Citrus Sinensis ID: 014296


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300-------310-------320-------330-------340-------350-------360-------370-------380-------390-------400-------410-------420-------
MYSNNSQAVRDALCFFMDEEKGHEFAFVQFPQNFDNVTKNELYSNSLRIYNEVEFEGMDGYGGPIYCGSGCFHRREILCGRKYDKETKIELKRENDSKREESLLELEETSKALASCTYETNTQWGKEIGLKYGCPVEDVITGISIQCQGWKSVYCKPERDAFLGVSPTTLLQFLVQRKRWSEGDFQIMLCRYSPARYAHGKISLGLRLGYCCYCLWAPNCLATLFYSIVPSLYLLKGIPLFPMISSPWIIPFAYVMFAKYTYSLAEFLWSGGTALGWWNEQRLWLYLRTTSFLFAFIDAILKTLGFSESSFVVTEKVADEDVSQRYEKEIMEFGAASSMFTILSTLALLNLFCMIGAVKKVIVGDGYVKFYETMLLQILLCSALVLINWPLYQGLFLRKDNGKMPSSVTTKSLVLALSVCTCFTFLY
ccccccHHHHHHccccccccccccEEEEEcccccccccccccccccccccHHHHccccccccccccccccccccccccccccccHHHHHHHHccccccHHHHHHHHHHHHHHHHHccccccccccccccccccccccHHHHHHHHHHcccEEEEEccccccccccccccHHHHHHHHHccccccHHHHHccccccccccccccccHHHHHHcccccccHHHHHHHHHHHHHHHHHccccccccccHHHHHHHHHHHHHHHHHHHHHHHHcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccccEEEccccccccHHHHHHHHHHHHccccHHHHHHHHHHHHHHHHHHHHHHHHHHccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccccccccccHHHHHHHHHHHHHHHHHHcc
cccccHHHHHHHHHHHcccccccEEEEEEccccccccccccccccccEEEEEEEccccccccccEEEccccEEEEEEEccccccccccccccccccccccccHHHHHHHHHHHHHcccccccccccEEEEEEccccHHHHHccHEcccccEEEEEccccccccccccccHHHHHHHHHHHHcccEEEEEcccccEEEcccccHHHHHHHHHHHccccHHHHHHHHHHHHHHHHHHccccEcccccHHHHHHHHHHHHHHHHHHHHHHHHccccHHHHHHccHEEEEccHHHHHHHHHHHHHHHHHcccEEEEEEEcccccccHHcccHcHEEEEcccHHHHHHHHHHHHHHHHHHHHHHHHHHcccccccccHHHHHHHHHHHHHHHHHHHHHHHHcccccccccHHHHHHHHHHHHHHHHHHHHHc
MYSNNSQAVRDALCFfmdeekghefafvqfpqnfdnvtknelysNSLRIYNevefegmdgyggpiycgsgcfhrreilcgrkydkeTKIELKRENDSKREESLLELEETSKALASCTYETNtqwgkeiglkygcpvedvITGISIqcqgwksvyckperdaflgvspTTLLQFLVQRKRWSEGDFQIMLcrysparyahgkislglrlgyccyclwapnclatLFYSIvpslyllkgiplfpmisspwiipFAYVMFAKYTYSLAEFLWSGGTALGWWNEQRLWLYLRTTSFLFAFIDAILKTlgfsessfvvtekvadedVSQRYEKEIMEFGAASSMFTILSTLALLNLFCMIGAVKKVIVGDGYVKFYETMLLQILLCSALVLInwplyqglflrkdngkmpssvTTKSLVLALSVctcftfly
MYSNNSQAVRDALCFFMDEEKGHEFAFVQFPQNFDNVTKNELYSNSLRIYNEVEFEGMDGYGGPIYCGSGCFHRREILcgrkydketkielkrendskreeSLLELEETSKALASCTYETNTQWGKEIGLKYGCPVEDVITGISIQCQGWKSVYCKPERDAFLGVSPTTLLQFLVQRKRWSEGDFQIMLCRYSPARYAHGKISLGLRLGYCCYCLWAPNCLATLFYSIVPSLYLLKGIPLFPMISSPWIIPFAYVMFAKYTYSLAEFLWSGGTALGWWNEQRLWLYLRTTSFLFAFIDAILKTLGFSESSFVVTEKVADEDVSQRYEKEIMEFGAASSMFTILSTLALLNLFCMIGAVKKVIVGDGYVKFYETMLLQILLCSALVLINWPLYQGLFLRKDNGKMPSSVTTKSLVLALSVCTCFTFLY
MYSNNSQAVRDALCFFMDEEKGHEFAFVQFPQNFDNVTKNELYSNSLRIYNEVEFEGMDGYGGPIYCGSGCFHRREILCGRKYDKETKIELKRENDskreeslleleetskalasCTYETNTQWGKEIGLKYGCPVEDVITGISIQCQGWKSVYCKPERDAFLGVSPTTLLQFLVQRKRWSEGDFQIMLCRYSPARYAHGKISlglrlgyccyclWAPNCLATLFYSIVPSLYLLKGIPLFPMISSPWIIPFAYVMFAKYTYSLAEFLWSGGTALGWWNEQRLWLYLRTTSFLFAFIDAILKTLGFSESSFVVTEKVADEDVSQRYEKEIMEFGAASSMFTILSTLALLNLFCMIGAVKKVIVGDGYVKFYETMLLQILLCSALVLINWPLYQGLFLRKDNGKMPSSVTTKSLVLALSVCTCFTFLY
*********RDALCFFMDEEKGHEFAFVQFPQNFDNVTKNELYSNSLRIYNEVEFEGMDGYGGPIYCGSGCFHRREILCGRKYD***************************ALASCTYETNTQWGKEIGLKYGCPVEDVITGISIQCQGWKSVYCKPERDAFLGVSPTTLLQFLVQRKRWSEGDFQIMLCRYSPARYAHGKISLGLRLGYCCYCLWAPNCLATLFYSIVPSLYLLKGIPLFPMISSPWIIPFAYVMFAKYTYSLAEFLWSGGTALGWWNEQRLWLYLRTTSFLFAFIDAILKTLGFSESSFVVTEKVADEDVSQRYEKEIMEFGAASSMFTILSTLALLNLFCMIGAVKKVIVGDGYVKFYETMLLQILLCSALVLINWPLYQGLFLRKDNGKMPSSVTTKSLVLALSVCTCFTFL*
MYSNNSQAVRDALCFFMDEEKGHEFAFVQFPQNFDNVTKNELYSNSLRIYNEVEFEGMDGYGGPIYCGSGCFHRREILCGRKYDKETKIELKRENDSKREESLLELEETSKALASCTYETNTQWGKEIGLKYGCPVEDVITGISIQCQGWKSVYCKPERDAFLGVSPTTLLQFLVQRKRWSEGDFQIMLCRYSPARYAHGKISLGLRLGYCCYCLWAPNCLATLFYSIVPSLYLLKGIPLFPMISSPWIIPFAYVMFAKYTYSLAEFLWSGGTALGWWNEQRLWLYLRTTSFLFAFIDAILKTLGFSESSFVVTEKVADEDVSQRYEKEIMEFGAASSMFTILSTLALLNLFCMIGAVKKVIVGDGYVKFYETMLLQILLCSALVLINWPLYQGLFLRKDNGKMPSSVTTKSLVLALSVCTCFTFLY
********VRDALCFFMDEEKGHEFAFVQFPQNFDNVTKNELYSNSLRIYNEVEFEGMDGYGGPIYCGSGCFHRREILCGRKYDKETKIELKRENDSKREESLLELEETSKALASCTYETNTQWGKEIGLKYGCPVEDVITGISIQCQGWKSVYCKPERDAFLGVSPTTLLQFLVQRKRWSEGDFQIMLCRYSPARYAHGKISLGLRLGYCCYCLWAPNCLATLFYSIVPSLYLLKGIPLFPMISSPWIIPFAYVMFAKYTYSLAEFLWSGGTALGWWNEQRLWLYLRTTSFLFAFIDAILKTLGFSESSFVVTEKVADEDVSQRYEKEIMEFGAASSMFTILSTLALLNLFCMIGAVKKVIVGDGYVKFYETMLLQILLCSALVLINWPLYQGLFLRKDNGKMPSSVTTKSLVLALSVCTCFTFLY
MYSNNSQAVRDALCFFMDEEKGHEFAFVQFPQNFDNVTKNELYSNSLRIYNEVEFEGMDGYGGPIYCGSGCFHRREILCGRKYDKE**************ESLLELEETSKALASCTYETNTQWGKEIGLKYGCPVEDVITGISIQCQGWKSVYCKPERDAFLGVSPTTLLQFLVQRKRWSEGDFQIMLCRYSPARYAHGKISLGLRLGYCCYCLWAPNCLATLFYSIVPSLYLLKGIPLFPMISSPWIIPFAYVMFAKYTYSLAEFLWSGGTALGWWNEQRLWLYLRTTSFLFAFIDAILKTLGFSESSFVVTEKVADEDVSQRYEKEIMEFGAASSMFTILSTLALLNLFCMIGAVKKVIVGDGYVKFYETMLLQILLCSALVLINWPLYQGLFLRKDNGKMPSSVTTKSLVLALSVCTCFTFLY
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHoooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHooooooooooooooHHHHHHHHHHHHHHHHHHHHi
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHiiiiiiHHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHiiiiiiHHHHHHHHHHHHHHHHHHHHHHHo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHHoooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHHHooooooooooooooooHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHoo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHoooooooooooooooooHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHHHoooooooooooooHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiHHHHHHHHHHHHHHHHHoooooooooooooooooooHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHHHHoooooooooooHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MYSNNSQAVRDALCFFMDEEKGHEFAFVQFPQNFDNVTKNELYSNSLRIYNEVEFEGMDGYGGPIYCGSGCFHRREILCGRKYDKETKIELKRENDSKREESLLELEETSKALASCTYETNTQWGKEIGLKYGCPVEDVITGISIQCQGWKSVYCKPERDAFLGVSPTTLLQFLVQRKRWSEGDFQIMLCRYSPARYAHGKISLGLRLGYCCYCLWAPNCLATLFYSIVPSLYLLKGIPLFPMISSPWIIPFAYVMFAKYTYSLAEFLWSGGTALGWWNEQRLWLYLRTTSFLFAFIDAILKTLGFSESSFVVTEKVADEDVSQRYEKEIMEFGAASSMFTILSTLALLNLFCMIGAVKKVIVGDGYVKFYETMLLQILLCSALVLINWPLYQGLFLRKDNGKMPSSVTTKSLVLALSVCTCFTFLY
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query427 2.2.26 [Sep-21-2011]
Q8VZK9729 Cellulose synthase-like p yes no 0.978 0.573 0.615 1e-151
Q651X6728 Cellulose synthase-like p yes no 0.955 0.560 0.527 1e-136
Q0DXZ1745 Cellulose synthase-like p yes no 0.976 0.559 0.523 1e-127
Q651X7737 Cellulose synthase-like p no no 0.964 0.559 0.498 1e-126
Q8VYR4722 Cellulose synthase-like p no no 0.932 0.551 0.359 1e-77
Q0WVN5751 Cellulose synthase-like p no no 0.939 0.533 0.378 5e-77
Q570S7760 Cellulose synthase-like p no no 0.925 0.519 0.360 3e-72
Q9SWW61026 Cellulose synthase A cata no no 0.936 0.389 0.280 3e-51
Q69P511055 Cellulose synthase A cata no no 0.948 0.383 0.281 2e-50
A2Z1C81055 Cellulose synthase A cata N/A no 0.948 0.383 0.281 2e-50
>sp|Q8VZK9|CSLE1_ARATH Cellulose synthase-like protein E1 OS=Arabidopsis thaliana GN=CSLE1 PE=1 SV=1 Back     alignment and function desciption
 Score =  535 bits (1377), Expect = e-151,   Method: Compositional matrix adjust.
 Identities = 262/426 (61%), Positives = 324/426 (76%), Gaps = 8/426 (1%)

Query: 1   MYSNNSQAVRDALCFFMDEEKGHEFAFVQFPQNFDNVTKNELYSNSLRIYNEVEFEGMDG 60
           MY+NNS++ RDALC  +DE++G E AFVQFPQ FDNVT+N+LY + +R+  +VEF G+DG
Sbjct: 312 MYANNSKSTRDALCILLDEKEGKEIAFVQFPQCFDNVTRNDLYGSMMRVGIDVEFLGLDG 371

Query: 61  YGGPIYCGSGCFHRREILCGRKYDKETKIELKRENDSKREESLLELEETSKALASCTYET 120
            GGP+Y G+GCFHRR+++CGRKY +E + E         E  ++      KALASCTYE 
Sbjct: 372 NGGPLYIGTGCFHRRDVICGRKYGEEEEEEESERIHENLEPEMI------KALASCTYEE 425

Query: 121 NTQWGKEIGLKYGCPVEDVITGISIQCQGWKSVYCKPERDAFLGVSPTTLLQFLVQRKRW 180
           NTQWGKE+G+KYGCPVEDVITG++IQC+GWKS Y  PE+ AFLGV+PT L Q LVQ++RW
Sbjct: 426 NTQWGKEMGVKYGCPVEDVITGLTIQCRGWKSAYLNPEKQAFLGVAPTNLHQMLVQQRRW 485

Query: 181 SEGDFQIMLCRYSPARYAHGKISLGLRLGYCCYCLWAPNCLATLFYSIVPSLYLLKGIPL 240
           SEGDFQIML +YSP  Y  GKISLGL LGYCCYCLWAP+ L  L YS++ SL L KGIPL
Sbjct: 486 SEGDFQIMLSKYSPVWYGKGKISLGLILGYCCYCLWAPSSLPVLIYSVLTSLCLFKGIPL 545

Query: 241 FPMISSPWIIPFAYVMFAKYTYSLAEFLWSGGTALGWWNEQRLWLYLRTTSFLFAFIDAI 300
           FP +SS W IPF YV  A   YSLAEFLW GGT  GWWNEQR+WLY RT+SFLF F+D I
Sbjct: 546 FPKVSSSWFIPFGYVTVAATAYSLAEFLWCGGTFRGWWNEQRMWLYRRTSSFLFGFMDTI 605

Query: 301 LKTLGFSESSFVVTEKVADEDVSQRYEKEIMEFGAASSMFTILSTLALLNLFCMIGAVKK 360
            K LG SES+FV+T KVA+E+ ++RY++E+MEFG  S MF +L TL +LNLFC   AV +
Sbjct: 606 KKLLGVSESAFVITAKVAEEEAAERYKEEVMEFGVESPMFLVLGTLGMLNLFCFAAAVAR 665

Query: 361 VIVGDGYVKFYETMLLQILLCSALVLINWPLYQGLFLRKDNGKMPSSVTTKSLVLALSVC 420
           ++ GDG     +TM +Q ++   LV+INWPLY+G+ LR+D GKMP SVT KS+VLALS C
Sbjct: 666 LVSGDG--GDLKTMGMQFVITGVLVVINWPLYKGMLLRQDKGKMPMSVTVKSVVLALSAC 723

Query: 421 TCFTFL 426
           TC  FL
Sbjct: 724 TCLAFL 729




Thought to be a Golgi-localized beta-glycan synthase that polymerize the backbones of noncellulosic polysaccharides (hemicelluloses) of plant cell wall.
Arabidopsis thaliana (taxid: 3702)
EC: 2EC: .EC: 4EC: .EC: 1EC: .EC: -
>sp|Q651X6|CSLE6_ORYSJ Cellulose synthase-like protein E6 OS=Oryza sativa subsp. japonica GN=CSLE6 PE=2 SV=1 Back     alignment and function description
>sp|Q0DXZ1|CSLE2_ORYSJ Cellulose synthase-like protein E2 OS=Oryza sativa subsp. japonica GN=CSLE2 PE=2 SV=1 Back     alignment and function description
>sp|Q651X7|CSLE1_ORYSJ Cellulose synthase-like protein E1 OS=Oryza sativa subsp. japonica GN=CSLE1 PE=2 SV=2 Back     alignment and function description
>sp|Q8VYR4|CSLG2_ARATH Cellulose synthase-like protein G2 OS=Arabidopsis thaliana GN=CSLG2 PE=2 SV=1 Back     alignment and function description
>sp|Q0WVN5|CSLG3_ARATH Cellulose synthase-like protein G3 OS=Arabidopsis thaliana GN=CSLG3 PE=2 SV=2 Back     alignment and function description
>sp|Q570S7|CSLG1_ARATH Cellulose synthase-like protein G1 OS=Arabidopsis thaliana GN=CSLG1 PE=2 SV=1 Back     alignment and function description
>sp|Q9SWW6|CESA7_ARATH Cellulose synthase A catalytic subunit 7 [UDP-forming] OS=Arabidopsis thaliana GN=CESA7 PE=1 SV=1 Back     alignment and function description
>sp|Q69P51|CESA9_ORYSJ Cellulose synthase A catalytic subunit 9 [UDP-forming] OS=Oryza sativa subsp. japonica GN=CESA9 PE=2 SV=1 Back     alignment and function description
>sp|A2Z1C8|CESA9_ORYSI Cellulose synthase A catalytic subunit 9 [UDP-forming] OS=Oryza sativa subsp. indica GN=CESA9 PE=2 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query427
224061234 736 predicted protein [Populus trichocarpa] 0.997 0.578 0.708 0.0
429326508 736 cellulose synthase-like protein [Populus 0.997 0.578 0.701 0.0
255563554 728 cellulose synthase, putative [Ricinus co 0.983 0.576 0.704 1e-179
449462403 730 PREDICTED: cellulose synthase-like prote 0.997 0.583 0.658 1e-171
449444188 733 PREDICTED: cellulose synthase-like prote 0.997 0.581 0.660 1e-171
449522323 748 PREDICTED: cellulose synthase-like prote 0.997 0.569 0.660 1e-171
255563552 938 coated vesicle membrane protein, putativ 0.983 0.447 0.646 1e-168
296089936 710 unnamed protein product [Vitis vinifera] 0.985 0.592 0.701 1e-166
359494189 922 PREDICTED: cellulose synthase-like prote 0.967 0.447 0.707 1e-165
356527159 736 PREDICTED: cellulose synthase-like prote 0.995 0.577 0.665 1e-164
>gi|224061234|ref|XP_002300383.1| predicted protein [Populus trichocarpa] gi|222847641|gb|EEE85188.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
 Score =  655 bits (1690), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 302/426 (70%), Positives = 354/426 (83%)

Query: 1   MYSNNSQAVRDALCFFMDEEKGHEFAFVQFPQNFDNVTKNELYSNSLRIYNEVEFEGMDG 60
           MYSN+   VRDALCFFMDEEK H+ AFVQFPQ F NVTKN+LYS+SLR+   VEF G DG
Sbjct: 311 MYSNDPLTVRDALCFFMDEEKSHDIAFVQFPQWFANVTKNDLYSSSLRVITNVEFHGTDG 370

Query: 61  YGGPIYCGSGCFHRREILCGRKYDKETKIELKRENDSKREESLLELEETSKALASCTYET 120
           YGGP+Y G+GCFHRR+ LCGR++ +++KIE K+ ND +R++S+ EL E +K LASCTYE 
Sbjct: 371 YGGPLYVGTGCFHRRDTLCGREFSQDSKIEWKKHNDHRRQQSVHELVEETKTLASCTYEQ 430

Query: 121 NTQWGKEIGLKYGCPVEDVITGISIQCQGWKSVYCKPERDAFLGVSPTTLLQFLVQRKRW 180
           NT+WG E GLKYGCPVEDVITG+SIQC+GWKS Y  PER AFLG++PTTL Q LVQ KRW
Sbjct: 431 NTKWGNETGLKYGCPVEDVITGLSIQCKGWKSAYFNPERKAFLGLAPTTLPQVLVQHKRW 490

Query: 181 SEGDFQIMLCRYSPARYAHGKISLGLRLGYCCYCLWAPNCLATLFYSIVPSLYLLKGIPL 240
           SEGDFQI+L +YSPA YAHG+I LGL+LGYCCYC WA NC ATL+YSIVPSL+LLKGI L
Sbjct: 491 SEGDFQILLSKYSPAWYAHGRIRLGLQLGYCCYCFWASNCFATLYYSIVPSLFLLKGISL 550

Query: 241 FPMISSPWIIPFAYVMFAKYTYSLAEFLWSGGTALGWWNEQRLWLYLRTTSFLFAFIDAI 300
           FP +SSPW +PFAYV+FAKY YSL EFLW+ GT LGWWN+QR+WLY RT+S+LFA ID I
Sbjct: 551 FPQVSSPWFLPFAYVIFAKYIYSLVEFLWADGTVLGWWNDQRIWLYKRTSSYLFATIDTI 610

Query: 301 LKTLGFSESSFVVTEKVADEDVSQRYEKEIMEFGAASSMFTILSTLALLNLFCMIGAVKK 360
           LKTLGF E++FV+T+KVADEDVSQRYEKE+MEFGA S MF ILSTLA+LNLFC+ G VKK
Sbjct: 611 LKTLGFGETAFVITDKVADEDVSQRYEKEMMEFGATSPMFEILSTLAMLNLFCLAGTVKK 670

Query: 361 VIVGDGYVKFYETMLLQILLCSALVLINWPLYQGLFLRKDNGKMPSSVTTKSLVLALSVC 420
           VI+ D   + +ETM LQILLC  LVL+N PLYQGL LRKD G+MP SV  KS + AL VC
Sbjct: 671 VIMNDSIDRLHETMPLQILLCGVLVLVNLPLYQGLLLRKDKGRMPCSVAVKSSLAALLVC 730

Query: 421 TCFTFL 426
           T F+FL
Sbjct: 731 TTFSFL 736




Source: Populus trichocarpa

Species: Populus trichocarpa

Genus: Populus

Family: Salicaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|429326508|gb|AFZ78594.1| cellulose synthase-like protein [Populus tomentosa] Back     alignment and taxonomy information
>gi|255563554|ref|XP_002522779.1| cellulose synthase, putative [Ricinus communis] gi|223538017|gb|EEF39630.1| cellulose synthase, putative [Ricinus communis] Back     alignment and taxonomy information
>gi|449462403|ref|XP_004148930.1| PREDICTED: cellulose synthase-like protein E1-like [Cucumis sativus] gi|449518729|ref|XP_004166388.1| PREDICTED: cellulose synthase-like protein E1-like [Cucumis sativus] Back     alignment and taxonomy information
>gi|449444188|ref|XP_004139857.1| PREDICTED: cellulose synthase-like protein E1-like [Cucumis sativus] Back     alignment and taxonomy information
>gi|449522323|ref|XP_004168176.1| PREDICTED: cellulose synthase-like protein E1-like [Cucumis sativus] Back     alignment and taxonomy information
>gi|255563552|ref|XP_002522778.1| coated vesicle membrane protein, putative [Ricinus communis] gi|223538016|gb|EEF39629.1| coated vesicle membrane protein, putative [Ricinus communis] Back     alignment and taxonomy information
>gi|296089936|emb|CBI39755.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|359494189|ref|XP_002266196.2| PREDICTED: cellulose synthase-like protein E1 [Vitis vinifera] Back     alignment and taxonomy information
>gi|356527159|ref|XP_003532180.1| PREDICTED: cellulose synthase-like protein E1-like [Glycine max] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query427
TAIR|locus:2012050729 CSLE1 "AT1G55850" [Arabidopsis 0.978 0.573 0.575 5.2e-136
TAIR|locus:2138106751 CSLG3 "AT4G23990" [Arabidopsis 0.983 0.559 0.355 4.4e-68
TAIR|locus:2138116722 CSLG2 "AT4G24000" [Arabidopsis 0.939 0.555 0.342 1.5e-67
TAIR|locus:2138126760 CSLG1 "AT4G24010" [Arabidopsis 0.929 0.522 0.348 1.8e-64
TAIR|locus:21363081084 CESA2 "cellulose synthase A2" 0.688 0.271 0.310 7.7e-53
TAIR|locus:20525761088 CESA9 "cellulose synthase A9" 0.688 0.270 0.310 4.4e-52
TAIR|locus:21781931069 CESA5 "cellulose synthase 5" [ 0.690 0.275 0.295 1.5e-51
TAIR|locus:21789351026 IRX3 "IRREGULAR XYLEM 3" [Arab 0.681 0.283 0.289 2e-51
TAIR|locus:20400801065 CESA10 "cellulose synthase 10" 0.686 0.275 0.291 3.1e-50
TAIR|locus:21277761081 CESA1 "cellulose synthase 1" [ 0.686 0.271 0.298 3.3e-50
TAIR|locus:2012050 CSLE1 "AT1G55850" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 1332 (473.9 bits), Expect = 5.2e-136, P = 5.2e-136
 Identities = 245/426 (57%), Positives = 305/426 (71%)

Query:     1 MYSNNSQAVRDALCFFMDEEKGHEFAFVQFPQNFDNVTKNELYSNSLRIYNEVEFEGMDG 60
             MY+NNS++ RDALC  +DE++G E AFVQFPQ FDNVT+N+LY + +R+  +VEF G+DG
Sbjct:   312 MYANNSKSTRDALCILLDEKEGKEIAFVQFPQCFDNVTRNDLYGSMMRVGIDVEFLGLDG 371

Query:    61 YGGPIYCGSGCFHRREILCGRKYDKETKIELKRENDXXXXXXXXXXXXXXXXXXXCTYET 120
              GGP+Y G+GCFHRR+++CGRKY +E       E +                   CTYE 
Sbjct:   372 NGGPLYIGTGCFHRRDVICGRKYGEE------EEEEESERIHENLEPEMIKALASCTYEE 425

Query:   121 NTQWGKEIGLKYGCPVEDVITGISIQCQGWKSVYCKPERDAFLGVSPTTLLQFLVQRKRW 180
             NTQWGKE+G+KYGCPVEDVITG++IQC+GWKS Y  PE+ AFLGV+PT L Q LVQ++RW
Sbjct:   426 NTQWGKEMGVKYGCPVEDVITGLTIQCRGWKSAYLNPEKQAFLGVAPTNLHQMLVQQRRW 485

Query:   181 SEGDFQIMLCRYSPARYAHGKISXXXXXXXXXXXXWAPNCLATLFYSIVPSLYLLKGIPL 240
             SEGDFQIML +YSP  Y  GKIS            WAP+ L  L YS++ SL L KGIPL
Sbjct:   486 SEGDFQIMLSKYSPVWYGKGKISLGLILGYCCYCLWAPSSLPVLIYSVLTSLCLFKGIPL 545

Query:   241 FPMISSPWIIPFAYVMFAKYTYSLAEFLWSGGTALGWWNEQRLWLYLRTTSFLFAFIDAI 300
             FP +SS W IPF YV  A   YSLAEFLW GGT  GWWNEQR+WLY RT+SFLF F+D I
Sbjct:   546 FPKVSSSWFIPFGYVTVAATAYSLAEFLWCGGTFRGWWNEQRMWLYRRTSSFLFGFMDTI 605

Query:   301 LKTLGFSESSFVVTEKVADEDVSQRYEKEIMEFGAASSMFTILSTLALLNLFCMIGAVKK 360
              K LG SES+FV+T KVA+E+ ++RY++E+MEFG  S MF +L TL +LNLFC   AV +
Sbjct:   606 KKLLGVSESAFVITAKVAEEEAAERYKEEVMEFGVESPMFLVLGTLGMLNLFCFAAAVAR 665

Query:   361 VIVGDGYVKFYETMLLQILLCSALVLINWPLYQGLFLRKDNGKMPSSVTTKSLVLALSVC 420
             ++ GDG     +TM +Q ++   LV+INWPLY+G+ LR+D GKMP SVT KS+VLALS C
Sbjct:   666 LVSGDG--GDLKTMGMQFVITGVLVVINWPLYKGMLLRQDKGKMPMSVTVKSVVLALSAC 723

Query:   421 TCFTFL 426
             TC  FL
Sbjct:   724 TCLAFL 729




GO:0000271 "polysaccharide biosynthetic process" evidence=ISS
GO:0005794 "Golgi apparatus" evidence=ISM
GO:0009832 "plant-type cell wall biogenesis" evidence=ISS
GO:0016020 "membrane" evidence=IEA
GO:0016757 "transferase activity, transferring glycosyl groups" evidence=ISS
GO:0016759 "cellulose synthase activity" evidence=ISS
GO:0016760 "cellulose synthase (UDP-forming) activity" evidence=IEA
GO:0030244 "cellulose biosynthetic process" evidence=IEA
GO:0005783 "endoplasmic reticulum" evidence=IDA
GO:0005886 "plasma membrane" evidence=IDA
GO:0046482 "para-aminobenzoic acid metabolic process" evidence=RCA
TAIR|locus:2138106 CSLG3 "AT4G23990" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2138116 CSLG2 "AT4G24000" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2138126 CSLG1 "AT4G24010" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2136308 CESA2 "cellulose synthase A2" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2052576 CESA9 "cellulose synthase A9" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2178193 CESA5 "cellulose synthase 5" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2178935 IRX3 "IRREGULAR XYLEM 3" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2040080 CESA10 "cellulose synthase 10" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2127776 CESA1 "cellulose synthase 1" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

Prediction LevelEC numberConfidence of Prediction
3rd Layer2.4.10.691
4th Layer2.4.1.12LOW CONFIDENCE prediction!

Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
fgenesh4_pm.C_LG_I001167
hypothetical protein (736 aa)
(Populus trichocarpa)
Predicted Functional Partners:
 
Sorry, there are no predicted associations at the current settings.
 

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query427
PLN02893734 PLN02893, PLN02893, Cellulose synthase-like protei 1e-120
PLN021891040 PLN02189, PLN02189, cellulose synthase 3e-60
pfam03552716 pfam03552, Cellulose_synt, Cellulose synthase 1e-51
PLN02190756 PLN02190, PLN02190, cellulose synthase-like protei 1e-49
PLN029151044 PLN02915, PLN02915, cellulose synthase A [UDP-form 2e-40
PLN024001085 PLN02400, PLN02400, cellulose synthase 3e-40
PLN024361094 PLN02436, PLN02436, cellulose synthase A 3e-38
PLN026381079 PLN02638, PLN02638, cellulose synthase A (UDP-form 2e-37
PLN02195977 PLN02195, PLN02195, cellulose synthase A 9e-36
PLN022481135 PLN02248, PLN02248, cellulose synthase-like protei 2e-24
PLN024361094 PLN02436, PLN02436, cellulose synthase A 2e-20
PLN02195977 PLN02195, PLN02195, cellulose synthase A 5e-19
PLN026381079 PLN02638, PLN02638, cellulose synthase A (UDP-form 2e-18
PLN02915 1044 PLN02915, PLN02915, cellulose synthase A [UDP-form 3e-18
cd06421234 cd06421, CESA_CelA_like, CESA_CelA_like are involv 4e-18
PLN024001085 PLN02400, PLN02400, cellulose synthase 2e-16
PLN022481135 PLN02248, PLN02248, cellulose synthase-like protei 7e-15
COG1215439 COG1215, COG1215, Glycosyltransferases, probably i 3e-06
TIGR03030 713 TIGR03030, CelA, cellulose synthase catalytic subu 7e-05
pfam13641229 pfam13641, Glyco_tranf_2_3, Glycosyltransferase li 0.001
>gnl|CDD|215483 PLN02893, PLN02893, Cellulose synthase-like protein Back     alignment and domain information
 Score =  365 bits (940), Expect = e-120
 Identities = 164/422 (38%), Positives = 236/422 (55%), Gaps = 14/422 (3%)

Query: 1   MYSNNSQAVRDALCFFMDEEKGHEFAFVQFPQNFDNVTKNELYSNSLRIYNEVEFEGMDG 60
           MYSN+ Q    ALC+ +D     +  +VQFPQ F  + KN++Y+  L+   ++   GMDG
Sbjct: 309 MYSNDPQTPLRALCYLLDPSMDPKLGYVQFPQIFHGINKNDIYAGELKRLFQINMIGMDG 368

Query: 61  YGGPIYCGSGCFHRREILCG---RKYDKETKIELKRE---NDSKREESLLELEETSKALA 114
             GP Y G+GCF RR +  G        E   EL  +   + S + + +L L   +  +A
Sbjct: 369 LAGPNYVGTGCFFRRRVFYGGPSSLILPEIP-ELNPDHLVDKSIKSQEVLAL---AHHVA 424

Query: 115 SCTYETNTQWGKEIGLKYGCPVEDVITGISIQCQGWKSVYCKPERDAFLGVSPTTLLQFL 174
            C YE  T WG ++G +YG  VED  TG  +QC+GWKS++C P+R AFLG SP  L   L
Sbjct: 425 GCNYENQTNWGSKMGFRYGSLVEDYYTGYRLQCEGWKSIFCNPKRPAFLGDSPINLHDVL 484

Query: 175 VQRKRWSEGDFQIMLCRYSPARYAHGKISLGLRLGYCCYCLWAPNCLATLFYSIVPSLYL 234
            Q+KRWS G  ++   +YSP  +    I L + LGY  Y  W    +    Y+ +P L L
Sbjct: 485 NQQKRWSVGLLEVAFSKYSPITFGVKSIGLLMGLGYAHYAFWPIWSIPITIYAFLPQLAL 544

Query: 235 LKGIPLFPMISSPWIIPFAYVMFAKYTYSLAEFLWSGGTALGWWNEQRLWLYLRTTSFLF 294
           L G+ +FP  S PW   + ++    Y   L +FL SGGT   WWN+QR+W+    +SFLF
Sbjct: 545 LNGVSIFPKASDPWFFLYIFLFLGAYGQDLLDFLLSGGTIQRWWNDQRMWMIRGLSSFLF 604

Query: 295 AFIDAILKTLGFSESSFVVTEKVADEDVSQRYEKEIMEFGAASSMFTILSTLALLNLFCM 354
             ++ +LKTLG S   F VT KV DE+ S+RYE+ I EFG +S MF  L+T A++NL   
Sbjct: 605 GLVEFLLKTLGISTFGFNVTSKVVDEEQSKRYEQGIFEFGVSSPMFLPLTTAAIINLVSF 664

Query: 355 IGAVKKVIVGDGYVKFYETMLLQILLCSALVLINWPLYQGLFLRKDNGKMPSSVTTKSLV 414
           +  + ++          E + LQ+ L    V+  WP+Y+ + LR D+GK+P  +T  S+V
Sbjct: 665 LWGIAQIFRQRN----LEGLFLQMFLAGFAVVNCWPIYEAMVLRTDDGKLPVKITLISIV 720

Query: 415 LA 416
           LA
Sbjct: 721 LA 722


Length = 734

>gnl|CDD|215121 PLN02189, PLN02189, cellulose synthase Back     alignment and domain information
>gnl|CDD|217611 pfam03552, Cellulose_synt, Cellulose synthase Back     alignment and domain information
>gnl|CDD|215122 PLN02190, PLN02190, cellulose synthase-like protein Back     alignment and domain information
>gnl|CDD|215494 PLN02915, PLN02915, cellulose synthase A [UDP-forming], catalytic subunit Back     alignment and domain information
>gnl|CDD|215224 PLN02400, PLN02400, cellulose synthase Back     alignment and domain information
>gnl|CDD|215239 PLN02436, PLN02436, cellulose synthase A Back     alignment and domain information
>gnl|CDD|215343 PLN02638, PLN02638, cellulose synthase A (UDP-forming), catalytic subunit Back     alignment and domain information
>gnl|CDD|215124 PLN02195, PLN02195, cellulose synthase A Back     alignment and domain information
>gnl|CDD|215138 PLN02248, PLN02248, cellulose synthase-like protein Back     alignment and domain information
>gnl|CDD|215239 PLN02436, PLN02436, cellulose synthase A Back     alignment and domain information
>gnl|CDD|215124 PLN02195, PLN02195, cellulose synthase A Back     alignment and domain information
>gnl|CDD|215343 PLN02638, PLN02638, cellulose synthase A (UDP-forming), catalytic subunit Back     alignment and domain information
>gnl|CDD|215494 PLN02915, PLN02915, cellulose synthase A [UDP-forming], catalytic subunit Back     alignment and domain information
>gnl|CDD|133043 cd06421, CESA_CelA_like, CESA_CelA_like are involved in the elongation of the glucan chain of cellulose Back     alignment and domain information
>gnl|CDD|215224 PLN02400, PLN02400, cellulose synthase Back     alignment and domain information
>gnl|CDD|215138 PLN02248, PLN02248, cellulose synthase-like protein Back     alignment and domain information
>gnl|CDD|224136 COG1215, COG1215, Glycosyltransferases, probably involved in cell wall biogenesis [Cell envelope biogenesis, outer membrane] Back     alignment and domain information
>gnl|CDD|234095 TIGR03030, CelA, cellulose synthase catalytic subunit (UDP-forming) Back     alignment and domain information
>gnl|CDD|222281 pfam13641, Glyco_tranf_2_3, Glycosyltransferase like family 2 Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 427
PF03552720 Cellulose_synt: Cellulose synthase; InterPro: IPR0 100.0
PLN026381079 cellulose synthase A (UDP-forming), catalytic subu 100.0
PLN021891040 cellulose synthase 100.0
PLN02893734 Cellulose synthase-like protein 100.0
PLN024001085 cellulose synthase 100.0
PLN024361094 cellulose synthase A 100.0
PLN02190756 cellulose synthase-like protein 100.0
PLN02195977 cellulose synthase A 100.0
PLN029151044 cellulose synthase A [UDP-forming], catalytic subu 100.0
PLN022481135 cellulose synthase-like protein 100.0
TIGR03030 713 CelA cellulose synthase catalytic subunit (UDP-for 100.0
PRK11498 852 bcsA cellulose synthase catalytic subunit; Provisi 100.0
PRK05454691 glucosyltransferase MdoH; Provisional 99.85
cd04191254 Glucan_BSP_ModH Glucan_BSP_ModH catalyzes the elon 99.84
COG1215439 Glycosyltransferases, probably involved in cell wa 99.78
PRK14583444 hmsR N-glycosyltransferase; Provisional 99.77
PRK11204420 N-glycosyltransferase; Provisional 99.74
PRK14716504 bacteriophage N4 adsorption protein B; Provisional 99.74
PRK11234 727 nfrB bacteriophage N4 adsorption protein B; Provis 99.69
PF13632193 Glyco_trans_2_3: Glycosyl transferase family group 99.68
TIGR03111439 glyc2_xrt_Gpos1 putative glycosyltransferase TIGR0 99.62
cd06435236 CESA_NdvC_like NdvC_like proteins in this family a 99.58
cd06437232 CESA_CaSu_A2 Cellulose synthase catalytic subunit 99.54
cd06421234 CESA_CelA_like CESA_CelA_like are involved in the 99.53
PRK15489 703 nfrB bacteriophage N4 adsorption protein B; Provis 99.51
cd06427241 CESA_like_2 CESA_like_2 is a member of the cellulo 99.5
PF13641228 Glyco_tranf_2_3: Glycosyltransferase like family 2 99.46
cd04190244 Chitin_synth_C C-terminal domain of Chitin Synthas 99.41
cd04192229 GT_2_like_e Subfamily of Glycosyltransferase Famil 99.09
cd06434235 GT2_HAS Hyaluronan synthases catalyze polymerizati 99.09
TIGR03472373 HpnI hopanoid biosynthesis associated glycosyl tra 99.03
cd02520196 Glucosylceramide_synthase Glucosylceramide synthas 98.98
PF13506175 Glyco_transf_21: Glycosyl transferase family 21 98.83
cd06439251 CESA_like_1 CESA_like_1 is a member of the cellulo 98.67
cd02525249 Succinoglycan_BP_ExoA ExoA is involved in the bios 98.53
cd06436191 GlcNAc-1-P_transferase N-acetyl-glucosamine transf 98.26
COG2943 736 MdoH Membrane glycosyltransferase [Cell envelope b 98.22
TIGR03469384 HonB hopene-associated glycosyltransferase HpnB. T 98.1
PF03142527 Chitin_synth_2: Chitin synthase; InterPro: IPR0048 97.89
KOG2571862 consensus Chitin synthase/hyaluronan synthase (gly 97.63
cd06438183 EpsO_like EpsO protein participates in the methano 97.39
cd04184202 GT2_RfbC_Mx_like Myxococcus xanthus RfbC like prot 96.58
cd04195201 GT2_AmsE_like GT2_AmsE_like is involved in exopoly 96.26
cd06433202 GT_2_WfgS_like WfgS and WfeV are involved in O-ant 96.0
cd02522221 GT_2_like_a GT_2_like_a represents a glycosyltrans 95.98
cd02526237 GT2_RfbF_like RfbF is a putative dTDP-rhamnosyl tr 95.89
cd04186166 GT_2_like_c Subfamily of Glycosyltransferase Famil 95.38
cd06420182 GT2_Chondriotin_Pol_N N-terminal domain of Chondro 95.02
TIGR01556281 rhamnosyltran L-rhamnosyltransferase. Rhamnolipids 94.41
cd02510299 pp-GalNAc-T pp-GalNAc-T initiates the formation of 94.35
COG1216305 Predicted glycosyltransferases [General function p 93.4
cd04185202 GT_2_like_b Subfamily of Glycosyltransferase Famil 92.56
cd06423180 CESA_like CESA_like is the cellulose synthase supe 90.75
KOG2547431 consensus Ceramide glucosyltransferase [Lipid tran 90.34
PLN02726243 dolichyl-phosphate beta-D-mannosyltransferase 89.77
cd06442224 DPM1_like DPM1_like represents putative enzymes si 89.0
cd06913219 beta3GnTL1_like Beta 1, 3-N-acetylglucosaminyltran 88.8
PF10111281 Glyco_tranf_2_2: Glycosyltransferase like family 2 88.7
cd04196214 GT_2_like_d Subfamily of Glycosyltransferase Famil 88.53
>PF03552 Cellulose_synt: Cellulose synthase; InterPro: IPR005150 Cellulose, an aggregate of unbranched polymers of beta-1,4-linked glucose residues, is the major component of wood and thus paper, and is synthesized by plants, most algae, some bacteria and fungi, and even some animals Back     alignment and domain information
Probab=100.00  E-value=5.7e-119  Score=952.83  Aligned_cols=418  Identities=42%  Similarity=0.766  Sum_probs=382.8

Q ss_pred             CccCChhHHHHHhhhhcCCCCCCeEEEEecCCccccCCCccchHhHHHHhhhhhcccccccCCceeccccceeehhhhcC
Q 014296            1 MYSNNSQAVRDALCFFMDEEKGHEFAFVQFPQNFDNVTKNELYSNSLRIYNEVEFEGMDGYGGPIYCGSGCFHRREILCG   80 (427)
Q Consensus         1 my~n~p~~l~~~l~~f~Dp~~~~~vafVQ~PQ~F~n~~~~d~~~~~~~~f~~~~~~g~dg~~g~~~~GTg~~~RR~aL~~   80 (427)
                      ||+|||+.+|+|||||+||+.|+++|||||||+|+|+++||+|+|++++||+++++|+||+|||+|+||||+|||+||||
T Consensus       211 ~y~nn~~~~~~amc~~~d~~~g~~~~~vQfpq~f~~i~~~d~y~~~~~~~~~~~~~g~dG~~gp~y~Gtgc~~rR~al~g  290 (720)
T PF03552_consen  211 MYINNSQALREAMCFFMDPKIGKKIAFVQFPQRFDGIDKNDRYGNQNRVFFDINMRGLDGLQGPFYVGTGCFFRREALYG  290 (720)
T ss_pred             ccccchHHHHHHHHhhccCCCCCeeEEEeCCceeCCCCcCCCCCccceeeeeccccccccCCCceeeecCcceechhhhC
Confidence            89999999999999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             CCCCchhHH-----------------------HHh----hc-----------c------------------------c--
Q 014296           81 RKYDKETKI-----------------------ELK----RE-----------N------------------------D--   96 (427)
Q Consensus        81 ~~~~~~~~~-----------------------~~~----~~-----------~------------------------g--   96 (427)
                      ..+.+..++                       .-.    +.           +                        |  
T Consensus       291 ~~~~~~~~~~~~~~~~~~~c~~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~FG~S  370 (720)
T PF03552_consen  291 FDPPRYEKDPEKTCCCCSCCFGRRKKKKSKKKPKKRASKRRESSSPIFALEDIEEGAEGSDEERSSLMSQKELEKKFGQS  370 (720)
T ss_pred             CCCCchhcccCcceeeeecccCCcccccccccchhccccccccccccccccccccccccchhhhhhcchhHHHHHHhcCC
Confidence            765421000                       000    00           0                        0  


Q ss_pred             -hh------------hhhhHHHHHHhhcccccccccccccccccccccCCCCCchHHHHHHHHhCCcEEEEecCCccccc
Q 014296           97 -SK------------REESLLELEETSKALASCTYETNTQWGKEIGLKYGCPVEDVITGISIQCQGWKSVYCKPERDAFL  163 (427)
Q Consensus        97 -~~------------~~~~~~~~~~~a~~va~c~Ye~~t~wG~e~G~~y~svtED~~tg~~lh~~GWrs~y~~p~~~af~  163 (427)
                       .+            ...+..+++|||++|+||+||++|+||+||||.|||+|||+.||++|||+||||+||+|+++||+
T Consensus       371 ~~fi~S~~~~~~~~~~~~~~~~~L~EA~~V~sC~YE~~T~WGkevGwiYGSvtEDv~TG~rmH~rGWrSvYc~p~r~AF~  450 (720)
T PF03552_consen  371 PEFIASTLMAQGGVPRSPSPASLLEEAIHVASCGYEDKTEWGKEVGWIYGSVTEDVLTGFRMHCRGWRSVYCNPKRPAFL  450 (720)
T ss_pred             HHHHHHHHHHhcCCCCCCChHHHHHHHHHHhcCCccccCCcccccceEEEecccccccceeEeeCceeeEEeccccchhc
Confidence             00            01134567799999999999999999999999999999999999999999999999999999999


Q ss_pred             cCCCcCHHHHHHHHHHHhhhhHHHHHhhcCccccc-cCcCChhhhHhhhcccchhHHHHHHHHHHHHHHHHHHhCCCccc
Q 014296          164 GVSPTTLLQFLVQRKRWSEGDFQIMLCRYSPARYA-HGKISLGLRLGYCCYCLWAPNCLATLFYSIVPSLYLLKGIPLFP  242 (427)
Q Consensus       164 G~aP~~l~~~l~Qr~RWa~G~~qi~~~~~~Pl~~~-~~~l~~~qrl~y~~~~~~~~~~~~~l~~~~~P~l~Ll~g~~~~p  242 (427)
                      |+||+|+.+.|.|++|||+|++||+++|+||+++| .+||+++||++|++.++||++++|+++|+++|++||++|++++|
T Consensus       451 G~AP~nL~d~L~Q~~RWA~GslEI~fSr~~Pl~~g~~~rL~~lQrLaY~~~~~ypl~Sipll~Y~~lPalcLLtG~~i~P  530 (720)
T PF03552_consen  451 GSAPINLSDRLHQVKRWATGSLEIFFSRHCPLWYGYGGRLKFLQRLAYLNYMLYPLTSIPLLCYCFLPALCLLTGIFIFP  530 (720)
T ss_pred             ccCCCChhhhceeeeeEeeeeEeeehhcCCchhccCCCCCcHHHHHHHHHHhhhHHHHHHHHHHHHhHHHHhhCCCcccC
Confidence            99999999999999999999999999999999987 57999999999999999999999999999999999999999999


Q ss_pred             ccchhhHHHHHHHHHHHHHHHHHHHHHhCCccccccchhhhHHHHHHHHHHHHHHHHHHHHhCCCCCCeeeCcCCCccch
Q 014296          243 MISSPWIIPFAYVMFAKYTYSLAEFLWSGGTALGWWNEQRLWLYLRTTSFLFAFIDAILKTLGFSESSFVVTEKVADEDV  322 (427)
Q Consensus       243 ~~~~~~~l~~~~~~~~~~~~~lle~~~~G~s~~~ww~~qr~w~i~~~s~~l~a~~~~llk~lg~~~~~F~VT~K~~~~~~  322 (427)
                      +++++|+++|+.+++++++++++|++|+|+++++||||||||+|.++++|++|++++++|.+|+++++|+||+|+.++++
T Consensus       531 k~s~~~~~~f~~lf~~~~~~~llE~~wsG~si~~WWrnQq~W~I~~tSa~LfAvl~~iLK~lg~s~t~F~VTsK~~dde~  610 (720)
T PF03552_consen  531 KVSSPWFIYFLALFVSIYAYSLLEFRWSGVSIREWWRNQQFWMIGGTSAHLFAVLQGILKVLGGSETSFTVTSKVSDDED  610 (720)
T ss_pred             ccccchhHHHHHHHHHHHHHHHHHHHhccCcHHHhhcccceeeehhhHHHHHHHHHHHHHHHcCCccceeeccccccccc
Confidence            99999999999999999999999999999999999999999999999999999999999999999999999999987554


Q ss_pred             hhhhhhhheeecccccchHHHHHHHHHHHHHHHHHHHHHHHcCccchhhHHHHHHHHHHHHHHHHHHHHHHHhhcccCCC
Q 014296          323 SQRYEKEIMEFGAASSMFTILSTLALLNLFCMIGAVKKVIVGDGYVKFYETMLLQILLCSALVLINWPLYQGLFLRKDNG  402 (427)
Q Consensus       323 ~~~y~~~~~~f~~~s~l~~p~~~l~~lnl~a~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~p~~~gl~~r~~k~  402 (427)
                      .+.+  |.++|+ ||++++|++|++++|++|+++|+.+++++ ++ +.|+++++|+++|+|+|+++|||+|||++|||  
T Consensus       611 ~~~~--ely~f~-wS~LfiP~tTllilNLva~v~Gi~r~i~~-g~-~~~g~l~g~lf~~~wVvv~lyPf~kGL~~R~~--  683 (720)
T PF03552_consen  611 DKYA--ELYIFK-WSPLFIPPTTLLILNLVAFVVGISRAINS-GY-GSWGPLLGQLFFSFWVVVHLYPFLKGLFGRKD--  683 (720)
T ss_pred             cccc--cccccc-ccchhhHHHHHHHHHHHHHHHHHHHHhcc-CC-CchhHHHHHHHHHHHHHHHhhHHHHhhhcccC--
Confidence            4433  467885 78999999999999999999999999976 33 56899999999999999999999999999986  


Q ss_pred             CCChhHHHHHHHHHHHHHHHhhh
Q 014296          403 KMPSSVTTKSLVLALSVCTCFTF  425 (427)
Q Consensus       403 ~~P~~~~~~s~~l~~~~~~~~~~  425 (427)
                      |+|+||++||++||++||++|+-
T Consensus       684 r~P~~v~v~S~lla~i~~llwv~  706 (720)
T PF03552_consen  684 RIPTSVIVWSVLLASIFSLLWVR  706 (720)
T ss_pred             CcceeehHHHHHHHHHHHHHhee
Confidence            69999999999999999999864



The genes that synthesize cellulose in higher plants differ greatly from the well-characterised genes found in Acetobacter and Agrobacterium spp. More correctly designated as "cellulose synthase catalytic subunits", plant cellulose synthase (CesA) proteins are integral membrane proteins, approximately 1,000 amino acids in length. There are a number of highly conserved residues, including several motifs shown to be necessary for processive glycosyltransferase activity [].; GO: 0016760 cellulose synthase (UDP-forming) activity, 0030244 cellulose biosynthetic process, 0016020 membrane

>PLN02638 cellulose synthase A (UDP-forming), catalytic subunit Back     alignment and domain information
>PLN02189 cellulose synthase Back     alignment and domain information
>PLN02893 Cellulose synthase-like protein Back     alignment and domain information
>PLN02400 cellulose synthase Back     alignment and domain information
>PLN02436 cellulose synthase A Back     alignment and domain information
>PLN02190 cellulose synthase-like protein Back     alignment and domain information
>PLN02195 cellulose synthase A Back     alignment and domain information
>PLN02915 cellulose synthase A [UDP-forming], catalytic subunit Back     alignment and domain information
>PLN02248 cellulose synthase-like protein Back     alignment and domain information
>TIGR03030 CelA cellulose synthase catalytic subunit (UDP-forming) Back     alignment and domain information
>PRK11498 bcsA cellulose synthase catalytic subunit; Provisional Back     alignment and domain information
>PRK05454 glucosyltransferase MdoH; Provisional Back     alignment and domain information
>cd04191 Glucan_BSP_ModH Glucan_BSP_ModH catalyzes the elongation of beta-1,2 polyglucose chains of glucan Back     alignment and domain information
>COG1215 Glycosyltransferases, probably involved in cell wall biogenesis [Cell envelope biogenesis, outer membrane] Back     alignment and domain information
>PRK14583 hmsR N-glycosyltransferase; Provisional Back     alignment and domain information
>PRK11204 N-glycosyltransferase; Provisional Back     alignment and domain information
>PRK14716 bacteriophage N4 adsorption protein B; Provisional Back     alignment and domain information
>PRK11234 nfrB bacteriophage N4 adsorption protein B; Provisional Back     alignment and domain information
>PF13632 Glyco_trans_2_3: Glycosyl transferase family group 2 Back     alignment and domain information
>TIGR03111 glyc2_xrt_Gpos1 putative glycosyltransferase TIGR03111 Back     alignment and domain information
>cd06435 CESA_NdvC_like NdvC_like proteins in this family are putative bacterial beta-(1,6)-glucosyltransferase Back     alignment and domain information
>cd06437 CESA_CaSu_A2 Cellulose synthase catalytic subunit A2 (CESA2) is a catalytic subunit or a catalytic subunit substitute of the cellulose synthase complex Back     alignment and domain information
>cd06421 CESA_CelA_like CESA_CelA_like are involved in the elongation of the glucan chain of cellulose Back     alignment and domain information
>PRK15489 nfrB bacteriophage N4 adsorption protein B; Provisional Back     alignment and domain information
>cd06427 CESA_like_2 CESA_like_2 is a member of the cellulose synthase superfamily Back     alignment and domain information
>PF13641 Glyco_tranf_2_3: Glycosyltransferase like family 2; PDB: 4FIY_B 4FIX_A Back     alignment and domain information
>cd04190 Chitin_synth_C C-terminal domain of Chitin Synthase catalyzes the incorporation of GlcNAc from substrate UDP-GlcNAc into chitin Back     alignment and domain information
>cd04192 GT_2_like_e Subfamily of Glycosyltransferase Family GT2 of unknown function Back     alignment and domain information
>cd06434 GT2_HAS Hyaluronan synthases catalyze polymerization of hyaluronan Back     alignment and domain information
>TIGR03472 HpnI hopanoid biosynthesis associated glycosyl transferase protein HpnI Back     alignment and domain information
>cd02520 Glucosylceramide_synthase Glucosylceramide synthase catalyzes the first glycosylation step of glycosphingolipid synthesis Back     alignment and domain information
>PF13506 Glyco_transf_21: Glycosyl transferase family 21 Back     alignment and domain information
>cd06439 CESA_like_1 CESA_like_1 is a member of the cellulose synthase (CESA) superfamily Back     alignment and domain information
>cd02525 Succinoglycan_BP_ExoA ExoA is involved in the biosynthesis of succinoglycan Back     alignment and domain information
>cd06436 GlcNAc-1-P_transferase N-acetyl-glucosamine transferase is involved in the synthesis of Poly-beta-1,6-N-acetyl-D-glucosamine Back     alignment and domain information
>COG2943 MdoH Membrane glycosyltransferase [Cell envelope biogenesis, outer membrane] Back     alignment and domain information
>TIGR03469 HonB hopene-associated glycosyltransferase HpnB Back     alignment and domain information
>PF03142 Chitin_synth_2: Chitin synthase; InterPro: IPR004835 Chitin synthase (2 Back     alignment and domain information
>KOG2571 consensus Chitin synthase/hyaluronan synthase (glycosyltransferases) [Cell wall/membrane/envelope biogenesis] Back     alignment and domain information
>cd06438 EpsO_like EpsO protein participates in the methanolan synthesis Back     alignment and domain information
>cd04184 GT2_RfbC_Mx_like Myxococcus xanthus RfbC like proteins are required for O-antigen biosynthesis Back     alignment and domain information
>cd04195 GT2_AmsE_like GT2_AmsE_like is involved in exopolysaccharide amylovora biosynthesis Back     alignment and domain information
>cd06433 GT_2_WfgS_like WfgS and WfeV are involved in O-antigen biosynthesis Back     alignment and domain information
>cd02522 GT_2_like_a GT_2_like_a represents a glycosyltransferase family-2 subfamily with unknown function Back     alignment and domain information
>cd02526 GT2_RfbF_like RfbF is a putative dTDP-rhamnosyl transferase Back     alignment and domain information
>cd04186 GT_2_like_c Subfamily of Glycosyltransferase Family GT2 of unknown function Back     alignment and domain information
>cd06420 GT2_Chondriotin_Pol_N N-terminal domain of Chondroitin polymerase functions as a GalNAc transferase Back     alignment and domain information
>TIGR01556 rhamnosyltran L-rhamnosyltransferase Back     alignment and domain information
>cd02510 pp-GalNAc-T pp-GalNAc-T initiates the formation of mucin-type O-linked glycans Back     alignment and domain information
>COG1216 Predicted glycosyltransferases [General function prediction only] Back     alignment and domain information
>cd04185 GT_2_like_b Subfamily of Glycosyltransferase Family GT2 of unknown function Back     alignment and domain information
>cd06423 CESA_like CESA_like is the cellulose synthase superfamily Back     alignment and domain information
>KOG2547 consensus Ceramide glucosyltransferase [Lipid transport and metabolism; Cell wall/membrane/envelope biogenesis] Back     alignment and domain information
>PLN02726 dolichyl-phosphate beta-D-mannosyltransferase Back     alignment and domain information
>cd06442 DPM1_like DPM1_like represents putative enzymes similar to eukaryotic DPM1 Back     alignment and domain information
>cd06913 beta3GnTL1_like Beta 1, 3-N-acetylglucosaminyltransferase is essential for the formation of poly-N-acetyllactosamine Back     alignment and domain information
>PF10111 Glyco_tranf_2_2: Glycosyltransferase like family 2; InterPro: IPR019290 This conserved domain is found in a set of prokaryotic proteins including putative glucosyltransferases, which are involved in bacterial capsule biosynthesis [, ] Back     alignment and domain information
>cd04196 GT_2_like_d Subfamily of Glycosyltransferase Family GT2 of unknown function Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query427
4hg6_A 802 Structure Of A Cellulose Synthase - Cellulose Trans 3e-06
>pdb|4HG6|A Chain A, Structure Of A Cellulose Synthase - Cellulose Translocation Intermediate Length = 802 Back     alignment and structure

Iteration: 1

Score = 49.7 bits (117), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 53/222 (23%), Positives = 79/222 (35%), Gaps = 58/222 (26%) Query: 27 FVQFPQ---NFDNVTKNELYSNSLRIYNEVEF----EGMDGYGGPIYCGSGCFHRREILC 79 VQ P N D + +N + NE+ + G+D +GG +CGS RR L Sbjct: 271 LVQTPHFFINPDPIQRNLALGDRCPPENEMFYGKIHRGLDRWGGAFFCGSAAVLRRRALD 330 Query: 80 GRKYDKETKIELKRENDXXXXXXXXXXXXXXXXXXXCTYETNTQWGKEIGLKYGCPVEDV 139 E G+ I ED Sbjct: 331 ---------------------------------------EAGGFAGETI-------TEDA 344 Query: 140 ITGISIQCQGWKSVYCKPERDAFLGVSPTTLLQFLVQRKRWSEGDFQIMLCRYSPARYAH 199 T + I +GWKS+Y +R G+ P T F+ QR RW+ G Q++L + R Sbjct: 345 ETALEIHSRGWKSLYI--DRAMIAGLQPETFASFIQQRGRWATGMMQMLLLKNPLFRRGL 402 Query: 200 GKISXXXXXXXXXXXXWAPNCLATLFYSIVPSLYLLKGIPLF 241 G I+ + P L + + + P +YL GI +F Sbjct: 403 G-IAQRLCYLNSMSFWFFP--LVRMMFLVAPLIYLFFGIEIF 441

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query427
4hg6_A 802 Cellulose synthase subunit A; membrane translocati 99.98
2z86_A625 Chondroitin synthase; GT-A, glycosyltransferase A, 97.09
2fy7_A287 Beta-1,4-galactosyltransferase 1; M339H mutant, AP 96.79
2z86_A625 Chondroitin synthase; GT-A, glycosyltransferase A, 95.6
1xhb_A472 Polypeptide N-acetylgalactosaminyltransferase 1; g 95.51
1qg8_A255 Protein (spore coat polysaccharide biosynthesis P 95.49
2d7i_A570 Polypeptide N-acetylgalactosaminyltransferase 10; 95.46
3bcv_A240 Putative glycosyltransferase protein; protein stru 95.0
2ffu_A501 Ppgalnact-2, polypeptide N-acetylgalactosaminyltra 93.54
4fix_A657 UDP-galactofuranosyl transferase GLFT2; CAZY GT-2 92.1
3l7i_A 729 Teichoic acid biosynthesis protein F; GT-B fold, m 83.47
3ckj_A329 Putative uncharacterized protein; mycobacteria, un 81.59
>4hg6_A Cellulose synthase subunit A; membrane translocation, cellulose synthesis, UDP-GLC binding membrane, transferase; HET: BGC UDP LDA; 3.25A {Rhodobacter sphaeroides} Back     alignment and structure
Probab=99.98  E-value=1.1e-30  Score=288.85  Aligned_cols=326  Identities=20%  Similarity=0.280  Sum_probs=239.2

Q ss_pred             CChhHHHHHhhhhc-CCCCCCeEEEEecCCccccCCCc-------cchHhHHHHhhhhhcccccccCCceeccccceeeh
Q 014296            4 NNSQAVRDALCFFM-DEEKGHEFAFVQFPQNFDNVTKN-------ELYSNSLRIYNEVEFEGMDGYGGPIYCGSGCFHRR   75 (427)
Q Consensus         4 n~p~~l~~~l~~f~-Dp~~~~~vafVQ~PQ~F~n~~~~-------d~~~~~~~~f~~~~~~g~dg~~g~~~~GTg~~~RR   75 (427)
                      .+||++++.+.+|. ||    ++++||+|+.+.|.++.       +.+.++.+.+++..+.+.+.+++++++|+++++||
T Consensus       251 ~~pd~L~~lv~~~~~dp----~v~~V~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~~~~Rr  326 (802)
T 4hg6_A          251 PSRDFLARTVGYFVEDP----DLFLVQTPHFFINPDPIQRNLALGDRCPPENEMFYGKIHRGLDRWGGAFFCGSAAVLRR  326 (802)
T ss_dssp             ECTTHHHHHHHHHHHSS----SCCEEECCCCBSSCCHHHHHHTCCTTSCCTTHHHHHTHHHHHHHTTCCCCCSSSEEEEH
T ss_pred             cChHHHHHHHHHHhcCC----CeEEEeccEEEeCCchHhhhhhHHhhhhHHHHHHHHHHHhhHhhcCCceecccchhhhH
Confidence            47999999999995 88    89999999999987631       23445668889999999999999999999999999


Q ss_pred             hhhcCCCCCchhHHHHhhccchhhhhhHHHHHHhhcccccccccccccccccccccCCCCCchHHHHHHHHhCCcEEEEe
Q 014296           76 EILCGRKYDKETKIELKRENDSKREESLLELEETSKALASCTYETNTQWGKEIGLKYGCPVEDVITGISIQCQGWKSVYC  155 (427)
Q Consensus        76 ~aL~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~a~~va~c~Ye~~t~wG~e~G~~y~svtED~~tg~~lh~~GWrs~y~  155 (427)
                      +++++              +|                                ||+.++++||.+++++++++|||+.|+
T Consensus       327 ~al~~--------------vG--------------------------------gf~~~~~~ED~~l~~rl~~~G~ri~~~  360 (802)
T 4hg6_A          327 RALDE--------------AG--------------------------------GFAGETITEDAETALEIHSRGWKSLYI  360 (802)
T ss_dssp             HHHHH--------------HT--------------------------------TCCCSSSSHHHHHHHHHHTTTCCEEEC
T ss_pred             HHHHH--------------cC--------------------------------CcCCCCcchHHHHHHHHHHcCCeEEEe
Confidence            99998              35                                899999999999999999999999999


Q ss_pred             cCCccccccCCCcCHHHHHHHHHHHhhhhHHHHHhhcCccccccCcCChhhhHhhhcccchhHHHHHHHHHHHHHHHHHH
Q 014296          156 KPERDAFLGVSPTTLLQFLVQRKRWSEGDFQIMLCRYSPARYAHGKISLGLRLGYCCYCLWAPNCLATLFYSIVPSLYLL  235 (427)
Q Consensus       156 ~p~~~af~G~aP~~l~~~l~Qr~RWa~G~~qi~~~~~~Pl~~~~~~l~~~qrl~y~~~~~~~~~~~~~l~~~~~P~l~Ll  235 (427)
                       |+.. +.+.+|+|+.++++||+||+.|.+|+++.+ +|++  .+++++.||++|+....+++.+++.++++++|+++++
T Consensus       361 -~~~~-~~~~~p~t~~~~~~Qr~RW~~G~~q~l~~~-~pl~--~~~l~~~~rl~~l~~~~~~~~~~~~li~ll~p~~~ll  435 (802)
T 4hg6_A          361 -DRAM-IAGLQPETFASFIQQRGRWATGMMQMLLLK-NPLF--RRGLGIAQRLCYLNSMSFWFFPLVRMMFLVAPLIYLF  435 (802)
T ss_dssp             -CCCC-EEECCCCSHHHHHHHHHHHHHHHHHHHHHS-CTTS--CSSCCHHHHHHHHHHHHHTTHHHHHHHHHHHHHHHHH
T ss_pred             -cCCE-EEecCCCCHHHHHHHHHHHHccHHHHHHHh-Cccc--cCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence             4444 479999999999999999999999998775 7876  4689999999999998899999999999999999999


Q ss_pred             hCCCcccccchhhHHHHHHHHHHHHHHHHHHHHHhCCccccccchhhhHHHHHHHHHHHHHHHHHHHHhCCCCCCeeeCc
Q 014296          236 KGIPLFPMISSPWIIPFAYVMFAKYTYSLAEFLWSGGTALGWWNEQRLWLYLRTTSFLFAFIDAILKTLGFSESSFVVTE  315 (427)
Q Consensus       236 ~g~~~~p~~~~~~~l~~~~~~~~~~~~~lle~~~~G~s~~~ww~~qr~w~i~~~s~~l~a~~~~llk~lg~~~~~F~VT~  315 (427)
                      +++.+++.....++..++..++.   ..++.....+.....||+.     +..+...++.+...+...+++++.+|+||+
T Consensus       436 ~~~~~~~~~~~~~~~~~lp~~l~---~~~~~~~~~~~~r~~~~~~-----l~~~~~~~~~~~a~l~~l~~~~~~~f~VT~  507 (802)
T 4hg6_A          436 FGIEIFVATFEEVLAYMPGYLAV---SFLVQNALFARQRWPLVSE-----VYEVAQAPYLARAIVTTLLRPRSARFAVTA  507 (802)
T ss_dssp             HCCCCSCCCHHHHHHHHHHHHHH---HHHHHHHHHTTTSCTTHHH-----HHHHHHHHHHHHHHHHHHHSTTCCCCCCCC
T ss_pred             HhHHHhhcCHHHHHHHHHHHHHH---HHHHHHHHhcCcHHHHHHH-----HHHHHHHHHHHHHHHHHHhCCCCCcceECC
Confidence            99988876443322211111110   1111112233333345543     222222222222333334558899999999


Q ss_pred             CCCccchhhhhhhhheeecccccchHHHHHHHHHHHHHHHHHHHHHHHcCccchhhHHHHHHHHHHHHHHHHHHHHHHHh
Q 014296          316 KVADEDVSQRYEKEIMEFGAASSMFTILSTLALLNLFCMIGAVKKVIVGDGYVKFYETMLLQILLCSALVLINWPLYQGL  395 (427)
Q Consensus       316 K~~~~~~~~~y~~~~~~f~~~s~l~~p~~~l~~lnl~a~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~p~~~gl  395 (427)
                      |+.+.+.  .+      |   +.+..|..++++++++++++|+++.... .  ........-.+++.+.++.+.+.+...
T Consensus       508 Kg~~~~~--~~------~---~~~~~p~~~~~~l~~~~~~~~~~~~~~~-~--~~~~~~~~~~~w~~~~l~~l~~~~~~~  573 (802)
T 4hg6_A          508 KDETLSE--NY------I---SPIYRPLLFTFLLCLSGVLATLVRWVAF-P--GDRSVLLVVGGWAVLNVLLVGFALRAV  573 (802)
T ss_dssp             CCCCCSS--CC------B---CTTCHHHHHHHHHHHHHHHHHHHHHHHC-G--GGHHHHHHHHHHHHHHHHHHHHHHTTT
T ss_pred             CCccccc--cc------h---hhHHHHHHHHHHHHHHHHHHHHHHHhcc-C--CccchhhhhhHHHHHHHHHHHHHHHHH
Confidence            9865432  11      2   3678899999999999999999998764 1  112223334445555555566666666


Q ss_pred             hcccCCCCCCh
Q 014296          396 FLRKDNGKMPS  406 (427)
Q Consensus       396 ~~r~~k~~~P~  406 (427)
                      ..|+++++-|+
T Consensus       574 ~~~~~~R~~~R  584 (802)
T 4hg6_A          574 AEKQQRRAAPR  584 (802)
T ss_dssp             BCCCCSCCCCC
T ss_pred             hcCCCcccccc
Confidence            76766655554



>2z86_A Chondroitin synthase; GT-A, glycosyltransferase A, fold; HET: UGA UDP; 2.40A {Escherichia coli} PDB: 2z87_A* Back     alignment and structure
>2fy7_A Beta-1,4-galactosyltransferase 1; M339H mutant, APO enzyme; HET: PGE; 1.70A {Homo sapiens} PDB: 2aec_A* 2aes_A* 2ae7_A* 2ah9_A* 2agd_A* 2fya_A* 2fyb_A* 3ee5_A* 2fyc_B* 1tw1_A* 1tw5_A* 1tvy_A* 1nmm_B* 1o0r_A* 1yro_B* 1nf5_B* 1nhe_B* 1nkh_B* 1nqi_B* 1nwg_B* ... Back     alignment and structure
>2z86_A Chondroitin synthase; GT-A, glycosyltransferase A, fold; HET: UGA UDP; 2.40A {Escherichia coli} PDB: 2z87_A* Back     alignment and structure
>1xhb_A Polypeptide N-acetylgalactosaminyltransferase 1; glycosyltransferase-A (GT-A); HET: NAG BMA; 2.50A {Mus musculus} SCOP: b.42.2.1 c.68.1.17 Back     alignment and structure
>1qg8_A Protein (spore coat polysaccharide biosynthesis P SPSA); glycosyltransferase, transferase; 1.50A {Bacillus subtilis} SCOP: c.68.1.1 PDB: 1h7q_A* 1h7l_A 1qgq_A* 1qgs_A* Back     alignment and structure
>2d7i_A Polypeptide N-acetylgalactosaminyltransferase 10; beta trefoil, rossmann fold; HET: NAG NGA UDP; 2.50A {Homo sapiens} PDB: 2d7r_A* Back     alignment and structure
>3bcv_A Putative glycosyltransferase protein; protein structure initiative II, PSI-II NYSGXRC, structural genomics; 2.35A {Bacteroides fragilis} Back     alignment and structure
>2ffu_A Ppgalnact-2, polypeptide N-acetylgalactosaminyltransferase 2, protein-UDP; ppgalnact, mucin, glycosyltransferase; HET: UDP; 1.64A {Homo sapiens} PDB: 2ffv_A* Back     alignment and structure
>4fix_A UDP-galactofuranosyl transferase GLFT2; CAZY GT-2 family, glycosyltrans carbohydrate binding, membrane; 2.45A {Mycobacterium tuberculosis} PDB: 4fiy_A* Back     alignment and structure
>3l7i_A Teichoic acid biosynthesis protein F; GT-B fold, monotopic membrane protein, structural protein; 2.70A {Staphylococcus epidermidis} PDB: 3l7j_A 3l7k_A* 3l7l_A* 3l7m_A* Back     alignment and structure
>3ckj_A Putative uncharacterized protein; mycobacteria, unknown function; HET: CIT; 1.80A {Mycobacterium paratuberculosis} PDB: 3ckn_A* 3cko_A* 3ckq_A* 3ckv_A* 3e26_A 3e25_A Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query427
d1xhba2328 Polypeptide N-acetylgalactosaminyltransferase 1, N 96.22
>d1xhba2 c.68.1.17 (A:95-422) Polypeptide N-acetylgalactosaminyltransferase 1, N-terminal domain {Mouse (Mus musculus) [TaxId: 10090]} Back     information, alignment and structure
class: Alpha and beta proteins (a/b)
fold: Nucleotide-diphospho-sugar transferases
superfamily: Nucleotide-diphospho-sugar transferases
family: Polypeptide N-acetylgalactosaminyltransferase 1, N-terminal domain
domain: Polypeptide N-acetylgalactosaminyltransferase 1, N-terminal domain
species: Mouse (Mus musculus) [TaxId: 10090]
Probab=96.22  E-value=0.0033  Score=58.52  Aligned_cols=48  Identities=15%  Similarity=0.023  Sum_probs=38.9

Q ss_pred             ceeccccceeehhhhcCCCCCchhHHHHhhccchhhhhhHHHHHHhhcccccccccccccccccccccCCCC---CchHH
Q 014296           64 PIYCGSGCFHRREILCGRKYDKETKIELKRENDSKREESLLELEETSKALASCTYETNTQWGKEIGLKYGCP---VEDVI  140 (427)
Q Consensus        64 ~~~~GTg~~~RR~aL~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~a~~va~c~Ye~~t~wG~e~G~~y~sv---tED~~  140 (427)
                      +..+|++.++||+++..+              |                                ||+.+-.   .||++
T Consensus       195 ~~~~g~~~~irr~~f~~v--------------G--------------------------------gfDe~~~~~g~ED~D  228 (328)
T d1xhba2         195 PTMAGGLFSIDRDYFQEI--------------G--------------------------------TYDAGMDIWGGENLE  228 (328)
T ss_dssp             SBCCSSSEEEEHHHHHHT--------------T--------------------------------SCCTTSCTTCCCCSH
T ss_pred             ceecceeeeeeHHHHHHh--------------C--------------------------------CCCCCCcCcCchHHH
Confidence            445777778999999984              5                                7766543   49999


Q ss_pred             HHHHHHhCCcEEEEecCC
Q 014296          141 TGISIQCQGWKSVYCKPE  158 (427)
Q Consensus       141 tg~~lh~~GWrs~y~~p~  158 (427)
                      .++|+..+||+..|+ |+
T Consensus       229 l~~R~~~~G~~i~~~-p~  245 (328)
T d1xhba2         229 ISFRIWQCGGTLEIV-TC  245 (328)
T ss_dssp             HHHHHHHTTCEEEEE-EE
T ss_pred             HHHHHHHhCCeEEEe-CC
Confidence            999999999999998 53