Psyllid ID: psy11641


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------
MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEVRPDWSS
ccccccccccEEEEEEcccEEEEEccccccEEEEcccccccccccHHHHHHHccEEEEEccHHHHHHHHcccccccccccccHHHHHHHHHHccccccHHHHHHHcccccccccccccccEEccccccccccccccccccccccEEEEEccccccccEEEEcccccEEcccccccccccHHHHHHHccccEEEccccEEEEEEccccEEEEEEcccccccccccEEEEccccEEEEccccccccEEccccccEEEccccccccccEEEEEccccccc
ccccccEEEEEEEEEccccEEEEEcccccccccccccccccccEEEcccHHHHHHHHHHHHHHHHHHHHHHcccccccccccHHHHHHHHHHcccccHHHHHHccccccccccccccccccEEEEEccccccEEcccccccccEEEEEccccccccEEEEEEccccccccccEEEcccccccccccEcccccccccccEEEEccccccEEEEEccccccccccEEEEEccccEEEEcccccEcccccccccEEEEEccccccccEEEEEcccccccc
mgrtapavigtsychgsgssqlfrlnaegqlghgercvdaDKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDvytkfpelppnlfwgeaknlgtqkcldsmgrtapavigtsychgsgssqlfrlnaegqlghgercvdaDKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTvlnlfncfnssgtvdgpwrfhnstRQLEHAILKRClsvhpetnqlamlrcdennSYQQWRfkevrpdwss
mgrtapavigtsychgsgsSQLFRLNAEGQLGHGERCVDADKQVISTQksslelrkrsswlKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQqwrfkevrpdwss
MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEVRPDWSS
********IGTSYCHG****QLFRLNAEGQLGHGERCVD*****************RSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRF*********
*****PA**GTSYCHGSGSSQLFRLNAEGQLGHGERCVDA*****STQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEVRP****
MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK********
***TAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEVRPDW**
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEVRPDWSS
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query277 2.2.26 [Sep-21-2011]
Q8MV48591 N-acetylgalactosaminyltra yes N/A 0.628 0.294 0.519 3e-59
O61397601 Probable N-acetylgalactos yes N/A 0.671 0.309 0.354 3e-35
Q80VA0657 N-acetylgalactosaminyltra yes N/A 0.682 0.287 0.279 7e-21
Q86SF2657 N-acetylgalactosaminyltra yes N/A 0.682 0.287 0.270 7e-21
Q5RFJ6657 N-acetylgalactosaminyltra yes N/A 0.682 0.287 0.270 8e-21
Q9R0C5657 N-acetylgalactosaminyltra yes N/A 0.682 0.287 0.275 2e-20
Q95ZJ1626 Polypeptide N-acetylgalac no N/A 0.559 0.247 0.296 1e-17
Q10473559 Polypeptide N-acetylgalac no N/A 0.462 0.228 0.358 3e-17
Q10472559 Polypeptide N-acetylgalac no N/A 0.462 0.228 0.358 3e-17
O08912559 Polypeptide N-acetylgalac no N/A 0.462 0.228 0.350 9e-17
>sp|Q8MV48|GALT7_DROME N-acetylgalactosaminyltransferase 7 OS=Drosophila melanogaster GN=GalNAc-T2 PE=2 SV=2 Back     alignment and function desciption
 Score =  228 bits (582), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 106/204 (51%), Positives = 137/204 (67%), Gaps = 30/204 (14%)

Query: 72  SPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQ 131
            PLA +LDMGDISEQ+AL+ +L+CK F WFM H+AYDVY KFP LP NL WGE +++ + 
Sbjct: 418 EPLARYLDMGDISEQLALKKRLNCKSFQWFMDHIAYDVYDKFPGLPANLHWGELRSVASD 477

Query: 132 KCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVT 191
            CLDSMG   PA++G +YCHG G++QL RLNA GQLG GERCV+AD+Q            
Sbjct: 478 GCLDSMGHQPPAIMGLTYCHGGGNNQLVRLNAAGQLGVGERCVEADRQ------------ 525

Query: 192 FCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQ 251
                G+KL  CRL             GTVDGPW+++  T+ L H + K+C+++HP T Q
Sbjct: 526 -----GIKLAVCRL-------------GTVDGPWQYNEHTKHLMHRVHKKCMALHPATQQ 567

Query: 252 LAMLRCDENNSYQQWRFKEVRPDW 275
           L++  CD N+SYQQW FKE+RP W
Sbjct: 568 LSLGHCDVNDSYQQWWFKEIRPRW 591




Glycopeptide transferase involved in O-linked oligosaccharide biosynthesis, which catalyzes the transfer of an N-acetyl-D-galactosamine residue to an already glycosylated peptide. In contrast to other proteins of the family, it does not act as a peptide transferase that transfers GalNAc onto serine or threonine residue on the protein receptor, but instead requires the prior addition of a GalNAc on a peptide before adding additional GalNAc moieties. Some peptide transferase activity is however not excluded, considering that its appropriate peptide substrate may remain unidentified. Prefers the monoglycosylated Muc5AC-3 as substrate.
Drosophila melanogaster (taxid: 7227)
EC: 2EC: .EC: 4EC: .EC: 1EC: .EC: -
>sp|O61397|GALT7_CAEEL Probable N-acetylgalactosaminyltransferase 7 OS=Caenorhabditis elegans GN=gly-7 PE=2 SV=1 Back     alignment and function description
>sp|Q80VA0|GALT7_MOUSE N-acetylgalactosaminyltransferase 7 OS=Mus musculus GN=Galnt7 PE=2 SV=2 Back     alignment and function description
>sp|Q86SF2|GALT7_HUMAN N-acetylgalactosaminyltransferase 7 OS=Homo sapiens GN=GALNT7 PE=1 SV=1 Back     alignment and function description
>sp|Q5RFJ6|GALT7_PONAB N-acetylgalactosaminyltransferase 7 OS=Pongo abelii GN=GALNT7 PE=2 SV=1 Back     alignment and function description
>sp|Q9R0C5|GALT7_RAT N-acetylgalactosaminyltransferase 7 OS=Rattus norvegicus GN=Galnt7 PE=2 SV=1 Back     alignment and function description
>sp|Q95ZJ1|GALT5_CAEEL Polypeptide N-acetylgalactosaminyltransferase 5 OS=Caenorhabditis elegans GN=gly-5 PE=2 SV=2 Back     alignment and function description
>sp|Q10473|GALT1_RAT Polypeptide N-acetylgalactosaminyltransferase 1 OS=Rattus norvegicus GN=Galnt1 PE=1 SV=1 Back     alignment and function description
>sp|Q10472|GALT1_HUMAN Polypeptide N-acetylgalactosaminyltransferase 1 OS=Homo sapiens GN=GALNT1 PE=1 SV=1 Back     alignment and function description
>sp|O08912|GALT1_MOUSE Polypeptide N-acetylgalactosaminyltransferase 1 OS=Mus musculus GN=Galnt1 PE=1 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query277
195567537 411 GD15630 [Drosophila simulans] gi|1942047 0.624 0.420 0.522 1e-57
193683588 588 PREDICTED: n-acetylgalactosaminyltransfe 0.631 0.297 0.507 1e-57
194766810 591 GF22410 [Drosophila ananassae] gi|190619 0.628 0.294 0.524 1e-57
194892500 591 GG18114 [Drosophila erecta] gi|190649322 0.628 0.294 0.519 2e-57
21552985 591 UDP-N-acetylgalactosamine: polypeptide N 0.628 0.294 0.519 2e-57
24643052 591 UDP-N-acetyl-alpha-D-galactosamine:polyp 0.628 0.294 0.519 2e-57
195481361 591 GE15519 [Drosophila yakuba] gi|194189143 0.628 0.294 0.519 2e-57
16198165 486 LD36616p [Drosophila melanogaster] 0.628 0.358 0.519 2e-57
195447414 587 GK25256 [Drosophila willistoni] gi|19416 0.628 0.296 0.509 2e-57
16769336214 LD27021p [Drosophila melanogaster] 0.628 0.813 0.519 3e-57
>gi|195567537|ref|XP_002107316.1| GD15630 [Drosophila simulans] gi|194204722|gb|EDX18298.1| GD15630 [Drosophila simulans] Back     alignment and taxonomy information
 Score =  229 bits (584), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 106/203 (52%), Positives = 137/203 (67%), Gaps = 30/203 (14%)

Query: 73  PLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQK 132
           PLA +LDMGDISEQ+AL+ +L+CK F WFM H+AYDVY KFP LP NL WGE +++ +  
Sbjct: 239 PLARYLDMGDISEQLALKKRLNCKSFQWFMDHIAYDVYDKFPGLPANLHWGELRSVASDG 298

Query: 133 CLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTF 192
           CLDSMG   PA++G +YCHG G++QL RLNA GQLG GERCV+AD+Q             
Sbjct: 299 CLDSMGHQPPAIMGLTYCHGGGNNQLVRLNAAGQLGVGERCVEADRQ------------- 345

Query: 193 CVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQL 252
               G+KL  CRL             GTVDGPW+++  T+ L H + K+C+++HP T QL
Sbjct: 346 ----GIKLAVCRL-------------GTVDGPWQYNEHTKHLMHRVHKKCMALHPATQQL 388

Query: 253 AMLRCDENNSYQQWRFKEVRPDW 275
           ++  CD N+SYQQW FKE+RP W
Sbjct: 389 SLGHCDVNDSYQQWWFKEIRPRW 411




Source: Drosophila simulans

Species: Drosophila simulans

Genus: Drosophila

Family: Drosophilidae

Order: Diptera

Class: Insecta

Phylum: Arthropoda

Superkingdom: Eukaryota

>gi|193683588|ref|XP_001951150.1| PREDICTED: n-acetylgalactosaminyltransferase 7-like [Acyrthosiphon pisum] Back     alignment and taxonomy information
>gi|194766810|ref|XP_001965517.1| GF22410 [Drosophila ananassae] gi|190619508|gb|EDV35032.1| GF22410 [Drosophila ananassae] Back     alignment and taxonomy information
>gi|194892500|ref|XP_001977673.1| GG18114 [Drosophila erecta] gi|190649322|gb|EDV46600.1| GG18114 [Drosophila erecta] Back     alignment and taxonomy information
>gi|21552985|gb|AAM62412.1|AF493067_1 UDP-N-acetylgalactosamine: polypeptide N-acetylgalactosaminyltransferase 2 [Drosophila melanogaster] Back     alignment and taxonomy information
>gi|24643052|ref|NP_573301.2| UDP-N-acetyl-alpha-D-galactosamine:polypeptide N-acetylgalactosaminyltransferase 2, isoform A [Drosophila melanogaster] gi|24643054|ref|NP_728178.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide N-acetylgalactosaminyltransferase 2, isoform B [Drosophila melanogaster] gi|51316019|sp|Q8MV48.2|GALT7_DROME RecName: Full=N-acetylgalactosaminyltransferase 7; AltName: Full=Protein-UDP acetylgalactosaminyltransferase 7; AltName: Full=UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase 7; Short=pp-GaNTase 7; AltName: Full=dGalNAc-T2 gi|7293476|gb|AAF48851.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide N-acetylgalactosaminyltransferase 2, isoform A [Drosophila melanogaster] gi|22832507|gb|AAN09470.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide N-acetylgalactosaminyltransferase 2, isoform B [Drosophila melanogaster] gi|34043004|gb|AAQ56704.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase [Drosophila melanogaster] gi|54650858|gb|AAV37008.1| LD01328p [Drosophila melanogaster] gi|220950352|gb|ACL87719.1| GalNAc-T2-PA [synthetic construct] Back     alignment and taxonomy information
>gi|195481361|ref|XP_002101619.1| GE15519 [Drosophila yakuba] gi|194189143|gb|EDX02727.1| GE15519 [Drosophila yakuba] Back     alignment and taxonomy information
>gi|16198165|gb|AAL13889.1| LD36616p [Drosophila melanogaster] Back     alignment and taxonomy information
>gi|195447414|ref|XP_002071203.1| GK25256 [Drosophila willistoni] gi|194167288|gb|EDW82189.1| GK25256 [Drosophila willistoni] Back     alignment and taxonomy information
>gi|16769336|gb|AAL28887.1| LD27021p [Drosophila melanogaster] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query277
FB|FBgn0030930591 GalNAc-T2 "UDP-N-acetyl-alpha- 0.386 0.181 0.644 1.4e-56
WB|WBGene00001632601 gly-7 [Caenorhabditis elegans 0.444 0.204 0.475 5.9e-35
UNIPROTKB|H0YAH3454 GALNT7 "N-acetylgalactosaminyl 0.472 0.288 0.352 6.5e-23
UNIPROTKB|F1PX34608 F1PX34 "Uncharacterized protei 0.415 0.189 0.382 1.7e-22
MGI|MGI:1349449657 Galnt7 "UDP-N-acetyl-alpha-D-g 0.411 0.173 0.396 1.8e-22
UNIPROTKB|F1Q3F3657 GALNT7 "Uncharacterized protei 0.415 0.175 0.382 2.3e-22
UNIPROTKB|Q86SF2657 GALNT7 "N-acetylgalactosaminyl 0.472 0.199 0.352 3.8e-22
UNIPROTKB|F1LVI8622 Galnt7 "N-acetylgalactosaminyl 0.411 0.183 0.396 3.9e-22
RGD|620362657 Galnt7 "UDP-N-acetyl-alpha-D-g 0.411 0.173 0.396 4.8e-22
UNIPROTKB|Q9R0C5657 Galnt7 "N-acetylgalactosaminyl 0.411 0.173 0.396 4.8e-22
FB|FBgn0030930 GalNAc-T2 "UDP-N-acetyl-alpha-D-galactosamine:polypeptide N-acetylgalactosaminyltransferase 2" [Drosophila melanogaster (taxid:7227)] Back     alignment and assigned GO terms
 Score = 400 (145.9 bits), Expect = 1.4e-56, Sum P(2) = 1.4e-56
 Identities = 69/107 (64%), Positives = 86/107 (80%)

Query:    73 PLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQK 132
             PLA +LDMGDISEQ+AL+ +L+CK F WFM H+AYDVY KFP LP NL WGE +++ +  
Sbjct:   419 PLARYLDMGDISEQLALKKRLNCKSFQWFMDHIAYDVYDKFPGLPANLHWGELRSVASDG 478

Query:   133 CLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
             CLDSMG   PA++G +YCHG G++QL RLNA GQLG GERCV+AD+Q
Sbjct:   479 CLDSMGHQPPAIMGLTYCHGGGNNQLVRLNAAGQLGVGERCVEADRQ 525


GO:0004653 "polypeptide N-acetylgalactosaminyltransferase activity" evidence=ISS;IDA;NAS
GO:0009312 "oligosaccharide biosynthetic process" evidence=IDA
GO:0005795 "Golgi stack" evidence=NAS
WB|WBGene00001632 gly-7 [Caenorhabditis elegans (taxid:6239)] Back     alignment and assigned GO terms
UNIPROTKB|H0YAH3 GALNT7 "N-acetylgalactosaminyltransferase 7" [Homo sapiens (taxid:9606)] Back     alignment and assigned GO terms
UNIPROTKB|F1PX34 F1PX34 "Uncharacterized protein" [Canis lupus familiaris (taxid:9615)] Back     alignment and assigned GO terms
MGI|MGI:1349449 Galnt7 "UDP-N-acetyl-alpha-D-galactosamine: polypeptide N-acetylgalactosaminyltransferase 7" [Mus musculus (taxid:10090)] Back     alignment and assigned GO terms
UNIPROTKB|F1Q3F3 GALNT7 "Uncharacterized protein" [Canis lupus familiaris (taxid:9615)] Back     alignment and assigned GO terms
UNIPROTKB|Q86SF2 GALNT7 "N-acetylgalactosaminyltransferase 7" [Homo sapiens (taxid:9606)] Back     alignment and assigned GO terms
UNIPROTKB|F1LVI8 Galnt7 "N-acetylgalactosaminyltransferase 7" [Rattus norvegicus (taxid:10116)] Back     alignment and assigned GO terms
RGD|620362 Galnt7 "UDP-N-acetyl-alpha-D-galactosamine:polypeptide N-acetylgalactosaminyltransferase 7 (GalNAc-T7)" [Rattus norvegicus (taxid:10116)] Back     alignment and assigned GO terms
UNIPROTKB|Q9R0C5 Galnt7 "N-acetylgalactosaminyltransferase 7" [Rattus norvegicus (taxid:10116)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

ID ?Name ?Annotated EC number ?Identity ?Query coverage ?Hit coverage ?RBH(Q2H) ?RBH(H2Q) ?
Q8MV48GALT7_DROME2, ., 4, ., 1, ., -0.51960.62810.2944yesN/A

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query277
cd02510299 cd02510, pp-GalNAc-T, pp-GalNAc-T initiates the fo 2e-10
cd00161124 cd00161, RICIN, Ricin-type beta-trefoil; Carbohydr 5e-10
pfam00652124 pfam00652, Ricin_B_lectin, Ricin-type beta-trefoil 1e-09
smart00458118 smart00458, RICIN, Ricin-type beta-trefoil 1e-08
cd00161124 cd00161, RICIN, Ricin-type beta-trefoil; Carbohydr 0.001
cd00161124 cd00161, RICIN, Ricin-type beta-trefoil; Carbohydr 0.002
>gnl|CDD|133004 cd02510, pp-GalNAc-T, pp-GalNAc-T initiates the formation of mucin-type O-linked glycans Back     alignment and domain information
 Score = 59.9 bits (146), Expect = 2e-10
 Identities = 17/47 (36%), Positives = 29/47 (61%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVA 106
           W+  +K+      P    +D GD+SE+ ALR++L CK F W++++V 
Sbjct: 253 WMDEYKEYFYKARPELRNIDYGDLSERKALRERLKCKSFKWYLENVY 299


UDP-GalNAc: polypeptide alpha-N-acetylgalactosaminyltransferases (pp-GalNAc-T) initiate the formation of mucin-type, O-linked glycans by catalyzing the transfer of alpha-N-acetylgalactosamine (GalNAc) from UDP-GalNAc to hydroxyl groups of Ser or Thr residues of core proteins to form the Tn antigen (GalNAc-a-1-O-Ser/Thr). These enzymes are type II membrane proteins with a GT-A type catalytic domain and a lectin domain located on the lumen side of the Golgi apparatus. In human, there are 15 isozymes of pp-GalNAc-Ts, representing the largest of all glycosyltransferase families. Each isozyme has unique but partially redundant substrate specificity for glycosylation sites on acceptor proteins. Length = 299

>gnl|CDD|238092 cd00161, RICIN, Ricin-type beta-trefoil; Carbohydrate-binding domain formed from presumed gene triplication Back     alignment and domain information
>gnl|CDD|216044 pfam00652, Ricin_B_lectin, Ricin-type beta-trefoil lectin domain Back     alignment and domain information
>gnl|CDD|214672 smart00458, RICIN, Ricin-type beta-trefoil Back     alignment and domain information
>gnl|CDD|238092 cd00161, RICIN, Ricin-type beta-trefoil; Carbohydrate-binding domain formed from presumed gene triplication Back     alignment and domain information
>gnl|CDD|238092 cd00161, RICIN, Ricin-type beta-trefoil; Carbohydrate-binding domain formed from presumed gene triplication Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 277
KOG3736|consensus578 100.0
KOG3737|consensus603 100.0
KOG3738|consensus559 100.0
PF00652124 Ricin_B_lectin: Ricin-type beta-trefoil lectin dom 99.72
cd00161124 RICIN Ricin-type beta-trefoil; Carbohydrate-bindin 99.7
smart00458117 RICIN Ricin-type beta-trefoil. Carbohydrate-bindin 99.64
cd02510299 pp-GalNAc-T pp-GalNAc-T initiates the formation of 99.46
cd00161124 RICIN Ricin-type beta-trefoil; Carbohydrate-bindin 99.05
PF00652124 Ricin_B_lectin: Ricin-type beta-trefoil lectin dom 98.92
smart00458117 RICIN Ricin-type beta-trefoil. Carbohydrate-bindin 98.85
KOG3737|consensus603 98.81
PF14200105 RicinB_lectin_2: Ricin-type beta-trefoil lectin do 98.76
KOG3736|consensus578 98.54
PF14200105 RicinB_lectin_2: Ricin-type beta-trefoil lectin do 98.36
KOG3738|consensus559 97.3
PF03498150 CDtoxinA: Cytolethal distending toxin A/C family; 94.92
PLN02899633 alpha-galactosidase 82.3
>KOG3736|consensus Back     alignment and domain information
Probab=100.00  E-value=1.8e-56  Score=427.95  Aligned_cols=226  Identities=26%  Similarity=0.508  Sum_probs=197.2

Q ss_pred             eeEEEeccCCCCceeEEeccCCeeEEEEeeccccccccc-hhhccccceEEEeeccchhhHhhcCCCccccCCCCChHHH
Q psy11641          8 VIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVIS-TQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQ   86 (277)
Q Consensus         8 ~v~l~~C~g~~~nq~~~~~~~s~vgh~~rc~~~~~~~~~-~~~~~N~~R~aevWmDeyk~~~y~~~p~~~~~~~gdis~r   86 (277)
                      .+++|+|||+     +++.||||||||||..+||.++.+ ..+.+|++|+|||||||||++||.++|.++.+|+||||+|
T Consensus       348 SfrvWqCGG~-----lei~PCSrVGHifRk~~pY~~p~~~~~~~~N~~RlAeVWmDeyK~~~y~~~P~~~~~d~GDvseR  422 (578)
T KOG3736|consen  348 SFRVWQCGGR-----LEIVPCSRVGHIFRKRKPYTFPDGTDTATRNLKRLAEVWMDEYKEQFYKRMPGLRNIDEGDLTER  422 (578)
T ss_pred             eEEEeccCCe-----EEecCccceeeeeecCCCccCCCcchhhhhchhhhhhhhhHHHHHHHHhhCccccccCCCCchhH
Confidence            3578999998     899999999999999999999765 5568999999999999999999999999999999999999


Q ss_pred             HHHHhhcCCcCcchhhhhccccccccCCCCCCCCceeeEEecCccccccccCC--CCCceEEEEecCCCCCcceEEECCC
Q psy11641         87 VALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGR--TAPAVIGTSYCHGSGSSQLFRLNAE  164 (277)
Q Consensus        87 ~~LR~~l~CksF~Wyl~nv~p~~~~~~p~~~~~~~~G~Irn~~s~~CLd~~~~--~~g~~v~l~~C~~~~~~Q~W~~t~~  164 (277)
                      ++||++|+||||+|||+|||||++.  |.. ...+.|+|+|..++.|+|....  ..|.++++++||+.+.+|.|.||..
T Consensus       423 ~~LR~~L~CKsFkWyL~nVyPel~~--~~~-~~~~~G~i~~~~~~~cld~~~~~~~~~~~~~~~~Ch~~~~~Q~~~yT~~  499 (578)
T KOG3736|consen  423 KALRERLNCKSFKWYLENVYPELYL--PTP-HVYASGEIRNGNPNLCLDTERAPAGQGMAVGLYPCHGPGGNQYFPYTKQ  499 (578)
T ss_pred             HHHHHhcCCccccchHhhcCccccC--CCC-cccccceeccCCcchhhhhhchhccCCCcceEecCCCccccccccccCC
Confidence            9999999999999999999999973  332 2368899999889999999863  2357999999999989999999999


Q ss_pred             CccccCCcceeecCCchhhhhhcccceeeecccEEEEeecCCCceeeeeccCCCCCCCCceEEeCCCCceeeCccCccce
Q psy11641        165 GQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLS  244 (277)
Q Consensus       165 g~ir~~~~CLd~~~~~~~~~~~~~p~~~~~~~~v~l~~C~~~~~~~~~~C~~~~~~~~Q~W~~~~~~~~l~~~~s~~CLd  244 (277)
                      ++||.+..||++...                ..|+           +++||..+   +|.|.|+.+ ++|+|..||+||+
T Consensus       500 ~eir~~~~cl~~~~~----------------~~v~-----------l~~C~~~~---~q~w~~~~~-~~i~~~~sg~CL~  548 (578)
T KOG3736|consen  500 GEIRIGDLCLDVDDA----------------GKVT-----------LYDCHKMG---NQLWHYDKD-GTLYHRNSGKCLE  548 (578)
T ss_pred             cceEECCEEeccccC----------------CceE-----------EEeccccc---ccceEEcCC-CceEcCCCCcccc
Confidence            999999999998521                1244           45554444   789999985 9999999999999


Q ss_pred             eecCCCceEEecCCCCCCCCceEEeeeC
Q psy11641        245 VHPETNQLAMLRCDENNSYQQWRFKEVR  272 (277)
Q Consensus       245 ~~~~~~~v~l~~C~~~~~~Q~W~~~~~~  272 (277)
                      +......+.+..|+.+++.|+|.|.+++
T Consensus       549 ~~~~~~~~~l~~c~~~~~~Q~W~~~~~~  576 (578)
T KOG3736|consen  549 AAVDKNGLILVACDPSDPTQQWLFEHVN  576 (578)
T ss_pred             ccCCCCCceEeecCCCCCcceEEEEecC
Confidence            9865445999999998889999999876



>KOG3737|consensus Back     alignment and domain information
>KOG3738|consensus Back     alignment and domain information
>PF00652 Ricin_B_lectin: Ricin-type beta-trefoil lectin domain; InterPro: IPR000772 Ricin is a legume lectin from the seeds of the castor bean plant, Ricinus communis Back     alignment and domain information
>cd00161 RICIN Ricin-type beta-trefoil; Carbohydrate-binding domain formed from presumed gene triplication Back     alignment and domain information
>smart00458 RICIN Ricin-type beta-trefoil Back     alignment and domain information
>cd02510 pp-GalNAc-T pp-GalNAc-T initiates the formation of mucin-type O-linked glycans Back     alignment and domain information
>cd00161 RICIN Ricin-type beta-trefoil; Carbohydrate-binding domain formed from presumed gene triplication Back     alignment and domain information
>PF00652 Ricin_B_lectin: Ricin-type beta-trefoil lectin domain; InterPro: IPR000772 Ricin is a legume lectin from the seeds of the castor bean plant, Ricinus communis Back     alignment and domain information
>smart00458 RICIN Ricin-type beta-trefoil Back     alignment and domain information
>KOG3737|consensus Back     alignment and domain information
>PF14200 RicinB_lectin_2: Ricin-type beta-trefoil lectin domain-like; PDB: 2X2S_C 2X2T_A 2VSE_B 2VSA_A 3EF2_A 2IHO_A 3HZB_H 1YBI_B 3PHZ_A 3NBE_A Back     alignment and domain information
>KOG3736|consensus Back     alignment and domain information
>PF14200 RicinB_lectin_2: Ricin-type beta-trefoil lectin domain-like; PDB: 2X2S_C 2X2T_A 2VSE_B 2VSA_A 3EF2_A 2IHO_A 3HZB_H 1YBI_B 3PHZ_A 3NBE_A Back     alignment and domain information
>KOG3738|consensus Back     alignment and domain information
>PF03498 CDtoxinA: Cytolethal distending toxin A/C family; InterPro: IPR003558 Escherichia coli, Haemophilus spp and Campylobacter spp Back     alignment and domain information
>PLN02899 alpha-galactosidase Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query277
1xhb_A472 The Crystal Structure Of Udp-Galnac: Polypeptide Al 5e-18
2d7i_A570 Crsytal Structure Of Pp-Galnac-T10 With Udp, Galnac 9e-11
2ffu_A501 Crystal Structure Of Human Ppgalnact-2 Complexed Wi 5e-10
>pdb|1XHB|A Chain A, The Crystal Structure Of Udp-Galnac: Polypeptide Alpha-N- Acetylgalactosaminyltransferase-T1 Length = 472 Back     alignment and structure

Iteration: 1

Score = 87.8 bits (216), Expect = 5e-18, Method: Compositional matrix adjust. Identities = 46/129 (35%), Positives = 70/129 (54%), Gaps = 6/129 (4%) Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119 W+ FK + SP +D GDIS ++ LR KL CK FSW+++++ D ++P + Sbjct: 286 WMDEFKNFFYIISPGVTKVDYGDISSRLGLRRKLQCKPFSWYLENIYPD-----SQIPRH 340 Query: 120 LF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADK 178 F GE +N+ T +CLD+M R +G CHG G +Q+F A ++ + C+D K Sbjct: 341 YFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSK 400 Query: 179 QEKLVEMLK 187 V MLK Sbjct: 401 LNGPVTMLK 409
>pdb|2D7I|A Chain A, Crsytal Structure Of Pp-Galnac-T10 With Udp, Galnac And Mn2+ Length = 570 Back     alignment and structure
>pdb|2FFU|A Chain A, Crystal Structure Of Human Ppgalnact-2 Complexed With Udp And Ea2 Length = 501 Back     alignment and structure

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query277
2ffu_A501 Ppgalnact-2, polypeptide N-acetylgalactosaminyltra 6e-38
2ffu_A501 Ppgalnact-2, polypeptide N-acetylgalactosaminyltra 4e-06
1xhb_A472 Polypeptide N-acetylgalactosaminyltransferase 1; g 1e-36
1xhb_A472 Polypeptide N-acetylgalactosaminyltransferase 1; g 2e-06
2d7i_A570 Polypeptide N-acetylgalactosaminyltransferase 10; 4e-35
3a07_A118 Actinohivin; carbohydrate-binding module family 13 8e-08
1vcl_A 432 Hemolytic lectin CEL-III; hemolysis, hemagglutinat 5e-06
1vcl_A 432 Hemolytic lectin CEL-III; hemolysis, hemagglutinat 1e-05
1vcl_A 432 Hemolytic lectin CEL-III; hemolysis, hemagglutinat 3e-05
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 1e-04
2vlc_A570 Cinnamomin, type 2 ribosome-inactivating protein c 6e-04
>2ffu_A Ppgalnact-2, polypeptide N-acetylgalactosaminyltransferase 2, protein-UDP; ppgalnact, mucin, glycosyltransferase; HET: UDP; 1.64A {Homo sapiens} PDB: 2ffv_A* Length = 501 Back     alignment and structure
 Score =  138 bits (349), Expect = 6e-38
 Identities = 44/214 (20%), Positives = 78/214 (36%), Gaps = 31/214 (14%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K       P A  +  G+I  ++ LR KL CK F W++++V  ++     
Sbjct: 313 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENVYPEL---RV 369

Query: 115 ELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCV 174
               ++ +G  +      CLD++G  A  V+G   CH +G +Q + L  E  + H + C+
Sbjct: 370 PDHQDIAFGALQQ--GTNCLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHMDLCL 427

Query: 175 DADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQL 234
               +                          G+++ L  C  +       W       +L
Sbjct: 428 TVVDRAP------------------------GSLIKLQGCRENDS--RQKWEQIEGNSKL 461

Query: 235 EHAILKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
            H     CL      +    +        QQW+F
Sbjct: 462 RHVGSNLCLDSRTAKSGGLSVEVCGPALSQQWKF 495


>2ffu_A Ppgalnact-2, polypeptide N-acetylgalactosaminyltransferase 2, protein-UDP; ppgalnact, mucin, glycosyltransferase; HET: UDP; 1.64A {Homo sapiens} PDB: 2ffv_A* Length = 501 Back     alignment and structure
>1xhb_A Polypeptide N-acetylgalactosaminyltransferase 1; glycosyltransferase-A (GT-A); HET: NAG BMA; 2.50A {Mus musculus} SCOP: b.42.2.1 c.68.1.17 Length = 472 Back     alignment and structure
>1xhb_A Polypeptide N-acetylgalactosaminyltransferase 1; glycosyltransferase-A (GT-A); HET: NAG BMA; 2.50A {Mus musculus} SCOP: b.42.2.1 c.68.1.17 Length = 472 Back     alignment and structure
>2d7i_A Polypeptide N-acetylgalactosaminyltransferase 10; beta trefoil, rossmann fold; HET: NAG NGA UDP; 2.50A {Homo sapiens} PDB: 2d7r_A* Length = 570 Back     alignment and structure
>3a07_A Actinohivin; carbohydrate-binding module family 13, antiviral protein, LE; 1.19A {Actinomycete} Length = 118 Back     alignment and structure
>1vcl_A Hemolytic lectin CEL-III; hemolysis, hemagglutination, pore-forming, calcium, toxin; HET: BTB; 1.70A {Cucumaria echinata} SCOP: b.42.2.1 b.42.2.1 d.281.1.1 PDB: 2z48_A* 2z49_A* Length = 432 Back     alignment and structure
>1vcl_A Hemolytic lectin CEL-III; hemolysis, hemagglutination, pore-forming, calcium, toxin; HET: BTB; 1.70A {Cucumaria echinata} SCOP: b.42.2.1 b.42.2.1 d.281.1.1 PDB: 2z48_A* 2z49_A* Length = 432 Back     alignment and structure
>1vcl_A Hemolytic lectin CEL-III; hemolysis, hemagglutination, pore-forming, calcium, toxin; HET: BTB; 1.70A {Cucumaria echinata} SCOP: b.42.2.1 b.42.2.1 d.281.1.1 PDB: 2z48_A* 2z49_A* Length = 432 Back     alignment and structure
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure
>2vlc_A Cinnamomin, type 2 ribosome-inactivating protein cinnamomin III; ribosome inhibiting proteins, toxin, hydrolase, plant defense; HET: AS5 BMA XYP XYS; 2.95A {Cinnamomum camphora} Length = 570 Back     alignment and structure

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query277
2d7i_A570 Polypeptide N-acetylgalactosaminyltransferase 10; 100.0
2ffu_A501 Ppgalnact-2, polypeptide N-acetylgalactosaminyltra 100.0
1xhb_A472 Polypeptide N-acetylgalactosaminyltransferase 1; g 100.0
3a21_A614 Putative secreted alpha-galactosidase; beta-alpha- 100.0
2aai_B262 Ricin (B chain); glycosidase; HET: GAL BGC NDG NAG 99.92
1ggp_B254 TKL-1, protein (lectin 1 B chain); sugar binding p 99.91
1abr_B267 Abrin-A; glycosidase/carbohydrate complex, lectin, 99.9
1m2t_B263 Mistletoe lectin I B chain; ribosome inactivation, 99.9
3c9z_A258 Agglutinin II, SNA-II; beta-trefoil, ricin-B domai 99.89
2vlc_A570 Cinnamomin, type 2 ribosome-inactivating protein c 99.87
1vcl_A 432 Hemolytic lectin CEL-III; hemolysis, hemagglutinat 99.87
2aai_B262 Ricin (B chain); glycosidase; HET: GAL BGC NDG NAG 99.86
3c9z_A258 Agglutinin II, SNA-II; beta-trefoil, ricin-B domai 99.86
1knl_A130 Endo-1,4-beta-xylanase A; carbohydrate binding mod 99.84
1m2t_B263 Mistletoe lectin I B chain; ribosome inactivation, 99.83
1vcl_A 432 Hemolytic lectin CEL-III; hemolysis, hemagglutinat 99.83
3a07_A118 Actinohivin; carbohydrate-binding module family 13 99.82
2dry_A130 29-kDa galactose-binding lectin; earthworm lumbric 99.8
2ao3_A130 29-kDa galactose-binding lectin; earthworm lumbric 99.76
1abr_B 267 Abrin-A; glycosidase/carbohydrate complex, lectin, 99.75
3pg0_A165 Threefoil; symmetric design, beta-trefoil, enginee 99.74
1ggp_B 254 TKL-1, protein (lectin 1 B chain); sugar binding p 99.72
2d1z_A436 Endo-1,4-beta-D-xylanase; TIM-barrel, retaining en 99.72
2vlc_A 570 Cinnamomin, type 2 ribosome-inactivating protein c 99.63
3phz_A 286 Ricin B-related lectin; beta trefoil, saccharide b 99.6
3vsf_A526 Ricin B lectin; GH43 CBM13, EXO-beta-1,3-galactana 99.57
3aj6_A286 Main hemagglutinin component; toxin, beta-trefoil; 99.49
3ef2_A 293 Agglutinin, lectin; beta-trefoil, calcium-binding, 99.48
2dry_A130 29-kDa galactose-binding lectin; earthworm lumbric 99.46
1knl_A130 Endo-1,4-beta-xylanase A; carbohydrate binding mod 99.4
3nbc_A148 Ricin B-like lectin; lactose, sugar BIND protein; 99.4
1ybi_A288 HA33A, HA33/A, non-toxin haemagglutinin HA34; beta 99.39
2vse_A 841 MTX holotoxin, mosquitocidal toxin; ADP-ribosyltra 99.37
2vse_A 841 MTX holotoxin, mosquitocidal toxin; ADP-ribosyltra 99.37
3a07_A118 Actinohivin; carbohydrate-binding module family 13 99.34
2ffu_A501 Ppgalnact-2, polypeptide N-acetylgalactosaminyltra 99.34
3aj6_A286 Main hemagglutinin component; toxin, beta-trefoil; 99.33
3pg0_A165 Threefoil; symmetric design, beta-trefoil, enginee 99.33
2ao3_A130 29-kDa galactose-binding lectin; earthworm lumbric 99.32
1xhb_A472 Polypeptide N-acetylgalactosaminyltransferase 1; g 99.32
2d7i_A570 Polypeptide N-acetylgalactosaminyltransferase 10; 99.26
2d1z_A436 Endo-1,4-beta-D-xylanase; TIM-barrel, retaining en 99.19
1ybi_A288 HA33A, HA33/A, non-toxin haemagglutinin HA34; beta 99.14
3a21_A614 Putative secreted alpha-galactosidase; beta-alpha- 99.09
3vsf_A526 Ricin B lectin; GH43 CBM13, EXO-beta-1,3-galactana 99.06
3phz_A286 Ricin B-related lectin; beta trefoil, saccharide b 98.99
3ef2_A293 Agglutinin, lectin; beta-trefoil, calcium-binding, 98.89
3nbc_A148 Ricin B-like lectin; lactose, sugar BIND protein; 98.32
1xez_A 721 Hemolysin, cytolysin; pore-forming toxin, Pro-toxi 98.02
1dqg_A135 Mannose receptor; beta trefoil, multilectin recept 97.84
1sr4_A206 CDT A, cytolethal distending toxin subunit A; bact 97.81
1xez_A 721 Hemolysin, cytolysin; pore-forming toxin, Pro-toxi 97.46
1dqg_A135 Mannose receptor; beta trefoil, multilectin recept 96.68
1sr4_A206 CDT A, cytolethal distending toxin subunit A; bact 96.14
2x2s_A153 Agglutinin, agglutinin SSA; fungal lectin, beta-tr 95.65
1sr4_C166 CDT C, cytolethal distending toxin protein C; bact 86.68
2x2s_A153 Agglutinin, agglutinin SSA; fungal lectin, beta-tr 85.45
>2d7i_A Polypeptide N-acetylgalactosaminyltransferase 10; beta trefoil, rossmann fold; HET: NAG NGA UDP; 2.50A {Homo sapiens} PDB: 2d7r_A* Back     alignment and structure
Probab=100.00  E-value=4.8e-47  Score=370.76  Aligned_cols=230  Identities=23%  Similarity=0.477  Sum_probs=201.0

Q ss_pred             EEEeccCCCCceeEEeccCCeeEEEEeeccccccccchhhccccceEEEeeccchhhHhhcCCCccccCCCCChHHHHHH
Q psy11641         10 GTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVAL   89 (277)
Q Consensus        10 ~l~~C~g~~~nq~~~~~~~s~vgh~~rc~~~~~~~~~~~~~~N~~R~aevWmDeyk~~~y~~~p~~~~~~~gdis~r~~L   89 (277)
                      ++|.|++.     +.+.|+++|.|++|-..|+..+.+....+|.+|++++|||+|+++||.++|.++.+++||+++|++|
T Consensus       319 Rl~~~G~~-----i~~~p~~~v~H~~r~~~~~~~~~~~~~~~n~~r~~~~W~d~~~~~~y~~~p~~~~~~~g~~~~r~~l  393 (570)
T 2d7i_A          319 KVWMCGGR-----MEDIPCSRVGHIYRKYVPYKVPAGVSLARNLKRVAEVWMDEYAEYIYQRRPEYRHLSAGDVAVQKKL  393 (570)
T ss_dssp             HHHHTTCE-----EEEEEEEEEEECCCSSCCSCCCSSCCHHHHHHHHHHHHCGGGHHHHHTTCGGGTTSCCCCCHHHHHH
T ss_pred             HHHhCCCe-----EEEccCeEEEEEccccCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCcccccCCccchHHHHHH
Confidence            45677665     7778999999999988787766554557999999999999999999999999999999999999999


Q ss_pred             HhhcCCcCcchhhhhccccccccCCCC-CCCCceeeEEecCccccccccCCCCCceEEEEecCCC------CCcceEEEC
Q psy11641         90 RDKLHCKDFSWFMKHVAYDVYTKFPEL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGS------GSSQLFRLN  162 (277)
Q Consensus        90 R~~l~CksF~Wyl~nv~p~~~~~~p~~-~~~~~~G~Irn~~s~~CLd~~~~~~g~~v~l~~C~~~------~~~Q~W~~t  162 (277)
                      |++|+||+|+|||+|||||+..+||.+ ++..++|+|+|.++++|||+.+...|+++++++||+.      +.+|+|.|+
T Consensus       394 r~~l~ck~f~wyl~~v~~~~~~~~p~~~~~~~~~G~irn~~s~~CLD~~~~~~g~~v~l~~C~g~~~~~~~~~nQ~w~~t  473 (570)
T 2d7i_A          394 RSSLNCKSFKWFMTKIAWDLPKFYPPVEPPAAAWGEIRNVGTGLCADTKHGALGSPLRLEGCVRGRGEAAWNNMQVFTFT  473 (570)
T ss_dssp             HHHHTCCCHHHHHHHTSTTHHHHSCSSCCCCSEEEEEEETTTCCEECCCSCTTTCCCCEECCCTTCSCGGGCCCCCEEEC
T ss_pred             HHHcCCCCcHHHHHHhhhhcccccCcccccccccceeeecccCccccccCCCCCCeEEEEECCCCCcCCCCCCCceEEEe
Confidence            999999999999999999998778864 4567899999999999999988667899999999983      579999999


Q ss_pred             CCCcccc------CCcceeecCCchhhhhhcccceeeecccEEEEeecCCCceeeeeccCCCCCCCCceEEeCCCCceee
Q psy11641        163 AEGQLGH------GERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEH  236 (277)
Q Consensus       163 ~~g~ir~------~~~CLd~~~~~~~~~~~~~p~~~~~~~~v~l~~C~~~~~~~~~~C~~~~~~~~Q~W~~~~~~~~l~~  236 (277)
                      .+++|+.      .++|||+.+.                          ++.|++|+|+++.+  +|+|.|+. +|+|+|
T Consensus       474 ~~g~ir~~~~~~~~~~CLd~~~~--------------------------g~~v~l~~C~~~~~--nQ~W~~~~-~g~i~~  524 (570)
T 2d7i_A          474 WREDIRPGDPQHTKKFCFDAISH--------------------------TSPVTLYDCHSMKG--NQLWKYRK-DKTLYH  524 (570)
T ss_dssp             TTSCEEESCTTSCCSBCCBCCSS--------------------------SSBCBCCBCCSSSS--TTCCEECT-TSCEEB
T ss_pred             cCCceeecccccCCCcceeecCC--------------------------CCeEEEEeCCCCCc--CCeEEECC-CCcEEe
Confidence            9999984      5899998642                          46678888888654  78999985 899999


Q ss_pred             CccCccceeecCCCceEEecCCCCCCCCceEEeeeCC
Q psy11641        237 AILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEVRP  273 (277)
Q Consensus       237 ~~s~~CLd~~~~~~~v~l~~C~~~~~~Q~W~~~~~~~  273 (277)
                      ..||+|||+.+.+..|++++|++++++|+|.|.++++
T Consensus       525 ~~sg~CLd~~~~~~~v~l~~C~~~~~~Q~W~~~~~~~  561 (570)
T 2d7i_A          525 PVSGSCMDCSESDHRIFMNTCNPSSLTQQWLFEHTNS  561 (570)
T ss_dssp             TTTTBEEEECTTTCCEEEECCCTTCSTTCEEEEECCH
T ss_pred             CCCCeeEeeeCCCCeEEEEeCCCcCCCCEEEEecCCH
Confidence            9999999998667789999999777799999999874



>2ffu_A Ppgalnact-2, polypeptide N-acetylgalactosaminyltransferase 2, protein-UDP; ppgalnact, mucin, glycosyltransferase; HET: UDP; 1.64A {Homo sapiens} PDB: 2ffv_A* Back     alignment and structure
>1xhb_A Polypeptide N-acetylgalactosaminyltransferase 1; glycosyltransferase-A (GT-A); HET: NAG BMA; 2.50A {Mus musculus} SCOP: b.42.2.1 c.68.1.17 Back     alignment and structure
>3a21_A Putative secreted alpha-galactosidase; beta-alpha-barrel, greek KEY motif, beta-jellyroll, beta-TRE hydrolase; HET: GOL 1PG EPE; 1.51A {Streptomyces avermitilis} PDB: 3a22_A* 3a23_A* Back     alignment and structure
>2aai_B Ricin (B chain); glycosidase; HET: GAL BGC NDG NAG BMA MAN; 2.50A {Ricinus communis} SCOP: b.42.2.1 b.42.2.1 PDB: 3rti_B* 3rtj_B* 1rzo_B* Back     alignment and structure
>1ggp_B TKL-1, protein (lectin 1 B chain); sugar binding protein; 2.70A {Trichosanthes kirilowii} SCOP: b.42.2.1 b.42.2.1 Back     alignment and structure
>1abr_B Abrin-A; glycosidase/carbohydrate complex, lectin, lectin (agglutinin), glycoprotein, plant SEED protein, plant toxin; HET: NDG NGZ MAN BMA BGC; 2.14A {Abrus precatorius} SCOP: b.42.2.1 b.42.2.1 PDB: 2zr1_B* 2q3n_B* Back     alignment and structure
>1m2t_B Mistletoe lectin I B chain; ribosome inactivation, ribosome inhibitor, hydrolase; HET: NAG FUC ADE; 1.89A {Viscum album} SCOP: b.42.2.1 b.42.2.1 PDB: 2r9k_B* 3d7w_B* 3cef_B* 1sz6_B* 1pum_B* 1puu_B* 2rg9_B* 3o5w_B* 1onk_B* 1oql_B* 1tfm_B* 1pc8_B* 1yf8_B* 1ce7_B* 2mll_B* Back     alignment and structure
>3c9z_A Agglutinin II, SNA-II; beta-trefoil, ricin-B domain, glycosylation, glycoprotein, L sugar binding protein, plant protein; HET: NAG FUC SO4; 1.35A {Sambucus nigra} PDB: 3ca0_A* 3ca1_A* 3ca3_A* 3ca4_A* 3ca5_A* 3ca6_A* 3cah_A* 1hwm_B* 1hwn_B* 1hwo_B* 1hwp_B* Back     alignment and structure
>2vlc_A Cinnamomin, type 2 ribosome-inactivating protein cinnamomin III; ribosome inhibiting proteins, toxin, hydrolase, plant defense; HET: AS5 BMA XYP XYS; 2.95A {Cinnamomum camphora} Back     alignment and structure
>1vcl_A Hemolytic lectin CEL-III; hemolysis, hemagglutination, pore-forming, calcium, toxin; HET: BTB; 1.70A {Cucumaria echinata} SCOP: b.42.2.1 b.42.2.1 d.281.1.1 PDB: 2z48_A* 2z49_A* Back     alignment and structure
>2aai_B Ricin (B chain); glycosidase; HET: GAL BGC NDG NAG BMA MAN; 2.50A {Ricinus communis} SCOP: b.42.2.1 b.42.2.1 PDB: 3rti_B* 3rtj_B* 1rzo_B* Back     alignment and structure
>3c9z_A Agglutinin II, SNA-II; beta-trefoil, ricin-B domain, glycosylation, glycoprotein, L sugar binding protein, plant protein; HET: NAG FUC SO4; 1.35A {Sambucus nigra} PDB: 3ca0_A* 3ca1_A* 3ca3_A* 3ca4_A* 3ca5_A* 3ca6_A* 3cah_A* 1hwm_B* 1hwn_B* 1hwo_B* 1hwp_B* Back     alignment and structure
>1knl_A Endo-1,4-beta-xylanase A; carbohydrate binding module, CBM13 xylan binding domain, LEC ricin B-like, beta-trefoil fold, hydrolase; 1.20A {Streptomyces lividans} SCOP: b.42.2.1 PDB: 1knm_A* 1mc9_A* Back     alignment and structure
>1m2t_B Mistletoe lectin I B chain; ribosome inactivation, ribosome inhibitor, hydrolase; HET: NAG FUC ADE; 1.89A {Viscum album} SCOP: b.42.2.1 b.42.2.1 PDB: 2r9k_B* 3d7w_B* 3cef_B* 1sz6_B* 1pum_B* 1puu_B* 2rg9_B* 3o5w_B* 1onk_B* 1oql_B* 1tfm_B* 1pc8_B* 1yf8_B* 1ce7_B* 2mll_B* Back     alignment and structure
>1vcl_A Hemolytic lectin CEL-III; hemolysis, hemagglutination, pore-forming, calcium, toxin; HET: BTB; 1.70A {Cucumaria echinata} SCOP: b.42.2.1 b.42.2.1 d.281.1.1 PDB: 2z48_A* 2z49_A* Back     alignment and structure
>3a07_A Actinohivin; carbohydrate-binding module family 13, antiviral protein, LE; 1.19A {Actinomycete} Back     alignment and structure
>2dry_A 29-kDa galactose-binding lectin; earthworm lumbricus terrestris, sialic acid, in vitro evolution, beta-trefoil fold, sugar binding protein; HET: PGE; 1.80A {Lumbricus terrestris} PDB: 2drz_A* 2ds0_A* 2zqn_A* 2zqo_A* Back     alignment and structure
>1abr_B Abrin-A; glycosidase/carbohydrate complex, lectin, lectin (agglutinin), glycoprotein, plant SEED protein, plant toxin; HET: NDG NGZ MAN BMA BGC; 2.14A {Abrus precatorius} SCOP: b.42.2.1 b.42.2.1 PDB: 2zr1_B* 2q3n_B* Back     alignment and structure
>3pg0_A Threefoil; symmetric design, beta-trefoil, engineered module, sugar BIN NOVO protein; HET: BTB GOL; 1.62A {Artificial gene} Back     alignment and structure
>1ggp_B TKL-1, protein (lectin 1 B chain); sugar binding protein; 2.70A {Trichosanthes kirilowii} SCOP: b.42.2.1 b.42.2.1 Back     alignment and structure
>2d1z_A Endo-1,4-beta-D-xylanase; TIM-barrel, retaining enzyme, catalytic-site mutant, chemica hydrolase; 1.60A {Streptomyces olivaceoviridis} PDB: 2d20_A* 2d22_A 2d23_A 2d24_A* 1xyf_A 1isw_A* 1isx_A* 1isy_A* 1isv_A* 1it0_A* 1v6u_A* 1v6v_A* 1v6w_A* 1v6x_A* 1isz_A Back     alignment and structure
>2vlc_A Cinnamomin, type 2 ribosome-inactivating protein cinnamomin III; ribosome inhibiting proteins, toxin, hydrolase, plant defense; HET: AS5 BMA XYP XYS; 2.95A {Cinnamomum camphora} Back     alignment and structure
>3phz_A Ricin B-related lectin; beta trefoil, saccharide binding lectin, 2,6-sialyl-lactosam sugar binding protein; HET: NAG GAL SIA; 1.70A {Polyporus squamosus} Back     alignment and structure
>3vsf_A Ricin B lectin; GH43 CBM13, EXO-beta-1,3-galactanase, sugar binding protein; 2.76A {Clostridium thermocellum} PDB: 3vsz_A* 3vt0_A* 3vt1_B* 3vt2_A* Back     alignment and structure
>3aj6_A Main hemagglutinin component; toxin, beta-trefoil; HET: NGA; 1.48A {Clostridium botulinum} PDB: 1qxm_A 3aj5_A* 2ehm_A* 2ehi_A* 2ehn_A* 3ah1_A* 3ah2_A* 3ah4_A* 2e4m_A Back     alignment and structure
>3ef2_A Agglutinin, lectin; beta-trefoil, calcium-binding, carbohydrate-binding, sugar-binding, sugar binding protein; HET: FUC GAL GLA; 1.80A {Marasmius oreades} PDB: 2iho_A* Back     alignment and structure
>2dry_A 29-kDa galactose-binding lectin; earthworm lumbricus terrestris, sialic acid, in vitro evolution, beta-trefoil fold, sugar binding protein; HET: PGE; 1.80A {Lumbricus terrestris} PDB: 2drz_A* 2ds0_A* 2zqn_A* 2zqo_A* Back     alignment and structure
>1knl_A Endo-1,4-beta-xylanase A; carbohydrate binding module, CBM13 xylan binding domain, LEC ricin B-like, beta-trefoil fold, hydrolase; 1.20A {Streptomyces lividans} SCOP: b.42.2.1 PDB: 1knm_A* 1mc9_A* Back     alignment and structure
>3nbc_A Ricin B-like lectin; lactose, sugar BIND protein; HET: LAT; 1.01A {Clitocybe nebularis} PDB: 3nbd_A* 3nbe_A* Back     alignment and structure
>1ybi_A HA33A, HA33/A, non-toxin haemagglutinin HA34; beta-trefoil; 1.50A {Clostridium botulinum} Back     alignment and structure
>2vse_A MTX holotoxin, mosquitocidal toxin; ADP-ribosyltransferase, lectin, ricin-B-like domain; 2.5A {Lysinibacillus sphaericus} PDB: 2vsa_A Back     alignment and structure
>2vse_A MTX holotoxin, mosquitocidal toxin; ADP-ribosyltransferase, lectin, ricin-B-like domain; 2.5A {Lysinibacillus sphaericus} PDB: 2vsa_A Back     alignment and structure
>3a07_A Actinohivin; carbohydrate-binding module family 13, antiviral protein, LE; 1.19A {Actinomycete} Back     alignment and structure
>2ffu_A Ppgalnact-2, polypeptide N-acetylgalactosaminyltransferase 2, protein-UDP; ppgalnact, mucin, glycosyltransferase; HET: UDP; 1.64A {Homo sapiens} PDB: 2ffv_A* Back     alignment and structure
>3aj6_A Main hemagglutinin component; toxin, beta-trefoil; HET: NGA; 1.48A {Clostridium botulinum} PDB: 1qxm_A 3aj5_A* 2ehm_A* 2ehi_A* 2ehn_A* 3ah1_A* 3ah2_A* 3ah4_A* 2e4m_A Back     alignment and structure
>3pg0_A Threefoil; symmetric design, beta-trefoil, engineered module, sugar BIN NOVO protein; HET: BTB GOL; 1.62A {Artificial gene} Back     alignment and structure
>1xhb_A Polypeptide N-acetylgalactosaminyltransferase 1; glycosyltransferase-A (GT-A); HET: NAG BMA; 2.50A {Mus musculus} SCOP: b.42.2.1 c.68.1.17 Back     alignment and structure
>2d7i_A Polypeptide N-acetylgalactosaminyltransferase 10; beta trefoil, rossmann fold; HET: NAG NGA UDP; 2.50A {Homo sapiens} PDB: 2d7r_A* Back     alignment and structure
>2d1z_A Endo-1,4-beta-D-xylanase; TIM-barrel, retaining enzyme, catalytic-site mutant, chemica hydrolase; 1.60A {Streptomyces olivaceoviridis} PDB: 2d20_A* 2d22_A 2d23_A 2d24_A* 1xyf_A 1isw_A* 1isx_A* 1isy_A* 1isv_A* 1it0_A* 1v6u_A* 1v6v_A* 1v6w_A* 1v6x_A* 1isz_A Back     alignment and structure
>1ybi_A HA33A, HA33/A, non-toxin haemagglutinin HA34; beta-trefoil; 1.50A {Clostridium botulinum} Back     alignment and structure
>3a21_A Putative secreted alpha-galactosidase; beta-alpha-barrel, greek KEY motif, beta-jellyroll, beta-TRE hydrolase; HET: GOL 1PG EPE; 1.51A {Streptomyces avermitilis} PDB: 3a22_A* 3a23_A* Back     alignment and structure
>3vsf_A Ricin B lectin; GH43 CBM13, EXO-beta-1,3-galactanase, sugar binding protein; 2.76A {Clostridium thermocellum} PDB: 3vsz_A* 3vt0_A* 3vt1_B* 3vt2_A* Back     alignment and structure
>3phz_A Ricin B-related lectin; beta trefoil, saccharide binding lectin, 2,6-sialyl-lactosam sugar binding protein; HET: NAG GAL SIA; 1.70A {Polyporus squamosus} Back     alignment and structure
>3ef2_A Agglutinin, lectin; beta-trefoil, calcium-binding, carbohydrate-binding, sugar-binding, sugar binding protein; HET: FUC GAL GLA; 1.80A {Marasmius oreades} PDB: 2iho_A* Back     alignment and structure
>3nbc_A Ricin B-like lectin; lactose, sugar BIND protein; HET: LAT; 1.01A {Clitocybe nebularis} PDB: 3nbd_A* 3nbe_A* Back     alignment and structure
>1xez_A Hemolysin, cytolysin; pore-forming toxin, Pro-toxin, water- soluble monomer, beta-prism, beta-trefoil; HET: BOG; 2.30A {Vibrio cholerae} PDB: 3o44_A Back     alignment and structure
>1dqg_A Mannose receptor; beta trefoil, multilectin receptor, pituitary hormones, sulfated carbohydrate, sugar binding protein; 1.70A {Mus musculus} SCOP: b.42.2.2 PDB: 1dqo_A* 1fwu_A* 1fwv_A* Back     alignment and structure
>1sr4_A CDT A, cytolethal distending toxin subunit A; bacterial, virulence, DNA damage, genotoxin, cytotoxins, cell cycle, apoptosis, lectin; 2.00A {Haemophilus ducreyi} SCOP: b.42.2.1 PDB: 2f2f_A Back     alignment and structure
>1xez_A Hemolysin, cytolysin; pore-forming toxin, Pro-toxin, water- soluble monomer, beta-prism, beta-trefoil; HET: BOG; 2.30A {Vibrio cholerae} PDB: 3o44_A Back     alignment and structure
>1dqg_A Mannose receptor; beta trefoil, multilectin receptor, pituitary hormones, sulfated carbohydrate, sugar binding protein; 1.70A {Mus musculus} SCOP: b.42.2.2 PDB: 1dqo_A* 1fwu_A* 1fwv_A* Back     alignment and structure
>1sr4_A CDT A, cytolethal distending toxin subunit A; bacterial, virulence, DNA damage, genotoxin, cytotoxins, cell cycle, apoptosis, lectin; 2.00A {Haemophilus ducreyi} SCOP: b.42.2.1 PDB: 2f2f_A Back     alignment and structure
>2x2s_A Agglutinin, agglutinin SSA; fungal lectin, beta-trefoil domain, cell adhesion; 1.60A {Sclerotinia sclerotiorum} PDB: 2x2t_A* Back     alignment and structure
>1sr4_C CDT C, cytolethal distending toxin protein C; bacterial, virulence, DNA damage, genotoxin, cytotoxins, cell cycle, apoptosis, lectin; 2.00A {Haemophilus ducreyi} SCOP: b.42.2.1 Back     alignment and structure
>2x2s_A Agglutinin, agglutinin SSA; fungal lectin, beta-trefoil domain, cell adhesion; 1.60A {Sclerotinia sclerotiorum} PDB: 2x2t_A* Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query 277
d1xhba1131 b.42.2.1 (A:423-553) Polypeptide N-acetylgalactosa 2e-17
d1xhba1131 b.42.2.1 (A:423-553) Polypeptide N-acetylgalactosa 5e-05
d1xhba2328 c.68.1.17 (A:95-422) Polypeptide N-acetylgalactosa 3e-12
d1vcla2133 b.42.2.1 (A:151-283) Hemolytic lectin CEL-III, dom 7e-10
d1vcla1150 b.42.2.1 (A:1-150) Hemolytic lectin CEL-III, domai 9e-08
d1knma_129 b.42.2.1 (A:) Xylan binding domain, CBM13 (Endo-1, 4e-06
d1ggpb1126 b.42.2.1 (B:11-140) Plant cytotoxin B-chain (lecti 5e-05
>d1xhba1 b.42.2.1 (A:423-553) Polypeptide N-acetylgalactosaminyltransferase 1, C-terminal domain {Mouse (Mus musculus) [TaxId: 10090]} Length = 131 Back     information, alignment and structure

class: All beta proteins
fold: beta-Trefoil
superfamily: Ricin B-like lectins
family: Ricin B-like
domain: Polypeptide N-acetylgalactosaminyltransferase 1, C-terminal domain
species: Mouse (Mus musculus) [TaxId: 10090]
 Score = 74.5 bits (182), Expect = 2e-17
 Identities = 36/160 (22%), Positives = 58/160 (36%), Gaps = 32/160 (20%)

Query: 115 ELPP-NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           ++P      GE +N+ T +CLD+M R     +G   CHG G +Q+F   A  ++   + C
Sbjct: 1   QIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLC 60

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
           +D  K    V MLK                                  +  W +      
Sbjct: 61  LDVSKLNGPVTMLK----------------------------CHHLKGNQLWEYDPVKLT 92

Query: 234 LEHAILKRCLSVHPETN--QLAMLRCDENNSYQQWRFKEV 271
           L+H    +CL    E +    ++  C   +  QQW  + V
Sbjct: 93  LQHVNSNQCLDKATEEDSQVPSIRDCT-GSRSQQWLLRNV 131


>d1xhba1 b.42.2.1 (A:423-553) Polypeptide N-acetylgalactosaminyltransferase 1, C-terminal domain {Mouse (Mus musculus) [TaxId: 10090]} Length = 131 Back     information, alignment and structure
>d1xhba2 c.68.1.17 (A:95-422) Polypeptide N-acetylgalactosaminyltransferase 1, N-terminal domain {Mouse (Mus musculus) [TaxId: 10090]} Length = 328 Back     information, alignment and structure
>d1vcla2 b.42.2.1 (A:151-283) Hemolytic lectin CEL-III, domains 1 and 2 {Cucumaria echinata [TaxId: 40245]} Length = 133 Back     information, alignment and structure
>d1vcla1 b.42.2.1 (A:1-150) Hemolytic lectin CEL-III, domains 1 and 2 {Cucumaria echinata [TaxId: 40245]} Length = 150 Back     information, alignment and structure
>d1knma_ b.42.2.1 (A:) Xylan binding domain, CBM13 (Endo-1,4-beta-xylanase C-terminal domain) {Streptomyces lividans [TaxId: 1916]} Length = 129 Back     information, alignment and structure
>d1ggpb1 b.42.2.1 (B:11-140) Plant cytotoxin B-chain (lectin) {Mongolian snake-gourd (Trichosanthes kirilowii), Lectin 1 [TaxId: 3677]} Length = 126 Back     information, alignment and structure

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query277
d1xhba1131 Polypeptide N-acetylgalactosaminyltransferase 1, C 99.92
d1ggpb1126 Plant cytotoxin B-chain (lectin) {Mongolian snake- 99.86
d1vcla2133 Hemolytic lectin CEL-III, domains 1 and 2 {Cucumar 99.86
d1knma_129 Xylan binding domain, CBM13 (Endo-1,4-beta-xylanas 99.86
d1vcla1150 Hemolytic lectin CEL-III, domains 1 and 2 {Cucumar 99.82
d1m2tb2126 Plant cytotoxin B-chain (lectin) {European mistlet 99.8
d2aaib2127 Plant cytotoxin B-chain (lectin) {Castor bean (Ric 99.79
d1hwmb2131 Plant cytotoxin B-chain (lectin) {Sambucus ebulus, 99.78
d1xhba2328 Polypeptide N-acetylgalactosaminyltransferase 1, N 99.78
d1hwmb1133 Plant cytotoxin B-chain (lectin) {Sambucus ebulus, 99.78
d1ggpb2128 Plant cytotoxin B-chain (lectin) {Mongolian snake- 99.77
d1abrb1140 Plant cytotoxin B-chain (lectin) {Abrus precatoriu 99.77
d1abrb2127 Plant cytotoxin B-chain (lectin) {Abrus precatoriu 99.76
d1m2tb1136 Plant cytotoxin B-chain (lectin) {European mistlet 99.76
d1rzob1135 Plant cytotoxin B-chain (lectin) {Castor bean (Ric 99.76
d2zqna1130 29-kDa galactose-binding lectin {Lumbricus terrest 99.68
d1dqga_134 Mannose receptor {Mouse (Mus musculus) [TaxId: 100 99.57
d2ihoa1154 Agglutinin MOA, N-terminal domain {Fairy-ring mush 99.55
d1m2tb2126 Plant cytotoxin B-chain (lectin) {European mistlet 99.51
d1abrb2127 Plant cytotoxin B-chain (lectin) {Abrus precatoriu 99.47
d1knma_129 Xylan binding domain, CBM13 (Endo-1,4-beta-xylanas 99.45
d2aaib2127 Plant cytotoxin B-chain (lectin) {Castor bean (Ric 99.45
d1xhba1131 Polypeptide N-acetylgalactosaminyltransferase 1, C 99.39
d1hwmb2131 Plant cytotoxin B-chain (lectin) {Sambucus ebulus, 99.38
d1ggpb2128 Plant cytotoxin B-chain (lectin) {Mongolian snake- 99.36
d1vcla2133 Hemolytic lectin CEL-III, domains 1 and 2 {Cucumar 99.36
d1ggpb1126 Plant cytotoxin B-chain (lectin) {Mongolian snake- 99.35
d1qxma2138 Hemagglutinin component Ha1 {Clostridium botulinum 99.34
d1vcla1150 Hemolytic lectin CEL-III, domains 1 and 2 {Cucumar 99.29
d1qxma1145 Hemagglutinin component Ha1 {Clostridium botulinum 99.27
d1m2tb1136 Plant cytotoxin B-chain (lectin) {European mistlet 99.25
d1abrb1140 Plant cytotoxin B-chain (lectin) {Abrus precatoriu 99.25
d1hwmb1133 Plant cytotoxin B-chain (lectin) {Sambucus ebulus, 99.18
d2zqna1130 29-kDa galactose-binding lectin {Lumbricus terrest 99.12
d1rzob1135 Plant cytotoxin B-chain (lectin) {Castor bean (Ric 99.08
d1qxma2138 Hemagglutinin component Ha1 {Clostridium botulinum 99.06
d1dqga_134 Mannose receptor {Mouse (Mus musculus) [TaxId: 100 99.06
d2ihoa1154 Agglutinin MOA, N-terminal domain {Fairy-ring mush 99.02
d1sr4a_167 Cytolethal distending toxin subunit A {Haemophilus 99.0
d1qxma1145 Hemagglutinin component Ha1 {Clostridium botulinum 98.77
d1upsa2131 GlcNAc-alpha-1,4-Gal-releasing endo-beta-galactosi 98.72
d1sr4a_167 Cytolethal distending toxin subunit A {Haemophilus 98.6
d1upsa2131 GlcNAc-alpha-1,4-Gal-releasing endo-beta-galactosi 97.6
d2bo4a1381 Mannosylglycerate synthase, MGS {Rhodothermus mari 90.46
>d1xhba1 b.42.2.1 (A:423-553) Polypeptide N-acetylgalactosaminyltransferase 1, C-terminal domain {Mouse (Mus musculus) [TaxId: 10090]} Back     information, alignment and structure
class: All beta proteins
fold: beta-Trefoil
superfamily: Ricin B-like lectins
family: Ricin B-like
domain: Polypeptide N-acetylgalactosaminyltransferase 1, C-terminal domain
species: Mouse (Mus musculus) [TaxId: 10090]
Probab=99.92  E-value=1.4e-25  Score=176.87  Aligned_cols=124  Identities=27%  Similarity=0.592  Sum_probs=103.2

Q ss_pred             CCceeeEEecCccccccccCCCCCceEEEEecCCCCCcceEEECCCCccccCCcceeecCCchhhhhhcccceeeecccE
Q psy11641        119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGV  198 (277)
Q Consensus       119 ~~~~G~Irn~~s~~CLd~~~~~~g~~v~l~~C~~~~~~Q~W~~t~~g~ir~~~~CLd~~~~~~~~~~~~~p~~~~~~~~v  198 (277)
                      ..++|+|||.++++|||+.+...|.+|.+++|++...+|+|.|+.+++|++.++|||+...+               +.|
T Consensus         6 ~~~~G~Irn~~s~~CLd~~~~~~g~~v~l~~C~~~~~~Q~w~~t~~~~i~~~~~Cld~~~~~---------------~~v   70 (131)
T d1xhba1           6 YFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLN---------------GPV   70 (131)
T ss_dssp             EEEEECEEETTTCEEEECTTCCTTEECEEEECCSCCGGGCEEEETTSCEEETTEEEECCSTT---------------CCC
T ss_pred             cEEEEEEEECCCCccEecCCCCCCCeEEEEEcCCCCCceEEEEecCceEeeCCEeeeccCCc---------------ccE
Confidence            35789999999999999988777889999999987778999999999999999999987541               344


Q ss_pred             EEEeecCCCceeeeeccCCCCCCCCceEEeCCCCceeeCccCccceeecC--CCceEEecCCCCCCCCceEEeee
Q psy11641        199 KLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPE--TNQLAMLRCDENNSYQQWRFKEV  271 (277)
Q Consensus       199 ~l~~C~~~~~~~~~~C~~~~~~~~Q~W~~~~~~~~l~~~~s~~CLd~~~~--~~~v~l~~C~~~~~~Q~W~~~~~  271 (277)
                      +|..|           ++..+  +|+|.|+..++.|+|..+++|||+...  +..|.+.+|+++ ++|+|.|+++
T Consensus        71 ~l~~C-----------~~~~~--~Q~W~~~~~~~~i~~~~s~~CLd~~~~~~~~~l~~~~C~~~-~~Q~W~~~~v  131 (131)
T d1xhba1          71 TMLKC-----------HHLKG--NQLWEYDPVKLTLQHVNSNQCLDKATEEDSQVPSIRDCTGS-RSQQWLLRNV  131 (131)
T ss_dssp             EEEEC-----------CTTCG--GGCEEEETTTTEEEESSSCEEEESCCSSSTTSCEEEECCCC-GGGCEEECCC
T ss_pred             EEEEe-----------cCCCc--eEEEEEcCCCceEECCCCCceEeCcccCCCceEEEeCCCCC-CccEEEEEEC
Confidence            55555           44342  689999986789999999999998653  467999999985 7999999874



>d1ggpb1 b.42.2.1 (B:11-140) Plant cytotoxin B-chain (lectin) {Mongolian snake-gourd (Trichosanthes kirilowii), Lectin 1 [TaxId: 3677]} Back     information, alignment and structure
>d1vcla2 b.42.2.1 (A:151-283) Hemolytic lectin CEL-III, domains 1 and 2 {Cucumaria echinata [TaxId: 40245]} Back     information, alignment and structure
>d1knma_ b.42.2.1 (A:) Xylan binding domain, CBM13 (Endo-1,4-beta-xylanase C-terminal domain) {Streptomyces lividans [TaxId: 1916]} Back     information, alignment and structure
>d1vcla1 b.42.2.1 (A:1-150) Hemolytic lectin CEL-III, domains 1 and 2 {Cucumaria echinata [TaxId: 40245]} Back     information, alignment and structure
>d1m2tb2 b.42.2.1 (B:385-510) Plant cytotoxin B-chain (lectin) {European mistletoe (Viscum album) [TaxId: 3972]} Back     information, alignment and structure
>d2aaib2 b.42.2.1 (B:136-262) Plant cytotoxin B-chain (lectin) {Castor bean (Ricinus communis), Ricin [TaxId: 3988]} Back     information, alignment and structure
>d1hwmb2 b.42.2.1 (B:136-266) Plant cytotoxin B-chain (lectin) {Sambucus ebulus, ebulin [TaxId: 28503]} Back     information, alignment and structure
>d1xhba2 c.68.1.17 (A:95-422) Polypeptide N-acetylgalactosaminyltransferase 1, N-terminal domain {Mouse (Mus musculus) [TaxId: 10090]} Back     information, alignment and structure
>d1hwmb1 b.42.2.1 (B:3-135) Plant cytotoxin B-chain (lectin) {Sambucus ebulus, ebulin [TaxId: 28503]} Back     information, alignment and structure
>d1ggpb2 b.42.2.1 (B:141-267) Plant cytotoxin B-chain (lectin) {Mongolian snake-gourd (Trichosanthes kirilowii), Lectin 1 [TaxId: 3677]} Back     information, alignment and structure
>d1abrb1 b.42.2.1 (B:1-140) Plant cytotoxin B-chain (lectin) {Abrus precatorius [TaxId: 3816]} Back     information, alignment and structure
>d1abrb2 b.42.2.1 (B:141-267) Plant cytotoxin B-chain (lectin) {Abrus precatorius [TaxId: 3816]} Back     information, alignment and structure
>d1m2tb1 b.42.2.1 (B:249-384) Plant cytotoxin B-chain (lectin) {European mistletoe (Viscum album) [TaxId: 3972]} Back     information, alignment and structure
>d1rzob1 b.42.2.1 (B:2001-2135) Plant cytotoxin B-chain (lectin) {Castor bean (Ricinus communis), Ricin [TaxId: 3988]} Back     information, alignment and structure
>d2zqna1 b.42.2.1 (A:131-260) 29-kDa galactose-binding lectin {Lumbricus terrestris [TaxId: 6398]} Back     information, alignment and structure
>d1dqga_ b.42.2.2 (A:) Mannose receptor {Mouse (Mus musculus) [TaxId: 10090]} Back     information, alignment and structure
>d2ihoa1 b.42.2.1 (A:2-155) Agglutinin MOA, N-terminal domain {Fairy-ring mushroom (Marasmius oreades) [TaxId: 181124]} Back     information, alignment and structure
>d1m2tb2 b.42.2.1 (B:385-510) Plant cytotoxin B-chain (lectin) {European mistletoe (Viscum album) [TaxId: 3972]} Back     information, alignment and structure
>d1abrb2 b.42.2.1 (B:141-267) Plant cytotoxin B-chain (lectin) {Abrus precatorius [TaxId: 3816]} Back     information, alignment and structure
>d1knma_ b.42.2.1 (A:) Xylan binding domain, CBM13 (Endo-1,4-beta-xylanase C-terminal domain) {Streptomyces lividans [TaxId: 1916]} Back     information, alignment and structure
>d2aaib2 b.42.2.1 (B:136-262) Plant cytotoxin B-chain (lectin) {Castor bean (Ricinus communis), Ricin [TaxId: 3988]} Back     information, alignment and structure
>d1xhba1 b.42.2.1 (A:423-553) Polypeptide N-acetylgalactosaminyltransferase 1, C-terminal domain {Mouse (Mus musculus) [TaxId: 10090]} Back     information, alignment and structure
>d1hwmb2 b.42.2.1 (B:136-266) Plant cytotoxin B-chain (lectin) {Sambucus ebulus, ebulin [TaxId: 28503]} Back     information, alignment and structure
>d1ggpb2 b.42.2.1 (B:141-267) Plant cytotoxin B-chain (lectin) {Mongolian snake-gourd (Trichosanthes kirilowii), Lectin 1 [TaxId: 3677]} Back     information, alignment and structure
>d1vcla2 b.42.2.1 (A:151-283) Hemolytic lectin CEL-III, domains 1 and 2 {Cucumaria echinata [TaxId: 40245]} Back     information, alignment and structure
>d1ggpb1 b.42.2.1 (B:11-140) Plant cytotoxin B-chain (lectin) {Mongolian snake-gourd (Trichosanthes kirilowii), Lectin 1 [TaxId: 3677]} Back     information, alignment and structure
>d1qxma2 b.42.2.1 (A:149-286) Hemagglutinin component Ha1 {Clostridium botulinum D phage [TaxId: 29342]} Back     information, alignment and structure
>d1vcla1 b.42.2.1 (A:1-150) Hemolytic lectin CEL-III, domains 1 and 2 {Cucumaria echinata [TaxId: 40245]} Back     information, alignment and structure
>d1qxma1 b.42.2.1 (A:4-148) Hemagglutinin component Ha1 {Clostridium botulinum D phage [TaxId: 29342]} Back     information, alignment and structure
>d1m2tb1 b.42.2.1 (B:249-384) Plant cytotoxin B-chain (lectin) {European mistletoe (Viscum album) [TaxId: 3972]} Back     information, alignment and structure
>d1abrb1 b.42.2.1 (B:1-140) Plant cytotoxin B-chain (lectin) {Abrus precatorius [TaxId: 3816]} Back     information, alignment and structure
>d1hwmb1 b.42.2.1 (B:3-135) Plant cytotoxin B-chain (lectin) {Sambucus ebulus, ebulin [TaxId: 28503]} Back     information, alignment and structure
>d2zqna1 b.42.2.1 (A:131-260) 29-kDa galactose-binding lectin {Lumbricus terrestris [TaxId: 6398]} Back     information, alignment and structure
>d1rzob1 b.42.2.1 (B:2001-2135) Plant cytotoxin B-chain (lectin) {Castor bean (Ricinus communis), Ricin [TaxId: 3988]} Back     information, alignment and structure
>d1qxma2 b.42.2.1 (A:149-286) Hemagglutinin component Ha1 {Clostridium botulinum D phage [TaxId: 29342]} Back     information, alignment and structure
>d1dqga_ b.42.2.2 (A:) Mannose receptor {Mouse (Mus musculus) [TaxId: 10090]} Back     information, alignment and structure
>d2ihoa1 b.42.2.1 (A:2-155) Agglutinin MOA, N-terminal domain {Fairy-ring mushroom (Marasmius oreades) [TaxId: 181124]} Back     information, alignment and structure
>d1sr4a_ b.42.2.1 (A:) Cytolethal distending toxin subunit A {Haemophilus ducreyi [TaxId: 730]} Back     information, alignment and structure
>d1qxma1 b.42.2.1 (A:4-148) Hemagglutinin component Ha1 {Clostridium botulinum D phage [TaxId: 29342]} Back     information, alignment and structure
>d1upsa2 b.42.2.3 (A:290-420) GlcNAc-alpha-1,4-Gal-releasing endo-beta-galactosidase, GngC, C-terminal domain {Clostridium perfringens [TaxId: 1502]} Back     information, alignment and structure
>d1sr4a_ b.42.2.1 (A:) Cytolethal distending toxin subunit A {Haemophilus ducreyi [TaxId: 730]} Back     information, alignment and structure
>d1upsa2 b.42.2.3 (A:290-420) GlcNAc-alpha-1,4-Gal-releasing endo-beta-galactosidase, GngC, C-terminal domain {Clostridium perfringens [TaxId: 1502]} Back     information, alignment and structure
>d2bo4a1 c.68.1.18 (A:2-382) Mannosylglycerate synthase, MGS {Rhodothermus marinus [TaxId: 29549]} Back     information, alignment and structure