Psyllid ID: psy3378


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------24
MADGGCSSVHGRQNLFPSRFYIYFLRRRFVVLKLILLVCLIWLTVAALLFMEDRNRSGVVIESLDNLNNVDRQSLKNVDDNDNLIQPVAQAQIAETPSAETVDKNADKMVLHAPEKGEFDEDKNKMQYGEWGKPVILPQNLSSDIKKLVEEGWQRNAFNQYVSDLISVKRKLPDPRDEWCKVPGRYLKNLPATSVIICFHNEAWSVLLRTVHSVLDRSPAHLLKEIILVDDYSDMRKYL
cccccccccccccccccccccEEEEHHHHHHHHHHHHHHHHHHHHHHHHEEEccccccccccccccccHHHHHHccccccccccccHHHHHHHccccccccccccccccccccccccccccccccccccccccEEEccccccHHHHHHHHHHHHHccHHHHHHccccccccccccccccccccccccccccccEEEEEEEccHHHHHHHHHHHHHccccHHHHHEEEEEEccccccccc
ccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccccccccccHHcHcHcccHHHHccccccccccccccHccccccccccccccccccccccccccccccccccccccccccEEEccccccHHHHHHHHHHHHHHcHHHHHHHHccccccccccccHHHcccccccccccccEEEEEEccHHHHHHHHHHHHHHcccHHHHHHHEEEEcccccccccc
madggcssvhgrqnlfpsrFYIYFLRRRFVVLKLILLVCLIWLTVAALLFMedrnrsgvviesldnlnnvdrqslknvddndnliqPVAQAqiaetpsaetvdknADKMVlhapekgefdedknkmqygewgkpvilpqnlsSDIKKLVEEGWQRNAFNQYVSDLISVkrklpdprdewckvpgrylknlpatsVIICFHNEAWSVLLRTVHSvldrspahlLKEIILVDDYSDMRKYL
madggcssvhgrqnlfpsRFYIYFLRRRFVVLKLILLVCLIWLTVAALLFMEDRNRSGVViesldnlnnvdrqslknvddndnLIQPVAQaqiaetpsaetvdknADKMVLHapekgefdedkNKMQYGEWGKPVILPQNLSSDIKKLVEEGWQRNAFNQYVSDLISvkrklpdprdewCKVPGRYLKNLPATSVIICFHNEAWSVLLRTVHSVldrspahllkeiilvddysdMRKYL
MADGGCSSVHGRQNLFPSRFYIYFLRRRFvvlklillvcliwltvAALLFMEDRNRSGVVIESLDNLNNVDRQSLKNVDDNDNLIQPVAQAQIAETPSAETVDKNADKMVLHAPEKGEFDEDKNKMQYGEWGKPVILPQNLSSDIKKLVEEGWQRNAFNQYVSDLISVKRKLPDPRDEWCKVPGRYLKNLPATSVIICFHNEAWSVLLRTVHSVLDRSPAHLLKEIILVDDYSDMRKYL
************QNLFPSRFYIYFLRRRFVVLKLILLVCLIWLTVAALLFMEDRNRSGVVIESLDNLN***********************************************************YGEWGKPVILPQNLSSDIKKLVEEGWQRNAFNQYVSDLISVKRKLPDPRDEWCKVPGRYLKNLPATSVIICFHNEAWSVLLRTVHSVLDRSPAHLLKEIILVDDY*******
********VHGRQNLFPSRFYIYFLRRRFVVLKLILLVCLIWLTVAALLFMED*********************************************************************************VIL***************WQRNAFNQYVSDLISVKRKLPDPRDEWCKVPGRYLKNLPATSVIICFHNEAWSVLLRTVHSVLDRSPAHLLKEIILVDDYSD****L
********VHGRQNLFPSRFYIYFLRRRFVVLKLILLVCLIWLTVAALLFMEDRNRSGVVIESLDNLNNVDRQSLKNVDDNDNLIQPVAQAQIAETPSAETVDKNADKMVLHAPEKGEFDEDKNKMQYGEWGKPVILPQNLSSDIKKLVEEGWQRNAFNQYVSDLISVKRKLPDPRDEWCKVPGRYLKNLPATSVIICFHNEAWSVLLRTVHSVLDRSPAHLLKEIILVDDYSDMRKYL
*****CSSVHGRQNLFPSRFYIYFLRRRFVVLKLILLVCLIWLTVAALLFMEDRNRS***IES*DN********************************************LHAPEKGEFDEDKNKMQYGEWGKPVILPQNLSSDIKKLVEEGWQRNAFNQYVSDLISVKRKLPDPRDEWCKVPGRYLKNLPATSVIICFHNEAWSVLLRTVHSVLDRSPAHLLKEIILVDDY*******
iiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MADGGCSSVHGRQNLFPSRFYIYFLRRRFVVLKLILLVCLIWLTVAALLFMEDRNRSGVVIESLDNLNNVDRQSLKNVDDNDNLIQPVAQAQIAETPSAETVDKNADKMVLHAPEKGEFDEDKNKMQYGEWGKPVILPQNLSSDIKKLVEEGWQRNAFNQYVSDLISVKRKLPDPRDEWCKVPGRYLKNLPATSVIICFHNEAWSVLLRTVHSVLDRSPAHLLKEIILVDDYSDMRKYL
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query239 2.2.26 [Sep-21-2011]
Q8MRC9 650 Putative polypeptide N-ac yes N/A 0.878 0.323 0.425 3e-44
Q95ZJ1 626 Polypeptide N-acetylgalac yes N/A 0.861 0.329 0.399 1e-30
P34678 612 Polypeptide N-acetylgalac no N/A 0.828 0.323 0.366 4e-24
Q07537 559 Polypeptide N-acetylgalac yes N/A 0.661 0.282 0.341 1e-23
Q29121 559 Polypeptide N-acetylgalac no N/A 0.661 0.282 0.341 1e-23
O08912 559 Polypeptide N-acetylgalac yes N/A 0.422 0.180 0.518 2e-23
Q10472 559 Polypeptide N-acetylgalac yes N/A 0.661 0.282 0.336 2e-23
Q6P9S7 603 Polypeptide N-acetylgalac no N/A 0.443 0.175 0.522 6e-23
Q6WV17 630 Polypeptide N-acetylgalac no N/A 0.439 0.166 0.527 9e-23
Q10473 559 Polypeptide N-acetylgalac yes N/A 0.422 0.180 0.509 1e-22
>sp|Q8MRC9|GALT9_DROME Putative polypeptide N-acetylgalactosaminyltransferase 9 OS=Drosophila melanogaster GN=pgant9 PE=2 SV=2 Back     alignment and function desciption
 Score =  178 bits (452), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 105/247 (42%), Positives = 133/247 (53%), Gaps = 37/247 (14%)

Query: 26  RRRFVVLKLILLVCLIWLTVAALLFMEDRNRSGVVIESLDNLNNVDRQ------------ 73
           RR   ++KL+     IW  +A L++ +D  R      +  +                   
Sbjct: 7   RRSTTIVKLVAFALAIWFCIAFLVYTDDTRRRAAQEGAGASGAGSAPGVGGGAGGLGDPI 66

Query: 74  --SLKNVDDNDNLI------------QPVAQAQIAETPSAETVDKNADKMVLHAPE--KG 117
             +L+N    ++              Q   +A I  T      D  A++M   A E  K 
Sbjct: 67  ALALRNEPAGEDFGINGNVIGGGGQKQAHDEADIPPTVGKHKADLQAERMRKKAAEQPKK 126

Query: 118 EFDEDKNKM---------QYGEWGKPVILPQNLSSDIKKLVEEGWQRNAFNQYVSDLISV 168
           +  ED  K+           GE GKPV LP+ +S ++KK V++GW +NAFNQYVSDLISV
Sbjct: 127 KPQEDSKKVIDPPANFEENPGELGKPVRLPKEMSDEMKKAVDDGWTKNAFNQYVSDLISV 186

Query: 169 KRKLPDPRDEWCKVPGRYLKNLPATSVIICFHNEAWSVLLRTVHSVLDRSPAHLLKEIIL 228
            R LPDPRD WCK   RYL NLP T VIICFHNEAW+VLLRTVHSVLDRSP HL+ +IIL
Sbjct: 187 HRTLPDPRDAWCKDEARYLTNLPKTDVIICFHNEAWTVLLRTVHSVLDRSPEHLIGKIIL 246

Query: 229 VDDYSDM 235
           VDDYSDM
Sbjct: 247 VDDYSDM 253




May catalyze the initial reaction in O-linked oligosaccharide biosynthesis, the transfer of an N-acetyl-D-galactosamine residue to a serine or threonine residue on the protein receptor.
Drosophila melanogaster (taxid: 7227)
EC: 2EC: .EC: 4EC: .EC: 1EC: .EC: 4EC: 1
>sp|Q95ZJ1|GALT5_CAEEL Polypeptide N-acetylgalactosaminyltransferase 5 OS=Caenorhabditis elegans GN=gly-5 PE=2 SV=2 Back     alignment and function description
>sp|P34678|GALT3_CAEEL Polypeptide N-acetylgalactosaminyltransferase 3 OS=Caenorhabditis elegans GN=gly-3 PE=2 SV=2 Back     alignment and function description
>sp|Q07537|GALT1_BOVIN Polypeptide N-acetylgalactosaminyltransferase 1 OS=Bos taurus GN=GALNT1 PE=1 SV=1 Back     alignment and function description
>sp|Q29121|GALT1_PIG Polypeptide N-acetylgalactosaminyltransferase 1 OS=Sus scrofa GN=GALNT1 PE=2 SV=1 Back     alignment and function description
>sp|O08912|GALT1_MOUSE Polypeptide N-acetylgalactosaminyltransferase 1 OS=Mus musculus GN=Galnt1 PE=1 SV=1 Back     alignment and function description
>sp|Q10472|GALT1_HUMAN Polypeptide N-acetylgalactosaminyltransferase 1 OS=Homo sapiens GN=GALNT1 PE=1 SV=1 Back     alignment and function description
>sp|Q6P9S7|GLT10_MOUSE Polypeptide N-acetylgalactosaminyltransferase 10 OS=Mus musculus GN=Galnt10 PE=2 SV=1 Back     alignment and function description
>sp|Q6WV17|GALT5_DROME Polypeptide N-acetylgalactosaminyltransferase 5 OS=Drosophila melanogaster GN=pgant5 PE=2 SV=2 Back     alignment and function description
>sp|Q10473|GALT1_RAT Polypeptide N-acetylgalactosaminyltransferase 1 OS=Rattus norvegicus GN=Galnt1 PE=1 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query239
328713087 674 PREDICTED: putative polypeptide N-acetyl 0.887 0.314 0.493 1e-56
91089275 586 PREDICTED: similar to n-acetylgalactosam 0.769 0.313 0.533 1e-52
270011456 621 hypothetical protein TcasGA2_TC005479 [T 0.769 0.296 0.533 1e-52
350426661 602 PREDICTED: putative polypeptide N-acetyl 0.832 0.330 0.523 2e-51
350426664 637 PREDICTED: putative polypeptide N-acetyl 0.832 0.312 0.523 2e-51
340723540 602 PREDICTED: putative polypeptide N-acetyl 0.832 0.330 0.518 3e-51
340723544 637 PREDICTED: putative polypeptide N-acetyl 0.832 0.312 0.518 3e-51
345484988 646 PREDICTED: putative polypeptide N-acetyl 0.870 0.321 0.518 5e-51
345484986 610 PREDICTED: putative polypeptide N-acetyl 0.870 0.340 0.518 6e-51
383857913 638 PREDICTED: putative polypeptide N-acetyl 0.836 0.313 0.527 2e-50
>gi|328713087|ref|XP_001951943.2| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase 9-like isoform 1 [Acyrthosiphon pisum] Back     alignment and taxonomy information
 Score =  225 bits (573), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 115/233 (49%), Positives = 152/233 (65%), Gaps = 21/233 (9%)

Query: 23  YFLRRRFVVLKLILLVCLIWLTVAALLFMEDRN--------RSGVVIESLDNLNNVDRQS 74
           Y  RR+  +LK ++L C +WLTV  LLFMEDR          +    +    +    R+ 
Sbjct: 6   YLYRRKSALLKGLVLGCTVWLTVTVLLFMEDRRITPASNDPSAAGGYQQQQPIAAAQRRL 65

Query: 75  LKNVDDNDNLI------------QPVAQAQIAETPSAETVDKNADKMVLHAPEKGEFDED 122
            +++D ++ L+            QP A   + +   A  + +N + ++      G  DE+
Sbjct: 66  PEDLDADNRLLDDEFPLEGGVEQQPAAPVIVQQQAGAAGL-QNGEGVLPQPNRGGVNDEE 124

Query: 123 KNKMQYGEWGKPVILPQNLSSDIKKLVEEGWQRNAFNQYVSDLISVKRKLPDPRDEWCKV 182
           + +  YGE GKPV+LP NL++D+KKLV+EGW+ NAFNQY SDLIS+ R LPDPRDEWCK 
Sbjct: 125 RGRKVYGEMGKPVVLPANLTADVKKLVDEGWKNNAFNQYASDLISLHRTLPDPRDEWCKK 184

Query: 183 PGRYLKNLPATSVIICFHNEAWSVLLRTVHSVLDRSPAHLLKEIILVDDYSDM 235
           PGRYL NLP TSVI+CFHNEAWSVLLRTVHS+LDRSP HL++EIILVDD+SDM
Sbjct: 185 PGRYLDNLPQTSVIVCFHNEAWSVLLRTVHSILDRSPEHLIREIILVDDFSDM 237




Source: Acyrthosiphon pisum

Species: Acyrthosiphon pisum

Genus: Acyrthosiphon

Family: Aphididae

Order: Hemiptera

Class: Insecta

Phylum: Arthropoda

Superkingdom: Eukaryota

>gi|91089275|ref|XP_970398.1| PREDICTED: similar to n-acetylgalactosaminyltransferase [Tribolium castaneum] Back     alignment and taxonomy information
>gi|270011456|gb|EFA07904.1| hypothetical protein TcasGA2_TC005479 [Tribolium castaneum] Back     alignment and taxonomy information
>gi|350426661|ref|XP_003494505.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase 9-like isoform 1 [Bombus impatiens] Back     alignment and taxonomy information
>gi|350426664|ref|XP_003494506.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase 9-like isoform 2 [Bombus impatiens] Back     alignment and taxonomy information
>gi|340723540|ref|XP_003400147.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase 9-like isoform 1 [Bombus terrestris] gi|340723542|ref|XP_003400148.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase 9-like isoform 2 [Bombus terrestris] Back     alignment and taxonomy information
>gi|340723544|ref|XP_003400149.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase 9-like isoform 3 [Bombus terrestris] Back     alignment and taxonomy information
>gi|345484988|ref|XP_001605337.2| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase 9-like isoform 1 [Nasonia vitripennis] Back     alignment and taxonomy information
>gi|345484986|ref|XP_003425168.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase 9-like isoform 2 [Nasonia vitripennis] Back     alignment and taxonomy information
>gi|383857913|ref|XP_003704448.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase 9-like [Megachile rotundata] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query239
FB|FBgn0050463 650 CG30463 [Drosophila melanogast 0.610 0.224 0.585 2.8e-41
WB|WBGene00001630 626 gly-5 [Caenorhabditis elegans 0.451 0.172 0.609 4.2e-29
UNIPROTKB|K7EJV8 387 GALNT1 "Polypeptide N-acetylga 0.439 0.271 0.531 2e-24
UNIPROTKB|Q5ZJL1 559 GALNT1 "Uncharacterized protei 0.439 0.187 0.522 6.6e-24
UNIPROTKB|Q07537 559 GALNT1 "Polypeptide N-acetylga 0.439 0.187 0.540 6.6e-24
UNIPROTKB|E2RM54 559 GALNT1 "Uncharacterized protei 0.439 0.187 0.540 6.6e-24
MGI|MGI:1890480 603 Galnt10 "UDP-N-acetyl-alpha-D- 0.602 0.238 0.454 8.2e-24
UNIPROTKB|Q10472 559 GALNT1 "Polypeptide N-acetylga 0.439 0.187 0.531 1.1e-23
UNIPROTKB|F1SAI6 559 LOC100738545 "Uncharacterized 0.439 0.187 0.540 1.1e-23
UNIPROTKB|Q29121 559 GALNT1 "Polypeptide N-acetylga 0.439 0.187 0.540 1.1e-23
FB|FBgn0050463 CG30463 [Drosophila melanogaster (taxid:7227)] Back     alignment and assigned GO terms
 Score = 440 (159.9 bits), Expect = 2.8e-41, P = 2.8e-41
 Identities = 92/157 (58%), Positives = 107/157 (68%)

Query:    90 QAQIAETPSAETVDKNADKMVLHAPE--KGEFDEDKNKM---------QYGEWGKPVILP 138
             +A I  T      D  A++M   A E  K +  ED  K+           GE GKPV LP
Sbjct:    97 EADIPPTVGKHKADLQAERMRKKAAEQPKKKPQEDSKKVIDPPANFEENPGELGKPVRLP 156

Query:   139 QNLSSDIKKLVEEGWQRNAFNQYVSDLISVKRKLPDPRDEWCKVPGRYLKNLPATSVIIC 198
             + +S ++KK V++GW +NAFNQYVSDLISV R LPDPRD WCK   RYL NLP T VIIC
Sbjct:   157 KEMSDEMKKAVDDGWTKNAFNQYVSDLISVHRTLPDPRDAWCKDEARYLTNLPKTDVIIC 216

Query:   199 FHNEAWSVLLRTVHSVLDRSPAHLLKEIILVDDYSDM 235
             FHNEAW+VLLRTVHSVLDRSP HL+ +IILVDDYSDM
Sbjct:   217 FHNEAWTVLLRTVHSVLDRSPEHLIGKIILVDDYSDM 253




GO:0004653 "polypeptide N-acetylgalactosaminyltransferase activity" evidence=ISS
WB|WBGene00001630 gly-5 [Caenorhabditis elegans (taxid:6239)] Back     alignment and assigned GO terms
UNIPROTKB|K7EJV8 GALNT1 "Polypeptide N-acetylgalactosaminyltransferase 1 soluble form" [Homo sapiens (taxid:9606)] Back     alignment and assigned GO terms
UNIPROTKB|Q5ZJL1 GALNT1 "Uncharacterized protein" [Gallus gallus (taxid:9031)] Back     alignment and assigned GO terms
UNIPROTKB|Q07537 GALNT1 "Polypeptide N-acetylgalactosaminyltransferase 1" [Bos taurus (taxid:9913)] Back     alignment and assigned GO terms
UNIPROTKB|E2RM54 GALNT1 "Uncharacterized protein" [Canis lupus familiaris (taxid:9615)] Back     alignment and assigned GO terms
MGI|MGI:1890480 Galnt10 "UDP-N-acetyl-alpha-D-galactosamine:polypeptide N-acetylgalactosaminyltransferase 10" [Mus musculus (taxid:10090)] Back     alignment and assigned GO terms
UNIPROTKB|Q10472 GALNT1 "Polypeptide N-acetylgalactosaminyltransferase 1" [Homo sapiens (taxid:9606)] Back     alignment and assigned GO terms
UNIPROTKB|F1SAI6 LOC100738545 "Uncharacterized protein" [Sus scrofa (taxid:9823)] Back     alignment and assigned GO terms
UNIPROTKB|Q29121 GALNT1 "Polypeptide N-acetylgalactosaminyltransferase 1" [Sus scrofa (taxid:9823)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

ID ?Name ?Annotated EC number ?Identity ?Query coverage ?Hit coverage ?RBH(Q2H) ?RBH(H2Q) ?
Q10473GALT1_RAT2, ., 4, ., 1, ., 4, 10.50940.42250.1806yesN/A
O08912GALT1_MOUSE2, ., 4, ., 1, ., 4, 10.51880.42250.1806yesN/A

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query239
cd02510 299 cd02510, pp-GalNAc-T, pp-GalNAc-T initiates the fo 3e-24
pfam00535 168 pfam00535, Glycos_transf_2, Glycosyl transferase f 2e-05
>gnl|CDD|133004 cd02510, pp-GalNAc-T, pp-GalNAc-T initiates the formation of mucin-type O-linked glycans Back     alignment and domain information
 Score = 97.3 bits (243), Expect = 3e-24
 Identities = 32/41 (78%), Positives = 36/41 (87%)

Query: 194 SVIICFHNEAWSVLLRTVHSVLDRSPAHLLKEIILVDDYSD 234
           SVII FHNEA S LLRTVHSV++R+P  LLKEIILVDD+SD
Sbjct: 1   SVIIIFHNEALSTLLRTVHSVINRTPPELLKEIILVDDFSD 41


UDP-GalNAc: polypeptide alpha-N-acetylgalactosaminyltransferases (pp-GalNAc-T) initiate the formation of mucin-type, O-linked glycans by catalyzing the transfer of alpha-N-acetylgalactosamine (GalNAc) from UDP-GalNAc to hydroxyl groups of Ser or Thr residues of core proteins to form the Tn antigen (GalNAc-a-1-O-Ser/Thr). These enzymes are type II membrane proteins with a GT-A type catalytic domain and a lectin domain located on the lumen side of the Golgi apparatus. In human, there are 15 isozymes of pp-GalNAc-Ts, representing the largest of all glycosyltransferase families. Each isozyme has unique but partially redundant substrate specificity for glycosylation sites on acceptor proteins. Length = 299

>gnl|CDD|215980 pfam00535, Glycos_transf_2, Glycosyl transferase family 2 Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 239
KOG3737|consensus 603 100.0
KOG3736|consensus 578 99.97
KOG3738|consensus 559 99.96
TIGR03469 384 HonB hopene-associated glycosyltransferase HpnB. T 97.98
cd06439 251 CESA_like_1 CESA_like_1 is a member of the cellulo 97.82
TIGR03111 439 glyc2_xrt_Gpos1 putative glycosyltransferase TIGR0 97.76
PRK11204 420 N-glycosyltransferase; Provisional 97.55
PRK14583 444 hmsR N-glycosyltransferase; Provisional 97.49
TIGR03472 373 HpnI hopanoid biosynthesis associated glycosyl tra 97.4
PRK13915 306 putative glucosyl-3-phosphoglycerate synthase; Pro 97.39
TIGR03030 713 CelA cellulose synthase catalytic subunit (UDP-for 97.12
COG1215 439 Glycosyltransferases, probably involved in cell wa 97.01
PRK11498 852 bcsA cellulose synthase catalytic subunit; Provisi 96.95
PTZ00260 333 dolichyl-phosphate beta-glucosyltransferase; Provi 96.92
PRK14716 504 bacteriophage N4 adsorption protein B; Provisional 94.68
PRK05454 691 glucosyltransferase MdoH; Provisional 92.09
PRK11234 727 nfrB bacteriophage N4 adsorption protein B; Provis 82.92
KOG3916|consensus 372 80.99
>KOG3737|consensus Back     alignment and domain information
Probab=100.00  E-value=7e-39  Score=303.79  Aligned_cols=127  Identities=43%  Similarity=0.684  Sum_probs=119.9

Q ss_pred             ccccccCCCCCCCccc---cCcCCCCCCCceeecCCCCCHHHHHHHHhcccccccchhhcccccCCCCCCCCCCccccCc
Q psy3378         107 DKMVLHAPEKGEFDED---KNKMQYGEWGKPVILPQNLSSDIKKLVEEGWQRNAFNQYVSDLISVKRKLPDPRDEWCKVP  183 (239)
Q Consensus       107 ~~~~~~~p~~g~~~~d---~~~~~pGE~G~~v~l~~~~~~e~k~~~e~~~~~~~FN~~~SD~Isl~RslpD~R~~~Ck~~  183 (239)
                      -.|++.+..+||||++   ...++|||.|+|+.+    ++|.+.+.+..-+++|||+++||+||++|+++|+|.++|+. 
T Consensus        73 ~~p~l~~G~LGNfEPKepe~P~~gPGE~gkp~~l----~pe~k~a~~AsekEfGfN~~~SDmISm~R~v~D~R~EECkh-  147 (603)
T KOG3737|consen   73 HDPVLRPGILGNFEPKEPEPPVGGPGEKGKPLVL----GPEFKQAIQASEKEFGFNMVASDMISMDRNVNDLRQEECKH-  147 (603)
T ss_pred             CCCcccccccCCCCCCCCCCCCCCCCcCCccccc----ChhHHHHHHHHHHhhCcceeehhhhhcccCccccCHhhccc-
Confidence            4578888899999998   367799999999977    67788888999999999999999999999999999999999 


Q ss_pred             cccCCCCCceeEEEEeecChhHHHHHHHHHHhhcCcccccceEEEEeCCCCCCCC
Q psy3378         184 GRYLKNLPATSVIICFHNEAWSVLLRTVHSVLDRSPAHLLKEIILVDDYSDMRKY  238 (239)
Q Consensus       184 ~~Y~~~LP~~SVII~FhNEa~StLLRTVhSVl~rTP~~LLkEIILVDD~S~~~~L  238 (239)
                      ..|+.+||++||||+||||.||||+|||||||.|||.++|+|||||||+||++||
T Consensus       148 WdYpe~Lpt~SVviVFHNEGws~LmRTVHSVi~RsP~~~l~eivlvDDfSdKehL  202 (603)
T KOG3737|consen  148 WDYPENLPTSSVVIVFHNEGWSTLMRTVHSVIKRSPRKYLAEIVLVDDFSDKEHL  202 (603)
T ss_pred             cCCcccCCcceEEEEEecCccHHHHHHHHHHHhcCcHHhhheEEEeccCCccHHH
Confidence            9999999999999999999999999999999999999999999999999999987



>KOG3736|consensus Back     alignment and domain information
>KOG3738|consensus Back     alignment and domain information
>TIGR03469 HonB hopene-associated glycosyltransferase HpnB Back     alignment and domain information
>cd06439 CESA_like_1 CESA_like_1 is a member of the cellulose synthase (CESA) superfamily Back     alignment and domain information
>TIGR03111 glyc2_xrt_Gpos1 putative glycosyltransferase TIGR03111 Back     alignment and domain information
>PRK11204 N-glycosyltransferase; Provisional Back     alignment and domain information
>PRK14583 hmsR N-glycosyltransferase; Provisional Back     alignment and domain information
>TIGR03472 HpnI hopanoid biosynthesis associated glycosyl transferase protein HpnI Back     alignment and domain information
>PRK13915 putative glucosyl-3-phosphoglycerate synthase; Provisional Back     alignment and domain information
>TIGR03030 CelA cellulose synthase catalytic subunit (UDP-forming) Back     alignment and domain information
>COG1215 Glycosyltransferases, probably involved in cell wall biogenesis [Cell envelope biogenesis, outer membrane] Back     alignment and domain information
>PRK11498 bcsA cellulose synthase catalytic subunit; Provisional Back     alignment and domain information
>PTZ00260 dolichyl-phosphate beta-glucosyltransferase; Provisional Back     alignment and domain information
>PRK14716 bacteriophage N4 adsorption protein B; Provisional Back     alignment and domain information
>PRK05454 glucosyltransferase MdoH; Provisional Back     alignment and domain information
>PRK11234 nfrB bacteriophage N4 adsorption protein B; Provisional Back     alignment and domain information
>KOG3916|consensus Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query239
2d7i_A 570 Crsytal Structure Of Pp-Galnac-T10 With Udp, Galnac 2e-23
1xhb_A 472 The Crystal Structure Of Udp-Galnac: Polypeptide Al 4e-19
2ffu_A 501 Crystal Structure Of Human Ppgalnact-2 Complexed Wi 1e-18
>pdb|2D7I|A Chain A, Crsytal Structure Of Pp-Galnac-T10 With Udp, Galnac And Mn2+ Length = 570 Back     alignment and structure

Iteration: 1

Score = 105 bits (262), Expect = 2e-23, Method: Compositional matrix adjust. Identities = 59/120 (49%), Positives = 78/120 (65%), Gaps = 7/120 (5%) Query: 115 EKGEFDEDKNKMQYGEWGKPVILPQNLSSDIKKLVEEGWQRNAFNQYVSDLISVKRKLPD 174 +K D ++ GE G+P + +D ++ V++ ++ N FN YVSD IS+ R LPD Sbjct: 43 DKEAIRRDAQRVGNGEQGRPYPM-----TDAER-VDQAYRENGFNIYVSDKISLNRSLPD 96 Query: 175 PRDEWCKVPGRYLKNLPATSVIICFHNEAWSVLLRTVHSVLDRSPAHLLKEIILVDDYSD 234 R C RYL+ LP TS+II FHNE WS LLRTVHSVL+RSP L+ EI+LVDD+SD Sbjct: 97 IRHPNCNS-KRYLETLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSD 155
>pdb|1XHB|A Chain A, The Crystal Structure Of Udp-Galnac: Polypeptide Alpha-N- Acetylgalactosaminyltransferase-T1 Length = 472 Back     alignment and structure
>pdb|2FFU|A Chain A, Crystal Structure Of Human Ppgalnact-2 Complexed With Udp And Ea2 Length = 501 Back     alignment and structure

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query239
2ffu_A 501 Ppgalnact-2, polypeptide N-acetylgalactosaminyltra 2e-40
2d7i_A 570 Polypeptide N-acetylgalactosaminyltransferase 10; 1e-36
1xhb_A 472 Polypeptide N-acetylgalactosaminyltransferase 1; g 2e-32
>2ffu_A Ppgalnact-2, polypeptide N-acetylgalactosaminyltransferase 2, protein-UDP; ppgalnact, mucin, glycosyltransferase; HET: UDP; 1.64A {Homo sapiens} PDB: 2ffv_A* Length = 501 Back     alignment and structure
 Score =  144 bits (363), Expect = 2e-40
 Identities = 49/114 (42%), Positives = 73/114 (64%), Gaps = 5/114 (4%)

Query: 122 DKNKMQYGEWGKPVILPQNLSSDIKKLVEEGWQRNAFNQYVSDLISVKRKLPDPRDEWCK 181
           D  K+++ ++ +   +       + +  ++ + RN FNQ  SD + + R +PD R + C+
Sbjct: 2   DALKVRWPDFNQEAYV----GGTMVRSGQDPYARNKFNQVESDKLRMDRAIPDTRHDQCQ 57

Query: 182 VPGRYLKNLPATSVIICFHNEAWSVLLRTVHSVLDRSPAHLLKEIILVDDYSDM 235
              ++  +LPATSV+I FHNEA S LLRTV SVL +SP HL+KEIILVDDYS+ 
Sbjct: 58  RK-QWRVDLPATSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSND 110


>2d7i_A Polypeptide N-acetylgalactosaminyltransferase 10; beta trefoil, rossmann fold; HET: NAG NGA UDP; 2.50A {Homo sapiens} PDB: 2d7r_A* Length = 570 Back     alignment and structure
>1xhb_A Polypeptide N-acetylgalactosaminyltransferase 1; glycosyltransferase-A (GT-A); HET: NAG BMA; 2.50A {Mus musculus} SCOP: b.42.2.1 c.68.1.17 Length = 472 Back     alignment and structure

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query239
2d7i_A 570 Polypeptide N-acetylgalactosaminyltransferase 10; 99.9
2ffu_A 501 Ppgalnact-2, polypeptide N-acetylgalactosaminyltra 99.84
1xhb_A 472 Polypeptide N-acetylgalactosaminyltransferase 1; g 99.77
2z86_A 625 Chondroitin synthase; GT-A, glycosyltransferase A, 98.7
3ckj_A 329 Putative uncharacterized protein; mycobacteria, un 97.87
2z86_A 625 Chondroitin synthase; GT-A, glycosyltransferase A, 97.71
3f1y_A 387 Mannosyl-3-phosphoglycerate synthase; GT-A type gl 97.63
2fy7_A 287 Beta-1,4-galactosyltransferase 1; M339H mutant, AP 97.44
4hg6_A 802 Cellulose synthase subunit A; membrane translocati 97.25
4fix_A 657 UDP-galactofuranosyl transferase GLFT2; CAZY GT-2 96.82
2wvl_A 391 Mannosyl-3-phosphoglycerate synthase; GT-A fold, t 96.62
2nxv_A 249 ATP synthase subunits region ORF 6; majastridin, A 96.06
2zu9_A 394 Mannosyl-3-phosphoglycerate synthase; GT-A fold, g 95.85
>2d7i_A Polypeptide N-acetylgalactosaminyltransferase 10; beta trefoil, rossmann fold; HET: NAG NGA UDP; 2.50A {Homo sapiens} PDB: 2d7r_A* Back     alignment and structure
Probab=99.90  E-value=4.3e-25  Score=210.92  Aligned_cols=111  Identities=51%  Similarity=0.848  Sum_probs=100.2

Q ss_pred             ccccCcCCCCCCCceeecCCCCCHHHHHHHHhcccccccchhhcccccCCCCCCCCCCccccCccccCCCCCceeEEEEe
Q psy3378         120 DEDKNKMQYGEWGKPVILPQNLSSDIKKLVEEGWQRNAFNQYVSDLISVKRKLPDPRDEWCKVPGRYLKNLPATSVIICF  199 (239)
Q Consensus       120 ~~d~~~~~pGE~G~~v~l~~~~~~e~k~~~e~~~~~~~FN~~~SD~Isl~RslpD~R~~~Ck~~~~Y~~~LP~~SVII~F  199 (239)
                      +.|..+.+|||+|+||.+++.    +  +.++.|++++||+++||+|+++|++||+|++.|+. ..|+.++|.+||||++
T Consensus        48 ~~~~~~~~~g~~g~~~~~~~~----~--~~~~~~~~~~~n~~~sd~i~l~R~~~d~r~~~~~~-~~~~~~~P~vSVIIp~  120 (570)
T 2d7i_A           48 RRDAQRVGNGEQGRPYPMTDA----E--RVDQAYRENGFNIYVSDKISLNRSLPDIRHPNCNS-KRYLETLPNTSIIIPF  120 (570)
T ss_dssp             HHHHTCCSTTGGGCCCCCCTT----T--TSSHHHHTTSSCHHHHHTSCTTCCCCCCSCGGGGT-CCEESSCCCEEEEEEE
T ss_pred             cccccCCCCCCCCCceECCHH----H--HHHHHHHHcCCCHHHHhccCCCccCCCCCCcccee-cccCCCCCCeEEEEEE
Confidence            344677899999999998531    2  23677999999999999999999999999999999 7999999999999999


Q ss_pred             ecChhHHHHHHHHHHhhcCcccccceEEEEeCCCCCCC
Q psy3378         200 HNEAWSVLLRTVHSVLDRSPAHLLKEIILVDDYSDMRK  237 (239)
Q Consensus       200 hNEa~StLLRTVhSVl~rTP~~LLkEIILVDD~S~~~~  237 (239)
                      |||+|++|.|+|+||+++|++.++.|||+|||+|+.+.
T Consensus       121 yNe~~~~L~~~L~Sll~qt~~~~~~EIIVVDDgS~D~t  158 (570)
T 2d7i_A          121 HNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREH  158 (570)
T ss_dssp             SSCCHHHHHHHHHHHHHHSCGGGEEEEEEEECSCCCGG
T ss_pred             CCCCHHHHHHHHHHHHhcCCccCcEEEEEEECCCCcHH
Confidence            99999999999999999999998889999999998765



>2ffu_A Ppgalnact-2, polypeptide N-acetylgalactosaminyltransferase 2, protein-UDP; ppgalnact, mucin, glycosyltransferase; HET: UDP; 1.64A {Homo sapiens} PDB: 2ffv_A* Back     alignment and structure
>1xhb_A Polypeptide N-acetylgalactosaminyltransferase 1; glycosyltransferase-A (GT-A); HET: NAG BMA; 2.50A {Mus musculus} SCOP: b.42.2.1 c.68.1.17 Back     alignment and structure
>2z86_A Chondroitin synthase; GT-A, glycosyltransferase A, fold; HET: UGA UDP; 2.40A {Escherichia coli} PDB: 2z87_A* Back     alignment and structure
>3ckj_A Putative uncharacterized protein; mycobacteria, unknown function; HET: CIT; 1.80A {Mycobacterium paratuberculosis} PDB: 3ckn_A* 3cko_A* 3ckq_A* 3ckv_A* 3e26_A 3e25_A Back     alignment and structure
>2z86_A Chondroitin synthase; GT-A, glycosyltransferase A, fold; HET: UGA UDP; 2.40A {Escherichia coli} PDB: 2z87_A* Back     alignment and structure
>3f1y_A Mannosyl-3-phosphoglycerate synthase; GT-A type glycosyltransferase, GT-81, mannosyl-3-phosphoglyc synthase, GDP-mannose, transferas; 2.20A {Rubrobacter xylanophilus} PDB: 3kia_A* 3o3p_A* Back     alignment and structure
>2fy7_A Beta-1,4-galactosyltransferase 1; M339H mutant, APO enzyme; HET: PGE; 1.70A {Homo sapiens} PDB: 2aec_A* 2aes_A* 2ae7_A* 2ah9_A* 2agd_A* 2fya_A* 2fyb_A* 3ee5_A* 2fyc_B* 1tw1_A* 1tw5_A* 1tvy_A* 1nmm_B* 1o0r_A* 1yro_B* 1nf5_B* 1nhe_B* 1nkh_B* 1nqi_B* 1nwg_B* ... Back     alignment and structure
>4hg6_A Cellulose synthase subunit A; membrane translocation, cellulose synthesis, UDP-GLC binding membrane, transferase; HET: BGC UDP LDA; 3.25A {Rhodobacter sphaeroides} Back     alignment and structure
>4fix_A UDP-galactofuranosyl transferase GLFT2; CAZY GT-2 family, glycosyltrans carbohydrate binding, membrane; 2.45A {Mycobacterium tuberculosis} PDB: 4fiy_A* Back     alignment and structure
>2wvl_A Mannosyl-3-phosphoglycerate synthase; GT-A fold, transferase, glycosyltransferase, retaining mecha glucosyl transferase; HET: GDD; 2.81A {Thermus thermophilus} PDB: 2wvk_A* 2wvm_A* Back     alignment and structure
>2nxv_A ATP synthase subunits region ORF 6; majastridin, ATPase operon, glycosyl transferase, rossmann F sulphur SAD, transferase; 1.10A {Rhodobacter blasticus} PDB: 2qgi_A* Back     alignment and structure
>2zu9_A Mannosyl-3-phosphoglycerate synthase; GT-A fold, glycosyltransferase, GT55, GDP, cytoplasm, magnesium, transferase; HET: GDP; 2.00A {Pyrococcus horikoshii} PDB: 2zu7_A* 2zu8_A* Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query 239
d1xhba2 328 c.68.1.17 (A:95-422) Polypeptide N-acetylgalactosa 4e-21
d2bo4a1 381 c.68.1.18 (A:2-382) Mannosylglycerate synthase, MG 2e-05
d1qg8a_ 255 c.68.1.1 (A:) Spore coat polysaccharide biosynthes 0.001
>d1xhba2 c.68.1.17 (A:95-422) Polypeptide N-acetylgalactosaminyltransferase 1, N-terminal domain {Mouse (Mus musculus) [TaxId: 10090]} Length = 328 Back     information, alignment and structure

class: Alpha and beta proteins (a/b)
fold: Nucleotide-diphospho-sugar transferases
superfamily: Nucleotide-diphospho-sugar transferases
family: Polypeptide N-acetylgalactosaminyltransferase 1, N-terminal domain
domain: Polypeptide N-acetylgalactosaminyltransferase 1, N-terminal domain
species: Mouse (Mus musculus) [TaxId: 10090]
 Score = 88.1 bits (217), Expect = 4e-21
 Identities = 42/65 (64%), Positives = 50/65 (76%), Gaps = 1/65 (1%)

Query: 170 RKLPDPRDEWCKVPGRYLKNLPATSVIICFHNEAWSVLLRTVHSVLDRSPAHLLKEIILV 229
           R LPD R E CK    Y  NLP TSV+I FHNEAWS LLRTVHSV++RSP H+++EI+LV
Sbjct: 2   RSLPDVRLEGCK-TKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMIEEIVLV 60

Query: 230 DDYSD 234
           DD S+
Sbjct: 61  DDASE 65


>d2bo4a1 c.68.1.18 (A:2-382) Mannosylglycerate synthase, MGS {Rhodothermus marinus [TaxId: 29549]} Length = 381 Back     information, alignment and structure
>d1qg8a_ c.68.1.1 (A:) Spore coat polysaccharide biosynthesis protein SpsA {Bacillus subtilis [TaxId: 1423]} Length = 255 Back     information, alignment and structure

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query239
d1xhba2 328 Polypeptide N-acetylgalactosaminyltransferase 1, N 99.81
>d1xhba2 c.68.1.17 (A:95-422) Polypeptide N-acetylgalactosaminyltransferase 1, N-terminal domain {Mouse (Mus musculus) [TaxId: 10090]} Back     information, alignment and structure
class: Alpha and beta proteins (a/b)
fold: Nucleotide-diphospho-sugar transferases
superfamily: Nucleotide-diphospho-sugar transferases
family: Polypeptide N-acetylgalactosaminyltransferase 1, N-terminal domain
domain: Polypeptide N-acetylgalactosaminyltransferase 1, N-terminal domain
species: Mouse (Mus musculus) [TaxId: 10090]
Probab=99.81  E-value=1.6e-20  Score=161.43  Aligned_cols=68  Identities=62%  Similarity=0.956  Sum_probs=65.3

Q ss_pred             CCCCCCCCCccccCccccCCCCCceeEEEEeecChhHHHHHHHHHHhhcCcccccceEEEEeCCCCCCC
Q psy3378         169 KRKLPDPRDEWCKVPGRYLKNLPATSVIICFHNEAWSVLLRTVHSVLDRSPAHLLKEIILVDDYSDMRK  237 (239)
Q Consensus       169 ~RslpD~R~~~Ck~~~~Y~~~LP~~SVII~FhNEa~StLLRTVhSVl~rTP~~LLkEIILVDD~S~~~~  237 (239)
                      +|++||.|++.|+. +.|++++|++||||++|||++++|.|||+||+++||+..+.|||+|||+|+.+.
T Consensus         1 ~~~~~~~r~~~~~~-~~~~~~~P~vSIIIp~yNe~~~~l~~~l~Si~~qt~~~~~~EIIvVDdgS~d~~   68 (328)
T d1xhba2           1 NRSLPDVRLEGCKT-KVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMIEEIVLVDDASERDF   68 (328)
T ss_dssp             CCCCCCCSCGGGGT-CCCCSCCCCEEEEEEESSCCHHHHHHHHHHHHHSSCGGGEEEEEEEECSCCCGG
T ss_pred             CCCCCCCCChhhhh-ccCCCCCCCEEEEEeccCCcHHHHHHHHHHHHhcCCCCCCeEEEEEECCCChhh
Confidence            69999999999999 899999999999999999999999999999999999999999999999998753