Citrus Sinensis ID: 021847


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300------
MRPLWGFPKSDNSSFSLFAIAAGIKQSDLVDKMVTKFLSSNFSVMLFHYDGVVDQWKTFEWCDRVIHVSAANQTKWWFAKRFLHPDIVAEYSYIFLWDEDLGVEDFNPQKYVSIVKSEGLEISQPALDPAKSEVHHQITARSQRSIVHRRTYKPGLCDKNSTAPPCTGWIEMMAPVFSRAAWRCVWYLIQNDLIHAWGLDMQLGYCAQGDRTKNVGVVDAEYVVHYGRPTLGVPEKKTDLLEIGFSPETENITHVMTKTSSHSHKKDPRLEVRRQSFLEFKIFKRRWIQAAKKDNCWIDPYSQSLK
cccccccccccccccEEEEEEEccccHHHHHHHHHHHcccccEEEEEEEcccccccccccccccEEEEEEcccHHHHHHHHcccHHHHccEEEEEEEccccccccccHHHHHHHHHHHccccccccccccccEEEEEEEEEEcccEEEcEEcccccccccccccccccEEEEEcccccHHHHHHHHHHHccccccccHHHHccccccccccccEEEEEEEEEEEEcccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHcccccccccccccc
cccccccccccccccEEEEEEEcHcHHHHHHHHHHHccccccEEEEEEEcccccccccccccccEEEEEEccccHHHHHHHcccccHcccccEEEEEccccccccccHHHHHHHHHHccccEcccccccccccEEEEEEEEccccEEEEEEEccccccccccccccHHHHHEHcHHccHHHHHHHHHHHHHHHHccccccccccHcccccccccEEEEEcHEEEEcccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHccccccccccccc
mrplwgfpksdnssfSLFAIAAGIKQSDLVDKMVTKFLSSNFSVMLFHYdgvvdqwktfewcdrVIHVSAANQTKWWFakrflhpdiVAEYSYIFlwdedlgvedfnpqKYVSIVKSegleisqpaldpaksevhhqitarsqrsivhrrtykpglcdknstappctgwieMMAPVFSRAAWRCVWYLIQNDLIHAWGldmqlgycaqgdrtknvgvvdAEYVvhygrptlgvpekktdlleigfspetenithvmtktsshshkkdprleVRRQSFLEFKIFKRRWIQAAkkdncwidpysqslk
mrplwgfpksdnSSFSLFAIAAGIKQSDLVDKMVTKFLSSNFSVMLFHYDGVVDQWKTFEWCDRVIHVSAANQTKWWFAKRFLHPDIVAEYSYIFLWDEDLGVEDFNPQKYVSIVKSEGLEISQPALDPAKSEVHhqitarsqrsivhrrtykpglcdknstAPPCTGWIEMMAPVFSRAAWRCVWYLIQNDLIHAWGLDMQLGYCAQGDRTKNVGVVDAEYVVHygrptlgvpekktDLLEIGFSPETENITHVmtktsshshkkdprlevrRQSFLEFKIFKrrwiqaakkdncwidpysqslk
MRPLWGFPKSDNSSFSLFAIAAGIKQSDLVDKMVTKFLSSNFSVMLFHYDGVVDQWKTFEWCDRVIHVSAANQTKWWFAKRFLHPDIVAEYSYIFLWDEDLGVEDFNPQKYVSIVKSEGLEISQPALDPAKSEVHHQITARSQRSIVHRRTYKPGLCDKNSTAPPCTGWIEMMAPVFSRAAWRCVWYLIQNDLIHAWGLDMQLGYCAQGDRTKNVGVVDAEYVVHYGRPTLGVPEKKTDLLEIGFSPETENITHVMTKTSSHSHKKDPRLEVRRQSFLEFKIFKRRWIQAAKKDNCWIDPYSQSLK
*************SFSLFAIAAGIKQSDLVDKMVTKFLSSNFSVMLFHYDGVVDQWKTFEWCDRVIHVSAANQTKWWFAKRFLHPDIVAEYSYIFLWDEDLGVEDFNPQKYVSIVKS****************************IVHRRTYKPGLCDKNSTAPPCTGWIEMMAPVFSRAAWRCVWYLIQNDLIHAWGLDMQLGYCAQGDRTKNVGVVDAEYVVHYGRPTLGVPEKKTDLLEIGF****************************RQSFLEFKIFKRRWIQAAKKDNCWIDP******
**P*************LFAIAAGIKQSDLVDKMVTKFLSSNFSVMLFHYDGVVDQWKTFEWCDRVIHVSAANQTKWWFAKRFLHPDIVAEYSYIFLWDEDLGVEDFNPQKYVSIVKSEGLEISQPALDPAKSEVHHQITARSQRSIVHRRTYKPGLCDKNSTAPPCTGWIEMMAPVFSRAAWRCVWYLIQNDLIHAWGLDMQLGYCAQGDRTKNVGVVDAEYVVHYGRPTLGV****************************************RQSFLEFKIFKRRWIQAAKKDNCW**PY*****
MRPLWGFPKSDNSSFSLFAIAAGIKQSDLVDKMVTKFLSSNFSVMLFHYDGVVDQWKTFEWCDRVIHVSAANQTKWWFAKRFLHPDIVAEYSYIFLWDEDLGVEDFNPQKYVSIVKSEGLEISQPAL******************IVHRRTYKPGLCDKNSTAPPCTGWIEMMAPVFSRAAWRCVWYLIQNDLIHAWGLDMQLGYCAQGDRTKNVGVVDAEYVVHYGRPTLGVPEKKTDLLEIGFSPETENITHV*************RLEVRRQSFLEFKIFKRRWIQAAKKDNCWIDPYSQSLK
****WGFPKSDNSSFSLFAIAAGIKQSDLVDKMVTKFLSSNFSVMLFHYDGVVDQWKTFEWCDRVIHVSAANQTKWWFAKRFLHPDIVAEYSYIFLWDEDLGVEDFNPQKYVSIVKSEGLEISQPALDPAKSEVHHQITARSQRSIVHRRTYKPGLCDKNSTAPPCTGWIEMMAPVFSRAAWRCVWYLIQNDLIHAWGLDMQLGYCAQGDRTKNVGVVDAEYVVHYGRPTL*************************************RLEVRRQSFLEFKIFKRRWIQAAKKDNCWIDPYS****
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MRPLWGFPKSDNSSFSLFAIAAGIKQSDLVDKMVTKFLSSNFSVMLFHYDGVVDQWKTFEWCDRVIHVSAANQTKWWFAKRFLHPDIVAEYSYIFLWDEDLGVEDFNPQKYVSIVKSEGLEISQPALDPAKSEVHHQITARSQRSIVHRRTYKPGLCDKNSTAPPCTGWIEMMAPVFSRAAWRCVWYLIQNDLIHAWGLDMQLGYCAQGDRTKNVGVVDAEYVVHYGRPTLGVPEKKTDLLEIGFSPETENITHVMTKTSSHSHKKDPRLEVRRQSFLEFKIFKRRWIQAAKKDNCWIDPYSQSLK
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

No hits with e-value below 0.001 by BLAST

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query306
255543122389 conserved hypothetical protein [Ricinus 0.934 0.735 0.736 1e-133
449449621388 PREDICTED: uncharacterized protein LOC10 0.934 0.737 0.685 1e-126
359476620 466 PREDICTED: uncharacterized protein LOC10 0.986 0.648 0.671 1e-121
297735287 413 unnamed protein product [Vitis vinifera] 0.986 0.731 0.671 1e-121
356566943382 PREDICTED: uncharacterized protein LOC10 0.911 0.730 0.666 1e-118
356574842387 PREDICTED: uncharacterized protein LOC10 0.911 0.720 0.672 1e-117
356530023343 PREDICTED: uncharacterized protein LOC10 0.911 0.813 0.660 1e-117
356533684326 PREDICTED: uncharacterized protein LOC10 0.905 0.849 0.675 1e-117
359486223 413 PREDICTED: uncharacterized protein LOC10 0.986 0.731 0.649 1e-114
240255999389 uncharacterized protein [Arabidopsis tha 0.947 0.745 0.642 1e-110
>gi|255543122|ref|XP_002512624.1| conserved hypothetical protein [Ricinus communis] gi|223548585|gb|EEF50076.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
 Score =  479 bits (1234), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 226/307 (73%), Positives = 256/307 (83%), Gaps = 21/307 (6%)

Query: 1   MRPLWGFPKSD-NSSFSLFAIAAGIKQSDLVDKMVTKFLSSNFSVMLFHYDGVVDQWKTF 59
           MRPLWGFP++D  SS +LF +A GIKQ D+VDKMV KFLSS FSVMLFHYDGVVD+W  +
Sbjct: 90  MRPLWGFPENDETSSINLFTLAVGIKQRDIVDKMVKKFLSSKFSVMLFHYDGVVDEWNDY 149

Query: 60  EWCDRVIHVSAANQTKWWFAKRFLHPDIVAEYSYIFLWDEDLGVEDFNPQKYVSIVKSEG 119
           EW D+VIH+SA NQTKWWFAKRFLHPDIVAEYSYIFLWDEDLGVE+F+PQ+Y+SIVKS+G
Sbjct: 150 EWKDQVIHISAHNQTKWWFAKRFLHPDIVAEYSYIFLWDEDLGVENFDPQQYLSIVKSKG 209

Query: 120 LEISQPALDPAKSEVHHQITARSQRSIVHRRTYKPGLCDKNSTAPPCTGWIEMMAPVFSR 179
           LEISQPALDP KS +H QITAR +RSIVH RT+KPG CD NSTAPPCTGW+EMMAPVFSR
Sbjct: 210 LEISQPALDPGKSAIHQQITARLRRSIVHSRTFKPGTCDGNSTAPPCTGWVEMMAPVFSR 269

Query: 180 AAWRCVWYLIQNDLIHAWGLDMQLGYCAQGDRTKNVGVVDAEYVVHYGRPTLGVPEKKTD 239
           AAWRCVWY+IQNDLIHAWGLD QLGYCAQGDR KN+GVVDAEY+VHYGRPTLG   +   
Sbjct: 270 AAWRCVWYMIQNDLIHAWGLDYQLGYCAQGDRVKNIGVVDAEYIVHYGRPTLGGTGE--- 326

Query: 240 LLEIGFSPETENITHVMTKTSSHSHKKDPRLEVRRQSFLEFKIFKRRWIQAAKKDNCWID 299
                            +K  S S+KKDPRLEVRRQSF+EFKIF++RW +AAK+D CWID
Sbjct: 327 -----------------SKEPSRSNKKDPRLEVRRQSFVEFKIFQKRWEKAAKEDKCWID 369

Query: 300 PYSQSLK 306
           PY Q+ K
Sbjct: 370 PYEQAEK 376




Source: Ricinus communis

Species: Ricinus communis

Genus: Ricinus

Family: Euphorbiaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|449449621|ref|XP_004142563.1| PREDICTED: uncharacterized protein LOC101221459 [Cucumis sativus] Back     alignment and taxonomy information
>gi|359476620|ref|XP_002272495.2| PREDICTED: uncharacterized protein LOC100244499 [Vitis vinifera] Back     alignment and taxonomy information
>gi|297735287|emb|CBI17649.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|356566943|ref|XP_003551684.1| PREDICTED: uncharacterized protein LOC100798633 [Glycine max] Back     alignment and taxonomy information
>gi|356574842|ref|XP_003555553.1| PREDICTED: uncharacterized protein LOC100802412 [Glycine max] Back     alignment and taxonomy information
>gi|356530023|ref|XP_003533584.1| PREDICTED: uncharacterized protein LOC100814260 [Glycine max] Back     alignment and taxonomy information
>gi|356533684|ref|XP_003535390.1| PREDICTED: uncharacterized protein LOC100807140 [Glycine max] Back     alignment and taxonomy information
>gi|359486223|ref|XP_002265374.2| PREDICTED: uncharacterized protein LOC100255698 [Vitis vinifera] gi|297739491|emb|CBI29673.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|240255999|ref|NP_193588.5| uncharacterized protein [Arabidopsis thaliana] gi|332658658|gb|AEE84058.1| uncharacterized protein [Arabidopsis thaliana] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query306
TAIR|locus:2124608389 AT4G18530 "AT4G18530" [Arabido 0.947 0.745 0.642 7.5e-105
TAIR|locus:2202124438 AT1G11170 [Arabidopsis thalian 0.950 0.664 0.547 4.6e-89
TAIR|locus:2008425425 AT1G61240 [Arabidopsis thalian 0.960 0.691 0.541 1.6e-86
TAIR|locus:504956279381 AT1G24570 [Arabidopsis thalian 0.748 0.601 0.508 7.7e-66
TAIR|locus:2008640404 AT1G67850 "AT1G67850" [Arabido 0.767 0.581 0.520 4.7e-64
TAIR|locus:2086721398 AT3G27470 "AT3G27470" [Arabido 0.741 0.570 0.512 1.6e-63
TAIR|locus:2079291396 AT3G26440 [Arabidopsis thalian 0.745 0.575 0.512 8.8e-63
TAIR|locus:2031905401 AT1G13000 [Arabidopsis thalian 0.745 0.568 0.497 1.1e-62
TAIR|locus:2205140382 AT1G08040 [Arabidopsis thalian 0.745 0.596 0.493 1e-61
TAIR|locus:2057557374 AT2G28310 "AT2G28310" [Arabido 0.745 0.609 0.493 1e-61
TAIR|locus:2124608 AT4G18530 "AT4G18530" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 1038 (370.5 bits), Expect = 7.5e-105, P = 7.5e-105
 Identities = 196/305 (64%), Positives = 225/305 (73%)

Query:     4 LWGFP----KSDNSSFSLFAIAAGIKQSDLVDKMVTKFLSSNFSVMLFHYDGVVDQWKTF 59
             LW +     +  N S SL A+A GIKQ +LV+K++ KF   +F+VMLFHYDGVVD WK +
Sbjct:    93 LWNYDDTKKRRPNHSMSLLAMAVGIKQKELVNKVIQKFPPRDFAVMLFHYDGVVDDWKQY 152

Query:    60 EWCDRVIHVSAANQTKWWFAKRFLHPDIVAEYSYIFLWDEDLGVEDFNPQKYVSIVKSEG 119
              W +  IHVS  NQTKWWFAKRFLHPDIVAEY YIFLWDEDLGV  FNPQ+Y+SIVK EG
Sbjct:   153 PWNNHAIHVSVMNQTKWWFAKRFLHPDIVAEYEYIFLWDEDLGVGHFNPQRYLSIVKEEG 212

Query:   120 LEISQPALDPAKSEVHHQITARSQRSIVHRRTYK---PGLCDKNSTAPPCTGWIEMMAPV 176
             LEISQPALD +KSEVHH ITAR ++S VHRR YK    G CD +ST PPC GW+EMMAPV
Sbjct:   213 LEISQPALDTSKSEVHHPITARRKKSKVHRRMYKYKGSGRCDDHSTNPPCIGWVEMMAPV 272

Query:   177 FSRAAWRCVWYLIQNDLIHAWGLDMQLGYCAQGDRTKNVGVVDAEYVVHYGRPTLGVPEK 236
             FSRAAWRC WY+IQNDLIHAWGLD QLGYCAQGDR KNVGVVDAEY++HYG PTLGV E 
Sbjct:   273 FSRAAWRCSWYMIQNDLIHAWGLDTQLGYCAQGDRKKNVGVVDAEYIIHYGLPTLGVVET 332

Query:   237 KTDLLEIGFSPETENITHVMTKTSSHSHKKDPRLEVRRQSFLEFKIFKRRWIQAAKKDNC 296
              +  L         N T   +  S  S + D R EVR +SF+E K FK RW +A + D C
Sbjct:   333 ASSALR--------NETDSKSTESLESREVDNRPEVRMKSFVEMKRFKERWKKAVRDDTC 384

Query:   297 WIDPY 301
             W+DPY
Sbjct:   385 WVDPY 389




GO:0008150 "biological_process" evidence=ND
GO:0016757 "transferase activity, transferring glycosyl groups" evidence=ISM
TAIR|locus:2202124 AT1G11170 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2008425 AT1G61240 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:504956279 AT1G24570 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2008640 AT1G67850 "AT1G67850" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2086721 AT3G27470 "AT3G27470" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2079291 AT3G26440 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2031905 AT1G13000 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2205140 AT1G08040 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2057557 AT2G28310 "AT2G28310" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
GSVIVG00035311001
SubName- Full=Chromosome undetermined scaffold_77, whole genome shotgun sequence; (411 aa)
(Vitis vinifera)
Predicted Functional Partners:
 
Sorry, there are no predicted associations at the current settings.
 

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query306
pfam05212294 pfam05212, DUF707, Protein of unknown function (DU 1e-130
>gnl|CDD|218501 pfam05212, DUF707, Protein of unknown function (DUF707) Back     alignment and domain information
 Score =  371 bits (955), Expect = e-130
 Identities = 141/296 (47%), Positives = 179/296 (60%), Gaps = 29/296 (9%)

Query: 1   MRPLWGFPKSDNSS--FSLFAIAAGIKQSDLVDKMVTKFLSSNFSVMLFHYDGVVDQWKT 58
           +R LWG P+ D +S    L A   G  Q   VD  V KF S NF+++LFHYDG   +W  
Sbjct: 26  LRRLWGSPEEDVASKPKYLLAFTVGYSQKANVDACVKKF-SDNFTIVLFHYDGRTTEWDE 84

Query: 59  FEWCDRVIHVSAANQTKWWFAKRFLHPDIVAEYSYIFLWDEDLGVEDFNPQKYVSIVKSE 118
            EW  + IHVSA  QTKWWFAKRFLHPDIVA Y YIFLWDEDLGV++F+ ++Y+ IVK  
Sbjct: 85  LEWSKKAIHVSAKKQTKWWFAKRFLHPDIVAPYEYIFLWDEDLGVDNFDAEEYIKIVKKH 144

Query: 119 GLEISQPALDPAKSEVHHQITARSQRSIVHRRTYKPGLCDKNSTAPPCTGWIEMMAPVFS 178
           GLEISQP LDP + ++  +IT R     VH+ T + G C  NST PPCTG++E+MAPVFS
Sbjct: 145 GLEISQPGLDPDRGQITWRITKRRGDGEVHKDTREKGRCCDNSTGPPCTGFVEIMAPVFS 204

Query: 179 RAAWRCVWYLIQNDLIHAWGLDMQLGYCAQGDRTKNVGVVDAEYVVHYGRPTLGVPEKKT 238
           R AWRCVW++IQNDL+H WGLD  L  C      + +GVVD+++VVH G P+LG      
Sbjct: 205 RDAWRCVWHMIQNDLVHGWGLDFALRRCV-EPAHEKIGVVDSQWVVHQGIPSLGSQGTAE 263

Query: 239 DLLEIGFSPETENITHVMTKTSSHSHKKDPRLEVRRQSFLEFKIFKRRWIQAAKKD 294
           +                          K P   VR +   E+ +F+RRW  A K+ 
Sbjct: 264 N-------------------------GKAPWQGVRDRCKAEWTMFQRRWAAAEKEY 294


This family consists of several uncharacterized proteins from Arabidopsis thaliana. Length = 294

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 306
PF05212294 DUF707: Protein of unknown function (DUF707); Inte 100.0
cd04185202 GT_2_like_b Subfamily of Glycosyltransferase Famil 93.94
TIGR01556281 rhamnosyltran L-rhamnosyltransferase. Rhamnolipids 92.67
cd02510299 pp-GalNAc-T pp-GalNAc-T initiates the formation of 92.58
cd04186166 GT_2_like_c Subfamily of Glycosyltransferase Famil 92.38
cd02526237 GT2_RfbF_like RfbF is a putative dTDP-rhamnosyl tr 89.37
PF13641228 Glyco_tranf_2_3: Glycosyltransferase like family 2 87.46
cd02520196 Glucosylceramide_synthase Glucosylceramide synthas 86.69
PF01762195 Galactosyl_T: Galactosyltransferase; InterPro: IPR 86.43
cd02525249 Succinoglycan_BP_ExoA ExoA is involved in the bios 86.16
cd04195201 GT2_AmsE_like GT2_AmsE_like is involved in exopoly 84.92
cd06442224 DPM1_like DPM1_like represents putative enzymes si 81.03
PLN02726243 dolichyl-phosphate beta-D-mannosyltransferase 80.39
cd06433202 GT_2_WfgS_like WfgS and WfeV are involved in O-ant 80.23
>PF05212 DUF707: Protein of unknown function (DUF707); InterPro: IPR007877 This family consists of uncharacterised proteins from Arabidopsis thaliana Back     alignment and domain information
Probab=100.00  E-value=1.1e-119  Score=847.46  Aligned_cols=266  Identities=64%  Similarity=1.173  Sum_probs=255.2

Q ss_pred             CCCCCCCCCCC--CCCCeeEEEEecccccccHHHHhhhccCCCceEEEEeecCccCccccccccCceeEEEeeccchhhh
Q 021847            1 MRPLWGFPKSD--NSSFSLFAIAAGIKQSDLVDKMVTKFLSSNFSVMLFHYDGVVDQWKTFEWCDRVIHVSAANQTKWWF   78 (306)
Q Consensus         1 ~r~lwg~~~~~--~~~k~Lv~~~VG~kqk~~v~~~v~kf~~~~Fdv~LfhYdg~~d~w~~~ews~~aIhv~~~~qtKww~   78 (306)
                      ||||||+|+++  .++||||||+||+|||++||++|+|| ++|||||||||||++|+|++||||++||||++.|||||||
T Consensus        26 ~r~lw~~p~~~~~~~~k~Lla~~VG~kqk~~vd~~v~Kf-~~nF~i~LfhYDg~vd~w~~~~ws~~aiHv~~~kqtKww~  104 (294)
T PF05212_consen   26 LRPLWGNPSEDLPKKPKYLLAMTVGIKQKDNVDAIVKKF-SDNFDIMLFHYDGRVDEWDDFEWSDRAIHVSARKQTKWWF  104 (294)
T ss_pred             eeecCCCccccccCCCceEEEEEecHHHHhhhhHHHhhh-ccCceEEEEEecCCcCchhhcccccceEEEEeccceEEee
Confidence            79999999999  66799999999999999999999999 8999999999999999999999999999999999999999


Q ss_pred             hccccCcccccCccEEEEeccccCCCCCChHHHHHHHHHhCCcccCCCcCCCCCceeeeeeeeeccceecceeccCCCCC
Q 021847           79 AKRFLHPDIVAEYSYIFLWDEDLGVEDFNPQKYVSIVKSEGLEISQPALDPAKSEVHHQITARSQRSIVHRRTYKPGLCD  158 (306)
Q Consensus        79 akRflhPdiv~~YdYIflwDDDl~vd~f~i~ry~~ivr~~gLeIsQPALd~~~S~~sh~iT~r~~~~~vhr~~~~~~~C~  158 (306)
                      ||||||||||++|||||||||||+||+|+|+|||+||++||||||||||++++|++||+||+|++.++|||.+.+.+.|.
T Consensus       105 akrfLHPdiv~~YdYiflwDeDL~vd~f~~~ry~~Ivk~~gLeISQPALd~~~~~~~~~iT~R~~~~~vhr~~~~~~~~~  184 (294)
T PF05212_consen  105 AKRFLHPDIVAPYDYIFLWDEDLGVDHFDINRYFEIVKKEGLEISQPALDPDSSEIHHPITKRRPDSEVHRKTRGGPRCC  184 (294)
T ss_pred             hhhhcChhhhccceeEEecCCccCcCcCCHHHHHHHHHHhCCcccCcccCCCCceeeeeEEeecCCceeEeccCCCCCcC
Confidence            99999999999999999999999999999999999999999999999999998889999999999999999777788888


Q ss_pred             CCCCCCCCcceEEEeccccchhHHHHhhhhhcCCCcceehhhHhhhhhhcCCCCCcEEEEeeeeEEEccccCCCCCCCcc
Q 021847          159 KNSTAPPCTGWIEMMAPVFSRAAWRCVWYLIQNDLIHAWGLDMQLGYCAQGDRTKNVGVVDAEYVVHYGRPTLGVPEKKT  238 (306)
Q Consensus       159 ~~~~~ppct~fVEiMaPVFSR~Awrcvw~~iqNdLvhGWGLD~~~~~c~~~~~~~kiGVVDa~~V~H~g~Ptlg~~~~~~  238 (306)
                      +++++||||||||||||||||+|||||||||||||+|||||||+|++|+ +++++||||||||||+|+|+||||+++...
T Consensus       185 ~~~~~ppct~fVEiMAPVFSr~Awrcvw~miqNDLvhGWGLDf~~~~c~-~~~~~kiGVVDs~~VvH~gvptLG~~~~~~  263 (294)
T PF05212_consen  185 DDSTGPPCTGFVEIMAPVFSRAAWRCVWHMIQNDLVHGWGLDFKWGYCA-GDRHKKIGVVDSQYVVHTGVPTLGGQGNSE  263 (294)
T ss_pred             CCCCCCCcceEEEEecceechHHHHHHHhcccCCCccccchhhhHHHHh-ccccccEEEEeeEEEEEcCCCcCCCccccc
Confidence            8999999999999999999999999999999999999999999999999 789999999999999999999999965321


Q ss_pred             ccccccCCCCcccccccccccccCCCCCCChHHHHhhhHHHHHHHHHHHHHHHhc
Q 021847          239 DLLEIGFSPETENITHVMTKTSSHSHKKDPRLEVRRQSFLEFKIFKRRWIQAAKK  293 (306)
Q Consensus       239 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vr~r~~~E~~~f~~R~~~A~~~  293 (306)
                                               .+.++|.+||+||++||++|++||++|+++
T Consensus       264 -------------------------~~~~~~~~Vr~r~~~E~~~F~~R~~~a~~~  293 (294)
T PF05212_consen  264 -------------------------KGKDPREEVRRRSFAEMRIFQKRWANAVKE  293 (294)
T ss_pred             -------------------------cCCchHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence                                     145789999999999999999999999986



>cd04185 GT_2_like_b Subfamily of Glycosyltransferase Family GT2 of unknown function Back     alignment and domain information
>TIGR01556 rhamnosyltran L-rhamnosyltransferase Back     alignment and domain information
>cd02510 pp-GalNAc-T pp-GalNAc-T initiates the formation of mucin-type O-linked glycans Back     alignment and domain information
>cd04186 GT_2_like_c Subfamily of Glycosyltransferase Family GT2 of unknown function Back     alignment and domain information
>cd02526 GT2_RfbF_like RfbF is a putative dTDP-rhamnosyl transferase Back     alignment and domain information
>PF13641 Glyco_tranf_2_3: Glycosyltransferase like family 2; PDB: 4FIY_B 4FIX_A Back     alignment and domain information
>cd02520 Glucosylceramide_synthase Glucosylceramide synthase catalyzes the first glycosylation step of glycosphingolipid synthesis Back     alignment and domain information
>PF01762 Galactosyl_T: Galactosyltransferase; InterPro: IPR002659 The biosynthesis of disaccharides, oligosaccharides and polysaccharides involves the action of hundreds of different glycosyltransferases Back     alignment and domain information
>cd02525 Succinoglycan_BP_ExoA ExoA is involved in the biosynthesis of succinoglycan Back     alignment and domain information
>cd04195 GT2_AmsE_like GT2_AmsE_like is involved in exopolysaccharide amylovora biosynthesis Back     alignment and domain information
>cd06442 DPM1_like DPM1_like represents putative enzymes similar to eukaryotic DPM1 Back     alignment and domain information
>PLN02726 dolichyl-phosphate beta-D-mannosyltransferase Back     alignment and domain information
>cd06433 GT_2_WfgS_like WfgS and WfeV are involved in O-antigen biosynthesis Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query306
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 5e-08
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure
 Score = 53.3 bits (127), Expect = 5e-08
 Identities = 45/309 (14%), Positives = 82/309 (26%), Gaps = 82/309 (26%)

Query: 3   PLWGFPKSDNSSFSLFAIAAGIKQSDLVDKMVTK--FLSSNFSVMLFHYDGVVDQWKTFE 60
                   +    SL       +  DL  +++T      S  +  +       D WK   
Sbjct: 294 HSMTLTPDE--VKSLLLKYLDCRPQDLPREVLTTNPRRLSIIAESIRDGLATWDNWKHVN 351

Query: 61  WCDRVIHVSAANQTKWWFAKRFLHPDIVAEYSY----IF-------------LWDEDLGV 103
            CD++  +  +       +   L P    +  +    +F             +W +   V
Sbjct: 352 -CDKLTTIIES-------SLNVLEPAEYRKM-FDRLSVFPPSAHIPTILLSLIWFD---V 399

Query: 104 EDFNPQKYVSIVKSEGLEISQPALDPAKSEVHH-QITARSQRSI---VHRR---TYKPGL 156
              +    V+ +    L   QP    +   +    +  + +      +HR     Y    
Sbjct: 400 IKSDVMVVVNKLHKYSLVEKQP--KESTISIPSIYLELKVKLENEYALHRSIVDHYNIPK 457

Query: 157 C-DKNSTAPPCT-GWI-----------EM--MAPVFSRAAWRCVWYL----IQNDLIH-- 195
             D +   PP    +            E      +F         +L    ++  + H  
Sbjct: 458 TFDSDDLIPPYLDQYFYSHIGHHLKNIEHPERMTLFRM------VFLDFRFLEQKIRHDS 511

Query: 196 -AWG---------LDMQL--GYCAQGDRTKNVGVVDAEYVVHYGRPTLGVPEKKTDLLEI 243
            AW            ++    Y    D      V      +      L +  K TDLL I
Sbjct: 512 TAWNASGSILNTLQQLKFYKPYICDNDPKYERLVNAILDFLPKIEENL-ICSKYTDLLRI 570

Query: 244 GFSPETENI 252
               E E I
Sbjct: 571 ALMAEDEAI 579


Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query306
1qg8_A255 Protein (spore coat polysaccharide biosynthesis P 92.02
1xhb_A 472 Polypeptide N-acetylgalactosaminyltransferase 1; g 84.3
>1qg8_A Protein (spore coat polysaccharide biosynthesis P SPSA); glycosyltransferase, transferase; 1.50A {Bacillus subtilis} SCOP: c.68.1.1 PDB: 1h7q_A* 1h7l_A 1qgq_A* 1qgs_A* Back     alignment and structure
Probab=92.02  E-value=0.41  Score=40.43  Aligned_cols=119  Identities=13%  Similarity=-0.013  Sum_probs=67.4

Q ss_pred             cCccEEEEeccccCCCCCChHHHHHHHHHh-CCcccCCCc-----CCCCCceeeeeeeeeccceecceeccCCCCCCCCC
Q 021847           89 AEYSYIFLWDEDLGVEDFNPQKYVSIVKSE-GLEISQPAL-----DPAKSEVHHQITARSQRSIVHRRTYKPGLCDKNST  162 (306)
Q Consensus        89 ~~YdYIflwDDDl~vd~f~i~ry~~ivr~~-gLeIsQPAL-----d~~~S~~sh~iT~r~~~~~vhr~~~~~~~C~~~~~  162 (306)
                      +..|||++.|+|..++...+.++++.+.++ +..+..+..     +.+ +...+.. .+ +...+.     .        
T Consensus        87 a~g~~i~~lD~Dd~~~~~~l~~~~~~~~~~~~~~~v~~~~~~~~~~~~-~~~~~~~-~~-~~~~~~-----~--------  150 (255)
T 1qg8_A           87 AEGEYITYATDDNIYMPDRLLKMVRELDTHPEKAVIYSASKTYHLNEN-RDIVKET-VR-PAAQVT-----W--------  150 (255)
T ss_dssp             CCCSEEEEEETTEEECTTHHHHHHHHHHHCTTCCEEEEEEEEEEEC----CEEEEE-EE-CCCSCB-----S--------
T ss_pred             cCCCEEEEeCCCCccChHHHHHHHHHHHhCCCceEEEeceEEEEeCCC-Ccchhhc-cC-chHHHH-----H--------
Confidence            578999999999999999999999999876 666655543     222 1111110 00 100000     0        


Q ss_pred             CCCCcceEEEeccccchhHHHHhh-hh---hcCCCcceehhhHhhhhhhcCCCCCcEEEEeeeeEEEcc
Q 021847          163 APPCTGWIEMMAPVFSRAAWRCVW-YL---IQNDLIHAWGLDMQLGYCAQGDRTKNVGVVDAEYVVHYG  227 (306)
Q Consensus       163 ~ppct~fVEiMaPVFSR~Awrcvw-~~---iqNdLvhGWGLD~~~~~c~~~~~~~kiGVVDa~~V~H~g  227 (306)
                        ....++-..+=+|+|++++-+- .|   ...+.....+-|+.+...+.. . .++..++...+.|+.
T Consensus       151 --~~~~~~~~~~~~~rr~~~~~~g~~f~~~~~~~~~~~~~eD~~~~~r~~~-~-g~~~~~~~~~~~~r~  215 (255)
T 1qg8_A          151 --NAPCAIDHCSVMHRYSVLEKVKEKFGSYWDESPAFYRIGDARFFWRVNH-F-YPFYPLDEELDLNYI  215 (255)
T ss_dssp             --CCTTTSCGGGEEEETHHHHHHHHHHSSSSCCCGGGSTTHHHHHHHHHTT-T-CCBEEEEEEEEEEEE
T ss_pred             --hhcCCCccceEEEEHHHHHhhcccccccccCCcccchHHHHHHHHHHHH-h-CCEEEecCcEEEEEE
Confidence              0111222234578999997664 34   111111234567666544422 2 369999998888873



>1xhb_A Polypeptide N-acetylgalactosaminyltransferase 1; glycosyltransferase-A (GT-A); HET: NAG BMA; 2.50A {Mus musculus} SCOP: b.42.2.1 c.68.1.17 Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query306
d1xhba2328 Polypeptide N-acetylgalactosaminyltransferase 1, N 86.02
>d1xhba2 c.68.1.17 (A:95-422) Polypeptide N-acetylgalactosaminyltransferase 1, N-terminal domain {Mouse (Mus musculus) [TaxId: 10090]} Back     information, alignment and structure
class: Alpha and beta proteins (a/b)
fold: Nucleotide-diphospho-sugar transferases
superfamily: Nucleotide-diphospho-sugar transferases
family: Polypeptide N-acetylgalactosaminyltransferase 1, N-terminal domain
domain: Polypeptide N-acetylgalactosaminyltransferase 1, N-terminal domain
species: Mouse (Mus musculus) [TaxId: 10090]
Probab=86.02  E-value=0.44  Score=40.25  Aligned_cols=140  Identities=13%  Similarity=0.071  Sum_probs=72.5

Q ss_pred             ccCccEEEEeccccCCCCCChHHHHHHHHHhCCcccCCCcCCCCCc-eee----eeeeeec--cceecceecc---CCCC
Q 021847           88 VAEYSYIFLWDEDLGVEDFNPQKYVSIVKSEGLEISQPALDPAKSE-VHH----QITARSQ--RSIVHRRTYK---PGLC  157 (306)
Q Consensus        88 v~~YdYIflwDDDl~vd~f~i~ry~~ivr~~gLeIsQPALd~~~S~-~sh----~iT~r~~--~~~vhr~~~~---~~~C  157 (306)
                      .+..|||+|.|+|..++.-.++++++.+.+..-.+..|.++.-... ..+    ..+.-.-  ....+.....   ....
T Consensus       105 ~a~gd~i~flD~D~~~~p~~l~~l~~~~~~~~~~~v~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  184 (328)
T d1xhba2         105 VSRGQVITFLDAHCECTAGWLEPLLARIKHDRRTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRR  184 (328)
T ss_dssp             HCCSSEEEEEESSEEECTTCHHHHHHHHHHCTTEEEEEEEEEECSSSCCEECCCTTEEEEECTTCCEEEEECCHHHHHHT
T ss_pred             hhhcceeeecCcccccChhHHHHHHHHHhcCCCeEEecceeeeccCceeeccCCccccccccccccccccccchhhhhcc
Confidence            3678999999999999999999999999998877777764321000 000    0000000  0000000000   0000


Q ss_pred             CCCCCCCCCcceEEEeccccchhHHHHhhhhhcCCCcceeh---hhHhhhhhhcCCCCCcEEEEeeeeEEEccccCCC
Q 021847          158 DKNSTAPPCTGWIEMMAPVFSRAAWRCVWYLIQNDLIHAWG---LDMQLGYCAQGDRTKNVGVVDAEYVVHYGRPTLG  232 (306)
Q Consensus       158 ~~~~~~ppct~fVEiMaPVFSR~Awrcvw~~iqNdLvhGWG---LD~~~~~c~~~~~~~kiGVVDa~~V~H~g~Ptlg  232 (306)
                      ....+.+..+.++-.-+=.++|++|.-+=.|-. . ...||   +|+.+.....   +.+|..+-...|.|..+-..+
T Consensus       185 ~~~~~~~~~~~~~~g~~~~irr~~f~~vGgfDe-~-~~~~g~ED~Dl~~R~~~~---G~~i~~~p~~~v~H~~~~~~~  257 (328)
T d1xhba2         185 KGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDA-G-MDIWGGENLEISFRIWQC---GGTLEIVTCSHVGHVFRKATP  257 (328)
T ss_dssp             TTCTTSCEECSBCCSSSEEEEHHHHHHTTSCCT-T-SCTTCCCCSHHHHHHHHT---TCEEEEEEEEEEEEEC-----
T ss_pred             ccccccccccceecceeeeeeHHHHHHhCCCCC-C-CcCcCchHHHHHHHHHHh---CCeEEEeCCeEEEEeCCCCCC
Confidence            011112222223322233468999986533322 2 23443   5566655543   468999999999998544443