Citrus Sinensis ID: 017407


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300-------310-------320-------330-------340-------350-------360-------370--
MMKTTNSISVLSDPPSRSCLCSLFIAAALICSVYFIGSSFVAKENKERLMRWGLVHSMYSAKPETCKNQQCRLPGTEALPEGIVSKTSNLEMRPLWSSPSKLNNQRPPMNLLAIAAGIKQKKIVDQIVRKFPSKDFVVMLFHYDGVVDEWKDLVWADRAIHVSAANQTKWWFAKRFLHPDIVAEYNYIFLWDEDIGVENFNPRRYLSIVKDEGLEISQPALDPVKSEVHHPITARRRNSKAHRRMYKYKGSGRCDDYSTAPPCIGWVEMMAPVFSRAAWRCAWYMIQNDLIHAWGLDIQLGYCAQGDRTKNVGVVDSEYIVHLGLPTLGVTTEPELNTVGQASDDLEQIANPVALVCILNLKLPFSTFFIRT
cccccccccccccccccccHHHHHHHHHHHHHHHHccEEEEEEcccHHHcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccEEEEEEccccHHHHHHHHHccccccEEEEEEEEcccccccccccccccEEEEEEcccHHHHHHHHcccHHHHHHccEEEEEccccccccccHHHHHHHHHHHcccccccccccccccccccEEEEcccccccEEEEEccccccccccccccccccEEEEEEccccHHHHHHHHHHcccccccccHHHHHHHHcccccccccEEEEEEEEEEEccccccccccccccccccccccHHHHHcccEEEEEEEEcccccEEEEEEc
cccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccccccccccccccccccccccccccccccccEEccccccEcccccccccccccccccEEEEEEEcHHHHHHHHHHHHHccccccEEEEEEEcccccccccccccccEEEEEEccccHHHHHHHcccccHcccccEEEEEccccccccccHHHHHHHHHHccccEcccccccccccEEEEEEEEccccEEEEEEEcccccccccccccccccHHHHHHHcHHccHHHHHHHHHHHHHHHHccccccccccHcccccccccEEEEEcHEEEEcccccccccccccccccccccccHHHccccEEEEEEEEccccccEEEEEc
mmkttnsisvlsdppsrscLCSLFIAAALICSVYFIGSSFVAKENKERLMRWGLVHSmysakpetcknqqcrlpgtealpegivsktsnlemrplwsspsklnnqrppmNLLAIAAGIKQKKIVDQIVRKFPSKDFVVMLFHYDGVVDEWKDLVWADRAIHVSAANQTKWWFakrflhpdivAEYNYIFlwdedigvenfnprrylsivkdegleisqpaldpvksevhhpitarrrnsKAHRRMYkykgsgrcddystappcigwvemMAPVFSRAAWRCAWYMIQNDLIHAWGLdiqlgycaqgdrtknvgvvdseyivhlglptlgvttepelntvgqasddleqIANPVALVCILnlklpfstffirt
mmkttnsisvlsdppsRSCLCSLFIAAALICSVYFIGSSFVAKENKERLMRWGLVHSMYSAKPETCKNQQCRLPGTEALPEGIVSKTSNLEMRPLWSspsklnnqrPPMNLLAIAAGIKQKKIVDQIVRKFPSKDFVVMLFHYDGVVDEWKDLVWADRAIHVSAANQTKWWFAKRFLHPDIVAEYNYIFLWDEDIGVENFNPRRYLSIVKDEGLEIsqpaldpvksevhhpitarrrnskahrrmykykgsgrcddysTAPPCIGWVEMMAPVFSRAAWRCAWYMIQNDLIHAWGLDIQLGYCAQGDRTKNVGVVDSEYIVHLGLPTLGVTTEPELNTVGQASDDLEQIANPVALVCILNlklpfstffirt
MMKTTNSISVLSDPPSRSCLCSLFIAAALICSVYFIGSSFVAKENKERLMRWGLVHSMYSAKPETCKNQQCRLPGTEALPEGIVSKTSNLEMRPLWSSPSKLNNQRPPMNLLAIAAGIKQKKIVDQIVRKFPSKDFVVMLFHYDGVVDEWKDLVWADRAIHVSAANQTKWWFAKRFLHPDIVAEYNYIFLWDEDIGVENFNPRRYLSIVKDEGLEISQPALDPVKSEVHHPITARRRNSKAHRRMYKYKGSGRCDDYSTAPPCIGWVEMMAPVFSRAAWRCAWYMIQNDLIHAWGLDIQLGYCAQGDRTKNVGVVDSEYIVHLGLPTLGVTTEPELNTVGQASDDLEQIANPVALVCILNLKLPFSTFFIRT
*****************SCLCSLFIAAALICSVYFIGSSFVAKENKERLMRWGLVHSMYSAK***********************************************NLLAIAAGIKQKKIVDQIVRKFPSKDFVVMLFHYDGVVDEWKDLVWADRAIHVSAANQTKWWFAKRFLHPDIVAEYNYIFLWDEDIGVENFNPRRYLSIVKDEGLE******************************YKYKGSGRCDDYSTAPPCIGWVEMMAPVFSRAAWRCAWYMIQNDLIHAWGLDIQLGYCAQGDRTKNVGVVDSEYIVHLGLPTLGVTTEPELNTVGQASDDLEQIANPVALVCILNLKLPFSTFFI**
***********************FIAAALICSVYFIGSSFVAKENKERLMRWGL****************************IVSKTSNLEM****************MNLLAIAAGIKQKKIVDQIVRKFPSKDFVVMLFHYDGVVDEWKDLVWADRAIHVSAANQTKWWFAKRFLHPDIVAEYNYIFLWDEDIGVENFNPRRYLSIVKDEGLEISQPALDPVKSEVHHPITARRRNSKAHR*MYKYK*SGRCDDYSTAPPCIGWVEMMAPVFSRAAWRCAWYMIQNDLIHAWGLDIQLGYCAQGDRTKNVGVVDSEYIVHLGL***************************VALVCILNLKLPFSTFFIRT
*************PPSRSCLCSLFIAAALICSVYFIGSSFVAKENKERLMRWGLVHSMYSAKPETCKNQQCRLPGTEALPEGIVSKTSNLEMRPLWSSPSKLNNQRPPMNLLAIAAGIKQKKIVDQIVRKFPSKDFVVMLFHYDGVVDEWKDLVWADRAIHVSAANQTKWWFAKRFLHPDIVAEYNYIFLWDEDIGVENFNPRRYLSIVKDEGLEISQPALDPVKS*****************RMYKYKGSGRCDDYSTAPPCIGWVEMMAPVFSRAAWRCAWYMIQNDLIHAWGLDIQLGYCAQGDRTKNVGVVDSEYIVHLGLPTLGVTTEPELNTVGQASDDLEQIANPVALVCILNLKLPFSTFFIRT
**************PSRSCLCSLFIAAALICSVYFIGSSFVAKENKERLMRWGLVHSMY*********QQCRLPGTEALPEGIVSKTSNLEMRPLWSSPSKLNNQRPPMNLLAIAAGIKQKKIVDQIVRKFPSKDFVVMLFHYDGVVDEWKDLVWADRAIHVSAANQTKWWFAKRFLHPDIVAEYNYIFLWDEDIGVENFNPRRYLSIVKDEGLEISQPALDPVKSEVHHPITARRRNSKAHRRMYKYKGSGRCDDYSTAPPCIGWVEMMAPVFSRAAWRCAWYMIQNDLIHAWGLDIQLGYCAQGDRTKNVGVVDSEYIVHLGLPTL****************DLEQIANPVALVCILNLKLPFSTFFIRT
iiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHoooo
oooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MMKTTNSISVLSDPPSRSCLCSLFIAAALICSVYFIGSSFVAKENKERLMRWGLVHSMYSAKPETCKNQQCRLPGTEALPEGIVSKTSNLEMRPLWSSPSKLNNQRPPMNLLAIAAGIKQKKIVDQIVRKFPSKDFVVMLFHYDGVVDEWKDLVWADRAIHVSAANQTKWWFAKRFLHPDIVAEYNYIFLWDEDIGVENFNPRRYLSIVKDEGLEISQPALDPVKSEVHHPITARRRNSKAHRRMYKYKGSGRCDDYSTAPPCIGWVEMMAPVFSRAAWRCAWYMIQNDLIHAWGLDIQLGYCAQGDRTKNVGVVDSEYIVHLGLPTLGVTTEPELNTVGQASDDLEQIANPVALVCILNLKLPFSTFFIRT
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

No hits with e-value below 0.001 by BLAST

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query372
255575938370 conserved hypothetical protein [Ricinus 0.897 0.902 0.710 1e-142
359486223413 PREDICTED: uncharacterized protein LOC10 0.948 0.854 0.674 1e-141
356560121385 PREDICTED: uncharacterized protein LOC10 0.930 0.898 0.66 1e-135
356541848387 PREDICTED: uncharacterized protein LOC10 0.922 0.886 0.667 1e-134
92893672355 Protein of unknown function DUF707 [Medi 0.892 0.935 0.683 1e-133
240255999389 uncharacterized protein [Arabidopsis tha 0.844 0.807 0.688 1e-132
357445211406 hypothetical protein MTR_2g005330 [Medic 0.916 0.839 0.644 1e-131
357445209368 hypothetical protein MTR_2g005330 [Medic 0.892 0.902 0.658 1e-130
449515369402 PREDICTED: uncharacterized LOC101205845, 0.865 0.800 0.658 1e-128
297735287413 unnamed protein product [Vitis vinifera] 0.919 0.828 0.635 1e-128
>gi|255575938|ref|XP_002528866.1| conserved hypothetical protein [Ricinus communis] gi|223531717|gb|EEF33540.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
 Score =  511 bits (1315), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 238/335 (71%), Positives = 277/335 (82%), Gaps = 1/335 (0%)

Query: 2   MKTTNSISVLSDPPSRSCLCSLFIAAALICSVYFIGSSFVAKENKERLMRWGLVHSMYSA 61
           MK+    S   DP SRS LC+LF+ A+LICS YFIG SF+ KE KERL RW ++ ++ S 
Sbjct: 1   MKSLYCASASPDPKSRSYLCTLFVVASLICSAYFIGGSFIGKEYKERLARWQVIETVQST 60

Query: 62  KPETCKNQQCRLPGTEALPEGIVSKTSNLEMRPLWSSPSKLNNQRPPMNLLAIAAGIKQK 121
           K   C++Q C+  GT+ALP+GIV KTS+ EMRPLW+S  + N Q+   +LLA+A GI QK
Sbjct: 61  KSTNCEDQ-CKPTGTKALPQGIVRKTSDFEMRPLWNSSLEDNKQKLSKSLLALAVGINQK 119

Query: 122 KIVDQIVRKFPSKDFVVMLFHYDGVVDEWKDLVWADRAIHVSAANQTKWWFAKRFLHPDI 181
            +VDQIV+KFP  DFVVMLFHYDGVVD+W+DL W+D AIHVSA NQTKWWFAKRFLHPDI
Sbjct: 120 VVVDQIVKKFPLSDFVVMLFHYDGVVDKWRDLPWSDHAIHVSAVNQTKWWFAKRFLHPDI 179

Query: 182 VAEYNYIFLWDEDIGVENFNPRRYLSIVKDEGLEISQPALDPVKSEVHHPITARRRNSKA 241
           V+EY+Y+FLWDED+GVENFNP+RYLSI++DEGLEISQPALDP KS V+HPITAR+  S  
Sbjct: 180 VSEYDYLFLWDEDLGVENFNPKRYLSIIRDEGLEISQPALDPTKSAVYHPITARQPKSTV 239

Query: 242 HRRMYKYKGSGRCDDYSTAPPCIGWVEMMAPVFSRAAWRCAWYMIQNDLIHAWGLDIQLG 301
           HRR+YK+KGSGRC   ST+PPCIGWVEMMAPVFS AAWRCAW+MIQNDLIHAWGLD QLG
Sbjct: 240 HRRIYKFKGSGRCYGNSTSPPCIGWVEMMAPVFSTAAWRCAWHMIQNDLIHAWGLDFQLG 299

Query: 302 YCAQGDRTKNVGVVDSEYIVHLGLPTLGVTTEPEL 336
           YCAQGDRTKNVGVVDSEYIVHLGL TLGV    E+
Sbjct: 300 YCAQGDRTKNVGVVDSEYIVHLGLLTLGVFNGTEV 334




Source: Ricinus communis

Species: Ricinus communis

Genus: Ricinus

Family: Euphorbiaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|359486223|ref|XP_002265374.2| PREDICTED: uncharacterized protein LOC100255698 [Vitis vinifera] gi|297739491|emb|CBI29673.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|356560121|ref|XP_003548344.1| PREDICTED: uncharacterized protein LOC100797710 [Glycine max] Back     alignment and taxonomy information
>gi|356541848|ref|XP_003539384.1| PREDICTED: uncharacterized protein LOC100526994 [Glycine max] Back     alignment and taxonomy information
>gi|92893672|gb|ABE91850.1| Protein of unknown function DUF707 [Medicago truncatula] gi|92893916|gb|ABE91966.1| Protein of unknown function DUF707 [Medicago truncatula] Back     alignment and taxonomy information
>gi|240255999|ref|NP_193588.5| uncharacterized protein [Arabidopsis thaliana] gi|332658658|gb|AEE84058.1| uncharacterized protein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|357445211|ref|XP_003592883.1| hypothetical protein MTR_2g005330 [Medicago truncatula] gi|355481931|gb|AES63134.1| hypothetical protein MTR_2g005330 [Medicago truncatula] Back     alignment and taxonomy information
>gi|357445209|ref|XP_003592882.1| hypothetical protein MTR_2g005330 [Medicago truncatula] gi|355481930|gb|AES63133.1| hypothetical protein MTR_2g005330 [Medicago truncatula] Back     alignment and taxonomy information
>gi|449515369|ref|XP_004164722.1| PREDICTED: uncharacterized LOC101205845, partial [Cucumis sativus] Back     alignment and taxonomy information
>gi|297735287|emb|CBI17649.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query372
TAIR|locus:2124608389 AT4G18530 "AT4G18530" [Arabido 0.889 0.850 0.660 8.9e-125
TAIR|locus:2202124438 AT1G11170 [Arabidopsis thalian 0.690 0.586 0.606 1.1e-87
TAIR|locus:2008425425 AT1G61240 [Arabidopsis thalian 0.696 0.609 0.593 3.3e-86
TAIR|locus:2031905401 AT1G13000 [Arabidopsis thalian 0.696 0.645 0.496 5.6e-68
TAIR|locus:2086721398 AT3G27470 "AT3G27470" [Arabido 0.677 0.633 0.494 2.4e-67
TAIR|locus:2205140382 AT1G08040 [Arabidopsis thalian 0.682 0.664 0.486 3.6e-66
TAIR|locus:2057557374 AT2G28310 "AT2G28310" [Arabido 0.666 0.663 0.482 5.2e-65
TAIR|locus:2008640404 AT1G67850 "AT1G67850" [Arabido 0.666 0.613 0.490 6.7e-65
TAIR|locus:2079291396 AT3G26440 [Arabidopsis thalian 0.669 0.628 0.490 1.4e-64
TAIR|locus:504956279381 AT1G24570 [Arabidopsis thalian 0.663 0.648 0.484 2.3e-64
TAIR|locus:2124608 AT4G18530 "AT4G18530" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 1226 (436.6 bits), Expect = 8.9e-125, P = 8.9e-125
 Identities = 228/345 (66%), Positives = 274/345 (79%)

Query:    16 SRSCLCSLFIAAALICSVYFIGSSFVAKENKERLMRWGLVHSMYSAKPE--------TCK 67
             +RSCLCS+ I  ALIC  YFI ++++AK+ KE+L++W +   M+++  +        TCK
Sbjct:     7 NRSCLCSVLITTALICGAYFICNAYLAKDFKEKLLKWEITDKMHNSTDKMQNATTTSTCK 66

Query:    68 NQQCRLPGTEALPEGIVSKTSNLEMRPLWS-SPSKLNNQRPPMNLLAIAAGIKQKKIVDQ 126
             N   +  GTEALP+GI+ KTSNLE + LW+   +K       M+LLA+A GIKQK++V++
Sbjct:    67 NFN-KPVGTEALPQGIIEKTSNLETQHLWNYDDTKKRRPNHSMSLLAMAVGIKQKELVNK 125

Query:   127 IVRKFPSKDFVVMLFHYDGVVDEWKDLVWADRAIHVSAANQTKWWFAKRFLHPDIVAEYN 186
             +++KFP +DF VMLFHYDGVVD+WK   W + AIHVS  NQTKWWFAKRFLHPDIVAEY 
Sbjct:   126 VIQKFPPRDFAVMLFHYDGVVDDWKQYPWNNHAIHVSVMNQTKWWFAKRFLHPDIVAEYE 185

Query:   187 YIFLWDEDIGVENFNPRRYLSIVKDEGLEISQPALDPVKSEVHHPITARRRNSKAHRRMY 246
             YIFLWDED+GV +FNP+RYLSIVK+EGLEISQPALD  KSEVHHPITARR+ SK HRRMY
Sbjct:   186 YIFLWDEDLGVGHFNPQRYLSIVKEEGLEISQPALDTSKSEVHHPITARRKKSKVHRRMY 245

Query:   247 KYKGSGRCDDYSTAPPCIGWVEMMAPVFSRAAWRCAWYMIQNDLIHAWGLDIQLGYCAQG 306
             KYKGSGRCDD+ST PPCIGWVEMMAPVFSRAAWRC+WYMIQNDLIHAWGLD QLGYCAQG
Sbjct:   246 KYKGSGRCDDHSTNPPCIGWVEMMAPVFSRAAWRCSWYMIQNDLIHAWGLDTQLGYCAQG 305

Query:   307 DRTKNVGVVDSEYIVHLGLPTLGV--TTEPELN--TVGQASDDLE 347
             DR KNVGVVD+EYI+H GLPTLGV  T    L   T  ++++ LE
Sbjct:   306 DRKKNVGVVDAEYIIHYGLPTLGVVETASSALRNETDSKSTESLE 350




GO:0008150 "biological_process" evidence=ND
GO:0016757 "transferase activity, transferring glycosyl groups" evidence=ISM
TAIR|locus:2202124 AT1G11170 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2008425 AT1G61240 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2031905 AT1G13000 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2086721 AT3G27470 "AT3G27470" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2205140 AT1G08040 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2057557 AT2G28310 "AT2G28310" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2008640 AT1G67850 "AT1G67850" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2079291 AT3G26440 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:504956279 AT1G24570 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query372
pfam05212294 pfam05212, DUF707, Protein of unknown function (DU 1e-137
>gnl|CDD|218501 pfam05212, DUF707, Protein of unknown function (DUF707) Back     alignment and domain information
 Score =  393 bits (1012), Expect = e-137
 Identities = 137/266 (51%), Positives = 170/266 (63%), Gaps = 6/266 (2%)

Query: 70  QCRLPGTEALPEGIVSKTSNLEMRPLWSSPSKLNNQRPPMNLLAIAAGIKQKKIVDQIVR 129
                G E LP GIV   S+  +R LW SP + +    P  LLA   G  QK  VD  V+
Sbjct: 4   PTNPRGAERLPPGIVVSESDFYLRRLWGSPEE-DVASKPKYLLAFTVGYSQKANVDACVK 62

Query: 130 KFPSKDFVVMLFHYDGVVDEWKDLVWADRAIHVSAANQTKWWFAKRFLHPDIVAEYNYIF 189
           KF S +F ++LFHYDG   EW +L W+ +AIHVSA  QTKWWFAKRFLHPDIVA Y YIF
Sbjct: 63  KF-SDNFTIVLFHYDGRTTEWDELEWSKKAIHVSAKKQTKWWFAKRFLHPDIVAPYEYIF 121

Query: 190 LWDEDIGVENFNPRRYLSIVKDEGLEISQPALDPVKSEVHHPITARRRNSKAHRRMYKYK 249
           LWDED+GV+NF+   Y+ IVK  GLEISQP LDP + ++   IT RR + + H+   +  
Sbjct: 122 LWDEDLGVDNFDAEEYIKIVKKHGLEISQPGLDPDRGQITWRITKRRGDGEVHKDTREKG 181

Query: 250 GSGRCDDYSTAPPCIGWVEMMAPVFSRAAWRCAWYMIQNDLIHAWGLDIQLGYCAQGDRT 309
                   ST PPC G+VE+MAPVFSR AWRC W+MIQNDL+H WGLD  L  C      
Sbjct: 182 RCCD---NSTGPPCTGFVEIMAPVFSRDAWRCVWHMIQNDLVHGWGLDFALRRCV-EPAH 237

Query: 310 KNVGVVDSEYIVHLGLPTLGVTTEPE 335
           + +GVVDS+++VH G+P+LG     E
Sbjct: 238 EKIGVVDSQWVVHQGIPSLGSQGTAE 263


This family consists of several uncharacterized proteins from Arabidopsis thaliana. Length = 294

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 372
PF05212294 DUF707: Protein of unknown function (DUF707); Inte 100.0
cd04185202 GT_2_like_b Subfamily of Glycosyltransferase Famil 94.0
cd04186166 GT_2_like_c Subfamily of Glycosyltransferase Famil 93.18
TIGR01556281 rhamnosyltran L-rhamnosyltransferase. Rhamnolipids 93.07
cd02510299 pp-GalNAc-T pp-GalNAc-T initiates the formation of 89.22
cd02526237 GT2_RfbF_like RfbF is a putative dTDP-rhamnosyl tr 89.0
cd02525249 Succinoglycan_BP_ExoA ExoA is involved in the bios 86.95
cd02520196 Glucosylceramide_synthase Glucosylceramide synthas 83.88
PF13641228 Glyco_tranf_2_3: Glycosyltransferase like family 2 83.79
cd06442224 DPM1_like DPM1_like represents putative enzymes si 80.77
cd06421234 CESA_CelA_like CESA_CelA_like are involved in the 80.26
>PF05212 DUF707: Protein of unknown function (DUF707); InterPro: IPR007877 This family consists of uncharacterised proteins from Arabidopsis thaliana Back     alignment and domain information
Probab=100.00  E-value=5e-123  Score=888.51  Aligned_cols=276  Identities=63%  Similarity=1.132  Sum_probs=265.4

Q ss_pred             cCCCCCCCCCCCCceecCCCcceecCCCCCCCCCCCCCCCcEEEEEecCccccchhHHhhcCCCCCcEEEEEEecCccCc
Q 017407           70 QCRLPGTEALPEGIVSKTSNLEMRPLWSSPSKLNNQRPPMNLLAIAAGIKQKKIVDQIVRKFPSKDFVVMLFHYDGVVDE  149 (372)
Q Consensus        70 q~~~~g~e~Lp~giv~~~sd~~lr~Lwg~p~~~~~~~~~k~Lla~~VG~kqk~~Vd~~v~kf~~~nFdvmLFhYDg~vd~  149 (372)
                      ||+|+|+|+||+|||+++||||||||||.|+++. +.++|||||||||+|||++||++|+|| ++|||||||||||+||+
T Consensus         4 ~~~p~g~e~Lp~giv~~~sd~~~r~lw~~p~~~~-~~~~k~Lla~~VG~kqk~~vd~~v~Kf-~~nF~i~LfhYDg~vd~   81 (294)
T PF05212_consen    4 PCNPRGAERLPPGIVVRESDLELRPLWGNPSEDL-PKKPKYLLAMTVGIKQKDNVDAIVKKF-SDNFDIMLFHYDGRVDE   81 (294)
T ss_pred             CCCCCccccCCCCccccCCCceeeecCCCccccc-cCCCceEEEEEecHHHHhhhhHHHhhh-ccCceEEEEEecCCcCc
Confidence            8999999999999999999999999999999886 568899999999999999999999999 99999999999999999


Q ss_pred             ccccccccceeEEEeecccchhhhccccChhhhccccEEEEeccccccCCCCHHHHHHHHHHhCCcccCCCCCCCCCcee
Q 017407          150 WKDLVWADRAIHVSAANQTKWWFAKRFLHPDIVAEYNYIFLWDEDIGVENFNPRRYLSIVKDEGLEISQPALDPVKSEVH  229 (372)
Q Consensus       150 w~d~ews~~aiHvsa~kqtKWwfakRfLHPdiv~~YDYIFLwDDDL~vd~f~i~ry~~Ivr~~gLeISQPALd~~s~~i~  229 (372)
                      |+|||||++||||++.|||||||||||||||+|++|||||||||||+||+|+|+|||+||++|||||||||||++++++|
T Consensus        82 w~~~~ws~~aiHv~~~kqtKww~akrfLHPdiv~~YdYiflwDeDL~vd~f~~~ry~~Ivk~~gLeISQPALd~~~~~~~  161 (294)
T PF05212_consen   82 WDDFEWSDRAIHVSARKQTKWWFAKRFLHPDIVAPYDYIFLWDEDLGVDHFDINRYFEIVKKEGLEISQPALDPDSSEIH  161 (294)
T ss_pred             hhhcccccceEEEEeccceEEeehhhhcChhhhccceeEEecCCccCcCcCCHHHHHHHHHHhCCcccCcccCCCCceee
Confidence            99999999999999999999999999999999999999999999999999999999999999999999999999998999


Q ss_pred             ecccccccCcccceeeecccCCCCCCCCCCCCCccceEEeecccccHHHHHHHhhhhcCCCcccchhhhhhhhhhcCCCC
Q 017407          230 HPITARRRNSKAHRRMYKYKGSGRCDDYSTAPPCIGWVEMMAPVFSRAAWRCAWYMIQNDLIHAWGLDIQLGYCAQGDRT  309 (372)
Q Consensus       230 h~iT~R~~~~~vHrr~~~~~~~~~C~~~~~~ppcTgFVEiMAPVFSR~AwrcvwhmiqNDLvhGWGLDf~w~~caqg~~~  309 (372)
                      |+||+|++++++||.   .++.+.|.+++++||||||||||||||||+|||||||||||||+|||||||+|+||+ ++++
T Consensus       162 ~~iT~R~~~~~vhr~---~~~~~~~~~~~~~ppct~fVEiMAPVFSr~Awrcvw~miqNDLvhGWGLDf~~~~c~-~~~~  237 (294)
T PF05212_consen  162 HPITKRRPDSEVHRK---TRGGPRCCDDSTGPPCTGFVEIMAPVFSRAAWRCVWHMIQNDLVHGWGLDFKWGYCA-GDRH  237 (294)
T ss_pred             eeEEeecCCceeEec---cCCCCCcCCCCCCCCcceEEEEecceechHHHHHHHhcccCCCccccchhhhHHHHh-cccc
Confidence            999999999999984   577888889999999999999999999999999999999999999999999999999 6899


Q ss_pred             CcEEEEeeeeEEEeccccCCCCCCCccc-----ccCCCchhHHhhhC
Q 017407          310 KNVGVVDSEYIVHLGLPTLGVTTEPELN-----TVGQASDDLEQIAN  351 (372)
Q Consensus       310 ~kiGVVDa~~VvH~g~Ptlg~~~~~~~~-----~~~~~~~~~~~~~~  351 (372)
                      +||||||||||+|+|+|||||++.+++.     .++.++++|+++|+
T Consensus       238 ~kiGVVDs~~VvH~gvptLG~~~~~~~~~~~~~~Vr~r~~~E~~~F~  284 (294)
T PF05212_consen  238 KKIGVVDSQYVVHTGVPTLGGQGNSEKGKDPREEVRRRSFAEMRIFQ  284 (294)
T ss_pred             ccEEEEeeEEEEEcCCCcCCCccccccCCchHHHHHHHHHHHHHHHH
Confidence            9999999999999999999999876443     68889999999985



>cd04185 GT_2_like_b Subfamily of Glycosyltransferase Family GT2 of unknown function Back     alignment and domain information
>cd04186 GT_2_like_c Subfamily of Glycosyltransferase Family GT2 of unknown function Back     alignment and domain information
>TIGR01556 rhamnosyltran L-rhamnosyltransferase Back     alignment and domain information
>cd02510 pp-GalNAc-T pp-GalNAc-T initiates the formation of mucin-type O-linked glycans Back     alignment and domain information
>cd02526 GT2_RfbF_like RfbF is a putative dTDP-rhamnosyl transferase Back     alignment and domain information
>cd02525 Succinoglycan_BP_ExoA ExoA is involved in the biosynthesis of succinoglycan Back     alignment and domain information
>cd02520 Glucosylceramide_synthase Glucosylceramide synthase catalyzes the first glycosylation step of glycosphingolipid synthesis Back     alignment and domain information
>PF13641 Glyco_tranf_2_3: Glycosyltransferase like family 2; PDB: 4FIY_B 4FIX_A Back     alignment and domain information
>cd06442 DPM1_like DPM1_like represents putative enzymes similar to eukaryotic DPM1 Back     alignment and domain information
>cd06421 CESA_CelA_like CESA_CelA_like are involved in the elongation of the glucan chain of cellulose Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query372
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 4e-05
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 3e-04
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure
 Score = 44.8 bits (105), Expect = 4e-05
 Identities = 40/284 (14%), Positives = 77/284 (27%), Gaps = 87/284 (30%)

Query: 12  SDPPSRSCLCSLFIAAALICSVYFIGSSFVAKENKERLMRWGLVHSMYSAKPETCKNQQC 71
            D P      +         S+  I  S   ++       W   H          ++   
Sbjct: 316 QDLPREVLTTNP-----RRLSI--IAESI--RDGLATWDNW--KHVNCDKLTTIIESSLN 364

Query: 72  RLPGTEA---------LPEG--IVSKTSNLEMRPLWSSPSK--------------LNNQR 106
            L   E           P    I +    L +  +W    K              L  ++
Sbjct: 365 VLEPAEYRKMFDRLSVFPPSAHIPTIL--LSL--IWFDVIKSDVMVVVNKLHKYSLVEKQ 420

Query: 107 PPMNLLAI----------AAGIKQ--KKIVDQ--IVRKFPSKDFVVM-----LFHYDG-- 145
           P  + ++I                  + IVD   I + F S D +        + + G  
Sbjct: 421 PKESTISIPSIYLELKVKLENEYALHRSIVDHYNIPKTFDSDDLIPPYLDQYFYSHIGHH 480

Query: 146 -----VVDEWK-------DLVWADRAI-HVSAANQTKWWFAKRFLHPDI--VAEY-NYIF 189
                  +          D  + ++ I H S A     W A   +   +  +  Y  YI 
Sbjct: 481 LKNIEHPERMTLFRMVFLDFRFLEQKIRHDSTA-----WNASGSILNTLQQLKFYKPYIC 535

Query: 190 LWDED-----IGVENFNPRRYLSIVKDEGLEISQPALDPVKSEV 228
             D         + +F P+   +++  +  ++ + AL      +
Sbjct: 536 DNDPKYERLVNAILDFLPKIEENLICSKYTDLLRIALMAEDEAI 579


>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query372
1qg8_A255 Protein (spore coat polysaccharide biosynthesis P 87.28
1xhb_A 472 Polypeptide N-acetylgalactosaminyltransferase 1; g 82.19
>1qg8_A Protein (spore coat polysaccharide biosynthesis P SPSA); glycosyltransferase, transferase; 1.50A {Bacillus subtilis} SCOP: c.68.1.1 PDB: 1h7q_A* 1h7l_A 1qgq_A* 1qgs_A* Back     alignment and structure
Probab=87.28  E-value=1.7  Score=37.28  Aligned_cols=120  Identities=8%  Similarity=-0.068  Sum_probs=68.6

Q ss_pred             ccccEEEEeccccccCCCCHHHHHHHHHHh-CCcccCCCC-----CCCCCceeecccccccCcccceeeecccCCCCCCC
Q 017407          183 AEYNYIFLWDEDIGVENFNPRRYLSIVKDE-GLEISQPAL-----DPVKSEVHHPITARRRNSKAHRRMYKYKGSGRCDD  256 (372)
Q Consensus       183 ~~YDYIFLwDDDL~vd~f~i~ry~~Ivr~~-gLeISQPAL-----d~~s~~i~h~iT~R~~~~~vHrr~~~~~~~~~C~~  256 (372)
                      +..|||++.|+|..++...++++++.++++ +..+..+..     +.+. ...+.. .+ +...+.          .   
T Consensus        87 a~g~~i~~lD~Dd~~~~~~l~~~~~~~~~~~~~~~v~~~~~~~~~~~~~-~~~~~~-~~-~~~~~~----------~---  150 (255)
T 1qg8_A           87 AEGEYITYATDDNIYMPDRLLKMVRELDTHPEKAVIYSASKTYHLNENR-DIVKET-VR-PAAQVT----------W---  150 (255)
T ss_dssp             CCCSEEEEEETTEEECTTHHHHHHHHHHHCTTCCEEEEEEEEEEEC----CEEEEE-EE-CCCSCB----------S---
T ss_pred             cCCCEEEEeCCCCccChHHHHHHHHHHHhCCCceEEEeceEEEEeCCCC-cchhhc-cC-chHHHH----------H---
Confidence            688999999999999988899999999876 666665543     2221 111110 00 000000          0   


Q ss_pred             CCCCCCccceEEeecccccHHHHHHHh-hh--hcC-CCcccchhhhhhhhhhcCCCCCcEEEEeeeeEEEecc
Q 017407          257 YSTAPPCIGWVEMMAPVFSRAAWRCAW-YM--IQN-DLIHAWGLDIQLGYCAQGDRTKNVGVVDSEYIVHLGL  325 (372)
Q Consensus       257 ~~~~ppcTgFVEiMAPVFSR~Awrcvw-hm--iqN-DLvhGWGLDf~w~~caqg~~~~kiGVVDa~~VvH~g~  325 (372)
                           ....++-..+=+|+|++++.+- .+  ..+ +.....+-|+.+...+.. .+ ++..++...+.|+..
T Consensus       151 -----~~~~~~~~~~~~~rr~~~~~~g~~f~~~~~~~~~~~~~eD~~~~~r~~~-~g-~~~~~~~~~~~~r~~  216 (255)
T 1qg8_A          151 -----NAPCAIDHCSVMHRYSVLEKVKEKFGSYWDESPAFYRIGDARFFWRVNH-FY-PFYPLDEELDLNYIT  216 (255)
T ss_dssp             -----CCTTTSCGGGEEEETHHHHHHHHHHSSSSCCCGGGSTTHHHHHHHHHTT-TC-CBEEEEEEEEEEEEC
T ss_pred             -----hhcCCCccceEEEEHHHHHhhcccccccccCCcccchHHHHHHHHHHHH-hC-CEEEecCcEEEEEEc
Confidence                 0111222234578999998763 21  011 222334567776544432 23 699999998888843



>1xhb_A Polypeptide N-acetylgalactosaminyltransferase 1; glycosyltransferase-A (GT-A); HET: NAG BMA; 2.50A {Mus musculus} SCOP: b.42.2.1 c.68.1.17 Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query372
d1xhba2328 Polypeptide N-acetylgalactosaminyltransferase 1, N 83.44
>d1xhba2 c.68.1.17 (A:95-422) Polypeptide N-acetylgalactosaminyltransferase 1, N-terminal domain {Mouse (Mus musculus) [TaxId: 10090]} Back     information, alignment and structure
class: Alpha and beta proteins (a/b)
fold: Nucleotide-diphospho-sugar transferases
superfamily: Nucleotide-diphospho-sugar transferases
family: Polypeptide N-acetylgalactosaminyltransferase 1, N-terminal domain
domain: Polypeptide N-acetylgalactosaminyltransferase 1, N-terminal domain
species: Mouse (Mus musculus) [TaxId: 10090]
Probab=83.44  E-value=0.68  Score=39.77  Aligned_cols=143  Identities=14%  Similarity=0.026  Sum_probs=74.9

Q ss_pred             hccccEEEEeccccccCCCCHHHHHHHHHHhCCcccCCCCCC---CCCcee--ecccccccCcccce--eeecccCCCCC
Q 017407          182 VAEYNYIFLWDEDIGVENFNPRRYLSIVKDEGLEISQPALDP---VKSEVH--HPITARRRNSKAHR--RMYKYKGSGRC  254 (372)
Q Consensus       182 v~~YDYIFLwDDDL~vd~f~i~ry~~Ivr~~gLeISQPALd~---~s~~i~--h~iT~R~~~~~vHr--r~~~~~~~~~C  254 (372)
                      .+.-|||++.|+|..++.-.++++++.+++..-.+..|.++.   +.....  ...+...-....+.  ..+........
T Consensus       105 ~a~gd~i~flD~D~~~~p~~l~~l~~~~~~~~~~~v~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  184 (328)
T d1xhba2         105 VSRGQVITFLDAHCECTAGWLEPLLARIKHDRRTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRR  184 (328)
T ss_dssp             HCCSSEEEEEESSEEECTTCHHHHHHHHHHCTTEEEEEEEEEECSSSCCEECCCTTEEEEECTTCCEEEEECCHHHHHHT
T ss_pred             hhhcceeeecCcccccChhHHHHHHHHHhcCCCeEEecceeeeccCceeeccCCccccccccccccccccccchhhhhcc
Confidence            478899999999999999999999999998887777776432   110000  00000000000000  00000000000


Q ss_pred             CCCCCCCCccceEEeecccccHHHHHHHhhhhcCCCcccch-hh--hhhhhhhcCCCCCcEEEEeeeeEEEeccccCC
Q 017407          255 DDYSTAPPCIGWVEMMAPVFSRAAWRCAWYMIQNDLIHAWG-LD--IQLGYCAQGDRTKNVGVVDSEYIVHLGLPTLG  329 (372)
Q Consensus       255 ~~~~~~ppcTgFVEiMAPVFSR~AwrcvwhmiqNDLvhGWG-LD--f~w~~caqg~~~~kiGVVDa~~VvH~g~Ptlg  329 (372)
                      ....+.|..+.++..-+=.++|++|..+= . +++....|| =|  +-+.-..   .+.+|..+-...|.|..+-..+
T Consensus       185 ~~~~~~~~~~~~~~g~~~~irr~~f~~vG-g-fDe~~~~~g~ED~Dl~~R~~~---~G~~i~~~p~~~v~H~~~~~~~  257 (328)
T d1xhba2         185 KGDRTLPVRTPTMAGGLFSIDRDYFQEIG-T-YDAGMDIWGGENLEISFRIWQ---CGGTLEIVTCSHVGHVFRKATP  257 (328)
T ss_dssp             TTCTTSCEECSBCCSSSEEEEHHHHHHTT-S-CCTTSCTTCCCCSHHHHHHHH---TTCEEEEEEEEEEEEEC-----
T ss_pred             ccccccccccceecceeeeeeHHHHHHhC-C-CCCCCcCcCchHHHHHHHHHH---hCCeEEEeCCeEEEEeCCCCCC
Confidence            11112222233333323347899999873 2 456556675 34  4444333   3568999999999998554443