Citrus Sinensis ID: 039138


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------
MKDDESLPTTTTLNVTKKESSDSSAFGKGRYKFWALAAILLLAFWSMFTGTVTLRWSAGNLNRLSDDLGSPIHDDLDVLEMEEREKVVKHMWDVYTNSRRIRLPRFWQEAFEAAYEELSSDVAEVRDAAITEIAKMSVRSLVNIDSPPFHSTSAHQFSKSLKLAERGRALLTSSGTG
cccccccccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHccEEEEEEccccccccccccccccccccHHHHHHHHHHHHHHHHHHHccccccccHHHHHHHHHHHHHHHccHHHHHHHHHHHHHHHHHHHcccccccccccHHHHHHHHHHHHHHHHcccccccccc
cccccccccccccccccccccccHHccccHHHHHHHHHHHHHHHHHHHHccEEEEEccccHHcccccccccccccccHHHHHHHHHHHHHHHHHHcccccccccHHHHHHHHHHHHHHHcccHHHHHHHHHHHHHHHHHHHcccccccccccHHHHHHHHHHHHHccccEEEccccc
mkddeslpttttlnvtkkessdssafgkgRYKFWALAAILLLAFWSMFTGTVTLRWSAgnlnrlsddlgspihddldvLEMEEREKVVKHMWDVytnsrrirlPRFWQEAFEAAYEELSSDVAEVRDAAITEIAKMSVRslvnidsppfhstsahqFSKSLKLAERGRALLTSSGTG
mkddeslpttttlnvtkkessdssafgkGRYKFWALAAILLLAFWSMFTGTVTLRWSAGNLNrlsddlgspihDDLDVLEMEEREKVVKHMwdvytnsrrirlPRFWQEAFEAAYEELSSDVAEVRDAAITEIAKMSVRSLVNIDSPPFHSTSAHQFSKSLKLAERGRAlltssgtg
MKDDESLPTTTTLNVTKKESSDSSAFGKGRYKFWalaailllaFWSMFTGTVTLRWSAGNLNRLSDDLGSPIHDDLDVLEMEEREKVVKHMWDVYTNSRRIRLPRFWQEAFEAAYEELSSDVAEVRDAAITEIAKMSVRSLVNIDSPPFHSTSAHQFSKSLKLAERGRALLTSSGTG
*************************FGKGRYKFWALAAILLLAFWSMFTGTVTLRWSAGNLNRLSDDLGSPIHDDLDVLEMEEREKVVKHMWDVYTNSRRIRLPRFWQEAFEAAYEELSSDVAEVRDAAITEIAKMSVRSLVN**********************************
**************************GKGRYKFWALAAILLLAFWSMFTGTVTLRWSAGNLNRLSDDLGSPIHDDLDVLEMEEREKVVKHMWDVYTNSRRIRLPRFWQEAFEAAYEELSSDVAEVRDAAITEIAKMSVRSLVN**********************************
**********TTLN***********FGKGRYKFWALAAILLLAFWSMFTGTVTLRWSAGNLNRLSDDLGSPIHDDLDVLEMEEREKVVKHMWDVYTNSRRIRLPRFWQEAFEAAYEELSSDVAEVRDAAITEIAKMSVRSLVNIDSPPFHSTSAHQFSKSLKLAERGRALLTSSGTG
*************************FGKGRYKFWALAAILLLAFWSMFTGTVTLRWSAGNLNRLSDDLGSPIHDDLDVLEMEEREKVVKHMWDVYTNSRRIRLPRFWQEAFEAAYEELSSDVAEVRDAAITEIAKMSVRSLVNIDSPPFHST*************RG**********
oooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
ooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MKDDESLPTTTTLNVTKKESSDSSAFGKGRYKFWALAAILLLAFWSMFTGTVTLRWSAGNLNRLSDDLGSPIHDDLDVLEMEEREKVVKHMWDVYTNSRRIRLPRFWQEAFEAAYEELSSDVAEVRDAAITEIAKMSVRSLVNIDSPPFHSTSAHQFSKSLKLAERGRALLTSSGTG
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

No hits with e-value below 0.001 by BLAST

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query177
255561142 382 sugar transporter, putative [Ricinus com 0.983 0.455 0.752 2e-65
356496751 331 PREDICTED: putative polyol transporter 2 0.920 0.492 0.785 6e-65
357483369171 hypothetical protein MTR_5g019900 [Medic 0.949 0.982 0.789 1e-64
356540751 373 PREDICTED: probable polyol transporter 6 0.943 0.447 0.771 2e-64
255646709168 unknown [Glycine max] 0.926 0.976 0.779 1e-63
388509002176 unknown [Lotus japonicus] 0.937 0.943 0.775 2e-62
118483265171 unknown [Populus trichocarpa] 0.960 0.994 0.728 2e-61
224094739162 predicted protein [Populus trichocarpa] 0.915 1.0 0.768 3e-61
224134354171 predicted protein [Populus trichocarpa] 0.966 1.0 0.739 6e-61
297735178178 unnamed protein product [Vitis vinifera] 0.994 0.988 0.700 4e-60
>gi|255561142|ref|XP_002521583.1| sugar transporter, putative [Ricinus communis] gi|223539261|gb|EEF40854.1| sugar transporter, putative [Ricinus communis] Back     alignment and taxonomy information
 Score =  253 bits (647), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 137/182 (75%), Positives = 153/182 (84%), Gaps = 8/182 (4%)

Query: 3   DDESLPTTTTLNVT-------KKESSDSSAFGKGRYKFWALAAILLLAFWSMFTGTVTLR 55
           DD+SLPT+T    T       KKESSDSS FGKGRYKFWALAAILLLAFWSMFTGTVTLR
Sbjct: 201 DDDSLPTSTIAATTTTSATNSKKESSDSSVFGKGRYKFWALAAILLLAFWSMFTGTVTLR 260

Query: 56  WSAGNLNRLSDDLGSPIHDDLDVLEMEEREKVVKHMWDVYTNSRRIRLPRFWQEAFEAAY 115
           WSAGNLNRLSDD+ +PIHDDLDVLEMEERE+VVKHMWDVYTNS RIRLPRFWQEAFEAAY
Sbjct: 261 WSAGNLNRLSDDIDAPIHDDLDVLEMEERERVVKHMWDVYTNSHRIRLPRFWQEAFEAAY 320

Query: 116 EELSSDVAEVRDAAITEIAKMSVRSLVNIDSPPFHSTSAHQFSKSLKLAERGRALLTSSG 175
           E+L+SDV  ++DAAI+EIAKMSVRS++ +D PP  STSA + SKSLKL  +  A  TS G
Sbjct: 321 EDLTSDVPGIKDAAISEIAKMSVRSII-LDPPPVQSTSARELSKSLKLVGKDGAARTSKG 379

Query: 176 TG 177
           +G
Sbjct: 380 SG 381




Source: Ricinus communis

Species: Ricinus communis

Genus: Ricinus

Family: Euphorbiaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|356496751|ref|XP_003517229.1| PREDICTED: putative polyol transporter 2-like [Glycine max] Back     alignment and taxonomy information
>gi|357483369|ref|XP_003611971.1| hypothetical protein MTR_5g019900 [Medicago truncatula] gi|358344391|ref|XP_003636273.1| hypothetical protein MTR_036s0057 [Medicago truncatula] gi|355502208|gb|AES83411.1| hypothetical protein MTR_036s0057 [Medicago truncatula] gi|355513306|gb|AES94929.1| hypothetical protein MTR_5g019900 [Medicago truncatula] gi|388512483|gb|AFK44303.1| unknown [Medicago truncatula] Back     alignment and taxonomy information
>gi|356540751|ref|XP_003538848.1| PREDICTED: probable polyol transporter 6-like [Glycine max] Back     alignment and taxonomy information
>gi|255646709|gb|ACU23828.1| unknown [Glycine max] Back     alignment and taxonomy information
>gi|388509002|gb|AFK42567.1| unknown [Lotus japonicus] Back     alignment and taxonomy information
>gi|118483265|gb|ABK93535.1| unknown [Populus trichocarpa] Back     alignment and taxonomy information
>gi|224094739|ref|XP_002310215.1| predicted protein [Populus trichocarpa] gi|222853118|gb|EEE90665.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|224134354|ref|XP_002327817.1| predicted protein [Populus trichocarpa] gi|222836902|gb|EEE75295.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|297735178|emb|CBI17540.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query177
TAIR|locus:2155735185 AT5G65650 "AT5G65650" [Arabido 0.949 0.908 0.660 3.6e-57
TAIR|locus:2115225179 AT4G36660 "AT4G36660" [Arabido 0.909 0.899 0.670 1.6e-54
TAIR|locus:2016372147 AT1G19380 "AT1G19380" [Arabido 0.779 0.938 0.469 1.2e-24
TAIR|locus:2155735 AT5G65650 "AT5G65650" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 588 (212.0 bits), Expect = 3.6e-57, P = 3.6e-57
 Identities = 115/174 (66%), Positives = 136/174 (78%)

Query:     1 MKDDESLPTTTT----LNVT-KKESSDSSAFGKGRYKFWXXXXXXXXXFWSMFTGTVTLR 55
             MKD +SLP +T+      VT KKE+  S+ F KGRYKFW         FWSM TGTV LR
Sbjct:     1 MKDGDSLPISTSSVAATTVTGKKETGYSALFSKGRYKFWALAAILLLAFWSMLTGTVNLR 60

Query:    56 WSAGNLNRLSDDLGSPIHDDLDVLEMEEREKVVKHMWDVYTNSRRIRLPRFWQEAFEAAY 115
             WSAGN+N  +DDL  PIH+DLDVLEMEEREKVVKHMWDVY N RRIRLPRFWQEAFEAAY
Sbjct:    61 WSAGNINHFTDDLVFPIHEDLDVLEMEEREKVVKHMWDVYNNGRRIRLPRFWQEAFEAAY 120

Query:   116 EELSSDVAEVRDAAITEIAKMSVRSLVNIDSPPFHSTSAHQFSKSLKLAERGRA 169
             EEL+SDV +V +AAI+EIA+MS+RS+V ID PP HST+  + +K+LKLA++GRA
Sbjct:   121 EELTSDVPDVVEAAISEIARMSIRSIV-IDPPPLHSTNVRELTKTLKLADKGRA 173




GO:0003674 "molecular_function" evidence=ND
GO:0008150 "biological_process" evidence=ND
TAIR|locus:2115225 AT4G36660 "AT4G36660" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2016372 AT1G19380 "AT1G19380" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query177
pfam06708156 pfam06708, DUF1195, Protein of unknown function (D 8e-86
>gnl|CDD|191590 pfam06708, DUF1195, Protein of unknown function (DUF1195) Back     alignment and domain information
 Score =  249 bits (636), Expect = 8e-86
 Identities = 114/157 (72%), Positives = 133/157 (84%), Gaps = 6/157 (3%)

Query: 1   MKDDESLPTTTTLNVT-----KKESSDSSAFGKGRYKFWALAAILLLAFWSMFTGTVTLR 55
           MKDD+S  +T +         KKE++ S+ FGKGRYKFWALAAILLLAFWSMFTG+VTLR
Sbjct: 1   MKDDDSPISTPSPATKITMNSKKETAGSALFGKGRYKFWALAAILLLAFWSMFTGSVTLR 60

Query: 56  WSAGNLNRLSDDLGSPIHDDLDVLEMEEREKVVKHMWDVYTNSRRIRLPRFWQEAFEAAY 115
           WSAGNLNR SDDL  P+HDDLDVLEMEEREK+V+HMWDVYT S R+RLPRFWQEAFEAAY
Sbjct: 61  WSAGNLNRASDDLDFPLHDDLDVLEMEEREKLVRHMWDVYTRSGRVRLPRFWQEAFEAAY 120

Query: 116 EELSSDVAEVRDAAITEIAKMSVRSLVNIDSPPFHST 152
           EEL+SDV +VR+AA++EIAKMSVRS+  ++ PP HST
Sbjct: 121 EELASDVPQVREAAVSEIAKMSVRSIE-LEPPPLHST 156


This family consists of several plant specific hypothetical proteins of around 160 residues in length. The function of this family is unknown. Length = 156

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 177
PF06708157 DUF1195: Protein of unknown function (DUF1195); In 100.0
TIGR0257463 stabl_TIGR02574 putative addiction module componen 83.07
>PF06708 DUF1195: Protein of unknown function (DUF1195); InterPro: IPR010608 This family consists of several plant specific hypothetical proteins of around 160 residues in length Back     alignment and domain information
Probab=100.00  E-value=1.5e-93  Score=574.14  Aligned_cols=151  Identities=81%  Similarity=1.275  Sum_probs=148.1

Q ss_pred             CCCCCCCCccccc------ccccccCCCccccCCchhHHHHHHHHHHHHHHhhhhceEEEEeccCCcccccCCCCCCCCC
Q 039138            1 MKDDESLPTTTTL------NVTKKESSDSSAFGKGRYKFWALAAILLLAFWSMFTGTVTLRWSAGNLNRLSDDLGSPIHD   74 (177)
Q Consensus         1 Mkddd~lp~st~t------~~~kkes~~~~l~GkgrYKfWaLaAIlLLAfWSMftgsVtLrwSag~Ln~~s~dld~p~~d   74 (177)
                      |||+|++|++|++      .++|+++++++|||||||||||||||+|||||||||||||||||+||||++++|+|+|++|
T Consensus         1 mk~~~~~~~~t~~~at~~~~~~~~~~~~~~l~gKgrYK~WaLaAIlLLAfWSM~tgsvtLrwS~g~l~~~~~dl~~~~~d   80 (157)
T PF06708_consen    1 MKDDDSLPTSTPTPATTTTASSKKESSESALFGKGRYKFWALAAILLLAFWSMFTGSVTLRWSAGNLNSVSDDLDFPIHD   80 (157)
T ss_pred             CCccccCCcCCCCCcccceeeecccccccccccCchhHHHHHHHHHHHHHHHHhheeeEEEeccCcccccccccCCcccc
Confidence            9999999988885      4889999999999999999999999999999999999999999999999999999999999


Q ss_pred             CCchhhHHHHHHHHhHhhhhhcCCCCcCcchHHHHHHHHHHHHhhcchhhhHHHHHHHHHhhhhhccccCCCCCCCcc
Q 039138           75 DLDVLEMEEREKVVKHMWDVYTNSRRIRLPRFWQEAFEAAYEELSSDVAEVRDAAITEIAKMSVRSLVNIDSPPFHST  152 (177)
Q Consensus        75 DlDvLEmEeREKvVr~MWDvYt~s~~vrLprFWqEAFeAAYeeL~sD~~~vrdaAisEIAkmS~r~li~~dp~p~~S~  152 (177)
                      |||||||||||||||||||||||++++|||||||||||||||+|+||+|+|||||||||||||+|+ |++||||.||+
T Consensus        81 DlDvLEmEeREKvVr~MWDvYt~s~~vrLPrFWqEAFeAAYe~L~sD~~~VrdaAisEIAkmS~r~-~~~~~~~~~st  157 (157)
T PF06708_consen   81 DLDVLEMEEREKVVRHMWDVYTRSRRVRLPRFWQEAFEAAYEELASDVPQVRDAAISEIAKMSVRS-IELDPPPNQST  157 (157)
T ss_pred             cccHHHHHHHHHHHHHHHHHhcCCCCccCchHHHHHHHHHHHHHhccCcchhHHHHHHHHHHhhcc-ccCCCCCCCCC
Confidence            999999999999999999999999999999999999999999999999999999999999999999 99999999986



The function of this family is unknown.

>TIGR02574 stabl_TIGR02574 putative addiction module component, TIGR02574 family Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query177
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 2e-04
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure
 Score = 40.2 bits (93), Expect = 2e-04
 Identities = 26/202 (12%), Positives = 46/202 (22%), Gaps = 57/202 (28%)

Query: 12  TLNVTKKESSDSSAFGKGRYKFWALAAILLLAFWSMFTGTVTLRWSAGNLNRLSDDL--- 68
            L +T +    +               I L       T           L+    DL   
Sbjct: 268 IL-LTTRFKQVTDFLSAATTT-----HISLDHHSMTLTPDEVKSLLLKYLDCRPQDLPRE 321

Query: 69  ---GSPIHDDLDVLEMEEREKVVKHMWDVYTNSRRIRLPRFWQEAFEAAYEELSSDVAEV 125
               +P    +    + +        WD   N + +   +      E++   L     E 
Sbjct: 322 VLTTNPRRLSIIAESIRDG----LATWD---NWKHVNCDKL-TTIIESSLNVLEPA--EY 371

Query: 126 R-----------DAAITE------------------IAKMSVRSLVNIDSPPFHSTSAH- 155
           R            A I                    + K+   SLV        + S   
Sbjct: 372 RKMFDRLSVFPPSAHIPTILLSLIWFDVIKSDVMVVVNKLHKYSLVEKQPKES-TISIPS 430

Query: 156 -QFSKSLKLAERG---RALLTS 173
                 +KL       R+++  
Sbjct: 431 IYLELKVKLENEYALHRSIVDH 452


Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00