Citrus Sinensis ID: 046004


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------
MGNCLVLEEKVIKVMKTDGKILEYNQPIRVQDVLAEFSGHAISDSLPEIRHLRPDFKLVGGNLYFLVPVPLPSQKVQKKKVRFSDEEAGAGAKERGGVVRIKLVISKQELEELLQKQGVSVKDMVSRIQSKQSADDFQSGDNTKAWKPELESIPEID
ccccccccccEEEEEEcccEEEEEcccEEHHHHHHHccccEEccccccccccccccccccccEEEEEEcccccHHHHHHHHHccHHHHcccccccccEEEEEEEccHHHHHHHHHHccccHHHHHHHHHHHcccccccccccccccccccccccccc
cccEEEEEcEEEEEEEccccEEEEccccEHHHHHHHccccEEEccccccccccccHHcccccEEEEEccccccccccEEEEEEccccccccccccccEEEEEEEEcHHHHHHHHHcccccHHHHHHHHHHHHccccccccccccccccccccccccc
MGNCLVLEEKVIKVMKTdgkileynqpirVQDVLAEfsghaisdslpeirhlrpdfklvggnlyflvpvplpsqkvqkkkvrfsdeeagagakergGVVRIKLVISKQELEELLQKQGVSVKDMVSRIQskqsaddfqsgdntkawkpelesipeid
MGNCLVLEEKVIKVMktdgkileynQPIRVQDVLAEFSGHAISDSLPEIRHLRPDFKLVGGNLYFLvpvplpsqkvqkkkvrfsdeeagagakerggvvRIKLVISKQELEELLQKQGVSVKDMVSRIQSkqsaddfqsgdntkawkpelesipeid
MGNCLVLEEKVIKVMKTDGKILEYNQPIRVQDVLAEFSGHAISDSLPEIRHLRPDFKLVGGNLYFLVPVPLPSQKVQKKKVRFSDeeagagakerggVVRIKLVISKQELEELLQKQGVSVKDMVSRIQSKQSADDFQSGDNTKAWKPELESIPEID
***CLVLEEKVIKVMKTDGKILEYNQPIRVQDVLAEFSGHAISDSLPEIRHLRPDFKLVGGNLYFLVPVPL*************************GVVRIKLVISKQ*************************************************
MGNC***EEKVIKVMKTDGKILEYNQPIRVQDVLAEFSGHAISDSLPE*RH*RP*FKLVGGNLYFLVPVPL*******************************************************************************ESI****
MGNCLVLEEKVIKVMKTDGKILEYNQPIRVQDVLAEFSGHAISDSLPEIRHLRPDFKLVGGNLYFLVPVPLPS********************ERGGVVRIKLVISKQELEELLQKQG************************TKAWKPELESIPEID
*GNCLVLEEKVIKVMKTDGKILEYNQPIRVQDVLAEFSGHAISDSLPEIRHLRPDFKLVGGNLYFLVPVPLPSQKVQKKKVRFSDEEAGAG**ERGGVVRIKLVISKQELEELLQKQGVSVKDMVSRIQSKQS************************
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
oooooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MGNCLVLEEKVIKVMKTDGKILEYNQPIRVQDVLAEFSGHAISDSLPEIRHLRPDFKLVGGNLYFLVPVPLPSQKVQKKKVRFSDEEAGAGAKERGGVVRIKLVISKQELEELLQKQGVSVKDMVSRIQSKQSADDFQSGDNTKAWKPELESIPEID
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query157 2.2.26 [Sep-21-2011]
Q6NLC8225 Uncharacterized protein A no no 0.598 0.417 0.314 0.0001
>sp|Q6NLC8|Y1648_ARATH Uncharacterized protein At1g66480 OS=Arabidopsis thaliana GN=At1g66480 PE=1 SV=1 Back     alignment and function desciption
 Score = 45.8 bits (107), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 34/108 (31%), Positives = 54/108 (50%), Gaps = 14/108 (12%)

Query: 1   MGNCLVLEEKVIKVMKTDGKILEYNQPIRVQDVLAEFSGHAISDSLPEIRH-------LR 53
           MGN + ++ K  KVMK DG+      P+  ++V A++ G+ + DS   ++H       L 
Sbjct: 1   MGNSITVKRKRAKVMKIDGETFRIKTPVTAREVTADYPGYVLLDSQ-AVKHFGVRSKPLE 59

Query: 54  PDFKLVGGNLYFLVPVP-LPSQKV---QKKKVRFSDEEAG--AGAKER 95
           P+  L     YFLV +P LP +      + K+ +    +G   GAKER
Sbjct: 60  PNQTLKPKKTYFLVELPKLPPETTAVDTENKLPYRRVMSGIHVGAKER 107





Arabidopsis thaliana (taxid: 3702)

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query157
255575713157 conserved hypothetical protein [Ricinus 0.980 0.980 0.631 5e-45
225440876160 PREDICTED: uncharacterized protein LOC10 0.987 0.968 0.561 3e-44
224088567164 predicted protein [Populus trichocarpa] 0.987 0.945 0.614 3e-43
224140107159 predicted protein [Populus trichocarpa] 0.980 0.968 0.566 5e-40
147794499193 hypothetical protein VITISV_021816 [Viti 0.885 0.720 0.574 3e-38
359807179156 uncharacterized protein LOC100780959 [Gl 0.974 0.980 0.525 9e-38
356534211162 PREDICTED: uncharacterized protein LOC10 0.987 0.956 0.518 1e-37
351724183155 uncharacterized protein LOC100500042 [Gl 0.968 0.980 0.518 2e-36
356548433161 PREDICTED: uncharacterized protein LOC10 0.987 0.962 0.509 2e-35
388518935164 unknown [Lotus japonicus] 0.987 0.945 0.5 3e-32
>gi|255575713|ref|XP_002528756.1| conserved hypothetical protein [Ricinus communis] gi|223531850|gb|EEF33668.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
 Score =  185 bits (470), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 101/160 (63%), Positives = 121/160 (75%), Gaps = 6/160 (3%)

Query: 1   MGNCLVLEEKVIKVMKTDGKILEYNQPIRVQDVLAEFSGHAISDSLPEIRHLRPDFKLVG 60
           MGNC+VL+EKV+KVMK DGKILEY  P+RVQ VL+EFSGHAISDSL + +HL PD KL G
Sbjct: 1   MGNCIVLQEKVVKVMKPDGKILEYRAPVRVQQVLSEFSGHAISDSLQDFQHLLPDTKLHG 60

Query: 61  GNLYFLVPVPLPSQKVQKKKVRFSDEEAGAGAKERGGVVRIKLVISKQELEELLQKQGVS 120
           G+LY+LV +PLPS +V KKKVRFS  E     K+  G VRIKLVISKQEL+E+L+K GVS
Sbjct: 61  GSLYYLVALPLPSPEV-KKKVRFSIPE--EENKKETGAVRIKLVISKQELQEILRKGGVS 117

Query: 121 VKDMVSRIQSKQSADDFQSGDN---TKAWKPELESIPEID 157
           V  M+S++Q +Q      + DN    K WKP LESIPEID
Sbjct: 118 VDYMISQLQGQQRVHRVDTSDNDDCHKGWKPVLESIPEID 157




Source: Ricinus communis

Species: Ricinus communis

Genus: Ricinus

Family: Euphorbiaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|225440876|ref|XP_002282533.1| PREDICTED: uncharacterized protein LOC100254636 [Vitis vinifera] Back     alignment and taxonomy information
>gi|224088567|ref|XP_002308476.1| predicted protein [Populus trichocarpa] gi|222854452|gb|EEE91999.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|224140107|ref|XP_002323428.1| predicted protein [Populus trichocarpa] gi|222868058|gb|EEF05189.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|147794499|emb|CAN62766.1| hypothetical protein VITISV_021816 [Vitis vinifera] Back     alignment and taxonomy information
>gi|359807179|ref|NP_001241101.1| uncharacterized protein LOC100780959 [Glycine max] gi|255640840|gb|ACU20703.1| unknown [Glycine max] Back     alignment and taxonomy information
>gi|356534211|ref|XP_003535651.1| PREDICTED: uncharacterized protein LOC100780937 [Glycine max] Back     alignment and taxonomy information
>gi|351724183|ref|NP_001236025.1| uncharacterized protein LOC100500042 [Glycine max] gi|255628759|gb|ACU14724.1| unknown [Glycine max] Back     alignment and taxonomy information
>gi|356548433|ref|XP_003542606.1| PREDICTED: uncharacterized protein LOC100793955 [Glycine max] Back     alignment and taxonomy information
>gi|388518935|gb|AFK47529.1| unknown [Lotus japonicus] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query157
TAIR|locus:2150695179 AT5G03890 [Arabidopsis thalian 0.802 0.703 0.421 1.9e-21
TAIR|locus:2100073173 AT3G10120 "AT3G10120" [Arabido 1.0 0.907 0.387 1e-20
TAIR|locus:2010931203 AT1G64700 "AT1G64700" [Arabido 0.433 0.334 0.4 3.3e-09
TAIR|locus:2089945119 AT3G21680 "AT3G21680" [Arabido 0.566 0.747 0.303 7.8e-08
TAIR|locus:2079572187 AT3G61920 "AT3G61920" [Arabido 0.834 0.700 0.263 2.9e-07
TAIR|locus:2038776215 At17.1 "AT2G01340" [Arabidopsi 0.477 0.348 0.348 4.1e-07
TAIR|locus:2170693161 AT5G62900 "AT5G62900" [Arabido 0.910 0.888 0.258 2.1e-06
TAIR|locus:2028942225 AT1G66480 "AT1G66480" [Arabido 0.464 0.324 0.329 8e-06
TAIR|locus:2156035214 AT5G37840 "AT5G37840" [Arabido 0.515 0.378 0.318 0.00027
TAIR|locus:2141682129 AT4G21920 "AT4G21920" [Arabido 0.356 0.434 0.353 0.00041
TAIR|locus:2150695 AT5G03890 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 251 (93.4 bits), Expect = 1.9e-21, P = 1.9e-21
 Identities = 64/152 (42%), Positives = 88/152 (57%)

Query:     1 MGNCLVLEEKVIKVMKTDGKILEYNQPIRVQDVLAEFSGHAISDSLPEIRHLRPDFKLVG 60
             MGNCLV+E+KVIK+++ DGK+LEY +PI V  +L +FSGH+IS +     HL PD KL+ 
Sbjct:     1 MGNCLVMEKKVIKIVRDDGKVLEYREPISVHHILTQFSGHSISHNNT---HLLPDAKLLS 57

Query:    61 GNLYFLVPVPLPSQKVQKKKVRF--------------------SDXXXXXXXXXXXXVVR 100
             G LY+L+P  +  +KV KK V F                    S+            VVR
Sbjct:    58 GRLYYLLPTTMTKKKVNKK-VTFANPEVEGDERLLREEEDSSESNSNIDGDDTKNVTVVR 116

Query:   101 IKLVISKQELEELLQKQGVSVKDMVSRIQSKQ 132
             +K+V+ KQELE+LLQ  G SV +M+ +   KQ
Sbjct:   117 MKIVVHKQELEKLLQ--GGSVHEMMYQTLEKQ 146


GO:0005634 "nucleus" evidence=ISM
TAIR|locus:2100073 AT3G10120 "AT3G10120" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2010931 AT1G64700 "AT1G64700" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2089945 AT3G21680 "AT3G21680" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2079572 AT3G61920 "AT3G61920" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2038776 At17.1 "AT2G01340" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2170693 AT5G62900 "AT5G62900" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2028942 AT1G66480 "AT1G66480" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2156035 AT5G37840 "AT5G37840" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2141682 AT4G21920 "AT4G21920" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query157
pfam14009181 pfam14009, DUF4228, Domain of unknown function (DU 2e-34
>gnl|CDD|222492 pfam14009, DUF4228, Domain of unknown function (DUF4228) Back     alignment and domain information
 Score =  118 bits (298), Expect = 2e-34
 Identities = 61/183 (33%), Positives = 88/183 (48%), Gaps = 30/183 (16%)

Query: 1   MGNCLVL------EEKVIKVMKTDGKILEYNQPIRVQDVLAEFSGHAISDS-----LPEI 49
           MGNCL            +KV+  DGK+ EY++P+   +++ E+ GH + DS        I
Sbjct: 1   MGNCLSCCLAPSGAAATVKVVHPDGKVREYSRPVTAAELMLEYPGHFVCDSDSLYIGRRI 60

Query: 50  RHLRPDFKLVGGNLYFLVPVP-LPSQKVQK--KKVRFSDEEAGAGA-------------- 92
             L PD +L  G LYFL+P   L S    K    +  S   A   A              
Sbjct: 61  PALPPDDELERGQLYFLLPAERLQSVLAAKDMASLASSASSALKSASAKRSSSSRPSPVR 120

Query: 93  KERGGVVRIKLVISKQELEELLQKQGVSVKDMVSRIQSKQSADDFQSGDNTKAWKPELES 152
           K  GGVVR+K+V+SK+ELEELL  +  SV   +S +           G  +++W+P+LE+
Sbjct: 121 KSDGGVVRVKVVVSKEELEELL--EEGSVSASLSELCETPELRKRSRGSRSRSWRPKLET 178

Query: 153 IPE 155
           I E
Sbjct: 179 ISE 181


This domain is found in plants. The function is not known. Length = 181

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 157
PF14009181 DUF4228: Domain of unknown function (DUF4228) 100.0
>PF14009 DUF4228: Domain of unknown function (DUF4228) Back     alignment and domain information
Probab=100.00  E-value=4.1e-40  Score=253.05  Aligned_cols=153  Identities=37%  Similarity=0.643  Sum_probs=115.3

Q ss_pred             CCCcccc------CCceeEEEecCCcEEEEeCCCcHHHHHhhcCCcEEeCCC-----CCccCCCCCCccCCCCeEEEeeC
Q 046004            1 MGNCLVL------EEKVIKVMKTDGKILEYNQPIRVQDVLAEFSGHAISDSL-----PEIRHLRPDFKLVGGNLYFLVPV   69 (157)
Q Consensus         1 MGNC~~~------~~~~ikV~~~dG~v~e~~~pv~a~~vm~~~Pgh~v~~s~-----~~~~~L~~d~~L~~G~~YfLlP~   69 (157)
                      ||||++.      ..++||||++||+|++|+.||+|+|||.+|||||||++.     ..+++|+||++|++|++|||||.
T Consensus         1 MGn~~~~~~~~~~~~~~vkvv~~~G~v~~~~~pv~a~evm~~~P~h~v~~~~~~~~~~~~~~l~~d~~L~~G~~Y~llP~   80 (181)
T PF14009_consen    1 MGNCVSCCLASSSSAATVKVVHPDGKVEEFKRPVTAAEVMLENPGHFVCDSDSFRFGRRIKPLPPDEELQPGQIYFLLPM   80 (181)
T ss_pred             CCCcccccccccCCCceEEEEcCCCcEEEeCCCcCHHHHHHHCCCCEEeccccccCCCcccCCCccCeecCCCEEEEEEc
Confidence            9999985      689999999999999999999999999999999998763     46799999999999999999999


Q ss_pred             CCCCchhc---cccccccchhcc------------c--CCcCCCceEEEEEEeCHHHHHHHHHhccCCHHHHHHHHHhhc
Q 046004           70 PLPSQKVQ---KKKVRFSDEEAG------------A--GAKERGGVVRIKLVISKQELEELLQKQGVSVKDMVSRIQSKQ  132 (157)
Q Consensus        70 ~~~~~~~~---~~~vr~~~~~~~------------~--~~~~~~g~~rvkl~i~k~~L~~ll~~~~~s~e~~l~~l~~~~  132 (157)
                      +.......   .....+......            .  ....++|++++|++++++||++++++.  +.++++....+..
T Consensus        81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~rvki~isk~el~~~l~~~--s~~~~~~~~~~~~  158 (181)
T PF14009_consen   81 SRLQSVLSASDMASLASSASSASSSSSARKSSSRPFSRSRSSNGGVVRVKIVISKEELEELLSEG--SDEEMLSESCRRP  158 (181)
T ss_pred             cccCcccccchhcccccchhhccccccccccccccccccccccCcccccccccCHHHHHHHHhcc--ccchhhhhhhccc
Confidence            98654211   111111111100            0  123467889999999999999999866  5666666555432


Q ss_pred             cccccccCCCCCCccccCcCcCC
Q 046004          133 SADDFQSGDNTKAWKPELESIPE  155 (157)
Q Consensus       133 ~~~~~~~~~~~~~WrP~LeSIpE  155 (157)
                      .........+.++|||+||||||
T Consensus       159 ~~~~~~~~~~~~~WrP~LesI~E  181 (181)
T PF14009_consen  159 RRRSSRRGSRSRSWRPALESIPE  181 (181)
T ss_pred             cccccccCCCCCCccCCCCCcCc
Confidence            21112233466999999999998




Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00