Citrus Sinensis ID: 031133


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-----
MGNCLMLEEKVIKVMKTDGKILEYKAPMKVQDVLAEFAGHAISDSFPEIRHLMPDFKLLGGNLYFLVPVPLPSPKVEKKKVRFSEEEARDGAKETSSVVRIKVVISKQELQDLQMLQKGGVVSVQDVASWLQGKQSTNSQAVGFQDGGNNNEGWKPELESIPEVD
ccccccccccEEEEEEcccEEEEEEccccHHHHHHHccccEEEcccccccccccccccccccEEEEEcccccccHHHHHHHHHcHHHHHccccccccEEEEEEEccHHHHHHHHHHHccccccHHHHHHHHHHcccccccccccccccccccccccccccccccc
cccEEEEEcEEEEEEEccccEEEEccccEHHHHHHHccccEEEccccccccccccHHHccccEEEEEEcccccccccccEEEEccccccccccccccEEEEEEEEcHHHHHHHHHHccccEEEHHHHHHHHHHcccccccEcccccccccccccccccccccccc
MGNCLMLEEKVIKVMKTDgkileykapmKVQDVLAEFAghaisdsfpeirhlmpdfkllggnlyflvpvplpspkvekkkvrfseeeardgaketSSVVRIKVVISKQELQDLQMLQKGGVVSVQDVASWlqgkqstnsqavgfqdggnnnegwkpelesipevd
MGNCLMLEEKVIKVMKTDGKILEYKAPMKVQDVLAEFAGHAISDSFPEIRHLMPDFKLLGGNLYFLVPvplpspkvekkkvrfseeeardgaketssvvrIKVVISKQELQDLQMLQKGGVVSVQDVASWLQGKqstnsqavgfqdggnnnegwkpelesipevd
MGNCLMLEEKVIKVMKTDGKILEYKAPMKVQDVLAEFAGHAISDSFPEIRHLMPDFKLLGGNLYFLVPVPLPSPKVEKKKVRFSEEEARDGAKETssvvrikvviskQELQDLQMLQKGGVVSVQDVASWLQGKQSTNSQAVGFQDGGNNNEGWKPELESIPEVD
***CLMLEEKVIKVMKTDGKILEYKAPMKVQDVLAEFAGHAISDSFPEIRHLMPDFKLLGGNLYFLVPVPL**************************VVRIKVVISKQELQDLQMLQKGGVVSVQDVASWL**********************************
MGNC***EEKVIKVMKTDGKILEYKAPMKVQDVLAEFAGHAISDSF**********KLLGGNLYFLVPV*****************************************************************************************ESI****
MGNCLMLEEKVIKVMKTDGKILEYKAPMKVQDVLAEFAGHAISDSFPEIRHLMPDFKLLGGNLYFLVPVPLPSPK*********************SVVRIKVVISKQELQDLQMLQKGGVVSVQDVASWLQGKQSTNSQAVGFQDGGNNNEGWKPE********
*GNCLMLEEKVIKVMKTDGKILEYKAPMKVQDVLAEFAGHAISDSFPEIRHLMPDFKLLGGNLYFLVPVPLPSPKVEKKKVRFSEE*A*****ETSSVVRIKVVISKQELQDLQMLQKGGVVSVQDVASWLQGK*****QA*G**********WKPE********
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
iiiiiiiiiiihhhhhhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MGNCLMLEEKVIKVMKTDGKILEYKAPMKVQDVLAEFAGHAISDSFPEIRHLMPDFKLLGGNLYFLVPVPLPSPKVEKKKVRFSEEEARDGAKETSSVVRIKVVISKQELQDLQMLQKGGVVSVQDVASWLQGKQSTNSQAVGFQDGGNNNEGWKPELESIPEVD
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query165 2.2.26 [Sep-21-2011]
Q6NLC8225 Uncharacterized protein A no no 0.418 0.306 0.337 0.0001
>sp|Q6NLC8|Y1648_ARATH Uncharacterized protein At1g66480 OS=Arabidopsis thaliana GN=At1g66480 PE=1 SV=1 Back     alignment and function desciption
 Score = 45.8 bits (107), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 26/77 (33%), Positives = 41/77 (53%), Gaps = 8/77 (10%)

Query: 1  MGNCLMLEEKVIKVMKTDGKILEYKAPMKVQDVLAEFAGHAISDSFPEIRH-------LM 53
          MGN + ++ K  KVMK DG+    K P+  ++V A++ G+ + DS   ++H       L 
Sbjct: 1  MGNSITVKRKRAKVMKIDGETFRIKTPVTAREVTADYPGYVLLDS-QAVKHFGVRSKPLE 59

Query: 54 PDFKLLGGNLYFLVPVP 70
          P+  L     YFLV +P
Sbjct: 60 PNQTLKPKKTYFLVELP 76





Arabidopsis thaliana (taxid: 3702)

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query165
224088567164 predicted protein [Populus trichocarpa] 0.957 0.963 0.626 1e-46
255575713157 conserved hypothetical protein [Ricinus 0.939 0.987 0.610 2e-43
225440876160 PREDICTED: uncharacterized protein LOC10 0.969 1.0 0.557 2e-42
224140107159 predicted protein [Populus trichocarpa] 0.927 0.962 0.578 4e-39
356534211162 PREDICTED: uncharacterized protein LOC10 0.963 0.981 0.542 3e-38
351724183155 uncharacterized protein LOC100500042 [Gl 0.933 0.993 0.548 2e-37
359807179156 uncharacterized protein LOC100780959 [Gl 0.939 0.993 0.542 2e-36
356548433161 PREDICTED: uncharacterized protein LOC10 0.939 0.962 0.520 1e-35
147794499193 hypothetical protein VITISV_021816 [Viti 0.806 0.689 0.565 3e-35
388518935164 unknown [Lotus japonicus] 0.921 0.926 0.536 2e-34
>gi|224088567|ref|XP_002308476.1| predicted protein [Populus trichocarpa] gi|222854452|gb|EEE91999.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
 Score =  191 bits (484), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 104/166 (62%), Positives = 130/166 (78%), Gaps = 8/166 (4%)

Query: 1   MGNCLMLEEKVIKVMKTDGKILEYKAPMKVQDVLAEFAGHAISDSFPEIRHLMPDFKLLG 60
           MGNCL+L+  VIKVMK DGKILEY+AP+KVQ VL++F+GHAI+DS    +HL PD  LLG
Sbjct: 1   MGNCLVLQANVIKVMKPDGKILEYQAPIKVQQVLSDFSGHAIADSLQAFQHLPPDTSLLG 60

Query: 61  G-NLYFLVPVPLPSPKVEKKKVRFS--EEEARDGAKETSSVVRIKVVISKQELQDLQMLQ 117
           G +LY+LVP+ LPSP+ +KKKVRFS  EEEA+D  ++TS+VVRIK+VISKQELQ+  ML+
Sbjct: 61  GDHLYYLVPLQLPSPQAKKKKVRFSIPEEEAKDVQEKTSTVVRIKLVISKQELQE--MLR 118

Query: 118 KGGVVSVQDVASWLQGKQSTNSQAVGFQDGGNNNEGWKPELESIPE 163
           KGG VSV D+ S LQG+Q    Q V      + ++GWKPELESIPE
Sbjct: 119 KGG-VSVDDMVSHLQGQQRV--QKVDISGSDSEHKGWKPELESIPE 161




Source: Populus trichocarpa

Species: Populus trichocarpa

Genus: Populus

Family: Salicaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|255575713|ref|XP_002528756.1| conserved hypothetical protein [Ricinus communis] gi|223531850|gb|EEF33668.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
>gi|225440876|ref|XP_002282533.1| PREDICTED: uncharacterized protein LOC100254636 [Vitis vinifera] Back     alignment and taxonomy information
>gi|224140107|ref|XP_002323428.1| predicted protein [Populus trichocarpa] gi|222868058|gb|EEF05189.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|356534211|ref|XP_003535651.1| PREDICTED: uncharacterized protein LOC100780937 [Glycine max] Back     alignment and taxonomy information
>gi|351724183|ref|NP_001236025.1| uncharacterized protein LOC100500042 [Glycine max] gi|255628759|gb|ACU14724.1| unknown [Glycine max] Back     alignment and taxonomy information
>gi|359807179|ref|NP_001241101.1| uncharacterized protein LOC100780959 [Glycine max] gi|255640840|gb|ACU20703.1| unknown [Glycine max] Back     alignment and taxonomy information
>gi|356548433|ref|XP_003542606.1| PREDICTED: uncharacterized protein LOC100793955 [Glycine max] Back     alignment and taxonomy information
>gi|147794499|emb|CAN62766.1| hypothetical protein VITISV_021816 [Vitis vinifera] Back     alignment and taxonomy information
>gi|388518935|gb|AFK47529.1| unknown [Lotus japonicus] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query165
TAIR|locus:2150695179 AT5G03890 [Arabidopsis thalian 0.963 0.888 0.352 4.3e-22
TAIR|locus:2100073173 AT3G10120 "AT3G10120" [Arabido 0.981 0.936 0.375 3e-21
TAIR|locus:2156035214 AT5G37840 "AT5G37840" [Arabido 0.563 0.434 0.336 6.2e-07
TAIR|locus:2089945119 AT3G21680 "AT3G21680" [Arabido 0.581 0.806 0.3 6.5e-07
TAIR|locus:2028942225 AT1G66480 "AT1G66480" [Arabido 0.430 0.315 0.35 5.6e-05
TAIR|locus:2038776215 At17.1 "AT2G01340" [Arabidopsi 0.515 0.395 0.336 8.3e-05
TAIR|locus:2079572187 AT3G61920 "AT3G61920" [Arabido 0.369 0.326 0.333 0.00012
TAIR|locus:2010931203 AT1G64700 "AT1G64700" [Arabido 0.412 0.334 0.373 0.00022
TAIR|locus:2202690173 AT1G60010 "AT1G60010" [Arabido 0.284 0.271 0.319 0.00026
TAIR|locus:2150695 AT5G03890 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 257 (95.5 bits), Expect = 4.3e-22, P = 4.3e-22
 Identities = 62/176 (35%), Positives = 95/176 (53%)

Query:     1 MGNCLMLEEKVIKVMKTDGKILEYKAPMKVQDVLAEFAGHAISDSFPEIRHLMPDFKLLG 60
             MGNCL++E+KVIK+++ DGK+LEY+ P+ V  +L +F+GH+IS +     HL+PD KLL 
Sbjct:     1 MGNCLVMEKKVIKIVRDDGKVLEYREPISVHHILTQFSGHSISHNNT---HLLPDAKLLS 57

Query:    61 GNLYFLVPVPLPSPKVEKKKVRFSEEEARDGAK---ETXXXXXXXXXXXXQELQDLQMLQ 117
             G LY+L+P  +   KV KK V F+  E     +   E              + +++ +++
Sbjct:    58 GRLYYLLPTTMTKKKVNKK-VTFANPEVEGDERLLREEEDSSESNSNIDGDDTKNVTVVR 116

Query:   118 KGGVVSVQDVASWLQGK-------QSTNSQAV---GFQDGGNNNEGWKPELESIPE 163
                VV  Q++   LQG        Q+   Q +      D    N GW+P L+SIPE
Sbjct:   117 MKIVVHKQELEKLLQGGSVHEMMYQTLEKQLLLTSSDDDDLECNSGWRPALDSIPE 172




GO:0005634 "nucleus" evidence=ISM
TAIR|locus:2100073 AT3G10120 "AT3G10120" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2156035 AT5G37840 "AT5G37840" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2089945 AT3G21680 "AT3G21680" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2028942 AT1G66480 "AT1G66480" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2038776 At17.1 "AT2G01340" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2079572 AT3G61920 "AT3G61920" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2010931 AT1G64700 "AT1G64700" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2202690 AT1G60010 "AT1G60010" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
grail3.0013005301
hypothetical protein (164 aa)
(Populus trichocarpa)
Predicted Functional Partners:
 
Sorry, there are no predicted associations at the current settings.
 

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query165
pfam14009181 pfam14009, DUF4228, Domain of unknown function (DU 1e-31
>gnl|CDD|222492 pfam14009, DUF4228, Domain of unknown function (DUF4228) Back     alignment and domain information
 Score =  111 bits (280), Expect = 1e-31
 Identities = 57/191 (29%), Positives = 88/191 (46%), Gaps = 38/191 (19%)

Query: 1   MGNCLML------EEKVIKVMKTDGKILEYKAPMKVQDVLAEFAGHAISDS---FPEIR- 50
           MGNCL            +KV+  DGK+ EY  P+   +++ E+ GH + DS   +   R 
Sbjct: 1   MGNCLSCCLAPSGAAATVKVVHPDGKVREYSRPVTAAELMLEYPGHFVCDSDSLYIGRRI 60

Query: 51  -HLMPDFKLLGGNLYFLVPVP-LPSPK--VEKKKVRFSEEEARDGAKETSS--------- 97
             L PD +L  G LYFL+P   L S     +   +  S   A   A    S         
Sbjct: 61  PALPPDDELERGQLYFLLPAERLQSVLAAKDMASLASSASSALKSASAKRSSSSRPSPVR 120

Query: 98  -----VVRIKVVISKQELQDLQMLQKGGVVSVQDVASWLQGKQSTNSQAVGFQDGGNNNE 152
                VVR+KVV+SK+EL++  +L++G   SV    S     +   +  +  +  G+ + 
Sbjct: 121 KSDGGVVRVKVVVSKEELEE--LLEEG---SVSASLS-----ELCETPELRKRSRGSRSR 170

Query: 153 GWKPELESIPE 163
            W+P+LE+I E
Sbjct: 171 SWRPKLETISE 181


This domain is found in plants. The function is not known. Length = 181

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 165
PF14009181 DUF4228: Domain of unknown function (DUF4228) 100.0
>PF14009 DUF4228: Domain of unknown function (DUF4228) Back     alignment and domain information
Probab=100.00  E-value=3.8e-38  Score=244.80  Aligned_cols=153  Identities=35%  Similarity=0.603  Sum_probs=112.8

Q ss_pred             CCCcccC------CCceeEEEecCCcEEEEeCCccHHHHHhhcCCcEEeCCC-----CCccCCCCCCccCCCCeEEEeeC
Q 031133            1 MGNCLML------EEKVIKVMKTDGKILEYKAPMKVQDVLAEFAGHAISDSF-----PEIRHLMPDFKLLGGNLYFLVPV   69 (165)
Q Consensus         1 MGNC~~~------~~~~iKV~~~dG~v~e~~~Pv~a~evl~~~Pgh~v~~s~-----~~~~~L~~de~L~~G~lYfLlP~   69 (165)
                      ||||++.      ..++||||++||+|++|+.||+|+|||.+|||||||++.     ..+++|+||++|++|++|||||.
T Consensus         1 MGn~~~~~~~~~~~~~~vkvv~~~G~v~~~~~pv~a~evm~~~P~h~v~~~~~~~~~~~~~~l~~d~~L~~G~~Y~llP~   80 (181)
T PF14009_consen    1 MGNCVSCCLASSSSAATVKVVHPDGKVEEFKRPVTAAEVMLENPGHFVCDSDSFRFGRRIKPLPPDEELQPGQIYFLLPM   80 (181)
T ss_pred             CCCcccccccccCCCceEEEEcCCCcEEEeCCCcCHHHHHHHCCCCEEeccccccCCCcccCCCccCeecCCCEEEEEEc
Confidence            9999974      689999999999999999999999999999999998774     24689999999999999999999


Q ss_pred             CCCCCcc---ccceeecchhhhh--------------cCcCCCCceEEEEEEEcHHHHHHHHHHhcCCcccHHHHHHHHh
Q 031133           70 PLPSPKV---EKKKVRFSEEEAR--------------DGAKETSSVVRIKVVISKQELQDLQMLQKGGVVSVQDVASWLQ  132 (165)
Q Consensus        70 ~~~~~~~---~~~~vr~~~~~~e--------------~~~~~~~~~~rvKi~itk~~l~~l~~~~~~~~~s~e~~l~~l~  132 (165)
                      ++.....   ....+.+......              .....++|.++||++++|+||++  ++.++   +.+++++.+.
T Consensus        81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~rvki~isk~el~~--~l~~~---s~~~~~~~~~  155 (181)
T PF14009_consen   81 SRLQSVLSASDMASLASSASSASSSSSARKSSSRPFSRSRSSNGGVVRVKIVISKEELEE--LLSEG---SDEEMLSESC  155 (181)
T ss_pred             cccCcccccchhcccccchhhccccccccccccccccccccccCcccccccccCHHHHHH--HHhcc---ccchhhhhhh
Confidence            9865421   1222222221110              11355678999999999999999  55533   4455555544


Q ss_pred             hcccccccccccccCCCCCCCccccCCCCCC
Q 031133          133 GKQSTNSQAVGFQDGGNNNEGWKPELESIPE  163 (165)
Q Consensus       133 ~~~~~~~~~~~~~~~~~~~~~WrP~LeSIpE  163 (165)
                      ........     ......++|||+||||||
T Consensus       156 ~~~~~~~~-----~~~~~~~~WrP~LesI~E  181 (181)
T PF14009_consen  156 RRPRRRSS-----RRGSRSRSWRPALESIPE  181 (181)
T ss_pred             cccccccc-----ccCCCCCCccCCCCCcCc
Confidence            43221100     123456899999999998




Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00