Citrus Sinensis ID: 031597


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150------
MAATAYAAVLTPRVPSTTTVKVKSSHCFALPCLPPRSSTPPFSSSIKQVSESRRFPLLQVRASSSEETSTVDADELFSDLKEKWDAVENKSTVLLYGGGAIVAVWLSSTIVGAINSVPLLPKLLELIGLGYTGWFVYRYLLFKVRLRDCKSSHAFS
ccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHcccccccccHHHHHHHHHHHHHHHHHHcHHHHHHHHHHHHHHHHHHHHHHHHHHHcccccHHHHHHHHHHEEEEEEHHHccccccHHHHHHHcccc
cccccHHHHcccccccccccccccccccccccccccccccccccccHHcccccccccEEEccccccccccccHHHHHHHHHHHHHHccccHHHHHHHHHHHHHHHHHHHHHHHHHcccccHHHHHHHEHHHHHHHHHHHHHHcccHHHHHHccccc
MAATAYAAvltprvpstttvkvksshcfalpclpprsstppfsssikqvsesrrfpllqvrassseetstvdaDELFSDLKEKWDAVENKSTVLLYGGGAIVAVWLSSTIVgainsvplLPKLLELIGLGYTGWFVYRYLLFKVRlrdcksshafs
maatayaavltprvpstttvkVKSSHCFALPCLPPRSSTPPFSSSIKQVSESRRFPLLqvrassseetstvdaDELFSDLKEKWDAVENKSTVLLYGGGAIVAVWLSSTIVGAINSVPLLPKLLELIGLGYTGWFVYRYLLFKVRLRDCKSSHAFS
MAATAYAAVLTPRVPSTTTVKVKSSHCFALPCLpprsstppfsssIKQVSESRRFPLLQVRASSSEETSTVDADELFSDLKEKWDAVENKSTVLLYGGGAIVAVWLSSTIVGAINSVpllpklleliglgYTGWFVYRYLLFKVRLRDCKSSHAFS
*****YAAVLTPRVPSTTTVKVKSSHCFALPCL*******************************************FSDLKEKWDAVENKSTVLLYGGGAIVAVWLSSTIVGAINSVPLLPKLLELIGLGYTGWFVYRYLLFKVRLRDC*******
**************************************************************************ELFSDLKEKWDAVENKSTVLLYGGGAIVAVWLSSTIVGAINSVPLLPKLLELIGLGYTGWFVYRYLLFKVRLRDCKSSHAFS
MAATAYAAVLTPRVP*********SHCFALPCLPPR**************ESRRFPLLQV***********DADELFSDLKEKWDAVENKSTVLLYGGGAIVAVWLSSTIVGAINSVPLLPKLLELIGLGYTGWFVYRYLLFKVRLRDCKSSHAFS
*********************************************************************TVDADELFSDLKEKWDAVENKSTVLLYGGGAIVAVWLSSTIVGAINSVPLLPKLLELIGLGYTGWFVYRYLLFKVRL**********
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHoooooooooHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHooooooooHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHooooooHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiii
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHoooooooHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHHHoooooHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiii
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MAATAYAAVLTPRVPSTTTVKVKSSHCFALPCLPPRSSTPPFSSSIKQVSESRRFPLLQVRASSSEETSTVDADELFSDLKEKWDAVENKSTVLLYGGGAIVAVWLSSTIVGAINSVPLLPKLLELIGLGYTGWFVYRYLLFKVRLRDCKSSHAFS
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query156 2.2.26 [Sep-21-2011]
O04616164 Uncharacterized protein A yes no 0.871 0.829 0.623 4e-44
Q8LCA1174 Thylakoid membrane phosph no no 0.403 0.362 0.428 2e-11
>sp|O04616|Y4115_ARATH Uncharacterized protein At4g01150, chloroplastic OS=Arabidopsis thaliana GN=At4g01150 PE=1 SV=1 Back     alignment and function desciption
 Score =  176 bits (447), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 91/146 (62%), Positives = 114/146 (78%), Gaps = 10/146 (6%)

Query: 1   MAATAYAAVLTPRVPSTTTVKVKSSHCFALPCLPPRS-STPPFSSSIKQVSES--RRFPL 57
           +AA++  AV+ PRVP+ +T       C A+P LPPRS     F+  +K VS +  ++  L
Sbjct: 5   VAASSSMAVMVPRVPAVST------RCSAVPYLPPRSFGRSSFTVPLKLVSGNGLQKVEL 58

Query: 58  LQVRASSSEETSTVDADELFSDLKEKWDAVENKSTVLLYGGGAIVAVWLSSTIVGAINSV 117
           L+ RASS EETS++D +EL +DLKEKWD +ENKSTVL+YGGGAIVAVWLSS +VGAINSV
Sbjct: 59  LKTRASS-EETSSIDTNELITDLKEKWDGLENKSTVLIYGGGAIVAVWLSSIVVGAINSV 117

Query: 118 PLLPKLLELIGLGYTGWFVYRYLLFK 143
           PLLPK++EL+GLGYTGWFVYRYLLFK
Sbjct: 118 PLLPKVMELVGLGYTGWFVYRYLLFK 143





Arabidopsis thaliana (taxid: 3702)
>sp|Q8LCA1|TMP14_ARATH Thylakoid membrane phosphoprotein 14 kDa, chloroplastic OS=Arabidopsis thaliana GN=TMP14 PE=1 SV=2 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query156
388500696163 unknown [Lotus japonicus] 0.884 0.846 0.687 1e-46
224067952155 predicted protein [Populus trichocarpa] 0.775 0.780 0.737 8e-46
351725951163 uncharacterized protein LOC100305578 [Gl 0.833 0.797 0.688 4e-45
255541776167 Thylakoid membrane phosphoprotein 14 kDa 0.769 0.718 0.735 8e-45
224130462151 predicted protein [Populus trichocarpa] 0.775 0.801 0.722 5e-44
118487811164 unknown [Populus trichocarpa] 0.769 0.731 0.728 1e-43
356560639153 PREDICTED: uncharacterized protein At4g0 0.788 0.803 0.674 2e-43
356560637163 PREDICTED: uncharacterized protein At4g0 0.833 0.797 0.659 5e-43
18411555164 uncharacterized protein [Arabidopsis tha 0.871 0.829 0.623 2e-42
687677164 unknown [Arabidopsis thaliana] 0.871 0.829 0.616 4e-42
>gi|388500696|gb|AFK38414.1| unknown [Lotus japonicus] Back     alignment and taxonomy information
 Score =  191 bits (484), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 101/147 (68%), Positives = 117/147 (79%), Gaps = 9/147 (6%)

Query: 1   MAATAYAAVLTP-RVPSTTTVKVKSSHCFALPCLPPRSSTPP---FSSSIKQVSESRRFP 56
           MAA A + VL P R P+TT +    + C ALP LPPR ST     FS S+K  SESR+  
Sbjct: 1   MAAAAASTVLLPHRFPTTTNI----TRCSALPYLPPRVSTTTTTLFSPSLKHFSESRKPS 56

Query: 57  LLQVRASSSEETSTVDADELFSDLKEKWDAVENKSTVLLYGGGAIVAVWLSSTIVGAINS 116
           LLQ RASS +ETS+VD +EL SDLKEKWDAVENKSTV++YGGGA+VAVWLSS +VGAINS
Sbjct: 57  LLQTRASS-DETSSVDTNELISDLKEKWDAVENKSTVIIYGGGALVAVWLSSILVGAINS 115

Query: 117 VPLLPKLLELIGLGYTGWFVYRYLLFK 143
           VPLLPK++EL+GLGYTGWFVYRYLLFK
Sbjct: 116 VPLLPKIMELVGLGYTGWFVYRYLLFK 142




Source: Lotus japonicus

Species: Lotus japonicus

Genus: Lotus

Family: Fabaceae

Order: Fabales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|224067952|ref|XP_002302615.1| predicted protein [Populus trichocarpa] gi|222844341|gb|EEE81888.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|351725951|ref|NP_001237366.1| uncharacterized protein LOC100305578 [Glycine max] gi|255625961|gb|ACU13325.1| unknown [Glycine max] Back     alignment and taxonomy information
>gi|255541776|ref|XP_002511952.1| Thylakoid membrane phosphoprotein 14 kDa, chloroplast precursor, putative [Ricinus communis] gi|223549132|gb|EEF50621.1| Thylakoid membrane phosphoprotein 14 kDa, chloroplast precursor, putative [Ricinus communis] Back     alignment and taxonomy information
>gi|224130462|ref|XP_002320843.1| predicted protein [Populus trichocarpa] gi|222861616|gb|EEE99158.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|118487811|gb|ABK95729.1| unknown [Populus trichocarpa] Back     alignment and taxonomy information
>gi|356560639|ref|XP_003548598.1| PREDICTED: uncharacterized protein At4g01150, chloroplastic-like [Glycine max] Back     alignment and taxonomy information
>gi|356560637|ref|XP_003548597.1| PREDICTED: uncharacterized protein At4g01150, chloroplastic-like [Glycine max] Back     alignment and taxonomy information
>gi|18411555|ref|NP_567210.1| uncharacterized protein [Arabidopsis thaliana] gi|75097110|sp|O04616.1|Y4115_ARATH RecName: Full=Uncharacterized protein At4g01150, chloroplastic; Flags: Precursor gi|14488088|gb|AAK63864.1|AF389292_1 AT4g01150/F2N1_18 [Arabidopsis thaliana] gi|2191138|gb|AAB61025.1| A_IG002N01.18 gene product [Arabidopsis thaliana] gi|7267612|emb|CAB80924.1| hypothetical protein [Arabidopsis thaliana] gi|20147123|gb|AAM10278.1| AT4g01150/F2N1_18 [Arabidopsis thaliana] gi|332656587|gb|AEE81987.1| uncharacterized protein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|687677|gb|AAB00107.1| unknown [Arabidopsis thaliana] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query156
TAIR|locus:2125018164 AT4G01150 "AT4G01150" [Arabido 0.871 0.829 0.520 4.9e-30
TAIR|locus:2037435156 AT1G52220 "AT1G52220" [Arabido 0.544 0.544 0.325 1.2e-08
TAIR|locus:2121065193 AT4G38100 [Arabidopsis thalian 0.551 0.445 0.366 5.3e-08
TAIR|locus:2044335174 PSI-P "photosystem I P subunit 0.916 0.821 0.2 2.9e-07
TAIR|locus:2125018 AT4G01150 "AT4G01150" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 332 (121.9 bits), Expect = 4.9e-30, P = 4.9e-30
 Identities = 76/146 (52%), Positives = 95/146 (65%)

Query:     1 MAATAYAAVLTPRVPSTTTVKVKSSHCFALPCLXXXXXXXXXXXX-IKQVSES--RRFPL 57
             +AA++  AV+ PRVP+ +T       C A+P L             +K VS +  ++  L
Sbjct:     5 VAASSSMAVMVPRVPAVST------RCSAVPYLPPRSFGRSSFTVPLKLVSGNGLQKVEL 58

Query:    58 LQVRASSSEETSTVDADELFSDLKEKWDAVENKSTVLLYGGGAIVAVWLSSTIVGAINSV 117
             L+ RASS EETS++D +EL +DLKEKWD +ENKSTVL+YGGGAIVAVWLSS +VGAINSV
Sbjct:    59 LKTRASS-EETSSIDTNELITDLKEKWDGLENKSTVLIYGGGAIVAVWLSSIVVGAINSV 117

Query:   118 XXXXXXXXXXXXXYTGWFVYRYLLFK 143
                          YTGWFVYRYLLFK
Sbjct:   118 PLLPKVMELVGLGYTGWFVYRYLLFK 143




GO:0003674 "molecular_function" evidence=ND
GO:0008150 "biological_process" evidence=ND
GO:0009507 "chloroplast" evidence=ISM;IDA
GO:0009941 "chloroplast envelope" evidence=IDA
GO:0009535 "chloroplast thylakoid membrane" evidence=IDA
GO:0010287 "plastoglobule" evidence=IDA
GO:0009579 "thylakoid" evidence=IDA
GO:0009534 "chloroplast thylakoid" evidence=IDA
TAIR|locus:2037435 AT1G52220 "AT1G52220" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2121065 AT4G38100 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2044335 PSI-P "photosystem I P subunit" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

ID ?Name ?Annotated EC number ?Identity ?Query coverage ?Hit coverage ?RBH(Q2H) ?RBH(H2Q) ?
O04616Y4115_ARATHNo assigned EC number0.62320.87170.8292yesno

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
gw1.II.1221.1
hypothetical protein (155 aa)
(Populus trichocarpa)
Predicted Functional Partners:
eugene3.00020518
SubName- Full=Putative uncharacterized protein; (262 aa)
      0.521
estExt_fgenesh4_pg.C_LG_V1224
SubName- Full=Putative uncharacterized protein; (262 aa)
      0.520
gw1.I.6006.1
hypothetical protein (200 aa)
      0.517
estExt_fgenesh4_pg.C_LG_II2395
SubName- Full=Putative uncharacterized protein; (210 aa)
      0.508
eugene3.00090150
hypothetical protein (129 aa)
      0.508
gw1.XV.3200.1
hypothetical protein (265 aa)
       0.506
gw1.XIII.568.1
SubName- Full=Putative uncharacterized protein; (212 aa)
       0.506
gw1.X.3020.1
hypothetical protein (83 aa)
       0.506
gw1.VIII.1629.1
hypothetical protein (88 aa)
       0.506
gw1.III.1425.1
hypothetical protein (172 aa)
       0.506

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query156
pfam1415990 pfam14159, DUF4308, Domain of unknown function (DU 8e-29
PLN02777167 PLN02777, PLN02777, photosystem I P subunit (PSI-P 1e-19
>gnl|CDD|222574 pfam14159, DUF4308, Domain of unknown function (DUF4308) Back     alignment and domain information
 Score =  101 bits (253), Expect = 8e-29
 Identities = 39/71 (54%), Positives = 48/71 (67%), Gaps = 4/71 (5%)

Query: 77  FSDLKEKWDAVENKSTVLLYGGGAI----VAVWLSSTIVGAINSVPLLPKLLELIGLGYT 132
              L   W   E+K    L G GAI    VA+WLS+ ++ AI+S+PLLP LLEL+GLGY+
Sbjct: 1   LKKLPNYWGKFEDKYKRPLLGVGAIIAVIVALWLSAAVLDAIDSIPLLPGLLELVGLGYS 60

Query: 133 GWFVYRYLLFK 143
           GWFVYRYLLF 
Sbjct: 61  GWFVYRYLLFS 71


This presumed domain is functionally uncharacterized. This domain family is found in bacteria and eukaryotes, and is approximately 90 amino acids in length. The domain is found in several amino-acyl tRNA synthetase enzymes as well as in isolation in single domain proteins. Length = 90

>gnl|CDD|178376 PLN02777, PLN02777, photosystem I P subunit (PSI-P) Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 156
PLN02777167 photosystem I P subunit (PSI-P) 100.0
PF1415990 CAAD: CAAD domains of cyanobacterial aminoacyl-tRN 99.95
>PLN02777 photosystem I P subunit (PSI-P) Back     alignment and domain information
Probab=100.00  E-value=1.7e-42  Score=278.27  Aligned_cols=142  Identities=30%  Similarity=0.623  Sum_probs=119.2

Q ss_pred             hhhhCCCCcccccccccccccccccCCCCCC----CCCCCCCccccccccCccceeeeecccC--CCCCccchhHHHHHH
Q 031597            7 AAVLTPRVPSTTTVKVKSSHCFALPCLPPRS----STPPFSSSIKQVSESRRFPLLQVRASSS--EETSTVDADELFSDL   80 (156)
Q Consensus         7 ~~~~~~~~~~~~~~~~~~~~~~~lp~lp~R~----~~~~~~~~~~~~~~s~~~~~l~vrass~--~esss~~~~Ei~~~l   80 (156)
                      ..+....-|....  +++++|+.+|.|||.+    .++++++.||++..++.+++    |+++  ++.++.+.+|+++++
T Consensus        10 ~~~~~~~~~~~~~--a~~~~~~~lp~lppp~~~~~~~~~~~~~~c~~~~r~vv~~----a~ge~s~~~~~~~~~ei~k~~   83 (167)
T PLN02777         10 STLIDSKAPRSSA--AASPQCVSLPTLPPPPVQSHNRPAKATAYCRKIARNVVTM----ATGEAPAEVETTELPEIVKTV   83 (167)
T ss_pred             cccccCCCCCcCc--ccCCccccCCCCCCCCcccCCCcchhHHHHHHhHHHHHHH----hccCCCcccccccHHHHHHHH
Confidence            3344444444332  3469999999999755    36788999999998886554    4442  333455778999999


Q ss_pred             HHHHHhcccchhhHHHHHHHHHHHHHHHHHHHHhhccCChhhHHHHHhhhheehhhhhhhccchhhHhhhhhcc
Q 031597           81 KEKWDAVENKSTVLLYGGGAIVAVWLSSTIVGAINSVPLLPKLLELIGLGYTGWFVYRYLLFKVRLRDCKSSHA  154 (156)
Q Consensus        81 ~e~Wd~~e~k~~vl~~g~gaival~v~~~vl~AId~iPLlp~llELVGlgyt~WF~yRyLl~~~~R~eL~~~~~  154 (156)
                      ||+||++|||++++++++++||++|++.+||+|||+|||+|++||||||||++||+||||+|++|||||++++.
T Consensus        84 ~e~Wd~~EdK~av~~l~~aaiVal~v~~~VL~AId~lPLlP~lLELVGigYs~WF~yRyLLfke~ReeL~~ki~  157 (167)
T PLN02777         84 QEAWDKVEDKYAVSSLAFAGVVALWGSAGMISAIDRLPLVPGVLELVGIGYTGWFAYKNLVFKPDREALIEKIK  157 (167)
T ss_pred             HHHHhhhcchhHHHHHHHHHHHHHHHHHHHHHHHhccccccchHHHhhhhhhhhhhhhHhcCcccHHHHHHHHH
Confidence            99999999999999999999999999999999999999999999999999999999999999999999999874



>PF14159 CAAD: CAAD domains of cyanobacterial aminoacyl-tRNA synthetase Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

No hit with probability above 80.00


Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00