Citrus Sinensis ID: 031597
Local Sequence Feature Prediction
| Prediction and (Method) | Result |
|---|
Close Homologs for Annotation Transfer
Close Homologs in the Non-Redundant Database Detected by BLAST 
Original result of BLAST against Nonredundant Database
GI ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 156 | ||||||
| 388500696 | 163 | unknown [Lotus japonicus] | 0.884 | 0.846 | 0.687 | 1e-46 | |
| 224067952 | 155 | predicted protein [Populus trichocarpa] | 0.775 | 0.780 | 0.737 | 8e-46 | |
| 351725951 | 163 | uncharacterized protein LOC100305578 [Gl | 0.833 | 0.797 | 0.688 | 4e-45 | |
| 255541776 | 167 | Thylakoid membrane phosphoprotein 14 kDa | 0.769 | 0.718 | 0.735 | 8e-45 | |
| 224130462 | 151 | predicted protein [Populus trichocarpa] | 0.775 | 0.801 | 0.722 | 5e-44 | |
| 118487811 | 164 | unknown [Populus trichocarpa] | 0.769 | 0.731 | 0.728 | 1e-43 | |
| 356560639 | 153 | PREDICTED: uncharacterized protein At4g0 | 0.788 | 0.803 | 0.674 | 2e-43 | |
| 356560637 | 163 | PREDICTED: uncharacterized protein At4g0 | 0.833 | 0.797 | 0.659 | 5e-43 | |
| 18411555 | 164 | uncharacterized protein [Arabidopsis tha | 0.871 | 0.829 | 0.623 | 2e-42 | |
| 687677 | 164 | unknown [Arabidopsis thaliana] | 0.871 | 0.829 | 0.616 | 4e-42 |
| >gi|388500696|gb|AFK38414.1| unknown [Lotus japonicus] | Back alignment and taxonomy information |
|---|
Score = 191 bits (484), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 101/147 (68%), Positives = 117/147 (79%), Gaps = 9/147 (6%)
Query: 1 MAATAYAAVLTP-RVPSTTTVKVKSSHCFALPCLPPRSSTPP---FSSSIKQVSESRRFP 56
MAA A + VL P R P+TT + + C ALP LPPR ST FS S+K SESR+
Sbjct: 1 MAAAAASTVLLPHRFPTTTNI----TRCSALPYLPPRVSTTTTTLFSPSLKHFSESRKPS 56
Query: 57 LLQVRASSSEETSTVDADELFSDLKEKWDAVENKSTVLLYGGGAIVAVWLSSTIVGAINS 116
LLQ RASS +ETS+VD +EL SDLKEKWDAVENKSTV++YGGGA+VAVWLSS +VGAINS
Sbjct: 57 LLQTRASS-DETSSVDTNELISDLKEKWDAVENKSTVIIYGGGALVAVWLSSILVGAINS 115
Query: 117 VPLLPKLLELIGLGYTGWFVYRYLLFK 143
VPLLPK++EL+GLGYTGWFVYRYLLFK
Sbjct: 116 VPLLPKIMELVGLGYTGWFVYRYLLFK 142
|
Source: Lotus japonicus Species: Lotus japonicus Genus: Lotus Family: Fabaceae Order: Fabales Class: Phylum: Streptophyta Superkingdom: Eukaryota |
| >gi|224067952|ref|XP_002302615.1| predicted protein [Populus trichocarpa] gi|222844341|gb|EEE81888.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
| >gi|351725951|ref|NP_001237366.1| uncharacterized protein LOC100305578 [Glycine max] gi|255625961|gb|ACU13325.1| unknown [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|255541776|ref|XP_002511952.1| Thylakoid membrane phosphoprotein 14 kDa, chloroplast precursor, putative [Ricinus communis] gi|223549132|gb|EEF50621.1| Thylakoid membrane phosphoprotein 14 kDa, chloroplast precursor, putative [Ricinus communis] | Back alignment and taxonomy information |
|---|
| >gi|224130462|ref|XP_002320843.1| predicted protein [Populus trichocarpa] gi|222861616|gb|EEE99158.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
| >gi|118487811|gb|ABK95729.1| unknown [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
| >gi|356560639|ref|XP_003548598.1| PREDICTED: uncharacterized protein At4g01150, chloroplastic-like [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|356560637|ref|XP_003548597.1| PREDICTED: uncharacterized protein At4g01150, chloroplastic-like [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|18411555|ref|NP_567210.1| uncharacterized protein [Arabidopsis thaliana] gi|75097110|sp|O04616.1|Y4115_ARATH RecName: Full=Uncharacterized protein At4g01150, chloroplastic; Flags: Precursor gi|14488088|gb|AAK63864.1|AF389292_1 AT4g01150/F2N1_18 [Arabidopsis thaliana] gi|2191138|gb|AAB61025.1| A_IG002N01.18 gene product [Arabidopsis thaliana] gi|7267612|emb|CAB80924.1| hypothetical protein [Arabidopsis thaliana] gi|20147123|gb|AAM10278.1| AT4g01150/F2N1_18 [Arabidopsis thaliana] gi|332656587|gb|AEE81987.1| uncharacterized protein [Arabidopsis thaliana] | Back alignment and taxonomy information |
|---|
| >gi|687677|gb|AAB00107.1| unknown [Arabidopsis thaliana] | Back alignment and taxonomy information |
|---|
Prediction of Gene Ontology (GO) Terms
Close Homologs with Gene Ontology terms Detected by BLAST 
Original result of BLAST against Gene Ontology (AMIGO)
ID ![]() |
Alignment graph ![]() |
Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 156 | ||||||
| TAIR|locus:2125018 | 164 | AT4G01150 "AT4G01150" [Arabido | 0.871 | 0.829 | 0.520 | 4.9e-30 | |
| TAIR|locus:2037435 | 156 | AT1G52220 "AT1G52220" [Arabido | 0.544 | 0.544 | 0.325 | 1.2e-08 | |
| TAIR|locus:2121065 | 193 | AT4G38100 [Arabidopsis thalian | 0.551 | 0.445 | 0.366 | 5.3e-08 | |
| TAIR|locus:2044335 | 174 | PSI-P "photosystem I P subunit | 0.916 | 0.821 | 0.2 | 2.9e-07 |
| TAIR|locus:2125018 AT4G01150 "AT4G01150" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Score = 332 (121.9 bits), Expect = 4.9e-30, P = 4.9e-30
Identities = 76/146 (52%), Positives = 95/146 (65%)
Query: 1 MAATAYAAVLTPRVPSTTTVKVKSSHCFALPCLXXXXXXXXXXXX-IKQVSES--RRFPL 57
+AA++ AV+ PRVP+ +T C A+P L +K VS + ++ L
Sbjct: 5 VAASSSMAVMVPRVPAVST------RCSAVPYLPPRSFGRSSFTVPLKLVSGNGLQKVEL 58
Query: 58 LQVRASSSEETSTVDADELFSDLKEKWDAVENKSTVLLYGGGAIVAVWLSSTIVGAINSV 117
L+ RASS EETS++D +EL +DLKEKWD +ENKSTVL+YGGGAIVAVWLSS +VGAINSV
Sbjct: 59 LKTRASS-EETSSIDTNELITDLKEKWDGLENKSTVLIYGGGAIVAVWLSSIVVGAINSV 117
Query: 118 XXXXXXXXXXXXXYTGWFVYRYLLFK 143
YTGWFVYRYLLFK
Sbjct: 118 PLLPKVMELVGLGYTGWFVYRYLLFK 143
|
|
| TAIR|locus:2037435 AT1G52220 "AT1G52220" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2121065 AT4G38100 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2044335 PSI-P "photosystem I P subunit" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Prediction of Enzyme Commission (EC) Number
EC Number Prediction by Ezypred Server 
Original result from Ezypred Server
Fail to connect to Ezypred Server
Prediction of Functionally Associated Proteins
Functionally Associated Proteins Detected by STRING 
Original result from the STRING server
| gw1.II.1221.1 | hypothetical protein (155 aa) | ||||||||||
(Populus trichocarpa) | |||||||||||
| eugene3.00020518 | • | • | 0.521 | ||||||||
| estExt_fgenesh4_pg.C_LG_V1224 | • | • | 0.520 | ||||||||
| gw1.I.6006.1 | • | • | 0.517 | ||||||||
| estExt_fgenesh4_pg.C_LG_II2395 | • | • | 0.508 | ||||||||
| eugene3.00090150 | • | • | 0.508 | ||||||||
| gw1.XV.3200.1 | • | 0.506 | |||||||||
| gw1.XIII.568.1 | • | 0.506 | |||||||||
| gw1.X.3020.1 | • | 0.506 | |||||||||
| gw1.VIII.1629.1 | • | 0.506 | |||||||||
| gw1.III.1425.1 | • | 0.506 |
Conserved Domains and Related Protein Families
Conserved Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against CDD database part I
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 156 | |||
| pfam14159 | 90 | pfam14159, DUF4308, Domain of unknown function (DU | 8e-29 | |
| PLN02777 | 167 | PLN02777, PLN02777, photosystem I P subunit (PSI-P | 1e-19 |
| >gnl|CDD|222574 pfam14159, DUF4308, Domain of unknown function (DUF4308) | Back alignment and domain information |
|---|
Score = 101 bits (253), Expect = 8e-29
Identities = 39/71 (54%), Positives = 48/71 (67%), Gaps = 4/71 (5%)
Query: 77 FSDLKEKWDAVENKSTVLLYGGGAI----VAVWLSSTIVGAINSVPLLPKLLELIGLGYT 132
L W E+K L G GAI VA+WLS+ ++ AI+S+PLLP LLEL+GLGY+
Sbjct: 1 LKKLPNYWGKFEDKYKRPLLGVGAIIAVIVALWLSAAVLDAIDSIPLLPGLLELVGLGYS 60
Query: 133 GWFVYRYLLFK 143
GWFVYRYLLF
Sbjct: 61 GWFVYRYLLFS 71
|
This presumed domain is functionally uncharacterized. This domain family is found in bacteria and eukaryotes, and is approximately 90 amino acids in length. The domain is found in several amino-acyl tRNA synthetase enzymes as well as in isolation in single domain proteins. Length = 90 |
| >gnl|CDD|178376 PLN02777, PLN02777, photosystem I P subunit (PSI-P) | Back alignment and domain information |
|---|
Conserved Domains Detected by HHsearch 
Original result of HHsearch against CDD database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 156 | |||
| PLN02777 | 167 | photosystem I P subunit (PSI-P) | 100.0 | |
| PF14159 | 90 | CAAD: CAAD domains of cyanobacterial aminoacyl-tRN | 99.95 |
| >PLN02777 photosystem I P subunit (PSI-P) | Back alignment and domain information |
|---|
Probab=100.00 E-value=1.7e-42 Score=278.27 Aligned_cols=142 Identities=30% Similarity=0.623 Sum_probs=119.2
Q ss_pred hhhhCCCCcccccccccccccccccCCCCCC----CCCCCCCccccccccCccceeeeecccC--CCCCccchhHHHHHH
Q 031597 7 AAVLTPRVPSTTTVKVKSSHCFALPCLPPRS----STPPFSSSIKQVSESRRFPLLQVRASSS--EETSTVDADELFSDL 80 (156)
Q Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~~lp~lp~R~----~~~~~~~~~~~~~~s~~~~~l~vrass~--~esss~~~~Ei~~~l 80 (156)
..+....-|.... +++++|+.+|.|||.+ .++++++.||++..++.+++ |+++ ++.++.+.+|+++++
T Consensus 10 ~~~~~~~~~~~~~--a~~~~~~~lp~lppp~~~~~~~~~~~~~~c~~~~r~vv~~----a~ge~s~~~~~~~~~ei~k~~ 83 (167)
T PLN02777 10 STLIDSKAPRSSA--AASPQCVSLPTLPPPPVQSHNRPAKATAYCRKIARNVVTM----ATGEAPAEVETTELPEIVKTV 83 (167)
T ss_pred cccccCCCCCcCc--ccCCccccCCCCCCCCcccCCCcchhHHHHHHhHHHHHHH----hccCCCcccccccHHHHHHHH
Confidence 3344444444332 3469999999999755 36788999999998886554 4442 333455778999999
Q ss_pred HHHHHhcccchhhHHHHHHHHHHHHHHHHHHHHhhccCChhhHHHHHhhhheehhhhhhhccchhhHhhhhhcc
Q 031597 81 KEKWDAVENKSTVLLYGGGAIVAVWLSSTIVGAINSVPLLPKLLELIGLGYTGWFVYRYLLFKVRLRDCKSSHA 154 (156)
Q Consensus 81 ~e~Wd~~e~k~~vl~~g~gaival~v~~~vl~AId~iPLlp~llELVGlgyt~WF~yRyLl~~~~R~eL~~~~~ 154 (156)
||+||++|||++++++++++||++|++.+||+|||+|||+|++||||||||++||+||||+|++|||||++++.
T Consensus 84 ~e~Wd~~EdK~av~~l~~aaiVal~v~~~VL~AId~lPLlP~lLELVGigYs~WF~yRyLLfke~ReeL~~ki~ 157 (167)
T PLN02777 84 QEAWDKVEDKYAVSSLAFAGVVALWGSAGMISAIDRLPLVPGVLELVGIGYTGWFAYKNLVFKPDREALIEKIK 157 (167)
T ss_pred HHHHhhhcchhHHHHHHHHHHHHHHHHHHHHHHHhccccccchHHHhhhhhhhhhhhhHhcCcccHHHHHHHHH
Confidence 99999999999999999999999999999999999999999999999999999999999999999999999874
|
|
| >PF14159 CAAD: CAAD domains of cyanobacterial aminoacyl-tRNA synthetase | Back alignment and domain information |
|---|
Homologous Structure Templates
Structure Templates Detected by BLAST 
Original result of BLAST against Protein Data Bank
No homologous structure with e-value below 0.005
Structure Templates Detected by RPS-BLAST 
Original result of RPS-BLAST against PDB70 database
No hit with e-value below 0.005
Structure Templates Detected by HHsearch 
Original result of HHsearch against PDB70 database
No hit with probability above 80.00
Homologous Structure Domains
Structure Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against SCOP70(version1.75) database
No hit with e-value below 0.005
Homologous Domains Detected by HHsearch 
Original result of HHsearch against SCOP70(version1.75) database
No hit with probability above 80.00