Citrus Sinensis ID: 028427
Local Sequence Feature Prediction
| Prediction and (Method) | Result |
|---|
Close Homologs for Annotation Transfer
Close Homologs in the Non-Redundant Database Detected by BLAST 
Original result of BLAST against Nonredundant Database
GI ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 209 | ||||||
| 225469778 | 204 | PREDICTED: uncharacterized protein LOC10 | 0.961 | 0.985 | 0.575 | 8e-59 | |
| 359495653 | 204 | PREDICTED: uncharacterized protein LOC10 | 0.961 | 0.985 | 0.575 | 2e-58 | |
| 388506910 | 199 | unknown [Lotus japonicus] | 0.842 | 0.884 | 0.528 | 9e-42 | |
| 351723571 | 197 | uncharacterized protein LOC100499731 [Gl | 0.799 | 0.847 | 0.531 | 6e-40 | |
| 351727288 | 200 | uncharacterized protein LOC100305966 [Gl | 0.727 | 0.76 | 0.550 | 2e-37 | |
| 357485585 | 182 | Glutamyl-tRNA synthetase [Medicago trunc | 0.722 | 0.829 | 0.540 | 3e-36 | |
| 297798056 | 194 | threonine endopeptidase [Arabidopsis lyr | 0.909 | 0.979 | 0.427 | 2e-35 | |
| 18420175 | 193 | uncharacterized protein [Arabidopsis tha | 0.904 | 0.979 | 0.436 | 1e-34 | |
| 4467116 | 153 | hypothetical protein [Arabidopsis thalia | 0.645 | 0.882 | 0.496 | 4e-31 | |
| 449508380 | 210 | PREDICTED: uncharacterized protein At4g0 | 0.727 | 0.723 | 0.385 | 6e-31 |
| >gi|225469778|ref|XP_002274430.1| PREDICTED: uncharacterized protein LOC100261101 [Vitis vinifera] gi|297735928|emb|CBI18704.3| unnamed protein product [Vitis vinifera] | Back alignment and taxonomy information |
|---|
Score = 232 bits (591), Expect = 8e-59, Method: Compositional matrix adjust.
Identities = 122/212 (57%), Positives = 147/212 (69%), Gaps = 11/212 (5%)
Query: 1 MELCTAIQTQSAISNHHHPFLTATTAPALARTKSALSLKQTTVSRSGYGLRSRILYYTNP 60
MELCT A SN HH L A L ++ KQ+ +SR+ G LY+ NP
Sbjct: 1 MELCTT----RAFSNLHHRTLFNPLANRLRWKTISIPFKQSPISRTSPG----SLYFNNP 52
Query: 61 LPKAT-SEETSSGTDQYVVDKRDGATAAEDVPAVEKNVYNESVATAVPKEESPVDGLT-- 117
L +A+ SE +SSG DQY+ ++RD ED+PA E+NVYNE + T P E+S V+ T
Sbjct: 53 LLRASISEGSSSGADQYIGEERDSVLVMEDIPATEENVYNEVIPTEAPIEDSQVEEQTVA 112
Query: 118 NELLDNLKIKFDSEDKYSLVLYGTGALLALWLTTVVVGAIDSIPLFPKLMEVVGLGYTLW 177
E LDNL IKFDSED YS+ LYGTGAL ALW + +VGAIDSIP+FPKLME+VGLGYTLW
Sbjct: 113 FEFLDNLNIKFDSEDPYSIFLYGTGALTALWFASAIVGAIDSIPIFPKLMEIVGLGYTLW 172
Query: 178 FSWRYLLFKKNRDELATKIEELKQQVLGSNDD 209
FS RYL+FK+NRDELA KIEELKQQVLGS D+
Sbjct: 173 FSARYLIFKQNRDELAAKIEELKQQVLGSEDE 204
|
Source: Vitis vinifera Species: Vitis vinifera Genus: Vitis Family: Vitaceae Order: Vitales Class: Phylum: Streptophyta Superkingdom: Eukaryota |
| >gi|359495653|ref|XP_003635047.1| PREDICTED: uncharacterized protein LOC100853187 isoform 1 [Vitis vinifera] gi|359495655|ref|XP_003635048.1| PREDICTED: uncharacterized protein LOC100853187 isoform 2 [Vitis vinifera] gi|297735931|emb|CBI18707.3| unnamed protein product [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|388506910|gb|AFK41521.1| unknown [Lotus japonicus] | Back alignment and taxonomy information |
|---|
| >gi|351723571|ref|NP_001235236.1| uncharacterized protein LOC100499731 [Glycine max] gi|255626121|gb|ACU13405.1| unknown [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|351727288|ref|NP_001235620.1| uncharacterized protein LOC100305966 [Glycine max] gi|255627137|gb|ACU13913.1| unknown [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|357485585|ref|XP_003613080.1| Glutamyl-tRNA synthetase [Medicago truncatula] gi|355514415|gb|AES96038.1| Glutamyl-tRNA synthetase [Medicago truncatula] gi|388495348|gb|AFK35740.1| unknown [Medicago truncatula] | Back alignment and taxonomy information |
|---|
| >gi|297798056|ref|XP_002866912.1| threonine endopeptidase [Arabidopsis lyrata subsp. lyrata] gi|297312748|gb|EFH43171.1| threonine endopeptidase [Arabidopsis lyrata subsp. lyrata] | Back alignment and taxonomy information |
|---|
| >gi|18420175|ref|NP_568035.1| uncharacterized protein [Arabidopsis thaliana] gi|21554198|gb|AAM63277.1| unknown [Arabidopsis thaliana] gi|107738205|gb|ABF83661.1| At4g38100 [Arabidopsis thaliana] gi|332661478|gb|AEE86878.1| uncharacterized protein [Arabidopsis thaliana] | Back alignment and taxonomy information |
|---|
| >gi|4467116|emb|CAB37550.1| hypothetical protein [Arabidopsis thaliana] gi|7270793|emb|CAB80475.1| hypothetical protein [Arabidopsis thaliana] | Back alignment and taxonomy information |
|---|
| >gi|449508380|ref|XP_004163298.1| PREDICTED: uncharacterized protein At4g01150, chloroplastic-like [Cucumis sativus] | Back alignment and taxonomy information |
|---|
Prediction of Gene Ontology (GO) Terms
Close Homologs with Gene Ontology terms Detected by BLAST 
Original result of BLAST against Gene Ontology (AMIGO)
ID ![]() |
Alignment graph ![]() |
Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 209 | ||||||
| TAIR|locus:2121065 | 193 | AT4G38100 [Arabidopsis thalian | 0.918 | 0.994 | 0.385 | 2.4e-28 | |
| TAIR|locus:2125018 | 164 | AT4G01150 "AT4G01150" [Arabido | 0.502 | 0.640 | 0.449 | 5.8e-18 | |
| TAIR|locus:2044335 | 174 | PSI-P "photosystem I P subunit | 0.550 | 0.660 | 0.260 | 1.8e-09 | |
| TAIR|locus:2037435 | 156 | AT1G52220 "AT1G52220" [Arabido | 0.502 | 0.673 | 0.261 | 2.9e-07 |
| TAIR|locus:2121065 AT4G38100 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Score = 316 (116.3 bits), Expect = 2.4e-28, P = 2.4e-28
Identities = 81/210 (38%), Positives = 118/210 (56%)
Query: 1 MELCTAIQTQSAISNHHHPFXXXXXXXXXXXXKSALSLKQTTVSRSGYGLRSRILYYTNP 60
MELCT ++ + I++ F + +L L++ S L+SR L +
Sbjct: 1 MELCT--RSTTIITHLPASFNGHGYLAGKSVDRISLPLQRNVASLV---LQSRTLRCSRK 55
Query: 61 LPKAT-SEETSSGTDQYVVDKRDGATAAEDVPAVEKNVYNESVATAVPKEESPVDGLTNE 119
P T +EETS+G +++ V+ RDG A A EKN +E A E+ L E
Sbjct: 56 FPGETVTEETSTGVNEFGVEDRDGVVVA----AEEKNSNSE----APQAEDEETQAL--E 105
Query: 120 LLDNLKIKFDSEDKYSLVLYXXXXXXXXXXXXXXXXXIDSIPLFPKLMEVVGLGYTLWFS 179
L+++K+ DS+ YS++LY +++IPLFPKLMEVVGLGYTLWF+
Sbjct: 106 FLNDIKL--DSDKTYSILLYGSGAIVALYLTSAIVSSLEAIPLFPKLMEVVGLGYTLWFT 163
Query: 180 WRYLLFKKNRDELATKIEELKQQVLGSNDD 209
RYLLFK+NR+EL TK+ E+K+QVLGS+ +
Sbjct: 164 TRYLLFKRNREELKTKVSEIKKQVLGSDSE 193
|
|
| TAIR|locus:2125018 AT4G01150 "AT4G01150" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2044335 PSI-P "photosystem I P subunit" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2037435 AT1G52220 "AT1G52220" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Prediction of Enzyme Commission (EC) Number
EC Number Prediction by Ezypred Server 
Original result from Ezypred Server
Fail to connect to Ezypred Server
Prediction of Functionally Associated Proteins
Functionally Associated Proteins Detected by STRING 
Original result from the STRING server
| GSVIVG00011469001 | SubName- Full=Chromosome undetermined scaffold_310, whole genome shotgun sequence; (204 aa) | ||||||||||
(Vitis vinifera) | |||||||||||
| GSVIVG00027603001 | • | 0.505 | |||||||||
| GSVIVG00002625001 | • | 0.505 | |||||||||
| GSVIVG00007397001 | • | 0.504 | |||||||||
| GSVIVG00004124001 | • | 0.502 | |||||||||
| GSVIVG00035652001 | • | 0.501 | |||||||||
| GSVIVG00020301001 | • | 0.501 | |||||||||
| cemA | • | 0.500 | |||||||||
| GSVIVG00017320001 | • | 0.500 | |||||||||
| GSVIVG00009465001 | • | 0.500 | |||||||||
| GSVIVG00034898001 | • | 0.499 |
Conserved Domains and Related Protein Families
Conserved Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against CDD database part I
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 209 | |||
| pfam14159 | 90 | pfam14159, DUF4308, Domain of unknown function (DU | 1e-33 | |
| PLN02777 | 167 | PLN02777, PLN02777, photosystem I P subunit (PSI-P | 1e-16 |
| >gnl|CDD|222574 pfam14159, DUF4308, Domain of unknown function (DUF4308) | Back alignment and domain information |
|---|
Score = 115 bits (290), Expect = 1e-33
Identities = 45/81 (55%), Positives = 59/81 (72%), Gaps = 4/81 (4%)
Query: 129 DSEDKYSLVLYGTGA----LLALWLTTVVVGAIDSIPLFPKLMEVVGLGYTLWFSWRYLL 184
EDKY L G GA ++ALWL+ V+ AIDSIPL P L+E+VGLGY+ WF +RYLL
Sbjct: 10 KFEDKYKRPLLGVGAIIAVIVALWLSAAVLDAIDSIPLLPGLLELVGLGYSGWFVYRYLL 69
Query: 185 FKKNRDELATKIEELKQQVLG 205
F ++R EL KI+ELK+++LG
Sbjct: 70 FSEDRQELLAKIQELKKEILG 90
|
This presumed domain is functionally uncharacterized. This domain family is found in bacteria and eukaryotes, and is approximately 90 amino acids in length. The domain is found in several amino-acyl tRNA synthetase enzymes as well as in isolation in single domain proteins. Length = 90 |
| >gnl|CDD|178376 PLN02777, PLN02777, photosystem I P subunit (PSI-P) | Back alignment and domain information |
|---|
Conserved Domains Detected by HHsearch 
Original result of HHsearch against CDD database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 209 | |||
| PLN02777 | 167 | photosystem I P subunit (PSI-P) | 100.0 | |
| PF14159 | 90 | CAAD: CAAD domains of cyanobacterial aminoacyl-tRN | 99.95 |
| >PLN02777 photosystem I P subunit (PSI-P) | Back alignment and domain information |
|---|
Probab=100.00 E-value=5.1e-34 Score=238.71 Aligned_cols=90 Identities=36% Similarity=0.764 Sum_probs=87.9
Q ss_pred hHHHHHHhhhhccc-cCcchhhhhhHHHHHHHHHHHHHHHHhccCCCccchheeeeeeeeeeehhhhcccccchHHHHHH
Q 028427 117 TNELLDNLKIKFDS-EDKYSLVLYGTGALLALWLTTVVVGAIDSIPLFPKLMEVVGLGYTLWFSWRYLLFKKNRDELATK 195 (209)
Q Consensus 117 ~~Evl~~L~~kwd~-e~K~~v~l~g~gaiVal~v~~aVL~AIdsIPLLp~lLELVGLgYt~WFvyRyLLf~e~RqEL~~k 195 (209)
.+|+++.++++||. |||++++++++++||++|++.+||+|||+|||+|++||||||||++||+||||+|+++|+||+++
T Consensus 76 ~~ei~k~~~e~Wd~~EdK~av~~l~~aaiVal~v~~~VL~AId~lPLlP~lLELVGigYs~WF~yRyLLfke~ReeL~~k 155 (167)
T PLN02777 76 LPEIVKTVQEAWDKVEDKYAVSSLAFAGVVALWGSAGMISAIDRLPLVPGVLELVGIGYTGWFAYKNLVFKPDREALIEK 155 (167)
T ss_pred HHHHHHHHHHHHhhhcchhHHHHHHHHHHHHHHHHHHHHHHHhccccccchHHHhhhhhhhhhhhhHhcCcccHHHHHHH
Confidence 56999999999995 99999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHHhhhhcCC
Q 028427 196 IEELKQQVLGS 206 (209)
Q Consensus 196 I~~lk~~IlG~ 206 (209)
|+++|++|+|+
T Consensus 156 i~~lk~~IlG~ 166 (167)
T PLN02777 156 IKDTYKEIIGS 166 (167)
T ss_pred HHHHHHHhhCC
Confidence 99999999996
|
|
| >PF14159 CAAD: CAAD domains of cyanobacterial aminoacyl-tRNA synthetase | Back alignment and domain information |
|---|
Homologous Structure Templates
Structure Templates Detected by BLAST 
Original result of BLAST against Protein Data Bank
No homologous structure with e-value below 0.005
Structure Templates Detected by RPS-BLAST 
Original result of RPS-BLAST against PDB70 database
No hit with e-value below 0.005
Structure Templates Detected by HHsearch 
Original result of HHsearch against PDB70 database
No hit with probability above 80.00
Homologous Structure Domains
Structure Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against SCOP70(version1.75) database
No hit with e-value below 0.005
Homologous Domains Detected by HHsearch 
Original result of HHsearch against SCOP70(version1.75) database
No hit with probability above 80.00