Citrus Sinensis ID: 028840
Local Sequence Feature Prediction
| Prediction and (Method) | Result |
|---|
Close Homologs for Annotation Transfer
Close Homologs in the Non-Redundant Database Detected by BLAST 
Original result of BLAST against Nonredundant Database
GI ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 203 | ||||||
| 225469778 | 204 | PREDICTED: uncharacterized protein LOC10 | 0.980 | 0.975 | 0.586 | 2e-59 | |
| 359495653 | 204 | PREDICTED: uncharacterized protein LOC10 | 0.980 | 0.975 | 0.586 | 4e-59 | |
| 388506910 | 199 | unknown [Lotus japonicus] | 0.837 | 0.854 | 0.523 | 2e-40 | |
| 351723571 | 197 | uncharacterized protein LOC100499731 [Gl | 0.798 | 0.822 | 0.523 | 4e-39 | |
| 351727288 | 200 | uncharacterized protein LOC100305966 [Gl | 0.763 | 0.775 | 0.529 | 3e-37 | |
| 357485585 | 182 | Glutamyl-tRNA synthetase [Medicago trunc | 0.773 | 0.862 | 0.532 | 7e-37 | |
| 297798056 | 194 | threonine endopeptidase [Arabidopsis lyr | 0.921 | 0.963 | 0.423 | 1e-34 | |
| 18420175 | 193 | uncharacterized protein [Arabidopsis tha | 0.911 | 0.958 | 0.436 | 6e-34 | |
| 4467116 | 153 | hypothetical protein [Arabidopsis thalia | 0.714 | 0.947 | 0.487 | 8e-32 | |
| 449508380 | 210 | PREDICTED: uncharacterized protein At4g0 | 0.763 | 0.738 | 0.384 | 4e-31 |
| >gi|225469778|ref|XP_002274430.1| PREDICTED: uncharacterized protein LOC100261101 [Vitis vinifera] gi|297735928|emb|CBI18704.3| unnamed protein product [Vitis vinifera] | Back alignment and taxonomy information |
|---|
Score = 234 bits (597), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 122/208 (58%), Positives = 147/208 (70%), Gaps = 9/208 (4%)
Query: 1 MELCTAIQTQSAISNHHHPFLTATTAPALARTKSALSLKQTTVSRS--GILYYTNPLPKA 58
MELCT A SN HH L A L ++ KQ+ +SR+ G LY+ NPL +A
Sbjct: 1 MELCTT----RAFSNLHHRTLFNPLANRLRWKTISIPFKQSPISRTSPGSLYFNNPLLRA 56
Query: 59 T-SEETSSGTDQYVVDKRDGATAAEDVPAVEKNVYNESVATAVPKEESPVDGLT--NELL 115
+ SE +SSG DQY+ ++RD ED+PA E+NVYNE + T P E+S V+ T E L
Sbjct: 57 SISEGSSSGADQYIGEERDSVLVMEDIPATEENVYNEVIPTEAPIEDSQVEEQTVAFEFL 116
Query: 116 DNLKIKFDSEDKYSLVLYGTGALLALWLTTVVVGAIDSIPLFPKLMEVVGLGYTLWFSWR 175
DNL IKFDSED YS+ LYGTGAL ALW + +VGAIDSIP+FPKLME+VGLGYTLWFS R
Sbjct: 117 DNLNIKFDSEDPYSIFLYGTGALTALWFASAIVGAIDSIPIFPKLMEIVGLGYTLWFSAR 176
Query: 176 YLLFKKNRDELATKIEELKQQVLGSNDD 203
YL+FK+NRDELA KIEELKQQVLGS D+
Sbjct: 177 YLIFKQNRDELAAKIEELKQQVLGSEDE 204
|
Source: Vitis vinifera Species: Vitis vinifera Genus: Vitis Family: Vitaceae Order: Vitales Class: Phylum: Streptophyta Superkingdom: Eukaryota |
| >gi|359495653|ref|XP_003635047.1| PREDICTED: uncharacterized protein LOC100853187 isoform 1 [Vitis vinifera] gi|359495655|ref|XP_003635048.1| PREDICTED: uncharacterized protein LOC100853187 isoform 2 [Vitis vinifera] gi|297735931|emb|CBI18707.3| unnamed protein product [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|388506910|gb|AFK41521.1| unknown [Lotus japonicus] | Back alignment and taxonomy information |
|---|
| >gi|351723571|ref|NP_001235236.1| uncharacterized protein LOC100499731 [Glycine max] gi|255626121|gb|ACU13405.1| unknown [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|351727288|ref|NP_001235620.1| uncharacterized protein LOC100305966 [Glycine max] gi|255627137|gb|ACU13913.1| unknown [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|357485585|ref|XP_003613080.1| Glutamyl-tRNA synthetase [Medicago truncatula] gi|355514415|gb|AES96038.1| Glutamyl-tRNA synthetase [Medicago truncatula] gi|388495348|gb|AFK35740.1| unknown [Medicago truncatula] | Back alignment and taxonomy information |
|---|
| >gi|297798056|ref|XP_002866912.1| threonine endopeptidase [Arabidopsis lyrata subsp. lyrata] gi|297312748|gb|EFH43171.1| threonine endopeptidase [Arabidopsis lyrata subsp. lyrata] | Back alignment and taxonomy information |
|---|
| >gi|18420175|ref|NP_568035.1| uncharacterized protein [Arabidopsis thaliana] gi|21554198|gb|AAM63277.1| unknown [Arabidopsis thaliana] gi|107738205|gb|ABF83661.1| At4g38100 [Arabidopsis thaliana] gi|332661478|gb|AEE86878.1| uncharacterized protein [Arabidopsis thaliana] | Back alignment and taxonomy information |
|---|
| >gi|4467116|emb|CAB37550.1| hypothetical protein [Arabidopsis thaliana] gi|7270793|emb|CAB80475.1| hypothetical protein [Arabidopsis thaliana] | Back alignment and taxonomy information |
|---|
| >gi|449508380|ref|XP_004163298.1| PREDICTED: uncharacterized protein At4g01150, chloroplastic-like [Cucumis sativus] | Back alignment and taxonomy information |
|---|
Prediction of Gene Ontology (GO) Terms
Close Homologs with Gene Ontology terms Detected by BLAST 
Original result of BLAST against Gene Ontology (AMIGO)
ID ![]() |
Alignment graph ![]() |
Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 203 | ||||||
| TAIR|locus:2121065 | 193 | AT4G38100 [Arabidopsis thalian | 0.931 | 0.979 | 0.381 | 1.3e-27 | |
| TAIR|locus:2125018 | 164 | AT4G01150 "AT4G01150" [Arabido | 0.517 | 0.640 | 0.449 | 5.8e-18 | |
| TAIR|locus:2044335 | 174 | PSI-P "photosystem I P subunit | 0.566 | 0.660 | 0.260 | 1.8e-09 | |
| TAIR|locus:2037435 | 156 | AT1G52220 "AT1G52220" [Arabido | 0.517 | 0.673 | 0.261 | 2.9e-07 |
| TAIR|locus:2121065 AT4G38100 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Score = 309 (113.8 bits), Expect = 1.3e-27, P = 1.3e-27
Identities = 79/207 (38%), Positives = 116/207 (56%)
Query: 1 MELCTAIQTQSAISNHHHPFXXXXXXXXXXXXKSALSLKQTTVS---RSGILYYTNPLPK 57
MELCT ++ + I++ F + +L L++ S +S L + P
Sbjct: 1 MELCT--RSTTIITHLPASFNGHGYLAGKSVDRISLPLQRNVASLVLQSRTLRCSRKFPG 58
Query: 58 AT-SEETSSGTDQYVVDKRDGATAAEDVPAVEKNVYNESVATAVPKEESPVDGLTNELLD 116
T +EETS+G +++ V+ RDG A A EKN +E A E+ L E L+
Sbjct: 59 ETVTEETSTGVNEFGVEDRDGVVVA----AEEKNSNSE----APQAEDEETQAL--EFLN 108
Query: 117 NLKIKFDSEDKYSLVLYXXXXXXXXXXXXXXXXXIDSIPLFPKLMEVVGLGYTLWFSWRY 176
++K+ DS+ YS++LY +++IPLFPKLMEVVGLGYTLWF+ RY
Sbjct: 109 DIKL--DSDKTYSILLYGSGAIVALYLTSAIVSSLEAIPLFPKLMEVVGLGYTLWFTTRY 166
Query: 177 LLFKKNRDELATKIEELKQQVLGSNDD 203
LLFK+NR+EL TK+ E+K+QVLGS+ +
Sbjct: 167 LLFKRNREELKTKVSEIKKQVLGSDSE 193
|
|
| TAIR|locus:2125018 AT4G01150 "AT4G01150" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2044335 PSI-P "photosystem I P subunit" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2037435 AT1G52220 "AT1G52220" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Prediction of Enzyme Commission (EC) Number
EC Number Prediction by Ezypred Server 
Original result from Ezypred Server
Fail to connect to Ezypred Server
Prediction of Functionally Associated Proteins
Functionally Associated Proteins Detected by STRING 
Original result from the STRING server
| GSVIVG00011469001 | SubName- Full=Chromosome undetermined scaffold_310, whole genome shotgun sequence; (204 aa) | ||||||||||
(Vitis vinifera) | |||||||||||
| GSVIVG00027603001 | • | 0.505 | |||||||||
| GSVIVG00002625001 | • | 0.505 | |||||||||
| GSVIVG00007397001 | • | 0.504 | |||||||||
| GSVIVG00004124001 | • | 0.502 | |||||||||
| GSVIVG00035652001 | • | 0.501 | |||||||||
| GSVIVG00020301001 | • | 0.501 | |||||||||
| cemA | • | 0.500 | |||||||||
| GSVIVG00017320001 | • | 0.500 | |||||||||
| GSVIVG00009465001 | • | 0.500 | |||||||||
| GSVIVG00034898001 | • | 0.499 |
Conserved Domains and Related Protein Families
Conserved Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against CDD database part I
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 203 | |||
| pfam14159 | 90 | pfam14159, DUF4308, Domain of unknown function (DU | 9e-34 | |
| PLN02777 | 167 | PLN02777, PLN02777, photosystem I P subunit (PSI-P | 1e-16 |
| >gnl|CDD|222574 pfam14159, DUF4308, Domain of unknown function (DUF4308) | Back alignment and domain information |
|---|
Score = 115 bits (290), Expect = 9e-34
Identities = 45/81 (55%), Positives = 59/81 (72%), Gaps = 4/81 (4%)
Query: 123 DSEDKYSLVLYGTGA----LLALWLTTVVVGAIDSIPLFPKLMEVVGLGYTLWFSWRYLL 178
EDKY L G GA ++ALWL+ V+ AIDSIPL P L+E+VGLGY+ WF +RYLL
Sbjct: 10 KFEDKYKRPLLGVGAIIAVIVALWLSAAVLDAIDSIPLLPGLLELVGLGYSGWFVYRYLL 69
Query: 179 FKKNRDELATKIEELKQQVLG 199
F ++R EL KI+ELK+++LG
Sbjct: 70 FSEDRQELLAKIQELKKEILG 90
|
This presumed domain is functionally uncharacterized. This domain family is found in bacteria and eukaryotes, and is approximately 90 amino acids in length. The domain is found in several amino-acyl tRNA synthetase enzymes as well as in isolation in single domain proteins. Length = 90 |
| >gnl|CDD|178376 PLN02777, PLN02777, photosystem I P subunit (PSI-P) | Back alignment and domain information |
|---|
Conserved Domains Detected by HHsearch 
Original result of HHsearch against CDD database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 203 | |||
| PLN02777 | 167 | photosystem I P subunit (PSI-P) | 100.0 | |
| PF14159 | 90 | CAAD: CAAD domains of cyanobacterial aminoacyl-tRN | 99.96 |
| >PLN02777 photosystem I P subunit (PSI-P) | Back alignment and domain information |
|---|
Probab=100.00 E-value=3.8e-34 Score=238.56 Aligned_cols=93 Identities=34% Similarity=0.725 Sum_probs=89.4
Q ss_pred chhHHHHHHhhhhcc-ccCchhhhhHHHHHHHHHHHHHHHHHHhccCCCcccceeeeeeeeeeeehhhhcccccchHHHH
Q 028840 109 GLTNELLDNLKIKFD-SEDKYSLVLYGTGALLALWLTTVVVGAIDSIPLFPKLMEVVGLGYTLWFSWRYLLFKKNRDELA 187 (203)
Q Consensus 109 ~q~~E~l~~l~~kwd-~edK~av~~~g~gaiVAL~v~~aVL~AID~IPLLp~lLELVGLgYs~WFvyRyLLfke~RqEL~ 187 (203)
.+..|+++.++++|| .|||+++++++++++|++|++.+||+|||+|||+|++||||||||++||+||||+|+++|+||+
T Consensus 74 ~~~~ei~k~~~e~Wd~~EdK~av~~l~~aaiVal~v~~~VL~AId~lPLlP~lLELVGigYs~WF~yRyLLfke~ReeL~ 153 (167)
T PLN02777 74 TELPEIVKTVQEAWDKVEDKYAVSSLAFAGVVALWGSAGMISAIDRLPLVPGVLELVGIGYTGWFAYKNLVFKPDREALI 153 (167)
T ss_pred ccHHHHHHHHHHHHhhhcchhHHHHHHHHHHHHHHHHHHHHHHHhccccccchHHHhhhhhhhhhhhhHhcCcccHHHHH
Confidence 345799999999999 6999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHHHHhhhhcCCC
Q 028840 188 TKIEELKQQVLGSN 201 (203)
Q Consensus 188 ~kI~~lk~~IlG~~ 201 (203)
++|+++|++|+|++
T Consensus 154 ~ki~~lk~~IlG~s 167 (167)
T PLN02777 154 EKIKDTYKEIIGSS 167 (167)
T ss_pred HHHHHHHHHhhCCC
Confidence 99999999999963
|
|
| >PF14159 CAAD: CAAD domains of cyanobacterial aminoacyl-tRNA synthetase | Back alignment and domain information |
|---|
Homologous Structure Templates
Structure Templates Detected by BLAST 
Original result of BLAST against Protein Data Bank
No homologous structure with e-value below 0.005
Structure Templates Detected by RPS-BLAST 
Original result of RPS-BLAST against PDB70 database
No hit with e-value below 0.005
Structure Templates Detected by HHsearch 
Original result of HHsearch against PDB70 database
No hit with probability above 80.00
Homologous Structure Domains
Structure Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against SCOP70(version1.75) database
No hit with e-value below 0.005
Homologous Domains Detected by HHsearch 
Original result of HHsearch against SCOP70(version1.75) database
No hit with probability above 80.00