Citrus Sinensis ID: 026654
Local Sequence Feature Prediction
| Prediction and (Method) | Result |
|---|
Close Homologs for Annotation Transfer
Close Homologs in the Non-Redundant Database Detected by BLAST 
Original result of BLAST against Nonredundant Database
GI ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 235 | ||||||
| 356543780 | 297 | PREDICTED: protein THYLAKOID FORMATION1, | 0.974 | 0.771 | 0.847 | 1e-112 | |
| 356549970 | 297 | PREDICTED: protein THYLAKOID FORMATION1, | 0.978 | 0.774 | 0.834 | 1e-112 | |
| 255636566 | 297 | unknown [Glycine max] | 0.978 | 0.774 | 0.834 | 1e-112 | |
| 388514959 | 303 | unknown [Medicago truncatula] | 0.987 | 0.765 | 0.819 | 1e-112 | |
| 359485791 | 299 | PREDICTED: protein THYLAKOID FORMATION1, | 0.982 | 0.772 | 0.840 | 1e-111 | |
| 296084957 | 243 | unnamed protein product [Vitis vinifera] | 0.982 | 0.950 | 0.840 | 1e-111 | |
| 255553917 | 299 | Protein THYLAKOID FORMATION1, chloroplas | 0.974 | 0.765 | 0.830 | 1e-111 | |
| 449438054 | 298 | PREDICTED: protein THYLAKOID FORMATION 1 | 0.948 | 0.748 | 0.856 | 1e-110 | |
| 356542877 | 291 | PREDICTED: protein THYLAKOID FORMATION1, | 0.936 | 0.756 | 0.859 | 1e-109 | |
| 224124656 | 298 | predicted protein [Populus trichocarpa] | 0.961 | 0.758 | 0.841 | 1e-109 |
| >gi|356543780|ref|XP_003540338.1| PREDICTED: protein THYLAKOID FORMATION1, chloroplastic-like [Glycine max] | Back alignment and taxonomy information |
|---|
Score = 411 bits (1057), Expect = e-112, Method: Compositional matrix adjust.
Identities = 194/229 (84%), Positives = 214/229 (93%)
Query: 3 SDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVY 62
+DVPPTV+ETK+NFLK YKRPIPSIYNTVLQELIVQQHLM+YKR+Y+YDPVFALGFVT+Y
Sbjct: 66 TDVPPTVSETKLNFLKAYKRPIPSIYNTVLQELIVQQHLMKYKRSYRYDPVFALGFVTIY 125
Query: 63 DRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEV 122
D+LMEGYPS+EDR+AIFQAYI ALKEDPEQYRIDA+KLEEWAR Q +SLVEF SKEGEV
Sbjct: 126 DKLMEGYPSDEDRDAIFQAYIKALKEDPEQYRIDARKLEEWARVQKPTSLVEFSSKEGEV 185
Query: 123 EGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDL 182
EG+LKDIAERA GKG FSYSRFFAVGLFRLLELANATEPT+L+KLC LN+NKRSVDRDL
Sbjct: 186 EGILKDIAERAGGKGEFSYSRFFAVGLFRLLELANATEPTILDKLCVALNINKRSVDRDL 245
Query: 183 DVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGEYL 231
DVYR LLSKL+QAKELLKEY+DREKKKR+ER EPQKANEAI CLG+ L
Sbjct: 246 DVYRILLSKLVQAKELLKEYIDREKKKRDERAEPQKANEAITTCLGQQL 294
|
Source: Glycine max Species: Glycine max Genus: Glycine Family: Fabaceae Order: Fabales Class: Phylum: Streptophyta Superkingdom: Eukaryota |
| >gi|356549970|ref|XP_003543363.1| PREDICTED: protein THYLAKOID FORMATION1, chloroplastic-like [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|255636566|gb|ACU18621.1| unknown [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|388514959|gb|AFK45541.1| unknown [Medicago truncatula] | Back alignment and taxonomy information |
|---|
| >gi|359485791|ref|XP_002275686.2| PREDICTED: protein THYLAKOID FORMATION1, chloroplastic [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|296084957|emb|CBI28372.3| unnamed protein product [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|255553917|ref|XP_002517999.1| Protein THYLAKOID FORMATION1, chloroplast precursor, putative [Ricinus communis] gi|223542981|gb|EEF44517.1| Protein THYLAKOID FORMATION1, chloroplast precursor, putative [Ricinus communis] | Back alignment and taxonomy information |
|---|
| >gi|449438054|ref|XP_004136805.1| PREDICTED: protein THYLAKOID FORMATION 1, chloroplastic-like [Cucumis sativus] gi|449493105|ref|XP_004159194.1| PREDICTED: protein THYLAKOID FORMATION 1, chloroplastic-like [Cucumis sativus] | Back alignment and taxonomy information |
|---|
| >gi|356542877|ref|XP_003539891.1| PREDICTED: protein THYLAKOID FORMATION1, chloroplastic-like [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|224124656|ref|XP_002319386.1| predicted protein [Populus trichocarpa] gi|222857762|gb|EEE95309.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
Prediction of Gene Ontology (GO) Terms
Close Homologs with Gene Ontology terms Detected by BLAST 
Original result of BLAST against Gene Ontology (AMIGO)
ID ![]() |
Alignment graph ![]() |
Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 235 | ||||||
| TAIR|locus:2051353 | 300 | PSB29 "AT2G20890" [Arabidopsis | 0.978 | 0.766 | 0.705 | 2.7e-84 | |
| UNIPROTKB|P73956 | 240 | thf1 "Protein thf1" [Synechocy | 0.765 | 0.75 | 0.340 | 2.1e-22 |
| TAIR|locus:2051353 PSB29 "AT2G20890" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Score = 844 (302.2 bits), Expect = 2.7e-84, P = 2.7e-84
Identities = 163/231 (70%), Positives = 189/231 (81%)
Query: 3 SDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVY 62
+DVPP V+ETK FLK YKRPIPSIYNTVLQELIVQQHLMRYK+TY+YDPVFALGFVTVY
Sbjct: 62 ADVPP-VSETKSKFLKAYKRPIPSIYNTVLQELIVQQHLMRYKKTYRYDPVFALGFVTVY 120
Query: 63 DRLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEV 122
D+LMEGYPS++DR+AIF+AYI AL EDP+QYRIDAQK+EEWAR QT++SLV+F SKEG++
Sbjct: 121 DQLMEGYPSDQDRDAIFKAYIEALNEDPKQYRIDAQKMEEWARSQTSASLVDFSSKEGDI 180
Query: 123 EGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDL 182
E +LKDIA RA K FSYSRFFAVGLFRLLELA+AT+PTVL+KLCA LN+NK+SVDRDL
Sbjct: 181 EAVLKDIAGRAGSKEGFSYSRFFAVGLFRLLELASATDPTVLDKLCASLNINKKSVDRDL 240
Query: 183 DVYRNXXXXXXXXXXXXXEYVDXXXXXXXXXTEPQKANEAIKKCLGEYLYS 233
DVYRN EYV+ + QKANE I KCLG+ LY+
Sbjct: 241 DVYRNLLSKLVQAKELLKEYVEREKKKQGERAQSQKANETISKCLGDTLYN 291
|
|
| UNIPROTKB|P73956 thf1 "Protein thf1" [Synechocystis sp. PCC 6803 substr. Kazusa (taxid:1111708)] | Back alignment and assigned GO terms |
|---|
Prediction of Enzyme Commission (EC) Number
EC Number Prediction by Ezypred Server 
Original result from Ezypred Server
Fail to connect to Ezypred Server
Prediction of Functionally Associated Proteins
Functionally Associated Proteins Detected by STRING 
Original result from the STRING server
Fail to connect to STRING server
Conserved Domains and Related Protein Families
Conserved Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against CDD database part I
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 235 | |||
| PLN00047 | 283 | PLN00047, PLN00047, photosystem II biogenesis prot | 1e-154 | |
| PLN03060 | 206 | PLN03060, PLN03060, inositol phosphatase-like prot | 1e-110 | |
| pfam11264 | 216 | pfam11264, ThylakoidFormat, Thylakoid formation pr | 6e-89 | |
| PRK13266 | 225 | PRK13266, PRK13266, Thf1-like protein; Reviewed | 2e-59 | |
| TIGR03060 | 214 | TIGR03060, PS_II_psb29, photosystem II biogenesis | 6e-51 |
| >gnl|CDD|177679 PLN00047, PLN00047, photosystem II biogenesis protein Psb29; Provisional | Back alignment and domain information |
|---|
Score = 428 bits (1102), Expect = e-154
Identities = 190/229 (82%), Positives = 207/229 (90%)
Query: 4 DVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYD 63
DVPPTVAETK FLK YKRPIPSIY+TVLQEL+VQQHLMRYK+TY+YDPVFALGFVTVYD
Sbjct: 53 DVPPTVAETKAKFLKSYKRPIPSIYSTVLQELLVQQHLMRYKKTYRYDPVFALGFVTVYD 112
Query: 64 RLMEGYPSEEDREAIFQAYITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVE 123
+LMEGYPS+EDR+AIF+AYI AL EDPEQYR DA KLEEWAR QT SSLV+F SKEGE+E
Sbjct: 113 QLMEGYPSDEDRDAIFKAYIKALGEDPEQYRKDAAKLEEWARSQTGSSLVDFSSKEGEIE 172
Query: 124 GLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPTVLEKLCAVLNVNKRSVDRDLD 183
G+LKDIAERA KG FSYSRFFA+GLFRLLELANATEPT LEKLCA LN+NKRSVDRDLD
Sbjct: 173 GILKDIAERAGSKGKFSYSRFFAIGLFRLLELANATEPTALEKLCAALNINKRSVDRDLD 232
Query: 184 VYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGEYLY 232
VYR LLSKL+QAKELLKEYV+REKKK+EER E QKANEA+ KCLGE
Sbjct: 233 VYRGLLSKLVQAKELLKEYVEREKKKQEERAESQKANEAVTKCLGELEQ 281
|
Length = 283 |
| >gnl|CDD|215554 PLN03060, PLN03060, inositol phosphatase-like protein; Provisional | Back alignment and domain information |
|---|
| >gnl|CDD|221050 pfam11264, ThylakoidFormat, Thylakoid formation protein | Back alignment and domain information |
|---|
| >gnl|CDD|237324 PRK13266, PRK13266, Thf1-like protein; Reviewed | Back alignment and domain information |
|---|
| >gnl|CDD|213765 TIGR03060, PS_II_psb29, photosystem II biogenesis protein Psp29 | Back alignment and domain information |
|---|
Conserved Domains Detected by HHsearch 
Original result of HHsearch against CDD database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 235 | |||
| PLN00047 | 283 | photosystem II biogenesis protein Psb29; Provision | 100.0 | |
| PRK13266 | 225 | Thf1-like protein; Reviewed | 100.0 | |
| PF11264 | 216 | ThylakoidFormat: Thylakoid formation protein; Inte | 100.0 | |
| PLN03060 | 206 | inositol phosphatase-like protein; Provisional | 100.0 | |
| TIGR03060 | 214 | PS_II_psb29 photosystem II biogenesis protein Psp2 | 100.0 | |
| PF11264 | 216 | ThylakoidFormat: Thylakoid formation protein; Inte | 95.16 | |
| PRK13266 | 225 | Thf1-like protein; Reviewed | 94.9 | |
| TIGR03060 | 214 | PS_II_psb29 photosystem II biogenesis protein Psp2 | 94.7 | |
| PLN03060 | 206 | inositol phosphatase-like protein; Provisional | 93.46 | |
| PLN00047 | 283 | photosystem II biogenesis protein Psb29; Provision | 91.12 |
| >PLN00047 photosystem II biogenesis protein Psb29; Provisional | Back alignment and domain information |
|---|
Probab=100.00 E-value=3.8e-100 Score=688.46 Aligned_cols=232 Identities=82% Similarity=1.242 Sum_probs=227.8
Q ss_pred CCCCCchhHhHHHHHhhCCCcCCcchhHHHHHHHHHHhhhcccCccccccchhhhHHHHHHHHhcCCCCchhHHHHHHHH
Q 026654 3 SDVPPTVAETKMNFLKLYKRPIPSIYNTVLQELIVQQHLMRYKRTYQYDPVFALGFVTVYDRLMEGYPSEEDREAIFQAY 82 (235)
Q Consensus 3 ~~~~~TVSDTKr~F~~~~~~pI~siYrrvv~ELLVe~HLl~~n~~F~yD~lfAlG~vt~fd~fm~GY~pe~~~~~IF~Al 82 (235)
++++||||||||+||++|||||||||||||||||||||||+||++|+|||||||||||+||+||+||||++|+++||+||
T Consensus 52 ~~~~~TVSDTKr~F~~~yp~pIpsiYrrvvdELLVElHLLs~n~~F~yDplFALGlVtvfd~fm~GY~Pee~~~~IF~Al 131 (283)
T PLN00047 52 TDVPPTVAETKAKFLKSYKRPIPSIYSTVLQELLVQQHLMRYKKTYRYDPVFALGFVTVYDQLMEGYPSDEDRDAIFKAY 131 (283)
T ss_pred cCCCCcHHHHHHHHHHhCCCCCcHHHHHHHHHHHHHHHHHHhccCceeCchhhhhhHHHHHHHHccCCChHHHHHHHHHH
Confidence 68899999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HHhcCCCHHHHHHHHHHHHHHHhcCCccccccccCCcchhHHHHHHHHHHhcCCCCCcchHHHHHHHHHHHhhcCCCCHH
Q 026654 83 ITALKEDPEQYRIDAQKLEEWARGQTASSLVEFPSKEGEVEGLLKDIAERASGKGNFSYSRFFAVGLFRLLELANATEPT 162 (235)
Q Consensus 83 c~a~~~Dp~q~r~dA~~l~~~A~~~s~~~l~~~~~~~g~~~~~l~~Ia~~~~~~~~f~YSRlfAIGLf~LLE~~~~~d~~ 162 (235)
|+|+|+||++||+||++|++||+|+|.+++.+|+...|+++++|++||+++++|++||||||||||||+|||.++++||+
T Consensus 132 c~a~g~Dp~qyr~dA~~l~~~A~~~s~~~l~~~l~~~~~l~~~l~~IA~~a~~~~~f~YSRlfAIGLf~LLe~a~~~d~~ 211 (283)
T PLN00047 132 IKALGEDPEQYRKDAAKLEEWARSQTGSSLVDFSSKEGEIEGILKDIAERAGSKGKFSYSRFFAIGLFRLLELANATEPT 211 (283)
T ss_pred HHHcCCCHHHHHHHHHHHHHHHhcCCHHHHHHHHhcchHHHHHHHHHHHhhccCCCcchHHHHHHHHHHHHHhcCCCCHH
Confidence 99999999999999999999999999999999999999999999999988778999999999999999999999999999
Q ss_pred HHHHHHHhcCCChhhhHhhHHHHHhhHHHHHHHHHHHHHHHHHHhhhHHhhccchhhhHHHHHhhhhccccC
Q 026654 163 VLEKLCAVLNVNKRSVDRDLDVYRNLLSKLLQAKELLKEYVDREKKKREERTEPQKANEAIKKCLGEYLYSH 234 (235)
Q Consensus 163 ~l~~l~~~Lgls~~kv~kDL~lYrs~LeKm~QA~el~eE~~~~ekKKre~r~~~~~~~~~~~~~~~~~~~~~ 234 (235)
.+++||++|||++++|+|||+|||||||||+||+|||||++++|||||++|+++||+|+++|+|+|++++.+
T Consensus 212 ~l~~l~e~Lgls~~kv~KDLdlYrsnLeKm~QA~elmeE~~~~EkKKre~r~~~~~~~~~~~~~~~~~~~~~ 283 (283)
T PLN00047 212 ALEKLCAALNINKRSVDRDLDVYRGLLSKLVQAKELLKEYVEREKKKQEERAESQKANEAVTKCLGELEQAG 283 (283)
T ss_pred HHHHHHHHcCCCHHHHHhhHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhHHHHhcccchhccC
Confidence 999999999999999999999999999999999999999999999999999999999999999999998764
|
|
| >PRK13266 Thf1-like protein; Reviewed | Back alignment and domain information |
|---|
| >PF11264 ThylakoidFormat: Thylakoid formation protein; InterPro: IPR017499 Psp29, originally designated sll1414 (P73956 from SWISSPROT) in Synechocystis sp | Back alignment and domain information |
|---|
| >PLN03060 inositol phosphatase-like protein; Provisional | Back alignment and domain information |
|---|
| >TIGR03060 PS_II_psb29 photosystem II biogenesis protein Psp29 | Back alignment and domain information |
|---|
| >PF11264 ThylakoidFormat: Thylakoid formation protein; InterPro: IPR017499 Psp29, originally designated sll1414 (P73956 from SWISSPROT) in Synechocystis sp | Back alignment and domain information |
|---|
| >PRK13266 Thf1-like protein; Reviewed | Back alignment and domain information |
|---|
| >TIGR03060 PS_II_psb29 photosystem II biogenesis protein Psp29 | Back alignment and domain information |
|---|
| >PLN03060 inositol phosphatase-like protein; Provisional | Back alignment and domain information |
|---|
| >PLN00047 photosystem II biogenesis protein Psb29; Provisional | Back alignment and domain information |
|---|
Homologous Structure Templates
Structure Templates Detected by BLAST 
Original result of BLAST against Protein Data Bank
No homologous structure with e-value below 0.005
Structure Templates Detected by RPS-BLAST 
Original result of RPS-BLAST against PDB70 database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 235 | |||
| 1vt4_I | 1221 | APAF-1 related killer DARK; drosophila apoptosome, | 6e-05 |
| >1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 | Back alignment and structure |
|---|
Score = 42.9 bits (100), Expect = 6e-05
Identities = 40/267 (14%), Positives = 76/267 (28%), Gaps = 67/267 (25%)
Query: 23 PIPSIYNTVLQELIVQQHLMRYKRTYQ-----YDP----VFALG---FVT-----VYDRL 65
I +++ EL Y+ + F L +T V D L
Sbjct: 222 NIKLRIHSIQAELRRLLKSKPYENCLLVLLNVQNAKAWNAFNLSCKILLTTRFKQVTDFL 281
Query: 66 MEGYP------------SEEDREAIFQAYITALKED-PEQYRIDAQKLEEWARGQTASSL 112
+ ++ +++ Y+ +D P + A S+
Sbjct: 282 SAATTTHISLDHHSMTLTPDEVKSLLLKYLDCRPQDLPRE----VLTTNPRRLSIIAESI 337
Query: 113 VEFPSKEGEVEGLLKDIAERASGKGNFSYS---------RFFAVGLFR--------LLEL 155
+ + + + D S + F + +F LL L
Sbjct: 338 RDGLATWDNWKHVNCDKLTTIIES---SLNVLEPAEYRKMFDRLSVFPPSAHIPTILLSL 394
Query: 156 --ANATE---PTVLEKLCAVLNVNKRSVDRDL---DVYRNLLSKLLQAKELLKEYVDREK 207
+ + V+ KL V K+ + + +Y L KL L + VD
Sbjct: 395 IWFDVIKSDVMVVVNKLHKYSLVEKQPKESTISIPSIYLELKVKLENEYALHRSIVDHYN 454
Query: 208 KKREERTEPQKANEAIKKCLGEYLYSH 234
+ + ++ I L +Y YSH
Sbjct: 455 IPKTFDS-----DDLIPPYLDQYFYSH 476
|
Structure Templates Detected by HHsearch 
Original result of HHsearch against PDB70 database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 235 | |||
| 3g80_A | 97 | Protein B2; RNA-binding, viral protein, suppressor | 82.08 |
| >3g80_A Protein B2; RNA-binding, viral protein, suppressor of RNAI, RNA interference; 2.50A {Nodamura virus} | Back alignment and structure |
|---|
Probab=82.08 E-value=1.8 Score=33.22 Aligned_cols=28 Identities=21% Similarity=0.353 Sum_probs=24.7
Q ss_pred HhcCCChhhhHhhHHHHHhhHHHHHHHH
Q 026654 169 AVLNVNKRSVDRDLDVYRNLLSKLLQAK 196 (235)
Q Consensus 169 ~~Lgls~~kv~kDL~lYrs~LeKm~QA~ 196 (235)
.+|....++|.|||+-|+.-|.|+.|-.
T Consensus 49 ~sl~~qDpnV~kDLdn~~acL~k~~~t~ 76 (97)
T 3g80_A 49 QTLMIADPNVNKDLRAFCEFLTVQHQRA 76 (97)
T ss_dssp HTCCCCCHHHHHHHHHHHHHHHHHHHHH
T ss_pred HhccccCchHHHHHHHHHHHHHHHHHHH
Confidence 5677889999999999999999998743
|
Homologous Structure Domains
Structure Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against SCOP70(version1.75) database
No hit with e-value below 0.005
Homologous Domains Detected by HHsearch 
Original result of HHsearch against SCOP70(version1.75) database
No hit with probability above 80.00