Citrus Sinensis ID: 031557
Local Sequence Feature Prediction
| Prediction and (Method) | Result |
|---|
Close Homologs for Annotation Transfer
Close Homologs in the Non-Redundant Database Detected by BLAST 
Original result of BLAST against Nonredundant Database
GI ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 157 | ||||||
| 224142377 | 196 | predicted protein [Populus trichocarpa] | 0.949 | 0.760 | 0.745 | 1e-59 | |
| 255577037 | 195 | conserved hypothetical protein [Ricinus | 0.942 | 0.758 | 0.745 | 4e-56 | |
| 225445842 | 196 | PREDICTED: UPF0133 protein Synpcc7942_04 | 0.949 | 0.760 | 0.712 | 2e-52 | |
| 356543329 | 196 | PREDICTED: UPF0133 protein Synpcc7942_04 | 0.949 | 0.760 | 0.679 | 1e-50 | |
| 351724105 | 195 | uncharacterized protein LOC100305795 [Gl | 0.942 | 0.758 | 0.653 | 2e-47 | |
| 388500608 | 188 | unknown [Lotus japonicus] | 0.923 | 0.771 | 0.657 | 6e-46 | |
| 223948417 | 186 | unknown [Zea mays] gi|413926357|gb|AFW66 | 0.904 | 0.763 | 0.652 | 6e-46 | |
| 413935880 | 265 | hypothetical protein ZEAMMB73_296576 [Ze | 0.910 | 0.539 | 0.630 | 2e-45 | |
| 388507246 | 188 | unknown [Lotus japonicus] | 0.923 | 0.771 | 0.651 | 2e-45 | |
| 226501168 | 186 | uncharacterized protein LOC100273390 [Ze | 0.910 | 0.768 | 0.630 | 3e-45 |
| >gi|224142377|ref|XP_002324535.1| predicted protein [Populus trichocarpa] gi|222865969|gb|EEF03100.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
Score = 234 bits (596), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 114/153 (74%), Positives = 133/153 (86%), Gaps = 4/153 (2%)
Query: 1 MASTSAFTAQISSIH----WKKPTSSSNFSNNLNLNSNLAGKEMLSFSSQQKSGHQFRSL 56
MAST+A TAQ+++++ WK+P +S NLNLN+N+ GK++LS+S +QKS +RS
Sbjct: 1 MASTTAVTAQVTNLNGLGDWKRPFPASLPLGNLNLNANVVGKQLLSWSGKQKSERNYRSF 60
Query: 57 RVYGLFGGKKDNNEKGDDAPSKAGILGNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFD 116
RV GLFGGKK+N+EK DDAPSKAGI GNMQNLYETVKKAQMVVQVEAVRVQKELA+AEFD
Sbjct: 61 RVIGLFGGKKENSEKSDDAPSKAGIFGNMQNLYETVKKAQMVVQVEAVRVQKELASAEFD 120
Query: 117 GYCEGELIKVTLSGNQQPVRTEITEAAMELGAE 149
GYCEGELIKVTL+GNQQPVRTEITEAAMELGAE
Sbjct: 121 GYCEGELIKVTLTGNQQPVRTEITEAAMELGAE 153
|
Source: Populus trichocarpa Species: Populus trichocarpa Genus: Populus Family: Salicaceae Order: Malpighiales Class: Phylum: Streptophyta Superkingdom: Eukaryota |
| >gi|255577037|ref|XP_002529403.1| conserved hypothetical protein [Ricinus communis] gi|223531151|gb|EEF32999.1| conserved hypothetical protein [Ricinus communis] | Back alignment and taxonomy information |
|---|
| >gi|225445842|ref|XP_002277907.1| PREDICTED: UPF0133 protein Synpcc7942_0464 [Vitis vinifera] gi|297743656|emb|CBI36539.3| unnamed protein product [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|356543329|ref|XP_003540114.1| PREDICTED: UPF0133 protein Synpcc7942_0464-like [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|351724105|ref|NP_001236790.1| uncharacterized protein LOC100305795 [Glycine max] gi|255626621|gb|ACU13655.1| unknown [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|388500608|gb|AFK38370.1| unknown [Lotus japonicus] | Back alignment and taxonomy information |
|---|
| >gi|223948417|gb|ACN28292.1| unknown [Zea mays] gi|413926357|gb|AFW66289.1| hypothetical protein ZEAMMB73_706905 [Zea mays] | Back alignment and taxonomy information |
|---|
| >gi|413935880|gb|AFW70431.1| hypothetical protein ZEAMMB73_296576 [Zea mays] | Back alignment and taxonomy information |
|---|
| >gi|388507246|gb|AFK41689.1| unknown [Lotus japonicus] | Back alignment and taxonomy information |
|---|
| >gi|226501168|ref|NP_001141299.1| uncharacterized protein LOC100273390 [Zea mays] gi|194703866|gb|ACF86017.1| unknown [Zea mays] | Back alignment and taxonomy information |
|---|
Prediction of Gene Ontology (GO) Terms
Close Homologs with Gene Ontology terms Detected by BLAST 
Original result of BLAST against Gene Ontology (AMIGO)
ID ![]() |
Alignment graph ![]() |
Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 157 | ||||||
| TAIR|locus:2118721 | 180 | AT4G30620 "AT4G30620" [Arabido | 0.707 | 0.616 | 0.730 | 1.6e-38 | |
| TAIR|locus:2061471 | 182 | AT2G24020 "AT2G24020" [Arabido | 0.675 | 0.582 | 0.770 | 2.6e-38 |
| TAIR|locus:2118721 AT4G30620 "AT4G30620" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Score = 412 (150.1 bits), Expect = 1.6e-38, P = 1.6e-38
Identities = 84/115 (73%), Positives = 100/115 (86%)
Query: 36 AGKEMLSFSSQQKSGHQFRSLRVYGLFGG-KKDNNEKGDDAPSKAGILGNMQNLYETVKK 94
+ ++ + + ++++SG+ RSLRV GLFGG KKDN E G SKAGILGNMQNLYETVKK
Sbjct: 27 SSRQNIVWLNRKQSGNNNRSLRVNGLFGGGKKDNKEDGQ---SKAGILGNMQNLYETVKK 83
Query: 95 AQMVVQVEAVRVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMELGAE 149
AQMVVQVEAVRVQKELA AEFDGYC+GEL+KVTLSGNQQP+RT+IT+AAMELG+E
Sbjct: 84 AQMVVQVEAVRVQKELAVAEFDGYCQGELVKVTLSGNQQPIRTDITDAAMELGSE 138
|
|
| TAIR|locus:2061471 AT2G24020 "AT2G24020" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Prediction of Enzyme Commission (EC) Number
EC Number Prediction by Ezypred Server 
Original result from Ezypred Server
Fail to connect to Ezypred Server
Prediction of Functionally Associated Proteins
Functionally Associated Proteins Detected by STRING 
Original result from the STRING server
Fail to connect to STRING server
Conserved Domains and Related Protein Families
Conserved Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against CDD database part I
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 157 | |||
| PRK00153 | 104 | PRK00153, PRK00153, hypothetical protein; Validate | 6e-10 | |
| pfam02575 | 92 | pfam02575, YbaB_DNA_bd, YbaB/EbfC DNA-binding fami | 1e-09 | |
| COG0718 | 105 | COG0718, COG0718, Uncharacterized protein conserve | 7e-06 | |
| TIGR00103 | 101 | TIGR00103, DNA_YbaB_EbfC, DNA-binding protein, Yba | 1e-04 |
| >gnl|CDD|234669 PRK00153, PRK00153, hypothetical protein; Validated | Back alignment and domain information |
|---|
Score = 52.9 bits (128), Expect = 6e-10
Identities = 24/62 (38%), Positives = 40/62 (64%), Gaps = 4/62 (6%)
Query: 83 GNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEA 142
GNMQNL +K+AQ + Q + ++Q+ELA E +G G L+KVT++G ++ R +I +
Sbjct: 3 GNMQNL---MKQAQQM-QEKMQKMQEELAQMEVEGEAGGGLVKVTMTGKKEVKRVKIDPS 58
Query: 143 AM 144
+
Sbjct: 59 LV 60
|
Length = 104 |
| >gnl|CDD|217115 pfam02575, YbaB_DNA_bd, YbaB/EbfC DNA-binding family | Back alignment and domain information |
|---|
| >gnl|CDD|223790 COG0718, COG0718, Uncharacterized protein conserved in bacteria [Function unknown] | Back alignment and domain information |
|---|
| >gnl|CDD|211546 TIGR00103, DNA_YbaB_EbfC, DNA-binding protein, YbaB/EbfC family | Back alignment and domain information |
|---|
Conserved Domains Detected by HHsearch 
Original result of HHsearch against CDD database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 157 | |||
| PRK14624 | 115 | hypothetical protein; Provisional | 99.94 | |
| PRK14626 | 110 | hypothetical protein; Provisional | 99.93 | |
| PRK14625 | 109 | hypothetical protein; Provisional | 99.93 | |
| COG0718 | 105 | Uncharacterized protein conserved in bacteria [Fun | 99.92 | |
| PRK14627 | 100 | hypothetical protein; Provisional | 99.92 | |
| PRK14622 | 103 | hypothetical protein; Provisional | 99.92 | |
| PRK03762 | 103 | hypothetical protein; Provisional | 99.92 | |
| PRK14621 | 111 | hypothetical protein; Provisional | 99.91 | |
| PRK14623 | 106 | hypothetical protein; Provisional | 99.91 | |
| TIGR00103 | 102 | DNA_YbaB_EbfC DNA-binding protein, YbaB/EbfC famil | 99.91 | |
| PRK00587 | 99 | hypothetical protein; Provisional | 99.91 | |
| PRK14629 | 99 | hypothetical protein; Provisional | 99.9 | |
| PRK00153 | 104 | hypothetical protein; Validated | 99.9 | |
| PRK14628 | 118 | hypothetical protein; Provisional | 99.9 | |
| PF02575 | 93 | YbaB_DNA_bd: YbaB/EbfC DNA-binding family; InterPr | 99.74 | |
| PRK14628 | 118 | hypothetical protein; Provisional | 92.8 | |
| PRK00153 | 104 | hypothetical protein; Validated | 83.78 | |
| PF10921 | 109 | DUF2710: Protein of unknown function (DUF2710); In | 83.6 |
| >PRK14624 hypothetical protein; Provisional | Back alignment and domain information |
|---|
Probab=99.94 E-value=8.1e-27 Score=177.95 Aligned_cols=75 Identities=15% Similarity=0.217 Sum_probs=72.7
Q ss_pred CCCChhhHHHHHHHHHHHHHHHHHHHHHHHhccEEEEEecCCeEEEEEeecceeEEEEeCccccc-CChhHHhhhhc
Q 031557 81 ILGNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAME-LGAEVKTFTLS 156 (157)
Q Consensus 81 m~gnM~nm~emmKQAQk~MQ~km~k~QeEL~~~eveGssGGGlVkVt~nG~~eI~~I~IDpelLd-eD~EmLEDLI~ 156 (157)
||.+|+||++||||||+ ||++|+++|++|++++|+|+||||+|+||+||+++|++|+|||++++ +|+|+|||||.
T Consensus 1 ~~~~~~nm~~~mkqAq~-mQ~km~~~QeeL~~~~v~g~sGgG~VkV~~nG~~~i~~i~Idp~lld~eD~E~LeDLI~ 76 (115)
T PRK14624 1 MFDKIKNMSEALSNMGN-IREKMEEVKKRIASIRVVGDAGAGMVTVTATGEGQITNVFINKQLFDADDNKMLEDLVM 76 (115)
T ss_pred CcchHHhHHHHHHHHHH-HHHHHHHHHHHHhccEEEEEECCcEEEEEEEcCccEEEEEECHHHcCcccHHHHHHHHH
Confidence 67789999999999999 99999999999999999999999999999999999999999999998 78999999985
|
|
| >PRK14626 hypothetical protein; Provisional | Back alignment and domain information |
|---|
| >PRK14625 hypothetical protein; Provisional | Back alignment and domain information |
|---|
| >COG0718 Uncharacterized protein conserved in bacteria [Function unknown] | Back alignment and domain information |
|---|
| >PRK14627 hypothetical protein; Provisional | Back alignment and domain information |
|---|
| >PRK14622 hypothetical protein; Provisional | Back alignment and domain information |
|---|
| >PRK03762 hypothetical protein; Provisional | Back alignment and domain information |
|---|
| >PRK14621 hypothetical protein; Provisional | Back alignment and domain information |
|---|
| >PRK14623 hypothetical protein; Provisional | Back alignment and domain information |
|---|
| >TIGR00103 DNA_YbaB_EbfC DNA-binding protein, YbaB/EbfC family | Back alignment and domain information |
|---|
| >PRK00587 hypothetical protein; Provisional | Back alignment and domain information |
|---|
| >PRK14629 hypothetical protein; Provisional | Back alignment and domain information |
|---|
| >PRK00153 hypothetical protein; Validated | Back alignment and domain information |
|---|
| >PRK14628 hypothetical protein; Provisional | Back alignment and domain information |
|---|
| >PF02575 YbaB_DNA_bd: YbaB/EbfC DNA-binding family; InterPro: IPR004401 The function of this protein is unknown | Back alignment and domain information |
|---|
| >PRK14628 hypothetical protein; Provisional | Back alignment and domain information |
|---|
| >PRK00153 hypothetical protein; Validated | Back alignment and domain information |
|---|
| >PF10921 DUF2710: Protein of unknown function (DUF2710); InterPro: IPR024296 This family of uncharacterised proteins appears to be restricted to Mycobacteriaceae | Back alignment and domain information |
|---|
Homologous Structure Templates
Structure Templates Detected by BLAST 
Original result of BLAST against Protein Data Bank
No homologous structure with e-value below 0.005
Structure Templates Detected by RPS-BLAST 
Original result of RPS-BLAST against PDB70 database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 157 | |||
| 1ybx_A | 143 | Conserved hypothetical protein; ST genomics, PSI, | 2e-06 | |
| 1j8b_A | 112 | YBAB; hypothetical protein, structural genomics, s | 6e-06 | |
| 3f42_A | 99 | Protein HP0035; helicobacter pylori unknown-functi | 2e-05 |
| >1ybx_A Conserved hypothetical protein; ST genomics, PSI, protein structure initiative, southeast COLL for structural genomics, secsg; HET: MSE; 1.80A {Clostridium thermocellum} Length = 143 | Back alignment and structure |
|---|
Score = 43.6 bits (103), Expect = 2e-06
Identities = 19/66 (28%), Positives = 30/66 (45%), Gaps = 4/66 (6%)
Query: 80 GILGNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEI 139
G GN+ NL VK+AQ + Q + RVQ+EL + G + V +G + I
Sbjct: 38 GFGGNINNL---VKQAQKM-QRDMERVQEELKEKTVEASAGGGAVTVVATGRKDIKEITI 93
Query: 140 TEAAME 145
++
Sbjct: 94 KPEVVD 99
|
| >1j8b_A YBAB; hypothetical protein, structural genomics, structure function project, S2F, unknown function; HET: MSE; 1.75A {Haemophilus influenzae RD} SCOP: d.222.1.1 PDB: 1pug_A Length = 112 | Back alignment and structure |
|---|
| >3f42_A Protein HP0035; helicobacter pylori unknown-function, structural genomics, P protein structure initiative; HET: MSE; 1.78A {Helicobacter pylori} Length = 99 | Back alignment and structure |
|---|
Structure Templates Detected by HHsearch 
Original result of HHsearch against PDB70 database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 157 | |||
| 1ybx_A | 143 | Conserved hypothetical protein; ST genomics, PSI, | 99.93 | |
| 1j8b_A | 112 | YBAB; hypothetical protein, structural genomics, s | 99.92 | |
| 3f42_A | 99 | Protein HP0035; helicobacter pylori unknown-functi | 99.89 | |
| 1ybx_A | 143 | Conserved hypothetical protein; ST genomics, PSI, | 83.18 |
| >1ybx_A Conserved hypothetical protein; ST genomics, PSI, protein structure initiative, southeast COLL for structural genomics, secsg; HET: MSE; 1.80A {Clostridium thermocellum} | Back alignment and structure |
|---|
Probab=99.93 E-value=2.6e-26 Score=179.65 Aligned_cols=76 Identities=22% Similarity=0.246 Sum_probs=66.7
Q ss_pred CCCCC-hhhHHHHHHHHHHHHHHHHHHHHHHHhccEEEEEecCCeEEEEEeecceeEEEEeCccccc-CChhHHhhhhc
Q 031557 80 GILGN-MQNLYETVKKAQMVVQVEAVRVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAME-LGAEVKTFTLS 156 (157)
Q Consensus 80 Gm~gn-M~nm~emmKQAQk~MQ~km~k~QeEL~~~eveGssGGGlVkVt~nG~~eI~~I~IDpelLd-eD~EmLEDLI~ 156 (157)
|+|+| |+||++||||||+ ||++|+++|+||++++|+|+||||+|+||+||+++|++|+|||++++ +|+|+|||||.
T Consensus 34 ~~~~~m~gnm~~mmkQAQk-mQ~km~k~QeeL~~~eveg~sGgGlVkVtvnG~~ev~~I~Idp~lldpeD~E~LeDLI~ 111 (143)
T 1ybx_A 34 GGFPGFGGNINNLVKQAQK-MQRDMERVQEELKEKTVEASAGGGAVTVVATGRKDIKEITIKPEVVDPDDVEMLQDLIL 111 (143)
T ss_dssp ---------CHHHHHHHHH-HHHHHHHHHHHHHHCEEEEEETTTTEEEEEETTCCEEEEEECGGGCCTTCHHHHHHHHH
T ss_pred cCCcccchhHHHHHHHHHH-HHHHHHHHHHHHhcCEEEEEECCCEEEEEEecCceEEEEEECHHHcCCcCHHHHHHHHH
Confidence 33444 3689999999999 99999999999999999999999999999999999999999999999 89999999984
|
| >1j8b_A YBAB; hypothetical protein, structural genomics, structure function project, S2F, unknown function; HET: MSE; 1.75A {Haemophilus influenzae RD} SCOP: d.222.1.1 PDB: 1pug_A | Back alignment and structure |
|---|
| >3f42_A Protein HP0035; helicobacter pylori unknown-function, structural genomics, P protein structure initiative; HET: MSE; 1.78A {Helicobacter pylori} | Back alignment and structure |
|---|
| >1ybx_A Conserved hypothetical protein; ST genomics, PSI, protein structure initiative, southeast COLL for structural genomics, secsg; HET: MSE; 1.80A {Clostridium thermocellum} | Back alignment and structure |
|---|
Homologous Structure Domains
Structure Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against SCOP70(version1.75) database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| 157 | ||||
| d1j8ba_ | 92 | d.222.1.1 (A:) Hypothetical protein HI0442 {Haemop | 5e-07 |
| >d1j8ba_ d.222.1.1 (A:) Hypothetical protein HI0442 {Haemophilus influenzae [TaxId: 727]} Length = 92 | Back information, alignment and structure |
|---|
class: Alpha and beta proteins (a+b) fold: YbaB-like superfamily: YbaB-like family: YbaB-like domain: Hypothetical protein HI0442 species: Haemophilus influenzae [TaxId: 727]
Score = 43.5 bits (103), Expect = 5e-07
Identities = 19/66 (28%), Positives = 35/66 (53%), Gaps = 4/66 (6%)
Query: 85 MQNLYETVKKAQMVVQVEAVRVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAM 144
+ L +K+AQ + Q + ++Q+E+A E G L+K+T++G R +I + M
Sbjct: 1 LGGL---MKQAQQM-QEKMQKMQEEIAQLEVTGESGAGLVKITINGAHNCRRIDIDPSLM 56
Query: 145 ELGAEV 150
E E+
Sbjct: 57 EDDKEM 62
|
Homologous Domains Detected by HHsearch 
Original result of HHsearch against SCOP70(version1.75) database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 157 | |||
| d1j8ba_ | 92 | Hypothetical protein HI0442 {Haemophilus influenza | 99.9 |
| >d1j8ba_ d.222.1.1 (A:) Hypothetical protein HI0442 {Haemophilus influenzae [TaxId: 727]} | Back information, alignment and structure |
|---|
class: Alpha and beta proteins (a+b) fold: YbaB-like superfamily: YbaB-like family: YbaB-like domain: Hypothetical protein HI0442 species: Haemophilus influenzae [TaxId: 727]
Probab=99.90 E-value=2.3e-24 Score=154.31 Aligned_cols=67 Identities=27% Similarity=0.327 Sum_probs=64.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhccEEEEEecCCeEEEEEeecceeEEEEeCcccccCChhHHhhhhc
Q 031557 89 YETVKKAQMVVQVEAVRVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMELGAEVKTFTLS 156 (157)
Q Consensus 89 ~emmKQAQk~MQ~km~k~QeEL~~~eveGssGGGlVkVt~nG~~eI~~I~IDpelLdeD~EmLEDLI~ 156 (157)
.+||||||+ ||++|+++|++|++++|+|++|||+|+||+||+++|++|+|||+++++|+|+|||||.
T Consensus 2 ~~mmkqaq~-mq~~m~~~q~eL~~~~v~~~sg~g~V~V~~~G~~~v~~i~Id~~~l~~d~e~LedlI~ 68 (92)
T d1j8ba_ 2 GGLMKQAQQ-MQEKMQKMQEEIAQLEVTGESGAGLVKITINGAHNCRRIDIDPSLMEDDKEMLEDLIA 68 (92)
T ss_dssp CCHHHHHHH-HHHHHHHHHHHHTTSEEEEEEGGGTEEEEEETTCCEEEEEECGGGGGSCHHHHHHHHH
T ss_pred hHHHHHHHH-HHHHHHHHHHHHhCeEEEEEECCCEEEEEEECCeEEEEEEECHHHhcCcHHHHHHHHH
Confidence 468999999 9999999999999999999999999999999999999999999999889999999984
|