Citrus Sinensis ID: 046231
Local Sequence Feature Prediction
| Prediction and (Method) | Result |
|---|
Close Homologs for Annotation Transfer
Close Homologs in the Non-Redundant Database Detected by BLAST 
Original result of BLAST against Nonredundant Database
GI ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 83 | ||||||
| 224108792 | 182 | predicted protein [Populus trichocarpa] | 1.0 | 0.456 | 0.654 | 4e-23 | |
| 255556478 | 181 | conserved hypothetical protein [Ricinus | 1.0 | 0.458 | 0.674 | 1e-22 | |
| 15240514 | 171 | uncharacterized protein [Arabidopsis tha | 0.963 | 0.467 | 0.641 | 3e-22 | |
| 225450543 | 166 | PREDICTED: uncharacterized protein LOC10 | 0.975 | 0.487 | 0.619 | 6e-22 | |
| 330318610 | 167 | hypothetical protein [Camellia sinensis] | 0.987 | 0.491 | 0.623 | 7e-22 | |
| 330318682 | 144 | hypothetical protein [Camellia sinensis] | 0.987 | 0.569 | 0.623 | 8e-22 | |
| 356519899 | 164 | PREDICTED: uncharacterized protein LOC10 | 1.0 | 0.506 | 0.602 | 1e-21 | |
| 449443512 | 196 | PREDICTED: uncharacterized protein LOC10 | 0.975 | 0.413 | 0.607 | 2e-20 | |
| 449449036 | 181 | PREDICTED: uncharacterized protein LOC10 | 0.987 | 0.453 | 0.6 | 1e-19 | |
| 449449038 | 170 | PREDICTED: uncharacterized protein LOC10 | 0.987 | 0.482 | 0.6 | 1e-19 |
| >gi|224108792|ref|XP_002314969.1| predicted protein [Populus trichocarpa] gi|222864009|gb|EEF01140.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
Score = 112 bits (279), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 55/84 (65%), Positives = 65/84 (77%), Gaps = 1/84 (1%)
Query: 1 ECGHVKELGFVWLKHKQKKK-HKFENVVVWFDTEVTAYFERNKIKNLTGVKAKEFLIWIS 59
E ++K+ GFV LKH K+K HKF+ V V +D EVTAYFE N+I+NLTGVKAKEFLIWI+
Sbjct: 67 EYRYIKDTGFVSLKHNSKRKDHKFDKVAVCYDNEVTAYFEPNRIRNLTGVKAKEFLIWIT 126
Query: 60 LCEIYVNGSSPNGSITFKTPAGLS 83
L EIYV+G P ITFKTPAG S
Sbjct: 127 LSEIYVSGDIPVALITFKTPAGFS 150
|
Source: Populus trichocarpa Species: Populus trichocarpa Genus: Populus Family: Salicaceae Order: Malpighiales Class: Phylum: Streptophyta Superkingdom: Eukaryota |
| >gi|255556478|ref|XP_002519273.1| conserved hypothetical protein [Ricinus communis] gi|223541588|gb|EEF43137.1| conserved hypothetical protein [Ricinus communis] | Back alignment and taxonomy information |
|---|
| >gi|15240514|ref|NP_199771.1| uncharacterized protein [Arabidopsis thaliana] gi|10177627|dbj|BAB10774.1| unnamed protein product [Arabidopsis thaliana] gi|52354533|gb|AAU44587.1| hypothetical protein AT5G49600 [Arabidopsis thaliana] gi|55740681|gb|AAV63933.1| hypothetical protein At5g49600 [Arabidopsis thaliana] gi|332008451|gb|AED95834.1| uncharacterized protein [Arabidopsis thaliana] | Back alignment and taxonomy information |
|---|
| >gi|225450543|ref|XP_002281636.1| PREDICTED: uncharacterized protein LOC100245914 [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|330318610|gb|AEC10970.1| hypothetical protein [Camellia sinensis] | Back alignment and taxonomy information |
|---|
| >gi|330318682|gb|AEC11001.1| hypothetical protein [Camellia sinensis] | Back alignment and taxonomy information |
|---|
| >gi|356519899|ref|XP_003528606.1| PREDICTED: uncharacterized protein LOC100784975 [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|449443512|ref|XP_004139521.1| PREDICTED: uncharacterized protein LOC101214389 [Cucumis sativus] gi|449530432|ref|XP_004172199.1| PREDICTED: uncharacterized LOC101214389 [Cucumis sativus] | Back alignment and taxonomy information |
|---|
| >gi|449449036|ref|XP_004142271.1| PREDICTED: uncharacterized protein LOC101204611 isoform 1 [Cucumis sativus] gi|449525441|ref|XP_004169726.1| PREDICTED: uncharacterized protein LOC101227935 isoform 1 [Cucumis sativus] | Back alignment and taxonomy information |
|---|
| >gi|449449038|ref|XP_004142272.1| PREDICTED: uncharacterized protein LOC101204611 isoform 2 [Cucumis sativus] gi|449525443|ref|XP_004169727.1| PREDICTED: uncharacterized protein LOC101227935 isoform 2 [Cucumis sativus] | Back alignment and taxonomy information |
|---|
Prediction of Gene Ontology (GO) Terms
Close Homologs with Gene Ontology terms Detected by BLAST 
Original result of BLAST against Gene Ontology (AMIGO)
ID ![]() |
Alignment graph ![]() |
Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 83 | ||||||
| TAIR|locus:2157824 | 171 | AT5G49600 "AT5G49600" [Arabido | 0.963 | 0.467 | 0.576 | 4e-19 | |
| TAIR|locus:2027579 | 166 | SVB "SMALLER WITH VARIABLE BRA | 1.0 | 0.5 | 0.458 | 7.9e-14 | |
| TAIR|locus:2203043 | 179 | AT1G09310 [Arabidopsis thalian | 0.975 | 0.452 | 0.482 | 1.3e-13 | |
| TAIR|locus:2170463 | 143 | AT5G46230 [Arabidopsis thalian | 0.987 | 0.573 | 0.423 | 1.3e-13 | |
| TAIR|locus:2134966 | 157 | AT4G24130 [Arabidopsis thalian | 0.975 | 0.515 | 0.357 | 3.2e-10 | |
| TAIR|locus:2198247 | 157 | AT1G30020 [Arabidopsis thalian | 0.783 | 0.414 | 0.373 | 3.6e-09 | |
| TAIR|locus:2133264 | 167 | AT4G02370 "AT4G02370" [Arabido | 0.542 | 0.269 | 0.333 | 0.00012 | |
| TAIR|locus:505006096 | 166 | AT1G02816 "AT1G02816" [Arabido | 0.542 | 0.271 | 0.333 | 0.00048 |
| TAIR|locus:2157824 AT5G49600 "AT5G49600" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Score = 229 (85.7 bits), Expect = 4.0e-19, P = 4.0e-19
Identities = 53/92 (57%), Positives = 60/92 (65%)
Query: 1 ECGHVKELGFVWLXXXXXXXXX----------FENVVVWFDTEVTAYFERNKIKNLTGVK 50
ECG+VK+ GFVWL F+NVVV F+ EVTAYFE N+IK LTGVK
Sbjct: 44 ECGYVKDTGFVWLKHKEKKKEDQKRRYQDLLRFDNVVVCFEDEVTAYFEPNRIKKLTGVK 103
Query: 51 AKEFLIWISLCEIYVNGSSPNGSITFKTPAGL 82
AKEF++WISL EI VN SS G ITFKT GL
Sbjct: 104 AKEFMVWISLGEIQVNRSS--GLITFKTEVGL 133
|
|
| TAIR|locus:2027579 SVB "SMALLER WITH VARIABLE BRANCHES" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2203043 AT1G09310 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2170463 AT5G46230 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2134966 AT4G24130 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2198247 AT1G30020 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2133264 AT4G02370 "AT4G02370" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:505006096 AT1G02816 "AT1G02816" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Prediction of Enzyme Commission (EC) Number
EC Number Prediction by Ezypred Server 
Original result from Ezypred Server
Fail to connect to Ezypred Server
Prediction of Functionally Associated Proteins
Functionally Associated Proteins Detected by STRING 
Original result from the STRING server
Fail to connect to STRING server
Conserved Domains and Related Protein Families
Conserved Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against CDD database part I
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 83 | |||
| pfam04398 | 108 | pfam04398, DUF538, Protein of unknown function, DU | 7e-22 |
| >gnl|CDD|218064 pfam04398, DUF538, Protein of unknown function, DUF538 | Back alignment and domain information |
|---|
Score = 81.1 bits (201), Expect = 7e-22
Identities = 21/78 (26%), Positives = 33/78 (42%), Gaps = 3/78 (3%)
Query: 1 ECGHVKELGFVWLKHKQKKKHKFENVVVWFDTEVTAYFERNKIKNLTGVKAKEFLIWISL 60
E + G W+ + FE V +D VT Y + ++ L GVK K +W+ +
Sbjct: 22 EYTLDETTGKFWVYLNGTCEFTFEGYSVRYDPTVTGYISKGRLSGLEGVKVKVLFLWVPI 81
Query: 61 CEIYVNGSSPNGSITFKT 78
EI V+ +TF
Sbjct: 82 VEISVDDG---DKLTFSV 96
|
This family consists of several plant proteins of unknown function. Length = 108 |
Conserved Domains Detected by HHsearch 
Original result of HHsearch against CDD database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 83 | |||
| PF04398 | 110 | DUF538: Protein of unknown function, DUF538; Inter | 99.97 |
| >PF04398 DUF538: Protein of unknown function, DUF538; InterPro: IPR007493 This family consists of several plant proteins of unknown function | Back alignment and domain information |
|---|
Probab=99.97 E-value=1.1e-31 Score=180.44 Aligned_cols=78 Identities=45% Similarity=0.836 Sum_probs=60.4
Q ss_pred CcceeccccEEEEEecCeEEEEEceEEEEEccEEEEEEecCceeeceeeeEEEEEEeeEeeEEEEcCCCCCCeEEEEcCC
Q 046231 1 ECGHVKELGFVWLKHKQKKKHKFENVVVWFDTEVTAYFERNKIKNLTGVKAKEFLIWISLCEIYVNGSSPNGSITFKTPA 80 (83)
Q Consensus 1 e~g~~~~tG~~~v~l~~~c~~~f~~~~v~Y~~~Itg~l~~gki~~l~GVk~K~lf~W~~v~eI~vdg~~~sgkI~F~~~~ 80 (83)
||++|++||+|||+|+++|+|+|++++|+|+++|||+|++|+|++|+||++|+||+|++|+||.++ +++|+|++ +
T Consensus 22 ~y~l~~~tG~f~v~l~~~C~~~~~~~~v~Y~~~ItG~i~~g~i~~L~GVk~k~l~~W~~v~~i~~~----~~~i~F~~-g 96 (110)
T PF04398_consen 22 EYGLNRDTGFFWVKLKSPCEFRFEGYLVSYDSEITGYIEKGKIKNLTGVKVKELFLWVPVTEISVD----GDKIYFKV-G 96 (110)
T ss_dssp EEEE-TTT-SEEEE-SS-EEEESTTSEEEE-SEEEEEE-SS-EEEEES-EEE-SSSEES---BEE-----SSSEE-TT-S
T ss_pred EEEEecCCcEEEEEecCCEEEEEEEEEEEEcCeEEEEECCCcCccccCEEEEEEEEEeeEEEEEEc----CCEEEEEE-e
Confidence 689999999999999999999999978999999999999999999999999999999999999996 88999999 6
Q ss_pred CCC
Q 046231 81 GLS 83 (83)
Q Consensus 81 g~~ 83 (83)
+++
T Consensus 97 ~~s 99 (110)
T PF04398_consen 97 GIS 99 (110)
T ss_dssp SSS
T ss_pred eEe
Confidence 653
|
; PDB: 1YDU_A. |
Homologous Structure Templates
Structure Templates Detected by BLAST 
Original result of BLAST against Protein Data Bank
No homologous structure with e-value below 0.005
Structure Templates Detected by RPS-BLAST 
Original result of RPS-BLAST against PDB70 database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 83 | |||
| 1ydu_A | 170 | AT5G01610; DUF538, structural genomics, protein st | 2e-16 | |
| 1qzv_F | 154 | Plant photosystem I: subunit PSAF; photosynthesis, | 8e-06 |
| >1ydu_A AT5G01610; DUF538, structural genomics, protein structure initiative, center for eukaryotic structural genomics, CESG unknown function; NMR {Arabidopsis thaliana} SCOP: b.162.1.1 Length = 170 | Back alignment and structure |
|---|
Score = 68.3 bits (167), Expect = 2e-16
Identities = 15/79 (18%), Positives = 34/79 (43%), Gaps = 6/79 (7%)
Query: 1 ECGHVKELGFVWLKHKQKKKHKFEN-VVVWFDTEVTAYFERNKIKNLTGVKAKEFLIWIS 59
+E + + + +++ V+ F T VT + E+ K+ ++ G+K K +IW+
Sbjct: 74 NYEFDEETKKLTVLIPSICEVGYKDSSVLKFTTTVTGHLEKGKLTDVEGIKTK-VMIWVK 132
Query: 60 LCEIYVNGSSPNGSITFKT 78
+ I + + F
Sbjct: 133 VTSISTDA----SKVYFTA 147
|
| >1qzv_F Plant photosystem I: subunit PSAF; photosynthesis,plant photosynthetic reaction center, peripheral antenna; HET: CL1 PQN; 4.44A {Pisum sativum} SCOP: i.5.1.1 Length = 154 | Back alignment and structure |
|---|
Structure Templates Detected by HHsearch 
Original result of HHsearch against PDB70 database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 83 | |||
| 1ydu_A | 170 | AT5G01610; DUF538, structural genomics, protein st | 99.95 |
| >1ydu_A AT5G01610; DUF538, structural genomics, protein structure initiative, center for eukaryotic structural genomics, CESG unknown function; NMR {Arabidopsis thaliana} SCOP: b.162.1.1 | Back alignment and structure |
|---|
Probab=99.95 E-value=1.7e-29 Score=180.15 Aligned_cols=75 Identities=21% Similarity=0.392 Sum_probs=71.0
Q ss_pred CcceeccccEEEEEecCeEEEEEceE-EEEEccEEEEEEecCceeeceeeeEEEEEEeeEeeEEEEcCCCCCCeEEEEcC
Q 046231 1 ECGHVKELGFVWLKHKQKKKHKFENV-VVWFDTEVTAYFERNKIKNLTGVKAKEFLIWISLCEIYVNGSSPNGSITFKTP 79 (83)
Q Consensus 1 e~g~~~~tG~~~v~l~~~c~~~f~~~-~v~Y~~~Itg~l~~gki~~l~GVk~K~lf~W~~v~eI~vdg~~~sgkI~F~~~ 79 (83)
|||||++||+|||+|+++|+|+|++. +|+|+++|||+|++|+|++|+||++|+| +|++|++|.++ +++|+|++
T Consensus 74 ~Y~l~~~tG~f~V~l~~~C~~~f~~~~~v~Y~~~VtG~l~~GkI~~L~GVk~K~L-lWv~V~eI~v~----~~kI~F~v- 147 (170)
T 1ydu_A 74 NYEFDEETKKLTVLIPSICEVGYKDSSVLKFTTTVTGHLEKGKLTDVEGIKTKVM-IWVKVTSISTD----ASKVYFTA- 147 (170)
T ss_dssp EEEECTTTCSEEEECSSCEEEESTTSSEEEECSEEEEEECSSCEEEEESCEEESS-SEESCCCBEEC----SSSEECTT-
T ss_pred EEEEECCCcEEEEEeCCCEEEEecCccEEEEcCEEEEEEcCCcCccccCEEEEEE-EEeeEEEEEEe----CCEEEEEE-
Confidence 68999999999999999999999865 8999999999999999999999999999 99999999996 78999997
Q ss_pred CCC
Q 046231 80 AGL 82 (83)
Q Consensus 80 ~g~ 82 (83)
|+
T Consensus 148 -Gi 149 (170)
T 1ydu_A 148 -GM 149 (170)
T ss_dssp -SS
T ss_pred -cC
Confidence 55
|
Homologous Structure Domains
Structure Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against SCOP70(version1.75) database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| 83 | ||||
| d1ydua1 | 169 | b.162.1.1 (A:2-170) Hypothetical protein At5g01610 | 1e-16 |
| >d1ydua1 b.162.1.1 (A:2-170) Hypothetical protein At5g01610 {Thale cress (Arabidopsis thaliana) [TaxId: 3702]} Length = 169 | Back information, alignment and structure |
|---|
class: All beta proteins fold: At5g01610-like superfamily: At5g01610-like family: At5g01610-like domain: Hypothetical protein At5g01610 species: Thale cress (Arabidopsis thaliana) [TaxId: 3702]
Score = 67.4 bits (165), Expect = 1e-16
Identities = 15/79 (18%), Positives = 34/79 (43%), Gaps = 6/79 (7%)
Query: 1 ECGHVKELGFVWLKHKQKKKHKFEN-VVVWFDTEVTAYFERNKIKNLTGVKAKEFLIWIS 59
+E + + + +++ V+ F T VT + E+ K+ ++ G+K K +IW+
Sbjct: 73 NYEFDEETKKLTVLIPSICEVGYKDSSVLKFTTTVTGHLEKGKLTDVEGIKTK-VMIWVK 131
Query: 60 LCEIYVNGSSPNGSITFKT 78
+ I + + F
Sbjct: 132 VTSISTDA----SKVYFTA 146
|
Homologous Domains Detected by HHsearch 
Original result of HHsearch against SCOP70(version1.75) database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 83 | |||
| d1ydua1 | 169 | Hypothetical protein At5g01610 {Thale cress (Arabi | 99.95 |
| >d1ydua1 b.162.1.1 (A:2-170) Hypothetical protein At5g01610 {Thale cress (Arabidopsis thaliana) [TaxId: 3702]} | Back information, alignment and structure |
|---|
class: All beta proteins fold: At5g01610-like superfamily: At5g01610-like family: At5g01610-like domain: Hypothetical protein At5g01610 species: Thale cress (Arabidopsis thaliana) [TaxId: 3702]
Probab=99.95 E-value=2.2e-29 Score=177.80 Aligned_cols=75 Identities=21% Similarity=0.406 Sum_probs=70.3
Q ss_pred CcceeccccEEEEEecCeEEEEEceE-EEEEccEEEEEEecCceeeceeeeEEEEEEeeEeeEEEEcCCCCCCeEEEEcC
Q 046231 1 ECGHVKELGFVWLKHKQKKKHKFENV-VVWFDTEVTAYFERNKIKNLTGVKAKEFLIWISLCEIYVNGSSPNGSITFKTP 79 (83)
Q Consensus 1 e~g~~~~tG~~~v~l~~~c~~~f~~~-~v~Y~~~Itg~l~~gki~~l~GVk~K~lf~W~~v~eI~vdg~~~sgkI~F~~~ 79 (83)
|++||++||+|||+|+++|+|+|++. +|+|+++|||+|++|+|++|+|||+| +|+|++|+||.+| +++|+|++
T Consensus 73 ~Y~l~~~TG~f~V~l~~~C~~~f~~sy~v~Y~~~VtG~ls~Gki~~L~GVkvK-lflWv~V~eI~vd----~~kI~F~v- 146 (169)
T d1ydua1 73 NYEFDEETKKLTVLIPSICEVGYKDSSVLKFTTTVTGHLEKGKLTDVEGIKTK-VMIWVKVTSISTD----ASKVYFTA- 146 (169)
T ss_dssp EEEECTTTCSEEEECSSCEEEESTTSSEEEECSEEEEEECSSCEEEEESCEEE-SSSEESCCCBEEC----SSSEECTT-
T ss_pred EEEEECCCcEEEEEeCCCEEEEecCceEEEEcCEEEEEEcCCcCccccceEEE-EEEEEEEEEEEEc----CCeEEEEe-
Confidence 57999999999999999999999875 89999999999999999999999999 7899999999998 67999995
Q ss_pred CCC
Q 046231 80 AGL 82 (83)
Q Consensus 80 ~g~ 82 (83)
||
T Consensus 147 -GI 148 (169)
T d1ydua1 147 -GM 148 (169)
T ss_dssp -SS
T ss_pred -cc
Confidence 55
|