Citrus Sinensis ID: 047759
Local Sequence Feature Prediction
| Prediction and (Method) | Result |
|---|
Close Homologs for Annotation Transfer
Close Homologs in the Non-Redundant Database Detected by BLAST 
Original result of BLAST against Nonredundant Database
GI ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 102 | ||||||
| 449436677 | 137 | PREDICTED: uncharacterized protein LOC10 | 1.0 | 0.744 | 0.764 | 6e-38 | |
| 356542189 | 136 | PREDICTED: uncharacterized protein LOC10 | 0.980 | 0.735 | 0.87 | 1e-36 | |
| 225443814 | 133 | PREDICTED: uncharacterized protein LOC10 | 0.980 | 0.751 | 0.77 | 1e-33 | |
| 147778776 | 133 | hypothetical protein VITISV_003230 [Viti | 0.980 | 0.751 | 0.76 | 7e-33 | |
| 356546962 | 136 | PREDICTED: uncharacterized protein LOC10 | 0.980 | 0.735 | 0.84 | 8e-33 | |
| 224076966 | 136 | predicted protein [Populus trichocarpa] | 0.980 | 0.735 | 0.77 | 6e-32 | |
| 226509585 | 139 | uncharacterized protein LOC100278998 [Ze | 1.0 | 0.733 | 0.627 | 1e-31 | |
| 255563132 | 137 | conserved hypothetical protein [Ricinus | 1.0 | 0.744 | 0.754 | 2e-31 | |
| 226532856 | 138 | uncharacterized protein LOC100276192 [Ze | 1.0 | 0.739 | 0.627 | 2e-31 | |
| 224079557 | 136 | predicted protein [Populus trichocarpa] | 0.980 | 0.735 | 0.72 | 2e-31 |
| >gi|449436677|ref|XP_004136119.1| PREDICTED: uncharacterized protein LOC101210896 [Cucumis sativus] gi|449489173|ref|XP_004158237.1| PREDICTED: uncharacterized LOC101210896 [Cucumis sativus] | Back alignment and taxonomy information |
|---|
Score = 161 bits (407), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 78/102 (76%), Positives = 91/102 (89%)
Query: 1 MAAEKEGGIVKKGHEEGLKLAVSLLKKFELPEGLLPLANVVEVGYVESTGYMWIVQQNKV 60
MA K GGIVKKG EEGL+LAV+LLK+FELPEGLLPLA+VVEVGYV+ TGY+WIVQ+ KV
Sbjct: 1 MADPKAGGIVKKGQEEGLELAVALLKEFELPEGLLPLADVVEVGYVKETGYVWIVQRKKV 60
Query: 61 EHEFKMISKLVSYDTEINGYVDKMKIKKLRGVKAKELMLWPP 102
EHEFKM+SKLVSYDTEI G++ +IKKL+GVKAKE +LWPP
Sbjct: 61 EHEFKMVSKLVSYDTEITGFILNKRIKKLKGVKAKEFLLWPP 102
|
Source: Cucumis sativus Species: Cucumis sativus Genus: Cucumis Family: Cucurbitaceae Order: Cucurbitales Class: Phylum: Streptophyta Superkingdom: Eukaryota |
| >gi|356542189|ref|XP_003539552.1| PREDICTED: uncharacterized protein LOC100784386 [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|225443814|ref|XP_002273711.1| PREDICTED: uncharacterized protein LOC100261780 [Vitis vinifera] gi|297740497|emb|CBI30679.3| unnamed protein product [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|147778776|emb|CAN71580.1| hypothetical protein VITISV_003230 [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|356546962|ref|XP_003541888.1| PREDICTED: uncharacterized protein LOC100791850 [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|224076966|ref|XP_002305071.1| predicted protein [Populus trichocarpa] gi|222848035|gb|EEE85582.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
| >gi|226509585|ref|NP_001145546.1| uncharacterized protein LOC100278998 [Zea mays] gi|195657859|gb|ACG48397.1| hypothetical protein [Zea mays] | Back alignment and taxonomy information |
|---|
| >gi|255563132|ref|XP_002522570.1| conserved hypothetical protein [Ricinus communis] gi|223538261|gb|EEF39870.1| conserved hypothetical protein [Ricinus communis] | Back alignment and taxonomy information |
|---|
| >gi|226532856|ref|NP_001143509.1| uncharacterized protein LOC100276192 [Zea mays] gi|195621716|gb|ACG32688.1| hypothetical protein [Zea mays] gi|413946620|gb|AFW79269.1| hypothetical protein ZEAMMB73_054395 [Zea mays] | Back alignment and taxonomy information |
|---|
| >gi|224079557|ref|XP_002305889.1| predicted protein [Populus trichocarpa] gi|222848853|gb|EEE86400.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
Prediction of Gene Ontology (GO) Terms
Close Homologs with Gene Ontology terms Detected by BLAST 
Original result of BLAST against Gene Ontology (AMIGO)
ID ![]() |
Alignment graph ![]() |
Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 102 | ||||||
| TAIR|locus:2134966 | 157 | AT4G24130 [Arabidopsis thalian | 1.0 | 0.649 | 0.421 | 3.1e-19 | |
| TAIR|locus:2170463 | 143 | AT5G46230 [Arabidopsis thalian | 0.950 | 0.678 | 0.402 | 6.5e-19 | |
| TAIR|locus:2198247 | 157 | AT1G30020 [Arabidopsis thalian | 0.833 | 0.541 | 0.458 | 1.2e-17 | |
| TAIR|locus:2027579 | 166 | SVB "SMALLER WITH VARIABLE BRA | 0.745 | 0.457 | 0.526 | 1.6e-17 | |
| TAIR|locus:2203043 | 179 | AT1G09310 [Arabidopsis thalian | 0.745 | 0.424 | 0.513 | 2e-17 | |
| TAIR|locus:2157824 | 171 | AT5G49600 "AT5G49600" [Arabido | 0.764 | 0.456 | 0.372 | 8.4e-10 | |
| TAIR|locus:505006096 | 166 | AT1G02816 "AT1G02816" [Arabido | 0.872 | 0.536 | 0.333 | 6.1e-07 | |
| TAIR|locus:2133264 | 167 | AT4G02370 "AT4G02370" [Arabido | 0.813 | 0.497 | 0.344 | 7.8e-07 | |
| TAIR|locus:2166178 | 170 | AT5G37070 "AT5G37070" [Arabido | 0.950 | 0.570 | 0.281 | 5.5e-06 | |
| TAIR|locus:2149770 | 170 | AT5G01610 "AT5G01610" [Arabido | 0.901 | 0.541 | 0.275 | 5.8e-05 |
| TAIR|locus:2134966 AT4G24130 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Score = 230 (86.0 bits), Expect = 3.1e-19, P = 3.1e-19
Identities = 43/102 (42%), Positives = 65/102 (63%)
Query: 1 MAAEKEGGIVKKGHEEGLKLAVSLLKKFELPEGLLPLANVVEVGYVESTGYMWIVQQNKV 60
+ E+EG + G EE K ++ LL++ P+G++PL N+VE G V +TGY+W+ Q
Sbjct: 9 VGGEREGAEIVYGAEECYKQSLELLEELGFPKGVMPLKNLVECGRVRATGYVWMKQDTPY 68
Query: 61 EHEFKMISKLVSYDTEINGYVDKMKIKKLRGVKAKELMLWPP 102
EH F+ + VSY E+ YVDK +KK+ GVK+K++ LW P
Sbjct: 69 EHFFEATNTRVSYGLEVTAYVDKCCMKKMTGVKSKQMFLWVP 110
|
|
| TAIR|locus:2170463 AT5G46230 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2198247 AT1G30020 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2027579 SVB "SMALLER WITH VARIABLE BRANCHES" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2203043 AT1G09310 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2157824 AT5G49600 "AT5G49600" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:505006096 AT1G02816 "AT1G02816" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2133264 AT4G02370 "AT4G02370" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2166178 AT5G37070 "AT5G37070" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2149770 AT5G01610 "AT5G01610" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Prediction of Enzyme Commission (EC) Number
EC Number Prediction by Ezypred Server 
Original result from Ezypred Server
Fail to connect to Ezypred Server
Prediction of Functionally Associated Proteins
Functionally Associated Proteins Detected by STRING 
Original result from the STRING server
| GSVIVG00035882001 | SubName- Full=Chromosome chr10 scaffold_81, whole genome shotgun sequence; (133 aa) | |||||||
(Vitis vinifera) | ||||||||
| Sorry, there are no predicted associations at the current settings. |
Conserved Domains and Related Protein Families
Conserved Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against CDD database part I
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 102 | |||
| pfam04398 | 108 | pfam04398, DUF538, Protein of unknown function, DU | 1e-24 |
| >gnl|CDD|218064 pfam04398, DUF538, Protein of unknown function, DUF538 | Back alignment and domain information |
|---|
Score = 88.8 bits (221), Expect = 1e-24
Identities = 33/82 (40%), Positives = 43/82 (52%), Gaps = 3/82 (3%)
Query: 21 AVSLLKKFELPEGLLPLANVVEVGYVESTGYMWIVQQNKVEHEFKMISKLVSYDTEINGY 80
A LL+++ LP GLLP V E E+TG W+ E F+ S V YD + GY
Sbjct: 2 AYELLEEYGLPVGLLPK-GVTEYTLDETTGKFWVYLNGTCEFTFEGYS--VRYDPTVTGY 58
Query: 81 VDKMKIKKLRGVKAKELMLWPP 102
+ K ++ L GVK K L LW P
Sbjct: 59 ISKGRLSGLEGVKVKVLFLWVP 80
|
This family consists of several plant proteins of unknown function. Length = 108 |
Conserved Domains Detected by HHsearch 
Original result of HHsearch against CDD database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 102 | |||
| PF04398 | 110 | DUF538: Protein of unknown function, DUF538; Inter | 100.0 |
| >PF04398 DUF538: Protein of unknown function, DUF538; InterPro: IPR007493 This family consists of several plant proteins of unknown function | Back alignment and domain information |
|---|
Probab=100.00 E-value=5.5e-37 Score=213.31 Aligned_cols=80 Identities=51% Similarity=0.912 Sum_probs=63.3
Q ss_pred hHHHHHHhCCCCCCCcCCCCceEEEEEccccEEEEEecCeEEEEEeecCeEEEEceEEEEEEecCceeccceeEEEEEee
Q 047759 20 LAVSLLKKFELPEGLLPLANVVEVGYVESTGYMWIVQQNKVEHEFKMISKLVSYDTEINGYVDKMKIKKLRGVKAKELML 99 (102)
Q Consensus 20 ~a~elL~~~gLP~GLLP~~~V~~y~l~~~tG~f~v~l~~~c~~~f~~~~~~v~Y~~~ItG~l~~g~i~~L~GVk~k~lf~ 99 (102)
||||+|++||||+||||+ +|++|+||++||+|||+|+++|+++|+. |+|+|+++|||+|++|+|++|+|||+|+||+
T Consensus 1 tayelL~~~glP~GLLP~-~v~~y~l~~~tG~f~v~l~~~C~~~~~~--~~v~Y~~~ItG~i~~g~i~~L~GVk~k~l~~ 77 (110)
T PF04398_consen 1 TAYELLEEYGLPRGLLPL-GVTEYGLNRDTGFFWVKLKSPCEFRFEG--YLVSYDSEITGYIEKGKIKNLTGVKVKELFL 77 (110)
T ss_dssp --HHHHHHHS-TT-TTTS-SS-EEEE-TTT-SEEEE-SS-EEEESTT--SEEEE-SEEEEEE-SS-EEEEES-EEE-SSS
T ss_pred CHHHhHHHcCCCCCcCCC-CceEEEEecCCcEEEEEecCCEEEEEEE--EEEEEcCeEEEEECCCcCccccCEEEEEEEE
Confidence 699999999999999995 9999999999999999999999999964 8999999999999999999999999999999
Q ss_pred cCC
Q 047759 100 WPP 102 (102)
Q Consensus 100 W~~ 102 (102)
|++
T Consensus 78 W~~ 80 (110)
T PF04398_consen 78 WVP 80 (110)
T ss_dssp EES
T ss_pred Eee
Confidence 986
|
; PDB: 1YDU_A. |
Homologous Structure Templates
Structure Templates Detected by BLAST 
Original result of BLAST against Protein Data Bank
No homologous structure with e-value below 0.005
Structure Templates Detected by RPS-BLAST 
Original result of RPS-BLAST against PDB70 database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 102 | |||
| 1ydu_A | 170 | AT5G01610; DUF538, structural genomics, protein st | 9e-20 |
| >1ydu_A AT5G01610; DUF538, structural genomics, protein structure initiative, center for eukaryotic structural genomics, CESG unknown function; NMR {Arabidopsis thaliana} SCOP: b.162.1.1 Length = 170 | Back alignment and structure |
|---|
Score = 77.6 bits (191), Expect = 9e-20
Identities = 21/79 (26%), Positives = 44/79 (55%), Gaps = 3/79 (3%)
Query: 24 LLKKFELPEGLLPLANVVEVGYVESTGYMWIVQQNKVEHEFKMISKLVSYDTEINGYVDK 83
LLK+++LP G+ P + + E T + ++ + E +K S ++ + T + G+++K
Sbjct: 57 LLKEYDLPIGIFP-GDATNYEFDEETKKLTVLIPSICEVGYK-DSSVLKFTTTVTGHLEK 114
Query: 84 MKIKKLRGVKAKELMLWPP 102
K+ + G+K K +M+W
Sbjct: 115 GKLTDVEGIKTK-VMIWVK 132
|
Structure Templates Detected by HHsearch 
Original result of HHsearch against PDB70 database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 102 | |||
| 1ydu_A | 170 | AT5G01610; DUF538, structural genomics, protein st | 100.0 |
| >1ydu_A AT5G01610; DUF538, structural genomics, protein structure initiative, center for eukaryotic structural genomics, CESG unknown function; NMR {Arabidopsis thaliana} SCOP: b.162.1.1 | Back alignment and structure |
|---|
Probab=100.00 E-value=2.7e-35 Score=217.23 Aligned_cols=84 Identities=25% Similarity=0.489 Sum_probs=80.5
Q ss_pred cchhhHHHHHHhCCCCCCCcCCCCceEEEEEccccEEEEEecCeEEEEEeecCeEEEEceEEEEEEecCceeccceeEEE
Q 047759 16 EGLKLAVSLLKKFELPEGLLPLANVVEVGYVESTGYMWIVQQNKVEHEFKMISKLVSYDTEINGYVDKMKIKKLRGVKAK 95 (102)
Q Consensus 16 ~~~~~a~elL~~~gLP~GLLP~~~V~~y~l~~~tG~f~v~l~~~c~~~f~~~~~~v~Y~~~ItG~l~~g~i~~L~GVk~k 95 (102)
.+++++||+|++||||+|||| ++|++|+||++||+|||+|+++|+|.|+. +++|+|+++|||+|++|+|++|+||++|
T Consensus 49 ~~~~ta~elL~e~gLP~GLLP-~~V~~Y~l~~~tG~f~V~l~~~C~~~f~~-~~~v~Y~~~VtG~l~~GkI~~L~GVk~K 126 (170)
T 1ydu_A 49 KMQKPLPELLKEYDLPIGIFP-GDATNYEFDEETKKLTVLIPSICEVGYKD-SSVLKFTTTVTGHLEKGKLTDVEGIKTK 126 (170)
T ss_dssp TTTSSCHHHHHHHSCTTCTTT-SSSCEEEECTTTCSEEEECSSCEEEESTT-SSEEEECSEEEEEECSSCEEEEESCEEE
T ss_pred cccccHHHHHHHcCCCCCcCC-CCCeEEEEECCCcEEEEEeCCCEEEEecC-ccEEEEcCEEEEEEcCCcCccccCEEEE
Confidence 367899999999999999999 79999999999999999999999999985 8999999999999999999999999999
Q ss_pred EEeecCC
Q 047759 96 ELMLWPP 102 (102)
Q Consensus 96 ~lf~W~~ 102 (102)
+| +|+|
T Consensus 127 ~L-lWv~ 132 (170)
T 1ydu_A 127 VM-IWVK 132 (170)
T ss_dssp SS-SEES
T ss_pred EE-EEee
Confidence 99 9986
|
Homologous Structure Domains
Structure Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against SCOP70(version1.75) database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| 102 | ||||
| d1ydua1 | 169 | b.162.1.1 (A:2-170) Hypothetical protein At5g01610 | 1e-20 |
| >d1ydua1 b.162.1.1 (A:2-170) Hypothetical protein At5g01610 {Thale cress (Arabidopsis thaliana) [TaxId: 3702]} Length = 169 | Back information, alignment and structure |
|---|
class: All beta proteins fold: At5g01610-like superfamily: At5g01610-like family: At5g01610-like domain: Hypothetical protein At5g01610 species: Thale cress (Arabidopsis thaliana) [TaxId: 3702]
Score = 79.0 bits (195), Expect = 1e-20
Identities = 21/79 (26%), Positives = 44/79 (55%), Gaps = 3/79 (3%)
Query: 24 LLKKFELPEGLLPLANVVEVGYVESTGYMWIVQQNKVEHEFKMISKLVSYDTEINGYVDK 83
LLK+++LP G+ P + + E T + ++ + E +K S ++ + T + G+++K
Sbjct: 56 LLKEYDLPIGIFP-GDATNYEFDEETKKLTVLIPSICEVGYK-DSSVLKFTTTVTGHLEK 113
Query: 84 MKIKKLRGVKAKELMLWPP 102
K+ + G+K K +M+W
Sbjct: 114 GKLTDVEGIKTK-VMIWVK 131
|
Homologous Domains Detected by HHsearch 
Original result of HHsearch against SCOP70(version1.75) database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 102 | |||
| d1ydua1 | 169 | Hypothetical protein At5g01610 {Thale cress (Arabi | 100.0 |
| >d1ydua1 b.162.1.1 (A:2-170) Hypothetical protein At5g01610 {Thale cress (Arabidopsis thaliana) [TaxId: 3702]} | Back information, alignment and structure |
|---|
class: All beta proteins fold: At5g01610-like superfamily: At5g01610-like family: At5g01610-like domain: Hypothetical protein At5g01610 species: Thale cress (Arabidopsis thaliana) [TaxId: 3702]
Probab=100.00 E-value=1.7e-34 Score=211.01 Aligned_cols=83 Identities=27% Similarity=0.511 Sum_probs=79.0
Q ss_pred chhhHHHHHHhCCCCCCCcCCCCceEEEEEccccEEEEEecCeEEEEEeecCeEEEEceEEEEEEecCceeccceeEEEE
Q 047759 17 GLKLAVSLLKKFELPEGLLPLANVVEVGYVESTGYMWIVQQNKVEHEFKMISKLVSYDTEINGYVDKMKIKKLRGVKAKE 96 (102)
Q Consensus 17 ~~~~a~elL~~~gLP~GLLP~~~V~~y~l~~~tG~f~v~l~~~c~~~f~~~~~~v~Y~~~ItG~l~~g~i~~L~GVk~k~ 96 (102)
.+.+++|||++||||+|||| ++|++|+||++||+|||+|+++|++.|+. +|+|+|+++|||+|++|+|++|+|||+|
T Consensus 49 ~~k~l~ELL~eygLP~GLLP-~~V~~Y~l~~~TG~f~V~l~~~C~~~f~~-sy~v~Y~~~VtG~ls~Gki~~L~GVkvK- 125 (169)
T d1ydua1 49 MQKPLPELLKEYDLPIGIFP-GDATNYEFDEETKKLTVLIPSICEVGYKD-SSVLKFTTTVTGHLEKGKLTDVEGIKTK- 125 (169)
T ss_dssp TTSSCHHHHHHHSCTTCTTT-SSSCEEEECTTTCSEEEECSSCEEEESTT-SSEEEECSEEEEEECSSCEEEEESCEEE-
T ss_pred CCCCHHHHHHhcCCCCccCC-CCceEEEEECCCcEEEEEeCCCEEEEecC-ceEEEEcCEEEEEEcCCcCccccceEEE-
Confidence 56789999999999999999 79999999999999999999999999974 8999999999999999999999999999
Q ss_pred EeecCC
Q 047759 97 LMLWPP 102 (102)
Q Consensus 97 lf~W~~ 102 (102)
+|+|++
T Consensus 126 lflWv~ 131 (169)
T d1ydua1 126 VMIWVK 131 (169)
T ss_dssp SSSEES
T ss_pred EEEEEE
Confidence 789986
|