Citrus Sinensis ID: 047235
Local Sequence Feature Prediction
| Prediction and (Method) | Result |
|---|
Close Homologs for Annotation Transfer
Close Homologs in the Non-Redundant Database Detected by BLAST 
Original result of BLAST against Nonredundant Database
GI ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 276 | ||||||
| 449436415 | 267 | PREDICTED: uncharacterized protein LOC10 | 0.902 | 0.932 | 0.652 | 2e-98 | |
| 449507826 | 267 | PREDICTED: uncharacterized LOC101220175 | 0.902 | 0.932 | 0.648 | 8e-98 | |
| 225437593 | 261 | PREDICTED: uncharacterized protein LOC10 | 0.913 | 0.965 | 0.627 | 2e-95 | |
| 357442215 | 260 | hypothetical protein MTR_1g086840 [Medic | 0.782 | 0.830 | 0.718 | 3e-91 | |
| 363807932 | 264 | uncharacterized protein LOC100788939 [Gl | 0.775 | 0.810 | 0.728 | 8e-91 | |
| 255548419 | 282 | conserved hypothetical protein [Ricinus | 0.931 | 0.911 | 0.660 | 5e-90 | |
| 224068582 | 200 | predicted protein [Populus trichocarpa] | 0.724 | 1.0 | 0.727 | 2e-84 | |
| 351720764 | 265 | uncharacterized protein LOC100499918 [Gl | 0.775 | 0.807 | 0.691 | 3e-83 | |
| 242089393 | 277 | hypothetical protein SORBIDRAFT_09g00257 | 0.775 | 0.772 | 0.649 | 3e-79 | |
| 297814129 | 263 | hypothetical protein ARALYDRAFT_912030 [ | 0.746 | 0.783 | 0.693 | 3e-77 |
| >gi|449436415|ref|XP_004135988.1| PREDICTED: uncharacterized protein LOC101220175 [Cucumis sativus] | Back alignment and taxonomy information |
|---|
Score = 364 bits (935), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 171/262 (65%), Positives = 219/262 (83%), Gaps = 13/262 (4%)
Query: 26 LHPQLQSP---RFSVLRSSIQPQP--------QAPPIKRESDTSRTEYKPGVLDDLFLSS 74
LHP SP RF+ +S ++ +P Q+ IK + +S+ EYKPG+LDD FL+
Sbjct: 8 LHPPTISPAPLRFTSSKSHLRNRPFILSCSALQSGSIK-DGASSKAEYKPGILDDFFLNV 66
Query: 75 FRNKLVQEVGLDSEKPGYDGLIELVNHLMMKGKSSSDARDAAQIRILVSLFPPLVLKLYK 134
FR+K+VQEVG DSEKPGYDGLIE+ + L M GK++S+ +A+ +RIL++LFPPL+LKLY+
Sbjct: 67 FRSKMVQEVGWDSEKPGYDGLIEVASRLTMTGKTNSETIEAS-VRILIALFPPLLLKLYR 125
Query: 135 ILISPLAGGKIAAMMVARVTALTCQWLMGHCTVNSVDLPDGTSCQSGVFVERCKYLEESK 194
IL+SP+AGGK+AA+MVARVTALTCQWLMG CTVNS++LPDG+SCQSGVFVE+CKYLEESK
Sbjct: 126 ILVSPIAGGKVAAIMVARVTALTCQWLMGTCTVNSIELPDGSSCQSGVFVEKCKYLEESK 185
Query: 195 CVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKFGILPPLPKDDTTLKEPCLDI 254
C+G+CINTCKLPTQ+FFKD MG+PLLMEPNF+DYSCQFKFG+LPPLP++D+ LKEPCL+I
Sbjct: 186 CIGICINTCKLPTQSFFKDQMGIPLLMEPNFTDYSCQFKFGVLPPLPEEDSILKEPCLEI 245
Query: 255 CPTSSRRKEVAMNSNVEQCPKA 276
CP ++RR+EV+ + QCPKA
Sbjct: 246 CPNATRRREVSGKISAAQCPKA 267
|
Source: Cucumis sativus Species: Cucumis sativus Genus: Cucumis Family: Cucurbitaceae Order: Cucurbitales Class: Phylum: Streptophyta Superkingdom: Eukaryota |
| >gi|449507826|ref|XP_004163139.1| PREDICTED: uncharacterized LOC101220175 [Cucumis sativus] | Back alignment and taxonomy information |
|---|
| >gi|225437593|ref|XP_002271003.1| PREDICTED: uncharacterized protein LOC100253777 [Vitis vinifera] gi|297743993|emb|CBI36963.3| unnamed protein product [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|357442215|ref|XP_003591385.1| hypothetical protein MTR_1g086840 [Medicago truncatula] gi|355480433|gb|AES61636.1| hypothetical protein MTR_1g086840 [Medicago truncatula] | Back alignment and taxonomy information |
|---|
| >gi|363807932|ref|NP_001242708.1| uncharacterized protein LOC100788939 [Glycine max] gi|255647168|gb|ACU24052.1| unknown [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|255548419|ref|XP_002515266.1| conserved hypothetical protein [Ricinus communis] gi|223545746|gb|EEF47250.1| conserved hypothetical protein [Ricinus communis] | Back alignment and taxonomy information |
|---|
| >gi|224068582|ref|XP_002302776.1| predicted protein [Populus trichocarpa] gi|222844502|gb|EEE82049.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
| >gi|351720764|ref|NP_001237955.1| uncharacterized protein LOC100499918 [Glycine max] gi|255627671|gb|ACU14180.1| unknown [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|242089393|ref|XP_002440529.1| hypothetical protein SORBIDRAFT_09g002570 [Sorghum bicolor] gi|241945814|gb|EES18959.1| hypothetical protein SORBIDRAFT_09g002570 [Sorghum bicolor] | Back alignment and taxonomy information |
|---|
| >gi|297814129|ref|XP_002874948.1| hypothetical protein ARALYDRAFT_912030 [Arabidopsis lyrata subsp. lyrata] gi|297320785|gb|EFH51207.1| hypothetical protein ARALYDRAFT_912030 [Arabidopsis lyrata subsp. lyrata] | Back alignment and taxonomy information |
|---|
Prediction of Gene Ontology (GO) Terms
Close Homologs with Gene Ontology terms Detected by BLAST 
Original result of BLAST against Gene Ontology (AMIGO)
ID ![]() |
Alignment graph ![]() |
Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 276 | ||||||
| TAIR|locus:504955232 | 258 | AT4G01995 "AT4G01995" [Arabido | 0.789 | 0.844 | 0.659 | 3.8e-78 | |
| TAIR|locus:2010856 | 250 | AT1G64680 "AT1G64680" [Arabido | 0.706 | 0.78 | 0.347 | 6.2e-30 | |
| TAIR|locus:505006097 | 264 | D27 "AT1G03055" [Arabidopsis t | 0.721 | 0.753 | 0.303 | 6.6e-26 |
| TAIR|locus:504955232 AT4G01995 "AT4G01995" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Score = 786 (281.7 bits), Expect = 3.8e-78, P = 3.8e-78
Identities = 149/226 (65%), Positives = 179/226 (79%)
Query: 51 IKRESDTSRTEYKPGVLDDLFLSSFRNKLVQEVGLDSEKPGYDGLIELVNHLMMKGKSSS 110
+K + + EYKPG LDD F+ SFRNKLV+EVG DSEKPGY GLIELV L++KG++ S
Sbjct: 41 VKSDEGAPKLEYKPGPLDDFFMQSFRNKLVEEVGSDSEKPGYVGLIELVKLLLLKGRTRS 100
Query: 111 DARDAAQIRILVSLFPPLVLKLYKILISPLAGGKIAAMMVARVTALTCQWLMGHCTVNSV 170
+ DAA +RIL SLFPPL+L+LYK+LI+P+A GK+AA+MVARVT LTCQWLMG VN +
Sbjct: 101 ETSDAA-VRILKSLFPPLILELYKLLIAPIAQGKLAALMVARVTVLTCQWLMGPSKVNII 159
Query: 171 DLPDGTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSC 230
DLP+G S SGVFVE+C+YLEESKCVGVCINTCKLPTQTFFKDYMGVPL+MEPNF DYSC
Sbjct: 160 DLPNGESWDSGVFVEKCQYLEESKCVGVCINTCKLPTQTFFKDYMGVPLVMEPNFKDYSC 219
Query: 231 QFKFGILPPLPKDDTTLKEPCLDICPTSSRRKEVAMNSNVEQCPKA 276
QFKFG+ PP +DD + EPC + C + RRK + S +CP A
Sbjct: 220 QFKFGVAPP--EDDGNVNEPCFETCSIAGRRK---LKSG--ECPLA 258
|
|
| TAIR|locus:2010856 AT1G64680 "AT1G64680" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:505006097 D27 "AT1G03055" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Prediction of Enzyme Commission (EC) Number
EC Number Prediction by Ezypred Server 
Original result from Ezypred Server
Fail to connect to Ezypred Server
Prediction of Functionally Associated Proteins
Functionally Associated Proteins Detected by STRING 
Original result from the STRING server
| gw1.II.884.1 | hypothetical protein (200 aa) | |||||||
(Populus trichocarpa) | ||||||||
| Sorry, there are no predicted associations at the current settings. |
Conserved Domains and Related Protein Families
Conserved Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against CDD database part I
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 276 | |||
| pfam13225 | 85 | pfam13225, DUF4033, Domain of unknown function (DU | 2e-47 |
| >gnl|CDD|221990 pfam13225, DUF4033, Domain of unknown function (DUF4033) | Back alignment and domain information |
|---|
Score = 152 bits (387), Expect = 2e-47
Identities = 50/85 (58%), Positives = 63/85 (74%)
Query: 160 WLMGHCTVNSVDLPDGTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPL 219
WL+G C VN V+ G +SGV +++C+YLEES C G+C+NTCKLPTQTFFK+ G+PL
Sbjct: 1 WLVGPCEVNEVENGLGRGEKSGVLIKKCRYLEESGCAGMCVNTCKLPTQTFFKEEFGLPL 60
Query: 220 LMEPNFSDYSCQFKFGILPPLPKDD 244
MEPNF DYSCQ FG PP ++D
Sbjct: 61 TMEPNFEDYSCQMIFGQTPPPIEED 85
|
This presumed domain is functionally uncharacterized. This domain family is found in bacteria and eukaryotes, and is approximately 80 amino acids in length. Length = 85 |
Conserved Domains Detected by HHsearch 
Original result of HHsearch against CDD database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 276 | |||
| PF13225 | 86 | DUF4033: Domain of unknown function (DUF4033) | 100.0 |
| >PF13225 DUF4033: Domain of unknown function (DUF4033) | Back alignment and domain information |
|---|
Probab=100.00 E-value=6.4e-55 Score=343.15 Aligned_cols=85 Identities=56% Similarity=1.171 Sum_probs=82.7
Q ss_pred ccccceeeeeee-CCCCceeccceEeeccccccccCcccccccccCCchhhhhhhhcCCceeeccCcccceeeeeeCCCC
Q 047235 160 WLMGHCTVNSVD-LPDGTSCQSGVFVERCKYLEESKCVGVCINTCKLPTQTFFKDYMGVPLLMEPNFSDYSCQFKFGILP 238 (276)
Q Consensus 160 WLVGP~eV~~ve-v~dG~~~~sgV~IeKCRyLEeSgC~GmCvN~CKiPTQ~FF~e~~GlPLtM~PNFED~SCqm~FG~~P 238 (276)
||||||+|+++| ..+|++++|||+||||||||||||+|||+|+||+|||+||+|+|||||||+||||||||||+||++|
T Consensus 1 WLvGp~~v~~~e~~~~~~~~~sgV~i~kCRyLEes~C~g~C~N~CK~PtQ~Ff~~~~Glpl~M~PNfed~SC~~~FG~~P 80 (86)
T PF13225_consen 1 WLVGPCEVNEVEENGNGRGQKSGVHIEKCRYLEESGCAGMCVNMCKIPTQTFFKEEFGLPLTMEPNFEDFSCQMIFGQTP 80 (86)
T ss_pred CcccccEeeEeeccCCCccccceEEEEEeEEeecCCceeeeehhcccchHHHHHhccCCceEecCCCcCcEEEEEcCCCC
Confidence 999999999999 3488889999999999999999999999999999999999999999999999999999999999999
Q ss_pred CCCCCC
Q 047235 239 PLPKDD 244 (276)
Q Consensus 239 Pp~eeD 244 (276)
||+|||
T Consensus 81 pp~eeD 86 (86)
T PF13225_consen 81 PPIEED 86 (86)
T ss_pred CCCCCC
Confidence 999998
|
|
Homologous Structure Templates
Structure Templates Detected by BLAST 
Original result of BLAST against Protein Data Bank
No homologous structure with e-value below 0.005
Structure Templates Detected by RPS-BLAST 
Original result of RPS-BLAST against PDB70 database
No hit with e-value below 0.005
Structure Templates Detected by HHsearch 
Original result of HHsearch against PDB70 database
No hit with probability above 80.00
Homologous Structure Domains
Structure Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against SCOP70(version1.75) database
No hit with e-value below 0.005
Homologous Domains Detected by HHsearch 
Original result of HHsearch against SCOP70(version1.75) database
No hit with probability above 80.00