Citrus Sinensis ID: 026716
Local Sequence Feature Prediction
| Prediction and (Method) | Result |
|---|
Close Homologs for Annotation Transfer
Close Homologs in the Non-Redundant Database Detected by BLAST 
Original result of BLAST against Nonredundant Database
GI ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 234 | ||||||
| 255537455 | 347 | DNA binding protein, putative [Ricinus c | 0.867 | 0.585 | 0.588 | 3e-59 | |
| 225426407 | 346 | PREDICTED: uncharacterized protein LOC10 | 0.858 | 0.580 | 0.606 | 8e-59 | |
| 224053919 | 343 | predicted protein [Populus trichocarpa] | 0.846 | 0.577 | 0.594 | 3e-53 | |
| 224074919 | 346 | predicted protein [Populus trichocarpa] | 0.867 | 0.586 | 0.575 | 6e-53 | |
| 356540448 | 352 | PREDICTED: uncharacterized protein LOC10 | 0.841 | 0.559 | 0.549 | 2e-50 | |
| 297742528 | 309 | unnamed protein product [Vitis vinifera] | 0.735 | 0.556 | 0.556 | 1e-48 | |
| 356528260 | 352 | PREDICTED: uncharacterized protein LOC10 | 0.833 | 0.553 | 0.5 | 6e-47 | |
| 356513399 | 352 | PREDICTED: uncharacterized protein LOC10 | 0.833 | 0.553 | 0.5 | 1e-46 | |
| 356497236 | 357 | PREDICTED: uncharacterized protein LOC10 | 0.841 | 0.551 | 0.510 | 1e-44 | |
| 449452330 | 343 | PREDICTED: uncharacterized protein LOC10 | 0.867 | 0.591 | 0.539 | 7e-44 |
| >gi|255537455|ref|XP_002509794.1| DNA binding protein, putative [Ricinus communis] gi|223549693|gb|EEF51181.1| DNA binding protein, putative [Ricinus communis] | Back alignment and taxonomy information |
|---|
Score = 233 bits (595), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 133/226 (58%), Positives = 157/226 (69%), Gaps = 23/226 (10%)
Query: 1 MDGREAMVASGGVSGPYYIQQHRGAFSGSHSATQSGLMQTPPGFRSSSNLNIPTQPNV-- 58
MDGREAM + G S PYYI + G + +PPGFR +N N+ N
Sbjct: 1 MDGREAMALASG-STPYYIHRGGGVGGSGSGSQAG-GFHSPPGFRPLANPNLLAHSNTRP 58
Query: 59 ---GSTFAVEPKHVNFGH--NMSVPSGVPLSDPVKKKRGRPRKYAPDGQVSLGLSPLPAR 113
GS+F++EP ++NF H N++VPSG+P+ +PVKKKRGRPRKYAPDGQVSLGLSPLP +
Sbjct: 59 GSSGSSFSIEPSNINFVHGMNVAVPSGLPVGEPVKKKRGRPRKYAPDGQVSLGLSPLPVK 118
Query: 114 PKRSPASDSQALKRSRGRPPGTGRKQQLATLGEWMNSSAGIAFAPHVISIGVGE------ 167
PK S D + KR+RGRPPGTGRKQQLA LGEWMNSSAGIAF+PHVI IGVGE
Sbjct: 119 PKPSSGQDPLSPKRARGRPPGTGRKQQLALLGEWMNSSAGIAFSPHVIRIGVGEDIVAKV 178
Query: 168 ------RPRVVCILSGRGTVSSVTLRQPATSVPTVTYE--VEALCL 205
RPR +CILSG GTVSSVTLRQPA+S PT+T+E E LCL
Sbjct: 179 LSFAQQRPRALCILSGTGTVSSVTLRQPASSGPTLTFEGRFEILCL 224
|
Source: Ricinus communis Species: Ricinus communis Genus: Ricinus Family: Euphorbiaceae Order: Malpighiales Class: Phylum: Streptophyta Superkingdom: Eukaryota |
| >gi|225426407|ref|XP_002273061.1| PREDICTED: uncharacterized protein LOC100249560 [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|224053919|ref|XP_002298038.1| predicted protein [Populus trichocarpa] gi|222845296|gb|EEE82843.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
| >gi|224074919|ref|XP_002304491.1| predicted protein [Populus trichocarpa] gi|222841923|gb|EEE79470.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
| >gi|356540448|ref|XP_003538701.1| PREDICTED: uncharacterized protein LOC100790569 [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|297742528|emb|CBI34677.3| unnamed protein product [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|356528260|ref|XP_003532722.1| PREDICTED: uncharacterized protein LOC100813888 [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|356513399|ref|XP_003525401.1| PREDICTED: uncharacterized protein LOC100798706 [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|356497236|ref|XP_003517468.1| PREDICTED: uncharacterized protein LOC100795781 [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|449452330|ref|XP_004143912.1| PREDICTED: uncharacterized protein LOC101219973 [Cucumis sativus] | Back alignment and taxonomy information |
|---|
Prediction of Gene Ontology (GO) Terms
Close Homologs with Gene Ontology terms Detected by BLAST 
Original result of BLAST against Gene Ontology (AMIGO)
ID ![]() |
Alignment graph ![]() |
Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 234 | ||||||
| TAIR|locus:2031321 | 378 | AT1G63470 [Arabidopsis thalian | 0.504 | 0.312 | 0.637 | 1.6e-49 | |
| TAIR|locus:2031306 | 361 | AT1G63480 [Arabidopsis thalian | 0.799 | 0.518 | 0.475 | 4.4e-36 | |
| TAIR|locus:2050766 | 348 | AT2G45850 [Arabidopsis thalian | 0.803 | 0.540 | 0.416 | 2.3e-30 | |
| TAIR|locus:2098861 | 354 | AT3G61310 [Arabidopsis thalian | 0.807 | 0.533 | 0.410 | 7.3e-27 | |
| TAIR|locus:2051038 | 351 | AT2G33620 [Arabidopsis thalian | 0.474 | 0.316 | 0.480 | 7.8e-23 | |
| TAIR|locus:2122684 | 404 | AHL3 "AT-HOOK MOTIF NUCLEAR LO | 0.688 | 0.398 | 0.278 | 4.7e-15 | |
| TAIR|locus:2141045 | 439 | AT4G17950 [Arabidopsis thalian | 0.329 | 0.175 | 0.442 | 1.5e-14 | |
| TAIR|locus:2153142 | 419 | AHL4 "AT-HOOK MOTIF NUCLEAR LO | 0.410 | 0.229 | 0.411 | 2.1e-14 | |
| TAIR|locus:2178505 | 386 | AT5G46640 [Arabidopsis thalian | 0.316 | 0.191 | 0.448 | 2e-12 | |
| TAIR|locus:2084983 | 411 | AT3G04590 [Arabidopsis thalian | 0.559 | 0.318 | 0.347 | 8.4e-11 |
| TAIR|locus:2031321 AT1G63470 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Score = 412 (150.1 bits), Expect = 1.6e-49, Sum P(2) = 1.6e-49
Identities = 86/135 (63%), Positives = 101/135 (74%)
Query: 82 PLSDP-VKKKRGRPRKYAPDGQVSLGLSPLPARPKRSP----ASDSQALKRSRGRPPGTG 136
P P VKKKRGRPRKY PDGQVSLGLSP+P K+S SD A KR+RGRPPGTG
Sbjct: 98 PSEQPMVKKKRGRPRKYVPDGQVSLGLSPMPCVSKKSKDSSSMSDPNAPKRARGRPPGTG 157
Query: 137 RKQQLATLGEWMNSSAGIAFAPHVISIGVGE------------RPRVVCILSGRGTVSSV 184
RKQ+LA LGEWMN+SAG+AFAPHVIS+G GE RPR +CI+SG GTVSSV
Sbjct: 158 RKQRLANLGEWMNTSAGLAFAPHVISVGSGEDIVSKVLSFSQKRPRALCIMSGTGTVSSV 217
Query: 185 TLRQPATSVPTVTYE 199
TLR+PA++ P++T+E
Sbjct: 218 TLREPASTTPSLTFE 232
|
|
| TAIR|locus:2031306 AT1G63480 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2050766 AT2G45850 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2098861 AT3G61310 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2051038 AT2G33620 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2122684 AHL3 "AT-HOOK MOTIF NUCLEAR LOCALIZED PROTEIN 3" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2141045 AT4G17950 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2153142 AHL4 "AT-HOOK MOTIF NUCLEAR LOCALIZED PROTEIN 4" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2178505 AT5G46640 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| TAIR|locus:2084983 AT3G04590 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Prediction of Enzyme Commission (EC) Number
EC Number Prediction by Ezypred Server 
Original result from Ezypred Server
Fail to connect to Ezypred Server
Prediction of Functionally Associated Proteins
Functionally Associated Proteins Detected by STRING 
Original result from the STRING server
| gw1.I.8978.1 | hypothetical protein (349 aa) | |||||||
(Populus trichocarpa) | ||||||||
| Sorry, there are no predicted associations at the current settings. |
Conserved Domains and Related Protein Families
Conserved Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against CDD database part I
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 234 | |||
| pfam03479 | 120 | pfam03479, DUF296, Domain of unknown function (DUF | 6e-04 |
| >gnl|CDD|217587 pfam03479, DUF296, Domain of unknown function (DUF296) | Back alignment and domain information |
|---|
Score = 38.3 bits (90), Expect = 6e-04
Identities = 23/59 (38%), Positives = 27/59 (45%), Gaps = 15/59 (25%)
Query: 156 FAPHVISIGVGE------------RPRVVCILSGRGTVSSVTLRQP---ATSVPTVTYE 199
PHV+ + GE R +LSG G VS+VTLRQP A S VT E
Sbjct: 1 GRPHVLRLEPGEDLVESLEAFARQRGIGAAVLSGIGAVSNVTLRQPDEEAKSYGVVTLE 59
|
This putative domain is found in proteins that contain AT-hook motifs pfam02178, which strongly suggests a DNA-binding function for the proteins as a whole. There are three highly conserved histidine residues, eg at 117, 119 and 133 in Reut_B5223, which should be a structurally conserved metal-binding unit, based on structural comparison with known metal-binding structures. The proteins should work as trimers. Length = 120 |
Conserved Domains Detected by HHsearch 
Original result of HHsearch against CDD database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 234 | |||
| PF03479 | 120 | DUF296: Domain of unknown function (DUF296); Inter | 98.57 | |
| PF02178 | 13 | AT_hook: AT hook motif; InterPro: IPR017956 AT hoo | 96.56 | |
| smart00384 | 26 | AT_hook DNA binding domain with preference for A/T | 96.51 | |
| COG1661 | 141 | Predicted DNA-binding protein with PD1-like DNA-bi | 90.35 | |
| PF14621 | 219 | RFX5_DNA_bdg: RFX5 DNA-binding domain | 86.33 | |
| PF13546 | 273 | DDE_5: DDE superfamily endonuclease | 82.9 |
| >PF03479 DUF296: Domain of unknown function (DUF296); InterPro: IPR005175 This putative conserved domain is found in proteins that contain AT-hook motifs IPR000637 from INTERPRO, suggesting a DNA-binding function for the proteins as a whole, however, the function of this domain is unknown | Back alignment and domain information |
|---|
Probab=98.57 E-value=3.8e-08 Score=78.34 Aligned_cols=56 Identities=25% Similarity=0.218 Sum_probs=47.6
Q ss_pred ceeEEEEEcCCC------------CceeEEEecCCCceeEEEEcCCC--CCCCceEEE--EEeEeeeccccc
Q 026716 156 FAPHVISIGVGE------------RPRVVCILSGRGTVSSVTLRQPA--TSVPTVTYE--VEALCLYLFLFW 211 (234)
Q Consensus 156 ftPHVItV~~GE------------g~raICILSA~G~VSnVTLRQP~--ssggtvTyE--FEILSLSGSflp 211 (234)
|+++++.+.+|| .....|++|+.|++++|+|++++ ....+.+|| |||+||+|++.+
T Consensus 1 ~r~~~~rl~~Gedl~~~l~~~~~~~~i~~~~is~iGsl~~~~l~~~~~~~~~~~~~~~g~~Ei~sl~G~i~~ 72 (120)
T PF03479_consen 1 GRVFVIRLDPGEDLLESLEAFAREHGIRSGVISGIGSLSNVTLGYYDPPSYYEPLEFEGPFEIISLSGTISP 72 (120)
T ss_dssp EEEEEEEEETTSBHHHHHHHHHHHHT-SSEEEEEEEEEEEEEEEEEETTTEEEEEEEESEEEEEEEEEEEEE
T ss_pred CcEEEEEECCCCHHHHHHHHHHHHCCCcEEEEEEEeEEeEEEEEEecccCCcceEEecccEEEEEeEEEEEC
Confidence 789999999999 34667888999999999999974 334567788 999999999997
|
Overexpression of a protein containing this domain, Q9S7C9 from SWISSPROT, in Arabidopsis thaliana causes late flowering and modified leaf development []. ; PDB: 2DT4_A 2P6Y_A 3HWU_A 3HTN_A 2NMU_A 2H6L_A 2HX0_A. |
| >PF02178 AT_hook: AT hook motif; InterPro: IPR017956 AT hooks are DNA-binding motifs with a preference for A/T rich regions | Back alignment and domain information |
|---|
| >smart00384 AT_hook DNA binding domain with preference for A/T rich regions | Back alignment and domain information |
|---|
| >COG1661 Predicted DNA-binding protein with PD1-like DNA-binding motif [General function prediction only] | Back alignment and domain information |
|---|
| >PF14621 RFX5_DNA_bdg: RFX5 DNA-binding domain | Back alignment and domain information |
|---|
| >PF13546 DDE_5: DDE superfamily endonuclease | Back alignment and domain information |
|---|
Homologous Structure Templates
Structure Templates Detected by BLAST 
Original result of BLAST against Protein Data Bank
No homologous structure with e-value below 0.005
Structure Templates Detected by RPS-BLAST 
Original result of RPS-BLAST against PDB70 database
No hit with e-value below 0.005
Structure Templates Detected by HHsearch 
Original result of HHsearch against PDB70 database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 234 | |||
| 2hx0_A | 154 | Putative DNA-binding protein; NESG, PSI-2, SCR59, | 98.87 | |
| 2p6y_A | 142 | Hypothetical protein VCA0587; NESG, Q9KM02_vibch, | 98.84 | |
| 2h6l_A | 146 | Hypothetical protein; NESG GR103, structural genom | 98.42 | |
| 2dt4_A | 143 | Hypothetical protein PH0802; PPC domain, structura | 98.41 | |
| 3htn_A | 149 | Putative DNA binding protein; DUF269 family protei | 97.99 | |
| 2ezd_A | 26 | High mobility group protein HMG-I/HMG-Y; DNA bindi | 96.75 | |
| 3hwu_A | 147 | Putative DNA-binding protein; YP_299413.1, structu | 95.77 | |
| 4i95_A | 142 | Putative uncharacterized protein; lipocalin-like d | 83.96 |
| >2hx0_A Putative DNA-binding protein; NESG, PSI-2, SCR59, structural genomics, protein structure initiative; 1.55A {Salmonella choleraesuis} SCOP: d.290.1.3 PDB: 2nmu_A | Back alignment and structure |
|---|
Probab=98.87 E-value=3.4e-09 Score=86.84 Aligned_cols=61 Identities=18% Similarity=0.175 Sum_probs=54.5
Q ss_pred CCCCCceeEEEEEcCCC-------------CceeEEEecCCCceeEEEEcCCCCCCCceEEE--EEeEeeecccccc
Q 026716 151 SAGIAFAPHVISIGVGE-------------RPRVVCILSGRGTVSSVTLRQPATSVPTVTYE--VEALCLYLFLFWK 212 (234)
Q Consensus 151 s~g~~ftPHVItV~~GE-------------g~raICILSA~G~VSnVTLRQP~ssggtvTyE--FEILSLSGSflp~ 212 (234)
+....+++|+|++.+|| +-++.|||++.|++++|+||+++... +++|+ ||||||+|++.+.
T Consensus 13 ~~~~~~r~~vlrL~~Gedl~~~i~~~~~~~gi~~a~v~s~iGsl~~~~l~~~~~~~-~~~~~g~~EIlsl~Gti~~~ 88 (154)
T 2hx0_A 13 HNASTARFYALRLLPGQEVFSQLHAFVQQNQLRAAWIAGCTGSLTDVALRYAGQEA-TTSLTGTFEVISLNGTLELT 88 (154)
T ss_dssp CSCCCCEEEEEEECTTCBHHHHHHHHHHHHTCSSEEEEEEEEEEEEEEEECTTCSS-CEEEEEEEEEEEEEEEEETT
T ss_pred CCCCCccEEEEEECCCCcHHHHHHHHHHHhCCCEEEEEEeEeeeEEeEEEccCCCc-cEecCCcEEEEEeeeeEeCC
Confidence 55678999999999999 67789999999999999999998765 67888 9999999999873
|
| >2p6y_A Hypothetical protein VCA0587; NESG, Q9KM02_vibch, VCR80, structural genomics, PSI-2, prote structure initiative; 1.63A {Vibrio cholerae} | Back alignment and structure |
|---|
| >2h6l_A Hypothetical protein; NESG GR103, structural genomics, PSI-2, protein structure initiative, northeast structural genomics consortium; 2.00A {Archaeoglobus fulgidus} SCOP: d.290.1.3 | Back alignment and structure |
|---|
| >2dt4_A Hypothetical protein PH0802; PPC domain, structural genomics, unknown function; 1.60A {Pyrococcus horikoshii} | Back alignment and structure |
|---|
| >3htn_A Putative DNA binding protein; DUF269 family protein, structural genomics, joint center for structural genomics, JCSG; HET: MSE 1PE; 1.50A {Bacteroides thetaiotaomicron vpi-5482} SCOP: d.290.1.0 | Back alignment and structure |
|---|
| >2ezd_A High mobility group protein HMG-I/HMG-Y; DNA binding protein, minor groove DNA binding, transcriptional CO-activator, architectural factor; HET: DNA; NMR {Homo sapiens} SCOP: j.10.1.1 PDB: 2eze_A* | Back alignment and structure |
|---|
| >3hwu_A Putative DNA-binding protein; YP_299413.1, structural genomi center for structural genomics, JCSG, protein structure INI PSI-2; HET: MSE; 1.30A {Ralstonia eutropha} | Back alignment and structure |
|---|
| >4i95_A Putative uncharacterized protein; lipocalin-like domain of PF13924 family, structural genomics center for structural genomics, JCSG; HET: MSE; 1.81A {Bacteroides eggerthii} PDB: 4gzv_A | Back alignment and structure |
|---|
Homologous Structure Domains
Structure Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against SCOP70(version1.75) database
No hit with e-value below 0.005
Homologous Domains Detected by HHsearch 
Original result of HHsearch against SCOP70(version1.75) database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 234 | |||
| d2hx0a1 | 136 | Hypothetical protein STM3071 {Salmonella typhimuri | 98.08 | |
| d2h6la1 | 138 | Hypothetical protein AF0104 {Archaeoglobus fulgidu | 97.52 |
| >d2hx0a1 d.290.1.3 (A:6-141) Hypothetical protein STM3071 {Salmonella typhimurium [TaxId: 90371]} | Back information, alignment and structure |
|---|
class: Alpha and beta proteins (a+b) fold: AF0104/ALDC/Ptd012-like superfamily: AF0104/ALDC/Ptd012-like family: AF0104-like domain: Hypothetical protein STM3071 species: Salmonella typhimurium [TaxId: 90371]
Probab=98.08 E-value=2.9e-06 Score=66.20 Aligned_cols=60 Identities=18% Similarity=0.175 Sum_probs=52.0
Q ss_pred CCCceeEEEEEcCCC-------------CceeEEEecCCCceeEEEEcCCCCCCCceEEE--EEeEeeeccccccC
Q 026716 153 GIAFAPHVISIGVGE-------------RPRVVCILSGRGTVSSVTLRQPATSVPTVTYE--VEALCLYLFLFWKK 213 (234)
Q Consensus 153 g~~ftPHVItV~~GE-------------g~raICILSA~G~VSnVTLRQP~ssggtvTyE--FEILSLSGSflp~~ 213 (234)
++..+-+++.+.+|| +-++.||+++-|++++|+|+++.... ..+++ |||+||+|++.+.+
T Consensus 3 ~~~~R~~~lrl~~Gedl~~~i~~~~~~~~I~~a~V~~~iGs~~~~~~~~~~~~~-~~~~~g~~Ei~sl~G~I~~~~ 77 (136)
T d2hx0a1 3 ASTARFYALRLLPGQEVFSQLHAFVQQNQLRAAWIAGCTGSLTDVALRYAGQEA-TTSLTGTFEVISLNGTLELTG 77 (136)
T ss_dssp CCCCEEEEEEECTTCBHHHHHHHHHHHHTCSSEEEEEEEEEEEEEEEECTTCSS-CEEEEEEEEEEEEEEEEETTE
T ss_pred CCCCcEEEEEECCCChHHHHHHHHHHHhCCCEEEEEEEeeeeEEEEEEeCCCCC-cEEecCcEEEEEEEEEeccCC
Confidence 467899999999999 66899999999999999999987544 45677 99999999997654
|
| >d2h6la1 d.290.1.3 (A:1-138) Hypothetical protein AF0104 {Archaeoglobus fulgidus [TaxId: 2234]} | Back information, alignment and structure |
|---|