Citrus Sinensis ID: 029699
Local Sequence Feature Prediction
| Prediction and (Method) | Result |
|---|
Close Homologs for Annotation Transfer
Close Homologs in the Non-Redundant Database Detected by BLAST 
Original result of BLAST against Nonredundant Database
GI ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 189 | ||||||
| 359491684 | 167 | PREDICTED: uncharacterized protein LOC10 | 0.857 | 0.970 | 0.782 | 1e-73 | |
| 449440818 | 181 | PREDICTED: uncharacterized protein LOC10 | 0.835 | 0.872 | 0.787 | 5e-72 | |
| 255547311 | 173 | endonuclease, putative [Ricinus communis | 0.867 | 0.947 | 0.766 | 1e-70 | |
| 224119042 | 174 | predicted protein [Populus trichocarpa] | 0.873 | 0.948 | 0.747 | 9e-70 | |
| 118481005 | 180 | unknown [Populus trichocarpa] | 0.783 | 0.822 | 0.786 | 1e-68 | |
| 297850254 | 179 | predicted protein [Arabidopsis lyrata su | 0.777 | 0.821 | 0.802 | 1e-66 | |
| 15221835 | 186 | HNH endonuclease domain-containing prote | 0.783 | 0.795 | 0.783 | 2e-66 | |
| 351722082 | 162 | uncharacterized protein LOC100527904 [Gl | 0.809 | 0.944 | 0.740 | 1e-65 | |
| 218184321 | 178 | hypothetical protein OsI_33113 [Oryza sa | 0.904 | 0.960 | 0.648 | 2e-62 | |
| 115481558 | 178 | Os10g0335200 [Oryza sativa Japonica Grou | 0.904 | 0.960 | 0.648 | 3e-62 |
| >gi|359491684|ref|XP_003634305.1| PREDICTED: uncharacterized protein LOC100852985 [Vitis vinifera] gi|297733935|emb|CBI15182.3| unnamed protein product [Vitis vinifera] | Back alignment and taxonomy information |
|---|
Score = 281 bits (718), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 133/170 (78%), Positives = 148/170 (87%), Gaps = 8/170 (4%)
Query: 6 SSSSPNRSRSSNSNNSNEERPRYFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRF 65
SSSSP+RSR EERPR+FD KAKS+CWAKA+ VPGRHP+RWRKDAAGNIVCKRF
Sbjct: 2 SSSSPHRSRD------KEERPRFFDPKAKSLCWAKADAVPGRHPDRWRKDAAGNIVCKRF 55
Query: 66 CNCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFT 125
CNC GCLCFEYDHI+PFSKGGES +NCQILQTRVNRFKSDK+ VD T+L+G+SCDVKFT
Sbjct: 56 CNCQGCLCFEYDHIIPFSKGGESVPENCQILQTRVNRFKSDKDNVDKTQLQGYSCDVKFT 115
Query: 126 DKELDIIEMAVYGDVIRPGNQCRCKTVAEMLGQFKSKDRWLLANCHMMVN 175
DKELDIIEMAVYGDVIRPGNQCRC+TVAEMLGQ+K KDR +A C + N
Sbjct: 116 DKELDIIEMAVYGDVIRPGNQCRCRTVAEMLGQYKPKDR--MAACKLPFN 163
|
Source: Vitis vinifera Species: Vitis vinifera Genus: Vitis Family: Vitaceae Order: Vitales Class: Phylum: Streptophyta Superkingdom: Eukaryota |
| >gi|449440818|ref|XP_004138181.1| PREDICTED: uncharacterized protein LOC101204599 [Cucumis sativus] gi|449477195|ref|XP_004154957.1| PREDICTED: uncharacterized protein LOC101224154 [Cucumis sativus] | Back alignment and taxonomy information |
|---|
| >gi|255547311|ref|XP_002514713.1| endonuclease, putative [Ricinus communis] gi|223546317|gb|EEF47819.1| endonuclease, putative [Ricinus communis] | Back alignment and taxonomy information |
|---|
| >gi|224119042|ref|XP_002317971.1| predicted protein [Populus trichocarpa] gi|222858644|gb|EEE96191.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
| >gi|118481005|gb|ABK92456.1| unknown [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
| >gi|297850254|ref|XP_002893008.1| predicted protein [Arabidopsis lyrata subsp. lyrata] gi|297338850|gb|EFH69267.1| predicted protein [Arabidopsis lyrata subsp. lyrata] | Back alignment and taxonomy information |
|---|
| >gi|15221835|ref|NP_173303.1| HNH endonuclease domain-containing protein [Arabidopsis thaliana] gi|6730716|gb|AAF27111.1|AC011809_20 Unknown protein [Arabidopsis thaliana] gi|26451201|dbj|BAC42704.1| unknown protein [Arabidopsis thaliana] gi|28973585|gb|AAO64117.1| unknown protein [Arabidopsis thaliana] gi|332191624|gb|AEE29745.1| HNH endonuclease domain-containing protein [Arabidopsis thaliana] | Back alignment and taxonomy information |
|---|
| >gi|351722082|ref|NP_001235440.1| uncharacterized protein LOC100527904 [Glycine max] gi|255633516|gb|ACU17116.1| unknown [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|218184321|gb|EEC66748.1| hypothetical protein OsI_33113 [Oryza sativa Indica Group] | Back alignment and taxonomy information |
|---|
| >gi|115481558|ref|NP_001064372.1| Os10g0335200 [Oryza sativa Japonica Group] gi|21671981|gb|AAM74343.1|AC115686_10 Putative HNH endonuclease [Oryza sativa Japonica Group] gi|23477807|gb|AAN34947.1| Putative HNH endonuclease [Oryza sativa Japonica Group] gi|31431266|gb|AAP53074.1| HNH endonuclease family protein, expressed [Oryza sativa Japonica Group] gi|113638981|dbj|BAF26286.1| Os10g0335200 [Oryza sativa Japonica Group] gi|125574417|gb|EAZ15701.1| hypothetical protein OsJ_31114 [Oryza sativa Japonica Group] gi|215693775|dbj|BAG88974.1| unnamed protein product [Oryza sativa Japonica Group] gi|215707241|dbj|BAG93701.1| unnamed protein product [Oryza sativa Japonica Group] | Back alignment and taxonomy information |
|---|
Prediction of Gene Ontology (GO) Terms
Close Homologs with Gene Ontology terms Detected by BLAST 
Original result of BLAST against Gene Ontology (AMIGO)
ID ![]() |
Alignment graph ![]() |
Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 189 | ||||||
| TAIR|locus:2035020 | 186 | AT1G18680 [Arabidopsis thalian | 0.719 | 0.731 | 0.801 | 9e-61 | |
| TAIR|locus:2079137 | 234 | AT3G47490 [Arabidopsis thalian | 0.645 | 0.521 | 0.459 | 4.4e-29 |
| TAIR|locus:2035020 AT1G18680 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Score = 622 (224.0 bits), Expect = 9.0e-61, P = 9.0e-61
Identities = 109/136 (80%), Positives = 123/136 (90%)
Query: 28 YFDGKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGCLCFEYDHIVPFSKGGE 87
+FDGKAK+ CWA A+ VPGRHPERWRKD AGNIVCKRF NC+GCLCFEYDHIVP+SKGGE
Sbjct: 42 FFDGKAKNKCWANADIVPGRHPERWRKDVAGNIVCKRFGNCNGCLCFEYDHIVPYSKGGE 101
Query: 88 STADNCQILQTRVNRFKSDKEQVDATKLKGFSCDVKFTDKELDIIEMAVYGDVIRPGNQC 147
S A+NCQILQTRVNRFKS +E VDAT LK +SC ++FTDKELD+IEMAVYGDV+RPG +C
Sbjct: 102 SIAENCQILQTRVNRFKSAQENVDATTLKSYSCGLQFTDKELDVIEMAVYGDVLRPGKEC 161
Query: 148 RCKTVAEMLGQFKSKD 163
RCKTVAE+LGQ KSKD
Sbjct: 162 RCKTVAELLGQSKSKD 177
|
|
| TAIR|locus:2079137 AT3G47490 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Prediction of Enzyme Commission (EC) Number
EC Number Prediction by Ezypred Server 
Original result from Ezypred Server
Fail to connect to Ezypred Server
Prediction of Functionally Associated Proteins
Functionally Associated Proteins Detected by STRING 
Original result from the STRING server
Fail to connect to STRING server
Conserved Domains and Related Protein Families
Conserved Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against CDD database part I
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 189 | |||
| pfam01844 | 47 | pfam01844, HNH, HNH endonuclease | 6e-06 | |
| cd00085 | 57 | cd00085, HNHc, HNH nucleases; HNH endonuclease sig | 8e-04 | |
| smart00507 | 52 | smart00507, HNHc, HNH nucleases | 0.004 |
| >gnl|CDD|216737 pfam01844, HNH, HNH endonuclease | Back alignment and domain information |
|---|
Score = 41.5 bits (98), Expect = 6e-06
Identities = 15/47 (31%), Positives = 19/47 (40%), Gaps = 2/47 (4%)
Query: 61 VCKRFCNCHGCLCFEYDHIVPFSKGGESTADNCQILQTRVNRFKSDK 107
C R E DHI+P SKGG N +L +R K +
Sbjct: 3 YCGR--PFKDTDALEVDHIIPRSKGGADDLSNLVLLCRSCHRRKHAR 47
|
Length = 47 |
| >gnl|CDD|238038 cd00085, HNHc, HNH nucleases; HNH endonuclease signature which is found in viral, prokaryotic, and eukaryotic proteins | Back alignment and domain information |
|---|
| >gnl|CDD|214702 smart00507, HNHc, HNH nucleases | Back alignment and domain information |
|---|
Conserved Domains Detected by HHsearch 
Original result of HHsearch against CDD database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 189 | |||
| PF13395 | 54 | HNH_4: HNH endonuclease | 99.24 | |
| PF01844 | 47 | HNH: HNH endonuclease; InterPro: IPR002711 HNH end | 99.14 | |
| cd00085 | 57 | HNHc HNH nucleases; HNH endonuclease signature whi | 99.11 | |
| TIGR01865 | 805 | cas_Csn1 CRISPR-associated protein, Csn1 family. C | 98.89 | |
| COG1403 | 146 | McrA Restriction endonuclease [Defense mechanisms] | 98.88 | |
| smart00507 | 52 | HNHc HNH nucleases. | 98.8 | |
| TIGR02646 | 144 | conserved hypothetical protein TIGR02646. Members | 98.52 | |
| PRK11295 | 113 | hypothetical protein; Provisional | 98.49 | |
| PF14279 | 71 | HNH_5: HNH endonuclease | 96.69 | |
| PHA02565 | 157 | 49 recombination endonuclease VII; Provisional | 91.84 | |
| COG3513 | 1088 | Predicted CRISPR-associated nuclease, contains Mcr | 90.12 | |
| TIGR02986 | 424 | restrict_Alw26I type II restriction endonuclease, | 88.21 | |
| PF13391 | 66 | HNH_2: HNH endonuclease | 86.79 | |
| TIGR03031 | 802 | cas_csx12 CRISPR-associated protein, Csx12 family. | 84.18 | |
| PF05766 | 189 | NinG: Bacteriophage Lambda NinG protein; InterPro: | 83.61 | |
| COG3183 | 272 | Predicted restriction endonuclease [Defense mechan | 83.45 |
| >PF13395 HNH_4: HNH endonuclease | Back alignment and domain information |
|---|
Probab=99.24 E-value=2.6e-12 Score=87.47 Aligned_cols=36 Identities=36% Similarity=0.557 Sum_probs=35.0
Q ss_pred CccceeEeccCCCCChHhHHHHHhHHHHhhccCCch
Q 029699 74 FEYDHIVPFSKGGESTADNCQILQTRVNRFKSDKEQ 109 (189)
Q Consensus 74 ~evDHIiP~S~GG~~~~~NLq~lC~~CN~~K~n~~~ 109 (189)
+++|||+|+|.+|.+++.||+++|..||+.|++++|
T Consensus 18 ~~iDHiiP~s~~~~~s~~Nlvl~~~~~N~~K~~k~P 53 (54)
T PF13395_consen 18 YEIDHIIPRSRGGDDSFWNLVLCCKECNRSKGNKTP 53 (54)
T ss_pred ceeEEEecccccCCCCcchhheECHHHhhcccccCC
Confidence 799999999999999999999999999999999987
|
|
| >PF01844 HNH: HNH endonuclease; InterPro: IPR002711 HNH endonuclease is found in bacteria and viruses [, , ] | Back alignment and domain information |
|---|
| >cd00085 HNHc HNH nucleases; HNH endonuclease signature which is found in viral, prokaryotic, and eukaryotic proteins | Back alignment and domain information |
|---|
| >TIGR01865 cas_Csn1 CRISPR-associated protein, Csn1 family | Back alignment and domain information |
|---|
| >COG1403 McrA Restriction endonuclease [Defense mechanisms] | Back alignment and domain information |
|---|
| >smart00507 HNHc HNH nucleases | Back alignment and domain information |
|---|
| >TIGR02646 conserved hypothetical protein TIGR02646 | Back alignment and domain information |
|---|
| >PRK11295 hypothetical protein; Provisional | Back alignment and domain information |
|---|
| >PF14279 HNH_5: HNH endonuclease | Back alignment and domain information |
|---|
| >PHA02565 49 recombination endonuclease VII; Provisional | Back alignment and domain information |
|---|
| >COG3513 Predicted CRISPR-associated nuclease, contains McrA/HNH-nuclease and RuvC-like nuclease domain [Defense mechanisms] | Back alignment and domain information |
|---|
| >TIGR02986 restrict_Alw26I type II restriction endonuclease, Alw26I/Eco31I/Esp3I family | Back alignment and domain information |
|---|
| >PF13391 HNH_2: HNH endonuclease | Back alignment and domain information |
|---|
| >TIGR03031 cas_csx12 CRISPR-associated protein, Csx12 family | Back alignment and domain information |
|---|
| >PF05766 NinG: Bacteriophage Lambda NinG protein; InterPro: IPR008713 The ninR region of phage lambda contains two recombination genes, ninB (also known as orf) and ninG (also known as rap) | Back alignment and domain information |
|---|
| >COG3183 Predicted restriction endonuclease [Defense mechanisms] | Back alignment and domain information |
|---|
Homologous Structure Templates
Structure Templates Detected by BLAST 
Original result of BLAST against Protein Data Bank
No homologous structure with e-value below 0.005
Structure Templates Detected by RPS-BLAST 
Original result of RPS-BLAST against PDB70 database
No hit with e-value below 0.005
Structure Templates Detected by HHsearch 
Original result of HHsearch against PDB70 database
No hit with probability above 80.00
Homologous Structure Domains
Structure Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against SCOP70(version1.75) database
No hit with e-value below 0.005
Homologous Domains Detected by HHsearch 
Original result of HHsearch against SCOP70(version1.75) database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 189 | |||
| d2gykb1 | 130 | DNase domain of colicin E9 {Escherichia coli [TaxI | 90.92 | |
| d2jb0b1 | 124 | DNase domain of colicin E7 {Escherichia coli [TaxI | 89.99 |
| >d2gykb1 d.4.1.1 (B:4-133) DNase domain of colicin E9 {Escherichia coli [TaxId: 562]} | Back information, alignment and structure |
|---|
class: Alpha and beta proteins (a+b) fold: His-Me finger endonucleases superfamily: His-Me finger endonucleases family: HNH-motif domain: DNase domain of colicin E9 species: Escherichia coli [TaxId: 562]
Probab=90.92 E-value=0.014 Score=30.41 Aligned_cols=75 Identities=20% Similarity=0.299 Sum_probs=44.9
Q ss_pred CCCCCCCC--HHHHHHHHHHCCCCCCCCCCCHHCCCCCCEEEECCCCCCCC---------CCCCCCEEEECCCCCC-CHH
Q ss_conf 88889887--88999999871899999951022013577375069987888---------8772113675369999-907
Q 029699 23 EERPRYFD--GKAKSMCWAKAETVPGRHPERWRKDAAGNIVCKRFCNCHGC---------LCFEYDHIVPFSKGGE-STA 90 (189)
Q Consensus 23 ~~~~R~f~--~~~R~~~w~ka~~v~Gr~~~r~R~da~gnvv~r~~~~C~~C---------~~~evDHIiP~s~GG~-~~~ 90 (189)
+.+-|.|. .+-|++.|.... .+|+...+-.-.|...-..+.-++- ..||++|++|.+.||. -++
T Consensus 38 kLrGr~F~sFd~FR~afW~~Va----~DpeL~~QF~~~N~~rmk~G~AP~a~~~e~~G~r~kfElHH~~~I~~GG~VYdi 113 (130)
T d2gykb1 38 KLRDKEFKSFDDFRKAVWEEVS----KDPELSKNLNPSNKSSVSKGYSPFTPKNQQVGGRKVYELHHDKPISQGGEVYDM 113 (130)
T ss_dssp HHTTCEESSHHHHHHHHHHHHH----HCTTTGGGSCHHHHHHHHTTCCCBCCGGGCBTTBCBCEEEESSCGGGTCCSSBG
T ss_pred HHCCCCCCCHHHHHHHHHHHHH----CCHHHHHHHCHHHHHHHHCCCCCCCCHHHHCCCEEEEEECCCCCCCCCCEEEEC
T ss_conf 8658856888899999999981----099999985888899986889998982364198227994267432469845435
Q ss_pred HHHHHHHHHHH
Q ss_conf 68999939998
Q 029699 91 DNCQILQTRVN 101 (189)
Q Consensus 91 ~NLq~LC~~CN 101 (189)
+||.++-..-|
T Consensus 114 DNLrI~TPK~H 124 (130)
T d2gykb1 114 DNIRVTTPKRH 124 (130)
T ss_dssp GGEEEECHHHH
T ss_pred CCEEECCCHHH
T ss_conf 66055380746
|
| >d2jb0b1 d.4.1.1 (B:449-572) DNase domain of colicin E7 {Escherichia coli [TaxId: 562]} | Back information, alignment and structure |
|---|