Citrus Sinensis ID: 028336
Local Sequence Feature Prediction
| Prediction and (Method) | Result |
|---|
Close Homologs for Annotation Transfer
Close Homologs in the Non-Redundant Database Detected by BLAST 
Original result of BLAST against Nonredundant Database
GI ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 210 | ||||||
| 255568954 | 275 | endonuclease, putative [Ricinus communis | 0.957 | 0.730 | 0.657 | 2e-72 | |
| 449446780 | 277 | PREDICTED: uncharacterized protein LOC10 | 0.980 | 0.743 | 0.634 | 1e-70 | |
| 225441641 | 277 | PREDICTED: uncharacterized protein LOC10 | 0.971 | 0.736 | 0.654 | 1e-70 | |
| 449489536 | 250 | PREDICTED: uncharacterized protein LOC10 | 0.980 | 0.824 | 0.634 | 2e-70 | |
| 224123804 | 272 | predicted protein [Populus trichocarpa] | 0.942 | 0.727 | 0.657 | 2e-68 | |
| 388521737 | 270 | unknown [Lotus japonicus] | 0.980 | 0.762 | 0.584 | 3e-65 | |
| 356571688 | 288 | PREDICTED: uncharacterized protein LOC10 | 0.976 | 0.711 | 0.576 | 7e-62 | |
| 297853378 | 283 | HNH endonuclease domain-containing prote | 0.990 | 0.734 | 0.576 | 4e-61 | |
| 356560967 | 282 | PREDICTED: uncharacterized protein LOC10 | 0.980 | 0.730 | 0.590 | 2e-60 | |
| 30682222 | 284 | HNH endonuclease [Arabidopsis thaliana] | 0.990 | 0.732 | 0.573 | 9e-60 |
| >gi|255568954|ref|XP_002525447.1| endonuclease, putative [Ricinus communis] gi|223535260|gb|EEF36937.1| endonuclease, putative [Ricinus communis] | Back alignment and taxonomy information |
|---|
Score = 277 bits (709), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 142/216 (65%), Positives = 163/216 (75%), Gaps = 15/216 (6%)
Query: 1 MAQITSQGGLNLLFNGDGSSFGVEPRDPFQYKLGSGL----RSIEVHKRKL--IRPRNGV 54
MAQ T+QG L LLFN DG + LG+G S V K K I +
Sbjct: 1 MAQFTAQGRLKLLFNSDGIQCKL---------LGNGFVGGCTSFRVQKGKFRYIGSSTRL 51
Query: 55 WSSMNKKKSSKFSVDASLNVSGKGISGESEEYDSDEEFDELACFRGLVLDISYRPVNVVC 114
+S KKS F+ +ASLN + GESE+YDS+ E D+LACFRGLVLDISYRPVNVVC
Sbjct: 52 HASSVSKKSRHFNAEASLNAEEENDIGESEDYDSEYETDDLACFRGLVLDISYRPVNVVC 111
Query: 115 WKRAICLEFMEKADVLEYYDQTINSPNGSFYIPAVLRVRHLLQVVKRRRIKNNLSRKNLM 174
WKRAICLEFMEKADVLEYYDQT+NSP+GSFYIPAVLRV HLLQVVKRRRIK+NLSRKN++
Sbjct: 112 WKRAICLEFMEKADVLEYYDQTVNSPSGSFYIPAVLRVPHLLQVVKRRRIKSNLSRKNVL 171
Query: 175 YRDNFTCQYCSSRENLTIDHVVPASRGGEWKWENLV 210
+RDN+TCQYCSSRENLTIDHV+P +RGG+W WENLV
Sbjct: 172 HRDNYTCQYCSSRENLTIDHVLPTARGGQWTWENLV 207
|
Source: Ricinus communis Species: Ricinus communis Genus: Ricinus Family: Euphorbiaceae Order: Malpighiales Class: Phylum: Streptophyta Superkingdom: Eukaryota |
| >gi|449446780|ref|XP_004141149.1| PREDICTED: uncharacterized protein LOC101207660 [Cucumis sativus] | Back alignment and taxonomy information |
|---|
| >gi|225441641|ref|XP_002282210.1| PREDICTED: uncharacterized protein LOC100264031 [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|449489536|ref|XP_004158341.1| PREDICTED: uncharacterized protein LOC101226520 [Cucumis sativus] | Back alignment and taxonomy information |
|---|
| >gi|224123804|ref|XP_002319168.1| predicted protein [Populus trichocarpa] gi|222857544|gb|EEE95091.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
| >gi|388521737|gb|AFK48930.1| unknown [Lotus japonicus] | Back alignment and taxonomy information |
|---|
| >gi|356571688|ref|XP_003554006.1| PREDICTED: uncharacterized protein LOC100787021 isoform 1 [Glycine max] gi|356571690|ref|XP_003554007.1| PREDICTED: uncharacterized protein LOC100787021 isoform 2 [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|297853378|ref|XP_002894570.1| HNH endonuclease domain-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297340412|gb|EFH70829.1| HNH endonuclease domain-containing protein [Arabidopsis lyrata subsp. lyrata] | Back alignment and taxonomy information |
|---|
| >gi|356560967|ref|XP_003548757.1| PREDICTED: uncharacterized protein LOC100810482 [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|30682222|ref|NP_179964.2| HNH endonuclease [Arabidopsis thaliana] gi|38603958|gb|AAR24724.1| At2g23840 [Arabidopsis thaliana] gi|44681452|gb|AAS47666.1| At2g23840 [Arabidopsis thaliana] gi|330252404|gb|AEC07498.1| HNH endonuclease [Arabidopsis thaliana] | Back alignment and taxonomy information |
|---|
Prediction of Gene Ontology (GO) Terms
Close Homologs with Gene Ontology terms Detected by BLAST 
Original result of BLAST against Gene Ontology (AMIGO)
ID ![]() |
Alignment graph ![]() |
Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 210 | ||||||
| TAIR|locus:2061476 | 284 | AT2G23840 [Arabidopsis thalian | 0.552 | 0.408 | 0.844 | 1.1e-58 | |
| UNIPROTKB|Q0BWS6 | 197 | HNE_3394 "HNH endonuclease dom | 0.485 | 0.517 | 0.358 | 5.3e-17 | |
| TIGR_CMR|SPO_2123 | 194 | SPO_2123 "HNH endonuclease fam | 0.485 | 0.525 | 0.367 | 2.3e-16 | |
| UNIPROTKB|O53196 | 222 | Rv2469c "Uncharacterized prote | 0.514 | 0.486 | 0.391 | 3.7e-16 | |
| UNIPROTKB|Q3Z918 | 177 | DET0537 "HNH endonuclease doma | 0.509 | 0.604 | 0.387 | 3.3e-15 | |
| TIGR_CMR|DET_0537 | 177 | DET_0537 "HNH endonuclease dom | 0.509 | 0.604 | 0.387 | 3.3e-15 |
| TAIR|locus:2061476 AT2G23840 [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Score = 553 (199.7 bits), Expect = 1.1e-58, Sum P(2) = 1.1e-58
Identities = 98/116 (84%), Positives = 111/116 (95%)
Query: 95 LACFRGLVLDISYRPVNVVCWKRAICLEFMEKADVLEYYDQTINSPNGSFYIPAVLRVRH 154
L+CFRGLVLDISYRPVNVVCWKRAICLE+M+KADVLEYYDQT++SP GSFYIPAVLRV H
Sbjct: 101 LSCFRGLVLDISYRPVNVVCWKRAICLEYMDKADVLEYYDQTVSSPTGSFYIPAVLRVPH 160
Query: 155 LLQVVKRRRIKNNLSRKNLMYRDNFTCQYCSSRENLTIDHVVPASRGGEWKWENLV 210
LLQVVKRRR+KN+LSRKN++ RD++TCQYCSSRENLTIDHV+P SRGGEW W+NLV
Sbjct: 161 LLQVVKRRRVKNSLSRKNILLRDDYTCQYCSSRENLTIDHVMPVSRGGEWTWQNLV 216
|
|
| UNIPROTKB|Q0BWS6 HNE_3394 "HNH endonuclease domain protein" [Hyphomonas neptunium ATCC 15444 (taxid:228405)] | Back alignment and assigned GO terms |
|---|
| TIGR_CMR|SPO_2123 SPO_2123 "HNH endonuclease family protein" [Ruegeria pomeroyi DSS-3 (taxid:246200)] | Back alignment and assigned GO terms |
|---|
| UNIPROTKB|O53196 Rv2469c "Uncharacterized protein" [Mycobacterium tuberculosis H37Rv (taxid:83332)] | Back alignment and assigned GO terms |
|---|
| UNIPROTKB|Q3Z918 DET0537 "HNH endonuclease domain protein" [Dehalococcoides ethenogenes 195 (taxid:243164)] | Back alignment and assigned GO terms |
|---|
| TIGR_CMR|DET_0537 DET_0537 "HNH endonuclease domain protein" [Dehalococcoides ethenogenes 195 (taxid:243164)] | Back alignment and assigned GO terms |
|---|
Prediction of Enzyme Commission (EC) Number
EC Number Prediction by Ezypred Server 
Original result from Ezypred Server
Fail to connect to Ezypred Server
Prediction of Functionally Associated Proteins
Functionally Associated Proteins Detected by STRING 
Original result from the STRING server
| GSVIVG00025489001 | SubName- Full=Chromosome chr8 scaffold_34, whole genome shotgun sequence; (276 aa) | ||||||||||
(Vitis vinifera) | |||||||||||
| GSVIVG00022843001 | • | 0.506 | |||||||||
| GSVIVG00023128001 | • | 0.468 |
Conserved Domains and Related Protein Families
Conserved Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against CDD database part I
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 210 | |||
| COG1403 | 146 | COG1403, McrA, Restriction endonuclease [Defense m | 4e-12 | |
| cd00085 | 57 | cd00085, HNHc, HNH nucleases; HNH endonuclease sig | 3e-05 | |
| pfam01844 | 47 | pfam01844, HNH, HNH endonuclease | 7e-05 | |
| smart00507 | 52 | smart00507, HNHc, HNH nucleases | 1e-04 |
| >gnl|CDD|224321 COG1403, McrA, Restriction endonuclease [Defense mechanisms] | Back alignment and domain information |
|---|
Score = 61.1 bits (148), Expect = 4e-12
Identities = 29/97 (29%), Positives = 46/97 (47%), Gaps = 4/97 (4%)
Query: 115 WKRAICLEFMEKADVLEYYDQTINSPNGSFYIPAVLRVRHLLQVVKRRRIKNNLSRKNLM 174
+ A+ ++A + Y + S + P+V + + +R R+ ++
Sbjct: 9 IRPALDRLVSQRAKHVAEYPC-LASESKDLRRPSVTDLTGESKRPSEKRPAKT--RRAVL 65
Query: 175 YRDNFTCQYCSS-RENLTIDHVVPASRGGEWKWENLV 210
RDN CQYC S +L +DH+VP SRGG WENL
Sbjct: 66 LRDNGLCQYCGSVGTDLEVDHIVPLSRGGASAWENLE 102
|
Length = 146 |
| >gnl|CDD|238038 cd00085, HNHc, HNH nucleases; HNH endonuclease signature which is found in viral, prokaryotic, and eukaryotic proteins | Back alignment and domain information |
|---|
| >gnl|CDD|216737 pfam01844, HNH, HNH endonuclease | Back alignment and domain information |
|---|
| >gnl|CDD|214702 smart00507, HNHc, HNH nucleases | Back alignment and domain information |
|---|
Conserved Domains Detected by HHsearch 
Original result of HHsearch against CDD database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 210 | |||
| COG1403 | 146 | McrA Restriction endonuclease [Defense mechanisms] | 99.52 | |
| smart00507 | 52 | HNHc HNH nucleases. | 98.96 | |
| cd00085 | 57 | HNHc HNH nucleases; HNH endonuclease signature whi | 98.83 | |
| PF13395 | 54 | HNH_4: HNH endonuclease | 98.66 | |
| TIGR01865 | 805 | cas_Csn1 CRISPR-associated protein, Csn1 family. C | 98.57 | |
| PF01844 | 47 | HNH: HNH endonuclease; InterPro: IPR002711 HNH end | 98.37 | |
| TIGR02646 | 144 | conserved hypothetical protein TIGR02646. Members | 98.06 | |
| PRK11295 | 113 | hypothetical protein; Provisional | 97.68 | |
| PF14279 | 71 | HNH_5: HNH endonuclease | 96.5 | |
| TIGR03031 | 802 | cas_csx12 CRISPR-associated protein, Csx12 family. | 93.68 | |
| COG3513 | 1088 | Predicted CRISPR-associated nuclease, contains Mcr | 92.09 | |
| smart00782 | 47 | PhnA_Zn_Ribbon PhnA Zinc-Ribbon. This protein fami | 90.49 | |
| PF14239 | 176 | RRXRR: RRXRR protein | 90.3 | |
| PF05198 | 76 | IF3_N: Translation initiation factor IF-3, N-termi | 85.34 |
| >COG1403 McrA Restriction endonuclease [Defense mechanisms] | Back alignment and domain information |
|---|
Probab=99.52 E-value=1.2e-14 Score=111.10 Aligned_cols=101 Identities=30% Similarity=0.519 Sum_probs=87.9
Q ss_pred CceeeeecHHHHHHHHHhCCeeEEEecCCeeeCCCcceecCeEEEEecceeeeeccccCChhhHHHHHhHcCCcccccCC
Q 028336 107 YRPVNVVCWKRAICLEFMEKADVLEYYDQTINSPNGSFYIPAVLRVRHLLQVVKRRRIKNNLSRKNLMYRDNFTCQYCSS 186 (210)
Q Consensus 107 y~Pl~~vswqrAi~Ll~~gkAevle~~~~~IrS~sg~~~vPsVIrL~~yv~~p~r~~~~~~~tR~~Vl~RD~~~CqYCG~ 186 (210)
|.|....+|++++..+..+.|..+..+. ...++.....+|.+..+..+...+... .++.+|.+++.||.+.|+|||.
T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~r~~~~~~d~~~c~~c~~ 77 (146)
T COG1403 1 YNPLESKSIRPALDRLVSQRAKHVAEYP-CLASESKDLRRPSVTDLTGESKRPSEK--RPAKTRRAVLLRDNGLCQYCGS 77 (146)
T ss_pred CccccccchhHHHHHHhhcccccccCcc-eecceecccccCccceeeeeecccCCC--CchHHHHHHHccccccccccCC
Confidence 5788999999999999999999887654 455667788899999999887766543 5789999999999999999998
Q ss_pred Cc-CCeeCeeecCcCCCCCCcCcCC
Q 028336 187 RE-NLTIDHVVPASRGGEWKWENLV 210 (210)
Q Consensus 187 ~~-~LtvDHIIPrSrGG~dtweNLV 210 (210)
.. .+++|||||++.||.+.|+||+
T Consensus 78 ~~~~~~~dHiip~~~g~~~~~~Nl~ 102 (146)
T COG1403 78 VGTDLEVDHIVPLSRGGASAWENLE 102 (146)
T ss_pred cCCCCceeeEeecccCCcchHHHHH
Confidence 74 7999999999999999999974
|
|
| >smart00507 HNHc HNH nucleases | Back alignment and domain information |
|---|
| >cd00085 HNHc HNH nucleases; HNH endonuclease signature which is found in viral, prokaryotic, and eukaryotic proteins | Back alignment and domain information |
|---|
| >PF13395 HNH_4: HNH endonuclease | Back alignment and domain information |
|---|
| >TIGR01865 cas_Csn1 CRISPR-associated protein, Csn1 family | Back alignment and domain information |
|---|
| >PF01844 HNH: HNH endonuclease; InterPro: IPR002711 HNH endonuclease is found in bacteria and viruses [, , ] | Back alignment and domain information |
|---|
| >TIGR02646 conserved hypothetical protein TIGR02646 | Back alignment and domain information |
|---|
| >PRK11295 hypothetical protein; Provisional | Back alignment and domain information |
|---|
| >PF14279 HNH_5: HNH endonuclease | Back alignment and domain information |
|---|
| >TIGR03031 cas_csx12 CRISPR-associated protein, Csx12 family | Back alignment and domain information |
|---|
| >COG3513 Predicted CRISPR-associated nuclease, contains McrA/HNH-nuclease and RuvC-like nuclease domain [Defense mechanisms] | Back alignment and domain information |
|---|
| >smart00782 PhnA_Zn_Ribbon PhnA Zinc-Ribbon | Back alignment and domain information |
|---|
| >PF14239 RRXRR: RRXRR protein | Back alignment and domain information |
|---|
| >PF05198 IF3_N: Translation initiation factor IF-3, N-terminal domain; InterPro: IPR019814 Initiation factor 3 (IF-3) (gene infC) is one of the three factors required for the initiation of protein biosynthesis in bacteria | Back alignment and domain information |
|---|
Homologous Structure Templates
Structure Templates Detected by BLAST 
Original result of BLAST against Protein Data Bank
No homologous structure with e-value below 0.005
Structure Templates Detected by RPS-BLAST 
Original result of RPS-BLAST against PDB70 database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 210 | |||
| 2qgp_A | 112 | HNH endonuclease; Q39X46, GMR87, X-RAY, NESG, stru | 2e-20 |
Structure Templates Detected by HHsearch 
Original result of HHsearch against PDB70 database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 210 | |||
| 2qgp_A | 112 | HNH endonuclease; Q39X46, GMR87, X-RAY, NESG, stru | 98.99 | |
| 1tif_A | 78 | IF3-N, translation initiation factor 3; IF3 N-term | 81.24 |
| >1tif_A IF3-N, translation initiation factor 3; IF3 N-terminal domain, ribosome binding factor; 1.80A {Geobacillus stearothermophilus} SCOP: d.15.8.1 | Back alignment and structure |
|---|
Homologous Structure Domains
Structure Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against SCOP70(version1.75) database
No hit with e-value below 0.005
Homologous Domains Detected by HHsearch 
Original result of HHsearch against SCOP70(version1.75) database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 210 | |||
| d1e7la2 | 103 | Recombination endonuclease VII, N-terminal domain | 85.32 |
| >d1e7la2 d.4.1.5 (A:1-103) Recombination endonuclease VII, N-terminal domain {Bacteriophage T4 [TaxId: 10665]} | Back information, alignment and structure |
|---|
class: Alpha and beta proteins (a+b) fold: His-Me finger endonucleases superfamily: His-Me finger endonucleases family: Recombination endonuclease VII, N-terminal domain domain: Recombination endonuclease VII, N-terminal domain species: Bacteriophage T4 [TaxId: 10665]
Probab=85.32 E-value=0.24 Score=36.26 Aligned_cols=31 Identities=16% Similarity=0.298 Sum_probs=25.2
Q ss_pred ChhhHHHHHhHcCCcccccCCC-----cCCeeCeee
Q 028336 166 NNLSRKNLMYRDNFTCQYCSSR-----ENLTIDHVV 196 (210)
Q Consensus 166 ~~~tR~~Vl~RD~~~CqYCG~~-----~~LtvDHII 196 (210)
.+..++++++.++++|+.||.+ ..+.+||-.
T Consensus 8 ~~ee~~~l~~~Q~g~C~IC~~~~~~~~~~~~vDHdH 43 (103)
T d1e7la2 8 YKEEKQKFYDAQNGKCLICQRELNPDVQANHLDHDH 43 (103)
T ss_dssp HHHHHHHHHHHTTTBCTTTCCBCCSSGGGSEEEECC
T ss_pred CHHHHHHHHHHhCCcCCCCCCCcccccccceeeccc
Confidence 3567889999999999999976 347888864
|