Citrus Sinensis ID: 001223
Local Sequence Feature Prediction
| Prediction and (Method) | Result |
|---|
Close Homologs for Annotation Transfer
Close Homologs in the Non-Redundant Database Detected by BLAST 
Original result of BLAST against Nonredundant Database
GI ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 1120 | ||||||
| 255538898 | 1113 | 5'->3' exoribonuclease, putative [Ricinu | 0.920 | 0.926 | 0.756 | 0.0 | |
| 359492219 | 1065 | PREDICTED: 5'-3' exoribonuclease 3-like | 0.935 | 0.984 | 0.723 | 0.0 | |
| 224082986 | 998 | predicted protein [Populus trichocarpa] | 0.868 | 0.974 | 0.784 | 0.0 | |
| 302142638 | 870 | unnamed protein product [Vitis vinifera] | 0.771 | 0.993 | 0.837 | 0.0 | |
| 449452066 | 1101 | PREDICTED: 5'-3' exoribonuclease 3-like | 0.916 | 0.931 | 0.709 | 0.0 | |
| 356562399 | 1065 | PREDICTED: 5'-3' exoribonuclease 3-like | 0.893 | 0.939 | 0.687 | 0.0 | |
| 356553393 | 1075 | PREDICTED: 5'-3' exoribonuclease 3-like | 0.935 | 0.974 | 0.672 | 0.0 | |
| 242056911 | 1066 | hypothetical protein SORBIDRAFT_03g01004 | 0.882 | 0.926 | 0.717 | 0.0 | |
| 224065931 | 851 | predicted protein [Populus trichocarpa] | 0.739 | 0.972 | 0.816 | 0.0 | |
| 414876883 | 1066 | TPA: hypothetical protein ZEAMMB73_40276 | 0.882 | 0.926 | 0.710 | 0.0 |
| >gi|255538898|ref|XP_002510514.1| 5'->3' exoribonuclease, putative [Ricinus communis] gi|223551215|gb|EEF52701.1| 5'->3' exoribonuclease, putative [Ricinus communis] | Back alignment and taxonomy information |
|---|
Score = 1594 bits (4127), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 803/1062 (75%), Positives = 880/1062 (82%), Gaps = 31/1062 (2%)
Query: 1 MGVPAFYRWLAEKYPLVVADVIEEEPVVIDGVKIPVDTSKPNPNGLEYDNLYLDMNGIIH 60
MGVPAFYRWLAEKYPLVV D IEEEPVVIDGVKIPVD S+PNPN +EYDNLYLDMNGIIH
Sbjct: 1 MGVPAFYRWLAEKYPLVVVDAIEEEPVVIDGVKIPVDASRPNPNNIEYDNLYLDMNGIIH 60
Query: 61 PCFHPEDRPAPTTFDEVFQCMFDYIDRLFVMVRPRKLLYMAIDGVAPRAKMNQQRSRRFR 120
PCFHPEDRP+PT+F+EVFQCMFDYIDRLFVMVRPRKLLYMAIDGVAPRAKMNQQRSRRFR
Sbjct: 61 PCFHPEDRPSPTSFEEVFQCMFDYIDRLFVMVRPRKLLYMAIDGVAPRAKMNQQRSRRFR 120
Query: 121 ASKDAADSAAEEERLRQEFEREGRKLPPKSDSQVFDSNVITPGTEFMAVLSIALQYYIHL 180
A+KD ++AAEEERLRQEFEREGRKLPPK SQVFDSN+ITPGTEFMAVLSIALQYYIHL
Sbjct: 121 AAKDREEAAAEEERLRQEFEREGRKLPPKESSQVFDSNIITPGTEFMAVLSIALQYYIHL 180
Query: 181 RLNNDPGWEKIKVILSDANVPGEGEHKIMSYVRLQRNLPGYDPNTRHCLYGLDADLIMLA 240
RLNNDPGW+K+KVILSDANVPGEGEHK+MSY+RLQRNLPGYDPNTRHCLYGLDADLIMLA
Sbjct: 181 RLNNDPGWKKVKVILSDANVPGEGEHKVMSYIRLQRNLPGYDPNTRHCLYGLDADLIMLA 240
Query: 241 LATHEVHFSILREVVFTPGQQDKCFLCGQPGHLAANCEGKAKRKAGEFDEKGDEVVVPKK 300
LATHE+HFSILRE+VFTPGQQDKCFLCGQ GHLAA+CEGKAKRKAGEFDEKGDE V KK
Sbjct: 241 LATHEIHFSILREIVFTPGQQDKCFLCGQMGHLAADCEGKAKRKAGEFDEKGDEGAVAKK 300
Query: 301 PYQFLHIWTLREYLDYEFRIPNPPFEIDLECIVDDFIFMCFFVGNDFLPHMPTLEIREGA 360
PYQFL+IWTLREYL+ E RIPNPPF+ID ECIVDDFIFMCFFVGNDFLPHMPTLEIREGA
Sbjct: 301 PYQFLNIWTLREYLELEMRIPNPPFKIDFECIVDDFIFMCFFVGNDFLPHMPTLEIREGA 360
Query: 361 INLLMAVYKKEFRALGGYLTDGSKPSLRRVEHFIQAVGSYEDRIFQKRARLHQRHAERIK 420
INLL+AVYKKEF A+GGYLTDG KP+L +VEHFIQAVGSYED+IFQKRARLHQR +ERIK
Sbjct: 361 INLLLAVYKKEFTAMGGYLTDGCKPNLSKVEHFIQAVGSYEDKIFQKRARLHQRQSERIK 420
Query: 421 REKAQ--ARRGDDVEPLAQPDSLVPVSRFHGSRLASGPSPSPYQQSECVGRLD------- 471
REKAQ +RRGDD +P QP+SLVPV+RFHGSRLAS PSPSP+Q S LD
Sbjct: 421 REKAQSRSRRGDDAQPQVQPESLVPVARFHGSRLASAPSPSPFQHSLEAADLDVRSAHFS 480
Query: 472 --NGKGTSGRPHKVPRLSSGATIGAAIVEAENSFETDPQENKEEFKAKLKELLRDKSDAF 529
+GKG+S + HKV RLSS A++GAAIVEAENS E + ENK+E KAKLKE+LR+KSDAF
Sbjct: 481 ALDGKGSSVQAHKVARLSSSASVGAAIVEAENSLEIEVHENKDELKAKLKEILREKSDAF 540
Query: 530 NSDNPEEDKVKLGEPGWKERYYEEKFNAKTPEEREETRKDVVLRYTEGLCWVMHYYYEGV 589
NS NPEEDK++LG+PGWKERYYEEKF+ KTPEE E+ R+DVVLRYTEGLCWVMHYYYEGV
Sbjct: 541 NSKNPEEDKIRLGDPGWKERYYEEKFSGKTPEELEDIRRDVVLRYTEGLCWVMHYYYEGV 600
Query: 590 CSWQWFYPYHYAPFASDLKDLDQFNISFELGHPFKPFNQLLGVFPSASSHALPEHYRKLM 649
CSWQWFYPYHYAPFASDLK L Q +I FELG PFKPFNQLLGVFP+ASSHALP HYRKLM
Sbjct: 601 CSWQWFYPYHYAPFASDLKHLGQLDIKFELGSPFKPFNQLLGVFPAASSHALPVHYRKLM 660
Query: 650 TDPNSPISDFYPTDFEVDMNGKRYSWQGIAKLPFIDEARLLDEVKKIEHTLTEEEARRNS 709
+DPNSPI DFYPTDFEVDMNGKRY+WQGIAKLPFIDE RLL EVKKIEHTLTEEEARRNS
Sbjct: 661 SDPNSPIIDFYPTDFEVDMNGKRYAWQGIAKLPFIDEVRLLAEVKKIEHTLTEEEARRNS 720
Query: 710 IMADMLFVLLSHPLSASIYTLDEHCKQLKIKERVEVKERLNPELSDGMNGYISPCAGDPH 769
M DMLFVL SH L+ SIY LD +CKQL +ERVEVKER+NPELSDGMNGYISPC+GD H
Sbjct: 721 AMFDMLFVLSSHSLAESIYLLDNNCKQLTDRERVEVKERINPELSDGMNGYISPCSGDTH 780
Query: 770 PPVFRSPVASMEDIMGNQVLCAIYKLPDAHKHVTRPPAGVIFPKKIVQLEDLKPAPVLWH 829
PP+FRSPVA MEDI+ N V+CAIY+LPD HKHVTRPPAGVIFPKKIV + DLKP PVLWH
Sbjct: 781 PPIFRSPVAGMEDILDNGVICAIYRLPDPHKHVTRPPAGVIFPKKIVNVVDLKPDPVLWH 840
Query: 830 EDSGRRPHENGRHNPHGTISGRQLGEAAHRLVANSLQHMKVDRNGYGDRMHGSPLPYAAA 889
EDSGR+P E+ R NP GTISGRQLGEA+HRLV NSLQ MKVD NGY + +H PYAA
Sbjct: 841 EDSGRKPWESDRRNPPGTISGRQLGEASHRLVVNSLQ-MKVDHNGYANHLHAPIPPYAAT 899
Query: 890 PYVPPVPSYQEYGVYDQGYNRFSQPRTDSYPAGHSQSSISAAQPPYDGGYSQHYAS-NTS 948
+VP SY G +D +R QPR D AG ++ + Q + GY Q YAS +
Sbjct: 900 AHVPVHSSYAN-GSHDPRRDRTGQPRMDYSHAGQNR-FFNPTQYNNNHGYGQSYASPGNA 957
Query: 949 HHPNRRYHPQYDRN--SSGEHPTHLYNPSGIHQNGGPRYPPRPMGSTSSGANLYPPQGGY 1006
H R PQY+R S+G+H +Y HQN GPRY M TS Y Q Y
Sbjct: 958 HFSGSR--PQYERENRSNGQHSRQVYLQPEFHQN-GPRYLHGSMAGTSES---YAYQDDY 1011
Query: 1007 SGYQS---PAAGSFNQWGG-----ANQSMPRGYGQGQHHQRR 1040
GYQS P S QWG N + PRGY Q Q R
Sbjct: 1012 DGYQSHQPPRDASHQQWGNGLPPPVNLNFPRGYDQSQQRGNR 1053
|
Source: Ricinus communis Species: Ricinus communis Genus: Ricinus Family: Euphorbiaceae Order: Malpighiales Class: Phylum: Streptophyta Superkingdom: Eukaryota |
| >gi|359492219|ref|XP_002280236.2| PREDICTED: 5'-3' exoribonuclease 3-like [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|224082986|ref|XP_002306919.1| predicted protein [Populus trichocarpa] gi|222856368|gb|EEE93915.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
| >gi|302142638|emb|CBI19841.3| unnamed protein product [Vitis vinifera] | Back alignment and taxonomy information |
|---|
| >gi|449452066|ref|XP_004143781.1| PREDICTED: 5'-3' exoribonuclease 3-like [Cucumis sativus] | Back alignment and taxonomy information |
|---|
| >gi|356562399|ref|XP_003549459.1| PREDICTED: 5'-3' exoribonuclease 3-like [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|356553393|ref|XP_003545041.1| PREDICTED: 5'-3' exoribonuclease 3-like [Glycine max] | Back alignment and taxonomy information |
|---|
| >gi|242056911|ref|XP_002457601.1| hypothetical protein SORBIDRAFT_03g010040 [Sorghum bicolor] gi|241929576|gb|EES02721.1| hypothetical protein SORBIDRAFT_03g010040 [Sorghum bicolor] | Back alignment and taxonomy information |
|---|
| >gi|224065931|ref|XP_002301985.1| predicted protein [Populus trichocarpa] gi|222843711|gb|EEE81258.1| predicted protein [Populus trichocarpa] | Back alignment and taxonomy information |
|---|
| >gi|414876883|tpg|DAA54014.1| TPA: hypothetical protein ZEAMMB73_402768 [Zea mays] | Back alignment and taxonomy information |
|---|
Prediction of Gene Ontology (GO) Terms
Close Homologs with Gene Ontology terms Detected by BLAST 
Original result of BLAST against Gene Ontology (AMIGO)
ID ![]() |
Alignment graph ![]() |
Length ![]() |
Definition ![]() |
Q cover ![]() |
H cover ![]() |
Identity ![]() |
E-value ![]() |
| Query | 1120 | ||||||
| TAIR|locus:2005614 | 1020 | XRN3 "5'-3' exoribonuclease 3" | 0.767 | 0.843 | 0.723 | 0.0 | |
| TAIR|locus:2020073 | 947 | XRN4 "exoribonuclease 4" [Arab | 0.381 | 0.450 | 0.631 | 3.1e-251 | |
| UNIPROTKB|F1MKX7 | 951 | XRN2 "Uncharacterized protein" | 0.423 | 0.498 | 0.497 | 3.7e-189 | |
| MGI|MGI:894687 | 951 | Xrn2 "5'-3' exoribonuclease 2" | 0.444 | 0.523 | 0.484 | 4.7e-187 | |
| UNIPROTKB|Q5ZIP4 | 949 | XRN2 "5'-3' exoribonuclease 2" | 0.400 | 0.473 | 0.504 | 2.6e-186 | |
| UNIPROTKB|Q9H0D6 | 950 | XRN2 "5'-3' exoribonuclease 2" | 0.444 | 0.524 | 0.486 | 8.5e-186 | |
| UNIPROTKB|E2RMS9 | 950 | XRN2 "Uncharacterized protein" | 0.444 | 0.524 | 0.490 | 1.1e-185 | |
| ZFIN|ZDB-GENE-040426-2874 | 952 | xrn2 "5'-3' exoribonuclease 2" | 0.380 | 0.447 | 0.535 | 1.4e-185 | |
| UNIPROTKB|Q5R4L5 | 950 | XRN2 "5'-3' exoribonuclease 2" | 0.469 | 0.553 | 0.468 | 2.3e-185 | |
| UNIPROTKB|F1SAU5 | 950 | XRN2 "Uncharacterized protein" | 0.443 | 0.523 | 0.489 | 2.9e-185 |
| TAIR|locus:2005614 XRN3 "5'-3' exoribonuclease 3" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
Score = 3420 (1209.0 bits), Expect = 0., Sum P(2) = 0.
Identities = 643/889 (72%), Positives = 721/889 (81%)
Query: 1 MGVPAFYRWLAEKYPLVVADVIEEEPVVIDGVKIPVDTSKPNPNGLEYDNLYLDMNGIIH 60
MGVP+FYRWLAEKYPL+VADVIEEEPV I+G+KIPVDTSKPNPN LEYDNLYLDMNGIIH
Sbjct: 1 MGVPSFYRWLAEKYPLLVADVIEEEPVEIEGIKIPVDTSKPNPNNLEYDNLYLDMNGIIH 60
Query: 61 PCFHPEDRPAPTTFDEVFQCMFDYIDRLFVMVRPRKLLYMAIDGVAPRAKMNQQXXXXXX 120
PCFHPEDRP+PTTF+EVFQCMFDYIDRLFVMVRPRKLLYMAIDGVAPRAKMNQQ
Sbjct: 61 PCFHPEDRPSPTTFEEVFQCMFDYIDRLFVMVRPRKLLYMAIDGVAPRAKMNQQRSRRFR 120
Query: 121 XXXXXXXXXXXXXXXXXXXXXXXXKLPPKSDSQVFDSNVITPGTEFMAVLSIALQYYIHL 180
+LPPK DSQVFDSNVITPGTEFM VLSIALQYY+HL
Sbjct: 121 SAKDASDAAAEEERLREEFEREGRRLPPKVDSQVFDSNVITPGTEFMGVLSIALQYYVHL 180
Query: 181 RLNNDPGWEKIKVILSDANVPGEGEHKIMSYVRLQRNLPGYDPNTRHCLYGLDADLIMLA 240
RLN+D GW+ IKVILSDANVPGEGEHKIMSY+RLQRNLPG+DPNTRHCLYGLDADLIML
Sbjct: 181 RLNHDVGWKNIKVILSDANVPGEGEHKIMSYIRLQRNLPGFDPNTRHCLYGLDADLIMLG 240
Query: 241 LATHEVHFSILREVVFTPGQQDKCFLCGQPGHLAANCEGKAKRKAGEFDEKGDEVVVPKK 300
LATHEVHFSILREVV+TPGQQ++CFLCGQ GH A+NCEGK K++AGE DEKGD KK
Sbjct: 241 LATHEVHFSILREVVYTPGQQERCFLCGQMGHFASNCEGKPKKRAGESDEKGDGNDFVKK 300
Query: 301 PYQFLHIWTLREYLDYEFRIPNPPFEIDLECIVDDFIFMCFFVGNDFLPHMPTLEIREGA 360
PYQFLHIW LREYL+ E RIPNPPFEIDLE IVDDFIF+CFFVGNDFLPHMPTLEIREGA
Sbjct: 301 PYQFLHIWVLREYLELEMRIPNPPFEIDLERIVDDFIFICFFVGNDFLPHMPTLEIREGA 360
Query: 361 INLLMAVYKKEFRALGGYLTDGSKPSLRRVEHFIQAVGSYEDRIFQKRARLHQRHAERIK 420
INLLMAVYKKEFR+ GYLTDG KP+L+RVE FIQAVGS+ED+IFQKRA HQR AER+K
Sbjct: 361 INLLMAVYKKEFRSFDGYLTDGCKPNLKRVEQFIQAVGSFEDKIFQKRAMQHQRQAERVK 420
Query: 421 REKA--QARRGDDVEPLAQPDSLVPVSRFHGSRLASGPSPSPYQQSECVGRLDNGKGTSG 478
R+KA +R DD P QPD LVPV+RF GSRLAS P+PSP+Q ++ GR
Sbjct: 421 RDKAGKATKRMDDEAPTVQPD-LVPVARFSGSRLASAPTPSPFQSND--GR--------S 469
Query: 479 RPH-KVPRLSSGATIGAAIVEAENSFETDPQXXXXXXXXXXXXXXRDKSDAFNSDNPEED 537
PH KV RLS G+++GAAIV+ ENS E+D + R+KSDAFNSD EED
Sbjct: 470 APHQKVRRLSPGSSVGAAIVDVENSLESDERENKEELKTKLKELIREKSDAFNSDTTEED 529
Query: 538 KVKLGEPGWKERYYEEKFNAKTPEEREETRKDVVLRYTEGLCWVMHYYYEGVCSWQWFYP 597
KVKLG+PGW+ERYYEEKF+ TPEE E RKDVVL+YTEGLCWVMHYY EGVCSWQWFYP
Sbjct: 530 KVKLGQPGWRERYYEEKFSVVTPEEMERVRKDVVLKYTEGLCWVMHYYMEGVCSWQWFYP 589
Query: 598 YHYAPFASDLKDLDQFNISFELGHPFKPFNQLLGVFPSASSHALPEHYRKLMTDPNSPIS 657
YHYAPFASDLKDL + +I FELG PFKPFNQLLGVFP+ASSHALPE YR LMTDPNSPI
Sbjct: 590 YHYAPFASDLKDLGEMDIKFELGTPFKPFNQLLGVFPAASSHALPERYRTLMTDPNSPII 649
Query: 658 DFYPTDFEVDMNGKRYSWQGIAKLPFIDEARLLDEVKKIEHTLTEEEARRNSIMADMLFV 717
DFYPTDFEVDMNGKR+SWQGIAKLPFIDE RLL+ V ++E TLT+EE RRNS M DMLF+
Sbjct: 650 DFYPTDFEVDMNGKRFSWQGIAKLPFIDERRLLEAVSEVEFTLTDEEKRRNSRMCDMLFI 709
Query: 718 LLSHPLSASIYTLDEHCKQLKIKERVEVKERLNPELSDGMNGYISPCAGDPHPPVFRSPV 777
SH L+ +++LD HC+QL +ERV+ K ++ P+LSDGMNGY++PC+G+ HPPVFRSP+
Sbjct: 710 ATSHRLAELVFSLDNHCRQLSARERVDFKVKIKPKLSDGMNGYLTPCSGETHPPVFRSPM 769
Query: 778 ASMEDIMGNQVLCAIYKLPDAHKHVTRPPAGVIFPKKIVQLEDLKPAPVLWHEDSGRRP- 836
MEDI+ NQV+C IY+LPDAH+H+TRPP GVIFPKK V + DLKP P LWHED+GRRP
Sbjct: 770 EGMEDILTNQVICCIYRLPDAHEHITRPPPGVIFPKKTVDIGDLKPPPALWHEDNGRRPM 829
Query: 837 HEN-------------GRHNPHGTISGRQLGEAAHRLVANSLQHMKVDR 872
H N GR NP G++SGR LG AAHRLV+NSLQ M DR
Sbjct: 830 HNNHGMHNNHGMHNNQGRQNPPGSVSGRHLGNAAHRLVSNSLQ-MGTDR 877
|
|
| TAIR|locus:2020073 XRN4 "exoribonuclease 4" [Arabidopsis thaliana (taxid:3702)] | Back alignment and assigned GO terms |
|---|
| UNIPROTKB|F1MKX7 XRN2 "Uncharacterized protein" [Bos taurus (taxid:9913)] | Back alignment and assigned GO terms |
|---|
| MGI|MGI:894687 Xrn2 "5'-3' exoribonuclease 2" [Mus musculus (taxid:10090)] | Back alignment and assigned GO terms |
|---|
| UNIPROTKB|Q5ZIP4 XRN2 "5'-3' exoribonuclease 2" [Gallus gallus (taxid:9031)] | Back alignment and assigned GO terms |
|---|
| UNIPROTKB|Q9H0D6 XRN2 "5'-3' exoribonuclease 2" [Homo sapiens (taxid:9606)] | Back alignment and assigned GO terms |
|---|
| UNIPROTKB|E2RMS9 XRN2 "Uncharacterized protein" [Canis lupus familiaris (taxid:9615)] | Back alignment and assigned GO terms |
|---|
| ZFIN|ZDB-GENE-040426-2874 xrn2 "5'-3' exoribonuclease 2" [Danio rerio (taxid:7955)] | Back alignment and assigned GO terms |
|---|
| UNIPROTKB|Q5R4L5 XRN2 "5'-3' exoribonuclease 2" [Pongo abelii (taxid:9601)] | Back alignment and assigned GO terms |
|---|
| UNIPROTKB|F1SAU5 XRN2 "Uncharacterized protein" [Sus scrofa (taxid:9823)] | Back alignment and assigned GO terms |
|---|
Prediction of Enzyme Commission (EC) Number
EC Number Prediction by Ezypred Server 
Original result from Ezypred Server
Fail to connect to Ezypred Server
Prediction of Functionally Associated Proteins
Functionally Associated Proteins Detected by STRING 
Original result from the STRING server
| estExt_Genewise1_v1.C_LG_V1706 | hypothetical protein (998 aa) | |||||||
(Populus trichocarpa) | ||||||||
| Sorry, there are no predicted associations at the current settings. |
Conserved Domains and Related Protein Families
Conserved Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against CDD database part I
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 1120 | |||
| COG5049 | 953 | COG5049, XRN1, 5'-3' exonuclease [DNA replication, | 0.0 | |
| pfam03159 | 237 | pfam03159, XRN_N, XRN 5'-3' exonuclease N-terminus | 1e-144 | |
| smart00157 | 218 | smart00157, PRP, Major prion protein | 6e-06 | |
| pfam09606 | 768 | pfam09606, Med15, ARC105 or Med15 subunit of Media | 0.004 |
| >gnl|CDD|227382 COG5049, XRN1, 5'-3' exonuclease [DNA replication, recombination, and repair / Cell division and chromosome partitioning / Translation] | Back alignment and domain information |
|---|
Score = 716 bits (1849), Expect = 0.0
Identities = 340/860 (39%), Positives = 455/860 (52%), Gaps = 74/860 (8%)
Query: 1 MGVPAFYRWLAEKYPLVVADVIEEEPVVIDGVKIPVDTSKPNPNGLEYDNLYLDMNGIIH 60
MGVP+F+RWL+E+YP ++ + E K P DNLYLDMNGI+H
Sbjct: 1 MGVPSFFRWLSERYPKIIQLIEE----------------KQIPEF---DNLYLDMNGILH 41
Query: 61 PCFHPEDRPAPTTFDEVFQCMFDYIDRLFVMVRPRKLLYMAIDGVAPRAKMNQQRSRRFR 120
C HP D P T +E+++ +F+YID + + +RPRKLLYMA+DGVAPRAKMNQQR+RRFR
Sbjct: 42 NCTHPNDGSPPETEEEMYKAVFEYIDHILLKIRPRKLLYMAVDGVAPRAKMNQQRARRFR 101
Query: 121 ASKDAADSAAEEERLRQEFEREGRKLPPKSDSQV-----FDSNVITPGTEFMAVLSIALQ 175
++KDA+ +A + E +E E ++ + D+ FDSN ITPGT FM L+ L+
Sbjct: 102 SAKDASAAALKAEPNGEEIPEEKDEIGNEIDTIDVEKKKFDSNCITPGTPFMERLAKVLR 161
Query: 176 YYIHLRLNNDPGWEKIKVILSDANVPGEGEHKIMSYVRLQRNLPGYDPNTRHCLYGLDAD 235
YYIH +L++DP W +++I S VPGEGEHKIM+++R Q+ P Y+PNTRHC+YGLDAD
Sbjct: 162 YYIHCKLSSDPEWRNLRIIFSGHLVPGEGEHKIMNFIRSQKAQPSYNPNTRHCIYGLDAD 221
Query: 236 LIMLALATHEVHFSILREVVFTPGQQDKCFLCGQPGHLAANCEGKAKRKAGEFDEKGDEV 295
LIML L+THE HF ILRE VF + + + K G +E
Sbjct: 222 LIMLGLSTHEPHFLILREDVFFG---------------SKSRRKRKCTKCGRTGHSDEEC 266
Query: 296 VVP-KKPYQFLHIWTLREYLDYEFRIPNPPFEIDLECIVDDFIFMCFFVGNDFLPHMPTL 354
V +P+ LHI LREYL+ EFR P PF DLE I+DD+IF+CFFVGNDFLPH+P L
Sbjct: 267 KVLTHQPFYLLHISLLREYLEREFREPTLPFTFDLERILDDWIFLCFFVGNDFLPHLPCL 326
Query: 355 EIREGAINLLMAVYKKEFRALGGYLTDGSKPSLRRVEHFIQAVGSYEDRIFQKRARLHQR 414
+IREGAI L ++KK + GY+T +L R+E + +GS+ED IF+K +R
Sbjct: 327 DIREGAIETLTEIWKKSLPHMKGYITCDGVINLARLEVILAILGSFEDDIFKKDHIQEER 386
Query: 415 HAE---RIKREKAQARRGDDVEPLAQPDSLVPVSRFHGSRLASGPSPSPYQQSE-CVGRL 470
E R K + + + + S SP E + L
Sbjct: 387 KNESLERFSLRKERKEGLKGMPRVVYEQKKLIGSIKPTLMDQLQEKKSPDLPDEEFIDTL 446
Query: 471 DNGKGTSGRPHKVPRLSSGATIGAAIVEAENS----------FETDPQENKEEFKAKLKE 520
K + H++ +G +I +A S P E++EEF++++
Sbjct: 447 ALPKDLDMKNHELFLKRFANDLGLSISKAIKSKGNYSLEMDIASDSPDEDEEEFESEVDS 506
Query: 521 LLR--DKSDAFNSDNPEE----DKVKLGEPGWKERYYEEKFNAKTPEEREETRKDVVLRY 574
+ + DK + EE V L PGWKERYY K + T + EE +D+ Y
Sbjct: 507 IRKIPDKYVNIIVEEEEENETEKTVNLRFPGWKERYYTSKLHF-TTDS-EEKIRDMAKEY 564
Query: 575 TEGLCWVMHYYYEGVCSWQWFYPYHYAPFASDLKDLDQFNISFELGHPFKPFNQLLGVFP 634
EGL WV+ YYY G SW W+YPYHYAP A+DL L +I FELG PF+PF QL+ V P
Sbjct: 565 VEGLQWVLSYYYRGCPSWDWYYPYHYAPLAADLSKLSDNDIKFELGTPFRPFEQLMAVLP 624
Query: 635 SASSHALPEHYRKLMTDPNSPISDFYPTDFEVDMNGKRYSWQGIAKLPFIDEARLLDEVK 694
+ S + +PE +R LM D SPI DFYP +F++DMNGK SWQ + LPFIDE RLL V
Sbjct: 625 ARSKNLVPEGFRPLMDDEKSPIIDFYPEEFKLDMNGKTASWQAVVLLPFIDERRLLSAVA 684
Query: 695 KIEHTLTEEEARRNSIMADMLFVLLSHP-LSASIYTLDEHCKQLK--IKERVEVKERLNP 751
TL+EEE +RN D+LF LS L CKQ + E L
Sbjct: 685 VKYPTLSEEERKRNLRGLDLLFSSNKKSDLSELFKDLYSKCKQKEYITMCSKESPYGLFG 744
Query: 752 ELSDGMNGYISPCAGDPHPPVFRSPVASMEDIMGNQVLCAIYKLPDAHKHVTRPPAGVIF 811
+ G G P M + ++ A + D VT
Sbjct: 745 TVKLGAEGLAPNLLSLCPISFLSYP-GLMVFLEYSKNQSARLVIEDPKSTVTN------- 796
Query: 812 PKKIVQLEDLKPAPVLWHED 831
K IV +KP VLW
Sbjct: 797 -KSIVLRGFIKPINVLWPYL 815
|
Length = 953 |
| >gnl|CDD|202558 pfam03159, XRN_N, XRN 5'-3' exonuclease N-terminus | Back alignment and domain information |
|---|
| >gnl|CDD|197548 smart00157, PRP, Major prion protein | Back alignment and domain information |
|---|
| >gnl|CDD|220309 pfam09606, Med15, ARC105 or Med15 subunit of Mediator complex non-fungal | Back alignment and domain information |
|---|
Conserved Domains Detected by HHsearch 
Original result of HHsearch against CDD database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 1120 | |||
| KOG2044 | 931 | consensus 5'-3' exonuclease HKE1/RAT1 [Replication | 100.0 | |
| COG5049 | 953 | XRN1 5'-3' exonuclease [DNA replication, recombina | 100.0 | |
| KOG2045 | 1493 | consensus 5'-3' exonuclease XRN1/KEM1/SEP1 involve | 100.0 | |
| PF03159 | 237 | XRN_N: XRN 5'-3' exonuclease N-terminus; InterPro: | 100.0 | |
| cd00128 | 316 | XPG Xeroderma pigmentosum G N- and I-regions (XPGN | 98.49 | |
| PTZ00217 | 393 | flap endonuclease-1; Provisional | 98.08 | |
| KOG2044 | 931 | consensus 5'-3' exonuclease HKE1/RAT1 [Replication | 97.74 | |
| PRK03980 | 292 | flap endonuclease-1; Provisional | 97.23 | |
| TIGR03674 | 338 | fen_arch flap structure-specific endonuclease. End | 96.79 | |
| cd00008 | 240 | 53EXOc 5'-3' exonuclease; T5 type 5'-3' exonucleas | 95.44 | |
| smart00475 | 259 | 53EXOc 5'-3' exonuclease. | 95.04 | |
| PF00752 | 101 | XPG_N: XPG N-terminal domain; InterPro: IPR006085 | 93.58 | |
| PRK14976 | 281 | 5'-3' exonuclease; Provisional | 93.05 | |
| PF00867 | 94 | XPG_I: XPG I-region; InterPro: IPR006086 This entr | 91.55 | |
| smart00485 | 99 | XPGN Xeroderma pigmentosum G N-region. domain in n | 90.83 | |
| PF00098 | 18 | zf-CCHC: Zinc knuckle; InterPro: IPR001878 Zinc fi | 89.62 | |
| PRK05755 | 880 | DNA polymerase I; Provisional | 86.24 | |
| KOG2519 | 449 | consensus 5'-3' exonuclease [Replication, recombin | 85.03 | |
| COG0258 | 310 | Exo 5'-3' exonuclease (including N-terminal domain | 84.03 |
| >KOG2044 consensus 5'-3' exonuclease HKE1/RAT1 [Replication, recombination and repair; RNA processing and modification] | Back alignment and domain information |
|---|
Probab=100.00 E-value=1.1e-269 Score=2298.30 Aligned_cols=806 Identities=58% Similarity=0.974 Sum_probs=721.9
Q ss_pred CcchhHHHHHHhhCCcccccccccCccccCCccccCCCCCCCCCCCccCeEEEeccccccccccCCCCCCCCCHHHHHHH
Q 001223 1 MGVPAFYRWLAEKYPLVVADVIEEEPVVIDGVKIPVDTSKPNPNGLEYDNLYLDMNGIIHPCFHPEDRPAPTTFDEVFQC 80 (1120)
Q Consensus 1 MGVP~FfRWL~~rYP~ii~~viE~~~~~~dG~~iP~d~s~pNPNg~efDNLYLDMNGIIH~C~Hpe~~~~p~te~Emf~~ 80 (1120)
||||+|||||++|||++|++|+|++|+++||+.||+|.|+|||||+||||||||||||||||+|||++|+|+||+|||++
T Consensus 1 MGVPaffRWLs~kyp~~I~~viEe~p~~~~g~~ip~D~s~pNPNg~E~DNLYLDMNGIIHPC~HPEdkPaP~tedEm~~a 80 (931)
T KOG2044|consen 1 MGVPAFFRWLSRKYPKTISPVIEEEPVDVDGVKIPVDYSKPNPNGVEFDNLYLDMNGIIHPCTHPEDKPAPETEDEMFVA 80 (931)
T ss_pred CCchHHHHHHHHhcchhhhhhhhcCcccCCCcccccccCCCCCCcccccceeeecCcccccCCCCCCCCCCccHHHHHHH
Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHHHHHHHhhccccceEEEEecCCCchhhhHHHHhhhccchhHHHHHHHHHHHHHHHHHHhCCCCCCCCCCCCCccCcc
Q 001223 81 MFDYIDRLFVMVRPRKLLYMAIDGVAPRAKMNQQRSRRFRASKDAADSAAEEERLRQEFEREGRKLPPKSDSQVFDSNVI 160 (1120)
Q Consensus 81 If~yIDrL~~~VRPrKlLyiAIDGVAPRAKMNQQRsRRFrsakea~~~~~~~~~l~~e~~~eg~~lp~~~~~~~fDSN~I 160 (1120)
||+||||||.||||||||||||||||||||||||||||||||||+.++++|++++++|++.+|..||++.++++||||||
T Consensus 81 vFeyiDrlf~mvRPRkLLymAIDGVAPRAKMNQQRsRRFRaaKeaae~~~e~e~~ree~~~~G~~lpp~~~~e~fDSNcI 160 (931)
T KOG2044|consen 81 VFEYIDRLFSMVRPRKLLYMAIDGVAPRAKMNQQRSRRFRAAKEAAEKEAEIERLREEFEAEGKFLPPKVKKETFDSNCI 160 (931)
T ss_pred HHHHHHHHHHhccchheeEEeecccCchhhhhHHHHHHHhhhhHHHHHHHHHHHHHHHHHhcCCcCCchhhccccccCcc
Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred cCCcHHHHHHHHHHHHHHHHHhcCCCCCcccEEEEcCCCCCCChhhhHHHHHHHhhcCCCCCCCCcEEEEecChhHHHHH
Q 001223 161 TPGTEFMAVLSIALQYYIHLRLNNDPGWEKIKVILSDANVPGEGEHKIMSYVRLQRNLPGYDPNTRHCLYGLDADLIMLA 240 (1120)
Q Consensus 161 TPGT~FM~~Ls~~L~~yI~~kl~~Dp~W~~lkVIlSds~VPGEGEHKImdfIR~qr~~p~ydPNt~HcIYGlDADLImL~ 240 (1120)
||||+||++|+.+|+|||+.|||+||+|+||+||||||+||||||||||+|||+||++|+|||||+|||||+||||||||
T Consensus 161 TPGTpFM~~La~aLrYyI~~rLn~DPgWkNikvIlSDAnVPGEGEHKIM~yIR~QR~~P~~dPNT~HclyGlDADLImLg 240 (931)
T KOG2044|consen 161 TPGTPFMDRLAKALRYYIHDRLNSDPGWKNIKVILSDANVPGEGEHKIMSYIRSQRAQPGYDPNTHHCLYGLDADLIMLG 240 (931)
T ss_pred CCCChHHHHHHHHHHHHHHHhhcCCccccceEEEEecCCCCCcchhHHHHHHHHccCCCCCCCCceeeeecCCccceeee
Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred hhcCCceEEEeecccccCCCCCcccccCCCCCcccccccccccccCCCCcCCCcccCCCCCeEEEehhHHHHHHHHHhcC
Q 001223 241 LATHEVHFSILREVVFTPGQQDKCFLCGQPGHLAANCEGKAKRKAGEFDEKGDEVVVPKKPYQFLHIWTLREYLDYEFRI 320 (1120)
Q Consensus 241 LatHep~f~ILRE~V~~~~~~~~c~~cgQ~gH~~~~c~g~~k~k~~e~~~~~~~~~~~~~~f~~l~I~vLREYL~~el~~ 320 (1120)
|||||+||+||||+|| |+++++|++|||+||.+++|.|+.+ .++.++..+.. ..+++|+||+||||||||+.||.+
T Consensus 241 LATHE~hF~IlRE~~~-P~~~~~C~~cgq~gh~~~dc~g~~~--~~~~~~~~~~~-~~ek~fifl~I~vLREYLe~El~~ 316 (931)
T KOG2044|consen 241 LATHEPHFSILREEFF-PNKPRRCFLCGQTGHEAKDCEGKPR--LGETNELADVP-GVEKPFIFLNISVLREYLERELRM 316 (931)
T ss_pred ccccCCceEEeeeeec-CCCcccchhhcccCCcHhhcCCcCC--cccccccccCc-ccccceEEEEHHHHHHHHHHHhcC
Confidence 9999999999999976 9999999999999999999999866 56666666555 789999999999999999999999
Q ss_pred CCCCCccchhhhhhHHHHhhhhcccCCCCCCCccccccchHHHHHHHHHHHHHhcCCccccCCccCHHHHHHHHHHHhhh
Q 001223 321 PNPPFEIDLECIVDDFIFMCFFVGNDFLPHMPTLEIREGAINLLMAVYKKEFRALGGYLTDGSKPSLRRVEHFIQAVGSY 400 (1120)
Q Consensus 321 ~~~pF~~dlERiIDDfVfLcffvGNDFLPhLPsL~IreGaId~Li~~YKk~L~~~ggYLT~~G~INl~rl~~fl~~L~~~ 400 (1120)
|++||.||+||+||||||||||||||||||||||+|||||||+|+++||+.|++|+||||++|.|||.||+.||+.||.+
T Consensus 317 p~lPf~fd~ER~iDDwVF~CFFvGNDFLPHlPsLeIRegAId~L~~iyk~~~~~~kgYLT~~g~vNL~rve~~~~avg~~ 396 (931)
T KOG2044|consen 317 PNLPFTFDLERAIDDWVFLCFFVGNDFLPHLPSLEIREGAIDRLMEIYKKSFPKMKGYLTDSGKVNLDRVEMFMQAVGSV 396 (931)
T ss_pred CCCCccccHHhhhcceEEEEeeecCccCCCCCchhhhhcHHHHHHHHHHHHHHhhcceeccCCcccHHHHHHHHHHHhhh
Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCCCCCCccccccccCCCCCCCCCCCCCccccccccccCCCCCCCCC
Q 001223 401 EDRIFQKRARLHQRHAERIKREKAQARRGDDVEPLAQPDSLVPVSRFHGSRLASGPSPSPYQQSECVGRLDNGKGTSGRP 480 (1120)
Q Consensus 401 Ed~iF~~r~~~e~r~~e~~kr~k~~~~~~~~~~~~~~p~~~~p~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 480 (1120)
||+||++|. |++|+++|+|+..++.+..+ ++++ .+.++.....+ ..+...
T Consensus 397 Ed~IFkkR~----r~~e~frrrk~~rk~~~~~~------------~~sg-~~~~~~~~~~~-------------~~~~p~ 446 (931)
T KOG2044|consen 397 EDDIFKKRQ----RREERFRRRKAARKRQDRNA------------QDSG-TNFSLAGSREL-------------EASEPA 446 (931)
T ss_pred cchHHHHhH----hHHHHHHHhhhhhhhhhhhc------------cccc-ccccccccccc-------------cccchh
Confidence 999999994 45566666555433332211 1122 11111110000 012223
Q ss_pred CcccccCCCcchhhHHHhhhccCCCCccccHHHHHHHHHHHHhhhcccCCCCCchhhhhccCCCchhHHHHHHhcCCCCh
Q 001223 481 HKVPRLSSGATIGAAIVEAENSFETDPQENKEEFKAKLKELLRDKSDAFNSDNPEEDKVKLGEPGWKERYYEEKFNAKTP 560 (1120)
Q Consensus 481 ~k~~~~s~~~~~~~~~~~~e~~~~~~~~~~~eelk~~l~~~~~~k~d~~~~~~~~eD~VkL~e~gwK~rYY~~KF~v~~~ 560 (1120)
+++.+++..++.++++++ .+++++++.+++..++++++.+++++..+|+|||||+|||+|||++||++++.
T Consensus 447 ~~~~~~~~~~~~~~~~~~---------~~~~e~~~~~~~~~~~~k~~~~~~~~~~~D~VkL~e~G~keRYY~~KF~v~~~ 517 (931)
T KOG2044|consen 447 QKALKVSLEKNESAANVE---------RDNTEDLKTKLKHGQRRKSEDSESEEENTDKVKLYEPGWKERYYEEKFDVTPD 517 (931)
T ss_pred hhhhhhccccccchhhhc---------ccchhhccccccccccccCccccCCCCCCcceeecCCchhhhhhhhhcCCCCH
Confidence 344556666666666653 24566777888888888998888999999999999999999999999999876
Q ss_pred hHHHHHHHHHHHHHHhhHHHHhhhhccCcCcccccccCCCCcccccccccccccceecCCCCCChhhhhhhcCCCCCCCC
Q 001223 561 EEREETRKDVVLRYTEGLCWVMHYYYEGVCSWQWFYPYHYAPFASDLKDLDQFNISFELGHPFKPFNQLLGVFPSASSHA 640 (1120)
Q Consensus 561 ~di~~~r~~vv~~YvEGL~WVL~YYyqGcpSW~WYYPYHYAPfaSDl~~l~~~~i~Fe~G~Pf~PfEQLMsVLPaaS~~~ 640 (1120)
+ ++||++||++|||||||||+||||||+||+||||||||||||||++|..++|+|++|+||+||||||+||||||+++
T Consensus 518 e--eq~R~~vv~~YveGLcWVl~YYyqGc~SW~WyYPYHYAPfAsDf~~l~~ldikFe~g~PFkP~eQLmgVlPAAS~~~ 595 (931)
T KOG2044|consen 518 E--EQIRKDVVLKYVEGLCWVLRYYYQGCASWNWYYPYHYAPFASDFKGLSDLDIKFELGKPFKPLEQLMGVLPAASSHA 595 (931)
T ss_pred H--HHHHHHHHHHHhcchhhhhhhhhccccccccccccccchhhhhhhcccccccccccCCCCCcHHHHhhhcchhhcCC
Confidence 6 78999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred CcHHHHhhhcCCCCCCcccCCCcccccCCCCcccccceeccCCCChHHHHHHHHhhcCCCCHHHHhhccCCCceEEeecC
Q 001223 641 LPEHYRKLMTDPNSPISDFYPTDFEVDMNGKRYSWQGIAKLPFIDEARLLDEVKKIEHTLTEEEARRNSIMADMLFVLLS 720 (1120)
Q Consensus 641 LP~~~r~LMtdp~SpIiDFYP~dF~ID~NGKk~~WqGVaLLPFIDe~rLl~a~~~~~~~LT~eE~~RN~~g~d~lfv~~~ 720 (1120)
||+.||+||+||+||||||||+||+||||||||+||||||||||||+|||+|+++++++||+||++||+.|.|+|||+.+
T Consensus 596 LPe~~r~LMsdpdSpIiDFYPedF~iDmNGKk~aWQGIalLPFiDe~rLl~a~~~~y~~Lt~EE~~RN~rg~d~Lfi~~~ 675 (931)
T KOG2044|consen 596 LPEEWRKLMSDPDSPIIDFYPEDFEIDMNGKKYAWQGIALLPFIDERRLLSAVAKVYPTLTDEEKRRNSRGPDLLFISDK 675 (931)
T ss_pred CcHHHHhhhcCCCCcccccccccceeeccCceeeccccccccccchhhHHHHHHhhccccCHHHHhccccCCceEEecCC
Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred CCcchhhhhhhhHhhhhhhhhhhhhhhcCCcCcCCCcceeeecCCCCCCCCcccCCCCCCcCCcCCceEEEEecCCCCCC
Q 001223 721 HPLSASIYTLDEHCKQLKIKERVEVKERLNPELSDGMNGYISPCAGDPHPPVFRSPVASMEDIMGNQVLCAIYKLPDAHK 800 (1120)
Q Consensus 721 ~~l~~~i~~l~~~~~~~~~~~~~~~~~~i~~~~s~g~~G~v~~~~~~~~~~~~~sPi~~l~dI~~n~~i~~~y~~P~~~~ 800 (1120)
||++++|.+||..+++ ...++ ..+-..++.|++|.+++++......+++||+..+-++..+..++++|..|..+.
T Consensus 676 hp~~e~i~~lysk~k~-~~~~~----v~~~~~~~p~~~~~~~~~~~~~~~~i~~~p~~~~~~~~~~~~v~~~~~~~~~~e 750 (931)
T KOG2044|consen 676 HPLFEFILQLYSKKKK-SNEKN----VKLAHGVDPGLNGAISKDPEGLESGISKSPPGGLSDYNTNTGVCLKYVDPEYPE 750 (931)
T ss_pred CchHHHHHHHHHhhcc-Ccccc----cccccccCcccceeeccCccccccccccCChhhcccCCccceeeecccCccccc
Confidence 9999999999887765 22221 234455667999999998866556788899999999999999999999998742
Q ss_pred C---CCCCCCCCcccCCCCCcccCCCCCcccCCCCCCCCCCCC-CCCCCCCCCCCccchhHHHHHHhhhc
Q 001223 801 H---VTRPPAGVIFPKKIVQLEDLKPAPVLWHEDSGRRPHENG-RHNPHGTISGRQLGEAAHRLVANSLQ 866 (1120)
Q Consensus 801 h---~s~ll~gv~~P~~~l~~~d~~~~~~l~~~~~~~~~~~~~-~~~~~~~~~~~~l~~a~~r~~~~~~~ 866 (1120)
. .++.|+|++.|.++|+ |-||...++ ..+.++ +|+.. ..+..+.+++||.|.+.+.
T Consensus 751 d~~~~a~~l~G~~~p~~~lk-------P~dwn~sR~-~~~~~~~~~~~~--~~~~~~~~~~~~~v~~~~~ 810 (931)
T KOG2044|consen 751 DYIFPAIRLDGAKEPEKVLK-------PDDWNDRRD-GRYQPQAGFNRN--APRGSLPQSGHRQVHHYVR 810 (931)
T ss_pred cccchhhhcCCCCCCccccC-------cchhhhhhh-hhcCcccccccC--CCCCcCCcccccccCcCCC
Confidence 2 4788888888888888 346765433 234444 66665 4478999999999888887
|
|
| >COG5049 XRN1 5'-3' exonuclease [DNA replication, recombination, and repair / Cell division and chromosome partitioning / Translation] | Back alignment and domain information |
|---|
| >KOG2045 consensus 5'-3' exonuclease XRN1/KEM1/SEP1 involved in DNA strand exchange and mRNA turnover [Replication, recombination and repair; Cell cycle control, cell division, chromosome partitioning] | Back alignment and domain information |
|---|
| >PF03159 XRN_N: XRN 5'-3' exonuclease N-terminus; InterPro: IPR004859 Signatures of this entry align residues towards the N terminus of several proteins with multiple functions | Back alignment and domain information |
|---|
| >cd00128 XPG Xeroderma pigmentosum G N- and I-regions (XPGN, XPGI); contains the HhH2 motif; domain in nucleases | Back alignment and domain information |
|---|
| >PTZ00217 flap endonuclease-1; Provisional | Back alignment and domain information |
|---|
| >KOG2044 consensus 5'-3' exonuclease HKE1/RAT1 [Replication, recombination and repair; RNA processing and modification] | Back alignment and domain information |
|---|
| >PRK03980 flap endonuclease-1; Provisional | Back alignment and domain information |
|---|
| >TIGR03674 fen_arch flap structure-specific endonuclease | Back alignment and domain information |
|---|
| >cd00008 53EXOc 5'-3' exonuclease; T5 type 5'-3' exonuclease domains may co-occur with DNA polymerase I (Pol I) domains, or be part of Pol I containing complexes | Back alignment and domain information |
|---|
| >smart00475 53EXOc 5'-3' exonuclease | Back alignment and domain information |
|---|
| >PF00752 XPG_N: XPG N-terminal domain; InterPro: IPR006085 Xeroderma pigmentosum (XP) [] is a human autosomal recessive disease, characterised by a high incidence of sunlight-induced skin cancer | Back alignment and domain information |
|---|
| >PRK14976 5'-3' exonuclease; Provisional | Back alignment and domain information |
|---|
| >PF00867 XPG_I: XPG I-region; InterPro: IPR006086 This entry represents endonucleases that cleave the 5'-overhanging flap structure that is generated by displacement synthesis when DNA polymerase encounters the 5'-end of a downstream Okazaki fragment | Back alignment and domain information |
|---|
| >smart00485 XPGN Xeroderma pigmentosum G N-region | Back alignment and domain information |
|---|
| >PF00098 zf-CCHC: Zinc knuckle; InterPro: IPR001878 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule | Back alignment and domain information |
|---|
| >PRK05755 DNA polymerase I; Provisional | Back alignment and domain information |
|---|
| >KOG2519 consensus 5'-3' exonuclease [Replication, recombination and repair] | Back alignment and domain information |
|---|
| >COG0258 Exo 5'-3' exonuclease (including N-terminal domain of PolI) [DNA replication, recombination, and repair] | Back alignment and domain information |
|---|
Homologous Structure Templates
Structure Templates Detected by BLAST 
Original result of BLAST against Protein Data Bank
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() | |
| Query | 1120 | ||||
| 3fqd_A | 899 | Crystal Structure Of The S. Pombe Rat1-Rai1 Complex | 1e-162 | ||
| 3pie_A | 1155 | Crystal Structure Of The 5'->3' Exoribonuclease Xrn | 4e-73 | ||
| 3pie_A | 1155 | Crystal Structure Of The 5'->3' Exoribonuclease Xrn | 5e-39 | ||
| 2y35_A | 1140 | Crystal Structure Of Xrn1-Substrate Complex Length | 2e-72 | ||
| 2y35_A | 1140 | Crystal Structure Of Xrn1-Substrate Complex Length | 4e-40 |
| >pdb|3FQD|A Chain A, Crystal Structure Of The S. Pombe Rat1-Rai1 Complex Length = 899 | Back alignment and structure |
|
| >pdb|3PIE|A Chain A, Crystal Structure Of The 5'->3' Exoribonuclease Xrn1, E178q Mutant Length = 1155 | Back alignment and structure |
| >pdb|3PIE|A Chain A, Crystal Structure Of The 5'->3' Exoribonuclease Xrn1, E178q Mutant Length = 1155 | Back alignment and structure |
| >pdb|2Y35|A Chain A, Crystal Structure Of Xrn1-Substrate Complex Length = 1140 | Back alignment and structure |
| >pdb|2Y35|A Chain A, Crystal Structure Of Xrn1-Substrate Complex Length = 1140 | Back alignment and structure |
Structure Templates Detected by RPS-BLAST 
Original result of RPS-BLAST against PDB70 database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| Query | 1120 | |||
| 3fqd_A | 899 | Protein DHP1, 5'-3' exoribonuclease 2; protein-pro | 0.0 | |
| 3pie_A | 1155 | 5'->3' exoribonuclease (XRN1); beta berrel, tudor | 0.0 | |
| 2y35_A | 1140 | LD22664P; hydrolase-DNA complex, RNA degradation, | 1e-138 | |
| 1vt4_I | 1221 | APAF-1 related killer DARK; drosophila apoptosome, | 5e-09 | |
| 1vt4_I | 1221 | APAF-1 related killer DARK; drosophila apoptosome, | 2e-06 | |
| 1vt4_I | 1221 | APAF-1 related killer DARK; drosophila apoptosome, | 3e-05 | |
| 1m2v_B | 926 | SEC24, protein transport protein SEC24, SEC24P, SE | 1e-07 | |
| 1m2v_B | 926 | SEC24, protein transport protein SEC24, SEC24P, SE | 2e-07 | |
| 1m2v_B | 926 | SEC24, protein transport protein SEC24, SEC24P, SE | 4e-05 | |
| 1twf_A | 1733 | B220, DNA-directed RNA polymerase II largest subun | 5e-05 |
| >3fqd_A Protein DHP1, 5'-3' exoribonuclease 2; protein-protein complex, exonuclease, hydrolase, mRNA proces nuclease, nucleus, rRNA processing, transcription; 2.20A {Schizosaccharomyces pombe} Length = 899 | Back alignment and structure |
|---|
Score = 669 bits (1725), Expect = 0.0
Identities = 345/911 (37%), Positives = 485/911 (53%), Gaps = 87/911 (9%)
Query: 1 MGVPAFYRWLAEKYPLVVADVIEEEPVVI-DGVKIPVDTSKPNPNGLEYDNLYLDMNGII 59
MGVPA +R L+ K+ V+ VIE + DG +I D S PNPNG+E DNLYLDMNGI+
Sbjct: 4 MGVPALFRLLSRKFAKVITPVIEAPTEKLPDGTEIEPDLSLPNPNGVECDNLYLDMNGIV 63
Query: 60 HPCFHPEDRPAPTTFDEVFQCMFDYIDRLFVMVRPRKLLYMAIDGVAPRAKMNQQRSRRF 119
HPC HPEDRPAP T DE+ +F+Y DR+ MVRPR+LL++AIDGVAPRAKMNQQRSRRF
Sbjct: 64 HPCSHPEDRPAPETEDEMMVAVFEYTDRILAMVRPRQLLFIAIDGVAPRAKMNQQRSRRF 123
Query: 120 RASKDAADSAAEEERLRQEFEREGRKLPPKS-DSQVFDSNVITPGTEFMAVLSIALQYYI 178
R+S++AA E + +E +++G + + + +DSN ITPGT FM L+ +L+YYI
Sbjct: 124 RSSREAALKEEELQAFIEEAKQQGIPIDENATKKKSWDSNCITPGTPFMDTLAKSLRYYI 183
Query: 179 HLRLNNDPGWEKIKVILSDANVPGEGEHKIMSYVRLQRNLPGYDPNTRHCLYGLDADLIM 238
+LN+DP W ++ ILSDA+VPGEGEHKIM ++R QR P YDPNT H +YGLDADLIM
Sbjct: 184 INKLNSDPCWRNVRFILSDASVPGEGEHKIMEFIRSQRVKPEYDPNTHHVVYGLDADLIM 243
Query: 239 LALATHEVHFSILREVVFTPGQQDKCFLCGQPGHLAANCEGKAKRKAGEFDEKGDEVVVP 298
L LATHE HF +LRE VF F G G + D V
Sbjct: 244 LGLATHEPHFRVLREDVF--------FQQGSTKKTKEERLGIKR----LDDVSETNKVPV 291
Query: 299 KKPYQFLHIWTLREYLDYEFRIPNPPFEIDLECIVDDFIFMCFFVGNDFLPHMPTLEIRE 358
KKP+ +L++ LREYL+ E +PN PF DLE +DD++F FFVGNDFLPH+P+L+IR+
Sbjct: 292 KKPFIWLNVSILREYLEVELYVPNLPFPFDLERAIDDWVFFIFFVGNDFLPHLPSLDIRD 351
Query: 359 GAINLLMAVYKKEFRALGGYLTDGSKPSLRRVEHFIQAVGSYEDRIFQKRARLHQRHAER 418
GA+ L +++ +GGYLT +L R E + AVG+ ED IF++ + R E
Sbjct: 352 GAVERLTEIWRASLPHMGGYLTLDGSVNLARAEVILSAVGNQEDDIFKRLKQQEDRRNEN 411
Query: 419 IKREKAQARRGDD--------VEPLAQPDSLVPVSRFHGSRLASGPS------------- 457
+R + + + ++ + S V+ + + + P
Sbjct: 412 YRRRQQRESNQESESYVDNVVIQRSVETQSTEVVTSSKSTSVDTKPPKKTQKIDAPAPVD 471
Query: 458 --PSPYQQSECVGRLDNGKGTSGRP-----------------------HKVPRLSSGATI 492
+ S N + + R K +S+
Sbjct: 472 LVNLSEKTSNRSLGATNRELINNRAANRLGLSREAAAVSSVNKLAASALKAQLVSNETLQ 531
Query: 493 GAAIVEAENSFETDPQENKEEFKAKLKELLRDKSDAFNSDNPEEDKVKLGEP-------- 544
+ ++ S + E + + K D + E
Sbjct: 532 NVPLEDSIASSSAYEDTDSIESSTPVVHPIDTKVSNVGQKRKAPDSTEENENTDTVRLYE 591
Query: 545 -GWKERYYEEKFNAKTPEEREETRKDVVLRYTEGLCWVMHYYYEGVCSWQWFYPYHYAPF 603
G++ERYYE+KF+ E E ++ V Y GLCWV+ YYY+G SW W+YPYHYAPF
Sbjct: 592 PGYRERYYEQKFHISPDE--PEKIREAVKHYVHGLCWVLLYYYQGCPSWTWYYPYHYAPF 649
Query: 604 ASDLKDLDQFNISFELGHPFKPFNQLLGVFPSASSHALPEHYRKLMTDPNSPISDFYPTD 663
A+D KDL ++ FEL PFKP+ QLLGV P+AS + LPE + LMTD NS I DFYP +
Sbjct: 650 AADFKDLASIDVKFELNQPFKPYEQLLGVLPAASKNNLPEKLQTLMTDENSEIIDFYPEN 709
Query: 664 FEVDMNGKRYSWQGIAKLPFIDEARLLDEVKKIEHTLTEEEARRNSIMADMLFVLLSHPL 723
F +D+NGK++ WQG+A LPFIDE RLL+ V KI LTEEE++RN + +LF+ HP+
Sbjct: 710 FTIDLNGKKFEWQGVALLPFIDENRLLNAVSKIYPQLTEEESKRNEDGSTLLFISEHHPM 769
Query: 724 SASIYTLDEHCKQLKIKERVEVKERLNPELSDGMNGYISPCAGDPHPPVFRSPV------ 777
+ + KQL K+R +L+ +++ G+ G ++ + P+
Sbjct: 770 FSEL------VKQLYSKKRQGKPLKLSGKMAHGLFGKVNTNDSVIPNVSVQCPIDVTSAD 823
Query: 778 --ASMEDIMGNQVLCAIYKLPD-AHKHVTRPPAGVIFPKKIVQLEDLKPAPVLWHEDSGR 834
I NQ + ++++P H + GV P +++ ED+ S R
Sbjct: 824 ALQKYGSIDDNQSISLVFEVPKSHFVHKSMLLRGVKMPNRVLTPEDINQVR-AERSFSSR 882
Query: 835 RPHENGRHNPH 845
R + N
Sbjct: 883 RNNGNSAAALE 893
|
| >3pie_A 5'->3' exoribonuclease (XRN1); beta berrel, tudor domain, chromo domain, mRNA turnover, RRN processing, RNA binding, DNA binding; 2.90A {Kluyveromyces lactis} PDB: 3pif_A Length = 1155 | Back alignment and structure |
|---|
| >2y35_A LD22664P; hydrolase-DNA complex, RNA degradation, exonuclease 5'-3', R interference; 3.20A {Drosophila melanogaster} Length = 1140 | Back alignment and structure |
|---|
| >1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 | Back alignment and structure |
|---|
| >1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 | Back alignment and structure |
|---|
| >1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 | Back alignment and structure |
|---|
| >1m2v_B SEC24, protein transport protein SEC24, SEC24P, SEC24 protein, abnormal nuclear; zinc-finger, beta barrel, VWA domain, gelsolin domain,; 2.75A {Saccharomyces cerevisiae} SCOP: a.71.2.1 b.2.8.1 c.62.1.2 d.109.2.1 g.41.10.1 Length = 926 | Back alignment and structure |
|---|
| >1m2v_B SEC24, protein transport protein SEC24, SEC24P, SEC24 protein, abnormal nuclear; zinc-finger, beta barrel, VWA domain, gelsolin domain,; 2.75A {Saccharomyces cerevisiae} SCOP: a.71.2.1 b.2.8.1 c.62.1.2 d.109.2.1 g.41.10.1 Length = 926 | Back alignment and structure |
|---|
| >1m2v_B SEC24, protein transport protein SEC24, SEC24P, SEC24 protein, abnormal nuclear; zinc-finger, beta barrel, VWA domain, gelsolin domain,; 2.75A {Saccharomyces cerevisiae} SCOP: a.71.2.1 b.2.8.1 c.62.1.2 d.109.2.1 g.41.10.1 Length = 926 | Back alignment and structure |
|---|
| >1twf_A B220, DNA-directed RNA polymerase II largest subunit; transcription, mRNA, multiprotein complex; HET: UTP; 2.30A {Saccharomyces cerevisiae} SCOP: e.29.1.2 PDB: 1i3q_A 1i6h_A 1k83_A* 1nik_A 1nt9_A 1pqv_A 1r5u_A 1r9s_A* 1r9t_A* 1sfo_A* 1twa_A* 1twc_A* 1i50_A* 1twg_A* 1twh_A* 1wcm_A 1y1v_A 1y1w_A 1y1y_A 1y77_A* ... Length = 1733 | Back alignment and structure |
|---|
Structure Templates Detected by HHsearch 
Original result of HHsearch against PDB70 database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 1120 | |||
| 3fqd_A | 899 | Protein DHP1, 5'-3' exoribonuclease 2; protein-pro | 100.0 | |
| 3pie_A | 1155 | 5'->3' exoribonuclease (XRN1); beta berrel, tudor | 100.0 | |
| 2y35_A | 1140 | LD22664P; hydrolase-DNA complex, RNA degradation, | 100.0 | |
| 1b43_A | 340 | Protein (FEN-1); nuclease, DNA repair, DNA replica | 98.45 | |
| 1rxw_A | 336 | Flap structure-specific endonuclease; helical clam | 98.16 | |
| 1a76_A | 326 | Flap endonuclease-1 protein; 5'-3' EXO/endo nuclea | 98.16 | |
| 3qe9_Y | 352 | Exonuclease 1; exonuclease, hydrolase-DNA complex; | 98.07 | |
| 3ory_A | 363 | Flap endonuclease 1; hydrolase; 2.00A {Desulfuroco | 97.83 | |
| 1ul1_X | 379 | Flap endonuclease-1; protein complex, DNA-binding | 97.49 | |
| 2izo_A | 346 | FEN1, flap structure-specific endonuclease; hydrol | 97.34 | |
| 3q8k_A | 341 | Flap endonuclease 1; helix-3 turn-helix, hydrophob | 97.2 | |
| 1exn_A | 290 | 5'-exonuclease, 5'-nuclease; hydrolase; 2.50A {Ent | 96.14 | |
| 1dsq_A | 26 | Nucleic acid binding protein P14; CCHC type zinc f | 85.0 | |
| 1nc8_A | 29 | Nucleocapsid protein; HIV-2, RNA recognition, zinc | 84.82 | |
| 1a6b_B | 40 | Momulv, zinc finger protein NCP10; nucleocapsid pr | 83.37 |
| >3fqd_A Protein DHP1, 5'-3' exoribonuclease 2; protein-protein complex, exonuclease, hydrolase, mRNA proces nuclease, nucleus, rRNA processing, transcription; 2.20A {Schizosaccharomyces pombe} | Back alignment and structure |
|---|
Probab=100.00 E-value=1.8e-233 Score=2066.42 Aligned_cols=802 Identities=43% Similarity=0.760 Sum_probs=634.3
Q ss_pred CcchhHHHHHHhhCCcccccccccCcccc-CCccccCCCCCCCCCCCccCeEEEeccccccccccCCCCCCCCCHHHHHH
Q 001223 1 MGVPAFYRWLAEKYPLVVADVIEEEPVVI-DGVKIPVDTSKPNPNGLEYDNLYLDMNGIIHPCFHPEDRPAPTTFDEVFQ 79 (1120)
Q Consensus 1 MGVP~FfRWL~~rYP~ii~~viE~~~~~~-dG~~iP~d~s~pNPNg~efDNLYLDMNGIIH~C~Hpe~~~~p~te~Emf~ 79 (1120)
||||+|||||++|||+|+++|+|+.|.++ ||..||+|+++|||||.||||||||||||||+|+|+++.+.+.|++|||.
T Consensus 4 MGVP~ffRWLs~rYP~i~~~~~e~~~~~~~~g~~~p~d~~~pnpng~e~DnLYlDmNgIIH~c~h~~~~~~~~te~e~~~ 83 (899)
T 3fqd_A 4 MGVPALFRLLSRKFAKVITPVIEAPTEKLPDGTEIEPDLSLPNPNGVECDNLYLDMNGIVHPCSHPEDRPAPETEDEMMV 83 (899)
T ss_dssp CCHHHHHHHHHHHCGGGEEECCCCCCEECTTSCEECCCTTSCCTTSSCEEEEEEETHHHHHHHHSCSSSCCCCSHHHHHH
T ss_pred cchHHHHHHHHHHCCccchhhcccccccccCCccccccccCCCCCCcccceEEEecchhhhhhcCCCCCCCCCCHHHHHH
Confidence 99999999999999999999999999888 89999999999999999999999999999999999999999999999999
Q ss_pred HHHHHHHHHHhhccccceEEEEecCCCchhhhHHHHhhhccchhHHHHHHHHHHHHHHHHHHhCCCCCCC--CCCCCCcc
Q 001223 80 CMFDYIDRLFVMVRPRKLLYMAIDGVAPRAKMNQQRSRRFRASKDAADSAAEEERLRQEFEREGRKLPPK--SDSQVFDS 157 (1120)
Q Consensus 80 ~If~yIDrL~~~VRPrKlLyiAIDGVAPRAKMNQQRsRRFrsakea~~~~~~~~~l~~e~~~eg~~lp~~--~~~~~fDS 157 (1120)
+||+|||+||.+|||||+||||||||||||||||||+||||+|++++++.++.++++++++++| .+++. ...+.|||
T Consensus 84 ~If~yid~l~~~vrPrklly~AiDGVAPrAKmnQQRsRRfrsa~~~~~~~~~~~~~~~~~~~~g-~~~~~~~~~~~~fDs 162 (899)
T 3fqd_A 84 AVFEYTDRILAMVRPRQLLFIAIDGVAPRAKMNQQRSRRFRSSREAALKEEELQAFIEEAKQQG-IPIDENATKKKSWDS 162 (899)
T ss_dssp HHHHHHHHHHHHHCEEEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHT-CCBCHHHHSCCCCCG
T ss_pred HHHHHHHHHHHHcCcceeEEEeecCCCCchHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHHhcC-CCCccccccccCCCc
Confidence 9999999999999999999999999999999999999999999999998888888888998888 33321 24578999
Q ss_pred CcccCCcHHHHHHHHHHHHHHHHHhcCCCCCcccEEEEcCCCCCCChhhhHHHHHHHhhcCCCCCCCCcEEEEecChhHH
Q 001223 158 NVITPGTEFMAVLSIALQYYIHLRLNNDPGWEKIKVILSDANVPGEGEHKIMSYVRLQRNLPGYDPNTRHCLYGLDADLI 237 (1120)
Q Consensus 158 N~ITPGT~FM~~Ls~~L~~yI~~kl~~Dp~W~~lkVIlSds~VPGEGEHKImdfIR~qr~~p~ydPNt~HcIYGlDADLI 237 (1120)
|||||||+||++|+.+|+|||+.||++||+|++++|||||++||||||||||+|||+||++|+|||||+|||||+|||||
T Consensus 163 N~ITPGT~FM~~L~~~L~~~i~~kl~~dp~W~~~~VIlSd~~vPGEGEHKIm~fIR~~r~~p~ydpN~~HcIyGlDADLI 242 (899)
T 3fqd_A 163 NCITPGTPFMDTLAKSLRYYIINKLNSDPCWRNVRFILSDASVPGEGEHKIMEFIRSQRVKPEYDPNTHHVVYGLDADLI 242 (899)
T ss_dssp GGSSTTSHHHHHHHHHHHHHHHHHHTSCGGGTTCEEEEECTTSCSCHHHHHHHHHHHHHTSTTSCTTCCEEEECCCTTHH
T ss_pred CccCCccHHHHHHHHHHHHHHHHHhhcCcccccceEEEeCCCCCCccHHHHHHHHHHHhcCCCCCCCCeEEEEccCccHh
Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHhhcCCceEEEeecccccCCCCCcccccCCCCCcccccccccccccCCCCcCCCcccCCCCCeEEEehhHHHHHHHHH
Q 001223 238 MLALATHEVHFSILREVVFTPGQQDKCFLCGQPGHLAANCEGKAKRKAGEFDEKGDEVVVPKKPYQFLHIWTLREYLDYE 317 (1120)
Q Consensus 238 mL~LatHep~f~ILRE~V~~~~~~~~c~~cgQ~gH~~~~c~g~~k~k~~e~~~~~~~~~~~~~~f~~l~I~vLREYL~~e 317 (1120)
||||+||||||+||||+|++.. +|.||.+.+|.|+.|. ++++++. .....++|+||||++|||||+.|
T Consensus 243 mL~LatHep~f~ILRE~v~~~~--------~q~~~~~~~~~~~~k~--~~~~~~~--~~~~~~~f~~l~i~iLREYL~~E 310 (899)
T 3fqd_A 243 MLGLATHEPHFRVLREDVFFQQ--------GSTKKTKEERLGIKRL--DDVSETN--KVPVKKPFIWLNVSILREYLEVE 310 (899)
T ss_dssp HHHHHTTCSSEEEEEECCC-----------------CTTTTTCCBT--TC------------CCEEEEEHHHHHHHHHHH
T ss_pred HHhhhccCCceEEEeeecccCc--------Cccccchhhhcccccc--ccccccc--cccccCceEEEeHHHHHHHHHHH
Confidence 9999999999999999998753 6889999999997665 4555432 23456899999999999999999
Q ss_pred hcCCCCCCccchhhhhhHHHHhhhhcccCCCCCCCccccccchHHHHHHHHHHHHHhcCCccccCCccCHHHHHHHHHHH
Q 001223 318 FRIPNPPFEIDLECIVDDFIFMCFFVGNDFLPHMPTLEIREGAINLLMAVYKKEFRALGGYLTDGSKPSLRRVEHFIQAV 397 (1120)
Q Consensus 318 l~~~~~pF~~dlERiIDDfVfLcffvGNDFLPhLPsL~IreGaId~Li~~YKk~L~~~ggYLT~~G~INl~rl~~fl~~L 397 (1120)
|.++++||++|+|||||||||||||||||||||||+|+|++||||+|+.+||++|+++|||||++|+|||+||+.||+.|
T Consensus 311 ~~~~~~~f~~d~ERiIDDfVfmcFfvGNDFLPhlP~l~I~egaid~L~~~Yk~~l~~~~gYlt~~G~inl~rl~~~l~~L 390 (899)
T 3fqd_A 311 LYVPNLPFPFDLERAIDDWVFFIFFVGNDFLPHLPSLDIRDGAVERLTEIWRASLPHMGGYLTLDGSVNLARAEVILSAV 390 (899)
T ss_dssp HCCTTCSSCCCHHHHHHHHHHHGGGGCCSSSCCCTTCCGGGTHHHHHHHHHHHHHHHHSSCSEETTEECHHHHHHHHHHH
T ss_pred hcccCCCCCchhhhhhhhhhhhhHhhCcccCCCCCccCcCCChHHHHHHHHHHHHHHcCCeeecCCEEcHHHHHHHHHHH
Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCC------CCC--CCCCCc---cc-------cccccCCCCCCCCCCC
Q 001223 398 GSYEDRIFQKRARLHQRHAERIKREKAQARRGDDV------EPL--AQPDSL---VP-------VSRFHGSRLASGPSPS 459 (1120)
Q Consensus 398 ~~~Ed~iF~~r~~~e~r~~e~~kr~k~~~~~~~~~------~~~--~~p~~~---~p-------~~~~~~~~l~~~~~~~ 459 (1120)
|++|++||++|.+.++++.++.+|++.+.+..... ... ..+... .+ .....+......+...
T Consensus 391 ~~~E~~iF~~r~~~e~~~~~~~krr~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 470 (899)
T 3fqd_A 391 GNQEDDIFKRLKQQEDRRNENYRRRQQRESNQESESYVDNVVIQRSVETQSTEVVTSSKSTSVDTKPPKKTQKIDAPAPV 470 (899)
T ss_dssp HTTHHHHHHHHHHHHTTTC-------------------------------------------------------------
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhcccccccccccccccccccccccccccccccccccccccccCcccc
Confidence 99999999999998887777777665443211110 000 000000 00 0000000000000000
Q ss_pred CCccc--c---------ccccccCCC-CCCCCCCcccccCCCcchhhHHHhhh-------ccCCCCcc-cc--------H
Q 001223 460 PYQQS--E---------CVGRLDNGK-GTSGRPHKVPRLSSGATIGAAIVEAE-------NSFETDPQ-EN--------K 511 (1120)
Q Consensus 460 ~~~~~--~---------~~~~~~~~~-~~~~~~~k~~~~s~~~~~~~~~~~~e-------~~~~~~~~-~~--------~ 511 (1120)
++... . ..+.+.+.. ......+.....+..+..+++.++.. .....++. +. .
T Consensus 471 ~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~~~n~~~a~~lk~~l~~~~~~~~~~~~~~~~~~~~~e~~~~ 550 (899)
T 3fqd_A 471 DLVNLSEKTSNRSLGATNRELINNRAANRLGLSREAAAVSSVNKLAASALKAQLVSNETLQNVPLEDSIASSSAYEDTDS 550 (899)
T ss_dssp --------------------------------------------------------------------------------
T ss_pred ccccccccccccchhhhhhhhhhcchhhhhhhhhhhhhhcccchhHHHHHHHHhhccccccccccccccccccccccccc
Confidence 00000 0 000000000 00000000000000000001111100 00000000 00 0
Q ss_pred HHHHHHHHHHHhh-------hcc--cCCCCCchhhhhccCCCchhHHHHHHhcCCCChhHHHHHHHHHHHHHHhhHHHHh
Q 001223 512 EEFKAKLKELLRD-------KSD--AFNSDNPEEDKVKLGEPGWKERYYEEKFNAKTPEEREETRKDVVLRYTEGLCWVM 582 (1120)
Q Consensus 512 eelk~~l~~~~~~-------k~d--~~~~~~~~eD~VkL~e~gwK~rYY~~KF~v~~~~di~~~r~~vv~~YvEGL~WVL 582 (1120)
.+......+.+++ ++. ....+.+.+|.|+|||+|||+|||++||+++..+ ++++++||++|||||||||
T Consensus 551 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~v~l~e~g~K~rYY~~KF~~~~~d--~e~~~~v~~~YvEGL~WVL 628 (899)
T 3fqd_A 551 IESSTPVVHPIDTKVSNVGQKRKAPDSTEENENTDTVRLYEPGYRERYYEQKFHISPDE--PEKIREAVKHYVHGLCWVL 628 (899)
T ss_dssp ----------------------------------CCSCTTSTTHHHHHHHHTSCCCTTC--THHHHHHHHHHHHHHHHHH
T ss_pred ccccccchhhhhccccccccccccccccccccchhhhhhcccchHHHHHHHhcCCCCCC--HHHHHHHHHHHHHHHHHHH
Confidence 0000000000000 000 0011123568899999999999999999997644 4789999999999999999
Q ss_pred hhhccCcCcccccccCCCCcccccccccccccceecCCCCCChhhhhhhcCCCCCCCCCcHHHHhhhcCCCCCCcccCCC
Q 001223 583 HYYYEGVCSWQWFYPYHYAPFASDLKDLDQFNISFELGHPFKPFNQLLGVFPSASSHALPEHYRKLMTDPNSPISDFYPT 662 (1120)
Q Consensus 583 ~YYyqGcpSW~WYYPYHYAPfaSDl~~l~~~~i~Fe~G~Pf~PfEQLMsVLPaaS~~~LP~~~r~LMtdp~SpIiDFYP~ 662 (1120)
+|||||||||+||||||||||||||+++.+++|+|++|+||+|||||||||||+|+++||++||+||+||+|||+||||+
T Consensus 629 ~YYYqGc~SW~WyYPYHYAPfaSD~~~~~~~~i~Fe~G~PF~PfEQLMaVLPaaS~~~LP~~y~~LMtdp~SpIiDFYP~ 708 (899)
T 3fqd_A 629 LYYYQGCPSWTWYYPYHYAPFAADFKDLASIDVKFELNQPFKPYEQLLGVLPAASKNNLPEKLQTLMTDENSEIIDFYPE 708 (899)
T ss_dssp HHHHTCCSCSSCCCCCSSCCCGGGCCSCTTCCCCCCCCCCCCHHHHHHHHCCGGGGGGSCHHHHHHHHCTTCTTGGGCCS
T ss_pred HHHhCCCCCCcccCcccccchHHHHhhcccCceecCCCCCCChHHHHHhhcCHhhhCcccHHHHHHhcCCCCcchhhCcc
Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred cccccCCCCcccccceeccCCCChHHHHHHHHhhcCCCCHHHHhhccCCCceEEeecCCCcchhhhhhhhHhhhhhhhhh
Q 001223 663 DFEVDMNGKRYSWQGIAKLPFIDEARLLDEVKKIEHTLTEEEARRNSIMADMLFVLLSHPLSASIYTLDEHCKQLKIKER 742 (1120)
Q Consensus 663 dF~ID~NGKk~~WqGVaLLPFIDe~rLl~a~~~~~~~LT~eE~~RN~~g~d~lfv~~~~~l~~~i~~l~~~~~~~~~~~~ 742 (1120)
+|+|||||||++||||||||||||+|||+||++++++||+||++||++|.++||++++||+++.+ ++.+|++++
T Consensus 709 dFeiDmNGKk~~WqgVvlLPFIDE~RLL~A~~~~~~~LT~eEk~RN~~G~~~lf~~~~~~l~~~~------~~~~y~~~~ 782 (899)
T 3fqd_A 709 NFTIDLNGKKFEWQGVALLPFIDENRLLNAVSKIYPQLTEEESKRNEDGSTLLFISEHHPMFSEL------VKQLYSKKR 782 (899)
T ss_dssp CCCEECSSSSCTTSSEECCCCCCHHHHHHHHHTTGGGSCHHHHHTTSCCCCEEEEETTSTTHHHH------HHHHTSSSC
T ss_pred cccccCCCCccccceEEecCccCHHHHHHHHHHhhccCCHHHHHhccCCCcEEEEeCCCchHHHH------HHHhccccc
Confidence 99999999999999999999999999999999999999999999999999999999999998754 345565555
Q ss_pred hhhhhcCCcCcCCCcceeeecCCCCCCCCcccCCCC--CCcC------CcCCceEEEEecCCC-CCCCCCCCCCCCcccC
Q 001223 743 VEVKERLNPELSDGMNGYISPCAGDPHPPVFRSPVA--SMED------IMGNQVLCAIYKLPD-AHKHVTRPPAGVIFPK 813 (1120)
Q Consensus 743 ~~~~~~i~~~~s~g~~G~v~~~~~~~~~~~~~sPi~--~l~d------I~~n~~i~~~y~~P~-~~~h~s~ll~gv~~P~ 813 (1120)
...+++|||++|+||+|+|++++.+.+.++|.|||. +||| |.+|++|||.|++|+ .+.|+|+||+||++|+
T Consensus 783 ~~~~~~~~~~~s~gl~G~v~~~~~~~p~~~~~sP~~~~~~~~~~~~~~i~~n~~~~~~y~~P~~~~~~~s~lL~g~~~p~ 862 (899)
T 3fqd_A 783 QGKPLKLSGKMAHGLFGKVNTNDSVIPNVSVQCPIDVTSADALQKYGSIDDNQSISLVFEVPKSHFVHKSMLLRGVKMPN 862 (899)
T ss_dssp SSSCEECCGGGGTTCCSEEEECTTSCSCCCBCCSSCCCSHHHHHHSCCBSSCCEEEEEEECCCCSSCCCCSCCTTCCCCC
T ss_pred cCcccccCcccccCcceEEEeCCCcCCCCeecCCCCccCCCCccccccccCCcEEEEEEeCCCCCCCcCCcCCCCCCCCC
Confidence 555688999999999999999999888899999997 9999 999999999999999 6999999999999999
Q ss_pred CCCCcccCCC
Q 001223 814 KIVQLEDLKP 823 (1120)
Q Consensus 814 ~~l~~~d~~~ 823 (1120)
++|++.|++.
T Consensus 863 ~~l~~~D~~~ 872 (899)
T 3fqd_A 863 RVLTPEDINQ 872 (899)
T ss_dssp CCCCHHHHHH
T ss_pred CCCCHHHHHH
Confidence 9999999883
|
| >3pie_A 5'->3' exoribonuclease (XRN1); beta berrel, tudor domain, chromo domain, mRNA turnover, RRN processing, RNA binding, DNA binding; 2.90A {Kluyveromyces lactis} PDB: 3pif_A | Back alignment and structure |
|---|
| >2y35_A LD22664P; hydrolase-DNA complex, RNA degradation, exonuclease 5'-3', R interference; 3.20A {Drosophila melanogaster} | Back alignment and structure |
|---|
| >1b43_A Protein (FEN-1); nuclease, DNA repair, DNA replication, transferase; 2.00A {Pyrococcus furiosus} SCOP: a.60.7.1 c.120.1.2 PDB: 1mc8_A | Back alignment and structure |
|---|
| >1rxw_A Flap structure-specific endonuclease; helical clamp, helix-3 turn-helix, hydrophobic wedge, 3' FLA site, hydrolase-DNA complex; 2.00A {Archaeoglobus fulgidus} SCOP: a.60.7.1 c.120.1.2 PDB: 1rxv_A | Back alignment and structure |
|---|
| >1a76_A Flap endonuclease-1 protein; 5'-3' EXO/endo nuclease, DNA replication, RTH, RAD27, DNA repair; 2.00A {Methanocaldococcus jannaschii} SCOP: a.60.7.1 c.120.1.2 PDB: 1a77_A | Back alignment and structure |
|---|
| >3qe9_Y Exonuclease 1; exonuclease, hydrolase-DNA complex; HET: DNA; 2.51A {Homo sapiens} PDB: 3qeb_Z* 3qea_Z* | Back alignment and structure |
|---|
| >3ory_A Flap endonuclease 1; hydrolase; 2.00A {Desulfurococcus amylolyticus} | Back alignment and structure |
|---|
| >1ul1_X Flap endonuclease-1; protein complex, DNA-binding protein, flap DNA, flap endonuclease, sliding clamp, DNA clamp; 2.90A {Homo sapiens} SCOP: a.60.7.1 c.120.1.2 | Back alignment and structure |
|---|
| >2izo_A FEN1, flap structure-specific endonuclease; hydrolase, DNA repair, DNA-binding, endonuclease, metal-BIND excision repair, DNA replication, PCNA; HET: DNA; 2.9A {Sulfolobus solfataricus} | Back alignment and structure |
|---|
| >3q8k_A Flap endonuclease 1; helix-3 turn-helix, hydrophobic wedge, 3' flap binding site, hydrolase-DNA complex, DNA repair, replication; HET: DNA; 2.20A {Homo sapiens} PDB: 3q8l_A* 3q8m_A* | Back alignment and structure |
|---|
| >1exn_A 5'-exonuclease, 5'-nuclease; hydrolase; 2.50A {Enterobacteria phage T5} SCOP: a.60.7.1 c.120.1.2 PDB: 1ut5_A 1ut8_A 1xo1_A | Back alignment and structure |
|---|
| >1dsq_A Nucleic acid binding protein P14; CCHC type zinc finger, virus/viral protein; NMR {Mouse mammary tumor virus} SCOP: g.40.1.1 | Back alignment and structure |
|---|
| >1nc8_A Nucleocapsid protein; HIV-2, RNA recognition, zinc finger, viral protein; NMR {Human immunodeficiency virus 2} SCOP: g.40.1.1 PDB: 2di2_A | Back alignment and structure |
|---|
| >1a6b_B Momulv, zinc finger protein NCP10; nucleocapsid protein, intercalation, nucleic acid, retrovirus, viral protein/DNA complex; HET: DNA; NMR {Synthetic} SCOP: g.40.1.1 | Back alignment and structure |
|---|
Homologous Structure Domains
Structure Domains Detected by RPS-BLAST 
Original result of RPS-BLAST against SCOP70(version1.75) database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
E-value ![]() |
| 1120 | ||||
| d1nc8a_ | 29 | g.40.1.1 (A:) HIV nucleocapsid {Human immunodefici | 0.002 |
| >d1nc8a_ g.40.1.1 (A:) HIV nucleocapsid {Human immunodeficiency virus type 2 [TaxId: 11709]} Length = 29 | Back information, alignment and structure |
|---|
class: Small proteins fold: Retrovirus zinc finger-like domains superfamily: Retrovirus zinc finger-like domains family: Retrovirus zinc finger-like domains domain: HIV nucleocapsid species: Human immunodeficiency virus type 2 [TaxId: 11709]
Score = 34.4 bits (79), Expect = 0.002
Identities = 10/24 (41%), Positives = 15/24 (62%), Gaps = 2/24 (8%)
Query: 263 KCFLCGQPGHLAANCEGKAKRKAG 286
+C+ CG+ GH A C +A R+ G
Sbjct: 8 RCWNCGKEGHSARQC--RAPRRQG 29
|
Homologous Domains Detected by HHsearch 
Original result of HHsearch against SCOP70(version1.75) database
ID ![]() | Alignment Graph ![]() | Length ![]() |
Definition ![]() |
Probability ![]() |
| Query | 1120 | |||
| d1ul1x2 | 216 | Flap endonuclease-1 (Fen-1 nuclease) {Human (Homo | 97.1 | |
| d1rxwa2 | 217 | Flap endonuclease-1 (Fen-1 nuclease) {Archaeon Arc | 96.01 | |
| d1b43a2 | 219 | Flap endonuclease-1 (Fen-1 nuclease) {Archaeon Pyr | 95.35 | |
| d1a77a2 | 207 | Flap endonuclease-1 (Fen-1 nuclease) {Archaeon Met | 92.91 | |
| d1nc8a_ | 29 | HIV nucleocapsid {Human immunodeficiency virus typ | 89.85 |
| >d1ul1x2 c.120.1.2 (X:2-217) Flap endonuclease-1 (Fen-1 nuclease) {Human (Homo sapiens) [TaxId: 9606]} | Back information, alignment and structure |
|---|
class: Alpha and beta proteins (a/b) fold: PIN domain-like superfamily: PIN domain-like family: 5' to 3' exonuclease catalytic domain domain: Flap endonuclease-1 (Fen-1 nuclease) species: Human (Homo sapiens) [TaxId: 9606]
Probab=97.10 E-value=0.00027 Score=71.39 Aligned_cols=179 Identities=14% Similarity=0.133 Sum_probs=88.6
Q ss_pred cchhHHHHHHhhCCcccccccccCccccCCccccCCCCCCCCCCCccCeEEEeccccccccccCCC---CCCCCCH---H
Q 001223 2 GVPAFYRWLAEKYPLVVADVIEEEPVVIDGVKIPVDTSKPNPNGLEYDNLYLDMNGIIHPCFHPED---RPAPTTF---D 75 (1120)
Q Consensus 2 GVP~FfRWL~~rYP~ii~~viE~~~~~~dG~~iP~d~s~pNPNg~efDNLYLDMNGIIH~C~Hpe~---~~~p~te---~ 75 (1120)
||.++..+|.+.-|.+++.+- -.+..| --|=||.+..||.+..... .+-.... .
T Consensus 1 GIkgL~~~l~~~~~~~i~~~~---l~~l~g-----------------k~vaIDas~~l~~~l~~~~~~~~~l~~~~~~~~ 60 (216)
T d1ul1x2 1 GIQGLAKLIADVAPSAIREND---IKSYFG-----------------RKVAIDASMSIYQFLIAVRQGGDVLQNEEGETT 60 (216)
T ss_dssp CCTTHHHHHHTTCTTSCCEEC---GGGGTT-----------------CCEEEEHHHHHHHHHSCC-------------CC
T ss_pred CcchHHHHHHHhCcCceEEec---HHHcCC-----------------CEEEEEeHHHHHHHHHHhccccchhhccCCCCc
Confidence 899999999999988776431 011122 2577899999987432211 0000000 1
Q ss_pred HHHHHHHHHHHHHHhhccccceEEEEecCCCchhhhHHHHhhhccchhHHHHHHHHHHHHHHHHHHhCCCCCCCCCCCCC
Q 001223 76 EVFQCMFDYIDRLFVMVRPRKLLYMAIDGVAPRAKMNQQRSRRFRASKDAADSAAEEERLRQEFEREGRKLPPKSDSQVF 155 (1120)
Q Consensus 76 Emf~~If~yIDrL~~~VRPrKlLyiAIDGVAPRAKMNQQRsRRFrsakea~~~~~~~~~l~~e~~~eg~~lp~~~~~~~f 155 (1120)
.-+..+|..+..|.. =.=..++.+||-+|-.|......||-+.....+... ..+ ..+. ......+
T Consensus 61 ~~l~~~~~~~~~l~~---~~I~pifVFDG~~~~~K~~~~~~R~~~r~~~~~~~~----~~~----~~~~----~~~~~~~ 125 (216)
T d1ul1x2 61 SHLMGMFYRTIRMME---NGIKPVYVFDGKPPQLKSGELAKRSERRAEAEKQLQ----QAQ----AAGA----EQEVEKF 125 (216)
T ss_dssp HHHHHHHHHHHHHHH---TTCCEEEEECCSCCSCCCCCCCCC--------------------------------------
T ss_pred HHHHHHHHHHHHHHH---cCCeEEEEEcCCCCccccchhhhhhhhHHHhhhhHH----hhh----hcch----HHHHHHh
Confidence 122233333333322 222347889999998887766555543221110000 000 0000 0011122
Q ss_pred ccCcccCCcHHHHHHHHHHHHHHHHHhcCCCCCcccEEEEcCCCCCCChhhhHHHHHHHhhcCCCCCCCCcEEEEecChh
Q 001223 156 DSNVITPGTEFMAVLSIALQYYIHLRLNNDPGWEKIKVILSDANVPGEGEHKIMSYVRLQRNLPGYDPNTRHCLYGLDAD 235 (1120)
Q Consensus 156 DSN~ITPGT~FM~~Ls~~L~~yI~~kl~~Dp~W~~lkVIlSds~VPGEGEHKImdfIR~qr~~p~ydPNt~HcIYGlDAD 235 (1120)
...+++.-+.....+.+.|+.+ ++.+|. .|||+|.-+-.+.+. +.-..|++-|+|
T Consensus 126 ~~~~~~i~~~~~~~~~~ll~~~------------gv~~i~----Ap~EAdaq~A~L~~~---------g~vd~v~S~DsD 180 (216)
T d1ul1x2 126 TKRLVKVTKQHNDECKHLLSLM------------GIPYLD----APSEAEASCAALVKA---------GKVYAAATEDMD 180 (216)
T ss_dssp ---CCCCCCSCHHHHHHHHHHH------------TCCEEE----CSSCHHHHHHHHHHH---------TSSSEEECSCTH
T ss_pred hccCeeccHHHHHHHHHHHHhc------------CeeeEe----ccchHHHHHHHHHhc---------CceEEEEccccc
Confidence 3333433333444444434322 466776 399999887776653 344678999999
Q ss_pred HHHHH
Q 001223 236 LIMLA 240 (1120)
Q Consensus 236 LImL~ 240 (1120)
+++.|
T Consensus 181 ~l~fG 185 (216)
T d1ul1x2 181 CLTFG 185 (216)
T ss_dssp HHHTT
T ss_pred eeccC
Confidence 99998
|
| >d1rxwa2 c.120.1.2 (A:3-219) Flap endonuclease-1 (Fen-1 nuclease) {Archaeon Archaeoglobus fulgidus [TaxId: 2234]} | Back information, alignment and structure |
|---|
| >d1b43a2 c.120.1.2 (A:1-219) Flap endonuclease-1 (Fen-1 nuclease) {Archaeon Pyrococcus furiosus [TaxId: 2261]} | Back information, alignment and structure |
|---|
| >d1a77a2 c.120.1.2 (A:2-208) Flap endonuclease-1 (Fen-1 nuclease) {Archaeon Methanococcus jannaschii [TaxId: 2190]} | Back information, alignment and structure |
|---|
| >d1nc8a_ g.40.1.1 (A:) HIV nucleocapsid {Human immunodeficiency virus type 2 [TaxId: 11709]} | Back information, alignment and structure |
|---|