Query         004647
Match_columns 740
No_of_seqs    232 out of 534
Neff          5.6 
Searched_HMMs 46136
Date          Fri Mar 29 02:49:06 2013
Command       hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/004647.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/004647hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG2044 5'-3' exonuclease HKE1 100.0  5E-248  1E-252 2045.9  55.4  712    1-721     1-771 (931)
  2 COG5049 XRN1 5'-3' exonuclease 100.0  5E-209  1E-213 1709.9  47.5  682    1-725     1-817 (953)
  3 KOG2045 5'-3' exonuclease XRN1 100.0  4E-181  8E-186 1510.0  39.9  561    1-624     1-655 (1493)
  4 PF03159 XRN_N:  XRN 5'-3' exon 100.0   5E-87 1.1E-91  688.5  19.9  237    1-256     1-237 (237)
  5 cd00128 XPG Xeroderma pigmento  98.7 2.3E-07   5E-12  100.6  15.7  237    1-369     1-244 (316)
  6 PTZ00217 flap endonuclease-1;   98.5 6.2E-06 1.3E-10   92.3  18.0  231    1-357     1-246 (393)
  7 TIGR03674 fen_arch flap struct  97.6  0.0014   3E-08   72.3  15.5   66   51-120    23-96  (338)
  8 PRK03980 flap endonuclease-1;   97.6  0.0017 3.7E-08   70.3  15.5   25  334-358   177-201 (292)
  9 smart00475 53EXOc 5'-3' exonuc  95.7    0.61 1.3E-05   49.8  17.8  133   52-241     4-139 (259)
 10 cd00008 53EXOc 5'-3' exonuclea  95.4    0.67 1.4E-05   48.9  16.6   56   52-108     4-62  (240)
 11 PF00752 XPG_N:  XPG N-terminal  95.0    0.03 6.5E-07   50.7   4.3   95    1-122     1-98  (101)
 12 smart00485 XPGN Xeroderma pigm  94.8   0.039 8.4E-07   49.9   4.4   93    1-122     1-96  (99)
 13 PF00867 XPG_I:  XPG I-region;   93.9    0.18 3.9E-06   45.4   6.8   90  192-346     5-94  (94)
 14 PRK14976 5'-3' exonuclease; Pr  93.2     3.4 7.5E-05   44.7  16.3   57   51-108     5-67  (281)
 15 PF00098 zf-CCHC:  Zinc knuckle  89.4    0.21 4.5E-06   32.0   1.3   16  264-279     2-17  (18)
 16 KOG2518 5'-3' exonuclease [Rep  89.1     3.3 7.1E-05   48.1  11.4   91    1-122     1-96  (556)
 17 KOG2519 5'-3' exonuclease [Rep  88.1      16 0.00036   41.9  15.9   34  334-369   217-250 (449)
 18 COG0258 Exo 5'-3' exonuclease   87.7      12 0.00025   41.0  14.3   62   51-116    13-82  (310)
 19 PRK05755 DNA polymerase I; Pro  83.9      23 0.00049   44.4  15.9   55   52-107     5-62  (880)
 20 TIGR00593 pola DNA polymerase   81.2      52  0.0011   41.4  17.4   56   52-108     2-61  (887)
 21 TIGR00600 rad2 DNA excision re  75.0     4.8  0.0001   50.7   6.0   39  191-242   785-823 (1034)
 22 TIGR00600 rad2 DNA excision re  69.2      20 0.00042   45.5   9.4   65   51-121    26-94  (1034)
 23 PF13696 zf-CCHC_2:  Zinc knuck  67.1     2.3   5E-05   31.4   0.5   20  261-280     7-26  (32)
 24 cd00080 HhH2_motif Helix-hairp  65.0     7.5 0.00016   33.7   3.4   22  334-355     8-31  (75)
 25 PF13917 zf-CCHC_3:  Zinc knuck  44.5      12 0.00026   29.4   1.1   19  262-280     4-22  (42)
 26 smart00484 XPGI Xeroderma pigm  44.1      13 0.00028   32.3   1.5   34  201-243    10-43  (73)
 27 smart00279 HhH2 Helix-hairpin-  38.7      26 0.00056   26.4   2.2   19  335-354     3-21  (36)
 28 PF02739 5_3_exonuc_N:  5'-3' e  38.6      29 0.00063   34.7   3.2   56   52-108     4-63  (169)
 29 smart00343 ZnF_C2HC zinc finge  36.6      16 0.00035   25.0   0.7   16  264-279     1-16  (26)
 30 PHA02567 rnh RnaseH; Provision  26.5      69  0.0015   35.3   3.8   57   50-106    15-73  (304)
 31 PF14392 zf-CCHC_4:  Zinc knuck  26.1      30 0.00066   27.5   0.8   20  260-279    29-48  (49)
 32 PF12513 SUV3_C:  Mitochondrial  24.7      40 0.00087   26.9   1.2   15    5-19     12-26  (49)
 33 COG5350 Predicted protein tyro  22.1      99  0.0022   31.1   3.6   42  190-234    56-105 (172)
 34 PF14787 zf-CCHC_5:  GAG-polypr  22.0      44 0.00096   25.4   0.9   20  263-282     3-22  (36)
 35 PF15288 zf-CCHC_6:  Zinc knuck  21.6      43 0.00094   26.1   0.8   13  263-275     2-14  (40)
 36 COG5082 AIR1 Arginine methyltr  20.7      42  0.0009   34.6   0.7   45  225-279    68-114 (190)
 37 PRK09482 flap endonuclease-lik  20.5 1.5E+02  0.0033   31.9   4.9   55   51-108     5-59  (256)

No 1  
>KOG2044 consensus 5'-3' exonuclease HKE1/RAT1 [Replication, recombination and repair; RNA processing and modification]
Probab=100.00  E-value=4.7e-248  Score=2045.89  Aligned_cols=712  Identities=58%  Similarity=0.993  Sum_probs=652.7

Q ss_pred             CccchHHHHHHhhCCCcccccccCCCccCCCCcccCCCCCCCCCCCcccCeEEeeccccccccccCCCCCCCCCHHHHHH
Q 004647            1 MGVPAFYRWLADRYPLSIVDVVEEDPQVDGEGVARPVDVSKPNPNGMEFDNLYLDMNGIIHPCFHPDGKPAPTSYDDVFK   80 (740)
Q Consensus         1 MGVP~ffrWL~~rYP~i~~~~~e~~~~~~~~g~~~p~d~s~pn~n~~e~DnLYlDmNgIIH~c~h~~~~~~p~te~e~~~   80 (740)
                      ||||+|||||++|||++|++|+|++|.+. ||+.+|+|.|.|||||+||||||||||||||||+||+++|+|+||+|||.
T Consensus         1 MGVPaffRWLs~kyp~~I~~viEe~p~~~-~g~~ip~D~s~pNPNg~E~DNLYLDMNGIIHPC~HPEdkPaP~tedEm~~   79 (931)
T KOG2044|consen    1 MGVPAFFRWLSRKYPKTISPVIEEEPVDV-DGVKIPVDYSKPNPNGVEFDNLYLDMNGIIHPCTHPEDKPAPETEDEMFV   79 (931)
T ss_pred             CCchHHHHHHHHhcchhhhhhhhcCcccC-CCcccccccCCCCCCcccccceeeecCcccccCCCCCCCCCCccHHHHHH
Confidence            99999999999999999999999999887 88999999999999999999999999999999999999999999999999


Q ss_pred             HHHHHHHHHHhhcccceeEEEeecCCCchhhhHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHhCcccCCCCCCCCCCCcc
Q 004647           81 SIFDYIDHIFLLVRPRKLLYLAIDGVAPRAKMNQQRTRRFRAAKDAAEAEAEEERLRKEFEEAGKLLSAKEKPETCDSNV  160 (740)
Q Consensus        81 ~If~yid~lv~~vrPrkllyiAiDGVAPrAKmnQQRsRRfrsa~e~~~~~~~~~~~~~~~~~eg~~~~~~~~~~~fDsN~  160 (740)
                      .||+||||||.+|||||||||||||||||||||||||||||||||++++++++++++++++++|..++++.++++|||||
T Consensus        80 avFeyiDrlf~mvRPRkLLymAIDGVAPRAKMNQQRsRRFRaaKeaae~~~e~e~~ree~~~~G~~lpp~~~~e~fDSNc  159 (931)
T KOG2044|consen   80 AVFEYIDRLFSMVRPRKLLYMAIDGVAPRAKMNQQRSRRFRAAKEAAEKEAEIERLREEFEAEGKFLPPKVKKETFDSNC  159 (931)
T ss_pred             HHHHHHHHHHHhccchheeEEeecccCchhhhhHHHHHHHhhhhHHHHHHHHHHHHHHHHHhcCCcCCchhhccccccCc
Confidence            99999999999999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             cccccHHHHHHHHHHHHHHHHHHhcCCCCcccEEEEcCCCCCCChhhHHHHHHHHhhcCCCCCCCCcEEEEecChhHHHH
Q 004647          161 ITPGTQFMAVLSAALQYFIQARLNQIPGWQFTKVILSDANVPGEGEHKIMSYIRLQRNLPGFDPNTRHCLYGLDADLIML  240 (740)
Q Consensus       161 ITPGT~FM~~L~~~L~~yi~~kl~~dp~w~~l~VI~Sds~vPGEGEHKIm~fIR~qr~~p~ydpn~~H~IyG~DADLImL  240 (740)
                      |||||+||++|+.+|+|||+.||++||+|++|+||+|||+||||||||||+|||+||++|+|||||+|||||+|||||||
T Consensus       160 ITPGTpFM~~La~aLrYyI~~rLn~DPgWkNikvIlSDAnVPGEGEHKIM~yIR~QR~~P~~dPNT~HclyGlDADLImL  239 (931)
T KOG2044|consen  160 ITPGTPFMDRLAKALRYYIHDRLNSDPGWKNIKVILSDANVPGEGEHKIMSYIRSQRAQPGYDPNTHHCLYGLDADLIML  239 (931)
T ss_pred             cCCCChHHHHHHHHHHHHHHHhhcCCccccceEEEEecCCCCCcchhHHHHHHHHccCCCCCCCCceeeeecCCccceee
Confidence            99999999999999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             HhhcCCceEEEeeccccCCCCCcccccccccCcccccccCCCCCCCCCCCCCCCCcccccceEEEehHHHHHHHHhhhCC
Q 004647          241 SLATHEIHFSILREVITLPGQQEKCFVCGQVGHLAAECHGKPGDNPADWNGVDDTPIHKKKYQFLNIWVLREYLQYELDI  320 (740)
Q Consensus       241 ~Lathe~~f~ILRE~v~~~~~~~~c~~c~~~~h~~~~c~~~~~~~~~~~~~~~~~~~~~~~f~~l~i~~LREyL~~e~~~  320 (740)
                      |||||||||+||||+++ |+++++|++|||+||.+.+|.|+......  +.....+..+++|+||+|++|||||+.||.+
T Consensus       240 gLATHE~hF~IlRE~~~-P~~~~~C~~cgq~gh~~~dc~g~~~~~~~--~~~~~~~~~ek~fifl~I~vLREYLe~El~~  316 (931)
T KOG2044|consen  240 GLATHEPHFSILREEFF-PNKPRRCFLCGQTGHEAKDCEGKPRLGET--NELADVPGVEKPFIFLNISVLREYLERELRM  316 (931)
T ss_pred             eccccCCceEEeeeeec-CCCcccchhhcccCCcHhhcCCcCCcccc--cccccCcccccceEEEEHHHHHHHHHHHhcC
Confidence            99999999999999977 99999999999999999999998542111  1122234678999999999999999999999


Q ss_pred             CCCCCCCchhhhHHHHHHHhhhhcCCCCCCCCcccccchhHHHHHHHHHHHhhhcCCccccCCeechHHHHHHHHHHHHh
Q 004647          321 PNPPFPINFERIVDDFVFLCFFVGNDFLPHMPTLEIREGAINLLMHVYRREFTAMGGYLTDAGEVLLDRVEKFIQSVAVY  400 (740)
Q Consensus       321 ~~~~~~~d~eriIDDfVfLcf~vGNDFLPhlPsl~I~egaid~Li~~Yk~~l~~~~gYLt~~g~inl~~l~~fl~~l~~~  400 (740)
                      |++||+||+||+||||||||||||||||||||||+|||||||+|+++||+.+++|+||||++|.+||.||+.||+.||..
T Consensus       317 p~lPf~fd~ER~iDDwVF~CFFvGNDFLPHlPsLeIRegAId~L~~iyk~~~~~~kgYLT~~g~vNL~rve~~~~avg~~  396 (931)
T KOG2044|consen  317 PNLPFTFDLERAIDDWVFLCFFVGNDFLPHLPSLEIREGAIDRLMEIYKKSFPKMKGYLTDSGKVNLDRVEMFMQAVGSV  396 (931)
T ss_pred             CCCCccccHHhhhcceEEEEeeecCccCCCCCchhhhhcHHHHHHHHHHHHHHhhcceeccCCcccHHHHHHHHHHHhhh
Confidence            99999999999999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHhhhccc-----------------------------------------hhhhh-----
Q 004647          401 EDQIFQKRTRIQQAYEYNEAMKLNARRES-----------------------------------------SEELL-----  434 (740)
Q Consensus       401 E~~if~~r~~~~~~~~~~~~~~~~~r~~~-----------------------------------------~~~~~-----  434 (740)
                      |++||++|.+.+++++.....+.+.+++.                                         ..+.+     
T Consensus       397 Ed~IFkkR~r~~e~frrrk~~rk~~~~~~~~sg~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~  476 (931)
T KOG2044|consen  397 EDDIFKKRQRREERFRRRKAARKRQDRNAQDSGTNFSLAGSRELEASEPAQKALKVSLEKNESAANVERDNTEDLKTKLK  476 (931)
T ss_pred             cchHHHHhHhHHHHHHHhhhhhhhhhhhcccccccccccccccccccchhhhhhhhccccccchhhhcccchhhcccccc
Confidence            99999999766655543221111100000                                         00000     


Q ss_pred             ----------cCccccccccccCCcchHHhHHHHhhCCCChhhHHHHHHHHHHHHHHHHHHHhhhhccCccccccccCCC
Q 004647          435 ----------QAPVAVADTVKLGEPGYKERYYADKFEISNPEEIDKVKKDVVLKYVEGLCWVCRYYYQDVCSWQWFYPYH  504 (740)
Q Consensus       435 ----------~~~~~~~d~v~l~~~~~k~~YY~~Kf~~~~~~~~~~~~~~v~~~YleGL~WVl~YYy~G~~SW~WyYPyH  504 (740)
                                .+..+..|+|+|+|+|||+|||++||++++++  +++|++||.+|+|||||||+||||||+||+||||||
T Consensus       477 ~~~~~k~~~~~~~~~~~D~VkL~e~G~keRYY~~KF~v~~~e--eq~R~~vv~~YveGLcWVl~YYyqGc~SW~WyYPYH  554 (931)
T KOG2044|consen  477 HGQRRKSEDSESEEENTDKVKLYEPGWKERYYEEKFDVTPDE--EQIRKDVVLKYVEGLCWVLRYYYQGCASWNWYYPYH  554 (931)
T ss_pred             ccccccCccccCCCCCCcceeecCCchhhhhhhhhcCCCCHH--HHHHHHHHHHHhcchhhhhhhhhccccccccccccc
Confidence                      11235678999999999999999999998775  789999999999999999999999999999999999


Q ss_pred             CCccccchhcccCccccccCCCCCChHHHhhhcCCCCCCCCCchhhhhccCCCCCcccccCCCcccccCCCCccceeeee
Q 004647          505 YAPFASDLKDLSDLEITFFLGEPFRPFDQLMGTLPAASSSALPEKYRNLMTDPSSPIYKFYPPDFQIDMNGKRFAWQGVV  584 (740)
Q Consensus       505 YAPfasDl~~l~~~~i~F~~g~Pf~P~eQLm~VLP~~S~~~LP~~~~~Lm~~~~SpI~dfYP~~f~iD~nGk~~~WqgV~  584 (740)
                      ||||||||.++++++|+|++|+||+||||||+||||+|+++||+.||.||+||+|||+||||+||+||||||+|+|||||
T Consensus       555 YAPfAsDf~~l~~ldikFe~g~PFkP~eQLmgVlPAAS~~~LPe~~r~LMsdpdSpIiDFYPedF~iDmNGKk~aWQGIa  634 (931)
T KOG2044|consen  555 YAPFASDFKGLSDLDIKFELGKPFKPLEQLMGVLPAASSHALPEEWRKLMSDPDSPIIDFYPEDFEIDMNGKKYAWQGIA  634 (931)
T ss_pred             cchhhhhhhcccccccccccCCCCCcHHHHhhhcchhhcCCCcHHHHhhhcCCCCcccccccccceeeccCceeeccccc
Confidence            99999999999999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             cccCCChHHHHHHHhhhccCCCHHHHhhccCCCcEEEEcCCCccHHHHHHHhhhcCCCCCCCCcceeccCCcCCCCccce
Q 004647          585 KLPFIDEKLLLRQTKKLEVFLTEEELFRNSVMLDLLYVHPQHPLYQQITLYCQLYHQLPPQDRFAWEIDVNASGGMNGYI  664 (740)
Q Consensus       585 lLPFIDe~~Ll~a~~~~~~~Lt~eE~~RN~~g~~~lf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~g~~  664 (740)
                      |||||||+|||+|+++++++||+||++||..|.|+||++++||++..+.++|++..+. .  ....++-...+.|++|.+
T Consensus       635 lLPFiDe~rLl~a~~~~y~~Lt~EE~~RN~rg~d~Lfi~~~hp~~e~i~~lysk~k~~-~--~~~v~~~~~~~p~~~~~~  711 (931)
T KOG2044|consen  635 LLPFIDERRLLSAVAKVYPTLTDEEKRRNSRGPDLLFISDKHPLFEFILQLYSKKKKS-N--EKNVKLAHGVDPGLNGAI  711 (931)
T ss_pred             cccccchhhHHHHHHhhccccCHHHHhccccCCceEEecCCCchHHHHHHHHHhhccC-c--ccccccccccCcccceee
Confidence            9999999999999999999999999999999999999999999999999999976541 1  111245555667899999


Q ss_pred             ecccCCCCCcccCCCCCCCCCCCCCcEEEEEecCCCCCCC---CCCCCCCCCCCcccccc
Q 004647          665 WLCERNGLRSIIPSPVKGLPDIERNQAINVTYLNPQKHRH---IPEPPKGATIPAKVCEQ  721 (740)
Q Consensus       665 ~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~y~~p~~~~~---~~~~l~g~~~p~~~l~~  721 (740)
                      ..++.........+|..++.+...+..+++.+..|.++..   .++.|+|++.|.++|+.
T Consensus       712 ~~~~~~~~~~i~~~p~~~~~~~~~~~~v~~~~~~~~~~ed~~~~a~~l~G~~~p~~~lkP  771 (931)
T KOG2044|consen  712 SKDPEGLESGISKSPPGGLSDYNTNTGVCLKYVDPEYPEDYIFPAIRLDGAKEPEKVLKP  771 (931)
T ss_pred             ccCccccccccccCChhhcccCCccceeeecccCccccccccchhhhcCCCCCCccccCc
Confidence            8877654445678888899999999999999999987644   38999999999999983


No 2  
>COG5049 XRN1 5'-3' exonuclease [DNA replication, recombination, and repair / Cell division and chromosome partitioning / Translation]
Probab=100.00  E-value=5.4e-209  Score=1709.93  Aligned_cols=682  Identities=44%  Similarity=0.781  Sum_probs=588.8

Q ss_pred             CccchHHHHHHhhCCCcccccccCCCccCCCCcccCCCCCCCCCCCcccCeEEeeccccccccccCCCCCCCCCHHHHHH
Q 004647            1 MGVPAFYRWLADRYPLSIVDVVEEDPQVDGEGVARPVDVSKPNPNGMEFDNLYLDMNGIIHPCFHPDGKPAPTSYDDVFK   80 (740)
Q Consensus         1 MGVP~ffrWL~~rYP~i~~~~~e~~~~~~~~g~~~p~d~s~pn~n~~e~DnLYlDmNgIIH~c~h~~~~~~p~te~e~~~   80 (740)
                      ||||+|||||++|||+|++.|.|+                 +|||   +||||||||||+|+|+||+++++|.||+||+.
T Consensus         1 MGVPsfFRwlS~r~p~ii~~I~e~-----------------~~P~---~DNLYLDMNgIlH~CtHp~d~~~petEeEm~~   60 (953)
T COG5049           1 MGVPSFFRWLSERYPKIIQLIEEK-----------------QIPE---FDNLYLDMNGILHNCTHPNDGSPPETEEEMYK   60 (953)
T ss_pred             CCchHHHHHHHhhhhHhhhhhhcc-----------------CCCC---cceeEEecccccccCCCCCCCCCCCCHHHHHH
Confidence            999999999999999999887653                 2344   69999999999999999999999999999999


Q ss_pred             HHHHHHHHHHhhcccceeEEEeecCCCchhhhHHHHHhhhhhHHHHHHHHHHHHHHHHHHHH----hCcccCC-CCCCCC
Q 004647           81 SIFDYIDHIFLLVRPRKLLYLAIDGVAPRAKMNQQRTRRFRAAKDAAEAEAEEERLRKEFEE----AGKLLSA-KEKPET  155 (740)
Q Consensus        81 ~If~yid~lv~~vrPrkllyiAiDGVAPrAKmnQQRsRRfrsa~e~~~~~~~~~~~~~~~~~----eg~~~~~-~~~~~~  155 (740)
                      .||+|||||+.++||||+||||||||||||||||||+||||+|++|..++.+++.-.+++..    .|..+.. ..+.+.
T Consensus        61 aVf~Yidhil~~irPrKllymAVDGvAPRAKMNQQRaRRFRsAkda~~A~~Kae~~~e~~~e~~~e~g~~id~~~~~kk~  140 (953)
T COG5049          61 AVFEYIDHILLKIRPRKLLYMAVDGVAPRAKMNQQRARRFRSAKDASAAALKAEPNGEEIPEEKDEIGNEIDTIDVEKKK  140 (953)
T ss_pred             HHHHHHHHHHHhcCcceEEEEEecccCchhhhhHHHHHhhhhhhhhHHHHhhccccccccchhccccCCccchhhhhhcc
Confidence            99999999999999999999999999999999999999999999976554433322222221    2333322 235678


Q ss_pred             CCCcccccccHHHHHHHHHHHHHHHHHHhcCCCCcccEEEEcCCCCCCChhhHHHHHHHHhhcCCCCCCCCcEEEEecCh
Q 004647          156 CDSNVITPGTQFMAVLSAALQYFIQARLNQIPGWQFTKVILSDANVPGEGEHKIMSYIRLQRNLPGFDPNTRHCLYGLDA  235 (740)
Q Consensus       156 fDsN~ITPGT~FM~~L~~~L~~yi~~kl~~dp~w~~l~VI~Sds~vPGEGEHKIm~fIR~qr~~p~ydpn~~H~IyG~DA  235 (740)
                      ||||||||||+||++|+..|+|||+.||++||.|++++||+||+.||||||||||+|||+||++|+|+|||+|||||+||
T Consensus       141 fDSNcITPGTpFMerLak~L~Y~i~~KlssDp~Wrnl~iI~S~~~vPGEGEHKIM~FIRsqkaqp~ynpNT~HciYGLDA  220 (953)
T COG5049         141 FDSNCITPGTPFMERLAKVLRYYIHCKLSSDPEWRNLRIIFSGHLVPGEGEHKIMNFIRSQKAQPSYNPNTRHCIYGLDA  220 (953)
T ss_pred             ccccCCCCCChHHHHHHHHHHHHHHhhhcCCccceeEEEEEecCcCCCccHHHHHHHHHhcccCCCcCCCceeEEeccCc
Confidence            99999999999999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             hHHHHHhhcCCceEEEeeccccCCC---CCcccccccccCcccccccCCCCCCCCCCCCCCCCcccccceEEEehHHHHH
Q 004647          236 DLIMLSLATHEIHFSILREVITLPG---QQEKCFVCGQVGHLAAECHGKPGDNPADWNGVDDTPIHKKKYQFLNIWVLRE  312 (740)
Q Consensus       236 DLImL~Lathe~~f~ILRE~v~~~~---~~~~c~~c~~~~h~~~~c~~~~~~~~~~~~~~~~~~~~~~~f~~l~i~~LRE  312 (740)
                      |||||||+||+|||.||||+|+++.   .+.+|..||.+||....|+..                ..++|.||||++|||
T Consensus       221 DLImLGLstH~PHF~iLREdVff~~~~~~k~k~~~~g~t~~~~e~~k~~----------------~~q~F~~LhiSlLRE  284 (953)
T COG5049         221 DLIMLGLSTHEPHFLILREDVFFGSKSRRKRKCTKCGRTGHSDEECKVL----------------THQPFYLLHISLLRE  284 (953)
T ss_pred             cceeeecccCCCeeEEeechhccCcccccccccccccccccchhhhccc----------------ccCceEEEEHHHHHH
Confidence            9999999999999999999999985   346899999999998888542                356899999999999


Q ss_pred             HHHhhhCCCCCCCCCchhhhHHHHHHHhhhhcCCCCCCCCcccccchhHHHHHHHHHHHhhhcCCccccCCeechHHHHH
Q 004647          313 YLQYELDIPNPPFPINFERIVDDFVFLCFFVGNDFLPHMPTLEIREGAINLLMHVYRREFTAMGGYLTDAGEVLLDRVEK  392 (740)
Q Consensus       313 yL~~e~~~~~~~~~~d~eriIDDfVfLcf~vGNDFLPhlPsl~I~egaid~Li~~Yk~~l~~~~gYLt~~g~inl~~l~~  392 (740)
                      ||+.||..++.+|+||+|||||||||||||||||||||||+|+|++|||++|+++||+.++.++||||++|.||+.||+.
T Consensus       285 YLe~Ef~~~~~~ftfdlERilDDwIf~~FfvGNDFLPhLP~Ldir~gai~~l~ei~k~~lp~~~gYit~~G~iNl~rle~  364 (953)
T COG5049         285 YLEREFREPTLPFTFDLERILDDWIFLCFFVGNDFLPHLPCLDIREGAIETLTEIWKKSLPHMKGYITCDGVINLARLEV  364 (953)
T ss_pred             HHHHHhhccCCCccccHHHhhhhheeeeeeeccccCCCCCccccccchHHHHHHHHHHHhhhcCceeecCceecHHHHHH
Confidence            99999999999999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             HHHHHHHhHHHHHHHHHHHHHHH----H---HHHHHHH-----------h-----------------hhc-----cc---
Q 004647          393 FIQSVAVYEDQIFQKRTRIQQAY----E---YNEAMKL-----------N-----------------ARR-----ES---  429 (740)
Q Consensus       393 fl~~l~~~E~~if~~r~~~~~~~----~---~~~~~~~-----------~-----------------~r~-----~~---  429 (740)
                      ++..|+.+|+.||+++...+.+.    +   ..++++.           +                 ++.     +.   
T Consensus       365 ~L~~L~~~E~~iFk~~~~qe~r~ne~~~~~~~~k~~~e~~~~~~~vv~eq~~~~gS~k~t~~d~~~~kk~~~l~~e~~id  444 (953)
T COG5049         365 ILAILGSFEDDIFKKDHIQEERKNESLERFSLRKERKEGLKGMPRVVYEQKKLIGSIKPTLMDQLQEKKSPDLPDEEFID  444 (953)
T ss_pred             HHHHHhhhhcchhhhhhhHhhhhccchhhHHHHhhhhhhhcccchhhhhhhhhcccccchhhhhhhhhccccCCCccccc
Confidence            99999999999998765332110    0   0000000           0                 000     00   


Q ss_pred             ----------------------------hh-------hhh----------------c---------------------Cc
Q 004647          430 ----------------------------SE-------ELL----------------Q---------------------AP  437 (740)
Q Consensus       430 ----------------------------~~-------~~~----------------~---------------------~~  437 (740)
                                                  +.       .++                +                     ..
T Consensus       445 ~~a~~k~~d~kn~el~~~~~~ndl~ls~ska~ks~~n~~le~~iasds~~ed~ee~ese~d~i~~i~dk~vn~~v~ee~e  524 (953)
T COG5049         445 TLALPKDLDMKNHELFLKRFANDLGLSISKAIKSKGNYSLEMDIASDSPDEDEEEFESEVDSIRKIPDKYVNIIVEEEEE  524 (953)
T ss_pred             hhhchhhhhhhhhHHHHHHHHhhhhhhHHhhhhccCCchhhhhhhccccccchhhhhhccchhhhhhhhhhccccccchh
Confidence                                        00       000                0                     00


Q ss_pred             cccccccccCCcchHHhHHHHhhCCCChhhHHHHHHHHHHHHHHHHHHHhhhhccCccccccccCCCCCccccchhcccC
Q 004647          438 VAVADTVKLGEPGYKERYYADKFEISNPEEIDKVKKDVVLKYVEGLCWVCRYYYQDVCSWQWFYPYHYAPFASDLKDLSD  517 (740)
Q Consensus       438 ~~~~d~v~l~~~~~k~~YY~~Kf~~~~~~~~~~~~~~v~~~YleGL~WVl~YYy~G~~SW~WyYPyHYAPfasDl~~l~~  517 (740)
                      .+..++|++-++||++|||.+||+++++ +.+++ ++||++|||||||||.|||+|||||+|||||||||+|+||.++.+
T Consensus       525 ~~~~~Tv~l~~~g~~erYY~~K~~~t~~-~~E~i-rdm~k~YVeGL~WVL~YYY~GC~SW~WyYpyHyAP~aaD~~k~~~  602 (953)
T COG5049         525 NETEKTVNLRFPGWKERYYTSKLHFTTD-SEEKI-RDMAKEYVEGLQWVLSYYYRGCPSWDWYYPYHYAPLAADLSKLSD  602 (953)
T ss_pred             cccccchhhcccchhhhhhhhhcCCCcC-CHHHH-HHHHHHHhhhhhhhhhhhhcCCCCcccccccccchhhhhhhhccc
Confidence            1234677888999999999999999775 34555 499999999999999999999999999999999999999999999


Q ss_pred             ccccccCCCCCChHHHhhhcCCCCCCCCCchhhhhccCCCCCcccccCCCcccccCCCCccceeeeecccCCChHHHHHH
Q 004647          518 LEITFFLGEPFRPFDQLMGTLPAASSSALPEKYRNLMTDPSSPIYKFYPPDFQIDMNGKRFAWQGVVKLPFIDEKLLLRQ  597 (740)
Q Consensus       518 ~~i~F~~g~Pf~P~eQLm~VLP~~S~~~LP~~~~~Lm~~~~SpI~dfYP~~f~iD~nGk~~~WqgV~lLPFIDe~~Ll~a  597 (740)
                      .+|+|++|+||+||||||+||||+|+++||+.||.||+|++|||+||||++|.+|||||+++|||||||||||++||++|
T Consensus       603 ~dIkFe~g~PF~P~EQLm~VLPa~Sk~~vP~~fr~LM~d~~S~IiDFYPe~f~lD~NGK~~~Wq~VvLlpFiDe~RLl~A  682 (953)
T COG5049         603 NDIKFELGTPFRPFEQLMAVLPARSKNLVPEGFRPLMDDEKSPIIDFYPEEFKLDMNGKTASWQAVVLLPFIDERRLLSA  682 (953)
T ss_pred             ceeeecCCCCCCcHHHHHhhcchhhcCcCchhhhhhhcCCCCcccccchhhcccccCCceeeeeeeEEeeecchhHHHHH
Confidence            99999999999999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             HhhhccCCCHHHHhhccCCCcEEEEcCCCcc-HHHHHHHhhhcCCCCCCCCcceeccCCcCC-CCccceecccC-CCCCc
Q 004647          598 TKKLEVFLTEEELFRNSVMLDLLYVHPQHPL-YQQITLYCQLYHQLPPQDRFAWEIDVNASG-GMNGYIWLCER-NGLRS  674 (740)
Q Consensus       598 ~~~~~~~Lt~eE~~RN~~g~~~lf~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-g~~g~~~~~~~-~~~~~  674 (740)
                      +++.++.||+||+.||.+|.++||..+.++- ...+..+|+++...     ..+.++.+.+. |+.|.+..... ..+..
T Consensus       683 ~~~~~~~Ls~eE~~RN~~g~~llf~s~~~~~~~~l~~~lysk~~~~-----~~~~m~~~~~~~GL~g~v~~~ae~~~pn~  757 (953)
T COG5049         683 VAVKYPTLSEEERKRNLRGLDLLFSSNKKSDLSELFKDLYSKCKQK-----EYITMCSKESPYGLFGTVKLGAEGLAPNL  757 (953)
T ss_pred             HHhhcccCCHHHHhccccCCceeEeccCCccHHHHHHHHHHhhccC-----Cceeeeccccccccccccccccccccccc
Confidence            9999999999999999999999999988874 44566788866432     33456666655 99999977643 34334


Q ss_pred             ccCCCCCC--------CCCCCCCcEEEEEecCCC-CCCCCCCCCCCCCCCcccccccchh
Q 004647          675 IIPSPVKG--------LPDIERNQAINVTYLNPQ-KHRHIPEPPKGATIPAKVCEQCIFI  725 (740)
Q Consensus       675 ~~~~p~~~--------~~~i~~~~~~~~~y~~p~-~~~~~~~~l~g~~~p~~~l~~~~~~  725 (740)
                      ...||+..        +..++.|+++.+.+.+|. ...|++.+++|++.|-.|++.++.-
T Consensus       758 ~~lcp~~~~s~~~l~~~~~~~~n~S~~lv~~~pks~~~~ksi~~rg~~~~~~vl~py~~e  817 (953)
T COG5049         758 LSLCPISFLSYPGLMVFLEYSKNQSARLVIEDPKSTVTNKSIVLRGFIKPINVLWPYLRE  817 (953)
T ss_pred             cccCccccccccchhhhcccccCCceEEEeeccccccchHHHHHHhcCCccccCCHHHHH
Confidence            56677642        335678999999999995 4578899999999999999988764


No 3  
>KOG2045 consensus 5'-3' exonuclease XRN1/KEM1/SEP1 involved in DNA strand exchange and mRNA turnover [Replication, recombination and repair; Cell cycle control, cell division, chromosome partitioning]
Probab=100.00  E-value=3.5e-181  Score=1510.04  Aligned_cols=561  Identities=46%  Similarity=0.847  Sum_probs=490.6

Q ss_pred             CccchHHHHHHhhCCCcccccccCCCccCCCCcccCCCCCCCCCCCcccCeEEeeccccccccccCCCCC--CCCCHHHH
Q 004647            1 MGVPAFYRWLADRYPLSIVDVVEEDPQVDGEGVARPVDVSKPNPNGMEFDNLYLDMNGIIHPCFHPDGKP--APTSYDDV   78 (740)
Q Consensus         1 MGVP~ffrWL~~rYP~i~~~~~e~~~~~~~~g~~~p~d~s~pn~n~~e~DnLYlDmNgIIH~c~h~~~~~--~p~te~e~   78 (740)
                      ||||+||||+++|||.+ ++++|+.        ++|           ||||||||||||||+|+|+++..  .+.|||||
T Consensus         1 MGvPKFfR~iSERyP~l-seliee~--------qIP-----------EFDNLYLDMNgIlHNCsH~nDddvt~rLtEeEi   60 (1493)
T KOG2045|consen    1 MGVPKFFRYISERYPCL-SELIEEH--------QIP-----------EFDNLYLDMNGILHNCSHPNDDDVTFRLTEEEI   60 (1493)
T ss_pred             CCchHHHHHhhhhchHH-HHHhhhc--------cCC-----------cccceeeecccccccCCCCCCCccCcCCCHHHH
Confidence            99999999999999995 5677643        455           99999999999999999998754  57899999


Q ss_pred             HHHHHHHHHHHHhhcccceeEEEeecCCCchhhhHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHhCcccCCCCCCCCCCC
Q 004647           79 FKSIFDYIDHIFLLVRPRKLLYLAIDGVAPRAKMNQQRTRRFRAAKDAAEAEAEEERLRKEFEEAGKLLSAKEKPETCDS  158 (740)
Q Consensus        79 ~~~If~yid~lv~~vrPrkllyiAiDGVAPrAKmnQQRsRRfrsa~e~~~~~~~~~~~~~~~~~eg~~~~~~~~~~~fDs  158 (740)
                      |.+||+|||+||.++||+|++|||||||||||||||||+||||+|++|..+....       ...|+..+    .+.|||
T Consensus        61 f~~IfnYIdhLf~~IkPqKlffMAVDGvAPRAKMNQQRsRRFrTArdAe~qlaKA-------~enGe~~p----~erFDS  129 (1493)
T KOG2045|consen   61 FQEIFNYIDHLFYLIKPQKLFFMAVDGVAPRAKMNQQRSRRFRTARDAEQQLAKA-------AENGELRP----HERFDS  129 (1493)
T ss_pred             HHHHHHHHHHHHHhhCcceEEEEeecccCchhhhhHHHHHhhhhhhhHHHHHHHH-------HhccccCc----cccccc
Confidence            9999999999999999999999999999999999999999999999987654332       23565432    378999


Q ss_pred             cccccccHHHHHHHHHHHHHHHHHHhcCCCCcccEEEEcCCCCCCChhhHHHHHHHHhhcCCCCCCCCcEEEEecChhHH
Q 004647          159 NVITPGTQFMAVLSAALQYFIQARLNQIPGWQFTKVILSDANVPGEGEHKIMSYIRLQRNLPGFDPNTRHCLYGLDADLI  238 (740)
Q Consensus       159 N~ITPGT~FM~~L~~~L~~yi~~kl~~dp~w~~l~VI~Sds~vPGEGEHKIm~fIR~qr~~p~ydpn~~H~IyG~DADLI  238 (740)
                      |||||||+||.||++.|+|||+.|+++|+.|++++||+||++||||||||||+|||.++++|+||||||||+||+|||||
T Consensus       130 NcITPGTeFM~rl~~~L~yfIktKistDs~Wq~~~vIlSGhevPGEGEHKIMdyIRt~kaq~dydpNTRHClYGLDADLI  209 (1493)
T KOG2045|consen  130 NCITPGTEFMVRLQEGLRYFIKTKISTDSLWQRCTVILSGHEVPGEGEHKIMDYIRTMKAQPDYDPNTRHCLYGLDADLI  209 (1493)
T ss_pred             CCCCCcHHHHHHHHHHHHHHHHhccccchhhcccEEEEeCCcCCCcchHHHHHHHHHhhcCCCCCCCcceeecccchhhh
Confidence            99999999999999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             HHHhhcCCceEEEeeccccCCCCCcccccccccCcccccccCCCCCCCCCCCCCCCCcccccceEEEehHHHHHHHHhhh
Q 004647          239 MLSLATHEIHFSILREVITLPGQQEKCFVCGQVGHLAAECHGKPGDNPADWNGVDDTPIHKKKYQFLNIWVLREYLQYEL  318 (740)
Q Consensus       239 mL~Lathe~~f~ILRE~v~~~~~~~~c~~c~~~~h~~~~c~~~~~~~~~~~~~~~~~~~~~~~f~~l~i~~LREyL~~e~  318 (740)
                      ||||.||+|||.+|||+|+|+.+.+                              .+....++|.+||+++|||||+.||
T Consensus       210 mLGL~tHepHF~lLREEVtFgrrn~------------------------------~k~lehqkFyLLHLsLLREYlelEF  259 (1493)
T KOG2045|consen  210 MLGLCTHEPHFVLLREEVTFGRRNK------------------------------RKSLEHQKFYLLHLSLLREYLELEF  259 (1493)
T ss_pred             eeeeccCCcceeeeeeeeecccccc------------------------------cchhhhhhhhhhHHHHHHHHHHHHH
Confidence            9999999999999999999863211                              1123456899999999999999999


Q ss_pred             CC--CCCCCCCchhhhHHHHHHHhhhhcCCCCCCCCcccccchhHHHHHHHHHHHhhhcCCccccCCeechHHHHHHHHH
Q 004647          319 DI--PNPPFPINFERIVDDFVFLCFFVGNDFLPHMPTLEIREGAINLLMHVYRREFTAMGGYLTDAGEVLLDRVEKFIQS  396 (740)
Q Consensus       319 ~~--~~~~~~~d~eriIDDfVfLcf~vGNDFLPhlPsl~I~egaid~Li~~Yk~~l~~~~gYLt~~g~inl~~l~~fl~~  396 (740)
                      .-  ...+|++|+|||+||||+|.||||||||||||+|+|.+||+..|+.+||+++|++||||.++|+||+.||+.|+.+
T Consensus       260 ~e~rdt~~fkyd~erIlDD~ILl~flVGNDFLPhLP~LHIn~gAlpllystykkvlpt~~GyINE~G~lNl~Rle~~L~e  339 (1493)
T KOG2045|consen  260 DELRDTDEFKYDIERILDDWILLGFLVGNDFLPHLPCLHINSGALPLLYSTYKKVLPTLGGYINENGKLNLRRLEIFLSE  339 (1493)
T ss_pred             HHhhhccchhhhHHHHHHHHHHHHHhhccccccCCCccccCCChHHHHHHHHHHHhccCCccccccceecHHHHHHHHHH
Confidence            73  3568999999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             HHHhHHHHHHHHHHHHHHHHHHHH--------------HHH---hhhc-----cchhhh---------------------
Q 004647          397 VAVYEDQIFQKRTRIQQAYEYNEA--------------MKL---NARR-----ESSEEL---------------------  433 (740)
Q Consensus       397 l~~~E~~if~~r~~~~~~~~~~~~--------------~~~---~~r~-----~~~~~~---------------------  433 (740)
                      |..+|.++|+++.+..+..++...              .+.   +...     +.++.+                     
T Consensus       340 L~nfeke~Fke~led~k~~nskr~r~~~~~~~~~~~~dika~t~sq~~d~l~~~~~~~p~i~~~a~ld~dD~~Fl~~~~e  419 (1493)
T KOG2045|consen  340 LTNFEKEHFKEHLEDLKYMNSKRERFDDPEQQELAEMDIKAITESQNLDSLLGEESKDPLINKSALLDDDDSAFLSDHEE  419 (1493)
T ss_pred             HHhhhHHHHHHHHHhhhhccccccccccHHHHhhhcccHHhhhhhhhhhhhccccccccccccccccccchHHHHHHhhh
Confidence            999999999988754432211000              000   0000     000000                     


Q ss_pred             --------------h-c--Ccccccccc----------------------ccCC--------cchHHhHHHHhhCCCChh
Q 004647          434 --------------L-Q--APVAVADTV----------------------KLGE--------PGYKERYYADKFEISNPE  466 (740)
Q Consensus       434 --------------~-~--~~~~~~d~v----------------------~l~~--------~~~k~~YY~~Kf~~~~~~  466 (740)
                                    + .  ...+.+|++                      -+||        ..||..||++|++.++.+
T Consensus       420 Dl~~~~~~s~s~~~~ld~~~~de~EdEf~~~~~t~~ls~~~~~~~~n~eea~~eKti~n~~F~rwK~~yYrdKlkf~~~d  499 (1493)
T KOG2045|consen  420 DLSDLEPGSGSDELLLDNLDADELEDEFAVELATLALSGMNDADFANDEEACWEKTILNKEFQRWKRNYYRDKLKFDPND  499 (1493)
T ss_pred             hccccccccCccchhhccccchhhhHHHHHHHHHHHHhccccccccchHHHhhhhhhHHHHHHHHHHHHhhhhhcCCCcc
Confidence                          0 0  000001110                      0121        269999999999997653


Q ss_pred             hHHHHHHHHHHHHHHHHHHHhhhhccCccccccccCCCCCccccchhcccCccccccCCCCCChHHHhhhcCCCCCCCCC
Q 004647          467 EIDKVKKDVVLKYVEGLCWVCRYYYQDVCSWQWFYPYHYAPFASDLKDLSDLEITFFLGEPFRPFDQLMGTLPAASSSAL  546 (740)
Q Consensus       467 ~~~~~~~~v~~~YleGL~WVl~YYy~G~~SW~WyYPyHYAPfasDl~~l~~~~i~F~~g~Pf~P~eQLm~VLP~~S~~~L  546 (740)
                        ++..+++|..|||||||||.|||+||+||+||||||||||+||+.+.-+++|.|++|+||+||||||||||++|+.+|
T Consensus       500 --ee~lrelae~YVeaLQWvL~YYYrGc~SWsWyYphHyaP~ISDl~kgldv~ieF~mgtPF~PFqQLlAVLPaaSa~ll  577 (1493)
T KOG2045|consen  500 --EELLRELAEHYVEALQWVLDYYYRGCQSWSWYYPHHYAPFISDLKKGLDVEIEFHMGTPFLPFQQLLAVLPAASAKLL  577 (1493)
T ss_pred             --HHHHHHHHHHHHHHHHHHHHHHhcCCccccccccccccchhHhHhcccceeEEEecCCCCCcHHHHHHhchhhhhccC
Confidence              356689999999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             chhhhhccCCCCCcccccCCCcccccCCCCccceeeeecccCCChHHHHHHHhhhccCCCHHHHhhccCCCcEEEEcC
Q 004647          547 PEKYRNLMTDPSSPIYKFYPPDFQIDMNGKRFAWQGVVKLPFIDEKLLLRQTKKLEVFLTEEELFRNSVMLDLLYVHP  624 (740)
Q Consensus       547 P~~~~~Lm~~~~SpI~dfYP~~f~iD~nGk~~~WqgV~lLPFIDe~~Ll~a~~~~~~~Lt~eE~~RN~~g~~~lf~~~  624 (740)
                      |.+||+||.+++|||+||||.+|+.|+|||+.+|++|||||||||+||++||.+.+..||.||+.||++|.+++|.+.
T Consensus       578 Pp~frdLM~~~~SPI~DFYPaefelD~NGKtadWEAVVLIpFIdEkRLleAm~pk~~~Ls~EEr~RNs~g~~~vys~~  655 (1493)
T KOG2045|consen  578 PPAFRDLMLLPTSPIADFYPAEFELDLNGKTADWEAVVLIPFIDEKRLLEAMLPKEAQLSLEERERNSHGPMYVYSYS  655 (1493)
T ss_pred             ChhhhHhhcCCCCchhhcchhhheecccCCccceeEEEEEeecchHHHHHHHhhHhhhcCHHHhhhcccCCceeeecc
Confidence            999999999999999999999999999999999999999999999999999999999999999999999999999764


No 4  
>PF03159 XRN_N:  XRN 5'-3' exonuclease N-terminus;  InterPro: IPR004859 Signatures of this entry align residues towards the N terminus of several proteins with multiple functions. The members of this family all appear to possess 5'-3' exonuclease activity 3.1.11 from EC. Thus, the aligned region may be necessary for 5'-3' exonuclease function.; GO: 0003676 nucleic acid binding, 0004527 exonuclease activity, 0005622 intracellular; PDB: 2Y35_A 3PIE_B 3PIF_C 3FQD_A.
Probab=100.00  E-value=5e-87  Score=688.52  Aligned_cols=237  Identities=57%  Similarity=1.028  Sum_probs=193.2

Q ss_pred             CccchHHHHHHhhCCCcccccccCCCccCCCCcccCCCCCCCCCCCcccCeEEeeccccccccccCCCCCCCCCHHHHHH
Q 004647            1 MGVPAFYRWLADRYPLSIVDVVEEDPQVDGEGVARPVDVSKPNPNGMEFDNLYLDMNGIIHPCFHPDGKPAPTSYDDVFK   80 (740)
Q Consensus         1 MGVP~ffrWL~~rYP~i~~~~~e~~~~~~~~g~~~p~d~s~pn~n~~e~DnLYlDmNgIIH~c~h~~~~~~p~te~e~~~   80 (740)
                      ||||+|||||++|||.++..+.+...                   ..+|||||||||||||+|+|++..+.+.++++||+
T Consensus         1 MGVp~f~~wl~~ryp~~~~~~~~~~~-------------------~~~~D~LYiDmN~IIH~~~~~~~~~~~~~~~~~~~   61 (237)
T PF03159_consen    1 MGVPGFFRWLSERYPLIVRPISENSI-------------------PSEFDNLYIDMNGIIHNCIHPNDSSIPKTEEEIFQ   61 (237)
T ss_dssp             --CCHHHHHHHHHSGGGEEEECTTTS-------------------EE-ESEEEEETHHHHHHHHS-SSS----SHHHHHH
T ss_pred             CCHHHHHHHHHHhCCcceeeccccCC-------------------CCcCCEEEEEcchhhhHhcCCcccCCCccHHHHHH
Confidence            99999999999999999887654321                   12699999999999999999998888899999999


Q ss_pred             HHHHHHHHHHhhcccceeEEEeecCCCchhhhHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHhCcccCCCCCCCCCCCcc
Q 004647           81 SIFDYIDHIFLLVRPRKLLYLAIDGVAPRAKMNQQRTRRFRAAKDAAEAEAEEERLRKEFEEAGKLLSAKEKPETCDSNV  160 (740)
Q Consensus        81 ~If~yid~lv~~vrPrkllyiAiDGVAPrAKmnQQRsRRfrsa~e~~~~~~~~~~~~~~~~~eg~~~~~~~~~~~fDsN~  160 (740)
                      +||+|||+||++|||||+||||||||||||||||||+|||+++++++....+..+.+++...+|...+.......||||+
T Consensus        62 ~i~~~id~l~~~v~P~k~l~iavDGvaP~AKm~qQR~RRf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~fdsn~  141 (237)
T PF03159_consen   62 RIFNYIDRLVRIVRPRKLLYIAVDGVAPRAKMNQQRSRRFKSAKESEENNKEESEIKEEIDEEGEQLPPEDQEEKFDSNC  141 (237)
T ss_dssp             HHHHHHHHHHHHH-ESSEEEEE---S--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHT--B-HHHHS----GGG
T ss_pred             HHHHHHHHhheeecCceEEEEEcCCCCCchHHHHHHHHHHHHhhcchhHHHHHHHHhhhhhhccccccccccccccccce
Confidence            99999999999999999999999999999999999999999999998887777777777777776555545568999999


Q ss_pred             cccccHHHHHHHHHHHHHHHHHHhcCCCCcccEEEEcCCCCCCChhhHHHHHHHHhhcCCCCCCCCcEEEEecChhHHHH
Q 004647          161 ITPGTQFMAVLSAALQYFIQARLNQIPGWQFTKVILSDANVPGEGEHKIMSYIRLQRNLPGFDPNTRHCLYGLDADLIML  240 (740)
Q Consensus       161 ITPGT~FM~~L~~~L~~yi~~kl~~dp~w~~l~VI~Sds~vPGEGEHKIm~fIR~qr~~p~ydpn~~H~IyG~DADLImL  240 (740)
                      |||||+||.+|+++|++|+++|+++||.|++++||+||++||||||||||+|||+++++|+|+||++|||||+|||||||
T Consensus       142 ITPGT~FM~~l~~~L~~~~~~k~~~~~~~~~~~vi~S~~~vpGEGE~KI~~~IR~~~~~~~~~~n~~h~i~g~DaDlIll  221 (237)
T PF03159_consen  142 ITPGTEFMEKLSDALRYYIKKKLNSDPKWQNLKVIFSGSDVPGEGEHKIMDFIRSQRSQPDYDPNTSHCIYGSDADLILL  221 (237)
T ss_dssp             SSTTSHHHHHHHHHHHHHHHHHHHH-GGGCCSEEEEE-TTSSS-HHHHHHHHHHHHHHSTTS-TT--EEEE-SSTHHHHH
T ss_pred             eccCCHHHHHHHHHHHHHHHHHhcCCCCcCceEEEEeCCCCCCccHHHHHHHHHHhhhcCCCCCCceEEEEecCHhHHHH
Confidence            99999999999999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             HhhcCCceEEEeeccc
Q 004647          241 SLATHEIHFSILREVI  256 (740)
Q Consensus       241 ~Lathe~~f~ILRE~v  256 (740)
                      ||++|++||+||||+|
T Consensus       222 ~L~~~~~~~~ilre~~  237 (237)
T PF03159_consen  222 SLATHEPNIYILREEV  237 (237)
T ss_dssp             HHHTT-SSEEEEEESS
T ss_pred             HHccCCCeEEEEeccC
Confidence            9999999999999974


No 5  
>cd00128 XPG Xeroderma pigmentosum G N- and I-regions (XPGN, XPGI); contains the HhH2 motif; domain in nucleases. XPG is a eukaryotic enzyme that functions in nucleotide-excision repair and transcription-coupled repair of oxidative DNA damage. Functionally/structurally related to FEN-1;  divalent metal ion-dependent exo- and endonuclease, and bacterial and bacteriophage 5'3' exonucleases.
Probab=98.72  E-value=2.3e-07  Score=100.57  Aligned_cols=237  Identities=17%  Similarity=0.239  Sum_probs=126.9

Q ss_pred             CccchHHHHHHhhCCCcccccccCCCccCCCCcccCCCCCCCCCCCcccCeEEeeccccccccccCCCCC--CCCCHHHH
Q 004647            1 MGVPAFYRWLADRYPLSIVDVVEEDPQVDGEGVARPVDVSKPNPNGMEFDNLYLDMNGIIHPCFHPDGKP--APTSYDDV   78 (740)
Q Consensus         1 MGVP~ffrWL~~rYP~i~~~~~e~~~~~~~~g~~~p~d~s~pn~n~~e~DnLYlDmNgIIH~c~h~~~~~--~p~te~e~   78 (740)
                      |||++|..||...-+.+  .+ ++              .     .|   --|=||.++.+|.+.......  ........
T Consensus         1 MGI~gL~~~l~~~~~~~--~i-~~--------------l-----~g---k~laID~~~~l~r~~~a~~~~~~~~g~~~~~   55 (316)
T cd00128           1 MGIKGLWPLLKPVARPV--HL-EE--------------L-----RG---KKVAIDASIWLYQFLKACRQELGSGGETTSH   55 (316)
T ss_pred             CchhhHHHHHHhhCCCC--CH-HH--------------h-----CC---cEEEecHHHHHHHHHHHhhhhccCCCCCcHH
Confidence            99999999999876653  11 10              0     01   147899999999875432110  11111233


Q ss_pred             HHHHHHHHHHHHhhcccceeEEEeecCCCchhhhHHHHHhhhhhH--HHHHHHHHH---HHHHHHHHHHhCcccCCCCCC
Q 004647           79 FKSIFDYIDHIFLLVRPRKLLYLAIDGVAPRAKMNQQRTRRFRAA--KDAAEAEAE---EERLRKEFEEAGKLLSAKEKP  153 (740)
Q Consensus        79 ~~~If~yid~lv~~vrPrkllyiAiDGVAPrAKmnQQRsRRfrsa--~e~~~~~~~---~~~~~~~~~~eg~~~~~~~~~  153 (740)
                      ...++..+.+|+.. .. + ..+++||.+|-.|......||-+..  ++......+   .++..+..             
T Consensus        56 l~~~~~rl~~L~~~-~i-~-pvfVFDG~~~~~K~~~~~~R~~~r~~~~~~~~~~~~~~~~~~~~~~~-------------  119 (316)
T cd00128          56 LQGFFYRTCRLLEL-GI-K-PVFVFDGKPPPLKAETLAKRRERREEAEEEAKEALEKGLEEEAKKLE-------------  119 (316)
T ss_pred             HHHHHHHHHHHHHC-CC-E-EEEEEcCCCchhhHHHHHHHHHHHHHHHHHHHHHHHhCCHHHHHHHH-------------
Confidence            44555555555542 22 3 3467999999888877665554322  211111000   00111000             


Q ss_pred             CCCCCcccccccHHHHHHHHHHHHHHHHHHhcCCCCcccEEEEcCCCCCCChhhHHHHHHHHhhcCCCCCCCCcEEEEec
Q 004647          154 ETCDSNVITPGTQFMAVLSAALQYFIQARLNQIPGWQFTKVILSDANVPGEGEHKIMSYIRLQRNLPGFDPNTRHCLYGL  233 (740)
Q Consensus       154 ~~fDsN~ITPGT~FM~~L~~~L~~yi~~kl~~dp~w~~l~VI~Sds~vPGEGEHKIm~fIR~qr~~p~ydpn~~H~IyG~  233 (740)
                      .  .+..+||.+  ...    ++..+..        -++.+|.+    |||+|-=+-..-+.         .....|+|.
T Consensus       120 ~--~~~~~~~~~--~~~----~~~lL~~--------~gi~~i~a----p~EAdaq~a~l~~~---------g~v~~i~S~  170 (316)
T cd00128         120 R--RAVRVTPQM--IEE----AKELLRL--------MGIPYIVA----PYEAEAQCAYLAKK---------GLVDAIITE  170 (316)
T ss_pred             h--ccCcCCHHH--HHH----HHHHHHH--------cCCCEEEC----CcCHHHHHHHHHhC---------CCeeEEEec
Confidence            0  012244432  122    2322221        15677764    79999765544331         235789999


Q ss_pred             ChhHHHHHhhcCCceEEEeeccccCCCCCcccccccccCcccccccCCCCCCCCCCCCCCCCcccccceEEEehHHHHHH
Q 004647          234 DADLIMLSLATHEIHFSILREVITLPGQQEKCFVCGQVGHLAAECHGKPGDNPADWNGVDDTPIHKKKYQFLNIWVLREY  313 (740)
Q Consensus       234 DADLImL~Lathe~~f~ILRE~v~~~~~~~~c~~c~~~~h~~~~c~~~~~~~~~~~~~~~~~~~~~~~f~~l~i~~LREy  313 (740)
                      |+|+++++-    ++  ++|.- ...                                      ....++.++..-+.+.
T Consensus       171 DsD~l~fg~----~~--vi~~~-~~~--------------------------------------~~~~~~~~~~~~~~~~  205 (316)
T cd00128         171 DSDLLLFGA----PR--VYRNL-FDS--------------------------------------GAKPVEEIDLEKILKE  205 (316)
T ss_pred             CCCeeeecC----ce--EEEec-ccC--------------------------------------CCCceEEEEHHHHHHH
Confidence            999988762    22  33321 000                                      0023456776555444


Q ss_pred             HHhhhCCCCCCCCCchhhhHHHHHHHhhhhcCCCCCCCCcccccchhHHHHHHHHH
Q 004647          314 LQYELDIPNPPFPINFERIVDDFVFLCFFVGNDFLPHMPTLEIREGAINLLMHVYR  369 (740)
Q Consensus       314 L~~e~~~~~~~~~~d~eriIDDfVfLcf~vGNDFLPhlPsl~I~egaid~Li~~Yk  369 (740)
                      +    .         +.  -+.|+.+|.|+|+||+|++|++-+.. |+. |+..|.
T Consensus       206 l----g---------l~--~~q~id~~~L~G~Dy~~gv~giG~k~-A~~-li~~~~  244 (316)
T cd00128         206 L----G---------LT--REKLIDLAILLGCDYTEGIPGIGPVT-ALK-LIKKYG  244 (316)
T ss_pred             c----C---------CC--HHHHHHHHHhcCCCCCCCCCCccHHH-HHH-HHHHcC
Confidence            3    2         11  24788999999999999999866542 333 444444


No 6  
>PTZ00217 flap endonuclease-1; Provisional
Probab=98.45  E-value=6.2e-06  Score=92.28  Aligned_cols=231  Identities=19%  Similarity=0.286  Sum_probs=126.0

Q ss_pred             CccchHHHHHHhhCCCcccccccCCCccCCCCcccCCCCCCCCCCCcccCeEEeeccccccccccCC-----CCCC----
Q 004647            1 MGVPAFYRWLADRYPLSIVDVVEEDPQVDGEGVARPVDVSKPNPNGMEFDNLYLDMNGIIHPCFHPD-----GKPA----   71 (740)
Q Consensus         1 MGVP~ffrWL~~rYP~i~~~~~e~~~~~~~~g~~~p~d~s~pn~n~~e~DnLYlDmNgIIH~c~h~~-----~~~~----   71 (740)
                      |||.++..+|....|.+++.+-    ...--|                 =-+=||....+|.....-     +.+.    
T Consensus         1 MGI~gL~~~l~~~~p~~~~~~~----l~~l~g-----------------k~vaIDa~~~lyr~~~a~~~~~~~~~l~~~~   59 (393)
T PTZ00217          1 MGIKGLSKFLADKAPNAIKEQE----LKNYFG-----------------RVIAIDASMALYQFLIAIRDDSQGGNLTNEA   59 (393)
T ss_pred             CChhhHHHHHhhhccccccccC----HHHhCC-----------------cEEEEeHHHHHHHHHHHcccccccccchhcc
Confidence            9999999999999999875431    000011                 147899999999754211     1110    


Q ss_pred             CCCHHHHHHHHHHHHHHHHhh-cccceeEEEeecCCCchhhhHHHHHhh--hhhHHHHHHHHH---HHHHHHHHHHHhCc
Q 004647           72 PTSYDDVFKSIFDYIDHIFLL-VRPRKLLYLAIDGVAPRAKMNQQRTRR--FRAAKDAAEAEA---EEERLRKEFEEAGK  145 (740)
Q Consensus        72 p~te~e~~~~If~yid~lv~~-vrPrkllyiAiDGVAPrAKmnQQRsRR--frsa~e~~~~~~---~~~~~~~~~~~eg~  145 (740)
                      -... .-+.-+|..+-+|+.. ++|    .+++||.+|-.|...-..||  ...|.+......   ..+++++...    
T Consensus        60 G~~t-~~l~g~~~r~~~Ll~~gikP----v~VFDG~~p~~K~~~~~~Rk~~R~~a~~~l~~a~~~g~~~~a~k~~~----  130 (393)
T PTZ00217         60 GEVT-SHISGLFNRTIRLLEAGIKP----VYVFDGKPPELKSGELEKRRERREEAEEELEKAIEEGDDEEIKKQSK----  130 (393)
T ss_pred             CCcc-HHHHHHHHHHHHHHHCCCCE----EEEEcCCCchhhHHHHHHHHHHHHHhHHHHHHHHhcCCHHHHHHHHh----
Confidence            0011 2334566666677764 788    47999999976665443333  333322111100   0011111110    


Q ss_pred             ccCCCCCCCCCCCcccccccHHHHHHHHHHHHHHHHHHhcCCCCcccEEEEcCCCCCCChhhHHHHHHHHhhcCCCCCCC
Q 004647          146 LLSAKEKPETCDSNVITPGTQFMAVLSAALQYFIQARLNQIPGWQFTKVILSDANVPGEGEHKIMSYIRLQRNLPGFDPN  225 (740)
Q Consensus       146 ~~~~~~~~~~fDsN~ITPGT~FM~~L~~~L~~yi~~kl~~dp~w~~l~VI~Sds~vPGEGEHKIm~fIR~qr~~p~ydpn  225 (740)
                                 .+.-||+  +-+..+.+.|+    .        -++.+|.+    |||+|.=|-..-+         .+
T Consensus       131 -----------r~~~vt~--~~~~~~~~lL~----~--------~Gip~i~A----P~EAdaq~A~L~~---------~g  172 (393)
T PTZ00217        131 -----------RTVRVTK--EQNEDAKKLLR----L--------MGIPVIEA----PCEAEAQCAELVK---------KG  172 (393)
T ss_pred             -----------hcccCCH--HHHHHHHHHHH----H--------cCCceEEC----CcCHHHHHHHHHH---------CC
Confidence                       0112332  12222222222    1        24667754    8999995544432         24


Q ss_pred             CcEEEEecChhHHHHHhhcCCceEEEeeccccCCCCCcccccccccCcccccccCCCCCCCCCCCCCCCCcccccceEEE
Q 004647          226 TRHCLYGLDADLIMLSLATHEIHFSILREVITLPGQQEKCFVCGQVGHLAAECHGKPGDNPADWNGVDDTPIHKKKYQFL  305 (740)
Q Consensus       226 ~~H~IyG~DADLImL~Lathe~~f~ILRE~v~~~~~~~~c~~c~~~~h~~~~c~~~~~~~~~~~~~~~~~~~~~~~f~~l  305 (740)
                      ....|++-|.|+++++-    +  .++|.- ...+                                    .....++.+
T Consensus       173 ~v~~ViS~D~D~l~fg~----~--~vi~~l-~~~~------------------------------------~~~~~~~~~  209 (393)
T PTZ00217        173 KVYAVATEDMDALTFGT----P--VLLRNL-NFSE------------------------------------AKKRPIQEI  209 (393)
T ss_pred             CeEEEeCCCcCeeecCC----c--EEEEcc-cccc------------------------------------cCCCCeEEE
Confidence            56789999999998872    1  335431 1000                                    011235667


Q ss_pred             ehHHHHHHHHhhhCCCCCCCCCchhhhHHHHHHHhhhhcCCCCCCCCccccc
Q 004647          306 NIWVLREYLQYELDIPNPPFPINFERIVDDFVFLCFFVGNDFLPHMPTLEIR  357 (740)
Q Consensus       306 ~i~~LREyL~~e~~~~~~~~~~d~eriIDDfVfLcf~vGNDFLPhlPsl~I~  357 (740)
                      +...+.+.+    .         +.  -+.||-+|.|+|.||+|.+|++-..
T Consensus       210 ~~~~v~~~~----g---------l~--~~q~id~~iL~G~Dy~pgi~GIG~k  246 (393)
T PTZ00217        210 NLSTVLEEL----G---------LS--MDQFIDLCILCGCDYCDTIKGIGPK  246 (393)
T ss_pred             EHHHHHHHh----C---------CC--HHHHHHHHHHhCCCCCCCCCCccHH
Confidence            776554432    2         11  2478889999999999999987553


No 7  
>TIGR03674 fen_arch flap structure-specific endonuclease. Endonuclease that cleaves the 5'-overhanging flap structure that is generated by displacement synthesis when DNA polymerase encounters the 5'-end of a downstream Okazaki fragment. Has 5'-endo-/exonuclease and 5'-pseudo-Y-endonuclease activities. Cleaves the junction between single and double-stranded regions of flap DNA
Probab=97.59  E-value=0.0014  Score=72.34  Aligned_cols=66  Identities=14%  Similarity=0.172  Sum_probs=40.4

Q ss_pred             eEEeeccccccccccCC---CCCCCCC----HHHHHHHHHHHHHHHHhh-cccceeEEEeecCCCchhhhHHHHHhhh
Q 004647           51 NLYLDMNGIIHPCFHPD---GKPAPTS----YDDVFKSIFDYIDHIFLL-VRPRKLLYLAIDGVAPRAKMNQQRTRRF  120 (740)
Q Consensus        51 nLYlDmNgIIH~c~h~~---~~~~p~t----e~e~~~~If~yid~lv~~-vrPrkllyiAiDGVAPrAKmnQQRsRRf  120 (740)
                      -+-||....+|.+...-   ++..-.+    ...-+..+|..+-+++.. ++|    .+++||.+|-.|..+-..||-
T Consensus        23 ~vaIDas~~L~r~~~a~~~~~g~~l~~~~G~~t~~l~g~~~~~~~ll~~~i~P----v~VFDG~~p~~K~~~~~~R~~   96 (338)
T TIGR03674        23 VVAVDAFNALYQFLSSIRQPDGTPLMDSRGRITSHLSGLFYRTINLLENGIKP----VYVFDGKPPELKAETLEERRE   96 (338)
T ss_pred             EEEEeHHHHHHHHHHHHhccccchhhhccCCCcHHHHHHHHHHHHHHHCCCeE----EEEECCCChhhhHhhHHHHHH
Confidence            47899999999765421   1100000    012234445555666665 778    799999999877776666654


No 8  
>PRK03980 flap endonuclease-1; Provisional
Probab=97.57  E-value=0.0017  Score=70.26  Aligned_cols=25  Identities=16%  Similarity=0.491  Sum_probs=21.5

Q ss_pred             HHHHHHhhhhcCCCCCCCCcccccc
Q 004647          334 DDFVFLCFFVGNDFLPHMPTLEIRE  358 (740)
Q Consensus       334 DDfVfLcf~vGNDFLPhlPsl~I~e  358 (740)
                      +.|+-+|.|+|.||.|++|++-+..
T Consensus       177 ~q~id~~iL~G~Dy~~GI~GIG~kt  201 (292)
T PRK03980        177 EQLIDIAILVGTDYNPGIKGIGPKT  201 (292)
T ss_pred             HHHHHHHHhcCCCCCCCCCCccHHH
Confidence            4678899999999999999887753


No 9  
>smart00475 53EXOc 5'-3' exonuclease.
Probab=95.74  E-value=0.61  Score=49.83  Aligned_cols=133  Identities=15%  Similarity=0.175  Sum_probs=73.5

Q ss_pred             EEeeccccccccccCCCC--CCCCCHHHHHHHHHHHHHHHHhhcccceeEEEeecCCCchhhhHHHHHhhhhhHHHHHHH
Q 004647           52 LYLDMNGIIHPCFHPDGK--PAPTSYDDVFKSIFDYIDHIFLLVRPRKLLYLAIDGVAPRAKMNQQRTRRFRAAKDAAEA  129 (740)
Q Consensus        52 LYlDmNgIIH~c~h~~~~--~~p~te~e~~~~If~yid~lv~~vrPrkllyiAiDGVAPrAKmnQQRsRRfrsa~e~~~~  129 (740)
                      |-||.|++||-+.|.-..  ...-........++..+-+++...+|..+ .+|+||-.|.-     |..-+-..|     
T Consensus         4 llIDg~~~i~R~~~a~~~l~~~~G~~t~a~~g~~~~l~~l~~~~~p~~~-~~~fD~~~~~~-----R~~l~p~YK-----   72 (259)
T smart00475        4 LLVDGSSLAFRAYFALPPLKNSKGEPTNAVYGFLRMLLKLIKEEKPTYV-AVVFDAKGKTF-----RHELYPEYK-----   72 (259)
T ss_pred             EEEeCcHHHHHHHHCCCcccCCCCCcccHHHHHHHHHHHHHHHcCCCeE-EEEEeCCCCcc-----ccchhHHHH-----
Confidence            689999999998885311  00001123344556666677777799875 59999854421     111111111     


Q ss_pred             HHHHHHHHHHHHHhCcccCCCCCCCCCCCcccccccHHHHHHHHHHHHHHHHHHhcCCCCcccEEEEcCCCCCC-ChhhH
Q 004647          130 EAEEERLRKEFEEAGKLLSAKEKPETCDSNVITPGTQFMAVLSAALQYFIQARLNQIPGWQFTKVILSDANVPG-EGEHK  208 (740)
Q Consensus       130 ~~~~~~~~~~~~~eg~~~~~~~~~~~fDsN~ITPGT~FM~~L~~~L~~yi~~kl~~dp~w~~l~VI~Sds~vPG-EGEHK  208 (740)
                                  + +..              =+|.     .|..++ .++++-+..    -++.+|    .+|| |++==
T Consensus        73 ------------a-~R~--------------~~pe-----~L~~q~-~~~~~~l~~----~gi~~i----~~~g~EADD~  111 (259)
T smart00475       73 ------------A-NRP--------------KTPD-----ELLEQI-PLIKELLDA----LGIPVL----EVEGYEADDV  111 (259)
T ss_pred             ------------h-CCC--------------CCCH-----HHHHHH-HHHHHHHHH----CCCCEE----eeCCcCHHHH
Confidence                        1 000              0111     144444 344443322    223443    3788 99987


Q ss_pred             HHHHHHHhhcCCCCCCCCcEEEEecChhHHHHH
Q 004647          209 IMSYIRLQRNLPGFDPNTRHCLYGLDADLIMLS  241 (740)
Q Consensus       209 Im~fIR~qr~~p~ydpn~~H~IyG~DADLImL~  241 (740)
                      |--..++....     +...+|++.|-|+.-|.
T Consensus       112 iatla~~~~~~-----g~~~~IvS~DkDl~ql~  139 (259)
T smart00475      112 IATLAKKAEAE-----GYEVRIVSGDKDLLQLV  139 (259)
T ss_pred             HHHHHHHHHhC-----CCeEEEEeCCCcHhhcC
Confidence            77666654331     34689999999998764


No 10 
>cd00008 53EXOc 5'-3' exonuclease; T5 type 5'-3' exonuclease domains may co-occur with DNA polymerase I (Pol I) domains, or be part of Pol I containing complexes. They digest dsDNA and ssDNA, releasing mono-,di- and tri-nucleotides, as well as oligonucleotides, and have also been reported to possess RNase H activity. Also called 5' nuclease family, involved in structure-specific cleavage of flaps formed by Pol I activity (similar to mammalian flap endonuclease I, FEN-1). A single nucleic acid strand may be threaded through the 5' nuclease enzyme before cleavage occurs. The domain binds two divalent metal ions which are necessary for activity.
Probab=95.42  E-value=0.67  Score=48.86  Aligned_cols=56  Identities=7%  Similarity=0.082  Sum_probs=38.4

Q ss_pred             EEeeccccccccccCCCCCC---CCCHHHHHHHHHHHHHHHHhhcccceeEEEeecCCCc
Q 004647           52 LYLDMNGIIHPCFHPDGKPA---PTSYDDVFKSIFDYIDHIFLLVRPRKLLYLAIDGVAP  108 (740)
Q Consensus        52 LYlDmNgIIH~c~h~~~~~~---p~te~e~~~~If~yid~lv~~vrPrkllyiAiDGVAP  108 (740)
                      |.||.|+++|.+.|......   .-........++..+.+++...+|.+. .+|+||-+|
T Consensus         4 llIDg~~l~yr~~~a~~~~~~~~~g~~t~ai~g~~~~l~~~~~~~~p~~~-~~~fD~~~~   62 (240)
T cd00008           4 LLIDGSSLAYRAYFALPPLKNSPKGLPTNAVYGFLNMLLKLIKEYKPTYV-AVVFDAGGK   62 (240)
T ss_pred             EEEEChHHHHHHHHCCCCcCCCCCCcCchHHHHHHHHHHHHHHhcCCCeE-EEEEeCCCC
Confidence            68999999999888652111   011223445566677778888899997 699999643


No 11 
>PF00752 XPG_N:  XPG N-terminal domain;  InterPro: IPR006085 Xeroderma pigmentosum (XP) [] is a human autosomal recessive disease, characterised by a high incidence of sunlight-induced skin cancer. People's skin cells with this condition are hypersensitive to ultraviolet light, due to defects in the incision step of DNA excision repair. There are a minimum of seven genetic complementation groups involved in this pathway: XP-A to XP-G. XP-G is one of the most rare and phenotypically heterogeneous of XP, showing anything from slight to extreme dysfunction in DNA excision repair [, ]. XP-G can be corrected by a 133 Kd nuclear protein, XPGC []. XPGC is an acidic protein that confers normal UV resistance in expressing cells []. It is a magnesium-dependent, single-strand DNA endonuclease that makes structure-specific endonucleolytic incisions in a DNA substrate containing a duplex region and single-stranded arms [, ]. XPGC cleaves one strand of the duplex at the border with the single-stranded region []. XPG belongs to a family of proteins that includes RAD2 from Saccharomyces cerevisiae (Baker's yeast) and rad13 from Schizosaccharomyces pombe (Fission yeast), which are single-stranded DNA endonucleases [, ]; mouse and human FEN-1, a structure-specific endonuclease; RAD2 from fission yeast and RAD27 from budding yeast; fission yeast exo1, a 5'-3' double-stranded DNA exonuclease that may act in a pathway that corrects mismatched base pairs; yeast DHS1, and yeast DIN7. Sequence alignment of this family of proteins reveals that similarities are largely confined to two regions. The first is located at the N-terminal extremity (N-region) and corresponds to the first 95 to 105 amino acids. The second region is internal (I-region) and found towards the C terminus; it spans about 140 residues and contains a highly conserved core of 27 amino acids that includes a conserved pentapeptide (E-A-[DE]-A-[QS]). It is possible that the conserved acidic residues are involved in the catalytic mechanism of DNA excision repair in XPG. The amino acids linking the N- and I-regions are not conserved. This entry represents the N-terminal of XPG.; GO: 0004518 nuclease activity, 0006281 DNA repair; PDB: 1A77_A 1A76_A 1MC8_B 3QEB_Z 3QEA_Z 3QE9_Y 1UL1_Z 3Q8K_A 3Q8M_A 3Q8L_A ....
Probab=94.98  E-value=0.03  Score=50.71  Aligned_cols=95  Identities=18%  Similarity=0.350  Sum_probs=56.4

Q ss_pred             CccchHHHHHHhhCCCcccccccCCCccCCCCcccCCCCCCCCCCCcccCeEEeeccccccccccCCCCCCCC--CHHHH
Q 004647            1 MGVPAFYRWLADRYPLSIVDVVEEDPQVDGEGVARPVDVSKPNPNGMEFDNLYLDMNGIIHPCFHPDGKPAPT--SYDDV   78 (740)
Q Consensus         1 MGVP~ffrWL~~rYP~i~~~~~e~~~~~~~~g~~~p~d~s~pn~n~~e~DnLYlDmNgIIH~c~h~~~~~~p~--te~e~   78 (740)
                      |||++|..+|....  .+..+.-+             +.     +|   --|=||.+..+|.+.+....+...  .....
T Consensus         1 MGI~gL~~~l~~~~--~v~~~~~~-------------~l-----~g---~~vaID~s~wl~~~~~~~~~~~~~~~~~~~~   57 (101)
T PF00752_consen    1 MGIKGLWQLLKPAA--AVRKVSLS-------------EL-----RG---KRVAIDASCWLHQFLFSCREELGQGVGTDSH   57 (101)
T ss_dssp             ---TTHHHHCHHHE--GEEEEEGG-------------GG-----TT---CEEEEEHHHHHHHHHHHSBCTTSCB-BS-HH
T ss_pred             CCcccHHHHHHhhc--cCCccCHH-------------Hh-----CC---CEEEEEcHHHHHHHHHHhHHHhccccchHHH
Confidence            99999999999976  22211100             00     11   347899999999875544321111  11456


Q ss_pred             HHHHHHHHHHHHh-hcccceeEEEeecCCCchhhhHHHHHhhhhh
Q 004647           79 FKSIFDYIDHIFL-LVRPRKLLYLAIDGVAPRAKMNQQRTRRFRA  122 (740)
Q Consensus        79 ~~~If~yid~lv~-~vrPrkllyiAiDGVAPrAKmnQQRsRRfrs  122 (740)
                      +..++..+..|.+ -|+|    ++.+||.+|-+|..+...||-+.
T Consensus        58 ~~~~~~r~~~L~~~gI~P----ifVFDG~~~~~K~~~~~~R~~~r   98 (101)
T PF00752_consen   58 LRGLFSRLCRLLEHGIKP----IFVFDGKPPPLKRETIQKRRKRR   98 (101)
T ss_dssp             HHHHHHHHHHHHHTTEEE----EEEE--STTGGCHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHCCCEE----EEEECCCCchhhHHHHHHHHHHH
Confidence            6677777776653 4565    68899999999998888776544


No 12 
>smart00485 XPGN Xeroderma pigmentosum G N-region. domain in nucleases
Probab=94.75  E-value=0.039  Score=49.91  Aligned_cols=93  Identities=19%  Similarity=0.295  Sum_probs=57.1

Q ss_pred             CccchHHHHHHhhCCCcccccccCCCccCCCCcccCCCCCCCCCCCcccCeEEeeccccccccccCCCCC--CCCCHHHH
Q 004647            1 MGVPAFYRWLADRYPLSIVDVVEEDPQVDGEGVARPVDVSKPNPNGMEFDNLYLDMNGIIHPCFHPDGKP--APTSYDDV   78 (740)
Q Consensus         1 MGVP~ffrWL~~rYP~i~~~~~e~~~~~~~~g~~~p~d~s~pn~n~~e~DnLYlDmNgIIH~c~h~~~~~--~p~te~e~   78 (740)
                      |||+++..||...    .+.+    +...-.|                 --+=||.+..+|.+......+  ........
T Consensus         1 MGI~gL~~~l~~~----~~~~----~i~~l~g-----------------~~vaIDa~~wl~~~~~~~~~~~~~~~~~~~~   55 (99)
T smart00485        1 MGIKGLWPLLKPV----VREV----PLEALRG-----------------KTLAIDASIWLYQFLTACREKLGTPLPNSKH   55 (99)
T ss_pred             CCHhHHHHHHHHh----cccC----CHHHhCC-----------------ceEeccHHHHHHHHHHHHhhhhcCCCCchHH
Confidence            9999999999875    1110    1000012                 135678889998765432110  11222235


Q ss_pred             HHHHHHHHHHHHh-hcccceeEEEeecCCCchhhhHHHHHhhhhh
Q 004647           79 FKSIFDYIDHIFL-LVRPRKLLYLAIDGVAPRAKMNQQRTRRFRA  122 (740)
Q Consensus        79 ~~~If~yid~lv~-~vrPrkllyiAiDGVAPrAKmnQQRsRRfrs  122 (740)
                      ...+|..+.+|++ -|.|    ++.+||.+|-+|...+..||-+.
T Consensus        56 l~~~~~rl~~L~~~~I~P----ifVFDG~~~~~K~~t~~~R~~~r   96 (99)
T smart00485       56 LMGLFYRTCRLLEFGIKP----IFVFDGKPPPLKSETLAKRRERR   96 (99)
T ss_pred             HHHHHHHHHHHHHCCCeE----EEEECCCCchhhHHHHHHHHHHH
Confidence            5666666666653 3444    58899999999999998887654


No 13 
>PF00867 XPG_I:  XPG I-region;  InterPro: IPR006086 This entry represents endonucleases that cleave the 5'-overhanging flap structure that is generated by displacement synthesis when DNA polymerase encounters the 5'-end of a downstream Okazaki fragment. Has 5'-endo-/exonuclease and 5'-pseudo-Y-endonuclease activities. Cleaves the junction between single and double-stranded regions of flap DNA. The endonuclease binds 2 magnesium ions per subunit. which probably participate in the reaction catalyzed by the enzyme. May bind an additional third magnesium ion after substrate binding.; GO: 0004518 nuclease activity, 0006281 DNA repair; PDB: 1UL1_Z 3Q8K_A 3Q8M_A 3Q8L_A 2IZO_A 1A77_A 1A76_A 3QEA_Z 3QE9_Y 3QEB_Z ....
Probab=93.94  E-value=0.18  Score=45.43  Aligned_cols=90  Identities=20%  Similarity=0.355  Sum_probs=54.8

Q ss_pred             cEEEEcCCCCCCChhhHHHHHHHHhhcCCCCCCCCcEEEEecChhHHHHHhhcCCceEEEeeccccCCCCCccccccccc
Q 004647          192 TKVILSDANVPGEGEHKIMSYIRLQRNLPGFDPNTRHCLYGLDADLIMLSLATHEIHFSILREVITLPGQQEKCFVCGQV  271 (740)
Q Consensus       192 l~VI~Sds~vPGEGEHKIm~fIR~qr~~p~ydpn~~H~IyG~DADLImL~Lathe~~f~ILRE~v~~~~~~~~c~~c~~~  271 (740)
                      +.+|.    -|||+|.=.--.-|+         +..+.|++.|+|+++.|--      .|+|...  ......|      
T Consensus         5 v~~i~----AP~EAeAq~A~L~~~---------g~vd~V~t~DsD~l~fG~~------~vi~~~~--~~~~~~~------   57 (94)
T PF00867_consen    5 VPYIV----APYEAEAQCAYLERN---------GLVDAVITEDSDLLLFGAP------KVIRKLS--DKSSGKC------   57 (94)
T ss_dssp             -EEEE-----SS-HHHHHHHHHHT---------TSSSEEE-SSSHHHHTT-S------EEEESST---CSCCST------
T ss_pred             CeEEE----cCchHHHHHHHHHHh---------cceeEEEecCCCEEeeCCC------EEEEecc--ccccCCc------
Confidence            45555    589999988866543         4578999999999999754      5666531  0000000      


Q ss_pred             CcccccccCCCCCCCCCCCCCCCCcccccceEEEehHHHHHHHHhhhCCCCCCCCCchhhhHHHHHHHhhhhcCC
Q 004647          272 GHLAAECHGKPGDNPADWNGVDDTPIHKKKYQFLNIWVLREYLQYELDIPNPPFPINFERIVDDFVFLCFFVGND  346 (740)
Q Consensus       272 ~h~~~~c~~~~~~~~~~~~~~~~~~~~~~~f~~l~i~~LREyL~~e~~~~~~~~~~d~eriIDDfVfLcf~vGND  346 (740)
                                             .......+.+++...+.+.+..  .             -+.|+.+|+|+|+|
T Consensus        58 -----------------------~~~~~~~~~~~~~~~i~~~l~l--~-------------~~~fi~~~iL~G~D   94 (94)
T PF00867_consen   58 -----------------------SSKSEKEVEVIDLDDILKELGL--T-------------REQFIDLCILCGCD   94 (94)
T ss_dssp             -----------------------S-CCESEEEEEEHHHHHHHHTT--S-------------HHHHHHHHHHHHET
T ss_pred             -----------------------ccccccceEEEEHHHHHHHcCC--C-------------HHHHHHHheecCCC
Confidence                                   0012345788888777666532  1             24799999999998


No 14 
>PRK14976 5'-3' exonuclease; Provisional
Probab=93.20  E-value=3.4  Score=44.70  Aligned_cols=57  Identities=18%  Similarity=0.243  Sum_probs=38.4

Q ss_pred             eEEeeccccccccccCCC--CCCC----CCHHHHHHHHHHHHHHHHhhcccceeEEEeecCCCc
Q 004647           51 NLYLDMNGIIHPCFHPDG--KPAP----TSYDDVFKSIFDYIDHIFLLVRPRKLLYLAIDGVAP  108 (740)
Q Consensus        51 nLYlDmNgIIH~c~h~~~--~~~p----~te~e~~~~If~yid~lv~~vrPrkllyiAiDGVAP  108 (740)
                      =|.||.|++|+.++|...  .+..    -.......-+++.+-+++...+|..+ -+|+||-.|
T Consensus         5 ~lliDg~~~~~ra~~a~~~~~~~l~~~~G~~t~a~~gf~~~l~~ll~~~~p~~~-~v~fD~~~~   67 (281)
T PRK14976          5 ALLIDGNSLIFRSYYATLKQGPKLKNNKGLPTNAIHTFLTMIFKILKKLNPSYI-LIAFDAGRK   67 (281)
T ss_pred             EEEEeCcHHHHHHHHccCccCCCccCCCCCCchHHHHHHHHHHHHHHhcCCCEE-EEEEECCCC
Confidence            368999999999888741  1110    11123445567777788888899886 689998544


No 15 
>PF00098 zf-CCHC:  Zinc knuckle;  InterPro: IPR001878 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  This entry represents the CysCysHisCys (CCHC) type zinc finger domains, and have the sequence:  C-X2-C-X4-H-X4-C  where X can be any amino acid, and number indicates the number of residues. These 18 residues CCHC zinc finger domains are mainly found in the nucleocapsid protein of retroviruses. It is required for viral genome packaging and for early infection process [, , ]. It is also found in eukaryotic proteins involved in RNA binding or single-stranded DNA binding []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003676 nucleic acid binding, 0008270 zinc ion binding; PDB: 2L44_A 1A1T_A 1WWG_A 1U6P_A 1WWD_A 1WWE_A 1A6B_B 1F6U_A 1MFS_A 1NCP_C ....
Probab=89.43  E-value=0.21  Score=31.99  Aligned_cols=16  Identities=56%  Similarity=1.410  Sum_probs=14.6

Q ss_pred             ccccccccCccccccc
Q 004647          264 KCFVCGQVGHLAAECH  279 (740)
Q Consensus       264 ~c~~c~~~~h~~~~c~  279 (740)
                      .|+.||+.||++.+|.
T Consensus         2 ~C~~C~~~GH~~~~Cp   17 (18)
T PF00098_consen    2 KCFNCGEPGHIARDCP   17 (18)
T ss_dssp             BCTTTSCSSSCGCTSS
T ss_pred             cCcCCCCcCcccccCc
Confidence            6999999999999985


No 16 
>KOG2518 consensus 5'-3' exonuclease [Replication, recombination and repair]
Probab=89.09  E-value=3.3  Score=48.07  Aligned_cols=91  Identities=22%  Similarity=0.342  Sum_probs=51.5

Q ss_pred             CccchHHHHHHhhCCCcccccccCCCccCCCCcccCCCCCCCCCCCcccCeEEeeccccccc----cccCCCCCCCCCHH
Q 004647            1 MGVPAFYRWLADRYPLSIVDVVEEDPQVDGEGVARPVDVSKPNPNGMEFDNLYLDMNGIIHP----CFHPDGKPAPTSYD   76 (740)
Q Consensus         1 MGVP~ffrWL~~rYP~i~~~~~e~~~~~~~~g~~~p~d~s~pn~n~~e~DnLYlDmNgIIH~----c~h~~~~~~p~te~   76 (740)
                      ||+++|+-.+..    +.+++                ..+     ..+.+-|=+|.-+-+|.    |.+.-....|+  +
T Consensus         1 MGI~GLlp~~k~----~~~~~----------------hi~-----~~~g~tvavD~y~WLhrg~~~Ca~el~~~~pT--~   53 (556)
T KOG2518|consen    1 MGIQGLLPLLKP----ALKPI----------------HIS-----EYKGKTVAVDGYCWLHRGALACAEKLAKGKPT--D   53 (556)
T ss_pred             CCcchhHHHHHH----Hhhhh----------------hHH-----HhcCceEEEehhhHHhhhHHhHHHHHhcCCCh--H
Confidence            999999988876    22211                000     01346678888888885    33322222222  3


Q ss_pred             HHHHHHHHHHHHHHh-hcccceeEEEeecCCCchhhhHHHHHhhhhh
Q 004647           77 DVFKSIFDYIDHIFL-LVRPRKLLYLAIDGVAPRAKMNQQRTRRFRA  122 (740)
Q Consensus        77 e~~~~If~yid~lv~-~vrPrkllyiAiDGVAPrAKmnQQRsRRfrs  122 (740)
                      .-++=...++..|.. -|+|    +|.+||=+=-+|--+-|.||-+.
T Consensus        54 ryi~y~ik~v~lL~~~gikP----ilVFDG~~LP~K~~te~~Rr~~R   96 (556)
T KOG2518|consen   54 RYIQFFIKRVKLLLSYGIKP----ILVFDGDPLPSKKETERKRRERR   96 (556)
T ss_pred             HHHHHHHHHHHHHHhcCCeE----EEEecCCCcccccccchHHHHHH
Confidence            333333334443332 3455    79999988777877776666544


No 17 
>KOG2519 consensus 5'-3' exonuclease [Replication, recombination and repair]
Probab=88.07  E-value=16  Score=41.88  Aligned_cols=34  Identities=24%  Similarity=0.476  Sum_probs=23.2

Q ss_pred             HHHHHHhhhhcCCCCCCCCcccccchhHHHHHHHHH
Q 004647          334 DDFVFLCFFVGNDFLPHMPTLEIREGAINLLMHVYR  369 (740)
Q Consensus       334 DDfVfLcf~vGNDFLPhlPsl~I~egaid~Li~~Yk  369 (740)
                      .-||-||+|+|+||.|.+-+  |..+.-=.|++.|+
T Consensus       217 ~~fidL~lLlGCDYc~~I~G--ig~~~al~lir~~~  250 (449)
T KOG2519|consen  217 ESFIDLCLLLGCDYCPTIRG--IGPKKALKLIRQHG  250 (449)
T ss_pred             HHHHHHHHHhcCcccccccc--cChHHHHHHHHHhc
Confidence            46788999999999999754  44333333555554


No 18 
>COG0258 Exo 5'-3' exonuclease (including N-terminal domain of PolI) [DNA replication, recombination, and repair]
Probab=87.73  E-value=12  Score=41.01  Aligned_cols=62  Identities=18%  Similarity=0.354  Sum_probs=44.2

Q ss_pred             eEEeeccccccccccCCCC------CCCCCHHHHHHHHHHHHHHHHhhcccceeEEEeecCCCc--hhhhHHHH
Q 004647           51 NLYLDMNGIIHPCFHPDGK------PAPTSYDDVFKSIFDYIDHIFLLVRPRKLLYLAIDGVAP--RAKMNQQR  116 (740)
Q Consensus        51 nLYlDmNgIIH~c~h~~~~------~~p~te~e~~~~If~yid~lv~~vrPrkllyiAiDGVAP--rAKmnQQR  116 (740)
                      -+-||.+++++.+.|....      ..+++   ...-+...+.++++..+|.+ ..+++||-+|  |.++...|
T Consensus        13 l~~IDg~~~lyr~~~a~~~~~~~~~g~~~~---~~~~~~~~l~~~~~~~~~~~-~~~vFD~~~~tfR~~~~~~y   82 (310)
T COG0258          13 LLLIDGSSLLYRALHALPQPLGNPLGDPTG---AVSGFLGMLYRLIRLLEPTH-PVVVFDGKPPTFRHELLEEY   82 (310)
T ss_pred             EEEEechHHHHHHHHhcchhcCCCCCCCcc---HHHHHHHHHHHHHHhcCCCc-EEEEEcCCCCcchHHHHHHH
Confidence            4789999999999886521      12333   44556677789999999966 4699999777  55555444


No 19 
>PRK05755 DNA polymerase I; Provisional
Probab=83.94  E-value=23  Score=44.40  Aligned_cols=55  Identities=11%  Similarity=0.155  Sum_probs=35.8

Q ss_pred             EEeeccccccccccCCCCC---CCCCHHHHHHHHHHHHHHHHhhcccceeEEEeecCCC
Q 004647           52 LYLDMNGIIHPCFHPDGKP---APTSYDDVFKSIFDYIDHIFLLVRPRKLLYLAIDGVA  107 (740)
Q Consensus        52 LYlDmNgIIH~c~h~~~~~---~p~te~e~~~~If~yid~lv~~vrPrkllyiAiDGVA  107 (740)
                      |.||.|.+++.+.|.....   ..-........+++.+-+++...+|..+ .+|+||-.
T Consensus         5 ~liDg~~~~~r~~~a~~~~~~~~~g~~~~a~~g~~~~l~~~~~~~~p~~~-~v~fD~~~   62 (880)
T PRK05755          5 LLIDGSSLLFRAFYALLPTLRNSDGLPTGAVYGFLNMLLKLLKEEKPTHV-AVAFDAKG   62 (880)
T ss_pred             EEEeCcHHHHHHHHCCCCcccCCCCCcccHHHHHHHHHHHHHHhcCCCEE-EEEEECCC
Confidence            6899999999988864110   0111123344455666677777899875 69999843


No 20 
>TIGR00593 pola DNA polymerase I. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=81.24  E-value=52  Score=41.37  Aligned_cols=56  Identities=13%  Similarity=0.204  Sum_probs=34.5

Q ss_pred             EEeeccccccccccCCCC-CC---CCCHHHHHHHHHHHHHHHHhhcccceeEEEeecCCCc
Q 004647           52 LYLDMNGIIHPCFHPDGK-PA---PTSYDDVFKSIFDYIDHIFLLVRPRKLLYLAIDGVAP  108 (740)
Q Consensus        52 LYlDmNgIIH~c~h~~~~-~~---p~te~e~~~~If~yid~lv~~vrPrkllyiAiDGVAP  108 (740)
                      +.||.|+++|-++|.-.. +-   .-.......-+++.+-+++...+|..+ .+|+||-.|
T Consensus         2 ~lIDg~~l~~Ra~~a~~~~~l~~~~G~~t~av~Gf~~~l~~ll~~~~p~~i-~v~FD~~~~   61 (887)
T TIGR00593         2 LLIDGHSLAFRAYFALKNKPLTNSKGEPTNAVYGFTKMLLKLLKEEKPTYV-AVAFDSGTP   61 (887)
T ss_pred             EEEeCcHHHHHHHHCCCcccCcCCCCCEecHHHHHHHHHHHHHHhcCCCEE-EEEEcCCCC
Confidence            579999999998885421 00   001112333345555566666799875 699998654


No 21 
>TIGR00600 rad2 DNA excision repair protein (rad2). All proteins in this family for which functions are known are flap endonucleases that generate the 3' incision next to DNA damage as part of nucleotide excision repair. This family is related to many other flap endonuclease families including the fen1 family. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=75.04  E-value=4.8  Score=50.74  Aligned_cols=39  Identities=18%  Similarity=0.126  Sum_probs=30.4

Q ss_pred             ccEEEEcCCCCCCChhhHHHHHHHHhhcCCCCCCCCcEEEEecChhHHHHHh
Q 004647          191 FTKVILSDANVPGEGEHKIMSYIRLQRNLPGFDPNTRHCLYGLDADLIMLSL  242 (740)
Q Consensus       191 ~l~VI~Sds~vPGEGEHKIm~fIR~qr~~p~ydpn~~H~IyG~DADLImL~L  242 (740)
                      ++.+|.+    |||||.=+-..-+.         .....|++-|+|+++.|=
T Consensus       785 GIP~i~A----P~EAEAqcA~L~~~---------G~vd~V~TeDsD~llFGa  823 (1034)
T TIGR00600       785 GIPYIVA----PMEAEAQCAILDLL---------DQTSGTITDDSDIWLFGA  823 (1034)
T ss_pred             CCCeeeC----CccHHHHHHHHHhC---------CCeEEEEccccceeccCC
Confidence            4667753    89999988877542         567899999999997764


No 22 
>TIGR00600 rad2 DNA excision repair protein (rad2). All proteins in this family for which functions are known are flap endonucleases that generate the 3' incision next to DNA damage as part of nucleotide excision repair. This family is related to many other flap endonuclease families including the fen1 family. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=69.22  E-value=20  Score=45.54  Aligned_cols=65  Identities=22%  Similarity=0.356  Sum_probs=37.0

Q ss_pred             eEEeeccccccccc---cCCCCCCCCCHHHHHHHHHHHHHHHHh-hcccceeEEEeecCCCchhhhHHHHHhhhh
Q 004647           51 NLYLDMNGIIHPCF---HPDGKPAPTSYDDVFKSIFDYIDHIFL-LVRPRKLLYLAIDGVAPRAKMNQQRTRRFR  121 (740)
Q Consensus        51 nLYlDmNgIIH~c~---h~~~~~~p~te~e~~~~If~yid~lv~-~vrPrkllyiAiDGVAPrAKmnQQRsRRfr  121 (740)
                      -|=||...-||.+.   +........+ .. +..+|..|.+|+. -|+|    ++.+||.+|-.|...-..||-|
T Consensus        26 ~vAIDasiWL~q~l~~vr~~~g~~l~n-~h-l~g~f~Ri~~Ll~~gI~P----VfVFDG~~p~lK~~t~~~R~~r   94 (1034)
T TIGR00600        26 RLAVDISIWLNQALKGVRDREGNAIKN-SH-LLTLFHRLCKLLFFRIRP----IFVFDGGAPLLKRQTLAKRRQR   94 (1034)
T ss_pred             EEEechHHHHHHHHHHHHhccCCccCC-HH-HHHHHHHHHHHHHCCCeE----EEEECCCCchHhHHHHHHHHHH
Confidence            35667777777542   2222212222 23 2445555556553 4555    6789999999888765555443


No 23 
>PF13696 zf-CCHC_2:  Zinc knuckle
Probab=67.07  E-value=2.3  Score=31.41  Aligned_cols=20  Identities=35%  Similarity=0.763  Sum_probs=17.0

Q ss_pred             CCcccccccccCcccccccC
Q 004647          261 QQEKCFVCGQVGHLAAECHG  280 (740)
Q Consensus       261 ~~~~c~~c~~~~h~~~~c~~  280 (740)
                      ....|.+|++.||+..+|-.
T Consensus         7 ~~Y~C~~C~~~GH~i~dCP~   26 (32)
T PF13696_consen    7 PGYVCHRCGQKGHWIQDCPT   26 (32)
T ss_pred             CCCEeecCCCCCccHhHCCC
Confidence            34679999999999999964


No 24 
>cd00080 HhH2_motif Helix-hairpin-helix class 2 (Pol1 family) motif. HhH2 domains are found in Rad2 family of prokaryotic and eukaryotic replication and repair nucleases, i.e., DNA polymerase I, Taq DNA polymerase, DNA repair protein Rad2 endonuclease, flap endonuclease, exonuclease I and IX, 5'-3' exonuclease and also bacteriophage Rnase H. These nucleases degrade RNA-DNA or DNA-DNA duplexes, or both and play essential roles in DNA duplication, repair, and recombination.
Probab=64.98  E-value=7.5  Score=33.75  Aligned_cols=22  Identities=32%  Similarity=0.732  Sum_probs=18.7

Q ss_pred             HHHHHHhhhhc--CCCCCCCCccc
Q 004647          334 DDFVFLCFFVG--NDFLPHMPTLE  355 (740)
Q Consensus       334 DDfVfLcf~vG--NDFLPhlPsl~  355 (740)
                      +.|+-+|.|+|  .|++|++|++-
T Consensus         8 ~q~~d~~~L~GD~~D~i~gv~giG   31 (75)
T cd00080           8 EQFIDLAILVGDKSDNIPGVPGIG   31 (75)
T ss_pred             HHHHHHHHHcCCccccCCCCCccc
Confidence            46777999999  99999999843


No 25 
>PF13917 zf-CCHC_3:  Zinc knuckle
Probab=44.50  E-value=12  Score=29.37  Aligned_cols=19  Identities=42%  Similarity=0.935  Sum_probs=16.8

Q ss_pred             CcccccccccCcccccccC
Q 004647          262 QEKCFVCGQVGHLAAECHG  280 (740)
Q Consensus       262 ~~~c~~c~~~~h~~~~c~~  280 (740)
                      ...|..|++.||...+|..
T Consensus         4 ~~~CqkC~~~GH~tyeC~~   22 (42)
T PF13917_consen    4 RVRCQKCGQKGHWTYECPN   22 (42)
T ss_pred             CCcCcccCCCCcchhhCCC
Confidence            4579999999999999984


No 26 
>smart00484 XPGI Xeroderma pigmentosum G I-region. domain in nucleases
Probab=44.06  E-value=13  Score=32.31  Aligned_cols=34  Identities=18%  Similarity=0.257  Sum_probs=27.7

Q ss_pred             CCCChhhHHHHHHHHhhcCCCCCCCCcEEEEecChhHHHHHhh
Q 004647          201 VPGEGEHKIMSYIRLQRNLPGFDPNTRHCLYGLDADLIMLSLA  243 (740)
Q Consensus       201 vPGEGEHKIm~fIR~qr~~p~ydpn~~H~IyG~DADLImL~La  243 (740)
                      -|||+|.-.--.-++         ..-+.|+|.|+|+++.|--
T Consensus        10 AP~eAeAq~A~L~~~---------g~vdav~s~D~D~llfG~~   43 (73)
T smart00484       10 APYEAEAQCAYLAKS---------GLVDAIITEDSDLLLFGAP   43 (73)
T ss_pred             cCCcHHHHHHHHHhC---------CCeeEEEcCccceEecCCc
Confidence            589999988777652         4678999999999998763


No 27 
>smart00279 HhH2 Helix-hairpin-helix class 2 (Pol1 family) motifs.
Probab=38.68  E-value=26  Score=26.39  Aligned_cols=19  Identities=21%  Similarity=0.597  Sum_probs=15.2

Q ss_pred             HHHHHhhhhcCCCCCCCCcc
Q 004647          335 DFVFLCFFVGNDFLPHMPTL  354 (740)
Q Consensus       335 DfVfLcf~vGNDFLPhlPsl  354 (740)
                      -|+-+|.|+| |+.+.+|++
T Consensus         3 q~~~~~~L~G-D~~dni~Gv   21 (36)
T smart00279        3 QLIDYAILVG-DYSDNIPGV   21 (36)
T ss_pred             HHHHHHHHhC-cCCCCCCCC
Confidence            5788999999 999955553


No 28 
>PF02739 5_3_exonuc_N:  5'-3' exonuclease, N-terminal resolvase-like domain;  InterPro: IPR020046 The N-terminal and internal 5'3'-exonuclease domains are commonly found together, and are most often associated with 5' to 3' nuclease activities. The XPG protein signatures (PDOC00658 from PROSITEDOC) are never found outside the '53EXO' domains. The latter are found in more diverse proteins [, , ]. The number of amino acids that separate the two 53EXO domains, and the presence of accompanying motifs allow the diagnosis of several protein families.  In the eubacterial type A DNA-polymerases, the N-terminal and internal domains are separated by a few amino acids, usually four. The pattern DNA_POLYMERASE_A (IPR001098 from INTERPRO) is always present towards the C terminus. Several eukaryotic structure-dependent endonucleases and exonucleases have the 53EXO domains separated by 24 to 27 amino acids, and the XPG protein signatures are always present. In several proteins from herpesviridae, the two 53EXO domains are separated by 50 to 120 amino acids. These proteins are implicated in the inhibition of the expression of the host genes. Eukaryotic DNA repair proteins with 600 to 700 amino acids between the 53_EXO domains all carry the XPG protein signatures.  This entry represents the N-terminal resolvase-like domain, which has a 3-layer alpha/beta/alpha core structure and contains an alpha-helical arch [, ].; GO: 0003677 DNA binding, 0008409 5'-3' exonuclease activity; PDB: 1TAQ_A 1BGX_T 1TAU_A 1XO1_B 1EXN_A 1UT8_A 1UT5_B 3H7I_A 3H8J_A 3H8S_A ....
Probab=38.62  E-value=29  Score=34.73  Aligned_cols=56  Identities=14%  Similarity=0.243  Sum_probs=39.4

Q ss_pred             EEeeccccccccccCCCCCC----CCCHHHHHHHHHHHHHHHHhhcccceeEEEeecCCCc
Q 004647           52 LYLDMNGIIHPCFHPDGKPA----PTSYDDVFKSIFDYIDHIFLLVRPRKLLYLAIDGVAP  108 (740)
Q Consensus        52 LYlDmNgIIH~c~h~~~~~~----p~te~e~~~~If~yid~lv~~vrPrkllyiAiDGVAP  108 (740)
                      |.||.|+++|.+.|.-....    .-..-......++.|.+++...+|.. +.+|+||-.+
T Consensus         4 lLIDg~~l~~Ra~~a~~~~~l~~~~G~~t~ai~g~~~~l~~l~~~~~p~~-~vv~fD~~~~   63 (169)
T PF02739_consen    4 LLIDGNSLLFRAYYALPKDPLRNSDGEPTNAIYGFLRMLLKLLKDFKPDY-VVVAFDSKGP   63 (169)
T ss_dssp             EEEEHHHHHHHCCCCCTTST-BETTSEB-HHHHHHHHHHHHHHHHTTEEE-EEEEEEBSSC
T ss_pred             EEEechHHHHHHHHhhccCCCcCCCCCChHHHHHHHHHHHHHHHHcCCce-EEEEecCCCc
Confidence            68999999999988654211    00111344556777778888889987 4699999987


No 29 
>smart00343 ZnF_C2HC zinc finger.
Probab=36.58  E-value=16  Score=25.00  Aligned_cols=16  Identities=50%  Similarity=1.364  Sum_probs=14.1

Q ss_pred             ccccccccCccccccc
Q 004647          264 KCFVCGQVGHLAAECH  279 (740)
Q Consensus       264 ~c~~c~~~~h~~~~c~  279 (740)
                      .|..|++.||.+.+|.
T Consensus         1 ~C~~CG~~GH~~~~C~   16 (26)
T smart00343        1 KCYNCGKEGHIARDCP   16 (26)
T ss_pred             CCccCCCCCcchhhCC
Confidence            4889999999999885


No 30 
>PHA02567 rnh RnaseH; Provisional
Probab=26.51  E-value=69  Score=35.33  Aligned_cols=57  Identities=18%  Similarity=0.121  Sum_probs=41.7

Q ss_pred             CeEEeeccccccccccCCCCCC-CCCHHHHHHHHHHHHHHHHhhcccce-eEEEeecCC
Q 004647           50 DNLYLDMNGIIHPCFHPDGKPA-PTSYDDVFKSIFDYIDHIFLLVRPRK-LLYLAIDGV  106 (740)
Q Consensus        50 DnLYlDmNgIIH~c~h~~~~~~-p~te~e~~~~If~yid~lv~~vrPrk-llyiAiDGV  106 (740)
                      +-+.||++.|+-.|++.+-.+. ..++..+...+++-|-.++..++|.- -+.+|+|+-
T Consensus        15 ~~~LiDgs~i~~~~~~a~l~~~~~~~~~~ir~~v~nsL~~~v~~~k~~~~~i~vaFD~~   73 (304)
T PHA02567         15 GVNLIDFSQIIIATIMANFKPKDKINEAMVRHLVLNSIRYNVKKFKEEYPEIVLAFDNS   73 (304)
T ss_pred             CEEEEehHHHHHHHHHhhCCCCCCCcHHHHHHHHHHHHHHHHHHhcCCCCeEEEEEeCC
Confidence            5689999999999999875444 33444444558888888888877662 167999974


No 31 
>PF14392 zf-CCHC_4:  Zinc knuckle
Probab=26.09  E-value=30  Score=27.53  Aligned_cols=20  Identities=40%  Similarity=0.954  Sum_probs=17.0

Q ss_pred             CCCcccccccccCccccccc
Q 004647          260 GQQEKCFVCGQVGHLAAECH  279 (740)
Q Consensus       260 ~~~~~c~~c~~~~h~~~~c~  279 (740)
                      ..+..|+.||..||...+|.
T Consensus        29 ~lp~~C~~C~~~gH~~~~C~   48 (49)
T PF14392_consen   29 RLPRFCFHCGRIGHSDKECP   48 (49)
T ss_pred             CcChhhcCCCCcCcCHhHcC
Confidence            34568999999999999884


No 32 
>PF12513 SUV3_C:  Mitochondrial degradasome RNA helicase subunit C terminal;  InterPro: IPR022192  This domain family is found in bacteria and eukaryotes, and is approximately 50 amino acids in length. The family is found in association with PF00271 from PFAM. The yeast mitochondrial degradosome (mtEXO) is an NTP-dependent exoribonuclease involved in mitochondrial RNA metabolism. mtEXO is made up of two subunits: an RNase (DSS1) and an RNA helicase (SUV3). These co-purify with mitochondrial ribosomes. ; GO: 0016817 hydrolase activity, acting on acid anhydrides; PDB: 3RC8_A 3RC3_A.
Probab=24.73  E-value=40  Score=26.86  Aligned_cols=15  Identities=33%  Similarity=0.786  Sum_probs=11.8

Q ss_pred             hHHHHHHhhCCCccc
Q 004647            5 AFYRWLADRYPLSIV   19 (740)
Q Consensus         5 ~ffrWL~~rYP~i~~   19 (740)
                      ..|-||+.|||.+..
T Consensus        12 ~lYlWLs~Rfp~~F~   26 (49)
T PF12513_consen   12 DLYLWLSYRFPDVFP   26 (49)
T ss_dssp             HHHHHHHCC-TTTST
T ss_pred             HHHHHHHHHcccccC
Confidence            479999999999764


No 33 
>COG5350 Predicted protein tyrosine phosphatase [General function prediction only]
Probab=22.12  E-value=99  Score=31.07  Aligned_cols=42  Identities=24%  Similarity=0.559  Sum_probs=34.7

Q ss_pred             cccEEEEcCCCCCCCh--------hhHHHHHHHHhhcCCCCCCCCcEEEEecC
Q 004647          190 QFTKVILSDANVPGEG--------EHKIMSYIRLQRNLPGFDPNTRHCLYGLD  234 (740)
Q Consensus       190 ~~l~VI~Sds~vPGEG--------EHKIm~fIR~qr~~p~ydpn~~H~IyG~D  234 (740)
                      +.+.+.+.|-.+||+|        =++|+||++   ..|.+.|=--||.-|.-
T Consensus        56 rhL~l~fnDI~~~~~g~~ap~e~Hv~~i~DF~~---~wp~~apllIHC~aGIS  105 (172)
T COG5350          56 RHLTLHFNDIAEPDDGWIAPGEAHVRAIIDFAD---EWPRFAPLLIHCYAGIS  105 (172)
T ss_pred             hceeEeeccccCCCccccCCCHHHHHHHHHHHh---cCccccceeeeeccccc
Confidence            4689999999999999        368999998   46777777888887753


No 34 
>PF14787 zf-CCHC_5:  GAG-polyprotein viral zinc-finger; PDB: 1CL4_A 1DSV_A.
Probab=22.04  E-value=44  Score=25.44  Aligned_cols=20  Identities=40%  Similarity=0.702  Sum_probs=12.6

Q ss_pred             cccccccccCcccccccCCC
Q 004647          263 EKCFVCGQVGHLAAECHGKP  282 (740)
Q Consensus       263 ~~c~~c~~~~h~~~~c~~~~  282 (740)
                      ..|..|++-.|-+.+|+.+.
T Consensus         3 ~~CprC~kg~Hwa~~C~sk~   22 (36)
T PF14787_consen    3 GLCPRCGKGFHWASECRSKT   22 (36)
T ss_dssp             -C-TTTSSSCS-TTT---TC
T ss_pred             ccCcccCCCcchhhhhhhhh
Confidence            36999999999999998864


No 35 
>PF15288 zf-CCHC_6:  Zinc knuckle
Probab=21.56  E-value=43  Score=26.09  Aligned_cols=13  Identities=46%  Similarity=1.027  Sum_probs=11.4

Q ss_pred             cccccccccCccc
Q 004647          263 EKCFVCGQVGHLA  275 (740)
Q Consensus       263 ~~c~~c~~~~h~~  275 (740)
                      .+|..||+.||.+
T Consensus         2 ~kC~~CG~~GH~~   14 (40)
T PF15288_consen    2 VKCKNCGAFGHMR   14 (40)
T ss_pred             ccccccccccccc
Confidence            4799999999985


No 36 
>COG5082 AIR1 Arginine methyltransferase-interacting protein, contains RING Zn-finger [Posttranslational modification, protein turnover, chaperones / Intracellular trafficking and secretion]
Probab=20.71  E-value=42  Score=34.58  Aligned_cols=45  Identities=22%  Similarity=0.527  Sum_probs=28.6

Q ss_pred             CCcEEEEecChhHHHHHhhc--CCceEEEeeccccCCCCCcccccccccCccccccc
Q 004647          225 NTRHCLYGLDADLIMLSLAT--HEIHFSILREVITLPGQQEKCFVCGQVGHLAAECH  279 (740)
Q Consensus       225 n~~H~IyG~DADLImL~Lat--he~~f~ILRE~v~~~~~~~~c~~c~~~~h~~~~c~  279 (740)
                      +.-|..+-..++|=... +.  |..+=.         .+..+|+.||+.||++.+|.
T Consensus        68 ~~GH~~~DCP~~iC~~C-~~~~H~s~~C---------~~~~~C~~Cg~~GH~~~dC~  114 (190)
T COG5082          68 QNGHLRRDCPHSICYNC-SWDGHRSNHC---------PKPKKCYNCGETGHLSRDCN  114 (190)
T ss_pred             ccCcccccCChhHhhhc-CCCCcccccC---------CcccccccccccCccccccC
Confidence            35677777776555444 21  311110         12368999999999999994


No 37 
>PRK09482 flap endonuclease-like protein; Provisional
Probab=20.46  E-value=1.5e+02  Score=31.88  Aligned_cols=55  Identities=16%  Similarity=0.229  Sum_probs=39.0

Q ss_pred             eEEeeccccccccccCCCCCCCCCHHHHHHHHHHHHHHHHhhcccceeEEEeecCCCc
Q 004647           51 NLYLDMNGIIHPCFHPDGKPAPTSYDDVFKSIFDYIDHIFLLVRPRKLLYLAIDGVAP  108 (740)
Q Consensus        51 nLYlDmNgIIH~c~h~~~~~~p~te~e~~~~If~yid~lv~~vrPrkllyiAiDGVAP  108 (740)
                      =|.||-++++|-.+|....  +.....+....++.+.+|+...+|..+ .+|.||-++
T Consensus         5 llLiDg~~l~~R~~~a~~~--~~g~t~av~gf~~~l~~ll~~~~p~~i-~v~fD~~~~   59 (256)
T PRK09482          5 LLIIDALNLIRRIHAVQPS--PNDINACVETCQHALDKLIRHSQPTHA-VAVFDGDAR   59 (256)
T ss_pred             EEEEeCcHHHHHHHhCCCC--CCCcchHHHHHHHHHHHHHHHcCCCEE-EEEEeCCCC
Confidence            3789999999998776421  122234455566777788888899886 599999654


Done!