Query         psy16749
Match_columns 821
No_of_seqs    296 out of 1655
Neff          5.4 
Searched_HMMs 46136
Date          Fri Aug 16 21:00:16 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy16749.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/16749hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 COG0328 RnhA Ribonuclease HI [  99.8 1.3E-20 2.8E-25  184.9  13.8  130  675-806     3-146 (154)
  2 PF00075 RNase_H:  RNase H;  In  99.8 6.1E-21 1.3E-25  180.0   8.4  125  674-804     2-132 (132)
  3 PRK06548 ribonuclease H; Provi  99.8 2.7E-20 5.8E-25  184.8  13.0  129  674-807     4-144 (161)
  4 PRK08719 ribonuclease H; Revie  99.8 3.7E-20 8.1E-25  181.5  12.0  126  673-804     2-146 (147)
  5 PRK00203 rnhA ribonuclease H;   99.8 5.4E-20 1.2E-24  180.3  11.7  130  675-808     3-145 (150)
  6 cd06222 RnaseH RNase H (RNase   99.7 1.8E-16 3.8E-21  143.9  12.8  125  677-803     1-130 (130)
  7 KOG3752|consensus               99.7 1.5E-16 3.3E-21  173.0  13.4  131  675-805   212-365 (371)
  8 PRK13907 rnhA ribonuclease H;   99.6 4.6E-15   1E-19  141.0  12.6  122  676-805     2-127 (128)
  9 PRK07708 hypothetical protein;  99.5 1.4E-13 2.9E-18  143.5  13.9  130  670-807    68-210 (219)
 10 PRK07238 bifunctional RNase H/  99.3 4.9E-12 1.1E-16  141.0  13.0  123  675-807     2-134 (372)
 11 PF13456 RVT_3:  Reverse transc  99.0 2.2E-09 4.7E-14   93.8  10.4   85  713-804     2-86  (87)
 12 PF09004 DUF1891:  Domain of un  89.5   0.051 1.1E-06   43.0  -1.7   41  467-509     1-41  (42)
 13 PHA03378 EBNA-3B; Provisional   88.2     6.1 0.00013   47.6  12.9   24  156-179   510-533 (991)
 14 PHA03378 EBNA-3B; Provisional   87.9     6.4 0.00014   47.5  12.8   15   36-50    476-490 (991)
 15 PRK14950 DNA polymerase III su  81.4     7.5 0.00016   46.9  10.1    8  525-532   511-518 (585)
 16 PF05380 Peptidase_A17:  Pao re  79.9     3.7   8E-05   41.0   5.8   54  674-729    80-143 (159)
 17 PRK14900 valS valyl-tRNA synth  79.6    0.61 1.3E-05   59.7   0.1   15  135-149   907-921 (1052)
 18 PRK14900 valS valyl-tRNA synth  69.6     7.5 0.00016   50.1   6.1   10   89-98    877-886 (1052)
 19 COG3341 Predicted double-stran  68.7      11 0.00024   39.9   6.0  139  675-819    65-215 (225)
 20 PRK14950 DNA polymerase III su  66.3      28 0.00061   42.1   9.8    8  477-484   480-487 (585)
 21 PHA03269 envelope glycoprotein  58.7      31 0.00068   40.6   7.8   17  504-522   216-232 (566)
 22 PF07174 FAP:  Fibronectin-atta  56.1      79  0.0017   34.8   9.8   51  675-725   223-283 (297)
 23 KOG1924|consensus               44.5 1.2E+02  0.0026   38.0   9.7   28  526-554   692-719 (1102)
 24 PF00336 DNA_pol_viral_C:  DNA   42.2      24 0.00053   37.2   3.3   63  673-746    92-154 (245)
 25 COG3889 Predicted solute bindi  40.9      17 0.00038   44.7   2.3   21  417-437   830-850 (872)
 26 PRK07764 DNA polymerase III su  40.5 1.2E+02  0.0026   38.5   9.5    8  536-543   572-579 (824)
 27 COG3147 DedD Uncharacterized p  39.9 1.2E+02  0.0026   32.4   7.9    6  172-177    67-72  (226)
 28 PRK12270 kgd alpha-ketoglutara  33.5 1.3E+02  0.0029   38.7   8.2   12  511-522   421-432 (1228)
 29 KOG1812|consensus               31.0      91   0.002   36.0   6.0   37  710-746    47-83  (384)
 30 KOG1924|consensus               30.7   3E+02  0.0065   34.7  10.2    9  105-113   428-436 (1102)
 31 PRK12270 kgd alpha-ketoglutara  30.5 1.7E+02  0.0036   38.0   8.3    9  734-742   650-658 (1228)
 32 KOG0559|consensus               29.2 1.8E+02  0.0039   33.4   7.6   13  249-261   175-187 (457)
 33 PTZ00436 60S ribosomal protein  28.9 2.3E+02  0.0049   31.8   8.1    6  108-113   154-159 (357)
 34 PF15324 TALPID3:  Hedgehog sig  23.3 1.3E+03   0.028   30.5  13.9   10  441-450  1133-1142(1252)
 35 COG0296 GlgB 1,4-alpha-glucan   21.4      92   0.002   38.2   3.9   77  717-795   164-247 (628)
 36 PF07462 MSP1_C:  Merozoite sur  21.3 1.9E+02  0.0041   34.9   6.1    8  572-579   415-422 (574)

No 1  
>COG0328 RnhA Ribonuclease HI [DNA replication, recombination, and repair]
Probab=99.84  E-value=1.3e-20  Score=184.92  Aligned_cols=130  Identities=22%  Similarity=0.163  Sum_probs=104.2

Q ss_pred             cEEEeccccccC--CCcceEEecCCCc--cceeeccCCCCCcchhHHHHHHHHHHHHHhcCCCcEEEEcchHHHHHHHhh
Q psy16749        675 AICIYTDASKKN--EKVGAAWFCPTYK--SKACFKLHPATSTYTAEVIGIWEALKYSASLKNNEILILTDSKSACQKLSK  750 (821)
Q Consensus       675 ~i~IYTDGS~~~--g~vGagvv~~~~~--~~~~~~l~~~~T~~~AEL~AI~~ALe~a~~~~~~~I~I~SDSqsAL~~L~~  750 (821)
                      .+.||+||+|..  |-.|+|+|+..++  ...+..+. .+||+++||+|++.||+++.+.+...|.|+|||++|+++|..
T Consensus         3 ~v~if~DGa~~gNpG~gG~g~vl~~~~~~~~~s~~~~-~tTNNraEl~A~i~AL~~l~~~~~~~v~l~tDS~yv~~~i~~   81 (154)
T COG0328           3 KVEIFTDGACLGNPGPGGWGAVLRYGDGEKELSGGEG-RTTNNRAELRALIEALEALKELGACEVTLYTDSKYVVEGITR   81 (154)
T ss_pred             ceEEEecCccCCCCCCceEEEEEEcCCceEEEeeeee-cccChHHHHHHHHHHHHHHHhcCCceEEEEecHHHHHHHHHH
Confidence            578999999994  5566777665333  23444444 889999999999999999999888999999999999999998


Q ss_pred             cccC----CCC------CCchHHHHHHHHHHHhcCCcEEEEEecCCCCCcccHHHHHHHHHHhhcC
Q psy16749        751 NCLN----TTP------THLELEILSSYKHLQNTCKTVKLAWIKGHEGIKGNVEVDRLAKYATVHG  806 (821)
Q Consensus       751 ~~~~----~~p------~~~s~~i~~i~~~L~~~~~~V~~~WVPGHsGI~GNE~AD~LAK~A~~~~  806 (821)
                      |..+    +|.      ..+..+++++..++ .....|.+.||+||.|.++||+||+||+.|+...
T Consensus        82 w~~~w~~~~w~~~~~~pvkn~dl~~~~~~~~-~~~~~v~~~WVkgH~g~~~NeraD~LA~~~~~~~  146 (154)
T COG0328          82 WIVKWKKNGWKTADKKPVKNKDLWEELDELL-KRHELVFWEWVKGHAGHPENERADQLAREAARAA  146 (154)
T ss_pred             HHhhccccCccccccCccccHHHHHHHHHHH-hhCCeEEEEEeeCCCCChHHHHHHHHHHHHHHhh
Confidence            7553    333      23445677777666 4556999999999999999999999999998765


No 2  
>PF00075 RNase_H:  RNase H;  InterPro: IPR002156 The RNase H domain is responsible for hydrolysis of the RNA portion of RNA x DNA hybrids, and this activity requires the presence of divalent cations (Mg2+ or Mn2+) that bind its active site. This domain is a part of a large family of homologous RNase H enzymes of which the RNase HI protein from Escherichia coli is the best characterised []. Secondary structure predictions for the enzymes from E. coli, yeast, human liver and diverse retroviruses (such as Rous sarcoma virus and the Foamy viruses) supported, in every case, the five beta-strands (1 to 5) and four or five alpha-helices (A, B/C, D, E) that have been identified by crystallography in the RNase H domain of Human immunodeficiency virus 1 (HIV-1) reverse transcriptase and in E. coli RNase H []. Reverse transcriptase (RT) is a modular enzyme carrying polymerase and ribonuclease H (RNase H) activities in separable domains. Reverse transcriptase (RT) converts the single-stranded RNA genome of a retrovirus into a double-stranded DNA copy for integration into the host genome. This process requires ribonuclease H as well as RNA- and DNA-directed DNA polymerase activities. Retroviral RNase H is synthesised as part of the POL polyprotein that contains; an aspartyl protease, a reverse transcriptase, RNase H and integrase. POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. Bacterial RNase H 3.1.26.4 from EC catalyses endonucleolytic cleavage to 5'-phosphomonoester acting on RNA-DNA hybrids.  The 3D structure of the RNase H domain from diverse bacteria and retroviruses has been solved [, , ]. All have four beta strands and four to five alpha helices. The E. coli RNase H1 protein binds a single Mg2+ ion cofactor in the active site of the enzyme. The divalent cation is bound by the carboxyl groups of four acidic residues, Asp-10, Glu-48, Asp-70, and Asp-134 []. The first three acidic residues are highly conserved in all bacterial and retroviral RNase H sequences. ; GO: 0003676 nucleic acid binding, 0004523 ribonuclease H activity; PDB: 3LP3_B 2KW4_A 3P1G_A 1RIL_A 2RPI_A 4EQJ_G 4EP2_B 3OTY_P 3U3G_D 2ZQB_D ....
Probab=99.83  E-value=6.1e-21  Score=180.03  Aligned_cols=125  Identities=32%  Similarity=0.365  Sum_probs=95.8

Q ss_pred             CcEEEecccccc--CCCcceEEecCCCccceeeccCCCCCcchhHHHHHHHHHHHHHhcCCCcEEEEcchHHHHHHHhhc
Q psy16749        674 NAICIYTDASKK--NEKVGAAWFCPTYKSKACFKLHPATSTYTAEVIGIWEALKYSASLKNNEILILTDSKSACQKLSKN  751 (821)
Q Consensus       674 ~~i~IYTDGS~~--~g~vGagvv~~~~~~~~~~~l~~~~T~~~AEL~AI~~ALe~a~~~~~~~I~I~SDSqsAL~~L~~~  751 (821)
                      +.+.||||||+.  .+..|+|++++.+ ....+.++ ..+++.+||.||++||+.+ .  ...+.|||||+++++.|..|
T Consensus         2 ~~~~iytDgS~~~~~~~~~~g~v~~~~-~~~~~~~~-~~s~~~aEl~Ai~~AL~~~-~--~~~v~I~tDS~~v~~~l~~~   76 (132)
T PF00075_consen    2 KAIIIYTDGSCRPNPGKGGAGYVVWGG-RNFSFRLG-GQSNNRAELQAIIEALKAL-E--HRKVTIYTDSQYVLNALNKW   76 (132)
T ss_dssp             TSEEEEEEEEECTTTTEEEEEEEEETT-EEEEEEEE-SECHHHHHHHHHHHHHHTH-S--TSEEEEEES-HHHHHHHHTH
T ss_pred             CcEEEEEeCCccCCCCceEEEEEEECC-eEEEeccc-ccchhhhheehHHHHHHHh-h--cccccccccHHHHHHHHHHh
Confidence            578999999987  3445666655434 34445555 7899999999999999944 2  38999999999999988886


Q ss_pred             ccC-CCCCCc-h-HHHHHHHHHHHhcCCcEEEEEecCCCCC-cccHHHHHHHHHHhh
Q psy16749        752 CLN-TTPTHL-E-LEILSSYKHLQNTCKTVKLAWIKGHEGI-KGNVEVDRLAKYATV  804 (821)
Q Consensus       752 ~~~-~~p~~~-s-~~i~~i~~~L~~~~~~V~~~WVPGHsGI-~GNE~AD~LAK~A~~  804 (821)
                      ..+ +|+... . .++.++++++ ..+..|+|+|||||+|+ .||+.||+|||.|+.
T Consensus        77 ~~~~~~~~~~~~~~i~~~i~~~~-~~~~~v~~~~V~~H~~~~~~N~~aD~lAk~a~~  132 (132)
T PF00075_consen   77 LHGNGWKKTSNGRPIKNEIWELL-SRGIKVRFRWVPGHSGVPQGNERADRLAKEAAQ  132 (132)
T ss_dssp             HHHTTSBSCTSSSBHTHHHHHHH-HHSSEEEEEESSSSSSSHHHHHHHHHHHHHHHH
T ss_pred             ccccccccccccccchhheeecc-ccceEEeeeeccCcCCCchhHHHHHHHHHHhcC
Confidence            543 333222 2 4666777777 45899999999999999 799999999999873


No 3  
>PRK06548 ribonuclease H; Provisional
Probab=99.83  E-value=2.7e-20  Score=184.85  Aligned_cols=129  Identities=24%  Similarity=0.271  Sum_probs=99.1

Q ss_pred             CcEEEeccccccC--CCcceEEecCCCccceeeccCCCCCcchhHHHHHHHHHHHHHhcCCCcEEEEcchHHHHHHHhhc
Q psy16749        674 NAICIYTDASKKN--EKVGAAWFCPTYKSKACFKLHPATSTYTAEVIGIWEALKYSASLKNNEILILTDSKSACQKLSKN  751 (821)
Q Consensus       674 ~~i~IYTDGS~~~--g~vGagvv~~~~~~~~~~~l~~~~T~~~AEL~AI~~ALe~a~~~~~~~I~I~SDSqsAL~~L~~~  751 (821)
                      ..++|||||++..  |..|+|+|+.+.. ..... ....||++|||.||+.||+.+.. ....|.|||||++++++|..|
T Consensus         4 ~~~~IytDGa~~gnpg~~G~g~~~~~~~-~~~g~-~~~~TNnraEl~Aii~aL~~~~~-~~~~v~I~TDS~yvi~~i~~W   80 (161)
T PRK06548          4 NEIIAATDGSSLANPGPSGWAWYVDENT-WDSGG-WDIATNNIAELTAVRELLIATRH-TDRPILILSDSKYVINSLTKW   80 (161)
T ss_pred             CEEEEEEeeccCCCCCceEEEEEEeCCc-EEccC-CCCCCHHHHHHHHHHHHHHhhhc-CCceEEEEeChHHHHHHHHHH
Confidence            3589999999994  5678888875322 11222 24579999999999999986643 345799999999999999965


Q ss_pred             cc----CCCCC------CchHHHHHHHHHHHhcCCcEEEEEecCCCCCcccHHHHHHHHHHhhcCC
Q psy16749        752 CL----NTTPT------HLELEILSSYKHLQNTCKTVKLAWIKGHEGIKGNVEVDRLAKYATVHGE  807 (821)
Q Consensus       752 ~~----~~~p~------~~s~~i~~i~~~L~~~~~~V~~~WVPGHsGI~GNE~AD~LAK~A~~~~~  807 (821)
                      ..    ++|..      .+.++|++|..++.+  ..|+|.||+||+|+.|||.||+||++|+....
T Consensus        81 ~~~Wk~~gWk~s~G~pV~N~dL~~~l~~l~~~--~~v~~~wVkgHsg~~gNe~aD~LA~~aa~~~~  144 (161)
T PRK06548         81 VYSWKMRKWRKADGKPVLNQEIIQEIDSLMEN--RNIRMSWVNAHTGHPLNEAADSLARQAANNFS  144 (161)
T ss_pred             HHHHHHCCCcccCCCccccHHHHHHHHHHHhc--CceEEEEEecCCCCHHHHHHHHHHHHHHHHhc
Confidence            44    44432      345678888777743  47999999999999999999999999986544


No 4  
>PRK08719 ribonuclease H; Reviewed
Probab=99.82  E-value=3.7e-20  Score=181.45  Aligned_cols=126  Identities=20%  Similarity=0.188  Sum_probs=93.6

Q ss_pred             CCcEEEeccccccCC-----CcceEEecCC--Ccc--ceeeccCCCCCcchhHHHHHHHHHHHHHhcCCCcEEEEcchHH
Q psy16749        673 PNAICIYTDASKKNE-----KVGAAWFCPT--YKS--KACFKLHPATSTYTAEVIGIWEALKYSASLKNNEILILTDSKS  743 (821)
Q Consensus       673 ~~~i~IYTDGS~~~g-----~vGagvv~~~--~~~--~~~~~l~~~~T~~~AEL~AI~~ALe~a~~~~~~~I~I~SDSqs  743 (821)
                      +..++||||||+..+     ..|+|+++..  +..  ...+.+....+|++|||.||+.||+.+.+.    ..|++||++
T Consensus         2 ~~~~~iYtDGs~~~n~~~~~~~G~G~vv~~~~~~~~~~~~~~~~~~~Tnn~aEl~A~~~aL~~~~~~----~~i~tDS~y   77 (147)
T PRK08719          2 RASYSIYIDGAAPNNQHGCVRGGIGLVVYDEAGEIVDEQSITVNRYTDNAELELLALIEALEYARDG----DVIYSDSDY   77 (147)
T ss_pred             CceEEEEEecccCCCCCCCCCcEEEEEEEeCCCCeeEEEEecCCCCccHHHHHHHHHHHHHHHcCCC----CEEEechHH
Confidence            356899999999853     2478877543  222  234445556799999999999999998753    489999999


Q ss_pred             HHHHHhhcc----cCCCCCC------chHHHHHHHHHHHhcCCcEEEEEecCCCCCcccHHHHHHHHHHhh
Q psy16749        744 ACQKLSKNC----LNTTPTH------LELEILSSYKHLQNTCKTVKLAWIKGHEGIKGNVEVDRLAKYATV  804 (821)
Q Consensus       744 AL~~L~~~~----~~~~p~~------~s~~i~~i~~~L~~~~~~V~~~WVPGHsGI~GNE~AD~LAK~A~~  804 (821)
                      |+++|+.|.    .++|...      +.++++.+..++.  ...|+|.|||||+|++||+.||+||++|..
T Consensus        78 vi~~i~~~~~~W~~~~w~~s~g~~v~n~dl~~~i~~l~~--~~~i~~~~VkgH~g~~~Ne~aD~lA~~a~~  146 (147)
T PRK08719         78 CVRGFNEWLDTWKQKGWRKSDKKPVANRDLWQQVDELRA--RKYVEVEKVTAHSGIEGNEAADMLAQAAAE  146 (147)
T ss_pred             HHHHHHHHHHHHHhCCcccCCCcccccHHHHHHHHHHhC--CCcEEEEEecCCCCChhHHHHHHHHHHHhh
Confidence            999986543    3445333      2345555554443  467999999999999999999999999875


No 5  
>PRK00203 rnhA ribonuclease H; Reviewed
Probab=99.82  E-value=5.4e-20  Score=180.30  Aligned_cols=130  Identities=22%  Similarity=0.158  Sum_probs=98.7

Q ss_pred             cEEEeccccccC--CCcceEEecCCCccce-eeccCCCCCcchhHHHHHHHHHHHHHhcCCCcEEEEcchHHHHHHHhhc
Q psy16749        675 AICIYTDASKKN--EKVGAAWFCPTYKSKA-CFKLHPATSTYTAEVIGIWEALKYSASLKNNEILILTDSKSACQKLSKN  751 (821)
Q Consensus       675 ~i~IYTDGS~~~--g~vGagvv~~~~~~~~-~~~l~~~~T~~~AEL~AI~~ALe~a~~~~~~~I~I~SDSqsAL~~L~~~  751 (821)
                      .+.||||||+..  +..|+|+|+...+... ........++++|||+||+.||+++.+  ...+.|||||++++++|..|
T Consensus         3 ~v~iytDGs~~~n~~~~g~g~v~~~~~~~~~~~~~~~~~TN~~aEL~Ai~~AL~~~~~--~~~v~I~tDS~yvi~~i~~w   80 (150)
T PRK00203          3 QVEIYTDGACLGNPGPGGWGAILRYKGHEKELSGGEALTTNNRMELMAAIEALEALKE--PCEVTLYTDSQYVRQGITEW   80 (150)
T ss_pred             eEEEEEEecccCCCCceEEEEEEEECCeeEEEecCCCCCcHHHHHHHHHHHHHHHcCC--CCeEEEEECHHHHHHHHHHH
Confidence            488999999983  5667777764333221 112234678999999999999998865  36799999999999999875


Q ss_pred             cc----CCCCC------CchHHHHHHHHHHHhcCCcEEEEEecCCCCCcccHHHHHHHHHHhhcCCC
Q psy16749        752 CL----NTTPT------HLELEILSSYKHLQNTCKTVKLAWIKGHEGIKGNVEVDRLAKYATVHGEA  808 (821)
Q Consensus       752 ~~----~~~p~------~~s~~i~~i~~~L~~~~~~V~~~WVPGHsGI~GNE~AD~LAK~A~~~~~~  808 (821)
                      ..    ++|..      .+.++++++..++.  ...|.|.|||||+|++||+.||+|||+|+.....
T Consensus        81 ~~~Wk~~~~~~~~g~~v~n~dl~~~i~~l~~--~~~v~~~wV~~H~~~~~N~~AD~lA~~a~~~~~~  145 (150)
T PRK00203         81 IHGWKKNGWKTADKKPVKNVDLWQRLDAALK--RHQIKWHWVKGHAGHPENERCDELARAGAEEATL  145 (150)
T ss_pred             HHHHHHcCCcccCCCccccHHHHHHHHHHhc--cCceEEEEecCCCCCHHHHHHHHHHHHHHHHhhh
Confidence            43    34432      23456777776663  3789999999999999999999999999876543


No 6  
>cd06222 RnaseH RNase H (RNase HI) is an endonuclease that cleaves the RNA strand of an RNA/DNA hybrid in a not sequence-specific manner. One of the important functions of RNase H is to remove Okazaki fragments during DNA replication.  RNase H knockout mice lack mitochondrial DNA replication and die as embryos. The retroviral reverse transcriptase contains an RNase H domain that plays an important role in converting a single stranded retroviral genomic RNA into a dsDNA for integration into host chromosomes. RNase H inhibitors have been explored as an anti-HIV drug target because RNase H inactivation inhibits reverse transcription.
Probab=99.70  E-value=1.8e-16  Score=143.89  Aligned_cols=125  Identities=28%  Similarity=0.314  Sum_probs=97.5

Q ss_pred             EEeccccccC--CCcceEEecCCCcc--ce-eeccCCCCCcchhHHHHHHHHHHHHHhcCCCcEEEEcchHHHHHHHhhc
Q psy16749        677 CIYTDASKKN--EKVGAAWFCPTYKS--KA-CFKLHPATSTYTAEVIGIWEALKYSASLKNNEILILTDSKSACQKLSKN  751 (821)
Q Consensus       677 ~IYTDGS~~~--g~vGagvv~~~~~~--~~-~~~l~~~~T~~~AEL~AI~~ALe~a~~~~~~~I~I~SDSqsAL~~L~~~  751 (821)
                      .+|+|||...  ++.|+|+++.+.+.  .. ........+++.+|+.||+.||+++...+...+.|++||+.+++.++++
T Consensus         1 ~~~~Dgs~~~~~~~~g~g~v~~~~~~~~~~~~~~~~~~~s~~~aEl~al~~al~~~~~~~~~~i~i~~Ds~~~~~~~~~~   80 (130)
T cd06222           1 VIYTDGSCRGNPGPAGAGVVLRDPGGEVLLSGGLLGGNTTNNRAELLALIEALELALELGGKKVNIYTDSQYVINALTGW   80 (130)
T ss_pred             CEEecccCCCCCCceEEEEEEEeCCCeEEEeccccCCCCcHHHHHHHHHHHHHHHHHhCCCceEEEEECHHHHHHHhhcc
Confidence            4899999985  48888888754432  11 1122256799999999999999999988889999999999999999984


Q ss_pred             ccCCCCCCchHHHHHHHHHHHhcCCcEEEEEecCCCCCcccHHHHHHHHHHh
Q psy16749        752 CLNTTPTHLELEILSSYKHLQNTCKTVKLAWIKGHEGIKGNVEVDRLAKYAT  803 (821)
Q Consensus       752 ~~~~~p~~~s~~i~~i~~~L~~~~~~V~~~WVPGHsGI~GNE~AD~LAK~A~  803 (821)
                      ... .......++.++.+.+ +.+..+.|.|||+|+|+.+|+.||.|||.++
T Consensus        81 ~~~-~~~~~~~~~~~i~~~~-~~~~~~~i~~v~~h~~~~~n~~ad~la~~~~  130 (130)
T cd06222          81 YEG-KPVKNVDLWQRLLALL-KRFHKVRFEWVPGHSGIEGNERADALAKEAA  130 (130)
T ss_pred             ccC-CChhhHHHHHHHHHHH-hCCCeEEEEEcCCCCCCcchHHHHHHHHhhC
Confidence            321 1123344566666655 5678999999999999999999999999874


No 7  
>KOG3752|consensus
Probab=99.69  E-value=1.5e-16  Score=173.03  Aligned_cols=131  Identities=25%  Similarity=0.223  Sum_probs=106.4

Q ss_pred             cEEEeccccccC-----CCcceEEecCCCc-cceeeccC-CCCCcchhHHHHHHHHHHHHHhcCCCcEEEEcchHHHHHH
Q psy16749        675 AICIYTDASKKN-----EKVGAAWFCPTYK-SKACFKLH-PATSTYTAEVIGIWEALKYSASLKNNEILILTDSKSACQK  747 (821)
Q Consensus       675 ~i~IYTDGS~~~-----g~vGagvv~~~~~-~~~~~~l~-~~~T~~~AEL~AI~~ALe~a~~~~~~~I~I~SDSqsAL~~  747 (821)
                      ..++|||||+..     ..+|+|||++++. ....+.+. +..++|+|||.||.+||+.+.+..+.+|+|.|||.++++.
T Consensus       212 ~~vvytDGS~~~ng~~~~~AGyGvywg~~~e~N~s~pv~~g~qtNnrAEl~Av~~ALkka~~~~~~kv~I~TDS~~~i~~  291 (371)
T KOG3752|consen  212 IQVVYTDGSSSGNGRKSSRAGYGVYWGPGHELNVSGPLAGGRQTNNRAELIAAIEALKKARSKNINKVVIRTDSEYFINS  291 (371)
T ss_pred             ceEEEecCccccCCCCCCcceeEEeeCCCCcccccccCCCCcccccHHHHHHHHHHHHHHHhcCCCcEEEEechHHHHHH
Confidence            489999999995     2689999997664 46667775 7899999999999999999999888899999999999999


Q ss_pred             HhhcccC----CCCC-----------CchHHHHHHHHHHHh-cCCcEEEEEecCCCCCcccHHHHHHHHHHhhc
Q psy16749        748 LSKNCLN----TTPT-----------HLELEILSSYKHLQN-TCKTVKLAWIKGHEGIKGNVEVDRLAKYATVH  805 (821)
Q Consensus       748 L~~~~~~----~~p~-----------~~s~~i~~i~~~L~~-~~~~V~~~WVPGHsGI~GNE~AD~LAK~A~~~  805 (821)
                      |+.|..+    +|-.           .....+.++.++++. .+.+|.+.||+||.||.||++||.||+++..+
T Consensus       292 l~~wv~~~k~~~~k~~~~~~~i~~~v~n~~~~~e~~~l~q~~~~~~vq~~~V~Gh~gi~gne~Ad~lARkgs~~  365 (371)
T KOG3752|consen  292 LTLWVQGWKKNGWKTSNGSDRICAYVKNQDFFNELDELEQEISNKKVQQEYVGGHSGILGNEMADALARKGSTL  365 (371)
T ss_pred             HHHHHhhhccCccccccCCCccceeeecchHHHHHHHHHhhhccCceEEEEecCcCCcchHHHHHHHHhhhhhh
Confidence            9988763    1211           112235555565554 57899999999999999999999999998543


No 8  
>PRK13907 rnhA ribonuclease H; Provisional
Probab=99.61  E-value=4.6e-15  Score=141.02  Aligned_cols=122  Identities=21%  Similarity=0.264  Sum_probs=95.9

Q ss_pred             EEEeccccccC--CCcceEEecCCCcc--ceeeccCCCCCcchhHHHHHHHHHHHHHhcCCCcEEEEcchHHHHHHHhhc
Q psy16749        676 ICIYTDASKKN--EKVGAAWFCPTYKS--KACFKLHPATSTYTAEVIGIWEALKYSASLKNNEILILTDSKSACQKLSKN  751 (821)
Q Consensus       676 i~IYTDGS~~~--g~vGagvv~~~~~~--~~~~~l~~~~T~~~AEL~AI~~ALe~a~~~~~~~I~I~SDSqsAL~~L~~~  751 (821)
                      +.||+|||+..  |..|+|+|+.+.+.  ...+. ....+++.||+.|++.||+++.+.+...+.|++||+++++.|+++
T Consensus         2 ~~iy~DGa~~~~~g~~G~G~vi~~~~~~~~~~~~-~~~~tn~~AE~~All~aL~~a~~~g~~~v~i~sDS~~vi~~~~~~   80 (128)
T PRK13907          2 IEVYIDGASKGNPGPSGAGVFIKGVQPAVQLSLP-LGTMSNHEAEYHALLAALKYCTEHNYNIVSFRTDSQLVERAVEKE   80 (128)
T ss_pred             EEEEEeeCCCCCCCccEEEEEEEECCeeEEEEec-ccccCCcHHHHHHHHHHHHHHHhCCCCEEEEEechHHHHHHHhHH
Confidence            68999999994  67888988854322  22222 245799999999999999999998889999999999999999882


Q ss_pred             ccCCCCCCchHHHHHHHHHHHhcCCcEEEEEecCCCCCcccHHHHHHHHHHhhc
Q psy16749        752 CLNTTPTHLELEILSSYKHLQNTCKTVKLAWIKGHEGIKGNVEVDRLAKYATVH  805 (821)
Q Consensus       752 ~~~~~p~~~s~~i~~i~~~L~~~~~~V~~~WVPGHsGI~GNE~AD~LAK~A~~~  805 (821)
                      ...  ......++.++..++ ..+..+.|.|||+    .+|+.||.||+.|+..
T Consensus        81 ~~~--~~~~~~l~~~~~~l~-~~f~~~~~~~v~r----~~N~~Ad~LA~~a~~~  127 (128)
T PRK13907         81 YAK--NKMFAPLLEEALQYI-KSFDLFFIKWIPS----SQNKVADELARKAILQ  127 (128)
T ss_pred             Hhc--ChhHHHHHHHHHHHH-hcCCceEEEEcCc----hhchhHHHHHHHHHhc
Confidence            221  123456777777766 4677888999996    5899999999999864


No 9  
>PRK07708 hypothetical protein; Validated
Probab=99.51  E-value=1.4e-13  Score=143.50  Aligned_cols=130  Identities=18%  Similarity=0.090  Sum_probs=96.1

Q ss_pred             hhCCCcEEEecccccc--CCCcceEEecCC--Cccce----eeccCCCCCcchhHHHHHHHHHHHHHhcCCCc--EEEEc
Q psy16749        670 EKCPNAICIYTDASKK--NEKVGAAWFCPT--YKSKA----CFKLHPATSTYTAEVIGIWEALKYSASLKNNE--ILILT  739 (821)
Q Consensus       670 ~~~~~~i~IYTDGS~~--~g~vGagvv~~~--~~~~~----~~~l~~~~T~~~AEL~AI~~ALe~a~~~~~~~--I~I~S  739 (821)
                      ...++.+++|+|||+.  .++.|+|+++..  ++...    ...+....+|+.||+.|++.||+++.+.+.+.  |.|++
T Consensus        68 ~~ep~~~~vY~DGs~~~n~g~aG~GvVI~~~~g~~~~~~~~~~~l~~~~TNN~AEy~Ali~aL~~A~e~g~~~~~V~I~~  147 (219)
T PRK07708         68 EEEPHEILVYFDGGFDKETKLAGLGIVIYYKQGNKRYRIRRNAYIEGIYDNNEAEYAALYYAMQELEELGVKHEPVTFRG  147 (219)
T ss_pred             ccCCCcEEEEEeeccCCCCCCcEEEEEEEECCCCEEEEEEeeccccccccCcHHHHHHHHHHHHHHHHcCCCcceEEEEe
Confidence            4456789999999997  467788887642  32221    22455568999999999999999999987655  89999


Q ss_pred             chHHHHHHHhhcccCCCCCCc---hHHHHHHHHHHHhcCCcEEEEEecCCCCCcccHHHHHHHHHHhhcCC
Q psy16749        740 DSKSACQKLSKNCLNTTPTHL---ELEILSSYKHLQNTCKTVKLAWIKGHEGIKGNVEVDRLAKYATVHGE  807 (821)
Q Consensus       740 DSqsAL~~L~~~~~~~~p~~~---s~~i~~i~~~L~~~~~~V~~~WVPGHsGI~GNE~AD~LAK~A~~~~~  807 (821)
                      ||+.+++.|++    .|....   ..+..++..++.+....+.|.|||    +..|+.||+||+.|++...
T Consensus       148 DSqlVi~qi~g----~wk~~~~~l~~y~~~i~~l~~~~~l~~~~~~Vp----R~~N~~AD~LAk~Al~~~~  210 (219)
T PRK07708        148 DSQVVLNQLAG----EWPCYDEHLNHWLDRIEQKLKQLKLTPVYEPIS----RKQNKEADQLATQALEGTV  210 (219)
T ss_pred             ccHHHHHHhCC----CceeCChhHHHHHHHHHHHHhhCCceEEEEECC----chhhhHHHHHHHHHHhcCC
Confidence            99999999987    222222   234455555565555567888987    5779999999999998543


No 10 
>PRK07238 bifunctional RNase H/acid phosphatase; Provisional
Probab=99.35  E-value=4.9e-12  Score=141.05  Aligned_cols=123  Identities=18%  Similarity=0.172  Sum_probs=94.8

Q ss_pred             cEEEeccccccC--CCcceEEecCCC--c-c--ceeeccCCCCCcchhHHHHHHHHHHHHHhcCCCcEEEEcchHHHHHH
Q psy16749        675 AICIYTDASKKN--EKVGAAWFCPTY--K-S--KACFKLHPATSTYTAEVIGIWEALKYSASLKNNEILILTDSKSACQK  747 (821)
Q Consensus       675 ~i~IYTDGS~~~--g~vGagvv~~~~--~-~--~~~~~l~~~~T~~~AEL~AI~~ALe~a~~~~~~~I~I~SDSqsAL~~  747 (821)
                      .+.||+|||+..  |..|+|+++.+.  + .  .....+ ...+|+.||+.|++.||+++.+.+...|.|++||+++++.
T Consensus         2 ~~~i~~DGa~~~n~g~aG~G~vi~~~~~~~~~~~~~~~~-~~~tnn~AE~~All~gL~~a~~~g~~~v~i~~DS~lvi~~   80 (372)
T PRK07238          2 KVVVEADGGSRGNPGPAGYGAVVWDADRGEVLAERAEAI-GRATNNVAEYRGLIAGLEAAAELGATEVEVRMDSKLVVEQ   80 (372)
T ss_pred             eEEEEecCCCCCCCCceEEEEEEEeCCCCcEEEEeeccc-CCCCchHHHHHHHHHHHHHHHhCCCCeEEEEeCcHHHHHH
Confidence            368999999995  567888887543  2 1  122333 3678999999999999999999988999999999999999


Q ss_pred             HhhcccCCCCCCc---hHHHHHHHHHHHhcCCcEEEEEecCCCCCcccHHHHHHHHHHhhcCC
Q psy16749        748 LSKNCLNTTPTHL---ELEILSSYKHLQNTCKTVKLAWIKGHEGIKGNVEVDRLAKYATVHGE  807 (821)
Q Consensus       748 L~~~~~~~~p~~~---s~~i~~i~~~L~~~~~~V~~~WVPGHsGI~GNE~AD~LAK~A~~~~~  807 (821)
                      |++    +|...+   ..++.++..++ +++..|.|.|||+    .+|+.||.||+.|+....
T Consensus        81 i~~----~~~~~~~~l~~~~~~i~~l~-~~f~~~~i~~v~r----~~N~~AD~LA~~a~~~~~  134 (372)
T PRK07238         81 MSG----RWKVKHPDMKPLAAQARELA-SQFGRVTYTWIPR----ARNAHADRLANEAMDAAA  134 (372)
T ss_pred             hCC----CCccCChHHHHHHHHHHHHH-hcCCceEEEECCc----hhhhHHHHHHHHHHHhhc
Confidence            976    222222   23455555554 5678899999995    789999999999987654


No 11 
>PF13456 RVT_3:  Reverse transcriptase-like; PDB: 3ALY_A 2EHG_A 3HST_B.
Probab=99.01  E-value=2.2e-09  Score=93.81  Aligned_cols=85  Identities=24%  Similarity=0.256  Sum_probs=67.6

Q ss_pred             chhHHHHHHHHHHHHHhcCCCcEEEEcchHHHHHHHhhcccCCCCCCchHHHHHHHHHHHhcCCcEEEEEecCCCCCccc
Q psy16749        713 YTAEVIGIWEALKYSASLKNNEILILTDSKSACQKLSKNCLNTTPTHLELEILSSYKHLQNTCKTVKLAWIKGHEGIKGN  792 (821)
Q Consensus       713 ~~AEL~AI~~ALe~a~~~~~~~I~I~SDSqsAL~~L~~~~~~~~p~~~s~~i~~i~~~L~~~~~~V~~~WVPGHsGI~GN  792 (821)
                      ..||+.|++.||+++.+.+.++|.|++||+.+++.|++...  .......++.++...+. ....+.|.||+    +++|
T Consensus         2 ~~aE~~al~~al~~a~~~g~~~i~v~sDs~~vv~~i~~~~~--~~~~~~~~~~~i~~~~~-~~~~~~~~~i~----r~~N   74 (87)
T PF13456_consen    2 LEAEALALLEALQLAWELGIRKIIVESDSQLVVDAINGRSS--SRSELRPLIQDIRSLLD-RFWNVSVSHIP----REQN   74 (87)
T ss_dssp             HHHHHHHHHHHHHHHHCCT-SCEEEEES-HHHHHHHTTSS-----SCCHHHHHHHHHHHC-CCSCEEEEE------GGGS
T ss_pred             cHHHHHHHHHHHHHHHHCCCCEEEEEecCcccccccccccc--ccccccccchhhhhhhc-cccceEEEEEC----hHHh
Confidence            57999999999999999999999999999999999988211  12344567778777774 58899999998    7999


Q ss_pred             HHHHHHHHHHhh
Q psy16749        793 VEVDRLAKYATV  804 (821)
Q Consensus       793 E~AD~LAK~A~~  804 (821)
                      ..||.|||.|..
T Consensus        75 ~~A~~LA~~a~~   86 (87)
T PF13456_consen   75 KVADALAKFALS   86 (87)
T ss_dssp             HHHHHHHHHHHH
T ss_pred             HHHHHHHHHHhh
Confidence            999999999874


No 12 
>PF09004 DUF1891:  Domain of unknown function (DUF1891);  InterPro: IPR015095 This domain is found at the extreme N terminus of eukaryotic alkylated DNA repair protein homologs. ; GO: 0008168 methyltransferase activity, 0016706 oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen, 2-oxoglutarate as one donor, and incorporation of one atom each of oxygen into both donors, 0055114 oxidation-reduction process; PDB: 2CQ2_A 3THT_C 3THP_A.
Probab=89.53  E-value=0.051  Score=42.99  Aligned_cols=41  Identities=7%  Similarity=0.184  Sum_probs=7.7

Q ss_pred             chHHHHHHHHHHHHHHHHHHhhccCCCCCChHHHHHHHHHHhH
Q psy16749        467 IHYPMIENLPDCNKYLNIMKMICNKHWGMNPTIGLNYYKATIR  509 (821)
Q Consensus       467 f~~HI~~i~~KA~k~ln~Lr~L~~~~wGls~~~lr~LYkAvVr  509 (821)
                      |+.|+..+.+|+..++..||+|..  .+++...+..+|+++|.
T Consensus         1 W~~n~~~~~KKa~qRlyFLRkl~k--~~~~~~~l~lfY~s~Ie   41 (42)
T PF09004_consen    1 WTSNTTSLYKKAQQRLYFLRKLRK--FNVDSKLLTLFYHSVIE   41 (42)
T ss_dssp             ---------------------------SS-----SHTTTTT--
T ss_pred             CCcchHHHHHHHHhHHHHHHHHHH--ccchhHHHHHHHHHHhc
Confidence            889999999999999999999997  46788888999988875


No 13 
>PHA03378 EBNA-3B; Provisional
Probab=88.15  E-value=6.1  Score=47.62  Aligned_cols=24  Identities=21%  Similarity=0.371  Sum_probs=11.9

Q ss_pred             cCcchhhhccCCcccccCCCCcch
Q psy16749        156 ETNSETAEKETPLSEVPVIPQEAQ  179 (821)
Q Consensus       156 ~~~~~~~~~~~~~~~~~~~~~~~~  179 (821)
                      |+.-|-.+|.--+--+|-+|.++-
T Consensus       510 e~d~e~~eq~vmat~l~~~p~~p~  533 (991)
T PHA03378        510 EKDDEDMEQRVMATLLPPSPPQPR  533 (991)
T ss_pred             hhchHHHHHHHHHhhCCCCCCCCc
Confidence            344445555544444555555553


No 14 
>PHA03378 EBNA-3B; Provisional
Probab=87.90  E-value=6.4  Score=47.50  Aligned_cols=15  Identities=13%  Similarity=-0.132  Sum_probs=6.3

Q ss_pred             CCCcCCcchhhhhcc
Q psy16749         36 KPQEHDDLTFETKES   50 (821)
Q Consensus        36 ~~~~~~~~~~~~~~~   50 (821)
                      .|.++-++-|++--+
T Consensus       476 ~p~~pl~~~~~~~vi  490 (991)
T PHA03378        476 EPWQPLPHPQVTPVI  490 (991)
T ss_pred             CCCCCCCCcceeeeE
Confidence            344444444444433


No 15 
>PRK14950 DNA polymerase III subunits gamma and tau; Provisional
Probab=81.37  E-value=7.5  Score=46.86  Aligned_cols=8  Identities=0%  Similarity=0.127  Sum_probs=3.5

Q ss_pred             hhhhhhhH
Q psy16749        525 SKLKTLDK  532 (821)
Q Consensus       525 ~~ikkLe~  532 (821)
                      .+.++|+.
T Consensus       511 ~~~~~~~~  518 (585)
T PRK14950        511 FHKDKIEE  518 (585)
T ss_pred             HHHHHhhc
Confidence            34444444


No 16 
>PF05380 Peptidase_A17:  Pao retrotransposon peptidase ;  InterPro: IPR008042 This signature identifies members of the Pao retrotransposon family.
Probab=79.92  E-value=3.7  Score=41.04  Aligned_cols=54  Identities=11%  Similarity=0.183  Sum_probs=36.1

Q ss_pred             CcEEEeccccccCCCcceEEecCC---Ccccee-----ecc--CCCCCcchhHHHHHHHHHHHHHh
Q psy16749        674 NAICIYTDASKKNEKVGAAWFCPT---YKSKAC-----FKL--HPATSTYTAEVIGIWEALKYSAS  729 (821)
Q Consensus       674 ~~i~IYTDGS~~~g~vGagvv~~~---~~~~~~-----~~l--~~~~T~~~AEL~AI~~ALe~a~~  729 (821)
                      ..+++|+|+|.  .++|+.+|+..   ++....     .++  -...++-+-||+|+..|.+++..
T Consensus        80 ~~L~~F~DAS~--~aygavvYlr~~~~~~~~~~ll~aKsrv~P~k~~tIPRlEL~a~~l~~~l~~~  143 (159)
T PF05380_consen   80 VELHVFCDASE--SAYGAVVYLRSYSDGSVQVRLLFAKSRVAPLKTVTIPRLELLAALLGVRLANT  143 (159)
T ss_pred             eeeeEeecccc--cceeeEeEeeeccCCceeeeeeeecccccCCCCCcHHHHHHHHHHHHHHHHHH
Confidence            36899999994  46676666533   221111     122  13458999999999999998864


No 17 
>PRK14900 valS valyl-tRNA synthetase; Provisional
Probab=79.58  E-value=0.61  Score=59.75  Aligned_cols=15  Identities=20%  Similarity=0.209  Sum_probs=7.3

Q ss_pred             CCccccccccccccc
Q psy16749        135 PAVDLTQDIVEEKEA  149 (821)
Q Consensus       135 ~~~~~~~~~~~~~~~  149 (821)
                      ..|.+..+-.|+|+.
T Consensus       907 ~~~~~~~~~~~~~~~  921 (1052)
T PRK14900        907 EANSARRDTMEIQNE  921 (1052)
T ss_pred             ccccccccccccccc
Confidence            445555555544443


No 18 
>PRK14900 valS valyl-tRNA synthetase; Provisional
Probab=69.57  E-value=7.5  Score=50.14  Aligned_cols=10  Identities=0%  Similarity=0.209  Sum_probs=4.2

Q ss_pred             chhcccccCC
Q psy16749         89 DFISAHTEKT   98 (821)
Q Consensus        89 ~~~~~~~~~~   98 (821)
                      +|+....+|.
T Consensus       877 ~~veka~~kl  886 (1052)
T PRK14900        877 AVVEKDRARA  886 (1052)
T ss_pred             HHHHHHHHHH
Confidence            4444444443


No 19 
>COG3341 Predicted double-stranded RNA/RNA-DNA hybrid binding protein [General function prediction only]
Probab=68.68  E-value=11  Score=39.94  Aligned_cols=139  Identities=10%  Similarity=-0.034  Sum_probs=85.8

Q ss_pred             cEEEecccccc--CCCcceEEecCCCcccee-----eccCCCCCcchhHHHHHHHHHHHHHhcCCCcEEE-E----cchH
Q psy16749        675 AICIYTDASKK--NEKVGAAWFCPTYKSKAC-----FKLHPATSTYTAEVIGIWEALKYSASLKNNEILI-L----TDSK  742 (821)
Q Consensus       675 ~i~IYTDGS~~--~g~vGagvv~~~~~~~~~-----~~l~~~~T~~~AEL~AI~~ALe~a~~~~~~~I~I-~----SDSq  742 (821)
                      .+.-+.+|+..  ++..++-++.....+...     ..+ ...++..+|.+|++.+|+.+...+.++.+| |    .||+
T Consensus        65 ~~i~~~~G~y~~~p~t~~~k~yr~k~~~~~~~lt~~~~~-~~~~n~s~d~la~ly~~~~~~~~~nrk~~i~y~~~~~ds~  143 (225)
T COG3341          65 YIISWAKGDYDAKPGTQEFKEYRGKCTIEYSWLTESSEF-SIKSNDSGDVLAKLYGLRYEVPLDNRKSVINYLTPGNDSW  143 (225)
T ss_pred             ccceeccCCccccCCCcceeEEeccccccceeeeeeccc-ccccCchHHHHHHhccccccccccCccceeeccCCcchhH
Confidence            45566777666  344455544432211111     112 334678899999999999998877766666 6    6999


Q ss_pred             HHHHHHhhcccCCCCCCchHHHHHHHHHHHhcCCcEEEEEecCCCCCcccHHHHHHHHHHhhcCCCCccccCCCCCC
Q psy16749        743 SACQKLSKNCLNTTPTHLELEILSSYKHLQNTCKTVKLAWIKGHEGIKGNVEVDRLAKYATVHGEASTIVISPTNTE  819 (821)
Q Consensus       743 sAL~~L~~~~~~~~p~~~s~~i~~i~~~L~~~~~~V~~~WVPGHsGI~GNE~AD~LAK~A~~~~~~~~i~~~~~~di  819 (821)
                      +.+.-+.+-..     ....+....-.-+.-.+....+.|+--|.++.-+...+.+..........+...++-|+++
T Consensus       144 a~~k~~k~~~~-----~k~~~~~~~e~~~~~~~v~h~~k~i~~~~~~~~~~~~~s~~~~~~k~~~~~~~~~~~~~~~  215 (225)
T COG3341         144 AYFKYVKDKCE-----TKVNLLKDREFFSIGKGVFHDEKDINIHIWIFESKKGNSHVYNTSKKELSPKYHDIKFEDY  215 (225)
T ss_pred             HHHHHHhhhhh-----hHHHHHHhHHhhhccchhhhhhhhcccccccchhhhhhhhhhchhhhhccccccccccccc
Confidence            88887766110     0011111100011112345567899999999999999999888887777776667666664


No 20 
>PRK14950 DNA polymerase III subunits gamma and tau; Provisional
Probab=66.26  E-value=28  Score=42.05  Aligned_cols=8  Identities=0%  Similarity=-0.126  Sum_probs=3.1

Q ss_pred             HHHHHHHH
Q psy16749        477 DCNKYLNI  484 (821)
Q Consensus       477 KA~k~ln~  484 (821)
                      +.+.++..
T Consensus       480 ~~~~~~~~  487 (585)
T PRK14950        480 PRSPAVQA  487 (585)
T ss_pred             hcCHHHHH
Confidence            33444433


No 21 
>PHA03269 envelope glycoprotein C; Provisional
Probab=58.73  E-value=31  Score=40.61  Aligned_cols=17  Identities=24%  Similarity=0.403  Sum_probs=10.2

Q ss_pred             HHHHhHHHHhhcccccccc
Q psy16749        504 YKATIRATLDFGSVFYSES  522 (821)
Q Consensus       504 YkAvVrPiL~YGa~VW~~~  522 (821)
                      |+.+|||  +.-..+|...
T Consensus       216 ~~~~~~~--~~~~~~~~~~  232 (566)
T PHA03269        216 WKPLIRP--DLHIEIWFKK  232 (566)
T ss_pred             cccccCC--ceeEEEEeec
Confidence            5667777  3445677643


No 22 
>PF07174 FAP:  Fibronectin-attachment protein (FAP);  InterPro: IPR010801 This family contains bacterial fibronectin-attachment proteins (FAP). Family members are rich in alanine and proline, are approximately 300 long, and seem to be restricted to mycobacteria. These proteins contain a fibronectin-binding motif that allows mycobacteria to bind to fibronectin in the extracellular matrix [].; GO: 0050840 extracellular matrix binding, 0005576 extracellular region
Probab=56.06  E-value=79  Score=34.80  Aligned_cols=51  Identities=16%  Similarity=0.071  Sum_probs=31.1

Q ss_pred             cEEEeccccccCCCcceEEecC-------CC--ccceeeccCC-CCCcchhHHHHHHHHHH
Q psy16749        675 AICIYTDASKKNEKVGAAWFCP-------TY--KSKACFKLHP-ATSTYTAEVIGIWEALK  725 (821)
Q Consensus       675 ~i~IYTDGS~~~g~vGagvv~~-------~~--~~~~~~~l~~-~~T~~~AEL~AI~~ALe  725 (821)
                      +-+-|+|.++-+|.+..|||-.       .+  +..+...|+. ..-+-.++..++.+.|+
T Consensus       223 yevkf~d~~kp~gqiw~~vvg~p~~~~~~~~~~~rwfvvwlgt~~~pvd~~~a~~la~si~  283 (297)
T PF07174_consen  223 YEVKFTDANKPNGQIWAGVVGSPVAPGTPRGTPQRWFVVWLGTANNPVDKGAAKALAESIR  283 (297)
T ss_pred             EEEEeccCCCCCCceEEEeecCcCCCCCCCCCCceEEEEEecCCCCCCCHHHHHHHHhhcc
Confidence            3468999999999999888853       01  1233333432 33455666666655543


No 23 
>KOG1924|consensus
Probab=44.55  E-value=1.2e+02  Score=37.99  Aligned_cols=28  Identities=29%  Similarity=0.506  Sum_probs=14.5

Q ss_pred             hhhhhhHHHHHHHHHHhCCCCCChHHHHH
Q psy16749        526 KLKTLDKVQNQALRLAMGYLNSTPIDNIL  554 (821)
Q Consensus       526 ~ikkLe~vQr~AlR~ItGAfRTTp~~aLl  554 (821)
                      .++.|+.--.+-|-+.+|.|| .|++.|.
T Consensus       692 el~ilDsKtaQnLsIflgS~r-mpyeeik  719 (1102)
T KOG1924|consen  692 ELRILDSKTAQNLSIFLGSFR-MPYEEIK  719 (1102)
T ss_pred             hheecchHHHHHHHHHHhhcc-CCHHHHH
Confidence            344455433334445666666 5555543


No 24 
>PF00336 DNA_pol_viral_C:  DNA polymerase (viral) C-terminal domain;  InterPro: IPR001462 This domain is at the C terminus of hepatitis B-type viruses P proteins and represents a functional domain that controls the RNase H activities of the protein. The domain is always associated with IPR000201 from INTERPRO and .; GO: 0004523 ribonuclease H activity
Probab=42.23  E-value=24  Score=37.19  Aligned_cols=63  Identities=19%  Similarity=0.181  Sum_probs=39.3

Q ss_pred             CCcEEEeccccccCCCcceEEecCCCccceeeccCCCCCcchhHHHHHHHHHHHHHhcCCCcEEEEcchHHHHH
Q psy16749        673 PNAICIYTDASKKNEKVGAAWFCPTYKSKACFKLHPATSTYTAEVIGIWEALKYSASLKNNEILILTDSKSACQ  746 (821)
Q Consensus       673 ~~~i~IYTDGS~~~g~vGagvv~~~~~~~~~~~l~~~~T~~~AEL~AI~~ALe~a~~~~~~~I~I~SDSqsAL~  746 (821)
                      ..--.||+|+.-.    |.|+.+..+.....|..  -..++.+||++...|.-+..    .++ |.+||..|+.
T Consensus        92 ~~lc~VfaDATpT----gwgi~i~~~~~~~Tfs~--~l~IhtaELlaaClAr~~~~----~r~-l~tDnt~Vls  154 (245)
T PF00336_consen   92 PGLCQVFADATPT----GWGISITGQRMRGTFSK--PLPIHTAELLAACLARLMSG----ARC-LGTDNTVVLS  154 (245)
T ss_pred             CCCCceeccCCCC----cceeeecCceeeeeecc--cccchHHHHHHHHHHHhccC----CcE-EeecCcEEEe
Confidence            3345799998876    33333332333333332  34688999999988766542    344 8999986643


No 25 
>COG3889 Predicted solute binding protein [General function prediction only]
Probab=40.92  E-value=17  Score=44.74  Aligned_cols=21  Identities=48%  Similarity=0.423  Sum_probs=8.4

Q ss_pred             CcCCCCCccccCCCCCCcccc
Q psy16749        417 ATATRPATTTSKPATTTSTDI  437 (821)
Q Consensus       417 ~~~~~~~~~~~~~~~~~~~~~  437 (821)
                      .+.+.+.|+|+-+.+||.++|
T Consensus       830 Qts~~TtTtTsps~ttt~~~i  850 (872)
T COG3889         830 QTSTSTTTTTSPSQTTTGGGI  850 (872)
T ss_pred             cccccceeeeccccccccccc
Confidence            333333444444444444433


No 26 
>PRK07764 DNA polymerase III subunits gamma and tau; Validated
Probab=40.52  E-value=1.2e+02  Score=38.55  Aligned_cols=8  Identities=38%  Similarity=0.517  Sum_probs=3.2

Q ss_pred             HHHHHHhC
Q psy16749        536 QALRLAMG  543 (821)
Q Consensus       536 ~AlR~ItG  543 (821)
                      .+++-.+|
T Consensus       572 ~al~e~lG  579 (824)
T PRK07764        572 TALAEELG  579 (824)
T ss_pred             HHHHHHhC
Confidence            33444443


No 27 
>COG3147 DedD Uncharacterized protein conserved in bacteria [Function unknown]
Probab=39.92  E-value=1.2e+02  Score=32.38  Aligned_cols=6  Identities=50%  Similarity=0.927  Sum_probs=2.1

Q ss_pred             cCCCCc
Q psy16749        172 PVIPQE  177 (821)
Q Consensus       172 ~~~~~~  177 (821)
                      |-.+.|
T Consensus        67 P~~~pe   72 (226)
T COG3147          67 PTQPPE   72 (226)
T ss_pred             CCCCCc
Confidence            333333


No 28 
>PRK12270 kgd alpha-ketoglutarate decarboxylase; Reviewed
Probab=33.47  E-value=1.3e+02  Score=38.72  Aligned_cols=12  Identities=17%  Similarity=0.354  Sum_probs=6.5

Q ss_pred             HHhhcccccccc
Q psy16749        511 TLDFGSVFYSES  522 (821)
Q Consensus       511 iL~YGa~VW~~~  522 (821)
                      ++.||..+|-..
T Consensus       421 ~~~hGLTlWDLd  432 (1228)
T PRK12270        421 VLTHGLTLWDLD  432 (1228)
T ss_pred             ccccCCCHHhcC
Confidence            344666666543


No 29 
>KOG1812|consensus
Probab=31.01  E-value=91  Score=35.95  Aligned_cols=37  Identities=22%  Similarity=0.285  Sum_probs=32.6

Q ss_pred             CCcchhHHHHHHHHHHHHHhcCCCcEEEEcchHHHHH
Q psy16749        710 TSTYTAEVIGIWEALKYSASLKNNEILILTDSKSACQ  746 (821)
Q Consensus       710 ~T~~~AEL~AI~~ALe~a~~~~~~~I~I~SDSqsAL~  746 (821)
                      .+...+|++|+..+|..+.+.++..+.+++|+.....
T Consensus        47 ~~~~~ae~~al~~~l~ea~~~~~~~~~~~~d~~~~~~   83 (384)
T KOG1812|consen   47 ITPLEAELMALKRGLTEALELGLNHIVIYCDDELIYE   83 (384)
T ss_pred             cchhhHHHHHHhhccHHHHhhccccceEecccHHHHH
Confidence            6788999999999999999999999999999654444


No 30 
>KOG1924|consensus
Probab=30.67  E-value=3e+02  Score=34.72  Aligned_cols=9  Identities=33%  Similarity=0.575  Sum_probs=5.1

Q ss_pred             hHHHHhhhh
Q psy16749        105 KEEVLDDLV  113 (821)
Q Consensus       105 ~~~~~~~~~  113 (821)
                      .+|-+-|+|
T Consensus       428 IEecISqIv  436 (1102)
T KOG1924|consen  428 IEECISQIV  436 (1102)
T ss_pred             HHHHHHHHH
Confidence            555555555


No 31 
>PRK12270 kgd alpha-ketoglutarate decarboxylase; Reviewed
Probab=30.51  E-value=1.7e+02  Score=37.95  Aligned_cols=9  Identities=33%  Similarity=0.471  Sum_probs=4.5

Q ss_pred             cEEEEcchH
Q psy16749        734 EILILTDSK  742 (821)
Q Consensus       734 ~I~I~SDSq  742 (821)
                      .|.|.-|..
T Consensus       650 pi~~hGdaa  658 (1228)
T PRK12270        650 PILLHGDAA  658 (1228)
T ss_pred             EEEEecccc
Confidence            445555543


No 32 
>KOG0559|consensus
Probab=29.22  E-value=1.8e+02  Score=33.41  Aligned_cols=13  Identities=38%  Similarity=0.560  Sum_probs=5.4

Q ss_pred             CCCCCCCCCCCCC
Q psy16749        249 AKPAISPVKKTAT  261 (821)
Q Consensus       249 ~~p~~~~~~~~~~  261 (821)
                      |+|+.++.++.++
T Consensus       175 p~~~s~~~p~~~a  187 (457)
T KOG0559|consen  175 PKPSSKPPPKEAA  187 (457)
T ss_pred             CCccCCCCccccC
Confidence            4444444444333


No 33 
>PTZ00436 60S ribosomal protein L19-like protein; Provisional
Probab=28.95  E-value=2.3e+02  Score=31.83  Aligned_cols=6  Identities=17%  Similarity=0.285  Sum_probs=2.5

Q ss_pred             HHhhhh
Q psy16749        108 VLDDLV  113 (821)
Q Consensus       108 ~~~~~~  113 (821)
                      +|.+++
T Consensus       154 ~L~dQ~  159 (357)
T PTZ00436        154 QLAEQL  159 (357)
T ss_pred             HHHHHH
Confidence            444443


No 34 
>PF15324 TALPID3:  Hedgehog signalling target
Probab=23.28  E-value=1.3e+03  Score=30.46  Aligned_cols=10  Identities=20%  Similarity=0.610  Sum_probs=4.8

Q ss_pred             cCCCCChHHH
Q psy16749        441 MNQPFTPEEL  450 (821)
Q Consensus       441 ~~~pi~~e~~  450 (821)
                      .+.+|...|+
T Consensus      1133 ~dRpiSEGEi 1142 (1252)
T PF15324_consen 1133 TDRPISEGEI 1142 (1252)
T ss_pred             cCCcCCccee
Confidence            4556653333


No 35 
>COG0296 GlgB 1,4-alpha-glucan branching enzyme [Carbohydrate transport and metabolism]
Probab=21.37  E-value=92  Score=38.23  Aligned_cols=77  Identities=17%  Similarity=0.129  Sum_probs=49.9

Q ss_pred             HHHHHHHHHHHHhcCCCcEEEEcchHHHHHHHhhcccCC----CCCCc---hHHHHHHHHHHHhcCCcEEEEEecCCCCC
Q psy16749        717 VIGIWEALKYSASLKNNEILILTDSKSACQKLSKNCLNT----TPTHL---ELEILSSYKHLQNTCKTVKLAWIKGHEGI  789 (821)
Q Consensus       717 L~AI~~ALe~a~~~~~~~I~I~SDSqsAL~~L~~~~~~~----~p~~~---s~~i~~i~~~L~~~~~~V~~~WVPGHsGI  789 (821)
                      ..++..-|.++.+++.+.|.+.-=++...+  .+|.+.+    .|...   -+-++.+.+.+-+.|+.|-|-|||+|.|.
T Consensus       164 ~e~a~~llpYl~elG~T~IELMPv~e~p~~--~sWGYq~~g~yAp~sryGtPedfk~fVD~aH~~GIgViLD~V~~HF~~  241 (628)
T COG0296         164 FELAIELLPYLKELGITHIELMPVAEHPGD--RSWGYQGTGYYAPTSRYGTPEDFKALVDAAHQAGIGVILDWVPNHFPP  241 (628)
T ss_pred             HHHHHHHhHHHHHhCCCEEEEcccccCCCC--CCCCCCcceeccccccCCCHHHHHHHHHHHHHcCCEEEEEecCCcCCC
Confidence            344566678888888888877543321111  2344421    01111   12377788888889999999999999999


Q ss_pred             cccHHH
Q psy16749        790 KGNVEV  795 (821)
Q Consensus       790 ~GNE~A  795 (821)
                      .||-++
T Consensus       242 d~~~L~  247 (628)
T COG0296         242 DGNYLA  247 (628)
T ss_pred             Ccchhh
Confidence            998744


No 36 
>PF07462 MSP1_C:  Merozoite surface protein 1 (MSP1) C-terminus;  InterPro: IPR010901 This entry represents the C-terminal region of merozoite surface protein 1 (MSP1), which is found in a number of Plasmodium species. MSP-1 is a 200 kDa protein expressed on the surface of the Plasmodium vivax merozoite. MSP-1 of Plasmodium species is synthesised as a high-molecular-weight precursor and then processed into several fragments. At the time of red cell invasion by the merozoite, only the 19 kDa C-terminal fragment (MSP-119), which contains two epidermal growth factor-like domains, remains on the surface. Antibodies against MSP-119 inhibit merozoite entry into red cells, and immunisation with MSP-119 protects monkeys from challenging infections. Hence, MSP-119 is considered a promising vaccine candidate [].; GO: 0009405 pathogenesis, 0016020 membrane
Probab=21.27  E-value=1.9e+02  Score=34.85  Aligned_cols=8  Identities=25%  Similarity=0.712  Sum_probs=3.7

Q ss_pred             HHHHhhcc
Q psy16749        572 RFVLKVIS  579 (821)
Q Consensus       572 ~~~~Rl~t  579 (821)
                      .|++.++.
T Consensus       415 nyFL~VLn  422 (574)
T PF07462_consen  415 NYFLDVLN  422 (574)
T ss_pred             HHHHHHHh
Confidence            44554443


Done!