Query psy16749
Match_columns 821
No_of_seqs 296 out of 1655
Neff 5.4
Searched_HMMs 46136
Date Fri Aug 16 21:00:16 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy16749.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/16749hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 COG0328 RnhA Ribonuclease HI [ 99.8 1.3E-20 2.8E-25 184.9 13.8 130 675-806 3-146 (154)
2 PF00075 RNase_H: RNase H; In 99.8 6.1E-21 1.3E-25 180.0 8.4 125 674-804 2-132 (132)
3 PRK06548 ribonuclease H; Provi 99.8 2.7E-20 5.8E-25 184.8 13.0 129 674-807 4-144 (161)
4 PRK08719 ribonuclease H; Revie 99.8 3.7E-20 8.1E-25 181.5 12.0 126 673-804 2-146 (147)
5 PRK00203 rnhA ribonuclease H; 99.8 5.4E-20 1.2E-24 180.3 11.7 130 675-808 3-145 (150)
6 cd06222 RnaseH RNase H (RNase 99.7 1.8E-16 3.8E-21 143.9 12.8 125 677-803 1-130 (130)
7 KOG3752|consensus 99.7 1.5E-16 3.3E-21 173.0 13.4 131 675-805 212-365 (371)
8 PRK13907 rnhA ribonuclease H; 99.6 4.6E-15 1E-19 141.0 12.6 122 676-805 2-127 (128)
9 PRK07708 hypothetical protein; 99.5 1.4E-13 2.9E-18 143.5 13.9 130 670-807 68-210 (219)
10 PRK07238 bifunctional RNase H/ 99.3 4.9E-12 1.1E-16 141.0 13.0 123 675-807 2-134 (372)
11 PF13456 RVT_3: Reverse transc 99.0 2.2E-09 4.7E-14 93.8 10.4 85 713-804 2-86 (87)
12 PF09004 DUF1891: Domain of un 89.5 0.051 1.1E-06 43.0 -1.7 41 467-509 1-41 (42)
13 PHA03378 EBNA-3B; Provisional 88.2 6.1 0.00013 47.6 12.9 24 156-179 510-533 (991)
14 PHA03378 EBNA-3B; Provisional 87.9 6.4 0.00014 47.5 12.8 15 36-50 476-490 (991)
15 PRK14950 DNA polymerase III su 81.4 7.5 0.00016 46.9 10.1 8 525-532 511-518 (585)
16 PF05380 Peptidase_A17: Pao re 79.9 3.7 8E-05 41.0 5.8 54 674-729 80-143 (159)
17 PRK14900 valS valyl-tRNA synth 79.6 0.61 1.3E-05 59.7 0.1 15 135-149 907-921 (1052)
18 PRK14900 valS valyl-tRNA synth 69.6 7.5 0.00016 50.1 6.1 10 89-98 877-886 (1052)
19 COG3341 Predicted double-stran 68.7 11 0.00024 39.9 6.0 139 675-819 65-215 (225)
20 PRK14950 DNA polymerase III su 66.3 28 0.00061 42.1 9.8 8 477-484 480-487 (585)
21 PHA03269 envelope glycoprotein 58.7 31 0.00068 40.6 7.8 17 504-522 216-232 (566)
22 PF07174 FAP: Fibronectin-atta 56.1 79 0.0017 34.8 9.8 51 675-725 223-283 (297)
23 KOG1924|consensus 44.5 1.2E+02 0.0026 38.0 9.7 28 526-554 692-719 (1102)
24 PF00336 DNA_pol_viral_C: DNA 42.2 24 0.00053 37.2 3.3 63 673-746 92-154 (245)
25 COG3889 Predicted solute bindi 40.9 17 0.00038 44.7 2.3 21 417-437 830-850 (872)
26 PRK07764 DNA polymerase III su 40.5 1.2E+02 0.0026 38.5 9.5 8 536-543 572-579 (824)
27 COG3147 DedD Uncharacterized p 39.9 1.2E+02 0.0026 32.4 7.9 6 172-177 67-72 (226)
28 PRK12270 kgd alpha-ketoglutara 33.5 1.3E+02 0.0029 38.7 8.2 12 511-522 421-432 (1228)
29 KOG1812|consensus 31.0 91 0.002 36.0 6.0 37 710-746 47-83 (384)
30 KOG1924|consensus 30.7 3E+02 0.0065 34.7 10.2 9 105-113 428-436 (1102)
31 PRK12270 kgd alpha-ketoglutara 30.5 1.7E+02 0.0036 38.0 8.3 9 734-742 650-658 (1228)
32 KOG0559|consensus 29.2 1.8E+02 0.0039 33.4 7.6 13 249-261 175-187 (457)
33 PTZ00436 60S ribosomal protein 28.9 2.3E+02 0.0049 31.8 8.1 6 108-113 154-159 (357)
34 PF15324 TALPID3: Hedgehog sig 23.3 1.3E+03 0.028 30.5 13.9 10 441-450 1133-1142(1252)
35 COG0296 GlgB 1,4-alpha-glucan 21.4 92 0.002 38.2 3.9 77 717-795 164-247 (628)
36 PF07462 MSP1_C: Merozoite sur 21.3 1.9E+02 0.0041 34.9 6.1 8 572-579 415-422 (574)
No 1
>COG0328 RnhA Ribonuclease HI [DNA replication, recombination, and repair]
Probab=99.84 E-value=1.3e-20 Score=184.92 Aligned_cols=130 Identities=22% Similarity=0.163 Sum_probs=104.2
Q ss_pred cEEEeccccccC--CCcceEEecCCCc--cceeeccCCCCCcchhHHHHHHHHHHHHHhcCCCcEEEEcchHHHHHHHhh
Q psy16749 675 AICIYTDASKKN--EKVGAAWFCPTYK--SKACFKLHPATSTYTAEVIGIWEALKYSASLKNNEILILTDSKSACQKLSK 750 (821)
Q Consensus 675 ~i~IYTDGS~~~--g~vGagvv~~~~~--~~~~~~l~~~~T~~~AEL~AI~~ALe~a~~~~~~~I~I~SDSqsAL~~L~~ 750 (821)
.+.||+||+|.. |-.|+|+|+..++ ...+..+. .+||+++||+|++.||+++.+.+...|.|+|||++|+++|..
T Consensus 3 ~v~if~DGa~~gNpG~gG~g~vl~~~~~~~~~s~~~~-~tTNNraEl~A~i~AL~~l~~~~~~~v~l~tDS~yv~~~i~~ 81 (154)
T COG0328 3 KVEIFTDGACLGNPGPGGWGAVLRYGDGEKELSGGEG-RTTNNRAELRALIEALEALKELGACEVTLYTDSKYVVEGITR 81 (154)
T ss_pred ceEEEecCccCCCCCCceEEEEEEcCCceEEEeeeee-cccChHHHHHHHHHHHHHHHhcCCceEEEEecHHHHHHHHHH
Confidence 578999999994 5566777665333 23444444 889999999999999999999888999999999999999998
Q ss_pred cccC----CCC------CCchHHHHHHHHHHHhcCCcEEEEEecCCCCCcccHHHHHHHHHHhhcC
Q psy16749 751 NCLN----TTP------THLELEILSSYKHLQNTCKTVKLAWIKGHEGIKGNVEVDRLAKYATVHG 806 (821)
Q Consensus 751 ~~~~----~~p------~~~s~~i~~i~~~L~~~~~~V~~~WVPGHsGI~GNE~AD~LAK~A~~~~ 806 (821)
|..+ +|. ..+..+++++..++ .....|.+.||+||.|.++||+||+||+.|+...
T Consensus 82 w~~~w~~~~w~~~~~~pvkn~dl~~~~~~~~-~~~~~v~~~WVkgH~g~~~NeraD~LA~~~~~~~ 146 (154)
T COG0328 82 WIVKWKKNGWKTADKKPVKNKDLWEELDELL-KRHELVFWEWVKGHAGHPENERADQLAREAARAA 146 (154)
T ss_pred HHhhccccCccccccCccccHHHHHHHHHHH-hhCCeEEEEEeeCCCCChHHHHHHHHHHHHHHhh
Confidence 7553 333 23445677777666 4556999999999999999999999999998765
No 2
>PF00075 RNase_H: RNase H; InterPro: IPR002156 The RNase H domain is responsible for hydrolysis of the RNA portion of RNA x DNA hybrids, and this activity requires the presence of divalent cations (Mg2+ or Mn2+) that bind its active site. This domain is a part of a large family of homologous RNase H enzymes of which the RNase HI protein from Escherichia coli is the best characterised []. Secondary structure predictions for the enzymes from E. coli, yeast, human liver and diverse retroviruses (such as Rous sarcoma virus and the Foamy viruses) supported, in every case, the five beta-strands (1 to 5) and four or five alpha-helices (A, B/C, D, E) that have been identified by crystallography in the RNase H domain of Human immunodeficiency virus 1 (HIV-1) reverse transcriptase and in E. coli RNase H []. Reverse transcriptase (RT) is a modular enzyme carrying polymerase and ribonuclease H (RNase H) activities in separable domains. Reverse transcriptase (RT) converts the single-stranded RNA genome of a retrovirus into a double-stranded DNA copy for integration into the host genome. This process requires ribonuclease H as well as RNA- and DNA-directed DNA polymerase activities. Retroviral RNase H is synthesised as part of the POL polyprotein that contains; an aspartyl protease, a reverse transcriptase, RNase H and integrase. POL polyprotein undergoes specific enzymatic cleavage to yield the mature proteins. Bacterial RNase H 3.1.26.4 from EC catalyses endonucleolytic cleavage to 5'-phosphomonoester acting on RNA-DNA hybrids. The 3D structure of the RNase H domain from diverse bacteria and retroviruses has been solved [, , ]. All have four beta strands and four to five alpha helices. The E. coli RNase H1 protein binds a single Mg2+ ion cofactor in the active site of the enzyme. The divalent cation is bound by the carboxyl groups of four acidic residues, Asp-10, Glu-48, Asp-70, and Asp-134 []. The first three acidic residues are highly conserved in all bacterial and retroviral RNase H sequences. ; GO: 0003676 nucleic acid binding, 0004523 ribonuclease H activity; PDB: 3LP3_B 2KW4_A 3P1G_A 1RIL_A 2RPI_A 4EQJ_G 4EP2_B 3OTY_P 3U3G_D 2ZQB_D ....
Probab=99.83 E-value=6.1e-21 Score=180.03 Aligned_cols=125 Identities=32% Similarity=0.365 Sum_probs=95.8
Q ss_pred CcEEEecccccc--CCCcceEEecCCCccceeeccCCCCCcchhHHHHHHHHHHHHHhcCCCcEEEEcchHHHHHHHhhc
Q psy16749 674 NAICIYTDASKK--NEKVGAAWFCPTYKSKACFKLHPATSTYTAEVIGIWEALKYSASLKNNEILILTDSKSACQKLSKN 751 (821)
Q Consensus 674 ~~i~IYTDGS~~--~g~vGagvv~~~~~~~~~~~l~~~~T~~~AEL~AI~~ALe~a~~~~~~~I~I~SDSqsAL~~L~~~ 751 (821)
+.+.||||||+. .+..|+|++++.+ ....+.++ ..+++.+||.||++||+.+ . ...+.|||||+++++.|..|
T Consensus 2 ~~~~iytDgS~~~~~~~~~~g~v~~~~-~~~~~~~~-~~s~~~aEl~Ai~~AL~~~-~--~~~v~I~tDS~~v~~~l~~~ 76 (132)
T PF00075_consen 2 KAIIIYTDGSCRPNPGKGGAGYVVWGG-RNFSFRLG-GQSNNRAELQAIIEALKAL-E--HRKVTIYTDSQYVLNALNKW 76 (132)
T ss_dssp TSEEEEEEEEECTTTTEEEEEEEEETT-EEEEEEEE-SECHHHHHHHHHHHHHHTH-S--TSEEEEEES-HHHHHHHHTH
T ss_pred CcEEEEEeCCccCCCCceEEEEEEECC-eEEEeccc-ccchhhhheehHHHHHHHh-h--cccccccccHHHHHHHHHHh
Confidence 578999999987 3445666655434 34445555 7899999999999999944 2 38999999999999988886
Q ss_pred ccC-CCCCCc-h-HHHHHHHHHHHhcCCcEEEEEecCCCCC-cccHHHHHHHHHHhh
Q psy16749 752 CLN-TTPTHL-E-LEILSSYKHLQNTCKTVKLAWIKGHEGI-KGNVEVDRLAKYATV 804 (821)
Q Consensus 752 ~~~-~~p~~~-s-~~i~~i~~~L~~~~~~V~~~WVPGHsGI-~GNE~AD~LAK~A~~ 804 (821)
..+ +|+... . .++.++++++ ..+..|+|+|||||+|+ .||+.||+|||.|+.
T Consensus 77 ~~~~~~~~~~~~~~i~~~i~~~~-~~~~~v~~~~V~~H~~~~~~N~~aD~lAk~a~~ 132 (132)
T PF00075_consen 77 LHGNGWKKTSNGRPIKNEIWELL-SRGIKVRFRWVPGHSGVPQGNERADRLAKEAAQ 132 (132)
T ss_dssp HHHTTSBSCTSSSBHTHHHHHHH-HHSSEEEEEESSSSSSSHHHHHHHHHHHHHHHH
T ss_pred ccccccccccccccchhheeecc-ccceEEeeeeccCcCCCchhHHHHHHHHHHhcC
Confidence 543 333222 2 4666777777 45899999999999999 799999999999873
No 3
>PRK06548 ribonuclease H; Provisional
Probab=99.83 E-value=2.7e-20 Score=184.85 Aligned_cols=129 Identities=24% Similarity=0.271 Sum_probs=99.1
Q ss_pred CcEEEeccccccC--CCcceEEecCCCccceeeccCCCCCcchhHHHHHHHHHHHHHhcCCCcEEEEcchHHHHHHHhhc
Q psy16749 674 NAICIYTDASKKN--EKVGAAWFCPTYKSKACFKLHPATSTYTAEVIGIWEALKYSASLKNNEILILTDSKSACQKLSKN 751 (821)
Q Consensus 674 ~~i~IYTDGS~~~--g~vGagvv~~~~~~~~~~~l~~~~T~~~AEL~AI~~ALe~a~~~~~~~I~I~SDSqsAL~~L~~~ 751 (821)
..++|||||++.. |..|+|+|+.+.. ..... ....||++|||.||+.||+.+.. ....|.|||||++++++|..|
T Consensus 4 ~~~~IytDGa~~gnpg~~G~g~~~~~~~-~~~g~-~~~~TNnraEl~Aii~aL~~~~~-~~~~v~I~TDS~yvi~~i~~W 80 (161)
T PRK06548 4 NEIIAATDGSSLANPGPSGWAWYVDENT-WDSGG-WDIATNNIAELTAVRELLIATRH-TDRPILILSDSKYVINSLTKW 80 (161)
T ss_pred CEEEEEEeeccCCCCCceEEEEEEeCCc-EEccC-CCCCCHHHHHHHHHHHHHHhhhc-CCceEEEEeChHHHHHHHHHH
Confidence 3589999999994 5678888875322 11222 24579999999999999986643 345799999999999999965
Q ss_pred cc----CCCCC------CchHHHHHHHHHHHhcCCcEEEEEecCCCCCcccHHHHHHHHHHhhcCC
Q psy16749 752 CL----NTTPT------HLELEILSSYKHLQNTCKTVKLAWIKGHEGIKGNVEVDRLAKYATVHGE 807 (821)
Q Consensus 752 ~~----~~~p~------~~s~~i~~i~~~L~~~~~~V~~~WVPGHsGI~GNE~AD~LAK~A~~~~~ 807 (821)
.. ++|.. .+.++|++|..++.+ ..|+|.||+||+|+.|||.||+||++|+....
T Consensus 81 ~~~Wk~~gWk~s~G~pV~N~dL~~~l~~l~~~--~~v~~~wVkgHsg~~gNe~aD~LA~~aa~~~~ 144 (161)
T PRK06548 81 VYSWKMRKWRKADGKPVLNQEIIQEIDSLMEN--RNIRMSWVNAHTGHPLNEAADSLARQAANNFS 144 (161)
T ss_pred HHHHHHCCCcccCCCccccHHHHHHHHHHHhc--CceEEEEEecCCCCHHHHHHHHHHHHHHHHhc
Confidence 44 44432 345678888777743 47999999999999999999999999986544
No 4
>PRK08719 ribonuclease H; Reviewed
Probab=99.82 E-value=3.7e-20 Score=181.45 Aligned_cols=126 Identities=20% Similarity=0.188 Sum_probs=93.6
Q ss_pred CCcEEEeccccccCC-----CcceEEecCC--Ccc--ceeeccCCCCCcchhHHHHHHHHHHHHHhcCCCcEEEEcchHH
Q psy16749 673 PNAICIYTDASKKNE-----KVGAAWFCPT--YKS--KACFKLHPATSTYTAEVIGIWEALKYSASLKNNEILILTDSKS 743 (821)
Q Consensus 673 ~~~i~IYTDGS~~~g-----~vGagvv~~~--~~~--~~~~~l~~~~T~~~AEL~AI~~ALe~a~~~~~~~I~I~SDSqs 743 (821)
+..++||||||+..+ ..|+|+++.. +.. ...+.+....+|++|||.||+.||+.+.+. ..|++||++
T Consensus 2 ~~~~~iYtDGs~~~n~~~~~~~G~G~vv~~~~~~~~~~~~~~~~~~~Tnn~aEl~A~~~aL~~~~~~----~~i~tDS~y 77 (147)
T PRK08719 2 RASYSIYIDGAAPNNQHGCVRGGIGLVVYDEAGEIVDEQSITVNRYTDNAELELLALIEALEYARDG----DVIYSDSDY 77 (147)
T ss_pred CceEEEEEecccCCCCCCCCCcEEEEEEEeCCCCeeEEEEecCCCCccHHHHHHHHHHHHHHHcCCC----CEEEechHH
Confidence 356899999999853 2478877543 222 234445556799999999999999998753 489999999
Q ss_pred HHHHHhhcc----cCCCCCC------chHHHHHHHHHHHhcCCcEEEEEecCCCCCcccHHHHHHHHHHhh
Q psy16749 744 ACQKLSKNC----LNTTPTH------LELEILSSYKHLQNTCKTVKLAWIKGHEGIKGNVEVDRLAKYATV 804 (821)
Q Consensus 744 AL~~L~~~~----~~~~p~~------~s~~i~~i~~~L~~~~~~V~~~WVPGHsGI~GNE~AD~LAK~A~~ 804 (821)
|+++|+.|. .++|... +.++++.+..++. ...|+|.|||||+|++||+.||+||++|..
T Consensus 78 vi~~i~~~~~~W~~~~w~~s~g~~v~n~dl~~~i~~l~~--~~~i~~~~VkgH~g~~~Ne~aD~lA~~a~~ 146 (147)
T PRK08719 78 CVRGFNEWLDTWKQKGWRKSDKKPVANRDLWQQVDELRA--RKYVEVEKVTAHSGIEGNEAADMLAQAAAE 146 (147)
T ss_pred HHHHHHHHHHHHHhCCcccCCCcccccHHHHHHHHHHhC--CCcEEEEEecCCCCChhHHHHHHHHHHHhh
Confidence 999986543 3445333 2345555554443 467999999999999999999999999875
No 5
>PRK00203 rnhA ribonuclease H; Reviewed
Probab=99.82 E-value=5.4e-20 Score=180.30 Aligned_cols=130 Identities=22% Similarity=0.158 Sum_probs=98.7
Q ss_pred cEEEeccccccC--CCcceEEecCCCccce-eeccCCCCCcchhHHHHHHHHHHHHHhcCCCcEEEEcchHHHHHHHhhc
Q psy16749 675 AICIYTDASKKN--EKVGAAWFCPTYKSKA-CFKLHPATSTYTAEVIGIWEALKYSASLKNNEILILTDSKSACQKLSKN 751 (821)
Q Consensus 675 ~i~IYTDGS~~~--g~vGagvv~~~~~~~~-~~~l~~~~T~~~AEL~AI~~ALe~a~~~~~~~I~I~SDSqsAL~~L~~~ 751 (821)
.+.||||||+.. +..|+|+|+...+... ........++++|||+||+.||+++.+ ...+.|||||++++++|..|
T Consensus 3 ~v~iytDGs~~~n~~~~g~g~v~~~~~~~~~~~~~~~~~TN~~aEL~Ai~~AL~~~~~--~~~v~I~tDS~yvi~~i~~w 80 (150)
T PRK00203 3 QVEIYTDGACLGNPGPGGWGAILRYKGHEKELSGGEALTTNNRMELMAAIEALEALKE--PCEVTLYTDSQYVRQGITEW 80 (150)
T ss_pred eEEEEEEecccCCCCceEEEEEEEECCeeEEEecCCCCCcHHHHHHHHHHHHHHHcCC--CCeEEEEECHHHHHHHHHHH
Confidence 488999999983 5667777764333221 112234678999999999999998865 36799999999999999875
Q ss_pred cc----CCCCC------CchHHHHHHHHHHHhcCCcEEEEEecCCCCCcccHHHHHHHHHHhhcCCC
Q psy16749 752 CL----NTTPT------HLELEILSSYKHLQNTCKTVKLAWIKGHEGIKGNVEVDRLAKYATVHGEA 808 (821)
Q Consensus 752 ~~----~~~p~------~~s~~i~~i~~~L~~~~~~V~~~WVPGHsGI~GNE~AD~LAK~A~~~~~~ 808 (821)
.. ++|.. .+.++++++..++. ...|.|.|||||+|++||+.||+|||+|+.....
T Consensus 81 ~~~Wk~~~~~~~~g~~v~n~dl~~~i~~l~~--~~~v~~~wV~~H~~~~~N~~AD~lA~~a~~~~~~ 145 (150)
T PRK00203 81 IHGWKKNGWKTADKKPVKNVDLWQRLDAALK--RHQIKWHWVKGHAGHPENERCDELARAGAEEATL 145 (150)
T ss_pred HHHHHHcCCcccCCCccccHHHHHHHHHHhc--cCceEEEEecCCCCCHHHHHHHHHHHHHHHHhhh
Confidence 43 34432 23456777776663 3789999999999999999999999999876543
No 6
>cd06222 RnaseH RNase H (RNase HI) is an endonuclease that cleaves the RNA strand of an RNA/DNA hybrid in a not sequence-specific manner. One of the important functions of RNase H is to remove Okazaki fragments during DNA replication. RNase H knockout mice lack mitochondrial DNA replication and die as embryos. The retroviral reverse transcriptase contains an RNase H domain that plays an important role in converting a single stranded retroviral genomic RNA into a dsDNA for integration into host chromosomes. RNase H inhibitors have been explored as an anti-HIV drug target because RNase H inactivation inhibits reverse transcription.
Probab=99.70 E-value=1.8e-16 Score=143.89 Aligned_cols=125 Identities=28% Similarity=0.314 Sum_probs=97.5
Q ss_pred EEeccccccC--CCcceEEecCCCcc--ce-eeccCCCCCcchhHHHHHHHHHHHHHhcCCCcEEEEcchHHHHHHHhhc
Q psy16749 677 CIYTDASKKN--EKVGAAWFCPTYKS--KA-CFKLHPATSTYTAEVIGIWEALKYSASLKNNEILILTDSKSACQKLSKN 751 (821)
Q Consensus 677 ~IYTDGS~~~--g~vGagvv~~~~~~--~~-~~~l~~~~T~~~AEL~AI~~ALe~a~~~~~~~I~I~SDSqsAL~~L~~~ 751 (821)
.+|+|||... ++.|+|+++.+.+. .. ........+++.+|+.||+.||+++...+...+.|++||+.+++.++++
T Consensus 1 ~~~~Dgs~~~~~~~~g~g~v~~~~~~~~~~~~~~~~~~~s~~~aEl~al~~al~~~~~~~~~~i~i~~Ds~~~~~~~~~~ 80 (130)
T cd06222 1 VIYTDGSCRGNPGPAGAGVVLRDPGGEVLLSGGLLGGNTTNNRAELLALIEALELALELGGKKVNIYTDSQYVINALTGW 80 (130)
T ss_pred CEEecccCCCCCCceEEEEEEEeCCCeEEEeccccCCCCcHHHHHHHHHHHHHHHHHhCCCceEEEEECHHHHHHHhhcc
Confidence 4899999985 48888888754432 11 1122256799999999999999999988889999999999999999984
Q ss_pred ccCCCCCCchHHHHHHHHHHHhcCCcEEEEEecCCCCCcccHHHHHHHHHHh
Q psy16749 752 CLNTTPTHLELEILSSYKHLQNTCKTVKLAWIKGHEGIKGNVEVDRLAKYAT 803 (821)
Q Consensus 752 ~~~~~p~~~s~~i~~i~~~L~~~~~~V~~~WVPGHsGI~GNE~AD~LAK~A~ 803 (821)
... .......++.++.+.+ +.+..+.|.|||+|+|+.+|+.||.|||.++
T Consensus 81 ~~~-~~~~~~~~~~~i~~~~-~~~~~~~i~~v~~h~~~~~n~~ad~la~~~~ 130 (130)
T cd06222 81 YEG-KPVKNVDLWQRLLALL-KRFHKVRFEWVPGHSGIEGNERADALAKEAA 130 (130)
T ss_pred ccC-CChhhHHHHHHHHHHH-hCCCeEEEEEcCCCCCCcchHHHHHHHHhhC
Confidence 321 1123344566666655 5678999999999999999999999999874
No 7
>KOG3752|consensus
Probab=99.69 E-value=1.5e-16 Score=173.03 Aligned_cols=131 Identities=25% Similarity=0.223 Sum_probs=106.4
Q ss_pred cEEEeccccccC-----CCcceEEecCCCc-cceeeccC-CCCCcchhHHHHHHHHHHHHHhcCCCcEEEEcchHHHHHH
Q psy16749 675 AICIYTDASKKN-----EKVGAAWFCPTYK-SKACFKLH-PATSTYTAEVIGIWEALKYSASLKNNEILILTDSKSACQK 747 (821)
Q Consensus 675 ~i~IYTDGS~~~-----g~vGagvv~~~~~-~~~~~~l~-~~~T~~~AEL~AI~~ALe~a~~~~~~~I~I~SDSqsAL~~ 747 (821)
..++|||||+.. ..+|+|||++++. ....+.+. +..++|+|||.||.+||+.+.+..+.+|+|.|||.++++.
T Consensus 212 ~~vvytDGS~~~ng~~~~~AGyGvywg~~~e~N~s~pv~~g~qtNnrAEl~Av~~ALkka~~~~~~kv~I~TDS~~~i~~ 291 (371)
T KOG3752|consen 212 IQVVYTDGSSSGNGRKSSRAGYGVYWGPGHELNVSGPLAGGRQTNNRAELIAAIEALKKARSKNINKVVIRTDSEYFINS 291 (371)
T ss_pred ceEEEecCccccCCCCCCcceeEEeeCCCCcccccccCCCCcccccHHHHHHHHHHHHHHHhcCCCcEEEEechHHHHHH
Confidence 489999999995 2689999997664 46667775 7899999999999999999999888899999999999999
Q ss_pred HhhcccC----CCCC-----------CchHHHHHHHHHHHh-cCCcEEEEEecCCCCCcccHHHHHHHHHHhhc
Q psy16749 748 LSKNCLN----TTPT-----------HLELEILSSYKHLQN-TCKTVKLAWIKGHEGIKGNVEVDRLAKYATVH 805 (821)
Q Consensus 748 L~~~~~~----~~p~-----------~~s~~i~~i~~~L~~-~~~~V~~~WVPGHsGI~GNE~AD~LAK~A~~~ 805 (821)
|+.|..+ +|-. .....+.++.++++. .+.+|.+.||+||.||.||++||.||+++..+
T Consensus 292 l~~wv~~~k~~~~k~~~~~~~i~~~v~n~~~~~e~~~l~q~~~~~~vq~~~V~Gh~gi~gne~Ad~lARkgs~~ 365 (371)
T KOG3752|consen 292 LTLWVQGWKKNGWKTSNGSDRICAYVKNQDFFNELDELEQEISNKKVQQEYVGGHSGILGNEMADALARKGSTL 365 (371)
T ss_pred HHHHHhhhccCccccccCCCccceeeecchHHHHHHHHHhhhccCceEEEEecCcCCcchHHHHHHHHhhhhhh
Confidence 9988763 1211 112235555565554 57899999999999999999999999998543
No 8
>PRK13907 rnhA ribonuclease H; Provisional
Probab=99.61 E-value=4.6e-15 Score=141.02 Aligned_cols=122 Identities=21% Similarity=0.264 Sum_probs=95.9
Q ss_pred EEEeccccccC--CCcceEEecCCCcc--ceeeccCCCCCcchhHHHHHHHHHHHHHhcCCCcEEEEcchHHHHHHHhhc
Q psy16749 676 ICIYTDASKKN--EKVGAAWFCPTYKS--KACFKLHPATSTYTAEVIGIWEALKYSASLKNNEILILTDSKSACQKLSKN 751 (821)
Q Consensus 676 i~IYTDGS~~~--g~vGagvv~~~~~~--~~~~~l~~~~T~~~AEL~AI~~ALe~a~~~~~~~I~I~SDSqsAL~~L~~~ 751 (821)
+.||+|||+.. |..|+|+|+.+.+. ...+. ....+++.||+.|++.||+++.+.+...+.|++||+++++.|+++
T Consensus 2 ~~iy~DGa~~~~~g~~G~G~vi~~~~~~~~~~~~-~~~~tn~~AE~~All~aL~~a~~~g~~~v~i~sDS~~vi~~~~~~ 80 (128)
T PRK13907 2 IEVYIDGASKGNPGPSGAGVFIKGVQPAVQLSLP-LGTMSNHEAEYHALLAALKYCTEHNYNIVSFRTDSQLVERAVEKE 80 (128)
T ss_pred EEEEEeeCCCCCCCccEEEEEEEECCeeEEEEec-ccccCCcHHHHHHHHHHHHHHHhCCCCEEEEEechHHHHHHHhHH
Confidence 68999999994 67888988854322 22222 245799999999999999999998889999999999999999882
Q ss_pred ccCCCCCCchHHHHHHHHHHHhcCCcEEEEEecCCCCCcccHHHHHHHHHHhhc
Q psy16749 752 CLNTTPTHLELEILSSYKHLQNTCKTVKLAWIKGHEGIKGNVEVDRLAKYATVH 805 (821)
Q Consensus 752 ~~~~~p~~~s~~i~~i~~~L~~~~~~V~~~WVPGHsGI~GNE~AD~LAK~A~~~ 805 (821)
... ......++.++..++ ..+..+.|.|||+ .+|+.||.||+.|+..
T Consensus 81 ~~~--~~~~~~l~~~~~~l~-~~f~~~~~~~v~r----~~N~~Ad~LA~~a~~~ 127 (128)
T PRK13907 81 YAK--NKMFAPLLEEALQYI-KSFDLFFIKWIPS----SQNKVADELARKAILQ 127 (128)
T ss_pred Hhc--ChhHHHHHHHHHHHH-hcCCceEEEEcCc----hhchhHHHHHHHHHhc
Confidence 221 123456777777766 4677888999996 5899999999999864
No 9
>PRK07708 hypothetical protein; Validated
Probab=99.51 E-value=1.4e-13 Score=143.50 Aligned_cols=130 Identities=18% Similarity=0.090 Sum_probs=96.1
Q ss_pred hhCCCcEEEecccccc--CCCcceEEecCC--Cccce----eeccCCCCCcchhHHHHHHHHHHHHHhcCCCc--EEEEc
Q psy16749 670 EKCPNAICIYTDASKK--NEKVGAAWFCPT--YKSKA----CFKLHPATSTYTAEVIGIWEALKYSASLKNNE--ILILT 739 (821)
Q Consensus 670 ~~~~~~i~IYTDGS~~--~g~vGagvv~~~--~~~~~----~~~l~~~~T~~~AEL~AI~~ALe~a~~~~~~~--I~I~S 739 (821)
...++.+++|+|||+. .++.|+|+++.. ++... ...+....+|+.||+.|++.||+++.+.+.+. |.|++
T Consensus 68 ~~ep~~~~vY~DGs~~~n~g~aG~GvVI~~~~g~~~~~~~~~~~l~~~~TNN~AEy~Ali~aL~~A~e~g~~~~~V~I~~ 147 (219)
T PRK07708 68 EEEPHEILVYFDGGFDKETKLAGLGIVIYYKQGNKRYRIRRNAYIEGIYDNNEAEYAALYYAMQELEELGVKHEPVTFRG 147 (219)
T ss_pred ccCCCcEEEEEeeccCCCCCCcEEEEEEEECCCCEEEEEEeeccccccccCcHHHHHHHHHHHHHHHHcCCCcceEEEEe
Confidence 4456789999999997 467788887642 32221 22455568999999999999999999987655 89999
Q ss_pred chHHHHHHHhhcccCCCCCCc---hHHHHHHHHHHHhcCCcEEEEEecCCCCCcccHHHHHHHHHHhhcCC
Q psy16749 740 DSKSACQKLSKNCLNTTPTHL---ELEILSSYKHLQNTCKTVKLAWIKGHEGIKGNVEVDRLAKYATVHGE 807 (821)
Q Consensus 740 DSqsAL~~L~~~~~~~~p~~~---s~~i~~i~~~L~~~~~~V~~~WVPGHsGI~GNE~AD~LAK~A~~~~~ 807 (821)
||+.+++.|++ .|.... ..+..++..++.+....+.|.||| +..|+.||+||+.|++...
T Consensus 148 DSqlVi~qi~g----~wk~~~~~l~~y~~~i~~l~~~~~l~~~~~~Vp----R~~N~~AD~LAk~Al~~~~ 210 (219)
T PRK07708 148 DSQVVLNQLAG----EWPCYDEHLNHWLDRIEQKLKQLKLTPVYEPIS----RKQNKEADQLATQALEGTV 210 (219)
T ss_pred ccHHHHHHhCC----CceeCChhHHHHHHHHHHHHhhCCceEEEEECC----chhhhHHHHHHHHHHhcCC
Confidence 99999999987 222222 234455555565555567888987 5779999999999998543
No 10
>PRK07238 bifunctional RNase H/acid phosphatase; Provisional
Probab=99.35 E-value=4.9e-12 Score=141.05 Aligned_cols=123 Identities=18% Similarity=0.172 Sum_probs=94.8
Q ss_pred cEEEeccccccC--CCcceEEecCCC--c-c--ceeeccCCCCCcchhHHHHHHHHHHHHHhcCCCcEEEEcchHHHHHH
Q psy16749 675 AICIYTDASKKN--EKVGAAWFCPTY--K-S--KACFKLHPATSTYTAEVIGIWEALKYSASLKNNEILILTDSKSACQK 747 (821)
Q Consensus 675 ~i~IYTDGS~~~--g~vGagvv~~~~--~-~--~~~~~l~~~~T~~~AEL~AI~~ALe~a~~~~~~~I~I~SDSqsAL~~ 747 (821)
.+.||+|||+.. |..|+|+++.+. + . .....+ ...+|+.||+.|++.||+++.+.+...|.|++||+++++.
T Consensus 2 ~~~i~~DGa~~~n~g~aG~G~vi~~~~~~~~~~~~~~~~-~~~tnn~AE~~All~gL~~a~~~g~~~v~i~~DS~lvi~~ 80 (372)
T PRK07238 2 KVVVEADGGSRGNPGPAGYGAVVWDADRGEVLAERAEAI-GRATNNVAEYRGLIAGLEAAAELGATEVEVRMDSKLVVEQ 80 (372)
T ss_pred eEEEEecCCCCCCCCceEEEEEEEeCCCCcEEEEeeccc-CCCCchHHHHHHHHHHHHHHHhCCCCeEEEEeCcHHHHHH
Confidence 368999999995 567888887543 2 1 122333 3678999999999999999999988999999999999999
Q ss_pred HhhcccCCCCCCc---hHHHHHHHHHHHhcCCcEEEEEecCCCCCcccHHHHHHHHHHhhcCC
Q psy16749 748 LSKNCLNTTPTHL---ELEILSSYKHLQNTCKTVKLAWIKGHEGIKGNVEVDRLAKYATVHGE 807 (821)
Q Consensus 748 L~~~~~~~~p~~~---s~~i~~i~~~L~~~~~~V~~~WVPGHsGI~GNE~AD~LAK~A~~~~~ 807 (821)
|++ +|...+ ..++.++..++ +++..|.|.|||+ .+|+.||.||+.|+....
T Consensus 81 i~~----~~~~~~~~l~~~~~~i~~l~-~~f~~~~i~~v~r----~~N~~AD~LA~~a~~~~~ 134 (372)
T PRK07238 81 MSG----RWKVKHPDMKPLAAQARELA-SQFGRVTYTWIPR----ARNAHADRLANEAMDAAA 134 (372)
T ss_pred hCC----CCccCChHHHHHHHHHHHHH-hcCCceEEEECCc----hhhhHHHHHHHHHHHhhc
Confidence 976 222222 23455555554 5678899999995 789999999999987654
No 11
>PF13456 RVT_3: Reverse transcriptase-like; PDB: 3ALY_A 2EHG_A 3HST_B.
Probab=99.01 E-value=2.2e-09 Score=93.81 Aligned_cols=85 Identities=24% Similarity=0.256 Sum_probs=67.6
Q ss_pred chhHHHHHHHHHHHHHhcCCCcEEEEcchHHHHHHHhhcccCCCCCCchHHHHHHHHHHHhcCCcEEEEEecCCCCCccc
Q psy16749 713 YTAEVIGIWEALKYSASLKNNEILILTDSKSACQKLSKNCLNTTPTHLELEILSSYKHLQNTCKTVKLAWIKGHEGIKGN 792 (821)
Q Consensus 713 ~~AEL~AI~~ALe~a~~~~~~~I~I~SDSqsAL~~L~~~~~~~~p~~~s~~i~~i~~~L~~~~~~V~~~WVPGHsGI~GN 792 (821)
..||+.|++.||+++.+.+.++|.|++||+.+++.|++... .......++.++...+. ....+.|.||+ +++|
T Consensus 2 ~~aE~~al~~al~~a~~~g~~~i~v~sDs~~vv~~i~~~~~--~~~~~~~~~~~i~~~~~-~~~~~~~~~i~----r~~N 74 (87)
T PF13456_consen 2 LEAEALALLEALQLAWELGIRKIIVESDSQLVVDAINGRSS--SRSELRPLIQDIRSLLD-RFWNVSVSHIP----REQN 74 (87)
T ss_dssp HHHHHHHHHHHHHHHHCCT-SCEEEEES-HHHHHHHTTSS-----SCCHHHHHHHHHHHC-CCSCEEEEE------GGGS
T ss_pred cHHHHHHHHHHHHHHHHCCCCEEEEEecCcccccccccccc--ccccccccchhhhhhhc-cccceEEEEEC----hHHh
Confidence 57999999999999999999999999999999999988211 12344567778777774 58899999998 7999
Q ss_pred HHHHHHHHHHhh
Q psy16749 793 VEVDRLAKYATV 804 (821)
Q Consensus 793 E~AD~LAK~A~~ 804 (821)
..||.|||.|..
T Consensus 75 ~~A~~LA~~a~~ 86 (87)
T PF13456_consen 75 KVADALAKFALS 86 (87)
T ss_dssp HHHHHHHHHHHH
T ss_pred HHHHHHHHHHhh
Confidence 999999999874
No 12
>PF09004 DUF1891: Domain of unknown function (DUF1891); InterPro: IPR015095 This domain is found at the extreme N terminus of eukaryotic alkylated DNA repair protein homologs. ; GO: 0008168 methyltransferase activity, 0016706 oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen, 2-oxoglutarate as one donor, and incorporation of one atom each of oxygen into both donors, 0055114 oxidation-reduction process; PDB: 2CQ2_A 3THT_C 3THP_A.
Probab=89.53 E-value=0.051 Score=42.99 Aligned_cols=41 Identities=7% Similarity=0.184 Sum_probs=7.7
Q ss_pred chHHHHHHHHHHHHHHHHHHhhccCCCCCChHHHHHHHHHHhH
Q psy16749 467 IHYPMIENLPDCNKYLNIMKMICNKHWGMNPTIGLNYYKATIR 509 (821)
Q Consensus 467 f~~HI~~i~~KA~k~ln~Lr~L~~~~wGls~~~lr~LYkAvVr 509 (821)
|+.|+..+.+|+..++..||+|.. .+++...+..+|+++|.
T Consensus 1 W~~n~~~~~KKa~qRlyFLRkl~k--~~~~~~~l~lfY~s~Ie 41 (42)
T PF09004_consen 1 WTSNTTSLYKKAQQRLYFLRKLRK--FNVDSKLLTLFYHSVIE 41 (42)
T ss_dssp ---------------------------SS-----SHTTTTT--
T ss_pred CCcchHHHHHHHHhHHHHHHHHHH--ccchhHHHHHHHHHHhc
Confidence 889999999999999999999997 46788888999988875
No 13
>PHA03378 EBNA-3B; Provisional
Probab=88.15 E-value=6.1 Score=47.62 Aligned_cols=24 Identities=21% Similarity=0.371 Sum_probs=11.9
Q ss_pred cCcchhhhccCCcccccCCCCcch
Q psy16749 156 ETNSETAEKETPLSEVPVIPQEAQ 179 (821)
Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~ 179 (821)
|+.-|-.+|.--+--+|-+|.++-
T Consensus 510 e~d~e~~eq~vmat~l~~~p~~p~ 533 (991)
T PHA03378 510 EKDDEDMEQRVMATLLPPSPPQPR 533 (991)
T ss_pred hhchHHHHHHHHHhhCCCCCCCCc
Confidence 344445555544444555555553
No 14
>PHA03378 EBNA-3B; Provisional
Probab=87.90 E-value=6.4 Score=47.50 Aligned_cols=15 Identities=13% Similarity=-0.132 Sum_probs=6.3
Q ss_pred CCCcCCcchhhhhcc
Q psy16749 36 KPQEHDDLTFETKES 50 (821)
Q Consensus 36 ~~~~~~~~~~~~~~~ 50 (821)
.|.++-++-|++--+
T Consensus 476 ~p~~pl~~~~~~~vi 490 (991)
T PHA03378 476 EPWQPLPHPQVTPVI 490 (991)
T ss_pred CCCCCCCCcceeeeE
Confidence 344444444444433
No 15
>PRK14950 DNA polymerase III subunits gamma and tau; Provisional
Probab=81.37 E-value=7.5 Score=46.86 Aligned_cols=8 Identities=0% Similarity=0.127 Sum_probs=3.5
Q ss_pred hhhhhhhH
Q psy16749 525 SKLKTLDK 532 (821)
Q Consensus 525 ~~ikkLe~ 532 (821)
.+.++|+.
T Consensus 511 ~~~~~~~~ 518 (585)
T PRK14950 511 FHKDKIEE 518 (585)
T ss_pred HHHHHhhc
Confidence 34444444
No 16
>PF05380 Peptidase_A17: Pao retrotransposon peptidase ; InterPro: IPR008042 This signature identifies members of the Pao retrotransposon family.
Probab=79.92 E-value=3.7 Score=41.04 Aligned_cols=54 Identities=11% Similarity=0.183 Sum_probs=36.1
Q ss_pred CcEEEeccccccCCCcceEEecCC---Ccccee-----ecc--CCCCCcchhHHHHHHHHHHHHHh
Q psy16749 674 NAICIYTDASKKNEKVGAAWFCPT---YKSKAC-----FKL--HPATSTYTAEVIGIWEALKYSAS 729 (821)
Q Consensus 674 ~~i~IYTDGS~~~g~vGagvv~~~---~~~~~~-----~~l--~~~~T~~~AEL~AI~~ALe~a~~ 729 (821)
..+++|+|+|. .++|+.+|+.. ++.... .++ -...++-+-||+|+..|.+++..
T Consensus 80 ~~L~~F~DAS~--~aygavvYlr~~~~~~~~~~ll~aKsrv~P~k~~tIPRlEL~a~~l~~~l~~~ 143 (159)
T PF05380_consen 80 VELHVFCDASE--SAYGAVVYLRSYSDGSVQVRLLFAKSRVAPLKTVTIPRLELLAALLGVRLANT 143 (159)
T ss_pred eeeeEeecccc--cceeeEeEeeeccCCceeeeeeeecccccCCCCCcHHHHHHHHHHHHHHHHHH
Confidence 36899999994 46676666533 221111 122 13458999999999999998864
No 17
>PRK14900 valS valyl-tRNA synthetase; Provisional
Probab=79.58 E-value=0.61 Score=59.75 Aligned_cols=15 Identities=20% Similarity=0.209 Sum_probs=7.3
Q ss_pred CCccccccccccccc
Q psy16749 135 PAVDLTQDIVEEKEA 149 (821)
Q Consensus 135 ~~~~~~~~~~~~~~~ 149 (821)
..|.+..+-.|+|+.
T Consensus 907 ~~~~~~~~~~~~~~~ 921 (1052)
T PRK14900 907 EANSARRDTMEIQNE 921 (1052)
T ss_pred ccccccccccccccc
Confidence 445555555544443
No 18
>PRK14900 valS valyl-tRNA synthetase; Provisional
Probab=69.57 E-value=7.5 Score=50.14 Aligned_cols=10 Identities=0% Similarity=0.209 Sum_probs=4.2
Q ss_pred chhcccccCC
Q psy16749 89 DFISAHTEKT 98 (821)
Q Consensus 89 ~~~~~~~~~~ 98 (821)
+|+....+|.
T Consensus 877 ~~veka~~kl 886 (1052)
T PRK14900 877 AVVEKDRARA 886 (1052)
T ss_pred HHHHHHHHHH
Confidence 4444444443
No 19
>COG3341 Predicted double-stranded RNA/RNA-DNA hybrid binding protein [General function prediction only]
Probab=68.68 E-value=11 Score=39.94 Aligned_cols=139 Identities=10% Similarity=-0.034 Sum_probs=85.8
Q ss_pred cEEEecccccc--CCCcceEEecCCCcccee-----eccCCCCCcchhHHHHHHHHHHHHHhcCCCcEEE-E----cchH
Q psy16749 675 AICIYTDASKK--NEKVGAAWFCPTYKSKAC-----FKLHPATSTYTAEVIGIWEALKYSASLKNNEILI-L----TDSK 742 (821)
Q Consensus 675 ~i~IYTDGS~~--~g~vGagvv~~~~~~~~~-----~~l~~~~T~~~AEL~AI~~ALe~a~~~~~~~I~I-~----SDSq 742 (821)
.+.-+.+|+.. ++..++-++.....+... ..+ ...++..+|.+|++.+|+.+...+.++.+| | .||+
T Consensus 65 ~~i~~~~G~y~~~p~t~~~k~yr~k~~~~~~~lt~~~~~-~~~~n~s~d~la~ly~~~~~~~~~nrk~~i~y~~~~~ds~ 143 (225)
T COG3341 65 YIISWAKGDYDAKPGTQEFKEYRGKCTIEYSWLTESSEF-SIKSNDSGDVLAKLYGLRYEVPLDNRKSVINYLTPGNDSW 143 (225)
T ss_pred ccceeccCCccccCCCcceeEEeccccccceeeeeeccc-ccccCchHHHHHHhccccccccccCccceeeccCCcchhH
Confidence 45566777666 344455544432211111 112 334678899999999999998877766666 6 6999
Q ss_pred HHHHHHhhcccCCCCCCchHHHHHHHHHHHhcCCcEEEEEecCCCCCcccHHHHHHHHHHhhcCCCCccccCCCCCC
Q psy16749 743 SACQKLSKNCLNTTPTHLELEILSSYKHLQNTCKTVKLAWIKGHEGIKGNVEVDRLAKYATVHGEASTIVISPTNTE 819 (821)
Q Consensus 743 sAL~~L~~~~~~~~p~~~s~~i~~i~~~L~~~~~~V~~~WVPGHsGI~GNE~AD~LAK~A~~~~~~~~i~~~~~~di 819 (821)
+.+.-+.+-.. ....+....-.-+.-.+....+.|+--|.++.-+...+.+..........+...++-|+++
T Consensus 144 a~~k~~k~~~~-----~k~~~~~~~e~~~~~~~v~h~~k~i~~~~~~~~~~~~~s~~~~~~k~~~~~~~~~~~~~~~ 215 (225)
T COG3341 144 AYFKYVKDKCE-----TKVNLLKDREFFSIGKGVFHDEKDINIHIWIFESKKGNSHVYNTSKKELSPKYHDIKFEDY 215 (225)
T ss_pred HHHHHHhhhhh-----hHHHHHHhHHhhhccchhhhhhhhcccccccchhhhhhhhhhchhhhhccccccccccccc
Confidence 88887766110 0011111100011112345567899999999999999999888887777776667666664
No 20
>PRK14950 DNA polymerase III subunits gamma and tau; Provisional
Probab=66.26 E-value=28 Score=42.05 Aligned_cols=8 Identities=0% Similarity=-0.126 Sum_probs=3.1
Q ss_pred HHHHHHHH
Q psy16749 477 DCNKYLNI 484 (821)
Q Consensus 477 KA~k~ln~ 484 (821)
+.+.++..
T Consensus 480 ~~~~~~~~ 487 (585)
T PRK14950 480 PRSPAVQA 487 (585)
T ss_pred hcCHHHHH
Confidence 33444433
No 21
>PHA03269 envelope glycoprotein C; Provisional
Probab=58.73 E-value=31 Score=40.61 Aligned_cols=17 Identities=24% Similarity=0.403 Sum_probs=10.2
Q ss_pred HHHHhHHHHhhcccccccc
Q psy16749 504 YKATIRATLDFGSVFYSES 522 (821)
Q Consensus 504 YkAvVrPiL~YGa~VW~~~ 522 (821)
|+.+||| +.-..+|...
T Consensus 216 ~~~~~~~--~~~~~~~~~~ 232 (566)
T PHA03269 216 WKPLIRP--DLHIEIWFKK 232 (566)
T ss_pred cccccCC--ceeEEEEeec
Confidence 5667777 3445677643
No 22
>PF07174 FAP: Fibronectin-attachment protein (FAP); InterPro: IPR010801 This family contains bacterial fibronectin-attachment proteins (FAP). Family members are rich in alanine and proline, are approximately 300 long, and seem to be restricted to mycobacteria. These proteins contain a fibronectin-binding motif that allows mycobacteria to bind to fibronectin in the extracellular matrix [].; GO: 0050840 extracellular matrix binding, 0005576 extracellular region
Probab=56.06 E-value=79 Score=34.80 Aligned_cols=51 Identities=16% Similarity=0.071 Sum_probs=31.1
Q ss_pred cEEEeccccccCCCcceEEecC-------CC--ccceeeccCC-CCCcchhHHHHHHHHHH
Q psy16749 675 AICIYTDASKKNEKVGAAWFCP-------TY--KSKACFKLHP-ATSTYTAEVIGIWEALK 725 (821)
Q Consensus 675 ~i~IYTDGS~~~g~vGagvv~~-------~~--~~~~~~~l~~-~~T~~~AEL~AI~~ALe 725 (821)
+-+-|+|.++-+|.+..|||-. .+ +..+...|+. ..-+-.++..++.+.|+
T Consensus 223 yevkf~d~~kp~gqiw~~vvg~p~~~~~~~~~~~rwfvvwlgt~~~pvd~~~a~~la~si~ 283 (297)
T PF07174_consen 223 YEVKFTDANKPNGQIWAGVVGSPVAPGTPRGTPQRWFVVWLGTANNPVDKGAAKALAESIR 283 (297)
T ss_pred EEEEeccCCCCCCceEEEeecCcCCCCCCCCCCceEEEEEecCCCCCCCHHHHHHHHhhcc
Confidence 3468999999999999888853 01 1233333432 33455666666655543
No 23
>KOG1924|consensus
Probab=44.55 E-value=1.2e+02 Score=37.99 Aligned_cols=28 Identities=29% Similarity=0.506 Sum_probs=14.5
Q ss_pred hhhhhhHHHHHHHHHHhCCCCCChHHHHH
Q psy16749 526 KLKTLDKVQNQALRLAMGYLNSTPIDNIL 554 (821)
Q Consensus 526 ~ikkLe~vQr~AlR~ItGAfRTTp~~aLl 554 (821)
.++.|+.--.+-|-+.+|.|| .|++.|.
T Consensus 692 el~ilDsKtaQnLsIflgS~r-mpyeeik 719 (1102)
T KOG1924|consen 692 ELRILDSKTAQNLSIFLGSFR-MPYEEIK 719 (1102)
T ss_pred hheecchHHHHHHHHHHhhcc-CCHHHHH
Confidence 344455433334445666666 5555543
No 24
>PF00336 DNA_pol_viral_C: DNA polymerase (viral) C-terminal domain; InterPro: IPR001462 This domain is at the C terminus of hepatitis B-type viruses P proteins and represents a functional domain that controls the RNase H activities of the protein. The domain is always associated with IPR000201 from INTERPRO and .; GO: 0004523 ribonuclease H activity
Probab=42.23 E-value=24 Score=37.19 Aligned_cols=63 Identities=19% Similarity=0.181 Sum_probs=39.3
Q ss_pred CCcEEEeccccccCCCcceEEecCCCccceeeccCCCCCcchhHHHHHHHHHHHHHhcCCCcEEEEcchHHHHH
Q psy16749 673 PNAICIYTDASKKNEKVGAAWFCPTYKSKACFKLHPATSTYTAEVIGIWEALKYSASLKNNEILILTDSKSACQ 746 (821)
Q Consensus 673 ~~~i~IYTDGS~~~g~vGagvv~~~~~~~~~~~l~~~~T~~~AEL~AI~~ALe~a~~~~~~~I~I~SDSqsAL~ 746 (821)
..--.||+|+.-. |.|+.+..+.....|.. -..++.+||++...|.-+.. .++ |.+||..|+.
T Consensus 92 ~~lc~VfaDATpT----gwgi~i~~~~~~~Tfs~--~l~IhtaELlaaClAr~~~~----~r~-l~tDnt~Vls 154 (245)
T PF00336_consen 92 PGLCQVFADATPT----GWGISITGQRMRGTFSK--PLPIHTAELLAACLARLMSG----ARC-LGTDNTVVLS 154 (245)
T ss_pred CCCCceeccCCCC----cceeeecCceeeeeecc--cccchHHHHHHHHHHHhccC----CcE-EeecCcEEEe
Confidence 3345799998876 33333332333333332 34688999999988766542 344 8999986643
No 25
>COG3889 Predicted solute binding protein [General function prediction only]
Probab=40.92 E-value=17 Score=44.74 Aligned_cols=21 Identities=48% Similarity=0.423 Sum_probs=8.4
Q ss_pred CcCCCCCccccCCCCCCcccc
Q psy16749 417 ATATRPATTTSKPATTTSTDI 437 (821)
Q Consensus 417 ~~~~~~~~~~~~~~~~~~~~~ 437 (821)
.+.+.+.|+|+-+.+||.++|
T Consensus 830 Qts~~TtTtTsps~ttt~~~i 850 (872)
T COG3889 830 QTSTSTTTTTSPSQTTTGGGI 850 (872)
T ss_pred cccccceeeeccccccccccc
Confidence 333333444444444444433
No 26
>PRK07764 DNA polymerase III subunits gamma and tau; Validated
Probab=40.52 E-value=1.2e+02 Score=38.55 Aligned_cols=8 Identities=38% Similarity=0.517 Sum_probs=3.2
Q ss_pred HHHHHHhC
Q psy16749 536 QALRLAMG 543 (821)
Q Consensus 536 ~AlR~ItG 543 (821)
.+++-.+|
T Consensus 572 ~al~e~lG 579 (824)
T PRK07764 572 TALAEELG 579 (824)
T ss_pred HHHHHHhC
Confidence 33444443
No 27
>COG3147 DedD Uncharacterized protein conserved in bacteria [Function unknown]
Probab=39.92 E-value=1.2e+02 Score=32.38 Aligned_cols=6 Identities=50% Similarity=0.927 Sum_probs=2.1
Q ss_pred cCCCCc
Q psy16749 172 PVIPQE 177 (821)
Q Consensus 172 ~~~~~~ 177 (821)
|-.+.|
T Consensus 67 P~~~pe 72 (226)
T COG3147 67 PTQPPE 72 (226)
T ss_pred CCCCCc
Confidence 333333
No 28
>PRK12270 kgd alpha-ketoglutarate decarboxylase; Reviewed
Probab=33.47 E-value=1.3e+02 Score=38.72 Aligned_cols=12 Identities=17% Similarity=0.354 Sum_probs=6.5
Q ss_pred HHhhcccccccc
Q psy16749 511 TLDFGSVFYSES 522 (821)
Q Consensus 511 iL~YGa~VW~~~ 522 (821)
++.||..+|-..
T Consensus 421 ~~~hGLTlWDLd 432 (1228)
T PRK12270 421 VLTHGLTLWDLD 432 (1228)
T ss_pred ccccCCCHHhcC
Confidence 344666666543
No 29
>KOG1812|consensus
Probab=31.01 E-value=91 Score=35.95 Aligned_cols=37 Identities=22% Similarity=0.285 Sum_probs=32.6
Q ss_pred CCcchhHHHHHHHHHHHHHhcCCCcEEEEcchHHHHH
Q psy16749 710 TSTYTAEVIGIWEALKYSASLKNNEILILTDSKSACQ 746 (821)
Q Consensus 710 ~T~~~AEL~AI~~ALe~a~~~~~~~I~I~SDSqsAL~ 746 (821)
.+...+|++|+..+|..+.+.++..+.+++|+.....
T Consensus 47 ~~~~~ae~~al~~~l~ea~~~~~~~~~~~~d~~~~~~ 83 (384)
T KOG1812|consen 47 ITPLEAELMALKRGLTEALELGLNHIVIYCDDELIYE 83 (384)
T ss_pred cchhhHHHHHHhhccHHHHhhccccceEecccHHHHH
Confidence 6788999999999999999999999999999654444
No 30
>KOG1924|consensus
Probab=30.67 E-value=3e+02 Score=34.72 Aligned_cols=9 Identities=33% Similarity=0.575 Sum_probs=5.1
Q ss_pred hHHHHhhhh
Q psy16749 105 KEEVLDDLV 113 (821)
Q Consensus 105 ~~~~~~~~~ 113 (821)
.+|-+-|+|
T Consensus 428 IEecISqIv 436 (1102)
T KOG1924|consen 428 IEECISQIV 436 (1102)
T ss_pred HHHHHHHHH
Confidence 555555555
No 31
>PRK12270 kgd alpha-ketoglutarate decarboxylase; Reviewed
Probab=30.51 E-value=1.7e+02 Score=37.95 Aligned_cols=9 Identities=33% Similarity=0.471 Sum_probs=4.5
Q ss_pred cEEEEcchH
Q psy16749 734 EILILTDSK 742 (821)
Q Consensus 734 ~I~I~SDSq 742 (821)
.|.|.-|..
T Consensus 650 pi~~hGdaa 658 (1228)
T PRK12270 650 PILLHGDAA 658 (1228)
T ss_pred EEEEecccc
Confidence 445555543
No 32
>KOG0559|consensus
Probab=29.22 E-value=1.8e+02 Score=33.41 Aligned_cols=13 Identities=38% Similarity=0.560 Sum_probs=5.4
Q ss_pred CCCCCCCCCCCCC
Q psy16749 249 AKPAISPVKKTAT 261 (821)
Q Consensus 249 ~~p~~~~~~~~~~ 261 (821)
|+|+.++.++.++
T Consensus 175 p~~~s~~~p~~~a 187 (457)
T KOG0559|consen 175 PKPSSKPPPKEAA 187 (457)
T ss_pred CCccCCCCccccC
Confidence 4444444444333
No 33
>PTZ00436 60S ribosomal protein L19-like protein; Provisional
Probab=28.95 E-value=2.3e+02 Score=31.83 Aligned_cols=6 Identities=17% Similarity=0.285 Sum_probs=2.5
Q ss_pred HHhhhh
Q psy16749 108 VLDDLV 113 (821)
Q Consensus 108 ~~~~~~ 113 (821)
+|.+++
T Consensus 154 ~L~dQ~ 159 (357)
T PTZ00436 154 QLAEQL 159 (357)
T ss_pred HHHHHH
Confidence 444443
No 34
>PF15324 TALPID3: Hedgehog signalling target
Probab=23.28 E-value=1.3e+03 Score=30.46 Aligned_cols=10 Identities=20% Similarity=0.610 Sum_probs=4.8
Q ss_pred cCCCCChHHH
Q psy16749 441 MNQPFTPEEL 450 (821)
Q Consensus 441 ~~~pi~~e~~ 450 (821)
.+.+|...|+
T Consensus 1133 ~dRpiSEGEi 1142 (1252)
T PF15324_consen 1133 TDRPISEGEI 1142 (1252)
T ss_pred cCCcCCccee
Confidence 4556653333
No 35
>COG0296 GlgB 1,4-alpha-glucan branching enzyme [Carbohydrate transport and metabolism]
Probab=21.37 E-value=92 Score=38.23 Aligned_cols=77 Identities=17% Similarity=0.129 Sum_probs=49.9
Q ss_pred HHHHHHHHHHHHhcCCCcEEEEcchHHHHHHHhhcccCC----CCCCc---hHHHHHHHHHHHhcCCcEEEEEecCCCCC
Q psy16749 717 VIGIWEALKYSASLKNNEILILTDSKSACQKLSKNCLNT----TPTHL---ELEILSSYKHLQNTCKTVKLAWIKGHEGI 789 (821)
Q Consensus 717 L~AI~~ALe~a~~~~~~~I~I~SDSqsAL~~L~~~~~~~----~p~~~---s~~i~~i~~~L~~~~~~V~~~WVPGHsGI 789 (821)
..++..-|.++.+++.+.|.+.-=++...+ .+|.+.+ .|... -+-++.+.+.+-+.|+.|-|-|||+|.|.
T Consensus 164 ~e~a~~llpYl~elG~T~IELMPv~e~p~~--~sWGYq~~g~yAp~sryGtPedfk~fVD~aH~~GIgViLD~V~~HF~~ 241 (628)
T COG0296 164 FELAIELLPYLKELGITHIELMPVAEHPGD--RSWGYQGTGYYAPTSRYGTPEDFKALVDAAHQAGIGVILDWVPNHFPP 241 (628)
T ss_pred HHHHHHHhHHHHHhCCCEEEEcccccCCCC--CCCCCCcceeccccccCCCHHHHHHHHHHHHHcCCEEEEEecCCcCCC
Confidence 344566678888888888877543321111 2344421 01111 12377788888889999999999999999
Q ss_pred cccHHH
Q psy16749 790 KGNVEV 795 (821)
Q Consensus 790 ~GNE~A 795 (821)
.||-++
T Consensus 242 d~~~L~ 247 (628)
T COG0296 242 DGNYLA 247 (628)
T ss_pred Ccchhh
Confidence 998744
No 36
>PF07462 MSP1_C: Merozoite surface protein 1 (MSP1) C-terminus; InterPro: IPR010901 This entry represents the C-terminal region of merozoite surface protein 1 (MSP1), which is found in a number of Plasmodium species. MSP-1 is a 200 kDa protein expressed on the surface of the Plasmodium vivax merozoite. MSP-1 of Plasmodium species is synthesised as a high-molecular-weight precursor and then processed into several fragments. At the time of red cell invasion by the merozoite, only the 19 kDa C-terminal fragment (MSP-119), which contains two epidermal growth factor-like domains, remains on the surface. Antibodies against MSP-119 inhibit merozoite entry into red cells, and immunisation with MSP-119 protects monkeys from challenging infections. Hence, MSP-119 is considered a promising vaccine candidate [].; GO: 0009405 pathogenesis, 0016020 membrane
Probab=21.27 E-value=1.9e+02 Score=34.85 Aligned_cols=8 Identities=25% Similarity=0.712 Sum_probs=3.7
Q ss_pred HHHHhhcc
Q psy16749 572 RFVLKVIS 579 (821)
Q Consensus 572 ~~~~Rl~t 579 (821)
.|++.++.
T Consensus 415 nyFL~VLn 422 (574)
T PF07462_consen 415 NYFLDVLN 422 (574)
T ss_pred HHHHHHHh
Confidence 44554443
Done!