Query 537021.9.peg.817_1 Match_columns 148 No_of_seqs 127 out of 965 Neff 6.6 Searched_HMMs 39220 Date Wed May 25 02:05:58 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i peg_817.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 PRK00024 radC DNA repair prote 100.0 3.6E-36 9.1E-41 254.5 14.5 131 15-145 2-132 (224) 2 COG2003 RadC DNA repair protei 100.0 1.2E-36 3.2E-41 257.4 11.1 130 16-145 3-132 (224) 3 TIGR00608 radc DNA repair prot 100.0 2.5E-36 6.3E-41 255.5 11.3 122 24-145 1-129 (223) 4 PRK13482 DNA integrity scannin 98.0 8.7E-05 2.2E-09 51.7 9.9 87 24-113 245-350 (352) 5 PRK13766 Hef nuclease; Provisi 97.9 1.4E-05 3.6E-10 56.8 5.0 70 28-99 689-758 (764) 6 KOG2841 consensus 97.7 4.8E-05 1.2E-09 53.4 4.8 45 55-99 203-247 (254) 7 COG1948 MUS81 ERCC4-type nucle 97.4 0.00035 8.9E-09 47.8 5.9 68 30-99 166-234 (254) 8 PRK00558 uvrC excinuclease ABC 96.9 0.0017 4.3E-08 43.4 4.8 40 59-98 566-605 (609) 9 COG1623 Predicted nucleic-acid 96.7 0.005 1.3E-07 40.4 5.8 42 57-98 303-344 (349) 10 cd00056 ENDO3c endonuclease II 96.2 0.0029 7.4E-08 41.9 2.3 113 23-137 23-149 (158) 11 PRK07956 ligA NAD-dependent DN 96.0 0.022 5.7E-07 36.2 6.1 60 40-100 505-564 (668) 12 COG0322 UvrC Nuclease subunit 96.0 0.0098 2.5E-07 38.5 4.3 37 60-97 543-579 (581) 13 TIGR00575 dnlj DNA ligase, NAD 95.9 0.024 6.2E-07 35.9 5.8 88 45-133 520-628 (706) 14 COG0272 Lig NAD-dependent DNA 95.4 0.05 1.3E-06 33.9 5.8 58 42-100 507-564 (667) 15 PRK10702 endonuclease III; Pro 95.4 0.014 3.6E-07 37.4 3.0 59 36-97 66-127 (211) 16 pfam00730 HhH-GPD HhH-GPD supe 95.1 0.029 7.4E-07 35.4 3.8 96 34-129 33-141 (144) 17 PRK10702 endonuclease III; Pro 94.7 0.068 1.7E-06 33.1 5.0 64 38-101 28-95 (211) 18 smart00478 ENDO3c endonuclease 94.4 0.081 2.1E-06 32.6 4.8 95 34-133 27-134 (149) 19 PRK01172 ski2-like helicase; P 94.3 0.14 3.7E-06 31.0 5.9 65 25-98 599-664 (674) 20 COG0177 Nth Predicted EndoIII- 94.3 0.18 4.6E-06 30.3 6.3 57 42-98 33-92 (211) 21 TIGR01259 comE comEA protein; 94.2 0.032 8.1E-07 35.2 2.4 32 69-100 62-93 (124) 22 pfam00633 HHH Helix-hairpin-he 94.1 0.03 7.6E-07 35.4 2.1 29 69-97 1-29 (30) 23 PRK10880 adenine DNA glycosyla 92.9 0.12 2.9E-06 31.6 3.4 69 27-98 53-128 (350) 24 PRK07758 hypothetical protein; 92.3 0.24 6.1E-06 29.5 4.4 41 60-100 46-88 (95) 25 PRK00254 ski2-like helicase; P 92.3 0.26 6.6E-06 29.3 4.5 83 21-114 627-710 (717) 26 PRK08097 ligB NAD-dependent DN 92.2 0.59 1.5E-05 27.0 6.3 35 69-103 450-484 (563) 27 cd00056 ENDO3c endonuclease II 91.9 1.2 3E-05 25.1 7.8 64 41-107 3-69 (158) 28 COG0177 Nth Predicted EndoIII- 91.6 0.15 3.8E-06 30.8 2.7 96 36-134 66-171 (211) 29 PTZ00035 Rad51; Provisional 91.6 0.51 1.3E-05 27.4 5.4 54 42-98 34-87 (350) 30 COG2231 Uncharacterized protei 91.3 0.22 5.5E-06 29.8 3.3 96 34-131 66-174 (215) 31 smart00478 ENDO3c endonuclease 90.7 0.4 1E-05 28.0 4.2 53 49-101 3-58 (149) 32 PRK13913 3-methyladenine DNA g 90.6 0.32 8E-06 28.7 3.5 93 35-130 73-178 (218) 33 COG5241 RAD10 Nucleotide excis 90.5 0.22 5.6E-06 29.7 2.7 40 58-97 178-217 (224) 34 PRK00116 ruvA Holliday junctio 89.8 1.5 3.9E-05 24.3 6.5 105 30-140 59-173 (198) 35 TIGR01083 nth endonuclease III 89.3 0.84 2.2E-05 26.0 4.8 66 38-103 26-95 (192) 36 COG1194 MutY A/G-specific DNA 89.1 0.63 1.6E-05 26.8 4.1 72 24-98 54-132 (342) 37 PRK13910 DNA glycosylase MutY; 88.8 0.87 2.2E-05 25.9 4.7 70 26-98 16-92 (290) 38 PRK04301 radA DNA repair and r 88.8 1.5 3.7E-05 24.5 5.8 47 51-97 12-58 (318) 39 COG1555 ComEA DNA uptake prote 87.7 0.51 1.3E-05 27.4 2.9 34 68-101 86-119 (149) 40 COG0122 AlkA 3-methyladenine D 87.3 0.88 2.3E-05 25.8 3.9 57 40-96 155-215 (285) 41 TIGR02236 recomb_radA DNA repa 87.1 0.75 1.9E-05 26.3 3.5 45 58-102 11-57 (333) 42 KOG2875 consensus 84.5 1 2.7E-05 25.4 3.2 50 47-96 179-235 (323) 43 TIGR00426 TIGR00426 competence 82.8 0.78 2E-05 26.2 1.9 33 68-100 6-39 (70) 44 PRK13901 ruvA Holliday junctio 82.4 4.4 0.00011 21.3 5.6 104 30-140 58-164 (196) 45 PTZ00188 adrenodoxin reductase 81.0 3 7.7E-05 22.4 4.4 82 51-132 423-504 (506) 46 KOG1921 consensus 80.9 1.3 3.3E-05 24.8 2.5 78 47-132 130-222 (286) 47 TIGR01954 nusA_Cterm_rpt trans 80.9 3.8 9.6E-05 21.8 4.9 42 57-98 3-47 (52) 48 pfam03118 RNA_pol_A_CTD Bacter 80.8 4.1 0.0001 21.6 5.0 40 60-99 19-60 (62) 49 TIGR01083 nth endonuclease III 80.6 0.57 1.5E-05 27.1 0.6 56 40-98 68-126 (192) 50 TIGR01448 recD_rel helicase, R 80.1 1.4 3.6E-05 24.5 2.5 88 38-126 84-176 (769) 51 pfam10391 DNA_pol_lambd_f Fing 78.4 1.6 4E-05 24.2 2.3 42 79-123 2-43 (52) 52 PRK12766 50S ribosomal protein 78.1 1.7 4.4E-05 24.0 2.4 54 39-98 8-62 (238) 53 TIGR02027 rpoA DNA-directed RN 76.5 4.3 0.00011 21.4 4.0 42 59-100 271-314 (324) 54 COG1379 PHP family phosphoeste 75.1 7.4 0.00019 19.9 5.4 77 19-99 277-364 (403) 55 PRK03980 flap endonuclease-1; 72.4 3.8 9.6E-05 21.8 2.9 55 71-127 183-237 (295) 56 TIGR00084 ruvA Holliday juncti 72.4 4.4 0.00011 21.3 3.3 75 29-107 68-145 (217) 57 TIGR00375 TIGR00375 conserved 70.1 3.9 9.8E-05 21.7 2.6 62 36-99 305-367 (384) 58 COG0632 RuvA Holliday junction 69.5 9.9 0.00025 19.1 5.4 74 29-107 58-134 (201) 59 smart00278 HhH1 Helix-hairpin- 68.8 2.2 5.7E-05 23.2 1.1 22 79-100 1-22 (26) 60 CHL00013 rpoA RNA polymerase a 68.8 10 0.00026 19.0 4.6 41 60-100 283-325 (333) 61 PRK05182 DNA-directed RNA poly 67.8 10 0.00026 19.0 4.3 41 60-100 256-298 (306) 62 TIGR00588 ogg 8-oxoguanine DNA 66.8 7.9 0.0002 19.7 3.6 50 47-96 228-291 (379) 63 PRK02362 ski2-like helicase; P 66.8 11 0.00029 18.7 5.8 33 60-94 665-698 (736) 64 TIGR01084 mutY A/G-specific ad 66.6 3.8 9.7E-05 21.8 1.9 49 47-98 89-137 (297) 65 TIGR00615 recR recombination p 63.2 5.4 0.00014 20.8 2.2 19 78-96 11-29 (205) 66 pfam05559 DUF763 Protein of un 62.9 9.4 0.00024 19.2 3.4 28 75-102 265-292 (319) 67 COG4277 Predicted DNA-binding 61.7 5.9 0.00015 20.5 2.2 64 36-101 288-352 (404) 68 smart00483 POLXc DNA polymeras 60.6 6.9 0.00018 20.1 2.3 39 56-95 60-105 (334) 69 pfam11731 Cdd1 Pathogenicity l 60.1 7.6 0.00019 19.8 2.5 22 76-97 9-30 (92) 70 LOAD_Hrd consensus 59.9 12 0.00031 18.5 3.5 36 70-105 38-74 (77) 71 PRK08609 hypothetical protein; 58.1 14 0.00035 18.1 3.5 40 57-96 58-105 (570) 72 pfam02371 Transposase_20 Trans 57.6 7.6 0.00019 19.8 2.1 37 79-126 2-38 (87) 73 COG1796 POL4 DNA polymerase IV 56.1 13 0.00032 18.4 3.1 44 39-94 65-108 (326) 74 TIGR03674 fen_arch flap struct 55.6 13 0.00033 18.3 3.0 53 70-127 229-281 (338) 75 TIGR02238 recomb_DMC1 meiotic 53.8 19 0.00048 17.3 3.7 50 53-105 8-57 (314) 76 smart00540 LEM in nuclear memb 53.1 12 0.00031 18.5 2.6 33 34-66 3-35 (44) 77 COG1415 Uncharacterized conser 52.1 18 0.00046 17.4 3.3 34 69-102 268-301 (373) 78 PRK07847 amidophosphoribosyltr 52.0 20 0.00051 17.1 5.4 47 22-68 107-160 (489) 79 PRK10308 3-methyl-adenine DNA 50.6 5.9 0.00015 20.5 0.6 47 50-96 176-224 (283) 80 PRK11057 ATP-dependent DNA hel 49.2 15 0.00038 17.9 2.5 35 71-105 567-602 (607) 81 cd00141 NT_POLXc Nucleotidyltr 48.3 23 0.00058 16.7 4.3 53 78-133 84-157 (307) 82 PRK02515 psbU photosystem II c 47.8 17 0.00044 17.5 2.7 35 62-96 56-90 (144) 83 cd00080 HhH2_motif Helix-hairp 47.8 6.5 0.00016 20.3 0.5 26 72-97 15-40 (75) 84 TIGR00593 pola DNA polymerase 44.4 25 0.00062 16.5 3.0 16 60-75 205-220 (1005) 85 KOG1918 consensus 43.9 13 0.00033 18.3 1.5 57 39-95 120-181 (254) 86 COG0758 Smf Predicted Rossmann 42.1 28 0.00072 16.1 6.4 75 37-113 1-75 (350) 87 COG0353 RecR Recombinational D 41.2 22 0.00056 16.9 2.4 18 78-95 11-28 (198) 88 PRK00076 recR recombination pr 40.5 24 0.00062 16.6 2.5 52 79-134 108-163 (197) 89 PRK13844 recombination protein 39.7 24 0.00061 16.6 2.4 62 66-131 99-164 (200) 90 cd07644 I-BAR_IMD_BAIAP2L2 Inv 39.0 32 0.00081 15.8 3.7 54 54-107 5-59 (215) 91 smart00279 HhH2 Helix-hairpin- 37.6 12 0.0003 18.6 0.5 29 69-97 6-34 (36) 92 PRK12278 50S ribosomal protein 36.5 15 0.00039 17.9 0.9 22 76-97 150-171 (216) 93 pfam03020 LEM LEM domain. The 36.1 34 0.00086 15.6 2.7 32 34-65 3-34 (43) 94 cd07645 I-BAR_IMD_BAIAP2L1 Inv 36.0 31 0.00079 15.9 2.5 51 56-106 7-58 (226) 95 KOG2534 consensus 36.0 35 0.0009 15.5 2.7 45 54-102 35-79 (353) 96 TIGR00920 2A060605 3-hydroxy-3 35.6 20 0.00051 17.1 1.4 36 23-59 509-552 (988) 97 pfam00416 Ribosomal_S13 Riboso 35.1 15 0.00039 17.9 0.8 26 79-104 15-40 (106) 98 cd00128 XPG Xeroderma pigmento 34.9 37 0.00094 15.4 4.2 55 71-127 217-274 (316) 99 smart00341 HRDC Helicase and R 34.1 38 0.00096 15.3 3.0 26 70-95 38-63 (81) 100 COG0258 Exo 5'-3' exonuclease 34.1 23 0.00057 16.8 1.5 31 70-100 189-219 (310) 101 PRK05179 rpsM 30S ribosomal pr 33.7 29 0.00075 16.0 2.0 24 79-102 17-40 (122) 102 TIGR03631 bact_S13 30S ribosom 33.3 20 0.0005 17.2 1.1 24 79-102 15-38 (113) 103 CHL00137 rps13 ribosomal prote 30.7 21 0.00054 16.9 0.9 24 80-103 18-41 (122) 104 pfam01367 5_3_exonuc 5'-3' exo 30.1 44 0.0011 14.9 4.4 40 59-105 30-69 (100) 105 cd00008 53EXOc 5'-3' exonuclea 29.4 22 0.00057 16.8 0.9 30 70-99 174-203 (240) 106 PRK04053 rps13p 30S ribosomal 29.4 24 0.00061 16.6 1.0 25 79-103 25-49 (149) 107 COG3743 Uncharacterized conser 28.6 25 0.00063 16.5 1.0 18 79-96 67-84 (133) 108 pfam00570 HRDC HRDC domain. Th 28.4 47 0.0012 14.7 3.0 26 70-95 35-60 (68) 109 pfam10678 DUF2492 Protein of u 27.6 48 0.0012 14.6 3.8 40 35-75 2-41 (78) 110 cd02434 Nodulin-21_like_3 Nodu 27.2 49 0.0013 14.6 2.9 47 19-65 63-109 (225) 111 PRK08937 adenylosuccinate lyas 26.8 50 0.0013 14.5 5.1 11 22-32 334-344 (425) 112 TIGR01258 pgm_1 phosphoglycera 26.7 27 0.00069 16.2 0.9 19 34-52 194-212 (248) 113 pfam02541 Ppx-GppA Ppx/GppA ph 25.8 52 0.0013 14.4 6.1 60 70-129 222-281 (285) 114 TIGR03629 arch_S13P archaeal r 25.5 36 0.00092 15.4 1.3 23 80-102 22-44 (144) 115 cd00715 GPATase_N Glutamine am 25.1 54 0.0014 14.3 4.7 62 22-83 107-170 (252) 116 pfam07088 GvpD GvpD gas vesicl 24.3 40 0.001 15.1 1.4 48 91-140 87-134 (484) 117 CHL00194 ycf39 Ycf39; Provisio 23.8 33 0.00085 15.7 0.9 23 33-55 204-226 (319) 118 PRK12495 hypothetical protein; 23.6 53 0.0014 14.4 1.9 32 19-50 7-38 (221) 119 PRK03968 DNA primase large sub 23.5 50 0.0013 14.5 1.7 21 59-79 13-33 (388) 120 PTZ00217 flap endonuclease-1; 22.9 59 0.0015 14.1 4.1 35 70-106 228-262 (394) 121 PRK09425 prpD 2-methylcitrate 22.6 60 0.0015 14.0 3.9 16 31-46 133-148 (480) 122 cd02433 Nodulin-21_like_2 Nodu 22.3 61 0.0015 14.0 3.6 50 15-66 78-127 (234) 123 cd05839 BR140_related The PWWP 22.0 43 0.0011 15.0 1.2 13 133-145 51-63 (111) 124 PTZ00205 DNA polymerase kappa; 21.6 43 0.0011 14.9 1.1 17 81-97 311-327 (571) 125 PRK08434 consensus 21.4 33 0.00085 15.7 0.5 47 19-66 383-434 (887) 126 PRK05797 consensus 20.6 34 0.00088 15.6 0.4 117 17-133 378-539 (869) 127 PRK07300 consensus 20.6 38 0.00097 15.3 0.7 16 60-75 209-224 (880) 128 COG2176 PolC DNA polymerase II 20.2 37 0.00094 15.4 0.5 45 81-127 1383-1427(1444) 129 pfam07747 MTH865 MTH865-like f 20.1 67 0.0017 13.7 2.9 36 33-68 39-74 (75) No 1 >PRK00024 radC DNA repair protein RadC; Reviewed Probab=100.00 E-value=3.6e-36 Score=254.46 Aligned_cols=131 Identities=27% Similarity=0.359 Sum_probs=127.5 Q ss_pred CCCCCCHHHHHHHHHHHCCHHHCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHCCCHHHHHCCCHHHHHHCCCCCHHHHHH Q ss_conf 86300165019999982392218989999999733799898899999999737997898509978997378988999999 Q 537021.9.peg.8 15 HLDYIGHHNRLRDRFLQKGENALAYYEILELILFRLIPRKDTKSIAKALLKRFATLGGVFGAPLHLLQEINRIGKRVALE 94 (148) Q Consensus 15 ~~~~~ghr~R~RErll~~G~~~LsD~ELLallL~~g~~~~~~~~lA~~lL~~fGsL~~ll~as~~eL~~i~GIG~aka~~ 94 (148) +.+++...+||||||+++|+++|||+||||++|++|++++||+++|++||++||||.+++++++++|++|+|||++||++ T Consensus 2 ~ik~~~~~~RpRER~l~~G~~~LsD~ELLallL~~g~~~~d~~~lA~~ll~~~g~l~~l~~a~~~eL~~i~GiG~~kA~~ 81 (224) T PRK00024 2 RIKDWPEEERPRERLLRRGAAALSDAELLAILLRTGTKGKSVLDLARELLERFGSLRGLLDASLEELQEIKGIGPAKAAQ 81 (224) T ss_pred CCCCCCHHHCHHHHHHHHCCCCCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHCCCHHHHHHCCHHHHHCCCCCCHHHHHH T ss_conf 60518811384899986093207779999999846999999899999999985999999870889884478988999999 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCCCEEEEEEEEECC Q ss_conf 754999999999987640677889899999999984597800025212102 Q 537021.9.peg.8 95 LKLVSVASQRILKANWSIKKFLTPGQPCLISVERHYPMKNANNFAYCFSIN 145 (148) Q Consensus 95 l~a~~El~rR~~~~~~~~~~~l~s~~~v~~yl~~~l~~~~~E~F~vLF~~N 145 (148) |+|+.|+++|+...+..++.+++||+++++||+..|++.++|+|+|+|+|+ T Consensus 82 l~a~~El~rR~~~~~~~~~~~l~s~~~v~~~~~~~l~~~~~E~f~vl~Ld~ 132 (224) T PRK00024 82 LKAALELARRILAERLREREVLTSPEDVADYLQAELRDEEQEVFVVLFLDT 132 (224) T ss_pred HHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHCCCCCCEEEEEEECC T ss_conf 999999999997521046987489999999999986566786799999878 No 2 >COG2003 RadC DNA repair proteins [DNA replication, recombination, and repair] Probab=100.00 E-value=1.2e-36 Score=257.42 Aligned_cols=130 Identities=21% Similarity=0.280 Sum_probs=126.7 Q ss_pred CCCCCHHHHHHHHHHHCCHHHCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHCCCHHHHHCCCHHHHHHCCCCCHHHHHHH Q ss_conf 63001650199999823922189899999997337998988999999997379978985099789973789889999997 Q 537021.9.peg.8 16 LDYIGHHNRLRDRFLQKGENALAYYEILELILFRLIPRKDTKSIAKALLKRFATLGGVFGAPLHLLQEINRIGKRVALEL 95 (148) Q Consensus 16 ~~~~ghr~R~RErll~~G~~~LsD~ELLallL~~g~~~~~~~~lA~~lL~~fGsL~~ll~as~~eL~~i~GIG~aka~~l 95 (148) .+.+.+++|||||++++|+++|||+|||||+|+||++++||+++|++||++||||.+|+.||.++|+.++|||++|++|+ T Consensus 3 i~~~~~~~rPRErll~~G~~~Lsd~ELLailLrtG~~~~~~~~la~~lL~~fg~L~~l~~a~~~el~~v~GiG~aka~~l 82 (224) T COG2003 3 IKDNPENERPRERLLKLGAEALSDAELLAILLRTGTKGESVLDLAKELLQEFGSLAELLKASVEELSSVKGIGLAKAIQI 82 (224) T ss_pred CCCCCCCCCHHHHHHHHCHHHCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHCCCHHHHHHCCHHHHHHCCCCCHHHHHHH T ss_conf 44465310618999872845346689999999628999878999999999732588887379999951788338899999 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCCCEEEEEEEEECC Q ss_conf 54999999999987640677889899999999984597800025212102 Q 537021.9.peg.8 96 KLVSVASQRILKANWSIKKFLTPGQPCLISVERHYPMKNANNFAYCFSIN 145 (148) Q Consensus 96 ~a~~El~rR~~~~~~~~~~~l~s~~~v~~yl~~~l~~~~~E~F~vLF~~N 145 (148) +|+.|+++|++.....++.+++||+++.+||+..|++.++|+|.|||+|| T Consensus 83 ~a~~El~~R~~~~~~~~~~~i~sp~~~~~~l~~~l~~~~~E~f~vL~Ld~ 132 (224) T COG2003 83 KAAIELGKRILAERLREGVVITSPEAVAEYLRAELGGEEREHFVVLYLDS 132 (224) T ss_pred HHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 99999999999987505775489999999999986134889999999547 No 3 >TIGR00608 radc DNA repair protein RadC; InterPro: IPR001405 RadC is a DNA repair protein found in many bacteria. They share a region of similarity at the C-terminus that contains two perfectly conserved histidine residues.; GO: 0006281 DNA repair. Probab=100.00 E-value=2.5e-36 Score=255.48 Aligned_cols=122 Identities=23% Similarity=0.306 Sum_probs=120.6 Q ss_pred HHHHHHHHCCHHHCCHHHHHHHHHHCCCCCC--CHHHHHHHHHHHCC-----CHHHHHCCCHHHHHHCCCCCHHHHHHHH Q ss_conf 1999998239221898999999973379989--88999999997379-----9789850997899737898899999975 Q 537021.9.peg.8 24 RLRDRFLQKGENALAYYEILELILFRLIPRK--DTKSIAKALLKRFA-----TLGGVFGAPLHLLQEINRIGKRVALELK 96 (148) Q Consensus 24 R~RErll~~G~~~LsD~ELLallL~~g~~~~--~~~~lA~~lL~~fG-----sL~~ll~as~~eL~~i~GIG~aka~~l~ 96 (148) |||||||++|+++|||+|||||||+||++++ ||++||++||+.|| +|..|.++++++|+.|+|||.+|+.||+ T Consensus 1 ~PREkLLk~Ga~~L~d~ELLAi~L~tG~~k~k~dVl~Lsk~lL~~fG~~~~d~l~~L~~~~~~~L~~v~GiG~ak~~qlk 80 (223) T TIGR00608 1 MPREKLLKFGAEALSDYELLAILLRTGTKKGKADVLSLSKELLKQFGQEELDSLDLLLSAEPEELSSVPGIGLAKAIQLK 80 (223) T ss_pred CCHHHHHHCCHHCCCCHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCHHCCCCHHHHHCCCHHHHHCCCCCHHHHHHHHH T ss_conf 95032333163005646799999843899976662678999998707000130014321585785046885088999999 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCCCEEEEEEEEECC Q ss_conf 4999999999987640677889899999999984597800025212102 Q 537021.9.peg.8 97 LVSVASQRILKANWSIKKFLTPGQPCLISVERHYPMKNANNFAYCFSIN 145 (148) Q Consensus 97 a~~El~rR~~~~~~~~~~~l~s~~~v~~yl~~~l~~~~~E~F~vLF~~N 145 (148) |+.|+++|+.+.++.++++++||+++.+||+..|.++++|+|+|||+|+ T Consensus 81 a~~El~~R~~~~~~~~~~~i~sPe~~~~~l~~~l~~~~~E~F~vlfLD~ 129 (223) T TIGR00608 81 AIVELAKRYAKSRMLERPVIRSPEAAAEFLSTDLAHEEREHFMVLFLDL 129 (223) T ss_pred HHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHCCCCCEEEEEEEECC T ss_conf 9999986775244214666489899999999863347642678887168 No 4 >PRK13482 DNA integrity scanning protein DisA; Provisional Probab=98.01 E-value=8.7e-05 Score=51.74 Aligned_cols=87 Identities=16% Similarity=0.187 Sum_probs=66.9 Q ss_pred HHHHHHHHCCHHHCCHHHHHHHHHHCCCCCCC-------------------HHHHHHHHHHHCCCHHHHHCCCHHHHHHC Q ss_conf 19999982392218989999999733799898-------------------89999999973799789850997899737 Q 537021.9.peg.8 24 RLRDRFLQKGENALAYYEILELILFRLIPRKD-------------------TKSIAKALLKRFATLGGVFGAPLHLLQEI 84 (148) Q Consensus 24 R~RErll~~G~~~LsD~ELLallL~~g~~~~~-------------------~~~lA~~lL~~fGsL~~ll~as~~eL~~i 84 (148) ...+.+-+...+-|.|...++=+|+++..... -..+..++.++||+|..+++||.+||..| T Consensus 245 ~v~~~l~~l~~~eLld~~~ia~~LGy~~~~~~lD~~v~pRGYR~L~kiprlp~~vie~lV~~Fg~L~~ll~As~eeL~~V 324 (352) T PRK13482 245 EILEELQALSSEELLDLSAIAKALGYPGGSEALDTAVSPRGYRILAKIPRLPSAVIENLVKHFGTLQGLLEASIEDLDDV 324 (352) T ss_pred HHHHHHHHCCHHHHCCHHHHHHHHCCCCCCCCCCCCCCCCHHHHHCCCCCCCHHHHHHHHHHHCCHHHHHHCCHHHHHHC T ss_conf 99999983899885188999999589987653346778725888605999999999999998527999986899888656 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 89889999997549999999999876406 Q 537021.9.peg.8 85 NRIGKRVALELKLVSVASQRILKANWSIK 113 (148) Q Consensus 85 ~GIG~aka~~l~a~~El~rR~~~~~~~~~ 113 (148) +|||+.+|..| .|--+|+......++ T Consensus 325 eGIGe~RAr~I---regL~Rl~e~~~~~r 350 (352) T PRK13482 325 EGIGEVRARAI---REGLSRLAEQLILDR 350 (352) T ss_pred CCCCHHHHHHH---HHHHHHHHHHHHHHH T ss_conf 79679999999---999999999887760 No 5 >PRK13766 Hef nuclease; Provisional Probab=97.95 E-value=1.4e-05 Score=56.82 Aligned_cols=70 Identities=23% Similarity=0.363 Sum_probs=51.2 Q ss_pred HHHHCCHHHCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHCCCHHHHHCCCHHHHHHCCCCCHHHHHHHHHHH Q ss_conf 998239221898999999973379989889999999973799789850997899737898899999975499 Q 537021.9.peg.8 28 RFLQKGENALAYYEILELILFRLIPRKDTKSIAKALLKRFATLGGVFGAPLHLLQEINRIGKRVALELKLVS 99 (148) Q Consensus 28 rll~~G~~~LsD~ELLallL~~g~~~~~~~~lA~~lL~~fGsL~~ll~as~~eL~~i~GIG~aka~~l~a~~ 99 (148) ..+..+..+.+-.|.-..+| .+.|+..- ..|++||.+|||+.++++||.+||++|+|||+.+|..|.=+. T Consensus 689 ~~~~~~k~~~~~~e~q~~~l-~~~pgvg~-~~a~~ll~~fgsi~~i~~a~~~eL~~v~giG~~~A~~i~~~~ 758 (764) T PRK13766 689 VSIHGEKKAMTLKEQQEYIV-SSLPDVGP-VLARNLLDHFGSVENVMTASEEELKAVEGIGEKTAKKIREVV 758 (764) T ss_pred CCCCCCCCCCCHHHHHHHHH-HHCCCCCH-HHHHHHHHHCCCHHHHHCCCHHHHHHCCCCCHHHHHHHHHHH T ss_conf 66677777687789999999-83899999-999999997199999965999999547496999999999985 No 6 >KOG2841 consensus Probab=97.74 E-value=4.8e-05 Score=53.42 Aligned_cols=45 Identities=24% Similarity=0.246 Sum_probs=39.5 Q ss_pred CHHHHHHHHHHHCCCHHHHHCCCHHHHHHCCCCCHHHHHHHHHHH Q ss_conf 889999999973799789850997899737898899999975499 Q 537021.9.peg.8 55 DTKSIAKALLKRFATLGGVFGAPLHLLQEINRIGKRVALELKLVS 99 (148) Q Consensus 55 ~~~~lA~~lL~~fGsL~~ll~as~~eL~~i~GIG~aka~~l~a~~ 99 (148) =-+.=|..||.+||||..+++||.+||.+++|+|++||-+|-..+ T Consensus 203 VnKtda~~LL~~FgsLq~~~~AS~~ele~~~G~G~~kak~l~~~l 247 (254) T KOG2841 203 VNKTDAQLLLQKFGSLQQISNASEGELEQCPGLGPAKAKRLHKFL 247 (254) T ss_pred CCCCCHHHHHHHCCCHHHHHHCCHHHHHHCCCCCHHHHHHHHHHH T ss_conf 770018999885535999874177679867573789999999998 No 7 >COG1948 MUS81 ERCC4-type nuclease [DNA replication, recombination, and repair] Probab=97.45 E-value=0.00035 Score=47.85 Aligned_cols=68 Identities=28% Similarity=0.354 Sum_probs=50.8 Q ss_pred HHCCH-HHCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHCCCHHHHHCCCHHHHHHCCCCCHHHHHHHHHHH Q ss_conf 82392-21898999999973379989889999999973799789850997899737898899999975499 Q 537021.9.peg.8 30 LQKGE-NALAYYEILELILFRLIPRKDTKSIAKALLKRFATLGGVFGAPLHLLQEINRIGKRVALELKLVS 99 (148) Q Consensus 30 l~~G~-~~LsD~ELLallL~~g~~~~~~~~lA~~lL~~fGsL~~ll~as~~eL~~i~GIG~aka~~l~a~~ 99 (148) ..+|. .+.+..|+...+| ...|+. ....|.+++.+|||..+++.||+++|++++|||+.+|..|.=++ T Consensus 166 ~~~~~~~~~t~~e~q~~il-~s~pgi-g~~~a~~ll~~fgS~~~~~tas~~eL~~v~gig~k~A~~I~~~~ 234 (254) T COG1948 166 NPHGKKKAKTLKELQLYIL-ESIPGI-GPKLAERLLKKFGSVEDVLTASEEELMKVKGIGEKKAREIYRFL 234 (254) T ss_pred CCCCCCCCCCHHHHHHHHH-HCCCCC-CHHHHHHHHHHHCCHHHHHHCCHHHHHHHCCCCHHHHHHHHHHH T ss_conf 6554322245578999999-708996-48999999998568887765599999774484688999999998 No 8 >PRK00558 uvrC excinuclease ABC subunit C; Validated Probab=96.91 E-value=0.0017 Score=43.44 Aligned_cols=40 Identities=30% Similarity=0.377 Sum_probs=36.2 Q ss_pred HHHHHHHHCCCHHHHHCCCHHHHHHCCCCCHHHHHHHHHH Q ss_conf 9999997379978985099789973789889999997549 Q 537021.9.peg.8 59 IAKALLKRFATLGGVFGAPLHLLQEINRIGKRVALELKLV 98 (148) Q Consensus 59 lA~~lL~~fGsL~~ll~as~~eL~~i~GIG~aka~~l~a~ 98 (148) -.+.||.+|||+.++..||.+||.+++|||+..|..|.-- T Consensus 566 r~~~Ll~~Fgs~~~i~~As~eeL~~v~Gi~~~~A~~I~~~ 605 (609) T PRK00558 566 RRKALLKHFGSLKAIKEASVEELAKVPGISKKLAEKIYEA 605 (609) T ss_pred HHHHHHHHCCCHHHHHHCCHHHHHHCCCCCHHHHHHHHHH T ss_conf 9999999707999997389999964899899999999999 No 9 >COG1623 Predicted nucleic-acid-binding protein (contains the HHH domain) [General function prediction only] Probab=96.68 E-value=0.005 Score=40.36 Aligned_cols=42 Identities=29% Similarity=0.405 Sum_probs=37.8 Q ss_pred HHHHHHHHHHCCCHHHHHCCCHHHHHHCCCCCHHHHHHHHHH Q ss_conf 999999997379978985099789973789889999997549 Q 537021.9.peg.8 57 KSIAKALLKRFATLGGVFGAPLHLLQEINRIGKRVALELKLV 98 (148) Q Consensus 57 ~~lA~~lL~~fGsL~~ll~as~~eL~~i~GIG~aka~~l~a~ 98 (148) +.+..++...||.|.+++.||.++|.+++|||.++|..|+.- T Consensus 303 ~~iv~nlV~~F~~l~~il~As~edL~~VeGIGe~rAr~i~~G 344 (349) T COG1623 303 FAIVENLVRAFGTLDGILEASAEDLDAVEGIGEARARAIKEG 344 (349) T ss_pred HHHHHHHHHHHHHHHHHHHHCHHHHHHHCCHHHHHHHHHHHH T ss_conf 789999999975289998714767766213669999999976 No 10 >cd00056 ENDO3c endonuclease III; includes endonuclease III (DNA-(apurinic or apyrimidinic site) lyase), alkylbase DNA glycosidases (Alka-family) and other DNA glycosidases Probab=96.21 E-value=0.0029 Score=41.91 Aligned_cols=113 Identities=21% Similarity=0.144 Sum_probs=72.0 Q ss_pred HHHHHHHH--HCCHHHCCHHHHHHHHHHCCCCC--CCHHHHHHHHHHHCCCHHHHHCCCHHHHHHCCCCCHHHHHHHHHH Q ss_conf 01999998--23922189899999997337998--988999999997379978985099789973789889999997549 Q 537021.9.peg.8 23 NRLRDRFL--QKGENALAYYEILELILFRLIPR--KDTKSIAKALLKRFATLGGVFGAPLHLLQEINRIGKRVALELKLV 98 (148) Q Consensus 23 ~R~RErll--~~G~~~LsD~ELLallL~~g~~~--~~~~~lA~~lL~~fGsL~~ll~as~~eL~~i~GIG~aka~~l~a~ 98 (148) .|+.+++. -......++.||.+++...|.++ ...+.+|+.+.+.++++..=.....++|+++||||+..|..+..- T Consensus 23 ~~l~~~~~pt~~~l~~~~~~~l~~~~~~~gy~~Ka~~i~~~a~~i~~~~~~~~~~~~~~~~~L~~l~GIG~~TA~~vl~~ 102 (158) T cd00056 23 ERLFERYGPTPEALAAADEEELRELIRSLGYRRKAKYLKELARAIVEGFGGLVLDDPDAREELLALPGVGRKTANVVLLF 102 (158) T ss_pred HHHHHHCCCCHHHHHCCCHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHCCCCCCHHHHHHHHHH T ss_conf 99998549989999809999999997335689999999998888898608957898889999875898289999999999 Q ss_pred HHHH----------HHHHHHHHHHCCCCCCHHHHHHHHHHHHCCCCEEE Q ss_conf 9999----------99999876406778898999999999845978000 Q 537021.9.peg.8 99 SVAS----------QRILKANWSIKKFLTPGQPCLISVERHYPMKNANN 137 (148) Q Consensus 99 ~El~----------rR~~~~~~~~~~~l~s~~~v~~yl~~~l~~~~~E~ 137 (148) .++ +|+... ..--..-.+++++...++..+....... T Consensus 103 -~~~~~~~~vD~~v~R~~~r-l~~~~~~~~~~~~~~~~~~~~p~~~~~~ 149 (158) T cd00056 103 -ALGPDAFPVDTHVRRVLKR-LGLIPKKKTPEELEELLEELLPKPYWGE 149 (158) T ss_pred -HCCCCCCCCCHHHHHHHHH-HCCCCCCCCHHHHHHHHHHHCCCCHHHH T ss_conf -8799836225999999999-5787789999999999998589201999 No 11 >PRK07956 ligA NAD-dependent DNA ligase LigA; Validated Probab=96.03 E-value=0.022 Score=36.18 Aligned_cols=60 Identities=22% Similarity=0.217 Sum_probs=47.3 Q ss_pred HHHHHHHHHCCCCCCCHHHHHHHHHHHCCCHHHHHCCCHHHHHHCCCCCHHHHHHHHHHHH Q ss_conf 9999999733799898899999999737997898509978997378988999999754999 Q 537021.9.peg.8 40 YEILELILFRLIPRKDTKSIAKALLKRFATLGGVFGAPLHLLQEINRIGKRVALELKLVSV 100 (148) Q Consensus 40 ~ELLallL~~g~~~~~~~~lA~~lL~~fGsL~~ll~as~~eL~~i~GIG~aka~~l~a~~E 100 (148) ..|--+|-..|.+.. -...|+.|.++|+++..+.+|+.++|..|+|||+..|..|..-+. T Consensus 505 ~~l~r~L~ALGI~~V-G~~~Ak~La~~f~sl~~l~~as~e~L~~I~giG~~~A~si~~ff~ 564 (668) T PRK07956 505 TPLARFLYALGIRHV-GEKAAKALARHFGSLEALEAASEEELAAVEGIGEEVAQSIVEFFA 564 (668) T ss_pred CCHHHHHHHCCCCCC-CHHHHHHHHHHHCCHHHHHHCCHHHHHCCCCCCHHHHHHHHHHHC T ss_conf 758889986278641-299999999996689999708999985768844999999999970 No 12 >COG0322 UvrC Nuclease subunit of the excinuclease complex [DNA replication, recombination, and repair] Probab=96.02 E-value=0.0098 Score=38.47 Aligned_cols=37 Identities=35% Similarity=0.502 Sum_probs=34.4 Q ss_pred HHHHHHHCCCHHHHHCCCHHHHHHCCCCCHHHHHHHHH Q ss_conf 99999737997898509978997378988999999754 Q 537021.9.peg.8 60 AKALLKRFATLGGVFGAPLHLLQEINRIGKRVALELKL 97 (148) Q Consensus 60 A~~lL~~fGsL~~ll~as~~eL~~i~GIG~aka~~l~a 97 (148) .+.||++|||+.++..||.+||.++ ||+++.|..|.. T Consensus 543 ~~~LL~~Fgs~~~i~~As~eel~~v-gi~~~~a~~i~~ 579 (581) T COG0322 543 RKALLKHFGSLKGIKSASVEELAKV-GISKKLAEKIYE 579 (581) T ss_pred HHHHHHHHHCHHHHHHCCHHHHHHC-CCCHHHHHHHHH T ss_conf 9999998317888885599999874-999999999985 No 13 >TIGR00575 dnlj DNA ligase, NAD-dependent; InterPro: IPR001679 DNA ligase (polydeoxyribonucleotide synthase) is the enzyme that joins two DNA fragments by catalyzing the formation of an internucleotide ester bond between phosphate and deoxyribose. It is active during DNA replication, DNA repair and DNA recombination. There are two forms of DNA ligase: one requires ATP (6.5.1.1 from EC), the other NAD (6.5.1.2 from EC). This family is predominantly composed of NAD-dependent bacterial DNA ligases. They are proteins of about 75 to 85 Kd whose sequence is well conserved , . They also show similarity to yicF, an Escherichia coli hypothetical protein of 63 Kd.; GO: 0003911 DNA ligase (NAD+) activity, 0006260 DNA replication, 0006281 DNA repair. Probab=95.90 E-value=0.024 Score=35.95 Aligned_cols=88 Identities=19% Similarity=0.123 Sum_probs=60.3 Q ss_pred HHHHCCCCCCCHHHHHHHHHHHCCCHHHHHCCCHH-----------HHHHCCCCCHHHHHHHHHHHHHH-HH-------- Q ss_conf 99733799898899999999737997898509978-----------99737898899999975499999-99-------- Q 537021.9.peg.8 45 LILFRLIPRKDTKSIAKALLKRFATLGGVFGAPLH-----------LLQEINRIGKRVALELKLVSVAS-QR-------- 104 (148) Q Consensus 45 llL~~g~~~~~~~~lA~~lL~~fGsL~~ll~as~~-----------eL~~i~GIG~aka~~l~a~~El~-rR-------- 104 (148) +|.+-|++. =-...|+.|.++|+||..|..|+.+ +|++|+|||+..|..|..-+.-- .| T Consensus 520 lL~aLGIr~-VG~~~A~~La~~f~tl~~L~~A~~e~~~~~~~f~~s~L~~~~g~G~~vA~~~~~~F~~~~~~~~qPYRta 598 (706) T TIGR00575 520 LLFALGIRH-VGEVTAKLLAKHFGTLDKLKAASLETLESIYQFDRSELLSVEGVGPKVAESIVNFFHDPNNLSLQPYRTA 598 (706) T ss_pred HHHHCCCHH-HHHHHHHHHHHHCCCHHHHHHCCHHHHHHHHHHCCHHHHHCCCCHHHHHHHHHHHHHCCCCCCCCHHHHH T ss_conf 998628603-5799999999855886899850821677887516055641014027899999998712000135604689 Q ss_pred HHHHHHHH-CCCCCCHHHHHHHHHHHHCCC Q ss_conf 99987640-677889899999999984597 Q 537021.9.peg.8 105 ILKANWSI-KKFLTPGQPCLISVERHYPMK 133 (148) Q Consensus 105 ~~~~~~~~-~~~l~s~~~v~~yl~~~l~~~ 133 (148) -.-..+.. .-.+.+.+.+.+.|...-... T Consensus 599 ~~~~~L~~~gv~~~~~D~~~~LL~~~~~~~ 628 (706) T TIGR00575 599 ELIEKLEELGVNMESEDEFCRLLDQALKEK 628 (706) T ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHHHHHH T ss_conf 999999883163101135787776420110 No 14 >COG0272 Lig NAD-dependent DNA ligase (contains BRCT domain type II) [DNA replication, recombination, and repair] Probab=95.38 E-value=0.05 Score=33.93 Aligned_cols=58 Identities=24% Similarity=0.256 Sum_probs=45.4 Q ss_pred HHHHHHHCCCCCCCHHHHHHHHHHHCCCHHHHHCCCHHHHHHCCCCCHHHHHHHHHHHH Q ss_conf 99999733799898899999999737997898509978997378988999999754999 Q 537021.9.peg.8 42 ILELILFRLIPRKDTKSIAKALLKRFATLGGVFGAPLHLLQEINRIGKRVALELKLVSV 100 (148) Q Consensus 42 LLallL~~g~~~~~~~~lA~~lL~~fGsL~~ll~as~~eL~~i~GIG~aka~~l~a~~E 100 (148) |=-+|..-|++.. -...|+.|-.+||++..++.|+.++|..++|||+..|..|..-+. T Consensus 507 l~r~l~aLGIr~V-G~~~Ak~La~~f~sl~~l~~a~~e~l~~i~giG~~vA~si~~ff~ 564 (667) T COG0272 507 LARFLYALGIRHV-GETTAKSLARHFGTLEALLAASEEELASIPGIGEVVARSIIEFFA 564 (667) T ss_pred HHHHHHHCCCCHH-HHHHHHHHHHHHHHHHHHHHCCHHHHHHCCCHHHHHHHHHHHHHC T ss_conf 9999998797114-089999999876029999842999995066612899999999972 No 15 >PRK10702 endonuclease III; Provisional Probab=95.38 E-value=0.014 Score=37.45 Aligned_cols=59 Identities=12% Similarity=0.095 Sum_probs=43.5 Q ss_pred HCCHHHHHHHHHHCCC---CCCCHHHHHHHHHHHCCCHHHHHCCCHHHHHHCCCCCHHHHHHHHH Q ss_conf 1898999999973379---9898899999999737997898509978997378988999999754 Q 537021.9.peg.8 36 ALAYYEILELILFRLI---PRKDTKSIAKALLKRFATLGGVFGAPLHLLQEINRIGKRVALELKL 97 (148) Q Consensus 36 ~LsD~ELLallL~~g~---~~~~~~~lA~~lL~~fGsL~~ll~as~~eL~~i~GIG~aka~~l~a 97 (148) ..+..||.+++=+.|. +-++....|+.++++||| -+-.+.++|.++||||+..|.-+.+ T Consensus 66 ~a~~~el~~~i~~~G~y~~KA~~L~~~a~~i~~~~~G---~vP~~~~~L~~LpGIG~kTA~aIl~ 127 (211) T PRK10702 66 ELGVEGVKTYIKTIGLYNSKAENVIKTCRILLEKHNG---EVPEDRAALEALPGVGRKTANVVLN 127 (211) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC---CCHHHHHHHHHHHCCCHHHHHHHHH T ss_conf 0999999999998635999999999999999999099---8766699999876635889999999 No 16 >pfam00730 HhH-GPD HhH-GPD superfamily base excision DNA repair protein. This family contains a diverse range of structurally related DNA repair proteins. The superfamily is called the HhH-GPD family after its hallmark Helix-hairpin-helix and Gly/Pro rich loop followed by a conserved aspartate. This includes endonuclease III, EC:4.2.99.18 and MutY an A/G-specific adenine glycosylase, both have a C terminal 4Fe-4S cluster. The family also includes 8-oxoguanine DNA glycosylases. The methyl-CPG binding protein MBD4 also contains a related domain that is a thymine DNA glycosylase. The family also includes DNA-3-methyladenine glycosylase II EC:3.2.2.21 and other members of the AlkA family. Probab=95.06 E-value=0.029 Score=35.43 Aligned_cols=96 Identities=18% Similarity=0.064 Sum_probs=60.4 Q ss_pred HHHCCHHHHHHHHHHCCC---CCCCHHHHHHHHHHHCCCHHHHHCCCHHHHHHCCCCCHHHHHHHHHH----------HH Q ss_conf 221898999999973379---98988999999997379978985099789973789889999997549----------99 Q 537021.9.peg.8 34 ENALAYYEILELILFRLI---PRKDTKSIAKALLKRFATLGGVFGAPLHLLQEINRIGKRVALELKLV----------SV 100 (148) Q Consensus 34 ~~~LsD~ELLallL~~g~---~~~~~~~lA~~lL~~fGsL~~ll~as~~eL~~i~GIG~aka~~l~a~----------~E 100 (148) ...++..||-+++=..|. +-+....+|+.+.+.+++.........++|++++|||+..|..+..- .. T Consensus 33 l~~~~~~~l~~~i~~~G~~~~Ka~~I~~~a~~~~~~~~~~~~~~~~~~~~L~~l~GIG~~ta~~~l~~~~~~~d~~~~~D 112 (144) T pfam00730 33 LAEADEEELRELIKGLGFYRRKAKYIKELARILVEGYLGLVPLDLEELEALLALPGVGRWTAEAVLLFALGRPDVFPAVD 112 (144) T ss_pred HHCCCHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHCCCCCCHHHHHHHHHHHCCCCCCEECCC T ss_conf 98599999999987089769999999999988898628978861156999860889769999999999869998732644 Q ss_pred HHHHHHHHHHHHCCCCCCHHHHHHHHHHH Q ss_conf 99999998764067788989999999998 Q 537021.9.peg.8 101 ASQRILKANWSIKKFLTPGQPCLISVERH 129 (148) Q Consensus 101 l~rR~~~~~~~~~~~l~s~~~v~~yl~~~ 129 (148) ++-+-...++..-+.-.+++.+...+... T Consensus 113 ~~v~r~~~rl~~~~~~~~~~~~~~~l~~~ 141 (144) T pfam00730 113 THVRRVAKRLGLIKEKPTPEEVERELEEL 141 (144) T ss_pred HHHHHHHHHHCCCCCCCCHHHHHHHHHHH T ss_conf 99999999977998999999999998713 No 17 >PRK10702 endonuclease III; Provisional Probab=94.74 E-value=0.068 Score=33.06 Aligned_cols=64 Identities=13% Similarity=0.113 Sum_probs=53.8 Q ss_pred CHHH-HHHHHHHCCCCCCCHHHHHHHHHHHCCCHHHHHCCCHHHHHH-CCCCCH--HHHHHHHHHHHH Q ss_conf 9899-999997337998988999999997379978985099789973-789889--999997549999 Q 537021.9.peg.8 38 AYYE-ILELILFRLIPRKDTKSIAKALLKRFATLGGVFGAPLHLLQE-INRIGK--RVALELKLVSVA 101 (148) Q Consensus 38 sD~E-LLallL~~g~~~~~~~~lA~~lL~~fGsL~~ll~as~~eL~~-i~GIG~--aka~~l~a~~El 101 (148) +.+| |++.+|..-++..+|.....+|.++|.+...+..++.+++.+ |+|+|- .||..|..+.+. T Consensus 28 ~P~~vLVs~ILsqqTtd~~v~~~~~~L~~~~~t~e~la~a~~~el~~~i~~~G~y~~KA~~L~~~a~~ 95 (211) T PRK10702 28 SPFELLIAVLLSAQATDVSVNKATAKLYPVANTPAAMLELGVEGVKTYIKTIGLYNSKAENVIKTCRI 95 (211) T ss_pred CHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 85899999999741858999999999999779999987099999999999863599999999999999 No 18 >smart00478 ENDO3c endonuclease III. includes endonuclease III (DNA-(apurinic or apyrimidinic site) lyase), alkylbase DNA glycosidases (Alka-family) and other DNA glycosidases Probab=94.42 E-value=0.081 Score=32.55 Aligned_cols=95 Identities=17% Similarity=0.108 Sum_probs=64.2 Q ss_pred HHHCCHHHHHHHHHHCCC---CCCCHHHHHHHHHHHCCCHHHHHCCCHHHHHHCCCCCHHHHHHHHHHHHHH-------- Q ss_conf 221898999999973379---989889999999973799789850997899737898899999975499999-------- Q 537021.9.peg.8 34 ENALAYYEILELILFRLI---PRKDTKSIAKALLKRFATLGGVFGAPLHLLQEINRIGKRVALELKLVSVAS-------- 102 (148) Q Consensus 34 ~~~LsD~ELLallL~~g~---~~~~~~~lA~~lL~~fGsL~~ll~as~~eL~~i~GIG~aka~~l~a~~El~-------- 102 (148) ....+..||.+++=..|. +-...+.+|+.+.+.|++- ..-+.++|.++||||+..|..+++.. +. T Consensus 27 l~~a~~~~l~~~i~~~g~~~~ka~~i~~~a~~i~~~~~~~---~p~~~~~L~~lpGVG~~tA~~vl~~~-~~~~~~~vD~ 102 (149) T smart00478 27 LAAADEEELEELIRPLGFYRRKAKYLIELARILVEEYGGE---VPDDREELLKLPGVGRKTANAVLSFA-LGKPFIPVDT 102 (149) T ss_pred HHCCCHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHCCCC---CCCHHHHHHCCCCCCHHHHHHHHHHH-CCCCCCCCCC T ss_conf 9868999999999986889999999999999999866555---88559998758986599999999998-7998351341 Q ss_pred --HHHHHHHHHHCCCCCCHHHHHHHHHHHHCCC Q ss_conf --9999987640677889899999999984597 Q 537021.9.peg.8 103 --QRILKANWSIKKFLTPGQPCLISVERHYPMK 133 (148) Q Consensus 103 --rR~~~~~~~~~~~l~s~~~v~~yl~~~l~~~ 133 (148) +|+... ..--+.-.+++.+...++..+... T Consensus 103 ~v~Rv~~R-~~~~~~~~~~~~~~~~l~~~~p~~ 134 (149) T smart00478 103 HVLRIAKR-LGLVDKKSTPEEVEKLLEKLLPKE 134 (149) T ss_pred CHHHHHHH-HCCCCCCCCHHHHHHHHHHHCCCC T ss_conf 39999999-847888898999999999878934 No 19 >PRK01172 ski2-like helicase; Provisional Probab=94.31 E-value=0.14 Score=30.95 Aligned_cols=65 Identities=18% Similarity=0.293 Sum_probs=49.0 Q ss_pred HHHHHHHCCHHHCCHHHHHHHHHHCCCCCCCHHHHHHHHHH-HCCCHHHHHCCCHHHHHHCCCCCHHHHHHHHHH Q ss_conf 99999823922189899999997337998988999999997-379978985099789973789889999997549 Q 537021.9.peg.8 25 LRDRFLQKGENALAYYEILELILFRLIPRKDTKSIAKALLK-RFATLGGVFGAPLHLLQEINRIGKRVALELKLV 98 (148) Q Consensus 25 ~RErll~~G~~~LsD~ELLallL~~g~~~~~~~~lA~~lL~-~fGsL~~ll~as~~eL~~i~GIG~aka~~l~a~ 98 (148) ++.| +.+|.. .|||++.==.|..+. -|+.|.+ .|.|+..+.+|++++|.+|.|+|+.-|.+|.+. T Consensus 599 l~~R-l~~Gv~----~eLl~L~~I~gigr~----RAR~Ly~aG~~s~~dia~a~~~~L~~i~g~~~~~A~~Ii~~ 664 (674) T PRK01172 599 LNIR-IKEGIR----EDLIDLVLIPKVGRV----RARRLYDAGFKTVDDIARSSPERIKKIYGFSDTLANAIVNR 664 (674) T ss_pred HHHH-HHCCCH----HHHHHHCCCCCCCHH----HHHHHHHCCCCCHHHHHHCCHHHHHHCCCCCHHHHHHHHHH T ss_conf 9999-981986----888977188999989----99999986999999997099989876419899999999999 No 20 >COG0177 Nth Predicted EndoIII-related endonuclease [DNA replication, recombination, and repair] Probab=94.27 E-value=0.18 Score=30.28 Aligned_cols=57 Identities=23% Similarity=0.284 Sum_probs=27.9 Q ss_pred HHHHHHHCCCCCCCHHHHHHHHHHHCCCHHHHHCCCHHHHHH-CC--CCCHHHHHHHHHH Q ss_conf 999997337998988999999997379978985099789973-78--9889999997549 Q 537021.9.peg.8 42 ILELILFRLIPRKDTKSIAKALLKRFATLGGVFGAPLHLLQE-IN--RIGKRVALELKLV 98 (148) Q Consensus 42 LLallL~~g~~~~~~~~lA~~lL~~fGsL~~ll~as~~eL~~-i~--GIG~aka~~l~a~ 98 (148) |++.+|..-+....|.....+|.++|++..++++++.++|.. |+ |.-..||..|+.+ T Consensus 33 Lva~iLSaqttD~~vn~at~~Lf~~~~t~e~l~~a~~~~l~~~I~~iGlyr~KAk~I~~~ 92 (211) T COG0177 33 LVAVILSAQTTDEVVNKATPALFKRYPTPEDLLNADEEELEELIKSIGLYRNKAKNIKEL 92 (211) T ss_pred HHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHCCCHHHHHHHHHHCCCCHHHHHHHHHH T ss_conf 999999446744889999999999759999997499999999998638718999999999 No 21 >TIGR01259 comE comEA protein; InterPro: IPR004787 The comE locus is obligatory for bacterial cell competence - the process of internalizing the exogenous added DNA. comEA and comEC are required for transformability, whereas the products of comEB and of the overlapping comER, which is transcribed in the reverse direction, are dispensable . ComEA has been shown to be an integral membrane protein, as predicted from hydropathy analysis, with its C-terminal domain outside the cytoplasmic membrane. This C-terminal domain possesses a sequence with similarity to those of several proteins known to be involved in nucleic acid transactions including UvrC and a human protein that binds to the replication origin of the Human herpesvirus 4 (Epstein-Barr virus) .. Probab=94.22 E-value=0.032 Score=35.18 Aligned_cols=32 Identities=22% Similarity=0.094 Sum_probs=26.5 Q ss_pred CHHHHHCCCHHHHHHCCCCCHHHHHHHHHHHH Q ss_conf 97898509978997378988999999754999 Q 537021.9.peg.8 69 TLGGVFGAPLHLLQEINRIGKRVALELKLVSV 100 (148) Q Consensus 69 sL~~ll~as~~eL~~i~GIG~aka~~l~a~~E 100 (148) -.-+|=.||.+||..++||||+||.-|-.=+| T Consensus 62 ~~VnlN~As~~EL~~l~GiGP~kA~aIi~YRe 93 (124) T TIGR01259 62 AAVNLNKASLEELQALPGIGPAKAKAIIEYRE 93 (124) T ss_pred CHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHH T ss_conf 01344067899986369998133799999998 No 22 >pfam00633 HHH Helix-hairpin-helix motif. The helix-hairpin-helix DNA-binding motif is found to be duplicated in the central domain of RuvA. The HhH domain of DisA, a bacterial checkpoint control protein, is a DNA-binding domain. Probab=94.14 E-value=0.03 Score=35.35 Aligned_cols=29 Identities=21% Similarity=0.277 Sum_probs=25.1 Q ss_pred CHHHHHCCCHHHHHHCCCCCHHHHHHHHH Q ss_conf 97898509978997378988999999754 Q 537021.9.peg.8 69 TLGGVFGAPLHLLQEINRIGKRVALELKL 97 (148) Q Consensus 69 sL~~ll~as~~eL~~i~GIG~aka~~l~a 97 (148) ++.++..++.+||++++|||+.+|..|.. T Consensus 1 ~~~~~~~as~eeL~~lpGVG~~tA~~I~~ 29 (30) T pfam00633 1 SLEGLIPASREELLALPGVGPKTAEAILS 29 (30) T ss_pred CCCCCCCCCHHHHHHCCCCCHHHHHHHHC T ss_conf 96443523599997288977688998853 No 23 >PRK10880 adenine DNA glycosylase; Provisional Probab=92.90 E-value=0.12 Score=31.56 Aligned_cols=69 Identities=16% Similarity=0.233 Sum_probs=50.9 Q ss_pred HHHHHC--CHHHC---CHHHHHHHH--HHCCCCCCCHHHHHHHHHHHCCCHHHHHCCCHHHHHHCCCCCHHHHHHHHHH Q ss_conf 999823--92218---989999999--7337998988999999997379978985099789973789889999997549 Q 537021.9.peg.8 27 DRFLQK--GENAL---AYYEILELI--LFRLIPRKDTKSIAKALLKRFATLGGVFGAPLHLLQEINRIGKRVALELKLV 98 (148) Q Consensus 27 Erll~~--G~~~L---sD~ELLall--L~~g~~~~~~~~lA~~lL~~fGsL~~ll~as~~eL~~i~GIG~aka~~l~a~ 98 (148) +||++. ..++| +..|+|.+- |++=++-.+-...|+.+.++||| .+=.+.++|.+.||||+..|.-|.++ T Consensus 53 ~rf~~~fP~~~~LA~A~~~~vl~~W~GLGYY~RArnLh~aA~~i~~~~~G---~~P~~~~~L~~LPGIG~yTA~AI~si 128 (350) T PRK10880 53 ERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGG---KFPETFEEVAALPGVGRSTAGAILSL 128 (350) T ss_pred HHHHHHCCCHHHHHCCCHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCC---CCCCHHHHHHHCCCCCHHHHHHHHHH T ss_conf 99999883999997799999999864069389999999999999997589---89825999862668872799999999 No 24 >PRK07758 hypothetical protein; Provisional Probab=92.29 E-value=0.24 Score=29.53 Aligned_cols=41 Identities=12% Similarity=0.105 Sum_probs=34.2 Q ss_pred HHHHHHHC--CCHHHHHCCCHHHHHHCCCCCHHHHHHHHHHHH Q ss_conf 99999737--997898509978997378988999999754999 Q 537021.9.peg.8 60 AKALLKRF--ATLGGVFGAPLHLLQEINRIGKRVALELKLVSV 100 (148) Q Consensus 60 A~~lL~~f--GsL~~ll~as~~eL~~i~GIG~aka~~l~a~~E 100 (148) |++-|.+- -+|..|...+..||.+++||||+....|..+.. T Consensus 46 ARRAL~~~GI~TLedLskytEkELL~LHGmGP~ai~~L~~aLk 88 (95) T PRK07758 46 ARRALEHHGIHTVEELSKYSEKEILKLHGMGPASLPKLRKALE 88 (95) T ss_pred HHHHHHHCCCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHH T ss_conf 8999998281149999885199999984868888999999999 No 25 >PRK00254 ski2-like helicase; Provisional Probab=92.29 E-value=0.26 Score=29.28 Aligned_cols=83 Identities=20% Similarity=0.268 Sum_probs=58.0 Q ss_pred HHHHHHHHHHHCCHHHCCHHHHHHHHHHCCCCCCCHHHHHHHHHH-HCCCHHHHHCCCHHHHHHCCCCCHHHHHHHHHHH Q ss_conf 650199999823922189899999997337998988999999997-3799789850997899737898899999975499 Q 537021.9.peg.8 21 HHNRLRDRFLQKGENALAYYEILELILFRLIPRKDTKSIAKALLK-RFATLGGVFGAPLHLLQEINRIGKRVALELKLVS 99 (148) Q Consensus 21 hr~R~RErll~~G~~~LsD~ELLallL~~g~~~~~~~~lA~~lL~-~fGsL~~ll~as~~eL~~i~GIG~aka~~l~a~~ 99 (148) +-.+++.| +++|.. .|||++.= .|+.. ..-|+.|.+ .|.|+..+.+|++.+|..|+|+|+.-|-.|. . T Consensus 627 ~l~~l~~R-l~~Gv~----~ELl~L~~---I~gvg-r~RAR~Ly~aGi~s~~~ia~A~p~~l~~i~g~g~~~a~~i~--~ 695 (717) T PRK00254 627 YLETLRLR-VKHGIR----EELIPLME---LPMIG-RKRARALYNAGFRDLEDIMNAKPSELLAVEGIGAKIVEGIF--K 695 (717) T ss_pred HHHHHHHH-HHCCCC----HHHHHHCC---CCCCC-HHHHHHHHHCCCCCHHHHHCCCHHHEECCCCCCHHHHHHHH--H T ss_conf 99999999-980997----65683564---89989-89999999869999999965999990302372789999999--9 Q ss_pred HHHHHHHHHHHHHCC Q ss_conf 999999998764067 Q 537021.9.peg.8 100 VASQRILKANWSIKK 114 (148) Q Consensus 100 El~rR~~~~~~~~~~ 114 (148) ++++.+.-....++. T Consensus 696 ~~~~~~~~~~~~~~~ 710 (717) T PRK00254 696 HLGKEVKINEKPRKG 710 (717) T ss_pred HHCCEEEHHHCCCCC T ss_conf 838644300034667 No 26 >PRK08097 ligB NAD-dependent DNA ligase LigB; Reviewed Probab=92.19 E-value=0.59 Score=26.98 Aligned_cols=35 Identities=20% Similarity=0.041 Sum_probs=15.9 Q ss_pred CHHHHHCCCHHHHHHCCCCCHHHHHHHHHHHHHHH Q ss_conf 97898509978997378988999999754999999 Q 537021.9.peg.8 69 TLGGVFGAPLHLLQEINRIGKRVALELKLVSVASQ 103 (148) Q Consensus 69 sL~~ll~as~~eL~~i~GIG~aka~~l~a~~El~r 103 (148) ++.+++..+.++|.+++|+|+.+|.-|..+.|.++ T Consensus 450 ~~~Dly~L~~~~L~~l~g~geKsa~nLl~aIe~SK 484 (563) T PRK08097 450 HLFSWLALTPEQLANTPGIGKARAAQLWHQFNLAR 484 (563) T ss_pred CHHHHHHCCHHHHHCCCCHHHHHHHHHHHHHHHHC T ss_conf 97997508999985589857899999999999972 No 27 >cd00056 ENDO3c endonuclease III; includes endonuclease III (DNA-(apurinic or apyrimidinic site) lyase), alkylbase DNA glycosidases (Alka-family) and other DNA glycosidases Probab=91.91 E-value=1.2 Score=25.05 Aligned_cols=64 Identities=25% Similarity=0.307 Sum_probs=50.3 Q ss_pred HHHHHHHHCCCCCCCHHHHHHHHHHHCC-CHHHHHCCCHHHHHH-CCCCC-HHHHHHHHHHHHHHHHHHH Q ss_conf 9999997337998988999999997379-978985099789973-78988-9999997549999999999 Q 537021.9.peg.8 41 EILELILFRLIPRKDTKSIAKALLKRFA-TLGGVFGAPLHLLQE-INRIG-KRVALELKLVSVASQRILK 107 (148) Q Consensus 41 ELLallL~~g~~~~~~~~lA~~lL~~fG-sL~~ll~as~~eL~~-i~GIG-~aka~~l~a~~El~rR~~~ 107 (148) -|+..+|..-+....+.....++.++|| +...+..++.++|.+ +.++| ..||..|+ ++++.+.. T Consensus 3 ~Li~~Il~qq~s~~~a~~~~~~l~~~~~pt~~~l~~~~~~~l~~~~~~~gy~~Ka~~i~---~~a~~i~~ 69 (158) T cd00056 3 VLVSEILSQQTTDKAVNKAYERLFERYGPTPEALAAADEEELRELIRSLGYRRKAKYLK---ELARAIVE 69 (158) T ss_pred HHHHHHHHCCCHHHHHHHHHHHHHHHCCCCHHHHHCCCHHHHHHHHCCCCHHHHHHHHH---HHHHHHHH T ss_conf 99999998145299999999999985499899998099999999973356899999999---98888898 No 28 >COG0177 Nth Predicted EndoIII-related endonuclease [DNA replication, recombination, and repair] Probab=91.63 E-value=0.15 Score=30.81 Aligned_cols=96 Identities=16% Similarity=0.062 Sum_probs=60.2 Q ss_pred HCCHHHHHHHHHHCCCC---CCCHHHHHHHHHHHCCCHHHHHCCCHHHHHHCCCCCHHHHHHHHHHHH----HHHHHHHH Q ss_conf 18989999999733799---898899999999737997898509978997378988999999754999----99999998 Q 537021.9.peg.8 36 ALAYYEILELILFRLIP---RKDTKSIAKALLKRFATLGGVFGAPLHLLQEINRIGKRVALELKLVSV----ASQRILKA 108 (148) Q Consensus 36 ~LsD~ELLallL~~g~~---~~~~~~lA~~lL~~fGsL~~ll~as~~eL~~i~GIG~aka~~l~a~~E----l~rR~~~~ 108 (148) .+...||=++|=..|.. -++.+.+|+.|+++||| -.-.+.++|.+.||||+..|--.+...- ++-=.+-. T Consensus 66 ~a~~~~l~~~I~~iGlyr~KAk~I~~~~~~l~e~~~g---~vP~~~~eL~~LPGVGrKTAnvVL~~a~g~p~i~VDTHV~ 142 (211) T COG0177 66 NADEEELEELIKSIGLYRNKAKNIKELARILLEKFGG---EVPDTREELLSLPGVGRKTANVVLSFAFGIPAIAVDTHVH 142 (211) T ss_pred CCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHCCC---CCCCHHHHHHHCCCCCHHHHHHHHHHHCCCCCCCCCCHHH T ss_conf 4999999999986387189999999999999997499---9981599997489966577898998655998652124299 Q ss_pred HHHHCCC---CCCHHHHHHHHHHHHCCCC Q ss_conf 7640677---8898999999999845978 Q 537021.9.peg.8 109 NWSIKKF---LTPGQPCLISVERHYPMKN 134 (148) Q Consensus 109 ~~~~~~~---l~s~~~v~~yl~~~l~~~~ 134 (148) ++.++.- =.+|+++-.-|+..+.... T Consensus 143 Rvs~R~gl~~~~~p~~ve~~L~~~iP~~~ 171 (211) T COG0177 143 RVSNRLGLVPGKTPEEVEEALMKLIPKEL 171 (211) T ss_pred HHHHHHCCCCCCCHHHHHHHHHHHCCHHH T ss_conf 99998477889999999999999789788 No 29 >PTZ00035 Rad51; Provisional Probab=91.56 E-value=0.51 Score=27.37 Aligned_cols=54 Identities=22% Similarity=0.179 Sum_probs=40.4 Q ss_pred HHHHHHHCCCCCCCHHHHHHHHHHHCCCHHHHHCCCHHHHHHCCCCCHHHHHHHHHH Q ss_conf 999997337998988999999997379978985099789973789889999997549 Q 537021.9.peg.8 42 ILELILFRLIPRKDTKSIAKALLKRFATLGGVFGAPLHLLQEINRIGKRVALELKLV 98 (148) Q Consensus 42 LLallL~~g~~~~~~~~lA~~lL~~fGsL~~ll~as~~eL~~i~GIG~aka~~l~a~ 98 (148) .++-|...|. ++-++.+--=..|.++.++..++..+|+.|+||+++||..|+.+ T Consensus 34 ~i~~l~~~Gi---~~~di~kl~~aG~~tv~~v~~~~~k~L~~ikgise~k~~Ki~~~ 87 (350) T PTZ00035 34 KIEQLLAKGF---VKRDLELLKEGGLQTVECVAYAPMRTLCAIKGISEQKAEKLKKA 87 (350) T ss_pred CHHHHHHCCC---CHHHHHHHHHCCCCHHHHHHHCCHHHHHHCCCCCHHHHHHHHHH T ss_conf 2999987799---98999999984912489998509998977379469999999999 No 30 >COG2231 Uncharacterized protein related to Endonuclease III [DNA replication, recombination, and repair] Probab=91.33 E-value=0.22 Score=29.78 Aligned_cols=96 Identities=25% Similarity=0.171 Sum_probs=67.8 Q ss_pred HHHCCHHHHHHHHHHCCCCCCCHH---HHHHHHHHHCCCHHHHHCCC-HHHHHHCCCCCHHHHHHHHH---------HHH Q ss_conf 221898999999973379989889---99999997379978985099-78997378988999999754---------999 Q 537021.9.peg.8 34 ENALAYYEILELILFRLIPRKDTK---SIAKALLKRFATLGGVFGAP-LHLLQEINRIGKRVALELKL---------VSV 100 (148) Q Consensus 34 ~~~LsD~ELLallL~~g~~~~~~~---~lA~~lL~~fGsL~~ll~as-~~eL~~i~GIG~aka~~l~a---------~~E 100 (148) ...++..||-++|=.+|-=.+-++ .+.+.+...|.|+.+.-.-. .++|..|+|||+-.|-.|.. +-. T Consensus 66 I~~~~~~~L~elIrpsGFYnqKa~rLk~l~k~l~~~~~~~~~~~~~~~R~~LL~iKGIG~ETaDsILlYa~~rp~FVvD~ 145 (215) T COG2231 66 ILKLDEEELAELIRPSGFYNQKAKRLKALSKNLAKFFINLESFKSEVLREELLSIKGIGKETADSILLYALDRPVFVVDK 145 (215) T ss_pred HHCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHCCCCCCHHHHHHHHHHHHCCCCCCHHH T ss_conf 84589999999870424089999999999999999864231115188999987268866223999999980486446329 Q ss_pred HHHHHHHHHHHHCCCCCCHHHHHHHHHHHHC Q ss_conf 9999999876406778898999999999845 Q 537021.9.peg.8 101 ASQRILKANWSIKKFLTPGQPCLISVERHYP 131 (148) Q Consensus 101 l~rR~~~~~~~~~~~l~s~~~v~~yl~~~l~ 131 (148) -++|++..-.... .++.+++...++..+. T Consensus 146 YtrR~l~rlg~i~--~k~ydeik~~fe~~l~ 174 (215) T COG2231 146 YTRRLLSRLGGIE--EKKYDEIKELFEENLP 174 (215) T ss_pred HHHHHHHHHCCCC--CCCHHHHHHHHHHCCH T ss_conf 9999999945510--2549999999985350 No 31 >smart00478 ENDO3c endonuclease III. includes endonuclease III (DNA-(apurinic or apyrimidinic site) lyase), alkylbase DNA glycosidases (Alka-family) and other DNA glycosidases Probab=90.72 E-value=0.4 Score=28.05 Aligned_cols=53 Identities=21% Similarity=0.250 Sum_probs=42.5 Q ss_pred CCCCCCCHHHHHHHHHHHCCCHHHHHCCCHHHHHH-CCCCC--HHHHHHHHHHHHH Q ss_conf 37998988999999997379978985099789973-78988--9999997549999 Q 537021.9.peg.8 49 RLIPRKDTKSIAKALLKRFATLGGVFGAPLHLLQE-INRIG--KRVALELKLVSVA 101 (148) Q Consensus 49 ~g~~~~~~~~lA~~lL~~fGsL~~ll~as~~eL~~-i~GIG--~aka~~l~a~~El 101 (148) .-+....|.....+++++|+++..+..++.+++.+ +.|+| ..||..|+.+.+. T Consensus 3 qqt~~~~v~~~~~~l~~~~pt~~~l~~a~~~~l~~~i~~~g~~~~ka~~i~~~a~~ 58 (149) T smart00478 3 QQTSDEAVNKATERLFEKFPTPEDLAAADEEELEELIRPLGFYRRKAKYLIELARI 58 (149) T ss_pred CCCCHHHHHHHHHHHHHHCCCHHHHHCCCHHHHHHHHHHCCCHHHHHHHHHHHHHH T ss_conf 86528999999999999885999998689999999999868899999999999999 No 32 >PRK13913 3-methyladenine DNA glycosylase; Provisional Probab=90.59 E-value=0.32 Score=28.74 Aligned_cols=93 Identities=15% Similarity=0.066 Sum_probs=64.5 Q ss_pred HHCCHHHHHHHHHHCCCCCC---CHHHHHHHHHHHCCCHHHHH-CCCHHHHHHCCCCCHHHHHHHHHHH---------HH Q ss_conf 21898999999973379989---88999999997379978985-0997899737898899999975499---------99 Q 537021.9.peg.8 35 NALAYYEILELILFRLIPRK---DTKSIAKALLKRFATLGGVF-GAPLHLLQEINRIGKRVALELKLVS---------VA 101 (148) Q Consensus 35 ~~LsD~ELLallL~~g~~~~---~~~~lA~~lL~~fGsL~~ll-~as~~eL~~i~GIG~aka~~l~a~~---------El 101 (148) .++++.+|=++|=.+|--+. -.+.+|+.++..||++.++- +.+.++|..++|||+-.|-.|.+=. .- T Consensus 73 ~~l~~e~La~lIrPaGFy~~KA~rLk~l~~~~~~d~~~~~~~~~~~~Re~LL~lkGIG~ETADsILlYa~~~p~FVVDaY 152 (218) T PRK13913 73 AYIEFSKLAECVRPSGFYNQKAKRLIDLSKNILKDFQSFENFKQEVTREWLLDQKGIGKESADAILCYVCAKEVMVVDKY 152 (218) T ss_pred HCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHCCCCCCHHHHHHHHHHHCCCCEEECHHH T ss_conf 71899999999504015899999999999999987525751453658999974898663339999999749984511188 Q ss_pred HHHHHHHHHHHCCCCCCHHHHHHHHHHHH Q ss_conf 99999987640677889899999999984 Q 537021.9.peg.8 102 SQRILKANWSIKKFLTPGQPCLISVERHY 130 (148) Q Consensus 102 ~rR~~~~~~~~~~~l~s~~~v~~yl~~~l 130 (148) .+|++.. + .....+.+++..++...+ T Consensus 153 T~Ri~~r-l--G~~~~~Ydelq~~fe~~l 178 (218) T PRK13913 153 SYLFLKK-L--GIEIEDYDELQHFFEKGV 178 (218) T ss_pred HHHHHHH-C--CCCCCCHHHHHHHHHHCC T ss_conf 9999998-1--998579999999999622 No 33 >COG5241 RAD10 Nucleotide excision repair endonuclease NEF1, RAD10 subunit [DNA replication, recombination, and repair] Probab=90.51 E-value=0.22 Score=29.74 Aligned_cols=40 Identities=18% Similarity=0.071 Sum_probs=35.6 Q ss_pred HHHHHHHHHCCCHHHHHCCCHHHHHHCCCCCHHHHHHHHH Q ss_conf 9999999737997898509978997378988999999754 Q 537021.9.peg.8 58 SIAKALLKRFATLGGVFGAPLHLLQEINRIGKRVALELKL 97 (148) Q Consensus 58 ~lA~~lL~~fGsL~~ll~as~~eL~~i~GIG~aka~~l~a 97 (148) .=|..+|..||+|...+.||.+|+..+.|.|+.||..+.- T Consensus 178 sds~~~L~d~g~L~rf~gaSrdE~e~l~g~g~~ka~~~ie 217 (224) T COG5241 178 SDSEDTLNDIGKLCRFNGASRDEFELLLGFGFEKAAKYIE 217 (224) T ss_pred CCHHHHHHHHHHHHHHHCCCHHHHHHHHCCCHHHHHHHHH T ss_conf 4189899888789998564615889997008889999999 No 34 >PRK00116 ruvA Holliday junction DNA helicase RuvA; Reviewed Probab=89.84 E-value=1.5 Score=24.29 Aligned_cols=105 Identities=11% Similarity=0.089 Sum_probs=65.3 Q ss_pred HHCCHHHCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHCC--C-HHHHHCCCHHHHHHCCCCCHHHHHHHHHHHHHHHHHH Q ss_conf 823922189899999997337998988999999997379--9-7898509978997378988999999754999999999 Q 537021.9.peg.8 30 LQKGENALAYYEILELILFRLIPRKDTKSIAKALLKRFA--T-LGGVFGAPLHLLQEINRIGKRVALELKLVSVASQRIL 106 (148) Q Consensus 30 l~~G~~~LsD~ELLallL~~g~~~~~~~~lA~~lL~~fG--s-L~~ll~as~~eL~~i~GIG~aka~~l~a~~El~rR~~ 106 (148) .-+|-....+.|+...++. .++--. -.|-.+|..++ . ...+.+-+.+.|.++||||+..|..|. .||--++. T Consensus 59 ~LyGF~~~~Er~~F~~Li~--V~GIGp-K~AL~ILs~~~~~~l~~aI~~~D~~~L~~vpGIG~KtA~rIi--~ELk~K~~ 133 (198) T PRK00116 59 LLYGFLTKEERELFRLLIS--VSGVGP-KLALAILSGLSPEELAQAIANGDIKALTKVPGVGKKTAERIV--LELKDKLA 133 (198) T ss_pred EEEEECCHHHHHHHHHHHC--CCCCCH-HHHHHHHCCCCHHHHHHHHHHCCHHHHCCCCCCCHHHHHHHH--HHHHHHHH T ss_conf 5784088899999999856--688578-999988702999999999985899997068897889999999--99998887 Q ss_pred HHHHHHC------CC-CCCHHHHHHHHHHHHCCCCEEEEEE Q ss_conf 9876406------77-8898999999999845978000252 Q 537021.9.peg.8 107 KANWSIK------KF-LTPGQPCLISVERHYPMKNANNFAY 140 (148) Q Consensus 107 ~~~~~~~------~~-l~s~~~v~~yl~~~l~~~~~E~F~v 140 (148) .-..... .. -+.-+++..-|.. +|+.+.|.-.+ T Consensus 134 ~~~~~~~~~~~~~~~~~~~~~d~~~AL~~-LGy~~~ea~~a 173 (198) T PRK00116 134 ALADAGAAAAAAASAANDALEEAVSALVA-LGYKPKEAQKA 173 (198) T ss_pred HHCCCCCCCCCCCCCCCCHHHHHHHHHHH-CCCCHHHHHHH T ss_conf 42124554545667775259999999998-59999999999 No 35 >TIGR01083 nth endonuclease III; InterPro: IPR005759 The spectrum of DNA damage caused by reactive oxygen species includes a wide variety of modifications of purine and pyrimidine bases. Among these modified bases, 7,8-dihydro-8-oxoguanine (8-oxoG) is an important mutagenic lesion. Base excision repair is a critical mechanism for preventing mutations by removing the oxidative lesion from the DNA. Escherichia coli Nth protein (endonuclease III) has an 8-oxoG DNA glycosylase/AP lyase activity which removes 8-oxoG preferentially from 8-oxoG/G mispairs. Human hNTH1 protein, a homolog of E. coli Nth protein, has similar DNA glycosylase/AP lyase activity that removes 8-oxoG from 8-oxoG/G mispairs . ; GO: 0003906 DNA-(apurinic or apyrimidinic site) lyase activity, 0006284 base-excision repair, 0005622 intracellular. Probab=89.26 E-value=0.84 Score=25.98 Aligned_cols=66 Identities=23% Similarity=0.269 Sum_probs=45.2 Q ss_pred CHHHH-HHHHHHCCCCCCCHHHHHHHHHHHCCCHHHHHCCCHHHHHH-CCCCC--HHHHHHHHHHHHHHH Q ss_conf 98999-99997337998988999999997379978985099789973-78988--999999754999999 Q 537021.9.peg.8 38 AYYEI-LELILFRLIPRKDTKSIAKALLKRFATLGGVFGAPLHLLQE-INRIG--KRVALELKLVSVASQ 103 (148) Q Consensus 38 sD~EL-LallL~~g~~~~~~~~lA~~lL~~fGsL~~ll~as~~eL~~-i~GIG--~aka~~l~a~~El~r 103 (148) ++.|| +|.||..=....+|.-..+.|.+.|.+-.+++.++.+||++ |+-|| ..||..|..+...-- T Consensus 26 ~PFeLLVAtiLSAQ~TD~~VNkaT~~LF~~Y~tp~~~a~a~~eel~~~Ik~iGlYr~KAk~I~~~~~~Lv 95 (192) T TIGR01083 26 NPFELLVATILSAQATDKSVNKATKKLFEVYPTPQALAAAGLEELEEYIKSIGLYRNKAKNIIALCRKLV 95 (192) T ss_pred CCHHHHHHHHHHHHHCCHHHHHCCHHHHHCCCCHHHHHCCCHHHHHHHHHCCCCCHHHHHHHHHHHHHHH T ss_conf 7078999999986531326763167865127786899608931347764225864568999999999999 No 36 >COG1194 MutY A/G-specific DNA glycosylase [DNA replication, recombination, and repair] Probab=89.08 E-value=0.63 Score=26.82 Aligned_cols=72 Identities=19% Similarity=0.158 Sum_probs=51.9 Q ss_pred HHHHHHHHC--CHHHCC---HHHHHHHHHHCC--CCCCCHHHHHHHHHHHCCCHHHHHCCCHHHHHHCCCCCHHHHHHHH Q ss_conf 199999823--922189---899999997337--9989889999999973799789850997899737898899999975 Q 537021.9.peg.8 24 RLRDRFLQK--GENALA---YYEILELILFRL--IPRKDTKSIAKALLKRFATLGGVFGAPLHLLQEINRIGKRVALELK 96 (148) Q Consensus 24 R~RErll~~--G~~~Ls---D~ELLallL~~g--~~~~~~~~lA~~lL~~fGsL~~ll~as~~eL~~i~GIG~aka~~l~ 96 (148) +.-+||++. +.++|. ..|++.+--+-| ++-.+....|+.+.++||| .+--+.++|.+++|||+..|.-+. T Consensus 54 ~yy~~fl~rfPti~~LA~A~~~evl~~W~gLGYysRArnL~~~A~~v~~~~~G---~~P~~~~~l~~LpGiG~yTa~Ail 130 (342) T COG1194 54 PYYERFLERFPTIKALAAAPEDEVLKAWEGLGYYSRARNLHKAAQEVVERHGG---EFPDDEEELAALPGVGPYTAGAIL 130 (342) T ss_pred HHHHHHHHHCCCHHHHHCCCHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCC---CCCCCHHHHHHCCCCCHHHHHHHH T ss_conf 56999998689989986688889999987167378999999999999998199---799999999867897388999999 Q ss_pred HH Q ss_conf 49 Q 537021.9.peg.8 97 LV 98 (148) Q Consensus 97 a~ 98 (148) +. T Consensus 131 ~~ 132 (342) T COG1194 131 SF 132 (342) T ss_pred HH T ss_conf 98 No 37 >PRK13910 DNA glycosylase MutY; Provisional Probab=88.84 E-value=0.87 Score=25.89 Aligned_cols=70 Identities=21% Similarity=0.119 Sum_probs=50.1 Q ss_pred HHHHHHC--CHHHCC---HHHHHHHHHHCC--CCCCCHHHHHHHHHHHCCCHHHHHCCCHHHHHHCCCCCHHHHHHHHHH Q ss_conf 9999823--922189---899999997337--998988999999997379978985099789973789889999997549 Q 537021.9.peg.8 26 RDRFLQK--GENALA---YYEILELILFRL--IPRKDTKSIAKALLKRFATLGGVFGAPLHLLQEINRIGKRVALELKLV 98 (148) Q Consensus 26 RErll~~--G~~~Ls---D~ELLallL~~g--~~~~~~~~lA~~lL~~fGsL~~ll~as~~eL~~i~GIG~aka~~l~a~ 98 (148) -+||++. -.++|+ ..|+|.+-=+-| ++-.+-...|+.+.++|+|- +=.+.++|.++||||+..|.-|.++ T Consensus 16 ~~~f~~~fP~~~~la~a~~~~vl~~W~GLGYY~RArnl~~~a~~i~~~~~g~---~P~~~~~L~~LPGIG~yTA~AI~si 92 (290) T PRK13910 16 YSPFLEAFPTLKDLANAQLEEVLLLWRGLGYYSRAKNLKKSAEICVKEHHSQ---LPNDYQSLLKLPGIGAYTANAILCF 92 (290) T ss_pred HHHHHHHCCCHHHHHCCCHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHCCC---CCCHHHHHHHCCCCCCHHHHHHHHH T ss_conf 9999998839999977899999999874684899999999999999983898---9852999975889982699999998 No 38 >PRK04301 radA DNA repair and recombination protein RadA; Validated Probab=88.81 E-value=1.5 Score=24.45 Aligned_cols=47 Identities=19% Similarity=0.176 Sum_probs=38.2 Q ss_pred CCCCCHHHHHHHHHHHCCCHHHHHCCCHHHHHHCCCCCHHHHHHHHH Q ss_conf 99898899999999737997898509978997378988999999754 Q 537021.9.peg.8 51 IPRKDTKSIAKALLKRFATLGGVFGAPLHLLQEINRIGKRVALELKL 97 (148) Q Consensus 51 ~~~~~~~~lA~~lL~~fGsL~~ll~as~~eL~~i~GIG~aka~~l~a 97 (148) .|+-+...+++--=..|.++..+..+++.+|+++.||++++|..|.. T Consensus 12 lpGv~~~~~~kL~~aG~~tv~~l~~~~~~~L~~~~gis~~~a~ki~~ 58 (318) T PRK04301 12 LPGVGPATAEKLREAGYDTVEAIAVASPKELSEIAGISESTAAKIIE 58 (318) T ss_pred CCCCCHHHHHHHHHCCCCCHHHHHCCCHHHHHHHHCCCHHHHHHHHH T ss_conf 89989999999998699549998748999999850999999999999 No 39 >COG1555 ComEA DNA uptake protein and related DNA-binding proteins [DNA replication, recombination, and repair] Probab=87.68 E-value=0.51 Score=27.40 Aligned_cols=34 Identities=18% Similarity=0.062 Sum_probs=26.7 Q ss_pred CCHHHHHCCCHHHHHHCCCCCHHHHHHHHHHHHH Q ss_conf 9978985099789973789889999997549999 Q 537021.9.peg.8 68 ATLGGVFGAPLHLLQEINRIGKRVALELKLVSVA 101 (148) Q Consensus 68 GsL~~ll~as~~eL~~i~GIG~aka~~l~a~~El 101 (148) ...-++-.||.+||..++|||+++|.-|..-.|- T Consensus 86 ~~~vNiNtAs~eeL~~lpgIG~~kA~aIi~yRe~ 119 (149) T COG1555 86 EKKVNINTASAEELQALPGIGPKKAQAIIDYREE 119 (149) T ss_pred CCCCCCCCCCHHHHHHCCCCCHHHHHHHHHHHHH T ss_conf 5413666108999988679899999999999997 No 40 >COG0122 AlkA 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase [DNA replication, recombination, and repair] Probab=87.28 E-value=0.88 Score=25.85 Aligned_cols=57 Identities=18% Similarity=-0.013 Sum_probs=39.7 Q ss_pred HHHHHHHHHCCCCCCCHHHHHHHHHHHCCCHHHHHCCCH----HHHHHCCCCCHHHHHHHH Q ss_conf 999999973379989889999999973799789850997----899737898899999975 Q 537021.9.peg.8 40 YEILELILFRLIPRKDTKSIAKALLKRFATLGGVFGAPL----HLLQEINRIGKRVALELK 96 (148) Q Consensus 40 ~ELLallL~~g~~~~~~~~lA~~lL~~fGsL~~ll~as~----~eL~~i~GIG~aka~~l~ 96 (148) .|.|.-.-.++.+-.-.+.+|+.+.+..-++..+..++. +.|++|+|||+..|-.+. T Consensus 155 ~~~l~~~g~s~~Ka~yi~~~A~~~~~g~~~~~~l~~~~~e~a~e~L~~i~GIG~WTAe~~l 215 (285) T COG0122 155 EEALRRCGLSGRKAEYIISLARAAAEGELDLSELKPLSDEEAIEELTALKGIGPWTAEMFL 215 (285) T ss_pred HHHHHHHCCCHHHHHHHHHHHHHHHCCCCCHHHHCCCCHHHHHHHHHCCCCCCHHHHHHHH T ss_conf 9999883785778999999999998599656766258899999998737886799999999 No 41 >TIGR02236 recomb_radA DNA repair and recombination protein RadA; InterPro: IPR011938 This family consists exclusively of archaeal RadA protein, a homolog of bacterial RecA, eukaryotic RAD51 (IPR011941 from INTERPRO), and archaeal RadB (IPR011939 from INTERPRO). This protein is involved in DNA repair and recombination. The member from Pyrococcus horikoshii contains an intein .; GO: 0003684 damaged DNA binding, 0005524 ATP binding, 0008094 DNA-dependent ATPase activity, 0006281 DNA repair, 0006310 DNA recombination. Probab=87.13 E-value=0.75 Score=26.33 Aligned_cols=45 Identities=27% Similarity=0.305 Sum_probs=37.2 Q ss_pred HHHHHHHH-HCCCHHHHHCCCHHHHHHCCCCCHHHHHHH-HHHHHHH Q ss_conf 99999997-379978985099789973789889999997-5499999 Q 537021.9.peg.8 58 SIAKALLK-RFATLGGVFGAPLHLLQEINRIGKRVALEL-KLVSVAS 102 (148) Q Consensus 58 ~lA~~lL~-~fGsL~~ll~as~~eL~~i~GIG~aka~~l-~a~~El~ 102 (148) ..|++|-+ .|..+..+.-||+.||..+.|||+..|..| .|+.+++ T Consensus 11 ~TA~KL~EaGy~t~~~iA~A~~~EL~~~~gI~E~~A~kiI~AAR~a~ 57 (333) T TIGR02236 11 ATAEKLREAGYDTLEAIAVASPKELSEIAGIGEGTAAKIIQAARKAA 57 (333) T ss_pred HHHHHHHHHHHHHHHHHHCCCHHHHHHHCCCCHHHHHHHHHHHHHHH T ss_conf 68899886107889998445857953203787778999999999984 No 42 >KOG2875 consensus Probab=84.55 E-value=1 Score=25.37 Aligned_cols=50 Identities=20% Similarity=0.102 Sum_probs=40.3 Q ss_pred HHCCCCCCCHHHHHHHHHHHCCC---HHHHHCCCHH----HHHHCCCCCHHHHHHHH Q ss_conf 73379989889999999973799---7898509978----99737898899999975 Q 537021.9.peg.8 47 LFRLIPRKDTKSIAKALLKRFAT---LGGVFGAPLH----LLQEINRIGKRVALELK 96 (148) Q Consensus 47 L~~g~~~~~~~~lA~~lL~~fGs---L~~ll~as~~----eL~~i~GIG~aka~~l~ 96 (148) ++.|.+-+=+-..|+.|++..|+ |..+-.++++ .|+.+||||+..|.-|. T Consensus 179 ~gfGYRAkYI~~ta~~l~~~~g~~~wLqslr~~~yeear~~L~~lpGVG~KVADCI~ 235 (323) T KOG2875 179 LGFGYRAKYISATARALQEKQGGLAWLQSLRKSSYEEAREALCSLPGVGPKVADCIC 235 (323) T ss_pred CCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHCCCHHHHHHHHHCCCCCCCHHHHHHH T ss_conf 176446899999999999723500599988545289999998528887614756223 No 43 >TIGR00426 TIGR00426 competence protein ComEA helix-hairpin-helix repeat region; InterPro: IPR004509 This domain is found in competence protein ComEA and closely related proteins from a number of species that exhibit competence for transformation by exongenous DNA, including Streptococcus pneumoniae, Bacillus subtilis, Neisseria meningitidis, and Haemophilus influenzae. This domain represents a region of two tandem copies of a helix-hairpin-helix domain, each about 30 residues in length. Limited sequence similarity can be found among some members of this family N-terminal to this domain.. Probab=82.83 E-value=0.78 Score=26.19 Aligned_cols=33 Identities=15% Similarity=0.114 Sum_probs=24.1 Q ss_pred CCHHHHHCCCHHHHHH-CCCCCHHHHHHHHHHHH Q ss_conf 9978985099789973-78988999999754999 Q 537021.9.peg.8 68 ATLGGVFGAPLHLLQE-INRIGKRVALELKLVSV 100 (148) Q Consensus 68 GsL~~ll~as~~eL~~-i~GIG~aka~~l~a~~E 100 (148) |..-.+=+|+.+||++ +.|||+.||--|.+=+| T Consensus 6 G~~vnINtAtaeElq~~~~GvG~kKAeAIv~YRE 39 (70) T TIGR00426 6 GAKVNINTATAEELQKALSGVGAKKAEAIVAYRE 39 (70) T ss_pred CCEEECCHHCHHHHHHHHCCCCHHHHHHHHHHHH T ss_conf 4524101104788887642887237899988753 No 44 >PRK13901 ruvA Holliday junction DNA helicase motor protein; Provisional Probab=82.40 E-value=4.4 Score=21.34 Aligned_cols=104 Identities=12% Similarity=0.036 Sum_probs=63.6 Q ss_pred HHCCHHHCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHCC--CH-HHHHCCCHHHHHHCCCCCHHHHHHHHHHHHHHHHHH Q ss_conf 823922189899999997337998988999999997379--97-898509978997378988999999754999999999 Q 537021.9.peg.8 30 LQKGENALAYYEILELILFRLIPRKDTKSIAKALLKRFA--TL-GGVFGAPLHLLQEINRIGKRVALELKLVSVASQRIL 106 (148) Q Consensus 30 l~~G~~~LsD~ELLallL~~g~~~~~~~~lA~~lL~~fG--sL-~~ll~as~~eL~~i~GIG~aka~~l~a~~El~rR~~ 106 (148) .-||-..-.+.++.+.++.- .+--.+ .|-.+|..+. .| ..+.+-+...|.++||||+..|..| +.||--.+. T Consensus 58 ~LyGF~~~~Er~~F~~LisV--sGIGpk-~Al~iLs~~~~~~l~~aI~~~D~~~L~~vpGIG~KtA~rI--i~ELk~Kl~ 132 (196) T PRK13901 58 KLFGFLNSSEREVFEELIGV--DGIGPR-AALRVLSGIKYNEFRDAIDREDIELISKVKGIGNKMAGKI--FLKLRGKLV 132 (196) T ss_pred EEECCCCHHHHHHHHHHHCC--CCCCHH-HHHHHHCCCCHHHHHHHHHHCCHHHHHHCCCCCHHHHHHH--HHHHHHHHC T ss_conf 13365988999999998765--882689-9999975799999999999289999831999589999999--999976531 Q ss_pred HHHHHHCCCCCCHHHHHHHHHHHHCCCCEEEEEE Q ss_conf 9876406778898999999999845978000252 Q 537021.9.peg.8 107 KANWSIKKFLTPGQPCLISVERHYPMKNANNFAY 140 (148) Q Consensus 107 ~~~~~~~~~l~s~~~v~~yl~~~l~~~~~E~F~v 140 (148) ....... .....+++..-|. .+|+.+.|.-.+ T Consensus 133 ~~~~~~~-~~~~~~e~~~AL~-~LGy~~~~a~~a 164 (196) T PRK13901 133 KNDELES-SLFKFKELEQSIV-NMGFDRKLVNSA 164 (196) T ss_pred CCCCCCC-CCCCHHHHHHHHH-HCCCCHHHHHHH T ss_conf 5665565-5344899999999-849998999999 No 45 >PTZ00188 adrenodoxin reductase; Provisional Probab=80.99 E-value=3 Score=22.42 Aligned_cols=82 Identities=9% Similarity=0.045 Sum_probs=59.9 Q ss_pred CCCCCHHHHHHHHHHHCCCHHHHHCCCHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHH Q ss_conf 99898899999999737997898509978997378988999999754999999999987640677889899999999984 Q 537021.9.peg.8 51 IPRKDTKSIAKALLKRFATLGGVFGAPLHLLQEINRIGKRVALELKLVSVASQRILKANWSIKKFLTPGQPCLISVERHY 130 (148) Q Consensus 51 ~~~~~~~~lA~~lL~~fGsL~~ll~as~~eL~~i~GIG~aka~~l~a~~El~rR~~~~~~~~~~~l~s~~~v~~yl~~~l 130 (148) +...|+.+.|..+++...+.......+..+|..-.|+-.+....++.+-+.=+..-.+.-+.|..+.+.++..+.++.+| T Consensus 423 Ttm~da~~ta~~vl~dl~~~~~~~~~~i~~lL~~r~v~~Vt~~~W~~ID~~E~~~G~~~GkpReK~~~~~eML~vl~gr~ 502 (506) T PTZ00188 423 SQILNSKNSTHLVLNFLQKVDSFFDNDISSLLQEKQIPYVSFDDWTYLHQMEKQMGAQQNKIAQKFSQTGEVLRVLKGRM 502 (506) T ss_pred CCCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHCCCCEECHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHCCC T ss_conf 78677899999999877546877702699999857998007899999999999716513898446499999999853043 Q ss_pred CC Q ss_conf 59 Q 537021.9.peg.8 131 PM 132 (148) Q Consensus 131 ~~ 132 (148) +. T Consensus 503 ~~ 504 (506) T PTZ00188 503 GK 504 (506) T ss_pred CC T ss_conf 67 No 46 >KOG1921 consensus Probab=80.93 E-value=1.3 Score=24.77 Aligned_cols=78 Identities=18% Similarity=0.221 Sum_probs=45.4 Q ss_pred HHCCCCCCCHHHHHHHHHHHCC-CHHHHHCCCHHHHHHCCCCCHHHHHHHHHHH--------------HHHHHHHHHHHH Q ss_conf 7337998988999999997379-9789850997899737898899999975499--------------999999998764 Q 537021.9.peg.8 47 LFRLIPRKDTKSIAKALLKRFA-TLGGVFGAPLHLLQEINRIGKRVALELKLVS--------------VASQRILKANWS 111 (148) Q Consensus 47 L~~g~~~~~~~~lA~~lL~~fG-sL~~ll~as~~eL~~i~GIG~aka~~l~a~~--------------El~rR~~~~~~~ 111 (148) =++..+-...+..|+-|.++|+ ++. .+.++|...|||||..|-..+.+. -++.|.. +. T Consensus 130 gFy~rKA~ylkkta~IL~d~f~gDIP----~~v~dLlsLPGVGPKMa~L~m~~AWn~i~GI~VDtHVHRi~nrlg---Wv 202 (286) T KOG1921 130 GFYTRKAKYLKKTAKILQDKFDGDIP----DTVEDLLSLPGVGPKMAHLTMQVAWNKIVGICVDTHVHRICNRLG---WV 202 (286) T ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCC----HHHHHHHCCCCCCHHHHHHHHHHHHCCCEEEEEEHHHHHHHHHHC---CC T ss_conf 21578889999999999987079975----559988558997659999999998566405786328999888756---33 Q ss_pred HCCCCCCHHHHHHHHHHHHCC Q ss_conf 067788989999999998459 Q 537021.9.peg.8 112 IKKFLTPGQPCLISVERHYPM 132 (148) Q Consensus 112 ~~~~l~s~~~v~~yl~~~l~~ 132 (148) ...-.+|++...-|+..+.. T Consensus 203 -~~ktkspE~TR~aLq~wLPk 222 (286) T KOG1921 203 -DTKTKSPEQTRVALQQWLPK 222 (286) T ss_pred -CCCCCCHHHHHHHHHHHCCH T ss_conf -56668878999999986768 No 47 >TIGR01954 nusA_Cterm_rpt transcription termination factor NusA, C-terminal duplication; InterPro: IPR010214 NusA, or N utilisation substance protein A, is a bacterial transcription termination factor. It binds to RNA polymerase alpha subunit and promotes termination at certain RNA hairpin structures. It is named for the interaction in Escherichia coli of phage lambda antitermination protein N with the N-utilisation substance, consisting of NusA, NusB, NusE (ribosomal protein S10), and NusG . This entry represents an acidic 50-residue region found in two copies toward the C terminus of most proteobacterial NusA proteins, spaced about 26 residues apart. Analogous C-terminal extensions in some other bacterial lineages lack apparent homology but appear similarly acidic ; GO: 0030528 transcription regulator activity. Probab=80.86 E-value=3.8 Score=21.78 Aligned_cols=42 Identities=26% Similarity=0.317 Sum_probs=36.2 Q ss_pred HHHHHHHHH-HCCCHHHHHCCCHHHHHH--CCCCCHHHHHHHHHH Q ss_conf 999999997-379978985099789973--789889999997549 Q 537021.9.peg.8 57 KSIAKALLK-RFATLGGVFGAPLHLLQE--INRIGKRVALELKLV 98 (148) Q Consensus 57 ~~lA~~lL~-~fGsL~~ll~as~~eL~~--i~GIG~aka~~l~a~ 98 (148) ..+|+.|.. .|-++-.|.+.+++||.. |.|+..-.|..|+.. T Consensus 3 e~~a~~Lv~eGf~t~EdlAy~~~~EL~~fGIeG~~Ee~A~~L~~~ 47 (52) T TIGR01954 3 EEIAQLLVEEGFTTVEDLAYVPVDELLSFGIEGLDEETAKELINR 47 (52) T ss_pred HHHHHHHHHHCCCHHHHHHCCCHHHHHHHCCCCCCHHHHHHHHHH T ss_conf 788999997367317887504556688635899887889999999 No 48 >pfam03118 RNA_pol_A_CTD Bacterial RNA polymerase, alpha chain C terminal domain. The alpha subunit of RNA polymerase consists of two independently folded domains, referred to as amino-terminal and carboxyl terminal domains. The amino terminal domain is involved in the interaction with the other subunits of the RNA polymerase. The carboxyl-terminal domain interacts with the DNA and activators. The amino acid sequence of the alpha subunit is conserved in prokaryotic and chloroplast RNA polymerases. There are three regions of particularly strong conservation, two in the amino-terminal and one in the carboxyl- terminal. Probab=80.81 E-value=4.1 Score=21.58 Aligned_cols=40 Identities=28% Similarity=0.325 Sum_probs=32.9 Q ss_pred HHHHHHH--CCCHHHHHCCCHHHHHHCCCCCHHHHHHHHHHH Q ss_conf 9999973--799789850997899737898899999975499 Q 537021.9.peg.8 60 AKALLKR--FATLGGVFGAPLHLLQEINRIGKRVALELKLVS 99 (148) Q Consensus 60 A~~lL~~--fGsL~~ll~as~~eL~~i~GIG~aka~~l~a~~ 99 (148) +.+.|.+ ...++.|...+.++|.+++++|....-.|+-.. T Consensus 19 ~~N~Lk~~~I~tv~dL~~~s~~dLl~i~N~G~kSl~EI~~~L 60 (62) T pfam03118 19 SYNCLKRAGINTVGDLLSKSEEDLLKIKNFGKKSLEEIKEKL 60 (62) T ss_pred HHHHHHHCCCCCHHHHHHCCHHHHHHCCCCCHHHHHHHHHHH T ss_conf 999998949967999985899999748898685799999998 No 49 >TIGR01083 nth endonuclease III; InterPro: IPR005759 The spectrum of DNA damage caused by reactive oxygen species includes a wide variety of modifications of purine and pyrimidine bases. Among these modified bases, 7,8-dihydro-8-oxoguanine (8-oxoG) is an important mutagenic lesion. Base excision repair is a critical mechanism for preventing mutations by removing the oxidative lesion from the DNA. Escherichia coli Nth protein (endonuclease III) has an 8-oxoG DNA glycosylase/AP lyase activity which removes 8-oxoG preferentially from 8-oxoG/G mispairs. Human hNTH1 protein, a homolog of E. coli Nth protein, has similar DNA glycosylase/AP lyase activity that removes 8-oxoG from 8-oxoG/G mispairs . ; GO: 0003906 DNA-(apurinic or apyrimidinic site) lyase activity, 0006284 base-excision repair, 0005622 intracellular. Probab=80.60 E-value=0.57 Score=27.06 Aligned_cols=56 Identities=20% Similarity=0.202 Sum_probs=40.1 Q ss_pred HHHHHHHHHC---CCCCCCHHHHHHHHHHHCCCHHHHHCCCHHHHHHCCCCCHHHHHHHHHH Q ss_conf 9999999733---7998988999999997379978985099789973789889999997549 Q 537021.9.peg.8 40 YEILELILFR---LIPRKDTKSIAKALLKRFATLGGVFGAPLHLLQEINRIGKRVALELKLV 98 (148) Q Consensus 40 ~ELLallL~~---g~~~~~~~~lA~~lL~~fGsL~~ll~as~~eL~~i~GIG~aka~~l~a~ 98 (148) .||=++|=.. -++-++...+|+.|+++||| .| =.+.+||.+.||+|...|=.+... T Consensus 68 eel~~~Ik~iGlYr~KAk~I~~~~~~LvE~y~G--eV-P~~~~eL~~LPGVGRKTANVVL~~ 126 (192) T TIGR01083 68 EELEEYIKSIGLYRNKAKNIIALCRKLVERYGG--EV-PEDREELVKLPGVGRKTANVVLNV 126 (192) T ss_pred HHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHCC--CC-CCCHHHHHCCCCCCCHHHHHHHHH T ss_conf 134776422586456899999999999998189--87-755376617899871145624334 No 50 >TIGR01448 recD_rel helicase, RecD/TraA family; InterPro: IPR006345 These sequences represent a family similar to RecD, the exodeoxyribonuclease V alpha chain of IPR006344 from INTERPRO. Members of this family, however, are not found in a context of RecB and RecC and are longer by about 200 amino acids at the amino end. Chlamydia muridarum has both a member of this family and a RecD. . Probab=80.08 E-value=1.4 Score=24.53 Aligned_cols=88 Identities=13% Similarity=0.025 Sum_probs=54.4 Q ss_pred CHHHHHHHHHHCCCCCCCHHHHHHHHHHHCC-CHHHHHCCCHHHHHHCCCCCHHHHHHHHHHHHHH---HHHHHHHHH-H Q ss_conf 9899999997337998988999999997379-9789850997899737898899999975499999---999998764-0 Q 537021.9.peg.8 38 AYYEILELILFRLIPRKDTKSIAKALLKRFA-TLGGVFGAPLHLLQEINRIGKRVALELKLVSVAS---QRILKANWS-I 112 (148) Q Consensus 38 sD~ELLallL~~g~~~~~~~~lA~~lL~~fG-sL~~ll~as~~eL~~i~GIG~aka~~l~a~~El~---rR~~~~~~~-~ 112 (148) +-.|=+...|.++.=+-=-+-+|++|.++|| ..=.++.-+|+-|.+|.||+.++.-.+.--.... ++++ ..+. - T Consensus 84 ~~~~gi~~Yl~S~~~KGvGk~~Aq~Iv~TfGe~~~~~l~~~P~kL~~V~G~s~~~~~~~~~~~~~~~~~~~~l-~~L~~l 162 (769) T TIGR01448 84 TSKEGIVAYLSSSSIKGVGKKLAQRIVETFGEAAIDVLDDDPEKLLEVSGISKANLEKIVSQLSKQKDERKLL-LKLQQL 162 (769) T ss_pred CHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCHHEEEECCCCHHHHHHHHHHHHHHHHHHHHH-HHHHHC T ss_conf 5067876555225416720577899999876999988740860103307756677899999999755899999-999853 Q ss_pred CCCCCCHHHHHHHH Q ss_conf 67788989999999 Q 537021.9.peg.8 113 KKFLTPGQPCLISV 126 (148) Q Consensus 113 ~~~l~s~~~v~~yl 126 (148) ....+=...+++++ T Consensus 163 G~~~~l~~~~~k~y 176 (769) T TIGR01448 163 GIGIKLALRIYKFY 176 (769) T ss_pred CCCHHHHHHHHHHH T ss_conf 87678999999984 No 51 >pfam10391 DNA_pol_lambd_f Fingers domain of DNA polymerase lambda. DNA polymerases catalyse the addition of dNMPs onto the 3-prime ends of DNA chains. There is a general polymerase fold consisting of three subdomains that have been likened to the fingers, palm, and thumb of a right hand. DNA_pol_lambd_f is the central three-helical region of DNA polymerase lambda referred to as the F and G helices of the fingers domain. Contacts with DNA involve this conserved helix-hairpin-helix motif in the fingers region which interacts with the primer strand. This motif is common to several DNA binding proteins and confers a sequence-independent interaction with the DNA backbone. Probab=78.35 E-value=1.6 Score=24.23 Aligned_cols=42 Identities=24% Similarity=0.147 Sum_probs=29.7 Q ss_pred HHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHH Q ss_conf 899737898899999975499999999998764067788989999 Q 537021.9.peg.8 79 HLLQEINRIGKRVALELKLVSVASQRILKANWSIKKFLTPGQPCL 123 (148) Q Consensus 79 ~eL~~i~GIG~aka~~l~a~~El~rR~~~~~~~~~~~l~s~~~v~ 123 (148) ..|++|.||||++|..+.. .+-|....-...+..++..+.+- T Consensus 2 ~~f~~I~GvGp~~A~~~~~---~G~~ti~dL~~~~~~L~~~Q~iG 43 (52) T pfam10391 2 KLFTNIWGVGPKTARKWYR---QGIRTLEDLRENKASLTRAQQIG 43 (52) T ss_pred HHHHHCCCCCHHHHHHHHH---HCCCCHHHHHHCCCCCCHHHHHH T ss_conf 0366354406999999999---47888999974631089999999 No 52 >PRK12766 50S ribosomal protein L32e; Provisional Probab=78.08 E-value=1.7 Score=23.97 Aligned_cols=54 Identities=26% Similarity=0.225 Sum_probs=39.3 Q ss_pred HHHHHHHHHHCCCCCCCHHHHHHHHH-HHCCCHHHHHCCCHHHHHHCCCCCHHHHHHHHHH Q ss_conf 89999999733799898899999999-7379978985099789973789889999997549 Q 537021.9.peg.8 39 YYEILELILFRLIPRKDTKSIAKALL-KRFATLGGVFGAPLHLLQEINRIGKRVALELKLV 98 (148) Q Consensus 39 D~ELLallL~~g~~~~~~~~lA~~lL-~~fGsL~~ll~as~~eL~~i~GIG~aka~~l~a~ 98 (148) ++|=|+=|-+-|... |..|- ..|-++..+-.|+..+|..+.|||-+-|+.|+|- T Consensus 8 ~~~~l~disgvg~~~------a~~l~~ag~e~~~d~~~a~q~~l~~~~gignalaarikad 62 (238) T PRK12766 8 EYEELTDISGVGPSK------AESLREAGFESVEDVRAADQSELADVDGIGNALAARIKAD 62 (238) T ss_pred CHHHHHCCCCCCHHH------HHHHHHCCCCHHHHHHHHHHHHHHHCCCHHHHHHHHHHCC T ss_conf 767764123668003------6788870401289897642776642013138999987414 No 53 >TIGR02027 rpoA DNA-directed RNA polymerase, alpha subunit; InterPro: IPR011773 DNA-directed RNA polymerases 2.7.7.6 from EC (also known as DNA-dependent RNA polymerases) are responsible for the polymerisation of ribonucleotides into a sequence complementary to the template DNA. In eukaryotes, there are three different forms of DNA-directed RNA polymerases transcribing different sets of genes. Most RNA polymerases are multimeric enzymes and are composed of a variable number of subunits. The core RNA polymerase complex consists of five subunits (two alpha, one beta, one beta-prime and one omega) and is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme . The core RNA polymerase complex forms a "crab claw"-like structure with an internal channel running along the full length . The key functional sites of the enzyme, as defined by mutational and cross-linking analysis, are located on the inner wall of this channel. RNA synthesis follows after the attachment of RNA polymerase to a specific site, the promoter, on the template DNA strand. The RNA synthesis process continues until a termination sequence is reached. The RNA product, which is synthesised in the 5' to 3'direction, is known as the primary transcript. Eukaryotic nuclei contain three distinct types of RNA polymerases that differ in the RNA they synthesise: RNA polymerase I: located in the nucleoli, synthesises precursors of most ribosomal RNAs. RNA polymerase II: occurs in the nucleoplasm, synthesises mRNA precursors. RNA polymerase III: also occurs in the nucleoplasm, synthesises the precursors of 5S ribosomal RNA, the tRNAs, and a variety of other small nuclear and cytosolic RNAs. Eukaryotic cells are also known to contain separate mitochondrial and chloroplast RNA polymerases. Eukaryotic RNA polymerases, whose molecular masses vary in size from 500 to 700 kD, contain two non-identical large (>100 kDa) subunits and an array of up to 12 different small (less than 50 kDa) subunits. This family consists of the bacterial (and chloroplast) DNA-directed RNA polymerase alpha subunit, encoded by the rpoA gene. The RNA polymerase catalyzes the transcription of DNA into RNA using the four ribonucleoside triphosphates as substrates. The amino terminal domain is involved in dimerizing and assembling the other RNA polymerase subunits into a transcriptionally active enzyme. The carboxy-terminal domain contains determinants for interaction with DNA and with transcriptional activator proteins , .; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006351 transcription DNA-dependent. Probab=76.47 E-value=4.3 Score=21.42 Aligned_cols=42 Identities=29% Similarity=0.267 Sum_probs=35.5 Q ss_pred HHHHHHHHCC--CHHHHHCCCHHHHHHCCCCCHHHHHHHHHHHH Q ss_conf 9999997379--97898509978997378988999999754999 Q 537021.9.peg.8 59 IAKALLKRFA--TLGGVFGAPLHLLQEINRIGKRVALELKLVSV 100 (148) Q Consensus 59 lA~~lL~~fG--sL~~ll~as~~eL~~i~GIG~aka~~l~a~~E 100 (148) =|-+.|++-| +++.|...|.+||.+|+-+|......|+...+ T Consensus 271 Rs~NCLk~~gI~t~geL~~~~e~eLl~i~NfGkKSl~EI~ekL~ 314 (324) T TIGR02027 271 RSYNCLKRAGIHTLGELVSKSEEELLKIKNFGKKSLDEIKEKLA 314 (324) T ss_pred HHHHHHHHHCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHH T ss_conf 98864646035417888860389884178857333899999988 No 54 >COG1379 PHP family phosphoesterase with a Zn ribbon [General function prediction only] Probab=75.05 E-value=7.4 Score=19.89 Aligned_cols=77 Identities=21% Similarity=0.285 Sum_probs=55.2 Q ss_pred CCHHHHHHHHHHHCCHH----------HCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHCCC-HHHHHCCCHHHHHHCCCC Q ss_conf 01650199999823922----------1898999999973379989889999999973799-789850997899737898 Q 537021.9.peg.8 19 IGHHNRLRDRFLQKGEN----------ALAYYEILELILFRLIPRKDTKSIAKALLKRFAT-LGGVFGAPLHLLQEINRI 87 (148) Q Consensus 19 ~ghr~R~RErll~~G~~----------~LsD~ELLallL~~g~~~~~~~~lA~~lL~~fGs-L~~ll~as~~eL~~i~GI 87 (148) .|-++|.+|=--.. ++ -..=+|++.+-|+-|..-+.|+.+=.++...||+ +.-|.+|++++|+.+. T Consensus 277 KGV~dRv~ELad~~-~~~p~~RPPYlhliPLaeIi~~~~g~gi~tK~V~~~we~lv~~FGtEi~vLi~a~~e~La~V~-- 353 (403) T COG1379 277 KGVSDRVLELADTE-PEHPKHRPPYLHLIPLAEIISMALGKGITTKAVKRTWERLVRAFGTEIDVLIDAPIEELARVD-- 353 (403) T ss_pred HHHHHHHHHHHCCC-CCCCCCCCCCEECCCHHHHHHHHHCCCEECHHHHHHHHHHHHHHCCHHHHHHCCCHHHHHHHH-- T ss_conf 30889999862057-336899998222064998888773466000268999999999842131567528989996645-- Q ss_pred CHHHHHHHHHHH Q ss_conf 899999975499 Q 537021.9.peg.8 88 GKRVALELKLVS 99 (148) Q Consensus 88 G~aka~~l~a~~ 99 (148) +..|..|.+.. T Consensus 354 -~~vA~aI~~~R 364 (403) T COG1379 354 -PKVAEAIVAFR 364 (403) T ss_pred -HHHHHHHHHHH T ss_conf -89999999984 No 55 >PRK03980 flap endonuclease-1; Provisional Probab=72.39 E-value=3.8 Score=21.79 Aligned_cols=55 Identities=15% Similarity=0.028 Sum_probs=30.5 Q ss_pred HHHHCCCHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHH Q ss_conf 898509978997378988999999754999999999987640677889899999999 Q 537021.9.peg.8 71 GGVFGAPLHLLQEINRIGKRVALELKLVSVASQRILKANWSIKKFLTPGQPCLISVE 127 (148) Q Consensus 71 ~~ll~as~~eL~~i~GIG~aka~~l~a~~El~rR~~~~~~~~~~~l~s~~~v~~yl~ 127 (148) .=|+.-|+. ..|+||||.+|..+.--.--...+....-...+..-+++++.+++. T Consensus 183 cIL~GcDY~--~gI~gIGpk~Alklikk~~~ie~il~~~~~~~~~~~~~~e~r~lF~ 237 (295) T PRK03980 183 AILVGTDYN--PGVKGIGPKTALKLIKKHGDLEKVLEAVGEGIDEPVDPEEIREFFL 237 (295) T ss_pred HHHCCCCCC--CCCCCCCHHHHHHHHHHHCCHHHHHHHCCCCCCCCCCHHHHHHHHC T ss_conf 984588899--9999842999999999969999999863767788999799999847 No 56 >TIGR00084 ruvA Holliday junction DNA helicase RuvA; InterPro: IPR000085 In prokaryotes, RuvA, RuvB, and RuvC process the universal DNA intermediate of homologous recombination, termed Holliday junction. The tetrameric DNA helicase RuvA specifically binds to the Holliday junction and facilitates the isomerization of the junction from the stacked folded configuration to the square-planar structure . In the RuvA tetramer, each subunit consists of three domains, I, II and III, where I and II form the major core that is responsible for Holliday junction binding and base pair rearrangements of Holliday junction executed at the crossover point, whereas domain III regulates branch migration through direct contact with RuvB.; GO: 0003678 DNA helicase activity, 0006281 DNA repair, 0006310 DNA recombination. Probab=72.36 E-value=4.4 Score=21.33 Aligned_cols=75 Identities=19% Similarity=0.149 Sum_probs=49.7 Q ss_pred HHHCCHHHCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHC--CCHHHHHC-CCHHHHHHCCCCCHHHHHHHHHHHHHHHHH Q ss_conf 982392218989999999733799898899999999737--99789850-997899737898899999975499999999 Q 537021.9.peg.8 29 FLQKGENALAYYEILELILFRLIPRKDTKSIAKALLKRF--ATLGGVFG-APLHLLQEINRIGKRVALELKLVSVASQRI 105 (148) Q Consensus 29 ll~~G~~~LsD~ELLallL~~g~~~~~~~~lA~~lL~~f--GsL~~ll~-as~~eL~~i~GIG~aka~~l~a~~El~rR~ 105 (148) ..-+|-..+.+-+|-.=||...-=+ =-+|=++|..+ +.+....+ -+++.|+++||||...|-+|.+ .||--|. T Consensus 68 ~~LfGF~~~~Er~lF~~Li~~nGvG---pk~ALaiL~~~~~~~~~~ai~~~~~~~L~k~pGvGKK~A~~l~~-leL~gk~ 143 (217) T TIGR00084 68 HLLFGFNTLEERELFKELIKVNGVG---PKLALAILSNMSPEEFVQAIETEEVKALVKIPGVGKKTAERLLA-LELKGKL 143 (217) T ss_pred HHHHCCCCHHHHHHHHHHHHCCCCH---HHHHHHHHCCCCHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHH-HHHHHHH T ss_conf 9973479877899999985148802---89999986678875898888641044420458857378999987-7754544 Q ss_pred HH Q ss_conf 99 Q 537021.9.peg.8 106 LK 107 (148) Q Consensus 106 ~~ 107 (148) .. T Consensus 144 ~~ 145 (217) T TIGR00084 144 KG 145 (217) T ss_pred CC T ss_conf 05 No 57 >TIGR00375 TIGR00375 conserved hypothetical protein TIGR00375; InterPro: IPR005287 This family of conserved hypothetical proteins has no known function. . Probab=70.11 E-value=3.9 Score=21.72 Aligned_cols=62 Identities=21% Similarity=0.257 Sum_probs=51.2 Q ss_pred HCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHCCC-HHHHHCCCHHHHHHCCCCCHHHHHHHHHHH Q ss_conf 1898999999973379989889999999973799-789850997899737898899999975499 Q 537021.9.peg.8 36 ALAYYEILELILFRLIPRKDTKSIAKALLKRFAT-LGGVFGAPLHLLQEINRIGKRVALELKLVS 99 (148) Q Consensus 36 ~LsD~ELLallL~~g~~~~~~~~lA~~lL~~fGs-L~~ll~as~~eL~~i~GIG~aka~~l~a~~ 99 (148) -+.=+|++.+-..+|.--+-|..+=.+|+..||+ +.=|++|+.+||.+| ..|..|..|.+.. T Consensus 305 liPLae~ig~~~~kG~~TK~VQ~~wekl~k~fGtEi~VL~~A~~edla~~--~pPkvA~~i~~fR 367 (384) T TIGR00375 305 LIPLAEVIGLKIDKGVFTKAVQSLWEKLVKKFGTEIEVLLEAALEDLAKV--DPPKVAALIEKFR 367 (384) T ss_pred ECCCEEECCCCCCCCEEEHHHHHHHHHHHHHHCCHHHHHHCCCHHHHHCC--CCCHHHHHHHHHH T ss_conf 05630003534587313034677899986410415667642784587335--8835889999863 No 58 >COG0632 RuvA Holliday junction resolvasome, DNA-binding subunit [DNA replication, recombination, and repair] Probab=69.52 E-value=9.9 Score=19.08 Aligned_cols=74 Identities=19% Similarity=0.174 Sum_probs=49.3 Q ss_pred HHHCCHHHCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHCC--CH-HHHHCCCHHHHHHCCCCCHHHHHHHHHHHHHHHHH Q ss_conf 9823922189899999997337998988999999997379--97-89850997899737898899999975499999999 Q 537021.9.peg.8 29 FLQKGENALAYYEILELILFRLIPRKDTKSIAKALLKRFA--TL-GGVFGAPLHLLQEINRIGKRVALELKLVSVASQRI 105 (148) Q Consensus 29 ll~~G~~~LsD~ELLallL~~g~~~~~~~~lA~~lL~~fG--sL-~~ll~as~~eL~~i~GIG~aka~~l~a~~El~rR~ 105 (148) ..-||-.+..+-||...++.-..=+ . -+|=.+|+.+. .| ..+.+.++..|+++||||+..|-+|. .||--+. T Consensus 58 ~~LyGF~~~~ER~lF~~LisVnGIG--p-K~ALaiLs~~~~~~l~~aI~~~d~~~L~k~PGIGkKtAeriv--leLk~K~ 132 (201) T COG0632 58 HLLYGFLTEEERELFRLLISVNGIG--P-KLALAILSNLDPEELAQAIANEDVKALSKIPGIGKKTAERIV--LELKGKL 132 (201) T ss_pred HHHCCCCCHHHHHHHHHHHCCCCCC--H-HHHHHHHCCCCHHHHHHHHHHCCHHHHHCCCCCCHHHHHHHH--HHHHHHH T ss_conf 8870899889999999987118805--8-999999848999999999983286764418987788999999--9976056 Q ss_pred HH Q ss_conf 99 Q 537021.9.peg.8 106 LK 107 (148) Q Consensus 106 ~~ 107 (148) .. T Consensus 133 ~~ 134 (201) T COG0632 133 AA 134 (201) T ss_pred HH T ss_conf 41 No 59 >smart00278 HhH1 Helix-hairpin-helix DNA-binding motif class 1. Probab=68.83 E-value=2.2 Score=23.23 Aligned_cols=22 Identities=18% Similarity=0.212 Sum_probs=18.0 Q ss_pred HHHHHCCCCCHHHHHHHHHHHH Q ss_conf 8997378988999999754999 Q 537021.9.peg.8 79 HLLQEINRIGKRVALELKLVSV 100 (148) Q Consensus 79 ~eL~~i~GIG~aka~~l~a~~E 100 (148) .+|++|+|||+.+|..|..... T Consensus 1 ~~L~~v~GIG~k~A~~ll~~~~ 22 (26) T smart00278 1 EELLKVPGIGPKTAEKILEAXX 22 (26) T ss_pred CCCCCCCCCCCHHHHHHHHHHH T ss_conf 9210179988115999999762 No 60 >CHL00013 rpoA RNA polymerase alpha subunit Probab=68.77 E-value=10 Score=18.98 Aligned_cols=41 Identities=22% Similarity=0.173 Sum_probs=34.3 Q ss_pred HHHHHHH--CCCHHHHHCCCHHHHHHCCCCCHHHHHHHHHHHH Q ss_conf 9999973--7997898509978997378988999999754999 Q 537021.9.peg.8 60 AKALLKR--FATLGGVFGAPLHLLQEINRIGKRVALELKLVSV 100 (148) Q Consensus 60 A~~lL~~--fGsL~~ll~as~~eL~~i~GIG~aka~~l~a~~E 100 (148) |.+.|++ ...++.|+..+.+||.++++.|+...-+|+...+ T Consensus 283 s~NCLKra~I~tvgdLv~~se~dLl~ikNfGkKSl~EI~ekL~ 325 (333) T CHL00013 283 AYNCLKRANIHTLLDLLNYSQEDLLKIKNFGQKSVKQVLDALE 325 (333) T ss_pred HHHHHHHCCCCCHHHHHHCCHHHHHCCCCCCHHHHHHHHHHHH T ss_conf 7743637389379998638999994189986302999999998 No 61 >PRK05182 DNA-directed RNA polymerase subunit alpha; Provisional Probab=67.80 E-value=10 Score=19.00 Aligned_cols=41 Identities=24% Similarity=0.242 Sum_probs=34.2 Q ss_pred HHHHHHHC--CCHHHHHCCCHHHHHHCCCCCHHHHHHHHHHHH Q ss_conf 99999737--997898509978997378988999999754999 Q 537021.9.peg.8 60 AKALLKRF--ATLGGVFGAPLHLLQEINRIGKRVALELKLVSV 100 (148) Q Consensus 60 A~~lL~~f--GsL~~ll~as~~eL~~i~GIG~aka~~l~a~~E 100 (148) |.+.|++- ..++.|+..+.+||.++++.|....-.|+...+ T Consensus 256 s~NcLk~a~I~tI~dLv~~se~dLl~ikNfGkKSl~EI~~kL~ 298 (306) T PRK05182 256 SYNCLKRAGINTIGDLVQRTEEELLKTPNLGKKSLEEIKEKLA 298 (306) T ss_pred HHHHHHHCCCCCHHHHHHCCHHHHHHCCCCCCCCHHHHHHHHH T ss_conf 8755878599369998747999996189976203999999999 No 62 >TIGR00588 ogg 8-oxoguanine DNA-glycosylase (ogg); InterPro: IPR004577 All proteins in this family for which functions are known are 8-oxo-guanaine DNA glycosylases that function in base excision repair. The enzyme incises DNA at 8-oxoG residues, and excises 7,8-dihydro-8-oxoguanine from damaged DNA. It has beta-lyase activity that nicks DNA 3' to the lesion.; GO: 0008534 oxidized purine base lesion DNA N-glycosylase activity, 0006281 DNA repair, 0005634 nucleus. Probab=66.85 E-value=7.9 Score=19.70 Aligned_cols=50 Identities=24% Similarity=0.155 Sum_probs=39.9 Q ss_pred HHCCCCCCCHHHHHHHHHHHCCC---------HHHH-HCCCHHH----HHHCCCCCHHHHHHHH Q ss_conf 73379989889999999973799---------7898-5099789----9737898899999975 Q 537021.9.peg.8 47 LFRLIPRKDTKSIAKALLKRFAT---------LGGV-FGAPLHL----LQEINRIGKRVALELK 96 (148) Q Consensus 47 L~~g~~~~~~~~lA~~lL~~fGs---------L~~l-l~as~~e----L~~i~GIG~aka~~l~ 96 (148) |+-|.+-+=+.+.|+.|+++-|. |..| -.++++| |+..+|||+..|-=|. T Consensus 228 lG~GYRA~Yi~~tar~l~ee~~~~nitsdta~LQ~ic~~~~Yedar~~L~~l~GVG~KVADCic 291 (379) T TIGR00588 228 LGLGYRARYIRETARALLEEQGGRNITSDTAWLQQICKDADYEDAREALLELPGVGPKVADCIC 291 (379) T ss_pred CCCCCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHCCCCCHHHHHHHHHCCCCCCCHHHHHHH T ss_conf 4887540689999999884126642002357999860668867899997216999704888888 No 63 >PRK02362 ski2-like helicase; Provisional Probab=66.81 E-value=11 Score=18.73 Aligned_cols=33 Identities=18% Similarity=0.314 Sum_probs=13.9 Q ss_pred HHHHHH-HCCCHHHHHCCCHHHHHHCCCCCHHHHHH Q ss_conf 999997-37997898509978997378988999999 Q 537021.9.peg.8 60 AKALLK-RFATLGGVFGAPLHLLQEINRIGKRVALE 94 (148) Q Consensus 60 A~~lL~-~fGsL~~ll~as~~eL~~i~GIG~aka~~ 94 (148) |+.|.+ .|.|+..|.+|+++++..+ +|+..|.+ T Consensus 665 AR~Ly~aGi~s~~dla~A~p~~l~~i--lg~~~a~~ 698 (736) T PRK02362 665 ARRLYNAGITSRADLRAADKEVVAAI--LGPKIAER 698 (736) T ss_pred HHHHHHCCCCCHHHHHHCCHHHHHHH--HHHHHHHH T ss_conf 99999879999999970999999999--77789999 No 64 >TIGR01084 mutY A/G-specific adenine glycosylase; InterPro: IPR005760 The DNA repair enzyme MutY plays an important role in the prevention of DNA mutations resulting from the presence of the oxidatively damaged lesion 7,8-dihydro-8-oxo-2'-deoxyguanosine (8-OxoG). 8-OxoG can mispair with 2'-deoxycytidine 5'-triphosphate or with 2'-deoxyadenosine triphosphate during DNA replication, forming C*8-oxoG and A*8-oxoG mispairs. Human MutY is responsible for recognition and removal of the inappropriately inserted adenine in an A=8-oxoG mispair. If unrepaired, the A=8-oxoG mispairs can result in deleterious C:G to A:T transversions. Multiple forms of mammalian MutY are formed by alternative splicing and locate to the nucleus or mitochondrion, where they have been shown to interact with several other proteins involved in the repair of DNA damage . The HhH-GPD domain within the protein binds the phosphate backbone of the substrate. This family represents bacterial MutY and a limited number of murine homologs. In rat, the Escherichia coli MutY DNA glycosylase homolog (MYH) is induced in response to mitochondrial DNA damage . ; GO: 0019104 DNA N-glycosylase activity, 0006284 base-excision repair, 0005622 intracellular. Probab=66.56 E-value=3.8 Score=21.75 Aligned_cols=49 Identities=18% Similarity=0.261 Sum_probs=36.7 Q ss_pred HHCCCCCCCHHHHHHHHHHHCCCHHHHHCCCHHHHHHCCCCCHHHHHHHHHH Q ss_conf 7337998988999999997379978985099789973789889999997549 Q 537021.9.peg.8 47 LFRLIPRKDTKSIAKALLKRFATLGGVFGAPLHLLQEINRIGKRVALELKLV 98 (148) Q Consensus 47 L~~g~~~~~~~~lA~~lL~~fGsL~~ll~as~~eL~~i~GIG~aka~~l~a~ 98 (148) |++=++-.|-.-=|+.+-+.|| |-+=-+.+++..++|||+..|.-|.+. T Consensus 89 LGYYaRARNL~kAA~~v~~~fG---G~fP~d~~~~~~L~GVG~yTAgAils~ 137 (297) T TIGR01084 89 LGYYARARNLHKAAQEVVEEFG---GEFPQDLEDLKALPGVGRYTAGAILSF 137 (297) T ss_pred CCHHHHHHHHHHHHHHHHHHHC---CCCCCCHHHHHHCCCCCHHHHHHHHHH T ss_conf 7867888999999999998718---817723797851789762179999998 No 65 >TIGR00615 recR recombination protein RecR; InterPro: IPR000093 The bacterial protein recR seems to play a role in a recombinational process of DNA repair . It may act with recF and recO. RecR is a protein of about 200 amino acid residues. This protein contains a putative C4-type zinc finger in the N-terminal section.; GO: 0006281 DNA repair, 0006310 DNA recombination. Probab=63.24 E-value=5.4 Score=20.76 Aligned_cols=19 Identities=26% Similarity=0.443 Sum_probs=11.5 Q ss_pred HHHHHHCCCCCHHHHHHHH Q ss_conf 7899737898899999975 Q 537021.9.peg.8 78 LHLLQEINRIGKRVALELK 96 (148) Q Consensus 78 ~~eL~~i~GIG~aka~~l~ 96 (148) .+.|.++||||+..|..|. T Consensus 11 ie~L~kLPgiG~KsA~RlA 29 (205) T TIGR00615 11 IESLKKLPGIGPKSAQRLA 29 (205) T ss_pred HHHHHHCCCCCHHHHHHHH T ss_conf 9986407898714789999 No 66 >pfam05559 DUF763 Protein of unknown function (DUF763). This family consists of several uncharacterized bacterial and archaeal proteins of unknown function. Probab=62.89 E-value=9.4 Score=19.21 Aligned_cols=28 Identities=18% Similarity=0.163 Sum_probs=24.5 Q ss_pred CCCHHHHHHCCCCCHHHHHHHHHHHHHH Q ss_conf 0997899737898899999975499999 Q 537021.9.peg.8 75 GAPLHLLQEINRIGKRVALELKLVSVAS 102 (148) Q Consensus 75 ~as~~eL~~i~GIG~aka~~l~a~~El~ 102 (148) -.+.+||..++||||....-|..+.|+. T Consensus 265 p~dfeeLLl~~GvGp~TlRALaLvaElI 292 (319) T pfam05559 265 PEDFEELLLLKGVGPSTLRALALVAEVI 292 (319) T ss_pred CCCHHHHHHCCCCCHHHHHHHHHHHHHH T ss_conf 0169999714798889999999999998 No 67 >COG4277 Predicted DNA-binding protein with the Helix-hairpin-helix motif [General function prediction only] Probab=61.71 E-value=5.9 Score=20.51 Aligned_cols=64 Identities=27% Similarity=0.200 Sum_probs=37.4 Q ss_pred HCCHHHHHHHHHHCCCCCCCHHH-HHHHHHHHCCCHHHHHCCCHHHHHHCCCCCHHHHHHHHHHHHH Q ss_conf 18989999999733799898899-9999997379978985099789973789889999997549999 Q 537021.9.peg.8 36 ALAYYEILELILFRLIPRKDTKS-IAKALLKRFATLGGVFGAPLHLLQEINRIGKRVALELKLVSVA 101 (148) Q Consensus 36 ~LsD~ELLallL~~g~~~~~~~~-lA~~lL~~fGsL~~ll~as~~eL~~i~GIG~aka~~l~a~~El 101 (148) +++-.|+++.==..-.+..|-+. -|-+-.+.|. -.+-.|+.++|..|||||...|..|....-. T Consensus 288 gF~~~Ei~~~g~~~ld~~lDPK~~wAl~~~d~FP--Vdvn~A~~~~llRVPGiG~ksa~rIv~~Rr~ 352 (404) T COG4277 288 GFSADEILASGGDFLDPDLDPKTAWALKHMDRFP--VDVNKAPYKELLRVPGIGVKSARRIVMTRRR 352 (404) T ss_pred CCCHHHHHHCCCCCCCCCCCHHHHHHHHCCCCCC--CCCCCCCHHHHCCCCCCCHHHHHHHHHHHHH T ss_conf 8887888744897557887834577773355365--1334567778211688773788999887423 No 68 >smart00483 POLXc DNA polymerase X family. includes vertebrate polymerase beta and terminal deoxynucleotidyltransferases Probab=60.61 E-value=6.9 Score=20.10 Aligned_cols=39 Identities=15% Similarity=0.285 Sum_probs=23.8 Q ss_pred HHHHHHHHHHHCCCHHHH---HC----CCHHHHHHCCCCCHHHHHHH Q ss_conf 899999999737997898---50----99789973789889999997 Q 537021.9.peg.8 56 TKSIAKALLKRFATLGGV---FG----APLHLLQEINRIGKRVALEL 95 (148) Q Consensus 56 ~~~lA~~lL~~fGsL~~l---l~----as~~eL~~i~GIG~aka~~l 95 (148) .-..-.++++ -|.+..+ +. ....+|++|+||||.+|..+ T Consensus 60 ia~kI~Eil~-TG~~~~~e~~~~~~~~~~l~el~~I~GvGpk~a~~l 105 (334) T smart00483 60 IKKKIEEIIE-TGKSSKVLEILNDEVYKSLKLFTNVFGVGPKTAAKW 105 (334) T ss_pred HHHHHHHHHH-CCCCHHHHHHHCCCCCHHHHHHHCCCCCCHHHHHHH T ss_conf 9999999998-499489999872865168999853888778999999 No 69 >pfam11731 Cdd1 Pathogenicity locus. Cdd1 is expressed as part of the pathogenicity locus operon in several different orders of bacteria. Many members of the family are annotated as being putative mitomycin resistance proteins but this could not be confirmed. Probab=60.09 E-value=7.6 Score=19.81 Aligned_cols=22 Identities=36% Similarity=0.591 Sum_probs=18.3 Q ss_pred CCHHHHHHCCCCCHHHHHHHHH Q ss_conf 9978997378988999999754 Q 537021.9.peg.8 76 APLHLLQEINRIGKRVALELKL 97 (148) Q Consensus 76 as~~eL~~i~GIG~aka~~l~a 97 (148) ....+|.+|||||++.|..+.. T Consensus 9 ~~l~~L~~lPnIG~a~a~DL~~ 30 (92) T pfam11731 9 SALKELTDLPNIGKATAKDLRL 30 (92) T ss_pred HHHHHHHCCCCCCHHHHHHHHH T ss_conf 9999874189746999999999 No 70 >LOAD_Hrd consensus Probab=59.93 E-value=12 Score=18.50 Aligned_cols=36 Identities=11% Similarity=0.159 Sum_probs=25.0 Q ss_pred HHHHHCCCHHHHHHCCCCCHHHHHHHHH-HHHHHHHH Q ss_conf 7898509978997378988999999754-99999999 Q 537021.9.peg.8 70 LGGVFGAPLHLLQEINRIGKRVALELKL-VSVASQRI 105 (148) Q Consensus 70 L~~ll~as~~eL~~i~GIG~aka~~l~a-~~El~rR~ 105 (148) +....=.+.++|.+|+|||+.|+...-. +.++.+++ T Consensus 38 ia~~~P~t~~eL~~I~Gig~~k~~~yG~~il~ii~~~ 74 (77) T LOAD_Hrd 38 MAALLPTTVSELLAIDGVGEAKVEKYGDRILEVIRRS 74 (77) T ss_pred HHHHCCCCHHHHCCCCCCCHHHHHHHHHHHHHHHHHH T ss_conf 9997899999982899969999999999999999987 No 71 >PRK08609 hypothetical protein; Provisional Probab=58.10 E-value=14 Score=18.13 Aligned_cols=40 Identities=15% Similarity=0.118 Sum_probs=22.9 Q ss_pred HHHHHHHHH--HCCCHHH---HHCC---CHHHHHHCCCCCHHHHHHHH Q ss_conf 999999997--3799789---8509---97899737898899999975 Q 537021.9.peg.8 57 KSIAKALLK--RFATLGG---VFGA---PLHLLQEINRIGKRVALELK 96 (148) Q Consensus 57 ~~lA~~lL~--~fGsL~~---ll~a---s~~eL~~i~GIG~aka~~l~ 96 (148) +.+|+.+-. +-|.+.. +..- ..-+|.+|+||||.++.+|- T Consensus 58 k~Ia~KI~Eil~TG~l~~le~L~~~~P~gl~eLl~IpGlGPKka~~L~ 105 (570) T PRK08609 58 KGTAEVIQEYRETGESSVLQELQKEVPEGLLPLLKLPGLGGKKIAKLY 105 (570) T ss_pred HHHHHHHHHHHHCCCHHHHHHHHHHCCHHHHHHHCCCCCCHHHHHHHH T ss_conf 999999999997299089999985487779999778987789999999 No 72 >pfam02371 Transposase_20 Transposase IS116/IS110/IS902 family. Transposases are needed for efficient transposition of the insertion sequence or transposon DNA. This family includes transposases for IS116, IS110 and IS902. This region is often found with pfam01548. The exact function of this region is uncertain. This family contains a HHH motif suggesting a DNA-binding function. Probab=57.65 E-value=7.6 Score=19.81 Aligned_cols=37 Identities=19% Similarity=0.052 Sum_probs=24.4 Q ss_pred HHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHH Q ss_conf 899737898899999975499999999998764067788989999999 Q 537021.9.peg.8 79 HLLQEINRIGKRVALELKLVSVASQRILKANWSIKKFLTPGQPCLISV 126 (148) Q Consensus 79 ~eL~~i~GIG~aka~~l~a~~El~rR~~~~~~~~~~~l~s~~~v~~yl 126 (148) +-|+.|||||+..++.|.+...=.. .+.|+.++..|+ T Consensus 2 ~~L~sipGiG~~~a~~l~aeigd~~-----------rF~~~~~~~s~~ 38 (87) T pfam02371 2 ELLLSIPGIGPITAAALLAEIGDIS-----------RFKSARQLAAYA 38 (87) T ss_pred CHHHCCCCCCHHHHHHHHHHHCCHH-----------HCCCHHHHHHHC T ss_conf 2342699952999999999929853-----------278999999983 No 73 >COG1796 POL4 DNA polymerase IV (family X) [DNA replication, recombination, and repair] Probab=56.12 E-value=13 Score=18.40 Aligned_cols=44 Identities=18% Similarity=0.106 Sum_probs=21.2 Q ss_pred HHHHHHHHHHCCCCCCCHHHHHHHHHHHCCCHHHHHCCCHHHHHHCCCCCHHHHHH Q ss_conf 89999999733799898899999999737997898509978997378988999999 Q 537021.9.peg.8 39 YYEILELILFRLIPRKDTKSIAKALLKRFATLGGVFGAPLHLLQEINRIGKRVALE 94 (148) Q Consensus 39 D~ELLallL~~g~~~~~~~~lA~~lL~~fGsL~~ll~as~~eL~~i~GIG~aka~~ 94 (148) -++-++.++.||.-. ....+-..|+ .+..+|++++|+||.|.+. T Consensus 65 ia~~I~e~l~tG~~~-----~le~lk~~~P-------~gl~~Ll~v~GlGpkKi~~ 108 (326) T COG1796 65 IAEKISEYLDTGEVK-----KLEALKKEVP-------EGLEPLLKVPGLGPKKIVS 108 (326) T ss_pred HHHHHHHHHHCCCCH-----HHHHHHHHCC-------CCHHHHHHCCCCCCHHHHH T ss_conf 999999999728607-----8999988579-------5558786077989288999 No 74 >TIGR03674 fen_arch flap structure-specific endonuclease. Endonuclease that cleaves the 5'-overhanging flap structure that is generated by displacement synthesis when DNA polymerase encounters the 5'-end of a downstream Okazaki fragment. Has 5'-endo-/exonuclease and 5'-pseudo-Y-endonuclease activities. Cleaves the junction between single and double-stranded regions of flap DNA Probab=55.58 E-value=13 Score=18.34 Aligned_cols=53 Identities=17% Similarity=0.154 Sum_probs=29.8 Q ss_pred HHHHHCCCHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHH Q ss_conf 7898509978997378988999999754999999999987640677889899999999 Q 537021.9.peg.8 70 LGGVFGAPLHLLQEINRIGKRVALELKLVSVASQRILKANWSIKKFLTPGQPCLISVE 127 (148) Q Consensus 70 L~~ll~as~~eL~~i~GIG~aka~~l~a~~El~rR~~~~~~~~~~~l~s~~~v~~yl~ 127 (148) +.=|+..|+. ..|+||||.+|..+.--.--...+.... .....+++++.+++. T Consensus 229 lcIL~GcDY~--~gI~GIG~k~A~klIkk~~~ie~i~~~~---~~~~~~~~~~r~lF~ 281 (338) T TIGR03674 229 IAILVGTDYN--EGVKGIGPKTALKLIKEHGDLEKVLKAR---GEDIENYDEIREFFL 281 (338) T ss_pred HHHHHCCCCC--CCCCCCCHHHHHHHHHHHCCHHHHHHHC---CCCCCCHHHHHHHHC T ss_conf 9997087567--8999856899999999859999999865---688778999999848 No 75 >TIGR02238 recomb_DMC1 meiotic recombinase Dmc1; InterPro: IPR011940 This entry describes DMC1, a subfamily of a larger family of DNA repair and recombination proteins. It is eukaryotic only and most closely related to eukaryotic RAD51. It also resembles archaeal RadA (IPR011938 from INTERPRO) and RadB (IPR011939 from INTERPRO) and bacterial RecA. It has been characterised for human as a recombinase active only in meiosis .; GO: 0003677 DNA binding, 0005524 ATP binding, 0008094 DNA-dependent ATPase activity, 0007131 meiotic recombination, 0005634 nucleus. Probab=53.76 E-value=19 Score=17.27 Aligned_cols=50 Identities=20% Similarity=0.210 Sum_probs=35.5 Q ss_pred CCCHHHHHHHHHHHCCCHHHHHCCCHHHHHHCCCCCHHHHHHHHHHHHHHHHH Q ss_conf 89889999999973799789850997899737898899999975499999999 Q 537021.9.peg.8 53 RKDTKSIAKALLKRFATLGGVFGAPLHLLQEINRIGKRVALELKLVSVASQRI 105 (148) Q Consensus 53 ~~~~~~lA~~lL~~fGsL~~ll~as~~eL~~i~GIG~aka~~l~a~~El~rR~ 105 (148) +-|+.++.+-=-...-+..++...+-..|++|+|+.++|+-.|+ |.+..+ T Consensus 8 GIn~~Di~KLk~~g~~T~~~~~~~T~r~L~~iKGlSEaKVdKi~---EaA~K~ 57 (314) T TIGR02238 8 GINAADIKKLKSAGICTVNGVIMTTKRELLKIKGLSEAKVDKIK---EAASKI 57 (314) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHH---HHHHHH T ss_conf 88788999988636899999999888887403785167799999---999876 No 76 >smart00540 LEM in nuclear membrane-associated proteins. LEM, domain in nuclear membrane-associated proteins, including lamino-associated polypeptide 2 and emerin. Probab=53.08 E-value=12 Score=18.50 Aligned_cols=33 Identities=12% Similarity=0.113 Sum_probs=27.9 Q ss_pred HHHCCHHHHHHHHHHCCCCCCCHHHHHHHHHHH Q ss_conf 221898999999973379989889999999973 Q 537021.9.peg.8 34 ENALAYYEILELILFRLIPRKDTKSIAKALLKR 66 (148) Q Consensus 34 ~~~LsD~ELLallL~~g~~~~~~~~lA~~lL~~ 66 (148) ...|||.||-+-++.+|.+..++.+--+.+..+ T Consensus 3 v~~Lsd~eLr~~L~~~G~~~gPIt~sTR~vy~K 35 (44) T smart00540 3 VDRLSDAELRAELKQYGLPPGPITDTTRKLYEK 35 (44) T ss_pred HHHHCHHHHHHHHHHCCCCCCCCCCCHHHHHHH T ss_conf 878269999999998399999968330999999 No 77 >COG1415 Uncharacterized conserved protein [Function unknown] Probab=52.08 E-value=18 Score=17.39 Aligned_cols=34 Identities=21% Similarity=0.117 Sum_probs=27.9 Q ss_pred CHHHHHCCCHHHHHHCCCCCHHHHHHHHHHHHHH Q ss_conf 9789850997899737898899999975499999 Q 537021.9.peg.8 69 TLGGVFGAPLHLLQEINRIGKRVALELKLVSVAS 102 (148) Q Consensus 69 sL~~ll~as~~eL~~i~GIG~aka~~l~a~~El~ 102 (148) .+...--.+.++|.-++||||....-|..+.|+. T Consensus 268 ~~ye~~p~Df~elLl~~GiGpstvRALalVAEvI 301 (373) T COG1415 268 RLYELNPDDFEELLLVPGIGPSTVRALALVAEVI 301 (373) T ss_pred HHHHCCCCCHHHHHHCCCCCHHHHHHHHHHHHHH T ss_conf 6644683249998740687889999999999998 No 78 >PRK07847 amidophosphoribosyltransferase; Provisional Probab=52.03 E-value=20 Score=17.10 Aligned_cols=47 Identities=19% Similarity=0.138 Sum_probs=38.4 Q ss_pred HHHHHHHHHHCC-------HHHCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHCC Q ss_conf 501999998239-------22189899999997337998988999999997379 Q 537021.9.peg.8 22 HNRLRDRFLQKG-------ENALAYYEILELILFRLIPRKDTKSIAKALLKRFA 68 (148) Q Consensus 22 r~R~RErll~~G-------~~~LsD~ELLallL~~g~~~~~~~~lA~~lL~~fG 68 (148) -+-+|+.+.+.| ..+-||.|+++-+|.......+..+..++++++.. T Consensus 107 ~~eLr~~L~~~G~i~~~~~f~s~sDTEvi~~lia~~~~~~~i~~~i~~~~~~i~ 160 (489) T PRK07847 107 TAELAARARDLGLIAGRCPAPATTDSDLLTALLAHGAADSTLEQAALELLPTVR 160 (489) T ss_pred HHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCHHHHHHHHHHHCC T ss_conf 999999999668743334678774799999999734676779999999998606 No 79 >PRK10308 3-methyl-adenine DNA glycosylase II; Provisional Probab=50.56 E-value=5.9 Score=20.54 Aligned_cols=47 Identities=21% Similarity=0.139 Sum_probs=23.8 Q ss_pred CCCCCCHHHHHHHHHHHCCCHHHHH--CCCHHHHHHCCCCCHHHHHHHH Q ss_conf 7998988999999997379978985--0997899737898899999975 Q 537021.9.peg.8 50 LIPRKDTKSIAKALLKRFATLGGVF--GAPLHLLQEINRIGKRVALELK 96 (148) Q Consensus 50 g~~~~~~~~lA~~lL~~fGsL~~ll--~as~~eL~~i~GIG~aka~~l~ 96 (148) +.+......+|+.+.+.--.|..-. .....+|+.|+||||..|-.+. T Consensus 176 ~~ra~tl~~lA~a~~~g~l~l~~~~d~~~~~~~L~~l~GIGpWTa~Yv~ 224 (283) T PRK10308 176 LKRAEALIHLANAALEGTLPLTAPGDVEQAMKTLQTFPGIGRWTANYFA 224 (283) T ss_pred HHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHCCCCCHHHHHHHH T ss_conf 6899999999999966986677789999999998736797889999999 No 80 >PRK11057 ATP-dependent DNA helicase RecQ; Provisional Probab=49.19 E-value=15 Score=17.91 Aligned_cols=35 Identities=9% Similarity=0.043 Sum_probs=20.5 Q ss_pred HHHHCCCHHHHHHCCCCCHHHHHHHH-HHHHHHHHH Q ss_conf 89850997899737898899999975-499999999 Q 537021.9.peg.8 71 GGVFGAPLHLLQEINRIGKRVALELK-LVSVASQRI 105 (148) Q Consensus 71 ~~ll~as~~eL~~i~GIG~aka~~l~-a~~El~rR~ 105 (148) ....=.+.+||.+|+|||++|+..-- ...|+.+.+ T Consensus 567 A~~rP~T~eELl~I~GVG~~KlekYG~~FLevI~~y 602 (607) T PRK11057 567 AEQMPITASEMLSVNGVGQRKLERFGKPFMALIRAH 602 (607) T ss_pred HHHCCCCHHHHCCCCCCCHHHHHHHHHHHHHHHHHH T ss_conf 997889999981899989999999999999999999 No 81 >cd00141 NT_POLXc Nucleotidyltransferase (NT) domain of family X DNA Polymerases. X family polymerases fill in short gaps during DNA repair. They are relatively inaccurate enzymes and play roles in base excision repair, in non-homologous end joining (NHEJ) which acts mainly to repair damage due to ionizing radiation, and in V(D)J recombination. This family includes eukaryotic Pol beta, Pol lambda, Pol mu, and terminal deoxyribonucleotidyl transferase (TdT). Pol beta and Pol lambda are primarily DNA template-dependent polymerases. TdT is a DNA template-independent polymerase. Pol mu has both template dependent and template independent activities. This subgroup belongs to the Pol beta-like NT superfamily. In the majority of enzymes in this superfamily, two carboxylates, Dx[D/E], together with a third more distal carboxylate, coordinate two divalent metal cations involved in a two-metal ion mechanism of nucleotide addition. These three carboxylate residues are fairly well conserved in this Probab=48.26 E-value=23 Score=16.72 Aligned_cols=53 Identities=15% Similarity=0.126 Sum_probs=30.7 Q ss_pred HHHHHHCCCCCHHHHHHHHHHHHHHHHHHH---------------------HHHHHCCCCCCHHHHHHHHHHHHCCC Q ss_conf 789973789889999997549999999999---------------------87640677889899999999984597 Q 537021.9.peg.8 78 LHLLQEINRIGKRVALELKLVSVASQRILK---------------------ANWSIKKFLTPGQPCLISVERHYPMK 133 (148) Q Consensus 78 ~~eL~~i~GIG~aka~~l~a~~El~rR~~~---------------------~~~~~~~~l~s~~~v~~yl~~~l~~~ 133 (148) ..+|++|+||||.+|..|- +++-+... .....+........+.+.+...+... T Consensus 84 l~~l~~I~GiGpk~a~~l~---~~Gi~sl~dL~~a~g~k~~~~~~~Gl~~~~~~~~ripr~e~~~~~~~i~~~l~~~ 157 (307) T cd00141 84 LLLLLRVPGVGPKTARKLY---ELGIRTLEDLRKAAGAKLEQNILIGLEYYEDFQQRIPREEALAIAEIIKEALREV 157 (307) T ss_pred HHHHHCCCCCCHHHHHHHH---HCCCCCHHHHHHHHCHHHHHHHHHHHHHHHHHCCCEEHHHHHHHHHHHHHHHHHC T ss_conf 9999647887889999999---8299979999975011038999999999998515677999999999999999838 No 82 >PRK02515 psbU photosystem II complex extrinsic protein precursor U; Provisional Probab=47.85 E-value=17 Score=17.53 Aligned_cols=35 Identities=6% Similarity=-0.002 Sum_probs=29.7 Q ss_pred HHHHHCCCHHHHHCCCHHHHHHCCCCCHHHHHHHH Q ss_conf 99973799789850997899737898899999975 Q 537021.9.peg.8 62 ALLKRFATLGGVFGAPLHLLQEINRIGKRVALELK 96 (148) Q Consensus 62 ~lL~~fGsL~~ll~as~~eL~~i~GIG~aka~~l~ 96 (148) .+-.+||.=-.|-++++.++++.+||.|.-|-.|. T Consensus 56 kl~t~~g~KIDlNNa~vr~f~q~pGmYPtlA~kIv 90 (144) T PRK02515 56 KLATERGEKIDLNNSSVRAFRQFPGMYPTLAGKIV 90 (144) T ss_pred HHHHHHCCCCCCCCHHHHHHHHCCCCCHHHHHHHH T ss_conf 88887156402562749999868884679999998 No 83 >cd00080 HhH2_motif Helix-hairpin-helix class 2 (Pol1 family) motif. HhH2 domains are found in Rad2 family of prokaryotic and eukaryotic replication and repair nucleases, i.e., DNA polymerase I, Taq DNA polymerase, DNA repair protein Rad2 endonuclease, flap endonuclease, exonuclease I and IX, 5'-3' exonuclease and also bacteriophage Rnase H. These nucleases degrade RNA-DNA or DNA-DNA duplexes, or both and play essential roles in DNA duplication, repair, and recombination. Probab=47.78 E-value=6.5 Score=20.27 Aligned_cols=26 Identities=23% Similarity=0.277 Sum_probs=10.8 Q ss_pred HHHCCCHHHHHHCCCCCHHHHHHHHH Q ss_conf 98509978997378988999999754 Q 537021.9.peg.8 72 GVFGAPLHLLQEINRIGKRVALELKL 97 (148) Q Consensus 72 ~ll~as~~eL~~i~GIG~aka~~l~a 97 (148) .|..-+-+-...|+|||+.+|..|.. T Consensus 15 ~L~GD~sDnipGV~GIG~ktA~~ll~ 40 (75) T cd00080 15 ILVGDKSDNIPGVPGIGPKTALKLLK 40 (75) T ss_pred HHHCCCCCCCCCCCCCCHHHHHHHHH T ss_conf 99196545887758637999999999 No 84 >TIGR00593 pola DNA polymerase I; InterPro: IPR002298 DNA carries the biological information that instructs cells how to exist in an ordered fashion. Accurate replication is thus one of the most important events in the cell life cycle. This function is mediated by DNA-directed DNA polymerases, which add nucleotide triphosphate (dNTP) residues to the 5'-end of the growing DNA chain, using a complementary DNA as template. Small RNA molecules are generally used as primers for chain elongation, although terminal proteins may also be used. DNA-dependent DNA polymerases have been grouped into families, denoted A, B and X, on the basis of sequence similarities , . Members of family A, which includes bacterial and bacteriophage polymerases, share significant similarity to Escherichia coli polymerase I; hence family A is also known as the pol I family. The bacterial polymerases also contain an exonuclease activity, which is coded for in the N-terminal portion. Three motifs, A, B and C , are seen to be conserved across all DNA polymerases, with motifs A and C also seen in RNA polymerases. They are centred on invariant residues, and their structural significance was implied from the Klenow (E. coli) structure. Motif A contains a strictly-conserved aspartate at the junction of a beta-strand and an alpha-helix; motif B contains an alpha-helix with positive charges; and motif C has a doublet of negative charges, located in a beta-turn-beta secondary structure .; GO: 0003677 DNA binding, 0003887 DNA-directed DNA polymerase activity, 0006260 DNA replication. Probab=44.41 E-value=25 Score=16.53 Aligned_cols=16 Identities=31% Similarity=0.656 Sum_probs=8.9 Q ss_pred HHHHHHHCCCHHHHHC Q ss_conf 9999973799789850 Q 537021.9.peg.8 60 AKALLKRFATLGGVFG 75 (148) Q Consensus 60 A~~lL~~fGsL~~ll~ 75 (148) |..||.+||||-+|+. T Consensus 205 A~kLL~~fgsLe~iy~ 220 (1005) T TIGR00593 205 AAKLLQEFGSLENIYE 220 (1005) T ss_pred HHHHHHHHHHHHHHHH T ss_conf 8999987210899998 No 85 >KOG1918 consensus Probab=43.91 E-value=13 Score=18.35 Aligned_cols=57 Identities=18% Similarity=0.101 Sum_probs=29.8 Q ss_pred HHHHHHHHHHCCCCCCCHHHHHHHHHHH-CCCHHHHHCCCHHHHH----HCCCCCHHHHHHH Q ss_conf 8999999973379989889999999973-7997898509978997----3789889999997 Q 537021.9.peg.8 39 YYEILELILFRLIPRKDTKSIAKALLKR-FATLGGVFGAPLHLLQ----EINRIGKRVALEL 95 (148) Q Consensus 39 D~ELLallL~~g~~~~~~~~lA~~lL~~-fGsL~~ll~as~~eL~----~i~GIG~aka~~l 95 (148) |.|-|.-.=.++.+-.....+|..-.+. ..+..++.+.|.++|. +|+|||+..+-+. T Consensus 120 ~~~~lrkcG~S~rK~~yLh~lA~~~~ng~I~s~~~i~~mseEeL~~~LT~VKGIg~Wtv~Mf 181 (254) T KOG1918 120 DCEELRKCGFSKRKASYLHSLAEAYTNGYIPSKSGIEKMSEEELIERLTNVKGIGRWTVEMF 181 (254) T ss_pred CHHHHHHHCCCHHHHHHHHHHHHHHHCCCCCCHHHHHHCCHHHHHHHHHHCCCCCCEEEEEE T ss_conf 89999985733246799999999986477776677761579999999986047551124432 No 86 >COG0758 Smf Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake [DNA replication, recombination, and repair / Intracellular trafficking and secretion] Probab=42.08 E-value=28 Score=16.11 Aligned_cols=75 Identities=17% Similarity=-0.007 Sum_probs=49.8 Q ss_pred CCHHHHHHHHHHCCCCCCCHHHHHHHHHHHCCCHHHHHCCCHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 89899999997337998988999999997379978985099789973789889999997549999999999876406 Q 537021.9.peg.8 37 LAYYEILELILFRLIPRKDTKSIAKALLKRFATLGGVFGAPLHLLQEINRIGKRVALELKLVSVASQRILKANWSIK 113 (148) Q Consensus 37 LsD~ELLallL~~g~~~~~~~~lA~~lL~~fGsL~~ll~as~~eL~~i~GIG~aka~~l~a~~El~rR~~~~~~~~~ 113 (148) ++|.|.++. |+-..-..-.-..-+.++.+||+.......+..++..+ |.....+..+.+..+...+....-.+.. T Consensus 1 m~~~e~~~~-l~L~~~~~l~~~~~~~ll~~~~~~~~~~~~~~~~l~~~-g~~~~~~~~~~~~~~~~~~~le~~~~~~ 75 (350) T COG0758 1 MSDRERLAW-LKLSSTNNLGPLNLLLLLKHFGNKESILKIPLPQLEIE-GLSSRRLLAELARIELLELELEKIEKPG 75 (350) T ss_pred CCHHHHHHH-HHHHHCCCCCHHHHHHHHHHCCCHHHHHHCCHHHHHHH-CCCHHHHHHHHCCHHHHHHHHHHHHHCC T ss_conf 963689999-99861799776999999982587766633337899986-3554777754025778999999999549 No 87 >COG0353 RecR Recombinational DNA repair protein (RecF pathway) [DNA replication, recombination, and repair] Probab=41.18 E-value=22 Score=16.85 Aligned_cols=18 Identities=22% Similarity=0.475 Sum_probs=10.4 Q ss_pred HHHHHHCCCCCHHHHHHH Q ss_conf 789973789889999997 Q 537021.9.peg.8 78 LHLLQEINRIGKRVALEL 95 (148) Q Consensus 78 ~~eL~~i~GIG~aka~~l 95 (148) .+.|.++||||+..|..+ T Consensus 11 I~~l~kLPGvG~KsA~R~ 28 (198) T COG0353 11 IDALKKLPGVGPKSAQRL 28 (198) T ss_pred HHHHHHCCCCCHHHHHHH T ss_conf 999976899883279999 No 88 >PRK00076 recR recombination protein RecR; Reviewed Probab=40.49 E-value=24 Score=16.55 Aligned_cols=52 Identities=15% Similarity=0.123 Sum_probs=23.8 Q ss_pred HHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCC----CHHHHHHHHHHHHCCCC Q ss_conf 899737898899999975499999999998764067788----98999999999845978 Q 537021.9.peg.8 79 HLLQEINRIGKRVALELKLVSVASQRILKANWSIKKFLT----PGQPCLISVERHYPMKN 134 (148) Q Consensus 79 ~eL~~i~GIG~aka~~l~a~~El~rR~~~~~~~~~~~l~----s~~~v~~yl~~~l~~~~ 134 (148) ..+.-+.|+||.. .-+.+|-.|+....+.+-..=. ..+....|+...+...+ T Consensus 108 G~lSpl~gi~pe~----l~i~~L~~Ri~~~~v~EVIlA~~~t~EGe~Ta~yi~~~lk~~~ 163 (197) T PRK00076 108 GLLSPLDGIGPED----LNIDELLERLADGEVKEVILATNPTVEGEATAHYIARLLKPLG 163 (197) T ss_pred CCCCCCCCCCHHH----CCHHHHHHHHCCCCCCEEEEECCCCCCHHHHHHHHHHHHHHCC T ss_conf 6248212898322----6889999986579951799864898650899999999865549 No 89 >PRK13844 recombination protein RecR; Provisional Probab=39.66 E-value=24 Score=16.59 Aligned_cols=62 Identities=13% Similarity=0.107 Sum_probs=28.5 Q ss_pred HCCCHHHHHCCCHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCC----CHHHHHHHHHHHHC Q ss_conf 3799789850997899737898899999975499999999998764067788----98999999999845 Q 537021.9.peg.8 66 RFATLGGVFGAPLHLLQEINRIGKRVALELKLVSVASQRILKANWSIKKFLT----PGQPCLISVERHYP 131 (148) Q Consensus 66 ~fGsL~~ll~as~~eL~~i~GIG~aka~~l~a~~El~rR~~~~~~~~~~~l~----s~~~v~~yl~~~l~ 131 (148) +-|...|+...=-..+.-+.||||.. |. +.+|-.|+......+-..=. ..+....|+...+. T Consensus 99 ~s~~y~G~YhVL~G~iSpl~gigp~~---l~-i~~L~~ri~~~~i~EVIlA~~~t~EGe~Ta~yi~~~l~ 164 (200) T PRK13844 99 EAGIYRGKYFVLNGRISPLDGIGPSE---LK-LDILQQIIADRKIDEVILAISPTVEGETTAHFISQMIA 164 (200) T ss_pred HHCCCCEEEEECCCCCCCCCCCCHHH---HC-HHHHHHHHCCCCCCEEEEECCCCCCHHHHHHHHHHHHC T ss_conf 74410668887487447323888223---06-79999886379974799967999650999999999844 No 90 >cd07644 I-BAR_IMD_BAIAP2L2 Inverse (I)-BAR, also known as the IRSp53/MIM homology Domain (IMD), of Brain-specific Angiogenesis Inhibitor 1-Associated Protein 2-Like 2. The IMD domain, also called Inverse-Bin/Amphiphysin/Rvs (I-BAR) domain, is a dimerization and lipid-binding module that bends membranes and induces membrane protrusions. This group is composed of uncharacterized proteins known as BAIAP2L2 (Brain-specific Angiogenesis Inhibitor 1-Associated Protein 2-Like 2). They contain an N-terminal IMD, an SH3 domain, and a WASP homology 2 (WH2) actin-binding motif at the C-terminus. The related proteins, BAIAP2L1 and IRSp53, function as regulators of membrane dynamics and the actin cytoskeleton. The IMD domain binds and bundles actin filaments, binds membranes and produces membrane protrusions, and interacts with the small GTPase Rac. Probab=38.96 E-value=32 Score=15.81 Aligned_cols=54 Identities=15% Similarity=0.285 Sum_probs=45.4 Q ss_pred CCHHHHHHHHHHHC-CCHHHHHCCCHHHHHHCCCCCHHHHHHHHHHHHHHHHHHH Q ss_conf 98899999999737-9978985099789973789889999997549999999999 Q 537021.9.peg.8 54 KDTKSIAKALLKRF-ATLGGVFGAPLHLLQEINRIGKRVALELKLVSVASQRILK 107 (148) Q Consensus 54 ~~~~~lA~~lL~~f-GsL~~ll~as~~eL~~i~GIG~aka~~l~a~~El~rR~~~ 107 (148) .++..+=+.+++.| ++|+++.++.-..+..+.|+..+.-.-.-|+.-+|..... T Consensus 5 r~t~~vYknimeqFNPgLRnLV~lGk~YeKAl~a~s~Aak~YfdAl~KiGe~A~~ 59 (215) T cd07644 5 RSTISIYKSIMEQFNPALENLVYLGNNYLRAFHALSEAAEVYFSAIAKIGEQALQ 59 (215) T ss_pred HHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 9989999999986498899999983789999999999899999999999998624 No 91 >smart00279 HhH2 Helix-hairpin-helix class 2 (Pol1 family) motifs. Probab=37.60 E-value=12 Score=18.56 Aligned_cols=29 Identities=21% Similarity=0.244 Sum_probs=18.7 Q ss_pred CHHHHHCCCHHHHHHCCCCCHHHHHHHHH Q ss_conf 97898509978997378988999999754 Q 537021.9.peg.8 69 TLGGVFGAPLHLLQEINRIGKRVALELKL 97 (148) Q Consensus 69 sL~~ll~as~~eL~~i~GIG~aka~~l~a 97 (148) .+..|..-+.+-.-.|+|||+..|..|.. T Consensus 6 d~~~L~GD~sDnipGV~GIG~ktA~~ll~ 34 (36) T smart00279 6 DYAILVGDYSDNIPGVKGIGPKTALKLLR 34 (36) T ss_pred HHHHHCCCCCCCCCCCCCCCHHHHHHHHH T ss_conf 99998187646889999747899999998 No 92 >PRK12278 50S ribosomal protein L21/unknown domain fusion protein; Provisional Probab=36.46 E-value=15 Score=17.88 Aligned_cols=22 Identities=23% Similarity=0.270 Sum_probs=17.4 Q ss_pred CCHHHHHHCCCCCHHHHHHHHH Q ss_conf 9978997378988999999754 Q 537021.9.peg.8 76 APLHLLQEINRIGKRVALELKL 97 (148) Q Consensus 76 as~~eL~~i~GIG~aka~~l~a 97 (148) .-.++|.+|+||||.-+..|.. T Consensus 150 g~aDDLk~I~GIGP~~e~~Ln~ 171 (216) T PRK12278 150 GEADDLTKITGVGPALAKKLNE 171 (216) T ss_pred CCCCCCCEECCCCHHHHHHHHH T ss_conf 8996543602658899999998 No 93 >pfam03020 LEM LEM domain. The LEM domain is 50 residues long and is composed of two parallel alpha helices. This domain is found in inner nuclear membrane proteins. It is called the LEM domain after LAP2, Emerin, and Man1. Probab=36.10 E-value=34 Score=15.64 Aligned_cols=32 Identities=13% Similarity=-0.003 Sum_probs=26.2 Q ss_pred HHHCCHHHHHHHHHHCCCCCCCHHHHHHHHHH Q ss_conf 22189899999997337998988999999997 Q 537021.9.peg.8 34 ENALAYYEILELILFRLIPRKDTKSIAKALLK 65 (148) Q Consensus 34 ~~~LsD~ELLallL~~g~~~~~~~~lA~~lL~ 65 (148) ...|||.||-+-+..+|.+--++.+--+.+.. T Consensus 3 v~~LsD~eL~~~L~~~G~~~GPI~~sTRklYe 34 (43) T pfam03020 3 VAQLSDAELRSELRKYGISPGPITGTTRKLYE 34 (43) T ss_pred HHHHCHHHHHHHHHHCCCCCCCCCCCHHHHHH T ss_conf 77718599999999839999996632089999 No 94 >cd07645 I-BAR_IMD_BAIAP2L1 Inverse (I)-BAR, also known as the IRSp53/MIM homology Domain (IMD), of Brain-specific Angiogenesis Inhibitor 1-Associated Protein 2-Like 1. The IMD domain, also called Inverse-Bin/Amphiphysin/Rvs (I-BAR) domain, is a dimerization and lipid-binding module that bends membranes and induces membrane protrusions. BAIAP2L1 (Brain-specific Angiogenesis Inhibitor 1-Associated Protein 2-Like 1) is also known as IRTKS (Insulin Receptor Tyrosine Kinase Substrate). It is widely expressed, serves as a substrate for the insulin receptor, and binds the small GTPase Rac. It plays a role in regulating the actin cytoskeleton and colocalizes with F-actin, cortactin, VASP, and vinculin. BAIAP2L1 expression leads to the formation of short actin bundles, distinct from filopodia-like protrusions induced by the expression of the related protein IRSp53. It contains an N-terminal IMD, an SH3 domain, and a WASP homology 2 (WH2) actin-binding motif at the C-terminus. The IMD domain of Probab=36.02 E-value=31 Score=15.89 Aligned_cols=51 Identities=14% Similarity=0.081 Sum_probs=43.2 Q ss_pred HHHHHHHHHHHC-CCHHHHHCCCHHHHHHCCCCCHHHHHHHHHHHHHHHHHH Q ss_conf 899999999737-997898509978997378988999999754999999999 Q 537021.9.peg.8 56 TKSIAKALLKRF-ATLGGVFGAPLHLLQEINRIGKRVALELKLVSVASQRIL 106 (148) Q Consensus 56 ~~~lA~~lL~~f-GsL~~ll~as~~eL~~i~GIG~aka~~l~a~~El~rR~~ 106 (148) +..+=+.+++.| ++|++++++.-..+..+.|+..+.-.-.-|+.-+|.... T Consensus 7 te~~YknimeqFNPglRnLi~lGk~YeKAl~a~~~Aa~~YfdAl~Kige~A~ 58 (226) T cd07645 7 TESTYKNVMEQFNPGLRNLINLGKNYEKAVNAMVLAGKAYYDGVAKIGEIAA 58 (226) T ss_pred HHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 8999998998549889999998378999999999988999999999999861 No 95 >KOG2534 consensus Probab=36.00 E-value=35 Score=15.52 Aligned_cols=45 Identities=13% Similarity=0.116 Sum_probs=31.2 Q ss_pred CCHHHHHHHHHHHCCCHHHHHCCCHHHHHHCCCCCHHHHHHHHHHHHHH Q ss_conf 9889999999973799789850997899737898899999975499999 Q 537021.9.peg.8 54 KDTKSIAKALLKRFATLGGVFGAPLHLLQEINRIGKRVALELKLVSVAS 102 (148) Q Consensus 54 ~~~~~lA~~lL~~fGsL~~ll~as~~eL~~i~GIG~aka~~l~a~~El~ 102 (148) ..+.-.|...|+.|..- --|.+|++++||||+..|-.|.-+.|-| T Consensus 35 ~~~y~~Aasvlk~~p~~----I~S~~ea~~lP~iG~kia~ki~EiletG 79 (353) T KOG2534 35 ARAYRRAASVLKSLPFP----ITSGEEAEKLPGIGPKIAEKIQEILETG 79 (353) T ss_pred HHHHHHHHHHHHHCCCC----CCCHHHHCCCCCCCHHHHHHHHHHHHCC T ss_conf 99999999999849997----6657885579997777999999999708 No 96 >TIGR00920 2A060605 3-hydroxy-3-methylglutaryl-coenzyme A reductase; InterPro: IPR004816 Synonym(s): 3-hydroxy-3-methylglutaryl-coenzyme A reductase, HMG-CoA reductase. There are two distinct classes of hydroxymethylglutaryl-coenzyme A (HMG-CoA) reductase enzymes: class I consists of eukaryotic and most archaeal enzymes (1.1.1.34 from EC), while class II consists of prokaryotic enzymes (1.1.1.88 from EC) , . Class I HMG-CoA reductases catalyse the NADP-dependent synthesis of mevalonate from 3-hydroxy-3-methylglutaryl-CoA (HMG-CoA). In vertebrates, membrane-bound HMG-CoA reductase is the rate-limiting enzyme in the biosynthesis of cholesterol and other isoprenoids. In plants, mevalonate is the precursor of all isoprenoid compounds . The reduction of HMG-CoA to mevalonate is regulated by feedback inhibition by sterols and non-sterol metabolites derived from mevalonate, including cholesterol. In archaea, HMG-CoA reductase is a cytoplasmic enzyme involved in the biosynthesis of the isoprenoids side chains of lipids . Class I HMG-CoA reductases consist of an N-terminal membrane domain (lacking in archaeal enzymes), and a C-terminal catalytic region. The catalytic region can be subdivided into three domains: an N-domain (N-terminal), a large L-domain, and a small S-domain (inserted within the L-domain). The L-domain binds the substrate, while the S-domain binds NADP. Class II HMG-CoA reductases catalyse the reverse reaction of class I enzymes, namely the NAD-dependent synthesis of HMG-CoA from mevalonate and CoA . Some bacteria, such as Pseudomonas mevalonii, can use mevalonate as the sole carbon source. Class II enzymes lack a membrane domain. Their catalytic region is structurally related to that of class I enzymes, but it consists of only two domains: a large L-domain and a small S-domain (inserted within the L-domain). As with class I enzymes, the L-domain binds substrate, but the S-domain binds NAD (instead of NADP in class I). This entry represents Metazoan class I HMG-CoA reductases, which are membrane-bound glycoproteins that remains in the endoplasmic reticulum after synthesis and glycosylation .; GO: 0004420 hydroxymethylglutaryl-CoA reductase (NADPH) activity, 0050661 NADP binding, 0008299 isoprenoid biosynthetic process, 0005789 endoplasmic reticulum membrane. Probab=35.61 E-value=20 Score=17.09 Aligned_cols=36 Identities=25% Similarity=0.294 Sum_probs=23.5 Q ss_pred HHHHHHHHH------CC--HHHCCHHHHHHHHHHCCCCCCCHHHH Q ss_conf 019999982------39--22189899999997337998988999 Q 537021.9.peg.8 23 NRLRDRFLQ------KG--ENALAYYEILELILFRLIPRKDTKSI 59 (148) Q Consensus 23 ~R~RErll~------~G--~~~LsD~ELLallL~~g~~~~~~~~l 59 (148) .||-+=+|+ .| |.+|||.|+..|+=-.| .+..+..| T Consensus 509 PRP~~EcL~iL~s~~~GsGa~aLsD~Ev~~LV~AkG-tHiP~YkL 552 (988) T TIGR00920 509 PRPVDECLEILNSTEKGSGAQALSDAEVISLVNAKG-THIPAYKL 552 (988) T ss_pred CCCHHHHHHHHCCCCCCCCCCCCCHHHHHHHHCCCC-CCCHHHHH T ss_conf 887489999844788886603366578999872787-31206677 No 97 >pfam00416 Ribosomal_S13 Ribosomal protein S13/S18. This family includes ribosomal protein S13 from prokaryotes and S18 from eukaryotes. Probab=35.12 E-value=15 Score=17.87 Aligned_cols=26 Identities=23% Similarity=0.202 Sum_probs=19.4 Q ss_pred HHHHHCCCCCHHHHHHHHHHHHHHHH Q ss_conf 89973789889999997549999999 Q 537021.9.peg.8 79 HLLQEINRIGKRVALELKLVSVASQR 104 (148) Q Consensus 79 ~eL~~i~GIG~aka~~l~a~~El~rR 104 (148) -.|..|.|||+..|.+|.....+... T Consensus 15 ~ALt~I~GIG~~~A~~Ic~~lgi~~~ 40 (106) T pfam00416 15 IALTYIKGIGRRKANQILKKAGVDKD 40 (106) T ss_pred EEECCCCCCCHHHHHHHHHHCCCCCC T ss_conf 44112105289999999999195977 No 98 >cd00128 XPG Xeroderma pigmentosum G N- and I-regions (XPGN, XPGI); contains the HhH2 motif; domain in nucleases. XPG is a eukaryotic enzyme that functions in nucleotide-excision repair and transcription-coupled repair of oxidative DNA damage. Functionally/structurally related to FEN-1; divalent metal ion-dependent exo- and endonuclease, and bacterial and bacteriophage 5'3' exonucleases. Probab=34.85 E-value=37 Score=15.39 Aligned_cols=55 Identities=15% Similarity=-0.004 Sum_probs=28.8 Q ss_pred HHHHCCCHHHHHHCCCCCHHHHHHHHHHHHHHHHHHH---HHHHHCCCCCCHHHHHHHHH Q ss_conf 8985099789973789889999997549999999999---87640677889899999999 Q 537021.9.peg.8 71 GGVFGAPLHLLQEINRIGKRVALELKLVSVASQRILK---ANWSIKKFLTPGQPCLISVE 127 (148) Q Consensus 71 ~~ll~as~~eL~~i~GIG~aka~~l~a~~El~rR~~~---~~~~~~~~l~s~~~v~~yl~ 127 (148) .-|+..|+. ..|+||||.+|..|.--..-...+.. ......+...++..+.+++. T Consensus 217 ~iL~G~Dy~--~gi~giG~k~A~kli~~~~~~e~~~~~l~~~~~~~~~~~~~~~~~~~f~ 274 (316) T cd00128 217 AILLGCDYT--EGIPGIGPVTALKLIKKYGDIEKDIERLKKKLYRSPEDFPLKEAREFFL 274 (316) T ss_pred HHHHCCCCC--CCCCCCCHHHHHHHHHHHCCHHHHHHHHHHCCCCCCCCCCHHHHHHHHC T ss_conf 997298878--9999735999999999919999999999834876999999799999857 No 99 >smart00341 HRDC Helicase and RNase D C-terminal. Hypothetical role in nucleic acid binding. Mutations in the HRDC domain cause human disease. Probab=34.08 E-value=38 Score=15.31 Aligned_cols=26 Identities=15% Similarity=0.266 Sum_probs=20.2 Q ss_pred HHHHHCCCHHHHHHCCCCCHHHHHHH Q ss_conf 78985099789973789889999997 Q 537021.9.peg.8 70 LGGVFGAPLHLLQEINRIGKRVALEL 95 (148) Q Consensus 70 L~~ll~as~~eL~~i~GIG~aka~~l 95 (148) +....=.+.++|..|+|+|+.++..- T Consensus 38 iA~~~P~~~~~L~~i~g~~~~~~~~~ 63 (81) T smart00341 38 MAAALPTNVSELLAIDGVGEEKARRY 63 (81) T ss_pred HHHHCCCCHHHHHCCCCCCHHHHHHH T ss_conf 99988799999846899999999999 No 100 >COG0258 Exo 5'-3' exonuclease (including N-terminal domain of PolI) [DNA replication, recombination, and repair] Probab=34.05 E-value=23 Score=16.76 Aligned_cols=31 Identities=23% Similarity=0.276 Sum_probs=15.6 Q ss_pred HHHHHCCCHHHHHHCCCCCHHHHHHHHHHHH Q ss_conf 7898509978997378988999999754999 Q 537021.9.peg.8 70 LGGVFGAPLHLLQEINRIGKRVALELKLVSV 100 (148) Q Consensus 70 L~~ll~as~~eL~~i~GIG~aka~~l~a~~E 100 (148) +..|+.-+.+-.-.|+|||+.+|..|.--.. T Consensus 189 ~~~L~Gd~sDnipGV~GIG~ktA~~Ll~~~g 219 (310) T COG0258 189 LKALVGDSSDNIPGVKGIGPKTALKLLQEYG 219 (310) T ss_pred HHHHHCCCCCCCCCCCCCCHHHHHHHHHHHC T ss_conf 9998296535789999838999999999838 No 101 >PRK05179 rpsM 30S ribosomal protein S13; Validated Probab=33.67 E-value=29 Score=16.01 Aligned_cols=24 Identities=25% Similarity=0.163 Sum_probs=17.4 Q ss_pred HHHHHCCCCCHHHHHHHHHHHHHH Q ss_conf 899737898899999975499999 Q 537021.9.peg.8 79 HLLQEINRIGKRVALELKLVSVAS 102 (148) Q Consensus 79 ~eL~~i~GIG~aka~~l~a~~El~ 102 (148) -.|..|.|||+..|.+|....-+. T Consensus 17 ~ALt~I~GIG~~~A~~Ic~~lgId 40 (122) T PRK05179 17 IALTYIYGIGRTRAKEILAAAGID 40 (122) T ss_pred HHHHHHCCCCHHHHHHHHHHCCCC T ss_conf 477300275899999999982999 No 102 >TIGR03631 bact_S13 30S ribosomal protein S13. This model describes bacterial ribosomal protein S13, to the exclusion of the homologous archaeal S13P and eukaryotic ribosomal protein S18. This model identifies some (but not all) instances of chloroplast and mitochondrial S13, which is of bacterial type. Probab=33.31 E-value=20 Score=17.16 Aligned_cols=24 Identities=21% Similarity=0.082 Sum_probs=18.7 Q ss_pred HHHHHCCCCCHHHHHHHHHHHHHH Q ss_conf 899737898899999975499999 Q 537021.9.peg.8 79 HLLQEINRIGKRVALELKLVSVAS 102 (148) Q Consensus 79 ~eL~~i~GIG~aka~~l~a~~El~ 102 (148) -.|..|.|||+..|.+|.....+. T Consensus 15 ~ALt~I~GIG~~~A~~Ic~~lgId 38 (113) T TIGR03631 15 IALTYIYGIGRTRARKILEKAGID 38 (113) T ss_pred HHHHCEECCCHHHHHHHHHHCCCC T ss_conf 065200275899999999992999 No 103 >CHL00137 rps13 ribosomal protein S13; Validated Probab=30.66 E-value=21 Score=16.93 Aligned_cols=24 Identities=21% Similarity=0.073 Sum_probs=18.6 Q ss_pred HHHHCCCCCHHHHHHHHHHHHHHH Q ss_conf 997378988999999754999999 Q 537021.9.peg.8 80 LLQEINRIGKRVALELKLVSVASQ 103 (148) Q Consensus 80 eL~~i~GIG~aka~~l~a~~El~r 103 (148) .|..|.|||+..|.+|.....+.. T Consensus 18 aLt~I~GIG~~~A~~Ic~~lgi~~ 41 (122) T CHL00137 18 ALTYIYGIGLTSAKKILEKANIDP 41 (122) T ss_pred HHHHHCCCCHHHHHHHHHHCCCCC T ss_conf 111000618999999999849898 No 104 >pfam01367 5_3_exonuc 5'-3' exonuclease, C-terminal SAM fold. Probab=30.14 E-value=44 Score=14.90 Aligned_cols=40 Identities=18% Similarity=0.243 Sum_probs=25.2 Q ss_pred HHHHHHHHCCCHHHHHCCCHHHHHHCCCCCHHHHHHHHHHHHHHHHH Q ss_conf 99999973799789850997899737898899999975499999999 Q 537021.9.peg.8 59 IAKALLKRFATLGGVFGAPLHLLQEINRIGKRVALELKLVSVASQRI 105 (148) Q Consensus 59 lA~~lL~~fGsL~~ll~as~~eL~~i~GIG~aka~~l~a~~El~rR~ 105 (148) .|..||++|||+.+++.. .+ ++++ .....|....|.+++. T Consensus 30 tA~~Ll~~~gsle~i~~n-~d---~i~~---k~~~~l~~~~~~~~ls 69 (100) T pfam01367 30 TAAKLLKEYGSLENIYEN-LD---KLKG---KLREKLLNGKEDAFLS 69 (100) T ss_pred HHHHHHHHCCCHHHHHHH-HH---HCCH---HHHHHHHHHHHHHHHH T ss_conf 899999981989999981-98---7179---9999999789999999 No 105 >cd00008 53EXOc 5'-3' exonuclease; T5 type 5'-3' exonuclease domains may co-occur with DNA polymerase I (Pol I) domains, or be part of Pol I containing complexes. They digest dsDNA and ssDNA, releasing mono-,di- and tri-nucleotides, as well as oligonucleotides, and have also been reported to possess RNase H activity. Also called 5' nuclease family, involved in structure-specific cleavage of flaps formed by Pol I activity (similar to mammalian flap endonuclease I, FEN-1). A single nucleic acid strand may be threaded through the 5' nuclease enzyme before cleavage occurs. The domain binds two divalent metal ions which are necessary for activity. Probab=29.40 E-value=22 Score=16.78 Aligned_cols=30 Identities=17% Similarity=0.232 Sum_probs=17.3 Q ss_pred HHHHHCCCHHHHHHCCCCCHHHHHHHHHHH Q ss_conf 789850997899737898899999975499 Q 537021.9.peg.8 70 LGGVFGAPLHLLQEINRIGKRVALELKLVS 99 (148) Q Consensus 70 L~~ll~as~~eL~~i~GIG~aka~~l~a~~ 99 (148) +..|..-+.+-.-.|+|||+..|..|..-. T Consensus 174 ~~aL~GD~sDnIpGV~GiG~KtA~kLl~~~ 203 (240) T cd00008 174 YKALMGDSSDNIPGVPGIGEKTAAKLLKEY 203 (240) T ss_pred HHHHHCCCCCCCCCCCCCCHHHHHHHHHHH T ss_conf 699728764688999985789999999980 No 106 >PRK04053 rps13p 30S ribosomal protein S13P; Reviewed Probab=29.38 E-value=24 Score=16.61 Aligned_cols=25 Identities=24% Similarity=0.189 Sum_probs=19.7 Q ss_pred HHHHHCCCCCHHHHHHHHHHHHHHH Q ss_conf 8997378988999999754999999 Q 537021.9.peg.8 79 HLLQEINRIGKRVALELKLVSVASQ 103 (148) Q Consensus 79 ~eL~~i~GIG~aka~~l~a~~El~r 103 (148) -.|..|.|||+..|.+|.....+.. T Consensus 25 ~ALt~IyGIG~~~A~~Ic~~lgId~ 49 (149) T PRK04053 25 YALTGIKGIGRRTARAIARKLGLDP 49 (149) T ss_pred EECCCCCCCCHHHHHHHHHHHCCCC T ss_conf 4411114848999999999918998 No 107 >COG3743 Uncharacterized conserved protein [Function unknown] Probab=28.61 E-value=25 Score=16.49 Aligned_cols=18 Identities=28% Similarity=0.372 Sum_probs=14.4 Q ss_pred HHHHHCCCCCHHHHHHHH Q ss_conf 899737898899999975 Q 537021.9.peg.8 79 HLLQEINRIGKRVALELK 96 (148) Q Consensus 79 ~eL~~i~GIG~aka~~l~ 96 (148) ++|..|.||||+-...|. T Consensus 67 DDLt~I~GIGPk~e~~Ln 84 (133) T COG3743 67 DDLTRISGIGPKLEKVLN 84 (133) T ss_pred CCCHHHCCCCHHHHHHHH T ss_conf 542110043788998998 No 108 >pfam00570 HRDC HRDC domain. The HRDC (Helicase and RNase D C-terminal) domain has a putative role in nucleic acid binding. Mutations in the HRDC domain cause human disease. It is interesting to note that the RecQ helicase in Deinococcus radiodurans has three tandem HRDC domains. Probab=28.43 E-value=47 Score=14.71 Aligned_cols=26 Identities=19% Similarity=0.203 Sum_probs=19.5 Q ss_pred HHHHHCCCHHHHHHCCCCCHHHHHHH Q ss_conf 78985099789973789889999997 Q 537021.9.peg.8 70 LGGVFGAPLHLLQEINRIGKRVALEL 95 (148) Q Consensus 70 L~~ll~as~~eL~~i~GIG~aka~~l 95 (148) +....=.+.++|.+|+|+|+.++... T Consensus 35 ia~~~P~s~~~L~~i~g~~~~~~~~~ 60 (68) T pfam00570 35 IAEKLPRTLEELLRIPGVGPRKLERY 60 (68) T ss_pred HHHHCCCCHHHHHCCCCCCHHHHHHH T ss_conf 99878499999808999999999999 No 109 >pfam10678 DUF2492 Protein of unknown function (DUF2492). This is a bacterial family of uncharacterized proteins. Probab=27.64 E-value=48 Score=14.62 Aligned_cols=40 Identities=18% Similarity=0.248 Sum_probs=31.4 Q ss_pred HHCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHCCCHHHHHC Q ss_conf 21898999999973379989889999999973799789850 Q 537021.9.peg.8 35 NALAYYEILELILFRLIPRKDTKSIAKALLKRFATLGGVFG 75 (148) Q Consensus 35 ~~LsD~ELLallL~~g~~~~~~~~lA~~lL~~fGsL~~ll~ 75 (148) +++.-+|+|.++|..|.|- +--+|...+.++||.=...-. T Consensus 2 ~sIHgH~VL~mml~~~~~~-t~~~L~~~i~~~FG~~arFhT 41 (78) T pfam10678 2 QSIHGHEVLNLMLASGEPY-TREELKEAIAQEFGEEARFHT 41 (78) T ss_pred CCHHHHHHHHHHHHCCCCC-CHHHHHHHHHHHHCCCCEEEE T ss_conf 6205999999999858975-999999999998586656766 No 110 >cd02434 Nodulin-21_like_3 Nodulin-21 and CCC1-related protein family. Nodulin-21_like_3: This is a family of proteins closely related to nodulin-21, a plant nodule-specific protein that may be involved in symbiotic nitrogen fixation. This family is also related to CCC1, a yeast vacuole transmembrane protein that functions as an iron and manganese transporter. Probab=27.17 E-value=49 Score=14.57 Aligned_cols=47 Identities=13% Similarity=0.169 Sum_probs=31.3 Q ss_pred CCHHHHHHHHHHHCCHHHCCHHHHHHHHHHCCCCCCCHHHHHHHHHH Q ss_conf 01650199999823922189899999997337998988999999997 Q 537021.9.peg.8 19 IGHHNRLRDRFLQKGENALAYYEILELILFRLIPRKDTKSIAKALLK 65 (148) Q Consensus 19 ~ghr~R~RErll~~G~~~LsD~ELLallL~~g~~~~~~~~lA~~lL~ 65 (148) .=|++|.||+-.-.---.-.-.|+.++.-..|.+..++.++++.+.+ T Consensus 63 ~~~~~~~~E~~ei~~~pe~E~~El~~iy~~~G~~~~~A~~v~~~l~~ 109 (225) T cd02434 63 FLHSEKKREEWEIENYPEGEKSEMVEIYSLKGLSEEVADQVVELLSK 109 (225) T ss_pred HHHHHHHHHHHHHHHCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHH T ss_conf 99876889999988594989999999998779998999999999983 No 111 >PRK08937 adenylosuccinate lyase; Provisional Probab=26.83 E-value=50 Score=14.53 Aligned_cols=11 Identities=9% Similarity=0.184 Sum_probs=3.9 Q ss_pred HHHHHHHHHHC Q ss_conf 50199999823 Q 537021.9.peg.8 22 HNRLRDRFLQK 32 (148) Q Consensus 22 r~R~RErll~~ 32 (148) .+|+++-+-.. T Consensus 334 ~e~m~~nl~~~ 344 (425) T PRK08937 334 PENMERNIRQE 344 (425) T ss_pred HHHHHHHHHHH T ss_conf 99999999984 No 112 >TIGR01258 pgm_1 phosphoglycerate mutase 1 family; InterPro: IPR005952 Most members of this family are phosphoglycerate mutase (5.4.2.1 from EC). This enzyme interconverts 2-phosphoglycerate and 3-phosphoglycerate.2-phospho-D-glycerate + 2,3-diphosphoglycerate = 3-phospho-D-glycerate + 2,3-diphosphoglycerate. The enzyme is transiently phosphorylated on an active site histidine by 2,3-diphosphoglyerate, which is both substrate and product. Some members of this family have are phosphoglycerate mutase as a minor activity and act primarily as a bisphoglycerate mutase, interconverting 2,3-diphosphoglycerate and 1,3-diphosphoglycerate (5.4.2.4 from EC). ; GO: 0016868 intramolecular transferase activity phosphotransferases, 0006096 glycolysis. Probab=26.72 E-value=27 Score=16.25 Aligned_cols=19 Identities=37% Similarity=0.342 Sum_probs=10.0 Q ss_pred HHHCCHHHHHHHHHHCCCC Q ss_conf 2218989999999733799 Q 537021.9.peg.8 34 ENALAYYEILELILFRLIP 52 (148) Q Consensus 34 ~~~LsD~ELLallL~~g~~ 52 (148) .+.+||.|++++=|-||.| T Consensus 194 L~~~Sd~~Il~LNIPTGiP 212 (248) T TIGR01258 194 LEGISDEEILELNIPTGIP 212 (248) T ss_pred HHCCCHHHHHHCCCCCCCC T ss_conf 6127946885045987553 No 113 >pfam02541 Ppx-GppA Ppx/GppA phosphatase family. This family consists of the N-terminal region of exopolyphosphatase (Ppx) EC:3.6.1.11 and guanosine pentaphosphate phospho-hydrolase (GppA) EC:3.6.1.40. Probab=25.77 E-value=52 Score=14.40 Aligned_cols=60 Identities=7% Similarity=-0.108 Sum_probs=45.4 Q ss_pred HHHHHCCCHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHH Q ss_conf 789850997899737898899999975499999999998764067788989999999998 Q 537021.9.peg.8 70 LGGVFGAPLHLLQEINRIGKRVALELKLVSVASQRILKANWSIKKFLTPGQPCLISVERH 129 (148) Q Consensus 70 L~~ll~as~~eL~~i~GIG~aka~~l~a~~El~rR~~~~~~~~~~~l~s~~~v~~yl~~~ 129 (148) +..+...+.++..+++|+.+..+-.|.+..-+...++..--.+...++.+.=---++... T Consensus 222 ~~~l~~~~~~~r~~~~gl~~~Rad~I~~g~~Il~~il~~~~~~~i~vs~~glReG~l~~~ 281 (285) T pfam02541 222 IEKLSQITREDRLELAGVSDERADVIVAGALILSAVFEALSIEAMIISDGALREGVLYSL 281 (285) T ss_pred HHHHHCCCHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHCCCCEEEECCCHHHHHHHHHH T ss_conf 999863999999762599977999999999999999997699989999854899999999 No 114 >TIGR03629 arch_S13P archaeal ribosomal protein S13P. This model describes exclusively the archaeal ribosomal protein S13P. It excludes the homologous eukaryotic 40S ribosomal protein S18 and bacterial 30S ribosomal protein S13. Probab=25.50 E-value=36 Score=15.44 Aligned_cols=23 Identities=26% Similarity=0.260 Sum_probs=17.5 Q ss_pred HHHHCCCCCHHHHHHHHHHHHHH Q ss_conf 99737898899999975499999 Q 537021.9.peg.8 80 LLQEINRIGKRVALELKLVSVAS 102 (148) Q Consensus 80 eL~~i~GIG~aka~~l~a~~El~ 102 (148) .|..|.|||+..|.+|.....+. T Consensus 22 aLt~I~GIG~~~A~~Ic~~~gid 44 (144) T TIGR03629 22 ALTGIKGIGRRFARAIARKLGVD 44 (144) T ss_pred EEEEECCCCHHHHHHHHHHHCCC T ss_conf 72212372899999999990999 No 115 >cd00715 GPATase_N Glutamine amidotransferases class-II (GN-AT)_GPAT- type. This domain is found at the N-terminus of glutamine phosphoribosylpyrophosphate (Prpp) amidotransferase (GPATase) . The glutaminase domain catalyzes amide nitrogen transfer from glutamine to the appropriate substrate. In this process, glutamine is hydrolyzed to glutamic acid and ammonia. GPATase catalyzes the first step in purine biosynthesis, an amide transfer from glutamine to PRPP, resulting in phosphoribosylamine, pyrophosphate and glutamate. GPATase crystalizes as a homotetramer, but can also exist as a homdimer. Probab=25.05 E-value=54 Score=14.32 Aligned_cols=62 Identities=15% Similarity=0.148 Sum_probs=43.0 Q ss_pred HHHHHHHHHHCCH--HHCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHCCCHHHHHCCCHHHHHH Q ss_conf 5019999982392--2189899999997337998988999999997379978985099789973 Q 537021.9.peg.8 22 HNRLRDRFLQKGE--NALAYYEILELILFRLIPRKDTKSIAKALLKRFATLGGVFGAPLHLLQE 83 (148) Q Consensus 22 r~R~RErll~~G~--~~LsD~ELLallL~~g~~~~~~~~lA~~lL~~fGsL~~ll~as~~eL~~ 83 (148) -+.+|+.+.+.|. .+=||.|+++-++.......+..+--++.++++.+-..+.-.+.+.+-. T Consensus 107 ~~eL~~~l~~~g~~f~s~sDTEvI~~li~~~~~~~~~~~~i~~~~~~l~Gaysl~~l~~~~l~~ 170 (252) T cd00715 107 AKELREELEEEGRIFQTTSDSEVILHLIARSLAKDDLFEAIIDALERVKGAYSLVIMTADGLIA 170 (252) T ss_pred HHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCEEEEEEECCEEEE T ss_conf 9999999985699876588899999999865410529999999999612125899999898999 No 116 >pfam07088 GvpD GvpD gas vesicle protein. This family consists of several archaeal GvpD gas vesicle proteins. GvpD is thought to be involved in the regulation of gas vesicle formation. Probab=24.31 E-value=40 Score=15.13 Aligned_cols=48 Identities=8% Similarity=-0.104 Sum_probs=34.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCCCEEEEEE Q ss_conf 99997549999999999876406778898999999999845978000252 Q 537021.9.peg.8 91 VALELKLVSVASQRILKANWSIKKFLTPGQPCLISVERHYPMKNANNFAY 140 (148) Q Consensus 91 ka~~l~a~~El~rR~~~~~~~~~~~l~s~~~v~~yl~~~l~~~~~E~F~v 140 (148) .-..+-+..|...++.....+.-..++||+.+++|+..+.. ..|.|.+ T Consensus 87 ~~l~l~sl~e~vd~i~a~~er~~Ia~DSW~~i~eyLa~r~d--~pe~f~~ 134 (484) T pfam07088 87 EKLNLDSLLEWVDAINAAGERLTIAFDSWELIYEYLAERHD--IPPDILT 134 (484) T ss_pred CCCCHHHHHHHHHHHHCCCCCCEEEEECHHHHHHHHHHHHC--CCHHHHH T ss_conf 42355579999987641245747998328999998777616--8466756 No 117 >CHL00194 ycf39 Ycf39; Provisional Probab=23.77 E-value=33 Score=15.66 Aligned_cols=23 Identities=17% Similarity=0.076 Sum_probs=9.8 Q ss_pred CHHHCCHHHHHHHHHHCCCCCCC Q ss_conf 92218989999999733799898 Q 537021.9.peg.8 33 GENALAYYEILELILFRLIPRKD 55 (148) Q Consensus 33 G~~~LsD~ELLallL~~g~~~~~ 55 (148) ||+++|-.|+.+++-+...+... T Consensus 204 GP~a~T~~EIa~l~~~~~Gk~~~ 226 (319) T CHL00194 204 GPKSWNSSEIISLCEQLSGQKAK 226 (319) T ss_pred CCCCCCHHHHHHHHHHHHCCCCC T ss_conf 98638999999999998599987 No 118 >PRK12495 hypothetical protein; Provisional Probab=23.62 E-value=53 Score=14.36 Aligned_cols=32 Identities=19% Similarity=0.286 Sum_probs=19.0 Q ss_pred CCHHHHHHHHHHHCCHHHCCHHHHHHHHHHCC Q ss_conf 01650199999823922189899999997337 Q 537021.9.peg.8 19 IGHHNRLRDRFLQKGENALAYYEILELILFRL 50 (148) Q Consensus 19 ~ghr~R~RErll~~G~~~LsD~ELLallL~~g 50 (148) -..|+|||||+-+---.-=.-.-+=+|||... T Consensus 7 EAEREkLREKye~d~~~ReaTqrMSeLLLqGA 38 (221) T PRK12495 7 EAEREKLREKYEQDQQDREETERMSELLLQGA 38 (221) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 89999999998764888999999999997332 No 119 >PRK03968 DNA primase large subunit; Validated Probab=23.46 E-value=50 Score=14.53 Aligned_cols=21 Identities=29% Similarity=0.396 Sum_probs=13.1 Q ss_pred HHHHHHHHCCCHHHHHCCCHH Q ss_conf 999999737997898509978 Q 537021.9.peg.8 59 IAKALLKRFATLGGVFGAPLH 79 (148) Q Consensus 59 lA~~lL~~fGsL~~ll~as~~ 79 (148) =|++|+++|||+..+++..+. T Consensus 13 ~AkeivkefGsin~~l~~ip~ 33 (388) T PRK03968 13 EAKELVKEFGSINELLDAIPS 33 (388) T ss_pred HHHHHHHHHCCHHHHHHCCCC T ss_conf 999999984359888750721 No 120 >PTZ00217 flap endonuclease-1; Provisional Probab=22.92 E-value=59 Score=14.06 Aligned_cols=35 Identities=29% Similarity=0.281 Sum_probs=19.2 Q ss_pred HHHHHCCCHHHHHHCCCCCHHHHHHHHHHHHHHHHHH Q ss_conf 7898509978997378988999999754999999999 Q 537021.9.peg.8 70 LGGVFGAPLHLLQEINRIGKRVALELKLVSVASQRIL 106 (148) Q Consensus 70 L~~ll~as~~eL~~i~GIG~aka~~l~a~~El~rR~~ 106 (148) +.=|+.-|+. ..|+||||.+|..|.--.--..++. T Consensus 228 lcIL~GcDY~--~~I~GIGpk~A~klIk~~~sie~il 262 (394) T PTZ00217 228 LCILCGCDYC--DTIEGIGPKTAYELIKKYGSIEEIL 262 (394) T ss_pred HHHHHCCCCC--CCCCCCCHHHHHHHHHHHCCHHHHH T ss_conf 9998186446--8998748899999999959999999 No 121 >PRK09425 prpD 2-methylcitrate dehydratase; Provisional Probab=22.58 E-value=60 Score=14.02 Aligned_cols=16 Identities=19% Similarity=0.453 Sum_probs=8.4 Q ss_pred HCCHHHCCHHHHHHHH Q ss_conf 2392218989999999 Q 537021.9.peg.8 31 QKGENALAYYEILELI 46 (148) Q Consensus 31 ~~G~~~LsD~ELLall 46 (148) ..|...+|-.+||.-+ T Consensus 133 ~~g~~~lsg~dlL~a~ 148 (480) T PRK09425 133 AAGKEPLTMKDVLTAM 148 (480) T ss_pred HCCCCCCCHHHHHHHH T ss_conf 0367886399999999 No 122 >cd02433 Nodulin-21_like_2 Nodulin-21 and CCC1-related protein family. Nodulin-21_like_2: This is a family of proteins closely related to nodulin-21, a plant nodule-specific protein that may be involved in symbiotic nitrogen fixation. This family is also related to CCC1, a yeast vacuole transmembrane protein that functions as an iron and manganese transporter. Probab=22.34 E-value=61 Score=13.99 Aligned_cols=50 Identities=10% Similarity=0.036 Sum_probs=35.7 Q ss_pred CCCCCCHHHHHHHHHHHCCHHHCCHHHHHHHHHHCCCCCCCHHHHHHHHHHH Q ss_conf 8630016501999998239221898999999973379989889999999973 Q 537021.9.peg.8 15 HLDYIGHHNRLRDRFLQKGENALAYYEILELILFRLIPRKDTKSIAKALLKR 66 (148) Q Consensus 15 ~~~~~ghr~R~RErll~~G~~~LsD~ELLallL~~g~~~~~~~~lA~~lL~~ 66 (148) +..+..+++|.+..+.+. |+. ...||.++.-..|.+..++..++..+.+. T Consensus 78 ~d~~~~e~~rE~~ei~~~-p~~-E~~El~~iy~~kG~~~~~A~~va~~l~~~ 127 (234) T cd02433 78 RELLEASIPDERRELRKH-PLE-EAAELALIYRAKGLDEEEAKRVASQLMND 127 (234) T ss_pred HHHHHHHHHHHHHHHHHC-HHH-HHHHHHHHHHHCCCCHHHHHHHHHHHHHC T ss_conf 889998899999998509-588-99999999987699989999999999859 No 123 >cd05839 BR140_related The PWWP domain is found in the BR140 family, which includes peregrin and BR140-like proteins 1 and 2. BR140 is the only family to contain the PWWP domain at the C terminus, with PHD and bromo domains in the N-terminal region. In myeloid leukemias, BR140 is disrupted by chromosomal translocations, similar to translocations of WHSC1 in lymphoid multiple myeloma. The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding proteins, that function as transcription factors regulating a variety of developmental processes. Probab=21.97 E-value=43 Score=14.96 Aligned_cols=13 Identities=15% Similarity=0.120 Sum_probs=9.0 Q ss_pred CCEEEEEEEEECC Q ss_conf 7800025212102 Q 537021.9.peg.8 133 KNANNFAYCFSIN 145 (148) Q Consensus 133 ~~~E~F~vLF~~N 145 (148) .+...|.|+|||| T Consensus 51 ~~e~~~LVlFFD~ 63 (111) T cd05839 51 ADERLYLVLFFDN 63 (111) T ss_pred CCCEEEEEEEECC T ss_conf 6871899999648 No 124 >PTZ00205 DNA polymerase kappa; Provisional Probab=21.58 E-value=43 Score=14.93 Aligned_cols=17 Identities=29% Similarity=0.327 Sum_probs=14.0 Q ss_pred HHHCCCCCHHHHHHHHH Q ss_conf 97378988999999754 Q 537021.9.peg.8 81 LQEINRIGKRVALELKL 97 (148) Q Consensus 81 L~~i~GIG~aka~~l~a 97 (148) +.+|+|||++....|.+ T Consensus 311 vRKIpGIGkVte~~L~a 327 (571) T PTZ00205 311 LRSVPGVGKVTEALLKG 327 (571) T ss_pred CCCCCCCCHHHHHHHHH T ss_conf 76689756888999987 No 125 >PRK08434 consensus Probab=21.40 E-value=33 Score=15.66 Aligned_cols=47 Identities=23% Similarity=0.219 Sum_probs=31.0 Q ss_pred CCHHHHHHHHHH-H-CCH---HHCCHHHHHHHHHHCCCCCCCHHHHHHHHHHH Q ss_conf 016501999998-2-392---21898999999973379989889999999973 Q 537021.9.peg.8 19 IGHHNRLRDRFL-Q-KGE---NALAYYEILELILFRLIPRKDTKSIAKALLKR 66 (148) Q Consensus 19 ~ghr~R~RErll-~-~G~---~~LsD~ELLallL~~g~~~~~~~~lA~~lL~~ 66 (148) .||.-.--=+++ + +|. ...-|..|.+.+|..+. +.+-..+|.+.+.. T Consensus 383 IghNlKyDl~vL~~~~gi~l~g~~fDTmLAaYLLdp~~-r~~Ld~La~~yL~~ 434 (887) T PRK08434 383 IGHNLKYDFKIIQNNFGLELPQKYADTMILAWLKDPSS-RVGLDDLAKRLFNY 434 (887) T ss_pred HHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHCCCCC-CCCHHHHHHHHCCC T ss_conf 36477999999998569877773148999998669866-66667899997087 No 126 >PRK05797 consensus Probab=20.62 E-value=34 Score=15.58 Aligned_cols=117 Identities=13% Similarity=0.008 Sum_probs=58.8 Q ss_pred CCCCHHHHH-HHHHHHCCHH--H-CCHHHHHHHHHHCCCCCCCHHHHHHHHHHHC-C----------------------- Q ss_conf 300165019-9999823922--1-8989999999733799898899999999737-9----------------------- Q 537021.9.peg.8 17 DYIGHHNRL-RDRFLQKGEN--A-LAYYEILELILFRLIPRKDTKSIAKALLKRF-A----------------------- 68 (148) Q Consensus 17 ~~~ghr~R~-RErll~~G~~--~-LsD~ELLallL~~g~~~~~~~~lA~~lL~~f-G----------------------- 68 (148) .-+||.-.- ..-|.++|.. + .-|..|.+.+|..+..+.+-..+|.+.|..- . T Consensus 378 ~KIGhNlK~dl~vL~~~GI~l~gi~FDTMLAsYLLnP~~~~h~Ld~La~~yL~~~~~~~e~~ya~~dA~~~~~L~~~L~~ 457 (869) T PRK05797 378 KKIGHDIKNFLTILKKLGIEFKGLAFDTAIAAYLIDPSKSEYDLSDLAKEYLSKDVNSEEEEYKIKEVSVMKELYEKLKE 457 (869) T ss_pred CEEEECCHHHHHHHHHCCCCCCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHH T ss_conf 78984306999999975985566166899999972899888899999999808777514789999999999999999999 Q ss_pred -----CHHHHHC---CCH---HHHHHCCCCCHHHHHHHHHHHHHHHHHHHH------HHHHCCCCCCHHHHHHHHHHHHC Q ss_conf -----9789850---997---899737898899999975499999999998------76406778898999999999845 Q 537021.9.peg.8 69 -----TLGGVFG---APL---HLLQEINRIGKRVALELKLVSVASQRILKA------NWSIKKFLTPGQPCLISVERHYP 131 (148) Q Consensus 69 -----sL~~ll~---as~---~eL~~i~GIG~aka~~l~a~~El~rR~~~~------~~~~~~~l~s~~~v~~yl~~~l~ 131 (148) ++..++. .+. =.=+...||.--+-..=....++..++..- ...+.--++||.++.+.|-..|+ T Consensus 458 ~L~~~~l~~L~~~IE~PLi~VLa~ME~~GI~VD~~~L~~l~~~l~~~l~~le~~I~~~aG~eFNinSPKQL~~iLFe~Lg 537 (869) T PRK05797 458 KIEEMDMEKLLYEVELPLVEVLASMESEGFKVDKDILDELSKKFKEEIEKTQKEIYELAGEEFNINSPKQLGKILFEKLD 537 (869) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCCEEECHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHCC T ss_conf 99861289999999867899999998468099999999999999999999999999862886678998999999898649 Q ss_pred CC Q ss_conf 97 Q 537021.9.peg.8 132 MK 133 (148) Q Consensus 132 ~~ 133 (148) .. T Consensus 538 Lp 539 (869) T PRK05797 538 LP 539 (869) T ss_pred CC T ss_conf 99 No 127 >PRK07300 consensus Probab=20.55 E-value=38 Score=15.29 Aligned_cols=16 Identities=31% Similarity=0.692 Sum_probs=7.4 Q ss_pred HHHHHHHCCCHHHHHC Q ss_conf 9999973799789850 Q 537021.9.peg.8 60 AKALLKRFATLGGVFG 75 (148) Q Consensus 60 A~~lL~~fGsL~~ll~ 75 (148) |..||++||||.+++. T Consensus 209 A~kLL~eyGsLE~I~~ 224 (880) T PRK07300 209 GLKLLHEFGSLEGIYE 224 (880) T ss_pred HHHHHHHCCCHHHHHH T ss_conf 9999997786999998 No 128 >COG2176 PolC DNA polymerase III, alpha subunit (gram-positive type) [DNA replication, recombination, and repair] Probab=20.19 E-value=37 Score=15.38 Aligned_cols=45 Identities=11% Similarity=0.156 Sum_probs=30.3 Q ss_pred HHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHH Q ss_conf 97378988999999754999999999987640677889899999999 Q 537021.9.peg.8 81 LQEINRIGKRVALELKLVSVASQRILKANWSIKKFLTPGQPCLISVE 127 (148) Q Consensus 81 L~~i~GIG~aka~~l~a~~El~rR~~~~~~~~~~~l~s~~~v~~yl~ 127 (148) +..|+|+|.+.|-+|..+.|=+.=...+.+++|..++++ +.+++. T Consensus 1383 Fi~i~GlGe~vA~~IV~AR~Ek~FlS~eDlkkRtkis~t--~ie~~~ 1427 (1444) T COG2176 1383 FIAIPGLGENVAKSIVEAREEKEFLSKEDLKKRTKISKT--HIEKLD 1427 (1444) T ss_pred EECCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHCCCCHH--HHHHHH T ss_conf 001587168999999998634886778999874175588--999998 No 129 >pfam07747 MTH865 MTH865-like family. This domain has an EF-hand like fold. Probab=20.13 E-value=67 Score=13.70 Aligned_cols=36 Identities=19% Similarity=0.284 Sum_probs=24.8 Q ss_pred CHHHCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHCC Q ss_conf 922189899999997337998988999999997379 Q 537021.9.peg.8 33 GENALAYYEILELILFRLIPRKDTKSIAKALLKRFA 68 (148) Q Consensus 33 G~~~LsD~ELLallL~~g~~~~~~~~lA~~lL~~fG 68 (148) |--.++-.|+..++-.+--|-+|+.++|..|+++.| T Consensus 39 g~~~vtA~El~kll~~~DFP~k~ae~lad~i~~r~g 74 (75) T pfam07747 39 GGVEITAGELVKLLTDADFPFKSAEDLADKLVDKAG 74 (75) T ss_pred CCEEEEHHHHHHHCCCCCCCCCCHHHHHHHHHHHCC T ss_conf 898464999986567656886799999999998706 Done!