Query gi|254780636|ref|YP_003065049.1| ribonuclease D [Candidatus Liberibacter asiaticus str. psy62] Match_columns 207 No_of_seqs 126 out of 2436 Neff 6.8 Searched_HMMs 39220 Date Sun May 29 23:22:26 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780636.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 COG0349 Rnd Ribonuclease D [Tr 100.0 0 0 349.0 13.9 198 7-207 2-204 (361) 2 TIGR01388 rnd ribonuclease D; 100.0 0 0 328.4 12.4 186 7-194 4-190 (374) 3 PRK10829 ribonuclease D; Provi 100.0 0 0 323.7 16.6 185 2-189 1-188 (373) 4 cd06142 RNaseD_exo Ribonucleas 100.0 0 0 313.2 16.4 175 11-188 2-177 (178) 5 cd06147 Rrp6p_like_exo Yeast R 100.0 0 0 297.1 14.0 178 8-188 11-190 (192) 6 cd06148 Egl_like_exo The Egali 100.0 3.1E-43 0 270.5 15.5 169 14-183 4-189 (197) 7 cd06129 RNaseD_like The RNase 100.0 8.7E-43 0 267.9 13.8 148 16-164 11-158 (159) 8 pfam01612 3_5_exonuc 3'-5' exo 100.0 2.3E-42 0 265.4 12.8 161 10-172 9-172 (172) 9 smart00474 35EXOc 3'-5' exonuc 100.0 5.8E-41 1.4E-45 257.3 13.8 162 10-172 9-172 (172) 10 cd06146 mut-7_like_exo The mut 100.0 1.9E-39 4.8E-44 248.6 11.8 159 8-167 7-191 (191) 11 cd06141 WRN_exo WRN is a uniqu 100.0 3.8E-39 9.8E-44 246.7 12.6 150 17-167 15-168 (168) 12 cd06139 DNA_polA_I_Ecoli_like_ 100.0 3.1E-38 8E-43 241.4 15.2 174 17-192 2-193 (193) 13 cd00007 35EXOc 3'-5' exonuclea 100.0 7.8E-38 2E-42 239.1 12.9 151 21-171 1-155 (155) 14 PRK07556 consensus 100.0 4.1E-35 1.1E-39 223.3 14.4 190 11-203 377-593 (977) 15 PRK06887 consensus 100.0 2E-33 5.1E-38 213.5 14.5 189 11-203 346-576 (954) 16 PRK05755 DNA polymerase I; Pro 100.0 2.4E-33 6.1E-38 213.0 14.8 189 11-202 309-511 (889) 17 PRK07997 consensus 100.0 2.1E-33 5.5E-38 213.3 13.5 190 11-203 339-549 (928) 18 PRK07456 consensus 100.0 3.1E-32 8E-37 206.6 15.7 177 20-203 382-579 (975) 19 PRK07625 consensus 100.0 9.4E-32 2.4E-36 203.8 15.3 189 11-203 334-543 (922) 20 PRK08786 consensus 100.0 2.9E-32 7.4E-37 206.8 12.5 190 11-203 338-548 (927) 21 cd06128 DNA_polA_exo The 3'-5' 100.0 5.8E-32 1.5E-36 205.0 12.3 147 20-168 1-165 (166) 22 PRK08835 consensus 100.0 5.7E-32 1.5E-36 205.0 11.8 190 11-203 341-551 (931) 23 cd06140 DNA_polA_I_Bacillus_li 100.0 1.2E-30 3.1E-35 197.3 14.8 170 20-192 3-178 (178) 24 PRK07898 consensus 100.0 2.2E-30 5.7E-35 195.8 15.5 183 17-203 334-522 (902) 25 PRK05797 consensus 100.0 9.1E-30 2.3E-34 192.3 13.3 169 19-203 318-491 (869) 26 PRK08434 consensus 100.0 1.8E-28 4.5E-33 184.8 15.9 187 11-203 308-514 (887) 27 PRK08076 consensus 100.0 3.8E-28 9.7E-33 182.8 14.1 180 20-203 314-498 (877) 28 PRK05929 consensus 99.9 1.3E-26 3.3E-31 173.9 13.6 181 19-203 309-496 (870) 29 KOG2206 consensus 99.9 1.3E-27 3.3E-32 179.8 6.2 156 17-175 209-364 (687) 30 PRK08928 consensus 99.9 8.7E-25 2.2E-29 163.3 13.9 160 17-203 321-482 (861) 31 COG0749 PolA DNA polymerase I 99.9 1E-24 2.5E-29 163.0 13.0 183 17-203 19-213 (593) 32 PRK07300 consensus 99.9 1E-20 2.6E-25 139.7 12.8 173 23-203 322-499 (880) 33 cd06126 DEDDy DEDDy exonucleas 99.8 2E-18 5E-23 126.4 11.4 143 22-164 1-180 (181) 34 KOG2207 consensus 99.8 2.9E-19 7.4E-24 131.2 5.6 155 17-172 409-586 (617) 35 TIGR00593 pola DNA polymerase 99.5 2.6E-14 6.5E-19 102.5 6.9 134 66-202 444-621 (1005) 36 PRK06309 DNA polymerase III su 99.4 4.4E-12 1.1E-16 89.5 11.7 143 20-182 2-177 (232) 37 KOG4373 consensus 99.4 1.6E-12 4.2E-17 92.0 7.9 129 36-164 146-281 (319) 38 PRK07740 hypothetical protein; 99.3 4.3E-11 1.1E-15 83.7 11.5 142 17-180 54-231 (240) 39 PRK06310 DNA polymerase III su 99.3 4.4E-11 1.1E-15 83.6 10.5 136 16-172 3-175 (250) 40 PRK06631 consensus 99.3 4.2E-11 1.1E-15 83.8 10.0 130 19-169 3-174 (229) 41 PRK07942 DNA polymerase III su 99.2 3E-10 7.6E-15 78.8 11.9 136 19-174 5-183 (234) 42 TIGR01405 polC_Gram_pos DNA po 99.2 1.2E-10 3.1E-15 81.1 8.6 147 17-183 201-386 (1264) 43 PRK08074 bifunctional ATP-depe 99.2 1.3E-09 3.4E-14 75.1 13.1 136 20-176 3-174 (932) 44 PRK06022 consensus 99.1 4.3E-10 1.1E-14 77.9 9.7 127 22-168 3-167 (234) 45 PRK08517 DNA polymerase III su 99.1 6.7E-10 1.7E-14 76.8 10.5 135 17-173 67-235 (259) 46 PRK08858 consensus 99.1 6.2E-10 1.6E-14 77.0 10.1 133 18-168 8-184 (246) 47 PRK07246 bifunctional ATP-depe 99.1 2.5E-09 6.4E-14 73.5 13.1 136 19-176 6-175 (820) 48 PRK07009 consensus 99.1 8.6E-10 2.2E-14 76.2 10.5 127 22-168 3-169 (241) 49 PRK05711 DNA polymerase III su 99.1 1.2E-09 2.9E-14 75.4 9.9 129 21-169 5-174 (240) 50 PRK08816 consensus 99.1 1.5E-09 3.8E-14 74.7 10.1 128 21-168 2-170 (244) 51 cd06131 DNA_pol_III_epsilon_Ec 99.1 1.3E-09 3.3E-14 75.1 9.5 125 23-167 2-166 (167) 52 PRK06807 DNA polymerase III su 99.1 2.9E-09 7.5E-14 73.1 11.3 133 20-174 8-175 (313) 53 cd06125 DnaQ_like_exo The DnaQ 99.0 2.5E-10 6.3E-15 79.3 5.1 85 23-107 1-118 (130) 54 PRK07982 consensus 99.0 3E-09 7.5E-14 73.0 9.8 130 21-168 7-179 (243) 55 TIGR01406 dnaQ_proteo DNA poly 99.0 4.6E-09 1.2E-13 71.9 10.4 130 22-170 2-179 (246) 56 cd06134 RNaseT RNase T is a DE 99.0 3.6E-09 9.2E-14 72.5 9.8 132 19-170 4-188 (189) 57 PRK00448 polC DNA polymerase I 99.0 8.1E-09 2.1E-13 70.5 11.5 145 14-179 412-591 (1436) 58 PRK07883 hypothetical protein; 99.0 1.4E-08 3.5E-13 69.2 12.2 143 16-179 20-199 (575) 59 PRK06859 consensus 99.0 6.6E-09 1.7E-13 71.0 10.4 132 18-168 3-177 (253) 60 PRK05168 ribonuclease T; Provi 99.0 6.2E-09 1.6E-13 71.2 10.1 140 20-179 17-209 (213) 61 COG2176 PolC DNA polymerase II 99.0 3.5E-09 9E-14 72.6 8.6 149 13-182 414-597 (1444) 62 PRK09145 DNA polymerase III su 99.0 4E-09 1E-13 72.3 8.8 131 18-169 27-199 (203) 63 cd06127 DEDDh DEDDh exonucleas 98.9 6.7E-09 1.7E-13 71.0 7.9 122 23-164 1-158 (159) 64 TIGR01407 dinG_rel DnaQ family 98.9 7E-08 1.8E-12 65.0 11.7 134 22-176 2-171 (944) 65 cd06144 REX4_like This subfami 98.8 1.1E-08 2.8E-13 69.7 7.1 123 23-165 1-152 (152) 66 PRK09146 DNA polymerase III su 98.8 2E-08 5E-13 68.3 8.1 150 16-170 43-226 (239) 67 PRK06063 DNA polymerase III su 98.8 1.2E-07 3.1E-12 63.6 11.5 131 19-170 14-178 (313) 68 pfam00929 Exonuc_X-T Exonuclea 98.8 8.1E-08 2.1E-12 64.7 10.1 124 22-164 1-162 (162) 69 smart00479 EXOIII exonuclease 98.8 4.4E-08 1.1E-12 66.2 8.4 131 22-172 2-168 (169) 70 PRK07247 DNA polymerase III su 98.7 5E-07 1.3E-11 60.1 11.2 136 20-178 5-176 (195) 71 PRK06195 DNA polymerase III su 98.6 6.7E-07 1.7E-11 59.4 10.1 139 21-181 2-174 (309) 72 cd06137 DEDDh_RNase This group 98.6 1.3E-07 3.4E-12 63.4 6.1 124 23-164 1-160 (161) 73 cd06136 TREX1_2 Three prime re 98.5 1.1E-06 2.7E-11 58.2 10.1 79 66-165 88-175 (177) 74 KOG2249 consensus 98.5 1.7E-06 4.3E-11 57.0 10.0 134 20-173 105-268 (280) 75 cd06130 DNA_pol_III_epsilon_li 98.4 1.5E-06 3.9E-11 57.3 7.8 119 22-164 1-154 (156) 76 cd06138 ExoI_N This subfamily 98.4 8.9E-06 2.3E-10 52.8 10.8 123 23-164 1-182 (183) 77 cd06149 ISG20 Interferon (IFN) 98.3 1.9E-06 4.8E-11 56.8 7.0 125 23-165 1-157 (157) 78 cd06145 REX1_like This subfami 98.3 1.5E-06 3.9E-11 57.3 5.6 123 23-164 1-149 (150) 79 cd06143 PAN2_exo PAN2 is the c 98.3 1.7E-06 4.3E-11 57.0 5.6 75 71-165 99-174 (174) 80 KOG2405 consensus 98.2 4.8E-07 1.2E-11 60.2 2.3 150 22-179 197-365 (458) 81 PRK07983 exodeoxyribonuclease 98.2 2E-05 5.2E-10 50.7 10.0 125 22-169 2-152 (219) 82 cd06133 ERI-1_3'hExo_like This 98.2 4.4E-05 1.1E-09 48.8 11.4 130 22-167 1-175 (176) 83 cd05781 DNA_polB_B3_exo The 3' 98.2 3.4E-05 8.7E-10 49.4 10.2 145 20-167 3-188 (188) 84 COG0847 DnaQ DNA polymerase II 98.1 5.9E-05 1.5E-09 48.1 10.4 140 20-175 13-187 (243) 85 COG3359 Predicted exonuclease 97.9 0.00027 7E-09 44.2 10.6 153 21-174 99-272 (278) 86 TIGR00573 dnaq exonuclease, DN 97.8 0.00054 1.4E-08 42.5 11.1 136 20-174 7-182 (228) 87 cd05780 DNA_polB_Kod1_like_exo 97.8 0.00014 3.5E-09 45.9 7.5 145 20-167 3-195 (195) 88 cd05160 DNA_polB_exo The 3'-5' 97.7 0.00018 4.6E-09 45.2 7.7 142 22-165 1-200 (200) 89 KOG3657 consensus 97.7 0.00013 3.3E-09 46.1 6.3 98 74-173 241-385 (1075) 90 PRK09182 DNA polymerase III su 97.6 0.00017 4.4E-09 45.4 5.6 82 22-104 39-152 (293) 91 cd05779 DNA_polB_epsilon_exo T 97.6 0.00056 1.4E-08 42.4 8.1 88 71-164 87-203 (204) 92 pfam04857 CAF1 CAF1 family rib 97.5 0.0006 1.5E-08 42.2 7.4 113 16-129 18-212 (235) 93 cd05785 DNA_polB_like2_exo A s 97.4 0.0031 7.8E-08 38.1 10.4 142 17-163 6-205 (207) 94 PRK05762 DNA polymerase II; Re 97.4 0.0021 5.5E-08 39.0 9.1 139 21-168 156-347 (785) 95 KOG1275 consensus 97.2 0.00045 1.1E-08 42.9 4.5 95 68-185 1008-1102(1118) 96 cd05784 DNA_polB_II_exo The 3' 97.1 0.0019 4.9E-08 39.3 6.4 139 20-165 3-193 (193) 97 cd05782 DNA_polB_like1_exo A s 97.0 0.011 2.8E-07 34.8 9.8 97 65-165 84-207 (208) 98 KOG2248 consensus 96.7 0.0021 5.4E-08 39.0 4.3 83 66-166 287-370 (380) 99 cd05777 DNA_polB_delta_exo The 96.7 0.0097 2.5E-07 35.2 7.4 104 66-171 78-227 (230) 100 KOG2405 consensus 96.7 4.3E-05 1.1E-09 48.9 -4.9 88 19-109 58-147 (458) 101 PRK11779 sbcB exonuclease I; P 96.5 0.058 1.5E-06 30.7 10.8 145 19-170 6-198 (477) 102 PTZ00166 DNA polymerase delta 96.5 0.031 7.9E-07 32.3 8.7 150 19-172 289-513 (1081) 103 cd05776 DNA_polB_alpha_exo The 95.2 0.032 8E-07 32.2 4.4 95 71-168 96-227 (234) 104 PRK06722 exonuclease; Provisio 94.8 0.16 4.1E-06 28.1 7.0 46 57-104 36-86 (242) 105 TIGR01298 RNaseT ribonuclease 94.6 0.2 5.2E-06 27.5 7.1 135 21-175 9-197 (201) 106 smart00486 POLBc DNA polymeras 94.3 0.38 9.8E-06 25.9 8.2 95 71-167 83-220 (471) 107 cd06135 Orn Oligoribonuclease 92.8 0.28 7.2E-06 26.7 5.2 19 22-40 1-19 (173) 108 cd05783 DNA_polB_B1_exo The 3' 92.7 0.62 1.6E-05 24.7 6.9 140 20-164 5-201 (204) 109 KOG0969 consensus 86.8 1.5 3.9E-05 22.4 4.9 156 17-177 271-500 (1066) 110 KOG0304 consensus 78.0 3.9 9.9E-05 20.1 4.2 66 70-136 138-220 (239) 111 pfam03104 DNA_pol_B_exo DNA po 74.4 6.2 0.00016 18.9 4.7 75 19-94 85-188 (254) 112 PRK05359 oligoribonuclease; Pr 70.3 5.7 0.00015 19.1 3.5 22 19-40 2-23 (181) 113 KOG1798 consensus 64.6 10 0.00026 17.6 8.6 150 16-170 242-453 (2173) 114 COG1949 Orn Oligoribonuclease 63.6 8.1 0.00021 18.2 3.1 27 18-46 4-30 (184) 115 COG2925 SbcB Exonuclease I [DN 63.2 11 0.00028 17.5 10.3 145 18-170 7-200 (475) 116 PRK07748 sporulation inhibitor 54.1 16 0.0004 16.5 5.3 87 66-169 86-178 (205) 117 pfam06369 Anemone_cytotox Sea 51.5 9.8 0.00025 17.7 1.9 76 22-97 32-119 (176) 118 KOG0970 consensus 48.5 19 0.00049 16.0 5.1 96 71-169 621-753 (1429) 119 COG0417 PolB DNA polymerase el 47.8 20 0.0005 15.9 9.0 145 18-165 152-348 (792) 120 pfam01707 Peptidase_C9 Peptida 46.2 21 0.00053 15.8 4.5 79 66-145 30-126 (202) 121 cd05778 DNA_polB_zeta_exo The 42.4 9.1 0.00023 17.9 0.6 50 115-164 176-230 (231) 122 KOG3242 consensus 41.2 25 0.00064 15.3 3.0 34 9-42 15-48 (208) 123 TIGR02093 P_ylase glycogen/sta 41.1 5.8 0.00015 19.0 -0.6 40 66-105 508-554 (822) 124 PRK05601 DNA polymerase III su 39.7 27 0.00068 15.2 8.2 77 16-94 42-145 (377) 125 pfam11567 PfUIS3 Plasmodium fa 37.0 29 0.00075 14.9 4.4 43 144-188 41-83 (101) 126 TIGR01720 NRPS-para261 non-rib 36.7 30 0.00076 14.9 3.5 53 96-151 117-169 (171) 127 KOG0217 consensus 32.5 35 0.00089 14.5 4.9 20 21-47 393-412 (1125) 128 TIGR02238 recomb_DMC1 meiotic 29.3 29 0.00074 15.0 1.4 19 10-28 121-139 (314) 129 TIGR02236 recomb_radA DNA repa 28.8 30 0.00077 14.9 1.4 28 17-49 138-167 (333) 130 pfam03175 DNA_pol_B_2 DNA poly 28.5 41 0.001 14.1 3.6 31 144-174 122-152 (452) 131 pfam09210 DUF1957 Domain of un 23.4 51 0.0013 13.5 4.4 41 135-175 36-76 (100) 132 TIGR01744 XPRTase xanthine pho 23.2 51 0.0013 13.5 2.6 42 46-87 116-166 (191) 133 TIGR02242 tail_TIGR02242 phage 23.0 52 0.0013 13.5 2.4 41 105-145 83-134 (138) 134 TIGR01074 rep ATP-dependent DN 22.5 53 0.0014 13.4 5.0 101 65-178 67-183 (677) 135 pfam09281 Taq-exonuc Taq polym 22.3 54 0.0014 13.4 2.9 58 82-150 69-126 (138) No 1 >COG0349 Rnd Ribonuclease D [Translation, ribosomal structure and biogenesis] Probab=100.00 E-value=0 Score=349.01 Aligned_cols=198 Identities=42% Similarity=0.644 Sum_probs=180.1 Q ss_pred CCCCCCHH-H-HCCCCCCEEEECCCCCCCCCCCEEEEEEEEECCCCEEEECCCCCCCCCHHHHHHHHCCCCCEEECCCCH Q ss_conf 25788857-8-424898279971777898544507999984189428997465778570677677732364023010000 Q gi|254780636|r 7 HEGDIPAE-C-AARYVDAIAVDTETLGLMPRRDRLCIVQLSPGDGTVDIIRIAAGQKNAPNLVGMLVDEKREKIFHYGRF 84 (207) Q Consensus 7 ~~~dl~~~-~-~l~~~~~iaiDtEt~~l~~~~~~l~LiQl~~~~~~~~l~~~~~~~~~~~~L~~ll~d~~i~KV~Hn~~~ 84 (207) +.+|+++. + .+++++.||+||||.|+++|+|++||||++++++ +.+|++.....+.++|..+|.|++|+||||+|+| T Consensus 2 ~~~~~l~~~~~~~~~~~~iAIDTEf~r~~t~~p~LcLIQi~~~e~-~~lIdp~~~~~d~~~l~~Ll~d~~v~KIfHaa~~ 80 (361) T COG0349 2 TTGDLLAAACALLRGSKAIAIDTEFMRLRTYYPRLCLIQISDGEG-ASLIDPLAGILDLPPLVALLADPNVVKIFHAARF 80 (361) T ss_pred CCHHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCEEEEEEECCCC-CEEECCCCCCCCCCHHHHHHCCCCEEEEECCCCC T ss_conf 742679999997428980798322246643688337999844888-6686226554666369998628760255025635 Q ss_pred HHHHHHHHHHCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCEECCCCCCCCCHHHHHHHHHHHHHHHHHH Q ss_conf 16888654200000004578988652110000008877765420024675200365654432368999986459999999 Q gi|254780636|r 85 DIAVLFYTFGVRVRPVFCTKIASRLTRTYTNQHGLKDNLKELLGINISKAQQSSDWSADDLSDEQLQYAASDVVHLHALR 164 (207) Q Consensus 85 D~~~L~~~~gi~~~~i~DT~ias~l~~~~~~~~~L~~L~~~~lg~~ldK~~q~SdW~~rpLs~~Qi~YAA~Dv~~l~~L~ 164 (207) |+.+|.+.||+.|.|+|||+||+++++.+. +|||++||++++|++|||++|+|||++||||++|+.|||.||+||++|| T Consensus 81 DL~~l~~~~g~~p~plfdTqiAa~l~g~~~-~~gl~~Lv~~ll~v~ldK~~q~SDW~~RPLs~~Ql~YAa~DV~yL~~l~ 159 (361) T COG0349 81 DLEVLLNLFGLLPTPLFDTQIAAKLAGFGT-SHGLADLVEELLGVELDKSEQRSDWLARPLSEAQLEYAAADVEYLLPLY 159 (361) T ss_pred CHHHHHHHCCCCCCCHHHHHHHHHHCCCCC-CCCHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHH T ss_conf 199999861888773468999999809863-2329999999968853644454655557999999999999999999999 Q ss_pred HHHHHHHHHCCCHHHHHHHHHHHHHH--HH-HHHCCCCCCCCCCCC Q ss_conf 99999999749189999999640068--99-986679888844359 Q gi|254780636|r 165 LQFTEKLQRLGRSDLATSCCNFLMDR--AE-LDLLGWENVDIFSHS 207 (207) Q Consensus 165 ~~l~~~L~~~~~~~~~~~e~~~l~~~--~~-me~~g~~~~~~~~~~ 207 (207) ++|.++|.++|+.+|+++||++++.+ +. .+..+|+++++ .|+ T Consensus 160 ~~L~~~L~~~~r~~~a~~ef~~l~~r~~~~~~~~~~w~~i~~-a~~ 204 (361) T COG0349 160 DKLTEELAREGRLEWAEDEFRLLPTRRTYKVLPEDAWREIKI-AHS 204 (361) T ss_pred HHHHHHHHHCCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHH-HHC T ss_conf 999999987485499999999731023455681767888666-504 No 2 >TIGR01388 rnd ribonuclease D; InterPro: IPR006292 These proteins represent ribonuclease D, a 3-exonuclease shown to act on tRNA both in vitro and when overexpressed in vivo. Members of this family are restricted to the Proteobacteria; Aquifex, Mycobacteria and eukaryotes. Ribonuclease D is not essential in Escherichia coli but is deleterious when overexpressed. The precise biological role of ribonuclease D is still unknown. ; GO: 0004540 ribonuclease activity, 0008033 tRNA processing, 0005737 cytoplasm. Probab=100.00 E-value=0 Score=328.40 Aligned_cols=186 Identities=30% Similarity=0.479 Sum_probs=168.8 Q ss_pred CCCCCCHHHH-CCCCCCEEEECCCCCCCCCCCEEEEEEEEECCCCEEEECCCCCCCCCHHHHHHHHCCCCCEEECCCCHH Q ss_conf 2578885784-248982799717778985445079999841894289974657785706776777323640230100001 Q gi|254780636|r 7 HEGDIPAECA-ARYVDAIAVDTETLGLMPRRDRLCIVQLSPGDGTVDIIRIAAGQKNAPNLVGMLVDEKREKIFHYGRFD 85 (207) Q Consensus 7 ~~~dl~~~~~-l~~~~~iaiDtEt~~l~~~~~~l~LiQl~~~~~~~~l~~~~~~~~~~~~L~~ll~d~~i~KV~Hn~~~D 85 (207) ..++|...|. .+..++||+||||.+.+||+|++||||++.++...+++|+.-.. +...|..+|+|++|+||+|.|+.| T Consensus 4 ~~~~La~~c~~~~~~p~valDTEF~R~~TFwP~LgLiQ~a~g~~~~LIDPL~~~~-~~~Pl~~lL~d~~v~KvlHAaseD 82 (374) T TIGR01388 4 TDDELATVCEALRTEPFVALDTEFVRERTFWPQLGLIQVADGEQLALIDPLVIDI-DLSPLLELLADESVVKVLHAASED 82 (374) T ss_pred CHHHHHHHHHHHCCCCEEECCCCEECCCCCCHHHCCEECCCCCEEEEECCCCCCC-CHHHHHHHHHCCCCEEEECCCCHH T ss_conf 6688999998614788374155411265320204003113787568765888752-347999997389815853045047 Q ss_pred HHHHHHHHHCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCEECCCCCCCCCHHHHHHHHHHHHHHHHHHH Q ss_conf 68886542000000045789886521100000088777654200246752003656544323689999864599999999 Q gi|254780636|r 86 IAVLFYTFGVRVRPVFCTKIASRLTRTYTNQHGLKDNLKELLGINISKAQQSSDWSADDLSDEQLQYAASDVVHLHALRL 165 (207) Q Consensus 86 ~~~L~~~~gi~~~~i~DT~ias~l~~~~~~~~~L~~L~~~~lg~~ldK~~q~SdW~~rpLs~~Qi~YAA~Dv~~l~~L~~ 165 (207) +.+|.+-+|..|.|+|||||++-++ |++.+.||+.||+++++++|||++++|||..||||+.|+.|||.||+||+++|. T Consensus 83 lEvF~~l~g~lP~PLfDTQi~Aa~~-G~g~s~gY~~LV~~~~~v~LdKse~rtDW~~RPLtd~Ql~YAaaDV~yL~~~y~ 161 (374) T TIGR01388 83 LEVFLNLFGVLPKPLFDTQIAAAFL-GFGESMGYDKLVQEVLGVELDKSETRTDWLARPLTDKQLEYAAADVTYLLPLYE 161 (374) T ss_pred HHHHHHHHCCCCCCCHHHHHHHHHH-CCCCCHHHHHHHHHHHCEECCCCCCCCHHHCCCCCHHHHHHHHHHHHHHHHHHH T ss_conf 9999987057988830588999983-678521089999997370317888722210478987789999999999999999 Q ss_pred HHHHHHHHCCCHHHHHHHHHHHHHHHHHH Q ss_conf 99999997491899999996400689998 Q gi|254780636|r 166 QFTEKLQRLGRSDLATSCCNFLMDRAELD 194 (207) Q Consensus 166 ~l~~~L~~~~~~~~~~~e~~~l~~~~~me 194 (207) +|.++|.+.||..|.++||+.+..+...+ T Consensus 162 ~L~~~l~~~~R~~~l~ee~~~~~~~~~~~ 190 (374) T TIGR01388 162 KLTERLAEEGRLAWLEEECELLTDRRTYD 190 (374) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 99999988645999972059873024207 No 3 >PRK10829 ribonuclease D; Provisional Probab=100.00 E-value=0 Score=323.69 Aligned_cols=185 Identities=24% Similarity=0.344 Sum_probs=166.7 Q ss_pred CCEEECCC--CCCHHHH-CCCCCCEEEECCCCCCCCCCCEEEEEEEEECCCCEEEECCCCCCCCCHHHHHHHHCCCCCEE Q ss_conf 72153257--8885784-24898279971777898544507999984189428997465778570677677732364023 Q gi|254780636|r 2 TTIRVHEG--DIPAECA-ARYVDAIAVDTETLGLMPRRDRLCIVQLSPGDGTVDIIRIAAGQKNAPNLVGMLVDEKREKI 78 (207) Q Consensus 2 ~~~~~~~~--dl~~~~~-l~~~~~iaiDtEt~~l~~~~~~l~LiQl~~~~~~~~l~~~~~~~~~~~~L~~ll~d~~i~KV 78 (207) |+|+++.. +|.+.+. ++.++.|||||||.+.++|+|++|||||++++..++++++. . .+...|+++|+||+|+|| T Consensus 1 m~y~~I~t~~~L~~~~~~l~~~~~iavDTEf~r~~ty~p~l~LiQi~~~~~~~lIDpl~-~-~dl~~l~~ll~~~~i~kv 78 (373) T PRK10829 1 MNYQMITTDDALASLCEAVRAFPAIALDTEFVRTRTYYPQLGLIQLFDGEQLALIDPLG-I-TDWSPFKALLRDPSITKF 78 (373) T ss_pred CEEEEECCHHHHHHHHHHHCCCCCEEEECCCCCCCCCCCEEEEEEEEECCCEEEEECCC-C-CCHHHHHHHHCCCCCEEE T ss_conf 91599798999999999970799189857745677789845899985189679994787-7-776999999709884165 Q ss_pred ECCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCEECCCCCCCCCHHHHHHHHHHHH Q ss_conf 01000016888654200000004578988652110000008877765420024675200365654432368999986459 Q gi|254780636|r 79 FHYGRFDIAVLFYTFGVRVRPVFCTKIASRLTRTYTNQHGLKDNLKELLGINISKAQQSSDWSADDLSDEQLQYAASDVV 158 (207) Q Consensus 79 ~Hn~~~D~~~L~~~~gi~~~~i~DT~ias~l~~~~~~~~~L~~L~~~~lg~~ldK~~q~SdW~~rpLs~~Qi~YAA~Dv~ 158 (207) +|+|++|+.+|.+.+|+.|.|+||||||+.++ |++.+.||+.||+++||++++|++|+|||++||||++|+.|||.||. T Consensus 79 ~Haa~qDle~l~~~~g~~p~~iFDTQiAa~~l-g~~~~~gy~~Lv~~~l~v~l~K~~~~sDW~~RPLs~~ql~YA~~DV~ 157 (373) T PRK10829 79 LHAGSEDLEVFLNAFGELPQPLIDTQILAAFC-GRPLSWGFASMVEEYSGVALDKSESRTDWLARPLSERQCEYAAADVW 157 (373) T ss_pred EECCCCHHHHHHHHCCCCCCHHHHHHHHHHHC-CCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHH T ss_conf 20642159999998198874077648899865-78655589999999939665666462757889999899999999999 Q ss_pred HHHHHHHHHHHHHHHCCCHHHHHHHHHHHHH Q ss_conf 9999999999999974918999999964006 Q gi|254780636|r 159 HLHALRLQFTEKLQRLGRSDLATSCCNFLMD 189 (207) Q Consensus 159 ~l~~L~~~l~~~L~~~~~~~~~~~e~~~l~~ 189 (207) ||++||+.|.++|.+.|+++|+.+||..+.. T Consensus 158 yL~~l~~~L~~~l~~~~r~~w~~eE~~~l~~ 188 (373) T PRK10829 158 YLLPIAAKLMVETEAAGWLPAALDECRLMQQ 188 (373) T ss_pred HHHHHHHHHHHHHHHCCCHHHHHHHHHHHHH T ss_conf 9999999999999835755899999999972 No 4 >cd06142 RNaseD_exo Ribonuclease (RNase) D is a bacterial enzyme involved in the maturation of small stable RNAs and the 3' maturation of tRNA. It contains a DEDDy-type DnaQ-like 3'-5' exonuclease domain possessing three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. In vivo, RNase D only becomes essential upon removal of other ribonucleases. Eukaryotic RNase D homologs include yeast Rrp6p, human PM/Scl-100 and the Drosophila melanogaster egalitarian protein. Probab=100.00 E-value=0 Score=313.18 Aligned_cols=175 Identities=36% Similarity=0.526 Sum_probs=160.5 Q ss_pred CCHHH-HCCCCCCEEEECCCCCCCCCCCEEEEEEEEECCCCEEEECCCCCCCCCHHHHHHHHCCCCCEEECCCCHHHHHH Q ss_conf 88578-42489827997177789854450799998418942899746577857067767773236402301000016888 Q gi|254780636|r 11 IPAEC-AARYVDAIAVDTETLGLMPRRDRLCIVQLSPGDGTVDIIRIAAGQKNAPNLVGMLVDEKREKIFHYGRFDIAVL 89 (207) Q Consensus 11 l~~~~-~l~~~~~iaiDtEt~~l~~~~~~l~LiQl~~~~~~~~l~~~~~~~~~~~~L~~ll~d~~i~KV~Hn~~~D~~~L 89 (207) |.+.+ .++++++|||||||++++++++++||||+|+++ .+|++++.... ..+.|+++|+|++|+|||||+++|+.+| T Consensus 2 l~~~~~~l~~~~~i~~DtE~~~~~~~~~~l~LiQia~~~-~~ylid~~~~~-~~~~L~~lL~d~~i~KVgh~~~~Dl~~L 79 (178) T cd06142 2 LADLCERLAGAGVIAVDTEFMRLRTYYPRLCLIQISTGG-QAYLIDPLAIG-DLSPLKELLADPNIVKVFHAAREDLELL 79 (178) T ss_pred HHHHHHHHCCCCEEEEECCCCCCCCCCCEEEEEEEEECC-CEEEEECCCCC-CHHHHHHHHCCCCCEEEEEEHHHHHHHH T ss_conf 899999970799489989867885588728999998689-06999531234-4699999975999439996279999999 Q ss_pred HHHHHCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCEECCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 65420000000457898865211000000887776542002467520036565443236899998645999999999999 Q gi|254780636|r 90 FYTFGVRVRPVFCTKIASRLTRTYTNQHGLKDNLKELLGINISKAQQSSDWSADDLSDEQLQYAASDVVHLHALRLQFTE 169 (207) Q Consensus 90 ~~~~gi~~~~i~DT~ias~l~~~~~~~~~L~~L~~~~lg~~ldK~~q~SdW~~rpLs~~Qi~YAA~Dv~~l~~L~~~l~~ 169 (207) .+++|+.+.|+||||+|+++++ ...++||++|++++||++++|+.|+|||++||||++|+.|||.||+||++||+.+.+ T Consensus 80 ~~~~~~~~~~v~Dt~ia~~~l~-~~~~~~L~~L~~~~lg~~l~K~~q~s~W~~rPLs~~qi~YAa~Da~~l~~l~~~l~~ 158 (178) T cd06142 80 KRDFGILPVNLFDTQIAARLLG-LGDSVGLAALVEELLGVELDKGEQRSDWLKRPLTDEQLEYAANDVRYLLPLYEKLKE 158 (178) T ss_pred HHHHCCCCCCEEEHHHHHHHCC-CCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9973967654530899999828-867378999999992987777736466433678289999999979999999999999 Q ss_pred HHHHCCCHHHHHHHHHHHH Q ss_conf 9997491899999996400 Q gi|254780636|r 170 KLQRLGRSDLATSCCNFLM 188 (207) Q Consensus 170 ~L~~~~~~~~~~~e~~~l~ 188 (207) +|.++|+.+|+++||+.++ T Consensus 159 ~L~~~~~~~~~~~e~~~~~ 177 (178) T cd06142 159 ELEEEGRLEWAEEECELLL 177 (178) T ss_pred HHHHCCHHHHHHHHHHHCC T ss_conf 9998683999999998626 No 5 >cd06147 Rrp6p_like_exo Yeast Rrp6p and its human homolog, the Polymyositis/scleroderma autoantigen 100kDa (PM/Scl-100), are exosome-associated proteins involved in the degradation and processing of precursors to stable RNAs. Both proteins contain a DEDDy-type DnaQ-like 3'-5' exonuclease domain possessing three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. The motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. PM/Scl-100, an autoantigen present in the nucleolar compartment of the cell, reacts with autoantibodies produced by about 50% of patients with polymyositis-scleroderma overlap syndrome. Probab=100.00 E-value=0 Score=297.09 Aligned_cols=178 Identities=22% Similarity=0.237 Sum_probs=153.9 Q ss_pred CCCCCHHH-HCCCCCCEEEECCCCCCCCCCCEEEEEEEEECCCCEEEECCCCCCCCCHHHHHHHHCCCCCEEECCCCHHH Q ss_conf 57888578-42489827997177789854450799998418942899746577857067767773236402301000016 Q gi|254780636|r 8 EGDIPAEC-AARYVDAIAVDTETLGLMPRRDRLCIVQLSPGDGTVDIIRIAAGQKNAPNLVGMLVDEKREKIFHYGRFDI 86 (207) Q Consensus 8 ~~dl~~~~-~l~~~~~iaiDtEt~~l~~~~~~l~LiQl~~~~~~~~l~~~~~~~~~~~~L~~ll~d~~i~KV~Hn~~~D~ 86 (207) +++|.+.+ .+++.++||+||||++.++|++++|||||++++.. |++++.......+.|+++|+||+|+||||++++|+ T Consensus 11 ~~~L~~~~~~L~~~~~ialDtE~~~~~~~~~~l~LiQiat~~~~-yliD~l~l~~~~~~L~~ll~~~~I~KV~H~~~~Dl 89 (192) T cd06147 11 EEKLEELVEKLKNCKEIAVDLEHHSYRSYLGFTCLMQISTREED-YIVDTLKLRDDMHILNEVFTDPNILKVFHGADSDI 89 (192) T ss_pred HHHHHHHHHHHHCCCEEEEECCCCCCCCCCCEEEEEEEECCCCC-EEEEECCCCCCHHHHHHHHCCCCEEEEEECHHHHH T ss_conf 99999999998269969997973788569982799999509988-79874343003799999975988379985578779 Q ss_pred HHHHHHHHCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCEECCCCCCCCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 88865420000000457898865211000000887776542002467520036565443236899998645999999999 Q gi|254780636|r 87 AVLFYTFGVRVRPVFCTKIASRLTRTYTNQHGLKDNLKELLGINISKAQQSSDWSADDLSDEQLQYAASDVVHLHALRLQ 166 (207) Q Consensus 87 ~~L~~~~gi~~~~i~DT~ias~l~~~~~~~~~L~~L~~~~lg~~ldK~~q~SdW~~rpLs~~Qi~YAA~Dv~~l~~L~~~ 166 (207) ++|++++|+.+.|+||||+|+++++. .++||++|+++++|++++|+.|+|||++||||++|+.|||.||+||++||++ T Consensus 90 ~~L~~~~gi~~~n~fDT~iaa~~l~~--~~~sL~~Lv~~~lg~~l~K~~q~SdW~~RPLs~~Ql~YAA~Dv~~Ll~ly~~ 167 (192) T cd06147 90 IWLQRDFGLYVVNLFDTGQAARVLNL--PRHSLAYLLQKYCNVDADKKYQLADWRIRPLPEEMIKYAREDTHYLLYIYDR 167 (192) T ss_pred HHHHHHHCCCCCCCHHHHHHHHHHCC--CCHHHHHHHHHHHCCCCCCCCEECCCCCCCCCHHHHHHHHHHHHHHHHHHHH T ss_conf 99999858987660428999998488--8111999999993958873231143799989999999999999999999999 Q ss_pred HHHHHHHCC-CHHHHHHHHHHHH Q ss_conf 999999749-1899999996400 Q gi|254780636|r 167 FTEKLQRLG-RSDLATSCCNFLM 188 (207) Q Consensus 167 l~~~L~~~~-~~~~~~~e~~~l~ 188 (207) |.++|.+++ ++.-.-.||.+.. T Consensus 168 L~~~L~e~~~~~~~~~~~~~~~~ 190 (192) T cd06147 168 LRNELLERANALAPNLLESVLNC 190 (192) T ss_pred HHHHHHHHHCCCHHHHHHHHHHH T ss_conf 99999984286416788999972 No 6 >cd06148 Egl_like_exo The Egalitarian (Egl) protein subfamily is composed of Drosophila Egl and similar proteins. Egl is a component of an mRNA-binding complex which is required for oocyte specification. Egl contains a DEDDy-type DnaQ-like 3'-5' exonuclease domain possessing three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. The motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. The conservation of this subfamily throughout eukaryotes suggests that its members may be part of ancient RNA processing complexes that are likely to participate in the regulated processing of specific mRNAs. Probab=100.00 E-value=3.1e-43 Score=270.53 Aligned_cols=169 Identities=24% Similarity=0.319 Sum_probs=147.3 Q ss_pred HHHCCCCCCEEEECCCCCCCCCCCEEEEEEEEECCCCEEEECCCCCCCC--CHHHHHHHHCCCCCEEECCCCHHHHHHHH Q ss_conf 7842489827997177789854450799998418942899746577857--06776777323640230100001688865 Q gi|254780636|r 14 ECAARYVDAIAVDTETLGLMPRRDRLCIVQLSPGDGTVDIIRIAAGQKN--APNLVGMLVDEKREKIFHYGRFDIAVLFY 91 (207) Q Consensus 14 ~~~l~~~~~iaiDtEt~~l~~~~~~l~LiQl~~~~~~~~l~~~~~~~~~--~~~L~~ll~d~~i~KV~Hn~~~D~~~L~~ 91 (207) ...++++++|||||||.+++++ +++||||+|++++.+|+|++...... .+.|+++|||++|+||||+|++|+.+|++ T Consensus 4 i~~L~~~~vi~iD~E~~~~~~~-g~l~LiQiat~~~~vyl~D~~~~~~~~~~~~Lk~iLe~~~I~KV~H~~r~D~~~L~~ 82 (197) T cd06148 4 IIHLKKQKVIGLDCEGVNLGRK-GKLCLVQIATRTGQIYLFDILKLGSIVFINGLKDILESKKILKVIHDCRRDSDALYH 82 (197) T ss_pred HHHHHHCCEEEEECCCCCCCCC-CCEEEEEEEECCCCEEEEECCCCCCHHHHHHHHHHHCCCCCEEEEECHHHHHHHHHH T ss_conf 9998759999998886788989-988799996489988999616676344568999995699823999610999999999 Q ss_pred HHHCCCCCHHHHHHHHHHHHHHC-------CCHHHHHHHHHHHCCCCCCC--------CEECCCCCCCCCHHHHHHHHHH Q ss_conf 42000000045789886521100-------00008877765420024675--------2003656544323689999864 Q gi|254780636|r 92 TFGVRVRPVFCTKIASRLTRTYT-------NQHGLKDNLKELLGINISKA--------QQSSDWSADDLSDEQLQYAASD 156 (207) Q Consensus 92 ~~gi~~~~i~DT~ias~l~~~~~-------~~~~L~~L~~~~lg~~ldK~--------~q~SdW~~rpLs~~Qi~YAA~D 156 (207) ++|+.+.|+||||+|+.+++... ...||++++++++++.++|. .+.|||++||||++|+.|||.| T Consensus 83 ~~gi~~~nvfDTqiA~~ll~~~~~~~~~~~~~~sl~~l~~~~l~~~~~~~~~~~~~~~~~~s~W~~RPLt~~ql~YAa~D 162 (197) T cd06148 83 QYGIKLNNVFDTQVADALLQEQETGGFNPDRVISLVQLLDKYLYISISLKEDVKKLMREDPKFWALRPLTEDMIRYAALD 162 (197) T ss_pred HHCCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHCCCCHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHH T ss_conf 70997067031999999975313147677564159999999957640177877888750655446799999999999998 Q ss_pred HHHHHHHHHHHHHHHHHCCCHHHHHHH Q ss_conf 599999999999999974918999999 Q gi|254780636|r 157 VVHLHALRLQFTEKLQRLGRSDLATSC 183 (207) Q Consensus 157 v~~l~~L~~~l~~~L~~~~~~~~~~~e 183 (207) |.||++||+.+.++|.++.+...++.- T Consensus 163 V~yL~~L~~~L~~~Li~~~~~~~~~~~ 189 (197) T cd06148 163 VLCLLPLYYAMLDALISKFLKAVFKYL 189 (197) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 999999999999999999999999998 No 7 >cd06129 RNaseD_like The RNase D-like group is composed of RNase D, WRN, and similar proteins. They contain a DEDDy-type, DnaQ-like, 3'-5' exonuclease domain that contains three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. RNase D is involved in the 3'-end processing of tRNA precursors. RNase D-like proteins in eukaryotes include yeast Rrp6p, human PM/Scl-100 and Drosophila melanogaster egalitarian (Egl) protein. WRN is a unique DNA helicase possessing exonuclease activity. Mutation in the WRN gene is implicated in Werner syndrome, a disease associated with premature aging and increased predisposition to cancer. Yeast Rrp6p and the human Polymyositis/scleroderma autoantigen 100kDa (PM/Scl-100) are exosome-associated proteins involved in the degradation and processing of prec Probab=100.00 E-value=8.7e-43 Score=267.92 Aligned_cols=148 Identities=25% Similarity=0.304 Sum_probs=136.5 Q ss_pred HCCCCCCEEEECCCCCCCCCCCEEEEEEEEECCCCEEEECCCCCCCCCHHHHHHHHCCCCCEEECCCCHHHHHHHHHHHC Q ss_conf 42489827997177789854450799998418942899746577857067767773236402301000016888654200 Q gi|254780636|r 16 AARYVDAIAVDTETLGLMPRRDRLCIVQLSPGDGTVDIIRIAAGQKNAPNLVGMLVDEKREKIFHYGRFDIAVLFYTFGV 95 (207) Q Consensus 16 ~l~~~~~iaiDtEt~~l~~~~~~l~LiQl~~~~~~~~l~~~~~~~~~~~~L~~ll~d~~i~KV~Hn~~~D~~~L~~~~gi 95 (207) ..+.+.+|||||||.+..++++++|+||+|+++ .++++++......++.|+++|+|++|+||||++++|+.+|++++|+ T Consensus 11 ~~~~~~vig~DtE~~~~~~~~~~v~LiQlat~~-~~~lid~~~~~~~~~~L~~lL~~~~i~Kvgh~~~~D~~~L~~~~g~ 89 (159) T cd06129 11 AKNESIVVALDCEWIPLRLYYGEVTLIQIGTTE-HAFLFDVLSLSAFDGGLKTVLENPSITKVGHGCRGDLAKLLRDFGI 89 (159) T ss_pred HCCCCEEEEEECCCCCCCCCCCCEEEEEEEECC-EEEEEECCCCCCCHHHHHHHHHCCCCEEEEECHHHHHHHHHHHCCC T ss_conf 545980899979768987998978999994599-6899954888767799999982999249995669999999675084 Q ss_pred CCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCEECCCCCCCCCHHHHHHHHHHHHHHHHHH Q ss_conf 000004578988652110000008877765420024675200365654432368999986459999999 Q gi|254780636|r 96 RVRPVFCTKIASRLTRTYTNQHGLKDNLKELLGINISKAQQSSDWSADDLSDEQLQYAASDVVHLHALR 164 (207) Q Consensus 96 ~~~~i~DT~ias~l~~~~~~~~~L~~L~~~~lg~~ldK~~q~SdW~~rpLs~~Qi~YAA~Dv~~l~~L~ 164 (207) .++|++||++++..++....++||+.++++++|++++|++|+|||++||||++|+.|||.||+++++|| T Consensus 90 ~~~~~~Dt~~~~~~l~~~~~~~sL~~l~~~~lg~~ldK~~q~SnW~~rpLt~~Qi~YAA~Da~~ll~ly 158 (159) T cd06129 90 LLRNLFDTQAAHAILGYPEEYWSLNSLVEQYLGPCLDKSRQLSDWAKRPLTEEQMLYAAADVYVLLHIY 158 (159) T ss_pred CCCCCHHHHHHHHHHCCCCCCHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHH T ss_conf 324303699999995888431659999999808787710245888999999999999999999999976 No 8 >pfam01612 3_5_exonuc 3'-5' exonuclease. This domain is responsible for the 3'-5' exonuclease proofreading activity of E. coli DNA polymerase I (polI) and other enzymes, it catalyses the hydrolysis of unpaired or mismatched nucleotides. This domain consists of the amino-terminal half of the Klenow fragment in E. coli polI it is also found in the Werner syndrome helicase (WRN), focus forming activity 1 protein (FFA-1) and ribonuclease D (RNase D). Werner syndrome is a human genetic disorder causing premature aging; the WRN protein has helicase activity in the 3'-5' direction. The FFA-1 protein is required for formation of a replication foci and also has helicase activity; it is a homologue of the WRN protein. RNase D is a 3'-5' exonuclease involved in tRNA processing. Also found in this family is the autoantigen PM/Scl thought to be involved in polymyositis-scleroderma overlap syndrome. Probab=100.00 E-value=2.3e-42 Score=265.44 Aligned_cols=161 Identities=33% Similarity=0.479 Sum_probs=142.0 Q ss_pred CCCHHH-HCCCCCCEEEECCCCCCCCC--CCEEEEEEEEECCCCEEEECCCCCCCCCHHHHHHHHCCCCCEEECCCCHHH Q ss_conf 888578-42489827997177789854--450799998418942899746577857067767773236402301000016 Q gi|254780636|r 10 DIPAEC-AARYVDAIAVDTETLGLMPR--RDRLCIVQLSPGDGTVDIIRIAAGQKNAPNLVGMLVDEKREKIFHYGRFDI 86 (207) Q Consensus 10 dl~~~~-~l~~~~~iaiDtEt~~l~~~--~~~l~LiQl~~~~~~~~l~~~~~~~~~~~~L~~ll~d~~i~KV~Hn~~~D~ 86 (207) ++.+.+ .+..+++|||||||+++++. +.+++++|++++++ +|+++........+.|+++|+|++|.|||||+++|+ T Consensus 9 ~l~~~~~~l~~~~~ia~DtEt~~~~~~~~~~~~~llq~~~~~~-~yli~~~~~~~~~~~L~~ll~~~~i~Kvghn~k~D~ 87 (172) T pfam01612 9 ELEDLIEELLNAKVVAVDTETTSLDTYSYTLVGCLIQISTGEG-AYIIDPLALGDDLEGLKRLLEDPNITKVGHNAKFDL 87 (172) T ss_pred HHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCEEEEEEECCCC-CEEEECCCCHHHHHHHHHHHCCCCCCEEEEEHHHHH T ss_conf 9999999994099089980628988676134336999967986-077643475667999999975999879997589999 Q ss_pred HHHHHHHHCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCEECCCCCCCCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 88865420000000457898865211000000887776542002467520036565443236899998645999999999 Q gi|254780636|r 87 AVLFYTFGVRVRPVFCTKIASRLTRTYTNQHGLKDNLKELLGINISKAQQSSDWSADDLSDEQLQYAASDVVHLHALRLQ 166 (207) Q Consensus 87 ~~L~~~~gi~~~~i~DT~ias~l~~~~~~~~~L~~L~~~~lg~~ldK~~q~SdW~~rpLs~~Qi~YAA~Dv~~l~~L~~~ 166 (207) .+|.+++|+.+.|+||||+|+++++ ...+|||+.|++++||++++|+.|+|||++|||+++|+.|||.||+++++||+. T Consensus 88 ~~L~~~~~~~~~~~~Dt~~a~~~l~-~~~~~gL~~L~~~~lg~~~~K~~~~s~w~~rpL~~~qi~YAa~Da~~~~~l~~~ 166 (172) T pfam01612 88 EVLARDFGIKLENLFDTMLAAYLLG-YPRSHSLDDLAEKYLGVELDKSEQCADWRARPLSEEQLRYAAEDADYLLRLYDK 166 (172) T ss_pred HHHHHHCCCCCCCCHHHHHHHHHCC-CCCCCCHHHHHHHHCCCCCCHHHEECCCCCCCCCHHHHHHHHHHHHHHHHHHHH T ss_conf 9998854766687327899998819-964356999999980987874415667887898299999999999999999999 Q ss_pred HHHHHH Q ss_conf 999999 Q gi|254780636|r 167 FTEKLQ 172 (207) Q Consensus 167 l~~~L~ 172 (207) |.++|+ T Consensus 167 l~~~l~ 172 (172) T pfam01612 167 LRKELE 172 (172) T ss_pred HHHHHC T ss_conf 999749 No 9 >smart00474 35EXOc 3'-5' exonuclease. 3\' -5' exonuclease proofreading domain present in DNA polymerase I, Werner syndrome helicase, RNase D and other enzymes Probab=100.00 E-value=5.8e-41 Score=257.31 Aligned_cols=162 Identities=35% Similarity=0.456 Sum_probs=142.2 Q ss_pred CCCHHHH-CC-CCCCEEEECCCCCCCCCCCEEEEEEEEECCCCEEEECCCCCCCCCHHHHHHHHCCCCCEEECCCCHHHH Q ss_conf 8885784-24-898279971777898544507999984189428997465778570677677732364023010000168 Q gi|254780636|r 10 DIPAECA-AR-YVDAIAVDTETLGLMPRRDRLCIVQLSPGDGTVDIIRIAAGQKNAPNLVGMLVDEKREKIFHYGRFDIA 87 (207) Q Consensus 10 dl~~~~~-l~-~~~~iaiDtEt~~l~~~~~~l~LiQl~~~~~~~~l~~~~~~~~~~~~L~~ll~d~~i~KV~Hn~~~D~~ 87 (207) .+.+.+. ++ ..++|||||||+|++++++++|+||+|++.+.+|+++........+.|+++|+|+++.|||||+++|+. T Consensus 9 ~l~~~~~~l~~~~~~i~~DtE~~~~~~~~~~l~~iql~~~~~~~yi~~~~~~~~~~~~l~~ll~~~~i~kv~hn~k~D~~ 88 (172) T smart00474 9 TLEELLEKLRAAGGEVALDTETTGLNSYSGKLVLIQISVTGEGAFIIDPLALGDDLEILKDLLEDETITKVGHNAKFDLH 88 (172) T ss_pred HHHHHHHHHHHCCCEEEEECCCCCCCCCCCEEEEEEEEECCCEEEEEECCCCHHHHHHHHHHHCCCCCCEEEECHHHHHH T ss_conf 99999999984299599988306885378758999999689848997012242179999999809998599974599999 Q ss_pred HHHHHHHCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCEECCCCCCCCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 88654200000004578988652110000008877765420024675200365654432368999986459999999999 Q gi|254780636|r 88 VLFYTFGVRVRPVFCTKIASRLTRTYTNQHGLKDNLKELLGINISKAQQSSDWSADDLSDEQLQYAASDVVHLHALRLQF 167 (207) Q Consensus 88 ~L~~~~gi~~~~i~DT~ias~l~~~~~~~~~L~~L~~~~lg~~ldK~~q~SdW~~rpLs~~Qi~YAA~Dv~~l~~L~~~l 167 (207) +|.+ +|+.++|+||||+|++++++...++||+.++++++|++++|+.|+|||.+|||+++|+.|||.||++|++||+.| T Consensus 89 ~L~~-~gi~l~~v~Dt~ia~~l~~~~~~~~~L~~l~~~~~~~~l~k~~~~~~w~~rpl~~~~~~YAa~Da~~~~~l~~~l 167 (172) T smart00474 89 VLAR-FGIELENIFDTMLAAYLLLGGPSKHGLATLLKEYLGVELDKEEQKSDWGARPLSEEQLQYAAEDADALLRLYEKL 167 (172) T ss_pred HHHH-CCCCCCCCHHHHHHHHHCCCCCCHHHHHHHHHHHCCEECCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHH T ss_conf 9997-798567524679987750579864899999999749231555564666789990999999999999999999999 Q ss_pred HHHHH Q ss_conf 99999 Q gi|254780636|r 168 TEKLQ 172 (207) Q Consensus 168 ~~~L~ 172 (207) .++|. T Consensus 168 ~~~L~ 172 (172) T smart00474 168 EKELE 172 (172) T ss_pred HHHHC T ss_conf 99729 No 10 >cd06146 mut-7_like_exo The mut-7 subfamily is composed of Caenorhabditis elegans mut-7 and similar proteins found in plants and metazoans. Mut-7 is implicated in posttranscriptional gene silencing. It contains a DEDDy-type DnaQ-like 3'-5' exonuclease domain possessing three conserved sequence motifs, termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. Probab=100.00 E-value=1.9e-39 Score=248.55 Aligned_cols=159 Identities=23% Similarity=0.264 Sum_probs=131.2 Q ss_pred CCCCCHHH---HCCCCCCEEEECCCC--CCCCCCCEEEEEEEEECCCCEEEECCCCCCCC-----CHHHHHHHHCCCCCE Q ss_conf 57888578---424898279971777--89854450799998418942899746577857-----067767773236402 Q gi|254780636|r 8 EGDIPAEC---AARYVDAIAVDTETL--GLMPRRDRLCIVQLSPGDGTVDIIRIAAGQKN-----APNLVGMLVDEKREK 77 (207) Q Consensus 8 ~~dl~~~~---~l~~~~~iaiDtEt~--~l~~~~~~l~LiQl~~~~~~~~l~~~~~~~~~-----~~~L~~ll~d~~i~K 77 (207) ++++.+.+ .++..++|||||||. ...++++++||||||++++ +|++++...... ++.|+++|+||+|+| T Consensus 7 ~~~l~~~~~~~~l~~~~vvg~D~Ew~~~~~~~~~~~vaLiQlat~~~-~~liD~~~~~~~~~~~~~~~L~~ll~~~~i~K 85 (191) T cd06146 7 EEELEALLLALSLEAGRVVGIDSEWKPSFLGDSDPRVAILQLATEDE-VFLLDLLALENLESEDWDRLLKRLFEDPDVLK 85 (191) T ss_pred HHHHHHHHHHHHCCCCCEEEEECCCCCCCCCCCCCCEEEEEEECCCC-CEEEEEECCCCCCHHHHHHHHHHHHCCCCEEE T ss_conf 99999999998426898899978757877788889527999952997-58999704565434778999999947999689 Q ss_pred EECCCCHHHHHHHHHHHC------CCCCHHHHHHHHHHHHHH----------CCCHHHHHHHHHHHCCCCCCCCEECCCC Q ss_conf 301000016888654200------000004578988652110----------0000088777654200246752003656 Q gi|254780636|r 78 IFHYGRFDIAVLFYTFGV------RVRPVFCTKIASRLTRTY----------TNQHGLKDNLKELLGINISKAQQSSDWS 141 (207) Q Consensus 78 V~Hn~~~D~~~L~~~~gi------~~~~i~DT~ias~l~~~~----------~~~~~L~~L~~~~lg~~ldK~~q~SdW~ 141 (207) |||++++|+..|.+.+|+ .++|++||+.+++.+.+. ...+||++|+++++|+.++|+.|+|||+ T Consensus 86 vg~~~~~D~~~L~~~~~~~~~~~~~~~~~~Dl~~~a~~~~~~~~~~~~~~~~~~~~gL~~L~~~~lg~~l~K~~q~SnW~ 165 (191) T cd06146 86 LGFGFKQDLKALSASYPALKCMFERVQNVLDLQNLAKELQKSDMGRLKGNLPSKTKGLADLVQAVLGKPLDKSEQCSNWE 165 (191) T ss_pred EEECHHHHHHHHHHHCCCCCCCCCCCCEEEEHHHHHHHHHCCCCCCHHHCCCCCCCCHHHHHHHHHCCCCCCCCCCCCCC T ss_conf 99443868999998658643233364403579999999731122101100355675699999999098788565138999 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 54432368999986459999999999 Q gi|254780636|r 142 ADDLSDEQLQYAASDVVHLHALRLQF 167 (207) Q Consensus 142 ~rpLs~~Qi~YAA~Dv~~l~~L~~~l 167 (207) +||||++|+.|||.||+++++||++| T Consensus 166 ~rPLt~~Qi~YAA~Da~~ll~iy~~L 191 (191) T cd06146 166 RRPLREEQILYAALDAYCLLEVFDKL 191 (191) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHC T ss_conf 99899999999999899999999739 No 11 >cd06141 WRN_exo WRN is a unique RecQ DNA helicase exhibiting an exonuclease activity. It contains a DEDDy-type DnaQ-like 3'-5' exonuclease domain possessing three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. Mutations in the WRN gene cause Werner syndrome, an autosomal recessive disorder associated with premature aging and increased susceptibility to cancer and type II diabetes. WRN interacts with key proteins involved in DNA replication, recombination, and repair. It is believed to maintain genomic stability and life span by participating in DNA processes. WRN is stimulated by Ku70/80, an important regulator of genomic stability. Probab=100.00 E-value=3.8e-39 Score=246.74 Aligned_cols=150 Identities=24% Similarity=0.275 Sum_probs=133.2 Q ss_pred CCCCCCEEEECCCCCCC--CCCCEEEEEEEEECCCCEEEECCCCCCCCCHHHHHHHHCCCCCEEECCCCHHHHHHHHHHH Q ss_conf 24898279971777898--5445079999841894289974657785706776777323640230100001688865420 Q gi|254780636|r 17 ARYVDAIAVDTETLGLM--PRRDRLCIVQLSPGDGTVDIIRIAAGQKNAPNLVGMLVDEKREKIFHYGRFDIAVLFYTFG 94 (207) Q Consensus 17 l~~~~~iaiDtEt~~l~--~~~~~l~LiQl~~~~~~~~l~~~~~~~~~~~~L~~ll~d~~i~KV~Hn~~~D~~~L~~~~g 94 (207) +...++|||||||.... .+++++||||||+++ .|+++++......++.|+++|+|++|+||||++++|+..|.+++| T Consensus 15 l~~~~vvG~D~Ew~~~~~~~~~~~~aliQlat~~-~~~l~d~~~~~~~~~~L~~lL~~~~i~Kvg~~~~~D~~~L~~~~g 93 (168) T cd06141 15 LDDEKVVGFDLEWRPSFRKGKRNKVALLQLASES-RCLLFQLSHMDKLPPSLKQLLEDPSILKVGVGIKGDARKLRRDFG 93 (168) T ss_pred HCCCCEEEEECEECCCCCCCCCCCCEEEEEECCC-CEEEEEECCCCHHHHHHHHHHCCCCEEEEEEEEHHHHHHHHHHCC T ss_conf 5689889996645587777888971599982499-659997403312049999997599807999631889999987445 Q ss_pred CCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCC--CCCEECCCCCCCCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 00000045789886521100000088777654200246--75200365654432368999986459999999999 Q gi|254780636|r 95 VRVRPVFCTKIASRLTRTYTNQHGLKDNLKELLGINIS--KAQQSSDWSADDLSDEQLQYAASDVVHLHALRLQF 167 (207) Q Consensus 95 i~~~~i~DT~ias~l~~~~~~~~~L~~L~~~~lg~~ld--K~~q~SdW~~rpLs~~Qi~YAA~Dv~~l~~L~~~l 167 (207) +.++|++||+.+++..++...++||++|++++||+.++ |+.|+|||++||||++|+.|||.||+++++||++| T Consensus 94 ~~~~~~~Dl~~~a~~~~~~~~~~sL~~l~~~~lg~~l~K~k~~q~SdW~~rpLs~~Qi~YAA~Da~~~~~l~~~L 168 (168) T cd06141 94 IEVRGVVDLSHLAKRVGPRRKLVSLARLVEEVLGLPLSKDKKVRCSNWEARPLSKEQILYAATDAYASLELYRKL 168 (168) T ss_pred CCCCCEEEHHHHHHHHCCCCCCCCHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHC T ss_conf 024876569999998577777632999999983962154665203888999999999999999999999999739 No 12 >cd06139 DNA_polA_I_Ecoli_like_exo The 3'-5' exonuclease domain of DNA polymerases has a fundamental role in reducing polymerase errors and is involved in proofreading activity. E.coli-like Polymerase I (pol I), a subgroup of family-A DNA polymerases, contains a DEDDy-type DnaQ-like 3'-5' exonuclease domain in the same polypeptide chain as the polymerase domain. The exonuclease domain contains three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. E. coli DNA pol I is involved in genome replication but is not the main replicating enzyme. It is also implicated in DNA repair. Probab=100.00 E-value=3.1e-38 Score=241.43 Aligned_cols=174 Identities=25% Similarity=0.275 Sum_probs=150.3 Q ss_pred CCCCCCEEEECCCCCCCCCCCEEEEEEEEECCCCEEEECCCCCCCC--------CHHHHHHHHCCCCCEEECCCCHHHHH Q ss_conf 2489827997177789854450799998418942899746577857--------06776777323640230100001688 Q gi|254780636|r 17 ARYVDAIAVDTETLGLMPRRDRLCIVQLSPGDGTVDIIRIAAGQKN--------APNLVGMLVDEKREKIFHYGRFDIAV 88 (207) Q Consensus 17 l~~~~~iaiDtEt~~l~~~~~~l~LiQl~~~~~~~~l~~~~~~~~~--------~~~L~~ll~d~~i~KV~Hn~~~D~~~ 88 (207) +..+++|||||||+|++|+++++++||+|++.+..+++|+.+.... ...|+++|+|+++.||+||+|||+.+ T Consensus 2 l~~~~~ia~DtETtgl~~~~~~ivGisls~~~~~~~YiP~~h~~~~~~l~~~~v~~~lk~~led~~i~Kv~hn~k~D~~~ 81 (193) T cd06139 2 LEKAKVFAFDTETTSLDPMQAELVGISFAVEPGEAYYIPLGHDYGGEQLPREEVLAALKPLLEDPSIKKVGQNLKFDLHV 81 (193) T ss_pred CCCCCEEEEECCCCCCCCCCCEEEEEEEEECCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHCCCCCCEEEECCHHHHHH T ss_conf 75598789978369998688849999998079968998723656544679999999999997399986887547978999 Q ss_pred HHHHHHCCCCC-HHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCC-------CCCCEECCCCCCCCCHHHHHHHHHHHHHH Q ss_conf 86542000000-04578988652110000008877765420024-------67520036565443236899998645999 Q gi|254780636|r 89 LFYTFGVRVRP-VFCTKIASRLTRTYTNQHGLKDNLKELLGINI-------SKAQQSSDWSADDLSDEQLQYAASDVVHL 160 (207) Q Consensus 89 L~~~~gi~~~~-i~DT~ias~l~~~~~~~~~L~~L~~~~lg~~l-------dK~~q~SdW~~rpLs~~Qi~YAA~Dv~~l 160 (207) |.++ |+.+.+ +|||||||+++++...+|||+.|++++||.++ .|+...++|...|+ +.+..|||.||.++ T Consensus 82 L~~~-gi~~~~~~~DTmiaa~ll~~~~~~~~L~~La~~~l~~~~~~~~~l~gkg~~~~~~~~~~~-~~~~~YAa~Da~~t 159 (193) T cd06139 82 LANH-GIELRGPAFDTMLASYLLNPGRRRHGLDDLAERYLGHKTISFEDLVGKGKKQITFDQVPL-EKAAEYAAEDADIT 159 (193) T ss_pred HHHC-CCCCCCCCHHHHHHHHHCCCCCCCCCHHHHHHHHHCCCCCCHHHHHCCCCCCCCHHHCCH-HHHHHHHHHHHHHH T ss_conf 9983-986678302189999872799888899999999818876638998377855378313329-99999999999999 Q ss_pred HHHHHHHHHHHHHC-CCHHHHHH-HHHHHHHHHH Q ss_conf 99999999999974-91899999-9964006899 Q gi|254780636|r 161 HALRLQFTEKLQRL-GRSDLATS-CCNFLMDRAE 192 (207) Q Consensus 161 ~~L~~~l~~~L~~~-~~~~~~~~-e~~~l~~~~~ 192 (207) ++||+.|.++|++. ++.++|++ |+++++++++ T Consensus 160 ~~L~~~l~~~L~~~~~l~~ly~~iE~PL~~VLa~ 193 (193) T cd06139 160 LRLYELLKPKLKEEPGLLELYEEIEMPLIPVLAE 193 (193) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHCC T ss_conf 9999999999984220999999984569998279 No 13 >cd00007 35EXOc 3'-5' exonuclease. The 35EXOc domain is responsible for the 3'-5' exonuclease proofreading activity of prokaryotic DNA polymerase I (pol I) and other enzymes, it catalyses the hydrolysis of unpaired or mismatched nucleotides. This domain consists of the amino-terminal half of the Klenow fragment in E. coli pol I. 35EXOc is also found in the Werner syndrome helicase (WRN), focus forming activity 1 protein (FFA-1) and ribonuclease D (RNase D). Probab=100.00 E-value=7.8e-38 Score=239.13 Aligned_cols=151 Identities=32% Similarity=0.329 Sum_probs=134.7 Q ss_pred CCEEEECCCCCCCCCCCEEEEEEEEECCCCEEEECCCCCCCCCHHHHHHHHCCCCCEEECCCCHHHHHHHHHHHCCCCCH Q ss_conf 82799717778985445079999841894289974657785706776777323640230100001688865420000000 Q gi|254780636|r 21 DAIAVDTETLGLMPRRDRLCIVQLSPGDGTVDIIRIAAGQKNAPNLVGMLVDEKREKIFHYGRFDIAVLFYTFGVRVRPV 100 (207) Q Consensus 21 ~~iaiDtEt~~l~~~~~~l~LiQl~~~~~~~~l~~~~~~~~~~~~L~~ll~d~~i~KV~Hn~~~D~~~L~~~~gi~~~~i 100 (207) +.||+||||+|++++++++|+||+|++++.+|++.........+.|+++|+|+++.|||||++||+.+|.+++++.++++ T Consensus 1 ~~ia~D~Et~~l~~~~~~i~~iql~~~~~~~yi~~~~~~~~~~~~l~~~l~~~~i~Kv~hn~k~D~~~L~~~~~~~~~~~ 80 (155) T cd00007 1 KEVAFDTETTGLNYHRGKLVGIQIATAGEAAYIPDELELEEDLEALKELLEDEDITKVGHDAKFDLVVLARDGIELPGNI 80 (155) T ss_pred CEEEEECCCCCCCCCCCCEEEEEEEECCCEEEEEECCCCHHHHHHHHHHHHCCCCCEEEECCCHHHHHHHHCCCCCCCCE T ss_conf 98999898799977888099999998993599981475524699999998299973898734000999986286013501 Q ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCEECCCCC----CCCCHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 457898865211000000887776542002467520036565----44323689999864599999999999999 Q gi|254780636|r 101 FCTKIASRLTRTYTNQHGLKDNLKELLGINISKAQQSSDWSA----DDLSDEQLQYAASDVVHLHALRLQFTEKL 171 (207) Q Consensus 101 ~DT~ias~l~~~~~~~~~L~~L~~~~lg~~ldK~~q~SdW~~----rpLs~~Qi~YAA~Dv~~l~~L~~~l~~~L 171 (207) ||||+|++++++...++||++++++++|.+++|..+.++|+. ||++++|+.|||.||.++++||+.+.++| T Consensus 81 ~Dt~iaa~ll~~~~~~~~L~~l~~~~l~~~~~k~~~~~~~~~~~~~~~l~~~~~~YAa~D~~~~~~l~~~l~~~l 155 (155) T cd00007 81 FDTMLAAYLLNPGEGSHSLDDLAKEYLGIELDKDEQIYGKGAKTFARPLSEELLEYAAEDADALLRLYEKLLEEL 155 (155) T ss_pred EHHHHHHHHHCCCCCCHHHHHHHHHHCCCCCCHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 149999998477855236999999983987674888617765444479989999999999999999999999659 No 14 >PRK07556 consensus Probab=100.00 E-value=4.1e-35 Score=223.29 Aligned_cols=190 Identities=23% Similarity=0.203 Sum_probs=160.7 Q ss_pred CCHHH-HCCCCCCEEEECCCCCCCCCCCEEEEEEEEECCCCEEEECCCCCCCC------------------CHHHHHHHH Q ss_conf 88578-42489827997177789854450799998418942899746577857------------------067767773 Q gi|254780636|r 11 IPAEC-AARYVDAIAVDTETLGLMPRRDRLCIVQLSPGDGTVDIIRIAAGQKN------------------APNLVGMLV 71 (207) Q Consensus 11 l~~~~-~l~~~~~iaiDtEt~~l~~~~~~l~LiQl~~~~~~~~l~~~~~~~~~------------------~~~L~~ll~ 71 (207) |...+ .+...+.++|||||+++++..+++++|.+|...+.++++|+.+.... ...|+++|+ T Consensus 377 L~~li~~l~~~~~~a~DtETtsld~~~a~lVGisls~~~~~a~YIPl~h~~~~~~~~~~~~~~~ql~~~~vL~~LkplLe 456 (977) T PRK07556 377 LKAWIARARETGIVAFDTETTSLDPMQAELVGFSLALAPGRACYIPLGHKSGDGDLFGGGLAEGQIPLRDALAALKPVLE 456 (977) T ss_pred HHHHHHHHHHCCEEEEEEECCCCCHHHCCEEEEEEEECCCCEEEEECCCCCCCCCHHCCCCCHHHCCHHHHHHHHHHHHC T ss_conf 99999998767958999731887830155357999806897899961444564201102222323059999999898853 Q ss_pred CCCCCEEECCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCC-------CCCEECCCCCCC Q ss_conf 2364023010000168886542000000045789886521100000088777654200246-------752003656544 Q gi|254780636|r 72 DEKREKIFHYGRFDIAVLFYTFGVRVRPVFCTKIASRLTRTYTNQHGLKDNLKELLGINIS-------KAQQSSDWSADD 144 (207) Q Consensus 72 d~~i~KV~Hn~~~D~~~L~~~~gi~~~~i~DT~ias~l~~~~~~~~~L~~L~~~~lg~~ld-------K~~q~SdW~~rp 144 (207) |+.+.|||||+|+|..+|.++ |+.+.+++||||||+++++...+|+|..|+++|||.+.. |......+...| T Consensus 457 d~~i~KighnlK~d~~vL~~~-gi~l~~i~DtmLaaYLLdp~~~~~~Ld~La~~yL~~~~~~~~dl~g~gk~~~~f~~v~ 535 (977) T PRK07556 457 DPSVLKIGQNMKYDALVLARH-GITVAPFDDTMLISYALDAGRGGHGMDALSERWLGHTPIAYKDLTGSGKSAITFDQVD 535 (977) T ss_pred CCCHHHHHHHHHHHHHHHHHC-CCCCCCCCCHHHHHHHHCCCCCCCCHHHHHHHHCCCCCCCCHHCCCCCCCCCCHHHHH T ss_conf 873114441256889999976-9602685318888888348866478799999964887621000034550104765520 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHH-HHHHHHHHHHHHHCCCCCCCC Q ss_conf 32368999986459999999999999997491899999-996400689998667988884 Q gi|254780636|r 145 LSDEQLQYAASDVVHLHALRLQFTEKLQRLGRSDLATS-CCNFLMDRAELDLLGWENVDI 203 (207) Q Consensus 145 Ls~~Qi~YAA~Dv~~l~~L~~~l~~~L~~~~~~~~~~~-e~~~l~~~~~me~~g~~~~~~ 203 (207) + +.+..|||.||.++++||+.|.++|.+.++..+|++ |+++++++++||.+|++ +|. T Consensus 536 ~-~~~~~YaaedA~~t~~L~~~L~~~L~~~~l~~Ly~~IE~PLi~VLa~ME~~GI~-vD~ 593 (977) T PRK07556 536 I-DRATAYAAEDADVTLRLWQVLKPRLAAEGLTTVYERLERPLVPVLARMEERGIS-VDR 593 (977) T ss_pred H-HHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHCCEE-ECH T ss_conf 6-899999999999999999999999984235789999997899999999856969-869 No 15 >PRK06887 consensus Probab=100.00 E-value=2e-33 Score=213.52 Aligned_cols=189 Identities=21% Similarity=0.196 Sum_probs=155.1 Q ss_pred CCHHH-HCCCCCCEEEECCCCCCCCCCCEEEEEEEEECCCCEEEECCCCCCC---------------------------- Q ss_conf 88578-4248982799717778985445079999841894289974657785---------------------------- Q gi|254780636|r 11 IPAEC-AARYVDAIAVDTETLGLMPRRDRLCIVQLSPGDGTVDIIRIAAGQK---------------------------- 61 (207) Q Consensus 11 l~~~~-~l~~~~~iaiDtEt~~l~~~~~~l~LiQl~~~~~~~~l~~~~~~~~---------------------------- 61 (207) +...+ .+..++.+|+||||+++++..+.+++|.+|..++.++++|+.+... T Consensus 346 l~~~~~~l~~~~~va~DtEt~~ld~~~a~lVGis~s~~~g~a~YiPl~h~~~~~~~~~~~~~~~~~~~~~~~~~~~~ql~ 425 (954) T PRK06887 346 LTSWIEKLNQAKLIAVDTETDSLDYMSANLVGISFALENGEACYIPLAHKQQVAEQPQSDLFAEEAETQTDYLLAPKQLN 425 (954) T ss_pred HHHHHHHHHHCCEEEEEEECCCCCCCCCCEEEEEEEECCCCEEEEECCCCCCCCCCCCHHHHHHCCCCHHHHHCCHHHCC T ss_conf 99999987637927999635998713266578999835897899864442100013200233200100122210232127 Q ss_pred ---CCHHHHHHHHCCCCCEEECCCCHHHHHHHHHHHCCCCCH-HHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCEE Q ss_conf ---706776777323640230100001688865420000000-4578988652110000008877765420024675200 Q gi|254780636|r 62 ---NAPNLVGMLVDEKREKIFHYGRFDIAVLFYTFGVRVRPV-FCTKIASRLTRTYTNQHGLKDNLKELLGINISKAQQS 137 (207) Q Consensus 62 ---~~~~L~~ll~d~~i~KV~Hn~~~D~~~L~~~~gi~~~~i-~DT~ias~l~~~~~~~~~L~~L~~~~lg~~ldK~~q~ 137 (207) ....|+++|+|+++.|||||+|+|+.+|.++ |+.++++ |||||||+++++ ..+|+|.+|+++|||.+..+.... T Consensus 426 ~~~vl~~Lkplled~~i~KIghNlK~dl~vL~~~-Gi~l~g~~fDTmLAaYLLdp-~~r~~Ld~LA~~yL~~~~i~~edl 503 (954) T PRK06887 426 KSTCLAALKPLLENPDIHKIGQNIKYDLTIFANH-GIELQGVAFDTMLESYTLNS-TGRHNMDDLAERYLGHETIAFESI 503 (954) T ss_pred HHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHC-CCCCCCCCCCHHHHHHHHCC-CCCCCHHHHHHHHCCCCCCCHHHH T ss_conf 9999999899852975022211478899999977-96247765169899875187-555788999999638865334642 Q ss_pred -------CCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHC-CCHHHHHH-HHHHHHHHHHHHHCCCCCCCC Q ss_conf -------3656544323689999864599999999999999974-91899999-996400689998667988884 Q gi|254780636|r 138 -------SDWSADDLSDEQLQYAASDVVHLHALRLQFTEKLQRL-GRSDLATS-CCNFLMDRAELDLLGWENVDI 203 (207) Q Consensus 138 -------SdW~~rpLs~~Qi~YAA~Dv~~l~~L~~~l~~~L~~~-~~~~~~~~-e~~~l~~~~~me~~g~~~~~~ 203 (207) ..+...|+ +....|||.||.++++||+.|.++|.++ ++..+|++ |+++++++++||..|++ +|. T Consensus 504 ~gkGk~ql~f~~v~l-~~~~~YAaedA~itl~L~~~L~~~L~~~~~L~~Ly~~IE~PLi~VLa~ME~~GI~-vD~ 576 (954) T PRK06887 504 AGKGKNQLTFDQIPL-EQATEYAAEDADVTMKLHQVLWLELQKEPTLVKLYKTMELPLVHVLSRIERNGVL-IDS 576 (954) T ss_pred CCCCCCCCCHHHCCH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCEE-ECH T ss_conf 254433378443446-7999999999999999999999999854547899999997999999999973858-879 No 16 >PRK05755 DNA polymerase I; Provisional Probab=100.00 E-value=2.4e-33 Score=213.04 Aligned_cols=189 Identities=23% Similarity=0.247 Sum_probs=159.1 Q ss_pred CCHHH-HCCCCCCEEEECCCCCCCCCCCEEEEEEEEECCCCEEEECCCCCCCC-----CHHHHHHHHCCCCCEEECCCCH Q ss_conf 88578-42489827997177789854450799998418942899746577857-----0677677732364023010000 Q gi|254780636|r 11 IPAEC-AARYVDAIAVDTETLGLMPRRDRLCIVQLSPGDGTVDIIRIAAGQKN-----APNLVGMLVDEKREKIFHYGRF 84 (207) Q Consensus 11 l~~~~-~l~~~~~iaiDtEt~~l~~~~~~l~LiQl~~~~~~~~l~~~~~~~~~-----~~~L~~ll~d~~i~KV~Hn~~~ 84 (207) +...+ .+..++.+|+||||+++++..+++.+|.+|..++..+++|+.+.... ...|+++|+|+++.||+||+|+ T Consensus 309 l~~ll~~l~~~~~va~dtEttsld~~~a~lVGis~s~~~g~a~YIPl~~~~~~~~~~vl~~Lk~lLed~~i~KIghNlK~ 388 (889) T PRK05755 309 LEAWLAKLKKAGLVAFDTETTSLDPMSANLVGLSFAVEEGEAAYIPLAHDDQLDRDDALAALKPLLEDPAIPKVGQNLKY 388 (889) T ss_pred HHHHHHHHHCCCEEEEEEECCCCCCCCCCEEEEEEECCCCCEEEEECCCCCCCCHHHHHHHHHHHHCCCCCCEEEECHHH T ss_conf 99999986528849999841788756576259999827996899844431336889999999999709887243203189 Q ss_pred HHHHHHHHHHCCCCCH-HHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCE------ECCCCCCCCCHHHHHHHHHHH Q ss_conf 1688865420000000-457898865211000000887776542002467520------036565443236899998645 Q gi|254780636|r 85 DIAVLFYTFGVRVRPV-FCTKIASRLTRTYTNQHGLKDNLKELLGINISKAQQ------SSDWSADDLSDEQLQYAASDV 157 (207) Q Consensus 85 D~~~L~~~~gi~~~~i-~DT~ias~l~~~~~~~~~L~~L~~~~lg~~ldK~~q------~SdW~~rpLs~~Qi~YAA~Dv 157 (207) |+.+|.++ |+.+.++ |||||||+++++...+|+|.+|+++|||..+.+... ..++...|+ +....|||.|| T Consensus 389 dl~vL~~~-GI~l~g~~fDTmLAaYLLdP~~~~~~Ld~La~~yL~~~~~~~eel~gkg~~~~~~~v~~-e~~~~YaaedA 466 (889) T PRK05755 389 DRHILANY-GIELRGIAFDTMLASYLLDPGAGRHDLDSLAERYLGHKTISFEEVAGKGNQLTFAQVAL-EEAAEYAAEDA 466 (889) T ss_pred HHHHHHHC-CEECCCCCHHHHHHHHHHCCCCCCCCHHHHHHHHHCCCCCCHHHHCCCCCCCCCCCCCH-HHHHHHHHHHH T ss_conf 99999977-92357763019999987478877677899999863234420343136565566100787-89999999988 Q ss_pred HHHHHHHHHHHHHHHHCCCHHHHHH-HHHHHHHHHHHHHCCCCCCC Q ss_conf 9999999999999997491899999-99640068999866798888 Q gi|254780636|r 158 VHLHALRLQFTEKLQRLGRSDLATS-CCNFLMDRAELDLLGWENVD 202 (207) Q Consensus 158 ~~l~~L~~~l~~~L~~~~~~~~~~~-e~~~l~~~~~me~~g~~~~~ 202 (207) .++++||+.|.++|.++++..+|++ |++++|++++||..|++ +| T Consensus 467 ~~~~~L~~~L~~~L~~~~l~~Ly~~IE~PLi~VLa~ME~~GI~-VD 511 (889) T PRK05755 467 DVTLRLHQVLKPELEEEGLLELYEEIELPLVPVLARMERNGIK-VD 511 (889) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHCCEE-EC T ss_conf 6899999999999986047889999875199999999960868-66 No 17 >PRK07997 consensus Probab=100.00 E-value=2.1e-33 Score=213.34 Aligned_cols=190 Identities=22% Similarity=0.219 Sum_probs=156.3 Q ss_pred CCHHH-HCCCCCCEEEECCCCCCCCCCCEEEEEEEEECCCCEEEECCCCCCCCC----------HHHHHHHHCCCCCEEE Q ss_conf 88578-424898279971777898544507999984189428997465778570----------6776777323640230 Q gi|254780636|r 11 IPAEC-AARYVDAIAVDTETLGLMPRRDRLCIVQLSPGDGTVDIIRIAAGQKNA----------PNLVGMLVDEKREKIF 79 (207) Q Consensus 11 l~~~~-~l~~~~~iaiDtEt~~l~~~~~~l~LiQl~~~~~~~~l~~~~~~~~~~----------~~L~~ll~d~~i~KV~ 79 (207) +.+.+ .++.++++++||||+++++..+.+.++++|...+..+++++.+....+ ..|+++|+|+++.||| T Consensus 339 l~~~~~~l~~~~~~a~DtEt~sld~~~a~lvGiS~s~~~g~a~YiPl~h~~l~~~~ql~~~~vl~~LkplLed~~i~KIg 418 (928) T PRK07997 339 LKAWIEKLKKAPVFAFDTETDSLDNISANLVGLSFAIEPGVAAYLPVAHDYLDAPDQISRERVLELLKPLLEDEKALKVG 418 (928) T ss_pred HHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCEEEEECCCCEEEEECCCCCCCCHHHCCHHHHHHHHHHHHCCCCCCHHC T ss_conf 99999986429869999512786600024443389862774687402212347600124999999989875286530321 Q ss_pred CCCCHHHHHHHHHHHCCCCCH-HHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCC-------CCEECCCCCCCCCHHHHH Q ss_conf 100001688865420000000-457898865211000000887776542002467-------520036565443236899 Q gi|254780636|r 80 HYGRFDIAVLFYTFGVRVRPV-FCTKIASRLTRTYTNQHGLKDNLKELLGINISK-------AQQSSDWSADDLSDEQLQ 151 (207) Q Consensus 80 Hn~~~D~~~L~~~~gi~~~~i-~DT~ias~l~~~~~~~~~L~~L~~~~lg~~ldK-------~~q~SdW~~rpLs~~Qi~ 151 (207) ||+|+|+.+|.++ |+.+.++ |||||||+++++...+|+|..|+++|||....+ ......|...|+ +.... T Consensus 419 hnlK~d~~vL~~~-GI~l~gi~fDTmLAaYLLnp~~~~~~Ld~La~~yL~~~~i~~~el~gkgk~q~~f~~v~~-~~~~~ 496 (928) T PRK07997 419 QNLKFDRGILANY-GIELRGIAFDTMLESYILNSVAGRHDMDSLAERWLKHKTITFEEIAGKGKNQLTFNQIAL-EEAGR 496 (928) T ss_pred CHHHHHHHHHHHC-CCEECCCCCCHHHHHHHCCCCCCCCCHHHHHHHHHCCCCCCHHHHCCCCCCCCCHHHHHH-HHHHH T ss_conf 2168899999977-941056430345532222443344788999999713466433543244645477435359-99999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHH-CCCHHHHHH-HHHHHHHHHHHHHCCCCCCCC Q ss_conf 9986459999999999999997-491899999-996400689998667988884 Q gi|254780636|r 152 YAASDVVHLHALRLQFTEKLQR-LGRSDLATS-CCNFLMDRAELDLLGWENVDI 203 (207) Q Consensus 152 YAA~Dv~~l~~L~~~l~~~L~~-~~~~~~~~~-e~~~l~~~~~me~~g~~~~~~ 203 (207) |||.||.++++||..|.++|.+ +++.++|++ |+++++++++||.+|++ +|. T Consensus 497 Ya~edA~~t~~L~~~L~~~L~~~~~l~~Ly~~IE~PLi~VLa~ME~~GI~-vD~ 549 (928) T PRK07997 497 YAAEDADVTLQLHLKMWPKLQQHKGPLNVFENIEMPLVPVLSRIERNGVK-IDP 549 (928) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCEE-ECH T ss_conf 99999999999999999998621449999999998999999999853959-989 No 18 >PRK07456 consensus Probab=100.00 E-value=3.1e-32 Score=206.56 Aligned_cols=177 Identities=25% Similarity=0.280 Sum_probs=148.3 Q ss_pred CCCEEEECCCCCCCCCCCEEEEEEEEE--CCCCEEEECCCCCCC--------CCHHHHHHHHCCCCCEEECCCCHHHHHH Q ss_conf 982799717778985445079999841--894289974657785--------7067767773236402301000016888 Q gi|254780636|r 20 VDAIAVDTETLGLMPRRDRLCIVQLSP--GDGTVDIIRIAAGQK--------NAPNLVGMLVDEKREKIFHYGRFDIAVL 89 (207) Q Consensus 20 ~~~iaiDtEt~~l~~~~~~l~LiQl~~--~~~~~~l~~~~~~~~--------~~~~L~~ll~d~~i~KV~Hn~~~D~~~L 89 (207) +..+|+||||+++++.++++++|.+|. ..+.++++|+.+... ....|+++|+|+++.||+||+|+|..+| T Consensus 382 ~~~vA~DtETtsLdp~~A~lVGis~s~~~~~~~a~YIPl~h~~~~~~l~~~~vl~~LkpiLed~~i~KighNlKyD~~vL 461 (975) T PRK07456 382 EAPVAWDTETTSLNPRDAELVGIGCCWGEELDDLAYIPLGHQQAGEQLPLETVLEALRPWLESPEHPKALQNAKFDRLIL 461 (975) T ss_pred CCEEEEECCCCCCCHHHCCEEEEEEECCCCCCCEEEEECCCCCCHHHCCHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHH T ss_conf 76589980468877423210266641155787269977345561310579999999999862887531021114679999 Q ss_pred HHHHHCCCCCH-HHHHHHHHHHHHHCCCHHHHHHHHHHHCCCC-------CCCCEECCCCCCCCCHHHHHHHHHHHHHHH Q ss_conf 65420000000-4578988652110000008877765420024-------675200365654432368999986459999 Q gi|254780636|r 90 FYTFGVRVRPV-FCTKIASRLTRTYTNQHGLKDNLKELLGINI-------SKAQQSSDWSADDLSDEQLQYAASDVVHLH 161 (207) Q Consensus 90 ~~~~gi~~~~i-~DT~ias~l~~~~~~~~~L~~L~~~~lg~~l-------dK~~q~SdW~~rpLs~~Qi~YAA~Dv~~l~ 161 (207) .++ |+.++++ |||||||+++++ ..+|+|.+|+++|||... .|+.. +...|+ +.+..|||.||.+++ T Consensus 462 ~~~-GI~l~g~~fDTmLAsYLLnP-~~~h~Ld~La~~yL~~~~i~~edl~gkgk~---~~~v~~-~~~~~ya~~dA~~~~ 535 (975) T PRK07456 462 LRH-GIALKGVVFDTLLADYLLNP-EAKHNLDDLAQRELGFTPTSYKDLVGKGQT---FADVDI-ATASQYCGMDVYLTY 535 (975) T ss_pred HHC-CCCCCCCCCCHHHHHHHHCC-CCCCCHHHHHHHHHCCCCCCHHHHCCCCCC---CCCCCH-HHHHHHHHHHHHHHH T ss_conf 976-95106755139999987687-655898999999718887749986288876---020897-899999999999999 Q ss_pred HHHHHHHHHHHHCC--CHHHHHH-HHHHHHHHHHHHHCCCCCCCC Q ss_conf 99999999999749--1899999-996400689998667988884 Q gi|254780636|r 162 ALRLQFTEKLQRLG--RSDLATS-CCNFLMDRAELDLLGWENVDI 203 (207) Q Consensus 162 ~L~~~l~~~L~~~~--~~~~~~~-e~~~l~~~~~me~~g~~~~~~ 203 (207) +||..|.++|++.+ +.++|++ |+++++++++||..|++ +|. T Consensus 536 ~L~~~l~~~L~~~~~~L~~l~~~iE~PLi~VLa~ME~~GI~-vD~ 579 (975) T PRK07456 536 RLAPILREQLAKTGPELIKLLEEVELPLEPVLAEMEATGIR-IDI 579 (975) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC-CCH T ss_conf 99999999998623699999999873699999999971850-169 No 19 >PRK07625 consensus Probab=100.00 E-value=9.4e-32 Score=203.80 Aligned_cols=189 Identities=20% Similarity=0.195 Sum_probs=155.1 Q ss_pred CCHHH-HCCCCCCEEEECCCCCCCCCCCEEEEEEEEECCCCEEEECCCCCCCC----------CHHHHHHHHCCCCCEEE Q ss_conf 88578-42489827997177789854450799998418942899746577857----------06776777323640230 Q gi|254780636|r 11 IPAEC-AARYVDAIAVDTETLGLMPRRDRLCIVQLSPGDGTVDIIRIAAGQKN----------APNLVGMLVDEKREKIF 79 (207) Q Consensus 11 l~~~~-~l~~~~~iaiDtEt~~l~~~~~~l~LiQl~~~~~~~~l~~~~~~~~~----------~~~L~~ll~d~~i~KV~ 79 (207) +...+ .+..++.+++||||+++++..+++.+|.+|...+..+++|+.+...+ ...|+++|+|+++.||+ T Consensus 334 l~~~~~~l~~~~~~a~dtEttsld~~~a~lvGis~s~~~g~a~YiPl~h~~~d~~~~l~~~~vl~~Lk~~Led~~i~KIg 413 (922) T PRK07625 334 FDAWLAKISAAELTAFDTETTSLDPMLAQLVGLSFSVEPGHAAYIPVAHRGPDMPAQLPRDEVLARLRPWLEDAGRKKLG 413 (922) T ss_pred HHHHHHHHHCCCEEEEEEECCCCCHHHCEEEEEEEEECCCCEEEEEECCCCCCHHHHCCHHHHHHHHHHHHHCCCCHHHH T ss_conf 99999986408848999522776500130565797743886799970215865232158999999879987434403554 Q ss_pred CCCCHHHHHHHHHHHCCCCCH-HHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCC-------CCEECCCCCCCCCHHHHH Q ss_conf 100001688865420000000-457898865211000000887776542002467-------520036565443236899 Q gi|254780636|r 80 HYGRFDIAVLFYTFGVRVRPV-FCTKIASRLTRTYTNQHGLKDNLKELLGINISK-------AQQSSDWSADDLSDEQLQ 151 (207) Q Consensus 80 Hn~~~D~~~L~~~~gi~~~~i-~DT~ias~l~~~~~~~~~L~~L~~~~lg~~ldK-------~~q~SdW~~rpLs~~Qi~ 151 (207) ||+|+|+.+|.++ |+.+.++ |||||||+++++ ..+|+|.+|+++|||....+ ....-.+...|+ +.... T Consensus 414 hNlK~d~~vL~~~-GI~l~g~~fDTmLAaYLL~~-~r~~~Ld~La~~yL~~~~i~~edl~Gkg~~q~~~~~v~~-~~~~~ 490 (922) T PRK07625 414 QHLKYDAQVLANY-GIALNGIEHDTLLESYVLES-HRTHDMDSLALRHLGVKTIKYEDVAGKGAKQIGFDEVAL-AQASE 490 (922) T ss_pred HHHHHHHHHHHHC-CCCCCCCCHHHHHHHHHHCC-CCCCCHHHHHHHHHCCCCCCHHHHCCCCCCCCCHHHCCH-HHHHH T ss_conf 2577899999977-96458754039999887546-767887999998725565541321233333378322467-99999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHC-CCHHHHHH-HHHHHHHHHHHHHCCCCCCCC Q ss_conf 99864599999999999999974-91899999-996400689998667988884 Q gi|254780636|r 152 YAASDVVHLHALRLQFTEKLQRL-GRSDLATS-CCNFLMDRAELDLLGWENVDI 203 (207) Q Consensus 152 YAA~Dv~~l~~L~~~l~~~L~~~-~~~~~~~~-e~~~l~~~~~me~~g~~~~~~ 203 (207) |||.|+.++++||..|.++|.++ ++..+|++ |+++++++++||.+|++ +|. T Consensus 491 Ya~eda~~t~~L~~~L~~~L~~e~~L~~ly~~iE~PLi~VLa~ME~~GI~-vD~ 543 (922) T PRK07625 491 YAAEDADITLQLHHALYPQVAREPGLERVYRDIELPVSLVLRKMERTGVL-IDA 543 (922) T ss_pred HHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHCCC-CCH T ss_conf 99999999999999999998340547999999995899999999982840-269 No 20 >PRK08786 consensus Probab=100.00 E-value=2.9e-32 Score=206.77 Aligned_cols=190 Identities=20% Similarity=0.211 Sum_probs=156.5 Q ss_pred CCHHH-HCCCCCCEEEECCCCCCCCCCCEEEEEEEEECCCCEEEECCCCCCCCC----------HHHHHHHHCCCCCEEE Q ss_conf 88578-424898279971777898544507999984189428997465778570----------6776777323640230 Q gi|254780636|r 11 IPAEC-AARYVDAIAVDTETLGLMPRRDRLCIVQLSPGDGTVDIIRIAAGQKNA----------PNLVGMLVDEKREKIF 79 (207) Q Consensus 11 l~~~~-~l~~~~~iaiDtEt~~l~~~~~~l~LiQl~~~~~~~~l~~~~~~~~~~----------~~L~~ll~d~~i~KV~ 79 (207) +...+ .++.++.+++||||+++++..+.+.+|.+|...+..+++++.+..... ..|+++|+|+.+.||+ T Consensus 338 L~~~i~~l~~~~~~afdtET~sld~~~a~lVGiS~s~~~g~a~YiP~~h~~~g~~~qld~~~~l~~LkplLed~~I~KIg 417 (927) T PRK08786 338 LDSWIARLRAAGQFAFDTETDSLDPLQADLIGLSVAAEPGQAAYLPFGHDFPGAPVQLDRTQALAQLAPLLTDPAVRKVG 417 (927) T ss_pred HHHHHHHHHHCCEEEEEEECCCCCHHHHHHHCEEEECCCCCEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCHHH T ss_conf 99999998736838999723886655531212245426786677532445678603413999999989986086511232 Q ss_pred CCCCHHHHHHHHHHHCCCCCH-HHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCC-------CCEECCCCCCCCCHHHHH Q ss_conf 100001688865420000000-457898865211000000887776542002467-------520036565443236899 Q gi|254780636|r 80 HYGRFDIAVLFYTFGVRVRPV-FCTKIASRLTRTYTNQHGLKDNLKELLGINISK-------AQQSSDWSADDLSDEQLQ 151 (207) Q Consensus 80 Hn~~~D~~~L~~~~gi~~~~i-~DT~ias~l~~~~~~~~~L~~L~~~~lg~~ldK-------~~q~SdW~~rpLs~~Qi~ 151 (207) ||+|+|+.+|.++ |+.+.++ |||||||+++++...+|+|..|+++|||.++.+ ......|+..|+ ++... T Consensus 418 hnlK~Dl~vL~~~-GI~l~g~~~DTmLAaYLL~p~~~~~~Ld~La~~yLg~~~~~~~~l~gkg~k~~~~~~v~~-~~~~~ 495 (927) T PRK08786 418 QHGKYDLHVMRRH-GIALAGYADDTLLESFVLNSGSARHDMDSLAKRYLGYDTVKYEDVCGKGAKQIPFAQVSL-EDATR 495 (927) T ss_pred HHHHHHHHHHHHC-CCCCCCCCCCHHHHHHHCCCCCCCCCHHHHHHHHCCCCCCCHHHHCCCCCCCCCHHHCCH-HHHHH T ss_conf 1178889999967-971167664258888623776454788999999669775426754244322288333259-99999 Q ss_pred HHHHHHHHHHHHHHHHHHHHH-HCCCHHHHH-HHHHHHHHHHHHHHCCCCCCCC Q ss_conf 998645999999999999999-749189999-9996400689998667988884 Q gi|254780636|r 152 YAASDVVHLHALRLQFTEKLQ-RLGRSDLAT-SCCNFLMDRAELDLLGWENVDI 203 (207) Q Consensus 152 YAA~Dv~~l~~L~~~l~~~L~-~~~~~~~~~-~e~~~l~~~~~me~~g~~~~~~ 203 (207) |||.|+.++++||..|.++|. +.++..+|+ .|+++++++++||.+|++ +|. T Consensus 496 Ya~eda~~tl~L~~~l~~kL~~e~~l~~ly~~iE~PLi~VLa~ME~~GI~-vD~ 548 (927) T PRK08786 496 YAAEDADITLRLHCVLGPKLAAEPGLERVYRDIEMPLVDVLERIEANGVC-VDA 548 (927) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCEE-ECH T ss_conf 99999999999999999999850529999999997999999998856879-889 No 21 >cd06128 DNA_polA_exo The 3'-5' exonuclease domain of family-A DNA polymerases has a fundamental role in reducing polymerase errors and is involved in proofreading activity. Family-A DNA polymerases contain a DnaQ-like exonuclease domain in the same polypeptide chain as the polymerase domain, similar to family-B DNA polymerases. The exonuclease domain contains three conserved sequence motifs termed ExoI, ExoII and ExoIII, which are clustered around the active site and contain four invariant acidic residues that serve as ligands for the two metal ions required for catalysis. The Klenow fragment (KF) of Escherichia coli pol I, the Thermus aquaticus (Taq) pol I, and Bacillus stearothermophilus (BF) pol I are examples of family-A DNA polymerases. They are involved in nucleotide excision repair and in the processing of Okazaki fragments that are generated during lagging strand synthesis. The N-terminal domains of BF pol I and Taq pol I resemble the fold of the 3'-5' exonuclease domain of KF Probab=99.98 E-value=5.8e-32 Score=205.03 Aligned_cols=147 Identities=24% Similarity=0.255 Sum_probs=125.6 Q ss_pred CCCEEEECCCCCCCCCCCEEEEEEEEECCCCEEEECCCCCCCCC----------HHHHHHHHCCCCCEEECCCCHHHHHH Q ss_conf 98279971777898544507999984189428997465778570----------67767773236402301000016888 Q gi|254780636|r 20 VDAIAVDTETLGLMPRRDRLCIVQLSPGDGTVDIIRIAAGQKNA----------PNLVGMLVDEKREKIFHYGRFDIAVL 89 (207) Q Consensus 20 ~~~iaiDtEt~~l~~~~~~l~LiQl~~~~~~~~l~~~~~~~~~~----------~~L~~ll~d~~i~KV~Hn~~~D~~~L 89 (207) ++++||||||+|+++.++++++|.+|...+..+++|+.+..... +.|+++|+|+++.|||||+|||+.+| T Consensus 1 a~~~a~DtETt~ld~~~a~iVGis~s~~~~~a~YiP~~h~~~~~~~~l~~~~vl~~L~p~led~~i~KighNlK~D~~vL 80 (166) T cd06128 1 APVFAFDTETDSLDNISANLVGLSFAIEPGVAAYIPVAHDYLDAPDQISRERALELLKPLLEDEKALKVGQNLKYDRGIL 80 (166) T ss_pred CCEEEEECCCCCCCCCCCEEEEEEEECCCCEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCEEEECHHHHHHHH T ss_conf 99899978879989043549999998008969999526776675257789999999999973999764531156679999 Q ss_pred HHHHHCCCCCH-HHHHHHHHHHHHHCCCHHHHHHHHHHHCCCC-------CCCCEECCCCCCCCCHHHHHHHHHHHHHHH Q ss_conf 65420000000-4578988652110000008877765420024-------675200365654432368999986459999 Q gi|254780636|r 90 FYTFGVRVRPV-FCTKIASRLTRTYTNQHGLKDNLKELLGINI-------SKAQQSSDWSADDLSDEQLQYAASDVVHLH 161 (207) Q Consensus 90 ~~~~gi~~~~i-~DT~ias~l~~~~~~~~~L~~L~~~~lg~~l-------dK~~q~SdW~~rpLs~~Qi~YAA~Dv~~l~ 161 (207) .++ |+.++++ |||||||+++++...+|+|+.|++++||.++ .|+.....++..|+ +....|||+||.+++ T Consensus 81 ~~~-Gi~l~g~~fDTMlaaylldp~~~~h~Ld~La~~~L~~~~i~~~dl~GkGk~q~~f~~v~i-~~~~~YA~eDAd~t~ 158 (166) T cd06128 81 ANY-GIELRGIAFDTMLESYILNSVAGRHDMDSLAERWLKHKTITFEEIAGKGKNQLTFNQIAL-EEAGRYAAEDADVTL 158 (166) T ss_pred HHC-CCCCCCCCCCHHHHHHHCCCCCCCCCHHHHHHHHCCCCCEEHHHHHCCCCCCCCCCCCCH-HHHHHHHHHHHHHHH T ss_conf 619-966899984576798762899988996999999839987059998378857589322899-999998999999999 Q ss_pred HHHHHHH Q ss_conf 9999999 Q gi|254780636|r 162 ALRLQFT 168 (207) Q Consensus 162 ~L~~~l~ 168 (207) +||+.|. T Consensus 159 rL~~~l~ 165 (166) T cd06128 159 QLHLKMW 165 (166) T ss_pred HHHHHHC T ss_conf 9998538 No 22 >PRK08835 consensus Probab=99.98 E-value=5.7e-32 Score=205.04 Aligned_cols=190 Identities=17% Similarity=0.170 Sum_probs=155.2 Q ss_pred CCHHH-HCCCCCCEEEECCCCCCCCCCCEEEEEEEEECCCCEEEECCCCCCCC----------CHHHHHHHHCCCCCEEE Q ss_conf 88578-42489827997177789854450799998418942899746577857----------06776777323640230 Q gi|254780636|r 11 IPAEC-AARYVDAIAVDTETLGLMPRRDRLCIVQLSPGDGTVDIIRIAAGQKN----------APNLVGMLVDEKREKIF 79 (207) Q Consensus 11 l~~~~-~l~~~~~iaiDtEt~~l~~~~~~l~LiQl~~~~~~~~l~~~~~~~~~----------~~~L~~ll~d~~i~KV~ 79 (207) +...+ .+..++.+++||||+++++..+.+.+|.+|..++..+++++.+.... ...|+++|+|+.+.||+ T Consensus 341 l~~~l~~l~~~~~~a~dtEt~~ld~~~a~lvGis~~~~~g~a~YiPl~h~~l~~~~ql~~d~vl~~LkplLed~~i~KIg 420 (931) T PRK08835 341 FQLWLDKLKAAELFAFDTETDSLDYMVANLVGLSFAVAEGEAAYVPVAHDYLDAPEQLDRDWVLAQLKPLLEDDAKAKVG 420 (931) T ss_pred HHHHHHHHHCCCEEEEEEECCCCCHHHCCCCCEEEEECCCCEEEEEECCCCCCCHHHCCHHHHHHHHHHHHCCCCHHHHH T ss_conf 99999985338859999504874322213354268764785799971233345510136999999979984284243662 Q ss_pred CCCCHHHHHHHHHHHCCCCCH-HHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCC-------CCEECCCCCCCCCHHHHH Q ss_conf 100001688865420000000-457898865211000000887776542002467-------520036565443236899 Q gi|254780636|r 80 HYGRFDIAVLFYTFGVRVRPV-FCTKIASRLTRTYTNQHGLKDNLKELLGINISK-------AQQSSDWSADDLSDEQLQ 151 (207) Q Consensus 80 Hn~~~D~~~L~~~~gi~~~~i-~DT~ias~l~~~~~~~~~L~~L~~~~lg~~ldK-------~~q~SdW~~rpLs~~Qi~ 151 (207) ||+|+|+.+|.++ |+.+.++ ||||||++++++...+|+|..|+.++|+...-+ +...-.+...++ +.... T Consensus 421 hNlK~d~~vL~~~-gI~l~gi~fDTmLAaYLLnp~~~~~~Ld~La~~~L~~~~i~~e~l~gkgk~q~~f~~v~~-~~~~~ 498 (931) T PRK08835 421 QNLKYDASVLARY-GIEMKGIKHDTMLASYVYNSVGGKHDMDSLALRFLQHSCISFEQVAGKGKNQLTFNQIDL-EEAAP 498 (931) T ss_pred CCHHHHHHHHHHC-CCCCCCCCCCHHHHHHHCCCCCCCCCHHHHHHHHCCCCCCCHHHHCCCCCCCCCHHHHHH-HHHHH T ss_conf 2115789999977-977678661243575300477787898999998648776347765144534478334328-89999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHH-CCCHHHHH-HHHHHHHHHHHHHHCCCCCCCC Q ss_conf 9986459999999999999997-49189999-9996400689998667988884 Q gi|254780636|r 152 YAASDVVHLHALRLQFTEKLQR-LGRSDLAT-SCCNFLMDRAELDLLGWENVDI 203 (207) Q Consensus 152 YAA~Dv~~l~~L~~~l~~~L~~-~~~~~~~~-~e~~~l~~~~~me~~g~~~~~~ 203 (207) |||.||.++++||+.|.++|.+ +++..+|+ .|+++++++++||.+|++ +|. T Consensus 499 YaaedA~~t~~L~~~L~~~L~~~~~L~~Ly~~IE~PLi~VLa~ME~~GI~-vD~ 551 (931) T PRK08835 499 YAAEDADVTLRLHNRLFANIEQDEKLKSVYEEIEMPLVPVLSRIERTGVL-IDD 551 (931) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCEE-ECH T ss_conf 99999999999999999999854778899999986899999999972848-789 No 23 >cd06140 DNA_polA_I_Bacillus_like_exo Bacillus stearothermophilus-like Polymerase I (pol I), a subgroup of the family-A DNA polymerases, contains an inactive DnaQ-like 3'-5' exonuclease domain in the same polypeptide chain as the polymerase region. The exonuclease-like domain of these proteins possess the same fold as the Klenow fragment (KF) of Escherichia coli pol I, but does not contain the four critical metal-binding residues necessary for activity. The function of this domain is unknown. It might act as a spacer between the polymerase and the 5'-3' exonuclease domains. Some members of this subgroup, such as those from Bacillus sphaericus and Thermus aquaticus, are thermostable DNA polymerases. Probab=99.97 E-value=1.2e-30 Score=197.34 Aligned_cols=170 Identities=21% Similarity=0.189 Sum_probs=145.9 Q ss_pred CCCEEEECCCCCCCCCCCEEEEEEEEECCCCEEEECCCCCCCCCHHHHHHHHCCCCCEEECCCCHHHHHHHHHHHCCCCC Q ss_conf 98279971777898544507999984189428997465778570677677732364023010000168886542000000 Q gi|254780636|r 20 VDAIAVDTETLGLMPRRDRLCIVQLSPGDGTVDIIRIAAGQKNAPNLVGMLVDEKREKIFHYGRFDIAVLFYTFGVRVRP 99 (207) Q Consensus 20 ~~~iaiDtEt~~l~~~~~~l~LiQl~~~~~~~~l~~~~~~~~~~~~L~~ll~d~~i~KV~Hn~~~D~~~L~~~~gi~~~~ 99 (207) .+.+|+|+||+|++++++++++|.+|.+ +.++++++.+.....+.|+++|+|+++.||+||+|+|+.+|.++ |+.++| T Consensus 3 ~~~~av~~e~~~~~~~~~~i~gi~l~~~-~~~~yi~~~~~~~~~~~lk~~l~d~~i~Ki~hn~K~~~~~L~~~-Gi~l~g 80 (178) T cd06140 3 ADEVALYVELLGENYHTADIIGLALANG-GGAYYIPLELALLDLAALKEWLEDEKIPKVGHDAKRAYVALKRH-GIELAG 80 (178) T ss_pred CCCEEEEEECCCCCCCCCEEEEEEEEEC-CEEEEEECCCCHHHHHHHHHHHCCCCCCEEEECHHHHHHHHHHC-CCCCCC T ss_conf 8866999737999845354999999889-97999956673646999999972999988986579999999988-980688 Q ss_pred H-HHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCE----ECCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 0-457898865211000000887776542002467520----03656544323689999864599999999999999974 Q gi|254780636|r 100 V-FCTKIASRLTRTYTNQHGLKDNLKELLGINISKAQQ----SSDWSADDLSDEQLQYAASDVVHLHALRLQFTEKLQRL 174 (207) Q Consensus 100 i-~DT~ias~l~~~~~~~~~L~~L~~~~lg~~ldK~~q----~SdW~~rpLs~~Qi~YAA~Dv~~l~~L~~~l~~~L~~~ 174 (207) + |||||||+++++...+|+|++|+.+|++.++..... ..++.. +-.+.+..|||.||.++++||+.|.++|++. T Consensus 81 ~~fDtmiaayLl~p~~~~~~L~~L~~~yl~~~~~~~~~~~gk~~~~~~-~~~~~~~~ya~~~a~~~~~L~~~l~~~L~e~ 159 (178) T cd06140 81 VAFDTMLAAYLLDPTRSSYDLADLAKRYLGRELPSDEEVYGKGAKFAV-PDEEVLAEHLARKAAAILRLAPKLEEELEEN 159 (178) T ss_pred CHHHHHHHHHHHCCCCCCCCHHHHHHHHHCCCCCCHHHHHCCCCCCCC-CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 303688898875489777999999999828666547886176656243-9979999999999999999999999999982 Q ss_pred CCHHHHHH-HHHHHHHHHH Q ss_conf 91899999-9964006899 Q gi|254780636|r 175 GRSDLATS-CCNFLMDRAE 192 (207) Q Consensus 175 ~~~~~~~~-e~~~l~~~~~ 192 (207) ++.++|++ |+|+++++++ T Consensus 160 ~l~~Ly~~iE~PL~~VLa~ 178 (178) T cd06140 160 EQLELYYEVELPLAEVLAE 178 (178) T ss_pred HHHHHHHHHHHHHHHHHCC T ss_conf 4899999987129998189 No 24 >PRK07898 consensus Probab=99.97 E-value=2.2e-30 Score=195.81 Aligned_cols=183 Identities=19% Similarity=0.136 Sum_probs=154.0 Q ss_pred CCCCCCEEEECCCCCCCCCCCEEEEEEEEECCCCEEEECCCCCC-CCCHHHHHHHHCCCCCEEECCCCHHHHHHHHHHHC Q ss_conf 24898279971777898544507999984189428997465778-57067767773236402301000016888654200 Q gi|254780636|r 17 ARYVDAIAVDTETLGLMPRRDRLCIVQLSPGDGTVDIIRIAAGQ-KNAPNLVGMLVDEKREKIFHYGRFDIAVLFYTFGV 95 (207) Q Consensus 17 l~~~~~iaiDtEt~~l~~~~~~l~LiQl~~~~~~~~l~~~~~~~-~~~~~L~~ll~d~~i~KV~Hn~~~D~~~L~~~~gi 95 (207) ......+++|+|++++ ++.+.+.++.++..++..+++++.... .....|+++|+|+++.||+||+|+|+.+|.++ |+ T Consensus 334 ~~~~~~~a~~~~~t~~-~~~a~lvgisl~~~~g~a~yip~~~~~~~~~~~L~~~Led~~i~KIghNlK~d~~vL~~~-Gi 411 (902) T PRK07898 334 AGDGRRLGLAVVGTHL-PGDGDATALAIAAADGHAAYIDTADLTPDDEAALAAWLADPARPKALHEAKGAMHALAGR-GW 411 (902) T ss_pred HCCCCEEEEEEECCCC-CCCCCEEEEEEEEECCEEEEECCCCCCHHHHHHHHHHHHCCCCCEEEECHHHHHHHHHHC-CC T ss_conf 4359869999853678-665738999999538808997220179789999999860877654554048999999966-98 Q ss_pred CCCCH-HHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCC---EECCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 00000-45789886521100000088777654200246752---003656544323689999864599999999999999 Q gi|254780636|r 96 RVRPV-FCTKIASRLTRTYTNQHGLKDNLKELLGINISKAQ---QSSDWSADDLSDEQLQYAASDVVHLHALRLQFTEKL 171 (207) Q Consensus 96 ~~~~i-~DT~ias~l~~~~~~~~~L~~L~~~~lg~~ldK~~---q~SdW~~rpLs~~Qi~YAA~Dv~~l~~L~~~l~~~L 171 (207) .+.++ |||||||+++++...+|+|.+|+.+|||.++.... ...++...+. +....|||.||.++++||+.|.++| T Consensus 412 ~l~g~~fDTmLAaYLLdp~~~~~~Ld~La~~yL~~~~~~~~~~~~~~~~~~~~~-~~~~~yaa~dA~~~~~L~~~l~~~L 490 (902) T PRK07898 412 TLAGVTSDTALAAYLVRPGQRSFTLDDLSLRYLRRELRAEAPAQQQLSLDDAVD-AAAAQTLILRARAVLDLADALDAEL 490 (902) T ss_pred CCCCCCHHHHHHHHHHCCCCCCCCHHHHHHHHCCCCCCCCCCCCCCCCCCCCCH-HHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 657753228999987568878787999999974976651565354347132676-8999999999999999999999986 Q ss_pred HHCCCHHHHHH-HHHHHHHHHHHHHCCCCCCCC Q ss_conf 97491899999-996400689998667988884 Q gi|254780636|r 172 QRLGRSDLATS-CCNFLMDRAELDLLGWENVDI 203 (207) Q Consensus 172 ~~~~~~~~~~~-e~~~l~~~~~me~~g~~~~~~ 203 (207) ++.++..+|++ |+++++++++||.+|++ +|. T Consensus 491 ~~~~l~~L~~~IE~PLi~VLa~ME~~GI~-vD~ 522 (902) T PRK07898 491 ARIDSAALLRDMELPLQRVLAEMERAGIA-VDL 522 (902) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCEE-ECH T ss_conf 16558999999760589999999850819-819 No 25 >PRK05797 consensus Probab=99.97 E-value=9.1e-30 Score=192.25 Aligned_cols=169 Identities=17% Similarity=0.133 Sum_probs=145.9 Q ss_pred CCCCEEEECCCCCCCCCCCEEEEEEEEECCCCEEEECCCCCCC---CCHHHHHHHHCCCCCEEECCCCHHHHHHHHHHHC Q ss_conf 8982799717778985445079999841894289974657785---7067767773236402301000016888654200 Q gi|254780636|r 19 YVDAIAVDTETLGLMPRRDRLCIVQLSPGDGTVDIIRIAAGQK---NAPNLVGMLVDEKREKIFHYGRFDIAVLFYTFGV 95 (207) Q Consensus 19 ~~~~iaiDtEt~~l~~~~~~l~LiQl~~~~~~~~l~~~~~~~~---~~~~L~~ll~d~~i~KV~Hn~~~D~~~L~~~~gi 95 (207) ....+++|+||+++++..+.++.+.++...+.++++++..... ....|+++|+|+++.|||||+|+|+.+|.++ |+ T Consensus 318 ~~~~~~~~~ett~~~~~~~~~vg~s~~~~~~~a~yip~~~~~~~~~~L~~Lk~lLed~~i~KIGhNlK~dl~vL~~~-GI 396 (869) T PRK05797 318 KNNQVYINFTLEDENLYSKIEIKKIFINFGEKTYYIDFKELIDEEEFIEDLKEIFENEEIKKIGHDIKNFLTILKKL-GI 396 (869) T ss_pred HCCCEEEEEECCCCCHHHHHHEEEEEEECCCEEEEEECHHHCCHHHHHHHHHHHHHCCCCCEEEECCHHHHHHHHHC-CC T ss_conf 34987999973897612030418999961997999831451677899999999984899878984306999999975-98 Q ss_pred CCCCH-HHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCEECCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 00000-45789886521100000088777654200246752003656544323689999864599999999999999974 Q gi|254780636|r 96 RVRPV-FCTKIASRLTRTYTNQHGLKDNLKELLGINISKAQQSSDWSADDLSDEQLQYAASDVVHLHALRLQFTEKLQRL 174 (207) Q Consensus 96 ~~~~i-~DT~ias~l~~~~~~~~~L~~L~~~~lg~~ldK~~q~SdW~~rpLs~~Qi~YAA~Dv~~l~~L~~~l~~~L~~~ 174 (207) .+.++ |||||||+++++...+|+|.+|+++|||..+.. ..-.|||.||.++++||+.|.++|++. T Consensus 397 ~l~gi~FDTMLAsYLLnP~~~~h~Ld~La~~yL~~~~~~--------------~e~~ya~~dA~~~~~L~~~L~~~L~~~ 462 (869) T PRK05797 397 EFKGLAFDTAIAAYLIDPSKSEYDLSDLAKEYLSKDVNS--------------EEEEYKIKEVSVMKELYEKLKEKIEEM 462 (869) T ss_pred CCCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHCCCCCC--------------HHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 556616689999997289988889999999980877751--------------478999999999999999999999861 Q ss_pred CCHHHHHH-HHHHHHHHHHHHHCCCCCCCC Q ss_conf 91899999-996400689998667988884 Q gi|254780636|r 175 GRSDLATS-CCNFLMDRAELDLLGWENVDI 203 (207) Q Consensus 175 ~~~~~~~~-e~~~l~~~~~me~~g~~~~~~ 203 (207) ++..+|++ |++++|++++||.+|++ +|. T Consensus 463 ~l~~L~~~IE~PLi~VLa~ME~~GI~-VD~ 491 (869) T PRK05797 463 DMEKLLYEVELPLVEVLASMESEGFK-VDK 491 (869) T ss_pred HHHHHHHHHHHHHHHHHHHHHHCCEE-ECH T ss_conf 28999999986789999999846809-999 No 26 >PRK08434 consensus Probab=99.96 E-value=1.8e-28 Score=184.76 Aligned_cols=187 Identities=21% Similarity=0.259 Sum_probs=147.8 Q ss_pred CCHHH-HCCCCCCEEEECCCCCCCCCCCEEEEEEEEECCCCEEEECCCCCCCCC----------HHHHHHHHCCCCCEEE Q ss_conf 88578-424898279971777898544507999984189428997465778570----------6776777323640230 Q gi|254780636|r 11 IPAEC-AARYVDAIAVDTETLGLMPRRDRLCIVQLSPGDGTVDIIRIAAGQKNA----------PNLVGMLVDEKREKIF 79 (207) Q Consensus 11 l~~~~-~l~~~~~iaiDtEt~~l~~~~~~l~LiQl~~~~~~~~l~~~~~~~~~~----------~~L~~ll~d~~i~KV~ 79 (207) +...+ .+...+.+++||||+++++..+++.++.+|..++..+++|+.+..... ..+++++++ .+|| T Consensus 308 l~~~l~~l~~~~~~a~d~Ett~ld~~~a~lvGis~~~~~~~a~YiPi~h~~l~~~~~~~~~~~~~~l~~~~~~---~~Ig 384 (887) T PRK08434 308 LEEILNTLPKESIIAFDTETTGLDTKEAKLVGFSFCFNEEEAYYVPLAHSYLGVGEQISLESAKKAIEKIFNH---FVIG 384 (887) T ss_pred HHHHHHHCCCCCEEEEEEECCCCCCCCCCEEEEEEEECCCCEEEEECCCCCCCCHHHCCHHHHHHHHHHHHHC---CCHH T ss_conf 9999985311676999951278661334305899995598589997521125773335999999997876337---4036 Q ss_pred CCCCHHHHHHHHHHHCCCCC-HHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCC----EECCCCCCCCCHHHHHHHH Q ss_conf 10000168886542000000-045789886521100000088777654200246752----0036565443236899998 Q gi|254780636|r 80 HYGRFDIAVLFYTFGVRVRP-VFCTKIASRLTRTYTNQHGLKDNLKELLGINISKAQ----QSSDWSADDLSDEQLQYAA 154 (207) Q Consensus 80 Hn~~~D~~~L~~~~gi~~~~-i~DT~ias~l~~~~~~~~~L~~L~~~~lg~~ldK~~----q~SdW~~rpLs~~Qi~YAA 154 (207) ||+|+|+.+|.+++|+.+.+ +|||||||+++++. .+|+|..|+++|++.++.+.. .-.++...|+ +....||| T Consensus 385 hNlKyDl~vL~~~~gi~l~g~~fDTmLAaYLLdp~-~r~~Ld~La~~yL~~~~i~~e~l~~kg~~f~~v~i-~~a~~YAa 462 (887) T PRK08434 385 HNLKYDFKIIQNNFGLELPQKYADTMILAWLKDPS-SRVGLDDLAKRLFNYEMIHFESVVKKGENFSSVDL-EKACKYAA 462 (887) T ss_pred HHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHCCCC-CCCCHHHHHHHHCCCCCCCHHHHHCCCCCCCCCCH-HHHHHHHH T ss_conf 47799999999856987777314899999866986-66666789999708787748998546777020788-99999999 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCH---HHHH-HHHHHHHHHHHHHHCCCCCCCC Q ss_conf 64599999999999999974918---9999-9996400689998667988884 Q gi|254780636|r 155 SDVVHLHALRLQFTEKLQRLGRS---DLAT-SCCNFLMDRAELDLLGWENVDI 203 (207) Q Consensus 155 ~Dv~~l~~L~~~l~~~L~~~~~~---~~~~-~e~~~l~~~~~me~~g~~~~~~ 203 (207) .||.++++||..|.++|++.+.. ++|+ .|+++++++++||.+|++ +|. T Consensus 463 edA~it~~L~~~l~~~L~~~~~~~L~~Ly~~IElPLi~VLa~ME~~GI~-VD~ 514 (887) T PRK08434 463 EDAWITLRFYKKFLENLEKRGATHLLELAKNVEFPFIKTLLMMEENGIK-LDT 514 (887) T ss_pred HHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHCCCC-CCH T ss_conf 9999999999999999986551779999999886599999999971950-269 No 27 >PRK08076 consensus Probab=99.96 E-value=3.8e-28 Score=182.82 Aligned_cols=180 Identities=16% Similarity=0.130 Sum_probs=146.1 Q ss_pred CCCEEEECCCCCCCCCCCEEEEEEEEECCCCEEEECCCCCCCCCHHHHHHHHCCCCCEEECCCCHHHHHHHHHHHCCCCC Q ss_conf 98279971777898544507999984189428997465778570677677732364023010000168886542000000 Q gi|254780636|r 20 VDAIAVDTETLGLMPRRDRLCIVQLSPGDGTVDIIRIAAGQKNAPNLVGMLVDEKREKIFHYGRFDIAVLFYTFGVRVRP 99 (207) Q Consensus 20 ~~~iaiDtEt~~l~~~~~~l~LiQl~~~~~~~~l~~~~~~~~~~~~L~~ll~d~~i~KV~Hn~~~D~~~L~~~~gi~~~~ 99 (207) ....++|+||++.++....+.++.+|.+++ +++++.... .....++++++|+++.|++||+|+|+.+|.++ |+.+.+ T Consensus 314 ~~~~a~~~Et~~~~~~~a~lvGis~s~~~~-a~yiP~~~~-~~~~~lk~~Led~~i~KighNlK~dl~vL~~~-GI~l~g 390 (877) T PRK08076 314 QQKSALVVEVLEDNYHKADIQGFAIVNENG-CYFIPTEIA-LKSDAFKEWLEDEEKKKWVFDAKRAIVALKWN-GIDLQG 390 (877) T ss_pred CCCCEEEEEECCCCCCCCCEEEEEEECCCC-EEEECCCHH-HHHHHHHHHHCCCCCCEEEECHHHHHHHHHHC-CCCCCC T ss_conf 688549999647885637789999985894-599550044-21589999853976540265479999999968-964477 Q ss_pred H-HHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCE---ECCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 0-457898865211000000887776542002467520---036565443236899998645999999999999999749 Q gi|254780636|r 100 V-FCTKIASRLTRTYTNQHGLKDNLKELLGINISKAQQ---SSDWSADDLSDEQLQYAASDVVHLHALRLQFTEKLQRLG 175 (207) Q Consensus 100 i-~DT~ias~l~~~~~~~~~L~~L~~~~lg~~ldK~~q---~SdW~~rpLs~~Qi~YAA~Dv~~l~~L~~~l~~~L~~~~ 175 (207) + |||||||+++++...+|+|..|+++|++..+..... .+--...|-.+....|||.|+..+++||+.|.++|.+.+ T Consensus 391 i~FDTMLAAYLLdP~~~~~~Ld~La~~yl~~~~~~~e~v~Gkg~k~~~p~~~~~~eYaaedA~~t~~L~~~L~~~L~~~~ 470 (877) T PRK08076 391 IDFDLLLAAYLLNPADSDDDFASVAKMKETHAVKSDEAVYGKGAKRAVPEEEILAEHLARKAHAIYDLKQTFVEELEKNE 470 (877) T ss_pred CCCCHHHHHHHCCCCCCCCCHHHHHHHHCCCCCCCHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 64029999987188767666999999855777763476537652224877899999999999999999999999986557 Q ss_pred CHHHHHH-HHHHHHHHHHHHHCCCCCCCC Q ss_conf 1899999-996400689998667988884 Q gi|254780636|r 176 RSDLATS-CCNFLMDRAELDLLGWENVDI 203 (207) Q Consensus 176 ~~~~~~~-e~~~l~~~~~me~~g~~~~~~ 203 (207) +..+|++ |++++|++++||.+|++ +|. T Consensus 471 l~~Ly~~IE~PLi~VLa~ME~~GI~-VD~ 498 (877) T PRK08076 471 QLELFEELELPLARVLAEMEVKGVK-VDT 498 (877) T ss_pred HHHHHHHHHHHHHHHHHHHHHCCEE-ECH T ss_conf 8999999970488999999854838-989 No 28 >PRK05929 consensus Probab=99.95 E-value=1.3e-26 Score=173.90 Aligned_cols=181 Identities=17% Similarity=0.108 Sum_probs=142.9 Q ss_pred CCCCEEEECCCCCCCCCCCEEEEEEEEECCCCEEEECCCCCCC-CCHHHHHHHHCCCCCEEECCCCHHHHHHHHHHHCCC Q ss_conf 8982799717778985445079999841894289974657785-706776777323640230100001688865420000 Q gi|254780636|r 19 YVDAIAVDTETLGLMPRRDRLCIVQLSPGDGTVDIIRIAAGQK-NAPNLVGMLVDEKREKIFHYGRFDIAVLFYTFGVRV 97 (207) Q Consensus 19 ~~~~iaiDtEt~~l~~~~~~l~LiQl~~~~~~~~l~~~~~~~~-~~~~L~~ll~d~~i~KV~Hn~~~D~~~L~~~~gi~~ 97 (207) ..+.+++|+||++.++...++.++.+|.++ .++++++.+... ....|+++|+|+++.|||||+|+|..+|.++ |+.+ T Consensus 309 ~~~~~a~~~~~~~~~~~~~~lvGisls~~~-~a~yip~~~~~~~vl~~Lk~~led~~i~kig~NlK~d~~vL~~~-GI~l 386 (870) T PRK05929 309 QGGEVAFCVAYTGNHLPSLKLHGVALAGGS-QVFYIEVEESGDQEIALLKDFFARKDTTFYGYNLKRDNHALKNA-GIDV 386 (870) T ss_pred CCCCEEEEEEECCCCCCCCCEEEEEEECCC-EEEEEECCCCCHHHHHHHHHHHCCCCCCEEECCHHHHHHHHHHC-CCCC T ss_conf 179747999745887677728899997698-08999765555238999999860987745322377799999968-9856 Q ss_pred CCH-HHHHHHHHHHHHHCCCHHHHHHHHHHHCCCC----CCCCEECCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 000-4578988652110000008877765420024----67520036565443236899998645999999999999999 Q gi|254780636|r 98 RPV-FCTKIASRLTRTYTNQHGLKDNLKELLGINI----SKAQQSSDWSADDLSDEQLQYAASDVVHLHALRLQFTEKLQ 172 (207) Q Consensus 98 ~~i-~DT~ias~l~~~~~~~~~L~~L~~~~lg~~l----dK~~q~SdW~~rpLs~~Qi~YAA~Dv~~l~~L~~~l~~~L~ 172 (207) .++ ||||+|++++++ ..++++..+..++..... .|......-....+++....|||.|+.+++.|++.|.++|+ T Consensus 387 ~g~~fDtmLAsYLLn~-~~~~~~~~ll~~~~~~~~~~~~~k~~g~~~l~~~~~~~~~~~yaae~a~~~~~L~~~L~~~L~ 465 (870) T PRK05929 387 HEITADLALAEHLVNG-GAKISFQTLLVEHGHIEEAHRFAKEWGASSLPIQRLPEQPAQYFGEFVSYLPIIKNYLLEELE 465 (870) T ss_pred CCCHHHHHHHHHHHCC-CCCCCHHHHHHHCCCCHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 8727799999986299-998883999976667315556555407666642007388999999999999999999999998 Q ss_pred HCCCHHHHHH-HHHHHHHHHHHHHCCCCCCCC Q ss_conf 7491899999-996400689998667988884 Q gi|254780636|r 173 RLGRSDLATS-CCNFLMDRAELDLLGWENVDI 203 (207) Q Consensus 173 ~~~~~~~~~~-e~~~l~~~~~me~~g~~~~~~ 203 (207) +.++.++|++ |+++++++++||.+|++ +|. T Consensus 466 ~~~L~~Ly~eIE~PLi~VLa~ME~~GI~-vD~ 496 (870) T PRK05929 466 EKGLKDIFSNVEMPLEKVLFAMERAGMP-LDV 496 (870) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCEE-EEH T ss_conf 8779999999874489999998738989-819 No 29 >KOG2206 consensus Probab=99.94 E-value=1.3e-27 Score=179.76 Aligned_cols=156 Identities=27% Similarity=0.275 Sum_probs=139.2 Q ss_pred CCCCCCEEEECCCCCCCCCCCEEEEEEEEECCCCEEEECCCCCCCCCHHHHHHHHCCCCCEEECCCCHHHHHHHHHHHCC Q ss_conf 24898279971777898544507999984189428997465778570677677732364023010000168886542000 Q gi|254780636|r 17 ARYVDAIAVDTETLGLMPRRDRLCIVQLSPGDGTVDIIRIAAGQKNAPNLVGMLVDEKREKIFHYGRFDIAVLFYTFGVR 96 (207) Q Consensus 17 l~~~~~iaiDtEt~~l~~~~~~l~LiQl~~~~~~~~l~~~~~~~~~~~~L~~ll~d~~i~KV~Hn~~~D~~~L~~~~gi~ 96 (207) +.....+|+|+|+.+++.+.+-.|||||+|.+..+.++.+.. .....-|.+.+.||+|+||+|++..|+.||.+.|||. T Consensus 209 l~~~~Efavdlehhsyrsf~gltclmqISTr~ed~iIDt~~l-~~~i~~l~e~fsdp~ivkvfhgaD~diiwlqrdfgiy 287 (687) T KOG2206 209 LDSVIEFAVDLEHHSYRSFLGLTCLMQISTRTEDFIIDTFKL-RDHIGILNEVFSDPGIVKVFHGADTDIIWLQRDFGIY 287 (687) T ss_pred HHHHHHHHHHCCCCCHHHHCCCEEEEEEECCCHHHEEHHHHH-HHHHHHHHHHCCCCCEEEEEECCCCCHHHHHCCCEEE T ss_conf 855356532036520453337206899530420312114788-8888774221058874789734764034340230489 Q ss_pred CCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCEECCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 0000457898865211000000887776542002467520036565443236899998645999999999999999749 Q gi|254780636|r 97 VRPVFCTKIASRLTRTYTNQHGLKDNLKELLGINISKAQQSSDWSADDLSDEQLQYAASDVVHLHALRLQFTEKLQRLG 175 (207) Q Consensus 97 ~~~i~DT~ias~l~~~~~~~~~L~~L~~~~lg~~ldK~~q~SdW~~rpLs~~Qi~YAA~Dv~~l~~L~~~l~~~L~~~~ 175 (207) .-|+|||--|++++. -++++|..|.+.|+|+..+|..|..||..|||+++|+.||-.|+.||+.||+.|...+.+.+ T Consensus 288 vvnLfdt~~a~r~L~--~~r~sL~~ll~~~~~v~~nk~yqladwR~rpLp~~Mv~yar~dthyllyiyD~lr~el~~~a 364 (687) T KOG2206 288 VVNLFDTIQASRLLG--LPRPSLAYLLECVCGVLTNKKYQLADWRIRPLPEEMVRYAREDTHYLLYIYDVLRKELKRLA 364 (687) T ss_pred EEECHHHHHHHHHHC--CCCCCHHHHHHHHHHHHHHHHHHHCHHCCCCCCHHHHHHHHHCCHHHHHHHHHHHHHHHHHH T ss_conf 985110588998857--99400999999987453114664110003658588988875123158999999999999870 No 30 >PRK08928 consensus Probab=99.93 E-value=8.7e-25 Score=163.31 Aligned_cols=160 Identities=14% Similarity=0.010 Sum_probs=138.5 Q ss_pred CCCCCCEEEECCCCCCCCCCCEEEEEEEEECCCCEEEECCCCCCCC-CHHHHHHHHCCCCCEEECCCCHHHHHHHHHHHC Q ss_conf 2489827997177789854450799998418942899746577857-067767773236402301000016888654200 Q gi|254780636|r 17 ARYVDAIAVDTETLGLMPRRDRLCIVQLSPGDGTVDIIRIAAGQKN-APNLVGMLVDEKREKIFHYGRFDIAVLFYTFGV 95 (207) Q Consensus 17 l~~~~~iaiDtEt~~l~~~~~~l~LiQl~~~~~~~~l~~~~~~~~~-~~~L~~ll~d~~i~KV~Hn~~~D~~~L~~~~gi 95 (207) +...+.+|+|+|+. .+.++.+.+|..++.+++++....... ...++++|+|+++.||+||+|+|+.+|.++ |+ T Consensus 321 ~~~~~~~a~~le~~-----~~~~vgis~s~~~~~~~~i~~~~~~~~~~~~l~~lLed~~i~KIghNlK~dl~vL~~~-gi 394 (861) T PRK08928 321 CEKEGIFAIYLLQE-----KGTNNAISLSLQNQSYIIIISNEITDAFSDIIFNLLTDKSILKITYDLKPLLKFYANQ-SH 394 (861) T ss_pred HHHCCEEEEEEECC-----CCEEEEEEEEECCCEEEEECCCCCHHHHHHHHHHHHCCCCCCEEECCHHHHHHHHHHC-CC T ss_conf 76579689992057-----8738999999559849995575424789999999843999754454579999999856-96 Q ss_pred CCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCEECCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 00000457898865211000000887776542002467520036565443236899998645999999999999999749 Q gi|254780636|r 96 RVRPVFCTKIASRLTRTYTNQHGLKDNLKELLGINISKAQQSSDWSADDLSDEQLQYAASDVVHLHALRLQFTEKLQRLG 175 (207) Q Consensus 96 ~~~~i~DT~ias~l~~~~~~~~~L~~L~~~~lg~~ldK~~q~SdW~~rpLs~~Qi~YAA~Dv~~l~~L~~~l~~~L~~~~ 175 (207) ...+++||||||+++++...+|+|.+++.++... ..|||.|+..++.||+.|.++|.+.+ T Consensus 395 ~l~~i~DTmLasYLLnp~~~~h~Ld~La~~~~~~--------------------~~~aa~~a~~~~~L~~~L~~~L~~~~ 454 (861) T PRK08928 395 DITAIDDLMLMSYSLSAGLHDHSLFNIILKNTNI--------------------INESAKIVCTLISLHKQLILELFDNK 454 (861) T ss_pred CCCCCCCHHHHHHHCCCCCCCCCHHHHHHHHCCC--------------------CCHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 7576371889998649998768989999762220--------------------01479999999999999999998866 Q ss_pred CHHHHHH-HHHHHHHHHHHHHCCCCCCCC Q ss_conf 1899999-996400689998667988884 Q gi|254780636|r 176 RSDLATS-CCNFLMDRAELDLLGWENVDI 203 (207) Q Consensus 176 ~~~~~~~-e~~~l~~~~~me~~g~~~~~~ 203 (207) +.++|++ |++++|++++||.+|++ +|. T Consensus 455 L~~Ly~eIE~PLi~VLa~ME~~GI~-VD~ 482 (861) T PRK08928 455 LFTIYYRIDLPLCFVLDKMEKIGIK-VDA 482 (861) T ss_pred HHHHHHHHHHHHHHHHHHHHHHCCC-CCH T ss_conf 8999999887799999999981960-169 No 31 >COG0749 PolA DNA polymerase I - 3'-5' exonuclease and polymerase domains [DNA replication, recombination, and repair] Probab=99.92 E-value=1e-24 Score=162.95 Aligned_cols=183 Identities=25% Similarity=0.268 Sum_probs=146.0 Q ss_pred CCCCCCEEEECCCCCCCCCCCEEEEEEEEECCCCEEEECCCCC---CCCCHHHHHHHHCCCCCEEECCCCHHHHHHHHHH Q ss_conf 2489827997177789854450799998418942899746577---8570677677732364023010000168886542 Q gi|254780636|r 17 ARYVDAIAVDTETLGLMPRRDRLCIVQLSPGDGTVDIIRIAAG---QKNAPNLVGMLVDEKREKIFHYGRFDIAVLFYTF 93 (207) Q Consensus 17 l~~~~~iaiDtEt~~l~~~~~~l~LiQl~~~~~~~~l~~~~~~---~~~~~~L~~ll~d~~i~KV~Hn~~~D~~~L~~~~ 93 (207) +...+.+|+||||+++++....+.++.++.++ ..+++++.+. -.....|+++|+++.+.||+||+|+|..+|.++ T Consensus 19 ~~~~~~~a~~~et~~l~~~~~~lvg~s~~~~~-~~~yi~~~~~~~~~~~~~~l~~~l~~~~~~kv~~~~K~d~~~l~~~- 96 (593) T COG0749 19 LNAAANIAFDTETDGLDPHGADLVGLSVASEE-EAAYIPLLHGPEQLNVLAALKPLLEDEGIKKVGQNLKYDYKVLANL- 96 (593) T ss_pred HHHCCCCCEECCCCCCCCCCCCEEEEEEECCC-CCEEEEECCCHHHHHHHHHHHHHHHCCCCCHHCCCCCHHHHHHHHC- T ss_conf 74223310100014567555874678862365-4236751366555514899999863754103213413669999973- Q ss_pred HCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCC-------CCCEECCCCCCCCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 000000045789886521100000088777654200246-------7520036565443236899998645999999999 Q gi|254780636|r 94 GVRVRPVFCTKIASRLTRTYTNQHGLKDNLKELLGINIS-------KAQQSSDWSADDLSDEQLQYAASDVVHLHALRLQ 166 (207) Q Consensus 94 gi~~~~i~DT~ias~l~~~~~~~~~L~~L~~~~lg~~ld-------K~~q~SdW~~rpLs~~Qi~YAA~Dv~~l~~L~~~ 166 (207) |+.+...||||||+++++....+|++..|++++++.+.- |..+.-+....++ +.-..|||.||..+++|+.. T Consensus 97 Gi~~~~~~DtmlasYll~~~~~~~~~~~l~~r~l~~~~~~~~~i~~kg~~~~~~~~~~~-~~~~~y~a~~a~~~~~L~~~ 175 (593) T COG0749 97 GIEPGVAFDTMLASYLLNPGAGAHNLDDLAKRYLGLETITFEDIAGKGKKQLTFADVKL-EKATEYAAEDADATLRLESI 175 (593) T ss_pred CCCCCCHHHHHHHHHCCCCCCCCCCHHHHHHHHCCCCCCHHHHHHCCCCCCCCCCCCHH-HHHHHHHHHHHHHHHHHHHH T ss_conf 97656517899999614867675777899998258774024876325445676100147-78888788888999999999 Q ss_pred HHHHHHHCC-CHHHHHH-HHHHHHHHHHHHHCCCCCCCC Q ss_conf 999999749-1899999-996400689998667988884 Q gi|254780636|r 167 FTEKLQRLG-RSDLATS-CCNFLMDRAELDLLGWENVDI 203 (207) Q Consensus 167 l~~~L~~~~-~~~~~~~-e~~~l~~~~~me~~g~~~~~~ 203 (207) |.+.+.+.. +.+++.+ |++++++++.||.+|++ +|. T Consensus 176 l~~~l~~~~~L~~l~~~iE~Pl~~VLa~ME~~Gi~-vD~ 213 (593) T COG0749 176 LEPELLKTPVLLELYEEIEMPLVRVLARMERNGIK-VDV 213 (593) T ss_pred HHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHCCCE-ECH T ss_conf 99876310469999999754089999986862856-429 No 32 >PRK07300 consensus Probab=99.86 E-value=1e-20 Score=139.70 Aligned_cols=173 Identities=16% Similarity=0.071 Sum_probs=130.8 Q ss_pred EEEECCCCCCCCCCCEEEEEEEEECCCCEEEECCCCCCCCCHHHHHHHHCCCCCEEECCCCHHHHHHHHHHHCCCCCH-H Q ss_conf 799717778985445079999841894289974657785706776777323640230100001688865420000000-4 Q gi|254780636|r 23 IAVDTETLGLMPRRDRLCIVQLSPGDGTVDIIRIAAGQKNAPNLVGMLVDEKREKIFHYGRFDIAVLFYTFGVRVRPV-F 101 (207) Q Consensus 23 iaiDtEt~~l~~~~~~l~LiQl~~~~~~~~l~~~~~~~~~~~~L~~ll~d~~i~KV~Hn~~~D~~~L~~~~gi~~~~i-~ 101 (207) -.+++||++.++.++.+.++.++.++ .+++.........+.++.+++.+ +.+||.|+|..+|.++ |+.+.++ | T Consensus 322 ~~~~~et~~~~~~~a~lvG~s~~~~~--~~y~~~~~~~l~~~~~~~~~~~~---i~~~nlK~~~~vL~~~-GI~l~~~~f 395 (880) T PRK07300 322 DFFYFETLGDNYHREAIIGFAWGNGE--QIYASTDLSLLATDSFKQVLQKP---IATYDFKRSKVLLSHL-GIELPAPSF 395 (880) T ss_pred CEEEEEECCCCCCCCCEEEEEEECCC--EEEECCCHHHHHCHHHHHHHHHH---HHHHHHHHHHHHHHHC-CCCCCCCHH T ss_conf 02576533798130777999985597--58955813321176888876535---6550378689999977-986677403 Q ss_pred HHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCEE-CCCC--CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHH Q ss_conf 578988652110000008877765420024675200-3656--5443236899998645999999999999999749189 Q gi|254780636|r 102 CTKIASRLTRTYTNQHGLKDNLKELLGINISKAQQS-SDWS--ADDLSDEQLQYAASDVVHLHALRLQFTEKLQRLGRSD 178 (207) Q Consensus 102 DT~ias~l~~~~~~~~~L~~L~~~~lg~~ldK~~q~-SdW~--~rpLs~~Qi~YAA~Dv~~l~~L~~~l~~~L~~~~~~~ 178 (207) |||||++++++ ..+|+|..++++|++..+.+.... .... ..|-.+....|||.|+..++++++.|.++|.+.++.+ T Consensus 396 DTmLAsYLL~p-~~~~~Ld~La~~yl~~~~~~~e~l~Gkg~k~~~~~~~~~~~yaa~da~~~~~L~~~L~~~L~~~~l~~ 474 (880) T PRK07300 396 DARLAKYLLST-VEDNELSTIARLYTDISLETDDTVYGKGAKRAIPEKEVLLEHLARKVKVLLDSKEQMLDKLTAHEQLD 474 (880) T ss_pred HHHHHHHCCCC-CCCCCHHHHHHHHCCCCCCCHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 48999853687-65587789999946888873376636652103662779999999999999999999899987616999 Q ss_pred HHHH-HHHHHHHHHHHHHCCCCCCCC Q ss_conf 9999-996400689998667988884 Q gi|254780636|r 179 LATS-CCNFLMDRAELDLLGWENVDI 203 (207) Q Consensus 179 ~~~~-e~~~l~~~~~me~~g~~~~~~ 203 (207) +|++ |+++++++++||.+|++ +|. T Consensus 475 L~~~IE~PLi~VLa~ME~~GI~-VD~ 499 (880) T PRK07300 475 LLFDMELPLANVLAKMEIAGIK-VNR 499 (880) T ss_pred HHHHHHHHHHHHHHHHHHCCEE-ECH T ss_conf 9999986899999999965808-889 No 33 >cd06126 DEDDy DEDDy exonucleases, part of the DnaQ-like (or DEDD) exonuclease superfamily, catalyze the excision of nucleoside monophosphates at the DNA or RNA termini in the 3'-5' direction. They contain four invariant acidic residues in three conserved sequence motifs termed ExoI, ExoII and ExoIII. DEDDy exonucleases are classified as such because of the presence of a specific YX(3)D pattern at ExoIII. The four conserved acidic residues serve as ligands for the two metal ions required for catalysis. DEDDy exonucleases include the proofreading domains of DNA polymerases (families A and B), as well as RNases such as RNase D and Rrp6p. The Egalitarian (Egl) and Bacillus-like DNA Polymerase I subfamilies do not possess a completely conserved YX(3)D pattern at the ExoIII motif. In addition, Bacillus-like DNA polymerases contain inactive 3'-5' exonuclease domains which do not possess the metal-binding residues necessary for activity. Probab=99.78 E-value=2e-18 Score=126.39 Aligned_cols=143 Identities=22% Similarity=0.246 Sum_probs=112.5 Q ss_pred CEEEECCCCCCCC--CCCEEEEEEEEECCCCEEEECCCCCCCC----------CHHHHHHHHCCCCCEEECCC-CHHHHH Q ss_conf 2799717778985--4450799998418942899746577857----------06776777323640230100-001688 Q gi|254780636|r 22 AIAVDTETLGLMP--RRDRLCIVQLSPGDGTVDIIRIAAGQKN----------APNLVGMLVDEKREKIFHYG-RFDIAV 88 (207) Q Consensus 22 ~iaiDtEt~~l~~--~~~~l~LiQl~~~~~~~~l~~~~~~~~~----------~~~L~~ll~d~~i~KV~Hn~-~~D~~~ 88 (207) ++|||+||++... .++++++||+|+.++.+++++....... ...+..++++...+++|||+ .||+.. T Consensus 1 vl~fDiE~~~~~~~~~~~pI~~i~~~~~~g~~~~~~~~~~~~~~~~~~~E~~ll~~f~~~i~~~~dii~GyN~~~FDlpy 80 (181) T cd06126 1 VVAFDIETTPLVGGGGKGPIILIQLAVSAGEKFLLDVLTLTLDAVDSADEKELLGELKAILEDYDDIKVGYNIDSFDLPY 80 (181) T ss_pred CEEEEEECCCCCCCCCCCCEEEEEEEECCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHCCCCEEEEECCCCCCHHH T ss_conf 92898623798999899988999999659978999862434675567899999999999983089989997997787999 Q ss_pred HHHHHHCCC---------------------CCHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCE---ECCCCCCC Q ss_conf 865420000---------------------000457898865211000000887776542002467520---03656544 Q gi|254780636|r 89 LFYTFGVRV---------------------RPVFCTKIASRLTRTYTNQHGLKDNLKELLGINISKAQQ---SSDWSADD 144 (207) Q Consensus 89 L~~~~gi~~---------------------~~i~DT~ias~l~~~~~~~~~L~~L~~~~lg~~ldK~~q---~SdW~~rp 144 (207) |.+.+...- +..+|++...+-......+++|.+++++++|...++..+ ...|..+| T Consensus 81 L~~R~~~l~~~~~~~~~~~~~~~~~~~~~gr~~~D~~~~~~~~~~~~~~~~L~~va~~~lg~~K~~~~~~~i~~~w~~~~ 160 (181) T cd06126 81 LLNRAGELGIPLRRGRDRSSGSREEINIKGRILFDLLLAAKRLGVALPSYSLDALAEQLLGEGKEKVDQLAIKQAWRARP 160 (181) T ss_pred HHHHHHHHCCCCHHCCCCCCCCCCEEEECCCCCCCHHHHHHHHCCCCCCCCHHHHHHHHCCCCCCCCCHHHHHHHHHCCC T ss_conf 99999995897010226665655048634852321999999841364653599999997499956699999999886798 Q ss_pred CCHHHHHHHHHHHHHHHHHH Q ss_conf 32368999986459999999 Q gi|254780636|r 145 LSDEQLQYAASDVVHLHALR 164 (207) Q Consensus 145 Ls~~Qi~YAA~Dv~~l~~L~ 164 (207) +.++++.||+.||..+++|| T Consensus 161 ~~~~~~~Y~~~Da~~tl~l~ 180 (181) T cd06126 161 LLERLAEYAAADAYATLQIY 180 (181) T ss_pred CHHHHHHHHHHHHHHHHHHH T ss_conf 66999999999999999976 No 34 >KOG2207 consensus Probab=99.77 E-value=2.9e-19 Score=131.21 Aligned_cols=155 Identities=24% Similarity=0.256 Sum_probs=117.7 Q ss_pred CCCC-CCEEEECCCCCC-CCCCCEEEEEEEEECCCCEEEECCCCCCCCCH-----HHHHHHHCCCCCEEECCCCHHHHHH Q ss_conf 2489-827997177789-85445079999841894289974657785706-----7767773236402301000016888 Q gi|254780636|r 17 ARYV-DAIAVDTETLGL-MPRRDRLCIVQLSPGDGTVDIIRIAAGQKNAP-----NLVGMLVDEKREKIFHYGRFDIAVL 89 (207) Q Consensus 17 l~~~-~~iaiDtEt~~l-~~~~~~l~LiQl~~~~~~~~l~~~~~~~~~~~-----~L~~ll~d~~i~KV~Hn~~~D~~~L 89 (207) ++.. -+||+|.||..- .....+++++|++.++ .+|++++......+. .+..+|+++++.|||.+.+.|+..+ T Consensus 409 l~~e~~yVGiDsEwkps~~v~dsk~~IlQif~~~-~v~Lidc~~l~~~~se~w~~~~s~if~s~~i~kvGf~~~eDL~~l 487 (617) T KOG2207 409 LSEELRYVGIDSEWKPSKKVSDSKLAILQIFFKD-CVYLIDCVKLENLASEIWHLLLSQIFESKSILKVGFSMREDLEVL 487 (617) T ss_pred HHHCCEEEEECCCCCCCCCCCHHHHHHHHHHHCC-EEEEEEHHHHHHCHHHHHHHHHHHHCCCCCEEEEECCHHHHHHHH T ss_conf 6316779987454476567880388999987048-089965687652408999999998706886046603356239999 Q ss_pred HHH-----HHCC---CCCH-HHHHHHHHHHHH-------HCCCHHHHHHHHHHHCCCCCCCCEECCCCCCCCCHHHHHHH Q ss_conf 654-----2000---0000-457898865211-------00000088777654200246752003656544323689999 Q gi|254780636|r 90 FYT-----FGVR---VRPV-FCTKIASRLTRT-------YTNQHGLKDNLKELLGINISKAQQSSDWSADDLSDEQLQYA 153 (207) Q Consensus 90 ~~~-----~gi~---~~~i-~DT~ias~l~~~-------~~~~~~L~~L~~~~lg~~ldK~~q~SdW~~rpLs~~Qi~YA 153 (207) ... +.+. +.++ .++.++.++..- ....-+|..|....+|+.++|+.|+|||..|||+.+|+.|| T Consensus 488 ~~s~pa~~~q~ki~~~~l~~~~~kl~e~~~~~~~~i~n~~~~~~~L~~Lt~~llg~~lnKteqcsnWqcrpLr~nQi~ya 567 (617) T KOG2207 488 EASSPALRFQMKIEGLQLVSCVLKLAENVIDLPLSIENLNEATKGLADLTDCLLGKKLNKTEQCSNWQCRPLRRNQIYYA 567 (617) T ss_pred HHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCCCHHHHHCCHHHHHHHHHHHHHHHHCCCCCCCCHHHCCCCHHHHHHHH T ss_conf 86516666313312318889999999988616325542121110110235777522225200050542587306688998 Q ss_pred HHHHHHHHHHHHHHHHHHH Q ss_conf 8645999999999999999 Q gi|254780636|r 154 ASDVVHLHALRLQFTEKLQ 172 (207) Q Consensus 154 A~Dv~~l~~L~~~l~~~L~ 172 (207) |.|+.++.++|.+..+.-. T Consensus 568 alDa~~~~~ifkkv~~vv~ 586 (617) T KOG2207 568 ALDAVVLVEIFKKVCSVVE 586 (617) T ss_pred HHCCHHHHHHHHHHHHHCC T ss_conf 8420266899999985130 No 35 >TIGR00593 pola DNA polymerase I; InterPro: IPR002298 DNA carries the biological information that instructs cells how to exist in an ordered fashion. Accurate replication is thus one of the most important events in the cell life cycle. This function is mediated by DNA-directed DNA polymerases, which add nucleotide triphosphate (dNTP) residues to the 5'-end of the growing DNA chain, using a complementary DNA as template. Small RNA molecules are generally used as primers for chain elongation, although terminal proteins may also be used. DNA-dependent DNA polymerases have been grouped into families, denoted A, B and X, on the basis of sequence similarities , . Members of family A, which includes bacterial and bacteriophage polymerases, share significant similarity to Escherichia coli polymerase I; hence family A is also known as the pol I family. The bacterial polymerases also contain an exonuclease activity, which is coded for in the N-terminal portion. Three motifs, A, B and C , are seen to be conserved across all DNA polymerases, with motifs A and C also seen in RNA polymerases. They are centred on invariant residues, and their structural significance was implied from the Klenow (E. coli) structure. Motif A contains a strictly-conserved aspartate at the junction of a beta-strand and an alpha-helix; motif B contains an alpha-helix with positive charges; and motif C has a doublet of negative charges, located in a beta-turn-beta secondary structure .; GO: 0003677 DNA binding, 0003887 DNA-directed DNA polymerase activity, 0006260 DNA replication. Probab=99.52 E-value=2.6e-14 Score=102.46 Aligned_cols=134 Identities=19% Similarity=0.212 Sum_probs=102.3 Q ss_pred HHHHHHCCCCCEEE-------------CCCCHHHHHHHHHHHCCCC---CHHHHHHHHHHHHHHCCCHHHHHHHHHHHCC Q ss_conf 76777323640230-------------1000016888654200000---0045789886521100000088777654200 Q gi|254780636|r 66 LVGMLVDEKREKIF-------------HYGRFDIAVLFYTFGVRVR---PVFCTKIASRLTRTYTNQHGLKDNLKELLGI 129 (207) Q Consensus 66 L~~ll~d~~i~KV~-------------Hn~~~D~~~L~~~~gi~~~---~i~DT~ias~l~~~~~~~~~L~~L~~~~lg~ 129 (207) +...++++.+.|.+ |++|++...|... |+.+. +++|||||++++++......+..++.+|... T Consensus 444 ~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~K~~~~~L~~~-g~~~~~~~~~~D~~laaYll~~~~~~~~l~~~~~~y~~~ 522 (1005) T TIGR00593 444 FAEWLLKETIKKALEELLVSASKVKVAHDAKFLMHLLKRK-GIELIEIGVIFDTMLAAYLLDPAQVSKDLDTLAKRYLVE 522 (1005) T ss_pred HHHHHHHCCHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHC-CCCCCCCCCCCCHHHHHHHHCCCCCCCHHHHHHHHHCCC T ss_conf 8998740001455521002341012288999999999743-773344211454899999843035640278999871343 Q ss_pred CCCCC----C--EEC--CCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH-------------------HCCCHHHHHH Q ss_conf 24675----2--003--6565443236899998645999999999999999-------------------7491899999 Q gi|254780636|r 130 NISKA----Q--QSS--DWSADDLSDEQLQYAASDVVHLHALRLQFTEKLQ-------------------RLGRSDLATS 182 (207) Q Consensus 130 ~ldK~----~--q~S--dW~~rpLs~~Qi~YAA~Dv~~l~~L~~~l~~~L~-------------------~~~~~~~~~~ 182 (207) .+-.. + ... +....+ .+..+.|++.+|.++..|+..|.++|. +.++..++.+ T Consensus 523 ~~~~d~~~~~~~~~~~~~~a~~~-~~~~~~~~~~~~~~~~~L~~~l~~~L~~eDaDitfrlyhylklrle~~~~~~L~~e 601 (1005) T TIGR00593 523 ELILDDDRKEEGIKKLAKFADPD-LEEAIEYLARKAAATARLAEELLKELDAEDADITFRLYHYLKLRLEENKLLSLYEE 601 (1005) T ss_pred CCCCCCCHHHHHHHHHHHHHCCH-HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCHHHHHHHHHH T ss_conf 46778731345544334441121-77889999999999999999988873113220466777788865163889999988 Q ss_pred -HHHHHHHHHHHHHCCCCCCC Q ss_conf -99640068999866798888 Q gi|254780636|r 183 -CCNFLMDRAELDLLGWENVD 202 (207) Q Consensus 183 -e~~~l~~~~~me~~g~~~~~ 202 (207) |+|+.+++++||..|++ +| T Consensus 602 iE~Pl~~VL~~ME~~Gi~-~D 621 (1005) T TIGR00593 602 IELPLSKVLAEMEKTGIK-VD 621 (1005) T ss_pred HCCCHHHHHHHHHHHHHH-HH T ss_conf 420078999998776678-70 No 36 >PRK06309 DNA polymerase III subunit epsilon; Validated Probab=99.41 E-value=4.4e-12 Score=89.50 Aligned_cols=143 Identities=19% Similarity=0.198 Sum_probs=90.8 Q ss_pred CCCEEEECCCCCCCCCCCEEEEEEEEECC----CCE-EEECCCCC-C--------------CCCH-------HHHHHHHC Q ss_conf 98279971777898544507999984189----428-99746577-8--------------5706-------77677732 Q gi|254780636|r 20 VDAIAVDTETLGLMPRRDRLCIVQLSPGD----GTV-DIIRIAAG-Q--------------KNAP-------NLVGMLVD 72 (207) Q Consensus 20 ~~~iaiDtEt~~l~~~~~~l~LiQl~~~~----~~~-~l~~~~~~-~--------------~~~~-------~L~~ll~d 72 (207) ..+|.|||||||+++.++++ ||+|.-+ +.+ .++..... . ..+| .+.+++.+ T Consensus 2 ~~~V~fD~ETTGl~~~~DrI--IEiaav~~~~~e~f~~linP~~pIp~~a~~ihGIt~e~v~~~P~~~ev~~~~~~f~~~ 79 (232) T PRK06309 2 PALIFYDTETTGTQIDKDRI--IEIAAYNGVTSESFQTYVNPEIPIPAEATKIHGITTSEVASAPKFPEAYQKFREFCGT 79 (232) T ss_pred CCEEEEEEECCCCCCCCCEE--EEEEEEECCCCCEEEEEECCCCCCCHHHHCCCCCCHHHHHCCCCHHHHHHHHHHHHCC T ss_conf 96899971179959999458--9999892756768999979989699656210383999980599899999999998579 Q ss_pred CCCCEEECCC-CHHHHHHHHHH---HCCC--CCHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCEECCCCCCCCC Q ss_conf 3640230100-00168886542---0000--0004578988652110000008877765420024675200365654432 Q gi|254780636|r 73 EKREKIFHYG-RFDIAVLFYTF---GVRV--RPVFCTKIASRLTRTYTNQHGLKDNLKELLGINISKAQQSSDWSADDLS 146 (207) Q Consensus 73 ~~i~KV~Hn~-~~D~~~L~~~~---gi~~--~~i~DT~ias~l~~~~~~~~~L~~L~~~~lg~~ldK~~q~SdW~~rpLs 146 (207) ..+ -|+||+ +||+.+|...+ |+.+ ..++||+..++.+.+...+|+|++|+++ +|+..+.. T Consensus 80 ~~i-lVaHN~~~FD~~fL~~e~~r~g~~~~~~~~iDTl~lar~l~P~l~~h~L~~L~~~-~gi~~~~a------------ 145 (232) T PRK06309 80 DNI-LVAHNNDGFDFPLLVKECRRHSLEPLTLRTIDSLKWAQKYRPDLPKHNLQYLRQV-YGFAENQA------------ 145 (232) T ss_pred CCE-EEEECCHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHCCCCCCCCHHHHHHH-HCCCCCCC------------ T ss_conf 988-9984896567999999999859986774053799999997589776889999998-09999888------------ Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHH Q ss_conf 368999986459999999999999997491899999 Q gi|254780636|r 147 DEQLQYAASDVVHLHALRLQFTEKLQRLGRSDLATS 182 (207) Q Consensus 147 ~~Qi~YAA~Dv~~l~~L~~~l~~~L~~~~~~~~~~~ 182 (207) +=|..||..+.+++..+-..+..+...++++. T Consensus 146 ----HrAl~Da~at~~vf~~ll~~~~~~~~~~l~~~ 177 (232) T PRK06309 146 ----HRALDDVITLHRVFSALVGDLSPQQVLDLLNE 177 (232) T ss_pred ----CCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHC T ss_conf ----88599999999999998721699999999845 No 37 >KOG4373 consensus Probab=99.38 E-value=1.6e-12 Score=91.97 Aligned_cols=129 Identities=20% Similarity=0.202 Sum_probs=100.5 Q ss_pred CCEEEEEEEEECCCCEEEECCCCCCCCCHHHHHHHHCCCCCEEECCCCHHHHHHHH-HHHCCCCCHHHHHH-HHHHHHHH Q ss_conf 45079999841894289974657785706776777323640230100001688865-42000000045789-88652110 Q gi|254780636|r 36 RDRLCIVQLSPGDGTVDIIRIAAGQKNAPNLVGMLVDEKREKIFHYGRFDIAVLFY-TFGVRVRPVFCTKI-ASRLTRTY 113 (207) Q Consensus 36 ~~~l~LiQl~~~~~~~~l~~~~~~~~~~~~L~~ll~d~~i~KV~Hn~~~D~~~L~~-~~gi~~~~i~DT~i-as~l~~~~ 113 (207) .|...-+|||.++..|+++.+.+....+..|+.+|+|++.++||.+.+.|...|.+ .++..+.-+.|.-. +...+.+. T Consensus 146 ~P~~~~lqlcV~en~C~I~ql~~~~~IP~~LR~fl~D~~~~~vgv~~d~D~~KL~r~~hql~I~~~~dlr~~~~d~~g~~ 225 (319) T KOG4373 146 DPPPDTLQLCVGENRCLIIQLIHCKRIPHELRSFLEDPDHTFVGVWNDQDAGKLERKEHQLEIGELEDLRLLVNDSLGGS 225 (319) T ss_pred CCCCCHHHHHHCCCCEEEEEEECCCCCHHHHHHHHCCCCCEEEECCCCCCHHHHHHHHHCCCHHHHHHHHHHCCHHHCCC T ss_conf 99843222442664115677631465559999862387705873055530787766552263776640876503333367 Q ss_pred ---CCCHHHHHHHHHHHC--CCCCCCCEECCCCCCCCCHHHHHHHHHHHHHHHHHH Q ss_conf ---000008877765420--024675200365654432368999986459999999 Q gi|254780636|r 114 ---TNQHGLKDNLKELLG--INISKAQQSSDWSADDLSDEQLQYAASDVVHLHALR 164 (207) Q Consensus 114 ---~~~~~L~~L~~~~lg--~~ldK~~q~SdW~~rpLs~~Qi~YAA~Dv~~l~~L~ 164 (207) ..+.++.....-+-| +.++|..+.|||+..|||.+|+.||+.||.....|+ T Consensus 226 ~~~~s~e~i~~~~~~~~~~~v~l~~~i~msdw~~~~Ls~~Ql~~asidvy~c~~lg 281 (319) T KOG4373 226 MPNDSFEEIVSETLGYYGKDVRLDKEIRMSDWSVYPLSDDQLLQASIDVYVCHKLG 281 (319) T ss_pred CCCCCHHHHHHHHHHCCCCCCCCCHHCCCCCCEEEECCHHHHHHHHHHHHHHHHHH T ss_conf 66761889999874124622025721043565146266889999876788887888 No 38 >PRK07740 hypothetical protein; Provisional Probab=99.31 E-value=4.3e-11 Score=83.74 Aligned_cols=142 Identities=17% Similarity=0.187 Sum_probs=93.8 Q ss_pred CCCCCCEEEECCCCCCCCCC-CEEEEEEEEE---CCC----CE-EEECCCCCC---------------CCCH-------H Q ss_conf 24898279971777898544-5079999841---894----28-997465778---------------5706-------7 Q gi|254780636|r 17 ARYVDAIAVDTETLGLMPRR-DRLCIVQLSP---GDG----TV-DIIRIAAGQ---------------KNAP-------N 65 (207) Q Consensus 17 l~~~~~iaiDtEt~~l~~~~-~~l~LiQl~~---~~~----~~-~l~~~~~~~---------------~~~~-------~ 65 (207) +...+.+.||+||||++|.+ +++ |+|+. .++ .+ -++...+.. ..+| . T Consensus 54 L~~~~fvV~D~ETTGl~p~~gD~I--IeIgAVkv~~g~i~d~F~tLVnP~~~Ip~~i~~ltGIT~emV~~aP~~~eVl~~ 131 (240) T PRK07740 54 LTDLPFVVFDLETTGFSPDQGDEI--LSIAAVKTVGGEVTDLYYSLVKPKKPIPEHILTLTGISAKDVVFAPPLAEVLRT 131 (240) T ss_pred CCCCEEEEEEEECCCCCCCCCCEE--EEEEEEEEECCEEEEEEEEEECCCCCCCHHHHHHCCCCHHHHHCCCCHHHHHHH T ss_conf 433838999805899898889878--999899999999968877550887989988998609899998379999999999 Q ss_pred HHHHHHCCCCCEEECCCCHHHHHHHHH----HHCCCC-CHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCEECCC Q ss_conf 767773236402301000016888654----200000-004578988652110000008877765420024675200365 Q gi|254780636|r 66 LVGMLVDEKREKIFHYGRFDIAVLFYT----FGVRVR-PVFCTKIASRLTRTYTNQHGLKDNLKELLGINISKAQQSSDW 140 (207) Q Consensus 66 L~~ll~d~~i~KV~Hn~~~D~~~L~~~----~gi~~~-~i~DT~ias~l~~~~~~~~~L~~L~~~~lg~~ldK~~q~SdW 140 (207) +.+++.+. +-|+||+.||+.+|.+. ++..+. ++.||...++++.. ...++|.+++++ +|+.... T Consensus 132 fl~Fi~~~--vlVaHNa~FD~~FL~~~l~r~~~~~~~~~~iDT~~Lar~l~~-~~~~sL~~l~~~-fgI~~~~------- 200 (240) T PRK07740 132 FYPFINAS--TLVGYHIGHDLAFLRHALWRHYRQKFSHRFIDTQFLTEIIAH-QSFPTLDDALAC-YGITCPR------- 200 (240) T ss_pred HHHHHCCC--EEEEECCHHHHHHHHHHHHHHCCCCCCCCEEEHHHHHHHHCC-CCCCCHHHHHHH-CCCCCCC------- T ss_conf 99983899--899968286399999999984599878876369999998658-998789999997-6999999------- Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHH Q ss_conf 6544323689999864599999999999999974918999 Q gi|254780636|r 141 SADDLSDEQLQYAASDVVHLHALRLQFTEKLQRLGRSDLA 180 (207) Q Consensus 141 ~~rpLs~~Qi~YAA~Dv~~l~~L~~~l~~~L~~~~~~~~~ 180 (207) -+=|..||..+-++|.++-+++++.|...+. T Consensus 201 ---------rHrAl~DA~aTA~lf~~ll~~l~~~Gi~TL~ 231 (240) T PRK07740 201 ---------RHTADGDVEMTAKLWAILIIEARQAGIETLH 231 (240) T ss_pred ---------CCCCHHHHHHHHHHHHHHHHHHHHCCCCCHH T ss_conf ---------9882999999999999999999976999899 No 39 >PRK06310 DNA polymerase III subunit epsilon; Validated Probab=99.28 E-value=4.4e-11 Score=83.64 Aligned_cols=136 Identities=16% Similarity=0.185 Sum_probs=89.4 Q ss_pred HCCCCCCEEEECCCCCCCCCCCEEEEEEEEE----CCCC----EEEECCCCC-C--------------CCCH-------H Q ss_conf 4248982799717778985445079999841----8942----899746577-8--------------5706-------7 Q gi|254780636|r 16 AARYVDAIAVDTETLGLMPRRDRLCIVQLSP----GDGT----VDIIRIAAG-Q--------------KNAP-------N 65 (207) Q Consensus 16 ~l~~~~~iaiDtEt~~l~~~~~~l~LiQl~~----~~~~----~~l~~~~~~-~--------------~~~~-------~ 65 (207) +++....|.||+||||+++..+++ ||||. +++. ..++..... + ..+| . T Consensus 3 ll~~~~yvv~D~ETTGl~~~~DrI--IEI~~vkv~~~~i~~~~~~linP~~~I~~~~~~ihGIt~emV~d~P~f~ev~~~ 80 (250) T PRK06310 3 LLKDTEFVCLDCETTGLDVKKDRI--IEFAAIRFTFDEVIDSMEFLINPERVVSAESQRIHHISDAMLRDKPKIAEVFPQ 80 (250) T ss_pred CCCCCCEEEEEEECCCCCCCCCEE--EEEEEEEEECCEEEEEEEEEECCCCCCCHHHHHHHCCCHHHHHCCCCHHHHHHH T ss_conf 666898899973379979999627--999999999998868899998928979988987615489998469999999999 Q ss_pred HHHHHHCCCCCEEECCCCHHHHHHHHHH---HCCC----CCHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCEEC Q ss_conf 7677732364023010000168886542---0000----00045789886521100000088777654200246752003 Q gi|254780636|r 66 LVGMLVDEKREKIFHYGRFDIAVLFYTF---GVRV----RPVFCTKIASRLTRTYTNQHGLKDNLKELLGINISKAQQSS 138 (207) Q Consensus 66 L~~ll~d~~i~KV~Hn~~~D~~~L~~~~---gi~~----~~i~DT~ias~l~~~~~~~~~L~~L~~~~lg~~ldK~~q~S 138 (207) +.+++.+..+. |+||++||+.+|...+ |+.+ ..+.||+-.++... ...+++|.+|+++ +|+.++.. T Consensus 81 ~~~fi~~~~il-VaHNa~FD~~fL~~e~~r~g~~~~~~~~~~IDTl~lAr~~~-~~~~~~L~~L~~~-~~i~~~~a---- 153 (250) T PRK06310 81 IKGFFKEGDYI-VGHSVGFDLQVLAQEAERIGETFLSKHYYIIDTLRLAKEYG-DSPNNSLEALAVH-FNVPYDGN---- 153 (250) T ss_pred HHHHHCCCCEE-EEECHHHHHHHHHHHHHHCCCCCCCCCCCEECHHHHHHHCC-CCCCCHHHHHHHH-CCCCCCCC---- T ss_conf 99996799999-99384557999999999769986656795784999997567-6545609999997-09999866---- Q ss_pred CCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 6565443236899998645999999999999999 Q gi|254780636|r 139 DWSADDLSDEQLQYAASDVVHLHALRLQFTEKLQ 172 (207) Q Consensus 139 dW~~rpLs~~Qi~YAA~Dv~~l~~L~~~l~~~L~ 172 (207) +=|..||..+.+++..+.++.+ T Consensus 154 ------------HrAl~Da~aTa~vf~~l~~r~~ 175 (250) T PRK06310 154 ------------HRAMKDVEININVFKHLCKRFR 175 (250) T ss_pred ------------CCHHHHHHHHHHHHHHHHHHCC T ss_conf ------------6819999999999999998553 No 40 >PRK06631 consensus Probab=99.28 E-value=4.2e-11 Score=83.79 Aligned_cols=130 Identities=25% Similarity=0.334 Sum_probs=82.1 Q ss_pred CCCCEEEECCCCCCCCCC-CEEEEEEEE---------ECCCCE-EEECCCCCCC-C--------------CH-------H Q ss_conf 898279971777898544-507999984---------189428-9974657785-7--------------06-------7 Q gi|254780636|r 19 YVDAIAVDTETLGLMPRR-DRLCIVQLS---------PGDGTV-DIIRIAAGQK-N--------------AP-------N 65 (207) Q Consensus 19 ~~~~iaiDtEt~~l~~~~-~~l~LiQl~---------~~~~~~-~l~~~~~~~~-~--------------~~-------~ 65 (207) .-..|.|||||||++|.. +++ |+++ +++ .+ .++.+.+... . .| . T Consensus 3 ~mr~VvlD~ETTGl~~~~gdrI--IEIgav~~~~~~~t~~-~f~~~inP~r~i~~~~~~ihGIt~e~l~~~P~f~ev~~~ 79 (229) T PRK06631 3 SLREIILDTETTGLDPQQGHRI--VEIGAIEMVNKVLTGR-NFHFYINPERDMPFEAYRIHGISGEFLKDKPLFHTIADD 79 (229) T ss_pred CCCEEEEEEECCCCCCCCCCEE--EEEEEEEEECCEEECC-EEEEEECCCCCCCHHHHHCCCCCHHHHHHCCCHHHHHHH T ss_conf 7888999825799899999879--9999999999988043-799998989979761512018678998538779999999 Q ss_pred HHHHHHCCCCCEEECCCCHHHHHHHHHHH------C---CCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCE Q ss_conf 76777323640230100001688865420------0---00000457898865211000000887776542002467520 Q gi|254780636|r 66 LVGMLVDEKREKIFHYGRFDIAVLFYTFG------V---RVRPVFCTKIASRLTRTYTNQHGLKDNLKELLGINISKAQQ 136 (207) Q Consensus 66 L~~ll~d~~i~KV~Hn~~~D~~~L~~~~g------i---~~~~i~DT~ias~l~~~~~~~~~L~~L~~~~lg~~ldK~~q 136 (207) +..++.+.. -|+||+.||+.+|.+.+. + ...++.||...|+-+-+. .+++|.+||++ +|+.. T Consensus 80 fl~Fi~~s~--LVaHNA~FD~~FLn~El~~l~~~~~~~~~~~~viDTL~lAR~~~Pg-~~~sLdaLc~r-~~I~~----- 150 (229) T PRK06631 80 FLEFISDSK--LIIHNAPFDIKFLNHELSLLKRTEIKLLELANTIDTLVMARSMFPG-SKYNLDALCKR-FKVDN----- 150 (229) T ss_pred HHHHCCCCE--EEEECCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHCCC-CCCCHHHHHHH-CCCCC----- T ss_conf 999829997--9997638759899999998278888843201101269999986899-88899999998-49999----- Q ss_pred ECCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 036565443236899998645999999999999 Q gi|254780636|r 137 SSDWSADDLSDEQLQYAASDVVHLHALRLQFTE 169 (207) Q Consensus 137 ~SdW~~rpLs~~Qi~YAA~Dv~~l~~L~~~l~~ 169 (207) +....+=|..||..|-++|-.|.. T Consensus 151 ---------s~r~~H~AL~DA~llA~Vyl~l~g 174 (229) T PRK06631 151 ---------SGRQLHGALKDAALLAEVYVELTG 174 (229) T ss_pred ---------CCCCCCCCHHHHHHHHHHHHHHHC T ss_conf ---------985556868999999999999717 No 41 >PRK07942 DNA polymerase III subunit epsilon; Provisional Probab=99.22 E-value=3e-10 Score=78.84 Aligned_cols=136 Identities=18% Similarity=0.220 Sum_probs=84.3 Q ss_pred CCCCEEEECCCCCCCCCCCEEEEEEEEE----CCC-----CEEEECCCCCC--------------------CCCHHHHH- Q ss_conf 8982799717778985445079999841----894-----28997465778--------------------57067767- Q gi|254780636|r 19 YVDAIAVDTETLGLMPRRDRLCIVQLSP----GDG-----TVDIIRIAAGQ--------------------KNAPNLVG- 68 (207) Q Consensus 19 ~~~~iaiDtEt~~l~~~~~~l~LiQl~~----~~~-----~~~l~~~~~~~--------------------~~~~~L~~- 68 (207) ..+.++|||||||+++..+++. |+|. +++ ..++++..... ..++.+.+ T Consensus 5 ~~~lvvfD~ETTG~d~~~dRII--eia~V~~~~~g~~~~~~~~LvnPg~~IP~~a~~vhGIT~e~~~~~G~p~a~v~~ei 82 (234) T PRK07942 5 DGPLAAFDLETTGVDPETARIV--TAAIVVVDAHGEVVERREWLADPGVEIPEEASAVHGITTERARAHGRPAAEVLAEI 82 (234) T ss_pred CCCEEEEEEECCCCCCCCCEEE--EEEEEEEECCCCEEEEEEEEECCCCCCCHHHHHHHCCCHHHHHHCCCCCHHHHHHH T ss_conf 6978999801699898898389--99999998898546889998695894998897784889999974289830799999 Q ss_pred ------HHHCCCCCEEECCCCHHHHHHHHHH---HCC---CCCHHHHHHHHHHHHHHC-CCHHHHHHHHHHHCCCCCCCC Q ss_conf ------7732364023010000168886542---000---000045789886521100-000088777654200246752 Q gi|254780636|r 69 ------MLVDEKREKIFHYGRFDIAVLFYTF---GVR---VRPVFCTKIASRLTRTYT-NQHGLKDNLKELLGINISKAQ 135 (207) Q Consensus 69 ------ll~d~~i~KV~Hn~~~D~~~L~~~~---gi~---~~~i~DT~ias~l~~~~~-~~~~L~~L~~~~lg~~ldK~~ 135 (207) ++. ..+.-|+||+.||+.+|.+.+ |+. +.+++||...++.+..+. .+++|.+||++| |++++.. T Consensus 83 a~~l~~~~~-~g~~lVahNA~FDl~fL~~El~R~g~~~l~~~~viDtl~l~r~l~~~r~gkr~L~aL~~~y-gv~l~~a- 159 (234) T PRK07942 83 AEALRAVWR-AGVPVVVYNAPYDLTVLDRELRRHGLPSLVPGPVIDPYVIDKHVDRYRKGKRTLTALCEHY-GVRLDNA- 159 (234) T ss_pred HHHHHHHHH-CCCEEEEECCHHHHHHHHHHHHHCCCCCCCCCCEEEHHHHHHHHCCCCCCCCCHHHHHHHC-CCCCCCC- T ss_conf 999999972-7987999670764999999999859996777755619998877514777888899999984-9988775- Q ss_pred EECCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 003656544323689999864599999999999999974 Q gi|254780636|r 136 QSSDWSADDLSDEQLQYAASDVVHLHALRLQFTEKLQRL 174 (207) Q Consensus 136 q~SdW~~rpLs~~Qi~YAA~Dv~~l~~L~~~l~~~L~~~ 174 (207) +=|..|+..+.+++..+..+..+. T Consensus 160 ---------------H~A~aDA~Ata~l~~~l~~r~~~l 183 (234) T PRK07942 160 ---------------HDATADALAAARLAWALARRFPEL 183 (234) T ss_pred ---------------CCCHHHHHHHHHHHHHHHHHHHHH T ss_conf ---------------564152899999999999876887 No 42 >TIGR01405 polC_Gram_pos DNA polymerase III, alpha subunit, Gram-positive type; InterPro: IPR006308 These are the polypeptide chains of DNA polymerase III. Full-length homologs of this protein are restricted to the Gram-positive lineages, including the Mycoplasmas. This protein is designated alpha chain and given the gene symbol polC, but is not a full-length homolog of other polC genes. The N-terminal region of about 200 amino acids is rich in low-complexity sequence and poorly alignable. ; GO: 0003677 DNA binding, 0003887 DNA-directed DNA polymerase activity, 0006260 DNA replication, 0005737 cytoplasm. Probab=99.19 E-value=1.2e-10 Score=81.13 Aligned_cols=147 Identities=24% Similarity=0.348 Sum_probs=105.5 Q ss_pred CCCCCCEEEECCCCCCCCCCCEEEEEEEEE---CCC-CEEEECCC-------CCC--------------CC-------CH Q ss_conf 248982799717778985445079999841---894-28997465-------778--------------57-------06 Q gi|254780636|r 17 ARYVDAIAVDTETLGLMPRRDRLCIVQLSP---GDG-TVDIIRIA-------AGQ--------------KN-------AP 64 (207) Q Consensus 17 l~~~~~iaiDtEt~~l~~~~~~l~LiQl~~---~~~-~~~l~~~~-------~~~--------------~~-------~~ 64 (207) +..+..|-||.|||||.|..+++ ||+|. .++ .. +..+. ... .+ .+ T Consensus 201 l~DaTyVVfD~ETTGLsp~~d~i--IE~gA~k~~nGk~i-i~~~~~FikP~~~l~~~~~elT~ITq~ml~n~~~~~~vL~ 277 (1264) T TIGR01405 201 LKDATYVVFDIETTGLSPQYDEI--IEFGAVKVKNGKRI-IDKFQFFIKPKEPLSAFVTELTGITQDMLENAPEIEEVLE 277 (1264) T ss_pred ECCCEEEEEEEEECCCCCCCCCE--EEEEEEEEECCCEE-EEEHHHHCCCCCCCCCCEEEECCCCHHHHCCCCCHHHHHH T ss_conf 23620799603515888861424--67875887468576-2312323288754675426624664797379841789999 Q ss_pred HHHHHHHCCCCCEEECC-CCHHHHHHHHHH---HCCC--CCHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCEEC Q ss_conf 77677732364023010-000168886542---0000--00045789886521100000088777654200246752003 Q gi|254780636|r 65 NLVGMLVDEKREKIFHY-GRFDIAVLFYTF---GVRV--RPVFCTKIASRLTRTYTNQHGLKDNLKELLGINISKAQQSS 138 (207) Q Consensus 65 ~L~~ll~d~~i~KV~Hn-~~~D~~~L~~~~---gi~~--~~i~DT~ias~l~~~~~~~~~L~~L~~~~lg~~ldK~~q~S 138 (207) .+++++. ..|. |.|| +.||+.+|...+ ++.+ +++.||.-.|+.+.+.-.+|+|..+|++ ||+.+ T Consensus 278 k~~~f~~-d~il-VAHNGasFD~~Fl~~~~~k~~~~~~~~p~IDTL~Lar~lnP~y~sh~Lg~~~~K-l~v~~------- 347 (1264) T TIGR01405 278 KFKEFLK-DSIL-VAHNGASFDIGFLNTNFEKVGLEPLENPVIDTLELARALNPEYKSHRLGEIAKK-LGVDL------- 347 (1264) T ss_pred HHHHHHC-CEEE-EECCCCCCCHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHCCCCCCCCCCHHHHH-CCCCC------- T ss_conf 9999769-7588-862761036688988987627742336511388999983731134454223332-05521------- Q ss_pred CCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCC-CHHHHHHH Q ss_conf 6565443236899998645999999999999999749-18999999 Q gi|254780636|r 139 DWSADDLSDEQLQYAASDVVHLHALRLQFTEKLQRLG-RSDLATSC 183 (207) Q Consensus 139 dW~~rpLs~~Qi~YAA~Dv~~l~~L~~~l~~~L~~~~-~~~~~~~e 183 (207) ..+..+=|-.||.+|-+++..+.+.+.+.+ -..+-+.. T Consensus 348 -------~~e~~HRA~YDa~~t~~vF~~~~~~~~e~gGi~~l~~ln 386 (1264) T TIGR01405 348 -------DDERHHRADYDAEATAKVFIVMVEQLKEKGGITNLEELN 386 (1264) T ss_pred -------CCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHCC T ss_conf -------533012433479999999999999987503721123405 No 43 >PRK08074 bifunctional ATP-dependent DNA helicase/DNA polymerase III subunit epsilon; Validated Probab=99.16 E-value=1.3e-09 Score=75.06 Aligned_cols=136 Identities=27% Similarity=0.344 Sum_probs=88.9 Q ss_pred CCCEEEECCCCCCCCCC-CEEEEEEEEE---CCCCE---EEECCCCCCC-----------------C-------CHHHHH Q ss_conf 98279971777898544-5079999841---89428---9974657785-----------------7-------067767 Q gi|254780636|r 20 VDAIAVDTETLGLMPRR-DRLCIVQLSP---GDGTV---DIIRIAAGQK-----------------N-------APNLVG 68 (207) Q Consensus 20 ~~~iaiDtEt~~l~~~~-~~l~LiQl~~---~~~~~---~l~~~~~~~~-----------------~-------~~~L~~ 68 (207) ...+++|+||||+++.+ +++ ||++. .++.. |--.++.... . ++.+.+ T Consensus 3 ~~~~vvDlETTG~~~~~~d~I--Iqig~v~i~~~~i~~~f~t~vnP~~~iP~~I~~LTgI~~~~v~~AP~f~eva~~i~~ 80 (932) T PRK08074 3 KRFVVVDLETTGNSPKKGDKI--IQIAAVVVEDGEILERFSTFVNPERPIPPFITELTGISEEMVKNAPLFEDVAPMIVE 80 (932) T ss_pred CEEEEEEEECCCCCCCCCCEE--EEEEEEEEECCEEEEEEECCCCCCCCCCHHHHHHCCCCHHHHCCCCCHHHHHHHHHH T ss_conf 708999988889897779889--999799999999989985325899989977886069898998059998999999999 Q ss_pred HHHCCCCCEEECCCCHHHHHHHHHH---HCC-CC-CHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCEECCCCCC Q ss_conf 7732364023010000168886542---000-00-004578988652110000008877765420024675200365654 Q gi|254780636|r 69 MLVDEKREKIFHYGRFDIAVLFYTF---GVR-VR-PVFCTKIASRLTRTYTNQHGLKDNLKELLGINISKAQQSSDWSAD 143 (207) Q Consensus 69 ll~d~~i~KV~Hn~~~D~~~L~~~~---gi~-~~-~i~DT~ias~l~~~~~~~~~L~~L~~~~lg~~ldK~~q~SdW~~r 143 (207) +++|. .-|+||+.||+.+|...+ |.. +. +.+||.-.|+++-+...+|.|.+|++. ||+..+.. T Consensus 81 ~l~~~--~~VaHNv~FD~~FL~~~l~~~G~~~~~~~~~DTveLa~i~~P~~~sy~L~~L~~~-l~i~~~~~--------- 148 (932) T PRK08074 81 LLEGA--YFVAHNVHFDLNFLNEELERAGYTEIHCPKLDTVELARILLPTADSYKLSDLSEE-LELEHDQP--------- 148 (932) T ss_pred HHCCC--EEEECCHHHHHHHHHHHHHHCCCCCCCCCEECHHHHHHHHCCCCCCCCHHHHHHH-CCCCCCCC--------- T ss_conf 96699--5994780540999999999739986668765199999997788788888999986-49999998--------- Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 432368999986459999999999999997491 Q gi|254780636|r 144 DLSDEQLQYAASDVVHLHALRLQFTEKLQRLGR 176 (207) Q Consensus 144 pLs~~Qi~YAA~Dv~~l~~L~~~l~~~L~~~~~ 176 (207) +=|-.||..+-+|+..+.+++..... T Consensus 149 -------HrA~sDa~aTa~l~~~l~~~l~~lp~ 174 (932) T PRK08074 149 -------HRADSDAEVTAELFLQLLHKLERLPL 174 (932) T ss_pred -------CCCHHHHHHHHHHHHHHHHHHHHCCH T ss_conf -------75178899999999999999983799 No 44 >PRK06022 consensus Probab=99.14 E-value=4.3e-10 Score=77.88 Aligned_cols=127 Identities=21% Similarity=0.275 Sum_probs=79.3 Q ss_pred CEEEECCCCCCCCCCCEEEEEEEEE-----C--C-CCE-EEECCC-CC-CCC--------------C-------HHHHHH Q ss_conf 2799717778985445079999841-----8--9-428-997465-77-857--------------0-------677677 Q gi|254780636|r 22 AIAVDTETLGLMPRRDRLCIVQLSP-----G--D-GTV-DIIRIA-AG-QKN--------------A-------PNLVGM 69 (207) Q Consensus 22 ~iaiDtEt~~l~~~~~~l~LiQl~~-----~--~-~~~-~l~~~~-~~-~~~--------------~-------~~L~~l 69 (207) .|-+||||||+++..+++ |+++. . . ..+ .++.+. +. ... + +.+.++ T Consensus 3 ~IvlDtETTGl~~~~dRI--IEIg~vel~~~~~t~~~f~~~inP~~r~I~~~a~~IHGIt~e~L~~~P~F~eva~~fl~F 80 (234) T PRK06022 3 EIIFDTETTGLERREDRV--IEIGGVELVNRFPTGRTFHKFINPQGRQVHPDALAVHGISDEQLLDKPVFAEILDEFLEF 80 (234) T ss_pred EEEEEEECCCCCCCCCCE--EEEEEEEEECCCCCCCEEEEEECCCCCCCCHHHHHHHCCCHHHHCCCCCHHHHHHHHHHH T ss_conf 699982279989989707--999999997786336578999899998689889988560899870599799999999998 Q ss_pred HHCCCCCEEECCCCHHHHHHHHHH---HCC---CCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCEECCCCCC Q ss_conf 732364023010000168886542---000---00004578988652110000008877765420024675200365654 Q gi|254780636|r 70 LVDEKREKIFHYGRFDIAVLFYTF---GVR---VRPVFCTKIASRLTRTYTNQHGLKDNLKELLGINISKAQQSSDWSAD 143 (207) Q Consensus 70 l~d~~i~KV~Hn~~~D~~~L~~~~---gi~---~~~i~DT~ias~l~~~~~~~~~L~~L~~~~lg~~ldK~~q~SdW~~r 143 (207) +.+.. -|+||+.||+.+|.+.+ |.. ..++.||.-.|+-.-+ +.+++|.+||++ +|+..+ .| T Consensus 81 i~~a~--LVaHNA~FD~gFLn~El~r~G~~~~~~~~viDTL~LAR~~~P-g~~~sLdaLc~r-~~Id~~---------~R 147 (234) T PRK06022 81 FDGAR--LVAHNATFDLGFINAEFARLGQPEITSERVVDTLALARRKHP-MGPNSLDALCRR-YGIDNS---------HR 147 (234) T ss_pred HCCCE--EEEECHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCC-CCCCCHHHHHHH-CCCCCC---------CC T ss_conf 18985--999557872999999999859987778887469999998688-998898999998-399977---------73 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 4323689999864599999999999 Q gi|254780636|r 144 DLSDEQLQYAASDVVHLHALRLQFT 168 (207) Q Consensus 144 pLs~~Qi~YAA~Dv~~l~~L~~~l~ 168 (207) . .+=|-.|+..|-.+|-.|. T Consensus 148 ~-----~HgAL~Da~lLa~VyleL~ 167 (234) T PRK06022 148 T-----KHGALLDSELLAEVYIELI 167 (234) T ss_pred C-----CCCCHHHHHHHHHHHHHHC T ss_conf 5-----3762888999999999972 No 45 >PRK08517 DNA polymerase III subunit epsilon; Provisional Probab=99.14 E-value=6.7e-10 Score=76.81 Aligned_cols=135 Identities=19% Similarity=0.193 Sum_probs=88.0 Q ss_pred CCCCCCEEEECCCCCCCCCCCEEEEEEEEE---CCCCE-----EEECCCCCC--------------CCC-------HHHH Q ss_conf 248982799717778985445079999841---89428-----997465778--------------570-------6776 Q gi|254780636|r 17 ARYVDAIAVDTETLGLMPRRDRLCIVQLSP---GDGTV-----DIIRIAAGQ--------------KNA-------PNLV 67 (207) Q Consensus 17 l~~~~~iaiDtEt~~l~~~~~~l~LiQl~~---~~~~~-----~l~~~~~~~--------------~~~-------~~L~ 67 (207) ++.+..+.||+||||++|..+++ |+|+. .++.. -++.....+ ..+ +.+. T Consensus 67 ~ke~~fvvvDlETTGl~~~~d~I--IEIgavkv~~g~iid~f~tlVnp~~Ip~~It~lTGIt~emv~~aP~~~evl~~f~ 144 (259) T PRK08517 67 IKEQVFCFVDIETNGSKPKKGQI--IEIGAVKVKGGEIIDRFESFVKAKEVPEYITELTGITYEDLENAPSLKEVLEDFR 144 (259) T ss_pred CCCCCEEEEEEECCCCCCCCCEE--EEEEEEEEECCEEEEEEEEEECCCCCCHHHHHHCCCCHHHHHCCCCHHHHHHHHH T ss_conf 67883899996589989999824--9999999999999998743239898998882213899899827998899999999 Q ss_pred HHHHCCCCCEEECCCCHHHHHHHHHHH---CC-C-CCHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCEECCCCC Q ss_conf 777323640230100001688865420---00-0-000457898865211000000887776542002467520036565 Q gi|254780636|r 68 GMLVDEKREKIFHYGRFDIAVLFYTFG---VR-V-RPVFCTKIASRLTRTYTNQHGLKDNLKELLGINISKAQQSSDWSA 142 (207) Q Consensus 68 ~ll~d~~i~KV~Hn~~~D~~~L~~~~g---i~-~-~~i~DT~ias~l~~~~~~~~~L~~L~~~~lg~~ldK~~q~SdW~~ 142 (207) .++.|. +-|+||+.||+.+|...+. .. + ++.+||.-.|+.+-+. .+|+|.+|+++ +|+..... T Consensus 145 ~Fi~d~--ilVaHNa~FD~~FL~~~l~r~g~~~l~n~~lDTl~LAR~~~p~-~~ykL~~La~~-~~I~~~~~-------- 212 (259) T PRK08517 145 LFLGDS--VFVAHNVNFDYNFISRSLEEIGLGPLLNRKLCTIDLAKRTIES-PRYGLSFLKEL-LGIETEVH-------- 212 (259) T ss_pred HHHCCC--EEEEECCHHHHHHHHHHHHHCCCCCCCCCEEEEHHHHHHHCCC-CCCCHHHHHHH-CCCCCCCC-------- T ss_conf 996889--5999372760999999999839997689856608999987587-78788999997-59899999-------- Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 4432368999986459999999999999997 Q gi|254780636|r 143 DDLSDEQLQYAASDVVHLHALRLQFTEKLQR 173 (207) Q Consensus 143 rpLs~~Qi~YAA~Dv~~l~~L~~~l~~~L~~ 173 (207) +=|..||..+-+++..+-++|.. T Consensus 213 --------HRAl~DA~aTaevF~~~L~~l~~ 235 (259) T PRK08517 213 --------HRAYADALAAYEIFEICLLNLPS 235 (259) T ss_pred --------CCCHHHHHHHHHHHHHHHHHCCC T ss_conf --------78458899999999999985665 No 46 >PRK08858 consensus Probab=99.13 E-value=6.2e-10 Score=77.00 Aligned_cols=133 Identities=18% Similarity=0.229 Sum_probs=83.5 Q ss_pred CCCCCEEEECCCCCCCCCC----CEEEEEEEEEC---C-----CCE-EEECCCCCCC---------------CC------ Q ss_conf 4898279971777898544----50799998418---9-----428-9974657785---------------70------ Q gi|254780636|r 18 RYVDAIAVDTETLGLMPRR----DRLCIVQLSPG---D-----GTV-DIIRIAAGQK---------------NA------ 63 (207) Q Consensus 18 ~~~~~iaiDtEt~~l~~~~----~~l~LiQl~~~---~-----~~~-~l~~~~~~~~---------------~~------ 63 (207) +....|.+||||||+++-. ..=.+|+|+.- + ..+ .++.+.+... .+ T Consensus 8 ~~~R~VVlDtETTGl~~~~~~~~~gdRIIEIgaV~i~~~~~t~~~f~~~InP~r~Ip~~a~~ihGIT~e~L~daP~F~eI 87 (246) T PRK08858 8 EYQRIVVLDTETTGMNREGGPHYEGHRIIEIGAVEIINRKLTGRHFHVYLKPDREIQPEAIDVHGITDEFLVDKPEYKDV 87 (246) T ss_pred CCCEEEEEEEECCCCCCCCCCCCCCCEEEEEEEEEEECCEEECCEEEEEECCCCCCCHHHHHHCCCCHHHHHCCCCHHHH T ss_conf 60479999857899997656777898799999999999998221478888979969988997739289998359758999 Q ss_pred -HHHHHHHHCCCCCEEECCCCHHHHHHHHHHHC---CC------CCHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCC Q ss_conf -67767773236402301000016888654200---00------000457898865211000000887776542002467 Q gi|254780636|r 64 -PNLVGMLVDEKREKIFHYGRFDIAVLFYTFGV---RV------RPVFCTKIASRLTRTYTNQHGLKDNLKELLGINISK 133 (207) Q Consensus 64 -~~L~~ll~d~~i~KV~Hn~~~D~~~L~~~~gi---~~------~~i~DT~ias~l~~~~~~~~~L~~L~~~~lg~~ldK 133 (207) +.+..++.+..+ |+||+.||+.+|.+.+.. .+ .+++||...|+...+ +.++||.+||++ +++.. T Consensus 88 ~~efl~FI~d~~L--VaHNa~FD~~FLn~El~r~~~~~~~~~~~~~viDTL~lAR~~~P-g~~~sLdaLc~r-~~Id~-- 161 (246) T PRK08858 88 HQEFLEFIKGAEL--VAHNAPFDVGFMDYEFEKLNPAIGKTDDYCKVTDTLAMAKKIFP-GKRNNLDVLCER-YGIDN-- 161 (246) T ss_pred HHHHHHHHCCCEE--EEECCHHHHHHHHHHHHHHCCCCCCHHHHCCCEEHHHHHHHHCC-CCCCCHHHHHHH-CCCCC-- T ss_conf 9999999589989--99722888999999999828776751342010029999998689-998888999998-19898-- Q ss_pred CCEECCCCCCCCCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 52003656544323689999864599999999999 Q gi|254780636|r 134 AQQSSDWSADDLSDEQLQYAASDVVHLHALRLQFT 168 (207) Q Consensus 134 ~~q~SdW~~rpLs~~Qi~YAA~Dv~~l~~L~~~l~ 168 (207) |....+=|..|+..|-+.|-.|. T Consensus 162 ------------s~R~~HgALlDa~lLa~Vyl~mt 184 (246) T PRK08858 162 ------------SHRTLHGALLDAEILADVYLLMT 184 (246) T ss_pred ------------CCCCCCCHHHHHHHHHHHHHHHC T ss_conf ------------88340230888999999999972 No 47 >PRK07246 bifunctional ATP-dependent DNA helicase/DNA polymerase III subunit epsilon; Validated Probab=99.13 E-value=2.5e-09 Score=73.47 Aligned_cols=136 Identities=19% Similarity=0.223 Sum_probs=89.2 Q ss_pred CCCCEEEECCCCCCCCCCCEEEEEEEEE---CCCCE---EEECCCCCCCC------------------------CHHHHH Q ss_conf 8982799717778985445079999841---89428---99746577857------------------------067767 Q gi|254780636|r 19 YVDAIAVDTETLGLMPRRDRLCIVQLSP---GDGTV---DIIRIAAGQKN------------------------APNLVG 68 (207) Q Consensus 19 ~~~~iaiDtEt~~l~~~~~~l~LiQl~~---~~~~~---~l~~~~~~~~~------------------------~~~L~~ 68 (207) ....+++|+||||.++. + .+||++. .++.. |--.++..... ++.+.+ T Consensus 6 ~~~~~vvDlETTG~~~~-~--~Iiqi~~v~~~~~~i~~~f~t~vnP~~~ip~~I~~LTgI~~~~v~~ap~f~~v~~~~~~ 82 (820) T PRK07246 6 LRKYAVVDLEATGAGPN-A--SIIQVGIVIIQGNKIIDSYETDVNPHESLDEHIVHLTGITDKQLAQAPDFSQVAHHIYQ 82 (820) T ss_pred CCCEEEEEEECCCCCCC-C--CEEEEEEEEEECCEEEEEEEECCCCCCCCCHHHHHHCCCCHHHHHCCCCHHHHHHHHHH T ss_conf 67189999887898999-8--77999999999999989987311889979978886469898998529998999999999 Q ss_pred HHHCCCCCEEECCCCHHHHHHHHHH---HCCC-CCHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCEECCCCCCC Q ss_conf 7732364023010000168886542---0000-00045789886521100000088777654200246752003656544 Q gi|254780636|r 69 MLVDEKREKIFHYGRFDIAVLFYTF---GVRV-RPVFCTKIASRLTRTYTNQHGLKDNLKELLGINISKAQQSSDWSADD 144 (207) Q Consensus 69 ll~d~~i~KV~Hn~~~D~~~L~~~~---gi~~-~~i~DT~ias~l~~~~~~~~~L~~L~~~~lg~~ldK~~q~SdW~~rp 144 (207) +++|. .-|+||+.||..+|...+ |..+ .+.+||.-.|+++-+...+|.|.+|++. ||+..+..+ T Consensus 83 ~l~~~--~~VaHNv~FD~~fl~~~~~~~g~~~~~~~~DTv~La~i~~P~~~~y~L~~L~~~-l~~~~~~~H--------- 150 (820) T PRK07246 83 LIEDC--IFVAHNVKFDANLLAEALFLEGYELRTPRVDTVELAQVFFPRLEKYSLSHLSRQ-LNIDLAEAH--------- 150 (820) T ss_pred HHCCC--EEEEECCCCCHHHHHHHHHHCCCCCCCCCCCHHHHHHHHCCCCCCCCHHHHHHH-CCCCCCCCC--------- T ss_conf 96798--599837201699999999975898899826099999997798788888999997-399999887--------- Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 32368999986459999999999999997491 Q gi|254780636|r 145 LSDEQLQYAASDVVHLHALRLQFTEKLQRLGR 176 (207) Q Consensus 145 Ls~~Qi~YAA~Dv~~l~~L~~~l~~~L~~~~~ 176 (207) =|..||..+-+|+..+.+++..... T Consensus 151 -------rA~~Da~ata~l~~~l~~~l~~lp~ 175 (820) T PRK07246 151 -------TAIADARATAILFLRLLQKIESLPK 175 (820) T ss_pred -------CHHHHHHHHHHHHHHHHHHHHCCCH T ss_conf -------6188999999999999999861999 No 48 >PRK07009 consensus Probab=99.12 E-value=8.6e-10 Score=76.16 Aligned_cols=127 Identities=24% Similarity=0.338 Sum_probs=79.1 Q ss_pred CEEEECCCCCCCCCC-CEEEEEEEE---------ECCCCEEEECCCCCC-CC--------------CH-------HHHHH Q ss_conf 279971777898544-507999984---------189428997465778-57--------------06-------77677 Q gi|254780636|r 22 AIAVDTETLGLMPRR-DRLCIVQLS---------PGDGTVDIIRIAAGQ-KN--------------AP-------NLVGM 69 (207) Q Consensus 22 ~iaiDtEt~~l~~~~-~~l~LiQl~---------~~~~~~~l~~~~~~~-~~--------------~~-------~L~~l 69 (207) .|.+||||||+++.. +++ |+|+ +++..-.++.+.+.. .. .| .+.++ T Consensus 3 ~VvlDtETTGl~~~~gdRI--IEIg~v~l~~~~~t~~~~~~yiNP~r~I~~~a~~IHGIt~e~L~~kP~F~eia~efl~F 80 (241) T PRK07009 3 QIILDTETTGLNARTGDRI--IEIGCVELLNRRLTGNNLHFYVNPERDSDPGALAVHGLTTEFLSDKPKFAEVVDQIRDF 80 (241) T ss_pred EEEEEEECCCCCCCCCCEE--EEEEEEEEECCEEECEEEEEEECCCCCCCHHHHHHHCCCHHHHHCCCCHHHHHHHHHHH T ss_conf 7999825799799999878--99999999999780406899878899799757651225799872585599999999999 Q ss_pred HHCCCCCEEECCCCHHHHHHHHHH---HCCC-C----CHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCEECCCC Q ss_conf 732364023010000168886542---0000-0----0045789886521100000088777654200246752003656 Q gi|254780636|r 70 LVDEKREKIFHYGRFDIAVLFYTF---GVRV-R----PVFCTKIASRLTRTYTNQHGLKDNLKELLGINISKAQQSSDWS 141 (207) Q Consensus 70 l~d~~i~KV~Hn~~~D~~~L~~~~---gi~~-~----~i~DT~ias~l~~~~~~~~~L~~L~~~~lg~~ldK~~q~SdW~ 141 (207) +.+..+ |+||+.||+.+|.+.+ |..+ . ++.||...|+-..+ +.+++|.+||+| +++.. T Consensus 81 i~~a~L--VaHNA~FD~~FLn~EL~r~G~~~~~~~~~~viDTL~lAR~~~P-g~~~sLdaLc~R-~~I~~---------- 146 (241) T PRK07009 81 VQDAEL--IIHNAPFDLGFLDAEFALLGLPPFTEHCAGVIDTLVQAKQMFP-GKRNSLDALCDR-FGISN---------- 146 (241) T ss_pred HCCCEE--EEECHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHCC-CCCCCHHHHHHH-CCCCC---------- T ss_conf 669927--8867699999999999983997310244447679999999789-998788999998-28698---------- Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 544323689999864599999999999 Q gi|254780636|r 142 ADDLSDEQLQYAASDVVHLHALRLQFT 168 (207) Q Consensus 142 ~rpLs~~Qi~YAA~Dv~~l~~L~~~l~ 168 (207) +....+=|-.|+..|-+.|-.|. T Consensus 147 ----~~r~~HgAL~Da~lLa~vyl~mt 169 (241) T PRK07009 147 ----AHRTLHGALLDSELLAEVYLAMT 169 (241) T ss_pred ----CCCCCCCCHHHHHHHHHHHHHHH T ss_conf ----87330242877999999999983 No 49 >PRK05711 DNA polymerase III subunit epsilon; Provisional Probab=99.09 E-value=1.2e-09 Score=75.42 Aligned_cols=129 Identities=27% Similarity=0.344 Sum_probs=77.6 Q ss_pred CCEEEECCCCCCCCCC-CEEEEEEEE---------ECCCCEEEECCCCCC---------------CCCH-------HHHH Q ss_conf 8279971777898544-507999984---------189428997465778---------------5706-------7767 Q gi|254780636|r 21 DAIAVDTETLGLMPRR-DRLCIVQLS---------PGDGTVDIIRIAAGQ---------------KNAP-------NLVG 68 (207) Q Consensus 21 ~~iaiDtEt~~l~~~~-~~l~LiQl~---------~~~~~~~l~~~~~~~---------------~~~~-------~L~~ 68 (207) ..|.+||||||++|.. +++ |+|+ +++.-..++.+.+.. ..+| .+.. T Consensus 5 R~VvlDtETTGl~~~~gdRI--IEIgav~~~~~~~t~~~f~~~inP~r~i~~~a~~IhGIT~e~L~~~P~F~ei~~efl~ 82 (240) T PRK05711 5 RQIVLDTETTGLNQREGHRI--IEIGAVELINRRLTGRNFHVYIKPDRDVDPEALAVHGITDEFLADKPTFAEVADEFLD 82 (240) T ss_pred CEEEEEEECCCCCCCCCCEE--EEEEEEEEECCEECCCEEEEEECCCCCCCHHHHHHCCCCHHHHCCCCCHHHHHHHHHH T ss_conf 28999866899699999878--9999999999988143578998968959977875408688996458759999999999 Q ss_pred HHHCCCCCEEECCCCHHHHHHHHHH---HCC---CC---CHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCEECC Q ss_conf 7732364023010000168886542---000---00---00457898865211000000887776542002467520036 Q gi|254780636|r 69 MLVDEKREKIFHYGRFDIAVLFYTF---GVR---VR---PVFCTKIASRLTRTYTNQHGLKDNLKELLGINISKAQQSSD 139 (207) Q Consensus 69 ll~d~~i~KV~Hn~~~D~~~L~~~~---gi~---~~---~i~DT~ias~l~~~~~~~~~L~~L~~~~lg~~ldK~~q~Sd 139 (207) ++.+. .-|+||+.||+.+|.+.+ |.. .. +++||.-.|+-.-+ +.++||.+||++ +|+.. T Consensus 83 Fi~~s--~lVaHNa~FD~~FLn~El~r~g~~~~~~~~~~~viDTL~lAR~~~P-g~~~sLdaLc~r-~~I~~-------- 150 (240) T PRK05711 83 FIRGA--ELIIHNAPFDIGFMDYEFALLGRDIPKTTTFCKVTDTLAMARRMFP-GKRNSLDALCKR-YGIDN-------- 150 (240) T ss_pred HHCCC--EEEEECCHHHHHHHHHHHHHHCCCCCCHHHHCCHHHHHHHHHHHCC-CCCCCHHHHHHH-CCCCC-------- T ss_conf 81899--7999661998999999999868998764452121019999998789-998899999998-09898-------- Q ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 565443236899998645999999999999 Q gi|254780636|r 140 WSADDLSDEQLQYAASDVVHLHALRLQFTE 169 (207) Q Consensus 140 W~~rpLs~~Qi~YAA~Dv~~l~~L~~~l~~ 169 (207) +.|. .+=|..|+..|-++|-.|.. T Consensus 151 -s~R~-----~HgAL~Da~llA~Vyl~l~g 174 (240) T PRK05711 151 -SHRT-----LHGALLDAEILAEVYLAMTG 174 (240) T ss_pred -CCCC-----CCCHHHHHHHHHHHHHHHCC T ss_conf -8756-----57648889999999999717 No 50 >PRK08816 consensus Probab=99.08 E-value=1.5e-09 Score=74.74 Aligned_cols=128 Identities=23% Similarity=0.309 Sum_probs=76.3 Q ss_pred CCEEEECCCCCCCCCC-CEEEEEEEE---------ECCCCEEEECCCCCC-CC--------------C-------HHHHH Q ss_conf 8279971777898544-507999984---------189428997465778-57--------------0-------67767 Q gi|254780636|r 21 DAIAVDTETLGLMPRR-DRLCIVQLS---------PGDGTVDIIRIAAGQ-KN--------------A-------PNLVG 68 (207) Q Consensus 21 ~~iaiDtEt~~l~~~~-~~l~LiQl~---------~~~~~~~l~~~~~~~-~~--------------~-------~~L~~ 68 (207) ..|.+||||||++|.. +++ |+|+ +++.--.++.+.+.. .. . +.+.. T Consensus 2 R~VVlDtETTGl~~~~gdRI--IEIgaV~~~~~~~~~~~f~~~inP~r~I~~~a~~vhGIT~e~L~dkP~F~eV~~efl~ 79 (244) T PRK08816 2 RQIILDTETTGLEWRKGNRV--VEIGAVELLERRPSGNNFHRYLKPDCDFEPGAQEVTGLTLEFLADKPLFGEVVDEFLA 79 (244) T ss_pred CEEEEEEECCCCCCCCCCEE--EEEEEEEEECCEECCCEEEEEECCCCCCCHHHHHHHCCCHHHHHCCCCHHHHHHHHHH T ss_conf 87999722799899899889--9999999999968143789986889969988987525178998409989999999999 Q ss_pred HHHCCCCCEEECCCCHHHHHHHHHHHC---C------CCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCEECC Q ss_conf 773236402301000016888654200---0------0000457898865211000000887776542002467520036 Q gi|254780636|r 69 MLVDEKREKIFHYGRFDIAVLFYTFGV---R------VRPVFCTKIASRLTRTYTNQHGLKDNLKELLGINISKAQQSSD 139 (207) Q Consensus 69 ll~d~~i~KV~Hn~~~D~~~L~~~~gi---~------~~~i~DT~ias~l~~~~~~~~~L~~L~~~~lg~~ldK~~q~Sd 139 (207) ++.+.. -|+||+.||+.+|-+.+.. . ..++.||...++-..+ +.+.||.+||+|| ++.. T Consensus 80 Fi~ds~--LVaHNa~FDi~FLn~El~r~g~~~~~~~~~~~v~DTl~lAr~~~P-g~~nsLDaLc~r~-~id~-------- 147 (244) T PRK08816 80 YIDGAE--LIIHNAAFDLGFLDNELSLLGDQYGRIVDRATVVDTLMMARERYP-GQRNSLDALCKRL-GVDN-------- 147 (244) T ss_pred HHCCCE--EEEECCHHHHHHHHHHHHHHCCCCCCCCCCCCEEEHHHHHHHHCC-CCCCCHHHHHHHC-CCCC-------- T ss_conf 828998--999663652989999999848875676431010438999998787-9977899999974-9788-------- Q ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 56544323689999864599999999999 Q gi|254780636|r 140 WSADDLSDEQLQYAASDVVHLHALRLQFT 168 (207) Q Consensus 140 W~~rpLs~~Qi~YAA~Dv~~l~~L~~~l~ 168 (207) +....+=|..|+..|-+.|-.|. T Consensus 148 ------~~R~~HgALlDa~lLa~VYl~mt 170 (244) T PRK08816 148 ------SHRQLHGALLDAQILADVYIALT 170 (244) T ss_pred ------CCCCCCCHHHHHHHHHHHHHHHC T ss_conf ------77350148887999999999974 No 51 >cd06131 DNA_pol_III_epsilon_Ecoli_like This subfamily is composed of the epsilon subunit of Escherichia coli DNA polymerase III (polIII) and similar proteins. PolIII is the main DNA replicating enzyme in bacteria, functioning as the chromosomal replicase. It is a holoenzyme complex of ten different subunits, three of which (alpha, epsilon, and theta) compose the catalytic core. The polIII epsilon subunit, encoded by the dnaQ gene, is a DEDDh-type 3'-5' exonuclease which is responsible for the proofreading activity of the polymerase, increasing the fidelity of DNA synthesis. It contains three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific Hx(4)D conserved pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. The epsilon subunit of polIII also functions as a stabilizer of the holoenzyme complex. Probab=99.07 E-value=1.3e-09 Score=75.14 Aligned_cols=125 Identities=25% Similarity=0.320 Sum_probs=75.4 Q ss_pred EEEECCCCCCCCCC-CEEEEEEEEE-----C---CCCE-EEECCCCCC-C--------------CC-------HHHHHHH Q ss_conf 79971777898544-5079999841-----8---9428-997465778-5--------------70-------6776777 Q gi|254780636|r 23 IAVDTETLGLMPRR-DRLCIVQLSP-----G---DGTV-DIIRIAAGQ-K--------------NA-------PNLVGML 70 (207) Q Consensus 23 iaiDtEt~~l~~~~-~~l~LiQl~~-----~---~~~~-~l~~~~~~~-~--------------~~-------~~L~~ll 70 (207) |.||+||||++|.+ +++ |||+. + ...+ .++...... . .+ +.+.+++ T Consensus 2 Vv~D~ETTGl~p~~~d~I--ieigav~~~~~~~~~~~~~~lv~P~~~i~~~~~~ihGIt~e~l~~~p~~~ev~~~f~~~l 79 (167) T cd06131 2 IVLDTETTGLDPREGHRI--IEIGCVELINRRLTGNTFHVYINPERDIPEEAFKVHGITDEFLADKPKFAEIADEFLDFI 79 (167) T ss_pred EEEECCCCCCCCCCCCEE--EEEEEEEEECCEEECCEEEEEECCCCCCCHHHHHHHCCCHHHHCCCCCHHHHHHHHHHHC T ss_conf 899760698898899837--999999999999924478999897898998899780968589835975899999998635 Q ss_pred HCCCCCEEECCCCHHHHHHHHHH---HCCC-----CCHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCEECCCCC Q ss_conf 32364023010000168886542---0000-----000457898865211000000887776542002467520036565 Q gi|254780636|r 71 VDEKREKIFHYGRFDIAVLFYTF---GVRV-----RPVFCTKIASRLTRTYTNQHGLKDNLKELLGINISKAQQSSDWSA 142 (207) Q Consensus 71 ~d~~i~KV~Hn~~~D~~~L~~~~---gi~~-----~~i~DT~ias~l~~~~~~~~~L~~L~~~~lg~~ldK~~q~SdW~~ 142 (207) .+. .-|+||++||..+|...+ |... .+++||+..++..-+ ..+++|.+++++ +|+..+ T Consensus 80 ~~~--~lVaHN~~FD~~fL~~~~~r~~~~~~~~~~~~~~Dtl~l~r~~~p-~~~~~L~~l~~~-~gi~~~---------- 145 (167) T cd06131 80 RGA--ELVIHNASFDVGFLNAELSLLGLGKKIIDFCRVIDTLALARKKFP-GKPNSLDALCKR-FGIDNS---------- 145 (167) T ss_pred CCC--EEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHCC-CCCCCHHHHHHH-CCCCCC---------- T ss_conf 699--999837187899999999981899777765541139999999849-998889999987-697999---------- Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 4432368999986459999999999 Q gi|254780636|r 143 DDLSDEQLQYAASDVVHLHALRLQF 167 (207) Q Consensus 143 rpLs~~Qi~YAA~Dv~~l~~L~~~l 167 (207) ..+-+=|..||..+-++|-+| T Consensus 146 ----~~~~H~Al~Da~~ta~vf~~l 166 (167) T cd06131 146 ----HRTLHGALLDAELLAEVYLEL 166 (167) T ss_pred ----CCCCCCHHHHHHHHHHHHHHH T ss_conf ----998758899999999999976 No 52 >PRK06807 DNA polymerase III subunit epsilon; Validated Probab=99.07 E-value=2.9e-09 Score=73.05 Aligned_cols=133 Identities=20% Similarity=0.289 Sum_probs=84.7 Q ss_pred CCCEEEECCCCCCCCCCCEEEEEEEEE---CCCC----E-EEECCCCC-C--------------CCCH-------HHHHH Q ss_conf 982799717778985445079999841---8942----8-99746577-8--------------5706-------77677 Q gi|254780636|r 20 VDAIAVDTETLGLMPRRDRLCIVQLSP---GDGT----V-DIIRIAAG-Q--------------KNAP-------NLVGM 69 (207) Q Consensus 20 ~~~iaiDtEt~~l~~~~~~l~LiQl~~---~~~~----~-~l~~~~~~-~--------------~~~~-------~L~~l 69 (207) .+.++||+||||++|.+++++ +++. .++. + .++.+... + .++| .+.++ T Consensus 8 ~~fVV~DiETTGl~p~~D~II--EIgAVkv~ng~Ivd~f~tLInP~~~Ip~~It~ItGIT~emV~dAP~f~EVl~~f~~F 85 (313) T PRK06807 8 LDYVVIDFETTGFNPYNDKII--QVAAVKYRNHELVDQFVSYVNPERPIPDRITSLTGITNYRVSDAPTIEEVLPLFLAF 85 (313) T ss_pred CCEEEEEEECCCCCCCCCEEE--EEEEEEEECCEEEEEEEEEECCCCCCCHHHHHHCCCCHHHHHCCCCHHHHHHHHHHH T ss_conf 878999978899799998789--999999999979899999989489799667653397999982699899999999987 Q ss_pred HHCCCCCEEECCCCHHHHHHHHHH---HC-CCCC-HHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCEECCCCCCC Q ss_conf 732364023010000168886542---00-0000-045789886521100000088777654200246752003656544 Q gi|254780636|r 70 LVDEKREKIFHYGRFDIAVLFYTF---GV-RVRP-VFCTKIASRLTRTYTNQHGLKDNLKELLGINISKAQQSSDWSADD 144 (207) Q Consensus 70 l~d~~i~KV~Hn~~~D~~~L~~~~---gi-~~~~-i~DT~ias~l~~~~~~~~~L~~L~~~~lg~~ldK~~q~SdW~~rp 144 (207) +.+ .++ |+||+.||+.+|...+ |. .+.+ +.||.-.|+.+-+...+|.|..+++ +||++++ T Consensus 86 i~d-~iL-VAHNA~FD~~FL~~~L~r~gl~~~~n~~IDTL~LAR~l~P~L~NhKLnTVa~-~LgI~l~------------ 150 (313) T PRK06807 86 LHT-NVI-VAHNASFDMRFLKSNVNMLGLPEPKNKVIDTVFLAKKYMKHAPNHKLETLKR-MLGIRLS------------ 150 (313) T ss_pred CCC-CEE-EEECCHHHHHHHHHHHHHCCCCCCCCCEEEHHHHHHHHCCCCCCCCHHHHHH-HHCCCCC------------ T ss_conf 389-989-9955366199999999974998889987765999998647876566789999-8399834------------ Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 323689999864599999999999999974 Q gi|254780636|r 145 LSDEQLQYAASDVVHLHALRLQFTEKLQRL 174 (207) Q Consensus 145 Ls~~Qi~YAA~Dv~~l~~L~~~l~~~L~~~ 174 (207) -+-|..||..+-+||.++..+-+.. T Consensus 151 -----HHrAldDA~acAeI~~k~~~~~~~~ 175 (313) T PRK06807 151 -----SHNAFDDCITCAAVYQKCASIEEEA 175 (313) T ss_pred -----CCCCHHHHHHHHHHHHHHHHHHHHH T ss_conf -----3671778999999999999988616 No 53 >cd06125 DnaQ_like_exo The DnaQ-like exonuclease superfamily is a structurally conserved group of 3'-5' exonucleases, which catalyze the excision of nucleoside monophosphates at the DNA or RNA termini in the 3'-5' direction. It is also called the DEDD superfamily, after the four invariant acidic residues present in the catalytic site of its members. The superfamily consists of DNA- and RNA-processing enzymes such as the proofreading domains of DNA polymerases, other DNA exonucleases, RNase D, RNase T, Oligoribonuclease and RNA exonucleases (REX). The DnaQ-like exonuclease domain contains three conserved sequence motifs termed ExoI, ExoII and ExoIII, which are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. The conservation patterns of the three motifs may vary among different subfamilies. DnaQ-like exonucleases are classified as DEDDy or DEDDh exonucleases depending on the variation of motif Probab=99.05 E-value=2.5e-10 Score=79.32 Aligned_cols=85 Identities=25% Similarity=0.235 Sum_probs=56.0 Q ss_pred EEEECCCCCCCC-CCCEEEEEEEEEC-CCCEEEECCCCCCC------------------------CCHHHHHHHHCCCCC Q ss_conf 799717778985-4450799998418-94289974657785------------------------706776777323640 Q gi|254780636|r 23 IAVDTETLGLMP-RRDRLCIVQLSPG-DGTVDIIRIAAGQK------------------------NAPNLVGMLVDEKRE 76 (207) Q Consensus 23 iaiDtEt~~l~~-~~~~l~LiQl~~~-~~~~~l~~~~~~~~------------------------~~~~L~~ll~d~~i~ 76 (207) |+||+||||+++ .+++++-|+.... .+..+...++.... ..+.+.++++++... T Consensus 1 Vv~D~ETTGl~~~~~d~Iiei~~v~~~~~~~~~~~v~p~~~i~~~~~~i~gi~~~~v~~~p~~~~~~~~~~~f~~~~~~i 80 (130) T cd06125 1 VAFDIETTGLNGNARDEIIEIALVVVHIGEKFVFDVKPDDLIDPYAFTISGITDEVVERAATEAELLIELLAFLRDKDDI 80 (130) T ss_pred CEEEEECCCCCCCCCCEEEEEEEEEEECCCEEEEEECCCCCCCHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCE T ss_conf 98980189969999985899999997189889999799998997794103759999833060899999999997699999 Q ss_pred EEECCCCHHHHHHHHHHH---CC----CCCHHHHHHHH Q ss_conf 230100001688865420---00----00004578988 Q gi|254780636|r 77 KIFHYGRFDIAVLFYTFG---VR----VRPVFCTKIAS 107 (207) Q Consensus 77 KV~Hn~~~D~~~L~~~~g---i~----~~~i~DT~ias 107 (207) .||||++||+.+|.+.+. +. ..+++||...| T Consensus 81 lVgHN~~FD~~fL~~~~~r~g~~~~~~~~~~iDTl~la 118 (130) T cd06125 81 LVGHNISFDLPYLLNRAAELGIPYPILGRIVFDTLALA 118 (130) T ss_pred EEEECHHHHHHHHHHHHHHCCCCCCCCCCCEEECHHHH T ss_conf 99919477499999999984999987899686860984 No 54 >PRK07982 consensus Probab=99.02 E-value=3e-09 Score=73.04 Aligned_cols=130 Identities=19% Similarity=0.227 Sum_probs=79.0 Q ss_pred CCEEEECCCCCCCCC---CCEEEEEEEE---------ECCCCEEEECCCCCCC---------------CC-------HHH Q ss_conf 827997177789854---4507999984---------1894289974657785---------------70-------677 Q gi|254780636|r 21 DAIAVDTETLGLMPR---RDRLCIVQLS---------PGDGTVDIIRIAAGQK---------------NA-------PNL 66 (207) Q Consensus 21 ~~iaiDtEt~~l~~~---~~~l~LiQl~---------~~~~~~~l~~~~~~~~---------------~~-------~~L 66 (207) ..|.+||||||++|. +..=.+|+|+ +++.--.++.+.+... +. +.+ T Consensus 7 R~ivlDtETTGL~p~~~~~~gdRIIEIgaVei~~~~~t~~~fh~yiNP~r~I~~~a~~VHGIT~e~L~dkP~F~eI~~ef 86 (243) T PRK07982 7 RQIVLDTETTGMNQIGAHYEGHKIIEIGAVEVINRRLTGNNFHVYLKPDRLVDPEAFGVHGIADEFLLDKPTFADVADEF 86 (243) T ss_pred EEEEEEEECCCCCCCCCCCCCCEEEEEEEEEEECCEECCCEEEEEECCCCCCCHHHHHHEEECHHHHHCCCCHHHHHHHH T ss_conf 58999711799998776778987999999999999882523437888899799889852115778871699899999999 Q ss_pred HHHHHCCCCCEEECCCCHHHHHHHHHHHC---C--C-C---CHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCEE Q ss_conf 67773236402301000016888654200---0--0-0---004578988652110000008877765420024675200 Q gi|254780636|r 67 VGMLVDEKREKIFHYGRFDIAVLFYTFGV---R--V-R---PVFCTKIASRLTRTYTNQHGLKDNLKELLGINISKAQQS 137 (207) Q Consensus 67 ~~ll~d~~i~KV~Hn~~~D~~~L~~~~gi---~--~-~---~i~DT~ias~l~~~~~~~~~L~~L~~~~lg~~ldK~~q~ 137 (207) ..++.+..+ |+||+.||+.+|-+.+.. . + . ++.||...|+-..+ +.++||.+||+| +++.. T Consensus 87 l~Fi~~a~L--VaHNA~FD~~FLn~El~r~g~~~~~~~~~~~v~DTL~lAR~~~P-g~~~sLDaLc~R-~~Id~------ 156 (243) T PRK07982 87 LDYIRGAEL--VIHNAAFDIGFMDYEFSLLKRDIPKTNTFCKVTDSLALARKMFP-GKRNSLDALCDR-YEIDN------ 156 (243) T ss_pred HHHHCCCEE--EEECCHHHHHHHHHHHHHCCCCCCCCCCCCCEEEHHHHHHHHCC-CCCCCHHHHHHH-CCCCC------ T ss_conf 999679989--99733888999999999708898861012431349999888788-986899999875-48776------ Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 3656544323689999864599999999999 Q gi|254780636|r 138 SDWSADDLSDEQLQYAASDVVHLHALRLQFT 168 (207) Q Consensus 138 SdW~~rpLs~~Qi~YAA~Dv~~l~~L~~~l~ 168 (207) +....+=|..|+..|-+.|-.|. T Consensus 157 --------s~R~~HgALlDa~lLAeVYl~lt 179 (243) T PRK07982 157 --------SKRTLHGALLDAQILAEVYLAMT 179 (243) T ss_pred --------CCCCCCCHHHHHHHHHHHHHHHC T ss_conf --------56552333774889999999972 No 55 >TIGR01406 dnaQ_proteo DNA polymerase III, epsilon subunit; InterPro: IPR006309 These represent DnaQ, the DNA polymerase III epsilon subunit, as found in most Proteobacteria. It consists largely of an exonuclease region as described in IPR006055 from INTERPRO. In Gram-positive bacteria, closely related regions are found both in the Gram-positive type DNA polymerase III alpha subunit and as an additional N-terminal domain of a DinG-family helicase. Both are not included in this set of sequences, as they are smaller proteins.; GO: 0003677 DNA binding, 0003887 DNA-directed DNA polymerase activity, 0006260 DNA replication. Probab=99.01 E-value=4.6e-09 Score=71.92 Aligned_cols=130 Identities=26% Similarity=0.345 Sum_probs=84.0 Q ss_pred CEEEECCCCCCCCCC-----CEEEEEEEE---------ECCCCEEEECCCCCC-----------------CCCHH----- Q ss_conf 279971777898544-----507999984---------189428997465778-----------------57067----- Q gi|254780636|r 22 AIAVDTETLGLMPRR-----DRLCIVQLS---------PGDGTVDIIRIAAGQ-----------------KNAPN----- 65 (207) Q Consensus 22 ~iaiDtEt~~l~~~~-----~~l~LiQl~---------~~~~~~~l~~~~~~~-----------------~~~~~----- 65 (207) .|=+||||||+++-. + =++|-|+ |+++-=+|+.+.+.. ..+|. T Consensus 2 ~IiLDTETTGl~~~~G~~~~G-hRivEIGavE~vnr~~tG~~fH~Y~~P~Rd~p~eA~kVHGiT~efL~dKnaP~Fk~Ia 80 (246) T TIGR01406 2 EIILDTETTGLDPKGGLPYIG-HRIVEIGAVELVNRMLTGKNFHVYVKPERDIPAEAAKVHGITDEFLKDKNAPKFKEIA 80 (246) T ss_pred EEEECCCCCCCCCCCCCCCCC-CEEEEEEEEEEECCCCCCCCEEEEECCCCCCHHHCEECCCCCHHHHCCCCCCCHHHHH T ss_conf 267115556877667768568-6268882185673503887312433688877300210137646884688989525789 Q ss_pred --HHHHHHCCCCCEEECCCCHHHHHHHHH---HH---CC----CCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCC Q ss_conf --767773236402301000016888654---20---00----0000457898865211000000887776542002467 Q gi|254780636|r 66 --LVGMLVDEKREKIFHYGRFDIAVLFYT---FG---VR----VRPVFCTKIASRLTRTYTNQHGLKDNLKELLGINISK 133 (207) Q Consensus 66 --L~~ll~d~~i~KV~Hn~~~D~~~L~~~---~g---i~----~~~i~DT~ias~l~~~~~~~~~L~~L~~~~lg~~ldK 133 (207) +..++.+... |.|||.||+.||-+. +| +. ..++.||..+|+-. =++.+.||.+||+| ||++ T Consensus 81 ~~Fl~Fi~~~~L--vIHNA~FD~gFlN~EL~~Lg~~~~~~~~~~~~~~DTL~mAR~~-FPG~~~sLDALCkR-f~vD--- 153 (246) T TIGR01406 81 DEFLDFIKGSEL--VIHNAAFDVGFLNYELERLGLKEIKKISDFCRVIDTLAMARER-FPGKRNSLDALCKR-FKVD--- 153 (246) T ss_pred HHHHHHHCCCEE--EEECCCHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHC-CCCCCCCHHHHHHH-HCCC--- T ss_conf 999998459889--9975702167899999872873210001366786568999723-78986646787743-0425--- Q ss_pred CCEECCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 5200365654432368999986459999999999999 Q gi|254780636|r 134 AQQSSDWSADDLSDEQLQYAASDVVHLHALRLQFTEK 170 (207) Q Consensus 134 ~~q~SdW~~rpLs~~Qi~YAA~Dv~~l~~L~~~l~~~ 170 (207) .|..|++=|-.||..|=++|-.|..- T Consensus 154 -----------ns~R~LHGAL~DA~LLAeVYl~ltGG 179 (246) T TIGR01406 154 -----------NSHRQLHGALLDAELLAEVYLALTGG 179 (246) T ss_pred -----------CCCCHHHHHHHHHHHHHHHHHHHCCC T ss_conf -----------00021347899999999999986288 No 56 >cd06134 RNaseT RNase T is a DEDDh-type DnaQ-like 3'-5' exoribonuclease E implicated in the 3' maturation of small stable RNAs and 23srRNA, and in the end turnover of tRNA. It contains three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific Hx(4)D conserved pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. RNase T is related to the proofreading domain of DNA polymerase III. Despite its important role, RNase T is mainly found only in gammaproteobacteria. It is speculated that it might have originated from DNA polymerase III at the time the gamma division of proteobacteria diverged from other bacteria. RNase T is a homodimer with the catalytic residues of one monomer contacting a large basic patch on the other monomer to form a functional active site. Probab=99.01 E-value=3.6e-09 Score=72.53 Aligned_cols=132 Identities=22% Similarity=0.299 Sum_probs=77.6 Q ss_pred CCCCEEEECCCCCCCCCCCEEEEEEEEE------CCC------CE-EEECCCCC-------------C---------CCC Q ss_conf 8982799717778985445079999841------894------28-99746577-------------8---------570 Q gi|254780636|r 19 YVDAIAVDTETLGLMPRRDRLCIVQLSP------GDG------TV-DIIRIAAG-------------Q---------KNA 63 (207) Q Consensus 19 ~~~~iaiDtEt~~l~~~~~~l~LiQl~~------~~~------~~-~l~~~~~~-------------~---------~~~ 63 (207) +--.|.|||||||++|.++++ |||+. +.+ .+ ..+.+... . ... T Consensus 4 ~~~~VV~D~ETTGL~p~~d~I--IEIgav~v~~d~~g~i~~~~~~~~~v~P~~~~~i~~~~~~itGIt~~~~~~~~~~~~ 81 (189) T cd06134 4 GFLPVVVDVETGGFNPQTDAL--LEIAAVTLEMDEQGNLYPDETFHFHILPFEGANLDPAALEFNGIDPFHPFRFAVDEK 81 (189) T ss_pred CCCCEEEEEECCCCCCCCCEE--EEEEEEEEEECCCCCCCCCCEEEEEECCCCCCCCCHHHHHHCCCCCHHHHCCCCCHH T ss_conf 985079995289999889807--999999999768785324516899778988998985566550777001100167577 Q ss_pred HHHHHHHH---C----C---CCCEEECCCCHHHHHHHHHH---HCC--C-CC--HHHHHHHHHHHHHHCCCHHHHHHHHH Q ss_conf 67767773---2----3---64023010000168886542---000--0-00--04578988652110000008877765 Q gi|254780636|r 64 PNLVGMLV---D----E---KREKIFHYGRFDIAVLFYTF---GVR--V-RP--VFCTKIASRLTRTYTNQHGLKDNLKE 125 (207) Q Consensus 64 ~~L~~ll~---d----~---~i~KV~Hn~~~D~~~L~~~~---gi~--~-~~--i~DT~ias~l~~~~~~~~~L~~L~~~ 125 (207) +.+.+++. + . +-+-|+||+.||+.+|...+ |+. | .+ ++||+-.|++. + ++++|..++++ T Consensus 82 ~~~~~~~~~i~~~~~~~~~~~avlVaHNa~FD~~FL~~~~~r~g~~~~p~~~~~~~DTl~La~~~--~-~~~~L~~l~~~ 158 (189) T cd06134 82 EALKEIFKPIRKALKAQGCTRAILVGHNAHFDLGFLNAAVARCKIKRNPFHPFSTFDTATLAGLA--Y-GQTVLAKACQA 158 (189) T ss_pred HHHHHHHHHHHHHHHHCCCCCCEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCEEEHHHHHHHH--H-CCCCHHHHHHH T ss_conf 88999999999998751675556875177772999999999849988888777656699998677--3-80019999998 Q ss_pred HHCCCCCCCCEECCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 420024675200365654432368999986459999999999999 Q gi|254780636|r 126 LLGINISKAQQSSDWSADDLSDEQLQYAASDVVHLHALRLQFTEK 170 (207) Q Consensus 126 ~lg~~ldK~~q~SdW~~rpLs~~Qi~YAA~Dv~~l~~L~~~l~~~ 170 (207) +|+.++ ..+-+=|..||.+|-+||.+|..+ T Consensus 159 -~gi~~~--------------~~~~HrAl~DA~~ta~lf~~l~~k 188 (189) T cd06134 159 -AGIEFD--------------NKEAHSALYDTQKTAELFCKIVNR 188 (189) T ss_pred -CCCCCC--------------CCCCCCCHHHHHHHHHHHHHHHHH T ss_conf -799999--------------987887299999999999999964 No 57 >PRK00448 polC DNA polymerase III PolC; Validated Probab=99.01 E-value=8.1e-09 Score=70.50 Aligned_cols=145 Identities=22% Similarity=0.351 Sum_probs=94.5 Q ss_pred HHHCCCCCCEEEECCCCCCCCCCCEEEEEEEEEC---CCCE-----EEECCCC---------------CCCCC------- Q ss_conf 7842489827997177789854450799998418---9428-----9974657---------------78570------- Q gi|254780636|r 14 ECAARYVDAIAVDTETLGLMPRRDRLCIVQLSPG---DGTV-----DIIRIAA---------------GQKNA------- 63 (207) Q Consensus 14 ~~~l~~~~~iaiDtEt~~l~~~~~~l~LiQl~~~---~~~~-----~l~~~~~---------------~~~~~------- 63 (207) +..+..+..|.||.||||+++.++++ |+++.. ++.. .++.+.. +-..+ T Consensus 412 ~~~l~~~~yVvFDiETTGls~~~d~I--iEigAvki~~g~iid~f~~fi~P~~~i~~~i~~lT~Itd~mv~~~~~~~evl 489 (1436) T PRK00448 412 DRDLKDATYVVFDVETTGLSAVYDEI--IEIGAVKIKNGEIIDKFEEFIKPGHPLSAFTTELTGITDDMVKDAPSIEEVL 489 (1436) T ss_pred CCCCCCCCEEEEEECCCCCCCCCCCE--EEEEEEEEECCEEEEEEHHHCCCCCCCCHHHHHCCCCCHHHHCCCCCHHHHH T ss_conf 77467663799860068888666723--6875798637817541021038999797545321478847863898789999 Q ss_pred HHHHHHHHCCCCCEEECCCCHHHHHHHHHH---HCC-C-CCHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCEEC Q ss_conf 677677732364023010000168886542---000-0-00045789886521100000088777654200246752003 Q gi|254780636|r 64 PNLVGMLVDEKREKIFHYGRFDIAVLFYTF---GVR-V-RPVFCTKIASRLTRTYTNQHGLKDNLKELLGINISKAQQSS 138 (207) Q Consensus 64 ~~L~~ll~d~~i~KV~Hn~~~D~~~L~~~~---gi~-~-~~i~DT~ias~l~~~~~~~~~L~~L~~~~lg~~ldK~~q~S 138 (207) +.+.++..| . +-|+|||.||+.++...+ |+. + .++.||.-.|+.+.+...+|+|..++++ +++.++.. T Consensus 490 ~~f~~f~~d-~-vlVAHNa~FD~~Fi~~~~~~~~~~~~~~p~iDTl~lsr~l~p~~k~~~L~~l~k~-~~v~~~~h---- 562 (1436) T PRK00448 490 PKFKEFCGD-S-VLVAHNASFDVGFINTNYEKLGLEKIKNPVIDTLELSRFLYPEYKSHRLNTLAKK-FGVELEHH---- 562 (1436) T ss_pred HHHHHHHCC-C-EEEEECCCCCHHHHHHHHHHCCCCCCCCCEECHHHHHHHHCCCCCCCCHHHHHHH-HCCCCCCC---- T ss_conf 999998289-7-7998458656889999999808986679723029989875510111447889988-48886545---- Q ss_pred CCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHH Q ss_conf 65654432368999986459999999999999997491899 Q gi|254780636|r 139 DWSADDLSDEQLQYAASDVVHLHALRLQFTEKLQRLGRSDL 179 (207) Q Consensus 139 dW~~rpLs~~Qi~YAA~Dv~~l~~L~~~l~~~L~~~~~~~~ 179 (207) +=|..|+..|-.+|..+.+++.+.+...+ T Consensus 563 ------------HRA~~Da~~t~~i~~~~l~~~~~~~i~~~ 591 (1436) T PRK00448 563 ------------HRADYDAEATAYLLIKFLKDLKEKGITNL 591 (1436) T ss_pred ------------CCCCCCHHHHHHHHHHHHHHHHHCCCCCH T ss_conf ------------44322388899999999999987487437 No 58 >PRK07883 hypothetical protein; Validated Probab=98.99 E-value=1.4e-08 Score=69.15 Aligned_cols=143 Identities=22% Similarity=0.274 Sum_probs=90.1 Q ss_pred HCCCCCCEEEECCCCCCCCCCCEEEEEEEEE---CCCCE-----EEECCCCC---------------CCCCH-------H Q ss_conf 4248982799717778985445079999841---89428-----99746577---------------85706-------7 Q gi|254780636|r 16 AARYVDAIAVDTETLGLMPRRDRLCIVQLSP---GDGTV-----DIIRIAAG---------------QKNAP-------N 65 (207) Q Consensus 16 ~l~~~~~iaiDtEt~~l~~~~~~l~LiQl~~---~~~~~-----~l~~~~~~---------------~~~~~-------~ 65 (207) .+.....+.||+||||+++..++| |+|+. .++.+ -++..... -..+| . T Consensus 20 pL~~~tfVVvDlETTG~s~~~d~I--iEIgAVkv~~g~vv~~f~tLVnP~~~Ip~~I~~LTGIt~~mV~~AP~~~~vl~~ 97 (575) T PRK07883 20 PLRDTTFVVVDLETTGGSPTGDAI--TEIGAVKVRGGEVLGEFATLVNPGRSIPPFIVVLTGITTAMVADAPPIDAVLPA 97 (575) T ss_pred CCCCCEEEEEEEECCCCCCCCCEE--EEEEEEEEECCEEEEEEECCCCCCCCCCHHHHCCCCCCHHHHCCCCCHHHHHHH T ss_conf 356784899995079999889705--999999998999999971001938989854400369698998079999999999 Q ss_pred HHHHHHCCCCCEEECCCCHHHHHHHHHH---HCC-CCC-HHHHHHHHHHHHH--HCCCHHHHHHHHHHHCCCCCCCCEEC Q ss_conf 7677732364023010000168886542---000-000-0457898865211--00000088777654200246752003 Q gi|254780636|r 66 LVGMLVDEKREKIFHYGRFDIAVLFYTF---GVR-VRP-VFCTKIASRLTRT--YTNQHGLKDNLKELLGINISKAQQSS 138 (207) Q Consensus 66 L~~ll~d~~i~KV~Hn~~~D~~~L~~~~---gi~-~~~-i~DT~ias~l~~~--~~~~~~L~~L~~~~lg~~ldK~~q~S 138 (207) +.+++.+ .+-|+||+.||+.+|...+ |+. +.| +.||.-.|+.+-+ ..++|+|.+|+++ ||+...-. T Consensus 98 f~~F~~~--~vlVAHNa~FD~~FL~~~~~r~g~~~~~~~~ldTl~LARrl~p~~e~~~~kL~tLa~~-~g~~~~~~---- 170 (575) T PRK07883 98 FLEFARG--SVLVAHNAPFDIGFLRAAAERCGYPWPQPEVLCTVRLARRVLSRDEAPNVRLSALARL-FGAATTPT---- 170 (575) T ss_pred HHHHHCC--CEEEEECCHHHHHHHHHHHHHCCCCCCCCCEEEHHHHHHHHCCCCCCCCCCHHHHHHH-CCCCCCCC---- T ss_conf 9999689--9899908345299999999975999878978648999998669211468998999997-59999998---- Q ss_pred CCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHH Q ss_conf 65654432368999986459999999999999997491899 Q gi|254780636|r 139 DWSADDLSDEQLQYAASDVVHLHALRLQFTEKLQRLGRSDL 179 (207) Q Consensus 139 dW~~rpLs~~Qi~YAA~Dv~~l~~L~~~l~~~L~~~~~~~~ 179 (207) +=|..|+..+..++..|.+++...+...+ T Consensus 171 ------------HRAl~DA~AT~~vl~~l~~~~~~~gv~t~ 199 (575) T PRK07883 171 ------------HRALDDARATVDVLHGLIERLGNQGVHTL 199 (575) T ss_pred ------------CCHHHHHHHHHHHHHHHHHHHHHCCCCCH T ss_conf ------------66045299999999999998542242209 No 59 >PRK06859 consensus Probab=98.99 E-value=6.6e-09 Score=71.01 Aligned_cols=132 Identities=23% Similarity=0.322 Sum_probs=79.0 Q ss_pred CCCCCEEEECCCCCCCCCC---CEEEEEEEE---------ECCCCE-EEECCCCCCC---------------CCH----- Q ss_conf 4898279971777898544---507999984---------189428-9974657785---------------706----- Q gi|254780636|r 18 RYVDAIAVDTETLGLMPRR---DRLCIVQLS---------PGDGTV-DIIRIAAGQK---------------NAP----- 64 (207) Q Consensus 18 ~~~~~iaiDtEt~~l~~~~---~~l~LiQl~---------~~~~~~-~l~~~~~~~~---------------~~~----- 64 (207) .....|.+||||||++|.. ..=.+|.|+ +++ .+ .++.+.+... .+| T Consensus 3 n~~R~VvlDtETTGl~p~~~~~~g~RIIEIgaVel~~~~~t~~-~f~~yinP~r~I~~~a~~vhGIT~e~L~~kP~F~ei 81 (253) T PRK06859 3 NPNRQIVLDTETTGMNQFGAHYEGHCIIEIGAVELINRRYTGN-NFHIYIKPDRPVDPDAIKVHGITDEMLADKPEFKEV 81 (253) T ss_pred CCCCEEEEEEECCCCCCCCCCCCCCEEEEEEEEEEECCEECCC-EEEEEECCCCCCCHHHHHHHCCCHHHHHCCCCHHHH T ss_conf 8883799972179999876666898689999999999978240-467887869949988998745089998369979999 Q ss_pred --HHHHHHHCCCCCEEECCCCHHHHHHHHHH---HCCC--C---CHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCC Q ss_conf --77677732364023010000168886542---0000--0---004578988652110000008877765420024675 Q gi|254780636|r 65 --NLVGMLVDEKREKIFHYGRFDIAVLFYTF---GVRV--R---PVFCTKIASRLTRTYTNQHGLKDNLKELLGINISKA 134 (207) Q Consensus 65 --~L~~ll~d~~i~KV~Hn~~~D~~~L~~~~---gi~~--~---~i~DT~ias~l~~~~~~~~~L~~L~~~~lg~~ldK~ 134 (207) .+..++.+..+ |+||+.||+.+|.+.+ |..+ . .+.||...|+-..+ +.+.||.+||+|| ++.. T Consensus 82 a~efl~Fi~~a~L--VaHNa~FD~~FLn~El~r~g~~~~~~~~~~viDTL~lAR~~~P-g~~~sLdaLc~r~-~id~--- 154 (253) T PRK06859 82 AQEFIDYIKGAEL--LIHNAPFDVGFMDYEFRKLNLNVKTDDICLVTDTLQMARQMYP-GKRNNLDALCDRL-GIDN--- 154 (253) T ss_pred HHHHHHHHCCCEE--EEECHHHHHHHHHHHHHHCCCCCCCCCCCCEEEHHHHHHHHCC-CCCCCHHHHHHHC-CCCC--- T ss_conf 9999999679979--9957188899999999974999653220011339999998787-9977999999973-9987--- Q ss_pred CEECCCCCCCCCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 2003656544323689999864599999999999 Q gi|254780636|r 135 QQSSDWSADDLSDEQLQYAASDVVHLHALRLQFT 168 (207) Q Consensus 135 ~q~SdW~~rpLs~~Qi~YAA~Dv~~l~~L~~~l~ 168 (207) +.| ..+=|..|+..|-+.|-.|. T Consensus 155 ------~~R-----~~HgAllDa~lLa~VYl~mt 177 (253) T PRK06859 155 ------SKR-----TLHGALLDAEILADVYLMMT 177 (253) T ss_pred ------CCC-----CCCCHHHHHHHHHHHHHHHC T ss_conf ------663-----30132888999999999872 No 60 >PRK05168 ribonuclease T; Provisional Probab=98.98 E-value=6.2e-09 Score=71.18 Aligned_cols=140 Identities=19% Similarity=0.240 Sum_probs=86.4 Q ss_pred CCCEEEECCCCCCCCCCCEEEEEEEEEC------C------CCE-EEECCCC-------------C---------CCCCH Q ss_conf 9827997177789854450799998418------9------428-9974657-------------7---------85706 Q gi|254780636|r 20 VDAIAVDTETLGLMPRRDRLCIVQLSPG------D------GTV-DIIRIAA-------------G---------QKNAP 64 (207) Q Consensus 20 ~~~iaiDtEt~~l~~~~~~l~LiQl~~~------~------~~~-~l~~~~~-------------~---------~~~~~ 64 (207) --.|.|||||||+++.++++ |+|+.- + ..+ .++.... . ....+ T Consensus 17 ~~pVV~D~ETTGl~p~~D~I--IEIGAV~v~~~e~g~i~~~~~f~~~V~P~~g~~I~~~~~~ltGI~~~~~l~~a~~~~~ 94 (213) T PRK05168 17 FLPVVIDVETAGFNAQTDAL--LEIAAITLKMDEQGWLYPDETLHFHVEPFEGANLEPEALAFNGIDPDNPLRGAVSEKE 94 (213) T ss_pred CCCEEEEEECCCCCCCCCCE--EEEEEEEEEECCCCCCCCCCEEEEEECCCCCCCCCHHHHHHCCCCCCCCCCCCCCHHH T ss_conf 88759983079988778855--8888899886378867247548898788878878866751126786532223666788 Q ss_pred HHHHHHH---C-------CCCCEEECCCCHHHHHHHHHH---HCCCC-----CHHHHHHHHHHHHHHCCCHHHHHHHHHH Q ss_conf 7767773---2-------364023010000168886542---00000-----0045789886521100000088777654 Q gi|254780636|r 65 NLVGMLV---D-------EKREKIFHYGRFDIAVLFYTF---GVRVR-----PVFCTKIASRLTRTYTNQHGLKDNLKEL 126 (207) Q Consensus 65 ~L~~ll~---d-------~~i~KV~Hn~~~D~~~L~~~~---gi~~~-----~i~DT~ias~l~~~~~~~~~L~~L~~~~ 126 (207) .+.+++. + .+.+-|+||+.||+.+|...+ |+.-. .++||.-.|++.- .+.+|..+|++ T Consensus 95 al~~~~~~i~~~~~~~~c~~avLVaHNA~FD~~FLn~~~~r~g~~~~P~~~~~viDTl~Lar~~~---~~~~L~~lc~~- 170 (213) T PRK05168 95 ALHEIFKPVRKGIKAAGCQRAILVAHNAHFDLGFLNAAAERTGLKRNPFHPFSTFDTATLAGLAY---GQTVLAKACQA- 170 (213) T ss_pred HHHHHHHHHHHHHHHHCCCCCEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCEEEHHHHHHHHH---CCCCHHHHHHH- T ss_conf 78999999999987504675637763418749999999998499778888776335899868874---87409999998- Q ss_pred HCCCCCCCCEECCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHH Q ss_conf 20024675200365654432368999986459999999999999997491899 Q gi|254780636|r 127 LGINISKAQQSSDWSADDLSDEQLQYAASDVVHLHALRLQFTEKLQRLGRSDL 179 (207) Q Consensus 127 lg~~ldK~~q~SdW~~rpLs~~Qi~YAA~Dv~~l~~L~~~l~~~L~~~~~~~~ 179 (207) +|+.++ ..+.+=|..||..|-++|-+|-+++++.|.+.| T Consensus 171 ~gi~~~--------------~~~~HrAl~DA~~tA~vFl~ll~~l~~~g~~~~ 209 (213) T PRK05168 171 AGIEFD--------------NKEAHSALYDTEKTAELFCEIVNRWKRLGGWPL 209 (213) T ss_pred CCCCCC--------------CCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCC T ss_conf 699999--------------977777389999999999999999997389999 No 61 >COG2176 PolC DNA polymerase III, alpha subunit (gram-positive type) [DNA replication, recombination, and repair] Probab=98.97 E-value=3.5e-09 Score=72.59 Aligned_cols=149 Identities=22% Similarity=0.275 Sum_probs=100.1 Q ss_pred HHHHCCCCCCEEEECCCCCCCCCCCEEEEEEEEEC---CCC------EEEECCCCCC--------------CCC------ Q ss_conf 57842489827997177789854450799998418---942------8997465778--------------570------ Q gi|254780636|r 13 AECAARYVDAIAVDTETLGLMPRRDRLCIVQLSPG---DGT------VDIIRIAAGQ--------------KNA------ 63 (207) Q Consensus 13 ~~~~l~~~~~iaiDtEt~~l~~~~~~l~LiQl~~~---~~~------~~l~~~~~~~--------------~~~------ 63 (207) .+..+..+..|-||.|||||++.++.+ ||++.. ++. .++-|-.... ..+ T Consensus 414 ~d~~l~datyVVfDiETTGLs~~~d~i--IE~aAvKikng~iId~f~~Fi~P~~pl~~~~telTgITdeml~~a~~i~~v 491 (1444) T COG2176 414 DDQKLDDATYVVFDIETTGLSPVYDEI--IEIAAVKIKNGRIIDKFQFFIKPGRPLSATITELTGITDEMLENAPEIEEV 491 (1444) T ss_pred CCCCCCCCCEEEEEEECCCCCCCCCHH--HHHEEEEEECCCCHHHHHHHCCCCCCCCHHHHHCCCCCHHHHCCCCCHHHH T ss_conf 445456550899985537767542015--620004433880437789750899967644542035679987488339999 Q ss_pred -HHHHHHHHCCCCCEEECCCCHHHHHHHHHH---HCCC--CCHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCEE Q ss_conf -677677732364023010000168886542---0000--0004578988652110000008877765420024675200 Q gi|254780636|r 64 -PNLVGMLVDEKREKIFHYGRFDIAVLFYTF---GVRV--RPVFCTKIASRLTRTYTNQHGLKDNLKELLGINISKAQQS 137 (207) Q Consensus 64 -~~L~~ll~d~~i~KV~Hn~~~D~~~L~~~~---gi~~--~~i~DT~ias~l~~~~~~~~~L~~L~~~~lg~~ldK~~q~ 137 (207) +.+++++.| .|. |.||+.||+.+|...+ |+.+ +++.||.-.|+.+.+.-.+|+|..+++. ||+.+ T Consensus 492 L~kf~~~~~d-~Il-VAHNasFD~gFl~~~~~k~~~~~~~~pvIDTL~lar~L~P~~ksh~Lg~l~kk-~~v~l------ 562 (1444) T COG2176 492 LEKFREFIGD-SIL-VAHNASFDMGFLNTNYEKYGLEPLTNPVIDTLELARALNPEFKSHRLGTLCKK-LGVEL------ 562 (1444) T ss_pred HHHHHHHHCC-CEE-EECCCCCCHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHCHHHHHCCHHHHHHH-HCCCH------ T ss_conf 9999998448-589-96167556457888899838851027501199999875745432466778887-48637------ Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHH Q ss_conf 365654432368999986459999999999999997491899999 Q gi|254780636|r 138 SDWSADDLSDEQLQYAASDVVHLHALRLQFTEKLQRLGRSDLATS 182 (207) Q Consensus 138 SdW~~rpLs~~Qi~YAA~Dv~~l~~L~~~l~~~L~~~~~~~~~~~ 182 (207) ++-+=|-.|+..|-.++..+.+.+++.|...+-+. T Consensus 563 ----------e~hHRA~yDaeat~~vf~~f~~~~ke~Gi~~l~el 597 (1444) T COG2176 563 ----------ERHHRADYDAEATAKVFFVFLKDLKEKGITNLSEL 597 (1444) T ss_pred ----------HHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHH T ss_conf ----------87531010089999999999999988455659998 No 62 >PRK09145 DNA polymerase III subunit epsilon; Validated Probab=98.97 E-value=4e-09 Score=72.30 Aligned_cols=131 Identities=20% Similarity=0.281 Sum_probs=77.3 Q ss_pred CCCCCEEEECCCCCCCCCCCEEEEEEEEE---CCC------CE-EEECCCCCCC---------------CC-------HH Q ss_conf 48982799717778985445079999841---894------28-9974657785---------------70-------67 Q gi|254780636|r 18 RYVDAIAVDTETLGLMPRRDRLCIVQLSP---GDG------TV-DIIRIAAGQK---------------NA-------PN 65 (207) Q Consensus 18 ~~~~~iaiDtEt~~l~~~~~~l~LiQl~~---~~~------~~-~l~~~~~~~~---------------~~-------~~ 65 (207) ...+.|+|||||||+++.++++ |||+. .++ .+ .++....... .+ +. T Consensus 27 ~~~~fVv~D~ETTGL~~~~D~I--ieIgav~v~~~~i~~~~~f~~lV~P~~~i~~~~~~IhGIt~~~l~~ap~~~evl~~ 104 (203) T PRK09145 27 PPDEWVALDCETTGLDPRRAEI--VSIAAVKIRGNRILTSERLELLVRPPQSLSAESIKIHRLRHQDLEHGLSEEEALRQ 104 (203) T ss_pred CCCCEEEEEEECCCCCCCCCEE--EEEEEEEEECCEEECCCEEEEEECCCCCCCHHHHCCCCCCHHHHCCCCCHHHHHHH T ss_conf 9998999985689999999706--99999999999993175289998979959864520138787998139999999999 Q ss_pred HHHHHHCCCCCEEECCCCHHHHHHHHH----HHCCC-CCHHHHHH--HHHHHHH---HCCCHHHHHHHHHHHCCCCCCCC Q ss_conf 767773236402301000016888654----20000-00045789--8865211---00000088777654200246752 Q gi|254780636|r 66 LVGMLVDEKREKIFHYGRFDIAVLFYT----FGVRV-RPVFCTKI--ASRLTRT---YTNQHGLKDNLKELLGINISKAQ 135 (207) Q Consensus 66 L~~ll~d~~i~KV~Hn~~~D~~~L~~~----~gi~~-~~i~DT~i--as~l~~~---~~~~~~L~~L~~~~lg~~ldK~~ 135 (207) +.+++.+. .-|+||+.||+.+|.+. +|... .+.+||.- ..++.+. ...+++|.+++++ +|+.. T Consensus 105 f~~fi~~~--~lVgHNa~FD~~fL~~~~~r~~g~~l~~~~id~~~l~~~~~~~~~~~~~~~~~L~~l~~~-~gi~~---- 177 (203) T PRK09145 105 LLAFIGNR--PLVGYYLEFDVAMLNRYVRPLLGIGLPNPLIEVSSLYHDKKERHLPDAYIDLRFDAILKH-LDLPV---- 177 (203) T ss_pred HHHHCCCC--EEEEECHHHHHHHHHHHHHHHCCCCCCCCEEEHHHHHHHHHHHHCCCCCCCCCHHHHHHH-CCCCC---- T ss_conf 99882999--899987899999999999996299999987760987999999667698778899999985-79878---- Q ss_pred EECCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 0036565443236899998645999999999999 Q gi|254780636|r 136 QSSDWSADDLSDEQLQYAASDVVHLHALRLQFTE 169 (207) Q Consensus 136 q~SdW~~rpLs~~Qi~YAA~Dv~~l~~L~~~l~~ 169 (207) .+-+=|..||..+-+||.+|.+ T Consensus 178 ------------~~~HrAL~DA~aTa~lfl~L~k 199 (203) T PRK09145 178 ------------LGRHDALNDAIMTALIYLRLRK 199 (203) T ss_pred ------------CCCCCHHHHHHHHHHHHHHHHC T ss_conf ------------9997989999999999999867 No 63 >cd06127 DEDDh DEDDh exonucleases, part of the DnaQ-like (or DEDD) exonuclease superfamily, catalyze the excision of nucleoside monophosphates at the DNA or RNA termini in the 3'-5' direction. These proteins contain four invariant acidic residues in three conserved sequence motifs termed ExoI, ExoII and ExoIII. DEDDy exonucleases are classified as such because of the presence of specific Hx(4)D conserved pattern at the ExoIII motif. The four conserved acidic residues are clustered around the active site and serve as ligands for the two metal ions required for catalysis. Most DEDDh exonucleases are the proofreading subunits (epsilon) or domains of bacterial DNA polymerase III, the main replicating enzyme in bacteria, which functions as the chromosomal replicase. Other members include other DNA and RNA exonucleases such as RNase T, Oligoribonuclease, and RNA exonuclease (REX), among others. Probab=98.90 E-value=6.7e-09 Score=70.96 Aligned_cols=122 Identities=22% Similarity=0.236 Sum_probs=73.6 Q ss_pred EEEECCCCCCCCCCCEEEEEEEEEC---C-----CCE-EEECCCCCCCC----------------------CHHHHHHHH Q ss_conf 7997177789854450799998418---9-----428-99746577857----------------------067767773 Q gi|254780636|r 23 IAVDTETLGLMPRRDRLCIVQLSPG---D-----GTV-DIIRIAAGQKN----------------------APNLVGMLV 71 (207) Q Consensus 23 iaiDtEt~~l~~~~~~l~LiQl~~~---~-----~~~-~l~~~~~~~~~----------------------~~~L~~ll~ 71 (207) |.+||||||+++..+++ ||+|.- + ..+ .++........ ...+..++. T Consensus 1 v~~D~ETTGl~~~~~~I--ieia~v~~~~~~~~~~~~~~~i~P~~~~~~~~~~i~gi~~~~~~~~~~~~~~~~~~~~~l~ 78 (159) T cd06127 1 VVLDTETTGLDPKGDRI--IEIGAVKVDGGIEIVERFETLVNPGRPIPPEATAIHGITDEMLADAPPFEEVLPEFLEFLG 78 (159) T ss_pred CEEEEECCCCCCCCCCE--EEEEEEEEECCEEEEEEEEEEECCCCCCCHHHHHHHCCCHHHHCCCCCHHHHHHHHHHHCC T ss_conf 98982679999999846--9999999999956520079998899999988987606778995899968999999987315 Q ss_pred CCCCCEEECCCCHHHHHHHHHHHC-----CCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCEECCCCCCCCC Q ss_conf 236402301000016888654200-----000004578988652110000008877765420024675200365654432 Q gi|254780636|r 72 DEKREKIFHYGRFDIAVLFYTFGV-----RVRPVFCTKIASRLTRTYTNQHGLKDNLKELLGINISKAQQSSDWSADDLS 146 (207) Q Consensus 72 d~~i~KV~Hn~~~D~~~L~~~~gi-----~~~~i~DT~ias~l~~~~~~~~~L~~L~~~~lg~~ldK~~q~SdW~~rpLs 146 (207) + . .-|+||+.||..+|.+.+.- .+...+|||-.++-........++..+...++|.. T Consensus 79 ~-~-~lv~hN~~FD~~~L~~~~~~~~~~~~~~~~iDt~~~~~~~~~~~~~~~~~~~~~~~~~~~---------------- 140 (159) T cd06127 79 G-R-VLVAHNASFDLRFLNRELRRLGGPPLPNPWIDTLRLARRLLPGLRSHRLGLLLAARYGIP---------------- 140 (159) T ss_pred C-C-EEEECCHHHHHHHHHHHHHHCCCCCCCCCEEEHHHHHHHHCCCCCCCCHHHHHHHHCCCC---------------- T ss_conf 6-3-465158889999999999983999889966647999999738888999999999976988---------------- Q ss_pred HHHHHHHHHHHHHHHHHH Q ss_conf 368999986459999999 Q gi|254780636|r 147 DEQLQYAASDVVHLHALR 164 (207) Q Consensus 147 ~~Qi~YAA~Dv~~l~~L~ 164 (207) ..+-+-|..||..+.++| T Consensus 141 ~~~aH~Al~D~~at~~l~ 158 (159) T cd06127 141 LEGAHRALADALATAELL 158 (159) T ss_pred CCCCCCHHHHHHHHHHHH T ss_conf 798959899999999996 No 64 >TIGR01407 dinG_rel DnaQ family exonuclease/DinG family helicase, putative; InterPro: IPR006310 These are a family of proteins in Gram-positive bacteria. The N-terminal region of about 200 amino acids resembles the epsilon subunit of E. coli DNA polymerase III and the homologous region of the Gram-positive type DNA polymerase III alpha subunit. The epsilon subunit contains an exonuclease domain. The remainder of this protein family resembles a predicted ATP-dependent helicase, the DNA damage-inducible protein DinG of Escherichia coli. . Probab=98.85 E-value=7e-08 Score=65.04 Aligned_cols=134 Identities=24% Similarity=0.292 Sum_probs=84.9 Q ss_pred CEEEECCCCCCCCCC-CEEEEEEEEE---CCCCE---EEECCCCCCCCC-----------------H-------HHHHHH Q ss_conf 279971777898544-5079999841---89428---997465778570-----------------6-------776777 Q gi|254780636|r 22 AIAVDTETLGLMPRR-DRLCIVQLSP---GDGTV---DIIRIAAGQKNA-----------------P-------NLVGML 70 (207) Q Consensus 22 ~iaiDtEt~~l~~~~-~~l~LiQl~~---~~~~~---~l~~~~~~~~~~-----------------~-------~L~~ll 70 (207) .+-||.||||-.+-. ++| ||++. .|+.. |--.++.....+ | .+..++ T Consensus 2 YaVVDLEtTg~~~~~~d~I--IqiGiV~v~DgeIv~~f~T~VNP~~~~pp~I~eLTGisd~~l~~AP~F~qVA~~i~~~l 79 (944) T TIGR01407 2 YAVVDLETTGTDSSEFDKI--IQIGIVLVEDGEIVDTFATDVNPNEPIPPFIQELTGISDEQLKQAPYFSQVAQEIYDLL 79 (944) T ss_pred EEEEEEEECCCCCCCCCCE--EEEEEEEEECCCEEEECCCCCCCCCCCCCHHHHHCCCCHHHHCCCCCHHHHHHHHHHHH T ss_conf 6899874168886455766--89988998558143201267788987761145422877686404888879999999874 Q ss_pred HCCCCCEEECCCCHHHHHHHHHH---HCCC--CCHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCEECCCCCCCC Q ss_conf 32364023010000168886542---0000--000457898865211000000887776542002467520036565443 Q gi|254780636|r 71 VDEKREKIFHYGRFDIAVLFYTF---GVRV--RPVFCTKIASRLTRTYTNQHGLKDNLKELLGINISKAQQSSDWSADDL 145 (207) Q Consensus 71 ~d~~i~KV~Hn~~~D~~~L~~~~---gi~~--~~i~DT~ias~l~~~~~~~~~L~~L~~~~lg~~ldK~~q~SdW~~rpL 145 (207) +|. +-|.||++||+.+|.+.+ |+.| .+++||-=.++++=+...+++|..|++. ||+.++.-++ T Consensus 80 ~D~--iFVAHNV~FD~nfL~k~l~~~G~~~~~~~~iDTVELa~if~PT~esY~Ls~L~E~-LGl~h~npH~--------- 147 (944) T TIGR01407 80 EDG--IFVAHNVHFDLNFLAKELVDEGYEPLEVPRIDTVELARIFFPTEESYKLSELSEE-LGLEHENPHR--------- 147 (944) T ss_pred CCC--EEEEECHHCCHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHCCCCCCCCCHHHHHH-HCCCCCCCCC--------- T ss_conf 297--2776101506789999998568974343520056799874674435561489997-1987146676--------- Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 2368999986459999999999999997491 Q gi|254780636|r 146 SDEQLQYAASDVVHLHALRLQFTEKLQRLGR 176 (207) Q Consensus 146 s~~Qi~YAA~Dv~~l~~L~~~l~~~L~~~~~ 176 (207) |-.||.+|-+|--.+.+++++.-+ T Consensus 148 -------AdSDA~~TAeLLLl~~~K~~~LPl 171 (944) T TIGR01407 148 -------ADSDAQVTAELLLLLEEKMKKLPL 171 (944) T ss_pred -------CCHHHHHHHHHHHHHHHHHHHHHH T ss_conf -------105789999999999999875108 No 65 >cd06144 REX4_like This subfamily is composed of RNA exonuclease 4 (REX4 or Rex4p), XPMC2, Interferon (IFN) Stimulated Gene product of 20 kDa (ISG20), and similar proteins. REX4 is involved in pre-rRNA processing. It controls the ratio between the two forms of 5.8S rRNA in yeast. XPMC2 is a Xenopus gene which was identified through its ability to correct a mitotic defect in fission yeast. The human homolog of XPMC2 (hPMC2) may be involved in angiotensin II-induced adrenal cell cycle progression and cell proliferation. ISG20 is an IFN-induced antiviral exonuclease with a strong preference for single-stranded RNA and minor activity towards single-stranded DNA. These proteins are DEDDh-type DnaQ-like 3'-5' exonucleases containing three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific Hx(4)D conserved pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required fo Probab=98.84 E-value=1.1e-08 Score=69.73 Aligned_cols=123 Identities=24% Similarity=0.254 Sum_probs=71.1 Q ss_pred EEEECCCCCCCCCCCE--EEEEEEEECCCCEEEECCCCCC------------------CCC-------HHHHHHHHCCCC Q ss_conf 7997177789854450--7999984189428997465778------------------570-------677677732364 Q gi|254780636|r 23 IAVDTETLGLMPRRDR--LCIVQLSPGDGTVDIIRIAAGQ------------------KNA-------PNLVGMLVDEKR 75 (207) Q Consensus 23 iaiDtEt~~l~~~~~~--l~LiQl~~~~~~~~l~~~~~~~------------------~~~-------~~L~~ll~d~~i 75 (207) ||+|||++|..+.... ++-|.+-..++...++.+-+-. ..+ +.+.+++.+. + T Consensus 1 valDcEm~g~~~~~~~~~larv~vvd~~g~~v~d~~V~P~~~V~d~~T~~sGIt~~~l~~a~~~~~v~~~~~~~l~~~-i 79 (152) T cd06144 1 VALDCEMVGVGPDGSESALARVSIVNEDGNVVYDTYVKPQEPVTDYRTAVSGIRPEHLKDAPDFEEVQKKVAELLKGR-I 79 (152) T ss_pred CEEEEEECCCCCCCCEEEEEEEEEEECCCCEEEEEEECCCCCCCCCCEEECCCCHHHHCCCCCHHHHHHHHHHHHCCC-E T ss_conf 989876033528998789999999967899998872369976787635624857999738999999999999985388-7 Q ss_pred CEEECCCCHHHHHHHHHHHCCCC-CHHHHHHHHHHHH-HHCCCHHHHHHHHHHHCCCCCCCCEECCCCCCCCCHHHHHHH Q ss_conf 02301000016888654200000-0045789886521-100000088777654200246752003656544323689999 Q gi|254780636|r 76 EKIFHYGRFDIAVLFYTFGVRVR-PVFCTKIASRLTR-TYTNQHGLKDNLKELLGINISKAQQSSDWSADDLSDEQLQYA 153 (207) Q Consensus 76 ~KV~Hn~~~D~~~L~~~~gi~~~-~i~DT~ias~l~~-~~~~~~~L~~L~~~~lg~~ldK~~q~SdW~~rpLs~~Qi~YA 153 (207) -||||+.||+.+|.-. .++ .+.||.....+.+ ....++||+.|++++||.++..+. +=| T Consensus 80 -lVGH~l~~Dl~~L~l~---~~~~~i~DT~~~~~~~~~~~~~~~sLk~L~~~~Lg~~Iq~~~---------------H~s 140 (152) T cd06144 80 -LVGHALKNDLKVLKLD---HPKKLIRDTSKYKPLRKTAKGKSPSLKKLAKQLLGLDIQEGE---------------HSS 140 (152) T ss_pred -EEECCHHHHHHHHHCC---CCCCEEEEEEEEHHHHHCCCCCCCCHHHHHHHHCCCCCCCCC---------------CCC T ss_conf -9860646469996243---887606885564224541688992799999998588589979---------------696 Q ss_pred HHHHHHHHHHHH Q ss_conf 864599999999 Q gi|254780636|r 154 ASDVVHLHALRL 165 (207) Q Consensus 154 A~Dv~~l~~L~~ 165 (207) ..||..+.+||. T Consensus 141 ~eDA~at~~Lfk 152 (152) T cd06144 141 VEDARAAMRLYR 152 (152) T ss_pred HHHHHHHHHHHC T ss_conf 999999999859 No 66 >PRK09146 DNA polymerase III subunit epsilon; Validated Probab=98.83 E-value=2e-08 Score=68.28 Aligned_cols=150 Identities=19% Similarity=0.179 Sum_probs=84.9 Q ss_pred HCCCCCCEEEECCCCCCCCCCCEEE---EEEEE-----ECCCCEEEECCCCCCC-C--------------CH----HHHH Q ss_conf 4248982799717778985445079---99984-----1894289974657785-7--------------06----7767 Q gi|254780636|r 16 AARYVDAIAVDTETLGLMPRRDRLC---IVQLS-----PGDGTVDIIRIAAGQK-N--------------AP----NLVG 68 (207) Q Consensus 16 ~l~~~~~iaiDtEt~~l~~~~~~l~---LiQl~-----~~~~~~~l~~~~~~~~-~--------------~~----~L~~ 68 (207) -+...+.+++|+||||++|.++++. .|-|. .++....++....... . ++ .+.+ T Consensus 43 pl~d~~fvalD~ETTGLdp~~D~IiSIGaV~i~~~~I~l~~a~~~lV~p~~~i~~~s~~IHGIt~~~l~~ap~l~evl~~ 122 (239) T PRK09146 43 PLSEVPFVALDFETTGLDPEQDAIVSIGLVPFTLQRIRCRQARHWVVKPRRPLNEESVVIHGITHSDLQDAPDLERILDE 122 (239) T ss_pred CCCCCCEEEEECCCCCCCCCCCCEEEEEEEEEECCEEEECCEEEEEECCCCCCCCCCEEECCCCHHHHHCCCCHHHHHHH T ss_conf 32108789996415788878885489989998788886534158998899978764511069698998569999999999 Q ss_pred HHHC-CCCCEEECCCCHHHHHHHHH----HHCCC-CCHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCEECCCCC Q ss_conf 7732-36402301000016888654----20000-000457898865211000000887776542002467520036565 Q gi|254780636|r 69 MLVD-EKREKIFHYGRFDIAVLFYT----FGVRV-RPVFCTKIASRLTRTYTNQHGLKDNLKELLGINISKAQQSSDWSA 142 (207) Q Consensus 69 ll~d-~~i~KV~Hn~~~D~~~L~~~----~gi~~-~~i~DT~ias~l~~~~~~~~~L~~L~~~~lg~~ldK~~q~SdW~~ 142 (207) +++- ...+-|+|++.+|..+|.+. +|..+ .+++||+-.++-... ..-..+..++.|.... +.+.++... T Consensus 123 ll~~l~g~VLVaH~a~~D~~FL~~a~~~~~g~~l~~P~IDTl~Le~~~~~----~~~~~~~~r~~~~~~~-slrL~~~r~ 197 (239) T PRK09146 123 LLEALAGKVVVVHYRRIERDFLDQALRNRIGEGIEFPVIDTMEIEARIQR----KQAGGLWNRLKGKKPE-SIRLADSRL 197 (239) T ss_pred HHHHHCCCEEEEECCHHHHHHHHHHHHHHHCCCCCCCEEEHHHHHHHHHH----HHHHHHHHHHCCCCCC-CCCHHHHHH T ss_conf 99981899899968399999999999996288988746448999999988----5420044431358986-763899999 Q ss_pred C-CCCHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 4-432368999986459999999999999 Q gi|254780636|r 143 D-DLSDEQLQYAASDVVHLHALRLQFTEK 170 (207) Q Consensus 143 r-pLs~~Qi~YAA~Dv~~l~~L~~~l~~~ 170 (207) | .|+.-+-+=|..||..+-+|+..+-.. T Consensus 198 rYgLP~y~aH~AL~DAlATAELflAQi~~ 226 (239) T PRK09146 198 RYGLPAYPPHHALTDAIATAELLQAQIAH 226 (239) T ss_pred HCCCCCCCCCCCHHHHHHHHHHHHHHHHH T ss_conf 81989888767199999999999999998 No 67 >PRK06063 DNA polymerase III subunit epsilon; Provisional Probab=98.80 E-value=1.2e-07 Score=63.64 Aligned_cols=131 Identities=21% Similarity=0.240 Sum_probs=80.0 Q ss_pred CCCCEEEECCCCCCCCCCCEEEEEEEEE----CCCC----E-EEECCCCCC-------------CCC-------HHHHHH Q ss_conf 8982799717778985445079999841----8942----8-997465778-------------570-------677677 Q gi|254780636|r 19 YVDAIAVDTETLGLMPRRDRLCIVQLSP----GDGT----V-DIIRIAAGQ-------------KNA-------PNLVGM 69 (207) Q Consensus 19 ~~~~iaiDtEt~~l~~~~~~l~LiQl~~----~~~~----~-~l~~~~~~~-------------~~~-------~~L~~l 69 (207) ..+.+.+|+||||+++.+++++ ||+. .++. + -++...... ..+ +.+.++ T Consensus 14 ~~~fvvvD~ETTGl~p~~drIi--eig~v~~~~~g~i~~~~~tlvnP~~dp~~~~IhGIt~~~v~~aP~f~ev~~~l~~~ 91 (313) T PRK06063 14 GQGWAVVDVETSGFRPGQARII--SVAVLGLDADGNVEQSVVSLLNPGVDPGPTHVHGLTAAMLEGQPQFADIAGELAEV 91 (313) T ss_pred CCCEEEEEEECCCCCCCCCEEE--EEEEEEEECCCEEEEEEEEEECCCCCCCCEEEECCCHHHHCCCCCHHHHHHHHHHH T ss_conf 9997999976899999897069--99999998997888999997591999898434187999981799899999999999 Q ss_pred HHCCCCCEEECCCCHHHHHHHHHH---HCC-C-CCHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCEECCCCCCC Q ss_conf 732364023010000168886542---000-0-00045789886521100000088777654200246752003656544 Q gi|254780636|r 70 LVDEKREKIFHYGRFDIAVLFYTF---GVR-V-RPVFCTKIASRLTRTYTNQHGLKDNLKELLGINISKAQQSSDWSADD 144 (207) Q Consensus 70 l~d~~i~KV~Hn~~~D~~~L~~~~---gi~-~-~~i~DT~ias~l~~~~~~~~~L~~L~~~~lg~~ldK~~q~SdW~~rp 144 (207) +.+ . +-|.||+.||..+|...+ |+. | ..++||+-.|+.+.....+|.|.+++++ +|+...+-+. T Consensus 92 l~g-~-vlVAHNa~FD~~FL~~e~~r~g~~~p~~~~lcTl~lARr~~~~l~~~kL~tla~~-~Gi~~~~~H~-------- 160 (313) T PRK06063 92 LRG-R-TLVAHNVAFDYSFLAAEAELAGAELPVDQVMCTVELARRLDLGLPNLRLETLAAH-WGVVQQRPHD-------- 160 (313) T ss_pred HCC-C-EEEEECCHHHHHHHHHHHHHCCCCCCCCCEEEHHHHHHHHCCCCCCCCHHHHHHH-CCCCCCCCCC-------- T ss_conf 789-9-7999685652999999999739988999667689988654489999747889998-0999887655-------- Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 32368999986459999999999999 Q gi|254780636|r 145 LSDEQLQYAASDVVHLHALRLQFTEK 170 (207) Q Consensus 145 Ls~~Qi~YAA~Dv~~l~~L~~~l~~~ 170 (207) |..|+..+-.|.....+. T Consensus 161 --------Al~DAra~a~il~~~l~~ 178 (313) T PRK06063 161 --------ALDDARVLAGILAPALER 178 (313) T ss_pred --------CHHHHHHHHHHHHHHHHH T ss_conf --------067789999998999986 No 68 >pfam00929 Exonuc_X-T Exonuclease. This family includes a variety of exonuclease proteins, such as ribonuclease T and the epsilon subunit of DNA polymerase III.; Probab=98.79 E-value=8.1e-08 Score=64.69 Aligned_cols=124 Identities=21% Similarity=0.258 Sum_probs=74.0 Q ss_pred CEEEECCCCCCCCCCCEEEEEEEEE----CCC----CE-EEECCCCCC-CC---------------------CHHHHHHH Q ss_conf 2799717778985445079999841----894----28-997465778-57---------------------06776777 Q gi|254780636|r 22 AIAVDTETLGLMPRRDRLCIVQLSP----GDG----TV-DIIRIAAGQ-KN---------------------APNLVGML 70 (207) Q Consensus 22 ~iaiDtEt~~l~~~~~~l~LiQl~~----~~~----~~-~l~~~~~~~-~~---------------------~~~L~~ll 70 (207) +|.+||||||+++.++++ ||++. ++. .+ .++.+.... .. .+.+.+++ T Consensus 1 ~v~~D~ETTGl~~~~~~I--ieigav~~~~~~~~~~~~~~~v~P~~~i~~~~~~i~GIt~~~l~~~~~~~~~~~~~~~~l 78 (162) T pfam00929 1 LVVIDCETTGLDPEKDRI--IEIAAVSIVGGENIGPVFDTYVKPERLITDEATKFHGITPEMLRNAPSFEEVLEAFLEFL 78 (162) T ss_pred CEEEEEECCCCCCCCCEE--EEEEEEEEECCEEEEEEEEEEECCCCCCCHHHHHHCCCCHHHHHHCCCHHHHHHHHHHHH T ss_conf 989998689998999758--999999999997765556787778888899999873978777541531177999999984 Q ss_pred HCCCCCEEECCCCHHHHHHHHHHH---CC----CCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCEECCCCCC Q ss_conf 323640230100001688865420---00----00004578988652110000008877765420024675200365654 Q gi|254780636|r 71 VDEKREKIFHYGRFDIAVLFYTFG---VR----VRPVFCTKIASRLTRTYTNQHGLKDNLKELLGINISKAQQSSDWSAD 143 (207) Q Consensus 71 ~d~~i~KV~Hn~~~D~~~L~~~~g---i~----~~~i~DT~ias~l~~~~~~~~~L~~L~~~~lg~~ldK~~q~SdW~~r 143 (207) .+.. +.|+||+.||..++.+.+. .. ...++||+..++...+...+++|.++++. +|++.. .+ T Consensus 79 ~~~~-~~v~~n~~fD~~~l~~~~~~~~~~~~~~~~~~~dt~~~~~~~~~~~~~~~L~~l~~~-~~i~~~---------~~ 147 (162) T pfam00929 79 KKLK-ILVGHNASFDVGFLLYDDLRFLKLPHPKLNDVIDTLILDKATYKGFKRRSLDALAEK-LGLEKI---------QR 147 (162) T ss_pred CCCC-EEECCCCCHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHCCCCCCCHHHHHHH-CCCCCC---------CC T ss_conf 2585-777788778999999999985863366333304489999997504467998999988-599998---------88 Q ss_pred CCCHHHHHHHHHHHHHHHHHH Q ss_conf 432368999986459999999 Q gi|254780636|r 144 DLSDEQLQYAASDVVHLHALR 164 (207) Q Consensus 144 pLs~~Qi~YAA~Dv~~l~~L~ 164 (207) -+=|..||..+.+|| T Consensus 148 ------aH~Al~Da~~t~~lf 162 (162) T pfam00929 148 ------AHRALDDARATAELF 162 (162) T ss_pred ------CCCHHHHHHHHHHHC T ss_conf ------678299999999879 No 69 >smart00479 EXOIII exonuclease domain in DNA-polymerase alpha and epsilon chain, ribonuclease T and other exonucleases. Probab=98.77 E-value=4.4e-08 Score=66.22 Aligned_cols=131 Identities=20% Similarity=0.266 Sum_probs=80.6 Q ss_pred CEEEECCCCCCCCCCCEEEEEEEEE----CCCC---E-EEECCCC-CC--------------CC-------CHHHHHHHH Q ss_conf 2799717778985445079999841----8942---8-9974657-78--------------57-------067767773 Q gi|254780636|r 22 AIAVDTETLGLMPRRDRLCIVQLSP----GDGT---V-DIIRIAA-GQ--------------KN-------APNLVGMLV 71 (207) Q Consensus 22 ~iaiDtEt~~l~~~~~~l~LiQl~~----~~~~---~-~l~~~~~-~~--------------~~-------~~~L~~ll~ 71 (207) +|+|||||||+++.++++ ||+|. +... + .++.+.. .. .. .+.+.+++. T Consensus 2 ~v~~D~ETTGl~~~~~~I--ieigav~~~~~~~~~~~~~~i~P~~~i~~~~~~i~GIt~~~l~~~~~~~~v~~~~~~~l~ 79 (169) T smart00479 2 LVVIDCETTGLDPGKDEI--IEIAAVDVDGGRIIVVFDTYVKPDRPITDYATEIHGITPEMLDDAPTFEEVLEELLEFLK 79 (169) T ss_pred EEEEEEECCCCCCCCCEE--EEEEEEEEECCEEEEEEEEEECCCCCCCHHHHHHCCCCCHHCCCCCCHHHHHHHHHHHHC T ss_conf 999998527999999869--999999999998878889887999999989986318472120458669999999998604 Q ss_pred CCCCCEEECCCCHHHHHHHHHHH---CC---CCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCEECCCCCCCC Q ss_conf 23640230100001688865420---00---0000457898865211000000887776542002467520036565443 Q gi|254780636|r 72 DEKREKIFHYGRFDIAVLFYTFG---VR---VRPVFCTKIASRLTRTYTNQHGLKDNLKELLGINISKAQQSSDWSADDL 145 (207) Q Consensus 72 d~~i~KV~Hn~~~D~~~L~~~~g---i~---~~~i~DT~ias~l~~~~~~~~~L~~L~~~~lg~~ldK~~q~SdW~~rpL 145 (207) +..+ -++|+++||..+|.+.+. .. ...++||+-.++...+ ..+++|.++++. +|+... . T Consensus 80 ~~~~-v~~~~~~fD~~~L~~~~~~~~~~~~~~~~~~dt~~~~~~~~~-~~~~~L~~l~~~-~~i~~~---------~--- 144 (169) T smart00479 80 GKIL-VAGNALNFDLRFLKLEHPRLGIKDPPKNPVIDTLKLARALNP-GRKYSLKKLAER-LGLEVI---------G--- 144 (169) T ss_pred CCCC-HHCCCHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHCC-CCCHHHHHHHHH-CCCCCC---------C--- T ss_conf 8745-103679999999999999818998866524629999999758-854589999998-599989---------9--- Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 236899998645999999999999999 Q gi|254780636|r 146 SDEQLQYAASDVVHLHALRLQFTEKLQ 172 (207) Q Consensus 146 s~~Qi~YAA~Dv~~l~~L~~~l~~~L~ 172 (207) +-+-|..||..+.++|.++.+++. T Consensus 145 ---~~H~Al~Da~~t~~v~~~l~~~~~ 168 (169) T smart00479 145 ---RAHRALDDARATAKLFKKLVERLL 168 (169) T ss_pred ---CCCCHHHHHHHHHHHHHHHHHHCC T ss_conf ---992979999999999999997724 No 70 >PRK07247 DNA polymerase III subunit epsilon; Validated Probab=98.67 E-value=5e-07 Score=60.08 Aligned_cols=136 Identities=16% Similarity=0.050 Sum_probs=81.8 Q ss_pred CCCEEEECCCCCCCCCCCEEEEEEEEEC---CC-CE----EEECCCCC-C--------------CCC-------HHHHHH Q ss_conf 9827997177789854450799998418---94-28----99746577-8--------------570-------677677 Q gi|254780636|r 20 VDAIAVDTETLGLMPRRDRLCIVQLSPG---DG-TV----DIIRIAAG-Q--------------KNA-------PNLVGM 69 (207) Q Consensus 20 ~~~iaiDtEt~~l~~~~~~l~LiQl~~~---~~-~~----~l~~~~~~-~--------------~~~-------~~L~~l 69 (207) ...|+||+||+|++.. ++ +|||+.- ++ .+ -++.+... . ..+ +.+..+ T Consensus 5 ~~yvv~DlETTg~~~~-~~--IIEIgAVk~~~g~ivd~F~slVnP~~~I~~~i~~lTGIt~~~v~~aP~~~eVl~~f~~F 81 (195) T PRK07247 5 ETYIAFDLEFNTVNDV-SH--IIQVSAVKYDHHKEVDSFDTYVYTDVPLQSFINGLTGITADKIAAAPKVEEVLAAFKNF 81 (195) T ss_pred CEEEEEEEECCCCCCC-CE--EEEEEEEEEECCEEEEEEEEEECCCCCCCHHHHHCCCCCHHHHHCCCCHHHHHHHHHHH T ss_conf 8499999778999999-80--69999999989989999983566989999555413598999983599999999999999 Q ss_pred HHCCCCCEEECCC-CHHHHHHHHHH---HCCC-CCHHHHHHHHH-HHHHHCCCHHHHHHHHHHHCCCCCCCCEECCCCCC Q ss_conf 7323640230100-00168886542---0000-00045789886-52110000008877765420024675200365654 Q gi|254780636|r 70 LVDEKREKIFHYG-RFDIAVLFYTF---GVRV-RPVFCTKIASR-LTRTYTNQHGLKDNLKELLGINISKAQQSSDWSAD 143 (207) Q Consensus 70 l~d~~i~KV~Hn~-~~D~~~L~~~~---gi~~-~~i~DT~ias~-l~~~~~~~~~L~~L~~~~lg~~ldK~~q~SdW~~r 143 (207) +.|.- -|+||+ +||+.+|.... +... .+++||...++ +..+...+|+|..|+++ ||++ . T Consensus 82 igd~~--lVaHNa~~FD~~fL~~~~~~~~~~~~~~~~~~~~~~r~~~~p~l~n~kL~tLa~~-f~I~-~----------- 146 (195) T PRK07247 82 VGELP--LIGYNAQKSDLPILAENGLDLRDQYQVDLFDEAFDRRSSDLNGIANLKLQTVATF-LGIK-G----------- 146 (195) T ss_pred HCCCE--EEEECCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHCCCCCCCCCHHHHHHH-CCCC-C----------- T ss_conf 68991--8990884514999999877415776685699999998860588767778999997-4999-9----------- Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHH Q ss_conf 43236899998645999999999999999749189 Q gi|254780636|r 144 DLSDEQLQYAASDVVHLHALRLQFTEKLQRLGRSD 178 (207) Q Consensus 144 pLs~~Qi~YAA~Dv~~l~~L~~~l~~~L~~~~~~~ 178 (207) .. +=|..||..+.+||.+|.++..+.+... T Consensus 147 ---~~--HRAl~DA~aTa~If~klLe~~~~~~~~~ 176 (195) T PRK07247 147 ---RG--HNSLEDARMTARIYESFLETDTNKAYLS 176 (195) T ss_pred ---CC--CCCHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf ---89--0818999999999999999766433677 No 71 >PRK06195 DNA polymerase III subunit epsilon; Validated Probab=98.60 E-value=6.7e-07 Score=59.37 Aligned_cols=139 Identities=19% Similarity=0.279 Sum_probs=81.8 Q ss_pred CCEEEECCCCCCCCCCCEEEEEEEE-ECC-----CCEEEECCCCCC----------------CCC-------HHHHHHHH Q ss_conf 8279971777898544507999984-189-----428997465778----------------570-------67767773 Q gi|254780636|r 21 DAIAVDTETLGLMPRRDRLCIVQLS-PGD-----GTVDIIRIAAGQ----------------KNA-------PNLVGMLV 71 (207) Q Consensus 21 ~~iaiDtEt~~l~~~~~~l~LiQl~-~~~-----~~~~l~~~~~~~----------------~~~-------~~L~~ll~ 71 (207) +.++||+||++ +.++++|=|.+. ..+ ....++.+.... ..+ +.+..++. T Consensus 2 ~fvviD~ETt~--~~~d~iieIg~v~v~~g~iv~~~~~Li~P~~~~~~p~~i~ihGIt~~~v~~aP~f~ev~~~~~~fi~ 79 (309) T PRK06195 2 DFVAIDFETAN--EKRNSPCSIGIVVVKDGEIVEKVHYLIKPKEMRFMPINIGIHGIRPHMVQDELEFDKIWEKIKDYFN 79 (309) T ss_pred CEEEEEEECCC--CCCCEEEEEEEEEEECCEEEEEEEEEECCCCCCCCCCEEEECCCCHHHHHCCCCHHHHHHHHHHHHC T ss_conf 29999978999--9998059999999999999999999989999888874464037799999659999999999999857 Q ss_pred CCCCCEEECCCCHHHHHHHHH---HHCCC-C-CHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCEECCCCCCCCC Q ss_conf 236402301000016888654---20000-0-004578988652110000008877765420024675200365654432 Q gi|254780636|r 72 DEKREKIFHYGRFDIAVLFYT---FGVRV-R-PVFCTKIASRLTRTYTNQHGLKDNLKELLGINISKAQQSSDWSADDLS 146 (207) Q Consensus 72 d~~i~KV~Hn~~~D~~~L~~~---~gi~~-~-~i~DT~ias~l~~~~~~~~~L~~L~~~~lg~~ldK~~q~SdW~~rpLs 146 (207) +. +-|+||+.||+.+|... +|+.+ . ..+||+-.|+.+-....+|+|.+|+++ ||+++. T Consensus 80 ~~--vlVaHNa~FD~~fL~~~~~r~gl~~~~~~~~cTl~LAR~~~p~l~~~kL~~La~~-~gi~~~-------------- 142 (309) T PRK06195 80 DN--LVIAHNASFDISVLRKTLELYNIPMPDFEYICTMKLAKNFYSNIPNARLNTVNNF-LGYEFK-------------- 142 (309) T ss_pred CC--EEEEECCHHHHHHHHHHHHHCCCCCCCCCEEEHHHHHHHHCCCCCCCCHHHHHHH-HCCCCC-------------- T ss_conf 99--7999562888999999999849999999777479999986557765788899998-598801-------------- Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHH Q ss_conf 36899998645999999999999999749189999 Q gi|254780636|r 147 DEQLQYAASDVVHLHALRLQFTEKLQRLGRSDLAT 181 (207) Q Consensus 147 ~~Qi~YAA~Dv~~l~~L~~~l~~~L~~~~~~~~~~ 181 (207) . +=|..||..+-+++..+.+.+......+|.. T Consensus 143 --h-H~Al~DA~A~a~I~~~~~~~~~~~~l~el~~ 174 (309) T PRK06195 143 --H-HDALEDAMACSNILLNISKELNSKSIEEISK 174 (309) T ss_pred --C-CCCCCCHHHHHHHHHHHHHHHCCCCHHHHHH T ss_conf --1-6851019999999999999818877899976 No 72 >cd06137 DEDDh_RNase This group is composed of eukaryotic exoribonucleases that include PAN2, RNA exonuclease 1 (REX1 or Rex1p), REX3 (Rex3p), REX4 (or Rex4p), ISG20, and similar proteins. They are DEDDh-type DnaQ-like 3'-5' exonucleases containing three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific Hx(4)D conserved pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. PAN2 is the catalytic subunit of poly(A) nuclease (PAN), a Pab1p-dependent 3'-5' exoribonuclease which plays an important role in the posttranscriptional maturation of pre-mRNAs. REX proteins are required for the processing and maturation of many RNA species, and ISG20 is an interferon-induced antiviral exonuclease with a strong preference for single-stranded RNA. Probab=98.58 E-value=1.3e-07 Score=63.43 Aligned_cols=124 Identities=19% Similarity=0.182 Sum_probs=71.6 Q ss_pred EEEECCCCCCCCCCCEEEEEEEEEC-CCCEEEECCCCCC------------CCCH--------------------HHHHH Q ss_conf 7997177789854450799998418-9428997465778------------5706--------------------77677 Q gi|254780636|r 23 IAVDTETLGLMPRRDRLCIVQLSPG-DGTVDIIRIAAGQ------------KNAP--------------------NLVGM 69 (207) Q Consensus 23 iaiDtEt~~l~~~~~~l~LiQl~~~-~~~~~l~~~~~~~------------~~~~--------------------~L~~l 69 (207) ||+|||..|..+....++=|.+-.. ++.+.++.+-+-. ..++ .+.++ T Consensus 1 VAlDCEMv~~~~~~~el~rvs~Vd~~~g~vl~d~~VkP~~~V~Dy~T~~SGIt~~~l~~a~~~~~~~~~~~~a~~~l~~~ 80 (161) T cd06137 1 VALDCEMVGLADGDSEVVRISAVDVLTGEVLIDSLVRPSVRVTDWRTRFSGVTPADLEEAAKAGKTIFGWEAARAALWKF 80 (161) T ss_pred CEEEEEEECCCCCCCEEEEEEEEECCCCEEEEEECCCCCCCCCCCEEECCCCCHHHHHHHHHCCCCCCCHHHHHHHHHHH T ss_conf 97964523322997258999999727991999863359843230143325799999854342153333199999999974 Q ss_pred HHCCCCCEEECCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHH---HCCCHHHHHHHHHHHCCCCCCCCEECCCCCCCCC Q ss_conf 7323640230100001688865420000000457898865211---0000008877765420024675200365654432 Q gi|254780636|r 70 LVDEKREKIFHYGRFDIAVLFYTFGVRVRPVFCTKIASRLTRT---YTNQHGLKDNLKELLGINISKAQQSSDWSADDLS 146 (207) Q Consensus 70 l~d~~i~KV~Hn~~~D~~~L~~~~gi~~~~i~DT~ias~l~~~---~~~~~~L~~L~~~~lg~~ldK~~q~SdW~~rpLs 146 (207) + +++.+-|||+..+|+++|. +.-..+.||.+.....++ ...+.||+.|++++||.+|-.+. T Consensus 81 i-~~~tIlVGH~L~nDL~aL~----~~H~~viDTa~l~~~~~~~~~~~~~~sLk~La~~~L~~~IQ~g~----------- 144 (161) T cd06137 81 I-DPDTILVGHSLQNDLDALR----MIHTRVVDTAILTREAVKGPLAKRQWSLRTLCRDFLGLKIQGGG----------- 144 (161) T ss_pred C-CCCCEEEECCHHHHHHHHC----CCCCCEEECEEECCHHCCCCCCCCCHHHHHHHHHHHCHHHCCCC----------- T ss_conf 5-8996798510131087862----77983543203320011578888886699999998287752899----------- Q ss_pred HHHHHHHHHHHHHHHHHH Q ss_conf 368999986459999999 Q gi|254780636|r 147 DEQLQYAASDVVHLHALR 164 (207) Q Consensus 147 ~~Qi~YAA~Dv~~l~~L~ 164 (207) +-+=+..||...++|| T Consensus 145 --~gHds~EDA~aam~L~ 160 (161) T cd06137 145 --EGHDSLEDALAAREVV 160 (161) T ss_pred --CCCCCHHHHHHHHHHH T ss_conf --8849199999999985 No 73 >cd06136 TREX1_2 Three prime repair exonuclease (TREX) 1 and 2 are closely related DEDDh-type DnaQ-like 3'-5' exonucleases. They contain three conserved sequence motifs known as ExoI, II, and III, with a specific Hx(4)D conserved pattern at ExoIII. These motifs contain four conserved acidic residues that participate in coordination of divalent metal ions required for catalysis. Both proteins play a role in the metabolism and clearance of DNA. TREX1 is the major 3'-5' exonuclease activity detected in mammalian cells. Mutations in the human TREX1 gene can cause Aicardi-Goutieres syndrome (AGS), which is characterized by perturbed innate immunity and presents itself as a severe neurological disease. TREX1 degrades ssDNA generated by aberrant replication intermediates to prevent checkpoint activation and autoimmune disease. There are distinct structural differences between TREX1 and 2 that point to different biological roles for these proteins. The main difference is the presence of about 7 Probab=98.55 E-value=1.1e-06 Score=58.18 Aligned_cols=79 Identities=24% Similarity=0.256 Sum_probs=51.4 Q ss_pred HHHHHHC--CCCCEEECCC-CHHHHHHHHHH---HCC-CCC--HHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCE Q ss_conf 7677732--3640230100-00168886542---000-000--0457898865211000000887776542002467520 Q gi|254780636|r 66 LVGMLVD--EKREKIFHYG-RFDIAVLFYTF---GVR-VRP--VFCTKIASRLTRTYTNQHGLKDNLKELLGINISKAQQ 136 (207) Q Consensus 66 L~~ll~d--~~i~KV~Hn~-~~D~~~L~~~~---gi~-~~~--i~DT~ias~l~~~~~~~~~L~~L~~~~lg~~ldK~~q 136 (207) +..++.. ..+.-|+||+ +||..+|...+ |+. |.+ ..||.-+++ . ..++|.+++++++|+..... T Consensus 88 ~~~F~~~~~~~~~LVAHNa~~FD~~fL~~e~~r~g~~~p~~~~~iDtL~l~r---~--~~~~L~~l~~~~~gi~~~~a-- 160 (177) T cd06136 88 IKLFLRRQPKPICLVAHNGNRFDFPILRSELERLGTKLPDDILCVDSLPAFR---E--LDQSLGSLYKRLFGQEPKNS-- 160 (177) T ss_pred HHHHHHHCCCCCEEEECCCHHCCHHHHHHHHHHCCCCCCCCCEEEEEHHHHH---H--HCCCHHHHHHHHCCCCCCCC-- T ss_conf 9999962677867996485123799999999986998998988998607645---5--06999999999759999877-- Q ss_pred ECCCCCCCCCHHHHHHHHHHHHHHHHHHH Q ss_conf 03656544323689999864599999999 Q gi|254780636|r 137 SSDWSADDLSDEQLQYAASDVVHLHALRL 165 (207) Q Consensus 137 ~SdW~~rpLs~~Qi~YAA~Dv~~l~~L~~ 165 (207) +=|..||..|.+++- T Consensus 161 --------------HRAl~Da~at~~vfl 175 (177) T cd06136 161 --------------HTAEGDVLALLKCAL 175 (177) T ss_pred --------------CCCHHHHHHHHHHHC T ss_conf --------------784899999999975 No 74 >KOG2249 consensus Probab=98.50 E-value=1.7e-06 Score=57.00 Aligned_cols=134 Identities=19% Similarity=0.203 Sum_probs=83.3 Q ss_pred CCCEEEECCCCCCCCC-C-CEEEEEEEEECCCCEEEECCCCCC------------------CCCHH-------HHHHHHC Q ss_conf 9827997177789854-4-507999984189428997465778------------------57067-------7677732 Q gi|254780636|r 20 VDAIAVDTETLGLMPR-R-DRLCIVQLSPGDGTVDIIRIAAGQ------------------KNAPN-------LVGMLVD 72 (207) Q Consensus 20 ~~~iaiDtEt~~l~~~-~-~~l~LiQl~~~~~~~~l~~~~~~~------------------~~~~~-------L~~ll~d 72 (207) ..+||+|||..|..|. + +-++=+.|-...+.++++..-+-. .++.. ..++|.+ T Consensus 105 ~r~vAmDCEMVG~Gp~G~~s~lARvSIVN~~G~VvyDkyVkP~~~VtDyRT~vSGIrpehm~~A~pf~~aQ~ev~klL~g 184 (280) T KOG2249 105 TRVVAMDCEMVGVGPDGRESLLARVSIVNYHGHVVYDKYVKPTEPVTDYRTRVSGIRPEHMRDAMPFKVAQKEVLKLLKG 184 (280) T ss_pred CEEEEEEEEEECCCCCCCCEEEEEEEEEECCCCEEEEEECCCCCCCCCCEEEECCCCHHHHCCCCCHHHHHHHHHHHHHC T ss_conf 23899740475147997600034788861468676555037886633103322365877851575389999999999708 Q ss_pred CCCCEEECCCCHHHHHHHHHHHCCCCC-HHHHHHHHHHHH--HHCCCHHHHHHHHHHHCCCCCCCCEECCCCCCCCCHHH Q ss_conf 364023010000168886542000000-045789886521--10000008877765420024675200365654432368 Q gi|254780636|r 73 EKREKIFHYGRFDIAVLFYTFGVRVRP-VFCTKIASRLTR--TYTNQHGLKDNLKELLGINISKAQQSSDWSADDLSDEQ 149 (207) Q Consensus 73 ~~i~KV~Hn~~~D~~~L~~~~gi~~~~-i~DT~ias~l~~--~~~~~~~L~~L~~~~lg~~ldK~~q~SdW~~rpLs~~Q 149 (207) +|+ |||+.+.|+++|... .|.. +.||.--.-+.+ ....+.||+.|.+.+||++|--+++ T Consensus 185 -RIl-VGHaLhnDl~~L~l~---hp~s~iRDTs~~~pl~k~~~~~~tpSLK~Lt~~~Lg~~IQ~GeH------------- 246 (280) T KOG2249 185 -RIL-VGHALHNDLQALKLE---HPRSMIRDTSKYPPLMKLLSKKATPSLKKLTEALLGKDIQVGEH------------- 246 (280) T ss_pred -CEE-ECCCCCCHHHHHHHH---CCHHHHCCCCCCCHHHHHHHCCCCCCHHHHHHHHHCHHHHCCCC------------- T ss_conf -777-512200079998553---74355314312754877752159831999999984525431566------------- Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 999986459999999999999997 Q gi|254780636|r 150 LQYAASDVVHLHALRLQFTEKLQR 173 (207) Q Consensus 150 i~YAA~Dv~~l~~L~~~l~~~L~~ 173 (207) =..+||..+.+||...+.+-++ T Consensus 247 --sSvEDA~AtM~LY~~vk~qwe~ 268 (280) T KOG2249 247 --SSVEDARATMELYKRVKVQWEK 268 (280) T ss_pred --CCHHHHHHHHHHHHHHHHHHHH T ss_conf --8588999999999999999999 No 75 >cd06130 DNA_pol_III_epsilon_like This subfamily is composed of uncharacterized bacterial proteins with similarity to the epsilon subunit of DNA polymerase III (polIII), a multisubunit polymerase which is the main DNA replicating enzyme in bacteria, functioning as the chromosomal replicase. The pol III holoenzyme is a complex of ten different subunits, three of which (alpha, epsilon, and theta) compose the catalytic core. The polIII epsilon subunit, encoded by the dnaQ gene, is a DEDDh-type 3'-5' exonuclease which is responsible for the proofreading activity of the polymerase, increasing the fidelity of DNA synthesis. It contains three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific Hx(4)D conserved pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. The epsilon subunit of polIII also functions as a stabilizer of the holoenzyme complex Probab=98.40 E-value=1.5e-06 Score=57.30 Aligned_cols=119 Identities=23% Similarity=0.288 Sum_probs=70.1 Q ss_pred CEEEECCCCCCCCCCCEEEEEEEEE---CCCCE-----EEECCCCC-CC--------------C-------CHHHHHHHH Q ss_conf 2799717778985445079999841---89428-----99746577-85--------------7-------067767773 Q gi|254780636|r 22 AIAVDTETLGLMPRRDRLCIVQLSP---GDGTV-----DIIRIAAG-QK--------------N-------APNLVGMLV 71 (207) Q Consensus 22 ~iaiDtEt~~l~~~~~~l~LiQl~~---~~~~~-----~l~~~~~~-~~--------------~-------~~~L~~ll~ 71 (207) .++||+||||.. ++++ |||+. .++.. .++..... .. . .+.+.+++. T Consensus 1 fvv~D~ETT~~~--~d~I--ieIgav~i~~g~i~~~f~~lv~P~~~i~~~~~~ihGIt~~~v~~~p~~~~v~~~l~~~l~ 76 (156) T cd06130 1 FVAIDFETANAD--RASA--CSIGLVKVRDGQIVDTFYTLIRPPTRFDPFNIAIHGITPEDVADAPTFPEVWPEIKPFLG 76 (156) T ss_pred CEEEEEECCCCC--CCEE--EEEEEEEEECCEEEEEEEEEECCCCCCCHHHHHHHCCCHHHHHCCCCHHHHHHHHHHHCC T ss_conf 989997789899--9969--999999999999989999987789879988932208887998409999999999998558 Q ss_pred CCCCCEEECCCCHHHHHHHHHH---HCCC--CCHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCEECCCCCCCCC Q ss_conf 2364023010000168886542---0000--0004578988652110000008877765420024675200365654432 Q gi|254780636|r 72 DEKREKIFHYGRFDIAVLFYTF---GVRV--RPVFCTKIASRLTRTYTNQHGLKDNLKELLGINISKAQQSSDWSADDLS 146 (207) Q Consensus 72 d~~i~KV~Hn~~~D~~~L~~~~---gi~~--~~i~DT~ias~l~~~~~~~~~L~~L~~~~lg~~ldK~~q~SdW~~rpLs 146 (207) +. +-|+||+.||..+|.+.+ |+.+ .+++||+-.++.+.....+++|.+++++ +|++.+ . T Consensus 77 ~~--~lVaHn~~FD~~fL~~~~~~~~~~~~~~~~iDtl~l~r~~~~~~~~~~L~~l~~~-~gi~~~-~------------ 140 (156) T cd06130 77 GS--LVVAHNASFDRSVLRAALEAYGLPPPPYQYLCTVRLARRVWPLLPNHKLNTVAEH-LGIELN-H------------ 140 (156) T ss_pred CC--EEEEECHHHHHHHHHHHHHHCCCCCCCCCEECHHHHHHHHCCCCCCCCHHHHHHH-CCCCCC-C------------ T ss_conf 98--8999577889999999999819999898588189999997388888999999998-599987-8------------ Q ss_pred HHHHHHHHHHHHHHHHHH Q ss_conf 368999986459999999 Q gi|254780636|r 147 DEQLQYAASDVVHLHALR 164 (207) Q Consensus 147 ~~Qi~YAA~Dv~~l~~L~ 164 (207) +=|..||..+-+|| T Consensus 141 ----H~Al~DA~ata~l~ 154 (156) T cd06130 141 ----HDALEDARACAEIL 154 (156) T ss_pred ----CCCHHHHHHHHHHH T ss_conf ----79689999999998 No 76 >cd06138 ExoI_N This subfamily is composed of the N-terminal domain of E. coli exonuclease I (ExoI) and similar proteins. ExoI is a monomeric enzyme that hydrolyzes single stranded DNA in the 3' to 5' direction. It plays a role in DNA recombination and repair. It primarily functions in repairing frameshift mutations. The N-terminal domain of ExoI is a DEDDh-type DnaQ-like 3'-5 exonuclease containing three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific Hx(4)D conserved pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. The ExoI structure is unique among DnaQ family enzymes in that there is a large distance between the two metal ions required for catalysis and the catalytic histidine is oriented away from the active site. Probab=98.35 E-value=8.9e-06 Score=52.84 Aligned_cols=123 Identities=22% Similarity=0.153 Sum_probs=65.5 Q ss_pred EEEECCCCCCCCCCCEEEEEEEEE----CC----CCE-EEECCCCCCCC-------------------------CHHHHH Q ss_conf 799717778985445079999841----89----428-99746577857-------------------------067767 Q gi|254780636|r 23 IAVDTETLGLMPRRDRLCIVQLSP----GD----GTV-DIIRIAAGQKN-------------------------APNLVG 68 (207) Q Consensus 23 iaiDtEt~~l~~~~~~l~LiQl~~----~~----~~~-~l~~~~~~~~~-------------------------~~~L~~ 68 (207) |-+|+||||+++.++++ ||+|. .+ +.+ ..+.+...... .+.+.+ T Consensus 1 i~fD~ETTGl~~~~d~I--iqiaai~~d~~~~~~~~~~~~i~p~~~~~p~p~a~~v~git~~~~~~~~~s~~e~~~~i~~ 78 (183) T cd06138 1 LFYDYETFGLNPSFDQI--LQFAAIRTDENFNEIEPFNIFCRLPPDVLPSPEALIVTGITPQQLLKEGLSEYEFIAKIHR 78 (183) T ss_pred CEEEECCCCCCCCCCCE--EEEEEEEECCCCCCCCEEEEEECCCCCCCCCHHHHHHCCCCHHHHHHCCCCHHHHHHHHHH T ss_conf 98970389979998760--6999999989987864468874048999988479987188999975318877899999999 Q ss_pred HHHCCCCCEEECC-CCHHHHHHHHHH---HCCCC-------C-HHHHH---HHHHHHHHH----------CCCHHHHHHH Q ss_conf 7732364023010-000168886542---00000-------0-04578---988652110----------0000088777 Q gi|254780636|r 69 MLVDEKREKIFHY-GRFDIAVLFYTF---GVRVR-------P-VFCTK---IASRLTRTY----------TNQHGLKDNL 123 (207) Q Consensus 69 ll~d~~i~KV~Hn-~~~D~~~L~~~~---gi~~~-------~-i~DT~---ias~l~~~~----------~~~~~L~~L~ 123 (207) ++..+...-|||| .+||..+|...+ +..|- + -+|+. -+++.+.+. ..++.|.+|+ T Consensus 79 ~~~~~~~i~vg~N~~~FD~~fL~~~~~r~~~~p~~~~~~~~~~~~D~l~~~r~~~~~~p~~i~~~~~~~g~~s~kL~~l~ 158 (183) T cd06138 79 LFNTPGTCIVGYNNIRFDDEFLRFAFYRNLYDPYTWEWKNGNSRWDLLDVVRAYYALRPDGIVWPKNDDGKPSFKLEDLA 158 (183) T ss_pred HHCCCCCEEEEECCHHHHHHHHHHHHHHHCCCCHHHHHCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCHHHHH T ss_conf 86169977998453556899999999982787156675278712307889999999782014476556897411299999 Q ss_pred HHHHCCCCCCCCEECCCCCCCCCHHHHHHHHHHHHHHHHHH Q ss_conf 65420024675200365654432368999986459999999 Q gi|254780636|r 124 KELLGINISKAQQSSDWSADDLSDEQLQYAASDVVHLHALR 164 (207) Q Consensus 124 ~~~lg~~ldK~~q~SdW~~rpLs~~Qi~YAA~Dv~~l~~L~ 164 (207) +. +|+..+ +-+=|..||..+.+|. T Consensus 159 ~~-~gi~~~----------------~aH~Al~Dv~aT~~l~ 182 (183) T cd06138 159 QA-NGIEHS----------------NAHDALSDVEATIALA 182 (183) T ss_pred HH-CCCCCC----------------CCCCCHHHHHHHHHHH T ss_conf 98-499998----------------8868289999999986 No 77 >cd06149 ISG20 Interferon (IFN) Stimulated Gene product of 20 kDa (ISG20) is an IFN-induced antiviral exonuclease with a strong preference for single-stranded RNA and minor activity towards single-stranded DNA. It was also independently identified by its response to estrogen and was called HEM45 (human estrogen regulated transcript). ISG20 is a DEDDh-type DnaQ-like 3'-5' exonuclease containing three conserved sequence motifs termed ExoI, ExoII and ExoIII with a specific Hx(4)D conserved pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. ISG20 may be a major effector of innate immunity against pathogens including viruses, bacteria, and parasites. It is located in promyelocytic leukemia (PML) nuclear bodies, sites for oncogenic DNA viral transcription and replication. It may carry out its function by degrading viral RNAs as part of the IFN-regulated antiviral r Probab=98.34 E-value=1.9e-06 Score=56.76 Aligned_cols=125 Identities=20% Similarity=0.130 Sum_probs=71.0 Q ss_pred EEEECCCCCCCCCCC--EEEEEEEEECCCCEEEECCCCCCCC------------------C-------HHHHHHHHCCCC Q ss_conf 799717778985445--0799998418942899746577857------------------0-------677677732364 Q gi|254780636|r 23 IAVDTETLGLMPRRD--RLCIVQLSPGDGTVDIIRIAAGQKN------------------A-------PNLVGMLVDEKR 75 (207) Q Consensus 23 iaiDtEt~~l~~~~~--~l~LiQl~~~~~~~~l~~~~~~~~~------------------~-------~~L~~ll~d~~i 75 (207) ||+|||..+..+... .++-|.+-..++.+.++.+-+-... + ..+.+++.+. | T Consensus 1 vAlDCEMv~t~~g~~~~~LaRvsvVd~~G~vl~D~~VkP~~~V~dy~T~~SGIt~~~l~~a~~~~~~~~~l~~~l~~~-I 79 (157) T cd06149 1 VAIDCEMVGTGPGGRESELARCSIVNYHGDVLYDKYIRPEGPVTDYRTRWSGIRRQHLVNATPFAVAQKEILKILKGK-V 79 (157) T ss_pred CEEECEEEEECCCCCEEEEEEEEEEECCCCEEEEEEECCCCCCCCCEEECCCCCHHHHCCCCCHHHHHHHHHHHHCCC-E T ss_conf 979365342748998468999999927898986743388987786365626788889579984999999999971698-8 Q ss_pred CEEECCCCHHHHHHHHHHHCCCC-CHHHHHHH---HHHHH-HHCCCHHHHHHHHHHHCCCCCCCCEECCCCCCCCCHHHH Q ss_conf 02301000016888654200000-00457898---86521-100000088777654200246752003656544323689 Q gi|254780636|r 76 EKIFHYGRFDIAVLFYTFGVRVR-PVFCTKIA---SRLTR-TYTNQHGLKDNLKELLGINISKAQQSSDWSADDLSDEQL 150 (207) Q Consensus 76 ~KV~Hn~~~D~~~L~~~~gi~~~-~i~DT~ia---s~l~~-~~~~~~~L~~L~~~~lg~~ldK~~q~SdW~~rpLs~~Qi 150 (207) + |||+..+|+++|.- +.|+ .+.||-.. .+... +...+.||+.|++++||.++-.+.+ - T Consensus 80 l-VGH~L~nDL~aL~l---~h~~~~i~DT~~~~~~~~~~~~p~~~~~sLk~L~~~~L~~~IQ~~~~-------------g 142 (157) T cd06149 80 V-VGHAIHNDFKALKY---FHPKHMTRDTSTIPLLNRKAGFPENCRVSLKVLAKRLLHRDIQVGRQ-------------G 142 (157) T ss_pred E-EEECCHHHHHHHHC---CCCCCCEEECCCCEEHHHHCCCCCCCCHHHHHHHHHHHCCCCCCCCC-------------C T ss_conf 9-98260766988603---38987267771110013324898766875999999984850259998-------------8 Q ss_pred HHHHHHHHHHHHHHH Q ss_conf 999864599999999 Q gi|254780636|r 151 QYAASDVVHLHALRL 165 (207) Q Consensus 151 ~YAA~Dv~~l~~L~~ 165 (207) +=+.+||..+++||. T Consensus 143 Hds~EDA~aa~~L~k 157 (157) T cd06149 143 HSSVEDARATMELYK 157 (157) T ss_pred CCHHHHHHHHHHHHC T ss_conf 080999999999859 No 78 >cd06145 REX1_like This subfamily is composed of RNA exonuclease 1 (REX1 or Rex1p), REX3 (or Rex3p), and similar eukaryotic proteins. In yeast, REX1 and REX3 are required for 5S rRNA and MRP (mitochondrial RNA processing) RNA maturation, respectively. They are DEDDh-type DnaQ-like 3'-5' exonucleases containing three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific Hx(4)D conserved pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. REX1 is the major exonuclease responsible for pre-tRNA trail trimming and may also be involved in nuclear CCA turnover. REX proteins function in the processing and maturation of many RNA species, similar to the function of E.coli RNase T. Probab=98.29 E-value=1.5e-06 Score=57.26 Aligned_cols=123 Identities=20% Similarity=0.128 Sum_probs=68.4 Q ss_pred EEEECCCCCCCCCCCEEEEEEEEECCCCEEEECCCCCCC------------------CC--------HHHHHHHHCCCCC Q ss_conf 799717778985445079999841894289974657785------------------70--------6776777323640 Q gi|254780636|r 23 IAVDTETLGLMPRRDRLCIVQLSPGDGTVDIIRIAAGQK------------------NA--------PNLVGMLVDEKRE 76 (207) Q Consensus 23 iaiDtEt~~l~~~~~~l~LiQl~~~~~~~~l~~~~~~~~------------------~~--------~~L~~ll~d~~i~ 76 (207) +|+|||-.+.... ..++=|.+-..++.+.++.+-+... .+ ..+++++ +++.+ T Consensus 1 ~AlDCEMv~t~~g-~~LaRvsvVd~~g~vl~D~~V~P~~~V~DyrT~~SGIt~~~l~~a~~~~~~~~~~l~~li-~~~~I 78 (150) T cd06145 1 FALDCEMCYTTDG-LELTRVTVVDENGKVVLDELVKPDGEIVDYNTRFSGITEEMLENVTTTLEDVQKKLLSLI-SPDTI 78 (150) T ss_pred CCCCCEEEEECCC-CEEEEEEEECCCCCEEEEEEECCCCCCEECCCCCCCCCHHHHHCCCCCHHHHHHHHHHHC-CCCCE T ss_conf 9561367752799-889999999389989877433499772132010457898898546588999999999875-99988 Q ss_pred EEECCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCEECCCCCCCCCHHHHHHHHHH Q ss_conf 23010000168886542000000045789886521100000088777654200246752003656544323689999864 Q gi|254780636|r 77 KIFHYGRFDIAVLFYTFGVRVRPVFCTKIASRLTRTYTNQHGLKDNLKELLGINISKAQQSSDWSADDLSDEQLQYAASD 156 (207) Q Consensus 77 KV~Hn~~~D~~~L~~~~gi~~~~i~DT~ias~l~~~~~~~~~L~~L~~~~lg~~ldK~~q~SdW~~rpLs~~Qi~YAA~D 156 (207) -|||+..+|+++|. +.-..+.||-+.....++.+.+.||+.|++.+||.++-.+.+ -+=+.+| T Consensus 79 lVGH~L~nDL~aL~----i~H~~v~DTa~l~~~~~~~~~~~sLk~L~~~~L~~~IQ~~~~-------------gHdsvED 141 (150) T cd06145 79 LVGHSLENDLKALK----LIHPRVIDTAILFPHPRGPPYKPSLKNLAKKYLGRDIQQGEG-------------GHDSVED 141 (150) T ss_pred EEECCCCCCHHHHH----CCCCCEEEHHHHHHHCCCCCCCHHHHHHHHHHHCHHHCCCCC-------------CCCCHHH T ss_conf 99626434777761----558761571662110379998978999999982877439998-------------8183999 Q ss_pred HHHHHHHH Q ss_conf 59999999 Q gi|254780636|r 157 VVHLHALR 164 (207) Q Consensus 157 v~~l~~L~ 164 (207) |..+++|+ T Consensus 142 A~aam~L~ 149 (150) T cd06145 142 ARAALELV 149 (150) T ss_pred HHHHHHHH T ss_conf 99999975 No 79 >cd06143 PAN2_exo PAN2 is the catalytic subunit of poly(A) nuclease (PAN), a Pab1p-dependent 3'-5' exoribonuclease which plays an important role in the posttranscriptional maturation of pre-mRNAs. PAN catalyzes the deadenylation of poly(A) tails, which are initially synthesized to default lengths of 70 to 90, to mRNA-specific lengths of 55 to 71. Pab1p and PAN also play a role in the export and decay of mRNA. PAN2 contains a DEDDh-type DnaQ-like 3'-5' exonuclease domain with three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific Hx(4)D conserved pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. Probab=98.28 E-value=1.7e-06 Score=57.01 Aligned_cols=75 Identities=20% Similarity=-0.028 Sum_probs=49.9 Q ss_pred HCCCCCEEECCCCHHHHHHHHHHHCCC-CCHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCEECCCCCCCCCHHH Q ss_conf 323640230100001688865420000-0004578988652110000008877765420024675200365654432368 Q gi|254780636|r 71 VDEKREKIFHYGRFDIAVLFYTFGVRV-RPVFCTKIASRLTRTYTNQHGLKDNLKELLGINISKAQQSSDWSADDLSDEQ 149 (207) Q Consensus 71 ~d~~i~KV~Hn~~~D~~~L~~~~gi~~-~~i~DT~ias~l~~~~~~~~~L~~L~~~~lg~~ldK~~q~SdW~~rpLs~~Q 149 (207) -++..+-|||+...|+.+|.-. .| ..+.||...... +...+.||+.|++.+||.+|-.+ T Consensus 99 i~~~~ILVGHsL~nDL~aL~l~---hP~~~i~DTs~l~~~--~~~r~~sLk~La~~~Lg~~IQ~g--------------- 158 (174) T cd06143 99 VDLGCIFVGHGLAKDFRVINIQ---VPKEQVIDTVELFHL--PGQRKLSLRFLAWYLLGEKIQSE--------------- 158 (174) T ss_pred CCCCCEEECCCCHHHHHHHCCC---CCCCCEEECHHHCCC--CCCCCCCHHHHHHHHCCCCCCCC--------------- T ss_conf 4799689789636679885066---998665777885268--99887089999999808104698--------------- Q ss_pred HHHHHHHHHHHHHHHH Q ss_conf 9999864599999999 Q gi|254780636|r 150 LQYAASDVVHLHALRL 165 (207) Q Consensus 150 i~YAA~Dv~~l~~L~~ 165 (207) -+=..+||...+.||. T Consensus 159 ~HdSvEDArAam~LYr 174 (174) T cd06143 159 THDSIEDARTALKLYR 174 (174) T ss_pred CCCCHHHHHHHHHHHC T ss_conf 9593999999999629 No 80 >KOG2405 consensus Probab=98.24 E-value=4.8e-07 Score=60.17 Aligned_cols=150 Identities=15% Similarity=0.074 Sum_probs=95.1 Q ss_pred CEEEECCCCCCCCCCCEEEEEEEEECCCCEEEECCCCCCCCC-HHHHHHHHCCCCCEEECCCCHHHHHHHHHHHCCCCCH Q ss_conf 279971777898544507999984189428997465778570-6776777323640230100001688865420000000 Q gi|254780636|r 22 AIAVDTETLGLMPRRDRLCIVQLSPGDGTVDIIRIAAGQKNA-PNLVGMLVDEKREKIFHYGRFDIAVLFYTFGVRVRPV 100 (207) Q Consensus 22 ~iaiDtEt~~l~~~~~~l~LiQl~~~~~~~~l~~~~~~~~~~-~~L~~ll~d~~i~KV~Hn~~~D~~~L~~~~gi~~~~i 100 (207) +|+.|.-+. +.....++.++++|+++..+.++-+..+..-. .+.|..+|...+ |. +++.+..-+...+++...|+ T Consensus 197 ~i~~~~~s~-~~~~~~e~~~m~ia~~n~i~llD~~~sdi~il~~gyK~~LEs~~~--vi-Dr~r~~e~l~~~y~~~L~nV 272 (458) T KOG2405 197 FIRPVSPSL-LKILALEATYMNIADGNEIFLLDSLPSDIRILFGGYKRELESLEK--VI-DRIRLIEQLDTTYHSALKNV 272 (458) T ss_pred EEECCCHHH-HHHHHHHHHHHHHCCCCHHHHHHHCCCCCEEECCCCHHHHHHCCE--EH-HHHHHHHHHHHHHHHHHHHH T ss_conf 860376447-776234532001125303332232357847750222055662321--04-23300488776889999856 Q ss_pred HHHHHHHHHHHH--HCCCHHHHHHHH--------HHHCCCCCC-------CCEEC-CCCCCCCCHHHHHHHHHHHHHHHH Q ss_conf 457898865211--000000887776--------542002467-------52003-656544323689999864599999 Q gi|254780636|r 101 FCTKIASRLTRT--YTNQHGLKDNLK--------ELLGINISK-------AQQSS-DWSADDLSDEQLQYAASDVVHLHA 162 (207) Q Consensus 101 ~DT~ias~l~~~--~~~~~~L~~L~~--------~~lg~~ldK-------~~q~S-dW~~rpLs~~Qi~YAA~Dv~~l~~ 162 (207) +|||||+.++.. ++.++....++- .|++...-+ -.+.- +|..||-++...+-++.||.+|+. T Consensus 273 kDtQia~sLve~~e~grr~p~~~lIsft~Lq~~~~y~~~s~~~~eev~~~l~~dp~~w~irp~te~~~~~~h~dv~~Ll~ 352 (458) T KOG2405 273 KDTQIASSLVEPSEYGRRHPTSILISFTCLQTYIFYIKASGLIFEEVAKILEADPPRWVIRPSTEIADHLLHRDVISLLG 352 (458) T ss_pred HHHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCCEEEHHHHHHHHHHHHHHHCCCCCCEECCCHHHHHHHHHHHHHHHHH T ss_conf 88999998751687455687655056675266514403345468888888723897633366378999998888999999 Q ss_pred HHHHHHHHHHHCCCHHH Q ss_conf 99999999997491899 Q gi|254780636|r 163 LRLQFTEKLQRLGRSDL 179 (207) Q Consensus 163 L~~~l~~~L~~~~~~~~ 179 (207) .++. +......++ T Consensus 353 ~~~~----l~a~~l~HL 365 (458) T KOG2405 353 IFDT----LVAVCLSHL 365 (458) T ss_pred HHHH----HHHHCHHHH T ss_conf 9865----764083753 No 81 >PRK07983 exodeoxyribonuclease X; Provisional Probab=98.21 E-value=2e-05 Score=50.74 Aligned_cols=125 Identities=22% Similarity=0.193 Sum_probs=71.5 Q ss_pred CEEEECCCCCCCCCCCEEEEEEEEE---CCCCE-----EEECCCCC---------------CCCCHHHHHH---HHCCCC Q ss_conf 2799717778985445079999841---89428-----99746577---------------8570677677---732364 Q gi|254780636|r 22 AIAVDTETLGLMPRRDRLCIVQLSP---GDGTV-----DIIRIAAG---------------QKNAPNLVGM---LVDEKR 75 (207) Q Consensus 22 ~iaiDtEt~~l~~~~~~l~LiQl~~---~~~~~-----~l~~~~~~---------------~~~~~~L~~l---l~d~~i 75 (207) .+-|||||||+... .||++. .++.. .++..... -..+|.+.++ +.+..+ T Consensus 2 ~~VlDtETTGl~~~-----IiEia~v~v~~g~i~~~~~~linP~~pI~~~a~~ihgIT~emv~~aP~f~ev~~~~~~~~~ 76 (219) T PRK07983 2 LRIIDTETCGLQGG-----IVEIASVDVIDGKIVNPMSHLVRPDRPISPQAMAIHRITEAMVADKPWIEDVIPHYYGSEW 76 (219) T ss_pred EEEEEEECCCCCCC-----CEEEEEEEEECCEEEEEEEEEECCCCCCCHHHHHHCCCCHHHHCCCCCHHHHHHHHCCCCE T ss_conf 59996347999998-----4899999999999976689998968969888998729899998689988999998628987 Q ss_pred CEEECCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCEECCCCCCCCCHHHHHHHHH Q ss_conf 02301000016888654200000004578988652110000008877765420024675200365654432368999986 Q gi|254780636|r 76 EKIFHYGRFDIAVLFYTFGVRVRPVFCTKIASRLTRTYTNQHGLKDNLKELLGINISKAQQSSDWSADDLSDEQLQYAAS 155 (207) Q Consensus 76 ~KV~Hn~~~D~~~L~~~~gi~~~~i~DT~ias~l~~~~~~~~~L~~L~~~~lg~~ldK~~q~SdW~~rpLs~~Qi~YAA~ 155 (207) + |+||+.||..+|..- + ....||.-.++.+-+. .+||+.+|.. .++....+... + .-+=|-. T Consensus 77 l-VaHNa~FD~~~L~~~-~---~~~IcTl~lAR~l~p~-~~~~l~~Lry-~~~~~~~~~~~--------~---~AHrAl~ 138 (219) T PRK07983 77 Y-VAHNASFDRRVLPEM-P---GEWICTMKLARRLWPG-IKYSNMALYK-SRKLNVQTPPG--------L---HHHRALY 138 (219) T ss_pred E-EECCCCCCHHHHCCC-C---CCEEEHHHHHHHHCCC-CCHHHHHHHH-HHCCCCCCCCC--------C---CCCCHHH T ss_conf 8-753651038664155-8---7566578999987677-4241899999-84276133244--------3---3343078 Q ss_pred HHHHHHHHHHHHHH Q ss_conf 45999999999999 Q gi|254780636|r 156 DVVHLHALRLQFTE 169 (207) Q Consensus 156 Dv~~l~~L~~~l~~ 169 (207) ||.++..++..+.+ T Consensus 139 Dv~v~~~ll~~l~~ 152 (219) T PRK07983 139 DCYITAALLIDIMR 152 (219) T ss_pred HHHHHHHHHHHHHH T ss_conf 79999999999998 No 82 >cd06133 ERI-1_3'hExo_like This subfamily is composed of Caenorhabditis elegans ERI-1, human 3' exonuclease (3'hExo), Drosophila exonuclease snipper (snp), and similar proteins from eukaryotes and bacteria. These are DEDDh-type DnaQ-like 3'-5' exonucleases containing three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific Hx(4)D conserved pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. ERI-1 has been implicated in the degradation of small interfering RNAs (RNAi). 3'hExo participates in the degradation of histone mRNAs. Snp is a non-essential exonuclease that efficiently degrades structured RNA and DNA substrates as long as there is a minimum of 2 nucleotides in the 3' overhang to initiate degradation. Snp is not a functional homolog of either ERI-1 or 3'hExo. Probab=98.19 E-value=4.4e-05 Score=48.82 Aligned_cols=130 Identities=13% Similarity=0.169 Sum_probs=74.7 Q ss_pred CEEEECCCCCCCCCCCE---EEEEEEEE-----CCCCE-----EEECCCCCCCC-----------------C-------H Q ss_conf 27997177789854450---79999841-----89428-----99746577857-----------------0-------6 Q gi|254780636|r 22 AIAVDTETLGLMPRRDR---LCIVQLSP-----GDGTV-----DIIRIAAGQKN-----------------A-------P 64 (207) Q Consensus 22 ~iaiDtEt~~l~~~~~~---l~LiQl~~-----~~~~~-----~l~~~~~~~~~-----------------~-------~ 64 (207) .|.||+||||+++.+++ --+|||+. ..+.. .++.+...... + + T Consensus 1 yvv~D~EtTg~~~~~~~~~~~eIIeIgav~vd~~~~~i~~~f~~lI~P~~~~~i~~~i~~itGIt~~~l~~ap~~~~v~~ 80 (176) T cd06133 1 YLVIDFEATCWEGNSKPDYPNEIIEIGAVLVDVKTKEIIDTFSSYVKPVINPKLSDFCTELTGITQEDVDNAPSFPEVLK 80 (176) T ss_pred CEEEEEECCCCCCCCCCCCCCCEEEEEEEEEECCCCEEEEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHCCCCHHHHHH T ss_conf 98999726899878898999707999999998799979899999975876887898899773818878707863999999 Q ss_pred HHHHHHHCCCCCEEECCCCHHHHHHHHHH---HCC-C----CCHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCE Q ss_conf 77677732364023010000168886542---000-0----000457898865211000000887776542002467520 Q gi|254780636|r 65 NLVGMLVDEKREKIFHYGRFDIAVLFYTF---GVR-V----RPVFCTKIASRLTRTYTNQHGLKDNLKELLGINISKAQQ 136 (207) Q Consensus 65 ~L~~ll~d~~i~KV~Hn~~~D~~~L~~~~---gi~-~----~~i~DT~ias~l~~~~~~~~~L~~L~~~~lg~~ldK~~q 136 (207) .+.+++.+.......|+++||+.++.+.+ +.. . ...+||+-.....-+...++||.+++++ +|+..+. T Consensus 81 ~f~~~i~~~~~~~~~~~~~fD~~~l~~~~~~~~~~~~~~~~~~~iD~~~~~~~~~~~~~~~sL~~l~~~-~gi~~~~--- 156 (176) T cd06133 81 EFLEWLGKNGKYAFVTWGDWDLKDLLQNQCKYKIINLPPFFRQWIDLKKEFAKFYGLKKRTGLSKALEY-LGLEFEG--- 156 (176) T ss_pred HHHHHHCCCCCEEEEEECCCHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCHHHHHHH-CCCCCCC--- T ss_conf 999997269857999606002999999999978998873011120499999998188889899999998-6999999--- Q ss_pred ECCCCCCCCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 0365654432368999986459999999999 Q gi|254780636|r 137 SSDWSADDLSDEQLQYAASDVVHLHALRLQF 167 (207) Q Consensus 137 ~SdW~~rpLs~~Qi~YAA~Dv~~l~~L~~~l 167 (207) +-+-|..||..+.+++.+| T Consensus 157 ------------~~H~AL~DA~~ta~v~~~l 175 (176) T cd06133 157 ------------RHHRGLDDARNIARILKRL 175 (176) T ss_pred ------------CCCCCHHHHHHHHHHHHHH T ss_conf ------------8858599999999999987 No 83 >cd05781 DNA_polB_B3_exo The 3'-5' exonuclease domain of archaeal proteins with similarity to Sulfurisphaera ohwakuensis DNA polymerase B3. B3 is a family-B DNA polymerase. Family-B DNA polymerases contain an N-terminal DEDDy DnaQ-like exonuclease domain in the same polypeptide chain as the polymerase domain, similar to family-A DNA polymerases. B3 exhibits both polymerase and 3'-5' exonuclease activities. This exonuclease domain contains three sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are clustered around the active site and are involved in metal binding and catalysis. The exonuclease domain of family B polymerases also contains a beta hairpin structure that plays an important role in active site switching in the event of nucleotide misincorporation. Archaeal proteins that are involved in DNA replication are similar to those from eukaryotes. Some archaea possess multiple family-B DNA polymerases. B3 is mainly found in crenarch Probab=98.15 E-value=3.4e-05 Score=49.43 Aligned_cols=145 Identities=19% Similarity=0.234 Sum_probs=81.1 Q ss_pred CCCEEEECCCCCC----CCCCCEEEEEEEEECCCCEEEECCCCCC--CCCHHHHHHH--HCCCCCEEECCC-CHHHHHHH Q ss_conf 9827997177789----8544507999984189428997465778--5706776777--323640230100-00168886 Q gi|254780636|r 20 VDAIAVDTETLGL----MPRRDRLCIVQLSPGDGTVDIIRIAAGQ--KNAPNLVGML--VDEKREKIFHYG-RFDIAVLF 90 (207) Q Consensus 20 ~~~iaiDtEt~~l----~~~~~~l~LiQl~~~~~~~~l~~~~~~~--~~~~~L~~ll--~d~~i~KV~Hn~-~~D~~~L~ 90 (207) -++++||.|+.+- ++.++++..|++++.++...++...... .....+.+++ .||+|+ +|+|. .||+..|. T Consensus 3 lk~lsfDIE~~~~~gfp~~~~d~Ii~Is~~~~~g~~~~~~~~~~~E~~ll~~f~~~i~~~~Pdii-~gyN~~~FD~pyl~ 81 (188) T cd05781 3 LKTLAFDIEVYSKYGTPNPRRDPIIVISLATSNGDVEFILAEGLDDRKIIREFVKYVKEYDPDII-VGYNSNAFDWPYLV 81 (188) T ss_pred CEEEEEEEEECCCCCCCCCCCCCEEEEEEEECCCCEEEEEECCCCHHHHHHHHHHHHHHCCCCEE-EECCCCCCCHHHHH T ss_conf 41999998876899989988897899999968997699984799989999999999987299999-96698675789999 Q ss_pred HHH---HCCC--------------------CC--HHHHHHHHHHHHHHCCCHHHHHHHHHHHCCC--CCCCCE-----EC Q ss_conf 542---0000--------------------00--0457898865211000000887776542002--467520-----03 Q gi|254780636|r 91 YTF---GVRV--------------------RP--VFCTKIASRLTRTYTNQHGLKDNLKELLGIN--ISKAQQ-----SS 138 (207) Q Consensus 91 ~~~---gi~~--------------------~~--i~DT~ias~l~~~~~~~~~L~~L~~~~lg~~--ldK~~q-----~S 138 (207) ... |+.. .+ .+|+.-..+-. -...+++|.++++ +||.. -+|... .. T Consensus 82 ~Ra~~~~i~l~~gr~~~~~~~~~~~~~~~i~Gr~~iDl~~~~~~~-~~l~sysL~~Va~-~lg~~~k~e~~~~~~~~I~~ 159 (188) T cd05781 82 ERARVLGVKLDVGRRGGSEPSTGVYGHYSITGRLNVDLYDFAEEI-PEVKVKTLENVAE-YLGVMKKSERVLIEWYRIYE 159 (188) T ss_pred HHHHHHCCCCCCCCCCCCCCCCCCCEEEEEEEEEEEECHHHHHHH-CCCCCCCHHHHHH-HHCCCCCCCCCCCCHHHHHH T ss_conf 999996998501556776512254206998789999778999762-4756457999999-85412245446579999999 Q ss_pred CCCCCCCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 65654432368999986459999999999 Q gi|254780636|r 139 DWSADDLSDEQLQYAASDVVHLHALRLQF 167 (207) Q Consensus 139 dW~~rpLs~~Qi~YAA~Dv~~l~~L~~~l 167 (207) -|...+-.+.-..|+..||..+++|+++| T Consensus 160 ~~~~~~~r~~l~~YnlqDa~L~~~L~~KL 188 (188) T cd05781 160 YWDDEKKRDILLKYNRDDARSTYGLAEKL 188 (188) T ss_pred HHHCCCCHHHHHHHHHHHHHHHHHHHHCC T ss_conf 87084121699999788799999998429 No 84 >COG0847 DnaQ DNA polymerase III, epsilon subunit and related 3'-5' exonucleases [DNA replication, recombination, and repair] Probab=98.10 E-value=5.9e-05 Score=48.07 Aligned_cols=140 Identities=22% Similarity=0.291 Sum_probs=81.2 Q ss_pred CCCEEEECCCCCCCCCCCEE---EEEEEEEC---CCCE-EEECCCCC-CCC--------------CH-------HHHHHH Q ss_conf 98279971777898544507---99998418---9428-99746577-857--------------06-------776777 Q gi|254780636|r 20 VDAIAVDTETLGLMPRRDRL---CIVQLSPG---DGTV-DIIRIAAG-QKN--------------AP-------NLVGML 70 (207) Q Consensus 20 ~~~iaiDtEt~~l~~~~~~l---~LiQl~~~---~~~~-~l~~~~~~-~~~--------------~~-------~L~~ll 70 (207) ...+.||+||+|+++.++++ ..+++... +..+ .++..... ... .| .+..++ T Consensus 13 ~~~vv~D~ETtg~~~~~~~iieIgav~~~~~~i~~~~~~~~v~P~~~i~~~~~~i~git~e~l~~~p~~~~v~~~~~~~i 92 (243) T COG0847 13 TRFVVIDLETTGLNPKKDRIIEIGAVTLEDGRIVERSFHTLVNPERPIPPEIFKIHGITDEMLADAPKFAEVLPEFLDFI 92 (243) T ss_pred CCEEEEECCCCCCCCCCCCEEEEEEEEEECCCCCCCCCCEEECCCCCCCCCCCCCCCCCHHHHHCCCCHHHHHHHHHHHH T ss_conf 75799966148988778854899659967784024311146688987875431147738899747863899999999984 Q ss_pred HCCCCCEEECCCCHHHHHHHHHHHC---CC--CCHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCEECCCCCCCC Q ss_conf 3236402301000016888654200---00--000457898865211000000887776542002467520036565443 Q gi|254780636|r 71 VDEKREKIFHYGRFDIAVLFYTFGV---RV--RPVFCTKIASRLTRTYTNQHGLKDNLKELLGINISKAQQSSDWSADDL 145 (207) Q Consensus 71 ~d~~i~KV~Hn~~~D~~~L~~~~gi---~~--~~i~DT~ias~l~~~~~~~~~L~~L~~~~lg~~ldK~~q~SdW~~rpL 145 (207) .+.. .-|+||+.||+.++...+.. .+ ..+.||.-.++-......+++|..|++ .+|+..+ ..+ + T Consensus 93 ~~~~-~~Vahna~fD~~fl~~~~~~~~~~~~~~~~~~t~~~~r~~~~~~~~~~L~~l~~-~~gi~~~-~~~-------~- 161 (243) T COG0847 93 GGLR-LLVAHNAAFDVGFLRVESERLGIEIPGDPVLDTLALARRHFPGFDRSSLDALAE-RLGIDRN-PFH-------P- 161 (243) T ss_pred CCCC-EEEEEEHHHCHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHCCCCCCCCHHHHHH-HCCCCCC-CCC-------C- T ss_conf 7888-699950530889999899970898757642007999998769976134899999-6298877-777-------6- Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHH-HHHCC Q ss_conf 2368999986459999999999999-99749 Q gi|254780636|r 146 SDEQLQYAASDVVHLHALRLQFTEK-LQRLG 175 (207) Q Consensus 146 s~~Qi~YAA~Dv~~l~~L~~~l~~~-L~~~~ 175 (207) +=|..|+..+-.+|..+... +.... T Consensus 162 -----H~Al~Da~~~a~~~~~~~~~~~~~~~ 187 (243) T COG0847 162 -----HRALFDALALAELFLLLQTGLLLKAP 187 (243) T ss_pred -----CCHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf -----61678899999999998620001011 No 85 >COG3359 Predicted exonuclease [DNA replication, recombination, and repair] Probab=97.88 E-value=0.00027 Score=44.18 Aligned_cols=153 Identities=20% Similarity=0.189 Sum_probs=91.5 Q ss_pred CCEEEECCCCCCCCCCCEEEEEEEEECCCCEEEE-CCCCC--CCCCHHHHHHHHCCCC-CEE-ECCCCHHHHHHHH---- Q ss_conf 8279971777898544507999984189428997-46577--8570677677732364-023-0100001688865---- Q gi|254780636|r 21 DAIAVDTETLGLMPRRDRLCIVQLSPGDGTVDII-RIAAG--QKNAPNLVGMLVDEKR-EKI-FHYGRFDIAVLFY---- 91 (207) Q Consensus 21 ~~iaiDtEt~~l~~~~~~l~LiQl~~~~~~~~l~-~~~~~--~~~~~~L~~ll~d~~i-~KV-~Hn~~~D~~~L~~---- 91 (207) .+.-||+||||+++..+.+.++..+.+.+....+ +.... ......+..++-+++. ..| +-+.+||..++++ T Consensus 99 ~~~FFDiETTGL~~ag~~I~~~g~a~~~~~~~~Vrq~~lp~p~~E~avle~fl~~~~~~~lvsfNGkaFD~PfikR~v~~ 178 (278) T COG3359 99 DVAFFDIETTGLDRAGNTITLVGGARGVDDTMHVRQHFLPAPEEEVAVLENFLHDPDFNMLVSFNGKAFDIPFIKRMVRD 178 (278) T ss_pred CEEEEEEECCCCCCCCCEEEEEEEEECCCCEEEEEEECCCCCCHHHHHHHHHHCCCCCCEEEEECCCCCCCHHHHHHHHC T ss_conf 25899633146688898599998887357569997512798413568999975389845599746832670899998751 Q ss_pred HHHCCCCC-HHHHHHHHHHHHHH-CCCHHHHHHHHHHHCCCCCCCCEE-------CCCCC---CCCCHHHHHHHHHHHHH Q ss_conf 42000000-04578988652110-000008877765420024675200-------36565---44323689999864599 Q gi|254780636|r 92 TFGVRVRP-VFCTKIASRLTRTY-TNQHGLKDNLKELLGINISKAQQS-------SDWSA---DDLSDEQLQYAASDVVH 159 (207) Q Consensus 92 ~~gi~~~~-i~DT~ias~l~~~~-~~~~~L~~L~~~~lg~~ldK~~q~-------SdW~~---rpLs~~Qi~YAA~Dv~~ 159 (207) ++...+.. =||.+-.++-+... -.+-||+.. +++||++=+-..-- -.|.+ ..|-..-+.|--.||.- T Consensus 179 ~~el~l~~~H~DL~h~~RRlwk~~l~~c~Lk~V-Er~LGi~R~edtdG~~~p~lyr~~~~~~dp~ll~~l~~hN~eDvln 257 (278) T COG3359 179 RLELSLEFGHFDLYHPSRRLWKHLLPRCGLKTV-ERILGIRREEDTDGYDGPELYRLYRRYGDPGLLDGLVLHNREDVLN 257 (278) T ss_pred CCCCCCCCCCHHHHHHHHHHHHCCCCCCCHHHH-HHHHCCCCCCCCCCCCHHHHHHHHHHCCCHHHHHHHHHCCHHHHHH T ss_conf 644476666312210344531202878881367-8885755011589850199999998709877778988706998875 Q ss_pred HHHHHHHHHHHHHHC Q ss_conf 999999999999974 Q gi|254780636|r 160 LHALRLQFTEKLQRL 174 (207) Q Consensus 160 l~~L~~~l~~~L~~~ 174 (207) |..|+..+.+.+.+. T Consensus 258 L~~i~~h~~~~i~~~ 272 (278) T COG3359 258 LPTIIKHVSKKILED 272 (278) T ss_pred HHHHHHHHHHHHHHH T ss_conf 999999999999987 No 86 >TIGR00573 dnaq exonuclease, DNA polymerase III, epsilon subunit family; InterPro: IPR006054 All proteins in this entry for which functions are known are components of the DNA polymerase III complex (epsilon subunit). There is, however, an outgroup that includes paralogs in some gamma-proteobacteria and the N-terminal region of DinG from some low GC Gram-positive bacteria. ; GO: 0003677 DNA binding, 0003887 DNA-directed DNA polymerase activity, 0006260 DNA replication. Probab=97.80 E-value=0.00054 Score=42.45 Aligned_cols=136 Identities=21% Similarity=0.236 Sum_probs=82.2 Q ss_pred CCCEEEECCCCCCCCCCCEEEEEEEE---------ECCC-CEEEECCCCCCCC--------------CHH-------HHH Q ss_conf 98279971777898544507999984---------1894-2899746577857--------------067-------767 Q gi|254780636|r 20 VDAIAVDTETLGLMPRRDRLCIVQLS---------PGDG-TVDIIRIAAGQKN--------------APN-------LVG 68 (207) Q Consensus 20 ~~~iaiDtEt~~l~~~~~~l~LiQl~---------~~~~-~~~l~~~~~~~~~--------------~~~-------L~~ 68 (207) --.+++|+||+|++-..+++ |-++ +++. .+|+.|-.....+ .|. +.. T Consensus 7 ~t~t~~d~e~tgl~~~~~~I--I~IGAv~~i~rr~~~~~~~~~~~P~R~~~~~~~k~hGItDd~L~dkP~F~eia~dF~~ 84 (228) T TIGR00573 7 DTFTTGDNETTGLYAKHDEI--IEIGAVEIINRRIIGNKFHTYIKPDRLIDPDAIKIHGITDDMLKDKPDFKEIAEDFAD 84 (228) T ss_pred EEEEEEECCCCCCCCCCCCE--EEEEEEEEECCCEEECCEEEEECCCCCCCCCEEEECCCCHHHHCCCCCHHHHHHHHHH T ss_conf 42687521106763224304--5453376644614401005776188888964255248882884689851378899999 Q ss_pred HHHCCCCCEEECCCCHHHHHHHHHHHCC-C------CCHHHHHHHHHHHHHH--CCCHHHHHHHHHHHCCCCCCCCEECC Q ss_conf 7732364023010000168886542000-0------0004578988652110--00000887776542002467520036 Q gi|254780636|r 69 MLVDEKREKIFHYGRFDIAVLFYTFGVR-V------RPVFCTKIASRLTRTY--TNQHGLKDNLKELLGINISKAQQSSD 139 (207) Q Consensus 69 ll~d~~i~KV~Hn~~~D~~~L~~~~gi~-~------~~i~DT~ias~l~~~~--~~~~~L~~L~~~~lg~~ldK~~q~Sd 139 (207) ++.+. ++ |+||+.||+.+|...|.-. . .+|+||.=+.+++++. ..+.+|.+|+.+ ++++- T Consensus 85 ~I~g~-~L-v~HNA~FD~GFl~~e~~~LG~~~~~~~~~v~~~~~~~~~~~~~~p~~~~~Ld~L~~~-~~~~~-------- 153 (228) T TIGR00573 85 YIKGA-VL-VIHNASFDVGFLNYEFSKLGLKKEPKTNDVIDTTDTLQAARPEFPGKRNTLDALALR-YEITN-------- 153 (228) T ss_pred HHCCC-EE-EEEHHHCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHH-CCCCH-------- T ss_conf 83794-78-752003057789999985145011001144449999999972058843027777878-05440-------- Q ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 56544323689999864599999999999999974 Q gi|254780636|r 140 WSADDLSDEQLQYAASDVVHLHALRLQFTEKLQRL 174 (207) Q Consensus 140 W~~rpLs~~Qi~YAA~Dv~~l~~L~~~l~~~L~~~ 174 (207) |...+.=|..||..|.++|..+...-.+. T Consensus 154 ------s~R~~H~A~~DA~~l~~~~~~~~~~~~~~ 182 (228) T TIGR00573 154 ------SHRALHGALLDAFILAKLYLVMTGKQTKY 182 (228) T ss_pred ------HHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf ------37777317788999999999998612203 No 87 >cd05780 DNA_polB_Kod1_like_exo The 3'-5' exonuclease domain of archaeal family-B DNA polymerases with similarity to Pyrococcus kodakaraensis Kod1, including polymerases from Desulfurococcus (D. Tok Pol) and Thermococcus gorgonarius (Tgo Pol). Kod1, D. Tok Pol, and Tgo Pol are thermostable enzymes that exhibit both polymerase and 3'-5' exonuclease activities. They are family-B DNA polymerases. Their amino termini harbor a DEDDy-type DnaQ-like 3'-5' exonuclease domain that contains three sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are clustered around the active site and are involved in metal binding and catalysis. The exonuclease domain of family B polymerases contains a beta hairpin structure that plays an important role in active site switching in the event of nucleotide misincorporation. Members of this subfamily show similarity to eukaryotic DNA polymerases involved in DNA replication. Some archaea possess multiple family-B D Probab=97.76 E-value=0.00014 Score=45.91 Aligned_cols=145 Identities=16% Similarity=0.246 Sum_probs=81.2 Q ss_pred CCCEEEECCCCCC----CCCCCEEEEEEEEECCCCEEEECCCCC-----------CCCCHHHHHHH--HCCCCCEEECCC Q ss_conf 9827997177789----854450799998418942899746577-----------85706776777--323640230100 Q gi|254780636|r 20 VDAIAVDTETLGL----MPRRDRLCIVQLSPGDGTVDIIRIAAG-----------QKNAPNLVGML--VDEKREKIFHYG 82 (207) Q Consensus 20 ~~~iaiDtEt~~l----~~~~~~l~LiQl~~~~~~~~l~~~~~~-----------~~~~~~L~~ll--~d~~i~KV~Hn~ 82 (207) -++++||.|+.+- +|.++++.-|.++...+..++. .... ......+.+++ .||.|+ +|+|+ T Consensus 3 lkilsfDIE~~~~~g~P~p~~d~Ii~Is~~~~~~~~vi~-~~~~~~~~v~~~~tE~eLL~~F~~~i~~~dPDii-~GyN~ 80 (195) T cd05780 3 LKILSFDIEVLNHEGEPNPEKDPIIMISFADEGGNKVIT-WKKFDLPFVEVVKTEKEMIKRFIEIVKEKDPDVI-YTYNG 80 (195) T ss_pred CCEEEEEEEECCCCCCCCCCCCCEEEEEEEECCCCEEEE-ECCCCCCCEEEECCHHHHHHHHHHHHHHCCCCEE-EECCC T ss_conf 429999999758999989888828999999689978998-1798877456738999999999999987299999-95798 Q ss_pred -CHHHHHHHHHHH---CCC----------------------C--CHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCC- Q ss_conf -001688865420---000----------------------0--00457898865211000000887776542002467- Q gi|254780636|r 83 -RFDIAVLFYTFG---VRV----------------------R--PVFCTKIASRLTRTYTNQHGLKDNLKELLGINISK- 133 (207) Q Consensus 83 -~~D~~~L~~~~g---i~~----------------------~--~i~DT~ias~l~~~~~~~~~L~~L~~~~lg~~ldK- 133 (207) .||+..|..... +.. . ..+|.+-..+-- ....+++|.++++++||.+-.. T Consensus 81 ~~FD~pYl~~R~~~~~~~~~~gr~~~~~~~~~~g~~~~~~i~Gr~~lD~~~~~~~~-~~l~sy~L~~Va~~~Lg~~K~d~ 159 (195) T cd05780 81 DNFDFPYLKKRAEKLGIELDLGRDGSEIKIQRGGFNNASEIKGRIHVDLYPVARRT-LNLTRYTLERVYEELFGIEKEDV 159 (195) T ss_pred CCCCHHHHHHHHHHHCCCCCCCCCCCEEEEEECCCCEEEEECCEEEEEHHHHHHHH-CCCCCCCHHHHHHHHHCCCCCCC T ss_conf 78768999999999589864578886047997785079987686885549999752-56240159999999839997889 Q ss_pred C-CEECC-CCCCCCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 5-20036-5654432368999986459999999999 Q gi|254780636|r 134 A-QQSSD-WSADDLSDEQLQYAASDVVHLHALRLQF 167 (207) Q Consensus 134 ~-~q~Sd-W~~rpLs~~Qi~YAA~Dv~~l~~L~~~l 167 (207) . .+..+ |...+=.++-+.|+..||..+++|++++ T Consensus 160 ~~~~i~~~~~~~~~~~~l~~Yn~~D~~L~~~L~~K~ 195 (195) T cd05780 160 PGEEIAEAWDSGENLERLFRYSMEDAKYTYEIGKEF 195 (195) T ss_pred CHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHC T ss_conf 999999999759753888844599899999998529 No 88 >cd05160 DNA_polB_exo The 3'-5' exonuclease domain of family-B DNA polymerases. This domain has a fundamental role in reducing polymerase errors and is involved in proofreading activity. Family-B DNA polymerases contain an N-terminal DEDDy DnaQ-like exonuclease domain in the same polypeptide chain as the polymerase domain, similar to family-A DNA polymerases. This domain contains three sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. The exonuclease domain of family B polymerase also contains a beta hairpin structure that plays an important role in active site switching in the event of nucleotide misincorporation. Members include E. coli DNA polymerase II, some eubacterial phage DNA polymerases, nuclear replicative DNA polymerases (alpha, delta, epsilon and zeta), and eukaryotic viral and Probab=97.74 E-value=0.00018 Score=45.23 Aligned_cols=142 Identities=18% Similarity=0.188 Sum_probs=76.1 Q ss_pred CEEEECCCCCCC----CCCCEEEEEEEEEC-CCCEEEE----------------------CCCCCCCCCHHHHHHH--HC Q ss_conf 279971777898----54450799998418-9428997----------------------4657785706776777--32 Q gi|254780636|r 22 AIAVDTETLGLM----PRRDRLCIVQLSPG-DGTVDII----------------------RIAAGQKNAPNLVGML--VD 72 (207) Q Consensus 22 ~iaiDtEt~~l~----~~~~~l~LiQl~~~-~~~~~l~----------------------~~~~~~~~~~~L~~ll--~d 72 (207) +++||.||.+-. |-++++.-|+++.. .+..... .+.........+.+++ .| T Consensus 1 vlsfDIE~~~~~gfP~p~~d~Ii~Is~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~E~eLL~~F~~~i~~~d 80 (200) T cd05160 1 VLSFDIETLPNVGGPEPSRDPIICISYADSFQGHKVVFLLKVSTHGDDIEAGFIDGIEVEYFDDEKELLKRFFDIIREYD 80 (200) T ss_pred CEEEEEEECCCCCCCCCCCCEEEEEEEEEECCCEEEEEEEECCCCCCCCCCCCCCCEEEEEECCHHHHHHHHHHHHHHCC T ss_conf 97999832799997899988089999999359999998864355676765467799699996999999999999999709 Q ss_pred CCCCEEECCC-CHHHHHHHHHH---HCCCCC-----------------------HHHHHHHHHHHHHHCCCHHHHHHHHH Q ss_conf 3640230100-00168886542---000000-----------------------04578988652110000008877765 Q gi|254780636|r 73 EKREKIFHYG-RFDIAVLFYTF---GVRVRP-----------------------VFCTKIASRLTRTYTNQHGLKDNLKE 125 (207) Q Consensus 73 ~~i~KV~Hn~-~~D~~~L~~~~---gi~~~~-----------------------i~DT~ias~l~~~~~~~~~L~~L~~~ 125 (207) |+++ +|+|. .||+..|.... |+.+.+ ++|.+-..+-. ....+++|.+++++ T Consensus 81 PDii-~GyN~~~FD~pyL~~R~~~l~~~~~~~~~~~~~~~~~~~~~~~~i~Gr~~lD~~~~~k~~-~~l~sy~Ld~Va~~ 158 (200) T cd05160 81 PDIL-TGYNIDDFDLPYLLERAKALGIKLDDIGRRSGQEKSSGETERAAVKGRVVFDLLRAVKRD-FKLKSYTLDAVAEE 158 (200) T ss_pred CCEE-EECCCCCCCHHHHHHHHHHHCCCHHHCCCCCCCCCCCCCEEEEEECCEEEEHHHHHHHHH-HCCCCCCHHHHHHH T ss_conf 9999-972677784889999999969885554673454543575059973677860289999975-32466539999999 Q ss_pred HHCCCCCC-C-CEECCCCCCCCCHHHHHHHHHHHHHHHHHHH Q ss_conf 42002467-5-2003656544323689999864599999999 Q gi|254780636|r 126 LLGINISK-A-QQSSDWSADDLSDEQLQYAASDVVHLHALRL 165 (207) Q Consensus 126 ~lg~~ldK-~-~q~SdW~~rpLs~~Qi~YAA~Dv~~l~~L~~ 165 (207) +||.+-.. . ....+|....=.+.-+.|+..||..+.+|++ T Consensus 159 ~Lg~~K~d~~~~~~~~~~~~~d~~k~~~Y~~~D~~L~~~L~e 200 (200) T cd05160 159 LLGEGKEKVDGEIIEDLEWEEDLERLLEYNLKDAELTLQILE 200 (200) T ss_pred HHCCCCCCCCHHHHHHHHHCCCCHHHHHEEHHHHHHHHHHHC T ss_conf 849997878989999998738914866025999998999759 No 89 >KOG3657 consensus Probab=97.69 E-value=0.00013 Score=46.09 Aligned_cols=98 Identities=17% Similarity=0.201 Sum_probs=66.4 Q ss_pred CCCEEECCCCHHHHHHHHHHHCCC-CC-HHHHHHHH----HHHHHH---------------------------------- Q ss_conf 640230100001688865420000-00-04578988----652110---------------------------------- Q gi|254780636|r 74 KREKIFHYGRFDIAVLFYTFGVRV-RP-VFCTKIAS----RLTRTY---------------------------------- 113 (207) Q Consensus 74 ~i~KV~Hn~~~D~~~L~~~~gi~~-~~-i~DT~ias----~l~~~~---------------------------------- 113 (207) .-+-||||..||-..++..++|.= ++ +.|||=.+ .++ .+ T Consensus 241 e~liVGHNVsfDRaRirEeY~i~~Sk~rFlDTMSlHia~~Gm~-S~Qrplw~ka~k~k~a~~d~~~~ps~~d~~~pWL~~ 319 (1075) T KOG3657 241 EQLIVGHNVSFDRARIREEYNINGSKIRFLDTMSLHIAMSGMC-SRQRPLWFKARKAKSAMYDSETNPSISDYDNPWLGR 319 (1075) T ss_pred CCEEEECCCCCHHHHHHHHHHCCCCCEEEEECHHHHHHHHCCC-CCCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHH T ss_conf 7568733554008999998702665406643112233332133-223316766313145565022487666643566421 Q ss_pred CCCHHHHHHHHHHHCCC-CCCCCEECCCCCCCC------CHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 00000887776542002-467520036565443------2368999986459999999999999997 Q gi|254780636|r 114 TNQHGLKDNLKELLGIN-ISKAQQSSDWSADDL------SDEQLQYAASDVVHLHALRLQFTEKLQR 173 (207) Q Consensus 114 ~~~~~L~~L~~~~lg~~-ldK~~q~SdW~~rpL------s~~Qi~YAA~Dv~~l~~L~~~l~~~L~~ 173 (207) +.-.||++..+.+||.. ++|+-+-+ +.+-+. -++-+.|+|.||+.+.++|.++-+...+ T Consensus 320 SS~NSL~dVhk~~c~~~~LdKt~Rd~-Fvs~~~e~Ire~fq~L~~YCA~Dv~aThqVf~~lfP~Fle 385 (1075) T KOG3657 320 SSLNSLVDVHKFHCGIDALDKTPRDS-FVSGTKEQIRENFQPLMNYCARDVIATHQVFFRLFPLFLE 385 (1075) T ss_pred HHHHHHHHHHHHHCCCCCCCCCHHHH-HHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 00577999999617997234463776-6408689999999999998732449999999998499998 No 90 >PRK09182 DNA polymerase III subunit epsilon; Validated Probab=97.57 E-value=0.00017 Score=45.36 Aligned_cols=82 Identities=24% Similarity=0.390 Sum_probs=48.2 Q ss_pred CEEEECCCCCCCCCCCEEE---EEEEE-ECCCCEE-EECC-CC----CCCC---------------------CHHHHHHH Q ss_conf 2799717778985445079---99984-1894289-9746-57----7857---------------------06776777 Q gi|254780636|r 22 AIAVDTETLGLMPRRDRLC---IVQLS-PGDGTVD-IIRI-AA----GQKN---------------------APNLVGML 70 (207) Q Consensus 22 ~iaiDtEt~~l~~~~~~l~---LiQl~-~~~~~~~-l~~~-~~----~~~~---------------------~~~L~~ll 70 (207) .+-+||||||+++..+++- .+-+. ++++..+ +... .. .... ......++ T Consensus 39 g~~lD~ETTGl~~~~d~IIElg~v~f~~~~~G~i~~i~~~~~~~~dP~~pIp~~it~lTGItd~mV~Gq~id~~~V~~~~ 118 (293) T PRK09182 39 GVILDTETTGLDPRKDEIIEIGMVAFEYDDDGRIGDVVDTFGGLQQPSRPIPPEITRLTGITDEMVAGQTIDPAEVDALI 118 (293) T ss_pred EEEEECCCCCCCCCCCEEEEEEEEEEEECCCCCEEEEECCEECCCCCCCCCCHHHHHHCCCCHHHHCCCCCCHHHHHHHH T ss_conf 99995305788977877999768999986998587764323110389998998798751888899688827999999986 Q ss_pred HCCCCCEEECCCCHHHHHHHHHHHCCCC-CHHHHH Q ss_conf 3236402301000016888654200000-004578 Q gi|254780636|r 71 VDEKREKIFHYGRFDIAVLFYTFGVRVR-PVFCTK 104 (207) Q Consensus 71 ~d~~i~KV~Hn~~~D~~~L~~~~gi~~~-~i~DT~ 104 (207) .+..++ |.||+.||-.++.+.+..... +.-|++ T Consensus 119 ~~adli-iAHNA~FDR~F~E~~~p~f~~k~WaCS~ 152 (293) T PRK09182 119 APADLI-IAHNAGFDRPFLERFSPVFANKPWACSV 152 (293) T ss_pred CCCCEE-EECCCCCCHHHHHHHCCCCCCCCEEEEC T ss_conf 449899-9817754677898509532587505316 No 91 >cd05779 DNA_polB_epsilon_exo The 3'-5' exonuclease domain of eukaryotic DNA polymerase epsilon. DNA polymerase epsilon is a family-B DNA polymerase with a catalytic subunit that contains a DEDDy-type DnaQ-like 3'-5' exonuclease domain. It is one of the three DNA-dependent type B DNA polymerases (alpha and delta are the other two) that have been identified as essential for nuclear DNA replication in eukaryotes. DNA polymerase epsilon plays a role in elongating the leading strand during DNA replication. It is also involved in DNA repair. The catalytic subunit contains both polymerase and 3'-5' exonuclease activities. The N-terminal exonuclease domain contains three sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are clustered around the active site and are involved in metal binding and catalysis. DNA polymerase epsilon also carries a unique large C-terminal domain with an unknown function. Phylogenetic analyses indicate that it is ort Probab=97.55 E-value=0.00056 Score=42.38 Aligned_cols=88 Identities=23% Similarity=0.234 Sum_probs=51.5 Q ss_pred HCCCCCEEECCC-CHHHHHHHHHH---HCCCCC-----------------H-HHHHHHHHHH-HH-H--CCCHHHHHHHH Q ss_conf 323640230100-00168886542---000000-----------------0-4578988652-11-0--00000887776 Q gi|254780636|r 71 VDEKREKIFHYG-RFDIAVLFYTF---GVRVRP-----------------V-FCTKIASRLT-RT-Y--TNQHGLKDNLK 124 (207) Q Consensus 71 ~d~~i~KV~Hn~-~~D~~~L~~~~---gi~~~~-----------------i-~DT~ias~l~-~~-~--~~~~~L~~L~~ 124 (207) .+|+++ +|.|+ .||+..|.... |+.... + +|.. +.+ +. + ..+.+|++.++ T Consensus 87 ~~pdii-~gyN~~~FD~pyl~~Ra~~~~~~~~~~~~~~~~~~~~~~~~~~vhiDly---~~v~rd~~l~~~sy~Ln~Vs~ 162 (204) T cd05779 87 VKPHII-VTYNGDFFDWPFVEARAAIHGLSMEEEIGFRKDSEGEYKSRYIIHMDCF---RWVKRDSYLPQGSQGLKAVTK 162 (204) T ss_pred CCCCEE-EECCCCCCCHHHHHHHHHHHCCCHHHHCCCCCCCCCCEECCCCEEEECH---HHHHHHCCCCCCCCCHHHHHH T ss_conf 499999-8558767767999999999599825531800067772762672563007---765101217556877899999 Q ss_pred HHHCCCCCCC--CEECC-CCCCCCCHHHHHHHHHHHHHHHHHH Q ss_conf 5420024675--20036-5654432368999986459999999 Q gi|254780636|r 125 ELLGINISKA--QQSSD-WSADDLSDEQLQYAASDVVHLHALR 164 (207) Q Consensus 125 ~~lg~~ldK~--~q~Sd-W~~rpLs~~Qi~YAA~Dv~~l~~L~ 164 (207) ++||.+-.+- ...+. |...| +.-..|+..||..|+.|| T Consensus 163 ~~Lg~~K~dv~~~~m~~l~~~~~--~~la~Y~~~Da~~Ty~l~ 203 (204) T cd05779 163 AKLGYDPVELDPEDMVPLAREDP--QTLASYSVSDAVATYYLY 203 (204) T ss_pred HHHCCCCCCCCHHHHHHHHCCCH--HHHHHCCHHHHHHHHHHH T ss_conf 99689975089999999982891--555524099999999976 No 92 >pfam04857 CAF1 CAF1 family ribonuclease. The major pathways of mRNA turnover in eukaryotes initiate with shortening of the polyA tail. CAF1 encodes a critical component of the major cytoplasmic deadenylase in yeast. Both Caf1p is required for normal mRNA deadenylation in vivo and localizes to the cytoplasm. Caf1p copurifies with a Ccr4p-dependent polyA-specific exonuclease activity. Some members of this family include and inserted RNA binding domain pfam01424. This family of proteins is related to other exonucleases pfam00929 (Bateman A pers. obs.). The crystal structure of Saccharomyces cerevisiae Pop2 has been resolved at 2.3 Angstrom#resolution. Probab=97.48 E-value=0.0006 Score=42.19 Aligned_cols=113 Identities=25% Similarity=0.340 Sum_probs=65.0 Q ss_pred HCCCCCCEEEECCCCCCCCCC-------------------CEEEEEEEE----ECCCC------EEEECC-----CCCCC Q ss_conf 424898279971777898544-------------------507999984----18942------899746-----57785 Q gi|254780636|r 16 AARYVDAIAVDTETLGLMPRR-------------------DRLCIVQLS----PGDGT------VDIIRI-----AAGQK 61 (207) Q Consensus 16 ~l~~~~~iaiDtEt~~l~~~~-------------------~~l~LiQl~----~~~~~------~~l~~~-----~~~~~ 61 (207) .....+.||+|||++|+.... +.+.+||++ .+++. ++-+.+ ..... T Consensus 18 ~i~~~~fVaiDtEFpG~v~~p~~~~~~t~~~~Y~~lk~nVd~l~iiQlGlt~~~~~g~~p~~~~~wqfNf~~F~~~~d~~ 97 (235) T pfam04857 18 AIDSYDFIAIDTEFPGVVARPIGSFRSTSDYRYQALRKNVDRLKIIQLGLTLFDEKGNLPDSYYTWQFNFSLFNLEEDFY 97 (235) T ss_pred HHHCCCEEEEEEECCCCCCCCCCCCCCCHHHHHHHHHHCCCCCCEEEEEEEEECCCCCCCCCCEEEEEECCCCCCCCCCC T ss_conf 98319989995142752468988888987899999984300123146898887468878987437850155679732211 Q ss_pred CCH-----------------------HH------HHHHHC-CCCCEEECCCCHHHHHHHHHH-H-CC------------- Q ss_conf 706-----------------------77------677732-364023010000168886542-0-00------------- Q gi|254780636|r 62 NAP-----------------------NL------VGMLVD-EKREKIFHYGRFDIAVLFYTF-G-VR------------- 96 (207) Q Consensus 62 ~~~-----------------------~L------~~ll~d-~~i~KV~Hn~~~D~~~L~~~~-g-i~------------- 96 (207) .+. .+ ..++.+ .++.||+||+-+|+..|.+.+ | -. T Consensus 98 ~~~Si~fL~~~G~DF~~~~~~GI~~~~f~e~l~~S~lv~~~~~~~wv~f~g~yD~~yl~k~l~g~~LP~~~~eF~~~l~~ 177 (235) T pfam04857 98 APSSIEFLAKQGFDFNKHRREGIPYLRFAELLGSSGLFLNISSVTWVTFHGLYDFGYLLKLLTGGPLPETLDDFLELLRE 177 (235) T ss_pred CHHHHHHHHHCCCCHHHHHHHCCCHHHHHHHHHHHHHEECCCCCEEEEECCHHHHHHHHHHHCCCCCCCCHHHHHHHHHH T ss_conf 68899999985988899987099989999899870302057998799817277899999997499899989999999999 Q ss_pred -CCCHHHHHHHHHHHHHH--CCCHHHHHHHHHHHCC Q ss_conf -00004578988652110--0000088777654200 Q gi|254780636|r 97 -VRPVFCTKIASRLTRTY--TNQHGLKDNLKELLGI 129 (207) Q Consensus 97 -~~~i~DT~ias~l~~~~--~~~~~L~~L~~~~lg~ 129 (207) ...++||...++.+.+. ..+.||..+++. ||+ T Consensus 178 ~FP~vyD~K~la~~~~~~~~~~~~gL~~lA~~-L~v 212 (235) T pfam04857 178 LFPRVYDTKYLAKFCFELDGGSNGGLQELADL-LGV 212 (235) T ss_pred HCCHHHHHHHHHHHCCCCCCCCCCCHHHHHHH-CCC T ss_conf 77717509999985221358755789999997-499 No 93 >cd05785 DNA_polB_like2_exo A subfamily of the 3'-5' exonuclease domain of family-B DNA polymerases. This subfamily is composed of uncharacterized bacterial family-B DNA polymerases. Family-B DNA polymerases contain an N-terminal DEDDy DnaQ-like exonuclease domain in the same polypeptide chain as the polymerase domain, similar to family-A DNA polymerases. This exonuclease domain contains three sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are involved in metal binding and catalysis. The exonuclease domain of family-B DNA polymerases has a fundamental role in proofreading activity. It contains a beta hairpin structure that plays an important role in active site switching in the event of a nucleotide misincorporation. Family-B DNA polymerases are predominantly involved in DNA replication and DNA repair. Probab=97.42 E-value=0.0031 Score=38.09 Aligned_cols=142 Identities=25% Similarity=0.277 Sum_probs=76.4 Q ss_pred CCCCCCEEEECCCCCCCCC--------CCEEEEEEEEECCCCEEEECCCCCCCCCHHHHH---HH--HCCCCCEEECCC- Q ss_conf 2489827997177789854--------450799998418942899746577857067767---77--323640230100- Q gi|254780636|r 17 ARYVDAIAVDTETLGLMPR--------RDRLCIVQLSPGDGTVDIIRIAAGQKNAPNLVG---ML--VDEKREKIFHYG- 82 (207) Q Consensus 17 l~~~~~iaiDtEt~~l~~~--------~~~l~LiQl~~~~~~~~l~~~~~~~~~~~~L~~---ll--~d~~i~KV~Hn~- 82 (207) ..+-.+++||.||++.+.. .+++..|.+....+.-.++.... .....-|.. ++ .||+|+ +|+|. T Consensus 6 ~~~lr~lsfDIE~~~~~g~~~p~p~~~~d~Ii~I~~~~~~~~~~v~~~~~-~~E~~lL~~F~~~i~~~dPDII-tGyNi~ 83 (207) T cd05785 6 FDDLRRLQLDIETYSLPGFFFSNPDRGDDRIIIVALRDNRGWEEVLHAED-AAEKELLEELVAIIRERDPDVI-EGHNIF 83 (207) T ss_pred HHHEEEEEEEEEECCCCCCCCCCCCCCCCCEEEEEEEECCCCEEEEEECC-CCHHHHHHHHHHHHHHCCCCEE-EECCCC T ss_conf 20405999999747889977999888999789999885899779876269-9999999999999987399999-867987 Q ss_pred CHHHHHHHHHHH---CC------------------------------CCC--HHHHHHHHHHHH--H-HCCCHHHHHHHH Q ss_conf 001688865420---00------------------------------000--045789886521--1-000000887776 Q gi|254780636|r 83 RFDIAVLFYTFG---VR------------------------------VRP--VFCTKIASRLTR--T-YTNQHGLKDNLK 124 (207) Q Consensus 83 ~~D~~~L~~~~g---i~------------------------------~~~--i~DT~ias~l~~--~-~~~~~~L~~L~~ 124 (207) .||+..|.+... +. +.+ ++|.+-+.+... . ...+++|..+++ T Consensus 84 ~FD~pYL~~Ra~~~~i~~~lgR~~~~~~~~~~~~~~~~~~~~~~~~~i~Gr~~lD~~~~~k~~~~~~~~L~SY~L~~Va~ 163 (207) T cd05785 84 RFDLPYLRRRCRRHGVPLAIGRDGSIPRQRPSRFRFAERLIDYPRYDIPGRHVIDTYFLVQLFDVSSRDLPSYGLKAVAK 163 (207) T ss_pred CCCHHHHHHHHHHHCCCCCCCCCCCCCEEECCEEEECCCCCCEEEEEECCEEEEEHHHHHHHHCCCCCCCCCCCHHHHHH T ss_conf 75889999999995997222578987446403021223555304998736488654999986421114677652899999 Q ss_pred HHHCCC-CCCCC----EECC-CCCCCCCHHHHHHHHHHHHHHHHH Q ss_conf 542002-46752----0036-565443236899998645999999 Q gi|254780636|r 125 ELLGIN-ISKAQ----QSSD-WSADDLSDEQLQYAASDVVHLHAL 163 (207) Q Consensus 125 ~~lg~~-ldK~~----q~Sd-W~~rpLs~~Qi~YAA~Dv~~l~~L 163 (207) ++ |.. -+|.. +..+ |...+ ++-+.|+..||..+..| T Consensus 164 ~~-~l~~~~k~~~~~~~i~~~~~~d~--~~l~~Yni~Dv~~t~~L 205 (207) T cd05785 164 HF-GLASPDRTYIDGRQIAEVWRSDP--ARLLAYALDDVRETEGL 205 (207) T ss_pred HH-HHCCCCCCCCCHHHHHHHHHCCH--HHHHHHHHHHHHHHHHH T ss_conf 87-31366656388999999995298--99999829999999986 No 94 >PRK05762 DNA polymerase II; Reviewed Probab=97.37 E-value=0.0021 Score=38.98 Aligned_cols=139 Identities=15% Similarity=0.106 Sum_probs=75.5 Q ss_pred CCEEEECCCCCCCCCCCEEEEEEEEE--CCCCE-EEE----------CCCCCCCCCHHHHHHH--HCCCCCEEECCC-CH Q ss_conf 82799717778985445079999841--89428-997----------4657785706776777--323640230100-00 Q gi|254780636|r 21 DAIAVDTETLGLMPRRDRLCIVQLSP--GDGTV-DII----------RIAAGQKNAPNLVGML--VDEKREKIFHYG-RF 84 (207) Q Consensus 21 ~~iaiDtEt~~l~~~~~~l~LiQl~~--~~~~~-~l~----------~~~~~~~~~~~L~~ll--~d~~i~KV~Hn~-~~ 84 (207) .+++||.|+..-+ + ++|+|. .++.. +.+ -+.........+..++ .||+|+ +|+|. .| T Consensus 156 rvlSfDIEc~~~g----~--ii~I~l~~~~~~~v~~~g~~~~~~~v~~~~~E~eLL~~F~~~i~~~DPDII-~GyNi~~F 228 (785) T PRK05762 156 KVVSLDIETSNKG----E--LYSIGLEGCGQRQVYMLGEANGDAELEYVADEKALLERFNAWFAEFDPDVI-IGWNVVQF 228 (785) T ss_pred EEEEEEEEECCCC----C--EEEEEECCCCCCEEEEECCCCCCCEEEEECCHHHHHHHHHHHHHHHCCCEE-EECCCCCC T ss_conf 6999998977999----8--899995179984689966889985389909999999999999998599989-96274787 Q ss_pred HHHHHHHHH---HCC-------------------------CCC--HHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCC--C Q ss_conf 168886542---000-------------------------000--04578988652110000008877765420024--6 Q gi|254780636|r 85 DIAVLFYTF---GVR-------------------------VRP--VFCTKIASRLTRTYTNQHGLKDNLKELLGINI--S 132 (207) Q Consensus 85 D~~~L~~~~---gi~-------------------------~~~--i~DT~ias~l~~~~~~~~~L~~L~~~~lg~~l--d 132 (207) |+..|.... |+. +.+ ++|++-+.+-.--...+++|.++++++||..- . T Consensus 229 DlpyL~~Ra~~~gi~l~lgR~~~~~~~~~~~~~~~~~~~~i~GRv~iD~~~~~k~~~~~l~sYsL~~Va~~~Lge~K~~~ 308 (785) T PRK05762 229 DLRLLQERAERYGIPLRLGRGGSELSWRVHGHKRGHFFASVPGRLVLDGIDALKSAFYEFKSFSLENVAQELLGEGKAID 308 (785) T ss_pred CHHHHHHHHHHHCCCCCCCCCCCCCCEECCCCCCCEEEEEEEEEEEEEEHHHHHHHHHCCCCCCHHHHHHHHHCCCCCCC T ss_conf 77999999999499854687888653331565566059998428999709999875203676629999999854455556 Q ss_pred CCC-E---E-CCCCCCCCCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 752-0---0-3656544323689999864599999999999 Q gi|254780636|r 133 KAQ-Q---S-SDWSADDLSDEQLQYAASDVVHLHALRLQFT 168 (207) Q Consensus 133 K~~-q---~-SdW~~rpLs~~Qi~YAA~Dv~~l~~L~~~l~ 168 (207) +.. + . .-|...+ +.-..|+..||..+.+|++++. T Consensus 309 ~~~~r~~eI~~~~~~~r--~~l~~Y~l~Da~L~~~L~~kl~ 347 (785) T PRK05762 309 NPYDRMDEIDRYFADDK--PALAKYNLKDCELVTRIFEKTE 347 (785) T ss_pred CHHHHHHHHHHHHHCCH--HHHHHHHHHHHHHHHHHHHHHH T ss_conf 66666888899885468--9999999999999999999870 No 95 >KOG1275 consensus Probab=97.24 E-value=0.00045 Score=42.93 Aligned_cols=95 Identities=21% Similarity=0.118 Sum_probs=63.1 Q ss_pred HHHHCCCCCEEECCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCEECCCCCCCCCH Q ss_conf 77732364023010000168886542000000045789886521100000088777654200246752003656544323 Q gi|254780636|r 68 GMLVDEKREKIFHYGRFDIAVLFYTFGVRVRPVFCTKIASRLTRTYTNQHGLKDNLKELLGINISKAQQSSDWSADDLSD 147 (207) Q Consensus 68 ~ll~d~~i~KV~Hn~~~D~~~L~~~~gi~~~~i~DT~ias~l~~~~~~~~~L~~L~~~~lg~~ldK~~q~SdW~~rpLs~ 147 (207) .+|-+-.++.|||+...|.+++-- .+...-++||-+.-.+ +-....||+.|+.++||.+| |... T Consensus 1008 ~~Li~~GviFVGHGL~nDFrvINi--~Vp~~QiiDTv~lf~~--~s~R~LSLrfLa~~lLg~~I----Q~~~-------- 1071 (1118) T KOG1275 1008 RLLIQRGVIFVGHGLQNDFRVINI--HVPEEQIIDTVTLFRL--GSQRMLSLRFLAWELLGETI----QMEA-------- 1071 (1118) T ss_pred HHHHHCCCEEECCCCCCCCEEEEE--ECCHHHHEEEEEEEEC--CCCCEEEHHHHHHHHHCCHH----HCCC-------- T ss_conf 999973828972244666159998--3484560103577861--66627877999999814132----0134-------- Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHH Q ss_conf 68999986459999999999999997491899999996 Q gi|254780636|r 148 EQLQYAASDVVHLHALRLQFTEKLQRLGRSDLATSCCN 185 (207) Q Consensus 148 ~Qi~YAA~Dv~~l~~L~~~l~~~L~~~~~~~~~~~e~~ 185 (207) +=...||.+.++||++..+ |++++. ++.+++ T Consensus 1072 ---HDSIeDA~taLkLYk~Yl~-lkeq~~---~~~~l~ 1102 (1118) T KOG1275 1072 ---HDSIEDARTALKLYKKYLK-LKEQGK---LESELR 1102 (1118) T ss_pred ---CCCHHHHHHHHHHHHHHHH-HHHHHH---HHHHHH T ss_conf ---5537879999999999998-887668---999987 No 96 >cd05784 DNA_polB_II_exo The 3'-5' exonuclease domain of Escherichia coli DNA polymerase II (PolII) and similar bacterial proteins. PolII is a family-B DNA polymerase. Family-B DNA polymerases contain an N-terminal DEDDy DnaQ-like exonuclease domain in the same polypeptide chain as the polymerase domain, similar to family-A DNA polymerases. This exonuclease domain contains three sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are clustered around the active site and are involved in metal binding and catalysis. The exonuclease domain has a fundamental role in the proofreading activity of polII. It contains a beta hairpin structure that plays an important role in active site switching in the event of a nucleotide misincorporation. PolII is involved in a variety of cellular activities, such as the repair of DNA damaged by UV irradiation or oxidation. It plays a pivotal role in replication-restart, a process that bypasses DNA damage in a Probab=97.08 E-value=0.0019 Score=39.28 Aligned_cols=139 Identities=17% Similarity=0.096 Sum_probs=72.2 Q ss_pred CCCEEEECCCCCCCCCCCEEEEEEEEECC-CCEEEEC------------CCCCCCCCHHHHHHHH--CCCCCEEECCC-C Q ss_conf 98279971777898544507999984189-4289974------------6577857067767773--23640230100-0 Q gi|254780636|r 20 VDAIAVDTETLGLMPRRDRLCIVQLSPGD-GTVDIIR------------IAAGQKNAPNLVGMLV--DEKREKIFHYG-R 83 (207) Q Consensus 20 ~~~iaiDtEt~~l~~~~~~l~LiQl~~~~-~~~~l~~------------~~~~~~~~~~L~~ll~--d~~i~KV~Hn~-~ 83 (207) -++++||.||..-. ++.-|.+.... ..++.+. +.........+.+++. ||+|+ +|+|. . T Consensus 3 lki~s~DIE~~~~~----~i~~I~~~~~~~~~v~~~~~~~~~~~~~v~~~~sE~eLL~~F~~~i~~~dPDiI-tG~N~~~ 77 (193) T cd05784 3 LKVVSLDIETSMDG----ELYSIGLYGEGQERVLMVGDPEDDAPDNIEWFADEKSLLLALIAWFAQYDPDII-IGWNVIN 77 (193) T ss_pred CEEEEEEECCCCCC----CEEEEEEECCCCCEEEEEECCCCCCCCEEEEECCHHHHHHHHHHHHHHCCCCEE-EECCCCC T ss_conf 30999996327988----888999986999989999789888997799989999999999999986299999-9779877 Q ss_pred HHHHHHHHHHH---CC------------------------CCC--HHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCC-C Q ss_conf 01688865420---00------------------------000--045789886521100000088777654200246-7 Q gi|254780636|r 84 FDIAVLFYTFG---VR------------------------VRP--VFCTKIASRLTRTYTNQHGLKDNLKELLGINIS-K 133 (207) Q Consensus 84 ~D~~~L~~~~g---i~------------------------~~~--i~DT~ias~l~~~~~~~~~L~~L~~~~lg~~ld-K 133 (207) ||+..|.+... +. +.+ ++|++-+-+-..-...+++|.+.++.+||.+-. - T Consensus 78 FDiPYL~~R~~~~~i~~~~~r~~~~~~~~~~~~~~~~~~~i~Gr~~lD~~~~~~~~~~~l~sy~Ld~Va~~~LGe~K~~~ 157 (193) T cd05784 78 FDLRLLQRRAEAHGLPLRLGRGGSPLNWRQSGKPGQGFLSLPGRVVLDGIDALKTATYHFESFSLENVAQELLGEGKLIH 157 (193) T ss_pred CCHHHHHHHHHHHCCCCCCCCCCCCCEEEECCCCCEEEEEECCEEEECHHHHHHHCCCCHHHHHHHHHHHHHHCCCCCCC T ss_conf 76789999999858864336578850598768562699997887841729999760735114348889999838775445 Q ss_pred CC--EEC----CCCCCCCCHHHHHHHHHHHHHHHHHHH Q ss_conf 52--003----656544323689999864599999999 Q gi|254780636|r 134 AQ--QSS----DWSADDLSDEQLQYAASDVVHLHALRL 165 (207) Q Consensus 134 ~~--q~S----dW~~rpLs~~Qi~YAA~Dv~~l~~L~~ 165 (207) .. ..+ -|...+ ++-+.|+..||....+|++ T Consensus 158 ~~~~~~~~i~~l~~~d~--~k~~~Yni~D~~Lv~rLf~ 193 (193) T cd05784 158 DVDDRGAEIERLFREDK--LALARYNLQDCELVWRIFE 193 (193) T ss_pred CCCCCHHHHHHHHHCCC--HHHEEEHHHHHHHHHHHHC T ss_conf 65555888999886590--3610320277999999749 No 97 >cd05782 DNA_polB_like1_exo A subfamily of the 3'-5' exonuclease domain of family-B DNA polymerases. This subfamily is composed of uncharacterized bacterial family-B DNA polymerases. Family-B DNA polymerases contain an N-terminal DEDDy DnaQ-like exonuclease domain in the same polypeptide chain as the polymerase domain, similar to family-A DNA polymerases. This exonuclease domain contains three sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are involved in metal binding and catalysis. The exonuclease domain of family-B DNA polymerases has a fundamental role in proofreading activity. It contains a beta hairpin structure that plays an important role in active site switching in the event of a nucleotide misincorporation. Family-B DNA polymerases are predominantly involved in DNA replication and DNA repair. Probab=97.01 E-value=0.011 Score=34.84 Aligned_cols=97 Identities=23% Similarity=0.287 Sum_probs=57.0 Q ss_pred HHHHHHHCCCCCEEECCC-CHHHHHHHHHH---HCCCCC------------------HHHHHHHHHHHHHHCCCHHHHHH Q ss_conf 776777323640230100-00168886542---000000------------------04578988652110000008877 Q gi|254780636|r 65 NLVGMLVDEKREKIFHYG-RFDIAVLFYTF---GVRVRP------------------VFCTKIASRLTRTYTNQHGLKDN 122 (207) Q Consensus 65 ~L~~ll~d~~i~KV~Hn~-~~D~~~L~~~~---gi~~~~------------------i~DT~ias~l~~~~~~~~~L~~L 122 (207) .+.+.++..+-.-|+||+ .||+.+|.+.. |+.+.. -+|||-.-..- +...+.+|..+ T Consensus 84 ~F~~~i~~~~p~lv~~Ng~~FD~P~l~~Ral~~gi~~p~~~~~~~~~~~y~~r~~~~H~Dl~d~l~~~-~~~~~~sLd~l 162 (208) T cd05782 84 DFFQLIEKKNPRLVSFNGRGFDLPVLHLRALIHGVSAPAYFDLGNKDWNYRNRYSERHLDLMDLLAFY-GARARASLDLL 162 (208) T ss_pred HHHHHHHHCCCEEEEECCCCCCHHHHHHHHHHCCCCCCHHHHCCCCCCCCCCCCCCCCCCHHHHHHCC-CCCCCCCHHHH T ss_conf 99999986599799636875758999999999198975554048874112488876313299998554-87577889999 Q ss_pred HHHHHCCCCCCCC----EECC-CCCCCCCHHHHHHHHHHHHHHHHHHH Q ss_conf 7654200246752----0036-56544323689999864599999999 Q gi|254780636|r 123 LKELLGINISKAQ----QSSD-WSADDLSDEQLQYAASDVVHLHALRL 165 (207) Q Consensus 123 ~~~~lg~~ldK~~----q~Sd-W~~rpLs~~Qi~YAA~Dv~~l~~L~~ 165 (207) + +.||+. .|.- +..+ |..-.+ ++-..|+..||.-|..+|. T Consensus 163 a-~llG~p-gK~~~~Gs~V~~~~~~g~l-~~I~~YCe~DVl~T~~v~l 207 (208) T cd05782 163 A-KLLGIP-GKMDVDGSQVWELYAEGKL-DEIAEYCETDVLNTYLLYL 207 (208) T ss_pred H-HHHCCC-CCCCCCHHHHHHHHHCCCH-HHHHHHHHHHHHHHHHHHH T ss_conf 9-983899-7578888999999981998-8999999999999999980 No 98 >KOG2248 consensus Probab=96.74 E-value=0.0021 Score=39.00 Aligned_cols=83 Identities=18% Similarity=0.131 Sum_probs=51.5 Q ss_pred HHHHHHCCCCCEEECCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHH-HCCCHHHHHHHHHHHCCCCCCCCEECCCCCCC Q ss_conf 76777323640230100001688865420000000457898865211-00000088777654200246752003656544 Q gi|254780636|r 66 LVGMLVDEKREKIFHYGRFDIAVLFYTFGVRVRPVFCTKIASRLTRT-YTNQHGLKDNLKELLGINISKAQQSSDWSADD 144 (207) Q Consensus 66 L~~ll~d~~i~KV~Hn~~~D~~~L~~~~gi~~~~i~DT~ias~l~~~-~~~~~~L~~L~~~~lg~~ldK~~q~SdW~~rp 144 (207) |.+++....|+ |||+.-.|+..|+.. -..+.||.+.-..-++ +..+.+|+.|++.|+|..+. . .=.. T Consensus 287 l~~~~~~~TIL-VGHSLenDL~aLKl~----H~~ViDTa~lf~~~~g~~~~k~sLk~L~~~~L~~~Iq-~----~~~~-- 354 (380) T KOG2248 287 LLELISKNTIL-VGHSLENDLKALKLD----HPSVIDTAVLFKHPTGPYPFKSSLKNLAKSYLGKLIQ-E----GVGG-- 354 (380) T ss_pred HHHHCCCCCEE-EEECHHHHHHHHHHH----CCCEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHH-C----CCCC-- T ss_conf 99646867179-951003078897642----7742200588864889854259999999999889986-1----6778-- Q ss_pred CCHHHHHHHHHHHHHHHHHHHH Q ss_conf 3236899998645999999999 Q gi|254780636|r 145 LSDEQLQYAASDVVHLHALRLQ 166 (207) Q Consensus 145 Ls~~Qi~YAA~Dv~~l~~L~~~ 166 (207) +=++.|+....+|... T Consensus 355 ------HdS~eDA~acm~Lv~~ 370 (380) T KOG2248 355 ------HDSVEDALACMKLVKL 370 (380) T ss_pred ------CCCHHHHHHHHHHHHH T ss_conf ------7608889999999999 No 99 >cd05777 DNA_polB_delta_exo The 3'-5' exonuclease domain of eukaryotic DNA polymerase delta. DNA polymerase delta is a family-B DNA polymerase with a catalytic subunit that contains a DEDDy-type DnaQ-like 3'-5' exonuclease domain. It is one of the three DNA-dependent type B DNA polymerases (alpha and epsilon are the other two) that have been identified as essential for nuclear DNA replication in eukaryotes. DNA polymerase delta is the enzyme responsible for both elongation and maturation of Okazaki fragments on the lagging strand. It is also implicated in mismatch repair (MMR) and base excision repair (BER). The catalytic subunit displays both polymerase and 3'-5' exonuclease activities. The exonuclease domain contains three sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues necessary for metal binding and catalysis. The exonuclease domain of family B polyme Probab=96.70 E-value=0.0097 Score=35.17 Aligned_cols=104 Identities=13% Similarity=0.014 Sum_probs=59.6 Q ss_pred HHHHH--HCCCCCEEECCC-CHHHHHHHHHHH---CC----------------------------------CCC--HHHH Q ss_conf 76777--323640230100-001688865420---00----------------------------------000--0457 Q gi|254780636|r 66 LVGML--VDEKREKIFHYG-RFDIAVLFYTFG---VR----------------------------------VRP--VFCT 103 (207) Q Consensus 66 L~~ll--~d~~i~KV~Hn~-~~D~~~L~~~~g---i~----------------------------------~~~--i~DT 103 (207) +.+++ .||+|+ +|.|. .||+..|..... +. ..+ ++|. T Consensus 78 F~~~i~~~dPDii-tGyN~~~FD~pYL~~Ra~~l~~~~~~~lgr~~~~~~~~~~~~~~s~~~g~~~~~~~~i~Gr~~lD~ 156 (230) T cd05777 78 WRDFVQEVDPDII-TGYNICNFDLPYLLERAKALKLNTFPFLGRIKNIKSTIKDTTFSSKQMGTRETKEINIEGRIQFDL 156 (230) T ss_pred HHHHHHHCCCCEE-EECCCCCCCHHHHHHHHHHHCCCCHHHHCCCCCCCCEEECCCCCCCCCCCCEEEEEEECCEEHHHH T ss_conf 9999998099999-963666788789999999968861454176678763574165222312640026998758759999 Q ss_pred HHHHHHHHHHCCCHHHHHHHHHHHCCCCCC-C-CEECC-CCCCC-CCHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 898865211000000887776542002467-5-20036-56544-323689999864599999999999999 Q gi|254780636|r 104 KIASRLTRTYTNQHGLKDNLKELLGINISK-A-QQSSD-WSADD-LSDEQLQYAASDVVHLHALRLQFTEKL 171 (207) Q Consensus 104 ~ias~l~~~~~~~~~L~~L~~~~lg~~ldK-~-~q~Sd-W~~rp-Ls~~Qi~YAA~Dv~~l~~L~~~l~~~L 171 (207) +-.-+- .....+++|.++++++||.+-.. . ...++ |...| -...-+.|+..||..+.+|.++|.-.+ T Consensus 157 ~~~~r~-~~~l~sY~L~~Va~~~Lg~~K~d~~~~~i~~~~~~~~~~~~~~~~Y~i~D~~L~~~L~~Kl~~~~ 227 (230) T cd05777 157 LQVIQR-DYKLRSYSLNSVSAHFLGEQKEDVHYSIITDLQNGNPETRRRLAVYCLKDAYLPLRLLDKLMCLV 227 (230) T ss_pred HHHHHH-HCCHHHCCHHHHHHHHHCCCCCCCCHHHHHHHHHCCHHHHHHHHEECHHHHHHHHHHHHHHHHHH T ss_conf 999997-50163364999999983899887999999999947964645600642687799999999987774 No 100 >KOG2405 consensus Probab=96.66 E-value=4.3e-05 Score=48.87 Aligned_cols=88 Identities=24% Similarity=0.315 Sum_probs=63.8 Q ss_pred CCCCEEEECCCCCCCCCCCEEEEEEEEECCCCEEEECCCCCCCCC--HHHHHHHHCCCCCEEECCCCHHHHHHHHHHHCC Q ss_conf 898279971777898544507999984189428997465778570--677677732364023010000168886542000 Q gi|254780636|r 19 YVDAIAVDTETLGLMPRRDRLCIVQLSPGDGTVDIIRIAAGQKNA--PNLVGMLVDEKREKIFHYGRFDIAVLFYTFGVR 96 (207) Q Consensus 19 ~~~~iaiDtEt~~l~~~~~~l~LiQl~~~~~~~~l~~~~~~~~~~--~~L~~ll~d~~i~KV~Hn~~~D~~~L~~~~gi~ 96 (207) ..++. +--|...... .+++|.+|.++ +..+|++++...+.-+ ..+..++++.+|.|+.|+++.=..++.+.+||. T Consensus 58 ~qn~~-~~~e~a~~~~-~~~l~~~q~~~-~~~~yl~~i~~~~~~~~~n~~q~~~~~k~i~~~~~d~~~~~~~~~~~~~i~ 134 (458) T KOG2405 58 KQNVL-VAAEGANVCR-HGKLCWLQVAT-NCRVYLFDIFLLGSRAFHNGLQMILEDKRILKVIHDCRWLSDCLSHQYGIL 134 (458) T ss_pred HHCCC-CCCCCCCCCC-CCCCHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEE T ss_conf 61676-5566754025-67633588888-899999999986159999899998604667655799999998740302244 Q ss_pred CCCHHHHHHHHHH Q ss_conf 0000457898865 Q gi|254780636|r 97 VRPVFCTKIASRL 109 (207) Q Consensus 97 ~~~i~DT~ias~l 109 (207) .+|+|||+.|-.+ T Consensus 135 ~n~v~~~q~~d~~ 147 (458) T KOG2405 135 LNNVFDTQVADVL 147 (458) T ss_pred ECCHHHHHHHHHH T ss_conf 3320333320665 No 101 >PRK11779 sbcB exonuclease I; Provisional Probab=96.51 E-value=0.058 Score=30.68 Aligned_cols=145 Identities=21% Similarity=0.153 Sum_probs=75.1 Q ss_pred CCCCEEEECCCCCCCCCCCEEEEEEEEEC--CCCE--------EEECCCCCC-----------CC--------------C Q ss_conf 89827997177789854450799998418--9428--------997465778-----------57--------------0 Q gi|254780636|r 19 YVDAIAVDTETLGLMPRRDRLCIVQLSPG--DGTV--------DIIRIAAGQ-----------KN--------------A 63 (207) Q Consensus 19 ~~~~iaiDtEt~~l~~~~~~l~LiQl~~~--~~~~--------~l~~~~~~~-----------~~--------------~ 63 (207) ....+--|+||+|++|.++++ +|+|.- +..+ ..-.+..+. .. . T Consensus 6 ~~~f~fyD~ETtG~~~~~dqi--~Qfa~i~td~~ln~i~~~~~~~c~~~~~~lP~p~A~lvt~itp~~~~~~~~~~~~~~ 83 (477) T PRK11779 6 QPTFLWHDYETFGANPALDRP--AQFAGIRTDADLNIIGEPLVFYCKPADDYLPSPEAVLITGITPQEALEKGLPEAEFA 83 (477) T ss_pred CCCEEEEECCCCCCCCCCCCC--CEEEEEEECCCCCCCCCCCEEEECCCCCCCCCCCHHHCCCCCHHHHHHCCCCHHHHH T ss_conf 785799945368999877650--125778987753725774105644799989993043114889999986699879999 Q ss_pred HHHHHHHHCCCCCEEECC-CCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHC-----CC--C-CCC Q ss_conf 677677732364023010-00016888654200000004578988652110000008877765420-----02--4-675 Q gi|254780636|r 64 PNLVGMLVDEKREKIFHY-GRFDIAVLFYTFGVRVRPVFCTKIASRLTRTYTNQHGLKDNLKELLG-----IN--I-SKA 134 (207) Q Consensus 64 ~~L~~ll~d~~i~KV~Hn-~~~D~~~L~~~~gi~~~~i~DT~ias~l~~~~~~~~~L~~L~~~~lg-----~~--l-dK~ 134 (207) ..+...|..|...-||+| .+||=.++++.| -+|++|--+-+.- +| ..|+-+-.++.-... +. + +|+ T Consensus 84 ~~i~~~~~~~~t~~iGyNsi~FDee~~R~~f---y~nl~dPY~~~~~-ng-NsR~Dll~~~r~~~~~~p~~i~~p~~~~G 158 (477) T PRK11779 84 ARIHAEFSQPGTCILGYNNIRFDDEVTRYIF---YRNFYDPYAREWQ-NG-NSRWDLLDVVRACYALRPEGINWPENEDG 158 (477) T ss_pred HHHHHHHCCCCCEEEECCCCCCCHHHHHHHH---HHHCCCHHHHHCC-CC-CCHHHHHHHHHHHHHHCCCCCCCCCCCCC T ss_conf 9999996689848982277544899999999---9861345766424-89-70579999999998748366116737999 Q ss_pred C---EECCCC-CCCCCHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 2---003656-54432368999986459999999999999 Q gi|254780636|r 135 Q---QSSDWS-ADDLSDEQLQYAASDVVHLHALRLQFTEK 170 (207) Q Consensus 135 ~---q~SdW~-~rpLs~~Qi~YAA~Dv~~l~~L~~~l~~~ 170 (207) . ...+.+ .-.+..+.-+=|..||..+++|-..+++. T Consensus 159 ~~sfKLe~la~aNgi~h~~AHDAl~Dv~aTi~laklIk~k 198 (477) T PRK11779 159 LPSFKLEHLTKANGIEHENAHDAMSDVYATIAMAKLIKQK 198 (477) T ss_pred CEEEEHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHH T ss_conf 8544499999855987655410688999999999999876 No 102 >PTZ00166 DNA polymerase delta catalytic subunit; Provisional Probab=96.46 E-value=0.031 Score=32.26 Aligned_cols=150 Identities=15% Similarity=0.088 Sum_probs=78.1 Q ss_pred CCCCEEEECCCCCCC----C--CCCEEEEEEEEE-----CCC-CE---EEECCCCCC--------------CCCHHHHHH Q ss_conf 898279971777898----5--445079999841-----894-28---997465778--------------570677677 Q gi|254780636|r 19 YVDAIAVDTETLGLM----P--RRDRLCIVQLSP-----GDG-TV---DIIRIAAGQ--------------KNAPNLVGM 69 (207) Q Consensus 19 ~~~~iaiDtEt~~l~----~--~~~~l~LiQl~~-----~~~-~~---~l~~~~~~~--------------~~~~~L~~l 69 (207) .-.+++||.|-.|.. | -.++ .||||. ++. .. .++-+.... .....+..+ T Consensus 289 PlrilSfDIEC~~~~G~gFP~~~~Dp--VIqIs~~~~~~g~~~~~~~~~~ftl~~C~~I~g~~V~~F~tE~eLL~aF~~f 366 (1081) T PTZ00166 289 PIRILSFDIECIKLDGKGFPEANNDP--VIQISSVLHTHGDPLDSCKNFIFTLKECASLAGANVLWFDNEKTLLLAWNDF 366 (1081) T ss_pred CCEEEEEEEEECCCCCCCCCCCCCCC--EEEEEEEEEECCCCCCCCCCEEEECCCCCCCCCCEEEEECCHHHHHHHHHHH T ss_conf 81699986784588899898999995--7998579881478887646446674787788997799709999999999999 Q ss_pred H--HCCCCCEEECCC-CHHHHHHHHHHHC---C----------------------------------CCC--HHHHHHHH Q ss_conf 7--323640230100-0016888654200---0----------------------------------000--04578988 Q gi|254780636|r 70 L--VDEKREKIFHYG-RFDIAVLFYTFGV---R----------------------------------VRP--VFCTKIAS 107 (207) Q Consensus 70 l--~d~~i~KV~Hn~-~~D~~~L~~~~gi---~----------------------------------~~~--i~DT~ias 107 (207) + .||+|+ +|+|+ .||+..|...... . +.+ ++|.+... T Consensus 367 i~~~DPDII-TGYNI~nFDlpYLl~Ra~~L~i~~f~~lgRlk~~~~~~~~~~~~s~a~G~~~~k~i~i~GRv~~Dl~~~v 445 (1081) T PTZ00166 367 IIAVDPDFL-TGYNIINFDLPYLLNRATALNLKKFKFLTRIKSVKSTVKDSIFSSNQFGTHENKEINIEGRILFDVYDLI 445 (1081) T ss_pred HHHCCCCEE-EECCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCEECCCCCCCCCCCCCCEEEEECCEEEEEHHHHH T ss_conf 987099899-9668788888999999999588643323611577652202656655556665506867578996469999 Q ss_pred HHHHHHCCCHHHHHHHHHHHCCCCCC-CC-EECC-CCCCCC-CHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 65211000000887776542002467-52-0036-565443-236899998645999999999999999 Q gi|254780636|r 108 RLTRTYTNQHGLKDNLKELLGINISK-AQ-QSSD-WSADDL-SDEQLQYAASDVVHLHALRLQFTEKLQ 172 (207) Q Consensus 108 ~l~~~~~~~~~L~~L~~~~lg~~ldK-~~-q~Sd-W~~rpL-s~~Qi~YAA~Dv~~l~~L~~~l~~~L~ 172 (207) +- .-...+++|.++++++||.+-+- .. ...+ |...|= ...-..|+..|+...+.|.++|.-... T Consensus 446 ~r-~~kL~SYkLntVs~~fLgeqKeDV~~~~I~~l~~~~~e~r~rla~YClkDa~L~lrL~~KL~~l~n 513 (1081) T PTZ00166 446 RR-DYKLKSYSLNYVSFEFLKEQKEDVHYSTINKLQNGNPEDRKRIASYCLKDSILPLRLIDKLLLLYN 513 (1081) T ss_pred HH-HCCCCCCCHHHHHHHHHCCCCCCCCHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 87-558785768889999818875337999999998149378899999989989999999999989999 No 103 >cd05776 DNA_polB_alpha_exo The 3'-5' exonuclease domain of eukaryotic DNA polymerase alpha. DNA polymerase alpha is a family-B DNA polymerase with a catalytic subunit that contains a DnaQ-like 3'-5' exonuclease domain. It is one of the three DNA-dependent type B DNA polymerases (delta and epsilon are the other two) that have been identified as essential for nuclear DNA replication in eukaryotes. DNA polymerase alpha is almost exclusively required for the initiation of DNA replication and the priming of Okazaki fragments during elongation. It associates with DNA primase and is the only enzyme able to start DNA synthesis de novo. The catalytic subunit contains both polymerase and 3'-5' exonuclease domains, but only exhibits polymerase activity. The 3'-5' exonuclease domain contains three sequence motifs termed ExoI, ExoII and ExoIII, without the four conserved acidic residues that are crucial for metal binding and catalysis. This explains why in most organisms, that no specific repair r Probab=95.21 E-value=0.032 Score=32.20 Aligned_cols=95 Identities=17% Similarity=0.186 Sum_probs=58.7 Q ss_pred HCCCCCEEECCC-CHHHHHHHHHHH---CC----------------------------CCC-HHHHHHHHHHHHHHCCCH Q ss_conf 323640230100-001688865420---00----------------------------000-045789886521100000 Q gi|254780636|r 71 VDEKREKIFHYG-RFDIAVLFYTFG---VR----------------------------VRP-VFCTKIASRLTRTYTNQH 117 (207) Q Consensus 71 ~d~~i~KV~Hn~-~~D~~~L~~~~g---i~----------------------------~~~-i~DT~ias~l~~~~~~~~ 117 (207) .||++. ||||. .||+..|.+... +. .+. ++|+...++=.- ...++ T Consensus 96 ~DPDii-~Ghni~~fdl~~L~~R~~~lki~~wsriGRlkr~~~p~~~~~~~~~~~~~~~GRl~~D~~~~~ke~~-k~~sy 173 (234) T cd05776 96 IDPDVL-VGHDLEGFDLDVLLSRIQELKVPHWSRIGRLKRSVWPKKKGGGKFGERELTAGRLLCDTYLSAKELI-RCKSY 173 (234) T ss_pred CCCCEE-EEECCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCEEEEECHHHHHHHH-CCCCC T ss_conf 099899-8506377759999999998099853313550014376445656534420352358876099999874-66778 Q ss_pred HHHHHHHHHHCCCC---CCCCEECCCCCC-CCCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 08877765420024---675200365654-4323689999864599999999999 Q gi|254780636|r 118 GLKDNLKELLGINI---SKAQQSSDWSAD-DLSDEQLQYAASDVVHLHALRLQFT 168 (207) Q Consensus 118 ~L~~L~~~~lg~~l---dK~~q~SdW~~r-pLs~~Qi~YAA~Dv~~l~~L~~~l~ 168 (207) +|..+++++||.+- +-..-..-|... .| ..-+.|...|+.+.++|..++. T Consensus 174 ~L~~v~~~~L~~~k~dv~~~~i~~~~~~~~~l-~~l~~~~~~Da~l~~~L~~~L~ 227 (234) T cd05776 174 DLTELSQQVLGIERQDIDPEEILNMYNDSESL-LKLLEHTEKDAYLILQLMFKLN 227 (234) T ss_pred CHHHHHHHHHCCCCCCCCHHHHHHHHHCHHHH-HHHHHHHHHHHHHHHHHHHHHC T ss_conf 88999999968676679999999998693999-9999999999999999999958 No 104 >PRK06722 exonuclease; Provisional Probab=94.77 E-value=0.16 Score=28.11 Aligned_cols=46 Identities=11% Similarity=0.067 Sum_probs=29.1 Q ss_pred CCCCCCCHHHHHHHHCCCCCEEECCCCHHHHHHHHHH---HCCCCC--HHHHH Q ss_conf 5778570677677732364023010000168886542---000000--04578 Q gi|254780636|r 57 AAGQKNAPNLVGMLVDEKREKIFHYGRFDIAVLFYTF---GVRVRP--VFCTK 104 (207) Q Consensus 57 ~~~~~~~~~L~~ll~d~~i~KV~Hn~~~D~~~L~~~~---gi~~~~--i~DT~ 104 (207) +......|.+-++..|..+ |.||++||..+|.+.+ |+...+ ..||- T Consensus 36 P~ieeVLp~FleFIGd~~L--VAHNAsFD~gFL~~~C~~hg~e~P~~~~~~~~ 86 (242) T PRK06722 36 EKFPQIIEKFIQFIGEDSI--FVTWGKEDYRFLSHDCTLHGVECPCMEKERRI 86 (242) T ss_pred CCHHHHHHHHHHHHCCCEE--EEECCHHHHHHHHHHHHHCCCCCCCCCCHHHH T ss_conf 9789999999998278838--97344022779997752428999998421088 No 105 >TIGR01298 RNaseT ribonuclease T; InterPro: IPR005987 Ribonuclease T (3.1.13 from EC) is an enzyme found so far only in gamma-subdivision proteobacteria such as Escherichia coli and Xylella fastidiosa. Ribonuclease T is homologous to the DNA polymerase III alpha chain. It can liberate AMP from the common C-C-A terminus of uncharged tRNA. It appears also to be involved in RNA maturation. It also acts as a 3' to 5' single-strand DNA-specific exonuclease; it is distinctive for its ability to remove residues near a double-stranded stem. Ribonuclease T is a high copy suppressor in E. coli of a UV-repair defect caused by deletion of three other single-stranded DNA exonucleases.; GO: 0004540 ribonuclease activity, 0006396 RNA processing. Probab=94.58 E-value=0.2 Score=27.50 Aligned_cols=135 Identities=21% Similarity=0.280 Sum_probs=73.2 Q ss_pred CCEEEECCCCCCCCCCC---EEEEEEEEE-------CCCC--EEEECCCCCCCCCH---------------------HHH Q ss_conf 82799717778985445---079999841-------8942--89974657785706---------------------776 Q gi|254780636|r 21 DAIAVDTETLGLMPRRD---RLCIVQLSP-------GDGT--VDIIRIAAGQKNAP---------------------NLV 67 (207) Q Consensus 21 ~~iaiDtEt~~l~~~~~---~l~LiQl~~-------~~~~--~~l~~~~~~~~~~~---------------------~L~ 67 (207) =+|-||.||.|++...+ +++-|.+.- |+.. +.+.|+...+..+. .|. T Consensus 9 lPVVvDVET~GfN~~~dALLEiAait~~MDe~G~L~P~~~~~~~~~P~~Gani~P~~L~~~GI~~d~P~R~A~~E~~Al~ 88 (201) T TIGR01298 9 LPVVVDVETAGFNAKTDALLEIAAITLKMDEQGYLVPDETLHFHVEPFEGANIEPEALEFTGIDLDHPLRGAVEEKLALE 88 (201) T ss_pred CCEEEEECCCCCCCHHHHHHHHHHHHCCCCCCEEECCCCEEEEEEECCCCCCCCCCCEEECCCCCCCCCCHHHHHHHHHH T ss_conf 43488610677660134555554200011673001478525788522679988712002206544887522466889999 Q ss_pred HHHH----------CCCCCEEECCCCHHHHHHHHHH----HC-----CCCCHHHHHHHHHHHHHHCCCHHHHHHHHHH-- Q ss_conf 7773----------2364023010000168886542----00-----0000045789886521100000088777654-- Q gi|254780636|r 68 GMLV----------DEKREKIFHYGRFDIAVLFYTF----GV-----RVRPVFCTKIASRLTRTYTNQHGLKDNLKEL-- 126 (207) Q Consensus 68 ~ll~----------d~~i~KV~Hn~~~D~~~L~~~~----gi-----~~~~i~DT~ias~l~~~~~~~~~L~~L~~~~-- 126 (207) .+|. ..+-+-|+||+.||..+|.+.. ++ ++=..|||--.+.+. | |-.-|++-. T Consensus 89 ~iF~~vR~~~K~~gC~RailV~HNa~fD~~Fl~~AA~~R~~~KRnPFH~F~~FDTatLAGl~--y----GQTVL~kA~Q~ 162 (201) T TIGR01298 89 EIFKVVRKALKASGCQRAILVAHNASFDLGFLNRAAVKRTALKRNPFHPFSVFDTATLAGLA--Y----GQTVLAKACQA 162 (201) T ss_pred HHHHHHHHHHHHCCCCEEEEEECCCCCCHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHH--H----HHHHHHHHHHH T ss_conf 99999999997557960367600320466689999999861157898765224689998886--4----17999988864 Q ss_pred HCCCCCCCCEECCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 2002467520036565443236899998645999999999999999749 Q gi|254780636|r 127 LGINISKAQQSSDWSADDLSDEQLQYAASDVVHLHALRLQFTEKLQRLG 175 (207) Q Consensus 127 lg~~ldK~~q~SdW~~rpLs~~Qi~YAA~Dv~~l~~L~~~l~~~L~~~~ 175 (207) -|+ +| ...|-+=|..|++-+-+|+=.+..++++.+ T Consensus 163 Ag~---------~f-----d~~~AHSA~YDtE~TA~LFC~ivN~wk~~g 197 (201) T TIGR01298 163 AGL---------DF-----DAKQAHSALYDTEKTAELFCEIVNRWKELG 197 (201) T ss_pred CCC---------CC-----CCCCCCHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 278---------86-----754431012003568999998863045504 No 106 >smart00486 POLBc DNA polymerase type-B family. DNA polymerase alpha, delta, epsilon and zeta chain (eukaryota), DNA polymerases in archaea, DNA polymerase II in e. coli, mitochondrial DNA polymerases and and virus DNA polymerases Probab=94.25 E-value=0.38 Score=25.90 Aligned_cols=95 Identities=18% Similarity=0.073 Sum_probs=52.8 Q ss_pred HCCCCCEEECCC-CHHHHHHHHHHHCCCC--------------------------------------CHHHHHHHHHHHH Q ss_conf 323640230100-0016888654200000--------------------------------------0045789886521 Q gi|254780636|r 71 VDEKREKIFHYG-RFDIAVLFYTFGVRVR--------------------------------------PVFCTKIASRLTR 111 (207) Q Consensus 71 ~d~~i~KV~Hn~-~~D~~~L~~~~gi~~~--------------------------------------~i~DT~ias~l~~ 111 (207) .||.+. +|||. +||...+.+.....-. .++|+.-..+--. T Consensus 83 ~dpdii-~g~N~~~FD~~yi~~R~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~D~~~~~~~~~ 161 (471) T smart00486 83 YDPDII-YGHNISNFDLPYIISRLEKLKIKPLSFIGRLKNIIDIKRKKPLFGSKSFGKTIKVKIKGRLVIDLYNLYKNKL 161 (471) T ss_pred CCCCEE-EECCCCCCCHHHHHHHHHHHCCCHHHHHCCCCCCCCCCEECCCCCCCCCCCEEEEEEEEEEEEEHHHHHHHHH T ss_conf 199999-9658878768999999998489647763735778764320332101256631489985599978899998860 Q ss_pred HHCCCHHHHHHHHHHHCCCCC---CCCEECCCCCC-CCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 100000088777654200246---75200365654-432368999986459999999999 Q gi|254780636|r 112 TYTNQHGLKDNLKELLGINIS---KAQQSSDWSAD-DLSDEQLQYAASDVVHLHALRLQF 167 (207) Q Consensus 112 ~~~~~~~L~~L~~~~lg~~ld---K~~q~SdW~~r-pLs~~Qi~YAA~Dv~~l~~L~~~l 167 (207) ...+.+|...++.++|..-+ ...-...|... ......+.|+..|+..+.+|..++ T Consensus 162 -~l~sy~L~~v~~~~l~~~k~d~~~~~i~~~~~~~~~~~~~~~~Y~~~D~~l~~~l~~~~ 220 (471) T smart00486 162 -KLPSYKLDTVAEYLLGKEKDDLPYKDIPELYNLNYKLRDELLEYCIQDAVLTLKLFNKL 220 (471) T ss_pred -CCCCCCHHHHHHHHHCCCCCCCCHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf -44668889999999577888899899999985393679999999999999999999986 No 107 >cd06135 Orn Oligoribonuclease (Orn) is a DEDDh-type DnaQ-like 3'-5' exoribonuclease that is responsible for degrading small oligoribonucleotides to mononucleotides. It contains three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific Hx(4)D conserved pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. Orn is essential for E.coli survival. The human homolog, also called Sfn (small fragment nuclease), is able to hydrolyze short single-stranded RNA and DNA oligomers. It plays a role in cellular nucleotide recycling. Probab=92.78 E-value=0.28 Score=26.67 Aligned_cols=19 Identities=37% Similarity=0.623 Sum_probs=15.8 Q ss_pred CEEEECCCCCCCCCCCEEE Q ss_conf 2799717778985445079 Q gi|254780636|r 22 AIAVDTETLGLMPRRDRLC 40 (207) Q Consensus 22 ~iaiDtEt~~l~~~~~~l~ 40 (207) .|=+|+|+|||+|..+++. T Consensus 1 lvWiDlEmTGLd~~~d~Ii 19 (173) T cd06135 1 LVWIDLEMTGLDPEKDRIL 19 (173) T ss_pred CEEEECCCCCCCCCCCEEE T ss_conf 9599330368888886489 No 108 >cd05783 DNA_polB_B1_exo The 3'-5' exonuclease domain of Sulfolobus solfataricus DNA polymerase B1 and similar archaeal proteins. B1 is a family-B DNA polymerase. Family-B DNA polymerases contain an N-terminal DEDDy DnaQ-like exonuclease domain in the same polypeptide chain as the polymerase domain, similar to family-A DNA polymerases. B1displays thermostable polymerase and 3'-5' exonuclease activities. This exonuclease domain contains three sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are clustered around the active site and are involved in metal binding and catalysis. The exonuclease domain of family-B polymerases also contains a beta hairpin structure that plays an important role in active site switching in the event of nucleotide misincorporation. Family-B DNA polymerases from thermophilic archaea are unique in that they are able to recognize the presence of uracil in the template strand, leading to the stalling of DNA synthes Probab=92.70 E-value=0.62 Score=24.70 Aligned_cols=140 Identities=17% Similarity=0.202 Sum_probs=68.1 Q ss_pred CCCEEEECCCCCC----CC--CCCEEEEEEEEEC--CC--CEEEECCC-------------------CCCCCCHHHHHHH Q ss_conf 9827997177789----85--4450799998418--94--28997465-------------------7785706776777 Q gi|254780636|r 20 VDAIAVDTETLGL----MP--RRDRLCIVQLSPG--DG--TVDIIRIA-------------------AGQKNAPNLVGML 70 (207) Q Consensus 20 ~~~iaiDtEt~~l----~~--~~~~l~LiQl~~~--~~--~~~l~~~~-------------------~~~~~~~~L~~ll 70 (207) -++++||.|+.+- .| -.+..-++|++.. ++ .+++.... ........+.+++ T Consensus 5 lrvlsfDIEv~s~~~~~FP~p~~~~~~iisI~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~f~~E~~LL~~F~~~i 84 (204) T cd05783 5 LKRIAIDIEVYTPIKGRIPDPKTAEYPVISVALAGSDGLKRVLVLKREGVEGLEGLLPEGAEVEFFDSEKELIREAFKII 84 (204) T ss_pred CEEEEEEEEECCCCCCCCCCCCCCCCEEEEEEEECCCCCEEEEEEEECCCCCCCCCCCCCCEEEEECCHHHHHHHHHHHH T ss_conf 74999998878899997979555886199999992799889999950871257655789989999599999999999997 Q ss_pred HCCCCCEEECCC-CHHHHHHHHHHHCC---CC--C-------------H-HHHH-------HHHHHHHHHCCCHHHHHHH Q ss_conf 323640230100-00168886542000---00--0-------------0-4578-------9886521100000088777 Q gi|254780636|r 71 VDEKREKIFHYG-RFDIAVLFYTFGVR---VR--P-------------V-FCTK-------IASRLTRTYTNQHGLKDNL 123 (207) Q Consensus 71 ~d~~i~KV~Hn~-~~D~~~L~~~~gi~---~~--~-------------i-~DT~-------ias~l~~~~~~~~~L~~L~ 123 (207) .+..++ ++.|. .||+..|.+..... .. + + .|.. +..........+++|.+++ T Consensus 85 ~~~p~I-i~wN~~~FDiPYL~~R~~~Lg~~~~~~~~~~~~~~~~~~~~~~~dl~~~~~~~~~~~y~~~~k~~sY~Ld~Va 163 (204) T cd05783 85 SEYPIV-LTFNGDNFDLPYLYNRALKLGIPKEEIPIYLKRDYATLKHGIHIDLYKFFSNRAIQVYAFGNKYREYTLDAVA 163 (204) T ss_pred HCCCEE-EECCCCCCCHHHHHHHHHHHCCCCEEEEEEECCEEEEEEEEEEEEEHHHCCCCHHHHHHCCCCCCCCCHHHHH T ss_conf 439979-9817867788999999998599820456673436999934499986463573135441202421225288999 Q ss_pred HHHHCCC-CCCCCEECCCCCCCCCHHHHHHHHHHHHHHHHHH Q ss_conf 6542002-4675200365654432368999986459999999 Q gi|254780636|r 124 KELLGIN-ISKAQQSSDWSADDLSDEQLQYAASDVVHLHALR 164 (207) Q Consensus 124 ~~~lg~~-ldK~~q~SdW~~rpLs~~Qi~YAA~Dv~~l~~L~ 164 (207) +.+||.+ ++=+... +.. .. +.-+.|...||....+|- T Consensus 164 ~~~LGe~K~d~~~~~--~~~-d~-~k~i~YNi~D~~Lv~~L~ 201 (204) T cd05783 164 KALLGEGKVELEKNI--SEL-NL-YELAEYNYRDAELTLELT 201 (204) T ss_pred HHHHCCCCCCCCCCC--CCC-CH-HHHHEEEHHHHHHHHHHH T ss_conf 998388860067644--425-86-785514267767777862 No 109 >KOG0969 consensus Probab=86.77 E-value=1.5 Score=22.41 Aligned_cols=156 Identities=15% Similarity=0.159 Sum_probs=86.1 Q ss_pred CCCCCCEEEECCCCCCCCCCCEE---EEEEEEE-----CCCCE------E-----------EECCCCCCCCCHHHHHHH- Q ss_conf 24898279971777898544507---9999841-----89428------9-----------974657785706776777- Q gi|254780636|r 17 ARYVDAIAVDTETLGLMPRRDRL---CIVQLSP-----GDGTV------D-----------IIRIAAGQKNAPNLVGML- 70 (207) Q Consensus 17 l~~~~~iaiDtEt~~l~~~~~~l---~LiQl~~-----~~~~~------~-----------l~~~~~~~~~~~~L~~ll- 70 (207) +..-.+++||.|..|-....|.. -.||+|. +++.. . +..........+..+.++ T Consensus 271 ~APlrvlSfDIECagrkg~FPe~~~DPvIQIan~v~~~Ge~~pfvrnvf~l~~capI~G~~V~~~~~E~elL~~W~~fir 350 (1066) T KOG0969 271 IAPLRVLSFDIECAGRKGVFPEAKIDPVIQIANLVTLQGENEPFVRNVFTLKTCAPIVGSNVHSYETEKELLESWRKFIR 350 (1066) T ss_pred CCCCCCCCEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCCHHHHHHHCCCCCCCCCCCEEEEECCHHHHHHHHHHHHH T ss_conf 56500154137852577888860117499998888871688467876204667688777525771338899999999998 Q ss_pred -HCCCCCEEECC-CCHHHHHHHHH---HHCC-------CCC----HHHHHHHHHHH-----------------------H Q ss_conf -32364023010-00016888654---2000-------000----04578988652-----------------------1 Q gi|254780636|r 71 -VDEKREKIFHY-GRFDIAVLFYT---FGVR-------VRP----VFCTKIASRLT-----------------------R 111 (207) Q Consensus 71 -~d~~i~KV~Hn-~~~D~~~L~~~---~gi~-------~~~----i~DT~ias~l~-----------------------~ 111 (207) -||+++ +|.| +.||+.-+.+. +||. .+| +-||.+.|+.- | T Consensus 351 evDPDvI-~GYNi~nFDiPYll~RA~tL~Ie~Fp~LGRikn~~s~irDttfSSkq~GtRetK~v~I~GRlqfDllqvi~R 429 (1066) T KOG0969 351 EVDPDVI-IGYNICNFDIPYLLNRAKTLGIENFPYLGRIKNSRSVIRDSTFSSKQYGTRETKEVNIDGRLQFDLLQVILR 429 (1066) T ss_pred HCCCCEE-ECCCCCCCCCCEECCHHHHCCCCCCCCCCEECCCCEEEECCCCCHHHCCCCCCEEEEECCEEEEHHHHHHHH T ss_conf 6298867-133544556321307676468066650100035412441032123213751013886656565227999877 Q ss_pred HHC-CCHHHHHHHHHHHCCCCCC-------CCEECCC-CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCH Q ss_conf 100-0000887776542002467-------5200365-6544323689999864599999999999999974918 Q gi|254780636|r 112 TYT-NQHGLKDNLKELLGINISK-------AQQSSDW-SADDLSDEQLQYAASDVVHLHALRLQFTEKLQRLGRS 177 (207) Q Consensus 112 ~~~-~~~~L~~L~~~~lg~~ldK-------~~q~SdW-~~rpLs~~Qi~YAA~Dv~~l~~L~~~l~~~L~~~~~~ 177 (207) .+. .+++|.+.+.+|||.+-.- ..|..|= .+|-| -.|+..||+..+.|.++|.-..+-.... T Consensus 430 dyKLrSytLNaVs~hFL~EQKEDV~~siItdLQng~~~TRRRl----A~YCLKDAYLPlRLlekLM~ivNy~EMA 500 (1066) T KOG0969 430 DYKLRSYTLNAVSAHFLGEQKEDVHHSIITDLQNGNEQTRRRL----AVYCLKDAYLPLRLLEKLMVIVNYTEMA 500 (1066) T ss_pred HHHHHHCCHHHHHHHHHHHHCCCCCCCCHHHHHCCCHHHHHHH----HHHHHHHHCCHHHHHHHHHHHHHHHHHH T ss_conf 6433111011457876553201565210043314767888899----9887644102299999999998678656 No 110 >KOG0304 consensus Probab=77.96 E-value=3.9 Score=20.05 Aligned_cols=66 Identities=18% Similarity=0.233 Sum_probs=37.8 Q ss_pred HHCCCCCEEECCCCHHHHHHHHHH----------------HCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCC-CC Q ss_conf 732364023010000168886542----------------0000000457898865211000000887776542002-46 Q gi|254780636|r 70 LVDEKREKIFHYGRFDIAVLFYTF----------------GVRVRPVFCTKIASRLTRTYTNQHGLKDNLKELLGIN-IS 132 (207) Q Consensus 70 l~d~~i~KV~Hn~~~D~~~L~~~~----------------gi~~~~i~DT~ias~l~~~~~~~~~L~~L~~~~lg~~-ld 132 (207) ..+++|.||-..+.+|...|.+-+ ......++|+...++.+.+.+.+.||..++.. ||++ .. T Consensus 138 v~~~~V~WvTFhs~YDfgYLlK~Lt~~~LP~~~~eF~~~v~~~fp~vYDiK~l~~~c~~~~l~~GL~~lA~~-L~~~RvG 216 (239) T KOG0304 138 VLDENVTWVTFHSGYDFGYLLKILTGKPLPETEEEFFEIVRQLFPFVYDVKYLMKFCEGLSLKGGLQRLADL-LGLKRVG 216 (239) T ss_pred HCCCCEEEEEEECCCHHHHHHHHHCCCCCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHHHH-HCCCEEE T ss_conf 006865799832210299999997179886018899999999740266199999760333530379999998-5887111 Q ss_pred CCCE Q ss_conf 7520 Q gi|254780636|r 133 KAQQ 136 (207) Q Consensus 133 K~~q 136 (207) +.+| T Consensus 217 ~~Hq 220 (239) T KOG0304 217 IAHQ 220 (239) T ss_pred CCCC T ss_conf 0323 No 111 >pfam03104 DNA_pol_B_exo DNA polymerase family B, exonuclease domain. This domain has 3' to 5' exonuclease activity and adopts a ribonuclease H type fold. Probab=74.44 E-value=6.2 Score=18.87 Aligned_cols=75 Identities=16% Similarity=0.112 Sum_probs=38.2 Q ss_pred CCCCEEEECCCCCCCCCC---CEEEEEEEEE-------CC----CCEEEECCCCC------------CCCCHHHHHHH-- Q ss_conf 898279971777898544---5079999841-------89----42899746577------------85706776777-- Q gi|254780636|r 19 YVDAIAVDTETLGLMPRR---DRLCIVQLSP-------GD----GTVDIIRIAAG------------QKNAPNLVGML-- 70 (207) Q Consensus 19 ~~~~iaiDtEt~~l~~~~---~~l~LiQl~~-------~~----~~~~l~~~~~~------------~~~~~~L~~ll-- 70 (207) .-.+++||.|+.+..-.- .+=..||||. ++ ..+.+..+... ......+..++ T Consensus 85 plrilsfDIEc~~~~g~FP~~~~d~IiqIs~~~~~~g~~~~~~~~~~tl~~~~~i~~~~~V~~f~~E~~ll~~f~~~i~~ 164 (254) T pfam03104 85 PLRVLSFDIECTSLKGKFPENEKDPIIQISCMLYREGEPDPIEDKLFTLKSCALIEIGGEVYIFPSEKELLRRFFEFIRQ 164 (254) T ss_pred CCEEEEEEEEECCCCCCCCCCCCCEEEEEEEEEEECCCCCCCCEEEEEECCCCCCCCCCEEEEECCHHHHHHHHHHHHHH T ss_conf 85399998895888999998789838999989885678986320699807877677897799959999999999999875 Q ss_pred HCCCCCEEECCC-CHHHHHHHHHHH Q ss_conf 323640230100-001688865420 Q gi|254780636|r 71 VDEKREKIFHYG-RFDIAVLFYTFG 94 (207) Q Consensus 71 ~d~~i~KV~Hn~-~~D~~~L~~~~g 94 (207) .||+|. +|+|. .||+..|..... T Consensus 165 ~dpdii-~gyNi~~FD~~yl~~Ra~ 188 (254) T pfam03104 165 YDPDII-TGYNGDNFDWPYILNRAK 188 (254) T ss_pred CCCCEE-EECCCCCCCHHHHHHHHH T ss_conf 199899-966787788899999999 No 112 >PRK05359 oligoribonuclease; Provisional Probab=70.29 E-value=5.7 Score=19.07 Aligned_cols=22 Identities=41% Similarity=0.537 Sum_probs=17.8 Q ss_pred CCCCEEEECCCCCCCCCCCEEE Q ss_conf 8982799717778985445079 Q gi|254780636|r 19 YVDAIAVDTETLGLMPRRDRLC 40 (207) Q Consensus 19 ~~~~iaiDtEt~~l~~~~~~l~ 40 (207) ....|=+|+|+|||++..+++. T Consensus 2 ~~~lvWiDlEMTGLd~~~d~Il 23 (181) T PRK05359 2 EDNLIWIDLEMTGLDPERDRII 23 (181) T ss_pred CCCEEEEECCCCCCCCCCCEEE T ss_conf 9865899761348987877689 No 113 >KOG1798 consensus Probab=64.57 E-value=10 Score=17.61 Aligned_cols=150 Identities=18% Similarity=0.208 Sum_probs=80.6 Q ss_pred HCCCCCCEEEECCCCCCCC-----CCCEEEEEEEEECCCCEEEECCCCC--------------------------CCCCH Q ss_conf 4248982799717778985-----4450799998418942899746577--------------------------85706 Q gi|254780636|r 16 AARYVDAIAVDTETLGLMP-----RRDRLCIVQLSPGDGTVDIIRIAAG--------------------------QKNAP 64 (207) Q Consensus 16 ~l~~~~~iaiDtEt~~l~~-----~~~~l~LiQl~~~~~~~~l~~~~~~--------------------------~~~~~ 64 (207) .-....++|||.||+-+-- -.|++..|+--. +|..++|- ++. ...+. T Consensus 242 ~radp~VlAFDIETtKlPLKFPDae~DqIMMISYMi-DGqGfLIt-NREiVs~DIedfEYTPKpE~eG~F~v~Ne~dEv~ 319 (2173) T KOG1798 242 ERADPRVLAFDIETTKLPLKFPDAESDQIMMISYMI-DGQGFLIT-NREIVSEDIEDFEYTPKPEYEGPFCVFNEPDEVG 319 (2173) T ss_pred CCCCCEEEEEEEECCCCCCCCCCCCCCEEEEEEEEE-CCCEEEEE-CHHHHCCCHHHCCCCCCCCCCCCEEEECCCCHHH T ss_conf 257855899851035578779986656078999996-47468996-2234305212223688632525458866876889 Q ss_pred HHHHHHHC-----CCCCEEECCCC-HHHHHHHHHH---HCCCC---CHH-HHH---HHH---H------HH-HHH--CCC Q ss_conf 77677732-----36402301000-0168886542---00000---004-578---988---6------52-110--000 Q gi|254780636|r 65 NLVGMLVD-----EKREKIFHYGR-FDIAVLFYTF---GVRVR---PVF-CTK---IAS---R------LT-RTY--TNQ 116 (207) Q Consensus 65 ~L~~ll~d-----~~i~KV~Hn~~-~D~~~L~~~~---gi~~~---~i~-DT~---ias---~------l~-~~~--~~~ 116 (207) -|..+|+. |.|. |..|+. ||-.++.+.. |+..+ ++. |.+ .++ + +- ++| ..+ T Consensus 320 Ll~RfFeHiq~~kP~ii-vTyNGDFFDWPFve~Ra~~hGi~m~eEiGF~~D~~gEyks~~c~HmDcfrWVKRDSYLPqGS 398 (2173) T KOG1798 320 LLQRFFEHIQEVKPTII-VTYNGDFFDWPFVEARAKIHGISMNEEIGFRRDSQGEYKSPFCIHMDCFRWVKRDSYLPQGS 398 (2173) T ss_pred HHHHHHHHHHHCCCCEE-EEECCCCCCCHHHHHHHHHCCCCCCHHCCCEECCCCCCCCCCEEEHHHHHHHHHCCCCCCCC T ss_conf 99999999986189679-97368644666667788750777112108520653331256321126666652046577766 Q ss_pred HHHHHHHHHHHCCC---CCCCCEECCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 00887776542002---4675200365654432368999986459999999999999 Q gi|254780636|r 117 HGLKDNLKELLGIN---ISKAQQSSDWSADDLSDEQLQYAASDVVHLHALRLQFTEK 170 (207) Q Consensus 117 ~~L~~L~~~~lg~~---ldK~~q~SdW~~rpLs~~Qi~YAA~Dv~~l~~L~~~l~~~ 170 (207) +||++..+.-||.. +|-+.-+ -. +..=++.+-.|.+.||..+.-||-+.... T Consensus 399 qgLKAVTkaKLGYdPvEvdPEdM~-~~-A~EkPQ~lasYSVSDAVATYyLYMkYVhP 453 (2173) T KOG1798 399 QGLKAVTKAKLGYDPVEVDPEDMV-RM-AMEKPQTLASYSVSDAVATYYLYMKYVHP 453 (2173) T ss_pred CCHHHHHHHHHCCCCCCCCHHHHH-HH-HHHCCHHHHHCCHHHHHHHHHHHHHHHHH T ss_conf 312688887507894437988854-66-65082565300057778888898987506 No 114 >COG1949 Orn Oligoribonuclease (3'-5' exoribonuclease) [RNA processing and modification] Probab=63.61 E-value=8.1 Score=18.19 Aligned_cols=27 Identities=33% Similarity=0.580 Sum_probs=21.2 Q ss_pred CCCCCEEEECCCCCCCCCCCEEEEEEEEE Q ss_conf 48982799717778985445079999841 Q gi|254780636|r 18 RYVDAIAVDTETLGLMPRRDRLCIVQLSP 46 (207) Q Consensus 18 ~~~~~iaiDtEt~~l~~~~~~l~LiQl~~ 46 (207) .+.+.|=+|+|-|||+|.++++ |-+|+ T Consensus 4 ~~~nLiWIDlEMTGLd~~~drI--IEiA~ 30 (184) T COG1949 4 NKNNLIWIDLEMTGLDPERDRI--IEIAT 30 (184) T ss_pred CCCCEEEEEEEECCCCCCCCEE--EEEEE T ss_conf 7786289963202688676427--88888 No 115 >COG2925 SbcB Exonuclease I [DNA replication, recombination, and repair] Probab=63.22 E-value=11 Score=17.46 Aligned_cols=145 Identities=21% Similarity=0.173 Sum_probs=79.7 Q ss_pred CCCCCEEEECCCCCCCCCCCEEEEEEEEE----------CCCCEEEECCCCCCCC------------------------- Q ss_conf 48982799717778985445079999841----------8942899746577857------------------------- Q gi|254780636|r 18 RYVDAIAVDTETLGLMPRRDRLCIVQLSP----------GDGTVDIIRIAAGQKN------------------------- 62 (207) Q Consensus 18 ~~~~~iaiDtEt~~l~~~~~~l~LiQl~~----------~~~~~~l~~~~~~~~~------------------------- 62 (207) .+...+-.|-||.|.+|-+++.+ |++. ++-.+++-... ..+. T Consensus 7 ~~~tF~~yDYETfG~~Pa~DRPa--QFAgIRTD~~~NiIgeP~~fyCkps-dDyLP~P~a~LITGITPQ~~~~~Gi~E~~ 83 (475) T COG2925 7 KQPTFLFYDYETFGVHPALDRPA--QFAGIRTDIEFNIIGEPIVFYCKPA-DDYLPQPGAVLITGITPQEAREKGINEAA 83 (475) T ss_pred CCCCEEEEECHHCCCCCCCCCCH--HHHEEECCCCCCCCCCCEEEEECCC-CCCCCCCCCEEEECCCHHHHHHCCCCHHH T ss_conf 89867997503317785666613--5411210343432478727985475-45689987366625787889764898589 Q ss_pred -CHHHHHHHHCCCCCEEECC-CCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHH-----CCCCCCCC Q ss_conf -0677677732364023010-0001688865420000000457898865211000000887776542-----00246752 Q gi|254780636|r 63 -APNLVGMLVDEKREKIFHY-GRFDIAVLFYTFGVRVRPVFCTKIASRLTRTYTNQHGLKDNLKELL-----GINISKAQ 135 (207) Q Consensus 63 -~~~L~~ll~d~~i~KV~Hn-~~~D~~~L~~~~gi~~~~i~DT~ias~l~~~~~~~~~L~~L~~~~l-----g~~ldK~~ 135 (207) +...-..|..|+..-+|.| .+||=++-.+ +.-+|++|-.-=|.- ++..++.|-++++--. |++-.|.. T Consensus 84 F~~~I~~~ls~P~Tcv~GYNniRfDDEvtRy---~fyRNF~DPYa~sWq--ngNSRWDLLD~~RacyALRPeGI~Wp~n~ 158 (475) T COG2925 84 FAARIHAELTQPNTCVLGYNNIRFDDEVTRY---IFYRNFYDPYAWSWQ--NGNSRWDLLDVVRACYALRPEGINWPEND 158 (475) T ss_pred HHHHHHHHHCCCCEEEECCCCCCCCHHHHHH---HHHHHCCCHHHHHHC--CCCCHHHHHHHHHHHHHCCCCCCCCCCCC T ss_conf 9999999857998156235553332688988---898722766667613--89842579999999975285568788678 Q ss_pred -EECCCCC------CCCCHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf -0036565------4432368999986459999999999999 Q gi|254780636|r 136 -QSSDWSA------DDLSDEQLQYAASDVVHLHALRLQFTEK 170 (207) Q Consensus 136 -q~SdW~~------rpLs~~Qi~YAA~Dv~~l~~L~~~l~~~ 170 (207) +..-+.- -.+-..+-+=|..||+.++.+-...++. T Consensus 159 dG~pSFkLEhLtkANgieH~nAHdAmsDVyATIamAklvk~~ 200 (475) T COG2925 159 DGLPSFKLEHLTKANGIEHSNAHDAMSDVYATIAMAKLVKTA 200 (475) T ss_pred CCCCCHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHH T ss_conf 897511068776425611000567889999999999998761 No 116 >PRK07748 sporulation inhibitor KapD; Provisional Probab=54.06 E-value=16 Score=16.53 Aligned_cols=87 Identities=20% Similarity=0.151 Sum_probs=48.9 Q ss_pred HHHHHHCCCCCEEECCCCHHHHHHHHHH---HCCC---CCHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCEECC Q ss_conf 7677732364023010000168886542---0000---000457898865211000000887776542002467520036 Q gi|254780636|r 66 LVGMLVDEKREKIFHYGRFDIAVLFYTF---GVRV---RPVFCTKIASRLTRTYTNQHGLKDNLKELLGINISKAQQSSD 139 (207) Q Consensus 66 L~~ll~d~~i~KV~Hn~~~D~~~L~~~~---gi~~---~~i~DT~ias~l~~~~~~~~~L~~L~~~~lg~~ldK~~q~Sd 139 (207) |.++..+++.. +.+.+.+|+..|.+.. |+.. ...+|.+-.....-+...+.||+..++. +|+..+ T Consensus 86 f~~~~~~~~~~-~~tWG~~D~~~L~~~c~~~~i~~p~~~~~idlk~~f~~~~~~~~~~gL~~aL~~-lgl~~e------- 156 (205) T PRK07748 86 LAEYDKRCKPT-IVTWGNMDMKVLKHNCEKAGVPFPFKGQCRDLSLEYKKFFGERNQTGLWKAIEE-YGKEGT------- 156 (205) T ss_pred HHHHHCCCCEE-EEEECHHHHHHHHHHHHHHCCCCCCCCCEEEHHHHHHHHHCCCCCCCHHHHHHH-CCCCCC------- T ss_conf 99981778728-987178899999999999389998531006399999999688887689999997-699888------- Q ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 565443236899998645999999999999 Q gi|254780636|r 140 WSADDLSDEQLQYAASDVVHLHALRLQFTE 169 (207) Q Consensus 140 W~~rpLs~~Qi~YAA~Dv~~l~~L~~~l~~ 169 (207) . .-+=|..||.++-+|+.++.. T Consensus 157 --G------~~H~glDDA~NtAkI~kk~~~ 178 (205) T PRK07748 157 --G------KHHCALDDAMTTYNIFKLVEK 178 (205) T ss_pred --C------CCCCHHHHHHHHHHHHHHHHC T ss_conf --9------863758999999999999761 No 117 >pfam06369 Anemone_cytotox Sea anemone cytotoxic protein. Sea anemones are a rich source of cytotoxic proteins. Cytolysins comprise a group of more than 30 highly basic proteins with molecular masses of about 20 kDa. Cytolysins isolated from the sea anemone, Heteractis magnifica, include magnificalysin I (HMg I), magnificalysin II (HMg II) and Heteractis magnifica toxin (HMgtxn). These are highly homologous at their N-terminals. HMg I and II have molecular masses of approximately 19 kDa, and pI values of 9.4 and 10.0, respectively. Cytolysins isolated from other sea anemones Actinia tenebrosa (Tenebrosin-C, TN-C), Actinia equina (Equinatoxin, EqT) and Stichodactyla helianthus (ShC) exhibit pore-forming, haemolytic, cytotoxic, and heart stimulatory activities. Probab=51.54 E-value=9.8 Score=17.72 Aligned_cols=76 Identities=14% Similarity=0.064 Sum_probs=45.5 Q ss_pred CEEEECCC----CCCCCCC-CEEEEE--EEEECCCCEEEECCCCCCC-CCH----HHHHHHHCCCCCEEECCCCHHHHHH Q ss_conf 27997177----7898544-507999--9841894289974657785-706----7767773236402301000016888 Q gi|254780636|r 22 AIAVDTET----LGLMPRR-DRLCIV--QLSPGDGTVDIIRIAAGQK-NAP----NLVGMLVDEKREKIFHYGRFDIAVL 89 (207) Q Consensus 22 ~iaiDtEt----~~l~~~~-~~l~Li--Ql~~~~~~~~l~~~~~~~~-~~~----~L~~ll~d~~i~KV~Hn~~~D~~~L 89 (207) .||||.|+ +.++.|. ...+=+ --..++..+++..-.+..- .+. .|.=.+.|.+++-|+.+..||..+- T Consensus 32 AiGvdNesg~twtaln~YFrSGtsDv~Lp~~vp~~kAlLy~~rK~~gPvatGaVGv~aY~ms~gnTLavmfSVPfDYNlY 111 (176) T pfam06369 32 AVGVDNESGLTWTALNTYFRSGTSDVTLPYEVPNSKALLYTGRKSKGPVATGAVGVLAYKMSSGNTLAVMFSVPFDYNWY 111 (176) T ss_pred EEEEECCCCCEEEHHHHHHCCCCCCEECCCCCCCCCEEEEECCCCCCCCEECEEEEEEEEECCCCEEEEEEECCCCHHHH T ss_conf 99773477865432211011687531320305775024673354679722020247888743898899999646323544 Q ss_pred HHHHHCCC Q ss_conf 65420000 Q gi|254780636|r 90 FYTFGVRV 97 (207) Q Consensus 90 ~~~~gi~~ 97 (207) .+.+++.+ T Consensus 112 sNWwnv~v 119 (176) T pfam06369 112 SNWWNVKI 119 (176) T ss_pred HCCEEEEE T ss_conf 23014676 No 118 >KOG0970 consensus Probab=48.53 E-value=19 Score=16.01 Aligned_cols=96 Identities=14% Similarity=0.138 Sum_probs=54.2 Q ss_pred HCCCCCEEECC-CCHHHHHHHHHHHC-------------------------------CCCC-HHHHHHHHHHHHHHCCCH Q ss_conf 32364023010-00016888654200-------------------------------0000-045789886521100000 Q gi|254780636|r 71 VDEKREKIFHY-GRFDIAVLFYTFGV-------------------------------RVRP-VFCTKIASRLTRTYTNQH 117 (207) Q Consensus 71 ~d~~i~KV~Hn-~~~D~~~L~~~~gi-------------------------------~~~~-i~DT~ias~l~~~~~~~~ 117 (207) +||++. |||| -.+++.+|...... ..+. +.|+.++++=+-. ..++ T Consensus 621 ~dpD~i-VgHn~~~~~l~VLl~R~~~~Kip~WS~IgRLrrS~~~kfg~~s~~~e~~~~aGRl~CD~~~~a~~lik-~~S~ 698 (1429) T KOG0970 621 EDPDVI-VGHNIQGFYLDVLLSRLHALKIPNWSSIGRLRRSWPPKFGRSSSFGEFFIIAGRLMCDLNLAARELIK-AQSY 698 (1429) T ss_pred CCCCEE-EEECCCCCHHHHHHHHHHHHCCCCHHHHHHHHHCCCCCCCCCCCCCCCCCCCCEEEEEHHHHHHHHHC-CCCC T ss_conf 299879-97141102099999999873486443432221046400177665442110211578505777776451-3566 Q ss_pred HHHHHHHHHHCC---CCCCCCEECCCCC-CCCCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 088777654200---2467520036565-443236899998645999999999999 Q gi|254780636|r 118 GLKDNLKELLGI---NISKAQQSSDWSA-DDLSDEQLQYAASDVVHLHALRLQFTE 169 (207) Q Consensus 118 ~L~~L~~~~lg~---~ldK~~q~SdW~~-rpLs~~Qi~YAA~Dv~~l~~L~~~l~~ 169 (207) +|..|+...|+. ++.-....--|.. ..| -.++.-.-.|+++.++|+-++.- T Consensus 699 ~LseL~q~~l~~eR~~i~~~~i~~~y~~s~~L-~~ll~~~~~d~~~~l~i~~~l~~ 753 (1429) T KOG0970 699 SLSELSQQILKEERKEINANEIPKMYEDSKSL-TYLLEHTITDAELILQIMFRLNA 753 (1429) T ss_pred CHHHHHHHHHHHHCCCCCHHHHHHHCCCHHHH-HHHHHHHHHHHHHHHHHHHHHCC T ss_conf 59999999986541557875754523670779-99999876779999999999541 No 119 >COG0417 PolB DNA polymerase elongation subunit (family B) [DNA replication, recombination, and repair] Probab=47.84 E-value=20 Score=15.95 Aligned_cols=145 Identities=20% Similarity=0.144 Sum_probs=70.1 Q ss_pred CCCCCEEEECCCCCCCCC-----CCEEEEEEEEECCCCEEE-----ECCC-----CCCCCCH---HHHHHH--HCCCCCE Q ss_conf 489827997177789854-----450799998418942899-----7465-----7785706---776777--3236402 Q gi|254780636|r 18 RYVDAIAVDTETLGLMPR-----RDRLCIVQLSPGDGTVDI-----IRIA-----AGQKNAP---NLVGML--VDEKREK 77 (207) Q Consensus 18 ~~~~~iaiDtEt~~l~~~-----~~~l~LiQl~~~~~~~~l-----~~~~-----~~~~~~~---~L~~ll--~d~~i~K 77 (207) ..-+++|||.|+.+..-. .+.+..|+.+........ .... ......+ .+...+ .||.+. T Consensus 152 p~l~~la~DiE~~~~~~~~~~~~~d~~~~i~~~~~~~~~~~~~~~~~~~~~~~v~~~~~e~e~l~~~~~~i~~~dPdVI- 230 (792) T COG0417 152 PPLRVLAFDIETLSEPGKFPDGEKDPIIMISYAIEAEGGLIEVFIYTSGEGFSVEVVISEAELLERFVELIREYDPDVI- 230 (792) T ss_pred CCCCEEEEEEEECCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCEEEECCHHHHHHHHHHHHHHCCCCEE- T ss_conf 8875479950310577788987778626999970467776444322356776348725879999999999850298999- Q ss_pred EECCCC-HHHHHHHHHH---HCCCC---------------------CHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCC- Q ss_conf 301000-0168886542---00000---------------------004578988652110000008877765420024- Q gi|254780636|r 78 IFHYGR-FDIAVLFYTF---GVRVR---------------------PVFCTKIASRLTRTYTNQHGLKDNLKELLGINI- 131 (207) Q Consensus 78 V~Hn~~-~D~~~L~~~~---gi~~~---------------------~i~DT~ias~l~~~~~~~~~L~~L~~~~lg~~l- 131 (207) +++|+. ||...|.... |+... ..+|+.-+.+--.....+.++...+..+++..- T Consensus 231 vgyn~~~fd~pyl~~Ra~~lgi~~~~gr~~~~~~~~~~~~~~~~Gr~~iDl~~~~~~~~~~~~~ysl~~v~~~~l~~~k~ 310 (792) T COG0417 231 VGYNGDNFDWPYLAERAERLGIPLRLGRDGSELRVRKSGFSSQVGRLHIDLYPALRRRPLNLKSYSLEAVSEALLGEGKR 310 (792) T ss_pred EEECCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCEEECCCCCCCCEEEEEHHHHHHHCCCCCCCEEHHHHHHHHCCCCCH T ss_conf 98367777738999999981998511356653200212544557716741066542032355632434677664133522 Q ss_pred -CCCC-----EECCCCCCCCCHHHHHHHHHHHHHHHHHHH Q ss_conf -6752-----003656544323689999864599999999 Q gi|254780636|r 132 -SKAQ-----QSSDWSADDLSDEQLQYAASDVVHLHALRL 165 (207) Q Consensus 132 -dK~~-----q~SdW~~rpLs~~Qi~YAA~Dv~~l~~L~~ 165 (207) +... -..+|.... ...+.|.-.|+....++.. T Consensus 311 ~~~~~~~~~~~~~~~~~~~--~~~~~~~l~~~~l~~~~~~ 348 (792) T COG0417 311 EDIPYDSMEEIWPDWADSK--LRLLLYNLSDADLVLRILL 348 (792) T ss_pred HCCCCCHHHHCCCCCCCCH--HHHHHHCCCHHHHHHHHHH T ss_conf 0056201333042355107--6888834107889999986 No 120 >pfam01707 Peptidase_C9 Peptidase family C9. Probab=46.19 E-value=21 Score=15.80 Aligned_cols=79 Identities=13% Similarity=0.310 Sum_probs=47.4 Q ss_pred HHHHHHCCCCCEEECC------CCHHHHHHHHHHHCCCCCH-HHHHHHH-HHHHHHCCCHHHHHHHHHHHCCCCCCCC-- Q ss_conf 7677732364023010------0001688865420000000-4578988-6521100000088777654200246752-- Q gi|254780636|r 66 LVGMLVDEKREKIFHY------GRFDIAVLFYTFGVRVRPV-FCTKIAS-RLTRTYTNQHGLKDNLKELLGINISKAQ-- 135 (207) Q Consensus 66 L~~ll~d~~i~KV~Hn------~~~D~~~L~~~~gi~~~~i-~DT~ias-~l~~~~~~~~~L~~L~~~~lg~~ldK~~-- 135 (207) ++.+++-+.-.-+|.| +|-=..+|... ||.+..- ..|-+.+ +-=+.++.-+.|..+|-+++|++||.+. T Consensus 30 M~~i~~~~~~~d~FqnK~NVCWAKaL~pVL~TA-GI~lT~~qW~~i~p~FkeDka~S~e~aLn~iCtr~fG~DLdSGLFS 108 (202) T pfam01707 30 MKAINETAPPVDPFQNKANVCWAKALVPVLATA-GIRLTAEQWSTLFPAFKEDKAYSPEVALNEICTKFFGVDLDSGLFS 108 (202) T ss_pred HHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHC-CCCCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCCCCCCCCC T ss_conf 999856899876211223001387778899863-8535699998886876136666889999999999846435765436 Q ss_pred --------EECCCCCCCC Q ss_conf --------0036565443 Q gi|254780636|r 136 --------QSSDWSADDL 145 (207) Q Consensus 136 --------q~SdW~~rpL 145 (207) ....|..+|= T Consensus 109 ap~vpl~y~n~HWDNsPg 126 (202) T pfam01707 109 APSVPLTYHNNHWDNSPG 126 (202) T ss_pred CCCCCEEEECCCCCCCCC T ss_conf 886643765365568998 No 121 >cd05778 DNA_polB_zeta_exo The 3'-5' exonuclease domain of eukaryotic DNA polymerase zeta. DNA polymerase zeta is a family-B DNA polymerase which is distantly related to DNA polymerase delta. It plays a major role in translesion replication and the production of either spontaneous or induced mutations. In addition, DNA polymerase zeta also appears to be involved in somatic hypermutability in B lymphocytes, an important element for the production of high affinity antibodies in response to an antigen. The catalytic subunit contains both polymerase and 3'-5' exonuclease domains, but only exhibits polymerase activity. The DnaQ-like 3'-5' exonuclease domain contains three sequence motifs termed ExoI, ExoII and ExoIII, without the four conserved acidic residues that are crucial for metal binding and catalysis. Probab=42.42 E-value=9.1 Score=17.89 Aligned_cols=50 Identities=14% Similarity=0.143 Sum_probs=33.0 Q ss_pred CCHHHHHHHHHHHCCCCCCC--CEECCC-C-CCCC-CHHHHHHHHHHHHHHHHHH Q ss_conf 00008877765420024675--200365-6-5443-2368999986459999999 Q gi|254780636|r 115 NQHGLKDNLKELLGINISKA--QQSSDW-S-ADDL-SDEQLQYAASDVVHLHALR 164 (207) Q Consensus 115 ~~~~L~~L~~~~lg~~ldK~--~q~SdW-~-~rpL-s~~Qi~YAA~Dv~~l~~L~ 164 (207) .+++|.+++.++||.+...- .+.++| . ..|- ...-+.|+..||...++|. T Consensus 176 ~sYsLe~V~~~~L~e~k~~~~~~~l~~~~~~g~~~~r~~v~~Yclkd~~l~l~Ll 230 (231) T cd05778 176 TNYTLENVVYHVLHQRIPLYSNKTLTEWYKSGSASERWRVLEYYLKRVRLNLEIL 230 (231) T ss_pred CCCCHHHHHHHHHCCCCCCCCHHHHHHHHHCCCHHHHHHHHEEEHHHHHHHHHHH T ss_conf 4387999999996798888999999999974998785164400075305158652 No 122 >KOG3242 consensus Probab=41.21 E-value=25 Score=15.34 Aligned_cols=34 Identities=18% Similarity=0.190 Sum_probs=24.9 Q ss_pred CCCCHHHHCCCCCCEEEECCCCCCCCCCCEEEEE Q ss_conf 7888578424898279971777898544507999 Q gi|254780636|r 9 GDIPAECAARYVDAIAVDTETLGLMPRRDRLCIV 42 (207) Q Consensus 9 ~dl~~~~~l~~~~~iaiDtEt~~l~~~~~~l~Li 42 (207) +++++....-..+.|=+|+|-|||+...++++=| T Consensus 15 ~~~~~~~~~l~q~lVWiD~EMTGLdvekd~i~Ei 48 (208) T KOG3242 15 NTLMETGDKLKQPLVWIDCEMTGLDVEKDRIIEI 48 (208) T ss_pred CCCHHHHCCCCCCEEEEEEECCCCCCCCCEEEEE T ss_conf 3232231022486489961012553231316878 No 123 >TIGR02093 P_ylase glycogen/starch/alpha-glucan phosphorylases; InterPro: IPR011833 The biosynthesis of disaccharides, oligosaccharides and polysaccharides involves the action of hundreds of different glycosyltransferases. These enzymes catalyse the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. A classification of glycosyltransferases using nucleotide diphospho-sugar, nucleotide monophospho-sugar and sugar phosphates (2.4.1.- from EC) and related proteins into distinct sequence based families has been described . This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site . The same three-dimensional fold is expected to occur within each of the families. Because 3-D structures are better conserved than sequences, several of the families defined on the basis of sequence similarities may have similar 3-D structures and therefore form 'clans'. Proteins in this entry are glycosyltransferases with phosphorylase activities. Members use phosphate to break alpha 1,4 linkages between pairs of glucose residues at the end of long glucose polymers, releasing alpha-D-glucose 1-phosphate. The nomenclature convention is to preface the name according to the natural substrate, as in glycogen phosphorylase, starch phosphorylase, maltodextrin phosphorylase, etc. The main role of glycogen phosphorylase (GPase) is to provide phosphorylated glucose molecules (G-1-P) . GPase is a highly regulated allosteric enzyme. The net effect of the regulatory site allows the enzyme to operate at a variety of rates; the enzyme is not simply regulated as "on" or "off", but rather it can be thought of being set to operate at an ideal rate based on changing conditions at in the cell. The most important allosteric effector is the phosphate molecule covalently attached to Ser14. This switches GPase from the b (inactive) state to the a (active) state. Upon phosphorylation, GPase attains about 800f its Vmax. When the enzyme is not phosphorylated, GPase activity is practically non-existent at low AMP levels . There is some apparent controversy as to the structure of GPase. All sources agree that the enzyme is multimeric, but there is apparent controversy as to the enzyme being a tetramer or a dimer. Apparently, GPase (in the a form) forms tetramers in the crystal form. The consensus seems to be that 'regardless of the a or b form, GPase functions as a dimer in vivo . The GPase monomer is best described as consisting of two domains, an N-terminal domain and a C-terminal domain . The C-terminal domain is often referred to as the catalytic domain. It consists of a beta-sheet core surrounded by layers of helical segments . The vitamin cofactor pyridoxal phosphate (PLP) is covalently attached to the amino acid backbone. The N-terminal domain also consists of a central beta-sheet core and is surrounded by layers of helical segments. The N-terminal domain contains different allosteric effector sites to regulate the enzyme. Bacterial phosphorylases follow the same catalytic mechanisms as their plant and animal counterparts, but differ considerably in terms of their substrate specificity and regulation. The catalytic domains are highly conserved while the regulatory sites are only poorly conserved. For maltodextrin phosphorylase from Escherichia coli the physiological role of the enzyme in the utilisation of maltidextrins is known in detail; that of all the other bacterial phosphorylases is still unclear. Roles in regulatuon of endogenous glycogen metabolism in periods of starvation, and sporulation, stress response or quick adaptation to changing environments are possible .; GO: 0004645 phosphorylase activity, 0030170 pyridoxal phosphate binding, 0005975 carbohydrate metabolic process. Probab=41.05 E-value=5.8 Score=19.04 Aligned_cols=40 Identities=20% Similarity=0.177 Sum_probs=15.6 Q ss_pred HHHHHHCCCCCEEECCCCH----HH-HHHHHHHHCCC--CCHHHHHH Q ss_conf 7677732364023010000----16-88865420000--00045789 Q gi|254780636|r 66 LVGMLVDEKREKIFHYGRF----DI-AVLFYTFGVRV--RPVFCTKI 105 (207) Q Consensus 66 L~~ll~d~~i~KV~Hn~~~----D~-~~L~~~~gi~~--~~i~DT~i 105 (207) |.++-.|+...+=+++.|. ++ ..+.++.||.+ ..+||.|| T Consensus 508 L~~ya~D~~f~~~~~~vK~~NK~~LA~yi~~~~GV~~dP~siFDVQv 554 (822) T TIGR02093 508 LEPYADDSEFLEKFAQVKQANKQRLAAYIKEETGVEVDPNSIFDVQV 554 (822) T ss_pred HHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCEECCCCCCCCCH T ss_conf 11004898999999999999999999999977792768665532112 No 124 >PRK05601 DNA polymerase III subunit epsilon; Validated Probab=39.71 E-value=27 Score=15.20 Aligned_cols=77 Identities=16% Similarity=0.226 Sum_probs=45.0 Q ss_pred HCCCCCCEEEECCCCCCCCCCCEEEEEEEEE----C--CCCEEE-ECCCCCC-------------CCCH-------HHHH Q ss_conf 4248982799717778985445079999841----8--942899-7465778-------------5706-------7767 Q gi|254780636|r 16 AARYVDAIAVDTETLGLMPRRDRLCIVQLSP----G--DGTVDI-IRIAAGQ-------------KNAP-------NLVG 68 (207) Q Consensus 16 ~l~~~~~iaiDtEt~~l~~~~~~l~LiQl~~----~--~~~~~l-~~~~~~~-------------~~~~-------~L~~ 68 (207) ....++.||+-.-|+|..|-.+++.-|---+ + .+.|+- +...... ..+| .|.+ T Consensus 42 ~~~~~p~va~~~~ttgihpst~r~v~~da~~~da~Ge~~~~f~~~lNPg~DPGP~HiHGLT~e~L~gAP~F~~ia~~l~~ 121 (377) T PRK05601 42 AIEDAPFVAVSIQSSGIHPSTSRMVTIDAVTLSADGEPVETFHAVLNPKTDPGPFHLHGVTEEEFASAKRFGQILKSLDR 121 (377) T ss_pred HHHCCCEEEEEEEECCCCCCCCEEEEEEEEEECCCCCCCCEEEEECCCCCCCCCCCCCCCCHHHHCCCCCHHHHHHHHHH T ss_conf 23128727999885575777550588878886478972234577127999999810137899997068428999999999 Q ss_pred HHHCCCCCEEECCCCHHHHHHHHHHH Q ss_conf 77323640230100001688865420 Q gi|254780636|r 69 MLVDEKREKIFHYGRFDIAVLFYTFG 94 (207) Q Consensus 69 ll~d~~i~KV~Hn~~~D~~~L~~~~g 94 (207) +| +.. +-|.||+.||..||-..+. T Consensus 122 lL-dGR-tLVaHNa~fdw~Fia~Ea~ 145 (377) T PRK05601 122 LI-DGR-TLLLHNAPRTWGFIVSEAK 145 (377) T ss_pred HH-CCC-EEEEECCCCHHHHHHHHHH T ss_conf 84-586-7998356020889999999 No 125 >pfam11567 PfUIS3 Plasmodium falciparum UIS3 membrane protein. UIS3 is a membrane protein essential for sporozoite development in infected hepatocytes. This family is 130-229 of the Plasmodium falciparum UIS3 protein which is compact and has an all alpha-helical structure.PfUIS3(130-229) interacts with lipids, phospholipid lysosomes, the human liver fatty acid-binding protein and with the lipid phosphatidylethanolamine. The interaction with liver fatty acid-binding protein provides the parasite with a method to import essential fatty acids/lipids during rapid growth phases of sporozoites. Probab=37.02 E-value=29 Score=14.95 Aligned_cols=43 Identities=16% Similarity=0.037 Sum_probs=27.4 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHH Q ss_conf 432368999986459999999999999997491899999996400 Q gi|254780636|r 144 DLSDEQLQYAASDVVHLHALRLQFTEKLQRLGRSDLATSCCNFLM 188 (207) Q Consensus 144 pLs~~Qi~YAA~Dv~~l~~L~~~l~~~L~~~~~~~~~~~e~~~l~ 188 (207) .||.+|..|-..|-.|+.++-+.|.+. +.-...-+++.+.+|. T Consensus 41 ~LSn~Qq~y~indydYlrkiVQ~L~E~--rnv~lSr~QEd~AVL~ 83 (101) T pfam11567 41 HLSNDQKDSHVNNYEYLQKFVQGLQEN--RNISLSKYQENKAVLD 83 (101) T ss_pred HCCCHHHHHHHCHHHHHHHHHHHHHHC--CCCCHHHHHHHHHHHH T ss_conf 437688988735199999999988861--3633888877688852 No 126 >TIGR01720 NRPS-para261 non-ribosomal peptide synthase domain TIGR01720; InterPro: IPR010060 This entry represents a region of 171 amino acids long and contains three very highly conserved regions. At the N terminus is a nearly invariant lysine (position 11) followed by xxxRxxPxxGxGYG in which the proline and the first glycine are invariant. This is followed approximately 22 residues later by the motif FNYLG. Near the C terminus of the domain is the sequence TxSD where the serine and aspartate are nearly invariant. This domain appears to be located immediately downstream from a condensation domain (IPR001242 from INTERPRO), and is followed primarily by the end of the molecule or another condensation domain (or, in a few cases, it is followed by IPR000873 from INTERPRO, an AMP-binding module). The converse is not true, IPR001242 from INTERPRO domains are not always followed by this domain, which implicates this domain in possible post-condensation modification events. This region has so far been found in a variety of peptide synthases including Bacitracin synthetase 1 Gramicidin S synthetase 1 Linear gramicidin synthetase subunit A Surfactin synthetase subunit 1 Tyrocidine synthetase 1 . Probab=36.67 E-value=30 Score=14.92 Aligned_cols=53 Identities=19% Similarity=0.362 Sum_probs=38.9 Q ss_pred CCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCEECCCCCCCCCHHHHH Q ss_conf 00000457898865211000000887776542002467520036565443236899 Q gi|254780636|r 96 RVRPVFCTKIASRLTRTYTNQHGLKDNLKELLGINISKAQQSSDWSADDLSDEQLQ 151 (207) Q Consensus 96 ~~~~i~DT~ias~l~~~~~~~~~L~~L~~~~lg~~ldK~~q~SdW~~rpLs~~Qi~ 151 (207) .+...|++.-...|.+.+ ...|..+++|..+.+ ..+.+=||++...|+.+|+. T Consensus 117 y~~~~f~~~ti~~L~~~f--~~~L~~~i~HC~~~~-~~e~TPSDf~~~~L~~~eLd 169 (171) T TIGR01720 117 YSRELFSEETIEQLADSF--KEALEAIIEHCAEKE-GGELTPSDFSLKDLTQDELD 169 (171) T ss_pred ECHHHHCHHHHHHHHHHH--HHHHHHHHHHHCCCC-CCCCCCCCCCCCCCCHHHHH T ss_conf 460410468999999999--999999999850888-88678656888886789784 No 127 >KOG0217 consensus Probab=32.46 E-value=35 Score=14.51 Aligned_cols=20 Identities=20% Similarity=0.180 Sum_probs=10.9 Q ss_pred CCEEEECCCCCCCCCCCEEEEEEEEEC Q ss_conf 827997177789854450799998418 Q gi|254780636|r 21 DAIAVDTETLGLMPRRDRLCIVQLSPG 47 (207) Q Consensus 21 ~~iaiDtEt~~l~~~~~~l~LiQl~~~ 47 (207) ++.-+||+|- ++-..|+|+. T Consensus 393 gvc~iDtstg-------e~~~~eF~DD 412 (1125) T KOG0217 393 GVCFIDTSTG-------EINIGEFCDD 412 (1125) T ss_pred EEEEEECCCC-------EEEEEEEECC T ss_conf 5899983555-------3889874050 No 128 >TIGR02238 recomb_DMC1 meiotic recombinase Dmc1; InterPro: IPR011940 This entry describes DMC1, a subfamily of a larger family of DNA repair and recombination proteins. It is eukaryotic only and most closely related to eukaryotic RAD51. It also resembles archaeal RadA (IPR011938 from INTERPRO) and RadB (IPR011939 from INTERPRO) and bacterial RecA. It has been characterised for human as a recombinase active only in meiosis .; GO: 0003677 DNA binding, 0005524 ATP binding, 0008094 DNA-dependent ATPase activity, 0007131 meiotic recombination, 0005634 nucleus. Probab=29.27 E-value=29 Score=14.98 Aligned_cols=19 Identities=26% Similarity=0.333 Sum_probs=9.5 Q ss_pred CCCHHHHCCCCCCEEEECC Q ss_conf 8885784248982799717 Q gi|254780636|r 10 DIPAECAARYVDAIAVDTE 28 (207) Q Consensus 10 dl~~~~~l~~~~~iaiDtE 28 (207) +||....-...+++.+||| T Consensus 121 QLP~~~~g~~GKv~yIDTE 139 (314) T TIGR02238 121 QLPREMGGGEGKVAYIDTE 139 (314) T ss_pred HCCCCCCCCCCEEEEEECC T ss_conf 0631257998428998178 No 129 >TIGR02236 recomb_radA DNA repair and recombination protein RadA; InterPro: IPR011938 This family consists exclusively of archaeal RadA protein, a homolog of bacterial RecA, eukaryotic RAD51 (IPR011941 from INTERPRO), and archaeal RadB (IPR011939 from INTERPRO). This protein is involved in DNA repair and recombination. The member from Pyrococcus horikoshii contains an intein .; GO: 0003684 damaged DNA binding, 0005524 ATP binding, 0008094 DNA-dependent ATPase activity, 0006281 DNA repair, 0006310 DNA recombination. Probab=28.77 E-value=30 Score=14.86 Aligned_cols=28 Identities=29% Similarity=0.274 Sum_probs=18.2 Q ss_pred CCCCCCEEEECCCCCCCCCCCEEEEEEEEE--CCC Q ss_conf 248982799717778985445079999841--894 Q gi|254780636|r 17 ARYVDAIAVDTETLGLMPRRDRLCIVQLSP--GDG 49 (207) Q Consensus 17 l~~~~~iaiDtEt~~l~~~~~~l~LiQl~~--~~~ 49 (207) |...++|-|||| +|+||+ +..|++- ..+ T Consensus 138 L~gp~av~IDTE----nTFRPE-RI~qmA~GL~~g 167 (333) T TIGR02236 138 LEGPKAVYIDTE----NTFRPE-RIEQMAKGLARG 167 (333) T ss_pred CCCCEEEEEECC----CCCCHH-HHHHHHHHHHHH T ss_conf 678768998559----798625-799999863111 No 130 >pfam03175 DNA_pol_B_2 DNA polymerase type B, organellar and viral. Like pfam00136, members of this family are also DNA polymerase type B proteins. Those included here are found in plant and fungal mitochondria, and in viruses. Probab=28.53 E-value=41 Score=14.11 Aligned_cols=31 Identities=26% Similarity=0.470 Sum_probs=25.9 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 4323689999864599999999999999974 Q gi|254780636|r 144 DLSDEQLQYAASDVVHLHALRLQFTEKLQRL 174 (207) Q Consensus 144 pLs~~Qi~YAA~Dv~~l~~L~~~l~~~L~~~ 174 (207) .+.++-+.|+..||..|.+++..+...+.+. T Consensus 122 ~~~~~~~~Yc~~DV~iL~~v~~~f~~~~~~~ 152 (452) T pfam03175 122 DIRKETIEYCKNDVLVLAKLLISFKNFIYEE 152 (452) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 6299999999999999999999999999998 No 131 >pfam09210 DUF1957 Domain of unknown function (DUF1957). This domain is found in a set of hypothetical bacterial proteins. Its exact function has not, as yet, been defined. Probab=23.35 E-value=51 Score=13.55 Aligned_cols=41 Identities=22% Similarity=0.213 Sum_probs=28.5 Q ss_pred CEECCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 20036565443236899998645999999999999999749 Q gi|254780636|r 135 QQSSDWSADDLSDEQLQYAASDVVHLHALRLQFTEKLQRLG 175 (207) Q Consensus 135 ~q~SdW~~rpLs~~Qi~YAA~Dv~~l~~L~~~l~~~L~~~~ 175 (207) .|.|||.---=+.....||..=+.-=+.-+..|...|.... T Consensus 36 aqSSDW~Fi~~~gt~~eYA~~R~~~Hl~rF~~L~~~l~~~~ 76 (100) T pfam09210 36 AQSSDWAFILTTGTAVEYARERIKEHLKRFWRLYEALEAGE 76 (100) T ss_pred HHHCCCHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 98345308866886689999889999999999999987399 No 132 >TIGR01744 XPRTase xanthine phosphoribosyltransferase; InterPro: IPR010079 This entry represent a xanthine-specific phosphoribosyltransferase of Bacillus subtilis and closely related proteins from other species, mostly from other Gram-positive bacteria. The adjacent gene is a xanthine transporter; Bacillus subtilis can import xanthine for the purine salvage pathway or for catabolism to obtain nitrogen.; GO: 0016763 transferase activity transferring pentosyl groups, 0043101 purine salvage, 0046110 xanthine metabolic process. Probab=23.23 E-value=51 Score=13.53 Aligned_cols=42 Identities=26% Similarity=0.271 Sum_probs=23.3 Q ss_pred ECCCCEEEECCCCCCCCCHHHHHHHHCCC---------CCEEECCCCHHHH Q ss_conf 18942899746577857067767773236---------4023010000168 Q gi|254780636|r 46 PGDGTVDIIRIAAGQKNAPNLVGMLVDEK---------REKIFHYGRFDIA 87 (207) Q Consensus 46 ~~~~~~~l~~~~~~~~~~~~L~~ll~d~~---------i~KV~Hn~~~D~~ 87 (207) ..|+..+++|+.....-+..|.++.++.+ |+|=|.|+|..+. T Consensus 116 ~~D~vl~IDDFLA~G~a~~gL~~i~~qA~A~~~G~G~viEK~Fq~Gr~~L~ 166 (191) T TIGR01744 116 DQDRVLIIDDFLANGDAARGLVDIAEQAGAKIAGIGIVIEKSFQNGRQELE 166 (191) T ss_pred CCCEEEEEECCCCCCHHHHHHHHHHHHCCCCEEEEEEEEEECCCCCHHHHH T ss_conf 998899995134666167889999986199588544799814578469998 No 133 >TIGR02242 tail_TIGR02242 phage tail protein domain; InterPro: IPR011748 This entry describes a region of sequence similarity shared by a number of uncharacterised proteins in bacterial genomes, including Geobacter sulfurreducens PCA, Mesorhizobium loti, Streptomyces coelicolor A3(2), Gloeobacter violaceus PCC 7421, and Myxococcus xanthus. In all cases, the genomic region resembles a phage tail region, based on tentative identifications of neighbouring genes. A region of this domain resembles a region of IPR006521 from INTERPRO, another phage tail protein model.. Probab=22.98 E-value=52 Score=13.50 Aligned_cols=41 Identities=20% Similarity=0.324 Sum_probs=27.7 Q ss_pred HHHHHHHHHCCCHHHHHHHHHHHCCCCCCCC---------EECCCCC--CCC Q ss_conf 9886521100000088777654200246752---------0036565--443 Q gi|254780636|r 105 IASRLTRTYTNQHGLKDNLKELLGINISKAQ---------QSSDWSA--DDL 145 (207) Q Consensus 105 ias~l~~~~~~~~~L~~L~~~~lg~~ldK~~---------q~SdW~~--rpL 145 (207) +|..+.+.++-+.||..+++-|+|+++.... ..|.|+. ++| T Consensus 83 rA~~~yr~rGTr~GL~~~~~l~~G~~~~~~~~~~~d~~~~g~~~~g~~~~~L 134 (138) T TIGR02242 83 RAVDLYRVRGTRKGLRELIRLYLGVEPRIVEPFEADKLVLGEDEFGVGDRVL 134 (138) T ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCCCHHCCHHHHHHHHHHHHHHCCCCCCC T ss_conf 9999998402488999999987388600017776435554121204545467 No 134 >TIGR01074 rep ATP-dependent DNA helicase Rep; InterPro: IPR005752 RepA hexameric DNA helicase contain ATP-binding domains similar to those seen in monomeric helicases but which are arranged in a ring. There is compelling evidence to suggest that a single ssDNA molecule passes through the centre of the hexameric ring. Activity of the enzyme is based upon two separate but coupled activities, ssDNA translocation and duplex destabilisation, and is driven by energy derived from the continuous ATP-binding and hydrolysis events that take place in the active-site cleft. The resulting conformational changes that accompany these events underpin the coupling process and allow the helicase to translocate along the DNA, destabilizing the duplex and separating the two strands in an active process ; GO: 0004003 ATP-dependent DNA helicase activity, 0006268 DNA unwinding during replication, 0005737 cytoplasm. Probab=22.51 E-value=53 Score=13.45 Aligned_cols=101 Identities=21% Similarity=0.227 Sum_probs=64.6 Q ss_pred HHHHHHHCCCCCE----EECCCCHHHHHHH-HHHHCCCC-CHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCC------ Q ss_conf 7767773236402----3010000168886-54200000-0045789886521100000088777654200246------ Q gi|254780636|r 65 NLVGMLVDEKREK----IFHYGRFDIAVLF-YTFGVRVR-PVFCTKIASRLTRTYTNQHGLKDNLKELLGINIS------ 132 (207) Q Consensus 65 ~L~~ll~d~~i~K----V~Hn~~~D~~~L~-~~~gi~~~-~i~DT~ias~l~~~~~~~~~L~~L~~~~lg~~ld------ 132 (207) .+.+.|..++..+ -||+...++---. .+.|..++ .+||-.=+ .+|.+++++..++ T Consensus 67 RVA~~L~~~~~~GL~isTFH~LGL~Ii~~E~~~lG~K~nFSlFD~~D~-------------~all~eL~~~~~~~d~~~l 133 (677) T TIGR01074 67 RVAKTLGKGQAKGLTISTFHTLGLKIIRREHNALGLKSNFSLFDETDQ-------------LALLKELLEGELKADKDLL 133 (677) T ss_pred HHHHHCCCCCCCCCEEECCHHHHHHHHHHHHHHCCCCCCCCCCCHHHH-------------HHHHHHHHHHHHCCCHHHH T ss_conf 998522654558544752057338999999986488999642067889-------------9999987523311424899 Q ss_pred -C-CCEECCCCCCCCCHHHHHHHHHHHHHH--HHHHHHHHHHHHHCCCHH Q ss_conf -7-520036565443236899998645999--999999999999749189 Q gi|254780636|r 133 -K-AQQSSDWSADDLSDEQLQYAASDVVHL--HALRLQFTEKLQRLGRSD 178 (207) Q Consensus 133 -K-~~q~SdW~~rpLs~~Qi~YAA~Dv~~l--~~L~~~l~~~L~~~~~~~ 178 (207) + ..+-|+|-..=+|++|..-+|.+.+.- -.+|..-...|++-|-.+ T Consensus 134 ~~~~~~IS~WKNdl~tP~qa~~~A~~~~e~~~A~~Y~~Y~~~L~~yNalD 183 (677) T TIGR01074 134 DKLKSTISNWKNDLLTPEQALASARNEREQTFAHVYARYQATLRAYNALD 183 (677) T ss_pred HHHHHHHHHHHHCCCCHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHCCCC T ss_conf 99999863654114790899998517668999999999999987707754 No 135 >pfam09281 Taq-exonuc Taq polymerase, exonuclease. Members of this family are found in prokaryotic Taq DNA polymerase, where they assume a ribonuclease H-like motif. The domain confers 5'-3' exonuclease activity to the polymerase. Probab=22.25 E-value=54 Score=13.42 Aligned_cols=58 Identities=16% Similarity=0.166 Sum_probs=39.2 Q ss_pred CCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCEECCCCCCCCCHHHH Q ss_conf 000168886542000000045789886521100000088777654200246752003656544323689 Q gi|254780636|r 82 GRFDIAVLFYTFGVRVRPVFCTKIASRLTRTYTNQHGLKDNLKELLGINISKAQQSSDWSADDLSDEQL 150 (207) Q Consensus 82 ~~~D~~~L~~~~gi~~~~i~DT~ias~l~~~~~~~~~L~~L~~~~lg~~ldK~~q~SdW~~rpLs~~Qi 150 (207) +-.|+.++.-.-|+.+.+=-|-|+.+++++.. .-.....++||+|-+ |...+-+.+++ T Consensus 69 ~AKdLAvla~~eG~~v~PGDDPmLlAYLlDPs--NT~p~~varRY~gge---------W~eDa~~RA~~ 126 (138) T pfam09281 69 LAKALAAHLSVRGTVVEPGDDPLLVAYLLDPA--NTNMPGVARRYLGGE---------WPEDAATRAAI 126 (138) T ss_pred HHHHHHHHHHCCCCCCCCCCCHHHHHHHCCCC--CCCCHHHHHHHCCCC---------CCCCHHHHHHH T ss_conf 99999999870266569998602787650856--788215667623888---------87567888999 Done!