Query gi|254780661|ref|YP_003065074.1| exonuclease I [Candidatus Liberibacter asiaticus str. psy62] Match_columns 471 No_of_seqs 170 out of 1439 Neff 8.2 Searched_HMMs 39220 Date Sun May 29 20:50:09 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780661.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 PRK11779 sbcB exonuclease I; P 100.0 0 0 932.0 45.6 470 1-470 5-476 (477) 2 COG2925 SbcB Exonuclease I [DN 100.0 0 0 770.3 38.0 467 1-468 7-475 (475) 3 pfam08411 Exonuc_X-T_C Exonucl 100.0 0 0 399.7 26.0 261 206-467 1-268 (268) 4 cd06138 ExoI_N This subfamily 100.0 0 0 360.5 17.9 183 6-189 1-183 (183) 5 TIGR01405 polC_Gram_pos DNA po 100.0 2.8E-45 0 309.3 8.1 435 3-469 204-667 (1264) 6 COG2176 PolC DNA polymerase II 100.0 5.4E-40 1.4E-44 275.8 10.7 425 3-469 421-864 (1444) 7 PRK08074 bifunctional ATP-depe 100.0 9.9E-35 2.5E-39 242.0 19.2 171 1-198 1-172 (932) 8 PRK08517 DNA polymerase III su 100.0 1.1E-34 2.7E-39 241.8 18.2 169 2-199 69-237 (259) 9 TIGR01407 dinG_rel DnaQ family 100.0 8.4E-35 2.1E-39 242.5 16.1 168 4-198 1-169 (944) 10 PRK06309 DNA polymerase III su 100.0 6.6E-34 1.7E-38 236.7 19.0 167 2-196 1-167 (232) 11 PRK00448 polC DNA polymerase I 100.0 5E-35 1.3E-39 243.9 12.0 169 3-198 418-586 (1436) 12 PRK06807 DNA polymerase III su 100.0 1.7E-33 4.3E-38 234.1 19.0 187 2-216 7-194 (313) 13 PRK07246 bifunctional ATP-depe 100.0 1.9E-33 4.9E-38 233.7 18.3 169 1-198 5-173 (820) 14 PRK06310 DNA polymerase III su 100.0 3.8E-33 9.6E-38 231.8 19.3 209 3-254 7-215 (250) 15 PRK07883 hypothetical protein; 100.0 2.2E-33 5.7E-38 233.3 17.8 171 3-198 24-194 (575) 16 PRK07740 hypothetical protein; 100.0 1.9E-32 4.9E-37 227.3 19.6 168 3-198 57-225 (240) 17 PRK07942 DNA polymerase III su 100.0 1E-31 2.5E-36 222.7 19.0 184 3-207 6-192 (234) 18 PRK09145 DNA polymerase III su 100.0 6.9E-30 1.8E-34 210.9 17.8 172 2-195 28-201 (203) 19 PRK07247 DNA polymerase III su 100.0 2.9E-29 7.5E-34 206.8 18.3 169 2-198 4-172 (195) 20 PRK06063 DNA polymerase III su 100.0 2.7E-29 7E-34 207.0 17.2 167 3-197 15-181 (313) 21 PRK06631 consensus 100.0 5E-29 1.3E-33 205.4 17.5 171 4-198 5-180 (229) 22 cd06136 TREX1_2 Three prime re 100.0 3.2E-29 8.2E-34 206.6 16.3 159 5-189 1-175 (177) 23 cd06131 DNA_pol_III_epsilon_Ec 100.0 3.8E-28 9.8E-33 199.6 18.6 161 6-191 2-166 (167) 24 smart00479 EXOIII exonuclease 100.0 3.6E-28 9.2E-33 199.8 17.3 167 4-196 1-168 (169) 25 PRK06195 DNA polymerase III su 100.0 3.1E-28 7.9E-33 200.3 16.9 162 4-194 2-163 (309) 26 PRK06022 consensus 100.0 6.7E-28 1.7E-32 198.1 17.0 168 5-198 3-174 (234) 27 cd06134 RNaseT RNase T is a DE 100.0 1.5E-27 3.7E-32 195.9 18.3 170 4-194 6-188 (189) 28 PRK05711 DNA polymerase III su 100.0 2E-27 5E-32 195.1 17.7 169 5-197 6-178 (240) 29 cd06130 DNA_pol_III_epsilon_li 100.0 2E-27 5.2E-32 195.0 17.6 154 5-188 1-154 (156) 30 PRK08858 consensus 100.0 3.6E-27 9.1E-32 193.4 18.1 172 3-198 10-190 (246) 31 PRK07009 consensus 100.0 5.7E-27 1.4E-31 192.1 18.1 166 5-195 3-172 (241) 32 pfam00929 Exonuc_X-T Exonuclea 100.0 2.7E-27 7E-32 194.2 16.1 161 5-188 1-162 (162) 33 cd06133 ERI-1_3'hExo_like This 100.0 1.5E-26 3.8E-31 189.4 18.7 168 5-191 1-175 (176) 34 PRK06859 consensus 99.9 9.9E-26 2.5E-30 184.1 19.2 168 3-195 5-180 (253) 35 PRK05168 ribonuclease T; Provi 99.9 2.2E-26 5.6E-31 188.4 15.7 174 5-199 19-205 (213) 36 PRK08816 consensus 99.9 1E-25 2.7E-30 184.0 18.9 165 4-192 2-170 (244) 37 PRK07982 consensus 99.9 6.6E-26 1.7E-30 185.3 17.8 172 2-197 5-184 (243) 38 cd06127 DEDDh DEDDh exonucleas 99.9 5.5E-26 1.4E-30 185.8 17.0 158 6-189 1-159 (159) 39 TIGR01406 dnaQ_proteo DNA poly 99.9 1.6E-25 4.1E-30 182.8 14.7 190 5-217 2-202 (246) 40 PRK09146 DNA polymerase III su 99.9 1.1E-24 2.7E-29 177.5 17.1 175 3-191 47-223 (239) 41 PRK07983 exodeoxyribonuclease 99.9 1.5E-23 3.7E-28 170.2 15.1 192 5-254 2-199 (219) 42 TIGR00573 dnaq exonuclease, DN 99.9 4.3E-23 1.1E-27 167.2 13.4 174 3-199 7-183 (228) 43 COG0847 DnaQ DNA polymerase II 99.9 6.7E-22 1.7E-26 159.5 16.0 165 4-194 14-181 (243) 44 PRK09182 DNA polymerase III su 99.9 5.4E-21 1.4E-25 153.7 12.7 172 4-208 38-215 (293) 45 cd06125 DnaQ_like_exo The DnaQ 99.8 1.9E-19 4.9E-24 143.7 11.5 116 6-134 1-117 (130) 46 PRK07748 sporulation inhibitor 99.8 7.5E-18 1.9E-22 133.5 18.2 169 2-193 3-178 (205) 47 cd06135 Orn Oligoribonuclease 99.7 4.5E-17 1.1E-21 128.5 12.5 163 5-194 1-171 (173) 48 PRK05359 oligoribonuclease; Pr 99.6 2.7E-15 7E-20 117.0 11.5 165 1-194 1-174 (181) 49 cd06144 REX4_like This subfami 99.6 2.9E-15 7.3E-20 116.9 11.6 148 6-189 1-152 (152) 50 PRK06722 exonuclease; Provisio 99.6 5.3E-15 1.4E-19 115.2 11.0 137 34-190 1-137 (242) 51 TIGR01298 RNaseT ribonuclease 99.5 1.9E-13 4.9E-18 105.1 9.3 173 6-199 11-197 (201) 52 PRK05601 DNA polymerase III su 99.4 1.5E-12 3.9E-17 99.4 11.4 158 4-184 47-226 (377) 53 COG1949 Orn Oligoribonuclease 99.4 9E-12 2.3E-16 94.4 11.7 159 1-192 4-175 (184) 54 cd06149 ISG20 Interferon (IFN) 99.1 1.6E-09 4.1E-14 79.9 10.8 151 6-189 1-157 (157) 55 cd05782 DNA_polB_like1_exo A s 99.0 1.2E-08 3.2E-13 74.2 13.2 115 14-170 45-169 (208) 56 cd06137 DEDDh_RNase This group 99.0 4.6E-09 1.2E-13 77.0 10.4 150 6-188 1-160 (161) 57 KOG3242 consensus 98.9 4.4E-08 1.1E-12 70.7 11.5 164 1-192 24-196 (208) 58 cd05781 DNA_polB_B3_exo The 3' 98.8 8.8E-08 2.2E-12 68.8 11.7 125 3-172 3-146 (188) 59 cd06139 DNA_polA_I_Ecoli_like_ 98.8 4.9E-07 1.3E-11 64.0 15.0 145 2-196 4-171 (193) 60 COG5018 KapD Inhibitor of the 98.8 3.6E-08 9.1E-13 71.3 7.8 171 2-193 3-183 (210) 61 cd05785 DNA_polB_like2_exo A s 98.7 5.8E-07 1.5E-11 63.5 13.4 129 4-173 10-171 (207) 62 KOG0542 consensus 98.7 5.9E-07 1.5E-11 63.4 13.4 167 4-193 57-240 (280) 63 cd05779 DNA_polB_epsilon_exo T 98.7 2.4E-07 6.2E-12 66.0 11.3 150 4-172 3-170 (204) 64 KOG4793 consensus 98.7 2.2E-07 5.7E-12 66.2 9.2 179 2-192 12-215 (318) 65 cd06145 REX1_like This subfami 98.6 6.6E-07 1.7E-11 63.1 10.2 144 8-189 3-150 (150) 66 pfam04857 CAF1 CAF1 family rib 98.6 5.3E-06 1.3E-10 57.3 14.6 173 3-189 22-234 (235) 67 KOG2249 consensus 98.6 2.1E-06 5.3E-11 59.9 12.0 162 4-204 106-271 (280) 68 COG3359 Predicted exonuclease 98.6 2.7E-06 6.8E-11 59.2 12.6 150 3-203 98-248 (278) 69 cd05160 DNA_polB_exo The 3'-5' 98.5 1.6E-06 4.1E-11 60.6 10.4 141 6-170 2-162 (200) 70 cd06126 DEDDy DEDDy exonucleas 98.5 7.1E-06 1.8E-10 56.5 12.3 130 6-171 2-143 (181) 71 PTZ00315 2'-phosphotransferase 98.4 2.9E-05 7.4E-10 52.6 13.4 173 3-192 56-251 (570) 72 cd05780 DNA_polB_Kod1_like_exo 98.3 1.5E-05 3.8E-10 54.4 11.3 127 4-167 4-151 (195) 73 cd05777 DNA_polB_delta_exo The 98.3 2.9E-05 7.3E-10 52.6 12.7 137 3-167 7-178 (230) 74 pfam10108 Exon_PolB Predicted 98.2 3.2E-05 8.2E-10 52.3 11.3 116 16-172 5-133 (211) 75 cd06128 DNA_polA_exo The 3'-5' 98.2 3.7E-05 9.5E-10 51.9 11.6 139 5-191 3-164 (166) 76 PRK05762 DNA polymerase II; Re 98.2 8.7E-05 2.2E-09 49.5 12.6 149 4-191 156-346 (785) 77 PTZ00166 DNA polymerase delta 98.1 3.7E-05 9.3E-10 51.9 10.3 162 4-192 291-509 (1081) 78 cd00007 35EXOc 3'-5' exonuclea 98.1 0.00012 3.1E-09 48.5 12.8 133 4-194 1-154 (155) 79 pfam03104 DNA_pol_B_exo DNA po 98.1 1.5E-05 3.9E-10 54.4 8.0 97 4-113 87-190 (254) 80 KOG0304 consensus 98.1 0.00012 3E-09 48.7 11.9 177 4-194 25-238 (239) 81 PRK06887 consensus 98.1 0.0002 5.2E-09 47.2 12.7 158 3-195 356-543 (954) 82 smart00486 POLBc DNA polymeras 98.0 0.00042 1.1E-08 45.1 13.5 164 1-191 1-220 (471) 83 pfam01612 3_5_exonuc 3'-5' exo 98.0 0.00055 1.4E-08 44.4 14.0 135 3-195 20-171 (172) 84 KOG2248 consensus 98.0 0.00014 3.6E-09 48.2 10.8 156 3-196 216-376 (380) 85 cd05784 DNA_polB_II_exo The 3' 98.0 0.00015 3.9E-09 47.9 10.3 128 3-167 3-149 (193) 86 cd05778 DNA_polB_zeta_exo The 97.9 0.00013 3.3E-09 48.4 9.4 105 4-111 5-118 (231) 87 PRK05755 DNA polymerase I; Pro 97.9 0.0011 2.7E-08 42.6 13.8 142 3-197 319-482 (889) 88 cd06143 PAN2_exo PAN2 is the c 97.9 0.0001 2.6E-09 49.1 8.5 148 7-189 2-174 (174) 89 PRK07456 consensus 97.8 0.0023 6E-08 40.3 14.1 147 3-198 382-548 (975) 90 cd05783 DNA_polB_B1_exo The 3' 97.8 0.00017 4.4E-09 47.6 8.1 103 3-118 5-115 (204) 91 cd06142 RNaseD_exo Ribonucleas 97.8 0.003 7.5E-08 39.7 14.2 132 2-196 11-161 (178) 92 smart00474 35EXOc 3'-5' exonuc 97.7 0.0018 4.6E-08 41.1 12.5 132 4-194 22-170 (172) 93 PRK07556 consensus 97.7 0.00049 1.2E-08 44.7 9.4 153 3-196 387-562 (977) 94 cd06140 DNA_polA_I_Bacillus_li 97.6 0.006 1.5E-07 37.7 13.5 137 2-197 2-158 (178) 95 PRK07997 consensus 97.5 0.0037 9.3E-08 39.1 11.4 145 3-195 349-516 (928) 96 cd05776 DNA_polB_alpha_exo The 97.5 0.0044 1.1E-07 38.6 11.6 110 4-116 4-124 (234) 97 PRK08835 consensus 97.5 0.0058 1.5E-07 37.8 12.2 145 3-195 351-518 (931) 98 PRK07625 consensus 97.4 0.0088 2.2E-07 36.6 12.6 144 3-195 344-510 (922) 99 PRK08786 consensus 97.3 0.013 3.2E-07 35.6 12.2 144 4-195 349-515 (927) 100 PRK05797 consensus 97.1 0.021 5.2E-07 34.3 13.0 138 4-196 320-460 (869) 101 PRK08434 consensus 96.9 0.031 7.9E-07 33.1 12.4 144 3-197 318-481 (887) 102 KOG1798 consensus 96.9 0.0032 8.1E-08 39.5 6.4 176 4-201 247-458 (2173) 103 COG0417 PolB DNA polymerase el 96.7 0.046 1.2E-06 32.0 10.7 155 4-192 155-351 (792) 104 cd06147 Rrp6p_like_exo Yeast R 96.1 0.07 1.8E-06 30.8 9.0 131 2-195 23-172 (192) 105 COG0749 PolA DNA polymerase I 95.8 0.14 3.7E-06 28.8 10.6 133 6-196 25-181 (593) 106 PRK05761 DNA polymerase I; Rev 95.2 0.22 5.7E-06 27.6 10.2 114 73-191 196-329 (777) 107 PRK07898 consensus 95.0 0.26 6.6E-06 27.2 13.8 137 3-197 337-492 (902) 108 KOG4793 consensus 94.5 0.027 7E-07 33.5 2.5 166 9-195 116-291 (318) 109 COG5228 POP2 mRNA deadenylase 94.4 0.18 4.5E-06 28.2 6.5 175 4-193 43-251 (299) 110 cd06129 RNaseD_like The RNase 92.1 0.83 2.1E-05 23.9 11.3 109 3-172 15-126 (159) 111 COG0349 Rnd Ribonuclease D [Tr 92.0 0.87 2.2E-05 23.8 10.2 131 3-196 17-167 (361) 112 pfam11074 DUF2779 Domain of un 91.8 0.58 1.5E-05 24.9 5.8 112 6-144 1-121 (127) 113 cd06141 WRN_exo WRN is a uniqu 91.1 1.1 2.7E-05 23.2 12.4 86 80-190 63-167 (168) 114 PRK08076 consensus 90.8 1.1 2.9E-05 23.1 12.5 133 5-197 316-468 (877) 115 KOG0969 consensus 89.8 0.02 5E-07 34.4 -3.2 90 5-107 276-371 (1066) 116 TIGR03491 RecB family nuclease 87.1 2 5.2E-05 21.4 11.8 17 247-264 281-297 (457) 117 PRK05929 consensus 80.5 3.9 9.9E-05 19.6 13.3 93 4-144 311-403 (870) 118 cd06146 mut-7_like_exo The mut 73.6 6 0.00015 18.4 8.9 104 78-190 70-190 (191) 119 PRK00047 glpK glycerol kinase; 63.9 9.6 0.00024 17.1 3.7 30 1-40 2-32 (498) 120 pfam06748 DUF1217 Protein of u 50.4 12 0.0003 16.6 2.1 39 369-407 10-48 (150) 121 TIGR01553 formate-DH-alph form 39.1 24 0.00061 14.6 2.4 94 13-111 238-352 (1043) 122 pfam06471 NSP11 NSP11. This re 35.8 27 0.00068 14.2 6.4 166 5-188 91-285 (594) 123 PRK03643 altronate oxidoreduct 33.6 29 0.00074 14.0 4.1 15 455-469 457-471 (482) 124 PRK08928 consensus 32.1 31 0.00078 13.9 10.1 89 79-196 363-451 (861) 125 KOG2481 consensus 30.3 33 0.00084 13.7 3.4 68 152-219 106-185 (570) 126 pfam08125 Mannitol_dh_C Mannit 29.8 34 0.00086 13.6 2.4 19 174-192 98-116 (245) 127 TIGR02267 TIGR02267 Myxococcus 28.7 35 0.00089 13.5 2.7 54 160-213 34-91 (123) 128 TIGR02438 catachol_actin catec 28.0 36 0.00092 13.4 2.3 27 27-57 183-209 (287) 129 pfam10866 DUF2704 Protein of u 25.7 40 0.001 13.1 6.6 79 173-262 91-169 (169) 130 TIGR00875 talC transaldolase, 24.7 41 0.0011 13.0 3.2 47 60-108 22-72 (216) 131 KOG0968 consensus 20.7 49 0.0013 12.5 5.1 55 61-116 724-779 (1488) 132 pfam11618 DUF3250 Protein of u 20.6 50 0.0013 12.5 2.9 10 53-62 21-30 (141) 133 TIGR01282 nifD nitrogenase mol 20.2 51 0.0013 12.5 4.0 27 92-118 214-240 (521) No 1 >PRK11779 sbcB exonuclease I; Provisional Probab=100.00 E-value=0 Score=931.99 Aligned_cols=470 Identities=56% Similarity=1.001 Sum_probs=457.3 Q ss_pred CCCCEEEEECCCCCCCCCCCEEEEEEEEEECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHHH Q ss_conf 98837999752788987726048999999838972804002266217888778578989857998899607999799999 Q gi|254780661|r 1 MTNHFVIYDYETFGRDVALDRPAQFAGVRVDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFSR 80 (471) Q Consensus 1 m~~~~v~~D~ETtG~~~~~~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~~ 80 (471) |+.||+|||+||||++|.+|||+|+|||+||.+||++|+..+++|||.||++|+|+|++||||||+++...|+|+||+++ T Consensus 5 ~~~~f~fyD~ETtG~~~~~dqi~Qfa~i~td~~ln~i~~~~~~~c~~~~~~lP~p~A~lvt~itp~~~~~~~~~~~~~~~ 84 (477) T PRK11779 5 MQPTFLWHDYETFGANPALDRPAQFAGIRTDADLNIIGEPLVFYCKPADDYLPSPEAVLITGITPQEALEKGLPEAEFAA 84 (477) T ss_pred CCCCEEEEECCCCCCCCCCCCCCEEEEEEECCCCCCCCCCCEEEECCCCCCCCCCCHHHCCCCCHHHHHHCCCCHHHHHH T ss_conf 77857999453689998776501257789877537257741056447999899930431148899999866998799999 Q ss_pred HHHHHHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCH Q ss_conf 99999511796599950557899999999985078532132568871001588886555653210024444454433108 Q gi|254780661|r 81 RIHQFFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGATSFKL 160 (471) Q Consensus 81 ~i~~~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~~kL 160 (471) +|++.|++|++|+|||||++||++|+|++||||+++||.+.|+|||+|+|+|+++|++++++|+.|+||.+++|++|||| T Consensus 85 ~i~~~~~~~~t~~iGyNsi~FDee~~R~~fy~nl~dPY~~~~~ngNsR~Dll~~~r~~~~~~p~~i~~p~~~~G~~sfKL 164 (477) T PRK11779 85 RIHAEFSQPGTCILGYNNIRFDDEVTRYIFYRNFYDPYAREWQNGNSRWDLLDVVRACYALRPEGINWPENEDGLPSFKL 164 (477) T ss_pred HHHHHHCCCCCEEEECCCCCCCHHHHHHHHHHHCCCHHHHHCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCEEEEH T ss_conf 99999668984898227754489999999998613457664248970579999999998748366116737999854449 Q ss_pred HHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCEEE Q ss_conf 98998718876431013331899999999998642268999996238898997420015760799701010477875068 Q gi|254780661|r 161 QDLALANGIEHVNAHDAKADVYATLALVWLIREKKPKLFEYLYDYRNKNQLRKLIDIQNMTPLVHVSGMFGASRSNTALI 240 (471) Q Consensus 161 ~~la~~n~i~~~~aH~A~~D~~~t~~l~k~i~~~~p~~~~~~l~~~~K~~v~~~l~~~~~~~~~~v~~~~~~~~~~~~~v 240 (471) |+||++|||+|++||||++||+||++|+|+|++++|++|++++.+++|+.|.++|+..+..|++||+++|++.++++++| T Consensus 165 e~la~aNgi~h~~AHDAl~Dv~aTi~laklIk~k~P~l~~~~l~~r~K~~v~~~i~~~~~~p~~~vs~~fg~~~~~~~~v 244 (477) T PRK11779 165 EHLTKANGIEHENAHDAMSDVYATIAMAKLIKQKQPKLFDYLFSLRSKRKVAALIDVPQMKPLVHVSGMFGAERGCTSWV 244 (477) T ss_pred HHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHCCHHHHHHHHHHHCCCCEEEEEEECCCCCCCCEEE T ss_conf 99998559876554106889999999999998769699999998437999999986502670389821016666622487 Q ss_pred EEEECCCCCCCEEEEEECCCCCCHHCCCCHHHHHHHHHCCHHHH-HCCCCCCEEEEECCCCCEECCHHHCCHHHHHHCCC Q ss_conf 86302888774478710356872000799889998740651442-02477430576536787543755703134554035 Q gi|254780661|r 241 APVAWHPRYKDSVIGCNLSGDMRVFQDLDSVQLAKRLFTSHDEL-KGLVPVPIKEVHLNKCPILMPIDYCKKEHFERWGI 319 (471) Q Consensus 241 ~~~~~~p~~~n~~i~fDL~~Dp~~~l~l~~~el~~~l~~~~~~~-~~~~~~~lr~ik~Nk~Pil~~~~~~~~~~~~~~~i 319 (471) +|+|.||.|+|+++||||++||+++++|+.++|++.+++++.++ .|..|+|+|+|++||||+++|.+++..+..+++++ T Consensus 245 ~pl~~hP~n~N~~i~fDL~~DP~~ll~ls~~eL~~~l~t~~~~l~~~~~~~plk~I~iNK~Pil~p~~~l~~e~~~rl~i 324 (477) T PRK11779 245 APLAWHPTNKNAVIVCDLAGDPSPLLELDADTLRERLYTRRADLAEGELPVPLKLVHLNKCPVLAPAKTLRPEDAERLGI 324 (477) T ss_pred EEECCCCCCCCCEEEEECCCCHHHHHCCCHHHHHHHHCCCHHHHHCCCCCCCEEEEEECCCCEECCHHHCCHHHHHHCCC T ss_conf 63024788877658887357979986699999999971543221034677650675307888017657579435877099 Q ss_pred CHHHHHHHHHHHHHCHHHHHHHHHHHHCC-CCCCCCCHHHHHCCCCCCHHHHHHHHHHHCCCHHHHHHHHCCCCCHHHHH Q ss_conf 88999998999761854688887443013-34433320454102468976622168897599889862120489837999 Q gi|254780661|r 320 DHKRCLENLTLLRQQTNLRDRFKAIYNKP-YTSSSQDVDSQLYDGFFADVDRQIMDRILRTAPEQLSTLNLPFSDRRLPE 398 (471) Q Consensus 320 ~~~~l~~r~~~l~~n~~~~~~~~~~~~~~-~~~~~~~~E~~IY~gF~s~~Dk~~~~~fh~a~~~~k~~i~~~f~D~Rl~~ 398 (471) +.+.+++|++.|++|++|.+++.+++.+. .+.++.+||++||+||+|++||.+|++||.++|+++.+++..|+|+||++ T Consensus 325 d~~~~~~ra~~ik~n~~f~~kv~~i~~~~~~~~~~~d~E~~IY~GF~s~~Dk~lm~~fh~a~~~el~~~~~~F~D~RL~e 404 (477) T PRK11779 325 DRAACLDNLKLLRQNPELREKVVAVFAEEEEFAPSDDVDAQLYDGFFSDADRRLMKIIRETEPENLAALDLTFDDPRLEE 404 (477) T ss_pred CHHHHHHHHHHHHHCHHHHHHHHHHHHCCCCCCCCCCHHHHHHCCCCCHHHHHHHHHHHCCCHHHHHHHHCCCCCCCHHH T ss_conf 99999999999986989999999998602157899997899775799989999999998199999984045787821899 Q ss_pred HHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCC Q ss_conf 999996311810089999999999999750678889999999999965149988999999999999975278 Q gi|254780661|r 399 LFFRYRARNFPHTLSEKEKQDWLEHRKKMLTRSRIEEYKNKLQSLSGEYKGDEGKEGLINALYEYLQWVIPK 470 (471) Q Consensus 399 l~~R~i~~n~P~~Ls~~~~~~~~~~i~~rl~~~~i~~~~~ei~~l~~~~~~d~~~~~iL~~l~~yl~~l~~~ 470 (471) |++|++|||||++||++|+.+|+++|++||+++|+++++++|++|+.++..+++++.+|++|++|+++|++. T Consensus 405 LlfRy~ARnfP~~Ls~~E~~~w~~~~~~RL~~~tl~~~~~~id~L~~~~~~d~~k~~iL~~L~~Y~e~l~~~ 476 (477) T PRK11779 405 LLFRYRARNFPETLDDEEQQRWLEHRRQRLTDETLQQYAAKLEQLAQEYEDDEEKQALLKALYDYAEELVPT 476 (477) T ss_pred HHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHC T ss_conf 999999873963399999999999999985766199999999999866018999999999999999987301 No 2 >COG2925 SbcB Exonuclease I [DNA replication, recombination, and repair] Probab=100.00 E-value=0 Score=770.33 Aligned_cols=467 Identities=54% Similarity=0.984 Sum_probs=456.5 Q ss_pred CCCCEEEEECCCCCCCCCCCEEEEEEEEEECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHHH Q ss_conf 98837999752788987726048999999838972804002266217888778578989857998899607999799999 Q gi|254780661|r 1 MTNHFVIYDYETFGRDVALDRPAQFAGVRVDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFSR 80 (471) Q Consensus 1 m~~~~v~~D~ETtG~~~~~~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~~ 80 (471) |.-+|.|||+||+|.+|+.|+|-|+|+|+||.+||+|++..+++|+|..+++|+|+|.+||||||+.....|.++++++. T Consensus 7 ~~~tF~~yDYETfG~~Pa~DRPaQFAgIRTD~~~NiIgeP~~fyCkpsdDyLP~P~a~LITGITPQ~~~~~Gi~E~~F~~ 86 (475) T COG2925 7 KQPTFLFYDYETFGVHPALDRPAQFAGIRTDIEFNIIGEPIVFYCKPADDYLPQPGAVLITGITPQEAREKGINEAAFAA 86 (475) T ss_pred CCCCEEEEECHHCCCCCCCCCCHHHHEEECCCCCCCCCCCEEEEECCCCCCCCCCCCEEEECCCHHHHHHCCCCHHHHHH T ss_conf 89867997503317785666613541121034343247872798547545689987366625787889764898589999 Q ss_pred HHHHHHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCH Q ss_conf 99999511796599950557899999999985078532132568871001588886555653210024444454433108 Q gi|254780661|r 81 RIHQFFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGATSFKL 160 (471) Q Consensus 81 ~i~~~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~~kL 160 (471) .|+..|.++++|++||||+.||+++.|+.||||+++||.++|+|||||||+|+++|++|+++|+.|.||.+++|.||||| T Consensus 87 ~I~~~ls~P~Tcv~GYNniRfDDEvtRy~fyRNF~DPYa~sWqngNSRWDLLD~~RacyALRPeGI~Wp~n~dG~pSFkL 166 (475) T COG2925 87 RIHAELTQPNTCVLGYNNIRFDDEVTRYIFYRNFYDPYAWSWQNGNSRWDLLDVVRACYALRPEGINWPENDDGLPSFKL 166 (475) T ss_pred HHHHHHCCCCEEEECCCCCCCCHHHHHHHHHHHCCCHHHHHHCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHH T ss_conf 99998579981562355533326889888987227666676138984257999999997528556878867889751106 Q ss_pred HHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCEEE Q ss_conf 98998718876431013331899999999998642268999996238898997420015760799701010477875068 Q gi|254780661|r 161 QDLALANGIEHVNAHDAKADVYATLALVWLIREKKPKLFEYLYDYRNKNQLRKLIDIQNMTPLVHVSGMFGASRSNTALI 240 (471) Q Consensus 161 ~~la~~n~i~~~~aH~A~~D~~~t~~l~k~i~~~~p~~~~~~l~~~~K~~v~~~l~~~~~~~~~~v~~~~~~~~~~~~~v 240 (471) |+|+++|||.|++||||++||+||++++|+++.++|.+|++++..|+|+.+.+.++...+.|++|||+.||..++++..| T Consensus 167 EhLtkANgieH~nAHdAmsDVyATIamAklvk~~QPrLfdy~f~~R~K~~~~~lid~~~~~PlvHVSgm~ga~r~~~s~V 246 (475) T COG2925 167 EHLTKANGIEHSNAHDAMSDVYATIAMAKLVKTAQPRLFDYLFQLRKKHKLLALIDIPNMQPLVHVSGMFGAERGNTSWV 246 (475) T ss_pred HHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHCCHHHHHHHHCCCCCCCEEEEECCCCHHHCCEEEE T ss_conf 87764256110005678899999999999987618479999998511889999846223572376411032420770599 Q ss_pred EEEECCCCCCCEEEEEECCCCCCHHCCCCHHHHHHHHHCCHHHH-HCCCCCCEEEEECCCCCEECCHHHCCHHHHHHCCC Q ss_conf 86302888774478710356872000799889998740651442-02477430576536787543755703134554035 Q gi|254780661|r 241 APVAWHPRYKDSVIGCNLSGDMRVFQDLDSVQLAKRLFTSHDEL-KGLVPVPIKEVHLNKCPILMPIDYCKKEHFERWGI 319 (471) Q Consensus 241 ~~~~~~p~~~n~~i~fDL~~Dp~~~l~l~~~el~~~l~~~~~~~-~~~~~~~lr~ik~Nk~Pil~~~~~~~~~~~~~~~i 319 (471) +|++.||.|+|.+|++||..||.++++|+.++|+..|++++.++ +|..+.|+|.|++||||++.|.+++.++...+++| T Consensus 247 ~PLaWHP~N~NalIv~DL~~Di~~Ll~l~a~~lr~rLyT~~~dL~eg~~~vP~KlvHiNkcPilap~ktL~~eda~RlGi 326 (475) T COG2925 247 LPLAWHPTNRNAVIVCDLAGDISPLLELDADTLRERLYTRKADLGEGELAVPVKLVHINKCPILAPAKTLRPEDADRLGI 326 (475) T ss_pred EEECCCCCCCCEEEEEECCCCHHHHHHHCHHHHHHHHCCCCCCCCCCCCCCCEEEEEECCCCEECCHHHCCHHHHHHCCC T ss_conf 85131677886389987668959998727899998734660114577656752798724787424054349656655087 Q ss_pred CHHHHHHHHHHHHHCHHHHHHHHHHHHCCC-CCCCCCHHHHHCCCCCCHHHHHHHHHHHCCCHHHHHHHHCCCCCHHHHH Q ss_conf 889999989997618546888874430133-4433320454102468976622168897599889862120489837999 Q gi|254780661|r 320 DHKRCLENLTLLRQQTNLRDRFKAIYNKPY-TSSSQDVDSQLYDGFFADVDRQIMDRILRTAPEQLSTLNLPFSDRRLPE 398 (471) Q Consensus 320 ~~~~l~~r~~~l~~n~~~~~~~~~~~~~~~-~~~~~~~E~~IY~gF~s~~Dk~~~~~fh~a~~~~k~~i~~~f~D~Rl~~ 398 (471) +...|.++++.|+.++++++|+..++++.. +....+||.+||+|||++.|+..|+.++.++++.+..+...|.|+|+.+ T Consensus 327 dr~~c~~~~~~l~~~p~~rekv~~if~~e~~f~~~~dvd~~LYnGFf~~~Dr~~~~ilR~~~pe~La~~~~~f~D~Rl~~ 406 (475) T COG2925 327 DRQHCLDNLELLRENPDVREKVVAIFAEEEPFNPSDDVDAQLYNGFFSDADRAAMKILRETEPENLAALDITFVDKRLEE 406 (475) T ss_pred CHHHHHHHHHHHHHCCHHHHHHHHHHHCCCCCCCCCCHHHHHHCCCCCHHHHHHHHHHHHCCHHHHHHCCCCCCCHHHHH T ss_conf 69999999999850941999999998334667765555588871356877788999987288645665367755415899 Q ss_pred HHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHC Q ss_conf 9999963118100899999999999997506788899999999999651499889999999999999752 Q gi|254780661|r 399 LFFRYRARNFPHTLSEKEKQDWLEHRKKMLTRSRIEEYKNKLQSLSGEYKGDEGKEGLINALYEYLQWVI 468 (471) Q Consensus 399 l~~R~i~~n~P~~Ls~~~~~~~~~~i~~rl~~~~i~~~~~ei~~l~~~~~~d~~~~~iL~~l~~yl~~l~ 468 (471) |.|||+|||||++|+..|+.+|.+++++||+.+ ...+...++.+..++..+.+++.+|+.+.+|.++|+ T Consensus 407 LlFrYRARnfp~tLt~~Eq~rW~~~rr~~l~~~-~~~~~~tl~~~~~~~~~d~~k~alL~~l~~y~~~l~ 475 (475) T COG2925 407 LLFRYRARNFPETLTDAEQQRWLEHRRQRLTPE-LQGYELTLQALVQEHSDDKEKLALLKALYEYAAELV 475 (475) T ss_pred HHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCHH-HHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHC T ss_conf 999988614865358899999999999853755-555676899999873025488999999999987429 No 3 >pfam08411 Exonuc_X-T_C Exonuclease C-terminal. This bacterial domain is found at the C-terminus of Exodeoxyribonuclease I/Exonuclease I (pfam00929), which is a single-strand specific DNA nuclease affecting recombination and expression pathways. The exonuclease I protein in E. coli is associated with DNA deoxyribophosphodiesterase (dRPase). Probab=100.00 E-value=0 Score=399.67 Aligned_cols=261 Identities=50% Similarity=0.886 Sum_probs=250.1 Q ss_pred CCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCEEEEEEECCCCCCCEEEEEECCCCCCHHCCCCHHHHHHHHHCCHHHH- Q ss_conf 3889899742001576079970101047787506886302888774478710356872000799889998740651442- Q gi|254780661|r 206 RNKNQLRKLIDIQNMTPLVHVSGMFGASRSNTALIAPVAWHPRYKDSVIGCNLSGDMRVFQDLDSVQLAKRLFTSHDEL- 284 (471) Q Consensus 206 ~~K~~v~~~l~~~~~~~~~~v~~~~~~~~~~~~~v~~~~~~p~~~n~~i~fDL~~Dp~~~l~l~~~el~~~l~~~~~~~- 284 (471) |+|+.|.+.++....+|++|||+.|++.++++++|+|+|.||.|+|.+|||||+.||+.+++++.++|+++++++++++ T Consensus 1 R~K~~v~~lid~~~~~P~vhvSg~~~a~~~~~s~v~Pla~HP~N~N~vI~~DL~~Dp~~ll~l~~e~lr~rlft~~~~l~ 80 (268) T pfam08411 1 RSKKAVAKLIDLVAMKPLVHVSGMFGAERGCTSWVLPLAWHPTNKNAVIVCDLAQDPSPLLDLSAEELRQRLYTKREDLA 80 (268) T ss_pred CCHHHHHHHHCCCCCCCEEEECCCCCCCCCCEEEEEECCCCCCCCCCEEEEECCCCHHHHHCCCHHHHHHHHCCCHHHCC T ss_conf 97889999858278996698768664114883788751358999886798886549099876999999999557642212 Q ss_pred HCCCCCCEEEEECCCCCEECCHHHCCHHHHHHCCCCHHHHHHHHHHHHHCHHHHHHHHHHHHCC-CCCCCCCHHHHHCCC Q ss_conf 0247743057653678754375570313455403588999998999761854688887443013-344333204541024 Q gi|254780661|r 285 KGLVPVPIKEVHLNKCPILMPIDYCKKEHFERWGIDHKRCLENLTLLRQQTNLRDRFKAIYNKP-YTSSSQDVDSQLYDG 363 (471) Q Consensus 285 ~~~~~~~lr~ik~Nk~Pil~~~~~~~~~~~~~~~i~~~~l~~r~~~l~~n~~~~~~~~~~~~~~-~~~~~~~~E~~IY~g 363 (471) .|..+.|+|.|++||||+++|.+++.++..++++++.+.|++|++.|++++++.+++.+++... .+++..+||++||+| T Consensus 81 ~~~~r~plk~I~lNkcPilap~~~L~~~~a~rL~iD~~~~~~~~~~l~~~~~l~~kl~~v~~~~~~~~~~~DvD~~LY~G 160 (268) T pfam08411 81 EGELRVPLKLVHLNKCPILAPAKTLRPEDAERLGIDREQCLKNLALLRQNPELREKLKEVFEEEREFPPSTDVDQQLYDG 160 (268) T ss_pred CCCCCCCCEEEECCCCCCCCCHHHCCHHHHHHHCCCHHHHHHHHHHHHHCHHHHHHHHHHHHCCCCCCCCCCHHHHHHCC T ss_conf 44667773365516898327655469888988489899999999999869899999999981755788878878998725 Q ss_pred CCCHHHHHHHHHHHCCCHHHHHHHHCCCCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCC-----CHHHHHH Q ss_conf 68976622168897599889862120489837999999996311810089999999999999750678-----8899999 Q gi|254780661|r 364 FFADVDRQIMDRILRTAPEQLSTLNLPFSDRRLPELFFRYRARNFPHTLSEKEKQDWLEHRKKMLTRS-----RIEEYKN 438 (471) Q Consensus 364 F~s~~Dk~~~~~fh~a~~~~k~~i~~~f~D~Rl~~l~~R~i~~n~P~~Ls~~~~~~~~~~i~~rl~~~-----~i~~~~~ 438 (471) |||++|+.+|++||.++|+++..+...|+|+||++|++||+|||||++||++|+.+|+++|++||+++ ++++|++ T Consensus 161 F~s~~Dk~~~~~ir~~~p~~L~~~~~~F~D~RL~eLlfRyrARn~P~tLs~~E~~~W~~~~~~rL~~~~~~~ltl~~y~~ 240 (268) T pfam08411 161 FFSDADKRLMEIIRETSPEQLAELELNFDDKRLPELLFRYRARNFPETLSEEEQQRWQEYCQQRLLDPAGGALTLEDYEQ 240 (268) T ss_pred CCCHHHHHHHHHHHHCCHHHHHHCCCCCCCCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHH T ss_conf 48878999999998099999963758888710999999999961820099999999999999980078988767999999 Q ss_pred HHHHHHHHHCCCHHHHHHHHHHHHHHHHH Q ss_conf 99999965149988999999999999975 Q gi|254780661|r 439 KLQSLSGEYKGDEGKEGLINALYEYLQWV 467 (471) Q Consensus 439 ei~~l~~~~~~d~~~~~iL~~l~~yl~~l 467 (471) +|++|+.++..+ +++.+|++|++|+++| T Consensus 241 ~i~~L~~~~~~~-~k~~lL~~L~~Y~~~l 268 (268) T pfam08411 241 KLEQLAAEYSDD-EKQALLQALYDYAQSL 268 (268) T ss_pred HHHHHHHHCCCH-HHHHHHHHHHHHHHHC T ss_conf 999998647876-8999999999999839 No 4 >cd06138 ExoI_N This subfamily is composed of the N-terminal domain of E. coli exonuclease I (ExoI) and similar proteins. ExoI is a monomeric enzyme that hydrolyzes single stranded DNA in the 3' to 5' direction. It plays a role in DNA recombination and repair. It primarily functions in repairing frameshift mutations. The N-terminal domain of ExoI is a DEDDh-type DnaQ-like 3'-5 exonuclease containing three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific Hx(4)D conserved pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. The ExoI structure is unique among DnaQ family enzymes in that there is a large distance between the two metal ions required for catalysis and the catalytic histidine is oriented away from the active site. Probab=100.00 E-value=0 Score=360.46 Aligned_cols=183 Identities=59% Similarity=1.080 Sum_probs=178.3 Q ss_pred EEEECCCCCCCCCCCEEEEEEEEEECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHHHHHHHH Q ss_conf 99975278898772604899999983897280400226621788877857898985799889960799979999999999 Q gi|254780661|r 6 VIYDYETFGRDVALDRPAQFAGVRVDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFSRRIHQF 85 (471) Q Consensus 6 v~~D~ETtG~~~~~~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~~~i~~~ 85 (471) ||||+||||+++.+|+|||+|||++|.+|+++++++. +|+|+++++|+|+|+.|||||++++.++|.|++|+++++.++ T Consensus 1 i~fD~ETTGl~~~~d~Iiqiaai~~d~~~~~~~~~~~-~i~p~~~~~p~p~a~~v~git~~~~~~~~~s~~e~~~~i~~~ 79 (183) T cd06138 1 LFYDYETFGLNPSFDQILQFAAIRTDENFNEIEPFNI-FCRLPPDVLPSPEALIVTGITPQQLLKEGLSEYEFIAKIHRL 79 (183) T ss_pred CEEEECCCCCCCCCCCEEEEEEEEECCCCCCCCEEEE-EECCCCCCCCCHHHHHHCCCCHHHHHHCCCCHHHHHHHHHHH T ss_conf 9897038997999876069999999899878644688-740489999884799871889999753188778999999998 Q ss_pred HHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHH Q ss_conf 51179659995055789999999998507853213256887100158888655565321002444445443310898998 Q gi|254780661|r 86 FSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGATSFKLQDLAL 165 (471) Q Consensus 86 ~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~~kL~~la~ 165 (471) ++.+++++||||+++||.+|||+.|||++++||.++|+++|+|+|+|+++|+++.+.|+.++||.+++|.+||||++||+ T Consensus 80 ~~~~~~i~vg~N~~~FD~~fL~~~~~r~~~~p~~~~~~~~~~~~D~l~~~r~~~~~~p~~i~~~~~~~g~~s~kL~~l~~ 159 (183) T cd06138 80 FNTPGTCIVGYNNIRFDDEFLRFAFYRNLYDPYTWEWKNGNSRWDLLDVVRAYYALRPDGIVWPKNDDGKPSFKLEDLAQ 159 (183) T ss_pred HCCCCCEEEEECCHHHHHHHHHHHHHHHCCCCHHHHHCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCHHHHHH T ss_conf 61699779984535568999999999827871566752787123078899999997820144765568974112999999 Q ss_pred HHCCCCCCCCHHHHHHHHHHHHHH Q ss_conf 718876431013331899999999 Q gi|254780661|r 166 ANGIEHVNAHDAKADVYATLALVW 189 (471) Q Consensus 166 ~n~i~~~~aH~A~~D~~~t~~l~k 189 (471) +|||++++||||++||.||++|+| T Consensus 160 ~~gi~~~~aH~Al~Dv~aT~~l~k 183 (183) T cd06138 160 ANGIEHSNAHDALSDVEATIALAK 183 (183) T ss_pred HCCCCCCCCCCCHHHHHHHHHHHC T ss_conf 849999888682899999999869 No 5 >TIGR01405 polC_Gram_pos DNA polymerase III, alpha subunit, Gram-positive type; InterPro: IPR006308 These are the polypeptide chains of DNA polymerase III. Full-length homologs of this protein are restricted to the Gram-positive lineages, including the Mycoplasmas. This protein is designated alpha chain and given the gene symbol polC, but is not a full-length homolog of other polC genes. The N-terminal region of about 200 amino acids is rich in low-complexity sequence and poorly alignable. ; GO: 0003677 DNA binding, 0003887 DNA-directed DNA polymerase activity, 0006260 DNA replication, 0005737 cytoplasm. Probab=100.00 E-value=2.8e-45 Score=309.31 Aligned_cols=435 Identities=17% Similarity=0.117 Sum_probs=309.9 Q ss_pred CCEEEEECCCCCCCCCCCEEEEEEEEEECCCC-EECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHHHH Q ss_conf 83799975278898772604899999983897-28040022662178887785789898579988996079997999999 Q gi|254780661|r 3 NHFVIYDYETFGRDVALDRPAQFAGVRVDRQW-EKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFSRR 81 (471) Q Consensus 3 ~~~v~~D~ETtG~~~~~~~iiq~~~i~~d~~~-~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~~~ 81 (471) .+||+||+|||||+|.+|.|||+||+++ .+| +|+|+|+.| + +|....+...+.+||||.+||.++. +.-++..+ T Consensus 204 aTyVVfD~ETTGLsp~~d~iIE~gA~k~-~nGk~ii~~~~~F-i--kP~~~l~~~~~elT~ITq~ml~n~~-~~~~vL~k 278 (1264) T TIGR01405 204 ATYVVFDIETTGLSPQYDEIIEFGAVKV-KNGKRIIDKFQFF-I--KPKEPLSAFVTELTGITQDMLENAP-EIEEVLEK 278 (1264) T ss_pred CEEEEEEEEECCCCCCCCCEEEEEEEEE-ECCCEEEEEHHHH-C--CCCCCCCCCEEEECCCCHHHHCCCC-CHHHHHHH T ss_conf 2079960351588886142467875887-4685762312323-2--8875467542662466479737984-17899999 Q ss_pred HHHHHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHH Q ss_conf 99995117965999505578999999999850785321325688710015888865556532100244444544331089 Q gi|254780661|r 82 IHQFFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGATSFKLQ 161 (471) Q Consensus 82 i~~~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~~kL~ 161 (471) |..++. ++++|+|||++||.+||++.|-++++.|. .|..+|+|.++|+++ .+-+||+|+ T Consensus 279 ~~~f~~--d~ilVAHNGasFD~~Fl~~~~~k~~~~~~------~~p~IDTL~Lar~ln-------------P~y~sh~Lg 337 (1264) T TIGR01405 279 FKEFLK--DSILVAHNGASFDIGFLNTNFEKVGLEPL------ENPVIDTLELARALN-------------PEYKSHRLG 337 (1264) T ss_pred HHHHHC--CEEEEECCCCCCCHHHHHHHHHHCCCCCC------CCCCHHHHHHHHHHC-------------CCCCCCCCC T ss_conf 999769--75888627610366889889876277423------365113889999837-------------311344542 Q ss_pred HHHHHHCCCCCC--CCHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHCC-----HHHHHHHHHHCCCCEEEEEECCCCCC Q ss_conf 899871887643--10133318999999999986422689-99996238-----89899742001576079970101047 Q gi|254780661|r 162 DLALANGIEHVN--AHDAKADVYATLALVWLIREKKPKLF-EYLYDYRN-----KNQLRKLIDIQNMTPLVHVSGMFGAS 233 (471) Q Consensus 162 ~la~~n~i~~~~--aH~A~~D~~~t~~l~k~i~~~~p~~~-~~~l~~~~-----K~~v~~~l~~~~~~~~~~v~~~~~~~ 233 (471) .+|+.+|+.... ||||..||.||..|++.+.+...+.. -.-|+..| +..-.+..+......+. +-...+.| T Consensus 338 ~~~~Kl~v~~~~e~~HRA~YDa~~t~~vF~~~~~~~~e~gGi~~l~~lni~e~~~l~~~~~y~r~~~~~~~-i~~kNq~G 416 (1264) T TIGR01405 338 EIAKKLGVDLDDERHHRADYDAEATAKVFIVMVEQLKEKGGITNLEELNIQELDKLSSEELYKRLRPNHII-ILVKNQAG 416 (1264) T ss_pred HHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHCCHHHHHHHCCHHHHHHCCCCEEE-EEEECCHH T ss_conf 23332055215330124334799999999999999875037211234057889842052345423745157-87763034 Q ss_pred CCCCEEE-EEEECCCCCCCEEEEEECCCCCCHHCCCCHHHHHHHHHCCHHHHHCCCC-CCEEEEECCCCCEECCHHHCCH Q ss_conf 7875068-8630288877447871035687200079988999874065144202477-4305765367875437557031 Q gi|254780661|r 234 RSNTALI-APVAWHPRYKDSVIGCNLSGDMRVFQDLDSVQLAKRLFTSHDELKGLVP-VPIKEVHLNKCPILMPIDYCKK 311 (471) Q Consensus 234 ~~~~~~v-~~~~~~p~~~n~~i~fDL~~Dp~~~l~l~~~el~~~l~~~~~~~~~~~~-~~lr~ik~Nk~Pil~~~~~~~~ 311 (471) ...+|-. +.......|..+.|....-....+-|-++.......|+.. -+.+... ..-..+|..+...++|...+.. T Consensus 417 LKnLykLvS~s~~~Yfy~~Pri~~s~L~~~~egLl~Gsac~~Gelf~a--~l~~~~~~e~e~iaK~YDfIei~Plg~y~~ 494 (1264) T TIGR01405 417 LKNLYKLVSISLTKYFYTRPRILRSLLKKYREGLLIGSACSEGELFDA--LLSKTQDDELEEIAKRYDFIEIQPLGNYAH 494 (1264) T ss_pred HHHHHHHHHHHHHHHHCCCCCCCHHHHHHHCCCEEECCCCCCCHHHHH--HHCCCCHHHHHHHHHHCCEEEECCCCCHHH T ss_conf 788999987998653146897517889864387077177743227687--433885899999875447456278765078 Q ss_pred HHHHHCCCCHHHHHHHHHHHHHCHHHHHHHHHHHHCCCCCCCCCHHHHHCC------CCCCHHHHHHHHHHHCCCHHHHH Q ss_conf 345540358899999899976185468888744301334433320454102------46897662216889759988986 Q gi|254780661|r 312 EHFERWGIDHKRCLENLTLLRQQTNLRDRFKAIYNKPYTSSSQDVDSQLYD------GFFADVDRQIMDRILRTAPEQLS 385 (471) Q Consensus 312 ~~~~~~~i~~~~l~~r~~~l~~n~~~~~~~~~~~~~~~~~~~~~~E~~IY~------gF~s~~Dk~~~~~fh~a~~~~k~ 385 (471) ....+...+.+.+.+-.+.|-.--+-..|++-+..+.+|-. +|+.+|- .+..+..-+...-=..+...-.- T Consensus 495 l~~~e~~~d~E~~~~~~~~li~~a~~lNK~Vv~tG~~hy~~---~~d~i~r~il~~s~G~~~~~nr~~~~k~~~~~~~~P 571 (1264) T TIGR01405 495 LIEKEQVKDKEALKEIIKKLIKLAKELNKPVVATGDVHYIE---PEDKIYRKILVASQGLENKLNRLANYKEDAREVYLP 571 (1264) T ss_pred HHHHCCCCCHHHHHHHHHHHHHHHHHCCCEEEEECCCCCCC---HHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCC T ss_conf 98613787777899999999999986197799967842178---678764531002778886547666665435533387 Q ss_pred HHHCCCCCHHHHHHHH--------HHHHCCCCCCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHH----HHHHCCCHHH Q ss_conf 2120489837999999--------996311810089999999999999750678889999999999----9651499889 Q gi|254780661|r 386 TLNLPFSDRRLPELFF--------RYRARNFPHTLSEKEKQDWLEHRKKMLTRSRIEEYKNKLQSL----SGEYKGDEGK 453 (471) Q Consensus 386 ~i~~~f~D~Rl~~l~~--------R~i~~n~P~~Ls~~~~~~~~~~i~~rl~~~~i~~~~~ei~~l----~~~~~~d~~~ 453 (471) ++..+.+++.+.||++ +.+--++|..|++..-..--+.|+++|+++.|+++.+++.++ +.+.++++-. T Consensus 572 ~~~~rTT~EML~ef~fLgee~~~a~eiVveNtN~ia~~~e~~di~Pi~~~LytP~ieg~~e~~~~~~Y~~A~~iYGdpLP 651 (1264) T TIGR01405 572 ELHFRTTEEMLDEFSFLGEEKLLAREIVVENTNKIADQIEEDDIQPIKDKLYTPKIEGADEKIRDLTYEKAKKIYGDPLP 651 (1264) T ss_pred EEEEECHHHHHHHHHHHCCHHHHHHHHHCCCCHHHHHHHCCCCCEECCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCH T ss_conf 05861648899998740464676644200360257755130586333788887800385278999999998974279856 Q ss_pred HHHHHHHHHHHHHHCC Q ss_conf 9999999999997527 Q gi|254780661|r 454 EGLINALYEYLQWVIP 469 (471) Q Consensus 454 ~~iL~~l~~yl~~l~~ 469 (471) +.+-++|++.|++|+. T Consensus 652 ~IVe~RieKEL~sII~ 667 (1264) T TIGR01405 652 EIVEQRIEKELKSIIG 667 (1264) T ss_pred HHHHHHHHHHHHHHHH T ss_conf 8999999987456540 No 6 >COG2176 PolC DNA polymerase III, alpha subunit (gram-positive type) [DNA replication, recombination, and repair] Probab=100.00 E-value=5.4e-40 Score=275.82 Aligned_cols=425 Identities=16% Similarity=0.132 Sum_probs=279.9 Q ss_pred CCEEEEECCCCCCCCCCCEEEEEEEEEECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHHHHH Q ss_conf 83799975278898772604899999983897280400226621788877857898985799889960799979999999 Q gi|254780661|r 3 NHFVIYDYETFGRDVALDRPAQFAGVRVDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFSRRI 82 (471) Q Consensus 3 ~~~v~~D~ETtG~~~~~~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~~~i 82 (471) .+||+||+|||||++.+|.|||+||+++ .+|+++|+|+.+ ++|+...+...+.+||||.+|+.+++ +..++..++ T Consensus 421 atyVVfDiETTGLs~~~d~iIE~aAvKi-kng~iId~f~~F---i~P~~pl~~~~telTgITdeml~~a~-~i~~vL~kf 495 (1444) T COG2176 421 ATYVVFDIETTGLSPVYDEIIEIAAVKI-KNGRIIDKFQFF---IKPGRPLSATITELTGITDEMLENAP-EIEEVLEKF 495 (1444) T ss_pred CCEEEEEEECCCCCCCCCHHHHHEEEEE-ECCCCHHHHHHH---CCCCCCCCHHHHHCCCCCHHHHCCCC-CHHHHHHHH T ss_conf 5089998553776754201562000443-388043778975---08999676445420356799874883-399999999 Q ss_pred HHHHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHH Q ss_conf 99951179659995055789999999998507853213256887100158888655565321002444445443310898 Q gi|254780661|r 83 HQFFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGATSFKLQD 162 (471) Q Consensus 83 ~~~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~~kL~~ 162 (471) .+++. +++.|+|| ++||.+||++.|-+++++|+ .|...|+|.++|+.+ |. -+||+|+. T Consensus 496 ~~~~~--d~IlVAHN-asFD~gFl~~~~~k~~~~~~------~~pvIDTL~lar~L~---P~----------~ksh~Lg~ 553 (1444) T COG2176 496 REFIG--DSILVAHN-ASFDMGFLNTNYEKYGLEPL------TNPVIDTLELARALN---PE----------FKSHRLGT 553 (1444) T ss_pred HHHHC--CCEEEECC-CCCCHHHHHHHHHHHCCCCC------CCCHHHHHHHHHHHC---HH----------HHHCCHHH T ss_conf 99844--85899616-75564578888998388510------275011999998757---45----------43246677 Q ss_pred HHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCEEEE Q ss_conf 998718876431013331899999999998642268-9999962388989974200157607997010104778750688 Q gi|254780661|r 163 LALANGIEHVNAHDAKADVYATLALVWLIREKKPKL-FEYLYDYRNKNQLRKLIDIQNMTPLVHVSGMFGASRSNTALIA 241 (471) Q Consensus 163 la~~n~i~~~~aH~A~~D~~~t~~l~k~i~~~~p~~-~~~~l~~~~K~~v~~~l~~~~~~~~~~v~~~~~~~~~~~~~v~ 241 (471) +|+.+|+..+.+|+|..|+.||+.|+..+.+...+. +.+..+...+....+..+........ +....+.|....|-.+ T Consensus 554 l~kk~~v~le~hHRA~yDaeat~~vf~~f~~~~ke~Gi~~l~eln~~l~~~~~ykr~r~~h~~-i~vknq~GlKnLykLv 632 (1444) T COG2176 554 LCKKLGVELERHHRADYDAEATAKVFFVFLKDLKEKGITNLSELNDKLSSEDLYKRLRPKHAT-IYVKNQVGLKNLYKLV 632 (1444) T ss_pred HHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHCCCCCEE-EEEECHHHHHHHHHHH T ss_conf 888748637875310100899999999999999884556599986764016788632566269-9995317589999998 Q ss_pred EEE-CCCCCCCEEEEEECCCCCCHHCC----CCHHHHHHHHHCCHHHHHCCCCCCEEEEECCCCCEECCHHHCCHHHHHH Q ss_conf 630-28887744787103568720007----9988999874065144202477430576536787543755703134554 Q gi|254780661|r 242 PVA-WHPRYKDSVIGCNLSGDMRVFQD----LDSVQLAKRLFTSHDELKGLVPVPIKEVHLNKCPILMPIDYCKKEHFER 316 (471) Q Consensus 242 ~~~-~~p~~~n~~i~fDL~~Dp~~~l~----l~~~el~~~l~~~~~~~~~~~~~~lr~ik~Nk~Pil~~~~~~~~~~~~~ 316 (471) ... .+..|..+.+....-....+-+- .+..|+...+..+. .......++..+...++|...+....... T Consensus 633 S~s~~kyf~~~PrI~~S~L~~~regLligsac~~Gelf~a~l~k~------~ee~e~iak~YDfIei~pp~~y~~li~~e 706 (1444) T COG2176 633 SISHTKYFYGRPRIPRSVLKKNREGLLIGSACSEGELFDAALQKP------DEEVEEIAKFYDFIEIQPPANYAHLIERE 706 (1444) T ss_pred HHHHHHHHCCCCCCCHHHHHHHHCCEEECCCCCCHHHHHHHHHCC------HHHHHHHHHHCCEEEECCCHHHHHHHHHC T ss_conf 888887642688771999997412716436865127999998279------99999998755758714820014678750 Q ss_pred CCCCHHHHHHHHHHHHHCHHHHHHHHHHHHCCCCCCCCCHHHHHCCCCC--CHH-HHHHHHHHHCCCHHHHHHHHCCCCC Q ss_conf 0358899999899976185468888744301334433320454102468--976-6221688975998898621204898 Q gi|254780661|r 317 WGIDHKRCLENLTLLRQQTNLRDRFKAIYNKPYTSSSQDVDSQLYDGFF--ADV-DRQIMDRILRTAPEQLSTLNLPFSD 393 (471) Q Consensus 317 ~~i~~~~l~~r~~~l~~n~~~~~~~~~~~~~~~~~~~~~~E~~IY~gF~--s~~-Dk~~~~~fh~a~~~~k~~i~~~f~D 393 (471) .--+.+.+.+-.+.|- .+..++.............++++.||-.-. |.. .-.+-..|. +....++..+.+| T Consensus 707 ~~~~~e~l~~~~k~li---~~g~~l~K~Vvatgn~hyl~p~d~iyr~ilv~s~G~~~~~~r~~~---~~~~P~~~frTT~ 780 (1444) T COG2176 707 GLKDKEALKEIIKKLI---KLGKKLNKPVVATGNVHYLDPEDKIYRKILVASQGLGNPLNRTFN---EQTLPEVHFRTTD 780 (1444) T ss_pred CCCCHHHHHHHHHHHH---HHHHHHCCCEEEECCCEECCHHHHHHHHHHEECCCCCCCCCCCCC---CCCCCCEEEECHH T ss_conf 3672999999999999---999984990899577410685787888861215776765554556---5568850565179 Q ss_pred HHHHHHHH------HHHHCCCCCCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHH----HHHHCCCHHHHHHHHHHHHH Q ss_conf 37999999------996311810089999999999999750678889999999999----96514998899999999999 Q gi|254780661|r 394 RRLPELFF------RYRARNFPHTLSEKEKQDWLEHRKKMLTRSRIEEYKNKLQSL----SGEYKGDEGKEGLINALYEY 463 (471) Q Consensus 394 ~Rl~~l~~------R~i~~n~P~~Ls~~~~~~~~~~i~~rl~~~~i~~~~~ei~~l----~~~~~~d~~~~~iL~~l~~y 463 (471) +.+.+|++ +-+.-++|+.+.+..-. .+.|++.|+.+.++++.+++.++ +.+.++++-...+-.+|++. T Consensus 781 EML~efsfLgee~a~eiVv~Ntn~iad~ie~--i~Pik~~LytPkie~a~e~v~~lty~~A~~iYG~pLP~IVe~RiEKE 858 (1444) T COG2176 781 EMLQEFSFLGEEKAYEIVVENTNKIADMIED--IQPIKDKLYTPKIEGAEEKVRDLTYEKAHKIYGDPLPEIVEQRIEKE 858 (1444) T ss_pred HHHHHHHHHHHHHHHHHHHCCCHHHHHHHHC--CEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHH T ss_conf 9999987652877778864582889988660--22366677798617777999999999999986797069999999999 Q ss_pred HHHHCC Q ss_conf 997527 Q gi|254780661|r 464 LQWVIP 469 (471) Q Consensus 464 l~~l~~ 469 (471) |++|+. T Consensus 859 L~sII~ 864 (1444) T COG2176 859 LNSIIG 864 (1444) T ss_pred HHHHHC T ss_conf 999972 No 7 >PRK08074 bifunctional ATP-dependent DNA helicase/DNA polymerase III subunit epsilon; Validated Probab=100.00 E-value=9.9e-35 Score=242.00 Aligned_cols=171 Identities=25% Similarity=0.329 Sum_probs=150.0 Q ss_pred CCCCEEEEECCCCCCCCCC-CEEEEEEEEEECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHH Q ss_conf 9883799975278898772-604899999983897280400226621788877857898985799889960799979999 Q gi|254780661|r 1 MTNHFVIYDYETFGRDVAL-DRPAQFAGVRVDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFS 79 (471) Q Consensus 1 m~~~~v~~D~ETtG~~~~~-~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~ 79 (471) |.+.|++.|+||||.++.. |+|||||++++ ++++|+++|.++ + +|+..+++.+..+||||++++.++| ++.|++ T Consensus 1 m~~~~~vvDlETTG~~~~~~d~IIqig~v~i-~~~~i~~~f~t~-v--nP~~~iP~~I~~LTgI~~~~v~~AP-~f~eva 75 (932) T PRK08074 1 MSKRFVVVDLETTGNSPKKGDKIIQIAAVVV-EDGEILERFSTF-V--NPERPIPPFITELTGISEEMVKNAP-LFEDVA 75 (932) T ss_pred CCCEEEEEEEECCCCCCCCCCEEEEEEEEEE-ECCEEEEEEECC-C--CCCCCCCHHHHHHCCCCHHHHCCCC-CHHHHH T ss_conf 9970899998888989777988999979999-999998998532-5--8999899778860698989980599-989999 Q ss_pred HHHHHHHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCC Q ss_conf 99999951179659995055789999999998507853213256887100158888655565321002444445443310 Q gi|254780661|r 80 RRIHQFFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGATSFK 159 (471) Q Consensus 80 ~~i~~~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~~k 159 (471) .+|.+++. +++||+|| ++||.+||++.|.+.|+++. .+..+|++.++|.+ .| .-+||+ T Consensus 76 ~~i~~~l~--~~~~VaHN-v~FD~~FL~~~l~~~G~~~~------~~~~~DTveLa~i~---~P----------~~~sy~ 133 (932) T PRK08074 76 PMIVELLE--GAYFVAHN-VHFDLNFLNEELERAGYTEI------HCPKLDTVELARIL---LP----------TADSYK 133 (932) T ss_pred HHHHHHHC--CCEEEECC-HHHHHHHHHHHHHHCCCCCC------CCCEECHHHHHHHH---CC----------CCCCCC T ss_conf 99999966--99599478-05409999999997399866------68765199999997---78----------878888 Q ss_pred HHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 898998718876431013331899999999998642268 Q gi|254780661|r 160 LQDLALANGIEHVNAHDAKADVYATLALVWLIREKKPKL 198 (471) Q Consensus 160 L~~la~~n~i~~~~aH~A~~D~~~t~~l~k~i~~~~p~~ 198 (471) |++|+..+|++|+++|+|.+||+||++|+..+.++...+ T Consensus 134 L~~L~~~l~i~~~~~HrA~sDa~aTa~l~~~l~~~l~~l 172 (932) T PRK08074 134 LSDLSEELELEHDQPHRADSDAEVTAELFLQLLHKLERL 172 (932) T ss_pred HHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHC T ss_conf 899998649999998751788999999999999999837 No 8 >PRK08517 DNA polymerase III subunit epsilon; Provisional Probab=100.00 E-value=1.1e-34 Score=241.84 Aligned_cols=169 Identities=20% Similarity=0.201 Sum_probs=147.6 Q ss_pred CCCEEEEECCCCCCCCCCCEEEEEEEEEECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHHHH Q ss_conf 88379997527889877260489999998389728040022662178887785789898579988996079997999999 Q gi|254780661|r 2 TNHFVIYDYETFGRDVALDRPAQFAGVRVDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFSRR 81 (471) Q Consensus 2 ~~~~v~~D~ETtG~~~~~~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~~~ 81 (471) ...||++|+||||++|..|+|||||||++ .+|+++|+|+++ ++|. .+ ++.+..+||||++|+.++| +..|++++ T Consensus 69 e~~fvvvDlETTGl~~~~d~IIEIgavkv-~~g~iid~f~tl-Vnp~--~I-p~~It~lTGIt~emv~~aP-~~~evl~~ 142 (259) T PRK08517 69 EQVFCFVDIETNGSKPKKGQIIEIGAVKV-KGGEIIDRFESF-VKAK--EV-PEYITELTGITYEDLENAP-SLKEVLED 142 (259) T ss_pred CCCEEEEEEECCCCCCCCCEEEEEEEEEE-ECCEEEEEEEEE-ECCC--CC-CHHHHHHCCCCHHHHHCCC-CHHHHHHH T ss_conf 88389999658998999982499999999-999999987432-3989--89-9888221389989982799-88999999 Q ss_pred HHHHHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHH Q ss_conf 99995117965999505578999999999850785321325688710015888865556532100244444544331089 Q gi|254780661|r 82 IHQFFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGATSFKLQ 161 (471) Q Consensus 82 i~~~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~~kL~ 161 (471) +.+|+. +.++|||| ++||.+||++.+.|.++++. .|.++|++.++|.++ |. ++|+|+ T Consensus 143 f~~Fi~--d~ilVaHN-a~FD~~FL~~~l~r~g~~~l------~n~~lDTl~LAR~~~---p~-----------~~ykL~ 199 (259) T PRK08517 143 FRLFLG--DSVFVAHN-VNFDYNFISRSLEEIGLGPL------LNRKLCTIDLAKRTI---ES-----------PRYGLS 199 (259) T ss_pred HHHHHC--CCEEEEEC-CHHHHHHHHHHHHHCCCCCC------CCCEEEEHHHHHHHC---CC-----------CCCCHH T ss_conf 999968--89599937-27609999999998399976------898566089999875---87-----------787889 Q ss_pred HHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 89987188764310133318999999999986422689 Q gi|254780661|r 162 DLALANGIEHVNAHDAKADVYATLALVWLIREKKPKLF 199 (471) Q Consensus 162 ~la~~n~i~~~~aH~A~~D~~~t~~l~k~i~~~~p~~~ 199 (471) .||+.+||+++++|+|++||.||++|++.+-++.|.-. T Consensus 200 ~La~~~~I~~~~~HRAl~DA~aTaevF~~~L~~l~~~v 237 (259) T PRK08517 200 FLKELLGIETEVHHRAYADALAAYEIFEICLLNLPSYI 237 (259) T ss_pred HHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCC T ss_conf 99997598999997845889999999999998566556 No 9 >TIGR01407 dinG_rel DnaQ family exonuclease/DinG family helicase, putative; InterPro: IPR006310 These are a family of proteins in Gram-positive bacteria. The N-terminal region of about 200 amino acids resembles the epsilon subunit of E. coli DNA polymerase III and the homologous region of the Gram-positive type DNA polymerase III alpha subunit. The epsilon subunit contains an exonuclease domain. The remainder of this protein family resembles a predicted ATP-dependent helicase, the DNA damage-inducible protein DinG of Escherichia coli. . Probab=100.00 E-value=8.4e-35 Score=242.45 Aligned_cols=168 Identities=25% Similarity=0.343 Sum_probs=147.1 Q ss_pred CEEEEECCCCCCCCCC-CEEEEEEEEEECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHHHHH Q ss_conf 3799975278898772-604899999983897280400226621788877857898985799889960799979999999 Q gi|254780661|r 4 HFVIYDYETFGRDVAL-DRPAQFAGVRVDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFSRRI 82 (471) Q Consensus 4 ~~v~~D~ETtG~~~~~-~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~~~i 82 (471) .|++.|+||||-++.. |+|||||+|++ +|++++|+|.+. | +|+...+|.+..+|||++.|++.+| .+.+++++| T Consensus 1 rYaVVDLEtTg~~~~~~d~IIqiGiV~v-~DgeIv~~f~T~-V--NP~~~~pp~I~eLTGisd~~l~~AP-~F~qVA~~i 75 (944) T TIGR01407 1 RYAVVDLETTGTDSSEFDKIIQIGIVLV-EDGEIVDTFATD-V--NPNEPIPPFIQELTGISDEQLKQAP-YFSQVAQEI 75 (944) T ss_pred CEEEEEEEECCCCCCCCCCEEEEEEEEE-ECCCEEEECCCC-C--CCCCCCCCHHHHHCCCCHHHHCCCC-CHHHHHHHH T ss_conf 9689987416888645576689988998-558143201267-7--8898776114542287768640488-887999999 Q ss_pred HHHHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHH Q ss_conf 99951179659995055789999999998507853213256887100158888655565321002444445443310898 Q gi|254780661|r 83 HQFFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGATSFKLQD 162 (471) Q Consensus 83 ~~~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~~kL~~ 162 (471) ++.+. +++||+|| ..||.+||...|.++|++|. .+.+.|++.++|. |+|. ..||+|+. T Consensus 76 ~~~l~--D~iFVAHN-V~FD~nfL~k~l~~~G~~~~------~~~~iDTVELa~i---f~PT----------~esY~Ls~ 133 (944) T TIGR01407 76 YDLLE--DGIFVAHN-VHFDLNFLAKELVDEGYEPL------EVPRIDTVELARI---FFPT----------EESYKLSE 133 (944) T ss_pred HHHHC--CCEEEEEC-HHCCHHHHHHHHHHCCCCCC------CCCCCHHHHHHHH---HCCC----------CCCCCCHH T ss_conf 98742--97277610-15067899999985689743------4352005679987---4674----------43556148 Q ss_pred HHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 998718876431013331899999999998642268 Q gi|254780661|r 163 LALANGIEHVNAHDAKADVYATLALVWLIREKKPKL 198 (471) Q Consensus 163 la~~n~i~~~~aH~A~~D~~~t~~l~k~i~~~~p~~ 198 (471) ||+..||.|++||+|.+||.||++|+-++.+|...+ T Consensus 134 L~E~LGl~h~npH~AdSDA~~TAeLLLl~~~K~~~L 169 (944) T TIGR01407 134 LSEELGLEHENPHRADSDAQVTAELLLLLEEKMKKL 169 (944) T ss_pred HHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHH T ss_conf 999719871466761057899999999999998751 No 10 >PRK06309 DNA polymerase III subunit epsilon; Validated Probab=100.00 E-value=6.6e-34 Score=236.70 Aligned_cols=167 Identities=21% Similarity=0.194 Sum_probs=141.5 Q ss_pred CCCEEEEECCCCCCCCCCCEEEEEEEEEECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHHHH Q ss_conf 88379997527889877260489999998389728040022662178887785789898579988996079997999999 Q gi|254780661|r 2 TNHFVIYDYETFGRDVALDRPAQFAGVRVDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFSRR 81 (471) Q Consensus 2 ~~~~v~~D~ETtG~~~~~~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~~~ 81 (471) |.+|||||+||||+++..|+|||+|++. +.+.++|.++ + +|++.++++|..|||||++|+.++| ++.+++++ T Consensus 1 m~~~V~fD~ETTGl~~~~DrIIEiaav~----~~~~e~f~~l-i--nP~~pIp~~a~~ihGIt~e~v~~~P-~~~ev~~~ 72 (232) T PRK06309 1 MPALIFYDTETTGTQIDKDRIIEIAAYN----GVTSESFQTY-V--NPEIPIPAEATKIHGITTSEVASAP-KFPEAYQK 72 (232) T ss_pred CCCEEEEEEECCCCCCCCCEEEEEEEEE----CCCCCEEEEE-E--CCCCCCCHHHHCCCCCCHHHHHCCC-CHHHHHHH T ss_conf 9968999711799599994589999892----7567689999-7--9989699656210383999980599-89999999 Q ss_pred HHHHHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHH Q ss_conf 99995117965999505578999999999850785321325688710015888865556532100244444544331089 Q gi|254780661|r 82 IHQFFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGATSFKLQ 161 (471) Q Consensus 82 i~~~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~~kL~ 161 (471) +.+++.. .+++||||+.+||.+||+..|.|+++++. .+..+|+|.++|.. .| +.++++|+ T Consensus 73 ~~~f~~~-~~ilVaHN~~~FD~~fL~~e~~r~g~~~~------~~~~iDTl~lar~l---~P----------~l~~h~L~ 132 (232) T PRK06309 73 FREFCGT-DNILVAHNNDGFDFPLLVKECRRHSLEPL------TLRTIDSLKWAQKY---RP----------DLPKHNLQ 132 (232) T ss_pred HHHHHCC-CCEEEEECCHHHHHHHHHHHHHHCCCCCC------CCCHHHHHHHHHHH---CC----------CCCCCCHH T ss_conf 9998579-98899848965679999999998599867------74053799999997---58----------97768899 Q ss_pred HHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 89987188764310133318999999999986422 Q gi|254780661|r 162 DLALANGIEHVNAHDAKADVYATLALVWLIREKKP 196 (471) Q Consensus 162 ~la~~n~i~~~~aH~A~~D~~~t~~l~k~i~~~~p 196 (471) .||+.+|+..++||+|++||.||.+|++.+-...+ T Consensus 133 ~L~~~~gi~~~~aHrAl~Da~at~~vf~~ll~~~~ 167 (232) T PRK06309 133 YLRQVYGFAENQAHRALDDVITLHRVFSALVGDLS 167 (232) T ss_pred HHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHCC T ss_conf 99998099998888859999999999999872169 No 11 >PRK00448 polC DNA polymerase III PolC; Validated Probab=100.00 E-value=5e-35 Score=243.89 Aligned_cols=169 Identities=22% Similarity=0.260 Sum_probs=146.0 Q ss_pred CCEEEEECCCCCCCCCCCEEEEEEEEEECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHHHHH Q ss_conf 83799975278898772604899999983897280400226621788877857898985799889960799979999999 Q gi|254780661|r 3 NHFVIYDYETFGRDVALDRPAQFAGVRVDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFSRRI 82 (471) Q Consensus 3 ~~~v~~D~ETtG~~~~~~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~~~i 82 (471) .+||+||+||||+++.+|+|||+|||++ .+|+++|+|+.+ + +|....++.+..+||||++|+.+++ +..|+.+++ T Consensus 418 ~~yVvFDiETTGls~~~d~IiEigAvki-~~g~iid~f~~f-i--~P~~~i~~~i~~lT~Itd~mv~~~~-~~~evl~~f 492 (1436) T PRK00448 418 ATYVVFDVETTGLSAVYDEIIEIGAVKI-KNGEIIDKFEEF-I--KPGHPLSAFTTELTGITDDMVKDAP-SIEEVLPKF 492 (1436) T ss_pred CCEEEEEECCCCCCCCCCCEEEEEEEEE-ECCEEEEEEHHH-C--CCCCCCCHHHHHCCCCCHHHHCCCC-CHHHHHHHH T ss_conf 6379986006888866672368757986-378175410210-3--8999797545321478847863898-789999999 Q ss_pred HHHHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHH Q ss_conf 99951179659995055789999999998507853213256887100158888655565321002444445443310898 Q gi|254780661|r 83 HQFFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGATSFKLQD 162 (471) Q Consensus 83 ~~~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~~kL~~ 162 (471) .+|.. +++.|+|| ++||.+|++..+-|+++++. .|..+|+|.++|.. +|+ -++|+|+. T Consensus 493 ~~f~~--d~vlVAHN-a~FD~~Fi~~~~~~~~~~~~------~~p~iDTl~lsr~l---~p~----------~k~~~L~~ 550 (1436) T PRK00448 493 KEFCG--DSVLVAHN-ASFDVGFINTNYEKLGLEKI------KNPVIDTLELSRFL---YPE----------YKSHRLNT 550 (1436) T ss_pred HHHHC--CCEEEEEC-CCCCHHHHHHHHHHCCCCCC------CCCEECHHHHHHHH---CCC----------CCCCCHHH T ss_conf 99828--97799845-86568899999998089866------79723029989875---510----------11144788 Q ss_pred HHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 998718876431013331899999999998642268 Q gi|254780661|r 163 LALANGIEHVNAHDAKADVYATLALVWLIREKKPKL 198 (471) Q Consensus 163 la~~n~i~~~~aH~A~~D~~~t~~l~k~i~~~~p~~ 198 (471) +|+.+++..+++|+|..|+.||+.++..+.+...+- T Consensus 551 l~k~~~v~~~~hHRA~~Da~~t~~i~~~~l~~~~~~ 586 (1436) T PRK00448 551 LAKKFGVELEHHHRADYDAEATAYLLIKFLKDLKEK 586 (1436) T ss_pred HHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHC T ss_conf 998848886545443223888999999999999874 No 12 >PRK06807 DNA polymerase III subunit epsilon; Validated Probab=100.00 E-value=1.7e-33 Score=234.06 Aligned_cols=187 Identities=18% Similarity=0.153 Sum_probs=151.1 Q ss_pred CCCEEEEECCCCCCCCCCCEEEEEEEEEECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHHHH Q ss_conf 88379997527889877260489999998389728040022662178887785789898579988996079997999999 Q gi|254780661|r 2 TNHFVIYDYETFGRDVALDRPAQFAGVRVDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFSRR 81 (471) Q Consensus 2 ~~~~v~~D~ETtG~~~~~~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~~~ 81 (471) +.+||++|+||||++|..|+|||+|||++ .+++++|+|.++ + +|....++.+..+||||++|+.++| +..|++++ T Consensus 7 p~~fVV~DiETTGl~p~~D~IIEIgAVkv-~ng~Ivd~f~tL-I--nP~~~Ip~~It~ItGIT~emV~dAP-~f~EVl~~ 81 (313) T PRK06807 7 PLDYVVIDFETTGFNPYNDKIIQVAAVKY-RNHELVDQFVSY-V--NPERPIPDRITSLTGITNYRVSDAP-TIEEVLPL 81 (313) T ss_pred CCCEEEEEEECCCCCCCCCEEEEEEEEEE-ECCEEEEEEEEE-E--CCCCCCCHHHHHHCCCCHHHHHCCC-CHHHHHHH T ss_conf 98789999788997999987899999999-999798999999-8--9489799667653397999982699-89999999 Q ss_pred HHHHHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHH Q ss_conf 99995117965999505578999999999850785321325688710015888865556532100244444544331089 Q gi|254780661|r 82 IHQFFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGATSFKLQ 161 (471) Q Consensus 82 i~~~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~~kL~ 161 (471) +.+|+. +.++|||| ++||.+||++.+.|.++++. .+..+|++.++|.. . .+-+.|||+ T Consensus 82 f~~Fi~--d~iLVAHN-A~FD~~FL~~~L~r~gl~~~------~n~~IDTL~LAR~l---~----------P~L~NhKLn 139 (313) T PRK06807 82 FLAFLH--TNVIVAHN-ASFDMRFLKSNVNMLGLPEP------KNKVIDTVFLAKKY---M----------KHAPNHKLE 139 (313) T ss_pred HHHHCC--CCEEEEEC-CHHHHHHHHHHHHHCCCCCC------CCCEEEHHHHHHHH---C----------CCCCCCCHH T ss_conf 998738--99899955-36619999999997499888------99877659999986---4----------787656678 Q ss_pred HHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHCCHHHHHHHHH Q ss_conf 89987188764310133318999999999986422689-99996238898997420 Q gi|254780661|r 162 DLALANGIEHVNAHDAKADVYATLALVWLIREKKPKLF-EYLYDYRNKNQLRKLID 216 (471) Q Consensus 162 ~la~~n~i~~~~aH~A~~D~~~t~~l~k~i~~~~p~~~-~~~l~~~~K~~v~~~l~ 216 (471) .+|+.+||+. ++|+|+.||.||+++++++.....+-- ..-.....+..|...++ T Consensus 140 TVa~~LgI~l-~HHrAldDA~acAeI~~k~~~~~~~~~~~~~~~~~~e~~~~~~i~ 194 (313) T PRK06807 140 TLKRMLGIRL-SSHNAFDDCITCAAVYQKCASIEEEAKRKSNKEVLDETAVYEAVK 194 (313) T ss_pred HHHHHHCCCC-CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999839983-436717789999999999999886166665553225889999999 No 13 >PRK07246 bifunctional ATP-dependent DNA helicase/DNA polymerase III subunit epsilon; Validated Probab=100.00 E-value=1.9e-33 Score=233.74 Aligned_cols=169 Identities=21% Similarity=0.256 Sum_probs=145.3 Q ss_pred CCCCEEEEECCCCCCCCCCCEEEEEEEEEECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHHH Q ss_conf 98837999752788987726048999999838972804002266217888778578989857998899607999799999 Q gi|254780661|r 1 MTNHFVIYDYETFGRDVALDRPAQFAGVRVDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFSR 80 (471) Q Consensus 1 m~~~~v~~D~ETtG~~~~~~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~~ 80 (471) ++.+||+.|+||||.++. ++||||||+++ ++++++++|.++ + +|+..+++.+..+|||+++++.++| ++.|++. T Consensus 5 ~~~~~~vvDlETTG~~~~-~~Iiqi~~v~~-~~~~i~~~f~t~-v--nP~~~ip~~I~~LTgI~~~~v~~ap-~f~~v~~ 78 (820) T PRK07246 5 KLRKYAVVDLEATGAGPN-ASIIQVGIVII-QGNKIIDSYETD-V--NPHESLDEHIVHLTGITDKQLAQAP-DFSQVAH 78 (820) T ss_pred CCCCEEEEEEECCCCCCC-CCEEEEEEEEE-ECCEEEEEEEEC-C--CCCCCCCHHHHHHCCCCHHHHHCCC-CHHHHHH T ss_conf 467189999887898999-87799999999-999998998731-1--8899799788864698989985299-9899999 Q ss_pred HHHHHHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCH Q ss_conf 99999511796599950557899999999985078532132568871001588886555653210024444454433108 Q gi|254780661|r 81 RIHQFFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGATSFKL 160 (471) Q Consensus 81 ~i~~~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~~kL 160 (471) ++.+++. ++++|||| ++||.+||++.+.+.+++ + .+.++|++.++|.+ .| .-+||+| T Consensus 79 ~~~~~l~--~~~~VaHN-v~FD~~fl~~~~~~~g~~-~------~~~~~DTv~La~i~---~P----------~~~~y~L 135 (820) T PRK07246 79 HIYQLIE--DCIFVAHN-VKFDANLLAEALFLEGYE-L------RTPRVDTVELAQVF---FP----------RLEKYSL 135 (820) T ss_pred HHHHHHC--CCEEEEEC-CCCCHHHHHHHHHHCCCC-C------CCCCCCHHHHHHHH---CC----------CCCCCCH T ss_conf 9999967--98599837-201699999999975898-8------99826099999997---79----------8788888 Q ss_pred HHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 98998718876431013331899999999998642268 Q gi|254780661|r 161 QDLALANGIEHVNAHDAKADVYATLALVWLIREKKPKL 198 (471) Q Consensus 161 ~~la~~n~i~~~~aH~A~~D~~~t~~l~k~i~~~~p~~ 198 (471) ++||...|++|.++|+|.+||.||++|+..+.++...+ T Consensus 136 ~~L~~~l~~~~~~~HrA~~Da~ata~l~~~l~~~l~~l 173 (820) T PRK07246 136 SHLSRQLNIDLAEAHTAIADARATAILFLRLLQKIESL 173 (820) T ss_pred HHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCC T ss_conf 99999739999988761889999999999999998619 No 14 >PRK06310 DNA polymerase III subunit epsilon; Validated Probab=100.00 E-value=3.8e-33 Score=231.84 Aligned_cols=209 Identities=21% Similarity=0.186 Sum_probs=158.1 Q ss_pred CCEEEEECCCCCCCCCCCEEEEEEEEEECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHHHHH Q ss_conf 83799975278898772604899999983897280400226621788877857898985799889960799979999999 Q gi|254780661|r 3 NHFVIYDYETFGRDVALDRPAQFAGVRVDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFSRRI 82 (471) Q Consensus 3 ~~~v~~D~ETtG~~~~~~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~~~i 82 (471) ..||++|+||||+++..|+|||||+|+++. +.++++++.+ | +|...++++|..|||||++|+.++| ++.++++++ T Consensus 7 ~~yvv~D~ETTGl~~~~DrIIEI~~vkv~~-~~i~~~~~~l-i--nP~~~I~~~~~~ihGIt~emV~d~P-~f~ev~~~~ 81 (250) T PRK06310 7 TEFVCLDCETTGLDVKKDRIIEFAAIRFTF-DEVIDSMEFL-I--NPERVVSAESQRIHHISDAMLRDKP-KIAEVFPQI 81 (250) T ss_pred CCEEEEEEECCCCCCCCCEEEEEEEEEEEC-CEEEEEEEEE-E--CCCCCCCHHHHHHHCCCHHHHHCCC-CHHHHHHHH T ss_conf 988999733799799996279999999999-9886889999-8--9289799889876154899984699-999999999 Q ss_pred HHHHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHH Q ss_conf 99951179659995055789999999998507853213256887100158888655565321002444445443310898 Q gi|254780661|r 83 HQFFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGATSFKLQD 162 (471) Q Consensus 83 ~~~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~~kL~~ 162 (471) .+++.. +.++|||| +.||.+||++.|.|.+.++.. ..+..+|++.++|.. | ..++++|+. T Consensus 82 ~~fi~~-~~ilVaHN-a~FD~~fL~~e~~r~g~~~~~----~~~~~IDTl~lAr~~----~----------~~~~~~L~~ 141 (250) T PRK06310 82 KGFFKE-GDYIVGHS-VGFDLQVLAQEAERIGETFLS----KHYYIIDTLRLAKEY----G----------DSPNNSLEA 141 (250) T ss_pred HHHHCC-CCEEEEEC-HHHHHHHHHHHHHHCCCCCCC----CCCCEECHHHHHHHC----C----------CCCCCHHHH T ss_conf 999679-99999938-455799999999976998665----679578499999756----7----------654560999 Q ss_pred HHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCEEEEE Q ss_conf 99871887643101333189999999999864226899999623889899742001576079970101047787506886 Q gi|254780661|r 163 LALANGIEHVNAHDAKADVYATLALVWLIREKKPKLFEYLYDYRNKNQLRKLIDIQNMTPLVHVSGMFGASRSNTALIAP 242 (471) Q Consensus 163 la~~n~i~~~~aH~A~~D~~~t~~l~k~i~~~~p~~~~~~l~~~~K~~v~~~l~~~~~~~~~~v~~~~~~~~~~~~~v~~ 242 (471) ||...|+++.+||+|++||+||++|++.+.++.+. -.....+...... .. .+..|+.+..++.- T Consensus 142 L~~~~~i~~~~aHrAl~Da~aTa~vf~~l~~r~~~----------~e~~~~i~~~p~~---~~---~f~FGKhKGk~v~e 205 (250) T PRK06310 142 LAVHFNVPYDGNHRAMKDVEININVFKHLCKRFRT----------LEQLKQVLSKPIK---MK---YMPLGKHKGRLFSE 205 (250) T ss_pred HHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCC----------HHHHHHHHCCCCC---CC---EECCCCCCCEEHHH T ss_conf 99970999986668199999999999999985537----------9999998619463---42---00577849888998 Q ss_pred EECCCCCCCEEE Q ss_conf 302888774478 Q gi|254780661|r 243 VAWHPRYKDSVI 254 (471) Q Consensus 243 ~~~~p~~~n~~i 254 (471) +. +.|-.|+. T Consensus 206 v~--~~Y~~W~~ 215 (250) T PRK06310 206 IP--LEYLQWAS 215 (250) T ss_pred HH--HHHHHHHH T ss_conf 17--99999998 No 15 >PRK07883 hypothetical protein; Validated Probab=100.00 E-value=2.2e-33 Score=233.30 Aligned_cols=171 Identities=22% Similarity=0.188 Sum_probs=149.3 Q ss_pred CCEEEEECCCCCCCCCCCEEEEEEEEEECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHHHHH Q ss_conf 83799975278898772604899999983897280400226621788877857898985799889960799979999999 Q gi|254780661|r 3 NHFVIYDYETFGRDVALDRPAQFAGVRVDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFSRRI 82 (471) Q Consensus 3 ~~~v~~D~ETtG~~~~~~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~~~i 82 (471) .+||++|+||||+++..|+|||||||++ .+++++++|.++ + +|+..++|.+..+||||++|+.++| +..+++.+| T Consensus 24 ~tfVVvDlETTG~s~~~d~IiEIgAVkv-~~g~vv~~f~tL-V--nP~~~Ip~~I~~LTGIt~~mV~~AP-~~~~vl~~f 98 (575) T PRK07883 24 TTFVVVDLETTGGSPTGDAITEIGAVKV-RGGEVLGEFATL-V--NPGRSIPPFIVVLTGITTAMVADAP-PIDAVLPAF 98 (575) T ss_pred CEEEEEEEECCCCCCCCCEEEEEEEEEE-ECCEEEEEEECC-C--CCCCCCCHHHHCCCCCCHHHHCCCC-CHHHHHHHH T ss_conf 8489999507999988970599999999-899999997100-1--9389898544003696989980799-999999999 Q ss_pred HHHHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHH Q ss_conf 99951179659995055789999999998507853213256887100158888655565321002444445443310898 Q gi|254780661|r 83 HQFFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGATSFKLQD 162 (471) Q Consensus 83 ~~~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~~kL~~ 162 (471) .+|+. ++++|+|| +.||..||++.+.|.++++. .|..+|++.++|.+ .| .+..++|||+. T Consensus 99 ~~F~~--~~vlVAHN-a~FD~~FL~~~~~r~g~~~~------~~~~ldTl~LARrl---~p--------~~e~~~~kL~t 158 (575) T PRK07883 99 LEFAR--GSVLVAHN-APFDIGFLRAAAERCGYPWP------QPEVLCTVRLARRV---LS--------RDEAPNVRLSA 158 (575) T ss_pred HHHHC--CCEEEEEC-CHHHHHHHHHHHHHCCCCCC------CCCEEEHHHHHHHH---CC--------CCCCCCCCHHH T ss_conf 99968--99899908-34529999999997599987------89786489999986---69--------21146899899 Q ss_pred HHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 998718876431013331899999999998642268 Q gi|254780661|r 163 LALANGIEHVNAHDAKADVYATLALVWLIREKKPKL 198 (471) Q Consensus 163 la~~n~i~~~~aH~A~~D~~~t~~l~k~i~~~~p~~ 198 (471) ||..+|+.+.+.|+|++|+.||++++..+.++.... T Consensus 159 La~~~g~~~~~~HRAl~DA~AT~~vl~~l~~~~~~~ 194 (575) T PRK07883 159 LARLFGAATTPTHRALDDARATVDVLHGLIERLGNQ 194 (575) T ss_pred HHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHC T ss_conf 999759999998660452999999999999985422 No 16 >PRK07740 hypothetical protein; Provisional Probab=100.00 E-value=1.9e-32 Score=227.29 Aligned_cols=168 Identities=21% Similarity=0.235 Sum_probs=142.9 Q ss_pred CCEEEEECCCCCCCCCC-CEEEEEEEEEECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHHHH Q ss_conf 83799975278898772-60489999998389728040022662178887785789898579988996079997999999 Q gi|254780661|r 3 NHFVIYDYETFGRDVAL-DRPAQFAGVRVDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFSRR 81 (471) Q Consensus 3 ~~~v~~D~ETtG~~~~~-~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~~~ 81 (471) -.||+||+||||++|.. |+|||||||++ .+++++|+|+++ + +|...+++.+..+||||++|+.++| +..|++.+ T Consensus 57 ~~fvV~D~ETTGl~p~~gD~IIeIgAVkv-~~g~i~d~F~tL-V--nP~~~Ip~~i~~ltGIT~emV~~aP-~~~eVl~~ 131 (240) T PRK07740 57 LPFVVFDLETTGFSPDQGDEILSIAAVKT-VGGEVTDLYYSL-V--KPKKPIPEHILTLTGISAKDVVFAP-PLAEVLRT 131 (240) T ss_pred CEEEEEEEECCCCCCCCCCEEEEEEEEEE-ECCEEEEEEEEE-E--CCCCCCCHHHHHHCCCCHHHHHCCC-CHHHHHHH T ss_conf 83899980589989888987899989999-999996887755-0--8879899889986098999983799-99999999 Q ss_pred HHHHHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHH Q ss_conf 99995117965999505578999999999850785321325688710015888865556532100244444544331089 Q gi|254780661|r 82 IHQFFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGATSFKLQ 161 (471) Q Consensus 82 i~~~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~~kL~ 161 (471) +.+|++ ++++|||| ++||..||++.+.|.+..+. .+..+|++.+++.. .|. ++++|+ T Consensus 132 fl~Fi~--~~vlVaHN-a~FD~~FL~~~l~r~~~~~~------~~~~iDT~~Lar~l---~~~-----------~~~sL~ 188 (240) T PRK07740 132 FYPFIN--ASTLVGYH-IGHDLAFLRHALWRHYRQKF------SHRFIDTQFLTEII---AHQ-----------SFPTLD 188 (240) T ss_pred HHHHHC--CCEEEEEC-CHHHHHHHHHHHHHHCCCCC------CCCEEEHHHHHHHH---CCC-----------CCCCHH T ss_conf 999838--99899968-28639999999998459987------88763699999986---589-----------987899 Q ss_pred HHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 8998718876431013331899999999998642268 Q gi|254780661|r 162 DLALANGIEHVNAHDAKADVYATLALVWLIREKKPKL 198 (471) Q Consensus 162 ~la~~n~i~~~~aH~A~~D~~~t~~l~k~i~~~~p~~ 198 (471) .+|..+||+..++|+|++||.+|++|+..+.++..+. T Consensus 189 ~l~~~fgI~~~~rHrAl~DA~aTA~lf~~ll~~l~~~ 225 (240) T PRK07740 189 DALACYGITCPRRHTADGDVEMTAKLWAILIIEARQA 225 (240) T ss_pred HHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHC T ss_conf 9999769999999882999999999999999999976 No 17 >PRK07942 DNA polymerase III subunit epsilon; Provisional Probab=100.00 E-value=1e-31 Score=222.69 Aligned_cols=184 Identities=24% Similarity=0.241 Sum_probs=151.9 Q ss_pred CCEEEEECCCCCCCCCCCEEEEEEEEEECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHHHHH Q ss_conf 83799975278898772604899999983897280400226621788877857898985799889960799979999999 Q gi|254780661|r 3 NHFVIYDYETFGRDVALDRPAQFAGVRVDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFSRRI 82 (471) Q Consensus 3 ~~~v~~D~ETtG~~~~~~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~~~i 82 (471) ..+++||+||||+++..|+|||+|+|++|.++.+.++++.+ + +|++.++++|..|||||+++++++|.+..+++.++ T Consensus 6 ~~lvvfD~ETTG~d~~~dRIIeia~V~~~~~g~~~~~~~~L-v--nPg~~IP~~a~~vhGIT~e~~~~~G~p~a~v~~ei 82 (234) T PRK07942 6 GPLAAFDLETTGVDPETARIVTAAIVVVDAHGEVVERREWL-A--DPGVEIPEEASAVHGITTERARAHGRPAAEVLAEI 82 (234) T ss_pred CCEEEEEEECCCCCCCCCEEEEEEEEEEECCCCEEEEEEEE-E--CCCCCCCHHHHHHHCCCHHHHHHCCCCCHHHHHHH T ss_conf 97899980169989889838999999998898546889998-6--95894998897784889999974289830799999 Q ss_pred HHHHHC---CCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCC Q ss_conf 999511---79659995055789999999998507853213256887100158888655565321002444445443310 Q gi|254780661|r 83 HQFFSV---PNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGATSFK 159 (471) Q Consensus 83 ~~~~~~---~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~~k 159 (471) .+.+.. .|..+|+|| +.||..||++.+.|.++++. .....+|.+.+.|...-+.+ | +.+ T Consensus 83 a~~l~~~~~~g~~lVahN-A~FDl~fL~~El~R~g~~~l-----~~~~viDtl~l~r~l~~~r~----------g--kr~ 144 (234) T PRK07942 83 AEALRAVWRAGVPVVVYN-APYDLTVLDRELRRHGLPSL-----VPGPVIDPYVIDKHVDRYRK----------G--KRT 144 (234) T ss_pred HHHHHHHHHCCCEEEEEC-CHHHHHHHHHHHHHCCCCCC-----CCCCEEEHHHHHHHHCCCCC----------C--CCC T ss_conf 999999972798799967-07649999999998599967-----77755619998877514777----------8--888 Q ss_pred HHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 898998718876431013331899999999998642268999996238 Q gi|254780661|r 160 LQDLALANGIEHVNAHDAKADVYATLALVWLIREKKPKLFEYLYDYRN 207 (471) Q Consensus 160 L~~la~~n~i~~~~aH~A~~D~~~t~~l~k~i~~~~p~~~~~~l~~~~ 207 (471) |+.||..+|+++++||+|.+|+.||++|+..+..+.|++=...+.... T Consensus 145 L~aL~~~ygv~l~~aH~A~aDA~Ata~l~~~l~~r~~~l~~~~~~eL~ 192 (234) T PRK07942 145 LTALCEHYGVRLDNAHDATADALAAARLAWALARRFPELAALSAAELH 192 (234) T ss_pred HHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCHHHHH T ss_conf 999999849988775564152899999999999876887559999999 No 18 >PRK09145 DNA polymerase III subunit epsilon; Validated Probab=99.97 E-value=6.9e-30 Score=210.88 Aligned_cols=172 Identities=17% Similarity=0.179 Sum_probs=133.2 Q ss_pred CCCEEEEECCCCCCCCCCCEEEEEEEEEECCCCEEC--CEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHH Q ss_conf 883799975278898772604899999983897280--400226621788877857898985799889960799979999 Q gi|254780661|r 2 TNHFVIYDYETFGRDVALDRPAQFAGVRVDRQWEKI--ESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFS 79 (471) Q Consensus 2 ~~~~v~~D~ETtG~~~~~~~iiq~~~i~~d~~~~~~--~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~ 79 (471) .+.||++|+||||+++..|+|||||||+++. +.++ ++|+++ ++ |+..+++.+..|||||++++.+++ +..+++ T Consensus 28 ~~~fVv~D~ETTGL~~~~D~IieIgav~v~~-~~i~~~~~f~~l-V~--P~~~i~~~~~~IhGIt~~~l~~ap-~~~evl 102 (203) T PRK09145 28 PDEWVALDCETTGLDPRRAEIVSIAAVKIRG-NRILTSERLELL-VR--PPQSLSAESIKIHRLRHQDLEHGL-SEEEAL 102 (203) T ss_pred CCCEEEEEEECCCCCCCCCEEEEEEEEEEEC-CEEECCCEEEEE-EC--CCCCCCHHHHCCCCCCHHHHCCCC-CHHHHH T ss_conf 9989999856899999997069999999999-999317528999-89--799598645201387879981399-999999 Q ss_pred HHHHHHHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCC Q ss_conf 99999951179659995055789999999998507853213256887100158888655565321002444445443310 Q gi|254780661|r 80 RRIHQFFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGATSFK 159 (471) Q Consensus 80 ~~i~~~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~~k 159 (471) +++.+++. +..+|||| ++||..||.+.+.+.+-.+ ..+..+|+..+.... ... ....+..+++ T Consensus 103 ~~f~~fi~--~~~lVgHN-a~FD~~fL~~~~~r~~g~~------l~~~~id~~~l~~~~--~~~------~~~~~~~~~~ 165 (203) T PRK09145 103 RQLLAFIG--NRPLVGYY-LEFDVAMLNRYVRPLLGIG------LPNPLIEVSSLYHDK--KER------HLPDAYIDLR 165 (203) T ss_pred HHHHHHCC--CCEEEEEC-HHHHHHHHHHHHHHHCCCC------CCCCEEEHHHHHHHH--HHH------HCCCCCCCCC T ss_conf 99998829--99899987-8999999999999962999------999877609879999--996------6769877889 Q ss_pred HHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 898998718876431013331899999999998642 Q gi|254780661|r 160 LQDLALANGIEHVNAHDAKADVYATLALVWLIREKK 195 (471) Q Consensus 160 L~~la~~n~i~~~~aH~A~~D~~~t~~l~k~i~~~~ 195 (471) |+.+|..+||+..++|+|++||.+|++|+..+.+.. T Consensus 166 L~~l~~~~gi~~~~~HrAL~DA~aTa~lfl~L~k~~ 201 (203) T PRK09145 166 FDAILKHLDLPVLGRHDALNDAIMTALIYLRLRKGD 201 (203) T ss_pred HHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHCCC T ss_conf 999998579878999798999999999999986789 No 19 >PRK07247 DNA polymerase III subunit epsilon; Validated Probab=99.97 E-value=2.9e-29 Score=206.82 Aligned_cols=169 Identities=19% Similarity=0.139 Sum_probs=135.3 Q ss_pred CCCEEEEECCCCCCCCCCCEEEEEEEEEECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHHHH Q ss_conf 88379997527889877260489999998389728040022662178887785789898579988996079997999999 Q gi|254780661|r 2 TNHFVIYDYETFGRDVALDRPAQFAGVRVDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFSRR 81 (471) Q Consensus 2 ~~~~v~~D~ETtG~~~~~~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~~~ 81 (471) +++||++|+||||+++ .++|||||||+++ +++++|+|+++ +. |...+++.+..+||||++|+.++| +..|++++ T Consensus 4 ~~~yvv~DlETTg~~~-~~~IIEIgAVk~~-~g~ivd~F~sl-Vn--P~~~I~~~i~~lTGIt~~~v~~aP-~~~eVl~~ 77 (195) T PRK07247 4 LETYIAFDLEFNTVND-VSHIIQVSAVKYD-HHKEVDSFDTY-VY--TDVPLQSFINGLTGITADKIAAAP-KVEEVLAA 77 (195) T ss_pred CCEEEEEEEECCCCCC-CCEEEEEEEEEEE-CCEEEEEEEEE-EC--CCCCCCHHHHHCCCCCHHHHHCCC-CHHHHHHH T ss_conf 8849999977899999-9806999999998-99899999835-66--989999555413598999983599-99999999 Q ss_pred HHHHHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHH Q ss_conf 99995117965999505578999999999850785321325688710015888865556532100244444544331089 Q gi|254780661|r 82 IHQFFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGATSFKLQ 161 (471) Q Consensus 82 i~~~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~~kL~ 161 (471) +.+|++ +.++||||..+||..||+....+...++ ....+|.....|.. . ..+-++|||+ T Consensus 78 f~~Fig--d~~lVaHNa~~FD~~fL~~~~~~~~~~~-------~~~~~~~~~~~r~~--~----------~p~l~n~kL~ 136 (195) T PRK07247 78 FKNFVG--ELPLIGYNAQKSDLPILAENGLDLRDQY-------QVDLFDEAFDRRSS--D----------LNGIANLKLQ 136 (195) T ss_pred HHHHHC--CCEEEEECCCHHHHHHHHHHHHHHCCCC-------CCCHHHHHHHHHHH--C----------CCCCCCCCHH T ss_conf 999968--9918990884514999999877415776-------68569999999886--0----------5887677789 Q ss_pred HHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 8998718876431013331899999999998642268 Q gi|254780661|r 162 DLALANGIEHVNAHDAKADVYATLALVWLIREKKPKL 198 (471) Q Consensus 162 ~la~~n~i~~~~aH~A~~D~~~t~~l~k~i~~~~p~~ 198 (471) .||+.+||+ +++|+|+.||.+|++|++++.+..++- T Consensus 137 tLa~~f~I~-~~~HRAl~DA~aTa~If~klLe~~~~~ 172 (195) T PRK07247 137 TVATFLGIK-GRGHNSLEDARMTARIYESFLETDTNK 172 (195) T ss_pred HHHHHCCCC-CCCCCCHHHHHHHHHHHHHHHHHHHHH T ss_conf 999974999-989081899999999999999976643 No 20 >PRK06063 DNA polymerase III subunit epsilon; Provisional Probab=99.97 E-value=2.7e-29 Score=207.02 Aligned_cols=167 Identities=16% Similarity=0.148 Sum_probs=141.9 Q ss_pred CCEEEEECCCCCCCCCCCEEEEEEEEEECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHHHHH Q ss_conf 83799975278898772604899999983897280400226621788877857898985799889960799979999999 Q gi|254780661|r 3 NHFVIYDYETFGRDVALDRPAQFAGVRVDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFSRRI 82 (471) Q Consensus 3 ~~~v~~D~ETtG~~~~~~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~~~i 82 (471) .-|+++|+||||++|..|+|||||+|.++.++++.++|.++ + +|.+. |....|||||++|+.++| .+.|++++| T Consensus 15 ~~fvvvD~ETTGl~p~~drIieig~v~~~~~g~i~~~~~tl-v--nP~~d--p~~~~IhGIt~~~v~~aP-~f~ev~~~l 88 (313) T PRK06063 15 QGWAVVDVETSGFRPGQARIISVAVLGLDADGNVEQSVVSL-L--NPGVD--PGPTHVHGLTAAMLEGQP-QFADIAGEL 88 (313) T ss_pred CCEEEEEEECCCCCCCCCEEEEEEEEEEECCCEEEEEEEEE-E--CCCCC--CCCEEEECCCHHHHCCCC-CHHHHHHHH T ss_conf 99799997689999989706999999998997888999997-5--91999--898434187999981799-899999999 Q ss_pred HHHHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHH Q ss_conf 99951179659995055789999999998507853213256887100158888655565321002444445443310898 Q gi|254780661|r 83 HQFFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGATSFKLQD 162 (471) Q Consensus 83 ~~~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~~kL~~ 162 (471) .+++. +.++|+|| +.||..||+..|.|.+++.. .+.++|++.++|.. ..+-+++||.. T Consensus 89 ~~~l~--g~vlVAHN-a~FD~~FL~~e~~r~g~~~p------~~~~lcTl~lARr~-------------~~~l~~~kL~t 146 (313) T PRK06063 89 AEVLR--GRTLVAHN-VAFDYSFLAAEAELAGAELP------VDQVMCTVELARRL-------------DLGLPNLRLET 146 (313) T ss_pred HHHHC--CCEEEEEC-CHHHHHHHHHHHHHCCCCCC------CCCEEEHHHHHHHH-------------CCCCCCCCHHH T ss_conf 99978--99799968-56529999999997399889------99667689988654-------------48999974788 Q ss_pred HHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99871887643101333189999999999864226 Q gi|254780661|r 163 LALANGIEHVNAHDAKADVYATLALVWLIREKKPK 197 (471) Q Consensus 163 la~~n~i~~~~aH~A~~D~~~t~~l~k~i~~~~p~ 197 (471) +|..+||...++|+|+.|+.||.+++.-..++..+ T Consensus 147 la~~~Gi~~~~~H~Al~DAra~a~il~~~l~~a~~ 181 (313) T PRK06063 147 LAAHWGVVQQRPHDALDDARVLAGILAPALERARE 181 (313) T ss_pred HHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHC T ss_conf 99980999887655067789999998999986002 No 21 >PRK06631 consensus Probab=99.97 E-value=5e-29 Score=205.35 Aligned_cols=171 Identities=16% Similarity=0.124 Sum_probs=137.1 Q ss_pred CEEEEECCCCCCCCCC-CEEEEEEEEEECCCCEECCE-EEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHHHH Q ss_conf 3799975278898772-60489999998389728040-022662178887785789898579988996079997999999 Q gi|254780661|r 4 HFVIYDYETFGRDVAL-DRPAQFAGVRVDRQWEKIES-TEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFSRR 81 (471) Q Consensus 4 ~~v~~D~ETtG~~~~~-~~iiq~~~i~~d~~~~~~~~-~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~~~ 81 (471) ..|++|+||||++|.. |+|||||||.+. +..+.++ |+. + ++|...+++.|..|||||++++.++| .+.+++++ T Consensus 5 r~VvlD~ETTGl~~~~gdrIIEIgav~~~-~~~~t~~~f~~-~--inP~r~i~~~~~~ihGIt~e~l~~~P-~f~ev~~~ 79 (229) T PRK06631 5 REIILDTETTGLDPQQGHRIVEIGAIEMV-NKVLTGRNFHF-Y--INPERDMPFEAYRIHGISGEFLKDKP-LFHTIADD 79 (229) T ss_pred CEEEEEEECCCCCCCCCCEEEEEEEEEEE-CCEEECCEEEE-E--ECCCCCCCHHHHHCCCCCHHHHHHCC-CHHHHHHH T ss_conf 88999825799899999879999999999-99880437999-9--89899797615120186789985387-79999999 Q ss_pred HHHHHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHH Q ss_conf 99995117965999505578999999999850785321325688710015888865556532100244444544331089 Q gi|254780661|r 82 IHQFFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGATSFKLQ 161 (471) Q Consensus 82 i~~~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~~kL~ 161 (471) +.+|+. +..+|+|| +.||.+||++.|.+.+.++... ..-+..+|+|.++|.. +| | .+++|+ T Consensus 80 fl~Fi~--~s~LVaHN-A~FD~~FLn~El~~l~~~~~~~--~~~~~viDTL~lAR~~---~P----------g-~~~sLd 140 (229) T PRK06631 80 FLEFIS--DSKLIIHN-APFDIKFLNHELSLLKRTEIKL--LELANTIDTLVMARSM---FP----------G-SKYNLD 140 (229) T ss_pred HHHHCC--CCEEEEEC-CHHHHHHHHHHHHHHCCCCCCC--CCCCCCCCHHHHHHHH---CC----------C-CCCCHH T ss_conf 999829--99799976-3875989999999827888884--3201101269999986---89----------9-888999 Q ss_pred HHHHHHCCCCC--CCCHHHHHHHHHHHHHHH-HHHHHHHH Q ss_conf 89987188764--310133318999999999-98642268 Q gi|254780661|r 162 DLALANGIEHV--NAHDAKADVYATLALVWL-IREKKPKL 198 (471) Q Consensus 162 ~la~~n~i~~~--~aH~A~~D~~~t~~l~k~-i~~~~p~~ 198 (471) .||+.+||+.. .+|+|+.||.+|++|.-. +..+++.+ T Consensus 141 aLc~r~~I~~s~r~~H~AL~DA~llA~Vyl~l~gG~q~~~ 180 (229) T PRK06631 141 ALCKRFKVDNSGRQLHGALKDAALLAEVYVELTGGRQSAF 180 (229) T ss_pred HHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCC T ss_conf 9999849999985556868999999999999717886665 No 22 >cd06136 TREX1_2 Three prime repair exonuclease (TREX) 1 and 2 are closely related DEDDh-type DnaQ-like 3'-5' exonucleases. They contain three conserved sequence motifs known as ExoI, II, and III, with a specific Hx(4)D conserved pattern at ExoIII. These motifs contain four conserved acidic residues that participate in coordination of divalent metal ions required for catalysis. Both proteins play a role in the metabolism and clearance of DNA. TREX1 is the major 3'-5' exonuclease activity detected in mammalian cells. Mutations in the human TREX1 gene can cause Aicardi-Goutieres syndrome (AGS), which is characterized by perturbed innate immunity and presents itself as a severe neurological disease. TREX1 degrades ssDNA generated by aberrant replication intermediates to prevent checkpoint activation and autoimmune disease. There are distinct structural differences between TREX1 and 2 that point to different biological roles for these proteins. The main difference is the presence of about 7 Probab=99.97 E-value=3.2e-29 Score=206.56 Aligned_cols=159 Identities=23% Similarity=0.218 Sum_probs=129.5 Q ss_pred EEEEECCCCCCCCCC-CEEEEEEEEEECCCC------------EECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHC Q ss_conf 799975278898772-604899999983897------------2804002266217888778578989857998899607 Q gi|254780661|r 5 FVIYDYETFGRDVAL-DRPAQFAGVRVDRQW------------EKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRD 71 (471) Q Consensus 5 ~v~~D~ETtG~~~~~-~~iiq~~~i~~d~~~------------~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~ 71 (471) ||+||+||||+++.. ++|||+|+|+++.++ +++|+++++ + +|+..++|++..+||||++|+.++ T Consensus 1 fvv~D~ETTGL~~~~~~rIiEI~av~v~~~~~~~~~~~~~~~~~i~d~~~~l-v--nP~~~Ip~~it~itGIt~emv~~~ 77 (177) T cd06136 1 FVFLDLETTGLPKHNRPEITELCLVAVHRDHLLNTSRDKPALPRVLDKLSLC-F--NPGRAISPGASEITGLSNDLLEHK 77 (177) T ss_pred CEEEEEECCCCCCCCCCCEEEEEEEEEECCCCCCCCCCCCCCCCEEEEEEEE-E--CCCCCCCHHHHHHCCCCHHHHHCC T ss_conf 9899867899798899855899999997787444311124566214225676-4--879938876887518899998159 Q ss_pred CCCHHHHHHHHHHHHHCC--CCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCC Q ss_conf 999799999999995117--965999505578999999999850785321325688710015888865556532100244 Q gi|254780661|r 72 GVVEAEFSRRIHQFFSVP--NTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWP 149 (471) Q Consensus 72 ~~~~~e~~~~i~~~~~~~--~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~ 149 (471) |....++.+.+..|+..- +.++||||+..||..||+..|.|.+++.. ......|+|.++|.. T Consensus 78 ~~~~~~~~~~~~~F~~~~~~~~~LVAHNa~~FD~~fL~~e~~r~g~~~p-----~~~~~iDtL~l~r~~----------- 141 (177) T cd06136 78 APFDSDTANLIKLFLRRQPKPICLVAHNGNRFDFPILRSELERLGTKLP-----DDILCVDSLPAFREL----------- 141 (177) T ss_pred CCHHHHHHHHHHHHHHHCCCCCEEEECCCHHCCHHHHHHHHHHCCCCCC-----CCCEEEEEHHHHHHH----------- T ss_conf 9817999999999996267786799648512379999999998699899-----898899860764550----------- Q ss_pred CCCCCCCCCCHHHHHHH-HCCCCCCCCHHHHHHHHHHHHHH Q ss_conf 44454433108989987-18876431013331899999999 Q gi|254780661|r 150 SRDDGATSFKLQDLALA-NGIEHVNAHDAKADVYATLALVW 189 (471) Q Consensus 150 ~~~~~~~~~kL~~la~~-n~i~~~~aH~A~~D~~~t~~l~k 189 (471) ..+|+.+++. +|+..++||+|++||.||++|+. T Consensus 142 -------~~~L~~l~~~~~gi~~~~aHRAl~Da~at~~vfl 175 (177) T cd06136 142 -------DQSLGSLYKRLFGQEPKNSHTAEGDVLALLKCAL 175 (177) T ss_pred -------CCCHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHC T ss_conf -------6999999999759999877784899999999975 No 23 >cd06131 DNA_pol_III_epsilon_Ecoli_like This subfamily is composed of the epsilon subunit of Escherichia coli DNA polymerase III (polIII) and similar proteins. PolIII is the main DNA replicating enzyme in bacteria, functioning as the chromosomal replicase. It is a holoenzyme complex of ten different subunits, three of which (alpha, epsilon, and theta) compose the catalytic core. The polIII epsilon subunit, encoded by the dnaQ gene, is a DEDDh-type 3'-5' exonuclease which is responsible for the proofreading activity of the polymerase, increasing the fidelity of DNA synthesis. It contains three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific Hx(4)D conserved pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. The epsilon subunit of polIII also functions as a stabilizer of the holoenzyme complex. Probab=99.96 E-value=3.8e-28 Score=199.64 Aligned_cols=161 Identities=19% Similarity=0.127 Sum_probs=129.5 Q ss_pred EEEECCCCCCCCCC-CEEEEEEEEEECCCCEECC-EEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHHHHHH Q ss_conf 99975278898772-6048999999838972804-002266217888778578989857998899607999799999999 Q gi|254780661|r 6 VIYDYETFGRDVAL-DRPAQFAGVRVDRQWEKIE-STEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFSRRIH 83 (471) Q Consensus 6 v~~D~ETtG~~~~~-~~iiq~~~i~~d~~~~~~~-~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~~~i~ 83 (471) |++|+||||++|.. |+|||+|||+++. +.+++ +++. +++| ...++++++.|||||++++.++| ++.|++.++. T Consensus 2 Vv~D~ETTGl~p~~~d~Iieigav~~~~-~~~~~~~~~~-lv~P--~~~i~~~~~~ihGIt~e~l~~~p-~~~ev~~~f~ 76 (167) T cd06131 2 IVLDTETTGLDPREGHRIIEIGCVELIN-RRLTGNTFHV-YINP--ERDIPEEAFKVHGITDEFLADKP-KFAEIADEFL 76 (167) T ss_pred EEEECCCCCCCCCCCCEEEEEEEEEEEC-CEEECCEEEE-EECC--CCCCCHHHHHHHCCCHHHHCCCC-CHHHHHHHHH T ss_conf 8997606988988998379999999999-9992447899-9897--89899889978096858983597-5899999998 Q ss_pred HHHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHH Q ss_conf 99511796599950557899999999985078532132568871001588886555653210024444454433108989 Q gi|254780661|r 84 QFFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGATSFKLQDL 163 (471) Q Consensus 84 ~~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~~kL~~l 163 (471) ++++ +..+|||| ++||..||++.+.|++.++.. ......+|++.++|.. .| ...++|+.+ T Consensus 77 ~~l~--~~~lVaHN-~~FD~~fL~~~~~r~~~~~~~---~~~~~~~Dtl~l~r~~---~p-----------~~~~~L~~l 136 (167) T cd06131 77 DFIR--GAELVIHN-ASFDVGFLNAELSLLGLGKKI---IDFCRVIDTLALARKK---FP-----------GKPNSLDAL 136 (167) T ss_pred HHCC--CCEEEEEC-HHHHHHHHHHHHHHHCCCCCC---CCCCCCCCHHHHHHHH---CC-----------CCCCCHHHH T ss_conf 6356--99999837-187899999999981899777---7655411399999998---49-----------998889999 Q ss_pred HHHHCCCCCC--CCHHHHHHHHHHHHHHHH Q ss_conf 9871887643--101333189999999999 Q gi|254780661|r 164 ALANGIEHVN--AHDAKADVYATLALVWLI 191 (471) Q Consensus 164 a~~n~i~~~~--aH~A~~D~~~t~~l~k~i 191 (471) |+.+||+..+ +|+|++||.+|++|+..+ T Consensus 137 ~~~~gi~~~~~~~H~Al~Da~~ta~vf~~l 166 (167) T cd06131 137 CKRFGIDNSHRTLHGALLDAELLAEVYLEL 166 (167) T ss_pred HHHCCCCCCCCCCCCHHHHHHHHHHHHHHH T ss_conf 987697999998758899999999999976 No 24 >smart00479 EXOIII exonuclease domain in DNA-polymerase alpha and epsilon chain, ribonuclease T and other exonucleases. Probab=99.96 E-value=3.6e-28 Score=199.82 Aligned_cols=167 Identities=26% Similarity=0.259 Sum_probs=137.4 Q ss_pred CEEEEECCCCCCCCCCCEEEEEEEEEECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHHHHHH Q ss_conf 37999752788987726048999999838972804002266217888778578989857998899607999799999999 Q gi|254780661|r 4 HFVIYDYETFGRDVALDRPAQFAGVRVDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFSRRIH 83 (471) Q Consensus 4 ~~v~~D~ETtG~~~~~~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~~~i~ 83 (471) ++|+||+||||+++..++|||+|+|.+|. ++++++++. +|+|. ..++|++..+||||++++.++ .++.++++++. T Consensus 1 ~~v~~D~ETTGl~~~~~~Iieigav~~~~-~~~~~~~~~-~i~P~--~~i~~~~~~i~GIt~~~l~~~-~~~~~v~~~~~ 75 (169) T smart00479 1 TLVVIDCETTGLDPGKDEIIEIAAVDVDG-GRIIVVFDT-YVKPD--RPITDYATEIHGITPEMLDDA-PTFEEVLEELL 75 (169) T ss_pred CEEEEEEECCCCCCCCCEEEEEEEEEEEC-CEEEEEEEE-EECCC--CCCCHHHHHHCCCCCHHCCCC-CCHHHHHHHHH T ss_conf 99999985279999998699999999999-988788898-87999--999989986318472120458-66999999999 Q ss_pred HHHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHH Q ss_conf 99511796599950557899999999985078532132568871001588886555653210024444454433108989 Q gi|254780661|r 84 QFFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGATSFKLQDL 163 (471) Q Consensus 84 ~~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~~kL~~l 163 (471) +++. +..++++|+.+||..+|++.+.+.+.+.. .....+|++.+.+.. . +.++++|+.+ T Consensus 76 ~~l~--~~~~v~~~~~~fD~~~L~~~~~~~~~~~~-----~~~~~~dt~~~~~~~---~-----------~~~~~~L~~l 134 (169) T smart00479 76 EFLK--GKILVAGNALNFDLRFLKLEHPRLGIKDP-----PKNPVIDTLKLARAL---N-----------PGRKYSLKKL 134 (169) T ss_pred HHHC--CCCCHHCCCHHHHHHHHHHHHHHHCCCCC-----CCCCHHHHHHHHHHH---C-----------CCCCHHHHHH T ss_conf 8604--87451036799999999999998189988-----665246299999997---5-----------8854589999 Q ss_pred HHHHCCCCC-CCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 987188764-310133318999999999986422 Q gi|254780661|r 164 ALANGIEHV-NAHDAKADVYATLALVWLIREKKP 196 (471) Q Consensus 164 a~~n~i~~~-~aH~A~~D~~~t~~l~k~i~~~~p 196 (471) |..+|++.. ++|+|++||.+|++|++.+.++.. T Consensus 135 ~~~~~i~~~~~~H~Al~Da~~t~~v~~~l~~~~~ 168 (169) T smart00479 135 AERLGLEVIGRAHRALDDARATAKLFKKLVERLL 168 (169) T ss_pred HHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCC T ss_conf 9985999899992979999999999999997724 No 25 >PRK06195 DNA polymerase III subunit epsilon; Validated Probab=99.96 E-value=3.1e-28 Score=200.25 Aligned_cols=162 Identities=21% Similarity=0.243 Sum_probs=136.3 Q ss_pred CEEEEECCCCCCCCCCCEEEEEEEEEECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHHHHHH Q ss_conf 37999752788987726048999999838972804002266217888778578989857998899607999799999999 Q gi|254780661|r 4 HFVIYDYETFGRDVALDRPAQFAGVRVDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFSRRIH 83 (471) Q Consensus 4 ~~v~~D~ETtG~~~~~~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~~~i~ 83 (471) +||++|+|||| +..|+|+|||++++ .++++++++.+ .++|.+ ...+|....|||||++|+.++| ++.|+++++. T Consensus 2 ~fvviD~ETt~--~~~d~iieIg~v~v-~~g~iv~~~~~-Li~P~~-~~~~p~~i~ihGIt~~~v~~aP-~f~ev~~~~~ 75 (309) T PRK06195 2 DFVAIDFETAN--EKRNSPCSIGIVVV-KDGEIVEKVHY-LIKPKE-MRFMPINIGIHGIRPHMVQDEL-EFDKIWEKIK 75 (309) T ss_pred CEEEEEEECCC--CCCCEEEEEEEEEE-ECCEEEEEEEE-EECCCC-CCCCCCEEEECCCCHHHHHCCC-CHHHHHHHHH T ss_conf 29999978999--99980599999999-99999999999-989999-8888744640377999996599-9999999999 Q ss_pred HHHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHH Q ss_conf 99511796599950557899999999985078532132568871001588886555653210024444454433108989 Q gi|254780661|r 84 QFFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGATSFKLQDL 163 (471) Q Consensus 84 ~~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~~kL~~l 163 (471) +++. +.++|||| ++||..||+..+.++++++. .+.++|++.++|.++ .+-+++||++| T Consensus 76 ~fi~--~~vlVaHN-a~FD~~fL~~~~~r~gl~~~------~~~~~cTl~LAR~~~-------------p~l~~~kL~~L 133 (309) T PRK06195 76 DYFN--DNLVIAHN-ASFDISVLRKTLELYNIPMP------DFEYICTMKLAKNFY-------------SNIPNARLNTV 133 (309) T ss_pred HHHC--CCEEEEEC-CHHHHHHHHHHHHHCCCCCC------CCCEEEHHHHHHHHC-------------CCCCCCCHHHH T ss_conf 9857--99799956-28889999999998499999------997774799999865-------------57765788899 Q ss_pred HHHHCCCCCCCCHHHHHHHHHHHHHHHHHHH Q ss_conf 9871887643101333189999999999864 Q gi|254780661|r 164 ALANGIEHVNAHDAKADVYATLALVWLIREK 194 (471) Q Consensus 164 a~~n~i~~~~aH~A~~D~~~t~~l~k~i~~~ 194 (471) |+.+||+. ++|+|++||.||++++-.+.+. T Consensus 134 a~~~gi~~-~hH~Al~DA~A~a~I~~~~~~~ 163 (309) T PRK06195 134 NNFLGYEF-KHHDALEDAMACSNILLNISKE 163 (309) T ss_pred HHHHCCCC-CCCCCCCCHHHHHHHHHHHHHH T ss_conf 99859880-1168510199999999999998 No 26 >PRK06022 consensus Probab=99.96 E-value=6.7e-28 Score=198.08 Aligned_cols=168 Identities=24% Similarity=0.272 Sum_probs=134.0 Q ss_pred EEEEECCCCCCCCCCCEEEEEEEEEECCCCEECCEEEEEEECCCCC-CCCCHHHHHHHCCCHHHHHHCCCCHHHHHHHHH Q ss_conf 7999752788987726048999999838972804002266217888-778578989857998899607999799999999 Q gi|254780661|r 5 FVIYDYETFGRDVALDRPAQFAGVRVDRQWEKIESTEVFFCKPADD-YLPDPEAVFITGITPQKALRDGVVEAEFSRRIH 83 (471) Q Consensus 5 ~v~~D~ETtG~~~~~~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~~-~~p~p~a~~v~git~~~~~~~~~~~~e~~~~i~ 83 (471) .|++|+||||++|..|+|||+|+|.+.....+-.+|+. ++ +|. ..++|+|..|||||++++.++| ++.|++.+|. T Consensus 3 ~IvlDtETTGl~~~~dRIIEIg~vel~~~~~t~~~f~~-~i--nP~~r~I~~~a~~IHGIt~e~L~~~P-~F~eva~~fl 78 (234) T PRK06022 3 EIIFDTETTGLERREDRVIEIGGVELVNRFPTGRTFHK-FI--NPQGRQVHPDALAVHGISDEQLLDKP-VFAEILDEFL 78 (234) T ss_pred EEEEEEECCCCCCCCCCEEEEEEEEEECCCCCCCEEEE-EE--CCCCCCCCHHHHHHHCCCHHHHCCCC-CHHHHHHHHH T ss_conf 69998227998998970799999999778633657899-98--99998689889988560899870599-7999999999 Q ss_pred HHHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHH Q ss_conf 99511796599950557899999999985078532132568871001588886555653210024444454433108989 Q gi|254780661|r 84 QFFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGATSFKLQDL 163 (471) Q Consensus 84 ~~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~~kL~~l 163 (471) +|+. +.++|+|| ++||..||++.|.|.+.++.. .....|+|.++|.. +| |. +.+|+.| T Consensus 79 ~Fi~--~a~LVaHN-A~FD~gFLn~El~r~G~~~~~-----~~~viDTL~LAR~~---~P----------g~-~~sLdaL 136 (234) T PRK06022 79 EFFD--GARLVAHN-ATFDLGFINAEFARLGQPEIT-----SERVVDTLALARRK---HP----------MG-PNSLDAL 136 (234) T ss_pred HHHC--CCEEEEEC-HHHHHHHHHHHHHHCCCCCCC-----HHHHHHHHHHHHHH---CC----------CC-CCCHHHH T ss_conf 9818--98599955-787299999999985998777-----88874699999986---88----------99-8898999 Q ss_pred HHHHCCCCCC--CCHHHHHHHHHHHHH-HHHHHHHHHH Q ss_conf 9871887643--101333189999999-9998642268 Q gi|254780661|r 164 ALANGIEHVN--AHDAKADVYATLALV-WLIREKKPKL 198 (471) Q Consensus 164 a~~n~i~~~~--aH~A~~D~~~t~~l~-k~i~~~~p~~ 198 (471) |+.+||+..+ .|.|+.||..+++|. .+...+++.+ T Consensus 137 c~r~~Id~~~R~~HgAL~Da~lLa~VyleL~GG~q~~l 174 (234) T PRK06022 137 CRRYGIDNSHRTKHGALLDSELLAEVYIELIGGKQTAL 174 (234) T ss_pred HHHCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCC T ss_conf 99839997773537628889999999999728886751 No 27 >cd06134 RNaseT RNase T is a DEDDh-type DnaQ-like 3'-5' exoribonuclease E implicated in the 3' maturation of small stable RNAs and 23srRNA, and in the end turnover of tRNA. It contains three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific Hx(4)D conserved pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. RNase T is related to the proofreading domain of DNA polymerase III. Despite its important role, RNase T is mainly found only in gammaproteobacteria. It is speculated that it might have originated from DNA polymerase III at the time the gamma division of proteobacteria diverged from other bacteria. RNase T is a homodimer with the catalytic residues of one monomer contacting a large basic patch on the other monomer to form a functional active site. Probab=99.96 E-value=1.5e-27 Score=195.93 Aligned_cols=170 Identities=25% Similarity=0.204 Sum_probs=131.6 Q ss_pred CEEEEECCCCCCCCCCCEEEEEEEEEE--CCCCEE--CCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHH Q ss_conf 379997527889877260489999998--389728--0400226621788877857898985799889960799979999 Q gi|254780661|r 4 HFVIYDYETFGRDVALDRPAQFAGVRV--DRQWEK--IESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFS 79 (471) Q Consensus 4 ~~v~~D~ETtG~~~~~~~iiq~~~i~~--d~~~~~--~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~ 79 (471) .+|++|+||||++|..|+|||||||++ |..+++ .+++. ..++|.++...+|.++.+||||+.++.+...++.+++ T Consensus 6 ~~VV~D~ETTGL~p~~d~IIEIgav~v~~d~~g~i~~~~~~~-~~v~P~~~~~i~~~~~~itGIt~~~~~~~~~~~~~~~ 84 (189) T cd06134 6 LPVVVDVETGGFNPQTDALLEIAAVTLEMDEQGNLYPDETFH-FHILPFEGANLDPAALEFNGIDPFHPFRFAVDEKEAL 84 (189) T ss_pred CCEEEEEECCCCCCCCCEEEEEEEEEEEECCCCCCCCCCEEE-EEECCCCCCCCCHHHHHHCCCCCHHHHCCCCCHHHHH T ss_conf 507999528999988980799999999976878532451689-9778988998985566550777001100167577889 Q ss_pred HHHHHHHHCC-------CCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCC Q ss_conf 9999995117-------965999505578999999999850785321325688710015888865556532100244444 Q gi|254780661|r 80 RRIHQFFSVP-------NTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRD 152 (471) Q Consensus 80 ~~i~~~~~~~-------~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~ 152 (471) +++.+++.+. ++++|||| +.||..||++.+.|+++.... ......+|++.+++.. +| T Consensus 85 ~~~~~~i~~~~~~~~~~~avlVaHN-a~FD~~FL~~~~~r~g~~~~p---~~~~~~~DTl~La~~~---~~--------- 148 (189) T cd06134 85 KEIFKPIRKALKAQGCTRAILVGHN-AHFDLGFLNAAVARCKIKRNP---FHPFSTFDTATLAGLA---YG--------- 148 (189) T ss_pred HHHHHHHHHHHHHCCCCCCEEEEEC-HHHHHHHHHHHHHHCCCCCCC---CCCCCEEEHHHHHHHH---HC--------- T ss_conf 9999999999875167555687517-777299999999984998888---8777656699998677---38--------- Q ss_pred CCCCCCCHHHHHHHHCCCCC--CCCHHHHHHHHHHHHHHHHHHH Q ss_conf 54433108989987188764--3101333189999999999864 Q gi|254780661|r 153 DGATSFKLQDLALANGIEHV--NAHDAKADVYATLALVWLIREK 194 (471) Q Consensus 153 ~~~~~~kL~~la~~n~i~~~--~aH~A~~D~~~t~~l~k~i~~~ 194 (471) +.+|+.+|+..||+.+ .+|+|++||.+|++|+-.+.++ T Consensus 149 ----~~~L~~l~~~~gi~~~~~~~HrAl~DA~~ta~lf~~l~~k 188 (189) T cd06134 149 ----QTVLAKACQAAGIEFDNKEAHSALYDTQKTAELFCKIVNR 188 (189) T ss_pred ----CCCHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHH T ss_conf ----0019999998799999987887299999999999999964 No 28 >PRK05711 DNA polymerase III subunit epsilon; Provisional Probab=99.96 E-value=2e-27 Score=195.10 Aligned_cols=169 Identities=21% Similarity=0.167 Sum_probs=135.0 Q ss_pred EEEEECCCCCCCCCC-CEEEEEEEEEECCCCEECC-EEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHHHHH Q ss_conf 799975278898772-6048999999838972804-00226621788877857898985799889960799979999999 Q gi|254780661|r 5 FVIYDYETFGRDVAL-DRPAQFAGVRVDRQWEKIE-STEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFSRRI 82 (471) Q Consensus 5 ~v~~D~ETtG~~~~~-~~iiq~~~i~~d~~~~~~~-~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~~~i 82 (471) .|++|+||||++|.. |+|||||+|.+. +..+.+ +|+.+ ++|...++|+|..|||||++++.++| ++.|++.++ T Consensus 6 ~VvlDtETTGl~~~~gdRIIEIgav~~~-~~~~t~~~f~~~---inP~r~i~~~a~~IhGIT~e~L~~~P-~F~ei~~ef 80 (240) T PRK05711 6 QIVLDTETTGLNQREGHRIIEIGAVELI-NRRLTGRNFHVY---IKPDRDVDPEALAVHGITDEFLADKP-TFAEVADEF 80 (240) T ss_pred EEEEEEECCCCCCCCCCEEEEEEEEEEE-CCEECCCEEEEE---ECCCCCCCHHHHHHCCCCHHHHCCCC-CHHHHHHHH T ss_conf 8999866899699999878999999999-998814357899---89689599778754086889964587-599999999 Q ss_pred HHHHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHH Q ss_conf 99951179659995055789999999998507853213256887100158888655565321002444445443310898 Q gi|254780661|r 83 HQFFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGATSFKLQD 162 (471) Q Consensus 83 ~~~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~~kL~~ 162 (471) .+|+. +.++|||| +.||..||.+.|.|.+.+.... ...+...|+|.++|.. +| |. +.+|+. T Consensus 81 l~Fi~--~s~lVaHN-a~FD~~FLn~El~r~g~~~~~~--~~~~~viDTL~lAR~~---~P----------g~-~~sLda 141 (240) T PRK05711 81 LDFIR--GAELIIHN-APFDIGFMDYEFALLGRDIPKT--TTFCKVTDTLAMARRM---FP----------GK-RNSLDA 141 (240) T ss_pred HHHHC--CCEEEEEC-CHHHHHHHHHHHHHHCCCCCCH--HHHCCHHHHHHHHHHH---CC----------CC-CCCHHH T ss_conf 99818--99799966-1998999999999868998764--4521210199999987---89----------99-889999 Q ss_pred HHHHHCCCCCC--CCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99871887643--101333189999999999864226 Q gi|254780661|r 163 LALANGIEHVN--AHDAKADVYATLALVWLIREKKPK 197 (471) Q Consensus 163 la~~n~i~~~~--aH~A~~D~~~t~~l~k~i~~~~p~ 197 (471) ||+.+||+... .|.|+.||.++++|.-.+...+.. T Consensus 142 Lc~r~~I~~s~R~~HgAL~Da~llA~Vyl~l~gGQ~~ 178 (240) T PRK05711 142 LCKRYGIDNSHRTLHGALLDAEILAEVYLAMTGGQTS 178 (240) T ss_pred HHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCC T ss_conf 9998098988756576488899999999997178756 No 29 >cd06130 DNA_pol_III_epsilon_like This subfamily is composed of uncharacterized bacterial proteins with similarity to the epsilon subunit of DNA polymerase III (polIII), a multisubunit polymerase which is the main DNA replicating enzyme in bacteria, functioning as the chromosomal replicase. The pol III holoenzyme is a complex of ten different subunits, three of which (alpha, epsilon, and theta) compose the catalytic core. The polIII epsilon subunit, encoded by the dnaQ gene, is a DEDDh-type 3'-5' exonuclease which is responsible for the proofreading activity of the polymerase, increasing the fidelity of DNA synthesis. It contains three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific Hx(4)D conserved pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. The epsilon subunit of polIII also functions as a stabilizer of the holoenzyme complex Probab=99.96 E-value=2e-27 Score=194.98 Aligned_cols=154 Identities=22% Similarity=0.240 Sum_probs=129.3 Q ss_pred EEEEECCCCCCCCCCCEEEEEEEEEECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHHHHHHH Q ss_conf 79997527889877260489999998389728040022662178887785789898579988996079997999999999 Q gi|254780661|r 5 FVIYDYETFGRDVALDRPAQFAGVRVDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFSRRIHQ 84 (471) Q Consensus 5 ~v~~D~ETtG~~~~~~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~~~i~~ 84 (471) ||++|+|||| +..|+|||+|+|+++ ++.++++|+. +|+| ...++|.+..+||||++++.++| ++.++++++.+ T Consensus 1 fvv~D~ETT~--~~~d~IieIgav~i~-~g~i~~~f~~-lv~P--~~~i~~~~~~ihGIt~~~v~~~p-~~~~v~~~l~~ 73 (156) T cd06130 1 FVAIDFETAN--ADRASACSIGLVKVR-DGQIVDTFYT-LIRP--PTRFDPFNIAIHGITPEDVADAP-TFPEVWPEIKP 73 (156) T ss_pred CEEEEEECCC--CCCCEEEEEEEEEEE-CCEEEEEEEE-EECC--CCCCCHHHHHHHCCCHHHHHCCC-CHHHHHHHHHH T ss_conf 9899977898--999969999999999-9999899999-8778--98799889322088879984099-99999999998 Q ss_pred HHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHH Q ss_conf 95117965999505578999999999850785321325688710015888865556532100244444544331089899 Q gi|254780661|r 85 FFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGATSFKLQDLA 164 (471) Q Consensus 85 ~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~~kL~~la 164 (471) ++. +.++|||| ++||..||++.+.|+++++. .+..+|++.++|..+ .+.+++||+.+| T Consensus 74 ~l~--~~~lVaHn-~~FD~~fL~~~~~~~~~~~~------~~~~iDtl~l~r~~~-------------~~~~~~~L~~l~ 131 (156) T cd06130 74 FLG--GSLVVAHN-ASFDRSVLRAALEAYGLPPP------PYQYLCTVRLARRVW-------------PLLPNHKLNTVA 131 (156) T ss_pred HCC--CCEEEEEC-HHHHHHHHHHHHHHCCCCCC------CCCEECHHHHHHHHC-------------CCCCCCCHHHHH T ss_conf 558--98899957-78899999999998199998------985881899999973-------------888889999999 Q ss_pred HHHCCCCCCCCHHHHHHHHHHHHH Q ss_conf 871887643101333189999999 Q gi|254780661|r 165 LANGIEHVNAHDAKADVYATLALV 188 (471) Q Consensus 165 ~~n~i~~~~aH~A~~D~~~t~~l~ 188 (471) +.+|++.. +|+|++||.+|++|+ T Consensus 132 ~~~gi~~~-~H~Al~DA~ata~l~ 154 (156) T cd06130 132 EHLGIELN-HHDALEDARACAEIL 154 (156) T ss_pred HHCCCCCC-CCCCHHHHHHHHHHH T ss_conf 98599987-879689999999998 No 30 >PRK08858 consensus Probab=99.96 E-value=3.6e-27 Score=193.42 Aligned_cols=172 Identities=20% Similarity=0.186 Sum_probs=137.9 Q ss_pred CCEEEEECCCCCCCCCC------CEEEEEEEEEECCCCEECC-EEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCH Q ss_conf 83799975278898772------6048999999838972804-0022662178887785789898579988996079997 Q gi|254780661|r 3 NHFVIYDYETFGRDVAL------DRPAQFAGVRVDRQWEKIE-STEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVE 75 (471) Q Consensus 3 ~~~v~~D~ETtG~~~~~------~~iiq~~~i~~d~~~~~~~-~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~ 75 (471) +-+|++|+||||+++.. |+|||||||.+. +..+++ +|+++ + +|.+.++++|..|||||++++.++| ++ T Consensus 10 ~R~VVlDtETTGl~~~~~~~~~gdRIIEIgaV~i~-~~~~t~~~f~~~-I--nP~r~Ip~~a~~ihGIT~e~L~daP-~F 84 (246) T PRK08858 10 QRIVVLDTETTGMNREGGPHYEGHRIIEIGAVEII-NRKLTGRHFHVY-L--KPDREIQPEAIDVHGITDEFLVDKP-EY 84 (246) T ss_pred CEEEEEEEECCCCCCCCCCCCCCCEEEEEEEEEEE-CCEEECCEEEEE-E--CCCCCCCHHHHHHCCCCHHHHHCCC-CH T ss_conf 47999985789999765677789879999999999-999822147888-8--9799699889977392899983597-58 Q ss_pred HHHHHHHHHHHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 99999999995117965999505578999999999850785321325688710015888865556532100244444544 Q gi|254780661|r 76 AEFSRRIHQFFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGA 155 (471) Q Consensus 76 ~e~~~~i~~~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~ 155 (471) .+++.++.+|+. +..+|||| +.||.+||++.|-|.+.++.. ...-+...|+|.++|.. + .|. T Consensus 85 ~eI~~efl~FI~--d~~LVaHN-a~FD~~FLn~El~r~~~~~~~--~~~~~~viDTL~lAR~~---~----------Pg~ 146 (246) T PRK08858 85 KDVHQEFLEFIK--GAELVAHN-APFDVGFMDYEFEKLNPAIGK--TDDYCKVTDTLAMAKKI---F----------PGK 146 (246) T ss_pred HHHHHHHHHHHC--CCEEEEEC-CHHHHHHHHHHHHHHCCCCCC--HHHHCCCEEHHHHHHHH---C----------CCC T ss_conf 999999999958--99899972-288899999999982877675--13420100299999986---8----------999 Q ss_pred CCCCHHHHHHHHCCCCC--CCCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 33108989987188764--31013331899999999998642268 Q gi|254780661|r 156 TSFKLQDLALANGIEHV--NAHDAKADVYATLALVWLIREKKPKL 198 (471) Q Consensus 156 ~~~kL~~la~~n~i~~~--~aH~A~~D~~~t~~l~k~i~~~~p~~ 198 (471) +.+|+.||+.++|+.. ..|.|+.||..+++|.-.+...++.+ T Consensus 147 -~~sLdaLc~r~~Id~s~R~~HgALlDa~lLa~Vyl~mtgGQ~~l 190 (246) T PRK08858 147 -RNNLDVLCERYGIDNSHRTLHGALLDAEILADVYLLMTGGQTSL 190 (246) T ss_pred -CCCHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCC T ss_conf -88889999981989888340230888999999999972898764 No 31 >PRK07009 consensus Probab=99.96 E-value=5.7e-27 Score=192.13 Aligned_cols=166 Identities=20% Similarity=0.189 Sum_probs=134.8 Q ss_pred EEEEECCCCCCCCCC-CEEEEEEEEEECCCCEECCE-EEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHHHHH Q ss_conf 799975278898772-60489999998389728040-0226621788877857898985799889960799979999999 Q gi|254780661|r 5 FVIYDYETFGRDVAL-DRPAQFAGVRVDRQWEKIES-TEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFSRRI 82 (471) Q Consensus 5 ~v~~D~ETtG~~~~~-~~iiq~~~i~~d~~~~~~~~-~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~~~i 82 (471) .|++|+||||++|.. |+|||||+|.+ .++.++++ |+. + ++|....+|+|..|||||++++.++| ++.+++.++ T Consensus 3 ~VvlDtETTGl~~~~gdRIIEIg~v~l-~~~~~t~~~~~~-y--iNP~r~I~~~a~~IHGIt~e~L~~kP-~F~eia~ef 77 (241) T PRK07009 3 QIILDTETTGLNARTGDRIIEIGCVEL-LNRRLTGNNLHF-Y--VNPERDSDPGALAVHGLTTEFLSDKP-KFAEVVDQI 77 (241) T ss_pred EEEEEEECCCCCCCCCCEEEEEEEEEE-ECCEEECEEEEE-E--ECCCCCCCHHHHHHHCCCHHHHHCCC-CHHHHHHHH T ss_conf 799982579979999987899999999-999780406899-8--78899799757651225799872585-599999999 Q ss_pred HHHHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHH Q ss_conf 99951179659995055789999999998507853213256887100158888655565321002444445443310898 Q gi|254780661|r 83 HQFFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGATSFKLQD 162 (471) Q Consensus 83 ~~~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~~kL~~ 162 (471) .+|+. +.++|+|| ++||..||.+.|.|.++||.. .......|+|.++|.. +| |. +.+|+. T Consensus 78 l~Fi~--~a~LVaHN-A~FD~~FLn~EL~r~G~~~~~---~~~~~viDTL~lAR~~---~P----------g~-~~sLda 137 (241) T PRK07009 78 RDFVQ--DAELIIHN-APFDLGFLDAEFALLGLPPFT---EHCAGVIDTLVQAKQM---FP----------GK-RNSLDA 137 (241) T ss_pred HHHHC--CCEEEEEC-HHHHHHHHHHHHHHCCCCCCC---HHHHHHHHHHHHHHHH---CC----------CC-CCCHHH T ss_conf 99966--99278867-699999999999983997310---2444476799999997---89----------99-878899 Q ss_pred HHHHHCCCCCC--CCHHHHHHHHHHHHHHHHHHHH Q ss_conf 99871887643--1013331899999999998642 Q gi|254780661|r 163 LALANGIEHVN--AHDAKADVYATLALVWLIREKK 195 (471) Q Consensus 163 la~~n~i~~~~--aH~A~~D~~~t~~l~k~i~~~~ 195 (471) ||+.++|+... .|.|+.||..+++|.-.+-..+ T Consensus 138 Lc~R~~I~~~~r~~HgAL~Da~lLa~vyl~mtgGq 172 (241) T PRK07009 138 LCDRFGISNAHRTLHGALLDSELLAEVYLAMTRGQ 172 (241) T ss_pred HHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCC T ss_conf 99982869887330242877999999999983897 No 32 >pfam00929 Exonuc_X-T Exonuclease. This family includes a variety of exonuclease proteins, such as ribonuclease T and the epsilon subunit of DNA polymerase III.; Probab=99.95 E-value=2.7e-27 Score=194.16 Aligned_cols=161 Identities=27% Similarity=0.232 Sum_probs=132.3 Q ss_pred EEEEECCCCCCCCCCCEEEEEEEEEECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHHHHHHH Q ss_conf 79997527889877260489999998389728040022662178887785789898579988996079997999999999 Q gi|254780661|r 5 FVIYDYETFGRDVALDRPAQFAGVRVDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFSRRIHQ 84 (471) Q Consensus 5 ~v~~D~ETtG~~~~~~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~~~i~~ 84 (471) +||+|+||||++|..++|||+|||++|.++.+.++++. +++|. ..++|++..+||||++++.+.+ +..+++.++.+ T Consensus 1 ~v~~D~ETTGl~~~~~~Iieigav~~~~~~~~~~~~~~-~v~P~--~~i~~~~~~i~GIt~~~l~~~~-~~~~~~~~~~~ 76 (162) T pfam00929 1 LVVIDCETTGLDPEKDRIIEIAAVSIVGGENIGPVFDT-YVKPE--RLITDEATKFHGITPEMLRNAP-SFEEVLEAFLE 76 (162) T ss_pred CEEEEEECCCCCCCCCEEEEEEEEEEECCEEEEEEEEE-EECCC--CCCCHHHHHHCCCCHHHHHHCC-CHHHHHHHHHH T ss_conf 98999868999899975899999999999776555678-77788--8889999987397877754153-11779999999 Q ss_pred HHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHH Q ss_conf 95117965999505578999999999850785321325688710015888865556532100244444544331089899 Q gi|254780661|r 85 FFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGATSFKLQDLA 164 (471) Q Consensus 85 ~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~~kL~~la 164 (471) ++.. +.+++||| ..||..++++.+++.+..++ ...+..+|++.+.+..+ .+.++++|+.++ T Consensus 77 ~l~~-~~~~v~~n-~~fD~~~l~~~~~~~~~~~~----~~~~~~~dt~~~~~~~~-------------~~~~~~~L~~l~ 137 (162) T pfam00929 77 FLKK-LKILVGHN-ASFDVGFLLYDDLRFLKLPH----PKLNDVIDTLILDKATY-------------KGFKRRSLDALA 137 (162) T ss_pred HHCC-CCEEECCC-CCHHHHHHHHHHHHHCCCCC----CCCCCHHHHHHHHHHHH-------------CCCCCCCHHHHH T ss_conf 8425-85777788-77899999999998586336----63333044899999975-------------044679989999 Q ss_pred HHHCCCC-CCCCHHHHHHHHHHHHH Q ss_conf 8718876-43101333189999999 Q gi|254780661|r 165 LANGIEH-VNAHDAKADVYATLALV 188 (471) Q Consensus 165 ~~n~i~~-~~aH~A~~D~~~t~~l~ 188 (471) +.+|+.. ++||+|++||.+|++|+ T Consensus 138 ~~~~i~~~~~aH~Al~Da~~t~~lf 162 (162) T pfam00929 138 EKLGLEKIQRAHRALDDARATAELF 162 (162) T ss_pred HHCCCCCCCCCCCHHHHHHHHHHHC T ss_conf 8859999888678299999999879 No 33 >cd06133 ERI-1_3'hExo_like This subfamily is composed of Caenorhabditis elegans ERI-1, human 3' exonuclease (3'hExo), Drosophila exonuclease snipper (snp), and similar proteins from eukaryotes and bacteria. These are DEDDh-type DnaQ-like 3'-5' exonucleases containing three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific Hx(4)D conserved pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. ERI-1 has been implicated in the degradation of small interfering RNAs (RNAi). 3'hExo participates in the degradation of histone mRNAs. Snp is a non-essential exonuclease that efficiently degrades structured RNA and DNA substrates as long as there is a minimum of 2 nucleotides in the 3' overhang to initiate degradation. Snp is not a functional homolog of either ERI-1 or 3'hExo. Probab=99.95 E-value=1.5e-26 Score=189.45 Aligned_cols=168 Identities=13% Similarity=0.017 Sum_probs=132.3 Q ss_pred EEEEECCCCCCCCCCCE-----EEEEEEEEEC-CCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHH Q ss_conf 79997527889877260-----4899999983-89728040022662178887785789898579988996079997999 Q gi|254780661|r 5 FVIYDYETFGRDVALDR-----PAQFAGVRVD-RQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEF 78 (471) Q Consensus 5 ~v~~D~ETtG~~~~~~~-----iiq~~~i~~d-~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~ 78 (471) ||++|+||||+++..++ |||||||++| .++.++++|++ +++|......+|.+..+||||++|+.++| ++.++ T Consensus 1 yvv~D~EtTg~~~~~~~~~~~eIIeIgav~vd~~~~~i~~~f~~-lI~P~~~~~i~~~i~~itGIt~~~l~~ap-~~~~v 78 (176) T cd06133 1 YLVIDFEATCWEGNSKPDYPNEIIEIGAVLVDVKTKEIIDTFSS-YVKPVINPKLSDFCTELTGITQEDVDNAP-SFPEV 78 (176) T ss_pred CEEEEEECCCCCCCCCCCCCCCEEEEEEEEEECCCCEEEEEEEE-EECCCCCCCCCHHHHHHHCCCHHHHHCCC-CHHHH T ss_conf 98999726899878898999707999999998799979899999-97587688789889977381887870786-39999 Q ss_pred HHHHHHHHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCC Q ss_conf 99999995117965999505578999999999850785321325688710015888865556532100244444544331 Q gi|254780661|r 79 SRRIHQFFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGATSF 158 (471) Q Consensus 79 ~~~i~~~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~~ 158 (471) ++++.+++.... .+..+++.+||..++.+.+.+++..... ...+..+|+..+.+.. .| ..+++ T Consensus 79 ~~~f~~~i~~~~-~~~~~~~~~fD~~~l~~~~~~~~~~~~~---~~~~~~iD~~~~~~~~---~~----------~~~~~ 141 (176) T cd06133 79 LKEFLEWLGKNG-KYAFVTWGDWDLKDLLQNQCKYKIINLP---PFFRQWIDLKKEFAKF---YG----------LKKRT 141 (176) T ss_pred HHHHHHHHCCCC-CEEEEEECCCHHHHHHHHHHHHCCCCCC---CCCCCHHHHHHHHHHH---HC----------CCCCC T ss_conf 999999972698-5799960600299999999997899887---3011120499999998---18----------88898 Q ss_pred CHHHHHHHHCCCCC-CCCHHHHHHHHHHHHHHHH Q ss_conf 08989987188764-3101333189999999999 Q gi|254780661|r 159 KLQDLALANGIEHV-NAHDAKADVYATLALVWLI 191 (471) Q Consensus 159 kL~~la~~n~i~~~-~aH~A~~D~~~t~~l~k~i 191 (471) +|+++|+.+||+.+ ++|||++||.+|++|++.+ T Consensus 142 sL~~l~~~~gi~~~~~~H~AL~DA~~ta~v~~~l 175 (176) T cd06133 142 GLSKALEYLGLEFEGRHHRGLDDARNIARILKRL 175 (176) T ss_pred CHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHH T ss_conf 9999999869999998858599999999999987 No 34 >PRK06859 consensus Probab=99.95 E-value=9.9e-26 Score=184.14 Aligned_cols=168 Identities=18% Similarity=0.194 Sum_probs=136.9 Q ss_pred CCEEEEECCCCCCCC-----CCCEEEEEEEEEECCCCEECCE-EEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHH Q ss_conf 837999752788987-----7260489999998389728040-0226621788877857898985799889960799979 Q gi|254780661|r 3 NHFVIYDYETFGRDV-----ALDRPAQFAGVRVDRQWEKIES-TEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEA 76 (471) Q Consensus 3 ~~~v~~D~ETtG~~~-----~~~~iiq~~~i~~d~~~~~~~~-~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~ 76 (471) +-.|++|+||||++| .-|+|||||||.+ .++.++++ |+.+ ++|.+..+++|..|||||++++.++| ++. T Consensus 5 ~R~VvlDtETTGl~p~~~~~~g~RIIEIgaVel-~~~~~t~~~f~~y---inP~r~I~~~a~~vhGIT~e~L~~kP-~F~ 79 (253) T PRK06859 5 NRQIVLDTETTGMNQFGAHYEGHCIIEIGAVEL-INRRYTGNNFHIY---IKPDRPVDPDAIKVHGITDEMLADKP-EFK 79 (253) T ss_pred CCEEEEEEECCCCCCCCCCCCCCEEEEEEEEEE-ECCEECCCEEEEE---ECCCCCCCHHHHHHHCCCHHHHHCCC-CHH T ss_conf 837999721799998766668986899999999-9997824046788---78699499889987450899983699-799 Q ss_pred HHHHHHHHHHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCC Q ss_conf 99999999951179659995055789999999998507853213256887100158888655565321002444445443 Q gi|254780661|r 77 EFSRRIHQFFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGAT 156 (471) Q Consensus 77 e~~~~i~~~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~ 156 (471) +++.++.+|+. +..+|+|| +.||..||.+.|-|.++++.. ...+...|+|.++|.. +| |. T Consensus 80 eia~efl~Fi~--~a~LVaHN-a~FD~~FLn~El~r~g~~~~~---~~~~~viDTL~lAR~~---~P----------g~- 139 (253) T PRK06859 80 EVAQEFIDYIK--GAELLIHN-APFDVGFMDYEFRKLNLNVKT---DDICLVTDTLQMARQM---YP----------GK- 139 (253) T ss_pred HHHHHHHHHHC--CCEEEEEC-HHHHHHHHHHHHHHCCCCCCC---CCCCCEEEHHHHHHHH---CC----------CC- T ss_conf 99999999967--99799957-188899999999974999653---2200113399999987---87----------99- Q ss_pred CCCHHHHHHHHCCCCCC--CCHHHHHHHHHHHHHHHHHHHH Q ss_conf 31089899871887643--1013331899999999998642 Q gi|254780661|r 157 SFKLQDLALANGIEHVN--AHDAKADVYATLALVWLIREKK 195 (471) Q Consensus 157 ~~kL~~la~~n~i~~~~--aH~A~~D~~~t~~l~k~i~~~~ 195 (471) ..+|+.||+.++|+... .|.|+.||...++|.-.+...+ T Consensus 140 ~~sLdaLc~r~~id~~~R~~HgAllDa~lLa~VYl~mtGGQ 180 (253) T PRK06859 140 RNNLDALCDRLGIDNSKRTLHGALLDAEILADVYLMMTGGQ 180 (253) T ss_pred CCCHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCC T ss_conf 77999999973998766330132888999999999872675 No 35 >PRK05168 ribonuclease T; Provisional Probab=99.95 E-value=2.2e-26 Score=188.36 Aligned_cols=174 Identities=25% Similarity=0.241 Sum_probs=132.8 Q ss_pred EEEEECCCCCCCCCCCEEEEEEEEEE--CCCCEEC--CEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHHH Q ss_conf 79997527889877260489999998--3897280--4002266217888778578989857998899607999799999 Q gi|254780661|r 5 FVIYDYETFGRDVALDRPAQFAGVRV--DRQWEKI--ESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFSR 80 (471) Q Consensus 5 ~v~~D~ETtG~~~~~~~iiq~~~i~~--d~~~~~~--~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~~ 80 (471) .|++|+||||++|..|+|||||||++ |++|++. ++|+. +++|.++..++|+++.+|||++........++.+++. T Consensus 19 pVV~D~ETTGl~p~~D~IIEIGAV~v~~~e~g~i~~~~~f~~-~V~P~~g~~I~~~~~~ltGI~~~~~l~~a~~~~~al~ 97 (213) T PRK05168 19 PVVIDVETAGFNAQTDALLEIAAITLKMDEQGWLYPDETLHF-HVEPFEGANLEPEALAFNGIDPDNPLRGAVSEKEALH 97 (213) T ss_pred CEEEEEECCCCCCCCCCEEEEEEEEEEECCCCCCCCCCEEEE-EECCCCCCCCCHHHHHHCCCCCCCCCCCCCCHHHHHH T ss_conf 759983079988778855888889988637886724754889-8788878878866751126786532223666788789 Q ss_pred HHHHHHHCC-------CCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 999995117-------9659995055789999999998507853213256887100158888655565321002444445 Q gi|254780661|r 81 RIHQFFSVP-------NTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDD 153 (471) Q Consensus 81 ~i~~~~~~~-------~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~ 153 (471) ++.+++.+. +.++|||| +.||..||++.+-|.++..- .+ ..-..+|++.++|..+ + T Consensus 98 ~~~~~i~~~~~~~~c~~avLVaHN-A~FD~~FLn~~~~r~g~~~~--P~-~~~~viDTl~Lar~~~---~---------- 160 (213) T PRK05168 98 EIFKPVRKGIKAAGCQRAILVAHN-AHFDLGFLNAAAERTGLKRN--PF-HPFSTFDTATLAGLAY---G---------- 160 (213) T ss_pred HHHHHHHHHHHHHCCCCCEEEEEC-HHHHHHHHHHHHHHCCCCCC--CC-CCCCEEEHHHHHHHHH---C---------- T ss_conf 999999999875046756377634-18749999999998499778--88-8776335899868874---8---------- Q ss_pred CCCCCCHHHHHHHHCCCC--CCCCHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 443310898998718876--4310133318999999999986422689 Q gi|254780661|r 154 GATSFKLQDLALANGIEH--VNAHDAKADVYATLALVWLIREKKPKLF 199 (471) Q Consensus 154 ~~~~~kL~~la~~n~i~~--~~aH~A~~D~~~t~~l~k~i~~~~p~~~ 199 (471) ...|+.+|++.||+. ..+|+|+.|+.+|++|+-.+-++.+.+. T Consensus 161 ---~~~L~~lc~~~gi~~~~~~~HrAl~DA~~tA~vFl~ll~~l~~~g 205 (213) T PRK05168 161 ---QTVLAKACQAAGIEFDNKEAHSALYDTEKTAELFCEIVNRWKRLG 205 (213) T ss_pred ---CCCHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCC T ss_conf ---740999999869999997777738999999999999999999738 No 36 >PRK08816 consensus Probab=99.95 E-value=1e-25 Score=184.01 Aligned_cols=165 Identities=19% Similarity=0.134 Sum_probs=133.1 Q ss_pred CEEEEECCCCCCCCCC-CEEEEEEEEEECCCCEEC-CEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHHHH Q ss_conf 3799975278898772-604899999983897280-40022662178887785789898579988996079997999999 Q gi|254780661|r 4 HFVIYDYETFGRDVAL-DRPAQFAGVRVDRQWEKI-ESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFSRR 81 (471) Q Consensus 4 ~~v~~D~ETtG~~~~~-~~iiq~~~i~~d~~~~~~-~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~~~ 81 (471) .+|++|+||||++|.. |+|||||+|.+-+ +.++ ++|+.+ ++|...++++|..|||||++|+.++| .+.+++.+ T Consensus 2 R~VVlDtETTGl~~~~gdRIIEIgaV~~~~-~~~~~~~f~~~---inP~r~I~~~a~~vhGIT~e~L~dkP-~F~eV~~e 76 (244) T PRK08816 2 RQIILDTETTGLEWRKGNRVVEIGAVELLE-RRPSGNNFHRY---LKPDCDFEPGAQEVTGLTLEFLADKP-LFGEVVDE 76 (244) T ss_pred CEEEEEEECCCCCCCCCCEEEEEEEEEEEC-CEECCCEEEEE---ECCCCCCCHHHHHHHCCCHHHHHCCC-CHHHHHHH T ss_conf 879997227998998998899999999999-96814378998---68899699889875251789984099-89999999 Q ss_pred HHHHHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHH Q ss_conf 99995117965999505578999999999850785321325688710015888865556532100244444544331089 Q gi|254780661|r 82 IHQFFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGATSFKLQ 161 (471) Q Consensus 82 i~~~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~~kL~ 161 (471) +.+|+. +..+|+|| +.||..||.+.|.|.+.+.-.. ...+...|+|.++|.- +| |. ..+|+ T Consensus 77 fl~Fi~--ds~LVaHN-a~FDi~FLn~El~r~g~~~~~~--~~~~~v~DTl~lAr~~---~P----------g~-~nsLD 137 (244) T PRK08816 77 FLAYID--GAELIIHN-AAFDLGFLDNELSLLGDQYGRI--VDRATVVDTLMMARER---YP----------GQ-RNSLD 137 (244) T ss_pred HHHHHC--CCEEEEEC-CHHHHHHHHHHHHHHCCCCCCC--CCCCCEEEHHHHHHHH---CC----------CC-CCCHH T ss_conf 999828--99899966-3652989999999848875676--4310104389999987---87----------99-77899 Q ss_pred HHHHHHCCCCCC--CCHHHHHHHHHHHHHHHHH Q ss_conf 899871887643--1013331899999999998 Q gi|254780661|r 162 DLALANGIEHVN--AHDAKADVYATLALVWLIR 192 (471) Q Consensus 162 ~la~~n~i~~~~--aH~A~~D~~~t~~l~k~i~ 192 (471) .||+.++|+..+ -|.|+.||...++|.-.+. T Consensus 138 aLc~r~~id~~~R~~HgALlDa~lLa~VYl~mt 170 (244) T PRK08816 138 ALCKRLGVDNSHRQLHGALLDAQILADVYIALT 170 (244) T ss_pred HHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHC T ss_conf 999974978877350148887999999999974 No 37 >PRK07982 consensus Probab=99.95 E-value=6.6e-26 Score=185.28 Aligned_cols=172 Identities=19% Similarity=0.150 Sum_probs=137.0 Q ss_pred CCCEEEEECCCCCCCC-----CCCEEEEEEEEEECCCCEECCE-EEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCH Q ss_conf 8837999752788987-----7260489999998389728040-022662178887785789898579988996079997 Q gi|254780661|r 2 TNHFVIYDYETFGRDV-----ALDRPAQFAGVRVDRQWEKIES-TEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVE 75 (471) Q Consensus 2 ~~~~v~~D~ETtG~~~-----~~~~iiq~~~i~~d~~~~~~~~-~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~ 75 (471) |.-.|++|+||||++| .-|+|||||||.+ .++.+.++ |+.+ ++|.+..+++|..|||||++++.++| ++ T Consensus 5 ~~R~ivlDtETTGL~p~~~~~~gdRIIEIgaVei-~~~~~t~~~fh~y---iNP~r~I~~~a~~VHGIT~e~L~dkP-~F 79 (243) T PRK07982 5 ITRQIVLDTETTGMNQIGAHYEGHKIIEIGAVEV-INRRLTGNNFHVY---LKPDRLVDPEAFGVHGIADEFLLDKP-TF 79 (243) T ss_pred CCEEEEEEEECCCCCCCCCCCCCCEEEEEEEEEE-ECCEECCCEEEEE---ECCCCCCCHHHHHHEEECHHHHHCCC-CH T ss_conf 0258999711799998776778987999999999-9998825234378---88899799889852115778871699-89 Q ss_pred HHHHHHHHHHHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 99999999995117965999505578999999999850785321325688710015888865556532100244444544 Q gi|254780661|r 76 AEFSRRIHQFFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGA 155 (471) Q Consensus 76 ~e~~~~i~~~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~ 155 (471) .+++.++.+|+. +..+|+|| +.||..||.+.|.|.+.+... ...-+...|+|.++|.- + .|. T Consensus 80 ~eI~~efl~Fi~--~a~LVaHN-A~FD~~FLn~El~r~g~~~~~--~~~~~~v~DTL~lAR~~---~----------Pg~ 141 (243) T PRK07982 80 ADVADEFLDYIR--GAELVIHN-AAFDIGFMDYEFSLLKRDIPK--TNTFCKVTDSLALARKM---F----------PGK 141 (243) T ss_pred HHHHHHHHHHHC--CCEEEEEC-CHHHHHHHHHHHHHCCCCCCC--CCCCCCEEEHHHHHHHH---C----------CCC T ss_conf 999999999967--99899973-388899999999970889886--10124313499998887---8----------898 Q ss_pred CCCCHHHHHHHHCCCCCCC--CHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 3310898998718876431--01333189999999999864226 Q gi|254780661|r 156 TSFKLQDLALANGIEHVNA--HDAKADVYATLALVWLIREKKPK 197 (471) Q Consensus 156 ~~~kL~~la~~n~i~~~~a--H~A~~D~~~t~~l~k~i~~~~p~ 197 (471) ..+|+.||+.++|+...- |.|+.||...++|.-.+...++. T Consensus 142 -~~sLDaLc~R~~Id~s~R~~HgALlDa~lLAeVYl~ltgGQ~~ 184 (243) T PRK07982 142 -RNSLDALCDRYEIDNSKRTLHGALLDAQILAEVYLAMTGGQTS 184 (243) T ss_pred -CCCHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCC T ss_conf -6899999875487765655233377488999999997289866 No 38 >cd06127 DEDDh DEDDh exonucleases, part of the DnaQ-like (or DEDD) exonuclease superfamily, catalyze the excision of nucleoside monophosphates at the DNA or RNA termini in the 3'-5' direction. These proteins contain four invariant acidic residues in three conserved sequence motifs termed ExoI, ExoII and ExoIII. DEDDy exonucleases are classified as such because of the presence of specific Hx(4)D conserved pattern at the ExoIII motif. The four conserved acidic residues are clustered around the active site and serve as ligands for the two metal ions required for catalysis. Most DEDDh exonucleases are the proofreading subunits (epsilon) or domains of bacterial DNA polymerase III, the main replicating enzyme in bacteria, which functions as the chromosomal replicase. Other members include other DNA and RNA exonucleases such as RNase T, Oligoribonuclease, and RNA exonuclease (REX), among others. Probab=99.95 E-value=5.5e-26 Score=185.78 Aligned_cols=158 Identities=30% Similarity=0.295 Sum_probs=129.4 Q ss_pred EEEECCCCCCCCCCCEEEEEEEEEECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHHHHHHHH Q ss_conf 99975278898772604899999983897280400226621788877857898985799889960799979999999999 Q gi|254780661|r 6 VIYDYETFGRDVALDRPAQFAGVRVDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFSRRIHQF 85 (471) Q Consensus 6 v~~D~ETtG~~~~~~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~~~i~~~ 85 (471) ||+|+||||++|..++|||+|+|++|.++++.++++. +++|... .++.+..+|||+.+++.. +.++.+++.++.++ T Consensus 1 v~~D~ETTGl~~~~~~Iieia~v~~~~~~~~~~~~~~-~i~P~~~--~~~~~~~i~gi~~~~~~~-~~~~~~~~~~~~~~ 76 (159) T cd06127 1 VVLDTETTGLDPKGDRIIEIGAVKVDGGIEIVERFET-LVNPGRP--IPPEATAIHGITDEMLAD-APPFEEVLPEFLEF 76 (159) T ss_pred CEEEEECCCCCCCCCCEEEEEEEEEECCEEEEEEEEE-EECCCCC--CCHHHHHHHCCCHHHHCC-CCCHHHHHHHHHHH T ss_conf 9898267999999984699999999999565200799-9889999--998898760677899589-99689999999873 Q ss_pred HHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHH-HH Q ss_conf 51179659995055789999999998507853213256887100158888655565321002444445443310898-99 Q gi|254780661|r 86 FSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGATSFKLQD-LA 164 (471) Q Consensus 86 ~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~~kL~~-la 164 (471) +.. .++||||. +||..||++.+.|++.++ .....+|++.+++.... ..+..++.. .+ T Consensus 77 l~~--~~lv~hN~-~FD~~~L~~~~~~~~~~~------~~~~~iDt~~~~~~~~~-------------~~~~~~~~~~~~ 134 (159) T cd06127 77 LGG--RVLVAHNA-SFDLRFLNRELRRLGGPP------LPNPWIDTLRLARRLLP-------------GLRSHRLGLLLA 134 (159) T ss_pred CCC--CEEEECCH-HHHHHHHHHHHHHCCCCC------CCCCEEEHHHHHHHHCC-------------CCCCCCHHHHHH T ss_conf 156--34651588-899999999999839998------89966647999999738-------------888999999999 Q ss_pred HHHCCCCCCCCHHHHHHHHHHHHHH Q ss_conf 8718876431013331899999999 Q gi|254780661|r 165 LANGIEHVNAHDAKADVYATLALVW 189 (471) Q Consensus 165 ~~n~i~~~~aH~A~~D~~~t~~l~k 189 (471) ...|+...+||+|++||.+|++|+. T Consensus 135 ~~~~~~~~~aH~Al~D~~at~~l~~ 159 (159) T cd06127 135 ARYGIPLEGAHRALADALATAELLL 159 (159) T ss_pred HHCCCCCCCCCCHHHHHHHHHHHHC T ss_conf 9769887989598999999999969 No 39 >TIGR01406 dnaQ_proteo DNA polymerase III, epsilon subunit; InterPro: IPR006309 These represent DnaQ, the DNA polymerase III epsilon subunit, as found in most Proteobacteria. It consists largely of an exonuclease region as described in IPR006055 from INTERPRO. In Gram-positive bacteria, closely related regions are found both in the Gram-positive type DNA polymerase III alpha subunit and as an additional N-terminal domain of a DinG-family helicase. Both are not included in this set of sequences, as they are smaller proteins.; GO: 0003677 DNA binding, 0003887 DNA-directed DNA polymerase activity, 0006260 DNA replication. Probab=99.94 E-value=1.6e-25 Score=182.77 Aligned_cols=190 Identities=19% Similarity=0.145 Sum_probs=154.0 Q ss_pred EEEEECCCCCCCCCC------CEEEEEEEE-EECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHC--CCCH Q ss_conf 799975278898772------604899999-9838972804002266217888778578989857998899607--9997 Q gi|254780661|r 5 FVIYDYETFGRDVAL------DRPAQFAGV-RVDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRD--GVVE 75 (471) Q Consensus 5 ~v~~D~ETtG~~~~~------~~iiq~~~i-~~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~--~~~~ 75 (471) -|+.||||||+++.. +|||||||| ++|.-+.- +.|+. +|+| ++-.+.+|..|||||.+.|.++ | .+ T Consensus 2 ~IiLDTETTGl~~~~G~~~~GhRivEIGavE~vnr~~tG-~~fH~-Y~~P--~Rd~p~eA~kVHGiT~efL~dKnaP-~F 76 (246) T TIGR01406 2 EIILDTETTGLDPKGGLPYIGHRIVEIGAVELVNRMLTG-KNFHV-YVKP--ERDIPAEAAKVHGITDEFLKDKNAP-KF 76 (246) T ss_pred EEEECCCCCCCCCCCCCCCCCCEEEEEEEEEEECCCCCC-CCEEE-EECC--CCCCHHHCEECCCCCHHHHCCCCCC-CH T ss_conf 267115556877667768568626888218567350388-73124-3368--8877300210137646884688989-52 Q ss_pred HHHHHHHHHHHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 99999999995117965999505578999999999850785321325688710015888865556532100244444544 Q gi|254780661|r 76 AEFSRRIHQFFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGA 155 (471) Q Consensus 76 ~e~~~~i~~~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~ 155 (471) .+++.+|.+|++ +...|.|| +.||..||.+.|-|.+++.+.- -..-++..|+|.++|-. +|+ . T Consensus 77 k~Ia~~Fl~Fi~--~~~LvIHN-A~FD~gFlN~EL~~Lg~~~~~~-~~~~~~~~DTL~mAR~~---FPG----------~ 139 (246) T TIGR01406 77 KEIADEFLDFIK--GSELVIHN-AAFDVGFLNYELERLGLKEIKK-ISDFCRVIDTLAMARER---FPG----------K 139 (246) T ss_pred HHHHHHHHHHHC--CCEEEEEC-CCHHHHHHHHHHHHCCCCCCCC-CCHHHHHHHHHHHHHHC---CCC----------C T ss_conf 578999999845--98899975-7021678999998728732100-01366786568999723---789----------8 Q ss_pred CCCCHHHHHHHHCCCCCC--CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHH Q ss_conf 331089899871887643--10133318999999999986422689999962388989974200 Q gi|254780661|r 156 TSFKLQDLALANGIEHVN--AHDAKADVYATLALVWLIREKKPKLFEYLYDYRNKNQLRKLIDI 217 (471) Q Consensus 156 ~~~kL~~la~~n~i~~~~--aH~A~~D~~~t~~l~k~i~~~~p~~~~~~l~~~~K~~v~~~l~~ 217 (471) ..+|+.||+++||+.+. -|-|+-||...++|.=.|...+-.++..+....++..+....+. T Consensus 140 -~~sLDALCkRf~vDns~R~LHGAL~DA~LLAeVYl~ltGGq~~~~~~~~~~~~~~~~~~~~~~ 202 (246) T TIGR01406 140 -RNSLDALCKRFKVDNSHRQLHGALLDAELLAEVYLALTGGQESLLEMLVASESGEAAKASKKA 202 (246) T ss_pred -CCCHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHCCCHHHHHHHHHHH T ss_conf -664678774304250002134789999999999998628805777753122013456677754 No 40 >PRK09146 DNA polymerase III subunit epsilon; Validated Probab=99.93 E-value=1.1e-24 Score=177.54 Aligned_cols=175 Identities=18% Similarity=0.149 Sum_probs=132.9 Q ss_pred CCEEEEECCCCCCCCCCCEEEEEEEEEECCCCEEC--CEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHHH Q ss_conf 83799975278898772604899999983897280--4002266217888778578989857998899607999799999 Q gi|254780661|r 3 NHFVIYDYETFGRDVALDRPAQFAGVRVDRQWEKI--ESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFSR 80 (471) Q Consensus 3 ~~~v~~D~ETtG~~~~~~~iiq~~~i~~d~~~~~~--~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~~ 80 (471) -.||.+|+||||+||..|+||+||+|.++ ++.+. +.++. .++ |....++++..|||||++++.+++ +..+++. T Consensus 47 ~~fvalD~ETTGLdp~~D~IiSIGaV~i~-~~~I~l~~a~~~-lV~--p~~~i~~~s~~IHGIt~~~l~~ap-~l~evl~ 121 (239) T PRK09146 47 VPFVALDFETTGLDPEQDAIVSIGLVPFT-LQRIRCRQARHW-VVK--PRRPLNEESVVIHGITHSDLQDAP-DLERILD 121 (239) T ss_pred CCEEEEECCCCCCCCCCCCEEEEEEEEEE-CCEEEECCEEEE-EEC--CCCCCCCCCEEECCCCHHHHHCCC-CHHHHHH T ss_conf 87899964157888788854899899987-888865341589-988--999787645110696989985699-9999999 Q ss_pred HHHHHHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCH Q ss_conf 99999511796599950557899999999985078532132568871001588886555653210024444454433108 Q gi|254780661|r 81 RIHQFFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGATSFKL 160 (471) Q Consensus 81 ~i~~~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~~kL 160 (471) ++.++++ |.++|+|| ..||..||++.+.+.+-.+. .+..+|+|.+.+....-....+--.......++++| T Consensus 122 ~ll~~l~--g~VLVaH~-a~~D~~FL~~a~~~~~g~~l------~~P~IDTl~Le~~~~~~~~~~~~~r~~~~~~~slrL 192 (239) T PRK09146 122 ELLEALA--GKVVVVHY-RRIERDFLDQALRNRIGEGI------EFPVIDTMEIEARIQRKQAGGLWNRLKGKKPESIRL 192 (239) T ss_pred HHHHHHC--CCEEEEEC-CHHHHHHHHHHHHHHHCCCC------CCCEEEHHHHHHHHHHHHHHHHHHHHCCCCCCCCCH T ss_conf 9999818--99899968-39999999999999628898------874644899999998854200444313589867638 Q ss_pred HHHHHHHCCCCCCCCHHHHHHHHHHHHHHHH Q ss_conf 9899871887643101333189999999999 Q gi|254780661|r 161 QDLALANGIEHVNAHDAKADVYATLALVWLI 191 (471) Q Consensus 161 ~~la~~n~i~~~~aH~A~~D~~~t~~l~k~i 191 (471) ..++..+|+..-.+|+|++||.||++|+-.- T Consensus 193 ~~~r~rYgLP~y~aH~AL~DAlATAELflAQ 223 (239) T PRK09146 193 ADSRLRYGLPAYPPHHALTDAIATAELLQAQ 223 (239) T ss_pred HHHHHHCCCCCCCCCCCHHHHHHHHHHHHHH T ss_conf 9999981989888767199999999999999 No 41 >PRK07983 exodeoxyribonuclease X; Provisional Probab=99.92 E-value=1.5e-23 Score=170.20 Aligned_cols=192 Identities=19% Similarity=0.197 Sum_probs=132.6 Q ss_pred EEEEECCCCCCCCCCCEEEEEEEEEECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHHHHHHH Q ss_conf 79997527889877260489999998389728040022662178887785789898579988996079997999999999 Q gi|254780661|r 5 FVIYDYETFGRDVALDRPAQFAGVRVDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFSRRIHQ 84 (471) Q Consensus 5 ~v~~D~ETtG~~~~~~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~~~i~~ 84 (471) ++++|+||||++ ++|||+|+|.+ .++.++++|+.+ + +|+..++++|+.|||||++|+.++| ++.|++++ T Consensus 2 ~~VlDtETTGl~---~~IiEia~v~v-~~g~i~~~~~~l-i--nP~~pI~~~a~~ihgIT~emv~~aP-~f~ev~~~--- 70 (219) T PRK07983 2 LRIIDTETCGLQ---GGIVEIASVDV-IDGKIVNPMSHL-V--RPDRPISPQAMAIHRITEAMVADKP-WIEDVIPH--- 70 (219) T ss_pred EEEEEEECCCCC---CCCEEEEEEEE-ECCEEEEEEEEE-E--CCCCCCCHHHHHHCCCCHHHHCCCC-CHHHHHHH--- T ss_conf 599963479999---98489999999-999997668999-8--9689698889987298999986899-88999998--- Q ss_pred HHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHH Q ss_conf 95117965999505578999999999850785321325688710015888865556532100244444544331089899 Q gi|254780661|r 85 FFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGATSFKLQDLA 164 (471) Q Consensus 85 ~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~~kL~~la 164 (471) +.. +.++|||| ++||..||.+. +....|++.++|.. .|+ . +++|..|. T Consensus 71 ~~~--~~~lVaHN-a~FD~~~L~~~---------------~~~~IcTl~lAR~l---~p~----------~-~~~l~~Lr 118 (219) T PRK07983 71 YYG--SEWYVAHN-ASFDRRVLPEM---------------PGEWICTMKLARRL---WPG----------I-KYSNMALY 118 (219) T ss_pred HCC--CCEEEECC-CCCCHHHHCCC---------------CCCEEEHHHHHHHH---CCC----------C-CHHHHHHH T ss_conf 628--98787536-51038664155---------------87566578999987---677----------4-24189999 Q ss_pred HHHCCCCC-----CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCEE Q ss_conf 87188764-----3101333189999999999864226899999623889899742001576079970101047787506 Q gi|254780661|r 165 LANGIEHV-----NAHDAKADVYATLALVWLIREKKPKLFEYLYDYRNKNQLRKLIDIQNMTPLVHVSGMFGASRSNTAL 239 (471) Q Consensus 165 ~~n~i~~~-----~aH~A~~D~~~t~~l~k~i~~~~p~~~~~~l~~~~K~~v~~~l~~~~~~~~~~v~~~~~~~~~~~~~ 239 (471) .+.++... +||+|++||.+|..|+..+.+....-.+...+.+.+ |...-. +.+|+.+... T Consensus 119 y~~~~~~~~~~~~~AHrAl~Dv~v~~~ll~~l~~~~~~~~e~l~~~s~~-------------p~~~~~--~~FGKyKG~~ 183 (219) T PRK07983 119 KSRKLNVQTPPGLHHHRALYDCYITAALLIDIMRTSGWTAEEMADITGR-------------PSLLTT--FTFGKYRGKA 183 (219) T ss_pred HHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHCC-------------CCCCCC--CCCCCCCCEE T ss_conf 9842761332443334307879999999999998626899999998638-------------761230--2567748809 Q ss_pred EEEEE-CCCCCCCEEE Q ss_conf 88630-2888774478 Q gi|254780661|r 240 IAPVA-WHPRYKDSVI 254 (471) Q Consensus 240 v~~~~-~~p~~~n~~i 254 (471) +.-+. .+|+|-+|++ T Consensus 184 i~ev~~~dp~Yl~WlL 199 (219) T PRK07983 184 VSDVAERDPGYLRWLF 199 (219) T ss_pred HHHHHCCCCCHHHHHH T ss_conf 9986125832999987 No 42 >TIGR00573 dnaq exonuclease, DNA polymerase III, epsilon subunit family; InterPro: IPR006054 All proteins in this entry for which functions are known are components of the DNA polymerase III complex (epsilon subunit). There is, however, an outgroup that includes paralogs in some gamma-proteobacteria and the N-terminal region of DinG from some low GC Gram-positive bacteria. ; GO: 0003677 DNA binding, 0003887 DNA-directed DNA polymerase activity, 0006260 DNA replication. Probab=99.90 E-value=4.3e-23 Score=167.20 Aligned_cols=174 Identities=22% Similarity=0.205 Sum_probs=149.3 Q ss_pred CCEEEEECCCCCCCCCCCEEEEEEEEE-ECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHHHH Q ss_conf 837999752788987726048999999-8389728040022662178887785789898579988996079997999999 Q gi|254780661|r 3 NHFVIYDYETFGRDVALDRPAQFAGVR-VDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFSRR 81 (471) Q Consensus 3 ~~~v~~D~ETtG~~~~~~~iiq~~~i~-~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~~~ 81 (471) .+|+.+|.||||++...++||++|||. ++. .-++++++.+ +.|++..+|++..+||||++||.++| .+.|++++ T Consensus 7 ~t~t~~d~e~tgl~~~~~~II~IGAv~~i~r-r~~~~~~~~~---~~P~R~~~~~~~k~hGItDd~L~dkP-~F~eia~d 81 (228) T TIGR00573 7 DTFTTGDNETTGLYAKHDEIIEIGAVEIINR-RIIGNKFHTY---IKPDRLIDPDAIKIHGITDDMLKDKP-DFKEIAED 81 (228) T ss_pred EEEEEEECCCCCCCCCCCCEEEEEEEEEECC-CEEECCEEEE---ECCCCCCCCCEEEECCCCHHHHCCCC-CHHHHHHH T ss_conf 4268752110676322430454533766446-1440100577---61888889642552488828846898-51378899 Q ss_pred HHHHHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHH Q ss_conf 99995117965999505578999999999850785321325688710015888865556532100244444544331089 Q gi|254780661|r 82 IHQFFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGATSFKLQ 161 (471) Q Consensus 82 i~~~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~~kL~ 161 (471) +.++++ |.+.|+|| ++||.+||.+.|-+-+.-. ....+..+|+..+++.++-.+|. ..+.|+ T Consensus 82 F~~~I~--g~~Lv~HN-A~FD~GFl~~e~~~LG~~~----~~~~~~v~~~~~~~~~~~~~~p~-----------~~~~Ld 143 (228) T TIGR00573 82 FADYIK--GAVLVIHN-ASFDVGFLNYEFSKLGLKK----EPKTNDVIDTTDTLQAARPEFPG-----------KRNTLD 143 (228) T ss_pred HHHHHC--CCEEEEEH-HHCCHHHHHHHHHHCCCCC----CHHHHHHHHHHHHHHHHHHHCCC-----------CCCHHH T ss_conf 999837--94787520-0305778999998514501----10011444499999999720588-----------430277 Q ss_pred HHHHHHCCCCC--CCCHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 89987188764--310133318999999999986422689 Q gi|254780661|r 162 DLALANGIEHV--NAHDAKADVYATLALVWLIREKKPKLF 199 (471) Q Consensus 162 ~la~~n~i~~~--~aH~A~~D~~~t~~l~k~i~~~~p~~~ 199 (471) .||..++|+.. -.|+|+.||..+++|++.+-.+.-.+. T Consensus 144 ~L~~~~~~~~s~R~~H~A~~DA~~l~~~~~~~~~~~~~~~ 183 (228) T TIGR00573 144 ALALRYEITNSHRALHGALLDAFILAKLYLVMTGKQTKYN 183 (228) T ss_pred HHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 7787805440377773177889999999999986122032 No 43 >COG0847 DnaQ DNA polymerase III, epsilon subunit and related 3'-5' exonucleases [DNA replication, recombination, and repair] Probab=99.89 E-value=6.7e-22 Score=159.54 Aligned_cols=165 Identities=25% Similarity=0.259 Sum_probs=137.3 Q ss_pred CEEEEECCCCCCCCCCCEEEEEEEEEECCCCEECCE-EEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHHHHH Q ss_conf 379997527889877260489999998389728040-0226621788877857898985799889960799979999999 Q gi|254780661|r 4 HFVIYDYETFGRDVALDRPAQFAGVRVDRQWEKIES-TEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFSRRI 82 (471) Q Consensus 4 ~~v~~D~ETtG~~~~~~~iiq~~~i~~d~~~~~~~~-~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~~~i 82 (471) .+|++|+||||.++..++|||+|+|... ++.+.++ |+. +++| ...+++++..+||||.+++.++| ...+++.++ T Consensus 14 ~~vv~D~ETtg~~~~~~~iieIgav~~~-~~~i~~~~~~~-~v~P--~~~i~~~~~~i~git~e~l~~~p-~~~~v~~~~ 88 (243) T COG0847 14 RFVVIDLETTGLNPKKDRIIEIGAVTLE-DGRIVERSFHT-LVNP--ERPIPPEIFKIHGITDEMLADAP-KFAEVLPEF 88 (243) T ss_pred CEEEEECCCCCCCCCCCCEEEEEEEEEE-CCCCCCCCCCE-EECC--CCCCCCCCCCCCCCCHHHHHCCC-CHHHHHHHH T ss_conf 5799966148988778854899659967-78402431114-6688--98787543114773889974786-389999999 Q ss_pred HHHHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHH Q ss_conf 99951179659995055789999999998507853213256887100158888655565321002444445443310898 Q gi|254780661|r 83 HQFFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGATSFKLQD 162 (471) Q Consensus 83 ~~~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~~kL~~ 162 (471) .+++..+ ..+|+|| ..||..|++..+.+.+.++. +..+.|++.++|..+ .|.+..+|+. T Consensus 89 ~~~i~~~-~~~Vahn-a~fD~~fl~~~~~~~~~~~~------~~~~~~t~~~~r~~~-------------~~~~~~~L~~ 147 (243) T COG0847 89 LDFIGGL-RLLVAHN-AAFDVGFLRVESERLGIEIP------GDPVLDTLALARRHF-------------PGFDRSSLDA 147 (243) T ss_pred HHHHCCC-CEEEEEE-HHHCHHHHHHHHHHHCCCCC------CCCCHHHHHHHHHHC-------------CCCCCCCHHH T ss_conf 9984788-8699950-53088999989997089875------764200799999876-------------9976134899 Q ss_pred HHHHHCCC--CCCCCHHHHHHHHHHHHHHHHHHH Q ss_conf 99871887--643101333189999999999864 Q gi|254780661|r 163 LALANGIE--HVNAHDAKADVYATLALVWLIREK 194 (471) Q Consensus 163 la~~n~i~--~~~aH~A~~D~~~t~~l~k~i~~~ 194 (471) +|..+||. +...|+|+.|+.+|++++..+... T Consensus 148 l~~~~gi~~~~~~~H~Al~Da~~~a~~~~~~~~~ 181 (243) T COG0847 148 LAERLGIDRNPFHPHRALFDALALAELFLLLQTG 181 (243) T ss_pred HHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHH T ss_conf 9996298877777661678899999999998620 No 44 >PRK09182 DNA polymerase III subunit epsilon; Validated Probab=99.86 E-value=5.4e-21 Score=153.68 Aligned_cols=172 Identities=22% Similarity=0.190 Sum_probs=119.1 Q ss_pred CEEEEECCCCCCCCCCCEEEEEEEEEE--CCCCE---ECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHH Q ss_conf 379997527889877260489999998--38972---8040022662178887785789898579988996079997999 Q gi|254780661|r 4 HFVIYDYETFGRDVALDRPAQFAGVRV--DRQWE---KIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEF 78 (471) Q Consensus 4 ~~v~~D~ETtG~~~~~~~iiq~~~i~~--d~~~~---~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~ 78 (471) .=||.|+||||+++..|+|||+|+++. |.+|+ ++++++.+ -.|+..++|+++.+||||++|+..+.... T Consensus 38 ~g~~lD~ETTGl~~~~d~IIElg~v~f~~~~~G~i~~i~~~~~~~---~dP~~pIp~~it~lTGItd~mV~Gq~id~--- 111 (293) T PRK09182 38 LGVILDTETTGLDPRKDEIIEIGMVAFEYDDDGRIGDVVDTFGGL---QQPSRPIPPEITRLTGITDEMVAGQTIDP--- 111 (293) T ss_pred EEEEEECCCCCCCCCCCEEEEEEEEEEEECCCCCEEEEECCEECC---CCCCCCCCHHHHHHCCCCHHHHCCCCCCH--- T ss_conf 899995305788977877999768999986998587764323110---38999899879875188889968882799--- Q ss_pred HHHHHHHHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCC Q ss_conf 99999995117965999505578999999999850785321325688710015888865556532100244444544331 Q gi|254780661|r 79 SRRIHQFFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGATSF 158 (471) Q Consensus 79 ~~~i~~~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~~ 158 (471) ..+..+++. ..+|.+||++||.+|+..-+ |.. ..-..+|+. ..+.|. ..|-.|- T Consensus 112 -~~V~~~~~~--adliiAHNA~FDR~F~E~~~-----p~f-~~k~WaCS~---------------~~IdW~--~~Gf~s~ 165 (293) T PRK09182 112 -AEVDALIAP--ADLIIAHNAGFDRPFLERFS-----PVF-ANKPWACSV---------------SEIDWS--ARGFEGT 165 (293) T ss_pred -HHHHHHHCC--CCEEEECCCCCCHHHHHHHC-----CCC-CCCCEEEEC---------------CCCCHH--HCCCCCC T ss_conf -999998644--98999817754677898509-----532-587505316---------------379703--2287741 Q ss_pred CHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHCCH Q ss_conf 08989987188764310133318999999999-986422689999962388 Q gi|254780661|r 159 KLQDLALANGIEHVNAHDAKADVYATLALVWL-IREKKPKLFEYLYDYRNK 208 (471) Q Consensus 159 kL~~la~~n~i~~~~aH~A~~D~~~t~~l~k~-i~~~~p~~~~~~l~~~~K 208 (471) ||+-||...|+=++ +|+|+.||.|++.|+-. +.......+...|+...| T Consensus 166 kLeyL~~~~G~F~~-aHRA~~Dc~All~LL~~~l~~s~~~~l~~Ll~~a~~ 215 (293) T PRK09182 166 KLGYLAGQAGFFHD-GHRAVDDCQALLELLARPLPETGQPPLAELLEASRR 215 (293) T ss_pred CHHHHHHHCCCCCC-CCCHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHCCC T ss_conf 59999986094545-653488999999997677864455499999997278 No 45 >cd06125 DnaQ_like_exo The DnaQ-like exonuclease superfamily is a structurally conserved group of 3'-5' exonucleases, which catalyze the excision of nucleoside monophosphates at the DNA or RNA termini in the 3'-5' direction. It is also called the DEDD superfamily, after the four invariant acidic residues present in the catalytic site of its members. The superfamily consists of DNA- and RNA-processing enzymes such as the proofreading domains of DNA polymerases, other DNA exonucleases, RNase D, RNase T, Oligoribonuclease and RNA exonucleases (REX). The DnaQ-like exonuclease domain contains three conserved sequence motifs termed ExoI, ExoII and ExoIII, which are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. The conservation patterns of the three motifs may vary among different subfamilies. DnaQ-like exonucleases are classified as DEDDy or DEDDh exonucleases depending on the variation of motif Probab=99.82 E-value=1.9e-19 Score=143.74 Aligned_cols=116 Identities=29% Similarity=0.212 Sum_probs=97.0 Q ss_pred EEEECCCCCCCC-CCCEEEEEEEEEECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHHHHHHH Q ss_conf 999752788987-7260489999998389728040022662178887785789898579988996079997999999999 Q gi|254780661|r 6 VIYDYETFGRDV-ALDRPAQFAGVRVDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFSRRIHQ 84 (471) Q Consensus 6 v~~D~ETtG~~~-~~~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~~~i~~ 84 (471) |++|+||||+++ ..|+|+|+|++.++. + +.+.. .+ +|...++|.+..+|||+++++.++| ++.++++++.+ T Consensus 1 Vv~D~ETTGl~~~~~d~Iiei~~v~~~~-~---~~~~~-~v--~p~~~i~~~~~~i~gi~~~~v~~~p-~~~~~~~~~~~ 72 (130) T cd06125 1 VAFDIETTGLNGNARDEIIEIALVVVHI-G---EKFVF-DV--KPDDLIDPYAFTISGITDEVVERAA-TEAELLIELLA 72 (130) T ss_pred CEEEEECCCCCCCCCCEEEEEEEEEEEC-C---CEEEE-EE--CCCCCCCHHHHHCCCCCHHHHHHHH-HHHHHHHHHHH T ss_conf 9898018996999998589999999718-9---88999-97--9999899779410375999983306-08999999999 Q ss_pred HHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHH Q ss_conf 95117965999505578999999999850785321325688710015888 Q gi|254780661|r 85 FFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDV 134 (471) Q Consensus 85 ~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~ 134 (471) ++..+++++||||. +||.+||.+.+.|++++.. ..++..+|++.+ T Consensus 73 f~~~~~~ilVgHN~-~FD~~fL~~~~~r~g~~~~----~~~~~~iDTl~l 117 (130) T cd06125 73 FLRDKDDILVGHNI-SFDLPYLLNRAAELGIPYP----ILGRIVFDTLAL 117 (130) T ss_pred HHCCCCCEEEEECH-HHHHHHHHHHHHHCCCCCC----CCCCCEEECHHH T ss_conf 97699999999194-7749999999998499998----789968686098 No 46 >PRK07748 sporulation inhibitor KapD; Provisional Probab=99.81 E-value=7.5e-18 Score=133.51 Aligned_cols=169 Identities=14% Similarity=0.112 Sum_probs=131.3 Q ss_pred CCCEEEEECCCCCCCC----C--CCEEEEEEEEEECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCH Q ss_conf 8837999752788987----7--260489999998389728040022662178887785789898579988996079997 Q gi|254780661|r 2 TNHFVIYDYETFGRDV----A--LDRPAQFAGVRVDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVE 75 (471) Q Consensus 2 ~~~~v~~D~ETtG~~~----~--~~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~ 75 (471) .+.|+|+|+|.|.-+. . ..+|||||||++|. .+++++|.. +|+|......++....+||||.+++.+ +.++ T Consensus 3 ~~~yiViDfEaTc~~~~~~p~~~~~EIIEiGaV~l~~-~~i~d~F~~-~VkP~~~p~Ls~fc~~LTGItq~~vd~-a~~F 79 (205) T PRK07748 3 EQRFLFLDFEFTMPQHKKKPKGFFPEIIEVGLVSVVG-CKVEDTFSS-YVKPKTFPSLTDRCKKFLGITQEDVDK-GISF 79 (205) T ss_pred CCEEEEEEEECCCCCCCCCCCCCCCCEEEEEEEEEEC-CEEEEEHEE-EECCCCCCCCCHHHHHHCCCCHHHHHC-CCCH T ss_conf 6669999985478887888777887517997899817-815210000-479854785137899752949999625-8988 Q ss_pred HHHHHHHHHHHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 99999999995117965999505578999999999850785321325688710015888865556532100244444544 Q gi|254780661|r 76 AEFSRRIHQFFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGA 155 (471) Q Consensus 76 ~e~~~~i~~~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~ 155 (471) .+++..+.++......+++.|. .+|..+|++...+++++.. + .....|+-...+..+. .. T Consensus 80 ~evl~~f~~~~~~~~~~~~tWG--~~D~~~L~~~c~~~~i~~p---~--~~~~idlk~~f~~~~~-------------~~ 139 (205) T PRK07748 80 EELVEKLAEYDKRCKPTIVTWG--NMDMKVLKHNCEKAGVPFP---F--KGQCRDLSLEYKKFFG-------------ER 139 (205) T ss_pred HHHHHHHHHHHCCCCEEEEEEC--HHHHHHHHHHHHHHCCCCC---C--CCCEEEHHHHHHHHHC-------------CC T ss_conf 9999999998177872898717--8899999999999389998---5--3100639999999968-------------88 Q ss_pred CCCCHHHHHHHHCCCCC-CCCHHHHHHHHHHHHHHHHHH Q ss_conf 33108989987188764-310133318999999999986 Q gi|254780661|r 156 TSFKLQDLALANGIEHV-NAHDAKADVYATLALVWLIRE 193 (471) Q Consensus 156 ~~~kL~~la~~n~i~~~-~aH~A~~D~~~t~~l~k~i~~ 193 (471) ...+|....+..|++.+ ..|+|+.||..|+.|++++.. T Consensus 140 ~~~gL~~aL~~lgl~~eG~~H~glDDA~NtAkI~kk~~~ 178 (205) T PRK07748 140 NQTGLWKAIEEYGKEGTGKHHCALDDAMTTYNIFKLVEK 178 (205) T ss_pred CCCCHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHC T ss_conf 876899999976998889863758999999999999761 No 47 >cd06135 Orn Oligoribonuclease (Orn) is a DEDDh-type DnaQ-like 3'-5' exoribonuclease that is responsible for degrading small oligoribonucleotides to mononucleotides. It contains three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific Hx(4)D conserved pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. Orn is essential for E.coli survival. The human homolog, also called Sfn (small fragment nuclease), is able to hydrolyze short single-stranded RNA and DNA oligomers. It plays a role in cellular nucleotide recycling. Probab=99.74 E-value=4.5e-17 Score=128.51 Aligned_cols=163 Identities=18% Similarity=0.201 Sum_probs=119.3 Q ss_pred EEEEECCCCCCCCCCCEEEEEEEEEECCCCEECCEEEEEEECCCCCCC--CCHHHHHHH---CCCHHHHHHCCCCHHHHH Q ss_conf 799975278898772604899999983897280400226621788877--857898985---799889960799979999 Q gi|254780661|r 5 FVIYDYETFGRDVALDRPAQFAGVRVDRQWEKIESTEVFFCKPADDYL--PDPEAVFIT---GITPQKALRDGVVEAEFS 79 (471) Q Consensus 5 ~v~~D~ETtG~~~~~~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~~~~--p~p~a~~v~---git~~~~~~~~~~~~e~~ 79 (471) +||.|+|||||+|..|+|+|+|+|+||.+++++++-..+.+.++...+ -++.....| |++.+. .+++.+..++. T Consensus 1 lvWiDlEmTGLd~~~d~IiEIA~ivTD~~L~~~~eg~~~vI~~~~~~L~~M~~w~~~~H~~sGL~~~v-~~s~~t~~~ae 79 (173) T cd06135 1 LVWIDLEMTGLDPEKDRILEIACIITDGDLNIIAEGPELVIHQPDEVLDGMDEWCTEMHTKSGLTERV-RASTVTLAQAE 79 (173) T ss_pred CEEEECCCCCCCCCCCEEEEEEEEEECCCCCEECCCCEEEEECCHHHHHHHHHHHHHHCCCCCHHHHH-HHCCCCHHHHH T ss_conf 95993303688888864899999998899877035631888288799877656877503324416788-73699999999 Q ss_pred HHHHHHHHCC---CCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCC Q ss_conf 9999995117---9659995055789999999998507853213256887100158888655565321002444445443 Q gi|254780661|r 80 RRIHQFFSVP---NTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGAT 156 (471) Q Consensus 80 ~~i~~~~~~~---~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~ 156 (471) .++.+|++.+ +.+.++.|++.||..||+...-+- ..|. .+ +.+|+-.+--++...+|+... T Consensus 80 ~~~l~Fi~~~~~~~~~pLaGNSV~~Dr~FL~k~mP~l--~~~l-hY----R~iDVSsikel~~rw~p~~~~--------- 143 (173) T cd06135 80 AELLEFIKKYVPKGKSPLAGNSVHQDRRFLDKYMPEL--EEYL-HY----RILDVSSIKELARRWYPEIYR--------- 143 (173) T ss_pred HHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHCCHH--HHHC-CC----EEEEHHHHHHHHHHHCCHHHH--------- T ss_conf 9999999997679985315764877099999858067--9759-87----046677799999986916340--------- Q ss_pred CCCHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHH Q ss_conf 31089899871887643101333189999999999864 Q gi|254780661|r 157 SFKLQDLALANGIEHVNAHDAKADVYATLALVWLIREK 194 (471) Q Consensus 157 ~~kL~~la~~n~i~~~~aH~A~~D~~~t~~l~k~i~~~ 194 (471) .+.....+|+|++|...+++-+|..++. T Consensus 144 ----------~~~~K~~~HrAl~DI~eSI~ELk~Yr~~ 171 (173) T cd06135 144 ----------KAPKKKGTHRALDDIRESIAELKYYREN 171 (173) T ss_pred ----------CCCCCCCCCCHHHHHHHHHHHHHHHHHH T ss_conf ----------0699877743498999999999999985 No 48 >PRK05359 oligoribonuclease; Provisional Probab=99.65 E-value=2.7e-15 Score=117.04 Aligned_cols=165 Identities=18% Similarity=0.208 Sum_probs=118.4 Q ss_pred CCCCEEEEECCCCCCCCCCCEEEEEEEEEECCCCEECCE-EEEEEECCCCCCC--CCHHHHHHH---CCCHHHHHHCCCC Q ss_conf 988379997527889877260489999998389728040-0226621788877--857898985---7998899607999 Q gi|254780661|r 1 MTNHFVIYDYETFGRDVALDRPAQFAGVRVDRQWEKIES-TEVFFCKPADDYL--PDPEAVFIT---GITPQKALRDGVV 74 (471) Q Consensus 1 m~~~~v~~D~ETtG~~~~~~~iiq~~~i~~d~~~~~~~~-~~~~~~~~~~~~~--p~p~a~~v~---git~~~~~~~~~~ 74 (471) +.+++||.|+|+|||+|..|+|+|+|+|.||.++|++++ ++ +.+..+..++ -++.....| |++ +.+.++..+ T Consensus 1 ~~~~lvWiDlEMTGLd~~~d~IlEiA~ivTD~~Ln~~~eg~~-~vI~~~~e~L~~M~~w~~~~H~~sGL~-~~v~~S~~s 78 (181) T PRK05359 1 NEDNLIWIDLEMTGLDPERDRIIEIATIVTDADLNILAEGPV-IAIHQSDEVLAAMDEWNTRTHTRSGLI-DRVRASTVT 78 (181) T ss_pred CCCCEEEEECCCCCCCCCCCEEEEEEEEEECCCCCCCCCCCE-EEEECCHHHHHHHHHHHHCCCCCCCCH-HHHHCCCCC T ss_conf 998658997613489878776899999999887767023761-788589899975217875057433206-678628988 Q ss_pred HHHHHHHHHHHHHCC---CCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCC Q ss_conf 799999999995117---96599950557899999999985078532132568871001588886555653210024444 Q gi|254780661|r 75 EAEFSRRIHQFFSVP---NTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSR 151 (471) Q Consensus 75 ~~e~~~~i~~~~~~~---~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~ 151 (471) ..++-+++.+|+..+ +.+..+.|++.+|..||+...=+ +..|. .+ +.+|+-.+--++...+|+..+-| T Consensus 79 ~~~AE~~~l~fi~~~~~~~~~pLaGNSV~~DR~FL~k~MP~--l~~~l-hY----R~iDVSSikEl~~rW~p~~~~~p-- 149 (181) T PRK05359 79 EAEAEAQTLEFLKQWVPAGKSPLCGNSIGQDRRFLARYMPE--LEAYF-HY----RNLDVSTLKELARRWKPEILNGF-- 149 (181) T ss_pred HHHHHHHHHHHHHHHCCCCCCCCCCCCEEHHHHHHHHHCHH--HHHHC-CE----EEEECCCHHHHHHHHCHHHCCCC-- T ss_conf 99999999999999679999751687200328899984806--79758-85----67851259999988483220169-- Q ss_pred CCCCCCCCHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHH Q ss_conf 4544331089899871887643101333189999999999864 Q gi|254780661|r 152 DDGATSFKLQDLALANGIEHVNAHDAKADVYATLALVWLIREK 194 (471) Q Consensus 152 ~~~~~~~kL~~la~~n~i~~~~aH~A~~D~~~t~~l~k~i~~~ 194 (471) .-..+|+|++|..-.++-+|..++. T Consensus 150 ------------------~K~~~HrAl~DI~eSI~ELk~Yr~~ 174 (181) T PRK05359 150 ------------------KKQGTHRALADIRESIAELKYYREH 174 (181) T ss_pred ------------------CCCCCCCHHHHHHHHHHHHHHHHHH T ss_conf ------------------9888744188999999999999981 No 49 >cd06144 REX4_like This subfamily is composed of RNA exonuclease 4 (REX4 or Rex4p), XPMC2, Interferon (IFN) Stimulated Gene product of 20 kDa (ISG20), and similar proteins. REX4 is involved in pre-rRNA processing. It controls the ratio between the two forms of 5.8S rRNA in yeast. XPMC2 is a Xenopus gene which was identified through its ability to correct a mitotic defect in fission yeast. The human homolog of XPMC2 (hPMC2) may be involved in angiotensin II-induced adrenal cell cycle progression and cell proliferation. ISG20 is an IFN-induced antiviral exonuclease with a strong preference for single-stranded RNA and minor activity towards single-stranded DNA. These proteins are DEDDh-type DnaQ-like 3'-5' exonucleases containing three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific Hx(4)D conserved pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required fo Probab=99.65 E-value=2.9e-15 Score=116.90 Aligned_cols=148 Identities=16% Similarity=0.177 Sum_probs=102.7 Q ss_pred EEEECCCCCCCCCC--CEEEEEEEEEECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHHHHHH Q ss_conf 99975278898772--6048999999838972804002266217888778578989857998899607999799999999 Q gi|254780661|r 6 VIYDYETFGRDVAL--DRPAQFAGVRVDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFSRRIH 83 (471) Q Consensus 6 v~~D~ETtG~~~~~--~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~~~i~ 83 (471) |..|+||||..+.- +.+.++++| |.+|+++ ++. +++|.. .+.+..+.+||||++++.++| +..++..++. T Consensus 1 valDcEm~g~~~~~~~~~larv~vv--d~~g~~v--~d~-~V~P~~--~V~d~~T~~sGIt~~~l~~a~-~~~~v~~~~~ 72 (152) T cd06144 1 VALDCEMVGVGPDGSESALARVSIV--NEDGNVV--YDT-YVKPQE--PVTDYRTAVSGIRPEHLKDAP-DFEEVQKKVA 72 (152) T ss_pred CEEEEEECCCCCCCCEEEEEEEEEE--ECCCCEE--EEE-EECCCC--CCCCCCEEECCCCHHHHCCCC-CHHHHHHHHH T ss_conf 9898760335289987899999999--6789999--887-236997--678763562485799973899-9999999999 Q ss_pred HHHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHH Q ss_conf 99511796599950557899999999985078532132568871001588886555653210024444454433108989 Q gi|254780661|r 84 QFFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGATSFKLQDL 163 (471) Q Consensus 84 ~~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~~kL~~l 163 (471) +++. +.++|||| +.||..+|+-..-+.. ..|+..+... . ...+..+++|..| T Consensus 73 ~~l~--~~ilVGH~-l~~Dl~~L~l~~~~~~-------------i~DT~~~~~~--------~----~~~~~~~~sLk~L 124 (152) T cd06144 73 ELLK--GRILVGHA-LKNDLKVLKLDHPKKL-------------IRDTSKYKPL--------R----KTAKGKSPSLKKL 124 (152) T ss_pred HHHC--CCEEEECC-HHHHHHHHHCCCCCCE-------------EEEEEEEHHH--------H----HCCCCCCCCHHHH T ss_conf 9853--88798606-4646999624388760-------------6885564224--------5----4168899279999 Q ss_pred HHH-HCCCC-CCCCHHHHHHHHHHHHHH Q ss_conf 987-18876-431013331899999999 Q gi|254780661|r 164 ALA-NGIEH-VNAHDAKADVYATLALVW 189 (471) Q Consensus 164 a~~-n~i~~-~~aH~A~~D~~~t~~l~k 189 (471) |+. .|++. .++|+|+.||.||++|+| T Consensus 125 ~~~~Lg~~Iq~~~H~s~eDA~at~~Lfk 152 (152) T cd06144 125 AKQLLGLDIQEGEHSSVEDARAAMRLYR 152 (152) T ss_pred HHHHCCCCCCCCCCCCHHHHHHHHHHHC T ss_conf 9998588589979696999999999859 No 50 >PRK06722 exonuclease; Provisional Probab=99.62 E-value=5.3e-15 Score=115.17 Aligned_cols=137 Identities=12% Similarity=0.017 Sum_probs=105.3 Q ss_pred CEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHHHHHHHHHHCCCCEEEEEECHHHHHHHHHHHHHHH Q ss_conf 72804002266217888778578989857998899607999799999999995117965999505578999999999850 Q gi|254780661|r 34 WEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFSRRIHQFFSVPNTCIIGYNNIRFDDYYSRNIFYRN 113 (471) Q Consensus 34 ~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~~~i~~~~~~~~~~~ig~N~~~FD~~~lr~~~~r~ 113 (471) ..++|+|..+ .+|+..+++.+..+||||++|+.++| +..|++.++.+|+. +.++|+|| ++||.+|||+..-.+ T Consensus 1 ~kIV~rFstL---VNPg~pIp~~It~LTGITdeMV~~aP-~ieeVLp~FleFIG--d~~LVAHN-AsFD~gFL~~~C~~h 73 (242) T PRK06722 1 MKVIGEFSEL---VKPGARLTRHTTKLTGITKKDLIGVE-KFPQIIEKFIQFIG--EDSIFVTW-GKEDYRFLSHDCTLH 73 (242) T ss_pred CCHHHHHHHH---CCCCCCCCHHHHHHCCCCHHHHHCCC-CHHHHHHHHHHHHC--CCEEEEEC-CHHHHHHHHHHHHHC T ss_conf 9327888865---19899998335653386889982799-78999999999827--88389734-402277999775242 Q ss_pred CCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHH Q ss_conf 78532132568871001588886555653210024444454433108989987188764310133318999999999 Q gi|254780661|r 114 FYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGATSFKLQDLALANGIEHVNAHDAKADVYATLALVWL 190 (471) Q Consensus 114 ~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~~kL~~la~~n~i~~~~aH~A~~D~~~t~~l~k~ 190 (471) +.++.++. ..|..++.+-+..-+|+ -..-+||.++. +..+.-.-.+..|+|++|+.+|+.++-. T Consensus 74 g~e~P~~~------~~~~~dl~k~v~~~~~e------l~~h~ps~~~A-~~q~~~t~~~~~HRAlaDA~~T~ni~l~ 137 (242) T PRK06722 74 GVECPCME------KERRIDLQKFVFQAYEE------LFEHTPSLQSA-VEQLGLIWEGKQHRALADAENTANILLK 137 (242) T ss_pred CCCCCCCC------CHHHHHHHHHHHHCCHH------HHCCCCHHHHH-HHHHHCCCCCHHHHHHHHHHHHHHHHHH T ss_conf 89999984------21088899998631244------32138029999-9998502567036778768888999999 No 51 >TIGR01298 RNaseT ribonuclease T; InterPro: IPR005987 Ribonuclease T (3.1.13 from EC) is an enzyme found so far only in gamma-subdivision proteobacteria such as Escherichia coli and Xylella fastidiosa. Ribonuclease T is homologous to the DNA polymerase III alpha chain. It can liberate AMP from the common C-C-A terminus of uncharged tRNA. It appears also to be involved in RNA maturation. It also acts as a 3' to 5' single-strand DNA-specific exonuclease; it is distinctive for its ability to remove residues near a double-stranded stem. Ribonuclease T is a high copy suppressor in E. coli of a UV-repair defect caused by deletion of three other single-stranded DNA exonucleases.; GO: 0004540 ribonuclease activity, 0006396 RNA processing. Probab=99.49 E-value=1.9e-13 Score=105.13 Aligned_cols=173 Identities=24% Similarity=0.244 Sum_probs=129.6 Q ss_pred EEEECCCCCCCCCCCEEEEEEEE--EECCCCEEC-CEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHHHHH Q ss_conf 99975278898772604899999--983897280-400226621788877857898985799889960799979999999 Q gi|254780661|r 6 VIYDYETFGRDVALDRPAQFAGV--RVDRQWEKI-ESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFSRRI 82 (471) Q Consensus 6 v~~D~ETtG~~~~~~~iiq~~~i--~~d~~~~~~-~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~~~i 82 (471) |+.|.||.|+|...|-.+|||+| ..|+.|-.. |+...+.+.|-++--++|++|.+|||..+.=...-..|..++.+| T Consensus 11 VVvDVET~GfN~~~dALLEiAait~~MDe~G~L~P~~~~~~~~~P~~Gani~P~~L~~~GI~~d~P~R~A~~E~~Al~~i 90 (201) T TIGR01298 11 VVVDVETAGFNAKTDALLEIAAITLKMDEQGYLVPDETLHFHVEPFEGANIEPEALEFTGIDLDHPLRGAVEEKLALEEI 90 (201) T ss_pred EEEEECCCCCCCHHHHHHHHHHHHCCCCCCEEECCCCEEEEEEECCCCCCCCCCCEEECCCCCCCCCCHHHHHHHHHHHH T ss_conf 48861067766013455555420001167300147852578852267998871200220654488752246688999999 Q ss_pred HHHH----HCCCC---EEEEEECHHHHHHHHH-HHHHHHCCC-CCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 9995----11796---5999505578999999-999850785-3213256887100158888655565321002444445 Q gi|254780661|r 83 HQFF----SVPNT---CIIGYNNIRFDDYYSR-NIFYRNFYD-SYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDD 153 (471) Q Consensus 83 ~~~~----~~~~~---~~ig~N~~~FD~~~lr-~~~~r~~~~-~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~ 153 (471) ...+ +..+| ++|||| .+||..|+. ...-|...- .+.+.. +.+|+..++-.+|.- T Consensus 91 F~~vR~~~K~~gC~RailV~HN-a~fD~~Fl~~AA~~R~~~KRnPFH~F----~~FDTatLAGl~yGQ------------ 153 (201) T TIGR01298 91 FKVVRKALKASGCQRAILVAHN-ASFDLGFLNRAAVKRTALKRNPFHPF----SVFDTATLAGLAYGQ------------ 153 (201) T ss_pred HHHHHHHHHHCCCCEEEEEECC-CCCCHHHHHHHHHHHHHCCCCCCCCC----CHHHHHHHHHHHHHH------------ T ss_conf 9999999975579603676003-20466689999999861157898765----224689998886417------------ Q ss_pred CCCCCCHHHHHHHHCCC--CCCCCHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 44331089899871887--64310133318999999999986422689 Q gi|254780661|r 154 GATSFKLQDLALANGIE--HVNAHDAKADVYATLALVWLIREKKPKLF 199 (471) Q Consensus 154 ~~~~~kL~~la~~n~i~--~~~aH~A~~D~~~t~~l~k~i~~~~p~~~ 199 (471) -=|..-|+.=|++ ..+||.|+.|++-|++|+..|-.+.+.+. T Consensus 154 ----TVL~kA~Q~Ag~~fd~~~AHSA~YDtE~TA~LFC~ivN~wk~~g 197 (201) T TIGR01298 154 ----TVLAKACQAAGLDFDAKQAHSALYDTEKTAELFCEIVNRWKELG 197 (201) T ss_pred ----HHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf ----99998886427886754431012003568999998863045504 No 52 >PRK05601 DNA polymerase III subunit epsilon; Validated Probab=99.45 E-value=1.5e-12 Score=99.37 Aligned_cols=158 Identities=13% Similarity=0.106 Sum_probs=124.3 Q ss_pred CEEEEECCCCCCCCCCCEEEEEEEEEECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHHHHHH Q ss_conf 37999752788987726048999999838972804002266217888778578989857998899607999799999999 Q gi|254780661|r 4 HFVIYDYETFGRDVALDRPAQFAGVRVDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFSRRIH 83 (471) Q Consensus 4 ~~v~~D~ETtG~~~~~~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~~~i~ 83 (471) -||-.-+.|||+.|+..+++.|.||..|.+|++.+.|... ++|++ +|+-..|||+|.+++..+| .+.++++++. T Consensus 47 p~va~~~~ttgihpst~r~v~~da~~~da~Ge~~~~f~~~---lNPg~--DPGP~HiHGLT~e~L~gAP-~F~~ia~~l~ 120 (377) T PRK05601 47 PFVAVSIQSSGIHPSTSRMVTIDAVTLSADGEPVETFHAV---LNPKT--DPGPFHLHGVTEEEFASAK-RFGQILKSLD 120 (377) T ss_pred CEEEEEEEECCCCCCCCEEEEEEEEEECCCCCCCCEEEEE---CCCCC--CCCCCCCCCCCHHHHCCCC-CHHHHHHHHH T ss_conf 7279998855757775505888788864789722345771---27999--9998101378999970684-2899999999 Q ss_pred HHHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCC----------------------CCCCCCHHHHHHHHHHH Q ss_conf 99511796599950557899999999985078532132568----------------------87100158888655565 Q gi|254780661|r 84 QFFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDN----------------------GNSRWDLLDVMRAIYAF 141 (471) Q Consensus 84 ~~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~----------------------~~~~~D~l~~~~~~~~~ 141 (471) +.+. |.+.|.|| ..||+.||-+.|.|-...-. ..|.. .+.-+|++-+.|... T Consensus 121 ~lLd--GRtLVaHN-a~fdw~Fia~Ea~RA~~~a~-~~~R~~~r~~~~~~r~rrrqr~gh~P~P~~iv~TLAlaRR~~-- 194 (377) T PRK05601 121 RLID--GRTLLLHN-APRTWGFIVSEAKRAMNDAA-RANRNRNRGNRRGGRGRRRQRVGHIPKPLVIVDTLASARRQA-- 194 (377) T ss_pred HHHC--CCEEEEEC-CCCHHHHHHHHHHHHHHHHH-HHHHHHCCCCCCCCCCCCHHCCCCCCCCCCEEEHHHHHHHCC-- T ss_conf 9845--86799835-60208899999998655423-443210013444444310100367788752100777776418-- Q ss_pred HHHHCCCCCCCCCCCCCCHHHHHHHHCCCCCCCCHHHHHHHHH Q ss_conf 3210024444454433108989987188764310133318999 Q gi|254780661|r 142 SPDGIQWPSRDDGATSFKLQDLALANGIEHVNAHDAKADVYAT 184 (471) Q Consensus 142 ~p~~~~~~~~~~~~~~~kL~~la~~n~i~~~~aH~A~~D~~~t 184 (471) -+.+..+|+.+|...|++...||.+..++..- T Consensus 195 -----------l~~~d~RL~~vA~~~g~~a~~a~a~~~ra~~~ 226 (377) T PRK05601 195 -----------IALDDVRIRGVAHTLGLDAPAAEASVERAQVP 226 (377) T ss_pred -----------CCCCCHHHHHHHHHCCCCCCCHHHHHHHHCCC T ss_conf -----------99874355666766088863013216563355 No 53 >COG1949 Orn Oligoribonuclease (3'-5' exoribonuclease) [RNA processing and modification] Probab=99.38 E-value=9e-12 Score=94.43 Aligned_cols=159 Identities=18% Similarity=0.200 Sum_probs=114.1 Q ss_pred CCCCEEEEECCCCCCCCCCCEEEEEEEEEECCCCEECCEEEEEEECCCCCCCCCHHH--------HHHHCCC--HHHHHH Q ss_conf 988379997527889877260489999998389728040022662178887785789--------8985799--889960 Q gi|254780661|r 1 MTNHFVIYDYETFGRDVALDRPAQFAGVRVDRQWEKIESTEVFFCKPADDYLPDPEA--------VFITGIT--PQKALR 70 (471) Q Consensus 1 m~~~~v~~D~ETtG~~~~~~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a--------~~v~git--~~~~~~ 70 (471) +.+++||.|+|.||++|..|+|||+|+|.||.++|++.+--.+-+. -+++. ...||-| .+.++. T Consensus 4 ~~~nLiWIDlEMTGLd~~~drIIEiA~iVTD~~Lnilaegp~~~Ih------q~~e~L~~Mdew~~~~H~~sGL~~rV~~ 77 (184) T COG1949 4 NKNNLIWIDLEMTGLDPERDRIIEIATIVTDANLNILAEGPVIAIH------QSDEQLAKMDEWNTETHGRSGLTERVKA 77 (184) T ss_pred CCCCEEEEEEEECCCCCCCCEEEEEEEEEECCCCCCCCCCCEEEEE------CCHHHHHHHHHHHHHCCCCCCHHHHHHH T ss_conf 7786289963202688676427888887754763210358557886------7999999889999971454318999997 Q ss_pred CCCCHHHHHHHHHHHHHCC---CCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCC Q ss_conf 7999799999999995117---9659995055789999999998507853213256887100158888655565321002 Q gi|254780661|r 71 DGVVEAEFSRRIHQFFSVP---NTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQ 147 (471) Q Consensus 71 ~~~~~~e~~~~i~~~~~~~---~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~ 147 (471) +..++.++-.+...|+++| +...+..|++.=|..|+....=+ +..|.+ + +.+|+-.+--.+..-.|..+. T Consensus 78 S~~t~~~aE~~~L~flkkwvp~~~spicGNSI~qDRrFl~r~MP~--Le~yfH-Y----R~lDVSTlKELa~RW~P~i~~ 150 (184) T COG1949 78 STVTEAEAEAQTLDFLKKWVPKGVSPICGNSIAQDRRFLFRYMPK--LEAYFH-Y----RYLDVSTLKELARRWNPEILA 150 (184) T ss_pred HHCCHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHH--HHHHHH-H----HHHHHHHHHHHHHHHCCHHHH T ss_conf 223399999999999998678998877565065789999997655--898744-6----662078799999862937765 Q ss_pred CCCCCCCCCCCCHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHH Q ss_conf 444445443310898998718876431013331899999999998 Q gi|254780661|r 148 WPSRDDGATSFKLQDLALANGIEHVNAHDAKADVYATLALVWLIR 192 (471) Q Consensus 148 ~~~~~~~~~~~kL~~la~~n~i~~~~aH~A~~D~~~t~~l~k~i~ 192 (471) - +.-+++|.|++|..-.+.=+...+ T Consensus 151 ~--------------------~~K~~~H~Al~DI~ESI~EL~~Yr 175 (184) T COG1949 151 G--------------------FKKGGTHRALDDIRESIAELRYYR 175 (184) T ss_pred H--------------------HCCCCCHHHHHHHHHHHHHHHHHH T ss_conf 2--------------------103652346999999999999999 No 54 >cd06149 ISG20 Interferon (IFN) Stimulated Gene product of 20 kDa (ISG20) is an IFN-induced antiviral exonuclease with a strong preference for single-stranded RNA and minor activity towards single-stranded DNA. It was also independently identified by its response to estrogen and was called HEM45 (human estrogen regulated transcript). ISG20 is a DEDDh-type DnaQ-like 3'-5' exonuclease containing three conserved sequence motifs termed ExoI, ExoII and ExoIII with a specific Hx(4)D conserved pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. ISG20 may be a major effector of innate immunity against pathogens including viruses, bacteria, and parasites. It is located in promyelocytic leukemia (PML) nuclear bodies, sites for oncogenic DNA viral transcription and replication. It may carry out its function by degrading viral RNAs as part of the IFN-regulated antiviral r Probab=99.09 E-value=1.6e-09 Score=79.92 Aligned_cols=151 Identities=17% Similarity=0.107 Sum_probs=99.8 Q ss_pred EEEECCCCCCCCC--CCEEEEEEEEEECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHHHHHH Q ss_conf 9997527889877--26048999999838972804002266217888778578989857998899607999799999999 Q gi|254780661|r 6 VIYDYETFGRDVA--LDRPAQFAGVRVDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFSRRIH 83 (471) Q Consensus 6 v~~D~ETtG~~~~--~~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~~~i~ 83 (471) |-.|+|.-|..+. ...+..+++| |.+|+++ ++. +++|... +...-+.++|||++++.++. +..++.+++. T Consensus 1 vAlDCEMv~t~~g~~~~~LaRvsvV--d~~G~vl--~D~-~VkP~~~--V~dy~T~~SGIt~~~l~~a~-~~~~~~~~l~ 72 (157) T cd06149 1 VAIDCEMVGTGPGGRESELARCSIV--NYHGDVL--YDK-YIRPEGP--VTDYRTRWSGIRRQHLVNAT-PFAVAQKEIL 72 (157) T ss_pred CEEECEEEEECCCCCEEEEEEEEEE--ECCCCEE--EEE-EECCCCC--CCCCEEECCCCCHHHHCCCC-CHHHHHHHHH T ss_conf 9793653427489984689999999--2789898--674-3388987--78636562678888957998-4999999999 Q ss_pred HHHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHH Q ss_conf 99511796599950557899999999985078532132568871001588886555653210024444454433108989 Q gi|254780661|r 84 QFFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGATSFKLQDL 163 (471) Q Consensus 84 ~~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~~kL~~l 163 (471) +++. +.++|||+ +.+|...|+-. .|.+ ...|+-.. .+.+..-. ...+.+.+|-.| T Consensus 73 ~~l~--~~IlVGH~-L~nDL~aL~l~-----h~~~--------~i~DT~~~-----~~~~~~~~----~p~~~~~sLk~L 127 (157) T cd06149 73 KILK--GKVVVGHA-IHNDFKALKYF-----HPKH--------MTRDTSTI-----PLLNRKAG----FPENCRVSLKVL 127 (157) T ss_pred HHHC--CCEEEEEC-CHHHHHHHHCC-----CCCC--------CEEECCCC-----EEHHHHCC----CCCCCCHHHHHH T ss_conf 9716--98899826-07669886033-----8987--------26777111-----00133248----987668759999 Q ss_pred HHHH---CCCCCC-CCHHHHHHHHHHHHHH Q ss_conf 9871---887643-1013331899999999 Q gi|254780661|r 164 ALAN---GIEHVN-AHDAKADVYATLALVW 189 (471) Q Consensus 164 a~~n---~i~~~~-aH~A~~D~~~t~~l~k 189 (471) |... .|..++ +||++.||.||++|.| T Consensus 128 ~~~~L~~~IQ~~~~gHds~EDA~aa~~L~k 157 (157) T cd06149 128 AKRLLHRDIQVGRQGHSSVEDARATMELYK 157 (157) T ss_pred HHHHHCCCCCCCCCCCCHHHHHHHHHHHHC T ss_conf 999848502599988080999999999859 No 55 >cd05782 DNA_polB_like1_exo A subfamily of the 3'-5' exonuclease domain of family-B DNA polymerases. This subfamily is composed of uncharacterized bacterial family-B DNA polymerases. Family-B DNA polymerases contain an N-terminal DEDDy DnaQ-like exonuclease domain in the same polypeptide chain as the polymerase domain, similar to family-A DNA polymerases. This exonuclease domain contains three sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are involved in metal binding and catalysis. The exonuclease domain of family-B DNA polymerases has a fundamental role in proofreading activity. It contains a beta hairpin structure that plays an important role in active site switching in the event of a nucleotide misincorporation. Family-B DNA polymerases are predominantly involved in DNA replication and DNA repair. Probab=99.03 E-value=1.2e-08 Score=74.24 Aligned_cols=115 Identities=17% Similarity=0.108 Sum_probs=80.1 Q ss_pred CCCCCCCEEEEEEEEEECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHHHHHHHHHHCCCCEE Q ss_conf 89877260489999998389728040022662178887785789898579988996079997999999999951179659 Q gi|254780661|r 14 GRDVALDRPAQFAGVRVDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFSRRIHQFFSVPNTCI 93 (471) Q Consensus 14 G~~~~~~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~~~i~~~~~~~~~~~ 93 (471) ++...+.+||.|+++..+.++.. ++.. ...+..|.++++.+.+.+...++.+ T Consensus 45 flp~~f~rIV~Is~~~~~~~~~~--------~~~~--------------------~~~~~~E~~ll~~F~~~i~~~~p~l 96 (208) T cd05782 45 FLPLPFHKVVSISALYRDDDGGF--------LKVR--------------------TLDGADEKELLEDFFQLIEKKNPRL 96 (208) T ss_pred CCHHHHCCEEEEEEEEEECCCCE--------EEEE--------------------ECCCCCHHHHHHHHHHHHHHCCCEE T ss_conf 54466386279999999558977--------9984--------------------1799999999999999998659979 Q ss_pred EEEECHHHHHHHHHHHHHHHCCCCCCC-C---------CCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHH Q ss_conf 995055789999999998507853213-2---------568871001588886555653210024444454433108989 Q gi|254780661|r 94 IGYNNIRFDDYYSRNIFYRNFYDSYRW-S---------WDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGATSFKLQDL 163 (471) Q Consensus 94 ig~N~~~FD~~~lr~~~~r~~~~~y~~-~---------~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~~kL~~l 163 (471) ||||+.+||.+||..-..+++++...+ . +.-+..++|++...... ..+.+.+|+.+ T Consensus 97 v~~Ng~~FD~P~l~~Ral~~gi~~p~~~~~~~~~~~y~~r~~~~H~Dl~d~l~~~--------------~~~~~~sLd~l 162 (208) T cd05782 97 VSFNGRGFDLPVLHLRALIHGVSAPAYFDLGNKDWNYRNRYSERHLDLMDLLAFY--------------GARARASLDLL 162 (208) T ss_pred EEECCCCCCHHHHHHHHHHCCCCCCHHHHCCCCCCCCCCCCCCCCCCHHHHHHCC--------------CCCCCCCHHHH T ss_conf 9636875758999999999198975554048874112488876313299998554--------------87577889999 Q ss_pred HHHHCCC Q ss_conf 9871887 Q gi|254780661|r 164 ALANGIE 170 (471) Q Consensus 164 a~~n~i~ 170 (471) |...|++ T Consensus 163 a~llG~p 169 (208) T cd05782 163 AKLLGIP 169 (208) T ss_pred HHHHCCC T ss_conf 9983899 No 56 >cd06137 DEDDh_RNase This group is composed of eukaryotic exoribonucleases that include PAN2, RNA exonuclease 1 (REX1 or Rex1p), REX3 (Rex3p), REX4 (or Rex4p), ISG20, and similar proteins. They are DEDDh-type DnaQ-like 3'-5' exonucleases containing three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific Hx(4)D conserved pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. PAN2 is the catalytic subunit of poly(A) nuclease (PAN), a Pab1p-dependent 3'-5' exoribonuclease which plays an important role in the posttranscriptional maturation of pre-mRNAs. REX proteins are required for the processing and maturation of many RNA species, and ISG20 is an interferon-induced antiviral exonuclease with a strong preference for single-stranded RNA. Probab=99.01 E-value=4.6e-09 Score=77.01 Aligned_cols=150 Identities=19% Similarity=0.209 Sum_probs=99.6 Q ss_pred EEEECCCCCCCCCCCEEEEEEEEEECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCC-----CHHHHH- Q ss_conf 99975278898772604899999983897280400226621788877857898985799889960799-----979999- Q gi|254780661|r 6 VIYDYETFGRDVALDRPAQFAGVRVDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGV-----VEAEFS- 79 (471) Q Consensus 6 v~~D~ETtG~~~~~~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~-----~~~e~~- 79 (471) |-.|+|+.|..+..+.++++++|-. .+|+++ ++. +++|.. .+...-+.++|||++++.+++. +.++.+ T Consensus 1 VAlDCEMv~~~~~~~el~rvs~Vd~-~~g~vl--~d~-~VkP~~--~V~Dy~T~~SGIt~~~l~~a~~~~~~~~~~~~a~ 74 (161) T cd06137 1 VALDCEMVGLADGDSEVVRISAVDV-LTGEVL--IDS-LVRPSV--RVTDWRTRFSGVTPADLEEAAKAGKTIFGWEAAR 74 (161) T ss_pred CEEEEEEECCCCCCCEEEEEEEEEC-CCCEEE--EEE-CCCCCC--CCCCCEEECCCCCHHHHHHHHHCCCCCCCHHHHH T ss_conf 9796452332299725899999972-799199--986-335984--3230143325799999854342153333199999 Q ss_pred HHHHHHHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCC Q ss_conf 99999951179659995055789999999998507853213256887100158888655565321002444445443310 Q gi|254780661|r 80 RRIHQFFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGATSFK 159 (471) Q Consensus 80 ~~i~~~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~~k 159 (471) ..+.+++. .++++|||. +.+|...|+-. -| ...|+..+. |..... ....+..+ T Consensus 75 ~~l~~~i~-~~tIlVGH~-L~nDL~aL~~~------H~---------~viDTa~l~-------~~~~~~---~~~~~~~s 127 (161) T cd06137 75 AALWKFID-PDTILVGHS-LQNDLDALRMI------HT---------RVVDTAILT-------REAVKG---PLAKRQWS 127 (161) T ss_pred HHHHHHCC-CCCEEEECC-HHHHHHHHCCC------CC---------CEEECEEEC-------CHHCCC---CCCCCCHH T ss_conf 99997458-996798510-13108786277------98---------354320332-------001157---88888866 Q ss_pred HHHHHH-HHCCCC--C-CCCHHHHHHHHHHHHH Q ss_conf 898998-718876--4-3101333189999999 Q gi|254780661|r 160 LQDLAL-ANGIEH--V-NAHDAKADVYATLALV 188 (471) Q Consensus 160 L~~la~-~n~i~~--~-~aH~A~~D~~~t~~l~ 188 (471) |..||+ ..|... + +.|++..||.||++|. T Consensus 128 Lk~La~~~L~~~IQ~g~~gHds~EDA~aam~L~ 160 (161) T cd06137 128 LRTLCRDFLGLKIQGGGEGHDSLEDALAAREVV 160 (161) T ss_pred HHHHHHHHHCHHHCCCCCCCCCHHHHHHHHHHH T ss_conf 999999982877528998849199999999985 No 57 >KOG3242 consensus Probab=98.88 E-value=4.4e-08 Score=70.69 Aligned_cols=164 Identities=16% Similarity=0.218 Sum_probs=108.2 Q ss_pred CCCCEEEEECCCCCCCCCCCEEEEEEEEEECCCCEECCEEEEEEECCCCCCCCC--HHHHHHH---CCCHHHHHHCCCCH Q ss_conf 988379997527889877260489999998389728040022662178887785--7898985---79988996079997 Q gi|254780661|r 1 MTNHFVIYDYETFGRDVALDRPAQFAGVRVDRQWEKIESTEVFFCKPADDYLPD--PEAVFIT---GITPQKALRDGVVE 75 (471) Q Consensus 1 m~~~~v~~D~ETtG~~~~~~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~~~~p~--p~a~~v~---git~~~~~~~~~~~ 75 (471) |..-+||.|+|.||++...|+|+|+|+|.||.+++++.+--.+.++-++.+.-+ ---..-| |+| +.+..+..+. T Consensus 24 l~q~lVWiD~EMTGLdvekd~i~EiacIITD~dL~~~~egpd~vI~~~~evld~MneWc~ehhg~SGLt-~kv~~S~~tl 102 (208) T KOG3242 24 LKQPLVWIDCEMTGLDVEKDRIIEIACIITDGDLNPVAEGPDLVIHQPKEVLDKMNEWCIEHHGNSGLT-EKVLASKITL 102 (208) T ss_pred CCCCEEEEEEECCCCCCCCCEEEEEEEEEECCCCCCCCCCCCCHHCCCHHHHHHHHHHHHHHCCCHHHH-HHHHHHHCCH T ss_conf 248648996101255323131687888876587550033864042298999988889999831411479-9999731439 Q ss_pred HHHHHHHHHHHHC---CCCEEEEEECHHHHHHHHHHHHHHHC-CCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCC Q ss_conf 9999999999511---79659995055789999999998507-8532132568871001588886555653210024444 Q gi|254780661|r 76 AEFSRRIHQFFSV---PNTCIIGYNNIRFDDYYSRNIFYRNF-YDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSR 151 (471) Q Consensus 76 ~e~~~~i~~~~~~---~~~~~ig~N~~~FD~~~lr~~~~r~~-~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~ 151 (471) .++-.++.+++.. +|.+.++.|+..-|..|+...+=.-. +-+| +-.|+-.+--.+..-+|...... T Consensus 103 ~~aEnevl~yikk~ip~~~~~laGNSV~~DrlFl~k~mPk~~~~lhy--------rivDVStIkeL~~Rw~P~~~~~a-- 172 (208) T KOG3242 103 ADAENEVLEYIKKHIPKGKCPLAGNSVYMDRLFLKKYMPKLIKHLHY--------RIVDVSTIKELARRWYPDIKARA-- 172 (208) T ss_pred HHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCE--------EEEEHHHHHHHHHHHCCHHHCCC-- T ss_conf 99999999999986789888756860346799999985889975254--------56647879999998590322007-- Q ss_pred CCCCCCCCHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHH Q ss_conf 45443310898998718876431013331899999999998 Q gi|254780661|r 152 DDGATSFKLQDLALANGIEHVNAHDAKADVYATLALVWLIR 192 (471) Q Consensus 152 ~~~~~~~kL~~la~~n~i~~~~aH~A~~D~~~t~~l~k~i~ 192 (471) .++ ..+|+|+.|..-.+.=++..+ T Consensus 173 --PkK---------------~~~HrAldDI~ESI~ELq~Yr 196 (208) T KOG3242 173 --PKK---------------KATHRALDDIRESIKELQYYR 196 (208) T ss_pred --CCC---------------CCCCCHHHHHHHHHHHHHHHH T ss_conf --210---------------014010889999999999999 No 58 >cd05781 DNA_polB_B3_exo The 3'-5' exonuclease domain of archaeal proteins with similarity to Sulfurisphaera ohwakuensis DNA polymerase B3. B3 is a family-B DNA polymerase. Family-B DNA polymerases contain an N-terminal DEDDy DnaQ-like exonuclease domain in the same polypeptide chain as the polymerase domain, similar to family-A DNA polymerases. B3 exhibits both polymerase and 3'-5' exonuclease activities. This exonuclease domain contains three sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are clustered around the active site and are involved in metal binding and catalysis. The exonuclease domain of family B polymerases also contains a beta hairpin structure that plays an important role in active site switching in the event of nucleotide misincorporation. Archaeal proteins that are involved in DNA replication are similar to those from eukaryotes. Some archaea possess multiple family-B DNA polymerases. B3 is mainly found in crenarch Probab=98.83 E-value=8.8e-08 Score=68.77 Aligned_cols=125 Identities=16% Similarity=0.121 Sum_probs=84.0 Q ss_pred CCEEEEECCCCC----CCCCCCEEEEEEEEEECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHH Q ss_conf 837999752788----9877260489999998389728040022662178887785789898579988996079997999 Q gi|254780661|r 3 NHFVIYDYETFG----RDVALDRPAQFAGVRVDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEF 78 (471) Q Consensus 3 ~~~v~~D~ETtG----~~~~~~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~ 78 (471) -+.+.||+||++ -+|..|.|++||... .++.. ++ ... ...++.++ T Consensus 3 lk~lsfDIE~~~~~gfp~~~~d~Ii~Is~~~--~~g~~--~~--~~~-------------------------~~~~E~~l 51 (188) T cd05781 3 LKTLAFDIEVYSKYGTPNPRRDPIIVISLAT--SNGDV--EF--ILA-------------------------EGLDDRKI 51 (188) T ss_pred CEEEEEEEEECCCCCCCCCCCCCEEEEEEEE--CCCCE--EE--EEE-------------------------CCCCHHHH T ss_conf 4199999887689998998889789999996--89976--99--984-------------------------79998999 Q ss_pred HHHHHHHHHCCCC-EEEEEECHHHHHHHHHHHHHHHCCCCCC--------------CCCCCCCCCCCHHHHHHHHHHHHH Q ss_conf 9999999511796-5999505578999999999850785321--------------325688710015888865556532 Q gi|254780661|r 79 SRRIHQFFSVPNT-CIIGYNNIRFDDYYSRNIFYRNFYDSYR--------------WSWDNGNSRWDLLDVMRAIYAFSP 143 (471) Q Consensus 79 ~~~i~~~~~~~~~-~~ig~N~~~FD~~~lr~~~~r~~~~~y~--------------~~~~~~~~~~D~l~~~~~~~~~~p 143 (471) +.++.+++....| +++|||..+||.++|..-.-+.+.+.-. .....|.-.+|++..++..+ T Consensus 52 l~~f~~~i~~~~Pdii~gyN~~~FD~pyl~~Ra~~~~i~l~~gr~~~~~~~~~~~~~~~i~Gr~~iDl~~~~~~~~---- 127 (188) T cd05781 52 IREFVKYVKEYDPDIIVGYNSNAFDWPYLVERARVLGVKLDVGRRGGSEPSTGVYGHYSITGRLNVDLYDFAEEIP---- 127 (188) T ss_pred HHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEEEEEEEECHHHHHHHC---- T ss_conf 9999999987299999966986757899999999969985015567765122542069987899997789997624---- Q ss_pred HHCCCCCCCCCCCCCCHHHHHHHHCCCCC Q ss_conf 10024444454433108989987188764 Q gi|254780661|r 144 DGIQWPSRDDGATSFKLQDLALANGIEHV 172 (471) Q Consensus 144 ~~~~~~~~~~~~~~~kL~~la~~n~i~~~ 172 (471) .-++++|+.+|+..|.... T Consensus 128 ----------~l~sysL~~Va~~lg~~~k 146 (188) T cd05781 128 ----------EVKVKTLENVAEYLGVMKK 146 (188) T ss_pred ----------CCCCCCHHHHHHHHCCCCC T ss_conf ----------7564579999998541224 No 59 >cd06139 DNA_polA_I_Ecoli_like_exo The 3'-5' exonuclease domain of DNA polymerases has a fundamental role in reducing polymerase errors and is involved in proofreading activity. E.coli-like Polymerase I (pol I), a subgroup of family-A DNA polymerases, contains a DEDDy-type DnaQ-like 3'-5' exonuclease domain in the same polypeptide chain as the polymerase domain. The exonuclease domain contains three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. E. coli DNA pol I is involved in genome replication but is not the main replicating enzyme. It is also implicated in DNA repair. Probab=98.82 E-value=4.9e-07 Score=63.95 Aligned_cols=145 Identities=19% Similarity=0.204 Sum_probs=91.5 Q ss_pred CCCEEEEECCCCCCCCCCCEEEEEEEEEECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHHHH Q ss_conf 88379997527889877260489999998389728040022662178887785789898579988996079997999999 Q gi|254780661|r 2 TNHFVIYDYETFGRDVALDRPAQFAGVRVDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFSRR 81 (471) Q Consensus 2 ~~~~v~~D~ETtG~~~~~~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~~~ 81 (471) ....+-||+||||++|..++|+-++. .++. + + .+++ |-.+. . + ..+.+..++... T Consensus 4 ~~~~ia~DtETtgl~~~~~~ivGisl-s~~~-~----~--~~Yi-P~~h~--~-------~-------~~~l~~~~v~~~ 58 (193) T cd06139 4 KAKVFAFDTETTSLDPMQAELVGISF-AVEP-G----E--AYYI-PLGHD--Y-------G-------GEQLPREEVLAA 58 (193) T ss_pred CCCEEEEECCCCCCCCCCCEEEEEEE-EECC-C----C--EEEE-EECCC--C-------C-------CCCCCHHHHHHH T ss_conf 59878997836999868884999999-8079-9----6--8998-72365--6-------5-------446799999999 Q ss_pred HHHHHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHH Q ss_conf 99995117965999505578999999999850785321325688710015888865556532100244444544331089 Q gi|254780661|r 82 IHQFFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGATSFKLQ 161 (471) Q Consensus 82 i~~~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~~kL~ 161 (471) +.++|.+....-|||| +.||..+|+ ++++.. .+..+|++-.++. +.|+ ..++.|. T Consensus 59 lk~~led~~i~Kv~hn-~k~D~~~L~----~~gi~~-------~~~~~DTmiaa~l---l~~~----------~~~~~L~ 113 (193) T cd06139 59 LKPLLEDPSIKKVGQN-LKFDLHVLA----NHGIEL-------RGPAFDTMLASYL---LNPG----------RRRHGLD 113 (193) T ss_pred HHHHHCCCCCCEEEEC-CHHHHHHHH----HCCCCC-------CCCCHHHHHHHHH---CCCC----------CCCCCHH T ss_conf 9999739998688754-797899999----839866-------7830218999987---2799----------8888999 Q ss_pred HHHHHH-CCCC---------------------CC-CCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 899871-8876---------------------43-10133318999999999986422 Q gi|254780661|r 162 DLALAN-GIEH---------------------VN-AHDAKADVYATLALVWLIREKKP 196 (471) Q Consensus 162 ~la~~n-~i~~---------------------~~-aH~A~~D~~~t~~l~k~i~~~~p 196 (471) .+|... |.+. +. ++=|..||.+|..|...+..+.. T Consensus 114 ~La~~~l~~~~~~~~~l~gkg~~~~~~~~~~~~~~~~YAa~Da~~t~~L~~~l~~~L~ 171 (193) T cd06139 114 DLAERYLGHKTISFEDLVGKGKKQITFDQVPLEKAAEYAAEDADITLRLYELLKPKLK 171 (193) T ss_pred HHHHHHHCCCCCCHHHHHCCCCCCCCHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999981887663899837785537831332999999999999999999999999998 No 60 >COG5018 KapD Inhibitor of the KinA pathway to sporulation, predicted exonuclease [General function prediction only] Probab=98.77 E-value=3.6e-08 Score=71.29 Aligned_cols=171 Identities=15% Similarity=0.071 Sum_probs=121.1 Q ss_pred CCCEEEEECCCCC----CCCCCCEEEEEEEEEECC-CCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHH Q ss_conf 8837999752788----987726048999999838-97280400226621788877857898985799889960799979 Q gi|254780661|r 2 TNHFVIYDYETFG----RDVALDRPAQFAGVRVDR-QWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEA 76 (471) Q Consensus 2 ~~~~v~~D~ETtG----~~~~~~~iiq~~~i~~d~-~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~ 76 (471) +...++.|+|.|- -.+....||+++|.+++. +-.++|+|.+ +|||..--..+-.---+|||+-..+-.++ .+. T Consensus 3 ~~~lLIID~EaT~~eG~~~~~e~eiiei~a~lv~~id~~vvd~F~s-yVRP~~~P~Lt~~Ckslt~I~Q~~VD~ap-ifs 80 (210) T COG5018 3 TNSLLIIDFEATMPEGKYSPQEFEIIEIEAGLVKSIDDEVVDTFSS-YVRPKKFPKLTKRCKSLTKITQKQVDEAP-IFS 80 (210) T ss_pred CCEEEEEEEEEECCCCCCCCHHCEEEEEHHHHHHHHHHHHHHHHHH-HCCCCCCCHHHHHHHHHHHHHHHHCCCCC-HHH T ss_conf 7548999756535677877033433442236888756889999997-42754373288999886203464036550-488 Q ss_pred HHHHHHHHHHHCC----CCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCC Q ss_conf 9999999995117----965999505578999999999850785321325688710015888865556532100244444 Q gi|254780661|r 77 EFSRRIHQFFSVP----NTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRD 152 (471) Q Consensus 77 e~~~~i~~~~~~~----~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~ 152 (471) .+...+..++++. .+.++.|. ..|...+.+..+.|.++|... .|- ..|+ ..++.+....|.. T Consensus 81 ~v~E~f~r~L~~h~Pr~~~~wa~wG--~~Dm~~l~q~~~~~~~~p~~~---kgp-~vdl-------~~~yk~v~~~pr~- 146 (210) T COG5018 81 MVFEDFIRKLNEHDPRKNSTWATWG--NMDMKVLKQNCMFNHIPPFPF---KGP-MVDL-------SLEYKNVFGDPRL- 146 (210) T ss_pred HHHHHHHHHHHHCCCCCCCCCCCCC--CHHHHHHHHHHHHCCCCCCCC---CCC-CCHH-------HHHHHHHHCCCCC- T ss_conf 8999999999854955377502026--041999999998617998534---575-1008-------9998787359852- Q ss_pred CCCCCCCHHHHHHHHCCCC-CCCCHHHHHHHHHHHHHHHHHH Q ss_conf 5443310898998718876-4310133318999999999986 Q gi|254780661|r 153 DGATSFKLQDLALANGIEH-VNAHDAKADVYATLALVWLIRE 193 (471) Q Consensus 153 ~~~~~~kL~~la~~n~i~~-~~aH~A~~D~~~t~~l~k~i~~ 193 (471) -.|...++..|.+- |..|+|+.|+..+..|+|++-. T Consensus 147 -----tgln~ale~~G~sf~G~~HraldDArn~~rl~klv~~ 183 (210) T COG5018 147 -----TGLNKALEEYGDSFTGTHHRALDDARNAYRLFKLVEQ 183 (210) T ss_pred -----CCHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHCC T ss_conf -----1378999984245677301447778889999998712 No 61 >cd05785 DNA_polB_like2_exo A subfamily of the 3'-5' exonuclease domain of family-B DNA polymerases. This subfamily is composed of uncharacterized bacterial family-B DNA polymerases. Family-B DNA polymerases contain an N-terminal DEDDy DnaQ-like exonuclease domain in the same polypeptide chain as the polymerase domain, similar to family-A DNA polymerases. This exonuclease domain contains three sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are involved in metal binding and catalysis. The exonuclease domain of family-B DNA polymerases has a fundamental role in proofreading activity. It contains a beta hairpin structure that plays an important role in active site switching in the event of a nucleotide misincorporation. Family-B DNA polymerases are predominantly involved in DNA replication and DNA repair. Probab=98.74 E-value=5.8e-07 Score=63.51 Aligned_cols=129 Identities=19% Similarity=0.096 Sum_probs=84.3 Q ss_pred CEEEEECCCCCCCC--------CCCEEEEEEEEEECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCH Q ss_conf 37999752788987--------7260489999998389728040022662178887785789898579988996079997 Q gi|254780661|r 4 HFVIYDYETFGRDV--------ALDRPAQFAGVRVDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVE 75 (471) Q Consensus 4 ~~v~~D~ETtG~~~--------~~~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~ 75 (471) ..+.||+||++.+. ..|.|++||.. +..+.. .+ +.+ .+.++ T Consensus 10 r~lsfDIE~~~~~g~~~p~p~~~~d~Ii~I~~~--~~~~~~--~v--~~~-------------------------~~~~E 58 (207) T cd05785 10 RRLQLDIETYSLPGFFFSNPDRGDDRIIIVALR--DNRGWE--EV--LHA-------------------------EDAAE 58 (207) T ss_pred EEEEEEEEECCCCCCCCCCCCCCCCCEEEEEEE--ECCCCE--EE--EEE-------------------------CCCCH T ss_conf 599999974788997799988899978999988--589977--98--762-------------------------69999 Q ss_pred HHHHHHHHHHHHCCCC-EEEEEECHHHHHHHHHHHHHHHCCCCCCC------------------------CCCCCCCCCC Q ss_conf 9999999999511796-59995055789999999998507853213------------------------2568871001 Q gi|254780661|r 76 AEFSRRIHQFFSVPNT-CIIGYNNIRFDDYYSRNIFYRNFYDSYRW------------------------SWDNGNSRWD 130 (471) Q Consensus 76 ~e~~~~i~~~~~~~~~-~~ig~N~~~FD~~~lr~~~~r~~~~~y~~------------------------~~~~~~~~~D 130 (471) .+++..+.+++....| +++|||..+||.++|..=.-+.+.+...- ....|...+| T Consensus 59 ~~lL~~F~~~i~~~dPDIItGyNi~~FD~pYL~~Ra~~~~i~~~lgR~~~~~~~~~~~~~~~~~~~~~~~~~i~Gr~~lD 138 (207) T cd05785 59 KELLEELVAIIRERDPDVIEGHNIFRFDLPYLRRRCRRHGVPLAIGRDGSIPRQRPSRFRFAERLIDYPRYDIPGRHVID 138 (207) T ss_pred HHHHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCEEECCEEEECCCCCCEEEEEECCEEEEE T ss_conf 99999999999873999998679877588999999999599722257898744640302122355530499873648865 Q ss_pred HHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHCCCCCC Q ss_conf 5888865556532100244444544331089899871887643 Q gi|254780661|r 131 LLDVMRAIYAFSPDGIQWPSRDDGATSFKLQDLALANGIEHVN 173 (471) Q Consensus 131 ~l~~~~~~~~~~p~~~~~~~~~~~~~~~kL~~la~~n~i~~~~ 173 (471) ++.+++....- .-.-+||+|+.+|+..|+..++ T Consensus 139 ~~~~~k~~~~~----------~~~L~SY~L~~Va~~~~l~~~~ 171 (207) T cd05785 139 TYFLVQLFDVS----------SRDLPSYGLKAVAKHFGLASPD 171 (207) T ss_pred HHHHHHHHCCC----------CCCCCCCCHHHHHHHHHHCCCC T ss_conf 49999864211----------1467765289999987313666 No 62 >KOG0542 consensus Probab=98.74 E-value=5.9e-07 Score=63.44 Aligned_cols=167 Identities=11% Similarity=0.175 Sum_probs=112.3 Q ss_pred CEEEEECCCCCCCCCC----CEEEEEEEEEEC-CCCEECC-EEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHH Q ss_conf 3799975278898772----604899999983-8972804-002266217888778578989857998899607999799 Q gi|254780661|r 4 HFVIYDYETFGRDVAL----DRPAQFAGVRVD-RQWEKIE-STEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAE 77 (471) Q Consensus 4 ~~v~~D~ETtG~~~~~----~~iiq~~~i~~d-~~~~~~~-~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e 77 (471) -+++.|+|+|--.... .+|||+.+|..| .+-.+|+ +|+. ++||......+..-..+|||.-+++ +++.++.+ T Consensus 57 YLliiDFEaTC~e~~~~~~~~EIIEfP~V~l~~~~~~~Ie~eF~q-YVrP~~np~LS~fC~~lTgI~Q~tV-D~a~~f~~ 134 (280) T KOG0542 57 YLLILDFEATCEEGNKPHYVQEIIEFPAVLLDNTETSIIEDEFHQ-YVRPVENPRLSDFCTSLTGIQQETV-DEAPTFPQ 134 (280) T ss_pred EEEEEEEEEECCCCCCCCCCHHHEECCEEEEECCCCHHHHHHHHH-HHCCCCCCHHHHHHHHHHCCHHHHH-CCCCCHHH T ss_conf 699986300300358888612302634456734430136788886-5265348538888988628507651-45786799 Q ss_pred HHHHHHHHHHC-------CCCEEEEEECHHHHHH-HHHHHHHHHCC--CCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCC Q ss_conf 99999999511-------7965999505578999-99999985078--53213256887100158888655565321002 Q gi|254780661|r 78 FSRRIHQFFSV-------PNTCIIGYNNIRFDDY-YSRNIFYRNFY--DSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQ 147 (471) Q Consensus 78 ~~~~i~~~~~~-------~~~~~ig~N~~~FD~~-~lr~~~~r~~~--~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~ 147 (471) +..++...+.. ....||.+.. -|.. ||...+-+..+ |+|..+| .|+- ..+.+..+ T Consensus 135 vl~~f~~Wlr~~~~~~k~~~~Afvtdg~--wDl~~~l~~qck~~~i~~P~~f~qw------Inir-------k~yk~~y~ 199 (280) T KOG0542 135 VLSEFDSWLRKDSLGDKNGKFAFVTDGD--WDLWVFLQYQCKLKNIRIPAFFNQW------INIR-------KIYKNFYN 199 (280) T ss_pred HHHHHHHHHHHHHCCCCCCCEEEEECCH--HHHHHHHHHHHHHHCCCCCHHHHHH------HHHH-------HHHHHHHC T ss_conf 9999999999863123347668996754--2499999999777278884899878------6799-------99999956 Q ss_pred CCCCCCCCCCCCHHHHHHHHCCCCC-CCCHHHHHHHHHHHHHHHHHH Q ss_conf 4444454433108989987188764-310133318999999999986 Q gi|254780661|r 148 WPSRDDGATSFKLQDLALANGIEHV-NAHDAKADVYATLALVWLIRE 193 (471) Q Consensus 148 ~~~~~~~~~~~kL~~la~~n~i~~~-~aH~A~~D~~~t~~l~k~i~~ 193 (471) .| ..-+++.+.+..|+.++ ++|+++.||.-++.+++.+.. T Consensus 200 ~~------~~t~it~mLe~~gL~f~Gr~HsGiDDa~Nia~I~~kM~~ 240 (280) T KOG0542 200 RP------APTNITGMLEHYGLQFEGRAHSGIDDARNIARIAQKMIR 240 (280) T ss_pred CC------CCCCHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHH T ss_conf 76------656778899971976568756672447789999999975 No 63 >cd05779 DNA_polB_epsilon_exo The 3'-5' exonuclease domain of eukaryotic DNA polymerase epsilon. DNA polymerase epsilon is a family-B DNA polymerase with a catalytic subunit that contains a DEDDy-type DnaQ-like 3'-5' exonuclease domain. It is one of the three DNA-dependent type B DNA polymerases (alpha and delta are the other two) that have been identified as essential for nuclear DNA replication in eukaryotes. DNA polymerase epsilon plays a role in elongating the leading strand during DNA replication. It is also involved in DNA repair. The catalytic subunit contains both polymerase and 3'-5' exonuclease activities. The N-terminal exonuclease domain contains three sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are clustered around the active site and are involved in metal binding and catalysis. DNA polymerase epsilon also carries a unique large C-terminal domain with an unknown function. Phylogenetic analyses indicate that it is ort Probab=98.74 E-value=2.4e-07 Score=65.95 Aligned_cols=150 Identities=19% Similarity=0.098 Sum_probs=85.3 Q ss_pred CEEEEECCCCCC-----CCCCCEEEEEEEEEECCCCEECCEEEEEEECCCC-CCCCCHHHHHHHCCCHHHHHHCCCCHHH Q ss_conf 379997527889-----8772604899999983897280400226621788-8778578989857998899607999799 Q gi|254780661|r 4 HFVIYDYETFGR-----DVALDRPAQFAGVRVDRQWEKIESTEVFFCKPAD-DYLPDPEAVFITGITPQKALRDGVVEAE 77 (471) Q Consensus 4 ~~v~~D~ETtG~-----~~~~~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~-~~~p~p~a~~v~git~~~~~~~~~~~~e 77 (471) ..+-||+||++. +|..|.|||||.+.. ..+..+-........... ...|.|+. .|. -.+... .+|.+ T Consensus 3 rilsfDIE~~~~~~~fP~~~~D~Ii~Is~~~~-~~g~~i~~~~~~~~~i~~~~~~~~~~~---~g~--~~v~~~-~~E~~ 75 (204) T cd05779 3 RVLAFDIETTKLPLKFPDAETDQIMMISYMID-GQGYLIVNREIVSEDIEDFEYTPKPEY---EGP--FKVFNE-PDEKA 75 (204) T ss_pred EEEEEEEEEECCCCCCCCCCCCEEEEEEEEEE-CCCEEEEECCCCCCCCCCCCCCCCCCC---CCC--EEEEEC-CCHHH T ss_conf 79999999507999998988890899999991-687798743401353300014678566---576--489836-99999 Q ss_pred HHHHHHHHHHCCCC-EEEEEECHHHHHHHHHHHHHHHCCCCCC-C---CCC------CCCCCCCHHHHHHHHHHHHHHHC Q ss_conf 99999999511796-5999505578999999999850785321-3---256------88710015888865556532100 Q gi|254780661|r 78 FSRRIHQFFSVPNT-CIIGYNNIRFDDYYSRNIFYRNFYDSYR-W---SWD------NGNSRWDLLDVMRAIYAFSPDGI 146 (471) Q Consensus 78 ~~~~i~~~~~~~~~-~~ig~N~~~FD~~~lr~~~~r~~~~~y~-~---~~~------~~~~~~D~l~~~~~~~~~~p~~~ 146 (471) ++.++.+++....+ +++|||+.+||.++|..-.-+.+++... + .+. .+-.++|++..++.-+. T Consensus 76 ll~~f~~~i~~~~pdii~gyN~~~FD~pyl~~Ra~~~~~~~~~~~~~~~~~~~~~~~~~~vhiDly~~v~rd~~------ 149 (204) T cd05779 76 LLQRFFEHIREVKPHIIVTYNGDFFDWPFVEARAAIHGLSMEEEIGFRKDSEGEYKSRYIIHMDCFRWVKRDSY------ 149 (204) T ss_pred HHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHHHCCCHHHHCCCCCCCCCCEECCCCEEEECHHHHHHHCC------ T ss_conf 99999999986499999855876776799999999959982553180006777276267256300776510121------ Q ss_pred CCCCCCCCCCCCCHHHHHHH-HCCCCC Q ss_conf 24444454433108989987-188764 Q gi|254780661|r 147 QWPSRDDGATSFKLQDLALA-NGIEHV 172 (471) Q Consensus 147 ~~~~~~~~~~~~kL~~la~~-n~i~~~ 172 (471) -...|++|..+|++ .|..-. T Consensus 150 ------l~~~sy~Ln~Vs~~~Lg~~K~ 170 (204) T cd05779 150 ------LPQGSQGLKAVTKAKLGYDPV 170 (204) T ss_pred ------CCCCCCCHHHHHHHHHCCCCC T ss_conf ------755687789999999689975 No 64 >KOG4793 consensus Probab=98.66 E-value=2.2e-07 Score=66.17 Aligned_cols=179 Identities=17% Similarity=0.162 Sum_probs=119.4 Q ss_pred CCCEEEEECCCCCCCCCCCEEEEEEEE------------EECCCC-------EECCEEEEEEECCCCCCCCCHHHHHHHC Q ss_conf 883799975278898772604899999------------983897-------2804002266217888778578989857 Q gi|254780661|r 2 TNHFVIYDYETFGRDVALDRPAQFAGV------------RVDRQW-------EKIESTEVFFCKPADDYLPDPEAVFITG 62 (471) Q Consensus 2 ~~~~v~~D~ETtG~~~~~~~iiq~~~i------------~~d~~~-------~~~~~~~~~~~~~~~~~~p~p~a~~v~g 62 (471) +.+|+|+|+|+||+......|.|++.. -+|++. .+++..+.++ -|..+-.|.|..+|| T Consensus 12 ~~tf~fldleat~lp~~~~~iteLcLlav~assle~k~~e~dq~~~~tlp~~Rvl~Klsvl~---~p~~v~~p~aeeitg 88 (318) T KOG4793 12 LRTFSFLDLEATGLPGWIPNITELCLLAVHASSLEGKAREIDQNVSTTLPGSRVLDKLSVLG---GPVPVTRPIAEEITG 88 (318) T ss_pred EEEEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHCC---CCCCCCCHHHHHHCC T ss_conf 03787541002568755664035668888987623876521258875478620221201105---786776715554102 Q ss_pred CCHHHHHHCCCCH--HHHHHHHHHHHHC-CCC-EEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHH Q ss_conf 9988996079997--9999999999511-796-59995055789999999998507853213256887100158888655 Q gi|254780661|r 63 ITPQKALRDGVVE--AEFSRRIHQFFSV-PNT-CIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAI 138 (471) Q Consensus 63 it~~~~~~~~~~~--~e~~~~i~~~~~~-~~~-~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~ 138 (471) .+...+.-++... -.+++.+.-|+++ +++ |.|.||+-.||.+++++.+.+-++.-.. + --..|.++..++. T Consensus 89 ls~~~~~l~rr~~~D~dla~LL~afls~lp~p~CLVaHng~~~dfpil~qela~lg~~lpq---~--lvcvdslpa~~al 163 (318) T KOG4793 89 LSQPFLALQRRLAFDKDLAKLLTAFLSRLPTPGCLVAHNGNEYDFPILAQELAGLGYSLPQ---D--LVCVDSLPALNAL 163 (318) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEECCCCCCCHHHHHHHHHCCCCCHH---H--HCCCCHHHHHHHH T ss_conf 3418888888862038799999999963899716885248765428899999724756604---4--2074415888877 Q ss_pred HHHHHHHCCCCCCCCCCCCCCHHHHHHHHCC--CCCCCCHHHHHHHHHHHHHHHHH Q ss_conf 5653210024444454433108989987188--76431013331899999999998 Q gi|254780661|r 139 YAFSPDGIQWPSRDDGATSFKLQDLALANGI--EHVNAHDAKADVYATLALVWLIR 192 (471) Q Consensus 139 ~~~~p~~~~~~~~~~~~~~~kL~~la~~n~i--~~~~aH~A~~D~~~t~~l~k~i~ 192 (471) .. .-.-.....+.+++.|+.+-...-= +-...|.|..|+.+..-++.... T Consensus 164 d~----a~s~~tr~~~~~~~~l~~If~ry~~q~eppa~~~~e~d~~~l~~~fqf~~ 215 (318) T KOG4793 164 DR----ANSMVTRPEVRRMYSLGSIFLRYVEQREPPAGHVAEGDVNGLLFIFQFRI 215 (318) T ss_pred HH----HCCCCCCCCCCCCCCCCHHHHHHHCCCCCCCCEEEECCCCHHHHHHHHHH T ss_conf 65----04754477777653002378764113599864354424311577899889 No 65 >cd06145 REX1_like This subfamily is composed of RNA exonuclease 1 (REX1 or Rex1p), REX3 (or Rex3p), and similar eukaryotic proteins. In yeast, REX1 and REX3 are required for 5S rRNA and MRP (mitochondrial RNA processing) RNA maturation, respectively. They are DEDDh-type DnaQ-like 3'-5' exonucleases containing three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific Hx(4)D conserved pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. REX1 is the major exonuclease responsible for pre-tRNA trail trimming and may also be involved in nuclear CCA turnover. REX proteins function in the processing and maturation of many RNA species, similar to the function of E.coli RNase T. Probab=98.60 E-value=6.6e-07 Score=63.13 Aligned_cols=144 Identities=19% Similarity=0.178 Sum_probs=94.3 Q ss_pred EECCCCCCCCCCCEEEEEEEEEECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHHHHHHHHHH Q ss_conf 97527889877260489999998389728040022662178887785789898579988996079997999999999951 Q gi|254780661|r 8 YDYETFGRDVALDRPAQFAGVRVDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFSRRIHQFFS 87 (471) Q Consensus 8 ~D~ETtG~~~~~~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~~~i~~~~~ 87 (471) .|.|.-|.... .....+++| |.+|+++ ++. +++|...+ .-.-+.++|||++.+.+++.+..++-+++.+++. T Consensus 3 lDCEMv~t~~g-~~LaRvsvV--d~~g~vl--~D~-~V~P~~~V--~DyrT~~SGIt~~~l~~a~~~~~~~~~~l~~li~ 74 (150) T cd06145 3 LDCEMCYTTDG-LELTRVTVV--DENGKVV--LDE-LVKPDGEI--VDYNTRFSGITEEMLENVTTTLEDVQKKLLSLIS 74 (150) T ss_pred CCCEEEEECCC-CEEEEEEEE--CCCCCEE--EEE-EECCCCCC--EECCCCCCCCCHHHHHCCCCCHHHHHHHHHHHCC T ss_conf 61367752799-889999999--3899898--774-33499772--1320104578988985465889999999998759 Q ss_pred CCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHH- Q ss_conf 1796599950557899999999985078532132568871001588886555653210024444454433108989987- Q gi|254780661|r 88 VPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGATSFKLQDLALA- 166 (471) Q Consensus 88 ~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~~kL~~la~~- 166 (471) .++++|||. +..|...|+-. .| ...|+-. ++|.. .......+|-.||.. T Consensus 75 -~~~IlVGH~-L~nDL~aL~i~------H~---------~v~DTa~-------l~~~~------~~~~~~~sLk~L~~~~ 124 (150) T cd06145 75 -PDTILVGHS-LENDLKALKLI------HP---------RVIDTAI-------LFPHP------RGPPYKPSLKNLAKKY 124 (150) T ss_pred -CCCEEEECC-CCCCHHHHHCC------CC---------CEEEHHH-------HHHHC------CCCCCCHHHHHHHHHH T ss_conf -998899626-43477776155------87---------6157166-------21103------7999897899999998 Q ss_pred HCCCCC---CCCHHHHHHHHHHHHHH Q ss_conf 188764---31013331899999999 Q gi|254780661|r 167 NGIEHV---NAHDAKADVYATLALVW 189 (471) Q Consensus 167 n~i~~~---~aH~A~~D~~~t~~l~k 189 (471) .|.+.. +.|++..||.||++|.| T Consensus 125 L~~~IQ~~~~gHdsvEDA~aam~L~k 150 (150) T cd06145 125 LGRDIQQGEGGHDSVEDARAALELVK 150 (150) T ss_pred HCHHHCCCCCCCCCHHHHHHHHHHHC T ss_conf 28774399988183999999999759 No 66 >pfam04857 CAF1 CAF1 family ribonuclease. The major pathways of mRNA turnover in eukaryotes initiate with shortening of the polyA tail. CAF1 encodes a critical component of the major cytoplasmic deadenylase in yeast. Both Caf1p is required for normal mRNA deadenylation in vivo and localizes to the cytoplasm. Caf1p copurifies with a Ccr4p-dependent polyA-specific exonuclease activity. Some members of this family include and inserted RNA binding domain pfam01424. This family of proteins is related to other exonucleases pfam00929 (Bateman A pers. obs.). The crystal structure of Saccharomyces cerevisiae Pop2 has been resolved at 2.3 Angstrom#resolution. Probab=98.59 E-value=5.3e-06 Score=57.34 Aligned_cols=173 Identities=19% Similarity=0.180 Sum_probs=111.8 Q ss_pred CCEEEEECCCCCCCCC---------------------CCEEEEEEEEEECCCCEECCEEEEE----EEC-CCCCCCCCH- Q ss_conf 8379997527889877---------------------2604899999983897280400226----621-788877857- Q gi|254780661|r 3 NHFVIYDYETFGRDVA---------------------LDRPAQFAGVRVDRQWEKIESTEVF----FCK-PADDYLPDP- 55 (471) Q Consensus 3 ~~~v~~D~ETtG~~~~---------------------~~~iiq~~~i~~d~~~~~~~~~~~~----~~~-~~~~~~p~p- 55 (471) -.||-.|+|=+|+... .=+|||+|.-..|.+++......++ ..- +..+ .-++ T Consensus 22 ~~fVaiDtEFpG~v~~p~~~~~~t~~~~Y~~lk~nVd~l~iiQlGlt~~~~~g~~p~~~~~wqfNf~~F~~~~d-~~~~~ 100 (235) T pfam04857 22 YDFIAIDTEFPGVVARPIGSFRSTSDYRYQALRKNVDRLKIIQLGLTLFDEKGNLPDSYYTWQFNFSLFNLEED-FYAPS 100 (235) T ss_pred CCEEEEEEECCCCCCCCCCCCCCCHHHHHHHHHHCCCCCCEEEEEEEEECCCCCCCCCCEEEEEECCCCCCCCC-CCCHH T ss_conf 99899951427524689888889878999999843001231468988874688789874378501556797322-11688 Q ss_pred --HHHHHHCCCHHHHHHCCCCHHHHHHHH--HHHHHC-CCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCC Q ss_conf --898985799889960799979999999--999511-796599950557899999999985078532132568871001 Q gi|254780661|r 56 --EAVFITGITPQKALRDGVVEAEFSRRI--HQFFSV-PNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWD 130 (471) Q Consensus 56 --~a~~v~git~~~~~~~~~~~~e~~~~i--~~~~~~-~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D 130 (471) +-+.-+|+.=..+...|++...+.+.+ ...+.. .+..||+||+ .+|..+|-+.|+..-+|. ..-+ T Consensus 101 Si~fL~~~G~DF~~~~~~GI~~~~f~e~l~~S~lv~~~~~~~wv~f~g-~yD~~yl~k~l~g~~LP~---------~~~e 170 (235) T pfam04857 101 SIEFLAKQGFDFNKHRREGIPYLRFAELLGSSGLFLNISSVTWVTFHG-LYDFGYLLKLLTGGPLPE---------TLDD 170 (235) T ss_pred HHHHHHHCCCCHHHHHHHCCCHHHHHHHHHHHHHEECCCCCEEEEECC-HHHHHHHHHHHCCCCCCC---------CHHH T ss_conf 999999859888999870999899998998703020579987998172-778999999974998999---------8999 Q ss_pred HHHHHHHHHHHHHHHCCCCCC------CCCCCCCCHHHHHHHHCCCC--CCCCHHHHHHHHHHHHHH Q ss_conf 588886555653210024444------45443310898998718876--431013331899999999 Q gi|254780661|r 131 LLDVMRAIYAFSPDGIQWPSR------DDGATSFKLQDLALANGIEH--VNAHDAKADVYATLALVW 189 (471) Q Consensus 131 ~l~~~~~~~~~~p~~~~~~~~------~~~~~~~kL~~la~~n~i~~--~~aH~A~~D~~~t~~l~k 189 (471) ....++ .+.|..+..-.- ..+..+..|+.+|...|+.- +.+|.|-+|+..|..++- T Consensus 171 F~~~l~---~~FP~vyD~K~la~~~~~~~~~~~~gL~~lA~~L~v~r~~g~~HqAGsDSllT~~~F~ 234 (235) T pfam04857 171 FLELLR---ELFPRVYDTKYLAKFCFELDGGSNGGLQELADLLGVNRRVGRLHQAGSDSLLTALVFF 234 (235) T ss_pred HHHHHH---HHCCHHHHHHHHHHHCCCCCCCCCCCHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHH T ss_conf 999999---9777175099999852213587557899999974997888986665529999998761 No 67 >KOG2249 consensus Probab=98.57 E-value=2.1e-06 Score=59.94 Aligned_cols=162 Identities=16% Similarity=0.196 Sum_probs=102.9 Q ss_pred CEEEEECCCCCCCCC--CCEEEEEEEEEECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHHHH Q ss_conf 379997527889877--260489999998389728040022662178887785789898579988996079997999999 Q gi|254780661|r 4 HFVIYDYETFGRDVA--LDRPAQFAGVRVDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFSRR 81 (471) Q Consensus 4 ~~v~~D~ETtG~~~~--~~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~~~ 81 (471) .+|-.|.|--|.-|. .+-.-.+. .+|..|+++ ++. +++|...+ --.-+.|+||.|+.+.++ .++..+=.+ T Consensus 106 r~vAmDCEMVG~Gp~G~~s~lARvS--IVN~~G~Vv--yDk-yVkP~~~V--tDyRT~vSGIrpehm~~A-~pf~~aQ~e 177 (280) T KOG2249 106 RVVAMDCEMVGVGPDGRESLLARVS--IVNYHGHVV--YDK-YVKPTEPV--TDYRTRVSGIRPEHMRDA-MPFKVAQKE 177 (280) T ss_pred EEEEEEEEEECCCCCCCCEEEEEEE--EEECCCCEE--EEE-ECCCCCCC--CCCEEEECCCCHHHHCCC-CCHHHHHHH T ss_conf 3899740475147997600034788--861468676--555-03788663--310332236587785157-538999999 Q ss_pred HHHHHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHH Q ss_conf 99995117965999505578999999999850785321325688710015888865556532100244444544331089 Q gi|254780661|r 82 IHQFFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGATSFKLQ 161 (471) Q Consensus 82 i~~~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~~kL~ 161 (471) |.+++. |.++|||- +.-|...|....=+.+.-. -++.-.| .+. ....+.| +|- T Consensus 178 v~klL~--gRIlVGHa-LhnDl~~L~l~hp~s~iRD--------Ts~~~pl-------------~k~-~~~~~tp--SLK 230 (280) T KOG2249 178 VLKLLK--GRILVGHA-LHNDLQALKLEHPRSMIRD--------TSKYPPL-------------MKL-LSKKATP--SLK 230 (280) T ss_pred HHHHHH--CCEEECCC-CCCHHHHHHHHCCHHHHCC--------CCCCCHH-------------HHH-HHCCCCC--CHH T ss_conf 999970--87775122-0007999855374355314--------3127548-------------777-5215983--199 Q ss_pred HHHHH-HCCCCC-CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 89987-188764-31013331899999999998642268999996 Q gi|254780661|r 162 DLALA-NGIEHV-NAHDAKADVYATLALVWLIREKKPKLFEYLYD 204 (471) Q Consensus 162 ~la~~-n~i~~~-~aH~A~~D~~~t~~l~k~i~~~~p~~~~~~l~ 204 (471) .|+++ .|++.. +-|+...||.||++|.++++ .-|+.... T Consensus 231 ~Lt~~~Lg~~IQ~GeHsSvEDA~AtM~LY~~vk----~qwe~~~~ 271 (280) T KOG2249 231 KLTEALLGKDIQVGEHSSVEDARATMELYKRVK----VQWEKIEA 271 (280) T ss_pred HHHHHHHCHHHHCCCCCCHHHHHHHHHHHHHHH----HHHHHHHH T ss_conf 999998452543156685889999999999999----99999861 No 68 >COG3359 Predicted exonuclease [DNA replication, recombination, and repair] Probab=98.57 E-value=2.7e-06 Score=59.23 Aligned_cols=150 Identities=20% Similarity=0.310 Sum_probs=86.6 Q ss_pred CCEEEEECCCCCCCCCCCEEEEEEEEEECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHHHHH Q ss_conf 83799975278898772604899999983897280400226621788877857898985799889960799979999999 Q gi|254780661|r 3 NHFVIYDYETFGRDVALDRPAQFAGVRVDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFSRRI 82 (471) Q Consensus 3 ~~~v~~D~ETtG~~~~~~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~~~i 82 (471) ++++|||+||||++++-+.|+-.|.-.. .+ +++.+ | ..-.|.|+ .|++ -+ T Consensus 98 e~~~FFDiETTGL~~ag~~I~~~g~a~~-~~----~~~~V---r--q~~lp~p~-------------------~E~a-vl 147 (278) T COG3359 98 EDVAFFDIETTGLDRAGNTITLVGGARG-VD----DTMHV---R--QHFLPAPE-------------------EEVA-VL 147 (278) T ss_pred CCEEEEEEECCCCCCCCCEEEEEEEEEC-CC----CEEEE---E--EECCCCCC-------------------HHHH-HH T ss_conf 2258996331466888985999988873-57----56999---7--51279841-------------------3568-99 Q ss_pred HHHHHCCC-CEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHH Q ss_conf 99951179-65999505578999999999850785321325688710015888865556532100244444544331089 Q gi|254780661|r 83 HQFFSVPN-TCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGATSFKLQ 161 (471) Q Consensus 83 ~~~~~~~~-~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~~kL~ 161 (471) ..+...++ .++|.||+..||.+|+++ +-+..++ .+..+.++|++..+|..+-- --+..+|. T Consensus 148 e~fl~~~~~~~lvsfNGkaFD~PfikR-~v~~~~e-----l~l~~~H~DL~h~~RRlwk~------------~l~~c~Lk 209 (278) T COG3359 148 ENFLHDPDFNMLVSFNGKAFDIPFIKR-MVRDRLE-----LSLEFGHFDLYHPSRRLWKH------------LLPRCGLK 209 (278) T ss_pred HHHHCCCCCCEEEEECCCCCCCHHHHH-HHHCCCC-----CCCCCCCHHHHHHHHHHHHC------------CCCCCCHH T ss_conf 997538984559974683267089999-8751644-----47666631221034453120------------28788813 Q ss_pred HHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 899871887643101333189999999999864226899999 Q gi|254780661|r 162 DLALANGIEHVNAHDAKADVYATLALVWLIREKKPKLFEYLY 203 (471) Q Consensus 162 ~la~~n~i~~~~aH~A~~D~~~t~~l~k~i~~~~p~~~~~~l 203 (471) .+-+..||.-+.-- -+..+-.....-++...|.+-.-.+ T Consensus 210 ~VEr~LGi~R~edt---dG~~~p~lyr~~~~~~dp~ll~~l~ 248 (278) T COG3359 210 TVERILGIRREEDT---DGYDGPELYRLYRRYGDPGLLDGLV 248 (278) T ss_pred HHHHHHCCCCCCCC---CCCCHHHHHHHHHHCCCHHHHHHHH T ss_conf 67888575501158---9850199999998709877778988 No 69 >cd05160 DNA_polB_exo The 3'-5' exonuclease domain of family-B DNA polymerases. This domain has a fundamental role in reducing polymerase errors and is involved in proofreading activity. Family-B DNA polymerases contain an N-terminal DEDDy DnaQ-like exonuclease domain in the same polypeptide chain as the polymerase domain, similar to family-A DNA polymerases. This domain contains three sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. The exonuclease domain of family B polymerase also contains a beta hairpin structure that plays an important role in active site switching in the event of nucleotide misincorporation. Members include E. coli DNA polymerase II, some eubacterial phage DNA polymerases, nuclear replicative DNA polymerases (alpha, delta, epsilon and zeta), and eukaryotic viral and Probab=98.52 E-value=1.6e-06 Score=60.65 Aligned_cols=141 Identities=23% Similarity=0.145 Sum_probs=87.8 Q ss_pred EEEECCCCCC----CCCCCEEEEEEEEEECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHHHH Q ss_conf 9997527889----877260489999998389728040022662178887785789898579988996079997999999 Q gi|254780661|r 6 VIYDYETFGR----DVALDRPAQFAGVRVDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFSRR 81 (471) Q Consensus 6 v~~D~ETtG~----~~~~~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~~~ 81 (471) +-||+||..- +|..|.|++|+.... .++ ....+... ............+.|. .+..- .+|.+++.. T Consensus 2 lsfDIE~~~~~gfP~p~~d~Ii~Is~~~~-~~~----~~~~~~~~-~~~~~~~~~~~~~~~~---~v~~~-~~E~eLL~~ 71 (200) T cd05160 2 LSFDIETLPNVGGPEPSRDPIICISYADS-FQG----HKVVFLLK-VSTHGDDIEAGFIDGI---EVEYF-DDEKELLKR 71 (200) T ss_pred EEEEEEECCCCCCCCCCCCEEEEEEEEEE-CCC----EEEEEEEE-CCCCCCCCCCCCCCCE---EEEEE-CCHHHHHHH T ss_conf 79998327999978999880899999993-599----99998864-3556767654677996---99996-999999999 Q ss_pred HHHHHHCCCC-EEEEEECHHHHHHHHHHHHHHHCCCCCCCC---------------CCCCCCCCCHHHHHHHHHHHHHHH Q ss_conf 9999511796-599950557899999999985078532132---------------568871001588886555653210 Q gi|254780661|r 82 IHQFFSVPNT-CIIGYNNIRFDDYYSRNIFYRNFYDSYRWS---------------WDNGNSRWDLLDVMRAIYAFSPDG 145 (471) Q Consensus 82 i~~~~~~~~~-~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~---------------~~~~~~~~D~l~~~~~~~~~~p~~ 145 (471) +.+++....| +++|||+.+||.++|..=.-++++++.... ...|...+|++.+++... T Consensus 72 F~~~i~~~dPDii~GyN~~~FD~pyL~~R~~~l~~~~~~~~~~~~~~~~~~~~~~~~i~Gr~~lD~~~~~k~~~------ 145 (200) T cd05160 72 FFDIIREYDPDILTGYNIDDFDLPYLLERAKALGIKLDDIGRRSGQEKSSGETERAAVKGRVVFDLLRAVKRDF------ 145 (200) T ss_pred HHHHHHHCCCCEEEECCCCCCCHHHHHHHHHHHCCCHHHCCCCCCCCCCCCCEEEEEECCEEEEHHHHHHHHHH------ T ss_conf 99999970999999726777848899999999698855546734545435750599736778602899999753------ Q ss_pred CCCCCCCCCCCCCCHHHHHHHHCCC Q ss_conf 0244444544331089899871887 Q gi|254780661|r 146 IQWPSRDDGATSFKLQDLALANGIE 170 (471) Q Consensus 146 ~~~~~~~~~~~~~kL~~la~~n~i~ 170 (471) .-+||+|+++|+...-. T Consensus 146 --------~l~sy~Ld~Va~~~Lg~ 162 (200) T cd05160 146 --------KLKSYTLDAVAEELLGE 162 (200) T ss_pred --------CCCCCCHHHHHHHHHCC T ss_conf --------24665399999998499 No 70 >cd06126 DEDDy DEDDy exonucleases, part of the DnaQ-like (or DEDD) exonuclease superfamily, catalyze the excision of nucleoside monophosphates at the DNA or RNA termini in the 3'-5' direction. They contain four invariant acidic residues in three conserved sequence motifs termed ExoI, ExoII and ExoIII. DEDDy exonucleases are classified as such because of the presence of a specific YX(3)D pattern at ExoIII. The four conserved acidic residues serve as ligands for the two metal ions required for catalysis. DEDDy exonucleases include the proofreading domains of DNA polymerases (families A and B), as well as RNases such as RNase D and Rrp6p. The Egalitarian (Egl) and Bacillus-like DNA Polymerase I subfamilies do not possess a completely conserved YX(3)D pattern at the ExoIII motif. In addition, Bacillus-like DNA polymerases contain inactive 3'-5' exonuclease domains which do not possess the metal-binding residues necessary for activity. Probab=98.46 E-value=7.1e-06 Score=56.52 Aligned_cols=130 Identities=19% Similarity=0.017 Sum_probs=83.4 Q ss_pred EEEECCCCCCCC--CCCEEEEEEEEEECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHHHHHH Q ss_conf 999752788987--726048999999838972804002266217888778578989857998899607999799999999 Q gi|254780661|r 6 VIYDYETFGRDV--ALDRPAQFAGVRVDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFSRRIH 83 (471) Q Consensus 6 v~~D~ETtG~~~--~~~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~~~i~ 83 (471) +-||+||++... ..|.|+.++.. +..+...+ +.+ ..-..+... . .++.++...+. T Consensus 2 l~fDiE~~~~~~~~~~~pI~~i~~~-~~~g~~~~-----~~~--~~~~~~~~~--------------~-~~E~~ll~~f~ 58 (181) T cd06126 2 VAFDIETTPLVGGGGKGPIILIQLA-VSAGEKFL-----LDV--LTLTLDAVD--------------S-ADEKELLGELK 58 (181) T ss_pred EEEEEECCCCCCCCCCCCEEEEEEE-ECCCCEEE-----EEE--CCCCCCCCC--------------C-CCHHHHHHHHH T ss_conf 2898623798999899988999999-65997899-----986--243467556--------------7-89999999999 Q ss_pred HHHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCC----------CCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 9951179659995055789999999998507853213----------256887100158888655565321002444445 Q gi|254780661|r 84 QFFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRW----------SWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDD 153 (471) Q Consensus 84 ~~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~----------~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~ 153 (471) +++++...+++|||+.+||.++|..-+-..+.++... .+..|...+|++..++... . T Consensus 59 ~~i~~~~dii~GyN~~~FDlpyL~~R~~~l~~~~~~~~~~~~~~~~~~~~~gr~~~D~~~~~~~~~-------------~ 125 (181) T cd06126 59 AILEDYDDIKVGYNIDSFDLPYLLNRAGELGIPLRRGRDRSSGSREEINIKGRILFDLLLAAKRLG-------------V 125 (181) T ss_pred HHHHCCCCEEEEECCCCCCHHHHHHHHHHHCCCCHHCCCCCCCCCCEEEECCCCCCCHHHHHHHHC-------------C T ss_conf 998308998999799778799999999995897010226665655048634852321999999841-------------3 Q ss_pred CCCCCCHHHHHHHHCCCC Q ss_conf 443310898998718876 Q gi|254780661|r 154 GATSFKLQDLALANGIEH 171 (471) Q Consensus 154 ~~~~~kL~~la~~n~i~~ 171 (471) .-++++|+.+|+..+-.. T Consensus 126 ~~~~~~L~~va~~~lg~~ 143 (181) T cd06126 126 ALPSYSLDALAEQLLGEG 143 (181) T ss_pred CCCCCCHHHHHHHHCCCC T ss_conf 646535999999974999 No 71 >PTZ00315 2'-phosphotransferase; Provisional Probab=98.35 E-value=2.9e-05 Score=52.60 Aligned_cols=173 Identities=14% Similarity=0.084 Sum_probs=113.5 Q ss_pred CCEEEEECCCCCCC---CCCCEEEEEEEEEECC-CCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHH Q ss_conf 83799975278898---7726048999999838-9728040022662178887785789898579988996079997999 Q gi|254780661|r 3 NHFVIYDYETFGRD---VALDRPAQFAGVRVDR-QWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEF 78 (471) Q Consensus 3 ~~~v~~D~ETtG~~---~~~~~iiq~~~i~~d~-~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~ 78 (471) ..|++.|+|.|--. ....+|||+-++++|. .+.+..+|.. +|||.-.-+.+-.-..+||||.+++-+++ .+.++ T Consensus 56 D~y~VLDFEATCe~~~r~~~~EIIEFPvv~Vdart~~~~aEFh~-YVRPv~nPvLS~FCTELTGItQs~VD~ap-tFP~V 133 (570) T PTZ00315 56 DAYVVLDFEATCERDRRLEDPEVIEFPMVLVDARTATPVAEFQR-YVRPVVNPVLSRFCTELTGITQSVVDRAD-TFPVV 133 (570) T ss_pred CEEEEEEECCCCCCCCCCCCCCEEEEEEEEEECCCCHHHHHHHH-HCCCCCCCCHHHHHHHHHCCCHHHHCCCC-CCHHH T ss_conf 65799730020012556788642560279985353400234454-14776477278777773175888763788-50899 Q ss_pred HHHHHHHHHCCC--------C-EEEEEECHHHHHHH-HHHHHHHHC-------CCCCCCCCCCCCCCCCHHHHHHHHHHH Q ss_conf 999999951179--------6-59995055789999-999998507-------853213256887100158888655565 Q gi|254780661|r 79 SRRIHQFFSVPN--------T-CIIGYNNIRFDDYY-SRNIFYRNF-------YDSYRWSWDNGNSRWDLLDVMRAIYAF 141 (471) Q Consensus 79 ~~~i~~~~~~~~--------~-~~ig~N~~~FD~~~-lr~~~~r~~-------~~~y~~~~~~~~~~~D~l~~~~~~~~~ 141 (471) .+.+.+++...+ . |+|+-. .||..- +-...+-.+ .|++.. ||--+.-.... .+ T Consensus 134 l~~f~~fL~engLgd~~p~~~f~~VTcG--dwDLKtmlp~Qc~~~~~~g~~i~vP~~f~-------rW~NlKk~~~~-~~ 203 (570) T PTZ00315 134 FCAALQFLAEAGLGEAPPMRSYLVVTCG--DWDLKTMLPAQMRVSGQQGTTVNVPLSFR-------RWCNLKKYMSQ-LS 203 (570) T ss_pred HHHHHHHHHHCCCCCCCCCEEEEEEECC--CCHHHHHHHHHHHHHHHCCCCCCCCHHHH-------HHHHHHHHHHH-HC T ss_conf 9999999987487767764059999617--60687775999888650476423768999-------99889999987-22 Q ss_pred HHHHCCCCCCCC-CCCCCCHHHHHHHHCCCCC-CCCHHHHHHHHHHHHHHHHH Q ss_conf 321002444445-4433108989987188764-31013331899999999998 Q gi|254780661|r 142 SPDGIQWPSRDD-GATSFKLQDLALANGIEHV-NAHDAKADVYATLALVWLIR 192 (471) Q Consensus 142 ~p~~~~~~~~~~-~~~~~kL~~la~~n~i~~~-~aH~A~~D~~~t~~l~k~i~ 192 (471) .+. .... ...+++|-++....|+++. ..|..+.||.--+.++.-+. T Consensus 204 ~~~-----~~~~ps~~p~~m~dML~~L~Lp~~GRhHSGIDDcRNIA~IL~ELl 251 (570) T PTZ00315 204 GCG-----LGATPPLGISDLPDMLQMLGLPLKGRHHSGIDDCRNIAAVLCELL 251 (570) T ss_pred CCC-----CCCCCCCCCCCHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHH T ss_conf 666-----666766687557999986699867875788513688999999999 No 72 >cd05780 DNA_polB_Kod1_like_exo The 3'-5' exonuclease domain of archaeal family-B DNA polymerases with similarity to Pyrococcus kodakaraensis Kod1, including polymerases from Desulfurococcus (D. Tok Pol) and Thermococcus gorgonarius (Tgo Pol). Kod1, D. Tok Pol, and Tgo Pol are thermostable enzymes that exhibit both polymerase and 3'-5' exonuclease activities. They are family-B DNA polymerases. Their amino termini harbor a DEDDy-type DnaQ-like 3'-5' exonuclease domain that contains three sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are clustered around the active site and are involved in metal binding and catalysis. The exonuclease domain of family B polymerases contains a beta hairpin structure that plays an important role in active site switching in the event of nucleotide misincorporation. Members of this subfamily show similarity to eukaryotic DNA polymerases involved in DNA replication. Some archaea possess multiple family-B D Probab=98.32 E-value=1.5e-05 Score=54.43 Aligned_cols=127 Identities=20% Similarity=0.124 Sum_probs=81.6 Q ss_pred CEEEEECCCCC----CCCCCCEEEEEEEEEECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHH Q ss_conf 37999752788----98772604899999983897280400226621788877857898985799889960799979999 Q gi|254780661|r 4 HFVIYDYETFG----RDVALDRPAQFAGVRVDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFS 79 (471) Q Consensus 4 ~~v~~D~ETtG----~~~~~~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~ 79 (471) +.+.||+||.. -+|..|.|++|+.. +..+.. + ..++... .|. |.. =.+|.|++ T Consensus 4 kilsfDIE~~~~~g~P~p~~d~Ii~Is~~--~~~~~~---v--i~~~~~~----~~~---v~~---------~~tE~eLL 60 (195) T cd05780 4 KILSFDIEVLNHEGEPNPEKDPIIMISFA--DEGGNK---V--ITWKKFD----LPF---VEV---------VKTEKEMI 60 (195) T ss_pred CEEEEEEEECCCCCCCCCCCCCEEEEEEE--ECCCCE---E--EEECCCC----CCC---EEE---------ECCHHHHH T ss_conf 29999999758999989888828999999--689978---9--9817988----774---567---------38999999 Q ss_pred HHHHHHHHCCCC-EEEEEECHHHHHHHHHHHHHHHCCCCC----------------CCCCCCCCCCCCHHHHHHHHHHHH Q ss_conf 999999511796-599950557899999999985078532----------------132568871001588886555653 Q gi|254780661|r 80 RRIHQFFSVPNT-CIIGYNNIRFDDYYSRNIFYRNFYDSY----------------RWSWDNGNSRWDLLDVMRAIYAFS 142 (471) Q Consensus 80 ~~i~~~~~~~~~-~~ig~N~~~FD~~~lr~~~~r~~~~~y----------------~~~~~~~~~~~D~l~~~~~~~~~~ 142 (471) ..+.+++....| +++|||+.+||.+++..=.-+.+.+.- ......|...+|++.+++.. T Consensus 61 ~~F~~~i~~~dPDii~GyN~~~FD~pYl~~R~~~~~~~~~~gr~~~~~~~~~~g~~~~~~i~Gr~~lD~~~~~~~~---- 136 (195) T cd05780 61 KRFIEIVKEKDPDVIYTYNGDNFDFPYLKKRAEKLGIELDLGRDGSEIKIQRGGFNNASEIKGRIHVDLYPVARRT---- 136 (195) T ss_pred HHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHHHCCCCCCCCCCCEEEEEECCCCEEEEECCEEEEEHHHHHHHH---- T ss_conf 9999999872999999579878768999999999589864578886047997785079987686885549999752---- Q ss_pred HHHCCCCCCCCCCCCCCHHHHHHHH Q ss_conf 2100244444544331089899871 Q gi|254780661|r 143 PDGIQWPSRDDGATSFKLQDLALAN 167 (471) Q Consensus 143 p~~~~~~~~~~~~~~~kL~~la~~n 167 (471) -.-+||+|+++|+.. T Consensus 137 ----------~~l~sy~L~~Va~~~ 151 (195) T cd05780 137 ----------LNLTRYTLERVYEEL 151 (195) T ss_pred ----------CCCCCCCHHHHHHHH T ss_conf ----------562401599999998 No 73 >cd05777 DNA_polB_delta_exo The 3'-5' exonuclease domain of eukaryotic DNA polymerase delta. DNA polymerase delta is a family-B DNA polymerase with a catalytic subunit that contains a DEDDy-type DnaQ-like 3'-5' exonuclease domain. It is one of the three DNA-dependent type B DNA polymerases (alpha and epsilon are the other two) that have been identified as essential for nuclear DNA replication in eukaryotes. DNA polymerase delta is the enzyme responsible for both elongation and maturation of Okazaki fragments on the lagging strand. It is also implicated in mismatch repair (MMR) and base excision repair (BER). The catalytic subunit displays both polymerase and 3'-5' exonuclease activities. The exonuclease domain contains three sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues necessary for metal binding and catalysis. The exonuclease domain of family B polyme Probab=98.32 E-value=2.9e-05 Score=52.62 Aligned_cols=137 Identities=21% Similarity=0.237 Sum_probs=80.6 Q ss_pred CCEEEEECCCCC-----CCCCCCEEEEEEEEEECCC-CEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHH Q ss_conf 837999752788-----9877260489999998389-7280400226621788877857898985799889960799979 Q gi|254780661|r 3 NHFVIYDYETFG-----RDVALDRPAQFAGVRVDRQ-WEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEA 76 (471) Q Consensus 3 ~~~v~~D~ETtG-----~~~~~~~iiq~~~i~~d~~-~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~ 76 (471) -+.+.||+||.. -+|..|.|++|+.+..... .....+. . ++- .+ .-|.| |. .+..- .+|. T Consensus 7 lki~sfDIE~~s~~g~fP~p~~D~Ii~I~~~~~~~~~~~~~~~~-v-~~l-~~-~~~i~------~~---~V~~~-~tE~ 72 (230) T cd05777 7 LRILSFDIECAGRKGVFPEPEKDPVIQIANVVTRQGEGEPFIRN-I-FTL-KT-CAPIV------GA---QVFSF-ETEE 72 (230) T ss_pred CEEEEEEEEECCCCCCCCCCCCCCEEEEEEEEEECCCCCCEEEE-E-EEE-CC-CCCCC------CC---EEEEC-CCHH T ss_conf 04999999952699999898888348999999827977604787-8-982-68-78889------98---79984-9999 Q ss_pred HHHHHHHHHHHCCCC-EEEEEECHHHHHHHHHHHHHHHCCCCCC-----------C---C--------------CCCCCC Q ss_conf 999999999511796-5999505578999999999850785321-----------3---2--------------568871 Q gi|254780661|r 77 EFSRRIHQFFSVPNT-CIIGYNNIRFDDYYSRNIFYRNFYDSYR-----------W---S--------------WDNGNS 127 (471) Q Consensus 77 e~~~~i~~~~~~~~~-~~ig~N~~~FD~~~lr~~~~r~~~~~y~-----------~---~--------------~~~~~~ 127 (471) |++..+.+++...+| +++|||+.+||.++|..=....+.+... . . ...|.. T Consensus 73 eLL~~F~~~i~~~dPDiitGyN~~~FD~pYL~~Ra~~l~~~~~~~lgr~~~~~~~~~~~~~~s~~~g~~~~~~~~i~Gr~ 152 (230) T cd05777 73 ELLLAWRDFVQEVDPDIITGYNICNFDLPYLLERAKALKLNTFPFLGRIKNIKSTIKDTTFSSKQMGTRETKEINIEGRI 152 (230) T ss_pred HHHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHHHCCCCHHHHCCCCCCCCEEECCCCCCCCCCCCEEEEEEECCEE T ss_conf 99999999999809999996366678878999999996886145417667876357416522231264002699875875 Q ss_pred CCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHH Q ss_conf 0015888865556532100244444544331089899871 Q gi|254780661|r 128 RWDLLDVMRAIYAFSPDGIQWPSRDDGATSFKLQDLALAN 167 (471) Q Consensus 128 ~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~~kL~~la~~n 167 (471) .+|++..+|.-+. -.||+|+++|... T Consensus 153 ~lD~~~~~r~~~~--------------l~sY~L~~Va~~~ 178 (230) T cd05777 153 QFDLLQVIQRDYK--------------LRSYSLNSVSAHF 178 (230) T ss_pred HHHHHHHHHHHCC--------------HHHCCHHHHHHHH T ss_conf 9999999997501--------------6336499999998 No 74 >pfam10108 Exon_PolB Predicted 3'-5' exonuclease related to the exonuclease domain of PolB. This domain is found in various prokaryotic 3'-5' exonucleases and hypothetical proteins. Probab=98.23 E-value=3.2e-05 Score=52.29 Aligned_cols=116 Identities=16% Similarity=0.145 Sum_probs=80.4 Q ss_pred CCCCCEEEEEEEEEECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHHHHHHHHHHCCCCEEEE Q ss_conf 87726048999999838972804002266217888778578989857998899607999799999999995117965999 Q gi|254780661|r 16 DVALDRPAQFAGVRVDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFSRRIHQFFSVPNTCIIG 95 (471) Q Consensus 16 ~~~~~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~~~i~~~~~~~~~~~ig 95 (471) ...+.+||-|++++.|.++.. ++.++ | +...+|.++++.+.+.+....|.+|+ T Consensus 5 p~~lhriVaIs~v~~~~~~~~--~v~sl------------------g-------~~~~~E~~ll~~F~~~i~~~~P~LVs 57 (211) T pfam10108 5 PLPLHRIVAISAVIADDDGGF--KVWSL------------------G-------DPESSEKELIQRFFDGIEKYTPQLVS 57 (211) T ss_pred CCCCEEEEEEEEEEEECCCCE--EEEEC------------------C-------CCCCCHHHHHHHHHHHHHHCCCEEEE T ss_conf 756501699999999669927--99878------------------8-------99989999999999999843982886 Q ss_pred EECHHHHHHHHHHHHHHHCCCCCCCCCCC------C-------CCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHH Q ss_conf 50557899999999985078532132568------8-------7100158888655565321002444445443310898 Q gi|254780661|r 96 YNNIRFDDYYSRNIFYRNFYDSYRWSWDN------G-------NSRWDLLDVMRAIYAFSPDGIQWPSRDDGATSFKLQD 162 (471) Q Consensus 96 ~N~~~FD~~~lr~~~~r~~~~~y~~~~~~------~-------~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~~kL~~ 162 (471) ||+.+||.++|..-..++++....+-.+. + ..++|+++.+... .++..-+|+. T Consensus 58 ~NG~gFDlPvL~~RAl~~gi~ap~~~~~~~~~~kw~nY~~Ry~~~H~DLmd~l~~~--------------g~~~~~~Ld~ 123 (211) T pfam10108 58 FNGRGFDLPVLHYRALKHGVSAPRYFDTGDRDFKWNNYFNRYSTRHLDLMDLLSGY--------------GARANAPLDE 123 (211) T ss_pred ECCCCCCHHHHHHHHHHCCCCCHHHHHCCCCCCCCCCCCCCCCCCCCCHHHHHHHC--------------CCCCCCCHHH T ss_conf 56865548999999998188850787567876654441123565445489998732--------------8867789899 Q ss_pred HHHHHCCCCC Q ss_conf 9987188764 Q gi|254780661|r 163 LALANGIEHV 172 (471) Q Consensus 163 la~~n~i~~~ 172 (471) +|...|++.. T Consensus 124 ~a~l~G~PGK 133 (211) T pfam10108 124 VAALLGLPGK 133 (211) T ss_pred HHHHCCCCCC T ss_conf 9998199987 No 75 >cd06128 DNA_polA_exo The 3'-5' exonuclease domain of family-A DNA polymerases has a fundamental role in reducing polymerase errors and is involved in proofreading activity. Family-A DNA polymerases contain a DnaQ-like exonuclease domain in the same polypeptide chain as the polymerase domain, similar to family-B DNA polymerases. The exonuclease domain contains three conserved sequence motifs termed ExoI, ExoII and ExoIII, which are clustered around the active site and contain four invariant acidic residues that serve as ligands for the two metal ions required for catalysis. The Klenow fragment (KF) of Escherichia coli pol I, the Thermus aquaticus (Taq) pol I, and Bacillus stearothermophilus (BF) pol I are examples of family-A DNA polymerases. They are involved in nucleotide excision repair and in the processing of Okazaki fragments that are generated during lagging strand synthesis. The N-terminal domains of BF pol I and Taq pol I resemble the fold of the 3'-5' exonuclease domain of KF Probab=98.22 E-value=3.7e-05 Score=51.87 Aligned_cols=139 Identities=15% Similarity=0.112 Sum_probs=82.2 Q ss_pred EEEEECCCCCCCCCCCEEEEEEEEEECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHHHHHHH Q ss_conf 79997527889877260489999998389728040022662178887785789898579988996079997999999999 Q gi|254780661|r 5 FVIYDYETFGRDVALDRPAQFAGVRVDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFSRRIHQ 84 (471) Q Consensus 5 ~v~~D~ETtG~~~~~~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~~~i~~ 84 (471) .+-||+||||++|...+||=++.- ++.+ ..+++-+....... +.. .+.-++...+.. T Consensus 3 ~~a~DtETt~ld~~~a~iVGis~s-~~~~-------~a~YiP~~h~~~~~----------~~~-----l~~~~vl~~L~p 59 (166) T cd06128 3 VFAFDTETDSLDNISANLVGLSFA-IEPG-------VAAYIPVAHDYLDA----------PDQ-----ISRERALELLKP 59 (166) T ss_pred EEEEECCCCCCCCCCCEEEEEEEE-CCCC-------EEEEEECCCCCCCC----------HHH-----HHHHHHHHHHHH T ss_conf 899978879989043549999998-0089-------69999526776675----------257-----789999999999 Q ss_pred HHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHH Q ss_conf 95117965999505578999999999850785321325688710015888865556532100244444544331089899 Q gi|254780661|r 85 FFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGATSFKLQDLA 164 (471) Q Consensus 85 ~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~~kL~~la 164 (471) +|.++...-|||| +.||..+|+ |+++.. .+-.+|+|-.++ .+.|+ ..++.|+.|| T Consensus 60 ~led~~i~KighN-lK~D~~vL~----~~Gi~l-------~g~~fDTMlaay---lldp~----------~~~h~Ld~La 114 (166) T cd06128 60 LLEDEKALKVGQN-LKYDRGILA----NYGIEL-------RGIAFDTMLESY---ILNSV----------AGRHDMDSLA 114 (166) T ss_pred HHCCCCCCEEEEC-HHHHHHHHH----HCCCCC-------CCCCCCHHHHHH---HCCCC----------CCCCCHHHHH T ss_conf 9739997645311-566799996----199668-------999845767987---62899----------9889969999 Q ss_pred HHHC-C---------------------CCCC-CCHHHHHHHHHHHHHHHH Q ss_conf 8718-8---------------------7643-101333189999999999 Q gi|254780661|r 165 LANG-I---------------------EHVN-AHDAKADVYATLALVWLI 191 (471) Q Consensus 165 ~~n~-i---------------------~~~~-aH~A~~D~~~t~~l~k~i 191 (471) +..- . +.+. +.=|-.|+.+|.+|.+.+ T Consensus 115 ~~~L~~~~i~~~dl~GkGk~q~~f~~v~i~~~~~YA~eDAd~t~rL~~~l 164 (166) T cd06128 115 ERWLKHKTITFEEIAGKGKNQLTFNQIALEEAGRYAAEDADVTLQLHLKM 164 (166) T ss_pred HHHCCCCCEEHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99839987059998378857589322899999998999999999999853 No 76 >PRK05762 DNA polymerase II; Reviewed Probab=98.17 E-value=8.7e-05 Score=49.53 Aligned_cols=149 Identities=17% Similarity=0.107 Sum_probs=89.3 Q ss_pred CEEEEECCCCCCCCCCCEEEEEEEEEECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHHHHHH Q ss_conf 37999752788987726048999999838972804002266217888778578989857998899607999799999999 Q gi|254780661|r 4 HFVIYDYETFGRDVALDRPAQFAGVRVDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFSRRIH 83 (471) Q Consensus 4 ~~v~~D~ETtG~~~~~~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~~~i~ 83 (471) ..+-||+|++.. +.|++||. ...+...+ + + .+.+ .+ ...+.. -.+|.+++..+. T Consensus 156 rvlSfDIEc~~~----g~ii~I~l--~~~~~~~v--~-~-~g~~------~~---------~~~v~~-~~~E~eLL~~F~ 209 (785) T PRK05762 156 KVVSLDIETSNK----GELYSIGL--EGCGQRQV--Y-M-LGEA------NG---------DAELEY-VADEKALLERFN 209 (785) T ss_pred EEEEEEEEECCC----CCEEEEEE--CCCCCCEE--E-E-ECCC------CC---------CCEEEE-ECCHHHHHHHHH T ss_conf 699999897799----98899995--17998468--9-9-6688------99---------853899-099999999999 Q ss_pred HHHHCCCC-EEEEEECHHHHHHHHHHHHHHHCCCCCC--------C-C----------CCCCCCCCCHHHHHHHHHHHHH Q ss_conf 99511796-5999505578999999999850785321--------3-2----------5688710015888865556532 Q gi|254780661|r 84 QFFSVPNT-CIIGYNNIRFDDYYSRNIFYRNFYDSYR--------W-S----------WDNGNSRWDLLDVMRAIYAFSP 143 (471) Q Consensus 84 ~~~~~~~~-~~ig~N~~~FD~~~lr~~~~r~~~~~y~--------~-~----------~~~~~~~~D~l~~~~~~~~~~p 143 (471) +++.+.+| +++|||..+||.++|..=.-+++.+.-. + . ...|.-.+|++.+++... T Consensus 210 ~~i~~~DPDII~GyNi~~FDlpyL~~Ra~~~gi~l~lgR~~~~~~~~~~~~~~~~~~~~i~GRv~iD~~~~~k~~~---- 285 (785) T PRK05762 210 AWFAEFDPDVIIGWNVVQFDLRLLQERAERYGIPLRLGRGGSELSWRVHGHKRGHFFASVPGRLVLDGIDALKSAF---- 285 (785) T ss_pred HHHHHHCCCEEEECCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCEECCCCCCCEEEEEEEEEEEEEEHHHHHHHH---- T ss_conf 9999859998996274787779999999994998546878886533315655660599984289997099998752---- Q ss_pred HHCCCCCCCCCCCCCCHHHHHHHHCCCCC---CCC-------------------HHHHHHHHHHHHHHHH Q ss_conf 10024444454433108989987188764---310-------------------1333189999999999 Q gi|254780661|r 144 DGIQWPSRDDGATSFKLQDLALANGIEHV---NAH-------------------DAKADVYATLALVWLI 191 (471) Q Consensus 144 ~~~~~~~~~~~~~~~kL~~la~~n~i~~~---~aH-------------------~A~~D~~~t~~l~k~i 191 (471) -.-+||+|+++|...--+.. ..| =.+.||..+.+|+.++ T Consensus 286 ---------~~l~sYsL~~Va~~~Lge~K~~~~~~~r~~eI~~~~~~~r~~l~~Y~l~Da~L~~~L~~kl 346 (785) T PRK05762 286 ---------YEFKSFSLENVAQELLGEGKAIDNPYDRMDEIDRYFADDKPALAKYNLKDCELVTRIFEKT 346 (785) T ss_pred ---------HCCCCCCHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHH T ss_conf ---------0367662999999985445555666666888899885468999999999999999999987 No 77 >PTZ00166 DNA polymerase delta catalytic subunit; Provisional Probab=98.15 E-value=3.7e-05 Score=51.94 Aligned_cols=162 Identities=17% Similarity=0.174 Sum_probs=95.3 Q ss_pred CEEEEECCCCCC------CCCCCEEEEEEEEEECCCCEECCEE-EEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHH Q ss_conf 379997527889------8772604899999983897280400-226621788877857898985799889960799979 Q gi|254780661|r 4 HFVIYDYETFGR------DVALDRPAQFAGVRVDRQWEKIEST-EVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEA 76 (471) Q Consensus 4 ~~v~~D~ETtG~------~~~~~~iiq~~~i~~d~~~~~~~~~-~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~ 76 (471) .++=||+|..|. ++..|-|||||.+++- .|+..... ...++. ..+- .|....+..-+ +|. T Consensus 291 rilSfDIEC~~~~G~gFP~~~~DpVIqIs~~~~~-~g~~~~~~~~~~ftl--~~C~---------~I~g~~V~~F~-tE~ 357 (1081) T PTZ00166 291 RILSFDIECIKLDGKGFPEANNDPVIQISSVLHT-HGDPLDSCKNFIFTL--KECA---------SLAGANVLWFD-NEK 357 (1081) T ss_pred EEEEEEEEECCCCCCCCCCCCCCCEEEEEEEEEE-CCCCCCCCCCEEEEC--CCCC---------CCCCCEEEEEC-CHH T ss_conf 6999867845888998989999957998579881-478887646446674--7877---------88997799709-999 Q ss_pred HHHHHHHHHHHCCCC-EEEEEECHHHHHHHHHHHHHHHCCCCCC---C-------------------------CCCCCCC Q ss_conf 999999999511796-5999505578999999999850785321---3-------------------------2568871 Q gi|254780661|r 77 EFSRRIHQFFSVPNT-CIIGYNNIRFDDYYSRNIFYRNFYDSYR---W-------------------------SWDNGNS 127 (471) Q Consensus 77 e~~~~i~~~~~~~~~-~~ig~N~~~FD~~~lr~~~~r~~~~~y~---~-------------------------~~~~~~~ 127 (471) +|...+.+++..-++ +++|||..+||.++|-.=.-....+.+. . ....|.- T Consensus 358 eLL~aF~~fi~~~DPDIITGYNI~nFDlpYLl~Ra~~L~i~~f~~lgRlk~~~~~~~~~~~~s~a~G~~~~k~i~i~GRv 437 (1081) T PTZ00166 358 TLLLAWNDFIIAVDPDFLTGYNIINFDLPYLLNRATALNLKKFKFLTRIKSVKSTVKDSIFSSNQFGTHENKEINIEGRI 437 (1081) T ss_pred HHHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCEECCCCCCCCCCCCCCEEEEECCEE T ss_conf 99999999998709989996687888889999999995886433236115776522026566555566655068675789 Q ss_pred CCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHH-HCCCCCC-CCHH-------------------HHHHHHHHH Q ss_conf 001588886555653210024444454433108989987-1887643-1013-------------------331899999 Q gi|254780661|r 128 RWDLLDVMRAIYAFSPDGIQWPSRDDGATSFKLQDLALA-NGIEHVN-AHDA-------------------KADVYATLA 186 (471) Q Consensus 128 ~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~~kL~~la~~-n~i~~~~-aH~A-------------------~~D~~~t~~ 186 (471) .+|++..++.- -.-.||+|+.+|.. .|-.-++ .|.- +-|+..++. T Consensus 438 ~~Dl~~~v~r~--------------~kL~SYkLntVs~~fLgeqKeDV~~~~I~~l~~~~~e~r~rla~YClkDa~L~lr 503 (1081) T PTZ00166 438 LFDVYDLIRRD--------------YKLKSYSLNYVSFEFLKEQKEDVHYSTINKLQNGNPEDRKRIASYCLKDSILPLR 503 (1081) T ss_pred EEEHHHHHHHH--------------CCCCCCCHHHHHHHHHCCCCCCCCHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHH T ss_conf 96469999875--------------5878576888999981887533799999999814937889999998998999999 Q ss_pred HHHHHH Q ss_conf 999998 Q gi|254780661|r 187 LVWLIR 192 (471) Q Consensus 187 l~k~i~ 192 (471) |+.++. T Consensus 504 L~~KL~ 509 (1081) T PTZ00166 504 LIDKLL 509 (1081) T ss_pred HHHHHH T ss_conf 999998 No 78 >cd00007 35EXOc 3'-5' exonuclease. The 35EXOc domain is responsible for the 3'-5' exonuclease proofreading activity of prokaryotic DNA polymerase I (pol I) and other enzymes, it catalyses the hydrolysis of unpaired or mismatched nucleotides. This domain consists of the amino-terminal half of the Klenow fragment in E. coli pol I. 35EXOc is also found in the Werner syndrome helicase (WRN), focus forming activity 1 protein (FFA-1) and ribonuclease D (RNase D). Probab=98.14 E-value=0.00012 Score=48.54 Aligned_cols=133 Identities=19% Similarity=0.140 Sum_probs=78.6 Q ss_pred CEEEEECCCCCCCCCCCEEEEEEEEEECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHHHHHH Q ss_conf 37999752788987726048999999838972804002266217888778578989857998899607999799999999 Q gi|254780661|r 4 HFVIYDYETFGRDVALDRPAQFAGVRVDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFSRRIH 83 (471) Q Consensus 4 ~~v~~D~ETtG~~~~~~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~~~i~ 83 (471) ..+.+|+||||+++..++|.=++ +.+... .+ + + +.... . ......+. T Consensus 1 ~~ia~D~Et~~l~~~~~~i~~iq-l~~~~~-----~~--y-i-~~~~~--~---------------------~~~~~~l~ 47 (155) T cd00007 1 KEVAFDTETTGLNYHRGKLVGIQ-IATAGE-----AA--Y-I-PDELE--L---------------------EEDLEALK 47 (155) T ss_pred CEEEEECCCCCCCCCCCCEEEEE-EEECCC-----EE--E-E-EECCC--C---------------------HHHHHHHH T ss_conf 98999898799977888099999-998993-----59--9-9-81475--5---------------------24699999 Q ss_pred HHHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHH Q ss_conf 99511796599950557899999999985078532132568871001588886555653210024444454433108989 Q gi|254780661|r 84 QFFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGATSFKLQDL 163 (471) Q Consensus 84 ~~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~~kL~~l 163 (471) ++|++.....|||| +.||..+|++.+ . ...+..+|++-..+.+ .|. ..+++|+.+ T Consensus 48 ~~l~~~~i~Kv~hn-~k~D~~~L~~~~----~-------~~~~~~~Dt~iaa~ll---~~~----------~~~~~L~~l 102 (155) T cd00007 48 ELLEDEDITKVGHD-AKFDLVVLARDG----I-------ELPGNIFDTMLAAYLL---NPG----------EGSHSLDDL 102 (155) T ss_pred HHHHCCCCCEEEEC-CCHHHHHHHHCC----C-------CCCCCEEHHHHHHHHH---CCC----------CCCHHHHHH T ss_conf 99829997389873-400099998628----6-------0135011499999984---778----------552369999 Q ss_pred HHHHC-CCCC--------------------CCCHHHHHHHHHHHHHHHHHHH Q ss_conf 98718-8764--------------------3101333189999999999864 Q gi|254780661|r 164 ALANG-IEHV--------------------NAHDAKADVYATLALVWLIREK 194 (471) Q Consensus 164 a~~n~-i~~~--------------------~aH~A~~D~~~t~~l~k~i~~~ 194 (471) ++..+ ++.. ...=|..||.+|..|...+.++ T Consensus 103 ~~~~l~~~~~k~~~~~~~~~~~~~~~l~~~~~~YAa~D~~~~~~l~~~l~~~ 154 (155) T cd00007 103 AKEYLGIELDKDEQIYGKGAKTFARPLSEELLEYAAEDADALLRLYEKLLEE 154 (155) T ss_pred HHHHCCCCCCHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9998398767488861776544447998999999999999999999999965 No 79 >pfam03104 DNA_pol_B_exo DNA polymerase family B, exonuclease domain. This domain has 3' to 5' exonuclease activity and adopts a ribonuclease H type fold. Probab=98.13 E-value=1.5e-05 Score=54.37 Aligned_cols=97 Identities=22% Similarity=0.138 Sum_probs=61.0 Q ss_pred CEEEEECCCCCC-----CCCCCEEEEEEEEEECCC-CEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHH Q ss_conf 379997527889-----877260489999998389-72804002266217888778578989857998899607999799 Q gi|254780661|r 4 HFVIYDYETFGR-----DVALDRPAQFAGVRVDRQ-WEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAE 77 (471) Q Consensus 4 ~~v~~D~ETtG~-----~~~~~~iiq~~~i~~d~~-~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e 77 (471) ..+=||+||.|. ++..|.|||||.+..... .....+. . ++ +.+. -|.+. |. .+..- .+|.+ T Consensus 87 rilsfDIEc~~~~g~FP~~~~d~IiqIs~~~~~~g~~~~~~~~-~-~t-l~~~-~~i~~-----~~---~V~~f-~~E~~ 153 (254) T pfam03104 87 RVLSFDIECTSLKGKFPENEKDPIIQISCMLYREGEPDPIEDK-L-FT-LKSC-ALIEI-----GG---EVYIF-PSEKE 153 (254) T ss_pred EEEEEEEEECCCCCCCCCCCCCEEEEEEEEEEECCCCCCCCEE-E-EE-ECCC-CCCCC-----CC---EEEEE-CCHHH T ss_conf 3999988958889999987898389999898856789863206-9-98-0787-76778-----97---79995-99999 Q ss_pred HHHHHHHHHHCCCC-EEEEEECHHHHHHHHHHHHHHH Q ss_conf 99999999511796-5999505578999999999850 Q gi|254780661|r 78 FSRRIHQFFSVPNT-CIIGYNNIRFDDYYSRNIFYRN 113 (471) Q Consensus 78 ~~~~i~~~~~~~~~-~~ig~N~~~FD~~~lr~~~~r~ 113 (471) ++..+.+++....+ +++|||..+||.++|-.=.-.. T Consensus 154 ll~~f~~~i~~~dpdii~gyNi~~FD~~yl~~Ra~~l 190 (254) T pfam03104 154 LLRRFFEFIRQYDPDIITGYNGDNFDWPYILNRAKIL 190 (254) T ss_pred HHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHHH T ss_conf 9999999987519989996678778889999999996 No 80 >KOG0304 consensus Probab=98.09 E-value=0.00012 Score=48.68 Aligned_cols=177 Identities=18% Similarity=0.122 Sum_probs=115.3 Q ss_pred CEEEEECCCCC-------C--------------CCCCCEEEEEEEEEECCCCEECC------EEEEEE-ECCCCCC--CC Q ss_conf 37999752788-------9--------------87726048999999838972804------002266-2178887--78 Q gi|254780661|r 4 HFVIYDYETFG-------R--------------DVALDRPAQFAGVRVDRQWEKIE------STEVFF-CKPADDY--LP 53 (471) Q Consensus 4 ~~v~~D~ETtG-------~--------------~~~~~~iiq~~~i~~d~~~~~~~------~~~~~~-~~~~~~~--~p 53 (471) +||-.|+|=-| . |...=.+||+|.-+.|.+||..+ +|+ +. -.+..+. .- T Consensus 25 ~~IamDTEFPGvv~rp~~~f~s~~d~~Y~~lk~NVd~lklIQlGlTlsd~~Gn~p~~g~~tWqfN-F~dF~~~~D~~a~~ 103 (239) T KOG0304 25 PYIAMDTEFPGVVARPIGTFRSSDDYHYQTLKCNVDNLKLIQLGLTLSDEKGNLPDCGTDTWQFN-FSDFNLEKDMYAQD 103 (239) T ss_pred CEEEECCCCCCEEEECCCCCCCCHHHHHHHHHHCHHHHHHHHEEEEEECCCCCCCCCCCCEEEEE-CCCCCCHHHCCCHH T ss_conf 91475376785466247665677478899997222331243022256536899999988656886-40278433213666 Q ss_pred CHHHHHHHCCCHHHHHHCCCCHHHHHHHHHHHH--HCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCH Q ss_conf 578989857998899607999799999999995--117965999505578999999999850785321325688710015 Q gi|254780661|r 54 DPEAVFITGITPQKALRDGVVEAEFSRRIHQFF--SVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDL 131 (471) Q Consensus 54 ~p~a~~v~git~~~~~~~~~~~~e~~~~i~~~~--~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~ 131 (471) |.+-+.-+||.=+...+.|+.-.++...+-.-. .+....+|.+|+ +||...|-..+-+.-+|.- .-|. T Consensus 104 SIElLr~~Gidf~K~~e~GI~~~~F~ellm~sg~v~~~~V~WvTFhs-~YDfgYLlK~Lt~~~LP~~---------~~eF 173 (239) T KOG0304 104 SIELLRRSGIDFEKHREEGIDIEEFAELLMTSGLVLDENVTWVTFHS-GYDFGYLLKILTGKPLPET---------EEEF 173 (239) T ss_pred HHHHHHHCCCCHHHHHHCCCCHHHHHHHHHHHHHHCCCCEEEEEEEC-CCHHHHHHHHHCCCCCCCH---------HHHH T ss_conf 69999984967777987397899999999873250068657998322-1029999999717988601---------8899 Q ss_pred HHHHHHHHHHHHHHCCCCCCCC----CCCCCCHHHHHHHHCCC-CCCCCHHHHHHHHHHHHHHHHHHH Q ss_conf 8888655565321002444445----44331089899871887-643101333189999999999864 Q gi|254780661|r 132 LDVMRAIYAFSPDGIQWPSRDD----GATSFKLQDLALANGIE-HVNAHDAKADVYATLALVWLIREK 194 (471) Q Consensus 132 l~~~~~~~~~~p~~~~~~~~~~----~~~~~kL~~la~~n~i~-~~~aH~A~~D~~~t~~l~k~i~~~ 194 (471) ..+++ .+.|..+..-.-.+ -.-...|..+|...|+. .+.+|.|-+|...|+..+.++++. T Consensus 174 ~~~v~---~~fp~vYDiK~l~~~c~~~~l~~GL~~lA~~L~~~RvG~~HqAGSDSlLT~~~F~kl~~~ 238 (239) T KOG0304 174 FEIVR---QLFPFVYDVKYLMKFCEGLSLKGGLQRLADLLGLKRVGIAHQAGSDSLLTARVFFKLKEL 238 (239) T ss_pred HHHHH---HHCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHCCCEEECCCCCCCHHHHHHHHHHHHHHC T ss_conf 99999---974026619999976033353037999999858871110323675899999999988741 No 81 >PRK06887 consensus Probab=98.07 E-value=0.0002 Score=47.16 Aligned_cols=158 Identities=17% Similarity=0.131 Sum_probs=92.1 Q ss_pred CCEEEEECCCCCCCCCCCEEEEEEEEEECCCCEECCEEEEEEECCCCC---CCCCH----HHHHHHCCCHHHHHHCCCCH Q ss_conf 837999752788987726048999999838972804002266217888---77857----89898579988996079997 Q gi|254780661|r 3 NHFVIYDYETFGRDVALDRPAQFAGVRVDRQWEKIESTEVFFCKPADD---YLPDP----EAVFITGITPQKALRDGVVE 75 (471) Q Consensus 3 ~~~v~~D~ETtG~~~~~~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~~---~~p~p----~a~~v~git~~~~~~~~~~~ 75 (471) ...+.+|+||||+++...+++-++... .++.- +++ |-.+ ....| .+...++.+.........+. T Consensus 356 ~~~va~DtEt~~ld~~~a~lVGis~s~--~~g~a------~Yi-Pl~h~~~~~~~~~~~~~~~~~~~~~~~~~~~~ql~~ 426 (954) T PRK06887 356 AKLIAVDTETDSLDYMSANLVGISFAL--ENGEA------CYI-PLAHKQQVAEQPQSDLFAEEAETQTDYLLAPKQLNK 426 (954) T ss_pred CCEEEEEEECCCCCCCCCCEEEEEEEE--CCCCE------EEE-ECCCCCCCCCCCCHHHHHHCCCCHHHHHCCHHHCCH T ss_conf 792799963599871326657899983--58978------998-644421000132002332001001222102321279 Q ss_pred HHHHHHHHHHHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 99999999995117965999505578999999999850785321325688710015888865556532100244444544 Q gi|254780661|r 76 AEFSRRIHQFFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGA 155 (471) Q Consensus 76 ~e~~~~i~~~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~ 155 (471) .++...+..+|.++....|||| +.||..+++ +++++. ..-.+|++-.. |.+.|. + T Consensus 427 ~~vl~~Lkplled~~i~KIghN-lK~dl~vL~----~~Gi~l-------~g~~fDTmLAa---YLLdp~---------~- 481 (954) T PRK06887 427 STCLAALKPLLENPDIHKIGQN-IKYDLTIFA----NHGIEL-------QGVAFDTMLES---YTLNST---------G- 481 (954) T ss_pred HHHHHHHHHHHCCCCCCHHHHH-HHHHHHHHH----HCCCCC-------CCCCCCHHHHH---HHHCCC---------C- T ss_conf 9999998998529750222114-788999999----779624-------77651698998---751875---------5- Q ss_pred CCCCHHHHHHHH-CCC---------------------CCC-CCHHHHHHHHHHHHHHHHHHHH Q ss_conf 331089899871-887---------------------643-1013331899999999998642 Q gi|254780661|r 156 TSFKLQDLALAN-GIE---------------------HVN-AHDAKADVYATLALVWLIREKK 195 (471) Q Consensus 156 ~~~kL~~la~~n-~i~---------------------~~~-aH~A~~D~~~t~~l~k~i~~~~ 195 (471) .+.|++||... |.. .+. ++-|..|+.+|..|...+..+. T Consensus 482 -r~~Ld~LA~~yL~~~~i~~edl~gkGk~ql~f~~v~l~~~~~YAaedA~itl~L~~~L~~~L 543 (954) T PRK06887 482 -RHNMDDLAERYLGHETIAFESIAGKGKNQLTFDQIPLEQATEYAAEDADVTMKLHQVLWLEL 543 (954) T ss_pred -CCCHHHHHHHHCCCCCCCHHHHCCCCCCCCCHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf -57889999996388653346422544333784434467999999999999999999999999 No 82 >smart00486 POLBc DNA polymerase type-B family. DNA polymerase alpha, delta, epsilon and zeta chain (eukaryota), DNA polymerases in archaea, DNA polymerase II in e. coli, mitochondrial DNA polymerases and and virus DNA polymerases Probab=98.01 E-value=0.00042 Score=45.10 Aligned_cols=164 Identities=19% Similarity=0.106 Sum_probs=97.0 Q ss_pred CCCCEEEEECCCCCCC-----C--CCCEEEEEEEEEECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCC Q ss_conf 9883799975278898-----7--72604899999983897280400226621788877857898985799889960799 Q gi|254780661|r 1 MTNHFVIYDYETFGRD-----V--ALDRPAQFAGVRVDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGV 73 (471) Q Consensus 1 m~~~~v~~D~ETtG~~-----~--~~~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~ 73 (471) +..+.+.||+||+... + ..+.|++++....+..-..+.....+. ..+.. .+.++. +..-. T Consensus 1 ~~~~i~s~DIEt~~~~~~~p~~~~~~d~i~~i~~~~~~~~~~~~~~~~~~~--~~~~~-------~~~~~~---~~~~~- 67 (471) T smart00486 1 PPLKILSFDIETYTDGGLFPDPLIFEDEIIQISLVINDGDKKGPEERICFT--LGTCK-------EIDGVE---VYEFN- 67 (471) T ss_pred CCCEEEEEEEEECCCCCCCCCCCCCCCEEEEEEEEEEECCEEEEEEEEEEC--CCCCC-------CCCCCE---EEECC- T ss_conf 994699999896679999999888998499999998219878788775430--36767-------789988---99809- Q ss_pred CHHHHHHHHHHHHHCCCC-EEEEEECHHHHHHHHHHHHHHHCCCCCCC---------------------------CCCCC Q ss_conf 979999999999511796-59995055789999999998507853213---------------------------25688 Q gi|254780661|r 74 VEAEFSRRIHQFFSVPNT-CIIGYNNIRFDDYYSRNIFYRNFYDSYRW---------------------------SWDNG 125 (471) Q Consensus 74 ~~~e~~~~i~~~~~~~~~-~~ig~N~~~FD~~~lr~~~~r~~~~~y~~---------------------------~~~~~ 125 (471) ++.+++..+.+++...++ +++|||..+||.+++..-.-+.+.++... ....| T Consensus 68 ~E~~ll~~f~~~i~~~dpdii~g~N~~~FD~~yi~~R~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 147 (471) T smart00486 68 NEKELLKAFLEFIKKYDPDIIYGHNISNFDLPYIISRLEKLKIKPLSFIGRLKNIIDIKRKKPLFGSKSFGKTIKVKIKG 147 (471) T ss_pred CHHHHHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHHHCCCHHHHHCCCCCCCCCCEECCCCCCCCCCCEEEEEEEE T ss_conf 99999999999998719999996588787689999999984896477637357787643203321012566314899855 Q ss_pred CCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHCCCCCC---------------------CCHHHHHHHHH Q ss_conf 710015888865556532100244444544331089899871887643---------------------10133318999 Q gi|254780661|r 126 NSRWDLLDVMRAIYAFSPDGIQWPSRDDGATSFKLQDLALANGIEHVN---------------------AHDAKADVYAT 184 (471) Q Consensus 126 ~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~~kL~~la~~n~i~~~~---------------------aH~A~~D~~~t 184 (471) ...+|++..++... ..++++|+.+|+...-.... ++=.+.||..+ T Consensus 148 ~~~~D~~~~~~~~~--------------~l~sy~L~~v~~~~l~~~k~d~~~~~i~~~~~~~~~~~~~~~~Y~~~D~~l~ 213 (471) T smart00486 148 RLVIDLYNLYKNKL--------------KLPSYKLDTVAEYLLGKEKDDLPYKDIPELYNLNYKLRDELLEYCIQDAVLT 213 (471) T ss_pred EEEEEHHHHHHHHH--------------CCCCCCHHHHHHHHHCCCCCCCCHHHHHHHHHCCHHHHHHHHHHHHHHHHHH T ss_conf 99978899998860--------------4466888999999957788889989999998539367999999999999999 Q ss_pred HHHHHHH Q ss_conf 9999999 Q gi|254780661|r 185 LALVWLI 191 (471) Q Consensus 185 ~~l~k~i 191 (471) .+|+..+ T Consensus 214 ~~l~~~~ 220 (471) T smart00486 214 LKLFNKL 220 (471) T ss_pred HHHHHHH T ss_conf 9999986 No 83 >pfam01612 3_5_exonuc 3'-5' exonuclease. This domain is responsible for the 3'-5' exonuclease proofreading activity of E. coli DNA polymerase I (polI) and other enzymes, it catalyses the hydrolysis of unpaired or mismatched nucleotides. This domain consists of the amino-terminal half of the Klenow fragment in E. coli polI it is also found in the Werner syndrome helicase (WRN), focus forming activity 1 protein (FFA-1) and ribonuclease D (RNase D). Werner syndrome is a human genetic disorder causing premature aging; the WRN protein has helicase activity in the 3'-5' direction. The FFA-1 protein is required for formation of a replication foci and also has helicase activity; it is a homologue of the WRN protein. RNase D is a 3'-5' exonuclease involved in tRNA processing. Also found in this family is the autoantigen PM/Scl thought to be involved in polymyositis-scleroderma overlap syndrome. Probab=98.00 E-value=0.00055 Score=44.35 Aligned_cols=135 Identities=20% Similarity=0.122 Sum_probs=78.8 Q ss_pred CCEEEEECCCCCCCCCCCEEEEEEEEEECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHHHHH Q ss_conf 83799975278898772604899999983897280400226621788877857898985799889960799979999999 Q gi|254780661|r 3 NHFVIYDYETFGRDVALDRPAQFAGVRVDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFSRRI 82 (471) Q Consensus 3 ~~~v~~D~ETtG~~~~~~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~~~i 82 (471) ...+.||+||||+++...++.-+.......+ .. + +|.+. +..++.+.+ T Consensus 20 ~~~ia~DtEt~~~~~~~~~~~~~llq~~~~~-----~~--y--------li~~~-----------------~~~~~~~~L 67 (172) T pfam01612 20 AKVVAVDTETTSLDTYSYTLVGCLIQISTGE-----GA--Y--------IIDPL-----------------ALGDDLEGL 67 (172) T ss_pred CCCEEEEEECCCCCCCCCCCCEEEEEEECCC-----CC--E--------EEECC-----------------CCHHHHHHH T ss_conf 9908998062898867613433699996798-----60--7--------76434-----------------756679999 Q ss_pred HHHHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHH Q ss_conf 99951179659995055789999999998507853213256887100158888655565321002444445443310898 Q gi|254780661|r 83 HQFFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGATSFKLQD 162 (471) Q Consensus 83 ~~~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~~kL~~ 162 (471) .++|.++..+.|||| +.||..+|++.+- .. ....+|++-..+ .+.+. .+++|.. T Consensus 68 ~~ll~~~~i~Kvghn-~k~D~~~L~~~~~---~~--------~~~~~Dt~~a~~---~l~~~-----------~~~gL~~ 121 (172) T pfam01612 68 KRLLEDPNITKVGHN-AKFDLEVLARDFG---IK--------LENLFDTMLAAY---LLGYP-----------RSHSLDD 121 (172) T ss_pred HHHHCCCCCCEEEEE-HHHHHHHHHHHCC---CC--------CCCCHHHHHHHH---HCCCC-----------CCCCHHH T ss_conf 999759998799975-8999999988547---66--------687327899998---81996-----------4356999 Q ss_pred HHHH-HCCCCC----------------CCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 9987-188764----------------31013331899999999998642 Q gi|254780661|r 163 LALA-NGIEHV----------------NAHDAKADVYATLALVWLIREKK 195 (471) Q Consensus 163 la~~-n~i~~~----------------~aH~A~~D~~~t~~l~k~i~~~~ 195 (471) ||.. .|+... .-|=|..||.+|..|...+..+. T Consensus 122 L~~~~lg~~~~K~~~~s~w~~rpL~~~qi~YAa~Da~~~~~l~~~l~~~l 171 (172) T pfam01612 122 LAEKYLGVELDKSEQCADWRARPLSEEQLRYAAEDADYLLRLYDKLRKEL 171 (172) T ss_pred HHHHHCCCCCCHHHEECCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99998098787441566788789829999999999999999999999974 No 84 >KOG2248 consensus Probab=98.00 E-value=0.00014 Score=48.16 Aligned_cols=156 Identities=19% Similarity=0.195 Sum_probs=97.8 Q ss_pred CCEEEEECCCCCCCCCCCEEEEEEEEEECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHHHHH Q ss_conf 83799975278898772604899999983897280400226621788877857898985799889960799979999999 Q gi|254780661|r 3 NHFVIYDYETFGRDVALDRPAQFAGVRVDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFSRRI 82 (471) Q Consensus 3 ~~~v~~D~ETtG~~~~~~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~~~i 82 (471) .+.+-.|.|-...... -.+..+++| |.+++++ ++. +|+|...+ .+ ......|||.+++.+.+.+..++=.++ T Consensus 216 ~~i~AlDCEm~~te~g-~el~RVt~V--D~~~~vi--~D~-fVkP~~~V-vD-y~T~~SGIT~~~~e~~t~tl~dvq~~l 287 (380) T KOG2248 216 PNIFALDCEMVVTENG-LELTRVTAV--DRDGKVI--LDT-FVKPNKPV-VD-YNTRYSGITEEDLENSTITLEDVQKEL 287 (380) T ss_pred CCEEEEEEEEEEECCC-EEEEEEEEE--ECCCCEE--EEE-EECCCCCC-CC-CCCCCCCCCHHHHHCCCCCHHHHHHHH T ss_conf 9747887565430565-036786465--0468478--678-63478861-30-125655420998725862799999999 Q ss_pred HHHHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCC-CCCHH Q ss_conf 99951179659995055789999999998507853213256887100158888655565321002444445443-31089 Q gi|254780661|r 83 HQFFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGAT-SFKLQ 161 (471) Q Consensus 83 ~~~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~-~~kL~ 161 (471) .+++ ..+++.|| |+..=|..-||-. ++ ...|+- +-+.....+.+ --+|. T Consensus 288 ~~~~-~~~TILVG-HSLenDL~aLKl~-----H~----------~ViDTa-------------~lf~~~~g~~~~k~sLk 337 (380) T KOG2248 288 LELI-SKNTILVG-HSLENDLKALKLD-----HP----------SVIDTA-------------VLFKHPTGPYPFKSSLK 337 (380) T ss_pred HHHC-CCCCEEEE-ECHHHHHHHHHHH-----CC----------CEEEEE-------------EEEECCCCCCCCHHHHH T ss_conf 9646-86717995-1003078897642-----77----------422005-------------88864889854259999 Q ss_pred HHHHH-HCCCCC---CCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 89987-188764---310133318999999999986422 Q gi|254780661|r 162 DLALA-NGIEHV---NAHDAKADVYATLALVWLIREKKP 196 (471) Q Consensus 162 ~la~~-n~i~~~---~aH~A~~D~~~t~~l~k~i~~~~p 196 (471) .||+. .|.... .-|++..|+.|+++|.+......+ T Consensus 338 ~L~~~~L~~~Iq~~~~~HdS~eDA~acm~Lv~~k~~~~~ 376 (380) T KOG2248 338 NLAKSYLGKLIQEGVGGHDSVEDALACMKLVKLKIKNSE 376 (380) T ss_pred HHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHCCC T ss_conf 999999889986167787608889999999999985153 No 85 >cd05784 DNA_polB_II_exo The 3'-5' exonuclease domain of Escherichia coli DNA polymerase II (PolII) and similar bacterial proteins. PolII is a family-B DNA polymerase. Family-B DNA polymerases contain an N-terminal DEDDy DnaQ-like exonuclease domain in the same polypeptide chain as the polymerase domain, similar to family-A DNA polymerases. This exonuclease domain contains three sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are clustered around the active site and are involved in metal binding and catalysis. The exonuclease domain has a fundamental role in the proofreading activity of polII. It contains a beta hairpin structure that plays an important role in active site switching in the event of a nucleotide misincorporation. PolII is involved in a variety of cellular activities, such as the repair of DNA damaged by UV irradiation or oxidation. It plays a pivotal role in replication-restart, a process that bypasses DNA damage in a Probab=97.95 E-value=0.00015 Score=47.93 Aligned_cols=128 Identities=18% Similarity=0.061 Sum_probs=75.4 Q ss_pred CCEEEEECCCCCCCCCCCEEEEEEEEEECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHHHHH Q ss_conf 83799975278898772604899999983897280400226621788877857898985799889960799979999999 Q gi|254780661|r 3 NHFVIYDYETFGRDVALDRPAQFAGVRVDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFSRRI 82 (471) Q Consensus 3 ~~~v~~D~ETtG~~~~~~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~~~i 82 (471) -+.+.||+||+. .+.|+.+|.. ....+ .. +....... . .|.+ +.--. +|.+++..+ T Consensus 3 lki~s~DIE~~~----~~~i~~I~~~--~~~~~---~v--~~~~~~~~---~------~~~~---v~~~~-sE~eLL~~F 58 (193) T cd05784 3 LKVVSLDIETSM----DGELYSIGLY--GEGQE---RV--LMVGDPED---D------APDN---IEWFA-DEKSLLLAL 58 (193) T ss_pred CEEEEEEECCCC----CCCEEEEEEE--CCCCC---EE--EEEECCCC---C------CCCE---EEEEC-CHHHHHHHH T ss_conf 309999963279----8888899998--69999---89--99978988---8------9977---99989-999999999 Q ss_pred HHHHHCCCC-EEEEEECHHHHHHHHHHHHHHHCCCCCCC------------------CCCCCCCCCCHHHHHHHHHHHHH Q ss_conf 999511796-59995055789999999998507853213------------------25688710015888865556532 Q gi|254780661|r 83 HQFFSVPNT-CIIGYNNIRFDDYYSRNIFYRNFYDSYRW------------------SWDNGNSRWDLLDVMRAIYAFSP 143 (471) Q Consensus 83 ~~~~~~~~~-~~ig~N~~~FD~~~lr~~~~r~~~~~y~~------------------~~~~~~~~~D~l~~~~~~~~~~p 143 (471) .+++...+| +++|||+.+||.+.|..=.-+++.++... ....|...+|++.+++... T Consensus 59 ~~~i~~~dPDiItG~N~~~FDiPYL~~R~~~~~i~~~~~r~~~~~~~~~~~~~~~~~~~i~Gr~~lD~~~~~~~~~---- 134 (193) T cd05784 59 IAWFAQYDPDIIIGWNVINFDLRLLQRRAEAHGLPLRLGRGGSPLNWRQSGKPGQGFLSLPGRVVLDGIDALKTAT---- 134 (193) T ss_pred HHHHHHCCCCEEEECCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCEEEECCCCCEEEEEECCEEEECHHHHHHHCC---- T ss_conf 9999862999999779877767899999998588643365788505987685626999978878417299997607---- Q ss_pred HHCCCCCCCCCCCCCCHHHHHHHH Q ss_conf 100244444544331089899871 Q gi|254780661|r 144 DGIQWPSRDDGATSFKLQDLALAN 167 (471) Q Consensus 144 ~~~~~~~~~~~~~~~kL~~la~~n 167 (471) . .-+||+|+.+|+.. T Consensus 135 --~-------~l~sy~Ld~Va~~~ 149 (193) T cd05784 135 --Y-------HFESFSLENVAQEL 149 (193) T ss_pred --C-------CHHHHHHHHHHHHH T ss_conf --3-------51143488899998 No 86 >cd05778 DNA_polB_zeta_exo The 3'-5' exonuclease domain of eukaryotic DNA polymerase zeta. DNA polymerase zeta is a family-B DNA polymerase which is distantly related to DNA polymerase delta. It plays a major role in translesion replication and the production of either spontaneous or induced mutations. In addition, DNA polymerase zeta also appears to be involved in somatic hypermutability in B lymphocytes, an important element for the production of high affinity antibodies in response to an antigen. The catalytic subunit contains both polymerase and 3'-5' exonuclease domains, but only exhibits polymerase activity. The DnaQ-like 3'-5' exonuclease domain contains three sequence motifs termed ExoI, ExoII and ExoIII, without the four conserved acidic residues that are crucial for metal binding and catalysis. Probab=97.91 E-value=0.00013 Score=48.38 Aligned_cols=105 Identities=11% Similarity=-0.021 Sum_probs=63.7 Q ss_pred CEEEEECCCCC-----CCCCCCEEEEEEEEEECCCCEECC---EEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCH Q ss_conf 37999752788-----987726048999999838972804---0022662178887785789898579988996079997 Q gi|254780661|r 4 HFVIYDYETFG-----RDVALDRPAQFAGVRVDRQWEKIE---STEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVE 75 (471) Q Consensus 4 ~~v~~D~ETtG-----~~~~~~~iiq~~~i~~d~~~~~~~---~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~ 75 (471) +.+-||+|+.+ -+|..|.|++|+...-+....... ......+. +...+........++....+..- .+| T Consensus 5 tiLSfDIe~~sr~~~fPdP~~D~I~~I~~~~~~~~~~~~~~~~~~g~i~~~--~~~~~~~~~~~~~~~~~~~v~~~-~~E 81 (231) T cd05778 5 TILSLEVHVNTRGDLLPDPEFDPISAIFYCIDDDVSPFILDANKVGVIIVD--ELKSNASNGRIRSGLSGIPVEVV-ESE 81 (231) T ss_pred EEEEEEEEECCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCEEEEEEC--CCCCCHHHCCCCCCCCCCEEEEE-CCH T ss_conf 899999998888999969897857899999626876544566520079955--86541121022257899649997-999 Q ss_pred HHHHHHHHHHHHCCCC-EEEEEECHHHHHHHHHHHHH Q ss_conf 9999999999511796-59995055789999999998 Q gi|254780661|r 76 AEFSRRIHQFFSVPNT-CIIGYNNIRFDDYYSRNIFY 111 (471) Q Consensus 76 ~e~~~~i~~~~~~~~~-~~ig~N~~~FD~~~lr~~~~ 111 (471) .|++..+.+++..++| +++|||..+||.+.+-.=.. T Consensus 82 ~eLl~~f~~~v~~~DPDii~Gyni~~fd~~Yl~~Ra~ 118 (231) T cd05778 82 LELFEELIDLVRRFDPDILSGYEIQRSSWGYLIERAA 118 (231) T ss_pred HHHHHHHHHHHHHHCCCEEEECCCCCCCCHHHHHHHH T ss_conf 9999999999997099989622556686189999999 No 87 >PRK05755 DNA polymerase I; Provisional Probab=97.90 E-value=0.0011 Score=42.55 Aligned_cols=142 Identities=20% Similarity=0.203 Sum_probs=88.5 Q ss_pred CCEEEEECCCCCCCCCCCEEEEEEEEEECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHHHHH Q ss_conf 83799975278898772604899999983897280400226621788877857898985799889960799979999999 Q gi|254780661|r 3 NHFVIYDYETFGRDVALDRPAQFAGVRVDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFSRRI 82 (471) Q Consensus 3 ~~~v~~D~ETtG~~~~~~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~~~i 82 (471) ...+.+|+||||+++....++-++.. + ..+ ..+++-+. + . ...+..++...+ T Consensus 319 ~~~va~dtEttsld~~~a~lVGis~s-~-~~g------~a~YIPl~-~-----------~--------~~~~~~~vl~~L 370 (889) T PRK05755 319 AGLVAFDTETTSLDPMSANLVGLSFA-V-EEG------EAAYIPLA-H-----------D--------DQLDRDDALAAL 370 (889) T ss_pred CCEEEEEEECCCCCCCCCCEEEEEEE-C-CCC------CEEEEECC-C-----------C--------CCCCHHHHHHHH T ss_conf 88499998417887565762599998-2-799------68998444-3-----------1--------336889999999 Q ss_pred HHHHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHH Q ss_conf 99951179659995055789999999998507853213256887100158888655565321002444445443310898 Q gi|254780661|r 83 HQFFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGATSFKLQD 162 (471) Q Consensus 83 ~~~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~~kL~~ 162 (471) ..+|.+.+...|||| +.||..+|++ ++++. .+..+|++-.+ |.+.|+ ..+|.|++ T Consensus 371 k~lLed~~i~KIghN-lK~dl~vL~~----~GI~l-------~g~~fDTmLAa---YLLdP~----------~~~~~Ld~ 425 (889) T PRK05755 371 KPLLEDPAIPKVGQN-LKYDRHILAN----YGIEL-------RGIAFDTMLAS---YLLDPG----------AGRHDLDS 425 (889) T ss_pred HHHHCCCCCCEEEEC-HHHHHHHHHH----CCEEC-------CCCCHHHHHHH---HHHCCC----------CCCCCHHH T ss_conf 999709887243203-1899999997----79235-------77630199999---874788----------77677899 Q ss_pred HHHHH-CCCC--------------------CC-CCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99871-8876--------------------43-101333189999999999864226 Q gi|254780661|r 163 LALAN-GIEH--------------------VN-AHDAKADVYATLALVWLIREKKPK 197 (471) Q Consensus 163 la~~n-~i~~--------------------~~-aH~A~~D~~~t~~l~k~i~~~~p~ 197 (471) ||... |... +. ++-|..|+.+|..|...+..+..+ T Consensus 426 La~~yL~~~~~~~eel~gkg~~~~~~~v~~e~~~~YaaedA~~~~~L~~~L~~~L~~ 482 (889) T PRK05755 426 LAERYLGHKTISFEEVAGKGNQLTFAQVALEEAAEYAAEDADVTLRLHQVLKPELEE 482 (889) T ss_pred HHHHHHCCCCCCHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 999863234420343136565566100787899999999886899999999999986 No 88 >cd06143 PAN2_exo PAN2 is the catalytic subunit of poly(A) nuclease (PAN), a Pab1p-dependent 3'-5' exoribonuclease which plays an important role in the posttranscriptional maturation of pre-mRNAs. PAN catalyzes the deadenylation of poly(A) tails, which are initially synthesized to default lengths of 70 to 90, to mRNA-specific lengths of 55 to 71. Pab1p and PAN also play a role in the export and decay of mRNA. PAN2 contains a DEDDh-type DnaQ-like 3'-5' exonuclease domain with three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific Hx(4)D conserved pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. Probab=97.89 E-value=0.0001 Score=49.12 Aligned_cols=148 Identities=14% Similarity=0.088 Sum_probs=83.9 Q ss_pred EEECCCCCCCCCCCE--------E-----EEEEEE-EE----CCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHH Q ss_conf 997527889877260--------4-----899999-98----38972804002266217888778578989857998899 Q gi|254780661|r 7 IYDYETFGRDVALDR--------P-----AQFAGV-RV----DRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKA 68 (471) Q Consensus 7 ~~D~ETtG~~~~~~~--------i-----iq~~~i-~~----d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~ 68 (471) -.|.|--|+.|.--+ + .-+|=| .+ |..|+++ ++. +++|...+ .-.-+.+.|||++++ T Consensus 2 alDcE~Vg~g~~~~~~~~~g~~~~~~~~~~aLARvSiV~~~g~~~G~vl--~D~-yVkP~~~V--tDYrT~~SGIt~~~L 76 (174) T cd06143 2 AIDAEFVKLKPEETEIRSDGTKSTIRPSQMSLARVSVVRGEGELEGVPF--IDD-YISTTEPV--VDYLTRFSGIKPGDL 76 (174) T ss_pred CCCCEEEEECCCCCCCCCCCCEEECCCCCCCCEEEEEECCCCCCCCCEE--EEE-EECCCCCC--CCCCCCCCCCCHHHC T ss_conf 1522146644641202578844110356764302789844587168888--555-17899974--625871635688895 Q ss_pred HHCC-----CCHHHHHHHHHHHHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHH Q ss_conf 6079-----99799999999995117965999505578999999999850785321325688710015888865556532 Q gi|254780661|r 69 LRDG-----VVEAEFSRRIHQFFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSP 143 (471) Q Consensus 69 ~~~~-----~~~~e~~~~i~~~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p 143 (471) .... .+..++.+++..++. .++++|||- +..|...|+-. . |. ....|+-.+.+ T Consensus 77 ~~~~s~~~~~~~~~~~~~l~~li~-~~~ILVGHs-L~nDL~aL~l~-----h-P~-------~~i~DTs~l~~------- 134 (174) T cd06143 77 DPKTSSKNLTTLKSAYLKLRLLVD-LGCIFVGHG-LAKDFRVINIQ-----V-PK-------EQVIDTVELFH------- 134 (174) T ss_pred CCCCCCCCCCCHHHHHHHHHHHCC-CCCEEECCC-CHHHHHHHCCC-----C-CC-------CCEEECHHHCC------- T ss_conf 855564443409999999998547-996897896-36679885066-----9-98-------66577788526------- Q ss_pred HHCCCCCCCCCCCCCCHHHHHHHH-CCCC-CCCCHHHHHHHHHHHHHH Q ss_conf 100244444544331089899871-8876-431013331899999999 Q gi|254780661|r 144 DGIQWPSRDDGATSFKLQDLALAN-GIEH-VNAHDAKADVYATLALVW 189 (471) Q Consensus 144 ~~~~~~~~~~~~~~~kL~~la~~n-~i~~-~~aH~A~~D~~~t~~l~k 189 (471) . .+....+|-.||... |.+. ++.|+...||.|+++|.| T Consensus 135 ----~----~~~r~~sLk~La~~~Lg~~IQ~g~HdSvEDArAam~LYr 174 (174) T cd06143 135 ----L----PGQRKLSLRFLAWYLLGEKIQSETHDSIEDARTALKLYR 174 (174) T ss_pred ----C----CCCCCCCHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHC T ss_conf ----8----998870899999998081046989593999999999629 No 89 >PRK07456 consensus Probab=97.79 E-value=0.0023 Score=40.32 Aligned_cols=147 Identities=19% Similarity=0.188 Sum_probs=88.8 Q ss_pred CCEEEEECCCCCCCCCCCEEEEEEEEEECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHHHHH Q ss_conf 83799975278898772604899999983897280400226621788877857898985799889960799979999999 Q gi|254780661|r 3 NHFVIYDYETFGRDVALDRPAQFAGVRVDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFSRRI 82 (471) Q Consensus 3 ~~~v~~D~ETtG~~~~~~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~~~i 82 (471) ..++.||+||||++|...+++-+|...-...+ . .+++ |-.+ .-.+. ..+.-++...+ T Consensus 382 ~~~vA~DtETtsLdp~~A~lVGis~s~~~~~~----~--a~YI-Pl~h-----------~~~~~-----~l~~~~vl~~L 438 (975) T PRK07456 382 EAPVAWDTETTSLNPRDAELVGIGCCWGEELD----D--LAYI-PLGH-----------QQAGE-----QLPLETVLEAL 438 (975) T ss_pred CCEEEEECCCCCCCHHHCCEEEEEEECCCCCC----C--EEEE-ECCC-----------CCCHH-----HCCHHHHHHHH T ss_conf 76589980468877423210266641155787----2--6997-7345-----------56131-----05799999999 Q ss_pred HHHHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHH Q ss_conf 99951179659995055789999999998507853213256887100158888655565321002444445443310898 Q gi|254780661|r 83 HQFFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGATSFKLQD 162 (471) Q Consensus 83 ~~~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~~kL~~ 162 (471) ..+|.+.....+||| +.||..+|+ +++++. .+-.+|++- +.|++.|+ .++.|++ T Consensus 439 kpiLed~~i~KighN-lKyD~~vL~----~~GI~l-------~g~~fDTmL---AsYLLnP~-----------~~h~Ld~ 492 (975) T PRK07456 439 RPWLESPEHPKALQN-AKFDRLILL----RHGIAL-------KGVVFDTLL---ADYLLNPE-----------AKHNLDD 492 (975) T ss_pred HHHHCCCCCCCHHHH-HHHHHHHHH----HCCCCC-------CCCCCCHHH---HHHHHCCC-----------CCCCHHH T ss_conf 998628875310211-146799999----769510-------675513999---99876876-----------5589899 Q ss_pred HHHHH-CCCC------------------CC-CCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99871-8876------------------43-1013331899999999998642268 Q gi|254780661|r 163 LALAN-GIEH------------------VN-AHDAKADVYATLALVWLIREKKPKL 198 (471) Q Consensus 163 la~~n-~i~~------------------~~-aH~A~~D~~~t~~l~k~i~~~~p~~ 198 (471) ||... |+.. .. ++-|..|+.+|..|...+..+..+. T Consensus 493 La~~yL~~~~i~~edl~gkgk~~~~v~~~~~~~ya~~dA~~~~~L~~~l~~~L~~~ 548 (975) T PRK07456 493 LAQRELGFTPTSYKDLVGKGQTFADVDIATASQYCGMDVYLTYRLAPILREQLAKT 548 (975) T ss_pred HHHHHHCCCCCCHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99997188877499862888760208978999999999999999999999999862 No 90 >cd05783 DNA_polB_B1_exo The 3'-5' exonuclease domain of Sulfolobus solfataricus DNA polymerase B1 and similar archaeal proteins. B1 is a family-B DNA polymerase. Family-B DNA polymerases contain an N-terminal DEDDy DnaQ-like exonuclease domain in the same polypeptide chain as the polymerase domain, similar to family-A DNA polymerases. B1displays thermostable polymerase and 3'-5' exonuclease activities. This exonuclease domain contains three sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are clustered around the active site and are involved in metal binding and catalysis. The exonuclease domain of family-B polymerases also contains a beta hairpin structure that plays an important role in active site switching in the event of nucleotide misincorporation. Family-B DNA polymerases from thermophilic archaea are unique in that they are able to recognize the presence of uracil in the template strand, leading to the stalling of DNA synthes Probab=97.77 E-value=0.00017 Score=47.57 Aligned_cols=103 Identities=17% Similarity=0.020 Sum_probs=58.9 Q ss_pred CCEEEEECCCC-C----C-CCCC--CEEEEEEEEEECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCC Q ss_conf 83799975278-8----9-8772--6048999999838972804002266217888778578989857998899607999 Q gi|254780661|r 3 NHFVIYDYETF-G----R-DVAL--DRPAQFAGVRVDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVV 74 (471) Q Consensus 3 ~~~v~~D~ETt-G----~-~~~~--~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~ 74 (471) -+.+.||+||. . . +|.. +.|++|+.. +.++.. ++..+ .++.... . ......|+. +..- .+ T Consensus 5 lrvlsfDIEv~s~~~~~FP~p~~~~~~iisI~~~--~~~~~~--~~~~~-~~~~~~~--~-~~~~~~~~~---v~~f-~~ 72 (204) T cd05783 5 LKRIAIDIEVYTPIKGRIPDPKTAEYPVISVALA--GSDGLK--RVLVL-KREGVEG--L-EGLLPEGAE---VEFF-DS 72 (204) T ss_pred CEEEEEEEEECCCCCCCCCCCCCCCCEEEEEEEE--CCCCCE--EEEEE-EECCCCC--C-CCCCCCCCE---EEEE-CC T ss_conf 7499999887889999797955588619999999--279988--99999-5087125--7-655789989---9995-99 Q ss_pred HHHHHHHHHHHHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCC Q ss_conf 79999999999511796599950557899999999985078532 Q gi|254780661|r 75 EAEFSRRIHQFFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSY 118 (471) Q Consensus 75 ~~e~~~~i~~~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y 118 (471) |.+++..+.+++.+. ++++|||+.+||.+.|..=.-+.+.... T Consensus 73 E~~LL~~F~~~i~~~-p~Ii~wN~~~FDiPYL~~R~~~Lg~~~~ 115 (204) T cd05783 73 EKELIREAFKIISEY-PIVLTFNGDNFDLPYLYNRALKLGIPKE 115 (204) T ss_pred HHHHHHHHHHHHHCC-CEEEECCCCCCCHHHHHHHHHHHCCCCE T ss_conf 999999999997439-9799817867788999999998599820 No 91 >cd06142 RNaseD_exo Ribonuclease (RNase) D is a bacterial enzyme involved in the maturation of small stable RNAs and the 3' maturation of tRNA. It contains a DEDDy-type DnaQ-like 3'-5' exonuclease domain possessing three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. In vivo, RNase D only becomes essential upon removal of other ribonucleases. Eukaryotic RNase D homologs include yeast Rrp6p, human PM/Scl-100 and the Drosophila melanogaster egalitarian protein. Probab=97.76 E-value=0.003 Score=39.67 Aligned_cols=132 Identities=20% Similarity=0.127 Sum_probs=80.0 Q ss_pred CCCEEEEECCCCCCCCCCCEE--EEEEEEEECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHH Q ss_conf 883799975278898772604--899999983897280400226621788877857898985799889960799979999 Q gi|254780661|r 2 TNHFVIYDYETFGRDVALDRP--AQFAGVRVDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFS 79 (471) Q Consensus 2 ~~~~v~~D~ETtG~~~~~~~i--iq~~~i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~ 79 (471) ....|-+|+||+|+++..+++ ||+| ++. .. +.. ++.+. + .. T Consensus 11 ~~~~i~~DtE~~~~~~~~~~l~LiQia---~~~------~~--yli--------d~~~~---~---------------~~ 53 (178) T cd06142 11 GAGVIAVDTEFMRLRTYYPRLCLIQIS---TGG------QA--YLI--------DPLAI---G---------------DL 53 (178) T ss_pred CCCEEEEECCCCCCCCCCCEEEEEEEE---ECC------CE--EEE--------ECCCC---C---------------CH T ss_conf 799489989867885588728999998---689------06--999--------53123---4---------------46 Q ss_pred HHHHHHHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCC Q ss_conf 99999951179659995055789999999998507853213256887100158888655565321002444445443310 Q gi|254780661|r 80 RRIHQFFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGATSFK 159 (471) Q Consensus 80 ~~i~~~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~~k 159 (471) ..+.++|.++..+.|||| +.||..+|++.+-.. ..+.+|++-..+.. .+ ..+++ T Consensus 54 ~~L~~lL~d~~i~KVgh~-~~~Dl~~L~~~~~~~-----------~~~v~Dt~ia~~~l---~~-----------~~~~~ 107 (178) T cd06142 54 SPLKELLADPNIVKVFHA-AREDLELLKRDFGIL-----------PVNLFDTQIAARLL---GL-----------GDSVG 107 (178) T ss_pred HHHHHHHCCCCCEEEEEE-HHHHHHHHHHHHCCC-----------CCCEEEHHHHHHHC---CC-----------CCCHH T ss_conf 999999759994399962-799999999973967-----------65453089999982---88-----------67378 Q ss_pred HHHHHH-HHCCCCC----------------CCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 898998-7188764----------------310133318999999999986422 Q gi|254780661|r 160 LQDLAL-ANGIEHV----------------NAHDAKADVYATLALVWLIREKKP 196 (471) Q Consensus 160 L~~la~-~n~i~~~----------------~aH~A~~D~~~t~~l~k~i~~~~p 196 (471) |..++. ..|+..+ .-+=|..||..+.+|...+..+.. T Consensus 108 L~~L~~~~lg~~l~K~~q~s~W~~rPLs~~qi~YAa~Da~~l~~l~~~l~~~L~ 161 (178) T cd06142 108 LAALVEELLGVELDKGEQRSDWLKRPLTDEQLEYAANDVRYLLPLYEKLKEELE 161 (178) T ss_pred HHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 999999992987777736466433678289999999979999999999999999 No 92 >smart00474 35EXOc 3'-5' exonuclease. 3\' -5' exonuclease proofreading domain present in DNA polymerase I, Werner syndrome helicase, RNase D and other enzymes Probab=97.71 E-value=0.0018 Score=41.07 Aligned_cols=132 Identities=18% Similarity=0.126 Sum_probs=74.8 Q ss_pred CEEEEECCCCCCCCCCCEEEEEEEEEECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHHHHHH Q ss_conf 37999752788987726048999999838972804002266217888778578989857998899607999799999999 Q gi|254780661|r 4 HFVIYDYETFGRDVALDRPAQFAGVRVDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFSRRIH 83 (471) Q Consensus 4 ~~v~~D~ETtG~~~~~~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~~~i~ 83 (471) ..|.+|+||||+++..++++=+. +.++.+ .. +.. .+... .+....+. T Consensus 22 ~~i~~DtE~~~~~~~~~~l~~iq-l~~~~~-----~~--yi~--~~~~~-----------------------~~~~~~l~ 68 (172) T smart00474 22 GEVALDTETTGLNSYSGKLVLIQ-ISVTGE-----GA--FII--DPLAL-----------------------GDDLEILK 68 (172) T ss_pred CEEEEECCCCCCCCCCCEEEEEE-EEECCC-----EE--EEE--ECCCC-----------------------HHHHHHHH T ss_conf 95999883068853787589999-996898-----48--997--01224-----------------------21799999 Q ss_pred HHHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHH Q ss_conf 99511796599950557899999999985078532132568871001588886555653210024444454433108989 Q gi|254780661|r 84 QFFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGATSFKLQDL 163 (471) Q Consensus 84 ~~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~~kL~~l 163 (471) ++++++....|||| +.||..+|++ .+..+ ++ .+|++-..+. ..+ +..+++|..+ T Consensus 69 ~ll~~~~i~kv~hn-~k~D~~~L~~----~gi~l-------~~-v~Dt~ia~~l---~~~----------~~~~~~L~~l 122 (172) T smart00474 69 DLLEDETITKVGHN-AKFDLHVLAR----FGIEL-------EN-IFDTMLAAYL---LLG----------GPSKHGLATL 122 (172) T ss_pred HHHCCCCCCEEEEC-HHHHHHHHHH----CCCCC-------CC-CHHHHHHHHH---CCC----------CCCHHHHHHH T ss_conf 99809998599974-5999999997----79856-------75-2467998775---057----------9864899999 Q ss_pred HHHH-CCCCC----------------CCCHHHHHHHHHHHHHHHHHHH Q ss_conf 9871-88764----------------3101333189999999999864 Q gi|254780661|r 164 ALAN-GIEHV----------------NAHDAKADVYATLALVWLIREK 194 (471) Q Consensus 164 a~~n-~i~~~----------------~aH~A~~D~~~t~~l~k~i~~~ 194 (471) +... |+... ..+=|..||.+|..|...+..+ T Consensus 123 ~~~~~~~~l~k~~~~~~w~~rpl~~~~~~YAa~Da~~~~~l~~~l~~~ 170 (172) T smart00474 123 LKEYLGVELDKEEQKSDWGARPLSEEQLQYAAEDADALLRLYEKLEKE 170 (172) T ss_pred HHHHCCEECCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 999749231555564666789990999999999999999999999997 No 93 >PRK07556 consensus Probab=97.70 E-value=0.00049 Score=44.71 Aligned_cols=153 Identities=16% Similarity=0.152 Sum_probs=88.2 Q ss_pred CCEEEEECCCCCCCCCCCEEEEEEEEEECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHHHHH Q ss_conf 83799975278898772604899999983897280400226621788877857898985799889960799979999999 Q gi|254780661|r 3 NHFVIYDYETFGRDVALDRPAQFAGVRVDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFSRRI 82 (471) Q Consensus 3 ~~~v~~D~ETtG~~~~~~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~~~i 82 (471) ..++.||+||||+++....++-++.- ++. + ..+++ |-.+.-.. ..+.-.+. .....+..++...+ T Consensus 387 ~~~~a~DtETtsld~~~a~lVGisls-~~~-~------~a~YI-Pl~h~~~~-~~~~~~~~-----~~~ql~~~~vL~~L 451 (977) T PRK07556 387 TGIVAFDTETTSLDPMQAELVGFSLA-LAP-G------RACYI-PLGHKSGD-GDLFGGGL-----AEGQIPLRDALAAL 451 (977) T ss_pred CCEEEEEEECCCCCHHHCCEEEEEEE-ECC-C------CEEEE-ECCCCCCC-CCHHCCCC-----CHHHCCHHHHHHHH T ss_conf 79589997318878301553579998-068-9------78999-61444564-20110222-----23230599999998 Q ss_pred HHHHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHH Q ss_conf 99951179659995055789999999998507853213256887100158888655565321002444445443310898 Q gi|254780661|r 83 HQFFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGATSFKLQD 162 (471) Q Consensus 83 ~~~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~~kL~~ 162 (471) ..+|.++...-|||| +.||..+|+. +++.. .+ ..|++-. +|.+.|+ ..++.|+. T Consensus 452 kplLed~~i~Kighn-lK~d~~vL~~----~gi~l------~~--i~DtmLa---aYLLdp~----------~~~~~Ld~ 505 (977) T PRK07556 452 KPVLEDPSVLKIGQN-MKYDALVLAR----HGITV------AP--FDDTMLI---SYALDAG----------RGGHGMDA 505 (977) T ss_pred HHHHCCCCHHHHHHH-HHHHHHHHHH----CCCCC------CC--CCCHHHH---HHHHCCC----------CCCCCHHH T ss_conf 988538731144412-5688999997----69602------68--5318888---8883488----------66478799 Q ss_pred HHHHH-CCC---------------------CCC-CCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99871-887---------------------643-10133318999999999986422 Q gi|254780661|r 163 LALAN-GIE---------------------HVN-AHDAKADVYATLALVWLIREKKP 196 (471) Q Consensus 163 la~~n-~i~---------------------~~~-aH~A~~D~~~t~~l~k~i~~~~p 196 (471) ||..+ |.. .+. ..-|..|+.+|..|...+..+.. T Consensus 506 La~~yL~~~~~~~~dl~g~gk~~~~f~~v~~~~~~~YaaedA~~t~~L~~~L~~~L~ 562 (977) T PRK07556 506 LSERWLGHTPIAYKDLTGSGKSAITFDQVDIDRATAYAAEDADVTLRLWQVLKPRLA 562 (977) T ss_pred HHHHHCCCCCCCCHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 999964887621000034550104765520689999999999999999999999998 No 94 >cd06140 DNA_polA_I_Bacillus_like_exo Bacillus stearothermophilus-like Polymerase I (pol I), a subgroup of the family-A DNA polymerases, contains an inactive DnaQ-like 3'-5' exonuclease domain in the same polypeptide chain as the polymerase region. The exonuclease-like domain of these proteins possess the same fold as the Klenow fragment (KF) of Escherichia coli pol I, but does not contain the four critical metal-binding residues necessary for activity. The function of this domain is unknown. It might act as a spacer between the polymerase and the 5'-3' exonuclease domains. Some members of this subgroup, such as those from Bacillus sphaericus and Thermus aquaticus, are thermostable DNA polymerases. Probab=97.57 E-value=0.006 Score=37.71 Aligned_cols=137 Identities=14% Similarity=0.071 Sum_probs=83.9 Q ss_pred CCCEEEEECCCCCCCCCCCEEEEEEEEEECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHHHH Q ss_conf 88379997527889877260489999998389728040022662178887785789898579988996079997999999 Q gi|254780661|r 2 TNHFVIYDYETFGRDVALDRPAQFAGVRVDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFSRR 81 (471) Q Consensus 2 ~~~~v~~D~ETtG~~~~~~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~~~ 81 (471) ...-+.+|+||||+++..++|+-+|.- . . + + .+++-. .+. . ..... T Consensus 2 ~~~~~av~~e~~~~~~~~~~i~gi~l~-~-~-~----~--~~yi~~-~~~------~------------------~~~~~ 47 (178) T cd06140 2 LADEVALYVELLGENYHTADIIGLALA-N-G-G----G--AYYIPL-ELA------L------------------LDLAA 47 (178) T ss_pred CCCCEEEEEECCCCCCCCCEEEEEEEE-E-C-C----E--EEEEEC-CCC------H------------------HHHHH T ss_conf 888669997379998453549999998-8-9-9----7--999956-673------6------------------46999 Q ss_pred HHHHHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHH Q ss_conf 99995117965999505578999999999850785321325688710015888865556532100244444544331089 Q gi|254780661|r 82 IHQFFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGATSFKLQ 161 (471) Q Consensus 82 i~~~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~~kL~ 161 (471) +..+|.+....-+||| +.||..+++. ++++. .+-.+|++-.+ |.+.|+ ..++.|. T Consensus 48 lk~~l~d~~i~Ki~hn-~K~~~~~L~~----~Gi~l-------~g~~fDtmiaa---yLl~p~----------~~~~~L~ 102 (178) T cd06140 48 LKEWLEDEKIPKVGHD-AKRAYVALKR----HGIEL-------AGVAFDTMLAA---YLLDPT----------RSSYDLA 102 (178) T ss_pred HHHHHCCCCCCEEEEC-HHHHHHHHHH----CCCCC-------CCCHHHHHHHH---HHHCCC----------CCCCCHH T ss_conf 9999729999889865-7999999998----89806-------88303688898---875489----------7779999 Q ss_pred HHHHHHCC-CCCC-------------------CCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 89987188-7643-------------------101333189999999999864226 Q gi|254780661|r 162 DLALANGI-EHVN-------------------AHDAKADVYATLALVWLIREKKPK 197 (471) Q Consensus 162 ~la~~n~i-~~~~-------------------aH~A~~D~~~t~~l~k~i~~~~p~ 197 (471) +++....- .... +.-|..||.+|.+|...+.++..+ T Consensus 103 ~L~~~yl~~~~~~~~~~~gk~~~~~~~~~~~~~~ya~~~a~~~~~L~~~l~~~L~e 158 (178) T cd06140 103 DLAKRYLGRELPSDEEVYGKGAKFAVPDEEVLAEHLARKAAAILRLAPKLEEELEE 158 (178) T ss_pred HHHHHHHCCCCCCHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999828666547886176656243997999999999999999999999999998 No 95 >PRK07997 consensus Probab=97.48 E-value=0.0037 Score=39.07 Aligned_cols=145 Identities=17% Similarity=0.134 Sum_probs=83.8 Q ss_pred CCEEEEECCCCCCCCCCCEEEEEEEEEECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHHHHH Q ss_conf 83799975278898772604899999983897280400226621788877857898985799889960799979999999 Q gi|254780661|r 3 NHFVIYDYETFGRDVALDRPAQFAGVRVDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFSRRI 82 (471) Q Consensus 3 ~~~v~~D~ETtG~~~~~~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~~~i 82 (471) ...+.||+||||+++....++-++.- +.. + ...++-+.....-.| .. .+.-.+...+ T Consensus 349 ~~~~a~DtEt~sld~~~a~lvGiS~s-~~~-g------~a~YiPl~h~~l~~~----------~q-----l~~~~vl~~L 405 (928) T PRK07997 349 APVFAFDTETDSLDNISANLVGLSFA-IEP-G------VAAYLPVAHDYLDAP----------DQ-----ISRERVLELL 405 (928) T ss_pred CCEEEEEEECCCCCCCCCCCCCEEEE-ECC-C------CEEEEECCCCCCCCH----------HH-----CCHHHHHHHH T ss_conf 98699995127866000244433898-627-7------468740221234760----------01-----2499999998 Q ss_pred HHHHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHH Q ss_conf 99951179659995055789999999998507853213256887100158888655565321002444445443310898 Q gi|254780661|r 83 HQFFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGATSFKLQD 162 (471) Q Consensus 83 ~~~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~~kL~~ 162 (471) ..+|.+.....|||| +.||..+|+. ++++. .+-.+|++-.+ |.+.|. ..++.|+. T Consensus 406 kplLed~~i~KIghn-lK~d~~vL~~----~GI~l-------~gi~fDTmLAa---YLLnp~----------~~~~~Ld~ 460 (928) T PRK07997 406 KPLLEDEKALKVGQN-LKFDRGILAN----YGIEL-------RGIAFDTMLES---YILNSV----------AGRHDMDS 460 (928) T ss_pred HHHHCCCCCCHHCCH-HHHHHHHHHH----CCCEE-------CCCCCCHHHHH---HHCCCC----------CCCCCHHH T ss_conf 987528653032121-6889999997----79410-------56430345532---222443----------34478899 Q ss_pred HHHHHC-C---------------------CCCC-CCHHHHHHHHHHHHHHHHHHHH Q ss_conf 998718-8---------------------7643-1013331899999999998642 Q gi|254780661|r 163 LALANG-I---------------------EHVN-AHDAKADVYATLALVWLIREKK 195 (471) Q Consensus 163 la~~n~-i---------------------~~~~-aH~A~~D~~~t~~l~k~i~~~~ 195 (471) ||..+- . +.+. ++-|..|+.+|..|...+..+. T Consensus 461 La~~yL~~~~i~~~el~gkgk~q~~f~~v~~~~~~~Ya~edA~~t~~L~~~L~~~L 516 (928) T PRK07997 461 LAERWLKHKTITFEEIAGKGKNQLTFNQIALEEAGRYAAEDADVTLQLHLKMWPKL 516 (928) T ss_pred HHHHHHCCCCCCHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99997134664335432446454774353599999999999999999999999998 No 96 >cd05776 DNA_polB_alpha_exo The 3'-5' exonuclease domain of eukaryotic DNA polymerase alpha. DNA polymerase alpha is a family-B DNA polymerase with a catalytic subunit that contains a DnaQ-like 3'-5' exonuclease domain. It is one of the three DNA-dependent type B DNA polymerases (delta and epsilon are the other two) that have been identified as essential for nuclear DNA replication in eukaryotes. DNA polymerase alpha is almost exclusively required for the initiation of DNA replication and the priming of Okazaki fragments during elongation. It associates with DNA primase and is the only enzyme able to start DNA synthesis de novo. The catalytic subunit contains both polymerase and 3'-5' exonuclease domains, but only exhibits polymerase activity. The 3'-5' exonuclease domain contains three sequence motifs termed ExoI, ExoII and ExoIII, without the four conserved acidic residues that are crucial for metal binding and catalysis. This explains why in most organisms, that no specific repair r Probab=97.46 E-value=0.0044 Score=38.59 Aligned_cols=110 Identities=9% Similarity=-0.085 Sum_probs=67.9 Q ss_pred CEEEEECCCCCCCCC--CCEEEEEEEEEECCCCEE-----CC---EEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCC Q ss_conf 379997527889877--260489999998389728-----04---00226621788877857898985799889960799 Q gi|254780661|r 4 HFVIYDYETFGRDVA--LDRPAQFAGVRVDRQWEK-----IE---STEVFFCKPADDYLPDPEAVFITGITPQKALRDGV 73 (471) Q Consensus 4 ~~v~~D~ETtG~~~~--~~~iiq~~~i~~d~~~~~-----~~---~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~ 73 (471) +.+-+.+.|. .|+. ..+|+.++++.. .+.+. .. ......++|.....|.+.-...-+.....+...+ T Consensus 4 tv~Sisi~T~-~n~k~~~~EI~~iS~~~~-~~~~~d~~~~~~~~~~~~~~~~r~~~~~~p~~~~~~~~~~~~~~v~~f~- 80 (234) T cd05776 4 TVMSLSIKTV-LNSKTNKNEIVMISMLVH-RNVSLDKPTPPPPFQSHTCTLTRPLGRSPPPDLFEKNAKKKKTKVRIFE- 80 (234) T ss_pred EEEEEEEEEE-ECCCCCCCEEEEEEEEEE-CCCCCCCCCCCCCCCCEEEEEECCCCCCCCCCHHHHHHHCCCCCEEECC- T ss_conf 8999998997-168888543889885141-6847789999754566089998489987997569999862698489848- Q ss_pred CHHHHHHHHHHHHHCCCC-EEEEEECHHHHHHHHHHHHHHHCCC Q ss_conf 979999999999511796-5999505578999999999850785 Q gi|254780661|r 74 VEAEFSRRIHQFFSVPNT-CIIGYNNIRFDDYYSRNIFYRNFYD 116 (471) Q Consensus 74 ~~~e~~~~i~~~~~~~~~-~~ig~N~~~FD~~~lr~~~~r~~~~ 116 (471) +|.++...+..++...++ +++|||..+||.++|-+=......+ T Consensus 81 sEr~LL~~f~~~i~~~DPDii~Ghni~~fdl~~L~~R~~~lki~ 124 (234) T cd05776 81 NERALLNFFLAKLQKIDPDVLVGHDLEGFDLDVLLSRIQELKVP 124 (234) T ss_pred CHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHHHHHHHHCCC T ss_conf 89999999999998509989985063777599999999980998 No 97 >PRK08835 consensus Probab=97.46 E-value=0.0058 Score=37.80 Aligned_cols=145 Identities=18% Similarity=0.179 Sum_probs=84.6 Q ss_pred CCEEEEECCCCCCCCCCCEEEEEEEEEECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHHHHH Q ss_conf 83799975278898772604899999983897280400226621788877857898985799889960799979999999 Q gi|254780661|r 3 NHFVIYDYETFGRDVALDRPAQFAGVRVDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFSRRI 82 (471) Q Consensus 3 ~~~v~~D~ETtG~~~~~~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~~~i 82 (471) ...+.||+||||+++....++-++... +. +. ..++-+..+.+-.|. . .+.-.+...+ T Consensus 351 ~~~~a~dtEt~~ld~~~a~lvGis~~~-~~-g~------a~YiPl~h~~l~~~~----------q-----l~~d~vl~~L 407 (931) T PRK08835 351 AELFAFDTETDSLDYMVANLVGLSFAV-AE-GE------AAYVPVAHDYLDAPE----------Q-----LDRDWVLAQL 407 (931) T ss_pred CCEEEEEEECCCCCHHHCCCCCEEEEE-CC-CC------EEEEEECCCCCCCHH----------H-----CCHHHHHHHH T ss_conf 885999950487432221335426876-47-85------799971233345510----------1-----3699999997 Q ss_pred HHHHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHH Q ss_conf 99951179659995055789999999998507853213256887100158888655565321002444445443310898 Q gi|254780661|r 83 HQFFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGATSFKLQD 162 (471) Q Consensus 83 ~~~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~~kL~~ 162 (471) ..+|.+.....|||| +.||..+|+. ++... .+-.+|++-.+ |.+.|. ..++.|+. T Consensus 408 kplLed~~i~KIghN-lK~d~~vL~~----~gI~l-------~gi~fDTmLAa---YLLnp~----------~~~~~Ld~ 462 (931) T PRK08835 408 KPLLEDDAKAKVGQN-LKYDASVLAR----YGIEM-------KGIKHDTMLAS---YVYNSV----------GGKHDMDS 462 (931) T ss_pred HHHHCCCCHHHHHCC-HHHHHHHHHH----CCCCC-------CCCCCCHHHHH---HHCCCC----------CCCCCHHH T ss_conf 998428424366221-1578999997----79776-------78661243575---300477----------78789899 Q ss_pred HHHHH-C---------------------CCCCC-CCHHHHHHHHHHHHHHHHHHHH Q ss_conf 99871-8---------------------87643-1013331899999999998642 Q gi|254780661|r 163 LALAN-G---------------------IEHVN-AHDAKADVYATLALVWLIREKK 195 (471) Q Consensus 163 la~~n-~---------------------i~~~~-aH~A~~D~~~t~~l~k~i~~~~ 195 (471) ||... + ++.+. ++-|..|+.+|..|...+..+. T Consensus 463 La~~~L~~~~i~~e~l~gkgk~q~~f~~v~~~~~~~YaaedA~~t~~L~~~L~~~L 518 (931) T PRK08835 463 LALRFLQHSCISFEQVAGKGKNQLTFNQIDLEEAAPYAAEDADVTLRLHNRLFANI 518 (931) T ss_pred HHHHHCCCCCCCHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99986487763477651445344783343288999999999999999999999999 No 98 >PRK07625 consensus Probab=97.40 E-value=0.0088 Score=36.63 Aligned_cols=144 Identities=17% Similarity=0.168 Sum_probs=85.3 Q ss_pred CCEEEEECCCCCCCCCCCEEEEEEEEEECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHHHHH Q ss_conf 83799975278898772604899999983897280400226621788877857898985799889960799979999999 Q gi|254780661|r 3 NHFVIYDYETFGRDVALDRPAQFAGVRVDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFSRRI 82 (471) Q Consensus 3 ~~~v~~D~ETtG~~~~~~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~~~i 82 (471) ...+.||+||||+++...+++-++. .++. +. .+++-+ .+.-|+ +. ...+.-++...+ T Consensus 344 ~~~~a~dtEttsld~~~a~lvGis~-s~~~-g~------a~YiPl-~h~~~d--~~------------~~l~~~~vl~~L 400 (922) T PRK07625 344 AELTAFDTETTSLDPMLAQLVGLSF-SVEP-GH------AAYIPV-AHRGPD--MP------------AQLPRDEVLARL 400 (922) T ss_pred CCEEEEEEECCCCCHHHCEEEEEEE-EECC-CC------EEEEEE-CCCCCC--HH------------HHCCHHHHHHHH T ss_conf 8848999522776500130565797-7438-86------799970-215865--23------------215899999987 Q ss_pred HHHHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHH Q ss_conf 99951179659995055789999999998507853213256887100158888655565321002444445443310898 Q gi|254780661|r 83 HQFFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGATSFKLQD 162 (471) Q Consensus 83 ~~~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~~kL~~ 162 (471) ..++.+.+...|||| +.||..+++. +++.. ..-.+|++-.++ .+.|. .++.|++ T Consensus 401 k~~Led~~i~KIghN-lK~d~~vL~~----~GI~l-------~g~~fDTmLAaY---LL~~~-----------r~~~Ld~ 454 (922) T PRK07625 401 RPWLEDAGRKKLGQH-LKYDAQVLAN----YGIAL-------NGIEHDTLLESY---VLESH-----------RTHDMDS 454 (922) T ss_pred HHHHHCCCCHHHHHH-HHHHHHHHHH----CCCCC-------CCCCHHHHHHHH---HHCCC-----------CCCCHHH T ss_conf 998743440355425-7789999997----79645-------875403999988---75467-----------6788799 Q ss_pred HHHHH-CCC---------------------CCC-CCHHHHHHHHHHHHHHHHHHHH Q ss_conf 99871-887---------------------643-1013331899999999998642 Q gi|254780661|r 163 LALAN-GIE---------------------HVN-AHDAKADVYATLALVWLIREKK 195 (471) Q Consensus 163 la~~n-~i~---------------------~~~-aH~A~~D~~~t~~l~k~i~~~~ 195 (471) ||... |.. .+. ++=|..|+.+|..|...+..+. T Consensus 455 La~~yL~~~~i~~edl~Gkg~~q~~~~~v~~~~~~~Ya~eda~~t~~L~~~L~~~L 510 (922) T PRK07625 455 LALRHLGVKTIKYEDVAGKGAKQIGFDEVALAQASEYAAEDADITLQLHHALYPQV 510 (922) T ss_pred HHHHHHCCCCCCHHHHCCCCCCCCCHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99987255655413212333333783224679999999999999999999999998 No 99 >PRK08786 consensus Probab=97.28 E-value=0.013 Score=35.62 Aligned_cols=144 Identities=15% Similarity=0.140 Sum_probs=85.0 Q ss_pred CEEEEECCCCCCCCCCCEEEEEEEEEECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHHHHHH Q ss_conf 37999752788987726048999999838972804002266217888778578989857998899607999799999999 Q gi|254780661|r 4 HFVIYDYETFGRDVALDRPAQFAGVRVDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFSRRIH 83 (471) Q Consensus 4 ~~v~~D~ETtG~~~~~~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~~~i~ 83 (471) ..+-||+||||+++....++-++.. ++. + + ..++ |-.+..+ |.. .. .+..++...+. T Consensus 349 ~~~afdtET~sld~~~a~lVGiS~s-~~~-g----~--a~Yi-P~~h~~~--------g~~-~q-----ld~~~~l~~Lk 405 (927) T PRK08786 349 GQFAFDTETDSLDPLQADLIGLSVA-AEP-G----Q--AAYL-PFGHDFP--------GAP-VQ-----LDRTQALAQLA 405 (927) T ss_pred CEEEEEEECCCCCHHHHHHHCEEEE-CCC-C----C--EEEE-ECCCCCC--------CCC-HH-----HHHHHHHHHHH T ss_conf 8389997238866555312122454-267-8----6--6775-3244567--------860-34-----13999999989 Q ss_pred HHHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHH Q ss_conf 99511796599950557899999999985078532132568871001588886555653210024444454433108989 Q gi|254780661|r 84 QFFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGATSFKLQDL 163 (471) Q Consensus 84 ~~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~~kL~~l 163 (471) .+|.++...-|||| +.||..+|+. ++++. .+..+|++-.. |.+.|+ ..++.|+.+ T Consensus 406 plLed~~I~KIghn-lK~Dl~vL~~----~GI~l-------~g~~~DTmLAa---YLL~p~----------~~~~~Ld~L 460 (927) T PRK08786 406 PLLTDPAVRKVGQH-GKYDLHVMRR----HGIAL-------AGYADDTLLES---FVLNSG----------SARHDMDSL 460 (927) T ss_pred HHHCCCCCCHHHHH-HHHHHHHHHH----CCCCC-------CCCCCCHHHHH---HHCCCC----------CCCCCHHHH T ss_conf 98608651123211-7888999996----79711-------67664258888---623776----------454788999 Q ss_pred HHHH-CCCC---------------------CC-CCHHHHHHHHHHHHHHHHHHHH Q ss_conf 9871-8876---------------------43-1013331899999999998642 Q gi|254780661|r 164 ALAN-GIEH---------------------VN-AHDAKADVYATLALVWLIREKK 195 (471) Q Consensus 164 a~~n-~i~~---------------------~~-aH~A~~D~~~t~~l~k~i~~~~ 195 (471) |... |.+. +. .+-|..|+.+|..|...+..+. T Consensus 461 a~~yLg~~~~~~~~l~gkg~k~~~~~~v~~~~~~~Ya~eda~~tl~L~~~l~~kL 515 (927) T PRK08786 461 AKRYLGYDTVKYEDVCGKGAKQIPFAQVSLEDATRYAAEDADITLRLHCVLGPKL 515 (927) T ss_pred HHHHCCCCCCCHHHHCCCCCCCCCHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9996697754267542443222883332599999999999999999999999999 No 100 >PRK05797 consensus Probab=97.14 E-value=0.021 Score=34.26 Aligned_cols=138 Identities=14% Similarity=0.121 Sum_probs=87.3 Q ss_pred CEEEEECCCCCCCCCCCEEEEEEEEEECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHHHHHH Q ss_conf 37999752788987726048999999838972804002266217888778578989857998899607999799999999 Q gi|254780661|r 4 HFVIYDYETFGRDVALDRPAQFAGVRVDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFSRRIH 83 (471) Q Consensus 4 ~~v~~D~ETtG~~~~~~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~~~i~ 83 (471) .-+++|+|||+.++....++.+++..... + .+++ |.+.. .+.-++...+. T Consensus 320 ~~~~~~~ett~~~~~~~~~vg~s~~~~~~------~--a~yi-------p~~~~---------------~~~~~~L~~Lk 369 (869) T PRK05797 320 NQVYINFTLEDENLYSKIEIKKIFINFGE------K--TYYI-------DFKEL---------------IDEEEFIEDLK 369 (869) T ss_pred CCEEEEEECCCCCHHHHHHEEEEEEECCC------E--EEEE-------ECHHH---------------CCHHHHHHHHH T ss_conf 98799997389761203041899996199------7--9998-------31451---------------67789999999 Q ss_pred HHHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHH Q ss_conf 99511796599950557899999999985078532132568871001588886555653210024444454433108989 Q gi|254780661|r 84 QFFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGATSFKLQDL 163 (471) Q Consensus 84 ~~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~~kL~~l 163 (471) .+|.+.....|||| +.||..+|+. ++.+. .+-.+|++-.. |++.|+ ..+|.|+.| T Consensus 370 ~lLed~~i~KIGhN-lK~dl~vL~~----~GI~l-------~gi~FDTMLAs---YLLnP~----------~~~h~Ld~L 424 (869) T PRK05797 370 EIFENEEIKKIGHD-IKNFLTILKK----LGIEF-------KGLAFDTAIAA---YLIDPS----------KSEYDLSDL 424 (869) T ss_pred HHHHCCCCCEEEEC-CHHHHHHHHH----CCCCC-------CCHHHHHHHHH---HHHCCC----------CCCCCHHHH T ss_conf 99848998789843-0699999997----59855-------66166899999---972899----------888899999 Q ss_pred HHHH-CCCC--CCCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9871-8876--4310133318999999999986422 Q gi|254780661|r 164 ALAN-GIEH--VNAHDAKADVYATLALVWLIREKKP 196 (471) Q Consensus 164 a~~n-~i~~--~~aH~A~~D~~~t~~l~k~i~~~~p 196 (471) |... |... ....-|..|+.++..|...+..+.. T Consensus 425 a~~yL~~~~~~~e~~ya~~dA~~~~~L~~~L~~~L~ 460 (869) T PRK05797 425 AKEYLSKDVNSEEEEYKIKEVSVMKELYEKLKEKIE 460 (869) T ss_pred HHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 999808777514789999999999999999999998 No 101 >PRK08434 consensus Probab=96.94 E-value=0.031 Score=33.12 Aligned_cols=144 Identities=21% Similarity=0.282 Sum_probs=78.5 Q ss_pred CCEEEEECCCCCCCCCCCEEEEEEEEEECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHHHHH Q ss_conf 83799975278898772604899999983897280400226621788877857898985799889960799979999999 Q gi|254780661|r 3 NHFVIYDYETFGRDVALDRPAQFAGVRVDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFSRRI 82 (471) Q Consensus 3 ~~~v~~D~ETtG~~~~~~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~~~i 82 (471) ...+-+|+||||+++....++-++ +.++.+ ..+++-...... |. ++. .+..++...+ T Consensus 318 ~~~~a~d~Ett~ld~~~a~lvGis-~~~~~~-------~a~YiPi~h~~l---------~~-~~~-----~~~~~~~~~l 374 (887) T PRK08434 318 ESIIAFDTETTGLDTKEAKLVGFS-FCFNEE-------EAYYVPLAHSYL---------GV-GEQ-----ISLESAKKAI 374 (887) T ss_pred CCEEEEEEECCCCCCCCCCEEEEE-EEECCC-------CEEEEECCCCCC---------CC-HHH-----CCHHHHHHHH T ss_conf 676999951278661334305899-995598-------589997521125---------77-333-----5999999997 Q ss_pred HHHHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHH Q ss_conf 99951179659995055789999999998507853213256887100158888655565321002444445443310898 Q gi|254780661|r 83 HQFFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGATSFKLQD 162 (471) Q Consensus 83 ~~~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~~kL~~ 162 (471) ...++.. .|||| +.||..++.+. .++.. ....+|++-. +|.+.|+. ++.|+. T Consensus 375 ~~~~~~~---~IghN-lKyDl~vL~~~---~gi~l-------~g~~fDTmLA---aYLLdp~~-----------r~~Ld~ 426 (887) T PRK08434 375 EKIFNHF---VIGHN-LKYDFKIIQNN---FGLEL-------PQKYADTMIL---AWLKDPSS-----------RVGLDD 426 (887) T ss_pred HHHHHCC---CHHHH-HHHHHHHHHHH---CCCCC-------CCCHHHHHHH---HHHCCCCC-----------CCCHHH T ss_conf 8763374---03647-79999999985---69877-------7731489999---98669866-----------666678 Q ss_pred HHHHH-CC------------------CCCC-CCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99871-88------------------7643-101333189999999999864226 Q gi|254780661|r 163 LALAN-GI------------------EHVN-AHDAKADVYATLALVWLIREKKPK 197 (471) Q Consensus 163 la~~n-~i------------------~~~~-aH~A~~D~~~t~~l~k~i~~~~p~ 197 (471) ||... +. +.+. ++-|..|+.+|..|...+..+..+ T Consensus 427 La~~yL~~~~i~~e~l~~kg~~f~~v~i~~a~~YAaedA~it~~L~~~l~~~L~~ 481 (887) T PRK08434 427 LAKRLFNYEMIHFESVVKKGENFSSVDLEKACKYAAEDAWITLRFYKKFLENLEK 481 (887) T ss_pred HHHHHCCCCCCCHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999708787748998546777020788999999999999999999999999986 No 102 >KOG1798 consensus Probab=96.93 E-value=0.0032 Score=39.47 Aligned_cols=176 Identities=20% Similarity=0.190 Sum_probs=95.9 Q ss_pred CEEEEECCCCCC-----CCCCCEEEEEEEEEECCCCEECCEEEEEEECCCC--CCCCCHHHHHHHCCCHHHHHHCCCCHH Q ss_conf 379997527889-----8772604899999983897280400226621788--877857898985799889960799979 Q gi|254780661|r 4 HFVIYDYETFGR-----DVALDRPAQFAGVRVDRQWEKIESTEVFFCKPAD--DYLPDPEAVFITGITPQKALRDGVVEA 76 (471) Q Consensus 4 ~~v~~D~ETtG~-----~~~~~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~--~~~p~p~a~~v~git~~~~~~~~~~~~ 76 (471) ..+-||+|||-+ +...|||.=|. .++|..|--|-.-++ ...=-. .+.|-|+- -| +-.+.+.+ .+. T Consensus 247 ~VlAFDIETtKlPLKFPDae~DqIMMIS-YMiDGqGfLItNREi-Vs~DIedfEYTPKpE~---eG--~F~v~Ne~-dEv 318 (2173) T KOG1798 247 RVLAFDIETTKLPLKFPDAESDQIMMIS-YMIDGQGFLITNREI-VSEDIEDFEYTPKPEY---EG--PFCVFNEP-DEV 318 (2173) T ss_pred EEEEEEEECCCCCCCCCCCCCCEEEEEE-EEECCCEEEEECHHH-HCCCHHHCCCCCCCCC---CC--CEEEECCC-CHH T ss_conf 5899851035578779986656078999-996474689962234-3052122236886325---25--45886687-688 Q ss_pred HHHHHHHHHHHCC-CCEEEEEECHHHHHHHHHHHHHHHCCCCCC-CCC---------CCCCCCCCHHHHHHHHHHHHHHH Q ss_conf 9999999995117-965999505578999999999850785321-325---------68871001588886555653210 Q gi|254780661|r 77 EFSRRIHQFFSVP-NTCIIGYNNIRFDDYYSRNIFYRNFYDSYR-WSW---------DNGNSRWDLLDVMRAIYAFSPDG 145 (471) Q Consensus 77 e~~~~i~~~~~~~-~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~-~~~---------~~~~~~~D~l~~~~~~~~~~p~~ 145 (471) .+++.+.+.+.+- +.++++||+.-||.+|+..-.-.+|+.--. +.. ..-+.++|.+.+++.=..+ T Consensus 319 ~Ll~RfFeHiq~~kP~iivTyNGDFFDWPFve~Ra~~hGi~m~eEiGF~~D~~gEyks~~c~HmDcfrWVKRDSYL---- 394 (2173) T KOG1798 319 GLLQRFFEHIQEVKPTIIVTYNGDFFDWPFVEARAKIHGISMNEEIGFRRDSQGEYKSPFCIHMDCFRWVKRDSYL---- 394 (2173) T ss_pred HHHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHHHHCCCCCCHHCCCEECCCCCCCCCCEEEHHHHHHHHHCCCC---- T ss_conf 9999999999861896799736864466666778875077711210852065333125632112666665204657---- Q ss_pred CCCCCCCCCCCCCCHHHHHHH-HCCCCC-----------------CCCHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 024444454433108989987-188764-----------------31013331899999999998642268999 Q gi|254780661|r 146 IQWPSRDDGATSFKLQDLALA-NGIEHV-----------------NAHDAKADVYATLALVWLIREKKPKLFEY 201 (471) Q Consensus 146 ~~~~~~~~~~~~~kL~~la~~-n~i~~~-----------------~aH~A~~D~~~t~~l~k~i~~~~p~~~~~ 201 (471) ...|..|-.++++ .|.+.- =|-=..+|+.||--|.- +=-+|-+|.- T Consensus 395 --------PqGSqgLKAVTkaKLGYdPvEvdPEdM~~~A~EkPQ~lasYSVSDAVATYyLYM--kYVhPFIFsL 458 (2173) T KOG1798 395 --------PQGSQGLKAVTKAKLGYDPVEVDPEDMVRMAMEKPQTLASYSVSDAVATYYLYM--KYVHPFIFSL 458 (2173) T ss_pred --------CCCCCCHHHHHHHHHCCCCCCCCHHHHHHHHHHCCHHHHHCCHHHHHHHHHHHH--HHHHHHHHHH T ss_conf --------776631268888750789443798885466650825653000577788888989--8750678666 No 103 >COG0417 PolB DNA polymerase elongation subunit (family B) [DNA replication, recombination, and repair] Probab=96.67 E-value=0.046 Score=31.98 Aligned_cols=155 Identities=19% Similarity=0.104 Sum_probs=89.7 Q ss_pred CEEEEECCCCCCC-----CCCCEEEEEEEEEECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHH Q ss_conf 3799975278898-----77260489999998389728040022662178887785789898579988996079997999 Q gi|254780661|r 4 HFVIYDYETFGRD-----VALDRPAQFAGVRVDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEF 78 (471) Q Consensus 4 ~~v~~D~ETtG~~-----~~~~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~ 78 (471) ..+-||+||.+.. +..|.++.++.... ..+..+..+ .+.+.++ .. +.. -.++.++ T Consensus 155 ~~la~DiE~~~~~~~~~~~~~d~~~~i~~~~~-~~~~~~~~~---~~~~~~~------~~---------v~~-~~~e~e~ 214 (792) T COG0417 155 RVLAFDIETLSEPGKFPDGEKDPIIMISYAIE-AEGGLIEVF---IYTSGEG------FS---------VEV-VISEAEL 214 (792) T ss_pred CEEEEEEEECCCCCCCCCCCCCCEEEEEEEEC-CCCCCCCCC---CCCCCCC------CC---------EEE-ECCHHHH T ss_conf 54799503105777889877786269999704-677764443---2235677------63---------487-2587999 Q ss_pred HHHHHHHHHCCCC-EEEEEECHHHHHHHHHHHHHHHCCCCC-----------C--CCCCCCCCCCCHHHHHHH-HHHHHH Q ss_conf 9999999511796-599950557899999999985078532-----------1--325688710015888865-556532 Q gi|254780661|r 79 SRRIHQFFSVPNT-CIIGYNNIRFDDYYSRNIFYRNFYDSY-----------R--WSWDNGNSRWDLLDVMRA-IYAFSP 143 (471) Q Consensus 79 ~~~i~~~~~~~~~-~~ig~N~~~FD~~~lr~~~~r~~~~~y-----------~--~~~~~~~~~~D~l~~~~~-~~~~~p 143 (471) ...+..++..-++ +++|||+.+||.+++..-.-+.+.+.- . .....|...+|+++.++. . T Consensus 215 l~~~~~~i~~~dPdVIvgyn~~~fd~pyl~~Ra~~lgi~~~~gr~~~~~~~~~~~~~~~~Gr~~iDl~~~~~~~~----- 289 (792) T COG0417 215 LERFVELIREYDPDVIVGYNGDNFDWPYLAERAERLGIPLRLGRDGSELRVRKSGFSSQVGRLHIDLYPALRRRP----- 289 (792) T ss_pred HHHHHHHHHHCCCCEEEEECCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCEEECCCCCCCCEEEEEHHHHHHHCC----- T ss_conf 999999985029899998367777738999999981998511356653200212544557716741066542032----- Q ss_pred HHCCCCCCCCCCCCCCHHHHHHHHCCCCCC----------------------CCHHHHHHHHHHHHHHHHH Q ss_conf 100244444544331089899871887643----------------------1013331899999999998 Q gi|254780661|r 144 DGIQWPSRDDGATSFKLQDLALANGIEHVN----------------------AHDAKADVYATLALVWLIR 192 (471) Q Consensus 144 ~~~~~~~~~~~~~~~kL~~la~~n~i~~~~----------------------aH~A~~D~~~t~~l~k~i~ 192 (471) -...+|+|+.++......... +..-+.|+.++..++.... T Consensus 290 ---------~~~~~ysl~~v~~~~l~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~l~~~~~~~~~ 351 (792) T COG0417 290 ---------LNLKSYSLEAVSEALLGEGKREDIPYDSMEEIWPDWADSKLRLLLYNLSDADLVLRILLKNE 351 (792) T ss_pred ---------CCCCCEEHHHHHHHHCCCCCHHCCCCCHHHHCCCCCCCCHHHHHHHCCCHHHHHHHHHHHHH T ss_conf ---------35563243467766413352200562013330423551076888834107889999986666 No 104 >cd06147 Rrp6p_like_exo Yeast Rrp6p and its human homolog, the Polymyositis/scleroderma autoantigen 100kDa (PM/Scl-100), are exosome-associated proteins involved in the degradation and processing of precursors to stable RNAs. Both proteins contain a DEDDy-type DnaQ-like 3'-5' exonuclease domain possessing three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. The motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. PM/Scl-100, an autoantigen present in the nucleolar compartment of the cell, reacts with autoantibodies produced by about 50% of patients with polymyositis-scleroderma overlap syndrome. Probab=96.11 E-value=0.07 Score=30.82 Aligned_cols=131 Identities=14% Similarity=0.038 Sum_probs=76.2 Q ss_pred CCCEEEEECCCCCCCCCCCE--EEEEEEEEECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHH Q ss_conf 88379997527889877260--4899999983897280400226621788877857898985799889960799979999 Q gi|254780661|r 2 TNHFVIYDYETFGRDVALDR--PAQFAGVRVDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFS 79 (471) Q Consensus 2 ~~~~v~~D~ETtG~~~~~~~--iiq~~~i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~ 79 (471) ....|-+|+||+|.++.+++ .+|+|.- +.--++ +|-++ . +.+ T Consensus 23 ~~~~ialDtE~~~~~~~~~~l~LiQiat~---~~~yli----------------D~l~l-----~------------~~~ 66 (192) T cd06147 23 NCKEIAVDLEHHSYRSYLGFTCLMQISTR---EEDYIV----------------DTLKL-----R------------DDM 66 (192) T ss_pred CCCEEEEECCCCCCCCCCCEEEEEEEECC---CCCEEE----------------EECCC-----C------------CCH T ss_conf 69969997973788569982799999509---988798----------------74343-----0------------037 Q ss_pred HHHHHHHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCC Q ss_conf 99999951179659995055789999999998507853213256887100158888655565321002444445443310 Q gi|254780661|r 80 RRIHQFFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGATSFK 159 (471) Q Consensus 80 ~~i~~~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~~k 159 (471) ..+...|++++.+-|+|+ ..+|..+|.+.| +..| .+.+|+.-.++.. |.+.++ T Consensus 67 ~~L~~ll~~~~I~KV~H~-~~~Dl~~L~~~~---gi~~--------~n~fDT~iaa~~l---------------~~~~~s 119 (192) T cd06147 67 HILNEVFTDPNILKVFHG-ADSDIIWLQRDF---GLYV--------VNLFDTGQAARVL---------------NLPRHS 119 (192) T ss_pred HHHHHHHCCCCEEEEEEC-HHHHHHHHHHHH---CCCC--------CCCHHHHHHHHHH---------------CCCCHH T ss_conf 999999759883799855-787799999985---8987--------6604289999984---------------888111 Q ss_pred HHHHHH-HHCCCCCC----------------CCHHHHHHHHHHHHHHHHHHHH Q ss_conf 898998-71887643----------------1013331899999999998642 Q gi|254780661|r 160 LQDLAL-ANGIEHVN----------------AHDAKADVYATLALVWLIREKK 195 (471) Q Consensus 160 L~~la~-~n~i~~~~----------------aH~A~~D~~~t~~l~k~i~~~~ 195 (471) |..+++ ..|+..+. -|=|..||.....+...+..+. T Consensus 120 L~~Lv~~~lg~~l~K~~q~SdW~~RPLs~~Ql~YAA~Dv~~Ll~ly~~L~~~L 172 (192) T cd06147 120 LAYLLQKYCNVDADKKYQLADWRIRPLPEEMIKYAREDTHYLLYIYDRLRNEL 172 (192) T ss_pred HHHHHHHHHCCCCCCCCEECCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999395887323114379998999999999999999999999999999 No 105 >COG0749 PolA DNA polymerase I - 3'-5' exonuclease and polymerase domains [DNA replication, recombination, and repair] Probab=95.79 E-value=0.14 Score=28.82 Aligned_cols=133 Identities=19% Similarity=0.182 Sum_probs=75.7 Q ss_pred EEEECCCCCCCCCCCEEEEEEEEEECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHHHHHHHH Q ss_conf 99975278898772604899999983897280400226621788877857898985799889960799979999999999 Q gi|254780661|r 6 VIYDYETFGRDVALDRPAQFAGVRVDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFSRRIHQF 85 (471) Q Consensus 6 v~~D~ETtG~~~~~~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~~~i~~~ 85 (471) +.+|+||+|+++...+++-++...-+ ...++-...+ ++ ..+....+..+ T Consensus 25 ~a~~~et~~l~~~~~~lvg~s~~~~~---------~~~yi~~~~~--------------~~--------~~~~~~~l~~~ 73 (593) T COG0749 25 IAFDTETDGLDPHGADLVGLSVASEE---------EAAYIPLLHG--------------PE--------QLNVLAALKPL 73 (593) T ss_pred CCEECCCCCCCCCCCCEEEEEEECCC---------CCEEEEECCC--------------HH--------HHHHHHHHHHH T ss_conf 10100014567555874678862365---------4236751366--------------55--------55148999998 Q ss_pred HHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHH Q ss_conf 51179659995055789999999998507853213256887100158888655565321002444445443310898998 Q gi|254780661|r 86 FSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGATSFKLQDLAL 165 (471) Q Consensus 86 ~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~~kL~~la~ 165 (471) +.+.+.-.+||| ..||..++.. +++. .+.++|++-.. |.+.|+ ..++.++++++ T Consensus 74 l~~~~~~kv~~~-~K~d~~~l~~----~Gi~--------~~~~~Dtmlas---Yll~~~----------~~~~~~~~l~~ 127 (593) T COG0749 74 LEDEGIKKVGQN-LKYDYKVLAN----LGIE--------PGVAFDTMLAS---YLLNPG----------AGAHNLDDLAK 127 (593) T ss_pred HHCCCCCHHCCC-CCHHHHHHHH----CCCC--------CCCHHHHHHHH---HCCCCC----------CCCCCHHHHHH T ss_conf 637541032134-1366999997----3976--------56517899999---614867----------67577789999 Q ss_pred HHCCCCCC------------------------CCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 71887643------------------------10133318999999999986422 Q gi|254780661|r 166 ANGIEHVN------------------------AHDAKADVYATLALVWLIREKKP 196 (471) Q Consensus 166 ~n~i~~~~------------------------aH~A~~D~~~t~~l~k~i~~~~p 196 (471) .+. .+.. .--|..|+.+|..|...+..+.+ T Consensus 128 r~l-~~~~~~~~~i~~kg~~~~~~~~~~~~~~~~y~a~~a~~~~~L~~~l~~~l~ 181 (593) T COG0749 128 RYL-GLETITFEDIAGKGKKQLTFADVKLEKATEYAAEDADATLRLESILEPELL 181 (593) T ss_pred HHC-CCCCCHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 825-877402487632544567610014778888788888999999999998763 No 106 >PRK05761 DNA polymerase I; Reviewed Probab=95.23 E-value=0.22 Score=27.59 Aligned_cols=114 Identities=18% Similarity=0.108 Sum_probs=65.7 Q ss_pred CCHHHHHHHHHHHHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCC------CCCCCHHHHHHHHHHHHHHHC Q ss_conf 99799999999995117965999505578999999999850785321325688------710015888865556532100 Q gi|254780661|r 73 VVEAEFSRRIHQFFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNG------NSRWDLLDVMRAIYAFSPDGI 146 (471) Q Consensus 73 ~~~~e~~~~i~~~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~------~~~~D~l~~~~~~~~~~p~~~ 146 (471) .++.+|+..+.+++.+.++.++ ||+.+||.+.|..-.-+.+.+.-......+ ..+.|+...... ...... T Consensus 196 ~sE~elL~~f~~~i~~~DPDIi-yN~d~FD~PYL~~Ra~~lgi~~~~~~~~~~~~~~~~~~~~d~~~~~~~---~~~~~y 271 (777) T PRK05761 196 DSEKELLAELFKIIEDYPPVIT-FNGDNFDLPYLYNRALKLGIPKEELPFNVKYASLKTGIHIDLYKFFQN---RAIRSY 271 (777) T ss_pred CCHHHHHHHHHHHHHHCCCCEE-ECCCCCCHHHHHHHHHHHCCCCHHCCCCCCCCCCCCCEEEECCCCCCH---HHHHHE T ss_conf 9999999999999974698299-807876489999999983899000665156654024446504453121---135530 Q ss_pred CCCCCCCCCCCCCHHHHHHHH-CCCCCC-------------CCHHHHHHHHHHHHHHHH Q ss_conf 244444544331089899871-887643-------------101333189999999999 Q gi|254780661|r 147 QWPSRDDGATSFKLQDLALAN-GIEHVN-------------AHDAKADVYATLALVWLI 191 (471) Q Consensus 147 ~~~~~~~~~~~~kL~~la~~n-~i~~~~-------------aH~A~~D~~~t~~l~k~i 191 (471) .+ ...-.-++++|+.+|.+. |..-.+ ++=.+-||..|.+|+... T Consensus 272 ~~-~~~~~~~~y~Ld~Va~~lLg~~K~~~~~~~~~~~~~~l~~Y~l~Da~l~~~L~~~~ 329 (777) T PRK05761 272 AF-YGKYRHREAALDAVARALLGESKVELEKNISELNYWELAEYNYRDAEITLKLTFFN 329 (777) T ss_pred EE-CCEECCCCEEHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHH T ss_conf 10-54742474554788999759974103124663347889999899999999987530 No 107 >PRK07898 consensus Probab=95.01 E-value=0.26 Score=27.17 Aligned_cols=137 Identities=14% Similarity=0.044 Sum_probs=77.5 Q ss_pred CCEEEEECCCCCCCCCCCEEEEEEEEEECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHHHHH Q ss_conf 83799975278898772604899999983897280400226621788877857898985799889960799979999999 Q gi|254780661|r 3 NHFVIYDYETFGRDVALDRPAQFAGVRVDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFSRRI 82 (471) Q Consensus 3 ~~~v~~D~ETtG~~~~~~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~~~i 82 (471) ...+.+|+|||++. ....++-++....+.. ..++ |. ..+++. ....+ T Consensus 337 ~~~~a~~~~~t~~~-~~a~lvgisl~~~~g~--------a~yi-p~------------~~~~~~-----------~~~~L 383 (902) T PRK07898 337 GRRLGLAVVGTHLP-GDGDATALAIAAADGH--------AAYI-DT------------ADLTPD-----------DEAAL 383 (902) T ss_pred CCEEEEEEECCCCC-CCCCEEEEEEEEECCE--------EEEE-CC------------CCCCHH-----------HHHHH T ss_conf 98699998536786-6573899999953880--------8997-22------------017978-----------99999 Q ss_pred HHHHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHH Q ss_conf 99951179659995055789999999998507853213256887100158888655565321002444445443310898 Q gi|254780661|r 83 HQFFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGATSFKLQD 162 (471) Q Consensus 83 ~~~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~~kL~~ 162 (471) ..++.+.....+||| +.||..+|+ ++++.. .+-.+|++-.+ |.+.|+ ..+|.|+. T Consensus 384 ~~~Led~~i~KIghN-lK~d~~vL~----~~Gi~l-------~g~~fDTmLAa---YLLdp~----------~~~~~Ld~ 438 (902) T PRK07898 384 AAWLADPARPKALHE-AKGAMHALA----GRGWTL-------AGVTSDTALAA---YLVRPG----------QRSFTLDD 438 (902) T ss_pred HHHHHCCCCCEEEEC-HHHHHHHHH----HCCCCC-------CCCCHHHHHHH---HHHCCC----------CCCCCHHH T ss_conf 998608776545540-489999999----669865-------77532289999---875688----------78787999 Q ss_pred HHHHH-CCCC-----------------C-CCCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99871-8876-----------------4-3101333189999999999864226 Q gi|254780661|r 163 LALAN-GIEH-----------------V-NAHDAKADVYATLALVWLIREKKPK 197 (471) Q Consensus 163 la~~n-~i~~-----------------~-~aH~A~~D~~~t~~l~k~i~~~~p~ 197 (471) ||... |... + -++-|..|+.+|..|...+..+..+ T Consensus 439 La~~yL~~~~~~~~~~~~~~~~~~~~~~~~~~yaa~dA~~~~~L~~~l~~~L~~ 492 (902) T PRK07898 439 LSLRYLRRELRAEAPAQQQLSLDDAVDAAAAQTLILRARAVLDLADALDAELAR 492 (902) T ss_pred HHHHHCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 999974976651565354347132676899999999999999999999998616 No 108 >KOG4793 consensus Probab=94.48 E-value=0.027 Score=33.46 Aligned_cols=166 Identities=13% Similarity=0.005 Sum_probs=90.6 Q ss_pred ECCCCC---CCCCCCEEEEEEEEEECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCC-CCHHHHHHHHHH Q ss_conf 752788---9877260489999998389728040022662178887785789898579988996079-997999999999 Q gi|254780661|r 9 DYETFG---RDVALDRPAQFAGVRVDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDG-VVEAEFSRRIHQ 84 (471) Q Consensus 9 D~ETtG---~~~~~~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~-~~~~e~~~~i~~ 84 (471) |++|+| ..-.++.++.+.+.-. ..++ +..+..+. .++.+|.-.|+. + +.++..... ...|+...-++. T Consensus 116 ~lp~p~CLVaHng~~~dfpil~qel-a~lg-~~lpq~lv---cvdslpa~~ald-~--a~s~~tr~~~~~~~~l~~If~r 187 (318) T KOG4793 116 RLPTPGCLVAHNGNEYDFPILAQEL-AGLG-YSLPQDLV---CVDSLPALNALD-R--ANSMVTRPEVRRMYSLGSIFLR 187 (318) T ss_pred CCCCCCEEEEECCCCCCCHHHHHHH-HHCC-CCCHHHHC---CCCHHHHHHHHH-H--HCCCCCCCCCCCCCCCCHHHHH T ss_conf 3899716885248765428899999-7247-56604420---744158888776-5--0475447777765300237876 Q ss_pred HHHC--C-CC--EEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCC Q ss_conf 9511--7-96--59995055789999999998507853213256887100158888655565321002444445443310 Q gi|254780661|r 85 FFSV--P-NT--CIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGATSFK 159 (471) Q Consensus 85 ~~~~--~-~~--~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~~k 159 (471) ++.+ + +. +-+..|+..|+.-|.-+.|-|..-. +..+=...+-|...|.. ..+.. ...+-+.. T Consensus 188 y~~q~eppa~~~~e~d~~~l~~~fqf~~~ellR~~de-----qa~pw~~ir~l~~~~~~---a~~~~-----P~p~~vs~ 254 (318) T KOG4793 188 YVEQREPPAGHVAEGDVNGLLFIFQFRINELLRWSDE-----QARPWLLIRPLYLAREN---AKSVE-----PTPKLVSS 254 (318) T ss_pred HHCCCCCCCCEEEECCCCHHHHHHHHHHHHHHHHHHH-----CCCCCCCCCCHHHHHHH---CCCCC-----CCCCCCHH T ss_conf 4113599864354424311577899889999755763-----18874300220001100---00267-----88864215 Q ss_pred HHHHHHHHCC-CCCCCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 8989987188-76431013331899999999998642 Q gi|254780661|r 160 LQDLALANGI-EHVNAHDAKADVYATLALVWLIREKK 195 (471) Q Consensus 160 L~~la~~n~i-~~~~aH~A~~D~~~t~~l~k~i~~~~ 195 (471) |+.||..... ...+||+|.+||..+-.++.+..... T Consensus 255 le~Lat~~~~~p~l~ahra~~Dv~~~~k~~q~~~idl 291 (318) T KOG4793 255 LEALATYYSLTPELDAHRALSDVLLLSKVFQKLTIDL 291 (318) T ss_pred HHHHHHHHHCCCCCCHHHHCCCCCHHHHHHHHHHHHH T ss_conf 8998888624846430221033300336888832456 No 109 >COG5228 POP2 mRNA deadenylase subunit [RNA processing and modification] Probab=94.37 E-value=0.18 Score=28.24 Aligned_cols=175 Identities=19% Similarity=0.121 Sum_probs=104.6 Q ss_pred CEEEEECCCCCC---------------------CCCCCEEEEEEEEEECCCCEECC-----EEEEEEECCCCCCCCCHH- Q ss_conf 379997527889---------------------87726048999999838972804-----002266217888778578- Q gi|254780661|r 4 HFVIYDYETFGR---------------------DVALDRPAQFAGVRVDRQWEKIE-----STEVFFCKPADDYLPDPE- 56 (471) Q Consensus 4 ~~v~~D~ETtG~---------------------~~~~~~iiq~~~i~~d~~~~~~~-----~~~~~~~~~~~~~~p~p~- 56 (471) |+|=.|+|--|. |..+=.|||+|.-+.|++||-.- +|+ +--.|..+. -+++ T Consensus 43 n~vSmdTEFpGvvArPiG~FkSs~dyhYQtlraNVD~LkiIQlGlsLSDe~GN~P~~~sTWQFN-F~F~l~~dm-ya~ES 120 (299) T COG5228 43 NHVSMDTEFPGVVARPIGTFKSSVDYHYQTLRANVDFLKIIQLGLSLSDENGNKPNGPSTWQFN-FEFDLKKDM-YATES 120 (299) T ss_pred CCEEECCCCCCEEECCCCCCCCCCHHHHHHHHCCCCHHHHHHEEEEECCCCCCCCCCCCEEEEE-EEECCHHHH-CCHHH T ss_conf 8004315567403024333435422788988605103202000004103029999997326888-875501200-35688 Q ss_pred --HHHHHCCCHHHHHHCCCCHHHHHHHHHH--HHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHH Q ss_conf --9898579988996079997999999999--951179659995055789999999998507853213256887100158 Q gi|254780661|r 57 --AVFITGITPQKALRDGVVEAEFSRRIHQ--FFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLL 132 (471) Q Consensus 57 --a~~v~git~~~~~~~~~~~~e~~~~i~~--~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l 132 (471) -+.-.||.=....+-|.--+|+.+-+-. +.-+....+|.||+ .+|..+|-..+-++-+|. ..-|.. T Consensus 121 ieLL~ksgIdFkkHe~~GI~v~eF~elLm~SGLvm~e~VtWitfHs-aYDfgyLikilt~~plP~---------~~EdFy 190 (299) T COG5228 121 IELLRKSGIDFKKHENLGIDVFEFSELLMDSGLVMDESVTWITFHS-AYDFGYLIKILTNDPLPN---------NKEDFY 190 (299) T ss_pred HHHHHHCCCCHHHHHHCCCCHHHHHHHHHCCCCEECCCEEEEEEEC-CHHHHHHHHHHHCCCCCC---------CHHHHH T ss_conf 9999980887556766397899999998626723214157999521-000899999983698875---------389999 Q ss_pred HHHHHHHHHHHHHCCCCCCCC--CCCCCCHHHHHHHHCCCCC-CCCHHHHHHHHHHHHHHHHHH Q ss_conf 888655565321002444445--4433108989987188764-310133318999999999986 Q gi|254780661|r 133 DVMRAIYAFSPDGIQWPSRDD--GATSFKLQDLALANGIEHV-NAHDAKADVYATLALVWLIRE 193 (471) Q Consensus 133 ~~~~~~~~~~p~~~~~~~~~~--~~~~~kL~~la~~n~i~~~-~aH~A~~D~~~t~~l~k~i~~ 193 (471) -++ ..+.|+-...-...+ -+.+-.|..++.-.++... ..|.|-+|+..|+..+-+.+. T Consensus 191 ~~l---~~yfP~fYDik~v~ks~~~~~KglQei~ndlql~r~g~QhQagsdaLlTa~~ff~~R~ 251 (299) T COG5228 191 WWL---HQYFPNFYDIKLVYKSVLNNSKGLQEIKNDLQLQRSGQQHQAGSDALLTADEFFLPRF 251 (299) T ss_pred HHH---HHHCCCCCCHHHHHHHHHHHHHHHHHHCCCHHHHCCCHHHHCCCHHHHHHHHHCCHHH T ss_conf 999---9977653125889876541556798853727651055655414304443677545132 No 110 >cd06129 RNaseD_like The RNase D-like group is composed of RNase D, WRN, and similar proteins. They contain a DEDDy-type, DnaQ-like, 3'-5' exonuclease domain that contains three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. RNase D is involved in the 3'-end processing of tRNA precursors. RNase D-like proteins in eukaryotes include yeast Rrp6p, human PM/Scl-100 and Drosophila melanogaster egalitarian (Egl) protein. WRN is a unique DNA helicase possessing exonuclease activity. Mutation in the WRN gene is implicated in Werner syndrome, a disease associated with premature aging and increased predisposition to cancer. Yeast Rrp6p and the human Polymyositis/scleroderma autoantigen 100kDa (PM/Scl-100) are exosome-associated proteins involved in the degradation and processing of prec Probab=92.11 E-value=0.83 Score=23.92 Aligned_cols=109 Identities=13% Similarity=0.021 Sum_probs=57.9 Q ss_pred CCEEEEECCCCCCCCCCCE--EEEEEEEEECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHHH Q ss_conf 8379997527889877260--48999999838972804002266217888778578989857998899607999799999 Q gi|254780661|r 3 NHFVIYDYETFGRDVALDR--PAQFAGVRVDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFSR 80 (471) Q Consensus 3 ~~~v~~D~ETtG~~~~~~~--iiq~~~i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~~ 80 (471) ...|-+|+||+|.....++ ++|+| +++ +. +...+. ++ +.. .. T Consensus 15 ~~vig~DtE~~~~~~~~~~v~LiQla---t~~------~~--~lid~~--------~~---~~~--------------~~ 58 (159) T cd06129 15 SIVVALDCEWIPLRLYYGEVTLIQIG---TTE------HA--FLFDVL--------SL---SAF--------------DG 58 (159) T ss_pred CEEEEEECCCCCCCCCCCCEEEEEEE---ECC------EE--EEEECC--------CC---CCC--------------HH T ss_conf 80899979768987998978999994---599------68--999548--------88---767--------------79 Q ss_pred HHHHHHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCH Q ss_conf 99999511796599950557899999999985078532132568871001588886555653210024444454433108 Q gi|254780661|r 81 RIHQFFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGATSFKL 160 (471) Q Consensus 81 ~i~~~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~~kL 160 (471) .+..++.++..+.|||+ +.+|...+++.|-- .-++.+|+...++.. . .+....+| T Consensus 59 ~L~~lL~~~~i~Kvgh~-~~~D~~~L~~~~g~-----------~~~~~~Dt~~~~~~l--------~-----~~~~~~sL 113 (159) T cd06129 59 GLKTVLENPSITKVGHG-CRGDLAKLLRDFGI-----------LLRNLFDTQAAHAIL--------G-----YPEEYWSL 113 (159) T ss_pred HHHHHHHCCCCEEEEEC-HHHHHHHHHHHCCC-----------CCCCCHHHHHHHHHH--------C-----CCCCCHHH T ss_conf 99999829992499956-69999999675084-----------324303699999995--------8-----88431659 Q ss_pred HHHHHH-HCCCCC Q ss_conf 989987-188764 Q gi|254780661|r 161 QDLALA-NGIEHV 172 (471) Q Consensus 161 ~~la~~-n~i~~~ 172 (471) ..+++. .|+..+ T Consensus 114 ~~l~~~~lg~~ld 126 (159) T cd06129 114 NSLVEQYLGPCLD 126 (159) T ss_pred HHHHHHHCCCCCC T ss_conf 9999998087877 No 111 >COG0349 Rnd Ribonuclease D [Translation, ribosomal structure and biogenesis] Probab=91.97 E-value=0.87 Score=23.81 Aligned_cols=131 Identities=22% Similarity=0.205 Sum_probs=74.2 Q ss_pred CCEEEEECCCCCCCCCCCE--EEEEEEEEECCCCEECCEEEEEEECCCCC-CCCCHHHHHHHCCCHHHHHHCCCCHHHHH Q ss_conf 8379997527889877260--48999999838972804002266217888-77857898985799889960799979999 Q gi|254780661|r 3 NHFVIYDYETFGRDVALDR--PAQFAGVRVDRQWEKIESTEVFFCKPADD-YLPDPEAVFITGITPQKALRDGVVEAEFS 79 (471) Q Consensus 3 ~~~v~~D~ETtG~~~~~~~--iiq~~~i~~d~~~~~~~~~~~~~~~~~~~-~~p~p~a~~v~git~~~~~~~~~~~~e~~ 79 (471) ..+|-.|+||.|..+.+++ .||++ |.++. ..+.|..+ .-.+| T Consensus 17 ~~~iAIDTEf~r~~t~~p~LcLIQi~----~~e~~-------~lIdp~~~~~d~~~------------------------ 61 (361) T COG0349 17 SKAIAIDTEFMRLRTYYPRLCLIQIS----DGEGA-------SLIDPLAGILDLPP------------------------ 61 (361) T ss_pred CCEEEEECCCCCCCCCCCCEEEEEEE----CCCCC-------EEECCCCCCCCCCH------------------------ T ss_conf 98079832224664368833799984----48886-------68622655466636------------------------ Q ss_pred HHHHHHHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCC Q ss_conf 99999951179659995055789999999998507853213256887100158888655565321002444445443310 Q gi|254780661|r 80 RRIHQFFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGATSFK 159 (471) Q Consensus 80 ~~i~~~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~~k 159 (471) +...+.+++ ++=.+|+.+||.+++-+.|- .+|. ..+|+--.++.+ ..|. +++ T Consensus 62 --l~~Ll~d~~-v~KIfHaa~~DL~~l~~~~g--~~p~---------plfdTqiAa~l~-------------g~~~-~~g 113 (361) T COG0349 62 --LVALLADPN-VVKIFHAARFDLEVLLNLFG--LLPT---------PLFDTQIAAKLA-------------GFGT-SHG 113 (361) T ss_pred --HHHHHCCCC-EEEEECCCCCCHHHHHHHCC--CCCC---------CHHHHHHHHHHC-------------CCCC-CCC T ss_conf --999862876-02550256351999998618--8877---------346899999980-------------9863-232 Q ss_pred HHHHHHH-HCCCCCCCCH----------------HHHHHHHHHHHHHHHHHHHH Q ss_conf 8989987-1887643101----------------33318999999999986422 Q gi|254780661|r 160 LQDLALA-NGIEHVNAHD----------------AKADVYATLALVWLIREKKP 196 (471) Q Consensus 160 L~~la~~-n~i~~~~aH~----------------A~~D~~~t~~l~k~i~~~~p 196 (471) |..|++. .|+..+..|. |.+||.-...|...+.+..- T Consensus 114 l~~Lv~~ll~v~ldK~~q~SDW~~RPLs~~Ql~YAa~DV~yL~~l~~~L~~~L~ 167 (361) T COG0349 114 LADLVEELLGVELDKSEQRSDWLARPLSEAQLEYAAADVEYLLPLYDKLTEELA 167 (361) T ss_pred HHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 999999996885364445465555799999999999999999999999999998 No 112 >pfam11074 DUF2779 Domain of unknown function(DUF2779). This domain is conserved in bacteria. The function is not known. Probab=91.76 E-value=0.58 Score=24.93 Aligned_cols=112 Identities=21% Similarity=0.258 Sum_probs=61.2 Q ss_pred EEEECCCC--------CCCCCCCEEEEEEEEEECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHH Q ss_conf 99975278--------8987726048999999838972804002266217888778578989857998899607999799 Q gi|254780661|r 6 VIYDYETF--------GRDVALDRPAQFAGVRVDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAE 77 (471) Q Consensus 6 v~~D~ETt--------G~~~~~~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e 77 (471) .|+|+||. |..|-.--+-|+.....+.+|.+++-++ +.. .++ .-|-.+ T Consensus 1 yflDFEt~~~aiP~~~g~~PyqqipFQ~SlHi~~~~g~~~~h~e-fL~--~~~---------------------~DPr~~ 56 (127) T pfam11074 1 YFLDFETISSAIPVFDGTRPYQQIPFQFSLHIQKEDGKKLEHVE-FLA--EDG---------------------IDPRRA 56 (127) T ss_pred CCEEECCCCCCCCCCCCCCCCEECCEEEEEEEECCCCCCEEEEE-EEC--CCC---------------------CCHHHH T ss_conf 91731246656668789976610337888988868997202301-203--677---------------------684799 Q ss_pred HHHHHHHHHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCC-CCCCCCCCHHHHHHHHHHHHHH Q ss_conf 999999995117965999505578999999999850785321325-6887100158888655565321 Q gi|254780661|r 78 FSRRIHQFFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSW-DNGNSRWDLLDVMRAIYAFSPD 144 (471) Q Consensus 78 ~~~~i~~~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~-~~~~~~~D~l~~~~~~~~~~p~ 144 (471) ++..+.+.+......+++||. +|-...|+..... +|.|.-.- ...++-.|++++++....+.|. T Consensus 57 ~~~~L~~~i~~~~Gsi~~yn~-sfEk~~l~ela~~--~p~~~~~l~~I~~r~vDL~~~f~~~~yy~p~ 121 (127) T pfam11074 57 FIEALKKAIGKDYGSIVVYNK-SFEKTRLKELAQL--FPELADKIKAIIERTVDLLDLFKKYKYYHPD 121 (127) T ss_pred HHHHHHHHHCCCCCEEEEEHH-HHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHCCEECCC T ss_conf 999999984767870999624-8899999999998--6999999999999868789999508786887 No 113 >cd06141 WRN_exo WRN is a unique RecQ DNA helicase exhibiting an exonuclease activity. It contains a DEDDy-type DnaQ-like 3'-5' exonuclease domain possessing three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. Mutations in the WRN gene cause Werner syndrome, an autosomal recessive disorder associated with premature aging and increased susceptibility to cancer and type II diabetes. WRN interacts with key proteins involved in DNA replication, recombination, and repair. It is believed to maintain genomic stability and life span by participating in DNA processes. WRN is stimulated by Ku70/80, an important regulator of genomic stability. Probab=91.08 E-value=1.1 Score=23.23 Aligned_cols=86 Identities=22% Similarity=0.213 Sum_probs=50.8 Q ss_pred HHHHHHHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCC Q ss_conf 99999951179659995055789999999998507853213256887100158888655565321002444445443310 Q gi|254780661|r 80 RRIHQFFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGATSFK 159 (471) Q Consensus 80 ~~i~~~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~~k 159 (471) +.+.+++.++..+.|||+. .+|...|++.|--. ..+.+|+-.+++.. .+ .....+ T Consensus 63 ~~L~~lL~~~~i~Kvg~~~-~~D~~~L~~~~g~~-----------~~~~~Dl~~~a~~~---~~----------~~~~~s 117 (168) T cd06141 63 PSLKQLLEDPSILKVGVGI-KGDARKLRRDFGIE-----------VRGVVDLSHLAKRV---GP----------RRKLVS 117 (168) T ss_pred HHHHHHHCCCCEEEEEEEE-HHHHHHHHHHCCCC-----------CCCEEEHHHHHHHH---CC----------CCCCCC T ss_conf 9999997599807999631-88999998744502-----------48765699999985---77----------777632 Q ss_pred HHHHHHH-HCCCC--------CC----------CCHHHHHHHHHHHHHHH Q ss_conf 8989987-18876--------43----------10133318999999999 Q gi|254780661|r 160 LQDLALA-NGIEH--------VN----------AHDAKADVYATLALVWL 190 (471) Q Consensus 160 L~~la~~-n~i~~--------~~----------aH~A~~D~~~t~~l~k~ 190 (471) |..++.. .|... ++ -|=|..||.+..+|.+. T Consensus 118 L~~l~~~~lg~~l~K~k~~q~SdW~~rpLs~~Qi~YAA~Da~~~~~l~~~ 167 (168) T cd06141 118 LARLVEEVLGLPLSKDKKVRCSNWEARPLSKEQILYAATDAYASLELYRK 167 (168) T ss_pred HHHHHHHHHCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHH T ss_conf 99999998396215466520388899999999999999999999999973 No 114 >PRK08076 consensus Probab=90.78 E-value=1.1 Score=23.05 Aligned_cols=133 Identities=17% Similarity=0.108 Sum_probs=76.7 Q ss_pred EEEEECCCCCCCCCCCEEEEEEEEEECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHHHHHHH Q ss_conf 79997527889877260489999998389728040022662178887785789898579988996079997999999999 Q gi|254780661|r 5 FVIYDYETFGRDVALDRPAQFAGVRVDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFSRRIHQ 84 (471) Q Consensus 5 ~v~~D~ETtG~~~~~~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~~~i~~ 84 (471) -.-+|+||+|.++....++-++.. ..++ .++ +|...++ ....+.. T Consensus 316 ~~a~~~Et~~~~~~~a~lvGis~s--~~~~-------a~y-------iP~~~~~-------------------~~~~lk~ 360 (877) T PRK08076 316 KSALVVEVLEDNYHKADIQGFAIV--NENG-------CYF-------IPTEIAL-------------------KSDAFKE 360 (877) T ss_pred CCEEEEEECCCCCCCCCEEEEEEE--CCCC-------EEE-------ECCCHHH-------------------HHHHHHH T ss_conf 854999964788563778999998--5894-------599-------5500442-------------------1589999 Q ss_pred HHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHH Q ss_conf 95117965999505578999999999850785321325688710015888865556532100244444544331089899 Q gi|254780661|r 85 FFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGATSFKLQDLA 164 (471) Q Consensus 85 ~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~~kL~~la 164 (471) ++.+.....+||| +.||..+|+. +++.. .+..+|++-.+ |++.| +..++.|+++| T Consensus 361 ~Led~~i~KighN-lK~dl~vL~~----~GI~l-------~gi~FDTMLAA---YLLdP----------~~~~~~Ld~La 415 (877) T PRK08076 361 WLEDEEKKKWVFD-AKRAIVALKW----NGIDL-------QGIDFDLLLAA---YLLNP----------ADSDDDFASVA 415 (877) T ss_pred HHCCCCCCEEEEC-HHHHHHHHHH----CCCCC-------CCCCCCHHHHH---HHCCC----------CCCCCCHHHHH T ss_conf 8539765402654-7999999996----89644-------77640299999---87188----------76766699999 Q ss_pred HHHCCCCC--------------------CCCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 87188764--------------------3101333189999999999864226 Q gi|254780661|r 165 LANGIEHV--------------------NAHDAKADVYATLALVWLIREKKPK 197 (471) Q Consensus 165 ~~n~i~~~--------------------~aH~A~~D~~~t~~l~k~i~~~~p~ 197 (471) .......- -+.-|..|+.+|..|...+..+..+ T Consensus 416 ~~yl~~~~~~~e~v~Gkg~k~~~p~~~~~~eYaaedA~~t~~L~~~L~~~L~~ 468 (877) T PRK08076 416 KMKETHAVKSDEAVYGKGAKRAVPEEEILAEHLARKAHAIYDLKQTFVEELEK 468 (877) T ss_pred HHHCCCCCCCHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 98557777634765376522248778999999999999999999999999865 No 115 >KOG0969 consensus Probab=89.78 E-value=0.02 Score=34.36 Aligned_cols=90 Identities=24% Similarity=0.270 Sum_probs=53.2 Q ss_pred EEEEECCCCCCC-----CCCCEEEEEEEEEECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHH Q ss_conf 799975278898-----772604899999983897280400226621788877857898985799889960799979999 Q gi|254780661|r 5 FVIYDYETFGRD-----VALDRPAQFAGVRVDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFS 79 (471) Q Consensus 5 ~v~~D~ETtG~~-----~~~~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~ 79 (471) ..=||+|+-|.. |..|.|||||-+.+ .-|.- +. ++|.--...+. .+|...++... ..+.|+. T Consensus 276 vlSfDIECagrkg~FPe~~~DPvIQIan~v~-~~Ge~--~p---fvrnvf~l~~c------apI~G~~V~~~-~~E~elL 342 (1066) T KOG0969 276 VLSFDIECAGRKGVFPEAKIDPVIQIANLVT-LQGEN--EP---FVRNVFTLKTC------APIVGSNVHSY-ETEKELL 342 (1066) T ss_pred CCCEEEEECCCCCCCCCCCCCHHHHHHHHHH-HHCCC--CH---HHHHHHCCCCC------CCCCCCEEEEE-CCHHHHH T ss_conf 1541378525778888601174999988888-71688--46---78762046676------88777525771-3388999 Q ss_pred HHHHHHHHCCCC-EEEEEECHHHHHHHHH Q ss_conf 999999511796-5999505578999999 Q gi|254780661|r 80 RRIHQFFSVPNT-CIIGYNNIRFDDYYSR 107 (471) Q Consensus 80 ~~i~~~~~~~~~-~~ig~N~~~FD~~~lr 107 (471) .....++.+-.+ +++|||.-+||.+.+= T Consensus 343 ~~W~~firevDPDvI~GYNi~nFDiPYll 371 (1066) T KOG0969 343 ESWRKFIREVDPDVIIGYNICNFDIPYLL 371 (1066) T ss_pred HHHHHHHHHCCCCEEECCCCCCCCCCEEC T ss_conf 99999998629886713354455632130 No 116 >TIGR03491 RecB family nuclease, putative, TM0106 family. Members of this uncharacterized protein family are found broadly but sporadically among bacteria. The N-terminal region is homologous to the Cas4 protein of CRISPR systems, although this protein family shows no signs of association with CRISPR repeats. Probab=87.13 E-value=2 Score=21.41 Aligned_cols=17 Identities=18% Similarity=0.215 Sum_probs=11.5 Q ss_pred CCCCCEEEEEECCCCCCH Q ss_conf 887744787103568720 Q gi|254780661|r 247 PRYKDSVIGCNLSGDMRV 264 (471) Q Consensus 247 p~~~n~~i~fDL~~Dp~~ 264 (471) |..+. -++||...+|+. T Consensus 281 p~~~g-dlffDiEg~Pd~ 297 (457) T TIGR03491 281 PRAPG-ELIFDIESDPDE 297 (457) T ss_pred CCCCC-CEEEEECCCCCC T ss_conf 89986-379850568998 No 117 >PRK05929 consensus Probab=80.54 E-value=3.9 Score=19.64 Aligned_cols=93 Identities=17% Similarity=0.130 Sum_probs=54.5 Q ss_pred CEEEEECCCCCCCCCCCEEEEEEEEEECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHHHHHH Q ss_conf 37999752788987726048999999838972804002266217888778578989857998899607999799999999 Q gi|254780661|r 4 HFVIYDYETFGRDVALDRPAQFAGVRVDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFSRRIH 83 (471) Q Consensus 4 ~~v~~D~ETtG~~~~~~~iiq~~~i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~~~i~ 83 (471) .-+-||+||+|.++...+++-++.. .+. + .+++-+.. ..-++...+. T Consensus 311 ~~~a~~~~~~~~~~~~~~lvGisls-~~~------~--a~yip~~~------------------------~~~~vl~~Lk 357 (870) T PRK05929 311 GEVAFCVAYTGNHLPSLKLHGVALA-GGS------Q--VFYIEVEE------------------------SGDQEIALLK 357 (870) T ss_pred CCEEEEEEECCCCCCCCCEEEEEEE-CCC------E--EEEEECCC------------------------CCHHHHHHHH T ss_conf 9747999745887677728899997-698------0--89997655------------------------5523899999 Q ss_pred HHHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHH Q ss_conf 9951179659995055789999999998507853213256887100158888655565321 Q gi|254780661|r 84 QFFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPD 144 (471) Q Consensus 84 ~~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~ 144 (471) .++.++....+||| +.||..+|+. ++... .+-.+|++-. .|.+.|+ T Consensus 358 ~~led~~i~kig~N-lK~d~~vL~~----~GI~l-------~g~~fDtmLA---sYLLn~~ 403 (870) T PRK05929 358 DFFARKDTTFYGYN-LKRDNHALKN----AGIDV-------HEITADLALA---EHLVNGG 403 (870) T ss_pred HHHCCCCCCEEECC-HHHHHHHHHH----CCCCC-------CCCHHHHHHH---HHHHCCC T ss_conf 98609877453223-7779999996----89856-------8727799999---9862999 No 118 >cd06146 mut-7_like_exo The mut-7 subfamily is composed of Caenorhabditis elegans mut-7 and similar proteins found in plants and metazoans. Mut-7 is implicated in posttranscriptional gene silencing. It contains a DEDDy-type DnaQ-like 3'-5' exonuclease domain possessing three conserved sequence motifs, termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. Probab=73.63 E-value=6 Score=18.40 Aligned_cols=104 Identities=15% Similarity=0.150 Sum_probs=58.2 Q ss_pred HHHHHHHHHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCC Q ss_conf 99999999511796599950557899999999985078532132568871001588886555653210024444454433 Q gi|254780661|r 78 FSRRIHQFFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGATS 157 (471) Q Consensus 78 ~~~~i~~~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~ 157 (471) +..-+.++|+++..+.|||+ +..|...|++.|--. .. ......+.+|+-.+++.+....+ .......+... T Consensus 70 ~~~~L~~ll~~~~i~Kvg~~-~~~D~~~L~~~~~~~--~~---~~~~~~~~~Dl~~~a~~~~~~~~---~~~~~~~~~~~ 140 (191) T cd06146 70 WDRLLKRLFEDPDVLKLGFG-FKQDLKALSASYPAL--KC---MFERVQNVLDLQNLAKELQKSDM---GRLKGNLPSKT 140 (191) T ss_pred HHHHHHHHHCCCCEEEEEEC-HHHHHHHHHHHCCCC--CC---CCCCCCEEEEHHHHHHHHHCCCC---CCHHHCCCCCC T ss_conf 89999999479996899944-386899999865864--32---33364403579999999731122---10110035567 Q ss_pred CCHHHHHHH-HCCCCCC----------------CCHHHHHHHHHHHHHHH Q ss_conf 108989987-1887643----------------10133318999999999 Q gi|254780661|r 158 FKLQDLALA-NGIEHVN----------------AHDAKADVYATLALVWL 190 (471) Q Consensus 158 ~kL~~la~~-n~i~~~~----------------aH~A~~D~~~t~~l~k~ 190 (471) .+|..+++. .|+..+. -|=|..||.+.+.|... T Consensus 141 ~gL~~L~~~~lg~~l~K~~q~SnW~~rPLt~~Qi~YAA~Da~~ll~iy~~ 190 (191) T cd06146 141 KGLADLVQAVLGKPLDKSEQCSNWERRPLREEQILYAALDAYCLLEVFDK 190 (191) T ss_pred CCHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHH T ss_conf 56999999990987885651389999989999999999989999999973 No 119 >PRK00047 glpK glycerol kinase; Provisional Probab=63.90 E-value=9.6 Score=17.11 Aligned_cols=30 Identities=13% Similarity=0.272 Sum_probs=23.5 Q ss_pred CCCCEEE-EECCCCCCCCCCCEEEEEEEEEECCCCEECCEE Q ss_conf 9883799-975278898772604899999983897280400 Q gi|254780661|r 1 MTNHFVI-YDYETFGRDVALDRPAQFAGVRVDRQWEKIEST 40 (471) Q Consensus 1 m~~~~v~-~D~ETtG~~~~~~~iiq~~~i~~d~~~~~~~~~ 40 (471) ||++|+. +|+=||+.. +++.|.+|+++..- T Consensus 2 Mm~~yvlgiD~GTts~K----------a~l~d~~G~~~a~~ 32 (498) T PRK00047 2 MMKKYILALDQGTTSSR----------AIIFDHKGNIVSVA 32 (498) T ss_pred CCCCEEEEEECCCCEEE----------EEEECCCCCEEEEE T ss_conf 87687999974633035----------66996999999999 No 120 >pfam06748 DUF1217 Protein of unknown function (DUF1217). This family represents a conserved region that is found within bacterial proteins, most of which are hypothetical. Some members contain multiple copies. Probab=50.40 E-value=12 Score=16.58 Aligned_cols=39 Identities=13% Similarity=0.273 Sum_probs=30.5 Q ss_pred HHHHHHHHHCCCHHHHHHHHCCCCCHHHHHHHHHHHHCC Q ss_conf 622168897599889862120489837999999996311 Q gi|254780661|r 369 DRQIMDRILRTAPEQLSTLNLPFSDRRLPELFFRYRARN 407 (471) Q Consensus 369 Dk~~~~~fh~a~~~~k~~i~~~f~D~Rl~~l~~R~i~~n 407 (471) -+..|++....++.+.-.++++..|+||++|+..+=+.. T Consensus 10 ~ka~ikkvL~sd~~d~~S~aN~L~D~Ry~~lA~AF~F~~ 48 (150) T pfam06748 10 AKAFIKKVLESDLSDPDSFANKLSDKRYREFAAAFNFGA 48 (150) T ss_pred CHHHHHHHHHCCCCCHHHHHHHCCCHHHHHHHHHHCCCC T ss_conf 089999998159997278998708899999999848998 No 121 >TIGR01553 formate-DH-alph formate dehydrogenase, alpha subunit; InterPro: IPR006443 This family of sequences describe a subset of formate dehydrogenase alpha chains found mainly in proteobacteria but also in Aquifex aeolicus. The alpha chain contains domains for molybdopterin dinucleotide binding and molybdopterin oxidoreductase. The holo-enzyme also contains beta and gamma subunits of 32 and 20 kDa. The enzyme catalyzes the oxidation of formate (produced from pyruvate during anaerobic growth) to carbon dioxide with the concomitant release of two electrons and two protons. The electrons are utilised mainly in the nitrate respiration by nitrate reductase . In Escherichia coli and Salmonella typhi, there are two forms of the formate dehydrogenase, one induced by nitrate which is strictly anaerobic (fdn), and one incuced during the transition from aerobic to anaerobic growth (fdo). This subunit is one of only three proteins in Escherichia coli which contain selenocysteine .; GO: 0008863 formate dehydrogenase activity, 0045333 cellular respiration, 0005737 cytoplasm. Probab=39.06 E-value=24 Score=14.56 Aligned_cols=94 Identities=15% Similarity=0.322 Sum_probs=63.5 Q ss_pred CCCCCCCCEEEEEE------------EEEECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCHHHHHH Q ss_conf 88987726048999------------999838972804002266217888778578989857998899607999799999 Q gi|254780661|r 13 FGRDVALDRPAQFA------------GVRVDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVEAEFSR 80 (471) Q Consensus 13 tG~~~~~~~iiq~~------------~i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~~e~~~ 80 (471) -|.||+.+.||-|= .|++|.=|+-.-+.-.+++.++++. .+..+.|++.-.|.++ .-..|-++ T Consensus 238 ~GgNPAEnHP~gFkWai~Akek~GAkiI~iDPRfnRTa~vaDl~apiRsGs----DIaFL~G~i~Y~Le~e-~yqkeYV~ 312 (1043) T TIGR01553 238 MGGNPAENHPVGFKWAIKAKEKKGAKIIVIDPRFNRTAAVADLYAPIRSGS----DIAFLNGLIKYVLEKE-KYQKEYVK 312 (1043) T ss_pred CCCCCCCCCCCCCEEHHHHHCCCCCEEEEECCCCCCCHHHCEEEEECCCCC----HHHHHHHHHHHHHHHC-CHHHHHHC T ss_conf 177872026765100221111378779997889871010011332168983----3888767999987403-31001120 Q ss_pred H-------HHHHHHCCCCEEEEEEC--HHHHHHHHHHHHH Q ss_conf 9-------99995117965999505--5789999999998 Q gi|254780661|r 81 R-------IHQFFSVPNTCIIGYNN--IRFDDYYSRNIFY 111 (471) Q Consensus 81 ~-------i~~~~~~~~~~~ig~N~--~~FD~~~lr~~~~ 111 (471) . |.+=|.....+|+|||- .+||-.=..+.|= T Consensus 313 ~yTnasFiv~EgF~FedGLF~GYn~e~R~YDkskW~yefD 352 (1043) T TIGR01553 313 LYTNASFIVDEGFAFEDGLFAGYNKETRKYDKSKWDYEFD 352 (1043) T ss_pred CCCCCCEEECCCCCCCCCCEECCCCCCCCCCCCCCCCCCC T ss_conf 0136540661786511574206673446657642674237 No 122 >pfam06471 NSP11 NSP11. This region of coronavirus polyproteins encodes the NSP11 protein. Probab=35.77 E-value=27 Score=14.23 Aligned_cols=166 Identities=22% Similarity=0.188 Sum_probs=79.4 Q ss_pred EEEEECCC---CCCCCCCCEEEEEEE------EEECCCCEECCEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCCCH Q ss_conf 79997527---889877260489999------998389728040022662178887785789898579988996079997 Q gi|254780661|r 5 FVIYDYET---FGRDVALDRPAQFAG------VRVDRQWEKIESTEVFFCKPADDYLPDPEAVFITGITPQKALRDGVVE 75 (471) Q Consensus 5 ~v~~D~ET---tG~~~~~~~iiq~~~------i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a~~v~git~~~~~~~~~~~ 75 (471) .|-||.|- +|-+...+-++|+|. +.+ ..|-+..+... ..+|-+-.. .|+-+.-|=| | ++..|.|. T Consensus 91 WiGfDVEgaHa~~~niGTN~PLQ~GFStGvdFvV~-peG~v~te~g~-~~~~v~Aka-PPGeQF~HLi-p--lmrkgqpW 164 (594) T pfam06471 91 WIGFDVEGAHATGDNIGTNLPLQLGFSTGVDFVVT-PEGCVDTEDGS-VFEPVVAKA-PPGEQFKHLI-P--LMRKGQPW 164 (594) T ss_pred HCCCCEECCCCCCCCCCCCCCEEECCCCCCEEEEE-CCEEEEECCCC-EEEECCCCC-CCHHHHHHHH-H--HHHCCCCC T ss_conf 03222010110266657675636512477407980-21389814786-664413779-9727766666-8--87469986 Q ss_pred HHHHHHHHHHHHC----CC-C-EEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHH-----------HHH Q ss_conf 9999999999511----79-6-59995055789999999998507853213256887100158888-----------655 Q gi|254780661|r 76 AEFSRRIHQFFSV----PN-T-CIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVM-----------RAI 138 (471) Q Consensus 76 ~e~~~~i~~~~~~----~~-~-~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~-----------~~~ 138 (471) .-+-++|.+.+.+ -+ . +||.|-. +|...-+|+. -+-+-.-.+ ...+|.-.++.. .+- T Consensus 165 ~VVR~rIVqm~~D~l~~lSD~vvfV~Wa~-glELTtmrYF-vKiG~e~~C----~C~krAtcyns~~~~y~C~~H~~gcD 238 (594) T pfam06471 165 SVVRRRIVQMLSDYLAGLSDYVVFVTWAH-GLELTTMRYF-VKIGPEQKC----FCGKRATCYNSVTGVYACWRHSLGCD 238 (594) T ss_pred HHHHHHHHHHHHHHHCCCCCEEEEEECCC-CEEHHHHHHH-HHHCCCEEE----CCCCCEEEECCCCCEEEEECCCCCCC T ss_conf 10399999999876057676799998058-5001355688-762761361----47981467216886067410157886 Q ss_pred HHHHHH---HCCCCCCCCCCCCCCHHHHHHHHCCCCCCCCHHHHHHHHHHHHH Q ss_conf 565321---00244444544331089899871887643101333189999999 Q gi|254780661|r 139 YAFSPD---GIQWPSRDDGATSFKLQDLALANGIEHVNAHDAKADVYATLALV 188 (471) Q Consensus 139 ~~~~p~---~~~~~~~~~~~~~~kL~~la~~n~i~~~~aH~A~~D~~~t~~l~ 188 (471) |++.|= .-.|... |.-|..-+ +.--.|.+||-|.+|+..|.=|+ T Consensus 239 YvYNP~~vDiQQWGy~--G~Ls~NHd----~~CnvH~naHVAS~DAiMTrCLA 285 (594) T pfam06471 239 YVYNPFLIDIQQWGYT--GSLSTNHD----AVCNVHRNAHVASADAIMTRCLA 285 (594) T ss_pred EEECCEEEEHHHCCCC--CCCCCCCC----CCCCCCCCCCCCCCCHHHHHHHH T ss_conf 1457778871151522--10256764----10003477642340179988899 No 123 >PRK03643 altronate oxidoreductase; Provisional Probab=33.56 E-value=29 Score=14.01 Aligned_cols=15 Identities=27% Similarity=0.479 Sum_probs=9.5 Q ss_pred HHHHHHHHHHHHHCC Q ss_conf 999999999997527 Q gi|254780661|r 455 GLINALYEYLQWVIP 469 (471) Q Consensus 455 ~iL~~l~~yl~~l~~ 469 (471) .+.+.+..|++.|.. T Consensus 457 ~f~~~v~~~l~~i~~ 471 (482) T PRK03643 457 GLVEQVAEDLDAILE 471 (482) T ss_pred HHHHHHHHHHHHHHH T ss_conf 699999999999998 No 124 >PRK08928 consensus Probab=32.09 E-value=31 Score=13.85 Aligned_cols=89 Identities=18% Similarity=0.112 Sum_probs=54.2 Q ss_pred HHHHHHHHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCC Q ss_conf 99999995117965999505578999999999850785321325688710015888865556532100244444544331 Q gi|254780661|r 79 SRRIHQFFSVPNTCIIGYNNIRFDDYYSRNIFYRNFYDSYRWSWDNGNSRWDLLDVMRAIYAFSPDGIQWPSRDDGATSF 158 (471) Q Consensus 79 ~~~i~~~~~~~~~~~ig~N~~~FD~~~lr~~~~r~~~~~y~~~~~~~~~~~D~l~~~~~~~~~~p~~~~~~~~~~~~~~~ 158 (471) ...+..++.++...-|||| +.||..+|.. ++..- + ...|++- +.|.+.| |...+ T Consensus 363 ~~~l~~lLed~~i~KIghN-lK~dl~vL~~----~gi~l-------~-~i~DTmL---asYLLnp----------~~~~h 416 (861) T PRK08928 363 SDIIFNLLTDKSILKITYD-LKPLLKFYAN----QSHDI-------T-AIDDLML---MSYSLSA----------GLHDH 416 (861) T ss_pred HHHHHHHHCCCCCCEEECC-HHHHHHHHHH----CCCCC-------C-CCCCHHH---HHHHCCC----------CCCCC T ss_conf 9999998439997544545-7999999985----69675-------7-6371889---9986499----------98768 Q ss_pred CHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 08989987188764310133318999999999986422 Q gi|254780661|r 159 KLQDLALANGIEHVNAHDAKADVYATLALVWLIREKKP 196 (471) Q Consensus 159 kL~~la~~n~i~~~~aH~A~~D~~~t~~l~k~i~~~~p 196 (471) .|++++....... --|..|+..+..|...+..+.. T Consensus 417 ~Ld~La~~~~~~~---~~aa~~a~~~~~L~~~L~~~L~ 451 (861) T PRK08928 417 SLFNIILKNTNII---NESAKIVCTLISLHKQLILELF 451 (861) T ss_pred CHHHHHHHHCCCC---CHHHHHHHHHHHHHHHHHHHHH T ss_conf 9899997622200---1479999999999999999998 No 125 >KOG2481 consensus Probab=30.34 E-value=33 Score=13.67 Aligned_cols=68 Identities=15% Similarity=0.177 Sum_probs=40.4 Q ss_pred CCCCCCCCHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHH----------HHHHHHHHHHHCCHHHH--HHHHHHCC Q ss_conf 4544331089899871887643101333189999999999864----------22689999962388989--97420015 Q gi|254780661|r 152 DDGATSFKLQDLALANGIEHVNAHDAKADVYATLALVWLIREK----------KPKLFEYLYDYRNKNQL--RKLIDIQN 219 (471) Q Consensus 152 ~~~~~~~kL~~la~~n~i~~~~aH~A~~D~~~t~~l~k~i~~~----------~p~~~~~~l~~~~K~~v--~~~l~~~~ 219 (471) .+..|+|+|+++-+.---.-.+|-+-+.||..++-|+-.+-.. |-.+-..++...-+... .-+++... T Consensus 106 ~~~~p~Y~lDhiIKERyptf~dAlrDlddaLsmlfLfs~lP~td~~~~~~v~~crrlt~EF~hYVi~s~aLrkvF~SiKG 185 (570) T KOG2481 106 LDNKPTYKLDHIIKERYPTFIDALRDLDDALSMLFLFSTLPSTDRLHVKIVALCRRLTAEFMHYVIASSALRKVFLSIKG 185 (570) T ss_pred HHCCCCCCHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 61588620326788755757878877888888888874088534200668888999999999999987546666621022 No 126 >pfam08125 Mannitol_dh_C Mannitol dehydrogenase C-terminal domain. Probab=29.76 E-value=34 Score=13.60 Aligned_cols=19 Identities=21% Similarity=0.090 Sum_probs=9.6 Q ss_pred CCHHHHHHHHHHHHHHHHH Q ss_conf 1013331899999999998 Q gi|254780661|r 174 AHDAKADVYATLALVWLIR 192 (471) Q Consensus 174 aH~A~~D~~~t~~l~k~i~ 192 (471) -|+|+.|-....-+-+++. T Consensus 98 V~eam~d~~~~~~v~~~~~ 116 (245) T pfam08125 98 IHEAMNDPEIRAFVEKLML 116 (245) T ss_pred HHHHHCCHHHHHHHHHHHH T ss_conf 9998679999999999999 No 127 >TIGR02267 TIGR02267 Myxococcus xanthus paralogous family TIGR02267; InterPro: IPR011753 This family consists of at least 7 paralogs in Myxococcus xanthus, a member of the Deltaproteobacteria. The function is unknown.. Probab=28.68 E-value=35 Score=13.48 Aligned_cols=54 Identities=22% Similarity=0.139 Sum_probs=36.5 Q ss_pred HHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHCCHHHHHH Q ss_conf 89899871887643101333189999999999864226----8999996238898997 Q gi|254780661|r 160 LQDLALANGIEHVNAHDAKADVYATLALVWLIREKKPK----LFEYLYDYRNKNQLRK 213 (471) Q Consensus 160 L~~la~~n~i~~~~aH~A~~D~~~t~~l~k~i~~~~p~----~~~~~l~~~~K~~v~~ 213 (471) |-.-|..-||...++|.|+++.-...+|++-+..+.-+ +....-.....++..| T Consensus 34 L~rsArEVai~~~dv~~Ala~d~~A~~LL~e~~~RIr~GS~RL~rAL~~a~~~~~AGD 91 (123) T TIGR02267 34 LLRSAREVAISEADVTSALADDVSALALLKEISRRIREGSRRLSRALTEAVRHRDAGD 91 (123) T ss_pred HHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC T ss_conf 8521201489877888886318989999999988865644899999999986300268 No 128 >TIGR02438 catachol_actin catechol 1,2-dioxygenase; InterPro: IPR012800 Members of this family are catechol 1,2-dioxygenases of the actinobacteria. They are more closely related to actinobacterial chlorocatechol 1,2-dioxygenases than to proteobacterial catechol 1,2-dioxygenases, and so form this separate entry. The member from Rhodococcus rhodochrous is described as a homodimer with bound Fe, similarly active on catechol, 3-methylcatechol and 4-methylcatechol.. Probab=28.00 E-value=36 Score=13.41 Aligned_cols=27 Identities=19% Similarity=0.196 Sum_probs=14.7 Q ss_pred EEEECCCCEECCEEEEEEECCCCCCCCCHHH Q ss_conf 9998389728040022662178887785789 Q gi|254780661|r 27 GVRVDRQWEKIESTEVFFCKPADDYLPDPEA 57 (471) Q Consensus 27 ~i~~d~~~~~~~~~~~~~~~~~~~~~p~p~a 57 (471) .|.+|++| +|.+..++|.|+.+|.-++ T Consensus 183 ti~~d~~G----~f~I~T~~PaPYqIP~DG~ 209 (287) T TIGR02438 183 TIIADDEG----RFEITTLQPAPYQIPTDGP 209 (287) T ss_pred EEEECCCC----CEEEEEECCCCCCCCCCCC T ss_conf 37874888----5788752788873878886 No 129 >pfam10866 DUF2704 Protein of unknown function (DUF2704). This viral family of proteins has no known function. Probab=25.73 E-value=40 Score=13.15 Aligned_cols=79 Identities=15% Similarity=0.085 Sum_probs=52.9 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCEEEEEEECCCCCCCE Q ss_conf 31013331899999999998642268999996238898997420015760799701010477875068863028887744 Q gi|254780661|r 173 NAHDAKADVYATLALVWLIREKKPKLFEYLYDYRNKNQLRKLIDIQNMTPLVHVSGMFGASRSNTALIAPVAWHPRYKDS 252 (471) Q Consensus 173 ~aH~A~~D~~~t~~l~k~i~~~~p~~~~~~l~~~~K~~v~~~l~~~~~~~~~~v~~~~~~~~~~~~~v~~~~~~p~~~n~ 252 (471) |||.-.+|.....++.|.+.+.....-...-...-|..|.++++.+-.-.. ..|.-..++..|-+.+. - T Consensus 91 Na~~l~ad~~fk~~i~k~lq~qLq~~~~~~k~~TfKn~VlnvlN~eL~~~~----~~yD~s~gYIKPNCIV~-------T 159 (169) T pfam10866 91 NAAELAADAEFKDKILKILQSQLQAALADLKTNTFKNIVLNVLNNELSDDA----CKYDTSDGYIKPNCIVL-------T 159 (169) T ss_pred CHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCH----HHCCCCCCCCCCCEEEE-------E T ss_conf 798860437889989999999999999874025099999999865614512----43025357536885898-------6 Q ss_pred EEEEECCCCC Q ss_conf 7871035687 Q gi|254780661|r 253 VIGCNLSGDM 262 (471) Q Consensus 253 ~i~fDL~~Dp 262 (471) +-|.||.++| T Consensus 160 fnCC~Ltf~p 169 (169) T pfam10866 160 FNCCNLTFKP 169 (169) T ss_pred EEECCEEECC T ss_conf 4301323178 No 130 >TIGR00875 talC transaldolase, putative; InterPro: IPR004731 Transaldolase (2.2.1.2 from EC) catalyzes the reversible transfer of a three-carbon ketol unit from sedoheptulose 7-phosphate to glyceraldehyde 3-phosphate to form erythrose 4-phosphate and fructose 6-phosphate. This enzyme, together with transketolase, provides a link between the glycolytic and pentose-phosphate pathways. Transaldolase is an enzyme of about 34 Kd whose sequence has been well conserved throughout evolution. A lysine has been implicated in the catalytic mechanism of the enzyme; it acts as a nucleophilic group that attacks the carbonyl group of fructose-6-phosphate. Transaldolase is evolutionary related to a bacterial protein of about 20 Kd (known as talC in Escherichia coli), whose exact function is not yet known.; GO: 0005975 carbohydrate metabolic process. Probab=24.70 E-value=41 Score=13.03 Aligned_cols=47 Identities=17% Similarity=0.222 Sum_probs=34.4 Q ss_pred HHCCC--HHHHHHCC-CCHHHHHHHHHHHHHCCCCEEEEEECHHHHH-HHHHH Q ss_conf 85799--88996079-9979999999999511796599950557899-99999 Q gi|254780661|r 60 ITGIT--PQKALRDG-VVEAEFSRRIHQFFSVPNTCIIGYNNIRFDD-YYSRN 108 (471) Q Consensus 60 v~git--~~~~~~~~-~~~~e~~~~i~~~~~~~~~~~ig~N~~~FD~-~~lr~ 108 (471) +-|+| |..+.+++ .+++|++++|.+.+.+.|+++ ...++-|= ++.+. T Consensus 22 ~~GVTTNPsliak~~n~~f~evl~ei~~~vGd~G~vs--ae~~S~dAegMv~E 72 (216) T TIGR00875 22 LAGVTTNPSLIAKEGNRSFWEVLKEIQEVVGDEGPVS--AEVISLDAEGMVEE 72 (216) T ss_pred CCCCCCCHHHHHHCCCCCHHHHHHHHHHHHCCCCCEE--HHHHHHHHHHHHHH T ss_conf 4773456334201368878899999998745888553--43430417889999 No 131 >KOG0968 consensus Probab=20.71 E-value=49 Score=12.52 Aligned_cols=55 Identities=15% Similarity=0.058 Sum_probs=40.8 Q ss_pred HCCCHHHHHHCCCCHHHHHHHHHHHHHCCCC-EEEEEECHHHHHHHHHHHHHHHCCC Q ss_conf 5799889960799979999999999511796-5999505578999999999850785 Q gi|254780661|r 61 TGITPQKALRDGVVEAEFSRRIHQFFSVPNT-CIIGYNNIRFDDYYSRNIFYRNFYD 116 (471) Q Consensus 61 ~git~~~~~~~~~~~~e~~~~i~~~~~~~~~-~~ig~N~~~FD~~~lr~~~~r~~~~ 116 (471) +||+.-.+.-. .++.|+..++.+.+.+|++ +.+||-.+++-.+++-.-.++.+++ T Consensus 724 ~~~~~~~V~~~-~sE~elf~ev~~~i~q~DPDIl~GyEi~~~SWGyl~eR~~~l~~d 779 (1488) T KOG0968 724 GGIYGCRVVVM-ESELELFEEVAKLIVQYDPDILLGYEIHNLSWGYLIERAKLLGID 779 (1488) T ss_pred CCCCCCEEEEE-HHHHHHHHHHHHHHHHCCCCEEEEEEECCCCHHHHHHHHHHHCCH T ss_conf 78688458985-658999999999998629660553553055059999999985641 No 132 >pfam11618 DUF3250 Protein of unknown function (DUF3250). This family of proteins represents a protein with unknown function. It may be the C2 domain from KIAA1005 however this cannot be confirmed. Probab=20.62 E-value=50 Score=12.51 Aligned_cols=10 Identities=20% Similarity=0.547 Sum_probs=3.8 Q ss_pred CCHHHHHHHC Q ss_conf 8578989857 Q gi|254780661|r 53 PDPEAVFITG 62 (471) Q Consensus 53 p~p~a~~v~g 62 (471) ++|+|+..-| T Consensus 21 ~sp~~l~~~~ 30 (141) T pfam11618 21 ETPAGLELSG 30 (141) T ss_pred ECHHHHHHCC T ss_conf 8799985468 No 133 >TIGR01282 nifD nitrogenase molybdenum-iron protein alpha chain; InterPro: IPR005972 The enzyme responsible for nitrogen fixation, the nitrogenase, shows a high degree of conservation of structure, function, and amino acid sequence across wide phylogenetic ranges. All known Mo-nitrogenases consist of two components, component I (also called dinitrogenase, or Fe-Mo protein), an alpha2beta2 tetramer encoded by the nifD and nifK genes, and component II (dinitrogenase reductase, or Fe protein) a homodimer encoded by the nifH gene , . Two operons, nifDK and nifEN, encode a tetrameric (alpha2beta2 and N2E2) enzymatic complex. Nitrogenase contains two unusual rare metal clusters; one of them is the iron molybdenum cofactor (FeMo-co), which is considered to be the site of dinitrogen reduction and whose biosynthesis requires the products of nifNE and of some other nif genes . It has been proposed that NifNE might serve as a scaffold upon which FeMo-co is built and then inserted into component I .; GO: 0016163 nitrogenase activity, 0009399 nitrogen fixation, 0016612 molybdenum-iron nitrogenase complex. Probab=20.21 E-value=51 Score=12.46 Aligned_cols=27 Identities=30% Similarity=0.187 Sum_probs=12.4 Q ss_pred EEEEEECHHHHHHHHHHHHHHHCCCCC Q ss_conf 599950557899999999985078532 Q gi|254780661|r 92 CIIGYNNIRFDDYYSRNIFYRNFYDSY 118 (471) Q Consensus 92 ~~ig~N~~~FD~~~lr~~~~r~~~~~y 118 (471) .+||-.||+=|-==.|..|-.-++.-. T Consensus 214 ~~iGdYNIGGDAW~~~~lLEe~Glrvv 240 (521) T TIGR01282 214 AIIGDYNIGGDAWASRILLEEIGLRVV 240 (521) T ss_pred EEEECCCCCCHHHHHHHHHHHCCCEEE T ss_conf 997244777308899999973694899 Done!