Query gi|254781193|ref|YP_003065606.1| putative DNA polymerase from bacteriophage origin [Candidatus Liberibacter asiaticus str. psy62] Match_columns 675 No_of_seqs 184 out of 2544 Neff 8.1 Searched_HMMs 39220 Date Mon May 30 06:28:41 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254781193.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 PRK06887 consensus 100.0 0 0 918.4 44.9 539 2-674 357-951 (954) 2 PRK07556 consensus 100.0 0 0 914.9 45.3 540 2-674 388-974 (977) 3 PRK05755 DNA polymerase I; Pro 100.0 0 0 912.4 47.0 538 2-674 320-886 (889) 4 PRK07997 consensus 100.0 0 0 913.3 43.9 539 2-674 350-925 (928) 5 PRK07625 consensus 100.0 0 0 907.0 45.9 539 2-674 345-919 (922) 6 PRK08786 consensus 100.0 0 0 906.8 43.9 539 2-674 349-924 (927) 7 PRK05929 consensus 100.0 0 0 896.1 46.5 536 2-674 311-868 (870) 8 PRK08835 consensus 100.0 0 0 899.8 43.0 539 2-674 352-928 (931) 9 PRK07898 consensus 100.0 0 0 890.1 46.4 534 2-674 338-898 (902) 10 PRK07456 consensus 100.0 0 0 887.3 45.1 536 3-674 384-972 (975) 11 PRK08928 consensus 100.0 0 0 888.3 43.2 498 27-674 335-856 (861) 12 PRK05797 consensus 100.0 0 0 884.5 43.1 518 3-674 321-866 (869) 13 PRK08076 consensus 100.0 0 0 881.9 45.0 533 4-674 317-874 (877) 14 PRK08434 consensus 100.0 0 0 876.6 42.8 530 2-674 319-884 (887) 15 TIGR00593 pola DNA polymerase 100.0 0 0 875.4 29.3 492 60-674 460-1002(1005) 16 PRK07300 consensus 100.0 0 0 857.9 42.5 515 28-674 336-877 (880) 17 COG0749 PolA DNA polymerase I 100.0 0 0 809.5 36.1 536 4-674 25-590 (593) 18 pfam00476 DNA_pol_A DNA polyme 100.0 0 0 651.8 31.8 367 199-674 1-381 (383) 19 cd06444 DNA_pol_A DNA polymera 100.0 0 0 644.2 32.7 367 199-674 1-378 (379) 20 smart00482 POLAc DNA polymeras 100.0 0 0 366.6 14.6 199 362-639 2-206 (206) 21 cd06139 DNA_polA_I_Ecoli_like_ 99.9 3.3E-25 8.5E-30 181.6 16.4 173 2-199 6-190 (193) 22 cd06128 DNA_polA_exo The 3'-5' 99.9 7.6E-25 1.9E-29 179.2 16.5 151 2-177 2-166 (166) 23 cd06140 DNA_polA_I_Bacillus_li 99.8 7.5E-20 1.9E-24 146.9 15.1 168 2-199 4-175 (178) 24 cd00007 35EXOc 3'-5' exonuclea 99.8 2.4E-19 6.1E-24 143.6 15.1 149 2-178 1-154 (155) 25 cd06142 RNaseD_exo Ribonucleas 99.8 3.1E-18 7.9E-23 136.4 15.4 151 2-185 13-166 (178) 26 pfam01612 3_5_exonuc 3'-5' exo 99.8 2.4E-18 6.2E-23 137.1 14.0 148 2-179 21-171 (172) 27 KOG3657 consensus 99.8 1.3E-16 3.4E-21 125.8 22.0 248 358-654 717-998 (1075) 28 smart00474 35EXOc 3'-5' exonuc 99.8 1.4E-17 3.6E-22 132.2 15.5 146 2-179 22-171 (172) 29 cd06147 Rrp6p_like_exo Yeast R 99.7 2.9E-15 7.4E-20 117.2 14.7 154 2-187 25-181 (192) 30 cd06148 Egl_like_exo The Egali 99.7 5.4E-15 1.4E-19 115.4 15.8 157 2-182 11-180 (197) 31 COG0349 Rnd Ribonuclease D [Tr 99.7 1.1E-15 2.8E-20 120.0 11.6 159 2-192 18-179 (361) 32 cd06129 RNaseD_like The RNase 99.6 6.3E-13 1.6E-17 102.1 16.0 138 3-172 17-158 (159) 33 cd06141 WRN_exo WRN is a uniqu 99.5 9.8E-13 2.5E-17 100.9 16.1 146 2-175 19-168 (168) 34 TIGR01388 rnd ribonuclease D; 99.5 1.9E-12 4.8E-17 99.0 14.5 158 3-193 20-181 (374) 35 PRK10829 ribonuclease D; Provi 99.4 1.6E-11 4.2E-16 92.9 15.4 147 2-181 23-172 (373) 36 cd06146 mut-7_like_exo The mut 99.4 3.4E-11 8.8E-16 90.8 16.0 146 2-175 23-191 (191) 37 PRK06807 DNA polymerase III su 99.3 1.2E-10 3.1E-15 87.3 13.5 87 56-180 81-173 (313) 38 PRK06631 consensus 99.3 8.7E-11 2.2E-15 88.2 12.4 88 55-177 78-174 (229) 39 PRK08517 DNA polymerase III su 99.3 2.1E-10 5.4E-15 85.7 13.6 91 55-182 141-236 (259) 40 PRK08074 bifunctional ATP-depe 99.3 2.1E-10 5.3E-15 85.8 13.5 108 2-120 4-145 (932) 41 cd06126 DEDDy DEDDy exonucleas 99.3 4.1E-10 1.1E-14 83.9 14.3 147 3-173 1-181 (181) 42 PRK07740 hypothetical protein; 99.2 4.4E-10 1.1E-14 83.7 13.9 133 1-180 57-223 (240) 43 PRK07246 bifunctional ATP-depe 99.2 4.8E-10 1.2E-14 83.4 13.8 106 2-121 8-147 (820) 44 PRK07982 consensus 99.2 5E-10 1.3E-14 83.3 13.6 133 1-176 6-179 (243) 45 PRK07009 consensus 99.2 3.8E-10 9.7E-15 84.1 12.9 132 1-176 1-169 (241) 46 TIGR01406 dnaQ_proteo DNA poly 99.2 1.4E-10 3.6E-15 86.9 10.5 89 53-176 79-177 (246) 47 PRK08816 consensus 99.2 6.6E-10 1.7E-14 82.6 13.9 132 1-176 1-170 (244) 48 PRK05711 DNA polymerase III su 99.2 5.4E-10 1.4E-14 83.1 13.1 88 55-177 78-174 (240) 49 PRK06310 DNA polymerase III su 99.2 6.3E-10 1.6E-14 82.7 13.0 87 56-178 80-173 (250) 50 TIGR01405 polC_Gram_pos DNA po 99.2 2.1E-10 5.4E-15 85.7 10.3 69 52-122 274-349 (1264) 51 PRK06309 DNA polymerase III su 99.2 1E-09 2.6E-14 81.3 13.2 86 56-177 72-164 (232) 52 PRK06022 consensus 99.2 7.5E-10 1.9E-14 82.2 12.3 87 55-176 75-167 (234) 53 PRK06195 DNA polymerase III su 99.2 8E-10 2E-14 82.0 11.7 64 54-119 71-140 (309) 54 PRK07883 hypothetical protein; 99.1 2.7E-09 7E-14 78.5 14.0 87 55-178 96-190 (575) 55 PRK06859 consensus 99.1 2.5E-09 6.5E-14 78.7 13.7 88 54-176 82-177 (253) 56 PRK08858 consensus 99.1 3.5E-09 9E-14 77.8 13.7 88 54-176 88-184 (246) 57 cd06131 DNA_pol_III_epsilon_Ec 99.1 4.1E-09 1E-13 77.4 13.2 85 56-175 74-166 (167) 58 cd05781 DNA_polB_B3_exo The 3' 99.1 1.2E-08 3E-13 74.4 15.1 153 1-175 3-188 (188) 59 PRK07942 DNA polymerase III su 99.1 6E-09 1.5E-13 76.3 13.4 80 64-178 92-179 (234) 60 PRK09145 DNA polymerase III su 99.1 1E-08 2.7E-13 74.8 13.9 128 2-177 30-199 (203) 61 cd05782 DNA_polB_like1_exo A s 99.1 1.4E-08 3.6E-13 73.9 14.5 131 28-174 51-208 (208) 62 cd05785 DNA_polB_like2_exo A s 99.0 2.6E-08 6.6E-13 72.2 15.0 152 1-172 9-206 (207) 63 cd06134 RNaseT RNase T is a DE 99.0 1.9E-08 4.7E-13 73.2 13.3 79 66-177 103-187 (189) 64 TIGR01407 dinG_rel DnaQ family 99.0 1.8E-08 4.7E-13 73.2 12.7 109 2-121 1-143 (944) 65 cd06130 DNA_pol_III_epsilon_li 99.0 2.1E-08 5.3E-13 72.8 12.7 63 55-119 69-137 (156) 66 PRK06063 DNA polymerase III su 99.0 1.6E-08 4E-13 73.7 11.6 66 53-120 84-155 (313) 67 PRK00448 polC DNA polymerase I 98.9 1.9E-08 4.8E-13 73.1 11.5 64 56-121 491-560 (1436) 68 PRK09146 DNA polymerase III su 98.9 1.1E-07 2.9E-12 68.1 13.7 83 56-176 122-224 (239) 69 cd05780 DNA_polB_Kod1_like_exo 98.9 7.2E-08 1.8E-12 69.3 12.6 152 1-175 3-195 (195) 70 PRK05168 ribonuclease T; Provi 98.9 9.4E-08 2.4E-12 68.6 13.2 81 66-179 115-201 (213) 71 cd06125 DnaQ_like_exo The DnaQ 98.7 1.2E-07 3.1E-12 67.9 10.1 89 4-101 1-119 (130) 72 cd05784 DNA_polB_II_exo The 3' 98.7 7.9E-08 2E-12 69.1 9.0 153 1-173 3-193 (193) 73 cd06127 DEDDh DEDDh exonucleas 98.7 5.3E-07 1.4E-11 63.7 12.6 48 55-104 71-121 (159) 74 COG2176 PolC DNA polymerase II 98.7 5.4E-08 1.4E-12 70.1 6.4 66 54-121 492-563 (1444) 75 cd06136 TREX1_2 Three prime re 98.6 1.5E-06 3.7E-11 60.9 12.4 81 57-174 88-176 (177) 76 PRK07983 exodeoxyribonuclease 98.6 1.1E-06 2.9E-11 61.6 11.1 86 55-177 65-152 (219) 77 cd05160 DNA_polB_exo The 3'-5' 98.6 2E-06 5.1E-11 60.0 12.3 146 4-173 2-200 (200) 78 PRK05762 DNA polymerase II; Re 98.6 4.2E-06 1.1E-10 57.9 13.7 156 2-176 156-347 (785) 79 smart00479 EXOIII exonuclease 98.6 1.9E-06 4.9E-11 60.1 11.7 89 55-178 72-166 (169) 80 PRK07247 DNA polymerase III su 98.5 1.8E-06 4.5E-11 60.4 11.0 85 55-178 76-168 (195) 81 cd05777 DNA_polB_delta_exo The 98.5 3.4E-06 8.8E-11 58.5 12.1 158 1-179 7-227 (230) 82 cd05779 DNA_polB_epsilon_exo T 98.5 3.3E-06 8.4E-11 58.6 11.8 149 2-173 3-204 (204) 83 cd05783 DNA_polB_B1_exo The 3' 98.3 8.3E-06 2.1E-10 56.0 9.8 147 1-172 5-201 (204) 84 COG0847 DnaQ DNA polymerase II 98.3 2.2E-05 5.5E-10 53.3 11.6 89 55-177 86-180 (243) 85 pfam10108 Exon_PolB Predicted 98.3 2.3E-05 5.8E-10 53.1 11.2 132 29-176 10-172 (211) 86 smart00486 POLBc DNA polymeras 98.3 0.00024 6.2E-09 46.5 16.3 155 2-175 4-220 (471) 87 COG3359 Predicted exonuclease 98.2 3.4E-05 8.8E-10 52.0 11.7 152 3-178 100-268 (278) 88 pfam00929 Exonuc_X-T Exonuclea 98.2 2E-05 5.2E-10 53.5 10.3 65 55-120 72-144 (162) 89 cd06138 ExoI_N This subfamily 98.1 4.2E-05 1.1E-09 51.5 10.3 65 55-119 74-164 (183) 90 cd06144 REX4_like This subfami 98.1 3.8E-05 9.6E-10 51.7 9.9 80 54-173 68-152 (152) 91 KOG2206 consensus 98.1 1E-05 2.7E-10 55.3 5.9 125 30-176 231-357 (687) 92 PTZ00166 DNA polymerase delta 98.0 0.00066 1.7E-08 43.7 14.6 112 52-179 358-512 (1081) 93 PRK09182 DNA polymerase III su 98.0 1.7E-05 4.3E-10 54.0 6.5 60 56-117 113-175 (293) 94 cd06133 ERI-1_3'hExo_like This 98.0 0.00038 9.8E-09 45.2 12.5 87 55-175 80-175 (176) 95 TIGR00573 dnaq exonuclease, DN 97.8 0.00066 1.7E-08 43.7 11.9 90 53-177 78-177 (228) 96 KOG1798 consensus 97.8 0.00045 1.1E-08 44.8 10.9 150 2-175 247-450 (2173) 97 COG0417 PolB DNA polymerase el 97.7 0.002 5.1E-08 40.6 12.6 178 1-204 154-372 (792) 98 pfam00940 RNA_pol DNA-dependen 97.4 0.0094 2.4E-07 36.2 13.5 179 427-645 154-333 (391) 99 cd06149 ISG20 Interferon (IFN) 97.4 0.0029 7.5E-08 39.5 10.3 57 54-116 68-131 (157) 100 cd05778 DNA_polB_zeta_exo The 97.3 0.0084 2.1E-07 36.5 11.9 152 1-172 4-230 (231) 101 PRK05601 DNA polymerase III su 97.0 0.0026 6.6E-08 39.9 6.6 66 53-120 115-213 (377) 102 TIGR03491 RecB family nuclease 96.8 0.036 9.3E-07 32.4 12.5 14 446-459 356-369 (457) 103 cd05776 DNA_polB_alpha_exo The 96.8 0.034 8.7E-07 32.6 10.9 107 53-176 85-227 (234) 104 PRK05761 DNA polymerase I; Rev 96.8 0.0072 1.8E-07 37.0 7.2 102 50-173 198-327 (777) 105 PHA00452 T3/T7-like RNA polyme 96.3 0.081 2.1E-06 30.2 13.6 230 374-645 504-761 (816) 106 PRK06722 exonuclease; Provisio 96.1 0.011 2.8E-07 35.7 5.0 51 51-103 39-92 (242) 107 cd06137 DEDDh_RNase This group 95.6 0.021 5.4E-07 33.9 4.5 48 63-116 82-135 (161) 108 cd06145 REX1_like This subfami 95.5 0.04 1E-06 32.2 5.6 55 55-116 67-124 (150) 109 pfam03175 DNA_pol_B_2 DNA poly 95.4 0.048 1.2E-06 31.6 5.8 114 67-185 2-155 (452) 110 pfam11074 DUF2779 Domain of un 95.2 0.12 3.2E-06 29.0 7.4 74 5-83 2-87 (127) 111 pfam03104 DNA_pol_B_exo DNA po 95.0 0.17 4.3E-06 28.1 7.5 76 1-82 86-185 (254) 112 KOG0969 consensus 94.4 0.1 2.5E-06 29.6 5.2 115 49-178 337-493 (1066) 113 cd06143 PAN2_exo PAN2 is the c 94.2 0.098 2.5E-06 29.6 4.8 52 61-116 98-150 (174) 114 TIGR00593 pola DNA polymerase 94.0 0.076 1.9E-06 30.4 3.9 283 155-519 566-913 (1005) 115 pfam04857 CAF1 CAF1 family rib 93.5 0.24 6E-06 27.2 5.7 59 62-120 137-214 (235) 116 KOG2249 consensus 93.5 0.53 1.3E-05 24.9 9.7 83 55-177 176-264 (280) 117 cd06135 Orn Oligoribonuclease 92.9 0.39 1E-05 25.7 6.1 53 57-117 86-138 (173) 118 PRK08928 consensus 92.8 0.078 2E-06 30.3 2.5 29 108-136 159-187 (861) 119 PRK07748 sporulation inhibitor 91.7 0.89 2.3E-05 23.4 14.7 138 1-177 4-178 (205) 120 PRK11779 sbcB exonuclease I; P 91.2 1 2.6E-05 23.1 12.1 29 55-83 84-113 (477) 121 PRK07556 consensus 90.8 0.15 3.8E-06 28.5 2.0 29 108-136 167-195 (977) 122 COG0749 PolA DNA polymerase I 90.4 0.27 6.9E-06 26.8 3.1 57 462-520 447-503 (593) 123 PRK06887 consensus 90.3 0.18 4.6E-06 27.9 2.1 27 108-134 161-187 (954) 124 PRK08076 consensus 90.2 0.21 5.4E-06 27.5 2.3 25 108-132 161-185 (877) 125 PRK07625 consensus 90.0 0.27 6.9E-06 26.8 2.8 27 108-134 160-186 (922) 126 PRK08786 consensus 90.0 0.19 4.9E-06 27.7 2.0 25 110-134 158-182 (927) 127 PRK07300 consensus 89.7 0.17 4.5E-06 28.0 1.6 24 109-132 170-193 (880) 128 PRK07898 consensus 89.3 0.22 5.7E-06 27.3 2.0 26 109-134 176-201 (902) 129 PRK07997 consensus 88.9 0.22 5.7E-06 27.3 1.7 27 108-134 161-187 (928) 130 TIGR01298 RNaseT ribonuclease 88.8 0.32 8.3E-06 26.3 2.5 46 57-102 94-149 (201) 131 PRK08835 consensus 87.4 0.5 1.3E-05 25.1 2.7 27 108-134 161-187 (931) 132 PRK05797 consensus 86.9 0.34 8.6E-06 26.2 1.6 24 109-132 162-185 (869) 133 PRK05755 DNA polymerase I; Pro 86.9 0.44 1.1E-05 25.4 2.2 27 108-134 159-185 (889) 134 KOG0950 consensus 82.9 0.36 9.3E-06 25.9 0.3 14 406-419 656-669 (1008) 135 KOG2207 consensus 82.7 1.2 3E-05 22.6 2.8 17 404-420 455-471 (617) 136 KOG4373 consensus 79.4 4 0.0001 19.2 7.0 35 48-82 167-201 (319) 137 pfam09281 Taq-exonuc Taq polym 79.3 4 0.0001 19.2 5.0 41 75-117 71-112 (138) 138 KOG1275 consensus 76.9 0.3 7.8E-06 26.4 -1.7 26 465-490 960-985 (1118) 139 TIGR01591 Fdh-alpha formate de 54.6 6.3 0.00016 17.9 1.3 81 305-385 337-420 (694) 140 KOG2304 consensus 53.7 10 0.00025 16.6 2.2 41 434-520 231-271 (298) 141 TIGR00888 guaA_Nterm GMP synth 47.4 11 0.00028 16.4 1.6 30 430-464 67-96 (195) 142 pfam07794 DUF1633 Protein of u 43.8 19 0.00049 14.8 3.4 11 320-330 373-383 (698) 143 TIGR02201 heptsyl_trn_III lipo 43.1 17 0.00043 15.2 2.0 12 602-613 189-200 (347) 144 TIGR00922 nusG transcription t 37.9 19 0.00049 14.8 1.6 14 365-381 58-71 (193) 145 TIGR00612 ispG_gcpE 4-hydroxy- 36.9 24 0.00062 14.1 6.1 32 63-94 83-121 (633) 146 KOG1931 consensus 35.0 21 0.00054 14.5 1.5 13 626-638 934-946 (1156) 147 KOG0184 consensus 33.2 28 0.00071 13.7 4.0 70 538-631 129-203 (254) 148 cd01779 Myosin_IXb_RA Myosin_I 32.5 25 0.00063 14.1 1.5 29 602-630 33-63 (105) 149 PRK05359 oligoribonuclease; Pr 30.9 31 0.00078 13.5 5.7 11 2-12 4-14 (181) 150 COG2388 Predicted acetyltransf 30.3 31 0.0008 13.4 2.7 24 602-625 53-76 (99) 151 COG1448 TyrB Aspartate/tyrosin 29.6 32 0.00082 13.4 3.0 17 196-212 186-202 (396) 152 COG1158 Rho Transcription term 27.2 35 0.0009 13.1 2.7 65 356-442 166-235 (422) 153 pfam00372 Hemocyanin_M Hemocya 27.2 35 0.0009 13.1 1.8 14 157-170 64-77 (278) 154 TIGR02188 Ac_CoA_lig_AcsA acet 26.6 12 0.0003 16.2 -1.0 19 278-296 389-407 (643) 155 TIGR01208 rmlA_long glucose-1- 26.5 25 0.00063 14.1 0.7 74 150-223 164-248 (361) 156 COG3935 DnaD Putative primosom 25.4 38 0.00097 12.9 3.5 66 237-302 128-193 (246) 157 pfam12030 DUF3517 Domain of un 24.2 29 0.00073 13.7 0.6 16 414-429 264-279 (337) 158 KOG0922 consensus 23.5 36 0.00091 13.1 1.0 15 55-69 57-71 (674) 159 COG1029 FwdB Formylmethanofura 23.1 42 0.0011 12.6 4.2 76 365-443 167-253 (429) 160 pfam12096 DUF3572 Protein of u 21.8 37 0.00094 13.0 0.8 21 384-404 9-29 (89) 161 TIGR00584 mug mismatch-specifi 21.6 22 0.00055 14.5 -0.4 53 377-439 215-272 (347) 162 TIGR02186 alph_Pro_TM conserve 21.2 46 0.0012 12.4 1.2 60 322-402 18-78 (275) 163 pfam05551 DUF1519 Protein of u 20.8 47 0.0012 12.3 2.4 113 196-310 28-148 (245) No 1 >PRK06887 consensus Probab=100.00 E-value=0 Score=918.40 Aligned_cols=539 Identities=20% Similarity=0.218 Sum_probs=461.3 Q ss_pred CEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCEEEEEEECCCCH------------------------------ Q ss_conf 68999814178987000440011279819999999848039999788810------------------------------ Q gi|254781193|r 2 ATLFIDFETRSPIDLTKCGICKYAEDVCVILFAFGFDDHPVELWDCVHDP------------------------------ 51 (675) Q Consensus 2 k~i~lD~ET~~~~dl~~~Gl~~y~~~~~i~~i~~a~~~~~v~~~d~~~~~------------------------------ 51 (675) +.+++|+||.+.. + ...+++|+++++.++..+|+|+.|.+ T Consensus 357 ~~va~DtEt~~ld--------~--~~a~lVGis~s~~~g~a~YiPl~h~~~~~~~~~~~~~~~~~~~~~~~~~~~~ql~~ 426 (954) T PRK06887 357 KLIAVDTETDSLD--------Y--MSANLVGISFALENGEACYIPLAHKQQVAEQPQSDLFAEEAETQTDYLLAPKQLNK 426 (954) T ss_pred CEEEEEEECCCCC--------C--CCCCEEEEEEEECCCCEEEEECCCCCCCCCCCCHHHHHHCCCCHHHHHCCHHHCCH T ss_conf 9279996359987--------1--32665789998358978998644421000132002332001001222102321279 Q ss_pred -HHHHHHHHHHCCCCCEEEEEEHHHHHHHHHHHCCCCCCCCCEEEHHHHHHHHCC--CCCHHHHHHHC-CCCCCHHHHHH Q ss_conf -437999998638898199973077499999860899884203316889998678--99968989982-99811004379 Q gi|254781193|r 52 -VMPQVLKEALENTDVTIVAHNSYFERTVLKACFNLDIPIHRWICTLVLARSNGL--PSALKAVGEAL-KLSSQKMEEGK 127 (675) Q Consensus 52 -~~~~~l~~~L~~~~~~kV~HNa~FD~~~L~~~~Gi~i~~~~~~DTmi~A~~l~~--~~sL~~la~~l-~l~~~k~~~~k 127 (675) .+.+.++++|+++++.||+||++||+.+|.++ |+.+. +..+||||+||++++ .++|++++.+| +..... .+ T Consensus 427 ~~vl~~Lkplled~~i~KIghNlK~dl~vL~~~-Gi~l~-g~~fDTmLAaYLLdp~~r~~Ld~LA~~yL~~~~i~---~e 501 (954) T PRK06887 427 STCLAALKPLLENPDIHKIGQNIKYDLTIFANH-GIELQ-GVAFDTMLESYTLNSTGRHNMDDLAERYLGHETIA---FE 501 (954) T ss_pred HHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHC-CCCCC-CCCCCHHHHHHHHCCCCCCCHHHHHHHHCCCCCCC---HH T ss_conf 999999899852975022211478899999977-96247-76516989987518755578899999963886533---46 Q ss_pred HHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCH-----------HCCCCHHHHHHHHHHCC Q ss_conf 9998620211234332346785853289999998998999999996301040-----------00151336666531156 Q gi|254781193|r 128 ELIARFCKGETHECLYDRNKPNHREAWRLFGEYCKGDVTVTREIFKRLEPLS-----------SKEQELWYLDQTINDRG 196 (675) Q Consensus 128 ~l~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~t~~L~~~l~~~~-----------~~E~~~~~~~~~me~~G 196 (675) ++. ++......+...+.+....||++|+++|++||..|.+.+ +.|+++..++++||.+| T Consensus 502 dl~----------gkGk~ql~f~~v~l~~~~~YAaedA~itl~L~~~L~~~L~~~~~L~~Ly~~IE~PLi~VLa~ME~~G 571 (954) T PRK06887 502 SIA----------GKGKNQLTFDQIPLEQATEYAAEDADVTMKLHQVLWLELQKEPTLVKLYKTMELPLVHVLSRIERNG 571 (954) T ss_pred HHC----------CCCCCCCCHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 422----------5443337844344679999999999999999999999998545478999999979999999999738 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHCCCCCCCCC----CCCHHHHHHHHHCCCCH Q ss_conf 21379999999999999999999999987288547678879998877775588888898----85338998721304823 Q gi|254781193|r 197 YCVDVALAHTLNQIVDVERTKLDQELESLTYGLVSSSRCLEKLKTWLFLETGLHLEDLT----ETTCQRVLSSNKKLTPQ 272 (675) Q Consensus 197 i~vD~e~l~~l~~~~~~~~~~l~~~i~~~~~~~~~n~~S~~ql~~~L~~~lg~~~~~~~----k~t~~~~L~~~~~~~~~ 272 (675) |.||.+.+.++..++..++++++++++++ .|..||+|||+||+++||+++|+|+.+++ .+|+.++|+.++..||. T Consensus 572 I~vD~~~L~~ls~el~~~l~~le~~iy~~-aG~eFNinSpKQL~eiLFekLgLp~~KKTKtG~~STda~vLe~L~~~h~i 650 (954) T PRK06887 572 VLIDSDALLAQSNEIAQRLTALEQQAYEL-AGQPFNLASTKQLQEILFDKLGLPVLKKTPKGAPSTNEEVLEELAYQHEL 650 (954) T ss_pred EEECHHHHHHHHHHHHHHHHHHHHHHHHH-CCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHCCCC T ss_conf 58879999999999999999999999886-09954899989999998984299988858899977469999998623532 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCC-CCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHCCCCCCC Q ss_conf 89999999997655101335788740868-96767788400013221124788844488888651134554310123333 Q gi|254781193|r 273 AIQVFNNRMRASRSAVLKLNTLVEAMNSD-GRLRGTLQFLGASRTGRWSGCVFPPQNLPRPSRSHEELDQIITHLPTSIT 351 (675) Q Consensus 273 ~~~~l~~r~~~~k~~~~~~~~~~~~~~~d-gRih~~~~~~gat~TGR~S~~~pNlQnlPr~~~~~~~~~~~~~~~~~~~~ 351 (675) +..+|+|| ++.|+.+||+++++..++++ ||||++|||.+ |.|||+||++|||||||.+++.|++ T Consensus 651 ~~~il~yR-~l~KL~sTY~d~L~~~i~~~tgRiHt~f~q~~-t~TGRlSS~~PNLQNIPiRte~Gr~------------- 715 (954) T PRK06887 651 PKLLVEHR-GLSKLKSTYTDKLPQMVNSKTGRVHTSYHQAV-TATGRLSSSDPNLQNIPIRNEEGRR------------- 715 (954) T ss_pred HHHHHHHH-HHHHHHHHHHHHHHHHCCCCCCEECCCCCCCC-CCCCCCCCCCCCCCCCCCCCHHHHH------------- T ss_conf 68999998-99998999898887640778987777652011-2312013799863368878602223------------- Q ss_pred CCCCCCCCCHHHHHEEEECCCCCEEEEEECCCCCHHHHHHHCCCCCHHHHHHHCCCCHHHHHHHHCCCCHHHHCCHHHHH Q ss_conf 45544321003333012668987899983453723333231146201134541133101014764487956706201110 Q gi|254781193|r 352 SDPDPLQIISDCVRSCVIASEGKKLVVADLAGIEARVLAWMAGEQWKLKAFETGEDLYVTTYARSFNVAPAQVTKDQRAI 431 (675) Q Consensus 352 ~~~~~~~~~~~~iR~~f~a~~G~~lv~aDySqiElRvlA~ls~d~~l~~af~~g~D~h~~tA~~~~g~~~~~~~~~~R~~ 431 (675) ||+||+|+|||+|++|||||||||||||+|+|+.|++||++|.|||+.||+.+||+|.++|++++|+. T Consensus 716 ------------iR~aF~~~~g~~l~saDYSQIELRilAh~s~d~~l~~af~~g~DiH~~tA~~vf~~~~~~v~~~~R~~ 783 (954) T PRK06887 716 ------------IRQAFIARDGYKIVAADYSQIELRIMAHLSGDQGLINAFSEGKDIHRSTAAEIFGLSLDEVTSEQRRS 783 (954) T ss_pred ------------HHEEEECCCCCEEEEECHHHHHHHHHHHHHCCHHHHHHHHCCCCCHHHHHHHHHCCCHHHCCHHHHHH T ss_conf ------------34005438998899714528999999999668879999863888027789999499946689889987 Q ss_pred CCCCEEEEECCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHH Q ss_conf 16520210028888889987987088831134556555455556788889999887643100010220788899999999 Q gi|254781193|r 432 GKVMELALGYQGGAKAFQKMASTTGLNLNTFAHLVKTNSAPEEWEDAENFCRWMQENHPEHAIEDLFIGTACERVKKAWR 511 (675) Q Consensus 432 aK~~~~g~~YG~G~~~l~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ea~~~~~~~r 511 (675) ||++|||++||||+.+|++ .++++..||+.+++.|+ T Consensus 784 aK~vnfgi~YG~~~~gLs~--------------------------------------------~l~i~~~eA~~~i~~yf 819 (954) T PRK06887 784 AKAINFGLIYGMSAFGLSR--------------------------------------------QLGISRADAQKYMDLYF 819 (954) T ss_pred HHHHHHHHHHCCCHHHHHH--------------------------------------------HHCCCHHHHHHHHHHHH T ss_conf 6888766566168777998--------------------------------------------70989999999999998 Q ss_pred HHCHHHHHHHHHHHHHHHHHHHCCEEEEECCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCCHHHHCCHHHHHHHHH Q ss_conf 87887899999999999999754917962154211332223445640232476631016786621021010023456777 Q gi|254781193|r 512 QKHSRVSELWNELHQAFEQTIENGKAIIARKRRDVPLVYMKKDKNNLILMLPSGRSLVYRDVGNNRSYLNTTTRQLTREY 591 (675) Q Consensus 512 ~~~p~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~GR~~~~~~~~~~~~~~~~~~~~~~r~~ 591 (675) +.||+|..|..++...+++ .+++++..||||++|++++.+...+.. .+|+ T Consensus 820 ~~~~~v~~~~~~~~~~a~~--------------------------~Gyv~Tl~GRrr~~p~i~s~n~~~r~~---~eR~- 869 (954) T PRK06887 820 QRYPSVQQFMTDIREKAKA--------------------------QGYVETLFGRRLYLPDINSSNAMRRKG---AERV- 869 (954) T ss_pred HHCCHHHHHHHHHHHHHHH--------------------------CCCEECCCCCCCCCCCCCCCCHHHHHH---HHHH- T ss_conf 8690099999999999997--------------------------794688888730687666899889978---9999- Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHC-----CCCEEEEECCEEEEECCCCHHHHH-HHHHHHHCCCCHHCCCCCCEE Q ss_conf 53145655453688999999999999967-----981889952589997792489999-999998456520215995157 Q gi|254781193|r 592 TYGGKLTENIVQAVSRDILAEGMMNATKA-----GYDIVLTVHDEIVCETPDTDEFNA-SMLYHLMTSNPSWAKGLPLKA 665 (675) Q Consensus 592 ~~~~k~~n~~iQgtaadi~k~ami~l~~~-----g~~ivl~VHDEiv~evp~~~~~~~-~~i~~~M~~~~~w~~~~pl~a 665 (675) ++|+|||||||||+|.||+++++. +.+|+||||||||||||+++.+.+ +.|+++|+.+.+ ..+||.+ T Consensus 870 -----aiN~piQGtAADiiK~AMi~~~~~l~~~~~~~~~lqvHDEli~e~~~~e~e~~~~~v~~~Me~a~~--l~VPL~v 942 (954) T PRK06887 870 -----AINAPMQGTAADIIKRAMIKIDKVIRGDPDIKMIMQVHDELVFEVRSEKVEFYSELIKQEMEKAAQ--LVVPLIV 942 (954) T ss_pred -----HHCCCCCHHHHHHHHHHHHHHHHHHHHCCCCEEEEEECEEEEEEECHHHHHHHHHHHHHHHHCCCC--CCCCEEE T ss_conf -----837565267999999999999999842789769998670668855688999999999999837410--6786373 Q ss_pred EECCCCCCC Q ss_conf 401004631 Q gi|254781193|r 666 EGYESKRYR 674 (675) Q Consensus 666 e~~~~~~y~ 674 (675) |..+|+.|. T Consensus 943 ~~~~g~nW~ 951 (954) T PRK06887 943 EVGVGDNWD 951 (954) T ss_pred ECCCCCCHH T ss_conf 107287977 No 2 >PRK07556 consensus Probab=100.00 E-value=0 Score=914.91 Aligned_cols=540 Identities=19% Similarity=0.234 Sum_probs=459.7 Q ss_pred CEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCEEEEEEECCCCH------------------HHHHHHHHHHCC Q ss_conf 68999814178987000440011279819999999848039999788810------------------437999998638 Q gi|254781193|r 2 ATLFIDFETRSPIDLTKCGICKYAEDVCVILFAFGFDDHPVELWDCVHDP------------------VMPQVLKEALEN 63 (675) Q Consensus 2 k~i~lD~ET~~~~dl~~~Gl~~y~~~~~i~~i~~a~~~~~v~~~d~~~~~------------------~~~~~l~~~L~~ 63 (675) +.+++|+||.+.. + ...+++|++++++.+..+|+|+.|.. .+...|+++|++ T Consensus 388 ~~~a~DtETtsld--------~--~~a~lVGisls~~~~~a~YIPl~h~~~~~~~~~~~~~~~ql~~~~vL~~LkplLed 457 (977) T PRK07556 388 GIVAFDTETTSLD--------P--MQAELVGFSLALAPGRACYIPLGHKSGDGDLFGGGLAEGQIPLRDALAALKPVLED 457 (977) T ss_pred CEEEEEEECCCCC--------H--HHCCEEEEEEEECCCCEEEEECCCCCCCCCHHCCCCCHHHCCHHHHHHHHHHHHCC T ss_conf 9589997318878--------3--01553579998068978999614445642011022223230599999998988538 Q ss_pred CCCEEEEEEHHHHHHHHHHHCCCCCCCCCEEEHHHHHHHHCC---CCCHHHHHHHC-CCCCCHHHHHHHHHHHHCCCCCC Q ss_conf 898199973077499999860899884203316889998678---99968989982-99811004379999862021123 Q gi|254781193|r 64 TDVTIVAHNSYFERTVLKACFNLDIPIHRWICTLVLARSNGL---PSALKAVGEAL-KLSSQKMEEGKELIARFCKGETH 139 (675) Q Consensus 64 ~~~~kV~HNa~FD~~~L~~~~Gi~i~~~~~~DTmi~A~~l~~---~~sL~~la~~l-~l~~~k~~~~k~l~~~~~k~~~~ 139 (675) +.+.|||||+|||+++|.++ |+.+. .++||||+||+|++ .++|+.++..| +..... .++++. T Consensus 458 ~~i~KighnlK~d~~vL~~~-gi~l~--~i~DtmLaaYLLdp~~~~~~Ld~La~~yL~~~~~~---~~dl~g-------- 523 (977) T PRK07556 458 PSVLKIGQNMKYDALVLARH-GITVA--PFDDTMLISYALDAGRGGHGMDALSERWLGHTPIA---YKDLTG-------- 523 (977) T ss_pred CCHHHHHHHHHHHHHHHHHC-CCCCC--CCCCHHHHHHHHCCCCCCCCHHHHHHHHCCCCCCC---CHHCCC-------- T ss_conf 73114441256889999976-96026--85318888888348866478799999964887621---000034-------- Q ss_pred CCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCH----------HCCCCHHHHHHHHHHCCCCCCHHHHHHHHH Q ss_conf 4332346785853289999998998999999996301040----------001513366665311562137999999999 Q gi|254781193|r 140 ECLYDRNKPNHREAWRLFGEYCKGDVTVTREIFKRLEPLS----------SKEQELWYLDQTINDRGYCVDVALAHTLNQ 209 (675) Q Consensus 140 ~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~t~~L~~~l~~~~----------~~E~~~~~~~~~me~~Gi~vD~e~l~~l~~ 209 (675) .......+...+.+....||++|+++|++||..|.+.+ +.|+++..++.+||.+||.||.+.++++.+ T Consensus 524 --~gk~~~~f~~v~~~~~~~YaaedA~~t~~L~~~L~~~L~~~~l~~Ly~~IE~PLi~VLa~ME~~GI~vD~~~L~~l~~ 601 (977) T PRK07556 524 --SGKSAITFDQVDIDRATAYAAEDADVTLRLWQVLKPRLAAEGLTTVYERLERPLVPVLARMEERGISVDRQVLSRLSG 601 (977) T ss_pred --CCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHCCEEECHHHHHHHHH T ss_conf --550104765520689999999999999999999999998423578999999789999999985696986999999999 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHCCCCCCCCCC----CCHHHHHHHHHCC-CCHHHHHHHHHHHHH Q ss_conf 99999999999999872885476788799988777755888888988----5338998721304-823899999999976 Q gi|254781193|r 210 IVDVERTKLDQELESLTYGLVSSSRCLEKLKTWLFLETGLHLEDLTE----TTCQRVLSSNKKL-TPQAIQVFNNRMRAS 284 (675) Q Consensus 210 ~~~~~~~~l~~~i~~~~~~~~~n~~S~~ql~~~L~~~lg~~~~~~~k----~t~~~~L~~~~~~-~~~~~~~l~~r~~~~ 284 (675) ++..++.+++++|+++. |..||+|||+||+++||+++|+|+.+++| +|+.++|+.+... ||.+..+|+|| ++. T Consensus 602 ~l~~~l~~le~~I~~~a-G~eFNi~SpkQL~~iLF~~l~lp~~kktKtG~~STd~~vLe~L~~~~h~i~~~iL~yR-~l~ 679 (977) T PRK07556 602 EFAQKAARLEAEIYELA-GEPFNIGSPKQLGDILFGKMGLPGGSKTKTGQWSTSAQVLEDLAAEGHELPRKILDWR-QLS 679 (977) T ss_pred HHHHHHHHHHHHHHHHC-CCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCHHHHHHHH-HHH T ss_conf 99999999999999870-8987878899999999986089888868889979709999998745785689999998-899 Q ss_pred HHHHHHHHHHHHHCCCC-CCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHCCCCCCCCCCCCCCCCHHH Q ss_conf 55101335788740868-96767788400013221124788844488888651134554310123333455443210033 Q gi|254781193|r 285 RSAVLKLNTLVEAMNSD-GRLRGTLQFLGASRTGRWSGCVFPPQNLPRPSRSHEELDQIITHLPTSITSDPDPLQIISDC 363 (675) Q Consensus 285 k~~~~~~~~~~~~~~~d-gRih~~~~~~gat~TGR~S~~~pNlQnlPr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 363 (675) |+.+||+++++..++++ ||||++|||.+ |.|||+||++|||||||.+++.|.. T Consensus 680 KL~sTY~d~L~~~i~~~~gRiHt~fnq~~-t~TGRlSS~~PNLQNIPir~e~G~~------------------------- 733 (977) T PRK07556 680 KLKSTYTDALPGYIHPQTGRVHTSYALAA-TTTGRLSSSDPNLQNIPVRTEEGRK------------------------- 733 (977) T ss_pred HHHHHHHHHHHHHCCCCCCEECCCHHHHC-EECCCCCCCCCCCCCCCCCCCCCCH------------------------- T ss_conf 99999999988623789880777302323-1057645699864678887632303------------------------- Q ss_pred HHEEEECCCCCEEEEEECCCCCHHHHHHHCCCCCHHHHHHHCCCCHHHHHHHHCCCCHHHHCCHHHHHCCCCEEEEECCC Q ss_conf 33012668987899983453723333231146201134541133101014764487956706201110165202100288 Q gi|254781193|r 364 VRSCVIASEGKKLVVADLAGIEARVLAWMAGEQWKLKAFETGEDLYVTTYARSFNVAPAQVTKDQRAIGKVMELALGYQG 443 (675) Q Consensus 364 iR~~f~a~~G~~lv~aDySqiElRvlA~ls~d~~l~~af~~g~D~h~~tA~~~~g~~~~~~~~~~R~~aK~~~~g~~YG~ 443 (675) ||+||+|+|||+|++|||||||||||||+|+|+.|++||++|.|||+.||+.+||+|+++|++++|+.||++|||++||| T Consensus 734 iR~aFv~~~g~~l~saDYSQIELRilAh~s~d~~li~af~~~~DiH~~tA~~if~~~~~~v~~~~R~~aK~iNfgi~YG~ 813 (977) T PRK07556 734 IRTAFIATPGHKLVSADYSQIELRILAHIADIPALKQAFADGLDIHAMTASEMFGVPVEGMPSEVRRRAKAINFGIIYGI 813 (977) T ss_pred HHEEECCCCCCEEEEECHHHHHHHHHHHHHCCHHHHHHHCCCCCHHHHHHHHHHCCCHHHCCHHHHHHHHEEEEHHHHCC T ss_conf 23216049998799833668899999998178899987635987789999999599935489999865260420345566 Q ss_pred CHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCHHHHHHHHH Q ss_conf 88889987987088831134556555455556788889999887643100010220788899999999878878999999 Q gi|254781193|r 444 GAKAFQKMASTTGLNLNTFAHLVKTNSAPEEWEDAENFCRWMQENHPEHAIEDLFIGTACERVKKAWRQKHSRVSELWNE 523 (675) Q Consensus 444 G~~~l~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ea~~~~~~~r~~~p~i~~~~~~ 523 (675) |+.+|++ + ++++..||+.+++.|++.||+|.+|..+ T Consensus 814 ~~~gL~~---~-----------------------------------------l~i~~~eA~~~i~~yf~~y~~v~~~~~~ 849 (977) T PRK07556 814 SAFGLAN---Q-----------------------------------------LGIPREEAGAYIKRYFERFPGIRAYMDA 849 (977) T ss_pred CHHHHHH---H-----------------------------------------CCCCHHHHHHHHHHHHHHCCHHHHHHHH T ss_conf 8777998---7-----------------------------------------1989999999999999868039999999 Q ss_pred HHHHHHHHHHCCEEEEECCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCCHHHHCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999754917962154211332223445640232476631016786621021010023456777531456554536 Q gi|254781193|r 524 LHQAFEQTIENGKAIIARKRRDVPLVYMKKDKNNLILMLPSGRSLVYRDVGNNRSYLNTTTRQLTREYTYGGKLTENIVQ 603 (675) Q Consensus 524 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~GR~~~~~~~~~~~~~~~~~~~~~~r~~~~~~k~~n~~iQ 603 (675) +-..+++ . +++++..||||++|++++.+...++. .+|+ ++|+||| T Consensus 850 ~~~~a~~---~-----------------------gyv~Tl~gRrr~~p~i~s~n~~~r~~---~eR~------a~N~piQ 894 (977) T PRK07556 850 TKAFARE---H-----------------------GYVETLFGRRIHYPDIKASNPSVRAF---NERA------AINAPIQ 894 (977) T ss_pred HHHHHHH---C-----------------------CCEECCCCCEECCCCCCCCCHHHHHH---HHHH------HHCCCCC T ss_conf 9999997---7-----------------------97788878703287556899889979---9999------8277643 Q ss_pred HHHHHHHHHHHHHHHH------CCCCEEEEECCEEEEECCCCHHHHH-HHHHHHHCCCC--HHCCCCCCEEEECCCCCCC Q ss_conf 8899999999999996------7981889952589997792489999-99999845652--0215995157401004631 Q gi|254781193|r 604 AVSRDILAEGMMNATK------AGYDIVLTVHDEIVCETPDTDEFNA-SMLYHLMTSNP--SWAKGLPLKAEGYESKRYR 674 (675) Q Consensus 604 gtaadi~k~ami~l~~------~g~~ivl~VHDEiv~evp~~~~~~~-~~i~~~M~~~~--~w~~~~pl~ae~~~~~~y~ 674 (675) ||||||+|.||+++++ .+.+|+||||||||||||+++.+++ +.|+++|+.++ ..-.++||.+|..+|+.|. T Consensus 895 GtaAdiiK~ami~~~~~l~~~~~~~~milqvHDEli~e~~~~e~e~~~~~v~~~Me~a~~~~~~l~VPL~v~~~~G~nW~ 974 (977) T PRK07556 895 GTAADIIRRAMIRMEDALAEEKLSARMLLQVHDELIFEVPEDEVEATIPVVRHVMENAAMPAVSLSVPLTVDARAADNWD 974 (977) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECEEEEEEECHHHHHHHHHHHHHHHHCCCCCCCEECCCEEEECCCCCCHH T ss_conf 55999999999999999983698862899977277896468899999999999973677678031156686548186978 No 3 >PRK05755 DNA polymerase I; Provisional Probab=100.00 E-value=0 Score=912.44 Aligned_cols=538 Identities=22% Similarity=0.258 Sum_probs=461.9 Q ss_pred CEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCEEEEEEECCCCH-----HHHHHHHHHHCCCCCEEEEEEHHHH Q ss_conf 68999814178987000440011279819999999848039999788810-----4379999986388981999730774 Q gi|254781193|r 2 ATLFIDFETRSPIDLTKCGICKYAEDVCVILFAFGFDDHPVELWDCVHDP-----VMPQVLKEALENTDVTIVAHNSYFE 76 (675) Q Consensus 2 k~i~lD~ET~~~~dl~~~Gl~~y~~~~~i~~i~~a~~~~~v~~~d~~~~~-----~~~~~l~~~L~~~~~~kV~HNa~FD 76 (675) +.+++|+||.+... ...+++|++++++++.++|+|+.|.. .+...++++|+++++.||+||+||| T Consensus 320 ~~va~dtEttsld~----------~~a~lVGis~s~~~g~a~YIPl~~~~~~~~~~vl~~Lk~lLed~~i~KIghNlK~d 389 (889) T PRK05755 320 GLVAFDTETTSLDP----------MSANLVGLSFAVEEGEAAYIPLAHDDQLDRDDALAALKPLLEDPAIPKVGQNLKYD 389 (889) T ss_pred CEEEEEEECCCCCC----------CCCCEEEEEEECCCCCEEEEECCCCCCCCHHHHHHHHHHHHCCCCCCEEEECHHHH T ss_conf 84999984178875----------65762599998279968998444313368899999999997098872432031899 Q ss_pred HHHHHHHCCCCCCCCCEEEHHHHHHHHCC---CCCHHHHHHHC-CCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHH Q ss_conf 99999860899884203316889998678---99968989982-998110043799998620211234332346785853 Q gi|254781193|r 77 RTVLKACFNLDIPIHRWICTLVLARSNGL---PSALKAVGEAL-KLSSQKMEEGKELIARFCKGETHECLYDRNKPNHRE 152 (675) Q Consensus 77 ~~~L~~~~Gi~i~~~~~~DTmi~A~~l~~---~~sL~~la~~l-~l~~~k~~~~k~l~~~~~k~~~~~~~~~~~~~~~~~ 152 (675) +.+|.++ |+++. +.++||||+||+++| +++|+.++..| +....+. +++. ++. ....+... T Consensus 390 l~vL~~~-GI~l~-g~~fDTmLAaYLLdP~~~~~~Ld~La~~yL~~~~~~~---eel~---gkg--------~~~~~~~v 453 (889) T PRK05755 390 RHILANY-GIELR-GIAFDTMLASYLLDPGAGRHDLDSLAERYLGHKTISF---EEVA---GKG--------NQLTFAQV 453 (889) T ss_pred HHHHHHC-CEECC-CCCHHHHHHHHHHCCCCCCCCHHHHHHHHHCCCCCCH---HHHC---CCC--------CCCCCCCC T ss_conf 9999977-92357-7630199999874788776778999998632344203---4313---656--------55661007 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCCH----------HCCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 289999998998999999996301040----------0015133666653115621379999999999999999999999 Q gi|254781193|r 153 AWRLFGEYCKGDVTVTREIFKRLEPLS----------SKEQELWYLDQTINDRGYCVDVALAHTLNQIVDVERTKLDQEL 222 (675) Q Consensus 153 ~~~~~~~Ya~~Dv~~t~~L~~~l~~~~----------~~E~~~~~~~~~me~~Gi~vD~e~l~~l~~~~~~~~~~l~~~i 222 (675) +.+....||+.|+.+|++||..|.+.+ +.|+++..++++||.+||.||.+.++++..++..++.++++++ T Consensus 454 ~~e~~~~YaaedA~~~~~L~~~L~~~L~~~~l~~Ly~~IE~PLi~VLa~ME~~GI~VD~~~L~~ls~el~~~l~~le~~I 533 (889) T PRK05755 454 ALEEAAEYAAEDADVTLRLHQVLKPELEEEGLLELYEEIELPLVPVLARMERNGIKVDREYLEELSAELAQRLAELEQEI 533 (889) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHCCEEECHHHHHHHHHHHHHHHHHHHHHH T ss_conf 87899999999886899999999999986047889999875199999999960868669999999999999999999999 Q ss_pred HHHHCCCCCCCCCHHHHHHHHHHHCCCCCCCCCC---CCHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 9872885476788799988777755888888988---5338998721304823899999999976551013357887408 Q gi|254781193|r 223 ESLTYGLVSSSRCLEKLKTWLFLETGLHLEDLTE---TTCQRVLSSNKKLTPQAIQVFNNRMRASRSAVLKLNTLVEAMN 299 (675) Q Consensus 223 ~~~~~~~~~n~~S~~ql~~~L~~~lg~~~~~~~k---~t~~~~L~~~~~~~~~~~~~l~~r~~~~k~~~~~~~~~~~~~~ 299 (675) +++ .|..||+|||+||+++||+++|+|+.+++| +|+.++|+.+.+.||.+..+|+|| +++|+.+||++++++.++ T Consensus 534 ~~~-aG~eFNinSPKQL~~iLFekL~Lp~~KKTKtGySTd~~vLe~L~~~hpi~~~iLeyR-~l~KL~sTYvd~L~~~i~ 611 (889) T PRK05755 534 YEL-AGEEFNLNSPKQLGEILFEKLGLPVTKKTKTGYSTDAEVLEKLADDHPIPKKILEYR-QLSKLKSTYTDGLPKMIN 611 (889) T ss_pred HHH-CCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCCHHHHHHHHHH-HHHHHHHHHHHHHHHCCC T ss_conf 987-499037799899999999853999788788988785999999875076999999997-787888889888875247 Q ss_pred CC-CCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHCCCCCCCCCCCCCCCCHHHHHEEEECCCCCEEEE Q ss_conf 68-96767788400013221124788844488888651134554310123333455443210033330126689878999 Q gi|254781193|r 300 SD-GRLRGTLQFLGASRTGRWSGCVFPPQNLPRPSRSHEELDQIITHLPTSITSDPDPLQIISDCVRSCVIASEGKKLVV 378 (675) Q Consensus 300 ~d-gRih~~~~~~gat~TGR~S~~~pNlQnlPr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iR~~f~a~~G~~lv~ 378 (675) ++ ||||++|||.+ |.|||+||++|||||||.+++.|.. ||+||+|+|||+|++ T Consensus 612 ~~tgRIHt~f~Q~~-t~TGRLSS~~PNLQNIPir~e~Gr~-------------------------IR~aFva~~g~~l~s 665 (889) T PRK05755 612 PDTGRVHTSFNQAV-TATGRLSSSDPNLQNIPIRTEEGRR-------------------------IRQAFVAPEGYKLLS 665 (889) T ss_pred CCCCEEECCHHHHH-CCCCCCCCCCCCCCCCCCCCHHHHH-------------------------HHHEEEECCCCEEEE T ss_conf 88984611001110-3202224799853468776323044-------------------------552278589988999 Q ss_pred EECCCCCHHHHHHHCCCCCHHHHHHHCCCCHHHHHHHHCCCCHHHHCCHHHHHCCCCEEEEECCCCHHHHHHHHHHCCCC Q ss_conf 83453723333231146201134541133101014764487956706201110165202100288888899879870888 Q gi|254781193|r 379 ADLAGIEARVLAWMAGEQWKLKAFETGEDLYVTTYARSFNVAPAQVTKDQRAIGKVMELALGYQGGAKAFQKMASTTGLN 458 (675) Q Consensus 379 aDySqiElRvlA~ls~d~~l~~af~~g~D~h~~tA~~~~g~~~~~~~~~~R~~aK~~~~g~~YG~G~~~l~~~~~~~G~~ 458 (675) |||||||||||||+|+|+.|++||++|.|+|+.||+.+||+|+++|++++|+.||++|||++||||+.+|++ T Consensus 666 aDYSQIELRilAh~S~D~~li~af~~g~DiH~~tA~~vf~v~~~~vt~~~R~~AK~vnfGiiYG~s~~gLs~-------- 737 (889) T PRK05755 666 ADYSQIELRILAHLSGDEGLIEAFAEGEDIHTATASEVFGVPLEEVTSEQRRRAKAINFGLIYGMSAFGLAR-------- 737 (889) T ss_pred ECHHHHHHHHHHHHHCCHHHHHHHHCCCCHHHHHHHHHHCCCHHHCCHHHHHHHHHEEEHHHHCCCHHHHHH-------- T ss_conf 512089999999980899999998559986899999993999001899998662613221444668778998-------- Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHCCEEE Q ss_conf 31134556555455556788889999887643100010220788899999999878878999999999999997549179 Q gi|254781193|r 459 LNTFAHLVKTNSAPEEWEDAENFCRWMQENHPEHAIEDLFIGTACERVKKAWRQKHSRVSELWNELHQAFEQTIENGKAI 538 (675) Q Consensus 459 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ea~~~~~~~r~~~p~i~~~~~~~~~~~~~~~~~~~~~ 538 (675) ..+++..||+.+++.|++.||+|.+|..++-..+++ + T Consensus 738 ------------------------------------~l~is~~eA~~~i~~yf~~yp~v~~~~~~~~~~a~~---~---- 774 (889) T PRK05755 738 ------------------------------------QLGISRKEAKEYIDRYFERYPGVKEYMEETVAQAKE---K---- 774 (889) T ss_pred ------------------------------------HHCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHH---C---- T ss_conf ------------------------------------819999999999999999681099999999999997---7---- Q ss_pred EECCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCCHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 62154211332223445640232476631016786621021010023456777531456554536889999999999999 Q gi|254781193|r 539 IARKRRDVPLVYMKKDKNNLILMLPSGRSLVYRDVGNNRSYLNTTTRQLTREYTYGGKLTENIVQAVSRDILAEGMMNAT 618 (675) Q Consensus 539 ~~~~~~~~~~~~~~~~~~~~~~~~p~GR~~~~~~~~~~~~~~~~~~~~~~r~~~~~~k~~n~~iQgtaadi~k~ami~l~ 618 (675) +++++..||||++|++++.+...+.. .+|+ ++|+|||||||||+|.||++++ T Consensus 775 -------------------Gyv~Tl~gRrr~~p~i~s~n~~~r~~---~eR~------AiN~pIQGsAADiiK~AMi~v~ 826 (889) T PRK05755 775 -------------------GYVETLFGRRRYLPDINSSNGARRAF---AERA------AINAPIQGSAADIIKIAMIRVD 826 (889) T ss_pred -------------------CCEECCCCCEEECCCCCCCCHHHHHH---HHHH------HHCCCCCHHHHHHHHHHHHHHH T ss_conf -------------------97778778705278677899889989---9999------9675655569999999999999 Q ss_pred H-----CCCCEEEEECCEEEEECCCCHHHHH-HHHHHHHCCCCHHCCCCCCEEEECCCCCCC Q ss_conf 6-----7981889952589997792489999-999998456520215995157401004631 Q gi|254781193|r 619 K-----AGYDIVLTVHDEIVCETPDTDEFNA-SMLYHLMTSNPSWAKGLPLKAEGYESKRYR 674 (675) Q Consensus 619 ~-----~g~~ivl~VHDEiv~evp~~~~~~~-~~i~~~M~~~~~w~~~~pl~ae~~~~~~y~ 674 (675) + .+.+|+||||||||||||+++.+++ +.|+++|+.+.+ ..+||.+|..+|+.|. T Consensus 827 ~~l~~~~~~~~llQVHDEli~Ev~~~~~e~~~~~v~~~Me~a~~--l~VPl~v~~~~g~nW~ 886 (889) T PRK05755 827 DALKEGLKARMLLQVHDELVFEVPEDELEEVKKLVKEVMENAVE--LSVPLVVDVGVGDNWD 886 (889) T ss_pred HHHHHCCCCEEEEEEEEEEEEECCHHHHHHHHHHHHHHHHCCEE--CCCCEEEECCCCCCHH T ss_conf 99984778738998655777842788999999999999718500--7887586238168967 No 4 >PRK07997 consensus Probab=100.00 E-value=0 Score=913.29 Aligned_cols=539 Identities=20% Similarity=0.234 Sum_probs=460.2 Q ss_pred CEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCEEEEEEECCCC----------HHHHHHHHHHHCCCCCEEEEE Q ss_conf 6899981417898700044001127981999999984803999978881----------043799999863889819997 Q gi|254781193|r 2 ATLFIDFETRSPIDLTKCGICKYAEDVCVILFAFGFDDHPVELWDCVHD----------PVMPQVLKEALENTDVTIVAH 71 (675) Q Consensus 2 k~i~lD~ET~~~~dl~~~Gl~~y~~~~~i~~i~~a~~~~~v~~~d~~~~----------~~~~~~l~~~L~~~~~~kV~H 71 (675) +.+++|+||.+...+ ..+++|+++++..+..+|+|+.|. +.+.+.++++|+++.+.||+| T Consensus 350 ~~~a~DtEt~sld~~----------~a~lvGiS~s~~~g~a~YiPl~h~~l~~~~ql~~~~vl~~LkplLed~~i~KIgh 419 (928) T PRK07997 350 PVFAFDTETDSLDNI----------SANLVGLSFAIEPGVAAYLPVAHDYLDAPDQISRERVLELLKPLLEDEKALKVGQ 419 (928) T ss_pred CEEEEEEECCCCCCC----------CCCCCCEEEEECCCCEEEEECCCCCCCCHHHCCHHHHHHHHHHHHCCCCCCHHCC T ss_conf 869999512786600----------0244433898627746874022123476001249999999898752865303212 Q ss_pred EHHHHHHHHHHHCCCCCCCCCEEEHHHHHHHHCC---CCCHHHHHHHC-CCCCCHHHHHHHHHHHHCCCCCCCCCCCCCC Q ss_conf 3077499999860899884203316889998678---99968989982-9981100437999986202112343323467 Q gi|254781193|r 72 NSYFERTVLKACFNLDIPIHRWICTLVLARSNGL---PSALKAVGEAL-KLSSQKMEEGKELIARFCKGETHECLYDRNK 147 (675) Q Consensus 72 Na~FD~~~L~~~~Gi~i~~~~~~DTmi~A~~l~~---~~sL~~la~~l-~l~~~k~~~~k~l~~~~~k~~~~~~~~~~~~ 147 (675) |++||+.+|.++ |+.+. +..+||||+||++++ .++|+.++..| +... ...+++. +...... T Consensus 420 nlK~d~~vL~~~-GI~l~-gi~fDTmLAaYLLnp~~~~~~Ld~La~~yL~~~~---i~~~el~----------gkgk~q~ 484 (928) T PRK07997 420 NLKFDRGILANY-GIELR-GIAFDTMLESYILNSVAGRHDMDSLAERWLKHKT---ITFEEIA----------GKGKNQL 484 (928) T ss_pred HHHHHHHHHHHC-CCEEC-CCCCCHHHHHHHCCCCCCCCCHHHHHHHHHCCCC---CCHHHHC----------CCCCCCC T ss_conf 168899999977-94105-6430345532222443344788999999713466---4335432----------4464547 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCH-----------HCCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHH Q ss_conf 85853289999998998999999996301040-----------0015133666653115621379999999999999999 Q gi|254781193|r 148 PNHREAWRLFGEYCKGDVTVTREIFKRLEPLS-----------SKEQELWYLDQTINDRGYCVDVALAHTLNQIVDVERT 216 (675) Q Consensus 148 ~~~~~~~~~~~~Ya~~Dv~~t~~L~~~l~~~~-----------~~E~~~~~~~~~me~~Gi~vD~e~l~~l~~~~~~~~~ 216 (675) .+...+.+....||++|+++|++||..|.+.+ +.|+++..++++||.+||.||.+.+.++..++..+++ T Consensus 485 ~f~~v~~~~~~~Ya~edA~~t~~L~~~L~~~L~~~~~l~~Ly~~IE~PLi~VLa~ME~~GI~vD~~~L~~ls~el~~~l~ 564 (928) T PRK07997 485 TFNQIALEEAGRYAAEDADVTLQLHLKMWPKLQQHKGPLNVFENIEMPLVPVLSRIERNGVKIDPAVLHAHSEELTLRLA 564 (928) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCEEECHHHHHHHHHHHHHHHH T ss_conf 74353599999999999999999999999998621449999999998999999999853959989999999999999999 Q ss_pred HHHHHHHHHHCCCCCCCCCHHHHHHHHHHHCCCCCCCCC---C-CCHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 999999987288547678879998877775588888898---8-533899872130482389999999997655101335 Q gi|254781193|r 217 KLDQELESLTYGLVSSSRCLEKLKTWLFLETGLHLEDLT---E-TTCQRVLSSNKKLTPQAIQVFNNRMRASRSAVLKLN 292 (675) Q Consensus 217 ~l~~~i~~~~~~~~~n~~S~~ql~~~L~~~lg~~~~~~~---k-~t~~~~L~~~~~~~~~~~~~l~~r~~~~k~~~~~~~ 292 (675) +++++++++ .|..||+|||+||+++||+++|+|+.+++ . +|+.++|+.++..||.++.+|+||. +.|+.+||++ T Consensus 565 ~le~~I~~~-aG~eFNinSpKQLgeILFekLgLp~~KKTktG~~ST~~~vLe~L~~~h~i~~~il~yR~-l~KL~sTY~d 642 (928) T PRK07997 565 ELEKKAHEI-AGEPFNLSSTKQLQTILFEKQGIKPLKKTPGGAPSTSEEVLEELALDYPLPKVILEYRG-LAKLKSTYTD 642 (928) T ss_pred HHHHHHHHH-CCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCHHHHHHHHH-HHHHHHHHHH T ss_conf 999999986-29988878999999998986299988858899968469999972025750899999989-9998998898 Q ss_pred HHHHHCCC-CCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHCCCCCCCCCCCCCCCCHHHHHEEEECC Q ss_conf 78874086-89676778840001322112478884448888865113455431012333345544321003333012668 Q gi|254781193|r 293 TLVEAMNS-DGRLRGTLQFLGASRTGRWSGCVFPPQNLPRPSRSHEELDQIITHLPTSITSDPDPLQIISDCVRSCVIAS 371 (675) Q Consensus 293 ~~~~~~~~-dgRih~~~~~~gat~TGR~S~~~pNlQnlPr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iR~~f~a~ 371 (675) +++..+++ +||||++|||.+ |.|||+||++|||||||.+++.|.. ||+||+|+ T Consensus 643 ~L~~~i~~~~gRiHt~f~Q~~-t~TGRlSS~~PNLQNIPir~~~G~~-------------------------iR~aFi~~ 696 (928) T PRK07997 643 KLPLMINPKTGRVHTSYHQAV-TATGRLSSTDPNLQNIPVRNEEGRR-------------------------IRQAFIAP 696 (928) T ss_pred HHHHHHCCCCCEEEEEEHCCC-CCCCCCCCCCCCCCCCCCCCHHHHH-------------------------HHHHHCCC T ss_conf 889753866687820301110-2125112589986758888622578-------------------------99874378 Q ss_pred CCCEEEEEECCCCCHHHHHHHCCCCCHHHHHHHCCCCHHHHHHHHCCCCHHHHCCHHHHHCCCCEEEEECCCCHHHHHHH Q ss_conf 98789998345372333323114620113454113310101476448795670620111016520210028888889987 Q gi|254781193|r 372 EGKKLVVADLAGIEARVLAWMAGEQWKLKAFETGEDLYVTTYARSFNVAPAQVTKDQRAIGKVMELALGYQGGAKAFQKM 451 (675) Q Consensus 372 ~G~~lv~aDySqiElRvlA~ls~d~~l~~af~~g~D~h~~tA~~~~g~~~~~~~~~~R~~aK~~~~g~~YG~G~~~l~~~ 451 (675) |||+|++|||||||||||||+|+|+.|++||++|.|||+.||+.+||+|.++|++++|+.||++|||++||||+.+|++ T Consensus 697 ~g~~l~saDYSQiELRilAh~s~d~~li~af~~~~DiH~~tA~~if~~~~~~v~~~~R~~aK~vnfgi~YG~s~~gLa~- 775 (928) T PRK07997 697 EDYVIVSADYSQIELRIMAHLSRDKGLLTAFAEGKDIHRATAAEVFGLPLEKVTSEQRRSAKAINFGLIYGMSAFGLAR- 775 (928) T ss_pred CCCEEEEECHHHHHHHHHHHHHCCHHHHHHHHCCCCCCHHHHHHHHCCCHHHCCHHHHHHHHHHHHHHHHCCCHHHHHH- T ss_conf 9987997056289999999997687688775168883401179882999465798899886877766466268778998- Q ss_pred HHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHH Q ss_conf 98708883113455655545555678888999988764310001022078889999999987887899999999999999 Q gi|254781193|r 452 ASTTGLNLNTFAHLVKTNSAPEEWEDAENFCRWMQENHPEHAIEDLFIGTACERVKKAWRQKHSRVSELWNELHQAFEQT 531 (675) Q Consensus 452 ~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ea~~~~~~~r~~~p~i~~~~~~~~~~~~~~ 531 (675) .++++..||+.+++.|++.||+|..|..++-..+++ T Consensus 776 -------------------------------------------~l~i~~~eA~~~i~~yf~~~~~v~~~~~~~~~~a~~- 811 (928) T PRK07997 776 -------------------------------------------QLNIPRKEAQKYMDLYFERYPGVLEYMERTRAQAKE- 811 (928) T ss_pred -------------------------------------------HHCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHH- T ss_conf -------------------------------------------709999999999999988683399999999999997- Q ss_pred HHCCEEEEECCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCCHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 75491796215421133222344564023247663101678662102101002345677753145655453688999999 Q gi|254781193|r 532 IENGKAIIARKRRDVPLVYMKKDKNNLILMLPSGRSLVYRDVGNNRSYLNTTTRQLTREYTYGGKLTENIVQAVSRDILA 611 (675) Q Consensus 532 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~GR~~~~~~~~~~~~~~~~~~~~~~r~~~~~~k~~n~~iQgtaadi~k 611 (675) .+++++..||||++|++++.+...+.. .+|+ ++|+|||||||||+| T Consensus 812 -------------------------~gyv~Tl~gRrr~~~~i~s~n~~~r~~---~eR~------a~N~piQgtaADiik 857 (928) T PRK07997 812 -------------------------QGYVETLDGRRLYLPDIKSSNAARRAA---AERA------AINAPMQGTAADIIK 857 (928) T ss_pred -------------------------CCCEECCCCCEECCCCCCCCCHHHHHH---HHHH------HHCCCCCHHHHHHHH T ss_conf -------------------------794788889614588767899889968---9999------958113378999999 Q ss_pred HHHHHHHHC------CCCEEEEECCEEEEECCCCHHHHH-HHHHHHHCCCCHHCCCCCCEEEECCCCCCC Q ss_conf 999999967------981889952589997792489999-999998456520215995157401004631 Q gi|254781193|r 612 EGMMNATKA------GYDIVLTVHDEIVCETPDTDEFNA-SMLYHLMTSNPSWAKGLPLKAEGYESKRYR 674 (675) Q Consensus 612 ~ami~l~~~------g~~ivl~VHDEiv~evp~~~~~~~-~~i~~~M~~~~~w~~~~pl~ae~~~~~~y~ 674 (675) .||+++++. ..+|+||||||||||||+++.+.+ +.|+++|+.+.+ ..+||.+|..+|+.|. T Consensus 858 ~Ami~~~~~l~~~~~~~~~~lqvHDEli~e~~~~~~~~~~~~~~~~Me~a~~--l~VPL~v~~~~g~nW~ 925 (928) T PRK07997 858 RAMIAVDAWLQAEQPRVRMIMQVHDELVFEVHKDDLDAVAKKIHQLMENCTR--LDVPLLVEVGSGENWD 925 (928) T ss_pred HHHHHHHHHHHHCCCCCEEEEEECCEEEEEECHHHHHHHHHHHHHHHHCCCC--CCCCEEEECCCCCCHH T ss_conf 9999999999844888648973560678842688999999999999728443--7887585359277968 No 5 >PRK07625 consensus Probab=100.00 E-value=0 Score=907.05 Aligned_cols=539 Identities=20% Similarity=0.205 Sum_probs=460.8 Q ss_pred CEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCEEEEEEECCCC----------HHHHHHHHHHHCCCCCEEEEE Q ss_conf 6899981417898700044001127981999999984803999978881----------043799999863889819997 Q gi|254781193|r 2 ATLFIDFETRSPIDLTKCGICKYAEDVCVILFAFGFDDHPVELWDCVHD----------PVMPQVLKEALENTDVTIVAH 71 (675) Q Consensus 2 k~i~lD~ET~~~~dl~~~Gl~~y~~~~~i~~i~~a~~~~~v~~~d~~~~----------~~~~~~l~~~L~~~~~~kV~H 71 (675) +.+++|+||.|... ...+++|+++++..+..+|+|+.|. +.+...++++|+++++.||+| T Consensus 345 ~~~a~dtEttsld~----------~~a~lvGis~s~~~g~a~YiPl~h~~~d~~~~l~~~~vl~~Lk~~Led~~i~KIgh 414 (922) T PRK07625 345 ELTAFDTETTSLDP----------MLAQLVGLSFSVEPGHAAYIPVAHRGPDMPAQLPRDEVLARLRPWLEDAGRKKLGQ 414 (922) T ss_pred CEEEEEEECCCCCH----------HHCEEEEEEEEECCCCEEEEEECCCCCCHHHHCCHHHHHHHHHHHHHCCCCHHHHH T ss_conf 84899952277650----------01305657977438867999702158652321589999998799874344035542 Q ss_pred EHHHHHHHHHHHCCCCCCCCCEEEHHHHHHHHCC--CCCHHHHHHHC-CCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCC Q ss_conf 3077499999860899884203316889998678--99968989982-99811004379999862021123433234678 Q gi|254781193|r 72 NSYFERTVLKACFNLDIPIHRWICTLVLARSNGL--PSALKAVGEAL-KLSSQKMEEGKELIARFCKGETHECLYDRNKP 148 (675) Q Consensus 72 Na~FD~~~L~~~~Gi~i~~~~~~DTmi~A~~l~~--~~sL~~la~~l-~l~~~k~~~~k~l~~~~~k~~~~~~~~~~~~~ 148 (675) |++||+.+|.++ |+.+. +..+||||+||++++ +++|++++..+ +....+. +++. ++...... T Consensus 415 NlK~d~~vL~~~-GI~l~-g~~fDTmLAaYLL~~~r~~~Ld~La~~yL~~~~i~~---edl~----------Gkg~~q~~ 479 (922) T PRK07625 415 HLKYDAQVLANY-GIALN-GIEHDTLLESYVLESHRTHDMDSLALRHLGVKTIKY---EDVA----------GKGAKQIG 479 (922) T ss_pred HHHHHHHHHHHC-CCCCC-CCCHHHHHHHHHHCCCCCCCHHHHHHHHHCCCCCCH---HHHC----------CCCCCCCC T ss_conf 577899999977-96458-754039999887546767887999998725565541---3212----------33333378 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHCCCCH-----------HCCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHH Q ss_conf 5853289999998998999999996301040-----------00151336666531156213799999999999999999 Q gi|254781193|r 149 NHREAWRLFGEYCKGDVTVTREIFKRLEPLS-----------SKEQELWYLDQTINDRGYCVDVALAHTLNQIVDVERTK 217 (675) Q Consensus 149 ~~~~~~~~~~~Ya~~Dv~~t~~L~~~l~~~~-----------~~E~~~~~~~~~me~~Gi~vD~e~l~~l~~~~~~~~~~ 217 (675) +...+.+....||++|+++|++||..|.+.+ +.|+++..++++||.+||.||.+.++++..++..++.+ T Consensus 480 ~~~v~~~~~~~Ya~eda~~t~~L~~~L~~~L~~e~~L~~ly~~iE~PLi~VLa~ME~~GI~vD~~~L~~l~~~l~~~l~~ 559 (922) T PRK07625 480 FDEVALAQASEYAAEDADITLQLHHALYPQVAREPGLERVYRDIELPVSLVLRKMERTGVLIDADRLSRQSGEIATRLLA 559 (922) T ss_pred HHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHH T ss_conf 32246799999999999999999999999983405479999999958999999999828402699999999999999999 Q ss_pred HHHHHHHHHCCCCCCCCCHHHHHHHHHHHCCCCCCCCC---C-CCHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999987288547678879998877775588888898---8-5338998721304823899999999976551013357 Q gi|254781193|r 218 LDQELESLTYGLVSSSRCLEKLKTWLFLETGLHLEDLT---E-TTCQRVLSSNKKLTPQAIQVFNNRMRASRSAVLKLNT 293 (675) Q Consensus 218 l~~~i~~~~~~~~~n~~S~~ql~~~L~~~lg~~~~~~~---k-~t~~~~L~~~~~~~~~~~~~l~~r~~~~k~~~~~~~~ 293 (675) ++++++++ .|..||+|||+||+++||+++|+|+.+++ . +|+.++|+.+.+.||.+..+|+||+ ++|+.+||+++ T Consensus 560 le~~i~~~-aG~eFNinSpkQL~~iLFe~L~Lp~~kKTktG~~ST~~~vLe~L~~~hpi~~~il~yR~-l~KL~sTY~d~ 637 (922) T PRK07625 560 LEQEAYEL-AGGEFNLGSPKQIGQIFFERLQLPVVKKTPSGAPSTDEEVLQKLAEDYPLPKLLLEHRG-LSKLKSTYTDK 637 (922) T ss_pred HHHHHHHH-CCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCCCCHHHHHHHHHH-HHHHHHHHHHH T ss_conf 99987775-49977899999999999985389878858989978539999986006873899998628-98999999988 Q ss_pred HHHHCCCC-CCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHCCCCCCCCCCCCCCCCHHHHHEEEECCC Q ss_conf 88740868-96767788400013221124788844488888651134554310123333455443210033330126689 Q gi|254781193|r 294 LVEAMNSD-GRLRGTLQFLGASRTGRWSGCVFPPQNLPRPSRSHEELDQIITHLPTSITSDPDPLQIISDCVRSCVIASE 372 (675) Q Consensus 294 ~~~~~~~d-gRih~~~~~~gat~TGR~S~~~pNlQnlPr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iR~~f~a~~ 372 (675) +++.++++ ||||++|+|.+ |.|||+||++|||||||.+++.|.. ||+||+|+| T Consensus 638 l~~~i~~~~gRiHt~f~Q~~-t~TGRlSS~~PNLQNIPir~~~G~~-------------------------iR~aFi~~~ 691 (922) T PRK07625 638 LPRMVNPDTGRVHTNYAQAV-AVTGRLASNDPNLQNIPVRTEEGRR-------------------------IREAFIAPP 691 (922) T ss_pred HHHHCCCCCCEECCCHHEEE-CCCCCCCCCCCCCCCCCCCCCCCCH-------------------------HHEEEECCC T ss_conf 88850888787766200210-2335435799863368877622401-------------------------334473189 Q ss_pred CCEEEEEECCCCCHHHHHHHCCCCCHHHHHHHCCCCHHHHHHHHCCCCHHHHCCHHHHHCCCCEEEEECCCCHHHHHHHH Q ss_conf 87899983453723333231146201134541133101014764487956706201110165202100288888899879 Q gi|254781193|r 373 GKKLVVADLAGIEARVLAWMAGEQWKLKAFETGEDLYVTTYARSFNVAPAQVTKDQRAIGKVMELALGYQGGAKAFQKMA 452 (675) Q Consensus 373 G~~lv~aDySqiElRvlA~ls~d~~l~~af~~g~D~h~~tA~~~~g~~~~~~~~~~R~~aK~~~~g~~YG~G~~~l~~~~ 452 (675) ||+|++|||||||||||||+|+|+.|++||++|.|+|+.||+.+||+|.++|++++|+.||++|||++||||+.+|++ T Consensus 692 g~~l~~aDYSQiELRilAh~s~d~~l~~af~~~~DiH~~tA~~vf~v~~~~v~~~~R~~AK~vnfgiiYG~s~~gLa~-- 769 (922) T PRK07625 692 GSKIVSADYSQIELRIMAHISGDESLLRAFAHGEDIHRATAAEVFGVTPLEVTSDQRRIAKVINFGLIYGMSAFGLAS-- 769 (922) T ss_pred CCEEEEECHHHHHHHHHHHHHCCHHHHHHHHCCCCCCCCCHHHHHCCCHHHCCHHHHHHHHEEEEHHHHCCCHHHHHH-- T ss_conf 988998155388999999995788799998659985424269996989576899998651702002365578677998-- Q ss_pred HHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHH Q ss_conf 87088831134556555455556788889999887643100010220788899999999878878999999999999997 Q gi|254781193|r 453 STTGLNLNTFAHLVKTNSAPEEWEDAENFCRWMQENHPEHAIEDLFIGTACERVKKAWRQKHSRVSELWNELHQAFEQTI 532 (675) Q Consensus 453 ~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ea~~~~~~~r~~~p~i~~~~~~~~~~~~~~~ 532 (675) + .+++..||+.+++.|++.||+|..|..++...+++ T Consensus 770 -~-----------------------------------------l~i~~~eA~~~i~~yf~~y~~v~~~~~~~~~~a~~-- 805 (922) T PRK07625 770 -N-----------------------------------------LGITRDAAKLYIDRYFARYPGVARYMEETRMRAKE-- 805 (922) T ss_pred -H-----------------------------------------CCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHH-- T ss_conf -7-----------------------------------------19999999999999998691299999999999987-- Q ss_pred HCCEEEEECCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCCHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 54917962154211332223445640232476631016786621021010023456777531456554536889999999 Q gi|254781193|r 533 ENGKAIIARKRRDVPLVYMKKDKNNLILMLPSGRSLVYRDVGNNRSYLNTTTRQLTREYTYGGKLTENIVQAVSRDILAE 612 (675) Q Consensus 533 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~GR~~~~~~~~~~~~~~~~~~~~~~r~~~~~~k~~n~~iQgtaadi~k~ 612 (675) .+++++..||||++|++++.+...++. .+|+ ++|+|||||||||+|. T Consensus 806 ------------------------~gyv~Tl~gRrr~~p~i~s~n~~~r~~---~eR~------AiN~piQGsaADiiK~ 852 (922) T PRK07625 806 ------------------------KGYVETVFGRRLWLPEINGGNGPRRQA---AERA------AINAPMQGTAADLIKL 852 (922) T ss_pred ------------------------CCEEECCCCCCCCCCCCCCCCHHHHHH---HHHH------HHCCCCCHHHHHHHHH T ss_conf ------------------------792667778603176556888788879---9999------9650244679999999 Q ss_pred HHHHHHH------CCCCEEEEECCEEEEECCCCHHHHH-HHHHHHHCCCCHHCCCCCCEEEECCCCCCC Q ss_conf 9999996------7981889952589997792489999-999998456520215995157401004631 Q gi|254781193|r 613 GMMNATK------AGYDIVLTVHDEIVCETPDTDEFNA-SMLYHLMTSNPSWAKGLPLKAEGYESKRYR 674 (675) Q Consensus 613 ami~l~~------~g~~ivl~VHDEiv~evp~~~~~~~-~~i~~~M~~~~~w~~~~pl~ae~~~~~~y~ 674 (675) ||+++++ .+.+|+||||||||||||+++++.+ +.|+++|+.+.+ ..+||.+|..+|+.|. T Consensus 853 Ami~i~~~l~~~~~~~~~ilqvHDEli~e~~~~~~~~~~~~~~~~Me~~~~--l~VPL~V~~~~G~nW~ 919 (922) T PRK07625 853 SMIAVDDWLERDGLRTRMIMQVHDELVLEVPDDELSLVREKLPEMMCGVAK--LKVPLVAEVGVGANWE 919 (922) T ss_pred HHHHHHHHHHHCCCCCCEEEEECEEEEEEECHHHHHHHHHHHHHHHHCCCC--CCCCEEEECCCCCCHH T ss_conf 999999999846998716998754778840788999999999999727415--7887683328278978 No 6 >PRK08786 consensus Probab=100.00 E-value=0 Score=906.81 Aligned_cols=539 Identities=19% Similarity=0.194 Sum_probs=459.7 Q ss_pred CEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCEEEEEEECCCC----------HHHHHHHHHHHCCCCCEEEEE Q ss_conf 6899981417898700044001127981999999984803999978881----------043799999863889819997 Q gi|254781193|r 2 ATLFIDFETRSPIDLTKCGICKYAEDVCVILFAFGFDDHPVELWDCVHD----------PVMPQVLKEALENTDVTIVAH 71 (675) Q Consensus 2 k~i~lD~ET~~~~dl~~~Gl~~y~~~~~i~~i~~a~~~~~v~~~d~~~~----------~~~~~~l~~~L~~~~~~kV~H 71 (675) +.+++|+||.+.. +. ..+++|++++++.+..+|+++.|. ..+...+.++|+++.+.||+| T Consensus 349 ~~~afdtET~sld--------~~--~a~lVGiS~s~~~g~a~YiP~~h~~~g~~~qld~~~~l~~LkplLed~~I~KIgh 418 (927) T PRK08786 349 GQFAFDTETDSLD--------PL--QADLIGLSVAAEPGQAAYLPFGHDFPGAPVQLDRTQALAQLAPLLTDPAVRKVGQ 418 (927) T ss_pred CEEEEEEECCCCC--------HH--HHHHHCEEEECCCCCEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHH T ss_conf 8389997238866--------55--5312122454267866775324456786034139999999899860865112321 Q ss_pred EHHHHHHHHHHHCCCCCCCCCEEEHHHHHHHHCC---CCCHHHHHHHC-CCCCCHHHHHHHHHHHHCCCCCCCCCCCCCC Q ss_conf 3077499999860899884203316889998678---99968989982-9981100437999986202112343323467 Q gi|254781193|r 72 NSYFERTVLKACFNLDIPIHRWICTLVLARSNGL---PSALKAVGEAL-KLSSQKMEEGKELIARFCKGETHECLYDRNK 147 (675) Q Consensus 72 Na~FD~~~L~~~~Gi~i~~~~~~DTmi~A~~l~~---~~sL~~la~~l-~l~~~k~~~~k~l~~~~~k~~~~~~~~~~~~ 147 (675) |++||+.+|.++ |+.+. +.++||||+||++++ .++|+.++..+ +....+. ++++. +..... T Consensus 419 nlK~Dl~vL~~~-GI~l~-g~~~DTmLAaYLL~p~~~~~~Ld~La~~yLg~~~~~~---~~l~g----------kg~k~~ 483 (927) T PRK08786 419 HGKYDLHVMRRH-GIALA-GYADDTLLESFVLNSGSARHDMDSLAKRYLGYDTVKY---EDVCG----------KGAKQI 483 (927) T ss_pred HHHHHHHHHHHC-CCCCC-CCCCCHHHHHHHCCCCCCCCCHHHHHHHHCCCCCCCH---HHHCC----------CCCCCC T ss_conf 178889999967-97116-7664258888623776454788999999669775426---75424----------432228 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCH-----------HCCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHH Q ss_conf 85853289999998998999999996301040-----------0015133666653115621379999999999999999 Q gi|254781193|r 148 PNHREAWRLFGEYCKGDVTVTREIFKRLEPLS-----------SKEQELWYLDQTINDRGYCVDVALAHTLNQIVDVERT 216 (675) Q Consensus 148 ~~~~~~~~~~~~Ya~~Dv~~t~~L~~~l~~~~-----------~~E~~~~~~~~~me~~Gi~vD~e~l~~l~~~~~~~~~ 216 (675) ++...+.+....||++|+++|++||..|.+.+ +.|+++..++++||.+||.||.+.++++..++..++. T Consensus 484 ~~~~v~~~~~~~Ya~eda~~tl~L~~~l~~kL~~e~~l~~ly~~iE~PLi~VLa~ME~~GI~vD~~~L~~l~~~l~~~l~ 563 (927) T PRK08786 484 PFAQVSLEDATRYAAEDADITLRLHCVLGPKLAAEPGLERVYRDIEMPLVDVLERIEANGVCVDAAELRRQSADLSKRML 563 (927) T ss_pred CHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCEEECHHHHHHHHHHHHHHHH T ss_conf 83332599999999999999999999999999850529999999997999999998856879889999999999999999 Q ss_pred HHHHHHHHHHCCCCCCCCCHHHHHHHHHHHCCCCCCCCC---C-CCHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 999999987288547678879998877775588888898---8-533899872130482389999999997655101335 Q gi|254781193|r 217 KLDQELESLTYGLVSSSRCLEKLKTWLFLETGLHLEDLT---E-TTCQRVLSSNKKLTPQAIQVFNNRMRASRSAVLKLN 292 (675) Q Consensus 217 ~l~~~i~~~~~~~~~n~~S~~ql~~~L~~~lg~~~~~~~---k-~t~~~~L~~~~~~~~~~~~~l~~r~~~~k~~~~~~~ 292 (675) +++++++++ .|..||+|||+||+++||+++|+|+.+++ . +|+.++|+.+.+.||.+..+|+|| ++.|+.+||++ T Consensus 564 ~le~~I~~~-aG~eFNinSpKQL~~iLFekL~Lp~~kKtktG~~ST~~~vLe~L~~~h~i~~~iL~yR-~l~KL~sTY~d 641 (927) T PRK08786 564 AAQQKATEL-AGRTFNLDSPKQLQALLFDELKLPAVVKTPKGQPSTNEEALEAIADQHELPRVILEYR-SLAKLRSTYTD 641 (927) T ss_pred HHHHHHHHH-CCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCHHHHHHHH-HHHHHHHHHHH T ss_conf 999999997-0855578999999999975028987873889997774999998763567769999998-88877878899 Q ss_pred HHHHHCCC-CCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHCCCCCCCCCCCCCCCCHHHHHEEEECC Q ss_conf 78874086-89676778840001322112478884448888865113455431012333345544321003333012668 Q gi|254781193|r 293 TLVEAMNS-DGRLRGTLQFLGASRTGRWSGCVFPPQNLPRPSRSHEELDQIITHLPTSITSDPDPLQIISDCVRSCVIAS 371 (675) Q Consensus 293 ~~~~~~~~-dgRih~~~~~~gat~TGR~S~~~pNlQnlPr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iR~~f~a~ 371 (675) ++++.+++ +||||++|||.+ |.|||+||++|||||||.+++.|.. ||+||+|+ T Consensus 642 ~L~~~i~~~~gRiHt~f~Q~~-t~TGRlSS~~PNLQNIPirt~~G~~-------------------------iR~aFi~~ 695 (927) T PRK08786 642 KLPEMIHPQSGRVHTSYHQAG-AATGRLSSSDPNLQNIPIRTEDGRR-------------------------IRRAFVAP 695 (927) T ss_pred HHHHHCCCCCCEECCCCHHHC-CCCCEECCCCCCCCCCCCCCCCCCC-------------------------HHHHEECC T ss_conf 888740766576624216311-4545252799341148877633302-------------------------26660258 Q ss_pred CCCEEEEEECCCCCHHHHHHHCCCCCHHHHHHHCCCCHHHHHHHHCCCCHHHHCCHHHHHCCCCEEEEECCCCHHHHHHH Q ss_conf 98789998345372333323114620113454113310101476448795670620111016520210028888889987 Q gi|254781193|r 372 EGKKLVVADLAGIEARVLAWMAGEQWKLKAFETGEDLYVTTYARSFNVAPAQVTKDQRAIGKVMELALGYQGGAKAFQKM 451 (675) Q Consensus 372 ~G~~lv~aDySqiElRvlA~ls~d~~l~~af~~g~D~h~~tA~~~~g~~~~~~~~~~R~~aK~~~~g~~YG~G~~~l~~~ 451 (675) |||+|++|||||||||||||+|+|+.|++||++|.|||+.||+.+||++.++|++++|+.||++|||++||||+.+|++ T Consensus 696 ~g~~l~saDYSQIELRilAh~s~d~~li~af~~~~DiH~~tA~~if~~~~~~v~~~~R~~aK~infgiiYG~s~~gL~~- 774 (927) T PRK08786 696 AGRKLIACDYSQIELRIMAHLSGDPGLVGAFESGADVHRATAAEVFGRTIDTVSGDERRAAKAINFGLMYGMSAFGLAR- 774 (927) T ss_pred CCCEEEEECHHHHHHHHHHHHHCCHHHHHHHCCCCCCCHHHHHHHHCCCHHHCCHHHHHHHHEEEEEHHHCCCHHHHHH- T ss_conf 9887997236488999999982886477775478673032689882999576697788551521020033378777998- Q ss_pred HHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHH Q ss_conf 98708883113455655545555678888999988764310001022078889999999987887899999999999999 Q gi|254781193|r 452 ASTTGLNLNTFAHLVKTNSAPEEWEDAENFCRWMQENHPEHAIEDLFIGTACERVKKAWRQKHSRVSELWNELHQAFEQT 531 (675) Q Consensus 452 ~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ea~~~~~~~r~~~p~i~~~~~~~~~~~~~~ 531 (675) ..+++..||+.+++.|++.||+|.+|..++-..++ T Consensus 775 -------------------------------------------~l~i~~~eA~~~i~~yf~~y~~v~~~~~~~~~~a~-- 809 (927) T PRK08786 775 -------------------------------------------QLGIGRGEAQDYIALYFSRYPGVRDFMETTRQQAR-- 809 (927) T ss_pred -------------------------------------------HCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHH-- T ss_conf -------------------------------------------70999999999999999969109999999999998-- Q ss_pred HHCCEEEEECCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCCHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 75491796215421133222344564023247663101678662102101002345677753145655453688999999 Q gi|254781193|r 532 IENGKAIIARKRRDVPLVYMKKDKNNLILMLPSGRSLVYRDVGNNRSYLNTTTRQLTREYTYGGKLTENIVQAVSRDILA 611 (675) Q Consensus 532 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~GR~~~~~~~~~~~~~~~~~~~~~~r~~~~~~k~~n~~iQgtaadi~k 611 (675) ++ +++++..||||++|++++.+...+.. .+|+ ++|+|||||||||+| T Consensus 810 -~~-----------------------gyv~Tl~gRrr~~p~i~s~n~~~r~~---aeR~------A~N~piQGsAADiiK 856 (927) T PRK08786 810 -DK-----------------------GYVETVFGRRLYLDFINAGSQGQRAG---AERA------AINAPMQGTAADIIK 856 (927) T ss_pred -HC-----------------------CEEECCCCCCCCCCCCCCCCHHHHHH---HHHH------HHCCCCCHHHHHHHH T ss_conf -77-----------------------91788888603288667888778878---9999------858344357999999 Q ss_pred HHHHHHHHC------CCCEEEEECCEEEEECCCCHHHHH-HHHHHHHCCCCHHCCCCCCEEEECCCCCCC Q ss_conf 999999967------981889952589997792489999-999998456520215995157401004631 Q gi|254781193|r 612 EGMMNATKA------GYDIVLTVHDEIVCETPDTDEFNA-SMLYHLMTSNPSWAKGLPLKAEGYESKRYR 674 (675) Q Consensus 612 ~ami~l~~~------g~~ivl~VHDEiv~evp~~~~~~~-~~i~~~M~~~~~w~~~~pl~ae~~~~~~y~ 674 (675) .||+++++. ..+|+||||||||||||+++.+++ +.|+++|+.+.+ ..+||.+|..+|+.|. T Consensus 857 ~AMi~v~~~l~~~~~~~~mlLQVHDELifEv~~~e~~~~~~~v~~~Me~a~~--L~VPL~v~~~~G~nW~ 924 (927) T PRK08786 857 RAMVKVDGWIADHAQRALMILQVHDELVFEADEDFVDTLLSEVTARMAAAAE--LRVPLVVDSGVGDNWD 924 (927) T ss_pred HHHHHHHHHHHHCCCCCEEEEEECCEEEEEECHHHHHHHHHHHHHHHHCCCC--CCCCEEECCCCCCCHH T ss_conf 9999999999726878669986672568851689999999999999817445--7787573038386967 No 7 >PRK05929 consensus Probab=100.00 E-value=0 Score=896.08 Aligned_cols=536 Identities=19% Similarity=0.163 Sum_probs=451.9 Q ss_pred CEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCEEEEEEECCC-CHHHHHHHHHHHCCCCCEEEEEEHHHHHHHH Q ss_conf 689998141789870004400112798199999998480399997888-1043799999863889819997307749999 Q gi|254781193|r 2 ATLFIDFETRSPIDLTKCGICKYAEDVCVILFAFGFDDHPVELWDCVH-DPVMPQVLKEALENTDVTIVAHNSYFERTVL 80 (675) Q Consensus 2 k~i~lD~ET~~~~dl~~~Gl~~y~~~~~i~~i~~a~~~~~v~~~d~~~-~~~~~~~l~~~L~~~~~~kV~HNa~FD~~~L 80 (675) +.+++|+||.+.. ....+++|++++. ++.++|+++.+ +..+...++++|+++++.|+|||++||+.+| T Consensus 311 ~~~a~~~~~~~~~----------~~~~~lvGisls~-~~~a~yip~~~~~~~vl~~Lk~~led~~i~kig~NlK~d~~vL 379 (870) T PRK05929 311 GEVAFCVAYTGNH----------LPSLKLHGVALAG-GSQVFYIEVEESGDQEIALLKDFFARKDTTFYGYNLKRDNHAL 379 (870) T ss_pred CCEEEEEEECCCC----------CCCCCEEEEEEEC-CCEEEEEECCCCCHHHHHHHHHHHCCCCCCEEECCHHHHHHHH T ss_conf 9747999745887----------6777288999976-9808999765555238999999860987745322377799999 Q ss_pred HHHCCCCCCCCCEEEHHHHHHHHCCC--CCHHHHHHHCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHH Q ss_conf 98608998842033168899986789--9968989982998110043799998620211234332346785853289999 Q gi|254781193|r 81 KACFNLDIPIHRWICTLVLARSNGLP--SALKAVGEALKLSSQKMEEGKELIARFCKGETHECLYDRNKPNHREAWRLFG 158 (675) Q Consensus 81 ~~~~Gi~i~~~~~~DTmi~A~~l~~~--~sL~~la~~l~l~~~k~~~~k~l~~~~~k~~~~~~~~~~~~~~~~~~~~~~~ 158 (675) .++ |+++. +..+|||+++|++++. .+++.+...+.+.......++. .. . ...+... ..+... T Consensus 380 ~~~-GI~l~-g~~fDtmLAsYLLn~~~~~~~~~ll~~~~~~~~~~~~~k~--------~g---~--~~l~~~~-~~~~~~ 443 (870) T PRK05929 380 KNA-GIDVH-EITADLALAEHLVNGGAKISFQTLLVEHGHIEEAHRFAKE--------WG---A--SSLPIQR-LPEQPA 443 (870) T ss_pred HHC-CCCCC-CCHHHHHHHHHHHCCCCCCCHHHHHHHCCCCHHHHHHHHH--------HC---C--CCCCCCC-CHHHHH T ss_conf 968-98568-7277999999862999988839999766673155565554--------07---6--6664200-738899 Q ss_pred HHHHHHHHHHHHHHHHCCCCH----------HCCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 998998999999996301040----------0015133666653115621379999999999999999999999987288 Q gi|254781193|r 159 EYCKGDVTVTREIFKRLEPLS----------SKEQELWYLDQTINDRGYCVDVALAHTLNQIVDVERTKLDQELESLTYG 228 (675) Q Consensus 159 ~Ya~~Dv~~t~~L~~~l~~~~----------~~E~~~~~~~~~me~~Gi~vD~e~l~~l~~~~~~~~~~l~~~i~~~~~~ 228 (675) .|++.|+..+++|+..|.+.+ +.|+++..++++||.+||.||.+.++++..++..++++++++|+++ .| T Consensus 444 ~yaae~a~~~~~L~~~L~~~L~~~~L~~Ly~eIE~PLi~VLa~ME~~GI~vD~~~L~~ls~el~~~l~~Le~~I~~~-AG 522 (870) T PRK05929 444 QYFGEFVSYLPIIKNYLLEELEEKGLKDIFSNVEMPLEKVLFAMERAGMPLDVEGLAVLERDLSKELAVLTQEIYDL-AG 522 (870) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCEEEEHHHHHHHHHHHHHHHHHHHHHHHHH-CC T ss_conf 99999999999999999999988779999999874489999998738989819999989999999999999999997-29 Q ss_pred CCCCCCCHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCC-CCCEEEE Q ss_conf 547678879998877775588888898853389987213048238999999999765510133578874086-8967677 Q gi|254781193|r 229 LVSSSRCLEKLKTWLFLETGLHLEDLTETTCQRVLSSNKKLTPQAIQVFNNRMRASRSAVLKLNTLVEAMNS-DGRLRGT 307 (675) Q Consensus 229 ~~~n~~S~~ql~~~L~~~lg~~~~~~~k~t~~~~L~~~~~~~~~~~~~l~~r~~~~k~~~~~~~~~~~~~~~-dgRih~~ 307 (675) ..||+|||+||+++||+++|+|+.+++|+|+.++|+.+...||.+..+|+|| +++|+.+||++++++.+++ +||||++ T Consensus 523 ~eFNinSPKQL~evLFekLgL~~~kKtkST~~evLe~L~~~hpi~~~iLeyR-~l~KL~sTYvd~L~~~i~~~tgRIHTs 601 (870) T PRK05929 523 CPFNIKSPKQLSDILYNRLGLRPIDKAKSTKAEVLEALEGEHEIIEKILAFR-AIEKLLSTYVKALPKQIDSHTQRIHPT 601 (870) T ss_pred CCCCCCCHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCCCHHHHHHHHHH-HHHHHHHHHHHHHHHHHCCCCCEECCE T ss_conf 9988789899999989863999888889872899997406002689999985-899875046777554225676877762 Q ss_pred EEEECCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHCCCCCCCCCCCCCCCCHHHHHEEE-ECCCCCEEEEEECCCCCH Q ss_conf 8840001322112478884448888865113455431012333345544321003333012-668987899983453723 Q gi|254781193|r 308 LQFLGASRTGRWSGCVFPPQNLPRPSRSHEELDQIITHLPTSITSDPDPLQIISDCVRSCV-IASEGKKLVVADLAGIEA 386 (675) Q Consensus 308 ~~~~gat~TGR~S~~~pNlQnlPr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iR~~f-~a~~G~~lv~aDySqiEl 386 (675) |||.+ |.|||+||++|||||||.+++.|+. ||+|| +|++||+|++|||||||| T Consensus 602 fnQt~-TaTGRLSSs~PNLQNIPiRte~Gr~-------------------------IR~aF~~~~~g~~llsaDYSQIEL 655 (870) T PRK05929 602 FNQVG-TVTGRLACQDPNLQNIPIRSERGRL-------------------------LRKAFRLTKKNDYFLSADYSQIEL 655 (870) T ss_pred EECCC-CCCCCCCCCCCCCCCCCCCCCCCHH-------------------------HHHHEEECCCCCEEEEECHHHHHH T ss_conf 33110-1113224689976536775411156-------------------------787513289997799612668899 Q ss_pred HHHHHHCCCCCHHHHHHHCCCCHHHHHHHHCCCCHHHHCCHHHHHCCCCEEEEECCCCHHHHHHHHHHCCCCCCHHHHHH Q ss_conf 33323114620113454113310101476448795670620111016520210028888889987987088831134556 Q gi|254781193|r 387 RVLAWMAGEQWKLKAFETGEDLYVTTYARSFNVAPAQVTKDQRAIGKVMELALGYQGGAKAFQKMASTTGLNLNTFAHLV 466 (675) Q Consensus 387 RvlA~ls~d~~l~~af~~g~D~h~~tA~~~~g~~~~~~~~~~R~~aK~~~~g~~YG~G~~~l~~~~~~~G~~~~~~~~~~ 466 (675) |||||+|+|+.|++||++|.|||+.||+.+||+|.++|++++|+.||++|||++||||+.+|++ T Consensus 656 RilAHlS~D~~Li~AF~~g~DIH~~tA~~if~v~~~~Vt~~~Rr~AK~vNFGiiYG~s~fgLa~---------------- 719 (870) T PRK05929 656 RFLAHLSQDESLKRAFESGEDIHAFTASQVFHVPLEQVTKQQRMQAKTVNFGIVYGQQAYGLSK---------------- 719 (870) T ss_pred HHHHHHHCCHHHHHHHHCCCHHHHHHHHHHHCCCHHHCCHHHHHHHHHHHHHHHHCCCHHHHHH---------------- T ss_conf 9999983678689875056207665269884898121698789886883545675658788998---------------- Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHCCEEEEECCCCCC Q ss_conf 55545555678888999988764310001022078889999999987887899999999999999754917962154211 Q gi|254781193|r 467 KTNSAPEEWEDAENFCRWMQENHPEHAIEDLFIGTACERVKKAWRQKHSRVSELWNELHQAFEQTIENGKAIIARKRRDV 546 (675) Q Consensus 467 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ea~~~~~~~r~~~p~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 546 (675) ..+++..||+.+++.|++.||+|++|..++-..+++ + T Consensus 720 ----------------------------~L~is~~eA~~~i~~Yf~~yp~v~~y~~~~~~~a~~---~------------ 756 (870) T PRK05929 720 ----------------------------ILKISVSEAQGLIDAYFARYPEAAEFITETIEQASK---N------------ 756 (870) T ss_pred ----------------------------HHCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHH---C------------ T ss_conf ----------------------------849899999999999998691299999999999997---6------------ Q ss_pred CCCCCCCCCCCCEEECCCCCCCCCCCCCCCHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC------ Q ss_conf 33222344564023247663101678662102101002345677753145655453688999999999999967------ Q gi|254781193|r 547 PLVYMKKDKNNLILMLPSGRSLVYRDVGNNRSYLNTTTRQLTREYTYGGKLTENIVQAVSRDILAEGMMNATKA------ 620 (675) Q Consensus 547 ~~~~~~~~~~~~~~~~p~GR~~~~~~~~~~~~~~~~~~~~~~r~~~~~~k~~n~~iQgtaadi~k~ami~l~~~------ 620 (675) ++++|..||||++|++++.+. + ....+|+ ++|+|||||||||+|.||+++++. T Consensus 757 -----------Gyv~Tl~gRrr~ip~i~~~~~-~---r~~~eR~------AiN~piQGtAADiiK~AMi~i~~~l~~~~~ 815 (870) T PRK05929 757 -----------LRVTTMLGRERIIDDWEEFPG-S---RAASGRL------AVNTRIQGSAAELIKLAMLNISDALKSRGL 815 (870) T ss_pred -----------CCEECCCCCCCCCCCCCCCHH-H---HHHHHHH------HHCCCCCHHHHHHHHHHHHHHHHHHHHCCC T ss_conf -----------967676886002886567745-6---7788898------818766648999999999999999985698 Q ss_pred CCCEEEEECCEEEEECCCCHHHHH-HHHHHHHCCCCHHCCCCCCEEEECCCCCCC Q ss_conf 981889952589997792489999-999998456520215995157401004631 Q gi|254781193|r 621 GYDIVLTVHDEIVCETPDTDEFNA-SMLYHLMTSNPSWAKGLPLKAEGYESKRYR 674 (675) Q Consensus 621 g~~ivl~VHDEiv~evp~~~~~~~-~~i~~~M~~~~~w~~~~pl~ae~~~~~~y~ 674 (675) ..+|+||||||||||||+++.+.+ +.|+++|+.+.+ .++||.+|..+|+.|. T Consensus 816 ~~~mllQVHDELvfEv~~~e~~~~~~~v~~~Me~~~~--l~VpL~v~~~~g~nW~ 868 (870) T PRK05929 816 KSRMLLQIHDELLFEVPEEELEEMQRLVREKMESAMT--LSVPLVVNILIGKNWA 868 (870) T ss_pred CCEEEEEECEEEEEEECHHHHHHHHHHHHHHHHCCCC--CCCCEEEECCCCCCCC T ss_conf 7649998530678842788999999999999727203--6887472228188853 No 8 >PRK08835 consensus Probab=100.00 E-value=0 Score=899.79 Aligned_cols=539 Identities=20% Similarity=0.221 Sum_probs=458.3 Q ss_pred CEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCEEEEEEECCCC----------HHHHHHHHHHHCCCCCEEEEE Q ss_conf 6899981417898700044001127981999999984803999978881----------043799999863889819997 Q gi|254781193|r 2 ATLFIDFETRSPIDLTKCGICKYAEDVCVILFAFGFDDHPVELWDCVHD----------PVMPQVLKEALENTDVTIVAH 71 (675) Q Consensus 2 k~i~lD~ET~~~~dl~~~Gl~~y~~~~~i~~i~~a~~~~~v~~~d~~~~----------~~~~~~l~~~L~~~~~~kV~H 71 (675) +.+++|+||.+. +.+ ...++|+++++..+.++|+++.|. +.+.+.++++|+++.+.||+| T Consensus 352 ~~~a~dtEt~~l--------d~~--~a~lvGis~~~~~g~a~YiPl~h~~l~~~~ql~~d~vl~~LkplLed~~i~KIgh 421 (931) T PRK08835 352 ELFAFDTETDSL--------DYM--VANLVGLSFAVAEGEAAYVPVAHDYLDAPEQLDRDWVLAQLKPLLEDDAKAKVGQ 421 (931) T ss_pred CEEEEEEECCCC--------CHH--HCCCCCEEEEECCCCEEEEEECCCCCCCHHHCCHHHHHHHHHHHHCCCCHHHHHC T ss_conf 859999504874--------322--2133542687647857999712333455101369999999799842842436622 Q ss_pred EHHHHHHHHHHHCCCCCCCCCEEEHHHHHHHHCC---CCCHHHHHHHC-CCCCCHHHHHHHHHHHHCCCCCCCCCCCCCC Q ss_conf 3077499999860899884203316889998678---99968989982-9981100437999986202112343323467 Q gi|254781193|r 72 NSYFERTVLKACFNLDIPIHRWICTLVLARSNGL---PSALKAVGEAL-KLSSQKMEEGKELIARFCKGETHECLYDRNK 147 (675) Q Consensus 72 Na~FD~~~L~~~~Gi~i~~~~~~DTmi~A~~l~~---~~sL~~la~~l-~l~~~k~~~~k~l~~~~~k~~~~~~~~~~~~ 147 (675) |++||+.+|.++ |+.+. +..+|||++||++++ .++|+.++..+ +...- ..+++. +...... T Consensus 422 NlK~d~~vL~~~-gI~l~-gi~fDTmLAaYLLnp~~~~~~Ld~La~~~L~~~~i---~~e~l~----------gkgk~q~ 486 (931) T PRK08835 422 NLKYDASVLARY-GIEMK-GIKHDTMLASYVYNSVGGKHDMDSLALRFLQHSCI---SFEQVA----------GKGKNQL 486 (931) T ss_pred CHHHHHHHHHHC-CCCCC-CCCCCHHHHHHHCCCCCCCCCHHHHHHHHCCCCCC---CHHHHC----------CCCCCCC T ss_conf 115789999977-97767-86612435753004777878989999986487763---477651----------4453447 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCH-----------HCCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHH Q ss_conf 85853289999998998999999996301040-----------0015133666653115621379999999999999999 Q gi|254781193|r 148 PNHREAWRLFGEYCKGDVTVTREIFKRLEPLS-----------SKEQELWYLDQTINDRGYCVDVALAHTLNQIVDVERT 216 (675) Q Consensus 148 ~~~~~~~~~~~~Ya~~Dv~~t~~L~~~l~~~~-----------~~E~~~~~~~~~me~~Gi~vD~e~l~~l~~~~~~~~~ 216 (675) .+...+.+....|+++|+.+|++||..|.+.+ +.|+++..++++||.+||.||.+.++++.+++..+++ T Consensus 487 ~f~~v~~~~~~~YaaedA~~t~~L~~~L~~~L~~~~~L~~Ly~~IE~PLi~VLa~ME~~GI~vD~~~L~~ls~el~~~l~ 566 (931) T PRK08835 487 TFNQIDLEEAAPYAAEDADVTLRLHNRLFANIEQDEKLKSVYEEIEMPLVPVLSRIERTGVLIDDMLLSAQSQEIAARLD 566 (931) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCEEECHHHHHHHHHHHHHHHH T ss_conf 83343288999999999999999999999999854778899999986899999999972848789999999999999999 Q ss_pred HHHHHHHHHHCCCCCCCCCHHHHHHHHHHHCCCCCCCCCC----CCHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999872885476788799988777755888888988----533899872130482389999999997655101335 Q gi|254781193|r 217 KLDQELESLTYGLVSSSRCLEKLKTWLFLETGLHLEDLTE----TTCQRVLSSNKKLTPQAIQVFNNRMRASRSAVLKLN 292 (675) Q Consensus 217 ~l~~~i~~~~~~~~~n~~S~~ql~~~L~~~lg~~~~~~~k----~t~~~~L~~~~~~~~~~~~~l~~r~~~~k~~~~~~~ 292 (675) +++++++++ .|..||+|||+||+++||+++|+|+.+++| +|+.++|+.+...||.+..+|+||. +.|+.+||++ T Consensus 567 ~le~~I~~~-aG~eFNinSPKQL~~vLF~~l~Lp~~kKtktG~~ST~~~vLe~L~~~hpi~~~il~yR~-l~KL~sTy~~ 644 (931) T PRK08835 567 ELEQKAYEI-AGQEFNMNSPKQLQTILFEKMGLPVIKKTPSGTPSTNEEVLQELALDYPLPKVILEYRG-LAKLKSTYTD 644 (931) T ss_pred HHHHHHHHH-CCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHCCCHHHHHHHHH-HHHHHHHHHH T ss_conf 999999876-27856779999999999986099878858899967559999987651777899999978-9888888775 Q ss_pred HHHHHCCC-CCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHCCCCCCCCCCCCCCCCHHHHHEEEECC Q ss_conf 78874086-89676778840001322112478884448888865113455431012333345544321003333012668 Q gi|254781193|r 293 TLVEAMNS-DGRLRGTLQFLGASRTGRWSGCVFPPQNLPRPSRSHEELDQIITHLPTSITSDPDPLQIISDCVRSCVIAS 371 (675) Q Consensus 293 ~~~~~~~~-dgRih~~~~~~gat~TGR~S~~~pNlQnlPr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iR~~f~a~ 371 (675) ++++.+++ +||||++|||.+ |.|||+||++|||||||.+++.|+. ||+||+|+ T Consensus 645 ~L~~~i~~~~gRiHt~f~Q~~-t~TGRLSS~~PNLQNIPiRt~~G~~-------------------------iR~aFva~ 698 (931) T PRK08835 645 KLPKMINPSTGRVHTSYHQAV-TATGRLSSTDPNLQNIPIRNEEGRR-------------------------IRQAFVAP 698 (931) T ss_pred HHHHHHCCCCCEECCHHHHCC-CCCCCCCCCCCCCCCCCCCCHHHCC-------------------------HHHHEECC T ss_conf 557652867588787343101-5524525799630267888601110-------------------------00534669 Q ss_pred CCCEEEEEECCCCCHHHHHHHCCCCCHHHHHHHCCCCHHHHHHHHCCCCHHHHCCHHHHHCCCCEEEEECCCCHHHHHHH Q ss_conf 98789998345372333323114620113454113310101476448795670620111016520210028888889987 Q gi|254781193|r 372 EGKKLVVADLAGIEARVLAWMAGEQWKLKAFETGEDLYVTTYARSFNVAPAQVTKDQRAIGKVMELALGYQGGAKAFQKM 451 (675) Q Consensus 372 ~G~~lv~aDySqiElRvlA~ls~d~~l~~af~~g~D~h~~tA~~~~g~~~~~~~~~~R~~aK~~~~g~~YG~G~~~l~~~ 451 (675) +||+|++|||||||||||||+|+|+.|++||++|.|||+.||+.+||+|.++|++++|+.||++|||++||||+.+|++ T Consensus 699 ~g~~l~saDYSQIELRilAh~s~d~~li~af~~~~DiH~~tA~~if~v~~~~v~~~~R~~AK~vnfgi~YG~~~~gLa~- 777 (931) T PRK08835 699 HGYKILAVDYSQIELRIMAHLSGDQALLDAFRDGKDIHAATAAEIMGVSIEQVSSEQRRRAKAVNFGLIYGMSAFGLAK- 777 (931) T ss_pred CCCEEEEECHHHHHHHHHHHHCCCHHHHHHHHCCCCHHHHHHHHHHCCCHHHCCHHHHHHHHHHHHHHHHCCCHHHHHH- T ss_conf 9887997043189999999982998999998669980678999995999466899898775833311244578777998- Q ss_pred HHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHH Q ss_conf 98708883113455655545555678888999988764310001022078889999999987887899999999999999 Q gi|254781193|r 452 ASTTGLNLNTFAHLVKTNSAPEEWEDAENFCRWMQENHPEHAIEDLFIGTACERVKKAWRQKHSRVSELWNELHQAFEQT 531 (675) Q Consensus 452 ~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ea~~~~~~~r~~~p~i~~~~~~~~~~~~~~ 531 (675) .++++..||+.+++.|++.||+|+.|...+-..+++ T Consensus 778 -------------------------------------------~l~i~~~eA~~~i~~yf~~yp~v~~~~~~~~~~a~~- 813 (931) T PRK08835 778 -------------------------------------------QLGIPRGEAQAYMDKYFERYPGVMQYMEDTRSRAAQ- 813 (931) T ss_pred -------------------------------------------HHCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHH- T ss_conf -------------------------------------------709899999999999988682299999999999987- Q ss_pred HHCCEEEEECCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCCHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 75491796215421133222344564023247663101678662102101002345677753145655453688999999 Q gi|254781193|r 532 IENGKAIIARKRRDVPLVYMKKDKNNLILMLPSGRSLVYRDVGNNRSYLNTTTRQLTREYTYGGKLTENIVQAVSRDILA 611 (675) Q Consensus 532 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~GR~~~~~~~~~~~~~~~~~~~~~~r~~~~~~k~~n~~iQgtaadi~k 611 (675) .+++++..||||++|++++.+...++. .+|+ ++|+|||||||||+| T Consensus 814 -------------------------~g~v~Tl~gRrr~~p~i~s~n~~~r~~---~eR~------a~N~piQGtaADiiK 859 (931) T PRK08835 814 -------------------------QGYVETIFGRRLHLPEITSRNGMRRKA---AERA------AINAPMQGTAADIIK 859 (931) T ss_pred -------------------------CCCEECCCCCCCCCCCCCCCCHHHHHH---HHHH------HHCCCCCHHHHHHHH T ss_conf -------------------------791788788731488656888778868---9999------957124455999999 Q ss_pred HHHHHHHH----CC---CCEEEEECCEEEEECCCCHHHHH-HHHHHHHCCCCHHCCCCCCEEEECCCCCCC Q ss_conf 99999996----79---81889952589997792489999-999998456520215995157401004631 Q gi|254781193|r 612 EGMMNATK----AG---YDIVLTVHDEIVCETPDTDEFNA-SMLYHLMTSNPSWAKGLPLKAEGYESKRYR 674 (675) Q Consensus 612 ~ami~l~~----~g---~~ivl~VHDEiv~evp~~~~~~~-~~i~~~M~~~~~w~~~~pl~ae~~~~~~y~ 674 (675) .||+++++ .| .+|+||||||||||||+++.+++ +.|+++|+.+.+ ..+||.+|..+|+.|. T Consensus 860 ~AMi~~~~~l~~~~~~~~~~~lqvHDEli~e~~~~~~~~~~~~v~~~Me~~~~--l~Vpl~v~~~~G~nW~ 928 (931) T PRK08835 860 KAMLLVDEWIEEEGNGRVKLLMQVHDELVFEVEESSLSEIESKVQQLMESAAE--LDVPLVAEAGHGDNWD 928 (931) T ss_pred HHHHHHHHHHHHCCCCCCEEEEEECCEEEEEECHHHHHHHHHHHHHHHHCCCC--CCCCEEEECCCCCCHH T ss_conf 99999999998556777628986672778831688999999999999727632--7786374238076978 No 9 >PRK07898 consensus Probab=100.00 E-value=0 Score=890.07 Aligned_cols=534 Identities=19% Similarity=0.179 Sum_probs=455.7 Q ss_pred CEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCEEEEEEECCCC-HHHHHHHHHHHCCCCCEEEEEEHHHHHHHH Q ss_conf 6899981417898700044001127981999999984803999978881-043799999863889819997307749999 Q gi|254781193|r 2 ATLFIDFETRSPIDLTKCGICKYAEDVCVILFAFGFDDHPVELWDCVHD-PVMPQVLKEALENTDVTIVAHNSYFERTVL 80 (675) Q Consensus 2 k~i~lD~ET~~~~dl~~~Gl~~y~~~~~i~~i~~a~~~~~v~~~d~~~~-~~~~~~l~~~L~~~~~~kV~HNa~FD~~~L 80 (675) +.+++|+||.+. + .+.+++++++++.++.++|+|..+. +.....+.++|+++.+.|++||++||+.+| T Consensus 338 ~~~a~~~~~t~~-~----------~~a~lvgisl~~~~g~a~yip~~~~~~~~~~~L~~~Led~~i~KIghNlK~d~~vL 406 (902) T PRK07898 338 RRLGLAVVGTHL-P----------GDGDATALAIAAADGHAAYIDTADLTPDDEAALAAWLADPARPKALHEAKGAMHAL 406 (902) T ss_pred CEEEEEEECCCC-C----------CCCCEEEEEEEEECCEEEEECCCCCCHHHHHHHHHHHHCCCCCEEEECHHHHHHHH T ss_conf 869999853678-6----------65738999999538808997220179789999999860877654554048999999 Q ss_pred HHHCCCCCCCCCEEEHHHHHHHHCC---CCCHHHHHHHCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHH Q ss_conf 9860899884203316889998678---9996898998299811004379999862021123433234678585328999 Q gi|254781193|r 81 KACFNLDIPIHRWICTLVLARSNGL---PSALKAVGEALKLSSQKMEEGKELIARFCKGETHECLYDRNKPNHREAWRLF 157 (675) Q Consensus 81 ~~~~Gi~i~~~~~~DTmi~A~~l~~---~~sL~~la~~l~l~~~k~~~~k~l~~~~~k~~~~~~~~~~~~~~~~~~~~~~ 157 (675) .++ |+.+. +..+||||+||++++ +++|+.++.+| ++...... ..+. ....+.....+.. T Consensus 407 ~~~-Gi~l~-g~~fDTmLAaYLLdp~~~~~~Ld~La~~y-L~~~~~~~-------------~~~~--~~~~~~~~~~~~~ 468 (902) T PRK07898 407 AGR-GWTLA-GVTSDTALAAYLVRPGQRSFTLDDLSLRY-LRRELRAE-------------APAQ--QQLSLDDAVDAAA 468 (902) T ss_pred HHC-CCCCC-CCCHHHHHHHHHHCCCCCCCCHHHHHHHH-CCCCCCCC-------------CCCC--CCCCCCCCCHHHH T ss_conf 966-98657-75322899998756887878799999997-49766515-------------6535--4347132676899 Q ss_pred HHHHHHHHHHHHHHHHHCCCCH----------HCCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 9998998999999996301040----------001513366665311562137999999999999999999999998728 Q gi|254781193|r 158 GEYCKGDVTVTREIFKRLEPLS----------SKEQELWYLDQTINDRGYCVDVALAHTLNQIVDVERTKLDQELESLTY 227 (675) Q Consensus 158 ~~Ya~~Dv~~t~~L~~~l~~~~----------~~E~~~~~~~~~me~~Gi~vD~e~l~~l~~~~~~~~~~l~~~i~~~~~ 227 (675) ..|++.|+.+|++||..|.+.+ +.|+++..++.+||.+||.||.+.++++..++..++++++++|+++. T Consensus 469 ~~yaa~dA~~~~~L~~~l~~~L~~~~l~~L~~~IE~PLi~VLa~ME~~GI~vD~~~L~~ls~el~~~l~~le~~I~~~a- 547 (902) T PRK07898 469 AQTLILRARAVLDLADALDAELARIDSAALLRDMELPLQRVLAEMERAGIAVDLAHLTELQSQFAGQVRDAAEAAYAVI- 547 (902) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCEEECHHHHHHHHHHHHHHHHHHHHHHHHHH- T ss_conf 9999999999999999999986165589999997605899999998508198199999899999999999999999981- Q ss_pred CCCCCCCCHHHHHHHHHHHCCCCCCCCCC---CCHHHHHHHHHC--CCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 85476788799988777755888888988---533899872130--4823899999999976551013357887408689 Q gi|254781193|r 228 GLVSSSRCLEKLKTWLFLETGLHLEDLTE---TTCQRVLSSNKK--LTPQAIQVFNNRMRASRSAVLKLNTLVEAMNSDG 302 (675) Q Consensus 228 ~~~~n~~S~~ql~~~L~~~lg~~~~~~~k---~t~~~~L~~~~~--~~~~~~~~l~~r~~~~k~~~~~~~~~~~~~~~dg 302 (675) |..||+|||+||+++||+++|+|+.+++| +|+.++|+.+.. .||.+..+|+|| +++|+.+| ++++++.+++|| T Consensus 548 G~eFNinSpKQL~eiLFdkL~Lp~~KKtKtGySTd~~vLe~L~~~~~h~i~~~iLeyR-~l~KL~sT-~d~L~~~i~~~g 625 (902) T PRK07898 548 GKQINLGSPKQLQVVLFDELGMPKTKRTKTGYTTDADALQSLADKTGHPFLQHLLAHR-DATKLKVT-VDGLIKAVADDG 625 (902) T ss_pred CCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCHHHHHHHHH-HHHHHHHH-HHHHHHHHCCCC T ss_conf 8956899989999998997099988878899878899999998557992699999999-99899989-988898726577 Q ss_pred CEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHCCCCCCCCCCCCCCCCHHHHHEEEECCCCC-EEEEEEC Q ss_conf 676778840001322112478884448888865113455431012333345544321003333012668987-8999834 Q gi|254781193|r 303 RLRGTLQFLGASRTGRWSGCVFPPQNLPRPSRSHEELDQIITHLPTSITSDPDPLQIISDCVRSCVIASEGK-KLVVADL 381 (675) Q Consensus 303 Rih~~~~~~gat~TGR~S~~~pNlQnlPr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iR~~f~a~~G~-~lv~aDy 381 (675) |||++|||.+ |.|||+||++|||||||.+++.|+. ||++|+|++|| +|++||| T Consensus 626 RIHtsfnQ~~-T~TGRLSS~~PNLQNIPir~~~G~~-------------------------iR~aFv~~~g~~~~~saDY 679 (902) T PRK07898 626 RIHTTFNQTV-AATGRLSSTDPNLQNIPIRTEAGRR-------------------------IRDAFVVGEGYETLLTADY 679 (902) T ss_pred CCCCHHHHHH-CCCCEECCCCCCCCCCCCCCCCHHH-------------------------HHHEEECCCCCCEEEECCH T ss_conf 2301265421-2112002589976677776610155-------------------------6760435899846774041 Q ss_pred CCCCHHHHHHHCCCCCHHHHHHHCCCCHHHHHHHHCCCCHHHHCCHHHHHCCCCEEEEECCCCHHHHHHHHHHCCCCCCH Q ss_conf 53723333231146201134541133101014764487956706201110165202100288888899879870888311 Q gi|254781193|r 382 AGIEARVLAWMAGEQWKLKAFETGEDLYVTTYARSFNVAPAQVTKDQRAIGKVMELALGYQGGAKAFQKMASTTGLNLNT 461 (675) Q Consensus 382 SqiElRvlA~ls~d~~l~~af~~g~D~h~~tA~~~~g~~~~~~~~~~R~~aK~~~~g~~YG~G~~~l~~~~~~~G~~~~~ 461 (675) ||||||||||+|+|+.|++||++|.|||+.||+.+||+|.++|++++|+.||++|||++||||+.+|++ T Consensus 680 SQiELRilAh~s~d~~l~~af~~~~DiH~~tA~~if~v~~~~vt~~~R~~AK~vNfgiiYG~~~~gLs~----------- 748 (902) T PRK07898 680 SQIEMRIMAHLSGDEGLIEAFNSGEDLHSFVASRVFGVPIDEVTPELRRRVKAMSYGLAYGLSAYGLSQ----------- 748 (902) T ss_pred HHHHHHHHHHHCCCHHHHHHHHCCCCHHHHHHHHHHCCCHHHCCHHHHHHHHHHHHHHHHCCCHHHHHH----------- T ss_conf 367999999980889999998449988899999984999254899999898887747766758889998----------- Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHCCEEEEEC Q ss_conf 34556555455556788889999887643100010220788899999999878878999999999999997549179621 Q gi|254781193|r 462 FAHLVKTNSAPEEWEDAENFCRWMQENHPEHAIEDLFIGTACERVKKAWRQKHSRVSELWNELHQAFEQTIENGKAIIAR 541 (675) Q Consensus 462 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ea~~~~~~~r~~~p~i~~~~~~~~~~~~~~~~~~~~~~~~ 541 (675) ..+++..||+.+++.|++.||+|..|..++-..+++ + T Consensus 749 ---------------------------------~l~i~~~eA~~~i~~yf~~yp~v~~~~~~~~~~a~~---~------- 785 (902) T PRK07898 749 ---------------------------------QLKISTEEAKELMDAYFARFGGVRDYLREVVEQARK---D------- 785 (902) T ss_pred ---------------------------------HHCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHH---C------- T ss_conf ---------------------------------729899999999999998690699999999999997---7------- Q ss_pred CCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCCHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC- Q ss_conf 5421133222344564023247663101678662102101002345677753145655453688999999999999967- Q gi|254781193|r 542 KRRDVPLVYMKKDKNNLILMLPSGRSLVYRDVGNNRSYLNTTTRQLTREYTYGGKLTENIVQAVSRDILAEGMMNATKA- 620 (675) Q Consensus 542 ~~~~~~~~~~~~~~~~~~~~~p~GR~~~~~~~~~~~~~~~~~~~~~~r~~~~~~k~~n~~iQgtaadi~k~ami~l~~~- 620 (675) +++++..||||++|++++.+...++. .+|+ ++|+|||||||||+|.||+++++. T Consensus 786 ----------------gyv~Tl~gRrr~~p~i~s~n~~~r~~---~eR~------A~N~piQGtaADiiK~AMi~i~~~l 840 (902) T PRK07898 786 ----------------GYTETLFGRRRYLPDLTSDNRQRREA---AERA------ALNAPIQGSAADIIKVAMIRVDRAL 840 (902) T ss_pred ----------------CEEECCCCCCCCCCCCCCCCHHHHHH---HHHH------HHCCCCCHHHHHHHHHHHHHHHHHH T ss_conf ----------------90678787711388777889778989---9999------8286666069999999999999999 Q ss_pred -----CCCEEEEECCEEEEECCCCHHHHH-HHHHHHHCCCCHHCCCCCCEEEECCCCCCC Q ss_conf -----981889952589997792489999-999998456520215995157401004631 Q gi|254781193|r 621 -----GYDIVLTVHDEIVCETPDTDEFNA-SMLYHLMTSNPSWAKGLPLKAEGYESKRYR 674 (675) Q Consensus 621 -----g~~ivl~VHDEiv~evp~~~~~~~-~~i~~~M~~~~~w~~~~pl~ae~~~~~~y~ 674 (675) ..+|+||||||||||||+++.+++ +.|+++|+.+.+ ..+||.+|..+|+.|. T Consensus 841 ~~~~l~~~~lLQvHDELi~ev~~~e~e~~~~~v~~~Me~~~~--L~VPL~v~~~~G~nW~ 898 (902) T PRK07898 841 REAGLKSRMLLQVHDELVLEVAPGEREQVEALVRDEMGSAYD--LRVPLEVSVGYGRSWD 898 (902) T ss_pred HHCCCCCEEEEEECEEEEEEECHHHHHHHHHHHHHHHHCCCC--CCCCEEEECCCCCCHH T ss_conf 846998738998641788940788999999999999728631--7787572007386978 No 10 >PRK07456 consensus Probab=100.00 E-value=0 Score=887.30 Aligned_cols=536 Identities=21% Similarity=0.241 Sum_probs=444.5 Q ss_pred EEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEEC--CEEEEEEECCCCH--------HHHHHHHHHHCCCCCEEEEEE Q ss_conf 899981417898700044001127981999999984--8039999788810--------437999998638898199973 Q gi|254781193|r 3 TLFIDFETRSPIDLTKCGICKYAEDVCVILFAFGFD--DHPVELWDCVHDP--------VMPQVLKEALENTDVTIVAHN 72 (675) Q Consensus 3 ~i~lD~ET~~~~dl~~~Gl~~y~~~~~i~~i~~a~~--~~~v~~~d~~~~~--------~~~~~l~~~L~~~~~~kV~HN 72 (675) .+++|+||.|... .+.+++|+++||+ .+..+|+|+.|.. .+.+.++++|+++.+.|++|| T Consensus 384 ~vA~DtETtsLdp----------~~A~lVGis~s~~~~~~~a~YIPl~h~~~~~~l~~~~vl~~LkpiLed~~i~KighN 453 (975) T PRK07456 384 PVAWDTETTSLNP----------RDAELVGIGCCWGEELDDLAYIPLGHQQAGEQLPLETVLEALRPWLESPEHPKALQN 453 (975) T ss_pred EEEEECCCCCCCH----------HHCCEEEEEEECCCCCCCEEEEECCCCCCHHHCCHHHHHHHHHHHHCCCCCCCHHHH T ss_conf 5899804688774----------232102666411557872699773455613105799999999998628875310211 Q ss_pred HHHHHHHHHHHCCCCCCCCCEEEHHHHHHHHCC--CCCHHHHHHHC-CCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCC Q ss_conf 077499999860899884203316889998678--99968989982-998110043799998620211234332346785 Q gi|254781193|r 73 SYFERTVLKACFNLDIPIHRWICTLVLARSNGL--PSALKAVGEAL-KLSSQKMEEGKELIARFCKGETHECLYDRNKPN 149 (675) Q Consensus 73 a~FD~~~L~~~~Gi~i~~~~~~DTmi~A~~l~~--~~sL~~la~~l-~l~~~k~~~~k~l~~~~~k~~~~~~~~~~~~~~ 149 (675) ++||+++|.++ |+.+. +..+||||+||+|++ .++|++++..| +.... ..++++. +. + .+ T Consensus 454 lKyD~~vL~~~-GI~l~-g~~fDTmLAsYLLnP~~~h~Ld~La~~yL~~~~i---~~edl~g---kg--------k--~~ 515 (975) T PRK07456 454 AKFDRLILLRH-GIALK-GVVFDTLLADYLLNPEAKHNLDDLAQRELGFTPT---SYKDLVG---KG--------Q--TF 515 (975) T ss_pred HHHHHHHHHHC-CCCCC-CCCCCHHHHHHHHCCCCCCCHHHHHHHHHCCCCC---CHHHHCC---CC--------C--CC T ss_conf 14679999976-95106-7551399999876876558989999997188877---4998628---88--------7--60 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHCCCCH------------HCCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHH Q ss_conf 853289999998998999999996301040------------00151336666531156213799999999999999999 Q gi|254781193|r 150 HREAWRLFGEYCKGDVTVTREIFKRLEPLS------------SKEQELWYLDQTINDRGYCVDVALAHTLNQIVDVERTK 217 (675) Q Consensus 150 ~~~~~~~~~~Ya~~Dv~~t~~L~~~l~~~~------------~~E~~~~~~~~~me~~Gi~vD~e~l~~l~~~~~~~~~~ 217 (675) ...+.+....||+.|+.+|++|+..|.+.+ +.|+++..++++||.+||.||.+.++++..++..++.+ T Consensus 516 ~~v~~~~~~~ya~~dA~~~~~L~~~l~~~L~~~~~~L~~l~~~iE~PLi~VLa~ME~~GI~vD~~~L~~l~~~l~~~l~~ 595 (975) T PRK07456 516 ADVDIATASQYCGMDVYLTYRLAPILREQLAKTGPELIKLLEEVELPLEPVLAEMEATGIRIDIPYLKELSQELESTLER 595 (975) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHH T ss_conf 20897899999999999999999999999986236999999998736999999999718501699999999999999999 Q ss_pred HHHHHHHHHCCCCCCCCCHHHHHHHHHHHCCCCCCC--CCC---CCHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 999999872885476788799988777755888888--988---533899872130482389999999997655101335 Q gi|254781193|r 218 LDQELESLTYGLVSSSRCLEKLKTWLFLETGLHLED--LTE---TTCQRVLSSNKKLTPQAIQVFNNRMRASRSAVLKLN 292 (675) Q Consensus 218 l~~~i~~~~~~~~~n~~S~~ql~~~L~~~lg~~~~~--~~k---~t~~~~L~~~~~~~~~~~~~l~~r~~~~k~~~~~~~ 292 (675) ++++++++ .|..||+|||+||+++||+++|+|+++ ++| +|++++|+.+...||.+..+|+||. +.|+.+||++ T Consensus 596 le~~i~~~-aG~eFNinSpkQL~evLFekL~L~~~k~~KtKtGySTd~~vLe~L~~~hpi~~~il~~R~-l~Kl~sTY~~ 673 (975) T PRK07456 596 LEQQAYEL-AGEDFNLSSPKQLGELLFEKLGLDRKKSRKTKTGYSTDAAVLEKLEGDHPIVPLILEHRT-LSKLKSTYVD 673 (975) T ss_pred HHHHHHHH-CCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHCCHHHHHHHHHHH-HHHHHHHHHH T ss_conf 99999986-489889899899999989863999877777889988629999975204727999999878-9999999999 Q ss_pred HHHHHCCCC-CCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHCCCCCCCCCCCCCCCCHHHHHEEEECC Q ss_conf 788740868-9676778840001322112478884448888865113455431012333345544321003333012668 Q gi|254781193|r 293 TLVEAMNSD-GRLRGTLQFLGASRTGRWSGCVFPPQNLPRPSRSHEELDQIITHLPTSITSDPDPLQIISDCVRSCVIAS 371 (675) Q Consensus 293 ~~~~~~~~d-gRih~~~~~~gat~TGR~S~~~pNlQnlPr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iR~~f~a~ 371 (675) ++++.++++ ||||++|||.+ |.|||+||++|||||||.+++.|.. ||+||+|+ T Consensus 674 ~L~~~i~~~tgriht~f~Q~~-t~TGRlSS~~PNLQNIP~r~~~g~~-------------------------iR~~Fv~~ 727 (975) T PRK07456 674 ALPQLVEAETGRVHTDFNQAV-TATGRLSSSNPNLQNIPIRTEFSRR-------------------------IRKAFLPQ 727 (975) T ss_pred HHHHHHCCCCCEECCHHHHHC-CCCCCCCCCCCCCCCCCCCCCCCHH-------------------------HHEEECCC T ss_conf 988752877587657555410-6656504789751357776410312-------------------------34124358 Q ss_pred CCCEEEEEECCCCCHHHHHHHCCCCCHHHHHHHCCCCHHHHHHHHCCCCHHHHCCHHHHHCCCCEEEEECCCCHHHHHHH Q ss_conf 98789998345372333323114620113454113310101476448795670620111016520210028888889987 Q gi|254781193|r 372 EGKKLVVADLAGIEARVLAWMAGEQWKLKAFETGEDLYVTTYARSFNVAPAQVTKDQRAIGKVMELALGYQGGAKAFQKM 451 (675) Q Consensus 372 ~G~~lv~aDySqiElRvlA~ls~d~~l~~af~~g~D~h~~tA~~~~g~~~~~~~~~~R~~aK~~~~g~~YG~G~~~l~~~ 451 (675) |||+|++|||||||||||||+|+|+.|++||++|.|||+.||+.+|++ ++|++++|++||++|||++||||+.+|++ T Consensus 728 ~g~~l~saDySQiELRilAh~s~d~~li~af~~~~DiH~~tA~~if~~--~~v~~~~R~~AK~infgiiYG~~~~gLa~- 804 (975) T PRK07456 728 EGWLLLSADYSQIELRILAHLSQEPVLLEAYRSGDDVHTLTARLLFEK--EEISSDERRLGKTINFGVIYGMGAQRFAR- 804 (975) T ss_pred CCCEEEEECHHHHHHHHHHHHHCCHHHHHHHHCCCCHHHHHHHHHCCC--CCCCHHHHHHHHHHHHHHHHCCCHHHHHH- T ss_conf 998899963208799999998588899998717998689999986393--01999999997898876776468778898- Q ss_pred HHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHH Q ss_conf 98708883113455655545555678888999988764310001022078889999999987887899999999999999 Q gi|254781193|r 452 ASTTGLNLNTFAHLVKTNSAPEEWEDAENFCRWMQENHPEHAIEDLFIGTACERVKKAWRQKHSRVSELWNELHQAFEQT 531 (675) Q Consensus 452 ~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ea~~~~~~~r~~~p~i~~~~~~~~~~~~~~ 531 (675) ..+++..||+.+++.|++.||+|.+|...+...+ T Consensus 805 -------------------------------------------~l~is~~eA~~~i~~yf~~yp~v~~~~~~~~~~a--- 838 (975) T PRK07456 805 -------------------------------------------ETGVSQSEAKEFLSRYKERYPKVFAFLELQERLA--- 838 (975) T ss_pred -------------------------------------------HHCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHH--- T ss_conf -------------------------------------------8597999999999999986901999999999999--- Q ss_pred HHCCEEEEECCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCCH--------------HHHC-CHHHHHHHHHHHHHH Q ss_conf 7549179621542113322234456402324766310167866210--------------2101-002345677753145 Q gi|254781193|r 532 IENGKAIIARKRRDVPLVYMKKDKNNLILMLPSGRSLVYRDVGNNR--------------SYLN-TTTRQLTREYTYGGK 596 (675) Q Consensus 532 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~GR~~~~~~~~~~~--------------~~~~-~~~~~~~r~~~~~~k 596 (675) .+.+ ++++..||||++|..+... ...+ ......+| . T Consensus 839 ~~~g-----------------------yv~Tl~gRrr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R------~ 889 (975) T PRK07456 839 LSKG-----------------------YVETILGRRRYFHFDKNGLGRLLGKDPEEIDLDVARRAGMEAQQLR------A 889 (975) T ss_pred HHCC-----------------------CEECCCCCEECCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHH------H T ss_conf 9769-----------------------8876667700464444310001356822134065666777758889------8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHC------CCCEEEEECCEEEEECCCCHHHHH-HHHHHHHCCCCHHCCCCCCEEEECC Q ss_conf 655453688999999999999967------981889952589997792489999-9999984565202159951574010 Q gi|254781193|r 597 LTENIVQAVSRDILAEGMMNATKA------GYDIVLTVHDEIVCETPDTDEFNA-SMLYHLMTSNPSWAKGLPLKAEGYE 669 (675) Q Consensus 597 ~~n~~iQgtaadi~k~ami~l~~~------g~~ivl~VHDEiv~evp~~~~~~~-~~i~~~M~~~~~w~~~~pl~ae~~~ 669 (675) ++|+|||||||||+|.||+++++. ..+|+||||||||||||+++.+++ +.|+++|+.+.+ ..+||.+|..+ T Consensus 890 A~N~piQGtaADiiK~AMi~~~~~l~~~~~~~~~llqvHDElifev~~~~~~~~~~~v~~~Me~a~~--l~Vpl~v~~~~ 967 (975) T PRK07456 890 AANAPIQGSSADIIKIAMVQLQKKLLEQNLPARLLLQVHDELVLEVPPEELEELKPLVKSTMENAVK--LSVPLVVEIGA 967 (975) T ss_pred HHCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEEEEEEEEEEEECHHHHHHHHHHHHHHHHCCCC--CCCCEEEECCC T ss_conf 8370244679999999999999999737988538998643778953688999999999999827435--78875841173 Q ss_pred CCCCC Q ss_conf 04631 Q gi|254781193|r 670 SKRYR 674 (675) Q Consensus 670 ~~~y~ 674 (675) |+.|. T Consensus 968 G~nW~ 972 (975) T PRK07456 968 GDNWM 972 (975) T ss_pred CCCHH T ss_conf 88968 No 11 >PRK08928 consensus Probab=100.00 E-value=0 Score=888.31 Aligned_cols=498 Identities=18% Similarity=0.189 Sum_probs=437.1 Q ss_pred CCEEEEEEEEECCEEEEEEECCCC-HHHHHHHHHHHCCCCCEEEEEEHHHHHHHHHHHCCCCCCCCCEEEHHHHHHHHCC Q ss_conf 981999999984803999978881-0437999998638898199973077499999860899884203316889998678 Q gi|254781193|r 27 DVCVILFAFGFDDHPVELWDCVHD-PVMPQVLKEALENTDVTIVAHNSYFERTVLKACFNLDIPIHRWICTLVLARSNGL 105 (675) Q Consensus 27 ~~~i~~i~~a~~~~~v~~~d~~~~-~~~~~~l~~~L~~~~~~kV~HNa~FD~~~L~~~~Gi~i~~~~~~DTmi~A~~l~~ 105 (675) +.++++++++++.+..++++..+. +...+.+.++|+++.+.||+||+|||+.+|.++ |+++ ..++||||+||+|++ T Consensus 335 ~~~~vgis~s~~~~~~~~i~~~~~~~~~~~~l~~lLed~~i~KIghNlK~dl~vL~~~-gi~l--~~i~DTmLasYLLnp 411 (861) T PRK08928 335 KGTNNAISLSLQNQSYIIIISNEITDAFSDIIFNLLTDKSILKITYDLKPLLKFYANQ-SHDI--TAIDDLMLMSYSLSA 411 (861) T ss_pred CCEEEEEEEEECCCEEEEECCCCCHHHHHHHHHHHHCCCCCCEEECCHHHHHHHHHHC-CCCC--CCCCCHHHHHHHCCC T ss_conf 8738999999559849995575424789999999843999754454579999999856-9675--763718899986499 Q ss_pred C---CCHHHHHHHCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCH--- Q ss_conf 9---9968989982998110043799998620211234332346785853289999998998999999996301040--- Q gi|254781193|r 106 P---SALKAVGEALKLSSQKMEEGKELIARFCKGETHECLYDRNKPNHREAWRLFGEYCKGDVTVTREIFKRLEPLS--- 179 (675) Q Consensus 106 ~---~sL~~la~~l~l~~~k~~~~k~l~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~t~~L~~~l~~~~--- 179 (675) + ++|++++.++.. ...|++.|+..+++||+.|.+.+ T Consensus 412 ~~~~h~Ld~La~~~~~--------------------------------------~~~~aa~~a~~~~~L~~~L~~~L~~~ 453 (861) T PRK08928 412 GLHDHSLFNIILKNTN--------------------------------------IINESAKIVCTLISLHKQLILELFDN 453 (861) T ss_pred CCCCCCHHHHHHHHCC--------------------------------------CCCHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9876898999976222--------------------------------------00147999999999999999999886 Q ss_pred -------HCCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHCCCCCC Q ss_conf -------0015133666653115621379999999999999999999999987288547678879998877775588888 Q gi|254781193|r 180 -------SKEQELWYLDQTINDRGYCVDVALAHTLNQIVDVERTKLDQELESLTYGLVSSSRCLEKLKTWLFLETGLHLE 252 (675) Q Consensus 180 -------~~E~~~~~~~~~me~~Gi~vD~e~l~~l~~~~~~~~~~l~~~i~~~~~~~~~n~~S~~ql~~~L~~~lg~~~~ 252 (675) +.|+++..++++||.+||.||.+.++++..++..++++++++|+++. |..||+|||+||+++||++||+|+. T Consensus 454 ~L~~Ly~eIE~PLi~VLa~ME~~GI~VD~~~L~~ls~el~~~l~~Le~~I~~lA-G~eFNInSPKQLgeILFekL~Lp~~ 532 (861) T PRK08928 454 KLFTIYYRIDLPLCFVLDKMEKIGIKVDANYLKQLSAEFGTEILKLEEEIYKLS-GTKFNIASPKQLGEILFEKMGLPKG 532 (861) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHC-CCCCCCCCHHHHHHHHHHHCCCCCC T ss_conf 689999998877999999999819601699999999999999999999999974-9874759999999997997299867 Q ss_pred CCC-----CCCHHHHHHHHHCC-CCHHHHHHHHHHHHHHHHHHHHHHHHHHCCC-CCCEEEEEEEECCCCCCCCCCCCCC Q ss_conf 898-----85338998721304-8238999999999765510133578874086-8967677884000132211247888 Q gi|254781193|r 253 DLT-----ETTCQRVLSSNKKL-TPQAIQVFNNRMRASRSAVLKLNTLVEAMNS-DGRLRGTLQFLGASRTGRWSGCVFP 325 (675) Q Consensus 253 ~~~-----k~t~~~~L~~~~~~-~~~~~~~l~~r~~~~k~~~~~~~~~~~~~~~-dgRih~~~~~~gat~TGR~S~~~pN 325 (675) +++ .||+.++|+.++.. ||.+..+|+|| +++|+.+||++++++.+++ +||||++|||.+ |.|||+||++|| T Consensus 533 KKt~KtG~ySTda~vLe~La~~~h~i~~~ILeyR-~l~KLkSTYvd~L~~~i~~~tgRIHTsfnQt~-taTGRLSS~~PN 610 (861) T PRK08928 533 KKSAKSGSYSTDAEVLEKLSEHGVEIADLLLKWR-HLTKLKNTYTDSLPKQIDNNTGRVHTTFSQTS-TTTGRLSSSNPN 610 (861) T ss_pred CCCCCCCCCCCHHHHHHHHHHCCCCHHHHHHHHH-HHHHHHHHHHHHHHHHHCCCCCEECCCHHHHH-HHCCEECCCCCC T ss_conf 7677889968549999986441791489999998-99887517888888874279985725210454-311400036887 Q ss_pred CCCCCCCCCCCHHHHHHHHCCCCCCCCCCCCCCCCHHHHHEEEECCCCCEEEEEECCCCCHHHHHHHCCCCCHHHHHHHC Q ss_conf 44488888651134554310123333455443210033330126689878999834537233332311462011345411 Q gi|254781193|r 326 PQNLPRPSRSHEELDQIITHLPTSITSDPDPLQIISDCVRSCVIASEGKKLVVADLAGIEARVLAWMAGEQWKLKAFETG 405 (675) Q Consensus 326 lQnlPr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iR~~f~a~~G~~lv~aDySqiElRvlA~ls~d~~l~~af~~g 405 (675) |||||.+++.|.+ ||+||+|++||+|++|||||||||||||+|+|+.|++||++| T Consensus 611 LQNIPiRte~Gr~-------------------------iR~aFva~~g~~l~saDYSQIELRilAhlS~D~~Li~AF~~~ 665 (861) T PRK08928 611 LQNIPIRSEEGNK-------------------------IRQAFIAEKGYKLISADYSQIELRILAHIANVDAFKQAFING 665 (861) T ss_pred CCCCCCCCCCHHH-------------------------HHEEECCCCCCEEEEECHHHHHHHHHHHHHCCHHHHHHHCCC T ss_conf 3567777620214-------------------------124034699988998537788999999981788999986068 Q ss_pred CCCHHHHHHHHCCCCHHHHCCHHHHHCCCCEEEEECCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 33101014764487956706201110165202100288888899879870888311345565554555567888899998 Q gi|254781193|r 406 EDLYVTTYARSFNVAPAQVTKDQRAIGKVMELALGYQGGAKAFQKMASTTGLNLNTFAHLVKTNSAPEEWEDAENFCRWM 485 (675) Q Consensus 406 ~D~h~~tA~~~~g~~~~~~~~~~R~~aK~~~~g~~YG~G~~~l~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 485 (675) .|||+.||+.+||+|.++|++++|+.||++|||++||||+.+|++ T Consensus 666 ~DIH~~TA~~vf~v~~~~vt~~~Rr~AK~vNFGiiYG~s~fgLs~----------------------------------- 710 (861) T PRK08928 666 DDIHTQTACQIFNLQKHELDSELRRKAKSINFGIIYGISAFGLAK----------------------------------- 710 (861) T ss_pred CCHHHHHHHHHHCCCHHHCCHHHHHHHHHHHHHHHHCCCHHHHHH----------------------------------- T ss_conf 866688888873898211898899999999999887568888998----------------------------------- Q ss_pred HHHHHHHHHHCCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHCCEEEEECCCCCCCCCCCCCCCCCCEEECCCC Q ss_conf 87643100010220788899999999878878999999999999997549179621542113322234456402324766 Q gi|254781193|r 486 QENHPEHAIEDLFIGTACERVKKAWRQKHSRVSELWNELHQAFEQTIENGKAIIARKRRDVPLVYMKKDKNNLILMLPSG 565 (675) Q Consensus 486 ~~~~~~~~~~~~~~~~ea~~~~~~~r~~~p~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~G 565 (675) ..+++..||+.+++.|++.||+|+.|..++-..++ +. ++++|..| T Consensus 711 ---------~l~i~~~eA~~~i~~Yf~~yp~v~~~~~~~~~~a~---~~-----------------------Gyv~Tl~g 755 (861) T PRK08928 711 ---------QLNITNSEASEYIKKYFAEYPEIQEYMEQTKAYAR---SN-----------------------GYVTTFFG 755 (861) T ss_pred ---------HHCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHH---HC-----------------------CCEECCCC T ss_conf ---------82989999999999999968149999999999999---77-----------------------98768888 Q ss_pred CCCCCCCCCCCHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC--CCCEEEEECCEEEEECCCCHHHH Q ss_conf 3101678662102101002345677753145655453688999999999999967--98188995258999779248999 Q gi|254781193|r 566 RSLVYRDVGNNRSYLNTTTRQLTREYTYGGKLTENIVQAVSRDILAEGMMNATKA--GYDIVLTVHDEIVCETPDTDEFN 643 (675) Q Consensus 566 R~~~~~~~~~~~~~~~~~~~~~~r~~~~~~k~~n~~iQgtaadi~k~ami~l~~~--g~~ivl~VHDEiv~evp~~~~~~ 643 (675) |||++|++++.+...++. .+|. ++|+|||||||||+|.||+++++. ..+|+||||||||||||+++.++ T Consensus 756 Rrr~~p~i~s~n~~~r~~---~eR~------A~N~piQGtAADiiK~AMi~~~~~l~~~~mllQVHDEli~E~~~~~~~~ 826 (861) T PRK08928 756 RKCFIPLINSKKHTLRQF---AERA------AINAPIQGTAADIIKIAMINLDDELEKGRLILQVHDELLFEVPEDEVDT 826 (861) T ss_pred CEECCCCCCCCCHHHHHH---HHHH------HHCHHHHHHHHHHHHHHHHHHHHHHHHCEEEEEEEEEEEEEECHHHHHH T ss_conf 603287555778778858---9999------9751212889999999999999986247168745446789646889999 Q ss_pred H-HHHHHHHCCCCHHCCCCCCEEEECCCCCCC Q ss_conf 9-999998456520215995157401004631 Q gi|254781193|r 644 A-SMLYHLMTSNPSWAKGLPLKAEGYESKRYR 674 (675) Q Consensus 644 ~-~~i~~~M~~~~~w~~~~pl~ae~~~~~~y~ 674 (675) + +.|+++|+.+.+ ..+||.+|..+|+.|. T Consensus 827 ~~~~~~~~Me~a~~--l~Vpl~v~~~~g~nW~ 856 (861) T PRK08928 827 VAKLIKEIMENSTN--MSVPLKVEIRIGDNWG 856 (861) T ss_pred HHHHHHHHHCCCCC--CCCCEEEECCCCCCHH T ss_conf 99999999628743--7787684107589978 No 12 >PRK05797 consensus Probab=100.00 E-value=0 Score=884.47 Aligned_cols=518 Identities=19% Similarity=0.202 Sum_probs=450.4 Q ss_pred EEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCEEEEEEECCC---CHHHHHHHHHHHCCCCCEEEEEEHHHHHHH Q ss_conf 89998141789870004400112798199999998480399997888---104379999986388981999730774999 Q gi|254781193|r 3 TLFIDFETRSPIDLTKCGICKYAEDVCVILFAFGFDDHPVELWDCVH---DPVMPQVLKEALENTDVTIVAHNSYFERTV 79 (675) Q Consensus 3 ~i~lD~ET~~~~dl~~~Gl~~y~~~~~i~~i~~a~~~~~v~~~d~~~---~~~~~~~l~~~L~~~~~~kV~HNa~FD~~~ 79 (675) .+++|+||.+...+ ...++++++++.++.++|+|+.+ .+.+...++++|+++.+.|||||+|||+.+ T Consensus 321 ~~~~~~ett~~~~~----------~~~~vg~s~~~~~~~a~yip~~~~~~~~~~L~~Lk~lLed~~i~KIGhNlK~dl~v 390 (869) T PRK05797 321 QVYINFTLEDENLY----------SKIEIKKIFINFGEKTYYIDFKELIDEEEFIEDLKEIFENEEIKKIGHDIKNFLTI 390 (869) T ss_pred CEEEEEECCCCCHH----------HHHHEEEEEEECCCEEEEEECHHHCCHHHHHHHHHHHHHCCCCCEEEECCHHHHHH T ss_conf 87999973897612----------03041899996199799983145167789999999998489987898430699999 Q ss_pred HHHHCCCCCCCCCEEEHHHHHHHHCCC---CCHHHHHHHC-CCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHH Q ss_conf 998608998842033168899986789---9968989982-998110043799998620211234332346785853289 Q gi|254781193|r 80 LKACFNLDIPIHRWICTLVLARSNGLP---SALKAVGEAL-KLSSQKMEEGKELIARFCKGETHECLYDRNKPNHREAWR 155 (675) Q Consensus 80 L~~~~Gi~i~~~~~~DTmi~A~~l~~~---~sL~~la~~l-~l~~~k~~~~k~l~~~~~k~~~~~~~~~~~~~~~~~~~~ 155 (675) |.++ |+.+. +.++||||+||++++. ++|+.++..+ +... . . T Consensus 391 L~~~-GI~l~-gi~FDTMLAsYLLnP~~~~h~Ld~La~~yL~~~~---------------------------~------~ 435 (869) T PRK05797 391 LKKL-GIEFK-GLAFDTAIAAYLIDPSKSEYDLSDLAKEYLSKDV---------------------------N------S 435 (869) T ss_pred HHHC-CCCCC-CHHHHHHHHHHHHCCCCCCCCHHHHHHHHHCCCC---------------------------C------C T ss_conf 9975-98556-6166899999972899888899999999808777---------------------------5------1 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCH----------HCCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 999998998999999996301040----------0015133666653115621379999999999999999999999987 Q gi|254781193|r 156 LFGEYCKGDVTVTREIFKRLEPLS----------SKEQELWYLDQTINDRGYCVDVALAHTLNQIVDVERTKLDQELESL 225 (675) Q Consensus 156 ~~~~Ya~~Dv~~t~~L~~~l~~~~----------~~E~~~~~~~~~me~~Gi~vD~e~l~~l~~~~~~~~~~l~~~i~~~ 225 (675) .-..|++.|+.++++||..|.+.+ +.|+++..++++||.+||.||.+.++++..++..++++++++|+++ T Consensus 436 ~e~~ya~~dA~~~~~L~~~L~~~L~~~~l~~L~~~IE~PLi~VLa~ME~~GI~VD~~~L~~l~~~l~~~l~~le~~I~~~ 515 (869) T PRK05797 436 EEEEYKIKEVSVMKELYEKLKEKIEEMDMEKLLYEVELPLVEVLASMESEGFKVDKDILDELSKKFKEEIEKTQKEIYEL 515 (869) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCEEECHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 47899999999999999999999986128999999986789999999846809999999999999999999999999986 Q ss_pred HCCCCCCCCCHHHHHHHHHHHCCCCCCCCCC---CCHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 2885476788799988777755888888988---5338998721304823899999999976551013357887408689 Q gi|254781193|r 226 TYGLVSSSRCLEKLKTWLFLETGLHLEDLTE---TTCQRVLSSNKKLTPQAIQVFNNRMRASRSAVLKLNTLVEAMNSDG 302 (675) Q Consensus 226 ~~~~~~n~~S~~ql~~~L~~~lg~~~~~~~k---~t~~~~L~~~~~~~~~~~~~l~~r~~~~k~~~~~~~~~~~~~~~dg 302 (675) .|..||+|||+||+++||+++|+|+.+++| +|+.++|+.+.+.||.+..+|+||. ++|+.+||+++++..+++|| T Consensus 516 -aG~eFNinSPKQL~~iLFe~LgLp~~KKtKtG~STd~~vLe~L~~~hpi~~~iLeyR~-l~KL~sTY~d~L~~~i~~~g 593 (869) T PRK05797 516 -AGEEFNINSPKQLGKILFEKLDLPVIKKTKTGYSTNAEVLEKLRDKHPIIEKITEYRQ-LTKLYSTYVEGLKNVIDEDG 593 (869) T ss_pred -CCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCHHHHHHHH-HHHHHHHHHHHHHHHCCCCC T ss_conf -2886678998999999898649998887889987859999976554885199999879-98999889878997426246 Q ss_pred CEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHCCCCCCCCCCCCCCCCHHHHHEEEECCCC-CEEEEEEC Q ss_conf 67677884000132211247888444888886511345543101233334554432100333301266898-78999834 Q gi|254781193|r 303 RLRGTLQFLGASRTGRWSGCVFPPQNLPRPSRSHEELDQIITHLPTSITSDPDPLQIISDCVRSCVIASEG-KKLVVADL 381 (675) Q Consensus 303 Rih~~~~~~gat~TGR~S~~~pNlQnlPr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iR~~f~a~~G-~~lv~aDy 381 (675) |||++|||.+ |.|||+||++|||||||.+++.|. .||+||+|++| ++|++||| T Consensus 594 RiHt~f~Q~~-t~TGRLSS~~PNLQNIP~r~e~G~-------------------------~iR~aFi~~~~~~~l~saDY 647 (869) T PRK05797 594 RIHSNFNQTV-TTTGRLSSTEPNLQNIPIKYEMGR-------------------------EIRKVFIPNTEDSVILSADY 647 (869) T ss_pred CCCCHHHHHH-CCCCCCCCCCCCCCCCCCCCCCHH-------------------------HHHHEEECCCCCCEEEECCH T ss_conf 5054266543-440211468986456877652012-------------------------25535714777615885227 Q ss_pred CCCCHHHHHHHCCCCCHHHHHHHCCCCHHHHHHHHCCCCHHHHCCHHHHHCCCCEEEEECCCCHHHHHHHHHHCCCCCCH Q ss_conf 53723333231146201134541133101014764487956706201110165202100288888899879870888311 Q gi|254781193|r 382 AGIEARVLAWMAGEQWKLKAFETGEDLYVTTYARSFNVAPAQVTKDQRAIGKVMELALGYQGGAKAFQKMASTTGLNLNT 461 (675) Q Consensus 382 SqiElRvlA~ls~d~~l~~af~~g~D~h~~tA~~~~g~~~~~~~~~~R~~aK~~~~g~~YG~G~~~l~~~~~~~G~~~~~ 461 (675) ||||||||||+|+|+.|++||++|.|+|+.||+.+||+|.++|++++|+.||++|||++||||+.+|++ T Consensus 648 SQIELRilAh~s~d~~l~~af~~~~DiH~~tA~~vf~~~~~~v~~~~R~~AK~infGiiYG~~~~gLs~----------- 716 (869) T PRK05797 648 SQIELRVLAHISGDENLIDAFNHHEDIHTKTASEVFKVPIEEVTPLMRSNAKAVNFGIVYGIGDFSLSQ----------- 716 (869) T ss_pred HHHHHHHHHHHHCCHHHHHHHHCCCCHHHHHHHHHHCCCHHHCCHHHHHHHHHHHHHHHHCCCHHHHHH----------- T ss_conf 789999999984689999988559976899999995989111798899998898856655568677898----------- Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHCCEEEEEC Q ss_conf 34556555455556788889999887643100010220788899999999878878999999999999997549179621 Q gi|254781193|r 462 FAHLVKTNSAPEEWEDAENFCRWMQENHPEHAIEDLFIGTACERVKKAWRQKHSRVSELWNELHQAFEQTIENGKAIIAR 541 (675) Q Consensus 462 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ea~~~~~~~r~~~p~i~~~~~~~~~~~~~~~~~~~~~~~~ 541 (675) ..+++..||+.+++.|++.||+|.+|..++...|++ T Consensus 717 ---------------------------------~l~i~~~eA~~~i~~yf~~yp~v~~~~~~~~~~a~~----------- 752 (869) T PRK05797 717 ---------------------------------DLNISRKEAKEYIDNYFERYPNVKEYMEKIVEEAKE----------- 752 (869) T ss_pred ---------------------------------HCCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHH----------- T ss_conf ---------------------------------529899999999999999680499999999999997----------- Q ss_pred CCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCCHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC- Q ss_conf 5421133222344564023247663101678662102101002345677753145655453688999999999999967- Q gi|254781193|r 542 KRRDVPLVYMKKDKNNLILMLPSGRSLVYRDVGNNRSYLNTTTRQLTREYTYGGKLTENIVQAVSRDILAEGMMNATKA- 620 (675) Q Consensus 542 ~~~~~~~~~~~~~~~~~~~~~p~GR~~~~~~~~~~~~~~~~~~~~~~r~~~~~~k~~n~~iQgtaadi~k~ami~l~~~- 620 (675) .+++++..||||++|++++.+...+.. .+|+ ++|+|||||||||+|.||+++++. T Consensus 753 ---------------~g~v~Tl~gRrr~~p~i~s~n~~~r~~---~eR~------AiN~piQGtaADiiK~AMi~~~~~l 808 (869) T PRK05797 753 ---------------KGYVTTILNRRRYIPEIKSSNKIVRAF---GERL------AMNTPIQGSAADIIKLAMVNVYNKL 808 (869) T ss_pred ---------------CCEEECCCCCEECCCCCCCCCHHHHHH---HHHH------HHCCCCCHHHHHHHHHHHHHHHHHH T ss_conf ---------------890788888733588777889889977---8887------8274655679999999999999999 Q ss_pred -----CCCEEEEECCEEEEECCCCHHHHH-HHHHHHHCCCCHHCCCCCCEEEECCCCCCC Q ss_conf -----981889952589997792489999-999998456520215995157401004631 Q gi|254781193|r 621 -----GYDIVLTVHDEIVCETPDTDEFNA-SMLYHLMTSNPSWAKGLPLKAEGYESKRYR 674 (675) Q Consensus 621 -----g~~ivl~VHDEiv~evp~~~~~~~-~~i~~~M~~~~~w~~~~pl~ae~~~~~~y~ 674 (675) ..+|+||||||||||||+++.+++ +.|+++|+.+.+ ..+||.+|..+|+.|. T Consensus 809 ~~~~~~~~~~lqVHDElv~ev~~~~~~~~~~~v~~~Me~~~~--l~VPl~v~~~~g~nW~ 866 (869) T PRK05797 809 KELNLKSKLILQVHDELILNVYKDELEEVKKIVKEEMENVIK--LKVPLDVDINVGNNWY 866 (869) T ss_pred HHCCCCCEEEEEEEEEEEEEECHHHHHHHHHHHHHHHHCCCC--CCCCEEEECCCCCCHH T ss_conf 854998649999732668951688999999999999728425--7887473048278967 No 13 >PRK08076 consensus Probab=100.00 E-value=0 Score=881.94 Aligned_cols=533 Identities=17% Similarity=0.179 Sum_probs=449.0 Q ss_pred EEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCEEEEEEECCCCHHHHHHHHHHHCCCCCEEEEEEHHHHHHHHHHH Q ss_conf 99981417898700044001127981999999984803999978881043799999863889819997307749999986 Q gi|254781193|r 4 LFIDFETRSPIDLTKCGICKYAEDVCVILFAFGFDDHPVELWDCVHDPVMPQVLKEALENTDVTIVAHNSYFERTVLKAC 83 (675) Q Consensus 4 i~lD~ET~~~~dl~~~Gl~~y~~~~~i~~i~~a~~~~~v~~~d~~~~~~~~~~l~~~L~~~~~~kV~HNa~FD~~~L~~~ 83 (675) .++|+||.+... .+..++|++++.+++ .+|++.... ...+.++++|+++.+.|++||++||+.+|.++ T Consensus 317 ~a~~~Et~~~~~----------~~a~lvGis~s~~~~-a~yiP~~~~-~~~~~lk~~Led~~i~KighNlK~dl~vL~~~ 384 (877) T PRK08076 317 SALVVEVLEDNY----------HKADIQGFAIVNENG-CYFIPTEIA-LKSDAFKEWLEDEEKKKWVFDAKRAIVALKWN 384 (877) T ss_pred CEEEEEECCCCC----------CCCCEEEEEEECCCC-EEEECCCHH-HHHHHHHHHHCCCCCCEEEECHHHHHHHHHHC T ss_conf 549999647885----------637789999985894-599550044-21589999853976540265479999999968 Q ss_pred CCCCCCCCCEEEHHHHHHHHCCC---CCHHHHHHHCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHH Q ss_conf 08998842033168899986789---996898998299811004379999862021123433234678585328999999 Q gi|254781193|r 84 FNLDIPIHRWICTLVLARSNGLP---SALKAVGEALKLSSQKMEEGKELIARFCKGETHECLYDRNKPNHREAWRLFGEY 160 (675) Q Consensus 84 ~Gi~i~~~~~~DTmi~A~~l~~~---~sL~~la~~l~l~~~k~~~~k~l~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~Y 160 (675) |+++. +.++||||+||++++. ++|+.++..+.... ....+.+. ++. .. +. .+. .....+| T Consensus 385 -GI~l~-gi~FDTMLAAYLLdP~~~~~~Ld~La~~yl~~~--~~~~e~v~---Gkg-----~k-~~---~p~-~~~~~eY 447 (877) T PRK08076 385 -GIDLQ-GIDFDLLLAAYLLNPADSDDDFASVAKMKETHA--VKSDEAVY---GKG-----AK-RA---VPE-EEILAEH 447 (877) T ss_pred -CCCCC-CCCCCHHHHHHHCCCCCCCCCHHHHHHHHCCCC--CCCHHHHC---CCC-----CC-CC---CCC-HHHHHHH T ss_conf -96447-764029999987188767666999999855777--76347653---765-----22-24---877-8999999 Q ss_pred HHHHHHHHHHHHHHCCCCH----------HCCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 8998999999996301040----------001513366665311562137999999999999999999999998728854 Q gi|254781193|r 161 CKGDVTVTREIFKRLEPLS----------SKEQELWYLDQTINDRGYCVDVALAHTLNQIVDVERTKLDQELESLTYGLV 230 (675) Q Consensus 161 a~~Dv~~t~~L~~~l~~~~----------~~E~~~~~~~~~me~~Gi~vD~e~l~~l~~~~~~~~~~l~~~i~~~~~~~~ 230 (675) |+.|+.+|++||+.|.+.+ +.|+++..++++||.+||.||.+.++++..++..++.+++++|+++. |.. T Consensus 448 aaedA~~t~~L~~~L~~~L~~~~l~~Ly~~IE~PLi~VLa~ME~~GI~VD~~~L~~ls~~l~~~l~~le~~I~~la-G~e 526 (877) T PRK08076 448 LARKAHAIYDLKQTFVEELEKNEQLELFEELELPLARVLAEMEVKGVKVDTERLENMGEELAGRLKELEQEIHELA-GEE 526 (877) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCEEECHHHHHHHHHHHHHHHHHHHHHHHHHC-CCC T ss_conf 9999999999999999998655789999999704889999998548389899999999999999999999999972-898 Q ss_pred CCCCCHHHHHHHHHHHCCCCCCCCCC---CCHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC-CCEEE Q ss_conf 76788799988777755888888988---533899872130482389999999997655101335788740868-96767 Q gi|254781193|r 231 SSSRCLEKLKTWLFLETGLHLEDLTE---TTCQRVLSSNKKLTPQAIQVFNNRMRASRSAVLKLNTLVEAMNSD-GRLRG 306 (675) Q Consensus 231 ~n~~S~~ql~~~L~~~lg~~~~~~~k---~t~~~~L~~~~~~~~~~~~~l~~r~~~~k~~~~~~~~~~~~~~~d-gRih~ 306 (675) ||+|||+||+++||+++|+|+.+++| +|+.++|+.+.+.||.+..+|+|| +++|+.+||++++++.++++ ||||+ T Consensus 527 FNinSpKQL~~ILFekL~Lp~~KKTKtG~STd~~vLe~L~~~h~i~~~iLeyR-~l~KL~sTY~d~L~~~i~~~tgRiHt 605 (877) T PRK08076 527 FNINSPKQLGVILFEKLGLPVIKKTKTGYSTSADVLEKLADKHEIIPNILHYR-QLGKLQSTYIEGLLKVVHKDTHKVHT 605 (877) T ss_pred CCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCCCHHHHHHHHH-HHHHHHHHHHHHHHHHHCCCCCEEEC T ss_conf 78899999999999853898888788998786999997543285479999998-99888748999988741877545630 Q ss_pred EEEEECCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHCCCCCCCCCCCCCCCCHHHHHEEEEC-CCCCEEEEEECCCCC Q ss_conf 7884000132211247888444888886511345543101233334554432100333301266-898789998345372 Q gi|254781193|r 307 TLQFLGASRTGRWSGCVFPPQNLPRPSRSHEELDQIITHLPTSITSDPDPLQIISDCVRSCVIA-SEGKKLVVADLAGIE 385 (675) Q Consensus 307 ~~~~~gat~TGR~S~~~pNlQnlPr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iR~~f~a-~~G~~lv~aDySqiE 385 (675) +|||.+ |.|||+||++|||||||.+++.|+. ||+||+| +|||+|++||||||| T Consensus 606 ~fnQ~~-t~TGRLSS~~PNLQNIPirt~~G~~-------------------------IR~aFi~~~~g~~l~saDYSQIE 659 (877) T PRK08076 606 RFNQAL-TQTGRLSSTDPNLQNIPIRLEEGRK-------------------------IRQAFVPSEEGWIMFAADYSQIE 659 (877) T ss_pred CCCHHC-CCCCCCCCCCCCCCCCCCCCHHHHH-------------------------HHHEEEECCCCCEEEEECHHHHH T ss_conf 100000-4556402578752367777644577-------------------------87614307999679983266889 Q ss_pred HHHHHHHCCCCCHHHHHHHCCCCHHHHHHHHCCCCHHHHCCHHHHHCCCCEEEEECCCCHHHHHHHHHHCCCCCCHHHHH Q ss_conf 33332311462011345411331010147644879567062011101652021002888888998798708883113455 Q gi|254781193|r 386 ARVLAWMAGEQWKLKAFETGEDLYVTTYARSFNVAPAQVTKDQRAIGKVMELALGYQGGAKAFQKMASTTGLNLNTFAHL 465 (675) Q Consensus 386 lRvlA~ls~d~~l~~af~~g~D~h~~tA~~~~g~~~~~~~~~~R~~aK~~~~g~~YG~G~~~l~~~~~~~G~~~~~~~~~ 465 (675) ||||||+|+|+.|++||++|.|||+.||+.+||+|.++|++++|+.||++|||++||||+.+|++ T Consensus 660 LRilAh~s~d~~l~~af~~~~DiH~~tA~~if~~~~~~vt~~~R~~AK~infGiiYG~~~~gLs~--------------- 724 (877) T PRK08076 660 LRVLAHIAEDENLIEAFQHDMDIHTKTAMDVFHVEKDEVTSNMRRQAKAVNFGIVYGISDYGLSQ--------------- 724 (877) T ss_pred HHHHHHHHCCHHHHHHHHCCCCHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHCCCHHHHHH--------------- T ss_conf 99999982789999987159757588889984898010899999998898877876558788998--------------- Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHCCEEEEECCCCC Q ss_conf 65554555567888899998876431000102207888999999998788789999999999999975491796215421 Q gi|254781193|r 466 VKTNSAPEEWEDAENFCRWMQENHPEHAIEDLFIGTACERVKKAWRQKHSRVSELWNELHQAFEQTIENGKAIIARKRRD 545 (675) Q Consensus 466 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ea~~~~~~~r~~~p~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 545 (675) ..+++..||+.+++.|++.||+|.+|..++...+++ T Consensus 725 -----------------------------~l~i~~~eA~~~i~~Yf~~yp~v~~~~~~~~~~a~~--------------- 760 (877) T PRK08076 725 -----------------------------NLGITRKEAAEFIERYFESFPGVKEYMEDIVQEAKQ--------------- 760 (877) T ss_pred -----------------------------HCCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHH--------------- T ss_conf -----------------------------629899999999999999782499999999999997--------------- Q ss_pred CCCCCCCCCCCCCEEECCCCCCCCCCCCCCCHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC----- Q ss_conf 133222344564023247663101678662102101002345677753145655453688999999999999967----- Q gi|254781193|r 546 VPLVYMKKDKNNLILMLPSGRSLVYRDVGNNRSYLNTTTRQLTREYTYGGKLTENIVQAVSRDILAEGMMNATKA----- 620 (675) Q Consensus 546 ~~~~~~~~~~~~~~~~~p~GR~~~~~~~~~~~~~~~~~~~~~~r~~~~~~k~~n~~iQgtaadi~k~ami~l~~~----- 620 (675) .+++++..||||++|++++.+...++. .+|+ ++|+|||||||||+|.||+++++. T Consensus 761 -----------~Gyv~Tl~gRrr~~p~i~s~n~~~r~~---~eR~------A~N~piQGtAADiiK~AMi~i~~~l~~~~ 820 (877) T PRK08076 761 -----------KGYVTTLLHRRRYIPEITSRNFNLRSF---AERT------AMNTPIQGSAADIIKKAMIDMADRLKEEG 820 (877) T ss_pred -----------CCCEECCCCCEEECCCCCCCCHHHHHH---HHHH------HHCCCCCHHHHHHHHHHHHHHHHHHHHCC T ss_conf -----------688864677703087655778778888---9999------93614456799999999999999998469 Q ss_pred -CCCEEEEECCEEEEECCCCHHHHH-HHHHHHHCCCCHHCCCCCCEEEECCCCCCC Q ss_conf -981889952589997792489999-999998456520215995157401004631 Q gi|254781193|r 621 -GYDIVLTVHDEIVCETPDTDEFNA-SMLYHLMTSNPSWAKGLPLKAEGYESKRYR 674 (675) Q Consensus 621 -g~~ivl~VHDEiv~evp~~~~~~~-~~i~~~M~~~~~w~~~~pl~ae~~~~~~y~ 674 (675) ..+|+||||||||||||+++.+++ +.++++|+.+.+ ..+||.+|..+|+.|. T Consensus 821 ~~~~mlLqVHDELvfev~~~~~~~~~~~~~~~Me~a~~--L~VPL~v~~~~G~nW~ 874 (877) T PRK08076 821 LQARLLLQVHDELIFEAPKEEIEKLEKLVPEVMEHAVE--LDVPLKVDYSYGPTWY 874 (877) T ss_pred CCCEEEEEEEEEEEEEECHHHHHHHHHHHHHHHHCCCC--CCCCEEEECCCCCCHH T ss_conf 98518998642668961688999999999999827423--6887584138387967 No 14 >PRK08434 consensus Probab=100.00 E-value=0 Score=876.56 Aligned_cols=530 Identities=19% Similarity=0.269 Sum_probs=442.5 Q ss_pred CEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCEEEEEEECCCCH-----HH-HH----HHHHHHCCCCCEEEEE Q ss_conf 68999814178987000440011279819999999848039999788810-----43-79----9999863889819997 Q gi|254781193|r 2 ATLFIDFETRSPIDLTKCGICKYAEDVCVILFAFGFDDHPVELWDCVHDP-----VM-PQ----VLKEALENTDVTIVAH 71 (675) Q Consensus 2 k~i~lD~ET~~~~dl~~~Gl~~y~~~~~i~~i~~a~~~~~v~~~d~~~~~-----~~-~~----~l~~~L~~~~~~kV~H 71 (675) +.+++|+||.+. | ....+++|++++++.+..+|+|+.|.. .+ ++ .+.++++ ..+||| T Consensus 319 ~~~a~d~Ett~l-d---------~~~a~lvGis~~~~~~~a~YiPi~h~~l~~~~~~~~~~~~~~l~~~~~---~~~Igh 385 (887) T PRK08434 319 SIIAFDTETTGL-D---------TKEAKLVGFSFCFNEEEAYYVPLAHSYLGVGEQISLESAKKAIEKIFN---HFVIGH 385 (887) T ss_pred CEEEEEEECCCC-C---------CCCCCEEEEEEEECCCCEEEEECCCCCCCCHHHCCHHHHHHHHHHHHH---CCCHHH T ss_conf 769999512786-6---------133430589999559858999752112577333599999999787633---740364 Q ss_pred EHHHHHHHHHHHCCCCCCCCCEEEHHHHHHHHCC--CCCHHHHHHHC-CCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCC Q ss_conf 3077499999860899884203316889998678--99968989982-99811004379999862021123433234678 Q gi|254781193|r 72 NSYFERTVLKACFNLDIPIHRWICTLVLARSNGL--PSALKAVGEAL-KLSSQKMEEGKELIARFCKGETHECLYDRNKP 148 (675) Q Consensus 72 Na~FD~~~L~~~~Gi~i~~~~~~DTmi~A~~l~~--~~sL~~la~~l-~l~~~k~~~~k~l~~~~~k~~~~~~~~~~~~~ 148 (675) |++||+++|.+.+|+++. +.++||||+||++++ .++|+.++..+ +... ...++++. +. . . T Consensus 386 NlKyDl~vL~~~~gi~l~-g~~fDTmLAaYLLdp~~r~~Ld~La~~yL~~~~---i~~e~l~~---kg--------~--~ 448 (887) T PRK08434 386 NLKYDFKIIQNNFGLELP-QKYADTMILAWLKDPSSRVGLDDLAKRLFNYEM---IHFESVVK---KG--------E--N 448 (887) T ss_pred HHHHHHHHHHHHCCCCCC-CCHHHHHHHHHHCCCCCCCCHHHHHHHHCCCCC---CCHHHHHC---CC--------C--C T ss_conf 779999999985698777-731489999986698666666789999708787---74899854---67--------7--7 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHCCCCH-------------HCCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHH Q ss_conf 5853289999998998999999996301040-------------001513366665311562137999999999999999 Q gi|254781193|r 149 NHREAWRLFGEYCKGDVTVTREIFKRLEPLS-------------SKEQELWYLDQTINDRGYCVDVALAHTLNQIVDVER 215 (675) Q Consensus 149 ~~~~~~~~~~~Ya~~Dv~~t~~L~~~l~~~~-------------~~E~~~~~~~~~me~~Gi~vD~e~l~~l~~~~~~~~ 215 (675) +...+.+....||++|+++|++||..|.+.+ +.|+++..++++||.+||.||.+.++++..++..++ T Consensus 449 f~~v~i~~a~~YAaedA~it~~L~~~l~~~L~~~~~~~L~~Ly~~IElPLi~VLa~ME~~GI~VD~~~L~~ls~el~~~i 528 (887) T PRK08434 449 FSSVDLEKACKYAAEDAWITLRFYKKFLENLEKRGATHLLELAKNVEFPFIKTLLMMEENGIKLDTNFLEELKKKFEEEI 528 (887) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHH T ss_conf 02078899999999999999999999999998655177999999988659999999997195026999999999999999 Q ss_pred HHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHCCCCCCCCCC---CCHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999872885476788799988777755888888988---533899872130482389999999997655101335 Q gi|254781193|r 216 TKLDQELESLTYGLVSSSRCLEKLKTWLFLETGLHLEDLTE---TTCQRVLSSNKKLTPQAIQVFNNRMRASRSAVLKLN 292 (675) Q Consensus 216 ~~l~~~i~~~~~~~~~n~~S~~ql~~~L~~~lg~~~~~~~k---~t~~~~L~~~~~~~~~~~~~l~~r~~~~k~~~~~~~ 292 (675) ++++++|+++ .|..||+|||+||+++||+++|+|+.+++| ||+.++|+.+.+.||.+..+|+|| +++|+.+||++ T Consensus 529 ~~Le~~Iy~l-AG~eFNInSPKQLgeILFekLgLp~~KKTKTGySTd~evLe~L~~~hpi~~~iLeyR-~l~KL~sTYvd 606 (887) T PRK08434 529 KELTEEIYEL-CGEEFNINSTKQLGAVLFEKLKLPSGKKTKTGYSTDEKVLNALLDKHPVIPKILEYR-ELYKLYSTYIE 606 (887) T ss_pred HHHHHHHHHH-CCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCCCCHHHHHHHH-HHHHHHHHHHH T ss_conf 9999999985-576647799999999989864998888888998880999999887387389999860-89999999998 Q ss_pred HHHHHCC--CCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHCCCCCCCCCCCCCCCCHHHHHEEEEC Q ss_conf 7887408--68967677884000132211247888444888886511345543101233334554432100333301266 Q gi|254781193|r 293 TLVEAMN--SDGRLRGTLQFLGASRTGRWSGCVFPPQNLPRPSRSHEELDQIITHLPTSITSDPDPLQIISDCVRSCVIA 370 (675) Q Consensus 293 ~~~~~~~--~dgRih~~~~~~gat~TGR~S~~~pNlQnlPr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iR~~f~a 370 (675) ++++.+. .+||||++|||.+ |.|||+||++|||||||.+++.|.. ||+||+| T Consensus 607 ~L~~~~~~d~tgRIHttfnQ~~-t~TGRLSS~~PNLQNIPirt~~Gr~-------------------------iR~aFv~ 660 (887) T PRK08434 607 PLLKLALKDKNSRIYTSFLQTG-TATGRLSSKNPNLQNIPARGELAKD-------------------------IREAFVA 660 (887) T ss_pred HHHHHCCCCCCCEECCHHHHHC-CCCEEECCCCCCCCCCCCCCCCCHH-------------------------HHHHEEE T ss_conf 8877514788870310115313-2210123689975678777642155-------------------------4644340 Q ss_pred CCCCEEEEEECCCCCHHHHHHHCCCCCHHHHHHHCCCCHHHHHHHHCCCCHHHHCCHHHHHCCCCEEEEECCCCHHHHHH Q ss_conf 89878999834537233332311462011345411331010147644879567062011101652021002888888998 Q gi|254781193|r 371 SEGKKLVVADLAGIEARVLAWMAGEQWKLKAFETGEDLYVTTYARSFNVAPAQVTKDQRAIGKVMELALGYQGGAKAFQK 450 (675) Q Consensus 371 ~~G~~lv~aDySqiElRvlA~ls~d~~l~~af~~g~D~h~~tA~~~~g~~~~~~~~~~R~~aK~~~~g~~YG~G~~~l~~ 450 (675) ++||+|++|||||||||||||+|+|+.|++||++|.|||+.||+.+|| +++++++|+.||++|||++||||+.+|++ T Consensus 661 ~~g~~l~~aDysQiElRilAh~s~d~~l~~af~~~~DiH~~tA~~vf~---~~~~~~~R~~aK~vnfgi~YG~~~~gLa~ 737 (887) T PRK08434 661 KEGYSLISLDYSQIELRLLAHFSKDEALLEAFKNDEDIHARTAIKIFG---EENAKAKRAVAKSINFGLIYGMGSKKLSE 737 (887) T ss_pred CCCCEEEEECHHHHHHHHHHHHCCCHHHHHHHHCCCCHHHHHHHHHHC---CCCCHHHHHHHHHHHHHHHHCCCHHHHHH T ss_conf 688779970573899999999718899999984499848999999837---77888899897777756655468777998 Q ss_pred HHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHH Q ss_conf 79870888311345565554555567888899998876431000102207888999999998788789999999999999 Q gi|254781193|r 451 MASTTGLNLNTFAHLVKTNSAPEEWEDAENFCRWMQENHPEHAIEDLFIGTACERVKKAWRQKHSRVSELWNELHQAFEQ 530 (675) Q Consensus 451 ~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ea~~~~~~~r~~~p~i~~~~~~~~~~~~~ 530 (675) ..+++..||+.+++.|++.||+|++|..++...+++ T Consensus 738 --------------------------------------------~l~i~~~eA~~~i~~yf~~yp~v~~~~~~~~~~a~~ 773 (887) T PRK08434 738 --------------------------------------------TLGISAKEAKEYIESYFASFPTIKNFLESIKNFAKE 773 (887) T ss_pred --------------------------------------------HCCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHH T ss_conf --------------------------------------------709899999999999999680199999999999997 Q ss_pred HHHCCEEEEECCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCCHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 97549179621542113322234456402324766310167866210210100234567775314565545368899999 Q gi|254781193|r 531 TIENGKAIIARKRRDVPLVYMKKDKNNLILMLPSGRSLVYRDVGNNRSYLNTTTRQLTREYTYGGKLTENIVQAVSRDIL 610 (675) Q Consensus 531 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~GR~~~~~~~~~~~~~~~~~~~~~~r~~~~~~k~~n~~iQgtaadi~ 610 (675) + +++++..||||++ ++.+.+...+ ...+|+ ++|+|||||||||+ T Consensus 774 ---~-----------------------gyv~Tl~gRrr~~-~~~~~~~~~~---~~~eR~------A~N~piQGtaADii 817 (887) T PRK08434 774 ---N-----------------------GYVETLLGRKRYF-DFENATPMQK---AMYERE------AVNTIFQGSAADLI 817 (887) T ss_pred ---C-----------------------CCEECCCCCCCCC-CCCCCCHHHH---HHHHHH------HHCCCCCHHHHHHH T ss_conf ---6-----------------------8877888872068-8779898899---899999------83823366799999 Q ss_pred HHHHHHHHH----CCCCEEEEECCEEEEECCCCHHHHH-HHHHHHHCCCCHHCCCCCCEEEECCCCCCC Q ss_conf 999999996----7981889952589997792489999-999998456520215995157401004631 Q gi|254781193|r 611 AEGMMNATK----AGYDIVLTVHDEIVCETPDTDEFNA-SMLYHLMTSNPSWAKGLPLKAEGYESKRYR 674 (675) Q Consensus 611 k~ami~l~~----~g~~ivl~VHDEiv~evp~~~~~~~-~~i~~~M~~~~~w~~~~pl~ae~~~~~~y~ 674 (675) |.||+++++ .+++|+||||||||||||+++.+++ +.|+++|+.+.+ ..+||.+|..+|+.|. T Consensus 818 k~Ami~~~~~l~~~~~~~~lqvHDElv~e~~~~~~~~~~~~v~~~Me~a~~--L~VPL~v~~~~g~nW~ 884 (887) T PRK08434 818 KLAMNEINKLLNEEDAKLLLQIHDELIFEVKDELAEEFAKKFKDIMENIVK--LNVPLKTSLSIAKNWG 884 (887) T ss_pred HHHHHHHHHHHHHCCCEEEEEECEEEEEEECHHHHHHHHHHHHHHHHCCCC--CCCCEEEECCCCCCHH T ss_conf 999999999986449869997542678962686999999999999728501--6787686428278946 No 15 >TIGR00593 pola DNA polymerase I; InterPro: IPR002298 DNA carries the biological information that instructs cells how to exist in an ordered fashion. Accurate replication is thus one of the most important events in the cell life cycle. This function is mediated by DNA-directed DNA polymerases, which add nucleotide triphosphate (dNTP) residues to the 5'-end of the growing DNA chain, using a complementary DNA as template. Small RNA molecules are generally used as primers for chain elongation, although terminal proteins may also be used. DNA-dependent DNA polymerases have been grouped into families, denoted A, B and X, on the basis of sequence similarities , . Members of family A, which includes bacterial and bacteriophage polymerases, share significant similarity to Escherichia coli polymerase I; hence family A is also known as the pol I family. The bacterial polymerases also contain an exonuclease activity, which is coded for in the N-terminal portion. Three motifs, A, B and C , are seen to be conserved across all DNA polymerases, with motifs A and C also seen in RNA polymerases. They are centred on invariant residues, and their structural significance was implied from the Klenow (E. coli) structure. Motif A contains a strictly-conserved aspartate at the junction of a beta-strand and an alpha-helix; motif B contains an alpha-helix with positive charges; and motif C has a doublet of negative charges, located in a beta-turn-beta secondary structure .; GO: 0003677 DNA binding, 0003887 DNA-directed DNA polymerase activity, 0006260 DNA replication. Probab=100.00 E-value=0 Score=875.36 Aligned_cols=492 Identities=22% Similarity=0.262 Sum_probs=408.4 Q ss_pred HHCCCCCEEEEEEHHHHHHHHHHHCCCCCC-CCCEEEHHHHHHHHCCCC---CHHHHHHHCCCCC-CHHH-HHHHHHHHH Q ss_conf 863889819997307749999986089988-420331688999867899---9689899829981-1004-379999862 Q gi|254781193|r 60 ALENTDVTIVAHNSYFERTVLKACFNLDIP-IHRWICTLVLARSNGLPS---ALKAVGEALKLSS-QKME-EGKELIARF 133 (675) Q Consensus 60 ~L~~~~~~kV~HNa~FD~~~L~~~~Gi~i~-~~~~~DTmi~A~~l~~~~---sL~~la~~l~l~~-~k~~-~~k~l~~~~ 133 (675) ...+...++..||+||++.+|++. |+++. ...++||||+||+|++.. .|+.++..|.... ..+. .++.-+. T Consensus 460 ~~~~a~~~~~~~~~K~~~~~L~~~-g~~~~~~~~~~D~~laaYll~~~~~~~~l~~~~~~y~~~~~~~d~~~~~~~~~-- 536 (1005) T TIGR00593 460 LLVSASKVKVAHDAKFLMHLLKRK-GIELIEIGVIFDTMLAAYLLDPAQVSKDLDTLAKRYLVEELILDDDRKEEGIK-- 536 (1005) T ss_pred HCCCCCCCCCHHHHHHHHHHHHHC-CCCCCCCCCCCCHHHHHHHHCCCCCCCHHHHHHHHHCCCCCCCCCCHHHHHHH-- T ss_conf 002341012288999999999743-77334421145489999984303564027899987134346778731345544-- Q ss_pred CCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCHH-----------------------------CCCC Q ss_conf 02112343323467858532899999989989999999963010400-----------------------------0151 Q gi|254781193|r 134 CKGETHECLYDRNKPNHREAWRLFGEYCKGDVTVTREIFKRLEPLSS-----------------------------KEQE 184 (675) Q Consensus 134 ~k~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~t~~L~~~l~~~~~-----------------------------~E~~ 184 (675) ..........+....|+..++.++.+|++.|.+.+. .|++ T Consensus 537 -----------~~~~~a~~~~~~~~~~~~~~~~~~~~L~~~l~~~L~~eDaDitfrlyhylklrle~~~~~~L~~eiE~P 605 (1005) T TIGR00593 537 -----------KLAKFADPDLEEAIEYLARKAAATARLAEELLKELDAEDADITFRLYHYLKLRLEENKLLSLYEEIELP 605 (1005) T ss_pred -----------HHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCHHHHHHHHHHHCCC T ss_conf -----------334441121778899999999999999999888731132204667777888651638899999884200 Q ss_pred HHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHCCCCCCCC---C-CCCHH Q ss_conf 3366665311562137999999999999999999999998728854767887999887777558888889---8-85338 Q gi|254781193|r 185 LWYLDQTINDRGYCVDVALAHTLNQIVDVERTKLDQELESLTYGLVSSSRCLEKLKTWLFLETGLHLEDL---T-ETTCQ 260 (675) Q Consensus 185 ~~~~~~~me~~Gi~vD~e~l~~l~~~~~~~~~~l~~~i~~~~~~~~~n~~S~~ql~~~L~~~lg~~~~~~---~-k~t~~ 260 (675) +..++++||.+||.||.++++++..++..++..++.+|+++ .|..||+||||||..+||++|++|+.++ + .||+. T Consensus 606 l~~VL~~ME~~Gi~~D~~~L~~L~~e~~~~l~~lE~~Iy~~-aG~~FNinSPKQL~~vLF~KL~Lp~~kK~KtG~YSTd~ 684 (1005) T TIGR00593 606 LSKVLAEMEKTGIKVDKDYLQELSKEFGEELLDLEEEIYEL-AGEEFNINSPKQLGEVLFEKLGLPVGKKTKTGSYSTDA 684 (1005) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-CCCCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCEECH T ss_conf 78999998776678708999999999999999999999987-49520688733555886998478820034678621338 Q ss_pred HHHHHHHCCCCH-HHHHH-HHHHHHHHHHHHHHHHHHHHCCCC-CCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCH Q ss_conf 998721304823-89999-999997655101335788740868-967677884000132211247888444888886511 Q gi|254781193|r 261 RVLSSNKKLTPQ-AIQVF-NNRMRASRSAVLKLNTLVEAMNSD-GRLRGTLQFLGASRTGRWSGCVFPPQNLPRPSRSHE 337 (675) Q Consensus 261 ~~L~~~~~~~~~-~~~~l-~~r~~~~k~~~~~~~~~~~~~~~d-gRih~~~~~~gat~TGR~S~~~pNlQnlPr~~~~~~ 337 (675) ++|+++...||. |..+| +|| +++|+.+||++++++.++++ ||||++|+|.+ |.|||+||++|||||||.+++.|. T Consensus 685 ~vLe~L~~~h~~~i~~~Ll~yR-~L~KLksTY~d~Ll~~v~~~TgRiHT~F~Qt~-TaTGRLSS~~PNLQNIPiR~~~Gr 762 (1005) T TIGR00593 685 DVLEKLAEKHPIHIALILLEYR-QLTKLKSTYVDGLLELVNPDTGRIHTTFNQTG-TATGRLSSSNPNLQNIPIRSEEGR 762 (1005) T ss_pred HHHHHHHHCCCCCHHHHHHHHH-HHHHHHHCCHHHHHHHHHHCCCCEEHHHHHHH-HHCCCCCCCCCCCCCCCCCCCCHH T ss_conf 8999887428883689998766-66543101201338984000282731314545-635620047888544332272001 Q ss_pred HHHHHHHCCCCCCCCCCCCCCCCHHHHHEEEECC-CCCEEEEEECCCCCHHHHHHHCCCCCHHHHHHHCCCCHHHHHHHH Q ss_conf 3455431012333345544321003333012668-987899983453723333231146201134541133101014764 Q gi|254781193|r 338 ELDQIITHLPTSITSDPDPLQIISDCVRSCVIAS-EGKKLVVADLAGIEARVLAWMAGEQWKLKAFETGEDLYVTTYARS 416 (675) Q Consensus 338 ~~~~~~~~~~~~~~~~~~~~~~~~~~iR~~f~a~-~G~~lv~aDySqiElRvlA~ls~d~~l~~af~~g~D~h~~tA~~~ 416 (675) + ||++|+|. .||.||+|||||||||||||+|+|+.|++||.+|.|||+.||+.+ T Consensus 763 ~-------------------------IR~AFva~~~g~~L~~ADYSQIELRiLAHlS~De~Li~AF~~~~DIH~~TA~~~ 817 (1005) T TIGR00593 763 K-------------------------IRKAFVAESKGWLLISADYSQIELRILAHLSQDENLIEAFKNGEDIHTQTASRL 817 (1005) T ss_pred H-------------------------HHHCCCCCCCCCEEEECCCCHHHHHHHHHHCCCHHHHHHHHCCCCHHHHHHHHH T ss_conf 3-------------------------322002468860688526337999999864078679999754887138899997 Q ss_pred CCCCHHHHCCHHHHHCCCCEEEEECCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 48795670620111016520210028888889987987088831134556555455556788889999887643100010 Q gi|254781193|r 417 FNVAPAQVTKDQRAIGKVMELALGYQGGAKAFQKMASTTGLNLNTFAHLVKTNSAPEEWEDAENFCRWMQENHPEHAIED 496 (675) Q Consensus 417 ~g~~~~~~~~~~R~~aK~~~~g~~YG~G~~~l~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 496 (675) ||+++++||+++|++||++|||++||||+.+|++ .. T Consensus 818 Fgv~~~~vtp~~RR~AK~vNFGivYGmSafgLA~--------------------------------------------~L 853 (1005) T TIGR00593 818 FGVEEEDVTPNMRRIAKAVNFGIVYGMSAFGLAR--------------------------------------------EL 853 (1005) T ss_pred HCCCHHHCCHHHHCCCCEEEEHHHHCCCHHHHHH--------------------------------------------HH T ss_conf 2989774496460474144301321012057777--------------------------------------------50 Q ss_pred CCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHCCEEEEECCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCC Q ss_conf 22078889999999987887899999999999999754917962154211332223445640232476631016786621 Q gi|254781193|r 497 LFIGTACERVKKAWRQKHSRVSELWNELHQAFEQTIENGKAIIARKRRDVPLVYMKKDKNNLILMLPSGRSLVYRDVGNN 576 (675) Q Consensus 497 ~~~~~ea~~~~~~~r~~~p~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~GR~~~~~~~~~~ 576 (675) +++..||+.+|+.||+.||+|+.|.. +.+..+.+.|+++| ..||||++|++++. T Consensus 854 ~Is~~EA~~fIe~YF~~yPgv~~Y~~---~~~~~ar~~GYV~T-----------------------l~gRRRY~~~i~s~ 907 (1005) T TIGR00593 854 GISRKEAKEFIERYFERYPGVKEYLE---NTVEEARKKGYVTT-----------------------LLGRRRYIPDINSR 907 (1005) T ss_pred CCCHHHHHHHHHHHHHHCCHHHHHHH---HHHHHHHHCCCEEE-----------------------ECCCEECCCCCCHH T ss_conf 88888999999999763413899999---99999973685563-----------------------04656115763223 Q ss_pred HHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----CC--CCEEEEECCEEEEECCCCHHHHH-HHHHH Q ss_conf 0210100234567775314565545368899999999999996----79--81889952589997792489999-99999 Q gi|254781193|r 577 RSYLNTTTRQLTREYTYGGKLTENIVQAVSRDILAEGMMNATK----AG--YDIVLTVHDEIVCETPDTDEFNA-SMLYH 649 (675) Q Consensus 577 ~~~~~~~~~~~~r~~~~~~k~~n~~iQgtaadi~k~ami~l~~----~g--~~ivl~VHDEiv~evp~~~~~~~-~~i~~ 649 (675) +...|+. .||.+ .|.|||||||||||.|||+|++ .+ .+|+|||||||+||+|+++.+.+ +.|++ T Consensus 908 n~~~R~~---AER~A------~NaPIQGSAADI~K~AMikl~~~L~~~~L~~rlLLQVHDELl~E~P~~E~e~v~~l~k~ 978 (1005) T TIGR00593 908 NRNLREA---AERMA------INAPIQGSAADIMKIAMIKLDKALKERKLKSRLLLQVHDELLFEAPEEEAEEVKALVKE 978 (1005) T ss_pred HHHHHHH---HHHHH------HCCCCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHH T ss_conf 5777999---99987------36753468789999999999999973250333467865588723444137899999998 Q ss_pred HHC-CCCHHCCCCCCEEEECCCCCCC Q ss_conf 845-6520215995157401004631 Q gi|254781193|r 650 LMT-SNPSWAKGLPLKAEGYESKRYR 674 (675) Q Consensus 650 ~M~-~~~~w~~~~pl~ae~~~~~~y~ 674 (675) +|+ ++-+ ..+||.+|-..|..|. T Consensus 979 ~MED~A~~--L~VPL~v~~~~G~~W~ 1002 (1005) T TIGR00593 979 VMEDEAYT--LSVPLEVEVETGKNWG 1002 (1005) T ss_pred HHCCEEEE--CCCCCEEECCCCCCHH T ss_conf 71461010--2576367524587711 No 16 >PRK07300 consensus Probab=100.00 E-value=0 Score=857.93 Aligned_cols=515 Identities=19% Similarity=0.211 Sum_probs=426.7 Q ss_pred CEEEEEEEEECCEEEEEEECCCCHHHHHHHHHHHCCCCCEEEEEEHHHHHHHHHHHCCCCCCCCCEEEHHHHHHHHCC-- Q ss_conf 819999999848039999788810437999998638898199973077499999860899884203316889998678-- Q gi|254781193|r 28 VCVILFAFGFDDHPVELWDCVHDPVMPQVLKEALENTDVTIVAHNSYFERTVLKACFNLDIPIHRWICTLVLARSNGL-- 105 (675) Q Consensus 28 ~~i~~i~~a~~~~~v~~~d~~~~~~~~~~l~~~L~~~~~~kV~HNa~FD~~~L~~~~Gi~i~~~~~~DTmi~A~~l~~-- 105 (675) ..+++++| +.++..|+...........+...+.. .++.||++||..+|.++ |++++ +..+||||+||++++ T Consensus 336 a~lvG~s~--~~~~~~y~~~~~~~l~~~~~~~~~~~---~i~~~nlK~~~~vL~~~-GI~l~-~~~fDTmLAsYLL~p~~ 408 (880) T PRK07300 336 EAIIGFAW--GNGEQIYASTDLSLLATDSFKQVLQK---PIATYDFKRSKVLLSHL-GIELP-APSFDARLAKYLLSTVE 408 (880) T ss_pred CCEEEEEE--ECCCEEEECCCHHHHHCHHHHHHHHH---HHHHHHHHHHHHHHHHC-CCCCC-CCHHHHHHHHHCCCCCC T ss_conf 77799998--55975895581332117688887653---56550378689999977-98667-74034899985368765 Q ss_pred CCCHHHHHHHC-CCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCH----- Q ss_conf 99968989982-998110043799998620211234332346785853289999998998999999996301040----- Q gi|254781193|r 106 PSALKAVGEAL-KLSSQKMEEGKELIARFCKGETHECLYDRNKPNHREAWRLFGEYCKGDVTVTREIFKRLEPLS----- 179 (675) Q Consensus 106 ~~sL~~la~~l-~l~~~k~~~~k~l~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~t~~L~~~l~~~~----- 179 (675) .++|+.+++.+ +..... .+.+. ++.. .... + ..+....|++.|+.++++|+..|.+.+ T Consensus 409 ~~~Ld~La~~yl~~~~~~---~e~l~---Gkg~------k~~~---~-~~~~~~~yaa~da~~~~~L~~~L~~~L~~~~l 472 (880) T PRK07300 409 DNELSTIARLYTDISLET---DDTVY---GKGA------KRAI---P-EKEVLLEHLARKVKVLLDSKEQMLDKLTAHEQ 472 (880) T ss_pred CCCHHHHHHHHCCCCCCC---HHHHC---CCCC------CCCC---C-CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 587789999946888873---37663---6652------1036---6-27799999999999999999998999876169 Q ss_pred -----HCCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHCCCCCC-- Q ss_conf -----0015133666653115621379999999999999999999999987288547678879998877775588888-- Q gi|254781193|r 180 -----SKEQELWYLDQTINDRGYCVDVALAHTLNQIVDVERTKLDQELESLTYGLVSSSRCLEKLKTWLFLETGLHLE-- 252 (675) Q Consensus 180 -----~~E~~~~~~~~~me~~Gi~vD~e~l~~l~~~~~~~~~~l~~~i~~~~~~~~~n~~S~~ql~~~L~~~lg~~~~-- 252 (675) +.|+++..++++||.+||.||.+.+.++..++..++++++++|+++ .|..||+|||+||+++||++||+|+. T Consensus 473 ~~L~~~IE~PLi~VLa~ME~~GI~VD~~~L~~ls~el~~~l~~Le~~Iy~l-AG~eFNInSPKQLgeILFekLgLP~~~~ 551 (880) T PRK07300 473 LDLLFDMELPLANVLAKMEIAGIKVNRETLQEMAAENEVVIEALTQEIYDL-AGQEFNINSPKQLGVILFEKMGLPLEMT 551 (880) T ss_pred HHHHHHHHHHHHHHHHHHHHCCEEECHHHHHHHHHHHHHHHHHHHHHHHHH-CCCCCCCCCHHHHHHHHHHHCCCCCCCC T ss_conf 999999986899999999965808889999999999999999999999986-5876577999999999998449987677 Q ss_pred CC---CCCCHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCC Q ss_conf 89---885338998721304823899999999976551013357887408689676778840001322112478884448 Q gi|254781193|r 253 DL---TETTCQRVLSSNKKLTPQAIQVFNNRMRASRSAVLKLNTLVEAMNSDGRLRGTLQFLGASRTGRWSGCVFPPQNL 329 (675) Q Consensus 253 ~~---~k~t~~~~L~~~~~~~~~~~~~l~~r~~~~k~~~~~~~~~~~~~~~dgRih~~~~~~gat~TGR~S~~~pNlQnl 329 (675) ++ +.+|+.++|+.+++.||.+..+|+|| +++|+.+||++++++.+++|||||++|||.+ |.|||+||++|||||| T Consensus 552 KKTKTGySTd~evLe~L~~~hpi~~~ILeyR-~l~KLkSTY~d~L~~~i~~~gRIHTsf~Qt~-TaTGRLSS~~PNLQNI 629 (880) T PRK07300 552 KKTKTGYSTAVDVLERLAPIAPIVAKILEYR-QITKLQSTYVIGLQDYILADGKIHTRYVQDL-TQTGRLSSVDPNLQNI 629 (880) T ss_pred CCCCCCCCCHHHHHHHHHHCCCCHHHHHHHH-HHHHHHHHHHHHHHHHCCCCCCCCCHHHHHH-CCCCEECCCCCCCCCC T ss_conf 7788887857999999766187489999984-8999999989989985475461221356423-1334211478763468 Q ss_pred CCCCCCCHHHHHHHHCCCCCCCCCCCCCCCCHHHHHEEEECC-CCCEEEEEECCCCCHHHHHHHCCCCCHHHHHHHCCCC Q ss_conf 888865113455431012333345544321003333012668-9878999834537233332311462011345411331 Q gi|254781193|r 330 PRPSRSHEELDQIITHLPTSITSDPDPLQIISDCVRSCVIAS-EGKKLVVADLAGIEARVLAWMAGEQWKLKAFETGEDL 408 (675) Q Consensus 330 Pr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iR~~f~a~-~G~~lv~aDySqiElRvlA~ls~d~~l~~af~~g~D~ 408 (675) |.+++.|.. ||+||+|. +||+|++|||||||||||||+|+|+.|++||++|.|| T Consensus 630 PiRte~Gr~-------------------------IR~aFi~~~~~~~llsaDYSQIELRilAhlS~D~~Li~AF~~~~DI 684 (880) T PRK07300 630 PVRLEQGRL-------------------------IRKAFVPSHEDAVLLSSDYSQIELRVLAHISGDEHLIAAFKEGADI 684 (880) T ss_pred CCCCCCCHH-------------------------HHHEEECCCCCEEEEEECHHHHHHHHHHHHHCCHHHHHHHHCCCCH T ss_conf 877743520-------------------------4412732788559998232268999999981888899987158877 Q ss_pred HHHHHHHHCCCC-HHHHCCHHHHHCCCCEEEEECCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 010147644879-5670620111016520210028888889987987088831134556555455556788889999887 Q gi|254781193|r 409 YVTTYARSFNVA-PAQVTKDQRAIGKVMELALGYQGGAKAFQKMASTTGLNLNTFAHLVKTNSAPEEWEDAENFCRWMQE 487 (675) Q Consensus 409 h~~tA~~~~g~~-~~~~~~~~R~~aK~~~~g~~YG~G~~~l~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 487 (675) |+.||+.+||+| .++|++++|+.||++|||++||||+.+|++ T Consensus 685 H~~TA~~if~v~~~~~Vt~~~Rr~AK~iNFGiiYG~s~fgLs~------------------------------------- 727 (880) T PRK07300 685 HTSTAMRVFGIEKAEDVTANDRRNAKAVNFGIVYGISDFGLSN------------------------------------- 727 (880) T ss_pred HHHHHHHHHCCCCHHHCCHHHHHHHHHHHHHHHHCCCHHHHHH------------------------------------- T ss_conf 8999999949973200899999998631211112788677888------------------------------------- Q ss_pred HHHHHHHHCCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHCCEEEEECCCCCCCCCCCCCCCCCCEEECCCCCC Q ss_conf 64310001022078889999999987887899999999999999754917962154211332223445640232476631 Q gi|254781193|r 488 NHPEHAIEDLFIGTACERVKKAWRQKHSRVSELWNELHQAFEQTIENGKAIIARKRRDVPLVYMKKDKNNLILMLPSGRS 567 (675) Q Consensus 488 ~~~~~~~~~~~~~~ea~~~~~~~r~~~p~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~GR~ 567 (675) ..+++..||+.+++.|++.||+|++|..++-..+++ .++++|..||| T Consensus 728 -------~l~i~~~eA~~~I~~Yf~~yp~v~~~~~~~~~~a~~--------------------------~Gyv~Tl~gRr 774 (880) T PRK07300 728 -------NLGITRKQAKSYIDTYFERYPGIKNYMENVVREAKD--------------------------KGYVETLFKRR 774 (880) T ss_pred -------HCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHH--------------------------CCCEECCCCCE T ss_conf -------649999999999999999691099999999999997--------------------------68877777881 Q ss_pred CCCCCCCCCHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC------CCCEEEEECCEEEEECCCCHH Q ss_conf 01678662102101002345677753145655453688999999999999967------981889952589997792489 Q gi|254781193|r 568 LVYRDVGNNRSYLNTTTRQLTREYTYGGKLTENIVQAVSRDILAEGMMNATKA------GYDIVLTVHDEIVCETPDTDE 641 (675) Q Consensus 568 ~~~~~~~~~~~~~~~~~~~~~r~~~~~~k~~n~~iQgtaadi~k~ami~l~~~------g~~ivl~VHDEiv~evp~~~~ 641 (675) |++|++++.+...+.. .+|+ ++|+|||||||||+|.||+++++. ..+|+||||||||||||+++. T Consensus 775 r~ip~i~s~n~~~r~~---~eR~------AiN~piQGtAADiiK~AMi~~~~~l~~~~~~~~mlLQVHDELvfEv~~~~~ 845 (880) T PRK07300 775 RELPDINSRNFNVRSF---AERT------AINSPIQGSAADILKIAMINLDKALQAGGFKSKMLLQVHDEIVLEVPNDEL 845 (880) T ss_pred ECCCCCCCCCHHHHHH---HHHH------HHCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECEEEEEEECHHHH T ss_conf 3589867888778879---9898------816024178999999999999999984598851899974045885178899 Q ss_pred HHH-HHHHHHHCCCCHHCCCCCCEEEECCCCCCC Q ss_conf 999-999998456520215995157401004631 Q gi|254781193|r 642 FNA-SMLYHLMTSNPSWAKGLPLKAEGYESKRYR 674 (675) Q Consensus 642 ~~~-~~i~~~M~~~~~w~~~~pl~ae~~~~~~y~ 674 (675) +++ +.|+++|+.+.+ ..+||.+|..+|+.|. T Consensus 846 ~~~~~~v~~~Me~~~~--l~VPL~v~~~~G~nW~ 877 (880) T PRK07300 846 TAIKKLVKETMEAAIE--LAVPLIADENTGQTWY 877 (880) T ss_pred HHHHHHHHHHHHCCCC--CCCCEEEECCCCCCHH T ss_conf 9999999999837524--7888373018488967 No 17 >COG0749 PolA DNA polymerase I - 3'-5' exonuclease and polymerase domains [DNA replication, recombination, and repair] Probab=100.00 E-value=0 Score=809.46 Aligned_cols=536 Identities=22% Similarity=0.258 Sum_probs=451.4 Q ss_pred EEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCEEEEEEECCCCH---HHHHHHHHHHCCCCCEEEEEEHHHHHHHH Q ss_conf 999814178987000440011279819999999848039999788810---43799999863889819997307749999 Q gi|254781193|r 4 LFIDFETRSPIDLTKCGICKYAEDVCVILFAFGFDDHPVELWDCVHDP---VMPQVLKEALENTDVTIVAHNSYFERTVL 80 (675) Q Consensus 4 i~lD~ET~~~~dl~~~Gl~~y~~~~~i~~i~~a~~~~~v~~~d~~~~~---~~~~~l~~~L~~~~~~kV~HNa~FD~~~L 80 (675) +++|+||.+. +. ....++++++++.. .+.|++..+++ .....++.+|++++..+|+||++||.++| T Consensus 25 ~a~~~et~~l--------~~--~~~~lvg~s~~~~~-~~~yi~~~~~~~~~~~~~~l~~~l~~~~~~kv~~~~K~d~~~l 93 (593) T COG0749 25 IAFDTETDGL--------DP--HGADLVGLSVASEE-EAAYIPLLHGPEQLNVLAALKPLLEDEGIKKVGQNLKYDYKVL 93 (593) T ss_pred CCEECCCCCC--------CC--CCCCEEEEEEECCC-CCEEEEECCCHHHHHHHHHHHHHHHCCCCCHHCCCCCHHHHHH T ss_conf 1010001456--------75--55874678862365-4236751366555514899999863754103213413669999 Q ss_pred HHHCCCCCCCCCEEEHHHHHHHHCCC---CCHHHHHHHCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHH Q ss_conf 98608998842033168899986789---996898998299811004379999862021123433234678585328999 Q gi|254781193|r 81 KACFNLDIPIHRWICTLVLARSNGLP---SALKAVGEALKLSSQKMEEGKELIARFCKGETHECLYDRNKPNHREAWRLF 157 (675) Q Consensus 81 ~~~~Gi~i~~~~~~DTmi~A~~l~~~---~sL~~la~~l~l~~~k~~~~k~l~~~~~k~~~~~~~~~~~~~~~~~~~~~~ 157 (675) +++ |+. .+..+||||++|+++++ +++++++.++ +.. .....+.+.. + ......+.....+.. T Consensus 94 ~~~-Gi~--~~~~~DtmlasYll~~~~~~~~~~~l~~r~-l~~-~~~~~~~i~~---k-------g~~~~~~~~~~~~~~ 158 (593) T COG0749 94 ANL-GIE--PGVAFDTMLASYLLNPGAGAHNLDDLAKRY-LGL-ETITFEDIAG---K-------GKKQLTFADVKLEKA 158 (593) T ss_pred HHC-CCC--CCCHHHHHHHHHCCCCCCCCCCHHHHHHHH-CCC-CCCHHHHHHC---C-------CCCCCCCCCCHHHHH T ss_conf 973-976--565178999996148676757778999982-587-7402487632---5-------445676100147788 Q ss_pred HHHHHHHHHHHHHHHHHCCCCH-----------HCCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9998998999999996301040-----------00151336666531156213799999999999999999999999872 Q gi|254781193|r 158 GEYCKGDVTVTREIFKRLEPLS-----------SKEQELWYLDQTINDRGYCVDVALAHTLNQIVDVERTKLDQELESLT 226 (675) Q Consensus 158 ~~Ya~~Dv~~t~~L~~~l~~~~-----------~~E~~~~~~~~~me~~Gi~vD~e~l~~l~~~~~~~~~~l~~~i~~~~ 226 (675) ..|++.|+.+|++|+..|.+.+ +.|+++..++..||.+||.||.+.+..+..++..++.+++.+++++ T Consensus 159 ~~y~a~~a~~~~~L~~~l~~~l~~~~~L~~l~~~iE~Pl~~VLa~ME~~Gi~vD~~~L~~l~~el~~~l~~le~eiy~l- 237 (593) T COG0749 159 TEYAAEDADATLRLESILEPELLKTPVLLELYEEIEMPLVRVLARMERNGIKVDVQYLKELSKELGCELAELEEEIYEL- 237 (593) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHCCCEECHHHHHHHHHHHHHHHHHHHHHHHHH- T ss_conf 8878888899999999999876310469999999754089999986862856429999999999999999999999998- Q ss_pred CCCCCCCCCHHHHHHHHHHHCCCCC-CCCC----CCCHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 8854767887999887777558888-8898----8533899872130482389999999997655101335788740868 Q gi|254781193|r 227 YGLVSSSRCLEKLKTWLFLETGLHL-EDLT----ETTCQRVLSSNKKLTPQAIQVFNNRMRASRSAVLKLNTLVEAMNSD 301 (675) Q Consensus 227 ~~~~~n~~S~~ql~~~L~~~lg~~~-~~~~----k~t~~~~L~~~~~~~~~~~~~l~~r~~~~k~~~~~~~~~~~~~~~d 301 (675) .|..||++||+||+.+||+++|+|+ .+++ .+|..++|+.+...||.++.+|+||+ +.|+++||.+++++.++++ T Consensus 238 aG~~FNi~SPKQL~~ILfeKl~Lp~~~kKtktG~yST~~~vLe~L~~~h~i~~~iL~~Rq-l~KLksTY~d~L~~~i~~~ 316 (593) T COG0749 238 AGEEFNINSPKQLGEILFEKLGLPPGLKKTKTGNYSTDAEVLEKLADDHPLPKLILEYRQ-LAKLKSTYTDGLPKLINPD 316 (593) T ss_pred HCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCHHHHHHHHH-HHHHHHHHHHCCHHHHCCC T ss_conf 268578997789999988851898555545789976279999986524873899999987-8888777640228751888 Q ss_pred -CCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHCCCCCCCCCCCCCCCCHHHHHEEEECCCCCEEEEEE Q ss_conf -9676778840001322112478884448888865113455431012333345544321003333012668987899983 Q gi|254781193|r 302 -GRLRGTLQFLGASRTGRWSGCVFPPQNLPRPSRSHEELDQIITHLPTSITSDPDPLQIISDCVRSCVIASEGKKLVVAD 380 (675) Q Consensus 302 -gRih~~~~~~gat~TGR~S~~~pNlQnlPr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iR~~f~a~~G~~lv~aD 380 (675) ||||++|+|.+ |.|||+||++|||||||.++..|.+ ||++|+|++||.+++|| T Consensus 317 t~RIHTsf~Q~~-t~TGRLSSsdPNLQNIPiRse~Gr~-------------------------IR~aFva~~g~~~i~aD 370 (593) T COG0749 317 TGRIHTSFNQTG-TATGRLSSSDPNLQNIPIRSEEGRK-------------------------IRKAFVAEKGYTLISAD 370 (593) T ss_pred CCCCCCCHHHHH-HHHHCCCCCCCCCCCCCCCCHHHHH-------------------------HHHCEECCCCCEEEEEC T ss_conf 993276168777-7750335799975567767776786-------------------------66405579998688701 Q ss_pred CCCCCHHHHHHHCCCCCHHHHHHHCCCCHHHHHHHHCCCCHHHHCCHHHHHCCCCEEEEECCCCHHHHHHHHHHCCCCCC Q ss_conf 45372333323114620113454113310101476448795670620111016520210028888889987987088831 Q gi|254781193|r 381 LAGIEARVLAWMAGEQWKLKAFETGEDLYVTTYARSFNVAPAQVTKDQRAIGKVMELALGYQGGAKAFQKMASTTGLNLN 460 (675) Q Consensus 381 ySqiElRvlA~ls~d~~l~~af~~g~D~h~~tA~~~~g~~~~~~~~~~R~~aK~~~~g~~YG~G~~~l~~~~~~~G~~~~ 460 (675) |||||||||||+|+|+.|++||.+|.|+|+.||+.+||+|+++|++++|+.||++|||++|||++.+|++ T Consensus 371 YSQIELRilAHls~D~~Ll~AF~~g~DiH~~TA~~vFgv~~~~Vt~e~Rr~AKaINFGiiYG~safgLa~---------- 440 (593) T COG0749 371 YSQIELRILAHLSQDEGLLRAFTEGEDIHTATAAEVFGVPIEEVTSEQRRKAKAINFGLIYGMSAFGLAQ---------- 440 (593) T ss_pred HHHHHHHHHHHHCCCHHHHHHHHCCCCHHHHHHHHHHCCCHHHCCHHHHHHHHHHCCCEEECCCHHHHHH---------- T ss_conf 5799999999861889999988658637789999994998233899987565530354143451334798---------- Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHCCEEEEE Q ss_conf 13455655545555678888999988764310001022078889999999987887899999999999999754917962 Q gi|254781193|r 461 TFAHLVKTNSAPEEWEDAENFCRWMQENHPEHAIEDLFIGTACERVKKAWRQKHSRVSELWNELHQAFEQTIENGKAIIA 540 (675) Q Consensus 461 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ea~~~~~~~r~~~p~i~~~~~~~~~~~~~~~~~~~~~~~ 540 (675) ..+++..||+.+++.|++.||+|..|..++...++ ..+ T Consensus 441 ----------------------------------~L~I~~~eA~~~I~~YF~rypgv~~ym~~~~~~ar---~~G----- 478 (593) T COG0749 441 ----------------------------------QLGIPRKEAKEYIDRYFERYPGVKEYMERTKEEAR---EDG----- 478 (593) T ss_pred ----------------------------------HCCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHH---HCC----- T ss_conf ----------------------------------74988688899999999868689999999999999---749----- Q ss_pred CCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCCHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 15421133222344564023247663101678662102101002345677753145655453688999999999999967 Q gi|254781193|r 541 RKRRDVPLVYMKKDKNNLILMLPSGRSLVYRDVGNNRSYLNTTTRQLTREYTYGGKLTENIVQAVSRDILAEGMMNATKA 620 (675) Q Consensus 541 ~~~~~~~~~~~~~~~~~~~~~~p~GR~~~~~~~~~~~~~~~~~~~~~~r~~~~~~k~~n~~iQgtaadi~k~ami~l~~~ 620 (675) ++++..|||+++|++++.+...+. ..+|. ++|.|||||||||+|.|||.+++. T Consensus 479 ------------------yV~Tl~gRRry~p~i~s~n~~~R~---~aER~------AiNaPIQGTAADiiK~AMI~vd~~ 531 (593) T COG0749 479 ------------------YVETLFGRRRYLPDINSSNRVVRA---AAERA------AINAPIQGTAADIIKLAMIKVDKA 531 (593) T ss_pred ------------------CEEECCCCCCCCCCCCCCCHHHHH---HHHHH------HHCCCCCCCHHHHHHHHHHHHHHH T ss_conf ------------------354102660248621017788887---88898------835767444899999999867999 Q ss_pred ------CCCEEEEECCEEEEECCCCHHHHH-HHHHHHHCCCCHHCCCCCCEEEECCCCCCC Q ss_conf ------981889952589997792489999-999998456520215995157401004631 Q gi|254781193|r 621 ------GYDIVLTVHDEIVCETPDTDEFNA-SMLYHLMTSNPSWAKGLPLKAEGYESKRYR 674 (675) Q Consensus 621 ------g~~ivl~VHDEiv~evp~~~~~~~-~~i~~~M~~~~~w~~~~pl~ae~~~~~~y~ 674 (675) ..+++||||||+|||||+++.+.+ +.|.+.|+.+.. ..+||..|-.+|..|. T Consensus 532 l~~~~~~~rllLQVHDELvfEv~~~e~e~~~~~v~~~Me~a~~--L~VPL~vdv~~g~nW~ 590 (593) T COG0749 532 LKEEKLKARLLLQVHDELVFEVPKEELEEVKKLLKAIMENAVN--LSVPLEVDVGIGKNWD 590 (593) T ss_pred HHHCCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHC--CCCCEEEECCCCCCHH T ss_conf 8634321456775545564117686899999999999987650--4786698617776834 No 18 >pfam00476 DNA_pol_A DNA polymerase family A. Probab=100.00 E-value=0 Score=651.80 Aligned_cols=367 Identities=24% Similarity=0.285 Sum_probs=324.4 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHCCCCCCCC-----CCCCHHHHHHHHH-CCCCH Q ss_conf 37999999999999999999999998728854767887999887777558888889-----8853389987213-04823 Q gi|254781193|r 199 VDVALAHTLNQIVDVERTKLDQELESLTYGLVSSSRCLEKLKTWLFLETGLHLEDL-----TETTCQRVLSSNK-KLTPQ 272 (675) Q Consensus 199 vD~e~l~~l~~~~~~~~~~l~~~i~~~~~~~~~n~~S~~ql~~~L~~~lg~~~~~~-----~k~t~~~~L~~~~-~~~~~ 272 (675) ||.+.++++..++..++.+++.+++.+. |..||++||+|++++||+++|+|+.+. ..+|++++|+.+. +.||. T Consensus 1 vd~~~l~~~~~~l~~~~~~l~~~i~~~a-G~~fN~~S~~ql~~vLf~~l~l~~~~~t~~~g~~st~~~~L~~l~~~~hp~ 79 (383) T pfam00476 1 VDVEYLEELSNELGAQLADLEFKIPELA-GEEFNLGSPKQLGVLLFEELGLPKTKKTDKTGARSTNADVLESLREDAHEI 79 (383) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHH-CCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCHH T ss_conf 9889999999999999999999999995-997699998999998497359999997789999867899999876168869 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCC-CCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHCCCCCCC Q ss_conf 8999999999765510133578874086-896767788400013221124788844488888651134554310123333 Q gi|254781193|r 273 AIQVFNNRMRASRSAVLKLNTLVEAMNS-DGRLRGTLQFLGASRTGRWSGCVFPPQNLPRPSRSHEELDQIITHLPTSIT 351 (675) Q Consensus 273 ~~~~l~~r~~~~k~~~~~~~~~~~~~~~-dgRih~~~~~~gat~TGR~S~~~pNlQnlPr~~~~~~~~~~~~~~~~~~~~ 351 (675) +..+++||. ++|+.+||+++++..+++ |||||++|||+| |+|||+||++|||||||++++.+ T Consensus 80 ~~~il~~R~-~~Kl~~t~~~~l~~~~~~~~gRih~~~nq~g-t~TGRlSs~~PNlQniP~~~~~g--------------- 142 (383) T pfam00476 80 IKIILEYRQ-LSKLQSTYVDKLPLMIDPDDGRIHTSYNQAG-TATGRLSSTDPNLQNIPIRNEYG--------------- 142 (383) T ss_pred HHHHHHHHH-HHHHHHHHHHHHHHHHCCCCCEECCEEECCC-CCEEECCCCCCCCCCCCCCCCCC--------------- T ss_conf 999999999-9999999999988862678986801242264-20220356798645799888430--------------- Q ss_pred CCCCCCCCCHHHHHEEEECCCCCEEEEEECCCCCHHHHHHHCCCCCHHHHHHHCCCCHHHHHHHHCCCCHHHHCCHHHHH Q ss_conf 45544321003333012668987899983453723333231146201134541133101014764487956706201110 Q gi|254781193|r 352 SDPDPLQIISDCVRSCVIASEGKKLVVADLAGIEARVLAWMAGEQWKLKAFETGEDLYVTTYARSFNVAPAQVTKDQRAI 431 (675) Q Consensus 352 ~~~~~~~~~~~~iR~~f~a~~G~~lv~aDySqiElRvlA~ls~d~~l~~af~~g~D~h~~tA~~~~g~~~~~~~~~~R~~ 431 (675) ..||+||+|+|||+||+|||||||+||+||+|+|+.|+++|++|.|+|+.||+.+||+|++++++++|+. T Consensus 143 ----------~~iR~~Fva~~G~~lv~aDySQiElRilA~ls~D~~l~~af~~g~Dih~~tA~~ifg~~~~~v~~~~R~~ 212 (383) T pfam00476 143 ----------REIRAAFIAEPGYVLVAADYSQIELRILAHLSGDENLIEAFRTGADIHTLTAADIFGVDLHEVTGNQRRN 212 (383) T ss_pred ----------HHHHHEEECCCCCEEEEECHHHHHHHHHHHHCCCHHHHHHHHCCCCHHHHHHHHHHCCCHHHCCHHHHHH T ss_conf ----------5567436279998788505877668888752054889998702765889889998499978868556767 Q ss_pred CCCCEEEEECCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHH Q ss_conf 16520210028888889987987088831134556555455556788889999887643100010220788899999999 Q gi|254781193|r 432 GKVMELALGYQGGAKAFQKMASTTGLNLNTFAHLVKTNSAPEEWEDAENFCRWMQENHPEHAIEDLFIGTACERVKKAWR 511 (675) Q Consensus 432 aK~~~~g~~YG~G~~~l~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ea~~~~~~~r 511 (675) ||++|||++||||+.+|++ ..+++..||+.+++.|+ T Consensus 213 AK~~~fGi~YG~g~~~La~--------------------------------------------~l~is~~eA~~~i~~~~ 248 (383) T pfam00476 213 AKTFNFGRIYGASAKGLSQ--------------------------------------------LLGISREEAKEFIEKYF 248 (383) T ss_pred HHHHHHHHHHCCCHHHHHH--------------------------------------------HHCCCHHHHHHHHHHHH T ss_conf 7771316644846889998--------------------------------------------85888999999999999 Q ss_pred HHCHHHHHHHHHHHHHHHHHHHCCEEEEECCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCCHHHHCCHHHHHHHHH Q ss_conf 87887899999999999999754917962154211332223445640232476631016786621021010023456777 Q gi|254781193|r 512 QKHSRVSELWNELHQAFEQTIENGKAIIARKRRDVPLVYMKKDKNNLILMLPSGRSLVYRDVGNNRSYLNTTTRQLTREY 591 (675) Q Consensus 512 ~~~p~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~GR~~~~~~~~~~~~~~~~~~~~~~r~~ 591 (675) +.||+|.+|+.++...++. .+++.++.||++++|++++.+...+. ..+| T Consensus 249 ~~yp~i~~~~~~~~~~a~~--------------------------~g~v~T~~GRrr~lp~~~~~~~~~~~---~~~r-- 297 (383) T pfam00476 249 ERFPGVKRYREKTRKEAKK--------------------------GGYVETLFGRRRYLPDIDSRNRSLRE---AAER-- 297 (383) T ss_pred HHCHHHHHHHHHHHHHHHH--------------------------CCCCCCCCCCEEECCCCCCCCHHHHH---HHHH-- T ss_conf 9783699999999999996--------------------------89611456876755775666788886---7767-- Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHC------CCCEEEEECCEEEEECCCCHHHHH-HHHHHHHCCCCHHCCCCCCE Q ss_conf 53145655453688999999999999967------981889952589997792489999-99999845652021599515 Q gi|254781193|r 592 TYGGKLTENIVQAVSRDILAEGMMNATKA------GYDIVLTVHDEIVCETPDTDEFNA-SMLYHLMTSNPSWAKGLPLK 664 (675) Q Consensus 592 ~~~~k~~n~~iQgtaadi~k~ami~l~~~------g~~ivl~VHDEiv~evp~~~~~~~-~~i~~~M~~~~~w~~~~pl~ 664 (675) .++|++||||||||+|.||+++++. +.+||+|||||||||||+++++++ ..|++||+.+.+ ..+||. T Consensus 298 ----~a~N~~IQGsAADi~k~Ami~i~~~l~~~~~~~~lvlqVHDElv~evp~~~~~~~~~~v~~~Me~~~~--l~VPl~ 371 (383) T pfam00476 298 ----AALNTPIQGSAADILKLAMIKLDEALVEKGLDARMCLQVHDEIVFEVPKEEAEAVAAQIKELMERAMF--LDVPLL 371 (383) T ss_pred ----HHHCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECEEEEEEECHHHHHHHHHHHHHHHHHHCC--CCCCEE T ss_conf ----65353268689999999999999999854999789996354118997689999999999999960427--899679 Q ss_pred EEECCCCCCC Q ss_conf 7401004631 Q gi|254781193|r 665 AEGYESKRYR 674 (675) Q Consensus 665 ae~~~~~~y~ 674 (675) +|..+|+.|. T Consensus 372 v~~~~G~~Wg 381 (383) T pfam00476 372 VEVGQGRNWF 381 (383) T ss_pred EECCCCCCHH T ss_conf 6538278801 No 19 >cd06444 DNA_pol_A DNA polymerase family A, 5'-3' polymerase domain. Family A polymerase (polymerase I) functions primarily to fill DNA gaps that arise during DNA repair, recombination and replication. DNA-dependent DNA polymerases can be classified into six main groups based upon phylogenetic relationships with E. coli polymerase I (classA), E. coli polymerase II (class B), E.coli polymerase III (class C), euryarchaeota polymerase II (class D), human polymerase beta (class X), E. coli UmuC/DinB and eukaryotic RAP 30/Xeroderma pigmentosum variant (class Y). Family A polymerases are found primarily in organisms related to prokaryotes and include prokaryotic DNA polymerase I, mitochondrial polymerase gamma, and several bacteriophage polymerases including those from odd-numbered phage (T3, T5, and T7). Prokaryotic Pol I has two functional domains located on the same polypeptide; a 5'-3' polymerase and a 5'-3' exonuclease. Pol I uses its 5' nuclease activity to remove the ribonucleotide po Probab=100.00 E-value=0 Score=644.23 Aligned_cols=367 Identities=22% Similarity=0.294 Sum_probs=326.3 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHCCCCCCCCCC---CCHHHHHHHHHCCCCHHHH Q ss_conf 3799999999999999999999999872885476788799988777755888888988---5338998721304823899 Q gi|254781193|r 199 VDVALAHTLNQIVDVERTKLDQELESLTYGLVSSSRCLEKLKTWLFLETGLHLEDLTE---TTCQRVLSSNKKLTPQAIQ 275 (675) Q Consensus 199 vD~e~l~~l~~~~~~~~~~l~~~i~~~~~~~~~n~~S~~ql~~~L~~~lg~~~~~~~k---~t~~~~L~~~~~~~~~~~~ 275 (675) +|.+.++.+...+..++++++.+++.++| ..||++||+|++++||+.+|+|++++++ +|+++.|+.+...||.+.. T Consensus 1 id~~~~~~~~~~l~~~l~~l~~~~~~~~g-~~fn~~S~~ql~~~L~~~lg~~~~~~~~~~~st~~~~l~~~~~~hp~~~~ 79 (379) T cd06444 1 VDKEELEELSAELEKELEELEKEIYELAG-EEFNINSPKQLGEVLFEKLGLPVSKKTKTGYSTDEEVLEKLAEAHPIIAL 79 (379) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHC-CCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCCCHHHH T ss_conf 98899999999999999999999999969-90799999999999998549998988889999859999976433848999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHCC-CCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHCCCCCCCCCC Q ss_conf 999999976551013357887408-6896767788400013221124788844488888651134554310123333455 Q gi|254781193|r 276 VFNNRMRASRSAVLKLNTLVEAMN-SDGRLRGTLQFLGASRTGRWSGCVFPPQNLPRPSRSHEELDQIITHLPTSITSDP 354 (675) Q Consensus 276 ~l~~r~~~~k~~~~~~~~~~~~~~-~dgRih~~~~~~gat~TGR~S~~~pNlQnlPr~~~~~~~~~~~~~~~~~~~~~~~ 354 (675) ++++| .++|+.++|.+.+++.++ .|||||++|+++| |+|||+||++|||||+|++++.+ T Consensus 80 ile~R-~~~kl~st~~~~~~~~~~~~~gRih~~~~q~g-t~TGR~Ss~~PNlQniP~~~~~~------------------ 139 (379) T cd06444 80 ILEYR-KLAKLKSTYVDALPKLINPKTGRIHTSFNQTV-TATGRLSSSNPNLQNIPIRTEEG------------------ 139 (379) T ss_pred HHHHH-HHHHHHHHHHHHHHHHHCCCCCEEEEEEECCC-CCCCCCCCCCCCCCCCCCCCHHH------------------ T ss_conf 99999-99999999999998760689997966885277-54001246787645788887116------------------ Q ss_pred CCCCCCHHHHHEEEECCCCCEEEEEECCCCCHHHHHHHCCCCCHHHHHHHCCCCHHHHHHHHCCCCHHHHCCHHHHHCCC Q ss_conf 44321003333012668987899983453723333231146201134541133101014764487956706201110165 Q gi|254781193|r 355 DPLQIISDCVRSCVIASEGKKLVVADLAGIEARVLAWMAGEQWKLKAFETGEDLYVTTYARSFNVAPAQVTKDQRAIGKV 434 (675) Q Consensus 355 ~~~~~~~~~iR~~f~a~~G~~lv~aDySqiElRvlA~ls~d~~l~~af~~g~D~h~~tA~~~~g~~~~~~~~~~R~~aK~ 434 (675) ..+|+||+|+|||+||++||||||+||+||||+|+.|+++|++|.|+|+.||+.+||+|++++++++|+.||+ T Consensus 140 -------~~~R~~f~~~~G~~lv~aDySqiElRvlA~ls~D~~l~~~f~~g~Dih~~~A~~~~~~~~~~v~~~~R~~aK~ 212 (379) T cd06444 140 -------REIRKAFVAPPGYVLLSADYSQIELRILAHLSGDPALIEAFNNGEDIHSRTAAEVFGVPEEEVTKEERRKAKA 212 (379) T ss_pred -------HHHHHEEECCCCCEEEEECHHHCHHHHHHHHHCCHHHHHHHHCCCCHHHHHHHHHHCCCHHHCCHHHHHHHHH T ss_conf -------7675247479998799842311218888887176999999873887899999998399844359889988657 Q ss_pred CEEEEECCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHC Q ss_conf 20210028888889987987088831134556555455556788889999887643100010220788899999999878 Q gi|254781193|r 435 MELALGYQGGAKAFQKMASTTGLNLNTFAHLVKTNSAPEEWEDAENFCRWMQENHPEHAIEDLFIGTACERVKKAWRQKH 514 (675) Q Consensus 435 ~~~g~~YG~G~~~l~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ea~~~~~~~r~~~ 514 (675) ++||++||||+.+|+. +. +++..||+.+++.|++.| T Consensus 213 ~~fgl~YG~G~~~la~---~l-----------------------------------------~~s~~eA~~~~~~f~~~~ 248 (379) T cd06444 213 INFGIIYGMGAFGLAE---QL-----------------------------------------GISREEAKEFIDRYFATY 248 (379) T ss_pred HHHHHHHCCCHHHHHH---CC-----------------------------------------CCCHHHHHHHHHHHHHHC T ss_conf 6757532834532975---01-----------------------------------------999999999999999978 Q ss_pred HHHHHHHHHHHHHHHHHHHCCEEEEECCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCCHHHHCCHHHHHHHHHHHH Q ss_conf 87899999999999999754917962154211332223445640232476631016786621021010023456777531 Q gi|254781193|r 515 SRVSELWNELHQAFEQTIENGKAIIARKRRDVPLVYMKKDKNNLILMLPSGRSLVYRDVGNNRSYLNTTTRQLTREYTYG 594 (675) Q Consensus 515 p~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~GR~~~~~~~~~~~~~~~~~~~~~~r~~~~~ 594 (675) |+|.+|++.+...++. .+++++++||++++|++++.+...++. .+| T Consensus 249 p~v~~~~~~~~~~a~~--------------------------~g~v~T~~GR~r~~~~~~~~~~~~~~~---~~r----- 294 (379) T cd06444 249 PGVKEYLEKTVEEARE--------------------------GGYVETLFGRRRYLPEINSPNKNVRSA---AER----- 294 (379) T ss_pred CCHHHHHHHHHHHHHH--------------------------CCCEECCCCCEECCCCCCCCCHHHHHH---HHH----- T ss_conf 6566789999999997--------------------------797766778853477556788889878---766----- Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHC------CCCEEEEECCEEEEECCCCHHHH-HHHHHHHHCCCCHHCCCCCCEEEE Q ss_conf 45655453688999999999999967------98188995258999779248999-999999845652021599515740 Q gi|254781193|r 595 GKLTENIVQAVSRDILAEGMMNATKA------GYDIVLTVHDEIVCETPDTDEFN-ASMLYHLMTSNPSWAKGLPLKAEG 667 (675) Q Consensus 595 ~k~~n~~iQgtaadi~k~ami~l~~~------g~~ivl~VHDEiv~evp~~~~~~-~~~i~~~M~~~~~w~~~~pl~ae~ 667 (675) .++|++||||||||+|.||+++++. +++||+|||||||+|||+++.++ ++.+++||+++.++ .+||.+|. T Consensus 295 -~a~N~~iQGsaAdi~k~ami~i~~~l~~~~~~~~lv~~VHDEiv~ev~~~~~~~~~~~i~~~Me~a~~l--~VPl~v~~ 371 (379) T cd06444 295 -QAINTPIQGSAADIIKLAMIKIEKLLKEKGLDARLLLQVHDELIFEVPEEELEEAAALVKENMENAVKL--SVPLKVDI 371 (379) T ss_pred -HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECEEEEEEECHHHHHHHHHHHHHHHHCCCCC--CCCEEEEC T ss_conf -567787465899999999999999999629997899972542489987999999999999998563577--88378663 Q ss_pred CCCCCCC Q ss_conf 1004631 Q gi|254781193|r 668 YESKRYR 674 (675) Q Consensus 668 ~~~~~y~ 674 (675) .+|+.|. T Consensus 372 ~iG~~Wg 378 (379) T cd06444 372 KIGKNWG 378 (379) T ss_pred CCCCCCC T ss_conf 7269999 No 20 >smart00482 POLAc DNA polymerase A domain. Probab=100.00 E-value=0 Score=366.62 Aligned_cols=199 Identities=27% Similarity=0.378 Sum_probs=171.6 Q ss_pred HHHHEEEECCCCCEEEEEECCCCCHHHHHHHCCCCCHHHHHHHCCCCHHHHHHHHCCCCHHHHCCHHHHHCCCCEEEEEC Q ss_conf 33330126689878999834537233332311462011345411331010147644879567062011101652021002 Q gi|254781193|r 362 DCVRSCVIASEGKKLVVADLAGIEARVLAWMAGEQWKLKAFETGEDLYVTTYARSFNVAPAQVTKDQRAIGKVMELALGY 441 (675) Q Consensus 362 ~~iR~~f~a~~G~~lv~aDySqiElRvlA~ls~d~~l~~af~~g~D~h~~tA~~~~g~~~~~~~~~~R~~aK~~~~g~~Y 441 (675) +.+|+||+|+|||+||++||||||+||+||||+|+.|+++|.+|.|+|+.+|+.+||+|++++++++|+.||+++||++| T Consensus 2 ~~~R~~f~a~~G~~lv~aD~sqiE~Rv~A~la~D~~l~~~~~~g~D~h~~~A~~~f~~~~~~~~~~~R~~aK~~~~g~~Y 81 (206) T smart00482 2 REIRRAFVAPPGYVLVSADYSQIELRILAHLSGDENLLEAFNNGGDIHSKTAAQVFGVPEEEVTKELRRAAKAINFGIIY 81 (206) T ss_pred CHHHCEEECCCCCEEEEECHHHHHHHHHHHHCCCHHHHHHHHCCCCHHHHHHHHHCCCCHHHCCHHHHHHHCHHHHHHHC T ss_conf 04322676799999999886689999999982987799987438986899999964988011575669885678886651 Q ss_pred CCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCHHHHHHH Q ss_conf 88888899879870888311345565554555567888899998876431000102207888999999998788789999 Q gi|254781193|r 442 QGGAKAFQKMASTTGLNLNTFAHLVKTNSAPEEWEDAENFCRWMQENHPEHAIEDLFIGTACERVKKAWRQKHSRVSELW 521 (675) Q Consensus 442 G~G~~~l~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ea~~~~~~~r~~~p~i~~~~ 521 (675) |||+.+|. ...| ++..|++.+++.|++.||.|.+|| T Consensus 82 G~G~~~la---~~~g-----------------------------------------~~~~ea~~~~~~~~~~~p~i~~~~ 117 (206) T smart00482 82 GMGAKGLA---EQLG-----------------------------------------ISEAEAKELIKAYFARFPGVKRYI 117 (206) T ss_pred CCCCCCHH---HHCC-----------------------------------------CCHHHHHHHHHHHHHHCHHHHHHH T ss_conf 86511024---5518-----------------------------------------899999999999999783599999 Q ss_pred HHHHHHHHHHHHCCEEEEECCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCCHHHHCCHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999997549179621542113322234456402324766310167866210210100234567775314565545 Q gi|254781193|r 522 NELHQAFEQTIENGKAIIARKRRDVPLVYMKKDKNNLILMLPSGRSLVYRDVGNNRSYLNTTTRQLTREYTYGGKLTENI 601 (675) Q Consensus 522 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~GR~~~~~~~~~~~~~~~~~~~~~~r~~~~~~k~~n~~ 601 (675) +++...+.. .+++.++.||++++|+.......... ..+ ++++|++ T Consensus 118 ~~~~~~~~~--------------------------~g~v~t~~Gr~~~~~~~~~~~~~~~~---~~~------~~~~N~~ 162 (206) T smart00482 118 KRTLEEARR--------------------------KGYVTTLFGRRRYIPDIDSRNPVLRA---AAE------RAAVNAP 162 (206) T ss_pred HHHHHHHHH--------------------------CCCEECCCCCEEECCCCCCCCHHHHH---HHH------HHHCCHH T ss_conf 999999987--------------------------79666788877767876875654553---578------7660710 Q ss_pred HHHHHHHHHHHHHHHHHHC------CCCEEEEECCEEEEECCCC Q ss_conf 3688999999999999967------9818899525899977924 Q gi|254781193|r 602 VQAVSRDILAEGMMNATKA------GYDIVLTVHDEIVCETPDT 639 (675) Q Consensus 602 iQgtaadi~k~ami~l~~~------g~~ivl~VHDEiv~evp~~ 639 (675) ||||||||++.||+++++. +.++++|||||||+|||++ T Consensus 163 vQgtaadi~~~al~~~~~~~~~~~~~~~iv~~VHDeiv~evp~e 206 (206) T smart00482 163 IQGSAADILKLAMIKMDEALKEKGLRARLLLQVHDELVFEVPEE 206 (206) T ss_pred CCHHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECCEEEEECCCC T ss_conf 22899999999999999999853999779985360259961899 No 21 >cd06139 DNA_polA_I_Ecoli_like_exo The 3'-5' exonuclease domain of DNA polymerases has a fundamental role in reducing polymerase errors and is involved in proofreading activity. E.coli-like Polymerase I (pol I), a subgroup of family-A DNA polymerases, contains a DEDDy-type DnaQ-like 3'-5' exonuclease domain in the same polypeptide chain as the polymerase domain. The exonuclease domain contains three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. E. coli DNA pol I is involved in genome replication but is not the main replicating enzyme. It is also implicated in DNA repair. Probab=99.94 E-value=3.3e-25 Score=181.56 Aligned_cols=173 Identities=18% Similarity=0.170 Sum_probs=131.7 Q ss_pred CEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCEEEEEEECCCC--------HHHHHHHHHHHCCCCCEEEEEEH Q ss_conf 6899981417898700044001127981999999984803999978881--------04379999986388981999730 Q gi|254781193|r 2 ATLFIDFETRSPIDLTKCGICKYAEDVCVILFAFGFDDHPVELWDCVHD--------PVMPQVLKEALENTDVTIVAHNS 73 (675) Q Consensus 2 k~i~lD~ET~~~~dl~~~Gl~~y~~~~~i~~i~~a~~~~~v~~~d~~~~--------~~~~~~l~~~L~~~~~~kV~HNa 73 (675) +.+|+|+||++. +++ ..+|+|||++++.+..+|+|+.|. +.+.+.++++|+++.+.||+||+ T Consensus 6 ~~ia~DtETtgl--------~~~--~~~ivGisls~~~~~~~YiP~~h~~~~~~l~~~~v~~~lk~~led~~i~Kv~hn~ 75 (193) T cd06139 6 KVFAFDTETTSL--------DPM--QAELVGISFAVEPGEAYYIPLGHDYGGEQLPREEVLAALKPLLEDPSIKKVGQNL 75 (193) T ss_pred CEEEEECCCCCC--------CCC--CCEEEEEEEEECCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHCCCCCCEEEECC T ss_conf 878997836999--------868--8849999998079968998723656544679999999999997399986887547 Q ss_pred HHHHHHHHHHCCCCCCCCCEEEHHHHHHHHCC---CCCHHHHHHHC-CCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCC Q ss_conf 77499999860899884203316889998678---99968989982-998110043799998620211234332346785 Q gi|254781193|r 74 YFERTVLKACFNLDIPIHRWICTLVLARSNGL---PSALKAVGEAL-KLSSQKMEEGKELIARFCKGETHECLYDRNKPN 149 (675) Q Consensus 74 ~FD~~~L~~~~Gi~i~~~~~~DTmi~A~~l~~---~~sL~~la~~l-~l~~~k~~~~k~l~~~~~k~~~~~~~~~~~~~~ 149 (675) +||+.+|.++ |+.++ +.++||||+||++++ +++|+.+++.+ +....+ .++++. + ..+..++ T Consensus 76 k~D~~~L~~~-gi~~~-~~~~DTmiaa~ll~~~~~~~~L~~La~~~l~~~~~~---~~~l~g---k-------g~~~~~~ 140 (193) T cd06139 76 KFDLHVLANH-GIELR-GPAFDTMLASYLLNPGRRRHGLDDLAERYLGHKTIS---FEDLVG---K-------GKKQITF 140 (193) T ss_pred HHHHHHHHHC-CCCCC-CCCHHHHHHHHHCCCCCCCCCHHHHHHHHHCCCCCC---HHHHHC---C-------CCCCCCH T ss_conf 9789999983-98667-830218999987279988889999999981887663---899837---7-------8553783 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHCCCCHHHHHHHHHHCCCCC Q ss_conf 85328999999899899999999630104000151336666531156213 Q gi|254781193|r 150 HREAWRLFGEYCKGDVTVTREIFKRLEPLSSKEQELWYLDQTINDRGYCV 199 (675) Q Consensus 150 ~~~~~~~~~~Ya~~Dv~~t~~L~~~l~~~~~~E~~~~~~~~~me~~Gi~v 199 (675) ...+.+.+..||+.|+.+|++||+.|.+.+..+..+..+..++|++-++| T Consensus 141 ~~~~~~~~~~YAa~Da~~t~~L~~~l~~~L~~~~~l~~ly~~iE~PL~~V 190 (193) T cd06139 141 DQVPLEKAAEYAAEDADITLRLYELLKPKLKEEPGLLELYEEIEMPLIPV 190 (193) T ss_pred HHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHH T ss_conf 13329999999999999999999999999984220999999984569998 No 22 >cd06128 DNA_polA_exo The 3'-5' exonuclease domain of family-A DNA polymerases has a fundamental role in reducing polymerase errors and is involved in proofreading activity. Family-A DNA polymerases contain a DnaQ-like exonuclease domain in the same polypeptide chain as the polymerase domain, similar to family-B DNA polymerases. The exonuclease domain contains three conserved sequence motifs termed ExoI, ExoII and ExoIII, which are clustered around the active site and contain four invariant acidic residues that serve as ligands for the two metal ions required for catalysis. The Klenow fragment (KF) of Escherichia coli pol I, the Thermus aquaticus (Taq) pol I, and Bacillus stearothermophilus (BF) pol I are examples of family-A DNA polymerases. They are involved in nucleotide excision repair and in the processing of Okazaki fragments that are generated during lagging strand synthesis. The N-terminal domains of BF pol I and Taq pol I resemble the fold of the 3'-5' exonuclease domain of KF Probab=99.93 E-value=7.6e-25 Score=179.22 Aligned_cols=151 Identities=18% Similarity=0.189 Sum_probs=115.7 Q ss_pred CEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCEEEEEEECCCC----------HHHHHHHHHHHCCCCCEEEEE Q ss_conf 6899981417898700044001127981999999984803999978881----------043799999863889819997 Q gi|254781193|r 2 ATLFIDFETRSPIDLTKCGICKYAEDVCVILFAFGFDDHPVELWDCVHD----------PVMPQVLKEALENTDVTIVAH 71 (675) Q Consensus 2 k~i~lD~ET~~~~dl~~~Gl~~y~~~~~i~~i~~a~~~~~v~~~d~~~~----------~~~~~~l~~~L~~~~~~kV~H 71 (675) ..+|||+||+|... .+.+|+|||+++..+..+|+|+.|. +.+.+.|+++|+|+++.|||| T Consensus 2 ~~~a~DtETt~ld~----------~~a~iVGis~s~~~~~a~YiP~~h~~~~~~~~l~~~~vl~~L~p~led~~i~Kigh 71 (166) T cd06128 2 PVFAFDTETDSLDN----------ISANLVGLSFAIEPGVAAYIPVAHDYLDAPDQISRERALELLKPLLEDEKALKVGQ 71 (166) T ss_pred CEEEEECCCCCCCC----------CCCEEEEEEEECCCCEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCEEEE T ss_conf 98999788799890----------43549999998008969999526776675257789999999999973999764531 Q ss_pred EHHHHHHHHHHHCCCCCCCCCEEEHHHHHHHHCC---CCCHHHHHHHC-CCCCCHHHHHHHHHHHHCCCCCCCCCCCCCC Q ss_conf 3077499999860899884203316889998678---99968989982-9981100437999986202112343323467 Q gi|254781193|r 72 NSYFERTVLKACFNLDIPIHRWICTLVLARSNGL---PSALKAVGEAL-KLSSQKMEEGKELIARFCKGETHECLYDRNK 147 (675) Q Consensus 72 Na~FD~~~L~~~~Gi~i~~~~~~DTmi~A~~l~~---~~sL~~la~~l-~l~~~k~~~~k~l~~~~~k~~~~~~~~~~~~ 147 (675) |+|||+.+|.++ |+.+. +..+||||+||++++ .++|+.++..+ +.. ....++++. | ..... T Consensus 72 NlK~D~~vL~~~-Gi~l~-g~~fDTMlaaylldp~~~~h~Ld~La~~~L~~~---~i~~~dl~G---k-------Gk~q~ 136 (166) T cd06128 72 NLKYDRGILANY-GIELR-GIAFDTMLESYILNSVAGRHDMDSLAERWLKHK---TITFEEIAG---K-------GKNQL 136 (166) T ss_pred CHHHHHHHHHHC-CCCCC-CCCCCHHHHHHHCCCCCCCCCHHHHHHHHCCCC---CEEHHHHHC---C-------CCCCC T ss_conf 156679999619-96689-998457679876289998899699999983998---705999837---8-------85758 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 858532899999989989999999963010 Q gi|254781193|r 148 PNHREAWRLFGEYCKGDVTVTREIFKRLEP 177 (675) Q Consensus 148 ~~~~~~~~~~~~Ya~~Dv~~t~~L~~~l~~ 177 (675) .+...+.+....||++|+++|++||++|.| T Consensus 137 ~f~~v~i~~~~~YA~eDAd~t~rL~~~l~P 166 (166) T cd06128 137 TFNQIALEEAGRYAAEDADVTLQLHLKMWP 166 (166) T ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 932289999999899999999999985389 No 23 >cd06140 DNA_polA_I_Bacillus_like_exo Bacillus stearothermophilus-like Polymerase I (pol I), a subgroup of the family-A DNA polymerases, contains an inactive DnaQ-like 3'-5' exonuclease domain in the same polypeptide chain as the polymerase region. The exonuclease-like domain of these proteins possess the same fold as the Klenow fragment (KF) of Escherichia coli pol I, but does not contain the four critical metal-binding residues necessary for activity. The function of this domain is unknown. It might act as a spacer between the polymerase and the 5'-3' exonuclease domains. Some members of this subgroup, such as those from Bacillus sphaericus and Thermus aquaticus, are thermostable DNA polymerases. Probab=99.85 E-value=7.5e-20 Score=146.93 Aligned_cols=168 Identities=14% Similarity=0.016 Sum_probs=125.3 Q ss_pred CEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCEEEEEEECCCCHHHHHHHHHHHCCCCCEEEEEEHHHHHHHHH Q ss_conf 68999814178987000440011279819999999848039999788810437999998638898199973077499999 Q gi|254781193|r 2 ATLFIDFETRSPIDLTKCGICKYAEDVCVILFAFGFDDHPVELWDCVHDPVMPQVLKEALENTDVTIVAHNSYFERTVLK 81 (675) Q Consensus 2 k~i~lD~ET~~~~dl~~~Gl~~y~~~~~i~~i~~a~~~~~v~~~d~~~~~~~~~~l~~~L~~~~~~kV~HNa~FD~~~L~ 81 (675) +++++|+||.+.. + -+.+|+||+++.+ +.++|+|..+.......++++|+|+++.||+||+|||+.+|. T Consensus 4 ~~~av~~e~~~~~--------~--~~~~i~gi~l~~~-~~~~yi~~~~~~~~~~~lk~~l~d~~i~Ki~hn~K~~~~~L~ 72 (178) T cd06140 4 DEVALYVELLGEN--------Y--HTADIIGLALANG-GGAYYIPLELALLDLAALKEWLEDEKIPKVGHDAKRAYVALK 72 (178) T ss_pred CCEEEEEECCCCC--------C--CCCEEEEEEEEEC-CEEEEEECCCCHHHHHHHHHHHCCCCCCEEEECHHHHHHHHH T ss_conf 8669997379998--------4--5354999999889-979999566736469999999729999889865799999999 Q ss_pred HHCCCCCCCCCEEEHHHHHHHHCCC---CCHHHHHHHC-CCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHH Q ss_conf 8608998842033168899986789---9968989982-99811004379999862021123433234678585328999 Q gi|254781193|r 82 ACFNLDIPIHRWICTLVLARSNGLP---SALKAVGEAL-KLSSQKMEEGKELIARFCKGETHECLYDRNKPNHREAWRLF 157 (675) Q Consensus 82 ~~~Gi~i~~~~~~DTmi~A~~l~~~---~sL~~la~~l-~l~~~k~~~~k~l~~~~~k~~~~~~~~~~~~~~~~~~~~~~ 157 (675) ++ |++++ +.++||||+||++++. ++|+.++..+ +...... .. ..++. ..+...+.+.. T Consensus 73 ~~-Gi~l~-g~~fDtmiaayLl~p~~~~~~L~~L~~~yl~~~~~~~---~~---~~gk~----------~~~~~~~~~~~ 134 (178) T cd06140 73 RH-GIELA-GVAFDTMLAAYLLDPTRSSYDLADLAKRYLGRELPSD---EE---VYGKG----------AKFAVPDEEVL 134 (178) T ss_pred HC-CCCCC-CCHHHHHHHHHHHCCCCCCCCHHHHHHHHHCCCCCCH---HH---HHCCC----------CCCCCCCHHHH T ss_conf 88-98068-8303688898875489777999999999828666547---88---61766----------56243997999 Q ss_pred HHHHHHHHHHHHHHHHHCCCCHHCCCCHHHHHHHHHHCCCCC Q ss_conf 999899899999999630104000151336666531156213 Q gi|254781193|r 158 GEYCKGDVTVTREIFKRLEPLSSKEQELWYLDQTINDRGYCV 199 (675) Q Consensus 158 ~~Ya~~Dv~~t~~L~~~l~~~~~~E~~~~~~~~~me~~Gi~v 199 (675) ..||+.|+.++++||..|.+.+.+ ..+..+..++|++=++| T Consensus 135 ~~ya~~~a~~~~~L~~~l~~~L~e-~~l~~Ly~~iE~PL~~V 175 (178) T cd06140 135 AEHLARKAAAILRLAPKLEEELEE-NEQLELYYEVELPLAEV 175 (178) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHH T ss_conf 999999999999999999999998-24899999987129998 No 24 >cd00007 35EXOc 3'-5' exonuclease. The 35EXOc domain is responsible for the 3'-5' exonuclease proofreading activity of prokaryotic DNA polymerase I (pol I) and other enzymes, it catalyses the hydrolysis of unpaired or mismatched nucleotides. This domain consists of the amino-terminal half of the Klenow fragment in E. coli pol I. 35EXOc is also found in the Werner syndrome helicase (WRN), focus forming activity 1 protein (FFA-1) and ribonuclease D (RNase D). Probab=99.83 E-value=2.4e-19 Score=143.63 Aligned_cols=149 Identities=17% Similarity=0.170 Sum_probs=107.6 Q ss_pred CEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCEEEEEEECCCC-HHHHHHHHHHHCCCCCEEEEEEHHHHHHHH Q ss_conf 6899981417898700044001127981999999984803999978881-043799999863889819997307749999 Q gi|254781193|r 2 ATLFIDFETRSPIDLTKCGICKYAEDVCVILFAFGFDDHPVELWDCVHD-PVMPQVLKEALENTDVTIVAHNSYFERTVL 80 (675) Q Consensus 2 k~i~lD~ET~~~~dl~~~Gl~~y~~~~~i~~i~~a~~~~~v~~~d~~~~-~~~~~~l~~~L~~~~~~kV~HNa~FD~~~L 80 (675) |.|++|+||++.. . ..++|.+|++++++ +.+|++..+. ...++.++++|+++++.||+||++||+.+| T Consensus 1 ~~ia~D~Et~~l~--------~--~~~~i~~iql~~~~-~~~yi~~~~~~~~~~~~l~~~l~~~~i~Kv~hn~k~D~~~L 69 (155) T cd00007 1 KEVAFDTETTGLN--------Y--HRGKLVGIQIATAG-EAAYIPDELELEEDLEALKELLEDEDITKVGHDAKFDLVVL 69 (155) T ss_pred CEEEEECCCCCCC--------C--CCCCEEEEEEEECC-CEEEEEECCCCHHHHHHHHHHHHCCCCCEEEECCCHHHHHH T ss_conf 9899989879997--------7--88809999999899-35999814755246999999982999738987340009999 Q ss_pred HHHCCCCCCCCCEEEHHHHHHHHCC---CCCHHHHHHHC-CCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHH Q ss_conf 9860899884203316889998678---99968989982-9981100437999986202112343323467858532899 Q gi|254781193|r 81 KACFNLDIPIHRWICTLVLARSNGL---PSALKAVGEAL-KLSSQKMEEGKELIARFCKGETHECLYDRNKPNHREAWRL 156 (675) Q Consensus 81 ~~~~Gi~i~~~~~~DTmi~A~~l~~---~~sL~~la~~l-~l~~~k~~~~k~l~~~~~k~~~~~~~~~~~~~~~~~~~~~ 156 (675) .++ |+.++ +.++|||++|+++++ ++||+.+++.+ +....+... ...+ ... .......+. T Consensus 70 ~~~-~~~~~-~~~~Dt~iaa~ll~~~~~~~~L~~l~~~~l~~~~~k~~~---~~~~----------~~~--~~~~~l~~~ 132 (155) T cd00007 70 ARD-GIELP-GNIFDTMLAAYLLNPGEGSHSLDDLAKEYLGIELDKDEQ---IYGK----------GAK--TFARPLSEE 132 (155) T ss_pred HHC-CCCCC-CCEEHHHHHHHHHCCCCCCHHHHHHHHHHCCCCCCHHHH---HCCC----------CCC--CCCCCCHHH T ss_conf 862-86013-501149999998477855236999999983987674888---6177----------654--444799899 Q ss_pred HHHHHHHHHHHHHHHHHHCCCC Q ss_conf 9999899899999999630104 Q gi|254781193|r 157 FGEYCKGDVTVTREIFKRLEPL 178 (675) Q Consensus 157 ~~~Ya~~Dv~~t~~L~~~l~~~ 178 (675) +..||+.||.++++||+.|.+. T Consensus 133 ~~~YAa~D~~~~~~l~~~l~~~ 154 (155) T cd00007 133 LLEYAAEDADALLRLYEKLLEE 154 (155) T ss_pred HHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999999999999999999965 No 25 >cd06142 RNaseD_exo Ribonuclease (RNase) D is a bacterial enzyme involved in the maturation of small stable RNAs and the 3' maturation of tRNA. It contains a DEDDy-type DnaQ-like 3'-5' exonuclease domain possessing three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. In vivo, RNase D only becomes essential upon removal of other ribonucleases. Eukaryotic RNase D homologs include yeast Rrp6p, human PM/Scl-100 and the Drosophila melanogaster egalitarian protein. Probab=99.80 E-value=3.1e-18 Score=136.44 Aligned_cols=151 Identities=22% Similarity=0.153 Sum_probs=112.2 Q ss_pred CEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCEEEEEEECCCCHHHHHHHHHHHCCCCCEEEEEEHHHHHHHHH Q ss_conf 68999814178987000440011279819999999848039999788810437999998638898199973077499999 Q gi|254781193|r 2 ATLFIDFETRSPIDLTKCGICKYAEDVCVILFAFGFDDHPVELWDCVHDPVMPQVLKEALENTDVTIVAHNSYFERTVLK 81 (675) Q Consensus 2 k~i~lD~ET~~~~dl~~~Gl~~y~~~~~i~~i~~a~~~~~v~~~d~~~~~~~~~~l~~~L~~~~~~kV~HNa~FD~~~L~ 81 (675) +.|++|+||.+. +.| .+++.+++++.++ .++++|+..... ...+.++|++++++|||||++||+.+|. T Consensus 13 ~~i~~DtE~~~~--------~~~--~~~l~LiQia~~~-~~ylid~~~~~~-~~~L~~lL~d~~i~KVgh~~~~Dl~~L~ 80 (178) T cd06142 13 GVIAVDTEFMRL--------RTY--YPRLCLIQISTGG-QAYLIDPLAIGD-LSPLKELLADPNIVKVFHAAREDLELLK 80 (178) T ss_pred CEEEEECCCCCC--------CCC--CCEEEEEEEEECC-CEEEEECCCCCC-HHHHHHHHCCCCCEEEEEEHHHHHHHHH T ss_conf 948998986788--------558--8728999998689-069995312344-6999999759994399962799999999 Q ss_pred HHCCCCCCCCCEEEHHHHHHHHCC--CCCHHHHHH-HCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHH Q ss_conf 860899884203316889998678--999689899-82998110043799998620211234332346785853289999 Q gi|254781193|r 82 ACFNLDIPIHRWICTLVLARSNGL--PSALKAVGE-ALKLSSQKMEEGKELIARFCKGETHECLYDRNKPNHREAWRLFG 158 (675) Q Consensus 82 ~~~Gi~i~~~~~~DTmi~A~~l~~--~~sL~~la~-~l~l~~~k~~~~k~l~~~~~k~~~~~~~~~~~~~~~~~~~~~~~ 158 (675) +++|+. +.+++|||+++++++. ++||+.++. .+|+..+|.... .+|...|. .+.+. T Consensus 81 ~~~~~~--~~~v~Dt~ia~~~l~~~~~~~L~~L~~~~lg~~l~K~~q~------------------s~W~~rPL-s~~qi 139 (178) T cd06142 81 RDFGIL--PVNLFDTQIAARLLGLGDSVGLAALVEELLGVELDKGEQR------------------SDWLKRPL-TDEQL 139 (178) T ss_pred HHHCCC--CCCEEEHHHHHHHCCCCCCHHHHHHHHHHHCCCCCCCCCC------------------CCCCCCCC-CHHHH T ss_conf 973967--6545308999998288673789999999929877777364------------------66433678-28999 Q ss_pred HHHHHHHHHHHHHHHHCCCCHHCCCCH Q ss_conf 998998999999996301040001513 Q gi|254781193|r 159 EYCKGDVTVTREIFKRLEPLSSKEQEL 185 (675) Q Consensus 159 ~Ya~~Dv~~t~~L~~~l~~~~~~E~~~ 185 (675) .||+.||.++++||..|.+.+..+... T Consensus 140 ~YAa~Da~~l~~l~~~l~~~L~~~~~~ 166 (178) T cd06142 140 EYAANDVRYLLPLYEKLKEELEEEGRL 166 (178) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCHH T ss_conf 999997999999999999999986839 No 26 >pfam01612 3_5_exonuc 3'-5' exonuclease. This domain is responsible for the 3'-5' exonuclease proofreading activity of E. coli DNA polymerase I (polI) and other enzymes, it catalyses the hydrolysis of unpaired or mismatched nucleotides. This domain consists of the amino-terminal half of the Klenow fragment in E. coli polI it is also found in the Werner syndrome helicase (WRN), focus forming activity 1 protein (FFA-1) and ribonuclease D (RNase D). Werner syndrome is a human genetic disorder causing premature aging; the WRN protein has helicase activity in the 3'-5' direction. The FFA-1 protein is required for formation of a replication foci and also has helicase activity; it is a homologue of the WRN protein. RNase D is a 3'-5' exonuclease involved in tRNA processing. Also found in this family is the autoantigen PM/Scl thought to be involved in polymyositis-scleroderma overlap syndrome. Probab=99.80 E-value=2.4e-18 Score=137.15 Aligned_cols=148 Identities=20% Similarity=0.198 Sum_probs=105.4 Q ss_pred CEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCEEEEEEECCCCHHHHHHHHHHHCCCCCEEEEEEHHHHHHHHH Q ss_conf 68999814178987000440011279819999999848039999788810437999998638898199973077499999 Q gi|254781193|r 2 ATLFIDFETRSPIDLTKCGICKYAEDVCVILFAFGFDDHPVELWDCVHDPVMPQVLKEALENTDVTIVAHNSYFERTVLK 81 (675) Q Consensus 2 k~i~lD~ET~~~~dl~~~Gl~~y~~~~~i~~i~~a~~~~~v~~~d~~~~~~~~~~l~~~L~~~~~~kV~HNa~FD~~~L~ 81 (675) +.|++|+||.+. .+......+.+++++.++ ..++++.......++.+.++|+++.+.|||||++||+.+|. T Consensus 21 ~~ia~DtEt~~~--------~~~~~~~~~~llq~~~~~-~~yli~~~~~~~~~~~L~~ll~~~~i~Kvghn~k~D~~~L~ 91 (172) T pfam01612 21 KVVAVDTETTSL--------DTYSYTLVGCLIQISTGE-GAYIIDPLALGDDLEGLKRLLEDPNITKVGHNAKFDLEVLA 91 (172) T ss_pred CCEEEEEECCCC--------CCCCCCCCEEEEEEECCC-CCEEEECCCCHHHHHHHHHHHCCCCCCEEEEEHHHHHHHHH T ss_conf 908998062898--------867613433699996798-60776434756679999999759998799975899999998 Q ss_pred HHCCCCCCCCCEEEHHHHHHHHCC--CCCHHHHHHH-CCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHH Q ss_conf 860899884203316889998678--9996898998-2998110043799998620211234332346785853289999 Q gi|254781193|r 82 ACFNLDIPIHRWICTLVLARSNGL--PSALKAVGEA-LKLSSQKMEEGKELIARFCKGETHECLYDRNKPNHREAWRLFG 158 (675) Q Consensus 82 ~~~Gi~i~~~~~~DTmi~A~~l~~--~~sL~~la~~-l~l~~~k~~~~k~l~~~~~k~~~~~~~~~~~~~~~~~~~~~~~ 158 (675) +++|+.+ .+++|||++++++++ ++||+.+++. +++...|.... .+|...| ..+.+. T Consensus 92 ~~~~~~~--~~~~Dt~~a~~~l~~~~~~gL~~L~~~~lg~~~~K~~~~------------------s~w~~rp-L~~~qi 150 (172) T pfam01612 92 RDFGIKL--ENLFDTMLAAYLLGYPRSHSLDDLAEKYLGVELDKSEQC------------------ADWRARP-LSEEQL 150 (172) T ss_pred HHCCCCC--CCCHHHHHHHHHCCCCCCCCHHHHHHHHCCCCCCHHHEE------------------CCCCCCC-CCHHHH T ss_conf 8547666--873278999988199643569999999809878744156------------------6788789-829999 Q ss_pred HHHHHHHHHHHHHHHHCCCCH Q ss_conf 998998999999996301040 Q gi|254781193|r 159 EYCKGDVTVTREIFKRLEPLS 179 (675) Q Consensus 159 ~Ya~~Dv~~t~~L~~~l~~~~ 179 (675) .||+.||.++++||++|.+.+ T Consensus 151 ~YAa~Da~~~~~l~~~l~~~l 171 (172) T pfam01612 151 RYAAEDADYLLRLYDKLRKEL 171 (172) T ss_pred HHHHHHHHHHHHHHHHHHHHH T ss_conf 999999999999999999974 No 27 >KOG3657 consensus Probab=99.79 E-value=1.3e-16 Score=125.85 Aligned_cols=248 Identities=17% Similarity=0.128 Sum_probs=125.7 Q ss_pred CCCHHHHHEEEECCCCCEEEEEECCCCCHHHHHHHCCC-----CCHH--------HHHHHCCCCHHHHHHHHCCCCHHHH Q ss_conf 21003333012668987899983453723333231146-----2011--------3454113310101476448795670 Q gi|254781193|r 358 QIISDCVRSCVIASEGKKLVVADLAGIEARVLAWMAGE-----QWKL--------KAFETGEDLYVTTYARSFNVAPAQV 424 (675) Q Consensus 358 ~~~~~~iR~~f~a~~G~~lv~aDySqiElRvlA~ls~d-----~~l~--------~af~~g~D~h~~tA~~~~g~~~~~~ 424 (675) ..++..++++..|||||.||+||--..|+-+.|-|+.- .+.. ..-.+|-|+|+.||+.+ T Consensus 717 ~Rigselkamvqappgy~LVgaDvdsqElwiaa~lgda~~~~~hg~ta~gwM~Lag~ks~gtdlhs~ta~~l-------- 788 (1075) T KOG3657 717 DRIGSELKAMVQAPPGYRLVGADVDSQELWIAALLGDASAEGVHGKTAFGWMTLAGSKSDGTDLHSKTASQL-------- 788 (1075) T ss_pred HHHHHHHHHHHCCCCCCEEEECCCCHHHHHHHHHHHHHHHHCCCCCCHHHHHHHCCCCCCCCHHHHHHHHHC-------- T ss_conf 455078897610788616850563479999999963154412247522102111376656730767666650-------- Q ss_pred CCHHHHHCCCCEEEEECCCCHHHHHHHHHHCC--CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHH Q ss_conf 62011101652021002888888998798708--8831134556555455556788889999887643100010220788 Q gi|254781193|r 425 TKDQRAIGKVMELALGYQGGAKAFQKMASTTG--LNLNTFAHLVKTNSAPEEWEDAENFCRWMQENHPEHAIEDLFIGTA 502 (675) Q Consensus 425 ~~~~R~~aK~~~~g~~YG~G~~~l~~~~~~~G--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 502 (675) +..|..||++|+|-+||.|..=-.....+.+ +..++++..........+-..++ ....-.+.. +....-.....+ T Consensus 789 -giSr~hakv~Ny~riygag~~fa~~ll~~fnp~l~~~Ea~~~A~~l~~~tkG~~~~-rlk~e~~~~-e~~~~~~~~~~~ 865 (1075) T KOG3657 789 -GISRNHAKVFNYARIYGAGQTFAEKLLMRFNPSLTQSEAKSKASQLFKLTKGDRAK-RLKVEVRMV-ENSLVCKILTID 865 (1075) T ss_pred -CCCHHHHHHCCHHHHHCCCCCHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCEEE-EHHHHHHHH-HHHHHCHHHHHH T ss_conf -52276665404787745886167776886189986678899999998862575110-057677776-653200045430 Q ss_pred HHHHHHHHHHHCHHHHHHH---------HHHHHHHHHHHHCCEEEEECCCCCCCCCCCCCCCCCCEEECC-CCCCCCCCC Q ss_conf 8999999998788789999---------999999999975491796215421133222344564023247-663101678 Q gi|254781193|r 503 CERVKKAWRQKHSRVSELW---------NELHQAFEQTIENGKAIIARKRRDVPLVYMKKDKNNLILMLP-SGRSLVYRD 572 (675) Q Consensus 503 a~~~~~~~r~~~p~i~~~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p-~GR~~~~~~ 572 (675) .--++..|...--....+| +.+|..|.... . ++| .|-.+.+. T Consensus 866 ~~~~~~~~~~~~~~~~~~w~gg~es~~fn~lesia~~~~--p-------------------------rtpvlgc~is~s- 917 (1075) T KOG3657 866 GIYLIYSIYENEVEPRRLWVGGTESSMFNKLESIATAHD--P-------------------------RTPVLGCQISRS- 917 (1075) T ss_pred HHHHHHHHHHCCCCHHHHCCCCHHHHHHHHHHHHHHCCC--C-------------------------CCCEECCCHHHH- T ss_conf 155543323054310453057518888888887764068--8-------------------------887102413564- Q ss_pred CCCCHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------HHCCCCEEEEECCEEEEECCCCHH--HHH Q ss_conf 662102101002345677753145655453688999999999999------967981889952589997792489--999 Q gi|254781193|r 573 VGNNRSYLNTTTRQLTREYTYGGKLTENIVQAVSRDILAEGMMNA------TKAGYDIVLTVHDEIVCETPDTDE--FNA 644 (675) Q Consensus 573 ~~~~~~~~~~~~~~~~r~~~~~~k~~n~~iQgtaadi~k~ami~l------~~~g~~ivl~VHDEiv~evp~~~~--~~~ 644 (675) ..+..++...+-|.-+|-.||.+|-|-+-+-++.+ ++...+.++-||||+=+-|.|+.. .+. T Consensus 918 ----------l~~~~~~~~~f~~srinwvvqssavd~lhlllvsm~wl~~~y~i~~rfcisihdevrylv~e~d~~raal 987 (1075) T KOG3657 918 ----------LEKLPEGEPKFLPSRINWVVQSSAVDFLHLLLVSMQWLCDTYKIDARFCISIHDEVRYLVKEEDAPRAAL 987 (1075) T ss_pred ----------HCCCCCCCCCCCCCEEEEEEECCCHHHHHHHHHHHHHHHHHCCCCEEEEEEEHHHHHHHHCCCCCHHHHH T ss_conf ----------4336667754444323577633406599999999999986415442799874176788862664277899 Q ss_pred -HHHHHHHCCC Q ss_conf -9999984565 Q gi|254781193|r 645 -SMLYHLMTSN 654 (675) Q Consensus 645 -~~i~~~M~~~ 654 (675) -.|..+|+++ T Consensus 988 alqisnl~tr~ 998 (1075) T KOG3657 988 ALQISNLLTRA 998 (1075) T ss_pred HHHHHHHHHHH T ss_conf 99999999999 No 28 >smart00474 35EXOc 3'-5' exonuclease. 3\' -5' exonuclease proofreading domain present in DNA polymerase I, Werner syndrome helicase, RNase D and other enzymes Probab=99.78 E-value=1.4e-17 Score=132.19 Aligned_cols=146 Identities=18% Similarity=0.184 Sum_probs=103.6 Q ss_pred CEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCEEEEEEECCCCHHHHHHHHHHHCCCCCEEEEEEHHHHHHHHH Q ss_conf 68999814178987000440011279819999999848039999788810437999998638898199973077499999 Q gi|254781193|r 2 ATLFIDFETRSPIDLTKCGICKYAEDVCVILFAFGFDDHPVELWDCVHDPVMPQVLKEALENTDVTIVAHNSYFERTVLK 81 (675) Q Consensus 2 k~i~lD~ET~~~~dl~~~Gl~~y~~~~~i~~i~~a~~~~~v~~~d~~~~~~~~~~l~~~L~~~~~~kV~HNa~FD~~~L~ 81 (675) +.|++|+||.+ +..| ..+++++++++.++..++++.......+..++++|+++.++||+||++||+.+|. T Consensus 22 ~~i~~DtE~~~--------~~~~--~~~l~~iql~~~~~~~yi~~~~~~~~~~~~l~~ll~~~~i~kv~hn~k~D~~~L~ 91 (172) T smart00474 22 GEVALDTETTG--------LNSY--SGKLVLIQISVTGEGAFIIDPLALGDDLEILKDLLEDETITKVGHNAKFDLHVLA 91 (172) T ss_pred CEEEEECCCCC--------CCCC--CCEEEEEEEEECCCEEEEEECCCCHHHHHHHHHHHCCCCCCEEEECHHHHHHHHH T ss_conf 95999883068--------8537--8758999999689848997012242179999999809998599974599999999 Q ss_pred HHCCCCCCCCCEEEHHHHHHHHCCC---CCHHHHHHHC-CCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHH Q ss_conf 8608998842033168899986789---9968989982-99811004379999862021123433234678585328999 Q gi|254781193|r 82 ACFNLDIPIHRWICTLVLARSNGLP---SALKAVGEAL-KLSSQKMEEGKELIARFCKGETHECLYDRNKPNHREAWRLF 157 (675) Q Consensus 82 ~~~Gi~i~~~~~~DTmi~A~~l~~~---~sL~~la~~l-~l~~~k~~~~k~l~~~~~k~~~~~~~~~~~~~~~~~~~~~~ 157 (675) + +|+.++ +++|||+++++++++ +||+.++..+ +....|. . ...++...| ..+.+ T Consensus 92 ~-~gi~l~--~v~Dt~ia~~l~~~~~~~~~L~~l~~~~~~~~l~k~-------------~-----~~~~w~~rp-l~~~~ 149 (172) T smart00474 92 R-FGIELE--NIFDTMLAAYLLLGGPSKHGLATLLKEYLGVELDKE-------------E-----QKSDWGARP-LSEEQ 149 (172) T ss_pred H-CCCCCC--CCHHHHHHHHHCCCCCCHHHHHHHHHHHCCEECCCC-------------C-----CCCCCCCCC-CCHHH T ss_conf 7-798567--524679987750579864899999999749231555-------------5-----646667899-90999 Q ss_pred HHHHHHHHHHHHHHHHHCCCCH Q ss_conf 9998998999999996301040 Q gi|254781193|r 158 GEYCKGDVTVTREIFKRLEPLS 179 (675) Q Consensus 158 ~~Ya~~Dv~~t~~L~~~l~~~~ 179 (675) ..||+.||.++++||+.|.+.+ T Consensus 150 ~~YAa~Da~~~~~l~~~l~~~L 171 (172) T smart00474 150 LQYAAEDADALLRLYEKLEKEL 171 (172) T ss_pred HHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999999999999999999972 No 29 >cd06147 Rrp6p_like_exo Yeast Rrp6p and its human homolog, the Polymyositis/scleroderma autoantigen 100kDa (PM/Scl-100), are exosome-associated proteins involved in the degradation and processing of precursors to stable RNAs. Both proteins contain a DEDDy-type DnaQ-like 3'-5' exonuclease domain possessing three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. The motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. PM/Scl-100, an autoantigen present in the nucleolar compartment of the cell, reacts with autoantibodies produced by about 50% of patients with polymyositis-scleroderma overlap syndrome. Probab=99.68 E-value=2.9e-15 Score=117.21 Aligned_cols=154 Identities=16% Similarity=0.080 Sum_probs=111.1 Q ss_pred CEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCEEEEEEECCCCHHHHHHHHHHHCCCCCEEEEEEHHHHHHHHH Q ss_conf 68999814178987000440011279819999999848039999788810437999998638898199973077499999 Q gi|254781193|r 2 ATLFIDFETRSPIDLTKCGICKYAEDVCVILFAFGFDDHPVELWDCVHDPVMPQVLKEALENTDVTIVAHNSYFERTVLK 81 (675) Q Consensus 2 k~i~lD~ET~~~~dl~~~Gl~~y~~~~~i~~i~~a~~~~~v~~~d~~~~~~~~~~l~~~L~~~~~~kV~HNa~FD~~~L~ 81 (675) +.|++|+|+.+. ..| .+++.++++++.+ .++.+|+.........+.++|++++++||+||++||+.+|. T Consensus 25 ~~ialDtE~~~~--------~~~--~~~l~LiQiat~~-~~yliD~l~l~~~~~~L~~ll~~~~I~KV~H~~~~Dl~~L~ 93 (192) T cd06147 25 KEIAVDLEHHSY--------RSY--LGFTCLMQISTRE-EDYIVDTLKLRDDMHILNEVFTDPNILKVFHGADSDIIWLQ 93 (192) T ss_pred CEEEEECCCCCC--------CCC--CCEEEEEEEECCC-CCEEEEECCCCCCHHHHHHHHCCCCEEEEEECHHHHHHHHH T ss_conf 969997973788--------569--9827999995099-88798743430037999999759883799855787799999 Q ss_pred HHCCCCCCCCCEEEHHHHHHHHCC-CCCHHHHHH-HCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHH Q ss_conf 860899884203316889998678-999689899-829981100437999986202112343323467858532899999 Q gi|254781193|r 82 ACFNLDIPIHRWICTLVLARSNGL-PSALKAVGE-ALKLSSQKMEEGKELIARFCKGETHECLYDRNKPNHREAWRLFGE 159 (675) Q Consensus 82 ~~~Gi~i~~~~~~DTmi~A~~l~~-~~sL~~la~-~l~l~~~k~~~~k~l~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~ 159 (675) +.+|+.+ .+++||++++++++. +.||..+.+ .+|+..+|.... .+|...|. .+.++. T Consensus 94 ~~~gi~~--~n~fDT~iaa~~l~~~~~sL~~Lv~~~lg~~l~K~~q~------------------SdW~~RPL-s~~Ql~ 152 (192) T cd06147 94 RDFGLYV--VNLFDTGQAARVLNLPRHSLAYLLQKYCNVDADKKYQL------------------ADWRIRPL-PEEMIK 152 (192) T ss_pred HHHCCCC--CCCHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCCCCEE------------------CCCCCCCC-CHHHHH T ss_conf 9858987--66042899999848881119999999939588732311------------------43799989-999999 Q ss_pred HHHHHHHHHHHHHHHCCCCH-HCCCCHHH Q ss_conf 98998999999996301040-00151336 Q gi|254781193|r 160 YCKGDVTVTREIFKRLEPLS-SKEQELWY 187 (675) Q Consensus 160 Ya~~Dv~~t~~L~~~l~~~~-~~E~~~~~ 187 (675) ||+.||...+.||.+|...+ +.+..+.. T Consensus 153 YAA~Dv~~Ll~ly~~L~~~L~e~~~~~~~ 181 (192) T cd06147 153 YAREDTHYLLYIYDRLRNELLERANALAP 181 (192) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCCHH T ss_conf 99999999999999999999984286416 No 30 >cd06148 Egl_like_exo The Egalitarian (Egl) protein subfamily is composed of Drosophila Egl and similar proteins. Egl is a component of an mRNA-binding complex which is required for oocyte specification. Egl contains a DEDDy-type DnaQ-like 3'-5' exonuclease domain possessing three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. The motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. The conservation of this subfamily throughout eukaryotes suggests that its members may be part of ancient RNA processing complexes that are likely to participate in the regulated processing of specific mRNAs. Probab=99.67 E-value=5.4e-15 Score=115.45 Aligned_cols=157 Identities=15% Similarity=0.069 Sum_probs=111.4 Q ss_pred CEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCEEEEEEECCCCHH--HHHHHHHHHCCCCCEEEEEEHHHHHHH Q ss_conf 689998141789870004400112798199999998480399997888104--379999986388981999730774999 Q gi|254781193|r 2 ATLFIDFETRSPIDLTKCGICKYAEDVCVILFAFGFDDHPVELWDCVHDPV--MPQVLKEALENTDVTIVAHNSYFERTV 79 (675) Q Consensus 2 k~i~lD~ET~~~~dl~~~Gl~~y~~~~~i~~i~~a~~~~~v~~~d~~~~~~--~~~~l~~~L~~~~~~kV~HNa~FD~~~ 79 (675) +.|++|+|+.. +.. .+++.++++++++++++.+|+..... ....++.+|++++++||+|++.+|..+ T Consensus 11 ~vi~iD~E~~~--------~~~---~g~l~LiQiat~~~~vyl~D~~~~~~~~~~~~Lk~iLe~~~I~KV~H~~r~D~~~ 79 (197) T cd06148 11 KVIGLDCEGVN--------LGR---KGKLCLVQIATRTGQIYLFDILKLGSIVFINGLKDILESKKILKVIHDCRRDSDA 79 (197) T ss_pred CEEEEECCCCC--------CCC---CCCEEEEEEEECCCCEEEEECCCCCCHHHHHHHHHHHCCCCCEEEEECHHHHHHH T ss_conf 99999888678--------898---9988799996489988999616676344568999995699823999610999999 Q ss_pred HHHHCCCCCCCCCEEEHHHHHHHHCC----------CCCHHHHHHH-CCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCC Q ss_conf 99860899884203316889998678----------9996898998-299811004379999862021123433234678 Q gi|254781193|r 80 LKACFNLDIPIHRWICTLVLARSNGL----------PSALKAVGEA-LKLSSQKMEEGKELIARFCKGETHECLYDRNKP 148 (675) Q Consensus 80 L~~~~Gi~i~~~~~~DTmi~A~~l~~----------~~sL~~la~~-l~l~~~k~~~~k~l~~~~~k~~~~~~~~~~~~~ 148 (675) |.+++|+.+ .+++||++++.++.. ..||..+... ++.+..+....+.... ....+|. T Consensus 80 L~~~~gi~~--~nvfDTqiA~~ll~~~~~~~~~~~~~~sl~~l~~~~l~~~~~~~~~~~~~~~----------~~~s~W~ 147 (197) T cd06148 80 LYHQYGIKL--NNVFDTQVADALLQEQETGGFNPDRVISLVQLLDKYLYISISLKEDVKKLMR----------EDPKFWA 147 (197) T ss_pred HHHHHCCCC--CCCHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHCCCCHHHHHHHHHHH----------CCCCCCC T ss_conf 999709970--6703199999997531314767756415999999995764017787788875----------0655446 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHCC Q ss_conf 5853289999998998999999996301040001 Q gi|254781193|r 149 NHREAWRLFGEYCKGDVTVTREIFKRLEPLSSKE 182 (675) Q Consensus 149 ~~~~~~~~~~~Ya~~Dv~~t~~L~~~l~~~~~~E 182 (675) ..|. .+.+..||+.||.....||..|...+..+ T Consensus 148 ~RPL-t~~ql~YAa~DV~yL~~L~~~L~~~Li~~ 180 (197) T cd06148 148 LRPL-TEDMIRYAALDVLCLLPLYYAMLDALISK 180 (197) T ss_pred CCCC-CHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 7999-99999999998999999999999999999 No 31 >COG0349 Rnd Ribonuclease D [Translation, ribosomal structure and biogenesis] Probab=99.67 E-value=1.1e-15 Score=119.98 Aligned_cols=159 Identities=19% Similarity=0.178 Sum_probs=117.6 Q ss_pred CEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCEEEEEEECCCCHHHHHHHHHHHCCCCCEEEEEEHHHHHHHHH Q ss_conf 68999814178987000440011279819999999848039999788810437999998638898199973077499999 Q gi|254781193|r 2 ATLFIDFETRSPIDLTKCGICKYAEDVCVILFAFGFDDHPVELWDCVHDPVMPQVLKEALENTDVTIVAHNSYFERTVLK 81 (675) Q Consensus 2 k~i~lD~ET~~~~dl~~~Gl~~y~~~~~i~~i~~a~~~~~v~~~d~~~~~~~~~~l~~~L~~~~~~kV~HNa~FD~~~L~ 81 (675) +.|+||+||-+. ..| .++..+|+++-+++ +..+|...+..-...+..+|.|++++||+|++.||+.+|. T Consensus 18 ~~iAIDTEf~r~--------~t~--~p~LcLIQi~~~e~-~~lIdp~~~~~d~~~l~~Ll~d~~v~KIfHaa~~DL~~l~ 86 (361) T COG0349 18 KAIAIDTEFMRL--------RTY--YPRLCLIQISDGEG-ASLIDPLAGILDLPPLVALLADPNVVKIFHAARFDLEVLL 86 (361) T ss_pred CEEEEECCCCCC--------CCC--CCCEEEEEEECCCC-CEEECCCCCCCCCCHHHHHHCCCCEEEEECCCCCCHHHHH T ss_conf 807983222466--------436--88337999844888-6686226554666369998628760255025635199999 Q ss_pred HHCCCCCCCCCEEEHHHHHHHHCCC--CCHHHHHHH-CCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHH Q ss_conf 8608998842033168899986789--996898998-2998110043799998620211234332346785853289999 Q gi|254781193|r 82 ACFNLDIPIHRWICTLVLARSNGLP--SALKAVGEA-LKLSSQKMEEGKELIARFCKGETHECLYDRNKPNHREAWRLFG 158 (675) Q Consensus 82 ~~~Gi~i~~~~~~DTmi~A~~l~~~--~sL~~la~~-l~l~~~k~~~~k~l~~~~~k~~~~~~~~~~~~~~~~~~~~~~~ 158 (675) +.+|.. +..+|||.|+|++.+.+ +||+++.+. +++..+|.....++ ...|. .+.+. T Consensus 87 ~~~g~~--p~plfdTqiAa~l~g~~~~~gl~~Lv~~ll~v~ldK~~q~SDW------------------~~RPL-s~~Ql 145 (361) T COG0349 87 NLFGLL--PTPLFDTQIAAKLAGFGTSHGLADLVEELLGVELDKSEQRSDW------------------LARPL-SEAQL 145 (361) T ss_pred HHCCCC--CCCHHHHHHHHHHCCCCCCCCHHHHHHHHHCCCCCCCCCCCCC------------------CCCCC-CHHHH T ss_conf 861888--7734689999998098632329999999968853644454655------------------55799-99999 Q ss_pred HHHHHHHHHHHHHHHHCCCCHHCCCCHHHHHHHH Q ss_conf 9989989999999963010400015133666653 Q gi|254781193|r 159 EYCKGDVTVTREIFKRLEPLSSKEQELWYLDQTI 192 (675) Q Consensus 159 ~Ya~~Dv~~t~~L~~~l~~~~~~E~~~~~~~~~m 192 (675) +||+.||...+.||.+|...+..+........++ T Consensus 146 ~YAa~DV~yL~~l~~~L~~~L~~~~r~~~a~~ef 179 (361) T COG0349 146 EYAAADVEYLLPLYDKLTEELAREGRLEWAEDEF 179 (361) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHH T ss_conf 9999999999999999999998748549999999 No 32 >cd06129 RNaseD_like The RNase D-like group is composed of RNase D, WRN, and similar proteins. They contain a DEDDy-type, DnaQ-like, 3'-5' exonuclease domain that contains three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. RNase D is involved in the 3'-end processing of tRNA precursors. RNase D-like proteins in eukaryotes include yeast Rrp6p, human PM/Scl-100 and Drosophila melanogaster egalitarian (Egl) protein. WRN is a unique DNA helicase possessing exonuclease activity. Mutation in the WRN gene is implicated in Werner syndrome, a disease associated with premature aging and increased predisposition to cancer. Yeast Rrp6p and the human Polymyositis/scleroderma autoantigen 100kDa (PM/Scl-100) are exosome-associated proteins involved in the degradation and processing of prec Probab=99.55 E-value=6.3e-13 Score=102.09 Aligned_cols=138 Identities=21% Similarity=0.130 Sum_probs=102.1 Q ss_pred EEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCEEEEEEECCCCHHHHHHHHHHHCCCCCEEEEEEHHHHHHHHHH Q ss_conf 89998141789870004400112798199999998480399997888104379999986388981999730774999998 Q gi|254781193|r 3 TLFIDFETRSPIDLTKCGICKYAEDVCVILFAFGFDDHPVELWDCVHDPVMPQVLKEALENTDVTIVAHNSYFERTVLKA 82 (675) Q Consensus 3 ~i~lD~ET~~~~dl~~~Gl~~y~~~~~i~~i~~a~~~~~v~~~d~~~~~~~~~~l~~~L~~~~~~kV~HNa~FD~~~L~~ 82 (675) .|++|+||.+. .. ..+++..++++.++ .++.+|+..-+..++.++++|++++++|||||+.+|...|.+ T Consensus 17 vig~DtE~~~~--------~~--~~~~v~LiQlat~~-~~~lid~~~~~~~~~~L~~lL~~~~i~Kvgh~~~~D~~~L~~ 85 (159) T cd06129 17 VVALDCEWIPL--------RL--YYGEVTLIQIGTTE-HAFLFDVLSLSAFDGGLKTVLENPSITKVGHGCRGDLAKLLR 85 (159) T ss_pred EEEEECCCCCC--------CC--CCCCEEEEEEEECC-EEEEEECCCCCCCHHHHHHHHHCCCCEEEEECHHHHHHHHHH T ss_conf 89997976898--------79--98978999994599-689995488876779999998299924999566999999967 Q ss_pred HCCCCCCCCCEEEHHHHHHHHCCC---CCHHHHHHH-CCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHH Q ss_conf 608998842033168899986789---996898998-2998110043799998620211234332346785853289999 Q gi|254781193|r 83 CFNLDIPIHRWICTLVLARSNGLP---SALKAVGEA-LKLSSQKMEEGKELIARFCKGETHECLYDRNKPNHREAWRLFG 158 (675) Q Consensus 83 ~~Gi~i~~~~~~DTmi~A~~l~~~---~sL~~la~~-l~l~~~k~~~~k~l~~~~~k~~~~~~~~~~~~~~~~~~~~~~~ 158 (675) .+|+.+ .+++||++++..++.+ .||..+++. +|+..+|... ..+|...|. .+.++ T Consensus 86 ~~g~~~--~~~~Dt~~~~~~l~~~~~~~sL~~l~~~~lg~~ldK~~q------------------~SnW~~rpL-t~~Qi 144 (159) T cd06129 86 DFGILL--RNLFDTQAAHAILGYPEEYWSLNSLVEQYLGPCLDKSRQ------------------LSDWAKRPL-TEEQM 144 (159) T ss_pred HCCCCC--CCCHHHHHHHHHHCCCCCCHHHHHHHHHHCCCCCCCCCC------------------CCCCCCCCC-CHHHH T ss_conf 508432--430369999999588843165999999980878771024------------------588899999-99999 Q ss_pred HHHHHHHHHHHHHH Q ss_conf 99899899999999 Q gi|254781193|r 159 EYCKGDVTVTREIF 172 (675) Q Consensus 159 ~Ya~~Dv~~t~~L~ 172 (675) .||+.||.+.+.|| T Consensus 145 ~YAA~Da~~ll~ly 158 (159) T cd06129 145 LYAAADVYVLLHIY 158 (159) T ss_pred HHHHHHHHHHHHHH T ss_conf 99999999999976 No 33 >cd06141 WRN_exo WRN is a unique RecQ DNA helicase exhibiting an exonuclease activity. It contains a DEDDy-type DnaQ-like 3'-5' exonuclease domain possessing three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. Mutations in the WRN gene cause Werner syndrome, an autosomal recessive disorder associated with premature aging and increased susceptibility to cancer and type II diabetes. WRN interacts with key proteins involved in DNA replication, recombination, and repair. It is believed to maintain genomic stability and life span by participating in DNA processes. WRN is stimulated by Ku70/80, an important regulator of genomic stability. Probab=99.54 E-value=9.8e-13 Score=100.86 Aligned_cols=146 Identities=20% Similarity=0.199 Sum_probs=106.2 Q ss_pred CEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCEEEEEEECCCCHHHHHHHHHHHCCCCCEEEEEEHHHHHHHHH Q ss_conf 68999814178987000440011279819999999848039999788810437999998638898199973077499999 Q gi|254781193|r 2 ATLFIDFETRSPIDLTKCGICKYAEDVCVILFAFGFDDHPVELWDCVHDPVMPQVLKEALENTDVTIVAHNSYFERTVLK 81 (675) Q Consensus 2 k~i~lD~ET~~~~dl~~~Gl~~y~~~~~i~~i~~a~~~~~v~~~d~~~~~~~~~~l~~~L~~~~~~kV~HNa~FD~~~L~ 81 (675) +.|+||+|+..... + ...+.+.++++++++ .++.+++.+.+.+|..|.++|+++.++|||||+.+|+..|. T Consensus 19 ~vvG~D~Ew~~~~~--~------~~~~~~aliQlat~~-~~~l~d~~~~~~~~~~L~~lL~~~~i~Kvg~~~~~D~~~L~ 89 (168) T cd06141 19 KVVGFDLEWRPSFR--K------GKRNKVALLQLASES-RCLLFQLSHMDKLPPSLKQLLEDPSILKVGVGIKGDARKLR 89 (168) T ss_pred CEEEEECEECCCCC--C------CCCCCCEEEEEECCC-CEEEEEECCCCHHHHHHHHHHCCCCEEEEEEEEHHHHHHHH T ss_conf 88999664558777--7------888971599982499-65999740331204999999759980799963188999998 Q ss_pred HHCCCCCCCCCEEEHHHHHHHHCC---CCCHHHHHHH-CCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHH Q ss_conf 860899884203316889998678---9996898998-299811004379999862021123433234678585328999 Q gi|254781193|r 82 ACFNLDIPIHRWICTLVLARSNGL---PSALKAVGEA-LKLSSQKMEEGKELIARFCKGETHECLYDRNKPNHREAWRLF 157 (675) Q Consensus 82 ~~~Gi~i~~~~~~DTmi~A~~l~~---~~sL~~la~~-l~l~~~k~~~~k~l~~~~~k~~~~~~~~~~~~~~~~~~~~~~ 157 (675) +.+|+.+ .+++||..+++.++. +.||..+++. +|....|.+. ....+|...|. .+.+ T Consensus 90 ~~~g~~~--~~~~Dl~~~a~~~~~~~~~~sL~~l~~~~lg~~l~K~k~----------------~q~SdW~~rpL-s~~Q 150 (168) T cd06141 90 RDFGIEV--RGVVDLSHLAKRVGPRRKLVSLARLVEEVLGLPLSKDKK----------------VRCSNWEARPL-SKEQ 150 (168) T ss_pred HHCCCCC--CCEEEHHHHHHHHCCCCCCCCHHHHHHHHHCCCCCCCCC----------------CCCCCCCCCCC-CHHH T ss_conf 7445024--876569999998577777632999999983962154665----------------20388899999-9999 Q ss_pred HHHHHHHHHHHHHHHHHC Q ss_conf 999899899999999630 Q gi|254781193|r 158 GEYCKGDVTVTREIFKRL 175 (675) Q Consensus 158 ~~Ya~~Dv~~t~~L~~~l 175 (675) +.||+.|+.+.+.||++| T Consensus 151 i~YAA~Da~~~~~l~~~L 168 (168) T cd06141 151 ILYAATDAYASLELYRKL 168 (168) T ss_pred HHHHHHHHHHHHHHHHHC T ss_conf 999999999999999739 No 34 >TIGR01388 rnd ribonuclease D; InterPro: IPR006292 These proteins represent ribonuclease D, a 3-exonuclease shown to act on tRNA both in vitro and when overexpressed in vivo. Members of this family are restricted to the Proteobacteria; Aquifex, Mycobacteria and eukaryotes. Ribonuclease D is not essential in Escherichia coli but is deleterious when overexpressed. The precise biological role of ribonuclease D is still unknown. ; GO: 0004540 ribonuclease activity, 0008033 tRNA processing, 0005737 cytoplasm. Probab=99.49 E-value=1.9e-12 Score=99.01 Aligned_cols=158 Identities=21% Similarity=0.191 Sum_probs=109.9 Q ss_pred EEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCEEEEEE-ECCCCHHHHHHHHHHHCCCCCEEEEEEHHHHHHHHH Q ss_conf 8999814178987000440011279819999999848039999-788810437999998638898199973077499999 Q gi|254781193|r 3 TLFIDFETRSPIDLTKCGICKYAEDVCVILFAFGFDDHPVELW-DCVHDPVMPQVLKEALENTDVTIVAHNSYFERTVLK 81 (675) Q Consensus 3 ~i~lD~ET~~~~dl~~~Gl~~y~~~~~i~~i~~a~~~~~v~~~-d~~~~~~~~~~l~~~L~~~~~~kV~HNa~FD~~~L~ 81 (675) +|++|||=- .+.+- =|..-+|+++.++. ...| |+..+ ...+-|..+|.|+.|+||+|-+.=|+-++. T Consensus 20 ~valDTEF~-----R~~TF-----wP~LgLiQ~a~g~~-~~LIDPL~~~-~~~~Pl~~lL~d~~v~KvlHAaseDlEvF~ 87 (374) T TIGR01388 20 FVALDTEFV-----RERTF-----WPQLGLIQVADGEQ-LALIDPLVID-IDLSPLLELLADESVVKVLHAASEDLEVFL 87 (374) T ss_pred EEECCCCEE-----CCCCC-----CHHHCCEECCCCCE-EEEECCCCCC-CCHHHHHHHHHCCCCEEEECCCCHHHHHHH T ss_conf 374155411-----26532-----02040031137875-6876588875-234799999738981585304504799999 Q ss_pred HHCCCCCCCCCEEEHHHHHHHHCCCCCHHH---HHHHCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHH Q ss_conf 860899884203316889998678999689---89982998110043799998620211234332346785853289999 Q gi|254781193|r 82 ACFNLDIPIHRWICTLVLARSNGLPSALKA---VGEALKLSSQKMEEGKELIARFCKGETHECLYDRNKPNHREAWRLFG 158 (675) Q Consensus 82 ~~~Gi~i~~~~~~DTmi~A~~l~~~~sL~~---la~~l~l~~~k~~~~k~l~~~~~k~~~~~~~~~~~~~~~~~~~~~~~ 158 (675) +.+|. +| ..+|||+|+|.++|.+.|... +...+++..+|...-.+++. .|. .+.+. T Consensus 88 ~l~g~-lP-~PLfDTQi~Aa~~G~g~s~gY~~LV~~~~~v~LdKse~rtDW~~------------------RPL-td~Ql 146 (374) T TIGR01388 88 NLFGV-LP-KPLFDTQIAAAFLGFGESMGYDKLVQEVLGVELDKSETRTDWLA------------------RPL-TDKQL 146 (374) T ss_pred HHHCC-CC-CCCHHHHHHHHHHCCCCCHHHHHHHHHHHCEECCCCCCCCHHHC------------------CCC-CHHHH T ss_conf 87057-98-88305889999836785210899999973703178887222104------------------789-87789 Q ss_pred HHHHHHHHHHHHHHHHCCCCHHCCCCHHHHHHHHH Q ss_conf 99899899999999630104000151336666531 Q gi|254781193|r 159 EYCKGDVTVTREIFKRLEPLSSKEQELWYLDQTIN 193 (675) Q Consensus 159 ~Ya~~Dv~~t~~L~~~l~~~~~~E~~~~~~~~~me 193 (675) +||+.||...+.||.+|...+..+.....+..+|+ T Consensus 147 ~YAaaDV~yL~~~y~~L~~~l~~~~R~~~l~ee~~ 181 (374) T TIGR01388 147 EYAAADVTYLLPLYEKLTERLAEEGRLAWLEEECE 181 (374) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999999999999999988645999972059 No 35 >PRK10829 ribonuclease D; Provisional Probab=99.43 E-value=1.6e-11 Score=92.94 Aligned_cols=147 Identities=18% Similarity=0.063 Sum_probs=103.7 Q ss_pred CEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCEEEEEEECCCCHHHHHHHHHHHCCCCCEEEEEEHHHHHHHHH Q ss_conf 68999814178987000440011279819999999848039999788810437999998638898199973077499999 Q gi|254781193|r 2 ATLFIDFETRSPIDLTKCGICKYAEDVCVILFAFGFDDHPVELWDCVHDPVMPQVLKEALENTDVTIVAHNSYFERTVLK 81 (675) Q Consensus 2 k~i~lD~ET~~~~dl~~~Gl~~y~~~~~i~~i~~a~~~~~v~~~d~~~~~~~~~~l~~~L~~~~~~kV~HNa~FD~~~L~ 81 (675) +.|++|+|-.+.. -| -|++.+|+++.+ +..+.||.-.... ...|.++|.|++++||+|-+..|+.+|. T Consensus 23 ~~iavDTEf~r~~--------ty--~p~l~LiQi~~~-~~~~lIDpl~~~d-l~~l~~ll~~~~i~kv~Haa~qDle~l~ 90 (373) T PRK10829 23 PAIALDTEFVRTR--------TY--YPQLGLIQLFDG-EQLALIDPLGITD-WSPFKALLRDPSITKFLHAGSEDLEVFL 90 (373) T ss_pred CCEEEECCCCCCC--------CC--CCEEEEEEEEEC-CCEEEEECCCCCC-HHHHHHHHCCCCCEEEEECCCCHHHHHH T ss_conf 9189857745677--------78--984589998518-9679994787777-6999999709884165206421599999 Q ss_pred HHCCCCCCCCCEEEHHHHHHHHCCCC--CHHHHHH-HCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHH Q ss_conf 86089988420331688999867899--9689899-82998110043799998620211234332346785853289999 Q gi|254781193|r 82 ACFNLDIPIHRWICTLVLARSNGLPS--ALKAVGE-ALKLSSQKMEEGKELIARFCKGETHECLYDRNKPNHREAWRLFG 158 (675) Q Consensus 82 ~~~Gi~i~~~~~~DTmi~A~~l~~~~--sL~~la~-~l~l~~~k~~~~k~l~~~~~k~~~~~~~~~~~~~~~~~~~~~~~ 158 (675) +.+|+.+ .++|||+|+|.+++.+. |+..+.+ .+++..+|.... -+|...|. .+.++ T Consensus 91 ~~~g~~p--~~iFDTQiAa~~lg~~~~~gy~~Lv~~~l~v~l~K~~~~------------------sDW~~RPL-s~~ql 149 (373) T PRK10829 91 NAFGELP--QPLIDTQILAAFCGRPLSWGFASMVEEYSGVALDKSESR------------------TDWLARPL-SERQC 149 (373) T ss_pred HHCCCCC--CHHHHHHHHHHHCCCCCCHHHHHHHHHHHCCCCCCCCCC------------------CCCCCCCC-CHHHH T ss_conf 9819887--407764889986578655589999999939665666462------------------75788999-98999 Q ss_pred HHHHHHHHHHHHHHHHCCCCHHC Q ss_conf 99899899999999630104000 Q gi|254781193|r 159 EYCKGDVTVTREIFKRLEPLSSK 181 (675) Q Consensus 159 ~Ya~~Dv~~t~~L~~~l~~~~~~ 181 (675) .||+.||...+.||..|...+.. T Consensus 150 ~YA~~DV~yL~~l~~~L~~~l~~ 172 (373) T PRK10829 150 EYAAADVWYLLPIAAKLMVETEA 172 (373) T ss_pred HHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999999999999999983 No 36 >cd06146 mut-7_like_exo The mut-7 subfamily is composed of Caenorhabditis elegans mut-7 and similar proteins found in plants and metazoans. Mut-7 is implicated in posttranscriptional gene silencing. It contains a DEDDy-type DnaQ-like 3'-5' exonuclease domain possessing three conserved sequence motifs, termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. Probab=99.41 E-value=3.4e-11 Score=90.84 Aligned_cols=146 Identities=20% Similarity=0.123 Sum_probs=100.7 Q ss_pred CEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCEEEEEEECCC-----CHHHHHHHHHHHCCCCCEEEEEEHHHH Q ss_conf 689998141789870004400112798199999998480399997888-----104379999986388981999730774 Q gi|254781193|r 2 ATLFIDFETRSPIDLTKCGICKYAEDVCVILFAFGFDDHPVELWDCVH-----DPVMPQVLKEALENTDVTIVAHNSYFE 76 (675) Q Consensus 2 k~i~lD~ET~~~~dl~~~Gl~~y~~~~~i~~i~~a~~~~~v~~~d~~~-----~~~~~~~l~~~L~~~~~~kV~HNa~FD 76 (675) +.|+||+|+..... ....+++.++++|+++ .++.+|+.. .+.++..+..+|++++++||||++.+| T Consensus 23 ~vvg~D~Ew~~~~~--------~~~~~~vaLiQlat~~-~~~liD~~~~~~~~~~~~~~~L~~ll~~~~i~Kvg~~~~~D 93 (191) T cd06146 23 RVVGIDSEWKPSFL--------GDSDPRVAILQLATED-EVFLLDLLALENLESEDWDRLLKRLFEDPDVLKLGFGFKQD 93 (191) T ss_pred CEEEEECCCCCCCC--------CCCCCCEEEEEEECCC-CCEEEEEECCCCCCHHHHHHHHHHHHCCCCEEEEEECHHHH T ss_conf 88999787578777--------8888952799995299-75899970456543477899999994799968999443868 Q ss_pred HHHHHHHCCCC----CCCCCEEEHHHHHHHHC-------------CCCCHHHHHHH-CCCCCCHHHHHHHHHHHHCCCCC Q ss_conf 99999860899----88420331688999867-------------89996898998-29981100437999986202112 Q gi|254781193|r 77 RTVLKACFNLD----IPIHRWICTLVLARSNG-------------LPSALKAVGEA-LKLSSQKMEEGKELIARFCKGET 138 (675) Q Consensus 77 ~~~L~~~~Gi~----i~~~~~~DTmi~A~~l~-------------~~~sL~~la~~-l~l~~~k~~~~k~l~~~~~k~~~ 138 (675) +..|.+.+|+. ..+.+++||..+++.+. .+.||.++++. +|...+|... T Consensus 94 ~~~L~~~~~~~~~~~~~~~~~~Dl~~~a~~~~~~~~~~~~~~~~~~~~gL~~L~~~~lg~~l~K~~q------------- 160 (191) T cd06146 94 LKALSASYPALKCMFERVQNVLDLQNLAKELQKSDMGRLKGNLPSKTKGLADLVQAVLGKPLDKSEQ------------- 160 (191) T ss_pred HHHHHHHCCCCCCCCCCCCEEEEHHHHHHHHHCCCCCCHHHCCCCCCCCHHHHHHHHHCCCCCCCCC------------- T ss_conf 9999986586432333644035799999997311221011003556756999999990987885651------------- Q ss_pred CCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 3433234678585328999999899899999999630 Q gi|254781193|r 139 HECLYDRNKPNHREAWRLFGEYCKGDVTVTREIFKRL 175 (675) Q Consensus 139 ~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~t~~L~~~l 175 (675) ..+|...|. .+.++.||+.||.+.+.||++| T Consensus 161 -----~SnW~~rPL-t~~Qi~YAA~Da~~ll~iy~~L 191 (191) T cd06146 161 -----CSNWERRPL-REEQILYAALDAYCLLEVFDKL 191 (191) T ss_pred -----CCCCCCCCC-CHHHHHHHHHHHHHHHHHHHHC T ss_conf -----389999989-9999999999899999999739 No 37 >PRK06807 DNA polymerase III subunit epsilon; Validated Probab=99.30 E-value=1.2e-10 Score=87.33 Aligned_cols=87 Identities=21% Similarity=0.290 Sum_probs=63.7 Q ss_pred HHHHHHCCCCCEEEEEEHHHHHHHHHH---HCCCCCCCCCEEEHHHHHHHHCC---CCCHHHHHHHCCCCCCHHHHHHHH Q ss_conf 999986388981999730774999998---60899884203316889998678---999689899829981100437999 Q gi|254781193|r 56 VLKEALENTDVTIVAHNSYFERTVLKA---CFNLDIPIHRWICTLVLARSNGL---PSALKAVGEALKLSSQKMEEGKEL 129 (675) Q Consensus 56 ~l~~~L~~~~~~kV~HNa~FD~~~L~~---~~Gi~i~~~~~~DTmi~A~~l~~---~~sL~~la~~l~l~~~k~~~~k~l 129 (675) ++.+++. +.++||||+.||+.||.. .+|+..+...+.||+-+||.+-| .+.|+.+++.|+++.+ T Consensus 81 ~f~~Fi~--d~iLVAHNA~FD~~FL~~~L~r~gl~~~~n~~IDTL~LAR~l~P~L~NhKLnTVa~~LgI~l~-------- 150 (313) T PRK06807 81 LFLAFLH--TNVIVAHNASFDMRFLKSNVNMLGLPEPKNKVIDTVFLAKKYMKHAPNHKLETLKRMLGIRLS-------- 150 (313) T ss_pred HHHHHCC--CCEEEEECCHHHHHHHHHHHHHCCCCCCCCCEEEHHHHHHHHCCCCCCCCHHHHHHHHCCCCC-------- T ss_conf 9998738--998999553661999999999749988899877659999986478765667899998399834-------- Q ss_pred HHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCHH Q ss_conf 986202112343323467858532899999989989999999963010400 Q gi|254781193|r 130 IARFCKGETHECLYDRNKPNHREAWRLFGEYCKGDVTVTREIFKRLEPLSS 180 (675) Q Consensus 130 ~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~t~~L~~~l~~~~~ 180 (675) ..-|..|+.+|..++.++..... T Consensus 151 ----------------------------HHrAldDA~acAeI~~k~~~~~~ 173 (313) T PRK06807 151 ----------------------------SHNAFDDCITCAAVYQKCASIEE 173 (313) T ss_pred ----------------------------CCCCHHHHHHHHHHHHHHHHHHH T ss_conf ----------------------------36717789999999999999886 No 38 >PRK06631 consensus Probab=99.29 E-value=8.7e-11 Score=88.24 Aligned_cols=88 Identities=23% Similarity=0.243 Sum_probs=61.7 Q ss_pred HHHHHHHCCCCCEEEEEEHHHHHHHHHHHC---CC-CC---CCCCEEEHHHHHHHHCC--CCCHHHHHHHCCCCCCHHHH Q ss_conf 999998638898199973077499999860---89-98---84203316889998678--99968989982998110043 Q gi|254781193|r 55 QVLKEALENTDVTIVAHNSYFERTVLKACF---NL-DI---PIHRWICTLVLARSNGL--PSALKAVGEALKLSSQKMEE 125 (675) Q Consensus 55 ~~l~~~L~~~~~~kV~HNa~FD~~~L~~~~---Gi-~i---~~~~~~DTmi~A~~l~~--~~sL~~la~~l~l~~~k~~~ 125 (675) +++.+++. +..+|+||+.||+.||.+-+ |. .+ ...+++||+.+||-+-| ++||+++|++|+++...- T Consensus 78 ~~fl~Fi~--~s~LVaHNA~FD~~FLn~El~~l~~~~~~~~~~~~viDTL~lAR~~~Pg~~~sLdaLc~r~~I~~s~r-- 153 (229) T PRK06631 78 DDFLEFIS--DSKLIIHNAPFDIKFLNHELSLLKRTEIKLLELANTIDTLVMARSMFPGSKYNLDALCKRFKVDNSGR-- 153 (229) T ss_pred HHHHHHCC--CCEEEEECCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHCCCCCCCHHHHHHHCCCCCCCC-- T ss_conf 99999829--99799976387598999999982788888432011012699999868998889999999849999985-- Q ss_pred HHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 7999986202112343323467858532899999989989999999963010 Q gi|254781193|r 126 GKELIARFCKGETHECLYDRNKPNHREAWRLFGEYCKGDVTVTREIFKRLEP 177 (675) Q Consensus 126 ~k~l~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~t~~L~~~l~~ 177 (675) . ..-|..|+.++..+|-.|.. T Consensus 154 ----------------------~---------~H~AL~DA~llA~Vyl~l~g 174 (229) T PRK06631 154 ----------------------Q---------LHGALKDAALLAEVYVELTG 174 (229) T ss_pred ----------------------C---------CCCCHHHHHHHHHHHHHHHC T ss_conf ----------------------5---------56868999999999999717 No 39 >PRK08517 DNA polymerase III subunit epsilon; Provisional Probab=99.27 E-value=2.1e-10 Score=85.72 Aligned_cols=91 Identities=23% Similarity=0.264 Sum_probs=66.0 Q ss_pred HHHHHHHCCCCCEEEEEEHHHHHHHHHHH---CCCCCCCCCEEEHHHHHHHHC--CCCCHHHHHHHCCCCCCHHHHHHHH Q ss_conf 99999863889819997307749999986---089988420331688999867--8999689899829981100437999 Q gi|254781193|r 55 QVLKEALENTDVTIVAHNSYFERTVLKAC---FNLDIPIHRWICTLVLARSNG--LPSALKAVGEALKLSSQKMEEGKEL 129 (675) Q Consensus 55 ~~l~~~L~~~~~~kV~HNa~FD~~~L~~~---~Gi~i~~~~~~DTmi~A~~l~--~~~sL~~la~~l~l~~~k~~~~k~l 129 (675) .++..++. +.++||||+.||+.||... .|+......++||+-+|+.+- +.++|+.+++.++++... T Consensus 141 ~~f~~Fi~--d~ilVaHNa~FD~~FL~~~l~r~g~~~l~n~~lDTl~LAR~~~p~~~ykL~~La~~~~I~~~~------- 211 (259) T PRK08517 141 EDFRLFLG--DSVFVAHNVNFDYNFISRSLEEIGLGPLLNRKLCTIDLAKRTIESPRYGLSFLKELLGIETEV------- 211 (259) T ss_pred HHHHHHHC--CCEEEEECCHHHHHHHHHHHHHCCCCCCCCCEEEEHHHHHHHCCCCCCCHHHHHHHCCCCCCC------- T ss_conf 99999968--895999372760999999999839997689856608999987587787889999975989999------- Q ss_pred HHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHCC Q ss_conf 98620211234332346785853289999998998999999996301040001 Q gi|254781193|r 130 IARFCKGETHECLYDRNKPNHREAWRLFGEYCKGDVTVTREIFKRLEPLSSKE 182 (675) Q Consensus 130 ~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~t~~L~~~l~~~~~~E 182 (675) .| -|..|+.+|.+|+..+...+..+ T Consensus 212 -------------------~H---------RAl~DA~aTaevF~~~L~~l~~~ 236 (259) T PRK08517 212 -------------------HH---------RAYADALAAYEIFEICLLNLPSY 236 (259) T ss_pred -------------------CC---------CCHHHHHHHHHHHHHHHHHCCCC T ss_conf -------------------97---------84588999999999999856655 No 40 >PRK08074 bifunctional ATP-dependent DNA helicase/DNA polymerase III subunit epsilon; Validated Probab=99.27 E-value=2.1e-10 Score=85.79 Aligned_cols=108 Identities=21% Similarity=0.165 Sum_probs=70.9 Q ss_pred CEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEE-CCEEEE-EE--ECCCCHHH------------------------ Q ss_conf 689998141789870004400112798199999998-480399-99--78881043------------------------ Q gi|254781193|r 2 ATLFIDFETRSPIDLTKCGICKYAEDVCVILFAFGF-DDHPVE-LW--DCVHDPVM------------------------ 53 (675) Q Consensus 2 k~i~lD~ET~~~~dl~~~Gl~~y~~~~~i~~i~~a~-~~~~v~-~~--d~~~~~~~------------------------ 53 (675) |++++|+||++ +++- ...+|.-||... .++.+. .+ .+-++.++ T Consensus 4 ~~~vvDlETTG--------~~~~-~~d~IIqig~v~i~~~~i~~~f~t~vnP~~~iP~~I~~LTgI~~~~v~~AP~f~ev 74 (932) T PRK08074 4 RFVVVDLETTG--------NSPK-KGDKIIQIAAVVVEDGEILERFSTFVNPERPIPPFITELTGISEEMVKNAPLFEDV 74 (932) T ss_pred EEEEEEEECCC--------CCCC-CCCEEEEEEEEEEECCEEEEEEECCCCCCCCCCHHHHHHCCCCHHHHCCCCCHHHH T ss_conf 08999988889--------8977-79889999799999999989985325899989977886069898998059998999 Q ss_pred HHHHHHHHCCCCCEEEEEEHHHHHHHHHHHC---CCCCCCCCEEEHHHHHHHHCC---CCCHHHHHHHCCCCC Q ss_conf 7999998638898199973077499999860---899884203316889998678---999689899829981 Q gi|254781193|r 54 PQVLKEALENTDVTIVAHNSYFERTVLKACF---NLDIPIHRWICTLVLARSNGL---PSALKAVGEALKLSS 120 (675) Q Consensus 54 ~~~l~~~L~~~~~~kV~HNa~FD~~~L~~~~---Gi~i~~~~~~DTmi~A~~l~~---~~sL~~la~~l~l~~ 120 (675) ...+.++|+ +.++||||+.||+.||...+ |+......++||+=+|+.+-| +++|.+++..++++. T Consensus 75 a~~i~~~l~--~~~~VaHNv~FD~~FL~~~l~~~G~~~~~~~~~DTveLa~i~~P~~~sy~L~~L~~~l~i~~ 145 (932) T PRK08074 75 APMIVELLE--GAYFVAHNVHFDLNFLNEELERAGYTEIHCPKLDTVELARILLPTADSYKLSDLSEELELEH 145 (932) T ss_pred HHHHHHHHC--CCEEEECCHHHHHHHHHHHHHHCCCCCCCCCEECHHHHHHHHCCCCCCCCHHHHHHHCCCCC T ss_conf 999999966--99599478054099999999973998666876519999999778878888899998649999 No 41 >cd06126 DEDDy DEDDy exonucleases, part of the DnaQ-like (or DEDD) exonuclease superfamily, catalyze the excision of nucleoside monophosphates at the DNA or RNA termini in the 3'-5' direction. They contain four invariant acidic residues in three conserved sequence motifs termed ExoI, ExoII and ExoIII. DEDDy exonucleases are classified as such because of the presence of a specific YX(3)D pattern at ExoIII. The four conserved acidic residues serve as ligands for the two metal ions required for catalysis. DEDDy exonucleases include the proofreading domains of DNA polymerases (families A and B), as well as RNases such as RNase D and Rrp6p. The Egalitarian (Egl) and Bacillus-like DNA Polymerase I subfamilies do not possess a completely conserved YX(3)D pattern at the ExoIII motif. In addition, Bacillus-like DNA polymerases contain inactive 3'-5' exonuclease domains which do not possess the metal-binding residues necessary for activity. Probab=99.26 E-value=4.1e-10 Score=83.86 Aligned_cols=147 Identities=20% Similarity=0.190 Sum_probs=96.3 Q ss_pred EEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCEEEEEEECCCC----------HHHHHHHHHHHCCCCCEEEEEE Q ss_conf 899981417898700044001127981999999984803999978881----------0437999998638898199973 Q gi|254781193|r 3 TLFIDFETRSPIDLTKCGICKYAEDVCVILFAFGFDDHPVELWDCVHD----------PVMPQVLKEALENTDVTIVAHN 72 (675) Q Consensus 3 ~i~lD~ET~~~~dl~~~Gl~~y~~~~~i~~i~~a~~~~~v~~~d~~~~----------~~~~~~l~~~L~~~~~~kV~HN 72 (675) .+++|+||+...- + .....|.++|++..++...+++..+. ..++..+..++++.+.+++||| T Consensus 1 vl~fDiE~~~~~~----~----~~~~pI~~i~~~~~~g~~~~~~~~~~~~~~~~~~~E~~ll~~f~~~i~~~~dii~GyN 72 (181) T cd06126 1 VVAFDIETTPLVG----G----GGKGPIILIQLAVSAGEKFLLDVLTLTLDAVDSADEKELLGELKAILEDYDDIKVGYN 72 (181) T ss_pred CEEEEEECCCCCC----C----CCCCCEEEEEEEECCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHCCCCEEEEEC T ss_conf 9289862379899----9----8999889999996599789998624346755678999999999999830899899979 Q ss_pred H-HHHHHHHHHH---CCCCC----------------CCCCEEEHHHHHHHHC---CCCCHHHHHHHCCCCCCHHH-HHHH Q ss_conf 0-7749999986---08998----------------8420331688999867---89996898998299811004-3799 Q gi|254781193|r 73 S-YFERTVLKAC---FNLDI----------------PIHRWICTLVLARSNG---LPSALKAVGEALKLSSQKME-EGKE 128 (675) Q Consensus 73 a-~FD~~~L~~~---~Gi~i----------------~~~~~~DTmi~A~~l~---~~~sL~~la~~l~l~~~k~~-~~k~ 128 (675) . .||++.|.+. +|+.. +....+|++..++-.+ .+++|+.+|+.+. +..|.. .+.. T Consensus 73 ~~~FDlpyL~~R~~~l~~~~~~~~~~~~~~~~~~~~~gr~~~D~~~~~~~~~~~~~~~~L~~va~~~l-g~~K~~~~~~~ 151 (181) T cd06126 73 IDSFDLPYLLNRAGELGIPLRRGRDRSSGSREEINIKGRILFDLLLAAKRLGVALPSYSLDALAEQLL-GEGKEKVDQLA 151 (181) T ss_pred CCCCCHHHHHHHHHHHCCCCHHCCCCCCCCCCEEEECCCCCCCHHHHHHHHCCCCCCCCHHHHHHHHC-CCCCCCCCHHH T ss_conf 97787999999999958970102266656550486348523219999998413646535999999974-99956699999 Q ss_pred HHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 998620211234332346785853289999998998999999996 Q gi|254781193|r 129 LIARFCKGETHECLYDRNKPNHREAWRLFGEYCKGDVTVTREIFK 173 (675) Q Consensus 129 l~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~t~~L~~ 173 (675) .... +...+ ..+.+.+||..||.+|++||. T Consensus 152 i~~~--------------w~~~~-~~~~~~~Y~~~Da~~tl~l~l 181 (181) T cd06126 152 IKQA--------------WRARP-LLERLAEYAAADAYATLQIYL 181 (181) T ss_pred HHHH--------------HHCCC-CHHHHHHHHHHHHHHHHHHHC T ss_conf 9998--------------86798-669999999999999999769 No 42 >PRK07740 hypothetical protein; Provisional Probab=99.25 E-value=4.4e-10 Score=83.70 Aligned_cols=133 Identities=17% Similarity=0.182 Sum_probs=82.6 Q ss_pred CCEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEE-EEECCEEE---EEEECCCCHHHH---------------------- Q ss_conf 9689998141789870004400112798199999-99848039---999788810437---------------------- Q gi|254781193|r 1 MATLFIDFETRSPIDLTKCGICKYAEDVCVILFA-FGFDDHPV---ELWDCVHDPVMP---------------------- 54 (675) Q Consensus 1 mk~i~lD~ET~~~~dl~~~Gl~~y~~~~~i~~i~-~a~~~~~v---~~~d~~~~~~~~---------------------- 54 (675) +.++++|+||+ |+++...| +|+-|+ +.+.++.+ ++.-+-++.++| T Consensus 57 ~~fvV~D~ETT--------Gl~p~~gD-~IIeIgAVkv~~g~i~d~F~tLVnP~~~Ip~~i~~ltGIT~emV~~aP~~~e 127 (240) T PRK07740 57 LPFVVFDLETT--------GFSPDQGD-EILSIAAVKTVGGEVTDLYYSLVKPKKPIPEHILTLTGISAKDVVFAPPLAE 127 (240) T ss_pred CEEEEEEEECC--------CCCCCCCC-EEEEEEEEEEECCEEEEEEEEEECCCCCCCHHHHHHCCCCHHHHHCCCCHHH T ss_conf 83899980589--------98988898-7899989999999996887755088798998899860989999837999999 Q ss_pred --HHHHHHHCCCCCEEEEEEHHHHHHHHHH----HCCCCCCCCCEEEHHHHHHHHCC--CCCHHHHHHHCCCCCCHHHHH Q ss_conf --9999986388981999730774999998----60899884203316889998678--999689899829981100437 Q gi|254781193|r 55 --QVLKEALENTDVTIVAHNSYFERTVLKA----CFNLDIPIHRWICTLVLARSNGL--PSALKAVGEALKLSSQKMEEG 126 (675) Q Consensus 55 --~~l~~~L~~~~~~kV~HNa~FD~~~L~~----~~Gi~i~~~~~~DTmi~A~~l~~--~~sL~~la~~l~l~~~k~~~~ 126 (675) .++.+++ .+.++||||+.||+.||.+ +++..++ ..+.||+.+++.+.+ .++|++++++++++... T Consensus 128 Vl~~fl~Fi--~~~vlVaHNa~FD~~FL~~~l~r~~~~~~~-~~~iDT~~Lar~l~~~~~~sL~~l~~~fgI~~~~---- 200 (240) T PRK07740 128 VLRTFYPFI--NASTLVGYHIGHDLAFLRHALWRHYRQKFS-HRFIDTQFLTEIIAHQSFPTLDDALACYGITCPR---- 200 (240) T ss_pred HHHHHHHHH--CCCEEEEECCHHHHHHHHHHHHHHCCCCCC-CCEEEHHHHHHHHCCCCCCCHHHHHHHCCCCCCC---- T ss_conf 999999983--899899968286399999999984599878-8763699999986589987899999976999999---- Q ss_pred HHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCHH Q ss_conf 999986202112343323467858532899999989989999999963010400 Q gi|254781193|r 127 KELIARFCKGETHECLYDRNKPNHREAWRLFGEYCKGDVTVTREIFKRLEPLSS 180 (675) Q Consensus 127 k~l~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~t~~L~~~l~~~~~ 180 (675) - .-|..|+.+|..||.+|.+.+. T Consensus 201 ----------------------r---------HrAl~DA~aTA~lf~~ll~~l~ 223 (240) T PRK07740 201 ----------------------R---------HTADGDVEMTAKLWAILIIEAR 223 (240) T ss_pred ----------------------C---------CCCHHHHHHHHHHHHHHHHHHH T ss_conf ----------------------9---------8829999999999999999999 No 43 >PRK07246 bifunctional ATP-dependent DNA helicase/DNA polymerase III subunit epsilon; Validated Probab=99.24 E-value=4.8e-10 Score=83.44 Aligned_cols=106 Identities=15% Similarity=0.242 Sum_probs=69.0 Q ss_pred CEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEE-ECCEEE-EEEE--CCCCHHH------------------------ Q ss_conf 68999814178987000440011279819999999-848039-9997--8881043------------------------ Q gi|254781193|r 2 ATLFIDFETRSPIDLTKCGICKYAEDVCVILFAFG-FDDHPV-ELWD--CVHDPVM------------------------ 53 (675) Q Consensus 2 k~i~lD~ET~~~~dl~~~Gl~~y~~~~~i~~i~~a-~~~~~v-~~~d--~~~~~~~------------------------ 53 (675) +++++|+||++ +++ .+ +|.-||.. +.++.+ ..+. +-++.++ T Consensus 8 ~~~vvDlETTG--------~~~--~~-~Iiqi~~v~~~~~~i~~~f~t~vnP~~~ip~~I~~LTgI~~~~v~~ap~f~~v 76 (820) T PRK07246 8 KYAVVDLEATG--------AGP--NA-SIIQVGIVIIQGNKIIDSYETDVNPHESLDEHIVHLTGITDKQLAQAPDFSQV 76 (820) T ss_pred CEEEEEEECCC--------CCC--CC-CEEEEEEEEEECCEEEEEEEECCCCCCCCCHHHHHHCCCCHHHHHCCCCHHHH T ss_conf 18999988789--------899--98-77999999999999989987311889979978886469898998529998999 Q ss_pred HHHHHHHHCCCCCEEEEEEHHHHHHHHHHHC---CCCCCCCCEEEHHHHHHHHCC---CCCHHHHHHHCCCCCC Q ss_conf 7999998638898199973077499999860---899884203316889998678---9996898998299811 Q gi|254781193|r 54 PQVLKEALENTDVTIVAHNSYFERTVLKACF---NLDIPIHRWICTLVLARSNGL---PSALKAVGEALKLSSQ 121 (675) Q Consensus 54 ~~~l~~~L~~~~~~kV~HNa~FD~~~L~~~~---Gi~i~~~~~~DTmi~A~~l~~---~~sL~~la~~l~l~~~ 121 (675) ...+.++|+ +.++||||+.||+.||...+ |+++. ..++||+=+|+.+-| +++|.++++.++++.+ T Consensus 77 ~~~~~~~l~--~~~~VaHNv~FD~~fl~~~~~~~g~~~~-~~~~DTv~La~i~~P~~~~y~L~~L~~~l~~~~~ 147 (820) T PRK07246 77 AHHIYQLIE--DCIFVAHNVKFDANLLAEALFLEGYELR-TPRVDTVELAQVFFPRLEKYSLSHLSRQLNIDLA 147 (820) T ss_pred HHHHHHHHC--CCEEEEECCCCCHHHHHHHHHHCCCCCC-CCCCCHHHHHHHHCCCCCCCCHHHHHHHCCCCCC T ss_conf 999999967--9859983720169999999997589889-9826099999997798788888999997399999 No 44 >PRK07982 consensus Probab=99.23 E-value=5e-10 Score=83.31 Aligned_cols=133 Identities=17% Similarity=0.161 Sum_probs=82.7 Q ss_pred CCEEEEEECCCCCCCCCCCCCHH---HCCCCEEEEEE-EEECCE-EE---EEEECCCC---------------------- Q ss_conf 96899981417898700044001---12798199999-998480-39---99978881---------------------- Q gi|254781193|r 1 MATLFIDFETRSPIDLTKCGICK---YAEDVCVILFA-FGFDDH-PV---ELWDCVHD---------------------- 50 (675) Q Consensus 1 mk~i~lD~ET~~~~dl~~~Gl~~---y~~~~~i~~i~-~a~~~~-~v---~~~d~~~~---------------------- 50 (675) |+.|.||+||+ |+++ |-+.-+|+-|+ +.+.++ .+ +...+-++ T Consensus 6 ~R~ivlDtETT--------GL~p~~~~~~gdRIIEIgaVei~~~~~t~~~fh~yiNP~r~I~~~a~~VHGIT~e~L~dkP 77 (243) T PRK07982 6 TRQIVLDTETT--------GMNQIGAHYEGHKIIEIGAVEVINRRLTGNNFHVYLKPDRLVDPEAFGVHGIADEFLLDKP 77 (243) T ss_pred CEEEEEEEECC--------CCCCCCCCCCCCEEEEEEEEEEECCEECCCEEEEEECCCCCCCHHHHHHEEECHHHHHCCC T ss_conf 25899971179--------9998776778987999999999999882523437888899799889852115778871699 Q ss_pred --HHHHHHHHHHHCCCCCEEEEEEHHHHHHHHHHH---CCCCCCC----CCEEEHHHHHHHHCC--CCCHHHHHHHCCCC Q ss_conf --043799999863889819997307749999986---0899884----203316889998678--99968989982998 Q gi|254781193|r 51 --PVMPQVLKEALENTDVTIVAHNSYFERTVLKAC---FNLDIPI----HRWICTLVLARSNGL--PSALKAVGEALKLS 119 (675) Q Consensus 51 --~~~~~~l~~~L~~~~~~kV~HNa~FD~~~L~~~---~Gi~i~~----~~~~DTmi~A~~l~~--~~sL~~la~~l~l~ 119 (675) ..+.+++..++.+ ..+|+||+.||+.||..- .|.+++. .++.||+.+||-+-| +.||++|+++|+++ T Consensus 78 ~F~eI~~efl~Fi~~--a~LVaHNA~FD~~FLn~El~r~g~~~~~~~~~~~v~DTL~lAR~~~Pg~~~sLDaLc~R~~Id 155 (243) T PRK07982 78 TFADVADEFLDYIRG--AELVIHNAAFDIGFMDYEFSLLKRDIPKTNTFCKVTDSLALARKMFPGKRNSLDALCDRYEID 155 (243) T ss_pred CHHHHHHHHHHHHCC--CEEEEECCHHHHHHHHHHHHHCCCCCCCCCCCCCEEEHHHHHHHHCCCCCCCHHHHHHHCCCC T ss_conf 899999999999679--989997338889999999997088988610124313499998887889868999998754877 Q ss_pred CCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 110043799998620211234332346785853289999998998999999996301 Q gi|254781193|r 120 SQKMEEGKELIARFCKGETHECLYDRNKPNHREAWRLFGEYCKGDVTVTREIFKRLE 176 (675) Q Consensus 120 ~~k~~~~k~l~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~t~~L~~~l~ 176 (675) ...- -..-|..|+.+...+|-.|. T Consensus 156 ~s~R---------------------------------~~HgALlDa~lLAeVYl~lt 179 (243) T PRK07982 156 NSKR---------------------------------TLHGALLDAQILAEVYLAMT 179 (243) T ss_pred CCCC---------------------------------CCCCHHHHHHHHHHHHHHHC T ss_conf 6565---------------------------------52333774889999999972 No 45 >PRK07009 consensus Probab=99.23 E-value=3.8e-10 Score=84.07 Aligned_cols=132 Identities=17% Similarity=0.178 Sum_probs=80.2 Q ss_pred CCEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEE-EEE-CCEEE---E--EEEC----C------CC------------H Q ss_conf 9689998141789870004400112798199999-998-48039---9--9978----8------81------------0 Q gi|254781193|r 1 MATLFIDFETRSPIDLTKCGICKYAEDVCVILFA-FGF-DDHPV---E--LWDC----V------HD------------P 51 (675) Q Consensus 1 mk~i~lD~ET~~~~dl~~~Gl~~y~~~~~i~~i~-~a~-~~~~v---~--~~d~----~------~~------------~ 51 (675) |+.|.||+||+ |+++.. ..+|+-|+ +.. ++..+ + |+.- . || . T Consensus 1 Mr~VvlDtETT--------Gl~~~~-gdRIIEIg~v~l~~~~~t~~~~~~yiNP~r~I~~~a~~IHGIt~e~L~~kP~F~ 71 (241) T PRK07009 1 MRQIILDTETT--------GLNART-GDRIIEIGCVELLNRRLTGNNLHFYVNPERDSDPGALAVHGLTTEFLSDKPKFA 71 (241) T ss_pred CCEEEEEEECC--------CCCCCC-CCEEEEEEEEEEECCEEECEEEEEEECCCCCCCHHHHHHHCCCHHHHHCCCCHH T ss_conf 98799982579--------979999-987899999999999780406899878899799757651225799872585599 Q ss_pred HHHHHHHHHHCCCCCEEEEEEHHHHHHHHHHH---CCCCCCC---CCEEEHHHHHHHHCC--CCCHHHHHHHCCCCCCHH Q ss_conf 43799999863889819997307749999986---0899884---203316889998678--999689899829981100 Q gi|254781193|r 52 VMPQVLKEALENTDVTIVAHNSYFERTVLKAC---FNLDIPI---HRWICTLVLARSNGL--PSALKAVGEALKLSSQKM 123 (675) Q Consensus 52 ~~~~~l~~~L~~~~~~kV~HNa~FD~~~L~~~---~Gi~i~~---~~~~DTmi~A~~l~~--~~sL~~la~~l~l~~~k~ 123 (675) .+.+++.+++.+ ..+|+||+.||+.||..- .|.+... .++.||+.+||-+.| +.||++|+++|+++...- T Consensus 72 eia~efl~Fi~~--a~LVaHNA~FD~~FLn~EL~r~G~~~~~~~~~~viDTL~lAR~~~Pg~~~sLdaLc~R~~I~~~~r 149 (241) T PRK07009 72 EVVDQIRDFVQD--AELIIHNAPFDLGFLDAEFALLGLPPFTEHCAGVIDTLVQAKQMFPGKRNSLDALCDRFGISNAHR 149 (241) T ss_pred HHHHHHHHHHCC--CEEEEECHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHCCCCCCCC T ss_conf 999999999669--927886769999999999998399731024444767999999978999878899999828698873 Q ss_pred HHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 43799998620211234332346785853289999998998999999996301 Q gi|254781193|r 124 EEGKELIARFCKGETHECLYDRNKPNHREAWRLFGEYCKGDVTVTREIFKRLE 176 (675) Q Consensus 124 ~~~k~l~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~t~~L~~~l~ 176 (675) -..-|..|+.++..+|-.|. T Consensus 150 ---------------------------------~~HgAL~Da~lLa~vyl~mt 169 (241) T PRK07009 150 ---------------------------------TLHGALLDSELLAEVYLAMT 169 (241) T ss_pred ---------------------------------CCCCCHHHHHHHHHHHHHHH T ss_conf ---------------------------------30242877999999999983 No 46 >TIGR01406 dnaQ_proteo DNA polymerase III, epsilon subunit; InterPro: IPR006309 These represent DnaQ, the DNA polymerase III epsilon subunit, as found in most Proteobacteria. It consists largely of an exonuclease region as described in IPR006055 from INTERPRO. In Gram-positive bacteria, closely related regions are found both in the Gram-positive type DNA polymerase III alpha subunit and as an additional N-terminal domain of a DinG-family helicase. Both are not included in this set of sequences, as they are smaller proteins.; GO: 0003677 DNA binding, 0003887 DNA-directed DNA polymerase activity, 0006260 DNA replication. Probab=99.22 E-value=1.4e-10 Score=86.87 Aligned_cols=89 Identities=20% Similarity=0.250 Sum_probs=64.8 Q ss_pred HHHHHHHHHCCCCCEEEEEEHHHHHHHHHH---HCC---CC-C-CCCCEEEHHHHHHHHCCC--CCHHHHHHHCCCCCCH Q ss_conf 379999986388981999730774999998---608---99-8-842033168899986789--9968989982998110 Q gi|254781193|r 53 MPQVLKEALENTDVTIVAHNSYFERTVLKA---CFN---LD-I-PIHRWICTLVLARSNGLP--SALKAVGEALKLSSQK 122 (675) Q Consensus 53 ~~~~l~~~L~~~~~~kV~HNa~FD~~~L~~---~~G---i~-i-~~~~~~DTmi~A~~l~~~--~sL~~la~~l~l~~~k 122 (675) +-+++..++.+. .+|.|||+||+.||.. -+| +. + +..+|.||+.|||-.-|+ .|||+||++|+++..+ T Consensus 79 Ia~~Fl~Fi~~~--~LvIHNA~FD~gFlN~EL~~Lg~~~~~~~~~~~~~~DTL~mAR~~FPG~~~sLDALCkRf~vDns~ 156 (246) T TIGR01406 79 IADEFLDFIKGS--ELVIHNAAFDVGFLNYELERLGLKEIKKISDFCRVIDTLAMARERFPGKRNSLDALCKRFKVDNSH 156 (246) T ss_pred HHHHHHHHHCCC--EEEEECCCHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHCCCCCC T ss_conf 899999984598--899975702167899999872873210001366786568999723789866467877430425000 Q ss_pred HHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 043799998620211234332346785853289999998998999999996301 Q gi|254781193|r 123 MEEGKELIARFCKGETHECLYDRNKPNHREAWRLFGEYCKGDVTVTREIFKRLE 176 (675) Q Consensus 123 ~~~~k~l~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~t~~L~~~l~ 176 (675) - ...-|..|+.+...+|-.|. T Consensus 157 R---------------------------------~LHGAL~DA~LLAeVYl~lt 177 (246) T TIGR01406 157 R---------------------------------QLHGALLDAELLAEVYLALT 177 (246) T ss_pred C---------------------------------HHHHHHHHHHHHHHHHHHHC T ss_conf 2---------------------------------13478999999999999862 No 47 >PRK08816 consensus Probab=99.22 E-value=6.6e-10 Score=82.55 Aligned_cols=132 Identities=16% Similarity=0.176 Sum_probs=80.4 Q ss_pred CCEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEE-EEE-CCEEE---EEEECCCCH------------------------ Q ss_conf 9689998141789870004400112798199999-998-48039---999788810------------------------ Q gi|254781193|r 1 MATLFIDFETRSPIDLTKCGICKYAEDVCVILFA-FGF-DDHPV---ELWDCVHDP------------------------ 51 (675) Q Consensus 1 mk~i~lD~ET~~~~dl~~~Gl~~y~~~~~i~~i~-~a~-~~~~v---~~~d~~~~~------------------------ 51 (675) |+.|.||+||+ |+++. ..-+|+-|+ +.+ ++..+ +...+-++. T Consensus 1 mR~VVlDtETT--------Gl~~~-~gdRIIEIgaV~~~~~~~~~~~f~~~inP~r~I~~~a~~vhGIT~e~L~dkP~F~ 71 (244) T PRK08816 1 MRQIILDTETT--------GLEWR-KGNRVVEIGAVELLERRPSGNNFHRYLKPDCDFEPGAQEVTGLTLEFLADKPLFG 71 (244) T ss_pred CCEEEEEEECC--------CCCCC-CCCEEEEEEEEEEECCEECCCEEEEEECCCCCCCHHHHHHHCCCHHHHHCCCCHH T ss_conf 98799972279--------98998-9988999999999999681437899868899699889875251789984099899 Q ss_pred HHHHHHHHHHCCCCCEEEEEEHHHHHHHHHHH---CCCC----CCCCCEEEHHHHHHHHCC--CCCHHHHHHHCCCCCCH Q ss_conf 43799999863889819997307749999986---0899----884203316889998678--99968989982998110 Q gi|254781193|r 52 VMPQVLKEALENTDVTIVAHNSYFERTVLKAC---FNLD----IPIHRWICTLVLARSNGL--PSALKAVGEALKLSSQK 122 (675) Q Consensus 52 ~~~~~l~~~L~~~~~~kV~HNa~FD~~~L~~~---~Gi~----i~~~~~~DTmi~A~~l~~--~~sL~~la~~l~l~~~k 122 (675) .+.+++..++. +..+|+||+.||+.||..- .|.+ .....+.||+.+||-..| +.||++|+++|+++... T Consensus 72 eV~~efl~Fi~--ds~LVaHNa~FDi~FLn~El~r~g~~~~~~~~~~~v~DTl~lAr~~~Pg~~nsLDaLc~r~~id~~~ 149 (244) T PRK08816 72 EVVDEFLAYID--GAELIIHNAAFDLGFLDNELSLLGDQYGRIVDRATVVDTLMMARERYPGQRNSLDALCKRLGVDNSH 149 (244) T ss_pred HHHHHHHHHHC--CCEEEEECCHHHHHHHHHHHHHHCCCCCCCCCCCCEEEHHHHHHHHCCCCCCCHHHHHHHCCCCCCC T ss_conf 99999999828--9989996636529899999998488756764310104389999987879977899999974978877 Q ss_pred HHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 043799998620211234332346785853289999998998999999996301 Q gi|254781193|r 123 MEEGKELIARFCKGETHECLYDRNKPNHREAWRLFGEYCKGDVTVTREIFKRLE 176 (675) Q Consensus 123 ~~~~k~l~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~t~~L~~~l~ 176 (675) -. ..-|..|+.+...+|-.|. T Consensus 150 R~---------------------------------~HgALlDa~lLa~VYl~mt 170 (244) T PRK08816 150 RQ---------------------------------LHGALLDAQILADVYIALT 170 (244) T ss_pred CC---------------------------------CCCHHHHHHHHHHHHHHHC T ss_conf 35---------------------------------0148887999999999974 No 48 >PRK05711 DNA polymerase III subunit epsilon; Provisional Probab=99.21 E-value=5.4e-10 Score=83.08 Aligned_cols=88 Identities=16% Similarity=0.186 Sum_probs=61.7 Q ss_pred HHHHHHHCCCCCEEEEEEHHHHHHHHHHH---CCCCCCC----CCEEEHHHHHHHHCC--CCCHHHHHHHCCCCCCHHHH Q ss_conf 99999863889819997307749999986---0899884----203316889998678--99968989982998110043 Q gi|254781193|r 55 QVLKEALENTDVTIVAHNSYFERTVLKAC---FNLDIPI----HRWICTLVLARSNGL--PSALKAVGEALKLSSQKMEE 125 (675) Q Consensus 55 ~~l~~~L~~~~~~kV~HNa~FD~~~L~~~---~Gi~i~~----~~~~DTmi~A~~l~~--~~sL~~la~~l~l~~~k~~~ 125 (675) +++..++. +..+|+||+.||+.||.+- .|.+.+. ..++||+.+||-+-| +.||++++++|+++...- T Consensus 78 ~efl~Fi~--~s~lVaHNa~FD~~FLn~El~r~g~~~~~~~~~~~viDTL~lAR~~~Pg~~~sLdaLc~r~~I~~s~R-- 153 (240) T PRK05711 78 DEFLDFIR--GAELIIHNAPFDIGFMDYEFALLGRDIPKTTTFCKVTDTLAMARRMFPGKRNSLDALCKRYGIDNSHR-- 153 (240) T ss_pred HHHHHHHC--CCEEEEECCHHHHHHHHHHHHHHCCCCCCHHHHCCHHHHHHHHHHHCCCCCCCHHHHHHHCCCCCCCC-- T ss_conf 99999818--99799966199899999999986899876445212101999999878999889999999809898875-- Q ss_pred HHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 7999986202112343323467858532899999989989999999963010 Q gi|254781193|r 126 GKELIARFCKGETHECLYDRNKPNHREAWRLFGEYCKGDVTVTREIFKRLEP 177 (675) Q Consensus 126 ~k~l~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~t~~L~~~l~~ 177 (675) . ..-|..|+.++..+|-.|.. T Consensus 154 ----------------------~---------~HgAL~Da~llA~Vyl~l~g 174 (240) T PRK05711 154 ----------------------T---------LHGALLDAEILAEVYLAMTG 174 (240) T ss_pred ----------------------C---------CCCHHHHHHHHHHHHHHHCC T ss_conf ----------------------6---------57648889999999999717 No 49 >PRK06310 DNA polymerase III subunit epsilon; Validated Probab=99.20 E-value=6.3e-10 Score=82.68 Aligned_cols=87 Identities=24% Similarity=0.242 Sum_probs=61.5 Q ss_pred HHHHHHCCCCCEEEEEEHHHHHHHHHHH---CCCCCC--CCCEEEHHHHHHHH--CCCCCHHHHHHHCCCCCCHHHHHHH Q ss_conf 9999863889819997307749999986---089988--42033168899986--7899968989982998110043799 Q gi|254781193|r 56 VLKEALENTDVTIVAHNSYFERTVLKAC---FNLDIP--IHRWICTLVLARSN--GLPSALKAVGEALKLSSQKMEEGKE 128 (675) Q Consensus 56 ~l~~~L~~~~~~kV~HNa~FD~~~L~~~---~Gi~i~--~~~~~DTmi~A~~l--~~~~sL~~la~~l~l~~~k~~~~k~ 128 (675) ++.+++.+.. ++||||+.||+.+|... .|++.. ...+.||+-+|+.+ .+.++|++++.+++++... T Consensus 80 ~~~~fi~~~~-ilVaHNa~FD~~fL~~e~~r~g~~~~~~~~~~IDTl~lAr~~~~~~~~~L~~L~~~~~i~~~~------ 152 (250) T PRK06310 80 QIKGFFKEGD-YIVGHSVGFDLQVLAQEAERIGETFLSKHYYIIDTLRLAKEYGDSPNNSLEALAVHFNVPYDG------ 152 (250) T ss_pred HHHHHHCCCC-EEEEECHHHHHHHHHHHHHHCCCCCCCCCCCEECHHHHHHHCCCCCCCHHHHHHHHCCCCCCC------ T ss_conf 9999967999-999938455799999999976998665679578499999756765456099999970999986------ Q ss_pred HHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 99862021123433234678585328999999899899999999630104 Q gi|254781193|r 129 LIARFCKGETHECLYDRNKPNHREAWRLFGEYCKGDVTVTREIFKRLEPL 178 (675) Q Consensus 129 l~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~t~~L~~~l~~~ 178 (675) . .-|..|+.+|..|+..|... T Consensus 153 --------------------a---------HrAl~Da~aTa~vf~~l~~r 173 (250) T PRK06310 153 --------------------N---------HRAMKDVEININVFKHLCKR 173 (250) T ss_pred --------------------C---------CCHHHHHHHHHHHHHHHHHH T ss_conf --------------------6---------68199999999999999985 No 50 >TIGR01405 polC_Gram_pos DNA polymerase III, alpha subunit, Gram-positive type; InterPro: IPR006308 These are the polypeptide chains of DNA polymerase III. Full-length homologs of this protein are restricted to the Gram-positive lineages, including the Mycoplasmas. This protein is designated alpha chain and given the gene symbol polC, but is not a full-length homolog of other polC genes. The N-terminal region of about 200 amino acids is rich in low-complexity sequence and poorly alignable. ; GO: 0003677 DNA binding, 0003887 DNA-directed DNA polymerase activity, 0006260 DNA replication, 0005737 cytoplasm. Probab=99.20 E-value=2.1e-10 Score=85.72 Aligned_cols=69 Identities=28% Similarity=0.297 Sum_probs=51.5 Q ss_pred HHHHHHHHHHCCCCCEEEEEE-HHHHHHHHHHH---CCCCCCCCCEEEHHHHHHHHCC---CCCHHHHHHHCCCCCCH Q ss_conf 437999998638898199973-07749999986---0899884203316889998678---99968989982998110 Q gi|254781193|r 52 VMPQVLKEALENTDVTIVAHN-SYFERTVLKAC---FNLDIPIHRWICTLVLARSNGL---PSALKAVGEALKLSSQK 122 (675) Q Consensus 52 ~~~~~l~~~L~~~~~~kV~HN-a~FD~~~L~~~---~Gi~i~~~~~~DTmi~A~~l~~---~~sL~~la~~l~l~~~k 122 (675) .++.++.+++. +.+.|||| |+||+.||... +++..-...++||+-+||.|+| +|+|+.+|+.|++.... T Consensus 274 ~vL~k~~~f~~--d~ilVAHNGasFD~~Fl~~~~~k~~~~~~~~p~IDTL~Lar~lnP~y~sh~Lg~~~~Kl~v~~~~ 349 (1264) T TIGR01405 274 EVLEKFKEFLK--DSILVAHNGASFDIGFLNTNFEKVGLEPLENPVIDTLELARALNPEYKSHRLGEIAKKLGVDLDD 349 (1264) T ss_pred HHHHHHHHHHC--CEEEEECCCCCCCHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHCCCCCCCCCCHHHHHCCCCCCC T ss_conf 99999999769--75888627610366889889876277423365113889999837311344542233320552153 No 51 >PRK06309 DNA polymerase III subunit epsilon; Validated Probab=99.18 E-value=1e-09 Score=81.35 Aligned_cols=86 Identities=23% Similarity=0.244 Sum_probs=62.2 Q ss_pred HHHHHHCCCCCEEEEEEH-HHHHHHHHHH---CCCCCCCCCEEEHHHHHHHHCC---CCCHHHHHHHCCCCCCHHHHHHH Q ss_conf 999986388981999730-7749999986---0899884203316889998678---99968989982998110043799 Q gi|254781193|r 56 VLKEALENTDVTIVAHNS-YFERTVLKAC---FNLDIPIHRWICTLVLARSNGL---PSALKAVGEALKLSSQKMEEGKE 128 (675) Q Consensus 56 ~l~~~L~~~~~~kV~HNa-~FD~~~L~~~---~Gi~i~~~~~~DTmi~A~~l~~---~~sL~~la~~l~l~~~k~~~~k~ 128 (675) ++.+++. ++.+.||||+ .||+.+|... .|+..+..+++||+.+|+.+-+ .++|+.++..+++..+. T Consensus 72 ~~~~f~~-~~~ilVaHN~~~FD~~fL~~e~~r~g~~~~~~~~iDTl~lar~l~P~l~~h~L~~L~~~~gi~~~~------ 144 (232) T PRK06309 72 KFREFCG-TDNILVAHNNDGFDFPLLVKECRRHSLEPLTLRTIDSLKWAQKYRPDLPKHNLQYLRQVYGFAENQ------ 144 (232) T ss_pred HHHHHHC-CCCEEEEECCHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHCCCCCCCCHHHHHHHHCCCCCC------ T ss_conf 9999857-998899848965679999999998599867740537999999975897768899999980999988------ Q ss_pred HHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 9986202112343323467858532899999989989999999963010 Q gi|254781193|r 129 LIARFCKGETHECLYDRNKPNHREAWRLFGEYCKGDVTVTREIFKRLEP 177 (675) Q Consensus 129 l~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~t~~L~~~l~~ 177 (675) .| =|..|+.+|..|+..|.. T Consensus 145 --------------------aH---------rAl~Da~at~~vf~~ll~ 164 (232) T PRK06309 145 --------------------AH---------RALDDVITLHRVFSALVG 164 (232) T ss_pred --------------------CC---------CHHHHHHHHHHHHHHHHH T ss_conf --------------------88---------859999999999999872 No 52 >PRK06022 consensus Probab=99.18 E-value=7.5e-10 Score=82.18 Aligned_cols=87 Identities=22% Similarity=0.261 Sum_probs=61.7 Q ss_pred HHHHHHHCCCCCEEEEEEHHHHHHHHHHH---CCCCC-CCCCEEEHHHHHHHHCC--CCCHHHHHHHCCCCCCHHHHHHH Q ss_conf 99999863889819997307749999986---08998-84203316889998678--99968989982998110043799 Q gi|254781193|r 55 QVLKEALENTDVTIVAHNSYFERTVLKAC---FNLDI-PIHRWICTLVLARSNGL--PSALKAVGEALKLSSQKMEEGKE 128 (675) Q Consensus 55 ~~l~~~L~~~~~~kV~HNa~FD~~~L~~~---~Gi~i-~~~~~~DTmi~A~~l~~--~~sL~~la~~l~l~~~k~~~~k~ 128 (675) +++.+++.+ ..+|+||+.||+.||..- .|... ...++.||+.+||-+-| +.||++|+++++++...- T Consensus 75 ~~fl~Fi~~--a~LVaHNA~FD~gFLn~El~r~G~~~~~~~~viDTL~LAR~~~Pg~~~sLdaLc~r~~Id~~~R----- 147 (234) T PRK06022 75 DEFLEFFDG--ARLVAHNATFDLGFINAEFARLGQPEITSERVVDTLALARRKHPMGPNSLDALCRRYGIDNSHR----- 147 (234) T ss_pred HHHHHHHCC--CEEEEECHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHCCCCCCCC----- T ss_conf 999998189--8599955787299999999985998777888746999999868899889899999839997773----- Q ss_pred HHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 998620211234332346785853289999998998999999996301 Q gi|254781193|r 129 LIARFCKGETHECLYDRNKPNHREAWRLFGEYCKGDVTVTREIFKRLE 176 (675) Q Consensus 129 l~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~t~~L~~~l~ 176 (675) . ..-|..|+.++..+|-.|. T Consensus 148 -------------------~---------~HgAL~Da~lLa~VyleL~ 167 (234) T PRK06022 148 -------------------T---------KHGALLDSELLAEVYIELI 167 (234) T ss_pred -------------------C---------CCCCHHHHHHHHHHHHHHC T ss_conf -------------------5---------3762888999999999972 No 53 >PRK06195 DNA polymerase III subunit epsilon; Validated Probab=99.16 E-value=8e-10 Score=81.98 Aligned_cols=64 Identities=28% Similarity=0.482 Sum_probs=50.6 Q ss_pred HHHHHHHHCCCCCEEEEEEHHHHHHHHHH---HCCCCCCCCCEEEHHHHHHHHCC---CCCHHHHHHHCCCC Q ss_conf 79999986388981999730774999998---60899884203316889998678---99968989982998 Q gi|254781193|r 54 PQVLKEALENTDVTIVAHNSYFERTVLKA---CFNLDIPIHRWICTLVLARSNGL---PSALKAVGEALKLS 119 (675) Q Consensus 54 ~~~l~~~L~~~~~~kV~HNa~FD~~~L~~---~~Gi~i~~~~~~DTmi~A~~l~~---~~sL~~la~~l~l~ 119 (675) ..++.+++. +.++||||+.||+.||.+ .+|+.++...|.||+-+||.+.+ +++|..|++.+++. T Consensus 71 ~~~~~~fi~--~~vlVaHNa~FD~~fL~~~~~r~gl~~~~~~~~cTl~LAR~~~p~l~~~kL~~La~~~gi~ 140 (309) T PRK06195 71 WEKIKDYFN--DNLVIAHNASFDISVLRKTLELYNIPMPDFEYICTMKLAKNFYSNIPNARLNTVNNFLGYE 140 (309) T ss_pred HHHHHHHHC--CCEEEEECCHHHHHHHHHHHHHCCCCCCCCCEEEHHHHHHHHCCCCCCCCHHHHHHHHCCC T ss_conf 999999857--9979995628889999999998499999997774799999865577657888999985988 No 54 >PRK07883 hypothetical protein; Validated Probab=99.15 E-value=2.7e-09 Score=78.54 Aligned_cols=87 Identities=23% Similarity=0.266 Sum_probs=62.4 Q ss_pred HHHHHHHCCCCCEEEEEEHHHHHHHHHH---HCCCCCCCCCEEEHHHHHHHHCC-----CCCHHHHHHHCCCCCCHHHHH Q ss_conf 9999986388981999730774999998---60899884203316889998678-----999689899829981100437 Q gi|254781193|r 55 QVLKEALENTDVTIVAHNSYFERTVLKA---CFNLDIPIHRWICTLVLARSNGL-----PSALKAVGEALKLSSQKMEEG 126 (675) Q Consensus 55 ~~l~~~L~~~~~~kV~HNa~FD~~~L~~---~~Gi~i~~~~~~DTmi~A~~l~~-----~~sL~~la~~l~l~~~k~~~~ 126 (675) .++.+++. +.++|+||+.||+.||.. ..|+..+...+.||+-+|+.+-+ +++|+.+++++++.... T Consensus 96 ~~f~~F~~--~~vlVAHNa~FD~~FL~~~~~r~g~~~~~~~~ldTl~LARrl~p~~e~~~~kL~tLa~~~g~~~~~---- 169 (575) T PRK07883 96 PAFLEFAR--GSVLVAHNAPFDIGFLRAAAERCGYPWPQPEVLCTVRLARRVLSRDEAPNVRLSALARLFGAATTP---- 169 (575) T ss_pred HHHHHHHC--CCEEEEECCHHHHHHHHHHHHHCCCCCCCCCEEEHHHHHHHHCCCCCCCCCCHHHHHHHCCCCCCC---- T ss_conf 99999968--998999083452999999999759998789786489999986692114689989999975999999---- Q ss_pred HHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 9999862021123433234678585328999999899899999999630104 Q gi|254781193|r 127 KELIARFCKGETHECLYDRNKPNHREAWRLFGEYCKGDVTVTREIFKRLEPL 178 (675) Q Consensus 127 k~l~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~t~~L~~~l~~~ 178 (675) .| -|..|+.+|..++..|... T Consensus 170 ----------------------~H---------RAl~DA~AT~~vl~~l~~~ 190 (575) T PRK07883 170 ----------------------TH---------RALDDARATVDVLHGLIER 190 (575) T ss_pred ----------------------CC---------CHHHHHHHHHHHHHHHHHH T ss_conf ----------------------86---------6045299999999999998 No 55 >PRK06859 consensus Probab=99.14 E-value=2.5e-09 Score=78.74 Aligned_cols=88 Identities=17% Similarity=0.189 Sum_probs=61.0 Q ss_pred HHHHHHHHCCCCCEEEEEEHHHHHHHHHHH---CCCCCCC---CCEEEHHHHHHHHCC--CCCHHHHHHHCCCCCCHHHH Q ss_conf 799999863889819997307749999986---0899884---203316889998678--99968989982998110043 Q gi|254781193|r 54 PQVLKEALENTDVTIVAHNSYFERTVLKAC---FNLDIPI---HRWICTLVLARSNGL--PSALKAVGEALKLSSQKMEE 125 (675) Q Consensus 54 ~~~l~~~L~~~~~~kV~HNa~FD~~~L~~~---~Gi~i~~---~~~~DTmi~A~~l~~--~~sL~~la~~l~l~~~k~~~ 125 (675) .+++..++. +..+|+||+.||+.||..- .|.+.+. .++.||+.+||-+.| +.||++|+++|+++...- T Consensus 82 a~efl~Fi~--~a~LVaHNa~FD~~FLn~El~r~g~~~~~~~~~~viDTL~lAR~~~Pg~~~sLdaLc~r~~id~~~R-- 157 (253) T PRK06859 82 AQEFIDYIK--GAELLIHNAPFDVGFMDYEFRKLNLNVKTDDICLVTDTLQMARQMYPGKRNNLDALCDRLGIDNSKR-- 157 (253) T ss_pred HHHHHHHHC--CCEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCEEEHHHHHHHHCCCCCCCHHHHHHHCCCCCCCC-- T ss_conf 999999967--9979995718889999999997499965322001133999999878799779999999739987663-- Q ss_pred HHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 799998620211234332346785853289999998998999999996301 Q gi|254781193|r 126 GKELIARFCKGETHECLYDRNKPNHREAWRLFGEYCKGDVTVTREIFKRLE 176 (675) Q Consensus 126 ~k~l~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~t~~L~~~l~ 176 (675) -..-|..|+.+...+|-.|. T Consensus 158 -------------------------------~~HgAllDa~lLa~VYl~mt 177 (253) T PRK06859 158 -------------------------------TLHGALLDAEILADVYLMMT 177 (253) T ss_pred -------------------------------CCCCHHHHHHHHHHHHHHHC T ss_conf -------------------------------30132888999999999872 No 56 >PRK08858 consensus Probab=99.12 E-value=3.5e-09 Score=77.83 Aligned_cols=88 Identities=18% Similarity=0.169 Sum_probs=61.7 Q ss_pred HHHHHHHHCCCCCEEEEEEHHHHHHHHHHHC---CCCCC----CCCEEEHHHHHHHHCC--CCCHHHHHHHCCCCCCHHH Q ss_conf 7999998638898199973077499999860---89988----4203316889998678--9996898998299811004 Q gi|254781193|r 54 PQVLKEALENTDVTIVAHNSYFERTVLKACF---NLDIP----IHRWICTLVLARSNGL--PSALKAVGEALKLSSQKME 124 (675) Q Consensus 54 ~~~l~~~L~~~~~~kV~HNa~FD~~~L~~~~---Gi~i~----~~~~~DTmi~A~~l~~--~~sL~~la~~l~l~~~k~~ 124 (675) .+++.+++.+ ..+|+||+.||+.||..-+ |..+. ..++.||+.+||.+-| +.||++|+++|+++...- T Consensus 88 ~~efl~FI~d--~~LVaHNa~FD~~FLn~El~r~~~~~~~~~~~~~viDTL~lAR~~~Pg~~~sLdaLc~r~~Id~s~R- 164 (246) T PRK08858 88 HQEFLEFIKG--AELVAHNAPFDVGFMDYEFEKLNPAIGKTDDYCKVTDTLAMAKKIFPGKRNNLDVLCERYGIDNSHR- 164 (246) T ss_pred HHHHHHHHCC--CEEEEECCHHHHHHHHHHHHHHCCCCCCHHHHCCCEEHHHHHHHHCCCCCCCHHHHHHHCCCCCCCC- T ss_conf 9999999589--9899972288899999999982877675134201002999999868999888899999819898883- Q ss_pred HHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 3799998620211234332346785853289999998998999999996301 Q gi|254781193|r 125 EGKELIARFCKGETHECLYDRNKPNHREAWRLFGEYCKGDVTVTREIFKRLE 176 (675) Q Consensus 125 ~~k~l~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~t~~L~~~l~ 176 (675) -..-|..|+.+...+|-.|. T Consensus 165 --------------------------------~~HgALlDa~lLa~Vyl~mt 184 (246) T PRK08858 165 --------------------------------TLHGALLDAEILADVYLLMT 184 (246) T ss_pred --------------------------------CCCCHHHHHHHHHHHHHHHC T ss_conf --------------------------------40230888999999999972 No 57 >cd06131 DNA_pol_III_epsilon_Ecoli_like This subfamily is composed of the epsilon subunit of Escherichia coli DNA polymerase III (polIII) and similar proteins. PolIII is the main DNA replicating enzyme in bacteria, functioning as the chromosomal replicase. It is a holoenzyme complex of ten different subunits, three of which (alpha, epsilon, and theta) compose the catalytic core. The polIII epsilon subunit, encoded by the dnaQ gene, is a DEDDh-type 3'-5' exonuclease which is responsible for the proofreading activity of the polymerase, increasing the fidelity of DNA synthesis. It contains three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific Hx(4)D conserved pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. The epsilon subunit of polIII also functions as a stabilizer of the holoenzyme complex. Probab=99.10 E-value=4.1e-09 Score=77.43 Aligned_cols=85 Identities=25% Similarity=0.305 Sum_probs=58.7 Q ss_pred HHHHHHCCCCCEEEEEEHHHHHHHHHHH---CCCCCC---CCCEEEHHHHHHHHCC--CCCHHHHHHHCCCCCCHHHHHH Q ss_conf 9999863889819997307749999986---089988---4203316889998678--9996898998299811004379 Q gi|254781193|r 56 VLKEALENTDVTIVAHNSYFERTVLKAC---FNLDIP---IHRWICTLVLARSNGL--PSALKAVGEALKLSSQKMEEGK 127 (675) Q Consensus 56 ~l~~~L~~~~~~kV~HNa~FD~~~L~~~---~Gi~i~---~~~~~DTmi~A~~l~~--~~sL~~la~~l~l~~~k~~~~k 127 (675) ++.+++. +..+||||+.||+.||.+. +|...+ ...|.|||.+++.+.+ +++|+.++.+|+++.... T Consensus 74 ~f~~~l~--~~~lVaHN~~FD~~fL~~~~~r~~~~~~~~~~~~~~Dtl~l~r~~~p~~~~~L~~l~~~~gi~~~~~---- 147 (167) T cd06131 74 EFLDFIR--GAELVIHNASFDVGFLNAELSLLGLGKKIIDFCRVIDTLALARKKFPGKPNSLDALCKRFGIDNSHR---- 147 (167) T ss_pred HHHHHCC--CCEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHCCCCCCCHHHHHHHCCCCCCCC---- T ss_conf 9986356--9999983718789999999998189977776554113999999984999888999998769799999---- Q ss_pred HHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 999862021123433234678585328999999899899999999630 Q gi|254781193|r 128 ELIARFCKGETHECLYDRNKPNHREAWRLFGEYCKGDVTVTREIFKRL 175 (675) Q Consensus 128 ~l~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~t~~L~~~l 175 (675) ..| =|..|+.+|.+||.+| T Consensus 148 --------------------~~H---------~Al~Da~~ta~vf~~l 166 (167) T cd06131 148 --------------------TLH---------GALLDAELLAEVYLEL 166 (167) T ss_pred --------------------CCC---------CHHHHHHHHHHHHHHH T ss_conf --------------------875---------8899999999999976 No 58 >cd05781 DNA_polB_B3_exo The 3'-5' exonuclease domain of archaeal proteins with similarity to Sulfurisphaera ohwakuensis DNA polymerase B3. B3 is a family-B DNA polymerase. Family-B DNA polymerases contain an N-terminal DEDDy DnaQ-like exonuclease domain in the same polypeptide chain as the polymerase domain, similar to family-A DNA polymerases. B3 exhibits both polymerase and 3'-5' exonuclease activities. This exonuclease domain contains three sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are clustered around the active site and are involved in metal binding and catalysis. The exonuclease domain of family B polymerases also contains a beta hairpin structure that plays an important role in active site switching in the event of nucleotide misincorporation. Archaeal proteins that are involved in DNA replication are similar to those from eukaryotes. Some archaea possess multiple family-B DNA polymerases. B3 is mainly found in crenarch Probab=99.09 E-value=1.2e-08 Score=74.43 Aligned_cols=153 Identities=17% Similarity=0.190 Sum_probs=92.5 Q ss_pred CCEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCEEEEEEECCC--CHHHHHHHHHHHC--CCCCEEEEEEH-HH Q ss_conf 9689998141789870004400112798199999998480399997888--1043799999863--88981999730-77 Q gi|254781193|r 1 MATLFIDFETRSPIDLTKCGICKYAEDVCVILFAFGFDDHPVELWDCVH--DPVMPQVLKEALE--NTDVTIVAHNS-YF 75 (675) Q Consensus 1 mk~i~lD~ET~~~~dl~~~Gl~~y~~~~~i~~i~~a~~~~~v~~~d~~~--~~~~~~~l~~~L~--~~~~~kV~HNa-~F 75 (675) .|.+++|+||++..... -.+...|+.||++..+++..++.... ...++..+..++. +++++ +|||. .| T Consensus 3 lk~lsfDIE~~~~~gfp------~~~~d~Ii~Is~~~~~g~~~~~~~~~~~E~~ll~~f~~~i~~~~Pdii-~gyN~~~F 75 (188) T cd05781 3 LKTLAFDIEVYSKYGTP------NPRRDPIIVISLATSNGDVEFILAEGLDDRKIIREFVKYVKEYDPDII-VGYNSNAF 75 (188) T ss_pred CEEEEEEEEECCCCCCC------CCCCCCEEEEEEEECCCCEEEEEECCCCHHHHHHHHHHHHHHCCCCEE-EECCCCCC T ss_conf 41999998876899989------988897899999968997699984799989999999999987299999-96698675 Q ss_pred HHHHHHH---HCCCCCCC-------------------CC-EEEHHHHHHH-HC-CCCCHHHHHHHCCCCC--C-HHHHHH Q ss_conf 4999998---60899884-------------------20-3316889998-67-8999689899829981--1-004379 Q gi|254781193|r 76 ERTVLKA---CFNLDIPI-------------------HR-WICTLVLARS-NG-LPSALKAVGEALKLSS--Q-KMEEGK 127 (675) Q Consensus 76 D~~~L~~---~~Gi~i~~-------------------~~-~~DTmi~A~~-l~-~~~sL~~la~~l~l~~--~-k~~~~k 127 (675) |++.|.. .+|+++.. ++ .+|.+-..+- .. ..++|+.+|..++... + .....+ T Consensus 76 D~pyl~~Ra~~~~i~l~~gr~~~~~~~~~~~~~~~i~Gr~~iDl~~~~~~~~~l~sysL~~Va~~lg~~~k~e~~~~~~~ 155 (188) T cd05781 76 DWPYLVERARVLGVKLDVGRRGGSEPSTGVYGHYSITGRLNVDLYDFAEEIPEVKVKTLENVAEYLGVMKKSERVLIEWY 155 (188) T ss_pred CHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEEEEEEEECHHHHHHHCCCCCCCHHHHHHHHCCCCCCCCCCCCHH T ss_conf 78999999999699850155677651225420699878999977899976247564579999998541224544657999 Q ss_pred HHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 999862021123433234678585328999999899899999999630 Q gi|254781193|r 128 ELIARFCKGETHECLYDRNKPNHREAWRLFGEYCKGDVTVTREIFKRL 175 (675) Q Consensus 128 ~l~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~t~~L~~~l 175 (675) +.-.. + ..+...+.+.+||.+|+.+|++|+++| T Consensus 156 ~I~~~--------------~-~~~~~r~~l~~YnlqDa~L~~~L~~KL 188 (188) T cd05781 156 RIYEY--------------W-DDEKKRDILLKYNRDDARSTYGLAEKL 188 (188) T ss_pred HHHHH--------------H-HCCCCHHHHHHHHHHHHHHHHHHHHCC T ss_conf 99998--------------7-084121699999788799999998429 No 59 >PRK07942 DNA polymerase III subunit epsilon; Provisional Probab=99.08 E-value=6e-09 Score=76.31 Aligned_cols=80 Identities=19% Similarity=0.165 Sum_probs=58.8 Q ss_pred CCCEEEEEEHHHHHHHHHHH---CCCC-CCCCCEEEHHHHHHHHCC----CCCHHHHHHHCCCCCCHHHHHHHHHHHHCC Q ss_conf 89819997307749999986---0899-884203316889998678----999689899829981100437999986202 Q gi|254781193|r 64 TDVTIVAHNSYFERTVLKAC---FNLD-IPIHRWICTLVLARSNGL----PSALKAVGEALKLSSQKMEEGKELIARFCK 135 (675) Q Consensus 64 ~~~~kV~HNa~FD~~~L~~~---~Gi~-i~~~~~~DTmi~A~~l~~----~~sL~~la~~l~l~~~k~~~~k~l~~~~~k 135 (675) .+..+|+||+.||+.||.+. +|+. +....++||+++++.+++ .++|++++++|+++.+. T Consensus 92 ~g~~lVahNA~FDl~fL~~El~R~g~~~l~~~~viDtl~l~r~l~~~r~gkr~L~aL~~~ygv~l~~------------- 158 (234) T PRK07942 92 AGVPVVVYNAPYDLTVLDRELRRHGLPSLVPGPVIDPYVIDKHVDRYRKGKRTLTALCEHYGVRLDN------------- 158 (234) T ss_pred CCCEEEEECCHHHHHHHHHHHHHCCCCCCCCCCEEEHHHHHHHHCCCCCCCCCHHHHHHHCCCCCCC------------- T ss_conf 7987999670764999999999859996777755619998877514777888899999984998877------------- Q ss_pred CCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 1123433234678585328999999899899999999630104 Q gi|254781193|r 136 GETHECLYDRNKPNHREAWRLFGEYCKGDVTVTREIFKRLEPL 178 (675) Q Consensus 136 ~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~t~~L~~~l~~~ 178 (675) .| =|..|+..|.+|...|... T Consensus 159 -------------aH---------~A~aDA~Ata~l~~~l~~r 179 (234) T PRK07942 159 -------------AH---------DATADALAAARLAWALARR 179 (234) T ss_pred -------------CC---------CCHHHHHHHHHHHHHHHHH T ss_conf -------------55---------6415289999999999987 No 60 >PRK09145 DNA polymerase III subunit epsilon; Validated Probab=99.06 E-value=1e-08 Score=74.78 Aligned_cols=128 Identities=16% Similarity=0.131 Sum_probs=75.7 Q ss_pred CEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEE-EECCEEE-----EEEECCCC---------------------H--- Q ss_conf 6899981417898700044001127981999999-9848039-----99978881---------------------0--- Q gi|254781193|r 2 ATLFIDFETRSPIDLTKCGICKYAEDVCVILFAF-GFDDHPV-----ELWDCVHD---------------------P--- 51 (675) Q Consensus 2 k~i~lD~ET~~~~dl~~~Gl~~y~~~~~i~~i~~-a~~~~~v-----~~~d~~~~---------------------~--- 51 (675) ++++||+||+ |+++. ..+|+-|+. .+.++.+ +..-+-.+ + T Consensus 30 ~fVv~D~ETT--------GL~~~--~D~IieIgav~v~~~~i~~~~~f~~lV~P~~~i~~~~~~IhGIt~~~l~~ap~~~ 99 (203) T PRK09145 30 EWVALDCETT--------GLDPR--RAEIVSIAAVKIRGNRILTSERLELLVRPPQSLSAESIKIHRLRHQDLEHGLSEE 99 (203) T ss_pred CEEEEEEECC--------CCCCC--CCEEEEEEEEEEECCEEECCCEEEEEECCCCCCCHHHHCCCCCCHHHHCCCCCHH T ss_conf 8999985689--------99999--9706999999999999931752899989799598645201387879981399999 Q ss_pred HHHHHHHHHHCCCCCEEEEEEHHHHHHHHHH----HCCCCCCCCCEEEHHHHH--HHH--CC----CCCHHHHHHHCCCC Q ss_conf 4379999986388981999730774999998----608998842033168899--986--78----99968989982998 Q gi|254781193|r 52 VMPQVLKEALENTDVTIVAHNSYFERTVLKA----CFNLDIPIHRWICTLVLA--RSN--GL----PSALKAVGEALKLS 119 (675) Q Consensus 52 ~~~~~l~~~L~~~~~~kV~HNa~FD~~~L~~----~~Gi~i~~~~~~DTmi~A--~~l--~~----~~sL~~la~~l~l~ 119 (675) .+..++..++. +.++||||+.||+.||.+ ++|+.++ ..+.||+.+. +.. .+ .++|+.+++.|+++ T Consensus 100 evl~~f~~fi~--~~~lVgHNa~FD~~fL~~~~~r~~g~~l~-~~~id~~~l~~~~~~~~~~~~~~~~~L~~l~~~~gi~ 176 (203) T PRK09145 100 EALRQLLAFIG--NRPLVGYYLEFDVAMLNRYVRPLLGIGLP-NPLIEVSSLYHDKKERHLPDAYIDLRFDAILKHLDLP 176 (203) T ss_pred HHHHHHHHHCC--CCEEEEECHHHHHHHHHHHHHHHCCCCCC-CCEEEHHHHHHHHHHHHCCCCCCCCCHHHHHHHCCCC T ss_conf 99999998829--99899987899999999999996299999-9877609879999996676987788999999857987 Q ss_pred CCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 1100437999986202112343323467858532899999989989999999963010 Q gi|254781193|r 120 SQKMEEGKELIARFCKGETHECLYDRNKPNHREAWRLFGEYCKGDVTVTREIFKRLEP 177 (675) Q Consensus 120 ~~k~~~~k~l~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~t~~L~~~l~~ 177 (675) ... .| =|..|+.+|..||.+|.+ T Consensus 177 ~~~--------------------------~H---------rAL~DA~aTa~lfl~L~k 199 (203) T PRK09145 177 VLG--------------------------RH---------DALNDAIMTALIYLRLRK 199 (203) T ss_pred CCC--------------------------CC---------CHHHHHHHHHHHHHHHHC T ss_conf 899--------------------------97---------989999999999999867 No 61 >cd05782 DNA_polB_like1_exo A subfamily of the 3'-5' exonuclease domain of family-B DNA polymerases. This subfamily is composed of uncharacterized bacterial family-B DNA polymerases. Family-B DNA polymerases contain an N-terminal DEDDy DnaQ-like exonuclease domain in the same polypeptide chain as the polymerase domain, similar to family-A DNA polymerases. This exonuclease domain contains three sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are involved in metal binding and catalysis. The exonuclease domain of family-B DNA polymerases has a fundamental role in proofreading activity. It contains a beta hairpin structure that plays an important role in active site switching in the event of a nucleotide misincorporation. Family-B DNA polymerases are predominantly involved in DNA replication and DNA repair. Probab=99.06 E-value=1.4e-08 Score=73.95 Aligned_cols=131 Identities=17% Similarity=0.243 Sum_probs=86.9 Q ss_pred CEEEEEEEEECC---EEEEEEECCC--CHHHHHHHHHHHCCCCCEEEEEEH-HHHHHHHHH---HCCCCCCC-------- Q ss_conf 819999999848---0399997888--104379999986388981999730-774999998---60899884-------- Q gi|254781193|r 28 VCVILFAFGFDD---HPVELWDCVH--DPVMPQVLKEALENTDVTIVAHNS-YFERTVLKA---CFNLDIPI-------- 90 (675) Q Consensus 28 ~~i~~i~~a~~~---~~v~~~d~~~--~~~~~~~l~~~L~~~~~~kV~HNa-~FD~~~L~~---~~Gi~i~~-------- 90 (675) -+|+|||+.+.+ .......... ...+.+.+.+.++...-.+|+||. .||+++|.+ .+|+.+|. T Consensus 51 ~rIV~Is~~~~~~~~~~~~~~~~~~~~E~~ll~~F~~~i~~~~p~lv~~Ng~~FD~P~l~~Ral~~gi~~p~~~~~~~~~ 130 (208) T cd05782 51 HKVVSISALYRDDDGGFLKVRTLDGADEKELLEDFFQLIEKKNPRLVSFNGRGFDLPVLHLRALIHGVSAPAYFDLGNKD 130 (208) T ss_pred CCEEEEEEEEEECCCCEEEEEECCCCCHHHHHHHHHHHHHHCCCEEEEECCCCCCHHHHHHHHHHCCCCCCHHHHCCCCC T ss_conf 86279999999558977998417999999999999999986599799636875758999999999198975554048874 Q ss_pred ----CC----EEEHHHHHHHHC--CCCCHHHHHHHCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHH Q ss_conf ----20----331688999867--89996898998299811004379999862021123433234678585328999999 Q gi|254781193|r 91 ----HR----WICTLVLARSNG--LPSALKAVGEALKLSSQKMEEGKELIARFCKGETHECLYDRNKPNHREAWRLFGEY 160 (675) Q Consensus 91 ----~~----~~DTmi~A~~l~--~~~sL~~la~~l~l~~~k~~~~k~l~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~Y 160 (675) .+ ..|||-.-..-+ ...||+.+|..+|++.+-...|++....+. ....+...+| T Consensus 131 ~~y~~r~~~~H~Dl~d~l~~~~~~~~~sLd~la~llG~pgK~~~~Gs~V~~~~~----------------~g~l~~I~~Y 194 (208) T cd05782 131 WNYRNRYSERHLDLMDLLAFYGARARASLDLLAKLLGIPGKMDVDGSQVWELYA----------------EGKLDEIAEY 194 (208) T ss_pred CCCCCCCCCCCCCHHHHHHCCCCCCCCCHHHHHHHHCCCCCCCCCHHHHHHHHH----------------CCCHHHHHHH T ss_conf 112488876313299998554875778899999983899757888899999998----------------1998899999 Q ss_pred HHHHHHHHHHHHHH Q ss_conf 89989999999963 Q gi|254781193|r 161 CKGDVTVTREIFKR 174 (675) Q Consensus 161 a~~Dv~~t~~L~~~ 174 (675) |..||..|+.||.+ T Consensus 195 Ce~DVl~T~~v~lR 208 (208) T cd05782 195 CETDVLNTYLLYLR 208 (208) T ss_pred HHHHHHHHHHHHHC T ss_conf 99999999999809 No 62 >cd05785 DNA_polB_like2_exo A subfamily of the 3'-5' exonuclease domain of family-B DNA polymerases. This subfamily is composed of uncharacterized bacterial family-B DNA polymerases. Family-B DNA polymerases contain an N-terminal DEDDy DnaQ-like exonuclease domain in the same polypeptide chain as the polymerase domain, similar to family-A DNA polymerases. This exonuclease domain contains three sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are involved in metal binding and catalysis. The exonuclease domain of family-B DNA polymerases has a fundamental role in proofreading activity. It contains a beta hairpin structure that plays an important role in active site switching in the event of a nucleotide misincorporation. Family-B DNA polymerases are predominantly involved in DNA replication and DNA repair. Probab=99.03 E-value=2.6e-08 Score=72.20 Aligned_cols=152 Identities=22% Similarity=0.202 Sum_probs=90.6 Q ss_pred CCEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCEEEEEEECC--CCHHHHHHHHHHHC--CCCCEEEEEEH-HH Q ss_conf 968999814178987000440011279819999999848039999788--81043799999863--88981999730-77 Q gi|254781193|r 1 MATLFIDFETRSPIDLTKCGICKYAEDVCVILFAFGFDDHPVELWDCV--HDPVMPQVLKEALE--NTDVTIVAHNS-YF 75 (675) Q Consensus 1 mk~i~lD~ET~~~~dl~~~Gl~~y~~~~~i~~i~~a~~~~~v~~~d~~--~~~~~~~~l~~~L~--~~~~~kV~HNa-~F 75 (675) +|.+++||||++....-- ..+-.++-.|+.|++..+++...++... ....++..+..++. +|+++ +|+|. .| T Consensus 9 lr~lsfDIE~~~~~g~~~--p~p~~~~d~Ii~I~~~~~~~~~~v~~~~~~~E~~lL~~F~~~i~~~dPDII-tGyNi~~F 85 (207) T cd05785 9 LRRLQLDIETYSLPGFFF--SNPDRGDDRIIIVALRDNRGWEEVLHAEDAAEKELLEELVAIIRERDPDVI-EGHNIFRF 85 (207) T ss_pred EEEEEEEEEECCCCCCCC--CCCCCCCCCEEEEEEEECCCCEEEEEECCCCHHHHHHHHHHHHHHCCCCEE-EECCCCCC T ss_conf 059999997478899779--998889997899998858997798762699999999999999987399999-86798775 Q ss_pred HHHHHHHH---CCCCCCC-----------------------------CC-EEEHHHHHHHH-----C-CCCCHHHHHHHC Q ss_conf 49999986---0899884-----------------------------20-33168899986-----7-899968989982 Q gi|254781193|r 76 ERTVLKAC---FNLDIPI-----------------------------HR-WICTLVLARSN-----G-LPSALKAVGEAL 116 (675) Q Consensus 76 D~~~L~~~---~Gi~i~~-----------------------------~~-~~DTmi~A~~l-----~-~~~sL~~la~~l 116 (675) |++.|.+. +|+++.. ++ ++|.+-+.+.. . .+++|+.+|+.+ T Consensus 86 D~pYL~~Ra~~~~i~~~lgR~~~~~~~~~~~~~~~~~~~~~~~~~i~Gr~~lD~~~~~k~~~~~~~~L~SY~L~~Va~~~ 165 (207) T cd05785 86 DLPYLRRRCRRHGVPLAIGRDGSIPRQRPSRFRFAERLIDYPRYDIPGRHVIDTYFLVQLFDVSSRDLPSYGLKAVAKHF 165 (207) T ss_pred CHHHHHHHHHHHCCCCCCCCCCCCCEEECCEEEECCCCCCEEEEEECCEEEEEHHHHHHHHCCCCCCCCCCCHHHHHHHH T ss_conf 88999999999599722257898744640302122355530499873648865499998642111467765289999987 Q ss_pred CCCC-CHHH-HHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9981-1004-379999862021123433234678585328999999899899999999 Q gi|254781193|r 117 KLSS-QKME-EGKELIARFCKGETHECLYDRNKPNHREAWRLFGEYCKGDVTVTREIF 172 (675) Q Consensus 117 ~l~~-~k~~-~~k~l~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~t~~L~ 172 (675) ++.. ++.. .+.++-.. ...+++.+.+||.+||++|.+|. T Consensus 166 ~l~~~~k~~~~~~~i~~~-----------------~~~d~~~l~~Yni~Dv~~t~~La 206 (207) T cd05785 166 GLASPDRTYIDGRQIAEV-----------------WRSDPARLLAYALDDVRETEGLA 206 (207) T ss_pred HHCCCCCCCCCHHHHHHH-----------------HHCCHHHHHHHHHHHHHHHHHHC T ss_conf 313666563889999999-----------------95298999998299999999860 No 63 >cd06134 RNaseT RNase T is a DEDDh-type DnaQ-like 3'-5' exoribonuclease E implicated in the 3' maturation of small stable RNAs and 23srRNA, and in the end turnover of tRNA. It contains three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific Hx(4)D conserved pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. RNase T is related to the proofreading domain of DNA polymerase III. Despite its important role, RNase T is mainly found only in gammaproteobacteria. It is speculated that it might have originated from DNA polymerase III at the time the gamma division of proteobacteria diverged from other bacteria. RNase T is a homodimer with the catalytic residues of one monomer contacting a large basic patch on the other monomer to form a functional active site. Probab=99.00 E-value=1.9e-08 Score=73.16 Aligned_cols=79 Identities=20% Similarity=0.175 Sum_probs=56.8 Q ss_pred CEEEEEEHHHHHHHHHHH---CCCCC-C--CCCEEEHHHHHHHHCCCCCHHHHHHHCCCCCCHHHHHHHHHHHHCCCCCC Q ss_conf 819997307749999986---08998-8--42033168899986789996898998299811004379999862021123 Q gi|254781193|r 66 VTIVAHNSYFERTVLKAC---FNLDI-P--IHRWICTLVLARSNGLPSALKAVGEALKLSSQKMEEGKELIARFCKGETH 139 (675) Q Consensus 66 ~~kV~HNa~FD~~~L~~~---~Gi~i-~--~~~~~DTmi~A~~l~~~~sL~~la~~l~l~~~k~~~~k~l~~~~~k~~~~ 139 (675) .++||||+.||+.||... .|+.. | +..++|||-+|+.+-+..+|+.+++.++++.... T Consensus 103 avlVaHNa~FD~~FL~~~~~r~g~~~~p~~~~~~~DTl~La~~~~~~~~L~~l~~~~gi~~~~~---------------- 166 (189) T cd06134 103 AILVGHNAHFDLGFLNAAVARCKIKRNPFHPFSTFDTATLAGLAYGQTVLAKACQAAGIEFDNK---------------- 166 (189) T ss_pred CEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCEEEHHHHHHHHHCCCCHHHHHHHCCCCCCCC---------------- T ss_conf 5687517777299999999984998888877765669999867738001999999879999998---------------- Q ss_pred CCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 43323467858532899999989989999999963010 Q gi|254781193|r 140 ECLYDRNKPNHREAWRLFGEYCKGDVTVTREIFKRLEP 177 (675) Q Consensus 140 ~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~t~~L~~~l~~ 177 (675) . ..-|..|+.+|..||.+|.. T Consensus 167 --------~---------~HrAl~DA~~ta~lf~~l~~ 187 (189) T cd06134 167 --------E---------AHSALYDTQKTAELFCKIVN 187 (189) T ss_pred --------C---------CCCCHHHHHHHHHHHHHHHH T ss_conf --------7---------88729999999999999996 No 64 >TIGR01407 dinG_rel DnaQ family exonuclease/DinG family helicase, putative; InterPro: IPR006310 These are a family of proteins in Gram-positive bacteria. The N-terminal region of about 200 amino acids resembles the epsilon subunit of E. coli DNA polymerase III and the homologous region of the Gram-positive type DNA polymerase III alpha subunit. The epsilon subunit contains an exonuclease domain. The remainder of this protein family resembles a predicted ATP-dependent helicase, the DNA damage-inducible protein DinG of Escherichia coli. . Probab=98.99 E-value=1.8e-08 Score=73.19 Aligned_cols=109 Identities=23% Similarity=0.186 Sum_probs=73.8 Q ss_pred CEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEE-CCEEEE-EE--ECCCC------------------------HHH Q ss_conf 689998141789870004400112798199999998-480399-99--78881------------------------043 Q gi|254781193|r 2 ATLFIDFETRSPIDLTKCGICKYAEDVCVILFAFGF-DDHPVE-LW--DCVHD------------------------PVM 53 (675) Q Consensus 2 k~i~lD~ET~~~~dl~~~Gl~~y~~~~~i~~i~~a~-~~~~v~-~~--d~~~~------------------------~~~ 53 (675) .+..+|+||++.-. +.-.+|+-||+-+ .|+++. .+ ++-+. ..+ T Consensus 1 rYaVVDLEtTg~~~---------~~~d~IIqiGiV~v~DgeIv~~f~T~VNP~~~~pp~I~eLTGisd~~l~~AP~F~qV 71 (944) T TIGR01407 1 RYAVVDLETTGTDS---------SEFDKIIQIGIVLVEDGEIVDTFATDVNPNEPIPPFIQELTGISDEQLKQAPYFSQV 71 (944) T ss_pred CEEEEEEEECCCCC---------CCCCCEEEEEEEEEECCCEEEECCCCCCCCCCCCCHHHHHCCCCHHHHCCCCCHHHH T ss_conf 96899874168886---------455766899889985581432012677889877611454228776864048888799 Q ss_pred HHHHHHHHCCCCCEEEEEEHHHHHHHHHHH---CCCCCCCCCEEEHHHHHHHHCC---CCCHHHHHHHCCCCCC Q ss_conf 799999863889819997307749999986---0899884203316889998678---9996898998299811 Q gi|254781193|r 54 PQVLKEALENTDVTIVAHNSYFERTVLKAC---FNLDIPIHRWICTLVLARSNGL---PSALKAVGEALKLSSQ 121 (675) Q Consensus 54 ~~~l~~~L~~~~~~kV~HNa~FD~~~L~~~---~Gi~i~~~~~~DTmi~A~~l~~---~~sL~~la~~l~l~~~ 121 (675) ..++.+.|+| .++||||..||+-+|++. -|+++....++||.=+|+++-| +++|..|++.||+..+ T Consensus 72 A~~i~~~l~D--~iFVAHNV~FD~nfL~k~l~~~G~~~~~~~~iDTVELa~if~PT~esY~Ls~L~E~LGl~h~ 143 (944) T TIGR01407 72 AQEIYDLLED--GIFVAHNVHFDLNFLAKELVDEGYEPLEVPRIDTVELARIFFPTEESYKLSELSEELGLEHE 143 (944) T ss_pred HHHHHHHHCC--CEEEEECHHCCHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHCCCCCCCCCHHHHHHHCCCCC T ss_conf 9999987429--72776101506789999998568974343520056799874674435561489997198714 No 65 >cd06130 DNA_pol_III_epsilon_like This subfamily is composed of uncharacterized bacterial proteins with similarity to the epsilon subunit of DNA polymerase III (polIII), a multisubunit polymerase which is the main DNA replicating enzyme in bacteria, functioning as the chromosomal replicase. The pol III holoenzyme is a complex of ten different subunits, three of which (alpha, epsilon, and theta) compose the catalytic core. The polIII epsilon subunit, encoded by the dnaQ gene, is a DEDDh-type 3'-5' exonuclease which is responsible for the proofreading activity of the polymerase, increasing the fidelity of DNA synthesis. It contains three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific Hx(4)D conserved pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. The epsilon subunit of polIII also functions as a stabilizer of the holoenzyme complex Probab=98.98 E-value=2.1e-08 Score=72.81 Aligned_cols=63 Identities=35% Similarity=0.525 Sum_probs=48.6 Q ss_pred HHHHHHHCCCCCEEEEEEHHHHHHHHHHH---CCCCCCCCCEEEHHHHHHHHCC---CCCHHHHHHHCCCC Q ss_conf 99999863889819997307749999986---0899884203316889998678---99968989982998 Q gi|254781193|r 55 QVLKEALENTDVTIVAHNSYFERTVLKAC---FNLDIPIHRWICTLVLARSNGL---PSALKAVGEALKLS 119 (675) Q Consensus 55 ~~l~~~L~~~~~~kV~HNa~FD~~~L~~~---~Gi~i~~~~~~DTmi~A~~l~~---~~sL~~la~~l~l~ 119 (675) .++.+++. +.++||||+.||+.+|.+. +|+..+...|+|||-+++.+.+ +++|+.+++++|++ T Consensus 69 ~~l~~~l~--~~~lVaHn~~FD~~fL~~~~~~~~~~~~~~~~iDtl~l~r~~~~~~~~~~L~~l~~~~gi~ 137 (156) T cd06130 69 PEIKPFLG--GSLVVAHNASFDRSVLRAALEAYGLPPPPYQYLCTVRLARRVWPLLPNHKLNTVAEHLGIE 137 (156) T ss_pred HHHHHHCC--CCEEEEECHHHHHHHHHHHHHHCCCCCCCCCEECHHHHHHHHCCCCCCCCHHHHHHHCCCC T ss_conf 99998558--9889995778899999999998199998985881899999973888889999999985999 No 66 >PRK06063 DNA polymerase III subunit epsilon; Provisional Probab=98.96 E-value=1.6e-08 Score=73.65 Aligned_cols=66 Identities=29% Similarity=0.423 Sum_probs=51.6 Q ss_pred HHHHHHHHHCCCCCEEEEEEHHHHHHHHHHH---CCCCCCCCCEEEHHHHHHHHCC---CCCHHHHHHHCCCCC Q ss_conf 3799999863889819997307749999986---0899884203316889998678---999689899829981 Q gi|254781193|r 53 MPQVLKEALENTDVTIVAHNSYFERTVLKAC---FNLDIPIHRWICTLVLARSNGL---PSALKAVGEALKLSS 120 (675) Q Consensus 53 ~~~~l~~~L~~~~~~kV~HNa~FD~~~L~~~---~Gi~i~~~~~~DTmi~A~~l~~---~~sL~~la~~l~l~~ 120 (675) +..++.++|. +.++|+||+.||+.||... .|+..+...+.||+-+||.+.+ .+.|..+|..+|+.. T Consensus 84 v~~~l~~~l~--g~vlVAHNa~FD~~FL~~e~~r~g~~~p~~~~lcTl~lARr~~~~l~~~kL~tla~~~Gi~~ 155 (313) T PRK06063 84 IAGELAEVLR--GRTLVAHNVAFDYSFLAAEAELAGAELPVDQVMCTVELARRLDLGLPNLRLETLAAHWGVVQ 155 (313) T ss_pred HHHHHHHHHC--CCEEEEECCHHHHHHHHHHHHHCCCCCCCCCEEEHHHHHHHHCCCCCCCCHHHHHHHCCCCC T ss_conf 9999999978--99799968565299999999973998899966768998865448999974788999809998 No 67 >PRK00448 polC DNA polymerase III PolC; Validated Probab=98.95 E-value=1.9e-08 Score=73.12 Aligned_cols=64 Identities=23% Similarity=0.243 Sum_probs=47.8 Q ss_pred HHHHHHCCCCCEEEEEEHHHHHHHHHHH---CCCCCCCCCEEEHHHHHHHHCC---CCCHHHHHHHCCCCCC Q ss_conf 9999863889819997307749999986---0899884203316889998678---9996898998299811 Q gi|254781193|r 56 VLKEALENTDVTIVAHNSYFERTVLKAC---FNLDIPIHRWICTLVLARSNGL---PSALKAVGEALKLSSQ 121 (675) Q Consensus 56 ~l~~~L~~~~~~kV~HNa~FD~~~L~~~---~Gi~i~~~~~~DTmi~A~~l~~---~~sL~~la~~l~l~~~ 121 (675) .+.+++ .+.+.|||||.||+.|+... +|++.....++||+.+||.|.| .++|+.+++++++... T Consensus 491 ~f~~f~--~d~vlVAHNa~FD~~Fi~~~~~~~~~~~~~~p~iDTl~lsr~l~p~~k~~~L~~l~k~~~v~~~ 560 (1436) T PRK00448 491 KFKEFC--GDSVLVAHNASFDVGFINTNYEKLGLEKIKNPVIDTLELSRFLYPEYKSHRLNTLAKKFGVELE 560 (1436) T ss_pred HHHHHH--CCCEEEEECCCCCHHHHHHHHHHCCCCCCCCCEECHHHHHHHHCCCCCCCCHHHHHHHHCCCCC T ss_conf 999982--8977998458656889999999808986679723029989875510111447889988488865 No 68 >PRK09146 DNA polymerase III subunit epsilon; Validated Probab=98.89 E-value=1.1e-07 Score=68.10 Aligned_cols=83 Identities=22% Similarity=0.149 Sum_probs=54.9 Q ss_pred HHHHHHCCCCCEEEEEEHHHHHHHHHH----HCCCCCCCCCEEEHHHHHHHHC--------------C--CCCHHHHHHH Q ss_conf 999986388981999730774999998----6089988420331688999867--------------8--9996898998 Q gi|254781193|r 56 VLKEALENTDVTIVAHNSYFERTVLKA----CFNLDIPIHRWICTLVLARSNG--------------L--PSALKAVGEA 115 (675) Q Consensus 56 ~l~~~L~~~~~~kV~HNa~FD~~~L~~----~~Gi~i~~~~~~DTmi~A~~l~--------------~--~~sL~~la~~ 115 (675) ++.+.+ .+.+.|+||+.||+.||.+ ++|..+. ..++|||.+++.+. . +..|.++..+ T Consensus 122 ~ll~~l--~g~VLVaH~a~~D~~FL~~a~~~~~g~~l~-~P~IDTl~Le~~~~~~~~~~~~~r~~~~~~~slrL~~~r~r 198 (239) T PRK09146 122 ELLEAL--AGKVVVVHYRRIERDFLDQALRNRIGEGIE-FPVIDTMEIEARIQRKQAGGLWNRLKGKKPESIRLADSRLR 198 (239) T ss_pred HHHHHH--CCCEEEEECCHHHHHHHHHHHHHHHCCCCC-CCEEEHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHH T ss_conf 999981--899899968399999999999996288988-74644899999998854200444313589867638999998 Q ss_pred CCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 2998110043799998620211234332346785853289999998998999999996301 Q gi|254781193|r 116 LKLSSQKMEEGKELIARFCKGETHECLYDRNKPNHREAWRLFGEYCKGDVTVTREIFKRLE 176 (675) Q Consensus 116 l~l~~~k~~~~k~l~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~t~~L~~~l~ 176 (675) ||++.... | =|..|+..|.+|+.... T Consensus 199 YgLP~y~a--------------------------H---------~AL~DAlATAELflAQi 224 (239) T PRK09146 199 YGLPAYPP--------------------------H---------HALTDAIATAELLQAQI 224 (239) T ss_pred CCCCCCCC--------------------------C---------CCHHHHHHHHHHHHHHH T ss_conf 19898887--------------------------6---------71999999999999999 No 69 >cd05780 DNA_polB_Kod1_like_exo The 3'-5' exonuclease domain of archaeal family-B DNA polymerases with similarity to Pyrococcus kodakaraensis Kod1, including polymerases from Desulfurococcus (D. Tok Pol) and Thermococcus gorgonarius (Tgo Pol). Kod1, D. Tok Pol, and Tgo Pol are thermostable enzymes that exhibit both polymerase and 3'-5' exonuclease activities. They are family-B DNA polymerases. Their amino termini harbor a DEDDy-type DnaQ-like 3'-5' exonuclease domain that contains three sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are clustered around the active site and are involved in metal binding and catalysis. The exonuclease domain of family B polymerases contains a beta hairpin structure that plays an important role in active site switching in the event of nucleotide misincorporation. Members of this subfamily show similarity to eukaryotic DNA polymerases involved in DNA replication. Some archaea possess multiple family-B D Probab=98.88 E-value=7.2e-08 Score=69.33 Aligned_cols=152 Identities=22% Similarity=0.241 Sum_probs=89.6 Q ss_pred CCEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCEE-EEEEEC---------CCCHHHHHHHHHHHC--CCCCEE Q ss_conf 96899981417898700044001127981999999984803-999978---------881043799999863--889819 Q gi|254781193|r 1 MATLFIDFETRSPIDLTKCGICKYAEDVCVILFAFGFDDHP-VELWDC---------VHDPVMPQVLKEALE--NTDVTI 68 (675) Q Consensus 1 mk~i~lD~ET~~~~dl~~~Gl~~y~~~~~i~~i~~a~~~~~-v~~~d~---------~~~~~~~~~l~~~L~--~~~~~k 68 (675) .|.+++||||+++... +-.+...|..||++..++. +..+.- .....++..+..++. +++++ T Consensus 3 lkilsfDIE~~~~~g~------P~p~~d~Ii~Is~~~~~~~~vi~~~~~~~~~v~~~~tE~eLL~~F~~~i~~~dPDii- 75 (195) T cd05780 3 LKILSFDIEVLNHEGE------PNPEKDPIIMISFADEGGNKVITWKKFDLPFVEVVKTEKEMIKRFIEIVKEKDPDVI- 75 (195) T ss_pred CCEEEEEEEECCCCCC------CCCCCCCEEEEEEEECCCCEEEEECCCCCCCEEEECCHHHHHHHHHHHHHHCCCCEE- T ss_conf 4299999997589999------898888289999996899789981798877456738999999999999987299999- Q ss_pred EEEEH-HHHHHHHHHH---CCCCCC---------------------CCC-EEEHHHHHHH-HC-CCCCHHHHHHHCCCCC Q ss_conf 99730-7749999986---089988---------------------420-3316889998-67-8999689899829981 Q gi|254781193|r 69 VAHNS-YFERTVLKAC---FNLDIP---------------------IHR-WICTLVLARS-NG-LPSALKAVGEALKLSS 120 (675) Q Consensus 69 V~HNa-~FD~~~L~~~---~Gi~i~---------------------~~~-~~DTmi~A~~-l~-~~~sL~~la~~l~l~~ 120 (675) +|||. .||++.|.+. +|++++ .++ ..|.+-+.+- .. .+++|+.+|..+ ++. T Consensus 76 ~GyN~~~FD~pYl~~R~~~~~~~~~~gr~~~~~~~~~~g~~~~~~i~Gr~~lD~~~~~~~~~~l~sy~L~~Va~~~-Lg~ 154 (195) T cd05780 76 YTYNGDNFDFPYLKKRAEKLGIELDLGRDGSEIKIQRGGFNNASEIKGRIHVDLYPVARRTLNLTRYTLERVYEEL-FGI 154 (195) T ss_pred EECCCCCCCHHHHHHHHHHHCCCCCCCCCCCEEEEEECCCCEEEEECCEEEEEHHHHHHHHCCCCCCCHHHHHHHH-HCC T ss_conf 9579878768999999999589864578886047997785079987686885549999752562401599999998-399 Q ss_pred CHHH-HHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 1004-379999862021123433234678585328999999899899999999630 Q gi|254781193|r 121 QKME-EGKELIARFCKGETHECLYDRNKPNHREAWRLFGEYCKGDVTVTREIFKRL 175 (675) Q Consensus 121 ~k~~-~~k~l~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~t~~L~~~l 175 (675) .|.. ..+.+..... .....+.+.+||.+|+.+|++|++++ T Consensus 155 ~K~d~~~~~i~~~~~---------------~~~~~~~l~~Yn~~D~~L~~~L~~K~ 195 (195) T cd05780 155 EKEDVPGEEIAEAWD---------------SGENLERLFRYSMEDAKYTYEIGKEF 195 (195) T ss_pred CCCCCCHHHHHHHHH---------------CCCCHHHHHHHHHHHHHHHHHHHHHC T ss_conf 978899999999997---------------59753888844599899999998529 No 70 >PRK05168 ribonuclease T; Provisional Probab=98.88 E-value=9.4e-08 Score=68.61 Aligned_cols=81 Identities=22% Similarity=0.144 Sum_probs=57.0 Q ss_pred CEEEEEEHHHHHHHHHHH---CCCCC-C--CCCEEEHHHHHHHHCCCCCHHHHHHHCCCCCCHHHHHHHHHHHHCCCCCC Q ss_conf 819997307749999986---08998-8--42033168899986789996898998299811004379999862021123 Q gi|254781193|r 66 VTIVAHNSYFERTVLKAC---FNLDI-P--IHRWICTLVLARSNGLPSALKAVGEALKLSSQKMEEGKELIARFCKGETH 139 (675) Q Consensus 66 ~~kV~HNa~FD~~~L~~~---~Gi~i-~--~~~~~DTmi~A~~l~~~~sL~~la~~l~l~~~k~~~~k~l~~~~~k~~~~ 139 (675) .++|+||+.||+.||... .|+.- | +..++||+-+|+++-...+|+.++++++++.... T Consensus 115 avLVaHNA~FD~~FLn~~~~r~g~~~~P~~~~~viDTl~Lar~~~~~~~L~~lc~~~gi~~~~~---------------- 178 (213) T PRK05168 115 AILVAHNAHFDLGFLNAAAERTGLKRNPFHPFSTFDTATLAGLAYGQTVLAKACQAAGIEFDNK---------------- 178 (213) T ss_pred CEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCEEEHHHHHHHHHCCCCHHHHHHHCCCCCCCC---------------- T ss_conf 6377634187499999999984997788887763358998688748740999999869999997---------------- Q ss_pred CCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCH Q ss_conf 4332346785853289999998998999999996301040 Q gi|254781193|r 140 ECLYDRNKPNHREAWRLFGEYCKGDVTVTREIFKRLEPLS 179 (675) Q Consensus 140 ~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~t~~L~~~l~~~~ 179 (675) . ..-|..|+.+|..||-+|.+.+ T Consensus 179 --------~---------~HrAl~DA~~tA~vFl~ll~~l 201 (213) T PRK05168 179 --------E---------AHSALYDTEKTAELFCEIVNRW 201 (213) T ss_pred --------C---------CCCCHHHHHHHHHHHHHHHHHH T ss_conf --------7---------7773899999999999999999 No 71 >cd06125 DnaQ_like_exo The DnaQ-like exonuclease superfamily is a structurally conserved group of 3'-5' exonucleases, which catalyze the excision of nucleoside monophosphates at the DNA or RNA termini in the 3'-5' direction. It is also called the DEDD superfamily, after the four invariant acidic residues present in the catalytic site of its members. The superfamily consists of DNA- and RNA-processing enzymes such as the proofreading domains of DNA polymerases, other DNA exonucleases, RNase D, RNase T, Oligoribonuclease and RNA exonucleases (REX). The DnaQ-like exonuclease domain contains three conserved sequence motifs termed ExoI, ExoII and ExoIII, which are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. The conservation patterns of the three motifs may vary among different subfamilies. DnaQ-like exonucleases are classified as DEDDy or DEDDh exonucleases depending on the variation of motif Probab=98.75 E-value=1.2e-07 Score=67.89 Aligned_cols=89 Identities=24% Similarity=0.166 Sum_probs=57.1 Q ss_pred EEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEEC-CEEEEEEECCCCH------------------------HHHHHHH Q ss_conf 99981417898700044001127981999999984-8039999788810------------------------4379999 Q gi|254781193|r 4 LFIDFETRSPIDLTKCGICKYAEDVCVILFAFGFD-DHPVELWDCVHDP------------------------VMPQVLK 58 (675) Q Consensus 4 i~lD~ET~~~~dl~~~Gl~~y~~~~~i~~i~~a~~-~~~v~~~d~~~~~------------------------~~~~~l~ 58 (675) +++|+||++. +.- ...+|+.++.... .++.+...+.+.. .+..++. T Consensus 1 Vv~D~ETTGl--------~~~-~~d~Iiei~~v~~~~~~~~~~~v~p~~~i~~~~~~i~gi~~~~v~~~p~~~~~~~~~~ 71 (130) T cd06125 1 VAFDIETTGL--------NGN-ARDEIIEIALVVVHIGEKFVFDVKPDDLIDPYAFTISGITDEVVERAATEAELLIELL 71 (130) T ss_pred CEEEEECCCC--------CCC-CCCEEEEEEEEEEECCCEEEEEECCCCCCCHHHHHCCCCCHHHHHHHHHHHHHHHHHH T ss_conf 9898018996--------999-9985899999997189889999799998997794103759999833060899999999 Q ss_pred HHHCCCCCEEEEEEHHHHHHHHHHH---CCCCCCC--CCEEEHHHHHH Q ss_conf 9863889819997307749999986---0899884--20331688999 Q gi|254781193|r 59 EALENTDVTIVAHNSYFERTVLKAC---FNLDIPI--HRWICTLVLAR 101 (675) Q Consensus 59 ~~L~~~~~~kV~HNa~FD~~~L~~~---~Gi~i~~--~~~~DTmi~A~ 101 (675) +++.+.+.++||||+.||+.+|.+. +|++.+. ..++||+.+|+ T Consensus 72 ~f~~~~~~ilVgHN~~FD~~fL~~~~~r~g~~~~~~~~~~iDTl~lar 119 (130) T cd06125 72 AFLRDKDDILVGHNISFDLPYLLNRAAELGIPYPILGRIVFDTLALAL 119 (130) T ss_pred HHHCCCCCEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCEEECHHHHH T ss_conf 997699999999194774999999999849999878996868609842 No 72 >cd05784 DNA_polB_II_exo The 3'-5' exonuclease domain of Escherichia coli DNA polymerase II (PolII) and similar bacterial proteins. PolII is a family-B DNA polymerase. Family-B DNA polymerases contain an N-terminal DEDDy DnaQ-like exonuclease domain in the same polypeptide chain as the polymerase domain, similar to family-A DNA polymerases. This exonuclease domain contains three sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are clustered around the active site and are involved in metal binding and catalysis. The exonuclease domain has a fundamental role in the proofreading activity of polII. It contains a beta hairpin structure that plays an important role in active site switching in the event of a nucleotide misincorporation. PolII is involved in a variety of cellular activities, such as the repair of DNA damaged by UV irradiation or oxidation. It plays a pivotal role in replication-restart, a process that bypasses DNA damage in a Probab=98.74 E-value=7.9e-08 Score=69.07 Aligned_cols=153 Identities=20% Similarity=0.241 Sum_probs=83.7 Q ss_pred CCEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCEEEEEEECCCCHHHHHHHHHHHC--CCCCEEEEEEH-HHHH Q ss_conf 96899981417898700044001127981999999984803999978881043799999863--88981999730-7749 Q gi|254781193|r 1 MATLFIDFETRSPIDLTKCGICKYAEDVCVILFAFGFDDHPVELWDCVHDPVMPQVLKEALE--NTDVTIVAHNS-YFER 77 (675) Q Consensus 1 mk~i~lD~ET~~~~dl~~~Gl~~y~~~~~i~~i~~a~~~~~v~~~d~~~~~~~~~~l~~~L~--~~~~~kV~HNa-~FD~ 77 (675) .|.++||+||.+...+...|++-.. .-.+.+++.+-....+.+.-+.....++..+..++. +|++ ++|||. .||+ T Consensus 3 lki~s~DIE~~~~~~i~~I~~~~~~-~~~v~~~~~~~~~~~~~v~~~~sE~eLL~~F~~~i~~~dPDi-ItG~N~~~FDi 80 (193) T cd05784 3 LKVVSLDIETSMDGELYSIGLYGEG-QERVLMVGDPEDDAPDNIEWFADEKSLLLALIAWFAQYDPDI-IIGWNVINFDL 80 (193) T ss_pred CEEEEEEECCCCCCCEEEEEEECCC-CCEEEEEECCCCCCCCEEEEECCHHHHHHHHHHHHHHCCCCE-EEECCCCCCCH T ss_conf 3099999632798888899998699-998999978988899779998999999999999998629999-99779877767 Q ss_pred HHHHHH---CCCCCC-----------------------CCC-EEEHHHHHHH--HC-CCCCHHHHHHHCCCCCCHHHH-- Q ss_conf 999986---089988-----------------------420-3316889998--67-899968989982998110043-- Q gi|254781193|r 78 TVLKAC---FNLDIP-----------------------IHR-WICTLVLARS--NG-LPSALKAVGEALKLSSQKMEE-- 125 (675) Q Consensus 78 ~~L~~~---~Gi~i~-----------------------~~~-~~DTmi~A~~--l~-~~~sL~~la~~l~l~~~k~~~-- 125 (675) +.|.+. +|+++. .++ ++|.|-+-+- .. .+++|+.+|+.+ ++.+|... T Consensus 81 PYL~~R~~~~~i~~~~~r~~~~~~~~~~~~~~~~~~~i~Gr~~lD~~~~~~~~~~~l~sy~Ld~Va~~~-LGe~K~~~~~ 159 (193) T cd05784 81 RLLQRRAEAHGLPLRLGRGGSPLNWRQSGKPGQGFLSLPGRVVLDGIDALKTATYHFESFSLENVAQEL-LGEGKLIHDV 159 (193) T ss_pred HHHHHHHHHHCCCCCCCCCCCCCEEEECCCCCEEEEEECCEEEECHHHHHHHCCCCHHHHHHHHHHHHH-HCCCCCCCCC T ss_conf 899999998588643365788505987685626999978878417299997607351143488899998-3877544565 Q ss_pred ---HHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf ---799998620211234332346785853289999998998999999996 Q gi|254781193|r 126 ---GKELIARFCKGETHECLYDRNKPNHREAWRLFGEYCKGDVTVTREIFK 173 (675) Q Consensus 126 ---~k~l~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~t~~L~~ 173 (675) ..++-+.+ ..+++.+.+||.+||.++.+|++ T Consensus 160 ~~~~~~i~~l~-----------------~~d~~k~~~Yni~D~~Lv~rLf~ 193 (193) T cd05784 160 DDRGAEIERLF-----------------REDKLALARYNLQDCELVWRIFE 193 (193) T ss_pred CCCHHHHHHHH-----------------HCCCHHHEEEHHHHHHHHHHHHC T ss_conf 55588899988-----------------65903610320277999999749 No 73 >cd06127 DEDDh DEDDh exonucleases, part of the DnaQ-like (or DEDD) exonuclease superfamily, catalyze the excision of nucleoside monophosphates at the DNA or RNA termini in the 3'-5' direction. These proteins contain four invariant acidic residues in three conserved sequence motifs termed ExoI, ExoII and ExoIII. DEDDy exonucleases are classified as such because of the presence of specific Hx(4)D conserved pattern at the ExoIII motif. The four conserved acidic residues are clustered around the active site and serve as ligands for the two metal ions required for catalysis. Most DEDDh exonucleases are the proofreading subunits (epsilon) or domains of bacterial DNA polymerase III, the main replicating enzyme in bacteria, which functions as the chromosomal replicase. Other members include other DNA and RNA exonucleases such as RNase T, Oligoribonuclease, and RNA exonuclease (REX), among others. Probab=98.72 E-value=5.3e-07 Score=63.73 Aligned_cols=48 Identities=31% Similarity=0.279 Sum_probs=34.7 Q ss_pred HHHHHHHCCCCCEEEEEEHHHHHHHHHHH---CCCCCCCCCEEEHHHHHHHHC Q ss_conf 99999863889819997307749999986---089988420331688999867 Q gi|254781193|r 55 QVLKEALENTDVTIVAHNSYFERTVLKAC---FNLDIPIHRWICTLVLARSNG 104 (675) Q Consensus 55 ~~l~~~L~~~~~~kV~HNa~FD~~~L~~~---~Gi~i~~~~~~DTmi~A~~l~ 104 (675) ..+..++.+ .++||||+.||+.+|.+. ++...+..+|+|||.+++.+. T Consensus 71 ~~~~~~l~~--~~lv~hN~~FD~~~L~~~~~~~~~~~~~~~~iDt~~~~~~~~ 121 (159) T cd06127 71 PEFLEFLGG--RVLVAHNASFDLRFLNRELRRLGGPPLPNPWIDTLRLARRLL 121 (159) T ss_pred HHHHHHCCC--CEEEECCHHHHHHHHHHHHHHCCCCCCCCCEEEHHHHHHHHC T ss_conf 999873156--346515888999999999998399988996664799999973 No 74 >COG2176 PolC DNA polymerase III, alpha subunit (gram-positive type) [DNA replication, recombination, and repair] Probab=98.67 E-value=5.4e-08 Score=70.14 Aligned_cols=66 Identities=26% Similarity=0.316 Sum_probs=49.0 Q ss_pred HHHHHHHHCCCCCEEEEEEHHHHHHHHHHH---CCCCCCCCCEEEHHHHHHHHCC---CCCHHHHHHHCCCCCC Q ss_conf 799999863889819997307749999986---0899884203316889998678---9996898998299811 Q gi|254781193|r 54 PQVLKEALENTDVTIVAHNSYFERTVLKAC---FNLDIPIHRWICTLVLARSNGL---PSALKAVGEALKLSSQ 121 (675) Q Consensus 54 ~~~l~~~L~~~~~~kV~HNa~FD~~~L~~~---~Gi~i~~~~~~DTmi~A~~l~~---~~sL~~la~~l~l~~~ 121 (675) ..++++++. +.+.|+|||.||+.||... +|+..-...++||+-+||.|.| +++|..+++.+++... T Consensus 492 L~kf~~~~~--d~IlVAHNasFD~gFl~~~~~k~~~~~~~~pvIDTL~lar~L~P~~ksh~Lg~l~kk~~v~le 563 (1444) T COG2176 492 LEKFREFIG--DSILVAHNASFDMGFLNTNYEKYGLEPLTNPVIDTLELARALNPEFKSHRLGTLCKKLGVELE 563 (1444) T ss_pred HHHHHHHHC--CCEEEECCCCCCHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHCHHHHHCCHHHHHHHHCCCHH T ss_conf 999999844--858996167556457888899838851027501199999875745432466778887486378 No 75 >cd06136 TREX1_2 Three prime repair exonuclease (TREX) 1 and 2 are closely related DEDDh-type DnaQ-like 3'-5' exonucleases. They contain three conserved sequence motifs known as ExoI, II, and III, with a specific Hx(4)D conserved pattern at ExoIII. These motifs contain four conserved acidic residues that participate in coordination of divalent metal ions required for catalysis. Both proteins play a role in the metabolism and clearance of DNA. TREX1 is the major 3'-5' exonuclease activity detected in mammalian cells. Mutations in the human TREX1 gene can cause Aicardi-Goutieres syndrome (AGS), which is characterized by perturbed innate immunity and presents itself as a severe neurological disease. TREX1 degrades ssDNA generated by aberrant replication intermediates to prevent checkpoint activation and autoimmune disease. There are distinct structural differences between TREX1 and 2 that point to different biological roles for these proteins. The main difference is the presence of about 7 Probab=98.62 E-value=1.5e-06 Score=60.87 Aligned_cols=81 Identities=20% Similarity=0.234 Sum_probs=51.6 Q ss_pred HHHHHC--CCCCEEEEEEH-HHHHHHHHHH---CCCCCCC-CCEEEHHHHHHHHCCCCCHHHHHHHC-CCCCCHHHHHHH Q ss_conf 999863--88981999730-7749999986---0899884-20331688999867899968989982-998110043799 Q gi|254781193|r 57 LKEALE--NTDVTIVAHNS-YFERTVLKAC---FNLDIPI-HRWICTLVLARSNGLPSALKAVGEAL-KLSSQKMEEGKE 128 (675) Q Consensus 57 l~~~L~--~~~~~kV~HNa-~FD~~~L~~~---~Gi~i~~-~~~~DTmi~A~~l~~~~sL~~la~~l-~l~~~k~~~~k~ 128 (675) +..++. ..++.+||||+ +||+.||... .|+.+|. ..+.||+-+++.+. .+|..+.+++ +++.. T Consensus 88 ~~~F~~~~~~~~~LVAHNa~~FD~~fL~~e~~r~g~~~p~~~~~iDtL~l~r~~~--~~L~~l~~~~~gi~~~------- 158 (177) T cd06136 88 IKLFLRRQPKPICLVAHNGNRFDFPILRSELERLGTKLPDDILCVDSLPAFRELD--QSLGSLYKRLFGQEPK------- 158 (177) T ss_pred HHHHHHHCCCCCEEEECCCHHCCHHHHHHHHHHCCCCCCCCCEEEEEHHHHHHHC--CCHHHHHHHHCCCCCC------- T ss_conf 9999962677867996485123799999999986998998988998607645506--9999999997599998------- Q ss_pred HHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9986202112343323467858532899999989989999999963 Q gi|254781193|r 129 LIARFCKGETHECLYDRNKPNHREAWRLFGEYCKGDVTVTREIFKR 174 (675) Q Consensus 129 l~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~t~~L~~~ 174 (675) .. .-|..||.+|.+++.+ T Consensus 159 -------------------~a---------HRAl~Da~at~~vflh 176 (177) T cd06136 159 -------------------NS---------HTAEGDVLALLKCALH 176 (177) T ss_pred -------------------CC---------CCCHHHHHHHHHHHCC T ss_conf -------------------77---------7848999999999757 No 76 >PRK07983 exodeoxyribonuclease X; Provisional Probab=98.59 E-value=1.1e-06 Score=61.60 Aligned_cols=86 Identities=26% Similarity=0.255 Sum_probs=54.9 Q ss_pred HHHHHHHCCCCCEEEEEEHHHHHHHHHHHCCCCCCCCCEEEHHHHHHHHCC--CCCHHHHHHHCCCCCCHHHHHHHHHHH Q ss_conf 999998638898199973077499999860899884203316889998678--999689899829981100437999986 Q gi|254781193|r 55 QVLKEALENTDVTIVAHNSYFERTVLKACFNLDIPIHRWICTLVLARSNGL--PSALKAVGEALKLSSQKMEEGKELIAR 132 (675) Q Consensus 55 ~~l~~~L~~~~~~kV~HNa~FD~~~L~~~~Gi~i~~~~~~DTmi~A~~l~~--~~sL~~la~~l~l~~~k~~~~k~l~~~ 132 (675) +++.+-+.+ +...||||+.||+.+|.. + ..+|+||+-+||.+.| .+|+..+...+++..+... T Consensus 65 ~ev~~~~~~-~~~lVaHNa~FD~~~L~~-----~-~~~~IcTl~lAR~l~p~~~~~l~~Lry~~~~~~~~~~-------- 129 (219) T PRK07983 65 EDVIPHYYG-SEWYVAHNASFDRRVLPE-----M-PGEWICTMKLARRLWPGIKYSNMALYKSRKLNVQTPP-------- 129 (219) T ss_pred HHHHHHHCC-CCEEEECCCCCCHHHHCC-----C-CCCEEEHHHHHHHHCCCCCHHHHHHHHHHCCCCCCCC-------- T ss_conf 999998628-987875365103866415-----5-8756657899998767742418999998427613324-------- Q ss_pred HCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 202112343323467858532899999989989999999963010 Q gi|254781193|r 133 FCKGETHECLYDRNKPNHREAWRLFGEYCKGDVTVTREIFKRLEP 177 (675) Q Consensus 133 ~~k~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~t~~L~~~l~~ 177 (675) . .+ ..=|..|+.+|..|+..+.. T Consensus 130 -------------~--~~-------AHrAl~Dv~v~~~ll~~l~~ 152 (219) T PRK07983 130 -------------G--LH-------HHRALYDCYITAALLIDIMR 152 (219) T ss_pred -------------C--CC-------CCCHHHHHHHHHHHHHHHHH T ss_conf -------------4--33-------34307879999999999998 No 77 >cd05160 DNA_polB_exo The 3'-5' exonuclease domain of family-B DNA polymerases. This domain has a fundamental role in reducing polymerase errors and is involved in proofreading activity. Family-B DNA polymerases contain an N-terminal DEDDy DnaQ-like exonuclease domain in the same polypeptide chain as the polymerase domain, similar to family-A DNA polymerases. This domain contains three sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. The exonuclease domain of family B polymerase also contains a beta hairpin structure that plays an important role in active site switching in the event of nucleotide misincorporation. Members include E. coli DNA polymerase II, some eubacterial phage DNA polymerases, nuclear replicative DNA polymerases (alpha, delta, epsilon and zeta), and eukaryotic viral and Probab=98.59 E-value=2e-06 Score=60.02 Aligned_cols=146 Identities=19% Similarity=0.203 Sum_probs=83.1 Q ss_pred EEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEEC--CEE---------------------EEEEECCCCHHHHHHHHHH Q ss_conf 99981417898700044001127981999999984--803---------------------9999788810437999998 Q gi|254781193|r 4 LFIDFETRSPIDLTKCGICKYAEDVCVILFAFGFD--DHP---------------------VELWDCVHDPVMPQVLKEA 60 (675) Q Consensus 4 i~lD~ET~~~~dl~~~Gl~~y~~~~~i~~i~~a~~--~~~---------------------v~~~d~~~~~~~~~~l~~~ 60 (675) +++||||.+... ..++ +...|.+|++... +.. +.+..+.....+...+..+ T Consensus 2 lsfDIE~~~~~g----fP~p--~~d~Ii~Is~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~E~eLL~~F~~~ 75 (200) T cd05160 2 LSFDIETLPNVG----GPEP--SRDPIICISYADSFQGHKVVFLLKVSTHGDDIEAGFIDGIEVEYFDDEKELLKRFFDI 75 (200) T ss_pred EEEEEEECCCCC----CCCC--CCCEEEEEEEEEECCCEEEEEEEECCCCCCCCCCCCCCCEEEEEECCHHHHHHHHHHH T ss_conf 799983279999----7899--9880899999993599999988643556767654677996999969999999999999 Q ss_pred HC--CCCCEEEEEEH-HHHHHHHHH---HCCCCCC--------------------CCC-EEEHHHHHHHH--CCCCCHHH Q ss_conf 63--88981999730-774999998---6089988--------------------420-33168899986--78999689 Q gi|254781193|r 61 LE--NTDVTIVAHNS-YFERTVLKA---CFNLDIP--------------------IHR-WICTLVLARSN--GLPSALKA 111 (675) Q Consensus 61 L~--~~~~~kV~HNa-~FD~~~L~~---~~Gi~i~--------------------~~~-~~DTmi~A~~l--~~~~sL~~ 111 (675) +. +|+++ +|||. .||++.|.+ .+|+++. .++ ..|.+-..+-. -++++|+. T Consensus 76 i~~~dPDii-~GyN~~~FD~pyL~~R~~~l~~~~~~~~~~~~~~~~~~~~~~~~i~Gr~~lD~~~~~k~~~~l~sy~Ld~ 154 (200) T cd05160 76 IREYDPDIL-TGYNIDDFDLPYLLERAKALGIKLDDIGRRSGQEKSSGETERAAVKGRVVFDLLRAVKRDFKLKSYTLDA 154 (200) T ss_pred HHHCCCCEE-EECCCCCCCHHHHHHHHHHHCCCHHHCCCCCCCCCCCCCEEEEEECCEEEEHHHHHHHHHHCCCCCCHHH T ss_conf 997099999-9726777848899999999698855546734545435750599736778602899999753246653999 Q ss_pred HHHHCCCCCCHHH-HHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 8998299811004-3799998620211234332346785853289999998998999999996 Q gi|254781193|r 112 VGEALKLSSQKME-EGKELIARFCKGETHECLYDRNKPNHREAWRLFGEYCKGDVTVTREIFK 173 (675) Q Consensus 112 la~~l~l~~~k~~-~~k~l~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~t~~L~~ 173 (675) +|+.+ ++..|.. .++.....+ ....++.+.+||.+||.++++|++ T Consensus 155 Va~~~-Lg~~K~d~~~~~~~~~~----------------~~~d~~k~~~Y~~~D~~L~~~L~e 200 (200) T cd05160 155 VAEEL-LGEGKEKVDGEIIEDLE----------------WEEDLERLLEYNLKDAELTLQILE 200 (200) T ss_pred HHHHH-HCCCCCCCCHHHHHHHH----------------HCCCCHHHHHEEHHHHHHHHHHHC T ss_conf 99998-49997878989999998----------------738914866025999998999759 No 78 >PRK05762 DNA polymerase II; Reviewed Probab=98.57 E-value=4.2e-06 Score=57.93 Aligned_cols=156 Identities=17% Similarity=0.239 Sum_probs=80.7 Q ss_pred CEEEEEECCCCCCCCCCCCCHHHCCCC-EEEEEEEEECCEEEEEEECCCCHHHHHHHHHHHC--CCCCEEEEEEH-HHHH Q ss_conf 689998141789870004400112798-1999999984803999978881043799999863--88981999730-7749 Q gi|254781193|r 2 ATLFIDFETRSPIDLTKCGICKYAEDV-CVILFAFGFDDHPVELWDCVHDPVMPQVLKEALE--NTDVTIVAHNS-YFER 77 (675) Q Consensus 2 k~i~lD~ET~~~~dl~~~Gl~~y~~~~-~i~~i~~a~~~~~v~~~d~~~~~~~~~~l~~~L~--~~~~~kV~HNa-~FD~ 77 (675) +.+++|||++...++-..|+. .++. .+..++-..++..+.. +.....++..+..++. |++++ +|||. .||+ T Consensus 156 rvlSfDIEc~~~g~ii~I~l~--~~~~~~v~~~g~~~~~~~v~~--~~~E~eLL~~F~~~i~~~DPDII-~GyNi~~FDl 230 (785) T PRK05762 156 KVVSLDIETSNKGELYSIGLE--GCGQRQVYMLGEANGDAELEY--VADEKALLERFNAWFAEFDPDVI-IGWNVVQFDL 230 (785) T ss_pred EEEEEEEEECCCCCEEEEEEC--CCCCCEEEEECCCCCCCEEEE--ECCHHHHHHHHHHHHHHHCCCEE-EECCCCCCCH T ss_conf 699999897799988999951--799846899668899853899--09999999999999998599989-9627478777 Q ss_pred HHHHH---HCCCCCCC------------------------CC-EEEHHHHHHH--HC-CCCCHHHHHHHCCCCCCHHHHH Q ss_conf 99998---60899884------------------------20-3316889998--67-8999689899829981100437 Q gi|254781193|r 78 TVLKA---CFNLDIPI------------------------HR-WICTLVLARS--NG-LPSALKAVGEALKLSSQKMEEG 126 (675) Q Consensus 78 ~~L~~---~~Gi~i~~------------------------~~-~~DTmi~A~~--l~-~~~sL~~la~~l~l~~~k~~~~ 126 (675) +.|.+ .+|+.+.. ++ ++|++-+.+- .. .+++|+.+|..+ +...|.... T Consensus 231 pyL~~Ra~~~gi~l~lgR~~~~~~~~~~~~~~~~~~~~i~GRv~iD~~~~~k~~~~~l~sYsL~~Va~~~-Lge~K~~~~ 309 (785) T PRK05762 231 RLLQERAERYGIPLRLGRGGSELSWRVHGHKRGHFFASVPGRLVLDGIDALKSAFYEFKSFSLENVAQEL-LGEGKAIDN 309 (785) T ss_pred HHHHHHHHHHCCCCCCCCCCCCCCEECCCCCCCEEEEEEEEEEEEEEHHHHHHHHHCCCCCCHHHHHHHH-HCCCCCCCC T ss_conf 9999999994998546878886533315655660599984289997099998752036766299999998-544555566 Q ss_pred -HHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf -99998620211234332346785853289999998998999999996301 Q gi|254781193|r 127 -KELIARFCKGETHECLYDRNKPNHREAWRLFGEYCKGDVTVTREIFKRLE 176 (675) Q Consensus 127 -k~l~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~t~~L~~~l~ 176 (675) ...+.... .. ...+...+..||.+|+.++++|+.++. T Consensus 310 ~~~r~~eI~----------~~---~~~~r~~l~~Y~l~Da~L~~~L~~kl~ 347 (785) T PRK05762 310 PYDRMDEID----------RY---FADDKPALAKYNLKDCELVTRIFEKTE 347 (785) T ss_pred HHHHHHHHH----------HH---HHCCHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 666688889----------98---854689999999999999999999870 No 79 >smart00479 EXOIII exonuclease domain in DNA-polymerase alpha and epsilon chain, ribonuclease T and other exonucleases. Probab=98.56 E-value=1.9e-06 Score=60.08 Aligned_cols=89 Identities=26% Similarity=0.273 Sum_probs=59.4 Q ss_pred HHHHHHHCCCCCEEEEEEHHHHHHHHHHH---CCCCCC-CCCEEEHHHHHHHHCC--CCCHHHHHHHCCCCCCHHHHHHH Q ss_conf 99999863889819997307749999986---089988-4203316889998678--99968989982998110043799 Q gi|254781193|r 55 QVLKEALENTDVTIVAHNSYFERTVLKAC---FNLDIP-IHRWICTLVLARSNGL--PSALKAVGEALKLSSQKMEEGKE 128 (675) Q Consensus 55 ~~l~~~L~~~~~~kV~HNa~FD~~~L~~~---~Gi~i~-~~~~~DTmi~A~~l~~--~~sL~~la~~l~l~~~k~~~~k~ 128 (675) +++..++.+. .++++||+.||..+|.+. ++...+ ...|+||+-.++...+ ..+|+.++++++++... T Consensus 72 ~~~~~~l~~~-~~v~~~~~~fD~~~L~~~~~~~~~~~~~~~~~~dt~~~~~~~~~~~~~~L~~l~~~~~i~~~~------ 144 (169) T smart00479 72 EELLEFLKGK-ILVAGNALNFDLRFLKLEHPRLGIKDPPKNPVIDTLKLARALNPGRKYSLKKLAERLGLEVIG------ 144 (169) T ss_pred HHHHHHHCCC-CCHHCCCHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHCCCCCHHHHHHHHHCCCCCCC------ T ss_conf 9999860487-451036799999999999998189988665246299999997588545899999985999899------ Q ss_pred HHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 99862021123433234678585328999999899899999999630104 Q gi|254781193|r 129 LIARFCKGETHECLYDRNKPNHREAWRLFGEYCKGDVTVTREIFKRLEPL 178 (675) Q Consensus 129 l~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~t~~L~~~l~~~ 178 (675) .. .-|..|+..|.+|+.+|.+. T Consensus 145 -------------------~~---------H~Al~Da~~t~~v~~~l~~~ 166 (169) T smart00479 145 -------------------RA---------HRALDDARATAKLFKKLVER 166 (169) T ss_pred -------------------CC---------CCHHHHHHHHHHHHHHHHHH T ss_conf -------------------99---------29799999999999999977 No 80 >PRK07247 DNA polymerase III subunit epsilon; Validated Probab=98.54 E-value=1.8e-06 Score=60.37 Aligned_cols=85 Identities=13% Similarity=0.149 Sum_probs=56.2 Q ss_pred HHHHHHHCCCCCEEEEEEH-HHHHHHHHHH---CCCCCCCCCEEEHHHHHHHHC----CCCCHHHHHHHCCCCCCHHHHH Q ss_conf 9999986388981999730-7749999986---089988420331688999867----8999689899829981100437 Q gi|254781193|r 55 QVLKEALENTDVTIVAHNS-YFERTVLKAC---FNLDIPIHRWICTLVLARSNG----LPSALKAVGEALKLSSQKMEEG 126 (675) Q Consensus 55 ~~l~~~L~~~~~~kV~HNa-~FD~~~L~~~---~Gi~i~~~~~~DTmi~A~~l~----~~~sL~~la~~l~l~~~k~~~~ 126 (675) .++.+++. +.++||||+ +||+.+|... ++.... ...+||...++.+. ++++|+.+++.++++.. T Consensus 76 ~~f~~Fig--d~~lVaHNa~~FD~~fL~~~~~~~~~~~~-~~~~~~~~~~r~~~~p~l~n~kL~tLa~~f~I~~~----- 147 (195) T PRK07247 76 AAFKNFVG--ELPLIGYNAQKSDLPILAENGLDLRDQYQ-VDLFDEAFDRRSSDLNGIANLKLQTVATFLGIKGR----- 147 (195) T ss_pred HHHHHHHC--CCEEEEECCCHHHHHHHHHHHHHHCCCCC-CCHHHHHHHHHHHCCCCCCCCCHHHHHHHCCCCCC----- T ss_conf 99999968--99189908845149999998774157766-85699999998860588767778999997499998----- Q ss_pred HHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 9999862021123433234678585328999999899899999999630104 Q gi|254781193|r 127 KELIARFCKGETHECLYDRNKPNHREAWRLFGEYCKGDVTVTREIFKRLEPL 178 (675) Q Consensus 127 k~l~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~t~~L~~~l~~~ 178 (675) . .-|..|+.+|.+||.+|... T Consensus 148 ----------------------~---------HRAl~DA~aTa~If~klLe~ 168 (195) T PRK07247 148 ----------------------G---------HNSLEDARMTARIYESFLET 168 (195) T ss_pred ----------------------C---------CCCHHHHHHHHHHHHHHHHH T ss_conf ----------------------9---------08189999999999999997 No 81 >cd05777 DNA_polB_delta_exo The 3'-5' exonuclease domain of eukaryotic DNA polymerase delta. DNA polymerase delta is a family-B DNA polymerase with a catalytic subunit that contains a DEDDy-type DnaQ-like 3'-5' exonuclease domain. It is one of the three DNA-dependent type B DNA polymerases (alpha and epsilon are the other two) that have been identified as essential for nuclear DNA replication in eukaryotes. DNA polymerase delta is the enzyme responsible for both elongation and maturation of Okazaki fragments on the lagging strand. It is also implicated in mismatch repair (MMR) and base excision repair (BER). The catalytic subunit displays both polymerase and 3'-5' exonuclease activities. The exonuclease domain contains three sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues necessary for metal binding and catalysis. The exonuclease domain of family B polyme Probab=98.52 E-value=3.4e-06 Score=58.47 Aligned_cols=158 Identities=17% Similarity=0.192 Sum_probs=81.5 Q ss_pred CCEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEE--CCEE------------------EEEEECCCCHHHHHHHHHH Q ss_conf 9689998141789870004400112798199999998--4803------------------9999788810437999998 Q gi|254781193|r 1 MATLFIDFETRSPIDLTKCGICKYAEDVCVILFAFGF--DDHP------------------VELWDCVHDPVMPQVLKEA 60 (675) Q Consensus 1 mk~i~lD~ET~~~~dl~~~Gl~~y~~~~~i~~i~~a~--~~~~------------------v~~~d~~~~~~~~~~l~~~ 60 (675) +|.+++|+||+|.. |.-+-.+...|.+|+..+ .+.+ +.++.+.....+...+..+ T Consensus 7 lki~sfDIE~~s~~-----g~fP~p~~D~Ii~I~~~~~~~~~~~~~~~~v~~l~~~~~i~~~~V~~~~tE~eLL~~F~~~ 81 (230) T cd05777 7 LRILSFDIECAGRK-----GVFPEPEKDPVIQIANVVTRQGEGEPFIRNIFTLKTCAPIVGAQVFSFETEEELLLAWRDF 81 (230) T ss_pred CEEEEEEEEECCCC-----CCCCCCCCCCEEEEEEEEEECCCCCCEEEEEEEECCCCCCCCCEEEECCCHHHHHHHHHHH T ss_conf 04999999952699-----9998988883489999998279776047878982687888998799849999999999999 Q ss_pred HC--CCCCEEEEEEH-HHHHHHHHHH---CCCCC----------C-----------------------CCC-EEEHHHHH Q ss_conf 63--88981999730-7749999986---08998----------8-----------------------420-33168899 Q gi|254781193|r 61 LE--NTDVTIVAHNS-YFERTVLKAC---FNLDI----------P-----------------------IHR-WICTLVLA 100 (675) Q Consensus 61 L~--~~~~~kV~HNa-~FD~~~L~~~---~Gi~i----------~-----------------------~~~-~~DTmi~A 100 (675) +. +++++ +|+|. .||++.|... ++++. + +++ ++|.+-+- T Consensus 82 i~~~dPDii-tGyN~~~FD~pYL~~Ra~~l~~~~~~~lgr~~~~~~~~~~~~~~s~~~g~~~~~~~~i~Gr~~lD~~~~~ 160 (230) T cd05777 82 VQEVDPDII-TGYNICNFDLPYLLERAKALKLNTFPFLGRIKNIKSTIKDTTFSSKQMGTRETKEINIEGRIQFDLLQVI 160 (230) T ss_pred HHHCCCCEE-EECCCCCCCHHHHHHHHHHHCCCCHHHHCCCCCCCCEEECCCCCCCCCCCCEEEEEEECCEEHHHHHHHH T ss_conf 998099999-9636667887899999999688614541766787635741652223126400269987587599999999 Q ss_pred HH-HC-CCCCHHHHHHHCCCCCCHHH-HHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 98-67-89996898998299811004-37999986202112343323467858532899999989989999999963010 Q gi|254781193|r 101 RS-NG-LPSALKAVGEALKLSSQKME-EGKELIARFCKGETHECLYDRNKPNHREAWRLFGEYCKGDVTVTREIFKRLEP 177 (675) Q Consensus 101 ~~-l~-~~~sL~~la~~l~l~~~k~~-~~k~l~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~t~~L~~~l~~ 177 (675) +- .. .+++|+.+|+.+ ++..|.. ..+.+.++. ...++.+..+.+||.+|+.++.+|..+|.- T Consensus 161 r~~~~l~sY~L~~Va~~~-Lg~~K~d~~~~~i~~~~--------------~~~~~~~~~~~~Y~i~D~~L~~~L~~Kl~~ 225 (230) T cd05777 161 QRDYKLRSYSLNSVSAHF-LGEQKEDVHYSIITDLQ--------------NGNPETRRRLAVYCLKDAYLPLRLLDKLMC 225 (230) T ss_pred HHHCCHHHCCHHHHHHHH-HCCCCCCCCHHHHHHHH--------------HCCHHHHHHHHEECHHHHHHHHHHHHHHHH T ss_conf 975016336499999998-38998879999999999--------------479646456006426877999999999877 Q ss_pred CH Q ss_conf 40 Q gi|254781193|r 178 LS 179 (675) Q Consensus 178 ~~ 179 (675) .+ T Consensus 226 ~~ 227 (230) T cd05777 226 LV 227 (230) T ss_pred HH T ss_conf 74 No 82 >cd05779 DNA_polB_epsilon_exo The 3'-5' exonuclease domain of eukaryotic DNA polymerase epsilon. DNA polymerase epsilon is a family-B DNA polymerase with a catalytic subunit that contains a DEDDy-type DnaQ-like 3'-5' exonuclease domain. It is one of the three DNA-dependent type B DNA polymerases (alpha and delta are the other two) that have been identified as essential for nuclear DNA replication in eukaryotes. DNA polymerase epsilon plays a role in elongating the leading strand during DNA replication. It is also involved in DNA repair. The catalytic subunit contains both polymerase and 3'-5' exonuclease activities. The N-terminal exonuclease domain contains three sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are clustered around the active site and are involved in metal binding and catalysis. DNA polymerase epsilon also carries a unique large C-terminal domain with an unknown function. Phylogenetic analyses indicate that it is ort Probab=98.51 E-value=3.3e-06 Score=58.60 Aligned_cols=149 Identities=20% Similarity=0.241 Sum_probs=78.5 Q ss_pred CEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCEE---------------------------EEEEECCCCHHHH Q ss_conf 6899981417898700044001127981999999984803---------------------------9999788810437 Q gi|254781193|r 2 ATLFIDFETRSPIDLTKCGICKYAEDVCVILFAFGFDDHP---------------------------VELWDCVHDPVMP 54 (675) Q Consensus 2 k~i~lD~ET~~~~dl~~~Gl~~y~~~~~i~~i~~a~~~~~---------------------------v~~~d~~~~~~~~ 54 (675) +.+|+||||++.. . + -+-.+...|..||+...+.. +.++.......++ T Consensus 3 rilsfDIE~~~~~-~---~-fP~~~~D~Ii~Is~~~~~~g~~i~~~~~~~~~i~~~~~~~~~~~~g~~~v~~~~~E~~ll 77 (204) T cd05779 3 RVLAFDIETTKLP-L---K-FPDAETDQIMMISYMIDGQGYLIVNREIVSEDIEDFEYTPKPEYEGPFKVFNEPDEKALL 77 (204) T ss_pred EEEEEEEEEECCC-C---C-CCCCCCCEEEEEEEEEECCCEEEEECCCCCCCCCCCCCCCCCCCCCCEEEEECCCHHHHH T ss_conf 7999999950799-9---9-989888908999999916877987434013533000146785665764898369999999 Q ss_pred HHHHHHHC--CCCCEEEEEEH-HHHHHHHHH---HCCCCCCC---------------CCE-EEHHH-HHH--HHC-CCCC Q ss_conf 99999863--88981999730-774999998---60899884---------------203-31688-999--867-8999 Q gi|254781193|r 55 QVLKEALE--NTDVTIVAHNS-YFERTVLKA---CFNLDIPI---------------HRW-ICTLV-LAR--SNG-LPSA 108 (675) Q Consensus 55 ~~l~~~L~--~~~~~kV~HNa-~FD~~~L~~---~~Gi~i~~---------------~~~-~DTmi-~A~--~l~-~~~s 108 (675) ..+.+++. ++++ ++|+|. .||+++|.. .+|+.+.. ..+ +|-+- +.+ .+. .+++ T Consensus 78 ~~f~~~i~~~~pdi-i~gyN~~~FD~pyl~~Ra~~~~~~~~~~~~~~~~~~~~~~~~~~vhiDly~~v~rd~~l~~~sy~ 156 (204) T cd05779 78 QRFFEHIREVKPHI-IVTYNGDFFDWPFVEARAAIHGLSMEEEIGFRKDSEGEYKSRYIIHMDCFRWVKRDSYLPQGSQG 156 (204) T ss_pred HHHHHHHHHCCCCE-EEECCCCCCCHHHHHHHHHHHCCCHHHHCCCCCCCCCCEECCCCEEEECHHHHHHHCCCCCCCCC T ss_conf 99999998649999-98558767767999999999599825531800067772762672563007765101217556877 Q ss_pred HHHHHHHCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 68989982998110043799998620211234332346785853289999998998999999996 Q gi|254781193|r 109 LKAVGEALKLSSQKMEEGKELIARFCKGETHECLYDRNKPNHREAWRLFGEYCKGDVTVTREIFK 173 (675) Q Consensus 109 L~~la~~l~l~~~k~~~~k~l~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~t~~L~~ 173 (675) |+.+++.+ ++.+|..-....|..+. ....+.+.+||.+|+.+|+.||. T Consensus 157 Ln~Vs~~~-Lg~~K~dv~~~~m~~l~----------------~~~~~~la~Y~~~Da~~Ty~l~~ 204 (204) T cd05779 157 LKAVTKAK-LGYDPVELDPEDMVPLA----------------REDPQTLASYSVSDAVATYYLYM 204 (204) T ss_pred HHHHHHHH-HCCCCCCCCHHHHHHHH----------------CCCHHHHHHCCHHHHHHHHHHHC T ss_conf 89999999-68997508999999998----------------28915555240999999999769 No 83 >cd05783 DNA_polB_B1_exo The 3'-5' exonuclease domain of Sulfolobus solfataricus DNA polymerase B1 and similar archaeal proteins. B1 is a family-B DNA polymerase. Family-B DNA polymerases contain an N-terminal DEDDy DnaQ-like exonuclease domain in the same polypeptide chain as the polymerase domain, similar to family-A DNA polymerases. B1displays thermostable polymerase and 3'-5' exonuclease activities. This exonuclease domain contains three sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are clustered around the active site and are involved in metal binding and catalysis. The exonuclease domain of family-B polymerases also contains a beta hairpin structure that plays an important role in active site switching in the event of nucleotide misincorporation. Family-B DNA polymerases from thermophilic archaea are unique in that they are able to recognize the presence of uracil in the template strand, leading to the stalling of DNA synthes Probab=98.31 E-value=8.3e-06 Score=56.01 Aligned_cols=147 Identities=21% Similarity=0.163 Sum_probs=76.4 Q ss_pred CCEEEEEECCCCCCCCCCCC-CHHHCCCCEEEEEEEEECCEE--EE-------------------EEECCCCHHHHHHHH Q ss_conf 96899981417898700044-001127981999999984803--99-------------------997888104379999 Q gi|254781193|r 1 MATLFIDFETRSPIDLTKCG-ICKYAEDVCVILFAFGFDDHP--VE-------------------LWDCVHDPVMPQVLK 58 (675) Q Consensus 1 mk~i~lD~ET~~~~dl~~~G-l~~y~~~~~i~~i~~a~~~~~--v~-------------------~~d~~~~~~~~~~l~ 58 (675) +|.+++||||.|+.. .| .++-.+.-.|..|++.-.++. +. ++.+.....++..+. T Consensus 5 lrvlsfDIEv~s~~~---~~FP~p~~~~~~iisI~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~f~~E~~LL~~F~ 81 (204) T cd05783 5 LKRIAIDIEVYTPIK---GRIPDPKTAEYPVISVALAGSDGLKRVLVLKREGVEGLEGLLPEGAEVEFFDSEKELIREAF 81 (204) T ss_pred CEEEEEEEEECCCCC---CCCCCCCCCCCEEEEEEEECCCCCEEEEEEEECCCCCCCCCCCCCCEEEEECCHHHHHHHHH T ss_conf 749999988788999---97979555886199999992799889999950871257655789989999599999999999 Q ss_pred HHHCCCCCEEEEEEH-HHHHHHHHHH---CCCCC---CC----C------CE-EEH-------HHHHHHHC---CCCCHH Q ss_conf 986388981999730-7749999986---08998---84----2------03-316-------88999867---899968 Q gi|254781193|r 59 EALENTDVTIVAHNS-YFERTVLKAC---FNLDI---PI----H------RW-ICT-------LVLARSNG---LPSALK 110 (675) Q Consensus 59 ~~L~~~~~~kV~HNa-~FD~~~L~~~---~Gi~i---~~----~------~~-~DT-------mi~A~~l~---~~~sL~ 110 (675) .++.+.. .++++|. .||++.|.+. +|+.- +. . .+ .|- ++..+..+ .+++|+ T Consensus 82 ~~i~~~p-~Ii~wN~~~FDiPYL~~R~~~Lg~~~~~~~~~~~~~~~~~~~~~~~dl~~~~~~~~~~~y~~~~k~~sY~Ld 160 (204) T cd05783 82 KIISEYP-IVLTFNGDNFDLPYLYNRALKLGIPKEEIPIYLKRDYATLKHGIHIDLYKFFSNRAIQVYAFGNKYREYTLD 160 (204) T ss_pred HHHHCCC-EEEECCCCCCCHHHHHHHHHHHCCCCEEEEEEECCEEEEEEEEEEEEEHHHCCCCHHHHHHCCCCCCCCCHH T ss_conf 9974399-799817867788999999998599820456673436999934499986463573135441202421225288 Q ss_pred HHHHHCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 98998299811004379999862021123433234678585328999999899899999999 Q gi|254781193|r 111 AVGEALKLSSQKMEEGKELIARFCKGETHECLYDRNKPNHREAWRLFGEYCKGDVTVTREIF 172 (675) Q Consensus 111 ~la~~l~l~~~k~~~~k~l~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~t~~L~ 172 (675) .+|+.. ++..|..... .....+++.+.+||.+||+++.+|- T Consensus 161 ~Va~~~-LGe~K~d~~~--------------------~~~~~d~~k~i~YNi~D~~Lv~~L~ 201 (204) T cd05783 161 AVAKAL-LGEGKVELEK--------------------NISELNLYELAEYNYRDAELTLELT 201 (204) T ss_pred HHHHHH-HCCCCCCCCC--------------------CCCCCCHHHHHEEEHHHHHHHHHHH T ss_conf 999998-3888600676--------------------4442586785514267767777862 No 84 >COG0847 DnaQ DNA polymerase III, epsilon subunit and related 3'-5' exonucleases [DNA replication, recombination, and repair] Probab=98.29 E-value=2.2e-05 Score=53.31 Aligned_cols=89 Identities=26% Similarity=0.310 Sum_probs=61.2 Q ss_pred HHHHHHHCCCCCEEEEEEHHHHHHHHHHH---CCCCCCCCCEEEHHHHHHHHCC---CCCHHHHHHHCCCCCCHHHHHHH Q ss_conf 99999863889819997307749999986---0899884203316889998678---99968989982998110043799 Q gi|254781193|r 55 QVLKEALENTDVTIVAHNSYFERTVLKAC---FNLDIPIHRWICTLVLARSNGL---PSALKAVGEALKLSSQKMEEGKE 128 (675) Q Consensus 55 ~~l~~~L~~~~~~kV~HNa~FD~~~L~~~---~Gi~i~~~~~~DTmi~A~~l~~---~~sL~~la~~l~l~~~k~~~~k~ 128 (675) +++.+++.+. ...|+||+.||+.|+... ++..++...+.||+.+++...+ .++|+.++.+++++.... T Consensus 86 ~~~~~~i~~~-~~~Vahna~fD~~fl~~~~~~~~~~~~~~~~~~t~~~~r~~~~~~~~~~L~~l~~~~gi~~~~~----- 159 (243) T COG0847 86 PEFLDFIGGL-RLLVAHNAAFDVGFLRVESERLGIEIPGDPVLDTLALARRHFPGFDRSSLDALAERLGIDRNPF----- 159 (243) T ss_pred HHHHHHHCCC-CEEEEEEHHHCHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHCCCCCCCCHHHHHHHCCCCCCCC----- T ss_conf 9999984788-8699950530889999899970898757642007999998769976134899999629887777----- Q ss_pred HHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 9986202112343323467858532899999989989999999963010 Q gi|254781193|r 129 LIARFCKGETHECLYDRNKPNHREAWRLFGEYCKGDVTVTREIFKRLEP 177 (675) Q Consensus 129 l~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~t~~L~~~l~~ 177 (675) .. .=+..|+.++..++..+.. T Consensus 160 ---------------------~~-------H~Al~Da~~~a~~~~~~~~ 180 (243) T COG0847 160 ---------------------HP-------HRALFDALALAELFLLLQT 180 (243) T ss_pred ---------------------CC-------CCHHHHHHHHHHHHHHHHH T ss_conf ---------------------76-------6167889999999999862 No 85 >pfam10108 Exon_PolB Predicted 3'-5' exonuclease related to the exonuclease domain of PolB. This domain is found in various prokaryotic 3'-5' exonucleases and hypothetical proteins. Probab=98.26 E-value=2.3e-05 Score=53.15 Aligned_cols=132 Identities=19% Similarity=0.335 Sum_probs=83.9 Q ss_pred EEEEEEEEE--CCEEEEEEECCC----CHHHHHHHHHHHCCCCCEEEEEEH-HHHHHHHHH---HCCCCCCC-------- Q ss_conf 199999998--480399997888----104379999986388981999730-774999998---60899884-------- Q gi|254781193|r 29 CVILFAFGF--DDHPVELWDCVH----DPVMPQVLKEALENTDVTIVAHNS-YFERTVLKA---CFNLDIPI-------- 90 (675) Q Consensus 29 ~i~~i~~a~--~~~~v~~~d~~~----~~~~~~~l~~~L~~~~~~kV~HNa-~FD~~~L~~---~~Gi~i~~-------- 90 (675) +|+.||... +++...++.+.. ...+++.+...++...-.+|.||. .||+++|.. .+|+..|. T Consensus 10 riVaIs~v~~~~~~~~~v~slg~~~~~E~~ll~~F~~~i~~~~P~LVs~NG~gFDlPvL~~RAl~~gi~ap~~~~~~~~~ 89 (211) T pfam10108 10 RIVAISAVIADDDGGFKVWSLGDPESSEKELIQRFFDGIEKYTPQLVSFNGRGFDLPVLHYRALKHGVSAPRYFDTGDRD 89 (211) T ss_pred EEEEEEEEEEECCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCEEEEECCCCCCHHHHHHHHHHCCCCCHHHHHCCCCC T ss_conf 16999999996699279987889998999999999999984398288656865548999999998188850787567876 Q ss_pred CCE-----------EEHHHHHHHHC--CCCCHHHHHHHCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHH Q ss_conf 203-----------31688999867--89996898998299811004379999862021123433234678585328999 Q gi|254781193|r 91 HRW-----------ICTLVLARSNG--LPSALKAVGEALKLSSQKMEEGKELIARFCKGETHECLYDRNKPNHREAWRLF 157 (675) Q Consensus 91 ~~~-----------~DTmi~A~~l~--~~~sL~~la~~l~l~~~k~~~~k~l~~~~~k~~~~~~~~~~~~~~~~~~~~~~ 157 (675) .+| .|+|=.-...+ ...+|+.+|.-+|+|.+-...|......+ ..-.+++. T Consensus 90 ~kw~nY~~Ry~~~H~DLmd~l~~~g~~~~~~Ld~~a~l~G~PGK~~~~G~~V~~~~----------------~~G~l~~I 153 (211) T pfam10108 90 FKWNNYFNRYSTRHLDLMDLLSGYGARANAPLDEVAALLGLPGKMGVDGSQVWELY----------------QAGKIEEI 153 (211) T ss_pred CCCCCCCCCCCCCCCCHHHHHHHCCCCCCCCHHHHHHHCCCCCCCCCCHHHHHHHH----------------HCCCHHHH T ss_conf 65444112356544548999873288677898999998199987877645899999----------------85988899 Q ss_pred HHHHHHHHHHHHHHHHHCC Q ss_conf 9998998999999996301 Q gi|254781193|r 158 GEYCKGDVTVTREIFKRLE 176 (675) Q Consensus 158 ~~Ya~~Dv~~t~~L~~~l~ 176 (675) ..||.-||.-|+-||-++. T Consensus 154 ~~YCe~DvlnTylv~LR~~ 172 (211) T pfam10108 154 RDYCETDVLNTYLVYLRFQ 172 (211) T ss_pred HHHHHHHHHHHHHHHHHHH T ss_conf 9998876899999999999 No 86 >smart00486 POLBc DNA polymerase type-B family. DNA polymerase alpha, delta, epsilon and zeta chain (eukaryota), DNA polymerases in archaea, DNA polymerase II in e. coli, mitochondrial DNA polymerases and and virus DNA polymerases Probab=98.25 E-value=0.00024 Score=46.51 Aligned_cols=155 Identities=19% Similarity=0.169 Sum_probs=79.3 Q ss_pred CEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCEEE--------------------EEEECCCCHHHHHHHHHHH Q ss_conf 68999814178987000440011279819999999848039--------------------9997888104379999986 Q gi|254781193|r 2 ATLFIDFETRSPIDLTKCGICKYAEDVCVILFAFGFDDHPV--------------------ELWDCVHDPVMPQVLKEAL 61 (675) Q Consensus 2 k~i~lD~ET~~~~dl~~~Gl~~y~~~~~i~~i~~a~~~~~v--------------------~~~d~~~~~~~~~~l~~~L 61 (675) +.+++|+||++.... -..+......|.+++....+... .+..+.....+...+...+ T Consensus 4 ~i~s~DIEt~~~~~~---~p~~~~~~d~i~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~ll~~f~~~i 80 (471) T smart00486 4 KILSFDIETYTDGGL---FPDPLIFEDEIIQISLVINDGDKKGPEERICFTLGTCKEIDGVEVYEFNNEKELLKAFLEFI 80 (471) T ss_pred EEEEEEEEECCCCCC---CCCCCCCCCEEEEEEEEEEECCEEEEEEEEEECCCCCCCCCCCEEEECCCHHHHHHHHHHHH T ss_conf 699999896679999---99988899849999999821987878877543036767789988998099999999999999 Q ss_pred --CCCCCEEEEEEH-HHHHHHHHHH---CCCCCC--------------------------------CC-CEEEHHHHHHH Q ss_conf --388981999730-7749999986---089988--------------------------------42-03316889998 Q gi|254781193|r 62 --ENTDVTIVAHNS-YFERTVLKAC---FNLDIP--------------------------------IH-RWICTLVLARS 102 (675) Q Consensus 62 --~~~~~~kV~HNa-~FD~~~L~~~---~Gi~i~--------------------------------~~-~~~DTmi~A~~ 102 (675) .+++ +.+|||. .||+.++.+. +++... .+ ..+|++...+- T Consensus 81 ~~~dpd-ii~g~N~~~FD~~yi~~R~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~D~~~~~~~ 159 (471) T smart00486 81 KKYDPD-IIYGHNISNFDLPYIISRLEKLKIKPLSFIGRLKNIIDIKRKKPLFGSKSFGKTIKVKIKGRLVIDLYNLYKN 159 (471) T ss_pred HHCCCC-EEEECCCCCCCHHHHHHHHHHHCCCHHHHHCCCCCCCCCCEECCCCCCCCCCCEEEEEEEEEEEEEHHHHHHH T ss_conf 871999-9996588787689999999984896477637357787643203321012566314899855999788999988 Q ss_pred -H-CCCCCHHHHHHHCCCCCCHHH-HHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf -6-789996898998299811004-379999862021123433234678585328999999899899999999630 Q gi|254781193|r 103 -N-GLPSALKAVGEALKLSSQKME-EGKELIARFCKGETHECLYDRNKPNHREAWRLFGEYCKGDVTVTREIFKRL 175 (675) Q Consensus 103 -l-~~~~sL~~la~~l~l~~~k~~-~~k~l~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~t~~L~~~l 175 (675) . -.+++|+.++..+. ...|.. ..+.+...+. ..+.....+..||.+|+.++.+|..++ T Consensus 160 ~~~l~sy~L~~v~~~~l-~~~k~d~~~~~i~~~~~--------------~~~~~~~~~~~Y~~~D~~l~~~l~~~~ 220 (471) T smart00486 160 KLKLPSYKLDTVAEYLL-GKEKDDLPYKDIPELYN--------------LNYKLRDELLEYCIQDAVLTLKLFNKL 220 (471) T ss_pred HHCCCCCCHHHHHHHHH-CCCCCCCCHHHHHHHHH--------------CCHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 60446688899999995-77888899899999985--------------393679999999999999999999986 No 87 >COG3359 Predicted exonuclease [DNA replication, recombination, and repair] Probab=98.24 E-value=3.4e-05 Score=51.99 Aligned_cols=152 Identities=15% Similarity=0.080 Sum_probs=94.1 Q ss_pred EEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCEEEEEE---ECCCCH---HHHHHHHHHHCCCCCEEEEEEH-HH Q ss_conf 8999814178987000440011279819999999848039999---788810---4379999986388981999730-77 Q gi|254781193|r 3 TLFIDFETRSPIDLTKCGICKYAEDVCVILFAFGFDDHPVELW---DCVHDP---VMPQVLKEALENTDVTIVAHNS-YF 75 (675) Q Consensus 3 ~i~lD~ET~~~~dl~~~Gl~~y~~~~~i~~i~~a~~~~~v~~~---d~~~~~---~~~~~l~~~L~~~~~~kV~HNa-~F 75 (675) ..++||||+ |++. .+..|..++.+++.+++.++ .++..+ .+.+.+.... +...+|-.|. .| T Consensus 100 ~~FFDiETT--------GL~~--ag~~I~~~g~a~~~~~~~~Vrq~~lp~p~~E~avle~fl~~~--~~~~lvsfNGkaF 167 (278) T COG3359 100 VAFFDIETT--------GLDR--AGNTITLVGGARGVDDTMHVRQHFLPAPEEEVAVLENFLHDP--DFNMLVSFNGKAF 167 (278) T ss_pred EEEEEEECC--------CCCC--CCCEEEEEEEEECCCCEEEEEEECCCCCCHHHHHHHHHHCCC--CCCEEEEECCCCC T ss_conf 589963314--------6688--898599998887357569997512798413568999975389--8455997468326 Q ss_pred HHHHHHHH--CCCCCCC-CCEEEHHHHHHHHC----CCCCHHHHHHHCCCCCCHHHHHH---HHHHHHCCCCCCCCCCCC Q ss_conf 49999986--0899884-20331688999867----89996898998299811004379---999862021123433234 Q gi|254781193|r 76 ERTVLKAC--FNLDIPI-HRWICTLVLARSNG----LPSALKAVGEALKLSSQKMEEGK---ELIARFCKGETHECLYDR 145 (675) Q Consensus 76 D~~~L~~~--~Gi~i~~-~~~~DTmi~A~~l~----~~~sL~~la~~l~l~~~k~~~~k---~l~~~~~k~~~~~~~~~~ 145 (675) |+++++|. ..+++.+ ..-+|-|-.+|-+. +..+|+.+-+.+|+...-+..|. .+.+.+.+ T Consensus 168 D~PfikR~v~~~~el~l~~~H~DL~h~~RRlwk~~l~~c~Lk~VEr~LGi~R~edtdG~~~p~lyr~~~~---------- 237 (278) T COG3359 168 DIPFIKRMVRDRLELSLEFGHFDLYHPSRRLWKHLLPRCGLKTVERILGIRREEDTDGYDGPELYRLYRR---------- 237 (278) T ss_pred CCHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCHHHHHHHHCCCCCCCCCCCCHHHHHHHHHH---------- T ss_conf 7089999875164447666631221034453120287888136788857550115898501999999987---------- Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 678585328999999899899999999630104 Q gi|254781193|r 146 NKPNHREAWRLFGEYCKGDVTVTREIFKRLEPL 178 (675) Q Consensus 146 ~~~~~~~~~~~~~~Ya~~Dv~~t~~L~~~l~~~ 178 (675) ...+........|+..|+.-...|+..+.+. T Consensus 238 --~~dp~ll~~l~~hN~eDvlnL~~i~~h~~~~ 268 (278) T COG3359 238 --YGDPGLLDGLVLHNREDVLNLPTIIKHVSKK 268 (278) T ss_pred --CCCHHHHHHHHHCCHHHHHHHHHHHHHHHHH T ss_conf --0987777898870699887599999999999 No 88 >pfam00929 Exonuc_X-T Exonuclease. This family includes a variety of exonuclease proteins, such as ribonuclease T and the epsilon subunit of DNA polymerase III.; Probab=98.23 E-value=2e-05 Score=53.48 Aligned_cols=65 Identities=26% Similarity=0.270 Sum_probs=43.6 Q ss_pred HHHHHHHCCCCCEEEEEEHHHHHHHHHHHC---CC-CCC-CCCEEEHHHHHHHHCC---CCCHHHHHHHCCCCC Q ss_conf 999998638898199973077499999860---89-988-4203316889998678---999689899829981 Q gi|254781193|r 55 QVLKEALENTDVTIVAHNSYFERTVLKACF---NL-DIP-IHRWICTLVLARSNGL---PSALKAVGEALKLSS 120 (675) Q Consensus 55 ~~l~~~L~~~~~~kV~HNa~FD~~~L~~~~---Gi-~i~-~~~~~DTmi~A~~l~~---~~sL~~la~~l~l~~ 120 (675) .++..++.+. .++|+||..||..++.+.+ +. ..+ ...++||+.+++.+.+ ..+|+.+++.++++. T Consensus 72 ~~~~~~l~~~-~~~v~~n~~fD~~~l~~~~~~~~~~~~~~~~~~~dt~~~~~~~~~~~~~~~L~~l~~~~~i~~ 144 (162) T pfam00929 72 EAFLEFLKKL-KILVGHNASFDVGFLLYDDLRFLKLPHPKLNDVIDTLILDKATYKGFKRRSLDALAEKLGLEK 144 (162) T ss_pred HHHHHHHCCC-CEEECCCCCHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHCCCCCCCHHHHHHHCCCCC T ss_conf 9999984258-577778877899999999998586336633330448999999750446799899998859999 No 89 >cd06138 ExoI_N This subfamily is composed of the N-terminal domain of E. coli exonuclease I (ExoI) and similar proteins. ExoI is a monomeric enzyme that hydrolyzes single stranded DNA in the 3' to 5' direction. It plays a role in DNA recombination and repair. It primarily functions in repairing frameshift mutations. The N-terminal domain of ExoI is a DEDDh-type DnaQ-like 3'-5 exonuclease containing three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific Hx(4)D conserved pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. The ExoI structure is unique among DnaQ family enzymes in that there is a large distance between the two metal ions required for catalysis and the catalytic histidine is oriented away from the active site. Probab=98.13 E-value=4.2e-05 Score=51.47 Aligned_cols=65 Identities=15% Similarity=0.101 Sum_probs=37.5 Q ss_pred HHHHHHHCCCCCEEEEEE-HHHHHHHHHHH---CCCCCCC------CCEEEHHHHHHH---HCC-------------CCC Q ss_conf 999998638898199973-07749999986---0899884------203316889998---678-------------999 Q gi|254781193|r 55 QVLKEALENTDVTIVAHN-SYFERTVLKAC---FNLDIPI------HRWICTLVLARS---NGL-------------PSA 108 (675) Q Consensus 55 ~~l~~~L~~~~~~kV~HN-a~FD~~~L~~~---~Gi~i~~------~~~~DTmi~A~~---l~~-------------~~s 108 (675) .++...+...+.++|||| +.||..+|... .|..... ...+|++=.+++ +.| +.+ T Consensus 74 ~~i~~~~~~~~~i~vg~N~~~FD~~fL~~~~~r~~~~p~~~~~~~~~~~~D~l~~~r~~~~~~p~~i~~~~~~~g~~s~k 153 (183) T cd06138 74 AKIHRLFNTPGTCIVGYNNIRFDDEFLRFAFYRNLYDPYTWEWKNGNSRWDLLDVVRAYYALRPDGIVWPKNDDGKPSFK 153 (183) T ss_pred HHHHHHHCCCCCEEEEECCHHHHHHHHHHHHHHHCCCCHHHHHCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCC T ss_conf 99999861699779984535568999999999827871566752787123078899999997820144765568974112 Q ss_pred HHHHHHHCCCC Q ss_conf 68989982998 Q gi|254781193|r 109 LKAVGEALKLS 119 (675) Q Consensus 109 L~~la~~l~l~ 119 (675) |+.+++.+|++ T Consensus 154 L~~l~~~~gi~ 164 (183) T cd06138 154 LEDLAQANGIE 164 (183) T ss_pred HHHHHHHCCCC T ss_conf 99999984999 No 90 >cd06144 REX4_like This subfamily is composed of RNA exonuclease 4 (REX4 or Rex4p), XPMC2, Interferon (IFN) Stimulated Gene product of 20 kDa (ISG20), and similar proteins. REX4 is involved in pre-rRNA processing. It controls the ratio between the two forms of 5.8S rRNA in yeast. XPMC2 is a Xenopus gene which was identified through its ability to correct a mitotic defect in fission yeast. The human homolog of XPMC2 (hPMC2) may be involved in angiotensin II-induced adrenal cell cycle progression and cell proliferation. ISG20 is an IFN-induced antiviral exonuclease with a strong preference for single-stranded RNA and minor activity towards single-stranded DNA. These proteins are DEDDh-type DnaQ-like 3'-5' exonucleases containing three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific Hx(4)D conserved pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required fo Probab=98.13 E-value=3.8e-05 Score=51.74 Aligned_cols=80 Identities=24% Similarity=0.264 Sum_probs=48.8 Q ss_pred HHHHHHHHCCCCCEEEEEEHHHHHHHHHHHCCCCCCCCCEEEHHHHHHHH----CCCCCHHHHHHH-CCCCCCHHHHHHH Q ss_conf 79999986388981999730774999998608998842033168899986----789996898998-2998110043799 Q gi|254781193|r 54 PQVLKEALENTDVTIVAHNSYFERTVLKACFNLDIPIHRWICTLVLARSN----GLPSALKAVGEA-LKLSSQKMEEGKE 128 (675) Q Consensus 54 ~~~l~~~L~~~~~~kV~HNa~FD~~~L~~~~Gi~i~~~~~~DTmi~A~~l----~~~~sL~~la~~-l~l~~~k~~~~k~ 128 (675) ..++..++.+ .++||||+.||+.+|. +..+...++||.....+. ....||+.+++. |+...+. T Consensus 68 ~~~~~~~l~~--~ilVGH~l~~Dl~~L~----l~~~~~~i~DT~~~~~~~~~~~~~~~sLk~L~~~~Lg~~Iq~------ 135 (152) T cd06144 68 QKKVAELLKG--RILVGHALKNDLKVLK----LDHPKKLIRDTSKYKPLRKTAKGKSPSLKKLAKQLLGLDIQE------ 135 (152) T ss_pred HHHHHHHHCC--CEEEECCHHHHHHHHH----CCCCCCEEEEEEEEHHHHHCCCCCCCCHHHHHHHHCCCCCCC------ T ss_conf 9999998538--8798606464699962----438876068855642245416889927999999985885899------ Q ss_pred HHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 998620211234332346785853289999998998999999996 Q gi|254781193|r 129 LIARFCKGETHECLYDRNKPNHREAWRLFGEYCKGDVTVTREIFK 173 (675) Q Consensus 129 l~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~t~~L~~ 173 (675) ..| =+..|+.+|++||+ T Consensus 136 -------------------~~H---------~s~eDA~at~~Lfk 152 (152) T cd06144 136 -------------------GEH---------SSVEDARAAMRLYR 152 (152) T ss_pred -------------------CCC---------CCHHHHHHHHHHHC T ss_conf -------------------796---------96999999999859 No 91 >KOG2206 consensus Probab=98.05 E-value=1e-05 Score=55.34 Aligned_cols=125 Identities=17% Similarity=0.132 Sum_probs=87.1 Q ss_pred EEEEEEEECCEEEEEEECCCCHHHHHHHHHHHCCCCCEEEEEEHHHHHHHHHHHCCCCCCCCCEEEHHHHHHHHCCC-CC Q ss_conf 99999998480399997888104379999986388981999730774999998608998842033168899986789-99 Q gi|254781193|r 30 VILFAFGFDDHPVELWDCVHDPVMPQVLKEALENTDVTIVAHNSYFERTVLKACFNLDIPIHRWICTLVLARSNGLP-SA 108 (675) Q Consensus 30 i~~i~~a~~~~~v~~~d~~~~~~~~~~l~~~L~~~~~~kV~HNa~FD~~~L~~~~Gi~i~~~~~~DTmi~A~~l~~~-~s 108 (675) ..++++++..++ +.+|...-.....-+.+.+.++++.||+|-+--|+.||.|-|||.+ .+.|||.-++++|+.| .| T Consensus 231 tclmqISTr~ed-~iIDt~~l~~~i~~l~e~fsdp~ivkvfhgaD~diiwlqrdfgiyv--vnLfdt~~a~r~L~~~r~s 307 (687) T KOG2206 231 TCLMQISTRTED-FIIDTFKLRDHIGILNEVFSDPGIVKVFHGADTDIIWLQRDFGIYV--VNLFDTIQASRLLGLPRPS 307 (687) T ss_pred EEEEEEECCCHH-HEEHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCHHHHHCCCEEEE--EECHHHHHHHHHHCCCCCC T ss_conf 068995304203-1211478888887742210588747897347640343402304899--8511058899885799400 Q ss_pred HHHHHHH-CCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 6898998-2998110043799998620211234332346785853289999998998999999996301 Q gi|254781193|r 109 LKAVGEA-LKLSSQKMEEGKELIARFCKGETHECLYDRNKPNHREAWRLFGEYCKGDVTVTREIFKRLE 176 (675) Q Consensus 109 L~~la~~-l~l~~~k~~~~k~l~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~t~~L~~~l~ 176 (675) |..|... .++..+|.. ++..| -..|. -+.|..||..|+...+-||..|. T Consensus 308 L~~ll~~~~~v~~nk~y---qladw---------------R~rpL-p~~Mv~yar~dthyllyiyD~lr 357 (687) T KOG2206 308 LAYLLECVCGVLTNKKY---QLADW---------------RIRPL-PEEMVRYAREDTHYLLYIYDVLR 357 (687) T ss_pred HHHHHHHHHHHHHHHHH---HHCHH---------------CCCCC-CHHHHHHHHHCCHHHHHHHHHHH T ss_conf 99999998745311466---41100---------------03658-58898887512315899999999 No 92 >PTZ00166 DNA polymerase delta catalytic subunit; Provisional Probab=98.02 E-value=0.00066 Score=43.71 Aligned_cols=112 Identities=20% Similarity=0.243 Sum_probs=65.7 Q ss_pred HHHHHHHHHH--CCCCCEEEEEEH-HHHHHHHHH---HCCCC----------------------------------CCCC Q ss_conf 4379999986--388981999730-774999998---60899----------------------------------8842 Q gi|254781193|r 52 VMPQVLKEAL--ENTDVTIVAHNS-YFERTVLKA---CFNLD----------------------------------IPIH 91 (675) Q Consensus 52 ~~~~~l~~~L--~~~~~~kV~HNa-~FD~~~L~~---~~Gi~----------------------------------i~~~ 91 (675) .++..+..++ .|++++ +|||+ .||++.|.. ++++. +. + T Consensus 358 eLL~aF~~fi~~~DPDII-TGYNI~nFDlpYLl~Ra~~L~i~~f~~lgRlk~~~~~~~~~~~~s~a~G~~~~k~i~i~-G 435 (1081) T PTZ00166 358 TLLLAWNDFIIAVDPDFL-TGYNIINFDLPYLLNRATALNLKKFKFLTRIKSVKSTVKDSIFSSNQFGTHENKEINIE-G 435 (1081) T ss_pred HHHHHHHHHHHHCCCCEE-EECCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCEECCCCCCCCCCCCCCEEEEEC-C T ss_conf 999999999987099899-96687888889999999995886433236115776522026566555566655068675-7 Q ss_pred C-EEEHHHHHHHH-C-CCCCHHHHHHHCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHH Q ss_conf 0-33168899986-7-8999689899829981100437999986202112343323467858532899999989989999 Q gi|254781193|r 92 R-WICTLVLARSN-G-LPSALKAVGEALKLSSQKMEEGKELIARFCKGETHECLYDRNKPNHREAWRLFGEYCKGDVTVT 168 (675) Q Consensus 92 ~-~~DTmi~A~~l-~-~~~sL~~la~~l~l~~~k~~~~k~l~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~t 168 (675) + .+|.+...+-- . .+++|+.++..+ ++.+|..--...+..+ +...++....+..||.+|+.++ T Consensus 436 Rv~~Dl~~~v~r~~kL~SYkLntVs~~f-LgeqKeDV~~~~I~~l-------------~~~~~e~r~rla~YClkDa~L~ 501 (1081) T PTZ00166 436 RILFDVYDLIRRDYKLKSYSLNYVSFEF-LKEQKEDVHYSTINKL-------------QNGNPEDRKRIASYCLKDSILP 501 (1081) T ss_pred EEEEEHHHHHHHHCCCCCCCHHHHHHHH-HCCCCCCCCHHHHHHH-------------HCCCHHHHHHHHHHHHHHHHHH T ss_conf 8996469999875587857688899998-1887533799999999-------------8149378899999989989999 Q ss_pred HHHHHHCCCCH Q ss_conf 99996301040 Q gi|254781193|r 169 REIFKRLEPLS 179 (675) Q Consensus 169 ~~L~~~l~~~~ 179 (675) ++|..+|.-++ T Consensus 502 lrL~~KL~~l~ 512 (1081) T PTZ00166 502 LRLIDKLLLLY 512 (1081) T ss_pred HHHHHHHHHHH T ss_conf 99999998999 No 93 >PRK09182 DNA polymerase III subunit epsilon; Validated Probab=98.02 E-value=1.7e-05 Score=53.99 Aligned_cols=60 Identities=18% Similarity=0.285 Sum_probs=37.5 Q ss_pred HHHHHHCCCCCEEEEEEHHHHHHHHHHHCCCCCCCCCEEEHHHHHHH--HC-CCCCHHHHHHHCC Q ss_conf 99998638898199973077499999860899884203316889998--67-8999689899829 Q gi|254781193|r 56 VLKEALENTDVTIVAHNSYFERTVLKACFNLDIPIHRWICTLVLARS--NG-LPSALKAVGEALK 117 (675) Q Consensus 56 ~l~~~L~~~~~~kV~HNa~FD~~~L~~~~Gi~i~~~~~~DTmi~A~~--l~-~~~sL~~la~~l~ 117 (675) .+..++.+ ..++|+|||.||++|+.+.+.. .....|-|||----- .+ .+.+|+.|+-.+| T Consensus 113 ~V~~~~~~-adliiAHNA~FDR~F~E~~~p~-f~~k~WaCS~~~IdW~~~Gf~s~kLeyL~~~~G 175 (293) T PRK09182 113 EVDALIAP-ADLIIAHNAGFDRPFLERFSPV-FANKPWACSVSEIDWSARGFEGTKLGYLAGQAG 175 (293) T ss_pred HHHHHHCC-CCEEEECCCCCCHHHHHHHCCC-CCCCCEEEECCCCCHHHCCCCCCCHHHHHHHCC T ss_conf 99998644-9899981775467789850953-258750531637970322877415999998609 No 94 >cd06133 ERI-1_3'hExo_like This subfamily is composed of Caenorhabditis elegans ERI-1, human 3' exonuclease (3'hExo), Drosophila exonuclease snipper (snp), and similar proteins from eukaryotes and bacteria. These are DEDDh-type DnaQ-like 3'-5' exonucleases containing three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific Hx(4)D conserved pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. ERI-1 has been implicated in the degradation of small interfering RNAs (RNAi). 3'hExo participates in the degradation of histone mRNAs. Snp is a non-essential exonuclease that efficiently degrades structured RNA and DNA substrates as long as there is a minimum of 2 nucleotides in the 3' overhang to initiate degradation. Snp is not a functional homolog of either ERI-1 or 3'hExo. Probab=97.96 E-value=0.00038 Score=45.21 Aligned_cols=87 Identities=18% Similarity=0.138 Sum_probs=55.4 Q ss_pred HHHHHHHCCCCCEEEEEEHHHHHHHHHH----HCCCCCC--CCCEEEHHHHHHHH-C--CCCCHHHHHHHCCCCCCHHHH Q ss_conf 9999986388981999730774999998----6089988--42033168899986-7--899968989982998110043 Q gi|254781193|r 55 QVLKEALENTDVTIVAHNSYFERTVLKA----CFNLDIP--IHRWICTLVLARSN-G--LPSALKAVGEALKLSSQKMEE 125 (675) Q Consensus 55 ~~l~~~L~~~~~~kV~HNa~FD~~~L~~----~~Gi~i~--~~~~~DTmi~A~~l-~--~~~sL~~la~~l~l~~~k~~~ 125 (675) .++..++.+.....+.||+.||..++.+ +.+...+ ...|+||+-.++.. . .+.||+.++++++++... T Consensus 80 ~~f~~~i~~~~~~~~~~~~~fD~~~l~~~~~~~~~~~~~~~~~~~iD~~~~~~~~~~~~~~~sL~~l~~~~gi~~~~--- 156 (176) T cd06133 80 KEFLEWLGKNGKYAFVTWGDWDLKDLLQNQCKYKIINLPPFFRQWIDLKKEFAKFYGLKKRTGLSKALEYLGLEFEG--- 156 (176) T ss_pred HHHHHHHCCCCCEEEEEECCCHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCHHHHHHHCCCCCCC--- T ss_conf 99999972698579996060029999999999789988730111204999999981888898999999986999999--- Q ss_pred HHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 79999862021123433234678585328999999899899999999630 Q gi|254781193|r 126 GKELIARFCKGETHECLYDRNKPNHREAWRLFGEYCKGDVTVTREIFKRL 175 (675) Q Consensus 126 ~k~l~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~t~~L~~~l 175 (675) .. .-|..|+..|..++.+| T Consensus 157 ----------------------~~---------H~AL~DA~~ta~v~~~l 175 (176) T cd06133 157 ----------------------RH---------HRGLDDARNIARILKRL 175 (176) T ss_pred ----------------------CC---------CCCHHHHHHHHHHHHHH T ss_conf ----------------------88---------58599999999999987 No 95 >TIGR00573 dnaq exonuclease, DNA polymerase III, epsilon subunit family; InterPro: IPR006054 All proteins in this entry for which functions are known are components of the DNA polymerase III complex (epsilon subunit). There is, however, an outgroup that includes paralogs in some gamma-proteobacteria and the N-terminal region of DinG from some low GC Gram-positive bacteria. ; GO: 0003677 DNA binding, 0003887 DNA-directed DNA polymerase activity, 0006260 DNA replication. Probab=97.83 E-value=0.00066 Score=43.70 Aligned_cols=90 Identities=13% Similarity=0.149 Sum_probs=60.1 Q ss_pred HHHHHHHHHCCCCCEEEEEEHHHHHHHHHHH---CCCCC--CCCCEEEHHHHHHHHCC--C---CCHHHHHHHCCCCCCH Q ss_conf 3799999863889819997307749999986---08998--84203316889998678--9---9968989982998110 Q gi|254781193|r 53 MPQVLKEALENTDVTIVAHNSYFERTVLKAC---FNLDI--PIHRWICTLVLARSNGL--P---SALKAVGEALKLSSQK 122 (675) Q Consensus 53 ~~~~l~~~L~~~~~~kV~HNa~FD~~~L~~~---~Gi~i--~~~~~~DTmi~A~~l~~--~---~sL~~la~~l~l~~~k 122 (675) +.+++.+++. +..+|+|||.||+.||..- +|... ....+.||.-+.+++-+ | .+|+.++.+|++...+ T Consensus 78 ia~dF~~~I~--g~~Lv~HNA~FD~GFl~~e~~~LG~~~~~~~~~v~~~~~~~~~~~~~~p~~~~~Ld~L~~~~~~~~s~ 155 (228) T TIGR00573 78 IAEDFADYIK--GAVLVIHNASFDVGFLNYEFSKLGLKKEPKTNDVIDTTDTLQAARPEFPGKRNTLDALALRYEITNSH 155 (228) T ss_pred HHHHHHHHHC--CCEEEEEHHHCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHCCCCHHH T ss_conf 8899999837--94787520030577899999851450110011444499999999720588430277778780544037 Q ss_pred HHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 0437999986202112343323467858532899999989989999999963010 Q gi|254781193|r 123 MEEGKELIARFCKGETHECLYDRNKPNHREAWRLFGEYCKGDVTVTREIFKRLEP 177 (675) Q Consensus 123 ~~~~k~l~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~t~~L~~~l~~ 177 (675) .+.--|..|+.++..||..+.. T Consensus 156 ---------------------------------R~~H~A~~DA~~l~~~~~~~~~ 177 (228) T TIGR00573 156 ---------------------------------RALHGALLDAFILAKLYLVMTG 177 (228) T ss_pred ---------------------------------HHHHHHHHHHHHHHHHHHHHHH T ss_conf ---------------------------------7773177889999999999986 No 96 >KOG1798 consensus Probab=97.82 E-value=0.00045 Score=44.79 Aligned_cols=150 Identities=18% Similarity=0.190 Sum_probs=81.5 Q ss_pred CEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCEEEEEE--------ECCC----------C----------HHH Q ss_conf 68999814178987000440011279819999999848039999--------7888----------1----------043 Q gi|254781193|r 2 ATLFIDFETRSPIDLTKCGICKYAEDVCVILFAFGFDDHPVELW--------DCVH----------D----------PVM 53 (675) Q Consensus 2 k~i~lD~ET~~~~dl~~~Gl~~y~~~~~i~~i~~a~~~~~v~~~--------d~~~----------~----------~~~ 53 (675) +.+|+||||+ ..+||= +.++..+|..|++-++|. ++.| |+.. | -.+ T Consensus 247 ~VlAFDIETt-KlPLKF----PDae~DqIMMISYMiDGq-GfLItNREiVs~DIedfEYTPKpE~eG~F~v~Ne~dEv~L 320 (2173) T KOG1798 247 RVLAFDIETT-KLPLKF----PDAESDQIMMISYMIDGQ-GFLITNREIVSEDIEDFEYTPKPEYEGPFCVFNEPDEVGL 320 (2173) T ss_pred EEEEEEEECC-CCCCCC----CCCCCCEEEEEEEEECCC-EEEEECHHHHCCCHHHCCCCCCCCCCCCEEEECCCCHHHH T ss_conf 5899851035-578779----986656078999996474-6899622343052122236886325254588668768899 Q ss_pred HHHHHHHHCC-CCCEEEEEEH-HHHHHHHHH---HCCCCCCCC-CE-EE-----------HHHH-HHHH------CCCCC Q ss_conf 7999998638-8981999730-774999998---608998842-03-31-----------6889-9986------78999 Q gi|254781193|r 54 PQVLKEALEN-TDVTIVAHNS-YFERTVLKA---CFNLDIPIH-RW-IC-----------TLVL-ARSN------GLPSA 108 (675) Q Consensus 54 ~~~l~~~L~~-~~~~kV~HNa-~FD~~~L~~---~~Gi~i~~~-~~-~D-----------Tmi~-A~~l------~~~~s 108 (675) ++.+.+.+.+ ...+.|-.|. .||++|+.+ ++|+++... .+ .| -|=+ .++- .-+++ T Consensus 321 l~RfFeHiq~~kP~iivTyNGDFFDWPFve~Ra~~hGi~m~eEiGF~~D~~gEyks~~c~HmDcfrWVKRDSYLPqGSqg 400 (2173) T KOG1798 321 LQRFFEHIQEVKPTIIVTYNGDFFDWPFVEARAKIHGISMNEEIGFRRDSQGEYKSPFCIHMDCFRWVKRDSYLPQGSQG 400 (2173) T ss_pred HHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHHHHCCCCCCHHCCCEECCCCCCCCCCEEEHHHHHHHHHCCCCCCCCCC T ss_conf 99999999861896799736864466666778875077711210852065333125632112666665204657776631 Q ss_pred HHHHHH-HCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 689899-8299811004379999862021123433234678585328999999899899999999630 Q gi|254781193|r 109 LKAVGE-ALKLSSQKMEEGKELIARFCKGETHECLYDRNKPNHREAWRLFGEYCKGDVTVTREIFKRL 175 (675) Q Consensus 109 L~~la~-~l~l~~~k~~~~k~l~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~t~~L~~~l 175 (675) |+++++ .||.++.-. .-++|- .+..+.-+.+..|...|+++|+-||.+. T Consensus 401 LKAVTkaKLGYdPvEv--dPEdM~----------------~~A~EkPQ~lasYSVSDAVATYyLYMkY 450 (2173) T KOG1798 401 LKAVTKAKLGYDPVEV--DPEDMV----------------RMAMEKPQTLASYSVSDAVATYYLYMKY 450 (2173) T ss_pred HHHHHHHHHCCCCCCC--CHHHHH----------------HHHHHCCHHHHHCCHHHHHHHHHHHHHH T ss_conf 2688887507894437--988854----------------6665082565300057778888898987 No 97 >COG0417 PolB DNA polymerase elongation subunit (family B) [DNA replication, recombination, and repair] Probab=97.70 E-value=0.002 Score=40.58 Aligned_cols=178 Identities=18% Similarity=0.140 Sum_probs=90.6 Q ss_pred CCEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCEEE-------------EEEECCCCHHHHHHHHHHH--CCCC Q ss_conf 968999814178987000440011279819999999848039-------------9997888104379999986--3889 Q gi|254781193|r 1 MATLFIDFETRSPIDLTKCGICKYAEDVCVILFAFGFDDHPV-------------ELWDCVHDPVMPQVLKEAL--ENTD 65 (675) Q Consensus 1 mk~i~lD~ET~~~~dl~~~Gl~~y~~~~~i~~i~~a~~~~~v-------------~~~d~~~~~~~~~~l~~~L--~~~~ 65 (675) ++.+++|+||++.... ..-.....+..++++...+.. .+..+.....+.+.+..++ .|++ T Consensus 154 l~~la~DiE~~~~~~~-----~~~~~~d~~~~i~~~~~~~~~~~~~~~~~~~~~~~v~~~~~e~e~l~~~~~~i~~~dPd 228 (792) T COG0417 154 LRVLAFDIETLSEPGK-----FPDGEKDPIIMISYAIEAEGGLIEVFIYTSGEGFSVEVVISEAELLERFVELIREYDPD 228 (792) T ss_pred CCEEEEEEEECCCCCC-----CCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCEEEECCHHHHHHHHHHHHHHCCCC T ss_conf 7547995031057778-----89877786269999704677764443223567763487258799999999998502989 Q ss_pred CEEEEEEH-HHHHHHHHHH---CCCCCCC------------------CCE-EEHHHHHH--HH-CCCCCHHHHHHHCCCC Q ss_conf 81999730-7749999986---0899884------------------203-31688999--86-7899968989982998 Q gi|254781193|r 66 VTIVAHNS-YFERTVLKAC---FNLDIPI------------------HRW-ICTLVLAR--SN-GLPSALKAVGEALKLS 119 (675) Q Consensus 66 ~~kV~HNa-~FD~~~L~~~---~Gi~i~~------------------~~~-~DTmi~A~--~l-~~~~sL~~la~~l~l~ 119 (675) ++ ++||. .||++.|.+. +|++... ++. .|..-..+ .. ..+++|+.++..+... T Consensus 229 VI-vgyn~~~fd~pyl~~Ra~~lgi~~~~gr~~~~~~~~~~~~~~~~Gr~~iDl~~~~~~~~~~~~~ysl~~v~~~~l~~ 307 (792) T COG0417 229 VI-VGYNGDNFDWPYLAERAERLGIPLRLGRDGSELRVRKSGFSSQVGRLHIDLYPALRRRPLNLKSYSLEAVSEALLGE 307 (792) T ss_pred EE-EEECCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCEEECCCCCCCCEEEEEHHHHHHHCCCCCCCEEHHHHHHHHCCC T ss_conf 99-98367777738999999981998511356653200212544557716741066542032355632434677664133 Q ss_pred CCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHCCCCHHHHHHHHHHCCCCC Q ss_conf 11004379999862021123433234678585328999999899899999999630104000151336666531156213 Q gi|254781193|r 120 SQKMEEGKELIARFCKGETHECLYDRNKPNHREAWRLFGEYCKGDVTVTREIFKRLEPLSSKEQELWYLDQTINDRGYCV 199 (675) Q Consensus 120 ~~k~~~~k~l~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~t~~L~~~l~~~~~~E~~~~~~~~~me~~Gi~v 199 (675) ..+..... ......+...+..+.....|+..|++++.++..+..-+. .+.......|+++ T Consensus 308 ~k~~~~~~-------------~~~~~~~~~~~~~~~~~~~~~l~~~~l~~~~~~~~~~l~-------~~~e~s~vsg~~~ 367 (792) T COG0417 308 GKREDIPY-------------DSMEEIWPDWADSKLRLLLYNLSDADLVLRILLKNELLP-------LLIELSRVSGLPL 367 (792) T ss_pred CCHHCCCC-------------CHHHHCCCCCCCCHHHHHHHCCCHHHHHHHHHHHHHHHH-------HHHHHHHHHCCCH T ss_conf 52200562-------------013330423551076888834107889999986666789-------9999999849879 Q ss_pred CHHHH Q ss_conf 79999 Q gi|254781193|r 200 DVALA 204 (675) Q Consensus 200 D~e~l 204 (675) |.-.. T Consensus 368 ~~~~~ 372 (792) T COG0417 368 DDVTR 372 (792) T ss_pred HHHHH T ss_conf 99861 No 98 >pfam00940 RNA_pol DNA-dependent RNA polymerase. This is a family of single chain RNA polymerases. Probab=97.44 E-value=0.0094 Score=36.23 Aligned_cols=179 Identities=11% Similarity=0.042 Sum_probs=100.4 Q ss_pred HHHHHCCCCEEEEECCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHH Q ss_conf 01110165202100288888899879870888311345565554555567888899998876431000102207888999 Q gi|254781193|r 427 DQRAIGKVMELALGYQGGAKAFQKMASTTGLNLNTFAHLVKTNSAPEEWEDAENFCRWMQENHPEHAIEDLFIGTACERV 506 (675) Q Consensus 427 ~~R~~aK~~~~g~~YG~G~~~l~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ea~~~ 506 (675) ..|+..|..+--+.||....++.++....=.+..... ..........-.++.+ T Consensus 154 i~Rk~vKr~vMT~~Yg~t~~g~~~qi~~~l~~~~~~~---------------------------~~~~~~~~~~yla~~i 206 (391) T pfam00940 154 ITRKVVKQTVMTLVYGVTFYGAREQIKDQLKEKGTDE---------------------------SKVLSYKAANYLAKII 206 (391) T ss_pred CCHHHHHCCEEEEECCCCCCHHHHHHHHHHHHCCCCC---------------------------CHHHHHHHHHHHHHHH T ss_conf 6686640321664226662009999999998436873---------------------------1456789999999999 Q ss_pred HHHHHHHCHHHHHHHHHHHHHHHHHHHCCEEEEECCCCCCCCCCCCCCCCCCEEECC-CCCCCCCCCCCCCHHHHCCHHH Q ss_conf 999998788789999999999999975491796215421133222344564023247-6631016786621021010023 Q gi|254781193|r 507 KKAWRQKHSRVSELWNELHQAFEQTIENGKAIIARKRRDVPLVYMKKDKNNLILMLP-SGRSLVYRDVGNNRSYLNTTTR 585 (675) Q Consensus 507 ~~~~r~~~p~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p-~GR~~~~~~~~~~~~~~~~~~~ 585 (675) .+...+.+|+..+...=++..++...+.+.++.+..+-+.+..-..+......|++. .|+.+....... ...++ T Consensus 207 ~~si~~v~~~a~~~m~wL~~~a~~~~~~~~pv~W~TP~Gf~V~Q~Y~k~~~~~V~t~l~~~~~~~~~~~~-----~~~d~ 281 (391) T pfam00940 207 FEAIKEVFPGAREIMDWLNECAKLIAKLNIPVVWTTPLGLPVVQPYRKSKKKRVKTSLQGVSRIKVITLT-----PVSDK 281 (391) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEECCCCCEEEEEECCCCCEEEEEEECCEEEECCCCCC-----CCCCH T ss_conf 9999998574999999999999999828996599999999898641688757999861566886135666-----65675 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEECCEEEEECCCCHHHHHH Q ss_conf 456777531456554536889999999999999679818899525899977924899999 Q gi|254781193|r 586 QLTREYTYGGKLTENIVQAVSRDILAEGMMNATKAGYDIVLTVHDEIVCETPDTDEFNAS 645 (675) Q Consensus 586 ~~~r~~~~~~k~~n~~iQgtaadi~k~ami~l~~~g~~ivl~VHDEiv~evp~~~~~~~~ 645 (675) +..+.. -+-|+ |-+.-|..+......+.+.|.. ...|||...... ...+... T Consensus 282 ~Kq~~a----~~PNf-IHSlDAshl~~t~~~~~~~gi~-f~~IHDsf~tha--~d~~~l~ 333 (391) T pfam00940 282 RKQKSA----FPPNF-IHSLDASHLMLTALACIKAGIP-FASVHDSFWTHA--NDVDKMN 333 (391) T ss_pred HHHHCC----CCCCH-HHHHHHHHHHHHHHHHHHCCCC-EEEEEECCCCCC--CCHHHHH T ss_conf 654236----79741-3334489999999999856998-788710788772--1499999 No 99 >cd06149 ISG20 Interferon (IFN) Stimulated Gene product of 20 kDa (ISG20) is an IFN-induced antiviral exonuclease with a strong preference for single-stranded RNA and minor activity towards single-stranded DNA. It was also independently identified by its response to estrogen and was called HEM45 (human estrogen regulated transcript). ISG20 is a DEDDh-type DnaQ-like 3'-5' exonuclease containing three conserved sequence motifs termed ExoI, ExoII and ExoIII with a specific Hx(4)D conserved pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. ISG20 may be a major effector of innate immunity against pathogens including viruses, bacteria, and parasites. It is located in promyelocytic leukemia (PML) nuclear bodies, sites for oncogenic DNA viral transcription and replication. It may carry out its function by degrading viral RNAs as part of the IFN-regulated antiviral r Probab=97.42 E-value=0.0029 Score=39.50 Aligned_cols=57 Identities=26% Similarity=0.294 Sum_probs=35.6 Q ss_pred HHHHHHHHCCCCCEEEEEEHHHHHHHHHHHCCCCCCCCCEEEHHH---HHHHHCCC----CCHHHHHHHC Q ss_conf 799999863889819997307749999986089988420331688---99986789----9968989982 Q gi|254781193|r 54 PQVLKEALENTDVTIVAHNSYFERTVLKACFNLDIPIHRWICTLV---LARSNGLP----SALKAVGEAL 116 (675) Q Consensus 54 ~~~l~~~L~~~~~~kV~HNa~FD~~~L~~~~Gi~i~~~~~~DTmi---~A~~l~~~----~sL~~la~~l 116 (675) ..++..++.+ .++||||+.+|+.+|. +.-|...++||-. ..+..+.| .||+.+++.+ T Consensus 68 ~~~l~~~l~~--~IlVGH~L~nDL~aL~----l~h~~~~i~DT~~~~~~~~~~~~p~~~~~sLk~L~~~~ 131 (157) T cd06149 68 QKEILKILKG--KVVVGHAIHNDFKALK----YFHPKHMTRDTSTIPLLNRKAGFPENCRVSLKVLAKRL 131 (157) T ss_pred HHHHHHHHCC--CEEEEECCHHHHHHHH----CCCCCCCEEECCCCEEHHHHCCCCCCCCHHHHHHHHHH T ss_conf 9999997169--8899826076698860----33898726777111001332489876687599999998 No 100 >cd05778 DNA_polB_zeta_exo The 3'-5' exonuclease domain of eukaryotic DNA polymerase zeta. DNA polymerase zeta is a family-B DNA polymerase which is distantly related to DNA polymerase delta. It plays a major role in translesion replication and the production of either spontaneous or induced mutations. In addition, DNA polymerase zeta also appears to be involved in somatic hypermutability in B lymphocytes, an important element for the production of high affinity antibodies in response to an antigen. The catalytic subunit contains both polymerase and 3'-5' exonuclease domains, but only exhibits polymerase activity. The DnaQ-like 3'-5' exonuclease domain contains three sequence motifs termed ExoI, ExoII and ExoIII, without the four conserved acidic residues that are crucial for metal binding and catalysis. Probab=97.34 E-value=0.0084 Score=36.54 Aligned_cols=152 Identities=16% Similarity=0.095 Sum_probs=77.7 Q ss_pred CCEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCEE---------E--EEEEC---------------------- Q ss_conf 96899981417898700044001127981999999984803---------9--99978---------------------- Q gi|254781193|r 1 MATLFIDFETRSPIDLTKCGICKYAEDVCVILFAFGFDDHP---------V--ELWDC---------------------- 47 (675) Q Consensus 1 mk~i~lD~ET~~~~dl~~~Gl~~y~~~~~i~~i~~a~~~~~---------v--~~~d~---------------------- 47 (675) |+.+++|+|..+..+ +-+-.+...|.+|+|++.+.. . ..+.. T Consensus 4 LtiLSfDIe~~sr~~-----~fPdP~~D~I~~I~~~~~~~~~~~~~~~~~~g~i~~~~~~~~~~~~~~~~~~~~~~v~~~ 78 (231) T cd05778 4 LTILSLEVHVNTRGD-----LLPDPEFDPISAIFYCIDDDVSPFILDANKVGVIIVDELKSNASNGRIRSGLSGIPVEVV 78 (231) T ss_pred EEEEEEEEEECCCCC-----CCCCCCCCCEEEEEEEECCCCCCCCCCCCCEEEEEECCCCCCHHHCCCCCCCCCCEEEEE T ss_conf 289999999888899-----996989785789999962687654456652007995586541121022257899649997 Q ss_pred CCCHHHHHHHHHHH--CCCCCEEEEEEH-HHHHHHHHH---HCCCCC-------------------------------C- Q ss_conf 88104379999986--388981999730-774999998---608998-------------------------------8- Q gi|254781193|r 48 VHDPVMPQVLKEAL--ENTDVTIVAHNS-YFERTVLKA---CFNLDI-------------------------------P- 89 (675) Q Consensus 48 ~~~~~~~~~l~~~L--~~~~~~kV~HNa-~FD~~~L~~---~~Gi~i-------------------------------~- 89 (675) .....+...+..++ .|++++ +|+|. .||++.|.. ++++.. . T Consensus 79 ~~E~eLl~~f~~~v~~~DPDii-~Gyni~~fd~~Yl~~Ra~~l~~~~~~~~lsR~~~~~~~~~~~~~~~~g~~~~~~i~i 157 (231) T cd05778 79 ESELELFEELIDLVRRFDPDIL-SGYEIQRSSWGYLIERAAALGIDDLLDEISRVPSDSNGKFGDRDDEWGYTHTSGIKI 157 (231) T ss_pred CCHHHHHHHHHHHHHHHCCCEE-EECCCCCCCCHHHHHHHHHHCCCHHHHHCCCCCCCCCCEECCCCCCCCCCCCCEEEE T ss_conf 9999999999999997099989-622556686189999999858620466607787665421023432355644415763 Q ss_pred CCC-EEEHHHHHH-HHC-CCCCHHHHHHHCCCCCCHHH-HHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHH Q ss_conf 420-331688999-867-89996898998299811004-37999986202112343323467858532899999989989 Q gi|254781193|r 90 IHR-WICTLVLAR-SNG-LPSALKAVGEALKLSSQKME-EGKELIARFCKGETHECLYDRNKPNHREAWRLFGEYCKGDV 165 (675) Q Consensus 90 ~~~-~~DTmi~A~-~l~-~~~sL~~la~~l~l~~~k~~-~~k~l~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv 165 (675) +++ ++|.+-+-+ -+. .+++|+.++..+ +...|.. .-+.+..++.. ..+........||..|+ T Consensus 158 ~GRi~lDl~~~~r~e~kL~sYsLe~V~~~~-L~e~k~~~~~~~l~~~~~~-------------g~~~~r~~v~~Yclkd~ 223 (231) T cd05778 158 VGRHILNVWRLMRSELALTNYTLENVVYHV-LHQRIPLYSNKTLTEWYKS-------------GSASERWRVLEYYLKRV 223 (231) T ss_pred CCEEEEEHHHHHHHHCCCCCCCHHHHHHHH-HCCCCCCCCHHHHHHHHHC-------------CCHHHHHHHHEEEHHHH T ss_conf 418998519999753584438799999999-6798888999999999974-------------99878516440007530 Q ss_pred HHHHHHH Q ss_conf 9999999 Q gi|254781193|r 166 TVTREIF 172 (675) Q Consensus 166 ~~t~~L~ 172 (675) .+.++|- T Consensus 224 ~l~l~Ll 230 (231) T cd05778 224 RLNLEIL 230 (231) T ss_pred HHHHHHH T ss_conf 5158652 No 101 >PRK05601 DNA polymerase III subunit epsilon; Validated Probab=97.02 E-value=0.0026 Score=39.86 Aligned_cols=66 Identities=18% Similarity=0.194 Sum_probs=42.6 Q ss_pred HHHHHHHHHCCCCCEEEEEEHHHHHHHHHHHCC-----------------------------CCC-CCCCEEEHHHHHHH Q ss_conf 379999986388981999730774999998608-----------------------------998-84203316889998 Q gi|254781193|r 53 MPQVLKEALENTDVTIVAHNSYFERTVLKACFN-----------------------------LDI-PIHRWICTLVLARS 102 (675) Q Consensus 53 ~~~~l~~~L~~~~~~kV~HNa~FD~~~L~~~~G-----------------------------i~i-~~~~~~DTmi~A~~ 102 (675) +..+|.+.|. +.+.|+||+.||+.||..-+. ..+ .+..+.||+.+|+- T Consensus 115 ia~~l~~lLd--GRtLVaHNa~fdw~Fia~Ea~RA~~~a~~~~R~~~r~~~~~~r~rrrqr~gh~P~P~~iv~TLAlaRR 192 (377) T PRK05601 115 ILKSLDRLID--GRTLLLHNAPRTWGFIVSEAKRAMNDAARANRNRNRGNRRGGRGRRRQRVGHIPKPLVIVDTLASARR 192 (377) T ss_pred HHHHHHHHHC--CCEEEEECCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHCCCCCCCCCCEEEHHHHHHH T ss_conf 9999999845--86799835602088999999986554234432100134444443101003677887521007777764 Q ss_pred HCCC---CCHHHHHHHCCCCC Q ss_conf 6789---99689899829981 Q gi|254781193|r 103 NGLP---SALKAVGEALKLSS 120 (675) Q Consensus 103 l~~~---~sL~~la~~l~l~~ 120 (675) ++.+ .-|..++..++++- T Consensus 193 ~~l~~~d~RL~~vA~~~g~~a 213 (377) T PRK05601 193 QAIALDDVRIRGVAHTLGLDA 213 (377) T ss_pred CCCCCCCHHHHHHHHHCCCCC T ss_conf 189987435566676608886 No 102 >TIGR03491 RecB family nuclease, putative, TM0106 family. Members of this uncharacterized protein family are found broadly but sporadically among bacteria. The N-terminal region is homologous to the Cas4 protein of CRISPR systems, although this protein family shows no signs of association with CRISPR repeats. Probab=96.84 E-value=0.036 Score=32.43 Aligned_cols=14 Identities=14% Similarity=0.302 Sum_probs=7.4 Q ss_pred HHHHHHHHHCCCCC Q ss_conf 88998798708883 Q gi|254781193|r 446 KAFQKMASTTGLNL 459 (675) Q Consensus 446 ~~l~~~~~~~G~~~ 459 (675) .+|..++..+|... T Consensus 356 ~al~rLa~~~~~~e 369 (457) T TIGR03491 356 DSLRRLAKRYGTPE 369 (457) T ss_pred HHHHHHHHHCCCCH T ss_conf 99999998719986 No 103 >cd05776 DNA_polB_alpha_exo The 3'-5' exonuclease domain of eukaryotic DNA polymerase alpha. DNA polymerase alpha is a family-B DNA polymerase with a catalytic subunit that contains a DnaQ-like 3'-5' exonuclease domain. It is one of the three DNA-dependent type B DNA polymerases (delta and epsilon are the other two) that have been identified as essential for nuclear DNA replication in eukaryotes. DNA polymerase alpha is almost exclusively required for the initiation of DNA replication and the priming of Okazaki fragments during elongation. It associates with DNA primase and is the only enzyme able to start DNA synthesis de novo. The catalytic subunit contains both polymerase and 3'-5' exonuclease domains, but only exhibits polymerase activity. The 3'-5' exonuclease domain contains three sequence motifs termed ExoI, ExoII and ExoIII, without the four conserved acidic residues that are crucial for metal binding and catalysis. This explains why in most organisms, that no specific repair r Probab=96.82 E-value=0.034 Score=32.62 Aligned_cols=107 Identities=18% Similarity=0.222 Sum_probs=62.8 Q ss_pred HHHHHHHHH--CCCCCEEEEEEH-HHHHHHHHH---HCCCC--------------------------CCCCC-EEEHHHH Q ss_conf 379999986--388981999730-774999998---60899--------------------------88420-3316889 Q gi|254781193|r 53 MPQVLKEAL--ENTDVTIVAHNS-YFERTVLKA---CFNLD--------------------------IPIHR-WICTLVL 99 (675) Q Consensus 53 ~~~~l~~~L--~~~~~~kV~HNa-~FD~~~L~~---~~Gi~--------------------------i~~~~-~~DTmi~ 99 (675) +...+...+ .|++++ +|||+ .||+.+|.. .+++. +-.++ +.|+... T Consensus 85 LL~~f~~~i~~~DPDii-~Ghni~~fdl~~L~~R~~~lki~~wsriGRlkr~~~p~~~~~~~~~~~~~~~GRl~~D~~~~ 163 (234) T cd05776 85 LLNFFLAKLQKIDPDVL-VGHDLEGFDLDVLLSRIQELKVPHWSRIGRLKRSVWPKKKGGGKFGERELTAGRLLCDTYLS 163 (234) T ss_pred HHHHHHHHHHHCCCCEE-EEECCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCEEEEECHHH T ss_conf 99999999985099899-85063777599999999980998533135500143764456565344203523588760999 Q ss_pred HHHHC--CCCCHHHHHHHCCCCCCHH-HHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 99867--8999689899829981100-43799998620211234332346785853289999998998999999996301 Q gi|254781193|r 100 ARSNG--LPSALKAVGEALKLSSQKM-EEGKELIARFCKGETHECLYDRNKPNHREAWRLFGEYCKGDVTVTREIFKRLE 176 (675) Q Consensus 100 A~~l~--~~~sL~~la~~l~l~~~k~-~~~k~l~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~t~~L~~~l~ 176 (675) ++-+- .+++|..++..+ +..+|. ...++....+ ..++....+..||..|+.++++|..+|. T Consensus 164 ~ke~~k~~sy~L~~v~~~~-L~~~k~dv~~~~i~~~~---------------~~~~~l~~l~~~~~~Da~l~~~L~~~L~ 227 (234) T cd05776 164 AKELIRCKSYDLTELSQQV-LGIERQDIDPEEILNMY---------------NDSESLLKLLEHTEKDAYLILQLMFKLN 227 (234) T ss_pred HHHHHCCCCCCHHHHHHHH-HCCCCCCCCHHHHHHHH---------------HCHHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 9987466778889999999-68676679999999998---------------6939999999999999999999999958 No 104 >PRK05761 DNA polymerase I; Reviewed Probab=96.77 E-value=0.0072 Score=36.96 Aligned_cols=102 Identities=21% Similarity=0.166 Sum_probs=55.2 Q ss_pred CHHHHHHHHHHHCCCCCEEEEEEH-HHHHHHHHH---HCCCCCC---C---------CCEEEH------H-HHHH----- Q ss_conf 104379999986388981999730-774999998---6089988---4---------203316------8-8999----- Q gi|254781193|r 50 DPVMPQVLKEALENTDVTIVAHNS-YFERTVLKA---CFNLDIP---I---------HRWICT------L-VLAR----- 101 (675) Q Consensus 50 ~~~~~~~l~~~L~~~~~~kV~HNa-~FD~~~L~~---~~Gi~i~---~---------~~~~DT------m-i~A~----- 101 (675) ...++..+..++.+-+.-++ +|. .||++.|.+ .+|+... . ....|. . +-.+ T Consensus 198 E~elL~~f~~~i~~~DPDIi-yN~d~FD~PYL~~Ra~~lgi~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~y~~~~~ 276 (777) T PRK05761 198 EKELLAELFKIIEDYPPVIT-FNGDNFDLPYLYNRALKLGIPKEELPFNVKYASLKTGIHIDLYKFFQNRAIRSYAFYGK 276 (777) T ss_pred HHHHHHHHHHHHHHCCCCEE-ECCCCCCHHHHHHHHHHHCCCCHHCCCCCCCCCCCCCEEEECCCCCCHHHHHHEEECCE T ss_conf 99999999999974698299-80787648999999998389900066515665402444650445312113553010547 Q ss_pred HHCCCCCHHHHHHHCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 867899968989982998110043799998620211234332346785853289999998998999999996 Q gi|254781193|r 102 SNGLPSALKAVGEALKLSSQKMEEGKELIARFCKGETHECLYDRNKPNHREAWRLFGEYCKGDVTVTREIFK 173 (675) Q Consensus 102 ~l~~~~sL~~la~~l~l~~~k~~~~k~l~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~t~~L~~ 173 (675) +--.+++|+.+++.+ ++..|... ........++.+..||.+|+.+|++|.. T Consensus 277 ~~~~~y~Ld~Va~~l-Lg~~K~~~--------------------~~~~~~~~~~~l~~Y~l~Da~l~~~L~~ 327 (777) T PRK05761 277 YRHREAALDAVARAL-LGESKVEL--------------------EKNISELNYWELAEYNYRDAEITLKLTF 327 (777) T ss_pred ECCCCEEHHHHHHHH-CCCCCCCC--------------------CCCCCCCCHHHHHHHHHHHHHHHHHHHH T ss_conf 424745547889997-59974103--------------------1246633478899998999999999875 No 105 >PHA00452 T3/T7-like RNA polymerase Probab=96.30 E-value=0.081 Score=30.19 Aligned_cols=230 Identities=14% Similarity=0.085 Sum_probs=134.8 Q ss_pred CEEEEEECCCCCHHHHHHHCCCCCHHHHHH-----HCCCCHHHHHHHHCCCCHHHH-----------------CCHHHHH Q ss_conf 789998345372333323114620113454-----113310101476448795670-----------------6201110 Q gi|254781193|r 374 KKLVVADLAGIEARVLAWMAGEQWKLKAFE-----TGEDLYVTTYARSFNVAPAQV-----------------TKDQRAI 431 (675) Q Consensus 374 ~~lv~aDySqiElRvlA~ls~d~~l~~af~-----~g~D~h~~tA~~~~g~~~~~~-----------------~~~~R~~ 431 (675) +.-|..|-|-==+--.|.|.+|+..-.+-+ ...|+|+..|..+-..-.++. .+..|+. T Consensus 504 ~lpv~~DgsCnGlQH~aal~rD~~ga~~VNL~p~d~p~DvY~~Va~~v~~~~~~d~~~~~~~~~~~~~~~~l~~~i~Rk~ 583 (816) T PHA00452 504 GLPVAMDGTCSGLQHFSAMLRDEVGGAAVNLLPSDKPQDIYKIVAEKVNERLERDAINGTDNETRGLAGQWLAYGITRSV 583 (816) T ss_pred CCCCCCCCCCCHHHHHHHHHCCHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHCCCCHHH T ss_conf 77411078872999999972586321452367888865189999999999999887438763268999998861666766 Q ss_pred CCCCEEEEECCCCHHHHHHHHHH----CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHH Q ss_conf 16520210028888889987987----08883113455655545555678888999988764310001022078889999 Q gi|254781193|r 432 GKVMELALGYQGGAKAFQKMAST----TGLNLNTFAHLVKTNSAPEEWEDAENFCRWMQENHPEHAIEDLFIGTACERVK 507 (675) Q Consensus 432 aK~~~~g~~YG~G~~~l~~~~~~----~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ea~~~~ 507 (675) .|..+--..||....+...+-.. .|++.+.... .....+.-.++.+. T Consensus 584 vK~~vMT~~Yg~t~~g~~~qi~~~l~~~~~~~~~~~~-----------------------------~~~~~~~yla~~i~ 634 (816) T PHA00452 584 TKRPVMTLPYGSTRFGFRDQVREDLVQPAIDEGKGAM-----------------------------FTNQAAGYMAKLIW 634 (816) T ss_pred HCCCEEEEECCCCCHHHHHHHHHHHHHHCCCCCCCHH-----------------------------HHHHHHHHHHHHHH T ss_conf 3253167520661000899999999730556674303-----------------------------47899999999999 Q ss_pred HHHHHHCHHHHHHHHHHHHHHHHHHHCCEEEEECCCCCCCCCCCCCCCCCCEEECC-CCCC-CCCCCCCCCHHHHCCHHH Q ss_conf 99998788789999999999999975491796215421133222344564023247-6631-016786621021010023 Q gi|254781193|r 508 KAWRQKHSRVSELWNELHQAFEQTIENGKAIIARKRRDVPLVYMKKDKNNLILMLP-SGRS-LVYRDVGNNRSYLNTTTR 585 (675) Q Consensus 508 ~~~r~~~p~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p-~GR~-~~~~~~~~~~~~~~~~~~ 585 (675) +.-.+.+|+..+...=++..++...+.+..+.+..+-+.+.+-..+....-.+.+. .|+. +..+.... ..+. T Consensus 635 ~ai~~~~~~a~~im~wl~~~a~~~~~~~~~v~W~TP~G~pV~Q~Y~~~~~~~v~t~l~g~~~~~~~~~~~------~~~~ 708 (816) T PHA00452 635 DAISETVPAAVEAMNWLQQAAKLLAKEVKDLHWTTPTGFPVWQEYRKTETKRVRLRLMGSARIQLTVNTD------EIDA 708 (816) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEECCCCCEEEEECCCCCCEEEEEEECCEEEEEECCCCC------CCCH T ss_conf 9999997709999999999999997169972898999998987737888468998741527998337889------8897 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEECCEEEEECCCCHHHHHH Q ss_conf 456777531456554536889999999999999679818899525899977924899999 Q gi|254781193|r 586 QLTREYTYGGKLTENIVQAVSRDILAEGMMNATKAGYDIVLTVHDEIVCETPDTDEFNAS 645 (675) Q Consensus 586 ~~~r~~~~~~k~~n~~iQgtaadi~k~ami~l~~~g~~ivl~VHDEiv~evp~~~~~~~~ 645 (675) +..+.. +.=|-|-+.-|.-|.+..+.+.+.|......|||+ |=++-.+.+... T Consensus 709 ~kq~~a-----~~PNfIHSlDashm~~t~~~~~~~gi~~fa~VHDs--f~tha~dv~~m~ 761 (816) T PHA00452 709 RKHASG-----IAPNFVHSLDASHLRLTVVACAEKGIESFAVIHDS--FGTHAADADNLF 761 (816) T ss_pred HHHHCC-----CCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEEECC--CCCCCCCHHHHH T ss_conf 787243-----79488888999999999999998399627886327--887715899999 No 106 >PRK06722 exonuclease; Provisional Probab=96.13 E-value=0.011 Score=35.74 Aligned_cols=51 Identities=10% Similarity=0.062 Sum_probs=35.8 Q ss_pred HHHHHHHHHHHCCCCCEEEEEEHHHHHHHHHH---HCCCCCCCCCEEEHHHHHHHH Q ss_conf 04379999986388981999730774999998---608998842033168899986 Q gi|254781193|r 51 PVMPQVLKEALENTDVTIVAHNSYFERTVLKA---CFNLDIPIHRWICTLVLARSN 103 (675) Q Consensus 51 ~~~~~~l~~~L~~~~~~kV~HNa~FD~~~L~~---~~Gi~i~~~~~~DTmi~A~~l 103 (675) +.+...+.+++.+. ..|+||++||+.||.+ .+|++-|.....||.-+++.+ T Consensus 39 eeVLp~FleFIGd~--~LVAHNAsFD~gFL~~~C~~hg~e~P~~~~~~~~dl~k~v 92 (242) T PRK06722 39 PQIIEKFIQFIGED--SIFVTWGKEDYRFLSHDCTLHGVECPCMEKERRIDLQKFV 92 (242) T ss_pred HHHHHHHHHHHCCC--EEEEECCHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHH T ss_conf 99999999982788--3897344022779997752428999998421088899998 No 107 >cd06137 DEDDh_RNase This group is composed of eukaryotic exoribonucleases that include PAN2, RNA exonuclease 1 (REX1 or Rex1p), REX3 (Rex3p), REX4 (or Rex4p), ISG20, and similar proteins. They are DEDDh-type DnaQ-like 3'-5' exonucleases containing three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific Hx(4)D conserved pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. PAN2 is the catalytic subunit of poly(A) nuclease (PAN), a Pab1p-dependent 3'-5' exoribonuclease which plays an important role in the posttranscriptional maturation of pre-mRNAs. REX proteins are required for the processing and maturation of many RNA species, and ISG20 is an interferon-induced antiviral exonuclease with a strong preference for single-stranded RNA. Probab=95.56 E-value=0.021 Score=33.93 Aligned_cols=48 Identities=19% Similarity=0.212 Sum_probs=33.0 Q ss_pred CCCCEEEEEEHHHHHHHHHHHCCCCCCCCCEEEHHHHHHHH-CC-----CCCHHHHHHHC Q ss_conf 88981999730774999998608998842033168899986-78-----99968989982 Q gi|254781193|r 63 NTDVTIVAHNSYFERTVLKACFNLDIPIHRWICTLVLARSN-GL-----PSALKAVGEAL 116 (675) Q Consensus 63 ~~~~~kV~HNa~FD~~~L~~~~Gi~i~~~~~~DTmi~A~~l-~~-----~~sL~~la~~l 116 (675) +++.++|||++.+|+.+|.. . -.+++||.++.... .. +.||+.+++.+ T Consensus 82 ~~~tIlVGH~L~nDL~aL~~----~--H~~viDTa~l~~~~~~~~~~~~~~sLk~La~~~ 135 (161) T cd06137 82 DPDTILVGHSLQNDLDALRM----I--HTRVVDTAILTREAVKGPLAKRQWSLRTLCRDF 135 (161) T ss_pred CCCCEEEECCHHHHHHHHCC----C--CCCEEECEEECCHHCCCCCCCCCHHHHHHHHHH T ss_conf 89967985101310878627----7--983543203320011578888886699999998 No 108 >cd06145 REX1_like This subfamily is composed of RNA exonuclease 1 (REX1 or Rex1p), REX3 (or Rex3p), and similar eukaryotic proteins. In yeast, REX1 and REX3 are required for 5S rRNA and MRP (mitochondrial RNA processing) RNA maturation, respectively. They are DEDDh-type DnaQ-like 3'-5' exonucleases containing three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific Hx(4)D conserved pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. REX1 is the major exonuclease responsible for pre-tRNA trail trimming and may also be involved in nuclear CCA turnover. REX proteins function in the processing and maturation of many RNA species, similar to the function of E.coli RNase T. Probab=95.47 E-value=0.04 Score=32.15 Aligned_cols=55 Identities=27% Similarity=0.302 Sum_probs=35.3 Q ss_pred HHHHHHHCCCCCEEEEEEHHHHHHHHHHHCCCCCCCCCEEEHHHHHHHH-CCC--CCHHHHHHHC Q ss_conf 9999986388981999730774999998608998842033168899986-789--9968989982 Q gi|254781193|r 55 QVLKEALENTDVTIVAHNSYFERTVLKACFNLDIPIHRWICTLVLARSN-GLP--SALKAVGEAL 116 (675) Q Consensus 55 ~~l~~~L~~~~~~kV~HNa~FD~~~L~~~~Gi~i~~~~~~DTmi~A~~l-~~~--~sL~~la~~l 116 (675) +.+..++ +++.++|||.+.+|+.+|. +.- .+++||.++-..- +++ .||+.+++.+ T Consensus 67 ~~l~~li-~~~~IlVGH~L~nDL~aL~----i~H--~~v~DTa~l~~~~~~~~~~~sLk~L~~~~ 124 (150) T cd06145 67 KKLLSLI-SPDTILVGHSLENDLKALK----LIH--PRVIDTAILFPHPRGPPYKPSLKNLAKKY 124 (150) T ss_pred HHHHHHC-CCCCEEEECCCCCCHHHHH----CCC--CCEEEHHHHHHHCCCCCCCHHHHHHHHHH T ss_conf 9999875-9998899626434777761----558--76157166211037999897899999998 No 109 >pfam03175 DNA_pol_B_2 DNA polymerase type B, organellar and viral. Like pfam00136, members of this family are also DNA polymerase type B proteins. Those included here are found in plant and fungal mitochondria, and in viruses. Probab=95.39 E-value=0.048 Score=31.62 Aligned_cols=114 Identities=22% Similarity=0.240 Sum_probs=56.4 Q ss_pred EEEEEEH-HHHHHHHHHHCCC----C-------------CC----CCCEEEHHHHHHHHCCCCCHHHHHHHCCCCCCHHH Q ss_conf 1999730-7749999986089----9-------------88----42033168899986789996898998299811004 Q gi|254781193|r 67 TIVAHNS-YFERTVLKACFNL----D-------------IP----IHRWICTLVLARSNGLPSALKAVGEALKLSSQKME 124 (675) Q Consensus 67 ~kV~HNa-~FD~~~L~~~~Gi----~-------------i~----~~~~~DTmi~A~~l~~~~sL~~la~~l~l~~~k~~ 124 (675) ++.+||+ .||..++..++.- . +. .-.+.||... .|.||+.+++.+++...|.. T Consensus 2 ~~y~HNl~~fD~~fil~~l~~~~~~~~~i~~~~~~~~~~~~~~~~~i~~~DS~~l-----~~~sL~~~~~~f~l~~~Kg~ 76 (452) T pfam03175 2 IPYAHNLSKFDGTFILPNLIRKKIIDKKIWKNGIIIKIDFKYKYIKIMVRDSYLL-----LPTSLRKLAKAFNLEVQKGC 76 (452) T ss_pred CEECCCCCCCCHHHHHHHHHHCCCCCCEECCCCCEEEEEEECCCEEEEEECCHHH-----HHHHHHHHHHHHCCCCCCCC T ss_conf 3663576424775555446642887634714894798887526279999814554-----33649999998398665786 Q ss_pred HHHHHHHHHCCCCCCC--------------CC----CCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHCCCCH Q ss_conf 3799998620211234--------------33----2346785853289999998998999999996301040001513 Q gi|254781193|r 125 EGKELIARFCKGETHE--------------CL----YDRNKPNHREAWRLFGEYCKGDVTVTREIFKRLEPLSSKEQEL 185 (675) Q Consensus 125 ~~k~l~~~~~k~~~~~--------------~~----~~~~~~~~~~~~~~~~~Ya~~Dv~~t~~L~~~l~~~~~~E~~~ 185 (675) -=...+.......... .. ...........-+....||.+||.++++++.++......+..+ T Consensus 77 fPy~~~~~~~~~~~~~~p~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Yc~~DV~iL~~v~~~f~~~~~~~~~l 155 (452) T pfam03175 77 FPYDFVNKNYMWDKGGFPIEKYWKDFKEYILNKEIWNLKNRPNGYDIRKETIEYCKNDVLVLAKLLISFKNFIYEEFNI 155 (452) T ss_pred CCHHHHCHHHHCCCCCCCCHHHCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 6817746566526778884554256134566777876640466676299999999999999999999999999998097 No 110 >pfam11074 DUF2779 Domain of unknown function(DUF2779). This domain is conserved in bacteria. The function is not known. Probab=95.22 E-value=0.12 Score=28.98 Aligned_cols=74 Identities=34% Similarity=0.396 Sum_probs=43.2 Q ss_pred EEEECCCCCC-CCCCCCCHHHCCCCEEEEEEEEE-----CCEE---EEEEECCC---CHHHHHHHHHHHCCCCCEEEEEE Q ss_conf 9981417898-70004400112798199999998-----4803---99997888---10437999998638898199973 Q gi|254781193|r 5 FIDFETRSPI-DLTKCGICKYAEDVCVILFAFGF-----DDHP---VELWDCVH---DPVMPQVLKEALENTDVTIVAHN 72 (675) Q Consensus 5 ~lD~ET~~~~-dl~~~Gl~~y~~~~~i~~i~~a~-----~~~~---v~~~d~~~---~~~~~~~l~~~L~~~~~~kV~HN 72 (675) +||+||.+.. ++ -.|+-+| +-+.++|+. ++++ ..++.... ...+.+.|...|....-.+|.+| T Consensus 2 flDFEt~~~aiP~-~~g~~Py----qqipFQ~SlHi~~~~g~~~~h~efL~~~~~DPr~~~~~~L~~~i~~~~Gsi~~yn 76 (127) T pfam11074 2 FLDFETISSAIPV-FDGTRPY----QQIPFQFSLHIQKEDGKKLEHVEFLAEDGIDPRRAFIEALKKAIGKDYGSIVVYN 76 (127) T ss_pred CEEECCCCCCCCC-CCCCCCC----EECCEEEEEEEECCCCCCEEEEEEECCCCCCHHHHHHHHHHHHHCCCCCEEEEEH T ss_conf 1731246656668-7899766----1033788898886899720230120367768479999999998476787099962 Q ss_pred HHHHHHHHHHH Q ss_conf 07749999986 Q gi|254781193|r 73 SYFERTVLKAC 83 (675) Q Consensus 73 a~FD~~~L~~~ 83 (675) +.|+..+|... T Consensus 77 ~sfEk~~l~el 87 (127) T pfam11074 77 KSFEKTRLKEL 87 (127) T ss_pred HHHHHHHHHHH T ss_conf 48899999999 No 111 >pfam03104 DNA_pol_B_exo DNA polymerase family B, exonuclease domain. This domain has 3' to 5' exonuclease activity and adopts a ribonuclease H type fold. Probab=94.95 E-value=0.17 Score=28.13 Aligned_cols=76 Identities=16% Similarity=0.138 Sum_probs=39.1 Q ss_pred CCEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECC----E-----------------EEEEEECCCCHHHHHHHHH Q ss_conf 968999814178987000440011279819999999848----0-----------------3999978881043799999 Q gi|254781193|r 1 MATLFIDFETRSPIDLTKCGICKYAEDVCVILFAFGFDD----H-----------------PVELWDCVHDPVMPQVLKE 59 (675) Q Consensus 1 mk~i~lD~ET~~~~dl~~~Gl~~y~~~~~i~~i~~a~~~----~-----------------~v~~~d~~~~~~~~~~l~~ 59 (675) ++.+++||||.+.. |-=+-.+...|..|+..+.. + .+.+........++..+.. T Consensus 86 lrilsfDIEc~~~~-----g~FP~~~~d~IiqIs~~~~~~g~~~~~~~~~~tl~~~~~i~~~~~V~~f~~E~~ll~~f~~ 160 (254) T pfam03104 86 LRVLSFDIECTSLK-----GKFPENEKDPIIQISCMLYREGEPDPIEDKLFTLKSCALIEIGGEVYIFPSEKELLRRFFE 160 (254) T ss_pred CEEEEEEEEECCCC-----CCCCCCCCCEEEEEEEEEEECCCCCCCCEEEEEECCCCCCCCCCEEEEECCHHHHHHHHHH T ss_conf 53999988958889-----9999878983899998988567898632069980787767789779995999999999999 Q ss_pred HHC--CCCCEEEEEEH-HHHHHHHHH Q ss_conf 863--88981999730-774999998 Q gi|254781193|r 60 ALE--NTDVTIVAHNS-YFERTVLKA 82 (675) Q Consensus 60 ~L~--~~~~~kV~HNa-~FD~~~L~~ 82 (675) ++. +++++ +|||. .||++.|.. T Consensus 161 ~i~~~dpdii-~gyNi~~FD~~yl~~ 185 (254) T pfam03104 161 FIRQYDPDII-TGYNGDNFDWPYILN 185 (254) T ss_pred HHHHCCCCEE-EECCCCCCCHHHHHH T ss_conf 9875199899-966787788899999 No 112 >KOG0969 consensus Probab=94.39 E-value=0.1 Score=29.59 Aligned_cols=115 Identities=16% Similarity=0.269 Sum_probs=63.9 Q ss_pred CCHHHHHHHHHHHC--CCCCEEEEEEH-HHHHHHHH---HHCCCC-C------C--CCCEEEHHHHHHHHCC-------- Q ss_conf 81043799999863--88981999730-77499999---860899-8------8--4203316889998678-------- Q gi|254781193|r 49 HDPVMPQVLKEALE--NTDVTIVAHNS-YFERTVLK---ACFNLD-I------P--IHRWICTLVLARSNGL-------- 105 (675) Q Consensus 49 ~~~~~~~~l~~~L~--~~~~~kV~HNa-~FD~~~L~---~~~Gi~-i------~--~~~~~DTmi~A~~l~~-------- 105 (675) ....+++.+..++. |++++ +|.|+ .||++.|. ..+|++ + + ...+.||+.-|+..+- T Consensus 337 ~E~elL~~W~~firevDPDvI-~GYNi~nFDiPYll~RA~tL~Ie~Fp~LGRikn~~s~irDttfSSkq~GtRetK~v~I 415 (1066) T KOG0969 337 TEKELLESWRKFIREVDPDVI-IGYNICNFDIPYLLNRAKTLGIENFPYLGRIKNSRSVIRDSTFSSKQYGTRETKEVNI 415 (1066) T ss_pred CHHHHHHHHHHHHHHCCCCEE-ECCCCCCCCCCEECCHHHHCCCCCCCCCCEECCCCEEEECCCCCHHHCCCCCCEEEEE T ss_conf 388999999999986298867-1335445563213076764680666501000354124410321232137510138866 Q ss_pred -------------------CCCHHHHHHHCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHH Q ss_conf -------------------9996898998299811004379999862021123433234678585328999999899899 Q gi|254781193|r 106 -------------------PSALKAVGEALKLSSQKMEEGKELIARFCKGETHECLYDRNKPNHREAWRLFGEYCKGDVT 166 (675) Q Consensus 106 -------------------~~sL~~la~~l~l~~~k~~~~k~l~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~ 166 (675) +++|.+++... ++++|..--+..|-.+ . .-..+--..+..||..|+. T Consensus 416 ~GRlqfDllqvi~RdyKLrSytLNaVs~hF-L~EQKEDV~~siItdL------------Q-ng~~~TRRRlA~YCLKDAY 481 (1066) T KOG0969 416 DGRLQFDLLQVILRDYKLRSYTLNAVSAHF-LGEQKEDVHHSIITDL------------Q-NGNEQTRRRLAVYCLKDAY 481 (1066) T ss_pred CCEEEEHHHHHHHHHHHHHHCCHHHHHHHH-HHHHCCCCCCCCHHHH------------H-CCCHHHHHHHHHHHHHHHC T ss_conf 565652279998776433111011457876-5532015652100433------------1-4767888899988764410 Q ss_pred HHHHHHHHCCCC Q ss_conf 999999630104 Q gi|254781193|r 167 VTREIFKRLEPL 178 (675) Q Consensus 167 ~t~~L~~~l~~~ 178 (675) +.++|.++|.-. T Consensus 482 LPlRLlekLM~i 493 (1066) T KOG0969 482 LPLRLLEKLMVI 493 (1066) T ss_pred CHHHHHHHHHHH T ss_conf 229999999999 No 113 >cd06143 PAN2_exo PAN2 is the catalytic subunit of poly(A) nuclease (PAN), a Pab1p-dependent 3'-5' exoribonuclease which plays an important role in the posttranscriptional maturation of pre-mRNAs. PAN catalyzes the deadenylation of poly(A) tails, which are initially synthesized to default lengths of 70 to 90, to mRNA-specific lengths of 55 to 71. Pab1p and PAN also play a role in the export and decay of mRNA. PAN2 contains a DEDDh-type DnaQ-like 3'-5' exonuclease domain with three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific Hx(4)D conserved pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. Probab=94.18 E-value=0.098 Score=29.63 Aligned_cols=52 Identities=23% Similarity=0.236 Sum_probs=35.1 Q ss_pred HCCCCCEEEEEEHHHHHHHHHHHCCCCCCCCCEEEHHHHHHHHC-CCCCHHHHHHHC Q ss_conf 63889819997307749999986089988420331688999867-899968989982 Q gi|254781193|r 61 LENTDVTIVAHNSYFERTVLKACFNLDIPIHRWICTLVLARSNG-LPSALKAVGEAL 116 (675) Q Consensus 61 L~~~~~~kV~HNa~FD~~~L~~~~Gi~i~~~~~~DTmi~A~~l~-~~~sL~~la~~l 116 (675) |-+.+.++|||-+..|+.+| ++..|...++||..+-+.-. ...||+.|++.+ T Consensus 98 li~~~~ILVGHsL~nDL~aL----~l~hP~~~i~DTs~l~~~~~~r~~sLk~La~~~ 150 (174) T cd06143 98 LVDLGCIFVGHGLAKDFRVI----NIQVPKEQVIDTVELFHLPGQRKLSLRFLAWYL 150 (174) T ss_pred HCCCCCEEECCCCHHHHHHH----CCCCCCCCEEECHHHCCCCCCCCCCHHHHHHHH T ss_conf 54799689789636679885----066998665777885268998870899999998 No 114 >TIGR00593 pola DNA polymerase I; InterPro: IPR002298 DNA carries the biological information that instructs cells how to exist in an ordered fashion. Accurate replication is thus one of the most important events in the cell life cycle. This function is mediated by DNA-directed DNA polymerases, which add nucleotide triphosphate (dNTP) residues to the 5'-end of the growing DNA chain, using a complementary DNA as template. Small RNA molecules are generally used as primers for chain elongation, although terminal proteins may also be used. DNA-dependent DNA polymerases have been grouped into families, denoted A, B and X, on the basis of sequence similarities , . Members of family A, which includes bacterial and bacteriophage polymerases, share significant similarity to Escherichia coli polymerase I; hence family A is also known as the pol I family. The bacterial polymerases also contain an exonuclease activity, which is coded for in the N-terminal portion. Three motifs, A, B and C , are seen to be conserved across all DNA polymerases, with motifs A and C also seen in RNA polymerases. They are centred on invariant residues, and their structural significance was implied from the Klenow (E. coli) structure. Motif A contains a strictly-conserved aspartate at the junction of a beta-strand and an alpha-helix; motif B contains an alpha-helix with positive charges; and motif C has a doublet of negative charges, located in a beta-turn-beta secondary structure .; GO: 0003677 DNA binding, 0003887 DNA-directed DNA polymerase activity, 0006260 DNA replication. Probab=93.96 E-value=0.076 Score=30.36 Aligned_cols=283 Identities=16% Similarity=0.130 Sum_probs=139.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHC-----------------CCC---H-HCCCCHHHHHHH--------------------HH Q ss_conf 999999899899999999630-----------------104---0-001513366665--------------------31 Q gi|254781193|r 155 RLFGEYCKGDVTVTREIFKRL-----------------EPL---S-SKEQELWYLDQT--------------------IN 193 (675) Q Consensus 155 ~~~~~Ya~~Dv~~t~~L~~~l-----------------~~~---~-~~E~~~~~~~~~--------------------me 193 (675) +...++.+.|+++|++||..| .|. + ..|-.-+.++++ .+ T Consensus 566 ~l~~~L~~eDaDitfrlyhylklrle~~~~~~L~~eiE~Pl~~VL~~ME~~Gi~~D~~~L~~L~~e~~~~l~~lE~~Iy~ 645 (1005) T TIGR00593 566 ELLKELDAEDADITFRLYHYLKLRLEENKLLSLYEEIELPLSKVLAEMEKTGIKVDKDYLQELSKEFGEELLDLEEEIYE 645 (1005) T ss_pred HHHHHHCCCCCCHHHHHHHHHHHHCCHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 98887311322046677778886516388999998842007899999877667870899999999999999999999998 Q ss_pred HCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCH-HHHHHHHHHHCCCCCCCCCCCC-HHHHHHH------ Q ss_conf 1562137999999999999999999999998728854767887-9998877775588888898853-3899872------ Q gi|254781193|r 194 DRGYCVDVALAHTLNQIVDVERTKLDQELESLTYGLVSSSRCL-EKLKTWLFLETGLHLEDLTETT-CQRVLSS------ 265 (675) Q Consensus 194 ~~Gi~vD~e~l~~l~~~~~~~~~~l~~~i~~~~~~~~~n~~S~-~ql~~~L~~~lg~~~~~~~k~t-~~~~L~~------ 265 (675) .-|..++...=+|+...+=+++.-=.. =+..||+ -|+ ..+.+-|++.- +. ....|-. T Consensus 646 ~aG~~FNinSPKQL~~vLF~KL~Lp~~-kK~KtG~-----YSTd~~vLe~L~~~h---------~~~i~~~Ll~yR~L~K 710 (1005) T TIGR00593 646 LAGEEFNINSPKQLGEVLFEKLGLPVG-KKTKTGS-----YSTDADVLEKLAEKH---------PIHIALILLEYRQLTK 710 (1005) T ss_pred HCCCCCCCCCHHHHHHHHHHHHCCCCC-CCCCCCC-----CEECHHHHHHHHHCC---------CCCHHHHHHHHHHHHH T ss_conf 749520688733555886998478820-0346786-----213388999887428---------8836899987666654 Q ss_pred HHCCC-CHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEE---EEEEE--ECCCCCCCCCCCCCCCCCCCC-CCCCCHH Q ss_conf 13048-23899999999976551013357887408689676---77884--000132211247888444888-8865113 Q gi|254781193|r 266 NKKLT-PQAIQVFNNRMRASRSAVLKLNTLVEAMNSDGRLR---GTLQF--LGASRTGRWSGCVFPPQNLPR-PSRSHEE 338 (675) Q Consensus 266 ~~~~~-~~~~~~l~~r~~~~k~~~~~~~~~~~~~~~dgRih---~~~~~--~gat~TGR~S~~~pNlQnlPr-~~~~~~~ 338 (675) +...+ ..+..++.. .-.|..++|.++. ...||+- |++|= .. +.-||-==+.+ +|. .+....+ T Consensus 711 LksTY~d~Ll~~v~~--~TgRiHT~F~Qt~----TaTGRLSS~~PNLQNIPiR-~~~Gr~IR~AF----va~~~g~~L~~ 779 (1005) T TIGR00593 711 LKSTYVDGLLELVNP--DTGRIHTTFNQTG----TATGRLSSSNPNLQNIPIR-SEEGRKIRKAF----VAESKGWLLIS 779 (1005) T ss_pred HHHCCHHHHHHHHHH--CCCCEEHHHHHHH----HHCCCCCCCCCCCCCCCCC-CCCHHHHHHCC----CCCCCCCEEEE T ss_conf 310120133898400--0282731314545----6356200478885443322-72001332200----24688606885 Q ss_pred HHHHHHCCCCCCCCCCCCCCCCHHHHHEEEECCCCCEEEEEECCCCCHHHHHHHCCCCC----HHHHHHH-CCCC---HH Q ss_conf 45543101233334554432100333301266898789998345372333323114620----1134541-1331---01 Q gi|254781193|r 339 LDQIITHLPTSITSDPDPLQIISDCVRSCVIASEGKKLVVADLAGIEARVLAWMAGEQW----KLKAFET-GEDL---YV 410 (675) Q Consensus 339 ~~~~~~~~~~~~~~~~~~~~~~~~~iR~~f~a~~G~~lv~aDySqiElRvlA~ls~d~~----l~~af~~-g~D~---h~ 410 (675) .| +|-+ .+| |=|=+||.|-=+-|.--+.+. -...|.- ..|+ ++ T Consensus 780 AD-------------YSQI-----ELR-----------iLAHlS~De~Li~AF~~~~DIH~~TA~~~Fgv~~~~vtp~~R 830 (1005) T TIGR00593 780 AD-------------YSQI-----ELR-----------ILAHLSQDENLIEAFKNGEDIHTQTASRLFGVEEEDVTPNMR 830 (1005) T ss_pred CC-------------CCHH-----HHH-----------HHHHHCCCHHHHHHHHCCCCHHHHHHHHHHCCCHHHCCHHHH T ss_conf 26-------------3379-----999-----------998640786799997548871388999972989774496460 Q ss_pred HHHHH-HCCCCHHHHCCHHHHHCCCCEEEEECCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 01476-44879567062011101652021002888888998798708883113455655545555678888999988764 Q gi|254781193|r 411 TTYAR-SFNVAPAQVTKDQRAIGKVMELALGYQGGAKAFQKMASTTGLNLNTFAHLVKTNSAPEEWEDAENFCRWMQENH 489 (675) Q Consensus 411 ~tA~~-~~g~~~~~~~~~~R~~aK~~~~g~~YG~G~~~l~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 489 (675) ..|.. -||+-|== +. -..|+..+ ++..++..++..||++||+++.|...+.+.+ T Consensus 831 R~AK~vNFGivYGm--Sa-fgLA~~L~----------------------Is~~EA~~fIe~YF~~yPgv~~Y~~~~~~~a 885 (1005) T TIGR00593 831 RIAKAVNFGIVYGM--SA-FGLARELG----------------------ISRKEAKEFIERYFERYPGVKEYLENTVEEA 885 (1005) T ss_pred CCCCEEEEHHHHCC--CH-HHHHHHHC----------------------CCHHHHHHHHHHHHHHCCHHHHHHHHHHHHH T ss_conf 47414430132101--20-57777508----------------------8888999999999763413899999999999 Q ss_pred HHHHHHCCCCHHHHHHHHHHHHHHCHHHHH Q ss_conf 310001022078889999999987887899 Q gi|254781193|r 490 PEHAIEDLFIGTACERVKKAWRQKHSRVSE 519 (675) Q Consensus 490 ~~~~~~~~~~~~ea~~~~~~~r~~~p~i~~ 519 (675) ...|++.++.++. +++..-..+...+++ T Consensus 886 r~~GYV~Tl~gRR--RY~~~i~s~n~~~R~ 913 (1005) T TIGR00593 886 RKKGYVTTLLGRR--RYIPDINSRNRNLRE 913 (1005) T ss_pred HHCCCEEEECCCE--ECCCCCCHHHHHHHH T ss_conf 7368556304656--115763223577799 No 115 >pfam04857 CAF1 CAF1 family ribonuclease. The major pathways of mRNA turnover in eukaryotes initiate with shortening of the polyA tail. CAF1 encodes a critical component of the major cytoplasmic deadenylase in yeast. Both Caf1p is required for normal mRNA deadenylation in vivo and localizes to the cytoplasm. Caf1p copurifies with a Ccr4p-dependent polyA-specific exonuclease activity. Some members of this family include and inserted RNA binding domain pfam01424. This family of proteins is related to other exonucleases pfam00929 (Bateman A pers. obs.). The crystal structure of Saccharomyces cerevisiae Pop2 has been resolved at 2.3 Angstrom#resolution. Probab=93.47 E-value=0.24 Score=27.17 Aligned_cols=59 Identities=19% Similarity=0.145 Sum_probs=42.5 Q ss_pred CCCCCEEEEEEHHHHHHHHHHHC-CCCCCC-------------CCEEEHHHHHHHHC-----CCCCHHHHHHHCCCCC Q ss_conf 38898199973077499999860-899884-------------20331688999867-----8999689899829981 Q gi|254781193|r 62 ENTDVTIVAHNSYFERTVLKACF-NLDIPI-------------HRWICTLVLARSNG-----LPSALKAVGEALKLSS 120 (675) Q Consensus 62 ~~~~~~kV~HNa~FD~~~L~~~~-Gi~i~~-------------~~~~DTmi~A~~l~-----~~~sL~~la~~l~l~~ 120 (675) .+.+++||+||..||...|-+.+ |-++|. ..++||-.+++... ...+|..+|..+++.. T Consensus 137 ~~~~~~wv~f~g~yD~~yl~k~l~g~~LP~~~~eF~~~l~~~FP~vyD~K~la~~~~~~~~~~~~gL~~lA~~L~v~r 214 (235) T pfam04857 137 NISSVTWVTFHGLYDFGYLLKLLTGGPLPETLDDFLELLRELFPRVYDTKYLAKFCFELDGGSNGGLQELADLLGVNR 214 (235) T ss_pred CCCCCEEEEECCHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHCCHHHHHHHHHHHCCCCCCCCCCCHHHHHHHCCCCC T ss_conf 579987998172778999999974998999899999999997771750999998522135875578999999749978 No 116 >KOG2249 consensus Probab=93.46 E-value=0.53 Score=24.92 Aligned_cols=83 Identities=23% Similarity=0.191 Sum_probs=49.8 Q ss_pred HHHHHHHCCCCCEEEEEEHHHHHHHHHHHCCCCCCCCCEEEHHHHH---HHH--CCCCCHHHHHHHC-CCCCCHHHHHHH Q ss_conf 9999986388981999730774999998608998842033168899---986--7899968989982-998110043799 Q gi|254781193|r 55 QVLKEALENTDVTIVAHNSYFERTVLKACFNLDIPIHRWICTLVLA---RSN--GLPSALKAVGEAL-KLSSQKMEEGKE 128 (675) Q Consensus 55 ~~l~~~L~~~~~~kV~HNa~FD~~~L~~~~Gi~i~~~~~~DTmi~A---~~l--~~~~sL~~la~~l-~l~~~k~~~~k~ 128 (675) .++.++|. +.++|||-+.-|+.+|.- .-|...++||---- .++ ...-||+.|++.+ |.+.+ T Consensus 176 ~ev~klL~--gRIlVGHaLhnDl~~L~l----~hp~s~iRDTs~~~pl~k~~~~~~tpSLK~Lt~~~Lg~~IQ------- 242 (280) T KOG2249 176 KEVLKLLK--GRILVGHALHNDLQALKL----EHPRSMIRDTSKYPPLMKLLSKKATPSLKKLTEALLGKDIQ------- 242 (280) T ss_pred HHHHHHHH--CCEEECCCCCCHHHHHHH----HCCHHHHCCCCCCCHHHHHHHCCCCCCHHHHHHHHHCHHHH------- T ss_conf 99999970--877751220007999855----37435531431275487775215983199999998452543------- Q ss_pred HHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 9986202112343323467858532899999989989999999963010 Q gi|254781193|r 129 LIARFCKGETHECLYDRNKPNHREAWRLFGEYCKGDVTVTREIFKRLEP 177 (675) Q Consensus 129 l~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~t~~L~~~l~~ 177 (675) ...+ =..+|+.+|.+||.+... T Consensus 243 ------------------~GeH---------sSvEDA~AtM~LY~~vk~ 264 (280) T KOG2249 243 ------------------VGEH---------SSVEDARATMELYKRVKV 264 (280) T ss_pred ------------------CCCC---------CCHHHHHHHHHHHHHHHH T ss_conf ------------------1566---------858899999999999999 No 117 >cd06135 Orn Oligoribonuclease (Orn) is a DEDDh-type DnaQ-like 3'-5' exoribonuclease that is responsible for degrading small oligoribonucleotides to mononucleotides. It contains three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific Hx(4)D conserved pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. Orn is essential for E.coli survival. The human homolog, also called Sfn (small fragment nuclease), is able to hydrolyze short single-stranded RNA and DNA oligomers. It plays a role in cellular nucleotide recycling. Probab=92.93 E-value=0.39 Score=25.75 Aligned_cols=53 Identities=13% Similarity=0.177 Sum_probs=28.1 Q ss_pred HHHHHCCCCCEEEEEEHHHHHHHHHHHCCCCCCCCCEEEHHHHHHHHCCCCCHHHHHHHCC Q ss_conf 9998638898199973077499999860899884203316889998678999689899829 Q gi|254781193|r 57 LKEALENTDVTIVAHNSYFERTVLKACFNLDIPIHRWICTLVLARSNGLPSALKAVGEALK 117 (675) Q Consensus 57 l~~~L~~~~~~kV~HNa~FD~~~L~~~~Gi~i~~~~~~DTmi~A~~l~~~~sL~~la~~l~ 117 (675) +.++......+..|+|..||+.||.++ +|. .+..+-=|.+| -.|++.+++++. T Consensus 86 i~~~~~~~~~pLaGNSV~~Dr~FL~k~----mP~---l~~~lhYR~iD-VSsikel~~rw~ 138 (173) T cd06135 86 IKKYVPKGKSPLAGNSVHQDRRFLDKY----MPE---LEEYLHYRILD-VSSIKELARRWY 138 (173) T ss_pred HHHHCCCCCCCCCCCCHHHHHHHHHHH----CCH---HHHHCCCEEEE-HHHHHHHHHHHC T ss_conf 999767998531576487709999985----806---79759870466-777999999869 No 118 >PRK08928 consensus Probab=92.85 E-value=0.078 Score=30.27 Aligned_cols=29 Identities=17% Similarity=0.131 Sum_probs=18.9 Q ss_pred CHHHHHHHCCCCCCHHHHHHHHHHHHCCC Q ss_conf 96898998299811004379999862021 Q gi|254781193|r 108 ALKAVGEALKLSSQKMEEGKELIARFCKG 136 (675) Q Consensus 108 sL~~la~~l~l~~~k~~~~k~l~~~~~k~ 136 (675) .-+.+-+.+|+++++..+...|+-.-+-+ T Consensus 159 ~~~~V~ek~GV~P~qiiD~laL~GDsSDN 187 (861) T PRK08928 159 TEDDVVEKFGVTSDKLLDVMALTGDRSDN 187 (861) T ss_pred CHHHHHHHHCCCHHHHHHHHHHCCCCCCC T ss_conf 89999999797989999999980876468 No 119 >PRK07748 sporulation inhibitor KapD; Provisional Probab=91.69 E-value=0.89 Score=23.43 Aligned_cols=138 Identities=19% Similarity=0.098 Sum_probs=74.9 Q ss_pred CCEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEE-EECCEEEE-----EE-ECC--------------------CCHHH Q ss_conf 96899981417898700044001127981999999-98480399-----99-788--------------------81043 Q gi|254781193|r 1 MATLFIDFETRSPIDLTKCGICKYAEDVCVILFAF-GFDDHPVE-----LW-DCV--------------------HDPVM 53 (675) Q Consensus 1 mk~i~lD~ET~~~~dl~~~Gl~~y~~~~~i~~i~~-a~~~~~v~-----~~-d~~--------------------~~~~~ 53 (675) |++|.+|+|.++...-. .+-...+||+-|+. .++++.+. |+ |.. .+..+ T Consensus 4 ~~yiViDfEaTc~~~~~----~p~~~~~EIIEiGaV~l~~~~i~d~F~~~VkP~~~p~Ls~fc~~LTGItq~~vd~a~~F 79 (205) T PRK07748 4 QRFLFLDFEFTMPQHKK----KPKGFFPEIIEVGLVSVVGCKVEDTFSSYVKPKTFPSLTDRCKKFLGITQEDVDKGISF 79 (205) T ss_pred CEEEEEEEECCCCCCCC----CCCCCCCCEEEEEEEEEECCEEEEEHEEEECCCCCCCCCHHHHHHCCCCHHHHHCCCCH T ss_conf 66999998547888788----87778875179978998178152100004798547851378997529499996258988 Q ss_pred ---HHHHHHHHCCCCCEEEEEEHHHHHHHHHH---HCCCCCCC-CCEEEHHHHH-HHHC--CCCCHHHHHHHCCCCCCHH Q ss_conf ---79999986388981999730774999998---60899884-2033168899-9867--8999689899829981100 Q gi|254781193|r 54 ---PQVLKEALENTDVTIVAHNSYFERTVLKA---CFNLDIPI-HRWICTLVLA-RSNG--LPSALKAVGEALKLSSQKM 123 (675) Q Consensus 54 ---~~~l~~~L~~~~~~kV~HNa~FD~~~L~~---~~Gi~i~~-~~~~DTmi~A-~~l~--~~~sL~~la~~l~l~~~k~ 123 (675) .+.+.+++.+.+.. +++...+|+.+|.+ .+|+++|. ..|+|....- ...+ ...||+..-..+|++... T Consensus 80 ~evl~~f~~~~~~~~~~-~~tWG~~D~~~L~~~c~~~~i~~p~~~~~idlk~~f~~~~~~~~~~gL~~aL~~lgl~~eG- 157 (205) T PRK07748 80 EELVEKLAEYDKRCKPT-IVTWGNMDMKVLKHNCEKAGVPFPFKGQCRDLSLEYKKFFGERNQTGLWKAIEEYGKEGTG- 157 (205) T ss_pred HHHHHHHHHHHCCCCEE-EEEECHHHHHHHHHHHHHHCCCCCCCCCEEEHHHHHHHHHCCCCCCCHHHHHHHCCCCCCC- T ss_conf 99999999981778728-9871788999999999993899985310063999999996888876899999976998889- Q ss_pred HHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 437999986202112343323467858532899999989989999999963010 Q gi|254781193|r 124 EEGKELIARFCKGETHECLYDRNKPNHREAWRLFGEYCKGDVTVTREIFKRLEP 177 (675) Q Consensus 124 ~~~k~l~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~t~~L~~~l~~ 177 (675) ..| =+..|+..|.+++.+|.. T Consensus 158 ------------------------~~H---------~glDDA~NtAkI~kk~~~ 178 (205) T PRK07748 158 ------------------------KHH---------CALDDAMTTYNIFKLVEK 178 (205) T ss_pred ------------------------CCC---------CHHHHHHHHHHHHHHHHC T ss_conf ------------------------863---------758999999999999761 No 120 >PRK11779 sbcB exonuclease I; Provisional Probab=91.17 E-value=1 Score=23.09 Aligned_cols=29 Identities=14% Similarity=0.219 Sum_probs=21.2 Q ss_pred HHHHHHHCCCCCEEEEEE-HHHHHHHHHHH Q ss_conf 999998638898199973-07749999986 Q gi|254781193|r 55 QVLKEALENTDVTIVAHN-SYFERTVLKAC 83 (675) Q Consensus 55 ~~l~~~L~~~~~~kV~HN-a~FD~~~L~~~ 83 (675) ..+...+..++...||+| +.||-.++.+. T Consensus 84 ~~i~~~~~~~~t~~iGyNsi~FDee~~R~~ 113 (477) T PRK11779 84 ARIHAEFSQPGTCILGYNNIRFDDEVTRYI 113 (477) T ss_pred HHHHHHHCCCCCEEEECCCCCCCHHHHHHH T ss_conf 999999668984898227754489999999 No 121 >PRK07556 consensus Probab=90.79 E-value=0.15 Score=28.48 Aligned_cols=29 Identities=14% Similarity=0.178 Sum_probs=19.2 Q ss_pred CHHHHHHHCCCCCCHHHHHHHHHHHHCCC Q ss_conf 96898998299811004379999862021 Q gi|254781193|r 108 ALKAVGEALKLSSQKMEEGKELIARFCKG 136 (675) Q Consensus 108 sL~~la~~l~l~~~k~~~~k~l~~~~~k~ 136 (675) ..+.+-+.+|+.+++..+...|+-.-+-. T Consensus 167 d~~~V~eK~GV~P~qv~D~lAL~GDsSDN 195 (977) T PRK07556 167 GIPEVIEKFGVPPEKVVDVQALAGDSVDN 195 (977) T ss_pred CHHHHHHHHCCCHHHHHHHHHHCCCCCCC T ss_conf 79999998696989999999971876467 No 122 >COG0749 PolA DNA polymerase I - 3'-5' exonuclease and polymerase domains [DNA replication, recombination, and repair] Probab=90.39 E-value=0.27 Score=26.76 Aligned_cols=57 Identities=11% Similarity=0.086 Sum_probs=45.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCHHHHHH Q ss_conf 34556555455556788889999887643100010220788899999999878878999 Q gi|254781193|r 462 FAHLVKTNSAPEEWEDAENFCRWMQENHPEHAIEDLFIGTACERVKKAWRQKHSRVSEL 520 (675) Q Consensus 462 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ea~~~~~~~r~~~p~i~~~ 520 (675) .++..++..||++||+++.|.....+.+.+.|++.++.++. +++..++..+..++++ T Consensus 447 ~eA~~~I~~YF~rypgv~~ym~~~~~~ar~~GyV~Tl~gRR--ry~p~i~s~n~~~R~~ 503 (593) T COG0749 447 KEAKEYIDRYFERYPGVKEYMERTKEEAREDGYVETLFGRR--RYLPDINSSNRVVRAA 503 (593) T ss_pred HHHHHHHHHHHHHCHHHHHHHHHHHHHHHHCCCEEECCCCC--CCCCCCCCCCHHHHHH T ss_conf 88899999999868689999999999999749354102660--2486210177888878 No 123 >PRK06887 consensus Probab=90.32 E-value=0.18 Score=27.93 Aligned_cols=27 Identities=11% Similarity=0.049 Sum_probs=18.2 Q ss_pred CHHHHHHHCCCCCCHHHHHHHHHHHHC Q ss_conf 968989982998110043799998620 Q gi|254781193|r 108 ALKAVGEALKLSSQKMEEGKELIARFC 134 (675) Q Consensus 108 sL~~la~~l~l~~~k~~~~k~l~~~~~ 134 (675) .-+.+-+.+|+.+++..+...|+-.-+ T Consensus 161 d~~~V~ek~GV~P~qiiD~lAL~GDsS 187 (954) T PRK06887 161 DREGVIEKYGIPPELIIDYLALMGDSS 187 (954) T ss_pred CHHHHHHHHCCCHHHHHHHHHHCCCCC T ss_conf 799998987979899999999718852 No 124 >PRK08076 consensus Probab=90.18 E-value=0.21 Score=27.48 Aligned_cols=25 Identities=20% Similarity=0.236 Sum_probs=14.9 Q ss_pred CHHHHHHHCCCCCCHHHHHHHHHHH Q ss_conf 9689899829981100437999986 Q gi|254781193|r 108 ALKAVGEALKLSSQKMEEGKELIAR 132 (675) Q Consensus 108 sL~~la~~l~l~~~k~~~~k~l~~~ 132 (675) .-+.+-+.+|+.+++..+...|+-. T Consensus 161 ~~~~v~ek~Gv~P~q~~D~~aL~GD 185 (877) T PRK08076 161 TPEALFEKYGLTPKQIIDMKGLMGD 185 (877) T ss_pred CHHHHHHHHCCCHHHHHHHHHHCCC T ss_conf 8999999979698999999997187 No 125 >PRK07625 consensus Probab=90.01 E-value=0.27 Score=26.77 Aligned_cols=27 Identities=11% Similarity=0.106 Sum_probs=17.9 Q ss_pred CHHHHHHHCCCCCCHHHHHHHHHHHHC Q ss_conf 968989982998110043799998620 Q gi|254781193|r 108 ALKAVGEALKLSSQKMEEGKELIARFC 134 (675) Q Consensus 108 sL~~la~~l~l~~~k~~~~k~l~~~~~ 134 (675) .-+.+-+.+|+.+++..+...|+-.-+ T Consensus 160 d~~~V~ek~GV~P~qviD~lAL~GDsS 186 (922) T PRK07625 160 DRDGVLAKFGVPPERIVDYLSLIGDTV 186 (922) T ss_pred CHHHHHHHHCCCHHHHHHHHHHCCCCC T ss_conf 899999987979899999999719763 No 126 >PRK08786 consensus Probab=90.00 E-value=0.19 Score=27.72 Aligned_cols=25 Identities=12% Similarity=0.097 Sum_probs=16.1 Q ss_pred HHHHHHCCCCCCHHHHHHHHHHHHC Q ss_conf 8989982998110043799998620 Q gi|254781193|r 110 KAVGEALKLSSQKMEEGKELIARFC 134 (675) Q Consensus 110 ~~la~~l~l~~~k~~~~k~l~~~~~ 134 (675) +.+-+.+|+.+.+..+...|+-.-+ T Consensus 158 ~~V~eK~GV~P~qviD~lAL~GDsS 182 (927) T PRK08786 158 AAVIAKFGVRPDQIVDLLALMGDTV 182 (927) T ss_pred HHHHHHHCCCHHHHHHHHHHCCCCC T ss_conf 9999996969899999999808764 No 127 >PRK07300 consensus Probab=89.68 E-value=0.17 Score=28.01 Aligned_cols=24 Identities=17% Similarity=0.189 Sum_probs=13.9 Q ss_pred HHHHHHHCCCCCCHHHHHHHHHHH Q ss_conf 689899829981100437999986 Q gi|254781193|r 109 LKAVGEALKLSSQKMEEGKELIAR 132 (675) Q Consensus 109 L~~la~~l~l~~~k~~~~k~l~~~ 132 (675) -+.+-+.+|+.+++..+...|+-. T Consensus 170 ~~~v~ek~GV~P~qiiD~lAL~GD 193 (880) T PRK07300 170 PAYLMEKMGLTPNQFIDLKALMGD 193 (880) T ss_pred HHHHHHHHCCCHHHHHHHHHHCCC T ss_conf 999999978898999999998088 No 128 >PRK07898 consensus Probab=89.34 E-value=0.22 Score=27.31 Aligned_cols=26 Identities=19% Similarity=0.101 Sum_probs=17.3 Q ss_pred HHHHHHHCCCCCCHHHHHHHHHHHHC Q ss_conf 68989982998110043799998620 Q gi|254781193|r 109 LKAVGEALKLSSQKMEEGKELIARFC 134 (675) Q Consensus 109 L~~la~~l~l~~~k~~~~k~l~~~~~ 134 (675) -+.+-+.+|+.+++..+...|+-.-+ T Consensus 176 ~~~V~ek~GV~P~qviD~lAL~GDsS 201 (902) T PRK07898 176 PEAVEEKYGLTPAQYPDFAALRGDPS 201 (902) T ss_pred HHHHHHHHCCCHHHHHHHHHHCCCCC T ss_conf 99999986979899999999818864 No 129 >PRK07997 consensus Probab=88.93 E-value=0.22 Score=27.31 Aligned_cols=27 Identities=7% Similarity=0.006 Sum_probs=18.1 Q ss_pred CHHHHHHHCCCCCCHHHHHHHHHHHHC Q ss_conf 968989982998110043799998620 Q gi|254781193|r 108 ALKAVGEALKLSSQKMEEGKELIARFC 134 (675) Q Consensus 108 sL~~la~~l~l~~~k~~~~k~l~~~~~ 134 (675) .-+.+-+.+|+.+++..+...|+-.-+ T Consensus 161 ~~~~V~ek~GV~P~qv~D~laL~GDsS 187 (928) T PRK07997 161 GPEEVVNKYGVPPELIIDFLALMGDSS 187 (928) T ss_pred CHHHHHHHHCCCHHHHHHHHHHCCCCC T ss_conf 799999987979898899999728865 No 130 >TIGR01298 RNaseT ribonuclease T; InterPro: IPR005987 Ribonuclease T (3.1.13 from EC) is an enzyme found so far only in gamma-subdivision proteobacteria such as Escherichia coli and Xylella fastidiosa. Ribonuclease T is homologous to the DNA polymerase III alpha chain. It can liberate AMP from the common C-C-A terminus of uncharged tRNA. It appears also to be involved in RNA maturation. It also acts as a 3' to 5' single-strand DNA-specific exonuclease; it is distinctive for its ability to remove residues near a double-stranded stem. Ribonuclease T is a high copy suppressor in E. coli of a UV-repair defect caused by deletion of three other single-stranded DNA exonucleases.; GO: 0004540 ribonuclease activity, 0006396 RNA processing. Probab=88.80 E-value=0.32 Score=26.27 Aligned_cols=46 Identities=26% Similarity=0.290 Sum_probs=27.1 Q ss_pred HHHHHCC---CCCEEEEEEHHHHHHHHHHH----CCCC---CCCCCEEEHHHHHHH Q ss_conf 9998638---89819997307749999986----0899---884203316889998 Q gi|254781193|r 57 LKEALEN---TDVTIVAHNSYFERTVLKAC----FNLD---IPIHRWICTLVLARS 102 (675) Q Consensus 57 l~~~L~~---~~~~kV~HNa~FD~~~L~~~----~Gi~---i~~~~~~DTmi~A~~ 102 (675) ++..|.. ...+.|+||+.||..||.+. .+++ .-+-..|||..+|=+ T Consensus 94 vR~~~K~~gC~RailV~HNa~fD~~Fl~~AA~~R~~~KRnPFH~F~~FDTatLAGl 149 (201) T TIGR01298 94 VRKALKASGCQRAILVAHNASFDLGFLNRAAVKRTALKRNPFHPFSVFDTATLAGL 149 (201) T ss_pred HHHHHHHCCCCEEEEEECCCCCCHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHH T ss_conf 99999755796036760032046668999999986115789876522468999888 No 131 >PRK08835 consensus Probab=87.38 E-value=0.5 Score=25.05 Aligned_cols=27 Identities=11% Similarity=0.081 Sum_probs=18.1 Q ss_pred CHHHHHHHCCCCCCHHHHHHHHHHHHC Q ss_conf 968989982998110043799998620 Q gi|254781193|r 108 ALKAVGEALKLSSQKMEEGKELIARFC 134 (675) Q Consensus 108 sL~~la~~l~l~~~k~~~~k~l~~~~~ 134 (675) .-+.+-+.+|+++++..+...|+-.-+ T Consensus 161 d~~~V~ek~GV~P~qviD~lAL~GDsS 187 (931) T PRK08835 161 DREGVVEKFGIPPELIIDYLALMGDKV 187 (931) T ss_pred CHHHHHHHHCCCHHHHHHHHHHCCCCC T ss_conf 899999987969899899999718864 No 132 >PRK05797 consensus Probab=86.94 E-value=0.34 Score=26.16 Aligned_cols=24 Identities=13% Similarity=0.165 Sum_probs=12.9 Q ss_pred HHHHHHHCCCCCCHHHHHHHHHHH Q ss_conf 689899829981100437999986 Q gi|254781193|r 109 LKAVGEALKLSSQKMEEGKELIAR 132 (675) Q Consensus 109 L~~la~~l~l~~~k~~~~k~l~~~ 132 (675) -+.+-+.+|+.+.+..+...|+-. T Consensus 162 ~~~v~ek~Gv~P~q~~D~~aL~GD 185 (869) T PRK05797 162 KNKMVEEYGVTPTQFIDVKGLMGD 185 (869) T ss_pred HHHHHHHHCCCHHHHHHHHHHCCC T ss_conf 999899977899999999998088 No 133 >PRK05755 DNA polymerase I; Provisional Probab=86.87 E-value=0.44 Score=25.39 Aligned_cols=27 Identities=11% Similarity=0.079 Sum_probs=17.8 Q ss_pred CHHHHHHHCCCCCCHHHHHHHHHHHHC Q ss_conf 968989982998110043799998620 Q gi|254781193|r 108 ALKAVGEALKLSSQKMEEGKELIARFC 134 (675) Q Consensus 108 sL~~la~~l~l~~~k~~~~k~l~~~~~ 134 (675) .-+.+-+.+|+.+.+..+...|+-.-+ T Consensus 159 ~~~~v~ek~Gv~P~q~~D~~aL~GD~s 185 (889) T PRK05755 159 DPEGVIEKYGVTPEQIIDFLALMGDSS 185 (889) T ss_pred CHHHHHHHHCCCHHHHHHHHHHCCCCC T ss_conf 899999997969899999999808864 No 134 >KOG0950 consensus Probab=82.95 E-value=0.36 Score=25.95 Aligned_cols=14 Identities=14% Similarity=0.133 Sum_probs=6.7 Q ss_pred CCCHHHHHHHHCCC Q ss_conf 33101014764487 Q gi|254781193|r 406 EDLYVTTYARSFNV 419 (675) Q Consensus 406 ~D~h~~tA~~~~g~ 419 (675) .|+|...|...|.. T Consensus 656 ~di~~~va~tl~s~ 669 (1008) T KOG0950 656 EDILHFVAVTLLSA 669 (1008) T ss_pred HHHHHHHHHHHHHC T ss_conf 78887777742012 No 135 >KOG2207 consensus Probab=82.72 E-value=1.2 Score=22.63 Aligned_cols=17 Identities=6% Similarity=0.231 Sum_probs=10.7 Q ss_pred HCCCCHHHHHHHHCCCC Q ss_conf 11331010147644879 Q gi|254781193|r 404 TGEDLYVTTYARSFNVA 420 (675) Q Consensus 404 ~g~D~h~~tA~~~~g~~ 420 (675) ...|++....+.+|+.+ T Consensus 455 ~~se~w~~~~s~if~s~ 471 (617) T KOG2207 455 LASEIWHLLLSQIFESK 471 (617) T ss_pred CHHHHHHHHHHHHCCCC T ss_conf 40899999999870688 No 136 >KOG4373 consensus Probab=79.43 E-value=4 Score=19.20 Aligned_cols=35 Identities=26% Similarity=0.298 Sum_probs=16.5 Q ss_pred CCCHHHHHHHHHHHCCCCCEEEEEEHHHHHHHHHH Q ss_conf 88104379999986388981999730774999998 Q gi|254781193|r 48 VHDPVMPQVLKEALENTDVTIVAHNSYFERTVLKA 82 (675) Q Consensus 48 ~~~~~~~~~l~~~L~~~~~~kV~HNa~FD~~~L~~ 82 (675) .+-..+|..|+.+|+|++.++||-+..-|..=|.| T Consensus 167 ~~~~~IP~~LR~fl~D~~~~~vgv~~d~D~~KL~r 201 (319) T KOG4373 167 IHCKRIPHELRSFLEDPDHTFVGVWNDQDAGKLER 201 (319) T ss_pred ECCCCCHHHHHHHHCCCCCEEEECCCCCCHHHHHH T ss_conf 31465559999862387705873055530787766 No 137 >pfam09281 Taq-exonuc Taq polymerase, exonuclease. Members of this family are found in prokaryotic Taq DNA polymerase, where they assume a ribonuclease H-like motif. The domain confers 5'-3' exonuclease activity to the polymerase. Probab=79.31 E-value=4 Score=19.18 Aligned_cols=41 Identities=7% Similarity=-0.134 Sum_probs=30.2 Q ss_pred HHHHHHHHHCCCCCCCCCEEEHHHHHHHHCCCC-CHHHHHHHCC Q ss_conf 749999986089988420331688999867899-9689899829 Q gi|254781193|r 75 FERTVLKACFNLDIPIHRWICTLVLARSNGLPS-ALKAVGEALK 117 (675) Q Consensus 75 FD~~~L~~~~Gi~i~~~~~~DTmi~A~~l~~~~-sL~~la~~l~ 117 (675) =|+.++.-.-|..++++ .|.|+.||++|++. ....++++|+ T Consensus 71 KdLAvla~~eG~~v~PG--DDPmLlAYLlDPsNT~p~~varRY~ 112 (138) T pfam09281 71 KALAAHLSVRGTVVEPG--DDPLLVAYLLDPANTNMPGVARRYL 112 (138) T ss_pred HHHHHHHHCCCCCCCCC--CCHHHHHHHCCCCCCCCHHHHHHHC T ss_conf 99999987026656999--8602787650856788215667623 No 138 >KOG1275 consensus Probab=76.88 E-value=0.3 Score=26.45 Aligned_cols=26 Identities=4% Similarity=-0.003 Sum_probs=16.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 56555455556788889999887643 Q gi|254781193|r 465 LVKTNSAPEEWEDAENFCRWMQENHP 490 (675) Q Consensus 465 ~~~~~~~~~~~~~~~~~~~~~~~~~~ 490 (675) ..+++.|-.+..++..|...|.-..+ T Consensus 960 iPFiDDYv~T~d~VvDYLTqySGI~P 985 (1118) T KOG1275 960 IPFIDDYVSTDDKVVDYLTQYSGIKP 985 (1118) T ss_pred CCCCCCEECCHHHHHHHHHHHCCCCC T ss_conf 86203233140588888887448776 No 139 >TIGR01591 Fdh-alpha formate dehydrogenase, alpha subunit; InterPro: IPR006478 This group of sequences describe a subset of formate dehydrogenase alpha chains found mainly in the archaea but also in alpha and gamma proteobacteria and a small number of Gram-positive bacteria. The alpha chain contains domains for molybdopterin dinucleotide binding and molybdopterin oxidoreductase. The holo-enzyme also contains beta and gamma subunits. The enzyme catalyzes the oxidation of formate (produced from pyruvate during anaerobic growth) to carbon dioxide with the concomitant release of two electrons and two protons. The enzyme's purpose is to allow growth on formate in some circumstances and, in the case of FdhH in gamma proteobacteria, to pass electrons to hydrogenase (by which process acid is neutralized) . The alpha subunit of a version of nitrate reductase is closely related.; GO: 0008863 formate dehydrogenase activity, 0015942 formate metabolic process, 0009326 formate dehydrogenase complex. Probab=54.59 E-value=6.3 Score=17.92 Aligned_cols=81 Identities=17% Similarity=0.174 Sum_probs=40.9 Q ss_pred EEEEEEECCCCCCCCCCCCCCCCCCCCC-CCCCHHHHHHHHCCCCCCC-CCCCCC-CCCHHHHHEEEECCCCCEEEEEEC Q ss_conf 6778840001322112478884448888-8651134554310123333-455443-210033330126689878999834 Q gi|254781193|r 305 RGTLQFLGASRTGRWSGCVFPPQNLPRP-SRSHEELDQIITHLPTSIT-SDPDPL-QIISDCVRSCVIASEGKKLVVADL 381 (675) Q Consensus 305 h~~~~~~gat~TGR~S~~~pNlQnlPr~-~~~~~~~~~~~~~~~~~~~-~~~~~~-~~~~~~iR~~f~a~~G~~lv~aDy 381 (675) |+.-|.=||..=|-++..-|.-|-+--. -....+.-+....+....+ +.+... ....+.+|.+|+-.|+=++..+|. T Consensus 337 RGQNNVQGacDmG~lp~~lPGY~~~~d~~~~~~~~~~WG~~~L~~~~GL~Ipe~~da~~~G~~kalYI~GEdp~~sDp~~ 416 (694) T TIGR01591 337 RGQNNVQGACDMGALPEFLPGYRKLSDEEVREKFEKLWGVVKLPAEPGLRIPEMIDAALDGDLKALYIVGEDPVISDPDT 416 (694) T ss_pred CCCCCCCCCCCCCCCCCCCCCCEECCCHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHCCCEEEEEEECCCCEEECCCH T ss_conf 77555322101478888877867448368999998627831186688973223789985497169998358502237985 Q ss_pred CCCC Q ss_conf 5372 Q gi|254781193|r 382 AGIE 385 (675) Q Consensus 382 SqiE 385 (675) +..+ T Consensus 417 ~~V~ 420 (694) T TIGR01591 417 KKVR 420 (694) T ss_pred HHHH T ss_conf 8999 No 140 >KOG2304 consensus Probab=53.73 E-value=10 Score=16.64 Aligned_cols=41 Identities=24% Similarity=0.433 Sum_probs=31.8 Q ss_pred CCEEEEECCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHH Q ss_conf 52021002888888998798708883113455655545555678888999988764310001022078889999999987 Q gi|254781193|r 434 VMELALGYQGGAKAFQKMASTTGLNLNTFAHLVKTNSAPEEWEDAENFCRWMQENHPEHAIEDLFIGTACERVKKAWRQK 513 (675) Q Consensus 434 ~~~~g~~YG~G~~~l~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ea~~~~~~~r~~ 513 (675) +.-||.+|-||+..|..+ .| ...++-+.+.|++. T Consensus 231 aMklGagyPMGPfEL~Dy---vG-------------------------------------------LDt~kfvmdgwhe~ 264 (298) T KOG2304 231 AMKLGAGYPMGPFELADY---VG-------------------------------------------LDTCKFVMDGWHEG 264 (298) T ss_pred HHHCCCCCCCCHHHHHHH---HH-------------------------------------------HHHHHHHHHHHHHC T ss_conf 985258998870888777---43-------------------------------------------79999999999742 Q ss_pred CHHHHHH Q ss_conf 8878999 Q gi|254781193|r 514 HSRVSEL 520 (675) Q Consensus 514 ~p~i~~~ 520 (675) ||+...| T Consensus 265 ~pe~~~f 271 (298) T KOG2304 265 YPEDSLF 271 (298) T ss_pred CCCCCCC T ss_conf 8764335 No 141 >TIGR00888 guaA_Nterm GMP synthase, N-terminal domain; InterPro: IPR004739 The N-terminal region of GMP synthase 6.3.5.2 from EC is recognised by this signature. GMP synthase catalyzes the synthesis of GMP from XMP , .ATP + xanthosine 5'-phosphate + L-glutamine + H(2)O = AMP + diphosphate + GMP + L-glutamate ; GO: 0003922 GMP synthase (glutamine-hydrolyzing) activity, 0005524 ATP binding, 0006177 GMP biosynthetic process. Probab=47.42 E-value=11 Score=16.38 Aligned_cols=30 Identities=23% Similarity=0.077 Sum_probs=23.0 Q ss_pred HHCCCCEEEEECCCCHHHHHHHHHHCCCCCCHHHH Q ss_conf 10165202100288888899879870888311345 Q gi|254781193|r 430 AIGKVMELALGYQGGAKAFQKMASTTGLNLNTFAH 464 (675) Q Consensus 430 ~~aK~~~~g~~YG~G~~~l~~~~~~~G~~~~~~~~ 464 (675) ....+-+||+.||| |-||.++|=....+++ T Consensus 67 f~LgVPvLGICYG~-----Ql~A~~lGG~V~~~~~ 96 (195) T TIGR00888 67 FELGVPVLGICYGM-----QLMAKQLGGEVGRAEK 96 (195) T ss_pred HHCCCCEEEEEHHH-----HHHHHHHCCEEEECCC T ss_conf 62799688762457-----8999982987765688 No 142 >pfam07794 DUF1633 Protein of unknown function (DUF1633). This family contains sequences derived from a group of hypothetical proteins expressed by Arabidopsis thaliana. These sequences are highly similar and the region concerned is about 100 residues long. Probab=43.79 E-value=19 Score=14.80 Aligned_cols=11 Identities=9% Similarity=-0.224 Sum_probs=5.4 Q ss_pred CCCCCCCCCCC Q ss_conf 24788844488 Q gi|254781193|r 320 SGCVFPPQNLP 330 (675) Q Consensus 320 S~~~pNlQnlP 330 (675) |.++|.|+-+- T Consensus 373 SaShPsLpevn 383 (698) T pfam07794 373 SASHPSLPAAN 383 (698) T ss_pred CCCCCCCCCCC T ss_conf 24688752123 No 143 >TIGR02201 heptsyl_trn_III lipopolysaccharide heptosyltransferase III, putative; InterPro: IPR011916 This family consists of examples of the putative ADP-heptose:LPS heptosyltransferase III, an enzyme of LPS inner core region biosynthesis. LPS, composed of lipid A, a core region, and O antigen, is found in the outer membrane of Gram-negative bacteria. This enzyme may be less widely distributed than heptosyltransferases I and II.. Probab=43.08 E-value=17 Score=15.18 Aligned_cols=12 Identities=25% Similarity=0.097 Sum_probs=4.8 Q ss_pred HHHHHHHHHHHH Q ss_conf 368899999999 Q gi|254781193|r 602 VQAVSRDILAEG 613 (675) Q Consensus 602 iQgtaadi~k~a 613 (675) ||=|||=++||+ T Consensus 189 IqPtsRwfFKCW 200 (347) T TIGR02201 189 IQPTSRWFFKCW 200 (347) T ss_pred ECCCCCCCCCCC T ss_conf 736736310166 No 144 >TIGR00922 nusG transcription termination/antitermination factor NusG; InterPro: IPR001062 Bacterial transcription antitermination protein, nusG, is a component of the transcription complex and interacts with the termination factor Rho and RNA polymerase , . NusG is a bacterial transcriptional elongation factor involved in transcription termination and anti-termination .; GO: 0003711 transcription elongation regulator activity, 0006355 regulation of transcription DNA-dependent. Probab=37.87 E-value=19 Score=14.81 Aligned_cols=14 Identities=43% Similarity=0.472 Sum_probs=9.0 Q ss_pred HEEEECCCCCEEEEEEC Q ss_conf 30126689878999834 Q gi|254781193|r 365 RSCVIASEGKKLVVADL 381 (675) Q Consensus 365 R~~f~a~~G~~lv~aDy 381 (675) |++| |||+||-+|+ T Consensus 58 rK~f---PGYvLv~m~l 71 (193) T TIGR00922 58 RKIF---PGYVLVKMDL 71 (193) T ss_pred EEEC---CCEEEEEEEC T ss_conf 1154---8648999973 No 145 >TIGR00612 ispG_gcpE 4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase; InterPro: IPR004588 This protein previously of unknown biochemical function is essential in Escherichia coli. It has now been characterised as 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase, which converts 2C-methyl-D-erythritol 2,4-cyclodiphosphate (ME-2,4CPP) into 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate in the sixth step of nonmevalonate terpenoid biosynthesis. The family is restricted to bacteria, where it is widely but not universally distributed. No homology can be detected between this family and other proteins.; GO: 0016114 terpenoid biosynthetic process. Probab=36.92 E-value=24 Score=14.12 Aligned_cols=32 Identities=16% Similarity=0.036 Sum_probs=14.2 Q ss_pred CCCCEEEE------EEHHHHHHHHHHH-CCCCCCCCCEE Q ss_conf 88981999------7307749999986-08998842033 Q gi|254781193|r 63 NTDVTIVA------HNSYFERTVLKAC-FNLDIPIHRWI 94 (675) Q Consensus 63 ~~~~~kV~------HNa~FD~~~L~~~-~Gi~i~~~~~~ 94 (675) ...+++|| |..+.+....+.+ -++.|.|+++. T Consensus 83 ~~~vPLVADiHf~Ghk~~~~~~~~~~~v~kvRINPGNig 121 (633) T TIGR00612 83 GLNVPLVADIHFAGHKYKKAALAVAKGVAKVRINPGNIG 121 (633) T ss_pred CCCCCEEEEECCCCCCHHHHHHHHHHCCCEEEECCCCCC T ss_conf 688874887302566146899998611782543787643 No 146 >KOG1931 consensus Probab=34.99 E-value=21 Score=14.55 Aligned_cols=13 Identities=31% Similarity=0.419 Sum_probs=5.9 Q ss_pred EEECCEEEEECCC Q ss_conf 9952589997792 Q gi|254781193|r 626 LTVHDEIVCETPD 638 (675) Q Consensus 626 l~VHDEiv~evp~ 638 (675) +++||-+++.+|. T Consensus 934 l~~~~~~l~~~~~ 946 (1156) T KOG1931 934 LFKEDQILVLLPG 946 (1156) T ss_pred HHHHCEEEEECCC T ss_conf 2331147883486 No 147 >KOG0184 consensus Probab=33.18 E-value=28 Score=13.74 Aligned_cols=70 Identities=21% Similarity=0.271 Sum_probs=41.3 Q ss_pred EEECCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCCHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-- Q ss_conf 962154211332223445640232476631016786621021010023456777531456554536889999999999-- Q gi|254781193|r 538 IIARKRRDVPLVYMKKDKNNLILMLPSGRSLVYRDVGNNRSYLNTTTRQLTREYTYGGKLTENIVQAVSRDILAEGMM-- 615 (675) Q Consensus 538 ~~~~~~~~~~~~~~~~~~~~~~~~~p~GR~~~~~~~~~~~~~~~~~~~~~~r~~~~~~k~~n~~iQgtaadi~k~ami-- 615 (675) +.+.+...+.-.+. ..+..|++.-|||-...|.-.-... + -|..-++|=|+-|- T Consensus 129 vRpfG~~~~~~~yd-~~g~~LymiepSG~~~~Y~~aaiGK----------g-------------rq~aKtElEKL~~~~m 184 (254) T KOG0184 129 VRPFGASTILGSYD-DEGPQLYMIEPSGSSYGYKGAAIGK----------G-------------RQAAKTELEKLKIDEM 184 (254) T ss_pred CCCCCCEEEEEEEE-CCCCEEEEECCCCCCCCEEEEECCC----------H-------------HHHHHHHHHHCCCCCC T ss_conf 15554337888880-8984599975788733033565051----------0-------------5788878886154525 Q ss_pred H---HHHCCCCEEEEECCE Q ss_conf 9---996798188995258 Q gi|254781193|r 616 N---ATKAGYDIVLTVHDE 631 (675) Q Consensus 616 ~---l~~~g~~ivl~VHDE 631 (675) . +-+...+|+..|||| T Consensus 185 t~~e~VkeaakIiY~~HDe 203 (254) T KOG0184 185 TCKELVKEAAKIIYKVHDE 203 (254) T ss_pred CHHHHHHHHHHEEEEECCC T ss_conf 1999999877246761263 No 148 >cd01779 Myosin_IXb_RA Myosin_IXb_RA RasGTP binding domain from guanine nucleotide exchange factors. In some proteins the domain acts as a RasGTP effector (AF6, canoe and RalGDS, for example), but in other cases it may not bind to RasGTP at all. Probab=32.50 E-value=25 Score=14.10 Aligned_cols=29 Identities=7% Similarity=0.202 Sum_probs=20.1 Q ss_pred HHHHHHHHHHHHHHHH--HHCCCCEEEEECC Q ss_conf 3688999999999999--9679818899525 Q gi|254781193|r 602 VQAVSRDILAEGMMNA--TKAGYDIVLTVHD 630 (675) Q Consensus 602 iQgtaadi~k~ami~l--~~~g~~ivl~VHD 630 (675) -|.|||||+..++.++ +..+...+.-||+ T Consensus 33 K~tTaaeVI~~vi~~L~Ld~~k~YvLaEVke 63 (105) T cd01779 33 KDSTAADVIDDVIASLQLDGTKCYVLAEVKE 63 (105) T ss_pred CCCCHHHHHHHHHHHHCCCCCCCEEEEEEEC T ss_conf 6675999999999985778764279999741 No 149 >PRK05359 oligoribonuclease; Provisional Probab=30.90 E-value=31 Score=13.50 Aligned_cols=11 Identities=27% Similarity=0.567 Sum_probs=8.3 Q ss_pred CEEEEEECCCC Q ss_conf 68999814178 Q gi|254781193|r 2 ATLFIDFETRS 12 (675) Q Consensus 2 k~i~lD~ET~~ 12 (675) +.+++|+|+++ T Consensus 4 ~lvWiDlEMTG 14 (181) T PRK05359 4 NLIWIDLEMTG 14 (181) T ss_pred CEEEEECCCCC T ss_conf 65899761348 No 150 >COG2388 Predicted acetyltransferase [General function prediction only] Probab=30.26 E-value=31 Score=13.43 Aligned_cols=24 Identities=21% Similarity=0.475 Sum_probs=21.1 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCEE Q ss_conf 368899999999999996798188 Q gi|254781193|r 602 VQAVSRDILAEGMMNATKAGYDIV 625 (675) Q Consensus 602 iQgtaadi~k~ami~l~~~g~~iv 625 (675) =||.|+.+++.|+-.+.++|++|+ T Consensus 53 GqGia~~L~~~al~~ar~~g~kii 76 (99) T COG2388 53 GQGIAQKLVEKALEEAREAGLKII 76 (99) T ss_pred CCCHHHHHHHHHHHHHHHCCCEEC T ss_conf 874799999999999998198573 No 151 >COG1448 TyrB Aspartate/tyrosine/aromatic aminotransferase [Amino acid transport and metabolism] Probab=29.63 E-value=32 Score=13.36 Aligned_cols=17 Identities=12% Similarity=0.118 Sum_probs=7.5 Q ss_pred CCCCCHHHHHHHHHHHH Q ss_conf 62137999999999999 Q gi|254781193|r 196 GYCVDVALAHTLNQIVD 212 (675) Q Consensus 196 Gi~vD~e~l~~l~~~~~ 212 (675) |+....+.+.++.+.+. T Consensus 186 G~D~t~~qW~~l~~~~~ 202 (396) T COG1448 186 GIDPTEEQWQELADLIK 202 (396) T ss_pred CCCCCHHHHHHHHHHHH T ss_conf 99999999999999999 No 152 >COG1158 Rho Transcription termination factor [Transcription] Probab=27.21 E-value=35 Score=13.09 Aligned_cols=65 Identities=18% Similarity=0.202 Sum_probs=31.6 Q ss_pred CCCCCHHHHHEEEECCCCCEEEEEECCCCCHHHHHHHCCCCCHHHHHHH-----CCCCHHHHHHHHCCCCHHHHCCHHHH Q ss_conf 4321003333012668987899983453723333231146201134541-----13310101476448795670620111 Q gi|254781193|r 356 PLQIISDCVRSCVIASEGKKLVVADLAGIEARVLAWMAGEQWKLKAFET-----GEDLYVTTYARSFNVAPAQVTKDQRA 430 (675) Q Consensus 356 ~~~~~~~~iR~~f~a~~G~~lv~aDySqiElRvlA~ls~d~~l~~af~~-----g~D~h~~tA~~~~g~~~~~~~~~~R~ 430 (675) -..++++..|.+++|||- ||-..+++-..+ ..|.|... .+...-++|||.-+|. T Consensus 166 L~~PIGkGQR~LIVAPPk-------------------aGKT~lLq~IA~aIt~N~Pe~~LiV--LLIDERPEEVTdmqrs 224 (422) T COG1158 166 LISPIGKGQRGLIVAPPK-------------------AGKTTLLQNIANAITTNHPECELIV--LLIDERPEEVTDMQRS 224 (422) T ss_pred HHCCCCCCCEEEEECCCC-------------------CCCHHHHHHHHHHHHCCCCCEEEEE--EEECCCCHHHHHHHHH T ss_conf 526567884656866987-------------------8733899999999863799649999--9934780677778875 Q ss_pred HCCCCEEEEECC Q ss_conf 016520210028 Q gi|254781193|r 431 IGKVMELALGYQ 442 (675) Q Consensus 431 ~aK~~~~g~~YG 442 (675) .- .-+++.-|- T Consensus 225 V~-geViaSTFD 235 (422) T COG1158 225 VK-GEVVASTFD 235 (422) T ss_pred HC-CEEEEECCC T ss_conf 24-069864488 No 153 >pfam00372 Hemocyanin_M Hemocyanin, copper containing domain. This family includes arthropod hemocyanins and insect larval storage proteins. Probab=27.16 E-value=35 Score=13.08 Aligned_cols=14 Identities=14% Similarity=0.116 Sum_probs=7.0 Q ss_pred HHHHHHHHHHHHHH Q ss_conf 99998998999999 Q gi|254781193|r 157 FGEYCKGDVTVTRE 170 (675) Q Consensus 157 ~~~Ya~~Dv~~t~~ 170 (675) -+.|=.+|+-+..- T Consensus 64 ~lsYF~EDiGlNs~ 77 (278) T pfam00372 64 RLAYFTEDIGLNTY 77 (278) T ss_pred HHHHHHHCCCHHHH T ss_conf 44456650377988 No 154 >TIGR02188 Ac_CoA_lig_AcsA acetate--CoA ligase; InterPro: IPR011904 Acetyl-CoA synthetase (also known as acetate-CoA ligase and acetyl-activating enzyme) is a ubiquitous enzyme, found in both prokaryotes and eukaryotes, which catalyses the formation of acetyl-CoA from acetate, coenzyme A (CoA) and ATP as shown below :ATP + acetate + CoA = AMP + diphosphate + acetyl-CoA The activity of this enzyme is crucial for maintaining the required levels of acetyl-CoA, a key intermediate in many important biosynthetic and catabolic processes. It is especially important in eukayotic species as it is the only route for the activation of acetate to acetyl-CoA in these organisms (some prokaryotic species can also activate acetate by either acetate kinase/phosphotransacetylase or by ADP-forming acetyl-CoA synthase). Eukaryotes typically have two isoforms of acetyl-CoA synthase, a cytosolic form involved in biosynthetic processes and a mitochondrial form primarily involved in energy generation. The crystal structures of a eukaryotic (Q01574 from SWISSPROT, from yeast) and bacterial (Q8ZKF6 from SWISSPROT, from Salmonella) form of this enzyme have been determined , . The yeast enzyme is trimeric, while the bacterial enzyme is monomeric. The trimeric state of the yeast protein may be unique to this organism however, as the residues involved in the trimer interface are poorly conserved in other sequences. Despite differences in the oligomeric state of the two enzyme, the structures of the monomers are almost identical. A large N-terminal domain (~500 residues) containing two parallel beta sheets is followed by a small (~110 residues) C-terminal domain containing a three-stranded beta sheet with helices. The active site occurs at the domain interface, with its contents determining the orientation of the C-terminal domain.; GO: 0003987 acetate-CoA ligase activity, 0016208 AMP binding. Probab=26.59 E-value=12 Score=16.19 Aligned_cols=19 Identities=5% Similarity=-0.118 Sum_probs=7.3 Q ss_pred HHHHHHHHHHHHHHHHHHH Q ss_conf 9999976551013357887 Q gi|254781193|r 278 NNRMRASRSAVLKLNTLVE 296 (675) Q Consensus 278 ~~r~~~~k~~~~~~~~~~~ 296 (675) +|++...+....-.++|.+ T Consensus 389 WY~~~vG~~~CpiVDTWWQ 407 (643) T TIGR02188 389 WYYKVVGKERCPIVDTWWQ 407 (643) T ss_pred HHHHCCCCCEEEEECCCCC T ss_conf 9987048952007616545 No 155 >TIGR01208 rmlA_long glucose-1-phosphate thymidylyltransferase; InterPro: IPR005908 Synonym: dTDP-D-glucose synthase This group of proteins comprises a tightly conserved but broadly distributed subfamily of known and putative bacterial glucose-1-phosphate thymidylyltransferases (2.7.7.24 from EC). It is well characterised in several species as the first of four enzymes involved in the biosynthesis of dTDP-L-rhamnose, a cell wall constituent and a feedback inhibitor of the enzyme.dTTP + alpha-D-glucose 1-phosphate = diphosphate + dTDP-glucose The family of known and putative glucose-1-phosphate thymidyltransferases shows a deep split into a short form (see IPR005907 from INTERPRO) and a long form described by this model. The homotetrameric short form is found in numerous bacterial species that incorporate dTDP-L-rhamnose, which it helps synthesize, into the cell wall. It is subject to feedback inhibition. The long form, in contrast, is found in many species for which it serves as a sugar-activating enzyme for antibiotic biosynthesis and or other, unknown pathways, and in which dTDP-L-rhamnose is not necessarily produced. . Probab=26.54 E-value=25 Score=14.08 Aligned_cols=74 Identities=16% Similarity=0.168 Sum_probs=53.8 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHCCCCHHHHHHHHHHC----------CCCCCHHHHHHHHHHHHHHHH-HH Q ss_conf 8532899999989989999999963010400015133666653115----------621379999999999999999-99 Q gi|254781193|r 150 HREAWRLFGEYCKGDVTVTREIFKRLEPLSSKEQELWYLDQTINDR----------GYCVDVALAHTLNQIVDVERT-KL 218 (675) Q Consensus 150 ~~~~~~~~~~Ya~~Dv~~t~~L~~~l~~~~~~E~~~~~~~~~me~~----------Gi~vD~e~l~~l~~~~~~~~~-~l 218 (675) .|.+......|+=.+-+.-+.....|.|--+-|.+.....|.|=.+ |+=.|+=+.+.+++....-++ +. T Consensus 164 PPSNLAvvGlY~F~pPe~if~~~~~~kPSwRGElEITD~IQ~lIe~Gy~V~~~~v~GWWkDTGk~eDLL~AN~~iLd~~~ 243 (361) T TIGR01208 164 PPSNLAVVGLYMFRPPELIFEAIKNIKPSWRGELEITDAIQYLIEKGYKVGGSKVKGWWKDTGKPEDLLDANRLILDEEV 243 (361) T ss_pred CCCCCEEEEEEECCCHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEEEEEECCCCCCHHHHHHHHHHHHHHC T ss_conf 78770123345338777988898528887766215775775653158377779997887015980238999999744322 Q ss_pred HHHHH Q ss_conf 99999 Q gi|254781193|r 219 DQELE 223 (675) Q Consensus 219 ~~~i~ 223 (675) +.+++ T Consensus 244 ~~~v~ 248 (361) T TIGR01208 244 EREVK 248 (361) T ss_pred CHHCC T ss_conf 32006 No 156 >COG3935 DnaD Putative primosome component and related proteins [DNA replication, recombination, and repair] Probab=25.40 E-value=38 Score=12.88 Aligned_cols=66 Identities=11% Similarity=0.105 Sum_probs=37.5 Q ss_pred HHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 999887777558888889885338998721304823899999999976551013357887408689 Q gi|254781193|r 237 EKLKTWLFLETGLHLEDLTETTCQRVLSSNKKLTPQAIQVFNNRMRASRSAVLKLNTLVEAMNSDG 302 (675) Q Consensus 237 ~ql~~~L~~~lg~~~~~~~k~t~~~~L~~~~~~~~~~~~~l~~r~~~~k~~~~~~~~~~~~~~~dg 302 (675) +.+...+.+.+|.+.....-.++...+.........|.+.|.......|.+-.|+++++.....+| T Consensus 128 k~l~~~F~~e~Gr~lsP~e~E~L~~wld~d~~~~elI~~ALkeAv~~gK~n~~YI~~IL~nW~k~g 193 (246) T COG3935 128 KELFRDFEEEFGRMLSPFEIEDLQKWLDEDSHDPELIKAALKEAVENGKLNFKYIDAILRNWKKNG 193 (246) T ss_pred HHHHHHHHHHCCCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCC T ss_conf 889999999818747845199999999832898999999999999858746899999999999838 No 157 >pfam12030 DUF3517 Domain of unknown function (DUF3517). This presumed domain is functionally uncharacterized. This domain is found in eukaryotes. This domain is about 340 amino acids in length. This domain is found associated with pfam00443. Probab=24.17 E-value=29 Score=13.66 Aligned_cols=16 Identities=13% Similarity=0.133 Sum_probs=7.4 Q ss_pred HHHCCCCHHHHCCHHH Q ss_conf 7644879567062011 Q gi|254781193|r 414 ARSFNVAPAQVTKDQR 429 (675) Q Consensus 414 ~~~~g~~~~~~~~~~R 429 (675) ..+|..+.+++..+.| T Consensus 264 e~LF~~e~e~~~~~~~ 279 (337) T pfam12030 264 QILFSGEAEDMGEDWQ 279 (337) T ss_pred HHHCCCCCCCCCHHHH T ss_conf 9873898511176899 No 158 >KOG0922 consensus Probab=23.51 E-value=36 Score=13.06 Aligned_cols=15 Identities=33% Similarity=0.408 Sum_probs=7.8 Q ss_pred HHHHHHHCCCCCEEE Q ss_conf 999998638898199 Q gi|254781193|r 55 QVLKEALENTDVTIV 69 (675) Q Consensus 55 ~~l~~~L~~~~~~kV 69 (675) +++...+++..+++| T Consensus 57 ~~il~~ve~nqvlIv 71 (674) T KOG0922 57 DQILYAVEDNQVLIV 71 (674) T ss_pred HHHHHHHHHCCEEEE T ss_conf 999999987877999 No 159 >COG1029 FwdB Formylmethanofuran dehydrogenase subunit B [Energy production and conversion] Probab=23.10 E-value=42 Score=12.60 Aligned_cols=76 Identities=14% Similarity=0.156 Sum_probs=46.4 Q ss_pred HEEEECC--CCCEEEEEECCCCCHHHHHHH------CCCCCHHHHHHH---CCCCHHHHHHHHCCCCHHHHCCHHHHHCC Q ss_conf 3012668--987899983453723333231------146201134541---13310101476448795670620111016 Q gi|254781193|r 365 RSCVIAS--EGKKLVVADLAGIEARVLAWM------AGEQWKLKAFET---GEDLYVTTYARSFNVAPAQVTKDQRAIGK 433 (675) Q Consensus 365 R~~f~a~--~G~~lv~aDySqiElRvlA~l------s~d~~l~~af~~---g~D~h~~tA~~~~g~~~~~~~~~~R~~aK 433 (675) |..|.++ ++..+|+.|----+---+|-+ ..|-.++.+++. |.-+|.. ..++|+|++++ .+.-..-| T Consensus 167 RG~~~~rGr~dRtvIvVD~RkT~TAklad~~~qi~p~sDyelisAl~~~l~G~~~~~~--eev~gvp~~~i-~e~a~~mK 243 (429) T COG1029 167 RGFFRPRGREDRTVIVVDPRKTATAKLADNHVQIKPNSDYELISALRAALHGKEPHRS--EEVAGVPIEEI-EELADMMK 243 (429) T ss_pred CCCCCCCCCCCCEEEEEECCCCCHHHHHHHEEECCCCCCHHHHHHHHHHHCCCCCCCC--HHHCCCCHHHH-HHHHHHHH T ss_conf 6422457766644999806767256654321756899867999999998558999872--42148798999-99999986 Q ss_pred CCEEEEECCC Q ss_conf 5202100288 Q gi|254781193|r 434 VMELALGYQG 443 (675) Q Consensus 434 ~~~~g~~YG~ 443 (675) -++||.+|++ T Consensus 244 na~Fg~if~G 253 (429) T COG1029 244 NAKFGAIFVG 253 (429) T ss_pred CCCCCEEEEE T ss_conf 3981249983 No 160 >pfam12096 DUF3572 Protein of unknown function (DUF3572). This family of proteins is functionally uncharacterized. This protein is found in bacteria. Proteins in this family are about 100 amino acids in length. Probab=21.84 E-value=37 Score=12.96 Aligned_cols=21 Identities=38% Similarity=0.643 Sum_probs=16.9 Q ss_pred CCHHHHHHHCCCCCHHHHHHH Q ss_conf 723333231146201134541 Q gi|254781193|r 384 IEARVLAWMAGEQWKLKAFET 404 (675) Q Consensus 384 iElRvlA~ls~d~~l~~af~~ 404 (675) |=+.+++|+++|+.++..|.. T Consensus 9 ~Al~aL~wla~d~ell~~FL~ 29 (89) T pfam12096 9 IALQALAWLAGDPELLPRFLA 29 (89) T ss_pred HHHHHHHHHCCCHHHHHHHHH T ss_conf 999999999389889999998 No 161 >TIGR00584 mug mismatch-specific thymine-DNA glycosylate (mug); InterPro: IPR003310 G:U mismatches resulting from deamination of cytosine are the most common promutagenic lesions occurring in DNA. Uracil is removed in a base-excision repair pathway by uracil DNA-glycosylase (UDG), which excises uracil from both single- and double-stranded DNA. A biochemically distinct family of DNA repair enzymes which excises both uracil and thymine, but only from mispairs with guanine has been identified . Crystal structures of the mismatch-specific uracil DNA-glycosylase (MUG) from Escherichia coli, and of a DNA complex, reveal structural and functional homology to UDG despite low sequence identity .; GO: 0016798 hydrolase activity acting on glycosyl bonds, 0030983 mismatched DNA binding, 0006298 mismatch repair. Probab=21.62 E-value=22 Score=14.48 Aligned_cols=53 Identities=25% Similarity=0.257 Sum_probs=39.1 Q ss_pred EEEECCCCCHHHHHHH-----CCCCCHHHHHHHCCCCHHHHHHHHCCCCHHHHCCHHHHHCCCCEEEE Q ss_conf 9983453723333231-----14620113454113310101476448795670620111016520210 Q gi|254781193|r 377 VVADLAGIEARVLAWM-----AGEQWKLKAFETGEDLYVTTYARSFNVAPAQVTKDQRAIGKVMELAL 439 (675) Q Consensus 377 v~aDySqiElRvlA~l-----s~d~~l~~af~~g~D~h~~tA~~~~g~~~~~~~~~~R~~aK~~~~g~ 439 (675) -++|+|.-|+|=-|-+ -.=.--+..|..|.-||..+...+||+ ++-|...||+ T Consensus 215 ~~~DL~~~E~~~G~RIL~~K~~~Y~P~~~~F~sGK~IyE~~~K~~~G~----------K~PK~l~FG~ 272 (347) T TIGR00584 215 GSKDLSKKEFREGARILVEKLKKYRPRVAVFISGKCIYEIFSKEVFGV----------KVPKNLEFGL 272 (347) T ss_pred CCCCHHHHHHHCCHHHHHHHHHHCCCCEEEEECCCEEEEEHHHHEECE----------ECCCCEECCC T ss_conf 611000565322205678887531880578844832112110010021----------2353013067 No 162 >TIGR02186 alph_Pro_TM conserved hypothetical protein; InterPro: IPR011734 This family consists of predicted transmembrane proteins of about 270 amino acids. Members are found, so far, only among the Alphaproteobacteria and only once in each genome.. Probab=21.22 E-value=46 Score=12.36 Aligned_cols=60 Identities=13% Similarity=0.088 Sum_probs=40.6 Q ss_pred CCCCCCCCCCCCCCCHHHHHHHHCCCCCCCCCCCCCCCCHHHHHEEEE-CCCCCEEEEEECCCCCHHHHHHHCCCCCHHH Q ss_conf 788844488888651134554310123333455443210033330126-6898789998345372333323114620113 Q gi|254781193|r 322 CVFPPQNLPRPSRSHEELDQIITHLPTSITSDPDPLQIISDCVRSCVI-ASEGKKLVVADLAGIEARVLAWMAGEQWKLK 400 (675) Q Consensus 322 ~~pNlQnlPr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iR~~f~-a~~G~~lv~aDySqiElRvlA~ls~d~~l~~ 400 (675) +-|.+||+|+--+.|.......+ ++-+ ..--.+=|..||++-|+-+...+-+++.++. T Consensus 18 ~~a~aQ~~~~alPPGPQ~~p~~E---------------------~leig~St~~i~itsnF~G~~LtlFGa~~~~d~~~~ 76 (275) T TIGR02186 18 TVAAAQVLELALPPGPQVRPLEE---------------------RLEIGVSTEEIAITSNFRGADLTLFGALDNADPLLL 76 (275) T ss_pred CCHHHCCCCCCCCCCCCCCCCCE---------------------EEEEEEECCEEEECCCCCCCCEEEEECCCCCCCCHH T ss_conf 76224264102777788887430---------------------168865357565156889876078601266330167 Q ss_pred HH Q ss_conf 45 Q gi|254781193|r 401 AF 402 (675) Q Consensus 401 af 402 (675) +- T Consensus 77 ~~ 78 (275) T TIGR02186 77 AQ 78 (275) T ss_pred HC T ss_conf 53 No 163 >pfam05551 DUF1519 Protein of unknown function (DUF1519). This family consists of several putative homing endonuclease proteins of around 245 residues in length which appear to be found exclusively in Naegleria species. The function of this family is unclear. Probab=20.79 E-value=47 Score=12.30 Aligned_cols=113 Identities=12% Similarity=-0.005 Sum_probs=58.2 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCC---CCCCCCHHHHH----HHHHHHCCCCCC-CCCCCCHHHHHHHHH Q ss_conf 6213799999999999999999999999872885---47678879998----877775588888-898853389987213 Q gi|254781193|r 196 GYCVDVALAHTLNQIVDVERTKLDQELESLTYGL---VSSSRCLEKLK----TWLFLETGLHLE-DLTETTCQRVLSSNK 267 (675) Q Consensus 196 Gi~vD~e~l~~l~~~~~~~~~~l~~~i~~~~~~~---~~n~~S~~ql~----~~L~~~lg~~~~-~~~k~t~~~~L~~~~ 267 (675) -+.+|.+..+..++-+...+.--...+..++... ...++..+|.. +||...-|-... .++|++.=+.|..+. T Consensus 28 tVkID~~sFk~GyDpLv~tidYGYsKmAkitv~k~drlskL~~~K~akniFneWl~nrkg~kGRsGkQkp~~fdeLkKld 107 (245) T pfam05551 28 TVKIDRSSFKDGYDPLVNTIDYGYSKMAKITVVKEDRLSKLSARKQAKNIFNEWLNNRKGGKGRSGKQKPTCFDELKKLD 107 (245) T ss_pred EEEEEHHHHHCCCCHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHH T ss_conf 99960788424652355520453788743764465435442113578999999986077877766786874044556531 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEEE Q ss_conf 0482389999999997655101335788740868967677884 Q gi|254781193|r 268 KLTPQAIQVFNNRMRASRSAVLKLNTLVEAMNSDGRLRGTLQF 310 (675) Q Consensus 268 ~~~~~~~~~l~~r~~~~k~~~~~~~~~~~~~~~dgRih~~~~~ 310 (675) -..++.-+.|.- .++|+.+.+--.|+..-.+|+.+|+--.+ T Consensus 108 ~C~HeFGECLiG--AanKtKsGFKvrFln~kGSDsYVHHVS~f 148 (245) T pfam05551 108 ICEHEFGECLIG--AANKTKSGFKVRFLNDKGSDSYVHHVSIL 148 (245) T ss_pred HHHCCCCHHHCC--CCCCCCCCCEEEEECCCCCCCCEEEHHHH T ss_conf 221344121115--35466666378850278876301102355 Done!