Query gi|254780127|ref|YP_003064540.1| putative DNA polymerase from bacteriophage origin [Candidatus Liberibacter asiaticus str. psy62] Match_columns 675 No_of_seqs 183 out of 2594 Neff 8.1 Searched_HMMs 39220 Date Sun May 22 11:13:06 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780127.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 PRK07556 consensus 100.0 0 0 881.4 42.1 539 2-674 388-974 (977) 2 PRK06887 consensus 100.0 0 0 879.0 42.9 538 2-674 357-951 (954) 3 PRK07997 consensus 100.0 0 0 873.5 40.8 538 2-674 350-925 (928) 4 PRK05755 DNA polymerase I; Pro 100.0 0 0 870.2 42.0 537 2-674 320-886 (889) 5 PRK08786 consensus 100.0 0 0 870.1 40.4 538 2-674 349-924 (927) 6 PRK07625 consensus 100.0 0 0 864.8 42.6 538 2-674 345-919 (922) 7 PRK08835 consensus 100.0 0 0 862.0 40.9 538 2-674 352-928 (931) 8 PRK05929 consensus 100.0 0 0 852.8 42.9 535 2-674 311-868 (870) 9 PRK07456 consensus 100.0 0 0 852.1 42.8 540 3-674 384-972 (975) 10 PRK07898 consensus 100.0 0 0 850.3 42.4 532 2-674 338-898 (902) 11 PRK08928 consensus 100.0 0 0 850.6 41.2 497 27-674 335-856 (861) 12 PRK08076 consensus 100.0 0 0 841.1 42.1 531 4-674 317-874 (877) 13 PRK05797 consensus 100.0 0 0 843.2 40.3 517 3-674 321-866 (869) 14 PRK08434 consensus 100.0 0 0 839.0 40.9 532 2-674 319-884 (887) 15 PRK07300 consensus 100.0 0 0 818.8 39.7 527 5-674 323-877 (880) 16 TIGR00593 pola DNA polymerase 100.0 0 0 822.0 26.0 502 64-674 464-1002(1005) 17 COG0749 PolA DNA polymerase I 100.0 0 0 767.9 32.0 534 4-674 25-590 (593) 18 cd06444 DNA_pol_A DNA polymera 100.0 0 0 618.0 29.5 366 201-674 1-378 (379) 19 pfam00476 DNA_pol_A DNA polyme 100.0 0 0 620.3 27.0 366 201-674 1-381 (383) 20 smart00482 POLAc DNA polymeras 100.0 0 0 356.5 10.8 200 361-639 1-206 (206) 21 cd06128 DNA_polA_exo The 3'-5' 99.9 5.5E-24 1.4E-28 176.9 15.8 151 2-179 2-166 (166) 22 cd06139 DNA_polA_I_Ecoli_like_ 99.9 1.1E-23 2.8E-28 174.9 16.6 173 2-201 6-190 (193) 23 cd06140 DNA_polA_I_Bacillus_li 99.8 3.2E-18 8.1E-23 138.7 16.3 164 2-197 4-171 (178) 24 cd00007 35EXOc 3'-5' exonuclea 99.8 4.8E-18 1.2E-22 137.6 15.5 148 2-179 1-153 (155) 25 KOG3657 consensus 99.8 2.9E-17 7.3E-22 132.4 15.4 271 357-666 716-1014(1075) 26 cd06142 RNaseD_exo Ribonucleas 99.8 8.2E-17 2.1E-21 129.4 17.1 150 2-186 13-165 (178) 27 pfam01612 3_5_exonuc 3'-5' exo 99.7 7.9E-17 2E-21 129.5 14.4 148 2-181 21-171 (172) 28 smart00474 35EXOc 3'-5' exonuc 99.7 1.4E-16 3.6E-21 127.8 14.9 146 2-181 22-171 (172) 29 cd06147 Rrp6p_like_exo Yeast R 99.6 3.2E-14 8.1E-19 112.3 15.6 156 2-191 25-183 (192) 30 COG0349 Rnd Ribonuclease D [Tr 99.6 2.7E-14 6.9E-19 112.8 14.2 160 2-195 18-180 (361) 31 cd06148 Egl_like_exo The Egali 99.6 1.6E-13 4E-18 107.7 15.8 157 2-184 11-180 (197) 32 cd06129 RNaseD_like The RNase 99.5 3.8E-12 9.6E-17 98.5 14.9 138 3-174 17-158 (159) 33 cd06141 WRN_exo WRN is a uniqu 99.4 8.8E-12 2.2E-16 96.1 14.8 146 2-177 19-168 (168) 34 PRK10829 ribonuclease D; Provi 99.3 2.6E-09 6.6E-14 79.8 20.6 145 2-181 23-170 (373) 35 cd06146 mut-7_like_exo The mut 99.3 3.1E-10 8E-15 85.9 15.4 146 2-177 23-191 (191) 36 PRK06631 consensus 99.3 2.9E-10 7.3E-15 86.1 13.2 85 57-178 80-173 (229) 37 PRK05711 DNA polymerase III su 99.2 1.6E-09 4E-14 81.2 14.0 63 56-120 79-150 (240) 38 PRK06807 DNA polymerase III su 99.2 2.7E-09 6.9E-14 79.7 15.2 80 64-181 87-172 (313) 39 PRK07009 consensus 99.2 1.4E-09 3.5E-14 81.6 13.6 64 56-121 76-147 (241) 40 PRK08074 bifunctional ATP-depe 99.2 3.6E-09 9.2E-14 78.8 15.6 110 2-122 4-147 (932) 41 PRK08816 consensus 99.2 2.6E-09 6.6E-14 79.8 14.8 110 1-121 1-148 (244) 42 PRK06022 consensus 99.1 1.2E-09 3E-14 82.1 11.3 64 56-121 76-145 (234) 43 PRK07982 consensus 99.1 5.5E-09 1.4E-13 77.6 14.4 65 55-121 84-157 (243) 44 TIGR01388 rnd ribonuclease D; 99.1 1.9E-09 4.9E-14 80.7 11.9 157 3-195 20-181 (374) 45 PRK06859 consensus 99.1 8.9E-09 2.3E-13 76.2 14.8 64 56-121 84-155 (253) 46 PRK07740 hypothetical protein; 99.1 6E-09 1.5E-13 77.4 13.9 130 2-180 58-221 (240) 47 cd06126 DEDDy DEDDy exonucleas 99.1 5.3E-09 1.3E-13 77.7 13.3 146 3-174 1-180 (181) 48 PRK08517 DNA polymerase III su 99.1 4.5E-09 1.2E-13 78.2 12.8 87 56-181 142-233 (259) 49 PRK06195 DNA polymerase III su 99.1 2.8E-09 7.1E-14 79.6 11.5 62 56-119 73-140 (309) 50 cd05781 DNA_polB_B3_exo The 3' 99.1 1E-08 2.6E-13 75.9 14.3 155 1-177 3-188 (188) 51 TIGR01406 dnaQ_proteo DNA poly 99.1 1.7E-09 4.2E-14 81.1 9.9 67 53-121 79-155 (246) 52 PRK07246 bifunctional ATP-depe 99.1 7.5E-09 1.9E-13 76.7 13.2 106 2-121 8-147 (820) 53 PRK06310 DNA polymerase III su 99.0 1.2E-08 3E-13 75.4 13.5 85 57-179 81-172 (250) 54 cd06131 DNA_pol_III_epsilon_Ec 99.0 1.8E-08 4.5E-13 74.3 13.7 79 64-177 80-166 (167) 55 PRK06309 DNA polymerase III su 99.0 1.8E-08 4.5E-13 74.3 13.4 85 57-179 73-164 (232) 56 PRK08858 consensus 99.0 3.6E-08 9.2E-13 72.2 14.8 65 55-121 89-162 (246) 57 PRK07883 hypothetical protein; 99.0 2.8E-08 7E-13 73.0 14.0 63 57-121 98-168 (575) 58 PRK07942 DNA polymerase III su 99.0 3.1E-07 7.9E-12 66.0 18.7 79 64-179 92-178 (234) 59 cd05785 DNA_polB_like2_exo A s 98.9 7.1E-08 1.8E-12 70.3 14.1 154 1-174 9-206 (207) 60 cd06130 DNA_pol_III_epsilon_li 98.9 4.9E-08 1.2E-12 71.4 12.9 63 56-120 70-138 (156) 61 TIGR01405 polC_Gram_pos DNA po 98.9 2.4E-08 6E-13 73.4 11.2 65 56-122 278-349 (1264) 62 cd06134 RNaseT RNase T is a DE 98.9 4.5E-08 1.2E-12 71.6 12.6 79 66-179 103-187 (189) 63 PRK09145 DNA polymerase III su 98.9 1.5E-07 3.8E-12 68.1 14.8 128 2-179 30-199 (203) 64 PRK06063 DNA polymerase III su 98.9 4E-08 1E-12 71.9 11.7 66 54-121 85-156 (313) 65 TIGR01407 dinG_rel DnaQ family 98.8 3.7E-07 9.5E-12 65.5 14.0 108 2-121 1-143 (944) 66 PRK05168 ribonuclease T; Provi 98.8 4.1E-07 1E-11 65.2 13.6 79 66-179 115-199 (213) 67 cd05782 DNA_polB_like1_exo A s 98.8 4.5E-07 1.2E-11 65.0 13.8 131 28-176 51-208 (208) 68 cd05780 DNA_polB_Kod1_like_exo 98.7 4.6E-07 1.2E-11 64.9 12.0 152 1-177 3-195 (195) 69 PRK00448 polC DNA polymerase I 98.7 6.4E-07 1.6E-11 64.0 12.1 57 64-120 497-559 (1436) 70 PRK09146 DNA polymerase III su 98.6 7.9E-07 2E-11 63.3 11.3 56 64-120 128-203 (239) 71 cd06125 DnaQ_like_exo The DnaQ 98.6 5.5E-07 1.4E-11 64.4 10.3 89 4-101 1-119 (130) 72 cd05784 DNA_polB_II_exo The 3' 98.6 1.1E-07 2.8E-12 69.0 6.6 156 1-175 3-193 (193) 73 PRK07983 exodeoxyribonuclease 98.6 8.6E-07 2.2E-11 63.1 10.8 84 56-178 66-151 (219) 74 PRK07247 DNA polymerase III su 98.5 4.2E-06 1.1E-10 58.6 12.6 83 57-180 78-168 (195) 75 PRK05762 DNA polymerase II; Re 98.5 4.6E-06 1.2E-10 58.3 12.6 156 1-178 155-347 (785) 76 cd06127 DEDDh DEDDh exonucleas 98.5 4.8E-06 1.2E-10 58.2 12.0 46 56-103 72-120 (159) 77 COG2176 PolC DNA polymerase II 98.5 5.7E-07 1.5E-11 64.3 6.8 57 64-120 500-562 (1444) 78 smart00479 EXOIII exonuclease 98.4 9.5E-06 2.4E-10 56.2 12.1 87 56-179 73-165 (169) 79 cd05777 DNA_polB_delta_exo The 98.4 8.1E-06 2.1E-10 56.7 11.6 157 1-180 7-226 (230) 80 cd05160 DNA_polB_exo The 3'-5' 98.4 6.6E-06 1.7E-10 57.3 11.0 147 3-175 1-200 (200) 81 cd06136 TREX1_2 Three prime re 98.4 2.7E-06 6.9E-11 59.8 9.0 54 64-119 97-156 (177) 82 TIGR00593 pola DNA polymerase 98.3 1.1E-06 2.9E-11 62.3 4.7 125 377-536 789-923 (1005) 83 COG0847 DnaQ DNA polymerase II 98.2 3.4E-05 8.8E-10 52.5 11.5 64 56-120 87-156 (243) 84 PRK09182 DNA polymerase III su 98.2 5E-06 1.3E-10 58.1 6.6 106 3-117 39-175 (293) 85 cd05779 DNA_polB_epsilon_exo T 98.1 8E-05 2.1E-09 50.1 11.8 147 2-174 3-203 (204) 86 cd06144 REX4_like This subfami 98.0 5.3E-05 1.3E-09 51.3 9.2 60 55-120 69-133 (152) 87 smart00486 POLBc DNA polymeras 98.0 0.00037 9.4E-09 45.7 13.4 157 1-177 3-220 (471) 88 COG3359 Predicted exonuclease 98.0 0.00024 6.2E-09 46.9 12.4 156 3-179 100-267 (278) 89 pfam00929 Exonuc_X-T Exonuclea 98.0 0.00013 3.3E-09 48.7 10.8 63 57-120 74-144 (162) 90 cd05783 DNA_polB_B1_exo The 3' 97.9 0.00015 3.8E-09 48.3 9.8 145 1-174 5-201 (204) 91 cd06133 ERI-1_3'hExo_like This 97.8 0.00075 1.9E-08 43.6 11.3 86 56-177 81-175 (176) 92 pfam10108 Exon_PolB Predicted 97.8 0.0024 6.2E-08 40.3 13.9 131 29-177 10-171 (211) 93 KOG2206 consensus 97.7 8.2E-05 2.1E-09 50.0 6.1 125 30-178 231-357 (687) 94 PTZ00166 DNA polymerase delta 97.7 0.003 7.7E-08 39.6 13.9 109 54-180 360-511 (1081) 95 COG0417 PolB DNA polymerase el 97.7 0.0023 5.7E-08 40.5 12.7 178 1-206 154-372 (792) 96 COG0749 PolA DNA polymerase I 97.6 8.7E-05 2.2E-09 49.9 4.7 63 463-534 448-510 (593) 97 cd06138 ExoI_N This subfamily 97.6 0.00078 2E-08 43.5 9.1 65 56-120 75-165 (183) 98 KOG1798 consensus 97.4 0.0059 1.5E-07 37.7 11.5 149 2-177 247-450 (2173) 99 TIGR00573 dnaq exonuclease, DN 97.3 0.0021 5.4E-08 40.7 8.2 90 52-178 77-176 (228) 100 PRK05601 DNA polymerase III su 97.0 0.0081 2.1E-07 36.8 9.0 29 54-84 116-144 (377) 101 cd05778 DNA_polB_zeta_exo The 97.0 0.015 3.7E-07 35.1 10.1 152 1-174 4-230 (231) 102 PRK06722 exonuclease; Provisio 96.9 0.033 8.5E-07 32.8 11.8 53 51-103 36-92 (242) 103 cd06149 ISG20 Interferon (IFN) 96.8 0.016 4.1E-07 34.9 9.0 60 55-120 69-136 (157) 104 PRK08928 consensus 96.7 0.002 5.1E-08 40.9 4.1 67 459-534 714-780 (861) 105 PRK07300 consensus 96.6 0.0023 5.8E-08 40.5 3.7 68 459-535 731-798 (880) 106 PRK06887 consensus 96.6 0.0028 7.2E-08 39.9 4.0 66 460-534 807-872 (954) 107 PRK07898 consensus 96.6 0.0033 8.5E-08 39.4 4.2 60 463-531 756-815 (902) 108 PRK07625 consensus 96.5 0.0036 9.1E-08 39.2 4.3 67 460-535 774-840 (922) 109 PRK07556 consensus 96.5 0.0031 7.8E-08 39.6 3.9 28 32-59 63-96 (977) 110 PRK08786 consensus 96.5 0.0034 8.6E-08 39.3 4.1 64 461-533 780-843 (927) 111 PRK05755 DNA polymerase I; Pro 96.5 0.0027 6.9E-08 40.0 3.6 68 460-536 742-809 (889) 112 PRK08835 consensus 96.5 0.0038 9.6E-08 39.0 4.1 62 462-532 784-845 (931) 113 PRK08076 consensus 96.4 0.0045 1.1E-07 38.5 4.2 67 459-534 728-794 (877) 114 PRK05929 consensus 96.4 0.0064 1.6E-07 37.5 4.7 64 459-532 723-786 (870) 115 PRK07997 consensus 96.3 0.0045 1.2E-07 38.5 3.8 64 460-532 780-843 (928) 116 PRK05797 consensus 96.2 0.0063 1.6E-07 37.5 3.8 62 460-530 721-782 (869) 117 cd06137 DEDDh_RNase This group 96.0 0.023 5.9E-07 33.8 6.1 56 58-120 78-140 (161) 118 PRK05761 DNA polymerase I; Rev 95.9 0.044 1.1E-06 32.0 7.1 122 52-203 200-350 (777) 119 cd05776 DNA_polB_alpha_exo The 95.6 0.16 4.2E-06 28.2 11.0 104 56-178 88-227 (234) 120 KOG0969 consensus 95.5 0.17 4.4E-06 28.1 10.5 113 50-179 338-492 (1066) 121 pfam00940 RNA_pol DNA-dependen 95.3 0.17 4.4E-06 28.0 8.3 175 426-638 153-328 (391) 122 PRK08434 consensus 95.2 0.033 8.5E-07 32.8 4.5 43 460-502 742-784 (887) 123 KOG2249 consensus 94.9 0.26 6.6E-06 26.9 8.5 83 54-178 175-263 (280) 124 TIGR03491 RecB family nuclease 94.8 0.28 7.2E-06 26.6 12.1 16 446-461 356-371 (457) 125 cd06145 REX1_like This subfami 94.8 0.17 4.3E-06 28.1 7.1 58 56-120 68-129 (150) 126 pfam03104 DNA_pol_B_exo DNA po 94.7 0.17 4.4E-06 28.0 7.1 76 1-82 86-185 (254) 127 pfam11074 DUF2779 Domain of un 94.3 0.27 6.9E-06 26.8 7.2 74 5-83 2-87 (127) 128 pfam03175 DNA_pol_B_2 DNA poly 93.7 0.17 4.4E-06 28.0 5.4 113 67-184 2-152 (452) 129 pfam04857 CAF1 CAF1 family rib 93.6 0.22 5.7E-06 27.3 5.8 59 62-120 137-214 (235) 130 pfam00476 DNA_pol_A DNA polyme 93.4 0.092 2.3E-06 29.8 3.4 57 460-525 236-292 (383) 131 PRK07456 consensus 92.6 0.17 4.3E-06 28.1 3.9 40 462-501 811-850 (975) 132 cd06135 Orn Oligoribonuclease 92.0 0.49 1.2E-05 25.0 5.6 49 61-117 90-138 (173) 133 KOG0950 consensus 91.3 0.17 4.2E-06 28.2 2.7 16 404-419 654-669 (1008) 134 cd06444 DNA_pol_A DNA polymera 91.3 0.21 5.2E-06 27.5 3.1 51 460-519 233-283 (379) 135 cd06143 PAN2_exo PAN2 is the c 91.1 0.47 1.2E-05 25.2 4.8 56 61-120 98-155 (174) 136 PHA00452 T3/T7-like RNA polyme 89.3 1.4 3.6E-05 22.0 9.7 229 375-645 505-761 (816) 137 TIGR01298 RNaseT ribonuclease 87.9 0.39 9.9E-06 25.7 2.4 37 65-101 105-148 (201) 138 PRK07748 sporulation inhibitor 82.0 3.2 8.2E-05 19.6 13.7 138 1-179 4-178 (205) 139 KOG4373 consensus 74.1 5.4 0.00014 18.1 6.2 20 194-213 185-204 (319) 140 TIGR00888 guaA_Nterm GMP synth 62.2 1.9 5E-05 21.1 -0.2 33 430-467 67-99 (195) 141 cd05531 POLBc_B2 DNA polymeras 56.8 12 0.0003 15.9 9.8 23 42-77 21-43 (352) 142 pfam09281 Taq-exonuc Taq polym 55.6 12 0.00031 15.8 4.3 40 75-116 71-111 (138) 143 TIGR00612 ispG_gcpE 4-hydroxy- 55.2 12 0.00031 15.8 5.8 21 620-640 600-623 (633) 144 PRK10017 putative pyruvyl tran 54.2 11 0.00028 16.1 2.5 71 363-458 298-368 (426) 145 COG1158 Rho Transcription term 52.8 13 0.00034 15.5 4.3 66 355-442 165-235 (422) 146 KOG2207 consensus 51.0 14 0.00036 15.3 7.0 15 406-420 457-471 (617) 147 KOG1275 consensus 49.7 5.4 0.00014 18.1 0.3 26 467-492 962-987 (1118) 148 KOG2304 consensus 49.5 15 0.00038 15.2 2.5 41 434-520 231-271 (298) 149 TIGR02336 TIGR02336 conserved 41.1 2.2 5.5E-05 20.8 -2.8 54 382-443 338-400 (767) 150 TIGR00922 nusG transcription t 41.0 16 0.00041 15.0 1.7 15 365-382 58-72 (193) 151 pfam07794 DUF1633 Protein of u 40.9 20 0.00052 14.3 2.9 10 322-331 374-383 (698) 152 TIGR01512 ATPase-IB2_Cd cadmiu 37.2 23 0.00059 14.0 2.8 22 265-286 267-288 (494) 153 COG1029 FwdB Formylmethanofura 36.8 21 0.00054 14.2 1.7 76 365-443 167-253 (429) 154 TIGR01591 Fdh-alpha formate de 33.9 8 0.0002 17.0 -0.8 24 360-383 395-418 (694) 155 cd05532 POLBc_alpha DNA polyme 33.9 26 0.00067 13.6 6.9 27 39-78 22-48 (400) 156 COG2388 Predicted acetyltransf 31.4 29 0.00073 13.3 2.6 24 603-626 54-77 (99) 157 cd07922 CarBa CarBa is the A s 28.5 25 0.00064 13.7 1.0 52 391-451 12-63 (81) 158 cd01779 Myosin_IXb_RA Myosin_I 28.2 31 0.00078 13.2 1.4 38 602-639 33-76 (105) 159 PRK06487 glycerate dehydrogena 28.1 33 0.00083 13.0 4.3 64 1-83 1-64 (317) 160 KOG0184 consensus 28.0 33 0.00083 13.0 3.1 76 550-648 140-239 (254) 161 TIGR01075 uvrD DNA helicase II 27.3 16 0.0004 15.1 -0.2 13 167-179 190-202 (741) 162 TIGR02049 gshA_ferroox glutama 27.0 8.3 0.00021 16.9 -1.7 20 187-208 219-238 (436) 163 PRK05359 oligoribonuclease; Pr 27.0 34 0.00087 12.9 6.8 11 2-12 4-14 (181) 164 TIGR02093 P_ylase glycogen/sta 26.1 35 0.0009 12.8 6.9 48 238-285 482-529 (822) 165 TIGR01418 PEP_synth phosphoeno 25.8 36 0.00091 12.7 1.4 41 613-668 710-755 (877) 166 KOG1401 consensus 25.7 33 0.00083 13.0 1.2 12 642-653 415-426 (433) 167 PRK00050 mraW S-adenosyl-methy 24.6 38 0.00096 12.6 1.7 14 235-248 177-190 (309) 168 TIGR01113 mtrE tetrahydrometha 23.1 40 0.001 12.4 2.0 26 216-242 12-37 (298) 169 TIGR02201 heptsyl_trn_III lipo 21.9 38 0.00097 12.5 0.9 26 617-649 212-238 (347) 170 TIGR00972 3a0107s01c2 phosphat 21.7 37 0.00095 12.6 0.8 25 64-88 61-88 (248) 171 TIGR00629 uvde UV damage endon 21.6 34 0.00087 12.8 0.6 90 157-253 151-241 (345) 172 TIGR00946 2a69 Auxin Efflux Ca 20.9 44 0.0011 12.1 1.8 20 383-402 345-364 (608) 173 TIGR02186 alph_Pro_TM conserve 20.9 44 0.0011 12.1 1.8 156 323-519 18-176 (275) 174 TIGR02109 PQQ_syn_pqqE coenzym 20.1 39 0.001 12.5 0.6 14 501-514 203-216 (363) No 1 >PRK07556 consensus Probab=100.00 E-value=0 Score=881.43 Aligned_cols=539 Identities=21% Similarity=0.231 Sum_probs=451.2 Q ss_pred CEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCCEEEEEECCCCC------------------HHHHHHHHHHCC Q ss_conf 78999804067887000460000268829999999838228999788842------------------217999998648 Q gi|254780127|r 2 SKLFIDIETRSPQPLPKVGVWAYAEQAVITLCAYAWDDEPVKLWDRTEQS------------------AMPSDLLQYLRD 63 (675) Q Consensus 2 k~l~lD~ET~s~~dl~~~Gl~~y~~~~~I~~i~~a~~~~~~~~~d~~~~~------------------~~~~~l~~~L~~ 63 (675) +.+++|+||.|.. +. +++++||++|++.+..+|+|+.|.. .+...|+++|++ T Consensus 388 ~~~a~DtETtsld--------~~--~a~lVGisls~~~~~a~YIPl~h~~~~~~~~~~~~~~~ql~~~~vL~~LkplLed 457 (977) T PRK07556 388 GIVAFDTETTSLD--------PM--QAELVGFSLALAPGRACYIPLGHKSGDGDLFGGGLAEGQIPLRDALAALKPVLED 457 (977) T ss_pred CEEEEEEECCCCC--------HH--HCCEEEEEEEECCCCEEEEECCCCCCCCCHHCCCCCHHHCCHHHHHHHHHHHHCC T ss_conf 9589997318878--------30--1553579998068978999614445642011022223230599999998988538 Q ss_pred CCCEEEEEEHHHHHHHHHHHCCCCCCCCCEEEHHHHHHHHCC---CCCHHHHHH-HHCCCCCCHHHHHCHHHHHHHHHHH Q ss_conf 898199863077599999862998786504545889998588---999789999-8088986527663765766410023 Q gi|254780127|r 64 ETVMCVAHNSLFERILFKKTLGIDIPSKRWICTSVLARTNGL---PSSLKNACL-ALNFSEHLTKMEEGKALIARFCKGS 139 (675) Q Consensus 64 ~~~~kv~HNa~FD~~vL~~~lGi~i~~~~~~DTmi~A~~l~~---p~sL~~la~-~lgl~~~~~kd~~gk~li~~~~~~~ 139 (675) +++.||+||+|||+.+|.++ |+.+.+ ++||||+||++++ .++|+.+++ +|+.....+++..|+.-. T Consensus 458 ~~i~KighnlK~d~~vL~~~-gi~l~~--i~DtmLaaYLLdp~~~~~~Ld~La~~yL~~~~~~~~dl~g~gk~------- 527 (977) T PRK07556 458 PSVLKIGQNMKYDALVLARH-GITVAP--FDDTMLISYALDAGRGGHGMDALSERWLGHTPIAYKDLTGSGKS------- 527 (977) T ss_pred CCHHHHHHHHHHHHHHHHHC-CCCCCC--CCCHHHHHHHHCCCCCCCCHHHHHHHHCCCCCCCCHHCCCCCCC------- T ss_conf 73114441256889999976-960268--53188888883488664787999999648876210000345501------- Q ss_pred HHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCC----------HHHHHHHHHHHHHHHHCCEECCHHHHHHH Q ss_conf 32013453112587689998988775399999999872002----------14310178999987516843088999999 Q gi|254780127|r 140 IDSPPYDCTRANHVQAWQLFGEYCKRDVEATRELFKRLIPL----------SDGERDLWLLDQTINDRGYRIDLDLVLKL 209 (675) Q Consensus 140 ~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~~~~L~~~L~~~----------~~~E~~l~~~~~~me~~Gi~vD~e~l~~~ 209 (675) ... ....+.+....||+.|++++++||..|.+- .+.|+++..++.+||.+||.||.+.++++ T Consensus 528 ------~~~--f~~v~~~~~~~YaaedA~~t~~L~~~L~~~L~~~~l~~Ly~~IE~PLi~VLa~ME~~GI~vD~~~L~~l 599 (977) T PRK07556 528 ------AIT--FDQVDIDRATAYAAEDADVTLRLWQVLKPRLAAEGLTTVYERLERPLVPVLARMEERGISVDRQVLSRL 599 (977) T ss_pred ------CCC--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHCCEEECHHHHHHH T ss_conf ------047--655206899999999999999999999999984235789999997899999999856969869999999 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCH----HHHHHHHHHC--CCCCHHHHHHHHHHH Q ss_conf 9999999999999998630576688885688999999972899988896----5889999744--898548999999999 Q gi|254780127|r 210 QELIAQERKKLDEDIVKLTDGIIRSSRQTYALRMYLFLITGIDLVDMSE----GTLKSILSHS--NITQLAKDLILNRLA 283 (675) Q Consensus 210 ~~~~~~~~~~~~~~i~~~~g~~~~n~~S~~ql~~~L~~~~g~~~~~~~k----~~~~~~l~~~--~~~~~~~~ll~~r~~ 283 (675) .+++.++..+++.+|++++|.++ |+|||+||+++||+.+|+|..+.+| +|..++|+.. +.+|.++.+++||+. T Consensus 600 ~~~l~~~l~~le~~I~~~aG~eF-Ni~SpkQL~~iLF~~l~lp~~kktKtG~~STd~~vLe~L~~~~h~i~~~iL~yR~l 678 (977) T PRK07556 600 SGEFAQKAARLEAEIYELAGEPF-NIGSPKQLGDILFGKMGLPGGSKTKTGQWSTSAQVLEDLAAEGHELPRKILDWRQL 678 (977) T ss_pred HHHHHHHHHHHHHHHHHHCCCCC-CCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCHHHHHHHHHH T ss_conf 99999999999999998708987-87889999999998608988886888997970999999874578568999999889 Q ss_pred HHHHCCHHHHHHHHHHCCC-CCCCCCEEECCCCCCHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCCCCH Q ss_conf 7543000356688730568-845220021132210011106888554777766523456544310244455444322101 Q gi|254780127|r 284 SSGSAILKLNTLSEAVSSD-GRLRGTLQFYGASRTGRWSGCVFQPQNLPRQERSEEILTQTIQALNRGETIADPLGLASD 362 (675) Q Consensus 284 ~~~~~~~~~~~l~~~~~~d-GRih~~~~~~gAt~TGR~S~~~pNlQnlPr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 362 (675) .|+.+ +|++.|.+.++++ ||||++|+|.+ |.|||+||++||||||| ++++.|+ T Consensus 679 ~KL~s-TY~d~L~~~i~~~~gRiHt~fnq~~-t~TGRlSS~~PNLQNIP------------------------ir~e~G~ 732 (977) T PRK07556 679 SKLKS-TYTDALPGYIHPQTGRVHTSYALAA-TTTGRLSSSDPNLQNIP------------------------VRTEEGR 732 (977) T ss_pred HHHHH-HHHHHHHHHCCCCCCEECCCHHHHC-EECCCCCCCCCCCCCCC------------------------CCCCCCC T ss_conf 99999-9999988623789880777302323-10576456998646788------------------------8763230 Q ss_pred HHHEEEECCCCCEEEEEECCCCCEEEEEEECCCHHHHHHHHCCCCCCCCCHHHHHCCCCCCCCHHHHHCCCCCHHHHHHC Q ss_conf 32121476998789997335485058866228767866540444100001577608751105702210064210013307 Q gi|254780127|r 363 CVRSCIIASNGKKLVVADLAGIEARVLAWIAGEDWKIKAFANGEDIYVTTYAKAFNTPIDKVSKEQRAIGKVMELALGYQ 442 (675) Q Consensus 363 ~iR~~f~a~~G~~lv~aDySqiElRvlA~ls~d~~li~~f~~g~D~h~~tA~~~~~~~~~~v~~~~R~~aK~~~~gl~YG 442 (675) .||+||+|+||++|++|||||||||||||+|+|+.|++||++|.|||+.||+.+||+|.++|++++|+.||++|||++|| T Consensus 733 ~iR~aFv~~~g~~l~saDYSQIELRilAh~s~d~~li~af~~~~DiH~~tA~~if~~~~~~v~~~~R~~aK~iNfgi~YG 812 (977) T PRK07556 733 KIRTAFIATPGHKLVSADYSQIELRILAHIADIPALKQAFADGLDIHAMTASEMFGVPVEGMPSEVRRRAKAINFGIIYG 812 (977) T ss_pred HHHEEECCCCCCEEEEECHHHHHHHHHHHHHCCHHHHHHHCCCCCHHHHHHHHHHCCCHHHCCHHHHHHHHEEEEHHHHC T ss_conf 32321604999879983366889999999817889998763598778999999959993548999986526042034556 Q ss_pred CCHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEECCHHHHHHHHHHHHHHHCCCHHHHHH Q ss_conf 88778988788708886403666665666765478899999999987511101031088978899999986730789999 Q gi|254780127|r 443 GGAKVFKTMASHCGLDLQQFSQNVKSTSTFEDWEQAESHHLWMQDQYPEFAVKDKLIGTACELVKKAWRAKHQGVLQLWK 522 (675) Q Consensus 443 ~G~~~l~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ea~~~~~~~~~~~p~i~~~~~ 522 (675) ||+.+|.+ ..+++..||+.+++.||+.||+|..|.. T Consensus 813 ~~~~gL~~--------------------------------------------~l~i~~~eA~~~i~~yf~~y~~v~~~~~ 848 (977) T PRK07556 813 ISAFGLAN--------------------------------------------QLGIPREEAGAYIKRYFERFPGIRAYMD 848 (977) T ss_pred CCHHHHHH--------------------------------------------HCCCCHHHHHHHHHHHHHHCCHHHHHHH T ss_conf 68777998--------------------------------------------7198999999999999986803999999 Q ss_pred HHHHHHHHHHHCCCEEEECCCCCCCCCCCCCCCCCCEEECCCCHHHCCCCCCHHHHHHCCCHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999963372142013222344321244564112246311310245201233203100123577653355777674 Q gi|254780127|r 523 DLTEGFACVVQEGGSISARRVANVPRLVMRRHKRDVHIVLPSSRRLVYSDVKGDCSYLNTATSQLMRERTYGGKLTENIV 602 (675) Q Consensus 523 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~i~~psGR~l~~~~~~~~~~~~~~~~~~~~r~~~~g~k~~n~~i 602 (675) .+.+.|+. +| ++++..|||+++|+++..+.... ...+|+ .+|+|| T Consensus 849 ~~~~~a~~---~g-----------------------yv~Tl~gRrr~~p~i~s~n~~~r---~~~eR~------a~N~pi 893 (977) T PRK07556 849 ATKAFARE---HG-----------------------YVETLFGRRIHYPDIKASNPSVR---AFNERA------AINAPI 893 (977) T ss_pred HHHHHHHH---CC-----------------------CEECCCCCEECCCCCCCCCHHHH---HHHHHH------HHCCCC T ss_conf 99999997---79-----------------------77888787032875568998899---799999------827764 Q ss_pred HHHHHHHHHHHHHHHHH------CCCCEEEEECCEEEEECCCCHHHHHHH-HHHHHCCCC--CCCCCCCCEEECCCCCCC Q ss_conf 79899999999999996------798189995358999678478999999-999846763--021698415730311463 Q gi|254780127|r 603 QAISRDILCEGMKNATK------NGYDIVLTVHDEIVSETPDTPYFSVGT-LCSLMTKNP--SWAKGLPLKAEGYEAKRY 673 (675) Q Consensus 603 QgtAaDi~k~am~~l~~------~g~~ivl~VHDElv~evp~~~~~~~~~-i~~~M~~~p--~wa~~~pl~a~~~~~~~y 673 (675) |||||||+|.||+++++ .+.+|+||||||||||||+++++++.. |+++|++++ ..-.++||.+|..+|+.| T Consensus 894 QGtaAdiiK~ami~~~~~l~~~~~~~~milqvHDEli~e~~~~e~e~~~~~v~~~Me~a~~~~~~l~VPL~v~~~~G~nW 973 (977) T PRK07556 894 QGTAADIIRRAMIRMEDALAEEKLSARMLLQVHDELIFEVPEDEVEATIPVVRHVMENAAMPAVSLSVPLTVDARAADNW 973 (977) T ss_pred CHHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECEEEEEEECHHHHHHHHHHHHHHHHCCCCCCCEECCCEEEECCCCCCH T ss_conf 35599999999999999998369886289997727789646889999999999997367767803115668654818697 Q ss_pred C Q ss_conf 1 Q gi|254780127|r 674 R 674 (675) Q Consensus 674 ~ 674 (675) . T Consensus 974 ~ 974 (977) T PRK07556 974 D 974 (977) T ss_pred H T ss_conf 8 No 2 >PRK06887 consensus Probab=100.00 E-value=0 Score=879.00 Aligned_cols=538 Identities=22% Similarity=0.231 Sum_probs=450.6 Q ss_pred CEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCCEEEEEECCCCCH----------------------------- Q ss_conf 789998040678870004600002688299999998382289997888422----------------------------- Q gi|254780127|r 2 SKLFIDIETRSPQPLPKVGVWAYAEQAVITLCAYAWDDEPVKLWDRTEQSA----------------------------- 52 (675) Q Consensus 2 k~l~lD~ET~s~~dl~~~Gl~~y~~~~~I~~i~~a~~~~~~~~~d~~~~~~----------------------------- 52 (675) +.+++|+||.|...+ .++++|++++++.+..+|+|+.+... T Consensus 357 ~~va~DtEt~~ld~~----------~a~lVGis~s~~~g~a~YiPl~h~~~~~~~~~~~~~~~~~~~~~~~~~~~~ql~~ 426 (954) T PRK06887 357 KLIAVDTETDSLDYM----------SANLVGISFALENGEACYIPLAHKQQVAEQPQSDLFAEEAETQTDYLLAPKQLNK 426 (954) T ss_pred CEEEEEEECCCCCCC----------CCCEEEEEEEECCCCEEEEECCCCCCCCCCCCHHHHHHCCCCHHHHHCCHHHCCH T ss_conf 927999635998713----------2665789998358978998644421000132002332001001222102321279 Q ss_pred --HHHHHHHHHCCCCCEEEEEEHHHHHHHHHHHCCCCCCCCCEEEHHHHHHHHCC--CCCHHHHHH-HHCCCCCCHHHHH Q ss_conf --17999998648898199863077599999862998786504545889998588--999789999-8088986527663 Q gi|254780127|r 53 --MPSDLLQYLRDETVMCVAHNSLFERILFKKTLGIDIPSKRWICTSVLARTNGL--PSSLKNACL-ALNFSEHLTKMEE 127 (675) Q Consensus 53 --~~~~l~~~L~~~~~~kv~HNa~FD~~vL~~~lGi~i~~~~~~DTmi~A~~l~~--p~sL~~la~-~lgl~~~~~kd~~ 127 (675) +...|+++|+++++.||+||+|||+.+|.++ |+++.. ..+||||+||++++ .++|+.+++ +|+......++.. T Consensus 427 ~~vl~~Lkplled~~i~KIghNlK~dl~vL~~~-Gi~l~g-~~fDTmLAaYLLdp~~r~~Ld~LA~~yL~~~~i~~edl~ 504 (954) T PRK06887 427 STCLAALKPLLENPDIHKIGQNIKYDLTIFANH-GIELQG-VAFDTMLESYTLNSTGRHNMDDLAERYLGHETIAFESIA 504 (954) T ss_pred HHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHC-CCCCCC-CCCCHHHHHHHHCCCCCCCHHHHHHHHCCCCCCCHHHHC T ss_conf 999999899852975022211478899999977-962477-651698998751875557889999996388653346422 Q ss_pred CHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCC-----------HHHHHHHHHHHHHHHH Q ss_conf 76576641002332013453112587689998988775399999999872002-----------1431017899998751 Q gi|254780127|r 128 GKALIARFCKGSIDSPPYDCTRANHVQAWQLFGEYCKRDVEATRELFKRLIPL-----------SDGERDLWLLDQTIND 196 (675) Q Consensus 128 gk~li~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~~~~L~~~L~~~-----------~~~E~~l~~~~~~me~ 196 (675) |+...+ ......+.+....||+.|++++++|+..|.+- .+.|+++..++.+||. T Consensus 505 gkGk~q---------------l~f~~v~l~~~~~YAaedA~itl~L~~~L~~~L~~~~~L~~Ly~~IE~PLi~VLa~ME~ 569 (954) T PRK06887 505 GKGKNQ---------------LTFDQIPLEQATEYAAEDADVTMKLHQVLWLELQKEPTLVKLYKTMELPLVHVLSRIER 569 (954) T ss_pred CCCCCC---------------CCHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 544333---------------78443446799999999999999999999999985454789999999799999999997 Q ss_pred CCEECCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHCCCCCCCCC----HHHHHHHHHH-CCCC Q ss_conf 6843088999999999999999999999863057668888568899999997289998889----6588999974-4898 Q gi|254780127|r 197 RGYRIDLDLVLKLQELIAQERKKLDEDIVKLTDGIIRSSRQTYALRMYLFLITGIDLVDMS----EGTLKSILSH-SNIT 271 (675) Q Consensus 197 ~Gi~vD~e~l~~~~~~~~~~~~~~~~~i~~~~g~~~~n~~S~~ql~~~L~~~~g~~~~~~~----k~~~~~~l~~-~~~~ 271 (675) +||.||.+.++++.+++..+..++++++++++|.++ |+|||+||+++||+++|+|+.+.+ .+|..++|+. ...+ T Consensus 570 ~GI~vD~~~L~~ls~el~~~l~~le~~iy~~aG~eF-NinSpKQL~eiLFekLgLp~~KKTKtG~~STda~vLe~L~~~h 648 (954) T PRK06887 570 NGVLIDSDALLAQSNEIAQRLTALEQQAYELAGQPF-NLASTKQLQEILFDKLGLPVLKKTPKGAPSTNEEVLEELAYQH 648 (954) T ss_pred HCEEECHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC-CCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHCC T ss_conf 385887999999999999999999999988609954-8999899999989842999888588999774699999986235 Q ss_pred CHHHHHHHHHHHHHHHCCHHHHHHHHHHCCC-CCCCCCEEECCCCCCHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCCCC Q ss_conf 5489999999997543000356688730568-845220021132210011106888554777766523456544310244 Q gi|254780127|r 272 QLAKDLILNRLASSGSAILKLNTLSEAVSSD-GRLRGTLQFYGASRTGRWSGCVFQPQNLPRQERSEEILTQTIQALNRG 350 (675) Q Consensus 272 ~~~~~ll~~r~~~~~~~~~~~~~l~~~~~~d-GRih~~~~~~gAt~TGR~S~~~pNlQnlPr~~~~~~~~~~~~~~~~~~ 350 (675) |.++.+++||+..|+.+ +|+++|...++++ ||||++|+|.+ |.|||+||++||||||| T Consensus 649 ~i~~~il~yR~l~KL~s-TY~d~L~~~i~~~tgRiHt~f~q~~-t~TGRlSS~~PNLQNIP------------------- 707 (954) T PRK06887 649 ELPKLLVEHRGLSKLKS-TYTDKLPQMVNSKTGRVHTSYHQAV-TATGRLSSSDPNLQNIP------------------- 707 (954) T ss_pred CCHHHHHHHHHHHHHHH-HHHHHHHHHCCCCCCEECCCCCCCC-CCCCCCCCCCCCCCCCC------------------- T ss_conf 32689999989999899-9898887640778987777652011-23120137998633688------------------- Q ss_pred CCCCCCCCCCCHHHHEEEECCCCCEEEEEECCCCCEEEEEEECCCHHHHHHHHCCCCCCCCCHHHHHCCCCCCCCHHHHH Q ss_conf 45544432210132121476998789997335485058866228767866540444100001577608751105702210 Q gi|254780127|r 351 ETIADPLGLASDCVRSCIIASNGKKLVVADLAGIEARVLAWIAGEDWKIKAFANGEDIYVTTYAKAFNTPIDKVSKEQRA 430 (675) Q Consensus 351 ~~~~~~~~~~~~~iR~~f~a~~G~~lv~aDySqiElRvlA~ls~d~~li~~f~~g~D~h~~tA~~~~~~~~~~v~~~~R~ 430 (675) ++++.|+.||+||+|+|||+|++|||||||||||||+|+|+.|++||++|.|||+.||+.+||+|.++|++++|+ T Consensus 708 -----iRte~Gr~iR~aF~~~~g~~l~saDYSQIELRilAh~s~d~~l~~af~~g~DiH~~tA~~vf~~~~~~v~~~~R~ 782 (954) T PRK06887 708 -----IRNEEGRRIRQAFIARDGYKIVAADYSQIELRIMAHLSGDQGLINAFSEGKDIHRSTAAEIFGLSLDEVTSEQRR 782 (954) T ss_pred -----CCCHHHHHHHEEEECCCCCEEEEECHHHHHHHHHHHHHCCHHHHHHHHCCCCCHHHHHHHHHCCCHHHCCHHHHH T ss_conf -----786022233400543899889971452899999999966887999986388802778999949994668988998 Q ss_pred CCCCCHHHHHHCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEECCHHHHHHHHHHHH Q ss_conf 06421001330788778988788708886403666665666765478899999999987511101031088978899999 Q gi|254780127|r 431 IGKVMELALGYQGGAKVFKTMASHCGLDLQQFSQNVKSTSTFEDWEQAESHHLWMQDQYPEFAVKDKLIGTACELVKKAW 510 (675) Q Consensus 431 ~aK~~~~gl~YG~G~~~l~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ea~~~~~~~ 510 (675) .||++|||++||||+.+|.+ ..+++..||+.+++.| T Consensus 783 ~aK~vnfgi~YG~~~~gLs~--------------------------------------------~l~i~~~eA~~~i~~y 818 (954) T PRK06887 783 SAKAINFGLIYGMSAFGLSR--------------------------------------------QLGISRADAQKYMDLY 818 (954) T ss_pred HHHHHHHHHHHCCCHHHHHH--------------------------------------------HHCCCHHHHHHHHHHH T ss_conf 76888766566168777998--------------------------------------------7098999999999999 Q ss_pred HHHCCCHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCCCCCCCEEECCCCHHHCCCCCCHHHHHHCCCHHHHHHH Q ss_conf 98673078999999999999963372142013222344321244564112246311310245201233203100123577 Q gi|254780127|r 511 RAKHQGVLQLWKDLTEGFACVVQEGGSISARRVANVPRLVMRRHKRDVHIVLPSSRRLVYSDVKGDCSYLNTATSQLMRE 590 (675) Q Consensus 511 ~~~~p~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~i~~psGR~l~~~~~~~~~~~~~~~~~~~~r~ 590 (675) |+.||+|..|...+.+.++. +| ++++..|||+++|++++.+.... ...+|+ T Consensus 819 f~~~~~v~~~~~~~~~~a~~---~G-----------------------yv~Tl~GRrr~~p~i~s~n~~~r---~~~eR~ 869 (954) T PRK06887 819 FQRYPSVQQFMTDIREKAKA---QG-----------------------YVETLFGRRLYLPDINSSNAMRR---KGAERV 869 (954) T ss_pred HHHCCHHHHHHHHHHHHHHH---CC-----------------------CEECCCCCCCCCCCCCCCCHHHH---HHHHHH T ss_conf 88690099999999999997---79-----------------------46888887306876668998899---789999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHC-----CCCEEEEECCEEEEECCCCHHHHHHHH-HHHHCCCCCCCCCCCCE Q ss_conf 653355777674798999999999999967-----981899953589996784789999999-99846763021698415 Q gi|254780127|r 591 RTYGGKLTENIVQAISRDILCEGMKNATKN-----GYDIVLTVHDEIVSETPDTPYFSVGTL-CSLMTKNPSWAKGLPLK 664 (675) Q Consensus 591 ~~~g~k~~n~~iQgtAaDi~k~am~~l~~~-----g~~ivl~VHDElv~evp~~~~~~~~~i-~~~M~~~p~wa~~~pl~ 664 (675) ++|+|||||||||+|.||++++++ +.+|+||||||||||||+++++++..| +++|+.+.. ..+||. T Consensus 870 ------aiN~piQGtAADiiK~AMi~~~~~l~~~~~~~~~lqvHDEli~e~~~~e~e~~~~~v~~~Me~a~~--l~VPL~ 941 (954) T PRK06887 870 ------AINAPMQGTAADIIKRAMIKIDKVIRGDPDIKMIMQVHDELVFEVRSEKVEFYSELIKQEMEKAAQ--LVVPLI 941 (954) T ss_pred ------HHCCCCCHHHHHHHHHHHHHHHHHHHHCCCCEEEEEECEEEEEEECHHHHHHHHHHHHHHHHCCCC--CCCCEE T ss_conf ------837565267999999999999999842789769998670668855688999999999999837410--678637 Q ss_pred EECCCCCCCC Q ss_conf 7303114631 Q gi|254780127|r 665 AEGYEAKRYR 674 (675) Q Consensus 665 a~~~~~~~y~ 674 (675) +|..+|+.|- T Consensus 942 v~~~~g~nW~ 951 (954) T PRK06887 942 VEVGVGDNWD 951 (954) T ss_pred EECCCCCCHH T ss_conf 3107287977 No 3 >PRK07997 consensus Probab=100.00 E-value=0 Score=873.52 Aligned_cols=538 Identities=21% Similarity=0.222 Sum_probs=450.2 Q ss_pred CEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCCEEEEEECCCC----------CHHHHHHHHHHCCCCCEEEEE Q ss_conf 7899980406788700046000026882999999983822899978884----------221799999864889819986 Q gi|254780127|r 2 SKLFIDIETRSPQPLPKVGVWAYAEQAVITLCAYAWDDEPVKLWDRTEQ----------SAMPSDLLQYLRDETVMCVAH 71 (675) Q Consensus 2 k~l~lD~ET~s~~dl~~~Gl~~y~~~~~I~~i~~a~~~~~~~~~d~~~~----------~~~~~~l~~~L~~~~~~kv~H 71 (675) +.+++|+||.|...+ .++++|+++|+..+..+|+++.+. +.+...+.++|+++++.||+| T Consensus 350 ~~~a~DtEt~sld~~----------~a~lvGiS~s~~~g~a~YiPl~h~~l~~~~ql~~~~vl~~LkplLed~~i~KIgh 419 (928) T PRK07997 350 PVFAFDTETDSLDNI----------SANLVGLSFAIEPGVAAYLPVAHDYLDAPDQISRERVLELLKPLLEDEKALKVGQ 419 (928) T ss_pred CEEEEEEECCCCCCC----------CCCCCCEEEEECCCCEEEEECCCCCCCCHHHCCHHHHHHHHHHHHCCCCCCHHCC T ss_conf 869999512786600----------0244433898627746874022123476001249999999898752865303212 Q ss_pred EHHHHHHHHHHHCCCCCCCCCEEEHHHHHHHHCC---CCCHHHHHH-HHCCCCCCHHHHHCHHHHHHHHHHHHHHCCCCC Q ss_conf 3077599999862998786504545889998588---999789999-808898652766376576641002332013453 Q gi|254780127|r 72 NSLFERILFKKTLGIDIPSKRWICTSVLARTNGL---PSSLKNACL-ALNFSEHLTKMEEGKALIARFCKGSIDSPPYDC 147 (675) Q Consensus 72 Na~FD~~vL~~~lGi~i~~~~~~DTmi~A~~l~~---p~sL~~la~-~lgl~~~~~kd~~gk~li~~~~~~~~~~~~~~~ 147 (675) |+|||+.+|.++ |+++.. ..+||||+||++++ .++|+.+++ +|+......++..|+.-. T Consensus 420 nlK~d~~vL~~~-GI~l~g-i~fDTmLAaYLLnp~~~~~~Ld~La~~yL~~~~i~~~el~gkgk~--------------- 482 (928) T PRK07997 420 NLKFDRGILANY-GIELRG-IAFDTMLESYILNSVAGRHDMDSLAERWLKHKTITFEEIAGKGKN--------------- 482 (928) T ss_pred HHHHHHHHHHHC-CCEECC-CCCCHHHHHHHCCCCCCCCCHHHHHHHHHCCCCCCHHHHCCCCCC--------------- T ss_conf 168899999977-941056-430345532222443344788999999713466433543244645--------------- Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCC-----------HHHHHHHHHHHHHHHHCCEECCHHHHHHHHHHHHHH Q ss_conf 112587689998988775399999999872002-----------143101789999875168430889999999999999 Q gi|254780127|r 148 TRANHVQAWQLFGEYCKRDVEATRELFKRLIPL-----------SDGERDLWLLDQTINDRGYRIDLDLVLKLQELIAQE 216 (675) Q Consensus 148 ~~~~~~~~~~~~~~Ya~~Dv~~~~~L~~~L~~~-----------~~~E~~l~~~~~~me~~Gi~vD~e~l~~~~~~~~~~ 216 (675) ....+....+....|++.|++++++|+..|.+. .+.|+++..++.+||.+||.||.+.++++..++.++ T Consensus 483 q~~f~~v~~~~~~~Ya~edA~~t~~L~~~L~~~L~~~~~l~~Ly~~IE~PLi~VLa~ME~~GI~vD~~~L~~ls~el~~~ 562 (928) T PRK07997 483 QLTFNQIALEEAGRYAAEDADVTLQLHLKMWPKLQQHKGPLNVFENIEMPLVPVLSRIERNGVKIDPAVLHAHSEELTLR 562 (928) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCEEECHHHHHHHHHHHHHH T ss_conf 47743535999999999999999999999999986214499999999989999999998539599899999999999999 Q ss_pred HHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHCCCCCCCC---CH-HHHHHHHHHC-CCCCHHHHHHHHHHHHHHHCCHH Q ss_conf 9999999986305766888856889999999728999888---96-5889999744-89854899999999975430003 Q gi|254780127|r 217 RKKLDEDIVKLTDGIIRSSRQTYALRMYLFLITGIDLVDM---SE-GTLKSILSHS-NITQLAKDLILNRLASSGSAILK 291 (675) Q Consensus 217 ~~~~~~~i~~~~g~~~~n~~S~~ql~~~L~~~~g~~~~~~---~k-~~~~~~l~~~-~~~~~~~~ll~~r~~~~~~~~~~ 291 (675) ..++++++++++|.++ |+|||+||+++||+.+|+|+.+. +. +|.+++|+.. ..+|.++.+++||...|+.+ +| T Consensus 563 l~~le~~I~~~aG~eF-NinSpKQLgeILFekLgLp~~KKTktG~~ST~~~vLe~L~~~h~i~~~il~yR~l~KL~s-TY 640 (928) T PRK07997 563 LAELEKKAHEIAGEPF-NLSSTKQLQTILFEKQGIKPLKKTPGGAPSTSEEVLEELALDYPLPKVILEYRGLAKLKS-TY 640 (928) T ss_pred HHHHHHHHHHHCCCCC-CCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCHHHHHHHHHHHHHHH-HH T ss_conf 9999999998629988-878999999998986299988858899968469999972025750899999989999899-88 Q ss_pred HHHHHHHHCC-CCCCCCCEEECCCCCCHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCCCCHHHHEEEEC Q ss_conf 5668873056-884522002113221001110688855477776652345654431024445544432210132121476 Q gi|254780127|r 292 LNTLSEAVSS-DGRLRGTLQFYGASRTGRWSGCVFQPQNLPRQERSEEILTQTIQALNRGETIADPLGLASDCVRSCIIA 370 (675) Q Consensus 292 ~~~l~~~~~~-dGRih~~~~~~gAt~TGR~S~~~pNlQnlPr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iR~~f~a 370 (675) +++|...+++ |||||++|+|.+ |.|||+||++||||||| ++++.|+.||+||+| T Consensus 641 ~d~L~~~i~~~~gRiHt~f~Q~~-t~TGRlSS~~PNLQNIP------------------------ir~~~G~~iR~aFi~ 695 (928) T PRK07997 641 TDKLPLMINPKTGRVHTSYHQAV-TATGRLSSTDPNLQNIP------------------------VRNEEGRRIRQAFIA 695 (928) T ss_pred HHHHHHHHCCCCCEEEEEEHCCC-CCCCCCCCCCCCCCCCC------------------------CCCHHHHHHHHHHCC T ss_conf 98889753866687820301110-21251125899867588------------------------886225789987437 Q ss_pred CCCCEEEEEECCCCCEEEEEEECCCHHHHHHHHCCCCCCCCCHHHHHCCCCCCCCHHHHHCCCCCHHHHHHCCCHHHHHH Q ss_conf 99878999733548505886622876786654044410000157760875110570221006421001330788778988 Q gi|254780127|r 371 SNGKKLVVADLAGIEARVLAWIAGEDWKIKAFANGEDIYVTTYAKAFNTPIDKVSKEQRAIGKVMELALGYQGGAKVFKT 450 (675) Q Consensus 371 ~~G~~lv~aDySqiElRvlA~ls~d~~li~~f~~g~D~h~~tA~~~~~~~~~~v~~~~R~~aK~~~~gl~YG~G~~~l~~ 450 (675) +|||+|++|||||||||||||+|+|+.|++||++|.|+|+.||+.|||+|.++|++++|+.||++|||++||||+.+|.+ T Consensus 696 ~~g~~l~saDYSQiELRilAh~s~d~~li~af~~~~DiH~~tA~~if~~~~~~v~~~~R~~aK~vnfgi~YG~s~~gLa~ 775 (928) T PRK07997 696 PEDYVIVSADYSQIELRIMAHLSRDKGLLTAFAEGKDIHRATAAEVFGLPLEKVTSEQRRSAKAINFGLIYGMSAFGLAR 775 (928) T ss_pred CCCCEEEEECHHHHHHHHHHHHHCCHHHHHHHHCCCCCCHHHHHHHHCCCHHHCCHHHHHHHHHHHHHHHHCCCHHHHHH T ss_conf 89987997056289999999997687688775168883401179882999465798899886877766466268778998 Q ss_pred HHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEECCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHH Q ss_conf 78870888640366666566676547889999999998751110103108897889999998673078999999999999 Q gi|254780127|r 451 MASHCGLDLQQFSQNVKSTSTFEDWEQAESHHLWMQDQYPEFAVKDKLIGTACELVKKAWRAKHQGVLQLWKDLTEGFAC 530 (675) Q Consensus 451 ~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ea~~~~~~~~~~~p~i~~~~~~~~~~~~~ 530 (675) ..+++..||+.+++.||.+||+|+.|...+.+.|+. T Consensus 776 --------------------------------------------~l~i~~~eA~~~i~~yf~~~~~v~~~~~~~~~~a~~ 811 (928) T PRK07997 776 --------------------------------------------QLNIPRKEAQKYMDLYFERYPGVLEYMERTRAQAKE 811 (928) T ss_pred --------------------------------------------HHCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHH T ss_conf --------------------------------------------709999999999999988683399999999999997 Q ss_pred HHHCCCEEEECCCCCCCCCCCCCCCCCCEEECCCCHHHCCCCCCHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 96337214201322234432124456411224631131024520123320310012357765335577767479899999 Q gi|254780127|r 531 VVQEGGSISARRVANVPRLVMRRHKRDVHIVLPSSRRLVYSDVKGDCSYLNTATSQLMRERTYGGKLTENIVQAISRDIL 610 (675) Q Consensus 531 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~i~~psGR~l~~~~~~~~~~~~~~~~~~~~r~~~~g~k~~n~~iQgtAaDi~ 610 (675) +| ++++..|||++.|++...+... +...+|+ ++|+|||||||||+ T Consensus 812 ---~g-----------------------yv~Tl~gRrr~~~~i~s~n~~~---r~~~eR~------a~N~piQgtaADii 856 (928) T PRK07997 812 ---QG-----------------------YVETLDGRRLYLPDIKSSNAAR---RAAAERA------AINAPMQGTAADII 856 (928) T ss_pred ---CC-----------------------CEECCCCCEECCCCCCCCCHHH---HHHHHHH------HHCCCCCHHHHHHH T ss_conf ---79-----------------------4788889614588767899889---9689999------95811337899999 Q ss_pred HHHHHHHHH----C--CCCEEEEECCEEEEECCCCHHHHHHH-HHHHHCCCCCCCCCCCCEEECCCCCCCC Q ss_conf 999999996----7--98189995358999678478999999-9998467630216984157303114631 Q gi|254780127|r 611 CEGMKNATK----N--GYDIVLTVHDEIVSETPDTPYFSVGT-LCSLMTKNPSWAKGLPLKAEGYEAKRYR 674 (675) Q Consensus 611 k~am~~l~~----~--g~~ivl~VHDElv~evp~~~~~~~~~-i~~~M~~~p~wa~~~pl~a~~~~~~~y~ 674 (675) |.||+++++ . ..+|+||||||||||||+++++++.. |+++|+.+.+ ..+||.+|..+|+.|. T Consensus 857 k~Ami~~~~~l~~~~~~~~~~lqvHDEli~e~~~~~~~~~~~~~~~~Me~a~~--l~VPL~v~~~~g~nW~ 925 (928) T PRK07997 857 KRAMIAVDAWLQAEQPRVRMIMQVHDELVFEVHKDDLDAVAKKIHQLMENCTR--LDVPLLVEVGSGENWD 925 (928) T ss_pred HHHHHHHHHHHHHCCCCCEEEEEECCEEEEEECHHHHHHHHHHHHHHHHCCCC--CCCCEEEECCCCCCHH T ss_conf 99999999999844888648973560678842688999999999999728443--7887585359277968 No 4 >PRK05755 DNA polymerase I; Provisional Probab=100.00 E-value=0 Score=870.24 Aligned_cols=537 Identities=21% Similarity=0.250 Sum_probs=450.9 Q ss_pred CEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCCEEEEEECCCCCH-----HHHHHHHHHCCCCCEEEEEEHHHH Q ss_conf 789998040678870004600002688299999998382289997888422-----179999986488981998630775 Q gi|254780127|r 2 SKLFIDIETRSPQPLPKVGVWAYAEQAVITLCAYAWDDEPVKLWDRTEQSA-----MPSDLLQYLRDETVMCVAHNSLFE 76 (675) Q Consensus 2 k~l~lD~ET~s~~dl~~~Gl~~y~~~~~I~~i~~a~~~~~~~~~d~~~~~~-----~~~~l~~~L~~~~~~kv~HNa~FD 76 (675) +.+++|+||.|.... .++++|+++|++++..+|+|+.+... ....+.++|+++++.||+||+||| T Consensus 320 ~~va~dtEttsld~~----------~a~lVGis~s~~~g~a~YIPl~~~~~~~~~~vl~~Lk~lLed~~i~KIghNlK~d 389 (889) T PRK05755 320 GLVAFDTETTSLDPM----------SANLVGLSFAVEEGEAAYIPLAHDDQLDRDDALAALKPLLEDPAIPKVGQNLKYD 389 (889) T ss_pred CEEEEEEECCCCCCC----------CCCEEEEEEECCCCCEEEEECCCCCCCCHHHHHHHHHHHHCCCCCCEEEECHHHH T ss_conf 849999841788756----------5762599998279968998444313368899999999997098872432031899 Q ss_pred HHHHHHHCCCCCCCCCEEEHHHHHHHHCC---CCCHHHHHH-HHCCCCCCHHHHHCHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 99999862998786504545889998588---999789999-80889865276637657664100233201345311258 Q gi|254780127|r 77 RILFKKTLGIDIPSKRWICTSVLARTNGL---PSSLKNACL-ALNFSEHLTKMEEGKALIARFCKGSIDSPPYDCTRANH 152 (675) Q Consensus 77 ~~vL~~~lGi~i~~~~~~DTmi~A~~l~~---p~sL~~la~-~lgl~~~~~kd~~gk~li~~~~~~~~~~~~~~~~~~~~ 152 (675) +.+|.++ |+++.. .++||||+||++++ +++|+.+++ +|+......++..|+.- ..... T Consensus 390 l~vL~~~-GI~l~g-~~fDTmLAaYLLdP~~~~~~Ld~La~~yL~~~~~~~eel~gkg~--------------~~~~~-- 451 (889) T PRK05755 390 RHILANY-GIELRG-IAFDTMLASYLLDPGAGRHDLDSLAERYLGHKTISFEEVAGKGN--------------QLTFA-- 451 (889) T ss_pred HHHHHHC-CEECCC-CCHHHHHHHHHHCCCCCCCCHHHHHHHHHCCCCCCHHHHCCCCC--------------CCCCC-- T ss_conf 9999977-923577-63019999987478877677899999863234420343136565--------------56610-- Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCC----------HHHHHHHHHHHHHHHHCCEECCHHHHHHHHHHHHHHHHHHHH Q ss_conf 7689998988775399999999872002----------143101789999875168430889999999999999999999 Q gi|254780127|r 153 VQAWQLFGEYCKRDVEATRELFKRLIPL----------SDGERDLWLLDQTINDRGYRIDLDLVLKLQELIAQERKKLDE 222 (675) Q Consensus 153 ~~~~~~~~~Ya~~Dv~~~~~L~~~L~~~----------~~~E~~l~~~~~~me~~Gi~vD~e~l~~~~~~~~~~~~~~~~ 222 (675) ..+.+....||+.|++++++||..|.+. .+.|+++..++.+||.+||.||.+.++++..++..+..++++ T Consensus 452 ~v~~e~~~~YaaedA~~~~~L~~~L~~~L~~~~l~~Ly~~IE~PLi~VLa~ME~~GI~VD~~~L~~ls~el~~~l~~le~ 531 (889) T PRK05755 452 QVALEEAAEYAAEDADVTLRLHQVLKPELEEEGLLELYEEIELPLVPVLARMERNGIKVDREYLEELSAELAQRLAELEQ 531 (889) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHCCEEECHHHHHHHHHHHHHHHHHHHH T ss_conf 07878999999998868999999999999860478899998751999999999608686699999999999999999999 Q ss_pred HHHHHCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCH---HHHHHHHHH-CCCCCHHHHHHHHHHHHHHHCCHHHHHHHHH Q ss_conf 998630576688885688999999972899988896---588999974-4898548999999999754300035668873 Q gi|254780127|r 223 DIVKLTDGIIRSSRQTYALRMYLFLITGIDLVDMSE---GTLKSILSH-SNITQLAKDLILNRLASSGSAILKLNTLSEA 298 (675) Q Consensus 223 ~i~~~~g~~~~n~~S~~ql~~~L~~~~g~~~~~~~k---~~~~~~l~~-~~~~~~~~~ll~~r~~~~~~~~~~~~~l~~~ 298 (675) ++++++|.++ |+|||+||+++||+.+|+|..+.+| +|..+.|+. .+.+|.+..+++||+..|+.+ +|+++|.+. T Consensus 532 ~I~~~aG~eF-NinSPKQL~~iLFekL~Lp~~KKTKtGySTd~~vLe~L~~~hpi~~~iLeyR~l~KL~s-TYvd~L~~~ 609 (889) T PRK05755 532 EIYELAGEEF-NLNSPKQLGEILFEKLGLPVTKKTKTGYSTDAEVLEKLADDHPIPKKILEYRQLSKLKS-TYTDGLPKM 609 (889) T ss_pred HHHHHCCCCC-CCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCCHHHHHHHHHHHHHHHHH-HHHHHHHHC T ss_conf 9998749903-77998999999998539997887889887859999998750769999999977878888-898888752 Q ss_pred HCCC-CCCCCCEEECCCCCCHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCCCCHHHHEEEECCCCCEEE Q ss_conf 0568-845220021132210011106888554777766523456544310244455444322101321214769987899 Q gi|254780127|r 299 VSSD-GRLRGTLQFYGASRTGRWSGCVFQPQNLPRQERSEEILTQTIQALNRGETIADPLGLASDCVRSCIIASNGKKLV 377 (675) Q Consensus 299 ~~~d-GRih~~~~~~gAt~TGR~S~~~pNlQnlPr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iR~~f~a~~G~~lv 377 (675) ++++ ||||++|+|.| |.|||+||++||||||| ++.+.|+.||+||+|+|||+|+ T Consensus 610 i~~~tgRIHt~f~Q~~-t~TGRLSS~~PNLQNIP------------------------ir~e~Gr~IR~aFva~~g~~l~ 664 (889) T PRK05755 610 INPDTGRVHTSFNQAV-TATGRLSSSDPNLQNIP------------------------IRTEEGRRIRQAFVAPEGYKLL 664 (889) T ss_pred CCCCCCEEECCHHHHH-CCCCCCCCCCCCCCCCC------------------------CCCHHHHHHHHEEEECCCCEEE T ss_conf 4788984611001110-32022247998534687------------------------7632304455227858998899 Q ss_pred EEECCCCCEEEEEEECCCHHHHHHHHCCCCCCCCCHHHHHCCCCCCCCHHHHHCCCCCHHHHHHCCCHHHHHHHHHHCCC Q ss_conf 97335485058866228767866540444100001577608751105702210064210013307887789887887088 Q gi|254780127|r 378 VADLAGIEARVLAWIAGEDWKIKAFANGEDIYVTTYAKAFNTPIDKVSKEQRAIGKVMELALGYQGGAKVFKTMASHCGL 457 (675) Q Consensus 378 ~aDySqiElRvlA~ls~d~~li~~f~~g~D~h~~tA~~~~~~~~~~v~~~~R~~aK~~~~gl~YG~G~~~l~~~~~~~G~ 457 (675) +|||||||||||||+|+|+.|++||++|.|+|+.||+.+||+|.++|++++|+.||++|||++||||+.+|.+ T Consensus 665 saDYSQIELRilAh~S~D~~li~af~~g~DiH~~tA~~vf~v~~~~vt~~~R~~AK~vnfGiiYG~s~~gLs~------- 737 (889) T PRK05755 665 SADYSQIELRILAHLSGDEGLIEAFAEGEDIHTATASEVFGVPLEEVTSEQRRRAKAINFGLIYGMSAFGLAR------- 737 (889) T ss_pred EECHHHHHHHHHHHHHCCHHHHHHHHCCCCHHHHHHHHHHCCCHHHCCHHHHHHHHHEEEHHHHCCCHHHHHH------- T ss_conf 9512089999999980899999998559986899999993999001899998662613221444668778998------- Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEECCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCE Q ss_conf 86403666665666765478899999999987511101031088978899999986730789999999999999633721 Q gi|254780127|r 458 DLQQFSQNVKSTSTFEDWEQAESHHLWMQDQYPEFAVKDKLIGTACELVKKAWRAKHQGVLQLWKDLTEGFACVVQEGGS 537 (675) Q Consensus 458 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ea~~~~~~~~~~~p~i~~~~~~~~~~~~~~~~~~~~ 537 (675) ..+++..||+.+++.|+.+||+|..|...+.+.|+. +| T Consensus 738 -------------------------------------~l~is~~eA~~~i~~yf~~yp~v~~~~~~~~~~a~~---~G-- 775 (889) T PRK05755 738 -------------------------------------QLGISRKEAKEYIDRYFERYPGVKEYMEETVAQAKE---KG-- 775 (889) T ss_pred -------------------------------------HHCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHH---CC-- T ss_conf -------------------------------------819999999999999999681099999999999997---79-- Q ss_pred EEECCCCCCCCCCCCCCCCCCEEECCCCHHHCCCCCCHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 42013222344321244564112246311310245201233203100123577653355777674798999999999999 Q gi|254780127|r 538 ISARRVANVPRLVMRRHKRDVHIVLPSSRRLVYSDVKGDCSYLNTATSQLMRERTYGGKLTENIVQAISRDILCEGMKNA 617 (675) Q Consensus 538 ~~~~~~~~~~~~~~~~~~~~l~i~~psGR~l~~~~~~~~~~~~~~~~~~~~r~~~~g~k~~n~~iQgtAaDi~k~am~~l 617 (675) ++++..|||+++|++.+.+.... ...+|+ ++|+|||||||||+|.||+++ T Consensus 776 ---------------------yv~Tl~gRrr~~p~i~s~n~~~r---~~~eR~------AiN~pIQGsAADiiK~AMi~v 825 (889) T PRK05755 776 ---------------------YVETLFGRRRYLPDINSSNGARR---AFAERA------AINAPIQGSAADIIKIAMIRV 825 (889) T ss_pred ---------------------CEECCCCCEEECCCCCCCCHHHH---HHHHHH------HHCCCCCHHHHHHHHHHHHHH T ss_conf ---------------------77787787052786778998899---899999------967565556999999999999 Q ss_pred HH-----CCCCEEEEECCEEEEECCCCHHHHHHH-HHHHHCCCCCCCCCCCCEEECCCCCCCC Q ss_conf 96-----798189995358999678478999999-9998467630216984157303114631 Q gi|254780127|r 618 TK-----NGYDIVLTVHDEIVSETPDTPYFSVGT-LCSLMTKNPSWAKGLPLKAEGYEAKRYR 674 (675) Q Consensus 618 ~~-----~g~~ivl~VHDElv~evp~~~~~~~~~-i~~~M~~~p~wa~~~pl~a~~~~~~~y~ 674 (675) ++ .+.+|+||||||||||||+++++++.. |+++|+++.+ ..+||.+|..+|+.|. T Consensus 826 ~~~l~~~~~~~~llQVHDEli~Ev~~~~~e~~~~~v~~~Me~a~~--l~VPl~v~~~~g~nW~ 886 (889) T PRK05755 826 DDALKEGLKARMLLQVHDELVFEVPEDELEEVKKLVKEVMENAVE--LSVPLVVDVGVGDNWD 886 (889) T ss_pred HHHHHHCCCCEEEEEEEEEEEEECCHHHHHHHHHHHHHHHHCCEE--CCCCEEEECCCCCCHH T ss_conf 999984778738998655777842788999999999999718500--7887586238168967 No 5 >PRK08786 consensus Probab=100.00 E-value=0 Score=870.06 Aligned_cols=538 Identities=19% Similarity=0.190 Sum_probs=448.8 Q ss_pred CEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCCEEEEEECCCCC----------HHHHHHHHHHCCCCCEEEEE Q ss_conf 78999804067887000460000268829999999838228999788842----------21799999864889819986 Q gi|254780127|r 2 SKLFIDIETRSPQPLPKVGVWAYAEQAVITLCAYAWDDEPVKLWDRTEQS----------AMPSDLLQYLRDETVMCVAH 71 (675) Q Consensus 2 k~l~lD~ET~s~~dl~~~Gl~~y~~~~~I~~i~~a~~~~~~~~~d~~~~~----------~~~~~l~~~L~~~~~~kv~H 71 (675) +.+++|+||.|...+ ++.++|++++++.+..+|+++.|.. .....+.++|+++++.||+| T Consensus 349 ~~~afdtET~sld~~----------~a~lVGiS~s~~~g~a~YiP~~h~~~g~~~qld~~~~l~~LkplLed~~I~KIgh 418 (927) T PRK08786 349 GQFAFDTETDSLDPL----------QADLIGLSVAAEPGQAAYLPFGHDFPGAPVQLDRTQALAQLAPLLTDPAVRKVGQ 418 (927) T ss_pred CEEEEEEECCCCCHH----------HHHHHCEEEECCCCCEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHH T ss_conf 838999723886655----------5312122454267866775324456786034139999999899860865112321 Q ss_pred EHHHHHHHHHHHCCCCCCCCCEEEHHHHHHHHCC---CCCHHHHHH-HHCCCCCCHHHHHCHHHHHHHHHHHHHHCCCCC Q ss_conf 3077599999862998786504545889998588---999789999-808898652766376576641002332013453 Q gi|254780127|r 72 NSLFERILFKKTLGIDIPSKRWICTSVLARTNGL---PSSLKNACL-ALNFSEHLTKMEEGKALIARFCKGSIDSPPYDC 147 (675) Q Consensus 72 Na~FD~~vL~~~lGi~i~~~~~~DTmi~A~~l~~---p~sL~~la~-~lgl~~~~~kd~~gk~li~~~~~~~~~~~~~~~ 147 (675) |+|||+.+|.++ |+.+.. ..+||||+||++++ .++|+.+++ +||++...+++..||.-.+ . T Consensus 419 nlK~Dl~vL~~~-GI~l~g-~~~DTmLAaYLL~p~~~~~~Ld~La~~yLg~~~~~~~~l~gkg~k~---------~---- 483 (927) T PRK08786 419 HGKYDLHVMRRH-GIALAG-YADDTLLESFVLNSGSARHDMDSLAKRYLGYDTVKYEDVCGKGAKQ---------I---- 483 (927) T ss_pred HHHHHHHHHHHC-CCCCCC-CCCCHHHHHHHCCCCCCCCCHHHHHHHHCCCCCCCHHHHCCCCCCC---------C---- T ss_conf 178889999967-971167-6642588886237764547889999996697754267542443222---------8---- Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCC----------C-HHHHHHHHHHHHHHHHCCEECCHHHHHHHHHHHHHH Q ss_conf 11258768999898877539999999987200----------2-143101789999875168430889999999999999 Q gi|254780127|r 148 TRANHVQAWQLFGEYCKRDVEATRELFKRLIP----------L-SDGERDLWLLDQTINDRGYRIDLDLVLKLQELIAQE 216 (675) Q Consensus 148 ~~~~~~~~~~~~~~Ya~~Dv~~~~~L~~~L~~----------~-~~~E~~l~~~~~~me~~Gi~vD~e~l~~~~~~~~~~ 216 (675) ... ....+....|++.|++++++||..|.+ + .+.|+++..++.+||.+||.||.+.++++..++..+ T Consensus 484 ~~~--~v~~~~~~~Ya~eda~~tl~L~~~l~~kL~~e~~l~~ly~~iE~PLi~VLa~ME~~GI~vD~~~L~~l~~~l~~~ 561 (927) T PRK08786 484 PFA--QVSLEDATRYAAEDADITLRLHCVLGPKLAAEPGLERVYRDIEMPLVDVLERIEANGVCVDAAELRRQSADLSKR 561 (927) T ss_pred CHH--HCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCEEECHHHHHHHHHHHHHH T ss_conf 833--325999999999999999999999999998505299999999979999999988568798899999999999999 Q ss_pred HHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHCCCCCCCCC----HHHHHHHHHH-CCCCCHHHHHHHHHHHHHHHCCHH Q ss_conf 99999999863057668888568899999997289998889----6588999974-489854899999999975430003 Q gi|254780127|r 217 RKKLDEDIVKLTDGIIRSSRQTYALRMYLFLITGIDLVDMS----EGTLKSILSH-SNITQLAKDLILNRLASSGSAILK 291 (675) Q Consensus 217 ~~~~~~~i~~~~g~~~~n~~S~~ql~~~L~~~~g~~~~~~~----k~~~~~~l~~-~~~~~~~~~ll~~r~~~~~~~~~~ 291 (675) ..++++++++++|.++ |+|||+||+++||+.+|+|+.+.+ .+|.+++|+. .+.+|.++.+++||+..|+.+ +| T Consensus 562 l~~le~~I~~~aG~eF-NinSpKQL~~iLFekL~Lp~~kKtktG~~ST~~~vLe~L~~~h~i~~~iL~yR~l~KL~s-TY 639 (927) T PRK08786 562 MLAAQQKATELAGRTF-NLDSPKQLQALLFDELKLPAVVKTPKGQPSTNEEALEAIADQHELPRVILEYRSLAKLRS-TY 639 (927) T ss_pred HHHHHHHHHHHCCCCC-CCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCHHHHHHHHHHHHHHH-HH T ss_conf 9999999999708555-789999999999750289878738899977749999987635677699999988887787-88 Q ss_pred HHHHHHHHCC-CCCCCCCEEECCCCCCHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCCCCHHHHEEEEC Q ss_conf 5668873056-884522002113221001110688855477776652345654431024445544432210132121476 Q gi|254780127|r 292 LNTLSEAVSS-DGRLRGTLQFYGASRTGRWSGCVFQPQNLPRQERSEEILTQTIQALNRGETIADPLGLASDCVRSCIIA 370 (675) Q Consensus 292 ~~~l~~~~~~-dGRih~~~~~~gAt~TGR~S~~~pNlQnlPr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iR~~f~a 370 (675) +++|.+.+++ |||||++|+|.+ |.|||+||++||||||| ++++.|+.||+||+| T Consensus 640 ~d~L~~~i~~~~gRiHt~f~Q~~-t~TGRlSS~~PNLQNIP------------------------irt~~G~~iR~aFi~ 694 (927) T PRK08786 640 TDKLPEMIHPQSGRVHTSYHQAG-AATGRLSSSDPNLQNIP------------------------IRTEDGRRIRRAFVA 694 (927) T ss_pred HHHHHHHCCCCCCEECCCCHHHC-CCCCEECCCCCCCCCCC------------------------CCCCCCCCHHHHEEC T ss_conf 99888740766576624216311-45452527993411488------------------------776333022666025 Q ss_pred CCCCEEEEEECCCCCEEEEEEECCCHHHHHHHHCCCCCCCCCHHHHHCCCCCCCCHHHHHCCCCCHHHHHHCCCHHHHHH Q ss_conf 99878999733548505886622876786654044410000157760875110570221006421001330788778988 Q gi|254780127|r 371 SNGKKLVVADLAGIEARVLAWIAGEDWKIKAFANGEDIYVTTYAKAFNTPIDKVSKEQRAIGKVMELALGYQGGAKVFKT 450 (675) Q Consensus 371 ~~G~~lv~aDySqiElRvlA~ls~d~~li~~f~~g~D~h~~tA~~~~~~~~~~v~~~~R~~aK~~~~gl~YG~G~~~l~~ 450 (675) +||++|++|||||||||||||+|+|+.|++||++|.|+|+.||+.+||+|.++|++++|+.||++|||++||||+.+|.+ T Consensus 695 ~~g~~l~saDYSQIELRilAh~s~d~~li~af~~~~DiH~~tA~~if~~~~~~v~~~~R~~aK~infgiiYG~s~~gL~~ 774 (927) T PRK08786 695 PAGRKLIACDYSQIELRIMAHLSGDPGLVGAFESGADVHRATAAEVFGRTIDTVSGDERRAAKAINFGLMYGMSAFGLAR 774 (927) T ss_pred CCCCEEEEECHHHHHHHHHHHHHCCHHHHHHHCCCCCCCHHHHHHHHCCCHHHCCHHHHHHHHEEEEEHHHCCCHHHHHH T ss_conf 89887997236488999999982886477775478673032689882999576697788551521020033378777998 Q ss_pred HHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEECCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHH Q ss_conf 78870888640366666566676547889999999998751110103108897889999998673078999999999999 Q gi|254780127|r 451 MASHCGLDLQQFSQNVKSTSTFEDWEQAESHHLWMQDQYPEFAVKDKLIGTACELVKKAWRAKHQGVLQLWKDLTEGFAC 530 (675) Q Consensus 451 ~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ea~~~~~~~~~~~p~i~~~~~~~~~~~~~ 530 (675) ..+++..||+.+++.||..||+|+.|...+.+.|+. T Consensus 775 --------------------------------------------~l~i~~~eA~~~i~~yf~~y~~v~~~~~~~~~~a~~ 810 (927) T PRK08786 775 --------------------------------------------QLGIGRGEAQDYIALYFSRYPGVRDFMETTRQQARD 810 (927) T ss_pred --------------------------------------------HCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHH T ss_conf --------------------------------------------709999999999999999691099999999999987 Q ss_pred HHHCCCEEEECCCCCCCCCCCCCCCCCCEEECCCCHHHCCCCCCHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 96337214201322234432124456411224631131024520123320310012357765335577767479899999 Q gi|254780127|r 531 VVQEGGSISARRVANVPRLVMRRHKRDVHIVLPSSRRLVYSDVKGDCSYLNTATSQLMRERTYGGKLTENIVQAISRDIL 610 (675) Q Consensus 531 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~i~~psGR~l~~~~~~~~~~~~~~~~~~~~r~~~~g~k~~n~~iQgtAaDi~ 610 (675) +| ++++..|||++.|+++..+.... ...+|+ ++|+|||||||||+ T Consensus 811 ---~g-----------------------yv~Tl~gRrr~~p~i~s~n~~~r---~~aeR~------A~N~piQGsAADii 855 (927) T PRK08786 811 ---KG-----------------------YVETVFGRRLYLDFINAGSQGQR---AGAERA------AINAPMQGTAADII 855 (927) T ss_pred ---CC-----------------------EEECCCCCCCCCCCCCCCCHHHH---HHHHHH------HHCCCCCHHHHHHH T ss_conf ---79-----------------------17888886032886678887788---789999------85834435799999 Q ss_pred HHHHHHHHH------CCCCEEEEECCEEEEECCCCHHHHHHH-HHHHHCCCCCCCCCCCCEEECCCCCCCC Q ss_conf 999999996------798189995358999678478999999-9998467630216984157303114631 Q gi|254780127|r 611 CEGMKNATK------NGYDIVLTVHDEIVSETPDTPYFSVGT-LCSLMTKNPSWAKGLPLKAEGYEAKRYR 674 (675) Q Consensus 611 k~am~~l~~------~g~~ivl~VHDElv~evp~~~~~~~~~-i~~~M~~~p~wa~~~pl~a~~~~~~~y~ 674 (675) |.||+++++ ...+|+||||||||||||+++++++.. |+++|+.+.. ..+||.+|..+|+.|. T Consensus 856 K~AMi~v~~~l~~~~~~~~mlLQVHDELifEv~~~e~~~~~~~v~~~Me~a~~--L~VPL~v~~~~G~nW~ 924 (927) T PRK08786 856 KRAMVKVDGWIADHAQRALMILQVHDELVFEADEDFVDTLLSEVTARMAAAAE--LRVPLVVDSGVGDNWD 924 (927) T ss_pred HHHHHHHHHHHHHCCCCCEEEEEECCEEEEEECHHHHHHHHHHHHHHHHCCCC--CCCCEEECCCCCCCHH T ss_conf 99999999999726878669986672568851689999999999999817445--7787573038386967 No 6 >PRK07625 consensus Probab=100.00 E-value=0 Score=864.79 Aligned_cols=538 Identities=22% Similarity=0.231 Sum_probs=450.3 Q ss_pred CEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCCEEEEEECCCCC----------HHHHHHHHHHCCCCCEEEEE Q ss_conf 78999804067887000460000268829999999838228999788842----------21799999864889819986 Q gi|254780127|r 2 SKLFIDIETRSPQPLPKVGVWAYAEQAVITLCAYAWDDEPVKLWDRTEQS----------AMPSDLLQYLRDETVMCVAH 71 (675) Q Consensus 2 k~l~lD~ET~s~~dl~~~Gl~~y~~~~~I~~i~~a~~~~~~~~~d~~~~~----------~~~~~l~~~L~~~~~~kv~H 71 (675) +.+++|+||.|...+ .++++|+++|++.+..+|+++.|.. .+...++++|+++++.|++| T Consensus 345 ~~~a~dtEttsld~~----------~a~lvGis~s~~~g~a~YiPl~h~~~d~~~~l~~~~vl~~Lk~~Led~~i~KIgh 414 (922) T PRK07625 345 ELTAFDTETTSLDPM----------LAQLVGLSFSVEPGHAAYIPVAHRGPDMPAQLPRDEVLARLRPWLEDAGRKKLGQ 414 (922) T ss_pred CEEEEEEECCCCCHH----------HCEEEEEEEEECCCCEEEEEECCCCCCHHHHCCHHHHHHHHHHHHHCCCCHHHHH T ss_conf 848999522776500----------1305657977438867999702158652321589999998799874344035542 Q ss_pred EHHHHHHHHHHHCCCCCCCCCEEEHHHHHHHHCC--CCCHHHHHH-HHCCCCCCHHHHHCHHHHHHHHHHHHHHCCCCCC Q ss_conf 3077599999862998786504545889998588--999789999-8088986527663765766410023320134531 Q gi|254780127|r 72 NSLFERILFKKTLGIDIPSKRWICTSVLARTNGL--PSSLKNACL-ALNFSEHLTKMEEGKALIARFCKGSIDSPPYDCT 148 (675) Q Consensus 72 Na~FD~~vL~~~lGi~i~~~~~~DTmi~A~~l~~--p~sL~~la~-~lgl~~~~~kd~~gk~li~~~~~~~~~~~~~~~~ 148 (675) |++||+.+|.++ |+.+.. ..+||||+||+++. +++|+.+++ +|++....+++..||...+ . T Consensus 415 NlK~d~~vL~~~-GI~l~g-~~fDTmLAaYLL~~~r~~~Ld~La~~yL~~~~i~~edl~Gkg~~q-------------~- 478 (922) T PRK07625 415 HLKYDAQVLANY-GIALNG-IEHDTLLESYVLESHRTHDMDSLALRHLGVKTIKYEDVAGKGAKQ-------------I- 478 (922) T ss_pred HHHHHHHHHHHC-CCCCCC-CCHHHHHHHHHHCCCCCCCHHHHHHHHHCCCCCCHHHHCCCCCCC-------------C- T ss_conf 577899999977-964587-540399998875467678879999987255655413212333333-------------7- Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCC-----------HHHHHHHHHHHHHHHHCCEECCHHHHHHHHHHHHHHH Q ss_conf 12587689998988775399999999872002-----------1431017899998751684308899999999999999 Q gi|254780127|r 149 RANHVQAWQLFGEYCKRDVEATRELFKRLIPL-----------SDGERDLWLLDQTINDRGYRIDLDLVLKLQELIAQER 217 (675) Q Consensus 149 ~~~~~~~~~~~~~Ya~~Dv~~~~~L~~~L~~~-----------~~~E~~l~~~~~~me~~Gi~vD~e~l~~~~~~~~~~~ 217 (675) .....+.+....|++.|++++++||..|.+- .+.|+++..++.+||.+||.||.+.++++..++.++. T Consensus 479 -~~~~v~~~~~~~Ya~eda~~t~~L~~~L~~~L~~e~~L~~ly~~iE~PLi~VLa~ME~~GI~vD~~~L~~l~~~l~~~l 557 (922) T PRK07625 479 -GFDEVALAQASEYAAEDADITLQLHHALYPQVAREPGLERVYRDIELPVSLVLRKMERTGVLIDADRLSRQSGEIATRL 557 (922) T ss_pred -CHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHH T ss_conf -8322467999999999999999999999999834054799999999589999999998284026999999999999999 Q ss_pred HHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHCCCCCCCCC---H-HHHHHHHHH-CCCCCHHHHHHHHHHHHHHHCCHHH Q ss_conf 9999999863057668888568899999997289998889---6-588999974-4898548999999999754300035 Q gi|254780127|r 218 KKLDEDIVKLTDGIIRSSRQTYALRMYLFLITGIDLVDMS---E-GTLKSILSH-SNITQLAKDLILNRLASSGSAILKL 292 (675) Q Consensus 218 ~~~~~~i~~~~g~~~~n~~S~~ql~~~L~~~~g~~~~~~~---k-~~~~~~l~~-~~~~~~~~~ll~~r~~~~~~~~~~~ 292 (675) .++++++++++|.++ |+|||+||+++||+.+|+|+.+.+ . +|.++.|+. .+.+|.++.+++||+..|+.+ +|+ T Consensus 558 ~~le~~i~~~aG~eF-NinSpkQL~~iLFe~L~Lp~~kKTktG~~ST~~~vLe~L~~~hpi~~~il~yR~l~KL~s-TY~ 635 (922) T PRK07625 558 LALEQEAYELAGGEF-NLGSPKQIGQIFFERLQLPVVKKTPSGAPSTDEEVLQKLAEDYPLPKLLLEHRGLSKLKS-TYT 635 (922) T ss_pred HHHHHHHHHHCCCCC-CCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCCCCHHHHHHHHHHHHHHHH-HHH T ss_conf 999998777549977-899999999999985389878858989978539999986006873899998628989999-999 Q ss_pred HHHHHHHCCC-CCCCCCEEECCCCCCHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCCCCHHHHEEEECC Q ss_conf 6688730568-845220021132210011106888554777766523456544310244455444322101321214769 Q gi|254780127|r 293 NTLSEAVSSD-GRLRGTLQFYGASRTGRWSGCVFQPQNLPRQERSEEILTQTIQALNRGETIADPLGLASDCVRSCIIAS 371 (675) Q Consensus 293 ~~l~~~~~~d-GRih~~~~~~gAt~TGR~S~~~pNlQnlPr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iR~~f~a~ 371 (675) ++|.+.++++ ||||++|+|.+ |.|||+||++||||||| ++++.|+.||+||+|+ T Consensus 636 d~l~~~i~~~~gRiHt~f~Q~~-t~TGRlSS~~PNLQNIP------------------------ir~~~G~~iR~aFi~~ 690 (922) T PRK07625 636 DKLPRMVNPDTGRVHTNYAQAV-AVTGRLASNDPNLQNIP------------------------VRTEEGRRIREAFIAP 690 (922) T ss_pred HHHHHHCCCCCCEECCCHHEEE-CCCCCCCCCCCCCCCCC------------------------CCCCCCCHHHEEEECC T ss_conf 8888850888787766200210-23354357998633688------------------------7762240133447318 Q ss_pred CCCEEEEEECCCCCEEEEEEECCCHHHHHHHHCCCCCCCCCHHHHHCCCCCCCCHHHHHCCCCCHHHHHHCCCHHHHHHH Q ss_conf 98789997335485058866228767866540444100001577608751105702210064210013307887789887 Q gi|254780127|r 372 NGKKLVVADLAGIEARVLAWIAGEDWKIKAFANGEDIYVTTYAKAFNTPIDKVSKEQRAIGKVMELALGYQGGAKVFKTM 451 (675) Q Consensus 372 ~G~~lv~aDySqiElRvlA~ls~d~~li~~f~~g~D~h~~tA~~~~~~~~~~v~~~~R~~aK~~~~gl~YG~G~~~l~~~ 451 (675) ||++|++|||||||||||||+|+|+.|++||++|.|+|+.||+.|||+|.++|++++|+.||++|||++||||+.+|.+ T Consensus 691 ~g~~l~~aDYSQiELRilAh~s~d~~l~~af~~~~DiH~~tA~~vf~v~~~~v~~~~R~~AK~vnfgiiYG~s~~gLa~- 769 (922) T PRK07625 691 PGSKIVSADYSQIELRIMAHISGDESLLRAFAHGEDIHRATAAEVFGVTPLEVTSDQRRIAKVINFGLIYGMSAFGLAS- 769 (922) T ss_pred CCCEEEEECHHHHHHHHHHHHHCCHHHHHHHHCCCCCCCCCHHHHHCCCHHHCCHHHHHHHHEEEEHHHHCCCHHHHHH- T ss_conf 9988998155388999999995788799998659985424269996989576899998651702002365578677998- Q ss_pred HHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEECCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHH Q ss_conf 88708886403666665666765478899999999987511101031088978899999986730789999999999999 Q gi|254780127|r 452 ASHCGLDLQQFSQNVKSTSTFEDWEQAESHHLWMQDQYPEFAVKDKLIGTACELVKKAWRAKHQGVLQLWKDLTEGFACV 531 (675) Q Consensus 452 ~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ea~~~~~~~~~~~p~i~~~~~~~~~~~~~~ 531 (675) ..+++..||+.+++.||..||+|..|...+.+.|+. T Consensus 770 -------------------------------------------~l~i~~~eA~~~i~~yf~~y~~v~~~~~~~~~~a~~- 805 (922) T PRK07625 770 -------------------------------------------NLGITRDAAKLYIDRYFARYPGVARYMEETRMRAKE- 805 (922) T ss_pred -------------------------------------------HCCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHH- T ss_conf -------------------------------------------719999999999999998691299999999999987- Q ss_pred HHCCCEEEECCCCCCCCCCCCCCCCCCEEECCCCHHHCCCCCCHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 63372142013222344321244564112246311310245201233203100123577653355777674798999999 Q gi|254780127|r 532 VQEGGSISARRVANVPRLVMRRHKRDVHIVLPSSRRLVYSDVKGDCSYLNTATSQLMRERTYGGKLTENIVQAISRDILC 611 (675) Q Consensus 532 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~i~~psGR~l~~~~~~~~~~~~~~~~~~~~r~~~~g~k~~n~~iQgtAaDi~k 611 (675) +| ++++..|||+++|++.+.+... +...+|+ ++|+|||||||||+| T Consensus 806 --~g-----------------------yv~Tl~gRrr~~p~i~s~n~~~---r~~~eR~------AiN~piQGsaADiiK 851 (922) T PRK07625 806 --KG-----------------------YVETVFGRRLWLPEINGGNGPR---RQAAERA------AINAPMQGTAADLIK 851 (922) T ss_pred --CC-----------------------EEECCCCCCCCCCCCCCCCHHH---HHHHHHH------HHCCCCCHHHHHHHH T ss_conf --79-----------------------2667778603176556888788---8799999------965024467999999 Q ss_pred HHHHHHHH------CCCCEEEEECCEEEEECCCCHHHHHHH-HHHHHCCCCCCCCCCCCEEECCCCCCCC Q ss_conf 99999996------798189995358999678478999999-9998467630216984157303114631 Q gi|254780127|r 612 EGMKNATK------NGYDIVLTVHDEIVSETPDTPYFSVGT-LCSLMTKNPSWAKGLPLKAEGYEAKRYR 674 (675) Q Consensus 612 ~am~~l~~------~g~~ivl~VHDElv~evp~~~~~~~~~-i~~~M~~~p~wa~~~pl~a~~~~~~~y~ 674 (675) .||+++++ .+.+|+||||||||||||+++++++.. |+++|+.+.. ..+||.+|..+|+.|. T Consensus 852 ~Ami~i~~~l~~~~~~~~~ilqvHDEli~e~~~~~~~~~~~~~~~~Me~~~~--l~VPL~V~~~~G~nW~ 919 (922) T PRK07625 852 LSMIAVDDWLERDGLRTRMIMQVHDELVLEVPDDELSLVREKLPEMMCGVAK--LKVPLVAEVGVGANWE 919 (922) T ss_pred HHHHHHHHHHHHCCCCCCEEEEECEEEEEEECHHHHHHHHHHHHHHHHCCCC--CCCCEEEECCCCCCHH T ss_conf 9999999999846998716998754778840788999999999999727415--7887683328278978 No 7 >PRK08835 consensus Probab=100.00 E-value=0 Score=861.98 Aligned_cols=538 Identities=20% Similarity=0.227 Sum_probs=447.4 Q ss_pred CEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCCEEEEEECCCC----------CHHHHHHHHHHCCCCCEEEEE Q ss_conf 7899980406788700046000026882999999983822899978884----------221799999864889819986 Q gi|254780127|r 2 SKLFIDIETRSPQPLPKVGVWAYAEQAVITLCAYAWDDEPVKLWDRTEQ----------SAMPSDLLQYLRDETVMCVAH 71 (675) Q Consensus 2 k~l~lD~ET~s~~dl~~~Gl~~y~~~~~I~~i~~a~~~~~~~~~d~~~~----------~~~~~~l~~~L~~~~~~kv~H 71 (675) +.+++|+||.| ++.+ .+.+.|++++++.+..+|+++.|. +.+...++++|+++++.||+| T Consensus 352 ~~~a~dtEt~~--------ld~~--~a~lvGis~~~~~g~a~YiPl~h~~l~~~~ql~~d~vl~~LkplLed~~i~KIgh 421 (931) T PRK08835 352 ELFAFDTETDS--------LDYM--VANLVGLSFAVAEGEAAYVPVAHDYLDAPEQLDRDWVLAQLKPLLEDDAKAKVGQ 421 (931) T ss_pred CEEEEEEECCC--------CCHH--HCCCCCEEEEECCCCEEEEEECCCCCCCHHHCCHHHHHHHHHHHHCCCCHHHHHC T ss_conf 85999950487--------4322--2133542687647857999712333455101369999999799842842436622 Q ss_pred EHHHHHHHHHHHCCCCCCCCCEEEHHHHHHHHCC---CCCHHHHHH-HHCCCCCCHHHHHCHHHHHHHHHHHHHHCCCCC Q ss_conf 3077599999862998786504545889998588---999789999-808898652766376576641002332013453 Q gi|254780127|r 72 NSLFERILFKKTLGIDIPSKRWICTSVLARTNGL---PSSLKNACL-ALNFSEHLTKMEEGKALIARFCKGSIDSPPYDC 147 (675) Q Consensus 72 Na~FD~~vL~~~lGi~i~~~~~~DTmi~A~~l~~---p~sL~~la~-~lgl~~~~~kd~~gk~li~~~~~~~~~~~~~~~ 147 (675) |++||+.+|.++ |+.+.. ..+|||++||++++ .++|+.++. +|+......++..|+.-.+ T Consensus 422 NlK~d~~vL~~~-gI~l~g-i~fDTmLAaYLLnp~~~~~~Ld~La~~~L~~~~i~~e~l~gkgk~q-------------- 485 (931) T PRK08835 422 NLKYDASVLARY-GIEMKG-IKHDTMLASYVYNSVGGKHDMDSLALRFLQHSCISFEQVAGKGKNQ-------------- 485 (931) T ss_pred CHHHHHHHHHHC-CCCCCC-CCCCHHHHHHHCCCCCCCCCHHHHHHHHCCCCCCCHHHHCCCCCCC-------------- T ss_conf 115789999977-977678-6612435753004777878989999986487763477651445344-------------- Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCC-----------HHHHHHHHHHHHHHHHCCEECCHHHHHHHHHHHHHH Q ss_conf 112587689998988775399999999872002-----------143101789999875168430889999999999999 Q gi|254780127|r 148 TRANHVQAWQLFGEYCKRDVEATRELFKRLIPL-----------SDGERDLWLLDQTINDRGYRIDLDLVLKLQELIAQE 216 (675) Q Consensus 148 ~~~~~~~~~~~~~~Ya~~Dv~~~~~L~~~L~~~-----------~~~E~~l~~~~~~me~~Gi~vD~e~l~~~~~~~~~~ 216 (675) ........+....|++.|++++++|+..|.+. .+.|+++..++.+||.+||.||.+.++++.+++..+ T Consensus 486 -~~f~~v~~~~~~~YaaedA~~t~~L~~~L~~~L~~~~~L~~Ly~~IE~PLi~VLa~ME~~GI~vD~~~L~~ls~el~~~ 564 (931) T PRK08835 486 -LTFNQIDLEEAAPYAAEDADVTLRLHNRLFANIEQDEKLKSVYEEIEMPLVPVLSRIERTGVLIDDMLLSAQSQEIAAR 564 (931) T ss_pred -CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCEEECHHHHHHHHHHHHHH T ss_conf -7833432889999999999999999999999998547788999999868999999999728487899999999999999 Q ss_pred HHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHCCCCCCCCC----HHHHHHHHHH-CCCCCHHHHHHHHHHHHHHHCCHH Q ss_conf 99999999863057668888568899999997289998889----6588999974-489854899999999975430003 Q gi|254780127|r 217 RKKLDEDIVKLTDGIIRSSRQTYALRMYLFLITGIDLVDMS----EGTLKSILSH-SNITQLAKDLILNRLASSGSAILK 291 (675) Q Consensus 217 ~~~~~~~i~~~~g~~~~n~~S~~ql~~~L~~~~g~~~~~~~----k~~~~~~l~~-~~~~~~~~~ll~~r~~~~~~~~~~ 291 (675) .++++++++++.|.++ |+|||+||+++||+.+|+|+.+.+ .+|.++.|+. ...+|.++.+++||+..|+.+ +| T Consensus 565 l~~le~~I~~~aG~eF-NinSPKQL~~vLF~~l~Lp~~kKtktG~~ST~~~vLe~L~~~hpi~~~il~yR~l~KL~s-Ty 642 (931) T PRK08835 565 LDELEQKAYEIAGQEF-NMNSPKQLQTILFEKMGLPVIKKTPSGTPSTNEEVLQELALDYPLPKVILEYRGLAKLKS-TY 642 (931) T ss_pred HHHHHHHHHHHCCCCC-CCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHHH-HH T ss_conf 9999999987627856-779999999999986099878858899967559999987651777899999978988888-87 Q ss_pred HHHHHHHHCC-CCCCCCCEEECCCCCCHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCCCCHHHHEEEEC Q ss_conf 5668873056-884522002113221001110688855477776652345654431024445544432210132121476 Q gi|254780127|r 292 LNTLSEAVSS-DGRLRGTLQFYGASRTGRWSGCVFQPQNLPRQERSEEILTQTIQALNRGETIADPLGLASDCVRSCIIA 370 (675) Q Consensus 292 ~~~l~~~~~~-dGRih~~~~~~gAt~TGR~S~~~pNlQnlPr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iR~~f~a 370 (675) +++|.+.+++ +||||++|+|.+ |.|||+||++||||||| ++++.|+.||+||+| T Consensus 643 ~~~L~~~i~~~~gRiHt~f~Q~~-t~TGRLSS~~PNLQNIP------------------------iRt~~G~~iR~aFva 697 (931) T PRK08835 643 TDKLPKMINPSTGRVHTSYHQAV-TATGRLSSTDPNLQNIP------------------------IRNEEGRRIRQAFVA 697 (931) T ss_pred HHHHHHHHCCCCCEECCHHHHCC-CCCCCCCCCCCCCCCCC------------------------CCCHHHCCHHHHEEC T ss_conf 75557652867588787343101-55245257996302678------------------------886011100053466 Q ss_pred CCCCEEEEEECCCCCEEEEEEECCCHHHHHHHHCCCCCCCCCHHHHHCCCCCCCCHHHHHCCCCCHHHHHHCCCHHHHHH Q ss_conf 99878999733548505886622876786654044410000157760875110570221006421001330788778988 Q gi|254780127|r 371 SNGKKLVVADLAGIEARVLAWIAGEDWKIKAFANGEDIYVTTYAKAFNTPIDKVSKEQRAIGKVMELALGYQGGAKVFKT 450 (675) Q Consensus 371 ~~G~~lv~aDySqiElRvlA~ls~d~~li~~f~~g~D~h~~tA~~~~~~~~~~v~~~~R~~aK~~~~gl~YG~G~~~l~~ 450 (675) ++||+|++|||||||||||||+|+|+.|++||++|.|+|+.||+.+||+|.++|++++|+.||++|||++||||+.+|.+ T Consensus 698 ~~g~~l~saDYSQIELRilAh~s~d~~li~af~~~~DiH~~tA~~if~v~~~~v~~~~R~~AK~vnfgi~YG~~~~gLa~ 777 (931) T PRK08835 698 PHGYKILAVDYSQIELRIMAHLSGDQALLDAFRDGKDIHAATAAEIMGVSIEQVSSEQRRRAKAVNFGLIYGMSAFGLAK 777 (931) T ss_pred CCCCEEEEECHHHHHHHHHHHHCCCHHHHHHHHCCCCHHHHHHHHHHCCCHHHCCHHHHHHHHHHHHHHHHCCCHHHHHH T ss_conf 99887997043189999999982998999998669980678999995999466899898775833311244578777998 Q ss_pred HHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEECCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHH Q ss_conf 78870888640366666566676547889999999998751110103108897889999998673078999999999999 Q gi|254780127|r 451 MASHCGLDLQQFSQNVKSTSTFEDWEQAESHHLWMQDQYPEFAVKDKLIGTACELVKKAWRAKHQGVLQLWKDLTEGFAC 530 (675) Q Consensus 451 ~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ea~~~~~~~~~~~p~i~~~~~~~~~~~~~ 530 (675) ..+++..||+.+++.|+++||+|+.|...+.+.|+. T Consensus 778 --------------------------------------------~l~i~~~eA~~~i~~yf~~yp~v~~~~~~~~~~a~~ 813 (931) T PRK08835 778 --------------------------------------------QLGIPRGEAQAYMDKYFERYPGVMQYMEDTRSRAAQ 813 (931) T ss_pred --------------------------------------------HHCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHH T ss_conf --------------------------------------------709899999999999988682299999999999987 Q ss_pred HHHCCCEEEECCCCCCCCCCCCCCCCCCEEECCCCHHHCCCCCCHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 96337214201322234432124456411224631131024520123320310012357765335577767479899999 Q gi|254780127|r 531 VVQEGGSISARRVANVPRLVMRRHKRDVHIVLPSSRRLVYSDVKGDCSYLNTATSQLMRERTYGGKLTENIVQAISRDIL 610 (675) Q Consensus 531 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~i~~psGR~l~~~~~~~~~~~~~~~~~~~~r~~~~g~k~~n~~iQgtAaDi~ 610 (675) +| ++++.+|||++.|++.+.+.... ...+|+ ++|+|||||||||+ T Consensus 814 ---~g-----------------------~v~Tl~gRrr~~p~i~s~n~~~r---~~~eR~------a~N~piQGtaADii 858 (931) T PRK08835 814 ---QG-----------------------YVETIFGRRLHLPEITSRNGMRR---KAAERA------AINAPMQGTAADII 858 (931) T ss_pred ---CC-----------------------CEECCCCCCCCCCCCCCCCHHHH---HHHHHH------HHCCCCCHHHHHHH T ss_conf ---79-----------------------17887887314886568887788---689999------95712445599999 Q ss_pred HHHHHHHHH----CC---CCEEEEECCEEEEECCCCHHHHHHH-HHHHHCCCCCCCCCCCCEEECCCCCCCC Q ss_conf 999999996----79---8189995358999678478999999-9998467630216984157303114631 Q gi|254780127|r 611 CEGMKNATK----NG---YDIVLTVHDEIVSETPDTPYFSVGT-LCSLMTKNPSWAKGLPLKAEGYEAKRYR 674 (675) Q Consensus 611 k~am~~l~~----~g---~~ivl~VHDElv~evp~~~~~~~~~-i~~~M~~~p~wa~~~pl~a~~~~~~~y~ 674 (675) |.||+++++ .| .+|+||||||||||||+++++++.. |+++|+++.+ ..+||.+|..+|+.|. T Consensus 859 K~AMi~~~~~l~~~~~~~~~~~lqvHDEli~e~~~~~~~~~~~~v~~~Me~~~~--l~Vpl~v~~~~G~nW~ 928 (931) T PRK08835 859 KKAMLLVDEWIEEEGNGRVKLLMQVHDELVFEVEESSLSEIESKVQQLMESAAE--LDVPLVAEAGHGDNWD 928 (931) T ss_pred HHHHHHHHHHHHHCCCCCCEEEEEECCEEEEEECHHHHHHHHHHHHHHHHCCCC--CCCCEEEECCCCCCHH T ss_conf 999999999998556777628986672778831688999999999999727632--7786374238076978 No 8 >PRK05929 consensus Probab=100.00 E-value=0 Score=852.81 Aligned_cols=535 Identities=17% Similarity=0.164 Sum_probs=439.8 Q ss_pred CEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCCEEEEEECCC-CCHHHHHHHHHHCCCCCEEEEEEHHHHHHHH Q ss_conf 789998040678870004600002688299999998382289997888-4221799999864889819986307759999 Q gi|254780127|r 2 SKLFIDIETRSPQPLPKVGVWAYAEQAVITLCAYAWDDEPVKLWDRTE-QSAMPSDLLQYLRDETVMCVAHNSLFERILF 80 (675) Q Consensus 2 k~l~lD~ET~s~~dl~~~Gl~~y~~~~~I~~i~~a~~~~~~~~~d~~~-~~~~~~~l~~~L~~~~~~kv~HNa~FD~~vL 80 (675) +.+++|+||.+.. ....++.++++|.+ +.++++++.+ +......++++|+++++.|++||+|||+.+| T Consensus 311 ~~~a~~~~~~~~~----------~~~~~lvGisls~~-~~a~yip~~~~~~~vl~~Lk~~led~~i~kig~NlK~d~~vL 379 (870) T PRK05929 311 GEVAFCVAYTGNH----------LPSLKLHGVALAGG-SQVFYIEVEESGDQEIALLKDFFARKDTTFYGYNLKRDNHAL 379 (870) T ss_pred CCEEEEEEECCCC----------CCCCCEEEEEEECC-CEEEEEECCCCCHHHHHHHHHHHCCCCCCEEECCHHHHHHHH T ss_conf 9747999745887----------67772889999769-808999765555238999999860987745322377799999 Q ss_pred HHHCCCCCCCCCEEEHHHHHHHHCCC--CCHHHHHHHHCCCCCCHHHHHCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHH Q ss_conf 98629987865045458899985889--9978999980889865276637657664100233201345311258768999 Q gi|254780127|r 81 KKTLGIDIPSKRWICTSVLARTNGLP--SSLKNACLALNFSEHLTKMEEGKALIARFCKGSIDSPPYDCTRANHVQAWQL 158 (675) Q Consensus 81 ~~~lGi~i~~~~~~DTmi~A~~l~~p--~sL~~la~~lgl~~~~~kd~~gk~li~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (675) .++ |+++.. ..+|||+++|+++.. .+++.+...+++.... ...++.. ...... ....+ +. T Consensus 380 ~~~-GI~l~g-~~fDtmLAsYLLn~~~~~~~~~ll~~~~~~~~~--~~~~k~~-------g~~~l~----~~~~~---~~ 441 (870) T PRK05929 380 KNA-GIDVHE-ITADLALAEHLVNGGAKISFQTLLVEHGHIEEA--HRFAKEW-------GASSLP----IQRLP---EQ 441 (870) T ss_pred HHC-CCCCCC-CHHHHHHHHHHHCCCCCCCHHHHHHHCCCCHHH--HHHHHHH-------CCCCCC----CCCCH---HH T ss_conf 968-985687-277999999862999988839999766673155--5655540-------766664----20073---88 Q ss_pred HHHHHHHHHHHHHHHHHHHCCC----------HHHHHHHHHHHHHHHHCCEECCHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 8988775399999999872002----------143101789999875168430889999999999999999999998630 Q gi|254780127|r 159 FGEYCKRDVEATRELFKRLIPL----------SDGERDLWLLDQTINDRGYRIDLDLVLKLQELIAQERKKLDEDIVKLT 228 (675) Q Consensus 159 ~~~Ya~~Dv~~~~~L~~~L~~~----------~~~E~~l~~~~~~me~~Gi~vD~e~l~~~~~~~~~~~~~~~~~i~~~~ 228 (675) ...|++.|+..+++|+..|.+. .+.|+++..++.+||.+||.||.+.++++.+++..+..++++++++++ T Consensus 442 ~~~yaae~a~~~~~L~~~L~~~L~~~~L~~Ly~eIE~PLi~VLa~ME~~GI~vD~~~L~~ls~el~~~l~~Le~~I~~~A 521 (870) T PRK05929 442 PAQYFGEFVSYLPIIKNYLLEELEEKGLKDIFSNVEMPLEKVLFAMERAGMPLDVEGLAVLERDLSKELAVLTQEIYDLA 521 (870) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCEEEEHHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 99999999999999999999999887799999998744899999987389898199999899999999999999999972 Q ss_pred CCCCCCCCCHHHHHHHHHHHCCCCCCCCCHHHHHHHHHH-CCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHCC-CCCCC Q ss_conf 576688885688999999972899988896588999974-4898548999999999754300035668873056-88452 Q gi|254780127|r 229 DGIIRSSRQTYALRMYLFLITGIDLVDMSEGTLKSILSH-SNITQLAKDLILNRLASSGSAILKLNTLSEAVSS-DGRLR 306 (675) Q Consensus 229 g~~~~n~~S~~ql~~~L~~~~g~~~~~~~k~~~~~~l~~-~~~~~~~~~ll~~r~~~~~~~~~~~~~l~~~~~~-dGRih 306 (675) |.++ |+|||+||+++||+++|+|+.+.+|+|..++|+. ...+|.+..+++||...|+.+ +|+++|.+.+++ ||||| T Consensus 522 G~eF-NinSPKQL~evLFekLgL~~~kKtkST~~evLe~L~~~hpi~~~iLeyR~l~KL~s-TYvd~L~~~i~~~tgRIH 599 (870) T PRK05929 522 GCPF-NIKSPKQLSDILYNRLGLRPIDKAKSTKAEVLEALEGEHEIIEKILAFRAIEKLLS-TYVKALPKQIDSHTQRIH 599 (870) T ss_pred CCCC-CCCCHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHH-HHHHHHHHHHCCCCCEEC T ss_conf 9998-87898999999898639998888898728999974060026899999858998750-467775542256768777 Q ss_pred CCEEECCCCCCHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCCCCHHHHEEE-ECCCCCEEEEEECCCCC Q ss_conf 20021132210011106888554777766523456544310244455444322101321214-76998789997335485 Q gi|254780127|r 307 GTLQFYGASRTGRWSGCVFQPQNLPRQERSEEILTQTIQALNRGETIADPLGLASDCVRSCI-IASNGKKLVVADLAGIE 385 (675) Q Consensus 307 ~~~~~~gAt~TGR~S~~~pNlQnlPr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iR~~f-~a~~G~~lv~aDySqiE 385 (675) ++|+|.+ |.|||+||++||||||| ++++.|+.||+|| +|++|++|++||||||| T Consensus 600 TsfnQt~-TaTGRLSSs~PNLQNIP------------------------iRte~Gr~IR~aF~~~~~g~~llsaDYSQIE 654 (870) T PRK05929 600 PTFNQVG-TVTGRLACQDPNLQNIP------------------------IRSERGRLLRKAFRLTKKNDYFLSADYSQIE 654 (870) T ss_pred CEEECCC-CCCCCCCCCCCCCCCCC------------------------CCCCCCHHHHHHEEECCCCCEEEEECHHHHH T ss_conf 6233110-11132246899765367------------------------7541115678751328999779961266889 Q ss_pred EEEEEEECCCHHHHHHHHCCCCCCCCCHHHHHCCCCCCCCHHHHHCCCCCHHHHHHCCCHHHHHHHHHHCCCCCCHHHHH Q ss_conf 05886622876786654044410000157760875110570221006421001330788778988788708886403666 Q gi|254780127|r 386 ARVLAWIAGEDWKIKAFANGEDIYVTTYAKAFNTPIDKVSKEQRAIGKVMELALGYQGGAKVFKTMASHCGLDLQQFSQN 465 (675) Q Consensus 386 lRvlA~ls~d~~li~~f~~g~D~h~~tA~~~~~~~~~~v~~~~R~~aK~~~~gl~YG~G~~~l~~~~~~~G~~~~~~~~~ 465 (675) ||||||+|+|+.|++||++|.|||+.||+.+||+|.++||+++|+.||++|||++||||+.+|.+ T Consensus 655 LRilAHlS~D~~Li~AF~~g~DIH~~tA~~if~v~~~~Vt~~~Rr~AK~vNFGiiYG~s~fgLa~--------------- 719 (870) T PRK05929 655 LRFLAHLSQDESLKRAFESGEDIHAFTASQVFHVPLEQVTKQQRMQAKTVNFGIVYGQQAYGLSK--------------- 719 (870) T ss_pred HHHHHHHHCCHHHHHHHHCCCHHHHHHHHHHHCCCHHHCCHHHHHHHHHHHHHHHHCCCHHHHHH--------------- T ss_conf 99999983678689875056207665269884898121698789886883545675658788998--------------- Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHCCCEECCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCEEEECCCCC Q ss_conf 66566676547889999999998751110103108897889999998673078999999999999963372142013222 Q gi|254780127|r 466 VKSTSTFEDWEQAESHHLWMQDQYPEFAVKDKLIGTACELVKKAWRAKHQGVLQLWKDLTEGFACVVQEGGSISARRVAN 545 (675) Q Consensus 466 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ea~~~~~~~~~~~p~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 545 (675) ..+++..||+.+++.||++||+|+.|...+.+.|+. +| T Consensus 720 -----------------------------~L~is~~eA~~~i~~Yf~~yp~v~~y~~~~~~~a~~---~G---------- 757 (870) T PRK05929 720 -----------------------------ILKISVSEAQGLIDAYFARYPEAAEFITETIEQASK---NL---------- 757 (870) T ss_pred -----------------------------HHCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHH---CC---------- T ss_conf -----------------------------849899999999999998691299999999999997---69---------- Q ss_pred CCCCCCCCCCCCCEEECCCCHHHCCCCCCHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC----- Q ss_conf 344321244564112246311310245201233203100123577653355777674798999999999999967----- Q gi|254780127|r 546 VPRLVMRRHKRDVHIVLPSSRRLVYSDVKGDCSYLNTATSQLMRERTYGGKLTENIVQAISRDILCEGMKNATKN----- 620 (675) Q Consensus 546 ~~~~~~~~~~~~l~i~~psGR~l~~~~~~~~~~~~~~~~~~~~r~~~~g~k~~n~~iQgtAaDi~k~am~~l~~~----- 620 (675) ++++..|||++.|++...+ ..+...+|+ ++|+|||||||||+|.||+++++. T Consensus 758 -------------yv~Tl~gRrr~ip~i~~~~----~~r~~~eR~------AiN~piQGtAADiiK~AMi~i~~~l~~~~ 814 (870) T PRK05929 758 -------------RVTTMLGRERIIDDWEEFP----GSRAASGRL------AVNTRIQGSAAELIKLAMLNISDALKSRG 814 (870) T ss_pred -------------CEECCCCCCCCCCCCCCCH----HHHHHHHHH------HHCCCCCHHHHHHHHHHHHHHHHHHHHCC T ss_conf -------------6767688600288656774----567788898------81876664899999999999999998569 Q ss_pred -CCCEEEEECCEEEEECCCCHHHHHHH-HHHHHCCCCCCCCCCCCEEECCCCCCCC Q ss_conf -98189995358999678478999999-9998467630216984157303114631 Q gi|254780127|r 621 -GYDIVLTVHDEIVSETPDTPYFSVGT-LCSLMTKNPSWAKGLPLKAEGYEAKRYR 674 (675) Q Consensus 621 -g~~ivl~VHDElv~evp~~~~~~~~~-i~~~M~~~p~wa~~~pl~a~~~~~~~y~ 674 (675) ..+|+||||||||||||+++++++.. |+++|+.+.. ..+||.+|..+|+.|- T Consensus 815 ~~~~mllQVHDELvfEv~~~e~~~~~~~v~~~Me~~~~--l~VpL~v~~~~g~nW~ 868 (870) T PRK05929 815 LKSRMLLQIHDELLFEVPEEELEEMQRLVREKMESAMT--LSVPLVVNILIGKNWA 868 (870) T ss_pred CCCEEEEEECEEEEEEECHHHHHHHHHHHHHHHHCCCC--CCCCEEEECCCCCCCC T ss_conf 87649998530678842788999999999999727203--6887472228188853 No 9 >PRK07456 consensus Probab=100.00 E-value=0 Score=852.09 Aligned_cols=540 Identities=21% Similarity=0.231 Sum_probs=440.8 Q ss_pred EEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEEC--CCEEEEEECCCCC--------HHHHHHHHHHCCCCCEEEEEE Q ss_conf 899980406788700046000026882999999983--8228999788842--------217999998648898199863 Q gi|254780127|r 3 KLFIDIETRSPQPLPKVGVWAYAEQAVITLCAYAWD--DEPVKLWDRTEQS--------AMPSDLLQYLRDETVMCVAHN 72 (675) Q Consensus 3 ~l~lD~ET~s~~dl~~~Gl~~y~~~~~I~~i~~a~~--~~~~~~~d~~~~~--------~~~~~l~~~L~~~~~~kv~HN 72 (675) .+++|+||.|...+ +++++|+++||. .+..+|+|+.|.. .+...|+++|+++++.|++|| T Consensus 384 ~vA~DtETtsLdp~----------~A~lVGis~s~~~~~~~a~YIPl~h~~~~~~l~~~~vl~~LkpiLed~~i~KighN 453 (975) T PRK07456 384 PVAWDTETTSLNPR----------DAELVGIGCCWGEELDDLAYIPLGHQQAGEQLPLETVLEALRPWLESPEHPKALQN 453 (975) T ss_pred EEEEECCCCCCCHH----------HCCEEEEEEECCCCCCCEEEEECCCCCCHHHCCHHHHHHHHHHHHCCCCCCCHHHH T ss_conf 58998046887742----------32102666411557872699773455613105799999999998628875310211 Q ss_pred HHHHHHHHHHHCCCCCCCCCEEEHHHHHHHHCC--CCCHHHHHH-HHCCCCCCHHHHHCHHHHHHHHHHHHHHCCCCCCC Q ss_conf 077599999862998786504545889998588--999789999-80889865276637657664100233201345311 Q gi|254780127|r 73 SLFERILFKKTLGIDIPSKRWICTSVLARTNGL--PSSLKNACL-ALNFSEHLTKMEEGKALIARFCKGSIDSPPYDCTR 149 (675) Q Consensus 73 a~FD~~vL~~~lGi~i~~~~~~DTmi~A~~l~~--p~sL~~la~-~lgl~~~~~kd~~gk~li~~~~~~~~~~~~~~~~~ 149 (675) +|||+.+|.++ |+++.. ..+||||+||++++ .++|+.+++ +|++....+++..||.. .. T Consensus 454 lKyD~~vL~~~-GI~l~g-~~fDTmLAsYLLnP~~~h~Ld~La~~yL~~~~i~~edl~gkgk----------------~~ 515 (975) T PRK07456 454 AKFDRLILLRH-GIALKG-VVFDTLLADYLLNPEAKHNLDDLAQRELGFTPTSYKDLVGKGQ----------------TF 515 (975) T ss_pred HHHHHHHHHHC-CCCCCC-CCCCHHHHHHHHCCCCCCCHHHHHHHHHCCCCCCHHHHCCCCC----------------CC T ss_conf 14679999976-951067-5513999998768765589899999971888774998628887----------------60 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCH------------HHHHHHHHHHHHHHHCCEECCHHHHHHHHHHHHHHH Q ss_conf 25876899989887753999999998720021------------431017899998751684308899999999999999 Q gi|254780127|r 150 ANHVQAWQLFGEYCKRDVEATRELFKRLIPLS------------DGERDLWLLDQTINDRGYRIDLDLVLKLQELIAQER 217 (675) Q Consensus 150 ~~~~~~~~~~~~Ya~~Dv~~~~~L~~~L~~~~------------~~E~~l~~~~~~me~~Gi~vD~e~l~~~~~~~~~~~ 217 (675) .. .+.+....||+.|++++++|+..|.+.+ +.|+++..++.+||.+||.||.+.++++.+++..+. T Consensus 516 ~~--v~~~~~~~ya~~dA~~~~~L~~~l~~~L~~~~~~L~~l~~~iE~PLi~VLa~ME~~GI~vD~~~L~~l~~~l~~~l 593 (975) T PRK07456 516 AD--VDIATASQYCGMDVYLTYRLAPILREQLAKTGPELIKLLEEVELPLEPVLAEMEATGIRIDIPYLKELSQELESTL 593 (975) T ss_pred CC--CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHH T ss_conf 20--8978999999999999999999999999862369999999987369999999997185016999999999999999 Q ss_pred HHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHCCCCCCC--CCH---HHHHHHHHH-CCCCCHHHHHHHHHHHHHHHCCHH Q ss_conf 99999998630576688885688999999972899988--896---588999974-489854899999999975430003 Q gi|254780127|r 218 KKLDEDIVKLTDGIIRSSRQTYALRMYLFLITGIDLVD--MSE---GTLKSILSH-SNITQLAKDLILNRLASSGSAILK 291 (675) Q Consensus 218 ~~~~~~i~~~~g~~~~n~~S~~ql~~~L~~~~g~~~~~--~~k---~~~~~~l~~-~~~~~~~~~ll~~r~~~~~~~~~~ 291 (675) .++++++++++|.++ |+|||+||+++||+.+|+|.++ .+| +|.++.|+. ...+|.++.+++||+..|+.+ +| T Consensus 594 ~~le~~i~~~aG~eF-NinSpkQL~evLFekL~L~~~k~~KtKtGySTd~~vLe~L~~~hpi~~~il~~R~l~Kl~s-TY 671 (975) T PRK07456 594 ERLEQQAYELAGEDF-NLSSPKQLGELLFEKLGLDRKKSRKTKTGYSTDAAVLEKLEGDHPIVPLILEHRTLSKLKS-TY 671 (975) T ss_pred HHHHHHHHHHCCCCC-CCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHCCHHHHHHHHHHHHHHHHH-HH T ss_conf 999999998648988-9899899999989863999877777889988629999975204727999999878999999-99 Q ss_pred HHHHHHHHCCC-CCCCCCEEECCCCCCHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCCCCHHHHEEEEC Q ss_conf 56688730568-84522002113221001110688855477776652345654431024445544432210132121476 Q gi|254780127|r 292 LNTLSEAVSSD-GRLRGTLQFYGASRTGRWSGCVFQPQNLPRQERSEEILTQTIQALNRGETIADPLGLASDCVRSCIIA 370 (675) Q Consensus 292 ~~~l~~~~~~d-GRih~~~~~~gAt~TGR~S~~~pNlQnlPr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iR~~f~a 370 (675) +++|.+.++++ ||||++|+|.+ |.|||+||++||||||| ++++.|+.||+||+| T Consensus 672 ~~~L~~~i~~~tgriht~f~Q~~-t~TGRlSS~~PNLQNIP------------------------~r~~~g~~iR~~Fv~ 726 (975) T PRK07456 672 VDALPQLVEAETGRVHTDFNQAV-TATGRLSSSNPNLQNIP------------------------IRTEFSRRIRKAFLP 726 (975) T ss_pred HHHHHHHHCCCCCEECCHHHHHC-CCCCCCCCCCCCCCCCC------------------------CCCCCCHHHHEEECC T ss_conf 99988752877587657555410-66565047897513577------------------------764103123412435 Q ss_pred CCCCEEEEEECCCCCEEEEEEECCCHHHHHHHHCCCCCCCCCHHHHHCCCCCCCCHHHHHCCCCCHHHHHHCCCHHHHHH Q ss_conf 99878999733548505886622876786654044410000157760875110570221006421001330788778988 Q gi|254780127|r 371 SNGKKLVVADLAGIEARVLAWIAGEDWKIKAFANGEDIYVTTYAKAFNTPIDKVSKEQRAIGKVMELALGYQGGAKVFKT 450 (675) Q Consensus 371 ~~G~~lv~aDySqiElRvlA~ls~d~~li~~f~~g~D~h~~tA~~~~~~~~~~v~~~~R~~aK~~~~gl~YG~G~~~l~~ 450 (675) +|||+|++|||||||||||||+|+|+.|++||++|.|+|+.||+.+|++ ++|++++|+.||++|||++||||+.+|.+ T Consensus 727 ~~g~~l~saDySQiELRilAh~s~d~~li~af~~~~DiH~~tA~~if~~--~~v~~~~R~~AK~infgiiYG~~~~gLa~ 804 (975) T PRK07456 727 QEGWLLLSADYSQIELRILAHLSQEPVLLEAYRSGDDVHTLTARLLFEK--EEISSDERRLGKTINFGVIYGMGAQRFAR 804 (975) T ss_pred CCCCEEEEECHHHHHHHHHHHHHCCHHHHHHHHCCCCHHHHHHHHHCCC--CCCCHHHHHHHHHHHHHHHHCCCHHHHHH T ss_conf 8998899963208799999998588899998717998689999986393--01999999997898876776468778898 Q ss_pred HHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEECCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHH Q ss_conf 78870888640366666566676547889999999998751110103108897889999998673078999999999999 Q gi|254780127|r 451 MASHCGLDLQQFSQNVKSTSTFEDWEQAESHHLWMQDQYPEFAVKDKLIGTACELVKKAWRAKHQGVLQLWKDLTEGFAC 530 (675) Q Consensus 451 ~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ea~~~~~~~~~~~p~i~~~~~~~~~~~~~ 530 (675) ..+++..||+.+++.||+.||+|..|...+.+.|+. T Consensus 805 --------------------------------------------~l~is~~eA~~~i~~yf~~yp~v~~~~~~~~~~a~~ 840 (975) T PRK07456 805 --------------------------------------------ETGVSQSEAKEFLSRYKERYPKVFAFLELQERLALS 840 (975) T ss_pred --------------------------------------------HHCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHH T ss_conf --------------------------------------------859799999999999998690199999999999997 Q ss_pred HHHCCCEEEECCCCCCCCCCCCCCCCCCEEECCCCHHHCCCCCCHHHHHH----------CCCHHHHHHHHHHHHHHHHH Q ss_conf 96337214201322234432124456411224631131024520123320----------31001235776533557776 Q gi|254780127|r 531 VVQEGGSISARRVANVPRLVMRRHKRDVHIVLPSSRRLVYSDVKGDCSYL----------NTATSQLMRERTYGGKLTEN 600 (675) Q Consensus 531 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~i~~psGR~l~~~~~~~~~~~~----------~~~~~~~~r~~~~g~k~~n~ 600 (675) +| ++++..|||++++..+...... +...+...+ +..-..++|+ T Consensus 841 ---~g-----------------------yv~Tl~gRrr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~R~A~N~ 893 (975) T PRK07456 841 ---KG-----------------------YVETILGRRRYFHFDKNGLGRLLGKDPEEIDLDVARRAGME-AQQLRAAANA 893 (975) T ss_pred ---CC-----------------------CEECCCCCEECCCCCCCCCCCCCCCCCCCCCCHHHHHHHHH-HHHHHHHHCC T ss_conf ---69-----------------------88766677004644443100013568221340656667777-5888988370 Q ss_pred HHHHHHHHHHHHHHHHHHHC------CCCEEEEECCEEEEECCCCHHHHHHH-HHHHHCCCCCCCCCCCCEEECCCCCCC Q ss_conf 74798999999999999967------98189995358999678478999999-999846763021698415730311463 Q gi|254780127|r 601 IVQAISRDILCEGMKNATKN------GYDIVLTVHDEIVSETPDTPYFSVGT-LCSLMTKNPSWAKGLPLKAEGYEAKRY 673 (675) Q Consensus 601 ~iQgtAaDi~k~am~~l~~~------g~~ivl~VHDElv~evp~~~~~~~~~-i~~~M~~~p~wa~~~pl~a~~~~~~~y 673 (675) |||||||||+|.||++++++ ..+++||||||||||||+++++++.. |+++|+++.+ ..+||.++..+|+.| T Consensus 894 piQGtaADiiK~AMi~~~~~l~~~~~~~~~llqvHDElifev~~~~~~~~~~~v~~~Me~a~~--l~Vpl~v~~~~G~nW 971 (975) T PRK07456 894 PIQGSSADIIKIAMVQLQKKLLEQNLPARLLLQVHDELVLEVPPEELEELKPLVKSTMENAVK--LSVPLVVEIGAGDNW 971 (975) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEEEEEEEEEEEECHHHHHHHHHHHHHHHHCCCC--CCCCEEEECCCCCCH T ss_conf 244679999999999999999737988538998643778953688999999999999827435--788758411738896 Q ss_pred C Q ss_conf 1 Q gi|254780127|r 674 R 674 (675) Q Consensus 674 ~ 674 (675) . T Consensus 972 ~ 972 (975) T PRK07456 972 M 972 (975) T ss_pred H T ss_conf 8 No 10 >PRK07898 consensus Probab=100.00 E-value=0 Score=850.27 Aligned_cols=532 Identities=21% Similarity=0.184 Sum_probs=443.1 Q ss_pred CEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCCEEEEEECCCCCH-HHHHHHHHHCCCCCEEEEEEHHHHHHHH Q ss_conf 789998040678870004600002688299999998382289997888422-1799999864889819986307759999 Q gi|254780127|r 2 SKLFIDIETRSPQPLPKVGVWAYAEQAVITLCAYAWDDEPVKLWDRTEQSA-MPSDLLQYLRDETVMCVAHNSLFERILF 80 (675) Q Consensus 2 k~l~lD~ET~s~~dl~~~Gl~~y~~~~~I~~i~~a~~~~~~~~~d~~~~~~-~~~~l~~~L~~~~~~kv~HNa~FD~~vL 80 (675) +.+++|+||.+. ++ ++++.++++++.++..++++..+..+ ....+.++|+++++.|++||+|||+.+| T Consensus 338 ~~~a~~~~~t~~---------~~--~a~lvgisl~~~~g~a~yip~~~~~~~~~~~L~~~Led~~i~KIghNlK~d~~vL 406 (902) T PRK07898 338 RRLGLAVVGTHL---------PG--DGDATALAIAAADGHAAYIDTADLTPDDEAALAAWLADPARPKALHEAKGAMHAL 406 (902) T ss_pred CEEEEEEECCCC---------CC--CCCEEEEEEEEECCEEEEECCCCCCHHHHHHHHHHHHCCCCCEEEECHHHHHHHH T ss_conf 869999853678---------66--5738999999538808997220179789999999860877654554048999999 Q ss_pred HHHCCCCCCCCCEEEHHHHHHHHCC---CCCHHHHHHH-HCCCCCCHHHHHCHHHHHHHHHHHHHHCCCCCCCCCCHHHH Q ss_conf 9862998786504545889998588---9997899998-08898652766376576641002332013453112587689 Q gi|254780127|r 81 KKTLGIDIPSKRWICTSVLARTNGL---PSSLKNACLA-LNFSEHLTKMEEGKALIARFCKGSIDSPPYDCTRANHVQAW 156 (675) Q Consensus 81 ~~~lGi~i~~~~~~DTmi~A~~l~~---p~sL~~la~~-lgl~~~~~kd~~gk~li~~~~~~~~~~~~~~~~~~~~~~~~ 156 (675) .++ |+.+.. ..+||||+||++++ +++|+.++.+ |+..... ...++.. .... ...- T Consensus 407 ~~~-Gi~l~g-~~fDTmLAaYLLdp~~~~~~Ld~La~~yL~~~~~~--~~~~~~~---------------~~~~--~~~~ 465 (902) T PRK07898 407 AGR-GWTLAG-VTSDTALAAYLVRPGQRSFTLDDLSLRYLRRELRA--EAPAQQQ---------------LSLD--DAVD 465 (902) T ss_pred HHC-CCCCCC-CCHHHHHHHHHHCCCCCCCCHHHHHHHHCCCCCCC--CCCCCCC---------------CCCC--CCCH T ss_conf 966-986577-53228999987568878787999999974976651--5653543---------------4713--2676 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCC----------HHHHHHHHHHHHHHHHCCEECCHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 998988775399999999872002----------1431017899998751684308899999999999999999999986 Q gi|254780127|r 157 QLFGEYCKRDVEATRELFKRLIPL----------SDGERDLWLLDQTINDRGYRIDLDLVLKLQELIAQERKKLDEDIVK 226 (675) Q Consensus 157 ~~~~~Ya~~Dv~~~~~L~~~L~~~----------~~~E~~l~~~~~~me~~Gi~vD~e~l~~~~~~~~~~~~~~~~~i~~ 226 (675) +....|++.|+.++++||..|.+. .+.|+++..++.+||.+||.||.+.++++..++..+..++++++++ T Consensus 466 ~~~~~yaa~dA~~~~~L~~~l~~~L~~~~l~~L~~~IE~PLi~VLa~ME~~GI~vD~~~L~~ls~el~~~l~~le~~I~~ 545 (902) T PRK07898 466 AAAAQTLILRARAVLDLADALDAELARIDSAALLRDMELPLQRVLAEMERAGIAVDLAHLTELQSQFAGQVRDAAEAAYA 545 (902) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCEEECHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 89999999999999999999999861655899999976058999999985081981999998999999999999999999 Q ss_pred HCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCH---HHHHHHHHHC---CCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHC Q ss_conf 30576688885688999999972899988896---5889999744---89854899999999975430003566887305 Q gi|254780127|r 227 LTDGIIRSSRQTYALRMYLFLITGIDLVDMSE---GTLKSILSHS---NITQLAKDLILNRLASSGSAILKLNTLSEAVS 300 (675) Q Consensus 227 ~~g~~~~n~~S~~ql~~~L~~~~g~~~~~~~k---~~~~~~l~~~---~~~~~~~~ll~~r~~~~~~~~~~~~~l~~~~~ 300 (675) ++|.++ |+|||+||+++||+.+|+|..+.+| +|.++.|+.. ..+|.++.+++||...|+.+| ++++.+.++ T Consensus 546 ~aG~eF-NinSpKQL~eiLFdkL~Lp~~KKtKtGySTd~~vLe~L~~~~~h~i~~~iLeyR~l~KL~sT--~d~L~~~i~ 622 (902) T PRK07898 546 VIGKQI-NLGSPKQLQVVLFDELGMPKTKRTKTGYTTDADALQSLADKTGHPFLQHLLAHRDATKLKVT--VDGLIKAVA 622 (902) T ss_pred HHCCCC-CCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHH--HHHHHHHHC T ss_conf 818956-89998999999899709998887889987889999999855799269999999999899989--988898726 Q ss_pred CCCCCCCCEEECCCCCCHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCCCCHHHHEEEECCCCC-EEEEE Q ss_conf 68845220021132210011106888554777766523456544310244455444322101321214769987-89997 Q gi|254780127|r 301 SDGRLRGTLQFYGASRTGRWSGCVFQPQNLPRQERSEEILTQTIQALNRGETIADPLGLASDCVRSCIIASNGK-KLVVA 379 (675) Q Consensus 301 ~dGRih~~~~~~gAt~TGR~S~~~pNlQnlPr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iR~~f~a~~G~-~lv~a 379 (675) +|||||++|+|.+ |.|||+||++||||||| ++++.|+.||+||+|++|+ +|++| T Consensus 623 ~~gRIHtsfnQ~~-T~TGRLSS~~PNLQNIP------------------------ir~~~G~~iR~aFv~~~g~~~~~sa 677 (902) T PRK07898 623 DDGRIHTTFNQTV-AATGRLSSTDPNLQNIP------------------------IRTEAGRRIRDAFVVGEGYETLLTA 677 (902) T ss_pred CCCCCCCHHHHHH-CCCCEECCCCCCCCCCC------------------------CCCCCHHHHHHEEECCCCCCEEEEC T ss_conf 5772301265421-21120025899766777------------------------7661015567604358998467740 Q ss_pred ECCCCCEEEEEEECCCHHHHHHHHCCCCCCCCCHHHHHCCCCCCCCHHHHHCCCCCHHHHHHCCCHHHHHHHHHHCCCCC Q ss_conf 33548505886622876786654044410000157760875110570221006421001330788778988788708886 Q gi|254780127|r 380 DLAGIEARVLAWIAGEDWKIKAFANGEDIYVTTYAKAFNTPIDKVSKEQRAIGKVMELALGYQGGAKVFKTMASHCGLDL 459 (675) Q Consensus 380 DySqiElRvlA~ls~d~~li~~f~~g~D~h~~tA~~~~~~~~~~v~~~~R~~aK~~~~gl~YG~G~~~l~~~~~~~G~~~ 459 (675) ||||||||||||+|+|+.|++||++|.|+|+.||+.+||+|.++|++++|+.||++|||++||||+.+|.+ T Consensus 678 DYSQiELRilAh~s~d~~l~~af~~~~DiH~~tA~~if~v~~~~vt~~~R~~AK~vNfgiiYG~~~~gLs~--------- 748 (902) T PRK07898 678 DYSQIEMRIMAHLSGDEGLIEAFNSGEDLHSFVASRVFGVPIDEVTPELRRRVKAMSYGLAYGLSAYGLSQ--------- 748 (902) T ss_pred CHHHHHHHHHHHHCCCHHHHHHHHCCCCHHHHHHHHHHCCCHHHCCHHHHHHHHHHHHHHHHCCCHHHHHH--------- T ss_conf 41367999999980889999998449988899999984999254899999898887747766758889998--------- Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEECCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCEEE Q ss_conf 40366666566676547889999999998751110103108897889999998673078999999999999963372142 Q gi|254780127|r 460 QQFSQNVKSTSTFEDWEQAESHHLWMQDQYPEFAVKDKLIGTACELVKKAWRAKHQGVLQLWKDLTEGFACVVQEGGSIS 539 (675) Q Consensus 460 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ea~~~~~~~~~~~p~i~~~~~~~~~~~~~~~~~~~~~~ 539 (675) ..+++..||+.+++.||..||+|..|...+.+.++. +| T Consensus 749 -----------------------------------~l~i~~~eA~~~i~~yf~~yp~v~~~~~~~~~~a~~---~g---- 786 (902) T PRK07898 749 -----------------------------------QLKISTEEAKELMDAYFARFGGVRDYLREVVEQARK---DG---- 786 (902) T ss_pred -----------------------------------HHCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHH---CC---- T ss_conf -----------------------------------729899999999999998690699999999999997---79---- Q ss_pred ECCCCCCCCCCCCCCCCCCEEECCCCHHHCCCCCCHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 01322234432124456411224631131024520123320310012357765335577767479899999999999996 Q gi|254780127|r 540 ARRVANVPRLVMRRHKRDVHIVLPSSRRLVYSDVKGDCSYLNTATSQLMRERTYGGKLTENIVQAISRDILCEGMKNATK 619 (675) Q Consensus 540 ~~~~~~~~~~~~~~~~~~l~i~~psGR~l~~~~~~~~~~~~~~~~~~~~r~~~~g~k~~n~~iQgtAaDi~k~am~~l~~ 619 (675) ++++..|||++.|++...+.... ...+|+ ++|+|||||||||+|.||+++++ T Consensus 787 -------------------yv~Tl~gRrr~~p~i~s~n~~~r---~~~eR~------A~N~piQGtaADiiK~AMi~i~~ 838 (902) T PRK07898 787 -------------------YTETLFGRRRYLPDLTSDNRQRR---EAAERA------ALNAPIQGSAADIIKVAMIRVDR 838 (902) T ss_pred -------------------EEECCCCCCCCCCCCCCCCHHHH---HHHHHH------HHCCCCCHHHHHHHHHHHHHHHH T ss_conf -------------------06787877113887778897789---899999------82866660699999999999999 Q ss_pred C------CCCEEEEECCEEEEECCCCHHHHHHHH-HHHHCCCCCCCCCCCCEEECCCCCCCC Q ss_conf 7------981899953589996784789999999-998467630216984157303114631 Q gi|254780127|r 620 N------GYDIVLTVHDEIVSETPDTPYFSVGTL-CSLMTKNPSWAKGLPLKAEGYEAKRYR 674 (675) Q Consensus 620 ~------g~~ivl~VHDElv~evp~~~~~~~~~i-~~~M~~~p~wa~~~pl~a~~~~~~~y~ 674 (675) . +.+|+||||||||||||+++++++.++ +++|+++.+ ..+||.+|..+|+.|- T Consensus 839 ~l~~~~l~~~~lLQvHDELi~ev~~~e~e~~~~~v~~~Me~~~~--L~VPL~v~~~~G~nW~ 898 (902) T PRK07898 839 ALREAGLKSRMLLQVHDELVLEVAPGEREQVEALVRDEMGSAYD--LRVPLEVSVGYGRSWD 898 (902) T ss_pred HHHHCCCCCEEEEEECEEEEEEECHHHHHHHHHHHHHHHHCCCC--CCCCEEEECCCCCCHH T ss_conf 99846998738998641788940788999999999999728631--7787572007386978 No 11 >PRK08928 consensus Probab=100.00 E-value=0 Score=850.57 Aligned_cols=497 Identities=20% Similarity=0.209 Sum_probs=423.0 Q ss_pred CCEEEEEEEEECCCEEEEEECCCC-CHHHHHHHHHHCCCCCEEEEEEHHHHHHHHHHHCCCCCCCCCEEEHHHHHHHHCC Q ss_conf 882999999983822899978884-2217999998648898199863077599999862998786504545889998588 Q gi|254780127|r 27 QAVITLCAYAWDDEPVKLWDRTEQ-SAMPSDLLQYLRDETVMCVAHNSLFERILFKKTLGIDIPSKRWICTSVLARTNGL 105 (675) Q Consensus 27 ~~~I~~i~~a~~~~~~~~~d~~~~-~~~~~~l~~~L~~~~~~kv~HNa~FD~~vL~~~lGi~i~~~~~~DTmi~A~~l~~ 105 (675) +..+.++++|++.+..++++..+. ......+.++|+++++.|++||+|||+.+|.++ |+.+. .++|||++||+|++ T Consensus 335 ~~~~vgis~s~~~~~~~~i~~~~~~~~~~~~l~~lLed~~i~KIghNlK~dl~vL~~~-gi~l~--~i~DTmLasYLLnp 411 (861) T PRK08928 335 KGTNNAISLSLQNQSYIIIISNEITDAFSDIIFNLLTDKSILKITYDLKPLLKFYANQ-SHDIT--AIDDLMLMSYSLSA 411 (861) T ss_pred CCEEEEEEEEECCCEEEEECCCCCHHHHHHHHHHHHCCCCCCEEECCHHHHHHHHHHC-CCCCC--CCCCHHHHHHHCCC T ss_conf 8738999999559849995575424789999999843999754454579999999856-96757--63718899986499 Q ss_pred C---CCHHHHHHHHCCCCCCHHHHHCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCC--- Q ss_conf 9---9978999980889865276637657664100233201345311258768999898877539999999987200--- Q gi|254780127|r 106 P---SSLKNACLALNFSEHLTKMEEGKALIARFCKGSIDSPPYDCTRANHVQAWQLFGEYCKRDVEATRELFKRLIP--- 179 (675) Q Consensus 106 p---~sL~~la~~lgl~~~~~kd~~gk~li~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~~~~L~~~L~~--- 179 (675) . ++|+.++.++.. ...|++.|+..+++|++.|.+ T Consensus 412 ~~~~h~Ld~La~~~~~----------------------------------------~~~~aa~~a~~~~~L~~~L~~~L~ 451 (861) T PRK08928 412 GLHDHSLFNIILKNTN----------------------------------------IINESAKIVCTLISLHKQLILELF 451 (861) T ss_pred CCCCCCHHHHHHHHCC----------------------------------------CCCHHHHHHHHHHHHHHHHHHHHH T ss_conf 9876898999976222----------------------------------------001479999999999999999998 Q ss_pred -------CHHHHHHHHHHHHHHHHCCEECCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHCCCC Q ss_conf -------2143101789999875168430889999999999999999999998630576688885688999999972899 Q gi|254780127|r 180 -------LSDGERDLWLLDQTINDRGYRIDLDLVLKLQELIAQERKKLDEDIVKLTDGIIRSSRQTYALRMYLFLITGID 252 (675) Q Consensus 180 -------~~~~E~~l~~~~~~me~~Gi~vD~e~l~~~~~~~~~~~~~~~~~i~~~~g~~~~n~~S~~ql~~~L~~~~g~~ 252 (675) ..+.|+++..++.+||.+||.||.+.++++.+++..+.++++++|++++|.+| |+|||+||+++||+++|+| T Consensus 452 ~~~L~~Ly~eIE~PLi~VLa~ME~~GI~VD~~~L~~ls~el~~~l~~Le~~I~~lAG~eF-NInSPKQLgeILFekL~Lp 530 (861) T PRK08928 452 DNKLFTIYYRIDLPLCFVLDKMEKIGIKVDANYLKQLSAEFGTEILKLEEEIYKLSGTKF-NIASPKQLGEILFEKMGLP 530 (861) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC-CCCCHHHHHHHHHHHCCCC T ss_conf 866899999988779999999998196016999999999999999999999999749874-7599999999979972998 Q ss_pred CCCCC-----HHHHHHHHHHC-CC-CCHHHHHHHHHHHHHHHCCHHHHHHHHHHCC-CCCCCCCEEECCCCCCHHHHCCC Q ss_conf 98889-----65889999744-89-8548999999999754300035668873056-88452200211322100111068 Q gi|254780127|r 253 LVDMS-----EGTLKSILSHS-NI-TQLAKDLILNRLASSGSAILKLNTLSEAVSS-DGRLRGTLQFYGASRTGRWSGCV 324 (675) Q Consensus 253 ~~~~~-----k~~~~~~l~~~-~~-~~~~~~ll~~r~~~~~~~~~~~~~l~~~~~~-dGRih~~~~~~gAt~TGR~S~~~ 324 (675) ..+.+ -+|.+++|+.. .+ +|.++.+++||...|+.|| |+++|.+.+++ |||||++|+|.+ |.|||+||++ T Consensus 531 ~~KKt~KtG~ySTda~vLe~La~~~h~i~~~ILeyR~l~KLkST-Yvd~L~~~i~~~tgRIHTsfnQt~-taTGRLSS~~ 608 (861) T PRK08928 531 KGKKSAKSGSYSTDAEVLEKLSEHGVEIADLLLKWRHLTKLKNT-YTDSLPKQIDNNTGRVHTTFSQTS-TTTGRLSSSN 608 (861) T ss_pred CCCCCCCCCCCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHH-HHHHHHHHHCCCCCEECCCHHHHH-HHCCEECCCC T ss_conf 67767788996854999998644179148999999899887517-888888874279985725210454-3114000368 Q ss_pred CCCCCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCCCCHHHHEEEECCCCCEEEEEECCCCCEEEEEEECCCHHHHHHHHC Q ss_conf 88554777766523456544310244455444322101321214769987899973354850588662287678665404 Q gi|254780127|r 325 FQPQNLPRQERSEEILTQTIQALNRGETIADPLGLASDCVRSCIIASNGKKLVVADLAGIEARVLAWIAGEDWKIKAFAN 404 (675) Q Consensus 325 pNlQnlPr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iR~~f~a~~G~~lv~aDySqiElRvlA~ls~d~~li~~f~~ 404 (675) ||||||| ++++.|+.||+||+|++||+|++|||||||||||||+|+|+.|++||++ T Consensus 609 PNLQNIP------------------------iRte~Gr~iR~aFva~~g~~l~saDYSQIELRilAhlS~D~~Li~AF~~ 664 (861) T PRK08928 609 PNLQNIP------------------------IRSEEGNKIRQAFIAEKGYKLISADYSQIELRILAHIANVDAFKQAFIN 664 (861) T ss_pred CCCCCCC------------------------CCCCCHHHHHEEECCCCCCEEEEECHHHHHHHHHHHHHCCHHHHHHHCC T ss_conf 8735677------------------------7762021412403469998899853778899999998178899998606 Q ss_pred CCCCCCCCHHHHHCCCCCCCCHHHHHCCCCCHHHHHHCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 44100001577608751105702210064210013307887789887887088864036666656667654788999999 Q gi|254780127|r 405 GEDIYVTTYAKAFNTPIDKVSKEQRAIGKVMELALGYQGGAKVFKTMASHCGLDLQQFSQNVKSTSTFEDWEQAESHHLW 484 (675) Q Consensus 405 g~D~h~~tA~~~~~~~~~~v~~~~R~~aK~~~~gl~YG~G~~~l~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 484 (675) |.|||+.||+.+||+|.++||+++|+.||++|||++||||+.+|++ T Consensus 665 ~~DIH~~TA~~vf~v~~~~vt~~~Rr~AK~vNFGiiYG~s~fgLs~---------------------------------- 710 (861) T PRK08928 665 GDDIHTQTACQIFNLQKHELDSELRRKAKSINFGIIYGISAFGLAK---------------------------------- 710 (861) T ss_pred CCCHHHHHHHHHHCCCHHHCCHHHHHHHHHHHHHHHHCCCHHHHHH---------------------------------- T ss_conf 8866688888873898211898899999999999887568888998---------------------------------- Q ss_pred HHHHHHHCCCEECCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCCCCCCCEEECCC Q ss_conf 99987511101031088978899999986730789999999999999633721420132223443212445641122463 Q gi|254780127|r 485 MQDQYPEFAVKDKLIGTACELVKKAWRAKHQGVLQLWKDLTEGFACVVQEGGSISARRVANVPRLVMRRHKRDVHIVLPS 564 (675) Q Consensus 485 ~~~~~~~~~~~~~~~~~ea~~~~~~~~~~~p~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~i~~ps 564 (675) ..+++..||+.+++.||++||+|+.|...+.+.|+. +| ++++.. T Consensus 711 ----------~l~i~~~eA~~~i~~Yf~~yp~v~~~~~~~~~~a~~---~G-----------------------yv~Tl~ 754 (861) T PRK08928 711 ----------QLNITNSEASEYIKKYFAEYPEIQEYMEQTKAYARS---NG-----------------------YVTTFF 754 (861) T ss_pred ----------HHCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHH---CC-----------------------CEECCC T ss_conf ----------829899999999999999681499999999999997---79-----------------------876888 Q ss_pred CHHHCCCCCCHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--CCCCEEEEECCEEEEECCCCHHH Q ss_conf 1131024520123320310012357765335577767479899999999999996--79818999535899967847899 Q gi|254780127|r 565 SRRLVYSDVKGDCSYLNTATSQLMRERTYGGKLTENIVQAISRDILCEGMKNATK--NGYDIVLTVHDEIVSETPDTPYF 642 (675) Q Consensus 565 GR~l~~~~~~~~~~~~~~~~~~~~r~~~~g~k~~n~~iQgtAaDi~k~am~~l~~--~g~~ivl~VHDElv~evp~~~~~ 642 (675) |||++.|++.+.+.... ...+|+ ++|+|||||||||+|.||+++++ ...+|+||||||||||||+++++ T Consensus 755 gRrr~~p~i~s~n~~~r---~~~eR~------A~N~piQGtAADiiK~AMi~~~~~l~~~~mllQVHDEli~E~~~~~~~ 825 (861) T PRK08928 755 GRKCFIPLINSKKHTLR---QFAERA------AINAPIQGTAADIIKIAMINLDDELEKGRLILQVHDELLFEVPEDEVD 825 (861) T ss_pred CCEECCCCCCCCCHHHH---HHHHHH------HHCHHHHHHHHHHHHHHHHHHHHHHHHCEEEEEEEEEEEEEECHHHHH T ss_conf 86032875557787788---589999------975121288999999999999998624716874544678964688999 Q ss_pred HHHH-HHHHHCCCCCCCCCCCCEEECCCCCCCC Q ss_conf 9999-9998467630216984157303114631 Q gi|254780127|r 643 SVGT-LCSLMTKNPSWAKGLPLKAEGYEAKRYR 674 (675) Q Consensus 643 ~~~~-i~~~M~~~p~wa~~~pl~a~~~~~~~y~ 674 (675) ++.+ |+++|+++.. ..+||.+|..+|+.|- T Consensus 826 ~~~~~~~~~Me~a~~--l~Vpl~v~~~~g~nW~ 856 (861) T PRK08928 826 TVAKLIKEIMENSTN--MSVPLKVEIRIGDNWG 856 (861) T ss_pred HHHHHHHHHHCCCCC--CCCCEEEECCCCCCHH T ss_conf 999999999628743--7787684107589978 No 12 >PRK08076 consensus Probab=100.00 E-value=0 Score=841.14 Aligned_cols=531 Identities=18% Similarity=0.186 Sum_probs=439.3 Q ss_pred EEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCCEEEEEECCCCCHHHHHHHHHHCCCCCEEEEEEHHHHHHHHHHH Q ss_conf 99980406788700046000026882999999983822899978884221799999864889819986307759999986 Q gi|254780127|r 4 LFIDIETRSPQPLPKVGVWAYAEQAVITLCAYAWDDEPVKLWDRTEQSAMPSDLLQYLRDETVMCVAHNSLFERILFKKT 83 (675) Q Consensus 4 l~lD~ET~s~~dl~~~Gl~~y~~~~~I~~i~~a~~~~~~~~~d~~~~~~~~~~l~~~L~~~~~~kv~HNa~FD~~vL~~~ 83 (675) .++|+||.++.. .++.+.|++++.+++ .++++.... ...+.++++|+++++.|++||+|||+.+|.++ T Consensus 317 ~a~~~Et~~~~~----------~~a~lvGis~s~~~~-a~yiP~~~~-~~~~~lk~~Led~~i~KighNlK~dl~vL~~~ 384 (877) T PRK08076 317 SALVVEVLEDNY----------HKADIQGFAIVNENG-CYFIPTEIA-LKSDAFKEWLEDEEKKKWVFDAKRAIVALKWN 384 (877) T ss_pred CEEEEEECCCCC----------CCCCEEEEEEECCCC-EEEECCCHH-HHHHHHHHHHCCCCCCEEEECHHHHHHHHHHC T ss_conf 549999647885----------637789999985894-599550044-21589999853976540265479999999968 Q ss_pred CCCCCCCCCEEEHHHHHHHHCC---CCCHHHHHHHH-CCCCCCHHHHHCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHH Q ss_conf 2998786504545889998588---99978999980-8898652766376576641002332013453112587689998 Q gi|254780127|r 84 LGIDIPSKRWICTSVLARTNGL---PSSLKNACLAL-NFSEHLTKMEEGKALIARFCKGSIDSPPYDCTRANHVQAWQLF 159 (675) Q Consensus 84 lGi~i~~~~~~DTmi~A~~l~~---p~sL~~la~~l-gl~~~~~kd~~gk~li~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (675) |+++.. ..+||||+||++++ .++|+.+++.+ +......++..|+...+ ..+. -... T Consensus 385 -GI~l~g-i~FDTMLAAYLLdP~~~~~~Ld~La~~yl~~~~~~~e~v~Gkg~k~-----------------~~p~-~~~~ 444 (877) T PRK08076 385 -GIDLQG-IDFDLLLAAYLLNPADSDDDFASVAKMKETHAVKSDEAVYGKGAKR-----------------AVPE-EEIL 444 (877) T ss_pred -CCCCCC-CCCCHHHHHHHCCCCCCCCCHHHHHHHHCCCCCCCHHHHCCCCCCC-----------------CCCC-HHHH T ss_conf -964477-6402999998718876766699999985577776347653765222-----------------4877-8999 Q ss_pred HHHHHHHHHHHHHHHHHHCCC----------HHHHHHHHHHHHHHHHCCEECCHHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 988775399999999872002----------1431017899998751684308899999999999999999999986305 Q gi|254780127|r 160 GEYCKRDVEATRELFKRLIPL----------SDGERDLWLLDQTINDRGYRIDLDLVLKLQELIAQERKKLDEDIVKLTD 229 (675) Q Consensus 160 ~~Ya~~Dv~~~~~L~~~L~~~----------~~~E~~l~~~~~~me~~Gi~vD~e~l~~~~~~~~~~~~~~~~~i~~~~g 229 (675) .+||+.|++++++||..|.+. .+.|+++..++.+||.+||.||.+.++++..++.++..++++++++++| T Consensus 445 ~eYaaedA~~t~~L~~~L~~~L~~~~l~~Ly~~IE~PLi~VLa~ME~~GI~VD~~~L~~ls~~l~~~l~~le~~I~~laG 524 (877) T PRK08076 445 AEHLARKAHAIYDLKQTFVEELEKNEQLELFEELELPLARVLAEMEVKGVKVDTERLENMGEELAGRLKELEQEIHELAG 524 (877) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCEEECHHHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 99999999999999999999986557899999997048899999985483898999999999999999999999999728 Q ss_pred CCCCCCCCHHHHHHHHHHHCCCCCCCCCH---HHHHHHHHH-CCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHCCC-CC Q ss_conf 76688885688999999972899988896---588999974-48985489999999997543000356688730568-84 Q gi|254780127|r 230 GIIRSSRQTYALRMYLFLITGIDLVDMSE---GTLKSILSH-SNITQLAKDLILNRLASSGSAILKLNTLSEAVSSD-GR 304 (675) Q Consensus 230 ~~~~n~~S~~ql~~~L~~~~g~~~~~~~k---~~~~~~l~~-~~~~~~~~~ll~~r~~~~~~~~~~~~~l~~~~~~d-GR 304 (675) .++ |+|||+||+++||+.+|+|+.+.+| +|.+++|+. .+.+|.+..+++||...|+.+ +|+++|.+.++++ || T Consensus 525 ~eF-NinSpKQL~~ILFekL~Lp~~KKTKtG~STd~~vLe~L~~~h~i~~~iLeyR~l~KL~s-TY~d~L~~~i~~~tgR 602 (877) T PRK08076 525 EEF-NINSPKQLGVILFEKLGLPVIKKTKTGYSTSADVLEKLADKHEIIPNILHYRQLGKLQS-TYIEGLLKVVHKDTHK 602 (877) T ss_pred CCC-CCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHH-HHHHHHHHHHCCCCCE T ss_conf 987-88999999999998538988887889987869999975432854799999989988874-8999988741877545 Q ss_pred CCCCEEECCCCCCHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCCCCHHHHEEEEC-CCCCEEEEEECCC Q ss_conf 522002113221001110688855477776652345654431024445544432210132121476-9987899973354 Q gi|254780127|r 305 LRGTLQFYGASRTGRWSGCVFQPQNLPRQERSEEILTQTIQALNRGETIADPLGLASDCVRSCIIA-SNGKKLVVADLAG 383 (675) Q Consensus 305 ih~~~~~~gAt~TGR~S~~~pNlQnlPr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iR~~f~a-~~G~~lv~aDySq 383 (675) ||++|+|.+ |.|||+||++||||||| ++++.|+.||+||+| +|||+|++||||| T Consensus 603 iHt~fnQ~~-t~TGRLSS~~PNLQNIP------------------------irt~~G~~IR~aFi~~~~g~~l~saDYSQ 657 (877) T PRK08076 603 VHTRFNQAL-TQTGRLSSTDPNLQNIP------------------------IRLEEGRKIRQAFVPSEEGWIMFAADYSQ 657 (877) T ss_pred EECCCCHHC-CCCCCCCCCCCCCCCCC------------------------CCCHHHHHHHHEEEECCCCCEEEEECHHH T ss_conf 630100000-45564025787523677------------------------77644577876143079996799832668 Q ss_pred CCEEEEEEECCCHHHHHHHHCCCCCCCCCHHHHHCCCCCCCCHHHHHCCCCCHHHHHHCCCHHHHHHHHHHCCCCCCHHH Q ss_conf 85058866228767866540444100001577608751105702210064210013307887789887887088864036 Q gi|254780127|r 384 IEARVLAWIAGEDWKIKAFANGEDIYVTTYAKAFNTPIDKVSKEQRAIGKVMELALGYQGGAKVFKTMASHCGLDLQQFS 463 (675) Q Consensus 384 iElRvlA~ls~d~~li~~f~~g~D~h~~tA~~~~~~~~~~v~~~~R~~aK~~~~gl~YG~G~~~l~~~~~~~G~~~~~~~ 463 (675) ||||||||+|+|+.|++||++|.|+|+.||+.|||+|.++|++++|+.||++|||++||||+.+|.+ T Consensus 658 IELRilAh~s~d~~l~~af~~~~DiH~~tA~~if~~~~~~vt~~~R~~AK~infGiiYG~~~~gLs~------------- 724 (877) T PRK08076 658 IELRVLAHIAEDENLIEAFQHDMDIHTKTAMDVFHVEKDEVTSNMRRQAKAVNFGIVYGISDYGLSQ------------- 724 (877) T ss_pred HHHHHHHHHHCCHHHHHHHHCCCCHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHCCCHHHHHH------------- T ss_conf 8999999982789999987159757588889984898010899999998898877876558788998------------- Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEECCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCEEEECCC Q ss_conf 66665666765478899999999987511101031088978899999986730789999999999999633721420132 Q gi|254780127|r 464 QNVKSTSTFEDWEQAESHHLWMQDQYPEFAVKDKLIGTACELVKKAWRAKHQGVLQLWKDLTEGFACVVQEGGSISARRV 543 (675) Q Consensus 464 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ea~~~~~~~~~~~p~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 543 (675) ..+++..||+.+++.||.+||+|..|...+.+.++. +| T Consensus 725 -------------------------------~l~i~~~eA~~~i~~Yf~~yp~v~~~~~~~~~~a~~---~G-------- 762 (877) T PRK08076 725 -------------------------------NLGITRKEAAEFIERYFESFPGVKEYMEDIVQEAKQ---KG-------- 762 (877) T ss_pred -------------------------------HCCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHH---CC-------- T ss_conf -------------------------------629899999999999999782499999999999997---68-------- Q ss_pred CCCCCCCCCCCCCCCEEECCCCHHHCCCCCCHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---- Q ss_conf 2234432124456411224631131024520123320310012357765335577767479899999999999996---- Q gi|254780127|r 544 ANVPRLVMRRHKRDVHIVLPSSRRLVYSDVKGDCSYLNTATSQLMRERTYGGKLTENIVQAISRDILCEGMKNATK---- 619 (675) Q Consensus 544 ~~~~~~~~~~~~~~l~i~~psGR~l~~~~~~~~~~~~~~~~~~~~r~~~~g~k~~n~~iQgtAaDi~k~am~~l~~---- 619 (675) ++++..|||++.|++.+.+.... ...+|+ ++|+|||||||||+|.||+++++ T Consensus 763 ---------------yv~Tl~gRrr~~p~i~s~n~~~r---~~~eR~------A~N~piQGtAADiiK~AMi~i~~~l~~ 818 (877) T PRK08076 763 ---------------YVTTLLHRRRYIPEITSRNFNLR---SFAERT------AMNTPIQGSAADIIKKAMIDMADRLKE 818 (877) T ss_pred ---------------CEECCCCCEEECCCCCCCCHHHH---HHHHHH------HHCCCCCHHHHHHHHHHHHHHHHHHHH T ss_conf ---------------88646777030876557787788---889999------936144567999999999999999984 Q ss_pred C--CCCEEEEECCEEEEECCCCHHHHHHH-HHHHHCCCCCCCCCCCCEEECCCCCCCC Q ss_conf 7--98189995358999678478999999-9998467630216984157303114631 Q gi|254780127|r 620 N--GYDIVLTVHDEIVSETPDTPYFSVGT-LCSLMTKNPSWAKGLPLKAEGYEAKRYR 674 (675) Q Consensus 620 ~--g~~ivl~VHDElv~evp~~~~~~~~~-i~~~M~~~p~wa~~~pl~a~~~~~~~y~ 674 (675) . ..+|+||||||||||||+++++++.. |+++|+++... .+||.+|..+|+.|- T Consensus 819 ~~~~~~mlLqVHDELvfev~~~~~~~~~~~~~~~Me~a~~L--~VPL~v~~~~G~nW~ 874 (877) T PRK08076 819 EGLQARLLLQVHDELIFEAPKEEIEKLEKLVPEVMEHAVEL--DVPLKVDYSYGPTWY 874 (877) T ss_pred CCCCCEEEEEEEEEEEEEECHHHHHHHHHHHHHHHHCCCCC--CCCEEEECCCCCCHH T ss_conf 69985189986426689616889999999999998274236--887584138387967 No 13 >PRK05797 consensus Probab=100.00 E-value=0 Score=843.25 Aligned_cols=517 Identities=19% Similarity=0.226 Sum_probs=437.9 Q ss_pred EEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCCEEEEEECCC---CCHHHHHHHHHHCCCCCEEEEEEHHHHHHH Q ss_conf 89998040678870004600002688299999998382289997888---422179999986488981998630775999 Q gi|254780127|r 3 KLFIDIETRSPQPLPKVGVWAYAEQAVITLCAYAWDDEPVKLWDRTE---QSAMPSDLLQYLRDETVMCVAHNSLFERIL 79 (675) Q Consensus 3 ~l~lD~ET~s~~dl~~~Gl~~y~~~~~I~~i~~a~~~~~~~~~d~~~---~~~~~~~l~~~L~~~~~~kv~HNa~FD~~v 79 (675) .+++|+||.+...+ ...+.++++++..+..+++++.+ .+.+...++++|+++++.||+||+|||+.+ T Consensus 321 ~~~~~~ett~~~~~----------~~~~vg~s~~~~~~~a~yip~~~~~~~~~~L~~Lk~lLed~~i~KIGhNlK~dl~v 390 (869) T PRK05797 321 QVYINFTLEDENLY----------SKIEIKKIFINFGEKTYYIDFKELIDEEEFIEDLKEIFENEEIKKIGHDIKNFLTI 390 (869) T ss_pred CEEEEEECCCCCHH----------HHHHEEEEEEECCCEEEEEECHHHCCHHHHHHHHHHHHHCCCCCEEEECCHHHHHH T ss_conf 87999973897612----------03041899996199799983145167789999999998489987898430699999 Q ss_pred HHHHCCCCCCCCCEEEHHHHHHHHCC---CCCHHHHHHH-HCCCCCCHHHHHCHHHHHHHHHHHHHHCCCCCCCCCCHHH Q ss_conf 99862998786504545889998588---9997899998-0889865276637657664100233201345311258768 Q gi|254780127|r 80 FKKTLGIDIPSKRWICTSVLARTNGL---PSSLKNACLA-LNFSEHLTKMEEGKALIARFCKGSIDSPPYDCTRANHVQA 155 (675) Q Consensus 80 L~~~lGi~i~~~~~~DTmi~A~~l~~---p~sL~~la~~-lgl~~~~~kd~~gk~li~~~~~~~~~~~~~~~~~~~~~~~ 155 (675) |.++ |+++.. .++||||+||++++ .++|+.++++ |+.+.... T Consensus 391 L~~~-GI~l~g-i~FDTMLAsYLLnP~~~~h~Ld~La~~yL~~~~~~~-------------------------------- 436 (869) T PRK05797 391 LKKL-GIEFKG-LAFDTAIAAYLIDPSKSEYDLSDLAKEYLSKDVNSE-------------------------------- 436 (869) T ss_pred HHHC-CCCCCC-HHHHHHHHHHHHCCCCCCCCHHHHHHHHHCCCCCCH-------------------------------- T ss_conf 9975-985566-166899999972899888899999999808777514-------------------------------- Q ss_pred HHHHHHHHHHHHHHHHHHHHHHCCC----------HHHHHHHHHHHHHHHHCCEECCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9998988775399999999872002----------143101789999875168430889999999999999999999998 Q gi|254780127|r 156 WQLFGEYCKRDVEATRELFKRLIPL----------SDGERDLWLLDQTINDRGYRIDLDLVLKLQELIAQERKKLDEDIV 225 (675) Q Consensus 156 ~~~~~~Ya~~Dv~~~~~L~~~L~~~----------~~~E~~l~~~~~~me~~Gi~vD~e~l~~~~~~~~~~~~~~~~~i~ 225 (675) -..|++.|+.++++||..|.+- .+.|+++..++.+||.+||.||.+.++++..++..+..+++++|+ T Consensus 437 ---e~~ya~~dA~~~~~L~~~L~~~L~~~~l~~L~~~IE~PLi~VLa~ME~~GI~VD~~~L~~l~~~l~~~l~~le~~I~ 513 (869) T PRK05797 437 ---EEEYKIKEVSVMKELYEKLKEKIEEMDMEKLLYEVELPLVEVLASMESEGFKVDKDILDELSKKFKEEIEKTQKEIY 513 (869) T ss_pred ---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCEEECHHHHHHHHHHHHHHHHHHHHHHH T ss_conf ---78999999999999999999999861289999999867899999998468099999999999999999999999999 Q ss_pred HHCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCH---HHHHHHHHH-CCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHCC Q ss_conf 630576688885688999999972899988896---588999974-4898548999999999754300035668873056 Q gi|254780127|r 226 KLTDGIIRSSRQTYALRMYLFLITGIDLVDMSE---GTLKSILSH-SNITQLAKDLILNRLASSGSAILKLNTLSEAVSS 301 (675) Q Consensus 226 ~~~g~~~~n~~S~~ql~~~L~~~~g~~~~~~~k---~~~~~~l~~-~~~~~~~~~ll~~r~~~~~~~~~~~~~l~~~~~~ 301 (675) +++|.++ |+|||+||+++||+.+|+|..+.+| +|..+.|+. .+.+|.+..+++||...|+.+| |+++|...+++ T Consensus 514 ~~aG~eF-NinSPKQL~~iLFe~LgLp~~KKtKtG~STd~~vLe~L~~~hpi~~~iLeyR~l~KL~sT-Y~d~L~~~i~~ 591 (869) T PRK05797 514 ELAGEEF-NINSPKQLGKILFEKLDLPVIKKTKTGYSTNAEVLEKLRDKHPIIEKITEYRQLTKLYST-YVEGLKNVIDE 591 (869) T ss_pred HHCCCCC-CCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCHHHHHHHHHHHHHHH-HHHHHHHHCCC T ss_conf 8628866-789989999998986499988878899878599999765548851999998799899988-98789974262 Q ss_pred CCCCCCCEEECCCCCCHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCCCCHHHHEEEECCCC-CEEEEEE Q ss_conf 884522002113221001110688855477776652345654431024445544432210132121476998-7899973 Q gi|254780127|r 302 DGRLRGTLQFYGASRTGRWSGCVFQPQNLPRQERSEEILTQTIQALNRGETIADPLGLASDCVRSCIIASNG-KKLVVAD 380 (675) Q Consensus 302 dGRih~~~~~~gAt~TGR~S~~~pNlQnlPr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iR~~f~a~~G-~~lv~aD 380 (675) |||||++|+|.+ |.|||+||++||||||| ++++.|+.||+||+|++| ++|++|| T Consensus 592 ~gRiHt~f~Q~~-t~TGRLSS~~PNLQNIP------------------------~r~e~G~~iR~aFi~~~~~~~l~saD 646 (869) T PRK05797 592 DGRIHSNFNQTV-TTTGRLSSTEPNLQNIP------------------------IKYEMGREIRKVFIPNTEDSVILSAD 646 (869) T ss_pred CCCCCCHHHHHH-CCCCCCCCCCCCCCCCC------------------------CCCCCHHHHHHEEECCCCCCEEEECC T ss_conf 465054266543-44021146898645687------------------------76520122553571477761588522 Q ss_pred CCCCCEEEEEEECCCHHHHHHHHCCCCCCCCCHHHHHCCCCCCCCHHHHHCCCCCHHHHHHCCCHHHHHHHHHHCCCCCC Q ss_conf 35485058866228767866540444100001577608751105702210064210013307887789887887088864 Q gi|254780127|r 381 LAGIEARVLAWIAGEDWKIKAFANGEDIYVTTYAKAFNTPIDKVSKEQRAIGKVMELALGYQGGAKVFKTMASHCGLDLQ 460 (675) Q Consensus 381 ySqiElRvlA~ls~d~~li~~f~~g~D~h~~tA~~~~~~~~~~v~~~~R~~aK~~~~gl~YG~G~~~l~~~~~~~G~~~~ 460 (675) |||||||||||+|+|+.|++||++|.|+|+.||+.+||+|.++|++++|+.||++|||++||||+.+|.+ T Consensus 647 YSQIELRilAh~s~d~~l~~af~~~~DiH~~tA~~vf~~~~~~v~~~~R~~AK~infGiiYG~~~~gLs~---------- 716 (869) T PRK05797 647 YSQIELRVLAHISGDENLIDAFNHHEDIHTKTASEVFKVPIEEVTPLMRSNAKAVNFGIVYGIGDFSLSQ---------- 716 (869) T ss_pred HHHHHHHHHHHHHCCHHHHHHHHCCCCHHHHHHHHHHCCCHHHCCHHHHHHHHHHHHHHHHCCCHHHHHH---------- T ss_conf 7789999999984689999988559976899999995989111798899998898856655568677898---------- Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEECCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCEEEE Q ss_conf 03666665666765478899999999987511101031088978899999986730789999999999999633721420 Q gi|254780127|r 461 QFSQNVKSTSTFEDWEQAESHHLWMQDQYPEFAVKDKLIGTACELVKKAWRAKHQGVLQLWKDLTEGFACVVQEGGSISA 540 (675) Q Consensus 461 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ea~~~~~~~~~~~p~i~~~~~~~~~~~~~~~~~~~~~~~ 540 (675) ..+++..||+.+++.||.+||+|+.|...+.+.|+. +| T Consensus 717 ----------------------------------~l~i~~~eA~~~i~~yf~~yp~v~~~~~~~~~~a~~---~g----- 754 (869) T PRK05797 717 ----------------------------------DLNISRKEAKEYIDNYFERYPNVKEYMEKIVEEAKE---KG----- 754 (869) T ss_pred ----------------------------------HCCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHH---CC----- T ss_conf ----------------------------------529899999999999999680499999999999997---89----- Q ss_pred CCCCCCCCCCCCCCCCCCEEECCCCHHHCCCCCCHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 13222344321244564112246311310245201233203100123577653355777674798999999999999967 Q gi|254780127|r 541 RRVANVPRLVMRRHKRDVHIVLPSSRRLVYSDVKGDCSYLNTATSQLMRERTYGGKLTENIVQAISRDILCEGMKNATKN 620 (675) Q Consensus 541 ~~~~~~~~~~~~~~~~~l~i~~psGR~l~~~~~~~~~~~~~~~~~~~~r~~~~g~k~~n~~iQgtAaDi~k~am~~l~~~ 620 (675) ++++..|||++.|++++.+.... ...+|+ ++|+|||||||||+|.||+++++. T Consensus 755 ------------------~v~Tl~gRrr~~p~i~s~n~~~r---~~~eR~------AiN~piQGtaADiiK~AMi~~~~~ 807 (869) T PRK05797 755 ------------------YVTTILNRRRYIPEIKSSNKIVR---AFGERL------AMNTPIQGSAADIIKLAMVNVYNK 807 (869) T ss_pred ------------------EEECCCCCEECCCCCCCCCHHHH---HHHHHH------HHCCCCCHHHHHHHHHHHHHHHHH T ss_conf ------------------07888887335887778898899---778887------827465567999999999999999 Q ss_pred ------CCCEEEEECCEEEEECCCCHHHHHHH-HHHHHCCCCCCCCCCCCEEECCCCCCCC Q ss_conf ------98189995358999678478999999-9998467630216984157303114631 Q gi|254780127|r 621 ------GYDIVLTVHDEIVSETPDTPYFSVGT-LCSLMTKNPSWAKGLPLKAEGYEAKRYR 674 (675) Q Consensus 621 ------g~~ivl~VHDElv~evp~~~~~~~~~-i~~~M~~~p~wa~~~pl~a~~~~~~~y~ 674 (675) ..+|+||||||||||||+++++++.. |+++|+.+.+ ..+||.+|..+|+.|- T Consensus 808 l~~~~~~~~~~lqVHDElv~ev~~~~~~~~~~~v~~~Me~~~~--l~VPl~v~~~~g~nW~ 866 (869) T PRK05797 808 LKELNLKSKLILQVHDELILNVYKDELEEVKKIVKEEMENVIK--LKVPLDVDINVGNNWY 866 (869) T ss_pred HHHCCCCCEEEEEEEEEEEEEECHHHHHHHHHHHHHHHHCCCC--CCCCEEEECCCCCCHH T ss_conf 9854998649999732668951688999999999999728425--7887473048278967 No 14 >PRK08434 consensus Probab=100.00 E-value=0 Score=839.03 Aligned_cols=532 Identities=18% Similarity=0.236 Sum_probs=434.0 Q ss_pred CEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCCEEEEEECCCCC-----HH-HHHHHHHHCC-CCCEEEEEEHH Q ss_conf 78999804067887000460000268829999999838228999788842-----21-7999998648-89819986307 Q gi|254780127|r 2 SKLFIDIETRSPQPLPKVGVWAYAEQAVITLCAYAWDDEPVKLWDRTEQS-----AM-PSDLLQYLRD-ETVMCVAHNSL 74 (675) Q Consensus 2 k~l~lD~ET~s~~dl~~~Gl~~y~~~~~I~~i~~a~~~~~~~~~d~~~~~-----~~-~~~l~~~L~~-~~~~kv~HNa~ 74 (675) +.+++|+||.|... ..+++.|+++|++.+..+|+++.+.. .+ +......+.. ....+|+||++ T Consensus 319 ~~~a~d~Ett~ld~----------~~a~lvGis~~~~~~~a~YiPi~h~~l~~~~~~~~~~~~~~l~~~~~~~~IghNlK 388 (887) T PRK08434 319 SIIAFDTETTGLDT----------KEAKLVGFSFCFNEEEAYYVPLAHSYLGVGEQISLESAKKAIEKIFNHFVIGHNLK 388 (887) T ss_pred CEEEEEEECCCCCC----------CCCCEEEEEEEECCCCEEEEECCCCCCCCHHHCCHHHHHHHHHHHHHCCCHHHHHH T ss_conf 76999951278661----------33430589999559858999752112577333599999999787633740364779 Q ss_pred HHHHHHHHHCCCCCCCCCEEEHHHHHHHHCC--CCCHHHHHH-HHCCCCCCHHHHHCHHHHHHHHHHHHHHCCCCCCCCC Q ss_conf 7599999862998786504545889998588--999789999-8088986527663765766410023320134531125 Q gi|254780127|r 75 FERILFKKTLGIDIPSKRWICTSVLARTNGL--PSSLKNACL-ALNFSEHLTKMEEGKALIARFCKGSIDSPPYDCTRAN 151 (675) Q Consensus 75 FD~~vL~~~lGi~i~~~~~~DTmi~A~~l~~--p~sL~~la~-~lgl~~~~~kd~~gk~li~~~~~~~~~~~~~~~~~~~ 151 (675) ||+.+|.+.+|+++.. .++||||+||++++ .++|+.+++ +|+.+...+++..+|.. +.. T Consensus 389 yDl~vL~~~~gi~l~g-~~fDTmLAaYLLdp~~r~~Ld~La~~yL~~~~i~~e~l~~kg~----------------~f~- 450 (887) T PRK08434 389 YDFKIIQNNFGLELPQ-KYADTMILAWLKDPSSRVGLDDLAKRLFNYEMIHFESVVKKGE----------------NFS- 450 (887) T ss_pred HHHHHHHHHCCCCCCC-CHHHHHHHHHHCCCCCCCCHHHHHHHHCCCCCCCHHHHHCCCC----------------CCC- T ss_conf 9999999856987777-3148999998669866666678999970878774899854677----------------702- Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHCCCH-------------HHHHHHHHHHHHHHHCCEECCHHHHHHHHHHHHHHHH Q ss_conf 876899989887753999999998720021-------------4310178999987516843088999999999999999 Q gi|254780127|r 152 HVQAWQLFGEYCKRDVEATRELFKRLIPLS-------------DGERDLWLLDQTINDRGYRIDLDLVLKLQELIAQERK 218 (675) Q Consensus 152 ~~~~~~~~~~Ya~~Dv~~~~~L~~~L~~~~-------------~~E~~l~~~~~~me~~Gi~vD~e~l~~~~~~~~~~~~ 218 (675) ....+....|++.|++++++||..|.+.+ +.|+++..++.+||.+||.||.+.++++..++..+.. T Consensus 451 -~v~i~~a~~YAaedA~it~~L~~~l~~~L~~~~~~~L~~Ly~~IElPLi~VLa~ME~~GI~VD~~~L~~ls~el~~~i~ 529 (887) T PRK08434 451 -SVDLEKACKYAAEDAWITLRFYKKFLENLEKRGATHLLELAKNVEFPFIKTLLMMEENGIKLDTNFLEELKKKFEEEIK 529 (887) T ss_pred -CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHH T ss_conf -0788999999999999999999999999986551779999999886599999999971950269999999999999999 Q ss_pred HHHHHHHHHCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCH---HHHHHHHHH-CCCCCHHHHHHHHHHHHHHHCCHHHHH Q ss_conf 9999998630576688885688999999972899988896---588999974-489854899999999975430003566 Q gi|254780127|r 219 KLDEDIVKLTDGIIRSSRQTYALRMYLFLITGIDLVDMSE---GTLKSILSH-SNITQLAKDLILNRLASSGSAILKLNT 294 (675) Q Consensus 219 ~~~~~i~~~~g~~~~n~~S~~ql~~~L~~~~g~~~~~~~k---~~~~~~l~~-~~~~~~~~~ll~~r~~~~~~~~~~~~~ 294 (675) ++++++++++|.+| |+|||+||+++||+++|+|..+.+| +|..++|+. .+.+|.+..+++||+..|+.+ +|+++ T Consensus 530 ~Le~~Iy~lAG~eF-NInSPKQLgeILFekLgLp~~KKTKTGySTd~evLe~L~~~hpi~~~iLeyR~l~KL~s-TYvd~ 607 (887) T PRK08434 530 ELTEEIYELCGEEF-NINSTKQLGAVLFEKLKLPSGKKTKTGYSTDEKVLNALLDKHPVIPKILEYRELYKLYS-TYIEP 607 (887) T ss_pred HHHHHHHHHCCCCC-CCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCCCCHHHHHHHHHHHHHHH-HHHHH T ss_conf 99999998557664-77999999999898649988888889988809999998873873899998608999999-99988 Q ss_pred HHHHHC--CCCCCCCCEEECCCCCCHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCCCCHHHHEEEECCC Q ss_conf 887305--688452200211322100111068885547777665234565443102444554443221013212147699 Q gi|254780127|r 295 LSEAVS--SDGRLRGTLQFYGASRTGRWSGCVFQPQNLPRQERSEEILTQTIQALNRGETIADPLGLASDCVRSCIIASN 372 (675) Q Consensus 295 l~~~~~--~dGRih~~~~~~gAt~TGR~S~~~pNlQnlPr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iR~~f~a~~ 372 (675) |.+.+. .+||||++|+|.+ |.|||+||++||||||| ++++.|+.||+||+|++ T Consensus 608 L~~~~~~d~tgRIHttfnQ~~-t~TGRLSS~~PNLQNIP------------------------irt~~Gr~iR~aFv~~~ 662 (887) T PRK08434 608 LLKLALKDKNSRIYTSFLQTG-TATGRLSSKNPNLQNIP------------------------ARGELAKDIREAFVAKE 662 (887) T ss_pred HHHHCCCCCCCEECCHHHHHC-CCCEEECCCCCCCCCCC------------------------CCCCCCHHHHHHEEECC T ss_conf 877514788870310115313-22101236899756787------------------------77642155464434068 Q ss_pred CCEEEEEECCCCCEEEEEEECCCHHHHHHHHCCCCCCCCCHHHHHCCCCCCCCHHHHHCCCCCHHHHHHCCCHHHHHHHH Q ss_conf 87899973354850588662287678665404441000015776087511057022100642100133078877898878 Q gi|254780127|r 373 GKKLVVADLAGIEARVLAWIAGEDWKIKAFANGEDIYVTTYAKAFNTPIDKVSKEQRAIGKVMELALGYQGGAKVFKTMA 452 (675) Q Consensus 373 G~~lv~aDySqiElRvlA~ls~d~~li~~f~~g~D~h~~tA~~~~~~~~~~v~~~~R~~aK~~~~gl~YG~G~~~l~~~~ 452 (675) ||+|++|||||||||||||+|+|+.|++||++|.|+|+.||+.||| +++++++|+.||++|||++||||+.+|.+ T Consensus 663 g~~l~~aDysQiElRilAh~s~d~~l~~af~~~~DiH~~tA~~vf~---~~~~~~~R~~aK~vnfgi~YG~~~~gLa~-- 737 (887) T PRK08434 663 GYSLISLDYSQIELRLLAHFSKDEALLEAFKNDEDIHARTAIKIFG---EENAKAKRAVAKSINFGLIYGMGSKKLSE-- 737 (887) T ss_pred CCEEEEECHHHHHHHHHHHHCCCHHHHHHHHCCCCHHHHHHHHHHC---CCCCHHHHHHHHHHHHHHHHCCCHHHHHH-- T ss_conf 8779970573899999999718899999984499848999999837---77888899897777756655468777998-- Q ss_pred HHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEECCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHH Q ss_conf 87088864036666656667654788999999999875111010310889788999999867307899999999999996 Q gi|254780127|r 453 SHCGLDLQQFSQNVKSTSTFEDWEQAESHHLWMQDQYPEFAVKDKLIGTACELVKKAWRAKHQGVLQLWKDLTEGFACVV 532 (675) Q Consensus 453 ~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ea~~~~~~~~~~~p~i~~~~~~~~~~~~~~~ 532 (675) ..+++..||+.+++.|+..||+|+.|...+.+.|+. T Consensus 738 ------------------------------------------~l~i~~~eA~~~i~~yf~~yp~v~~~~~~~~~~a~~-- 773 (887) T PRK08434 738 ------------------------------------------TLGISAKEAKEYIESYFASFPTIKNFLESIKNFAKE-- 773 (887) T ss_pred ------------------------------------------HCCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHH-- T ss_conf ------------------------------------------709899999999999999680199999999999997-- Q ss_pred HCCCEEEECCCCCCCCCCCCCCCCCCEEECCCCHHHCCCCCCHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 33721420132223443212445641122463113102452012332031001235776533557776747989999999 Q gi|254780127|r 533 QEGGSISARRVANVPRLVMRRHKRDVHIVLPSSRRLVYSDVKGDCSYLNTATSQLMRERTYGGKLTENIVQAISRDILCE 612 (675) Q Consensus 533 ~~~~~~~~~~~~~~~~~~~~~~~~~l~i~~psGR~l~~~~~~~~~~~~~~~~~~~~r~~~~g~k~~n~~iQgtAaDi~k~ 612 (675) +| ++++..|||++. +..+.+. ...+..+|+ ++|+|||||||||+|. T Consensus 774 -~g-----------------------yv~Tl~gRrr~~-~~~~~~~---~~~~~~eR~------A~N~piQGtaADiik~ 819 (887) T PRK08434 774 -NG-----------------------YVETLLGRKRYF-DFENATP---MQKAMYERE------AVNTIFQGSAADLIKL 819 (887) T ss_pred -CC-----------------------CEECCCCCCCCC-CCCCCCH---HHHHHHHHH------HHCCCCCHHHHHHHHH T ss_conf -68-----------------------877888872068-8779898---899899999------8382336679999999 Q ss_pred HHHHHHH----CCCCEEEEECCEEEEECCCCHHHHHHH-HHHHHCCCCCCCCCCCCEEECCCCCCCC Q ss_conf 9999996----798189995358999678478999999-9998467630216984157303114631 Q gi|254780127|r 613 GMKNATK----NGYDIVLTVHDEIVSETPDTPYFSVGT-LCSLMTKNPSWAKGLPLKAEGYEAKRYR 674 (675) Q Consensus 613 am~~l~~----~g~~ivl~VHDElv~evp~~~~~~~~~-i~~~M~~~p~wa~~~pl~a~~~~~~~y~ 674 (675) ||+++++ .+++|+||||||||||||+++++++.. |+++|+.+.+ ..+||.+|..+|+.|. T Consensus 820 Ami~~~~~l~~~~~~~~lqvHDElv~e~~~~~~~~~~~~v~~~Me~a~~--L~VPL~v~~~~g~nW~ 884 (887) T PRK08434 820 AMNEINKLLNEEDAKLLLQIHDELIFEVKDELAEEFAKKFKDIMENIVK--LNVPLKTSLSIAKNWG 884 (887) T ss_pred HHHHHHHHHHHCCCEEEEEECEEEEEEECHHHHHHHHHHHHHHHHCCCC--CCCCEEEECCCCCCHH T ss_conf 9999999986449869997542678962686999999999999728501--6787686428278946 No 15 >PRK07300 consensus Probab=100.00 E-value=0 Score=818.76 Aligned_cols=527 Identities=18% Similarity=0.188 Sum_probs=422.7 Q ss_pred EEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCCEEEEEECCCCCHHHHHHHHHHCCCCCEEEEEEHHHHHHHHHHHC Q ss_conf 99804067887000460000268829999999838228999788842217999998648898199863077599999862 Q gi|254780127|r 5 FIDIETRSPQPLPKVGVWAYAEQAVITLCAYAWDDEPVKLWDRTEQSAMPSDLLQYLRDETVMCVAHNSLFERILFKKTL 84 (675) Q Consensus 5 ~lD~ET~s~~dl~~~Gl~~y~~~~~I~~i~~a~~~~~~~~~d~~~~~~~~~~l~~~L~~~~~~kv~HNa~FD~~vL~~~l 84 (675) .+++||.++ ... ...+++|++ ..++..++...........+...+.. .+..||++||..+|.++ T Consensus 323 ~~~~et~~~--------~~~--~a~lvG~s~--~~~~~~y~~~~~~~l~~~~~~~~~~~---~i~~~nlK~~~~vL~~~- 386 (880) T PRK07300 323 FFYFETLGD--------NYH--REAIIGFAW--GNGEQIYASTDLSLLATDSFKQVLQK---PIATYDFKRSKVLLSHL- 386 (880) T ss_pred EEEEEECCC--------CCC--CCCEEEEEE--ECCCEEEECCCHHHHHCHHHHHHHHH---HHHHHHHHHHHHHHHHC- T ss_conf 257653379--------813--077799998--55975895581332117688887653---56550378689999977- Q ss_pred CCCCCCCCEEEHHHHHHHHCC--CCCHHHHHHH-HCCCCCCHHHHHCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHH Q ss_conf 998786504545889998588--9997899998-0889865276637657664100233201345311258768999898 Q gi|254780127|r 85 GIDIPSKRWICTSVLARTNGL--PSSLKNACLA-LNFSEHLTKMEEGKALIARFCKGSIDSPPYDCTRANHVQAWQLFGE 161 (675) Q Consensus 85 Gi~i~~~~~~DTmi~A~~l~~--p~sL~~la~~-lgl~~~~~kd~~gk~li~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 161 (675) |+++.. ..+||||++|++++ .++|+.+++. ++......++..|+.-. ...++ .+.... T Consensus 387 GI~l~~-~~fDTmLAsYLL~p~~~~~Ld~La~~yl~~~~~~~e~l~Gkg~k-----------------~~~~~-~~~~~~ 447 (880) T PRK07300 387 GIELPA-PSFDARLAKYLLSTVEDNELSTIARLYTDISLETDDTVYGKGAK-----------------RAIPE-KEVLLE 447 (880) T ss_pred CCCCCC-CHHHHHHHHHCCCCCCCCCHHHHHHHHCCCCCCCHHHHCCCCCC-----------------CCCCC-HHHHHH T ss_conf 986677-40348999853687655877899999468888733766366521-----------------03662-779999 Q ss_pred HHHHHHHHHHHHHHHHCCC----------HHHHHHHHHHHHHHHHCCEECCHHHHHHHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 8775399999999872002----------143101789999875168430889999999999999999999998630576 Q gi|254780127|r 162 YCKRDVEATRELFKRLIPL----------SDGERDLWLLDQTINDRGYRIDLDLVLKLQELIAQERKKLDEDIVKLTDGI 231 (675) Q Consensus 162 Ya~~Dv~~~~~L~~~L~~~----------~~~E~~l~~~~~~me~~Gi~vD~e~l~~~~~~~~~~~~~~~~~i~~~~g~~ 231 (675) |++.|++++++++..|.+. .+.|+++..++.+||.+||.||.+.++++..++.++..++++++++++|.+ T Consensus 448 yaa~da~~~~~L~~~L~~~L~~~~l~~L~~~IE~PLi~VLa~ME~~GI~VD~~~L~~ls~el~~~l~~Le~~Iy~lAG~e 527 (880) T PRK07300 448 HLARKVKVLLDSKEQMLDKLTAHEQLDLLFDMELPLANVLAKMEIAGIKVNRETLQEMAAENEVVIEALTQEIYDLAGQE 527 (880) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCEEECHHHHHHHHHHHHHHHHHHHHHHHHHCCCC T ss_conf 99999999999999989998761699999999868999999999658088899999999999999999999999865876 Q ss_pred CCCCCCHHHHHHHHHHHCCCCCC--C---CCHHHHHHHHHH-CCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHCCCCCC Q ss_conf 68888568899999997289998--8---896588999974-48985489999999997543000356688730568845 Q gi|254780127|r 232 IRSSRQTYALRMYLFLITGIDLV--D---MSEGTLKSILSH-SNITQLAKDLILNRLASSGSAILKLNTLSEAVSSDGRL 305 (675) Q Consensus 232 ~~n~~S~~ql~~~L~~~~g~~~~--~---~~k~~~~~~l~~-~~~~~~~~~ll~~r~~~~~~~~~~~~~l~~~~~~dGRi 305 (675) | |+|||+||+++||+++|+|+. + |+-+|..++|+. .+.+|.+..+++||+..|+.|| |+++|.+.+.+|||| T Consensus 528 F-NInSPKQLgeILFekLgLP~~~~KKTKTGySTd~evLe~L~~~hpi~~~ILeyR~l~KLkST-Y~d~L~~~i~~~gRI 605 (880) T PRK07300 528 F-NINSPKQLGVILFEKMGLPLEMTKKTKTGYSTAVDVLERLAPIAPIVAKILEYRQITKLQST-YVIGLQDYILADGKI 605 (880) T ss_pred C-CCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHH-HHHHHHHHCCCCCCC T ss_conf 5-77999999999998449987677778888785799999976618748999998489999999-899899854754612 Q ss_pred CCCEEECCCCCCHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCCCCHHHHEEEECC-CCCEEEEEECCCC Q ss_conf 220021132210011106888554777766523456544310244455444322101321214769-9878999733548 Q gi|254780127|r 306 RGTLQFYGASRTGRWSGCVFQPQNLPRQERSEEILTQTIQALNRGETIADPLGLASDCVRSCIIAS-NGKKLVVADLAGI 384 (675) Q Consensus 306 h~~~~~~gAt~TGR~S~~~pNlQnlPr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iR~~f~a~-~G~~lv~aDySqi 384 (675) |++|+|.+ |.|||+||++||||||| ++++.|+.||+||+|. +|++|++|||||| T Consensus 606 HTsf~Qt~-TaTGRLSS~~PNLQNIP------------------------iRte~Gr~IR~aFi~~~~~~~llsaDYSQI 660 (880) T PRK07300 606 HTRYVQDL-TQTGRLSSVDPNLQNIP------------------------VRLEQGRLIRKAFVPSHEDAVLLSSDYSQI 660 (880) T ss_pred CCHHHHHH-CCCCEECCCCCCCCCCC------------------------CCCCCCHHHHHEEECCCCCEEEEEECHHHH T ss_conf 21356423-13342114787634688------------------------777435204412732788559998232268 Q ss_pred CEEEEEEECCCHHHHHHHHCCCCCCCCCHHHHHCCC-CCCCCHHHHHCCCCCHHHHHHCCCHHHHHHHHHHCCCCCCHHH Q ss_conf 505886622876786654044410000157760875-1105702210064210013307887789887887088864036 Q gi|254780127|r 385 EARVLAWIAGEDWKIKAFANGEDIYVTTYAKAFNTP-IDKVSKEQRAIGKVMELALGYQGGAKVFKTMASHCGLDLQQFS 463 (675) Q Consensus 385 ElRvlA~ls~d~~li~~f~~g~D~h~~tA~~~~~~~-~~~v~~~~R~~aK~~~~gl~YG~G~~~l~~~~~~~G~~~~~~~ 463 (675) |||||||+|+|+.|++||++|.|||+.||+.|||+| .++||+++|+.||++|||++||||+.+|.+ T Consensus 661 ELRilAhlS~D~~Li~AF~~~~DIH~~TA~~if~v~~~~~Vt~~~Rr~AK~iNFGiiYG~s~fgLs~------------- 727 (880) T PRK07300 661 ELRVLAHISGDEHLIAAFKEGADIHTSTAMRVFGIEKAEDVTANDRRNAKAVNFGIVYGISDFGLSN------------- 727 (880) T ss_pred HHHHHHHHHCCHHHHHHHHCCCCHHHHHHHHHHCCCCHHHCCHHHHHHHHHHHHHHHHCCCHHHHHH------------- T ss_conf 9999999818888999871588778999999949973200899999998631211112788677888------------- Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEECCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCEEEECCC Q ss_conf 66665666765478899999999987511101031088978899999986730789999999999999633721420132 Q gi|254780127|r 464 QNVKSTSTFEDWEQAESHHLWMQDQYPEFAVKDKLIGTACELVKKAWRAKHQGVLQLWKDLTEGFACVVQEGGSISARRV 543 (675) Q Consensus 464 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ea~~~~~~~~~~~p~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 543 (675) ..+++..||+.+++.||..||+|+.|...+.+.|+. +| T Consensus 728 -------------------------------~l~i~~~eA~~~I~~Yf~~yp~v~~~~~~~~~~a~~---~G-------- 765 (880) T PRK07300 728 -------------------------------NLGITRKQAKSYIDTYFERYPGIKNYMENVVREAKD---KG-------- 765 (880) T ss_pred -------------------------------HCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHH---CC-------- T ss_conf -------------------------------649999999999999999691099999999999997---68-------- Q ss_pred CCCCCCCCCCCCCCCEEECCCCHHHCCCCCCHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---- Q ss_conf 2234432124456411224631131024520123320310012357765335577767479899999999999996---- Q gi|254780127|r 544 ANVPRLVMRRHKRDVHIVLPSSRRLVYSDVKGDCSYLNTATSQLMRERTYGGKLTENIVQAISRDILCEGMKNATK---- 619 (675) Q Consensus 544 ~~~~~~~~~~~~~~l~i~~psGR~l~~~~~~~~~~~~~~~~~~~~r~~~~g~k~~n~~iQgtAaDi~k~am~~l~~---- 619 (675) ++++..|||++.|++++.+.... ...+|+ ++|+|||||||||+|.||+++++ T Consensus 766 ---------------yv~Tl~gRrr~ip~i~s~n~~~r---~~~eR~------AiN~piQGtAADiiK~AMi~~~~~l~~ 821 (880) T PRK07300 766 ---------------YVETLFKRRRELPDINSRNFNVR---SFAERT------AINSPIQGSAADILKIAMINLDKALQA 821 (880) T ss_pred ---------------CEECCCCCEECCCCCCCCCHHHH---HHHHHH------HHCCCCCHHHHHHHHHHHHHHHHHHHH T ss_conf ---------------87777788135898678887788---799898------816024178999999999999999984 Q ss_pred C--CCCEEEEECCEEEEECCCCHHHHHHH-HHHHHCCCCCCCCCCCCEEECCCCCCCC Q ss_conf 7--98189995358999678478999999-9998467630216984157303114631 Q gi|254780127|r 620 N--GYDIVLTVHDEIVSETPDTPYFSVGT-LCSLMTKNPSWAKGLPLKAEGYEAKRYR 674 (675) Q Consensus 620 ~--g~~ivl~VHDElv~evp~~~~~~~~~-i~~~M~~~p~wa~~~pl~a~~~~~~~y~ 674 (675) . ..+|+||||||||||||+++++++.. |+++|+.+.+ ..+||.+|..+|+.|- T Consensus 822 ~~~~~~mlLQVHDELvfEv~~~~~~~~~~~v~~~Me~~~~--l~VPL~v~~~~G~nW~ 877 (880) T PRK07300 822 GGFKSKMLLQVHDEIVLEVPNDELTAIKKLVKETMEAAIE--LAVPLIADENTGQTWY 877 (880) T ss_pred CCCCCEEEEEECEEEEEEECHHHHHHHHHHHHHHHHCCCC--CCCCEEEECCCCCCHH T ss_conf 5988518999740458851788999999999999837524--7888373018488967 No 16 >TIGR00593 pola DNA polymerase I; InterPro: IPR002298 DNA carries the biological information that instructs cells how to exist in an ordered fashion. Accurate replication is thus one of the most important events in the cell life cycle. This function is mediated by DNA-directed DNA polymerases, which add nucleotide triphosphate (dNTP) residues to the 5'-end of the growing DNA chain, using a complementary DNA as template. Small RNA molecules are generally used as primers for chain elongation, although terminal proteins may also be used. DNA-dependent DNA polymerases have been grouped into families, denoted A, B and X, on the basis of sequence similarities , . Members of family A, which includes bacterial and bacteriophage polymerases, share significant similarity to Escherichia coli polymerase I; hence family A is also known as the pol I family. The bacterial polymerases also contain an exonuclease activity, which is coded for in the N-terminal portion. Three motifs, A, B and C , are seen to be conserved across all DNA polymerases, with motifs A and C also seen in RNA polymerases. They are centred on invariant residues, and their structural significance was implied from the Klenow (E. coli) structure. Motif A contains a strictly-conserved aspartate at the junction of a beta-strand and an alpha-helix; motif B contains an alpha-helix with positive charges; and motif C has a doublet of negative charges, located in a beta-turn-beta secondary structure .; GO: 0003677 DNA binding, 0003887 DNA-directed DNA polymerase activity, 0006260 DNA replication. Probab=100.00 E-value=0 Score=822.04 Aligned_cols=502 Identities=22% Similarity=0.245 Sum_probs=396.2 Q ss_pred CCCEEEEEEHHHHHHHHHHHCCCCCC-CCCEEEHHHHHHHHCCCC---CHHHHHHHHCCCCCCH----HHHHCHHHHHHH Q ss_conf 89819986307759999986299878-650454588999858899---9789999808898652----766376576641 Q gi|254780127|r 64 ETVMCVAHNSLFERILFKKTLGIDIP-SKRWICTSVLARTNGLPS---SLKNACLALNFSEHLT----KMEEGKALIARF 135 (675) Q Consensus 64 ~~~~kv~HNa~FD~~vL~~~lGi~i~-~~~~~DTmi~A~~l~~p~---sL~~la~~lgl~~~~~----kd~~gk~li~~~ 135 (675) ...++.+||+||+..+|++. |++.. ....+||||+||++++.. .|+.++..+....... +....++....+ T Consensus 464 a~~~~~~~~~K~~~~~L~~~-g~~~~~~~~~~D~~laaYll~~~~~~~~l~~~~~~y~~~~~~~d~~~~~~~~~~~~~~a 542 (1005) T TIGR00593 464 ASKVKVAHDAKFLMHLLKRK-GIELIEIGVIFDTMLAAYLLDPAQVSKDLDTLAKRYLVEELILDDDRKEEGIKKLAKFA 542 (1005) T ss_pred CCCCCCHHHHHHHHHHHHHC-CCCCCCCCCCCCHHHHHHHHCCCCCCCHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHH T ss_conf 41012288999999999743-77334421145489999984303564027899987134346778731345544334441 Q ss_pred HHHHHHHCCCCCCCC-CCHHH-HHHHHHHHHHHHHHHHHHHHHHCC----------CHHHHHHHHHHHHHHHHCCEECCH Q ss_conf 002332013453112-58768-999898877539999999987200----------214310178999987516843088 Q gi|254780127|r 136 CKGSIDSPPYDCTRA-NHVQA-WQLFGEYCKRDVEATRELFKRLIP----------LSDGERDLWLLDQTINDRGYRIDL 203 (675) Q Consensus 136 ~~~~~~~~~~~~~~~-~~~~~-~~~~~~Ya~~Dv~~~~~L~~~L~~----------~~~~E~~l~~~~~~me~~Gi~vD~ 203 (675) ............... ..... .+...++.+.|+++|.+||+-|.= +-+.|+++..++.+||.+||.||. T Consensus 543 ~~~~~~~~~~~~~~~~~~~~L~~~l~~~L~~eDaDitfrlyhylklrle~~~~~~L~~eiE~Pl~~VL~~ME~~Gi~~D~ 622 (1005) T TIGR00593 543 DPDLEEAIEYLARKAAATARLAEELLKELDAEDADITFRLYHYLKLRLEENKLLSLYEEIELPLSKVLAEMEKTGIKVDK 622 (1005) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHH T ss_conf 12177889999999999999999988873113220466777788865163889999988420078999998776678708 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHCCCCCCCC---C-HHHHHHHHHH-CCCCCH-HHHH Q ss_conf 99999999999999999999986305766888856889999999728999888---9-6588999974-489854-8999 Q gi|254780127|r 204 DLVLKLQELIAQERKKLDEDIVKLTDGIIRSSRQTYALRMYLFLITGIDLVDM---S-EGTLKSILSH-SNITQL-AKDL 277 (675) Q Consensus 204 e~l~~~~~~~~~~~~~~~~~i~~~~g~~~~n~~S~~ql~~~L~~~~g~~~~~~---~-k~~~~~~l~~-~~~~~~-~~~l 277 (675) ++++++..++..++..++.+|++++|.+| |+||||||..+||+++++|..++ + -||..++|+. .+.+|. +..+ T Consensus 623 ~~L~~L~~e~~~~l~~lE~~Iy~~aG~~F-NinSPKQL~~vLF~KL~Lp~~kK~KtG~YSTd~~vLe~L~~~h~~~i~~~ 701 (1005) T TIGR00593 623 DYLQELSKEFGEELLDLEEEIYELAGEEF-NINSPKQLGEVLFEKLGLPVGKKTKTGSYSTDADVLEKLAEKHPIHIALI 701 (1005) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCCCC-CCCCHHHHHHHHHHHHCCCCCCCCCCCCCEECHHHHHHHHHCCCCCHHHH T ss_conf 99999999999999999999998749520-68873355588699847882003467862133889998874288836899 Q ss_pred H-HHHHHHHHHCCHHHHHHHHHHCCC-CCCCCCEEECCCCCCHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCC Q ss_conf 9-999997543000356688730568-84522002113221001110688855477776652345654431024445544 Q gi|254780127|r 278 I-LNRLASSGSAILKLNTLSEAVSSD-GRLRGTLQFYGASRTGRWSGCVFQPQNLPRQERSEEILTQTIQALNRGETIAD 355 (675) Q Consensus 278 l-~~r~~~~~~~~~~~~~l~~~~~~d-GRih~~~~~~gAt~TGR~S~~~pNlQnlPr~~~~~~~~~~~~~~~~~~~~~~~ 355 (675) | +||+..|+.+| |++.|++.++++ ||||++|+|.+ |.|||+||++||||||| T Consensus 702 Ll~yR~L~KLksT-Y~d~Ll~~v~~~TgRiHT~F~Qt~-TaTGRLSS~~PNLQNIP------------------------ 755 (1005) T TIGR00593 702 LLEYRQLTKLKST-YVDGLLELVNPDTGRIHTTFNQTG-TATGRLSSSNPNLQNIP------------------------ 755 (1005) T ss_pred HHHHHHHHHHHHC-CHHHHHHHHHHCCCCEEHHHHHHH-HHCCCCCCCCCCCCCCC------------------------ T ss_conf 9876666543101-201338984000282731314545-63562004788854433------------------------ Q ss_pred CCCCCCHHHHEEEECC-CCCEEEEEECCCCCEEEEEEECCCHHHHHHHHCCCCCCCCCHHHHHCCCCCCCCHHHHHCCCC Q ss_conf 4322101321214769-987899973354850588662287678665404441000015776087511057022100642 Q gi|254780127|r 356 PLGLASDCVRSCIIAS-NGKKLVVADLAGIEARVLAWIAGEDWKIKAFANGEDIYVTTYAKAFNTPIDKVSKEQRAIGKV 434 (675) Q Consensus 356 ~~~~~~~~iR~~f~a~-~G~~lv~aDySqiElRvlA~ls~d~~li~~f~~g~D~h~~tA~~~~~~~~~~v~~~~R~~aK~ 434 (675) ++.+.|+.||++|+|. .||.||+|||||||||||||+|+|+.||+||.+|.|||+.||+.+||++.++||+++|+.||+ T Consensus 756 iR~~~Gr~IR~AFva~~~g~~L~~ADYSQIELRiLAHlS~De~Li~AF~~~~DIH~~TA~~~Fgv~~~~vtp~~RR~AK~ 835 (1005) T TIGR00593 756 IRSEEGRKIRKAFVAESKGWLLISADYSQIELRILAHLSQDENLIEAFKNGEDIHTQTASRLFGVEEEDVTPNMRRIAKA 835 (1005) T ss_pred CCCCCHHHHHHCCCCCCCCCEEEECCCCHHHHHHHHHHCCCHHHHHHHHCCCCHHHHHHHHHHCCCHHHCCHHHHCCCCE T ss_conf 22720013322002468860688526337999999864078679999754887138899997298977449646047414 Q ss_pred CHHHHHHCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEECCHHHHHHHHHHHHHHHC Q ss_conf 10013307887789887887088864036666656667654788999999999875111010310889788999999867 Q gi|254780127|r 435 MELALGYQGGAKVFKTMASHCGLDLQQFSQNVKSTSTFEDWEQAESHHLWMQDQYPEFAVKDKLIGTACELVKKAWRAKH 514 (675) Q Consensus 435 ~~~gl~YG~G~~~l~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ea~~~~~~~~~~~ 514 (675) +|||++||||+.+|.+ ..+||..||+.+|+.||++| T Consensus 836 vNFGivYGmSafgLA~--------------------------------------------~L~Is~~EA~~fIe~YF~~y 871 (1005) T TIGR00593 836 VNFGIVYGMSAFGLAR--------------------------------------------ELGISRKEAKEFIERYFERY 871 (1005) T ss_pred EEEHHHHCCCHHHHHH--------------------------------------------HHCCCHHHHHHHHHHHHHHC T ss_conf 4301321012057777--------------------------------------------50888889999999997634 Q ss_pred CCHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCCCCCCCEEECCCCHHHCCCCCCHHHHHHCCCHHHHHHHHHHH Q ss_conf 30789999999999999633721420132223443212445641122463113102452012332031001235776533 Q gi|254780127|r 515 QGVLQLWKDLTEGFACVVQEGGSISARRVANVPRLVMRRHKRDVHIVLPSSRRLVYSDVKGDCSYLNTATSQLMRERTYG 594 (675) Q Consensus 515 p~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~i~~psGR~l~~~~~~~~~~~~~~~~~~~~r~~~~g 594 (675) |+|+.|..++-+ .+.++|+ +.+..||||+.|++.+.+...++ ..+|.+ T Consensus 872 Pgv~~Y~~~~~~---~ar~~GY-----------------------V~Tl~gRRRY~~~i~s~n~~~R~---~AER~A--- 919 (1005) T TIGR00593 872 PGVKEYLENTVE---EARKKGY-----------------------VTTLLGRRRYIPDINSRNRNLRE---AAERMA--- 919 (1005) T ss_pred CHHHHHHHHHHH---HHHHCCC-----------------------EEEECCCEECCCCCCHHHHHHHH---HHHHHH--- T ss_conf 138999999999---9973685-----------------------56304656115763223577799---999987--- Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH----CC--CCEEEEECCEEEEECCCCHHHHHHHH-HHHHC-CCCCCCCCCCCEEE Q ss_conf 5577767479899999999999996----79--81899953589996784789999999-99846-76302169841573 Q gi|254780127|r 595 GKLTENIVQAISRDILCEGMKNATK----NG--YDIVLTVHDEIVSETPDTPYFSVGTL-CSLMT-KNPSWAKGLPLKAE 666 (675) Q Consensus 595 ~k~~n~~iQgtAaDi~k~am~~l~~----~g--~~ivl~VHDElv~evp~~~~~~~~~i-~~~M~-~~p~wa~~~pl~a~ 666 (675) .|.|||||||||||.||+.|++ .+ .+|+||||||||||+|.+++++|+.| ++.|+ ++-. ..+||.++ T Consensus 920 ---~NaPIQGSAADI~K~AMikl~~~L~~~~L~~rlLLQVHDELl~E~P~~E~e~v~~l~k~~MED~A~~--L~VPL~v~ 994 (1005) T TIGR00593 920 ---INAPIQGSAADIMKIAMIKLDKALKERKLKSRLLLQVHDELLFEAPEEEAEEVKALVKEVMEDEAYT--LSVPLEVE 994 (1005) T ss_pred ---HCCCCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCEEEE--CCCCCEEE T ss_conf ---3675346878999999999999997325033346786558872344413789999999871461010--25763675 Q ss_pred CCCCCCCC Q ss_conf 03114631 Q gi|254780127|r 667 GYEAKRYR 674 (675) Q Consensus 667 ~~~~~~y~ 674 (675) -..|..|. T Consensus 995 ~~~G~~W~ 1002 (1005) T TIGR00593 995 VETGKNWG 1002 (1005) T ss_pred CCCCCCHH T ss_conf 24587711 No 17 >COG0749 PolA DNA polymerase I - 3'-5' exonuclease and polymerase domains [DNA replication, recombination, and repair] Probab=100.00 E-value=0 Score=767.94 Aligned_cols=534 Identities=21% Similarity=0.231 Sum_probs=435.4 Q ss_pred EEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCCEEEEEECCCC---CHHHHHHHHHHCCCCCEEEEEEHHHHHHHH Q ss_conf 99980406788700046000026882999999983822899978884---221799999864889819986307759999 Q gi|254780127|r 4 LFIDIETRSPQPLPKVGVWAYAEQAVITLCAYAWDDEPVKLWDRTEQ---SAMPSDLLQYLRDETVMCVAHNSLFERILF 80 (675) Q Consensus 4 l~lD~ET~s~~dl~~~Gl~~y~~~~~I~~i~~a~~~~~~~~~d~~~~---~~~~~~l~~~L~~~~~~kv~HNa~FD~~vL 80 (675) +++|+||.+. +.. +..+.++++++.. .+.++++.++ -.....+++.|++++..||+||++||..+| T Consensus 25 ~a~~~et~~l--------~~~--~~~lvg~s~~~~~-~~~yi~~~~~~~~~~~~~~l~~~l~~~~~~kv~~~~K~d~~~l 93 (593) T COG0749 25 IAFDTETDGL--------DPH--GADLVGLSVASEE-EAAYIPLLHGPEQLNVLAALKPLLEDEGIKKVGQNLKYDYKVL 93 (593) T ss_pred CCEECCCCCC--------CCC--CCCEEEEEEECCC-CCEEEEECCCHHHHHHHHHHHHHHHCCCCCHHCCCCCHHHHHH T ss_conf 1010001456--------755--5874678862365-4236751366555514899999863754103213413669999 Q ss_pred HHHCCCCCCCCCEEEHHHHHHHHCCC---CCHHHHHHHH-CCCCCCHHHHHCHHHHHHHHHHHHHHCCCCCCCCCCHHHH Q ss_conf 98629987865045458899985889---9978999980-8898652766376576641002332013453112587689 Q gi|254780127|r 81 KKTLGIDIPSKRWICTSVLARTNGLP---SSLKNACLAL-NFSEHLTKMEEGKALIARFCKGSIDSPPYDCTRANHVQAW 156 (675) Q Consensus 81 ~~~lGi~i~~~~~~DTmi~A~~l~~p---~sL~~la~~l-gl~~~~~kd~~gk~li~~~~~~~~~~~~~~~~~~~~~~~~ 156 (675) .++ |+. + ...+||||++|++++. ++++.+++++ ++....+++..||.... ........ T Consensus 94 ~~~-Gi~-~-~~~~DtmlasYll~~~~~~~~~~~l~~r~l~~~~~~~~~i~~kg~~~---------------~~~~~~~~ 155 (593) T COG0749 94 ANL-GIE-P-GVAFDTMLASYLLNPGAGAHNLDDLAKRYLGLETITFEDIAGKGKKQ---------------LTFADVKL 155 (593) T ss_pred HHC-CCC-C-CCHHHHHHHHHCCCCCCCCCCHHHHHHHHCCCCCCHHHHHHCCCCCC---------------CCCCCCHH T ss_conf 973-976-5-65178999996148676757778999982587740248763254456---------------76100147 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCC-----------CHHHHHHHHHHHHHHHHCCEECCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99898877539999999987200-----------2143101789999875168430889999999999999999999998 Q gi|254780127|r 157 QLFGEYCKRDVEATRELFKRLIP-----------LSDGERDLWLLDQTINDRGYRIDLDLVLKLQELIAQERKKLDEDIV 225 (675) Q Consensus 157 ~~~~~Ya~~Dv~~~~~L~~~L~~-----------~~~~E~~l~~~~~~me~~Gi~vD~e~l~~~~~~~~~~~~~~~~~i~ 225 (675) +....|++.|++++.+|+..|.+ ..+.|+++..++.+||.+||.||.+.+..+..++..+..+++++++ T Consensus 156 ~~~~~y~a~~a~~~~~L~~~l~~~l~~~~~L~~l~~~iE~Pl~~VLa~ME~~Gi~vD~~~L~~l~~el~~~l~~le~eiy 235 (593) T COG0749 156 EKATEYAAEDADATLRLESILEPELLKTPVLLELYEEIEMPLVRVLARMERNGIKVDVQYLKELSKELGCELAELEEEIY 235 (593) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHCCCEECHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 78888788888999999999998763104699999997540899999868628564299999999999999999999999 Q ss_pred HHCCCCCCCCCCHHHHHHHHHHHCCCCC--CCCC---HHHHHHHHHHCC-CCCHHHHHHHHHHHHHHHCCHHHHHHHHHH Q ss_conf 6305766888856889999999728999--8889---658899997448-985489999999997543000356688730 Q gi|254780127|r 226 KLTDGIIRSSRQTYALRMYLFLITGIDL--VDMS---EGTLKSILSHSN-ITQLAKDLILNRLASSGSAILKLNTLSEAV 299 (675) Q Consensus 226 ~~~g~~~~n~~S~~ql~~~L~~~~g~~~--~~~~---k~~~~~~l~~~~-~~~~~~~ll~~r~~~~~~~~~~~~~l~~~~ 299 (675) ++.|.++ |++||+||+.+||+++|+|+ ++|. -+|..++++... .+|.++.++++|+..|+.+ +|.+.+.+.+ T Consensus 236 ~laG~~F-Ni~SPKQL~~ILfeKl~Lp~~~kKtktG~yST~~~vLe~L~~~h~i~~~iL~~Rql~KLks-TY~d~L~~~i 313 (593) T COG0749 236 ELAGEEF-NINSPKQLGEILFEKLGLPPGLKKTKTGNYSTDAEVLEKLADDHPLPKLILEYRQLAKLKS-TYTDGLPKLI 313 (593) T ss_pred HHHCCCC-CCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHH-HHHHCCHHHH T ss_conf 9826857-8997789999988851898555545789976279999986524873899999987888877-7640228751 Q ss_pred CCC-CCCCCCEEECCCCCCHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCCCCHHHHEEEECCCCCEEEE Q ss_conf 568-8452200211322100111068885547777665234565443102444554443221013212147699878999 Q gi|254780127|r 300 SSD-GRLRGTLQFYGASRTGRWSGCVFQPQNLPRQERSEEILTQTIQALNRGETIADPLGLASDCVRSCIIASNGKKLVV 378 (675) Q Consensus 300 ~~d-GRih~~~~~~gAt~TGR~S~~~pNlQnlPr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iR~~f~a~~G~~lv~ 378 (675) +|+ ||||++|+|.+ |.|||+||++||||||| ++...|..||++|+|++||.+++ T Consensus 314 ~~~t~RIHTsf~Q~~-t~TGRLSSsdPNLQNIP------------------------iRse~Gr~IR~aFva~~g~~~i~ 368 (593) T COG0749 314 NPDTGRIHTSFNQTG-TATGRLSSSDPNLQNIP------------------------IRSEEGRKIRKAFVAEKGYTLIS 368 (593) T ss_pred CCCCCCCCCCHHHHH-HHHHCCCCCCCCCCCCC------------------------CCCHHHHHHHHCEECCCCCEEEE T ss_conf 888993276168777-77503357999755677------------------------67776786664055799986887 Q ss_pred EECCCCCEEEEEEECCCHHHHHHHHCCCCCCCCCHHHHHCCCCCCCCHHHHHCCCCCHHHHHHCCCHHHHHHHHHHCCCC Q ss_conf 73354850588662287678665404441000015776087511057022100642100133078877898878870888 Q gi|254780127|r 379 ADLAGIEARVLAWIAGEDWKIKAFANGEDIYVTTYAKAFNTPIDKVSKEQRAIGKVMELALGYQGGAKVFKTMASHCGLD 458 (675) Q Consensus 379 aDySqiElRvlA~ls~d~~li~~f~~g~D~h~~tA~~~~~~~~~~v~~~~R~~aK~~~~gl~YG~G~~~l~~~~~~~G~~ 458 (675) |||||||||||||+|+|+.|++||.+|.|+|+.||+.+||+|+++|++++|+.||++|||++|||++.+|.+ + T Consensus 369 aDYSQIELRilAHls~D~~Ll~AF~~g~DiH~~TA~~vFgv~~~~Vt~e~Rr~AKaINFGiiYG~safgLa~---~---- 441 (593) T COG0749 369 ADYSQIELRILAHLSQDEGLLRAFTEGEDIHTATAAEVFGVPIEEVTSEQRRKAKAINFGLIYGMSAFGLAQ---Q---- 441 (593) T ss_pred ECHHHHHHHHHHHHCCCHHHHHHHHCCCCHHHHHHHHHHCCCHHHCCHHHHHHHHHHCCCEEECCCHHHHHH---H---- T ss_conf 015799999999861889999988658637789999994998233899987565530354143451334798---7---- Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEECCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCEE Q ss_conf 64036666656667654788999999999875111010310889788999999867307899999999999996337214 Q gi|254780127|r 459 LQQFSQNVKSTSTFEDWEQAESHHLWMQDQYPEFAVKDKLIGTACELVKKAWRAKHQGVLQLWKDLTEGFACVVQEGGSI 538 (675) Q Consensus 459 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ea~~~~~~~~~~~p~i~~~~~~~~~~~~~~~~~~~~~ 538 (675) .+++..||+.+++.||+.||+|..|..++.+.++. +| T Consensus 442 -------------------------------------L~I~~~eA~~~I~~YF~rypgv~~ym~~~~~~ar~---~G--- 478 (593) T COG0749 442 -------------------------------------LGIPRKEAKEYIDRYFERYPGVKEYMERTKEEARE---DG--- 478 (593) T ss_pred -------------------------------------CCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHH---CC--- T ss_conf -------------------------------------49886888999999998686899999999999997---49--- Q ss_pred EECCCCCCCCCCCCCCCCCCEEECCCCHHHCCCCCCHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 20132223443212445641122463113102452012332031001235776533557776747989999999999999 Q gi|254780127|r 539 SARRVANVPRLVMRRHKRDVHIVLPSSRRLVYSDVKGDCSYLNTATSQLMRERTYGGKLTENIVQAISRDILCEGMKNAT 618 (675) Q Consensus 539 ~~~~~~~~~~~~~~~~~~~l~i~~psGR~l~~~~~~~~~~~~~~~~~~~~r~~~~g~k~~n~~iQgtAaDi~k~am~~l~ 618 (675) ++++.+|||+++|+.++.+... +...+|. .+|.|||||||||+|.||+.++ T Consensus 479 --------------------yV~Tl~gRRry~p~i~s~n~~~---R~~aER~------AiNaPIQGTAADiiK~AMI~vd 529 (593) T COG0749 479 --------------------YVETLFGRRRYLPDINSSNRVV---RAAAERA------AINAPIQGTAADIIKLAMIKVD 529 (593) T ss_pred --------------------CEEECCCCCCCCCCCCCCCHHH---HHHHHHH------HHCCCCCCCHHHHHHHHHHHHH T ss_conf --------------------3541026602486210177888---8788898------8357674448999999998679 Q ss_pred H----C--CCCEEEEECCEEEEECCCCHHHHHHH-HHHHHCCCCCCCCCCCCEEECCCCCCCC Q ss_conf 6----7--98189995358999678478999999-9998467630216984157303114631 Q gi|254780127|r 619 K----N--GYDIVLTVHDEIVSETPDTPYFSVGT-LCSLMTKNPSWAKGLPLKAEGYEAKRYR 674 (675) Q Consensus 619 ~----~--g~~ivl~VHDElv~evp~~~~~~~~~-i~~~M~~~p~wa~~~pl~a~~~~~~~y~ 674 (675) + . ..+++||||||||||||+++.+++++ |.+.|+++-. ..+||.+|-.+|+.|- T Consensus 530 ~~l~~~~~~~rllLQVHDELvfEv~~~e~e~~~~~v~~~Me~a~~--L~VPL~vdv~~g~nW~ 590 (593) T COG0749 530 KALKEEKLKARLLLQVHDELVFEVPKEELEEVKKLLKAIMENAVN--LSVPLEVDVGIGKNWD 590 (593) T ss_pred HHHHHCCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHC--CCCCEEEECCCCCCHH T ss_conf 998634321456775545564117686899999999999987650--4786698617776834 No 18 >cd06444 DNA_pol_A DNA polymerase family A, 5'-3' polymerase domain. Family A polymerase (polymerase I) functions primarily to fill DNA gaps that arise during DNA repair, recombination and replication. DNA-dependent DNA polymerases can be classified into six main groups based upon phylogenetic relationships with E. coli polymerase I (classA), E. coli polymerase II (class B), E.coli polymerase III (class C), euryarchaeota polymerase II (class D), human polymerase beta (class X), E. coli UmuC/DinB and eukaryotic RAP 30/Xeroderma pigmentosum variant (class Y). Family A polymerases are found primarily in organisms related to prokaryotes and include prokaryotic DNA polymerase I, mitochondrial polymerase gamma, and several bacteriophage polymerases including those from odd-numbered phage (T3, T5, and T7). Prokaryotic Pol I has two functional domains located on the same polypeptide; a 5'-3' polymerase and a 5'-3' exonuclease. Pol I uses its 5' nuclease activity to remove the ribonucleotide po Probab=100.00 E-value=0 Score=618.02 Aligned_cols=366 Identities=22% Similarity=0.287 Sum_probs=315.2 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCH---HHHHHHHHH-CCCCCHHHH Q ss_conf 0889999999999999999999998630576688885688999999972899988896---588999974-489854899 Q gi|254780127|r 201 IDLDLVLKLQELIAQERKKLDEDIVKLTDGIIRSSRQTYALRMYLFLITGIDLVDMSE---GTLKSILSH-SNITQLAKD 276 (675) Q Consensus 201 vD~e~l~~~~~~~~~~~~~~~~~i~~~~g~~~~n~~S~~ql~~~L~~~~g~~~~~~~k---~~~~~~l~~-~~~~~~~~~ 276 (675) ||.+.++++...+..++.+++++++.++|.++ |++||+|++++||+.+|+|++++++ +|.+..+.. ...+|.++. T Consensus 1 id~~~~~~~~~~l~~~l~~l~~~~~~~~g~~f-n~~S~~ql~~~L~~~lg~~~~~~~~~~~st~~~~l~~~~~~hp~~~~ 79 (379) T cd06444 1 VDKEELEELSAELEKELEELEKEIYELAGEEF-NINSPKQLGEVLFEKLGLPVSKKTKTGYSTDEEVLEKLAEAHPIIAL 79 (379) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC-CCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCCCHHHH T ss_conf 98899999999999999999999999969907-99999999999998549998988889999859999976433848999 Q ss_pred HHHHHHHHHHHCCHHHHHHHHHHC-CCCCCCCCEEECCCCCCHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCC Q ss_conf 999999975430003566887305-6884522002113221001110688855477776652345654431024445544 Q gi|254780127|r 277 LILNRLASSGSAILKLNTLSEAVS-SDGRLRGTLQFYGASRTGRWSGCVFQPQNLPRQERSEEILTQTIQALNRGETIAD 355 (675) Q Consensus 277 ll~~r~~~~~~~~~~~~~l~~~~~-~dGRih~~~~~~gAt~TGR~S~~~pNlQnlPr~~~~~~~~~~~~~~~~~~~~~~~ 355 (675) ++++|+..+..+ ++++.|.+.++ .|||||++|+++| |+||||||++|||||+|+... T Consensus 80 ile~R~~~kl~s-t~~~~~~~~~~~~~gRih~~~~q~g-t~TGR~Ss~~PNlQniP~~~~-------------------- 137 (379) T cd06444 80 ILEYRKLAKLKS-TYVDALPKLINPKTGRIHTSFNQTV-TATGRLSSSNPNLQNIPIRTE-------------------- 137 (379) T ss_pred HHHHHHHHHHHH-HHHHHHHHHHCCCCCEEEEEEECCC-CCCCCCCCCCCCCCCCCCCCH-------------------- T ss_conf 999999999999-9999998760689997966885277-540012467876457888871-------------------- Q ss_pred CCCCCCHHHHEEEECCCCCEEEEEECCCCCEEEEEEECCCHHHHHHHHCCCCCCCCCHHHHHCCCCCCCCHHHHHCCCCC Q ss_conf 43221013212147699878999733548505886622876786654044410000157760875110570221006421 Q gi|254780127|r 356 PLGLASDCVRSCIIASNGKKLVVADLAGIEARVLAWIAGEDWKIKAFANGEDIYVTTYAKAFNTPIDKVSKEQRAIGKVM 435 (675) Q Consensus 356 ~~~~~~~~iR~~f~a~~G~~lv~aDySqiElRvlA~ls~d~~li~~f~~g~D~h~~tA~~~~~~~~~~v~~~~R~~aK~~ 435 (675) .+..+|+||+|+|||+||++||||||+||+||+|+|+.|+++|++|.|+|+.||+.+||+|+++|++++|+.||++ T Consensus 138 ----~~~~~R~~f~~~~G~~lv~aDySqiElRvlA~ls~D~~l~~~f~~g~Dih~~~A~~~~~~~~~~v~~~~R~~aK~~ 213 (379) T cd06444 138 ----EGREIRKAFVAPPGYVLLSADYSQIELRILAHLSGDPALIEAFNNGEDIHSRTAAEVFGVPEEEVTKEERRKAKAI 213 (379) T ss_pred ----HHHHHHHEEECCCCCEEEEECHHHCHHHHHHHHHCCHHHHHHHHCCCCHHHHHHHHHHCCCHHHCCHHHHHHHHHH T ss_conf ----1676752474799987998423112188888871769999998738878999999983998443598899886576 Q ss_pred HHHHHHCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEECCHHHHHHHHHHHHHHHCC Q ss_conf 00133078877898878870888640366666566676547889999999998751110103108897889999998673 Q gi|254780127|r 436 ELALGYQGGAKVFKTMASHCGLDLQQFSQNVKSTSTFEDWEQAESHHLWMQDQYPEFAVKDKLIGTACELVKKAWRAKHQ 515 (675) Q Consensus 436 ~~gl~YG~G~~~l~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ea~~~~~~~~~~~p 515 (675) +||++||||+.+|.. ..+++.+||+.+++.|+++|| T Consensus 214 ~fgl~YG~G~~~la~--------------------------------------------~l~~s~~eA~~~~~~f~~~~p 249 (379) T cd06444 214 NFGIIYGMGAFGLAE--------------------------------------------QLGISREEAKEFIDRYFATYP 249 (379) T ss_pred HHHHHHCCCHHHHHH--------------------------------------------CCCCCHHHHHHHHHHHHHHCC T ss_conf 757532834532975--------------------------------------------019999999999999999786 Q ss_pred CHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCCCCCCCEEECCCCHHHCCCCCCHHHHHHCCCHHHHHHHHHHHH Q ss_conf 07899999999999996337214201322234432124456411224631131024520123320310012357765335 Q gi|254780127|r 516 GVLQLWKDLTEGFACVVQEGGSISARRVANVPRLVMRRHKRDVHIVLPSSRRLVYSDVKGDCSYLNTATSQLMRERTYGG 595 (675) Q Consensus 516 ~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~i~~psGR~l~~~~~~~~~~~~~~~~~~~~r~~~~g~ 595 (675) +|..|+..+.+.+.. .+ ++++++||++++|+......... ....| T Consensus 250 ~v~~~~~~~~~~a~~---~g-----------------------~v~T~~GR~r~~~~~~~~~~~~~---~~~~r------ 294 (379) T cd06444 250 GVKEYLEKTVEEARE---GG-----------------------YVETLFGRRRYLPEINSPNKNVR---SAAER------ 294 (379) T ss_pred CHHHHHHHHHHHHHH---CC-----------------------CEECCCCCEECCCCCCCCCHHHH---HHHHH------ T ss_conf 566789999999997---79-----------------------77667788534775567888898---78766------ Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHC------CCCEEEEECCEEEEECCCCHHHHHH-HHHHHHCCCCCCCCCCCCEEECC Q ss_conf 5777674798999999999999967------9818999535899967847899999-99998467630216984157303 Q gi|254780127|r 596 KLTENIVQAISRDILCEGMKNATKN------GYDIVLTVHDEIVSETPDTPYFSVG-TLCSLMTKNPSWAKGLPLKAEGY 668 (675) Q Consensus 596 k~~n~~iQgtAaDi~k~am~~l~~~------g~~ivl~VHDElv~evp~~~~~~~~-~i~~~M~~~p~wa~~~pl~a~~~ 668 (675) ..+|++||||||||+|.||+++++. ++++|+|||||||+|||+++++++. .+++||+++.++ .+||.+|.. T Consensus 295 ~a~N~~iQGsaAdi~k~ami~i~~~l~~~~~~~~lv~~VHDEiv~ev~~~~~~~~~~~i~~~Me~a~~l--~VPl~v~~~ 372 (379) T cd06444 295 QAINTPIQGSAADIIKLAMIKIEKLLKEKGLDARLLLQVHDELIFEVPEEELEEAAALVKENMENAVKL--SVPLKVDIK 372 (379) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECEEEEEEECHHHHHHHHHHHHHHHHCCCCC--CCCEEEECC T ss_conf 567787465899999999999999999629997899972542489987999999999999998563577--883786637 Q ss_pred CCCCCC Q ss_conf 114631 Q gi|254780127|r 669 EAKRYR 674 (675) Q Consensus 669 ~~~~y~ 674 (675) +|++|. T Consensus 373 iG~~Wg 378 (379) T cd06444 373 IGKNWG 378 (379) T ss_pred CCCCCC T ss_conf 269999 No 19 >pfam00476 DNA_pol_A DNA polymerase family A. Probab=100.00 E-value=0 Score=620.30 Aligned_cols=366 Identities=23% Similarity=0.273 Sum_probs=313.1 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHCCCCCCCC-----CHHHHHHHHHHC--CCCCH Q ss_conf 08899999999999999999999986305766888856889999999728999888-----965889999744--89854 Q gi|254780127|r 201 IDLDLVLKLQELIAQERKKLDEDIVKLTDGIIRSSRQTYALRMYLFLITGIDLVDM-----SEGTLKSILSHS--NITQL 273 (675) Q Consensus 201 vD~e~l~~~~~~~~~~~~~~~~~i~~~~g~~~~n~~S~~ql~~~L~~~~g~~~~~~-----~k~~~~~~l~~~--~~~~~ 273 (675) ||.+.++++..++..+..+++.+++.+.|.++ |++||+|++++||+.+|+|+... ..+|.++.|+.. +.+|. T Consensus 1 vd~~~l~~~~~~l~~~~~~l~~~i~~~aG~~f-N~~S~~ql~~vLf~~l~l~~~~~t~~~g~~st~~~~L~~l~~~~hp~ 79 (383) T pfam00476 1 VDVEYLEELSNELGAQLADLEFKIPELAGEEF-NLGSPKQLGVLLFEELGLPKTKKTDKTGARSTNADVLESLREDAHEI 79 (383) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC-CCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCHH T ss_conf 98899999999999999999999999959976-99998999998497359999997789999867899999876168869 Q ss_pred HHHHHHHHHHHHHHCCHHHHHHHHHHCC-CCCCCCCEEECCCCCCHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCC Q ss_conf 8999999999754300035668873056-884522002113221001110688855477776652345654431024445 Q gi|254780127|r 274 AKDLILNRLASSGSAILKLNTLSEAVSS-DGRLRGTLQFYGASRTGRWSGCVFQPQNLPRQERSEEILTQTIQALNRGET 352 (675) Q Consensus 274 ~~~ll~~r~~~~~~~~~~~~~l~~~~~~-dGRih~~~~~~gAt~TGR~S~~~pNlQnlPr~~~~~~~~~~~~~~~~~~~~ 352 (675) +..++++|...|..+ +|++.+...+++ |||||++|+++| |+|||+||++|||||||+.+ T Consensus 80 ~~~il~~R~~~Kl~~-t~~~~l~~~~~~~~gRih~~~nq~g-t~TGRlSs~~PNlQniP~~~------------------ 139 (383) T pfam00476 80 IKIILEYRQLSKLQS-TYVDKLPLMIDPDDGRIHTSYNQAG-TATGRLSSTDPNLQNIPIRN------------------ 139 (383) T ss_pred HHHHHHHHHHHHHHH-HHHHHHHHHHCCCCCEECCEEECCC-CCEEECCCCCCCCCCCCCCC------------------ T ss_conf 999999999999999-9999988862678986801242264-20220356798645799888------------------ Q ss_pred CCCCCCCCCHHHHEEEECCCCCEEEEEECCCCCEEEEEEECCCHHHHHHHHCCCCCCCCCHHHHHCCCCCCCCHHHHHCC Q ss_conf 54443221013212147699878999733548505886622876786654044410000157760875110570221006 Q gi|254780127|r 353 IADPLGLASDCVRSCIIASNGKKLVVADLAGIEARVLAWIAGEDWKIKAFANGEDIYVTTYAKAFNTPIDKVSKEQRAIG 432 (675) Q Consensus 353 ~~~~~~~~~~~iR~~f~a~~G~~lv~aDySqiElRvlA~ls~d~~li~~f~~g~D~h~~tA~~~~~~~~~~v~~~~R~~a 432 (675) +++..||+||+|+||++||+|||||||+||+||+|+|+.|+++|++|.|+|+.||+.+||+|+++|++++|+.| T Consensus 140 ------~~g~~iR~~Fva~~G~~lv~aDySQiElRilA~ls~D~~l~~af~~g~Dih~~tA~~ifg~~~~~v~~~~R~~A 213 (383) T pfam00476 140 ------EYGREIRAAFIAEPGYVLVAADYSQIELRILAHLSGDENLIEAFRTGADIHTLTAADIFGVDLHEVTGNQRRNA 213 (383) T ss_pred ------CCCHHHHHEEECCCCCEEEEECHHHHHHHHHHHHCCCHHHHHHHHCCCCHHHHHHHHHHCCCHHHCCHHHHHHH T ss_conf ------43055674362799987885058776688887520548899987027658898899984999788685567677 Q ss_pred CCCHHHHHHCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEECCHHHHHHHHHHHHHH Q ss_conf 42100133078877898878870888640366666566676547889999999998751110103108897889999998 Q gi|254780127|r 433 KVMELALGYQGGAKVFKTMASHCGLDLQQFSQNVKSTSTFEDWEQAESHHLWMQDQYPEFAVKDKLIGTACELVKKAWRA 512 (675) Q Consensus 433 K~~~~gl~YG~G~~~l~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ea~~~~~~~~~ 512 (675) |++|||++||||+.+|.. ..+++.+||+.+++.|++ T Consensus 214 K~~~fGi~YG~g~~~La~--------------------------------------------~l~is~~eA~~~i~~~~~ 249 (383) T pfam00476 214 KTFNFGRIYGASAKGLSQ--------------------------------------------LLGISREEAKEFIEKYFE 249 (383) T ss_pred HHHHHHHHHCCCHHHHHH--------------------------------------------HHCCCHHHHHHHHHHHHH T ss_conf 771316644846889998--------------------------------------------858889999999999999 Q ss_pred HCCCHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCCCCCCCEEECCCCHHHCCCCCCHHHHHHCCCHHHHHHHHH Q ss_conf 67307899999999999996337214201322234432124456411224631131024520123320310012357765 Q gi|254780127|r 513 KHQGVLQLWKDLTEGFACVVQEGGSISARRVANVPRLVMRRHKRDVHIVLPSSRRLVYSDVKGDCSYLNTATSQLMRERT 592 (675) Q Consensus 513 ~~p~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~i~~psGR~l~~~~~~~~~~~~~~~~~~~~r~~~ 592 (675) .||+|..|+..+...++. .+ ++.++.||++++|+......... ....| T Consensus 250 ~yp~i~~~~~~~~~~a~~---~g-----------------------~v~T~~GRrr~lp~~~~~~~~~~---~~~~r--- 297 (383) T pfam00476 250 RFPGVKRYREKTRKEAKK---GG-----------------------YVETLFGRRRYLPDIDSRNRSLR---EAAER--- 297 (383) T ss_pred HCHHHHHHHHHHHHHHHH---CC-----------------------CCCCCCCCEEECCCCCCCCHHHH---HHHHH--- T ss_conf 783699999999999996---89-----------------------61145687675577566678888---67767--- Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHC------CCCEEEEECCEEEEECCCCHHHHHH-HHHHHHCCCCCCCCCCCCEE Q ss_conf 3355777674798999999999999967------9818999535899967847899999-99998467630216984157 Q gi|254780127|r 593 YGGKLTENIVQAISRDILCEGMKNATKN------GYDIVLTVHDEIVSETPDTPYFSVG-TLCSLMTKNPSWAKGLPLKA 665 (675) Q Consensus 593 ~g~k~~n~~iQgtAaDi~k~am~~l~~~------g~~ivl~VHDElv~evp~~~~~~~~-~i~~~M~~~p~wa~~~pl~a 665 (675) ..+|++||||||||+|.||+++++. +.++|+|||||||+|||+++++++. .|++||+++.. ..+||.+ T Consensus 298 ---~a~N~~IQGsAADi~k~Ami~i~~~l~~~~~~~~lvlqVHDElv~evp~~~~~~~~~~v~~~Me~~~~--l~VPl~v 372 (383) T pfam00476 298 ---AALNTPIQGSAADILKLAMIKLDEALVEKGLDARMCLQVHDEIVFEVPKEEAEAVAAQIKELMERAMF--LDVPLLV 372 (383) T ss_pred ---HHHCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECEEEEEEECHHHHHHHHHHHHHHHHHHCC--CCCCEEE T ss_conf ---65353268689999999999999999854999789996354118997689999999999999960427--8996796 Q ss_pred ECCCCCCCC Q ss_conf 303114631 Q gi|254780127|r 666 EGYEAKRYR 674 (675) Q Consensus 666 ~~~~~~~y~ 674 (675) |..+|+.|- T Consensus 373 ~~~~G~~Wg 381 (383) T pfam00476 373 EVGQGRNWF 381 (383) T ss_pred ECCCCCCHH T ss_conf 538278801 No 20 >smart00482 POLAc DNA polymerase A domain. Probab=100.00 E-value=0 Score=356.48 Aligned_cols=200 Identities=29% Similarity=0.371 Sum_probs=171.3 Q ss_pred CHHHHEEEECCCCCEEEEEECCCCCEEEEEEECCCHHHHHHHHCCCCCCCCCHHHHHCCCCCCCCHHHHHCCCCCHHHHH Q ss_conf 01321214769987899973354850588662287678665404441000015776087511057022100642100133 Q gi|254780127|r 361 SDCVRSCIIASNGKKLVVADLAGIEARVLAWIAGEDWKIKAFANGEDIYVTTYAKAFNTPIDKVSKEQRAIGKVMELALG 440 (675) Q Consensus 361 ~~~iR~~f~a~~G~~lv~aDySqiElRvlA~ls~d~~li~~f~~g~D~h~~tA~~~~~~~~~~v~~~~R~~aK~~~~gl~ 440 (675) |+.||+||+|+|||+||++||||||+||+||+|+|+.|+++|.+|.|+|+.+|+.+||+|++++++++|+.||+++||++ T Consensus 1 g~~~R~~f~a~~G~~lv~aD~sqiE~Rv~A~la~D~~l~~~~~~g~D~h~~~A~~~f~~~~~~~~~~~R~~aK~~~~g~~ 80 (206) T smart00482 1 GREIRRAFVAPPGYVLVSADYSQIELRILAHLSGDENLLEAFNNGGDIHSKTAAQVFGVPEEEVTKELRRAAKAINFGII 80 (206) T ss_pred CCHHHCEEECCCCCEEEEECHHHHHHHHHHHHCCCHHHHHHHHCCCCHHHHHHHHHCCCCHHHCCHHHHHHHCHHHHHHH T ss_conf 90432267679999999988668999999998298779998743898689999996498801157566988567888665 Q ss_pred HCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEECCHHHHHHHHHHHHHHHCCCHHHH Q ss_conf 07887789887887088864036666656667654788999999999875111010310889788999999867307899 Q gi|254780127|r 441 YQGGAKVFKTMASHCGLDLQQFSQNVKSTSTFEDWEQAESHHLWMQDQYPEFAVKDKLIGTACELVKKAWRAKHQGVLQL 520 (675) Q Consensus 441 YG~G~~~l~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ea~~~~~~~~~~~p~i~~~ 520 (675) ||||+.+|.. . .+++..|++.+++.|++.||+|.+| T Consensus 81 YG~G~~~la~---~-----------------------------------------~g~~~~ea~~~~~~~~~~~p~i~~~ 116 (206) T smart00482 81 YGMGAKGLAE---Q-----------------------------------------LGISEAEAKELIKAYFARFPGVKRY 116 (206) T ss_pred CCCCCCCHHH---H-----------------------------------------CCCCHHHHHHHHHHHHHHCHHHHHH T ss_conf 1865110245---5-----------------------------------------1889999999999999978359999 Q ss_pred HHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCCCCCCCEEECCCCHHHCCCCCCHHHHHHCCCHHHHHHHHHHHHHHHHH Q ss_conf 99999999999633721420132223443212445641122463113102452012332031001235776533557776 Q gi|254780127|r 521 WKDLTEGFACVVQEGGSISARRVANVPRLVMRRHKRDVHIVLPSSRRLVYSDVKGDCSYLNTATSQLMRERTYGGKLTEN 600 (675) Q Consensus 521 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~i~~psGR~l~~~~~~~~~~~~~~~~~~~~r~~~~g~k~~n~ 600 (675) |..+...+.. .+ +++++.||++++|+.......... ......+|+ T Consensus 117 ~~~~~~~~~~---~g-----------------------~v~t~~Gr~~~~~~~~~~~~~~~~---------~~~~~~~N~ 161 (206) T smart00482 117 IKRTLEEARR---KG-----------------------YVTTLFGRRRYIPDIDSRNPVLRA---------AAERAAVNA 161 (206) T ss_pred HHHHHHHHHH---CC-----------------------CEECCCCCEEECCCCCCCCHHHHH---------HHHHHHCCH T ss_conf 9999999987---79-----------------------666788877767876875654553---------578766071 Q ss_pred HHHHHHHHHHHHHHHHHHHC------CCCEEEEECCEEEEECCCC Q ss_conf 74798999999999999967------9818999535899967847 Q gi|254780127|r 601 IVQAISRDILCEGMKNATKN------GYDIVLTVHDEIVSETPDT 639 (675) Q Consensus 601 ~iQgtAaDi~k~am~~l~~~------g~~ivl~VHDElv~evp~~ 639 (675) +||||||||++.||+++++. +.++++|||||||+|||+| T Consensus 162 ~vQgtaadi~~~al~~~~~~~~~~~~~~~iv~~VHDeiv~evp~e 206 (206) T smart00482 162 PIQGSAADILKLAMIKMDEALKEKGLRARLLLQVHDELVFEVPEE 206 (206) T ss_pred HCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECCEEEEECCCC T ss_conf 022899999999999999999853999779985360259961899 No 21 >cd06128 DNA_polA_exo The 3'-5' exonuclease domain of family-A DNA polymerases has a fundamental role in reducing polymerase errors and is involved in proofreading activity. Family-A DNA polymerases contain a DnaQ-like exonuclease domain in the same polypeptide chain as the polymerase domain, similar to family-B DNA polymerases. The exonuclease domain contains three conserved sequence motifs termed ExoI, ExoII and ExoIII, which are clustered around the active site and contain four invariant acidic residues that serve as ligands for the two metal ions required for catalysis. The Klenow fragment (KF) of Escherichia coli pol I, the Thermus aquaticus (Taq) pol I, and Bacillus stearothermophilus (BF) pol I are examples of family-A DNA polymerases. They are involved in nucleotide excision repair and in the processing of Okazaki fragments that are generated during lagging strand synthesis. The N-terminal domains of BF pol I and Taq pol I resemble the fold of the 3'-5' exonuclease domain of KF Probab=99.92 E-value=5.5e-24 Score=176.87 Aligned_cols=151 Identities=18% Similarity=0.149 Sum_probs=118.7 Q ss_pred CEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCCEEEEEECCCCC----------HHHHHHHHHHCCCCCEEEEE Q ss_conf 78999804067887000460000268829999999838228999788842----------21799999864889819986 Q gi|254780127|r 2 SKLFIDIETRSPQPLPKVGVWAYAEQAVITLCAYAWDDEPVKLWDRTEQS----------AMPSDLLQYLRDETVMCVAH 71 (675) Q Consensus 2 k~l~lD~ET~s~~dl~~~Gl~~y~~~~~I~~i~~a~~~~~~~~~d~~~~~----------~~~~~l~~~L~~~~~~kv~H 71 (675) ..+++|+||+|...+ +++|+|||+|++.+..+|+|+.|.. .+...|++.|+|+++.||+| T Consensus 2 ~~~a~DtETt~ld~~----------~a~iVGis~s~~~~~a~YiP~~h~~~~~~~~l~~~~vl~~L~p~led~~i~Kigh 71 (166) T cd06128 2 PVFAFDTETDSLDNI----------SANLVGLSFAIEPGVAAYIPVAHDYLDAPDQISRERALELLKPLLEDEKALKVGQ 71 (166) T ss_pred CEEEEECCCCCCCCC----------CCEEEEEEEECCCCEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCEEEE T ss_conf 989997887998904----------3549999998008969999526776675257789999999999973999764531 Q ss_pred EHHHHHHHHHHHCCCCCCCCCEEEHHHHHHHHCC---CCCHHHHHH-HHCCCCCCHHHHHCHHHHHHHHHHHHHHCCCCC Q ss_conf 3077599999862998786504545889998588---999789999-808898652766376576641002332013453 Q gi|254780127|r 72 NSLFERILFKKTLGIDIPSKRWICTSVLARTNGL---PSSLKNACL-ALNFSEHLTKMEEGKALIARFCKGSIDSPPYDC 147 (675) Q Consensus 72 Na~FD~~vL~~~lGi~i~~~~~~DTmi~A~~l~~---p~sL~~la~-~lgl~~~~~kd~~gk~li~~~~~~~~~~~~~~~ 147 (675) |+|||+.+|.++ |+++.. ..+||||+||++++ .++|+.+++ +|+.....+++..||.- .. . T Consensus 72 NlK~D~~vL~~~-Gi~l~g-~~fDTMlaaylldp~~~~h~Ld~La~~~L~~~~i~~~dl~GkGk------~q-------~ 136 (166) T cd06128 72 NLKYDRGILANY-GIELRG-IAFDTMLESYILNSVAGRHDMDSLAERWLKHKTITFEEIAGKGK------NQ-------L 136 (166) T ss_pred CHHHHHHHHHHC-CCCCCC-CCCCHHHHHHHCCCCCCCCCHHHHHHHHCCCCCEEHHHHHCCCC------CC-------C T ss_conf 156679999619-966899-98457679876289998899699999983998705999837885------75-------8 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 11258768999898877539999999987200 Q gi|254780127|r 148 TRANHVQAWQLFGEYCKRDVEATRELFKRLIP 179 (675) Q Consensus 148 ~~~~~~~~~~~~~~Ya~~Dv~~~~~L~~~L~~ 179 (675) +.. ..+.+....||++|+++|++||++|.| T Consensus 137 ~f~--~v~i~~~~~YA~eDAd~t~rL~~~l~P 166 (166) T cd06128 137 TFN--QIALEEAGRYAAEDADVTLQLHLKMWP 166 (166) T ss_pred CCC--CCCHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 932--289999999899999999999985389 No 22 >cd06139 DNA_polA_I_Ecoli_like_exo The 3'-5' exonuclease domain of DNA polymerases has a fundamental role in reducing polymerase errors and is involved in proofreading activity. E.coli-like Polymerase I (pol I), a subgroup of family-A DNA polymerases, contains a DEDDy-type DnaQ-like 3'-5' exonuclease domain in the same polypeptide chain as the polymerase domain. The exonuclease domain contains three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. E. coli DNA pol I is involved in genome replication but is not the main replicating enzyme. It is also implicated in DNA repair. Probab=99.92 E-value=1.1e-23 Score=174.90 Aligned_cols=173 Identities=18% Similarity=0.112 Sum_probs=130.8 Q ss_pred CEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCCEEEEEECCCCC--------HHHHHHHHHHCCCCCEEEEEEH Q ss_conf 78999804067887000460000268829999999838228999788842--------2179999986488981998630 Q gi|254780127|r 2 SKLFIDIETRSPQPLPKVGVWAYAEQAVITLCAYAWDDEPVKLWDRTEQS--------AMPSDLLQYLRDETVMCVAHNS 73 (675) Q Consensus 2 k~l~lD~ET~s~~dl~~~Gl~~y~~~~~I~~i~~a~~~~~~~~~d~~~~~--------~~~~~l~~~L~~~~~~kv~HNa 73 (675) +.+++|+||+| ++++ ..+|+|||+|++.+..+|+|+.|.. .+...++++|+++++.||+||+ T Consensus 6 ~~ia~DtETtg--------l~~~--~~~ivGisls~~~~~~~YiP~~h~~~~~~l~~~~v~~~lk~~led~~i~Kv~hn~ 75 (193) T cd06139 6 KVFAFDTETTS--------LDPM--QAELVGISFAVEPGEAYYIPLGHDYGGEQLPREEVLAALKPLLEDPSIKKVGQNL 75 (193) T ss_pred CEEEEECCCCC--------CCCC--CCEEEEEEEEECCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHCCCCCCEEEECC T ss_conf 87899783699--------9868--8849999998079968998723656544679999999999997399986887547 Q ss_pred HHHHHHHHHHCCCCCCCCCEEEHHHHHHHHCC---CCCHHHHHH-HHCCCCCCHHHHHCHHHHHHHHHHHHHHCCCCCCC Q ss_conf 77599999862998786504545889998588---999789999-80889865276637657664100233201345311 Q gi|254780127|r 74 LFERILFKKTLGIDIPSKRWICTSVLARTNGL---PSSLKNACL-ALNFSEHLTKMEEGKALIARFCKGSIDSPPYDCTR 149 (675) Q Consensus 74 ~FD~~vL~~~lGi~i~~~~~~DTmi~A~~l~~---p~sL~~la~-~lgl~~~~~kd~~gk~li~~~~~~~~~~~~~~~~~ 149 (675) +||+.+|.++ |+.++ ..++||||+|+++++ +++|+.+++ +||.+..+.++..||.- ... T Consensus 76 k~D~~~L~~~-gi~~~-~~~~DTmiaa~ll~~~~~~~~L~~La~~~l~~~~~~~~~l~gkg~---------------~~~ 138 (193) T cd06139 76 KFDLHVLANH-GIELR-GPAFDTMLASYLLNPGRRRHGLDDLAERYLGHKTISFEDLVGKGK---------------KQI 138 (193) T ss_pred HHHHHHHHHC-CCCCC-CCCHHHHHHHHHCCCCCCCCCHHHHHHHHHCCCCCCHHHHHCCCC---------------CCC T ss_conf 9789999983-98667-830218999987279988889999999981887663899837785---------------537 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCEEC Q ss_conf 2587689998988775399999999872002143101789999875168430 Q gi|254780127|r 150 ANHVQAWQLFGEYCKRDVEATRELFKRLIPLSDGERDLWLLDQTINDRGYRI 201 (675) Q Consensus 150 ~~~~~~~~~~~~Ya~~Dv~~~~~L~~~L~~~~~~E~~l~~~~~~me~~Gi~v 201 (675) .....+.+....||+.|++++++|++.|.+.++.+..+..+..++|+.-++| T Consensus 139 ~~~~~~~~~~~~YAa~Da~~t~~L~~~l~~~L~~~~~l~~ly~~iE~PL~~V 190 (193) T cd06139 139 TFDQVPLEKAAEYAAEDADITLRLYELLKPKLKEEPGLLELYEEIEMPLIPV 190 (193) T ss_pred CHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHH T ss_conf 8313329999999999999999999999999984220999999984569998 No 23 >cd06140 DNA_polA_I_Bacillus_like_exo Bacillus stearothermophilus-like Polymerase I (pol I), a subgroup of the family-A DNA polymerases, contains an inactive DnaQ-like 3'-5' exonuclease domain in the same polypeptide chain as the polymerase region. The exonuclease-like domain of these proteins possess the same fold as the Klenow fragment (KF) of Escherichia coli pol I, but does not contain the four critical metal-binding residues necessary for activity. The function of this domain is unknown. It might act as a spacer between the polymerase and the 5'-3' exonuclease domains. Some members of this subgroup, such as those from Bacillus sphaericus and Thermus aquaticus, are thermostable DNA polymerases. Probab=99.81 E-value=3.2e-18 Score=138.72 Aligned_cols=164 Identities=16% Similarity=0.035 Sum_probs=121.6 Q ss_pred CEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCCEEEEEECCCCCHHHHHHHHHHCCCCCEEEEEEHHHHHHHHH Q ss_conf 78999804067887000460000268829999999838228999788842217999998648898199863077599999 Q gi|254780127|r 2 SKLFIDIETRSPQPLPKVGVWAYAEQAVITLCAYAWDDEPVKLWDRTEQSAMPSDLLQYLRDETVMCVAHNSLFERILFK 81 (675) Q Consensus 2 k~l~lD~ET~s~~dl~~~Gl~~y~~~~~I~~i~~a~~~~~~~~~d~~~~~~~~~~l~~~L~~~~~~kv~HNa~FD~~vL~ 81 (675) ..+++|+||.+. +++ +.+|.||+++.+ +..+++|+.+.......++++|+|+++.||+||+|||+.+|. T Consensus 4 ~~~av~~e~~~~--------~~~--~~~i~gi~l~~~-~~~~yi~~~~~~~~~~~lk~~l~d~~i~Ki~hn~K~~~~~L~ 72 (178) T cd06140 4 DEVALYVELLGE--------NYH--TADIIGLALANG-GGAYYIPLELALLDLAALKEWLEDEKIPKVGHDAKRAYVALK 72 (178) T ss_pred CCEEEEEECCCC--------CCC--CCEEEEEEEEEC-CEEEEEECCCCHHHHHHHHHHHCCCCCCEEEECHHHHHHHHH T ss_conf 866999737999--------845--354999999889-979999566736469999999729999889865799999999 Q ss_pred HHCCCCCCCCCEEEHHHHHHHHCC---CCCHHHHHHH-HCCCCCCHHHHHCHHHHHHHHHHHHHHCCCCCCCCCCHHHHH Q ss_conf 862998786504545889998588---9997899998-088986527663765766410023320134531125876899 Q gi|254780127|r 82 KTLGIDIPSKRWICTSVLARTNGL---PSSLKNACLA-LNFSEHLTKMEEGKALIARFCKGSIDSPPYDCTRANHVQAWQ 157 (675) Q Consensus 82 ~~lGi~i~~~~~~DTmi~A~~l~~---p~sL~~la~~-lgl~~~~~kd~~gk~li~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (675) ++ |++++ ..++||||+||++++ +++|+.++.. ++......++..|+.. . ....+.+ T Consensus 73 ~~-Gi~l~-g~~fDtmiaayLl~p~~~~~~L~~L~~~yl~~~~~~~~~~~gk~~----------------~--~~~~~~~ 132 (178) T cd06140 73 RH-GIELA-GVAFDTMLAAYLLDPTRSSYDLADLAKRYLGRELPSDEEVYGKGA----------------K--FAVPDEE 132 (178) T ss_pred HC-CCCCC-CCHHHHHHHHHHHCCCCCCCCHHHHHHHHHCCCCCCHHHHHCCCC----------------C--CCCCCHH T ss_conf 88-98068-830368889887548977799999999982866654788617665----------------6--2439979 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHC Q ss_conf 9898877539999999987200214310178999987516 Q gi|254780127|r 158 LFGEYCKRDVEATRELFKRLIPLSDGERDLWLLDQTINDR 197 (675) Q Consensus 158 ~~~~Ya~~Dv~~~~~L~~~L~~~~~~E~~l~~~~~~me~~ 197 (675) ....|++.|+.++.+|+..|.+.++. ..+..+..++|+. T Consensus 133 ~~~~ya~~~a~~~~~L~~~l~~~L~e-~~l~~Ly~~iE~P 171 (178) T cd06140 133 VLAEHLARKAAAILRLAPKLEEELEE-NEQLELYYEVELP 171 (178) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHH T ss_conf 99999999999999999999999998-2489999998712 No 24 >cd00007 35EXOc 3'-5' exonuclease. The 35EXOc domain is responsible for the 3'-5' exonuclease proofreading activity of prokaryotic DNA polymerase I (pol I) and other enzymes, it catalyses the hydrolysis of unpaired or mismatched nucleotides. This domain consists of the amino-terminal half of the Klenow fragment in E. coli pol I. 35EXOc is also found in the Werner syndrome helicase (WRN), focus forming activity 1 protein (FFA-1) and ribonuclease D (RNase D). Probab=99.80 E-value=4.8e-18 Score=137.56 Aligned_cols=148 Identities=19% Similarity=0.201 Sum_probs=105.9 Q ss_pred CEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCCEEEEEECCC-CCHHHHHHHHHHCCCCCEEEEEEHHHHHHHH Q ss_conf 789998040678870004600002688299999998382289997888-4221799999864889819986307759999 Q gi|254780127|r 2 SKLFIDIETRSPQPLPKVGVWAYAEQAVITLCAYAWDDEPVKLWDRTE-QSAMPSDLLQYLRDETVMCVAHNSLFERILF 80 (675) Q Consensus 2 k~l~lD~ET~s~~dl~~~Gl~~y~~~~~I~~i~~a~~~~~~~~~d~~~-~~~~~~~l~~~L~~~~~~kv~HNa~FD~~vL 80 (675) |.+++|+||++ ++.+ .++|.+|+++++++ .++++..+ ....+..++++|+++++.||+||++||..+| T Consensus 1 ~~ia~D~Et~~--------l~~~--~~~i~~iql~~~~~-~~yi~~~~~~~~~~~~l~~~l~~~~i~Kv~hn~k~D~~~L 69 (155) T cd00007 1 KEVAFDTETTG--------LNYH--RGKLVGIQIATAGE-AAYIPDELELEEDLEALKELLEDEDITKVGHDAKFDLVVL 69 (155) T ss_pred CEEEEECCCCC--------CCCC--CCCEEEEEEEECCC-EEEEEECCCCHHHHHHHHHHHHCCCCCEEEECCCHHHHHH T ss_conf 98999898799--------9778--88099999998993-5999814755246999999982999738987340009999 Q ss_pred HHHCCCCCCCCCEEEHHHHHHHHCC---CCCHHHHHHH-HCCCCCCHHHHHCHHHHHHHHHHHHHHCCCCCCCCCCHHHH Q ss_conf 9862998786504545889998588---9997899998-08898652766376576641002332013453112587689 Q gi|254780127|r 81 KKTLGIDIPSKRWICTSVLARTNGL---PSSLKNACLA-LNFSEHLTKMEEGKALIARFCKGSIDSPPYDCTRANHVQAW 156 (675) Q Consensus 81 ~~~lGi~i~~~~~~DTmi~A~~l~~---p~sL~~la~~-lgl~~~~~kd~~gk~li~~~~~~~~~~~~~~~~~~~~~~~~ 156 (675) .++ |+.++ ..++|||++|++++. ++||+.+++. ++.+..+.+...++. . .+.. .... T Consensus 70 ~~~-~~~~~-~~~~Dt~iaa~ll~~~~~~~~L~~l~~~~l~~~~~k~~~~~~~~---------------~-~~~~-~~l~ 130 (155) T cd00007 70 ARD-GIELP-GNIFDTMLAAYLLNPGEGSHSLDDLAKEYLGIELDKDEQIYGKG---------------A-KTFA-RPLS 130 (155) T ss_pred HHC-CCCCC-CCEEHHHHHHHHHCCCCCCHHHHHHHHHHCCCCCCHHHHHCCCC---------------C-CCCC-CCCH T ss_conf 862-86013-50114999999847785523699999998398767488861776---------------5-4444-7998 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 99898877539999999987200 Q gi|254780127|r 157 QLFGEYCKRDVEATRELFKRLIP 179 (675) Q Consensus 157 ~~~~~Ya~~Dv~~~~~L~~~L~~ 179 (675) +.+..||+.|+..+++|++.|.+ T Consensus 131 ~~~~~YAa~D~~~~~~l~~~l~~ 153 (155) T cd00007 131 EELLEYAAEDADALLRLYEKLLE 153 (155) T ss_pred HHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999999999999999996 No 25 >KOG3657 consensus Probab=99.77 E-value=2.9e-17 Score=132.42 Aligned_cols=271 Identities=17% Similarity=0.101 Sum_probs=136.6 Q ss_pred CCCCCHHHHEEEECCCCCEEEEEECCCCCEEEEEEECCCH----HHHHHH---------HCCCCCCCCCHHHHHCCCCCC Q ss_conf 3221013212147699878999733548505886622876----786654---------044410000157760875110 Q gi|254780127|r 357 LGLASDCVRSCIIASNGKKLVVADLAGIEARVLAWIAGED----WKIKAF---------ANGEDIYVTTYAKAFNTPIDK 423 (675) Q Consensus 357 ~~~~~~~iR~~f~a~~G~~lv~aDySqiElRvlA~ls~d~----~li~~f---------~~g~D~h~~tA~~~~~~~~~~ 423 (675) .+.+|..|+....|||||.||+||--..|+-|.|-++.-- .--.+| .+|-|+|+.||+.+ T Consensus 716 ~~Rigselkamvqappgy~LVgaDvdsqElwiaa~lgda~~~~~hg~ta~gwM~Lag~ks~gtdlhs~ta~~l------- 788 (1075) T KOG3657 716 PDRIGSELKAMVQAPPGYRLVGADVDSQELWIAALLGDASAEGVHGKTAFGWMTLAGSKSDGTDLHSKTASQL------- 788 (1075) T ss_pred HHHHHHHHHHHHCCCCCCEEEECCCCHHHHHHHHHHHHHHHHCCCCCCHHHHHHHCCCCCCCCHHHHHHHHHC------- T ss_conf 2455078897610788616850563479999999963154412247522102111376656730767666650------- Q ss_pred CCHHHHHCCCCCHHHHHHCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHCCCEECCHHHH Q ss_conf 570221006421001330788778988788708886403666665666765478-8999999999875111010310889 Q gi|254780127|r 424 VSKEQRAIGKVMELALGYQGGAKVFKTMASHCGLDLQQFSQNVKSTSTFEDWEQ-AESHHLWMQDQYPEFAVKDKLIGTA 502 (675) Q Consensus 424 v~~~~R~~aK~~~~gl~YG~G~~~l~~~~~~~G~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~e 502 (675) +..|..||++|+|-+||.|..--.....+.+-.....++...+...+..--+ ....+...++...+. . .....+. T Consensus 789 --giSr~hakv~Ny~riygag~~fa~~ll~~fnp~l~~~Ea~~~A~~l~~~tkG~~~~rlk~e~~~~e~~-~-~~~~~~~ 864 (1075) T KOG3657 789 --GISRNHAKVFNYARIYGAGQTFAEKLLMRFNPSLTQSEAKSKASQLFKLTKGDRAKRLKVEVRMVENS-L-VCKILTI 864 (1075) T ss_pred --CCCHHHHHHCCHHHHHCCCCCHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCEEEEHHHHHHHHHHH-H-HCHHHHH T ss_conf --52276665404787745886167776886189986678899999998862575110057677776653-2-0004543 Q ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCCCCCCCEEECC-CCHHHCCCCCCHHHHHHC Q ss_conf 7889999998673078999999999999963372142013222344321244564112246-311310245201233203 Q gi|254780127|r 503 CELVKKAWRAKHQGVLQLWKDLTEGFACVVQEGGSISARRVANVPRLVMRRHKRDVHIVLP-SSRRLVYSDVKGDCSYLN 581 (675) Q Consensus 503 a~~~~~~~~~~~p~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~i~~p-sGR~l~~~~~~~~~~~~~ 581 (675) +.. | +..-|+.-+...++. ..||++..- ..++..-.... .-++| -|=++.+.=.+ T Consensus 865 ~~~----~------~~~~~~~~~~~~~~~-w~gg~es~~----fn~lesia~~~--~prtpvlgc~is~sl~~------- 920 (1075) T KOG3657 865 DGI----Y------LIYSIYENEVEPRRL-WVGGTESSM----FNKLESIATAH--DPRTPVLGCQISRSLEK------- 920 (1075) T ss_pred HHH----H------HHHHHHHCCCCHHHH-CCCCHHHHH----HHHHHHHHHCC--CCCCCEECCCHHHHHCC------- T ss_conf 015----5------543323054310453-057518888----88888776406--88887102413564433------- Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHH---HHCCCCEEEEECCEEEEECCCCHHHHH---HHHHHHHC Q ss_conf 1001235776533557776747989999999---99999---967981899953589996784789999---99999846 Q gi|254780127|r 582 TATSQLMRERTYGGKLTENIVQAISRDILCE---GMKNA---TKNGYDIVLTVHDEIVSETPDTPYFSV---GTLCSLMT 652 (675) Q Consensus 582 ~~~~~~~r~~~~g~k~~n~~iQgtAaDi~k~---am~~l---~~~g~~ivl~VHDElv~evp~~~~~~~---~~i~~~M~ 652 (675) ..++...+=+.-+|-.||.+|-|-+-. +|-=+ ++...+.++.+|||+=+-|.|+.+..+ -+|..+|+ T Consensus 921 ----~~~~~~~f~~srinwvvqssavd~lhlllvsm~wl~~~y~i~~rfcisihdevrylv~e~d~~raalalqisnl~t 996 (1075) T KOG3657 921 ----LPEGEPKFLPSRINWVVQSSAVDFLHLLLVSMQWLCDTYKIDARFCISIHDEVRYLVKEEDAPRAALALQISNLLT 996 (1075) T ss_pred ----CCCCCCCCCCCEEEEEEECCCHHHHHHHHHHHHHHHHHCCCCEEEEEEEHHHHHHHHCCCCCHHHHHHHHHHHHHH T ss_conf ----6667754444323577633406599999999999986415442799874176788862664277899999999999 Q ss_pred CCCC----CCCCCCCEEE Q ss_conf 7630----2169841573 Q gi|254780127|r 653 KNPS----WAKGLPLKAE 666 (675) Q Consensus 653 ~~p~----wa~~~pl~a~ 666 (675) ++.= =.-+||-.++ T Consensus 997 r~~~~~r~gi~~lp~sva 1014 (1075) T KOG3657 997 RAMFSQRVGINDLPQSVA 1014 (1075) T ss_pred HHHHHHHHHHCCCCHHHH T ss_conf 999998732200533442 No 26 >cd06142 RNaseD_exo Ribonuclease (RNase) D is a bacterial enzyme involved in the maturation of small stable RNAs and the 3' maturation of tRNA. It contains a DEDDy-type DnaQ-like 3'-5' exonuclease domain possessing three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. In vivo, RNase D only becomes essential upon removal of other ribonucleases. Eukaryotic RNase D homologs include yeast Rrp6p, human PM/Scl-100 and the Drosophila melanogaster egalitarian protein. Probab=99.77 E-value=8.2e-17 Score=129.39 Aligned_cols=150 Identities=20% Similarity=0.145 Sum_probs=111.4 Q ss_pred CEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCCEEEEEECCCCCHHHHHHHHHHCCCCCEEEEEEHHHHHHHHH Q ss_conf 78999804067887000460000268829999999838228999788842217999998648898199863077599999 Q gi|254780127|r 2 SKLFIDIETRSPQPLPKVGVWAYAEQAVITLCAYAWDDEPVKLWDRTEQSAMPSDLLQYLRDETVMCVAHNSLFERILFK 81 (675) Q Consensus 2 k~l~lD~ET~s~~dl~~~Gl~~y~~~~~I~~i~~a~~~~~~~~~d~~~~~~~~~~l~~~L~~~~~~kv~HNa~FD~~vL~ 81 (675) +.|++|+||.+ +..| .+++.+++++.+ +.++.+|+..... ...+.++|+++++.||+||++||+.+|. T Consensus 13 ~~i~~DtE~~~--------~~~~--~~~l~LiQia~~-~~~ylid~~~~~~-~~~L~~lL~d~~i~KVgh~~~~Dl~~L~ 80 (178) T cd06142 13 GVIAVDTEFMR--------LRTY--YPRLCLIQISTG-GQAYLIDPLAIGD-LSPLKELLADPNIVKVFHAAREDLELLK 80 (178) T ss_pred CEEEEECCCCC--------CCCC--CCEEEEEEEEEC-CCEEEEECCCCCC-HHHHHHHHCCCCCEEEEEEHHHHHHHHH T ss_conf 94899898678--------8558--872899999868-9069995312344-6999999759994399962799999999 Q ss_pred HHCCCCCCCCCEEEHHHHHHHHCC--CCCHHHHHH-HHCCCCCCHHHHHCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHH Q ss_conf 862998786504545889998588--999789999-80889865276637657664100233201345311258768999 Q gi|254780127|r 82 KTLGIDIPSKRWICTSVLARTNGL--PSSLKNACL-ALNFSEHLTKMEEGKALIARFCKGSIDSPPYDCTRANHVQAWQL 158 (675) Q Consensus 82 ~~lGi~i~~~~~~DTmi~A~~l~~--p~sL~~la~-~lgl~~~~~kd~~gk~li~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (675) +++|+.+ .+++|||+++++++. ++||+.+++ .||...++.. .. ..|..-| ..+. T Consensus 81 ~~~~~~~--~~v~Dt~ia~~~l~~~~~~~L~~L~~~~lg~~l~K~~---------------q~-----s~W~~rP-Ls~~ 137 (178) T cd06142 81 RDFGILP--VNLFDTQIAARLLGLGDSVGLAALVEELLGVELDKGE---------------QR-----SDWLKRP-LTDE 137 (178) T ss_pred HHHCCCC--CCEEEHHHHHHHCCCCCCHHHHHHHHHHHCCCCCCCC---------------CC-----CCCCCCC-CCHH T ss_conf 9739676--5453089999982886737899999999298777773---------------64-----6643367-8289 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCHHHHHH Q ss_conf 8988775399999999872002143101 Q gi|254780127|r 159 FGEYCKRDVEATRELFKRLIPLSDGERD 186 (675) Q Consensus 159 ~~~Ya~~Dv~~~~~L~~~L~~~~~~E~~ 186 (675) +..||+.||..++.||..|.+.+..+.. T Consensus 138 qi~YAa~Da~~l~~l~~~l~~~L~~~~~ 165 (178) T cd06142 138 QLEYAANDVRYLLPLYEKLKEELEEEGR 165 (178) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCH T ss_conf 9999999799999999999999998683 No 27 >pfam01612 3_5_exonuc 3'-5' exonuclease. This domain is responsible for the 3'-5' exonuclease proofreading activity of E. coli DNA polymerase I (polI) and other enzymes, it catalyses the hydrolysis of unpaired or mismatched nucleotides. This domain consists of the amino-terminal half of the Klenow fragment in E. coli polI it is also found in the Werner syndrome helicase (WRN), focus forming activity 1 protein (FFA-1) and ribonuclease D (RNase D). Werner syndrome is a human genetic disorder causing premature aging; the WRN protein has helicase activity in the 3'-5' direction. The FFA-1 protein is required for formation of a replication foci and also has helicase activity; it is a homologue of the WRN protein. RNase D is a 3'-5' exonuclease involved in tRNA processing. Also found in this family is the autoantigen PM/Scl thought to be involved in polymyositis-scleroderma overlap syndrome. Probab=99.75 E-value=7.9e-17 Score=129.50 Aligned_cols=148 Identities=19% Similarity=0.151 Sum_probs=104.0 Q ss_pred CEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCCEEEEEECCCCCHHHHHHHHHHCCCCCEEEEEEHHHHHHHHH Q ss_conf 78999804067887000460000268829999999838228999788842217999998648898199863077599999 Q gi|254780127|r 2 SKLFIDIETRSPQPLPKVGVWAYAEQAVITLCAYAWDDEPVKLWDRTEQSAMPSDLLQYLRDETVMCVAHNSLFERILFK 81 (675) Q Consensus 2 k~l~lD~ET~s~~dl~~~Gl~~y~~~~~I~~i~~a~~~~~~~~~d~~~~~~~~~~l~~~L~~~~~~kv~HNa~FD~~vL~ 81 (675) +.|++|+||.+. .+......+.+++++.+ ...+.++.......+..+.++|+++++.|||||++||..+|. T Consensus 21 ~~ia~DtEt~~~--------~~~~~~~~~~llq~~~~-~~~yli~~~~~~~~~~~L~~ll~~~~i~Kvghn~k~D~~~L~ 91 (172) T pfam01612 21 KVVAVDTETTSL--------DTYSYTLVGCLIQISTG-EGAYIIDPLALGDDLEGLKRLLEDPNITKVGHNAKFDLEVLA 91 (172) T ss_pred CCEEEEEECCCC--------CCCCCCCCEEEEEEECC-CCCEEEECCCCHHHHHHHHHHHCCCCCCEEEEEHHHHHHHHH T ss_conf 908998062898--------86761343369999679-860776434756679999999759998799975899999998 Q ss_pred HHCCCCCCCCCEEEHHHHHHHHCC--CCCHHHHHHH-HCCCCCCHHHHHCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHH Q ss_conf 862998786504545889998588--9997899998-0889865276637657664100233201345311258768999 Q gi|254780127|r 82 KTLGIDIPSKRWICTSVLARTNGL--PSSLKNACLA-LNFSEHLTKMEEGKALIARFCKGSIDSPPYDCTRANHVQAWQL 158 (675) Q Consensus 82 ~~lGi~i~~~~~~DTmi~A~~l~~--p~sL~~la~~-lgl~~~~~kd~~gk~li~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (675) +++|+.+. +++|||+++++++. ++||+.+++. ||++.++.+ .. ..|...| ..+. T Consensus 92 ~~~~~~~~--~~~Dt~~a~~~l~~~~~~gL~~L~~~~lg~~~~K~~---------------~~-----s~w~~rp-L~~~ 148 (172) T pfam01612 92 RDFGIKLE--NLFDTMLAAYLLGYPRSHSLDDLAEKYLGVELDKSE---------------QC-----ADWRARP-LSEE 148 (172) T ss_pred HHCCCCCC--CCHHHHHHHHHCCCCCCCCHHHHHHHHCCCCCCHHH---------------EE-----CCCCCCC-CCHH T ss_conf 85476668--732789999881996435699999998098787441---------------56-----6788789-8299 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCH Q ss_conf 89887753999999998720021 Q gi|254780127|r 159 FGEYCKRDVEATRELFKRLIPLS 181 (675) Q Consensus 159 ~~~Ya~~Dv~~~~~L~~~L~~~~ 181 (675) +..||+.||.++++||+.|.+.+ T Consensus 149 qi~YAa~Da~~~~~l~~~l~~~l 171 (172) T pfam01612 149 QLRYAAEDADYLLRLYDKLRKEL 171 (172) T ss_pred HHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999999999999999974 No 28 >smart00474 35EXOc 3'-5' exonuclease. 3\' -5' exonuclease proofreading domain present in DNA polymerase I, Werner syndrome helicase, RNase D and other enzymes Probab=99.74 E-value=1.4e-16 Score=127.82 Aligned_cols=146 Identities=18% Similarity=0.156 Sum_probs=104.5 Q ss_pred CEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCCEEEEEECCCCCHHHHHHHHHHCCCCCEEEEEEHHHHHHHHH Q ss_conf 78999804067887000460000268829999999838228999788842217999998648898199863077599999 Q gi|254780127|r 2 SKLFIDIETRSPQPLPKVGVWAYAEQAVITLCAYAWDDEPVKLWDRTEQSAMPSDLLQYLRDETVMCVAHNSLFERILFK 81 (675) Q Consensus 2 k~l~lD~ET~s~~dl~~~Gl~~y~~~~~I~~i~~a~~~~~~~~~d~~~~~~~~~~l~~~L~~~~~~kv~HNa~FD~~vL~ 81 (675) +.+++|+||++ +..| .+++.+++++++++..+.++.......+..++++|+++++.||+||++||..+|. T Consensus 22 ~~i~~DtE~~~--------~~~~--~~~l~~iql~~~~~~~yi~~~~~~~~~~~~l~~ll~~~~i~kv~hn~k~D~~~L~ 91 (172) T smart00474 22 GEVALDTETTG--------LNSY--SGKLVLIQISVTGEGAFIIDPLALGDDLEILKDLLEDETITKVGHNAKFDLHVLA 91 (172) T ss_pred CEEEEECCCCC--------CCCC--CCEEEEEEEEECCCEEEEEECCCCHHHHHHHHHHHCCCCCCEEEECHHHHHHHHH T ss_conf 95999883068--------8537--8758999999689848997012242179999999809998599974599999999 Q ss_pred HHCCCCCCCCCEEEHHHHHHHHCCC---CCHHHHHHH-HCCCCCCHHHHHCHHHHHHHHHHHHHHCCCCCCCCCCHHHHH Q ss_conf 8629987865045458899985889---997899998-088986527663765766410023320134531125876899 Q gi|254780127|r 82 KTLGIDIPSKRWICTSVLARTNGLP---SSLKNACLA-LNFSEHLTKMEEGKALIARFCKGSIDSPPYDCTRANHVQAWQ 157 (675) Q Consensus 82 ~~lGi~i~~~~~~DTmi~A~~l~~p---~sL~~la~~-lgl~~~~~kd~~gk~li~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (675) + +|+.++ +++|||+++++++.+ +||+.+++. +|.+.++. .....|...| ..+ T Consensus 92 ~-~gi~l~--~v~Dt~ia~~l~~~~~~~~~L~~l~~~~~~~~l~k~--------------------~~~~~w~~rp-l~~ 147 (172) T smart00474 92 R-FGIELE--NIFDTMLAAYLLLGGPSKHGLATLLKEYLGVELDKE--------------------EQKSDWGARP-LSE 147 (172) T ss_pred H-CCCCCC--CCHHHHHHHHHCCCCCCHHHHHHHHHHHCCEECCCC--------------------CCCCCCCCCC-CCH T ss_conf 7-798567--524679987750579864899999999749231555--------------------5646667899-909 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCH Q ss_conf 989887753999999998720021 Q gi|254780127|r 158 LFGEYCKRDVEATRELFKRLIPLS 181 (675) Q Consensus 158 ~~~~Ya~~Dv~~~~~L~~~L~~~~ 181 (675) .+..||+.|+.++++||+.|.+.+ T Consensus 148 ~~~~YAa~Da~~~~~l~~~l~~~L 171 (172) T smart00474 148 EQLQYAAEDADALLRLYEKLEKEL 171 (172) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 999999999999999999999972 No 29 >cd06147 Rrp6p_like_exo Yeast Rrp6p and its human homolog, the Polymyositis/scleroderma autoantigen 100kDa (PM/Scl-100), are exosome-associated proteins involved in the degradation and processing of precursors to stable RNAs. Both proteins contain a DEDDy-type DnaQ-like 3'-5' exonuclease domain possessing three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. The motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. PM/Scl-100, an autoantigen present in the nucleolar compartment of the cell, reacts with autoantibodies produced by about 50% of patients with polymyositis-scleroderma overlap syndrome. Probab=99.63 E-value=3.2e-14 Score=112.27 Aligned_cols=156 Identities=16% Similarity=0.098 Sum_probs=111.3 Q ss_pred CEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCCEEEEEECCCCCHHHHHHHHHHCCCCCEEEEEEHHHHHHHHH Q ss_conf 78999804067887000460000268829999999838228999788842217999998648898199863077599999 Q gi|254780127|r 2 SKLFIDIETRSPQPLPKVGVWAYAEQAVITLCAYAWDDEPVKLWDRTEQSAMPSDLLQYLRDETVMCVAHNSLFERILFK 81 (675) Q Consensus 2 k~l~lD~ET~s~~dl~~~Gl~~y~~~~~I~~i~~a~~~~~~~~~d~~~~~~~~~~l~~~L~~~~~~kv~HNa~FD~~vL~ 81 (675) +.|++|+|+.+ ...| .+++-+++++..+ .++.+|..........+.++|+++++.||+||+.||+.+|. T Consensus 25 ~~ialDtE~~~--------~~~~--~~~l~LiQiat~~-~~yliD~l~l~~~~~~L~~ll~~~~I~KV~H~~~~Dl~~L~ 93 (192) T cd06147 25 KEIAVDLEHHS--------YRSY--LGFTCLMQISTRE-EDYIVDTLKLRDDMHILNEVFTDPNILKVFHGADSDIIWLQ 93 (192) T ss_pred CEEEEECCCCC--------CCCC--CCEEEEEEEECCC-CCEEEEECCCCCCHHHHHHHHCCCCEEEEEECHHHHHHHHH T ss_conf 96999797378--------8569--9827999995099-88798743430037999999759883799855787799999 Q ss_pred HHCCCCCCCCCEEEHHHHHHHHCCC-CCHHHHHH-HHCCCCCCHHHHHCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHH Q ss_conf 8629987865045458899985889-99789999-808898652766376576641002332013453112587689998 Q gi|254780127|r 82 KTLGIDIPSKRWICTSVLARTNGLP-SSLKNACL-ALNFSEHLTKMEEGKALIARFCKGSIDSPPYDCTRANHVQAWQLF 159 (675) Q Consensus 82 ~~lGi~i~~~~~~DTmi~A~~l~~p-~sL~~la~-~lgl~~~~~kd~~gk~li~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (675) +.+|+.+ .+++||++++..++.+ .||..+.+ .||.+.++.. .. ..|..-| ..+.+ T Consensus 94 ~~~gi~~--~n~fDT~iaa~~l~~~~~sL~~Lv~~~lg~~l~K~~---------------q~-----SdW~~RP-Ls~~Q 150 (192) T cd06147 94 RDFGLYV--VNLFDTGQAARVLNLPRHSLAYLLQKYCNVDADKKY---------------QL-----ADWRIRP-LPEEM 150 (192) T ss_pred HHHCCCC--CCCHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCCCC---------------EE-----CCCCCCC-CCHHH T ss_conf 9858987--660428999998488811199999999395887323---------------11-----4379998-99999 Q ss_pred HHHHHHHHHHHHHHHHHHCCC-HHHHHHHHHHH Q ss_conf 988775399999999872002-14310178999 Q gi|254780127|r 160 GEYCKRDVEATRELFKRLIPL-SDGERDLWLLD 191 (675) Q Consensus 160 ~~Ya~~Dv~~~~~L~~~L~~~-~~~E~~l~~~~ 191 (675) ..||+.||...+.||.+|... .+.+.++...+ T Consensus 151 l~YAA~Dv~~Ll~ly~~L~~~L~e~~~~~~~~~ 183 (192) T cd06147 151 IKYAREDTHYLLYIYDRLRNELLERANALAPNL 183 (192) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHH T ss_conf 999999999999999999999998428641678 No 30 >COG0349 Rnd Ribonuclease D [Translation, ribosomal structure and biogenesis] Probab=99.62 E-value=2.7e-14 Score=112.75 Aligned_cols=160 Identities=18% Similarity=0.140 Sum_probs=115.5 Q ss_pred CEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCCEEEEEECCCCCHHHHHHHHHHCCCCCEEEEEEHHHHHHHHH Q ss_conf 78999804067887000460000268829999999838228999788842217999998648898199863077599999 Q gi|254780127|r 2 SKLFIDIETRSPQPLPKVGVWAYAEQAVITLCAYAWDDEPVKLWDRTEQSAMPSDLLQYLRDETVMCVAHNSLFERILFK 81 (675) Q Consensus 2 k~l~lD~ET~s~~dl~~~Gl~~y~~~~~I~~i~~a~~~~~~~~~d~~~~~~~~~~l~~~L~~~~~~kv~HNa~FD~~vL~ 81 (675) +.|+||+||- |+..| .++.-+|+++-+++ +..+|...+..-..-+...|.|++++||+|++.||+.+|. T Consensus 18 ~~iAIDTEf~--------r~~t~--~p~LcLIQi~~~e~-~~lIdp~~~~~d~~~l~~Ll~d~~v~KIfHaa~~DL~~l~ 86 (361) T COG0349 18 KAIAIDTEFM--------RLRTY--YPRLCLIQISDGEG-ASLIDPLAGILDLPPLVALLADPNVVKIFHAARFDLEVLL 86 (361) T ss_pred CEEEEECCCC--------CCCCC--CCCEEEEEEECCCC-CEEECCCCCCCCCCHHHHHHCCCCEEEEECCCCCCHHHHH T ss_conf 8079832224--------66436--88337999844888-6686226554666369998628760255025635199999 Q ss_pred HHCCCCCCCCCEEEHHHHHHHHCCC--CCHHHHHH-HHCCCCCCHHHHHCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHH Q ss_conf 8629987865045458899985889--99789999-80889865276637657664100233201345311258768999 Q gi|254780127|r 82 KTLGIDIPSKRWICTSVLARTNGLP--SSLKNACL-ALNFSEHLTKMEEGKALIARFCKGSIDSPPYDCTRANHVQAWQL 158 (675) Q Consensus 82 ~~lGi~i~~~~~~DTmi~A~~l~~p--~sL~~la~-~lgl~~~~~kd~~gk~li~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (675) +.+|... ..++||.|+|.+.+.+ +||+.+++ .+|+..++.. ..+.|... | ..+. T Consensus 87 ~~~g~~p--~plfdTqiAa~l~g~~~~~gl~~Lv~~ll~v~ldK~~---------------q~SDW~~R-----P-Ls~~ 143 (361) T COG0349 87 NLFGLLP--TPLFDTQIAAKLAGFGTSHGLADLVEELLGVELDKSE---------------QRSDWLAR-----P-LSEA 143 (361) T ss_pred HHCCCCC--CCHHHHHHHHHHCCCCCCCCHHHHHHHHHCCCCCCCC---------------CCCCCCCC-----C-CCHH T ss_conf 8618887--7346899999980986323299999999688536444---------------54655557-----9-9999 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHH Q ss_conf 8988775399999999872002143101789999875 Q gi|254780127|r 159 FGEYCKRDVEATRELFKRLIPLSDGERDLWLLDQTIN 195 (675) Q Consensus 159 ~~~Ya~~Dv~~~~~L~~~L~~~~~~E~~l~~~~~~me 195 (675) +.+||..||..-+.||.+|...+..+.+.......++ T Consensus 144 Ql~YAa~DV~yL~~l~~~L~~~L~~~~r~~~a~~ef~ 180 (361) T COG0349 144 QLEYAAADVEYLLPLYDKLTEELAREGRLEWAEDEFR 180 (361) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHH T ss_conf 9999999999999999999999987485499999999 No 31 >cd06148 Egl_like_exo The Egalitarian (Egl) protein subfamily is composed of Drosophila Egl and similar proteins. Egl is a component of an mRNA-binding complex which is required for oocyte specification. Egl contains a DEDDy-type DnaQ-like 3'-5' exonuclease domain possessing three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. The motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. The conservation of this subfamily throughout eukaryotes suggests that its members may be part of ancient RNA processing complexes that are likely to participate in the regulated processing of specific mRNAs. Probab=99.59 E-value=1.6e-13 Score=107.67 Aligned_cols=157 Identities=17% Similarity=0.063 Sum_probs=108.9 Q ss_pred CEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCCEEEEEECCCCCH--HHHHHHHHHCCCCCEEEEEEHHHHHHH Q ss_conf 789998040678870004600002688299999998382289997888422--179999986488981998630775999 Q gi|254780127|r 2 SKLFIDIETRSPQPLPKVGVWAYAEQAVITLCAYAWDDEPVKLWDRTEQSA--MPSDLLQYLRDETVMCVAHNSLFERIL 79 (675) Q Consensus 2 k~l~lD~ET~s~~dl~~~Gl~~y~~~~~I~~i~~a~~~~~~~~~d~~~~~~--~~~~l~~~L~~~~~~kv~HNa~FD~~v 79 (675) +.|++|+|+.+ +..| +++-+++++.++++++.+|+...+. ....++.+|++++++||+|++.+|..+ T Consensus 11 ~vi~iD~E~~~--------~~~~---g~l~LiQiat~~~~vyl~D~~~~~~~~~~~~Lk~iLe~~~I~KV~H~~r~D~~~ 79 (197) T cd06148 11 KVIGLDCEGVN--------LGRK---GKLCLVQIATRTGQIYLFDILKLGSIVFINGLKDILESKKILKVIHDCRRDSDA 79 (197) T ss_pred CEEEEECCCCC--------CCCC---CCEEEEEEEECCCCEEEEECCCCCCHHHHHHHHHHHCCCCCEEEEECHHHHHHH T ss_conf 99999888678--------8989---988799996489988999616676344568999995699823999610999999 Q ss_pred HHHHCCCCCCCCCEEEHHHHHHHHCC----------CCCHHHHHH-HHCCCCCCHHHHHCHHHHHHHHHHHHHHCCCCCC Q ss_conf 99862998786504545889998588----------999789999-8088986527663765766410023320134531 Q gi|254780127|r 80 FKKTLGIDIPSKRWICTSVLARTNGL----------PSSLKNACL-ALNFSEHLTKMEEGKALIARFCKGSIDSPPYDCT 148 (675) Q Consensus 80 L~~~lGi~i~~~~~~DTmi~A~~l~~----------p~sL~~la~-~lgl~~~~~kd~~gk~li~~~~~~~~~~~~~~~~ 148 (675) |.+++|+.+ .+++||++++.++.. ..||..+.+ .++.+....+.. +.. .. ..... T Consensus 80 L~~~~gi~~--~nvfDTqiA~~ll~~~~~~~~~~~~~~sl~~l~~~~l~~~~~~~~~~--~~~--------~~--~~~s~ 145 (197) T cd06148 80 LYHQYGIKL--NNVFDTQVADALLQEQETGGFNPDRVISLVQLLDKYLYISISLKEDV--KKL--------MR--EDPKF 145 (197) T ss_pred HHHHHCCCC--CCCHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHCCCCHHHHHH--HHH--------HH--CCCCC T ss_conf 999709970--67031999999975313147677564159999999957640177877--888--------75--06554 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHH Q ss_conf 125876899989887753999999998720021431 Q gi|254780127|r 149 RANHVQAWQLFGEYCKRDVEATRELFKRLIPLSDGE 184 (675) Q Consensus 149 ~~~~~~~~~~~~~Ya~~Dv~~~~~L~~~L~~~~~~E 184 (675) |..-| ..+.+..||+.||.....||..|...+-.+ T Consensus 146 W~~RP-Lt~~ql~YAa~DV~yL~~L~~~L~~~Li~~ 180 (197) T cd06148 146 WALRP-LTEDMIRYAALDVLCLLPLYYAMLDALISK 180 (197) T ss_pred CCCCC-CCHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 46799-999999999998999999999999999999 No 32 >cd06129 RNaseD_like The RNase D-like group is composed of RNase D, WRN, and similar proteins. They contain a DEDDy-type, DnaQ-like, 3'-5' exonuclease domain that contains three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. RNase D is involved in the 3'-end processing of tRNA precursors. RNase D-like proteins in eukaryotes include yeast Rrp6p, human PM/Scl-100 and Drosophila melanogaster egalitarian (Egl) protein. WRN is a unique DNA helicase possessing exonuclease activity. Mutation in the WRN gene is implicated in Werner syndrome, a disease associated with premature aging and increased predisposition to cancer. Yeast Rrp6p and the human Polymyositis/scleroderma autoantigen 100kDa (PM/Scl-100) are exosome-associated proteins involved in the degradation and processing of prec Probab=99.47 E-value=3.8e-12 Score=98.55 Aligned_cols=138 Identities=19% Similarity=0.136 Sum_probs=101.7 Q ss_pred EEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCCEEEEEECCCCCHHHHHHHHHHCCCCCEEEEEEHHHHHHHHHH Q ss_conf 89998040678870004600002688299999998382289997888422179999986488981998630775999998 Q gi|254780127|r 3 KLFIDIETRSPQPLPKVGVWAYAEQAVITLCAYAWDDEPVKLWDRTEQSAMPSDLLQYLRDETVMCVAHNSLFERILFKK 82 (675) Q Consensus 3 ~l~lD~ET~s~~dl~~~Gl~~y~~~~~I~~i~~a~~~~~~~~~d~~~~~~~~~~l~~~L~~~~~~kv~HNa~FD~~vL~~ 82 (675) .|++|+||.+ +..+ .+++.++++|.+ +.++.+|+......|..+.++|+++++.||+||+.+|...|.+ T Consensus 17 vig~DtE~~~--------~~~~--~~~v~LiQlat~-~~~~lid~~~~~~~~~~L~~lL~~~~i~Kvgh~~~~D~~~L~~ 85 (159) T cd06129 17 VVALDCEWIP--------LRLY--YGEVTLIQIGTT-EHAFLFDVLSLSAFDGGLKTVLENPSITKVGHGCRGDLAKLLR 85 (159) T ss_pred EEEEECCCCC--------CCCC--CCCEEEEEEEEC-CEEEEEECCCCCCCHHHHHHHHHCCCCEEEEECHHHHHHHHHH T ss_conf 8999797689--------8799--897899999459-9689995488876779999998299924999566999999967 Q ss_pred HCCCCCCCCCEEEHHHHHHHHCCC---CCHHHHHHH-HCCCCCCHHHHHCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHH Q ss_conf 629987865045458899985889---997899998-0889865276637657664100233201345311258768999 Q gi|254780127|r 83 TLGIDIPSKRWICTSVLARTNGLP---SSLKNACLA-LNFSEHLTKMEEGKALIARFCKGSIDSPPYDCTRANHVQAWQL 158 (675) Q Consensus 83 ~lGi~i~~~~~~DTmi~A~~l~~p---~sL~~la~~-lgl~~~~~kd~~gk~li~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (675) .+|+.+ .+++||++++..++.+ .||+.+++. +|...++.. .. .+|..-| ..+. T Consensus 86 ~~g~~~--~~~~Dt~~~~~~l~~~~~~~sL~~l~~~~lg~~ldK~~---------------q~-----SnW~~rp-Lt~~ 142 (159) T cd06129 86 DFGILL--RNLFDTQAAHAILGYPEEYWSLNSLVEQYLGPCLDKSR---------------QL-----SDWAKRP-LTEE 142 (159) T ss_pred HCCCCC--CCCHHHHHHHHHHCCCCCCHHHHHHHHHHCCCCCCCCC---------------CC-----CCCCCCC-CCHH T ss_conf 508432--43036999999958884316599999998087877102---------------45-----8889999-9999 Q ss_pred HHHHHHHHHHHHHHHH Q ss_conf 8988775399999999 Q gi|254780127|r 159 FGEYCKRDVEATRELF 174 (675) Q Consensus 159 ~~~Ya~~Dv~~~~~L~ 174 (675) +..||+.||...+.|| T Consensus 143 Qi~YAA~Da~~ll~ly 158 (159) T cd06129 143 QMLYAAADVYVLLHIY 158 (159) T ss_pred HHHHHHHHHHHHHHHH T ss_conf 9999999999999976 No 33 >cd06141 WRN_exo WRN is a unique RecQ DNA helicase exhibiting an exonuclease activity. It contains a DEDDy-type DnaQ-like 3'-5' exonuclease domain possessing three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. Mutations in the WRN gene cause Werner syndrome, an autosomal recessive disorder associated with premature aging and increased susceptibility to cancer and type II diabetes. WRN interacts with key proteins involved in DNA replication, recombination, and repair. It is believed to maintain genomic stability and life span by participating in DNA processes. WRN is stimulated by Ku70/80, an important regulator of genomic stability. Probab=99.44 E-value=8.8e-12 Score=96.13 Aligned_cols=146 Identities=21% Similarity=0.226 Sum_probs=105.5 Q ss_pred CEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCCEEEEEECCCCCHHHHHHHHHHCCCCCEEEEEEHHHHHHHHH Q ss_conf 78999804067887000460000268829999999838228999788842217999998648898199863077599999 Q gi|254780127|r 2 SKLFIDIETRSPQPLPKVGVWAYAEQAVITLCAYAWDDEPVKLWDRTEQSAMPSDLLQYLRDETVMCVAHNSLFERILFK 81 (675) Q Consensus 2 k~l~lD~ET~s~~dl~~~Gl~~y~~~~~I~~i~~a~~~~~~~~~d~~~~~~~~~~l~~~L~~~~~~kv~HNa~FD~~vL~ 81 (675) +.|++|+|+..... + | ..+.+.++++|.+ +.++.+|+.....+|..|.++|++++++|||||+.+|...|. T Consensus 19 ~vvG~D~Ew~~~~~--~-~-----~~~~~aliQlat~-~~~~l~d~~~~~~~~~~L~~lL~~~~i~Kvg~~~~~D~~~L~ 89 (168) T cd06141 19 KVVGFDLEWRPSFR--K-G-----KRNKVALLQLASE-SRCLLFQLSHMDKLPPSLKQLLEDPSILKVGVGIKGDARKLR 89 (168) T ss_pred CEEEEECEECCCCC--C-C-----CCCCCEEEEEECC-CCEEEEEECCCCHHHHHHHHHHCCCCEEEEEEEEHHHHHHHH T ss_conf 88999664558777--7-8-----8897159998249-965999740331204999999759980799963188999998 Q ss_pred HHCCCCCCCCCEEEHHHHHHHHCC---CCCHHHHHH-HHCCCCCCHHHHHCHHHHHHHHHHHHHHCCCCCCCCCCHHHHH Q ss_conf 862998786504545889998588---999789999-8088986527663765766410023320134531125876899 Q gi|254780127|r 82 KTLGIDIPSKRWICTSVLARTNGL---PSSLKNACL-ALNFSEHLTKMEEGKALIARFCKGSIDSPPYDCTRANHVQAWQ 157 (675) Q Consensus 82 ~~lGi~i~~~~~~DTmi~A~~l~~---p~sL~~la~-~lgl~~~~~kd~~gk~li~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (675) +.+|+.+ .+++||+.++..++. +.||+.+++ .||....+.| .. . ..+|..-| ..+ T Consensus 90 ~~~g~~~--~~~~Dl~~~a~~~~~~~~~~sL~~l~~~~lg~~l~K~k-----~~--------q-----~SdW~~rp-Ls~ 148 (168) T cd06141 90 RDFGIEV--RGVVDLSHLAKRVGPRRKLVSLARLVEEVLGLPLSKDK-----KV--------R-----CSNWEARP-LSK 148 (168) T ss_pred HHCCCCC--CCEEEHHHHHHHHCCCCCCCCHHHHHHHHHCCCCCCCC-----CC--------C-----CCCCCCCC-CCH T ss_conf 7445024--87656999999857777763299999998396215466-----52--------0-----38889999-999 Q ss_pred HHHHHHHHHHHHHHHHHHHH Q ss_conf 98988775399999999872 Q gi|254780127|r 158 LFGEYCKRDVEATRELFKRL 177 (675) Q Consensus 158 ~~~~Ya~~Dv~~~~~L~~~L 177 (675) .+..||+.|+.+.+.||++| T Consensus 149 ~Qi~YAA~Da~~~~~l~~~L 168 (168) T cd06141 149 EQILYAATDAYASLELYRKL 168 (168) T ss_pred HHHHHHHHHHHHHHHHHHHC T ss_conf 99999999999999999739 No 34 >PRK10829 ribonuclease D; Provisional Probab=99.31 E-value=2.6e-09 Score=79.78 Aligned_cols=145 Identities=17% Similarity=0.109 Sum_probs=100.4 Q ss_pred CEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCCEEEEEECCCCCHHHHHHHHHHCCCCCEEEEEEHHHHHHHHH Q ss_conf 78999804067887000460000268829999999838228999788842217999998648898199863077599999 Q gi|254780127|r 2 SKLFIDIETRSPQPLPKVGVWAYAEQAVITLCAYAWDDEPVKLWDRTEQSAMPSDLLQYLRDETVMCVAHNSLFERILFK 81 (675) Q Consensus 2 k~l~lD~ET~s~~dl~~~Gl~~y~~~~~I~~i~~a~~~~~~~~~d~~~~~~~~~~l~~~L~~~~~~kv~HNa~FD~~vL~ 81 (675) +.|++|+|--+ ...| -+.+-+|+++. ++..+.||-..... ..-+.++|.+++++||+|-+..|+.+|. T Consensus 23 ~~iavDTEf~r--------~~ty--~p~l~LiQi~~-~~~~~lIDpl~~~d-l~~l~~ll~~~~i~kv~Haa~qDle~l~ 90 (373) T PRK10829 23 PAIALDTEFVR--------TRTY--YPQLGLIQLFD-GEQLALIDPLGITD-WSPFKALLRDPSITKFLHAGSEDLEVFL 90 (373) T ss_pred CCEEEECCCCC--------CCCC--CCEEEEEEEEE-CCCEEEEECCCCCC-HHHHHHHHCCCCCEEEEECCCCHHHHHH T ss_conf 91898577456--------7778--98458999851-89679994787777-6999999709884165206421599999 Q ss_pred HHCCCCCCCCCEEEHHHHHHHHCCCC--CHHHHHH-HHCCCCCCHHHHHCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHH Q ss_conf 86299878650454588999858899--9789999-80889865276637657664100233201345311258768999 Q gi|254780127|r 82 KTLGIDIPSKRWICTSVLARTNGLPS--SLKNACL-ALNFSEHLTKMEEGKALIARFCKGSIDSPPYDCTRANHVQAWQL 158 (675) Q Consensus 82 ~~lGi~i~~~~~~DTmi~A~~l~~p~--sL~~la~-~lgl~~~~~kd~~gk~li~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (675) +.+|.. |.++|||+|+|.+++.+. |+..+.+ .+|+.-++.. ..+ .|..-| ..+. T Consensus 91 ~~~g~~--p~~iFDTQiAa~~lg~~~~~gy~~Lv~~~l~v~l~K~~---------------~~s-----DW~~RP-Ls~~ 147 (373) T PRK10829 91 NAFGEL--PQPLIDTQILAAFCGRPLSWGFASMVEEYSGVALDKSE---------------SRT-----DWLARP-LSER 147 (373) T ss_pred HHCCCC--CCHHHHHHHHHHHCCCCCCHHHHHHHHHHHCCCCCCCC---------------CCC-----CCCCCC-CCHH T ss_conf 981988--74077648899865786555899999999396656664---------------627-----578899-9989 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCH Q ss_conf 89887753999999998720021 Q gi|254780127|r 159 FGEYCKRDVEATRELFKRLIPLS 181 (675) Q Consensus 159 ~~~Ya~~Dv~~~~~L~~~L~~~~ 181 (675) +..||+.||.....+|..|...+ T Consensus 148 ql~YA~~DV~yL~~l~~~L~~~l 170 (373) T PRK10829 148 QCEYAAADVWYLLPIAAKLMVET 170 (373) T ss_pred HHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999999999999999999 No 35 >cd06146 mut-7_like_exo The mut-7 subfamily is composed of Caenorhabditis elegans mut-7 and similar proteins found in plants and metazoans. Mut-7 is implicated in posttranscriptional gene silencing. It contains a DEDDy-type DnaQ-like 3'-5' exonuclease domain possessing three conserved sequence motifs, termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. Probab=99.30 E-value=3.1e-10 Score=85.86 Aligned_cols=146 Identities=14% Similarity=0.031 Sum_probs=99.0 Q ss_pred CEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCCEEEEEECCCC-----CHHHHHHHHHHCCCCCEEEEEEHHHH Q ss_conf 7899980406788700046000026882999999983822899978884-----22179999986488981998630775 Q gi|254780127|r 2 SKLFIDIETRSPQPLPKVGVWAYAEQAVITLCAYAWDDEPVKLWDRTEQ-----SAMPSDLLQYLRDETVMCVAHNSLFE 76 (675) Q Consensus 2 k~l~lD~ET~s~~dl~~~Gl~~y~~~~~I~~i~~a~~~~~~~~~d~~~~-----~~~~~~l~~~L~~~~~~kv~HNa~FD 76 (675) +.|++|+|+.... .. ...+++.++++|.++ .++.+|+..- +.++.-+..+|++++++||||++.+| T Consensus 23 ~vvg~D~Ew~~~~------~~--~~~~~vaLiQlat~~-~~~liD~~~~~~~~~~~~~~~L~~ll~~~~i~Kvg~~~~~D 93 (191) T cd06146 23 RVVGIDSEWKPSF------LG--DSDPRVAILQLATED-EVFLLDLLALENLESEDWDRLLKRLFEDPDVLKLGFGFKQD 93 (191) T ss_pred CEEEEECCCCCCC------CC--CCCCCEEEEEEECCC-CCEEEEEECCCCCCHHHHHHHHHHHHCCCCEEEEEECHHHH T ss_conf 8899978757877------78--888952799995299-75899970456543477899999994799968999443868 Q ss_pred HHHHHHHCCCC----CCCCCEEEHHHHHHHHC-------------CCCCHHHHHHH-HCCCCCCHHHHHCHHHHHHHHHH Q ss_conf 99999862998----78650454588999858-------------89997899998-08898652766376576641002 Q gi|254780127|r 77 RILFKKTLGID----IPSKRWICTSVLARTNG-------------LPSSLKNACLA-LNFSEHLTKMEEGKALIARFCKG 138 (675) Q Consensus 77 ~~vL~~~lGi~----i~~~~~~DTmi~A~~l~-------------~p~sL~~la~~-lgl~~~~~kd~~gk~li~~~~~~ 138 (675) +..|.+.+|+. ..+.+++||..++..+. .+.||..+++. +|...++.. T Consensus 94 ~~~L~~~~~~~~~~~~~~~~~~Dl~~~a~~~~~~~~~~~~~~~~~~~~gL~~L~~~~lg~~l~K~~-------------- 159 (191) T cd06146 94 LKALSASYPALKCMFERVQNVLDLQNLAKELQKSDMGRLKGNLPSKTKGLADLVQAVLGKPLDKSE-------------- 159 (191) T ss_pred HHHHHHHCCCCCCCCCCCCEEEEHHHHHHHHHCCCCCCHHHCCCCCCCCHHHHHHHHHCCCCCCCC-------------- T ss_conf 999998658643233364403579999999731122101100355675699999999098788565-------------- Q ss_pred HHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 332013453112587689998988775399999999872 Q gi|254780127|r 139 SIDSPPYDCTRANHVQAWQLFGEYCKRDVEATRELFKRL 177 (675) Q Consensus 139 ~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~~~~L~~~L 177 (675) . ..+|..-| ..+.+..||+.||.+.+.||++| T Consensus 160 -q-----~SnW~~rP-Lt~~Qi~YAA~Da~~ll~iy~~L 191 (191) T cd06146 160 -Q-----CSNWERRP-LREEQILYAALDAYCLLEVFDKL 191 (191) T ss_pred -C-----CCCCCCCC-CCHHHHHHHHHHHHHHHHHHHHC T ss_conf -1-----38999998-99999999999899999999739 No 36 >PRK06631 consensus Probab=99.25 E-value=2.9e-10 Score=86.10 Aligned_cols=85 Identities=19% Similarity=0.235 Sum_probs=58.9 Q ss_pred HHHHHCCCCCEEEEEEHHHHHHHHHHHC---CC-CC---CCCCEEEHHHHHHHHC--CCCCHHHHHHHHCCCCCCHHHHH Q ss_conf 9998648898199863077599999862---99-87---8650454588999858--89997899998088986527663 Q gi|254780127|r 57 LLQYLRDETVMCVAHNSLFERILFKKTL---GI-DI---PSKRWICTSVLARTNG--LPSSLKNACLALNFSEHLTKMEE 127 (675) Q Consensus 57 l~~~L~~~~~~kv~HNa~FD~~vL~~~l---Gi-~i---~~~~~~DTmi~A~~l~--~p~sL~~la~~lgl~~~~~kd~~ 127 (675) +.+++ .+..+|+||+.||+.||...+ |. ++ ...++.||+.+|+-+- .++||+++|++|+++..... T Consensus 80 fl~Fi--~~s~LVaHNA~FD~~FLn~El~~l~~~~~~~~~~~~viDTL~lAR~~~Pg~~~sLdaLc~r~~I~~s~r~--- 154 (229) T PRK06631 80 FLEFI--SDSKLIIHNAPFDIKFLNHELSLLKRTEIKLLELANTIDTLVMARSMFPGSKYNLDALCKRFKVDNSGRQ--- 154 (229) T ss_pred HHHHC--CCCEEEEECCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHCCCCCCCHHHHHHHCCCCCCCCC--- T ss_conf 99982--9997999763875989999999827888884320110126999998689988899999998499999855--- Q ss_pred CHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 765766410023320134531125876899989887753999999998720 Q gi|254780127|r 128 GKALIARFCKGSIDSPPYDCTRANHVQAWQLFGEYCKRDVEATRELFKRLI 178 (675) Q Consensus 128 gk~li~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~~~~L~~~L~ 178 (675) ..-|..|++++..+|..|. T Consensus 155 --------------------------------~H~AL~DA~llA~Vyl~l~ 173 (229) T PRK06631 155 --------------------------------LHGALKDAALLAEVYVELT 173 (229) T ss_pred --------------------------------CCCCHHHHHHHHHHHHHHH T ss_conf --------------------------------5686899999999999971 No 37 >PRK05711 DNA polymerase III subunit epsilon; Provisional Probab=99.18 E-value=1.6e-09 Score=81.21 Aligned_cols=63 Identities=25% Similarity=0.405 Sum_probs=46.3 Q ss_pred HHHHHHCCCCCEEEEEEHHHHHHHHHHH---CCCCCCC----CCEEEHHHHHHHH--CCCCCHHHHHHHHCCCC Q ss_conf 9999864889819986307759999986---2998786----5045458899985--88999789999808898 Q gi|254780127|r 56 DLLQYLRDETVMCVAHNSLFERILFKKT---LGIDIPS----KRWICTSVLARTN--GLPSSLKNACLALNFSE 120 (675) Q Consensus 56 ~l~~~L~~~~~~kv~HNa~FD~~vL~~~---lGi~i~~----~~~~DTmi~A~~l--~~p~sL~~la~~lgl~~ 120 (675) ++.+++ .+..+|+||+.||+.||... +|.+.++ ..+.||+.+|+-+ +.+.||+++|++|+++. T Consensus 79 efl~Fi--~~s~lVaHNa~FD~~FLn~El~r~g~~~~~~~~~~~viDTL~lAR~~~Pg~~~sLdaLc~r~~I~~ 150 (240) T PRK05711 79 EFLDFI--RGAELIIHNAPFDIGFMDYEFALLGRDIPKTTTFCKVTDTLAMARRMFPGKRNSLDALCKRYGIDN 150 (240) T ss_pred HHHHHH--CCCEEEEECCHHHHHHHHHHHHHHCCCCCCHHHHCCHHHHHHHHHHHCCCCCCCHHHHHHHCCCCC T ss_conf 999981--899799966199899999999986899876445212101999999878999889999999809898 No 38 >PRK06807 DNA polymerase III subunit epsilon; Validated Probab=99.18 E-value=2.7e-09 Score=79.65 Aligned_cols=80 Identities=21% Similarity=0.262 Sum_probs=59.5 Q ss_pred CCCEEEEEEHHHHHHHHHHH---CCCCCCCCCEEEHHHHHHHHCC---CCCHHHHHHHHCCCCCCHHHHHCHHHHHHHHH Q ss_conf 89819986307759999986---2998786504545889998588---99978999980889865276637657664100 Q gi|254780127|r 64 ETVMCVAHNSLFERILFKKT---LGIDIPSKRWICTSVLARTNGL---PSSLKNACLALNFSEHLTKMEEGKALIARFCK 137 (675) Q Consensus 64 ~~~~kv~HNa~FD~~vL~~~---lGi~i~~~~~~DTmi~A~~l~~---p~sL~~la~~lgl~~~~~kd~~gk~li~~~~~ 137 (675) .+.++||||+.||+.||... +|++-+...+.||+-+|+.+-+ .+.|..+++.||++-. T Consensus 87 ~d~iLVAHNA~FD~~FL~~~L~r~gl~~~~n~~IDTL~LAR~l~P~L~NhKLnTVa~~LgI~l~---------------- 150 (313) T PRK06807 87 HTNVIVAHNASFDMRFLKSNVNMLGLPEPKNKVIDTVFLAKKYMKHAPNHKLETLKRMLGIRLS---------------- 150 (313) T ss_pred CCCEEEEECCHHHHHHHHHHHHHCCCCCCCCCEEEHHHHHHHHCCCCCCCCHHHHHHHHCCCCC---------------- T ss_conf 8998999553661999999999749988899877659999986478765667899998399834---------------- Q ss_pred HHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCH Q ss_conf 23320134531125876899989887753999999998720021 Q gi|254780127|r 138 GSIDSPPYDCTRANHVQAWQLFGEYCKRDVEATRELFKRLIPLS 181 (675) Q Consensus 138 ~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~~~~L~~~L~~~~ 181 (675) ...|..|+.+|..++.++..+- T Consensus 151 ----------------------HHrAldDA~acAeI~~k~~~~~ 172 (313) T PRK06807 151 ----------------------SHNAFDDCITCAAVYQKCASIE 172 (313) T ss_pred ----------------------CCCCHHHHHHHHHHHHHHHHHH T ss_conf ----------------------3671778999999999999988 No 39 >PRK07009 consensus Probab=99.18 E-value=1.4e-09 Score=81.63 Aligned_cols=64 Identities=20% Similarity=0.239 Sum_probs=46.5 Q ss_pred HHHHHHCCCCCEEEEEEHHHHHHHHHHH---CCCCCCC---CCEEEHHHHHHHHC--CCCCHHHHHHHHCCCCC Q ss_conf 9999864889819986307759999986---2998786---50454588999858--89997899998088986 Q gi|254780127|r 56 DLLQYLRDETVMCVAHNSLFERILFKKT---LGIDIPS---KRWICTSVLARTNG--LPSSLKNACLALNFSEH 121 (675) Q Consensus 56 ~l~~~L~~~~~~kv~HNa~FD~~vL~~~---lGi~i~~---~~~~DTmi~A~~l~--~p~sL~~la~~lgl~~~ 121 (675) ++.+++. +..+|+||+.||+.||... +|.+.-. .++.||+.+|+-+- .+.||+++|++|+++.. T Consensus 76 efl~Fi~--~a~LVaHNA~FD~~FLn~EL~r~G~~~~~~~~~~viDTL~lAR~~~Pg~~~sLdaLc~R~~I~~~ 147 (241) T PRK07009 76 QIRDFVQ--DAELIIHNAPFDLGFLDAEFALLGLPPFTEHCAGVIDTLVQAKQMFPGKRNSLDALCDRFGISNA 147 (241) T ss_pred HHHHHHC--CCEEEEECHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHCCCCCC T ss_conf 9999966--99278867699999999999983997310244447679999999789998788999998286988 No 40 >PRK08074 bifunctional ATP-dependent DNA helicase/DNA polymerase III subunit epsilon; Validated Probab=99.17 E-value=3.6e-09 Score=78.84 Aligned_cols=110 Identities=20% Similarity=0.160 Sum_probs=72.1 Q ss_pred CEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEE-EECCCEEE-EE--ECCCCCHHHH---------------------- Q ss_conf 7899980406788700046000026882999999-98382289-99--7888422179---------------------- Q gi|254780127|r 2 SKLFIDIETRSPQPLPKVGVWAYAEQAVITLCAY-AWDDEPVK-LW--DRTEQSAMPS---------------------- 55 (675) Q Consensus 2 k~l~lD~ET~s~~dl~~~Gl~~y~~~~~I~~i~~-a~~~~~~~-~~--d~~~~~~~~~---------------------- 55 (675) |++.+|+||++ +++-. .-+|.-|+. -+.++.+. .+ .+-++.++|. T Consensus 4 ~~~vvDlETTG--------~~~~~-~d~IIqig~v~i~~~~i~~~f~t~vnP~~~iP~~I~~LTgI~~~~v~~AP~f~ev 74 (932) T PRK08074 4 RFVVVDLETTG--------NSPKK-GDKIIQIAAVVVEDGEILERFSTFVNPERPIPPFITELTGISEEMVKNAPLFEDV 74 (932) T ss_pred EEEEEEEECCC--------CCCCC-CCEEEEEEEEEEECCEEEEEEECCCCCCCCCCHHHHHHCCCCHHHHCCCCCHHHH T ss_conf 08999988889--------89777-9889999799999999989985325899989977886069898998059998999 Q ss_pred --HHHHHHCCCCCEEEEEEHHHHHHHHHHHC---CCCCCCCCEEEHHHHHHHHC---CCCCHHHHHHHHCCCCCC Q ss_conf --99998648898199863077599999862---99878650454588999858---899978999980889865 Q gi|254780127|r 56 --DLLQYLRDETVMCVAHNSLFERILFKKTL---GIDIPSKRWICTSVLARTNG---LPSSLKNACLALNFSEHL 122 (675) Q Consensus 56 --~l~~~L~~~~~~kv~HNa~FD~~vL~~~l---Gi~i~~~~~~DTmi~A~~l~---~p~sL~~la~~lgl~~~~ 122 (675) ++.++| ++...||||+.||..||...+ |++.....++||+-+|+.+- .+++|.++++.||++.+. T Consensus 75 a~~i~~~l--~~~~~VaHNv~FD~~FL~~~l~~~G~~~~~~~~~DTveLa~i~~P~~~sy~L~~L~~~l~i~~~~ 147 (932) T PRK08074 75 APMIVELL--EGAYFVAHNVHFDLNFLNEELERAGYTEIHCPKLDTVELARILLPTADSYKLSDLSEELELEHDQ 147 (932) T ss_pred HHHHHHHH--CCCEEEECCHHHHHHHHHHHHHHCCCCCCCCCEECHHHHHHHHCCCCCCCCHHHHHHHCCCCCCC T ss_conf 99999996--69959947805409999999997399866687651999999977887888889999864999999 No 41 >PRK08816 consensus Probab=99.17 E-value=2.6e-09 Score=79.77 Aligned_cols=110 Identities=19% Similarity=0.261 Sum_probs=66.2 Q ss_pred CCEEEEEECCCCCCCCCCCCCHHHCCCCEEEEE-EEEE-CCCEE---EEEECCCCCHH---------------------- Q ss_conf 978999804067887000460000268829999-9998-38228---99978884221---------------------- Q gi|254780127|r 1 MSKLFIDIETRSPQPLPKVGVWAYAEQAVITLC-AYAW-DDEPV---KLWDRTEQSAM---------------------- 53 (675) Q Consensus 1 mk~l~lD~ET~s~~dl~~~Gl~~y~~~~~I~~i-~~a~-~~~~~---~~~d~~~~~~~---------------------- 53 (675) |+.|.+|+||+ |+++... -+|.-| ++.+ +...+ +..-+-++.++ T Consensus 1 mR~VVlDtETT--------Gl~~~~g-dRIIEIgaV~~~~~~~~~~~f~~~inP~r~I~~~a~~vhGIT~e~L~dkP~F~ 71 (244) T PRK08816 1 MRQIILDTETT--------GLEWRKG-NRVVEIGAVELLERRPSGNNFHRYLKPDCDFEPGAQEVTGLTLEFLADKPLFG 71 (244) T ss_pred CCEEEEEEECC--------CCCCCCC-CEEEEEEEEEEECCEECCCEEEEEECCCCCCCHHHHHHHCCCHHHHHCCCCHH T ss_conf 98799972279--------9899899-88999999999999681437899868899699889875251789984099899 Q ss_pred --HHHHHHHHCCCCCEEEEEEHHHHHHHHHHH---CCCC----CCCCCEEEHHHHHHHHC--CCCCHHHHHHHHCCCCC Q ss_conf --799999864889819986307759999986---2998----78650454588999858--89997899998088986 Q gi|254780127|r 54 --PSDLLQYLRDETVMCVAHNSLFERILFKKT---LGID----IPSKRWICTSVLARTNG--LPSSLKNACLALNFSEH 121 (675) Q Consensus 54 --~~~l~~~L~~~~~~kv~HNa~FD~~vL~~~---lGi~----i~~~~~~DTmi~A~~l~--~p~sL~~la~~lgl~~~ 121 (675) ..++..++ .+..+|+||+.||+.||... +|.+ .....+.||+.+|+-+- .+.||+++|++|+++.. T Consensus 72 eV~~efl~Fi--~ds~LVaHNa~FDi~FLn~El~r~g~~~~~~~~~~~v~DTl~lAr~~~Pg~~nsLDaLc~r~~id~~ 148 (244) T PRK08816 72 EVVDEFLAYI--DGAELIIHNAAFDLGFLDNELSLLGDQYGRIVDRATVVDTLMMARERYPGQRNSLDALCKRLGVDNS 148 (244) T ss_pred HHHHHHHHHH--CCCEEEEECCHHHHHHHHHHHHHHCCCCCCCCCCCCEEEHHHHHHHHCCCCCCCHHHHHHHCCCCCC T ss_conf 9999999982--8998999663652989999999848875676431010438999998787997789999997497887 No 42 >PRK06022 consensus Probab=99.13 E-value=1.2e-09 Score=82.05 Aligned_cols=64 Identities=28% Similarity=0.402 Sum_probs=47.2 Q ss_pred HHHHHHCCCCCEEEEEEHHHHHHHHHHH---CCCCC-CCCCEEEHHHHHHHHC--CCCCHHHHHHHHCCCCC Q ss_conf 9999864889819986307759999986---29987-8650454588999858--89997899998088986 Q gi|254780127|r 56 DLLQYLRDETVMCVAHNSLFERILFKKT---LGIDI-PSKRWICTSVLARTNG--LPSSLKNACLALNFSEH 121 (675) Q Consensus 56 ~l~~~L~~~~~~kv~HNa~FD~~vL~~~---lGi~i-~~~~~~DTmi~A~~l~--~p~sL~~la~~lgl~~~ 121 (675) ++.+++. +..+|+||+.||+.||... +|.+. ...++.||+.+|+-+- .+.||+++|++|+++.. T Consensus 76 ~fl~Fi~--~a~LVaHNA~FD~gFLn~El~r~G~~~~~~~~viDTL~LAR~~~Pg~~~sLdaLc~r~~Id~~ 145 (234) T PRK06022 76 EFLEFFD--GARLVAHNATFDLGFINAEFARLGQPEITSERVVDTLALARRKHPMGPNSLDALCRRYGIDNS 145 (234) T ss_pred HHHHHHC--CCEEEEECHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHCCCCCC T ss_conf 9999818--985999557872999999999859987778887469999998688998898999998399977 No 43 >PRK07982 consensus Probab=99.12 E-value=5.5e-09 Score=77.61 Aligned_cols=65 Identities=26% Similarity=0.383 Sum_probs=47.2 Q ss_pred HHHHHHHCCCCCEEEEEEHHHHHHHHHHHC---CCCCCC----CCEEEHHHHHHHH--CCCCCHHHHHHHHCCCCC Q ss_conf 999998648898199863077599999862---998786----5045458899985--889997899998088986 Q gi|254780127|r 55 SDLLQYLRDETVMCVAHNSLFERILFKKTL---GIDIPS----KRWICTSVLARTN--GLPSSLKNACLALNFSEH 121 (675) Q Consensus 55 ~~l~~~L~~~~~~kv~HNa~FD~~vL~~~l---Gi~i~~----~~~~DTmi~A~~l--~~p~sL~~la~~lgl~~~ 121 (675) +++..++. +..+|+||+.||+.||...+ |..+++ .++.||+.+|+-+ +.+.||+++|++|+++.. T Consensus 84 ~efl~Fi~--~a~LVaHNA~FD~~FLn~El~r~g~~~~~~~~~~~v~DTL~lAR~~~Pg~~~sLDaLc~R~~Id~s 157 (243) T PRK07982 84 DEFLDYIR--GAELVIHNAAFDIGFMDYEFSLLKRDIPKTNTFCKVTDSLALARKMFPGKRNSLDALCDRYEIDNS 157 (243) T ss_pred HHHHHHHC--CCEEEEECCHHHHHHHHHHHHHCCCCCCCCCCCCCEEEHHHHHHHHCCCCCCCHHHHHHHCCCCCC T ss_conf 99999967--998999733888999999999708898861012431349999888788986899999875487765 No 44 >TIGR01388 rnd ribonuclease D; InterPro: IPR006292 These proteins represent ribonuclease D, a 3-exonuclease shown to act on tRNA both in vitro and when overexpressed in vivo. Members of this family are restricted to the Proteobacteria; Aquifex, Mycobacteria and eukaryotes. Ribonuclease D is not essential in Escherichia coli but is deleterious when overexpressed. The precise biological role of ribonuclease D is still unknown. ; GO: 0004540 ribonuclease activity, 0008033 tRNA processing, 0005737 cytoplasm. Probab=99.11 E-value=1.9e-09 Score=80.67 Aligned_cols=157 Identities=22% Similarity=0.180 Sum_probs=103.5 Q ss_pred EEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCCEEEEEE-CCC-CCHHHHHHHHHHCCCCCEEEEEEHHHHHHHH Q ss_conf 89998040678870004600002688299999998382289997-888-4221799999864889819986307759999 Q gi|254780127|r 3 KLFIDIETRSPQPLPKVGVWAYAEQAVITLCAYAWDDEPVKLWD-RTE-QSAMPSDLLQYLRDETVMCVAHNSLFERILF 80 (675) Q Consensus 3 ~l~lD~ET~s~~dl~~~Gl~~y~~~~~I~~i~~a~~~~~~~~~d-~~~-~~~~~~~l~~~L~~~~~~kv~HNa~FD~~vL 80 (675) +|++|||=- ++.+ + =|..-+||++. ++....|| +.. .+. .=+..+|+|++|+||+|-+.=|+-|+ T Consensus 20 ~valDTEF~-----R~~T-F----wP~LgLiQ~a~-g~~~~LIDPL~~~~~~--~Pl~~lL~d~~v~KvlHAaseDlEvF 86 (374) T TIGR01388 20 FVALDTEFV-----RERT-F----WPQLGLIQVAD-GEQLALIDPLVIDIDL--SPLLELLADESVVKVLHAASEDLEVF 86 (374) T ss_pred EEECCCCEE-----CCCC-C----CHHHCCEECCC-CCEEEEECCCCCCCCH--HHHHHHHHCCCCEEEECCCCHHHHHH T ss_conf 374155411-----2653-2----02040031137-8756876588875234--79999973898158530450479999 Q ss_pred HHHCCCCCCCCCEEEHHHHHHHHCCCCCHHH--HH-HHHCCCCCCHHHHHCHHHHHHHHHHHHHHCCCCCCCCCCHHHHH Q ss_conf 9862998786504545889998588999789--99-98088986527663765766410023320134531125876899 Q gi|254780127|r 81 KKTLGIDIPSKRWICTSVLARTNGLPSSLKN--AC-LALNFSEHLTKMEEGKALIARFCKGSIDSPPYDCTRANHVQAWQ 157 (675) Q Consensus 81 ~~~lGi~i~~~~~~DTmi~A~~l~~p~sL~~--la-~~lgl~~~~~kd~~gk~li~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (675) .+..|. +| ..++||.|+|.++|.++|... +- +.+++.-++.+- .-.|..=| ..+ T Consensus 87 ~~l~g~-lP-~PLfDTQi~Aa~~G~g~s~gY~~LV~~~~~v~LdKse~--------------------rtDW~~RP-Ltd 143 (374) T TIGR01388 87 LNLFGV-LP-KPLFDTQIAAAFLGFGESMGYDKLVQEVLGVELDKSET--------------------RTDWLARP-LTD 143 (374) T ss_pred HHHHCC-CC-CCCHHHHHHHHHHCCCCCHHHHHHHHHHHCEECCCCCC--------------------CCHHHCCC-CCH T ss_conf 987057-98-88305889999836785210899999973703178887--------------------22210478-987 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHH Q ss_conf 98988775399999999872002143101789999875 Q gi|254780127|r 158 LFGEYCKRDVEATRELFKRLIPLSDGERDLWLLDQTIN 195 (675) Q Consensus 158 ~~~~Ya~~Dv~~~~~L~~~L~~~~~~E~~l~~~~~~me 195 (675) ...+||+.||..-..+|.+|..-+..+.....+..+|+ T Consensus 144 ~Ql~YAaaDV~yL~~~y~~L~~~l~~~~R~~~l~ee~~ 181 (374) T TIGR01388 144 KQLEYAAADVTYLLPLYEKLTERLAEEGRLAWLEEECE 181 (374) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 78999999999999999999999988645999972059 No 45 >PRK06859 consensus Probab=99.10 E-value=8.9e-09 Score=76.24 Aligned_cols=64 Identities=17% Similarity=0.271 Sum_probs=47.2 Q ss_pred HHHHHHCCCCCEEEEEEHHHHHHHHHHH---CCCCCCC---CCEEEHHHHHHHHC--CCCCHHHHHHHHCCCCC Q ss_conf 9999864889819986307759999986---2998786---50454588999858--89997899998088986 Q gi|254780127|r 56 DLLQYLRDETVMCVAHNSLFERILFKKT---LGIDIPS---KRWICTSVLARTNG--LPSSLKNACLALNFSEH 121 (675) Q Consensus 56 ~l~~~L~~~~~~kv~HNa~FD~~vL~~~---lGi~i~~---~~~~DTmi~A~~l~--~p~sL~~la~~lgl~~~ 121 (675) +++.++. +..+|+||+.||+.||... +|.+.+. .++.||+.+|+-+- .+.||+++|++|+++.. T Consensus 84 efl~Fi~--~a~LVaHNa~FD~~FLn~El~r~g~~~~~~~~~~viDTL~lAR~~~Pg~~~sLdaLc~r~~id~~ 155 (253) T PRK06859 84 EFIDYIK--GAELLIHNAPFDVGFMDYEFRKLNLNVKTDDICLVTDTLQMARQMYPGKRNNLDALCDRLGIDNS 155 (253) T ss_pred HHHHHHC--CCEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCEEEHHHHHHHHCCCCCCCHHHHHHHCCCCCC T ss_conf 9999967--99799957188899999999974999653220011339999998787997799999997399876 No 46 >PRK07740 hypothetical protein; Provisional Probab=99.10 E-value=6e-09 Score=77.39 Aligned_cols=130 Identities=17% Similarity=0.151 Sum_probs=80.0 Q ss_pred CEEEEEECCCCCCCCCCCCCHHHCCCCEEEEE-EEEECCCEEEE-E--ECCCCCHHHH---------------------- Q ss_conf 78999804067887000460000268829999-99983822899-9--7888422179---------------------- Q gi|254780127|r 2 SKLFIDIETRSPQPLPKVGVWAYAEQAVITLC-AYAWDDEPVKL-W--DRTEQSAMPS---------------------- 55 (675) Q Consensus 2 k~l~lD~ET~s~~dl~~~Gl~~y~~~~~I~~i-~~a~~~~~~~~-~--d~~~~~~~~~---------------------- 55 (675) .++.+|+||+ |+++...| +|+-| ++.+.++.+.- + -+-++.++|. T Consensus 58 ~fvV~D~ETT--------Gl~p~~gD-~IIeIgAVkv~~g~i~d~F~tLVnP~~~Ip~~i~~ltGIT~emV~~aP~~~eV 128 (240) T PRK07740 58 PFVVFDLETT--------GFSPDQGD-EILSIAAVKTVGGEVTDLYYSLVKPKKPIPEHILTLTGISAKDVVFAPPLAEV 128 (240) T ss_pred EEEEEEEECC--------CCCCCCCC-EEEEEEEEEEECCEEEEEEEEEECCCCCCCHHHHHHCCCCHHHHHCCCCHHHH T ss_conf 3899980589--------98988898-78999899999999968877550887989988998609899998379999999 Q ss_pred --HHHHHHCCCCCEEEEEEHHHHHHHHHH----HCCCCCCCCCEEEHHHHHHHHCC--CCCHHHHHHHHCCCCCCHHHHH Q ss_conf --999986488981998630775999998----62998786504545889998588--9997899998088986527663 Q gi|254780127|r 56 --DLLQYLRDETVMCVAHNSLFERILFKK----TLGIDIPSKRWICTSVLARTNGL--PSSLKNACLALNFSEHLTKMEE 127 (675) Q Consensus 56 --~l~~~L~~~~~~kv~HNa~FD~~vL~~----~lGi~i~~~~~~DTmi~A~~l~~--p~sL~~la~~lgl~~~~~kd~~ 127 (675) ++.+++ .+..+||||+.||+.||.+ +++..+. ..+.||+.+++.+.. .++|++++++||++... T Consensus 129 l~~fl~Fi--~~~vlVaHNa~FD~~FL~~~l~r~~~~~~~-~~~iDT~~Lar~l~~~~~~sL~~l~~~fgI~~~~----- 200 (240) T PRK07740 129 LRTFYPFI--NASTLVGYHIGHDLAFLRHALWRHYRQKFS-HRFIDTQFLTEIIAHQSFPTLDDALACYGITCPR----- 200 (240) T ss_pred HHHHHHHH--CCCEEEEECCHHHHHHHHHHHHHHCCCCCC-CCEEEHHHHHHHHCCCCCCCHHHHHHHCCCCCCC----- T ss_conf 99999983--899899968286399999999984599878-8763699999986589987899999976999999----- Q ss_pred CHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 76576641002332013453112587689998988775399999999872002 Q gi|254780127|r 128 GKALIARFCKGSIDSPPYDCTRANHVQAWQLFGEYCKRDVEATRELFKRLIPL 180 (675) Q Consensus 128 gk~li~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~~~~L~~~L~~~ 180 (675) ..-|..|+..|..||.+|.+. T Consensus 201 --------------------------------rHrAl~DA~aTA~lf~~ll~~ 221 (240) T PRK07740 201 --------------------------------RHTADGDVEMTAKLWAILIIE 221 (240) T ss_pred --------------------------------CCCCHHHHHHHHHHHHHHHHH T ss_conf --------------------------------988299999999999999999 No 47 >cd06126 DEDDy DEDDy exonucleases, part of the DnaQ-like (or DEDD) exonuclease superfamily, catalyze the excision of nucleoside monophosphates at the DNA or RNA termini in the 3'-5' direction. They contain four invariant acidic residues in three conserved sequence motifs termed ExoI, ExoII and ExoIII. DEDDy exonucleases are classified as such because of the presence of a specific YX(3)D pattern at ExoIII. The four conserved acidic residues serve as ligands for the two metal ions required for catalysis. DEDDy exonucleases include the proofreading domains of DNA polymerases (families A and B), as well as RNases such as RNase D and Rrp6p. The Egalitarian (Egl) and Bacillus-like DNA Polymerase I subfamilies do not possess a completely conserved YX(3)D pattern at the ExoIII motif. In addition, Bacillus-like DNA polymerases contain inactive 3'-5' exonuclease domains which do not possess the metal-binding residues necessary for activity. Probab=99.09 E-value=5.3e-09 Score=77.74 Aligned_cols=146 Identities=20% Similarity=0.146 Sum_probs=93.2 Q ss_pred EEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCCEEEEEECCCC----------CHHHHHHHHHHCCCCCEEEEEE Q ss_conf 899980406788700046000026882999999983822899978884----------2217999998648898199863 Q gi|254780127|r 3 KLFIDIETRSPQPLPKVGVWAYAEQAVITLCAYAWDDEPVKLWDRTEQ----------SAMPSDLLQYLRDETVMCVAHN 72 (675) Q Consensus 3 ~l~lD~ET~s~~dl~~~Gl~~y~~~~~I~~i~~a~~~~~~~~~d~~~~----------~~~~~~l~~~L~~~~~~kv~HN 72 (675) .+++|+||+..... .....|.+|+++..++...+++..+. ..++..+.+++++.+.++++|| T Consensus 1 vl~fDiE~~~~~~~--------~~~~pI~~i~~~~~~g~~~~~~~~~~~~~~~~~~~E~~ll~~f~~~i~~~~dii~GyN 72 (181) T cd06126 1 VVAFDIETTPLVGG--------GGKGPIILIQLAVSAGEKFLLDVLTLTLDAVDSADEKELLGELKAILEDYDDIKVGYN 72 (181) T ss_pred CEEEEEECCCCCCC--------CCCCCEEEEEEEECCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHCCCCEEEEEC T ss_conf 92898623798999--------8999889999996599789998624346755678999999999999830899899979 Q ss_pred H-HHHHHHHHHH---CCCCC----------------CCCCEEEHHHHHHHHC---CCCCHHHHHHHH-CCCCCCHHHHHC Q ss_conf 0-7759999986---29987----------------8650454588999858---899978999980-889865276637 Q gi|254780127|r 73 S-LFERILFKKT---LGIDI----------------PSKRWICTSVLARTNG---LPSSLKNACLAL-NFSEHLTKMEEG 128 (675) Q Consensus 73 a-~FD~~vL~~~---lGi~i----------------~~~~~~DTmi~A~~l~---~p~sL~~la~~l-gl~~~~~kd~~g 128 (675) + .||++.|.+. +|++. +.....|++...+-.. ..++|+.+|+.+ |...... .+ T Consensus 73 ~~~FDlpyL~~R~~~l~~~~~~~~~~~~~~~~~~~~~gr~~~D~~~~~~~~~~~~~~~~L~~va~~~lg~~K~~~---~~ 149 (181) T cd06126 73 IDSFDLPYLLNRAGELGIPLRRGRDRSSGSREEINIKGRILFDLLLAAKRLGVALPSYSLDALAEQLLGEGKEKV---DQ 149 (181) T ss_pred CCCCCHHHHHHHHHHHCCCCHHCCCCCCCCCCEEEECCCCCCCHHHHHHHHCCCCCCCCHHHHHHHHCCCCCCCC---CH T ss_conf 977879999999999589701022666565504863485232199999984136465359999999749995669---99 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 6576641002332013453112587689998988775399999999 Q gi|254780127|r 129 KALIARFCKGSIDSPPYDCTRANHVQAWQLFGEYCKRDVEATRELF 174 (675) Q Consensus 129 k~li~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~~~~L~ 174 (675) .... ..+...+ ..+.+.+||..|+.+++.|| T Consensus 150 ~~i~--------------~~w~~~~-~~~~~~~Y~~~Da~~tl~l~ 180 (181) T cd06126 150 LAIK--------------QAWRARP-LLERLAEYAAADAYATLQIY 180 (181) T ss_pred HHHH--------------HHHHCCC-CHHHHHHHHHHHHHHHHHHH T ss_conf 9999--------------9886798-66999999999999999976 No 48 >PRK08517 DNA polymerase III subunit epsilon; Provisional Probab=99.08 E-value=4.5e-09 Score=78.17 Aligned_cols=87 Identities=22% Similarity=0.248 Sum_probs=61.8 Q ss_pred HHHHHHCCCCCEEEEEEHHHHHHHHHHH---CCCCCCCCCEEEHHHHHHHH--CCCCCHHHHHHHHCCCCCCHHHHHCHH Q ss_conf 9999864889819986307759999986---29987865045458899985--889997899998088986527663765 Q gi|254780127|r 56 DLLQYLRDETVMCVAHNSLFERILFKKT---LGIDIPSKRWICTSVLARTN--GLPSSLKNACLALNFSEHLTKMEEGKA 130 (675) Q Consensus 56 ~l~~~L~~~~~~kv~HNa~FD~~vL~~~---lGi~i~~~~~~DTmi~A~~l--~~p~sL~~la~~lgl~~~~~kd~~gk~ 130 (675) ++...+. +.+.||||+.||+.||... .|+......+.||+-+|+.+ +..++|+.+++.||++... T Consensus 142 ~f~~Fi~--d~ilVaHNa~FD~~FL~~~l~r~g~~~l~n~~lDTl~LAR~~~p~~~ykL~~La~~~~I~~~~-------- 211 (259) T PRK08517 142 DFRLFLG--DSVFVAHNVNFDYNFISRSLEEIGLGPLLNRKLCTIDLAKRTIESPRYGLSFLKELLGIETEV-------- 211 (259) T ss_pred HHHHHHC--CCEEEEECCHHHHHHHHHHHHHCCCCCCCCCEEEEHHHHHHHCCCCCCCHHHHHHHCCCCCCC-------- T ss_conf 9999968--895999372760999999999839997689856608999987587787889999975989999-------- Q ss_pred HHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCH Q ss_conf 766410023320134531125876899989887753999999998720021 Q gi|254780127|r 131 LIARFCKGSIDSPPYDCTRANHVQAWQLFGEYCKRDVEATRELFKRLIPLS 181 (675) Q Consensus 131 li~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~~~~L~~~L~~~~ 181 (675) ..-|..|+.+|..+++.+..-+ T Consensus 212 -----------------------------~HRAl~DA~aTaevF~~~L~~l 233 (259) T PRK08517 212 -----------------------------HHRAYADALAAYEIFEICLLNL 233 (259) T ss_pred -----------------------------CCCCHHHHHHHHHHHHHHHHHC T ss_conf -----------------------------9784588999999999999856 No 49 >PRK06195 DNA polymerase III subunit epsilon; Validated Probab=99.08 E-value=2.8e-09 Score=79.58 Aligned_cols=62 Identities=26% Similarity=0.450 Sum_probs=48.8 Q ss_pred HHHHHHCCCCCEEEEEEHHHHHHHHHHH---CCCCCCCCCEEEHHHHHHHHCC---CCCHHHHHHHHCCC Q ss_conf 9999864889819986307759999986---2998786504545889998588---99978999980889 Q gi|254780127|r 56 DLLQYLRDETVMCVAHNSLFERILFKKT---LGIDIPSKRWICTSVLARTNGL---PSSLKNACLALNFS 119 (675) Q Consensus 56 ~l~~~L~~~~~~kv~HNa~FD~~vL~~~---lGi~i~~~~~~DTmi~A~~l~~---p~sL~~la~~lgl~ 119 (675) ++..+|. +..+|+||+.||+.||... +|++.+...|.||+-+|+.+.+ .++|..+++.||++ T Consensus 73 ~~~~fi~--~~vlVaHNa~FD~~fL~~~~~r~gl~~~~~~~~cTl~LAR~~~p~l~~~kL~~La~~~gi~ 140 (309) T PRK06195 73 KIKDYFN--DNLVIAHNASFDISVLRKTLELYNIPMPDFEYICTMKLAKNFYSNIPNARLNTVNNFLGYE 140 (309) T ss_pred HHHHHHC--CCEEEEECCHHHHHHHHHHHHHCCCCCCCCCEEEHHHHHHHHCCCCCCCCHHHHHHHHCCC T ss_conf 9999857--9979995628889999999998499999997774799999865577657888999985988 No 50 >cd05781 DNA_polB_B3_exo The 3'-5' exonuclease domain of archaeal proteins with similarity to Sulfurisphaera ohwakuensis DNA polymerase B3. B3 is a family-B DNA polymerase. Family-B DNA polymerases contain an N-terminal DEDDy DnaQ-like exonuclease domain in the same polypeptide chain as the polymerase domain, similar to family-A DNA polymerases. B3 exhibits both polymerase and 3'-5' exonuclease activities. This exonuclease domain contains three sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are clustered around the active site and are involved in metal binding and catalysis. The exonuclease domain of family B polymerases also contains a beta hairpin structure that plays an important role in active site switching in the event of nucleotide misincorporation. Archaeal proteins that are involved in DNA replication are similar to those from eukaryotes. Some archaea possess multiple family-B DNA polymerases. B3 is mainly found in crenarch Probab=99.07 E-value=1e-08 Score=75.87 Aligned_cols=155 Identities=17% Similarity=0.234 Sum_probs=92.9 Q ss_pred CCEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCCEEEEEECCCCC--HHHHHHHHHHC--CCCCEEEEEEH-HH Q ss_conf 978999804067887000460000268829999999838228999788842--21799999864--88981998630-77 Q gi|254780127|r 1 MSKLFIDIETRSPQPLPKVGVWAYAEQAVITLCAYAWDDEPVKLWDRTEQS--AMPSDLLQYLR--DETVMCVAHNS-LF 75 (675) Q Consensus 1 mk~l~lD~ET~s~~dl~~~Gl~~y~~~~~I~~i~~a~~~~~~~~~d~~~~~--~~~~~l~~~L~--~~~~~kv~HNa-~F 75 (675) .|.+++||||+|...... ++.-.|..||++..+++..++...... .+...+.+++. +|+++ +|||. .| T Consensus 3 lk~lsfDIE~~~~~gfp~------~~~d~Ii~Is~~~~~g~~~~~~~~~~~E~~ll~~f~~~i~~~~Pdii-~gyN~~~F 75 (188) T cd05781 3 LKTLAFDIEVYSKYGTPN------PRRDPIIVISLATSNGDVEFILAEGLDDRKIIREFVKYVKEYDPDII-VGYNSNAF 75 (188) T ss_pred CEEEEEEEEECCCCCCCC------CCCCCEEEEEEEECCCCEEEEEECCCCHHHHHHHHHHHHHHCCCCEE-EECCCCCC T ss_conf 419999988768999899------88897899999968997699984799989999999999987299999-96698675 Q ss_pred HHHHHHH---HCCCCCC--------------------CCCEEEHHHHHH-HHC-CCCCHHHHHHHHCCCCCCHH-HHHCH Q ss_conf 5999998---6299878--------------------650454588999-858-89997899998088986527-66376 Q gi|254780127|r 76 ERILFKK---TLGIDIP--------------------SKRWICTSVLAR-TNG-LPSSLKNACLALNFSEHLTK-MEEGK 129 (675) Q Consensus 76 D~~vL~~---~lGi~i~--------------------~~~~~DTmi~A~-~l~-~p~sL~~la~~lgl~~~~~k-d~~gk 129 (675) |++.|.. .+|+++. .....|.+-..+ -.. ..++|+.+|+.||......+ +...+ T Consensus 76 D~pyl~~Ra~~~~i~l~~gr~~~~~~~~~~~~~~~i~Gr~~iDl~~~~~~~~~l~sysL~~Va~~lg~~~k~e~~~~~~~ 155 (188) T cd05781 76 DWPYLVERARVLGVKLDVGRRGGSEPSTGVYGHYSITGRLNVDLYDFAEEIPEVKVKTLENVAEYLGVMKKSERVLIEWY 155 (188) T ss_pred CHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEEEEEEEECHHHHHHHCCCCCCCHHHHHHHHCCCCCCCCCCCCHH T ss_conf 78999999999699850155677651225420699878999977899976247564579999998541224544657999 Q ss_pred HHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 576641002332013453112587689998988775399999999872 Q gi|254780127|r 130 ALIARFCKGSIDSPPYDCTRANHVQAWQLFGEYCKRDVEATRELFKRL 177 (675) Q Consensus 130 ~li~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~~~~L~~~L 177 (675) ..-+ .+ ..+...+.+.+||.+|+.++++|.++| T Consensus 156 ~I~~--------------~~-~~~~~r~~l~~YnlqDa~L~~~L~~KL 188 (188) T cd05781 156 RIYE--------------YW-DDEKKRDILLKYNRDDARSTYGLAEKL 188 (188) T ss_pred HHHH--------------HH-HCCCCHHHHHHHHHHHHHHHHHHHHCC T ss_conf 9999--------------87-084121699999788799999998429 No 51 >TIGR01406 dnaQ_proteo DNA polymerase III, epsilon subunit; InterPro: IPR006309 These represent DnaQ, the DNA polymerase III epsilon subunit, as found in most Proteobacteria. It consists largely of an exonuclease region as described in IPR006055 from INTERPRO. In Gram-positive bacteria, closely related regions are found both in the Gram-positive type DNA polymerase III alpha subunit and as an additional N-terminal domain of a DinG-family helicase. Both are not included in this set of sequences, as they are smaller proteins.; GO: 0003677 DNA binding, 0003887 DNA-directed DNA polymerase activity, 0006260 DNA replication. Probab=99.07 E-value=1.7e-09 Score=81.06 Aligned_cols=67 Identities=25% Similarity=0.377 Sum_probs=49.8 Q ss_pred HHHHHHHHHCCCCCEEEEEEHHHHHHHHHHH---CC---CC-C-CCCCEEEHHHHHHHHC--CCCCHHHHHHHHCCCCC Q ss_conf 1799999864889819986307759999986---29---98-7-8650454588999858--89997899998088986 Q gi|254780127|r 53 MPSDLLQYLRDETVMCVAHNSLFERILFKKT---LG---ID-I-PSKRWICTSVLARTNG--LPSSLKNACLALNFSEH 121 (675) Q Consensus 53 ~~~~l~~~L~~~~~~kv~HNa~FD~~vL~~~---lG---i~-i-~~~~~~DTmi~A~~l~--~p~sL~~la~~lgl~~~ 121 (675) +.++++.+|.+. .+|.|||+||+.||... +| +. + +..+|.||+.||+-+= ...|||+||+|||++.. T Consensus 79 Ia~~Fl~Fi~~~--~LvIHNA~FD~gFlN~EL~~Lg~~~~~~~~~~~~~~DTL~mAR~~FPG~~~sLDALCkRf~vDns 155 (246) T TIGR01406 79 IADEFLDFIKGS--ELVIHNAAFDVGFLNYELERLGLKEIKKISDFCRVIDTLAMARERFPGKRNSLDALCKRFKVDNS 155 (246) T ss_pred HHHHHHHHHCCC--EEEEECCCHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHCCCCC T ss_conf 899999984598--89997570216789999987287321000136678656899972378986646787743042500 No 52 >PRK07246 bifunctional ATP-dependent DNA helicase/DNA polymerase III subunit epsilon; Validated Probab=99.06 E-value=7.5e-09 Score=76.73 Aligned_cols=106 Identities=19% Similarity=0.230 Sum_probs=70.0 Q ss_pred CEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEE-EECCCEEE-EEE--CCCCCHHHHH--------------------- Q ss_conf 7899980406788700046000026882999999-98382289-997--8884221799--------------------- Q gi|254780127|r 2 SKLFIDIETRSPQPLPKVGVWAYAEQAVITLCAY-AWDDEPVK-LWD--RTEQSAMPSD--------------------- 56 (675) Q Consensus 2 k~l~lD~ET~s~~dl~~~Gl~~y~~~~~I~~i~~-a~~~~~~~-~~d--~~~~~~~~~~--------------------- 56 (675) +++.+|+||++ ++. .+ +|.-|+. .+.++.+. .+. +-++.++|.. T Consensus 8 ~~~vvDlETTG--------~~~--~~-~Iiqi~~v~~~~~~i~~~f~t~vnP~~~ip~~I~~LTgI~~~~v~~ap~f~~v 76 (820) T PRK07246 8 KYAVVDLEATG--------AGP--NA-SIIQVGIVIIQGNKIIDSYETDVNPHESLDEHIVHLTGITDKQLAQAPDFSQV 76 (820) T ss_pred CEEEEEEECCC--------CCC--CC-CEEEEEEEEEECCEEEEEEEECCCCCCCCCHHHHHHCCCCHHHHHCCCCHHHH T ss_conf 18999988789--------899--98-77999999999999989987311889979978886469898998529998999 Q ss_pred ---HHHHHCCCCCEEEEEEHHHHHHHHHHHC---CCCCCCCCEEEHHHHHHHHC---CCCCHHHHHHHHCCCCC Q ss_conf ---9998648898199863077599999862---99878650454588999858---89997899998088986 Q gi|254780127|r 57 ---LLQYLRDETVMCVAHNSLFERILFKKTL---GIDIPSKRWICTSVLARTNG---LPSSLKNACLALNFSEH 121 (675) Q Consensus 57 ---l~~~L~~~~~~kv~HNa~FD~~vL~~~l---Gi~i~~~~~~DTmi~A~~l~---~p~sL~~la~~lgl~~~ 121 (675) +...| .+...||||+.||+.||..++ |+++. ..+.||+-+|+.+- .+++|.++++.||++.+ T Consensus 77 ~~~~~~~l--~~~~~VaHNv~FD~~fl~~~~~~~g~~~~-~~~~DTv~La~i~~P~~~~y~L~~L~~~l~~~~~ 147 (820) T PRK07246 77 AHHIYQLI--EDCIFVAHNVKFDANLLAEALFLEGYELR-TPRVDTVELAQVFFPRLEKYSLSHLSRQLNIDLA 147 (820) T ss_pred HHHHHHHH--CCCEEEEECCCCCHHHHHHHHHHCCCCCC-CCCCCHHHHHHHHCCCCCCCCHHHHHHHCCCCCC T ss_conf 99999996--79859983720169999999997589889-9826099999997798788888999997399999 No 53 >PRK06310 DNA polymerase III subunit epsilon; Validated Probab=99.04 E-value=1.2e-08 Score=75.40 Aligned_cols=85 Identities=21% Similarity=0.273 Sum_probs=58.9 Q ss_pred HHHHHCCCCCEEEEEEHHHHHHHHHHH---CCCCCC--CCCEEEHHHHHHHHC--CCCCHHHHHHHHCCCCCCHHHHHCH Q ss_conf 999864889819986307759999986---299878--650454588999858--8999789999808898652766376 Q gi|254780127|r 57 LLQYLRDETVMCVAHNSLFERILFKKT---LGIDIP--SKRWICTSVLARTNG--LPSSLKNACLALNFSEHLTKMEEGK 129 (675) Q Consensus 57 l~~~L~~~~~~kv~HNa~FD~~vL~~~---lGi~i~--~~~~~DTmi~A~~l~--~p~sL~~la~~lgl~~~~~kd~~gk 129 (675) +.+.+.+.+ +.||||+.||+.+|... .|++.. ...|.||+-+|+.+. ..++|+.+++.+|++... T Consensus 81 ~~~fi~~~~-ilVaHNa~FD~~fL~~e~~r~g~~~~~~~~~~IDTl~lAr~~~~~~~~~L~~L~~~~~i~~~~------- 152 (250) T PRK06310 81 IKGFFKEGD-YIVGHSVGFDLQVLAQEAERIGETFLSKHYYIIDTLRLAKEYGDSPNNSLEALAVHFNVPYDG------- 152 (250) T ss_pred HHHHHCCCC-EEEEECHHHHHHHHHHHHHHCCCCCCCCCCCEECHHHHHHHCCCCCCCHHHHHHHHCCCCCCC------- T ss_conf 999967999-999938455799999999976998665679578499999756765456099999970999986------- Q ss_pred HHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 57664100233201345311258768999898877539999999987200 Q gi|254780127|r 130 ALIARFCKGSIDSPPYDCTRANHVQAWQLFGEYCKRDVEATRELFKRLIP 179 (675) Q Consensus 130 ~li~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~~~~L~~~L~~ 179 (675) ..-|..|+.+|..+++.|.. T Consensus 153 ------------------------------aHrAl~Da~aTa~vf~~l~~ 172 (250) T PRK06310 153 ------------------------------NHRAMKDVEININVFKHLCK 172 (250) T ss_pred ------------------------------CCCHHHHHHHHHHHHHHHHH T ss_conf ------------------------------66819999999999999998 No 54 >cd06131 DNA_pol_III_epsilon_Ecoli_like This subfamily is composed of the epsilon subunit of Escherichia coli DNA polymerase III (polIII) and similar proteins. PolIII is the main DNA replicating enzyme in bacteria, functioning as the chromosomal replicase. It is a holoenzyme complex of ten different subunits, three of which (alpha, epsilon, and theta) compose the catalytic core. The polIII epsilon subunit, encoded by the dnaQ gene, is a DEDDh-type 3'-5' exonuclease which is responsible for the proofreading activity of the polymerase, increasing the fidelity of DNA synthesis. It contains three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific Hx(4)D conserved pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. The epsilon subunit of polIII also functions as a stabilizer of the holoenzyme complex. Probab=99.02 E-value=1.8e-08 Score=74.26 Aligned_cols=79 Identities=25% Similarity=0.255 Sum_probs=55.0 Q ss_pred CCCEEEEEEHHHHHHHHHHH---CCCCCC---CCCEEEHHHHHHHHC--CCCCHHHHHHHHCCCCCCHHHHHCHHHHHHH Q ss_conf 89819986307759999986---299878---650454588999858--8999789999808898652766376576641 Q gi|254780127|r 64 ETVMCVAHNSLFERILFKKT---LGIDIP---SKRWICTSVLARTNG--LPSSLKNACLALNFSEHLTKMEEGKALIARF 135 (675) Q Consensus 64 ~~~~kv~HNa~FD~~vL~~~---lGi~i~---~~~~~DTmi~A~~l~--~p~sL~~la~~lgl~~~~~kd~~gk~li~~~ 135 (675) .+...|+||+.||+.||... .|+..+ ...|.|||.+++.+- .+++|+.++++||++.... T Consensus 80 ~~~~lVaHN~~FD~~fL~~~~~r~~~~~~~~~~~~~~Dtl~l~r~~~p~~~~~L~~l~~~~gi~~~~~------------ 147 (167) T cd06131 80 RGAELVIHNASFDVGFLNAELSLLGLGKKIIDFCRVIDTLALARKKFPGKPNSLDALCKRFGIDNSHR------------ 147 (167) T ss_pred CCCEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHCCCCCCCHHHHHHHCCCCCCCC------------ T ss_conf 69999983718789999999998189977776554113999999984999888999998769799999------------ Q ss_pred HHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 002332013453112587689998988775399999999872 Q gi|254780127|r 136 CKGSIDSPPYDCTRANHVQAWQLFGEYCKRDVEATRELFKRL 177 (675) Q Consensus 136 ~~~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~~~~L~~~L 177 (675) . ..-|..|+.+|..+|.+| T Consensus 148 --------------~---------~H~Al~Da~~ta~vf~~l 166 (167) T cd06131 148 --------------T---------LHGALLDAELLAEVYLEL 166 (167) T ss_pred --------------C---------CCCHHHHHHHHHHHHHHH T ss_conf --------------8---------758899999999999976 No 55 >PRK06309 DNA polymerase III subunit epsilon; Validated Probab=99.01 E-value=1.8e-08 Score=74.29 Aligned_cols=85 Identities=20% Similarity=0.201 Sum_probs=60.3 Q ss_pred HHHHHCCCCCEEEEEEH-HHHHHHHHHH---CCCCCCCCCEEEHHHHHHHHC---CCCCHHHHHHHHCCCCCCHHHHHCH Q ss_conf 99986488981998630-7759999986---299878650454588999858---8999789999808898652766376 Q gi|254780127|r 57 LLQYLRDETVMCVAHNS-LFERILFKKT---LGIDIPSKRWICTSVLARTNG---LPSSLKNACLALNFSEHLTKMEEGK 129 (675) Q Consensus 57 l~~~L~~~~~~kv~HNa-~FD~~vL~~~---lGi~i~~~~~~DTmi~A~~l~---~p~sL~~la~~lgl~~~~~kd~~gk 129 (675) +.+++. ++.+.||||+ .||+.||... .|+..+..++.||+.+|+.+- ..++|+.+++.+|+.... T Consensus 73 ~~~f~~-~~~ilVaHN~~~FD~~fL~~e~~r~g~~~~~~~~iDTl~lar~l~P~l~~h~L~~L~~~~gi~~~~------- 144 (232) T PRK06309 73 FREFCG-TDNILVAHNNDGFDFPLLVKECRRHSLEPLTLRTIDSLKWAQKYRPDLPKHNLQYLRQVYGFAENQ------- 144 (232) T ss_pred HHHHHC-CCCEEEEECCHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHCCCCCCCCHHHHHHHHCCCCCC------- T ss_conf 999857-998899848965679999999998599867740537999999975897768899999980999988------- Q ss_pred HHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 57664100233201345311258768999898877539999999987200 Q gi|254780127|r 130 ALIARFCKGSIDSPPYDCTRANHVQAWQLFGEYCKRDVEATRELFKRLIP 179 (675) Q Consensus 130 ~li~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~~~~L~~~L~~ 179 (675) ..-|..|+.+|..++..|.. T Consensus 145 ------------------------------aHrAl~Da~at~~vf~~ll~ 164 (232) T PRK06309 145 ------------------------------AHRALDDVITLHRVFSALVG 164 (232) T ss_pred ------------------------------CCCHHHHHHHHHHHHHHHHH T ss_conf ------------------------------88859999999999999872 No 56 >PRK08858 consensus Probab=99.01 E-value=3.6e-08 Score=72.21 Aligned_cols=65 Identities=17% Similarity=0.221 Sum_probs=46.8 Q ss_pred HHHHHHHCCCCCEEEEEEHHHHHHHHHHHC---CCCCCC----CCEEEHHHHHHHHC--CCCCHHHHHHHHCCCCC Q ss_conf 999998648898199863077599999862---998786----50454588999858--89997899998088986 Q gi|254780127|r 55 SDLLQYLRDETVMCVAHNSLFERILFKKTL---GIDIPS----KRWICTSVLARTNG--LPSSLKNACLALNFSEH 121 (675) Q Consensus 55 ~~l~~~L~~~~~~kv~HNa~FD~~vL~~~l---Gi~i~~----~~~~DTmi~A~~l~--~p~sL~~la~~lgl~~~ 121 (675) .++.+++. +..+|+||+.||+.||...+ |..+.. .++.||+.+|+.+- .+.||+++|++|+++.. T Consensus 89 ~efl~FI~--d~~LVaHNa~FD~~FLn~El~r~~~~~~~~~~~~~viDTL~lAR~~~Pg~~~sLdaLc~r~~Id~s 162 (246) T PRK08858 89 QEFLEFIK--GAELVAHNAPFDVGFMDYEFEKLNPAIGKTDDYCKVTDTLAMAKKIFPGKRNNLDVLCERYGIDNS 162 (246) T ss_pred HHHHHHHC--CCEEEEECCHHHHHHHHHHHHHHCCCCCCHHHHCCCEEHHHHHHHHCCCCCCCHHHHHHHCCCCCC T ss_conf 99999958--998999722888999999999828776751342010029999998689998888999998198988 No 57 >PRK07883 hypothetical protein; Validated Probab=99.00 E-value=2.8e-08 Score=72.99 Aligned_cols=63 Identities=22% Similarity=0.295 Sum_probs=48.5 Q ss_pred HHHHHCCCCCEEEEEEHHHHHHHHHHH---CCCCCCCCCEEEHHHHHHHHCC-----CCCHHHHHHHHCCCCC Q ss_conf 999864889819986307759999986---2998786504545889998588-----9997899998088986 Q gi|254780127|r 57 LLQYLRDETVMCVAHNSLFERILFKKT---LGIDIPSKRWICTSVLARTNGL-----PSSLKNACLALNFSEH 121 (675) Q Consensus 57 l~~~L~~~~~~kv~HNa~FD~~vL~~~---lGi~i~~~~~~DTmi~A~~l~~-----p~sL~~la~~lgl~~~ 121 (675) +.+++ .+.++|+||+.||+.||... .|++.+...+.||+-+|+.+-+ .++|..+++.||+... T Consensus 98 f~~F~--~~~vlVAHNa~FD~~FL~~~~~r~g~~~~~~~~ldTl~LARrl~p~~e~~~~kL~tLa~~~g~~~~ 168 (575) T PRK07883 98 FLEFA--RGSVLVAHNAPFDIGFLRAAAERCGYPWPQPEVLCTVRLARRVLSRDEAPNVRLSALARLFGAATT 168 (575) T ss_pred HHHHH--CCCEEEEECCHHHHHHHHHHHHHCCCCCCCCCEEEHHHHHHHHCCCCCCCCCCHHHHHHHCCCCCC T ss_conf 99996--899899908345299999999975999878978648999998669211468998999997599999 No 58 >PRK07942 DNA polymerase III subunit epsilon; Provisional Probab=98.98 E-value=3.1e-07 Score=66.04 Aligned_cols=79 Identities=15% Similarity=0.134 Sum_probs=56.3 Q ss_pred CCCEEEEEEHHHHHHHHHHH---CCCC-CCCCCEEEHHHHHHHHCC----CCCHHHHHHHHCCCCCCHHHHHCHHHHHHH Q ss_conf 89819986307759999986---2998-786504545889998588----999789999808898652766376576641 Q gi|254780127|r 64 ETVMCVAHNSLFERILFKKT---LGID-IPSKRWICTSVLARTNGL----PSSLKNACLALNFSEHLTKMEEGKALIARF 135 (675) Q Consensus 64 ~~~~kv~HNa~FD~~vL~~~---lGi~-i~~~~~~DTmi~A~~l~~----p~sL~~la~~lgl~~~~~kd~~gk~li~~~ 135 (675) .+..+|+||+.||+.||.+. .|++ +.+..+.||+++++.++. +++|+.+|+.||++.+.. T Consensus 92 ~g~~lVahNA~FDl~fL~~El~R~g~~~l~~~~viDtl~l~r~l~~~r~gkr~L~aL~~~ygv~l~~a------------ 159 (234) T PRK07942 92 AGVPVVVYNAPYDLTVLDRELRRHGLPSLVPGPVIDPYVIDKHVDRYRKGKRTLTALCEHYGVRLDNA------------ 159 (234) T ss_pred CCCEEEEECCHHHHHHHHHHHHHCCCCCCCCCCEEEHHHHHHHHCCCCCCCCCHHHHHHHCCCCCCCC------------ T ss_conf 79879996707649999999998599967777556199988775147778888999999849988775------------ Q ss_pred HHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 00233201345311258768999898877539999999987200 Q gi|254780127|r 136 CKGSIDSPPYDCTRANHVQAWQLFGEYCKRDVEATRELFKRLIP 179 (675) Q Consensus 136 ~~~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~~~~L~~~L~~ 179 (675) .-|..|+..+.+|...|.. T Consensus 160 -------------------------H~A~aDA~Ata~l~~~l~~ 178 (234) T PRK07942 160 -------------------------HDATADALAAARLAWALAR 178 (234) T ss_pred -------------------------CCCHHHHHHHHHHHHHHHH T ss_conf -------------------------5641528999999999998 No 59 >cd05785 DNA_polB_like2_exo A subfamily of the 3'-5' exonuclease domain of family-B DNA polymerases. This subfamily is composed of uncharacterized bacterial family-B DNA polymerases. Family-B DNA polymerases contain an N-terminal DEDDy DnaQ-like exonuclease domain in the same polypeptide chain as the polymerase domain, similar to family-A DNA polymerases. This exonuclease domain contains three sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are involved in metal binding and catalysis. The exonuclease domain of family-B DNA polymerases has a fundamental role in proofreading activity. It contains a beta hairpin structure that plays an important role in active site switching in the event of a nucleotide misincorporation. Family-B DNA polymerases are predominantly involved in DNA replication and DNA repair. Probab=98.94 E-value=7.1e-08 Score=70.25 Aligned_cols=154 Identities=18% Similarity=0.180 Sum_probs=89.2 Q ss_pred CCEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCCEEEEEECC--CCCHHHHHHHHHHC--CCCCEEEEEEH-HH Q ss_conf 978999804067887000460000268829999999838228999788--84221799999864--88981998630-77 Q gi|254780127|r 1 MSKLFIDIETRSPQPLPKVGVWAYAEQAVITLCAYAWDDEPVKLWDRT--EQSAMPSDLLQYLR--DETVMCVAHNS-LF 75 (675) Q Consensus 1 mk~l~lD~ET~s~~dl~~~Gl~~y~~~~~I~~i~~a~~~~~~~~~d~~--~~~~~~~~l~~~L~--~~~~~kv~HNa-~F 75 (675) +|++++||||+|.....-.. +-.+.-.|..|++..+++...++... .-..+...+..++. +|+++ +|+|. .| T Consensus 9 lr~lsfDIE~~~~~g~~~p~--p~~~~d~Ii~I~~~~~~~~~~v~~~~~~~E~~lL~~F~~~i~~~dPDII-tGyNi~~F 85 (207) T cd05785 9 LRRLQLDIETYSLPGFFFSN--PDRGDDRIIIVALRDNRGWEEVLHAEDAAEKELLEELVAIIRERDPDVI-EGHNIFRF 85 (207) T ss_pred EEEEEEEEEECCCCCCCCCC--CCCCCCCEEEEEEEECCCCEEEEEECCCCHHHHHHHHHHHHHHCCCCEE-EECCCCCC T ss_conf 05999999747889977999--8889997899998858997798762699999999999999987399999-86798775 Q ss_pred HHHHHHH---HCCCCCC------------------------------CCCEEEHHHHHHHH-----C-CCCCHHHHHHHH Q ss_conf 5999998---6299878------------------------------65045458899985-----8-899978999980 Q gi|254780127|r 76 ERILFKK---TLGIDIP------------------------------SKRWICTSVLARTN-----G-LPSSLKNACLAL 116 (675) Q Consensus 76 D~~vL~~---~lGi~i~------------------------------~~~~~DTmi~A~~l-----~-~p~sL~~la~~l 116 (675) |++.|.. .+|+++. .....|.+-+.+.. . .+++|+.+|+.+ T Consensus 86 D~pYL~~Ra~~~~i~~~lgR~~~~~~~~~~~~~~~~~~~~~~~~~i~Gr~~lD~~~~~k~~~~~~~~L~SY~L~~Va~~~ 165 (207) T cd05785 86 DLPYLRRRCRRHGVPLAIGRDGSIPRQRPSRFRFAERLIDYPRYDIPGRHVIDTYFLVQLFDVSSRDLPSYGLKAVAKHF 165 (207) T ss_pred CHHHHHHHHHHHCCCCCCCCCCCCCEEECCEEEECCCCCCEEEEEECCEEEEEHHHHHHHHCCCCCCCCCCCHHHHHHHH T ss_conf 88999999999599722257898744640302122355530499873648865499998642111467765289999987 Q ss_pred CCCCCCHHHHHCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 8898652766376576641002332013453112587689998988775399999999 Q gi|254780127|r 117 NFSEHLTKMEEGKALIARFCKGSIDSPPYDCTRANHVQAWQLFGEYCKRDVEATRELF 174 (675) Q Consensus 117 gl~~~~~kd~~gk~li~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~~~~L~ 174 (675) ++......+..+..+... + ...++.+..||.+||++|.+|. T Consensus 166 ~l~~~~k~~~~~~~i~~~--------------~---~~d~~~l~~Yni~Dv~~t~~La 206 (207) T cd05785 166 GLASPDRTYIDGRQIAEV--------------W---RSDPARLLAYALDDVRETEGLA 206 (207) T ss_pred HHCCCCCCCCCHHHHHHH--------------H---HCCHHHHHHHHHHHHHHHHHHC T ss_conf 313666563889999999--------------9---5298999998299999999860 No 60 >cd06130 DNA_pol_III_epsilon_like This subfamily is composed of uncharacterized bacterial proteins with similarity to the epsilon subunit of DNA polymerase III (polIII), a multisubunit polymerase which is the main DNA replicating enzyme in bacteria, functioning as the chromosomal replicase. The pol III holoenzyme is a complex of ten different subunits, three of which (alpha, epsilon, and theta) compose the catalytic core. The polIII epsilon subunit, encoded by the dnaQ gene, is a DEDDh-type 3'-5' exonuclease which is responsible for the proofreading activity of the polymerase, increasing the fidelity of DNA synthesis. It contains three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific Hx(4)D conserved pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. The epsilon subunit of polIII also functions as a stabilizer of the holoenzyme complex Probab=98.92 E-value=4.9e-08 Score=71.36 Aligned_cols=63 Identities=25% Similarity=0.445 Sum_probs=48.0 Q ss_pred HHHHHHCCCCCEEEEEEHHHHHHHHHHH---CCCCCCCCCEEEHHHHHHHHC---CCCCHHHHHHHHCCCC Q ss_conf 9999864889819986307759999986---299878650454588999858---8999789999808898 Q gi|254780127|r 56 DLLQYLRDETVMCVAHNSLFERILFKKT---LGIDIPSKRWICTSVLARTNG---LPSSLKNACLALNFSE 120 (675) Q Consensus 56 ~l~~~L~~~~~~kv~HNa~FD~~vL~~~---lGi~i~~~~~~DTmi~A~~l~---~p~sL~~la~~lgl~~ 120 (675) ++.+++ .+..+|+||+.||+.+|.+. .|++.+...|.|||-+++.+. .+++|+.++++||++. T Consensus 70 ~l~~~l--~~~~lVaHn~~FD~~fL~~~~~~~~~~~~~~~~iDtl~l~r~~~~~~~~~~L~~l~~~~gi~~ 138 (156) T cd06130 70 EIKPFL--GGSLVVAHNASFDRSVLRAALEAYGLPPPPYQYLCTVRLARRVWPLLPNHKLNTVAEHLGIEL 138 (156) T ss_pred HHHHHC--CCCEEEEECHHHHHHHHHHHHHHCCCCCCCCCEECHHHHHHHHCCCCCCCCHHHHHHHCCCCC T ss_conf 999855--898899957788999999999981999989858818999999738888899999999859998 No 61 >TIGR01405 polC_Gram_pos DNA polymerase III, alpha subunit, Gram-positive type; InterPro: IPR006308 These are the polypeptide chains of DNA polymerase III. Full-length homologs of this protein are restricted to the Gram-positive lineages, including the Mycoplasmas. This protein is designated alpha chain and given the gene symbol polC, but is not a full-length homolog of other polC genes. The N-terminal region of about 200 amino acids is rich in low-complexity sequence and poorly alignable. ; GO: 0003677 DNA binding, 0003887 DNA-directed DNA polymerase activity, 0006260 DNA replication, 0005737 cytoplasm. Probab=98.92 E-value=2.4e-08 Score=73.43 Aligned_cols=65 Identities=22% Similarity=0.246 Sum_probs=47.7 Q ss_pred HHHHHHCCCCCEEEEEE-HHHHHHHHHHHC---CCCCCCCCEEEHHHHHHHHCC---CCCHHHHHHHHCCCCCC Q ss_conf 99998648898199863-077599999862---998786504545889998588---99978999980889865 Q gi|254780127|r 56 DLLQYLRDETVMCVAHN-SLFERILFKKTL---GIDIPSKRWICTSVLARTNGL---PSSLKNACLALNFSEHL 122 (675) Q Consensus 56 ~l~~~L~~~~~~kv~HN-a~FD~~vL~~~l---Gi~i~~~~~~DTmi~A~~l~~---p~sL~~la~~lgl~~~~ 122 (675) ++...+ .+.+.|||| |.||+.||...+ ++.--...++||+-+|+.|++ +|+|..+|+.|+++-.. T Consensus 278 k~~~f~--~d~ilVAHNGasFD~~Fl~~~~~k~~~~~~~~p~IDTL~Lar~lnP~y~sh~Lg~~~~Kl~v~~~~ 349 (1264) T TIGR01405 278 KFKEFL--KDSILVAHNGASFDIGFLNTNFEKVGLEPLENPVIDTLELARALNPEYKSHRLGEIAKKLGVDLDD 349 (1264) T ss_pred HHHHHH--CCEEEEECCCCCCCHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHCCCCCCCCCCHHHHHCCCCCCC T ss_conf 999976--975888627610366889889876277423365113889999837311344542233320552153 No 62 >cd06134 RNaseT RNase T is a DEDDh-type DnaQ-like 3'-5' exoribonuclease E implicated in the 3' maturation of small stable RNAs and 23srRNA, and in the end turnover of tRNA. It contains three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific Hx(4)D conserved pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. RNase T is related to the proofreading domain of DNA polymerase III. Despite its important role, RNase T is mainly found only in gammaproteobacteria. It is speculated that it might have originated from DNA polymerase III at the time the gamma division of proteobacteria diverged from other bacteria. RNase T is a homodimer with the catalytic residues of one monomer contacting a large basic patch on the other monomer to form a functional active site. Probab=98.92 E-value=4.5e-08 Score=71.56 Aligned_cols=79 Identities=22% Similarity=0.201 Sum_probs=55.6 Q ss_pred CEEEEEEHHHHHHHHHHH---CCCCCC---CCCEEEHHHHHHHHCCCCCHHHHHHHHCCCCCCHHHHHCHHHHHHHHHHH Q ss_conf 819986307759999986---299878---65045458899985889997899998088986527663765766410023 Q gi|254780127|r 66 VMCVAHNSLFERILFKKT---LGIDIP---SKRWICTSVLARTNGLPSSLKNACLALNFSEHLTKMEEGKALIARFCKGS 139 (675) Q Consensus 66 ~~kv~HNa~FD~~vL~~~---lGi~i~---~~~~~DTmi~A~~l~~p~sL~~la~~lgl~~~~~kd~~gk~li~~~~~~~ 139 (675) .+.||||+.||+.||... .|+.-. +..+.|||-+|+.+-...+|+.+++.+|++.... T Consensus 103 avlVaHNa~FD~~FL~~~~~r~g~~~~p~~~~~~~DTl~La~~~~~~~~L~~l~~~~gi~~~~~---------------- 166 (189) T cd06134 103 AILVGHNAHFDLGFLNAAVARCKIKRNPFHPFSTFDTATLAGLAYGQTVLAKACQAAGIEFDNK---------------- 166 (189) T ss_pred CEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCEEEHHHHHHHHHCCCCHHHHHHHCCCCCCCC---------------- T ss_conf 5687517777299999999984998888877765669999867738001999999879999998---------------- Q ss_pred HHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 3201345311258768999898877539999999987200 Q gi|254780127|r 140 IDSPPYDCTRANHVQAWQLFGEYCKRDVEATRELFKRLIP 179 (675) Q Consensus 140 ~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~~~~L~~~L~~ 179 (675) . ..-|..|+.+|..|+.+|.+ T Consensus 167 ----------~---------~HrAl~DA~~ta~lf~~l~~ 187 (189) T cd06134 167 ----------E---------AHSALYDTQKTAELFCKIVN 187 (189) T ss_pred ----------C---------CCCCHHHHHHHHHHHHHHHH T ss_conf ----------7---------88729999999999999996 No 63 >PRK09145 DNA polymerase III subunit epsilon; Validated Probab=98.90 E-value=1.5e-07 Score=68.12 Aligned_cols=128 Identities=18% Similarity=0.125 Sum_probs=74.1 Q ss_pred CEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEE-EECCCEEE-----EEECCC---------------------CCHHH Q ss_conf 7899980406788700046000026882999999-98382289-----997888---------------------42217 Q gi|254780127|r 2 SKLFIDIETRSPQPLPKVGVWAYAEQAVITLCAY-AWDDEPVK-----LWDRTE---------------------QSAMP 54 (675) Q Consensus 2 k~l~lD~ET~s~~dl~~~Gl~~y~~~~~I~~i~~-a~~~~~~~-----~~d~~~---------------------~~~~~ 54 (675) ++++||+||+ |+++.. -+|+-|+. .+.++.+. ..-+-+ +.++. T Consensus 30 ~fVv~D~ETT--------GL~~~~--D~IieIgav~v~~~~i~~~~~f~~lV~P~~~i~~~~~~IhGIt~~~l~~ap~~~ 99 (203) T PRK09145 30 EWVALDCETT--------GLDPRR--AEIVSIAAVKIRGNRILTSERLELLVRPPQSLSAESIKIHRLRHQDLEHGLSEE 99 (203) T ss_pred CEEEEEEECC--------CCCCCC--CEEEEEEEEEEECCEEECCCEEEEEECCCCCCCHHHHCCCCCCHHHHCCCCCHH T ss_conf 8999985689--------999999--706999999999999931752899989799598645201387879981399999 Q ss_pred ---HHHHHHHCCCCCEEEEEEHHHHHHHHHH----HCCCCCCCCCEEEHHHHH--HH---H-C--CCCCHHHHHHHHCCC Q ss_conf ---9999986488981998630775999998----629987865045458899--98---5-8--899978999980889 Q gi|254780127|r 55 ---SDLLQYLRDETVMCVAHNSLFERILFKK----TLGIDIPSKRWICTSVLA--RT---N-G--LPSSLKNACLALNFS 119 (675) Q Consensus 55 ---~~l~~~L~~~~~~kv~HNa~FD~~vL~~----~lGi~i~~~~~~DTmi~A--~~---l-~--~p~sL~~la~~lgl~ 119 (675) .++.+++. +..+||||+.||+.||.+ ++|+.++ ..+.||+-+. +. . . ..++|+.+++.||++ T Consensus 100 evl~~f~~fi~--~~~lVgHNa~FD~~fL~~~~~r~~g~~l~-~~~id~~~l~~~~~~~~~~~~~~~~~L~~l~~~~gi~ 176 (203) T PRK09145 100 EALRQLLAFIG--NRPLVGYYLEFDVAMLNRYVRPLLGIGLP-NPLIEVSSLYHDKKERHLPDAYIDLRFDAILKHLDLP 176 (203) T ss_pred HHHHHHHHHCC--CCEEEEECHHHHHHHHHHHHHHHCCCCCC-CCEEEHHHHHHHHHHHHCCCCCCCCCHHHHHHHCCCC T ss_conf 99999998829--99899987899999999999996299999-9877609879999996676987788999999857987 Q ss_pred CCCHHHHHCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 865276637657664100233201345311258768999898877539999999987200 Q gi|254780127|r 120 EHLTKMEEGKALIARFCKGSIDSPPYDCTRANHVQAWQLFGEYCKRDVEATRELFKRLIP 179 (675) Q Consensus 120 ~~~~kd~~gk~li~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~~~~L~~~L~~ 179 (675) ... ..-|..|+.+|..||.+|.+ T Consensus 177 ~~~-------------------------------------~HrAL~DA~aTa~lfl~L~k 199 (203) T PRK09145 177 VLG-------------------------------------RHDALNDAIMTALIYLRLRK 199 (203) T ss_pred CCC-------------------------------------CCCHHHHHHHHHHHHHHHHC T ss_conf 899-------------------------------------97989999999999999867 No 64 >PRK06063 DNA polymerase III subunit epsilon; Provisional Probab=98.90 E-value=4e-08 Score=71.95 Aligned_cols=66 Identities=26% Similarity=0.386 Sum_probs=50.6 Q ss_pred HHHHHHHHCCCCCEEEEEEHHHHHHHHHHH---CCCCCCCCCEEEHHHHHHHHCC---CCCHHHHHHHHCCCCC Q ss_conf 799999864889819986307759999986---2998786504545889998588---9997899998088986 Q gi|254780127|r 54 PSDLLQYLRDETVMCVAHNSLFERILFKKT---LGIDIPSKRWICTSVLARTNGL---PSSLKNACLALNFSEH 121 (675) Q Consensus 54 ~~~l~~~L~~~~~~kv~HNa~FD~~vL~~~---lGi~i~~~~~~DTmi~A~~l~~---p~sL~~la~~lgl~~~ 121 (675) ..++.++|. +.++|+||+.||+.||... .|++.+...+.||+.+|+.+.+ .+.|..+|+.||+... T Consensus 85 ~~~l~~~l~--g~vlVAHNa~FD~~FL~~e~~r~g~~~p~~~~lcTl~lARr~~~~l~~~kL~tla~~~Gi~~~ 156 (313) T PRK06063 85 AGELAEVLR--GRTLVAHNVAFDYSFLAAEAELAGAELPVDQVMCTVELARRLDLGLPNLRLETLAAHWGVVQQ 156 (313) T ss_pred HHHHHHHHC--CCEEEEECCHHHHHHHHHHHHHCCCCCCCCCEEEHHHHHHHHCCCCCCCCHHHHHHHCCCCCC T ss_conf 999999978--997999685652999999999739988999667689988654489999747889998099988 No 65 >TIGR01407 dinG_rel DnaQ family exonuclease/DinG family helicase, putative; InterPro: IPR006310 These are a family of proteins in Gram-positive bacteria. The N-terminal region of about 200 amino acids resembles the epsilon subunit of E. coli DNA polymerase III and the homologous region of the Gram-positive type DNA polymerase III alpha subunit. The epsilon subunit contains an exonuclease domain. The remainder of this protein family resembles a predicted ATP-dependent helicase, the DNA damage-inducible protein DinG of Escherichia coli. . Probab=98.80 E-value=3.7e-07 Score=65.52 Aligned_cols=108 Identities=21% Similarity=0.214 Sum_probs=71.7 Q ss_pred CEEEEEECCCCCCCCCCCCCHHHCCC-CEEEEEEEE-ECCCEEEEE---ECCCCCHHHH--------------------- Q ss_conf 78999804067887000460000268-829999999-838228999---7888422179--------------------- Q gi|254780127|r 2 SKLFIDIETRSPQPLPKVGVWAYAEQ-AVITLCAYA-WDDEPVKLW---DRTEQSAMPS--------------------- 55 (675) Q Consensus 2 k~l~lD~ET~s~~dl~~~Gl~~y~~~-~~I~~i~~a-~~~~~~~~~---d~~~~~~~~~--------------------- 55 (675) .+..+|+||++.... + .+|.-||+- +.||++.-- ++-+..++|. T Consensus 1 rYaVVDLEtTg~~~~----------~~d~IIqiGiV~v~DgeIv~~f~T~VNP~~~~pp~I~eLTGisd~~l~~AP~F~q 70 (944) T TIGR01407 1 RYAVVDLETTGTDSS----------EFDKIIQIGIVLVEDGEIVDTFATDVNPNEPIPPFIQELTGISDEQLKQAPYFSQ 70 (944) T ss_pred CEEEEEEEECCCCCC----------CCCCEEEEEEEEEECCCEEEECCCCCCCCCCCCCHHHHHCCCCHHHHCCCCCHHH T ss_conf 968998741688864----------5576689988998558143201267788987761145422877686404888879 Q ss_pred ---HHHHHHCCCCCEEEEEEHHHHHHHHHHHC---CCCCCCCCEEEHHHHHHHHC---CCCCHHHHHHHHCCCCC Q ss_conf ---99998648898199863077599999862---99878650454588999858---89997899998088986 Q gi|254780127|r 56 ---DLLQYLRDETVMCVAHNSLFERILFKKTL---GIDIPSKRWICTSVLARTNG---LPSSLKNACLALNFSEH 121 (675) Q Consensus 56 ---~l~~~L~~~~~~kv~HNa~FD~~vL~~~l---Gi~i~~~~~~DTmi~A~~l~---~p~sL~~la~~lgl~~~ 121 (675) ++.+.|+ +.+.||||+.||+-+|++.+ |++..+...+||.=+|+.+= .+++|..+++.||+..+ T Consensus 71 VA~~i~~~l~--D~iFVAHNV~FD~nfL~k~l~~~G~~~~~~~~iDTVELa~if~PT~esY~Ls~L~E~LGl~h~ 143 (944) T TIGR01407 71 VAQEIYDLLE--DGIFVAHNVHFDLNFLAKELVDEGYEPLEVPRIDTVELARIFFPTEESYKLSELSEELGLEHE 143 (944) T ss_pred HHHHHHHHHC--CCEEEEECHHCCHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHCCCCCCCCCHHHHHHHCCCCC T ss_conf 9999998742--972776101506789999998568974343520056799874674435561489997198714 No 66 >PRK05168 ribonuclease T; Provisional Probab=98.78 E-value=4.1e-07 Score=65.25 Aligned_cols=79 Identities=24% Similarity=0.233 Sum_probs=55.0 Q ss_pred CEEEEEEHHHHHHHHHHH---CCCC---CCCCCEEEHHHHHHHHCCCCCHHHHHHHHCCCCCCHHHHHCHHHHHHHHHHH Q ss_conf 819986307759999986---2998---7865045458899985889997899998088986527663765766410023 Q gi|254780127|r 66 VMCVAHNSLFERILFKKT---LGID---IPSKRWICTSVLARTNGLPSSLKNACLALNFSEHLTKMEEGKALIARFCKGS 139 (675) Q Consensus 66 ~~kv~HNa~FD~~vL~~~---lGi~---i~~~~~~DTmi~A~~l~~p~sL~~la~~lgl~~~~~kd~~gk~li~~~~~~~ 139 (675) .++|+||+.||+.||... .|+. +.+..++||+-+|+.+-...+|+.+|+++|++.+.. T Consensus 115 avLVaHNA~FD~~FLn~~~~r~g~~~~P~~~~~viDTl~Lar~~~~~~~L~~lc~~~gi~~~~~---------------- 178 (213) T PRK05168 115 AILVAHNAHFDLGFLNAAAERTGLKRNPFHPFSTFDTATLAGLAYGQTVLAKACQAAGIEFDNK---------------- 178 (213) T ss_pred CEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCEEEHHHHHHHHHCCCCHHHHHHHCCCCCCCC---------------- T ss_conf 6377634187499999999984997788887763358998688748740999999869999997---------------- Q ss_pred HHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 3201345311258768999898877539999999987200 Q gi|254780127|r 140 IDSPPYDCTRANHVQAWQLFGEYCKRDVEATRELFKRLIP 179 (675) Q Consensus 140 ~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~~~~L~~~L~~ 179 (675) ...-|..|+++|..+|-+|.+ T Consensus 179 -------------------~~HrAl~DA~~tA~vFl~ll~ 199 (213) T PRK05168 179 -------------------EAHSALYDTEKTAELFCEIVN 199 (213) T ss_pred -------------------CCCCCHHHHHHHHHHHHHHHH T ss_conf -------------------777738999999999999999 No 67 >cd05782 DNA_polB_like1_exo A subfamily of the 3'-5' exonuclease domain of family-B DNA polymerases. This subfamily is composed of uncharacterized bacterial family-B DNA polymerases. Family-B DNA polymerases contain an N-terminal DEDDy DnaQ-like exonuclease domain in the same polypeptide chain as the polymerase domain, similar to family-A DNA polymerases. This exonuclease domain contains three sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are involved in metal binding and catalysis. The exonuclease domain of family-B DNA polymerases has a fundamental role in proofreading activity. It contains a beta hairpin structure that plays an important role in active site switching in the event of a nucleotide misincorporation. Family-B DNA polymerases are predominantly involved in DNA replication and DNA repair. Probab=98.78 E-value=4.5e-07 Score=64.95 Aligned_cols=131 Identities=19% Similarity=0.258 Sum_probs=80.3 Q ss_pred CEEEEEEEEECC---CEEEEEECCC--CCHHHHHHHHHHCCCCCEEEEEEH-HHHHHHHHH---HCCCCCCC-------- Q ss_conf 829999999838---2289997888--422179999986488981998630-775999998---62998786-------- Q gi|254780127|r 28 AVITLCAYAWDD---EPVKLWDRTE--QSAMPSDLLQYLRDETVMCVAHNS-LFERILFKK---TLGIDIPS-------- 90 (675) Q Consensus 28 ~~I~~i~~a~~~---~~~~~~d~~~--~~~~~~~l~~~L~~~~~~kv~HNa-~FD~~vL~~---~lGi~i~~-------- 90 (675) .+|++||+.+.+ ..+....... -..+...+.+.++.....+|+||. .||+++|.+ .+|+.+|. T Consensus 51 ~rIV~Is~~~~~~~~~~~~~~~~~~~~E~~ll~~F~~~i~~~~p~lv~~Ng~~FD~P~l~~Ral~~gi~~p~~~~~~~~~ 130 (208) T cd05782 51 HKVVSISALYRDDDGGFLKVRTLDGADEKELLEDFFQLIEKKNPRLVSFNGRGFDLPVLHLRALIHGVSAPAYFDLGNKD 130 (208) T ss_pred CCEEEEEEEEEECCCCEEEEEECCCCCHHHHHHHHHHHHHHCCCEEEEECCCCCCHHHHHHHHHHCCCCCCHHHHCCCCC T ss_conf 86279999999558977998417999999999999999986599799636875758999999999198975554048874 Q ss_pred ----CCE----EEHHHHHHHHC--CCCCHHHHHHHHCCCCCCHHHHHCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHH Q ss_conf ----504----54588999858--89997899998088986527663765766410023320134531125876899989 Q gi|254780127|r 91 ----KRW----ICTSVLARTNG--LPSSLKNACLALNFSEHLTKMEEGKALIARFCKGSIDSPPYDCTRANHVQAWQLFG 160 (675) Q Consensus 91 ----~~~----~DTmi~A~~l~--~p~sL~~la~~lgl~~~~~kd~~gk~li~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (675) .+| .|||-.-..-+ ...||+.+|+.||++.+ -+..|....... .. ...+... T Consensus 131 ~~y~~r~~~~H~Dl~d~l~~~~~~~~~sLd~la~llG~pgK--~~~~Gs~V~~~~--------------~~--g~l~~I~ 192 (208) T cd05782 131 WNYRNRYSERHLDLMDLLAFYGARARASLDLLAKLLGIPGK--MDVDGSQVWELY--------------AE--GKLDEIA 192 (208) T ss_pred CCCCCCCCCCCCCHHHHHHCCCCCCCCCHHHHHHHHCCCCC--CCCCHHHHHHHH--------------HC--CCHHHHH T ss_conf 11248887631329999855487577889999998389975--788889999999--------------81--9988999 Q ss_pred HHHHHHHHHHHHHHHH Q ss_conf 8877539999999987 Q gi|254780127|r 161 EYCKRDVEATRELFKR 176 (675) Q Consensus 161 ~Ya~~Dv~~~~~L~~~ 176 (675) +||..||..|..+|.+ T Consensus 193 ~YCe~DVl~T~~v~lR 208 (208) T cd05782 193 EYCETDVLNTYLLYLR 208 (208) T ss_pred HHHHHHHHHHHHHHHC T ss_conf 9999999999999809 No 68 >cd05780 DNA_polB_Kod1_like_exo The 3'-5' exonuclease domain of archaeal family-B DNA polymerases with similarity to Pyrococcus kodakaraensis Kod1, including polymerases from Desulfurococcus (D. Tok Pol) and Thermococcus gorgonarius (Tgo Pol). Kod1, D. Tok Pol, and Tgo Pol are thermostable enzymes that exhibit both polymerase and 3'-5' exonuclease activities. They are family-B DNA polymerases. Their amino termini harbor a DEDDy-type DnaQ-like 3'-5' exonuclease domain that contains three sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are clustered around the active site and are involved in metal binding and catalysis. The exonuclease domain of family B polymerases contains a beta hairpin structure that plays an important role in active site switching in the event of nucleotide misincorporation. Members of this subfamily show similarity to eukaryotic DNA polymerases involved in DNA replication. Some archaea possess multiple family-B D Probab=98.71 E-value=4.6e-07 Score=64.88 Aligned_cols=152 Identities=20% Similarity=0.288 Sum_probs=84.8 Q ss_pred CCEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCCE-EEEEEC---------CCCCHHHHHHHHHHC--CCCCEE Q ss_conf 97899980406788700046000026882999999983822-899978---------884221799999864--889819 Q gi|254780127|r 1 MSKLFIDIETRSPQPLPKVGVWAYAEQAVITLCAYAWDDEP-VKLWDR---------TEQSAMPSDLLQYLR--DETVMC 68 (675) Q Consensus 1 mk~l~lD~ET~s~~dl~~~Gl~~y~~~~~I~~i~~a~~~~~-~~~~d~---------~~~~~~~~~l~~~L~--~~~~~k 68 (675) +|++++||||+|...... ++...|..|+++..++. +..+.. ..-..+...+.+++. +|+++ T Consensus 3 lkilsfDIE~~~~~g~P~------p~~d~Ii~Is~~~~~~~~vi~~~~~~~~~v~~~~tE~eLL~~F~~~i~~~dPDii- 75 (195) T cd05780 3 LKILSFDIEVLNHEGEPN------PEKDPIIMISFADEGGNKVITWKKFDLPFVEVVKTEKEMIKRFIEIVKEKDPDVI- 75 (195) T ss_pred CCEEEEEEEECCCCCCCC------CCCCCEEEEEEEECCCCEEEEECCCCCCCEEEECCHHHHHHHHHHHHHHCCCCEE- T ss_conf 429999999758999989------8888289999996899789981798877456738999999999999987299999- Q ss_pred EEEEH-HHHHHHHHH---HCCCCCC----------------------CCCEEEHHHHHH-HHC-CCCCHHHHHHH-HCCC Q ss_conf 98630-775999998---6299878----------------------650454588999-858-89997899998-0889 Q gi|254780127|r 69 VAHNS-LFERILFKK---TLGIDIP----------------------SKRWICTSVLAR-TNG-LPSSLKNACLA-LNFS 119 (675) Q Consensus 69 v~HNa-~FD~~vL~~---~lGi~i~----------------------~~~~~DTmi~A~-~l~-~p~sL~~la~~-lgl~ 119 (675) +|||. .||++.|.. .+|++++ .....|.+-+.+ ... ..++|+.+|+. ||.. T Consensus 76 ~GyN~~~FD~pYl~~R~~~~~~~~~~gr~~~~~~~~~~g~~~~~~i~Gr~~lD~~~~~~~~~~l~sy~L~~Va~~~Lg~~ 155 (195) T cd05780 76 YTYNGDNFDFPYLKKRAEKLGIELDLGRDGSEIKIQRGGFNNASEIKGRIHVDLYPVARRTLNLTRYTLERVYEELFGIE 155 (195) T ss_pred EECCCCCCCHHHHHHHHHHHCCCCCCCCCCCEEEEEECCCCEEEEECCEEEEEHHHHHHHHCCCCCCCHHHHHHHHHCCC T ss_conf 95798787689999999995898645788860479977850799876868855499997525624015999999983999 Q ss_pred CCCHHHHHCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 8652766376576641002332013453112587689998988775399999999872 Q gi|254780127|r 120 EHLTKMEEGKALIARFCKGSIDSPPYDCTRANHVQAWQLFGEYCKRDVEATRELFKRL 177 (675) Q Consensus 120 ~~~~kd~~gk~li~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~~~~L~~~L 177 (675) ... ..++.+..+. .. .+..+.+.+||.+|++++++|.+++ T Consensus 156 K~d---~~~~~i~~~~--------------~~-~~~~~~l~~Yn~~D~~L~~~L~~K~ 195 (195) T cd05780 156 KED---VPGEEIAEAW--------------DS-GENLERLFRYSMEDAKYTYEIGKEF 195 (195) T ss_pred CCC---CCHHHHHHHH--------------HC-CCCHHHHHHHHHHHHHHHHHHHHHC T ss_conf 788---9999999999--------------75-9753888844599899999998529 No 69 >PRK00448 polC DNA polymerase III PolC; Validated Probab=98.68 E-value=6.4e-07 Score=63.97 Aligned_cols=57 Identities=21% Similarity=0.171 Sum_probs=44.3 Q ss_pred CCCEEEEEEHHHHHHHHHHH---CCCCCCCCCEEEHHHHHHHHCC---CCCHHHHHHHHCCCC Q ss_conf 89819986307759999986---2998786504545889998588---999789999808898 Q gi|254780127|r 64 ETVMCVAHNSLFERILFKKT---LGIDIPSKRWICTSVLARTNGL---PSSLKNACLALNFSE 120 (675) Q Consensus 64 ~~~~kv~HNa~FD~~vL~~~---lGi~i~~~~~~DTmi~A~~l~~---p~sL~~la~~lgl~~ 120 (675) .+.++|+|||.||..|+... +|++-....++||+.+|+.+.+ .++|+.+++++++.- T Consensus 497 ~d~vlVAHNa~FD~~Fi~~~~~~~~~~~~~~p~iDTl~lsr~l~p~~k~~~L~~l~k~~~v~~ 559 (1436) T PRK00448 497 GDSVLVAHNASFDVGFINTNYEKLGLEKIKNPVIDTLELSRFLYPEYKSHRLNTLAKKFGVEL 559 (1436) T ss_pred CCCEEEEECCCCCHHHHHHHHHHCCCCCCCCCEECHHHHHHHHCCCCCCCCHHHHHHHHCCCC T ss_conf 897799845865688999999980898667972302998987551011144788998848886 No 70 >PRK09146 DNA polymerase III subunit epsilon; Validated Probab=98.63 E-value=7.9e-07 Score=63.34 Aligned_cols=56 Identities=20% Similarity=0.192 Sum_probs=38.5 Q ss_pred CCCEEEEEEHHHHHHHHHH----HCCCCCCCCCEEEHHHHHHHHC--------------CC--CCHHHHHHHHCCCC Q ss_conf 8981998630775999998----6299878650454588999858--------------89--99789999808898 Q gi|254780127|r 64 ETVMCVAHNSLFERILFKK----TLGIDIPSKRWICTSVLARTNG--------------LP--SSLKNACLALNFSE 120 (675) Q Consensus 64 ~~~~kv~HNa~FD~~vL~~----~lGi~i~~~~~~DTmi~A~~l~--------------~p--~sL~~la~~lgl~~ 120 (675) .+.++|+||+.||+.||.+ ++|..+. ..+.|||.+++.+. .+ ..|..+.++|||+. T Consensus 128 ~g~VLVaH~a~~D~~FL~~a~~~~~g~~l~-~P~IDTl~Le~~~~~~~~~~~~~r~~~~~~~slrL~~~r~rYgLP~ 203 (239) T PRK09146 128 AGKVVVVHYRRIERDFLDQALRNRIGEGIE-FPVIDTMEIEARIQRKQAGGLWNRLKGKKPESIRLADSRLRYGLPA 203 (239) T ss_pred CCCEEEEECCHHHHHHHHHHHHHHHCCCCC-CCEEEHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHCCCCC T ss_conf 899899968399999999999996288988-7464489999999885420044431358986763899999819898 No 71 >cd06125 DnaQ_like_exo The DnaQ-like exonuclease superfamily is a structurally conserved group of 3'-5' exonucleases, which catalyze the excision of nucleoside monophosphates at the DNA or RNA termini in the 3'-5' direction. It is also called the DEDD superfamily, after the four invariant acidic residues present in the catalytic site of its members. The superfamily consists of DNA- and RNA-processing enzymes such as the proofreading domains of DNA polymerases, other DNA exonucleases, RNase D, RNase T, Oligoribonuclease and RNA exonucleases (REX). The DnaQ-like exonuclease domain contains three conserved sequence motifs termed ExoI, ExoII and ExoIII, which are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. The conservation patterns of the three motifs may vary among different subfamilies. DnaQ-like exonucleases are classified as DEDDy or DEDDh exonucleases depending on the variation of motif Probab=98.62 E-value=5.5e-07 Score=64.41 Aligned_cols=89 Identities=26% Similarity=0.264 Sum_probs=56.6 Q ss_pred EEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEEC-CCEEEEEECCCCC------------------------HHHHHHH Q ss_conf 99980406788700046000026882999999983-8228999788842------------------------2179999 Q gi|254780127|r 4 LFIDIETRSPQPLPKVGVWAYAEQAVITLCAYAWD-DEPVKLWDRTEQS------------------------AMPSDLL 58 (675) Q Consensus 4 l~lD~ET~s~~dl~~~Gl~~y~~~~~I~~i~~a~~-~~~~~~~d~~~~~------------------------~~~~~l~ 58 (675) +++|+||++ +.. ....+|..|+.... .+..+...+-+.. .+..++. T Consensus 1 Vv~D~ETTG--------l~~-~~~d~Iiei~~v~~~~~~~~~~~v~p~~~i~~~~~~i~gi~~~~v~~~p~~~~~~~~~~ 71 (130) T cd06125 1 VAFDIETTG--------LNG-NARDEIIEIALVVVHIGEKFVFDVKPDDLIDPYAFTISGITDEVVERAATEAELLIELL 71 (130) T ss_pred CEEEEECCC--------CCC-CCCCEEEEEEEEEEECCCEEEEEECCCCCCCHHHHHCCCCCHHHHHHHHHHHHHHHHHH T ss_conf 989801899--------699-99985899999997189889999799998997794103759999833060899999999 Q ss_pred HHHCCCCCEEEEEEHHHHHHHHHHH---CCCCCCC--CCEEEHHHHHH Q ss_conf 9864889819986307759999986---2998786--50454588999 Q gi|254780127|r 59 QYLRDETVMCVAHNSLFERILFKKT---LGIDIPS--KRWICTSVLAR 101 (675) Q Consensus 59 ~~L~~~~~~kv~HNa~FD~~vL~~~---lGi~i~~--~~~~DTmi~A~ 101 (675) +.+++.+.++||||+.||+.||.+. +|++.++ ..+.||+.+|+ T Consensus 72 ~f~~~~~~ilVgHN~~FD~~fL~~~~~r~g~~~~~~~~~~iDTl~lar 119 (130) T cd06125 72 AFLRDKDDILVGHNISFDLPYLLNRAAELGIPYPILGRIVFDTLALAL 119 (130) T ss_pred HHHCCCCCEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCEEECHHHHH T ss_conf 997699999999194774999999999849999878996868609842 No 72 >cd05784 DNA_polB_II_exo The 3'-5' exonuclease domain of Escherichia coli DNA polymerase II (PolII) and similar bacterial proteins. PolII is a family-B DNA polymerase. Family-B DNA polymerases contain an N-terminal DEDDy DnaQ-like exonuclease domain in the same polypeptide chain as the polymerase domain, similar to family-A DNA polymerases. This exonuclease domain contains three sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are clustered around the active site and are involved in metal binding and catalysis. The exonuclease domain has a fundamental role in the proofreading activity of polII. It contains a beta hairpin structure that plays an important role in active site switching in the event of a nucleotide misincorporation. PolII is involved in a variety of cellular activities, such as the repair of DNA damaged by UV irradiation or oxidation. It plays a pivotal role in replication-restart, a process that bypasses DNA damage in a Probab=98.61 E-value=1.1e-07 Score=68.98 Aligned_cols=156 Identities=17% Similarity=0.244 Sum_probs=82.2 Q ss_pred CCEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCCEEEEEECCCCCHHHHHHHHHHC--CCCCEEEEEEH-HHHH Q ss_conf 97899980406788700046000026882999999983822899978884221799999864--88981998630-7759 Q gi|254780127|r 1 MSKLFIDIETRSPQPLPKVGVWAYAEQAVITLCAYAWDDEPVKLWDRTEQSAMPSDLLQYLR--DETVMCVAHNS-LFER 77 (675) Q Consensus 1 mk~l~lD~ET~s~~dl~~~Gl~~y~~~~~I~~i~~a~~~~~~~~~d~~~~~~~~~~l~~~L~--~~~~~kv~HNa-~FD~ 77 (675) .|.+++||||.++..+-..|++-.. .-.+.+++.+-....+.+.-+..-..+...+..++. +|+++ +|||. .||+ T Consensus 3 lki~s~DIE~~~~~~i~~I~~~~~~-~~~v~~~~~~~~~~~~~v~~~~sE~eLL~~F~~~i~~~dPDiI-tG~N~~~FDi 80 (193) T cd05784 3 LKVVSLDIETSMDGELYSIGLYGEG-QERVLMVGDPEDDAPDNIEWFADEKSLLLALIAWFAQYDPDII-IGWNVINFDL 80 (193) T ss_pred CEEEEEEECCCCCCCEEEEEEECCC-CCEEEEEECCCCCCCCEEEEECCHHHHHHHHHHHHHHCCCCEE-EECCCCCCCH T ss_conf 3099999632798888899998699-9989999789888997799989999999999999986299999-9779877767 Q ss_pred HHHHHH---CCCCC------------------------CCCCEEEHHHHHHH--HC-CCCCHHHHHHH-HCCCCCCHH-H Q ss_conf 999986---29987------------------------86504545889998--58-89997899998-088986527-6 Q gi|254780127|r 78 ILFKKT---LGIDI------------------------PSKRWICTSVLART--NG-LPSSLKNACLA-LNFSEHLTK-M 125 (675) Q Consensus 78 ~vL~~~---lGi~i------------------------~~~~~~DTmi~A~~--l~-~p~sL~~la~~-lgl~~~~~k-d 125 (675) +.|..- +|+++ +.....|.|-+-+- .. .+++|+.+|+. ||....... + T Consensus 81 PYL~~R~~~~~i~~~~~r~~~~~~~~~~~~~~~~~~~i~Gr~~lD~~~~~~~~~~~l~sy~Ld~Va~~~LGe~K~~~~~~ 160 (193) T cd05784 81 RLLQRRAEAHGLPLRLGRGGSPLNWRQSGKPGQGFLSLPGRVVLDGIDALKTATYHFESFSLENVAQELLGEGKLIHDVD 160 (193) T ss_pred HHHHHHHHHHCCCCCCCCCCCCCEEEECCCCCEEEEEECCEEEECHHHHHHHCCCCHHHHHHHHHHHHHHCCCCCCCCCC T ss_conf 89999999858864336578850598768562699997887841729999760735114348889999838775445655 Q ss_pred HHCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 63765766410023320134531125876899989887753999999998 Q gi|254780127|r 126 EEGKALIARFCKGSIDSPPYDCTRANHVQAWQLFGEYCKRDVEATRELFK 175 (675) Q Consensus 126 ~~gk~li~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~~~~L~~ 175 (675) .....+.+. + ...++.+.+||.+|++++.+|++ T Consensus 161 ~~~~~i~~l--------------~---~~d~~k~~~Yni~D~~Lv~rLf~ 193 (193) T cd05784 161 DRGAEIERL--------------F---REDKLALARYNLQDCELVWRIFE 193 (193) T ss_pred CCHHHHHHH--------------H---HCCCHHHEEEHHHHHHHHHHHHC T ss_conf 558889998--------------8---65903610320277999999749 No 73 >PRK07983 exodeoxyribonuclease X; Provisional Probab=98.60 E-value=8.6e-07 Score=63.09 Aligned_cols=84 Identities=21% Similarity=0.244 Sum_probs=53.5 Q ss_pred HHHHHHCCCCCEEEEEEHHHHHHHHHHHCCCCCCCCCEEEHHHHHHHHC--CCCCHHHHHHHHCCCCCCHHHHHCHHHHH Q ss_conf 9999864889819986307759999986299878650454588999858--89997899998088986527663765766 Q gi|254780127|r 56 DLLQYLRDETVMCVAHNSLFERILFKKTLGIDIPSKRWICTSVLARTNG--LPSSLKNACLALNFSEHLTKMEEGKALIA 133 (675) Q Consensus 56 ~l~~~L~~~~~~kv~HNa~FD~~vL~~~lGi~i~~~~~~DTmi~A~~l~--~p~sL~~la~~lgl~~~~~kd~~gk~li~ 133 (675) ++.+.+.+ +...||||+.||+.+|.. ....|+||+-+|+.+- .++|+..+...+++...... T Consensus 66 ev~~~~~~-~~~lVaHNa~FD~~~L~~------~~~~~IcTl~lAR~l~p~~~~~l~~Lry~~~~~~~~~~--------- 129 (219) T PRK07983 66 DVIPHYYG-SEWYVAHNASFDRRVLPE------MPGEWICTMKLARRLWPGIKYSNMALYKSRKLNVQTPP--------- 129 (219) T ss_pred HHHHHHCC-CCEEEECCCCCCHHHHCC------CCCCEEEHHHHHHHHCCCCCHHHHHHHHHHCCCCCCCC--------- T ss_conf 99998628-987875365103866415------58756657899998767742418999998427613324--------- Q ss_pred HHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 410023320134531125876899989887753999999998720 Q gi|254780127|r 134 RFCKGSIDSPPYDCTRANHVQAWQLFGEYCKRDVEATRELFKRLI 178 (675) Q Consensus 134 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~~~~L~~~L~ 178 (675) .. ...-|..|+.++..++..|. T Consensus 130 --------------~~---------~AHrAl~Dv~v~~~ll~~l~ 151 (219) T PRK07983 130 --------------GL---------HHHRALYDCYITAALLIDIM 151 (219) T ss_pred --------------CC---------CCCCHHHHHHHHHHHHHHHH T ss_conf --------------43---------33430787999999999999 No 74 >PRK07247 DNA polymerase III subunit epsilon; Validated Probab=98.52 E-value=4.2e-06 Score=58.57 Aligned_cols=83 Identities=10% Similarity=0.142 Sum_probs=53.7 Q ss_pred HHHHHCCCCCEEEEEEH-HHHHHHHHHH---CCCCCCCCCEEEHHHHHHHHC----CCCCHHHHHHHHCCCCCCHHHHHC Q ss_conf 99986488981998630-7759999986---299878650454588999858----899978999980889865276637 Q gi|254780127|r 57 LLQYLRDETVMCVAHNS-LFERILFKKT---LGIDIPSKRWICTSVLARTNG----LPSSLKNACLALNFSEHLTKMEEG 128 (675) Q Consensus 57 l~~~L~~~~~~kv~HNa-~FD~~vL~~~---lGi~i~~~~~~DTmi~A~~l~----~p~sL~~la~~lgl~~~~~kd~~g 128 (675) +...+ .+..+||||+ +||+.+|... ++.... ....||...++.+. ..++|+.+++.||++.. T Consensus 78 f~~Fi--gd~~lVaHNa~~FD~~fL~~~~~~~~~~~~-~~~~~~~~~~r~~~~p~l~n~kL~tLa~~f~I~~~------- 147 (195) T PRK07247 78 FKNFV--GELPLIGYNAQKSDLPILAENGLDLRDQYQ-VDLFDEAFDRRSSDLNGIANLKLQTVATFLGIKGR------- 147 (195) T ss_pred HHHHH--CCCEEEEECCCHHHHHHHHHHHHHHCCCCC-CCHHHHHHHHHHHCCCCCCCCCHHHHHHHCCCCCC------- T ss_conf 99996--899189908845149999998774157766-85699999998860588767778999997499998------- Q ss_pred HHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 6576641002332013453112587689998988775399999999872002 Q gi|254780127|r 129 KALIARFCKGSIDSPPYDCTRANHVQAWQLFGEYCKRDVEATRELFKRLIPL 180 (675) Q Consensus 129 k~li~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~~~~L~~~L~~~ 180 (675) ..-|..|+..|..+|.+|..+ T Consensus 148 -------------------------------~HRAl~DA~aTa~If~klLe~ 168 (195) T PRK07247 148 -------------------------------GHNSLEDARMTARIYESFLET 168 (195) T ss_pred -------------------------------CCCCHHHHHHHHHHHHHHHHH T ss_conf -------------------------------908189999999999999997 No 75 >PRK05762 DNA polymerase II; Reviewed Probab=98.51 E-value=4.6e-06 Score=58.26 Aligned_cols=156 Identities=16% Similarity=0.205 Sum_probs=78.7 Q ss_pred CCEEEEEECCCCCCCCCCCCCHHHCCCC-EEEEEEEEECCCEEEEEECCCCCHHHHHHHHHHC--CCCCEEEEEEH-HHH Q ss_conf 9789998040678870004600002688-2999999983822899978884221799999864--88981998630-775 Q gi|254780127|r 1 MSKLFIDIETRSPQPLPKVGVWAYAEQA-VITLCAYAWDDEPVKLWDRTEQSAMPSDLLQYLR--DETVMCVAHNS-LFE 76 (675) Q Consensus 1 mk~l~lD~ET~s~~dl~~~Gl~~y~~~~-~I~~i~~a~~~~~~~~~d~~~~~~~~~~l~~~L~--~~~~~kv~HNa-~FD 76 (675) .+.+|+|||++...++-..|+.. ++. .+..++-..++..+.. +..-..+...+..++. ||+++ +|||+ .|| T Consensus 155 lrvlSfDIEc~~~g~ii~I~l~~--~~~~~v~~~g~~~~~~~v~~--~~~E~eLL~~F~~~i~~~DPDII-~GyNi~~FD 229 (785) T PRK05762 155 LKVVSLDIETSNKGELYSIGLEG--CGQRQVYMLGEANGDAELEY--VADEKALLERFNAWFAEFDPDVI-IGWNVVQFD 229 (785) T ss_pred CEEEEEEEEECCCCCEEEEEECC--CCCCEEEEECCCCCCCEEEE--ECCHHHHHHHHHHHHHHHCCCEE-EECCCCCCC T ss_conf 86999998977999889999517--99846899668899853899--09999999999999998599989-962747877 Q ss_pred HHHHHH---HCCCCCC-------------------------CCCEEEHHHHHHH--HC-CCCCHHHHHHHH-CCCCCCH- Q ss_conf 999998---6299878-------------------------6504545889998--58-899978999980-8898652- Q gi|254780127|r 77 RILFKK---TLGIDIP-------------------------SKRWICTSVLART--NG-LPSSLKNACLAL-NFSEHLT- 123 (675) Q Consensus 77 ~~vL~~---~lGi~i~-------------------------~~~~~DTmi~A~~--l~-~p~sL~~la~~l-gl~~~~~- 123 (675) ++.|.. .+|+++. ..-..|++-.-+- .. ..++|+.+++.+ |-..... T Consensus 230 lpyL~~Ra~~~gi~l~lgR~~~~~~~~~~~~~~~~~~~~i~GRv~iD~~~~~k~~~~~l~sYsL~~Va~~~Lge~K~~~~ 309 (785) T PRK05762 230 LRLLQERAERYGIPLRLGRGGSELSWRVHGHKRGHFFASVPGRLVLDGIDALKSAFYEFKSFSLENVAQELLGEGKAIDN 309 (785) T ss_pred HHHHHHHHHHHCCCCCCCCCCCCCCEECCCCCCCEEEEEEEEEEEEEEHHHHHHHHHCCCCCCHHHHHHHHHCCCCCCCC T ss_conf 79999999994998546878886533315655660599984289997099998752036766299999998544555566 Q ss_pred HHHHCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 7663765766410023320134531125876899989887753999999998720 Q gi|254780127|r 124 KMEEGKALIARFCKGSIDSPPYDCTRANHVQAWQLFGEYCKRDVEATRELFKRLI 178 (675) Q Consensus 124 kd~~gk~li~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~~~~L~~~L~ 178 (675) .+....+...+ +. +....+..||.+|+.+++.|..++. T Consensus 310 ~~~r~~eI~~~--------------~~---~~r~~l~~Y~l~Da~L~~~L~~kl~ 347 (785) T PRK05762 310 PYDRMDEIDRY--------------FA---DDKPALAKYNLKDCELVTRIFEKTE 347 (785) T ss_pred HHHHHHHHHHH--------------HH---CCHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 66668888998--------------85---4689999999999999999999870 No 76 >cd06127 DEDDh DEDDh exonucleases, part of the DnaQ-like (or DEDD) exonuclease superfamily, catalyze the excision of nucleoside monophosphates at the DNA or RNA termini in the 3'-5' direction. These proteins contain four invariant acidic residues in three conserved sequence motifs termed ExoI, ExoII and ExoIII. DEDDy exonucleases are classified as such because of the presence of specific Hx(4)D conserved pattern at the ExoIII motif. The four conserved acidic residues are clustered around the active site and serve as ligands for the two metal ions required for catalysis. Most DEDDh exonucleases are the proofreading subunits (epsilon) or domains of bacterial DNA polymerase III, the main replicating enzyme in bacteria, which functions as the chromosomal replicase. Other members include other DNA and RNA exonucleases such as RNase T, Oligoribonuclease, and RNA exonuclease (REX), among others. Probab=98.48 E-value=4.8e-06 Score=58.15 Aligned_cols=46 Identities=33% Similarity=0.478 Sum_probs=33.3 Q ss_pred HHHHHHCCCCCEEEEEEHHHHHHHHHHH---CCCCCCCCCEEEHHHHHHHH Q ss_conf 9999864889819986307759999986---29987865045458899985 Q gi|254780127|r 56 DLLQYLRDETVMCVAHNSLFERILFKKT---LGIDIPSKRWICTSVLARTN 103 (675) Q Consensus 56 ~l~~~L~~~~~~kv~HNa~FD~~vL~~~---lGi~i~~~~~~DTmi~A~~l 103 (675) ++...+.+ ..+|+||+.||+.+|.+. ++...+...|+|||.+++.+ T Consensus 72 ~~~~~l~~--~~lv~hN~~FD~~~L~~~~~~~~~~~~~~~~iDt~~~~~~~ 120 (159) T cd06127 72 EFLEFLGG--RVLVAHNASFDLRFLNRELRRLGGPPLPNPWIDTLRLARRL 120 (159) T ss_pred HHHHHCCC--CEEEECCHHHHHHHHHHHHHHCCCCCCCCCEEEHHHHHHHH T ss_conf 99873156--34651588899999999999839998899666479999997 No 77 >COG2176 PolC DNA polymerase III, alpha subunit (gram-positive type) [DNA replication, recombination, and repair] Probab=98.46 E-value=5.7e-07 Score=64.27 Aligned_cols=57 Identities=25% Similarity=0.282 Sum_probs=44.1 Q ss_pred CCCEEEEEEHHHHHHHHHHH---CCCCCCCCCEEEHHHHHHHHCC---CCCHHHHHHHHCCCC Q ss_conf 89819986307759999986---2998786504545889998588---999789999808898 Q gi|254780127|r 64 ETVMCVAHNSLFERILFKKT---LGIDIPSKRWICTSVLARTNGL---PSSLKNACLALNFSE 120 (675) Q Consensus 64 ~~~~kv~HNa~FD~~vL~~~---lGi~i~~~~~~DTmi~A~~l~~---p~sL~~la~~lgl~~ 120 (675) .+.+.|+|||.||+.||... +|+..-...++||+-+|+.|++ +++|..+++.|++.- T Consensus 500 ~d~IlVAHNasFD~gFl~~~~~k~~~~~~~~pvIDTL~lar~L~P~~ksh~Lg~l~kk~~v~l 562 (1444) T COG2176 500 GDSILVAHNASFDMGFLNTNYEKYGLEPLTNPVIDTLELARALNPEFKSHRLGTLCKKLGVEL 562 (1444) T ss_pred CCCEEEECCCCCCHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHCHHHHHCCHHHHHHHHCCCH T ss_conf 485899616755645788889983885102750119999987574543246677888748637 No 78 >smart00479 EXOIII exonuclease domain in DNA-polymerase alpha and epsilon chain, ribonuclease T and other exonucleases. Probab=98.41 E-value=9.5e-06 Score=56.22 Aligned_cols=87 Identities=28% Similarity=0.332 Sum_probs=56.8 Q ss_pred HHHHHHCCCCCEEEEEEHHHHHHHHHHHC---CCCCC-CCCEEEHHHHHHHHCC--CCCHHHHHHHHCCCCCCHHHHHCH Q ss_conf 99998648898199863077599999862---99878-6504545889998588--999789999808898652766376 Q gi|254780127|r 56 DLLQYLRDETVMCVAHNSLFERILFKKTL---GIDIP-SKRWICTSVLARTNGL--PSSLKNACLALNFSEHLTKMEEGK 129 (675) Q Consensus 56 ~l~~~L~~~~~~kv~HNa~FD~~vL~~~l---Gi~i~-~~~~~DTmi~A~~l~~--p~sL~~la~~lgl~~~~~kd~~gk 129 (675) ++.+.+.+ ....++||+.||..+|.+.+ |...+ ...|.|||-.++.... ..+|+.+++.+|++.... T Consensus 73 ~~~~~l~~-~~~v~~~~~~fD~~~L~~~~~~~~~~~~~~~~~~dt~~~~~~~~~~~~~~L~~l~~~~~i~~~~~------ 145 (169) T smart00479 73 ELLEFLKG-KILVAGNALNFDLRFLKLEHPRLGIKDPPKNPVIDTLKLARALNPGRKYSLKKLAERLGLEVIGR------ 145 (169) T ss_pred HHHHHHCC-CCCHHCCCHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHCCCCCHHHHHHHHHCCCCCCCC------ T ss_conf 99986048-74510367999999999999981899886652462999999975885458999999859998999------ Q ss_pred HHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 57664100233201345311258768999898877539999999987200 Q gi|254780127|r 130 ALIARFCKGSIDSPPYDCTRANHVQAWQLFGEYCKRDVEATRELFKRLIP 179 (675) Q Consensus 130 ~li~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~~~~L~~~L~~ 179 (675) ..-|..|+..|..++.+|.. T Consensus 146 ------------------------------~H~Al~Da~~t~~v~~~l~~ 165 (169) T smart00479 146 ------------------------------AHRALDDARATAKLFKKLVE 165 (169) T ss_pred ------------------------------CCCHHHHHHHHHHHHHHHHH T ss_conf ------------------------------92979999999999999997 No 79 >cd05777 DNA_polB_delta_exo The 3'-5' exonuclease domain of eukaryotic DNA polymerase delta. DNA polymerase delta is a family-B DNA polymerase with a catalytic subunit that contains a DEDDy-type DnaQ-like 3'-5' exonuclease domain. It is one of the three DNA-dependent type B DNA polymerases (alpha and epsilon are the other two) that have been identified as essential for nuclear DNA replication in eukaryotes. DNA polymerase delta is the enzyme responsible for both elongation and maturation of Okazaki fragments on the lagging strand. It is also implicated in mismatch repair (MMR) and base excision repair (BER). The catalytic subunit displays both polymerase and 3'-5' exonuclease activities. The exonuclease domain contains three sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues necessary for metal binding and catalysis. The exonuclease domain of family B polyme Probab=98.40 E-value=8.1e-06 Score=56.67 Aligned_cols=157 Identities=13% Similarity=0.167 Sum_probs=77.9 Q ss_pred CCEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEE--CCCE--E----------------EEEECCCCCHHHHHHHHH Q ss_conf 9789998040678870004600002688299999998--3822--8----------------999788842217999998 Q gi|254780127|r 1 MSKLFIDIETRSPQPLPKVGVWAYAEQAVITLCAYAW--DDEP--V----------------KLWDRTEQSAMPSDLLQY 60 (675) Q Consensus 1 mk~l~lD~ET~s~~dl~~~Gl~~y~~~~~I~~i~~a~--~~~~--~----------------~~~d~~~~~~~~~~l~~~ 60 (675) +|.+|+||||+|... ..+-++.-.|..|+..+ .+.. . .++.+..-..+...+..+ T Consensus 7 lki~sfDIE~~s~~g-----~fP~p~~D~Ii~I~~~~~~~~~~~~~~~~v~~l~~~~~i~~~~V~~~~tE~eLL~~F~~~ 81 (230) T cd05777 7 LRILSFDIECAGRKG-----VFPEPEKDPVIQIANVVTRQGEGEPFIRNIFTLKTCAPIVGAQVFSFETEEELLLAWRDF 81 (230) T ss_pred CEEEEEEEEECCCCC-----CCCCCCCCCEEEEEEEEEECCCCCCEEEEEEEECCCCCCCCCEEEECCCHHHHHHHHHHH T ss_conf 049999999526999-----998988883489999998279776047878982687888998799849999999999999 Q ss_pred HC--CCCCEEEEEEH-HHHHHHHHHH---CCCCC----------------------------------CCCCEEEHHHHH Q ss_conf 64--88981998630-7759999986---29987----------------------------------865045458899 Q gi|254780127|r 61 LR--DETVMCVAHNS-LFERILFKKT---LGIDI----------------------------------PSKRWICTSVLA 100 (675) Q Consensus 61 L~--~~~~~kv~HNa-~FD~~vL~~~---lGi~i----------------------------------~~~~~~DTmi~A 100 (675) +. +|+++ +|+|. .||++.|... ++++. +.....|.+-.- T Consensus 82 i~~~dPDii-tGyN~~~FD~pYL~~Ra~~l~~~~~~~lgr~~~~~~~~~~~~~~s~~~g~~~~~~~~i~Gr~~lD~~~~~ 160 (230) T cd05777 82 VQEVDPDII-TGYNICNFDLPYLLERAKALKLNTFPFLGRIKNIKSTIKDTTFSSKQMGTRETKEINIEGRIQFDLLQVI 160 (230) T ss_pred HHHCCCCEE-EECCCCCCCHHHHHHHHHHHCCCCHHHHCCCCCCCCEEECCCCCCCCCCCCEEEEEEECCEEHHHHHHHH T ss_conf 998099999-9636667887899999999688614541766787635741652223126400269987587599999999 Q ss_pred HH-HC-CCCCHHHHHHH-HCCCCCCHHHHHCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 98-58-89997899998-08898652766376576641002332013453112587689998988775399999999872 Q gi|254780127|r 101 RT-NG-LPSSLKNACLA-LNFSEHLTKMEEGKALIARFCKGSIDSPPYDCTRANHVQAWQLFGEYCKRDVEATRELFKRL 177 (675) Q Consensus 101 ~~-l~-~p~sL~~la~~-lgl~~~~~kd~~gk~li~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~~~~L~~~L 177 (675) +- .. ..++|+.+++. ||-..... ..+.+.. -+...++.+..+.+||.+|++++.+|.++| T Consensus 161 r~~~~l~sY~L~~Va~~~Lg~~K~d~---~~~~i~~--------------~~~~~~~~~~~~~~Y~i~D~~L~~~L~~Kl 223 (230) T cd05777 161 QRDYKLRSYSLNSVSAHFLGEQKEDV---HYSIITD--------------LQNGNPETRRRLAVYCLKDAYLPLRLLDKL 223 (230) T ss_pred HHHCCHHHCCHHHHHHHHHCCCCCCC---CHHHHHH--------------HHHCCHHHHHHHHEECHHHHHHHHHHHHHH T ss_conf 97501633649999999838998879---9999999--------------994796464560064268779999999998 Q ss_pred CCC Q ss_conf 002 Q gi|254780127|r 178 IPL 180 (675) Q Consensus 178 ~~~ 180 (675) .-+ T Consensus 224 ~~~ 226 (230) T cd05777 224 MCL 226 (230) T ss_pred HHH T ss_conf 777 No 80 >cd05160 DNA_polB_exo The 3'-5' exonuclease domain of family-B DNA polymerases. This domain has a fundamental role in reducing polymerase errors and is involved in proofreading activity. Family-B DNA polymerases contain an N-terminal DEDDy DnaQ-like exonuclease domain in the same polypeptide chain as the polymerase domain, similar to family-A DNA polymerases. This domain contains three sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. The exonuclease domain of family B polymerase also contains a beta hairpin structure that plays an important role in active site switching in the event of nucleotide misincorporation. Members include E. coli DNA polymerase II, some eubacterial phage DNA polymerases, nuclear replicative DNA polymerases (alpha, delta, epsilon and zeta), and eukaryotic viral and Probab=98.40 E-value=6.6e-06 Score=57.27 Aligned_cols=147 Identities=16% Similarity=0.115 Sum_probs=78.1 Q ss_pred EEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECC-CEEE----------------------EEECCCCCHHHHHHHH Q ss_conf 8999804067887000460000268829999999838-2289----------------------9978884221799999 Q gi|254780127|r 3 KLFIDIETRSPQPLPKVGVWAYAEQAVITLCAYAWDD-EPVK----------------------LWDRTEQSAMPSDLLQ 59 (675) Q Consensus 3 ~l~lD~ET~s~~dl~~~Gl~~y~~~~~I~~i~~a~~~-~~~~----------------------~~d~~~~~~~~~~l~~ 59 (675) .+++||||+|.... +.+ +.-.|..|+++... +... +..+..-..+...+.. T Consensus 1 vlsfDIE~~~~~gf----P~p--~~d~Ii~Is~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~E~eLL~~F~~ 74 (200) T cd05160 1 VLSFDIETLPNVGG----PEP--SRDPIICISYADSFQGHKVVFLLKVSTHGDDIEAGFIDGIEVEYFDDEKELLKRFFD 74 (200) T ss_pred CEEEEEEECCCCCC----CCC--CCCEEEEEEEEEECCCEEEEEEEECCCCCCCCCCCCCCCEEEEEECCHHHHHHHHHH T ss_conf 97999832799997----899--988089999999359999998864355676765467799699996999999999999 Q ss_pred HHC--CCCCEEEEEEH-HHHHHHHHH---HCCCCCC---------------------CCCEEEHHHHHHHH-C-CCCCHH Q ss_conf 864--88981998630-775999998---6299878---------------------65045458899985-8-899978 Q gi|254780127|r 60 YLR--DETVMCVAHNS-LFERILFKK---TLGIDIP---------------------SKRWICTSVLARTN-G-LPSSLK 110 (675) Q Consensus 60 ~L~--~~~~~kv~HNa-~FD~~vL~~---~lGi~i~---------------------~~~~~DTmi~A~~l-~-~p~sL~ 110 (675) ++. +|+++ +|||. .||++.|.. .+|+++. .....|.+-..+-. . ..++|+ T Consensus 75 ~i~~~dPDii-~GyN~~~FD~pyL~~R~~~l~~~~~~~~~~~~~~~~~~~~~~~~i~Gr~~lD~~~~~k~~~~l~sy~Ld 153 (200) T cd05160 75 IIREYDPDIL-TGYNIDDFDLPYLLERAKALGIKLDDIGRRSGQEKSSGETERAAVKGRVVFDLLRAVKRDFKLKSYTLD 153 (200) T ss_pred HHHHCCCCEE-EECCCCCCCHHHHHHHHHHHCCCHHHCCCCCCCCCCCCCEEEEEECCEEEEHHHHHHHHHHCCCCCCHH T ss_conf 9997099999-972677784889999999969885554673454543575059973677860289999975324665399 Q ss_pred HHHHH-HCCCCCCHHHHHCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99998-088986527663765766410023320134531125876899989887753999999998 Q gi|254780127|r 111 NACLA-LNFSEHLTKMEEGKALIARFCKGSIDSPPYDCTRANHVQAWQLFGEYCKRDVEATRELFK 175 (675) Q Consensus 111 ~la~~-lgl~~~~~kd~~gk~li~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~~~~L~~ 175 (675) .+|+. ||..... ..++...... . ...++.+.+||.+|+.++++|++ T Consensus 154 ~Va~~~Lg~~K~d---~~~~~~~~~~-------------~---~~d~~k~~~Y~~~D~~L~~~L~e 200 (200) T cd05160 154 AVAEELLGEGKEK---VDGEIIEDLE-------------W---EEDLERLLEYNLKDAELTLQILE 200 (200) T ss_pred HHHHHHHCCCCCC---CCHHHHHHHH-------------H---CCCCHHHHHEEHHHHHHHHHHHC T ss_conf 9999984999787---8989999998-------------7---38914866025999998999759 No 81 >cd06136 TREX1_2 Three prime repair exonuclease (TREX) 1 and 2 are closely related DEDDh-type DnaQ-like 3'-5' exonucleases. They contain three conserved sequence motifs known as ExoI, II, and III, with a specific Hx(4)D conserved pattern at ExoIII. These motifs contain four conserved acidic residues that participate in coordination of divalent metal ions required for catalysis. Both proteins play a role in the metabolism and clearance of DNA. TREX1 is the major 3'-5' exonuclease activity detected in mammalian cells. Mutations in the human TREX1 gene can cause Aicardi-Goutieres syndrome (AGS), which is characterized by perturbed innate immunity and presents itself as a severe neurological disease. TREX1 degrades ssDNA generated by aberrant replication intermediates to prevent checkpoint activation and autoimmune disease. There are distinct structural differences between TREX1 and 2 that point to different biological roles for these proteins. The main difference is the presence of about 7 Probab=98.39 E-value=2.7e-06 Score=59.80 Aligned_cols=54 Identities=22% Similarity=0.261 Sum_probs=38.1 Q ss_pred CCCEEEEEEH-HHHHHHHHHH---CCCCCCC-CCEEEHHHHHHHHCCCCCHHHHHHHH-CCC Q ss_conf 8981998630-7759999986---2998786-50454588999858899978999980-889 Q gi|254780127|r 64 ETVMCVAHNS-LFERILFKKT---LGIDIPS-KRWICTSVLARTNGLPSSLKNACLAL-NFS 119 (675) Q Consensus 64 ~~~~kv~HNa-~FD~~vL~~~---lGi~i~~-~~~~DTmi~A~~l~~p~sL~~la~~l-gl~ 119 (675) .++.+||||+ .||+.||... +|+++|. ..+.||+.+++-+ .++|..+++++ |.+ T Consensus 97 ~~~~LVAHNa~~FD~~fL~~e~~r~g~~~p~~~~~iDtL~l~r~~--~~~L~~l~~~~~gi~ 156 (177) T cd06136 97 KPICLVAHNGNRFDFPILRSELERLGTKLPDDILCVDSLPAFREL--DQSLGSLYKRLFGQE 156 (177) T ss_pred CCCEEEECCCHHCCHHHHHHHHHHCCCCCCCCCEEEEEHHHHHHH--CCCHHHHHHHHCCCC T ss_conf 786799648512379999999998699899898899860764550--699999999975999 No 82 >TIGR00593 pola DNA polymerase I; InterPro: IPR002298 DNA carries the biological information that instructs cells how to exist in an ordered fashion. Accurate replication is thus one of the most important events in the cell life cycle. This function is mediated by DNA-directed DNA polymerases, which add nucleotide triphosphate (dNTP) residues to the 5'-end of the growing DNA chain, using a complementary DNA as template. Small RNA molecules are generally used as primers for chain elongation, although terminal proteins may also be used. DNA-dependent DNA polymerases have been grouped into families, denoted A, B and X, on the basis of sequence similarities , . Members of family A, which includes bacterial and bacteriophage polymerases, share significant similarity to Escherichia coli polymerase I; hence family A is also known as the pol I family. The bacterial polymerases also contain an exonuclease activity, which is coded for in the N-terminal portion. Three motifs, A, B and C , are seen to be conserved across all DNA polymerases, with motifs A and C also seen in RNA polymerases. They are centred on invariant residues, and their structural significance was implied from the Klenow (E. coli) structure. Motif A contains a strictly-conserved aspartate at the junction of a beta-strand and an alpha-helix; motif B contains an alpha-helix with positive charges; and motif C has a doublet of negative charges, located in a beta-turn-beta secondary structure .; GO: 0003677 DNA binding, 0003887 DNA-directed DNA polymerase activity, 0006260 DNA replication. Probab=98.27 E-value=1.1e-06 Score=62.31 Aligned_cols=125 Identities=14% Similarity=0.122 Sum_probs=86.1 Q ss_pred EEEECCCCCEEEEEEECC-C---HHHHHHHH-CCCCC---CCCCH-HHHHCCCCCCCCHHH-HHCCCCCHHHHHHCCCHH Q ss_conf 997335485058866228-7---67866540-44410---00015-776087511057022-100642100133078877 Q gi|254780127|r 377 VVADLAGIEARVLAWIAG-E---DWKIKAFA-NGEDI---YVTTY-AKAFNTPIDKVSKEQ-RAIGKVMELALGYQGGAK 446 (675) Q Consensus 377 v~aDySqiElRvlA~ls~-d---~~li~~f~-~g~D~---h~~tA-~~~~~~~~~~v~~~~-R~~aK~~~~gl~YG~G~~ 446 (675) |=|=.||.|-=+-|.--+ | .+=...|. +-.|+ ++..| +.-||+-| +.. -..|+..+ T Consensus 789 iLAHlS~De~Li~AF~~~~DIH~~TA~~~Fgv~~~~vtp~~RR~AK~vNFGivY----GmSafgLA~~L~---------- 854 (1005) T TIGR00593 789 ILAHLSQDENLIEAFKNGEDIHTQTASRLFGVEEEDVTPNMRRIAKAVNFGIVY----GMSAFGLARELG---------- 854 (1005) T ss_pred HHHHHCCCHHHHHHHHCCCCHHHHHHHHHHCCCHHHCCHHHHCCCCEEEEHHHH----CCCHHHHHHHHC---------- T ss_conf 998640786799997548871388999972989774496460474144301321----012057777508---------- Q ss_pred HHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEECCHHHHHHHHHHHHHHHCCCHHHHHHHHHH Q ss_conf 89887887088864036666656667654788999999999875111010310889788999999867307899999999 Q gi|254780127|r 447 VFKTMASHCGLDLQQFSQNVKSTSTFEDWEQAESHHLWMQDQYPEFAVKDKLIGTACELVKKAWRAKHQGVLQLWKDLTE 526 (675) Q Consensus 447 ~l~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ea~~~~~~~~~~~p~i~~~~~~~~~ 526 (675) ++..+|..++..||++||+++.|+..+...+++.|++.++.+| ||-.|.|.+-.+..+. T Consensus 855 ------------Is~~EA~~fIe~YF~~yPgv~~Y~~~~~~~ar~~GYV~Tl~gR---------RRY~~~i~s~n~~~R~ 913 (1005) T TIGR00593 855 ------------ISRKEAKEFIERYFERYPGVKEYLENTVEEARKKGYVTTLLGR---------RRYIPDINSRNRNLRE 913 (1005) T ss_pred ------------CCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCEEEECCC---------EECCCCCCHHHHHHHH T ss_conf ------------8888999999999763413899999999999736855630465---------6115763223577799 Q ss_pred HHHHHHHCCC Q ss_conf 9999963372 Q gi|254780127|r 527 GFACVVQEGG 536 (675) Q Consensus 527 ~~~~~~~~~~ 536 (675) +|.+.+-|-- T Consensus 914 ~AER~A~NaP 923 (1005) T TIGR00593 914 AAERMAINAP 923 (1005) T ss_pred HHHHHHHCCC T ss_conf 9999873675 No 83 >COG0847 DnaQ DNA polymerase III, epsilon subunit and related 3'-5' exonucleases [DNA replication, recombination, and repair] Probab=98.23 E-value=3.4e-05 Score=52.51 Aligned_cols=64 Identities=30% Similarity=0.396 Sum_probs=47.1 Q ss_pred HHHHHHCCCCCEEEEEEHHHHHHHHHHH---CCCCCCCCCEEEHHHHHHHHC---CCCCHHHHHHHHCCCC Q ss_conf 9999864889819986307759999986---299878650454588999858---8999789999808898 Q gi|254780127|r 56 DLLQYLRDETVMCVAHNSLFERILFKKT---LGIDIPSKRWICTSVLARTNG---LPSSLKNACLALNFSE 120 (675) Q Consensus 56 ~l~~~L~~~~~~kv~HNa~FD~~vL~~~---lGi~i~~~~~~DTmi~A~~l~---~p~sL~~la~~lgl~~ 120 (675) ++.+.+.+. ...|+||+.||+.|+... .+..++...+.||+.+++... ...+|+.+++++|++. T Consensus 87 ~~~~~i~~~-~~~Vahna~fD~~fl~~~~~~~~~~~~~~~~~~t~~~~r~~~~~~~~~~L~~l~~~~gi~~ 156 (243) T COG0847 87 EFLDFIGGL-RLLVAHNAAFDVGFLRVESERLGIEIPGDPVLDTLALARRHFPGFDRSSLDALAERLGIDR 156 (243) T ss_pred HHHHHHCCC-CEEEEEEHHHCHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHCCCCCCCCHHHHHHHCCCCC T ss_conf 999984788-8699950530889999899970898757642007999998769976134899999629887 No 84 >PRK09182 DNA polymerase III subunit epsilon; Validated Probab=98.19 E-value=5e-06 Score=58.08 Aligned_cols=106 Identities=19% Similarity=0.324 Sum_probs=54.5 Q ss_pred EEEEEECCCCCCCCCCCCCHHHCCC-CEEEEEEEEEC-CCEEEEE-E-----CCCCCHHHHH------------------ Q ss_conf 8999804067887000460000268-82999999983-8228999-7-----8884221799------------------ Q gi|254780127|r 3 KLFIDIETRSPQPLPKVGVWAYAEQ-AVITLCAYAWD-DEPVKLW-D-----RTEQSAMPSD------------------ 56 (675) Q Consensus 3 ~l~lD~ET~s~~dl~~~Gl~~y~~~-~~I~~i~~a~~-~~~~~~~-d-----~~~~~~~~~~------------------ 56 (675) -+.||+||++ +++-.+. -+|-.+.+.++ +|.+..+ + ..++.++|.+ T Consensus 39 g~~lD~ETTG--------l~~~~d~IIElg~v~f~~~~~G~i~~i~~~~~~~~dP~~pIp~~it~lTGItd~mV~Gq~id 110 (293) T PRK09182 39 GVILDTETTG--------LDPRKDEIIEIGMVAFEYDDDGRIGDVVDTFGGLQQPSRPIPPEITRLTGITDEMVAGQTID 110 (293) T ss_pred EEEEECCCCC--------CCCCCCEEEEEEEEEEEECCCCCEEEEECCEECCCCCCCCCCHHHHHHCCCCHHHHCCCCCC T ss_conf 9999530578--------89778779997689999869985877643231103899989987987518888996888279 Q ss_pred --HHHHHCCCCCEEEEEEHHHHHHHHHHHCCCCCCCCCEEEHHHHH-HH-HC-CCCCHHHHHHHHC Q ss_conf --99986488981998630775999998629987865045458899-98-58-8999789999808 Q gi|254780127|r 57 --LLQYLRDETVMCVAHNSLFERILFKKTLGIDIPSKRWICTSVLA-RT-NG-LPSSLKNACLALN 117 (675) Q Consensus 57 --l~~~L~~~~~~kv~HNa~FD~~vL~~~lGi~i~~~~~~DTmi~A-~~-l~-~p~sL~~la~~lg 117 (675) ....+-++..++|+|||.||++|+.+.+-. +....|-|||--- |. .+ .+.+|+.++-.+| T Consensus 111 ~~~V~~~~~~adliiAHNA~FDR~F~E~~~p~-f~~k~WaCS~~~IdW~~~Gf~s~kLeyL~~~~G 175 (293) T PRK09182 111 PAEVDALIAPADLIIAHNAGFDRPFLERFSPV-FANKPWACSVSEIDWSARGFEGTKLGYLAGQAG 175 (293) T ss_pred HHHHHHHHCCCCEEEECCCCCCHHHHHHHCCC-CCCCCEEEECCCCCHHHCCCCCCCHHHHHHHCC T ss_conf 99999986449899981775467789850953-258750531637970322877415999998609 No 85 >cd05779 DNA_polB_epsilon_exo The 3'-5' exonuclease domain of eukaryotic DNA polymerase epsilon. DNA polymerase epsilon is a family-B DNA polymerase with a catalytic subunit that contains a DEDDy-type DnaQ-like 3'-5' exonuclease domain. It is one of the three DNA-dependent type B DNA polymerases (alpha and delta are the other two) that have been identified as essential for nuclear DNA replication in eukaryotes. DNA polymerase epsilon plays a role in elongating the leading strand during DNA replication. It is also involved in DNA repair. The catalytic subunit contains both polymerase and 3'-5' exonuclease activities. The N-terminal exonuclease domain contains three sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are clustered around the active site and are involved in metal binding and catalysis. DNA polymerase epsilon also carries a unique large C-terminal domain with an unknown function. Phylogenetic analyses indicate that it is ort Probab=98.14 E-value=8e-05 Score=50.07 Aligned_cols=147 Identities=19% Similarity=0.269 Sum_probs=74.6 Q ss_pred CEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCCE---------------------------EEEEECCCCCHHH Q ss_conf 7899980406788700046000026882999999983822---------------------------8999788842217 Q gi|254780127|r 2 SKLFIDIETRSPQPLPKVGVWAYAEQAVITLCAYAWDDEP---------------------------VKLWDRTEQSAMP 54 (675) Q Consensus 2 k~l~lD~ET~s~~dl~~~Gl~~y~~~~~I~~i~~a~~~~~---------------------------~~~~d~~~~~~~~ 54 (675) +.+|+||||++... + -+-++.-.|..||+...+.. +.++.+..-..+. T Consensus 3 rilsfDIE~~~~~~----~-fP~~~~D~Ii~Is~~~~~~g~~i~~~~~~~~~i~~~~~~~~~~~~g~~~v~~~~~E~~ll 77 (204) T cd05779 3 RVLAFDIETTKLPL----K-FPDAETDQIMMISYMIDGQGYLIVNREIVSEDIEDFEYTPKPEYEGPFKVFNEPDEKALL 77 (204) T ss_pred EEEEEEEEEECCCC----C-CCCCCCCEEEEEEEEEECCCEEEEECCCCCCCCCCCCCCCCCCCCCCEEEEECCCHHHHH T ss_conf 79999999507999----9-989888908999999916877987434013533000146785665764898369999999 Q ss_pred HHHHHHHC--CCCCEEEEEEH-HHHHHHHHH---HCCCCCCC------CC---------E-EEHHH-HHH--HHC-CCCC Q ss_conf 99999864--88981998630-775999998---62998786------50---------4-54588-999--858-8999 Q gi|254780127|r 55 SDLLQYLR--DETVMCVAHNS-LFERILFKK---TLGIDIPS------KR---------W-ICTSV-LAR--TNG-LPSS 108 (675) Q Consensus 55 ~~l~~~L~--~~~~~kv~HNa-~FD~~vL~~---~lGi~i~~------~~---------~-~DTmi-~A~--~l~-~p~s 108 (675) ..+.+++. +|+++ ++.|. .||++.|.. .+|+++.. .. . +|-+- +.+ .+. ..++ T Consensus 78 ~~f~~~i~~~~pdii-~gyN~~~FD~pyl~~Ra~~~~~~~~~~~~~~~~~~~~~~~~~~vhiDly~~v~rd~~l~~~sy~ 156 (204) T cd05779 78 QRFFEHIREVKPHII-VTYNGDFFDWPFVEARAAIHGLSMEEEIGFRKDSEGEYKSRYIIHMDCFRWVKRDSYLPQGSQG 156 (204) T ss_pred HHHHHHHHHCCCCEE-EECCCCCCCHHHHHHHHHHHCCCHHHHCCCCCCCCCCEECCCCEEEECHHHHHHHCCCCCCCCC T ss_conf 999999986499999-8558767767999999999599825531800067772762672563007765101217556877 Q ss_pred HHHHHHH-HCCCCCCHHHHHCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 7899998-08898652766376576641002332013453112587689998988775399999999 Q gi|254780127|r 109 LKNACLA-LNFSEHLTKMEEGKALIARFCKGSIDSPPYDCTRANHVQAWQLFGEYCKRDVEATRELF 174 (675) Q Consensus 109 L~~la~~-lgl~~~~~kd~~gk~li~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~~~~L~ 174 (675) |+.+++. ||...... ..+.+..+ +..++ +.+.+||.+|+.+|+.|| T Consensus 157 Ln~Vs~~~Lg~~K~dv---~~~~m~~l--------------~~~~~---~~la~Y~~~Da~~Ty~l~ 203 (204) T cd05779 157 LKAVTKAKLGYDPVEL---DPEDMVPL--------------AREDP---QTLASYSVSDAVATYYLY 203 (204) T ss_pred HHHHHHHHHCCCCCCC---CHHHHHHH--------------HCCCH---HHHHHCCHHHHHHHHHHH T ss_conf 8999999968997508---99999999--------------82891---555524099999999976 No 86 >cd06144 REX4_like This subfamily is composed of RNA exonuclease 4 (REX4 or Rex4p), XPMC2, Interferon (IFN) Stimulated Gene product of 20 kDa (ISG20), and similar proteins. REX4 is involved in pre-rRNA processing. It controls the ratio between the two forms of 5.8S rRNA in yeast. XPMC2 is a Xenopus gene which was identified through its ability to correct a mitotic defect in fission yeast. The human homolog of XPMC2 (hPMC2) may be involved in angiotensin II-induced adrenal cell cycle progression and cell proliferation. ISG20 is an IFN-induced antiviral exonuclease with a strong preference for single-stranded RNA and minor activity towards single-stranded DNA. These proteins are DEDDh-type DnaQ-like 3'-5' exonucleases containing three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific Hx(4)D conserved pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required fo Probab=98.04 E-value=5.3e-05 Score=51.29 Aligned_cols=60 Identities=23% Similarity=0.220 Sum_probs=37.3 Q ss_pred HHHHHHHCCCCCEEEEEEHHHHHHHHHHHCCCCCCCCCEEEHHHHHHHH----CCCCCHHHHHHH-HCCCC Q ss_conf 9999986488981998630775999998629987865045458899985----889997899998-08898 Q gi|254780127|r 55 SDLLQYLRDETVMCVAHNSLFERILFKKTLGIDIPSKRWICTSVLARTN----GLPSSLKNACLA-LNFSE 120 (675) Q Consensus 55 ~~l~~~L~~~~~~kv~HNa~FD~~vL~~~lGi~i~~~~~~DTmi~A~~l----~~p~sL~~la~~-lgl~~ 120 (675) .++.+++. +.++||||+.||+.+|. +..+....+||.....+. ....||+.+++. ||.+. T Consensus 69 ~~~~~~l~--~~ilVGH~l~~Dl~~L~----l~~~~~~i~DT~~~~~~~~~~~~~~~sLk~L~~~~Lg~~I 133 (152) T cd06144 69 KKVAELLK--GRILVGHALKNDLKVLK----LDHPKKLIRDTSKYKPLRKTAKGKSPSLKKLAKQLLGLDI 133 (152) T ss_pred HHHHHHHC--CCEEEECCHHHHHHHHH----CCCCCCEEEEEEEEHHHHHCCCCCCCCHHHHHHHHCCCCC T ss_conf 99999853--88798606464699962----4388760688556422454168899279999999858858 No 87 >smart00486 POLBc DNA polymerase type-B family. DNA polymerase alpha, delta, epsilon and zeta chain (eukaryota), DNA polymerases in archaea, DNA polymerase II in e. coli, mitochondrial DNA polymerases and and virus DNA polymerases Probab=98.03 E-value=0.00037 Score=45.71 Aligned_cols=157 Identities=18% Similarity=0.147 Sum_probs=76.9 Q ss_pred CCEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCCEE--------------------EEEECCCCCHHHHHHHHH Q ss_conf 978999804067887000460000268829999999838228--------------------999788842217999998 Q gi|254780127|r 1 MSKLFIDIETRSPQPLPKVGVWAYAEQAVITLCAYAWDDEPV--------------------KLWDRTEQSAMPSDLLQY 60 (675) Q Consensus 1 mk~l~lD~ET~s~~dl~~~Gl~~y~~~~~I~~i~~a~~~~~~--------------------~~~d~~~~~~~~~~l~~~ 60 (675) .+.+++|+||+++.... +.+....-.|..++....+... .+..+..-..+...+... T Consensus 3 ~~i~s~DIEt~~~~~~~---p~~~~~~d~i~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~ll~~f~~~ 79 (471) T smart00486 3 LKILSFDIETYTDGGLF---PDPLIFEDEIIQISLVINDGDKKGPEERICFTLGTCKEIDGVEVYEFNNEKELLKAFLEF 79 (471) T ss_pred CEEEEEEEEECCCCCCC---CCCCCCCCEEEEEEEEEEECCEEEEEEEEEECCCCCCCCCCCEEEECCCHHHHHHHHHHH T ss_conf 46999998966799999---998889984999999982198787887754303676778998899809999999999999 Q ss_pred HC--CCCCEEEEEEH-HHHHHHHHHH---CCCCC---------------------------------CCCCEEEHHHHHH Q ss_conf 64--88981998630-7759999986---29987---------------------------------8650454588999 Q gi|254780127|r 61 LR--DETVMCVAHNS-LFERILFKKT---LGIDI---------------------------------PSKRWICTSVLAR 101 (675) Q Consensus 61 L~--~~~~~kv~HNa-~FD~~vL~~~---lGi~i---------------------------------~~~~~~DTmi~A~ 101 (675) +. +++++ ++||. .||+.+|... +++.. +.....|++...+ T Consensus 80 i~~~dpdii-~g~N~~~FD~~yi~~R~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~D~~~~~~ 158 (471) T smart00486 80 IKKYDPDII-YGHNISNFDLPYIISRLEKLKIKPLSFIGRLKNIIDIKRKKPLFGSKSFGKTIKVKIKGRLVIDLYNLYK 158 (471) T ss_pred HHHCCCCEE-EECCCCCCCHHHHHHHHHHHCCCHHHHHCCCCCCCCCCEECCCCCCCCCCCEEEEEEEEEEEEEHHHHHH T ss_conf 987199999-9658878768999999998489647763735778764320332101256631489985599978899998 Q ss_pred H-HC-CCCCHHHHHHHHCCCCCCHHHHHCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 8-58-8999789999808898652766376576641002332013453112587689998988775399999999872 Q gi|254780127|r 102 T-NG-LPSSLKNACLALNFSEHLTKMEEGKALIARFCKGSIDSPPYDCTRANHVQAWQLFGEYCKRDVEATRELFKRL 177 (675) Q Consensus 102 ~-l~-~p~sL~~la~~lgl~~~~~kd~~gk~li~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~~~~L~~~L 177 (675) - .. .+++|+.+++.+.- ..+ .+...+.+... +...++....+..||.+|+.++..|..++ T Consensus 159 ~~~~l~sy~L~~v~~~~l~-~~k-~d~~~~~i~~~--------------~~~~~~~~~~~~~Y~~~D~~l~~~l~~~~ 220 (471) T smart00486 159 NKLKLPSYKLDTVAEYLLG-KEK-DDLPYKDIPEL--------------YNLNYKLRDELLEYCIQDAVLTLKLFNKL 220 (471) T ss_pred HHHCCCCCCHHHHHHHHHC-CCC-CCCCHHHHHHH--------------HHCCHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 8604466888999999957-788-88998999999--------------85393679999999999999999999986 No 88 >COG3359 Predicted exonuclease [DNA replication, recombination, and repair] Probab=98.02 E-value=0.00024 Score=46.88 Aligned_cols=156 Identities=17% Similarity=0.067 Sum_probs=91.9 Q ss_pred EEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCCEEEEEE--CCCCCH-HHHHHHHHHCCC-CCEEEEEEH-HHHH Q ss_conf 89998040678870004600002688299999998382289997--888422-179999986488-981998630-7759 Q gi|254780127|r 3 KLFIDIETRSPQPLPKVGVWAYAEQAVITLCAYAWDDEPVKLWD--RTEQSA-MPSDLLQYLRDE-TVMCVAHNS-LFER 77 (675) Q Consensus 3 ~l~lD~ET~s~~dl~~~Gl~~y~~~~~I~~i~~a~~~~~~~~~d--~~~~~~-~~~~l~~~L~~~-~~~kv~HNa-~FD~ 77 (675) ..++||||+ |++. +...|...+.+++.+.+.++. +.++.. ....+..++-++ ...+|..|. .||+ T Consensus 100 ~~FFDiETT--------GL~~--ag~~I~~~g~a~~~~~~~~Vrq~~lp~p~~E~avle~fl~~~~~~~lvsfNGkaFD~ 169 (278) T COG3359 100 VAFFDIETT--------GLDR--AGNTITLVGGARGVDDTMHVRQHFLPAPEEEVAVLENFLHDPDFNMLVSFNGKAFDI 169 (278) T ss_pred EEEEEEECC--------CCCC--CCCEEEEEEEEECCCCEEEEEEECCCCCCHHHHHHHHHHCCCCCCEEEEECCCCCCC T ss_conf 589963314--------6688--898599998887357569997512798413568999975389845599746832670 Q ss_pred HHHHHHC--CCCCCCCC-EEEHHHHHHHHC----CCCCHHHHHHHHCCCCCCHHHHHCHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 9999862--99878650-454588999858----8999789999808898652766376576641002332013453112 Q gi|254780127|r 78 ILFKKTL--GIDIPSKR-WICTSVLARTNG----LPSSLKNACLALNFSEHLTKMEEGKALIARFCKGSIDSPPYDCTRA 150 (675) Q Consensus 78 ~vL~~~l--Gi~i~~~~-~~DTmi~A~~l~----~p~sL~~la~~lgl~~~~~kd~~gk~li~~~~~~~~~~~~~~~~~~ 150 (675) ++++|.. .++..+.. =+|-|-.++-+. +..+|+.+-+.||.... .|..|..-.. .-..+..+ T Consensus 170 PfikR~v~~~~el~l~~~H~DL~h~~RRlwk~~l~~c~Lk~VEr~LGi~R~--edtdG~~~p~---------lyr~~~~~ 238 (278) T COG3359 170 PFIKRMVRDRLELSLEFGHFDLYHPSRRLWKHLLPRCGLKTVERILGIRRE--EDTDGYDGPE---------LYRLYRRY 238 (278) T ss_pred HHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCHHHHHHHHCCCCC--CCCCCCCHHH---------HHHHHHHC T ss_conf 899998751644476666312210344531202878881367888575501--1589850199---------99999870 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 58768999898877539999999987200 Q gi|254780127|r 151 NHVQAWQLFGEYCKRDVEATRELFKRLIP 179 (675) Q Consensus 151 ~~~~~~~~~~~Ya~~Dv~~~~~L~~~L~~ 179 (675) .+|.....+..|+..|+.....|+..+.+ T Consensus 239 ~dp~ll~~l~~hN~eDvlnL~~i~~h~~~ 267 (278) T COG3359 239 GDPGLLDGLVLHNREDVLNLPTIIKHVSK 267 (278) T ss_pred CCHHHHHHHHHCCHHHHHHHHHHHHHHHH T ss_conf 98777789887069988759999999999 No 89 >pfam00929 Exonuc_X-T Exonuclease. This family includes a variety of exonuclease proteins, such as ribonuclease T and the epsilon subunit of DNA polymerase III.; Probab=98.01 E-value=0.00013 Score=48.67 Aligned_cols=63 Identities=21% Similarity=0.232 Sum_probs=41.5 Q ss_pred HHHHHCCCCCEEEEEEHHHHHHHHHHHC---C-CCCC-CCCEEEHHHHHHHHC---CCCCHHHHHHHHCCCC Q ss_conf 9998648898199863077599999862---9-9878-650454588999858---8999789999808898 Q gi|254780127|r 57 LLQYLRDETVMCVAHNSLFERILFKKTL---G-IDIP-SKRWICTSVLARTNG---LPSSLKNACLALNFSE 120 (675) Q Consensus 57 l~~~L~~~~~~kv~HNa~FD~~vL~~~l---G-i~i~-~~~~~DTmi~A~~l~---~p~sL~~la~~lgl~~ 120 (675) +...+.+ ....|+||+.||..++.+.. + ...+ ...+.||+..++.+- ...+|+.+++.+|++. T Consensus 74 ~~~~l~~-~~~~v~~n~~fD~~~l~~~~~~~~~~~~~~~~~~~dt~~~~~~~~~~~~~~~L~~l~~~~~i~~ 144 (162) T pfam00929 74 FLEFLKK-LKILVGHNASFDVGFLLYDDLRFLKLPHPKLNDVIDTLILDKATYKGFKRRSLDALAEKLGLEK 144 (162) T ss_pred HHHHHCC-CCEEECCCCCHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHCCCCCCCHHHHHHHCCCCC T ss_conf 9998425-8577778877899999999998586336633330448999999750446799899998859999 No 90 >cd05783 DNA_polB_B1_exo The 3'-5' exonuclease domain of Sulfolobus solfataricus DNA polymerase B1 and similar archaeal proteins. B1 is a family-B DNA polymerase. Family-B DNA polymerases contain an N-terminal DEDDy DnaQ-like exonuclease domain in the same polypeptide chain as the polymerase domain, similar to family-A DNA polymerases. B1displays thermostable polymerase and 3'-5' exonuclease activities. This exonuclease domain contains three sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are clustered around the active site and are involved in metal binding and catalysis. The exonuclease domain of family-B polymerases also contains a beta hairpin structure that plays an important role in active site switching in the event of nucleotide misincorporation. Family-B DNA polymerases from thermophilic archaea are unique in that they are able to recognize the presence of uracil in the template strand, leading to the stalling of DNA synthes Probab=97.92 E-value=0.00015 Score=48.28 Aligned_cols=145 Identities=21% Similarity=0.168 Sum_probs=72.2 Q ss_pred CCEEEEEECCCCCCCCCCCCCHHHCCCC--EEEEEEEEECCCEE--EEE-------------------ECCCCCHHHHHH Q ss_conf 9789998040678870004600002688--29999999838228--999-------------------788842217999 Q gi|254780127|r 1 MSKLFIDIETRSPQPLPKVGVWAYAEQA--VITLCAYAWDDEPV--KLW-------------------DRTEQSAMPSDL 57 (675) Q Consensus 1 mk~l~lD~ET~s~~dl~~~Gl~~y~~~~--~I~~i~~a~~~~~~--~~~-------------------d~~~~~~~~~~l 57 (675) +|++|+||||+|+..- | -+-.+.+ .|..|++.-.++.. .++ .+..-..+.... T Consensus 5 lrvlsfDIEv~s~~~~---~-FP~p~~~~~~iisI~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~f~~E~~LL~~F 80 (204) T cd05783 5 LKRIAIDIEVYTPIKG---R-IPDPKTAEYPVISVALAGSDGLKRVLVLKREGVEGLEGLLPEGAEVEFFDSEKELIREA 80 (204) T ss_pred CEEEEEEEEECCCCCC---C-CCCCCCCCCEEEEEEEECCCCCEEEEEEEECCCCCCCCCCCCCCEEEEECCHHHHHHHH T ss_conf 7499999887889999---7-97955588619999999279988999995087125765578998999959999999999 Q ss_pred HHHHCCCCCEEEEEEH-HHHHHHHHH---HCCCCC---CCC--C---------EEEH-------HHHHHHHC---CCCCH Q ss_conf 9986488981998630-775999998---629987---865--0---------4545-------88999858---89997 Q gi|254780127|r 58 LQYLRDETVMCVAHNS-LFERILFKK---TLGIDI---PSK--R---------WICT-------SVLARTNG---LPSSL 109 (675) Q Consensus 58 ~~~L~~~~~~kv~HNa-~FD~~vL~~---~lGi~i---~~~--~---------~~DT-------mi~A~~l~---~p~sL 109 (675) .+++.+.. ..+++|. .||++.|.. .+|+.- +.. + ..|- ++..+..+ .+++| T Consensus 81 ~~~i~~~p-~Ii~wN~~~FDiPYL~~R~~~Lg~~~~~~~~~~~~~~~~~~~~~~~dl~~~~~~~~~~~y~~~~k~~sY~L 159 (204) T cd05783 81 FKIISEYP-IVLTFNGDNFDLPYLYNRALKLGIPKEEIPIYLKRDYATLKHGIHIDLYKFFSNRAIQVYAFGNKYREYTL 159 (204) T ss_pred HHHHHCCC-EEEECCCCCCCHHHHHHHHHHHCCCCEEEEEEECCEEEEEEEEEEEEEHHHCCCCHHHHHHCCCCCCCCCH T ss_conf 99974399-79981786778899999999859982045667343699993449998646357313544120242122528 Q ss_pred HHHHHH-HCCCCCCHHHHHCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 899998-08898652766376576641002332013453112587689998988775399999999 Q gi|254780127|r 110 KNACLA-LNFSEHLTKMEEGKALIARFCKGSIDSPPYDCTRANHVQAWQLFGEYCKRDVEATRELF 174 (675) Q Consensus 110 ~~la~~-lgl~~~~~kd~~gk~li~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~~~~L~ 174 (675) +.+|+. ||-.... ..+ . .....++.+..||.+|++++.+|- T Consensus 160 d~Va~~~LGe~K~d---~~~-------------------~--~~~~d~~k~i~YNi~D~~Lv~~L~ 201 (204) T cd05783 160 DAVAKALLGEGKVE---LEK-------------------N--ISELNLYELAEYNYRDAELTLELT 201 (204) T ss_pred HHHHHHHHCCCCCC---CCC-------------------C--CCCCCHHHHHEEEHHHHHHHHHHH T ss_conf 89999983888600---676-------------------4--442586785514267767777862 No 91 >cd06133 ERI-1_3'hExo_like This subfamily is composed of Caenorhabditis elegans ERI-1, human 3' exonuclease (3'hExo), Drosophila exonuclease snipper (snp), and similar proteins from eukaryotes and bacteria. These are DEDDh-type DnaQ-like 3'-5' exonucleases containing three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific Hx(4)D conserved pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. ERI-1 has been implicated in the degradation of small interfering RNAs (RNAi). 3'hExo participates in the degradation of histone mRNAs. Snp is a non-essential exonuclease that efficiently degrades structured RNA and DNA substrates as long as there is a minimum of 2 nucleotides in the 3' overhang to initiate degradation. Snp is not a functional homolog of either ERI-1 or 3'hExo. Probab=97.77 E-value=0.00075 Score=43.65 Aligned_cols=86 Identities=16% Similarity=0.163 Sum_probs=52.9 Q ss_pred HHHHHHCCCCCEEEEEEHHHHHHHHHH----HCCCCCC--CCCEEEHHHHHHH-HC--CCCCHHHHHHHHCCCCCCHHHH Q ss_conf 999986488981998630775999998----6299878--6504545889998-58--8999789999808898652766 Q gi|254780127|r 56 DLLQYLRDETVMCVAHNSLFERILFKK----TLGIDIP--SKRWICTSVLART-NG--LPSSLKNACLALNFSEHLTKME 126 (675) Q Consensus 56 ~l~~~L~~~~~~kv~HNa~FD~~vL~~----~lGi~i~--~~~~~DTmi~A~~-l~--~p~sL~~la~~lgl~~~~~kd~ 126 (675) ++.+.+.+.....+.||+.||..++.. +.+...+ ...|.||+-.++. +. .+.||+.+++++|++.... T Consensus 81 ~f~~~i~~~~~~~~~~~~~fD~~~l~~~~~~~~~~~~~~~~~~~iD~~~~~~~~~~~~~~~sL~~l~~~~gi~~~~~--- 157 (176) T cd06133 81 EFLEWLGKNGKYAFVTWGDWDLKDLLQNQCKYKIINLPPFFRQWIDLKKEFAKFYGLKKRTGLSKALEYLGLEFEGR--- 157 (176) T ss_pred HHHHHHCCCCCEEEEEECCCHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCHHHHHHHCCCCCCCC--- T ss_conf 99999726985799960600299999999997899887301112049999999818888989999999869999998--- Q ss_pred HCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 376576641002332013453112587689998988775399999999872 Q gi|254780127|r 127 EGKALIARFCKGSIDSPPYDCTRANHVQAWQLFGEYCKRDVEATRELFKRL 177 (675) Q Consensus 127 ~gk~li~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~~~~L~~~L 177 (675) ..-|..|+..|..++.+| T Consensus 158 ---------------------------------~H~AL~DA~~ta~v~~~l 175 (176) T cd06133 158 ---------------------------------HHRGLDDARNIARILKRL 175 (176) T ss_pred ---------------------------------CCCCHHHHHHHHHHHHHH T ss_conf ---------------------------------858599999999999987 No 92 >pfam10108 Exon_PolB Predicted 3'-5' exonuclease related to the exonuclease domain of PolB. This domain is found in various prokaryotic 3'-5' exonucleases and hypothetical proteins. Probab=97.76 E-value=0.0024 Score=40.26 Aligned_cols=131 Identities=18% Similarity=0.271 Sum_probs=77.7 Q ss_pred EEEEEEEE--ECCCEEEEEECCCCC----HHHHHHHHHHCCCCCEEEEEEH-HHHHHHHHH---HCCCCCCC-------- Q ss_conf 29999999--838228999788842----2179999986488981998630-775999998---62998786-------- Q gi|254780127|r 29 VITLCAYA--WDDEPVKLWDRTEQS----AMPSDLLQYLRDETVMCVAHNS-LFERILFKK---TLGIDIPS-------- 90 (675) Q Consensus 29 ~I~~i~~a--~~~~~~~~~d~~~~~----~~~~~l~~~L~~~~~~kv~HNa-~FD~~vL~~---~lGi~i~~-------- 90 (675) +|+.|+.. .+++...++.+...+ .+...+.+.++...-.+|.||. .||+++|.. .+|+..|. T Consensus 10 riVaIs~v~~~~~~~~~v~slg~~~~~E~~ll~~F~~~i~~~~P~LVs~NG~gFDlPvL~~RAl~~gi~ap~~~~~~~~~ 89 (211) T pfam10108 10 RIVAISAVIADDDGGFKVWSLGDPESSEKELIQRFFDGIEKYTPQLVSFNGRGFDLPVLHYRALKHGVSAPRYFDTGDRD 89 (211) T ss_pred EEEEEEEEEEECCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCEEEEECCCCCCHHHHHHHHHHCCCCCHHHHHCCCCC T ss_conf 16999999996699279987889998999999999999984398288656865548999999998188850787567876 Q ss_pred CCE-----------EEHHHHHHHHC--CCCCHHHHHHHHCCCCCCHHHHHCHHHHHHHHHHHHHHCCCCCCCCCCHHHHH Q ss_conf 504-----------54588999858--89997899998088986527663765766410023320134531125876899 Q gi|254780127|r 91 KRW-----------ICTSVLARTNG--LPSSLKNACLALNFSEHLTKMEEGKALIARFCKGSIDSPPYDCTRANHVQAWQ 157 (675) Q Consensus 91 ~~~-----------~DTmi~A~~l~--~p~sL~~la~~lgl~~~~~kd~~gk~li~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (675) .+| .|+|-.-...+ ...+|+.+|..+|++.+. +..|...... +. .-.++ T Consensus 90 ~kw~nY~~Ry~~~H~DLmd~l~~~g~~~~~~Ld~~a~l~G~PGK~--~~~G~~V~~~--------------~~--~G~l~ 151 (211) T pfam10108 90 FKWNNYFNRYSTRHLDLMDLLSGYGARANAPLDEVAALLGLPGKM--GVDGSQVWEL--------------YQ--AGKIE 151 (211) T ss_pred CCCCCCCCCCCCCCCCHHHHHHHCCCCCCCCHHHHHHHCCCCCCC--CCCHHHHHHH--------------HH--CCCHH T ss_conf 654441123565445489998732886778989999981999878--7764589999--------------98--59888 Q ss_pred HHHHHHHHHHHHHHHHHHHH Q ss_conf 98988775399999999872 Q gi|254780127|r 158 LFGEYCKRDVEATRELFKRL 177 (675) Q Consensus 158 ~~~~Ya~~Dv~~~~~L~~~L 177 (675) ....||.-||..|+-+|.+. T Consensus 152 ~I~~YCe~DvlnTylv~LR~ 171 (211) T pfam10108 152 EIRDYCETDVLNTYLVYLRF 171 (211) T ss_pred HHHHHHHHHHHHHHHHHHHH T ss_conf 99999887689999999999 No 93 >KOG2206 consensus Probab=97.75 E-value=8.2e-05 Score=50.02 Aligned_cols=125 Identities=18% Similarity=0.121 Sum_probs=81.5 Q ss_pred EEEEEEEECCCEEEEEECCCCCHHHHHHHHHHCCCCCEEEEEEHHHHHHHHHHHCCCCCCCCCEEEHHHHHHHHCCC-CC Q ss_conf 99999998382289997888422179999986488981998630775999998629987865045458899985889-99 Q gi|254780127|r 30 ITLCAYAWDDEPVKLWDRTEQSAMPSDLLQYLRDETVMCVAHNSLFERILFKKTLGIDIPSKRWICTSVLARTNGLP-SS 108 (675) Q Consensus 30 I~~i~~a~~~~~~~~~d~~~~~~~~~~l~~~L~~~~~~kv~HNa~FD~~vL~~~lGi~i~~~~~~DTmi~A~~l~~p-~s 108 (675) .-+++++.-.+ -+.+|...-..-..-+.+++.++++.||+|-+-=|+.||.+-+||.+ .+.+||..++++|++| .| T Consensus 231 tclmqISTr~e-d~iIDt~~l~~~i~~l~e~fsdp~ivkvfhgaD~diiwlqrdfgiyv--vnLfdt~~a~r~L~~~r~s 307 (687) T KOG2206 231 TCLMQISTRTE-DFIIDTFKLRDHIGILNEVFSDPGIVKVFHGADTDIIWLQRDFGIYV--VNLFDTIQASRLLGLPRPS 307 (687) T ss_pred EEEEEEECCCH-HHEEHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCHHHHHCCCEEEE--EECHHHHHHHHHHCCCCCC T ss_conf 06899530420-31211478888887742210588747897347640343402304899--8511058899885799400 Q ss_pred HHHHHHH-HCCCCCCHHHHHCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 7899998-088986527663765766410023320134531125876899989887753999999998720 Q gi|254780127|r 109 LKNACLA-LNFSEHLTKMEEGKALIARFCKGSIDSPPYDCTRANHVQAWQLFGEYCKRDVEATRELFKRLI 178 (675) Q Consensus 109 L~~la~~-lgl~~~~~kd~~gk~li~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~~~~L~~~L~ 178 (675) |..+.+. .|+..++. . .+ ..|-...-| +.|..|+..|.-..+-||..|. T Consensus 308 L~~ll~~~~~v~~nk~-y----ql-------------adwR~rpLp---~~Mv~yar~dthyllyiyD~lr 357 (687) T KOG2206 308 LAYLLECVCGVLTNKK-Y----QL-------------ADWRIRPLP---EEMVRYAREDTHYLLYIYDVLR 357 (687) T ss_pred HHHHHHHHHHHHHHHH-H----HH-------------CHHCCCCCC---HHHHHHHHHCCHHHHHHHHHHH T ss_conf 9999999874531146-6----41-------------100036585---8898887512315899999999 No 94 >PTZ00166 DNA polymerase delta catalytic subunit; Provisional Probab=97.73 E-value=0.003 Score=39.65 Aligned_cols=109 Identities=17% Similarity=0.178 Sum_probs=61.1 Q ss_pred HHHHHHHHC--CCCCEEEEEEH-HHHHHHHHH---HCCCC----------------------------------CCCCCE Q ss_conf 799999864--88981998630-775999998---62998----------------------------------786504 Q gi|254780127|r 54 PSDLLQYLR--DETVMCVAHNS-LFERILFKK---TLGID----------------------------------IPSKRW 93 (675) Q Consensus 54 ~~~l~~~L~--~~~~~kv~HNa-~FD~~vL~~---~lGi~----------------------------------i~~~~~ 93 (675) +..+..++. ||+++ +|+|+ .||++.|.. ++++. +...-. T Consensus 360 L~aF~~fi~~~DPDII-TGYNI~nFDlpYLl~Ra~~L~i~~f~~lgRlk~~~~~~~~~~~~s~a~G~~~~k~i~i~GRv~ 438 (1081) T PTZ00166 360 LLAWNDFIIAVDPDFL-TGYNIINFDLPYLLNRATALNLKKFKFLTRIKSVKSTVKDSIFSSNQFGTHENKEINIEGRIL 438 (1081) T ss_pred HHHHHHHHHHCCCCEE-EECCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCEECCCCCCCCCCCCCCEEEEECCEEE T ss_conf 9999999987099899-966878888899999999958864332361157765220265665555666550686757899 Q ss_pred EEHHHHHHH-HC-CCCCHHHHHHH-HCCCCCCHHHHHCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHH Q ss_conf 545889998-58-89997899998-0889865276637657664100233201345311258768999898877539999 Q gi|254780127|r 94 ICTSVLART-NG-LPSSLKNACLA-LNFSEHLTKMEEGKALIARFCKGSIDSPPYDCTRANHVQAWQLFGEYCKRDVEAT 170 (675) Q Consensus 94 ~DTmi~A~~-l~-~p~sL~~la~~-lgl~~~~~kd~~gk~li~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~~ 170 (675) +|.+...+- .. ..++|+.+++. ||-. |+....+.+. ..+..+++....+..||.+|+.+. T Consensus 439 ~Dl~~~v~r~~kL~SYkLntVs~~fLgeq----KeDV~~~~I~-------------~l~~~~~e~r~rla~YClkDa~L~ 501 (1081) T PTZ00166 439 FDVYDLIRRDYKLKSYSLNYVSFEFLKEQ----KEDVHYSTIN-------------KLQNGNPEDRKRIASYCLKDSILP 501 (1081) T ss_pred EEHHHHHHHHCCCCCCCHHHHHHHHHCCC----CCCCCHHHHH-------------HHHCCCHHHHHHHHHHHHHHHHHH T ss_conf 64699998755878576888999981887----5337999999-------------998149378899999989989999 Q ss_pred HHHHHHHCCC Q ss_conf 9999872002 Q gi|254780127|r 171 RELFKRLIPL 180 (675) Q Consensus 171 ~~L~~~L~~~ 180 (675) ++|..+|.-+ T Consensus 502 lrL~~KL~~l 511 (1081) T PTZ00166 502 LRLIDKLLLL 511 (1081) T ss_pred HHHHHHHHHH T ss_conf 9999999899 No 95 >COG0417 PolB DNA polymerase elongation subunit (family B) [DNA replication, recombination, and repair] Probab=97.68 E-value=0.0023 Score=40.50 Aligned_cols=178 Identities=19% Similarity=0.109 Sum_probs=89.3 Q ss_pred CCEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCCEE------EE------EE-CCCCCHHHHHHHHHHC--CCC Q ss_conf 978999804067887000460000268829999999838228------99------97-8884221799999864--889 Q gi|254780127|r 1 MSKLFIDIETRSPQPLPKVGVWAYAEQAVITLCAYAWDDEPV------KL------WD-RTEQSAMPSDLLQYLR--DET 65 (675) Q Consensus 1 mk~l~lD~ET~s~~dl~~~Gl~~y~~~~~I~~i~~a~~~~~~------~~------~d-~~~~~~~~~~l~~~L~--~~~ 65 (675) ++.+++|+||++.......+ ....+..++++.+.+.. +. +. +.....+...+..++. |++ T Consensus 154 l~~la~DiE~~~~~~~~~~~-----~~d~~~~i~~~~~~~~~~~~~~~~~~~~~~~v~~~~~e~e~l~~~~~~i~~~dPd 228 (792) T COG0417 154 LRVLAFDIETLSEPGKFPDG-----EKDPIIMISYAIEAEGGLIEVFIYTSGEGFSVEVVISEAELLERFVELIREYDPD 228 (792) T ss_pred CCEEEEEEEECCCCCCCCCC-----CCCCEEEEEEEECCCCCCCCCCCCCCCCCCCEEEECCHHHHHHHHHHHHHHCCCC T ss_conf 75479950310577788987-----7786269999704677764443223567763487258799999999998502989 Q ss_pred CEEEEEEHH-HHHHHHHHH---CCCCCCC-------------------CCEEEHHHHHH--HHC-CCCCHHHHHHHHCCC Q ss_conf 819986307-759999986---2998786-------------------50454588999--858-899978999980889 Q gi|254780127|r 66 VMCVAHNSL-FERILFKKT---LGIDIPS-------------------KRWICTSVLAR--TNG-LPSSLKNACLALNFS 119 (675) Q Consensus 66 ~~kv~HNa~-FD~~vL~~~---lGi~i~~-------------------~~~~DTmi~A~--~l~-~p~sL~~la~~lgl~ 119 (675) ++ ++||.. ||++.|... +|++... ....|..-... ... ..+||+.++..+... T Consensus 229 VI-vgyn~~~fd~pyl~~Ra~~lgi~~~~gr~~~~~~~~~~~~~~~~Gr~~iDl~~~~~~~~~~~~~ysl~~v~~~~l~~ 307 (792) T COG0417 229 VI-VGYNGDNFDWPYLAERAERLGIPLRLGRDGSELRVRKSGFSSQVGRLHIDLYPALRRRPLNLKSYSLEAVSEALLGE 307 (792) T ss_pred EE-EEECCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCEEECCCCCCCCEEEEEHHHHHHHCCCCCCCEEHHHHHHHHCCC T ss_conf 99-98367777738999999981998511356653200212544557716741066542032355632434677664133 Q ss_pred CCCHHHHHCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCE Q ss_conf 86527663765766410023320134531125876899989887753999999998720021431017899998751684 Q gi|254780127|r 120 EHLTKMEEGKALIARFCKGSIDSPPYDCTRANHVQAWQLFGEYCKRDVEATRELFKRLIPLSDGERDLWLLDQTINDRGY 199 (675) Q Consensus 120 ~~~~kd~~gk~li~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~~~~L~~~L~~~~~~E~~l~~~~~~me~~Gi 199 (675) . +..+.. +... ...+...+..+.....|+..|.+++.++..+..-+. .+.......|+ T Consensus 308 ~-k~~~~~------~~~~--------~~~~~~~~~~~~~~~~~~l~~~~l~~~~~~~~~~l~-------~~~e~s~vsg~ 365 (792) T COG0417 308 G-KREDIP------YDSM--------EEIWPDWADSKLRLLLYNLSDADLVLRILLKNELLP-------LLIELSRVSGL 365 (792) T ss_pred C-CHHCCC------CCHH--------HHCCCCCCCCHHHHHHHCCCHHHHHHHHHHHHHHHH-------HHHHHHHHHCC T ss_conf 5-220056------2013--------330423551076888834107889999986666789-------99999998498 Q ss_pred ECCHHHH Q ss_conf 3088999 Q gi|254780127|r 200 RIDLDLV 206 (675) Q Consensus 200 ~vD~e~l 206 (675) ++|.-.. T Consensus 366 ~~~~~~~ 372 (792) T COG0417 366 PLDDVTR 372 (792) T ss_pred CHHHHHH T ss_conf 7999861 No 96 >COG0749 PolA DNA polymerase I - 3'-5' exonuclease and polymerase domains [DNA replication, recombination, and repair] Probab=97.62 E-value=8.7e-05 Score=49.85 Aligned_cols=63 Identities=10% Similarity=0.023 Sum_probs=52.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEECCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHC Q ss_conf 666665666765478899999999987511101031088978899999986730789999999999999633 Q gi|254780127|r 463 SQNVKSTSTFEDWEQAESHHLWMQDQYPEFAVKDKLIGTACELVKKAWRAKHQGVLQLWKDLTEGFACVVQE 534 (675) Q Consensus 463 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ea~~~~~~~~~~~p~i~~~~~~~~~~~~~~~~~ 534 (675) ++..+...||++||+++.|+..+...+++.|++.++.++ |+..|.|.+.....+.++.++.-| T Consensus 448 eA~~~I~~YF~rypgv~~ym~~~~~~ar~~GyV~Tl~gR---------Rry~p~i~s~n~~~R~~aER~AiN 510 (593) T COG0749 448 EAKEYIDRYFERYPGVKEYMERTKEEAREDGYVETLFGR---------RRYLPDINSSNRVVRAAAERAAIN 510 (593) T ss_pred HHHHHHHHHHHHCHHHHHHHHHHHHHHHHCCCEEECCCC---------CCCCCCCCCCCHHHHHHHHHHHHC T ss_conf 889999999986868999999999999974935410266---------024862101778888788898835 No 97 >cd06138 ExoI_N This subfamily is composed of the N-terminal domain of E. coli exonuclease I (ExoI) and similar proteins. ExoI is a monomeric enzyme that hydrolyzes single stranded DNA in the 3' to 5' direction. It plays a role in DNA recombination and repair. It primarily functions in repairing frameshift mutations. The N-terminal domain of ExoI is a DEDDh-type DnaQ-like 3'-5 exonuclease containing three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific Hx(4)D conserved pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. The ExoI structure is unique among DnaQ family enzymes in that there is a large distance between the two metal ions required for catalysis and the catalytic histidine is oriented away from the active site. Probab=97.58 E-value=0.00078 Score=43.54 Aligned_cols=65 Identities=11% Similarity=0.017 Sum_probs=36.6 Q ss_pred HHHHHHCCCCCEEEEEE-HHHHHHHHHHHC---CCCCC------CCCEEEHHHHHHH---HC-------------CCCCH Q ss_conf 99998648898199863-077599999862---99878------6504545889998---58-------------89997 Q gi|254780127|r 56 DLLQYLRDETVMCVAHN-SLFERILFKKTL---GIDIP------SKRWICTSVLART---NG-------------LPSSL 109 (675) Q Consensus 56 ~l~~~L~~~~~~kv~HN-a~FD~~vL~~~l---Gi~i~------~~~~~DTmi~A~~---l~-------------~p~sL 109 (675) ++...+.....+.|||| +.||..+|...+ |++.. ....+|++=.+++ +. ...+| T Consensus 75 ~i~~~~~~~~~i~vg~N~~~FD~~fL~~~~~r~~~~p~~~~~~~~~~~~D~l~~~r~~~~~~p~~i~~~~~~~g~~s~kL 154 (183) T cd06138 75 KIHRLFNTPGTCIVGYNNIRFDDEFLRFAFYRNLYDPYTWEWKNGNSRWDLLDVVRAYYALRPDGIVWPKNDDGKPSFKL 154 (183) T ss_pred HHHHHHCCCCCEEEEECCHHHHHHHHHHHHHHHCCCCHHHHHCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCH T ss_conf 99998616997799845355689999999998278715667527871230788999999978201447655689741129 Q ss_pred HHHHHHHCCCC Q ss_conf 89999808898 Q gi|254780127|r 110 KNACLALNFSE 120 (675) Q Consensus 110 ~~la~~lgl~~ 120 (675) +.+++.+|++. T Consensus 155 ~~l~~~~gi~~ 165 (183) T cd06138 155 EDLAQANGIEH 165 (183) T ss_pred HHHHHHCCCCC T ss_conf 99999849999 No 98 >KOG1798 consensus Probab=97.38 E-value=0.0059 Score=37.72 Aligned_cols=149 Identities=21% Similarity=0.251 Sum_probs=80.9 Q ss_pred CEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCCEEEEE--------EC-----CCC--------------C-HH Q ss_conf 78999804067887000460000268829999999838228999--------78-----884--------------2-21 Q gi|254780127|r 2 SKLFIDIETRSPQPLPKVGVWAYAEQAVITLCAYAWDDEPVKLW--------DR-----TEQ--------------S-AM 53 (675) Q Consensus 2 k~l~lD~ET~s~~dl~~~Gl~~y~~~~~I~~i~~a~~~~~~~~~--------d~-----~~~--------------~-~~ 53 (675) +.+|+||||+ ..+|+= +-++.-+|..|+|-+|+ ..+.| |+ ++. + .+ T Consensus 247 ~VlAFDIETt-KlPLKF----PDae~DqIMMISYMiDG-qGfLItNREiVs~DIedfEYTPKpE~eG~F~v~Ne~dEv~L 320 (2173) T KOG1798 247 RVLAFDIETT-KLPLKF----PDAESDQIMMISYMIDG-QGFLITNREIVSEDIEDFEYTPKPEYEGPFCVFNEPDEVGL 320 (2173) T ss_pred EEEEEEEECC-CCCCCC----CCCCCCEEEEEEEEECC-CEEEEECHHHHCCCHHHCCCCCCCCCCCCEEEECCCCHHHH T ss_conf 5899851035-578779----98665607899999647-46899622343052122236886325254588668768899 Q ss_pred HHHHHHHHC--CCCCEEEEEE-HHHHHHHHHH---HCCCCCCC-CCE------------EEHHHH-HHHH-C--C---CC Q ss_conf 799999864--8898199863-0775999998---62998786-504------------545889-9985-8--8---99 Q gi|254780127|r 54 PSDLLQYLR--DETVMCVAHN-SLFERILFKK---TLGIDIPS-KRW------------ICTSVL-ARTN-G--L---PS 107 (675) Q Consensus 54 ~~~l~~~L~--~~~~~kv~HN-a~FD~~vL~~---~lGi~i~~-~~~------------~DTmi~-A~~l-~--~---p~ 107 (675) ...+.+++. .|.++ |..| ..||++|+.+ +.|+++.. ..+ -+-|-. .|+- + + ++ T Consensus 321 l~RfFeHiq~~kP~ii-vTyNGDFFDWPFve~Ra~~hGi~m~eEiGF~~D~~gEyks~~c~HmDcfrWVKRDSYLPqGSq 399 (2173) T KOG1798 321 LQRFFEHIQEVKPTII-VTYNGDFFDWPFVEARAKIHGISMNEEIGFRRDSQGEYKSPFCIHMDCFRWVKRDSYLPQGSQ 399 (2173) T ss_pred HHHHHHHHHHCCCCEE-EEECCCCCCCHHHHHHHHHCCCCCCHHCCCEECCCCCCCCCCEEEHHHHHHHHHCCCCCCCCC T ss_conf 9999999986189679-973686446666677887507771121085206533312563211266666520465777663 Q ss_pred CHHHHHH-HHCCCCCCHHHHHCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9789999-808898652766376576641002332013453112587689998988775399999999872 Q gi|254780127|r 108 SLKNACL-ALNFSEHLTKMEEGKALIARFCKGSIDSPPYDCTRANHVQAWQLFGEYCKRDVEATRELFKRL 177 (675) Q Consensus 108 sL~~la~-~lgl~~~~~kd~~gk~li~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~~~~L~~~L 177 (675) +|+++.+ .||++.... |.+ .+.++. .+.-+.+..|..-|+..|+=||.+. T Consensus 400 gLKAVTkaKLGYdPvEv-dPE--dM~~~A-----------------~EkPQ~lasYSVSDAVATYyLYMkY 450 (2173) T KOG1798 400 GLKAVTKAKLGYDPVEV-DPE--DMVRMA-----------------MEKPQTLASYSVSDAVATYYLYMKY 450 (2173) T ss_pred CHHHHHHHHHCCCCCCC-CHH--HHHHHH-----------------HHCCHHHHHCCHHHHHHHHHHHHHH T ss_conf 12688887507894437-988--854666-----------------5082565300057778888898987 No 99 >TIGR00573 dnaq exonuclease, DNA polymerase III, epsilon subunit family; InterPro: IPR006054 All proteins in this entry for which functions are known are components of the DNA polymerase III complex (epsilon subunit). There is, however, an outgroup that includes paralogs in some gamma-proteobacteria and the N-terminal region of DinG from some low GC Gram-positive bacteria. ; GO: 0003677 DNA binding, 0003887 DNA-directed DNA polymerase activity, 0006260 DNA replication. Probab=97.27 E-value=0.0021 Score=40.65 Aligned_cols=90 Identities=17% Similarity=0.237 Sum_probs=57.9 Q ss_pred HHHHHHHHHHCCCCCEEEEEEHHHHHHHHHHH---CCCCC--CCCCEEEHHHHHHHH--CCC---CCHHHHHHHHCCCCC Q ss_conf 21799999864889819986307759999986---29987--865045458899985--889---997899998088986 Q gi|254780127|r 52 AMPSDLLQYLRDETVMCVAHNSLFERILFKKT---LGIDI--PSKRWICTSVLARTN--GLP---SSLKNACLALNFSEH 121 (675) Q Consensus 52 ~~~~~l~~~L~~~~~~kv~HNa~FD~~vL~~~---lGi~i--~~~~~~DTmi~A~~l--~~p---~sL~~la~~lgl~~~ 121 (675) .+..++.++|. +..+|+|||.||+.||.+. +|... ....+.||.-+.+++ ..| .+|+.+|.+++++.. T Consensus 77 eia~dF~~~I~--g~~Lv~HNA~FD~GFl~~e~~~LG~~~~~~~~~v~~~~~~~~~~~~~~p~~~~~Ld~L~~~~~~~~s 154 (228) T TIGR00573 77 EIAEDFADYIK--GAVLVIHNASFDVGFLNYEFSKLGLKKEPKTNDVIDTTDTLQAARPEFPGKRNTLDALALRYEITNS 154 (228) T ss_pred HHHHHHHHHHC--CCEEEEEHHHCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHCCCCHH T ss_conf 78899999837--9478752003057789999985145011001144449999999972058843027777878054403 Q ss_pred CHHHHHCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 527663765766410023320134531125876899989887753999999998720 Q gi|254780127|r 122 LTKMEEGKALIARFCKGSIDSPPYDCTRANHVQAWQLFGEYCKRDVEATRELFKRLI 178 (675) Q Consensus 122 ~~kd~~gk~li~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~~~~L~~~L~ 178 (675) .. ...-|..|+.++..+|..+. T Consensus 155 ~R-----------------------------------~~H~A~~DA~~l~~~~~~~~ 176 (228) T TIGR00573 155 HR-----------------------------------ALHGALLDAFILAKLYLVMT 176 (228) T ss_pred HH-----------------------------------HHHHHHHHHHHHHHHHHHHH T ss_conf 77-----------------------------------77317788999999999998 No 100 >PRK05601 DNA polymerase III subunit epsilon; Validated Probab=97.01 E-value=0.0081 Score=36.81 Aligned_cols=29 Identities=10% Similarity=-0.032 Sum_probs=20.6 Q ss_pred HHHHHHHHCCCCCEEEEEEHHHHHHHHHHHC Q ss_conf 7999998648898199863077599999862 Q gi|254780127|r 54 PSDLLQYLRDETVMCVAHNSLFERILFKKTL 84 (675) Q Consensus 54 ~~~l~~~L~~~~~~kv~HNa~FD~~vL~~~l 84 (675) ..+|.+.|. +..+|+||+.||+.||...+ T Consensus 116 a~~l~~lLd--GRtLVaHNa~fdw~Fia~Ea 144 (377) T PRK05601 116 LKSLDRLID--GRTLLLHNAPRTWGFIVSEA 144 (377) T ss_pred HHHHHHHHC--CCEEEEECCCCHHHHHHHHH T ss_conf 999999845--86799835602088999999 No 101 >cd05778 DNA_polB_zeta_exo The 3'-5' exonuclease domain of eukaryotic DNA polymerase zeta. DNA polymerase zeta is a family-B DNA polymerase which is distantly related to DNA polymerase delta. It plays a major role in translesion replication and the production of either spontaneous or induced mutations. In addition, DNA polymerase zeta also appears to be involved in somatic hypermutability in B lymphocytes, an important element for the production of high affinity antibodies in response to an antigen. The catalytic subunit contains both polymerase and 3'-5' exonuclease domains, but only exhibits polymerase activity. The DnaQ-like 3'-5' exonuclease domain contains three sequence motifs termed ExoI, ExoII and ExoIII, without the four conserved acidic residues that are crucial for metal binding and catalysis. Probab=96.98 E-value=0.015 Score=35.13 Aligned_cols=152 Identities=18% Similarity=0.184 Sum_probs=74.9 Q ss_pred CCEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCCE---------EEEEEC------------------------ Q ss_conf 97899980406788700046000026882999999983822---------899978------------------------ Q gi|254780127|r 1 MSKLFIDIETRSPQPLPKVGVWAYAEQAVITLCAYAWDDEP---------VKLWDR------------------------ 47 (675) Q Consensus 1 mk~l~lD~ET~s~~dl~~~Gl~~y~~~~~I~~i~~a~~~~~---------~~~~d~------------------------ 47 (675) |+.+++|+|..+..++. +++ +.=.|.+|.+++.+.. ..++-+ T Consensus 4 LtiLSfDIe~~sr~~~f---PdP--~~D~I~~I~~~~~~~~~~~~~~~~~~g~i~~~~~~~~~~~~~~~~~~~~~~v~~~ 78 (231) T cd05778 4 LTILSLEVHVNTRGDLL---PDP--EFDPISAIFYCIDDDVSPFILDANKVGVIIVDELKSNASNGRIRSGLSGIPVEVV 78 (231) T ss_pred EEEEEEEEEECCCCCCC---CCC--CCCCEEEEEEEECCCCCCCCCCCCCEEEEEECCCCCCHHHCCCCCCCCCCEEEEE T ss_conf 28999999988889999---698--9785789999962687654456652007995586541121022257899649997 Q ss_pred CCCCHHHHHHHHHHC--CCCCEEEEEEH-HHHHHHHH-H--HCCCC---------------------------------C Q ss_conf 884221799999864--88981998630-77599999-8--62998---------------------------------7 Q gi|254780127|r 48 TEQSAMPSDLLQYLR--DETVMCVAHNS-LFERILFK-K--TLGID---------------------------------I 88 (675) Q Consensus 48 ~~~~~~~~~l~~~L~--~~~~~kv~HNa-~FD~~vL~-~--~lGi~---------------------------------i 88 (675) ..-..+...+..++. ||+++ +|.|+ .||++.|. | ++|+. + T Consensus 79 ~~E~eLl~~f~~~v~~~DPDii-~Gyni~~fd~~Yl~~Ra~~l~~~~~~~~lsR~~~~~~~~~~~~~~~~g~~~~~~i~i 157 (231) T cd05778 79 ESELELFEELIDLVRRFDPDIL-SGYEIQRSSWGYLIERAAALGIDDLLDEISRVPSDSNGKFGDRDDEWGYTHTSGIKI 157 (231) T ss_pred CCHHHHHHHHHHHHHHHCCCEE-EECCCCCCCCHHHHHHHHHHCCCHHHHHCCCCCCCCCCEECCCCCCCCCCCCCEEEE T ss_conf 9999999999999997099989-622556686189999999858620466607787665421023432355644415763 Q ss_pred CCCCEEEHHHHHH-HHC-CCCCHHHHHHH-HCCCCCCHHHHHCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHH Q ss_conf 8650454588999-858-89997899998-08898652766376576641002332013453112587689998988775 Q gi|254780127|r 89 PSKRWICTSVLAR-TNG-LPSSLKNACLA-LNFSEHLTKMEEGKALIARFCKGSIDSPPYDCTRANHVQAWQLFGEYCKR 165 (675) Q Consensus 89 ~~~~~~DTmi~A~-~l~-~p~sL~~la~~-lgl~~~~~kd~~gk~li~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Ya~~ 165 (675) +..-.+|.+-.-+ -+. ..++|+.++.. ||-... +..-+.+..++. ...+........||.+ T Consensus 158 ~GRi~lDl~~~~r~e~kL~sYsLe~V~~~~L~e~k~---~~~~~~l~~~~~-------------~g~~~~r~~v~~Yclk 221 (231) T cd05778 158 VGRHILNVWRLMRSELALTNYTLENVVYHVLHQRIP---LYSNKTLTEWYK-------------SGSASERWRVLEYYLK 221 (231) T ss_pred CCEEEEEHHHHHHHHCCCCCCCHHHHHHHHHCCCCC---CCCHHHHHHHHH-------------CCCHHHHHHHHEEEHH T ss_conf 418998519999753584438799999999679888---899999999997-------------4998785164400075 Q ss_pred HHHHHHHHH Q ss_conf 399999999 Q gi|254780127|r 166 DVEATRELF 174 (675) Q Consensus 166 Dv~~~~~L~ 174 (675) |+.+.++|. T Consensus 222 d~~l~l~Ll 230 (231) T cd05778 222 RVRLNLEIL 230 (231) T ss_pred HHHHHHHHH T ss_conf 305158652 No 102 >PRK06722 exonuclease; Provisional Probab=96.90 E-value=0.033 Score=32.77 Aligned_cols=53 Identities=8% Similarity=0.021 Sum_probs=34.4 Q ss_pred CHHHHHHHHHHCC-CCCEEEEEEHHHHHHHHHH---HCCCCCCCCCEEEHHHHHHHH Q ss_conf 2217999998648-8981998630775999998---629987865045458899985 Q gi|254780127|r 51 SAMPSDLLQYLRD-ETVMCVAHNSLFERILFKK---TLGIDIPSKRWICTSVLARTN 103 (675) Q Consensus 51 ~~~~~~l~~~L~~-~~~~kv~HNa~FD~~vL~~---~lGi~i~~~~~~DTmi~A~~l 103 (675) .++.+.+-+.|+- .+..+|+||++||..||.+ ..|++.|.....||.-+++.+ T Consensus 36 P~ieeVLp~FleFIGd~~LVAHNAsFD~gFL~~~C~~hg~e~P~~~~~~~~dl~k~v 92 (242) T PRK06722 36 EKFPQIIEKFIQFIGEDSIFVTWGKEDYRFLSHDCTLHGVECPCMEKERRIDLQKFV 92 (242) T ss_pred CCHHHHHHHHHHHHCCCEEEEECCHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHH T ss_conf 978999999999827883897344022779997752428999998421088899998 No 103 >cd06149 ISG20 Interferon (IFN) Stimulated Gene product of 20 kDa (ISG20) is an IFN-induced antiviral exonuclease with a strong preference for single-stranded RNA and minor activity towards single-stranded DNA. It was also independently identified by its response to estrogen and was called HEM45 (human estrogen regulated transcript). ISG20 is a DEDDh-type DnaQ-like 3'-5' exonuclease containing three conserved sequence motifs termed ExoI, ExoII and ExoIII with a specific Hx(4)D conserved pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. ISG20 may be a major effector of innate immunity against pathogens including viruses, bacteria, and parasites. It is located in promyelocytic leukemia (PML) nuclear bodies, sites for oncogenic DNA viral transcription and replication. It may carry out its function by degrading viral RNAs as part of the IFN-regulated antiviral r Probab=96.78 E-value=0.016 Score=34.86 Aligned_cols=60 Identities=27% Similarity=0.241 Sum_probs=37.0 Q ss_pred HHHHHHHCCCCCEEEEEEHHHHHHHHHHHCCCCCCCCCEEEHHH---HHHHHCCC----CCHHHHHHH-HCCCC Q ss_conf 99999864889819986307759999986299878650454588---99985889----997899998-08898 Q gi|254780127|r 55 SDLLQYLRDETVMCVAHNSLFERILFKKTLGIDIPSKRWICTSV---LARTNGLP----SSLKNACLA-LNFSE 120 (675) Q Consensus 55 ~~l~~~L~~~~~~kv~HNa~FD~~vL~~~lGi~i~~~~~~DTmi---~A~~l~~p----~sL~~la~~-lgl~~ 120 (675) .++.+++.+ .++||||+.||+.+|. +.-|...++||-. ..+..+.| .||+.+++. ||.+. T Consensus 69 ~~l~~~l~~--~IlVGH~L~nDL~aL~----l~h~~~~i~DT~~~~~~~~~~~~p~~~~~sLk~L~~~~L~~~I 136 (157) T cd06149 69 KEILKILKG--KVVVGHAIHNDFKALK----YFHPKHMTRDTSTIPLLNRKAGFPENCRVSLKVLAKRLLHRDI 136 (157) T ss_pred HHHHHHHCC--CEEEEECCHHHHHHHH----CCCCCCCEEECCCCEEHHHHCCCCCCCCHHHHHHHHHHHCCCC T ss_conf 999997169--8899826076698860----3389872677711100133248987668759999999848502 No 104 >PRK08928 consensus Probab=96.73 E-value=0.002 Score=40.87 Aligned_cols=67 Identities=4% Similarity=-0.073 Sum_probs=52.4 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEECCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHC Q ss_conf 6403666665666765478899999999987511101031088978899999986730789999999999999633 Q gi|254780127|r 459 LQQFSQNVKSTSTFEDWEQAESHHLWMQDQYPEFAVKDKLIGTACELVKKAWRAKHQGVLQLWKDLTEGFACVVQE 534 (675) Q Consensus 459 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ea~~~~~~~~~~~p~i~~~~~~~~~~~~~~~~~ 534 (675) ++..++..+...||++||+++.|+..+...+++.|++.++.++ |+..|+|.+-...++.++.+..-| T Consensus 714 i~~~eA~~~i~~Yf~~yp~v~~~~~~~~~~a~~~Gyv~Tl~gR---------rr~~p~i~s~n~~~r~~~eR~A~N 780 (861) T PRK08928 714 ITNSEASEYIKKYFAEYPEIQEYMEQTKAYARSNGYVTTFFGR---------KCFIPLINSKKHTLRQFAERAAIN 780 (861) T ss_pred CCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCEECCCCC---------EECCCCCCCCCHHHHHHHHHHHHC T ss_conf 8999999999999996814999999999999977987688886---------032875557787788589999975 No 105 >PRK07300 consensus Probab=96.60 E-value=0.0023 Score=40.46 Aligned_cols=68 Identities=6% Similarity=-0.039 Sum_probs=55.8 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEECCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCC Q ss_conf 64036666656667654788999999999875111010310889788999999867307899999999999996337 Q gi|254780127|r 459 LQQFSQNVKSTSTFEDWEQAESHHLWMQDQYPEFAVKDKLIGTACELVKKAWRAKHQGVLQLWKDLTEGFACVVQEG 535 (675) Q Consensus 459 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ea~~~~~~~~~~~p~i~~~~~~~~~~~~~~~~~~ 535 (675) ++..++..+...||++||+++.|+..+...+++.|++.++.++ |+..|+|.+-...++.++.++.-|- T Consensus 731 i~~~eA~~~I~~Yf~~yp~v~~~~~~~~~~a~~~Gyv~Tl~gR---------rr~ip~i~s~n~~~r~~~eR~AiN~ 798 (880) T PRK07300 731 ITRKQAKSYIDTYFERYPGIKNYMENVVREAKDKGYVETLFKR---------RRELPDINSRNFNVRSFAERTAINS 798 (880) T ss_pred CCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCEECCCCC---------EECCCCCCCCCHHHHHHHHHHHHCC T ss_conf 9999999999999996910999999999999976887777788---------1358986788877887998988160 No 106 >PRK06887 consensus Probab=96.59 E-value=0.0028 Score=39.86 Aligned_cols=66 Identities=9% Similarity=0.034 Sum_probs=44.8 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEECCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHC Q ss_conf 403666665666765478899999999987511101031088978899999986730789999999999999633 Q gi|254780127|r 460 QQFSQNVKSTSTFEDWEQAESHHLWMQDQYPEFAVKDKLIGTACELVKKAWRAKHQGVLQLWKDLTEGFACVVQE 534 (675) Q Consensus 460 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ea~~~~~~~~~~~p~i~~~~~~~~~~~~~~~~~ 534 (675) +..++..+...||+.||+++.|+..+...+++.|++.++.++ |+..|+|.+-....+.++.++.-| T Consensus 807 ~~~eA~~~i~~yf~~~~~v~~~~~~~~~~a~~~Gyv~Tl~GR---------rr~~p~i~s~n~~~r~~~eR~aiN 872 (954) T PRK06887 807 SRADAQKYMDLYFQRYPSVQQFMTDIREKAKAQGYVETLFGR---------RLYLPDINSSNAMRRKGAERVAIN 872 (954) T ss_pred CHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCEECCCCC---------CCCCCCCCCCCHHHHHHHHHHHHC T ss_conf 999999999999886900999999999999977946888887---------306876668998899789999837 No 107 >PRK07898 consensus Probab=96.55 E-value=0.0033 Score=39.38 Aligned_cols=60 Identities=10% Similarity=0.004 Sum_probs=27.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEECCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHH Q ss_conf 666665666765478899999999987511101031088978899999986730789999999999999 Q gi|254780127|r 463 SQNVKSTSTFEDWEQAESHHLWMQDQYPEFAVKDKLIGTACELVKKAWRAKHQGVLQLWKDLTEGFACV 531 (675) Q Consensus 463 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ea~~~~~~~~~~~p~i~~~~~~~~~~~~~~ 531 (675) ++..+...||++||+++.|+..+...+++.|++.++.++ |+..|+|.+-....+.++.++ T Consensus 756 eA~~~i~~yf~~yp~v~~~~~~~~~~a~~~gyv~Tl~gR---------rr~~p~i~s~n~~~r~~~eR~ 815 (902) T PRK07898 756 EAKELMDAYFARFGGVRDYLREVVEQARKDGYTETLFGR---------RRYLPDLTSDNRQRREAAERA 815 (902) T ss_pred HHHHHHHHHHHHCHHHHHHHHHHHHHHHHCCEEECCCCC---------CCCCCCCCCCCHHHHHHHHHH T ss_conf 999999999986906999999999999977906787877---------113887778897789899999 No 108 >PRK07625 consensus Probab=96.54 E-value=0.0036 Score=39.18 Aligned_cols=67 Identities=6% Similarity=-0.091 Sum_probs=47.4 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEECCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCC Q ss_conf 4036666656667654788999999999875111010310889788999999867307899999999999996337 Q gi|254780127|r 460 QQFSQNVKSTSTFEDWEQAESHHLWMQDQYPEFAVKDKLIGTACELVKKAWRAKHQGVLQLWKDLTEGFACVVQEG 535 (675) Q Consensus 460 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ea~~~~~~~~~~~p~i~~~~~~~~~~~~~~~~~~ 535 (675) +..++..+...||++||+++.|+..+...+++.|++.++.++ |+..|+|.+-....+.++.++.-|- T Consensus 774 ~~~eA~~~i~~yf~~y~~v~~~~~~~~~~a~~~gyv~Tl~gR---------rr~~p~i~s~n~~~r~~~eR~AiN~ 840 (922) T PRK07625 774 TRDAAKLYIDRYFARYPGVARYMEETRMRAKEKGYVETVFGR---------RLWLPEINGGNGPRRQAAERAAINA 840 (922) T ss_pred CHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHCCEEECCCCC---------CCCCCCCCCCCHHHHHHHHHHHHCC T ss_conf 999999999999986912999999999999877926677786---------0317655688878887999999650 No 109 >PRK07556 consensus Probab=96.53 E-value=0.0031 Score=39.61 Aligned_cols=28 Identities=21% Similarity=0.191 Sum_probs=15.1 Q ss_pred EEEEEECC-CEEEEEEC-----CCCCHHHHHHHH Q ss_conf 99999838-22899978-----884221799999 Q gi|254780127|r 32 LCAYAWDD-EPVKLWDR-----TEQSAMPSDLLQ 59 (675) Q Consensus 32 ~i~~a~~~-~~~~~~d~-----~~~~~~~~~l~~ 59 (675) .++++||. ++++--.+ ++..+.|++|.. T Consensus 63 ~iavaFD~~~~TFRhe~y~~YKA~R~~~PedL~~ 96 (977) T PRK07556 63 HLAVIFDYSSKTFRNEIYPQYKANRPAPPEDLIP 96 (977) T ss_pred EEEEEEECCCCCCCHHHHHHHCCCCCCCCHHHHH T ss_conf 8999980889960122205652799999889999 No 110 >PRK08786 consensus Probab=96.53 E-value=0.0034 Score=39.34 Aligned_cols=64 Identities=8% Similarity=-0.061 Sum_probs=36.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEECCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHH Q ss_conf 0366666566676547889999999998751110103108897889999998673078999999999999963 Q gi|254780127|r 461 QFSQNVKSTSTFEDWEQAESHHLWMQDQYPEFAVKDKLIGTACELVKKAWRAKHQGVLQLWKDLTEGFACVVQ 533 (675) Q Consensus 461 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ea~~~~~~~~~~~p~i~~~~~~~~~~~~~~~~ 533 (675) ..++..+...||++||+++.|+..+...+++.|++.++.++ |+..|.|.+-...++.++.++.- T Consensus 780 ~~eA~~~i~~yf~~y~~v~~~~~~~~~~a~~~gyv~Tl~gR---------rr~~p~i~s~n~~~r~~aeR~A~ 843 (927) T PRK08786 780 RGEAQDYIALYFSRYPGVRDFMETTRQQARDKGYVETVFGR---------RLYLDFINAGSQGQRAGAERAAI 843 (927) T ss_pred HHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCEEECCCCC---------CCCCCCCCCCCHHHHHHHHHHHH T ss_conf 99999999999996910999999999999877917888886---------03288667888778878999985 No 111 >PRK05755 DNA polymerase I; Provisional Probab=96.53 E-value=0.0027 Score=39.99 Aligned_cols=68 Identities=10% Similarity=-0.076 Sum_probs=54.1 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEECCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCC Q ss_conf 40366666566676547889999999998751110103108897889999998673078999999999999963372 Q gi|254780127|r 460 QQFSQNVKSTSTFEDWEQAESHHLWMQDQYPEFAVKDKLIGTACELVKKAWRAKHQGVLQLWKDLTEGFACVVQEGG 536 (675) Q Consensus 460 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ea~~~~~~~~~~~p~i~~~~~~~~~~~~~~~~~~~ 536 (675) +..++..+...||++||+++.|+..+...+++.|++.++.++ |+..|+|.+-....+.++.++.-|.. T Consensus 742 s~~eA~~~i~~yf~~yp~v~~~~~~~~~~a~~~Gyv~Tl~gR---------rr~~p~i~s~n~~~r~~~eR~AiN~p 809 (889) T PRK05755 742 SRKEAKEYIDRYFERYPGVKEYMEETVAQAKEKGYVETLFGR---------RRYLPDINSSNGARRAFAERAAINAP 809 (889) T ss_pred CHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCEECCCCC---------EEECCCCCCCCHHHHHHHHHHHHCCC T ss_conf 999999999999996810999999999999977977787787---------05278677899889989999996756 No 112 >PRK08835 consensus Probab=96.47 E-value=0.0038 Score=39.03 Aligned_cols=62 Identities=6% Similarity=-0.056 Sum_probs=32.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEECCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHH Q ss_conf 36666656667654788999999999875111010310889788999999867307899999999999996 Q gi|254780127|r 462 FSQNVKSTSTFEDWEQAESHHLWMQDQYPEFAVKDKLIGTACELVKKAWRAKHQGVLQLWKDLTEGFACVV 532 (675) Q Consensus 462 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ea~~~~~~~~~~~p~i~~~~~~~~~~~~~~~ 532 (675) .++..+...||+.||+++.|+..+...+++.|++.++.++ |+..|+|.+-....+.++.++. T Consensus 784 ~eA~~~i~~yf~~yp~v~~~~~~~~~~a~~~g~v~Tl~gR---------rr~~p~i~s~n~~~r~~~eR~a 845 (931) T PRK08835 784 GEAQAYMDKYFERYPGVMQYMEDTRSRAAQQGYVETIFGR---------RLHLPEITSRNGMRRKAAERAA 845 (931) T ss_pred HHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCEECCCCC---------CCCCCCCCCCCHHHHHHHHHHH T ss_conf 9999999999886822999999999999877917887887---------3148865688877886899999 No 113 >PRK08076 consensus Probab=96.41 E-value=0.0045 Score=38.54 Aligned_cols=67 Identities=7% Similarity=-0.071 Sum_probs=52.3 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEECCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHC Q ss_conf 6403666665666765478899999999987511101031088978899999986730789999999999999633 Q gi|254780127|r 459 LQQFSQNVKSTSTFEDWEQAESHHLWMQDQYPEFAVKDKLIGTACELVKKAWRAKHQGVLQLWKDLTEGFACVVQE 534 (675) Q Consensus 459 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ea~~~~~~~~~~~p~i~~~~~~~~~~~~~~~~~ 534 (675) ++..++..+...||++||+++.|+..+...+++.|++.++.++ |+..|+|.+-...++.++.+..-| T Consensus 728 i~~~eA~~~i~~Yf~~yp~v~~~~~~~~~~a~~~Gyv~Tl~gR---------rr~~p~i~s~n~~~r~~~eR~A~N 794 (877) T PRK08076 728 ITRKEAAEFIERYFESFPGVKEYMEDIVQEAKQKGYVTTLLHR---------RRYIPEITSRNFNLRSFAERTAMN 794 (877) T ss_pred CCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHCCCEECCCCC---------EEECCCCCCCCHHHHHHHHHHHHC T ss_conf 8999999999999997824999999999999976888646777---------030876557787788889999936 No 114 >PRK05929 consensus Probab=96.36 E-value=0.0064 Score=37.51 Aligned_cols=64 Identities=6% Similarity=-0.049 Sum_probs=45.8 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEECCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHH Q ss_conf 64036666656667654788999999999875111010310889788999999867307899999999999996 Q gi|254780127|r 459 LQQFSQNVKSTSTFEDWEQAESHHLWMQDQYPEFAVKDKLIGTACELVKKAWRAKHQGVLQLWKDLTEGFACVV 532 (675) Q Consensus 459 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ea~~~~~~~~~~~p~i~~~~~~~~~~~~~~~ 532 (675) ++..++..+...||++||+++.|+..+...+++.|++.++.++ |+..|++.+-. ..+.++.++. T Consensus 723 is~~eA~~~i~~Yf~~yp~v~~y~~~~~~~a~~~Gyv~Tl~gR---------rr~ip~i~~~~-~~r~~~eR~A 786 (870) T PRK05929 723 ISVSEAQGLIDAYFARYPEAAEFITETIEQASKNLRVTTMLGR---------ERIIDDWEEFP-GSRAASGRLA 786 (870) T ss_pred CCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHCCCEECCCCC---------CCCCCCCCCCH-HHHHHHHHHH T ss_conf 8999999999999986912999999999999976967676886---------00288656774-5677888988 No 115 >PRK07997 consensus Probab=96.32 E-value=0.0045 Score=38.50 Aligned_cols=64 Identities=11% Similarity=-0.057 Sum_probs=34.6 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEECCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHH Q ss_conf 4036666656667654788999999999875111010310889788999999867307899999999999996 Q gi|254780127|r 460 QQFSQNVKSTSTFEDWEQAESHHLWMQDQYPEFAVKDKLIGTACELVKKAWRAKHQGVLQLWKDLTEGFACVV 532 (675) Q Consensus 460 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ea~~~~~~~~~~~p~i~~~~~~~~~~~~~~~ 532 (675) +..++..+...||++||+++.|+..+...+++.|++.++.++ |+..|+|.+-....+.++.++. T Consensus 780 ~~~eA~~~i~~yf~~~~~v~~~~~~~~~~a~~~gyv~Tl~gR---------rr~~~~i~s~n~~~r~~~eR~a 843 (928) T PRK07997 780 PRKEAQKYMDLYFERYPGVLEYMERTRAQAKEQGYVETLDGR---------RLYLPDIKSSNAARRAAAERAA 843 (928) T ss_pred CHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCEECCCCC---------EECCCCCCCCCHHHHHHHHHHH T ss_conf 999999999999886833999999999999977947888896---------1458876789988996899999 No 116 >PRK05797 consensus Probab=96.15 E-value=0.0063 Score=37.53 Aligned_cols=62 Identities=8% Similarity=-0.062 Sum_probs=32.7 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEECCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHH Q ss_conf 40366666566676547889999999998751110103108897889999998673078999999999999 Q gi|254780127|r 460 QQFSQNVKSTSTFEDWEQAESHHLWMQDQYPEFAVKDKLIGTACELVKKAWRAKHQGVLQLWKDLTEGFAC 530 (675) Q Consensus 460 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ea~~~~~~~~~~~p~i~~~~~~~~~~~~~ 530 (675) +..++..+...||++||+++.|+..+...+++.|++.++.++ |+..|.|.+-....+.++.+ T Consensus 721 ~~~eA~~~i~~yf~~yp~v~~~~~~~~~~a~~~g~v~Tl~gR---------rr~~p~i~s~n~~~r~~~eR 782 (869) T PRK05797 721 SRKEAKEYIDNYFERYPNVKEYMEKIVEEAKEKGYVTTILNR---------RRYIPEIKSSNKIVRAFGER 782 (869) T ss_pred CHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHCCEEECCCCC---------EECCCCCCCCCHHHHHHHHH T ss_conf 999999999999996804999999999999978907888887---------33588777889889977888 No 117 >cd06137 DEDDh_RNase This group is composed of eukaryotic exoribonucleases that include PAN2, RNA exonuclease 1 (REX1 or Rex1p), REX3 (Rex3p), REX4 (or Rex4p), ISG20, and similar proteins. They are DEDDh-type DnaQ-like 3'-5' exonucleases containing three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific Hx(4)D conserved pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. PAN2 is the catalytic subunit of poly(A) nuclease (PAN), a Pab1p-dependent 3'-5' exoribonuclease which plays an important role in the posttranscriptional maturation of pre-mRNAs. REX proteins are required for the processing and maturation of many RNA species, and ISG20 is an interferon-induced antiviral exonuclease with a strong preference for single-stranded RNA. Probab=96.00 E-value=0.023 Score=33.78 Aligned_cols=56 Identities=20% Similarity=0.237 Sum_probs=35.7 Q ss_pred HHHHCCCCCEEEEEEHHHHHHHHHHHCCCCCCCCCEEEHHHHHHHH-CC-----CCCHHHHHH-HHCCCC Q ss_conf 9986488981998630775999998629987865045458899985-88-----999789999-808898 Q gi|254780127|r 58 LQYLRDETVMCVAHNSLFERILFKKTLGIDIPSKRWICTSVLARTN-GL-----PSSLKNACL-ALNFSE 120 (675) Q Consensus 58 ~~~L~~~~~~kv~HNa~FD~~vL~~~lGi~i~~~~~~DTmi~A~~l-~~-----p~sL~~la~-~lgl~~ 120 (675) .++| +++.++|||++.+|+.+|.. . . .+++||.++.... .. +.||+.+++ .||... T Consensus 78 ~~~i-~~~tIlVGH~L~nDL~aL~~----~-H-~~viDTa~l~~~~~~~~~~~~~~sLk~La~~~L~~~I 140 (161) T cd06137 78 WKFI-DPDTILVGHSLQNDLDALRM----I-H-TRVVDTAILTREAVKGPLAKRQWSLRTLCRDFLGLKI 140 (161) T ss_pred HHHC-CCCCEEEECCHHHHHHHHCC----C-C-CCEEECEEECCHHCCCCCCCCCHHHHHHHHHHHCHHH T ss_conf 9745-89967985101310878627----7-9-8354320332001157888888669999999828775 No 118 >PRK05761 DNA polymerase I; Reviewed Probab=95.87 E-value=0.044 Score=31.97 Aligned_cols=122 Identities=20% Similarity=0.135 Sum_probs=59.6 Q ss_pred HHHHHHHHHHCCCCCEEEEEEH-HHHHHHHHH---HCCCCCCC------------CCEEEH------H-HHHHH----HC Q ss_conf 2179999986488981998630-775999998---62998786------------504545------8-89998----58 Q gi|254780127|r 52 AMPSDLLQYLRDETVMCVAHNS-LFERILFKK---TLGIDIPS------------KRWICT------S-VLART----NG 104 (675) Q Consensus 52 ~~~~~l~~~L~~~~~~kv~HNa-~FD~~vL~~---~lGi~i~~------------~~~~DT------m-i~A~~----l~ 104 (675) .+...+..++.+-+.-.+ +|. .||++.|.. .+|++... ....|. . +-.+. +. T Consensus 200 elL~~f~~~i~~~DPDIi-yN~d~FD~PYL~~Ra~~lgi~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~y~~~~~~~ 278 (777) T PRK05761 200 ELLAELFKIIEDYPPVIT-FNGDNFDLPYLYNRALKLGIPKEELPFNVKYASLKTGIHIDLYKFFQNRAIRSYAFYGKYR 278 (777) T ss_pred HHHHHHHHHHHHCCCCEE-ECCCCCCHHHHHHHHHHHCCCCHHCCCCCCCCCCCCCEEEECCCCCCHHHHHHEEECCEEC T ss_conf 999999999974698299-8078764899999999838990006651566540244465044531211355301054742 Q ss_pred -CCCCHHHHHHHH-CCCCCCHHHHHCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCHH Q ss_conf -899978999980-889865276637657664100233201345311258768999898877539999999987200214 Q gi|254780127|r 105 -LPSSLKNACLAL-NFSEHLTKMEEGKALIARFCKGSIDSPPYDCTRANHVQAWQLFGEYCKRDVEATRELFKRLIPLSD 182 (675) Q Consensus 105 -~p~sL~~la~~l-gl~~~~~kd~~gk~li~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~~~~L~~~L~~~~~ 182 (675) .+++|+.+++.+ |... ... ........+..+..||.+|+.+++.|...... T Consensus 279 ~~~y~Ld~Va~~lLg~~K--~~~----------------------~~~~~~~~~~~l~~Y~l~Da~l~~~L~~~~~~--- 331 (777) T PRK05761 279 HREAALDAVARALLGESK--VEL----------------------EKNISELNYWELAEYNYRDAEITLKLTFFNNE--- 331 (777) T ss_pred CCCEEHHHHHHHHCCCCC--CCC----------------------CCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHH--- T ss_conf 474554788999759974--103----------------------12466334788999989999999998753045--- Q ss_pred HHHHHHHHHHHHHHCCEECCH Q ss_conf 310178999987516843088 Q gi|254780127|r 183 GERDLWLLDQTINDRGYRIDL 203 (675) Q Consensus 183 ~E~~l~~~~~~me~~Gi~vD~ 203 (675) .-++++..+.++ .|+++|. T Consensus 332 ~~l~~~~elsri--~glpl~~ 350 (777) T PRK05761 332 LVLKLILLLSRI--SKLPIED 350 (777) T ss_pred HHHHHHHHHHHH--HCCCHHH T ss_conf 448889999999--7989999 No 119 >cd05776 DNA_polB_alpha_exo The 3'-5' exonuclease domain of eukaryotic DNA polymerase alpha. DNA polymerase alpha is a family-B DNA polymerase with a catalytic subunit that contains a DnaQ-like 3'-5' exonuclease domain. It is one of the three DNA-dependent type B DNA polymerases (delta and epsilon are the other two) that have been identified as essential for nuclear DNA replication in eukaryotes. DNA polymerase alpha is almost exclusively required for the initiation of DNA replication and the priming of Okazaki fragments during elongation. It associates with DNA primase and is the only enzyme able to start DNA synthesis de novo. The catalytic subunit contains both polymerase and 3'-5' exonuclease domains, but only exhibits polymerase activity. The 3'-5' exonuclease domain contains three sequence motifs termed ExoI, ExoII and ExoIII, without the four conserved acidic residues that are crucial for metal binding and catalysis. This explains why in most organisms, that no specific repair r Probab=95.60 E-value=0.16 Score=28.19 Aligned_cols=104 Identities=18% Similarity=0.169 Sum_probs=59.3 Q ss_pred HHHHHHC--CCCCEEEEEEH-HHHHHHHHH---HCCCCC------------C--------------CCC-EEEHHHHHHH Q ss_conf 9999864--88981998630-775999998---629987------------8--------------650-4545889998 Q gi|254780127|r 56 DLLQYLR--DETVMCVAHNS-LFERILFKK---TLGIDI------------P--------------SKR-WICTSVLART 102 (675) Q Consensus 56 ~l~~~L~--~~~~~kv~HNa-~FD~~vL~~---~lGi~i------------~--------------~~~-~~DTmi~A~~ 102 (675) .+...+. ||+++ +|||+ .||+.+|.. .++++. + .++ ..|+...++- T Consensus 88 ~f~~~i~~~DPDii-~Ghni~~fdl~~L~~R~~~lki~~wsriGRlkr~~~p~~~~~~~~~~~~~~~GRl~~D~~~~~ke 166 (234) T cd05776 88 FFLAKLQKIDPDVL-VGHDLEGFDLDVLLSRIQELKVPHWSRIGRLKRSVWPKKKGGGKFGERELTAGRLLCDTYLSAKE 166 (234) T ss_pred HHHHHHHHCCCCEE-EEECCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCEEEEECHHHHHH T ss_conf 99999985099899-85063777599999999980998533135500143764456565344203523588760999998 Q ss_pred H-C-CCCCHHHHHHH-HCCCCCCHHHHHCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 5-8-89997899998-088986527663765766410023320134531125876899989887753999999998720 Q gi|254780127|r 103 N-G-LPSSLKNACLA-LNFSEHLTKMEEGKALIARFCKGSIDSPPYDCTRANHVQAWQLFGEYCKRDVEATRELFKRLI 178 (675) Q Consensus 103 l-~-~p~sL~~la~~-lgl~~~~~kd~~gk~li~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~~~~L~~~L~ 178 (675) + . ..++|..+++. ||-... |...+...++ ...++....+..||.+|+.++++|..+|. T Consensus 167 ~~k~~sy~L~~v~~~~L~~~k~---dv~~~~i~~~---------------~~~~~~l~~l~~~~~~Da~l~~~L~~~L~ 227 (234) T cd05776 167 LIRCKSYDLTELSQQVLGIERQ---DIDPEEILNM---------------YNDSESLLKLLEHTEKDAYLILQLMFKLN 227 (234) T ss_pred HHCCCCCCHHHHHHHHHCCCCC---CCCHHHHHHH---------------HHCHHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 7466778889999999686766---7999999999---------------86939999999999999999999999958 No 120 >KOG0969 consensus Probab=95.54 E-value=0.17 Score=28.05 Aligned_cols=113 Identities=18% Similarity=0.303 Sum_probs=60.2 Q ss_pred CCHHHHHHHHHHC--CCCCEEEEEEH-HHHHHHHH-H--HCCCCC-C--------CCCEEEHHHHHHHHCC--------- Q ss_conf 4221799999864--88981998630-77599999-8--629987-8--------6504545889998588--------- Q gi|254780127|r 50 QSAMPSDLLQYLR--DETVMCVAHNS-LFERILFK-K--TLGIDI-P--------SKRWICTSVLARTNGL--------- 105 (675) Q Consensus 50 ~~~~~~~l~~~L~--~~~~~kv~HNa-~FD~~vL~-~--~lGi~i-~--------~~~~~DTmi~A~~l~~--------- 105 (675) -..++.....++. ||+++ +|-|+ .||++-|. | .||++- | ...+.||+.-|...+- T Consensus 338 E~elL~~W~~firevDPDvI-~GYNi~nFDiPYll~RA~tL~Ie~Fp~LGRikn~~s~irDttfSSkq~GtRetK~v~I~ 416 (1066) T KOG0969 338 EKELLESWRKFIREVDPDVI-IGYNICNFDIPYLLNRAKTLGIENFPYLGRIKNSRSVIRDSTFSSKQYGTRETKEVNID 416 (1066) T ss_pred HHHHHHHHHHHHHHCCCCEE-ECCCCCCCCCCEECCHHHHCCCCCCCCCCEECCCCEEEECCCCCHHHCCCCCCEEEEEC T ss_conf 88999999999986298867-13354455632130767646806665010003541244103212321375101388665 Q ss_pred ------------------CCCHHHHHHHHCCCCCCHHHHHCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHH Q ss_conf ------------------99978999980889865276637657664100233201345311258768999898877539 Q gi|254780127|r 106 ------------------PSSLKNACLALNFSEHLTKMEEGKALIARFCKGSIDSPPYDCTRANHVQAWQLFGEYCKRDV 167 (675) Q Consensus 106 ------------------p~sL~~la~~lgl~~~~~kd~~gk~li~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv 167 (675) +++|.+++..+ |.+++ +|. -...| . ....-+.+....+.-||.+|+ T Consensus 417 GRlqfDllqvi~RdyKLrSytLNaVs~hF-L~EQK-EDV-~~siI---------t----dLQng~~~TRRRlA~YCLKDA 480 (1066) T KOG0969 417 GRLQFDLLQVILRDYKLRSYTLNAVSAHF-LGEQK-EDV-HHSII---------T----DLQNGNEQTRRRLAVYCLKDA 480 (1066) T ss_pred CEEEEHHHHHHHHHHHHHHCCHHHHHHHH-HHHHC-CCC-CCCCH---------H----HHHCCCHHHHHHHHHHHHHHH T ss_conf 65652279998776433111011457876-55320-156-52100---------4----331476788889998876441 Q ss_pred HHHHHHHHHHCC Q ss_conf 999999987200 Q gi|254780127|r 168 EATRELFKRLIP 179 (675) Q Consensus 168 ~~~~~L~~~L~~ 179 (675) .+.++|.++|.- T Consensus 481 YLPlRLlekLM~ 492 (1066) T KOG0969 481 YLPLRLLEKLMV 492 (1066) T ss_pred CCHHHHHHHHHH T ss_conf 022999999999 No 121 >pfam00940 RNA_pol DNA-dependent RNA polymerase. This is a family of single chain RNA polymerases. Probab=95.29 E-value=0.17 Score=28.04 Aligned_cols=175 Identities=13% Similarity=0.074 Sum_probs=88.1 Q ss_pred HHHHHCCCCCHHHHHHCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEECCHHHHHHH Q ss_conf 02210064210013307887789887887088864036666656667654788999999999875111010310889788 Q gi|254780127|r 426 KEQRAIGKVMELALGYQGGAKVFKTMASHCGLDLQQFSQNVKSTSTFEDWEQAESHHLWMQDQYPEFAVKDKLIGTACEL 505 (675) Q Consensus 426 ~~~R~~aK~~~~gl~YG~G~~~l~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ea~~ 505 (675) +..|+..|..+--+.||....++...-.. ...... ..+. .......+.--++. T Consensus 153 ~i~Rk~vKr~vMT~~Yg~t~~g~~~qi~~---------~l~~~~----~~~~--------------~~~~~~~~~yla~~ 205 (391) T pfam00940 153 KITRKVVKQTVMTLVYGVTFYGAREQIKD---------QLKEKG----TDES--------------KVLSYKAANYLAKI 205 (391) T ss_pred CCCHHHHHCCEEEEECCCCCCHHHHHHHH---------HHHHCC----CCCC--------------HHHHHHHHHHHHHH T ss_conf 66686640321664226662009999999---------998436----8731--------------45678999999999 Q ss_pred HHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCCCCCCCEEECC-CCHHHCCCCCCHHHHHHCCCH Q ss_conf 9999998673078999999999999963372142013222344321244564112246-311310245201233203100 Q gi|254780127|r 506 VKKAWRAKHQGVLQLWKDLTEGFACVVQEGGSISARRVANVPRLVMRRHKRDVHIVLP-SSRRLVYSDVKGDCSYLNTAT 584 (675) Q Consensus 506 ~~~~~~~~~p~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~i~~p-sGR~l~~~~~~~~~~~~~~~~ 584 (675) +.+.....+|+....-..+++-++...+.+.++.+.-+....-.-.-+......|.+. .|+.+.... . ...... T Consensus 206 i~~si~~v~~~a~~~m~wL~~~a~~~~~~~~pv~W~TP~Gf~V~Q~Y~k~~~~~V~t~l~~~~~~~~~--~---~~~~~d 280 (391) T pfam00940 206 IFEAIKEVFPGAREIMDWLNECAKLIAKLNIPVVWTTPLGLPVVQPYRKSKKKRVKTSLQGVSRIKVI--T---LTPVSD 280 (391) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEECCCCCEEEEEECCCCCEEEEEEECCEEEECCC--C---CCCCCC T ss_conf 99999998574999999999999999828996599999999898641688757999861566886135--6---666567 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEECCEEEEECCC Q ss_conf 123577653355777674798999999999999967981899953589996784 Q gi|254780127|r 585 SQLMRERTYGGKLTENIVQAISRDILCEGMKNATKNGYDIVLTVHDEIVSETPD 638 (675) Q Consensus 585 ~~~~r~~~~g~k~~n~~iQgtAaDi~k~am~~l~~~g~~ivl~VHDElv~evp~ 638 (675) +.-++. .+.=|.|-+.-|..|......+.+.|+. ...|||......-+ T Consensus 281 ~~Kq~~-----a~~PNfIHSlDAshl~~t~~~~~~~gi~-f~~IHDsf~tha~d 328 (391) T pfam00940 281 KRKQKS-----AFPPNFIHSLDASHLMLTALACIKAGIP-FASVHDSFWTHAND 328 (391) T ss_pred HHHHHC-----CCCCCHHHHHHHHHHHHHHHHHHHCCCC-EEEEEECCCCCCCC T ss_conf 565423-----6797413334489999999999856998-78871078877214 No 122 >PRK08434 consensus Probab=95.23 E-value=0.033 Score=32.77 Aligned_cols=43 Identities=12% Similarity=0.082 Sum_probs=36.6 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEECCHHHH Q ss_conf 4036666656667654788999999999875111010310889 Q gi|254780127|r 460 QQFSQNVKSTSTFEDWEQAESHHLWMQDQYPEFAVKDKLIGTA 502 (675) Q Consensus 460 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 502 (675) +..++..+...||++||+++.|+..+...+++.|++.++.++. T Consensus 742 ~~~eA~~~i~~yf~~yp~v~~~~~~~~~~a~~~gyv~Tl~gRr 784 (887) T PRK08434 742 SAKEAKEYIESYFASFPTIKNFLESIKNFAKENGYVETLLGRK 784 (887) T ss_pred CHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHCCCEECCCCCC T ss_conf 9999999999999968019999999999999768877888872 No 123 >KOG2249 consensus Probab=94.94 E-value=0.26 Score=26.86 Aligned_cols=83 Identities=24% Similarity=0.241 Sum_probs=49.8 Q ss_pred HHHHHHHHCCCCCEEEEEEHHHHHHHHHHHCCCCCCCCCEEEHH----HHHH-HHCCCCCHHHHHHH-HCCCCCCHHHHH Q ss_conf 79999986488981998630775999998629987865045458----8999-85889997899998-088986527663 Q gi|254780127|r 54 PSDLLQYLRDETVMCVAHNSLFERILFKKTLGIDIPSKRWICTS----VLAR-TNGLPSSLKNACLA-LNFSEHLTKMEE 127 (675) Q Consensus 54 ~~~l~~~L~~~~~~kv~HNa~FD~~vL~~~lGi~i~~~~~~DTm----i~A~-~l~~p~sL~~la~~-lgl~~~~~kd~~ 127 (675) ..++.++|. +.++|||-+.-|+.+|.. .-|-...+||- ++.. .-...-||+.|++. ||.+.+..+ T Consensus 175 Q~ev~klL~--gRIlVGHaLhnDl~~L~l----~hp~s~iRDTs~~~pl~k~~~~~~tpSLK~Lt~~~Lg~~IQ~Ge--- 245 (280) T KOG2249 175 QKEVLKLLK--GRILVGHALHNDLQALKL----EHPRSMIRDTSKYPPLMKLLSKKATPSLKKLTEALLGKDIQVGE--- 245 (280) T ss_pred HHHHHHHHH--CCEEECCCCCCHHHHHHH----HCCHHHHCCCCCCCHHHHHHHCCCCCCHHHHHHHHHCHHHHCCC--- T ss_conf 999999970--877751220007999855----37435531431275487775215983199999998452543156--- Q ss_pred CHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 765766410023320134531125876899989887753999999998720 Q gi|254780127|r 128 GKALIARFCKGSIDSPPYDCTRANHVQAWQLFGEYCKRDVEATRELFKRLI 178 (675) Q Consensus 128 gk~li~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~~~~L~~~L~ 178 (675) .-..+|+.++++||.+.. T Consensus 246 ---------------------------------HsSvEDA~AtM~LY~~vk 263 (280) T KOG2249 246 ---------------------------------HSSVEDARATMELYKRVK 263 (280) T ss_pred ---------------------------------CCCHHHHHHHHHHHHHHH T ss_conf ---------------------------------685889999999999999 No 124 >TIGR03491 RecB family nuclease, putative, TM0106 family. Members of this uncharacterized protein family are found broadly but sporadically among bacteria. The N-terminal region is homologous to the Cas4 protein of CRISPR systems, although this protein family shows no signs of association with CRISPR repeats. Probab=94.79 E-value=0.28 Score=26.61 Aligned_cols=16 Identities=13% Similarity=0.266 Sum_probs=8.8 Q ss_pred HHHHHHHHHCCCCCCH Q ss_conf 7898878870888640 Q gi|254780127|r 446 KVFKTMASHCGLDLQQ 461 (675) Q Consensus 446 ~~l~~~~~~~G~~~~~ 461 (675) .+|+.++..||....+ T Consensus 356 ~al~rLa~~~~~~e~e 371 (457) T TIGR03491 356 DSLRRLAKRYGTPEAE 371 (457) T ss_pred HHHHHHHHHCCCCHHH T ss_conf 9999999871998699 No 125 >cd06145 REX1_like This subfamily is composed of RNA exonuclease 1 (REX1 or Rex1p), REX3 (or Rex3p), and similar eukaryotic proteins. In yeast, REX1 and REX3 are required for 5S rRNA and MRP (mitochondrial RNA processing) RNA maturation, respectively. They are DEDDh-type DnaQ-like 3'-5' exonucleases containing three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific Hx(4)D conserved pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. REX1 is the major exonuclease responsible for pre-tRNA trail trimming and may also be involved in nuclear CCA turnover. REX proteins function in the processing and maturation of many RNA species, similar to the function of E.coli RNase T. Probab=94.75 E-value=0.17 Score=28.08 Aligned_cols=58 Identities=29% Similarity=0.304 Sum_probs=36.2 Q ss_pred HHHHHHCCCCCEEEEEEHHHHHHHHHHHCCCCCCCCCEEEHHHHHHHH-CCC--CCHHHHHHH-HCCCC Q ss_conf 999986488981998630775999998629987865045458899985-889--997899998-08898 Q gi|254780127|r 56 DLLQYLRDETVMCVAHNSLFERILFKKTLGIDIPSKRWICTSVLARTN-GLP--SSLKNACLA-LNFSE 120 (675) Q Consensus 56 ~l~~~L~~~~~~kv~HNa~FD~~vL~~~lGi~i~~~~~~DTmi~A~~l-~~p--~sL~~la~~-lgl~~ 120 (675) ++..+| +++.++|||.+.+|+.+|. +.-+ +++||.++-..- +.+ .||+.+++. ||.+. T Consensus 68 ~l~~li-~~~~IlVGH~L~nDL~aL~----i~H~--~v~DTa~l~~~~~~~~~~~sLk~L~~~~L~~~I 129 (150) T cd06145 68 KLLSLI-SPDTILVGHSLENDLKALK----LIHP--RVIDTAILFPHPRGPPYKPSLKNLAKKYLGRDI 129 (150) T ss_pred HHHHHC-CCCCEEEECCCCCCHHHHH----CCCC--CEEEHHHHHHHCCCCCCCHHHHHHHHHHHCHHH T ss_conf 999875-9998899626434777761----5587--615716621103799989789999999828774 No 126 >pfam03104 DNA_pol_B_exo DNA polymerase family B, exonuclease domain. This domain has 3' to 5' exonuclease activity and adopts a ribonuclease H type fold. Probab=94.75 E-value=0.17 Score=28.01 Aligned_cols=76 Identities=16% Similarity=0.245 Sum_probs=38.3 Q ss_pred CCEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECC----CE-----------------EEEEECCCCCHHHHHHHH Q ss_conf 978999804067887000460000268829999999838----22-----------------899978884221799999 Q gi|254780127|r 1 MSKLFIDIETRSPQPLPKVGVWAYAEQAVITLCAYAWDD----EP-----------------VKLWDRTEQSAMPSDLLQ 59 (675) Q Consensus 1 mk~l~lD~ET~s~~dl~~~Gl~~y~~~~~I~~i~~a~~~----~~-----------------~~~~d~~~~~~~~~~l~~ 59 (675) ++.+++||||.|... --+-++.-.|..|+..+.. ++ +.+..+..-..++..+.. T Consensus 86 lrilsfDIEc~~~~g-----~FP~~~~d~IiqIs~~~~~~g~~~~~~~~~~tl~~~~~i~~~~~V~~f~~E~~ll~~f~~ 160 (254) T pfam03104 86 LRVLSFDIECTSLKG-----KFPENEKDPIIQISCMLYREGEPDPIEDKLFTLKSCALIEIGGEVYIFPSEKELLRRFFE 160 (254) T ss_pred CEEEEEEEEECCCCC-----CCCCCCCCEEEEEEEEEEECCCCCCCCEEEEEECCCCCCCCCCEEEEECCHHHHHHHHHH T ss_conf 539999889588899-----999878983899998988567898632069980787767789779995999999999999 Q ss_pred HHC--CCCCEEEEEEH-HHHHHHHHH Q ss_conf 864--88981998630-775999998 Q gi|254780127|r 60 YLR--DETVMCVAHNS-LFERILFKK 82 (675) Q Consensus 60 ~L~--~~~~~kv~HNa-~FD~~vL~~ 82 (675) ++. +++++ +|+|+ .||++.|.. T Consensus 161 ~i~~~dpdii-~gyNi~~FD~~yl~~ 185 (254) T pfam03104 161 FIRQYDPDII-TGYNGDNFDWPYILN 185 (254) T ss_pred HHHHCCCCEE-EECCCCCCCHHHHHH T ss_conf 9875199899-966787788899999 No 127 >pfam11074 DUF2779 Domain of unknown function(DUF2779). This domain is conserved in bacteria. The function is not known. Probab=94.29 E-value=0.27 Score=26.76 Aligned_cols=74 Identities=22% Similarity=0.296 Sum_probs=41.9 Q ss_pred EEEECCCCCC-CCCCCCCHHHCCCCEEEEEEEEE----CCC-EEE---EEECC---CCCHHHHHHHHHHCCCCCEEEEEE Q ss_conf 9980406788-70004600002688299999998----382-289---99788---842217999998648898199863 Q gi|254780127|r 5 FIDIETRSPQ-PLPKVGVWAYAEQAVITLCAYAW----DDE-PVK---LWDRT---EQSAMPSDLLQYLRDETVMCVAHN 72 (675) Q Consensus 5 ~lD~ET~s~~-dl~~~Gl~~y~~~~~I~~i~~a~----~~~-~~~---~~d~~---~~~~~~~~l~~~L~~~~~~kv~HN 72 (675) +||+||.+.. ++- .|+.+| +-+.++||. ++| ... ++... +...+.+.|++.|.......+.+| T Consensus 2 flDFEt~~~aiP~~-~g~~Py----qqipFQ~SlHi~~~~g~~~~h~efL~~~~~DPr~~~~~~L~~~i~~~~Gsi~~yn 76 (127) T pfam11074 2 FLDFETISSAIPVF-DGTRPY----QQIPFQFSLHIQKEDGKKLEHVEFLAEDGIDPRRAFIEALKKAIGKDYGSIVVYN 76 (127) T ss_pred CEEECCCCCCCCCC-CCCCCC----EECCEEEEEEEECCCCCCEEEEEEECCCCCCHHHHHHHHHHHHHCCCCCEEEEEH T ss_conf 17312466566687-899766----1033788898886899720230120367768479999999998476787099962 Q ss_pred HHHHHHHHHHH Q ss_conf 07759999986 Q gi|254780127|r 73 SLFERILFKKT 83 (675) Q Consensus 73 a~FD~~vL~~~ 83 (675) ..|+..+|... T Consensus 77 ~sfEk~~l~el 87 (127) T pfam11074 77 KSFEKTRLKEL 87 (127) T ss_pred HHHHHHHHHHH T ss_conf 48899999999 No 128 >pfam03175 DNA_pol_B_2 DNA polymerase type B, organellar and viral. Like pfam00136, members of this family are also DNA polymerase type B proteins. Those included here are found in plant and fungal mitochondria, and in viruses. Probab=93.71 E-value=0.17 Score=28.01 Aligned_cols=113 Identities=20% Similarity=0.106 Sum_probs=52.9 Q ss_pred EEEEEEH-HHHHHHHHHHCCC----C---CC--------------CCCEEEHHHHHHHHCCCCCHHHHHHHHCCCCCCHH Q ss_conf 1998630-7759999986299----8---78--------------65045458899985889997899998088986527 Q gi|254780127|r 67 MCVAHNS-LFERILFKKTLGI----D---IP--------------SKRWICTSVLARTNGLPSSLKNACLALNFSEHLTK 124 (675) Q Consensus 67 ~kv~HNa-~FD~~vL~~~lGi----~---i~--------------~~~~~DTmi~A~~l~~p~sL~~la~~lgl~~~~~k 124 (675) ...+||+ .||..++..++-. + +. .-.+.||.. -.|.||+.+++.+++...+.. T Consensus 2 ~~y~HNl~~fD~~fil~~l~~~~~~~~~i~~~~~~~~~~~~~~~~~i~~~DS~~-----l~~~sL~~~~~~f~l~~~Kg~ 76 (452) T pfam03175 2 IPYAHNLSKFDGTFILPNLIRKKIIDKKIWKNGIIIKIDFKYKYIKIMVRDSYL-----LLPTSLRKLAKAFNLEVQKGC 76 (452) T ss_pred CEECCCCCCCCHHHHHHHHHHCCCCCCEECCCCCEEEEEEECCCEEEEEECCHH-----HHHHHHHHHHHHHCCCCCCCC T ss_conf 366357642477555544664288763471489479888752627999981455-----433649999998398665786 Q ss_pred ----HHH-------C-HHHHHHHHHHHH----HHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHH Q ss_conf ----663-------7-657664100233----20134531125876899989887753999999998720021431 Q gi|254780127|r 125 ----MEE-------G-KALIARFCKGSI----DSPPYDCTRANHVQAWQLFGEYCKRDVEATRELFKRLIPLSDGE 184 (675) Q Consensus 125 ----d~~-------g-k~li~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~~~~L~~~L~~~~~~E 184 (675) ... + -.-..++..... ...+...........-+....||.+||.+...++.++......+ T Consensus 77 fPy~~~~~~~~~~~~~~p~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Yc~~DV~iL~~v~~~f~~~~~~~ 152 (452) T pfam03175 77 FPYDFVNKNYMWDKGGFPIEKYWKDFKEYILNKEIWNLKNRPNGYDIRKETIEYCKNDVLVLAKLLISFKNFIYEE 152 (452) T ss_pred CCHHHHCHHHHCCCCCCCCHHHCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 6817746566526778884554256134566777876640466676299999999999999999999999999998 No 129 >pfam04857 CAF1 CAF1 family ribonuclease. The major pathways of mRNA turnover in eukaryotes initiate with shortening of the polyA tail. CAF1 encodes a critical component of the major cytoplasmic deadenylase in yeast. Both Caf1p is required for normal mRNA deadenylation in vivo and localizes to the cytoplasm. Caf1p copurifies with a Ccr4p-dependent polyA-specific exonuclease activity. Some members of this family include and inserted RNA binding domain pfam01424. This family of proteins is related to other exonucleases pfam00929 (Bateman A pers. obs.). The crystal structure of Saccharomyces cerevisiae Pop2 has been resolved at 2.3 Angstrom#resolution. Probab=93.65 E-value=0.22 Score=27.29 Aligned_cols=59 Identities=22% Similarity=0.174 Sum_probs=41.8 Q ss_pred CCCCCEEEEEEHHHHHHHHHHHC-CCCCCC-------------CCEEEHHHHHHHHC-----CCCCHHHHHHHHCCCC Q ss_conf 48898199863077599999862-998786-------------50454588999858-----8999789999808898 Q gi|254780127|r 62 RDETVMCVAHNSLFERILFKKTL-GIDIPS-------------KRWICTSVLARTNG-----LPSSLKNACLALNFSE 120 (675) Q Consensus 62 ~~~~~~kv~HNa~FD~~vL~~~l-Gi~i~~-------------~~~~DTmi~A~~l~-----~p~sL~~la~~lgl~~ 120 (675) .+.++.+|+||..||...|.+.+ |-++|. ..++||..++.... ...+|..+|+.||+.. T Consensus 137 ~~~~~~wv~f~g~yD~~yl~k~l~g~~LP~~~~eF~~~l~~~FP~vyD~K~la~~~~~~~~~~~~gL~~lA~~L~v~r 214 (235) T pfam04857 137 NISSVTWVTFHGLYDFGYLLKLLTGGPLPETLDDFLELLRELFPRVYDTKYLAKFCFELDGGSNGGLQELADLLGVNR 214 (235) T ss_pred CCCCCEEEEECCHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHCCHHHHHHHHHHHCCCCCCCCCCCHHHHHHHCCCCC T ss_conf 579987998172778999999974998999899999999997771750999998522135875578999999749978 No 130 >pfam00476 DNA_pol_A DNA polymerase family A. Probab=93.35 E-value=0.092 Score=29.84 Aligned_cols=57 Identities=11% Similarity=0.026 Sum_probs=44.0 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEECCHHHHHHHHHHHHHHHCCCHHHHHHHHH Q ss_conf 403666665666765478899999999987511101031088978899999986730789999999 Q gi|254780127|r 460 QQFSQNVKSTSTFEDWEQAESHHLWMQDQYPEFAVKDKLIGTACELVKKAWRAKHQGVLQLWKDLT 525 (675) Q Consensus 460 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ea~~~~~~~~~~~p~i~~~~~~~~ 525 (675) +..++..+...|++.||+++.|+..+...+.+.|++.++.++ ++.+|.+........ T Consensus 236 s~~eA~~~i~~~~~~yp~i~~~~~~~~~~a~~~g~v~T~~GR---------rr~lp~~~~~~~~~~ 292 (383) T pfam00476 236 SREEAKEFIEKYFERFPGVKRYREKTRKEAKKGGYVETLFGR---------RRYLPDIDSRNRSLR 292 (383) T ss_pred CHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHCCCCCCCCCC---------EEECCCCCCCCHHHH T ss_conf 899999999999997836999999999999968961145687---------675577566678888 No 131 >PRK07456 consensus Probab=92.62 E-value=0.17 Score=28.13 Aligned_cols=40 Identities=5% Similarity=-0.075 Sum_probs=19.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEECCHHH Q ss_conf 3666665666765478899999999987511101031088 Q gi|254780127|r 462 FSQNVKSTSTFEDWEQAESHHLWMQDQYPEFAVKDKLIGT 501 (675) Q Consensus 462 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 501 (675) .++..+...||+.||+++.|+..+...+++.|++.++.++ T Consensus 811 ~eA~~~i~~yf~~yp~v~~~~~~~~~~a~~~gyv~Tl~gR 850 (975) T PRK07456 811 SEAKEFLSRYKERYPKVFAFLELQERLALSKGYVETILGR 850 (975) T ss_pred HHHHHHHHHHHHHCHHHHHHHHHHHHHHHHCCCEECCCCC T ss_conf 9999999999986901999999999999976988766677 No 132 >cd06135 Orn Oligoribonuclease (Orn) is a DEDDh-type DnaQ-like 3'-5' exoribonuclease that is responsible for degrading small oligoribonucleotides to mononucleotides. It contains three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific Hx(4)D conserved pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. Orn is essential for E.coli survival. The human homolog, also called Sfn (small fragment nuclease), is able to hydrolyze short single-stranded RNA and DNA oligomers. It plays a role in cellular nucleotide recycling. Probab=91.97 E-value=0.49 Score=25.04 Aligned_cols=49 Identities=12% Similarity=0.012 Sum_probs=25.7 Q ss_pred HCCCCCEEEEEEHHHHHHHHHHHCCCCCCCCCEEEHHHHHHHHCCCCCHHHHHHHHC Q ss_conf 648898199863077599999862998786504545889998588999789999808 Q gi|254780127|r 61 LRDETVMCVAHNSLFERILFKKTLGIDIPSKRWICTSVLARTNGLPSSLKNACLALN 117 (675) Q Consensus 61 L~~~~~~kv~HNa~FD~~vL~~~lGi~i~~~~~~DTmi~A~~l~~p~sL~~la~~lg 117 (675) +........|+|+.||+.||.++ +| . +. ..+ .|-+-.-.|++.+++++. T Consensus 90 ~~~~~~pLaGNSV~~Dr~FL~k~----mP-~-l~-~~l-hYR~iDVSsikel~~rw~ 138 (173) T cd06135 90 VPKGKSPLAGNSVHQDRRFLDKY----MP-E-LE-EYL-HYRILDVSSIKELARRWY 138 (173) T ss_pred CCCCCCCCCCCCHHHHHHHHHHH----CC-H-HH-HHC-CCEEEEHHHHHHHHHHHC T ss_conf 67998531576487709999985----80-6-79-759-870466777999999869 No 133 >KOG0950 consensus Probab=91.34 E-value=0.17 Score=28.16 Aligned_cols=16 Identities=19% Similarity=0.198 Sum_probs=9.2 Q ss_pred CCCCCCCCCHHHHHCC Q ss_conf 4441000015776087 Q gi|254780127|r 404 NGEDIYVTTYAKAFNT 419 (675) Q Consensus 404 ~g~D~h~~tA~~~~~~ 419 (675) +-.|+|...|...|.. T Consensus 654 t~~di~~~va~tl~s~ 669 (1008) T KOG0950 654 TAEDILHFVAVTLLSA 669 (1008) T ss_pred HHHHHHHHHHHHHHHC T ss_conf 2778887777742012 No 134 >cd06444 DNA_pol_A DNA polymerase family A, 5'-3' polymerase domain. Family A polymerase (polymerase I) functions primarily to fill DNA gaps that arise during DNA repair, recombination and replication. DNA-dependent DNA polymerases can be classified into six main groups based upon phylogenetic relationships with E. coli polymerase I (classA), E. coli polymerase II (class B), E.coli polymerase III (class C), euryarchaeota polymerase II (class D), human polymerase beta (class X), E. coli UmuC/DinB and eukaryotic RAP 30/Xeroderma pigmentosum variant (class Y). Family A polymerases are found primarily in organisms related to prokaryotes and include prokaryotic DNA polymerase I, mitochondrial polymerase gamma, and several bacteriophage polymerases including those from odd-numbered phage (T3, T5, and T7). Prokaryotic Pol I has two functional domains located on the same polypeptide; a 5'-3' polymerase and a 5'-3' exonuclease. Pol I uses its 5' nuclease activity to remove the ribonucleotide po Probab=91.32 E-value=0.21 Score=27.54 Aligned_cols=51 Identities=10% Similarity=-0.003 Sum_probs=41.3 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEECCHHHHHHHHHHHHHHHCCCHHH Q ss_conf 403666665666765478899999999987511101031088978899999986730789 Q gi|254780127|r 460 QQFSQNVKSTSTFEDWEQAESHHLWMQDQYPEFAVKDKLIGTACELVKKAWRAKHQGVLQ 519 (675) Q Consensus 460 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ea~~~~~~~~~~~p~i~~ 519 (675) +..++......|++.||+++.|+..+...+.+.|++.+++++ ++.+|.+.. T Consensus 233 s~~eA~~~~~~f~~~~p~v~~~~~~~~~~a~~~g~v~T~~GR---------~r~~~~~~~ 283 (379) T cd06444 233 SREEAKEFIDRYFATYPGVKEYLEKTVEEAREGGYVETLFGR---------RRYLPEINS 283 (379) T ss_pred CHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCEECCCCC---------EECCCCCCC T ss_conf 999999999999997865667899999999977977667788---------534775567 No 135 >cd06143 PAN2_exo PAN2 is the catalytic subunit of poly(A) nuclease (PAN), a Pab1p-dependent 3'-5' exoribonuclease which plays an important role in the posttranscriptional maturation of pre-mRNAs. PAN catalyzes the deadenylation of poly(A) tails, which are initially synthesized to default lengths of 70 to 90, to mRNA-specific lengths of 55 to 71. Pab1p and PAN also play a role in the export and decay of mRNA. PAN2 contains a DEDDh-type DnaQ-like 3'-5' exonuclease domain with three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific Hx(4)D conserved pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. Probab=91.11 E-value=0.47 Score=25.15 Aligned_cols=56 Identities=21% Similarity=0.115 Sum_probs=37.1 Q ss_pred HCCCCCEEEEEEHHHHHHHHHHHCCCCCCCCCEEEHHHHHHHH-CCCCCHHHHHHH-HCCCC Q ss_conf 6488981998630775999998629987865045458899985-889997899998-08898 Q gi|254780127|r 61 LRDETVMCVAHNSLFERILFKKTLGIDIPSKRWICTSVLARTN-GLPSSLKNACLA-LNFSE 120 (675) Q Consensus 61 L~~~~~~kv~HNa~FD~~vL~~~lGi~i~~~~~~DTmi~A~~l-~~p~sL~~la~~-lgl~~ 120 (675) |-+.+.++|||-+..|+.+| ++..|...++||..+-+.- ..+.||+.+++. ||.+. T Consensus 98 li~~~~ILVGHsL~nDL~aL----~l~hP~~~i~DTs~l~~~~~~r~~sLk~La~~~Lg~~I 155 (174) T cd06143 98 LVDLGCIFVGHGLAKDFRVI----NIQVPKEQVIDTVELFHLPGQRKLSLRFLAWYLLGEKI 155 (174) T ss_pred HCCCCCEEECCCCHHHHHHH----CCCCCCCCEEECHHHCCCCCCCCCCHHHHHHHHCCCCC T ss_conf 54799689789636679885----06699866577788526899887089999999808104 No 136 >PHA00452 T3/T7-like RNA polymerase Probab=89.31 E-value=1.4 Score=22.02 Aligned_cols=229 Identities=12% Similarity=0.087 Sum_probs=116.9 Q ss_pred EEEEEECCCCCEEEEEEECCCHHHHHHHH-----CCCCCCCCCHHHHHCCCCCCC-----------------CHHHHHCC Q ss_conf 89997335485058866228767866540-----444100001577608751105-----------------70221006 Q gi|254780127|r 375 KLVVADLAGIEARVLAWIAGEDWKIKAFA-----NGEDIYVTTYAKAFNTPIDKV-----------------SKEQRAIG 432 (675) Q Consensus 375 ~lv~aDySqiElRvlA~ls~d~~li~~f~-----~g~D~h~~tA~~~~~~~~~~v-----------------~~~~R~~a 432 (675) .-|..|-|-==+--.|.|.+|+..-.+-+ ...|+|+..|..+-..-.++. .+..|+.. T Consensus 505 lpv~~DgsCnGlQH~aal~rD~~ga~~VNL~p~d~p~DvY~~Va~~v~~~~~~d~~~~~~~~~~~~~~~~l~~~i~Rk~v 584 (816) T PHA00452 505 LPVAMDGTCSGLQHFSAMLRDEVGGAAVNLLPSDKPQDIYKIVAEKVNERLERDAINGTDNETRGLAGQWLAYGITRSVT 584 (816) T ss_pred CCCCCCCCCCHHHHHHHHHCCHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHCCCCHHHH T ss_conf 74110788729999999725863214523678888651899999999999998874387632689999988616667663 Q ss_pred CCCHHHHHHCCCHHHHHHHHH----HCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEECCHHHHHHHHHH Q ss_conf 421001330788778988788----7088864036666656667654788999999999875111010310889788999 Q gi|254780127|r 433 KVMELALGYQGGAKVFKTMAS----HCGLDLQQFSQNVKSTSTFEDWEQAESHHLWMQDQYPEFAVKDKLIGTACELVKK 508 (675) Q Consensus 433 K~~~~gl~YG~G~~~l~~~~~----~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ea~~~~~ 508 (675) |..+--..||....+....-. ..|.+.+.. .. + ....+.=.++.+.+ T Consensus 585 K~~vMT~~Yg~t~~g~~~qi~~~l~~~~~~~~~~--------------~~--~-------------~~~~~~yla~~i~~ 635 (816) T PHA00452 585 KRPVMTLPYGSTRFGFRDQVREDLVQPAIDEGKG--------------AM--F-------------TNQAAGYMAKLIWD 635 (816) T ss_pred CCCEEEEECCCCCHHHHHHHHHHHHHHCCCCCCC--------------HH--H-------------HHHHHHHHHHHHHH T ss_conf 2531675206610008999999997305566743--------------03--4-------------78999999999999 Q ss_pred HHHHHCCCHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCCCCCCCEEECC-CCHH-HCCCCCCHHHHHHCCCHHH Q ss_conf 9998673078999999999999963372142013222344321244564112246-3113-1024520123320310012 Q gi|254780127|r 509 AWRAKHQGVLQLWKDLTEGFACVVQEGGSISARRVANVPRLVMRRHKRDVHIVLP-SSRR-LVYSDVKGDCSYLNTATSQ 586 (675) Q Consensus 509 ~~~~~~p~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~i~~p-sGR~-l~~~~~~~~~~~~~~~~~~ 586 (675) .-...+|+-...-..+++.++.+.+.+..+.+.-+....-.-.-+......|.+- .|+. +.... . .+...+. T Consensus 636 ai~~~~~~a~~im~wl~~~a~~~~~~~~~v~W~TP~G~pV~Q~Y~~~~~~~v~t~l~g~~~~~~~~---~---~~~~~~~ 709 (816) T PHA00452 636 AISETVPAAVEAMNWLQQAAKLLAKEVKDLHWTTPTGFPVWQEYRKTETKRVRLRLMGSARIQLTV---N---TDEIDAR 709 (816) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEECCCCCEEEEECCCCCCEEEEEEECCEEEEEECC---C---CCCCCHH T ss_conf 999997709999999999999997169972898999998987737888468998741527998337---8---8988977 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEECCEEEEECCCCHHHHHH Q ss_conf 35776533557776747989999999999999679818999535899967847899999 Q gi|254780127|r 587 LMRERTYGGKLTENIVQAISRDILCEGMKNATKNGYDIVLTVHDEIVSETPDTPYFSVG 645 (675) Q Consensus 587 ~~r~~~~g~k~~n~~iQgtAaDi~k~am~~l~~~g~~ivl~VHDElv~evp~~~~~~~~ 645 (675) -++. .+.=|.|-+.-|.-|.+..+.+.++|......|||..=...-+ ++.+. T Consensus 710 kq~~-----a~~PNfIHSlDashm~~t~~~~~~~gi~~fa~VHDsf~tha~d--v~~m~ 761 (816) T PHA00452 710 KHAS-----GIAPNFVHSLDASHLRLTVVACAEKGIESFAVIHDSFGTHAAD--ADNLF 761 (816) T ss_pred HHHC-----CCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCC--HHHHH T ss_conf 8724-----3794888889999999999999983996278863278877158--99999 No 137 >TIGR01298 RNaseT ribonuclease T; InterPro: IPR005987 Ribonuclease T (3.1.13 from EC) is an enzyme found so far only in gamma-subdivision proteobacteria such as Escherichia coli and Xylella fastidiosa. Ribonuclease T is homologous to the DNA polymerase III alpha chain. It can liberate AMP from the common C-C-A terminus of uncharged tRNA. It appears also to be involved in RNA maturation. It also acts as a 3' to 5' single-strand DNA-specific exonuclease; it is distinctive for its ability to remove residues near a double-stranded stem. Ribonuclease T is a high copy suppressor in E. coli of a UV-repair defect caused by deletion of three other single-stranded DNA exonucleases.; GO: 0004540 ribonuclease activity, 0006396 RNA processing. Probab=87.89 E-value=0.39 Score=25.70 Aligned_cols=37 Identities=22% Similarity=0.263 Sum_probs=23.8 Q ss_pred CCEEEEEEHHHHHHHHHHHC----CC---CCCCCCEEEHHHHHH Q ss_conf 98199863077599999862----99---878650454588999 Q gi|254780127|r 65 TVMCVAHNSLFERILFKKTL----GI---DIPSKRWICTSVLAR 101 (675) Q Consensus 65 ~~~kv~HNa~FD~~vL~~~l----Gi---~i~~~~~~DTmi~A~ 101 (675) ..+.|+||+.||..||.+.- ++ ++-+-..+||.-+|= T Consensus 105 RailV~HNa~fD~~Fl~~AA~~R~~~KRnPFH~F~~FDTatLAG 148 (201) T TIGR01298 105 RAILVAHNASFDLGFLNRAAVKRTALKRNPFHPFSVFDTATLAG 148 (201) T ss_pred EEEEEECCCCCCHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHH T ss_conf 03676003204666899999998611578987652246899988 No 138 >PRK07748 sporulation inhibitor KapD; Provisional Probab=82.00 E-value=3.2 Score=19.64 Aligned_cols=138 Identities=16% Similarity=0.102 Sum_probs=74.4 Q ss_pred CCEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEE-EECCCEEE-----EE---------------------ECCCCCHH Q ss_conf 97899980406788700046000026882999999-98382289-----99---------------------78884221 Q gi|254780127|r 1 MSKLFIDIETRSPQPLPKVGVWAYAEQAVITLCAY-AWDDEPVK-----LW---------------------DRTEQSAM 53 (675) Q Consensus 1 mk~l~lD~ET~s~~dl~~~Gl~~y~~~~~I~~i~~-a~~~~~~~-----~~---------------------d~~~~~~~ 53 (675) +.+|.+|+|.++..+-.. ...+ .++|.=|+. .++++.+. ++ ++..+.++ T Consensus 4 ~~yiViDfEaTc~~~~~~--p~~~--~~EIIEiGaV~l~~~~i~d~F~~~VkP~~~p~Ls~fc~~LTGItq~~vd~a~~F 79 (205) T PRK07748 4 QRFLFLDFEFTMPQHKKK--PKGF--FPEIIEVGLVSVVGCKVEDTFSSYVKPKTFPSLTDRCKKFLGITQEDVDKGISF 79 (205) T ss_pred CEEEEEEEECCCCCCCCC--CCCC--CCCEEEEEEEEEECCEEEEEHEEEECCCCCCCCCHHHHHHCCCCHHHHHCCCCH T ss_conf 669999985478887888--7778--875179978998178152100004798547851378997529499996258988 Q ss_pred HHHH---HHHHCCCCCEEEEEEHHHHHHHHHH---HCCCCCC-CCCEEEHHH-HHHHHCC--CCCHHHHHHHHCCCCCCH Q ss_conf 7999---9986488981998630775999998---6299878-650454588-9998588--999789999808898652 Q gi|254780127|r 54 PSDL---LQYLRDETVMCVAHNSLFERILFKK---TLGIDIP-SKRWICTSV-LARTNGL--PSSLKNACLALNFSEHLT 123 (675) Q Consensus 54 ~~~l---~~~L~~~~~~kv~HNa~FD~~vL~~---~lGi~i~-~~~~~DTmi-~A~~l~~--p~sL~~la~~lgl~~~~~ 123 (675) ++.+ .+.+.+++.. ++++..+|+.+|.+ ..|+++| ...|+|... .+.+.+. +.||+.+.+.+|++.... T Consensus 80 ~evl~~f~~~~~~~~~~-~~tWG~~D~~~L~~~c~~~~i~~p~~~~~idlk~~f~~~~~~~~~~gL~~aL~~lgl~~eG~ 158 (205) T PRK07748 80 EELVEKLAEYDKRCKPT-IVTWGNMDMKVLKHNCEKAGVPFPFKGQCRDLSLEYKKFFGERNQTGLWKAIEEYGKEGTGK 158 (205) T ss_pred HHHHHHHHHHHCCCCEE-EEEECHHHHHHHHHHHHHHCCCCCCCCCEEEHHHHHHHHHCCCCCCCHHHHHHHCCCCCCCC T ss_conf 99999999981778728-98717889999999999938999853100639999999968888768999999769988898 Q ss_pred HHHHCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 76637657664100233201345311258768999898877539999999987200 Q gi|254780127|r 124 KMEEGKALIARFCKGSIDSPPYDCTRANHVQAWQLFGEYCKRDVEATRELFKRLIP 179 (675) Q Consensus 124 kd~~gk~li~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Ya~~Dv~~~~~L~~~L~~ 179 (675) ..-+..|+..+.+++.+|.. T Consensus 159 ------------------------------------~H~glDDA~NtAkI~kk~~~ 178 (205) T PRK07748 159 ------------------------------------HHCALDDAMTTYNIFKLVEK 178 (205) T ss_pred ------------------------------------CCCHHHHHHHHHHHHHHHHC T ss_conf ------------------------------------63758999999999999761 No 139 >KOG4373 consensus Probab=74.08 E-value=5.4 Score=18.13 Aligned_cols=20 Identities=15% Similarity=-0.215 Sum_probs=9.9 Q ss_pred HHHCCEECCHHHHHHHHHHH Q ss_conf 75168430889999999999 Q gi|254780127|r 194 INDRGYRIDLDLVLKLQELI 213 (675) Q Consensus 194 me~~Gi~vD~e~l~~~~~~~ 213 (675) ...-|+..|.+.-+-...+. T Consensus 185 ~~~vgv~~d~D~~KL~r~~h 204 (319) T KOG4373 185 HTFVGVWNDQDAGKLERKEH 204 (319) T ss_pred CEEEECCCCCCHHHHHHHHH T ss_conf 05873055530787766552 No 140 >TIGR00888 guaA_Nterm GMP synthase, N-terminal domain; InterPro: IPR004739 The N-terminal region of GMP synthase 6.3.5.2 from EC is recognised by this signature. GMP synthase catalyzes the synthesis of GMP from XMP , .ATP + xanthosine 5'-phosphate + L-glutamine + H(2)O = AMP + diphosphate + GMP + L-glutamate ; GO: 0003922 GMP synthase (glutamine-hydrolyzing) activity, 0005524 ATP binding, 0006177 GMP biosynthetic process. Probab=62.24 E-value=1.9 Score=21.08 Aligned_cols=33 Identities=18% Similarity=0.071 Sum_probs=26.3 Q ss_pred HCCCCCHHHHHHCCCHHHHHHHHHHCCCCCCHHHHHHH Q ss_conf 00642100133078877898878870888640366666 Q gi|254780127|r 430 AIGKVMELALGYQGGAKVFKTMASHCGLDLQQFSQNVK 467 (675) Q Consensus 430 ~~aK~~~~gl~YG~G~~~l~~~~~~~G~~~~~~~~~~~ 467 (675) ....+.+||++||| |.||..+|=.+.......+ T Consensus 67 f~LgVPvLGICYG~-----Ql~A~~lGG~V~~~~~~EY 99 (195) T TIGR00888 67 FELGVPVLGICYGM-----QLMAKQLGGEVGRAEKREY 99 (195) T ss_pred HHCCCCEEEEEHHH-----HHHHHHHCCEEEECCCCCC T ss_conf 62799688762457-----8999982987765688873 No 141 >cd05531 POLBc_B2 DNA polymerase type-B B2 subfamily catalytic domain. Archaeal proteins that are involved in DNA replication are similar to those from eukaryotes. Some archaeal members also possess multiple family B DNA polymerases (B1, B2 and B3). So far there is no specific function(s) has been assigned for different members of the archaea type B DNA polymerases. Phylogenetic analyses of eubacterial, archaeal, and eukaryotic family B DNA polymerases are support independent gene duplications during the evolution of archaeal and eukaryotic family B DNA polymerases. Probab=56.81 E-value=12 Score=15.94 Aligned_cols=23 Identities=22% Similarity=0.112 Sum_probs=12.4 Q ss_pred EEEEECCCCCHHHHHHHHHHCCCCCEEEEEEHHHHH Q ss_conf 899978884221799999864889819986307759 Q gi|254780127|r 42 VKLWDRTEQSAMPSDLLQYLRDETVMCVAHNSLFER 77 (675) Q Consensus 42 ~~~~d~~~~~~~~~~l~~~L~~~~~~kv~HNa~FD~ 77 (675) |.+.|+.+-.| +++ +.||+.|+. T Consensus 21 V~~lDF~SLYP------------SIi-~~~Ni~~~T 43 (352) T cd05531 21 VAQIDFSSMYP------------SII-VKYNISPET 43 (352) T ss_pred EEEEECCCCCH------------HHH-HHHCCCHHH T ss_conf 37888443549------------999-996979777 No 142 >pfam09281 Taq-exonuc Taq polymerase, exonuclease. Members of this family are found in prokaryotic Taq DNA polymerase, where they assume a ribonuclease H-like motif. The domain confers 5'-3' exonuclease activity to the polymerase. Probab=55.60 E-value=12 Score=15.82 Aligned_cols=40 Identities=5% Similarity=-0.122 Sum_probs=29.4 Q ss_pred HHHHHHHHHCCCCCCCCCEEEHHHHHHHHCCC-CCHHHHHHHH Q ss_conf 75999998629987865045458899985889-9978999980 Q gi|254780127|r 75 FERILFKKTLGIDIPSKRWICTSVLARTNGLP-SSLKNACLAL 116 (675) Q Consensus 75 FD~~vL~~~lGi~i~~~~~~DTmi~A~~l~~p-~sL~~la~~l 116 (675) =|+.++.-.-|..++|. .|.|+.||++++. .+...++.+| T Consensus 71 KdLAvla~~eG~~v~PG--DDPmLlAYLlDPsNT~p~~varRY 111 (138) T pfam09281 71 KALAAHLSVRGTVVEPG--DDPLLVAYLLDPANTNMPGVARRY 111 (138) T ss_pred HHHHHHHHCCCCCCCCC--CCHHHHHHHCCCCCCCCHHHHHHH T ss_conf 99999987026656999--860278765085678821566762 No 143 >TIGR00612 ispG_gcpE 4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase; InterPro: IPR004588 This protein previously of unknown biochemical function is essential in Escherichia coli. It has now been characterised as 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase, which converts 2C-methyl-D-erythritol 2,4-cyclodiphosphate (ME-2,4CPP) into 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate in the sixth step of nonmevalonate terpenoid biosynthesis. The family is restricted to bacteria, where it is widely but not universally distributed. No homology can be detected between this family and other proteins.; GO: 0016114 terpenoid biosynthetic process. Probab=55.22 E-value=12 Score=15.77 Aligned_cols=21 Identities=19% Similarity=0.310 Sum_probs=9.9 Q ss_pred CCC--CEEEEECCEE-EEECCCCH Q ss_conf 798--1899953589-99678478 Q gi|254780127|r 620 NGY--DIVLTVHDEI-VSETPDTP 640 (675) Q Consensus 620 ~g~--~ivl~VHDEl-v~evp~~~ 640 (675) +|+ .+.+.+|-+. .-.++++. T Consensus 600 ~G~tPt~~lf~~gk~v~~~~~~~~ 623 (633) T TIGR00612 600 AGATPTIILFKKGKPVKRKVPEED 623 (633) T ss_pred CCCCCCEEEECCCEEEEEECCCCH T ss_conf 988653545408726877317002 No 144 >PRK10017 putative pyruvyl transferase; Provisional Probab=54.24 E-value=11 Score=16.09 Aligned_cols=71 Identities=17% Similarity=0.287 Sum_probs=47.7 Q ss_pred HHHEEEECCCCCEEEEEECCCCCEEEEEEECCCHHHHHHHHCCCCCCCCCHHHHHCCCCCCCCHHHHHCCCCCHHHHHHC Q ss_conf 32121476998789997335485058866228767866540444100001577608751105702210064210013307 Q gi|254780127|r 363 CVRSCIIASNGKKLVVADLAGIEARVLAWIAGEDWKIKAFANGEDIYVTTYAKAFNTPIDKVSKEQRAIGKVMELALGYQ 442 (675) Q Consensus 363 ~iR~~f~a~~G~~lv~aDySqiElRvlA~ls~d~~li~~f~~g~D~h~~tA~~~~~~~~~~v~~~~R~~aK~~~~gl~YG 442 (675) .++..+-.++...++.-||+-.| .++.++++.-++.. -+|+.+++...|+| ..|+.|. T Consensus 298 ~i~~~m~~~~~~~il~~~~~~~E--~~~ii~~~dl~IG~-----RLHslIfA~~~gvP---------------~i~IsYd 355 (426) T PRK10017 298 NLRQHISDPARYHVVMDELNDLE--MGKILGACELTVGT-----RLHSAIISMNFATP---------------AIAINYE 355 (426) T ss_pred HHHHHHCCCCCEEEECCCCCHHH--HHHHHHHCHHHHHH-----HHHHHHHHHHCCCC---------------EEEEEEC T ss_conf 99997268763698389999899--99999739229988-----89999999975999---------------6984022 Q ss_pred CCHHHHHHHHHHCCCC Q ss_conf 8877898878870888 Q gi|254780127|r 443 GGAKVFKTMASHCGLD 458 (675) Q Consensus 443 ~G~~~l~~~~~~~G~~ 458 (675) --..++ ..+.|++ T Consensus 356 ~K~~g~---~~~lGl~ 368 (426) T PRK10017 356 HKSAGI---MQQLGLP 368 (426) T ss_pred HHHHHH---HHHCCCC T ss_conf 878999---9975993 No 145 >COG1158 Rho Transcription termination factor [Transcription] Probab=52.82 E-value=13 Score=15.53 Aligned_cols=66 Identities=18% Similarity=0.200 Sum_probs=37.0 Q ss_pred CCCCCCCHHHHEEEECCCCCEEEEEECCCCCEEEEEEECCCHHHHHHHH-----CCCCCCCCCHHHHHCCCCCCCCHHHH Q ss_conf 4432210132121476998789997335485058866228767866540-----44410000157760875110570221 Q gi|254780127|r 355 DPLGLASDCVRSCIIASNGKKLVVADLAGIEARVLAWIAGEDWKIKAFA-----NGEDIYVTTYAKAFNTPIDKVSKEQR 429 (675) Q Consensus 355 ~~~~~~~~~iR~~f~a~~G~~lv~aDySqiElRvlA~ls~d~~li~~f~-----~g~D~h~~tA~~~~~~~~~~v~~~~R 429 (675) +...++|+--|.+++|||- ||-..|++-.. |..|.|... ..-..-.|+||..+| T Consensus 165 DL~~PIGkGQR~LIVAPPk-------------------aGKT~lLq~IA~aIt~N~Pe~~LiV--LLIDERPEEVTdmqr 223 (422) T COG1158 165 DLISPIGKGQRGLIVAPPK-------------------AGKTTLLQNIANAITTNHPECELIV--LLIDERPEEVTDMQR 223 (422) T ss_pred HHHCCCCCCCEEEEECCCC-------------------CCCHHHHHHHHHHHHCCCCCEEEEE--EEECCCCHHHHHHHH T ss_conf 6526567884656866987-------------------8733899999999863799649999--993478067777887 Q ss_pred HCCCCCHHHHHHC Q ss_conf 0064210013307 Q gi|254780127|r 430 AIGKVMELALGYQ 442 (675) Q Consensus 430 ~~aK~~~~gl~YG 442 (675) +.- .-+++.-|- T Consensus 224 sV~-geViaSTFD 235 (422) T COG1158 224 SVK-GEVVASTFD 235 (422) T ss_pred HHC-CEEEEECCC T ss_conf 524-069864488 No 146 >KOG2207 consensus Probab=51.04 E-value=14 Score=15.34 Aligned_cols=15 Identities=13% Similarity=0.379 Sum_probs=9.6 Q ss_pred CCCCCCCHHHHHCCC Q ss_conf 410000157760875 Q gi|254780127|r 406 EDIYVTTYAKAFNTP 420 (675) Q Consensus 406 ~D~h~~tA~~~~~~~ 420 (675) .|++....|.+|+.+ T Consensus 457 se~w~~~~s~if~s~ 471 (617) T KOG2207 457 SEIWHLLLSQIFESK 471 (617) T ss_pred HHHHHHHHHHHCCCC T ss_conf 899999999870688 No 147 >KOG1275 consensus Probab=49.70 E-value=5.4 Score=18.14 Aligned_cols=26 Identities=4% Similarity=-0.230 Sum_probs=15.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 65666765478899999999987511 Q gi|254780127|r 467 KSTSTFEDWEQAESHHLWMQDQYPEF 492 (675) Q Consensus 467 ~~~~~~~~~~~~~~~~~~~~~~~~~~ 492 (675) +...|-....++..|..++....|.. T Consensus 962 FiDDYv~T~d~VvDYLTqySGI~PGD 987 (1118) T KOG1275 962 FIDDYVSTDDKVVDYLTQYSGIKPGD 987 (1118) T ss_pred CCCCEECCHHHHHHHHHHHCCCCCCC T ss_conf 20323314058888888744877666 No 148 >KOG2304 consensus Probab=49.50 E-value=15 Score=15.25 Aligned_cols=41 Identities=20% Similarity=0.325 Sum_probs=31.6 Q ss_pred CCHHHHHHCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEECCHHHHHHHHHHHHHHH Q ss_conf 21001330788778988788708886403666665666765478899999999987511101031088978899999986 Q gi|254780127|r 434 VMELALGYQGGAKVFKTMASHCGLDLQQFSQNVKSTSTFEDWEQAESHHLWMQDQYPEFAVKDKLIGTACELVKKAWRAK 513 (675) Q Consensus 434 ~~~~gl~YG~G~~~l~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ea~~~~~~~~~~ 513 (675) +.-||.+|-||+..|.-+ +..+.++-+.+.|++. T Consensus 231 aMklGagyPMGPfEL~Dy----------------------------------------------vGLDt~kfvmdgwhe~ 264 (298) T KOG2304 231 AMKLGAGYPMGPFELADY----------------------------------------------VGLDTCKFVMDGWHEG 264 (298) T ss_pred HHHCCCCCCCCHHHHHHH----------------------------------------------HHHHHHHHHHHHHHHC T ss_conf 985258998870888777----------------------------------------------4379999999999742 Q ss_pred CCCHHHH Q ss_conf 7307899 Q gi|254780127|r 514 HQGVLQL 520 (675) Q Consensus 514 ~p~i~~~ 520 (675) ||+...| T Consensus 265 ~pe~~~f 271 (298) T KOG2304 265 YPEDSLF 271 (298) T ss_pred CCCCCCC T ss_conf 8764335 No 149 >TIGR02336 TIGR02336 conserved hypothetical protein TIGR02336; InterPro: IPR012711 The function of proteins in this family is unknown. Members are found so far only in a small number of phylogenetically diverse bacteria, including Clostridium perfringens (in the firmicutes), Bifidobacterium longum (in the actinobacteria), and Vibrio vulnificus (in the proteobacteria).. Probab=41.09 E-value=2.2 Score=20.78 Aligned_cols=54 Identities=30% Similarity=0.377 Sum_probs=28.7 Q ss_pred CCCCEEEEEEECCCHH---------HHHHHHCCCCCCCCCHHHHHCCCCCCCCHHHHHCCCCCHHHHHHCC Q ss_conf 5485058866228767---------8665404441000015776087511057022100642100133078 Q gi|254780127|r 382 AGIEARVLAWIAGEDW---------KIKAFANGEDIYVTTYAKAFNTPIDKVSKEQRAIGKVMELALGYQG 443 (675) Q Consensus 382 SqiElRvlA~ls~d~~---------li~~f~~g~D~h~~tA~~~~~~~~~~v~~~~R~~aK~~~~gl~YG~ 443 (675) .+.=+|++|-.-|++- .=+.|++|.|.=-+-.+.+ .+.+|-+=+...==++||| T Consensus 338 dGtt~R~iaDipGvkytEgRFlPYFFPDvF~EgnDPv~E~~~NW--------~kaRRAiLr~PldRiGYGG 400 (767) T TIGR02336 338 DGTTLRMIADIPGVKYTEGRFLPYFFPDVFREGNDPVVESLENW--------LKARRAILRSPLDRIGYGG 400 (767) T ss_pred CCEEEEEEECCCCCEEEECCCCCCCCCCCCCCCCCCEECCHHHH--------HHHHHHHHCCCHHHCCCCH T ss_conf 73013245417984133134377537861114878513034668--------9999997407620325543 No 150 >TIGR00922 nusG transcription termination/antitermination factor NusG; InterPro: IPR001062 Bacterial transcription antitermination protein, nusG, is a component of the transcription complex and interacts with the termination factor Rho and RNA polymerase , . NusG is a bacterial transcriptional elongation factor involved in transcription termination and anti-termination .; GO: 0003711 transcription elongation regulator activity, 0006355 regulation of transcription DNA-dependent. Probab=40.98 E-value=16 Score=15.00 Aligned_cols=15 Identities=40% Similarity=0.465 Sum_probs=11.7 Q ss_pred HEEEECCCCCEEEEEECC Q ss_conf 121476998789997335 Q gi|254780127|r 365 RSCIIASNGKKLVVADLA 382 (675) Q Consensus 365 R~~f~a~~G~~lv~aDyS 382 (675) |+|| |||+||-+|+. T Consensus 58 rK~f---PGYvLv~m~l~ 72 (193) T TIGR00922 58 RKIF---PGYVLVKMDLT 72 (193) T ss_pred EEEC---CCEEEEEEECC T ss_conf 1154---86489999738 No 151 >pfam07794 DUF1633 Protein of unknown function (DUF1633). This family contains sequences derived from a group of hypothetical proteins expressed by Arabidopsis thaliana. These sequences are highly similar and the region concerned is about 100 residues long. Probab=40.92 E-value=20 Score=14.33 Aligned_cols=10 Identities=0% Similarity=-0.326 Sum_probs=4.7 Q ss_pred CCCCCCCCCC Q ss_conf 0688855477 Q gi|254780127|r 322 GCVFQPQNLP 331 (675) Q Consensus 322 ~~~pNlQnlP 331 (675) .++|.|+-+- T Consensus 374 aShPsLpevn 383 (698) T pfam07794 374 ASHPSLPAAN 383 (698) T ss_pred CCCCCCCCCC T ss_conf 4688752123 No 152 >TIGR01512 ATPase-IB2_Cd cadmium-translocating P-type ATPase; InterPro: IPR006404 These sequences describe the P-type ATPase primarily responsible for translocating cadmium ions (and other closely-related divalent heavy metals such as cobalt, mercury, lead and zinc) across biological membranes. These transporters are found in prokaryotes and plants. Experimentally characterised members include P37617 from SWISSPROT from Escherichia coli, Q10866 from SWISSPROT from Mycobacterium tuberculosis and Q59998 from SWISSPROT from Synechocystis PCC6803. The cadmium P-type ATPases have been characterised as Type IB based on a phylogenetic analysis which combines the copper-translocating ATPases with the cadmium-translocating species . This family and that describing the copper-ATPases (IPR006403 from INTERPRO) are well separated, and thus the copper-ATPases can be typed as IB1 and the cadmium-ATPases as IB2. ; GO: 0005524 ATP binding, 0016818 hydrolase activity acting on acid anhydrides in phosphorus-containing anhydrides, 0046872 metal ion binding, 0046873 metal ion transmembrane transporter activity, 0030001 metal ion transport, 0016020 membrane. Probab=37.21 E-value=23 Score=13.96 Aligned_cols=22 Identities=9% Similarity=0.152 Sum_probs=12.2 Q ss_pred HHHCCCCCHHHHHHHHHHHHHH Q ss_conf 9744898548999999999754 Q gi|254780127|r 265 LSHSNITQLAKDLILNRLASSG 286 (675) Q Consensus 265 l~~~~~~~~~~~ll~~r~~~~~ 286 (675) ++..-.||.++-+.+++..... T Consensus 267 ~E~~S~HPlA~AIv~~~e~pG~ 288 (494) T TIGR01512 267 AEQQSSHPLARAIVAAEEVPGE 288 (494) T ss_pred HHHCCCCHHHHHHHHHHHCCCC T ss_conf 7310668689999974204777 No 153 >COG1029 FwdB Formylmethanofuran dehydrogenase subunit B [Energy production and conversion] Probab=36.81 E-value=21 Score=14.23 Aligned_cols=76 Identities=17% Similarity=0.219 Sum_probs=45.3 Q ss_pred HEEEEC--CCCCEEEEEECCCCCEEEEE------EECCCHHHHHHHH---CCCCCCCCCHHHHHCCCCCCCCHHHHHCCC Q ss_conf 121476--99878999733548505886------6228767866540---444100001577608751105702210064 Q gi|254780127|r 365 RSCIIA--SNGKKLVVADLAGIEARVLA------WIAGEDWKIKAFA---NGEDIYVTTYAKAFNTPIDKVSKEQRAIGK 433 (675) Q Consensus 365 R~~f~a--~~G~~lv~aDySqiElRvlA------~ls~d~~li~~f~---~g~D~h~~tA~~~~~~~~~~v~~~~R~~aK 433 (675) |..|.+ +++..+|+.|----+---+| --..|-.++.+++ .|.-.|.. ..++|+|++++ .+.-..-| T Consensus 167 RG~~~~rGr~dRtvIvVD~RkT~TAklad~~~qi~p~sDyelisAl~~~l~G~~~~~~--eev~gvp~~~i-~e~a~~mK 243 (429) T COG1029 167 RGFFRPRGREDRTVIVVDPRKTATAKLADNHVQIKPNSDYELISALRAALHGKEPHRS--EEVAGVPIEEI-EELADMMK 243 (429) T ss_pred CCCCCCCCCCCCEEEEEECCCCCHHHHHHHEEECCCCCCHHHHHHHHHHHCCCCCCCC--HHHCCCCHHHH-HHHHHHHH T ss_conf 6422457766644999806767256654321756899867999999998558999872--42148798999-99999986 Q ss_pred CCHHHHHHCC Q ss_conf 2100133078 Q gi|254780127|r 434 VMELALGYQG 443 (675) Q Consensus 434 ~~~~gl~YG~ 443 (675) -++||.+|+| T Consensus 244 na~Fg~if~G 253 (429) T COG1029 244 NAKFGAIFVG 253 (429) T ss_pred CCCCCEEEEE T ss_conf 3981249983 No 154 >TIGR01591 Fdh-alpha formate dehydrogenase, alpha subunit; InterPro: IPR006478 This group of sequences describe a subset of formate dehydrogenase alpha chains found mainly in the archaea but also in alpha and gamma proteobacteria and a small number of Gram-positive bacteria. The alpha chain contains domains for molybdopterin dinucleotide binding and molybdopterin oxidoreductase. The holo-enzyme also contains beta and gamma subunits. The enzyme catalyzes the oxidation of formate (produced from pyruvate during anaerobic growth) to carbon dioxide with the concomitant release of two electrons and two protons. The enzyme's purpose is to allow growth on formate in some circumstances and, in the case of FdhH in gamma proteobacteria, to pass electrons to hydrogenase (by which process acid is neutralized) . The alpha subunit of a version of nitrate reductase is closely related.; GO: 0008863 formate dehydrogenase activity, 0015942 formate metabolic process, 0009326 formate dehydrogenase complex. Probab=33.90 E-value=8 Score=17.01 Aligned_cols=24 Identities=8% Similarity=0.057 Sum_probs=17.1 Q ss_pred CCHHHHEEEECCCCCEEEEEECCC Q ss_conf 101321214769987899973354 Q gi|254780127|r 360 ASDCVRSCIIASNGKKLVVADLAG 383 (675) Q Consensus 360 ~~~~iR~~f~a~~G~~lv~aDySq 383 (675) ....+|.+|+--++=++..+|.+. T Consensus 395 ~~G~~kalYI~GEdp~~sDp~~~~ 418 (694) T TIGR01591 395 LDGDLKALYIVGEDPVISDPDTKK 418 (694) T ss_pred HCCCEEEEEEECCCCEEECCCHHH T ss_conf 549716999835850223798589 No 155 >cd05532 POLBc_alpha DNA polymerase type-B alpha subfamily catalytic domain. Three DNA-dependent DNA polymerases type B (alpha, delta, and epsilon) have been identified as essential for nuclear DNA replication in eukaryotes. DNA polymerase (Pol) alpha is almost exclusively required for the initiation of DNA replication and the priming of Okazaki fragments during elongation. In most organisms no specific repair role, other than check point control, has been assigned to this enzyme. Pol alpha contains both polymerase and exonuclease domains, but lacks exonuclease activity suggesting that the exonuclease domain may be for structural purposes only. Probab=33.87 E-value=26 Score=13.61 Aligned_cols=27 Identities=19% Similarity=0.238 Sum_probs=17.0 Q ss_pred CCEEEEEECCCCCHHHHHHHHHHCCCCCEEEEEEHHHHHH Q ss_conf 8228999788842217999998648898199863077599 Q gi|254780127|r 39 DEPVKLWDRTEQSAMPSDLLQYLRDETVMCVAHNSLFERI 78 (675) Q Consensus 39 ~~~~~~~d~~~~~~~~~~l~~~L~~~~~~kv~HNa~FD~~ 78 (675) +.||.+.|+.+-.| +++ ++||+.|+.. T Consensus 22 ~~~V~vlDF~SLYP------------SIi-~~~Ni~~~T~ 48 (400) T cd05532 22 DKFILLLDFNSLYP------------SII-QEYNICFTTV 48 (400) T ss_pred CCCEEEEECCCCCH------------HHH-HHHCCCHHHC T ss_conf 59879988321229------------999-9968797652 No 156 >COG2388 Predicted acetyltransferase [General function prediction only] Probab=31.40 E-value=29 Score=13.35 Aligned_cols=24 Identities=21% Similarity=0.466 Sum_probs=20.9 Q ss_pred HHHHHHHHHHHHHHHHHCCCCEEE Q ss_conf 798999999999999967981899 Q gi|254780127|r 603 QAISRDILCEGMKNATKNGYDIVL 626 (675) Q Consensus 603 QgtAaDi~k~am~~l~~~g~~ivl 626 (675) ||.|+.+++.|+....++|++|+- T Consensus 54 qGia~~L~~~al~~ar~~g~kiiP 77 (99) T COG2388 54 QGIAQKLVEKALEEAREAGLKIIP 77 (99) T ss_pred CCHHHHHHHHHHHHHHHCCCEECC T ss_conf 747999999999999981985733 No 157 >cd07922 CarBa CarBa is the A subunit of 2-aminophenol 1,6-dioxygenase, which catalyzes the oxidization and subsequent ring-opening of 2-aminophenyl-2,3-diol. CarBa is the A subunit of 2-aminophenol 1,6-dioxygenase, which catalyzes the oxidization and subsequent ring-opening of 2-aminophenyl-2,3-diol. 2-aminophenol 1,6-dioxygenase is a key enzyme in the carbazole degradation pathway isolated from bacterial strains with carbazole degradation ability. The enzyme is a heterotetramer composed of two A and two B subunits. CarB belongs to the class III extradiol dioxygenase family, composed of enzymes which use a non-heme Fe(II) to cleave aromatic rings between a hydroxylated carbon and an adjacent non-hydroxylated carbon. Although the enzyme was originally isolated as a meta-cleavage enzyme for 2'-aminobiphenyl-2,3-diol involved in carbazole degradation, the enzyme has also shown high specificity for 2,3-dihydroxybiphenyl. Probab=28.52 E-value=25 Score=13.74 Aligned_cols=52 Identities=13% Similarity=0.072 Sum_probs=34.7 Q ss_pred EECCCHHHHHHHHCCCCCCCCCHHHHHCCCCCCCCHHHHHCCCCCHHHHHHCCCHHHHHHH Q ss_conf 6228767866540444100001577608751105702210064210013307887789887 Q gi|254780127|r 391 WIAGEDWKIKAFANGEDIYVTTYAKAFNTPIDKVSKEQRAIGKVMELALGYQGGAKVFKTM 451 (675) Q Consensus 391 ~ls~d~~li~~f~~g~D~h~~tA~~~~~~~~~~v~~~~R~~aK~~~~gl~YG~G~~~l~~~ 451 (675) .+..||.+++.|++..+ .+|. + =..+.++|+.--...+|.+|+.|.+....| T Consensus 12 eL~r~p~~~e~~~~dp~-------av~e-~-y~Lt~~E~~AL~eg~~gaL~~iGVhp~l~M 63 (81) T cd07922 12 ELFKDPGLIERFQDDPS-------AVFE-E-YGLTPAERAALREGTFGALTSIGVHPILQM 63 (81) T ss_pred HHHCCHHHHHHHHHCHH-------HHHH-H-CCCCHHHHHHHHCCCHHHHHHCCCCHHHHH T ss_conf 99369888999874989-------9877-5-289989999986388646773673699999 No 158 >cd01779 Myosin_IXb_RA Myosin_IXb_RA RasGTP binding domain from guanine nucleotide exchange factors. In some proteins the domain acts as a RasGTP effector (AF6, canoe and RalGDS, for example), but in other cases it may not bind to RasGTP at all. Probab=28.19 E-value=31 Score=13.17 Aligned_cols=38 Identities=13% Similarity=0.233 Sum_probs=23.9 Q ss_pred HHHHHHHHHHHHHHHH--HHCCCCEEEEEC----CEEEEECCCC Q ss_conf 4798999999999999--967981899953----5899967847 Q gi|254780127|r 602 VQAISRDILCEGMKNA--TKNGYDIVLTVH----DEIVSETPDT 639 (675) Q Consensus 602 iQgtAaDi~k~am~~l--~~~g~~ivl~VH----DElv~evp~~ 639 (675) =|.||||++..++.++ +..+...+.-|| +|-|++--+- T Consensus 33 K~tTaaeVI~~vi~~L~Ld~~k~YvLaEVke~GgEEwiL~psD~ 76 (105) T cd01779 33 KDSTAADVIDDVIASLQLDGTKCYVLAEVKESGGEEWVLDPTDS 76 (105) T ss_pred CCCCHHHHHHHHHHHHCCCCCCCEEEEEEECCCCEEEECCCCCC T ss_conf 66759999999999857787642799997416870654187668 No 159 >PRK06487 glycerate dehydrogenase; Provisional Probab=28.10 E-value=33 Score=12.98 Aligned_cols=64 Identities=14% Similarity=0.200 Sum_probs=39.1 Q ss_pred CCEEEEEECCCCCCCCCCCCCHHHCCCCEEEEEEEEECCCEEEEEECCCCCHHHHHHHHHHCCCCCEEEEEEHHHHHHHH Q ss_conf 97899980406788700046000026882999999983822899978884221799999864889819986307759999 Q gi|254780127|r 1 MSKLFIDIETRSPQPLPKVGVWAYAEQAVITLCAYAWDDEPVKLWDRTEQSAMPSDLLQYLRDETVMCVAHNSLFERILF 80 (675) Q Consensus 1 mk~l~lD~ET~s~~dl~~~Gl~~y~~~~~I~~i~~a~~~~~~~~~d~~~~~~~~~~l~~~L~~~~~~kv~HNa~FD~~vL 80 (675) ||.++||-.|.+..|+.-.++.. . -+++.+++.+ -|+++.+.+.+.+++ +.....+|..+| T Consensus 1 mk~v~ld~~~~~~~~~~~~~l~~---~-----------~~~~~~~~~t----~~~el~~~~~dadi~-i~~~~~i~~~~l 61 (317) T PRK06487 1 MRAVFLDHGSLDLGDLDLSPLEQ---A-----------FDELQLHAAT----RPEQVAERLQGAQVA-ISNKVALDAAAL 61 (317) T ss_pred CEEEEECCCCCCCCCCCHHHHHH---H-----------CCCEEEECCC----CHHHHHHHHCCCCEE-EECCCCCCHHHH T ss_conf 95999574448877779599985---2-----------9957995699----989999984899199-968971289999 Q ss_pred HHH Q ss_conf 986 Q gi|254780127|r 81 KKT 83 (675) Q Consensus 81 ~~~ 83 (675) ... T Consensus 62 ~~a 64 (317) T PRK06487 62 AAA 64 (317) T ss_pred HCC T ss_conf 349 No 160 >KOG0184 consensus Probab=27.98 E-value=33 Score=12.97 Aligned_cols=76 Identities=16% Similarity=0.124 Sum_probs=46.3 Q ss_pred CCCCCCCCCEEECCCCHHHCCCCCCHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--H---HHHHCCCCE Q ss_conf 21244564112246311310245201233203100123577653355777674798999999999--9---999679818 Q gi|254780127|r 550 VMRRHKRDVHIVLPSSRRLVYSDVKGDCSYLNTATSQLMRERTYGGKLTENIVQAISRDILCEGM--K---NATKNGYDI 624 (675) Q Consensus 550 ~~~~~~~~l~i~~psGR~l~~~~~~~~~~~~~~~~~~~~r~~~~g~k~~n~~iQgtAaDi~k~am--~---~l~~~g~~i 624 (675) ++..++..|+..-|||-...|.-.-... --|..-++|=|+-| + .+-+...+| T Consensus 140 ~yd~~g~~LymiepSG~~~~Y~~aaiGK-----------------------grq~aKtElEKL~~~~mt~~e~VkeaakI 196 (254) T KOG0184 140 SYDDEGPQLYMIEPSGSSYGYKGAAIGK-----------------------GRQAAKTELEKLKIDEMTCKELVKEAAKI 196 (254) T ss_pred EEECCCCEEEEECCCCCCCCEEEEECCC-----------------------HHHHHHHHHHHCCCCCCCHHHHHHHHHHE T ss_conf 8808984599975788733033565051-----------------------05788878886154525199999987724 Q ss_pred EEEECCEE-------------------EEECCCCHHHHHHHHH Q ss_conf 99953589-------------------9967847899999999 Q gi|254780127|r 625 VLTVHDEI-------------------VSETPDTPYFSVGTLC 648 (675) Q Consensus 625 vl~VHDEl-------------------v~evp~~~~~~~~~i~ 648 (675) +..||||. .-.||.+-.+++++-. T Consensus 197 iY~~HDe~KdK~feiEm~wvg~eTnG~h~~vp~el~~ea~~~a 239 (254) T KOG0184 197 IYKVHDENKDKEFEIEMGWVGEETNGLHEKVPSELLEEAEKYA 239 (254) T ss_pred EEEECCCCCCCCEEEEEEEEEEECCCCCCCCCHHHHHHHHHHH T ss_conf 6761263568513799989886138843429388999999999 No 161 >TIGR01075 uvrD DNA helicase II; InterPro: IPR005753 The SOS response is required for efficient nucleotide excision repair (NER) of the major ultraviolet light (UV) induced DNA lesions. The expression of three NER genes, uvrA, uvrB, and uvrD, is upregulated as part of the SOS response. UvrD differs from the others in that it is not involved in lesion recognition but rather in promoting the post-incision steps of NER, including turnover of the UvrBC incision complex . The form of the UvrD helicase with optimal helicase activity is oligomeric with at least two sites for binding the DNA substrate, where one site contacts regions of the 3'-ssDNA tail that are distal from the single-stranded/double-stranded DNA junction.; GO: 0004003 ATP-dependent DNA helicase activity, 0006268 DNA unwinding during replication, 0005737 cytoplasm. Probab=27.30 E-value=16 Score=15.05 Aligned_cols=13 Identities=15% Similarity=0.079 Sum_probs=7.5 Q ss_pred HHHHHHHHHHHCC Q ss_conf 9999999987200 Q gi|254780127|r 167 VEATRELFKRLIP 179 (675) Q Consensus 167 v~~~~~L~~~L~~ 179 (675) +++.++-|+.|.. T Consensus 190 AELLLRa~ELl~~ 202 (741) T TIGR01075 190 AELLLRAYELLRN 202 (741) T ss_pred HHHHHHHHHHHCC T ss_conf 9999999997528 No 162 >TIGR02049 gshA_ferroox glutamate--cysteine ligase; InterPro: IPR011718 This entry represents a rare family of glutamate--cysteine ligases, demonstrated first in Thiobacillus ferrooxidans and present in a few other Proteobacteria . It is the first of two enzymes for glutathione biosynthesis. It is also called gamma-glutamylcysteine synthetase.. Probab=27.04 E-value=8.3 Score=16.92 Aligned_cols=20 Identities=10% Similarity=-0.099 Sum_probs=9.4 Q ss_pred HHHHHHHHHHCCEECCHHHHHH Q ss_conf 7899998751684308899999 Q gi|254780127|r 187 LWLLDQTINDRGYRIDLDLVLK 208 (675) Q Consensus 187 l~~~~~~me~~Gi~vD~e~l~~ 208 (675) .-.+..++. ||-++.+.-+. T Consensus 219 i~p~f~~c~--G~DF~~~~g~~ 238 (436) T TIGR02049 219 INPYFEKCD--GIDFDDREGED 238 (436) T ss_pred CCCHHHCCC--CCCCCCCCCHH T ss_conf 120000106--63356103536 No 163 >PRK05359 oligoribonuclease; Provisional Probab=27.01 E-value=34 Score=12.86 Aligned_cols=11 Identities=27% Similarity=0.604 Sum_probs=8.5 Q ss_pred CEEEEEECCCC Q ss_conf 78999804067 Q gi|254780127|r 2 SKLFIDIETRS 12 (675) Q Consensus 2 k~l~lD~ET~s 12 (675) +.++||+|+++ T Consensus 4 ~lvWiDlEMTG 14 (181) T PRK05359 4 NLIWIDLEMTG 14 (181) T ss_pred CEEEEECCCCC T ss_conf 65899761348 No 164 >TIGR02093 P_ylase glycogen/starch/alpha-glucan phosphorylases; InterPro: IPR011833 The biosynthesis of disaccharides, oligosaccharides and polysaccharides involves the action of hundreds of different glycosyltransferases. These enzymes catalyse the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. A classification of glycosyltransferases using nucleotide diphospho-sugar, nucleotide monophospho-sugar and sugar phosphates (2.4.1.- from EC) and related proteins into distinct sequence based families has been described . This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site . The same three-dimensional fold is expected to occur within each of the families. Because 3-D structures are better conserved than sequences, several of the families defined on the basis of sequence similarities may have similar 3-D structures and therefore form 'clans'. Proteins in this entry are glycosyltransferases with phosphorylase activities. Members use phosphate to break alpha 1,4 linkages between pairs of glucose residues at the end of long glucose polymers, releasing alpha-D-glucose 1-phosphate. The nomenclature convention is to preface the name according to the natural substrate, as in glycogen phosphorylase, starch phosphorylase, maltodextrin phosphorylase, etc. The main role of glycogen phosphorylase (GPase) is to provide phosphorylated glucose molecules (G-1-P) . GPase is a highly regulated allosteric enzyme. The net effect of the regulatory site allows the enzyme to operate at a variety of rates; the enzyme is not simply regulated as "on" or "off", but rather it can be thought of being set to operate at an ideal rate based on changing conditions at in the cell. The most important allosteric effector is the phosphate molecule covalently attached to Ser14. This switches GPase from the b (inactive) state to the a (active) state. Upon phosphorylation, GPase attains about 800f its Vmax. When the enzyme is not phosphorylated, GPase activity is practically non-existent at low AMP levels . There is some apparent controversy as to the structure of GPase. All sources agree that the enzyme is multimeric, but there is apparent controversy as to the enzyme being a tetramer or a dimer. Apparently, GPase (in the a form) forms tetramers in the crystal form. The consensus seems to be that 'regardless of the a or b form, GPase functions as a dimer in vivo . The GPase monomer is best described as consisting of two domains, an N-terminal domain and a C-terminal domain . The C-terminal domain is often referred to as the catalytic domain. It consists of a beta-sheet core surrounded by layers of helical segments . The vitamin cofactor pyridoxal phosphate (PLP) is covalently attached to the amino acid backbone. The N-terminal domain also consists of a central beta-sheet core and is surrounded by layers of helical segments. The N-terminal domain contains different allosteric effector sites to regulate the enzyme. Bacterial phosphorylases follow the same catalytic mechanisms as their plant and animal counterparts, but differ considerably in terms of their substrate specificity and regulation. The catalytic domains are highly conserved while the regulatory sites are only poorly conserved. For maltodextrin phosphorylase from Escherichia coli the physiological role of the enzyme in the utilisation of maltidextrins is known in detail; that of all the other bacterial phosphorylases is still unclear. Roles in regulatuon of endogenous glycogen metabolism in periods of starvation, and sporulation, stress response or quick adaptation to changing environments are possible .; GO: 0004645 phosphorylase activity, 0030170 pyridoxal phosphate binding, 0005975 carbohydrate metabolic process. Probab=26.11 E-value=35 Score=12.75 Aligned_cols=48 Identities=15% Similarity=0.062 Sum_probs=24.8 Q ss_pred HHHHHHHHHHHCCCCCCCCCHHHHHHHHHHCCCCCHHHHHHHHHHHHH Q ss_conf 688999999972899988896588999974489854899999999975 Q gi|254780127|r 238 TYALRMYLFLITGIDLVDMSEGTLKSILSHSNITQLAKDLILNRLASS 285 (675) Q Consensus 238 ~~ql~~~L~~~~g~~~~~~~k~~~~~~l~~~~~~~~~~~ll~~r~~~~ 285 (675) ++-|+.+|-+.+|-.--.++-+.++++....+.+.+.+++....+..| T Consensus 482 NP~LS~l~~e~iGt~~W~~~l~~L~~L~~ya~D~~f~~~~~~vK~~NK 529 (822) T TIGR02093 482 NPGLSALLTETIGTDEWLTDLDLLKKLEPYADDSEFLEKFAQVKQANK 529 (822) T ss_pred CHHHHHHHHHHHCCCHHHCCHHHHHHHHHCCCCHHHHHHHHHHHHHHH T ss_conf 813689998762467222147988511100489899999999999999 No 165 >TIGR01418 PEP_synth phosphoenolpyruvate synthase; InterPro: IPR006319 This family represents phosphoenolpyruvate synthase; also called pyruvate,water dikinase and PEP synthase. The member from Methanococcus jannaschii contains a large intein. This enzyme generates phosphoenolpyruvate (PEP) from pyruvate, hydrolyzing ATP to AMP and releasing inorganic phosphate in the process. The enzyme shows extensive homology to other enzymes that use PEP as substrate or product. This enzyme may provide PEP for gluconeogenesis, for PTS-type carbohydrate transport systems, or for other processes. ; GO: 0008986 pyruvate water dikinase activity, 0006094 gluconeogenesis. Probab=25.83 E-value=36 Score=12.72 Aligned_cols=41 Identities=17% Similarity=0.245 Sum_probs=23.6 Q ss_pred HHHHHH-HCCCC----EEEEECCEEEEECCCCHHHHHHHHHHHHCCCCCCCCCCCCEEECC Q ss_conf 999999-67981----899953589996784789999999998467630216984157303 Q gi|254780127|r 613 GMKNAT-KNGYD----IVLTVHDEIVSETPDTPYFSVGTLCSLMTKNPSWAKGLPLKAEGY 668 (675) Q Consensus 613 am~~l~-~~g~~----ivl~VHDElv~evp~~~~~~~~~i~~~M~~~p~wa~~~pl~a~~~ 668 (675) ||.++. +.|+. |+-.|- ..++++++.++|+.. ||+.+-+|. T Consensus 710 A~k~VR~eMGLtNV~vMiPFvR----------T~~E~~rv~~im~~~-----GL~~g~nGl 755 (877) T TIGR01418 710 AIKRVREEMGLTNVEVMIPFVR----------TPEEGKRVLEIMAEE-----GLKRGKNGL 755 (877) T ss_pred HHHHHHHHHCCCCCEECCCCCC----------CHHHHHHHHHHHHHC-----CCCCCCCCC T ss_conf 9998876508513311357548----------889999999999972-----888477855 No 166 >KOG1401 consensus Probab=25.74 E-value=33 Score=12.97 Aligned_cols=12 Identities=17% Similarity=0.271 Sum_probs=5.7 Q ss_pred HHHHHHHHHHCC Q ss_conf 999999998467 Q gi|254780127|r 642 FSVGTLCSLMTK 653 (675) Q Consensus 642 ~~~~~i~~~M~~ 653 (675) |..+...++|++ T Consensus 415 E~i~~~l~i~~k 426 (433) T KOG1401 415 EVIQRLLTILEK 426 (433) T ss_pred HHHHHHHHHHHH T ss_conf 999999999999 No 167 >PRK00050 mraW S-adenosyl-methyltransferase MraW; Provisional Probab=24.57 E-value=38 Score=12.57 Aligned_cols=14 Identities=14% Similarity=-0.076 Sum_probs=7.3 Q ss_pred CCCHHHHHHHHHHH Q ss_conf 88568899999997 Q gi|254780127|r 235 SRQTYALRMYLFLI 248 (675) Q Consensus 235 ~~S~~ql~~~L~~~ 248 (675) +.++.+|.+++... T Consensus 177 i~tt~~L~~ii~~~ 190 (309) T PRK00050 177 ITTTKELAEIIASA 190 (309) T ss_pred CCCHHHHHHHHHHH T ss_conf 72489999999996 No 168 >TIGR01113 mtrE tetrahydromethanopterin S-methyltransferase, subunit E; InterPro: IPR005780 This model describes N5-methyltetrahydromethanopterin: coenzyme M methyltransferase subunit E in methanogenic archaea. This methyltranferase is a membrane-associated enzyme complex that uses methyl-transfer reaction to drive sodium-ion pump.5-methyl-5,6,7,8-tetrahydromethanopterin + 2-mercaptoethanesulphonate = 5,6,7,8-tetrahydromethanopterin + 2-(methylthio)ethanesulphonate. Archaea have evolved energy-yielding pathways marked by one-carbon biochemistry featuring novel cofactors and enzymes. This transferase (encoded by subunit A) is involved in the transfer of 'methyl' group from N5-methyltetrahydromethanopterin to coenzyme M. In an accompanying reaction, methane is produced by two-electron reduction of methyl-coenzyme M by another enzyme, methyl-coenzyme M reductase.; GO: 0030269 tetrahydromethanopterin S-methyltransferase activity, 0006814 sodium ion transport, 0005737 cytoplasm, 0012506 vesicle membrane. Probab=23.10 E-value=40 Score=12.38 Aligned_cols=26 Identities=12% Similarity=0.009 Sum_probs=18.8 Q ss_pred HHHHHHHHHHHHCCCCCCCCCCHHHHH Q ss_conf 999999999863057668888568899 Q gi|254780127|r 216 ERKKLDEDIVKLTDGIIRSSRQTYALR 242 (675) Q Consensus 216 ~~~~~~~~i~~~~g~~~~n~~S~~ql~ 242 (675) ......|+++.-.|..- ||||.-||+ T Consensus 12 tIAGa~EDlESDvGSQS-NPNSQVQLA 37 (298) T TIGR01113 12 TIAGAAEDLESDVGSQS-NPNSQVQLA 37 (298) T ss_pred HHHHHHHHHHHCCCCCC-CCCHHHCCC T ss_conf 99756765551577646-886021253 No 169 >TIGR02201 heptsyl_trn_III lipopolysaccharide heptosyltransferase III, putative; InterPro: IPR011916 This family consists of examples of the putative ADP-heptose:LPS heptosyltransferase III, an enzyme of LPS inner core region biosynthesis. LPS, composed of lipid A, a core region, and O antigen, is found in the outer membrane of Gram-negative bacteria. This enzyme may be less widely distributed than heptosyltransferases I and II.. Probab=21.90 E-value=38 Score=12.52 Aligned_cols=26 Identities=27% Similarity=0.553 Sum_probs=12.1 Q ss_pred HHHCCCCEEEEECCEEEEECCCC-HHHHHHHHHH Q ss_conf 99679818999535899967847-8999999999 Q gi|254780127|r 617 ATKNGYDIVLTVHDEIVSETPDT-PYFSVGTLCS 649 (675) Q Consensus 617 l~~~g~~ivl~VHDElv~evp~~-~~~~~~~i~~ 649 (675) +...||.+|| ...||. |.+-|.+|.+ T Consensus 212 L~a~Gy~vVL-------TsgPD~~e~~mV~~I~~ 238 (347) T TIGR02201 212 LAAEGYEVVL-------TSGPDKDELKMVDEIAA 238 (347) T ss_pred HHHCCCEEEE-------CCCCCHHHHHHHHHHHH T ss_conf 8518956988-------28989867999999971 No 170 >TIGR00972 3a0107s01c2 phosphate ABC transporter, ATP-binding protein; InterPro: IPR005670 This is a family of phosphate transport system permease proteins.; GO: 0005315 inorganic phosphate transmembrane transporter activity, 0015114 phosphate transmembrane transporter activity, 0006817 phosphate transport, 0016020 membrane. Probab=21.71 E-value=37 Score=12.60 Aligned_cols=25 Identities=12% Similarity=0.275 Sum_probs=14.2 Q ss_pred CCCEEEEEEH---HHHHHHHHHHCCCCC Q ss_conf 8981998630---775999998629987 Q gi|254780127|r 64 ETVMCVAHNS---LFERILFKKTLGIDI 88 (675) Q Consensus 64 ~~~~kv~HNa---~FD~~vL~~~lGi~i 88 (675) ..|+.-++|+ ++|..-|.+.-|.=+ T Consensus 61 G~v~f~G~dIy~~~~D~~~LR~~vGMVF 88 (248) T TIGR00972 61 GKVLFDGQDIYDKKIDVVELRKRVGMVF 88 (248) T ss_pred EEEEECCCCCCCCCCCHHHHHHHCCCCC T ss_conf 8898645114565668788762258521 No 171 >TIGR00629 uvde UV damage endonuclease UvdE; InterPro: IPR004601 Schizosaccharomyces pombe ultraviolet damage endonuclease (UVDE or Uve1p) performs the initial step in an alternative excision repair pathway for UV-induced DNA damage. This DNA repair pathway was originally thought to be specific for UV damage, however Uve1p also recognises UV-induced bipyrimidine photoadducts and other non-UV-induced DNA adducts . The Deinococcus radiodurans UVSE protein has also shown to be a UV DNA damage endonuclease that catalyzes repair of UV-induced DNA damage by a similar mechanism .; GO: 0004519 endonuclease activity, 0006289 nucleotide-excision repair. Probab=21.64 E-value=34 Score=12.84 Aligned_cols=90 Identities=16% Similarity=0.009 Sum_probs=42.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCEECCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCC-CC Q ss_conf 99898877539999999987200214310178999987516843088999999999999999999999863057668-88 Q gi|254780127|r 157 QLFGEYCKRDVEATRELFKRLIPLSDGERDLWLLDQTINDRGYRIDLDLVLKLQELIAQERKKLDEDIVKLTDGIIR-SS 235 (675) Q Consensus 157 ~~~~~Ya~~Dv~~~~~L~~~L~~~~~~E~~l~~~~~~me~~Gi~vD~e~l~~~~~~~~~~~~~~~~~i~~~~g~~~~-n~ 235 (675) +.+.+-|..|...-.++...|. +.+...+-..+...| -|-.=|.+.. .+.+.+..+++..++++-.=-+.. .- T Consensus 151 e~Vv~sAirDlaYHd~~L~~mk-l~e~~N~DaV~~IH~--GG~fg~K~~t---l~rF~~Ny~rLsq~~K~RlvLENDD~~ 224 (345) T TIGR00629 151 ESVVKSAIRDLAYHDELLSAMK-LAEQLNKDAVLIIHL--GGAFGNKDTT---LARFKKNYKRLSQSIKERLVLENDDVT 224 (345) T ss_pred HHHHHHHHHHHHHHHHHHHHCC-CCCCCCCCCEEEEEC--CCCCCCHHHH---HHHHHHHHHHCCHHHHCCCCEECCCCC T ss_conf 5676777654899999998606-700116653888743--8876767899---999998674044676403511027863 Q ss_pred CCHHHHHHHHHHHCCCCC Q ss_conf 856889999999728999 Q gi|254780127|r 236 RQTYALRMYLFLITGIDL 253 (675) Q Consensus 236 ~S~~ql~~~L~~~~g~~~ 253 (675) .|...+..++ +.+.+|. T Consensus 225 wtv~dlLpvC-e~lniP~ 241 (345) T TIGR00629 225 WTVEDLLPVC-EELNIPL 241 (345) T ss_pred CCHHHHHHHH-HHCCCCE T ss_conf 1054254321-2217986 No 172 >TIGR00946 2a69 Auxin Efflux Carrier; InterPro: IPR014024 This entry is mostly composed of known or predicted PIN proteins from plants, though some homologous prokaryotic proteins are also included. The PIN proteins are components of auxin efflux systems from plants. These carriers are saturable, auxin-specific, and localized to the basal ends of auxin transport-competent cells , . Plants typically posses several of these proteins, each displaying a unique tissue-specific expression pattern. They are expressed in almost all plant tissues including vascular tissues and roots, and influence many processes including the establishment of embryonic polarity, plant growth, apical hook formation in seedlings and the photo- and gravitrophic responses. These plant proteins are typically 600-700 amino acyl residues long and exhibit 8-12 transmembrane segments.. Probab=20.90 E-value=44 Score=12.10 Aligned_cols=20 Identities=15% Similarity=0.195 Sum_probs=11.7 Q ss_pred CCCEEEEEEECCCHHHHHHH Q ss_conf 48505886622876786654 Q gi|254780127|r 383 GIEARVLAWIAGEDWKIKAF 402 (675) Q Consensus 383 qiElRvlA~ls~d~~li~~f 402 (675) .-|+.+..|=|.-.-.-+++ T Consensus 345 ~~el~MFVWsS~aSPVSd~~ 364 (608) T TIGR00946 345 AKELHMFVWSSSASPVSDAA 364 (608) T ss_pred CCEEEEEEEECCCCCHHHHC T ss_conf 72235788605788501103 No 173 >TIGR02186 alph_Pro_TM conserved hypothetical protein; InterPro: IPR011734 This family consists of predicted transmembrane proteins of about 270 amino acids. Members are found, so far, only among the Alphaproteobacteria and only once in each genome.. Probab=20.88 E-value=44 Score=12.09 Aligned_cols=156 Identities=17% Similarity=0.158 Sum_probs=72.7 Q ss_pred CCCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCCCCHHHHEEEECCCCCEEEEEECCCCCEEEEEEECCCHHHHHHH Q ss_conf 68885547777665234565443102444554443221013212147699878999733548505886622876786654 Q gi|254780127|r 323 CVFQPQNLPRQERSEEILTQTIQALNRGETIADPLGLASDCVRSCIIASNGKKLVVADLAGIEARVLAWIAGEDWKIKAF 402 (675) Q Consensus 323 ~~pNlQnlPr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iR~~f~a~~G~~lv~aDySqiElRvlA~ls~d~~li~~f 402 (675) +-|++||+|+--++.++....-. |=-+-...-.+=|..||++-||-+-..+-++..++.+- T Consensus 18 ~~a~aQ~~~~alPPGPQ~~p~~E-------------------~leig~St~~i~itsnF~G~~LtlFGa~~~~d~~~~~~ 78 (275) T TIGR02186 18 TVAAAQVLELALPPGPQVRPLEE-------------------RLEIGVSTEEIAITSNFRGADLTLFGALDNADPLLLAQ 78 (275) T ss_pred CCHHHCCCCCCCCCCCCCCCCCE-------------------EEEEEEECCEEEECCCCCCCCEEEEECCCCCCCCHHHC T ss_conf 76224264102777788887430-------------------16886535756515688987607860126633016753 Q ss_pred HCCCCCCCCCHHHHHCCCCCCCC--HHHHHCCCCCHHHHHHCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 04441000015776087511057--0221006421001330788778988788708886403666665666765478899 Q gi|254780127|r 403 ANGEDIYVTTYAKAFNTPIDKVS--KEQRAIGKVMELALGYQGGAKVFKTMASHCGLDLQQFSQNVKSTSTFEDWEQAES 480 (675) Q Consensus 403 ~~g~D~h~~tA~~~~~~~~~~v~--~~~R~~aK~~~~gl~YG~G~~~l~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~ 480 (675) .-.||=-.. .=|.++++ |.+|-.|=.+|=... .|...=..| .+.....++..+ ..+ T Consensus 79 -g~YdivV~l-----~GP~~~~vvRkKeR~~GiWiNt~~~------tF~~vP~~y---------sL~stR~~~~ia-~~~ 136 (275) T TIGR02186 79 -GKYDIVVTL-----EGPRDDMVVRKKERVFGIWINTDSR------TFLQVPESY---------SLASTRNIDDIA-SPK 136 (275) T ss_pred -CCCCEEEEE-----ECCCCCCEEEEECCEEEEEECCCCH------HHHHHHHHH---------HHHHCCCCCCCC-CHH T ss_conf -695379997-----5787771361210276577615503------444116778---------886437533438-988 Q ss_pred HHHHHHHHHHHCCCEECCHHHHHHHHH-HHHHHHCCCHHH Q ss_conf 999999987511101031088978899-999986730789 Q gi|254780127|r 481 HHLWMQDQYPEFAVKDKLIGTACELVK-KAWRAKHQGVLQ 519 (675) Q Consensus 481 ~~~~~~~~~~~~~~~~~~~~~ea~~~~-~~~~~~~p~i~~ 519 (675) -...++-+..+.-.+..++...+..+. |.|+++|.+|+. T Consensus 137 ~~r~~~iGv~~~~L~~~~~~~~~d~v~nD~F~~afrrlr~ 176 (275) T TIGR02186 137 TLRRMGIGVDNLRLVPRVFGGYADNVVNDEFRNAFRRLRL 176 (275) T ss_pred HHHHHCCCEEEEEEEECCCCCCCCCCCCCCHHHHHHHHHH T ss_conf 9997427631257300678888765565206789999887 No 174 >TIGR02109 PQQ_syn_pqqE coenzyme PQQ biosynthesis protein E; InterPro: IPR011843 This entry describes coenzyme PQQ biosynthesis protein E, a gene required for the biosynthesis of pyrrolo-quinoline-quinone (coenzyme PQQ). PQQ is required for some glucose dehydrogenases and alcohol dehydrogenases.; GO: 0051539 4 iron 4 sulfur cluster binding, 0018189 pyrroloquinoline quinone biosynthetic process. Probab=20.09 E-value=39 Score=12.45 Aligned_cols=14 Identities=7% Similarity=0.064 Sum_probs=4.9 Q ss_pred HHHHHHHHHHHHHC Q ss_conf 89788999999867 Q gi|254780127|r 501 TACELVKKAWRAKH 514 (675) Q Consensus 501 ~ea~~~~~~~~~~~ 514 (675) ++|.++++.+|+++ T Consensus 203 ~~a~r~V~~aRer~ 216 (363) T TIGR02109 203 EEATRIVEEARERL 216 (363) T ss_pred HHHHHHHHHHHHHH T ss_conf 99999999999998 Done!