Query gi|254780277|ref|YP_003064690.1| DNA polymerase I [Candidatus Liberibacter asiaticus str. psy62] Match_columns 976 No_of_seqs 263 out of 3266 Neff 6.9 Searched_HMMs 39220 Date Tue May 24 13:05:25 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780277.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 PRK07556 consensus 100.0 0 0 2406.9 80.8 965 1-976 4-977 (977) 2 PRK07997 consensus 100.0 0 0 2352.3 75.2 914 1-976 4-928 (928) 3 PRK06887 consensus 100.0 0 0 2342.7 77.4 920 3-976 5-954 (954) 4 PRK07625 consensus 100.0 0 0 2322.2 75.6 911 1-976 3-922 (922) 5 PRK08835 consensus 100.0 0 0 2317.4 78.0 915 3-976 5-931 (931) 6 PRK08786 consensus 100.0 0 0 2315.8 75.1 920 5-976 2-927 (927) 7 PRK05755 DNA polymerase I; Pro 100.0 0 0 2301.9 77.1 885 1-976 1-889 (889) 8 PRK08928 consensus 100.0 0 0 2293.5 75.1 858 1-976 1-859 (861) 9 PRK07898 consensus 100.0 0 0 2259.9 76.9 881 2-976 12-901 (902) 10 PRK07456 consensus 100.0 0 0 2256.9 76.6 916 2-976 5-975 (975) 11 PRK08434 consensus 100.0 0 0 2253.3 74.0 874 5-975 2-886 (887) 12 TIGR00593 pola DNA polymerase 100.0 0 0 2259.4 55.0 930 7-976 1-1005(1005) 13 PRK08076 consensus 100.0 0 0 2230.1 73.6 868 1-976 1-877 (877) 14 PRK05929 consensus 100.0 0 0 2232.3 71.0 861 5-975 2-870 (870) 15 PRK05797 consensus 100.0 0 0 2222.3 75.7 860 5-976 3-869 (869) 16 PRK07300 consensus 100.0 0 0 2216.2 72.9 862 1-976 1-880 (880) 17 COG0749 PolA DNA polymerase I 100.0 0 0 1385.1 35.9 586 367-976 2-593 (593) 18 pfam00476 DNA_pol_A DNA polyme 100.0 0 0 973.1 34.6 382 590-975 1-383 (383) 19 cd06444 DNA_pol_A DNA polymera 100.0 0 0 907.0 33.5 379 590-974 1-379 (379) 20 smart00475 53EXOc 5'-3' exonuc 100.0 0 0 639.0 26.9 255 5-265 1-259 (259) 21 PRK09482 xni exonuclease IX; P 100.0 0 0 625.9 24.8 253 1-265 1-255 (256) 22 cd00008 53EXOc 5'-3' exonuclea 100.0 0 0 580.5 24.3 240 5-249 1-240 (240) 23 COG0258 Exo 5'-3' exonuclease 100.0 0 0 519.0 24.3 286 1-291 7-300 (310) 24 smart00482 POLAc DNA polymeras 100.0 0 0 441.0 17.7 206 730-935 1-206 (206) 25 pfam02739 5_3_exonuc_N 5'-3' e 100.0 0 0 391.5 17.7 164 5-172 1-169 (169) 26 cd06139 DNA_polA_I_Ecoli_like_ 100.0 5.9E-39 1.5E-43 302.7 9.8 187 384-582 2-193 (193) 27 cd06128 DNA_polA_exo The 3'-5' 100.0 1.3E-36 3.3E-41 285.6 8.1 162 387-558 1-166 (166) 28 cd06140 DNA_polA_I_Bacillus_li 100.0 2.7E-32 6.8E-37 254.2 8.5 175 387-582 3-178 (178) 29 PHA00439 exonuclease 100.0 1.3E-30 3.3E-35 241.8 15.1 241 1-258 1-277 (288) 30 cd00128 XPG Xeroderma pigmento 100.0 7.3E-29 1.9E-33 229.1 19.6 261 5-290 24-312 (316) 31 pfam01367 5_3_exonuc 5'-3' exo 100.0 1.2E-30 3.1E-35 242.1 8.3 100 174-274 1-100 (100) 32 smart00474 35EXOc 3'-5' exonuc 99.9 9.4E-25 2.4E-29 199.1 10.1 170 369-561 2-172 (172) 33 pfam01612 3_5_exonuc 3'-5' exo 99.9 3E-24 7.7E-29 195.4 10.3 170 368-561 1-172 (172) 34 cd00007 35EXOc 3'-5' exonuclea 99.9 1.5E-23 3.7E-28 190.5 7.1 151 388-560 1-155 (155) 35 cd06142 RNaseD_exo Ribonucleas 99.9 3.5E-22 8.9E-27 180.4 8.1 171 376-574 1-173 (178) 36 PRK03980 flap endonuclease-1; 99.9 8.2E-21 2.1E-25 170.4 14.7 237 29-291 2-282 (295) 37 PTZ00217 flap endonuclease-1; 99.9 3.6E-19 9.3E-24 158.4 22.4 264 5-291 28-333 (394) 38 TIGR03674 fen_arch flap struct 99.9 8.3E-20 2.1E-24 163.0 18.5 256 5-290 21-325 (338) 39 cd00080 HhH2_motif Helix-hairp 99.8 1.2E-20 3E-25 169.2 6.3 75 170-247 1-75 (75) 40 cd06147 Rrp6p_like_exo Yeast R 99.8 1.5E-18 3.8E-23 153.9 10.2 172 366-564 3-176 (192) 41 COG0349 Rnd Ribonuclease D [Tr 99.7 4.4E-16 1.1E-20 135.9 9.7 175 371-571 1-177 (361) 42 PRK10829 ribonuclease D; Provi 99.6 2.2E-15 5.6E-20 130.8 11.0 177 367-570 2-180 (373) 43 KOG3657 consensus 99.6 2.1E-14 5.3E-19 123.7 13.9 211 728-948 718-999 (1075) 44 cd06148 Egl_like_exo The Egali 99.6 4.7E-15 1.2E-19 128.4 9.0 175 379-572 2-189 (197) 45 KOG2519 consensus 99.6 9.6E-14 2.5E-18 118.8 14.8 256 6-283 29-316 (449) 46 cd06141 WRN_exo WRN is a uniqu 99.5 5.4E-13 1.4E-17 113.4 9.2 162 372-556 2-168 (168) 47 cd06129 RNaseD_like The RNase 99.4 4.4E-13 1.1E-17 114.0 7.3 149 380-553 5-158 (159) 48 cd06146 mut-7_like_exo The mut 99.4 3.7E-12 9.3E-17 107.3 10.1 168 369-556 2-191 (191) 49 TIGR01388 rnd ribonuclease D; 99.3 8E-12 2E-16 104.8 9.6 182 370-577 1-183 (374) 50 smart00279 HhH2 Helix-hairpin- 99.3 1.1E-12 2.7E-17 111.2 3.2 36 176-211 1-36 (36) 51 TIGR01405 polC_Gram_pos DNA po 99.3 2.7E-12 7E-17 108.2 4.4 190 361-573 178-386 (1264) 52 PRK06309 DNA polymerase III su 99.2 9.1E-11 2.3E-15 97.1 7.9 167 388-575 3-181 (232) 53 PRK07740 hypothetical protein; 99.1 1.1E-10 2.7E-15 96.6 6.2 158 384-568 54-230 (240) 54 PRK06310 DNA polymerase III su 99.1 3.8E-10 9.6E-15 92.6 6.6 182 384-591 4-204 (250) 55 cd06126 DEDDy DEDDy exonucleas 99.0 5.7E-10 1.5E-14 91.3 6.6 156 389-554 1-181 (181) 56 COG2176 PolC DNA polymerase II 99.0 3.5E-10 9E-15 92.8 5.1 139 365-512 404-558 (1444) 57 PRK06807 DNA polymerase III su 99.0 7.4E-10 1.9E-14 90.5 6.4 154 387-563 8-175 (313) 58 PRK08517 DNA polymerase III su 99.0 6.1E-10 1.6E-14 91.1 5.4 156 384-562 67-235 (259) 59 PRK00448 polC DNA polymerase I 99.0 1E-09 2.6E-14 89.4 6.5 126 383-512 414-555 (1436) 60 PRK08074 bifunctional ATP-depe 99.0 2E-09 5.1E-14 87.3 7.2 154 387-562 3-171 (932) 61 PRK09145 DNA polymerase III su 99.0 1.5E-09 3.8E-14 88.3 6.2 157 380-558 22-199 (203) 62 PRK06631 consensus 98.9 1.7E-09 4.5E-14 87.8 6.1 150 387-558 4-174 (229) 63 KOG2518 consensus 98.9 1E-07 2.6E-12 74.9 15.1 198 5-219 24-253 (556) 64 PRK07942 DNA polymerase III su 98.9 4.4E-09 1.1E-13 84.8 7.5 152 387-560 6-180 (234) 65 PRK07246 bifunctional ATP-depe 98.9 3.9E-09 1E-13 85.2 6.9 154 387-563 7-173 (820) 66 PRK07883 hypothetical protein; 98.9 2.1E-09 5.2E-14 87.2 5.3 165 381-567 18-198 (575) 67 KOG2206 consensus 98.9 4.8E-09 1.2E-13 84.6 6.9 175 361-563 186-363 (687) 68 cd06131 DNA_pol_III_epsilon_Ec 98.9 3.7E-09 9.4E-14 85.4 6.3 145 390-556 2-166 (167) 69 TIGR00600 rad2 DNA excision re 98.9 4.6E-08 1.2E-12 77.4 11.2 73 4-85 23-96 (1127) 70 PRK07009 consensus 98.8 4.5E-09 1.2E-13 84.7 5.2 147 389-557 3-169 (241) 71 PRK05711 DNA polymerase III su 98.8 8.8E-09 2.2E-13 82.6 6.4 148 388-557 5-173 (240) 72 PRK06022 consensus 98.8 6.5E-09 1.6E-13 83.6 5.6 148 389-557 3-167 (234) 73 PRK06195 DNA polymerase III su 98.8 1.3E-08 3.4E-13 81.4 6.7 157 388-569 2-173 (309) 74 PRK08816 consensus 98.8 1.2E-08 3E-13 81.7 5.2 146 389-557 3-170 (244) 75 TIGR01406 dnaQ_proteo DNA poly 98.8 3.1E-08 7.8E-13 78.7 7.1 144 389-557 2-177 (246) 76 PRK05168 ribonuclease T; Provi 98.8 5.5E-08 1.4E-12 76.9 8.4 165 382-567 12-208 (213) 77 PRK08858 consensus 98.7 1.4E-08 3.6E-13 81.1 5.3 148 388-557 11-184 (246) 78 KOG2520 consensus 98.7 8.1E-07 2.1E-11 68.3 14.1 27 192-218 63-92 (815) 79 TIGR01407 dinG_rel DnaQ family 98.7 3.3E-07 8.4E-12 71.2 11.5 148 389-562 2-168 (944) 80 PRK06859 consensus 98.7 1.9E-08 4.8E-13 80.2 4.5 122 387-513 5-151 (253) 81 PRK07982 consensus 98.7 3.4E-08 8.8E-13 78.3 5.6 148 388-557 7-179 (243) 82 PRK07247 DNA polymerase III su 98.7 4.8E-08 1.2E-12 77.2 6.2 155 387-567 5-176 (195) 83 smart00479 EXOIII exonuclease 98.7 4.6E-08 1.2E-12 77.4 5.8 150 389-561 2-168 (169) 84 cd06134 RNaseT RNase T is a DE 98.7 7.7E-08 2E-12 75.8 6.8 151 388-559 6-188 (189) 85 PRK06063 DNA polymerase III su 98.7 4.5E-08 1.1E-12 77.5 5.6 148 387-556 15-175 (313) 86 PRK09146 DNA polymerase III su 98.6 9.6E-08 2.4E-12 75.1 7.1 153 384-558 44-225 (239) 87 cd06127 DEDDh DEDDh exonucleas 98.6 1.2E-07 3.1E-12 74.3 7.2 142 390-553 1-158 (159) 88 cd06130 DNA_pol_III_epsilon_li 98.6 8.6E-08 2.2E-12 75.4 5.7 138 389-553 1-154 (156) 89 cd06136 TREX1_2 Three prime re 98.5 2.1E-07 5.4E-12 72.6 5.4 142 389-554 1-175 (177) 90 pfam00929 Exonuc_X-T Exonuclea 98.4 3.9E-07 9.9E-12 70.6 5.8 144 389-553 1-162 (162) 91 cd06125 DnaQ_like_exo The DnaQ 98.4 1.8E-07 4.6E-12 73.0 2.3 104 390-493 1-118 (130) 92 cd06144 REX4_like This subfami 98.3 1.3E-06 3.4E-11 66.7 4.7 138 390-554 1-152 (152) 93 PRK09182 DNA polymerase III su 98.2 9.4E-06 2.4E-10 60.5 7.8 172 389-591 39-228 (293) 94 cd06138 ExoI_N This subfamily 98.2 4.9E-06 1.3E-10 62.6 6.0 144 390-553 1-182 (183) 95 PRK07983 exodeoxyribonuclease 98.2 6.1E-06 1.5E-10 61.9 6.4 85 453-558 68-152 (219) 96 COG0847 DnaQ DNA polymerase II 98.1 3E-05 7.7E-10 56.9 8.4 153 387-559 13-181 (243) 97 cd06133 ERI-1_3'hExo_like This 98.0 1.8E-05 4.5E-10 58.6 6.6 149 389-556 1-175 (176) 98 cd05780 DNA_polB_Kod1_like_exo 97.9 2.9E-05 7.3E-10 57.0 6.3 158 387-556 3-195 (195) 99 cd05781 DNA_polB_B3_exo The 3' 97.8 8.7E-05 2.2E-09 53.5 6.5 149 387-556 3-188 (188) 100 cd05779 DNA_polB_epsilon_exo T 97.8 5.7E-05 1.5E-09 54.8 5.5 162 389-553 4-203 (204) 101 PRK05762 DNA polymerase II; Re 97.7 0.00012 3.1E-09 52.4 6.0 156 387-556 155-346 (785) 102 KOG0950 consensus 97.6 1.6E-05 4E-10 58.9 1.2 266 613-937 715-990 (1008) 103 PTZ00166 DNA polymerase delta 97.6 0.00054 1.4E-08 47.7 8.6 206 385-598 288-545 (1081) 104 cd05160 DNA_polB_exo The 3'-5' 97.6 0.00014 3.6E-09 51.9 5.6 162 390-554 2-200 (200) 105 cd05785 DNA_polB_like2_exo A s 97.6 0.00017 4.4E-09 51.4 5.9 149 385-553 7-206 (207) 106 PRK00558 uvrC excinuclease ABC 97.5 0.0008 2.1E-08 46.5 8.7 32 805-836 573-607 (609) 107 cd05777 DNA_polB_delta_exo The 97.5 0.00015 3.9E-09 51.7 4.6 168 385-558 5-225 (230) 108 cd06137 DEDDh_RNase This group 97.5 0.00021 5.4E-09 50.7 5.0 141 390-553 1-160 (161) 109 pfam05991 DUF901 Protein of un 97.4 0.00058 1.5E-08 47.5 6.5 108 8-149 1-111 (166) 110 cd05784 DNA_polB_II_exo The 3' 97.4 0.00029 7.4E-09 49.7 4.9 156 387-554 3-193 (193) 111 TIGR00573 dnaq exonuclease, DN 97.4 0.00032 8.2E-09 49.3 5.1 155 385-560 5-179 (228) 112 PRK13766 Hef nuclease; Provisi 97.3 0.00033 8.4E-09 49.3 4.1 74 61-142 32-117 (764) 113 cd06149 ISG20 Interferon (IFN) 97.3 0.00044 1.1E-08 48.4 4.7 90 444-554 62-157 (157) 114 cd05782 DNA_polB_like1_exo A s 97.2 0.00087 2.2E-08 46.2 5.3 141 395-554 42-207 (208) 115 cd06143 PAN2_exo PAN2 is the c 97.1 0.002 5.1E-08 43.5 6.9 87 446-554 88-174 (174) 116 cd06145 REX1_like This subfami 96.8 0.0026 6.7E-08 42.7 5.4 90 445-554 61-150 (150) 117 cd05783 DNA_polB_B1_exo The 3' 96.8 0.0019 5E-08 43.7 4.4 101 446-553 73-201 (204) 118 cd06135 Orn Oligoribonuclease 96.7 0.003 7.6E-08 42.3 4.6 26 391-416 3-28 (173) 119 pfam00940 RNA_pol DNA-dependen 96.6 0.023 6E-07 35.8 8.9 145 798-952 156-344 (391) 120 COG3359 Predicted exonuclease 96.3 0.018 4.6E-07 36.6 6.9 157 388-562 99-271 (278) 121 KOG1798 consensus 96.3 0.0049 1.3E-07 40.7 3.9 36 161-196 1100-1135(2173) 122 KOG0969 consensus 96.3 0.033 8.5E-07 34.7 8.0 118 447-574 339-499 (1066) 123 cd05776 DNA_polB_alpha_exo The 96.2 0.0089 2.3E-07 38.8 4.9 109 447-557 83-227 (234) 124 pfam00867 XPG_I XPG I-region. 96.1 0.029 7.3E-07 35.1 6.9 79 104-187 2-87 (89) 125 pfam09281 Taq-exonuc Taq polym 95.7 0.015 3.8E-07 37.2 3.9 68 470-560 70-138 (138) 126 KOG2249 consensus 95.6 0.034 8.7E-07 34.6 5.5 148 388-562 106-268 (280) 127 COG3688 Predicted RNA-binding 95.5 0.11 2.9E-06 30.8 8.1 106 5-143 3-110 (173) 128 COG0322 UvrC Nuclease subunit 95.4 0.23 5.9E-06 28.5 9.6 12 449-460 188-199 (581) 129 smart00486 POLBc DNA polymeras 95.3 0.062 1.6E-06 32.7 6.0 165 388-556 4-220 (471) 130 pfam01550 consensus 95.1 0.13 3.4E-06 30.3 7.3 84 92-179 169-252 (443) 131 PRK05601 DNA polymerase III su 95.1 0.049 1.3E-06 33.4 5.1 92 382-478 41-142 (377) 132 PRK11779 sbcB exonuclease I; P 95.1 0.084 2.1E-06 31.7 6.2 152 388-559 8-198 (477) 133 PRK07956 ligA NAD-dependent DN 94.9 0.019 4.7E-07 36.5 2.5 12 629-641 507-518 (668) 134 PRK08609 hypothetical protein; 94.7 0.038 9.8E-07 34.2 3.7 48 195-242 92-148 (570) 135 pfam10108 Exon_PolB Predicted 94.6 0.097 2.5E-06 31.3 5.5 110 445-556 36-171 (211) 136 TIGR01448 recD_rel helicase, R 94.3 0.03 7.7E-07 35.0 2.4 63 177-247 87-156 (769) 137 cd05778 DNA_polB_zeta_exo The 94.3 0.065 1.6E-06 32.6 4.1 54 500-553 175-230 (231) 138 PRK00116 ruvA Holliday junctio 94.0 0.17 4.4E-06 29.5 5.8 131 132-284 23-168 (198) 139 pfam04857 CAF1 CAF1 family rib 93.7 0.28 7.1E-06 27.9 6.4 52 460-511 139-209 (235) 140 pfam00752 XPG_N XPG N-terminal 93.6 0.19 4.9E-06 29.1 5.4 67 5-79 25-92 (100) 141 TIGR02236 recomb_radA DNA repa 93.5 0.058 1.5E-06 32.9 2.6 34 192-225 1-36 (333) 142 KOG2207 consensus 93.4 0.33 8.3E-06 27.4 6.3 175 362-559 386-584 (617) 143 smart00485 XPGN Xeroderma pigm 93.0 0.32 8.3E-06 27.5 5.7 59 5-71 24-83 (99) 144 PRK07748 sporulation inhibitor 92.8 0.62 1.6E-05 25.4 7.0 96 444-559 77-179 (205) 145 PHA00452 T3/T7-like RNA polyme 92.5 0.81 2.1E-05 24.6 9.2 142 798-952 581-772 (816) 146 COG1948 MUS81 ERCC4-type nucle 92.3 0.17 4.4E-06 29.5 3.7 27 195-221 186-212 (254) 147 TIGR00575 dnlj DNA ligase, NAD 92.3 0.086 2.2E-06 31.6 2.1 57 763-819 461-531 (706) 148 TIGR00084 ruvA Holliday juncti 92.0 0.17 4.3E-06 29.5 3.3 111 164-284 62-187 (217) 149 PRK10702 endonuclease III; Pro 91.9 0.2 5.2E-06 28.9 3.6 26 702-727 174-201 (211) 150 PRK06722 exonuclease; Provisio 91.7 0.18 4.5E-06 29.3 3.1 51 446-498 38-95 (242) 151 pfam10391 DNA_pol_lambd_f Fing 91.7 0.13 3.4E-06 30.3 2.5 32 194-225 5-37 (52) 152 smart00483 POLXc DNA polymeras 91.4 0.22 5.7E-06 28.6 3.4 31 192-222 90-121 (334) 153 cd00141 NT_POLXc Nucleotidyltr 91.4 0.2 5.1E-06 29.0 3.1 29 193-221 87-116 (307) 154 PRK05761 DNA polymerase I; Rev 91.3 1 2.6E-05 23.8 6.7 102 446-556 198-329 (777) 155 smart00484 XPGI Xeroderma pigm 91.2 0.77 2E-05 24.7 6.0 66 104-174 2-73 (73) 156 PRK13901 ruvA Holliday junctio 91.0 0.37 9.3E-06 27.1 4.2 121 132-284 25-159 (196) 157 COG0417 PolB DNA polymerase el 91.0 1.2 3E-05 23.4 7.7 162 386-555 153-349 (792) 158 KOG2248 consensus 90.9 0.26 6.7E-06 28.1 3.4 98 440-557 274-372 (380) 159 TIGR03491 RecB family nuclease 90.9 0.76 1.9E-05 24.8 5.7 151 388-556 285-457 (457) 160 COG0272 Lig NAD-dependent DNA 90.6 0.16 4.1E-06 29.7 2.0 19 369-387 86-104 (667) 161 pfam09293 RNaseH_C T4 RNase H, 89.4 0.58 1.5E-05 25.6 4.1 103 175-289 2-120 (122) 162 KOG1275 consensus 88.9 0.41 1E-05 26.7 3.1 22 238-259 245-266 (1118) 163 PRK05359 oligoribonuclease; Pr 88.7 0.54 1.4E-05 25.9 3.5 29 388-416 4-32 (181) 164 KOG4373 consensus 88.7 0.84 2.1E-05 24.4 4.5 104 449-553 172-281 (319) 165 TIGR00630 uvra excinuclease AB 87.4 0.8 2E-05 24.6 3.7 38 817-860 816-853 (956) 166 TIGR01298 RNaseT ribonuclease 86.8 0.47 1.2E-05 26.3 2.3 151 390-563 11-196 (201) 167 COG1796 POL4 DNA polymerase IV 86.6 0.56 1.4E-05 25.7 2.6 20 796-819 270-289 (326) 168 PRK13901 ruvA Holliday junctio 86.5 0.28 7.2E-06 27.9 1.0 36 174-211 92-127 (196) 169 PRK00116 ruvA Holliday junctio 85.5 0.37 9.4E-06 27.0 1.2 28 184-211 101-128 (198) 170 COG0177 Nth Predicted EndoIII- 85.5 0.9 2.3E-05 24.2 3.2 16 709-724 181-198 (211) 171 COG0632 RuvA Holliday junction 84.4 1 2.6E-05 23.8 3.1 49 165-222 54-103 (201) 172 TIGR01259 comE comEA protein; 82.9 1.1 2.9E-05 23.5 2.8 43 166-208 72-119 (124) 173 COG2925 SbcB Exonuclease I [DN 82.9 3.2 8.2E-05 20.2 6.4 113 387-499 9-150 (475) 174 PRK13636 cbiO cobalt transport 82.5 2.4 6.2E-05 21.1 4.4 15 822-836 265-279 (285) 175 KOG2841 consensus 81.9 0.99 2.5E-05 23.9 2.2 10 696-705 240-249 (254) 176 pfam00633 HHH Helix-hairpin-he 81.4 0.33 8.3E-06 27.4 -0.4 25 185-209 5-29 (30) 177 TIGR00615 recR recombination p 81.2 0.53 1.4E-05 25.9 0.6 19 190-208 11-29 (205) 178 smart00278 HhH1 Helix-hairpin- 80.6 0.94 2.4E-05 24.1 1.7 21 192-212 2-22 (26) 179 PRK13646 cbiO cobalt transport 80.3 3.5 8.8E-05 20.0 4.5 13 823-835 270-282 (286) 180 cd00431 cysteine_hydrolases Cy 79.9 4 0.0001 19.5 9.9 118 7-139 2-142 (161) 181 cd01702 Pol_eta Pol eta is mem 79.6 2.7 6.9E-05 20.7 3.8 102 786-912 237-339 (358) 182 PRK10017 putative pyruvyl tran 79.2 1.7 4.3E-05 22.2 2.6 105 731-857 297-411 (426) 183 pfam04599 Pox_G5 Poxvirus G5 p 79.2 4.2 0.00011 19.3 6.1 94 98-199 149-248 (426) 184 cd01014 nicotinamidase_related 79.1 4.2 0.00011 19.3 7.8 97 39-139 22-131 (155) 185 pfam03104 DNA_pol_B_exo DNA po 78.0 4.1 0.0001 19.4 4.3 91 386-478 85-185 (254) 186 pfam00857 Isochorismatase Isoc 77.9 4.6 0.00012 19.0 9.5 116 7-139 3-142 (172) 187 COG1949 Orn Oligoribonuclease 76.6 3.2 8.3E-05 20.2 3.5 30 387-416 6-35 (184) 188 KOG2304 consensus 75.9 2.2 5.6E-05 21.4 2.4 55 803-867 231-290 (298) 189 COG0353 RecR Recombinational D 75.8 1.3 3.3E-05 23.1 1.2 46 191-243 12-57 (198) 190 cd06167 LabA_like LabA_like pr 75.3 5.4 0.00014 18.6 9.2 52 98-149 57-117 (149) 191 PRK08306 dipicolinate synthase 75.0 5.4 0.00014 18.5 8.7 18 637-659 263-280 (296) 192 COG1029 FwdB Formylmethanofura 74.7 1.7 4.4E-05 22.1 1.6 49 762-813 203-255 (429) 193 pfam02371 Transposase_20 Trans 73.8 2 5.2E-05 21.6 1.8 31 192-223 3-33 (87) 194 KOG0917 consensus 73.7 5.8 0.00015 18.3 4.6 22 666-687 127-148 (338) 195 PRK13844 recombination protein 73.4 1.6 4.1E-05 22.4 1.2 19 191-209 15-33 (200) 196 PRK00254 ski2-like helicase; P 73.3 2.5 6.3E-05 21.0 2.2 12 941-952 687-698 (717) 197 PRK01216 DNA polymerase IV; Va 73.3 3.9 0.0001 19.5 3.2 12 196-207 183-194 (351) 198 PRK08097 ligB NAD-dependent DN 72.4 6.2 0.00016 18.1 5.1 13 371-383 105-117 (563) 199 COG4987 CydC ABC-type transpor 72.4 6.3 0.00016 18.1 4.3 10 464-473 80-89 (573) 200 TIGR01169 rplA_bact ribosomal 72.2 1.6 4.2E-05 22.3 1.0 40 693-740 163-214 (227) 201 PRK03352 DNA polymerase IV; Va 71.5 6.5 0.00017 17.9 5.4 68 843-921 267-335 (345) 202 cd01012 YcaC_related YcaC rela 71.5 6.5 0.00017 17.9 9.7 114 7-139 2-120 (157) 203 PRK13910 DNA glycosylase MutY; 71.4 4.9 0.00012 18.9 3.3 36 166-212 58-95 (290) 204 PRK00076 recR recombination pr 70.0 2.1 5.4E-05 21.5 1.2 17 192-208 12-28 (197) 205 KOG2534 consensus 68.8 3.2 8.1E-05 20.2 1.9 14 745-758 279-292 (353) 206 PRK02362 ski2-like helicase; P 68.7 7.4 0.00019 17.5 3.9 11 132-142 94-104 (736) 207 pfam01936 DUF88 Protein of unk 68.4 7.5 0.00019 17.5 5.4 52 98-149 49-107 (140) 208 pfam01927 DUF82 Protein of unk 67.6 7.8 0.0002 17.4 6.4 75 97-177 11-85 (146) 209 PRK13639 cbiO cobalt transport 67.5 7.8 0.0002 17.4 5.3 13 823-835 260-272 (275) 210 COG1936 Predicted nucleotide k 66.6 2.9 7.5E-05 20.5 1.3 20 193-212 5-24 (180) 211 KOG0687 consensus 66.3 8.2 0.00021 17.2 5.2 55 552-616 40-94 (393) 212 COG1569 Predicted nucleic acid 66.0 5.6 0.00014 18.4 2.7 31 136-172 107-137 (142) 213 pfam11798 IMS_HHH IMS family H 65.9 3.5 8.9E-05 19.9 1.6 20 192-212 13-32 (33) 214 KOG2144 consensus 64.5 1.5 3.8E-05 22.6 -0.5 54 684-740 193-252 (360) 215 PRK10880 adenine DNA glycosyla 64.0 8.6 0.00022 17.1 3.3 34 165-209 93-127 (350) 216 cd01701 Pol_zeta Pol_zeta, a m 63.7 9.2 0.00023 16.9 5.7 127 766-921 259-398 (405) 217 pfam07377 DUF1493 Protein of u 63.6 8 0.0002 17.3 3.1 24 816-839 31-54 (110) 218 pfam11977 RNase_Zc3h12a Zc3h12 61.3 10 0.00026 16.6 8.3 104 5-146 2-116 (155) 219 PRK02406 DNA polymerase IV; Va 61.1 9 0.00023 16.9 3.0 25 895-921 312-336 (355) 220 PRK13647 cbiO cobalt transport 60.7 10 0.00026 16.5 4.3 15 822-836 257-271 (273) 221 PRK10917 ATP-dependent DNA hel 60.5 6.4 0.00016 18.0 2.2 31 190-221 7-39 (677) 222 PRK07945 hypothetical protein; 59.6 10 0.00026 16.5 3.1 90 173-282 36-126 (335) 223 cd01013 isochorismatase Isocho 59.1 11 0.00028 16.3 5.0 42 98-139 133-174 (203) 224 KOG3416 consensus 58.3 7.8 0.0002 17.4 2.3 28 717-744 11-50 (134) 225 pfam10266 Strumpellin Heredita 58.1 2.6 6.6E-05 20.9 -0.2 44 42-94 157-200 (1080) 226 PRK13634 cbiO cobalt transport 57.2 12 0.0003 16.1 4.6 10 824-833 258-267 (276) 227 TIGR00706 SppA_dom signal pept 56.8 5.9 0.00015 18.2 1.5 59 633-701 124-196 (224) 228 TIGR03269 met_CoM_red_A2 methy 56.4 12 0.00031 16.0 3.9 12 925-936 503-514 (520) 229 KOG2213 consensus 56.3 8.8 0.00022 17.0 2.3 18 205-223 102-119 (460) 230 PRK03348 DNA polymerase IV; Pr 55.8 8.8 0.00022 17.0 2.2 37 175-218 168-204 (456) 231 COG2019 AdkA Archaeal adenylat 55.6 6.8 0.00017 17.8 1.6 48 193-241 9-57 (189) 232 PRK13641 cbiO cobalt transport 55.4 12 0.00032 15.9 4.5 13 823-835 268-280 (286) 233 PRK13507 formate--tetrahydrofo 55.2 9.9 0.00025 16.6 2.4 35 602-638 473-507 (587) 234 COG1555 ComEA DNA uptake prote 55.2 13 0.00032 15.9 3.4 11 667-677 109-119 (149) 235 TIGR00194 uvrC excinuclease AB 55.1 6.4 0.00016 18.0 1.4 10 834-843 586-595 (601) 236 PRK03839 putative kinase; Prov 55.0 5.7 0.00015 18.3 1.2 19 193-211 5-23 (180) 237 PRK06294 coproporphyrinogen II 54.9 13 0.00032 15.8 5.9 21 848-868 334-354 (374) 238 TIGR01084 mutY A/G-specific ad 54.8 13 0.00032 15.9 2.9 139 553-716 44-203 (297) 239 TIGR02093 P_ylase glycogen/sta 54.3 6.9 0.00018 17.8 1.5 26 627-661 483-509 (822) 240 COG1194 MutY A/G-specific DNA 54.1 12 0.00032 15.9 2.7 18 700-717 182-199 (342) 241 PRK13651 cobalt transporter AT 53.7 13 0.00034 15.7 3.8 13 824-836 284-296 (304) 242 PRK11440 hypothetical protein; 52.8 14 0.00035 15.6 5.2 42 98-139 112-153 (188) 243 COG2390 DeoR Transcriptional r 51.5 14 0.00036 15.5 5.1 12 854-865 239-250 (321) 244 PRK04301 radA DNA repair and r 51.4 14 0.00037 15.4 2.8 32 189-220 5-37 (318) 245 cd01011 nicotinamidase Nicotin 51.1 14 0.00037 15.4 8.6 43 97-139 128-170 (196) 246 COG4277 Predicted DNA-binding 50.8 7.9 0.0002 17.3 1.3 19 742-760 281-299 (404) 247 PRK09057 coproporphyrinogen II 50.3 15 0.00038 15.3 6.9 19 850-868 345-364 (381) 248 TIGR02607 antidote_HigA addict 49.7 14 0.00034 15.6 2.4 23 811-833 20-42 (81) 249 pfam07592 Transposase_36 Rhodo 48.9 16 0.0004 15.2 2.9 111 742-874 90-227 (311) 250 cd00245 Glm_e Coenzyme B12-dep 48.5 15 0.00038 15.3 2.5 79 777-860 278-360 (428) 251 pfam11731 Cdd1 Pathogenicity l 48.5 14 0.00035 15.6 2.2 13 196-208 17-29 (92) 252 TIGR03614 RutB pyrimidine util 48.2 16 0.00041 15.1 5.5 42 98-139 142-183 (226) 253 PRK08238 hypothetical protein; 47.9 9.4 0.00024 16.8 1.3 20 695-714 254-273 (481) 254 KOG2405 consensus 47.5 8.5 0.00022 17.1 1.1 47 450-496 101-148 (458) 255 COG1747 Uncharacterized N-term 47.3 16 0.00042 15.0 6.3 11 116-126 16-26 (711) 256 cd04891 ACT_AK-LysC-DapG-like_ 46.9 17 0.00042 15.0 5.7 44 894-945 13-61 (61) 257 COG1938 Archaeal enzymes of AT 46.9 17 0.00043 15.0 3.9 38 577-616 193-230 (244) 258 PRK13851 type IV secretion sys 46.8 13 0.00034 15.7 2.0 43 732-786 228-277 (343) 259 cd01015 CSHase N-carbamoylsarc 46.6 17 0.00043 14.9 5.4 44 96-139 103-146 (179) 260 cd01703 Pol_iota Pol iota is m 46.5 8.8 0.00022 17.0 1.0 67 785-868 262-329 (394) 261 COG1412 Uncharacterized protei 46.4 17 0.00043 14.9 5.6 54 104-162 74-128 (136) 262 TIGR02788 VirB11 P-type DNA tr 46.1 4.2 0.00011 19.3 -0.7 72 681-774 171-258 (328) 263 PRK13631 cbiO cobalt transport 46.0 17 0.00044 14.9 4.0 11 801-811 289-299 (320) 264 PRK01810 DNA polymerase IV; Va 45.1 18 0.00045 14.8 3.0 15 193-207 181-195 (410) 265 TIGR00758 UDG_fam4 uracil-DNA 44.9 18 0.00046 14.8 2.4 24 83-106 79-102 (185) 266 KOG1931 consensus 44.7 13 0.00034 15.7 1.7 76 694-774 836-916 (1156) 267 pfam02882 THF_DHG_CYH_C Tetrah 44.5 14 0.00037 15.4 1.9 47 123-169 51-99 (159) 268 TIGR02313 HpaI-NOT-DapA 2,4-di 43.6 17 0.00044 14.9 2.1 13 265-277 16-28 (294) 269 TIGR00761 argB acetylglutamate 43.4 19 0.00048 14.6 2.4 54 597-659 15-77 (254) 270 TIGR03556 photolyase_8HDF deox 43.3 19 0.00048 14.6 8.1 37 91-128 53-89 (471) 271 PRK09536 btuD corrinoid ABC tr 43.1 19 0.00048 14.6 4.7 11 813-823 300-310 (409) 272 KOG0577 consensus 42.9 17 0.00043 14.9 2.0 30 580-610 511-540 (948) 273 COG0178 UvrA Excinuclease ATPa 42.7 19 0.00049 14.5 4.3 39 816-860 777-815 (935) 274 PRK02515 psbU photosystem II c 42.2 17 0.00043 14.9 1.9 11 648-658 93-103 (144) 275 PRK05686 fliG flagellar motor 42.0 20 0.0005 14.4 3.4 19 655-673 121-140 (337) 276 COG1897 MetA Homoserine trans- 41.9 20 0.0005 14.4 2.7 75 134-217 19-95 (307) 277 PRK06761 hypothetical protein; 41.5 12 0.0003 16.1 1.0 18 193-210 7-24 (281) 278 PRK06582 coproporphyrinogen II 41.2 20 0.00051 14.4 7.4 17 849-866 355-371 (390) 279 KOG0304 consensus 41.2 20 0.00051 14.4 7.2 23 376-398 13-35 (239) 280 PRK07379 coproporphyrinogen II 41.1 20 0.00052 14.4 7.7 21 849-869 356-376 (399) 281 pfam09992 DUF2233 Predicted pe 40.8 19 0.00049 14.5 2.0 13 696-708 126-138 (212) 282 cd01700 Pol_V Pol V was discov 40.7 13 0.00032 15.9 1.1 74 842-921 263-338 (344) 283 COG5290 IkappaB kinase complex 40.6 6.3 0.00016 18.1 -0.5 12 929-940 1210-1221(1243) 284 TIGR02784 addA_alphas double-s 40.3 21 0.00053 14.3 2.6 91 29-147 110-203 (1190) 285 cd07922 CarBa CarBa is the A s 40.3 13 0.00033 15.8 1.1 52 757-817 9-60 (81) 286 COG2251 Predicted nuclease (Re 40.2 20 0.00051 14.4 2.0 16 470-485 142-157 (474) 287 TIGR02778 ligD_pol DNA polymer 40.1 21 0.00053 14.3 3.6 16 633-648 146-161 (251) 288 PRK01172 ski2-like helicase; P 39.8 21 0.00054 14.2 2.9 14 130-143 90-103 (674) 289 COG1335 PncA Amidases related 39.6 21 0.00054 14.2 5.2 45 96-140 122-166 (205) 290 TIGR02074 PBP_1a_fam penicilli 39.6 19 0.00049 14.5 1.9 22 499-520 112-133 (700) 291 cd03586 Pol_IV_kappa Pol_IV_ka 39.6 21 0.00054 14.2 2.3 27 607-633 120-148 (337) 292 COG0122 AlkA 3-methyladenine D 39.5 15 0.00038 15.3 1.3 17 580-597 174-190 (285) 293 TIGR01455 glmM phosphoglucosam 39.4 17 0.00043 15.0 1.5 19 761-779 418-436 (450) 294 TIGR01282 nifD nitrogenase mol 39.3 21 0.00055 14.2 5.6 16 754-769 236-251 (521) 295 TIGR00426 TIGR00426 competence 39.2 19 0.00047 14.6 1.7 17 193-209 19-36 (70) 296 PRK03858 DNA polymerase IV; Va 38.7 22 0.00056 14.1 2.8 20 192-212 174-193 (398) 297 PTZ00112 origin recognition co 38.3 11 0.00027 16.4 0.4 25 741-765 458-482 (650) 298 PRK00285 ihfA integration host 38.3 22 0.00057 14.1 3.0 22 816-837 12-33 (100) 299 PRK03103 DNA polymerase IV; Re 38.2 22 0.00057 14.0 2.1 15 193-207 184-198 (410) 300 COG1875 NYN ribonuclease and A 38.2 18 0.00045 14.8 1.5 34 167-200 219-257 (436) 301 PRK03695 vitamin B12-transport 38.1 22 0.00057 14.0 4.6 16 729-746 227-242 (245) 302 TIGR02061 aprA adenylylsulfate 37.6 19 0.00049 14.5 1.6 17 897-913 567-583 (651) 303 pfam04748 Polysacc_deac_2 Dive 37.5 8.5 0.00022 17.1 -0.2 49 815-865 134-182 (213) 304 PHA00666 putative protease 37.1 23 0.00059 13.9 5.7 14 274-287 2-15 (233) 305 PRK13506 formate--tetrahydrofo 37.0 15 0.00038 15.4 0.9 52 600-656 463-518 (577) 306 pfam09507 CDC27 DNA polymerase 36.8 23 0.0006 13.9 2.8 14 369-382 58-71 (418) 307 pfam01312 Bac_export_2 FlhB Hr 36.7 24 0.0006 13.9 4.7 38 720-765 252-292 (343) 308 PRK10982 galactose/methyl gala 35.2 25 0.00063 13.7 4.1 20 905-926 431-450 (491) 309 PRK02794 DNA polymerase IV; Pr 35.0 17 0.00044 14.8 1.0 63 841-912 295-358 (417) 310 cd04913 ACT_AKii-LysC-BS-like_ 34.9 25 0.00064 13.7 6.7 51 893-951 13-68 (75) 311 TIGR03631 bact_S13 30S ribosom 34.7 19 0.00049 14.5 1.3 15 196-210 20-34 (113) 312 KOG1921 consensus 34.6 14 0.00035 15.6 0.5 10 584-593 182-191 (286) 313 TIGR00922 nusG transcription t 34.3 24 0.00062 13.8 1.7 13 734-749 58-70 (193) 314 KOG1521 consensus 33.8 9.2 0.00023 16.9 -0.5 40 695-737 214-254 (338) 315 TIGR01862 N2-ase-Ialpha nitrog 33.4 16 0.00041 15.1 0.6 15 628-642 345-359 (510) 316 KOG2404 consensus 33.1 3.6 9.3E-05 19.8 -2.7 38 815-868 349-386 (477) 317 COG1031 Uncharacterized Fe-S o 32.8 17 0.00042 15.0 0.7 31 701-731 457-487 (560) 318 TIGR02005 PTS-IIBC-alpha PTS s 32.8 27 0.00069 13.5 1.8 22 267-288 83-104 (533) 319 PRK11145 pflA pyruvate formate 32.7 27 0.00069 13.4 7.5 29 629-660 205-233 (246) 320 TIGR01057 topA_arch DNA topois 32.6 27 0.00069 13.4 4.6 101 609-720 287-393 (637) 321 PRK06746 peptide chain release 32.5 27 0.00069 13.4 2.5 71 201-286 10-80 (326) 322 PRK04040 adenylate kinase; Pro 32.5 21 0.00053 14.3 1.1 17 193-209 7-23 (189) 323 PRK11609 nicotinamidase/pyrazi 32.3 27 0.0007 13.4 5.0 44 96-139 132-175 (213) 324 PRK03968 DNA primase large sub 32.0 28 0.00071 13.4 1.9 24 202-225 12-35 (388) 325 TIGR00977 LeuA_rel 2-isopropyl 31.9 28 0.00071 13.3 4.1 95 697-804 232-369 (543) 326 KOG0523 consensus 31.7 16 0.00041 15.1 0.4 15 906-920 522-536 (632) 327 pfam03753 HHV6-IE Human herpes 31.6 26 0.00067 13.5 1.5 187 694-900 733-934 (993) 328 pfam04614 Pex19 Pex19 protein 31.5 26 0.00067 13.5 1.5 10 280-289 34-43 (245) 329 COG0776 HimA Bacterial nucleoi 31.5 28 0.00072 13.3 2.6 22 816-837 10-31 (94) 330 TIGR01418 PEP_synth phosphoeno 31.2 28 0.00073 13.3 2.3 42 821-864 793-837 (877) 331 PRK08561 rps15p 30S ribosomal 31.0 29 0.00073 13.3 2.3 35 606-643 30-64 (151) 332 TIGR00737 nifR3_yhdG putative 30.8 29 0.00074 13.2 2.8 10 807-816 234-243 (336) 333 cd03183 GST_C_Theta GST_C fami 30.6 25 0.00063 13.7 1.2 27 586-612 34-60 (126) 334 PRK10664 transcriptional regul 30.3 29 0.00075 13.2 3.2 22 816-837 9-30 (90) 335 pfam07367 FB_lectin Fungal fru 30.0 19 0.00048 14.6 0.5 45 698-745 90-134 (139) 336 pfam08972 DUF1902 Domain of un 30.0 30 0.00076 13.1 1.9 16 207-222 24-39 (54) 337 PRK13900 type IV secretion sys 29.9 16 0.00041 15.1 0.2 43 732-786 227-276 (332) 338 TIGR01703 hybrid_clust hydroxy 29.9 18 0.00046 14.7 0.5 83 741-835 450-549 (567) 339 cd01748 GATase1_IGP_Synthase T 29.8 30 0.00077 13.1 2.3 11 741-751 154-164 (198) 340 PRK12402 replication factor C 29.7 30 0.00075 13.2 1.5 21 190-210 33-58 (337) 341 COG1168 MalY Bifunctional PLP- 29.6 25 0.00063 13.7 1.1 67 737-836 313-381 (388) 342 TIGR01420 pilT_fam twitching m 29.6 13 0.00033 15.7 -0.3 10 634-643 109-119 (350) 343 PRK05179 rpsM 30S ribosomal pr 29.3 27 0.00068 13.5 1.2 12 197-208 23-34 (122) 344 pfam00809 Pterin_bind Pterin b 29.2 31 0.00078 13.0 1.9 15 595-609 140-154 (208) 345 pfam02022 Integrase_Zn Integra 29.1 31 0.00078 13.0 3.0 25 810-834 8-32 (40) 346 TIGR02624 rhamnu_1P_ald rhamnu 29.0 22 0.00056 14.1 0.8 34 588-621 35-70 (273) 347 pfam00735 Septin Septin. Membe 28.9 31 0.00079 13.0 5.3 38 470-509 92-131 (280) 348 PRK09490 metH B12-dependent me 28.7 31 0.0008 13.0 6.6 27 92-118 194-223 (1229) 349 KOG3218 consensus 28.7 29 0.00074 13.2 1.3 20 194-213 75-94 (208) 350 COG4332 Uncharacterized protei 28.3 11 0.00027 16.4 -0.9 14 538-551 47-60 (203) 351 pfam04396 consensus 28.3 32 0.00081 12.9 4.9 62 98-159 55-130 (149) 352 cd00477 FTHFS Formyltetrahydro 28.2 32 0.00081 12.9 2.2 35 602-638 428-462 (524) 353 TIGR01208 rmlA_long glucose-1- 28.2 22 0.00057 14.1 0.7 23 733-755 339-361 (361) 354 cd01080 NAD_bind_m-THF_DH_Cycl 28.1 32 0.00081 12.9 3.6 14 196-209 147-161 (168) 355 PRK07194 fliG flagellar motor 28.1 32 0.00082 12.9 4.4 21 654-674 117-138 (334) 356 PTZ00205 DNA polymerase kappa; 27.8 32 0.00082 12.9 3.0 41 472-515 321-362 (571) 357 TIGR01083 nth endonuclease III 27.7 22 0.00057 14.0 0.6 17 626-646 103-120 (192) 358 TIGR01286 nifK nitrogenase mol 27.6 33 0.00083 12.8 3.4 24 368-391 152-175 (526) 359 PRK01395 V-type ATP synthase s 27.6 18 0.00046 14.7 0.1 53 761-833 8-61 (104) 360 PRK03609 umuC DNA polymerase V 27.5 21 0.00053 14.3 0.4 17 842-858 267-283 (422) 361 COG2759 MIS1 Formyltetrahydrof 27.3 31 0.0008 13.0 1.3 36 602-639 442-477 (554) 362 pfam05503 Pox_G7 Poxvirus G7-l 27.3 33 0.00084 12.8 5.6 34 541-579 84-117 (363) 363 KOG0334 consensus 27.2 33 0.00084 12.8 8.5 10 836-845 904-913 (997) 364 PRK08599 coproporphyrinogen II 27.0 33 0.00085 12.8 7.0 19 849-867 339-357 (377) 365 COG3083 Predicted hydrolase of 27.0 33 0.00085 12.8 2.8 42 37-85 20-62 (600) 366 pfam03221 Transposase_Tc5 Tc5 26.8 34 0.00086 12.7 1.9 36 571-606 3-38 (62) 367 COG1737 RpiR Transcriptional r 26.8 34 0.00086 12.7 5.0 58 214-285 6-66 (281) 368 cd01966 Nitrogenase_NifN_1 Nit 26.7 34 0.00086 12.7 4.1 45 630-675 238-284 (417) 369 PRK06298 type III secretion sy 26.7 34 0.00086 12.7 4.8 25 736-760 259-286 (360) 370 TIGR01591 Fdh-alpha formate de 26.5 15 0.00039 15.3 -0.4 54 578-632 448-511 (694) 371 KOG0923 consensus 26.4 34 0.00087 12.7 4.0 47 587-635 478-526 (902) 372 KOG4302 consensus 26.4 34 0.00087 12.7 4.5 89 166-259 64-152 (660) 373 pfam01268 FTHFS Formate--tetra 26.3 34 0.00087 12.7 2.3 28 603-630 444-471 (555) 374 KOG4379 consensus 26.1 35 0.00088 12.7 2.3 18 206-223 182-199 (596) 375 TIGR01487 SPP-like SPP-like hy 26.0 31 0.00079 13.0 1.1 17 503-520 158-174 (223) 376 PRK00106 hypothetical protein; 25.6 35 0.00088 12.7 1.3 14 817-830 365-378 (535) 377 PRK13505 formate--tetrahydrofo 25.5 35 0.0009 12.6 2.2 29 603-631 445-473 (556) 378 cd00424 Pol_Y Y-family of DNA 25.4 30 0.00077 13.1 0.9 27 607-633 120-148 (341) 379 KOG0970 consensus 25.2 36 0.00091 12.6 4.2 97 458-556 621-751 (1429) 380 pfam09249 tRNA_NucTransf2 tRNA 25.2 36 0.00091 12.5 1.5 24 205-228 31-54 (114) 381 KOG1282 consensus 25.2 36 0.00091 12.5 2.4 119 402-560 122-244 (454) 382 KOG0276 consensus 25.2 34 0.00088 12.7 1.2 11 149-159 162-172 (794) 383 TIGR02877 spore_yhbH sporulati 25.1 36 0.00092 12.5 2.1 16 366-381 110-125 (392) 384 KOG3666 consensus 25.0 35 0.00089 12.6 1.2 61 550-613 602-665 (1141) 385 COG0505 CarA Carbamoylphosphat 24.7 29 0.00073 13.3 0.7 30 192-221 105-134 (368) 386 TIGR02397 dnaX_nterm DNA polym 24.7 37 0.00093 12.5 2.7 74 371-477 127-201 (363) 387 smart00411 BHL bacterial (prok 24.6 37 0.00094 12.5 3.0 22 816-837 9-30 (90) 388 PRK00024 radC DNA repair prote 24.6 31 0.00078 13.1 0.8 15 623-637 103-117 (224) 389 TIGR02506 NrdE_NrdA ribonucleo 24.3 37 0.00095 12.4 4.6 45 811-857 420-469 (685) 390 COG2805 PilT Tfp pilus assembl 24.0 38 0.00096 12.4 3.2 23 887-913 303-325 (353) 391 PTZ00126 tyrosyl-tRNA syntheta 23.9 28 0.00072 13.3 0.6 30 710-740 271-303 (399) 392 PRK07394 hypothetical protein; 23.9 38 0.00096 12.4 2.6 37 620-661 97-139 (342) 393 pfam00450 Peptidase_S10 Serine 23.4 39 0.00098 12.3 4.9 47 446-500 115-161 (415) 394 COG2256 MGS1 ATPase related to 23.3 16 0.00041 15.1 -0.8 60 189-250 49-110 (436) 395 KOG2379 consensus 23.2 39 0.00099 12.3 2.1 24 106-129 55-78 (501) 396 PRK09483 response regulator; P 23.2 39 0.00099 12.3 5.3 62 792-859 148-209 (216) 397 pfam06368 Met_asp_mut_E Methyl 23.2 39 0.00099 12.3 2.8 79 777-860 282-364 (441) 398 PRK12285 tryptophanyl-tRNA syn 23.0 31 0.0008 13.0 0.6 23 697-720 239-261 (369) 399 PRK13824 replication initiatio 22.7 26 0.00067 13.5 0.2 16 871-886 363-378 (404) 400 KOG2093 consensus 22.7 40 0.001 12.2 1.2 18 480-497 508-526 (1016) 401 PRK08156 surface presentation 22.6 40 0.001 12.2 3.6 16 737-752 254-269 (367) 402 TIGR01116 ATPase-IIA1_Ca calci 22.6 40 0.001 12.2 2.9 22 263-284 45-66 (800) 403 pfam09091 consensus 22.5 40 0.001 12.2 2.1 38 569-606 7-44 (64) 404 cd04780 HTH_MerR-like_sg5 Heli 22.4 40 0.001 12.2 2.4 28 633-660 19-47 (95) 405 PRK10070 glycine betaine trans 22.4 40 0.001 12.2 7.0 20 650-672 246-265 (400) 406 pfam03796 DnaB_C DnaB-like hel 22.4 40 0.001 12.2 5.9 14 568-581 52-65 (186) 407 pfam04318 DUF468 Protein of un 22.4 40 0.001 12.2 1.1 39 680-723 39-77 (84) 408 PRK04195 replication factor C 22.3 40 0.001 12.2 1.5 18 193-210 45-62 (403) 409 TIGR03624 conserved hypothetic 22.2 41 0.001 12.1 3.1 16 585-600 188-203 (345) 410 TIGR01279 DPOR_bchN light-inde 22.1 30 0.00075 13.2 0.3 79 551-632 298-386 (458) 411 PRK12704 phosphodiesterase; Pr 22.0 41 0.001 12.1 1.3 13 818-830 286-298 (455) 412 COG4619 ABC-type uncharacteriz 21.8 32 0.00081 12.9 0.5 30 591-620 109-138 (223) 413 smart00674 CENPB Putative DNA- 21.6 42 0.0011 12.1 2.2 38 569-606 4-41 (66) 414 TIGR01113 mtrE tetrahydrometha 21.6 42 0.0011 12.1 1.8 29 599-631 10-38 (298) 415 KOG1759 consensus 21.3 42 0.0011 12.0 1.3 18 574-591 1-18 (115) 416 pfam00416 Ribosomal_S13 Riboso 21.3 42 0.0011 12.0 1.5 10 635-644 33-42 (106) 417 TIGR02320 PEP_mutase phosphoen 21.2 42 0.0011 12.0 1.5 48 611-660 168-222 (272) 418 PRK05702 flhB flagellar biosyn 21.2 42 0.0011 12.0 4.7 40 719-766 256-298 (352) 419 PRK12721 secretion system appa 21.1 43 0.0011 12.0 2.4 15 738-752 260-274 (349) 420 PTZ00072 40S ribosomal protein 21.1 43 0.0011 12.0 2.2 35 606-643 27-61 (148) 421 PRK11886 biotin--protein ligas 21.0 43 0.0011 12.0 5.4 15 738-753 291-305 (319) 422 cd07361 MEMO_like Memo (mediat 21.0 43 0.0011 12.0 4.0 15 372-386 11-25 (266) 423 smart00670 PINc Large family o 21.0 43 0.0011 12.0 1.8 25 118-146 87-111 (111) 424 cd03008 TryX_like_RdCVF Trypar 21.0 37 0.00095 12.4 0.7 72 125-212 58-138 (146) 425 PRK00440 rfc replication facto 20.9 43 0.0011 12.0 1.6 16 195-210 44-59 (318) 426 PRK10563 6-phosphogluconate ph 20.8 34 0.00086 12.7 0.4 69 73-141 68-136 (221) 427 TIGR02176 pyruv_ox_red pyruvat 20.7 43 0.0011 11.9 3.0 70 827-925 223-299 (1194) 428 TIGR03333 salvage_mtnX 2-hydro 20.7 43 0.0011 11.9 4.1 71 69-145 24-99 (214) 429 pfam00216 Bac_DNA_binding Bact 20.6 44 0.0011 11.9 3.0 21 817-837 10-30 (90) 430 pfam05918 API5 Apoptosis inhib 20.5 44 0.0011 11.9 2.4 17 828-844 42-58 (543) 431 pfam04025 DUF370 Domain of unk 20.4 44 0.0011 11.9 2.0 20 729-748 44-63 (73) 432 CHL00137 rps13 ribosomal prote 20.4 44 0.0011 11.9 1.4 20 193-212 19-38 (122) 433 KOG1968 consensus 20.4 40 0.001 12.2 0.7 14 192-206 361-374 (871) 434 KOG2875 consensus 20.3 44 0.0011 11.9 1.0 13 196-208 223-235 (323) 435 COG1415 Uncharacterized conser 20.2 44 0.0011 11.9 2.8 34 163-207 261-294 (373) 436 cd00635 PLPDE_III_YBL036c_like 20.2 44 0.0011 11.9 2.0 51 171-221 131-184 (222) 437 TIGR01362 KDO8P_synth 3-deoxy- 20.1 45 0.0011 11.9 2.1 30 81-110 45-78 (279) 438 PRK13331 pantothenate kinase; 20.0 45 0.0011 11.8 1.1 38 27-64 175-212 (251) 439 TIGR00375 TIGR00375 conserved 20.0 44 0.0011 11.9 0.9 42 196-237 311-367 (384) No 1 >PRK07556 consensus Probab=100.00 E-value=0 Score=2406.90 Aligned_cols=965 Identities=50% Similarity=0.857 Sum_probs=892.5 Q ss_pred CCCCCEEEEEECCHHHHHHHHCCCCCCCCCCCCEECHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHHCCHHH Q ss_conf 98897289996626999887426754468988550069999999999999848554798799997274887132408652 Q gi|254780277|r 1 MQKENHLFLVDGSSFIYRAFYATPLLSRKHDGLPVNAIAGFCNMLWKLLQNSRKESIASHFAVIFDYPAVTFRNEIYPDY 80 (976) Q Consensus 1 m~~~~~~~liDg~~~~~ra~~a~p~~~~~~~G~~t~ai~gf~~~l~~~i~~~~~~~~~~~~~v~fD~~~~tfR~~~~~~Y 80 (976) |+++++|+|||||+|+||||||+|+++++++|+||||||||+|||+++|++++++..|+|++||||.++||||||+||+| T Consensus 4 m~~~~~L~LIDG~sl~fRAyyAlp~l~~~s~G~pTnAvyGF~~mL~kll~~~~~~~~p~~iavaFD~~~~TFRhe~y~~Y 83 (977) T PRK07556 4 MGKGDHLFLVDGSGYIFRAYHALPPLTRKSDGLPVGAVSGFCNMLWKLLRDARNTDVPTHLAVIFDYSSKTFRNEIYPQY 83 (977) T ss_pred CCCCCEEEEEECCHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCEEEEEEECCCCCCCHHHHHHH T ss_conf 89999289983637999998179976779999471369999999999998623567998899998088996012220565 Q ss_pred HCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEECCEEHHHHHHHHHHHHHHCCCCEEEECCCCHHHEECCCCEEEEECCCC Q ss_conf 16679879758864899999999709979956890166589999999984899499986870121223797599966788 Q gi|254780277|r 81 KANRPKIPEMLLPQLPLVRLATQAFGIPAIEIQGFEADDIIATYTYIAEKEGFAVTIISTDKDLMQLVSPTTCLYDTVKE 160 (976) Q Consensus 81 Ka~R~~~p~~l~~q~~~i~~~l~~~gi~~~~~~~~EADDviatla~~~~~~~~~v~i~s~DkD~~Qlv~~~v~~~~~~~~ 160 (976) ||||++|||+|.+|||+|++++++||||+++.+||||||||||||++++++|++|+|+|||||++|||+++|++|+|+++ T Consensus 84 KA~R~~~PedL~~Qip~i~e~l~algIp~l~~~GyEADDiIgTlA~~~~~~g~~v~IvSgDKDl~QLV~d~V~i~~~~k~ 163 (977) T PRK07556 84 KANRPAPPEDLIPQFPLIREATRAFNLPCIEKEGFEADDLIATYARLAREAGGDVTIISSDKDLMQLVGDGVSMYDPMKD 163 (977) T ss_pred CCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEECCCCHHHHHHHHHHHHHHCCCCEEEECCCCCHHHHCCCCEEEEECCCC T ss_conf 27999998899999999999999879988546895489999999999997799299981898675627798699978898 Q ss_pred CEECHHHHHHHHCCCHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHCCCHHHH Q ss_conf 35389999888489887877778751874358998776360249989841367899986454200112232200357899 Q gi|254780277|r 161 EKIDIEHVIKKWGIPPEKMVCLQALTGDSIDNIPGIPGIGYKTAALLLQEYGNLENILINASRIKQKKRRENILEYAETA 240 (976) Q Consensus 161 ~~~~~~~v~~~~gv~p~~~~d~~al~GD~sDnipGv~GiG~KtA~~ll~~~gsle~i~~~~d~i~~~k~~~~l~~~~e~a 240 (976) +.+|+++|.|||||+|+||+||+||+||+|||||||||||||||+|||++|||||+||+|+|+|+++|++++|++++|+| T Consensus 164 ~~~d~~~V~eK~GV~P~qv~D~lAL~GDsSDNIPGVpGIG~KTA~kLL~eyGsLE~Iy~n~deIk~~K~ke~L~~~~d~A 243 (977) T PRK07556 164 KRIGIPEVIEKFGVPPEKVVDVQALAGDSVDNVPGIPGIGIKTAAQLINEYGDLDTLLARAGEIKQPKRRETLIENAEQA 243 (977) T ss_pred CEECHHHHHHHHCCCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHCCCHHHHHHHHHHCCCHHHHHHHHHCHHHH T ss_conf 48179999998696989999999971876467898998548999999997578999999886502102778998579999 Q ss_pred HHHHHHHHCCCCCCCCCCHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 97655432035642247635521267676889986532336899987653114454433421112343322223332222 Q gi|254780277|r 241 RLSRKLVMLRTDVPITIPLEHLILEDCNGPRLISFLKALEFTKLINRVATTYDCDAANIEPVVLDINTTENKGIAIETEI 320 (976) Q Consensus 241 ~ls~~L~tL~~Dvpl~~~~~~l~~~~~d~~~l~~~~~~lef~~ll~rL~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~ 320 (976) ++||+||||++|||++++++++++.++|.+.|.++|++|||++|++|+......+.+..++............... . T Consensus 244 ~LSkeLaTI~~Dvpl~~~le~l~~~~~d~~~L~~~f~eLEF~sLlkrl~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~ 320 (977) T PRK07556 244 RISRQLVLLDTDVPLDVPLDDLAVREPDGPKLIAFLKAMEFTTLTRRVAEATDTDAPDIEPAAVEVEWGADAHGPD---L 320 (977) T ss_pred HHHHHHHHHHHCCCCCCCHHHHCCCCCCHHHHHHHHHHHCCHHHHHHHHHHHCCCCCCCCCCHHCCCCCCCCCCCC---C T ss_conf 9989986766279899998991458999999999999848277999998752344321110000000011123333---2 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHCCCCCCCCCCCCCEEECCHHHHHHHHHHHHHCCCCCCCCCCCCCC Q ss_conf 22233332222233332112232112001121100001235556665035479999999999998668523233222433 Q gi|254780277|r 321 DRDTTIKTSQFSSLSDQTSKINSEQTPQKLFLERLQKLSQYPINNDSYTKIVNTKDIQQWVQKLETIGSASFKIITDTMD 400 (976) Q Consensus 321 ~~~~~~~~~~~~~~~~~~~~~~~e~~p~~~~~~~~~~~~~~~~~~~~y~~I~~~~~L~~li~~L~~~~~ia~DtETt~ld 400 (976) ..........................|..............+.....|.+|.+.+.+..|++++...+.++||+||+++| T Consensus 321 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Y~~I~~~e~L~~li~~l~~~~~~a~DtETtsld 400 (977) T PRK07556 321 DVPAAADAAAGDLSSAAPSAATPPPTPAALAAARAEAAEAQPIDRSAYVTIRDLATLKAWIARARETGIVAFDTETTSLD 400 (977) T ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCCEEEECCHHHHHHHHHHHHHCCEEEEEEECCCCC T ss_conf 11112211222211123321234456034444323343115555443167388999999999987679589997318878 Q ss_pred CCCCCCCCCEEECCCCCCCCCCCCCCEECCCCCCCCCC------CCCCCCCHHHHHHHHHHHHCCCCCHHHHCCCCHHHH Q ss_conf 11001244201025776654434550231374433345------650017889999744442022210011003212378 Q gi|254780277|r 401 AFHSKPIAIAISTLDQKDDTSFKVETIFIDLSVHTSQD------STGKNTLTKEILSYLKKFFENEHFLKIGHNIKYDKL 474 (976) Q Consensus 401 ~~~~~ivGisls~~~~~~~~~~~~~~~yip~~~~~~~d------~~~~~~~~~evl~~L~~lle~~~i~kIgHN~KfD~~ 474 (976) ++.++++|+|+|+.++. ++|+|+.|..... ....+...+++++.|+++|+++.+.|||||+|||++ T Consensus 401 ~~~a~lVGisls~~~~~--------a~YIPl~h~~~~~~~~~~~~~~~ql~~~~vL~~LkplLed~~i~KighnlK~d~~ 472 (977) T PRK07556 401 PMQAELVGFSLALAPGR--------ACYIPLGHKSGDGDLFGGGLAEGQIPLRDALAALKPVLEDPSVLKIGQNMKYDAL 472 (977) T ss_pred HHHCCEEEEEEEECCCC--------EEEEECCCCCCCCCHHCCCCCHHHCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHH T ss_conf 30155357999806897--------8999614445642011022223230599999998988538731144412568899 Q ss_pred HHHHCCCCCCCCCCHHHHHHHHCCCCCCCCHHHHHHHHCCCC---CCCCCCCCCCCCCCHHHCCHHHHHHHHHHHHHHHH Q ss_conf 898618866642100234554316876677246677622444---22335654322210000001233332221111124 Q gi|254780277|r 475 VLHRYGISMRGFEDIMLMSYVLDSGRSSHDIASIAQKWLSYT---RKEILKSRKSSIPIDKISDSQVQEHAIENSNVILQ 551 (976) Q Consensus 475 ~L~~~GI~l~~i~DTmlaayLLdp~~~~~~L~~la~~yL~~~---~~e~~~~~k~~~~~~~i~~e~~~~Yaa~Da~~t~~ 551 (976) +|.++||.+.+++|||||||||||+.++|+|++|+.+||++. .+++.|+++++++|+.++.+.+..|||+||++|++ T Consensus 473 vL~~~gi~l~~i~DtmLaaYLLdp~~~~~~Ld~La~~yL~~~~~~~~dl~g~gk~~~~f~~v~~~~~~~YaaedA~~t~~ 552 (977) T PRK07556 473 VLARHGITVAPFDDTMLISYALDAGRGGHGMDALSERWLGHTPIAYKDLTGSGKSAITFDQVDIDRATAYAAEDADVTLR 552 (977) T ss_pred HHHHCCCCCCCCCCHHHHHHHHCCCCCCCCHHHHHHHHCCCCCCCCHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99976960268531888888834886647879999996488762100003455010476552068999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHH Q ss_conf 56776322210001345788607501233221014310058999986434368999999999740345577776566679 Q gi|254780277|r 552 LWLLLRPKLIVEKLLHVYERLDKPMIDVVSQMEITGIQIDQDLLKQVSAEISKNLLSLEENIYDLSGEKFNLSSPKQLGD 631 (976) Q Consensus 552 L~~~l~~~L~~~~l~~l~~~iE~Pl~~vLa~ME~~Gi~vD~~~L~~l~~~l~~~l~~le~~i~~~aG~~FNinSpkQL~~ 631 (976) ||+.|+++|++.+++.+|++|||||++||++||.+||.||.+.|++++.++..++.+++++||++||++|||||||||++ T Consensus 553 L~~~L~~~L~~~~l~~Ly~~IE~PLi~VLa~ME~~GI~vD~~~L~~l~~~l~~~l~~le~~I~~~aG~eFNi~SpkQL~~ 632 (977) T PRK07556 553 LWQVLKPRLAAEGLTTVYERLERPLVPVLARMEERGISVDRQVLSRLSGEFAQKAARLEAEIYELAGEPFNIGSPKQLGD 632 (977) T ss_pred HHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHCCEEECHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHH T ss_conf 99999999984235789999997899999999856969869999999999999999999999987089878788999999 Q ss_pred HHHHHCCCCCCCCCCCCCCCCCHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEE Q ss_conf 98742078875556654535317887765304762799999998788898865320011110012333453300222100 Q gi|254780277|r 632 ILFTKLKFPSGAKTKTGQWKTTAQDLEQINCGDNPIIKNILEWRQLSKIKSTYADSLPNHINKRTQRVHTFYSLASTMTG 711 (976) Q Consensus 632 vLf~~LgLp~~kKTktG~~ST~~~~Le~L~~~~hp~~~~iLeyR~l~KL~sTY~~~l~~~i~~~dgRIH~~f~q~~t~TG 711 (976) |||++||||++||||||+||||++|||+|..+.||+++.||+||+++||+|||+|+|+.+|++.||||||+|||++|+|| T Consensus 633 iLF~~l~lp~~kktKtG~~STd~~vLe~L~~~~h~i~~~iL~yR~l~KL~sTY~d~L~~~i~~~~gRiHt~fnq~~t~TG 712 (977) T PRK07556 633 ILFGKMGLPGGSKTKTGQWSTSAQVLEDLAAEGHELPRKILDWRQLSKLKSTYTDALPGYIHPQTGRVHTSYALAATTTG 712 (977) T ss_pred HHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEECCCHHHHCEECC T ss_conf 99986089888868889979709999998745785689999998899999999999886237898807773023231057 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCEEEEEECCCCCEEEEEEEEEHHHHHHHHHCCCCHHHHHHHHHHCCCHH Q ss_conf 00013334333455633222243024248994899963265443687665103777873406988899999898099968 Q gi|254780277|r 712 RLASLEPNLQNIPIKTDLGQKIRRAFISPPTKKLISADYSQIELRILAHIAKITPLYQVFENSLDIHRMVAAEIFGVNIE 791 (976) Q Consensus 712 RlSs~~PNlQNiP~r~~~g~~iR~~F~a~~g~~lv~aDySQiElRilA~ls~D~~l~~af~~g~DiH~~tA~~vf~~~~~ 791 (976) ||||++|||||||+|+++||+||+||+|++||+|++|||||||||||||+|+|+.|++||++|+|||+.||+.|||+|.+ T Consensus 713 RlSS~~PNLQNIPir~e~G~~iR~aFv~~~g~~l~saDYSQIELRilAh~s~d~~li~af~~~~DiH~~tA~~if~~~~~ 792 (977) T PRK07556 713 RLSSSDPNLQNIPVRTEEGRKIRTAFIATPGHKLVSADYSQIELRILAHIADIPALKQAFADGLDIHAMTASEMFGVPVE 792 (977) T ss_pred CCCCCCCCCCCCCCCCCCCCHHHEEECCCCCCEEEEECHHHHHHHHHHHHHCCHHHHHHHCCCCCHHHHHHHHHHCCCHH T ss_conf 64569986467888763230323216049998799833668899999998178899987635987789999999599935 Q ss_pred HCCHHHHHHHHHHHHHHHHCCCHHHHHHHCCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHCCCEECCCCCEECCCC Q ss_conf 89988886302513564407770356788199989999999999998888999999999999984988827787601898 Q gi|254780277|r 792 KVSPQMRRHAKTINFSIIYGISPFRLATQLRIPRSEAADYIKRYFHRFPGIHEYIEKTKNFVRQNGYVETIFGRRIHYDE 871 (976) Q Consensus 792 ~vt~~~R~~aK~infgi~YG~~~~~La~~l~is~~eA~~~i~~yf~~~p~v~~~~~~~~~~a~~~gyv~T~~GRrr~~p~ 871 (976) +||++||+.||+|||||+||||+||||++||||++||++||++||++||+|+.||++++++|+++|||+|++|||||+|+ T Consensus 793 ~v~~~~R~~aK~iNfgi~YG~~~~gL~~~l~i~~~eA~~~i~~yf~~y~~v~~~~~~~~~~a~~~gyv~Tl~gRrr~~p~ 872 (977) T PRK07556 793 GMPSEVRRRAKAINFGIIYGISAFGLANQLGIPREEAGAYIKRYFERFPGIRAYMDATKAFAREHGYVETLFGRRIHYPD 872 (977) T ss_pred HCCHHHHHHHHEEEEHHHHCCCHHHHHHHCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCEECCCCCEECCCC T ss_conf 48999986526042034556687779987198999999999999986803999999999999977977888787032875 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEECCEEEEECCHHHHHHHHHHHHHHHHHC Q ss_conf 75883788877878770134565389999999999999998659981899972638999559879999999999996510 Q gi|254780277|r 872 INSPKSSIRNINERAAINAPIQGSAADITRRAMISVHKSLEDHKLATKMLLQIHDELVFEVPEEEITTASQIIVHSMEKA 951 (976) Q Consensus 872 i~s~~~~~r~~aeR~a~N~~iQGsaAdi~k~am~~i~~~~~~~~~~~~~ilqVHDElv~ev~~~~~~~~~~~~~~~Me~~ 951 (976) |+|.|.+.|++|||+|+|||||||||||||+||++|+++|+..+++++||||||||||||||++++++++.+|+++||+| T Consensus 873 i~s~n~~~r~~~eR~a~N~piQGtaAdiiK~ami~~~~~l~~~~~~~~milqvHDEli~e~~~~e~e~~~~~v~~~Me~a 952 (977) T PRK07556 873 IKASNPSVRAFNERAAINAPIQGTAADIIRRAMIRMEDALAEEKLSARMLLQVHDELIFEVPEDEVEATIPVVRHVMENA 952 (977) T ss_pred CCCCCHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECEEEEEEECHHHHHHHHHHHHHHHHCC T ss_conf 56899889979999982776435599999999999999998369886289997727789646889999999999997367 Q ss_pred CCCCCCCCCCEEEEEEEECCHHHHC Q ss_conf 1357444464388833208937819 Q gi|254780277|r 952 CLPKINLRVPLKVNIKVSNNWQEEN 976 (976) Q Consensus 952 ~~~~~~l~VPl~v~~~~G~~W~e~~ 976 (976) +.+.++|+|||+|++++|+||+||| T Consensus 953 ~~~~~~l~VPL~v~~~~G~nW~eah 977 (977) T PRK07556 953 AMPAVSLSVPLTVDARAADNWDEAH 977 (977) T ss_pred CCCCCEECCCEEEECCCCCCHHHHC T ss_conf 7678031156686548186978819 No 2 >PRK07997 consensus Probab=100.00 E-value=0 Score=2352.31 Aligned_cols=914 Identities=39% Similarity=0.646 Sum_probs=858.2 Q ss_pred CCCCCEEEEEECCHHHHHHHHCCCCCCCCCCCCEECHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHHCCHHH Q ss_conf 98897289996626999887426754468988550069999999999999848554798799997274887132408652 Q gi|254780277|r 1 MQKENHLFLVDGSSFIYRAFYATPLLSRKHDGLPVNAIAGFCNMLWKLLQNSRKESIASHFAVIFDYPAVTFRNEIYPDY 80 (976) Q Consensus 1 m~~~~~~~liDg~~~~~ra~~a~p~~~~~~~G~~t~ai~gf~~~l~~~i~~~~~~~~~~~~~v~fD~~~~tfR~~~~~~Y 80 (976) |. +++|+|||||+++||||||+|+++ +++|+||||||||+|||+++|++++|+ |++||||.+++|||||+||+| T Consensus 4 ~~-~~~l~LIDg~slifRAfyAl~~l~-~s~G~pTnAi~GF~~mL~kli~~~~P~----~iavaFD~~~~TFR~e~y~~Y 77 (928) T PRK07997 4 IP-ENPLILVDGSSYLYRAYHAFPPLT-NSAGEPTGAMYGVLNMLRSLIMQYKPT----HAAVVFDAKGKTFRDELFEHY 77 (928) T ss_pred CC-CCCEEEEECCHHHHHHHCCCCCCC-CCCCCEECHHHHHHHHHHHHHHHCCCC----EEEEEEECCCCCCCHHHHHHH T ss_conf 99-997799967579999980788677-998966207999999999999864998----799998088996034330565 Q ss_pred HCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEECCEEHHHHHHHHHHHHHHCCCCEEEECCCCHHHEECCCCEEEEECCCC Q ss_conf 16679879758864899999999709979956890166589999999984899499986870121223797599966788 Q gi|254780277|r 81 KANRPKIPEMLLPQLPLVRLATQAFGIPAIEIQGFEADDIIATYTYIAEKEGFAVTIISTDKDLMQLVSPTTCLYDTVKE 160 (976) Q Consensus 81 Ka~R~~~p~~l~~q~~~i~~~l~~~gi~~~~~~~~EADDviatla~~~~~~~~~v~i~s~DkD~~Qlv~~~v~~~~~~~~ 160 (976) ||||++|||||.+|||+|++++++||||+++.+||||||||||||++++++|++|+|+|||||++|||+++|+||+|+++ T Consensus 78 KA~R~~~PedL~~Qip~i~e~l~a~gIp~~~~~GyEADDiIgTla~~~~~~g~~v~IvSgDKDl~QLV~~~v~v~~~~~~ 157 (928) T PRK07997 78 KSHRPPMPDDLRAQIEPLHAMVKAMGLPLLAVSGVEADDVIGTLAREAEKAGRPVLISTGDKDMAQLVTPNITLINTMTN 157 (928) T ss_pred CCCCCCCCHHHHHHHHHHHHHHHHCCCCEECCCCCCHHHHHHHHHHHHHHCCCEEEEECCCCCHHHHCCCCEEEEECCCC T ss_conf 28999998789999999999999879988555892499999999999996799599983898765537898599988898 Q ss_pred CEECHHHHHHHHCCCHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCCCHHHHHHHHHHHHH------HHHHHHHC Q ss_conf 353899998884898878777787518743589987763602499898413678999864542001------12232200 Q gi|254780277|r 161 EKIDIEHVIKKWGIPPEKMVCLQALTGDSIDNIPGIPGIGYKTAALLLQEYGNLENILINASRIKQ------KKRRENIL 234 (976) Q Consensus 161 ~~~~~~~v~~~~gv~p~~~~d~~al~GD~sDnipGv~GiG~KtA~~ll~~~gsle~i~~~~d~i~~------~k~~~~l~ 234 (976) ..+|++.|.|||||+|+||+||+||+||+|||||||||||||||+|||++|||||+||+|+|+|++ +|++++|+ T Consensus 158 ~~~~~~~V~ek~GV~P~qv~D~laL~GDsSDNIPGVpGIG~KTA~kLL~eyGsLE~I~~n~d~i~g~~~r~~~kl~e~L~ 237 (928) T PRK07997 158 TILGPEEVVNKYGVPPELIIDFLALMGDSSDNIPGVPGVGEKTAQALLQGLGGLDTLYAEPEKIAGLSFRGAKTMAAKLE 237 (928) T ss_pred EEECHHHHHHHHCCCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHCCHHHHHHHHHHHCCCCCCCHHHHHHHHH T ss_conf 48779999998797989889999972886568899997548999999998277999998098604542211778999999 Q ss_pred CCHHHHHHHHHHHHCCCCCCCCCCHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 35789997655432035642247635521267676889986532336899987653114454433421112343322223 Q gi|254780277|r 235 EYAETARLSRKLVMLRTDVPITIPLEHLILEDCNGPRLISFLKALEFTKLINRVATTYDCDAANIEPVVLDINTTENKGI 314 (976) Q Consensus 235 ~~~e~a~ls~~L~tL~~Dvpl~~~~~~l~~~~~d~~~l~~~~~~lef~~ll~rL~~~~~~~~~~~~~~~~~~~~~e~~~~ 314 (976) +++|+|++||+||||++|||++++++++++.++|.++|.++|++|||++|++++............+...... T Consensus 238 ~~ke~A~LSkeLvTI~~Dvpl~~~l~~l~~~~~d~e~L~~lf~elEFksll~~l~~~~~~~~~~~~~~~~~~~------- 310 (928) T PRK07997 238 QNKEVAYLSYQLATIKTDVELELTCEQLEVQQPDAEELLGLFKKYEFKRWTADVEAGKWLQAKGAKPAAKPQE------- 310 (928) T ss_pred HHHHHHHHHHHHHHHHCCCCCCCCHHHHCCCCCCHHHHHHHHHHCCCHHHHHHHHCCCCHHCCCCCCCCCCCC------- T ss_conf 7299999999874651366778998994548999999999999808778999873442000024565566655------- Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHCCCCCCCCCCCCCEEECCHHHHHHHHHHHHHCCCCCCCC Q ss_conf 33222222233332222233332112232112001121100001235556665035479999999999998668523233 Q gi|254780277|r 315 AIETEIDRDTTIKTSQFSSLSDQTSKINSEQTPQKLFLERLQKLSQYPINNDSYTKIVNTKDIQQWVQKLETIGSASFKI 394 (976) Q Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~p~~~~~~~~~~~~~~~~~~~~y~~I~~~~~L~~li~~L~~~~~ia~Dt 394 (976) .... +..............|..|.+.+++..|++++...+.++||+ T Consensus 311 -------------------------~~~~---------~~~~~~~~~~~~~~~y~~i~~~~~l~~~~~~l~~~~~~a~Dt 356 (928) T PRK07997 311 -------------------------TVVA---------DEAPEVTAAVLSYDNYVTILDEETLKAWIEKLKKAPVFAFDT 356 (928) T ss_pred -------------------------CCCC---------CCCCCCCCCCCCCCCCEEECCHHHHHHHHHHHHCCCEEEEEE T ss_conf -------------------------4443---------445321223357555078478999999999864298699995 Q ss_pred CCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCHHHHCCCCHHHH Q ss_conf 22243311001244201025776654434550231374433345650017889999744442022210011003212378 Q gi|254780277|r 395 ITDTMDAFHSKPIAIAISTLDQKDDTSFKVETIFIDLSVHTSQDSTGKNTLTKEILSYLKKFFENEHFLKIGHNIKYDKL 474 (976) Q Consensus 395 ETt~ld~~~~~ivGisls~~~~~~~~~~~~~~~yip~~~~~~~d~~~~~~~~~evl~~L~~lle~~~i~kIgHN~KfD~~ 474 (976) ||+++|++.++++|+|+|+.++. ++|+|+.|.... ...+...+.+++.|+++|+++++.|||||+|||++ T Consensus 357 Et~sld~~~a~lvGiS~s~~~g~--------a~YiPl~h~~l~--~~~ql~~~~vl~~LkplLed~~i~KIghnlK~d~~ 426 (928) T PRK07997 357 ETDSLDNISANLVGLSFAIEPGV--------AAYLPVAHDYLD--APDQISRERVLELLKPLLEDEKALKVGQNLKFDRG 426 (928) T ss_pred ECCCCCCCCCCCCCEEEEECCCC--------EEEEECCCCCCC--CHHHCCHHHHHHHHHHHHCCCCCCHHCCHHHHHHH T ss_conf 12786600024443389862774--------687402212347--60012499999998987528653032121688999 Q ss_pred HHHHCCCCCCCC-CCHHHHHHHHCCCCCCCCHHHHHHHHCCCC---CCCCCCCCCCCCCCHHHCCHHHHHHHHHHHHHHH Q ss_conf 898618866642-100234554316876677246677622444---2233565432221000000123333222111112 Q gi|254780277|r 475 VLHRYGISMRGF-EDIMLMSYVLDSGRSSHDIASIAQKWLSYT---RKEILKSRKSSIPIDKISDSQVQEHAIENSNVIL 550 (976) Q Consensus 475 ~L~~~GI~l~~i-~DTmlaayLLdp~~~~~~L~~la~~yL~~~---~~e~~~~~k~~~~~~~i~~e~~~~Yaa~Da~~t~ 550 (976) +|.++||.+.++ ||||||||||||+.++|+|++|+.+||++. .+++.|+++++++|+.++.+.+..|||+||++|+ T Consensus 427 vL~~~GI~l~gi~fDTmLAaYLLnp~~~~~~Ld~La~~yL~~~~i~~~el~gkgk~q~~f~~v~~~~~~~Ya~edA~~t~ 506 (928) T PRK07997 427 ILANYGIELRGIAFDTMLESYILNSVAGRHDMDSLAERWLKHKTITFEEIAGKGKNQLTFNQIALEEAGRYAAEDADVTL 506 (928) T ss_pred HHHHCCCEECCCCCCHHHHHHHCCCCCCCCCHHHHHHHHHCCCCCCHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHH T ss_conf 99977941056430345532222443344788999999713466433543244645477435359999999999999999 Q ss_pred HHHHHHHHHHHH-HHHHHHHHHHCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHH Q ss_conf 456776322210-0013457886075012332210143100589999864343689999999997403455777765666 Q gi|254780277|r 551 QLWLLLRPKLIV-EKLLHVYERLDKPMIDVVSQMEITGIQIDQDLLKQVSAEISKNLLSLEENIYDLSGEKFNLSSPKQL 629 (976) Q Consensus 551 ~L~~~l~~~L~~-~~l~~l~~~iE~Pl~~vLa~ME~~Gi~vD~~~L~~l~~~l~~~l~~le~~i~~~aG~~FNinSpkQL 629 (976) +||..|+++|++ ++++++|++|||||++||++||.+||+||.+.|++++.+++.++++++++||++||++||||||||| T Consensus 507 ~L~~~L~~~L~~~~~l~~Ly~~IE~PLi~VLa~ME~~GI~vD~~~L~~ls~el~~~l~~le~~I~~~aG~eFNinSpKQL 586 (928) T PRK07997 507 QLHLKMWPKLQQHKGPLNVFENIEMPLVPVLSRIERNGVKIDPAVLHAHSEELTLRLAELEKKAHEIAGEPFNLSSTKQL 586 (928) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCEEECHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHH T ss_conf 99999999986214499999999989999999998539599899999999999999999999999862998887899999 Q ss_pred HHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 79987420788755566545353178877653047627999999987888988653200111100123334533002221 Q gi|254780277|r 630 GDILFTKLKFPSGAKTKTGQWKTTAQDLEQINCGDNPIIKNILEWRQLSKIKSTYADSLPNHINKRTQRVHTFYSLASTM 709 (976) Q Consensus 630 ~~vLf~~LgLp~~kKTktG~~ST~~~~Le~L~~~~hp~~~~iLeyR~l~KL~sTY~~~l~~~i~~~dgRIH~~f~q~~t~ 709 (976) |+|||++||||++||||||.||||++|||+| +..||+++.||+||+++||+|||+|+|+.+|++.||||||+|||++|+ T Consensus 587 geILFekLgLp~~KKTktG~~ST~~~vLe~L-~~~h~i~~~il~yR~l~KL~sTY~d~L~~~i~~~~gRiHt~f~Q~~t~ 665 (928) T PRK07997 587 QTILFEKQGIKPLKKTPGGAPSTSEEVLEEL-ALDYPLPKVILEYRGLAKLKSTYTDKLPLMINPKTGRVHTSYHQAVTA 665 (928) T ss_pred HHHHHHHCCCCCCCCCCCCCCCCHHHHHHHH-HCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEEEHCCCCC T ss_conf 9998986299988858899968469999972-025750899999989999899889888975386668782030111021 Q ss_pred EECCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCEEEEEECCCCCEEEEEEEEEHHHHHHHHHCCCCHHHHHHHHHHCCC Q ss_conf 00000133343334556332222430242489948999632654436876651037778734069888999998980999 Q gi|254780277|r 710 TGRLASLEPNLQNIPIKTDLGQKIRRAFISPPTKKLISADYSQIELRILAHIAKITPLYQVFENSLDIHRMVAAEIFGVN 789 (976) Q Consensus 710 TGRlSs~~PNlQNiP~r~~~g~~iR~~F~a~~g~~lv~aDySQiElRilA~ls~D~~l~~af~~g~DiH~~tA~~vf~~~ 789 (976) ||||||++|||||||+|+++||+||+||+|+|||+|++|||||||||||||+|+|+.|++||++|+|||+.||+.|||+| T Consensus 666 TGRlSS~~PNLQNIPir~~~G~~iR~aFi~~~g~~l~saDYSQiELRilAh~s~d~~li~af~~~~DiH~~tA~~if~~~ 745 (928) T PRK07997 666 TGRLSSTDPNLQNIPVRNEEGRRIRQAFIAPEDYVIVSADYSQIELRIMAHLSRDKGLLTAFAEGKDIHRATAAEVFGLP 745 (928) T ss_pred CCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCCCEEEEECHHHHHHHHHHHHHCCHHHHHHHHCCCCCCHHHHHHHHCCC T ss_conf 25112589986758888622578998743789987997056289999999997687688775168883401179882999 Q ss_pred HHHCCHHHHHHHHHHHHHHHHCCCHHHHHHHCCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHCCCEECCCCCEECC Q ss_conf 68899888863025135644077703567881999899999999999988889999999999999849888277876018 Q gi|254780277|r 790 IEKVSPQMRRHAKTINFSIIYGISPFRLATQLRIPRSEAADYIKRYFHRFPGIHEYIEKTKNFVRQNGYVETIFGRRIHY 869 (976) Q Consensus 790 ~~~vt~~~R~~aK~infgi~YG~~~~~La~~l~is~~eA~~~i~~yf~~~p~v~~~~~~~~~~a~~~gyv~T~~GRrr~~ 869 (976) .++||++||+.||+|||||+||||+||||++||||++||++||++||++||+|+.||++++++|+++|||+|++|||||+ T Consensus 746 ~~~v~~~~R~~aK~vnfgi~YG~s~~gLa~~l~i~~~eA~~~i~~yf~~~~~v~~~~~~~~~~a~~~gyv~Tl~gRrr~~ 825 (928) T PRK07997 746 LEKVTSEQRRSAKAINFGLIYGMSAFGLARQLNIPRKEAQKYMDLYFERYPGVLEYMERTRAQAKEQGYVETLDGRRLYL 825 (928) T ss_pred HHHCCHHHHHHHHHHHHHHHHCCCHHHHHHHHCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCEECCCCCEECC T ss_conf 46579889988687776646626877899870999999999999998868339999999999999779478888961458 Q ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEECCEEEEECCHHHHHHHHHHHHHHHH Q ss_conf 98758837888778787701345653899999999999999986599818999726389995598799999999999965 Q gi|254780277|r 870 DEINSPKSSIRNINERAAINAPIQGSAADITRRAMISVHKSLEDHKLATKMLLQIHDELVFEVPEEEITTASQIIVHSME 949 (976) Q Consensus 870 p~i~s~~~~~r~~aeR~a~N~~iQGsaAdi~k~am~~i~~~~~~~~~~~~~ilqVHDElv~ev~~~~~~~~~~~~~~~Me 949 (976) |+|+|.|++.|++|||+|+|||||||||||||+||++++++++..+++++||||||||||||||++++++++++|+++|| T Consensus 826 ~~i~s~n~~~r~~~eR~a~N~piQgtaADiik~Ami~~~~~l~~~~~~~~~~lqvHDEli~e~~~~~~~~~~~~~~~~Me 905 (928) T PRK07997 826 PDIKSSNAARRAAAERAAINAPMQGTAADIIKRAMIAVDAWLQAEQPRVRMIMQVHDELVFEVHKDDLDAVAKKIHQLME 905 (928) T ss_pred CCCCCCCHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECCEEEEEECHHHHHHHHHHHHHHHH T ss_conf 87678998899689999958113378999999999999999984488864897356067884268899999999999972 Q ss_pred HCCCCCCCCCCCEEEEEEEECCHHHHC Q ss_conf 101357444464388833208937819 Q gi|254780277|r 950 KACLPKINLRVPLKVNIKVSNNWQEEN 976 (976) Q Consensus 950 ~~~~~~~~l~VPl~v~~~~G~~W~e~~ 976 (976) +|. +|+|||+|++++|+||+||| T Consensus 906 ~a~----~l~VPL~v~~~~g~nW~eah 928 (928) T PRK07997 906 NCT----RLDVPLLVEVGSGENWDQAH 928 (928) T ss_pred CCC----CCCCCEEEECCCCCCHHHCC T ss_conf 844----37887585359277968809 No 3 >PRK06887 consensus Probab=100.00 E-value=0 Score=2342.73 Aligned_cols=920 Identities=39% Similarity=0.626 Sum_probs=854.0 Q ss_pred CCCEEEEEECCHHHHHHHHCCCCCCCCCCCCEECHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHHCCHHHHC Q ss_conf 89728999662699988742675446898855006999999999999984855479879999727488713240865216 Q gi|254780277|r 3 KENHLFLVDGSSFIYRAFYATPLLSRKHDGLPVNAIAGFCNMLWKLLQNSRKESIASHFAVIFDYPAVTFRNEIYPDYKA 82 (976) Q Consensus 3 ~~~~~~liDg~~~~~ra~~a~p~~~~~~~G~~t~ai~gf~~~l~~~i~~~~~~~~~~~~~v~fD~~~~tfR~~~~~~YKa 82 (976) ++++|+|||||+++||||||+|+++ +++|+||||||||+|||++++++++|+ |++||||.++||||||+||+||| T Consensus 5 ~~~~L~LIDg~sl~fRAfyAl~~l~-ts~G~pTnAi~GF~~ml~kli~~~kP~----~iavaFD~~~~TFR~e~y~~YKa 79 (954) T PRK06887 5 ATNPLVLVDGSSYLYRAFHAFPPLT-NSAGEPTGAMYGVLNMLKSLISQVQPS----HIAVVFDAKGKTFRDELFEQYKS 79 (954) T ss_pred CCCCEEEEECCHHHHHHHCCCCCCC-CCCCCEECHHHHHHHHHHHHHHHCCCC----EEEEEEECCCCCCCHHHHHHHCC T ss_conf 8998899967589999980788675-998987335999999999999871998----89999808899603323056537 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHCCCCEEEECCEEHHHHHHHHHHHHHHCCCCEEEECCCCHHHEECCCCEEEEECCCCCE Q ss_conf 67987975886489999999970997995689016658999999998489949998687012122379759996678835 Q gi|254780277|r 83 NRPKIPEMLLPQLPLVRLATQAFGIPAIEIQGFEADDIIATYTYIAEKEGFAVTIISTDKDLMQLVSPTTCLYDTVKEEK 162 (976) Q Consensus 83 ~R~~~p~~l~~q~~~i~~~l~~~gi~~~~~~~~EADDviatla~~~~~~~~~v~i~s~DkD~~Qlv~~~v~~~~~~~~~~ 162 (976) ||++|||||.+|||+|++++++||||+++.+||||||||||+|++++++|++|+|+|||||++|||+++|.+|+|++++. T Consensus 80 ~R~~~PeeL~~Qip~i~e~l~a~gip~~~~~GyEADDiIgTla~~~~~~g~~v~IvS~DKDl~QLV~~~v~i~~~~~~~~ 159 (954) T PRK06887 80 HRPPMPDDLRKQIQPLHDIIRALGIPLLSVEGVEADDVIGTLAVQASSAGKKVLISTGDKDMAQLVDDNIMLINTMNNSL 159 (954) T ss_pred CCCCCCHHHHHHHHHHHHHHHHCCCCEEEECCCCHHHHHHHHHHHHHHCCCEEEEECCCCCCHHHCCCCEEEEECCCCEE T ss_conf 99999878999999999999987998855689349999999999999789929997588770121679879998989827 Q ss_pred ECHHHHHHHHCCCHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCCCHHHHHHHHHHHHH------HHHHHHHCCC Q ss_conf 3899998884898878777787518743589987763602499898413678999864542001------1223220035 Q gi|254780277|r 163 IDIEHVIKKWGIPPEKMVCLQALTGDSIDNIPGIPGIGYKTAALLLQEYGNLENILINASRIKQ------KKRRENILEY 236 (976) Q Consensus 163 ~~~~~v~~~~gv~p~~~~d~~al~GD~sDnipGv~GiG~KtA~~ll~~~gsle~i~~~~d~i~~------~k~~~~l~~~ 236 (976) +|++.|.|||||+|+||+||+||+||+|||||||||||||||+|||++|||||+||+|+|+|++ +|++++|+++ T Consensus 160 ~d~~~V~ek~GV~P~qiiD~lAL~GDsSDNIPGVpGIG~KTA~kLL~eyGsLe~Iy~nld~i~~~~~rg~~Kl~e~L~~~ 239 (954) T PRK06887 160 LDREGVIEKYGIPPELIIDYLALMGDSSDNIPGVAGVGEKTALGLLQGIGSMAEIYANLDKVAELPIRGAKKLGEKLLAE 239 (954) T ss_pred ECHHHHHHHHCCCHHHHHHHHHHCCCCCCCCCCCCCEEHHHHHHHHHHHCCHHHHHHHHHHHCCCCCCHHHHHHHHHHHH T ss_conf 57999989879798999999997188524778878541699999999957799999869873255420057899999998 Q ss_pred HHHHHHHHHHHHCCCCCCCCCCHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 78999765543203564224763552126767688998653233689998765311445443342111234332222333 Q gi|254780277|r 237 AETARLSRKLVMLRTDVPITIPLEHLILEDCNGPRLISFLKALEFTKLINRVATTYDCDAANIEPVVLDINTTENKGIAI 316 (976) Q Consensus 237 ~e~a~ls~~L~tL~~Dvpl~~~~~~l~~~~~d~~~l~~~~~~lef~~ll~rL~~~~~~~~~~~~~~~~~~~~~e~~~~~~ 316 (976) +|+|++||+||||++|||+++++++++++++|.+++.++|++|||++|++++...........+... T Consensus 240 ~e~a~LSk~LvtI~~Dvpl~~~~~~l~~~~~d~~~L~~~f~elEFksll~~l~~~~~~~~~~~~~~~------------- 306 (954) T PRK06887 240 KANADLSYLLATIKTDVELDVTPEQLLLGESNKDELIEYFARYEFKRWLNEVMNGANSITQTTEQPV------------- 306 (954) T ss_pred HHHHHHHHHHHCCCCCCCCCCCHHHHCCCCCCHHHHHHHHHHHCCHHHHHHHHCCCCCCCCCCHHCC------------- T ss_conf 9989999986014557888899899444899999999999983485899998435453334400012------------- Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHCCCCCCCCCCCCCEEECCHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 22222223333222223333211223211200112110000123555666503547999999999999866852323322 Q gi|254780277|r 317 ETEIDRDTTIKTSQFSSLSDQTSKINSEQTPQKLFLERLQKLSQYPINNDSYTKIVNTKDIQQWVQKLETIGSASFKIIT 396 (976) Q Consensus 317 ~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~p~~~~~~~~~~~~~~~~~~~~y~~I~~~~~L~~li~~L~~~~~ia~DtET 396 (976) ........++. ...++ ..............|..|.+.+++..|++++...+.+|+|||| T Consensus 307 ----------~~~~~~~~~~~---~~~~~--------~~~~~~~~~~~~~~y~~i~~~~~l~~~~~~l~~~~~va~DtEt 365 (954) T PRK06887 307 ----------KMNQYQATSAV---PQDQS--------AVKNTPKVKIDRTKYETLLTQAQLTSWIEKLNQAKLIAVDTET 365 (954) T ss_pred ----------CCCCCCCCCCC---CCCCC--------CCCCCCCCCCCCCCCEECCCHHHHHHHHHHHHHCCEEEEEEEC T ss_conf ----------22311246555---43210--------0002443444533134408999999999987637927999635 Q ss_pred CCCCCCCCCCCCCEEECCCCCCCCCCCCCCEECCCCCCCCC-------------------CCCCCCCCHHHHHHHHHHHH Q ss_conf 24331100124420102577665443455023137443334-------------------56500178899997444420 Q gi|254780277|r 397 DTMDAFHSKPIAIAISTLDQKDDTSFKVETIFIDLSVHTSQ-------------------DSTGKNTLTKEILSYLKKFF 457 (976) Q Consensus 397 t~ld~~~~~ivGisls~~~~~~~~~~~~~~~yip~~~~~~~-------------------d~~~~~~~~~evl~~L~~ll 457 (976) ++++++.++++|+|+|+.++. ++|+|+.|.... .....++...++++.|+++| T Consensus 366 ~~ld~~~a~lVGis~s~~~g~--------a~YiPl~h~~~~~~~~~~~~~~~~~~~~~~~~~~~~ql~~~~vl~~Lkpll 437 (954) T PRK06887 366 DSLDYMSANLVGISFALENGE--------ACYIPLAHKQQVAEQPQSDLFAEEAETQTDYLLAPKQLNKSTCLAALKPLL 437 (954) T ss_pred CCCCCCCCCEEEEEEEECCCC--------EEEEECCCCCCCCCCCCHHHHHHCCCCHHHHHCCHHHCCHHHHHHHHHHHH T ss_conf 998713266578999835897--------899864442100013200233200100122210232127999999989985 Q ss_pred CCCCCHHHHCCCCHHHHHHHHCCCCCCCC-CCHHHHHHHHCCCCCCCCHHHHHHHHCCCC---CCCCCCCCCCCCCCHHH Q ss_conf 22210011003212378898618866642-100234554316876677246677622444---22335654322210000 Q gi|254780277|r 458 ENEHFLKIGHNIKYDKLVLHRYGISMRGF-EDIMLMSYVLDSGRSSHDIASIAQKWLSYT---RKEILKSRKSSIPIDKI 533 (976) Q Consensus 458 e~~~i~kIgHN~KfD~~~L~~~GI~l~~i-~DTmlaayLLdp~~~~~~L~~la~~yL~~~---~~e~~~~~k~~~~~~~i 533 (976) +|+++.|||||+|||+++|.++||.++++ ||||||+|||||.. +|+|++|+.+||++. .+++.|+++++++|+.+ T Consensus 438 ed~~i~KIghNlK~dl~vL~~~Gi~l~g~~fDTmLAaYLLdp~~-r~~Ld~LA~~yL~~~~i~~edl~gkGk~ql~f~~v 516 (954) T PRK06887 438 ENPDIHKIGQNIKYDLTIFANHGIELQGVAFDTMLESYTLNSTG-RHNMDDLAERYLGHETIAFESIAGKGKNQLTFDQI 516 (954) T ss_pred CCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHCCCC-CCCHHHHHHHHCCCCCCCHHHHCCCCCCCCCHHHC T ss_conf 29750222114788999999779624776516989987518755-57889999996388653346422544333784434 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 001233332221111124567763222100-0134578860750123322101431005899998643436899999999 Q gi|254780277|r 534 SDSQVQEHAIENSNVILQLWLLLRPKLIVE-KLLHVYERLDKPMIDVVSQMEITGIQIDQDLLKQVSAEISKNLLSLEEN 612 (976) Q Consensus 534 ~~e~~~~Yaa~Da~~t~~L~~~l~~~L~~~-~l~~l~~~iE~Pl~~vLa~ME~~Gi~vD~~~L~~l~~~l~~~l~~le~~ 612 (976) +.+.+.+|||+||++|++||..|+++|.+. +|+++|++|||||++||++||.+||+||.+.|++++.+++.++++++++ T Consensus 517 ~l~~~~~YAaedA~itl~L~~~L~~~L~~~~~L~~Ly~~IE~PLi~VLa~ME~~GI~vD~~~L~~ls~el~~~l~~le~~ 596 (954) T PRK06887 517 PLEQATEYAAEDADVTMKLHQVLWLELQKEPTLVKLYKTMELPLVHVLSRIERNGVLIDSDALLAQSNEIAQRLTALEQQ 596 (954) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCEEECHHHHHHHHHHHHHHHHHHHHH T ss_conf 46799999999999999999999999985454789999999799999999997385887999999999999999999999 Q ss_pred HHHCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 97403455777765666799874207887555665453531788776530476279999999878889886532001111 Q gi|254780277|r 613 IYDLSGEKFNLSSPKQLGDILFTKLKFPSGAKTKTGQWKTTAQDLEQINCGDNPIIKNILEWRQLSKIKSTYADSLPNHI 692 (976) Q Consensus 613 i~~~aG~~FNinSpkQL~~vLf~~LgLp~~kKTktG~~ST~~~~Le~L~~~~hp~~~~iLeyR~l~KL~sTY~~~l~~~i 692 (976) ||++||++|||||||||++|||++||||++||||||.||||++|||+| +..||+++.||+||+++||+|||+|+|+++| T Consensus 597 iy~~aG~eFNinSpKQL~eiLFekLgLp~~KKTKtG~~STda~vLe~L-~~~h~i~~~il~yR~l~KL~sTY~d~L~~~i 675 (954) T PRK06887 597 AYELAGQPFNLASTKQLQEILFDKLGLPVLKKTPKGAPSTNEEVLEEL-AYQHELPKLLVEHRGLSKLKSTYTDKLPQMV 675 (954) T ss_pred HHHHCCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHH-HHCCCCHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 988609954899989999998984299988858899977469999998-6235326899999899998999898887640 Q ss_pred HHHHHHHHHHHHCCCCCEECCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCEEEEEECCCCCEEEEEEEEEHHHHHHHHH Q ss_conf 00123334533002221000001333433345563322224302424899489996326544368766510377787340 Q gi|254780277|r 693 NKRTQRVHTFYSLASTMTGRLASLEPNLQNIPIKTDLGQKIRRAFISPPTKKLISADYSQIELRILAHIAKITPLYQVFE 772 (976) Q Consensus 693 ~~~dgRIH~~f~q~~t~TGRlSs~~PNlQNiP~r~~~g~~iR~~F~a~~g~~lv~aDySQiElRilA~ls~D~~l~~af~ 772 (976) ++.||||||+|||++|+||||||++|||||||+||++||+||+||+|++||+|++|||||||||||||+|+|+.|++||+ T Consensus 676 ~~~tgRiHt~f~q~~t~TGRlSS~~PNLQNIPiRte~Gr~iR~aF~~~~g~~l~saDYSQIELRilAh~s~d~~l~~af~ 755 (954) T PRK06887 676 NSKTGRVHTSYHQAVTATGRLSSSDPNLQNIPIRNEEGRRIRQAFIARDGYKIVAADYSQIELRIMAHLSGDQGLINAFS 755 (954) T ss_pred CCCCCEECCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHEEEECCCCCEEEEECHHHHHHHHHHHHHCCHHHHHHHH T ss_conf 77898777765201123120137998633688786022233400543899889971452899999999966887999986 Q ss_pred CCCCHHHHHHHHHHCCCHHHCCHHHHHHHHHHHHHHHHCCCHHHHHHHCCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHH Q ss_conf 69888999998980999688998888630251356440777035678819998999999999999888899999999999 Q gi|254780277|r 773 NSLDIHRMVAAEIFGVNIEKVSPQMRRHAKTINFSIIYGISPFRLATQLRIPRSEAADYIKRYFHRFPGIHEYIEKTKNF 852 (976) Q Consensus 773 ~g~DiH~~tA~~vf~~~~~~vt~~~R~~aK~infgi~YG~~~~~La~~l~is~~eA~~~i~~yf~~~p~v~~~~~~~~~~ 852 (976) +|+|||+.||+.|||+|.++||++||+.||+|||||+||||+||||++||||+.||++||++||++||+|+.|+++++++ T Consensus 756 ~g~DiH~~tA~~vf~~~~~~v~~~~R~~aK~vnfgi~YG~~~~gLs~~l~i~~~eA~~~i~~yf~~~~~v~~~~~~~~~~ 835 (954) T PRK06887 756 EGKDIHRSTAAEIFGLSLDEVTSEQRRSAKAINFGLIYGMSAFGLSRQLGISRADAQKYMDLYFQRYPSVQQFMTDIREK 835 (954) T ss_pred CCCCCHHHHHHHHHCCCHHHCCHHHHHHHHHHHHHHHHCCCHHHHHHHHCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHH T ss_conf 38880277899994999466898899876888766566168777998709899999999999988690099999999999 Q ss_pred HHHCCCEECCCCCEECCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEECCEEEEEC Q ss_conf 99849888277876018987588378887787877013456538999999999999999865998189997263899955 Q gi|254780277|r 853 VRQNGYVETIFGRRIHYDEINSPKSSIRNINERAAINAPIQGSAADITRRAMISVHKSLEDHKLATKMLLQIHDELVFEV 932 (976) Q Consensus 853 a~~~gyv~T~~GRrr~~p~i~s~~~~~r~~aeR~a~N~~iQGsaAdi~k~am~~i~~~~~~~~~~~~~ilqVHDElv~ev 932 (976) |+++|||+|++|||||+|+|+|.|.++|++|||+|+|||||||||||||+||++++++++..+ +++||||||||||||| T Consensus 836 a~~~Gyv~Tl~GRrr~~p~i~s~n~~~r~~~eR~aiN~piQGtAADiiK~AMi~~~~~l~~~~-~~~~~lqvHDEli~e~ 914 (954) T PRK06887 836 AKAQGYVETLFGRRLYLPDINSSNAMRRKGAERVAINAPMQGTAADIIKRAMIKIDKVIRGDP-DIKMIMQVHDELVFEV 914 (954) T ss_pred HHHCCCEECCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHCC-CCEEEEEECEEEEEEE T ss_conf 997794688888730687666899889978999983756526799999999999999984278-9769998670668855 Q ss_pred CHHHHHHHHHHHHHHHHHCCCCCCCCCCCEEEEEEEECCHHHHC Q ss_conf 98799999999999965101357444464388833208937819 Q gi|254780277|r 933 PEEEITTASQIIVHSMEKACLPKINLRVPLKVNIKVSNNWQEEN 976 (976) Q Consensus 933 ~~~~~~~~~~~~~~~Me~~~~~~~~l~VPl~v~~~~G~~W~e~~ 976 (976) |++++++++++|+++||+|. +|+|||+|++++|+||+||| T Consensus 915 ~~~e~e~~~~~v~~~Me~a~----~l~VPL~v~~~~g~nW~eah 954 (954) T PRK06887 915 RSEKVEFYSELIKQEMEKAA----QLVVPLIVEVGVGDNWDEAH 954 (954) T ss_pred CHHHHHHHHHHHHHHHHCCC----CCCCCEEEECCCCCCHHHCC T ss_conf 68899999999999983741----06786373107287977719 No 4 >PRK07625 consensus Probab=100.00 E-value=0 Score=2322.20 Aligned_cols=911 Identities=38% Similarity=0.625 Sum_probs=852.4 Q ss_pred CCCCCEEEEEECCHHHHHHHHCCCCCCCCCCCCEECHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHHCCHHH Q ss_conf 98897289996626999887426754468988550069999999999999848554798799997274887132408652 Q gi|254780277|r 1 MQKENHLFLVDGSSFIYRAFYATPLLSRKHDGLPVNAIAGFCNMLWKLLQNSRKESIASHFAVIFDYPAVTFRNEIYPDY 80 (976) Q Consensus 1 m~~~~~~~liDg~~~~~ra~~a~p~~~~~~~G~~t~ai~gf~~~l~~~i~~~~~~~~~~~~~v~fD~~~~tfR~~~~~~Y 80 (976) |+ +|+|+|||||+++||||||+|+++ +++|+||||||||+|||++++++++|+ |++||||.+++||||++||+| T Consensus 3 ~~-~k~l~liDg~~lifRAfyAlp~l~-ts~G~pTnAi~GF~~mL~kli~e~~P~----~iavaFD~~~~TFR~e~y~eY 76 (922) T PRK07625 3 LE-GKTLLLVDGSSYLYRAYHAMPDLR-GPGGEPTGALYGIVNMLRRMRKDVSAE----YSACVFDAKGKTFRDDLYADY 76 (922) T ss_pred CC-CCEEEEECCCHHHHHHHHCCCCCC-CCCCCEECHHHHHHHHHHHHHHHCCCC----EEEEEECCCCCCCCHHHCHHH T ss_conf 99-997999847389999982788887-998987117999999999999865998----899997198996340013565 Q ss_pred HCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEECCEEHHHHHHHHHHHHHHCCCCEEEECCCCHHHEECCCCEEEEECCCC Q ss_conf 16679879758864899999999709979956890166589999999984899499986870121223797599966788 Q gi|254780277|r 81 KANRPKIPEMLLPQLPLVRLATQAFGIPAIEIQGFEADDIIATYTYIAEKEGFAVTIISTDKDLMQLVSPTTCLYDTVKE 160 (976) Q Consensus 81 Ka~R~~~p~~l~~q~~~i~~~l~~~gi~~~~~~~~EADDviatla~~~~~~~~~v~i~s~DkD~~Qlv~~~v~~~~~~~~ 160 (976) ||||++|||+|.+|||+|++++++||||+++.+||||||||||||++++++|++|+|+|||||++|||+++|++|+|+++ T Consensus 77 KAnR~~~PedL~~Qip~i~e~l~a~gIp~~e~~GyEADDiIgTla~~a~~~g~~v~IvSgDKDl~QLV~~~v~i~~~~k~ 156 (922) T PRK07625 77 KAHRPPMPPDLALQIEPIHEAVRALGWPLLMIEGVEADDVIGTLARQAEQHGMNVIVSTGDKDLAQLVTDRVTLVNTMTN 156 (922) T ss_pred CCCCCCCCHHHHHHHHHHHHHHHHCCCCEECCCCCCHHHHHHHHHHHHHHCCCEEEEECCCCCHHHHCCCCEEEEECCCC T ss_conf 18999998889998999999999879988452891589999999999997889399981898686618998699978899 Q ss_pred CEECHHHHHHHHCCCHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHCCCHHHH Q ss_conf 35389999888489887877778751874358998776360249989841367899986454200112232200357899 Q gi|254780277|r 161 EKIDIEHVIKKWGIPPEKMVCLQALTGDSIDNIPGIPGIGYKTAALLLQEYGNLENILINASRIKQKKRRENILEYAETA 240 (976) Q Consensus 161 ~~~~~~~v~~~~gv~p~~~~d~~al~GD~sDnipGv~GiG~KtA~~ll~~~gsle~i~~~~d~i~~~k~~~~l~~~~e~a 240 (976) +.||++.|.|||||+|+||+||+||+||+|||||||||||||||++||++|||||+||+|+|+|+| |++++|++++|+| T Consensus 157 ~~~d~~~V~ek~GV~P~qviD~lAL~GDsSDNIPGVpGIG~KTA~kLL~eyGsLE~I~~n~d~Ikg-k~~e~L~~~~e~a 235 (922) T PRK07625 157 EVLDRDGVLAKFGVPPERIVDYLSLIGDTVDNVPGVEKCGPKTAVKWLTQYGSLDGVVEHAGEIKG-VVGDNLRRALDFL 235 (922) T ss_pred CCCCHHHHHHHHCCCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHCCCHHHHHHHHHHCCC-HHHHHHHHHHHHH T ss_conf 270899999987979899999999719763578999985679899999974789999998873020-3678999769999 Q ss_pred HHHHHHHHCCCCCCCCCCHHH----HCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 976554320356422476355----2126767688998653233689998765311445443342111234332222333 Q gi|254780277|r 241 RLSRKLVMLRTDVPITIPLEH----LILEDCNGPRLISFLKALEFTKLINRVATTYDCDAANIEPVVLDINTTENKGIAI 316 (976) Q Consensus 241 ~ls~~L~tL~~Dvpl~~~~~~----l~~~~~d~~~l~~~~~~lef~~ll~rL~~~~~~~~~~~~~~~~~~~~~e~~~~~~ 316 (976) ++||+||||++||||+.++++ +++.++|.+.+.++|++|||++|++++................. T Consensus 236 ~LSk~LvTI~~Dv~l~~~l~d~~~~l~~~~~d~~~L~~lf~eleFksllk~l~~~~~~~~~~~~~~~~~----------- 304 (922) T PRK07625 236 PLARKLVTVETDCDLAPHVESFDASLATDGEDRDALRAIFARYGFKTWLRELDSEPAAGGGAAAAAAAG----------- 304 (922) T ss_pred HHHHHHHHCCCCCCCCCCCCHHHHHHCCCCCCHHHHHHHHHHHCHHHHHHHHCCCCCCCCCCCCCCCCC----------- T ss_conf 998998752446888876230677741378899999999998341899998515755554321112344----------- Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHCCCCCCCCCCCCCEEECCHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 22222223333222223333211223211200112110000123555666503547999999999999866852323322 Q gi|254780277|r 317 ETEIDRDTTIKTSQFSSLSDQTSKINSEQTPQKLFLERLQKLSQYPINNDSYTKIVNTKDIQQWVQKLETIGSASFKIIT 396 (976) Q Consensus 317 ~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~p~~~~~~~~~~~~~~~~~~~~y~~I~~~~~L~~li~~L~~~~~ia~DtET 396 (976) .. .. .... ............|..|.+.++++.|++.+...+.+++|+|| T Consensus 305 ---------------~~-~~-----~~~~----------~~~~~~~~~e~~y~~i~~~e~l~~~~~~l~~~~~~a~dtEt 353 (922) T PRK07625 305 ---------------AA-PD-----PAAG----------AAAELAAPTERNYATVQTWEQFDAWLAKISAAELTAFDTET 353 (922) T ss_pred ---------------CC-CC-----CCCC----------CCCCCCCCCCCCEEEECCHHHHHHHHHHHHCCCEEEEEEEC T ss_conf ---------------56-65-----3234----------43323565434307837899999999986408848999522 Q ss_pred CCCCCCCCCCCCCEEECCCCCCCCCCCCCCEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCHHHHCCCCHHHHHH Q ss_conf 24331100124420102577665443455023137443334565001788999974444202221001100321237889 Q gi|254780277|r 397 DTMDAFHSKPIAIAISTLDQKDDTSFKVETIFIDLSVHTSQDSTGKNTLTKEILSYLKKFFENEHFLKIGHNIKYDKLVL 476 (976) Q Consensus 397 t~ld~~~~~ivGisls~~~~~~~~~~~~~~~yip~~~~~~~d~~~~~~~~~evl~~L~~lle~~~i~kIgHN~KfD~~~L 476 (976) ++++++.++++|+|+|+.++. ++|+|+.|.... ...++..+++++.|++||+++.+.|||||+|||+++| T Consensus 354 tsld~~~a~lvGis~s~~~g~--------a~YiPl~h~~~d--~~~~l~~~~vl~~Lk~~Led~~i~KIghNlK~d~~vL 423 (922) T PRK07625 354 TSLDPMLAQLVGLSFSVEPGH--------AAYIPVAHRGPD--MPAQLPRDEVLARLRPWLEDAGRKKLGQHLKYDAQVL 423 (922) T ss_pred CCCCHHHCEEEEEEEEECCCC--------EEEEEECCCCCC--HHHHCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHH T ss_conf 776500130565797743886--------799970215865--2321589999998799874344035542577899999 Q ss_pred HHCCCCCCCC-CCHHHHHHHHCCCCCCCCHHHHHHHHCCCC---CCCCCCCCCCCCCCHHHCCHHHHHHHHHHHHHHHHH Q ss_conf 8618866642-100234554316876677246677622444---223356543222100000012333322211111245 Q gi|254780277|r 477 HRYGISMRGF-EDIMLMSYVLDSGRSSHDIASIAQKWLSYT---RKEILKSRKSSIPIDKISDSQVQEHAIENSNVILQL 552 (976) Q Consensus 477 ~~~GI~l~~i-~DTmlaayLLdp~~~~~~L~~la~~yL~~~---~~e~~~~~k~~~~~~~i~~e~~~~Yaa~Da~~t~~L 552 (976) .++||.++++ ||||||+|||+|.+ +|+|++|+.+||++. .+++.|+++++++|++++.+.+..|||+|+++|++| T Consensus 424 ~~~GI~l~g~~fDTmLAaYLL~~~r-~~~Ld~La~~yL~~~~i~~edl~Gkg~~q~~~~~v~~~~~~~Ya~eda~~t~~L 502 (922) T PRK07625 424 ANYGIALNGIEHDTLLESYVLESHR-THDMDSLALRHLGVKTIKYEDVAGKGAKQIGFDEVALAQASEYAAEDADITLQL 502 (922) T ss_pred HHCCCCCCCCCHHHHHHHHHHCCCC-CCCHHHHHHHHHCCCCCCHHHHCCCCCCCCCHHHCCHHHHHHHHHHHHHHHHHH T ss_conf 9779645875403999988754676-788799999872556554132123333337832246799999999999999999 Q ss_pred HHHHHHHHHH-HHHHHHHHHHCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHH Q ss_conf 6776322210-001345788607501233221014310058999986434368999999999740345577776566679 Q gi|254780277|r 553 WLLLRPKLIV-EKLLHVYERLDKPMIDVVSQMEITGIQIDQDLLKQVSAEISKNLLSLEENIYDLSGEKFNLSSPKQLGD 631 (976) Q Consensus 553 ~~~l~~~L~~-~~l~~l~~~iE~Pl~~vLa~ME~~Gi~vD~~~L~~l~~~l~~~l~~le~~i~~~aG~~FNinSpkQL~~ 631 (976) |..|.++|.+ .+|+++|++|||||++||++||.+||+||.++|++++.++..++.+++++||++||++|||||||||++ T Consensus 503 ~~~L~~~L~~e~~L~~ly~~iE~PLi~VLa~ME~~GI~vD~~~L~~l~~~l~~~l~~le~~i~~~aG~eFNinSpkQL~~ 582 (922) T PRK07625 503 HHALYPQVAREPGLERVYRDIELPVSLVLRKMERTGVLIDADRLSRQSGEIATRLLALEQEAYELAGGEFNLGSPKQIGQ 582 (922) T ss_pred HHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHH T ss_conf 99999998340547999999995899999999982840269999999999999999999987775499778999999999 Q ss_pred HHHHHCCCCCCCCCCCCCCCCCHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEE Q ss_conf 98742078875556654535317887765304762799999998788898865320011110012333453300222100 Q gi|254780277|r 632 ILFTKLKFPSGAKTKTGQWKTTAQDLEQINCGDNPIIKNILEWRQLSKIKSTYADSLPNHINKRTQRVHTFYSLASTMTG 711 (976) Q Consensus 632 vLf~~LgLp~~kKTktG~~ST~~~~Le~L~~~~hp~~~~iLeyR~l~KL~sTY~~~l~~~i~~~dgRIH~~f~q~~t~TG 711 (976) |||++||||++||||||.||||++|||+| ++.||+++.||+||+++||+|||+|+|+++|++.||||||+|||++|+|| T Consensus 583 iLFe~L~Lp~~kKTktG~~ST~~~vLe~L-~~~hpi~~~il~yR~l~KL~sTY~d~l~~~i~~~~gRiHt~f~Q~~t~TG 661 (922) T PRK07625 583 IFFERLQLPVVKKTPSGAPSTDEEVLQKL-AEDYPLPKLLLEHRGLSKLKSTYTDKLPRMVNPDTGRVHTNYAQAVAVTG 661 (922) T ss_pred HHHHHCCCCCCCCCCCCCCCCHHHHHHHH-HCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEECCCHHEEECCCC T ss_conf 99985389878858989978539999986-00687389999862898999999988888508887877662002102335 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCEEEEEECCCCCEEEEEEEEEHHHHHHHHHCCCCHHHHHHHHHHCCCHH Q ss_conf 00013334333455633222243024248994899963265443687665103777873406988899999898099968 Q gi|254780277|r 712 RLASLEPNLQNIPIKTDLGQKIRRAFISPPTKKLISADYSQIELRILAHIAKITPLYQVFENSLDIHRMVAAEIFGVNIE 791 (976) Q Consensus 712 RlSs~~PNlQNiP~r~~~g~~iR~~F~a~~g~~lv~aDySQiElRilA~ls~D~~l~~af~~g~DiH~~tA~~vf~~~~~ 791 (976) ||||++|||||||+|+++||+||+||+|+|||+|++|||||||||||||+|+|+.|++||++|+|||+.||++|||+|.+ T Consensus 662 RlSS~~PNLQNIPir~~~G~~iR~aFi~~~g~~l~~aDYSQiELRilAh~s~d~~l~~af~~~~DiH~~tA~~vf~v~~~ 741 (922) T PRK07625 662 RLASNDPNLQNIPVRTEEGRRIREAFIAPPGSKIVSADYSQIELRIMAHISGDESLLRAFAHGEDIHRATAAEVFGVTPL 741 (922) T ss_pred CCCCCCCCCCCCCCCCCCCCHHHEEEECCCCCEEEEECHHHHHHHHHHHHHCCHHHHHHHHCCCCCCCCCHHHHHCCCHH T ss_conf 43579986336887762240133447318998899815538899999999578879999865998542426999698957 Q ss_pred HCCHHHHHHHHHHHHHHHHCCCHHHHHHHCCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHCCCEECCCCCEECCCC Q ss_conf 89988886302513564407770356788199989999999999998888999999999999984988827787601898 Q gi|254780277|r 792 KVSPQMRRHAKTINFSIIYGISPFRLATQLRIPRSEAADYIKRYFHRFPGIHEYIEKTKNFVRQNGYVETIFGRRIHYDE 871 (976) Q Consensus 792 ~vt~~~R~~aK~infgi~YG~~~~~La~~l~is~~eA~~~i~~yf~~~p~v~~~~~~~~~~a~~~gyv~T~~GRrr~~p~ 871 (976) +||++||+.||+|||||+||||+||||++||||++||++||++||++||+|+.||++++++|+++|||+|++|||||+|+ T Consensus 742 ~v~~~~R~~AK~vnfgiiYG~s~~gLa~~l~i~~~eA~~~i~~yf~~y~~v~~~~~~~~~~a~~~gyv~Tl~gRrr~~p~ 821 (922) T PRK07625 742 EVTSDQRRIAKVINFGLIYGMSAFGLASNLGITRDAAKLYIDRYFARYPGVARYMEETRMRAKEKGYVETVFGRRLWLPE 821 (922) T ss_pred HCCHHHHHHHHEEEEHHHHCCCHHHHHHHCCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHCCEEECCCCCCCCCCC T ss_conf 68999986517020023655786779987199999999999999986912999999999999877926677786031765 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEECCEEEEECCHHHHHHHHHHHHHHHHHC Q ss_conf 75883788877878770134565389999999999999998659981899972638999559879999999999996510 Q gi|254780277|r 872 INSPKSSIRNINERAAINAPIQGSAADITRRAMISVHKSLEDHKLATKMLLQIHDELVFEVPEEEITTASQIIVHSMEKA 951 (976) Q Consensus 872 i~s~~~~~r~~aeR~a~N~~iQGsaAdi~k~am~~i~~~~~~~~~~~~~ilqVHDElv~ev~~~~~~~~~~~~~~~Me~~ 951 (976) |+|.|++.|++|||+|+|+|||||||||+|+||++++++++.++++++||||||||||||||++++++++++|+++||+| T Consensus 822 i~s~n~~~r~~~eR~AiN~piQGsaADiiK~Ami~i~~~l~~~~~~~~~ilqvHDEli~e~~~~~~~~~~~~~~~~Me~~ 901 (922) T PRK07625 822 INGGNGPRRQAAERAAINAPMQGTAADLIKLSMIAVDDWLERDGLRTRMIMQVHDELVLEVPDDELSLVREKLPEMMCGV 901 (922) T ss_pred CCCCCHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEECEEEEEEECHHHHHHHHHHHHHHHHCC T ss_conf 56888788879999996502446799999999999999998469987169987547788407889999999999997274 Q ss_pred CCCCCCCCCCEEEEEEEECCHHHHC Q ss_conf 1357444464388833208937819 Q gi|254780277|r 952 CLPKINLRVPLKVNIKVSNNWQEEN 976 (976) Q Consensus 952 ~~~~~~l~VPl~v~~~~G~~W~e~~ 976 (976) . +|+|||+|++++|+||+||| T Consensus 902 ~----~l~VPL~V~~~~G~nW~eAh 922 (922) T PRK07625 902 A----KLKVPLVAEVGVGANWEEAH 922 (922) T ss_pred C----CCCCCEEEECCCCCCHHHCC T ss_conf 1----57887683328278978719 No 5 >PRK08835 consensus Probab=100.00 E-value=0 Score=2317.37 Aligned_cols=915 Identities=39% Similarity=0.635 Sum_probs=855.2 Q ss_pred CCCEEEEEECCHHHHHHHHCCCCCCCCCCCCEECHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHHCCHHHHC Q ss_conf 89728999662699988742675446898855006999999999999984855479879999727488713240865216 Q gi|254780277|r 3 KENHLFLVDGSSFIYRAFYATPLLSRKHDGLPVNAIAGFCNMLWKLLQNSRKESIASHFAVIFDYPAVTFRNEIYPDYKA 82 (976) Q Consensus 3 ~~~~~~liDg~~~~~ra~~a~p~~~~~~~G~~t~ai~gf~~~l~~~i~~~~~~~~~~~~~v~fD~~~~tfR~~~~~~YKa 82 (976) ++++|+|||||+++||||||+|+++ |++|+||||||||+|||+++|++++|+ |++||||.+++||||++||+||| T Consensus 5 ~~~~L~LIDg~slifRAfyA~p~l~-ts~G~pTnAv~GF~~mL~kli~~~~P~----~iavaFD~~~~TfR~~~y~~YKa 79 (931) T PRK08835 5 PENPLILIDGSSYLYRAFHAYPGTM-SNGEIPTNAVYGVVNMLRSMMRQFASD----RIAVIFDAKGKTFRDDMYPEYKA 79 (931) T ss_pred CCCCEEEEECCHHHHHHHHCCCCCC-CCCCCEECHHHHHHHHHHHHHHHCCCC----EEEEEECCCCCCCCHHHHHHHCC T ss_conf 9997799965579999982799878-999988306999999999999864998----79999749899623434057517 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHCCCCEEEECCEEHHHHHHHHHHHHHHCCCCEEEECCCCHHHEECCCCEEEEECCCCCE Q ss_conf 67987975886489999999970997995689016658999999998489949998687012122379759996678835 Q gi|254780277|r 83 NRPKIPEMLLPQLPLVRLATQAFGIPAIEIQGFEADDIIATYTYIAEKEGFAVTIISTDKDLMQLVSPTTCLYDTVKEEK 162 (976) Q Consensus 83 ~R~~~p~~l~~q~~~i~~~l~~~gi~~~~~~~~EADDviatla~~~~~~~~~v~i~s~DkD~~Qlv~~~v~~~~~~~~~~ 162 (976) ||++|||||.+|||+|++++++||||+++.+||||||||||+|++++++|++|+|||||||++|||+++|.||+|+++.. T Consensus 80 ~R~~~pedL~~Q~p~i~e~l~a~gIp~~~~~GyEADDiIgTla~~~~~~g~~v~IvS~DKDl~QLV~~~v~v~~~~~~~~ 159 (931) T PRK08835 80 NRPPMPDDLRCQIEPLHQVIKAMGLPLIAIEGVEADDVIGTLAYQASQAGMPVLISTGDKDMAQLVDDNITLINTMTNVV 159 (931) T ss_pred CCCCCCHHHHHHHHHHHHHHHHCCCCEEEECCCCHHHHHHHHHHHHHHCCCEEEEEECCCCHHHHCCCCEEEEECCCCEE T ss_conf 99999878999999999999987998844589149999999999999779939998289976341789859998889868 Q ss_pred ECHHHHHHHHCCCHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCCCHHHHHHHHHHHHH------HHHHHHHCCC Q ss_conf 3899998884898878777787518743589987763602499898413678999864542001------1223220035 Q gi|254780277|r 163 IDIEHVIKKWGIPPEKMVCLQALTGDSIDNIPGIPGIGYKTAALLLQEYGNLENILINASRIKQ------KKRRENILEY 236 (976) Q Consensus 163 ~~~~~v~~~~gv~p~~~~d~~al~GD~sDnipGv~GiG~KtA~~ll~~~gsle~i~~~~d~i~~------~k~~~~l~~~ 236 (976) +|++.|.|||||+|+||+||+||+||+|||||||||||||||+|||++|||||+||+|+|+|++ ++++++|+++ T Consensus 160 ~d~~~V~ek~GV~P~qviD~lAL~GDsSDNIPGVpGIG~KTA~kLL~eyGsLE~Iy~nld~Ik~~~~rg~kk~~ekL~e~ 239 (931) T PRK08835 160 MDREGVVEKFGIPPELIIDYLALMGDKVDNIPGVPGVGDKTATALLQGIGGLDALYDNLDDIAALGFRGSKTMAKKLVDN 239 (931) T ss_pred ECHHHHHHHHCCCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHCCHHHHHHHHHHCCCCCCCCHHHHHHHHHHH T ss_conf 78999999879698998999997188646889999745888999999848699999769861444331056899999987 Q ss_pred HHHHHHHHHHHHCCCCCCCCCCHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 78999765543203564224763552126767688998653233689998765311445443342111234332222333 Q gi|254780277|r 237 AETARLSRKLVMLRTDVPITIPLEHLILEDCNGPRLISFLKALEFTKLINRVATTYDCDAANIEPVVLDINTTENKGIAI 316 (976) Q Consensus 237 ~e~a~ls~~L~tL~~Dvpl~~~~~~l~~~~~d~~~l~~~~~~lef~~ll~rL~~~~~~~~~~~~~~~~~~~~~e~~~~~~ 316 (976) +|+|++||+||||++|||++++++++++.++|.+++.++|++|||++|++++...........+... .. T Consensus 240 ~e~A~LSk~LaTI~~Dvpl~~~le~l~~~~pd~~~L~~~f~eleFksll~~l~~~~~~~~~~~~~~~------~~----- 308 (931) T PRK08835 240 KDNAYLSYELATIKLDVELEETPESLLKQEPNKDELIKLYGQLTFKSWLNELLDGGTGVVEADEKAS------TS----- 308 (931) T ss_pred HHHHHHHHHHHHHCCCCCCCCCHHHHHCCCCCHHHHHHHHHHHHCHHHHHHHHHCCCCCCCCCCCCC------CC----- T ss_conf 9999998987420335567889899730799999999999997478999998734455444443234------55----- Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHCCCCCCCCCCCCCEEECCHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 22222223333222223333211223211200112110000123555666503547999999999999866852323322 Q gi|254780277|r 317 ETEIDRDTTIKTSQFSSLSDQTSKINSEQTPQKLFLERLQKLSQYPINNDSYTKIVNTKDIQQWVQKLETIGSASFKIIT 396 (976) Q Consensus 317 ~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~p~~~~~~~~~~~~~~~~~~~~y~~I~~~~~L~~li~~L~~~~~ia~DtET 396 (976) . ......... .............|..|.+.+++..|++.+...+.++||+|| T Consensus 309 -------------~------~~~~~~~~~---------~~~~~~~~~~~~~y~~i~~~~~l~~~l~~l~~~~~~a~dtEt 360 (931) T PRK08835 309 -------------V------RSGASHAAA---------EPESVAATIDRSQYETILDEASFQLWLDKLKAAELFAFDTET 360 (931) T ss_pred -------------C------CCCCCCCCC---------CCCCCCCCCCCCCCEEECCHHHHHHHHHHHHCCCEEEEEEEC T ss_conf -------------4------456665433---------433234445623325736889999999985338859999504 Q ss_pred CCCCCCCCCCCCCEEECCCCCCCCCCCCCCEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCHHHHCCCCHHHHHH Q ss_conf 24331100124420102577665443455023137443334565001788999974444202221001100321237889 Q gi|254780277|r 397 DTMDAFHSKPIAIAISTLDQKDDTSFKVETIFIDLSVHTSQDSTGKNTLTKEILSYLKKFFENEHFLKIGHNIKYDKLVL 476 (976) Q Consensus 397 t~ld~~~~~ivGisls~~~~~~~~~~~~~~~yip~~~~~~~d~~~~~~~~~evl~~L~~lle~~~i~kIgHN~KfD~~~L 476 (976) ++++++.++++|+|+|+.++. ++|+|+.|.... ...+...+.+++.|+++|+++.+.|||||+|||+++| T Consensus 361 ~~ld~~~a~lvGis~~~~~g~--------a~YiPl~h~~l~--~~~ql~~d~vl~~LkplLed~~i~KIghNlK~d~~vL 430 (931) T PRK08835 361 DSLDYMVANLVGLSFAVAEGE--------AAYVPVAHDYLD--APEQLDRDWVLAQLKPLLEDDAKAKVGQNLKYDASVL 430 (931) T ss_pred CCCCHHHCCCCCEEEEECCCC--------EEEEEECCCCCC--CHHHCCHHHHHHHHHHHHCCCCHHHHHCCHHHHHHHH T ss_conf 874322213354268764785--------799971233345--5101369999999799842842436622115789999 Q ss_pred HHCCCCCCCC-CCHHHHHHHHCCCCCCCCHHHHHHHHCCCC---CCCCCCCCCCCCCCHHHCCHHHHHHHHHHHHHHHHH Q ss_conf 8618866642-100234554316876677246677622444---223356543222100000012333322211111245 Q gi|254780277|r 477 HRYGISMRGF-EDIMLMSYVLDSGRSSHDIASIAQKWLSYT---RKEILKSRKSSIPIDKISDSQVQEHAIENSNVILQL 552 (976) Q Consensus 477 ~~~GI~l~~i-~DTmlaayLLdp~~~~~~L~~la~~yL~~~---~~e~~~~~k~~~~~~~i~~e~~~~Yaa~Da~~t~~L 552 (976) .++||.+.++ ||||||+||+||+.++|+|++|+.+||++. .+++.|+++++.+|+.++.+.+..|||+||++|++| T Consensus 431 ~~~gI~l~gi~fDTmLAaYLLnp~~~~~~Ld~La~~~L~~~~i~~e~l~gkgk~q~~f~~v~~~~~~~YaaedA~~t~~L 510 (931) T PRK08835 431 ARYGIEMKGIKHDTMLASYVYNSVGGKHDMDSLALRFLQHSCISFEQVAGKGKNQLTFNQIDLEEAAPYAAEDADVTLRL 510 (931) T ss_pred HHCCCCCCCCCCCHHHHHHHCCCCCCCCCHHHHHHHHCCCCCCCHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 97797767866124357530047778789899999864877634776514453447833432889999999999999999 Q ss_pred HHHHHHHHHH-HHHHHHHHHHCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHH Q ss_conf 6776322210-001345788607501233221014310058999986434368999999999740345577776566679 Q gi|254780277|r 553 WLLLRPKLIV-EKLLHVYERLDKPMIDVVSQMEITGIQIDQDLLKQVSAEISKNLLSLEENIYDLSGEKFNLSSPKQLGD 631 (976) Q Consensus 553 ~~~l~~~L~~-~~l~~l~~~iE~Pl~~vLa~ME~~Gi~vD~~~L~~l~~~l~~~l~~le~~i~~~aG~~FNinSpkQL~~ 631 (976) |+.|.++|++ ++++.+|++|||||++||++||.+||+||.+.|++++.+++.++++++++||++||++||+||||||++ T Consensus 511 ~~~L~~~L~~~~~L~~Ly~~IE~PLi~VLa~ME~~GI~vD~~~L~~ls~el~~~l~~le~~I~~~aG~eFNinSPKQL~~ 590 (931) T PRK08835 511 HNRLFANIEQDEKLKSVYEEIEMPLVPVLSRIERTGVLIDDMLLSAQSQEIAARLDELEQKAYEIAGQEFNMNSPKQLQT 590 (931) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCEEECHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHH T ss_conf 99999999854778899999986899999999972848789999999999999999999999876278567799999999 Q ss_pred HHHHHCCCCCCCCCCCCCCCCCHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEE Q ss_conf 98742078875556654535317887765304762799999998788898865320011110012333453300222100 Q gi|254780277|r 632 ILFTKLKFPSGAKTKTGQWKTTAQDLEQINCGDNPIIKNILEWRQLSKIKSTYADSLPNHINKRTQRVHTFYSLASTMTG 711 (976) Q Consensus 632 vLf~~LgLp~~kKTktG~~ST~~~~Le~L~~~~hp~~~~iLeyR~l~KL~sTY~~~l~~~i~~~dgRIH~~f~q~~t~TG 711 (976) |||++||||++||||||.||||+++||+|. ..||+++.||+||+++||+|||+++|+++|++.||||||+|||++|+|| T Consensus 591 vLF~~l~Lp~~kKtktG~~ST~~~vLe~L~-~~hpi~~~il~yR~l~KL~sTy~~~L~~~i~~~~gRiHt~f~Q~~t~TG 669 (931) T PRK08835 591 ILFEKMGLPVIKKTPSGTPSTNEEVLQELA-LDYPLPKVILEYRGLAKLKSTYTDKLPKMINPSTGRVHTSYHQAVTATG 669 (931) T ss_pred HHHHHCCCCCCCCCCCCCCCCHHHHHHHHH-HHCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEECCHHHHCCCCCC T ss_conf 999860998788588999675599999876-5177789999997898888887755576528675887873431015524 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCEEEEEECCCCCEEEEEEEEEHHHHHHHHHCCCCHHHHHHHHHHCCCHH Q ss_conf 00013334333455633222243024248994899963265443687665103777873406988899999898099968 Q gi|254780277|r 712 RLASLEPNLQNIPIKTDLGQKIRRAFISPPTKKLISADYSQIELRILAHIAKITPLYQVFENSLDIHRMVAAEIFGVNIE 791 (976) Q Consensus 712 RlSs~~PNlQNiP~r~~~g~~iR~~F~a~~g~~lv~aDySQiElRilA~ls~D~~l~~af~~g~DiH~~tA~~vf~~~~~ 791 (976) ||||++|||||||+|+++||+||+||+|++||+|++|||||||||||||+|+|+.|++||++|+|||+.||+.|||+|.+ T Consensus 670 RLSS~~PNLQNIPiRt~~G~~iR~aFva~~g~~l~saDYSQIELRilAh~s~d~~li~af~~~~DiH~~tA~~if~v~~~ 749 (931) T PRK08835 670 RLSSTDPNLQNIPIRNEEGRRIRQAFVAPHGYKILAVDYSQIELRIMAHLSGDQALLDAFRDGKDIHAATAAEIMGVSIE 749 (931) T ss_pred CCCCCCCCCCCCCCCCHHHCCHHHHEECCCCCEEEEECHHHHHHHHHHHHCCCHHHHHHHHCCCCHHHHHHHHHHCCCHH T ss_conf 52579963026788860111000534669988799704318999999998299899999866998067899999599946 Q ss_pred HCCHHHHHHHHHHHHHHHHCCCHHHHHHHCCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHCCCEECCCCCEECCCC Q ss_conf 89988886302513564407770356788199989999999999998888999999999999984988827787601898 Q gi|254780277|r 792 KVSPQMRRHAKTINFSIIYGISPFRLATQLRIPRSEAADYIKRYFHRFPGIHEYIEKTKNFVRQNGYVETIFGRRIHYDE 871 (976) Q Consensus 792 ~vt~~~R~~aK~infgi~YG~~~~~La~~l~is~~eA~~~i~~yf~~~p~v~~~~~~~~~~a~~~gyv~T~~GRrr~~p~ 871 (976) +||++||++||+|||||+||||+||||++||||+.||++||++||++||+|+.||++++++|+++|||+|++|||||||+ T Consensus 750 ~v~~~~R~~AK~vnfgi~YG~~~~gLa~~l~i~~~eA~~~i~~yf~~yp~v~~~~~~~~~~a~~~g~v~Tl~gRrr~~p~ 829 (931) T PRK08835 750 QVSSEQRRRAKAVNFGLIYGMSAFGLAKQLGIPRGEAQAYMDKYFERYPGVMQYMEDTRSRAAQQGYVETIFGRRLHLPE 829 (931) T ss_pred HCCHHHHHHHHHHHHHHHHCCCHHHHHHHHCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCEECCCCCCCCCCC T ss_conf 68998987758333112445787779987098999999999999886822999999999999877917887887314886 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC-CEEEEEEECCEEEEECCHHHHHHHHHHHHHHHHH Q ss_conf 758837888778787701345653899999999999999986599-8189997263899955987999999999999651 Q gi|254780277|r 872 INSPKSSIRNINERAAINAPIQGSAADITRRAMISVHKSLEDHKL-ATKMLLQIHDELVFEVPEEEITTASQIIVHSMEK 950 (976) Q Consensus 872 i~s~~~~~r~~aeR~a~N~~iQGsaAdi~k~am~~i~~~~~~~~~-~~~~ilqVHDElv~ev~~~~~~~~~~~~~~~Me~ 950 (976) |+|.|+++|++|||+|+|||||||||||||+||++++++++..+. +++||||||||||||||++++++++++|+++||+ T Consensus 830 i~s~n~~~r~~~eR~a~N~piQGtaADiiK~AMi~~~~~l~~~~~~~~~~~lqvHDEli~e~~~~~~~~~~~~v~~~Me~ 909 (931) T PRK08835 830 ITSRNGMRRKAAERAAINAPMQGTAADIIKKAMLLVDEWIEEEGNGRVKLLMQVHDELVFEVEESSLSEIESKVQQLMES 909 (931) T ss_pred CCCCCHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEECCEEEEEECHHHHHHHHHHHHHHHHC T ss_conf 56888778868999995712445599999999999999998556777628986672778831688999999999999727 Q ss_pred CCCCCCCCCCCEEEEEEEECCHHHHC Q ss_conf 01357444464388833208937819 Q gi|254780277|r 951 ACLPKINLRVPLKVNIKVSNNWQEEN 976 (976) Q Consensus 951 ~~~~~~~l~VPl~v~~~~G~~W~e~~ 976 (976) |. +|+|||+|++++|+||+||| T Consensus 910 ~~----~l~Vpl~v~~~~G~nW~eah 931 (931) T PRK08835 910 AA----ELDVPLVAEAGHGDNWDQAH 931 (931) T ss_pred CC----CCCCCEEEECCCCCCHHHHC T ss_conf 63----27786374238076978819 No 6 >PRK08786 consensus Probab=100.00 E-value=0 Score=2315.76 Aligned_cols=920 Identities=39% Similarity=0.631 Sum_probs=852.9 Q ss_pred CEEEEEECCHHHHHHHHCCCCCCCCCCCCEECHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHHCCHHHHCCC Q ss_conf 72899966269998874267544689885500699999999999998485547987999972748871324086521667 Q gi|254780277|r 5 NHLFLVDGSSFIYRAFYATPLLSRKHDGLPVNAIAGFCNMLWKLLQNSRKESIASHFAVIFDYPAVTFRNEIYPDYKANR 84 (976) Q Consensus 5 ~~~~liDg~~~~~ra~~a~p~~~~~~~G~~t~ai~gf~~~l~~~i~~~~~~~~~~~~~v~fD~~~~tfR~~~~~~YKa~R 84 (976) ++|+|||||+|+||||||+|+++ +++|+||||||||+|||.+++++ +| +|++||||.+++||||++||+||||| T Consensus 2 ~~l~LIDg~slifRAy~A~~~l~-~~~G~pTnAv~GF~~mL~kl~~e-~p----~~~~v~fD~~~~tfR~~~~~~YKa~R 75 (927) T PRK08786 2 SRLVLIDGSSYLYRAFHALPPLT-NAQGEPTGALFGVVNMLRATLKE-RP----AYVAFVVDAPGKTFRDALYADYKANR 75 (927) T ss_pred CEEEEEECCHHHHHHHCCCCCCC-CCCCCEECHHHHHHHHHHHHHHH-CC----CEEEEEEECCCCCCCCCCCHHHCCCC T ss_conf 71899927489999980799887-99998820799999999999987-88----98999996959974511035760899 Q ss_pred CCCCHHHHHHHHHHHHHHHHCCCCEEEECCEEHHHHHHHHHHHHHHCCCCEEEECCCCHHHEECCCCEEEEECCCCCEE- Q ss_conf 9879758864899999999709979956890166589999999984899499986870121223797599966788353- Q gi|254780277|r 85 PKIPEMLLPQLPLVRLATQAFGIPAIEIQGFEADDIIATYTYIAEKEGFAVTIISTDKDLMQLVSPTTCLYDTVKEEKI- 163 (976) Q Consensus 85 ~~~p~~l~~q~~~i~~~l~~~gi~~~~~~~~EADDviatla~~~~~~~~~v~i~s~DkD~~Qlv~~~v~~~~~~~~~~~- 163 (976) ++|||+|.+|||+|++++++||||+++.+||||||||||+|++++.+|++|+|+|||||++|||+++|++|+|+++..+ T Consensus 76 ~~~p~~l~~Q~~~i~~~~~~~gi~~~~~~g~EADDii~tla~~~~~~g~~v~IvSgDKD~~QLV~~~v~i~~~~kg~~~~ 155 (927) T PRK08786 76 PSMPDDLRAQVQPMCDIVHALGIDILRIDGVEADDVIGTLALQAAADGLAVTISTGDKDFAQLVRPGVELVNTMSGSRMD 155 (927) T ss_pred CCCCHHHHHHHHHHHHHHHHCCCCEECCCCCCHHHHHHHHHHHHHHCCCEEEEECCCCCHHHHCCCCEEEEECCCCCEEC T ss_conf 99988999999999999998799885438926899999999999978991999828987524178985999889997765 Q ss_pred CHHHHHHHHCCCHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHCCCHHHHHHH Q ss_conf 89999888489887877778751874358998776360249989841367899986454200112232200357899976 Q gi|254780277|r 164 DIEHVIKKWGIPPEKMVCLQALTGDSIDNIPGIPGIGYKTAALLLQEYGNLENILINASRIKQKKRRENILEYAETARLS 243 (976) Q Consensus 164 ~~~~v~~~~gv~p~~~~d~~al~GD~sDnipGv~GiG~KtA~~ll~~~gsle~i~~~~d~i~~~k~~~~l~~~~e~a~ls 243 (976) +++.|.|||||+|+||+||+||+||+|||||||||||||||+|||++|||||+||+|+|+|++ |++++|++++|++++| T Consensus 156 ~~~~V~eK~GV~P~qviD~lAL~GDsSDNIPGVpGIG~KTA~kLL~eyGsLE~I~~n~d~Ikg-K~~e~L~~~~e~a~lS 234 (927) T PRK08786 156 SDAAVIAKFGVRPDQIVDLLALMGDTVDNVPGVEKCGPKTAAKWLAEYDSLDGVIANADKIKG-KIGDNLRAALPRLPLN 234 (927) T ss_pred CHHHHHHHHCCCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHCCCHHHHHHHHHHHHH-HHHHHHHHHHHHHHHH T ss_conf 789999996969899999999808764678999985689999999975769999999875546-6788999889999974 Q ss_pred HHHHHCCCCCCCCCCHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 55432035642247635521267676889986532336899987653114454433421112343322223332222222 Q gi|254780277|r 244 RKLVMLRTDVPITIPLEHLILEDCNGPRLISFLKALEFTKLINRVATTYDCDAANIEPVVLDINTTENKGIAIETEIDRD 323 (976) Q Consensus 244 ~~L~tL~~Dvpl~~~~~~l~~~~~d~~~l~~~~~~lef~~ll~rL~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~ 323 (976) |+|+||++|||++++++++++.++|.+.+..+|++|||+++++++.......+....+...... +.... T Consensus 235 k~LatI~~Dvpl~~~le~l~~~~~d~e~L~~~f~eleF~sllk~l~~~~~~~~~~~~~~~~~~~-----------~~~~~ 303 (927) T PRK08786 235 RELVTIKTDVVLASGPRALDLREPNTEALAVLYARYGFTQALRELGGAAAQAGLLAEPVALART-----------EPGRA 303 (927) T ss_pred HHHHHEEECCCCCCCHHHHCCCCCCHHHHHHHHHHHCCHHHHHHHHHHHCCCCCCCCCCCCCCC-----------CCCCC T ss_conf 8876522178788898993458999999999999813389999874111003433454211224-----------65433 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHCCCCCCCCCCCCCEEECCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCC Q ss_conf 33332222233332112232112001121100001235556665035479999999999998668523233222433110 Q gi|254780277|r 324 TTIKTSQFSSLSDQTSKINSEQTPQKLFLERLQKLSQYPINNDSYTKIVNTKDIQQWVQKLETIGSASFKIITDTMDAFH 403 (976) Q Consensus 324 ~~~~~~~~~~~~~~~~~~~~e~~p~~~~~~~~~~~~~~~~~~~~y~~I~~~~~L~~li~~L~~~~~ia~DtETt~ld~~~ 403 (976) ... .....|.... ............|..|.+.+++..|+.++...+.++||+||++++++. T Consensus 304 ---~~~------------~~~~~p~~~~----~~~~~~~~~~~~y~~i~~~e~L~~~i~~l~~~~~~afdtET~sld~~~ 364 (927) T PRK08786 304 ---RGT------------GFVSAPANAA----VELDPALSAPGQYETILTQEQLDSWIARLRAAGQFAFDTETDSLDPLQ 364 (927) T ss_pred ---CCC------------CCCCCCCCCC----CCCCCCCCCCCCEEEECCHHHHHHHHHHHHHCCEEEEEEECCCCCHHH T ss_conf ---344------------3335753100----124410157552023267999999999987368389997238866555 Q ss_pred CCCCCCEEECCCCCCCCCCCCCCEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCHHHHCCCCHHHHHHHHCCCCC Q ss_conf 01244201025776654434550231374433345650017889999744442022210011003212378898618866 Q gi|254780277|r 404 SKPIAIAISTLDQKDDTSFKVETIFIDLSVHTSQDSTGKNTLTKEILSYLKKFFENEHFLKIGHNIKYDKLVLHRYGISM 483 (976) Q Consensus 404 ~~ivGisls~~~~~~~~~~~~~~~yip~~~~~~~d~~~~~~~~~evl~~L~~lle~~~i~kIgHN~KfD~~~L~~~GI~l 483 (976) ++++|+|+|+..+. ++|+|+.|.... ...++...+++..|+++|+++.+.|||||+|||+++|.++||.+ T Consensus 365 a~lVGiS~s~~~g~--------a~YiP~~h~~~g--~~~qld~~~~l~~LkplLed~~I~KIghnlK~Dl~vL~~~GI~l 434 (927) T PRK08786 365 ADLIGLSVAAEPGQ--------AAYLPFGHDFPG--APVQLDRTQALAQLAPLLTDPAVRKVGQHGKYDLHVMRRHGIAL 434 (927) T ss_pred HHHHCEEEECCCCC--------EEEEECCCCCCC--CCHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCCCC T ss_conf 31212245426786--------677532445678--60341399999998998608651123211788899999679711 Q ss_pred CCC-CCHHHHHHHHCCCCCCCCHHHHHHHHCCCC---CCCCCCCCCCCCCCHHHCCHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 642-100234554316876677246677622444---2233565432221000000123333222111112456776322 Q gi|254780277|r 484 RGF-EDIMLMSYVLDSGRSSHDIASIAQKWLSYT---RKEILKSRKSSIPIDKISDSQVQEHAIENSNVILQLWLLLRPK 559 (976) Q Consensus 484 ~~i-~DTmlaayLLdp~~~~~~L~~la~~yL~~~---~~e~~~~~k~~~~~~~i~~e~~~~Yaa~Da~~t~~L~~~l~~~ 559 (976) .++ +|||||+|||||+.++|+|++++++||++. .+++.|+++++++|..++.+.+..|||+|+++|++||..|.++ T Consensus 435 ~g~~~DTmLAaYLL~p~~~~~~Ld~La~~yLg~~~~~~~~l~gkg~k~~~~~~v~~~~~~~Ya~eda~~tl~L~~~l~~k 514 (927) T PRK08786 435 AGYADDTLLESFVLNSGSARHDMDSLAKRYLGYDTVKYEDVCGKGAKQIPFAQVSLEDATRYAAEDADITLRLHCVLGPK 514 (927) T ss_pred CCCCCCHHHHHHHCCCCCCCCCHHHHHHHHCCCCCCCHHHHCCCCCCCCCHHHCCHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 67664258888623776454788999999669775426754244322288333259999999999999999999999999 Q ss_pred H-HHHHHHHHHHHHCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHCC Q ss_conf 2-100013457886075012332210143100589999864343689999999997403455777765666799874207 Q gi|254780277|r 560 L-IVEKLLHVYERLDKPMIDVVSQMEITGIQIDQDLLKQVSAEISKNLLSLEENIYDLSGEKFNLSSPKQLGDILFTKLK 638 (976) Q Consensus 560 L-~~~~l~~l~~~iE~Pl~~vLa~ME~~Gi~vD~~~L~~l~~~l~~~l~~le~~i~~~aG~~FNinSpkQL~~vLf~~Lg 638 (976) | ++.+|+++|++|||||++||++||.+||+||.+.|++++.+++.++..++++||++||++|||||||||++|||++|| T Consensus 515 L~~e~~l~~ly~~iE~PLi~VLa~ME~~GI~vD~~~L~~l~~~l~~~l~~le~~I~~~aG~eFNinSpKQL~~iLFekL~ 594 (927) T PRK08786 515 LAAEPGLERVYRDIEMPLVDVLERIEANGVCVDAAELRRQSADLSKRMLAAQQKATELAGRTFNLDSPKQLQALLFDELK 594 (927) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCCEEECHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHCC T ss_conf 98505299999999979999999988568798899999999999999999999999970855578999999999975028 Q ss_pred CCCCCCCCCCCCCCCHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEECCCCCCC Q ss_conf 88755566545353178877653047627999999987888988653200111100123334533002221000001333 Q gi|254780277|r 639 FPSGAKTKTGQWKTTAQDLEQINCGDNPIIKNILEWRQLSKIKSTYADSLPNHINKRTQRVHTFYSLASTMTGRLASLEP 718 (976) Q Consensus 639 Lp~~kKTktG~~ST~~~~Le~L~~~~hp~~~~iLeyR~l~KL~sTY~~~l~~~i~~~dgRIH~~f~q~~t~TGRlSs~~P 718 (976) ||+.||||||.||||++||++| +..||+++.||+||+++||+|||+|+|+++|++.||||||+|||++|+||||||++| T Consensus 595 Lp~~kKtktG~~ST~~~vLe~L-~~~h~i~~~iL~yR~l~KL~sTY~d~L~~~i~~~~gRiHt~f~Q~~t~TGRlSS~~P 673 (927) T PRK08786 595 LPAVVKTPKGQPSTNEEALEAI-ADQHELPRVILEYRSLAKLRSTYTDKLPEMIHPQSGRVHTSYHQAGAATGRLSSSDP 673 (927) T ss_pred CCCCCCCCCCCCCCCHHHHHHH-HHCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEECCCCHHHCCCCCEECCCCC T ss_conf 9878738899977749999987-635677699999988887787889988874076657662421631145452527993 Q ss_pred CCCCCCCCCCCCCCCCEEEECCCCCEEEEEECCCCCEEEEEEEEEHHHHHHHHHCCCCHHHHHHHHHHCCCHHHCCHHHH Q ss_conf 43334556332222430242489948999632654436876651037778734069888999998980999688998888 Q gi|254780277|r 719 NLQNIPIKTDLGQKIRRAFISPPTKKLISADYSQIELRILAHIAKITPLYQVFENSLDIHRMVAAEIFGVNIEKVSPQMR 798 (976) Q Consensus 719 NlQNiP~r~~~g~~iR~~F~a~~g~~lv~aDySQiElRilA~ls~D~~l~~af~~g~DiH~~tA~~vf~~~~~~vt~~~R 798 (976) ||||||+||++||+||+||||++||+|++|||||||||||||+|+|+.|++||++|+|||+.||+.|||+|.++||++|| T Consensus 674 NLQNIPirt~~G~~iR~aFi~~~g~~l~saDYSQIELRilAh~s~d~~li~af~~~~DiH~~tA~~if~~~~~~v~~~~R 753 (927) T PRK08786 674 NLQNIPIRTEDGRRIRRAFVAPAGRKLIACDYSQIELRIMAHLSGDPGLVGAFESGADVHRATAAEVFGRTIDTVSGDER 753 (927) T ss_pred CCCCCCCCCCCCCCHHHHEECCCCCEEEEECHHHHHHHHHHHHHCCHHHHHHHCCCCCCCHHHHHHHHCCCHHHCCHHHH T ss_conf 41148877633302266602589887997236488999999982886477775478673032689882999576697788 Q ss_pred HHHHHHHHHHHHCCCHHHHHHHCCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHCCCEECCCCCEECCCCCCCCCHH Q ss_conf 63025135644077703567881999899999999999988889999999999999849888277876018987588378 Q gi|254780277|r 799 RHAKTINFSIIYGISPFRLATQLRIPRSEAADYIKRYFHRFPGIHEYIEKTKNFVRQNGYVETIFGRRIHYDEINSPKSS 878 (976) Q Consensus 799 ~~aK~infgi~YG~~~~~La~~l~is~~eA~~~i~~yf~~~p~v~~~~~~~~~~a~~~gyv~T~~GRrr~~p~i~s~~~~ 878 (976) +.||+|||||+||||+||||++||||++||++||++||++||+|+.||++++++|+++|||+|++|||||||+|+|.|++ T Consensus 754 ~~aK~infgiiYG~s~~gL~~~l~i~~~eA~~~i~~yf~~y~~v~~~~~~~~~~a~~~gyv~Tl~gRrr~~p~i~s~n~~ 833 (927) T PRK08786 754 RAAKAINFGLMYGMSAFGLARQLGIGRGEAQDYIALYFSRYPGVRDFMETTRQQARDKGYVETVFGRRLYLDFINAGSQG 833 (927) T ss_pred HHHHEEEEEHHHCCCHHHHHHHCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCEEECCCCCCCCCCCCCCCCHH T ss_conf 55152102003337877799870999999999999999969109999999999998779178888860328866788877 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEECCEEEEECCHHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 88778787701345653899999999999999986599818999726389995598799999999999965101357444 Q gi|254780277|r 879 IRNINERAAINAPIQGSAADITRRAMISVHKSLEDHKLATKMLLQIHDELVFEVPEEEITTASQIIVHSMEKACLPKINL 958 (976) Q Consensus 879 ~r~~aeR~a~N~~iQGsaAdi~k~am~~i~~~~~~~~~~~~~ilqVHDElv~ev~~~~~~~~~~~~~~~Me~~~~~~~~l 958 (976) +|++|||+|+|||||||||||||+|||+++++++..+++++||||||||||||||++++++++++|+++||+|. +| T Consensus 834 ~r~~aeR~A~N~piQGsAADiiK~AMi~v~~~l~~~~~~~~mlLQVHDELifEv~~~e~~~~~~~v~~~Me~a~----~L 909 (927) T PRK08786 834 QRAGAERAAINAPMQGTAADIIKRAMVKVDGWIADHAQRALMILQVHDELVFEADEDFVDTLLSEVTARMAAAA----EL 909 (927) T ss_pred HHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECCEEEEEECHHHHHHHHHHHHHHHHCCC----CC T ss_conf 88789999858344357999999999999999972687866998667256885168999999999999981744----57 Q ss_pred CCCEEEEEEEECCHHHHC Q ss_conf 464388833208937819 Q gi|254780277|r 959 RVPLKVNIKVSNNWQEEN 976 (976) Q Consensus 959 ~VPl~v~~~~G~~W~e~~ 976 (976) +|||+|++++|+||+||| T Consensus 910 ~VPL~v~~~~G~nW~eah 927 (927) T PRK08786 910 RVPLVVDSGVGDNWDEAH 927 (927) T ss_pred CCCEEECCCCCCCHHHCC T ss_conf 787573038386967709 No 7 >PRK05755 DNA polymerase I; Provisional Probab=100.00 E-value=0 Score=2301.87 Aligned_cols=885 Identities=47% Similarity=0.780 Sum_probs=851.0 Q ss_pred CCCCCEEEEEECCHHHHHHHHCCCCCCCCCCCCEECHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHHCCHHH Q ss_conf 98897289996626999887426754468988550069999999999999848554798799997274887132408652 Q gi|254780277|r 1 MQKENHLFLVDGSSFIYRAFYATPLLSRKHDGLPVNAIAGFCNMLWKLLQNSRKESIASHFAVIFDYPAVTFRNEIYPDY 80 (976) Q Consensus 1 m~~~~~~~liDg~~~~~ra~~a~p~~~~~~~G~~t~ai~gf~~~l~~~i~~~~~~~~~~~~~v~fD~~~~tfR~~~~~~Y 80 (976) |.++++|+|||||+++||||||+|++. +++|.||||||||+|||++++++++|+ |++||||.+++||||++||+| T Consensus 1 m~~k~~l~liDg~~~~~Ra~~a~~~l~-~~~G~~tnai~Gf~~~l~~~~~~~~p~----~~~v~fD~~~~tfR~~~y~~Y 75 (889) T PRK05755 1 MIKKKTLLLVDGSSLAFRAFYALPPLT-NSDGLPTGAVYGFLNMLLKLLKDEKPT----HIAVAFDAKGKTFRHELYPEY 75 (889) T ss_pred CCCCCCEEEEECCHHHHHHHCCCCCCC-CCCCCCCCCHHHHHHHHHHHHHHCCCC----EEEEEECCCCCCCCCCCCHHH T ss_conf 998894899958589999980789887-999987252999999999999862998----899996398996352105564 Q ss_pred HCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEECCEEHHHHHHHHHHHHHHCCCCEEEECCCCHHHEECCCCEEEEECCCC Q ss_conf 16679879758864899999999709979956890166589999999984899499986870121223797599966788 Q gi|254780277|r 81 KANRPKIPEMLLPQLPLVRLATQAFGIPAIEIQGFEADDIIATYTYIAEKEGFAVTIISTDKDLMQLVSPTTCLYDTVKE 160 (976) Q Consensus 81 Ka~R~~~p~~l~~q~~~i~~~l~~~gi~~~~~~~~EADDviatla~~~~~~~~~v~i~s~DkD~~Qlv~~~v~~~~~~~~ 160 (976) ||||++||++|.+|+|+|++++++||||+++.+||||||||||+|++++++|++|+|+|||||++|||+++|++|+|+++ T Consensus 76 Ka~R~~~P~~l~~Q~~~i~~~l~~~gi~~~~~~g~EADDii~tla~~~~~~~~~v~i~s~DkD~~QLv~~~v~~~~~~k~ 155 (889) T PRK05755 76 KANRPPMPEDLREQIPLLREALEALGIPLLEVEGYEADDVIGTLAKQAEAAGFEVLIVTGDKDLLQLVDDNVTVLNTMKN 155 (889) T ss_pred CCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEECCCCHHHHHHHHHHHHHHCCCEEEEEECCCCHHHHCCCCEEEEECCCC T ss_conf 27999999899999999999999879988646892399999999999997799599980898785618898699989999 Q ss_pred CEECHHHHHHHHCCCHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHCCCHHHH Q ss_conf 35389999888489887877778751874358998776360249989841367899986454200112232200357899 Q gi|254780277|r 161 EKIDIEHVIKKWGIPPEKMVCLQALTGDSIDNIPGIPGIGYKTAALLLQEYGNLENILINASRIKQKKRRENILEYAETA 240 (976) Q Consensus 161 ~~~~~~~v~~~~gv~p~~~~d~~al~GD~sDnipGv~GiG~KtA~~ll~~~gsle~i~~~~d~i~~~k~~~~l~~~~e~a 240 (976) +.+|++.|.|||||+|+||+||+||+||+|||||||||||||||++||++|||||+||+|+|+|+|+|++++|++++|+| T Consensus 156 ~~~~~~~v~ek~Gv~P~q~~D~~aL~GD~sDNIPGVpGIG~KtA~kLl~~ygsle~i~~~~d~i~g~k~~~~l~~~~e~a 235 (889) T PRK05755 156 ERIDPEGVIEKYGVTPEQIIDFLALMGDSSDNIPGVPGVGEKTAAKLLKEFGSLEGLYENLDEIKGKKLKEKLRENKEQA 235 (889) T ss_pred CEECHHHHHHHHCCCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHCCCHHHHHHHHHHCCCHHHHHHHHHHHHHH T ss_conf 28589999999796989999999980886457899886269999999986786899999898523256788898469999 Q ss_pred HHHHHHHHCCCCCCCCCCHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 97655432035642247635521267676889986532336899987653114454433421112343322223332222 Q gi|254780277|r 241 RLSRKLVMLRTDVPITIPLEHLILEDCNGPRLISFLKALEFTKLINRVATTYDCDAANIEPVVLDINTTENKGIAIETEI 320 (976) Q Consensus 241 ~ls~~L~tL~~Dvpl~~~~~~l~~~~~d~~~l~~~~~~lef~~ll~rL~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~ 320 (976) ++||+|+||++|||++++++++++.++|.+.+.++|++|||++|++|+......... T Consensus 236 ~ls~~L~ti~~dvpl~~~le~l~~~~~d~~~l~~lf~elEFksLlkrl~~~~~~~~~----------------------- 292 (889) T PRK05755 236 FLSRKLATIKTDVPLDVDLEDLELKPPDREKLIALLKKLEFKSLLRRVEAGAAEAEA----------------------- 292 (889) T ss_pred HHHHHHHHHHCCCCCCCCHHHHCCCCCCHHHHHHHHHHHCCHHHHHHHHHCCCCCCC----------------------- T ss_conf 999998565228989999999225899999999999982859999998521266677----------------------- Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHCCCCCCCCCCCCCEEECCHHHHHHHHHHHHHCCCCCCCCCCCCCC Q ss_conf 22233332222233332112232112001121100001235556665035479999999999998668523233222433 Q gi|254780277|r 321 DRDTTIKTSQFSSLSDQTSKINSEQTPQKLFLERLQKLSQYPINNDSYTKIVNTKDIQQWVQKLETIGSASFKIITDTMD 400 (976) Q Consensus 321 ~~~~~~~~~~~~~~~~~~~~~~~e~~p~~~~~~~~~~~~~~~~~~~~y~~I~~~~~L~~li~~L~~~~~ia~DtETt~ld 400 (976) .......|..|.+.+++..|++.+...+.+++|+||++++ T Consensus 293 ----------------------------------------~~~~~~~~~~I~~~~el~~ll~~l~~~~~va~dtEttsld 332 (889) T PRK05755 293 ----------------------------------------AVIDEIEYETILDEEELEAWLAKLKKAGLVAFDTETTSLD 332 (889) T ss_pred ----------------------------------------CCCCCCCCEEECCHHHHHHHHHHHHCCCEEEEEEECCCCC T ss_conf ----------------------------------------8777788566189999999999865288499998417887 Q ss_pred CCCCCCCCCEEECCCCCCCCCCCCCCEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCHHHHCCCCHHHHHHHHCC Q ss_conf 11001244201025776654434550231374433345650017889999744442022210011003212378898618 Q gi|254780277|r 401 AFHSKPIAIAISTLDQKDDTSFKVETIFIDLSVHTSQDSTGKNTLTKEILSYLKKFFENEHFLKIGHNIKYDKLVLHRYG 480 (976) Q Consensus 401 ~~~~~ivGisls~~~~~~~~~~~~~~~yip~~~~~~~d~~~~~~~~~evl~~L~~lle~~~i~kIgHN~KfD~~~L~~~G 480 (976) ++.++++|+|+|+.++. ++|+|+.+. .+....++++.|++||+++++.|||||+|||+++|.++| T Consensus 333 ~~~a~lVGis~s~~~g~--------a~YIPl~~~-------~~~~~~~vl~~Lk~lLed~~i~KIghNlK~dl~vL~~~G 397 (889) T PRK05755 333 PMSANLVGLSFAVEEGE--------AAYIPLAHD-------DQLDRDDALAALKPLLEDPAIPKVGQNLKYDRHILANYG 397 (889) T ss_pred CCCCCEEEEEEECCCCC--------EEEEECCCC-------CCCCHHHHHHHHHHHHCCCCCCEEEECHHHHHHHHHHCC T ss_conf 56576259999827996--------899844431-------336889999999999709887243203189999999779 Q ss_pred CCCCC-CCCHHHHHHHHCCCCCCCCHHHHHHHHCCCCC---CCCCCCCCCCCCCHHHCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 86664-21002345543168766772466776224442---233565432221000000123333222111112456776 Q gi|254780277|r 481 ISMRG-FEDIMLMSYVLDSGRSSHDIASIAQKWLSYTR---KEILKSRKSSIPIDKISDSQVQEHAIENSNVILQLWLLL 556 (976) Q Consensus 481 I~l~~-i~DTmlaayLLdp~~~~~~L~~la~~yL~~~~---~e~~~~~k~~~~~~~i~~e~~~~Yaa~Da~~t~~L~~~l 556 (976) |.+.+ +||||||||||||+.++|+|++|+.+||++.. +++.|++ ++.+|+.++.+.+.+|||+||++|++||..| T Consensus 398 I~l~g~~fDTmLAaYLLdP~~~~~~Ld~La~~yL~~~~~~~eel~gkg-~~~~~~~v~~e~~~~YaaedA~~~~~L~~~L 476 (889) T PRK05755 398 IELRGIAFDTMLASYLLDPGAGRHDLDSLAERYLGHKTISFEEVAGKG-NQLTFAQVALEEAAEYAAEDADVTLRLHQVL 476 (889) T ss_pred EECCCCCHHHHHHHHHHCCCCCCCCHHHHHHHHHCCCCCCHHHHCCCC-CCCCCCCCCHHHHHHHHHHHHHHHHHHHHHH T ss_conf 235776301999998747887767789999986323442034313656-5566100787899999999886899999999 Q ss_pred HHHHHHHHHHHHHHHHCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHH Q ss_conf 32221000134578860750123322101431005899998643436899999999974034557777656667998742 Q gi|254780277|r 557 RPKLIVEKLLHVYERLDKPMIDVVSQMEITGIQIDQDLLKQVSAEISKNLLSLEENIYDLSGEKFNLSSPKQLGDILFTK 636 (976) Q Consensus 557 ~~~L~~~~l~~l~~~iE~Pl~~vLa~ME~~Gi~vD~~~L~~l~~~l~~~l~~le~~i~~~aG~~FNinSpkQL~~vLf~~ 636 (976) +++|++.+++++|++|||||++||++||.+||.||.+.|++++.+++.++++++++||++||++|||||||||++|||++ T Consensus 477 ~~~L~~~~l~~Ly~~IE~PLi~VLa~ME~~GI~VD~~~L~~ls~el~~~l~~le~~I~~~aG~eFNinSPKQL~~iLFek 556 (889) T PRK05755 477 KPELEEEGLLELYEEIELPLVPVLARMERNGIKVDREYLEELSAELAQRLAELEQEIYELAGEEFNLNSPKQLGEILFEK 556 (889) T ss_pred HHHHHHHHHHHHHHHHHCCHHHHHHHHHHCCEEECHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHH T ss_conf 99998604788999987519999999996086866999999999999999999999998749903779989999999985 Q ss_pred CCCCCCCCCCCCCCCCCHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEECCCCC Q ss_conf 07887555665453531788776530476279999999878889886532001111001233345330022210000013 Q gi|254780277|r 637 LKFPSGAKTKTGQWKTTAQDLEQINCGDNPIIKNILEWRQLSKIKSTYADSLPNHINKRTQRVHTFYSLASTMTGRLASL 716 (976) Q Consensus 637 LgLp~~kKTktG~~ST~~~~Le~L~~~~hp~~~~iLeyR~l~KL~sTY~~~l~~~i~~~dgRIH~~f~q~~t~TGRlSs~ 716 (976) ||||++|||||| ||||++|||+|. ..||+++.||+||+++||+|||+|+|+++|++.||||||+|||++|+||||||+ T Consensus 557 L~Lp~~KKTKtG-ySTd~~vLe~L~-~~hpi~~~iLeyR~l~KL~sTYvd~L~~~i~~~tgRIHt~f~Q~~t~TGRLSS~ 634 (889) T PRK05755 557 LGLPVTKKTKTG-YSTDAEVLEKLA-DDHPIPKKILEYRQLSKLKSTYTDGLPKMINPDTGRVHTSFNQAVTATGRLSSS 634 (889) T ss_pred CCCCCCCCCCCC-CCCCHHHHHHHH-HCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEECCHHHHHCCCCCCCCC T ss_conf 399978878898-878599999987-507699999999778788888988887524788984611001110320222479 Q ss_pred CCCCCCCCCCCCCCCCCCEEEECCCCCEEEEEECCCCCEEEEEEEEEHHHHHHHHHCCCCHHHHHHHHHHCCCHHHCCHH Q ss_conf 33433345563322224302424899489996326544368766510377787340698889999989809996889988 Q gi|254780277|r 717 EPNLQNIPIKTDLGQKIRRAFISPPTKKLISADYSQIELRILAHIAKITPLYQVFENSLDIHRMVAAEIFGVNIEKVSPQ 796 (976) Q Consensus 717 ~PNlQNiP~r~~~g~~iR~~F~a~~g~~lv~aDySQiElRilA~ls~D~~l~~af~~g~DiH~~tA~~vf~~~~~~vt~~ 796 (976) +|||||||+|+++||+||+||+|+|||+|++|||||||||||||+|+|+.|++||++|+|||+.||++|||+|.++||++ T Consensus 635 ~PNLQNIPir~e~Gr~IR~aFva~~g~~l~saDYSQIELRilAh~S~D~~li~af~~g~DiH~~tA~~vf~v~~~~vt~~ 714 (889) T PRK05755 635 DPNLQNIPIRTEEGRRIRQAFVAPEGYKLLSADYSQIELRILAHLSGDEGLIEAFAEGEDIHTATASEVFGVPLEEVTSE 714 (889) T ss_pred CCCCCCCCCCCHHHHHHHHEEEECCCCEEEEECHHHHHHHHHHHHHCCHHHHHHHHCCCCHHHHHHHHHHCCCHHHCCHH T ss_conf 98534687763230445522785899889995120899999999808999999985599868999999939990018999 Q ss_pred HHHHHHHHHHHHHHCCCHHHHHHHCCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHCCCEECCCCCEECCCCCCCCC Q ss_conf 88630251356440777035678819998999999999999888899999999999998498882778760189875883 Q gi|254780277|r 797 MRRHAKTINFSIIYGISPFRLATQLRIPRSEAADYIKRYFHRFPGIHEYIEKTKNFVRQNGYVETIFGRRIHYDEINSPK 876 (976) Q Consensus 797 ~R~~aK~infgi~YG~~~~~La~~l~is~~eA~~~i~~yf~~~p~v~~~~~~~~~~a~~~gyv~T~~GRrr~~p~i~s~~ 876 (976) ||+.||+|||||+||||+||||++||||++||++||++||++||+|+.||++++++|+++|||+|++|||||||+|+|.| T Consensus 715 ~R~~AK~vnfGiiYG~s~~gLs~~l~is~~eA~~~i~~yf~~yp~v~~~~~~~~~~a~~~Gyv~Tl~gRrr~~p~i~s~n 794 (889) T PRK05755 715 QRRRAKAINFGLIYGMSAFGLARQLGISRKEAKEYIDRYFERYPGVKEYMEETVAQAKEKGYVETLFGRRRYLPDINSSN 794 (889) T ss_pred HHHHHHHEEEHHHHCCCHHHHHHHHCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCEECCCCCEEECCCCCCCC T ss_conf 98662613221444668778998819999999999999999681099999999999997797778778705278677899 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEECCEEEEECCHHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 78887787877013456538999999999999999865998189997263899955987999999999999651013574 Q gi|254780277|r 877 SSIRNINERAAINAPIQGSAADITRRAMISVHKSLEDHKLATKMLLQIHDELVFEVPEEEITTASQIIVHSMEKACLPKI 956 (976) Q Consensus 877 ~~~r~~aeR~a~N~~iQGsaAdi~k~am~~i~~~~~~~~~~~~~ilqVHDElv~ev~~~~~~~~~~~~~~~Me~~~~~~~ 956 (976) .+.|++|||+|+|||||||||||||+||++++++++ .+++++||||||||||||||++++++++++|+++||+|. T Consensus 795 ~~~r~~~eR~AiN~pIQGsAADiiK~AMi~v~~~l~-~~~~~~~llQVHDEli~Ev~~~~~e~~~~~v~~~Me~a~---- 869 (889) T PRK05755 795 GARRAFAERAAINAPIQGSAADIIKIAMIRVDDALK-EGLKARMLLQVHDELVFEVPEDELEEVKKLVKEVMENAV---- 869 (889) T ss_pred HHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHH-HCCCCEEEEEEEEEEEEECCHHHHHHHHHHHHHHHHCCE---- T ss_conf 889989999996756555699999999999999998-477873899865577784278899999999999971850---- Q ss_pred CCCCCEEEEEEEECCHHHHC Q ss_conf 44464388833208937819 Q gi|254780277|r 957 NLRVPLKVNIKVSNNWQEEN 976 (976) Q Consensus 957 ~l~VPl~v~~~~G~~W~e~~ 976 (976) +|+|||.|++++|+||+||. T Consensus 870 ~l~VPl~v~~~~g~nW~eaK 889 (889) T PRK05755 870 ELSVPLVVDVGVGDNWDEAK 889 (889) T ss_pred ECCCCEEEECCCCCCHHHCC T ss_conf 07887586238168967729 No 8 >PRK08928 consensus Probab=100.00 E-value=0 Score=2293.50 Aligned_cols=858 Identities=43% Similarity=0.685 Sum_probs=814.8 Q ss_pred CCCCCEEEEEECCHHHHHHHHCCCCCCCCCCCCEECHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHHCCHHH Q ss_conf 98897289996626999887426754468988550069999999999999848554798799997274887132408652 Q gi|254780277|r 1 MQKENHLFLVDGSSFIYRAFYATPLLSRKHDGLPVNAIAGFCNMLWKLLQNSRKESIASHFAVIFDYPAVTFRNEIYPDY 80 (976) Q Consensus 1 m~~~~~~~liDg~~~~~ra~~a~p~~~~~~~G~~t~ai~gf~~~l~~~i~~~~~~~~~~~~~v~fD~~~~tfR~~~~~~Y 80 (976) |+++++|+|||||+++||||||+|++. |++|+||||||||+|||++++++++|+ |++||||.+++|||||+||+| T Consensus 1 m~~k~~l~liDg~sl~fRAf~Al~~l~-t~~G~pTnAi~GF~~mL~~~~~~~~P~----~~~v~fD~~~~TFR~e~~~~Y 75 (861) T PRK08928 1 MTKKNTFLLIDGYGFLFRAYYALQPLT-TSKGEPIGGVYGFINMLLKLLSDFKPD----YLAVVFDSGGKNFRHDIYPEY 75 (861) T ss_pred CCCCCCEEEEECCHHHHHHHCCCCCCC-CCCCCEECHHHHHHHHHHHHHHHCCCC----EEEEEEECCCCCCCHHHHHHH T ss_conf 999994899967489999980799887-999988244999999999999863998----899999795996224332565 Q ss_pred HCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEECCEEHHHHHHHHHHHHHHCCCCEEEECCCCHHHEECCCCEEEEECCCC Q ss_conf 16679879758864899999999709979956890166589999999984899499986870121223797599966788 Q gi|254780277|r 81 KANRPKIPEMLLPQLPLVRLATQAFGIPAIEIQGFEADDIIATYTYIAEKEGFAVTIISTDKDLMQLVSPTTCLYDTVKE 160 (976) Q Consensus 81 Ka~R~~~p~~l~~q~~~i~~~l~~~gi~~~~~~~~EADDviatla~~~~~~~~~v~i~s~DkD~~Qlv~~~v~~~~~~~~ 160 (976) ||||++|||||.+|||+|++++++||||+++.+||||||||||+|++++++|++|+|+|||||++|||+++|+||+|+++ T Consensus 76 Ka~R~~~pedl~~Qip~i~~~~~~~gi~~~~~~gyEADDvIgTla~~~~~~g~~v~IvSgDKDl~QLV~~~v~v~~~~k~ 155 (861) T PRK08928 76 KANRPPPPEDLISQFPLLREAVSALNIPIEEKNGYEADDIIATLATKYASLGDEVVIISSDKDLLQLMNENIKIYDPIKN 155 (861) T ss_pred CCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEECCCCHHHHHHHHHHHHHHCCCEEEEECCCCCHHHHCCCCEEEEECCCC T ss_conf 38999998889999999999999879998544893589999999999997799099984899750318898599988888 Q ss_pred CEECHHHHHHHHCCCHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHCCCHHHH Q ss_conf 35389999888489887877778751874358998776360249989841367899986454200112232200357899 Q gi|254780277|r 161 EKIDIEHVIKKWGIPPEKMVCLQALTGDSIDNIPGIPGIGYKTAALLLQEYGNLENILINASRIKQKKRRENILEYAETA 240 (976) Q Consensus 161 ~~~~~~~v~~~~gv~p~~~~d~~al~GD~sDnipGv~GiG~KtA~~ll~~~gsle~i~~~~d~i~~~k~~~~l~~~~e~a 240 (976) ..+|++.|.|||||+|+||+||+||+||+|||||||||||||||++||++|||||+||+|+|+|+++|++++|++++|+| T Consensus 156 ~~~~~~~V~ek~GV~P~qiiD~laL~GDsSDNIPGVpGIG~KTA~kLL~eyGsLE~Iy~n~d~Ik~~K~re~L~e~ke~a 235 (861) T PRK08928 156 KYITEDDVVEKFGVTSDKLLDVMALTGDRSDNIPGVPSIGPKTAAKLITQFGSLENILNSLDQISSNKQRETLQNSREQA 235 (861) T ss_pred CCCCHHHHHHHHCCCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHCCCHHHHHHHHHHCCCHHHHHHHHHHHHHH T ss_conf 17689999999797989999999980876468899988562899999996797999999886113024678888558889 Q ss_pred HHHHHHHHCCCCCCCCCCHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 97655432035642247635521267676889986532336899987653114454433421112343322223332222 Q gi|254780277|r 241 RLSRKLVMLRTDVPITIPLEHLILEDCNGPRLISFLKALEFTKLINRVATTYDCDAANIEPVVLDINTTENKGIAIETEI 320 (976) Q Consensus 241 ~ls~~L~tL~~Dvpl~~~~~~l~~~~~d~~~l~~~~~~lef~~ll~rL~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~ 320 (976) ++||+||||++|||+++++++++++++|.++|.++|++|||++|++|+...+......... T Consensus 236 ~lSkeLvTI~~Dvpl~~~l~~l~~~~~d~~~L~~lf~elEF~sLlkrl~~~~~~~~~~~~~------------------- 296 (861) T PRK08928 236 LISRQLIGLCSNVDLDFDLNKLEWSPPNSEKLTGFLKKYEFKSLIKKVEKLFKIKINDHKE------------------- 296 (861) T ss_pred HHHHHHHHCCCCCCCCCCHHHHCCCCCCHHHHHHHHHHHCCHHHHHHHHHHHCCCCCCCCC------------------- T ss_conf 9989986435478876467873448999999999999826789999987650577677544------------------- Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHCCCCCCCCCCCCCEEECCHHHHHHHHHHHHHCCCCCCCCCCCCCC Q ss_conf 22233332222233332112232112001121100001235556665035479999999999998668523233222433 Q gi|254780277|r 321 DRDTTIKTSQFSSLSDQTSKINSEQTPQKLFLERLQKLSQYPINNDSYTKIVNTKDIQQWVQKLETIGSASFKIITDTMD 400 (976) Q Consensus 321 ~~~~~~~~~~~~~~~~~~~~~~~e~~p~~~~~~~~~~~~~~~~~~~~y~~I~~~~~L~~li~~L~~~~~ia~DtETt~ld 400 (976) ........+..+.+.+++..|++.+...+.+++++|++ T Consensus 297 ---------------------------------------~~~~~~~~~~~i~~~~~l~~~~~~~~~~~~~a~~le~~--- 334 (861) T PRK08928 297 ---------------------------------------IVDNKVTEIKEIYNAEELEDFAEKCEKEGIFAIYLLQE--- 334 (861) T ss_pred ---------------------------------------CCCCCCCCCCCCCCHHHHHHHHHHHHHCCEEEEEEECC--- T ss_conf ---------------------------------------44554432111589899999999876579689992057--- Q ss_pred CCCCCCCCCEEECCCCCCCCCCCCCCEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCHHHHCCCCHHHHHHHHCC Q ss_conf 11001244201025776654434550231374433345650017889999744442022210011003212378898618 Q gi|254780277|r 401 AFHSKPIAIAISTLDQKDDTSFKVETIFIDLSVHTSQDSTGKNTLTKEILSYLKKFFENEHFLKIGHNIKYDKLVLHRYG 480 (976) Q Consensus 401 ~~~~~ivGisls~~~~~~~~~~~~~~~yip~~~~~~~d~~~~~~~~~evl~~L~~lle~~~i~kIgHN~KfD~~~L~~~G 480 (976) .++++|+++|+.++. .+|++..+. ...+...++++|+++.+.||+||+|||+++|.++| T Consensus 335 --~~~~vgis~s~~~~~--------~~~i~~~~~-----------~~~~~~~l~~lLed~~i~KIghNlK~dl~vL~~~g 393 (861) T PRK08928 335 --KGTNNAISLSLQNQS--------YIIIISNEI-----------TDAFSDIIFNLLTDKSILKITYDLKPLLKFYANQS 393 (861) T ss_pred --CCEEEEEEEEECCCE--------EEEECCCCC-----------HHHHHHHHHHHHCCCCCCEEECCHHHHHHHHHHCC T ss_conf --873899999955984--------999557542-----------47899999998439997544545799999998569 Q ss_pred CCCCCCCCHHHHHHHHCCCCCCCCHHHHHHHHCCCCCCCCCCCCCCCCCCHHHCCHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 86664210023455431687667724667762244422335654322210000001233332221111124567763222 Q gi|254780277|r 481 ISMRGFEDIMLMSYVLDSGRSSHDIASIAQKWLSYTRKEILKSRKSSIPIDKISDSQVQEHAIENSNVILQLWLLLRPKL 560 (976) Q Consensus 481 I~l~~i~DTmlaayLLdp~~~~~~L~~la~~yL~~~~~e~~~~~k~~~~~~~i~~e~~~~Yaa~Da~~t~~L~~~l~~~L 560 (976) +.+.+++|||||+|||||+.++|+|++++.+|++. ..|+++++..+++||+.|+++| T Consensus 394 i~l~~i~DTmLasYLLnp~~~~h~Ld~La~~~~~~-----------------------~~~aa~~a~~~~~L~~~L~~~L 450 (861) T PRK08928 394 HDITAIDDLMLMSYSLSAGLHDHSLFNIILKNTNI-----------------------INESAKIVCTLISLHKQLILEL 450 (861) T ss_pred CCCCCCCCHHHHHHHCCCCCCCCCHHHHHHHHCCC-----------------------CCHHHHHHHHHHHHHHHHHHHH T ss_conf 67576371889998649998768989999762220-----------------------0147999999999999999999 Q ss_pred HHHHHHHHHHHHCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHCCCC Q ss_conf 10001345788607501233221014310058999986434368999999999740345577776566679987420788 Q gi|254780277|r 561 IVEKLLHVYERLDKPMIDVVSQMEITGIQIDQDLLKQVSAEISKNLLSLEENIYDLSGEKFNLSSPKQLGDILFTKLKFP 640 (976) Q Consensus 561 ~~~~l~~l~~~iE~Pl~~vLa~ME~~Gi~vD~~~L~~l~~~l~~~l~~le~~i~~~aG~~FNinSpkQL~~vLf~~LgLp 640 (976) ++.+++++|++|||||++||++||.+||+||.+.|++++.++..++++++++||++||++||||||||||+|||++|||| T Consensus 451 ~~~~L~~Ly~eIE~PLi~VLa~ME~~GI~VD~~~L~~ls~el~~~l~~Le~~I~~lAG~eFNInSPKQLgeILFekL~Lp 530 (861) T PRK08928 451 FDNKLFTIYYRIDLPLCFVLDKMEKIGIKVDANYLKQLSAEFGTEILKLEEEIYKLSGTKFNIASPKQLGEILFEKMGLP 530 (861) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHCCCC T ss_conf 88668999999887799999999981960169999999999999999999999997498747599999999979972998 Q ss_pred CCCCC-CCCCCCCCHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEECCCCCCCC Q ss_conf 75556-65453531788776530476279999999878889886532001111001233345330022210000013334 Q gi|254780277|r 641 SGAKT-KTGQWKTTAQDLEQINCGDNPIIKNILEWRQLSKIKSTYADSLPNHINKRTQRVHTFYSLASTMTGRLASLEPN 719 (976) Q Consensus 641 ~~kKT-ktG~~ST~~~~Le~L~~~~hp~~~~iLeyR~l~KL~sTY~~~l~~~i~~~dgRIH~~f~q~~t~TGRlSs~~PN 719 (976) ++||| |||+||||++|||+|..+.||+++.||+||+++||+|||+|+|+++|++.||||||||||++|+||||||++|| T Consensus 531 ~~KKt~KtG~ySTda~vLe~La~~~h~i~~~ILeyR~l~KLkSTYvd~L~~~i~~~tgRIHTsfnQt~taTGRLSS~~PN 610 (861) T PRK08928 531 KGKKSAKSGSYSTDAEVLEKLSEHGVEIADLLLKWRHLTKLKNTYTDSLPKQIDNNTGRVHTTFSQTSTTTGRLSSSNPN 610 (861) T ss_pred CCCCCCCCCCCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEECCCHHHHHHHCCEECCCCCC T ss_conf 67767788996854999998644179148999999899887517888888874279985725210454311400036887 Q ss_pred CCCCCCCCCCCCCCCEEEECCCCCEEEEEECCCCCEEEEEEEEEHHHHHHHHHCCCCHHHHHHHHHHCCCHHHCCHHHHH Q ss_conf 33345563322224302424899489996326544368766510377787340698889999989809996889988886 Q gi|254780277|r 720 LQNIPIKTDLGQKIRRAFISPPTKKLISADYSQIELRILAHIAKITPLYQVFENSLDIHRMVAAEIFGVNIEKVSPQMRR 799 (976) Q Consensus 720 lQNiP~r~~~g~~iR~~F~a~~g~~lv~aDySQiElRilA~ls~D~~l~~af~~g~DiH~~tA~~vf~~~~~~vt~~~R~ 799 (976) |||||+||++||+||+||||++||+|++|||||||||||||+|+|+.|++||++|+|||+.||+.|||+|.++||++||+ T Consensus 611 LQNIPiRte~Gr~iR~aFva~~g~~l~saDYSQIELRilAhlS~D~~Li~AF~~~~DIH~~TA~~vf~v~~~~vt~~~Rr 690 (861) T PRK08928 611 LQNIPIRSEEGNKIRQAFIAEKGYKLISADYSQIELRILAHIANVDAFKQAFINGDDIHTQTACQIFNLQKHELDSELRR 690 (861) T ss_pred CCCCCCCCCCHHHHHEEECCCCCCEEEEECHHHHHHHHHHHHHCCHHHHHHHCCCCCHHHHHHHHHHCCCHHHCCHHHHH T ss_conf 35677776202141240346999889985377889999999817889999860688666888888738982118988999 Q ss_pred HHHHHHHHHHHCCCHHHHHHHCCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHCCCEECCCCCEECCCCCCCCCHHH Q ss_conf 30251356440777035678819998999999999999888899999999999998498882778760189875883788 Q gi|254780277|r 800 HAKTINFSIIYGISPFRLATQLRIPRSEAADYIKRYFHRFPGIHEYIEKTKNFVRQNGYVETIFGRRIHYDEINSPKSSI 879 (976) Q Consensus 800 ~aK~infgi~YG~~~~~La~~l~is~~eA~~~i~~yf~~~p~v~~~~~~~~~~a~~~gyv~T~~GRrr~~p~i~s~~~~~ 879 (976) .||+|||||+|||||||||++||||++||++||++||++||+|+.||++++++|+++|||+|++|||||||+|+|+|++. T Consensus 691 ~AK~vNFGiiYG~s~fgLs~~l~i~~~eA~~~i~~Yf~~yp~v~~~~~~~~~~a~~~Gyv~Tl~gRrr~~p~i~s~n~~~ 770 (861) T PRK08928 691 KAKSINFGIIYGISAFGLAKQLNITNSEASEYIKKYFAEYPEIQEYMEQTKAYARSNGYVTTFFGRKCFIPLINSKKHTL 770 (861) T ss_pred HHHHHHHHHHHCCCHHHHHHHHCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCEECCCCCEECCCCCCCCCHHH T ss_conf 99999999887568888998829899999999999999681499999999999997798768888603287555778778 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEECCEEEEECCHHHHHHHHHHHHHHHHHCCCCCCCCC Q ss_conf 87787877013456538999999999999999865998189997263899955987999999999999651013574444 Q gi|254780277|r 880 RNINERAAINAPIQGSAADITRRAMISVHKSLEDHKLATKMLLQIHDELVFEVPEEEITTASQIIVHSMEKACLPKINLR 959 (976) Q Consensus 880 r~~aeR~a~N~~iQGsaAdi~k~am~~i~~~~~~~~~~~~~ilqVHDElv~ev~~~~~~~~~~~~~~~Me~~~~~~~~l~ 959 (976) |++|||+|+|||||||||||||+||++++++++ +++||||||||||||||++++++++++|+++||+|. +|+ T Consensus 771 r~~~eR~A~N~piQGtAADiiK~AMi~~~~~l~----~~~mllQVHDEli~E~~~~~~~~~~~~~~~~Me~a~----~l~ 842 (861) T PRK08928 771 RQFAERAAINAPIQGTAADIIKIAMINLDDELE----KGRLILQVHDELLFEVPEDEVDTVAKLIKEIMENST----NMS 842 (861) T ss_pred HHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHH----HCEEEEEEEEEEEEEECHHHHHHHHHHHHHHHCCCC----CCC T ss_conf 858999997512128899999999999999862----471687454467896468899999999999962874----377 Q ss_pred CCEEEEEEEECCHHHHC Q ss_conf 64388833208937819 Q gi|254780277|r 960 VPLKVNIKVSNNWQEEN 976 (976) Q Consensus 960 VPl~v~~~~G~~W~e~~ 976 (976) |||+|++++|+||+||| T Consensus 843 Vpl~v~~~~g~nW~e~h 859 (861) T PRK08928 843 VPLKVEIRIGDNWGEIH 859 (861) T ss_pred CCEEEECCCCCCHHHCC T ss_conf 87684107589978738 No 9 >PRK07898 consensus Probab=100.00 E-value=0 Score=2259.85 Aligned_cols=881 Identities=33% Similarity=0.549 Sum_probs=827.5 Q ss_pred CCCCEEEEEECCHHHHHHHHCCCC-CCCCCCCCEECHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHHCCHHH Q ss_conf 889728999662699988742675-4468988550069999999999999848554798799997274887132408652 Q gi|254780277|r 2 QKENHLFLVDGSSFIYRAFYATPL-LSRKHDGLPVNAIAGFCNMLWKLLQNSRKESIASHFAVIFDYPAVTFRNEIYPDY 80 (976) Q Consensus 2 ~~~~~~~liDg~~~~~ra~~a~p~-~~~~~~G~~t~ai~gf~~~l~~~i~~~~~~~~~~~~~v~fD~~~~tfR~~~~~~Y 80 (976) +++++|+|||||+++||||||+|+ .+.+++|+||||||||+|||+|++++++|+ |++||||.+++|||||+||+| T Consensus 12 ~~k~~l~liDg~sl~fRAy~Al~~~~l~~~~G~pTnAi~GF~~ml~kli~~~~P~----~iavaFD~~~~TFRhe~y~~Y 87 (902) T PRK07898 12 TEKPTLMLLDGHSLAFRAFYALPAENFSTSGGQTTNAVYGFTAMLINLLRDEQPT----HVAVAFDVSRQTFRTEEYPEY 87 (902) T ss_pred CCCCEEEEEECCHHHHHHHCCCCCCCCCCCCCCEECHHHHHHHHHHHHHHHCCCC----EEEEEEECCCCCCCHHHHHHH T ss_conf 8898089995848999997169976683999987025999999999999873998----899999795996031013675 Q ss_pred HCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEECCEEHHHHHHHHHHHHHHCCCCEEEECCCCHHHEECCCCEEEEECCCC Q ss_conf 16679879758864899999999709979956890166589999999984899499986870121223797599966788 Q gi|254780277|r 81 KANRPKIPEMLLPQLPLVRLATQAFGIPAIEIQGFEADDIIATYTYIAEKEGFAVTIISTDKDLMQLVSPTTCLYDTVKE 160 (976) Q Consensus 81 Ka~R~~~p~~l~~q~~~i~~~l~~~gi~~~~~~~~EADDviatla~~~~~~~~~v~i~s~DkD~~Qlv~~~v~~~~~~~~ 160 (976) ||||+||||||.+|||+|++++++||||+++.+||||||||||+|++++++|++|+|||||||++|||+++|+||+|+++ T Consensus 88 KA~R~~~PedL~~Qip~i~e~l~a~gIp~le~~GyEADDiIgTla~~a~~~g~~v~IvSgDKDl~QLV~d~v~vl~~~kg 167 (902) T PRK07898 88 KATRSATPDEFKGQVDLIKEVLGALGITVLAKPGFEADDIIATLATQAEAEGYRVLVVTGDRDALQLVSDDVTVLYPRKG 167 (902) T ss_pred CCCCCCCCHHHHHHHHHHHHHHHHCCCCEECCCCCCHHHHHHHHHHHHHHCCCEEEEECCCCCHHHHCCCCEEEEECCCC T ss_conf 38999998899998999999999879988563896589999999999997799699976899775517898799977899 Q ss_pred ----CEECHHHHHHHHCCCHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf ----3538999988848988787777875187435899877636024998984136789998645420011223220035 Q gi|254780277|r 161 ----EKIDIEHVIKKWGIPPEKMVCLQALTGDSIDNIPGIPGIGYKTAALLLQEYGNLENILINASRIKQKKRRENILEY 236 (976) Q Consensus 161 ----~~~~~~~v~~~~gv~p~~~~d~~al~GD~sDnipGv~GiG~KtA~~ll~~~gsle~i~~~~d~i~~~k~~~~l~~~ 236 (976) +.||++.|.|||||+|+||+||+||+||+|||||||||||||||+|||++|||||+||+|+|+|+| |++++|+++ T Consensus 168 ~~~~~~~d~~~V~ek~GV~P~qviD~lAL~GDsSDNIPGVpGIG~KTA~kLL~eyGsLE~I~~n~d~Ikg-K~~e~L~~~ 246 (902) T PRK07898 168 VSDLTRFTPEAVEEKYGLTPAQYPDFAALRGDPSDNLPGIPGVGEKTAAKWIAQYGSLDGLVDHADEIKG-KVGDALRAN 246 (902) T ss_pred CCCCEECCHHHHHHHHCCCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHCCCHHHHHHHHHHHHH-HHHHHHHHH T ss_conf 8635442899999986979899999999818864579999984478899999976778999999875215-778899965 Q ss_pred HHHHHHHHHHHHCCCCCCCCCCHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 78999765543203564224763552126767688998653233689998765311445443342111234332222333 Q gi|254780277|r 237 AETARLSRKLVMLRTDVPITIPLEHLILEDCNGPRLISFLKALEFTKLINRVATTYDCDAANIEPVVLDINTTENKGIAI 316 (976) Q Consensus 237 ~e~a~ls~~L~tL~~Dvpl~~~~~~l~~~~~d~~~l~~~~~~lef~~ll~rL~~~~~~~~~~~~~~~~~~~~~e~~~~~~ 316 (976) +|+|++||+||||++|||++++++++++.++|.++|.++|++|||++|++|+............ T Consensus 247 ~d~a~LSk~LvTI~~Dvpl~~~le~l~~~~~d~~~L~~lf~eLEF~sL~~rl~~~~~~~~~~~~---------------- 310 (902) T PRK07898 247 LESVVRNRELTELVRDVPLPVTPDDLRLQPWDRDAVHRLFDDLEFRVLRDRLFATLAAAGEPEV---------------- 310 (902) T ss_pred HHHHHHHHHHHHHHHCCCCCCCHHHHCCCCCCHHHHHHHHHHHCCHHHHHHHHHHCCCCCCCCC---------------- T ss_conf 9999998998688754888878887124899999999999984667899999864177788776---------------- Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHCCCCCCCCCCCCCEEECCHHHHHHHHHH-HHHCCCCCCCCC Q ss_conf 2222222333322222333321122321120011211000012355566650354799999999999-986685232332 Q gi|254780277|r 317 ETEIDRDTTIKTSQFSSLSDQTSKINSEQTPQKLFLERLQKLSQYPINNDSYTKIVNTKDIQQWVQK-LETIGSASFKII 395 (976) Q Consensus 317 ~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~p~~~~~~~~~~~~~~~~~~~~y~~I~~~~~L~~li~~-L~~~~~ia~DtE 395 (976) .........+.+.+++..|+.. +.....++++++ T Consensus 311 ---------------------------------------------~~~~~~~~~~~~~~~l~~~l~~~~~~~~~~a~~~~ 345 (902) T PRK07898 311 ---------------------------------------------DEGFEVRGGVLAPGELAQWLAEHAGDGRRLGLAVV 345 (902) T ss_pred ---------------------------------------------CCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEEEEE T ss_conf ---------------------------------------------55666554348989999999997435986999985 Q ss_pred CCCCCCCCCCCCCCEEECCCCCCCCCCCCCCEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCHHHHCCCCHHHHH Q ss_conf 22433110012442010257766544345502313744333456500178899997444420222100110032123788 Q gi|254780277|r 396 TDTMDAFHSKPIAIAISTLDQKDDTSFKVETIFIDLSVHTSQDSTGKNTLTKEILSYLKKFFENEHFLKIGHNIKYDKLV 475 (976) Q Consensus 396 Tt~ld~~~~~ivGisls~~~~~~~~~~~~~~~yip~~~~~~~d~~~~~~~~~evl~~L~~lle~~~i~kIgHN~KfD~~~ 475 (976) ++++ ++.++++|+++++.++. ++|+|..+.. .+....|++||+++.+.|||||+|||+++ T Consensus 346 ~t~~-~~~a~lvgisl~~~~g~--------a~yip~~~~~-----------~~~~~~L~~~Led~~i~KIghNlK~d~~v 405 (902) T PRK07898 346 GTHL-PGDGDATALAIAAADGH--------AAYIDTADLT-----------PDDEAALAAWLADPARPKALHEAKGAMHA 405 (902) T ss_pred CCCC-CCCCCEEEEEEEEECCE--------EEEECCCCCC-----------HHHHHHHHHHHHCCCCCEEEECHHHHHHH T ss_conf 3678-66573899999953880--------8997220179-----------78999999986087765455404899999 Q ss_pred HHHCCCCCCCC-CCHHHHHHHHCCCCCCCCHHHHHHHHCCCCCCCCCCCCCCCCCCHHHCCHHHHHHHHHHHHHHHHHHH Q ss_conf 98618866642-10023455431687667724667762244422335654322210000001233332221111124567 Q gi|254780277|r 476 LHRYGISMRGF-EDIMLMSYVLDSGRSSHDIASIAQKWLSYTRKEILKSRKSSIPIDKISDSQVQEHAIENSNVILQLWL 554 (976) Q Consensus 476 L~~~GI~l~~i-~DTmlaayLLdp~~~~~~L~~la~~yL~~~~~e~~~~~k~~~~~~~i~~e~~~~Yaa~Da~~t~~L~~ 554 (976) |.++||.+.++ ||||||||||||+.++|+|++|+.+||++.... .+.++++.+|..++.+.+..|+|+||++|++||. T Consensus 406 L~~~Gi~l~g~~fDTmLAaYLLdp~~~~~~Ld~La~~yL~~~~~~-~~~~~~~~~~~~~~~~~~~~yaa~dA~~~~~L~~ 484 (902) T PRK07898 406 LAGRGWTLAGVTSDTALAAYLVRPGQRSFTLDDLSLRYLRRELRA-EAPAQQQLSLDDAVDAAAAQTLILRARAVLDLAD 484 (902) T ss_pred HHHCCCCCCCCCHHHHHHHHHHCCCCCCCCHHHHHHHHCCCCCCC-CCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHH T ss_conf 996698657753228999987568878787999999974976651-5653543471326768999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHH Q ss_conf 76322210001345788607501233221014310058999986434368999999999740345577776566679987 Q gi|254780277|r 555 LLRPKLIVEKLLHVYERLDKPMIDVVSQMEITGIQIDQDLLKQVSAEISKNLLSLEENIYDLSGEKFNLSSPKQLGDILF 634 (976) Q Consensus 555 ~l~~~L~~~~l~~l~~~iE~Pl~~vLa~ME~~Gi~vD~~~L~~l~~~l~~~l~~le~~i~~~aG~~FNinSpkQL~~vLf 634 (976) .|+++|++.+++++|++|||||++||++||.+||+||.++|++++.++..++++++++||++||++|||||||||++||| T Consensus 485 ~l~~~L~~~~l~~L~~~IE~PLi~VLa~ME~~GI~vD~~~L~~ls~el~~~l~~le~~I~~~aG~eFNinSpKQL~eiLF 564 (902) T PRK07898 485 ALDAELARIDSAALLRDMELPLQRVLAEMERAGIAVDLAHLTELQSQFAGQVRDAAEAAYAVIGKQINLGSPKQLQVVLF 564 (902) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCEEECHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHH T ss_conf 99998616558999999760589999999850819819999989999999999999999998189568999899999989 Q ss_pred HHCCCCCCCCCCCCCCCCCHHHHHHHHC-CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEECC Q ss_conf 4207887555665453531788776530-476279999999878889886532001111001233345330022210000 Q gi|254780277|r 635 TKLKFPSGAKTKTGQWKTTAQDLEQINC-GDNPIIKNILEWRQLSKIKSTYADSLPNHINKRTQRVHTFYSLASTMTGRL 713 (976) Q Consensus 635 ~~LgLp~~kKTktG~~ST~~~~Le~L~~-~~hp~~~~iLeyR~l~KL~sTY~~~l~~~i~~~dgRIH~~f~q~~t~TGRl 713 (976) ++||||++|||||| ||||++||++|.. ..||+++.||+||+++||+|| +++|+++|++ ||||||+|||++|+|||| T Consensus 565 dkL~Lp~~KKtKtG-ySTd~~vLe~L~~~~~h~i~~~iLeyR~l~KL~sT-~d~L~~~i~~-~gRIHtsfnQ~~T~TGRL 641 (902) T PRK07898 565 DELGMPKTKRTKTG-YTTDADALQSLADKTGHPFLQHLLAHRDATKLKVT-VDGLIKAVAD-DGRIHTTFNQTVAATGRL 641 (902) T ss_pred HHCCCCCCCCCCCC-CCCCHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHH-HHHHHHHHCC-CCCCCCHHHHHHCCCCEE T ss_conf 97099988878899-87889999999855799269999999999899989-9888987265-772301265421211200 Q ss_pred CCCCCCCCCCCCCCCCCCCCCEEEECCCCC-EEEEEECCCCCEEEEEEEEEHHHHHHHHHCCCCHHHHHHHHHHCCCHHH Q ss_conf 013334333455633222243024248994-8999632654436876651037778734069888999998980999688 Q gi|254780277|r 714 ASLEPNLQNIPIKTDLGQKIRRAFISPPTK-KLISADYSQIELRILAHIAKITPLYQVFENSLDIHRMVAAEIFGVNIEK 792 (976) Q Consensus 714 Ss~~PNlQNiP~r~~~g~~iR~~F~a~~g~-~lv~aDySQiElRilA~ls~D~~l~~af~~g~DiH~~tA~~vf~~~~~~ 792 (976) ||++|||||||+|+++||+||+||+|++|| +|++|||||||||||||+|+|+.|++||++|+|||+.||+.|||+|.++ T Consensus 642 SS~~PNLQNIPir~~~G~~iR~aFv~~~g~~~~~saDYSQiELRilAh~s~d~~l~~af~~~~DiH~~tA~~if~v~~~~ 721 (902) T PRK07898 642 SSTDPNLQNIPIRTEAGRRIRDAFVVGEGYETLLTADYSQIEMRIMAHLSGDEGLIEAFNSGEDLHSFVASRVFGVPIDE 721 (902) T ss_pred CCCCCCCCCCCCCCCCHHHHHHEEECCCCCCEEEECCHHHHHHHHHHHHCCCHHHHHHHHCCCCHHHHHHHHHHCCCHHH T ss_conf 25899766777766101556760435899846774041367999999980889999998449988899999984999254 Q ss_pred CCHHHHHHHHHHHHHHHHCCCHHHHHHHCCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHCCCEECCCCCEECCCCC Q ss_conf 99888863025135644077703567881999899999999999988889999999999999849888277876018987 Q gi|254780277|r 793 VSPQMRRHAKTINFSIIYGISPFRLATQLRIPRSEAADYIKRYFHRFPGIHEYIEKTKNFVRQNGYVETIFGRRIHYDEI 872 (976) Q Consensus 793 vt~~~R~~aK~infgi~YG~~~~~La~~l~is~~eA~~~i~~yf~~~p~v~~~~~~~~~~a~~~gyv~T~~GRrr~~p~i 872 (976) ||+++|++||+|||||+||||+||||++||||++||++||++||++||+|+.||++++++|+++|||+|++|||||||+| T Consensus 722 vt~~~R~~AK~vNfgiiYG~~~~gLs~~l~i~~~eA~~~i~~yf~~yp~v~~~~~~~~~~a~~~gyv~Tl~gRrr~~p~i 801 (902) T PRK07898 722 VTPELRRRVKAMSYGLAYGLSAYGLSQQLKISTEEAKELMDAYFARFGGVRDYLREVVEQARKDGYTETLFGRRRYLPDL 801 (902) T ss_pred CCHHHHHHHHHHHHHHHHCCCHHHHHHHHCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHCCEEECCCCCCCCCCCC T ss_conf 89999989888774776675888999872989999999999999869069999999999999779067878771138877 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEECCEEEEECCHHHHHHHHHHHHHHHHHCC Q ss_conf 58837888778787701345653899999999999999986599818999726389995598799999999999965101 Q gi|254780277|r 873 NSPKSSIRNINERAAINAPIQGSAADITRRAMISVHKSLEDHKLATKMLLQIHDELVFEVPEEEITTASQIIVHSMEKAC 952 (976) Q Consensus 873 ~s~~~~~r~~aeR~a~N~~iQGsaAdi~k~am~~i~~~~~~~~~~~~~ilqVHDElv~ev~~~~~~~~~~~~~~~Me~~~ 952 (976) +|.|++.|++|||+|||||||||||||||+||++++++++..+++++||||||||||||||++++++++++|+++||+|. T Consensus 802 ~s~n~~~r~~~eR~A~N~piQGtaADiiK~AMi~i~~~l~~~~l~~~~lLQvHDELi~ev~~~e~e~~~~~v~~~Me~~~ 881 (902) T PRK07898 802 TSDNRQRREAAERAALNAPIQGSAADIIKVAMIRVDRALREAGLKSRMLLQVHDELVLEVAPGEREQVEALVRDEMGSAY 881 (902) T ss_pred CCCCHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECEEEEEEECHHHHHHHHHHHHHHHHCCC T ss_conf 78897789899999828666606999999999999999984699873899864178894078899999999999972863 Q ss_pred CCCCCCCCCEEEEEEEECCHHHHC Q ss_conf 357444464388833208937819 Q gi|254780277|r 953 LPKINLRVPLKVNIKVSNNWQEEN 976 (976) Q Consensus 953 ~~~~~l~VPl~v~~~~G~~W~e~~ 976 (976) +|+|||+|++++|+||+||- T Consensus 882 ----~L~VPL~v~~~~G~nW~eA~ 901 (902) T PRK07898 882 ----DLRVPLEVSVGYGRSWDAAA 901 (902) T ss_pred ----CCCCCEEEECCCCCCHHHHC T ss_conf ----17787572007386978834 No 10 >PRK07456 consensus Probab=100.00 E-value=0 Score=2256.87 Aligned_cols=916 Identities=36% Similarity=0.555 Sum_probs=826.2 Q ss_pred CCCCEEEEEECCHHHHHHHHCCCC-----CCCCCCCCEECHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHHC Q ss_conf 889728999662699988742675-----446898855006999999999999984855479879999727488713240 Q gi|254780277|r 2 QKENHLFLVDGSSFIYRAFYATPL-----LSRKHDGLPVNAIAGFCNMLWKLLQNSRKESIASHFAVIFDYPAVTFRNEI 76 (976) Q Consensus 2 ~~~~~~~liDg~~~~~ra~~a~p~-----~~~~~~G~~t~ai~gf~~~l~~~i~~~~~~~~~~~~~v~fD~~~~tfR~~~ 76 (976) +++++|+|||||+++||||||+|. + ++++|+||||||||+|||+++|++++|+ |++||||.++||||||+ T Consensus 5 ~~k~~l~liDg~~~~fRay~A~~~~~~~pl-~~~~G~~tnAv~GF~~~l~kli~~~~P~----~iavaFD~~~~TFR~e~ 79 (975) T PRK07456 5 SKKPLLLLVDGHSLAFRSFYAFSKGGEGGL-RTKTGIPTSVTYGFLKSLLDNCKTQKPQ----GVAIAFDTAEPTFRHEA 79 (975) T ss_pred CCCCEEEEEECCHHHHHHHHCCCCCCCCCC-CCCCCCEECHHHHHHHHHHHHHHHCCCC----EEEEEEECCCCCCCHHH T ss_conf 789908999555789999807786788886-4899987214999999999999863998----89999828899525423 Q ss_pred CHHHHCCCCCCCHHHHHHHHHHHHHHH-HCCCCEEEECCEEHHHHHHHHHHHHHHCCCCEEEECCCCHHHEECCCCE--E Q ss_conf 865216679879758864899999999-7099799568901665899999999848994999868701212237975--9 Q gi|254780277|r 77 YPDYKANRPKIPEMLLPQLPLVRLATQ-AFGIPAIEIQGFEADDIIATYTYIAEKEGFAVTIISTDKDLMQLVSPTT--C 153 (976) Q Consensus 77 ~~~YKa~R~~~p~~l~~q~~~i~~~l~-~~gi~~~~~~~~EADDviatla~~~~~~~~~v~i~s~DkD~~Qlv~~~v--~ 153 (976) ||+|||||++|||||.+|||+|+++|+ +||||+++.+||||||||||+|++++++|++|+|||||||++|||+|+| . T Consensus 80 y~~YKA~R~~~PedL~~Qip~i~e~l~~algIp~~e~~GyEADDiIaTla~~a~~~g~~v~IvSgDKDl~QLV~d~~~v~ 159 (975) T PRK07456 80 DPNYKANRDVAPDDFFQDLEQLQEILEEALNLPICTAPGYEADDVLGTLANRAADQGWRVRILSGDRDLFQLVDDERDIA 159 (975) T ss_pred HHHHHCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEECCCCHHHHHHHHHHHHHHCCCEEEEECCCCCHHHHCCCCCCEE T ss_conf 13640799999888999999999999983699874178801999999999999977990899788824886087876669 Q ss_pred EEE----CCCC----CEECHHHHHHHHCCCHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCCCHHHHHHHHHHHH Q ss_conf 996----6788----35389999888489887877778751874358998776360249989841367899986454200 Q gi|254780277|r 154 LYD----TVKE----EKIDIEHVIKKWGIPPEKMVCLQALTGDSIDNIPGIPGIGYKTAALLLQEYGNLENILINASRIK 225 (976) Q Consensus 154 ~~~----~~~~----~~~~~~~v~~~~gv~p~~~~d~~al~GD~sDnipGv~GiG~KtA~~ll~~~gsle~i~~~~d~i~ 225 (976) +++ |.++ ..||+++|.|||||+|+||+||+||+||+|||||||||||||||++||++|||||+||+|+|+++ T Consensus 160 vl~~~~~p~~~~~~~~~~d~~~V~ek~GV~P~qiiD~lAL~GDsSDNIPGVpGIG~KTA~kLL~eyGsLE~Iyen~d~i~ 239 (975) T PRK07456 160 VLYMGGGPYAKSGGPTLINEAGVKEKLGVAPEQVVDLKALTGDSSDNIPGVKGVGPKTAINLLKENGDLDGIYKALDEIE 239 (975) T ss_pred EEECCCCCCCCCCCCEEECHHHHHHHHCCCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHCCCHHHHHHHHHHHC T ss_conf 98216875446677656269999998695989989999972886568899998559999999998499999998287620 Q ss_pred H---------HHHHHHHCCCHHHHHHHHHHHHCCCCCCCCCCHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 1---------1223220035789997655432035642247635521267676889986532336899987653114454 Q gi|254780277|r 226 Q---------KKRRENILEYAETARLSRKLVMLRTDVPITIPLEHLILEDCNGPRLISFLKALEFTKLINRVATTYDCDA 296 (976) Q Consensus 226 ~---------~k~~~~l~~~~e~a~ls~~L~tL~~Dvpl~~~~~~l~~~~~d~~~l~~~~~~lef~~ll~rL~~~~~~~~ 296 (976) + +|++++|++++|+|++||+||||++||||+++ +++++.++|.+.+.++|++|||++|++++.......+ T Consensus 240 ~~k~~~~~IKgkl~e~L~~~~d~A~LSkeLvTI~~Dvpl~~~-~dl~~~~~d~~~L~~lf~elEF~sLl~rl~~~~~~~~ 318 (975) T PRK07456 240 GEKAYRGAIKGALKKKLKNDKDNAYLSRFLAEILVDVPLPLD-ESLELTGFDQELLSPLLEKLELNSLLRQVDIFQATFS 318 (975) T ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHEEECCCCCCC-HHHCCCCCCHHHHHHHHHHHCCHHHHHHHHHHHHHHC T ss_conf 011111234448899999979999988997650017988865-5542689999999999999554789999887776414 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHCCCCCCCCCCCCCEEECCHHH Q ss_conf 43342111234332222333222222233332222233332112232112001121100001235556665035479999 Q gi|254780277|r 297 ANIEPVVLDINTTENKGIAIETEIDRDTTIKTSQFSSLSDQTSKINSEQTPQKLFLERLQKLSQYPINNDSYTKIVNTKD 376 (976) Q Consensus 297 ~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~p~~~~~~~~~~~~~~~~~~~~y~~I~~~~~ 376 (976) ..... .+....... ......... .................+.+|.+.++ T Consensus 319 ~~~~~----~~~~~~~~~-----------~~~~~~~~~----------------~~~~~~~~~~~~~~~~~~~~I~~~~~ 367 (975) T PRK07456 319 GGGFG----ANATAAEPK-----------KGESDDKKF----------------SSAEELENSENTIPPLQPQIINTPEQ 367 (975) T ss_pred CCCCC----CCCCCCCCC-----------CCCCCCCCC----------------CCCCCCCCCCCCCCCCCCEEECCHHH T ss_conf 66656----444323445-----------544443223----------------32111101112356544278589999 Q ss_pred HHHHHHHHHH----CCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCEECCCCCCCCCCCCCCCCCHHHHHHH Q ss_conf 9999999986----685232332224331100124420102577665443455023137443334565001788999974 Q gi|254780277|r 377 IQQWVQKLET----IGSASFKIITDTMDAFHSKPIAIAISTLDQKDDTSFKVETIFIDLSVHTSQDSTGKNTLTKEILSY 452 (976) Q Consensus 377 L~~li~~L~~----~~~ia~DtETt~ld~~~~~ivGisls~~~~~~~~~~~~~~~yip~~~~~~~d~~~~~~~~~evl~~ 452 (976) +..|++.+.. ...+|+|+||+++||+.+++||+|+||..... .++|+|+.|.. ...+...+++++. T Consensus 368 l~~lv~~L~~~~~~~~~vA~DtETtsLdp~~A~lVGis~s~~~~~~------~a~YIPl~h~~----~~~~l~~~~vl~~ 437 (975) T PRK07456 368 LQALVQRLEKCTDPEAPVAWDTETTSLNPRDAELVGIGCCWGEELD------DLAYIPLGHQQ----AGEQLPLETVLEA 437 (975) T ss_pred HHHHHHHHHHCCCCCCEEEEECCCCCCCHHHCCEEEEEEECCCCCC------CEEEEECCCCC----CHHHCCHHHHHHH T ss_conf 9999999983635476589980468877423210266641155787------26997734556----1310579999999 Q ss_pred HHHHHCCCCCHHHHCCCCHHHHHHHHCCCCCCCC-CCHHHHHHHHCCCCCCCCHHHHHHHHCCCCC---CCCCCCCCCCC Q ss_conf 4442022210011003212378898618866642-1002345543168766772466776224442---23356543222 Q gi|254780277|r 453 LKKFFENEHFLKIGHNIKYDKLVLHRYGISMRGF-EDIMLMSYVLDSGRSSHDIASIAQKWLSYTR---KEILKSRKSSI 528 (976) Q Consensus 453 L~~lle~~~i~kIgHN~KfD~~~L~~~GI~l~~i-~DTmlaayLLdp~~~~~~L~~la~~yL~~~~---~e~~~~~k~~~ 528 (976) |++||+|+.+.|||||+|||+++|.++||.+.++ ||||||+|||||+. +|+|++|+.+||++.. +++.++++ T Consensus 438 LkpiLed~~i~KighNlKyD~~vL~~~GI~l~g~~fDTmLAsYLLnP~~-~h~Ld~La~~yL~~~~i~~edl~gkgk--- 513 (975) T PRK07456 438 LRPWLESPEHPKALQNAKFDRLILLRHGIALKGVVFDTLLADYLLNPEA-KHNLDDLAQRELGFTPTSYKDLVGKGQ--- 513 (975) T ss_pred HHHHHCCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHCCCC-CCCHHHHHHHHHCCCCCCHHHHCCCCC--- T ss_conf 9998628875310211146799999769510675513999998768765-589899999971888774998628887--- Q ss_pred CCHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHHHHH Q ss_conf 10000001233332221111124567763222100--0134578860750123322101431005899998643436899 Q gi|254780277|r 529 PIDKISDSQVQEHAIENSNVILQLWLLLRPKLIVE--KLLHVYERLDKPMIDVVSQMEITGIQIDQDLLKQVSAEISKNL 606 (976) Q Consensus 529 ~~~~i~~e~~~~Yaa~Da~~t~~L~~~l~~~L~~~--~l~~l~~~iE~Pl~~vLa~ME~~Gi~vD~~~L~~l~~~l~~~l 606 (976) ++.+++.+.+.+|||+||++|++||+.|+++|++. +|+++|++|||||++||++||.+||+||.+.|++++.++..++ T Consensus 514 ~~~~v~~~~~~~ya~~dA~~~~~L~~~l~~~L~~~~~~L~~l~~~iE~PLi~VLa~ME~~GI~vD~~~L~~l~~~l~~~l 593 (975) T PRK07456 514 TFADVDIATASQYCGMDVYLTYRLAPILREQLAKTGPELIKLLEEVELPLEPVLAEMEATGIRIDIPYLKELSQELESTL 593 (975) T ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHH T ss_conf 60208978999999999999999999999999862369999999987369999999997185016999999999999999 Q ss_pred HHHHHHHHHCCCCCCCCCCHHHHHHHHHHHCCCCCCC--CCCCCCCCCCHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999740345577776566679987420788755--56654535317887765304762799999998788898865 Q gi|254780277|r 607 LSLEENIYDLSGEKFNLSSPKQLGDILFTKLKFPSGA--KTKTGQWKTTAQDLEQINCGDNPIIKNILEWRQLSKIKSTY 684 (976) Q Consensus 607 ~~le~~i~~~aG~~FNinSpkQL~~vLf~~LgLp~~k--KTktG~~ST~~~~Le~L~~~~hp~~~~iLeyR~l~KL~sTY 684 (976) .+++++||++||++|||||||||++|||++||||++| ||||| ||||++|||+| +..||+++.||+||+++||+||| T Consensus 594 ~~le~~i~~~aG~eFNinSpkQL~evLFekL~L~~~k~~KtKtG-ySTd~~vLe~L-~~~hpi~~~il~~R~l~Kl~sTY 671 (975) T PRK07456 594 ERLEQQAYELAGEDFNLSSPKQLGELLFEKLGLDRKKSRKTKTG-YSTDAAVLEKL-EGDHPIVPLILEHRTLSKLKSTY 671 (975) T ss_pred HHHHHHHHHHCCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCC-CCCHHHHHHHH-HHCCHHHHHHHHHHHHHHHHHHH T ss_conf 99999999864898898998999999898639998777778899-88629999975-20472799999987899999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCEECCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCEEEEEECCCCCEEEEEEEEEH Q ss_conf 32001111001233345330022210000013334333455633222243024248994899963265443687665103 Q gi|254780277|r 685 ADSLPNHINKRTQRVHTFYSLASTMTGRLASLEPNLQNIPIKTDLGQKIRRAFISPPTKKLISADYSQIELRILAHIAKI 764 (976) Q Consensus 685 ~~~l~~~i~~~dgRIH~~f~q~~t~TGRlSs~~PNlQNiP~r~~~g~~iR~~F~a~~g~~lv~aDySQiElRilA~ls~D 764 (976) +|+|+.+|++.||||||+|||++|+||||||++|||||||+|+++||+||+||||++||+|++|||||||||||||+|+| T Consensus 672 ~~~L~~~i~~~tgriht~f~Q~~t~TGRlSS~~PNLQNIP~r~~~g~~iR~~Fv~~~g~~l~saDySQiELRilAh~s~d 751 (975) T PRK07456 672 VDALPQLVEAETGRVHTDFNQAVTATGRLSSSNPNLQNIPIRTEFSRRIRKAFLPQEGWLLLSADYSQIELRILAHLSQE 751 (975) T ss_pred HHHHHHHHCCCCCEECCHHHHHCCCCCCCCCCCCCCCCCCCCCCCCHHHHEEECCCCCCEEEEECHHHHHHHHHHHHHCC T ss_conf 99988752877587657555410665650478975135777641031234124358998899963208799999998588 Q ss_pred HHHHHHHHCCCCHHHHHHHHHHCCCHHHCCHHHHHHHHHHHHHHHHCCCHHHHHHHCCCCHHHHHHHHHHHHHHCHHHHH Q ss_conf 77787340698889999989809996889988886302513564407770356788199989999999999998888999 Q gi|254780277|r 765 TPLYQVFENSLDIHRMVAAEIFGVNIEKVSPQMRRHAKTINFSIIYGISPFRLATQLRIPRSEAADYIKRYFHRFPGIHE 844 (976) Q Consensus 765 ~~l~~af~~g~DiH~~tA~~vf~~~~~~vt~~~R~~aK~infgi~YG~~~~~La~~l~is~~eA~~~i~~yf~~~p~v~~ 844 (976) +.|++||++|+|||+.||+.||++ ++||++||+.||+|||||+||||+||||++||||++||++||++||++||+|++ T Consensus 752 ~~li~af~~~~DiH~~tA~~if~~--~~v~~~~R~~AK~infgiiYG~~~~gLa~~l~is~~eA~~~i~~yf~~yp~v~~ 829 (975) T PRK07456 752 PVLLEAYRSGDDVHTLTARLLFEK--EEISSDERRLGKTINFGVIYGMGAQRFARETGVSQSEAKEFLSRYKERYPKVFA 829 (975) T ss_pred HHHHHHHHCCCCHHHHHHHHHCCC--CCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHCCCHHHHHHHHHHHHHHCHHHHH T ss_conf 899998717998689999986393--019999999978988767764687788988597999999999999986901999 Q ss_pred HHHHHHHHHHHCCCEECCCCCEECCCCC------------------CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999849888277876018987------------------5883788877878770134565389999999999 Q gi|254780277|r 845 YIEKTKNFVRQNGYVETIFGRRIHYDEI------------------NSPKSSIRNINERAAINAPIQGSAADITRRAMIS 906 (976) Q Consensus 845 ~~~~~~~~a~~~gyv~T~~GRrr~~p~i------------------~s~~~~~r~~aeR~a~N~~iQGsaAdi~k~am~~ 906 (976) ||++++++|+++|||+|++|||||+|.+ .+.+...++++||+|+|||||||||||||+||++ T Consensus 830 ~~~~~~~~a~~~gyv~Tl~gRrr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R~A~N~piQGtaADiiK~AMi~ 909 (975) T PRK07456 830 FLELQERLALSKGYVETILGRRRYFHFDKNGLGRLLGKDPEEIDLDVARRAGMEAQQLRAAANAPIQGSSADIIKIAMVQ 909 (975) T ss_pred HHHHHHHHHHHCCCEECCCCCEECCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHH T ss_conf 99999999997698876667700464444310001356822134065666777758889883702446799999999999 Q ss_pred HHHHHHHCCCCEEEEEEECCEEEEECCHHHHHHHHHHHHHHHHHCCCCCCCCCCCEEEEEEEECCHHHHC Q ss_conf 9999986599818999726389995598799999999999965101357444464388833208937819 Q gi|254780277|r 907 VHKSLEDHKLATKMLLQIHDELVFEVPEEEITTASQIIVHSMEKACLPKINLRVPLKVNIKVSNNWQEEN 976 (976) Q Consensus 907 i~~~~~~~~~~~~~ilqVHDElv~ev~~~~~~~~~~~~~~~Me~~~~~~~~l~VPl~v~~~~G~~W~e~~ 976 (976) ++++++..+++++||||||||||||||++++++++.+|+++||+|. +|+|||+|++++|+||+||- T Consensus 910 ~~~~l~~~~~~~~~llqvHDElifev~~~~~~~~~~~v~~~Me~a~----~l~Vpl~v~~~~G~nW~eaK 975 (975) T PRK07456 910 LQKKLLEQNLPARLLLQVHDELVLEVPPEELEELKPLVKSTMENAV----KLSVPLVVEIGAGDNWMEAK 975 (975) T ss_pred HHHHHHHCCCCCEEEEEEEEEEEEEECHHHHHHHHHHHHHHHHCCC----CCCCCEEEECCCCCCHHHCC T ss_conf 9999973798853899864377895368899999999999982743----57887584117388968709 No 11 >PRK08434 consensus Probab=100.00 E-value=0 Score=2253.25 Aligned_cols=874 Identities=34% Similarity=0.579 Sum_probs=821.0 Q ss_pred CEEEEEECCHHHHHHHHCCCCCCCCCCCCEECHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHHCCHHHHCCC Q ss_conf 72899966269998874267544689885500699999999999998485547987999972748871324086521667 Q gi|254780277|r 5 NHLFLVDGSSFIYRAFYATPLLSRKHDGLPVNAIAGFCNMLWKLLQNSRKESIASHFAVIFDYPAVTFRNEIYPDYKANR 84 (976) Q Consensus 5 ~~~~liDg~~~~~ra~~a~p~~~~~~~G~~t~ai~gf~~~l~~~i~~~~~~~~~~~~~v~fD~~~~tfR~~~~~~YKa~R 84 (976) |+|+||||||++||||||+|++. +++|+||||||||+|||+|++++++|+ |++||||.++|||||++||+||||| T Consensus 2 k~l~liDg~sl~fRAy~A~~~l~-t~~G~pTnAv~GF~~mL~kli~~~~P~----~iavaFD~~~~TFRhe~y~~YKAnR 76 (887) T PRK08434 2 KTLTIIDTFGFFFRSYYALPPLK-NSQGFPTGLLSGFANFIYSLKNEFKSD----YIVFALDSKGKTFRSEIDPNYKANR 76 (887) T ss_pred CCEEEEECCHHHHHHHCCCCCCC-CCCCCEECHHHHHHHHHHHHHHHCCCC----EEEEEEECCCCCCCHHHHHHHCCCC T ss_conf 80899806489999970799987-999988035999999999999865998----8999984959962210205752899 Q ss_pred CCCCHHHHHHHHHHHHHHHHCCCCEEEECCEEHHHHHHHHHHHHHHCCCCEEEECCCCHHHEECCC-CEEEEECCCCCEE Q ss_conf 987975886489999999970997995689016658999999998489949998687012122379-7599966788353 Q gi|254780277|r 85 PKIPEMLLPQLPLVRLATQAFGIPAIEIQGFEADDIIATYTYIAEKEGFAVTIISTDKDLMQLVSP-TTCLYDTVKEEKI 163 (976) Q Consensus 85 ~~~p~~l~~q~~~i~~~l~~~gi~~~~~~~~EADDviatla~~~~~~~~~v~i~s~DkD~~Qlv~~-~v~~~~~~~~~~~ 163 (976) ++|||||++|||+|++++++||||+++.+||||||||||||++++++|++|+|||||||++|||+| +|.+|+|+++..| T Consensus 77 ~~~PedL~~Qip~i~e~l~algIp~~e~~GyEADDiIgTla~~a~~~g~~v~IvSgDKDl~QLV~d~~v~l~~~~k~~~~ 156 (887) T PRK08434 77 PPPPEELLEQLPIAIEWIEKMGFKSLSQEGYEADDIIASFVKFAKEQDIFVRIVSHDKDLYQLIDDGKVSIYDPIKKNEI 156 (887) T ss_pred CCCCHHHHHHHHHHHHHHHHCCCCEECCCCCCHHHHHHHHHHHHHHCCCEEEEECCCCCHHHHCCCCEEEEEECCCCEEE T ss_conf 99998999999999999998799884558924999999999999978993999808987776183980899978998687 Q ss_pred CHHHHHHHHCCCHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHCCCHHHHHHH Q ss_conf 89999888489887877778751874358998776360249989841367899986454200112232200357899976 Q gi|254780277|r 164 DIEHVIKKWGIPPEKMVCLQALTGDSIDNIPGIPGIGYKTAALLLQEYGNLENILINASRIKQKKRRENILEYAETARLS 243 (976) Q Consensus 164 ~~~~v~~~~gv~p~~~~d~~al~GD~sDnipGv~GiG~KtA~~ll~~~gsle~i~~~~d~i~~~k~~~~l~~~~e~a~ls 243 (976) |++.|.|||||+|+||+||+||+||+|||||||||||||||+|||++|||||+||+|+|+|+++|++++|++++|+|++| T Consensus 157 d~~~V~eK~GV~P~qv~D~lAL~GDsSDNIPGVpGIG~KTA~kLL~eyGsLE~I~~n~d~Ik~~K~re~L~~~~e~A~lS 236 (887) T PRK08434 157 DEAACLEKYGVKPKQIRDFLALCGDSADNIPGVKGIGAKGAKKLLDEFGSLEGIYENLELVRNERIRNLLLEGKENAFLS 236 (887) T ss_pred CHHHHHHHHCCCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHCCCHHHHHHHHHHCCCHHHHHHHHHHHHHHHHH T ss_conf 79999999796989999999981886457899998568999999998288899999887124077999998547888987 Q ss_pred HHHHHCCCCCCCCCCHHHHCCCCCC-HHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 5543203564224763552126767-688998653233689998765311445443342111234332222333222222 Q gi|254780277|r 244 RKLVMLRTDVPITIPLEHLILEDCN-GPRLISFLKALEFTKLINRVATTYDCDAANIEPVVLDINTTENKGIAIETEIDR 322 (976) Q Consensus 244 ~~L~tL~~Dvpl~~~~~~l~~~~~d-~~~l~~~~~~lef~~ll~rL~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~ 322 (976) |+||||++|||++++++++++.+.+ ...+.++|+++||+++++++......... T Consensus 237 k~LvtI~~Dvpl~~~le~l~~~~~~~~~~l~~~l~~~ef~~ll~~l~~~~~~~~~------------------------- 291 (887) T PRK08434 237 KKLVTLYDDLEEQDDLEKATFPDENPLLKILDELEKYELNRILKKLQQPGNSSNK------------------------- 291 (887) T ss_pred HHHCEEECCCCCCCCHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCC------------------------- T ss_conf 7434663068999998880689813899999999998789999973467887766------------------------- Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHCCCCCCCCCCCCCEEECCHHHHHHHHHHHHHCCCCCCCCCCCCCCCC Q ss_conf 23333222223333211223211200112110000123555666503547999999999999866852323322243311 Q gi|254780277|r 323 DTTIKTSQFSSLSDQTSKINSEQTPQKLFLERLQKLSQYPINNDSYTKIVNTKDIQQWVQKLETIGSASFKIITDTMDAF 402 (976) Q Consensus 323 ~~~~~~~~~~~~~~~~~~~~~e~~p~~~~~~~~~~~~~~~~~~~~y~~I~~~~~L~~li~~L~~~~~ia~DtETt~ld~~ 402 (976) .......|..|.+.+++..|++.+...+.+++|+||++++++ T Consensus 292 --------------------------------------~~~~~~~~~~v~~~~~l~~~l~~l~~~~~~a~d~Ett~ld~~ 333 (887) T PRK08434 292 --------------------------------------DKNLGFKAILLDDENKLEEILNTLPKESIIAFDTETTGLDTK 333 (887) T ss_pred --------------------------------------CCCCCCCEEEECCHHHHHHHHHHCCCCCEEEEEEECCCCCCC T ss_conf --------------------------------------665667308838999999999853116769999512786613 Q ss_pred CCCCCCCEEECCCCCCCCCCCCCCEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCHHHHCCCCHHHHHHHH-CCC Q ss_conf 0012442010257766544345502313744333456500178899997444420222100110032123788986-188 Q gi|254780277|r 403 HSKPIAIAISTLDQKDDTSFKVETIFIDLSVHTSQDSTGKNTLTKEILSYLKKFFENEHFLKIGHNIKYDKLVLHR-YGI 481 (976) Q Consensus 403 ~~~ivGisls~~~~~~~~~~~~~~~yip~~~~~~~d~~~~~~~~~evl~~L~~lle~~~i~kIgHN~KfD~~~L~~-~GI 481 (976) .++++|+|+|+.++. ++|+|+.|... ..+.+.....++..+.++++++ +||||+|||+++|.+ +|| T Consensus 334 ~a~lvGis~~~~~~~--------a~YiPi~h~~l--~~~~~~~~~~~~~~l~~~~~~~---~IghNlKyDl~vL~~~~gi 400 (887) T PRK08434 334 EAKLVGFSFCFNEEE--------AYYVPLAHSYL--GVGEQISLESAKKAIEKIFNHF---VIGHNLKYDFKIIQNNFGL 400 (887) T ss_pred CCCEEEEEEEECCCC--------EEEEECCCCCC--CCHHHCCHHHHHHHHHHHHHCC---CHHHHHHHHHHHHHHHCCC T ss_conf 343058999955985--------89997521125--7733359999999978763374---0364779999999985698 Q ss_pred CCCC-CCCHHHHHHHHCCCCCCCCHHHHHHHHCCCCC---CCCCCCCCCCCCCHHHCCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 6664-21002345543168766772466776224442---2335654322210000001233332221111124567763 Q gi|254780277|r 482 SMRG-FEDIMLMSYVLDSGRSSHDIASIAQKWLSYTR---KEILKSRKSSIPIDKISDSQVQEHAIENSNVILQLWLLLR 557 (976) Q Consensus 482 ~l~~-i~DTmlaayLLdp~~~~~~L~~la~~yL~~~~---~e~~~~~k~~~~~~~i~~e~~~~Yaa~Da~~t~~L~~~l~ 557 (976) .+.+ +|||||||||+||+ .+|+|++|+.+||++.. +++.+++ .+|+.++.+.+..|||+||++|++||..|. T Consensus 401 ~l~g~~fDTmLAaYLLdp~-~r~~Ld~La~~yL~~~~i~~e~l~~kg---~~f~~v~i~~a~~YAaedA~it~~L~~~l~ 476 (887) T PRK08434 401 ELPQKYADTMILAWLKDPS-SRVGLDDLAKRLFNYEMIHFESVVKKG---ENFSSVDLEKACKYAAEDAWITLRFYKKFL 476 (887) T ss_pred CCCCCHHHHHHHHHHCCCC-CCCCHHHHHHHHCCCCCCCHHHHHCCC---CCCCCCCHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 7777314899999866986-666667899997087877489985467---770207889999999999999999999999 Q ss_pred HHHHHH---HHHHHHHHHCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHH Q ss_conf 222100---01345788607501233221014310058999986434368999999999740345577776566679987 Q gi|254780277|r 558 PKLIVE---KLLHVYERLDKPMIDVVSQMEITGIQIDQDLLKQVSAEISKNLLSLEENIYDLSGEKFNLSSPKQLGDILF 634 (976) Q Consensus 558 ~~L~~~---~l~~l~~~iE~Pl~~vLa~ME~~Gi~vD~~~L~~l~~~l~~~l~~le~~i~~~aG~~FNinSpkQL~~vLf 634 (976) ++|++. +++++|++|||||++||++||.+||+||.+.|++++.+++.++++++++||++||++||||||||||+||| T Consensus 477 ~~L~~~~~~~L~~Ly~~IElPLi~VLa~ME~~GI~VD~~~L~~ls~el~~~i~~Le~~Iy~lAG~eFNInSPKQLgeILF 556 (887) T PRK08434 477 ENLEKRGATHLLELAKNVEFPFIKTLLMMEENGIKLDTNFLEELKKKFEEEIKELTEEIYELCGEEFNINSTKQLGAVLF 556 (887) T ss_pred HHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHH T ss_conf 99986551779999999886599999999971950269999999999999999999999985576647799999999989 Q ss_pred HHCCCCCCCCCCCCCCCCCHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHCCCCCEECC Q ss_conf 4207887555665453531788776530476279999999878889886532001111-001233345330022210000 Q gi|254780277|r 635 TKLKFPSGAKTKTGQWKTTAQDLEQINCGDNPIIKNILEWRQLSKIKSTYADSLPNHI-NKRTQRVHTFYSLASTMTGRL 713 (976) Q Consensus 635 ~~LgLp~~kKTktG~~ST~~~~Le~L~~~~hp~~~~iLeyR~l~KL~sTY~~~l~~~i-~~~dgRIH~~f~q~~t~TGRl 713 (976) ++||||++|||||| ||||++|||+|. +.||+++.||+||+++||+|||+|+|++++ ++.||||||||||++|+|||| T Consensus 557 ekLgLp~~KKTKTG-ySTd~evLe~L~-~~hpi~~~iLeyR~l~KL~sTYvd~L~~~~~~d~tgRIHttfnQ~~t~TGRL 634 (887) T PRK08434 557 EKLKLPSGKKTKTG-YSTDEKVLNALL-DKHPVIPKILEYRELYKLYSTYIEPLLKLALKDKNSRIYTSFLQTGTATGRL 634 (887) T ss_pred HHCCCCCCCCCCCC-CCCCHHHHHHHH-HCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEECCHHHHHCCCCEEE T ss_conf 86499888888899-888099999988-7387389999860899999999988877514788870310115313221012 Q ss_pred CCCCCCCCCCCCCCCCCCCCCEEEECCCCCEEEEEECCCCCEEEEEEEEEHHHHHHHHHCCCCHHHHHHHHHHCCCHHHC Q ss_conf 01333433345563322224302424899489996326544368766510377787340698889999989809996889 Q gi|254780277|r 714 ASLEPNLQNIPIKTDLGQKIRRAFISPPTKKLISADYSQIELRILAHIAKITPLYQVFENSLDIHRMVAAEIFGVNIEKV 793 (976) Q Consensus 714 Ss~~PNlQNiP~r~~~g~~iR~~F~a~~g~~lv~aDySQiElRilA~ls~D~~l~~af~~g~DiH~~tA~~vf~~~~~~v 793 (976) ||++|||||||+|+++||+||+||+|++||+|++|||||||||||||+|+|+.|++||++|+|||+.||+.||| ++| T Consensus 635 SS~~PNLQNIPirt~~Gr~iR~aFv~~~g~~l~~aDysQiElRilAh~s~d~~l~~af~~~~DiH~~tA~~vf~---~~~ 711 (887) T PRK08434 635 SSKNPNLQNIPARGELAKDIREAFVAKEGYSLISLDYSQIELRLLAHFSKDEALLEAFKNDEDIHARTAIKIFG---EEN 711 (887) T ss_pred CCCCCCCCCCCCCCCCCHHHHHHEEECCCCEEEEECHHHHHHHHHHHHCCCHHHHHHHHCCCCHHHHHHHHHHC---CCC T ss_conf 36899756787776421554644340688779970573899999999718899999984499848999999837---778 Q ss_pred CHHHHHHHHHHHHHHHHCCCHHHHHHHCCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHCCCEECCCCCEECCCCCC Q ss_conf 98888630251356440777035678819998999999999999888899999999999998498882778760189875 Q gi|254780277|r 794 SPQMRRHAKTINFSIIYGISPFRLATQLRIPRSEAADYIKRYFHRFPGIHEYIEKTKNFVRQNGYVETIFGRRIHYDEIN 873 (976) Q Consensus 794 t~~~R~~aK~infgi~YG~~~~~La~~l~is~~eA~~~i~~yf~~~p~v~~~~~~~~~~a~~~gyv~T~~GRrr~~p~i~ 873 (976) |+++|+.||+|||||+||||+||||++||||++||++||++||++||+|+.||++++++|+++|||+|++|||||+ ++. T Consensus 712 ~~~~R~~aK~vnfgi~YG~~~~gLa~~l~i~~~eA~~~i~~yf~~yp~v~~~~~~~~~~a~~~gyv~Tl~gRrr~~-~~~ 790 (887) T PRK08434 712 AKAKRAVAKSINFGLIYGMGSKKLSETLGISAKEAKEYIESYFASFPTIKNFLESIKNFAKENGYVETLLGRKRYF-DFE 790 (887) T ss_pred CHHHHHHHHHHHHHHHHCCCHHHHHHHCCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHCCCEECCCCCCCCC-CCC T ss_conf 8889989777775665546877799870989999999999999968019999999999999768877888872068-877 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEECCEEEEECCHHHHHHHHHHHHHHHHHCCC Q ss_conf 88378887787877013456538999999999999999865998189997263899955987999999999999651013 Q gi|254780277|r 874 SPKSSIRNINERAAINAPIQGSAADITRRAMISVHKSLEDHKLATKMLLQIHDELVFEVPEEEITTASQIIVHSMEKACL 953 (976) Q Consensus 874 s~~~~~r~~aeR~a~N~~iQGsaAdi~k~am~~i~~~~~~~~~~~~~ilqVHDElv~ev~~~~~~~~~~~~~~~Me~~~~ 953 (976) |.|.+.|+++||+|+|||||||||||||+||+++++++++. +++||||||||||||||++++++++++|+++||+|. T Consensus 791 ~~~~~~~~~~eR~A~N~piQGtaADiik~Ami~~~~~l~~~--~~~~~lqvHDElv~e~~~~~~~~~~~~v~~~Me~a~- 867 (887) T PRK08434 791 NATPMQKAMYEREAVNTIFQGSAADLIKLAMNEINKLLNEE--DAKLLLQIHDELIFEVKDELAEEFAKKFKDIMENIV- 867 (887) T ss_pred CCCHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHC--CCEEEEEECEEEEEEECHHHHHHHHHHHHHHHHCCC- T ss_conf 98988998999998382336679999999999999998644--986999754267896268699999999999972850- Q ss_pred CCCCCCCCEEEEEEEECCHHHH Q ss_conf 5744446438883320893781 Q gi|254780277|r 954 PKINLRVPLKVNIKVSNNWQEE 975 (976) Q Consensus 954 ~~~~l~VPl~v~~~~G~~W~e~ 975 (976) +|+|||+|++++|+||+|| T Consensus 868 ---~L~VPL~v~~~~g~nW~e~ 886 (887) T PRK08434 868 ---KLNVPLKTSLSIAKNWGDL 886 (887) T ss_pred ---CCCCCEEEECCCCCCHHHC T ss_conf ---1678768642827894653 No 12 >TIGR00593 pola DNA polymerase I; InterPro: IPR002298 DNA carries the biological information that instructs cells how to exist in an ordered fashion. Accurate replication is thus one of the most important events in the cell life cycle. This function is mediated by DNA-directed DNA polymerases, which add nucleotide triphosphate (dNTP) residues to the 5'-end of the growing DNA chain, using a complementary DNA as template. Small RNA molecules are generally used as primers for chain elongation, although terminal proteins may also be used. DNA-dependent DNA polymerases have been grouped into families, denoted A, B and X, on the basis of sequence similarities , . Members of family A, which includes bacterial and bacteriophage polymerases, share significant similarity to Escherichia coli polymerase I; hence family A is also known as the pol I family. The bacterial polymerases also contain an exonuclease activity, which is coded for in the N-terminal portion. Three motifs, A, B and C , are seen to be conserved across all DNA polymerases, with motifs A and C also seen in RNA polymerases. They are centred on invariant residues, and their structural significance was implied from the Klenow (E. coli) structure. Motif A contains a strictly-conserved aspartate at the junction of a beta-strand and an alpha-helix; motif B contains an alpha-helix with positive charges; and motif C has a doublet of negative charges, located in a beta-turn-beta secondary structure .; GO: 0003677 DNA binding, 0003887 DNA-directed DNA polymerase activity, 0006260 DNA replication. Probab=100.00 E-value=0 Score=2259.42 Aligned_cols=930 Identities=41% Similarity=0.659 Sum_probs=791.5 Q ss_pred EEEEECCHHHHHHHHCCCC---CCCCCCCCEECHHHHHHHHHHHHHHHHCC-CCCCCEEEEEEECCCCCHHHHCCHHHHC Q ss_conf 8999662699988742675---44689885500699999999999998485-5479879999727488713240865216 Q gi|254780277|r 7 LFLVDGSSFIYRAFYATPL---LSRKHDGLPVNAIAGFCNMLWKLLQNSRK-ESIASHFAVIFDYPAVTFRNEIYPDYKA 82 (976) Q Consensus 7 ~~liDg~~~~~ra~~a~p~---~~~~~~G~~t~ai~gf~~~l~~~i~~~~~-~~~~~~~~v~fD~~~~tfR~~~~~~YKa 82 (976) |+||||||++||||||+|. ..+|+.|.||||||||++||++++++.+| + ||+||||+++||||||.|++||| T Consensus 1 L~LiDG~~lafRayfAl~~~~~~L~~~~G~~t~A~yGF~~~L~~~l~~~~Pm~----~~~~aFD~~~~TFR~e~Y~~YKa 76 (1005) T TIGR00593 1 LLLIDGHSLAFRAYFALPNNKKPLTNSKGEPTNAVYGFTKMLLKLLKEEKPMT----YVAVAFDSGTKTFRHEAYEEYKA 76 (1005) T ss_pred CEEEEHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCE----EEEEEECCCCCCHHHHHCHHHCC T ss_conf 92340347999998862211466666788711677888999999997418951----69988768888516662533157 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHCCCCEEEECCEEHHHHHHHHHHHHHHCCCCEEEECCCCHHHEECCC-CEEEEEC--CC Q ss_conf 67987975886489999999970997995689016658999999998489949998687012122379-7599966--78 Q gi|254780277|r 83 NRPKIPEMLLPQLPLVRLATQAFGIPAIEIQGFEADDIIATYTYIAEKEGFAVTIISTDKDLMQLVSP-TTCLYDT--VK 159 (976) Q Consensus 83 ~R~~~p~~l~~q~~~i~~~l~~~gi~~~~~~~~EADDviatla~~~~~~~~~v~i~s~DkD~~Qlv~~-~v~~~~~--~~ 159 (976) ||++||+||..|||+|||+|.+||||+++.+||||||||||||++++++|++|+|+|||||++||||+ +|.|+.+ ++ T Consensus 77 ~R~~tP~el~~Qi~~i~e~l~algi~~~e~~GYEADD~IaTLA~~a~~~G~~V~I~sgDrD~lQLvs~e~V~V~~~~~~~ 156 (1005) T TIGR00593 77 NRAKTPEELIEQIPLIKELLEALGIPILEVEGYEADDVIATLAKQAEKEGYEVRIISGDRDLLQLVSDENVKVLIPERKK 156 (1005) T ss_pred CCCCCCHHHHHHHHHHHHHHHHCCCEEEECCCCCHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCEEEEECCCCC T ss_conf 88888874687689999999971981882288710128999888775468548998378462112178614886435456 Q ss_pred C----CEECHHHHHHHHCCCHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCCCHHHHHH-----------HHHHH Q ss_conf 8----35389999888489887877778751874358998776360249989841367899986-----------45420 Q gi|254780277|r 160 E----EKIDIEHVIKKWGIPPEKMVCLQALTGDSIDNIPGIPGIGYKTAALLLQEYGNLENILI-----------NASRI 224 (976) Q Consensus 160 ~----~~~~~~~v~~~~gv~p~~~~d~~al~GD~sDnipGv~GiG~KtA~~ll~~~gsle~i~~-----------~~d~i 224 (976) + ..+|++.|.|||||.|.|++||+|||||+|||||||||||||||++||++|||||+||+ |+|+| T Consensus 157 g~~~~~~~t~e~V~eKyGv~P~Q~~D~~AL~GD~SDNiPGV~GIGeKTA~kLL~~fgsLe~iy~~~ylkealr~~~~~~~ 236 (1005) T TIGR00593 157 GKTEFTEITPEYVVEKYGVTPSQLVDLKALVGDSSDNIPGVKGIGEKTAAKLLQEFGSLENIYENTYLKEALRLDNLDQI 236 (1005) T ss_pred CCCCCEECCHHHHHHHCCCCHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 75431011778987541867457887620468888785998973765689999872108999988778888747758622 Q ss_pred HHHHHHHHHCCCHHHHHH--HHHHHHCCCCCCCCCCHHHH-CCCCCCHHHHHH-HHHHHHHHHHHHHHHHHCCCCCCCCC Q ss_conf 011223220035789997--65543203564224763552-126767688998-65323368999876531144544334 Q gi|254780277|r 225 KQKKRRENILEYAETARL--SRKLVMLRTDVPITIPLEHL-ILEDCNGPRLIS-FLKALEFTKLINRVATTYDCDAANIE 300 (976) Q Consensus 225 ~~~k~~~~l~~~~e~a~l--s~~L~tL~~Dvpl~~~~~~l-~~~~~d~~~l~~-~~~~lef~~ll~rL~~~~~~~~~~~~ 300 (976) ++.+.+++|++++|+|++ |++|+||++|||++++++++ +.+++|.+.+.+ +|++|||+++.++|............ T Consensus 237 k~~~~~~~L~~~~e~a~~vfS~~La~i~~d~pl~~~~~~~~~~~~~~~~~l~~~~l~elef~s~~~~l~~~~~~~~~~~~ 316 (1005) T TIGR00593 237 KSEKLREKLIAHKEDAFLVFSKELATIVTDVPLEVDLEDLLRLSEPDREKLVEKLLQELEFKSLLKKLEKLESPEIDDHA 316 (1005) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCC T ss_conf 86578899998775378773176500037997434668873036775678999999975089999997202420004567 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHCCCCCCCCCCCCCEE-------EC- Q ss_conf 2111234332222333222222233332222233332112232112001121100001235556665035-------47- Q gi|254780277|r 301 PVVLDINTTENKGIAIETEIDRDTTIKTSQFSSLSDQTSKINSEQTPQKLFLERLQKLSQYPINNDSYTK-------IV- 372 (976) Q Consensus 301 ~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~p~~~~~~~~~~~~~~~~~~~~y~~-------I~- 372 (976) |.... +......... ...............+........ |. T Consensus 317 p~~~~-------------~l~~~~~~~~------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~V~~ 365 (1005) T TIGR00593 317 PALEA-------------DLSQAEAPEE------------------KEEFTLVAKESEEASPLANVAEKAERGGRSDVVL 365 (1005) T ss_pred CCCCC-------------CCCCCCCCCC------------------CCCCHHHHCCCCCCCCCCCCCHHHHHCCCCHHHH T ss_conf 66510-------------0012676421------------------1242112013300267222103555136511343 Q ss_pred --CHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCC-CC-CCCCCEECCCCCCC-----C--CCCCC Q ss_conf --99999999999986685232332224331100124420102577665-44-34550231374433-----3--45650 Q gi|254780277|r 373 --NTKDIQQWVQKLETIGSASFKIITDTMDAFHSKPIAIAISTLDQKDD-TS-FKVETIFIDLSVHT-----S--QDSTG 441 (976) Q Consensus 373 --~~~~L~~li~~L~~~~~ia~DtETt~ld~~~~~ivGisls~~~~~~~-~~-~~~~~~yip~~~~~-----~--~d~~~ 441 (976) +.--+..+.+.......++++++..+...+.++..|++.+..+.... .+ ....++|.+..... . +.... T Consensus 366 ~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 445 (1005) T TIGR00593 366 ARDAIVLEDLLDQLKKARALAFATENQSLVAYISEADGISIADTELEGFSILVAEAEALYVALIKEDVSLLTILTDAKFA 445 (1005) T ss_pred HHHHHHHHCCHHHHHHCHHHHHHHHCCCCCCCCCCCCCCHHHHCCHHHCCCCCCCCCCCCCCCCCCCHHHHCCHHHHHHH T ss_conf 33323220001121202033222121333310000014203322132123112132455223357722342001112789 Q ss_pred CCCCHHHHHHHHHHHHCCCCCHHHHCCCCHHHHHHHHCCCCCC--C-CCCHHHHHHHHCCCCCCCCHHHHHHHHCCCCCC Q ss_conf 0178899997444420222100110032123788986188666--4-210023455431687667724667762244422 Q gi|254780277|r 442 KNTLTKEILSYLKKFFENEHFLKIGHNIKYDKLVLHRYGISMR--G-FEDIMLMSYVLDSGRSSHDIASIAQKWLSYTRK 518 (976) Q Consensus 442 ~~~~~~evl~~L~~lle~~~i~kIgHN~KfD~~~L~~~GI~l~--~-i~DTmlaayLLdp~~~~~~L~~la~~yL~~~~~ 518 (976) ..+......+.+...+.+....+..||+|++.++|.++|+.+. + ++|||||+||++|......|+.++.+|+..... T Consensus 446 ~~~~~~~~~~~~~~~~~~a~~~~~~~~~K~~~~~L~~~g~~~~~~~~~~D~~laaYll~~~~~~~~l~~~~~~y~~~~~~ 525 (1005) T TIGR00593 446 EWLLKETIKKALEELLVSASKVKVAHDAKFLMHLLKRKGIELIEIGVIFDTMLAAYLLDPAQVSKDLDTLAKRYLVEELI 525 (1005) T ss_pred HHHHHCCHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHCCCCCCCHHHHHHHHHCCCCCC T ss_conf 98740001455521002341012288999999999743773344211454899999843035640278999871343467 Q ss_pred -C---CCCCCCCCCCCHHHCCHHHHHHHHH-------------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf -3---3565432221000000123333222-------------------1111124567763222100013457886075 Q gi|254780277|r 519 -E---ILKSRKSSIPIDKISDSQVQEHAIE-------------------NSNVILQLWLLLRPKLIVEKLLHVYERLDKP 575 (976) Q Consensus 519 -e---~~~~~k~~~~~~~i~~e~~~~Yaa~-------------------Da~~t~~L~~~l~~~L~~~~l~~l~~~iE~P 575 (976) + .....++...+...+.+.+.+|++. |+++||+||+.|+-+|++.++.+||.+||+| T Consensus 526 ~d~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~L~~~l~~~L~~eDaDitfrlyhylklrle~~~~~~L~~eiE~P 605 (1005) T TIGR00593 526 LDDDRKEEGIKKLAKFADPDLEEAIEYLARKAAATARLAEELLKELDAEDADITFRLYHYLKLRLEENKLLSLYEEIELP 605 (1005) T ss_pred CCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCHHHHHHHHHHHCCC T ss_conf 78731345544334441121778899999999999999999888731132204667777888651638899999884200 Q ss_pred CHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCHH Q ss_conf 01233221014310058999986434368999999999740345577776566679987420788755566545353178 Q gi|254780277|r 576 MIDVVSQMEITGIQIDQDLLKQVSAEISKNLLSLEENIYDLSGEKFNLSSPKQLGDILFTKLKFPSGAKTKTGQWKTTAQ 655 (976) Q Consensus 576 l~~vLa~ME~~Gi~vD~~~L~~l~~~l~~~l~~le~~i~~~aG~~FNinSpkQL~~vLf~~LgLp~~kKTktG~~ST~~~ 655 (976) |++||++||.+||+||.++|++|+.++..+|..+|++||++||++|||||||||++||||+|+||++||+|||+||||++ T Consensus 606 l~~VL~~ME~~Gi~~D~~~L~~L~~e~~~~l~~lE~~Iy~~aG~~FNinSPKQL~~vLF~KL~Lp~~kK~KtG~YSTd~~ 685 (1005) T TIGR00593 606 LSKVLAEMEKTGIKVDKDYLQELSKEFGEELLDLEEEIYELAGEEFNINSPKQLGEVLFEKLGLPVGKKTKTGSYSTDAD 685 (1005) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCEECHH T ss_conf 78999998776678708999999999999999999999987495206887335558869984788200346786213388 Q ss_pred HHHHHHCCCCHH-HHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEECCCCCCCCCCCCCCCCCCCCCC Q ss_conf 877653047627-99999-9987888988653200111100123334533002221000001333433345563322224 Q gi|254780277|r 656 DLEQINCGDNPI-IKNIL-EWRQLSKIKSTYADSLPNHINKRTQRVHTFYSLASTMTGRLASLEPNLQNIPIKTDLGQKI 733 (976) Q Consensus 656 ~Le~L~~~~hp~-~~~iL-eyR~l~KL~sTY~~~l~~~i~~~dgRIH~~f~q~~t~TGRlSs~~PNlQNiP~r~~~g~~i 733 (976) |||+|. ..||+ |..+| +||+|+||+|||+|+||+.|++.||||||+|||++|+||||||+||||||||||+++||+| T Consensus 686 vLe~L~-~~h~~~i~~~Ll~yR~L~KLksTY~d~Ll~~v~~~TgRiHT~F~Qt~TaTGRLSS~~PNLQNIPiR~~~Gr~I 764 (1005) T TIGR00593 686 VLEKLA-EKHPIHIALILLEYRQLTKLKSTYVDGLLELVNPDTGRIHTTFNQTGTATGRLSSSNPNLQNIPIRSEEGRKI 764 (1005) T ss_pred HHHHHH-HCCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHCCCCEEHHHHHHHHHCCCCCCCCCCCCCCCCCCCCHHHH T ss_conf 999887-4288836899987666654310120133898400028273131454563562004788854433227200133 Q ss_pred CEEEECC-CCCEEEEEECCCCCEEEEEEEEEHHHHHHHHHCCCCHHHHHHHHHHCCCHHHCCHHHHHHHHHHHHHHHHCC Q ss_conf 3024248-994899963265443687665103777873406988899999898099968899888863025135644077 Q gi|254780277|r 734 RRAFISP-PTKKLISADYSQIELRILAHIAKITPLYQVFENSLDIHRMVAAEIFGVNIEKVSPQMRRHAKTINFSIIYGI 812 (976) Q Consensus 734 R~~F~a~-~g~~lv~aDySQiElRilA~ls~D~~l~~af~~g~DiH~~tA~~vf~~~~~~vt~~~R~~aK~infgi~YG~ 812 (976) |+||||. .||.||+|||||||||||||+|+|+.|++||.+|.|||+.||+.+||++.++|||++||.||||||||+||| T Consensus 765 R~AFva~~~g~~L~~ADYSQIELRiLAHlS~De~Li~AF~~~~DIH~~TA~~~Fgv~~~~vtp~~RR~AK~vNFGivYGm 844 (1005) T TIGR00593 765 RKAFVAESKGWLLISADYSQIELRILAHLSQDENLIEAFKNGEDIHTQTASRLFGVEEEDVTPNMRRIAKAVNFGIVYGM 844 (1005) T ss_pred HHCCCCCCCCCEEEECCCCHHHHHHHHHHCCCHHHHHHHHCCCCHHHHHHHHHHCCCHHHCCHHHHCCCCEEEEHHHHCC T ss_conf 22002468860688526337999999864078679999754887138899997298977449646047414430132101 Q ss_pred CHHHHHHHCCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHCCCEECCCCCEECCCCCCCCCHHHHHHHHHHHHHHHH Q ss_conf 70356788199989999999999998888999999999999984988827787601898758837888778787701345 Q gi|254780277|r 813 SPFRLATQLRIPRSEAADYIKRYFHRFPGIHEYIEKTKNFVRQNGYVETIFGRRIHYDEINSPKSSIRNINERAAINAPI 892 (976) Q Consensus 813 ~~~~La~~l~is~~eA~~~i~~yf~~~p~v~~~~~~~~~~a~~~gyv~T~~GRrr~~p~i~s~~~~~r~~aeR~a~N~~i 892 (976) |+||||++||||.+||++||++||++||||++||++++++||++|||+|++|||||||+|+|+|+..|++|||.|+|+|| T Consensus 845 SafgLA~~L~Is~~EA~~fIe~YF~~yPgv~~Y~~~~~~~ar~~GYV~Tl~gRRRY~~~i~s~n~~~R~~AER~A~NaPI 924 (1005) T TIGR00593 845 SAFGLARELGISRKEAKEFIERYFERYPGVKEYLENTVEEARKKGYVTTLLGRRRYIPDINSRNRNLREAAERMAINAPI 924 (1005) T ss_pred CHHHHHHHHCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCEEEECCCEECCCCCCHHHHHHHHHHHHHHHCCCC T ss_conf 20577775088888999999999763413899999999999736855630465611576322357779999998736753 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEECCEEEEECCHHHHHHHHHHHHHHHH-HCCCCCCCCCCCEEEEEEEECC Q ss_conf 653899999999999999986599818999726389995598799999999999965-1013574444643888332089 Q gi|254780277|r 893 QGSAADITRRAMISVHKSLEDHKLATKMLLQIHDELVFEVPEEEITTASQIIVHSME-KACLPKINLRVPLKVNIKVSNN 971 (976) Q Consensus 893 QGsaAdi~k~am~~i~~~~~~~~~~~~~ilqVHDElv~ev~~~~~~~~~~~~~~~Me-~~~~~~~~l~VPl~v~~~~G~~ 971 (976) |||||||||+|||+|+++|++.+|++||+||||||||||+|.+|+++|+++|+++|| +|. +|+|||.|+++.|+| T Consensus 925 QGSAADI~K~AMikl~~~L~~~~L~~rlLLQVHDELl~E~P~~E~e~v~~l~k~~MED~A~----~L~VPL~v~~~~G~~ 1000 (1005) T TIGR00593 925 QGSAADIMKIAMIKLDKALKERKLKSRLLLQVHDELLFEAPEEEAEEVKALVKEVMEDEAY----TLSVPLEVEVETGKN 1000 (1005) T ss_pred CCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCEEE----ECCCCCEEECCCCCC T ss_conf 4687899999999999999732503334678655887234441378999999987146101----025763675245877 Q ss_pred HHHHC Q ss_conf 37819 Q gi|254780277|r 972 WQEEN 976 (976) Q Consensus 972 W~e~~ 976 (976) |+|+| T Consensus 1001 W~e~k 1005 (1005) T TIGR00593 1001 WGEAK 1005 (1005) T ss_pred HHCCC T ss_conf 11129 No 13 >PRK08076 consensus Probab=100.00 E-value=0 Score=2230.08 Aligned_cols=868 Identities=38% Similarity=0.626 Sum_probs=808.6 Q ss_pred CCCCCEEEEEECCHHHHHHHHCCCCCCCCCCCCEECHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHHCCHHH Q ss_conf 98897289996626999887426754468988550069999999999999848554798799997274887132408652 Q gi|254780277|r 1 MQKENHLFLVDGSSFIYRAFYATPLLSRKHDGLPVNAIAGFCNMLWKLLQNSRKESIASHFAVIFDYPAVTFRNEIYPDY 80 (976) Q Consensus 1 m~~~~~~~liDg~~~~~ra~~a~p~~~~~~~G~~t~ai~gf~~~l~~~i~~~~~~~~~~~~~v~fD~~~~tfR~~~~~~Y 80 (976) |+| +|+||||++++||||||+|+++ +++|+||||||||+|||++++++++|+ |++||||++++|||||+||+| T Consensus 1 m~k--~l~liDg~~~~~Ra~~a~p~l~-~~~G~~t~ai~Gf~~~l~k~~~~~~p~----~~~v~fD~~~~tfR~~~~~~Y 73 (877) T PRK08076 1 LEK--KLVLVDGNSIAYRAFFALPLLN-NDKGIHTNAVYGFTMMLMKILEEEKPT----HMLVAFDAGKTTFRHKTFKEY 73 (877) T ss_pred CCC--EEEEEECCHHHHHHHCCCCCCC-CCCCCCCCHHHHHHHHHHHHHHHCCCC----EEEEEEECCCCCCCHHHHHHH T ss_conf 997--1999957479999981898787-999987134999999999999864998----899998694896322341676 Q ss_pred HCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEECCEEHHHHHHHHHHHHHHCCCCEEEECCCCHHHEECCCCEEEEECCCC Q ss_conf 16679879758864899999999709979956890166589999999984899499986870121223797599966788 Q gi|254780277|r 81 KANRPKIPEMLLPQLPLVRLATQAFGIPAIEIQGFEADDIIATYTYIAEKEGFAVTIISTDKDLMQLVSPTTCLYDTVKE 160 (976) Q Consensus 81 Ka~R~~~p~~l~~q~~~i~~~l~~~gi~~~~~~~~EADDviatla~~~~~~~~~v~i~s~DkD~~Qlv~~~v~~~~~~~~ 160 (976) ||||++|||+|.+|||+|++++++||||+++.+||||||||||+|++++++|++|+|+|||||++|||+++|+||+|+++ T Consensus 74 Ka~R~~~p~~L~~Q~~~i~~~~~~~gi~~~~~~g~EADDvi~tla~~~~~~~~~v~i~s~DkD~~QLv~~~v~v~~~~k~ 153 (877) T PRK08076 74 KGGRQKTPPELSEQFPFIRELLDAFNIPRYELENYEADDIIGTLAKEAEKDGFEVKVISGDKDLLQLVSDNTTVCITKKG 153 (877) T ss_pred CCCCCCCCHHHHHHHHHHHHHHHHCCCCEECCCCCCHHHHHHHHHHHHHHCCCCEEEECCCCCHHHHCCCCEEEEECCCC T ss_conf 18999998889999999999999879988443890099999999999997799099982898762527498699975899 Q ss_pred ----CEECHHHHHHHHCCCHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf ----3538999988848988787777875187435899877636024998984136789998645420011223220035 Q gi|254780277|r 161 ----EKIDIEHVIKKWGIPPEKMVCLQALTGDSIDNIPGIPGIGYKTAALLLQEYGNLENILINASRIKQKKRRENILEY 236 (976) Q Consensus 161 ----~~~~~~~v~~~~gv~p~~~~d~~al~GD~sDnipGv~GiG~KtA~~ll~~~gsle~i~~~~d~i~~~k~~~~l~~~ 236 (976) +.+|++.|.|||||+|+|++||+||+||+|||||||||||||||++||++|||||+||+|+|+|+++|++++|+++ T Consensus 154 ~~~~~~~~~~~v~ek~Gv~P~q~~D~~aL~GDssDNIPGVpGiG~KtA~~ll~~~gsle~i~~~~~~ikg~k~re~L~e~ 233 (877) T PRK08076 154 ITDVEEYTPEALFEKYGLTPKQIIDMKGLMGDSSDNIPGVPGVGEKTAIKLLKQFGTVEEVLESIDEVSGKKLKEKLEEN 233 (877) T ss_pred CCCCEECCHHHHHHHHCCCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHCCCHHHHHHHHHHCCCHHHHHHHHHC T ss_conf 86441058999999979698999999997187646799999863799999999668668899877732644678898848 Q ss_pred HHHHHHHHHHHHCCCCCCCCCCHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 78999765543203564224763552126767688998653233689998765311445443342111234332222333 Q gi|254780277|r 237 AETARLSRKLVMLRTDVPITIPLEHLILEDCNGPRLISFLKALEFTKLINRVATTYDCDAANIEPVVLDINTTENKGIAI 316 (976) Q Consensus 237 ~e~a~ls~~L~tL~~Dvpl~~~~~~l~~~~~d~~~l~~~~~~lef~~ll~rL~~~~~~~~~~~~~~~~~~~~~e~~~~~~ 316 (976) +|+|++||+||||++|||++++++++++.++|.++|.++|++|||++|++|+.......... T Consensus 234 ~e~A~LSkeLaTI~~Dvpl~~~l~~l~~~~~d~~~L~~lf~eLEFksLl~rl~~~~~~~~~~------------------ 295 (877) T PRK08076 234 KEQALMSKELATIITDAPIEVTVDDLEYKGYEAEDVIPLFKELGFTSLLEKLGETPEETAPA------------------ 295 (877) T ss_pred HHHHHHHHHHHHHHCCCCCCCCHHHCCCCCCCHHHHHHHHHHCCHHHHHHHHCCCCCCCCCC------------------ T ss_conf 99999999998853189899998881757999999999999806099999845675666787------------------ Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHCCCCCCCCCCCCCEEECCHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 22222223333222223333211223211200112110000123555666503547999999999999866852323322 Q gi|254780277|r 317 ETEIDRDTTIKTSQFSSLSDQTSKINSEQTPQKLFLERLQKLSQYPINNDSYTKIVNTKDIQQWVQKLETIGSASFKIIT 396 (976) Q Consensus 317 ~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~p~~~~~~~~~~~~~~~~~~~~y~~I~~~~~L~~li~~L~~~~~ia~DtET 396 (976) . .....+.. ...+.+. ......++++|| T Consensus 296 ----------------~-----------------------------~~~~~~~~------~~~~~~~-~l~~~~a~~~Et 323 (877) T PRK08076 296 ----------------E-----------------------------LDDITFDI------VEEVTEE-MLQQKSALVVEV 323 (877) T ss_pred ----------------C-----------------------------CCCCCHHH------HHHHHHH-HHCCCCEEEEEE T ss_conf ----------------6-----------------------------55321466------6465698-746885499996 Q ss_pred CCCCCCCCCCCCCEEECCCCCCCCCCCCCCEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCHHHHCCCCHHHHHH Q ss_conf 24331100124420102577665443455023137443334565001788999974444202221001100321237889 Q gi|254780277|r 397 DTMDAFHSKPIAIAISTLDQKDDTSFKVETIFIDLSVHTSQDSTGKNTLTKEILSYLKKFFENEHFLKIGHNIKYDKLVL 476 (976) Q Consensus 397 t~ld~~~~~ivGisls~~~~~~~~~~~~~~~yip~~~~~~~d~~~~~~~~~evl~~L~~lle~~~i~kIgHN~KfD~~~L 476 (976) ++.+++.+.++|+|+|+.++ ++|+|.... ...+.++++|+++.+.|++||+|||+++| T Consensus 324 ~~~~~~~a~lvGis~s~~~~---------a~yiP~~~~-------------~~~~~lk~~Led~~i~KighNlK~dl~vL 381 (877) T PRK08076 324 LEDNYHKADIQGFAIVNENG---------CYFIPTEIA-------------LKSDAFKEWLEDEEKKKWVFDAKRAIVAL 381 (877) T ss_pred CCCCCCCCCEEEEEEECCCC---------EEEECCCHH-------------HHHHHHHHHHCCCCCCEEEECHHHHHHHH T ss_conf 47885637789999985894---------599550044-------------21589999853976540265479999999 Q ss_pred HHCCCCCCCC-CCHHHHHHHHCCCCCCCCHHHHHHHHCCCCC---CCCCCCCCCCCCCHHHCCHHHHHHHHHHHHHHHHH Q ss_conf 8618866642-1002345543168766772466776224442---23356543222100000012333322211111245 Q gi|254780277|r 477 HRYGISMRGF-EDIMLMSYVLDSGRSSHDIASIAQKWLSYTR---KEILKSRKSSIPIDKISDSQVQEHAIENSNVILQL 552 (976) Q Consensus 477 ~~~GI~l~~i-~DTmlaayLLdp~~~~~~L~~la~~yL~~~~---~e~~~~~k~~~~~~~i~~e~~~~Yaa~Da~~t~~L 552 (976) .++||.+.++ |||||||||+||+.++|+|++++.+|+++.. +++.|+++++...+ .+.+.+|||+|+++|++| T Consensus 382 ~~~GI~l~gi~FDTMLAAYLLdP~~~~~~Ld~La~~yl~~~~~~~e~v~Gkg~k~~~p~---~~~~~eYaaedA~~t~~L 458 (877) T PRK08076 382 KWNGIDLQGIDFDLLLAAYLLNPADSDDDFASVAKMKETHAVKSDEAVYGKGAKRAVPE---EEILAEHLARKAHAIYDL 458 (877) T ss_pred HHCCCCCCCCCCCHHHHHHHCCCCCCCCCHHHHHHHHCCCCCCCHHHHCCCCCCCCCCC---HHHHHHHHHHHHHHHHHH T ss_conf 96896447764029999987188767666999999855777763476537652224877---899999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHH Q ss_conf 67763222100013457886075012332210143100589999864343689999999997403455777765666799 Q gi|254780277|r 553 WLLLRPKLIVEKLLHVYERLDKPMIDVVSQMEITGIQIDQDLLKQVSAEISKNLLSLEENIYDLSGEKFNLSSPKQLGDI 632 (976) Q Consensus 553 ~~~l~~~L~~~~l~~l~~~iE~Pl~~vLa~ME~~Gi~vD~~~L~~l~~~l~~~l~~le~~i~~~aG~~FNinSpkQL~~v 632 (976) |+.|+++|++.+++++|++|||||++||++||.+||.||.++|++++.++..++++++++||++||++|||||||||++| T Consensus 459 ~~~L~~~L~~~~l~~Ly~~IE~PLi~VLa~ME~~GI~VD~~~L~~ls~~l~~~l~~le~~I~~laG~eFNinSpKQL~~I 538 (877) T PRK08076 459 KQTFVEELEKNEQLELFEELELPLARVLAEMEVKGVKVDTERLENMGEELAGRLKELEQEIHELAGEEFNINSPKQLGVI 538 (877) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCEEECHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHH T ss_conf 99999998655789999999704889999998548389899999999999999999999999972898788999999999 Q ss_pred HHHHCCCCCCCCCCCCCCCCCHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEC Q ss_conf 87420788755566545353178877653047627999999987888988653200111100123334533002221000 Q gi|254780277|r 633 LFTKLKFPSGAKTKTGQWKTTAQDLEQINCGDNPIIKNILEWRQLSKIKSTYADSLPNHINKRTQRVHTFYSLASTMTGR 712 (976) Q Consensus 633 Lf~~LgLp~~kKTktG~~ST~~~~Le~L~~~~hp~~~~iLeyR~l~KL~sTY~~~l~~~i~~~dgRIH~~f~q~~t~TGR 712 (976) ||++||||++|||||| ||||++|||+|. ..||+++.||+||+++||+|||+|+|+++|++.||||||+|||++|+||| T Consensus 539 LFekL~Lp~~KKTKtG-~STd~~vLe~L~-~~h~i~~~iLeyR~l~KL~sTY~d~L~~~i~~~tgRiHt~fnQ~~t~TGR 616 (877) T PRK08076 539 LFEKLGLPVIKKTKTG-YSTSADVLEKLA-DKHEIIPNILHYRQLGKLQSTYIEGLLKVVHKDTHKVHTRFNQALTQTGR 616 (877) T ss_pred HHHHCCCCCCCCCCCC-CCCCHHHHHHHH-HCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEECCCCHHCCCCCC T ss_conf 9985389888878899-878699999754-32854799999989988874899998874187754563010000045564 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCEEEEC-CCCCEEEEEECCCCCEEEEEEEEEHHHHHHHHHCCCCHHHHHHHHHHCCCHH Q ss_conf 001333433345563322224302424-8994899963265443687665103777873406988899999898099968 Q gi|254780277|r 713 LASLEPNLQNIPIKTDLGQKIRRAFIS-PPTKKLISADYSQIELRILAHIAKITPLYQVFENSLDIHRMVAAEIFGVNIE 791 (976) Q Consensus 713 lSs~~PNlQNiP~r~~~g~~iR~~F~a-~~g~~lv~aDySQiElRilA~ls~D~~l~~af~~g~DiH~~tA~~vf~~~~~ 791 (976) |||++|||||||+||++||+||+|||| ++||+|+||||||||||||||+|+|+.|++||++|+|||+.||+.|||+|.+ T Consensus 617 LSS~~PNLQNIPirt~~G~~IR~aFi~~~~g~~l~saDYSQIELRilAh~s~d~~l~~af~~~~DiH~~tA~~if~~~~~ 696 (877) T PRK08076 617 LSSTDPNLQNIPIRLEEGRKIRQAFVPSEEGWIMFAADYSQIELRVLAHIAEDENLIEAFQHDMDIHTKTAMDVFHVEKD 696 (877) T ss_pred CCCCCCCCCCCCCCCHHHHHHHHEEEECCCCCEEEEECHHHHHHHHHHHHHCCHHHHHHHHCCCCHHHHHHHHHHCCCCC T ss_conf 02578752367777644577876143079996799832668899999998278999998715975758888998489801 Q ss_pred HCCHHHHHHHHHHHHHHHHCCCHHHHHHHCCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHCCCEECCCCCEECCCC Q ss_conf 89988886302513564407770356788199989999999999998888999999999999984988827787601898 Q gi|254780277|r 792 KVSPQMRRHAKTINFSIIYGISPFRLATQLRIPRSEAADYIKRYFHRFPGIHEYIEKTKNFVRQNGYVETIFGRRIHYDE 871 (976) Q Consensus 792 ~vt~~~R~~aK~infgi~YG~~~~~La~~l~is~~eA~~~i~~yf~~~p~v~~~~~~~~~~a~~~gyv~T~~GRrr~~p~ 871 (976) +||+++|+.||+|||||+||||+||||++||||+.||++||++||++||+|+.||++++++|+++|||+|++|||||||+ T Consensus 697 ~vt~~~R~~AK~infGiiYG~~~~gLs~~l~i~~~eA~~~i~~Yf~~yp~v~~~~~~~~~~a~~~Gyv~Tl~gRrr~~p~ 776 (877) T PRK08076 697 EVTSNMRRQAKAVNFGIVYGISDYGLSQNLGITRKEAAEFIERYFESFPGVKEYMEDIVQEAKQKGYVTTLLHRRRYIPE 776 (877) T ss_pred CCCHHHHHHHHHHHHHHHHCCCHHHHHHHCCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHCCCEECCCCCEEECCC T ss_conf 08999999988988778765587889986298999999999999997824999999999999976888646777030876 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEECCEEEEECCHHHHHHHHHHHHHHHHHC Q ss_conf 75883788877878770134565389999999999999998659981899972638999559879999999999996510 Q gi|254780277|r 872 INSPKSSIRNINERAAINAPIQGSAADITRRAMISVHKSLEDHKLATKMLLQIHDELVFEVPEEEITTASQIIVHSMEKA 951 (976) Q Consensus 872 i~s~~~~~r~~aeR~a~N~~iQGsaAdi~k~am~~i~~~~~~~~~~~~~ilqVHDElv~ev~~~~~~~~~~~~~~~Me~~ 951 (976) |+|.|++.|++|||+|+|||||||||||||+||++++++++.++++++||||||||||||||++++++++++++++||+| T Consensus 777 i~s~n~~~r~~~eR~A~N~piQGtAADiiK~AMi~i~~~l~~~~~~~~mlLqVHDELvfev~~~~~~~~~~~~~~~Me~a 856 (877) T PRK08076 777 ITSRNFNLRSFAERTAMNTPIQGSAADIIKKAMIDMADRLKEEGLQARLLLQVHDELIFEAPKEEIEKLEKLVPEVMEHA 856 (877) T ss_pred CCCCCHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEEEEEEEEEEEECHHHHHHHHHHHHHHHHCC T ss_conf 55778778888999993614456799999999999999998469985189986426689616889999999999998274 Q ss_pred CCCCCCCCCCEEEEEEEECCHHHHC Q ss_conf 1357444464388833208937819 Q gi|254780277|r 952 CLPKINLRVPLKVNIKVSNNWQEEN 976 (976) Q Consensus 952 ~~~~~~l~VPl~v~~~~G~~W~e~~ 976 (976) . +|+|||+|++++|+||+||- T Consensus 857 ~----~L~VPL~v~~~~G~nW~eaK 877 (877) T PRK08076 857 V----ELDVPLKVDYSYGPTWYDAK 877 (877) T ss_pred C----CCCCCEEEECCCCCCHHHCC T ss_conf 2----36887584138387967729 No 14 >PRK05929 consensus Probab=100.00 E-value=0 Score=2232.34 Aligned_cols=861 Identities=35% Similarity=0.555 Sum_probs=805.0 Q ss_pred CEEEEEECCHHHHHHHHCCCCCCCCCCCCEECHHHHHHHHHHHHHHHHCCCCCCCEEEEEEEC-CCCCHHHHCCHHHHCC Q ss_conf 728999662699988742675446898855006999999999999984855479879999727-4887132408652166 Q gi|254780277|r 5 NHLFLVDGSSFIYRAFYATPLLSRKHDGLPVNAIAGFCNMLWKLLQNSRKESIASHFAVIFDY-PAVTFRNEIYPDYKAN 83 (976) Q Consensus 5 ~~~~liDg~~~~~ra~~a~p~~~~~~~G~~t~ai~gf~~~l~~~i~~~~~~~~~~~~~v~fD~-~~~tfR~~~~~~YKa~ 83 (976) |+|+|||||+++||||||+|++. +++|+||||||||+|||+|++++++|+ |++||||+ +++||||++||+|||| T Consensus 2 kkl~LiDg~sl~fRAy~Alp~l~-~s~G~pTnAv~GF~~mL~kll~~~~p~----~~avaFD~~~~~tfRhe~~~~YKan 76 (870) T PRK05929 2 KKLFVLDASGFVFRAYFALPEMK-NPNGESTQAVFGFIRSLNKLIKEFSPE----YMVAVFDGPNNKQSRQEIYADYKSN 76 (870) T ss_pred CEEEEEECCHHHHHHHCCCCCCC-CCCCCCCCCHHHHHHHHHHHHHHCCCC----EEEEEEECCCCCCCHHHHHHHHCCC T ss_conf 71999917579999980799887-999987050999999999999865998----8999987979997164412565389 Q ss_pred CCCCCHHHHHHHHHHHHHHHHCCCCEEEECCEEHHHHHHHHHHHHHHCCCCEEEECCCCHHHEECCCCEEEEECCCC-CE Q ss_conf 79879758864899999999709979956890166589999999984899499986870121223797599966788-35 Q gi|254780277|r 84 RPKIPEMLLPQLPLVRLATQAFGIPAIEIQGFEADDIIATYTYIAEKEGFAVTIISTDKDLMQLVSPTTCLYDTVKE-EK 162 (976) Q Consensus 84 R~~~p~~l~~q~~~i~~~l~~~gi~~~~~~~~EADDviatla~~~~~~~~~v~i~s~DkD~~Qlv~~~v~~~~~~~~-~~ 162 (976) |+||||+|.+|||+|++++++||||+++.+||||||||||||++++++|++|+|+|||||++|||+++|++|+|+++ +. T Consensus 77 R~~~p~~l~~Qip~i~e~l~a~gip~~~~~GyEADDiIgTla~~~~~~g~~v~IvTgDKD~~QLv~~~v~v~~~~k~~~~ 156 (870) T PRK05929 77 RDKKFEDLPEQIALVKEYCSLLGLSYLEKEGVEADDVIASITKKAVAEGYEVCICTADKDLLQLVNSHVVAWNPWKDQEV 156 (870) T ss_pred CCCCCHHHHHHHHHHHHHHHHCCCCEECCCCCCHHHHHHHHHHHHHHCCCEEEEECCCCCHHHHCCCCEEEEECCCCCCC T ss_conf 99998899999999999999879988350893589999999999997799299984899762417798799976888812 Q ss_pred ECHHHHHHHHCCCHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHCCCHHHHHH Q ss_conf 38999988848988787777875187435899877636024998984136789998645420011223220035789997 Q gi|254780277|r 163 IDIEHVIKKWGIPPEKMVCLQALTGDSIDNIPGIPGIGYKTAALLLQEYGNLENILINASRIKQKKRRENILEYAETARL 242 (976) Q Consensus 163 ~~~~~v~~~~gv~p~~~~d~~al~GD~sDnipGv~GiG~KtA~~ll~~~gsle~i~~~~d~i~~~k~~~~l~~~~e~a~l 242 (976) +|++.|.|||||+|+|++||+||+||+|||||||||||||||+|||++|||||+||+|+|+|++ |++++|++++|+|++ T Consensus 157 ~d~~~V~ek~Gv~P~qi~D~laL~GDsSDNIPGVpGIG~KTA~kLL~~ygsLE~Iy~n~d~ikg-K~~e~L~e~~d~a~l 235 (870) T PRK05929 157 IGYNEVIEQYGVPPGQIADYLALVGDSSDNIPGVSGCGPKKAAALLKEFQSVEGLLENLDRLSG-KTKQMISEQKETLLL 235 (870) T ss_pred CCHHHHHHHHCCCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHCCCHHHHHHHHHHHCH-HHHHHHHHCHHHHHH T ss_conf 6799999997969899999999728764689999976099999999867877799999986320-207889857887898 Q ss_pred HHHHHHCCCCCCCCCCHHHHCCC--CCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 65543203564224763552126--7676889986532336899987653114454433421112343322223332222 Q gi|254780277|r 243 SRKLVMLRTDVPITIPLEHLILE--DCNGPRLISFLKALEFTKLINRVATTYDCDAANIEPVVLDINTTENKGIAIETEI 320 (976) Q Consensus 243 s~~L~tL~~Dvpl~~~~~~l~~~--~~d~~~l~~~~~~lef~~ll~rL~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~ 320 (976) ||+||||++|||++++++++++. ++|.+++..+|.++||++|++++... T Consensus 236 Sk~LatI~~Dvpl~~~le~l~~~~~~~d~~~L~~~~~~leF~sL~~~~~~~----------------------------- 286 (870) T PRK05929 236 SKRLATLDSNIPIPLPIEELAFPQHPVDEEQLNTFYLQHGFKTLVKHKETE----------------------------- 286 (870) T ss_pred HHHHHHHHCCCCCCCCHHHHHCCCCCCCHHHHHHHHHHCCHHHHHHCCCCC----------------------------- T ss_conf 888756531688999978971576787999999999970447541222345----------------------------- Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHCCCCCCCCCCCCCEEECCHHHHHHHHHHHHHCCCCCCCCCCCCCC Q ss_conf 22233332222233332112232112001121100001235556665035479999999999998668523233222433 Q gi|254780277|r 321 DRDTTIKTSQFSSLSDQTSKINSEQTPQKLFLERLQKLSQYPINNDSYTKIVNTKDIQQWVQKLETIGSASFKIITDTMD 400 (976) Q Consensus 321 ~~~~~~~~~~~~~~~~~~~~~~~e~~p~~~~~~~~~~~~~~~~~~~~y~~I~~~~~L~~li~~L~~~~~ia~DtETt~ld 400 (976) ..+..+..+.+.+.+..+++.+.. +.++++++|++.+ T Consensus 287 ------------------------------------------~~~~~~~~i~~~~~l~~~~~~l~~-~~~a~~~~~~~~~ 323 (870) T PRK05929 287 ------------------------------------------ASSIDVQIIKDPESLTNVLNLLQG-GEVAFCVAYTGNH 323 (870) T ss_pred ------------------------------------------CCCCCCEEECCHHHHHHHHHHHCC-CCEEEEEEECCCC T ss_conf ------------------------------------------577771530898999999998517-9747999745887 Q ss_pred CCCCCCCCCEEECCCCCCCCCCCCCCEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCHHHHCCCCHHHHHHHHCC Q ss_conf 11001244201025776654434550231374433345650017889999744442022210011003212378898618 Q gi|254780277|r 401 AFHSKPIAIAISTLDQKDDTSFKVETIFIDLSVHTSQDSTGKNTLTKEILSYLKKFFENEHFLKIGHNIKYDKLVLHRYG 480 (976) Q Consensus 401 ~~~~~ivGisls~~~~~~~~~~~~~~~yip~~~~~~~d~~~~~~~~~evl~~L~~lle~~~i~kIgHN~KfD~~~L~~~G 480 (976) +..++++|+++|..++ ++|+|+.+.. .+++..|+++|+++++.|+|||+|||+++|.++| T Consensus 324 ~~~~~lvGisls~~~~---------a~yip~~~~~-----------~~vl~~Lk~~led~~i~kig~NlK~d~~vL~~~G 383 (870) T PRK05929 324 LPSLKLHGVALAGGSQ---------VFYIEVEESG-----------DQEIALLKDFFARKDTTFYGYNLKRDNHALKNAG 383 (870) T ss_pred CCCCCEEEEEEECCCE---------EEEEECCCCC-----------HHHHHHHHHHHCCCCCCEEECCHHHHHHHHHHCC T ss_conf 6777288999976980---------8999765555-----------2389999998609877453223777999999689 Q ss_pred CCCCCC-CCHHHHHHHHCCCCCCCCHHHHHHHHCCCCCCCCCC--CCCCCCCCHHHCCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 866642-100234554316876677246677622444223356--54322210000001233332221111124567763 Q gi|254780277|r 481 ISMRGF-EDIMLMSYVLDSGRSSHDIASIAQKWLSYTRKEILK--SRKSSIPIDKISDSQVQEHAIENSNVILQLWLLLR 557 (976) Q Consensus 481 I~l~~i-~DTmlaayLLdp~~~~~~L~~la~~yL~~~~~e~~~--~~k~~~~~~~i~~e~~~~Yaa~Da~~t~~L~~~l~ 557 (976) |.+.++ ||||+|+||+||+. +++++.+...|.........+ .++.+.++...+ +....|+|+|++.+++|+..|. T Consensus 384 I~l~g~~fDtmLAsYLLn~~~-~~~~~~ll~~~~~~~~~~~~~k~~g~~~l~~~~~~-~~~~~yaae~a~~~~~L~~~L~ 461 (870) T PRK05929 384 IDVHEITADLALAEHLVNGGA-KISFQTLLVEHGHIEEAHRFAKEWGASSLPIQRLP-EQPAQYFGEFVSYLPIIKNYLL 461 (870) T ss_pred CCCCCCHHHHHHHHHHHCCCC-CCCHHHHHHHCCCCHHHHHHHHHHCCCCCCCCCCH-HHHHHHHHHHHHHHHHHHHHHH T ss_conf 856872779999998629999-88839999766673155565554076666420073-8899999999999999999999 Q ss_pred HHHHHHHHHHHHHHHCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHC Q ss_conf 22210001345788607501233221014310058999986434368999999999740345577776566679987420 Q gi|254780277|r 558 PKLIVEKLLHVYERLDKPMIDVVSQMEITGIQIDQDLLKQVSAEISKNLLSLEENIYDLSGEKFNLSSPKQLGDILFTKL 637 (976) Q Consensus 558 ~~L~~~~l~~l~~~iE~Pl~~vLa~ME~~Gi~vD~~~L~~l~~~l~~~l~~le~~i~~~aG~~FNinSpkQL~~vLf~~L 637 (976) ++|++.++.++|++|||||++||++||.+||+||.++|++++.+++.++++++++||++||++||||||||||+|||++| T Consensus 462 ~~L~~~~L~~Ly~eIE~PLi~VLa~ME~~GI~vD~~~L~~ls~el~~~l~~Le~~I~~~AG~eFNinSPKQL~evLFekL 541 (870) T PRK05929 462 EELEEKGLKDIFSNVEMPLEKVLFAMERAGMPLDVEGLAVLERDLSKELAVLTQEIYDLAGCPFNIKSPKQLSDILYNRL 541 (870) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCEEEEHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHC T ss_conf 99988779999999874489999998738989819999989999999999999999997299988789899999989863 Q ss_pred CCCCCCCCCCCCCCCCHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEECCCCCC Q ss_conf 78875556654535317887765304762799999998788898865320011110012333453300222100000133 Q gi|254780277|r 638 KFPSGAKTKTGQWKTTAQDLEQINCGDNPIIKNILEWRQLSKIKSTYADSLPNHINKRTQRVHTFYSLASTMTGRLASLE 717 (976) Q Consensus 638 gLp~~kKTktG~~ST~~~~Le~L~~~~hp~~~~iLeyR~l~KL~sTY~~~l~~~i~~~dgRIH~~f~q~~t~TGRlSs~~ 717 (976) |||+.|||| |||++|||+|+ ..||+++.||+||+++||+|||+|+|+++|++.||||||+|||++|+||||||++ T Consensus 542 gL~~~kKtk----ST~~evLe~L~-~~hpi~~~iLeyR~l~KL~sTYvd~L~~~i~~~tgRIHTsfnQt~TaTGRLSSs~ 616 (870) T PRK05929 542 GLRPIDKAK----STKAEVLEALE-GEHEIIEKILAFRAIEKLLSTYVKALPKQIDSHTQRIHPTFNQVGTVTGRLACQD 616 (870) T ss_pred CCCCCCCCC----CCHHHHHHHHH-CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEECCEEECCCCCCCCCCCCC T ss_conf 999888889----87289999740-6002689999985899875046777554225676877762331101113224689 Q ss_pred CCCCCCCCCCCCCCCCCEEE-ECCCCCEEEEEECCCCCEEEEEEEEEHHHHHHHHHCCCCHHHHHHHHHHCCCHHHCCHH Q ss_conf 34333455633222243024-24899489996326544368766510377787340698889999989809996889988 Q gi|254780277|r 718 PNLQNIPIKTDLGQKIRRAF-ISPPTKKLISADYSQIELRILAHIAKITPLYQVFENSLDIHRMVAAEIFGVNIEKVSPQ 796 (976) Q Consensus 718 PNlQNiP~r~~~g~~iR~~F-~a~~g~~lv~aDySQiElRilA~ls~D~~l~~af~~g~DiH~~tA~~vf~~~~~~vt~~ 796 (976) |||||||+||++||+||+|| +|++||+|+||||||||||||||+|+|++|++||++|+|||+.||+.|||+|.++||++ T Consensus 617 PNLQNIPiRte~Gr~IR~aF~~~~~g~~llsaDYSQIELRilAHlS~D~~Li~AF~~g~DIH~~tA~~if~v~~~~Vt~~ 696 (870) T PRK05929 617 PNLQNIPIRSERGRLLRKAFRLTKKNDYFLSADYSQIELRFLAHLSQDESLKRAFESGEDIHAFTASQVFHVPLEQVTKQ 696 (870) T ss_pred CCCCCCCCCCCCCHHHHHHEEECCCCCEEEEECHHHHHHHHHHHHHCCHHHHHHHHCCCHHHHHHHHHHHCCCHHHCCHH T ss_conf 97653677541115678751328999779961266889999999836786898750562076652698848981216987 Q ss_pred HHHHHHHHHHHHHHCCCHHHHHHHCCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHCCCEECCCCCEECCCCCCCCC Q ss_conf 88630251356440777035678819998999999999999888899999999999998498882778760189875883 Q gi|254780277|r 797 MRRHAKTINFSIIYGISPFRLATQLRIPRSEAADYIKRYFHRFPGIHEYIEKTKNFVRQNGYVETIFGRRIHYDEINSPK 876 (976) Q Consensus 797 ~R~~aK~infgi~YG~~~~~La~~l~is~~eA~~~i~~yf~~~p~v~~~~~~~~~~a~~~gyv~T~~GRrr~~p~i~s~~ 876 (976) ||+.||+|||||+||||+||||++||||++||++||++||++||+|++||++++++|+++|||+|++|||||+|+|+|.| T Consensus 697 ~Rr~AK~vNFGiiYG~s~fgLa~~L~is~~eA~~~i~~Yf~~yp~v~~y~~~~~~~a~~~Gyv~Tl~gRrr~ip~i~~~~ 776 (870) T PRK05929 697 QRMQAKTVNFGIVYGQQAYGLSKILKISVSEAQGLIDAYFARYPEAAEFITETIEQASKNLRVTTMLGRERIIDDWEEFP 776 (870) T ss_pred HHHHHHHHHHHHHHCCCHHHHHHHHCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHCCCEECCCCCCCCCCCCCCCH T ss_conf 89886883545675658788998849899999999999998691299999999999997696767688600288656774 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEECCEEEEECCHHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 78887787877013456538999999999999999865998189997263899955987999999999999651013574 Q gi|254780277|r 877 SSIRNINERAAINAPIQGSAADITRRAMISVHKSLEDHKLATKMLLQIHDELVFEVPEEEITTASQIIVHSMEKACLPKI 956 (976) Q Consensus 877 ~~~r~~aeR~a~N~~iQGsaAdi~k~am~~i~~~~~~~~~~~~~ilqVHDElv~ev~~~~~~~~~~~~~~~Me~~~~~~~ 956 (976) +.|++|||+|||||||||||||||+|||++++++++++++++||||||||||||||++++++++++|+++||+|. T Consensus 777 -~~r~~~eR~AiN~piQGtAADiiK~AMi~i~~~l~~~~~~~~mllQVHDELvfEv~~~e~~~~~~~v~~~Me~~~---- 851 (870) T PRK05929 777 -GSRAASGRLAVNTRIQGSAAELIKLAMLNISDALKSRGLKSRMLLQIHDELLFEVPEEELEEMQRLVREKMESAM---- 851 (870) T ss_pred -HHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECEEEEEEECHHHHHHHHHHHHHHHHCCC---- T ss_conf -567788898818766648999999999999999985698764999853067884278899999999999972720---- Q ss_pred CCCCCEEEEEEEECCHHHH Q ss_conf 4446438883320893781 Q gi|254780277|r 957 NLRVPLKVNIKVSNNWQEE 975 (976) Q Consensus 957 ~l~VPl~v~~~~G~~W~e~ 975 (976) +|+|||+|++++|+||+|- T Consensus 852 ~l~VpL~v~~~~g~nW~e~ 870 (870) T PRK05929 852 TLSVPLVVNILIGKNWAEC 870 (870) T ss_pred CCCCCEEEECCCCCCCCCC T ss_conf 3688747222818885359 No 15 >PRK05797 consensus Probab=100.00 E-value=0 Score=2222.33 Aligned_cols=860 Identities=39% Similarity=0.659 Sum_probs=819.0 Q ss_pred CEEEEEECCHHHHHHHHCCCCCCCCCCCCEECHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHHCCHHHHCCC Q ss_conf 72899966269998874267544689885500699999999999998485547987999972748871324086521667 Q gi|254780277|r 5 NHLFLVDGSSFIYRAFYATPLLSRKHDGLPVNAIAGFCNMLWKLLQNSRKESIASHFAVIFDYPAVTFRNEIYPDYKANR 84 (976) Q Consensus 5 ~~~~liDg~~~~~ra~~a~p~~~~~~~G~~t~ai~gf~~~l~~~i~~~~~~~~~~~~~v~fD~~~~tfR~~~~~~YKa~R 84 (976) |+|+||||++++||||||+|++. +++|.||||||||+|||++++++++|+ |++||||.+++||||++||+||||| T Consensus 3 k~l~liDg~~~~~Ra~~a~~~l~-~~~G~pt~ai~Gf~~~l~~~~~~~~p~----~~~v~fD~~~~tfR~~~y~~YKa~R 77 (869) T PRK05797 3 KRLLILDGNSLMNRAFYALPPLT-NSEGIHTNAIYGFTNMLLKMKEEIKPD----YIVVAFDKKAPTFRHKEYKDYKAGR 77 (869) T ss_pred CEEEEEECCHHHHHHHCCCCCCC-CCCCCEECHHHHHHHHHHHHHHHCCCC----EEEEEEECCCCCCCHHHCHHHCCCC T ss_conf 75999937589999980799888-998987054999999999999873998----8999985939960421233642899 Q ss_pred CCCCHHHHHHHHHHHHHHHHCCCCEEEECCEEHHHHHHHHHHHHHHCCCCEEEECCCCHHHEECCCCEEEEECCCC---- Q ss_conf 9879758864899999999709979956890166589999999984899499986870121223797599966788---- Q gi|254780277|r 85 PKIPEMLLPQLPLVRLATQAFGIPAIEIQGFEADDIIATYTYIAEKEGFAVTIISTDKDLMQLVSPTTCLYDTVKE---- 160 (976) Q Consensus 85 ~~~p~~l~~q~~~i~~~l~~~gi~~~~~~~~EADDviatla~~~~~~~~~v~i~s~DkD~~Qlv~~~v~~~~~~~~---- 160 (976) +||||||.+|||+|++++++||||+++.+||||||||||+|+++.++|++|+|+|||||++|||+++|+||+|+++ T Consensus 78 ~~~P~~l~~Q~~~i~~~~~~~gi~~~~~~g~EADDvi~tla~~~~~~g~~v~i~s~DkD~~QLv~~~v~v~~~~k~~~~~ 157 (869) T PRK05797 78 KKMPEELAEQFPILKELLDALNINIFEIDGFEADDLIGTLSKFAEEKGIEVYIVTGDKDALQLASDNVKVVITKKGITEK 157 (869) T ss_pred CCCCHHHHHHHHHHHHHHHHCCCCEEEECCCCHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCEEEEECCCCCCCC T ss_conf 99997999999999999998899874458915898999999999978995899807986001087998999888999745 Q ss_pred CEECHHHHHHHHCCCHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHCCCHHHH Q ss_conf 35389999888489887877778751874358998776360249989841367899986454200112232200357899 Q gi|254780277|r 161 EKIDIEHVIKKWGIPPEKMVCLQALTGDSIDNIPGIPGIGYKTAALLLQEYGNLENILINASRIKQKKRRENILEYAETA 240 (976) Q Consensus 161 ~~~~~~~v~~~~gv~p~~~~d~~al~GD~sDnipGv~GiG~KtA~~ll~~~gsle~i~~~~d~i~~~k~~~~l~~~~e~a 240 (976) +.+|++.|.|||||+|+||+||+||+||+|||||||||||||||++||++|||||+||+|+|+|++++++++|++++|+| T Consensus 158 ~~~~~~~v~ek~Gv~P~q~~D~~aL~GD~sDNIPGVpGIG~KTA~kLL~eyGsLE~I~~n~deIkgkk~~e~L~e~~d~a 237 (869) T PRK05797 158 EIYDKNKMVEEYGVTPTQFIDVKGLMGDKSDNIPGVPGIGEKTAFKLIKEYGSIENVLENIDNIKGKKLKENLEENREQA 237 (869) T ss_pred EEECHHHHHHHHCCCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHCCCHHHHHHHHHHCCCHHHHHHHHHHHHHH T ss_conf 56379998999778999999999980886457899998781899999986787889998788505388999999548888 Q ss_pred HHHHHHHHCCCCCCCCCCHHHHCCCC-CCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 97655432035642247635521267-67688998653233689998765311445443342111234332222333222 Q gi|254780277|r 241 RLSRKLVMLRTDVPITIPLEHLILED-CNGPRLISFLKALEFTKLINRVATTYDCDAANIEPVVLDINTTENKGIAIETE 319 (976) Q Consensus 241 ~ls~~L~tL~~Dvpl~~~~~~l~~~~-~d~~~l~~~~~~lef~~ll~rL~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~ 319 (976) ++||+||||++|||++++++++++++ +|.+++.++|++|||++|++++....... T Consensus 238 ~LSkeLvTI~~Dvpl~~~le~l~~~~~~d~~~L~~~f~elEFksLl~rl~~~~~~~------------------------ 293 (869) T PRK05797 238 IFSKKLATIMRNVPIEIDLEEIKSKEEYDIEKLRELFEKLEFKSLIDKIGKNDEEE------------------------ 293 (869) T ss_pred HHHHHHHHCCCCCCCCCCHHHHCCCCCCCHHHHHHHHHHHCCHHHHHHHCCCCCCC------------------------ T ss_conf 99877774111798899989951478899999999999826599999752046766------------------------ Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHCCCCCCCCCCCCCEEECCHHHHHHHHHHHHHCCCCCCCCCCCCC Q ss_conf 22223333222223333211223211200112110000123555666503547999999999999866852323322243 Q gi|254780277|r 320 IDRDTTIKTSQFSSLSDQTSKINSEQTPQKLFLERLQKLSQYPINNDSYTKIVNTKDIQQWVQKLETIGSASFKIITDTM 399 (976) Q Consensus 320 ~~~~~~~~~~~~~~~~~~~~~~~~e~~p~~~~~~~~~~~~~~~~~~~~y~~I~~~~~L~~li~~L~~~~~ia~DtETt~l 399 (976) .......|..+.+.+++..|++.+.. ..+++|+||+++ T Consensus 294 -----------------------------------------~~~~~~~~~~i~~~~~l~~l~~~l~~-~~~~~~~ett~~ 331 (869) T PRK05797 294 -----------------------------------------EVEEEVEIKNIVDIEELKDLFNNIKN-NQVYINFTLEDE 331 (869) T ss_pred -----------------------------------------CCCCCCCCEEECCHHHHHHHHHHHHC-CCEEEEEECCCC T ss_conf -----------------------------------------78755674553899999999986134-987999973897 Q ss_pred CCCCCCCCCCEEECCCCCCCCCCCCCCEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCHHHHCCCCHHHHHHHHC Q ss_conf 31100124420102577665443455023137443334565001788999974444202221001100321237889861 Q gi|254780277|r 400 DAFHSKPIAIAISTLDQKDDTSFKVETIFIDLSVHTSQDSTGKNTLTKEILSYLKKFFENEHFLKIGHNIKYDKLVLHRY 479 (976) Q Consensus 400 d~~~~~ivGisls~~~~~~~~~~~~~~~yip~~~~~~~d~~~~~~~~~evl~~L~~lle~~~i~kIgHN~KfD~~~L~~~ 479 (976) +++.+.++|+++++.++. ++|+|+.+. .....+++.|+++|+++++.|||||+|||+++|.++ T Consensus 332 ~~~~~~~vg~s~~~~~~~--------a~yip~~~~---------~~~~~~L~~Lk~lLed~~i~KIGhNlK~dl~vL~~~ 394 (869) T PRK05797 332 NLYSKIEIKKIFINFGEK--------TYYIDFKEL---------IDEEEFIEDLKEIFENEEIKKIGHDIKNFLTILKKL 394 (869) T ss_pred CHHHHHHEEEEEEECCCE--------EEEEECHHH---------CCHHHHHHHHHHHHHCCCCCEEEECCHHHHHHHHHC T ss_conf 612030418999961997--------999831451---------677899999999984899878984306999999975 Q ss_pred CCCCCCC-CCHHHHHHHHCCCCCCCCHHHHHHHHCCCCCCCCCCCCCCCCCCHHHCCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 8866642-100234554316876677246677622444223356543222100000012333322211111245677632 Q gi|254780277|r 480 GISMRGF-EDIMLMSYVLDSGRSSHDIASIAQKWLSYTRKEILKSRKSSIPIDKISDSQVQEHAIENSNVILQLWLLLRP 558 (976) Q Consensus 480 GI~l~~i-~DTmlaayLLdp~~~~~~L~~la~~yL~~~~~e~~~~~k~~~~~~~i~~e~~~~Yaa~Da~~t~~L~~~l~~ 558 (976) ||.+.++ ||||||+|||||+.++|+|++|+.+||++...+ ....|||+|+.++++||..|++ T Consensus 395 GI~l~gi~FDTMLAsYLLnP~~~~h~Ld~La~~yL~~~~~~-----------------~e~~ya~~dA~~~~~L~~~L~~ 457 (869) T PRK05797 395 GIEFKGLAFDTAIAAYLIDPSKSEYDLSDLAKEYLSKDVNS-----------------EEEEYKIKEVSVMKELYEKLKE 457 (869) T ss_pred CCCCCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHCCCCCC-----------------HHHHHHHHHHHHHHHHHHHHHH T ss_conf 98556616689999997289988889999999980877751-----------------4789999999999999999999 Q ss_pred HHHHHHHHHHHHHHCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHCC Q ss_conf 22100013457886075012332210143100589999864343689999999997403455777765666799874207 Q gi|254780277|r 559 KLIVEKLLHVYERLDKPMIDVVSQMEITGIQIDQDLLKQVSAEISKNLLSLEENIYDLSGEKFNLSSPKQLGDILFTKLK 638 (976) Q Consensus 559 ~L~~~~l~~l~~~iE~Pl~~vLa~ME~~Gi~vD~~~L~~l~~~l~~~l~~le~~i~~~aG~~FNinSpkQL~~vLf~~Lg 638 (976) +|++.+++++|++|||||++||++||.+||.||.+.|++++.++..+++.++++||++||++||+||||||++|||++|| T Consensus 458 ~L~~~~l~~L~~~IE~PLi~VLa~ME~~GI~VD~~~L~~l~~~l~~~l~~le~~I~~~aG~eFNinSPKQL~~iLFe~Lg 537 (869) T PRK05797 458 KIEEMDMEKLLYEVELPLVEVLASMESEGFKVDKDILDELSKKFKEEIEKTQKEIYELAGEEFNINSPKQLGKILFEKLD 537 (869) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCCEEECHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHCC T ss_conf 99861289999999867899999998468099999999999999999999999999862886678998999999898649 Q ss_pred CCCCCCCCCCCCCCCHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEECCCCCCC Q ss_conf 88755566545353178877653047627999999987888988653200111100123334533002221000001333 Q gi|254780277|r 639 FPSGAKTKTGQWKTTAQDLEQINCGDNPIIKNILEWRQLSKIKSTYADSLPNHINKRTQRVHTFYSLASTMTGRLASLEP 718 (976) Q Consensus 639 Lp~~kKTktG~~ST~~~~Le~L~~~~hp~~~~iLeyR~l~KL~sTY~~~l~~~i~~~dgRIH~~f~q~~t~TGRlSs~~P 718 (976) ||++|||||| ||||++|||+|. ..||+++.||+||+++||+|||+|+|+++|+ .||||||+|||++|+||||||++| T Consensus 538 Lp~~KKtKtG-~STd~~vLe~L~-~~hpi~~~iLeyR~l~KL~sTY~d~L~~~i~-~~gRiHt~f~Q~~t~TGRLSS~~P 614 (869) T PRK05797 538 LPVIKKTKTG-YSTNAEVLEKLR-DKHPIIEKITEYRQLTKLYSTYVEGLKNVID-EDGRIHSNFNQTVTTTGRLSSTEP 614 (869) T ss_pred CCCCCCCCCC-CCCCHHHHHHHH-HCCCCCHHHHHHHHHHHHHHHHHHHHHHHCC-CCCCCCCHHHHHHCCCCCCCCCCC T ss_conf 9988878899-878599999765-5488519999987998999889878997426-246505426654344021146898 Q ss_pred CCCCCCCCCCCCCCCCEEEECCCC-CEEEEEECCCCCEEEEEEEEEHHHHHHHHHCCCCHHHHHHHHHHCCCHHHCCHHH Q ss_conf 433345563322224302424899-4899963265443687665103777873406988899999898099968899888 Q gi|254780277|r 719 NLQNIPIKTDLGQKIRRAFISPPT-KKLISADYSQIELRILAHIAKITPLYQVFENSLDIHRMVAAEIFGVNIEKVSPQM 797 (976) Q Consensus 719 NlQNiP~r~~~g~~iR~~F~a~~g-~~lv~aDySQiElRilA~ls~D~~l~~af~~g~DiH~~tA~~vf~~~~~~vt~~~ 797 (976) ||||||+|+++||+||+||+|++| ++|+||||||||||||||+|+|+.|++||++|+|||+.||+.|||+|.++||+++ T Consensus 615 NLQNIP~r~e~G~~iR~aFi~~~~~~~l~saDYSQIELRilAh~s~d~~l~~af~~~~DiH~~tA~~vf~~~~~~v~~~~ 694 (869) T PRK05797 615 NLQNIPIKYEMGREIRKVFIPNTEDSVILSADYSQIELRVLAHISGDENLIDAFNHHEDIHTKTASEVFKVPIEEVTPLM 694 (869) T ss_pred CCCCCCCCCCCHHHHHHEEECCCCCCEEEECCHHHHHHHHHHHHHCCHHHHHHHHCCCCHHHHHHHHHHCCCHHHCCHHH T ss_conf 64568776520122553571477761588522778999999998468999998855997689999999598911179889 Q ss_pred HHHHHHHHHHHHHCCCHHHHHHHCCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHCCCEECCCCCEECCCCCCCCCH Q ss_conf 86302513564407770356788199989999999999998888999999999999984988827787601898758837 Q gi|254780277|r 798 RRHAKTINFSIIYGISPFRLATQLRIPRSEAADYIKRYFHRFPGIHEYIEKTKNFVRQNGYVETIFGRRIHYDEINSPKS 877 (976) Q Consensus 798 R~~aK~infgi~YG~~~~~La~~l~is~~eA~~~i~~yf~~~p~v~~~~~~~~~~a~~~gyv~T~~GRrr~~p~i~s~~~ 877 (976) |+.||+|||||+||||+||||++||||++||++||++||++||+|++||++++++|+++|||+|++|||||+|+|+|.|+ T Consensus 695 R~~AK~infGiiYG~~~~gLs~~l~i~~~eA~~~i~~yf~~yp~v~~~~~~~~~~a~~~g~v~Tl~gRrr~~p~i~s~n~ 774 (869) T PRK05797 695 RSNAKAVNFGIVYGIGDFSLSQDLNISRKEAKEYIDNYFERYPNVKEYMEKIVEEAKEKGYVTTILNRRRYIPEIKSSNK 774 (869) T ss_pred HHHHHHHHHHHHHCCCHHHHHHHCCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHCCEEECCCCCEECCCCCCCCCH T ss_conf 99988988566555686778985298999999999999996804999999999999978907888887335887778898 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEECCEEEEECCHHHHHHHHHHHHHHHHHCCCCCCC Q ss_conf 88877878770134565389999999999999998659981899972638999559879999999999996510135744 Q gi|254780277|r 878 SIRNINERAAINAPIQGSAADITRRAMISVHKSLEDHKLATKMLLQIHDELVFEVPEEEITTASQIIVHSMEKACLPKIN 957 (976) Q Consensus 878 ~~r~~aeR~a~N~~iQGsaAdi~k~am~~i~~~~~~~~~~~~~ilqVHDElv~ev~~~~~~~~~~~~~~~Me~~~~~~~~ 957 (976) +.|++|||+|+|+|||||||||+|+||++++++++.++++++||||||||||||||++++++++.+|+++||+|. + T Consensus 775 ~~r~~~eR~AiN~piQGtaADiiK~AMi~~~~~l~~~~~~~~~~lqVHDElv~ev~~~~~~~~~~~v~~~Me~~~----~ 850 (869) T PRK05797 775 IVRAFGERLAMNTPIQGSAADIIKLAMVNVYNKLKELNLKSKLILQVHDELILNVYKDELEEVKKIVKEEMENVI----K 850 (869) T ss_pred HHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEEEEEEEEEEEECHHHHHHHHHHHHHHHHCCC----C T ss_conf 899778887827465567999999999999999985499864999973266895168899999999999972842----5 Q ss_pred CCCCEEEEEEEECCHHHHC Q ss_conf 4464388833208937819 Q gi|254780277|r 958 LRVPLKVNIKVSNNWQEEN 976 (976) Q Consensus 958 l~VPl~v~~~~G~~W~e~~ 976 (976) |+|||+|++++|+||+||- T Consensus 851 l~VPl~v~~~~g~nW~eaK 869 (869) T PRK05797 851 LKVPLDVDINVGNNWYEAK 869 (869) T ss_pred CCCCEEEECCCCCCHHHCC T ss_conf 7887473048278967709 No 16 >PRK07300 consensus Probab=100.00 E-value=0 Score=2216.18 Aligned_cols=862 Identities=36% Similarity=0.585 Sum_probs=787.2 Q ss_pred CCCCCEEEEEECCHHHHHHHHCC----CCCCCCCCCCEECHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHHC Q ss_conf 98897289996626999887426----75446898855006999999999999984855479879999727488713240 Q gi|254780277|r 1 MQKENHLFLVDGSSFIYRAFYAT----PLLSRKHDGLPVNAIAGFCNMLWKLLQNSRKESIASHFAVIFDYPAVTFRNEI 76 (976) Q Consensus 1 m~~~~~~~liDg~~~~~ra~~a~----p~~~~~~~G~~t~ai~gf~~~l~~~i~~~~~~~~~~~~~v~fD~~~~tfR~~~ 76 (976) |+++|+|+|||||+|+||||||+ |++ +|++|+||||||||+|||++++++++|+ |++||||++++||||++ T Consensus 1 m~~kk~l~LIDG~slifRAfyAl~~~~~~l-~ts~G~pTnAi~GF~~mL~kli~~~~P~----~iavaFD~~~~TFR~~~ 75 (880) T PRK07300 1 MENKNKLLLIDGSSVAFRAFFALYNQIDRF-KNHSGLHTNAIYGFHLMLDHMMKRVQPT----HVLVAFDAGKTTFRTEM 75 (880) T ss_pred CCCCCCEEEEECCHHHHHHHCCCCCCCCCC-CCCCCCEECHHHHHHHHHHHHHHHCCCC----EEEEEEECCCCCCCHHH T ss_conf 998981899967488999871788777887-2999988235999999999999864998----89999969599601423 Q ss_pred CHHHHCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEECCEEHHHHHHHHHHHHHHCC--CCEEEECCCCHHHEECCCCEEE Q ss_conf 86521667987975886489999999970997995689016658999999998489--9499986870121223797599 Q gi|254780277|r 77 YPDYKANRPKIPEMLLPQLPLVRLATQAFGIPAIEIQGFEADDIIATYTYIAEKEG--FAVTIISTDKDLMQLVSPTTCL 154 (976) Q Consensus 77 ~~~YKa~R~~~p~~l~~q~~~i~~~l~~~gi~~~~~~~~EADDviatla~~~~~~~--~~v~i~s~DkD~~Qlv~~~v~~ 154 (976) ||+|||||++|||||.+|+|+|++++++||||+++.+||||||||||+|+++...| ++|+|+|||||++|||+++|.+ T Consensus 76 ~~~YKa~R~~~P~~l~~Q~~~i~~~~~~~~i~~~~~~g~EADDvi~tlak~a~~~~~~~~v~Ivs~DkD~~QLV~~~v~v 155 (880) T PRK07300 76 YADYKAGRAKTPDEFREQFPYIREMLTALGIAYYELEHYEADDIIGTLDKMAERTEVPFDVTIVSGDKDLIQLTDENTVV 155 (880) T ss_pred HHHHCCCCCCCCHHHHHHHHHHHHHHHHCCCCEECCCCCCHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCCCCCCEEE T ss_conf 15752799999889999999999999987998825289149999999999999649997189974898616107698589 Q ss_pred EECCCC----CEECHHHHHHHHCCCHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHH Q ss_conf 966788----3538999988848988787777875187435899877636024998984136789998645420011223 Q gi|254780277|r 155 YDTVKE----EKIDIEHVIKKWGIPPEKMVCLQALTGDSIDNIPGIPGIGYKTAALLLQEYGNLENILINASRIKQKKRR 230 (976) Q Consensus 155 ~~~~~~----~~~~~~~v~~~~gv~p~~~~d~~al~GD~sDnipGv~GiG~KtA~~ll~~~gsle~i~~~~d~i~~~k~~ 230 (976) ++++++ +.+|++.|.|||||+|+||+||+||+||+|||||||||||||||++||++|||||+||+|+|+|+++|++ T Consensus 156 ~~~~~~~~~~~~~~~~~v~ek~GV~P~qiiD~lAL~GDsSDNIPGVpGIG~KTA~kLL~eyGsLE~I~~n~d~Ik~~K~r 235 (880) T PRK07300 156 EISKKGVAEFEEFTPAYLMEKMGLTPNQFIDLKALMGDKSDNIPGVTKIGEKTGLKLLHEFGSLEGIYENIDGMKASKMK 235 (880) T ss_pred EECCCCCCCCEECCHHHHHHHHCCCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHCCCHHHHHHHHHHHCCHHHH T ss_conf 74689974121058999999978898999999998088624677889853699999999778699999857531325688 Q ss_pred HHHCCCHHHHHHHHHHHHCCCCCCCCCCHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCC Q ss_conf 22003578999765543203564224763552126767688998653233689998765311445443342111234332 Q gi|254780277|r 231 ENILEYAETARLSRKLVMLRTDVPITIPLEHLILEDCNGPRLISFLKALEFTKLINRVATTYDCDAANIEPVVLDINTTE 310 (976) Q Consensus 231 ~~l~~~~e~a~ls~~L~tL~~Dvpl~~~~~~l~~~~~d~~~l~~~~~~lef~~ll~rL~~~~~~~~~~~~~~~~~~~~~e 310 (976) ++|++++|+|++||+||||++|||++++++++++.++|.+++..+|++|||++|.+++............ T Consensus 236 e~L~~~ke~a~lSk~LvTI~~Dvpl~~~ledl~~~~~d~~~L~~~f~elEFksL~~~l~~~~~~~~~~~~---------- 305 (880) T PRK07300 236 ENLINDKEQAFLSKTLATINTASPITIGLDDIVYKGPDVASLSQFYDEMGFVQLKKALGSQMPQEPVAAI---------- 305 (880) T ss_pred HHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHCCCCCCHHHHHHHHHHHCHHHHHHHHCCCCCCCCCCCC---------- T ss_conf 9998638888888652010207989999899555899999999999986649999985133788776544---------- Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHCCCCCCCCCCCCCEEECCHHHHHHHHHHHHHCCCC Q ss_conf 22233322222223333222223333211223211200112110000123555666503547999999999999866852 Q gi|254780277|r 311 NKGIAIETEIDRDTTIKTSQFSSLSDQTSKINSEQTPQKLFLERLQKLSQYPINNDSYTKIVNTKDIQQWVQKLETIGSA 390 (976) Q Consensus 311 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~p~~~~~~~~~~~~~~~~~~~~y~~I~~~~~L~~li~~L~~~~~i 390 (976) . +..+.... .+. ....- T Consensus 306 ----------------~----------------------------------------~~~~~~~~-----~~~--~~~~~ 322 (880) T PRK07300 306 ----------------S----------------------------------------FQEVTQVS-----ADM--FSAED 322 (880) T ss_pred ----------------C----------------------------------------CCCCCCCC-----HHH--CCCCC T ss_conf ----------------4----------------------------------------42124024-----524--59460 Q ss_pred CCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCHHHHCCCC Q ss_conf 32332224331100124420102577665443455023137443334565001788999974444202221001100321 Q gi|254780277|r 391 SFKIITDTMDAFHSKPIAIAISTLDQKDDTSFKVETIFIDLSVHTSQDSTGKNTLTKEILSYLKKFFENEHFLKIGHNIK 470 (976) Q Consensus 391 a~DtETt~ld~~~~~ivGisls~~~~~~~~~~~~~~~yip~~~~~~~d~~~~~~~~~evl~~L~~lle~~~i~kIgHN~K 470 (976) .+++||++++++.++++|+|++..+ .+|++....... . ..+++++.. .+.+||+| T Consensus 323 ~~~~et~~~~~~~a~lvG~s~~~~~----------~~y~~~~~~~l~---------~---~~~~~~~~~---~i~~~nlK 377 (880) T PRK07300 323 FFYFETLGDNYHREAIIGFAWGNGE----------QIYASTDLSLLA---------T---DSFKQVLQK---PIATYDFK 377 (880) T ss_pred EEEEEECCCCCCCCCEEEEEEECCC----------EEEECCCHHHHH---------C---HHHHHHHHH---HHHHHHHH T ss_conf 2576533798130777999985597----------589558133211---------7---688887653---56550378 Q ss_pred HHHHHHHHCCCCCCC-CCCHHHHHHHHCCCCCCCCHHHHHHHHCCCCC---CCCCCCCCCCCCCHHHCCHHHHHHHHHHH Q ss_conf 237889861886664-21002345543168766772466776224442---23356543222100000012333322211 Q gi|254780277|r 471 YDKLVLHRYGISMRG-FEDIMLMSYVLDSGRSSHDIASIAQKWLSYTR---KEILKSRKSSIPIDKISDSQVQEHAIENS 546 (976) Q Consensus 471 fD~~~L~~~GI~l~~-i~DTmlaayLLdp~~~~~~L~~la~~yL~~~~---~e~~~~~k~~~~~~~i~~e~~~~Yaa~Da 546 (976) +|+.+|.++||.+.+ .||||||+|||+|.+ +|+|+.++.+|+++.. +++.|+++++.. ...+....|+|+|+ T Consensus 378 ~~~~vL~~~GI~l~~~~fDTmLAsYLL~p~~-~~~Ld~La~~yl~~~~~~~e~l~Gkg~k~~~---~~~~~~~~yaa~da 453 (880) T PRK07300 378 RSKVLLSHLGIELPAPSFDARLAKYLLSTVE-DNELSTIARLYTDISLETDDTVYGKGAKRAI---PEKEVLLEHLARKV 453 (880) T ss_pred HHHHHHHHCCCCCCCCHHHHHHHHHCCCCCC-CCCHHHHHHHHCCCCCCCHHHHCCCCCCCCC---CCHHHHHHHHHHHH T ss_conf 6899999779866774034899985368765-5877899999468888733766366521036---62779999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCH Q ss_conf 11124567763222100013457886075012332210143100589999864343689999999997403455777765 Q gi|254780277|r 547 NVILQLWLLLRPKLIVEKLLHVYERLDKPMIDVVSQMEITGIQIDQDLLKQVSAEISKNLLSLEENIYDLSGEKFNLSSP 626 (976) Q Consensus 547 ~~t~~L~~~l~~~L~~~~l~~l~~~iE~Pl~~vLa~ME~~Gi~vD~~~L~~l~~~l~~~l~~le~~i~~~aG~~FNinSp 626 (976) +++++|++.|+++|++.+++++|++|||||++||++||.+||+||.++|++++.+++.+++.||++||++||++|||||| T Consensus 454 ~~~~~L~~~L~~~L~~~~l~~L~~~IE~PLi~VLa~ME~~GI~VD~~~L~~ls~el~~~l~~Le~~Iy~lAG~eFNInSP 533 (880) T PRK07300 454 KVLLDSKEQMLDKLTAHEQLDLLFDMELPLANVLAKMEIAGIKVNRETLQEMAAENEVVIEALTQEIYDLAGQEFNINSP 533 (880) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCEEECHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCH T ss_conf 99999999989998761699999999868999999999658088899999999999999999999999865876577999 Q ss_pred HHHHHHHHHHCCCCC--CCCCCCCCCCCCHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 666799874207887--555665453531788776530476279999999878889886532001111001233345330 Q gi|254780277|r 627 KQLGDILFTKLKFPS--GAKTKTGQWKTTAQDLEQINCGDNPIIKNILEWRQLSKIKSTYADSLPNHINKRTQRVHTFYS 704 (976) Q Consensus 627 kQL~~vLf~~LgLp~--~kKTktG~~ST~~~~Le~L~~~~hp~~~~iLeyR~l~KL~sTY~~~l~~~i~~~dgRIH~~f~ 704 (976) ||||+|||++||||+ .|||||| ||||++|||+| +..||++..||+||+++||+|||+|+|+++|+ .||||||||| T Consensus 534 KQLgeILFekLgLP~~~~KKTKTG-ySTd~evLe~L-~~~hpi~~~ILeyR~l~KLkSTY~d~L~~~i~-~~gRIHTsf~ 610 (880) T PRK07300 534 KQLGVILFEKMGLPLEMTKKTKTG-YSTAVDVLERL-APIAPIVAKILEYRQITKLQSTYVIGLQDYIL-ADGKIHTRYV 610 (880) T ss_pred HHHHHHHHHHCCCCCCCCCCCCCC-CCCHHHHHHHH-HHCCCCHHHHHHHHHHHHHHHHHHHHHHHHCC-CCCCCCCHHH T ss_conf 999999998449987677778888-78579999997-66187489999984899999998998998547-5461221356 Q ss_pred CCCCCEECCCCCCCCCCCCCCCCCCCCCCCEEEECC-CCCEEEEEECCCCCEEEEEEEEEHHHHHHHHHCCCCHHHHHHH Q ss_conf 022210000013334333455633222243024248-9948999632654436876651037778734069888999998 Q gi|254780277|r 705 LASTMTGRLASLEPNLQNIPIKTDLGQKIRRAFISP-PTKKLISADYSQIELRILAHIAKITPLYQVFENSLDIHRMVAA 783 (976) Q Consensus 705 q~~t~TGRlSs~~PNlQNiP~r~~~g~~iR~~F~a~-~g~~lv~aDySQiElRilA~ls~D~~l~~af~~g~DiH~~tA~ 783 (976) |++|+||||||++|||||||+||++||+||+||||. +||+|+||||||||||||||+|+|+.|++||++|+|||+.||+ T Consensus 611 Qt~TaTGRLSS~~PNLQNIPiRte~Gr~IR~aFi~~~~~~~llsaDYSQIELRilAhlS~D~~Li~AF~~~~DIH~~TA~ 690 (880) T PRK07300 611 QDLTQTGRLSSVDPNLQNIPVRLEQGRLIRKAFVPSHEDAVLLSSDYSQIELRVLAHISGDEHLIAAFKEGADIHTSTAM 690 (880) T ss_pred HHHCCCCEECCCCCCCCCCCCCCCCCHHHHHEEECCCCCEEEEEECHHHHHHHHHHHHHCCHHHHHHHHCCCCHHHHHHH T ss_conf 42313342114787634688777435204412732788559998232268999999981888899987158877899999 Q ss_pred HHHCCC-HHHCCHHHHHHHHHHHHHHHHCCCHHHHHHHCCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHCCCEECC Q ss_conf 980999-6889988886302513564407770356788199989999999999998888999999999999984988827 Q gi|254780277|r 784 EIFGVN-IEKVSPQMRRHAKTINFSIIYGISPFRLATQLRIPRSEAADYIKRYFHRFPGIHEYIEKTKNFVRQNGYVETI 862 (976) Q Consensus 784 ~vf~~~-~~~vt~~~R~~aK~infgi~YG~~~~~La~~l~is~~eA~~~i~~yf~~~p~v~~~~~~~~~~a~~~gyv~T~ 862 (976) +|||+| .++||++||+.||+|||||+|||||||||++||||++||++||++||++||+|+.||++++++|+++|||+|+ T Consensus 691 ~if~v~~~~~Vt~~~Rr~AK~iNFGiiYG~s~fgLs~~l~i~~~eA~~~I~~Yf~~yp~v~~~~~~~~~~a~~~Gyv~Tl 770 (880) T PRK07300 691 RVFGIEKAEDVTANDRRNAKAVNFGIVYGISDFGLSNNLGITRKQAKSYIDTYFERYPGIKNYMENVVREAKDKGYVETL 770 (880) T ss_pred HHHCCCCHHHCCHHHHHHHHHHHHHHHHCCCHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCEECC T ss_conf 99499732008999999986312111127886778886499999999999999996910999999999999976887777 Q ss_pred CCCEECCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEECCEEEEECCHHHHHHHHH Q ss_conf 78760189875883788877878770134565389999999999999998659981899972638999559879999999 Q gi|254780277|r 863 FGRRIHYDEINSPKSSIRNINERAAINAPIQGSAADITRRAMISVHKSLEDHKLATKMLLQIHDELVFEVPEEEITTASQ 942 (976) Q Consensus 863 ~GRrr~~p~i~s~~~~~r~~aeR~a~N~~iQGsaAdi~k~am~~i~~~~~~~~~~~~~ilqVHDElv~ev~~~~~~~~~~ 942 (976) +|||||||+|+|.|+++|++|||+|+|||||||||||||+|||+++++++..+++++||||||||||||||+++++++++ T Consensus 771 ~gRrr~ip~i~s~n~~~r~~~eR~AiN~piQGtAADiiK~AMi~~~~~l~~~~~~~~mlLQVHDELvfEv~~~~~~~~~~ 850 (880) T PRK07300 771 FKRRRELPDINSRNFNVRSFAERTAINSPIQGSAADILKIAMINLDKALQAGGFKSKMLLQVHDEIVLEVPNDELTAIKK 850 (880) T ss_pred CCCEECCCCCCCCCHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECEEEEEEECHHHHHHHHH T ss_conf 78813589867888778879989881602417899999999999999998459885189997404588517889999999 Q ss_pred HHHHHHHHCCCCCCCCCCCEEEEEEEECCHHHHC Q ss_conf 9999965101357444464388833208937819 Q gi|254780277|r 943 IIVHSMEKACLPKINLRVPLKVNIKVSNNWQEEN 976 (976) Q Consensus 943 ~~~~~Me~~~~~~~~l~VPl~v~~~~G~~W~e~~ 976 (976) +|+++||+|. +|+|||+|++++|+||+||- T Consensus 851 ~v~~~Me~~~----~l~VPL~v~~~~G~nW~eaK 880 (880) T PRK07300 851 LVKETMEAAI----ELAVPLIADENTGQTWYEAK 880 (880) T ss_pred HHHHHHHCCC----CCCCCEEEECCCCCCHHHCC T ss_conf 9999983752----47888373018488967719 No 17 >COG0749 PolA DNA polymerase I - 3'-5' exonuclease and polymerase domains [DNA replication, recombination, and repair] Probab=100.00 E-value=0 Score=1385.06 Aligned_cols=586 Identities=45% Similarity=0.757 Sum_probs=560.9 Q ss_pred CCEEECCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCEECCCCCCCCCCCCCCCCCH Q ss_conf 50354799999999999986685232332224331100124420102577665443455023137443334565001788 Q gi|254780277|r 367 SYTKIVNTKDIQQWVQKLETIGSASFKIITDTMDAFHSKPIAIAISTLDQKDDTSFKVETIFIDLSVHTSQDSTGKNTLT 446 (976) Q Consensus 367 ~y~~I~~~~~L~~li~~L~~~~~ia~DtETt~ld~~~~~ivGisls~~~~~~~~~~~~~~~yip~~~~~~~d~~~~~~~~ 446 (976) .|..+++.+.+..|+..+...+.+++|+||++++++.+.++|+++++.+ +++|+|+.+. +.+. T Consensus 2 ~~~~~~~~~~~~~~~~~~~~~~~~a~~~et~~l~~~~~~lvg~s~~~~~---------~~~yi~~~~~------~~~~-- 64 (593) T COG0749 2 PYGTITDLAVLNAWLTKLNAAANIAFDTETDGLDPHGADLVGLSVASEE---------EAAYIPLLHG------PEQL-- 64 (593) T ss_pred CCHHHHHHHHHHHHHHHHHHCCCCCEECCCCCCCCCCCCEEEEEEECCC---------CCEEEEECCC------HHHH-- T ss_conf 5125667888999999874223310100014567555874678862365---------4236751366------5555-- Q ss_pred HHHHHHHHHHHCCCCCHHHHCCCCHHHHHHHHCCCCCCC-CCCHHHHHHHHCCCCCCCCHHHHHHHHCCCC---CCCCCC Q ss_conf 999974444202221001100321237889861886664-2100234554316876677246677622444---223356 Q gi|254780277|r 447 KEILSYLKKFFENEHFLKIGHNIKYDKLVLHRYGISMRG-FEDIMLMSYVLDSGRSSHDIASIAQKWLSYT---RKEILK 522 (976) Q Consensus 447 ~evl~~L~~lle~~~i~kIgHN~KfD~~~L~~~GI~l~~-i~DTmlaayLLdp~~~~~~L~~la~~yL~~~---~~e~~~ 522 (976) +++..|++||+++...|++||+|||+++|.++||. .+ .+|||||+|+++|+.+.|++++|+++|+++. .+++.+ T Consensus 65 -~~~~~l~~~l~~~~~~kv~~~~K~d~~~l~~~Gi~-~~~~~DtmlasYll~~~~~~~~~~~l~~r~l~~~~~~~~~i~~ 142 (593) T COG0749 65 -NVLAALKPLLEDEGIKKVGQNLKYDYKVLANLGIE-PGVAFDTMLASYLLNPGAGAHNLDDLAKRYLGLETITFEDIAG 142 (593) T ss_pred -HHHHHHHHHHHCCCCCHHCCCCCHHHHHHHHCCCC-CCCHHHHHHHHHCCCCCCCCCCHHHHHHHHCCCCCCHHHHHHC T ss_conf -14899999863754103213413669999973976-5651789999961486767577789999825877402487632 Q ss_pred CCCCCCCCHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHCCCCHHHHHHHCCCCCCCCHHHHHHHHHH Q ss_conf 54322210000001233332221111124567763222100-01345788607501233221014310058999986434 Q gi|254780277|r 523 SRKSSIPIDKISDSQVQEHAIENSNVILQLWLLLRPKLIVE-KLLHVYERLDKPMIDVVSQMEITGIQIDQDLLKQVSAE 601 (976) Q Consensus 523 ~~k~~~~~~~i~~e~~~~Yaa~Da~~t~~L~~~l~~~L~~~-~l~~l~~~iE~Pl~~vLa~ME~~Gi~vD~~~L~~l~~~ 601 (976) ++.++.++..++.+....|+++|+++|++|+..|++++.+. .|..+|++|||||++||+.||.+||.||.+.|+.++.+ T Consensus 143 kg~~~~~~~~~~~~~~~~y~a~~a~~~~~L~~~l~~~l~~~~~L~~l~~~iE~Pl~~VLa~ME~~Gi~vD~~~L~~l~~e 222 (593) T COG0749 143 KGKKQLTFADVKLEKATEYAAEDADATLRLESILEPELLKTPVLLELYEEIEMPLVRVLARMERNGIKVDVQYLKELSKE 222 (593) T ss_pred CCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHCCCEECHHHHHHHHHH T ss_conf 54456761001477888878888899999999999876310469999999754089999986862856429999999999 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHCCCC-CCCCCCCCCCCCCHHHHHHHHCCCCHHHHHHHHHHHHHHH Q ss_conf 368999999999740345577776566679987420788-7555665453531788776530476279999999878889 Q gi|254780277|r 602 ISKNLLSLEENIYDLSGEKFNLSSPKQLGDILFTKLKFP-SGAKTKTGQWKTTAQDLEQINCGDNPIIKNILEWRQLSKI 680 (976) Q Consensus 602 l~~~l~~le~~i~~~aG~~FNinSpkQL~~vLf~~LgLp-~~kKTktG~~ST~~~~Le~L~~~~hp~~~~iLeyR~l~KL 680 (976) +..++.+++++||++||++||||||||||.|||++|||| +.||||||+|||+++|||+|.. .||+|+.||+||+++|| T Consensus 223 l~~~l~~le~eiy~laG~~FNi~SPKQL~~ILfeKl~Lp~~~kKtktG~yST~~~vLe~L~~-~h~i~~~iL~~Rql~KL 301 (593) T COG0749 223 LGCELAELEEEIYELAGEEFNINSPKQLGEILFEKLGLPPGLKKTKTGNYSTDAEVLEKLAD-DHPLPKLILEYRQLAKL 301 (593) T ss_pred HHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHH-CCCCHHHHHHHHHHHHH T ss_conf 99999999999999826857899778999998885189855554578997627999998652-48738999999878888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCCCCEECCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCEEEEEECCCCCEEEEEE Q ss_conf 88653200111100123334533002221000001333433345563322224302424899489996326544368766 Q gi|254780277|r 681 KSTYADSLPNHINKRTQRVHTFYSLASTMTGRLASLEPNLQNIPIKTDLGQKIRRAFISPPTKKLISADYSQIELRILAH 760 (976) Q Consensus 681 ~sTY~~~l~~~i~~~dgRIH~~f~q~~t~TGRlSs~~PNlQNiP~r~~~g~~iR~~F~a~~g~~lv~aDySQiElRilA~ 760 (976) +|||+|+||++|+|.||||||+|||++|+||||||++|||||||+|+++||+||+||+|++||++++||||||||||||| T Consensus 302 ksTY~d~L~~~i~~~t~RIHTsf~Q~~t~TGRLSSsdPNLQNIPiRse~Gr~IR~aFva~~g~~~i~aDYSQIELRilAH 381 (593) T COG0749 302 KSTYTDGLPKLINPDTGRIHTSFNQTGTATGRLSSSDPNLQNIPIRSEEGRKIRKAFVAEKGYTLISADYSQIELRILAH 381 (593) T ss_pred HHHHHHCCHHHHCCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHCEECCCCCEEEEECHHHHHHHHHHH T ss_conf 77764022875188899327616877777503357999755677677767866640557999868870157999999998 Q ss_pred EEEHHHHHHHHHCCCCHHHHHHHHHHCCCHHHCCHHHHHHHHHHHHHHHHCCCHHHHHHHCCCCHHHHHHHHHHHHHHCH Q ss_conf 51037778734069888999998980999688998888630251356440777035678819998999999999999888 Q gi|254780277|r 761 IAKITPLYQVFENSLDIHRMVAAEIFGVNIEKVSPQMRRHAKTINFSIIYGISPFRLATQLRIPRSEAADYIKRYFHRFP 840 (976) Q Consensus 761 ls~D~~l~~af~~g~DiH~~tA~~vf~~~~~~vt~~~R~~aK~infgi~YG~~~~~La~~l~is~~eA~~~i~~yf~~~p 840 (976) +|+|+.|++||++|.|||+.||++|||+|.++||+++|+.||+|||||+||||+||||++||||++||+.||++||++|| T Consensus 382 ls~D~~Ll~AF~~g~DiH~~TA~~vFgv~~~~Vt~e~Rr~AKaINFGiiYG~safgLa~~L~I~~~eA~~~I~~YF~ryp 461 (593) T COG0749 382 LSQDEGLLRAFTEGEDIHTATAAEVFGVPIEEVTSEQRRKAKAINFGLIYGMSAFGLAQQLGIPRKEAKEYIDRYFERYP 461 (593) T ss_pred HCCCHHHHHHHHCCCCHHHHHHHHHHCCCHHHCCHHHHHHHHHHCCCEEECCCHHHHHHHCCCCHHHHHHHHHHHHHHCH T ss_conf 61889999988658637789999994998233899987565530354143451334798749886888999999998686 Q ss_pred HHHHHHHHHHHHHHHCCCEECCCCCEECCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEE Q ss_conf 89999999999999849888277876018987588378887787877013456538999999999999999865998189 Q gi|254780277|r 841 GIHEYIEKTKNFVRQNGYVETIFGRRIHYDEINSPKSSIRNINERAAINAPIQGSAADITRRAMISVHKSLEDHKLATKM 920 (976) Q Consensus 841 ~v~~~~~~~~~~a~~~gyv~T~~GRrr~~p~i~s~~~~~r~~aeR~a~N~~iQGsaAdi~k~am~~i~~~~~~~~~~~~~ 920 (976) +|++||++++++|+++|||+|++|||||+|+|+|.|+..|++|||.|+|+|||||||||+|+|||.++++|+..++++|| T Consensus 462 gv~~ym~~~~~~ar~~GyV~Tl~gRRry~p~i~s~n~~~R~~aER~AiNaPIQGTAADiiK~AMI~vd~~l~~~~~~~rl 541 (593) T COG0749 462 GVKEYMERTKEEAREDGYVETLFGRRRYLPDINSSNRVVRAAAERAAINAPIQGTAADIIKLAMIKVDKALKEEKLKARL 541 (593) T ss_pred HHHHHHHHHHHHHHHCCCEEECCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHCCHHHHH T ss_conf 89999999999999749354102660248621017788887888988357674448999999998679998634321456 Q ss_pred EEEECCEEEEECCHHHHHHHHHHHHHHHHHCCCCCCCCCCCEEEEEEEECCHHHHC Q ss_conf 99726389995598799999999999965101357444464388833208937819 Q gi|254780277|r 921 LLQIHDELVFEVPEEEITTASQIIVHSMEKACLPKINLRVPLKVNIKVSNNWQEEN 976 (976) Q Consensus 921 ilqVHDElv~ev~~~~~~~~~~~~~~~Me~~~~~~~~l~VPl~v~~~~G~~W~e~~ 976 (976) +||||||||||||++++++++++|++.||+|. +|+|||+|++++|+||+||| T Consensus 542 lLQVHDELvfEv~~~e~e~~~~~v~~~Me~a~----~L~VPL~vdv~~g~nW~ea~ 593 (593) T COG0749 542 LLQVHDELVFEVPKEELEEVKKLLKAIMENAV----NLSVPLEVDVGIGKNWDEAH 593 (593) T ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHH----CCCCCEEEECCCCCCHHHCC T ss_conf 77554556411768689999999999998765----04786698617776834139 No 18 >pfam00476 DNA_pol_A DNA polymerase family A. Probab=100.00 E-value=0 Score=973.08 Aligned_cols=382 Identities=49% Similarity=0.761 Sum_probs=375.3 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCC-CCCCCCCHHHHHHHHCCCCHHH Q ss_conf 058999986434368999999999740345577776566679987420788755566-5453531788776530476279 Q gi|254780277|r 590 IDQDLLKQVSAEISKNLLSLEENIYDLSGEKFNLSSPKQLGDILFTKLKFPSGAKTK-TGQWKTTAQDLEQINCGDNPII 668 (976) Q Consensus 590 vD~~~L~~l~~~l~~~l~~le~~i~~~aG~~FNinSpkQL~~vLf~~LgLp~~kKTk-tG~~ST~~~~Le~L~~~~hp~~ 668 (976) ||.++|++++.++..++.+++++||+++|++||||||+||++|||++||||+.+||+ +|+||||+++|++|..+.||++ T Consensus 1 vd~~~l~~~~~~l~~~~~~l~~~i~~~aG~~fN~~S~~ql~~vLf~~l~l~~~~~t~~~g~~st~~~~L~~l~~~~hp~~ 80 (383) T pfam00476 1 VDVEYLEELSNELGAQLADLEFKIPELAGEEFNLGSPKQLGVLLFEELGLPKTKKTDKTGARSTNADVLESLREDAHEII 80 (383) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCHHH T ss_conf 98899999999999999999999999959976999989999984973599999977899998678999998761688699 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEECCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCEEEEE Q ss_conf 99999987888988653200111100123334533002221000001333433345563322224302424899489996 Q gi|254780277|r 669 KNILEWRQLSKIKSTYADSLPNHINKRTQRVHTFYSLASTMTGRLASLEPNLQNIPIKTDLGQKIRRAFISPPTKKLISA 748 (976) Q Consensus 669 ~~iLeyR~l~KL~sTY~~~l~~~i~~~dgRIH~~f~q~~t~TGRlSs~~PNlQNiP~r~~~g~~iR~~F~a~~g~~lv~a 748 (976) +.||+||+++||+|||+++|+..+++.||||||+|||+||+||||||++|||||||.|+++|++||+||+|+|||+||+| T Consensus 81 ~~il~~R~~~Kl~~t~~~~l~~~~~~~~gRih~~~nq~gt~TGRlSs~~PNlQniP~~~~~g~~iR~~Fva~~G~~lv~a 160 (383) T pfam00476 81 KIILEYRQLSKLQSTYVDKLPLMIDPDDGRIHTSYNQAGTATGRLSSTDPNLQNIPIRNEYGREIRAAFIAEPGYVLVAA 160 (383) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCCCCEECCEEECCCCCEEECCCCCCCCCCCCCCCCCCHHHHHEEECCCCCEEEEE T ss_conf 99999999999999999998886267898680124226420220356798645799888430556743627999878850 Q ss_pred ECCCCCEEEEEEEEEHHHHHHHHHCCCCHHHHHHHHHHCCCHHHCCHHHHHHHHHHHHHHHHCCCHHHHHHHCCCCHHHH Q ss_conf 32654436876651037778734069888999998980999688998888630251356440777035678819998999 Q gi|254780277|r 749 DYSQIELRILAHIAKITPLYQVFENSLDIHRMVAAEIFGVNIEKVSPQMRRHAKTINFSIIYGISPFRLATQLRIPRSEA 828 (976) Q Consensus 749 DySQiElRilA~ls~D~~l~~af~~g~DiH~~tA~~vf~~~~~~vt~~~R~~aK~infgi~YG~~~~~La~~l~is~~eA 828 (976) |||||||||+||+|+|+.|+++|++|.|+|+.||+.+||+|.++|++++|+.||++|||++||||+++||++||+|.+|| T Consensus 161 DySQiElRilA~ls~D~~l~~af~~g~Dih~~tA~~ifg~~~~~v~~~~R~~AK~~~fGi~YG~g~~~La~~l~is~~eA 240 (383) T pfam00476 161 DYSQIELRILAHLSGDENLIEAFRTGADIHTLTAADIFGVDLHEVTGNQRRNAKTFNFGRIYGASAKGLSQLLGISREEA 240 (383) T ss_pred CHHHHHHHHHHHHCCCHHHHHHHHCCCCHHHHHHHHHHCCCHHHCCHHHHHHHHHHHHHHHHCCCHHHHHHHHCCCHHHH T ss_conf 58776688887520548899987027658898899984999788685567677771316644846889998858889999 Q ss_pred HHHHHHHHHHCHHHHHHHHHHHHHHHHCCCEECCCCCEECCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999988889999999999999849888277876018987588378887787877013456538999999999999 Q gi|254780277|r 829 ADYIKRYFHRFPGIHEYIEKTKNFVRQNGYVETIFGRRIHYDEINSPKSSIRNINERAAINAPIQGSAADITRRAMISVH 908 (976) Q Consensus 829 ~~~i~~yf~~~p~v~~~~~~~~~~a~~~gyv~T~~GRrr~~p~i~s~~~~~r~~aeR~a~N~~iQGsaAdi~k~am~~i~ 908 (976) ++++++||++||+|++|++++.+.|+++|||+|++||||++|++++.++..|+.++|+|+|+|||||||||+|+||++++ T Consensus 241 ~~~i~~~~~~yp~i~~~~~~~~~~a~~~g~v~T~~GRrr~lp~~~~~~~~~~~~~~r~a~N~~IQGsAADi~k~Ami~i~ 320 (383) T pfam00476 241 KEFIEKYFERFPGVKRYREKTRKEAKKGGYVETLFGRRRYLPDIDSRNRSLREAAERAALNTPIQGSAADILKLAMIKLD 320 (383) T ss_pred HHHHHHHHHHCHHHHHHHHHHHHHHHHCCCCCCCCCCEEECCCCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHH T ss_conf 99999999978369999999999999689611456876755775666788886776765353268689999999999999 Q ss_pred HHHHHCCCCEEEEEEECCEEEEECCHHHHHHHHHHHHHHHHHCCCCCCCCCCCEEEEEEEECCHHHH Q ss_conf 9998659981899972638999559879999999999996510135744446438883320893781 Q gi|254780277|r 909 KSLEDHKLATKMLLQIHDELVFEVPEEEITTASQIIVHSMEKACLPKINLRVPLKVNIKVSNNWQEE 975 (976) Q Consensus 909 ~~~~~~~~~~~~ilqVHDElv~ev~~~~~~~~~~~~~~~Me~~~~~~~~l~VPl~v~~~~G~~W~e~ 975 (976) +++++.+++++|++|||||||||||++++++++.++++|||++. +|+|||.|++++|+||+|| T Consensus 321 ~~l~~~~~~~~lvlqVHDElv~evp~~~~~~~~~~v~~~Me~~~----~l~VPl~v~~~~G~~Wgea 383 (383) T pfam00476 321 EALVEKGLDARMCLQVHDEIVFEVPKEEAEAVAAQIKELMERAM----FLDVPLLVEVGQGRNWFSA 383 (383) T ss_pred HHHHHCCCCCEEEEEECEEEEEEECHHHHHHHHHHHHHHHHHHC----CCCCCEEEECCCCCCHHCC T ss_conf 99985499978999635411899768999999999999996042----7899679653827880147 No 19 >cd06444 DNA_pol_A DNA polymerase family A, 5'-3' polymerase domain. Family A polymerase (polymerase I) functions primarily to fill DNA gaps that arise during DNA repair, recombination and replication. DNA-dependent DNA polymerases can be classified into six main groups based upon phylogenetic relationships with E. coli polymerase I (classA), E. coli polymerase II (class B), E.coli polymerase III (class C), euryarchaeota polymerase II (class D), human polymerase beta (class X), E. coli UmuC/DinB and eukaryotic RAP 30/Xeroderma pigmentosum variant (class Y). Family A polymerases are found primarily in organisms related to prokaryotes and include prokaryotic DNA polymerase I, mitochondrial polymerase gamma, and several bacteriophage polymerases including those from odd-numbered phage (T3, T5, and T7). Prokaryotic Pol I has two functional domains located on the same polypeptide; a 5'-3' polymerase and a 5'-3' exonuclease. Pol I uses its 5' nuclease activity to remove the ribonucleotide po Probab=100.00 E-value=0 Score=906.99 Aligned_cols=379 Identities=54% Similarity=0.882 Sum_probs=371.9 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHCCCCHHHH Q ss_conf 05899998643436899999999974034557777656667998742078875556654535317887765304762799 Q gi|254780277|r 590 IDQDLLKQVSAEISKNLLSLEENIYDLSGEKFNLSSPKQLGDILFTKLKFPSGAKTKTGQWKTTAQDLEQINCGDNPIIK 669 (976) Q Consensus 590 vD~~~L~~l~~~l~~~l~~le~~i~~~aG~~FNinSpkQL~~vLf~~LgLp~~kKTktG~~ST~~~~Le~L~~~~hp~~~ 669 (976) ||.+.|+++..++..++.++++++++++|.+||++||+||+++||++||+|+.++|++| +||++++|+.+ ...||+++ T Consensus 1 id~~~~~~~~~~l~~~l~~l~~~~~~~~g~~fn~~S~~ql~~~L~~~lg~~~~~~~~~~-~st~~~~l~~~-~~~hp~~~ 78 (379) T cd06444 1 VDKEELEELSAELEKELEELEKEIYELAGEEFNINSPKQLGEVLFEKLGLPVSKKTKTG-YSTDEEVLEKL-AEAHPIIA 78 (379) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCC-CCCCHHHHHHH-HHCCCHHH T ss_conf 98899999999999999999999999969907999999999999985499989888899-99859999976-43384899 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEECCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCEEEEEE Q ss_conf 99999878889886532001111001233345330022210000013334333455633222243024248994899963 Q gi|254780277|r 670 NILEWRQLSKIKSTYADSLPNHINKRTQRVHTFYSLASTMTGRLASLEPNLQNIPIKTDLGQKIRRAFISPPTKKLISAD 749 (976) Q Consensus 670 ~iLeyR~l~KL~sTY~~~l~~~i~~~dgRIH~~f~q~~t~TGRlSs~~PNlQNiP~r~~~g~~iR~~F~a~~g~~lv~aD 749 (976) .||+||+++|++|||+++|++.+++.|||||++|||+||+||||||++|||||||.+++.|+++|+||+|+|||+||+|| T Consensus 79 ~ile~R~~~kl~st~~~~~~~~~~~~~gRih~~~~q~gt~TGR~Ss~~PNlQniP~~~~~~~~~R~~f~~~~G~~lv~aD 158 (379) T cd06444 79 LILEYRKLAKLKSTYVDALPKLINPKTGRIHTSFNQTVTATGRLSSSNPNLQNIPIRTEEGREIRKAFVAPPGYVLLSAD 158 (379) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEEEECCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHEEECCCCCEEEEEC T ss_conf 99999999999999999998760689997966885277540012467876457888871167675247479998799842 Q ss_pred CCCCCEEEEEEEEEHHHHHHHHHCCCCHHHHHHHHHHCCCHHHCCHHHHHHHHHHHHHHHHCCCHHHHHHHCCCCHHHHH Q ss_conf 26544368766510377787340698889999989809996889988886302513564407770356788199989999 Q gi|254780277|r 750 YSQIELRILAHIAKITPLYQVFENSLDIHRMVAAEIFGVNIEKVSPQMRRHAKTINFSIIYGISPFRLATQLRIPRSEAA 829 (976) Q Consensus 750 ySQiElRilA~ls~D~~l~~af~~g~DiH~~tA~~vf~~~~~~vt~~~R~~aK~infgi~YG~~~~~La~~l~is~~eA~ 829 (976) ||||||||+||+|+|+.|+++|++|.|+|+.||+.|||+|.++|++++|+.||++|||++||||+++||++||+|.+||+ T Consensus 159 ySqiElRvlA~ls~D~~l~~~f~~g~Dih~~~A~~~~~~~~~~v~~~~R~~aK~~~fgl~YG~G~~~la~~l~~s~~eA~ 238 (379) T cd06444 159 YSQIELRILAHLSGDPALIEAFNNGEDIHSRTAAEVFGVPEEEVTKEERRKAKAINFGIIYGMGAFGLAEQLGISREEAK 238 (379) T ss_pred HHHCHHHHHHHHHCCHHHHHHHHCCCCHHHHHHHHHHCCCHHHCCHHHHHHHHHHHHHHHHCCCHHHHHHCCCCCHHHHH T ss_conf 31121888888717699999987388789999999839984435988998865767575328345329750199999999 Q ss_pred HHHHHHHHHCHHHHHHHHHHHHHHHHCCCEECCCCCEECCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999888899999999999998498882778760189875883788877878770134565389999999999999 Q gi|254780277|r 830 DYIKRYFHRFPGIHEYIEKTKNFVRQNGYVETIFGRRIHYDEINSPKSSIRNINERAAINAPIQGSAADITRRAMISVHK 909 (976) Q Consensus 830 ~~i~~yf~~~p~v~~~~~~~~~~a~~~gyv~T~~GRrr~~p~i~s~~~~~r~~aeR~a~N~~iQGsaAdi~k~am~~i~~ 909 (976) ++++.||.+||+|++|+++++++|+++|||+|++||||++|++++.+...|++++|+|+|+|||||||||+|.||+++++ T Consensus 239 ~~~~~f~~~~p~v~~~~~~~~~~a~~~g~v~T~~GR~r~~~~~~~~~~~~~~~~~r~a~N~~iQGsaAdi~k~ami~i~~ 318 (379) T cd06444 239 EFIDRYFATYPGVKEYLEKTVEEAREGGYVETLFGRRRYLPEINSPNKNVRSAAERQAINTPIQGSAADIIKLAMIKIEK 318 (379) T ss_pred HHHHHHHHHCCCHHHHHHHHHHHHHHCCCEECCCCCEECCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999786566789999999997797766778853477556788889878766567787465899999999999999 Q ss_pred HHHHCCCCEEEEEEECCEEEEECCHHHHHHHHHHHHHHHHHCCCCCCCCCCCEEEEEEEECCHHH Q ss_conf 99865998189997263899955987999999999999651013574444643888332089378 Q gi|254780277|r 910 SLEDHKLATKMLLQIHDELVFEVPEEEITTASQIIVHSMEKACLPKINLRVPLKVNIKVSNNWQE 974 (976) Q Consensus 910 ~~~~~~~~~~~ilqVHDElv~ev~~~~~~~~~~~~~~~Me~~~~~~~~l~VPl~v~~~~G~~W~e 974 (976) +++..+++++|++|||||||+|||+++++++.+++++|||++. +|+|||.|++++|+||+| T Consensus 319 ~l~~~~~~~~lv~~VHDEiv~ev~~~~~~~~~~~i~~~Me~a~----~l~VPl~v~~~iG~~Wge 379 (379) T cd06444 319 LLKEKGLDARLLLQVHDELIFEVPEEELEEAAALVKENMENAV----KLSVPLKVDIKIGKNWGE 379 (379) T ss_pred HHHHCCCCCEEEEEECEEEEEEECHHHHHHHHHHHHHHHHCCC----CCCCCEEEECCCCCCCCC T ss_conf 9996299978999725424899879999999999999985635----778837866372699999 No 20 >smart00475 53EXOc 5'-3' exonuclease. Probab=100.00 E-value=0 Score=638.95 Aligned_cols=255 Identities=51% Similarity=0.841 Sum_probs=246.3 Q ss_pred CEEEEEECCHHHHHHHHCCCCCCCCCCCCEECHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHHCCHHHHCCC Q ss_conf 72899966269998874267544689885500699999999999998485547987999972748871324086521667 Q gi|254780277|r 5 NHLFLVDGSSFIYRAFYATPLLSRKHDGLPVNAIAGFCNMLWKLLQNSRKESIASHFAVIFDYPAVTFRNEIYPDYKANR 84 (976) Q Consensus 5 ~~~~liDg~~~~~ra~~a~p~~~~~~~G~~t~ai~gf~~~l~~~i~~~~~~~~~~~~~v~fD~~~~tfR~~~~~~YKa~R 84 (976) |+++||||++++||||||+|.+. +++|.||||||||+++|.+++++++| +|++||||++++||||++||+||||| T Consensus 1 k~lllIDg~~l~~Ray~a~~~l~-~~~G~~t~ai~gf~~~l~~l~~~~~p----~~i~v~fD~~~~~~R~~i~p~YKanR 75 (259) T smart00475 1 KKLLLVDGSSLAFRAYFALPPLK-NSKGEPTNAVYGFLRMLLKLIKEEKP----TYVAVVFDAKGKTFRHELYPEYKANR 75 (259) T ss_pred CEEEEEECCHHHHHHHHCCCCCC-CCCCCCCCHHHHHHHHHHHHHHHCCC----CEEEEEEECCCCCCHHHHCHHHHCCC T ss_conf 95999936389999994898775-99999835799999999999987699----77999981899865144455553289 Q ss_pred CCCCHHHHHHHHHHHHHHHHCCCCEEEECCEEHHHHHHHHHHHHHHCCCCEEEECCCCHHHEECCCCEEEEECCCC---- Q ss_conf 9879758864899999999709979956890166589999999984899499986870121223797599966788---- Q gi|254780277|r 85 PKIPEMLLPQLPLVRLATQAFGIPAIEIQGFEADDIIATYTYIAEKEGFAVTIISTDKDLMQLVSPTTCLYDTVKE---- 160 (976) Q Consensus 85 ~~~p~~l~~q~~~i~~~l~~~gi~~~~~~~~EADDviatla~~~~~~~~~v~i~s~DkD~~Qlv~~~v~~~~~~~~---- 160 (976) ++||++|..|++.+++++++||||+++.+||||||+|||+|+++..++.+|+|+|+|||++|||+++|.+|+|.++ T Consensus 76 ~~~p~~l~~q~~~i~~~l~~~gi~~~~~~g~EADDvIatla~~~~~~~~~v~IvS~DkDl~QLv~~~v~v~~~~~~~~~~ 155 (259) T smart00475 76 PKTPDELLEQIPLIKELLDALGIPVLEVEGYEADDVIATLAKKAEAEGYEVRIVSGDKDLLQLVSDKVSVLDPTKGIKEF 155 (259) T ss_pred CCCCHHHHHHHHHHHHHHHHCCCCEEECCCCCHHHHHHHHHHHHHHCCCEEEEECCCCCHHHHCCCCEEEEECCCCCCCC T ss_conf 99978999999999999997699876238854799999999999986992699508837888488871999815899656 Q ss_pred CEECHHHHHHHHCCCHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHCCCHHHH Q ss_conf 35389999888489887877778751874358998776360249989841367899986454200112232200357899 Q gi|254780277|r 161 EKIDIEHVIKKWGIPPEKMVCLQALTGDSIDNIPGIPGIGYKTAALLLQEYGNLENILINASRIKQKKRRENILEYAETA 240 (976) Q Consensus 161 ~~~~~~~v~~~~gv~p~~~~d~~al~GD~sDnipGv~GiG~KtA~~ll~~~gsle~i~~~~d~i~~~k~~~~l~~~~e~a 240 (976) +.++++.|.|+|||+|+|++||+||+||+|||||||||||||||++||++|||||+||+|+|++++ |++++|++++|++ T Consensus 156 ~~~~~~~v~e~~gv~P~q~~d~~aL~GD~sDnIpGV~GIG~KtA~kLL~~ygsle~i~~~~d~i~~-k~~~~l~~~~~~~ 234 (259) T smart00475 156 ELYTPENVIEKYGLTPEQIIDYKALMGDSSDNIPGVPGIGEKTAAKLLKEFGSLENILENLDKLKK-KLREKLLAHKEDA 234 (259) T ss_pred EEECHHHHHHHHCCCHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHCCCHHHHHHHHHHCCH-HHHHHHHHHHHHH T ss_conf 796599999884899899863764046453589999984789999999983999999983985488-9999999779999 Q ss_pred HHHHHHHHCCCCCCCCCCHHHHCCC Q ss_conf 9765543203564224763552126 Q gi|254780277|r 241 RLSRKLVMLRTDVPITIPLEHLILE 265 (976) Q Consensus 241 ~ls~~L~tL~~Dvpl~~~~~~l~~~ 265 (976) ++||+||||++|+|++++++++++. T Consensus 235 ~ls~~L~~l~~d~~l~~~l~~~~~k 259 (259) T smart00475 235 KLSRKLATIETDVPLEVDLEDLRLK 259 (259) T ss_pred HHHHHHHHEEECCCCCCCHHHHCCC T ss_conf 9889862040389799999994159 No 21 >PRK09482 xni exonuclease IX; Provisional Probab=100.00 E-value=0 Score=625.87 Aligned_cols=253 Identities=35% Similarity=0.529 Sum_probs=241.4 Q ss_pred CCCCCEEEEEECCHHHHHHHHCCCCCCCCCCCCEECHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECC--CCCHHHHCCH Q ss_conf 98897289996626999887426754468988550069999999999999848554798799997274--8871324086 Q gi|254780277|r 1 MQKENHLFLVDGSSFIYRAFYATPLLSRKHDGLPVNAIAGFCNMLWKLLQNSRKESIASHFAVIFDYP--AVTFRNEIYP 78 (976) Q Consensus 1 m~~~~~~~liDg~~~~~ra~~a~p~~~~~~~G~~t~ai~gf~~~l~~~i~~~~~~~~~~~~~v~fD~~--~~tfR~~~~~ 78 (976) |.+ +++||||+|++||+|||+|.. +.++|+++||+++|.++|++++| +|++||||++ ++||||++|| T Consensus 1 Mm~--~llLIDG~~l~~Ray~a~p~~-----~~~~~~~~~~~~~l~kli~~~~P----~~i~v~fD~~~~~~~fR~e~~~ 69 (256) T PRK09482 1 MMN--HLLIIDALNLIRRIHAVQPSP-----NDIAACVETCQHALDKLIRHSQP----THAVAVFDGDERSSGWRHQLLP 69 (256) T ss_pred CCC--CEEEECCCHHHHHHHHCCCCC-----CCCCCHHHHHHHHHHHHHHHCCC----CEEEEEEECCCCCCCCHHHHHH T ss_conf 997--289984628999998437789-----99886789999999999997199----8799998479987653777648 Q ss_pred HHHCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEECCEEHHHHHHHHHHHHHHCCCCEEEECCCCHHHEECCCCEEEEECC Q ss_conf 52166798797588648999999997099799568901665899999999848994999868701212237975999667 Q gi|254780277|r 79 DYKANRPKIPEMLLPQLPLVRLATQAFGIPAIEIQGFEADDIIATYTYIAEKEGFAVTIISTDKDLMQLVSPTTCLYDTV 158 (976) Q Consensus 79 ~YKa~R~~~p~~l~~q~~~i~~~l~~~gi~~~~~~~~EADDviatla~~~~~~~~~v~i~s~DkD~~Qlv~~~v~~~~~~ 158 (976) +|||||++||++|.+|++.++++|++||||+++.+||||||+|||+|+++...+++|+|+|+|||++|||+++|++|++. T Consensus 70 ~YKanR~~~pd~L~~Q~~~i~~~l~~~gi~~~~~~g~EADDiIatla~~~~~~~~~v~IvS~DkDl~QLv~~~v~v~~~~ 149 (256) T PRK09482 70 DYKAGRKPMPEALAQGLPAIRAAFEELGIDSWLADGNEADDLIATLAVKVAQAGHQATIVSTDKGYCQLLSPTIQIRDYF 149 (256) T ss_pred HHHHCCCCCCHHHHHHHHHHHHHHHHCCCCEEEECCCCHHHHHHHHHHHHHHCCCEEEEEECCCCHHHHCCCCEEEEECC T ss_conf 87609999998999999999999997799898357814889999999999977996999966888999675984999786 Q ss_pred CCCEECHHHHHHHHCCCHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHCCCHH Q ss_conf 88353899998884898878777787518743589987763602499898413678999864542001122322003578 Q gi|254780277|r 159 KEEKIDIEHVIKKWGIPPEKMVCLQALTGDSIDNIPGIPGIGYKTAALLLQEYGNLENILINASRIKQKKRRENILEYAE 238 (976) Q Consensus 159 ~~~~~~~~~v~~~~gv~p~~~~d~~al~GD~sDnipGv~GiG~KtA~~ll~~~gsle~i~~~~d~i~~~k~~~~l~~~~e 238 (976) +.+.++++.|.++|||+|+|++||+||+||+|||||||||||||||++||++|||||+||+|+|+++ +|++++|++++| T Consensus 150 ~~~~~~~~~v~ek~Gv~P~q~~d~~aL~GD~SDNIPGV~GIG~KtA~~LL~~fgsle~i~~n~d~i~-~k~~~~l~~~~e 228 (256) T PRK09482 150 QKRWLDIPFIEQEFGVEPQQLPDYWGLAGISSSKIPGVAGIGPKSAAELLNQFRSLEGIYESLDALP-EKWRKKLEEHKE 228 (256) T ss_pred CCCCCCHHHHHHHCCCCHHHHHHEEEEECCCCCCCCCCCCCCHHHHHHHHHHHCCHHHHHHHHHHCC-HHHHHHHHHCHH T ss_conf 7713179998785282873532302474654368999998588899999998550999998377556-899999996899 Q ss_pred HHHHHHHHHHCCCCCCCCCCHHHHCCC Q ss_conf 999765543203564224763552126 Q gi|254780277|r 239 TARLSRKLVMLRTDVPITIPLEHLILE 265 (976) Q Consensus 239 ~a~ls~~L~tL~~Dvpl~~~~~~l~~~ 265 (976) +|++||+|+||++|+|++++++++++. T Consensus 229 ~a~lsk~L~tl~~d~~l~~~l~~l~l~ 255 (256) T PRK09482 229 MARLCRKLAQLQTDLPLGGNLQQLRLN 255 (256) T ss_pred HHHHHHHHHHCCCCCCCCCCHHHHCCC T ss_conf 999989850440189899899992058 No 22 >cd00008 53EXOc 5'-3' exonuclease; T5 type 5'-3' exonuclease domains may co-occur with DNA polymerase I (Pol I) domains, or be part of Pol I containing complexes. They digest dsDNA and ssDNA, releasing mono-,di- and tri-nucleotides, as well as oligonucleotides, and have also been reported to possess RNase H activity. Also called 5' nuclease family, involved in structure-specific cleavage of flaps formed by Pol I activity (similar to mammalian flap endonuclease I, FEN-1). A single nucleic acid strand may be threaded through the 5' nuclease enzyme before cleavage occurs. The domain binds two divalent metal ions which are necessary for activity. Probab=100.00 E-value=0 Score=580.55 Aligned_cols=240 Identities=50% Similarity=0.806 Sum_probs=233.6 Q ss_pred CEEEEEECCHHHHHHHHCCCCCCCCCCCCEECHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHHCCHHHHCCC Q ss_conf 72899966269998874267544689885500699999999999998485547987999972748871324086521667 Q gi|254780277|r 5 NHLFLVDGSSFIYRAFYATPLLSRKHDGLPVNAIAGFCNMLWKLLQNSRKESIASHFAVIFDYPAVTFRNEIYPDYKANR 84 (976) Q Consensus 5 ~~~~liDg~~~~~ra~~a~p~~~~~~~G~~t~ai~gf~~~l~~~i~~~~~~~~~~~~~v~fD~~~~tfR~~~~~~YKa~R 84 (976) |+++||||++++||+|||+|....+.+|.||||++||+++|.+++++++| +|++||||++++||||++||+||||| T Consensus 1 k~lllIDg~~l~~R~~~a~~~~~~~~~G~~t~ai~gf~~~l~~l~~~~~p----~~i~v~fD~~~~t~R~~l~p~YKanR 76 (240) T cd00008 1 KRLLLIDGSSLAYRAYFALPPLKNSPKGLPTNAVYGFLNMLLKLIKEYKP----TYVAVVFDAGGKTFRHELYPEYKANR 76 (240) T ss_pred CEEEEECCCHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCC----CEEEEEECCCCCCCHHHHHHHHHHCC T ss_conf 95999926299999997187778899998712899999999999986599----88999972899862387728877378 Q ss_pred CCCCHHHHHHHHHHHHHHHHCCCCEEEECCEEHHHHHHHHHHHHHHCCCCEEEECCCCHHHEECCCCEEEEECCCCCEEC Q ss_conf 98797588648999999997099799568901665899999999848994999868701212237975999667883538 Q gi|254780277|r 85 PKIPEMLLPQLPLVRLATQAFGIPAIEIQGFEADDIIATYTYIAEKEGFAVTIISTDKDLMQLVSPTTCLYDTVKEEKID 164 (976) Q Consensus 85 ~~~p~~l~~q~~~i~~~l~~~gi~~~~~~~~EADDviatla~~~~~~~~~v~i~s~DkD~~Qlv~~~v~~~~~~~~~~~~ 164 (976) +++|++|..|++.+++++++||||++..+||||||+|||+|+++...+.+|+|+|+|||++|||+++|.+|+|.+++.++ T Consensus 77 ~~~p~~l~~q~~~~~~~l~~~gi~~~~~~~~EADD~ia~la~~~~~~~~~v~IvS~DkD~~QLv~~~v~~~~~~~~~~~~ 156 (240) T cd00008 77 KKMPEELREQIPLIKELLEALGIPVLEIEGYEADDVIGTLAKKAEAEGYKVVIVSGDKDLLQLVSDNVKVVSPMKKKLVT 156 (240) T ss_pred CCCCHHHHHHHHHHHHHHHHCCCCEEEECCCCHHHHHHHHHHHHHHCCCEEEEECCCCCHHHHHHHHHCCCCCCCCCCCC T ss_conf 99958999999999999997799789746843899999999999866970699628806888724330013555576578 Q ss_pred HHHHHHHHCCCHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHH Q ss_conf 99998884898878777787518743589987763602499898413678999864542001122322003578999765 Q gi|254780277|r 165 IEHVIKKWGIPPEKMVCLQALTGDSIDNIPGIPGIGYKTAALLLQEYGNLENILINASRIKQKKRRENILEYAETARLSR 244 (976) Q Consensus 165 ~~~v~~~~gv~p~~~~d~~al~GD~sDnipGv~GiG~KtA~~ll~~~gsle~i~~~~d~i~~~k~~~~l~~~~e~a~ls~ 244 (976) .+.|.++|||+|+|++||+||+||+|||||||||||||||++||++|||||+||+|++++++ |++++|++++|++++|| T Consensus 157 ~~~v~~~~gv~p~q~~d~~aL~GD~sDnIpGV~GiG~KtA~kLl~~~gsle~i~~~~~~~~~-k~~~~l~~~~~~a~ls~ 235 (240) T cd00008 157 EENVIEKYGVTPAQIIDYKALMGDSSDNIPGVPGIGEKTAAKLLKEYGSLEGILENLDKIKG-KLREKLEEGKEMAFLSK 235 (240) T ss_pred HHHHHHHCCCCHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHCCHHHHHHHHHHCCH-HHHHHHHHCHHHHHHHH T ss_conf 89999960989999987699728764688999985789999999980999999982985278-99999996899999999 Q ss_pred HHHHC Q ss_conf 54320 Q gi|254780277|r 245 KLVML 249 (976) Q Consensus 245 ~L~tL 249 (976) +|+|| T Consensus 236 ~L~tl 240 (240) T cd00008 236 RLATI 240 (240) T ss_pred HHHCC T ss_conf 85349 No 23 >COG0258 Exo 5'-3' exonuclease (including N-terminal domain of PolI) [DNA replication, recombination, and repair] Probab=100.00 E-value=0 Score=518.97 Aligned_cols=286 Identities=44% Similarity=0.679 Sum_probs=269.7 Q ss_pred CCCCCEEEEEECCHHHHHHHHCCCCCCCCCCCCEECHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHHCCHHH Q ss_conf 98897289996626999887426754468988550069999999999999848554798799997274887132408652 Q gi|254780277|r 1 MQKENHLFLVDGSSFIYRAFYATPLLSRKHDGLPVNAIAGFCNMLWKLLQNSRKESIASHFAVIFDYPAVTFRNEIYPDY 80 (976) Q Consensus 1 m~~~~~~~liDg~~~~~ra~~a~p~~~~~~~G~~t~ai~gf~~~l~~~i~~~~~~~~~~~~~v~fD~~~~tfR~~~~~~Y 80 (976) |...+++++|||++++||+|||+|....+..|.||||++||.+||.++++..+| +|+++|||++.+||||++|++| T Consensus 7 ~~~~~~l~~IDg~~~lyr~~~a~~~~~~~~~g~~~~~~~~~~~~l~~~~~~~~~----~~~~~vFD~~~~tfR~~~~~~y 82 (310) T COG0258 7 MNKSGKLLLIDGSSLLYRALHALPQPLGNPLGDPTGAVSGFLGMLYRLIRLLEP----THPVVVFDGKPPTFRHELLEEY 82 (310) T ss_pred HHCCCCEEEEECCHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCC----CCEEEEECCCCCCHHHHHHHHH T ss_conf 011685699968578999997033012687778444999999999999985678----6179997699962278889999 Q ss_pred HCCCCC-CCHHHHHHHHHHHHHHHHCCCCEEEECCEEHHHHHHHHHHHHHHCCCCEEEECCCCHHHEECCCCEEEEECCC Q ss_conf 166798-7975886489999999970997995689016658999999998489949998687012122379759996678 Q gi|254780277|r 81 KANRPK-IPEMLLPQLPLVRLATQAFGIPAIEIQGFEADDIIATYTYIAEKEGFAVTIISTDKDLMQLVSPTTCLYDTVK 159 (976) Q Consensus 81 Ka~R~~-~p~~l~~q~~~i~~~l~~~gi~~~~~~~~EADDviatla~~~~~~~~~v~i~s~DkD~~Qlv~~~v~~~~~~~ 159 (976) |++|.+ +|++|.+|++.+++++.++|+++++.+||||||+|||+|+++...|..+.|+|+|||++||+++++.++++++ T Consensus 83 K~~R~~~~p~~l~~q~~~i~~~~~~~~~~~l~~~G~eadd~i~t~A~~a~~~g~~~~I~S~DkD~lql~~~~~~~~~~~~ 162 (310) T COG0258 83 KANREKEMPDELAPQIPILTELLVALGIPLLELMGIEADDPIETLAQKAYKKGDVVLIISGDKDLLQLVSPNVLVINGKK 162 (310) T ss_pred HHCCCCCCHHHHHHHHHHHHHHHHHHCCHHHHCCCCCCCCCCCHHHHHHHHCCCCEEEEECCCCEEEECCCCEEEEECCC T ss_conf 84689558899999889999999983825532789683144408999988649957999789670011489617862577 Q ss_pred CC---EECHHHHHHHH-CCCHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 83---53899998884-898878777787518743589987763602499898413678999864542001122322003 Q gi|254780277|r 160 EE---KIDIEHVIKKW-GIPPEKMVCLQALTGDSIDNIPGIPGIGYKTAALLLQEYGNLENILINASRIKQKKRRENILE 235 (976) Q Consensus 160 ~~---~~~~~~v~~~~-gv~p~~~~d~~al~GD~sDnipGv~GiG~KtA~~ll~~~gsle~i~~~~d~i~~~k~~~~l~~ 235 (976) +. .++...|.++| |++|.|++||++|+||+|||||||||||||||.+||++|||+|+||++++.++ ++.+++|.+ T Consensus 163 ~~~~~~~~~~~~~e~~~g~~p~qliD~~~L~Gd~sDnipGV~GIG~ktA~~Ll~~~gs~e~i~~~~~~~~-~~~~~~l~~ 241 (310) T COG0258 163 GEPEKFLDLEEVEEKFKGLTPEQLIDLKALVGDSSDNIPGVKGIGPKTALKLLQEYGSLEGLYENLDIIK-KKTREKLLE 241 (310) T ss_pred CCCCCCCCHHHHHHHHCCCCHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHCCHHHHHHHHHHCC-CCHHHHHHH T ss_conf 8852348999999985699989988999982965357899998389999999998385999998775212-202566787 Q ss_pred CHHHHHHHHHHHHCCCCCCCCCCHHH---HCCCCCCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 57899976554320356422476355---212676768899865323368999876531 Q gi|254780277|r 236 YAETARLSRKLVMLRTDVPITIPLEH---LILEDCNGPRLISFLKALEFTKLINRVATT 291 (976) Q Consensus 236 ~~e~a~ls~~L~tL~~Dvpl~~~~~~---l~~~~~d~~~l~~~~~~lef~~ll~rL~~~ 291 (976) +++++++|+.|+|+.+|+|++++.++ +...+.|.+.+..++.+++|+++.+++... T Consensus 242 ~~~~afl~~~l~t~~~d~~l~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~l~~~~~~~ 300 (310) T COG0258 242 DKEKAFLSKPLATIKTDVPLEFDLEDILELLVPEHDFSKLLEERVELGFKRLLKAIGSV 300 (310) T ss_pred HHHHHHCCCHHHCCCCCCCCCCCCCCHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 79998607453112566644667340145426621266679999986169998775443 No 24 >smart00482 POLAc DNA polymerase A domain. Probab=100.00 E-value=0 Score=440.99 Aligned_cols=206 Identities=54% Similarity=0.855 Sum_probs=202.3 Q ss_pred CCCCCEEEECCCCCEEEEEECCCCCEEEEEEEEEHHHHHHHHHCCCCHHHHHHHHHHCCCHHHCCHHHHHHHHHHHHHHH Q ss_conf 22243024248994899963265443687665103777873406988899999898099968899888863025135644 Q gi|254780277|r 730 GQKIRRAFISPPTKKLISADYSQIELRILAHIAKITPLYQVFENSLDIHRMVAAEIFGVNIEKVSPQMRRHAKTINFSII 809 (976) Q Consensus 730 g~~iR~~F~a~~g~~lv~aDySQiElRilA~ls~D~~l~~af~~g~DiH~~tA~~vf~~~~~~vt~~~R~~aK~infgi~ 809 (976) |++||+||+|+|||+||++||||||+||+||+|+|+.|+++|++|.|+|+.+|+.+||+|.+++++++|+.||++|||++ T Consensus 1 g~~~R~~f~a~~G~~lv~aD~sqiE~Rv~A~la~D~~l~~~~~~g~D~h~~~A~~~f~~~~~~~~~~~R~~aK~~~~g~~ 80 (206) T smart00482 1 GREIRRAFVAPPGYVLVSADYSQIELRILAHLSGDENLLEAFNNGGDIHSKTAAQVFGVPEEEVTKELRRAAKAINFGII 80 (206) T ss_pred CCHHHCEEECCCCCEEEEECHHHHHHHHHHHHCCCHHHHHHHHCCCCHHHHHHHHHCCCCHHHCCHHHHHHHCHHHHHHH T ss_conf 90432267679999999988668999999998298779998743898689999996498801157566988567888665 Q ss_pred HCCCHHHHHHHCCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHCCCEECCCCCEECCCCCCCCCHHHHHHHHHHHHH Q ss_conf 07770356788199989999999999998888999999999999984988827787601898758837888778787701 Q gi|254780277|r 810 YGISPFRLATQLRIPRSEAADYIKRYFHRFPGIHEYIEKTKNFVRQNGYVETIFGRRIHYDEINSPKSSIRNINERAAIN 889 (976) Q Consensus 810 YG~~~~~La~~l~is~~eA~~~i~~yf~~~p~v~~~~~~~~~~a~~~gyv~T~~GRrr~~p~i~s~~~~~r~~aeR~a~N 889 (976) ||+|+.+|++++|+|.+||++++++||+.||++++|.+++.+.+.+.||++|++||+|++|++.+.....+...+|+|+| T Consensus 81 YG~G~~~la~~~g~~~~ea~~~~~~~~~~~p~i~~~~~~~~~~~~~~g~v~t~~Gr~~~~~~~~~~~~~~~~~~~~~~~N 160 (206) T smart00482 81 YGMGAKGLAEQLGISEAEAKELIKAYFARFPGVKRYIKRTLEEARRKGYVTTLFGRRRYIPDIDSRNPVLRAAAERAAVN 160 (206) T ss_pred CCCCCCCHHHHCCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHCCCEECCCCCEEECCCCCCCCHHHHHHHHHHHCC T ss_conf 18651102455188999999999999997835999999999999877966678887776787687565455357876607 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEECCEEEEECCHH Q ss_conf 3456538999999999999999865998189997263899955987 Q gi|254780277|r 890 APIQGSAADITRRAMISVHKSLEDHKLATKMLLQIHDELVFEVPEE 935 (976) Q Consensus 890 ~~iQGsaAdi~k~am~~i~~~~~~~~~~~~~ilqVHDElv~ev~~~ 935 (976) +|||||||||+|.||+++++++...+++.++++|||||||+|||+| T Consensus 161 ~~vQgtaadi~~~al~~~~~~~~~~~~~~~iv~~VHDeiv~evp~e 206 (206) T smart00482 161 APIQGSAADILKLAMIKMDEALKEKGLRARLLLQVHDELVFEVPEE 206 (206) T ss_pred HHCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECCEEEEECCCC T ss_conf 1022899999999999999999853999779985360259961899 No 25 >pfam02739 5_3_exonuc_N 5'-3' exonuclease, N-terminal resolvase-like domain. Probab=100.00 E-value=0 Score=391.48 Aligned_cols=164 Identities=46% Similarity=0.725 Sum_probs=153.6 Q ss_pred CEEEEEECCHHHHHHHHCCCC-CCCCCCCCEECHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHHCCHHHHCC Q ss_conf 728999662699988742675-4468988550069999999999999848554798799997274887132408652166 Q gi|254780277|r 5 NHLFLVDGSSFIYRAFYATPL-LSRKHDGLPVNAIAGFCNMLWKLLQNSRKESIASHFAVIFDYPAVTFRNEIYPDYKAN 83 (976) Q Consensus 5 ~~~~liDg~~~~~ra~~a~p~-~~~~~~G~~t~ai~gf~~~l~~~i~~~~~~~~~~~~~v~fD~~~~tfR~~~~~~YKa~ 83 (976) |+++||||++++||||||+|. ...+++|.||||||||+++|.+++++++|+ |++||||++++|||||+||+|||| T Consensus 1 KklllIDg~~l~~Ray~a~p~~~l~~~~G~pt~ai~gf~~~l~~li~~~~p~----~i~v~fD~~~~t~R~elyp~YKa~ 76 (169) T pfam02739 1 KKLLLIDGSSLAFRAFFALPKVPLTNSKGEPTNAVYGFLRMLLKLLKEEKPT----YVAVVFDAGAPTFRHELYEEYKAN 76 (169) T ss_pred CEEEEEECCHHHHHHHHCCCCCCCCCCCCCEEHHHHHHHHHHHHHHHHCCCC----EEEEEECCCCCCCHHHHHHHHHCC T ss_conf 9699991649999999728878773899988028999999999999974997----699996599997278879988619 Q ss_pred CCCCCHHHHHHHHHHHHHHHHCCCCEEEECCEEHHHHHHHHHHHHHHCCCCEEEECCCCHHHEECCCCEEEEECCCC--- Q ss_conf 79879758864899999999709979956890166589999999984899499986870121223797599966788--- Q gi|254780277|r 84 RPKIPEMLLPQLPLVRLATQAFGIPAIEIQGFEADDIIATYTYIAEKEGFAVTIISTDKDLMQLVSPTTCLYDTVKE--- 160 (976) Q Consensus 84 R~~~p~~l~~q~~~i~~~l~~~gi~~~~~~~~EADDviatla~~~~~~~~~v~i~s~DkD~~Qlv~~~v~~~~~~~~--- 160 (976) |++||++|..|++.++++++++||++++.+||||||+|||+|+++.+++.+|+|+|+||||+|||+++|+||.+.++ T Consensus 77 R~~~p~~l~~Q~~~i~~~l~~~gi~~~~~~g~EADDvIatla~~~~~~~~~v~I~S~DkDl~QLv~~~~~V~~~~~~~~~ 156 (169) T pfam02739 77 RAKTPDELPPQIPLIKELLDALGIPVLEVPGYEADDVIGTLAKKAEKEGYDVRIVSGDKDLLQLVSDKVNVAVLNPGRTE 156 (169) T ss_pred CCCCCHHHHHHHHHHHHHHHHCEEEEEEECCCCHHHHHHHHHHHHHHCCCCEEEECCCCCHHHHCCCCCEEEEECCCCCC T ss_conf 99993789999999999987501447852485315679999999985899199988999858853299039994189977 Q ss_pred -CEECHHHHHHHH Q ss_conf -353899998884 Q gi|254780277|r 161 -EKIDIEHVIKKW 172 (976) Q Consensus 161 -~~~~~~~v~~~~ 172 (976) +.+|++.|.||| T Consensus 157 ~~~~~~~~v~ek~ 169 (169) T pfam02739 157 FTLYTPEEVVEKY 169 (169) T ss_pred CEEECHHHHHCCC T ss_conf 7798889972149 No 26 >cd06139 DNA_polA_I_Ecoli_like_exo The 3'-5' exonuclease domain of DNA polymerases has a fundamental role in reducing polymerase errors and is involved in proofreading activity. E.coli-like Polymerase I (pol I), a subgroup of family-A DNA polymerases, contains a DEDDy-type DnaQ-like 3'-5' exonuclease domain in the same polypeptide chain as the polymerase domain. The exonuclease domain contains three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. E. coli DNA pol I is involved in genome replication but is not the main replicating enzyme. It is also implicated in DNA repair. Probab=100.00 E-value=5.9e-39 Score=302.72 Aligned_cols=187 Identities=30% Similarity=0.543 Sum_probs=172.1 Q ss_pred HHHCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCH Q ss_conf 98668523233222433110012442010257766544345502313744333456500178899997444420222100 Q gi|254780277|r 384 LETIGSASFKIITDTMDAFHSKPIAIAISTLDQKDDTSFKVETIFIDLSVHTSQDSTGKNTLTKEILSYLKKFFENEHFL 463 (976) Q Consensus 384 L~~~~~ia~DtETt~ld~~~~~ivGisls~~~~~~~~~~~~~~~yip~~~~~~~d~~~~~~~~~evl~~L~~lle~~~i~ 463 (976) +.+.+.+||||||||+||+.++++|+|+|+.++. ++|+|+.|... ..+...+++++.|+++|+|+++. T Consensus 2 l~~~~~ia~DtETtgl~~~~~~ivGisls~~~~~--------~~YiP~~h~~~----~~~l~~~~v~~~lk~~led~~i~ 69 (193) T cd06139 2 LEKAKVFAFDTETTSLDPMQAELVGISFAVEPGE--------AYYIPLGHDYG----GEQLPREEVLAALKPLLEDPSIK 69 (193) T ss_pred CCCCCEEEEECCCCCCCCCCCEEEEEEEEECCCC--------EEEEEECCCCC----CCCCCHHHHHHHHHHHHCCCCCC T ss_conf 7559878997836999868884999999807996--------89987236565----44679999999999997399986 Q ss_pred HHHCCCCHHHHHHHHCCCCCCC-CCCHHHHHHHHCCCCCCCCHHHHHHHHCCCCC---CCCCCCCCCCCCCHHHCCHHHH Q ss_conf 1100321237889861886664-21002345543168766772466776224442---2335654322210000001233 Q gi|254780277|r 464 KIGHNIKYDKLVLHRYGISMRG-FEDIMLMSYVLDSGRSSHDIASIAQKWLSYTR---KEILKSRKSSIPIDKISDSQVQ 539 (976) Q Consensus 464 kIgHN~KfD~~~L~~~GI~l~~-i~DTmlaayLLdp~~~~~~L~~la~~yL~~~~---~e~~~~~k~~~~~~~i~~e~~~ 539 (976) ||+||+|||+++|.++||.+.+ +|||||||||+||+.++|+|++++++||++.+ +++.|+++++.+++.++.+.+. T Consensus 70 Kv~hn~k~D~~~L~~~gi~~~~~~~DTmiaa~ll~~~~~~~~L~~La~~~l~~~~~~~~~l~gkg~~~~~~~~~~~~~~~ 149 (193) T cd06139 70 KVGQNLKFDLHVLANHGIELRGPAFDTMLASYLLNPGRRRHGLDDLAERYLGHKTISFEDLVGKGKKQITFDQVPLEKAA 149 (193) T ss_pred EEEECCHHHHHHHHHCCCCCCCCCHHHHHHHHHCCCCCCCCCHHHHHHHHHCCCCCCHHHHHCCCCCCCCHHHCCHHHHH T ss_conf 88754797899999839866783021899998727998888999999998188766389983778553783133299999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHCCCCHHHHHH Q ss_conf 33222111112456776322210-00134578860750123322 Q gi|254780277|r 540 EHAIENSNVILQLWLLLRPKLIV-EKLLHVYERLDKPMIDVVSQ 582 (976) Q Consensus 540 ~Yaa~Da~~t~~L~~~l~~~L~~-~~l~~l~~~iE~Pl~~vLa~ 582 (976) .|||+||++|++||+.|.++|++ +++.++|++|||||++||++ T Consensus 150 ~YAa~Da~~t~~L~~~l~~~L~~~~~l~~ly~~iE~PL~~VLa~ 193 (193) T cd06139 150 EYAAEDADITLRLYELLKPKLKEEPGLLELYEEIEMPLIPVLAE 193 (193) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHCC T ss_conf 99999999999999999999984220999999984569998279 No 27 >cd06128 DNA_polA_exo The 3'-5' exonuclease domain of family-A DNA polymerases has a fundamental role in reducing polymerase errors and is involved in proofreading activity. Family-A DNA polymerases contain a DnaQ-like exonuclease domain in the same polypeptide chain as the polymerase domain, similar to family-B DNA polymerases. The exonuclease domain contains three conserved sequence motifs termed ExoI, ExoII and ExoIII, which are clustered around the active site and contain four invariant acidic residues that serve as ligands for the two metal ions required for catalysis. The Klenow fragment (KF) of Escherichia coli pol I, the Thermus aquaticus (Taq) pol I, and Bacillus stearothermophilus (BF) pol I are examples of family-A DNA polymerases. They are involved in nucleotide excision repair and in the processing of Okazaki fragments that are generated during lagging strand synthesis. The N-terminal domains of BF pol I and Taq pol I resemble the fold of the 3'-5' exonuclease domain of KF Probab=100.00 E-value=1.3e-36 Score=285.65 Aligned_cols=162 Identities=30% Similarity=0.498 Sum_probs=149.0 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCHHHH Q ss_conf 68523233222433110012442010257766544345502313744333456500178899997444420222100110 Q gi|254780277|r 387 IGSASFKIITDTMDAFHSKPIAIAISTLDQKDDTSFKVETIFIDLSVHTSQDSTGKNTLTKEILSYLKKFFENEHFLKIG 466 (976) Q Consensus 387 ~~~ia~DtETt~ld~~~~~ivGisls~~~~~~~~~~~~~~~yip~~~~~~~d~~~~~~~~~evl~~L~~lle~~~i~kIg 466 (976) +..+||||||||+|++.++|||||+|+.++. ++|+|+.|.... ...+...+.+++.|+++|+|+++.||| T Consensus 1 a~~~a~DtETt~ld~~~a~iVGis~s~~~~~--------a~YiP~~h~~~~--~~~~l~~~~vl~~L~p~led~~i~Kig 70 (166) T cd06128 1 APVFAFDTETDSLDNISANLVGLSFAIEPGV--------AAYIPVAHDYLD--APDQISRERALELLKPLLEDEKALKVG 70 (166) T ss_pred CCEEEEECCCCCCCCCCCEEEEEEEECCCCE--------EEEEECCCCCCC--CHHHHHHHHHHHHHHHHHCCCCCCEEE T ss_conf 9989997887998904354999999800896--------999952677667--525778999999999997399976453 Q ss_pred CCCCHHHHHHHHCCCCCCCC-CCHHHHHHHHCCCCCCCCHHHHHHHHCCCCC---CCCCCCCCCCCCCHHHCCHHHHHHH Q ss_conf 03212378898618866642-1002345543168766772466776224442---2335654322210000001233332 Q gi|254780277|r 467 HNIKYDKLVLHRYGISMRGF-EDIMLMSYVLDSGRSSHDIASIAQKWLSYTR---KEILKSRKSSIPIDKISDSQVQEHA 542 (976) Q Consensus 467 HN~KfD~~~L~~~GI~l~~i-~DTmlaayLLdp~~~~~~L~~la~~yL~~~~---~e~~~~~k~~~~~~~i~~e~~~~Ya 542 (976) ||+|||+++|.++||.++++ |||||||||+||++++|+|+.|+++||++.+ +++.|+|+++.+|++++++.+..|| T Consensus 71 hNlK~D~~vL~~~Gi~l~g~~fDTMlaaylldp~~~~h~Ld~La~~~L~~~~i~~~dl~GkGk~q~~f~~v~i~~~~~YA 150 (166) T cd06128 71 QNLKYDRGILANYGIELRGIAFDTMLESYILNSVAGRHDMDSLAERWLKHKTITFEEIAGKGKNQLTFNQIALEEAGRYA 150 (166) T ss_pred ECHHHHHHHHHHCCCCCCCCCCCHHHHHHHCCCCCCCCCHHHHHHHHCCCCCEEHHHHHCCCCCCCCCCCCCHHHHHHHH T ss_conf 11566799996199668999845767987628999889969999998399870599983788575893228999999989 Q ss_pred HHHHHHHHHHHHHHHH Q ss_conf 2211111245677632 Q gi|254780277|r 543 IENSNVILQLWLLLRP 558 (976) Q Consensus 543 a~Da~~t~~L~~~l~~ 558 (976) |+||++||+||+.|++ T Consensus 151 ~eDAd~t~rL~~~l~P 166 (166) T cd06128 151 AEDADVTLQLHLKMWP 166 (166) T ss_pred HHHHHHHHHHHHHHCC T ss_conf 9999999999985389 No 28 >cd06140 DNA_polA_I_Bacillus_like_exo Bacillus stearothermophilus-like Polymerase I (pol I), a subgroup of the family-A DNA polymerases, contains an inactive DnaQ-like 3'-5' exonuclease domain in the same polypeptide chain as the polymerase region. The exonuclease-like domain of these proteins possess the same fold as the Klenow fragment (KF) of Escherichia coli pol I, but does not contain the four critical metal-binding residues necessary for activity. The function of this domain is unknown. It might act as a spacer between the polymerase and the 5'-3' exonuclease domains. Some members of this subgroup, such as those from Bacillus sphaericus and Thermus aquaticus, are thermostable DNA polymerases. Probab=99.97 E-value=2.7e-32 Score=254.18 Aligned_cols=175 Identities=24% Similarity=0.346 Sum_probs=152.9 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCHHHH Q ss_conf 68523233222433110012442010257766544345502313744333456500178899997444420222100110 Q gi|254780277|r 387 IGSASFKIITDTMDAFHSKPIAIAISTLDQKDDTSFKVETIFIDLSVHTSQDSTGKNTLTKEILSYLKKFFENEHFLKIG 466 (976) Q Consensus 387 ~~~ia~DtETt~ld~~~~~ivGisls~~~~~~~~~~~~~~~yip~~~~~~~d~~~~~~~~~evl~~L~~lle~~~i~kIg 466 (976) .+..++|+||++++++.++++|+++|..+ .++|+|+.+.. ...+.|++||+|+++.||+ T Consensus 3 ~~~~av~~e~~~~~~~~~~i~gi~l~~~~---------~~~yi~~~~~~------------~~~~~lk~~l~d~~i~Ki~ 61 (178) T cd06140 3 ADEVALYVELLGENYHTADIIGLALANGG---------GAYYIPLELAL------------LDLAALKEWLEDEKIPKVG 61 (178) T ss_pred CCCEEEEEECCCCCCCCCEEEEEEEEECC---------EEEEEECCCCH------------HHHHHHHHHHCCCCCCEEE T ss_conf 88669997379998453549999998899---------79999566736------------4699999997299998898 Q ss_pred CCCCHHHHHHHHCCCCCCCC-CCHHHHHHHHCCCCCCCCHHHHHHHHCCCCCCCCCCCCCCCCCCHHHCCHHHHHHHHHH Q ss_conf 03212378898618866642-10023455431687667724667762244422335654322210000001233332221 Q gi|254780277|r 467 HNIKYDKLVLHRYGISMRGF-EDIMLMSYVLDSGRSSHDIASIAQKWLSYTRKEILKSRKSSIPIDKISDSQVQEHAIEN 545 (976) Q Consensus 467 HN~KfD~~~L~~~GI~l~~i-~DTmlaayLLdp~~~~~~L~~la~~yL~~~~~e~~~~~k~~~~~~~i~~e~~~~Yaa~D 545 (976) ||+|||+++|.++||+++|+ ||||||+||+||++++|+|++++.+||++...+......++..+..++.+....|||+| T Consensus 62 hn~K~~~~~L~~~Gi~l~g~~fDtmiaayLl~p~~~~~~L~~L~~~yl~~~~~~~~~~~gk~~~~~~~~~~~~~~ya~~~ 141 (178) T cd06140 62 HDAKRAYVALKRHGIELAGVAFDTMLAAYLLDPTRSSYDLADLAKRYLGRELPSDEEVYGKGAKFAVPDEEVLAEHLARK 141 (178) T ss_pred ECHHHHHHHHHHCCCCCCCCHHHHHHHHHHHCCCCCCCCHHHHHHHHHCCCCCCHHHHHCCCCCCCCCCHHHHHHHHHHH T ss_conf 65799999999889806883036888988754897779999999998286665478861766562439979999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHH Q ss_conf 1111245677632221000134578860750123322 Q gi|254780277|r 546 SNVILQLWLLLRPKLIVEKLLHVYERLDKPMIDVVSQ 582 (976) Q Consensus 546 a~~t~~L~~~l~~~L~~~~l~~l~~~iE~Pl~~vLa~ 582 (976) |.++++||+.|+++|++.++.++|++|||||++||++ T Consensus 142 a~~~~~L~~~l~~~L~e~~l~~Ly~~iE~PL~~VLa~ 178 (178) T cd06140 142 AAAILRLAPKLEEELEENEQLELYYEVELPLAEVLAE 178 (178) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 9999999999999999824899999987129998189 No 29 >PHA00439 exonuclease Probab=99.97 E-value=1.3e-30 Score=241.83 Aligned_cols=241 Identities=20% Similarity=0.234 Sum_probs=158.6 Q ss_pred CCCCCEEEEEECCHHHHHHHHCCCCCCCCCCCC------EECHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHH Q ss_conf 988972899966269998874267544689885------50069999999999999848554798799997274887132 Q gi|254780277|r 1 MQKENHLFLVDGSSFIYRAFYATPLLSRKHDGL------PVNAIAGFCNMLWKLLQNSRKESIASHFAVIFDYPAVTFRN 74 (976) Q Consensus 1 m~~~~~~~liDg~~~~~ra~~a~p~~~~~~~G~------~t~ai~gf~~~l~~~i~~~~~~~~~~~~~v~fD~~~~tfR~ 74 (976) |+ +|..|||||-.++|||-.|.-.-..=-++. ..-|..-|.+.+.++...+.. ...+.+++||-. .+|||+ T Consensus 1 ~~-~k~~LLIDaD~lvYka~~aaE~eidwgddv~tl~~d~~ea~~~v~~~i~~i~~~~~a-~~~~~iIl~fsd-~~NFRk 77 (288) T PHA00439 1 MS-DKGVLVMDGDYLVFQAMAAAEVETDWGEDIWTLECDHAKARQILEDSIKSYKTRKKA-WEDAPIVLAFTD-SVNWRK 77 (288) T ss_pred CC-CCCEEEEECHHHHHHHHHHHCCCCCCCCCEEEEECCHHHHHHHHHHHHHHHHHHHCC-CCCCCEEEEECC-CCCCCC T ss_conf 98-676699942099999877605554478987999489999999999999999987444-577878999589-866254 Q ss_pred HCC-HHHHCCCCCC--CHHHHHHHHHHHHHHHHCCCCEEEECCEEHHHHHHHHHHHHHHCCC-CEEEECCCCHHHEECCC Q ss_conf 408-6521667987--9758864899999999709979956890166589999999984899-49998687012122379 Q gi|254780277|r 75 EIY-PDYKANRPKI--PEMLLPQLPLVRLATQAFGIPAIEIQGFEADDIIATYTYIAEKEGF-AVTIISTDKDLMQLVSP 150 (976) Q Consensus 75 ~~~-~~YKa~R~~~--p~~l~~q~~~i~~~l~~~gi~~~~~~~~EADDviatla~~~~~~~~-~v~i~s~DkD~~Qlv~~ 150 (976) +++ |+||+||+.. |-- ..++++.+.+.-..+++..+|+|||||||++|.+....|. .++|||+||||.|+... T Consensus 78 ~i~~P~YK~nRk~~rKP~g---y~~l~~~~~~~~~y~~i~~p~lEADDviGilat~~~~~g~~~~iivS~DKDm~qiPg~ 154 (288) T PHA00439 78 ELVDPTYKANRKAKRKPVG---YKKFLEELMEREEWKSILEPGLEGDDVMGIIATNPSLFGADKAVLVSCDKDFKTIPNC 154 (288) T ss_pred CCCCCCCCCCCCCCCCCCC---HHHHHHHHHHCCCCCEEEECCCCCCCEEEEEECCCCCCCCCCEEEEECCCCCCCCCCE T ss_conf 4578550003578888605---9999999862367536974585512013553126654689862898279882308860 Q ss_pred CEEEEECCCCC-EECHHHHHHHHCCCHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCCCHHH---HHHHH----- Q ss_conf 75999667883-5389999888489887877778751874358998776360249989841367899---98645----- Q gi|254780277|r 151 TTCLYDTVKEE-KIDIEHVIKKWGIPPEKMVCLQALTGDSIDNIPGIPGIGYKTAALLLQEYGNLEN---ILINA----- 221 (976) Q Consensus 151 ~v~~~~~~~~~-~~~~~~v~~~~gv~p~~~~d~~al~GD~sDnipGv~GiG~KtA~~ll~~~gsle~---i~~~~----- 221 (976) .. .+...+.. ..+++ .+..+-.+++|+||++||+|||||||+|||.+||.+-+.+|. .+.+- T Consensus 155 ~~-~~~~~~~~~~~t~~--------~ad~~~~~QtL~GD~tDgy~GvpGiG~ktA~klL~~~~~~~~~~~~~k~g~~kg~ 225 (288) T PHA00439 155 DF-LWCTTGNILTQTPE--------TADRWHLFQTIKGDSTDGYSGIPGWGPDTAEAFLNNPYIFEQVEKVLKSGKNKGQ 225 (288) T ss_pred EE-ECCCCCCEEECCHH--------HHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHCCCCCCCCCCHHCCCCCCCCC T ss_conf 55-02687663653765--------6889888886247476897899884889999986374422210000023444554 Q ss_pred -----------------HHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCCCCC Q ss_conf -----------------4200112232200357899976554320356422476 Q gi|254780277|r 222 -----------------SRIKQKKRRENILEYAETARLSRKLVMLRTDVPITIP 258 (976) Q Consensus 222 -----------------d~i~~~k~~~~l~~~~e~a~ls~~L~tL~~Dvpl~~~ 258 (976) +.+.. ..++=--..+.|++.-+|+.|.+.=+-++. T Consensus 226 t~~~w~k~~~~~~~~~W~~vv~--~y~k~g~te~dAl~qARlARILR~~dyd~~ 277 (288) T PHA00439 226 TVTKWVKRAPGDEETLWDCIVS--LGAKAGMTEEDIIKQAQMARILRAEDYDFI 277 (288) T ss_pred CCCCCCCCCCCCCHHHHHHHHH--HHHHCCCCHHHHHHHHHHHHHCCCCCCCCC T ss_conf 3110012466542028999999--999859999999999899987054245766 No 30 >cd00128 XPG Xeroderma pigmentosum G N- and I-regions (XPGN, XPGI); contains the HhH2 motif; domain in nucleases. XPG is a eukaryotic enzyme that functions in nucleotide-excision repair and transcription-coupled repair of oxidative DNA damage. Functionally/structurally related to FEN-1; divalent metal ion-dependent exo- and endonuclease, and bacterial and bacteriophage 5'3' exonucleases. Probab=99.97 E-value=7.3e-29 Score=229.11 Aligned_cols=261 Identities=25% Similarity=0.329 Sum_probs=199.5 Q ss_pred CEEEEEECCHHHHHHHHCCCCCCCCCCCCEECHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHHCCHHHHCCC Q ss_conf 72899966269998874267544689885500699999999999998485547987999972748871324086521667 Q gi|254780277|r 5 NHLFLVDGSSFIYRAFYATPLLSRKHDGLPVNAIAGFCNMLWKLLQNSRKESIASHFAVIFDYPAVTFRNEIYPDYKANR 84 (976) Q Consensus 5 ~~~~liDg~~~~~ra~~a~p~~~~~~~G~~t~ai~gf~~~l~~~i~~~~~~~~~~~~~v~fD~~~~tfR~~~~~~YKa~R 84 (976) .+.+.|||+..+||+.+|.+.... ++|.+|+.+.||++++.+++..- -..++|||++.|.+..+.....+..| T Consensus 24 gk~iAIDas~~Lyq~l~~~~~~~~-~~~~~~~~l~g~~~ri~~Ll~~~------I~PifVFDG~~p~~K~~~~~~R~~~r 96 (316) T cd00128 24 GKKVAIDASIWLYQFLKACRQELG-SGGETTSHLQGFFYRTCRLLELG------IKPVFVFDGKPPPLKAETLAKRRERR 96 (316) T ss_pred CCEEEEECHHHHHHHHHHCCHHHC-CCCCCCHHHHHHHHHHHHHHHCC------CEEEEEECCCCCCCHHHHHHHHHHHH T ss_conf 998999718999999985372331-48986679999999999999779------95999989989864347889889999 Q ss_pred CCCCHHHH-----------------------HHHHHHHHHHHHCCCCEEEECCEEHHHHHHHHHHHHHHCCCCEEEECCC Q ss_conf 98797588-----------------------6489999999970997995689016658999999998489949998687 Q gi|254780277|r 85 PKIPEMLL-----------------------PQLPLVRLATQAFGIPAIEIQGFEADDIIATYTYIAEKEGFAVTIISTD 141 (976) Q Consensus 85 ~~~p~~l~-----------------------~q~~~i~~~l~~~gi~~~~~~~~EADDviatla~~~~~~~~~v~i~s~D 141 (976) .+..+.+. .++..++++|+.||||++..|+ |||...|.|++ .|.--.|+|.| T Consensus 97 ~~a~~~~~~~~~~~~~~~~~k~~~~~~~it~~~i~~~k~LL~~~GIpyi~Ap~-EAEAqcA~L~k----~g~vd~v~SeD 171 (316) T cd00128 97 EEAEEEAKEALEKGLEEEAKKLERRAVRVTPQMIEEAKELLRLMGIPYIVAPY-EAEAQCAYLAK----KGLVDAIITED 171 (316) T ss_pred HHHHHHHHHHHHCCCHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCCEEECCC-HHHHHHHHHHH----CCCEEEEEECC T ss_conf 99999999999715999999998501558767789999999986998773580-39999999997----79858999656 Q ss_pred CHHHEECCCCEEEEECCC----CCEECHHHHHHHHCCCHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCCCHHHH Q ss_conf 012122379759996678----8353899998884898878777787518743589987763602499898413678999 Q gi|254780277|r 142 KDLMQLVSPTTCLYDTVK----EEKIDIEHVIKKWGIPPEKMVCLQALTGDSIDNIPGIPGIGYKTAALLLQEYGNLENI 217 (976) Q Consensus 142 kD~~Qlv~~~v~~~~~~~----~~~~~~~~v~~~~gv~p~~~~d~~al~GD~sDnipGv~GiG~KtA~~ll~~~gsle~i 217 (976) .|+++.-.+.|.--.... ...++.+.+.+.+|++++|++|+..|+| +|+.|||||||||||.+|+++|||+|++ T Consensus 172 sD~l~fGa~~vlr~l~~~~~~~~~~~~~~~i~~~l~l~~~~~id~~iL~G--~Dy~~gi~giG~k~A~kli~~~~~~e~~ 249 (316) T cd00128 172 SDLLLFGAPRVYRNLFDSGAKPVEEIDLEKILKELGLTREKLIDLAILLG--CDYTEGIPGIGPVTALKLIKKYGDIEKD 249 (316) T ss_pred CHHHHCCCCEEEEECCCCCCCCEEEEEHHHHHHHCCCCHHHHHHHHHHHC--CCCCCCCCCCCHHHHHHHHHHHCCHHHH T ss_conf 11453288778861454689876998889978773999999999999729--8878999973599999999991999999 Q ss_pred HHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCCCCCHHHHCCCCCCHHHHHHHH-HHHHHHHHHHHHHH Q ss_conf 86454200112232200357899976554320356422476355212676768899865-32336899987653 Q gi|254780277|r 218 LINASRIKQKKRRENILEYAETARLSRKLVMLRTDVPITIPLEHLILEDCNGPRLISFL-KALEFTKLINRVAT 290 (976) Q Consensus 218 ~~~~d~i~~~k~~~~l~~~~e~a~ls~~L~tL~~Dvpl~~~~~~l~~~~~d~~~l~~~~-~~lef~~ll~rL~~ 290 (976) ++++++-+. ..-.. -..+.+ +++- .+-++.-+...+.|..+|.+.+..|+ .+.+|. -.|+.. T Consensus 250 ~~~l~~~~~-~~~~~--~~~~~~---~~~f---~~p~v~~~~~~~~~~~pd~~~l~~fl~~e~~f~--~~rv~~ 312 (316) T cd00128 250 IERLKKKLY-RSPED--FPLKEA---REFF---LNPEVTDDFIDLRWRDPDEEGIIEFLCKEHGFN--EDRVLK 312 (316) T ss_pred HHHHHHCCC-CCCCC--CCHHHH---HHHH---CCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCC--HHHHHH T ss_conf 999983487-69999--997999---9985---799999986676569999899999974366989--999997 No 31 >pfam01367 5_3_exonuc 5'-3' exonuclease, C-terminal SAM fold. Probab=99.97 E-value=1.2e-30 Score=242.10 Aligned_cols=100 Identities=52% Similarity=0.777 Sum_probs=96.4 Q ss_pred CCHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCC Q ss_conf 98878777787518743589987763602499898413678999864542001122322003578999765543203564 Q gi|254780277|r 174 IPPEKMVCLQALTGDSIDNIPGIPGIGYKTAALLLQEYGNLENILINASRIKQKKRRENILEYAETARLSRKLVMLRTDV 253 (976) Q Consensus 174 v~p~~~~d~~al~GD~sDnipGv~GiG~KtA~~ll~~~gsle~i~~~~d~i~~~k~~~~l~~~~e~a~ls~~L~tL~~Dv 253 (976) |+|+|++||+||+||+|||||||||||||||++||++|||||+|++|+|++++ |++++|++++|++++||+|+||++|+ T Consensus 1 v~P~q~~d~~aL~GD~sDnIPGv~GiG~KtA~~Ll~~~gsle~i~~n~d~i~~-k~~~~l~~~~~~~~lsr~L~tL~~dv 79 (100) T pfam01367 1 LTPEQIIDYKALVGDSSDNIPGVPGIGEKTAAKLLKEYGSLENIYENLDKLKG-KLREKLLNGKEDAFLSRKLATIKTDV 79 (100) T ss_pred CCHHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHCCCHHHHHHHHHHCCH-HHHHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 98899999999929764588999998816899999981989999981987179-99999997899999999871511489 Q ss_pred CCCCCHHHHCCCCCCHHHHHH Q ss_conf 224763552126767688998 Q gi|254780277|r 254 PITIPLEHLILEDCNGPRLIS 274 (976) Q Consensus 254 pl~~~~~~l~~~~~d~~~l~~ 274 (976) |++++++++++.++|.++|.+ T Consensus 80 ~l~~~~~~l~~~~~d~~~L~e 100 (100) T pfam01367 80 PLELTLEDLRLKPPDKEELIE 100 (100) T ss_pred CCCCCHHHCCCCCCCHHHHCC T ss_conf 899999990478999888549 No 32 >smart00474 35EXOc 3'-5' exonuclease. 3\' -5' exonuclease proofreading domain present in DNA polymerase I, Werner syndrome helicase, RNase D and other enzymes Probab=99.92 E-value=9.4e-25 Score=199.14 Aligned_cols=170 Identities=25% Similarity=0.405 Sum_probs=142.7 Q ss_pred EEECCHHHHHHHHHHHHHC-CCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCEECCCCCCCCCCCCCCCCCHH Q ss_conf 3547999999999999866-852323322243311001244201025776654434550231374433345650017889 Q gi|254780277|r 369 TKIVNTKDIQQWVQKLETI-GSASFKIITDTMDAFHSKPIAIAISTLDQKDDTSFKVETIFIDLSVHTSQDSTGKNTLTK 447 (976) Q Consensus 369 ~~I~~~~~L~~li~~L~~~-~~ia~DtETt~ld~~~~~ivGisls~~~~~~~~~~~~~~~yip~~~~~~~d~~~~~~~~~ 447 (976) ++|.+.+++..+++.+... +.++|||||++++++.++++|+++|+... ++|++..+.. . T Consensus 2 ~~i~~~~~l~~~~~~l~~~~~~i~~DtE~~~~~~~~~~l~~iql~~~~~---------~~yi~~~~~~-----------~ 61 (172) T smart00474 2 RVVTDSETLEELLEKLRAAGGEVALDTETTGLNSYSGKLVLIQISVTGE---------GAFIIDPLAL-----------G 61 (172) T ss_pred EEECCHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCEEEEEEEEECCC---------EEEEEECCCC-----------H T ss_conf 6989899999999999842995999883068853787589999996898---------4899701224-----------2 Q ss_pred HHHHHHHHHHCCCCCHHHHCCCCHHHHHHHHCCCCCCCCCCHHHHHHHHCCCCCCCCHHHHHHHHCCCCCCCCCCCCCCC Q ss_conf 99974444202221001100321237889861886664210023455431687667724667762244422335654322 Q gi|254780277|r 448 EILSYLKKFFENEHFLKIGHNIKYDKLVLHRYGISMRGFEDIMLMSYVLDSGRSSHDIASIAQKWLSYTRKEILKSRKSS 527 (976) Q Consensus 448 evl~~L~~lle~~~i~kIgHN~KfD~~~L~~~GI~l~~i~DTmlaayLLdp~~~~~~L~~la~~yL~~~~~e~~~~~k~~ 527 (976) ...+.|+++|+++.+.|||||+|||+++|.++|+.+.+++|||+|+|+++++.+.|+|+.++.+|+++..+.. .+.+ T Consensus 62 ~~~~~l~~ll~~~~i~kv~hn~k~D~~~L~~~gi~l~~v~Dt~ia~~l~~~~~~~~~L~~l~~~~~~~~l~k~---~~~~ 138 (172) T smart00474 62 DDLEILKDLLEDETITKVGHNAKFDLHVLARFGIELENIFDTMLAAYLLLGGPSKHGLATLLKEYLGVELDKE---EQKS 138 (172) T ss_pred HHHHHHHHHHCCCCCCEEEECHHHHHHHHHHCCCCCCCCHHHHHHHHHCCCCCCHHHHHHHHHHHCCEECCCC---CCCC T ss_conf 1799999998099985999745999999997798567524679987750579864899999999749231555---5646 Q ss_pred CCCHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 2100000012333322211111245677632221 Q gi|254780277|r 528 IPIDKISDSQVQEHAIENSNVILQLWLLLRPKLI 561 (976) Q Consensus 528 ~~~~~i~~e~~~~Yaa~Da~~t~~L~~~l~~~L~ 561 (976) ........+.+..|||.||.++++||+.|.++|+ T Consensus 139 ~w~~rpl~~~~~~YAa~Da~~~~~l~~~l~~~L~ 172 (172) T smart00474 139 DWGARPLSEEQLQYAAEDADALLRLYEKLEKELE 172 (172) T ss_pred CCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 6678999099999999999999999999999729 No 33 >pfam01612 3_5_exonuc 3'-5' exonuclease. This domain is responsible for the 3'-5' exonuclease proofreading activity of E. coli DNA polymerase I (polI) and other enzymes, it catalyses the hydrolysis of unpaired or mismatched nucleotides. This domain consists of the amino-terminal half of the Klenow fragment in E. coli polI it is also found in the Werner syndrome helicase (WRN), focus forming activity 1 protein (FFA-1) and ribonuclease D (RNase D). Werner syndrome is a human genetic disorder causing premature aging; the WRN protein has helicase activity in the 3'-5' direction. The FFA-1 protein is required for formation of a replication foci and also has helicase activity; it is a homologue of the WRN protein. RNase D is a 3'-5' exonuclease involved in tRNA processing. Also found in this family is the autoantigen PM/Scl thought to be involved in polymyositis-scleroderma overlap syndrome. Probab=99.91 E-value=3e-24 Score=195.43 Aligned_cols=170 Identities=24% Similarity=0.384 Sum_probs=141.2 Q ss_pred CEEECCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCEECCCCCCCCCCCCCCCCCHH Q ss_conf 03547999999999999866852323322243311001244201025776654434550231374433345650017889 Q gi|254780277|r 368 YTKIVNTKDIQQWVQKLETIGSASFKIITDTMDAFHSKPIAIAISTLDQKDDTSFKVETIFIDLSVHTSQDSTGKNTLTK 447 (976) Q Consensus 368 y~~I~~~~~L~~li~~L~~~~~ia~DtETt~ld~~~~~ivGisls~~~~~~~~~~~~~~~yip~~~~~~~d~~~~~~~~~ 447 (976) |..|++.+++.++++.+...+.++|||||++++++...++|++++...+. .+|++..+.. . T Consensus 1 y~~I~~~~~l~~~~~~l~~~~~ia~DtEt~~~~~~~~~~~~~llq~~~~~--------~~yli~~~~~-----------~ 61 (172) T pfam01612 1 YRIVTTEDELEDLIEELLNAKVVAVDTETTSLDTYSYTLVGCLIQISTGE--------GAYIIDPLAL-----------G 61 (172) T ss_pred CEEECCHHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCEEEEEEECCC--------CCEEEECCCC-----------H T ss_conf 97988899999999999409908998062898867613433699996798--------6077643475-----------6 Q ss_pred HHHHHHHHHHCCCCCHHHHCCCCHHHHHHHH-CCCCCCCCCCHHHHHHHHCCCCCCCCHHHHHHHHCCCCCCCCCCCCCC Q ss_conf 9997444420222100110032123788986-188666421002345543168766772466776224442233565432 Q gi|254780277|r 448 EILSYLKKFFENEHFLKIGHNIKYDKLVLHR-YGISMRGFEDIMLMSYVLDSGRSSHDIASIAQKWLSYTRKEILKSRKS 526 (976) Q Consensus 448 evl~~L~~lle~~~i~kIgHN~KfD~~~L~~-~GI~l~~i~DTmlaayLLdp~~~~~~L~~la~~yL~~~~~e~~~~~k~ 526 (976) ++++.|+++|+++++.|||||++||+++|.+ +|+.+.+++|||+|+|++++.+ +|||+.|+++|||+.... ... T Consensus 62 ~~~~~L~~ll~~~~i~Kvghn~k~D~~~L~~~~~~~~~~~~Dt~~a~~~l~~~~-~~gL~~L~~~~lg~~~~K---~~~- 136 (172) T pfam01612 62 DDLEGLKRLLEDPNITKVGHNAKFDLEVLARDFGIKLENLFDTMLAAYLLGYPR-SHSLDDLAEKYLGVELDK---SEQ- 136 (172) T ss_pred HHHHHHHHHHCCCCCCEEEEEHHHHHHHHHHHCCCCCCCCHHHHHHHHHCCCCC-CCCHHHHHHHHCCCCCCH---HHE- T ss_conf 679999999759998799975899999998854766687327899998819964-356999999980987874---415- Q ss_pred CCCC-HHHCCHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 2210-0000012333322211111245677632221 Q gi|254780277|r 527 SIPI-DKISDSQVQEHAIENSNVILQLWLLLRPKLI 561 (976) Q Consensus 527 ~~~~-~~i~~e~~~~Yaa~Da~~t~~L~~~l~~~L~ 561 (976) ..++ .....+.+..|||.||.++++||+.|.++|+ T Consensus 137 ~s~w~~rpL~~~qi~YAa~Da~~~~~l~~~l~~~l~ 172 (172) T pfam01612 137 CADWRARPLSEEQLRYAAEDADYLLRLYDKLRKELE 172 (172) T ss_pred ECCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 667887898299999999999999999999999749 No 34 >cd00007 35EXOc 3'-5' exonuclease. The 35EXOc domain is responsible for the 3'-5' exonuclease proofreading activity of prokaryotic DNA polymerase I (pol I) and other enzymes, it catalyses the hydrolysis of unpaired or mismatched nucleotides. This domain consists of the amino-terminal half of the Klenow fragment in E. coli pol I. 35EXOc is also found in the Werner syndrome helicase (WRN), focus forming activity 1 protein (FFA-1) and ribonuclease D (RNase D). Probab=99.89 E-value=1.5e-23 Score=190.45 Aligned_cols=151 Identities=28% Similarity=0.465 Sum_probs=127.5 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCHHHHC Q ss_conf 85232332224331100124420102577665443455023137443334565001788999974444202221001100 Q gi|254780277|r 388 GSASFKIITDTMDAFHSKPIAIAISTLDQKDDTSFKVETIFIDLSVHTSQDSTGKNTLTKEILSYLKKFFENEHFLKIGH 467 (976) Q Consensus 388 ~~ia~DtETt~ld~~~~~ivGisls~~~~~~~~~~~~~~~yip~~~~~~~d~~~~~~~~~evl~~L~~lle~~~i~kIgH 467 (976) +.+|+|+||++++++.++++|+++|+... ++|+|..+.. ..+++.|+++|+++++.||+| T Consensus 1 ~~ia~D~Et~~l~~~~~~i~~iql~~~~~---------~~yi~~~~~~-----------~~~~~~l~~~l~~~~i~Kv~h 60 (155) T cd00007 1 KEVAFDTETTGLNYHRGKLVGIQIATAGE---------AAYIPDELEL-----------EEDLEALKELLEDEDITKVGH 60 (155) T ss_pred CEEEEECCCCCCCCCCCCEEEEEEEECCC---------EEEEEECCCC-----------HHHHHHHHHHHHCCCCCEEEE T ss_conf 98999898799977888099999998993---------5999814755-----------246999999982999738987 Q ss_pred CCCHHHHHHHHCCCCCCC-CCCHHHHHHHHCCCCCCCCHHHHHHHHCCCCCCC---CCCCCCCCCCCHHHCCHHHHHHHH Q ss_conf 321237889861886664-2100234554316876677246677622444223---356543222100000012333322 Q gi|254780277|r 468 NIKYDKLVLHRYGISMRG-FEDIMLMSYVLDSGRSSHDIASIAQKWLSYTRKE---ILKSRKSSIPIDKISDSQVQEHAI 543 (976) Q Consensus 468 N~KfD~~~L~~~GI~l~~-i~DTmlaayLLdp~~~~~~L~~la~~yL~~~~~e---~~~~~k~~~~~~~i~~e~~~~Yaa 543 (976) |+|||+.+|.++|+.+.+ ++|||+|+||+++.+++|+|++++++|+++.... ..+++ ...+...+.+.+..||+ T Consensus 61 n~k~D~~~L~~~~~~~~~~~~Dt~iaa~ll~~~~~~~~L~~l~~~~l~~~~~k~~~~~~~~--~~~~~~~l~~~~~~YAa 138 (155) T cd00007 61 DAKFDLVVLARDGIELPGNIFDTMLAAYLLNPGEGSHSLDDLAKEYLGIELDKDEQIYGKG--AKTFARPLSEELLEYAA 138 (155) T ss_pred CCCHHHHHHHHCCCCCCCCEEHHHHHHHHHCCCCCCHHHHHHHHHHCCCCCCHHHHHCCCC--CCCCCCCCHHHHHHHHH T ss_conf 3400099998628601350114999999847785523699999998398767488861776--54444799899999999 Q ss_pred HHHHHHHHHHHHHHHHH Q ss_conf 21111124567763222 Q gi|254780277|r 544 ENSNVILQLWLLLRPKL 560 (976) Q Consensus 544 ~Da~~t~~L~~~l~~~L 560 (976) .||.++++||+.|.++| T Consensus 139 ~D~~~~~~l~~~l~~~l 155 (155) T cd00007 139 EDADALLRLYEKLLEEL 155 (155) T ss_pred HHHHHHHHHHHHHHHHC T ss_conf 99999999999999659 No 35 >cd06142 RNaseD_exo Ribonuclease (RNase) D is a bacterial enzyme involved in the maturation of small stable RNAs and the 3' maturation of tRNA. It contains a DEDDy-type DnaQ-like 3'-5' exonuclease domain possessing three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. In vivo, RNase D only becomes essential upon removal of other ribonucleases. Eukaryotic RNase D homologs include yeast Rrp6p, human PM/Scl-100 and the Drosophila melanogaster egalitarian protein. Probab=99.87 E-value=3.5e-22 Score=180.37 Aligned_cols=171 Identities=21% Similarity=0.199 Sum_probs=140.4 Q ss_pred HHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCEECCCCCCCCCCCCCCCCCHHHHHHHHHH Q ss_conf 99999999986685232332224331100124420102577665443455023137443334565001788999974444 Q gi|254780277|r 376 DIQQWVQKLETIGSASFKIITDTMDAFHSKPIAIAISTLDQKDDTSFKVETIFIDLSVHTSQDSTGKNTLTKEILSYLKK 455 (976) Q Consensus 376 ~L~~li~~L~~~~~ia~DtETt~ld~~~~~ivGisls~~~~~~~~~~~~~~~yip~~~~~~~d~~~~~~~~~evl~~L~~ 455 (976) ++..+++.+...+.++|||||++++++.+++.+++++..+. .+.++.... ..+..|++ T Consensus 1 ~l~~~~~~l~~~~~i~~DtE~~~~~~~~~~l~LiQia~~~~---------~ylid~~~~-------------~~~~~L~~ 58 (178) T cd06142 1 ELADLCERLAGAGVIAVDTEFMRLRTYYPRLCLIQISTGGQ---------AYLIDPLAI-------------GDLSPLKE 58 (178) T ss_pred CHHHHHHHHCCCCEEEEECCCCCCCCCCCEEEEEEEEECCC---------EEEEECCCC-------------CCHHHHHH T ss_conf 98999999707994899898678855887289999986890---------699953123-------------44699999 Q ss_pred HHCCCCCHHHHCCCCHHHHHHHHC-CCCCCCCCCHHHHHHHHCCCCCCCCHHHHHHHHCCCCCCCCCCCCCCCCCCHHHC Q ss_conf 202221001100321237889861-8866642100234554316876677246677622444223356543222100000 Q gi|254780277|r 456 FFENEHFLKIGHNIKYDKLVLHRY-GISMRGFEDIMLMSYVLDSGRSSHDIASIAQKWLSYTRKEILKSRKSSIPIDKIS 534 (976) Q Consensus 456 lle~~~i~kIgHN~KfD~~~L~~~-GI~l~~i~DTmlaayLLdp~~~~~~L~~la~~yL~~~~~e~~~~~k~~~~~~~i~ 534 (976) +|+++++.|||||++||+.+|.++ |+.+.+++|||+|++++++. .++||+.++++|||+... +.....+|..-| T Consensus 59 lL~d~~i~KVgh~~~~Dl~~L~~~~~~~~~~v~Dt~ia~~~l~~~-~~~~L~~L~~~~lg~~l~----K~~q~s~W~~rP 133 (178) T cd06142 59 LLADPNIVKVFHAAREDLELLKRDFGILPVNLFDTQIAARLLGLG-DSVGLAALVEELLGVELD----KGEQRSDWLKRP 133 (178) T ss_pred HHCCCCCEEEEEEHHHHHHHHHHHHCCCCCCEEEHHHHHHHCCCC-CCHHHHHHHHHHHCCCCC----CCCCCCCCCCCC T ss_conf 975999439996279999999997396765453089999982886-737899999999298777----773646643367 Q ss_pred C-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 0-123333222111112456776322210001345788607 Q gi|254780277|r 535 D-SQVQEHAIENSNVILQLWLLLRPKLIVEKLLHVYERLDK 574 (976) Q Consensus 535 ~-e~~~~Yaa~Da~~t~~L~~~l~~~L~~~~l~~l~~~iE~ 574 (976) . +.+..|||.||.++++||..|.++|++.++.+.+. .|+ T Consensus 134 Ls~~qi~YAa~Da~~l~~l~~~l~~~L~~~~~~~~~~-~e~ 173 (178) T cd06142 134 LTDEQLEYAANDVRYLLPLYEKLKEELEEEGRLEWAE-EEC 173 (178) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHH-HHH T ss_conf 8289999999979999999999999999868399999-999 No 36 >PRK03980 flap endonuclease-1; Provisional Probab=99.87 E-value=8.2e-21 Score=170.38 Aligned_cols=237 Identities=24% Similarity=0.306 Sum_probs=165.5 Q ss_pred CCCCCEECHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHHCCHHHHCCCCCC------------CHHH----- Q ss_conf 89885500699999999999998485547987999972748871324086521667987------------9758----- Q gi|254780277|r 29 KHDGLPVNAIAGFCNMLWKLLQNSRKESIASHFAVIFDYPAVTFRNEIYPDYKANRPKI------------PEML----- 91 (976) Q Consensus 29 ~~~G~~t~ai~gf~~~l~~~i~~~~~~~~~~~~~v~fD~~~~tfR~~~~~~YKa~R~~~------------p~~l----- 91 (976) +++|..|+-+.||++-..++++.- -.-+.+||++-|..-.+....=+..|.+. .++. T Consensus 2 n~~G~~TSHl~G~f~R~~~ll~~g------I~PvfVFDGkpp~~K~~~l~kR~~~r~~a~~~~~~a~~~g~~e~~~k~~~ 75 (295) T PRK03980 2 DSKGRITSHLSGLFYRTINLLENG------IKPVYVFDGKPPELKAEEIEERRKVREEAEEKYEEAKEEGDLEEARKYAQ 75 (295) T ss_pred CCCCCCCHHHHHHHHHHHHHHHCC------CEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHH T ss_conf 989874078879999999999789------98999979989454599999999999999999999997589999999987 Q ss_pred ------HHHHHHHHHHHHHCCCCEEEECCEEHHHHHHHHHHHHHHCCCCEEEECCCCHHHEECCCCEEE----E----EC Q ss_conf ------864899999999709979956890166589999999984899499986870121223797599----9----66 Q gi|254780277|r 92 ------LPQLPLVRLATQAFGIPAIEIQGFEADDIIATYTYIAEKEGFAVTIISTDKDLMQLVSPTTCL----Y----DT 157 (976) Q Consensus 92 ------~~q~~~i~~~l~~~gi~~~~~~~~EADDviatla~~~~~~~~~v~i~s~DkD~~Qlv~~~v~~----~----~~ 157 (976) ...+.-++++|+.||||++..|+ |||...|.|++. |.---++|-|-|++..-.+.+.- . .+ T Consensus 76 r~v~it~~~i~~~k~LL~~~Gipyi~Ap~-EAeaqca~L~~~----g~v~~v~SeD~D~l~~G~~~~lrnl~~~~~~k~~ 150 (295) T PRK03980 76 RASRLTDEIVEDSKKLLDLMGIPYVQAPS-EGEAQAAYMVKK----GDAWAVASQDYDSLLFGAPRLVRNLTISGKRKLP 150 (295) T ss_pred HHCCCCHHHHHHHHHHHHHCCCCCCCCCC-HHHHHHHHHHHC----CCEEEEEECCCCHHHCCCHHHHHHCCCCCCCCCC T ss_conf 62669999999999999986987121872-599999999976----9758997046341211760576420135660245 Q ss_pred CC-------CCEECHHHHHHHHCCCHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHH Q ss_conf 78-------83538999988848988787777875187435899877636024998984136789998645420011223 Q gi|254780277|r 158 VK-------EEKIDIEHVIKKWGIPPEKMVCLQALTGDSIDNIPGIPGIGYKTAALLLQEYGNLENILINASRIKQKKRR 230 (976) Q Consensus 158 ~~-------~~~~~~~~v~~~~gv~p~~~~d~~al~GD~sDnipGv~GiG~KtA~~ll~~~gsle~i~~~~d~i~~~k~~ 230 (976) .+ .+.++.+.+.+.+|+..+|++|+--|+| +|..|||||||||||.+|+++|||+|+|+++++.-... T Consensus 151 ~~~~~~~~~~e~i~l~~il~~l~lt~eqfidlcIL~G--cDY~~gI~gIGpk~Alklikk~~~ie~il~~~~~~~~~--- 225 (295) T PRK03980 151 GKNVYVEVKPELIELEEVLKELGITREQLIDIAILVG--TDYNPGVKGIGPKTALKLIKKHGDLEKVLEAVGEGIDE--- 225 (295) T ss_pred CCCCCCCCCHHHEEHHHHHHHCCCCHHHHHHHHHHCC--CCCCCCCCCCCHHHHHHHHHHHCCHHHHHHHCCCCCCC--- T ss_conf 6443212465442399899883999999999998458--88999999842999999999969999999863767788--- Q ss_pred HHHCCCHHHHHHHHHHHHCCCCCCCCCCHHHHCCCCCCHHHHHHH-HHHHHHH-----HHHHHHHHH Q ss_conf 220035789997655432035642247635521267676889986-5323368-----999876531 Q gi|254780277|r 231 ENILEYAETARLSRKLVMLRTDVPITIPLEHLILEDCNGPRLISF-LKALEFT-----KLINRVATT 291 (976) Q Consensus 231 ~~l~~~~e~a~ls~~L~tL~~Dvpl~~~~~~l~~~~~d~~~l~~~-~~~lef~-----~ll~rL~~~ 291 (976) ..+.+.++ +|- .+-|+. +-.++.|..+|.+.|.+| +.+.+|+ +-+++|... T Consensus 226 ---~~~~~e~r---~lF---~~p~v~-~~~~l~~~~pd~e~l~~fL~~e~~fse~RV~~~i~kl~k~ 282 (295) T PRK03980 226 ---PVDPEEIR---EFF---LNPPVT-DDYELKWKKPDKEGIIEFLVEEHDFSEDRVEKALERLKKA 282 (295) T ss_pred ---CCCHHHHH---HHH---CCCCCC-CCCCCCCCCCCHHHHHHHHHHCCCCCHHHHHHHHHHHHHH T ss_conf ---99979999---984---799988-8788787999989999998764298999999999999987 No 37 >PTZ00217 flap endonuclease-1; Provisional Probab=99.86 E-value=3.6e-19 Score=158.38 Aligned_cols=264 Identities=23% Similarity=0.282 Sum_probs=182.2 Q ss_pred CEEEEEECCHHHHHHHHCCC------CCCCCCCCCEECHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHHCCH Q ss_conf 72899966269998874267------544689885500699999999999998485547987999972748871324086 Q gi|254780277|r 5 NHLFLVDGSSFIYRAFYATP------LLSRKHDGLPVNAIAGFCNMLWKLLQNSRKESIASHFAVIFDYPAVTFRNEIYP 78 (976) Q Consensus 5 ~~~~liDg~~~~~ra~~a~p------~~~~~~~G~~t~ai~gf~~~l~~~i~~~~~~~~~~~~~v~fD~~~~tfR~~~~~ 78 (976) .+.+-||++..+||--+|+. .++ +++|..|.-+.||+....+++.. . -.-+.|||++-|..-.+-.. T Consensus 28 GK~VAIDas~~LYqfl~air~~~~~~~l~-n~~G~~tshl~gl~~r~~~ll~~-----g-IkPVfVFDGkpp~~K~~tl~ 100 (394) T PTZ00217 28 GRRIAIDASMALYQFIIAIRSSGQGSELT-NESGEVTSHLNGLFNRTIRLLEE-----G-IKPIYVFDGKPPELKSGELE 100 (394) T ss_pred CCEEEEECHHHHHHHHHHHCCCCCCCCCC-CCCCCHHHHHHHHHHHHHHHHHC-----C-CEEEEEECCCCCHHHHHHHH T ss_conf 98899961499999999864442344210-46787029999999999999987-----9-97999978999435588999 Q ss_pred HHHCCCCCC------------CHHH-----------HHHHHHHHHHHHHCCCCEEEECCEEHHHHHHHHHHHHHHCCCCE Q ss_conf 521667987------------9758-----------86489999999970997995689016658999999998489949 Q gi|254780277|r 79 DYKANRPKI------------PEML-----------LPQLPLVRLATQAFGIPAIEIQGFEADDIIATYTYIAEKEGFAV 135 (976) Q Consensus 79 ~YKa~R~~~------------p~~l-----------~~q~~~i~~~l~~~gi~~~~~~~~EADDviatla~~~~~~~~~v 135 (976) .=+..|... .++. ...+.-++++|..||||++..|+ |||-..|.|++. |.-= T Consensus 101 kRr~~R~eA~~~~~~a~e~gd~e~a~k~~~r~v~vT~e~~~~~k~LL~~mGIpyI~AP~-EAEaqcA~L~~~----g~vd 175 (394) T PTZ00217 101 KRRQRREEAEAELEKAKEEGDDEQMKKQSKRTVRVTRKQNEEAKKLLRLMGIPVIEAPS-EAEAQCAELVKK----GKAW 175 (394) T ss_pred HHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCEEECCC-HHHHHHHHHHHC----CCEE T ss_conf 99999999999999998758999999987543538899999999999976997686580-599999999966----9657 Q ss_pred EEECCCCHHHEECCCCEEEEECC---CC---CEECHHHHHHHHCCCHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHH Q ss_conf 99868701212237975999667---88---3538999988848988787777875187435899877636024998984 Q gi|254780277|r 136 TIISTDKDLMQLVSPTTCLYDTV---KE---EKIDIEHVIKKWGIPPEKMVCLQALTGDSIDNIPGIPGIGYKTAALLLQ 209 (976) Q Consensus 136 ~i~s~DkD~~Qlv~~~v~~~~~~---~~---~~~~~~~v~~~~gv~p~~~~d~~al~GD~sDnipGv~GiG~KtA~~ll~ 209 (976) .++|-|-|++-.=.+.+.--... +. ..++.+.+.+..|++.+|++|+--|+| .|=.|||||||||||.+|+. T Consensus 176 av~SeD~D~l~fG~p~vlr~l~~s~~~k~~~~~i~l~~il~~l~lt~eqfidlcIL~G--cDY~~~I~GIGpk~A~klIk 253 (394) T PTZ00217 176 AVATEDMDALTFGTPVLLRNLNFSEAKKRPIVEIHLDKVLKGLGLSMDQFIDLCILCG--CDYCDTIEGIGPKTAYELIK 253 (394) T ss_pred EEEECCCCCCCCCCCEEEEECCCCCCCCCCEEEEEHHHHHHHCCCCHHHHHHHHHHHC--CCCCCCCCCCCHHHHHHHHH T ss_conf 9970442300037866887313466665544998899988774999999999999818--64468998748899999999 Q ss_pred HCCCHHHHHHHHHHHHHHHHHHHHC-CCHHHHHHHHHHHHCCCCCCCCCCHHHHCCCCCCHHHHHHH-HHHHHHH----- Q ss_conf 1367899986454200112232200-35789997655432035642247635521267676889986-5323368----- Q gi|254780277|r 210 EYGNLENILINASRIKQKKRRENIL-EYAETARLSRKLVMLRTDVPITIPLEHLILEDCNGPRLISF-LKALEFT----- 282 (976) Q Consensus 210 ~~gsle~i~~~~d~i~~~k~~~~l~-~~~e~a~ls~~L~tL~~Dvpl~~~~~~l~~~~~d~~~l~~~-~~~lef~----- 282 (976) +|||+|+|+++++.-+- .+-+.+. +..-.+|+ .-+|. +.+.-++.|..+|.+.|..| +++.+|+ T Consensus 254 ~~~sie~il~~~~~~k~-~~p~~~~~~~~r~~Fl-------~P~V~-~~~~~~l~w~~pd~e~l~~fL~~e~~Fse~Rv~ 324 (394) T PTZ00217 254 KYGSIEEILEHLDGTKY-PVPENFPYKEARELFL-------NPKVT-PAEEIDIKWSEPDIEGLKKFLVEEKNFNEERVE 324 (394) T ss_pred HHCCHHHHHHHHCCCCC-CCCCCCCHHHHHHHHC-------CCCCC-CCCCCCCCCCCCCHHHHHHHHHHCCCCCHHHHH T ss_conf 95999999987412467-7998799799999962-------99888-976676508999989999997646697989999 Q ss_pred HHHHHHHHH Q ss_conf 999876531 Q gi|254780277|r 283 KLINRVATT 291 (976) Q Consensus 283 ~ll~rL~~~ 291 (976) +-+.||.+. T Consensus 325 ~~~~~l~k~ 333 (394) T PTZ00217 325 KGIERLRKA 333 (394) T ss_pred HHHHHHHHH T ss_conf 999999987 No 38 >TIGR03674 fen_arch flap structure-specific endonuclease. Endonuclease that cleaves the 5'-overhanging flap structure that is generated by displacement synthesis when DNA polymerase encounters the 5'-end of a downstream Okazaki fragment. Has 5'-endo-/exonuclease and 5'-pseudo-Y-endonuclease activities. Cleaves the junction between single and double-stranded regions of flap DNA Probab=99.86 E-value=8.3e-20 Score=163.04 Aligned_cols=256 Identities=23% Similarity=0.289 Sum_probs=176.9 Q ss_pred CEEEEEECCHHHHHHHHCC-----CCCCCCCCCCEECHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHH---- Q ss_conf 7289996626999887426-----7544689885500699999999999998485547987999972748871324---- Q gi|254780277|r 5 NHLFLVDGSSFIYRAFYAT-----PLLSRKHDGLPVNAIAGFCNMLWKLLQNSRKESIASHFAVIFDYPAVTFRNE---- 75 (976) Q Consensus 5 ~~~~liDg~~~~~ra~~a~-----p~~~~~~~G~~t~ai~gf~~~l~~~i~~~~~~~~~~~~~v~fD~~~~tfR~~---- 75 (976) .+.+-||++..+||--.++ .+++ ++.|..|+-+.|++..+.++++. . -.-+.|||++-|-.-.+ T Consensus 21 Gk~vAIDas~~lyqfl~~ir~~~g~~l~-~~~G~~Tshl~g~f~R~~~Ll~~-g-----I~PIfVFDGkpp~~K~~t~~~ 93 (338) T TIGR03674 21 GKVVAVDAFNALYQFLSSIRQPDGTPLM-DSRGRITSHLSGLFYRTINLLEN-G-----IKPVYVFDGKPPELKAETLEE 93 (338) T ss_pred CCEEEEECHHHHHHHHHHHHCCCCCCCC-CCCCCEEHHHHHHHHHHHHHHHC-C-----CEEEEEECCCCCCCCHHHHHH T ss_conf 9889997589999999997066898142-67897408999999999999977-9-----989999799985100188999 Q ss_pred -------CCHHHHCCCCCC-CHHH-----------HHHHHHHHHHHHHCCCCEEEECCEEHHHHHHHHHHHHHHCCCCEE Q ss_conf -------086521667987-9758-----------864899999999709979956890166589999999984899499 Q gi|254780277|r 76 -------IYPDYKANRPKI-PEML-----------LPQLPLVRLATQAFGIPAIEIQGFEADDIIATYTYIAEKEGFAVT 136 (976) Q Consensus 76 -------~~~~YKa~R~~~-p~~l-----------~~q~~~i~~~l~~~gi~~~~~~~~EADDviatla~~~~~~~~~v~ 136 (976) ....|+.-...- .++. ...+.-++++|++||||++..|+ |||-..|.|++. |.-=- T Consensus 94 R~~~r~~a~~~~~~~~~~g~~e~~~k~~~r~~~it~~~i~~~k~lL~~~Gi~~i~Ap~-EAeaqca~L~k~----g~vda 168 (338) T TIGR03674 94 RREIREEAEEKWEEALEKGDLEEARKYAQRSSRLTSEIVESSKKLLDLMGIPYVQAPS-EGEAQAAYMAKK----GDVDY 168 (338) T ss_pred HHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCEEECCH-HHHHHHHHHHHC----CCEEE T ss_conf 9999999999999999778999999998774749899999999999976987685672-799999999974----97259 Q ss_pred EECCCCHHHEECCCCEEEEEC--------CCC-------CEECHHHHHHHHCCCHHHHHHHHHHCCCCCCCCCCCCCCCC Q ss_conf 986870121223797599966--------788-------35389999888489887877778751874358998776360 Q gi|254780277|r 137 IISTDKDLMQLVSPTTCLYDT--------VKE-------EKIDIEHVIKKWGIPPEKMVCLQALTGDSIDNIPGIPGIGY 201 (976) Q Consensus 137 i~s~DkD~~Qlv~~~v~~~~~--------~~~-------~~~~~~~v~~~~gv~p~~~~d~~al~GD~sDnipGv~GiG~ 201 (976) ++|-|-|++-.=.+.+.--.. .++ +.++.+.+.+.+|++.+|++|+--|+| +|-.|||||||| T Consensus 169 v~SeD~D~l~fG~~~virnl~~~g~~k~~~~~~~~~~~~e~i~l~~~l~~l~lt~eqfidlcIL~G--cDY~~gI~GIG~ 246 (338) T TIGR03674 169 VGSQDYDSLLFGAPRLVRNLTISGKRKLPGKNIYVEVKPELIELEEVLSELGITREQLIDIAILVG--TDYNEGVKGIGP 246 (338) T ss_pred EECCCHHHHHCCCCEEEHHHCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHC--CCCCCCCCCCCH T ss_conf 961540033228765101112455513566454332457783099888773899999999999708--756789998568 Q ss_pred HHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCCCCCHHHHCCCCCCHHHHHHHH-HHHH Q ss_conf 249989841367899986454200112232200357899976554320356422476355212676768899865-3233 Q gi|254780277|r 202 KTAALLLQEYGNLENILINASRIKQKKRRENILEYAETARLSRKLVMLRTDVPITIPLEHLILEDCNGPRLISFL-KALE 280 (976) Q Consensus 202 KtA~~ll~~~gsle~i~~~~d~i~~~k~~~~l~~~~e~a~ls~~L~tL~~Dvpl~~~~~~l~~~~~d~~~l~~~~-~~le 280 (976) |||.+|+++|||+|+|+++.+.-.. +.+.+ |+|- .+-|+. +-.++.|..+|.+.|.+|+ ++.+ T Consensus 247 k~A~klIkk~~~ie~i~~~~~~~~~---------~~~~~---r~lF---~~p~v~-d~~~~~~~~pd~e~l~~fL~~e~~ 310 (338) T TIGR03674 247 KTALKLIKEHGDLEKVLKARGEDIE---------NYDEI---REFF---LNPPVT-DDYELKWRKPDKEGIIEFLCDEHD 310 (338) T ss_pred HHHHHHHHHHCCHHHHHHHCCCCCC---------CHHHH---HHHH---CCCCCC-CCCCCCCCCCCHHHHHHHHHHCCC T ss_conf 9999999985999999986568877---------89999---9984---899989-877776799998999999553559 Q ss_pred HH-----HHHHHHHH Q ss_conf 68-----99987653 Q gi|254780277|r 281 FT-----KLINRVAT 290 (976) Q Consensus 281 f~-----~ll~rL~~ 290 (976) |+ +-++||.. T Consensus 311 fse~RV~~~i~kl~k 325 (338) T TIGR03674 311 FSEDRVERALERLEA 325 (338) T ss_pred CCHHHHHHHHHHHHH T ss_conf 799999999999997 No 39 >cd00080 HhH2_motif Helix-hairpin-helix class 2 (Pol1 family) motif. HhH2 domains are found in Rad2 family of prokaryotic and eukaryotic replication and repair nucleases, i.e., DNA polymerase I, Taq DNA polymerase, DNA repair protein Rad2 endonuclease, flap endonuclease, exonuclease I and IX, 5'-3' exonuclease and also bacteriophage Rnase H. These nucleases degrade RNA-DNA or DNA-DNA duplexes, or both and play essential roles in DNA duplication, repair, and recombination. Probab=99.82 E-value=1.2e-20 Score=169.19 Aligned_cols=75 Identities=55% Similarity=0.845 Sum_probs=71.1 Q ss_pred HHHCCCHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHH Q ss_conf 884898878777787518743589987763602499898413678999864542001122322003578999765543 Q gi|254780277|r 170 KKWGIPPEKMVCLQALTGDSIDNIPGIPGIGYKTAALLLQEYGNLENILINASRIKQKKRRENILEYAETARLSRKLV 247 (976) Q Consensus 170 ~~~gv~p~~~~d~~al~GD~sDnipGv~GiG~KtA~~ll~~~gsle~i~~~~d~i~~~k~~~~l~~~~e~a~ls~~L~ 247 (976) ++|||+|+|++||++|+||+|||||||||||+|||.+||++|||+|+|+++++.++ .+++++++++.+++|++|+ T Consensus 1 e~~g~~p~q~~d~~~L~GD~sDnipGV~GIG~ktA~~ll~~~~~l~~i~~~~~~~~---~~~~~~~~~~~~~~s~~l~ 75 (75) T cd00080 1 EKLGLTPEQFIDLAILVGDKSDNIPGVPGIGPKTALKLLKEYGSLENLLENLDKIK---LREKLLEHKELAKLSKLLA 75 (75) T ss_pred CCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHCCCHHHHHHHHHHHH---HHHHHHHCHHHHHHHHHHC T ss_conf 98897999999999991965458877586379999999999099999998387776---8999996899999989869 No 40 >cd06147 Rrp6p_like_exo Yeast Rrp6p and its human homolog, the Polymyositis/scleroderma autoantigen 100kDa (PM/Scl-100), are exosome-associated proteins involved in the degradation and processing of precursors to stable RNAs. Both proteins contain a DEDDy-type DnaQ-like 3'-5' exonuclease domain possessing three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. The motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. PM/Scl-100, an autoantigen present in the nucleolar compartment of the cell, reacts with autoantibodies produced by about 50% of patients with polymyositis-scleroderma overlap syndrome. Probab=99.78 E-value=1.5e-18 Score=153.89 Aligned_cols=172 Identities=17% Similarity=0.251 Sum_probs=138.7 Q ss_pred CCCEEECCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCEECCCCCCCCCCCCCCCCC Q ss_conf 65035479999999999998668523233222433110012442010257766544345502313744333456500178 Q gi|254780277|r 366 DSYTKIVNTKDIQQWVQKLETIGSASFKIITDTMDAFHSKPIAIAISTLDQKDDTSFKVETIFIDLSVHTSQDSTGKNTL 445 (976) Q Consensus 366 ~~y~~I~~~~~L~~li~~L~~~~~ia~DtETt~ld~~~~~ivGisls~~~~~~~~~~~~~~~yip~~~~~~~d~~~~~~~ 445 (976) ..+..|.+.++++.+++.+...+.+++|||+++.+.+...+..+.++..+. .+.++.-. T Consensus 3 ~p~~~Idt~~~L~~~~~~L~~~~~ialDtE~~~~~~~~~~l~LiQiat~~~---------~yliD~l~------------ 61 (192) T cd06147 3 TPLTFVDTEEKLEELVEKLKNCKEIAVDLEHHSYRSYLGFTCLMQISTREE---------DYIVDTLK------------ 61 (192) T ss_pred CCEEEECCHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCEEEEEEEECCCC---------CEEEEECC------------ T ss_conf 873998899999999999826996999797378856998279999950998---------87987434------------ Q ss_pred HHHHHHHHHHHHCCCCCHHHHCCCCHHHHHHHH-CCCCCCCCCCHHHHHHHHCCCCCCCCHHHHHHHHCCCCCCCCCCCC Q ss_conf 899997444420222100110032123788986-1886664210023455431687667724667762244422335654 Q gi|254780277|r 446 TKEILSYLKKFFENEHFLKIGHNIKYDKLVLHR-YGISMRGFEDIMLMSYVLDSGRSSHDIASIAQKWLSYTRKEILKSR 524 (976) Q Consensus 446 ~~evl~~L~~lle~~~i~kIgHN~KfD~~~L~~-~GI~l~~i~DTmlaayLLdp~~~~~~L~~la~~yL~~~~~e~~~~~ 524 (976) ..+.+..|+++|+|+++.||+||+++|+.+|.+ +|+.+.++|||++|+.+++.. ++||..++++|||...+. .. T Consensus 62 l~~~~~~L~~ll~~~~I~KV~H~~~~Dl~~L~~~~gi~~~n~fDT~iaa~~l~~~--~~sL~~Lv~~~lg~~l~K---~~ 136 (192) T cd06147 62 LRDDMHILNEVFTDPNILKVFHGADSDIIWLQRDFGLYVVNLFDTGQAARVLNLP--RHSLAYLLQKYCNVDADK---KY 136 (192) T ss_pred CCCCHHHHHHHHCCCCEEEEEECHHHHHHHHHHHHCCCCCCCHHHHHHHHHHCCC--CHHHHHHHHHHHCCCCCC---CC T ss_conf 3003799999975988379985578779999998589876604289999984888--111999999993958873---23 Q ss_pred CCCCCCHHHC-CHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 3222100000-012333322211111245677632221000 Q gi|254780277|r 525 KSSIPIDKIS-DSQVQEHAIENSNVILQLWLLLRPKLIVEK 564 (976) Q Consensus 525 k~~~~~~~i~-~e~~~~Yaa~Da~~t~~L~~~l~~~L~~~~ 564 (976) ...+|..-| .+.+..||+.||.+.+.||..|..+|.+++ T Consensus 137 -q~SdW~~RPLs~~Ql~YAA~Dv~~Ll~ly~~L~~~L~e~~ 176 (192) T cd06147 137 -QLADWRIRPLPEEMIKYAREDTHYLLYIYDRLRNELLERA 176 (192) T ss_pred -EECCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf -1143799989999999999999999999999999999842 No 41 >COG0349 Rnd Ribonuclease D [Translation, ribosomal structure and biogenesis] Probab=99.67 E-value=4.4e-16 Score=135.91 Aligned_cols=175 Identities=19% Similarity=0.157 Sum_probs=139.4 Q ss_pred ECCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCEECCCCCCCCCCCCCCCCCHHHHH Q ss_conf 47999999999999866852323322243311001244201025776654434550231374433345650017889999 Q gi|254780277|r 371 IVNTKDIQQWVQKLETIGSASFKIITDTMDAFHSKPIAIAISTLDQKDDTSFKVETIFIDLSVHTSQDSTGKNTLTKEIL 450 (976) Q Consensus 371 I~~~~~L~~li~~L~~~~~ia~DtETt~ld~~~~~ivGisls~~~~~~~~~~~~~~~yip~~~~~~~d~~~~~~~~~evl 450 (976) |++.+.+..++..+...+.+|+|||+.++.++...+..|.++..++ ...|+.... .. -. T Consensus 1 i~~~~~l~~~~~~~~~~~~iAIDTEf~r~~t~~p~LcLIQi~~~e~---------~~lIdp~~~-~~-----------d~ 59 (361) T COG0349 1 ITTGDLLAAACALLRGSKAIAIDTEFMRLRTYYPRLCLIQISDGEG---------ASLIDPLAG-IL-----------DL 59 (361) T ss_pred CCCHHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCEEEEEEECCCC---------CEEECCCCC-CC-----------CC T ss_conf 9742679999997428980798322246643688337999844888---------668622655-46-----------66 Q ss_pred HHHHHHHCCCCCHHHHCCCCHHHHHHHHC-CCCCCCCCCHHHHHHHHCCCCCCCCHHHHHHHHCCCCCCCCCCCCCCCCC Q ss_conf 74444202221001100321237889861-88666421002345543168766772466776224442233565432221 Q gi|254780277|r 451 SYLKKFFENEHFLKIGHNIKYDKLVLHRY-GISMRGFEDIMLMSYVLDSGRSSHDIASIAQKWLSYTRKEILKSRKSSIP 529 (976) Q Consensus 451 ~~L~~lle~~~i~kIgHN~KfD~~~L~~~-GI~l~~i~DTmlaayLLdp~~~~~~L~~la~~yL~~~~~e~~~~~k~~~~ 529 (976) ..|..+|.|+.+.||+|.+.||+.+|.+. |+.+.++|||.||+.++.-+. +|||++|+++++|+..+ ++.+.. + T Consensus 60 ~~l~~Ll~d~~v~KIfHaa~~DL~~l~~~~g~~p~plfdTqiAa~l~g~~~-~~gl~~Lv~~ll~v~ld---K~~q~S-D 134 (361) T COG0349 60 PPLVALLADPNVVKIFHAARFDLEVLLNLFGLLPTPLFDTQIAAKLAGFGT-SHGLADLVEELLGVELD---KSEQRS-D 134 (361) T ss_pred CHHHHHHCCCCEEEEECCCCCCHHHHHHHCCCCCCCHHHHHHHHHHCCCCC-CCCHHHHHHHHHCCCCC---CCCCCC-C T ss_conf 369998628760255025635199999861888773468999999809863-23299999999688536---444546-5 Q ss_pred CHHHC-CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 00000-0123333222111112456776322210001345788 Q gi|254780277|r 530 IDKIS-DSQVQEHAIENSNVILQLWLLLRPKLIVEKLLHVYER 571 (976) Q Consensus 530 ~~~i~-~e~~~~Yaa~Da~~t~~L~~~l~~~L~~~~l~~l~~~ 571 (976) |..-| .+.+.+||+.||.+.+.||..|.++|.+.+.+....+ T Consensus 135 W~~RPLs~~Ql~YAa~DV~yL~~l~~~L~~~L~~~~r~~~a~~ 177 (361) T COG0349 135 WLARPLSEAQLEYAAADVEYLLPLYDKLTEELAREGRLEWAED 177 (361) T ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHH T ss_conf 5557999999999999999999999999999987485499999 No 42 >PRK10829 ribonuclease D; Provisional Probab=99.64 E-value=2.2e-15 Score=130.81 Aligned_cols=177 Identities=13% Similarity=0.052 Sum_probs=141.7 Q ss_pred CCEEECCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCEECCCCCCCCCCCCCCCCCH Q ss_conf 50354799999999999986685232332224331100124420102577665443455023137443334565001788 Q gi|254780277|r 367 SYTKIVNTKDIQQWVQKLETIGSASFKIITDTMDAFHSKPIAIAISTLDQKDDTSFKVETIFIDLSVHTSQDSTGKNTLT 446 (976) Q Consensus 367 ~y~~I~~~~~L~~li~~L~~~~~ia~DtETt~ld~~~~~ivGisls~~~~~~~~~~~~~~~yip~~~~~~~d~~~~~~~~ 446 (976) +|..|++.++|..+++.+...+.+|+|||-.+.+.+...+..|.++.... .+.||--.. T Consensus 2 ~y~~I~t~~~L~~~~~~l~~~~~iavDTEf~r~~ty~p~l~LiQi~~~~~---------~~lIDpl~~------------ 60 (373) T PRK10829 2 NYQMITTDDALASLCEAVRAFPAIALDTEFVRTRTYYPQLGLIQLFDGEQ---------LALIDPLGI------------ 60 (373) T ss_pred EEEEECCHHHHHHHHHHHCCCCCEEEECCCCCCCCCCCEEEEEEEEECCC---------EEEEECCCC------------ T ss_conf 15997989999999999707991898577456777898458999851896---------799947877------------ Q ss_pred HHHHHHHHHHHCCCCCHHHHCCCCHHHHHHHH-CCCCCCCCCCHHHHHHHHCCCCCCCCHHHHHHHHCCCCCCCCCCCCC Q ss_conf 99997444420222100110032123788986-18866642100234554316876677246677622444223356543 Q gi|254780277|r 447 KEILSYLKKFFENEHFLKIGHNIKYDKLVLHR-YGISMRGFEDIMLMSYVLDSGRSSHDIASIAQKWLSYTRKEILKSRK 525 (976) Q Consensus 447 ~evl~~L~~lle~~~i~kIgHN~KfD~~~L~~-~GI~l~~i~DTmlaayLLdp~~~~~~L~~la~~yL~~~~~e~~~~~k 525 (976) .-++.|.++|.|+.+.||+|-+..|+.+|.+ .|+.+.++|||++|+-++.-+ .+.|+..|+++|+|+..+ ++. T Consensus 61 -~dl~~l~~ll~~~~i~kv~Haa~qDle~l~~~~g~~p~~iFDTQiAa~~lg~~-~~~gy~~Lv~~~l~v~l~----K~~ 134 (373) T PRK10829 61 -TDWSPFKALLRDPSITKFLHAGSEDLEVFLNAFGELPQPLIDTQILAAFCGRP-LSWGFASMVEEYSGVALD----KSE 134 (373) T ss_pred -CCHHHHHHHHCCCCCEEEEECCCCHHHHHHHHCCCCCCHHHHHHHHHHHCCCC-CCHHHHHHHHHHHCCCCC----CCC T ss_conf -77699999970988416520642159999998198874077648899865786-555899999999396656----664 Q ss_pred CCCCCHHHC-CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 222100000-012333322211111245677632221000134578 Q gi|254780277|r 526 SSIPIDKIS-DSQVQEHAIENSNVILQLWLLLRPKLIVEKLLHVYE 570 (976) Q Consensus 526 ~~~~~~~i~-~e~~~~Yaa~Da~~t~~L~~~l~~~L~~~~l~~l~~ 570 (976) +..++..-| .+.+..||+.||.+.+.||+.|..+|++.+-+..+. T Consensus 135 ~~sDW~~RPLs~~ql~YA~~DV~yL~~l~~~L~~~l~~~~r~~w~~ 180 (373) T PRK10829 135 SRTDWLARPLSERQCEYAAADVWYLLPIAAKLMVETEAAGWLPAAL 180 (373) T ss_pred CCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHH T ss_conf 6275788999989999999999999999999999998357558999 No 43 >KOG3657 consensus Probab=99.62 E-value=2.1e-14 Score=123.66 Aligned_cols=211 Identities=25% Similarity=0.375 Sum_probs=123.7 Q ss_pred CCCCCCCEEEECCCCCEEEEEECCCCCEEEEEEEEE--------HHH-----HHHHHHCCCCHHHHHHHHHHCCCHHHCC Q ss_conf 322224302424899489996326544368766510--------377-----7873406988899999898099968899 Q gi|254780277|r 728 DLGQKIRRAFISPPTKKLISADYSQIELRILAHIAK--------ITP-----LYQVFENSLDIHRMVAAEIFGVNIEKVS 794 (976) Q Consensus 728 ~~g~~iR~~F~a~~g~~lv~aDySQiElRilA~ls~--------D~~-----l~~af~~g~DiH~~tA~~vf~~~~~~vt 794 (976) ..|-+|+...+|++||.||+||--.-||=|.|-++. -.+ |-..-.+|.|+|+.||..+ |+. T Consensus 718 Rigselkamvqappgy~LVgaDvdsqElwiaa~lgda~~~~~hg~ta~gwM~Lag~ks~gtdlhs~ta~~l-giS----- 791 (1075) T KOG3657 718 RIGSELKAMVQAPPGYRLVGADVDSQELWIAALLGDASAEGVHGKTAFGWMTLAGSKSDGTDLHSKTASQL-GIS----- 791 (1075) T ss_pred HHHHHHHHHHCCCCCCEEEECCCCHHHHHHHHHHHHHHHHCCCCCCHHHHHHHCCCCCCCCHHHHHHHHHC-CCC----- T ss_conf 55078897610788616850563479999999963154412247522102111376656730767666650-522----- Q ss_pred HHHHHHHHHHHHHHHHCCCHHHHHHH----C-C-CCHHHHHHHHHHHHHHCHHHH-HHHHHHHHHHHH----------CC Q ss_conf 88886302513564407770356788----1-9-998999999999999888899-999999999998----------49 Q gi|254780277|r 795 PQMRRHAKTINFSIIYGISPFRLATQ----L-R-IPRSEAADYIKRYFHRFPGIH-EYIEKTKNFVRQ----------NG 857 (976) Q Consensus 795 ~~~R~~aK~infgi~YG~~~~~La~~----l-~-is~~eA~~~i~~yf~~~p~v~-~~~~~~~~~a~~----------~g 857 (976) |.-||.+|||-|||.|.. .|.+ . . .+.+||+..-..-|..-.|-+ ..++..+....+ .+ T Consensus 792 ---r~hakv~Ny~riygag~~-fa~~ll~~fnp~l~~~Ea~~~A~~l~~~tkG~~~~rlk~e~~~~e~~~~~~~~~~~~~ 867 (1075) T KOG3657 792 ---RNHAKVFNYARIYGAGQT-FAEKLLMRFNPSLTQSEAKSKASQLFKLTKGDRAKRLKVEVRMVENSLVCKILTIDGI 867 (1075) T ss_pred ---HHHHHHCCHHHHHCCCCC-HHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCEEEEHHHHHHHHHHHHHCHHHHHHHH T ss_conf ---766654047877458861-6777688618998667889999999886257511005767777665320004543015 Q ss_pred CE---EC---CCCCEECCCCCCCCCHHH--------------------HH---------HHHHHHHHHHHHHHHHHHHHH Q ss_conf 88---82---778760189875883788--------------------87---------787877013456538999999 Q gi|254780277|r 858 YV---ET---IFGRRIHYDEINSPKSSI--------------------RN---------INERAAINAPIQGSAADITRR 902 (976) Q Consensus 858 yv---~T---~~GRrr~~p~i~s~~~~~--------------------r~---------~aeR~a~N~~iQGsaAdi~k~ 902 (976) |. .+ -.-||...-+-.|.-++. |+ .-.+.-||-.||+||-|-.-+ T Consensus 868 ~~~~~~~~~~~~~~~~w~gg~es~~fn~lesia~~~~prtpvlgc~is~sl~~~~~~~~~f~~srinwvvqssavd~lhl 947 (1075) T KOG3657 868 YLIYSIYENEVEPRRLWVGGTESSMFNKLESIATAHDPRTPVLGCQISRSLEKLPEGEPKFLPSRINWVVQSSAVDFLHL 947 (1075) T ss_pred HHHHHHHHCCCCHHHHCCCCHHHHHHHHHHHHHHCCCCCCCEECCCHHHHHCCCCCCCCCCCCCEEEEEEECCCHHHHHH T ss_conf 55433230543104530575188888888877640688887102413564433666775444432357763340659999 Q ss_pred HHHHHHHHHHHCCCCEEEEEEECCEEEEECCHHHHHHHH------HHHHHHH Q ss_conf 999999999865998189997263899955987999999------9999996 Q gi|254780277|r 903 AMISVHKSLEDHKLATKMLLQIHDELVFEVPEEEITTAS------QIIVHSM 948 (976) Q Consensus 903 am~~i~~~~~~~~~~~~~ilqVHDElv~ev~~~~~~~~~------~~~~~~M 948 (976) -++.+.=-+....+.+|+++-||||+-+-|.|+..-+++ .++.++| T Consensus 948 llvsm~wl~~~y~i~~rfcisihdevrylv~e~d~~raalalqisnl~tr~~ 999 (1075) T KOG3657 948 LLVSMQWLCDTYKIDARFCISIHDEVRYLVKEEDAPRAALALQISNLLTRAM 999 (1075) T ss_pred HHHHHHHHHHHCCCCEEEEEEEHHHHHHHHCCCCCHHHHHHHHHHHHHHHHH T ss_conf 9999999986415442799874176788862664277899999999999999 No 44 >cd06148 Egl_like_exo The Egalitarian (Egl) protein subfamily is composed of Drosophila Egl and similar proteins. Egl is a component of an mRNA-binding complex which is required for oocyte specification. Egl contains a DEDDy-type DnaQ-like 3'-5' exonuclease domain possessing three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. The motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. The conservation of this subfamily throughout eukaryotes suggests that its members may be part of ancient RNA processing complexes that are likely to participate in the regulated processing of specific mRNAs. Probab=99.60 E-value=4.7e-15 Score=128.41 Aligned_cols=175 Identities=16% Similarity=0.169 Sum_probs=127.5 Q ss_pred HHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHC Q ss_conf 99999986685232332224331100124420102577665443455023137443334565001788999974444202 Q gi|254780277|r 379 QWVQKLETIGSASFKIITDTMDAFHSKPIAIAISTLDQKDDTSFKVETIFIDLSVHTSQDSTGKNTLTKEILSYLKKFFE 458 (976) Q Consensus 379 ~li~~L~~~~~ia~DtETt~ld~~~~~ivGisls~~~~~~~~~~~~~~~yip~~~~~~~d~~~~~~~~~evl~~L~~lle 458 (976) +.+..+.+...+|+|||+.++..+ +.+..+.+++..+. .+.+++.... .......|+++|+ T Consensus 2 ~ai~~L~~~~vi~iD~E~~~~~~~-g~l~LiQiat~~~~--------vyl~D~~~~~----------~~~~~~~Lk~iLe 62 (197) T cd06148 2 EAIIHLKKQKVIGLDCEGVNLGRK-GKLCLVQIATRTGQ--------IYLFDILKLG----------SIVFINGLKDILE 62 (197) T ss_pred HHHHHHHHCCEEEEECCCCCCCCC-CCEEEEEEEECCCC--------EEEEECCCCC----------CHHHHHHHHHHHC T ss_conf 589998759999998886788989-98879999648998--------8999616676----------3445689999956 Q ss_pred CCCCHHHHCCCCHHHHHHHH-CCCCCCCCCCHHHHHHHHCCCC-------CCCCHHHHHHHHCCCCCCCCCCCC----CC Q ss_conf 22100110032123788986-1886664210023455431687-------667724667762244422335654----32 Q gi|254780277|r 459 NEHFLKIGHNIKYDKLVLHR-YGISMRGFEDIMLMSYVLDSGR-------SSHDIASIAQKWLSYTRKEILKSR----KS 526 (976) Q Consensus 459 ~~~i~kIgHN~KfD~~~L~~-~GI~l~~i~DTmlaayLLdp~~-------~~~~L~~la~~yL~~~~~e~~~~~----k~ 526 (976) |+++.||+|++.+|..+|++ +||.+.++|||++|+.++.... ...||..++..||+.......... .. T Consensus 63 ~~~I~KV~H~~r~D~~~L~~~~gi~~~nvfDTqiA~~ll~~~~~~~~~~~~~~sl~~l~~~~l~~~~~~~~~~~~~~~~~ 142 (197) T cd06148 63 SKKILKVIHDCRRDSDALYHQYGIKLNNVFDTQVADALLQEQETGGFNPDRVISLVQLLDKYLYISISLKEDVKKLMRED 142 (197) T ss_pred CCCCEEEEECHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHCCCCHHHHHHHHHHHCC T ss_conf 99823999610999999999709970670319999999753131476775641599999999576401778778887506 Q ss_pred CCCCHHHC-CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 22100000-01233332221111124567763222100013457886 Q gi|254780277|r 527 SIPIDKIS-DSQVQEHAIENSNVILQLWLLLRPKLIVEKLLHVYERL 572 (976) Q Consensus 527 ~~~~~~i~-~e~~~~Yaa~Da~~t~~L~~~l~~~L~~~~l~~l~~~i 572 (976) ...|..-| .+.+..||+.||.+...||..|.++|..+....+|..+ T Consensus 143 ~s~W~~RPLt~~ql~YAa~DV~yL~~L~~~L~~~Li~~~~~~~~~~~ 189 (197) T cd06148 143 PKFWALRPLTEDMIRYAALDVLCLLPLYYAMLDALISKFLKAVFKYL 189 (197) T ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 55446799999999999998999999999999999999999999998 No 45 >KOG2519 consensus Probab=99.59 E-value=9.6e-14 Score=118.82 Aligned_cols=256 Identities=21% Similarity=0.325 Sum_probs=175.1 Q ss_pred EEEEEECCHHHHHHHHCCCCCCCCCCCCEECHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHHC--------- Q ss_conf 28999662699988742675446898855006999999999999984855479879999727488713240--------- Q gi|254780277|r 6 HLFLVDGSSFIYRAFYATPLLSRKHDGLPVNAIAGFCNMLWKLLQNSRKESIASHFAVIFDYPAVTFRNEI--------- 76 (976) Q Consensus 6 ~~~liDg~~~~~ra~~a~p~~~~~~~G~~t~ai~gf~~~l~~~i~~~~~~~~~~~~~v~fD~~~~tfR~~~--------- 76 (976) +=+-|||+-.+|+-+-+... ..+-.|.+|.-+.|++.-+++++ +.. ...+.|||+..|.+-..- T Consensus 29 kkVAID~s~~lyqfl~~v~~-~~~~~~~~~~HL~g~f~Rt~~l~-~~g-----i~Pv~VfDG~pP~lKs~e~~kR~~rr~ 101 (449) T KOG2519 29 KKVAIDASMWLYQFLIVVRS-CRNEAGEPTSHLMGMFYRTIRLI-ENG-----IKPVYVFDGKPPDLKSQELAKRSERRS 101 (449) T ss_pred CEEEEECCEEHHHHHHHHCC-CCCCCCCCHHHHHHHHHHHHHHH-HCC-----CCEEEEECCCCCCCCHHHHHHHHHHHH T ss_conf 35999453027657432034-55667772399999999999999-859-----957999789898753177777888765 Q ss_pred --CHHHHCCCC-CCCHH---HH--------HHHHHHHHHHHHCCCCEEEECCEEHHHHHHHHHHHHHHCCCCEEEECCCC Q ss_conf --865216679-87975---88--------64899999999709979956890166589999999984899499986870 Q gi|254780277|r 77 --YPDYKANRP-KIPEM---LL--------PQLPLVRLATQAFGIPAIEIQGFEADDIIATYTYIAEKEGFAVTIISTDK 142 (976) Q Consensus 77 --~~~YKa~R~-~~p~~---l~--------~q~~~i~~~l~~~gi~~~~~~~~EADDviatla~~~~~~~~~v~i~s~Dk 142 (976) .++|+..-. ..+.+ +. ++-.-++++|..||||++..++ ||---=|.|++. |.---++|.|- T Consensus 102 ~a~~~~~~~~e~~~~~~~~k~~~r~vkvtk~~~dEak~LL~lmGIp~i~ap~-EAEAqCA~Lnk~----g~V~~~at~Ds 176 (449) T KOG2519 102 EADKELKPAKEAGAKENMEKFFSRLVKVTKQHNDEAKRLLSLMGIPVLDAPG-EAEAQCAALNKA----GKVYAVATEDS 176 (449) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCEEECCCC-HHHHHHHHHHHC----CCEEEEECCCC T ss_conf 4234320687751488888777777550100068999999972975106896-188999999646----86105644664 Q ss_pred HHHEECCCCEE--EE-ECCCC---CEECHHHHHHHHCCCHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCCCHHH Q ss_conf 12122379759--99-66788---35389999888489887877778751874358998776360249989841367899 Q gi|254780277|r 143 DLMQLVSPTTC--LY-DTVKE---EKIDIEHVIKKWGIPPEKMVCLQALTGDSIDNIPGIPGIGYKTAALLLQEYGNLEN 216 (976) Q Consensus 143 D~~Qlv~~~v~--~~-~~~~~---~~~~~~~v~~~~gv~p~~~~d~~al~GD~sDnipGv~GiG~KtA~~ll~~~gsle~ 216 (976) |.+-.-.+.+. .. ...++ ..|+...+.+.+|..-.+++|+--|.| +|=.|+|.|||+++|.+|+.+|+++++ T Consensus 177 D~l~fg~~~~lr~l~~s~~~~~pv~e~~~~~il~~l~l~~~~fidL~lLlG--CDYc~~I~Gig~~~al~lir~~~~i~~ 254 (449) T KOG2519 177 DALTFGAPVKLRHLIHSLASGLPVSEYDMSRILEGLGLSRESFIDLCLLLG--CDYCPTIRGIGPKKALKLIRQHGDIEN 254 (449) T ss_pred CHHHCCCHHHHHHHCCCHHCCCCEEEEEHHHHHHHHCCCHHHHHHHHHHHC--CCCCCCCCCCCHHHHHHHHHHHCCHHH T ss_conf 224306799998720121038973774199998773432799999999966--845521256373879999997567877 Q ss_pred HHH-HHHHHHHHHHHHHHC-CCHHHHHHHHHHHHCCCCCCCCCCHHHHCCCCCCHHHHHHHHH-HHHHHH Q ss_conf 986-454200112232200-3578999765543203564224763552126767688998653-233689 Q gi|254780277|r 217 ILI-NASRIKQKKRRENIL-EYAETARLSRKLVMLRTDVPITIPLEHLILEDCNGPRLISFLK-ALEFTK 283 (976) Q Consensus 217 i~~-~~d~i~~~k~~~~l~-~~~e~a~ls~~L~tL~~Dvpl~~~~~~l~~~~~d~~~l~~~~~-~lef~~ 283 (976) |++ +.|.-+- ++-+... .....+++ --+++.+...-++.|..+|.+.+.+|+. +.+|+- T Consensus 255 ile~~~~~~~~-~ip~~w~~~~~r~~f~-------~p~~~~~~~~~~i~w~~pd~~~li~fl~~~~~f~~ 316 (449) T KOG2519 255 ILEINSDLKEY-PIPEDWSYKLARKLFL-------EPEFPNPESILDLKWKTPDTEGLIQFLVGEKQFNE 316 (449) T ss_pred HHHHCCCHHHC-CCCCCCCHHHHHHHHC-------CCCCCCCCCEEECCCCCCCHHHHHHHHHHHHCCCH T ss_conf 75112413322-8997756777888754-------84448843003413678976888999976642597 No 46 >cd06141 WRN_exo WRN is a unique RecQ DNA helicase exhibiting an exonuclease activity. It contains a DEDDy-type DnaQ-like 3'-5' exonuclease domain possessing three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. Mutations in the WRN gene cause Werner syndrome, an autosomal recessive disorder associated with premature aging and increased susceptibility to cancer and type II diabetes. WRN interacts with key proteins involved in DNA replication, recombination, and repair. It is believed to maintain genomic stability and life span by participating in DNA processes. WRN is stimulated by Ku70/80, an important regulator of genomic stability. Probab=99.45 E-value=5.4e-13 Score=113.38 Aligned_cols=162 Identities=19% Similarity=0.238 Sum_probs=115.8 Q ss_pred CCHHHHHHHHHHHH-HCCCCCCCCCCCCC--CCCCCCCCCCEEECCCCCCCCCCCCCCEECCCCCCCCCCCCCCCCCHHH Q ss_conf 79999999999998-66852323322243--3110012442010257766544345502313744333456500178899 Q gi|254780277|r 372 VNTKDIQQWVQKLE-TIGSASFKIITDTM--DAFHSKPIAIAISTLDQKDDTSFKVETIFIDLSVHTSQDSTGKNTLTKE 448 (976) Q Consensus 372 ~~~~~L~~li~~L~-~~~~ia~DtETt~l--d~~~~~ivGisls~~~~~~~~~~~~~~~yip~~~~~~~d~~~~~~~~~e 448 (976) .+.....++++.+. ..+.++||+|+... +.....+-.+.+|..+. .+.+++... .. T Consensus 2 ~~~~~~~~~~~~~l~~~~vvG~D~Ew~~~~~~~~~~~~aliQlat~~~---------~~l~d~~~~------------~~ 60 (168) T cd06141 2 DSAQDAEEAVKELLDDEKVVGFDLEWRPSFRKGKRNKVALLQLASESR---------CLLFQLSHM------------DK 60 (168) T ss_pred CCHHHHHHHHHHHHCCCCEEEEECEECCCCCCCCCCCCEEEEEECCCC---------EEEEEECCC------------CH T ss_conf 748899999999856898899966455877778889715999824996---------599974033------------12 Q ss_pred HHHHHHHHHCCCCCHHHHCCCCHHHHHHHH-CCCCCCCCCCHHHHHHHHCCCCCCCCHHHHHHHHCCCCCCCCCCCCCCC Q ss_conf 997444420222100110032123788986-1886664210023455431687667724667762244422335654322 Q gi|254780277|r 449 ILSYLKKFFENEHFLKIGHNIKYDKLVLHR-YGISMRGFEDIMLMSYVLDSGRSSHDIASIAQKWLSYTRKEILKSRKSS 527 (976) Q Consensus 449 vl~~L~~lle~~~i~kIgHN~KfD~~~L~~-~GI~l~~i~DTmlaayLLdp~~~~~~L~~la~~yL~~~~~e~~~~~k~~ 527 (976) ....|+++|+|+++.||||++++|+..|.+ +|+.+.+++|++.++..+.+...++||..++.++||...... +.... T Consensus 61 ~~~~L~~lL~~~~i~Kvg~~~~~D~~~L~~~~g~~~~~~~Dl~~~a~~~~~~~~~~sL~~l~~~~lg~~l~K~--k~~q~ 138 (168) T cd06141 61 LPPSLKQLLEDPSILKVGVGIKGDARKLRRDFGIEVRGVVDLSHLAKRVGPRRKLVSLARLVEEVLGLPLSKD--KKVRC 138 (168) T ss_pred HHHHHHHHHCCCCEEEEEEEEHHHHHHHHHHCCCCCCCEEEHHHHHHHHCCCCCCCCHHHHHHHHHCCCCCCC--CCCCC T ss_conf 0499999975998079996318899999874450248765699999985777776329999999839621546--65203 Q ss_pred CCCHHHCC-HHHHHHHHHHHHHHHHHHHHH Q ss_conf 21000000-123333222111112456776 Q gi|254780277|r 528 IPIDKISD-SQVQEHAIENSNVILQLWLLL 556 (976) Q Consensus 528 ~~~~~i~~-e~~~~Yaa~Da~~t~~L~~~l 556 (976) .+|..-|. +.+..|||.||.+.++||+.| T Consensus 139 SdW~~rpLs~~Qi~YAA~Da~~~~~l~~~L 168 (168) T cd06141 139 SNWEARPLSKEQILYAATDAYASLELYRKL 168 (168) T ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHHHHC T ss_conf 888999999999999999999999999739 No 47 >cd06129 RNaseD_like The RNase D-like group is composed of RNase D, WRN, and similar proteins. They contain a DEDDy-type, DnaQ-like, 3'-5' exonuclease domain that contains three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. RNase D is involved in the 3'-end processing of tRNA precursors. RNase D-like proteins in eukaryotes include yeast Rrp6p, human PM/Scl-100 and Drosophila melanogaster egalitarian (Egl) protein. WRN is a unique DNA helicase possessing exonuclease activity. Mutation in the WRN gene is implicated in Werner syndrome, a disease associated with premature aging and increased predisposition to cancer. Yeast Rrp6p and the human Polymyositis/scleroderma autoantigen 100kDa (PM/Scl-100) are exosome-associated proteins involved in the degradation and processing of prec Probab=99.42 E-value=4.4e-13 Score=113.98 Aligned_cols=149 Identities=19% Similarity=0.234 Sum_probs=110.4 Q ss_pred HHHHH---HHCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCEECCCCCCCCCCCCCCCCCHHHHHHHHHHH Q ss_conf 99999---866852323322243311001244201025776654434550231374433345650017889999744442 Q gi|254780277|r 380 WVQKL---ETIGSASFKIITDTMDAFHSKPIAIAISTLDQKDDTSFKVETIFIDLSVHTSQDSTGKNTLTKEILSYLKKF 456 (976) Q Consensus 380 li~~L---~~~~~ia~DtETt~ld~~~~~ivGisls~~~~~~~~~~~~~~~yip~~~~~~~d~~~~~~~~~evl~~L~~l 456 (976) +++.| +..-.+++|||+.+...+..++-.+.+|..+. .+.++..... ...+.|+++ T Consensus 5 ~~~~l~~~~~~~vig~DtE~~~~~~~~~~v~LiQlat~~~---------~~lid~~~~~------------~~~~~L~~l 63 (159) T cd06129 5 LIEDLAAKNESIVVALDCEWIPLRLYYGEVTLIQIGTTEH---------AFLFDVLSLS------------AFDGGLKTV 63 (159) T ss_pred HHHHHHHCCCCEEEEEECCCCCCCCCCCCEEEEEEEECCE---------EEEEECCCCC------------CCHHHHHHH T ss_conf 9998875459808999797689879989789999945996---------8999548887------------677999999 Q ss_pred HCCCCCHHHHCCCCHHHHHHHH-CCCCCCCCCCHHHHHHHHCCCCCCCCHHHHHHHHCCCCCCCCCCCCCCCCCCHHHC- Q ss_conf 0222100110032123788986-18866642100234554316876677246677622444223356543222100000- Q gi|254780277|r 457 FENEHFLKIGHNIKYDKLVLHR-YGISMRGFEDIMLMSYVLDSGRSSHDIASIAQKWLSYTRKEILKSRKSSIPIDKIS- 534 (976) Q Consensus 457 le~~~i~kIgHN~KfD~~~L~~-~GI~l~~i~DTmlaayLLdp~~~~~~L~~la~~yL~~~~~e~~~~~k~~~~~~~i~- 534 (976) |+|+++.||||++++|...|.+ +|+.+.+++||++++..+.-....+||..++++|+|...+ ++. ...+|..-| T Consensus 64 L~~~~i~Kvgh~~~~D~~~L~~~~g~~~~~~~Dt~~~~~~l~~~~~~~sL~~l~~~~lg~~ld---K~~-q~SnW~~rpL 139 (159) T cd06129 64 LENPSITKVGHGCRGDLAKLLRDFGILLRNLFDTQAAHAILGYPEEYWSLNSLVEQYLGPCLD---KSR-QLSDWAKRPL 139 (159) T ss_pred HHCCCCEEEEECHHHHHHHHHHHCCCCCCCCHHHHHHHHHHCCCCCCHHHHHHHHHHCCCCCC---CCC-CCCCCCCCCC T ss_conf 829992499956699999996750843243036999999958884316599999998087877---102-4588899999 Q ss_pred CHHHHHHHHHHHHHHHHHH Q ss_conf 0123333222111112456 Q gi|254780277|r 535 DSQVQEHAIENSNVILQLW 553 (976) Q Consensus 535 ~e~~~~Yaa~Da~~t~~L~ 553 (976) .+.+..|||.||.+.+.|| T Consensus 140 t~~Qi~YAA~Da~~ll~ly 158 (159) T cd06129 140 TEEQMLYAAADVYVLLHIY 158 (159) T ss_pred CHHHHHHHHHHHHHHHHHH T ss_conf 9999999999999999976 No 48 >cd06146 mut-7_like_exo The mut-7 subfamily is composed of Caenorhabditis elegans mut-7 and similar proteins found in plants and metazoans. Mut-7 is implicated in posttranscriptional gene silencing. It contains a DEDDy-type DnaQ-like 3'-5' exonuclease domain possessing three conserved sequence motifs, termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. Probab=99.39 E-value=3.7e-12 Score=107.30 Aligned_cols=168 Identities=16% Similarity=0.206 Sum_probs=118.4 Q ss_pred EEECCHHHHHHHHHHH--HHCCCCCCCCCCC--CCCCCCCCCCCCEEECCCCCCCCCCCCCCEECCCCCCCCCCCCCCCC Q ss_conf 3547999999999999--8668523233222--43311001244201025776654434550231374433345650017 Q gi|254780277|r 369 TKIVNTKDIQQWVQKL--ETIGSASFKIITD--TMDAFHSKPIAIAISTLDQKDDTSFKVETIFIDLSVHTSQDSTGKNT 444 (976) Q Consensus 369 ~~I~~~~~L~~li~~L--~~~~~ia~DtETt--~ld~~~~~ivGisls~~~~~~~~~~~~~~~yip~~~~~~~d~~~~~~ 444 (976) ++|.+.+++..++..+ +....++||+|+. ........+-.+.+|..+. .+.++.... ... T Consensus 2 ~~v~~~~~l~~~~~~~~l~~~~vvg~D~Ew~~~~~~~~~~~vaLiQlat~~~---------~~liD~~~~-------~~~ 65 (191) T cd06146 2 HIVDSEEELEALLLALSLEAGRVVGIDSEWKPSFLGDSDPRVAILQLATEDE---------VFLLDLLAL-------ENL 65 (191) T ss_pred EEECCHHHHHHHHHHHHCCCCCEEEEECCCCCCCCCCCCCCEEEEEEECCCC---------CEEEEEECC-------CCC T ss_conf 8978899999999998426898899978757877788889527999952997---------589997045-------654 Q ss_pred CHHHHHHHHHHHHCCCCCHHHHCCCCHHHHHHHH-CCC------CCCCCCCHHHHHHHHCCC----------CCCCCHHH Q ss_conf 8899997444420222100110032123788986-188------666421002345543168----------76677246 Q gi|254780277|r 445 LTKEILSYLKKFFENEHFLKIGHNIKYDKLVLHR-YGI------SMRGFEDIMLMSYVLDSG----------RSSHDIAS 507 (976) Q Consensus 445 ~~~evl~~L~~lle~~~i~kIgHN~KfD~~~L~~-~GI------~l~~i~DTmlaayLLdp~----------~~~~~L~~ 507 (976) ........|+++|+++++.||||+++.|+..|.+ +|+ .+.+++|+..++..+... ...+||.+ T Consensus 66 ~~~~~~~~L~~ll~~~~i~Kvg~~~~~D~~~L~~~~~~~~~~~~~~~~~~Dl~~~a~~~~~~~~~~~~~~~~~~~~gL~~ 145 (191) T cd06146 66 ESEDWDRLLKRLFEDPDVLKLGFGFKQDLKALSASYPALKCMFERVQNVLDLQNLAKELQKSDMGRLKGNLPSKTKGLAD 145 (191) T ss_pred CHHHHHHHHHHHHCCCCEEEEEECHHHHHHHHHHHCCCCCCCCCCCCEEEEHHHHHHHHHCCCCCCHHHCCCCCCCCHHH T ss_conf 34778999999947999689994438689999986586432333644035799999997311221011003556756999 Q ss_pred HHHHHCCCCCCCCCCCCCCCCCCHHHCC-HHHHHHHHHHHHHHHHHHHHH Q ss_conf 6776224442233565432221000000-123333222111112456776 Q gi|254780277|r 508 IAQKWLSYTRKEILKSRKSSIPIDKISD-SQVQEHAIENSNVILQLWLLL 556 (976) Q Consensus 508 la~~yL~~~~~e~~~~~k~~~~~~~i~~-e~~~~Yaa~Da~~t~~L~~~l 556 (976) +++++||...+. + ....+|..-|. ..+..|||.||.+.+.||++| T Consensus 146 L~~~~lg~~l~K---~-~q~SnW~~rPLt~~Qi~YAA~Da~~ll~iy~~L 191 (191) T cd06146 146 LVQAVLGKPLDK---S-EQCSNWERRPLREEQILYAALDAYCLLEVFDKL 191 (191) T ss_pred HHHHHHCCCCCC---C-CCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHC T ss_conf 999990987885---6-513899999899999999999899999999739 No 49 >TIGR01388 rnd ribonuclease D; InterPro: IPR006292 These proteins represent ribonuclease D, a 3-exonuclease shown to act on tRNA both in vitro and when overexpressed in vivo. Members of this family are restricted to the Proteobacteria; Aquifex, Mycobacteria and eukaryotes. Ribonuclease D is not essential in Escherichia coli but is deleterious when overexpressed. The precise biological role of ribonuclease D is still unknown. ; GO: 0004540 ribonuclease activity, 0008033 tRNA processing, 0005737 cytoplasm. Probab=99.35 E-value=8e-12 Score=104.83 Aligned_cols=182 Identities=19% Similarity=0.141 Sum_probs=138.6 Q ss_pred EECCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCEECCCCCCCCCCCCCCCCCHHHH Q ss_conf 54799999999999986685232332224331100124420102577665443455023137443334565001788999 Q gi|254780277|r 370 KIVNTKDIQQWVQKLETIGSASFKIITDTMDAFHSKPIAIAISTLDQKDDTSFKVETIFIDLSVHTSQDSTGKNTLTKEI 449 (976) Q Consensus 370 ~I~~~~~L~~li~~L~~~~~ia~DtETt~ld~~~~~ivGisls~~~~~~~~~~~~~~~yip~~~~~~~d~~~~~~~~~ev 449 (976) .|++.++|..+|.......++++|||=--...|--.+..|.++..+. ...||--.. +.- T Consensus 1 ~I~~~~~La~~c~~~~~~p~valDTEF~R~~TFwP~LgLiQ~a~g~~---------~~LIDPL~~------------~~~ 59 (374) T TIGR01388 1 WITTDDELATVCEALRTEPFVALDTEFVRERTFWPQLGLIQVADGEQ---------LALIDPLVI------------DID 59 (374) T ss_pred CCCCHHHHHHHHHHHCCCCEEECCCCEECCCCCCHHHCCEECCCCCE---------EEEECCCCC------------CCC T ss_conf 98866889999986147883741554112653202040031137875---------687658887------------523 Q ss_pred HHHHHHHHCCCCCHHHHCCCCHHHHHHHHC-CCCCCCCCCHHHHHHHHCCCCCCCCHHHHHHHHCCCCCCCCCCCCCCCC Q ss_conf 974444202221001100321237889861-8866642100234554316876677246677622444223356543222 Q gi|254780277|r 450 LSYLKKFFENEHFLKIGHNIKYDKLVLHRY-GISMRGFEDIMLMSYVLDSGRSSHDIASIAQKWLSYTRKEILKSRKSSI 528 (976) Q Consensus 450 l~~L~~lle~~~i~kIgHN~KfD~~~L~~~-GI~l~~i~DTmlaayLLdp~~~~~~L~~la~~yL~~~~~e~~~~~k~~~ 528 (976) ++-|..+|+|+.|.||.|-+.-|+-++.+. |.-+.++|||+|+|-++-= ..+.|.+.|+++++++..+ |+..... T Consensus 60 ~~Pl~~lL~d~~v~KvlHAaseDlEvF~~l~g~lP~PLfDTQi~Aa~~G~-g~s~gY~~LV~~~~~v~Ld---Kse~rtD 135 (374) T TIGR01388 60 LSPLLELLADESVVKVLHAASEDLEVFLNLFGVLPKPLFDTQIAAAFLGF-GESMGYDKLVQEVLGVELD---KSETRTD 135 (374) T ss_pred HHHHHHHHHCCCCEEEECCCCHHHHHHHHHHCCCCCCCHHHHHHHHHHCC-CCCHHHHHHHHHHHCEECC---CCCCCCH T ss_conf 47999997389815853045047999998705798883058899998367-8521089999997370317---8887222 Q ss_pred CCHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCH Q ss_conf 1000000123333222111112456776322210001345788607501 Q gi|254780277|r 529 PIDKISDSQVQEHAIENSNVILQLWLLLRPKLIVEKLLHVYERLDKPMI 577 (976) Q Consensus 529 ~~~~i~~e~~~~Yaa~Da~~t~~L~~~l~~~L~~~~l~~l~~~iE~Pl~ 577 (976) .+..--.+.+.+||+.||.+.+.+|..|..+|.+.+=.....+ ||-++ T Consensus 136 W~~RPLtd~Ql~YAaaDV~yL~~~y~~L~~~l~~~~R~~~l~e-e~~~~ 183 (374) T TIGR01388 136 WLARPLTDKQLEYAAADVTYLLPLYEKLTERLAEEGRLAWLEE-ECELL 183 (374) T ss_pred HHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHH T ss_conf 1047898778999999999999999999999988645999972-05987 No 50 >smart00279 HhH2 Helix-hairpin-helix class 2 (Pol1 family) motifs. Probab=99.31 E-value=1.1e-12 Score=111.20 Aligned_cols=36 Identities=53% Similarity=0.854 Sum_probs=35.2 Q ss_pred HHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHC Q ss_conf 878777787518743589987763602499898413 Q gi|254780277|r 176 PEKMVCLQALTGDSIDNIPGIPGIGYKTAALLLQEY 211 (976) Q Consensus 176 p~~~~d~~al~GD~sDnipGv~GiG~KtA~~ll~~~ 211 (976) |+|++||++|+||+|||||||||||+|||.+||++| T Consensus 1 p~q~~d~~~L~GD~sDnipGV~GIG~ktA~~ll~~~ 36 (36) T smart00279 1 PEQLIDYAILVGDYSDNIPGVKGIGPKTALKLLREF 36 (36) T ss_pred CHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHC T ss_conf 907889999818764688999974789999999859 No 51 >TIGR01405 polC_Gram_pos DNA polymerase III, alpha subunit, Gram-positive type; InterPro: IPR006308 These are the polypeptide chains of DNA polymerase III. Full-length homologs of this protein are restricted to the Gram-positive lineages, including the Mycoplasmas. This protein is designated alpha chain and given the gene symbol polC, but is not a full-length homolog of other polC genes. The N-terminal region of about 200 amino acids is rich in low-complexity sequence and poorly alignable. ; GO: 0003677 DNA binding, 0003887 DNA-directed DNA polymerase activity, 0006260 DNA replication, 0005737 cytoplasm. Probab=99.29 E-value=2.7e-12 Score=108.21 Aligned_cols=190 Identities=18% Similarity=0.261 Sum_probs=138.3 Q ss_pred CCCCCCCCEEECCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCEEECCCCC-CCCCCCCCCEECCCCC------ Q ss_conf 555666503547999999999999866852323322243311001244201025776-6544345502313744------ Q gi|254780277|r 361 YPINNDSYTKIVNTKDIQQWVQKLETIGSASFKIITDTMDAFHSKPIAIAISTLDQK-DDTSFKVETIFIDLSV------ 433 (976) Q Consensus 361 ~~~~~~~y~~I~~~~~L~~li~~L~~~~~ia~DtETt~ld~~~~~ivGisls~~~~~-~~~~~~~~~~yip~~~------ 433 (976) ..+..+.--+|.+..+-..+--+|..+.++.||+|||||.|..++|+-++.....+. ...- . ...++.... T Consensus 178 ~~lv~D~~Piv~n~~~~~~ldykl~DaTyVVfD~ETTGLsp~~d~iIE~gA~k~~nGk~ii~-~-~~~FikP~~~l~~~~ 255 (1264) T TIGR01405 178 ANLVDDRVPIVYNPKDQKFLDYKLKDATYVVFDIETTGLSPQYDEIIEFGAVKVKNGKRIID-K-FQFFIKPKEPLSAFV 255 (1264) T ss_pred EEEECCCCCEEECCCCCCHHCEEECCCEEEEEEEEECCCCCCCCCEEEEEEEEEECCCEEEE-E-HHHHCCCCCCCCCCE T ss_conf 78604886637666345100024236207996035158888614246787588746857623-1-232328875467542 Q ss_pred ----CCCCCCCCCCCCHHHHHHHHHHHHCCCCCHHHHCC-CCHHHHHHH----HCCCC-CC-CCCCHHHHHHHHCCCCCC Q ss_conf ----33345650017889999744442022210011003-212378898----61886-66-421002345543168766 Q gi|254780277|r 434 ----HTSQDSTGKNTLTKEILSYLKKFFENEHFLKIGHN-IKYDKLVLH----RYGIS-MR-GFEDIMLMSYVLDSGRSS 502 (976) Q Consensus 434 ----~~~~d~~~~~~~~~evl~~L~~lle~~~i~kIgHN-~KfD~~~L~----~~GI~-l~-~i~DTmlaayLLdp~~~~ 502 (976) ...++.-......++||..|+.|+.+. ..|+|| |.||+.||. ++++. +. ++-||+-.|+.|+|.-++ T Consensus 256 ~elT~ITq~ml~n~~~~~~vL~k~~~f~~d~--ilVAHNGasFD~~Fl~~~~~k~~~~~~~~p~IDTL~Lar~lnP~y~s 333 (1264) T TIGR01405 256 TELTGITQDMLENAPEIEEVLEKFKEFLKDS--ILVAHNGASFDIGFLNTNFEKVGLEPLENPVIDTLELARALNPEYKS 333 (1264) T ss_pred EEECCCCHHHHCCCCCHHHHHHHHHHHHCCE--EEEECCCCCCCHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHCCCCCC T ss_conf 6624664797379841789999999976975--88862761036688988987627742336511388999983731134 Q ss_pred CCHHHHHHHHCCCCCCCCCCCCCCCCCCHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHC Q ss_conf 77246677622444223356543222100000012333322211111245677632221000-134578860 Q gi|254780277|r 503 HDIASIAQKWLSYTRKEILKSRKSSIPIDKISDSQVQEHAIENSNVILQLWLLLRPKLIVEK-LLHVYERLD 573 (976) Q Consensus 503 ~~L~~la~~yL~~~~~e~~~~~k~~~~~~~i~~e~~~~Yaa~Da~~t~~L~~~l~~~L~~~~-l~~l~~~iE 573 (976) |+|..+|++| +.... .+.+..|.-||.+|-+++..+.+++++.+ ...+ .+|. T Consensus 334 h~Lg~~~~Kl-~v~~~-----------------~e~~HRA~YDa~~t~~vF~~~~~~~~e~gGi~~l-~~ln 386 (1264) T TIGR01405 334 HRLGEIAKKL-GVDLD-----------------DERHHRADYDAEATAKVFIVMVEQLKEKGGITNL-EELN 386 (1264) T ss_pred CCCCHHHHHC-CCCCC-----------------CCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCH-HHCC T ss_conf 4542233320-55215-----------------3301243347999999999999998750372112-3405 No 52 >PRK06309 DNA polymerase III subunit epsilon; Validated Probab=99.18 E-value=9.1e-11 Score=97.13 Aligned_cols=167 Identities=13% Similarity=0.132 Sum_probs=106.9 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCC-CCCEECCCC----CCCCCCCCCCCCCHHHHHHHHHHHHCCCCC Q ss_conf 852323322243311001244201025776654434-550231374----433345650017889999744442022210 Q gi|254780277|r 388 GSASFKIITDTMDAFHSKPIAIAISTLDQKDDTSFK-VETIFIDLS----VHTSQDSTGKNTLTKEILSYLKKFFENEHF 462 (976) Q Consensus 388 ~~ia~DtETt~ld~~~~~ivGisls~~~~~~~~~~~-~~~~yip~~----~~~~~d~~~~~~~~~evl~~L~~lle~~~i 462 (976) ..+.||+||||+++..++|+-++.......+.+... .+...||.. |....+.-...-...+++..+.+++.+.. T Consensus 3 ~~V~fD~ETTGl~~~~DrIIEiaav~~~~~e~f~~linP~~pIp~~a~~ihGIt~e~v~~~P~~~ev~~~~~~f~~~~~- 81 (232) T PRK06309 3 ALIFYDTETTGTQIDKDRIIEIAAYNGVTSESFQTYVNPEIPIPAEATKIHGITTSEVASAPKFPEAYQKFREFCGTDN- 81 (232) T ss_pred CEEEEEEECCCCCCCCCEEEEEEEEECCCCCEEEEEECCCCCCCHHHHCCCCCCHHHHHCCCCHHHHHHHHHHHHCCCC- T ss_conf 6899971179959999458999989275676899997998969965621038399998059989999999999857998- Q ss_pred HHHHCCC-CHHHHHHH----HCCCCCCC--CCCHHHHHHHHCCCCCCCCHHHHHHHHCCCCCCCCCCCCCCCCCCHHHCC Q ss_conf 0110032-12378898----61886664--21002345543168766772466776224442233565432221000000 Q gi|254780277|r 463 LKIGHNI-KYDKLVLH----RYGISMRG--FEDIMLMSYVLDSGRSSHDIASIAQKWLSYTRKEILKSRKSSIPIDKISD 535 (976) Q Consensus 463 ~kIgHN~-KfD~~~L~----~~GI~l~~--i~DTmlaayLLdp~~~~~~L~~la~~yL~~~~~e~~~~~k~~~~~~~i~~ 535 (976) ..||||+ .||+.+|. ++|+.... +.||+..++.+.|+..+|+|+.|+++| +...+ T Consensus 82 ilVaHN~~~FD~~fL~~e~~r~g~~~~~~~~iDTl~lar~l~P~l~~h~L~~L~~~~-gi~~~----------------- 143 (232) T PRK06309 82 ILVAHNNDGFDFPLLVKECRRHSLEPLTLRTIDSLKWAQKYRPDLPKHNLQYLRQVY-GFAEN----------------- 143 (232) T ss_pred EEEEECCHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHCCCCCCCCHHHHHHHH-CCCCC----------------- T ss_conf 899848965679999999998599867740537999999975897768899999980-99998----------------- Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 1233332221111124567763222100013457886075 Q gi|254780277|r 536 SQVQEHAIENSNVILQLWLLLRPKLIVEKLLHVYERLDKP 575 (976) Q Consensus 536 e~~~~Yaa~Da~~t~~L~~~l~~~L~~~~l~~l~~~iE~P 575 (976) -..-|..||.+|.+++..+...+...++..++..-+.| T Consensus 144 --~aHrAl~Da~at~~vf~~ll~~~~~~~~~~l~~~~~~~ 181 (232) T PRK06309 144 --QAHRALDDVITLHRVFSALVGDLSPQQVLDLLNESCHP 181 (232) T ss_pred --CCCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHCCCCC T ss_conf --88885999999999999987216999999998450655 No 53 >PRK07740 hypothetical protein; Provisional Probab=99.13 E-value=1.1e-10 Score=96.60 Aligned_cols=158 Identities=15% Similarity=0.153 Sum_probs=108.4 Q ss_pred HHHCCCCCCCCCCCCCCCCC-CCCCCCEEECCCCCCCCCCCCCCEECC------CCC------CCCCCCCCCCCCHHHHH Q ss_conf 98668523233222433110-012442010257766544345502313------744------33345650017889999 Q gi|254780277|r 384 LETIGSASFKIITDTMDAFH-SKPIAIAISTLDQKDDTSFKVETIFID------LSV------HTSQDSTGKNTLTKEIL 450 (976) Q Consensus 384 L~~~~~ia~DtETt~ld~~~-~~ivGisls~~~~~~~~~~~~~~~yip------~~~------~~~~d~~~~~~~~~evl 450 (976) |....+++||+||||++|.. ++|+-|+.....+.... +.++.- ++. ....+.-...-...+|+ T Consensus 54 L~~~~fvV~D~ETTGl~p~~gD~IIeIgAVkv~~g~i~----d~F~tLVnP~~~Ip~~i~~ltGIT~emV~~aP~~~eVl 129 (240) T PRK07740 54 LTDLPFVVFDLETTGFSPDQGDEILSIAAVKTVGGEVT----DLYYSLVKPKKPIPEHILTLTGISAKDVVFAPPLAEVL 129 (240) T ss_pred CCCCEEEEEEEECCCCCCCCCCEEEEEEEEEEECCEEE----EEEEEEECCCCCCCHHHHHHCCCCHHHHHCCCCHHHHH T ss_conf 43383899980589989888987899989999999996----88775508879899889986098999983799999999 Q ss_pred HHHHHHHCCCCCHHHHCCCCHHHHHHHH-----CCCCCCC-CCCHHHHHHHHCCCCCCCCHHHHHHHHCCCCCCCCCCCC Q ss_conf 7444420222100110032123788986-----1886664-210023455431687667724667762244422335654 Q gi|254780277|r 451 SYLKKFFENEHFLKIGHNIKYDKLVLHR-----YGISMRG-FEDIMLMSYVLDSGRSSHDIASIAQKWLSYTRKEILKSR 524 (976) Q Consensus 451 ~~L~~lle~~~i~kIgHN~KfD~~~L~~-----~GI~l~~-i~DTmlaayLLdp~~~~~~L~~la~~yL~~~~~e~~~~~ 524 (976) ..|.+++.+.- .||||+.||+.+|.+ .+..+.+ +.||+..+.++.|. ..++|++++++| ++... T Consensus 130 ~~fl~Fi~~~v--lVaHNa~FD~~FL~~~l~r~~~~~~~~~~iDT~~Lar~l~~~-~~~sL~~l~~~f-gI~~~------ 199 (240) T PRK07740 130 RTFYPFINAST--LVGYHIGHDLAFLRHALWRHYRQKFSHRFIDTQFLTEIIAHQ-SFPTLDDALACY-GITCP------ 199 (240) T ss_pred HHHHHHHCCCE--EEEECCHHHHHHHHHHHHHHCCCCCCCCEEEHHHHHHHHCCC-CCCCHHHHHHHC-CCCCC------ T ss_conf 99999838998--999682863999999999845998788763699999986589-987899999976-99999------ Q ss_pred CCCCCCHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 32221000000123333222111112456776322210001345 Q gi|254780277|r 525 KSSIPIDKISDSQVQEHAIENSNVILQLWLLLRPKLIVEKLLHV 568 (976) Q Consensus 525 k~~~~~~~i~~e~~~~Yaa~Da~~t~~L~~~l~~~L~~~~l~~l 568 (976) -..-|..||.+|.+||.+|.+++++.+...+ T Consensus 200 -------------~rHrAl~DA~aTA~lf~~ll~~l~~~Gi~TL 230 (240) T PRK07740 200 -------------RRHTADGDVEMTAKLWAILIIEARQAGIETL 230 (240) T ss_pred -------------CCCCCHHHHHHHHHHHHHHHHHHHHCCCCCH T ss_conf -------------9988299999999999999999997699989 No 54 >PRK06310 DNA polymerase III subunit epsilon; Validated Probab=99.06 E-value=3.8e-10 Score=92.60 Aligned_cols=182 Identities=16% Similarity=0.248 Sum_probs=111.8 Q ss_pred HHHCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCEEC----CCC------CCCCCCCCCCCCCHHHHHHHH Q ss_conf 9866852323322243311001244201025776654434550231----374------433345650017889999744 Q gi|254780277|r 384 LETIGSASFKIITDTMDAFHSKPIAIAISTLDQKDDTSFKVETIFI----DLS------VHTSQDSTGKNTLTKEILSYL 453 (976) Q Consensus 384 L~~~~~ia~DtETt~ld~~~~~ivGisls~~~~~~~~~~~~~~~yi----p~~------~~~~~d~~~~~~~~~evl~~L 453 (976) +....+++||+||||+++..++|+-|+...........-. ..++ |++ |....+.-...-...+++..+ T Consensus 4 l~~~~yvv~D~ETTGl~~~~DrIIEI~~vkv~~~~i~~~~--~~linP~~~I~~~~~~ihGIt~emV~d~P~f~ev~~~~ 81 (250) T PRK06310 4 LKDTEFVCLDCETTGLDVKKDRIIEFAAIRFTFDEVIDSM--EFLINPERVVSAESQRIHHISDAMLRDKPKIAEVFPQI 81 (250) T ss_pred CCCCCEEEEEEECCCCCCCCCEEEEEEEEEEECCEEEEEE--EEEECCCCCCCHHHHHHHCCCHHHHHCCCCHHHHHHHH T ss_conf 6689889997337997999962799999999999886889--99989289799889876154899984699999999999 Q ss_pred HHHHCCCCCHHHHCCCCHHHHHHH----HCCCCCC--C--CCCHHHHHHHHCCCCCCCCHHHHHHHHCCCCCCCCCCCCC Q ss_conf 442022210011003212378898----6188666--4--2100234554316876677246677622444223356543 Q gi|254780277|r 454 KKFFENEHFLKIGHNIKYDKLVLH----RYGISMR--G--FEDIMLMSYVLDSGRSSHDIASIAQKWLSYTRKEILKSRK 525 (976) Q Consensus 454 ~~lle~~~i~kIgHN~KfD~~~L~----~~GI~l~--~--i~DTmlaayLLdp~~~~~~L~~la~~yL~~~~~e~~~~~k 525 (976) .+++.+..+ .|+||+.||+.+|. ++|+.+. . +.||+-.++++. ...+|+|++|+.+| +...+ T Consensus 82 ~~fi~~~~i-lVaHNa~FD~~fL~~e~~r~g~~~~~~~~~~IDTl~lAr~~~-~~~~~~L~~L~~~~-~i~~~------- 151 (250) T PRK06310 82 KGFFKEGDY-IVGHSVGFDLQVLAQEAERIGETFLSKHYYIIDTLRLAKEYG-DSPNNSLEALAVHF-NVPYD------- 151 (250) T ss_pred HHHHCCCCE-EEEECHHHHHHHHHHHHHHCCCCCCCCCCCEECHHHHHHHCC-CCCCCHHHHHHHHC-CCCCC------- T ss_conf 999679999-999384557999999999769986656795784999997567-65456099999970-99998------- Q ss_pred CCCCCHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC-CHHHHHHHCCCCCCCC Q ss_conf 22210000001233332221111124567763222100013457886075-0123322101431005 Q gi|254780277|r 526 SSIPIDKISDSQVQEHAIENSNVILQLWLLLRPKLIVEKLLHVYERLDKP-MIDVVSQMEITGIQID 591 (976) Q Consensus 526 ~~~~~~~i~~e~~~~Yaa~Da~~t~~L~~~l~~~L~~~~l~~l~~~iE~P-l~~vLa~ME~~Gi~vD 591 (976) ...-|..||.+|..++..|.++.+. +..++.-.+-| +..++.-=.+.|..|. T Consensus 152 ------------~aHrAl~Da~aTa~vf~~l~~r~~~--~e~~~~i~~~p~~~~~f~FGKhKGk~v~ 204 (250) T PRK06310 152 ------------GNHRAMKDVEININVFKHLCKRFRT--LEQLKQVLSKPIKMKYMPLGKHKGRLFS 204 (250) T ss_pred ------------CCCCHHHHHHHHHHHHHHHHHHCCC--HHHHHHHHCCCCCCCEECCCCCCCEEHH T ss_conf ------------6668199999999999999985537--9999998619463420057784988899 No 55 >cd06126 DEDDy DEDDy exonucleases, part of the DnaQ-like (or DEDD) exonuclease superfamily, catalyze the excision of nucleoside monophosphates at the DNA or RNA termini in the 3'-5' direction. They contain four invariant acidic residues in three conserved sequence motifs termed ExoI, ExoII and ExoIII. DEDDy exonucleases are classified as such because of the presence of a specific YX(3)D pattern at ExoIII. The four conserved acidic residues serve as ligands for the two metal ions required for catalysis. DEDDy exonucleases include the proofreading domains of DNA polymerases (families A and B), as well as RNases such as RNase D and Rrp6p. The Egalitarian (Egl) and Bacillus-like DNA Polymerase I subfamilies do not possess a completely conserved YX(3)D pattern at the ExoIII motif. In addition, Bacillus-like DNA polymerases contain inactive 3'-5' exonuclease domains which do not possess the metal-binding residues necessary for activity. Probab=99.04 E-value=5.7e-10 Score=91.29 Aligned_cols=156 Identities=21% Similarity=0.267 Sum_probs=105.4 Q ss_pred CCCCCCCCCCCCC--CCCCCCCCEEECCCCCCCCCCCCCCEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCHHHH Q ss_conf 5232332224331--10012442010257766544345502313744333456500178899997444420222100110 Q gi|254780277|r 389 SASFKIITDTMDA--FHSKPIAIAISTLDQKDDTSFKVETIFIDLSVHTSQDSTGKNTLTKEILSYLKKFFENEHFLKIG 466 (976) Q Consensus 389 ~ia~DtETt~ld~--~~~~ivGisls~~~~~~~~~~~~~~~yip~~~~~~~d~~~~~~~~~evl~~L~~lle~~~i~kIg 466 (976) .++||+||+.... ..+.++.|++|...+. .+.++..+.... ......-.+++..|..++++.+..++| T Consensus 1 vl~fDiE~~~~~~~~~~~pI~~i~~~~~~g~--------~~~~~~~~~~~~--~~~~~~E~~ll~~f~~~i~~~~dii~G 70 (181) T cd06126 1 VVAFDIETTPLVGGGGKGPIILIQLAVSAGE--------KFLLDVLTLTLD--AVDSADEKELLGELKAILEDYDDIKVG 70 (181) T ss_pred CEEEEEECCCCCCCCCCCCEEEEEEEECCCC--------EEEEEECCCCCC--CCCCCCHHHHHHHHHHHHHCCCCEEEE T ss_conf 9289862379899989998899999965997--------899986243467--556789999999999998308998999 Q ss_pred CCC-CHHHHHHHH----CCCCC----------------CC--CCCHHHHHHHHCCCCCCCCHHHHHHHHCCCCCCCCCCC Q ss_conf 032-123788986----18866----------------64--21002345543168766772466776224442233565 Q gi|254780277|r 467 HNI-KYDKLVLHR----YGISM----------------RG--FEDIMLMSYVLDSGRSSHDIASIAQKWLSYTRKEILKS 523 (976) Q Consensus 467 HN~-KfD~~~L~~----~GI~l----------------~~--i~DTmlaayLLdp~~~~~~L~~la~~yL~~~~~e~~~~ 523 (976) ||+ .||+..|.+ +|+.+ .| ..|++...+-......+++|+.+|+.+|+....++.+. T Consensus 71 yN~~~FDlpyL~~R~~~l~~~~~~~~~~~~~~~~~~~~~gr~~~D~~~~~~~~~~~~~~~~L~~va~~~lg~~K~~~~~~ 150 (181) T cd06126 71 YNIDSFDLPYLLNRAGELGIPLRRGRDRSSGSREEINIKGRILFDLLLAAKRLGVALPSYSLDALAEQLLGEGKEKVDQL 150 (181) T ss_pred ECCCCCCHHHHHHHHHHHCCCCHHCCCCCCCCCCEEEECCCCCCCHHHHHHHHCCCCCCCCHHHHHHHHCCCCCCCCCHH T ss_conf 79977879999999999589701022666565504863485232199999984136465359999999749995669999 Q ss_pred CCCCCCCHHHCCHHHHHHHHHHHHHHHHHHH Q ss_conf 4322210000001233332221111124567 Q gi|254780277|r 524 RKSSIPIDKISDSQVQEHAIENSNVILQLWL 554 (976) Q Consensus 524 ~k~~~~~~~i~~e~~~~Yaa~Da~~t~~L~~ 554 (976) .........-..+.+.+||+.||.+|++||. T Consensus 151 ~i~~~w~~~~~~~~~~~Y~~~Da~~tl~l~l 181 (181) T cd06126 151 AIKQAWRARPLLERLAEYAAADAYATLQIYL 181 (181) T ss_pred HHHHHHHCCCCHHHHHHHHHHHHHHHHHHHC T ss_conf 9999886798669999999999999999769 No 56 >COG2176 PolC DNA polymerase III, alpha subunit (gram-positive type) [DNA replication, recombination, and repair] Probab=99.02 E-value=3.5e-10 Score=92.81 Aligned_cols=139 Identities=19% Similarity=0.314 Sum_probs=96.2 Q ss_pred CCCCEEECCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCEECCCCC----C------ Q ss_conf 665035479999999999998668523233222433110012442010257766544345502313744----3------ Q gi|254780277|r 365 NDSYTKIVNTKDIQQWVQKLETIGSASFKIITDTMDAFHSKPIAIAISTLDQKDDTSFKVETIFIDLSV----H------ 434 (976) Q Consensus 365 ~~~y~~I~~~~~L~~li~~L~~~~~ia~DtETt~ld~~~~~ivGisls~~~~~~~~~~~~~~~yip~~~----~------ 434 (976) .+.-.++.++. -..+..+.++.||+|||||.+...+++-++.....+...+ .....++...+ . T Consensus 404 dD~vpiv~N~~-----d~~l~datyVVfDiETTGLs~~~d~iIE~aAvKikng~iI--d~f~~Fi~P~~pl~~~~telTg 476 (1444) T COG2176 404 DDGVPIVYNPD-----DQKLDDATYVVFDIETTGLSPVYDEIIEIAAVKIKNGRII--DKFQFFIKPGRPLSATITELTG 476 (1444) T ss_pred CCCCCEECCCC-----CCCCCCCCEEEEEEECCCCCCCCCHHHHHEEEEEECCCCH--HHHHHHCCCCCCCCHHHHHCCC T ss_conf 57884230754-----4545655089998553776754201562000443388043--7789750899967644542035 Q ss_pred CCCCCCCCCCCHHHHHHHHHHHHCCCCCHHHHCCCCHHHHHHH----HCCCC-CC-CCCCHHHHHHHHCCCCCCCCHHHH Q ss_conf 3345650017889999744442022210011003212378898----61886-66-421002345543168766772466 Q gi|254780277|r 435 TSQDSTGKNTLTKEILSYLKKFFENEHFLKIGHNIKYDKLVLH----RYGIS-MR-GFEDIMLMSYVLDSGRSSHDIASI 508 (976) Q Consensus 435 ~~~d~~~~~~~~~evl~~L~~lle~~~i~kIgHN~KfD~~~L~----~~GI~-l~-~i~DTmlaayLLdp~~~~~~L~~l 508 (976) ..++.-......++|+..++.|+.+. ..|+||+.||+.+|. ++|+. +. ++.||+-.|+-|+|.-.+|+|..+ T Consensus 477 ITdeml~~a~~i~~vL~kf~~~~~d~--IlVAHNasFD~gFl~~~~~k~~~~~~~~pvIDTL~lar~L~P~~ksh~Lg~l 554 (1444) T COG2176 477 ITDEMLENAPEIEEVLEKFREFIGDS--ILVAHNASFDMGFLNTNYEKYGLEPLTNPVIDTLELARALNPEFKSHRLGTL 554 (1444) T ss_pred CCHHHHCCCCCHHHHHHHHHHHHCCC--EEEECCCCCCHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHCHHHHHCCHHHH T ss_conf 67998748833999999999984485--8996167556457888899838851027501199999875745432466778 Q ss_pred HHHH Q ss_conf 7762 Q gi|254780277|r 509 AQKW 512 (976) Q Consensus 509 a~~y 512 (976) |+++ T Consensus 555 ~kk~ 558 (1444) T COG2176 555 CKKL 558 (1444) T ss_pred HHHH T ss_conf 8874 No 57 >PRK06807 DNA polymerase III subunit epsilon; Validated Probab=99.01 E-value=7.4e-10 Score=90.46 Aligned_cols=154 Identities=12% Similarity=0.170 Sum_probs=104.1 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCC-CC---CCCEECCCC----CCCCCCCCCCCCCHHHHHHHHHHHHC Q ss_conf 68523233222433110012442010257766544-34---550231374----43334565001788999974444202 Q gi|254780277|r 387 IGSASFKIITDTMDAFHSKPIAIAISTLDQKDDTS-FK---VETIFIDLS----VHTSQDSTGKNTLTKEILSYLKKFFE 458 (976) Q Consensus 387 ~~~ia~DtETt~ld~~~~~ivGisls~~~~~~~~~-~~---~~~~yip~~----~~~~~d~~~~~~~~~evl~~L~~lle 458 (976) ..++++|+||||++|..++++-|+.....+..... +. .+...+|-. +....+.-...-...+++..|.+++. T Consensus 8 ~~fVV~DiETTGl~p~~D~IIEIgAVkv~ng~Ivd~f~tLInP~~~Ip~~It~ItGIT~emV~dAP~f~EVl~~f~~Fi~ 87 (313) T PRK06807 8 LDYVVIDFETTGFNPYNDKIIQVAAVKYRNHELVDQFVSYVNPERPIPDRITSLTGITNYRVSDAPTIEEVLPLFLAFLH 87 (313) T ss_pred CCEEEEEEECCCCCCCCCEEEEEEEEEEECCEEEEEEEEEECCCCCCCHHHHHHCCCCHHHHHCCCCHHHHHHHHHHHCC T ss_conf 87899997889979999878999999999997989999998948979966765339799998269989999999998738 Q ss_pred CCCCHHHHCCCCHHHHHHH----HCCCC-CCC-CCCHHHHHHHHCCCCCCCCHHHHHHHHCCCCCCCCCCCCCCCCCCHH Q ss_conf 2210011003212378898----61886-664-21002345543168766772466776224442233565432221000 Q gi|254780277|r 459 NEHFLKIGHNIKYDKLVLH----RYGIS-MRG-FEDIMLMSYVLDSGRSSHDIASIAQKWLSYTRKEILKSRKSSIPIDK 532 (976) Q Consensus 459 ~~~i~kIgHN~KfD~~~L~----~~GI~-l~~-i~DTmlaayLLdp~~~~~~L~~la~~yL~~~~~e~~~~~k~~~~~~~ 532 (976) +. ..|+||+.||+.+|. ++|+. +.+ +.||+-.|..+-|+..+|.|..+|+. |++.. T Consensus 88 d~--iLVAHNA~FD~~FL~~~L~r~gl~~~~n~~IDTL~LAR~l~P~L~NhKLnTVa~~-LgI~l--------------- 149 (313) T PRK06807 88 TN--VIVAHNASFDMRFLKSNVNMLGLPEPKNKVIDTVFLAKKYMKHAPNHKLETLKRM-LGIRL--------------- 149 (313) T ss_pred CC--EEEEECCHHHHHHHHHHHHHCCCCCCCCCEEEHHHHHHHHCCCCCCCCHHHHHHH-HCCCC--------------- T ss_conf 99--8999553661999999999749988899877659999986478765667899998-39983--------------- Q ss_pred HCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 0001233332221111124567763222100 Q gi|254780277|r 533 ISDSQVQEHAIENSNVILQLWLLLRPKLIVE 563 (976) Q Consensus 533 i~~e~~~~Yaa~Da~~t~~L~~~l~~~L~~~ 563 (976) -...|..||.+|..++.++...-++. T Consensus 150 -----~HHrAldDA~acAeI~~k~~~~~~~~ 175 (313) T PRK06807 150 -----SSHNAFDDCITCAAVYQKCASIEEEA 175 (313) T ss_pred -----CCCCCHHHHHHHHHHHHHHHHHHHHH T ss_conf -----43671778999999999999988616 No 58 >PRK08517 DNA polymerase III subunit epsilon; Provisional Probab=98.99 E-value=6.1e-10 Score=91.07 Aligned_cols=156 Identities=10% Similarity=0.072 Sum_probs=105.0 Q ss_pred HHHCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCC---CCEECCCC----CCCCCCCCCCCCCHHHHHHHHHHH Q ss_conf 98668523233222433110012442010257766544345---50231374----433345650017889999744442 Q gi|254780277|r 384 LETIGSASFKIITDTMDAFHSKPIAIAISTLDQKDDTSFKV---ETIFIDLS----VHTSQDSTGKNTLTKEILSYLKKF 456 (976) Q Consensus 384 L~~~~~ia~DtETt~ld~~~~~ivGisls~~~~~~~~~~~~---~~~yip~~----~~~~~d~~~~~~~~~evl~~L~~l 456 (976) +....++++|+||||++|..++|+-|+.....++.....+. ..-.+|-. +....+.-...-...+|+..+..+ T Consensus 67 ~ke~~fvvvDlETTGl~~~~d~IIEIgavkv~~g~iid~f~tlVnp~~Ip~~It~lTGIt~emv~~aP~~~evl~~f~~F 146 (259) T PRK08517 67 IKEQVFCFVDIETNGSKPKKGQIIEIGAVKVKGGEIIDRFESFVKAKEVPEYITELTGITYEDLENAPSLKEVLEDFRLF 146 (259) T ss_pred CCCCCEEEEEEECCCCCCCCCEEEEEEEEEEECCEEEEEEEEEECCCCCCHHHHHHCCCCHHHHHCCCCHHHHHHHHHHH T ss_conf 67883899996589989999824999999999999999874323989899888221389989982799889999999999 Q ss_pred HCCCCCHHHHCCCCHHHHHHH----HCCCC-CC-CCCCHHHHHHHHCCCCCCCCHHHHHHHHCCCCCCCCCCCCCCCCCC Q ss_conf 022210011003212378898----61886-66-4210023455431687667724667762244422335654322210 Q gi|254780277|r 457 FENEHFLKIGHNIKYDKLVLH----RYGIS-MR-GFEDIMLMSYVLDSGRSSHDIASIAQKWLSYTRKEILKSRKSSIPI 530 (976) Q Consensus 457 le~~~i~kIgHN~KfD~~~L~----~~GI~-l~-~i~DTmlaayLLdp~~~~~~L~~la~~yL~~~~~e~~~~~k~~~~~ 530 (976) +.+. ..||||+.||+.+|. ++|+. +. .+.||+-.|..+-|+. +|+|+.||+++ +.... T Consensus 147 i~d~--ilVaHNa~FD~~FL~~~l~r~g~~~l~n~~lDTl~LAR~~~p~~-~ykL~~La~~~-~I~~~------------ 210 (259) T PRK08517 147 LGDS--VFVAHNVNFDYNFISRSLEEIGLGPLLNRKLCTIDLAKRTIESP-RYGLSFLKELL-GIETE------------ 210 (259) T ss_pred HCCC--EEEEECCHHHHHHHHHHHHHCCCCCCCCCEEEEHHHHHHHCCCC-CCCHHHHHHHC-CCCCC------------ T ss_conf 6889--59993727609999999998399976898566089999875877-87889999975-98999------------ Q ss_pred HHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 00000123333222111112456776322210 Q gi|254780277|r 531 DKISDSQVQEHAIENSNVILQLWLLLRPKLIV 562 (976) Q Consensus 531 ~~i~~e~~~~Yaa~Da~~t~~L~~~l~~~L~~ 562 (976) ....|..||.+|.+++..+..+|.. T Consensus 211 -------~~HRAl~DA~aTaevF~~~L~~l~~ 235 (259) T PRK08517 211 -------VHHRAYADALAAYEIFEICLLNLPS 235 (259) T ss_pred -------CCCCCHHHHHHHHHHHHHHHHHCCC T ss_conf -------9978458899999999999985665 No 59 >PRK00448 polC DNA polymerase III PolC; Validated Probab=98.99 E-value=1e-09 Score=89.42 Aligned_cols=126 Identities=21% Similarity=0.335 Sum_probs=87.0 Q ss_pred HHHHCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCEEC----CCCCCC------CCCCCCCCCCHHHHHHH Q ss_conf 99866852323322243311001244201025776654434550231----374433------34565001788999974 Q gi|254780277|r 383 KLETIGSASFKIITDTMDAFHSKPIAIAISTLDQKDDTSFKVETIFI----DLSVHT------SQDSTGKNTLTKEILSY 452 (976) Q Consensus 383 ~L~~~~~ia~DtETt~ld~~~~~ivGisls~~~~~~~~~~~~~~~yi----p~~~~~------~~d~~~~~~~~~evl~~ 452 (976) .+....++.||+||||+.+..++|+-++-.....+...- ....++ |++..+ ..+........++++.. T Consensus 414 ~l~~~~yVvFDiETTGls~~~d~IiEigAvki~~g~iid--~f~~fi~P~~~i~~~i~~lT~Itd~mv~~~~~~~evl~~ 491 (1436) T PRK00448 414 DLKDATYVVFDVETTGLSAVYDEIIEIGAVKIKNGEIID--KFEEFIKPGHPLSAFTTELTGITDDMVKDAPSIEEVLPK 491 (1436) T ss_pred CCCCCCEEEEEECCCCCCCCCCCEEEEEEEEEECCEEEE--EEHHHCCCCCCCCHHHHHCCCCCHHHHCCCCCHHHHHHH T ss_conf 467663799860068888666723687579863781754--102103899979754532147884786389878999999 Q ss_pred HHHHHCCCCCHHHHCCCCHHHHHHH----HCCCC-CCC-CCCHHHHHHHHCCCCCCCCHHHHHHHH Q ss_conf 4442022210011003212378898----61886-664-210023455431687667724667762 Q gi|254780277|r 453 LKKFFENEHFLKIGHNIKYDKLVLH----RYGIS-MRG-FEDIMLMSYVLDSGRSSHDIASIAQKW 512 (976) Q Consensus 453 L~~lle~~~i~kIgHN~KfD~~~L~----~~GI~-l~~-i~DTmlaayLLdp~~~~~~L~~la~~y 512 (976) +.+|+.+. ..|+||+.||..++. ++|+. +.+ +.||+-.|+.|.|+..+|+|+.+|+++ T Consensus 492 f~~f~~d~--vlVAHNa~FD~~Fi~~~~~~~~~~~~~~p~iDTl~lsr~l~p~~k~~~L~~l~k~~ 555 (1436) T PRK00448 492 FKEFCGDS--VLVAHNASFDVGFINTNYEKLGLEKIKNPVIDTLELSRFLYPEYKSHRLNTLAKKF 555 (1436) T ss_pred HHHHHCCC--EEEEECCCCCHHHHHHHHHHCCCCCCCCCEECHHHHHHHHCCCCCCCCHHHHHHHH T ss_conf 99982897--79984586568899999998089866797230299898755101114478899884 No 60 >PRK08074 bifunctional ATP-dependent DNA helicase/DNA polymerase III subunit epsilon; Validated Probab=98.97 E-value=2e-09 Score=87.34 Aligned_cols=154 Identities=17% Similarity=0.174 Sum_probs=106.3 Q ss_pred CCCCCCCCCCCCCCCCC-CCCCCCEEECCCCCCCCCCCC----CCEECCCCCCCCC---CCCCCCC-CHHHHHHHHHHHH Q ss_conf 68523233222433110-012442010257766544345----5023137443334---5650017-8899997444420 Q gi|254780277|r 387 IGSASFKIITDTMDAFH-SKPIAIAISTLDQKDDTSFKV----ETIFIDLSVHTSQ---DSTGKNT-LTKEILSYLKKFF 457 (976) Q Consensus 387 ~~~ia~DtETt~ld~~~-~~ivGisls~~~~~~~~~~~~----~~~yip~~~~~~~---d~~~~~~-~~~evl~~L~~ll 457 (976) ..++++|+||||+++.. ++|+-|+.....+....-.+. +.--||..+.... +....+. ..++|...+..++ T Consensus 3 ~~~~vvDlETTG~~~~~~d~IIqig~v~i~~~~i~~~f~t~vnP~~~iP~~I~~LTgI~~~~v~~AP~f~eva~~i~~~l 82 (932) T PRK08074 3 KRFVVVDLETTGNSPKKGDKIIQIAAVVVEDGEILERFSTFVNPERPIPPFITELTGISEEMVKNAPLFEDVAPMIVELL 82 (932) T ss_pred CEEEEEEEECCCCCCCCCCEEEEEEEEEEECCEEEEEEECCCCCCCCCCHHHHHHCCCCHHHHCCCCCHHHHHHHHHHHH T ss_conf 70899998888989777988999979999999998998532589998997788606989899805999899999999996 Q ss_pred CCCCCHHHHCCCCHHHHHHHH----CCCC-CCC-CCCHHHHHHHHCCCCCCCCHHHHHHHHCCCCCCCCCCCCCCCCCCH Q ss_conf 222100110032123788986----1886-664-2100234554316876677246677622444223356543222100 Q gi|254780277|r 458 ENEHFLKIGHNIKYDKLVLHR----YGIS-MRG-FEDIMLMSYVLDSGRSSHDIASIAQKWLSYTRKEILKSRKSSIPID 531 (976) Q Consensus 458 e~~~i~kIgHN~KfD~~~L~~----~GI~-l~~-i~DTmlaayLLdp~~~~~~L~~la~~yL~~~~~e~~~~~k~~~~~~ 531 (976) ++. +.|+||+.||+.+|.. .|+. +.+ ++||+-.|.++-|...+|.|.+|+.. |+..-+ T Consensus 83 ~~~--~~VaHNv~FD~~FL~~~l~~~G~~~~~~~~~DTveLa~i~~P~~~sy~L~~L~~~-l~i~~~------------- 146 (932) T PRK08074 83 EGA--YFVAHNVHFDLNFLNEELERAGYTEIHCPKLDTVELARILLPTADSYKLSDLSEE-LELEHD------------- 146 (932) T ss_pred CCC--EEEECCHHHHHHHHHHHHHHCCCCCCCCCEECHHHHHHHHCCCCCCCCHHHHHHH-CCCCCC------------- T ss_conf 699--5994780540999999999739986668765199999997788788888999986-499999------------- Q ss_pred HHCCHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 0000123333222111112456776322210 Q gi|254780277|r 532 KISDSQVQEHAIENSNVILQLWLLLRPKLIV 562 (976) Q Consensus 532 ~i~~e~~~~Yaa~Da~~t~~L~~~l~~~L~~ 562 (976) -..-|-.||.+|.+|+..+..++.. T Consensus 147 ------~~HrA~sDa~aTa~l~~~l~~~l~~ 171 (932) T PRK08074 147 ------QPHRADSDAEVTAELFLQLLHKLER 171 (932) T ss_pred ------CCCCCHHHHHHHHHHHHHHHHHHHH T ss_conf ------9875178899999999999999983 No 61 >PRK09145 DNA polymerase III subunit epsilon; Validated Probab=98.96 E-value=1.5e-09 Score=88.29 Aligned_cols=157 Identities=14% Similarity=0.173 Sum_probs=96.9 Q ss_pred HHHHHHHCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCEECC----CC------CCCCCCCCCCCCCHHHH Q ss_conf 999998668523233222433110012442010257766544345502313----74------43334565001788999 Q gi|254780277|r 380 WVQKLETIGSASFKIITDTMDAFHSKPIAIAISTLDQKDDTSFKVETIFID----LS------VHTSQDSTGKNTLTKEI 449 (976) Q Consensus 380 li~~L~~~~~ia~DtETt~ld~~~~~ivGisls~~~~~~~~~~~~~~~yip----~~------~~~~~d~~~~~~~~~ev 449 (976) +.+......+++||+||||+|+..++|+-|+.....+.....-.....++. ++ |....+.........++ T Consensus 22 l~~~~~~~~fVv~D~ETTGL~~~~D~IieIgav~v~~~~i~~~~~f~~lV~P~~~i~~~~~~IhGIt~~~l~~ap~~~ev 101 (203) T PRK09145 22 LFEPPPPDEWVALDCETTGLDPRRAEIVSIAAVKIRGNRILTSERLELLVRPPQSLSAESIKIHRLRHQDLEHGLSEEEA 101 (203) T ss_pred HHCCCCCCCEEEEEEECCCCCCCCCEEEEEEEEEEECCEEECCCEEEEEECCCCCCCHHHHCCCCCCHHHHCCCCCHHHH T ss_conf 51589999899998568999999970699999999999993175289998979959864520138787998139999999 Q ss_pred HHHHHHHHCCCCCHHHHCCCCHHHHHHH----H-CCCCCCC-CCCHHHH--HH---HHCCCCCCCCHHHHHHHHCCCCCC Q ss_conf 9744442022210011003212378898----6-1886664-2100234--55---431687667724667762244422 Q gi|254780277|r 450 LSYLKKFFENEHFLKIGHNIKYDKLVLH----R-YGISMRG-FEDIMLM--SY---VLDSGRSSHDIASIAQKWLSYTRK 518 (976) Q Consensus 450 l~~L~~lle~~~i~kIgHN~KfD~~~L~----~-~GI~l~~-i~DTmla--ay---LLdp~~~~~~L~~la~~yL~~~~~ 518 (976) +..|..++.+.. .||||+.||+.+|. + .|+.+.. +.|++-. .+ .+-+...+++|++++++| |... T Consensus 102 l~~f~~fi~~~~--lVgHNa~FD~~fL~~~~~r~~g~~l~~~~id~~~l~~~~~~~~~~~~~~~~~L~~l~~~~-gi~~- 177 (203) T PRK09145 102 LRQLLAFIGNRP--LVGYYLEFDVAMLNRYVRPLLGIGLPNPLIEVSSLYHDKKERHLPDAYIDLRFDAILKHL-DLPV- 177 (203) T ss_pred HHHHHHHCCCCE--EEEECHHHHHHHHHHHHHHHCCCCCCCCEEEHHHHHHHHHHHHCCCCCCCCCHHHHHHHC-CCCC- T ss_conf 999998829998--999878999999999999962999999877609879999996676987788999999857-9878- Q ss_pred CCCCCCCCCCCCHHHCCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 3356543222100000012333322211111245677632 Q gi|254780277|r 519 EILKSRKSSIPIDKISDSQVQEHAIENSNVILQLWLLLRP 558 (976) Q Consensus 519 e~~~~~k~~~~~~~i~~e~~~~Yaa~Da~~t~~L~~~l~~ 558 (976) . ...-|..||.+|..||.+|.+ T Consensus 178 -----------------~-~~HrAL~DA~aTa~lfl~L~k 199 (203) T PRK09145 178 -----------------L-GRHDALNDAIMTALIYLRLRK 199 (203) T ss_pred -----------------C-CCCCHHHHHHHHHHHHHHHHC T ss_conf -----------------9-997989999999999999867 No 62 >PRK06631 consensus Probab=98.94 E-value=1.7e-09 Score=87.76 Aligned_cols=150 Identities=13% Similarity=0.216 Sum_probs=94.2 Q ss_pred CCCCCCCCCCCCCCCCC-CCCCCCEEECCCCCCCCCCCCCCEECCCCCCCCC----------CCCCCCCCHHHHHHHHHH Q ss_conf 68523233222433110-0124420102577665443455023137443334----------565001788999974444 Q gi|254780277|r 387 IGSASFKIITDTMDAFH-SKPIAIAISTLDQKDDTSFKVETIFIDLSVHTSQ----------DSTGKNTLTKEILSYLKK 455 (976) Q Consensus 387 ~~~ia~DtETt~ld~~~-~~ivGisls~~~~~~~~~~~~~~~yip~~~~~~~----------d~~~~~~~~~evl~~L~~ 455 (976) ...++|||||||++|.. .+|+-|+.....+.....-. ...|+--...... +.-...-...+|++.|.. T Consensus 4 mr~VvlD~ETTGl~~~~gdrIIEIgav~~~~~~~t~~~-f~~~inP~r~i~~~~~~ihGIt~e~l~~~P~f~ev~~~fl~ 82 (229) T PRK06631 4 LREIILDTETTGLDPQQGHRIVEIGAIEMVNKVLTGRN-FHFYINPERDMPFEAYRIHGISGEFLKDKPLFHTIADDFLE 82 (229) T ss_pred CCEEEEEEECCCCCCCCCCEEEEEEEEEEECCEEECCE-EEEEECCCCCCCHHHHHCCCCCHHHHHHCCCHHHHHHHHHH T ss_conf 88899982579989999987999999999999880437-99998989979761512018678998538779999999999 Q ss_pred HHCCCCCHHHHCCCCHHHHHHHH----CC---CC---CCCCCCHHHHHHHHCCCCCCCCHHHHHHHHCCCCCCCCCCCCC Q ss_conf 20222100110032123788986----18---86---6642100234554316876677246677622444223356543 Q gi|254780277|r 456 FFENEHFLKIGHNIKYDKLVLHR----YG---IS---MRGFEDIMLMSYVLDSGRSSHDIASIAQKWLSYTRKEILKSRK 525 (976) Q Consensus 456 lle~~~i~kIgHN~KfD~~~L~~----~G---I~---l~~i~DTmlaayLLdp~~~~~~L~~la~~yL~~~~~e~~~~~k 525 (976) |+.+..+ |+||+.||+.+|.+ .| +. +..+.||+..|+-+.|+. +|+|++||+|| ++... T Consensus 83 Fi~~s~L--VaHNA~FD~~FLn~El~~l~~~~~~~~~~~~viDTL~lAR~~~Pg~-~~sLdaLc~r~-~I~~s------- 151 (229) T PRK06631 83 FISDSKL--IIHNAPFDIKFLNHELSLLKRTEIKLLELANTIDTLVMARSMFPGS-KYNLDALCKRF-KVDNS------- 151 (229) T ss_pred HCCCCEE--EEECCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHCCCC-CCCHHHHHHHC-CCCCC------- T ss_conf 8299979--9976387598999999982788888432011012699999868998-88999999984-99999------- Q ss_pred CCCCCHHHCCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 222100000012333322211111245677632 Q gi|254780277|r 526 SSIPIDKISDSQVQEHAIENSNVILQLWLLLRP 558 (976) Q Consensus 526 ~~~~~~~i~~e~~~~Yaa~Da~~t~~L~~~l~~ 558 (976) .....-|..||.++..+|-.|.. T Consensus 152 ----------~r~~H~AL~DA~llA~Vyl~l~g 174 (229) T PRK06631 152 ----------GRQLHGALKDAALLAEVYVELTG 174 (229) T ss_pred ----------CCCCCCCHHHHHHHHHHHHHHHC T ss_conf ----------85556868999999999999717 No 63 >KOG2518 consensus Probab=98.94 E-value=1e-07 Score=74.93 Aligned_cols=198 Identities=23% Similarity=0.244 Sum_probs=143.6 Q ss_pred CEEEEEECCHHHHHHHHCCCCCCCCCCCCEECHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECC-C----C--CHHHHCC Q ss_conf 7289996626999887426754468988550069999999999999848554798799997274-8----8--7132408 Q gi|254780277|r 5 NHLFLVDGSSFIYRAFYATPLLSRKHDGLPVNAIAGFCNMLWKLLQNSRKESIASHFAVIFDYP-A----V--TFRNEIY 77 (976) Q Consensus 5 ~~~~liDg~~~~~ra~~a~p~~~~~~~G~~t~ai~gf~~~l~~~i~~~~~~~~~~~~~v~fD~~-~----~--tfR~~~~ 77 (976) ...+-|||+..+||+-.+--.-.- -|.||.--..|+.--.+++..+.= +-++|||+. - . .-|++. T Consensus 24 g~tvavD~y~WLhrg~~~Ca~el~--~~~pT~ryi~y~ik~v~lL~~~gi-----kPilVFDG~~LP~K~~te~~Rr~~- 95 (556) T KOG2518 24 GKTVAVDGYCWLHRGALACAEKLA--KGKPTDRYIQFFIKRVKLLLSYGI-----KPILVFDGDPLPSKKETERKRRER- 95 (556) T ss_pred CCEEEEEHHHHHHHHHHHHHHHHH--CCCCHHHHHHHHHHHHHHHHHCCC-----EEEEEECCCCCCCCCCCCHHHHHH- T ss_conf 836887325588645776799986--289738999999999999985697-----489996699765556543677999- Q ss_pred HHHHCCCCCC------------CHHHHHH-------HHHHHHHHHHCCCCEEEECCEEHHHHHHHHHHHHHHCCCCEEEE Q ss_conf 6521667987------------9758864-------89999999970997995689016658999999998489949998 Q gi|254780277|r 78 PDYKANRPKI------------PEMLLPQ-------LPLVRLATQAFGIPAIEIQGFEADDIIATYTYIAEKEGFAVTII 138 (976) Q Consensus 78 ~~YKa~R~~~------------p~~l~~q-------~~~i~~~l~~~gi~~~~~~~~EADDviatla~~~~~~~~~v~i~ 138 (976) +|+|+... -+-+..- ...+-++|...||..+-.| ||||-=++.|.+ +|.--.|+ T Consensus 96 --R~~n~~~a~~ll~~G~~~~A~~~fqr~VdIT~~ma~~lI~~~r~~nVe~IVAP-yEADAQlayL~~----~~~i~~II 168 (556) T KOG2518 96 --RKKNLDAAEQLLAEGKESNARECFQRCVDITPEMAHKLIQYLRSQNVEYIVAP-YEADAQLAYLER----EGIVDAII 168 (556) T ss_pred --HHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCEEECC-CCCCCHHHHHHH----CCCCEEEE T ss_conf --98709999999976898889999987611769999999999997697558657-623212378876----27642798 Q ss_pred CCCCHHHEECCCCE-EEEE-CCCCCEECHHHHHHHHCCC----HHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCC Q ss_conf 68701212237975-9996-6788353899998884898----8787777875187435899877636024998984136 Q gi|254780277|r 139 STDKDLMQLVSPTT-CLYD-TVKEEKIDIEHVIKKWGIP----PEKMVCLQALTGDSIDNIPGIPGIGYKTAALLLQEYG 212 (976) Q Consensus 139 s~DkD~~Qlv~~~v-~~~~-~~~~~~~~~~~v~~~~gv~----p~~~~d~~al~GD~sDnipGv~GiG~KtA~~ll~~~g 212 (976) |-|-||++.=..+| .-++ +.++..++.....+.++.. -+++...-.|-| -|=.+||||||-+||.++|..|. T Consensus 169 TEDSDLl~fGc~~vifK~d~~G~~le~~~~~l~~~~~l~~~~~~ekfr~mciLSG--CDYl~slpGvGl~tA~k~l~k~~ 246 (556) T KOG2518 169 TEDSDLLVFGCKKVIFKMDSFGNGLEINRSKLPECKPLGDKFTEEKFRRMCILSG--CDYLSSLPGVGLATAHKLLSKYN 246 (556) T ss_pred ECCCCCCCCCCHHHEEECCCCCCCCCCCHHHHHHCCCCCCCCCHHHHHHHHHHCC--CCCCCCCCCCCHHHHHHHHHHCC T ss_conf 2365565668524322014788850322765222126544457999999998427--73112476522999999998537 Q ss_pred CHHHHHH Q ss_conf 7899986 Q gi|254780277|r 213 NLENILI 219 (976) Q Consensus 213 sle~i~~ 219 (976) +.|-++. T Consensus 247 ~~d~vi~ 253 (556) T KOG2518 247 TPDRVII 253 (556) T ss_pred CHHHHHH T ss_conf 5888899 No 64 >PRK07942 DNA polymerase III subunit epsilon; Provisional Probab=98.92 E-value=4.4e-09 Score=84.84 Aligned_cols=152 Identities=16% Similarity=0.223 Sum_probs=97.4 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCEECCCCCCCCCCC---------------CCCCCCHHHHHH Q ss_conf 68523233222433110012442010257766544345502313744333456---------------500178899997 Q gi|254780277|r 387 IGSASFKIITDTMDAFHSKPIAIAISTLDQKDDTSFKVETIFIDLSVHTSQDS---------------TGKNTLTKEILS 451 (976) Q Consensus 387 ~~~ia~DtETt~ld~~~~~ivGisls~~~~~~~~~~~~~~~yip~~~~~~~d~---------------~~~~~~~~evl~ 451 (976) ...++||+||||+++..+.||-+++...++.+... ....+++.-........ ....-...++.. T Consensus 6 ~~lvvfD~ETTG~d~~~dRIIeia~V~~~~~g~~~-~~~~~LvnPg~~IP~~a~~vhGIT~e~~~~~G~p~a~v~~eia~ 84 (234) T PRK07942 6 GPLAAFDLETTGVDPETARIVTAAIVVVDAHGEVV-ERREWLADPGVEIPEEASAVHGITTERARAHGRPAAEVLAEIAE 84 (234) T ss_pred CCEEEEEEECCCCCCCCCEEEEEEEEEEECCCCEE-EEEEEEECCCCCCCHHHHHHHCCCHHHHHHCCCCCHHHHHHHHH T ss_conf 97899980169989889838999999998898546-88999869589499889778488999997428983079999999 Q ss_pred HHHHHHCCCCCHHHHCCCCHHHHHHH----HCCCC-C-C-CCCCHHHHHHHHCCCC-CCCCHHHHHHHHCCCCCCCCCCC Q ss_conf 44442022210011003212378898----61886-6-6-4210023455431687-66772466776224442233565 Q gi|254780277|r 452 YLKKFFENEHFLKIGHNIKYDKLVLH----RYGIS-M-R-GFEDIMLMSYVLDSGR-SSHDIASIAQKWLSYTRKEILKS 523 (976) Q Consensus 452 ~L~~lle~~~i~kIgHN~KfD~~~L~----~~GI~-l-~-~i~DTmlaayLLdp~~-~~~~L~~la~~yL~~~~~e~~~~ 523 (976) .|..++.+. ...|+||+.||+.+|. ++|+. + . .+.||++.+..+++-+ ++++|.+||.+| |...+ T Consensus 85 ~l~~~~~~g-~~lVahNA~FDl~fL~~El~R~g~~~l~~~~viDtl~l~r~l~~~r~gkr~L~aL~~~y-gv~l~----- 157 (234) T PRK07942 85 ALRAVWRAG-VPVVVYNAPYDLTVLDRELRRHGLPSLVPGPVIDPYVIDKHVDRYRKGKRTLTALCEHY-GVRLD----- 157 (234) T ss_pred HHHHHHHCC-CEEEEECCHHHHHHHHHHHHHCCCCCCCCCCEEEHHHHHHHHCCCCCCCCCHHHHHHHC-CCCCC----- T ss_conf 999997279-87999670764999999999859996777755619998877514777888899999984-99887----- Q ss_pred CCCCCCCHHHCCHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 4322210000001233332221111124567763222 Q gi|254780277|r 524 RKSSIPIDKISDSQVQEHAIENSNVILQLWLLLRPKL 560 (976) Q Consensus 524 ~k~~~~~~~i~~e~~~~Yaa~Da~~t~~L~~~l~~~L 560 (976) . ..-|..||.+|.+|...+..+. T Consensus 158 -------------~-aH~A~aDA~Ata~l~~~l~~r~ 180 (234) T PRK07942 158 -------------N-AHDATADALAAARLAWALARRF 180 (234) T ss_pred -------------C-CCCCHHHHHHHHHHHHHHHHHH T ss_conf -------------7-5564152899999999999876 No 65 >PRK07246 bifunctional ATP-dependent DNA helicase/DNA polymerase III subunit epsilon; Validated Probab=98.91 E-value=3.9e-09 Score=85.20 Aligned_cols=154 Identities=14% Similarity=0.137 Sum_probs=105.8 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCC----CCEECCCCCCCCC---CCCCC-CCCHHHHHHHHHHHHC Q ss_conf 68523233222433110012442010257766544345----5023137443334---56500-1788999974444202 Q gi|254780277|r 387 IGSASFKIITDTMDAFHSKPIAIAISTLDQKDDTSFKV----ETIFIDLSVHTSQ---DSTGK-NTLTKEILSYLKKFFE 458 (976) Q Consensus 387 ~~~ia~DtETt~ld~~~~~ivGisls~~~~~~~~~~~~----~~~yip~~~~~~~---d~~~~-~~~~~evl~~L~~lle 458 (976) ..++++|+||||.++. ++|+-|+.....+......+. +.--||..+.... +.... .-..++|...+..+++ T Consensus 7 ~~~~vvDlETTG~~~~-~~Iiqi~~v~~~~~~i~~~f~t~vnP~~~ip~~I~~LTgI~~~~v~~ap~f~~v~~~~~~~l~ 85 (820) T PRK07246 7 RKYAVVDLEATGAGPN-ASIIQVGIVIIQGNKIIDSYETDVNPHESLDEHIVHLTGITDKQLAQAPDFSQVAHHIYQLIE 85 (820) T ss_pred CCEEEEEEECCCCCCC-CCEEEEEEEEEECCEEEEEEEECCCCCCCCCHHHHHHCCCCHHHHHCCCCHHHHHHHHHHHHC T ss_conf 7189999887898999-877999999999999989987311889979978886469898998529998999999999967 Q ss_pred CCCCHHHHCCCCHHHHHHHH----CCCCCCC-CCCHHHHHHHHCCCCCCCCHHHHHHHHCCCCCCCCCCCCCCCCCCHHH Q ss_conf 22100110032123788986----1886664-210023455431687667724667762244422335654322210000 Q gi|254780277|r 459 NEHFLKIGHNIKYDKLVLHR----YGISMRG-FEDIMLMSYVLDSGRSSHDIASIAQKWLSYTRKEILKSRKSSIPIDKI 533 (976) Q Consensus 459 ~~~i~kIgHN~KfD~~~L~~----~GI~l~~-i~DTmlaayLLdp~~~~~~L~~la~~yL~~~~~e~~~~~k~~~~~~~i 533 (976) +. +.|+||+.||+.+|.. .|.++.+ +.||+-.|.++-|...+|.|.+|+.. |+...+ T Consensus 86 ~~--~~VaHNv~FD~~fl~~~~~~~g~~~~~~~~DTv~La~i~~P~~~~y~L~~L~~~-l~~~~~--------------- 147 (820) T PRK07246 86 DC--IFVAHNVKFDANLLAEALFLEGYELRTPRVDTVELAQVFFPRLEKYSLSHLSRQ-LNIDLA--------------- 147 (820) T ss_pred CC--EEEEECCCCCHHHHHHHHHHCCCCCCCCCCCHHHHHHHHCCCCCCCCHHHHHHH-CCCCCC--------------- T ss_conf 98--599837201699999999975898899826099999997798788888999997-399999--------------- Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 001233332221111124567763222100 Q gi|254780277|r 534 SDSQVQEHAIENSNVILQLWLLLRPKLIVE 563 (976) Q Consensus 534 ~~e~~~~Yaa~Da~~t~~L~~~l~~~L~~~ 563 (976) . ..-|-.||.+|..|+..+..++..- T Consensus 148 ---~-~HrA~~Da~ata~l~~~l~~~l~~l 173 (820) T PRK07246 148 ---E-AHTAIADARATAILFLRLLQKIESL 173 (820) T ss_pred ---C-CCCHHHHHHHHHHHHHHHHHHHHCC T ss_conf ---8-8761889999999999999998619 No 66 >PRK07883 hypothetical protein; Validated Probab=98.90 E-value=2.1e-09 Score=87.25 Aligned_cols=165 Identities=10% Similarity=0.039 Sum_probs=110.1 Q ss_pred HHHHHHCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCC----CCCEECCCC----CCCCCCCCCCCCCHHHHHHH Q ss_conf 9999866852323322243311001244201025776654434----550231374----43334565001788999974 Q gi|254780277|r 381 VQKLETIGSASFKIITDTMDAFHSKPIAIAISTLDQKDDTSFK----VETIFIDLS----VHTSQDSTGKNTLTKEILSY 452 (976) Q Consensus 381 i~~L~~~~~ia~DtETt~ld~~~~~ivGisls~~~~~~~~~~~----~~~~yip~~----~~~~~d~~~~~~~~~evl~~ 452 (976) -..|....++.||+||||+++..+.|+-|+.....+......+ .+...||-. +.+..+.-...-...+|+.. T Consensus 18 ~~pL~~~tfVVvDlETTG~s~~~d~IiEIgAVkv~~g~vv~~f~tLVnP~~~Ip~~I~~LTGIt~~mV~~AP~~~~vl~~ 97 (575) T PRK07883 18 GLPLRDTTFVVVDLETTGGSPTGDAITEIGAVKVRGGEVLGEFATLVNPGRSIPPFIVVLTGITTAMVADAPPIDAVLPA 97 (575) T ss_pred CCCCCCCEEEEEEEECCCCCCCCCEEEEEEEEEEECCEEEEEEECCCCCCCCCCHHHHCCCCCCHHHHCCCCCHHHHHHH T ss_conf 99356784899995079999889705999999998999999971001938989854400369698998079999999999 Q ss_pred HHHHHCCCCCHHHHCCCCHHHHHHH----HCCCCC-CC-CCCHHHHHHHHCC--CCCCCCHHHHHHHHCCCCCCCCCCCC Q ss_conf 4442022210011003212378898----618866-64-2100234554316--87667724667762244422335654 Q gi|254780277|r 453 LKKFFENEHFLKIGHNIKYDKLVLH----RYGISM-RG-FEDIMLMSYVLDS--GRSSHDIASIAQKWLSYTRKEILKSR 524 (976) Q Consensus 453 L~~lle~~~i~kIgHN~KfD~~~L~----~~GI~l-~~-i~DTmlaayLLdp--~~~~~~L~~la~~yL~~~~~e~~~~~ 524 (976) |.+|+.+. ..|+||+.||+.||. ++|+.. .+ +.||+-.|.-+-| ...+|.|+.|+.++ +.... T Consensus 98 f~~F~~~~--vlVAHNa~FD~~FL~~~~~r~g~~~~~~~~ldTl~LARrl~p~~e~~~~kL~tLa~~~-g~~~~------ 168 (575) T PRK07883 98 FLEFARGS--VLVAHNAPFDIGFLRAAAERCGYPWPQPEVLCTVRLARRVLSRDEAPNVRLSALARLF-GAATT------ 168 (575) T ss_pred HHHHHCCC--EEEEECCHHHHHHHHHHHHHCCCCCCCCCEEEHHHHHHHHCCCCCCCCCCHHHHHHHC-CCCCC------ T ss_conf 99996899--8999083452999999999759998789786489999986692114689989999975-99999------ Q ss_pred CCCCCCHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 3222100000012333322211111245677632221000134 Q gi|254780277|r 525 KSSIPIDKISDSQVQEHAIENSNVILQLWLLLRPKLIVEKLLH 567 (976) Q Consensus 525 k~~~~~~~i~~e~~~~Yaa~Da~~t~~L~~~l~~~L~~~~l~~ 567 (976) ...-|..||.+|..+...|..++.+.+... T Consensus 169 -------------~~HRAl~DA~AT~~vl~~l~~~~~~~gv~t 198 (575) T PRK07883 169 -------------PTHRALDDARATVDVLHGLIERLGNQGVHT 198 (575) T ss_pred -------------CCCCHHHHHHHHHHHHHHHHHHHHHCCCCC T ss_conf -------------986604529999999999999854224220 No 67 >KOG2206 consensus Probab=98.89 E-value=4.8e-09 Score=84.58 Aligned_cols=175 Identities=17% Similarity=0.220 Sum_probs=133.3 Q ss_pred CCCCCCCCEEECCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCEECCCCCCCCCCCC Q ss_conf 55566650354799999999999986685232332224331100124420102577665443455023137443334565 Q gi|254780277|r 361 YPINNDSYTKIVNTKDIQQWVQKLETIGSASFKIITDTMDAFHSKPIAIAISTLDQKDDTSFKVETIFIDLSVHTSQDST 440 (976) Q Consensus 361 ~~~~~~~y~~I~~~~~L~~li~~L~~~~~ia~DtETt~ld~~~~~ivGisls~~~~~~~~~~~~~~~yip~~~~~~~d~~ 440 (976) .++....+..|.+..++.++.+.+.....+|+|+|+.+.-.|.+--..+.+++... .+.|+ T Consensus 186 ~~le~T~~~~I~t~~el~~l~~~l~~~~Efavdlehhsyrsf~gltclmqISTr~e---------d~iID---------- 246 (687) T KOG2206 186 SPLEITPKVWICTLGELEALPEILDSVIEFAVDLEHHSYRSFLGLTCLMQISTRTE---------DFIID---------- 246 (687) T ss_pred CCCCCCCCEEEECHHHHHHHHHHHHHHHHHHHHCCCCCHHHHCCCEEEEEEECCCH---------HHEEH---------- T ss_conf 75103686444216879888999855356532036520453337206899530420---------31211---------- Q ss_pred CCCCCHHHHHHHHHHHHCCCCCHHHHCCCCHHHHHHHH-CCCCCCCCCCHHHHHHHHCCCCCCCCHHHHHHHHCCCCCCC Q ss_conf 00178899997444420222100110032123788986-18866642100234554316876677246677622444223 Q gi|254780277|r 441 GKNTLTKEILSYLKKFFENEHFLKIGHNIKYDKLVLHR-YGISMRGFEDIMLMSYVLDSGRSSHDIASIAQKWLSYTRKE 519 (976) Q Consensus 441 ~~~~~~~evl~~L~~lle~~~i~kIgHN~KfD~~~L~~-~GI~l~~i~DTmlaayLLdp~~~~~~L~~la~~yL~~~~~e 519 (976) -....+.+..|+++|.+|.+.||+|-+--|+-||.+ +||.+-+.|||..|+.+|. ...++|..|...|.+..... T Consensus 247 --t~~l~~~i~~l~e~fsdp~ivkvfhgaD~diiwlqrdfgiyvvnLfdt~~a~r~L~--~~r~sL~~ll~~~~~v~~nk 322 (687) T KOG2206 247 --TFKLRDHIGILNEVFSDPGIVKVFHGADTDIIWLQRDFGIYVVNLFDTIQASRLLG--LPRPSLAYLLECVCGVLTNK 322 (687) T ss_pred --HHHHHHHHHHHHHHCCCCCEEEEEECCCCCHHHHHCCCEEEEEECHHHHHHHHHHC--CCCCCHHHHHHHHHHHHHHH T ss_conf --47888888774221058874789734764034340230489985110588998857--99400999999987453114 Q ss_pred CCCCCCCCCC-CHHHC-CHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 3565432221-00000-01233332221111124567763222100 Q gi|254780277|r 520 ILKSRKSSIP-IDKIS-DSQVQEHAIENSNVILQLWLLLRPKLIVE 563 (976) Q Consensus 520 ~~~~~k~~~~-~~~i~-~e~~~~Yaa~Da~~t~~L~~~l~~~L~~~ 563 (976) +-|.. |..-| .+.+..||-+|..+.+-+|..|...+... T Consensus 323 -----~yqladwR~rpLp~~Mv~yar~dthyllyiyD~lr~el~~~ 363 (687) T KOG2206 323 -----KYQLADWRIRPLPEEMVRYAREDTHYLLYIYDVLRKELKRL 363 (687) T ss_pred -----HHHHCHHCCCCCCHHHHHHHHHCCHHHHHHHHHHHHHHHHH T ss_conf -----66411000365858898887512315899999999999987 No 68 >cd06131 DNA_pol_III_epsilon_Ecoli_like This subfamily is composed of the epsilon subunit of Escherichia coli DNA polymerase III (polIII) and similar proteins. PolIII is the main DNA replicating enzyme in bacteria, functioning as the chromosomal replicase. It is a holoenzyme complex of ten different subunits, three of which (alpha, epsilon, and theta) compose the catalytic core. The polIII epsilon subunit, encoded by the dnaQ gene, is a DEDDh-type 3'-5' exonuclease which is responsible for the proofreading activity of the polymerase, increasing the fidelity of DNA synthesis. It contains three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific Hx(4)D conserved pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. The epsilon subunit of polIII also functions as a stabilizer of the holoenzyme complex. Probab=98.89 E-value=3.7e-09 Score=85.38 Aligned_cols=145 Identities=14% Similarity=0.230 Sum_probs=90.7 Q ss_pred CCCCCCCCCCCCCC-CCCCCCEEECCCCCCCCCCCCCCEECC----CC------CCCCCCCCCCCCCHHHHHHHHHHHHC Q ss_conf 23233222433110-012442010257766544345502313----74------43334565001788999974444202 Q gi|254780277|r 390 ASFKIITDTMDAFH-SKPIAIAISTLDQKDDTSFKVETIFID----LS------VHTSQDSTGKNTLTKEILSYLKKFFE 458 (976) Q Consensus 390 ia~DtETt~ld~~~-~~ivGisls~~~~~~~~~~~~~~~yip----~~------~~~~~d~~~~~~~~~evl~~L~~lle 458 (976) ++||+||||++|.. ++|+-|+........... .....++. ++ |....+.....-...+++..|..++. T Consensus 2 Vv~D~ETTGl~p~~~d~Iieigav~~~~~~~~~-~~~~~lv~P~~~i~~~~~~ihGIt~e~l~~~p~~~ev~~~f~~~l~ 80 (167) T cd06131 2 IVLDTETTGLDPREGHRIIEIGCVELINRRLTG-NTFHVYINPERDIPEEAFKVHGITDEFLADKPKFAEIADEFLDFIR 80 (167) T ss_pred EEEECCCCCCCCCCCCEEEEEEEEEEECCEEEC-CEEEEEECCCCCCCHHHHHHHCCCHHHHCCCCCHHHHHHHHHHHCC T ss_conf 899760698898899837999999999999924-4789998978989988997809685898359758999999986356 Q ss_pred CCCCHHHHCCCCHHHHHHH----HCCCCCC-----CCCCHHHHHHHHCCCCCCCCHHHHHHHHCCCCCCCCCCCCCCCCC Q ss_conf 2210011003212378898----6188666-----421002345543168766772466776224442233565432221 Q gi|254780277|r 459 NEHFLKIGHNIKYDKLVLH----RYGISMR-----GFEDIMLMSYVLDSGRSSHDIASIAQKWLSYTRKEILKSRKSSIP 529 (976) Q Consensus 459 ~~~i~kIgHN~KfD~~~L~----~~GI~l~-----~i~DTmlaayLLdp~~~~~~L~~la~~yL~~~~~e~~~~~k~~~~ 529 (976) +.. .|+||++||+.+|. ++|+... .+.||+..+..+.|. .+++|++++++| |.... T Consensus 81 ~~~--lVaHN~~FD~~fL~~~~~r~~~~~~~~~~~~~~Dtl~l~r~~~p~-~~~~L~~l~~~~-gi~~~----------- 145 (167) T cd06131 81 GAE--LVIHNASFDVGFLNAELSLLGLGKKIIDFCRVIDTLALARKKFPG-KPNSLDALCKRF-GIDNS----------- 145 (167) T ss_pred CCE--EEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHCCC-CCCCHHHHHHHC-CCCCC----------- T ss_conf 999--998371878999999999818997777655411399999998499-988899999876-97999----------- Q ss_pred CHHHCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 000000123333222111112456776 Q gi|254780277|r 530 IDKISDSQVQEHAIENSNVILQLWLLL 556 (976) Q Consensus 530 ~~~i~~e~~~~Yaa~Da~~t~~L~~~l 556 (976) .....-|..||.+|.+||..| T Consensus 146 ------~~~~H~Al~Da~~ta~vf~~l 166 (167) T cd06131 146 ------HRTLHGALLDAELLAEVYLEL 166 (167) T ss_pred ------CCCCCCHHHHHHHHHHHHHHH T ss_conf ------998758899999999999976 No 69 >TIGR00600 rad2 DNA excision repair protein (rad2); InterPro: IPR001044 Xeroderma pigmentosum (XP) is a human autosomal recessive disease, characterised by a high incidence of sunlight-induced skin cancer. People's skin cells with this condition are hypersensitive to ultraviolet light, due to defects in the incision step of DNA excision repair. There are a minimum of seven genetic complementation groups involved in this pathway: XP-A to XP-G. XP-G is one of the most rare and phenotypically heterogeneous of XP, showing anything from slight to extreme dysfunction in DNA excision repair , . XP-G can be corrected by a 133 Kd nuclear protein, XPGC . XPGC is an acidic protein that confers normal UV resistance in expressing cells . It is a magnesium-dependent, single-strand DNA endonuclease that makes structure-specific endonucleolytic incisions in a DNA substrate containing a duplex region and single-stranded arms , . XPGC cleaves one strand of the duplex at the border with the single-stranded region . XPG belongs to a family of proteins that includes RAD2 from Saccharomyces cerevisiae (Baker's yeast) and rad13 from Schizosaccharomyces pombe (Fission yeast), which are single-stranded DNA endonucleases , ; mouse and human FEN-1, a structure-specific endonuclease; RAD2 from fission yeast and RAD27 from budding yeast; fission yeast exo1, a 5'-3' double-stranded DNA exonuclease that may act in a pathway that corrects mismatched base pairs; yeast DHS1, and yeast DIN7. Sequence alignment of this family of proteins reveals that similarities are largely confined to two regions. The first is located at the N-terminal extremity (N-region) and corresponds to the first 95 to 105 amino acids. The second region is internal (I-region) and found towards the C-terminus; it spans about 140 residues and contains a highly conserved core of 27 amino acids that includes a conserved pentapeptide (E-A-[DE]-A-[QS]). It is possible that the conserved acidic residues are involved in the catalytic mechanism of DNA excision repair in XPG. The amino acids linking the N- and I-regions are not conserved. This entry represents XPGC, an acidic protein that confers normal UV resistance in expressing cells, can correct XP-G. It is a magnesium-dependent, single-strand DNA endonuclease that makes structure-specific endonucleolytic incisions in a DNA substrate containing a duplex region and single-stranded arms. XPGC cleaves one strand of the duplex at the border with the single-stranded region , .; GO: 0003697 single-stranded DNA binding, 0004519 endonuclease activity, 0006289 nucleotide-excision repair, 0005634 nucleus. Probab=98.87 E-value=4.6e-08 Score=77.41 Aligned_cols=73 Identities=22% Similarity=0.258 Sum_probs=50.9 Q ss_pred CCEEEEEECCHHHHHHHHCCCCCCCCCCCCEECHHHHHHHHHHHHHH-HHCCCCCCCEEEEEEECCCCCHHHHCCHHHHC Q ss_conf 97289996626999887426754468988550069999999999999-84855479879999727488713240865216 Q gi|254780277|r 4 ENHLFLVDGSSFIYRAFYATPLLSRKHDGLPVNAIAGFCNMLWKLLQ-NSRKESIASHFAVIFDYPAVTFRNEIYPDYKA 82 (976) Q Consensus 4 ~~~~~liDg~~~~~ra~~a~p~~~~~~~G~~t~ai~gf~~~l~~~i~-~~~~~~~~~~~~v~fD~~~~tfR~~~~~~YKa 82 (976) .+|+|-||.+--||.+-=|..+-..+ -+--.=|.|||+-|+|||= .-+| |.||||+.|.-.++=..-=|+ T Consensus 23 e~K~LAvD~SIWlyQ~LKaVRD~~GN--~i~n~Hl~tlF~RlCKLLFFrIrP-------vFVFDG~aP~LKrqTl~kR~~ 93 (1127) T TIGR00600 23 EGKRLAVDISIWLYQALKAVRDREGN--AIKNSHLLTLFRRLCKLLFFRIRP-------VFVFDGGAPLLKRQTLAKRRQ 93 (1127) T ss_pred CCCEEEHHHHHHHHHHHHHHCCCCCC--EEECCHHHHHHHHHHHHHHCCCCE-------EEEECCCCCCHHHHHHHHHHH T ss_conf 14340124648999876222003687--230506677899999886168731-------688408887246899999987 Q ss_pred CCC Q ss_conf 679 Q gi|254780277|r 83 NRP 85 (976) Q Consensus 83 ~R~ 85 (976) =|. T Consensus 94 R~~ 96 (1127) T TIGR00600 94 RRD 96 (1127) T ss_pred HHH T ss_conf 653 No 70 >PRK07009 consensus Probab=98.84 E-value=4.5e-09 Score=84.73 Aligned_cols=147 Identities=14% Similarity=0.264 Sum_probs=91.1 Q ss_pred CCCCCCCCCCCCCCC-CCCCCCEEECCCCCCCCCCCCCCEECCCCCCC----------CCCCCCCCCCHHHHHHHHHHHH Q ss_conf 523233222433110-01244201025776654434550231374433----------3456500178899997444420 Q gi|254780277|r 389 SASFKIITDTMDAFH-SKPIAIAISTLDQKDDTSFKVETIFIDLSVHT----------SQDSTGKNTLTKEILSYLKKFF 457 (976) Q Consensus 389 ~ia~DtETt~ld~~~-~~ivGisls~~~~~~~~~~~~~~~yip~~~~~----------~~d~~~~~~~~~evl~~L~~ll 457 (976) .+.|||||||++|.. .+|+-|+..-......+.-.. ..|+.-.... ..+.-...-...+|...|..++ T Consensus 3 ~VvlDtETTGl~~~~gdRIIEIg~v~l~~~~~t~~~~-~~yiNP~r~I~~~a~~IHGIt~e~L~~kP~F~eia~efl~Fi 81 (241) T PRK07009 3 QIILDTETTGLNARTGDRIIEIGCVELLNRRLTGNNL-HFYVNPERDSDPGALAVHGLTTEFLSDKPKFAEVVDQIRDFV 81 (241) T ss_pred EEEEEEECCCCCCCCCCEEEEEEEEEEECCEEECEEE-EEEECCCCCCCHHHHHHHCCCHHHHHCCCCHHHHHHHHHHHH T ss_conf 7999825799799999878999999999997804068-998788997997576512257998725855999999999996 Q ss_pred CCCCCHHHHCCCCHHHHHHH----HCCCCC-C----CCCCHHHHHHHHCCCCCCCCHHHHHHHHCCCCCCCCCCCCCCCC Q ss_conf 22210011003212378898----618866-6----42100234554316876677246677622444223356543222 Q gi|254780277|r 458 ENEHFLKIGHNIKYDKLVLH----RYGISM-R----GFEDIMLMSYVLDSGRSSHDIASIAQKWLSYTRKEILKSRKSSI 528 (976) Q Consensus 458 e~~~i~kIgHN~KfD~~~L~----~~GI~l-~----~i~DTmlaayLLdp~~~~~~L~~la~~yL~~~~~e~~~~~k~~~ 528 (976) .+..+ |+||+.||+.+|. +.|... . .+.||+.+|.-+.|++ +++|++||+||- +... T Consensus 82 ~~a~L--VaHNA~FD~~FLn~EL~r~G~~~~~~~~~~viDTL~lAR~~~Pg~-~~sLdaLc~R~~-I~~~---------- 147 (241) T PRK07009 82 QDAEL--IIHNAPFDLGFLDAEFALLGLPPFTEHCAGVIDTLVQAKQMFPGK-RNSLDALCDRFG-ISNA---------- 147 (241) T ss_pred CCCEE--EEECHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHCCCC-CCCHHHHHHHCC-CCCC---------- T ss_conf 69927--886769999999999998399731024444767999999978999-878899999828-6988---------- Q ss_pred CCHHHCCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 10000001233332221111124567763 Q gi|254780277|r 529 PIDKISDSQVQEHAIENSNVILQLWLLLR 557 (976) Q Consensus 529 ~~~~i~~e~~~~Yaa~Da~~t~~L~~~l~ 557 (976) .....-|..||.++..+|-.|. T Consensus 148 -------~r~~HgAL~Da~lLa~vyl~mt 169 (241) T PRK07009 148 -------HRTLHGALLDSELLAEVYLAMT 169 (241) T ss_pred -------CCCCCCCHHHHHHHHHHHHHHH T ss_conf -------7330242877999999999983 No 71 >PRK05711 DNA polymerase III subunit epsilon; Provisional Probab=98.83 E-value=8.8e-09 Score=82.63 Aligned_cols=148 Identities=12% Similarity=0.213 Sum_probs=91.0 Q ss_pred CCCCCCCCCCCCCCCC-CCCCCCEEECCCCCCCCCCCCCCEECCCCCCC----------CCCCCCCCCCHHHHHHHHHHH Q ss_conf 8523233222433110-01244201025776654434550231374433----------345650017889999744442 Q gi|254780277|r 388 GSASFKIITDTMDAFH-SKPIAIAISTLDQKDDTSFKVETIFIDLSVHT----------SQDSTGKNTLTKEILSYLKKF 456 (976) Q Consensus 388 ~~ia~DtETt~ld~~~-~~ivGisls~~~~~~~~~~~~~~~yip~~~~~----------~~d~~~~~~~~~evl~~L~~l 456 (976) ..+++||||||++|.. .+|+-|+..-..+.....-. ...|+.-.... ..+.-...-...+++..|..+ T Consensus 5 R~VvlDtETTGl~~~~gdRIIEIgav~~~~~~~t~~~-f~~~inP~r~i~~~a~~IhGIT~e~L~~~P~F~ei~~efl~F 83 (240) T PRK05711 5 RQIVLDTETTGLNQREGHRIIEIGAVELINRRLTGRN-FHVYIKPDRDVDPEALAVHGITDEFLADKPTFAEVADEFLDF 83 (240) T ss_pred CEEEEEEECCCCCCCCCCEEEEEEEEEEECCEECCCE-EEEEECCCCCCCHHHHHHCCCCHHHHCCCCCHHHHHHHHHHH T ss_conf 2899986689969999987899999999999881435-789989689599778754086889964587599999999998 Q ss_pred HCCCCCHHHHCCCCHHHHHHH----HCCCCC---C---CCCCHHHHHHHHCCCCCCCCHHHHHHHHCCCCCCCCCCCCCC Q ss_conf 022210011003212378898----618866---6---421002345543168766772466776224442233565432 Q gi|254780277|r 457 FENEHFLKIGHNIKYDKLVLH----RYGISM---R---GFEDIMLMSYVLDSGRSSHDIASIAQKWLSYTRKEILKSRKS 526 (976) Q Consensus 457 le~~~i~kIgHN~KfD~~~L~----~~GI~l---~---~i~DTmlaayLLdp~~~~~~L~~la~~yL~~~~~e~~~~~k~ 526 (976) +.+..+ |+||+.||+.+|. +.|... . .+.||+..|.-+.|+. +++|++||+||- +... T Consensus 84 i~~s~l--VaHNa~FD~~FLn~El~r~g~~~~~~~~~~~viDTL~lAR~~~Pg~-~~sLdaLc~r~~-I~~s-------- 151 (240) T PRK05711 84 IRGAEL--IIHNAPFDIGFMDYEFALLGRDIPKTTTFCKVTDTLAMARRMFPGK-RNSLDALCKRYG-IDNS-------- 151 (240) T ss_pred HCCCEE--EEECCHHHHHHHHHHHHHHCCCCCCHHHHCCHHHHHHHHHHHCCCC-CCCHHHHHHHCC-CCCC-------- T ss_conf 189979--9966199899999999986899876445212101999999878999-889999999809-8988-------- Q ss_pred CCCCHHHCCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 2210000001233332221111124567763 Q gi|254780277|r 527 SIPIDKISDSQVQEHAIENSNVILQLWLLLR 557 (976) Q Consensus 527 ~~~~~~i~~e~~~~Yaa~Da~~t~~L~~~l~ 557 (976) .....-|..||.++..+|-.|. T Consensus 152 ---------~R~~HgAL~Da~llA~Vyl~l~ 173 (240) T PRK05711 152 ---------HRTLHGALLDAEILAEVYLAMT 173 (240) T ss_pred ---------CCCCCCHHHHHHHHHHHHHHHC T ss_conf ---------7565764888999999999971 No 72 >PRK06022 consensus Probab=98.83 E-value=6.5e-09 Score=83.62 Aligned_cols=148 Identities=14% Similarity=0.203 Sum_probs=91.3 Q ss_pred CCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCEECCCCC----------CCCCCCCCCCCCHHHHHHHHHHHHC Q ss_conf 523233222433110012442010257766544345502313744----------3334565001788999974444202 Q gi|254780277|r 389 SASFKIITDTMDAFHSKPIAIAISTLDQKDDTSFKVETIFIDLSV----------HTSQDSTGKNTLTKEILSYLKKFFE 458 (976) Q Consensus 389 ~ia~DtETt~ld~~~~~ivGisls~~~~~~~~~~~~~~~yip~~~----------~~~~d~~~~~~~~~evl~~L~~lle 458 (976) .+.|||||||+++..++|+-|+..-..+.....-....+.-|-.. ....+.-...-...+|...|..++. T Consensus 3 ~IvlDtETTGl~~~~dRIIEIg~vel~~~~~t~~~f~~~inP~~r~I~~~a~~IHGIt~e~L~~~P~F~eva~~fl~Fi~ 82 (234) T PRK06022 3 EIIFDTETTGLERREDRVIEIGGVELVNRFPTGRTFHKFINPQGRQVHPDALAVHGISDEQLLDKPVFAEILDEFLEFFD 82 (234) T ss_pred EEEEEEECCCCCCCCCCEEEEEEEEEECCCCCCCEEEEEECCCCCCCCHHHHHHHCCCHHHHCCCCCHHHHHHHHHHHHC T ss_conf 69998227998998970799999999778633657899989999868988998856089987059979999999999818 Q ss_pred CCCCHHHHCCCCHHHHHHH----HCCCCC-C--CCCCHHHHHHHHCCCCCCCCHHHHHHHHCCCCCCCCCCCCCCCCCCH Q ss_conf 2210011003212378898----618866-6--42100234554316876677246677622444223356543222100 Q gi|254780277|r 459 NEHFLKIGHNIKYDKLVLH----RYGISM-R--GFEDIMLMSYVLDSGRSSHDIASIAQKWLSYTRKEILKSRKSSIPID 531 (976) Q Consensus 459 ~~~i~kIgHN~KfD~~~L~----~~GI~l-~--~i~DTmlaayLLdp~~~~~~L~~la~~yL~~~~~e~~~~~k~~~~~~ 531 (976) +.. .|+||+.||+.+|. +.|... . .+.||+-+|.-+.|+. +++|++||+||- +... T Consensus 83 ~a~--LVaHNA~FD~gFLn~El~r~G~~~~~~~~viDTL~LAR~~~Pg~-~~sLdaLc~r~~-Id~~------------- 145 (234) T PRK06022 83 GAR--LVAHNATFDLGFINAEFARLGQPEITSERVVDTLALARRKHPMG-PNSLDALCRRYG-IDNS------------- 145 (234) T ss_pred CCE--EEEECHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCC-CCCHHHHHHHCC-CCCC------------- T ss_conf 985--99955787299999999985998777888746999999868899-889899999839-9977------------- Q ss_pred HHCCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 00001233332221111124567763 Q gi|254780277|r 532 KISDSQVQEHAIENSNVILQLWLLLR 557 (976) Q Consensus 532 ~i~~e~~~~Yaa~Da~~t~~L~~~l~ 557 (976) .....-|..||.++..+|-.|. T Consensus 146 ----~R~~HgAL~Da~lLa~VyleL~ 167 (234) T PRK06022 146 ----HRTKHGALLDSELLAEVYIELI 167 (234) T ss_pred ----CCCCCCCHHHHHHHHHHHHHHC T ss_conf ----7353762888999999999972 No 73 >PRK06195 DNA polymerase III subunit epsilon; Validated Probab=98.81 E-value=1.3e-08 Score=81.36 Aligned_cols=157 Identities=17% Similarity=0.246 Sum_probs=97.0 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCC---C---CCEECCCC---CCCCCCCCCCCCCHHHHHHHHHHHHC Q ss_conf 852323322243311001244201025776654434---5---50231374---43334565001788999974444202 Q gi|254780277|r 388 GSASFKIITDTMDAFHSKPIAIAISTLDQKDDTSFK---V---ETIFIDLS---VHTSQDSTGKNTLTKEILSYLKKFFE 458 (976) Q Consensus 388 ~~ia~DtETt~ld~~~~~ivGisls~~~~~~~~~~~---~---~~~yip~~---~~~~~d~~~~~~~~~evl~~L~~lle 458 (976) .++++|+||++ +..+.++-|++....+....--+ + +..+.|.. |....+.....-...+|+..+..++. T Consensus 2 ~fvviD~ETt~--~~~d~iieIg~v~v~~g~iv~~~~~Li~P~~~~~~p~~i~ihGIt~~~v~~aP~f~ev~~~~~~fi~ 79 (309) T PRK06195 2 DFVAIDFETAN--EKRNSPCSIGIVVVKDGEIVEKVHYLIKPKEMRFMPINIGIHGIRPHMVQDELEFDKIWEKIKDYFN 79 (309) T ss_pred CEEEEEEECCC--CCCCEEEEEEEEEEECCEEEEEEEEEECCCCCCCCCCEEEECCCCHHHHHCCCCHHHHHHHHHHHHC T ss_conf 29999978999--9998059999999999999999999989999888874464037799999659999999999999857 Q ss_pred CCCCHHHHCCCCHHHHHHH----HCCCCCCC--CCCHHHHHHHHCCCCCCCCHHHHHHHHCCCCCCCCCCCCCCCCCCHH Q ss_conf 2210011003212378898----61886664--21002345543168766772466776224442233565432221000 Q gi|254780277|r 459 NEHFLKIGHNIKYDKLVLH----RYGISMRG--FEDIMLMSYVLDSGRSSHDIASIAQKWLSYTRKEILKSRKSSIPIDK 532 (976) Q Consensus 459 ~~~i~kIgHN~KfD~~~L~----~~GI~l~~--i~DTmlaayLLdp~~~~~~L~~la~~yL~~~~~e~~~~~k~~~~~~~ 532 (976) +. ..|+||+.||+.+|. ++|+.+.. +.||+-.|.-+.|+..+|+|.+||+++ |+..+ T Consensus 80 ~~--vlVaHNa~FD~~fL~~~~~r~gl~~~~~~~~cTl~LAR~~~p~l~~~kL~~La~~~-gi~~~-------------- 142 (309) T PRK06195 80 DN--LVIAHNASFDISVLRKTLELYNIPMPDFEYICTMKLAKNFYSNIPNARLNTVNNFL-GYEFK-------------- 142 (309) T ss_pred CC--EEEEECCHHHHHHHHHHHHHCCCCCCCCCEEEHHHHHHHHCCCCCCCCHHHHHHHH-CCCCC-------------- T ss_conf 99--79995628889999999998499999997774799999865577657888999985-98801-------------- Q ss_pred HCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 0001233332221111124567763222100013457 Q gi|254780277|r 533 ISDSQVQEHAIENSNVILQLWLLLRPKLIVEKLLHVY 569 (976) Q Consensus 533 i~~e~~~~Yaa~Da~~t~~L~~~l~~~L~~~~l~~l~ 569 (976) ..-|..||.++-.+...+.++.....+.+++ T Consensus 143 ------hH~Al~DA~A~a~I~~~~~~~~~~~~l~el~ 173 (309) T PRK06195 143 ------HHDALEDAMACSNILLNISKELNSKSIEEIS 173 (309) T ss_pred ------CCCCCCCHHHHHHHHHHHHHHHCCCCHHHHH T ss_conf ------1685101999999999999981887789997 No 74 >PRK08816 consensus Probab=98.76 E-value=1.2e-08 Score=81.72 Aligned_cols=146 Identities=12% Similarity=0.202 Sum_probs=89.0 Q ss_pred CCCCCCCCCCCCCCC-CCCCCCEEE-CCCCCCCCCCCCCCEECCCCCCC----------CCCCCCCCCCHHHHHHHHHHH Q ss_conf 523233222433110-012442010-25776654434550231374433----------345650017889999744442 Q gi|254780277|r 389 SASFKIITDTMDAFH-SKPIAIAIS-TLDQKDDTSFKVETIFIDLSVHT----------SQDSTGKNTLTKEILSYLKKF 456 (976) Q Consensus 389 ~ia~DtETt~ld~~~-~~ivGisls-~~~~~~~~~~~~~~~yip~~~~~----------~~d~~~~~~~~~evl~~L~~l 456 (976) .+.+||||||++|.. .+|+-|+.. ..++. ... .....|+.-.... ..+.-...-...+|+..|..+ T Consensus 3 ~VVlDtETTGl~~~~gdRIIEIgaV~~~~~~-~~~-~~f~~~inP~r~I~~~a~~vhGIT~e~L~dkP~F~eV~~efl~F 80 (244) T PRK08816 3 QIILDTETTGLEWRKGNRVVEIGAVELLERR-PSG-NNFHRYLKPDCDFEPGAQEVTGLTLEFLADKPLFGEVVDEFLAY 80 (244) T ss_pred EEEEEEECCCCCCCCCCEEEEEEEEEEECCE-ECC-CEEEEEECCCCCCCHHHHHHHCCCHHHHHCCCCHHHHHHHHHHH T ss_conf 7999722799899899889999999999996-814-37899868899699889875251789984099899999999998 Q ss_pred HCCCCCHHHHCCCCHHHHHHH----HCCCC------CCCCCCHHHHHHHHCCCCCCCCHHHHHHHHCCCCCCCCCCCCCC Q ss_conf 022210011003212378898----61886------66421002345543168766772466776224442233565432 Q gi|254780277|r 457 FENEHFLKIGHNIKYDKLVLH----RYGIS------MRGFEDIMLMSYVLDSGRSSHDIASIAQKWLSYTRKEILKSRKS 526 (976) Q Consensus 457 le~~~i~kIgHN~KfD~~~L~----~~GI~------l~~i~DTmlaayLLdp~~~~~~L~~la~~yL~~~~~e~~~~~k~ 526 (976) +.+..+ |+||+.||+.+|. +.|.. ...+.||+.+|.-..|++ +++|++||+||- +.. +. T Consensus 81 i~ds~L--VaHNa~FDi~FLn~El~r~g~~~~~~~~~~~v~DTl~lAr~~~Pg~-~nsLDaLc~r~~-id~-----~~-- 149 (244) T PRK08816 81 IDGAEL--IIHNAAFDLGFLDNELSLLGDQYGRIVDRATVVDTLMMARERYPGQ-RNSLDALCKRLG-VDN-----SH-- 149 (244) T ss_pred HCCCEE--EEECCHHHHHHHHHHHHHHCCCCCCCCCCCCEEEHHHHHHHHCCCC-CCCHHHHHHHCC-CCC-----CC-- T ss_conf 289989--9966365298999999984887567643101043899999878799-778999999749-788-----77-- Q ss_pred CCCCHHHCCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 2210000001233332221111124567763 Q gi|254780277|r 527 SIPIDKISDSQVQEHAIENSNVILQLWLLLR 557 (976) Q Consensus 527 ~~~~~~i~~e~~~~Yaa~Da~~t~~L~~~l~ 557 (976) ....-|..||.+...+|-.|. T Consensus 150 ----------R~~HgALlDa~lLa~VYl~mt 170 (244) T PRK08816 150 ----------RQLHGALLDAQILADVYIALT 170 (244) T ss_pred ----------CCCCCHHHHHHHHHHHHHHHC T ss_conf ----------350148887999999999974 No 75 >TIGR01406 dnaQ_proteo DNA polymerase III, epsilon subunit; InterPro: IPR006309 These represent DnaQ, the DNA polymerase III epsilon subunit, as found in most Proteobacteria. It consists largely of an exonuclease region as described in IPR006055 from INTERPRO. In Gram-positive bacteria, closely related regions are found both in the Gram-positive type DNA polymerase III alpha subunit and as an additional N-terminal domain of a DinG-family helicase. Both are not included in this set of sequences, as they are smaller proteins.; GO: 0003677 DNA binding, 0003887 DNA-directed DNA polymerase activity, 0006260 DNA replication. Probab=98.75 E-value=3.1e-08 Score=78.68 Aligned_cols=144 Identities=17% Similarity=0.250 Sum_probs=90.5 Q ss_pred CCCCCCCCCCCCCCC-----C-CCCCCE---EECCCCCCCCCCCCCCEECCCCCCCCCC-----------CCCCCCC-HH Q ss_conf 523233222433110-----0-124420---1025776654434550231374433345-----------6500178-89 Q gi|254780277|r 389 SASFKIITDTMDAFH-----S-KPIAIA---ISTLDQKDDTSFKVETIFIDLSVHTSQD-----------STGKNTL-TK 447 (976) Q Consensus 389 ~ia~DtETt~ld~~~-----~-~ivGis---ls~~~~~~~~~~~~~~~yip~~~~~~~d-----------~~~~~~~-~~ 447 (976) .+.+||||||+|+.. + .||=|+ +-... .......+|+.-......+ ..+++.+ -+ T Consensus 2 ~IiLDTETTGl~~~~G~~~~GhRivEIGavE~vnr~----~tG~~fH~Y~~P~Rd~p~eA~kVHGiT~efL~dKnaP~Fk 77 (246) T TIGR01406 2 EIILDTETTGLDPKGGLPYIGHRIVEIGAVELVNRM----LTGKNFHVYVKPERDIPAEAAKVHGITDEFLKDKNAPKFK 77 (246) T ss_pred EEEECCCCCCCCCCCCCCCCCCEEEEEEEEEEECCC----CCCCCEEEEECCCCCCHHHCEECCCCCHHHHCCCCCCCHH T ss_conf 267115556877667768568626888218567350----3887312433688877300210137646884688989525 Q ss_pred HHHHHHHHHHCCCCCHHHHCCCCHHHHHH----HHCC---CC-CC---CCCCHHHHHHHHCCCCCCCCHHHHHHHHCCCC Q ss_conf 99974444202221001100321237889----8618---86-66---42100234554316876677246677622444 Q gi|254780277|r 448 EILSYLKKFFENEHFLKIGHNIKYDKLVL----HRYG---IS-MR---GFEDIMLMSYVLDSGRSSHDIASIAQKWLSYT 516 (976) Q Consensus 448 evl~~L~~lle~~~i~kIgHN~KfD~~~L----~~~G---I~-l~---~i~DTmlaayLLdp~~~~~~L~~la~~yL~~~ 516 (976) +|-+.|-.|..+..+ |.|||+||+.|| .+.| +. +. .+.||+.||.=.-||+ +++|++||+||- +. T Consensus 78 ~Ia~~Fl~Fi~~~~L--vIHNA~FD~gFlN~EL~~Lg~~~~~~~~~~~~~~DTL~mAR~~FPG~-~~sLDALCkRf~-vD 153 (246) T TIGR01406 78 EIADEFLDFIKGSEL--VIHNAAFDVGFLNYELERLGLKEIKKISDFCRVIDTLAMARERFPGK-RNSLDALCKRFK-VD 153 (246) T ss_pred HHHHHHHHHHCCCEE--EEECCCHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHCCCCC-CCCHHHHHHHHC-CC T ss_conf 789999998459889--99757021678999998728732100013667865689997237898-664678774304-25 Q ss_pred CCCCCCCCCCCCCCHHHCCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 22335654322210000001233332221111124567763 Q gi|254780277|r 517 RKEILKSRKSSIPIDKISDSQVQEHAIENSNVILQLWLLLR 557 (976) Q Consensus 517 ~~e~~~~~k~~~~~~~i~~e~~~~Yaa~Da~~t~~L~~~l~ 557 (976) .+ .....-|.-||.+...+|-.|. T Consensus 154 -----ns------------~R~LHGAL~DA~LLAeVYl~lt 177 (246) T TIGR01406 154 -----NS------------HRQLHGALLDAELLAEVYLALT 177 (246) T ss_pred -----CC------------CCHHHHHHHHHHHHHHHHHHHC T ss_conf -----00------------0213478999999999999862 No 76 >PRK05168 ribonuclease T; Provisional Probab=98.75 E-value=5.5e-08 Score=76.85 Aligned_cols=165 Identities=14% Similarity=0.078 Sum_probs=93.0 Q ss_pred HHHHHCCCCCCCCCCCCCCCCCCCCCCCEEECCC---CCCCCCCCCCCEECCC------C------CCCCCCCCCC-CCC Q ss_conf 9998668523233222433110012442010257---7665443455023137------4------4333456500-178 Q gi|254780277|r 382 QKLETIGSASFKIITDTMDAFHSKPIAIAISTLD---QKDDTSFKVETIFIDL------S------VHTSQDSTGK-NTL 445 (976) Q Consensus 382 ~~L~~~~~ia~DtETt~ld~~~~~ivGisls~~~---~~~~~~~~~~~~yip~------~------~~~~~d~~~~-~~~ 445 (976) +.....-.++||+||||+++..+.|+-|+..... .+..........|+.- . +....+.... ... T Consensus 12 ~rfr~~~pVV~D~ETTGl~p~~D~IIEIGAV~v~~~e~g~i~~~~~f~~~V~P~~g~~I~~~~~~ltGI~~~~~l~~a~~ 91 (213) T PRK05168 12 DRFRGFLPVVIDVETAGFNAQTDALLEIAAITLKMDEQGWLYPDETLHFHVEPFEGANLEPEALAFNGIDPDNPLRGAVS 91 (213) T ss_pred HHHCCCCCEEEEEECCCCCCCCCCEEEEEEEEEEECCCCCCCCCCEEEEEECCCCCCCCCHHHHHHCCCCCCCCCCCCCC T ss_conf 98558887599830799887788558888899886378867247548898788878878866751126786532223666 Q ss_pred HHHHH----HHHHHHHCCC---CCHHHHCCCCHHHHHHH----HCCCCCC-----CCCCHHHHHHHHCCCCCCCCHHHHH Q ss_conf 89999----7444420222---10011003212378898----6188666-----4210023455431687667724667 Q gi|254780277|r 446 TKEIL----SYLKKFFENE---HFLKIGHNIKYDKLVLH----RYGISMR-----GFEDIMLMSYVLDSGRSSHDIASIA 509 (976) Q Consensus 446 ~~evl----~~L~~lle~~---~i~kIgHN~KfD~~~L~----~~GI~l~-----~i~DTmlaayLLdp~~~~~~L~~la 509 (976) ..+++ +.+.+.+... .-..|+||+.||+.+|. +.|+.-. .++||+-.|.++-+ .+.|+.+| T Consensus 92 ~~~al~~~~~~i~~~~~~~~c~~avLVaHNA~FD~~FLn~~~~r~g~~~~P~~~~~viDTl~Lar~~~~---~~~L~~lc 168 (213) T PRK05168 92 EKEALHEIFKPVRKGIKAAGCQRAILVAHNAHFDLGFLNAAAERTGLKRNPFHPFSTFDTATLAGLAYG---QTVLAKAC 168 (213) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCCEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCEEEHHHHHHHHHC---CCCHHHHH T ss_conf 788789999999999875046756377634187499999999984997788887763358998688748---74099999 Q ss_pred HHHCCCCCCCCCCCCCCCCCCHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 7622444223356543222100000012333322211111245677632221000134 Q gi|254780277|r 510 QKWLSYTRKEILKSRKSSIPIDKISDSQVQEHAIENSNVILQLWLLLRPKLIVEKLLH 567 (976) Q Consensus 510 ~~yL~~~~~e~~~~~k~~~~~~~i~~e~~~~Yaa~Da~~t~~L~~~l~~~L~~~~l~~ 567 (976) ++| ++... .....-|..||.+|.+||-.|..+++..+-+. T Consensus 169 ~~~-gi~~~-----------------~~~~HrAl~DA~~tA~vFl~ll~~l~~~g~~~ 208 (213) T PRK05168 169 QAA-GIEFD-----------------NKEAHSALYDTEKTAELFCEIVNRWKRLGGWP 208 (213) T ss_pred HHC-CCCCC-----------------CCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCC T ss_conf 986-99999-----------------97777738999999999999999999738999 No 77 >PRK08858 consensus Probab=98.75 E-value=1.4e-08 Score=81.14 Aligned_cols=148 Identities=11% Similarity=0.221 Sum_probs=89.4 Q ss_pred CCCCCCCCCCCCCCCC------CCCCCCEEECCCCCCCCCCCCCCEECCCCC----------CCCCCCCCCCCCHHHHHH Q ss_conf 8523233222433110------012442010257766544345502313744----------333456500178899997 Q gi|254780277|r 388 GSASFKIITDTMDAFH------SKPIAIAISTLDQKDDTSFKVETIFIDLSV----------HTSQDSTGKNTLTKEILS 451 (976) Q Consensus 388 ~~ia~DtETt~ld~~~------~~ivGisls~~~~~~~~~~~~~~~yip~~~----------~~~~d~~~~~~~~~evl~ 451 (976) ..+.+||||||+++.. .+|+-|+..-..+.....-.. ..|+.-.. ....+.-...-...+|+. T Consensus 11 R~VVlDtETTGl~~~~~~~~~gdRIIEIgaV~i~~~~~t~~~f-~~~InP~r~Ip~~a~~ihGIT~e~L~daP~F~eI~~ 89 (246) T PRK08858 11 RIVVLDTETTGMNREGGPHYEGHRIIEIGAVEIINRKLTGRHF-HVYLKPDREIQPEAIDVHGITDEFLVDKPEYKDVHQ 89 (246) T ss_pred EEEEEEEECCCCCCCCCCCCCCCEEEEEEEEEEECCEEECCEE-EEEECCCCCCCHHHHHHCCCCHHHHHCCCCHHHHHH T ss_conf 7999985789999765677789879999999999999822147-888897996998899773928999835975899999 Q ss_pred HHHHHHCCCCCHHHHCCCCHHHHHHH----HCCCCC------CCCCCHHHHHHHHCCCCCCCCHHHHHHHHCCCCCCCCC Q ss_conf 44442022210011003212378898----618866------64210023455431687667724667762244422335 Q gi|254780277|r 452 YLKKFFENEHFLKIGHNIKYDKLVLH----RYGISM------RGFEDIMLMSYVLDSGRSSHDIASIAQKWLSYTRKEIL 521 (976) Q Consensus 452 ~L~~lle~~~i~kIgHN~KfD~~~L~----~~GI~l------~~i~DTmlaayLLdp~~~~~~L~~la~~yL~~~~~e~~ 521 (976) .|..++.+..+ |+||+.||+.+|. +.|..+ ..+.||+.+|..+.|+. +++|++||+||- +.. T Consensus 90 efl~FI~d~~L--VaHNa~FD~~FLn~El~r~~~~~~~~~~~~~viDTL~lAR~~~Pg~-~~sLdaLc~r~~-Id~---- 161 (246) T PRK08858 90 EFLEFIKGAEL--VAHNAPFDVGFMDYEFEKLNPAIGKTDDYCKVTDTLAMAKKIFPGK-RNNLDVLCERYG-IDN---- 161 (246) T ss_pred HHHHHHCCCEE--EEECCHHHHHHHHHHHHHHCCCCCCHHHHCCCEEHHHHHHHHCCCC-CCCHHHHHHHCC-CCC---- T ss_conf 99999589989--9972288899999999982877675134201002999999868999-888899999819-898---- Q ss_pred CCCCCCCCCHHHCCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 654322210000001233332221111124567763 Q gi|254780277|r 522 KSRKSSIPIDKISDSQVQEHAIENSNVILQLWLLLR 557 (976) Q Consensus 522 ~~~k~~~~~~~i~~e~~~~Yaa~Da~~t~~L~~~l~ 557 (976) + .....-|.-||.++..+|-.|. T Consensus 162 -s------------~R~~HgALlDa~lLa~Vyl~mt 184 (246) T PRK08858 162 -S------------HRTLHGALLDAEILADVYLLMT 184 (246) T ss_pred -C------------CCCCCCHHHHHHHHHHHHHHHC T ss_conf -8------------8340230888999999999972 No 78 >KOG2520 consensus Probab=98.74 E-value=8.1e-07 Score=68.32 Aligned_cols=27 Identities=19% Similarity=0.204 Sum_probs=10.5 Q ss_pred CCCCCCCCCCHHHHH---HHHHCCCHHHHH Q ss_conf 899877636024998---984136789998 Q gi|254780277|r 192 NIPGIPGIGYKTAAL---LLQEYGNLENIL 218 (976) Q Consensus 192 nipGv~GiG~KtA~~---ll~~~gsle~i~ 218 (976) +-|+|+-.+..+|+. ..+.++..+++. T Consensus 63 ~~~~~~p~~~~~a~~~~~k~~~~~~~e~~d 92 (815) T KOG2520 63 SSPDVTPLELRAAVIDFQKRQNRSPDEKID 92 (815) T ss_pred CCCCCCCCHHHHHHHHHHHHCCCCHHHHCC T ss_conf 556788741467788898763787132100 No 79 >TIGR01407 dinG_rel DnaQ family exonuclease/DinG family helicase, putative; InterPro: IPR006310 These are a family of proteins in Gram-positive bacteria. The N-terminal region of about 200 amino acids resembles the epsilon subunit of E. coli DNA polymerase III and the homologous region of the Gram-positive type DNA polymerase III alpha subunit. The epsilon subunit contains an exonuclease domain. The remainder of this protein family resembles a predicted ATP-dependent helicase, the DNA damage-inducible protein DinG of Escherichia coli. . Probab=98.72 E-value=3.3e-07 Score=71.16 Aligned_cols=148 Identities=18% Similarity=0.246 Sum_probs=103.1 Q ss_pred CCCCCCCCCCCCCCC-CCCCCCEEECCCCCCCCCCCCCCEECCCCCCCC-----------CCCCCCCCC-HHHHHHHHHH Q ss_conf 523233222433110-012442010257766544345502313744333-----------456500178-8999974444 Q gi|254780277|r 389 SASFKIITDTMDAFH-SKPIAIAISTLDQKDDTSFKVETIFIDLSVHTS-----------QDSTGKNTL-TKEILSYLKK 455 (976) Q Consensus 389 ~ia~DtETt~ld~~~-~~ivGisls~~~~~~~~~~~~~~~yip~~~~~~-----------~d~~~~~~~-~~evl~~L~~ 455 (976) ++.+|+||||-++-. ++|+-|++........+ ++|--.++.+.. .|....+.+ -.+|...+.. T Consensus 2 YaVVDLEtTg~~~~~~d~IIqiGiV~v~DgeIv----~~f~T~VNP~~~~pp~I~eLTGisd~~l~~AP~F~qVA~~i~~ 77 (944) T TIGR01407 2 YAVVDLETTGTDSSEFDKIIQIGIVLVEDGEIV----DTFATDVNPNEPIPPFIQELTGISDEQLKQAPYFSQVAQEIYD 77 (944) T ss_pred EEEEEEEECCCCCCCCCCEEEEEEEEEECCCEE----EECCCCCCCCCCCCCHHHHHCCCCHHHHCCCCCHHHHHHHHHH T ss_conf 689987416888645576689988998558143----2012677889877611454228776864048888799999998 Q ss_pred HHCCCCCHHHHCCCCHHHHHHHH----CCCCCCC--CCCHHHHHHHHCCCCCCCCHHHHHHHHCCCCCCCCCCCCCCCCC Q ss_conf 20222100110032123788986----1886664--21002345543168766772466776224442233565432221 Q gi|254780277|r 456 FFENEHFLKIGHNIKYDKLVLHR----YGISMRG--FEDIMLMSYVLDSGRSSHDIASIAQKWLSYTRKEILKSRKSSIP 529 (976) Q Consensus 456 lle~~~i~kIgHN~KfD~~~L~~----~GI~l~~--i~DTmlaayLLdp~~~~~~L~~la~~yL~~~~~e~~~~~k~~~~ 529 (976) +++|. .+|+||+.||+.+|.+ .|+.+.. +.||.=.+.+.=|...+|+|..||+. ||..-+ T Consensus 78 ~l~D~--iFVAHNV~FD~nfL~k~l~~~G~~~~~~~~iDTVELa~if~PT~esY~Ls~L~E~-LGl~h~----------- 143 (944) T TIGR01407 78 LLEDG--IFVAHNVHFDLNFLAKELVDEGYEPLEVPRIDTVELARIFFPTEESYKLSELSEE-LGLEHE----------- 143 (944) T ss_pred HHCCC--EEEEECHHCCHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHCCCCCCCCCHHHHHH-HCCCCC----------- T ss_conf 74297--2776101506789999998568974343520056799874674435561489997-198714----------- Q ss_pred CHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 000000123333222111112456776322210 Q gi|254780277|r 530 IDKISDSQVQEHAIENSNVILQLWLLLRPKLIV 562 (976) Q Consensus 530 ~~~i~~e~~~~Yaa~Da~~t~~L~~~l~~~L~~ 562 (976) . .+-|-.||.+|..|--.+.++++. T Consensus 144 -------n-pH~AdSDA~~TAeLLLl~~~K~~~ 168 (944) T TIGR01407 144 -------N-PHRADSDAQVTAELLLLLEEKMKK 168 (944) T ss_pred -------C-CCCCCHHHHHHHHHHHHHHHHHHH T ss_conf -------6-676105789999999999999875 No 80 >PRK06859 consensus Probab=98.69 E-value=1.9e-08 Score=80.22 Aligned_cols=122 Identities=13% Similarity=0.212 Sum_probs=77.3 Q ss_pred CCCCCCCCCCCCCCCCC-----CCCCCCEEE-CCCCCCCCCCCCCCEECCCCCC----------CCCCCCCCCCCHHHHH Q ss_conf 68523233222433110-----012442010-2577665443455023137443----------3345650017889999 Q gi|254780277|r 387 IGSASFKIITDTMDAFH-----SKPIAIAIS-TLDQKDDTSFKVETIFIDLSVH----------TSQDSTGKNTLTKEIL 450 (976) Q Consensus 387 ~~~ia~DtETt~ld~~~-----~~ivGisls-~~~~~~~~~~~~~~~yip~~~~----------~~~d~~~~~~~~~evl 450 (976) ...+.+||||||++|.. .+|+-|+.. ..++. .+.-. ...|+.-... ...+.-...-...+|. T Consensus 5 ~R~VvlDtETTGl~p~~~~~~g~RIIEIgaVel~~~~-~t~~~-f~~yinP~r~I~~~a~~vhGIT~e~L~~kP~F~eia 82 (253) T PRK06859 5 NRQIVLDTETTGMNQFGAHYEGHCIIEIGAVELINRR-YTGNN-FHIYIKPDRPVDPDAIKVHGITDEMLADKPEFKEVA 82 (253) T ss_pred CCEEEEEEECCCCCCCCCCCCCCEEEEEEEEEEECCE-ECCCE-EEEEECCCCCCCHHHHHHHCCCHHHHHCCCCHHHHH T ss_conf 8379997217999987666689868999999999997-82404-678878699499889987450899983699799999 Q ss_pred HHHHHHHCCCCCHHHHCCCCHHHHHHH----HCCCCC--C---CCCCHHHHHHHHCCCCCCCCHHHHHHHHC Q ss_conf 744442022210011003212378898----618866--6---42100234554316876677246677622 Q gi|254780277|r 451 SYLKKFFENEHFLKIGHNIKYDKLVLH----RYGISM--R---GFEDIMLMSYVLDSGRSSHDIASIAQKWL 513 (976) Q Consensus 451 ~~L~~lle~~~i~kIgHN~KfD~~~L~----~~GI~l--~---~i~DTmlaayLLdp~~~~~~L~~la~~yL 513 (976) ..|..++.+..+ |+||+.||+.+|. +.|... . .+.||+.+|.-+.|++ +++|++||+||- T Consensus 83 ~efl~Fi~~a~L--VaHNa~FD~~FLn~El~r~g~~~~~~~~~~viDTL~lAR~~~Pg~-~~sLdaLc~r~~ 151 (253) T PRK06859 83 QEFIDYIKGAEL--LIHNAPFDVGFMDYEFRKLNLNVKTDDICLVTDTLQMARQMYPGK-RNNLDALCDRLG 151 (253) T ss_pred HHHHHHHCCCEE--EEECHHHHHHHHHHHHHHCCCCCCCCCCCCEEEHHHHHHHHCCCC-CCCHHHHHHHCC T ss_conf 999999679979--995718889999999997499965322001133999999878799-779999999739 No 81 >PRK07982 consensus Probab=98.68 E-value=3.4e-08 Score=78.31 Aligned_cols=148 Identities=9% Similarity=0.166 Sum_probs=88.2 Q ss_pred CCCCCCCCCCCCCC-----CCCCCCCCEEECCCCCCCCCCCCCCEECCCCCCC----------CCCCCCCCCCHHHHHHH Q ss_conf 85232332224331-----1001244201025776654434550231374433----------34565001788999974 Q gi|254780277|r 388 GSASFKIITDTMDA-----FHSKPIAIAISTLDQKDDTSFKVETIFIDLSVHT----------SQDSTGKNTLTKEILSY 452 (976) Q Consensus 388 ~~ia~DtETt~ld~-----~~~~ivGisls~~~~~~~~~~~~~~~yip~~~~~----------~~d~~~~~~~~~evl~~ 452 (976) ..+.+||||||++| ...+|+-|+..-........-. ...|+.-.... ..+.-...-...+|... T Consensus 7 R~ivlDtETTGL~p~~~~~~gdRIIEIgaVei~~~~~t~~~-fh~yiNP~r~I~~~a~~VHGIT~e~L~dkP~F~eI~~e 85 (243) T PRK07982 7 RQIVLDTETTGMNQIGAHYEGHKIIEIGAVEVINRRLTGNN-FHVYLKPDRLVDPEAFGVHGIADEFLLDKPTFADVADE 85 (243) T ss_pred EEEEEEEECCCCCCCCCCCCCCEEEEEEEEEEECCEECCCE-EEEEECCCCCCCHHHHHHEEECHHHHHCCCCHHHHHHH T ss_conf 58999711799998776778987999999999999882523-43788889979988985211577887169989999999 Q ss_pred HHHHHCCCCCHHHHCCCCHHHHHHHH----CCCCC---C---CCCCHHHHHHHHCCCCCCCCHHHHHHHHCCCCCCCCCC Q ss_conf 44420222100110032123788986----18866---6---42100234554316876677246677622444223356 Q gi|254780277|r 453 LKKFFENEHFLKIGHNIKYDKLVLHR----YGISM---R---GFEDIMLMSYVLDSGRSSHDIASIAQKWLSYTRKEILK 522 (976) Q Consensus 453 L~~lle~~~i~kIgHN~KfD~~~L~~----~GI~l---~---~i~DTmlaayLLdp~~~~~~L~~la~~yL~~~~~e~~~ 522 (976) |..|+.+..+ |+||+.||+.+|.. .|-.+ . .+.||+.+|.-..|++ +++|++||+||- +.. T Consensus 86 fl~Fi~~a~L--VaHNA~FD~~FLn~El~r~g~~~~~~~~~~~v~DTL~lAR~~~Pg~-~~sLDaLc~R~~-Id~----- 156 (243) T PRK07982 86 FLDYIRGAEL--VIHNAAFDIGFMDYEFSLLKRDIPKTNTFCKVTDSLALARKMFPGK-RNSLDALCDRYE-IDN----- 156 (243) T ss_pred HHHHHCCCEE--EEECCHHHHHHHHHHHHHCCCCCCCCCCCCCEEEHHHHHHHHCCCC-CCCHHHHHHHCC-CCC----- T ss_conf 9999679989--9973388899999999970889886101243134999988878898-689999987548-776----- Q ss_pred CCCCCCCCHHHCCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 54322210000001233332221111124567763 Q gi|254780277|r 523 SRKSSIPIDKISDSQVQEHAIENSNVILQLWLLLR 557 (976) Q Consensus 523 ~~k~~~~~~~i~~e~~~~Yaa~Da~~t~~L~~~l~ 557 (976) + .....-|.-||.+..++|-.|. T Consensus 157 s------------~R~~HgALlDa~lLAeVYl~lt 179 (243) T PRK07982 157 S------------KRTLHGALLDAQILAEVYLAMT 179 (243) T ss_pred C------------CCCCCCHHHHHHHHHHHHHHHC T ss_conf 5------------6552333774889999999972 No 82 >PRK07247 DNA polymerase III subunit epsilon; Validated Probab=98.68 E-value=4.8e-08 Score=77.25 Aligned_cols=155 Identities=16% Similarity=0.219 Sum_probs=97.0 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCEEC----CCCCCC------CCCCCCCCCCHHHHHHHHHHH Q ss_conf 6852323322243311001244201025776654434550231----374433------345650017889999744442 Q gi|254780277|r 387 IGSASFKIITDTMDAFHSKPIAIAISTLDQKDDTSFKVETIFI----DLSVHT------SQDSTGKNTLTKEILSYLKKF 456 (976) Q Consensus 387 ~~~ia~DtETt~ld~~~~~ivGisls~~~~~~~~~~~~~~~yi----p~~~~~------~~d~~~~~~~~~evl~~L~~l 456 (976) ..++++|+||||+++. ++|+-|+...-.+....... ..++ |++... ..+.-...-...+|+..|..| T Consensus 5 ~~yvv~DlETTg~~~~-~~IIEIgAVk~~~g~ivd~F--~slVnP~~~I~~~i~~lTGIt~~~v~~aP~~~eVl~~f~~F 81 (195) T PRK07247 5 ETYIAFDLEFNTVNDV-SHIIQVSAVKYDHHKEVDSF--DTYVYTDVPLQSFINGLTGITADKIAAAPKVEEVLAAFKNF 81 (195) T ss_pred CEEEEEEEECCCCCCC-CEEEEEEEEEEECCEEEEEE--EEEECCCCCCCHHHHHCCCCCHHHHHCCCCHHHHHHHHHHH T ss_conf 8499999778999999-80699999999899899999--83566989999555413598999983599999999999999 Q ss_pred HCCCCCHHHHCCC-CHHHHHHHHCCCCCC-----CCCCHHHHH-HHHCCCCCCCCHHHHHHHHCCCCCCCCCCCCCCCCC Q ss_conf 0222100110032-123788986188666-----421002345-543168766772466776224442233565432221 Q gi|254780277|r 457 FENEHFLKIGHNI-KYDKLVLHRYGISMR-----GFEDIMLMS-YVLDSGRSSHDIASIAQKWLSYTRKEILKSRKSSIP 529 (976) Q Consensus 457 le~~~i~kIgHN~-KfD~~~L~~~GI~l~-----~i~DTmlaa-yLLdp~~~~~~L~~la~~yL~~~~~e~~~~~k~~~~ 529 (976) +.+.- .||||+ +||+.+|...+..+. .++|+...+ .++-|+-.+|+|+.||+. |+... T Consensus 82 igd~~--lVaHNa~~FD~~fL~~~~~~~~~~~~~~~~~~~~~~r~~~~p~l~n~kL~tLa~~-f~I~~------------ 146 (195) T PRK07247 82 VGELP--LIGYNAQKSDLPILAENGLDLRDQYQVDLFDEAFDRRSSDLNGIANLKLQTVATF-LGIKG------------ 146 (195) T ss_pred HCCCE--EEEECCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHCCCCCCCCCHHHHHHH-CCCCC------------ T ss_conf 68991--8990884514999999877415776685699999998860588767778999997-49999------------ Q ss_pred CHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 00000012333322211111245677632221000134 Q gi|254780277|r 530 IDKISDSQVQEHAIENSNVILQLWLLLRPKLIVEKLLH 567 (976) Q Consensus 530 ~~~i~~e~~~~Yaa~Da~~t~~L~~~l~~~L~~~~l~~ 567 (976) . ...|..||.+|.++|.+|.+...+.+... T Consensus 147 -------~-~HRAl~DA~aTa~If~klLe~~~~~~~~~ 176 (195) T PRK07247 147 -------R-GHNSLEDARMTARIYESFLETDTNKAYLS 176 (195) T ss_pred -------C-CCCCHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf -------8-90818999999999999999766433677 No 83 >smart00479 EXOIII exonuclease domain in DNA-polymerase alpha and epsilon chain, ribonuclease T and other exonucleases. Probab=98.67 E-value=4.6e-08 Score=77.43 Aligned_cols=150 Identities=17% Similarity=0.224 Sum_probs=99.0 Q ss_pred CCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCEEC-CC---C------CCCCCCCCCCCCCHHHHHHHHHHHHC Q ss_conf 52323322243311001244201025776654434550231-37---4------43334565001788999974444202 Q gi|254780277|r 389 SASFKIITDTMDAFHSKPIAIAISTLDQKDDTSFKVETIFI-DL---S------VHTSQDSTGKNTLTKEILSYLKKFFE 458 (976) Q Consensus 389 ~ia~DtETt~ld~~~~~ivGisls~~~~~~~~~~~~~~~yi-p~---~------~~~~~d~~~~~~~~~evl~~L~~lle 458 (976) .++||+||||+++..++|+.++....++.... .....|+ |. + +....+.........+++..|..|+. T Consensus 2 ~v~~D~ETTGl~~~~~~Iieigav~~~~~~~~--~~~~~~i~P~~~i~~~~~~i~GIt~~~l~~~~~~~~v~~~~~~~l~ 79 (169) T smart00479 2 LVVIDCETTGLDPGKDEIIEIAAVDVDGGRII--VVFDTYVKPDRPITDYATEIHGITPEMLDDAPTFEEVLEELLEFLK 79 (169) T ss_pred EEEEEEECCCCCCCCCEEEEEEEEEEECCEEE--EEEEEEECCCCCCCHHHHHHCCCCCHHCCCCCCHHHHHHHHHHHHC T ss_conf 99999852799999986999999999999887--8889887999999989986318472120458669999999998604 Q ss_pred CCCCHHHHCCCCHHHHHHHH----CCCCC--C-CCCCHHHHHHHHCCCCCCCCHHHHHHHHCCCCCCCCCCCCCCCCCCH Q ss_conf 22100110032123788986----18866--6-42100234554316876677246677622444223356543222100 Q gi|254780277|r 459 NEHFLKIGHNIKYDKLVLHR----YGISM--R-GFEDIMLMSYVLDSGRSSHDIASIAQKWLSYTRKEILKSRKSSIPID 531 (976) Q Consensus 459 ~~~i~kIgHN~KfD~~~L~~----~GI~l--~-~i~DTmlaayLLdp~~~~~~L~~la~~yL~~~~~e~~~~~k~~~~~~ 531 (976) +.. ..++||++||..+|.+ +|... . .+.||+-.+....+.. +++|+.+++.|.. .. T Consensus 80 ~~~-~v~~~~~~fD~~~L~~~~~~~~~~~~~~~~~~dt~~~~~~~~~~~-~~~L~~l~~~~~i-~~-------------- 142 (169) T smart00479 80 GKI-LVAGNALNFDLRFLKLEHPRLGIKDPPKNPVIDTLKLARALNPGR-KYSLKKLAERLGL-EV-------------- 142 (169) T ss_pred CCC-CHHCCCHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHCCCC-CHHHHHHHHHCCC-CC-------------- T ss_conf 874-510367999999999999981899886652462999999975885-4589999998599-98-------------- Q ss_pred HHCCHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 000012333322211111245677632221 Q gi|254780277|r 532 KISDSQVQEHAIENSNVILQLWLLLRPKLI 561 (976) Q Consensus 532 ~i~~e~~~~Yaa~Da~~t~~L~~~l~~~L~ 561 (976) +....-|..||..|.+++..+.+++. T Consensus 143 ----~~~~H~Al~Da~~t~~v~~~l~~~~~ 168 (169) T smart00479 143 ----IGRAHRALDDARATAKLFKKLVERLL 168 (169) T ss_pred ----CCCCCCHHHHHHHHHHHHHHHHHHCC T ss_conf ----99992979999999999999997724 No 84 >cd06134 RNaseT RNase T is a DEDDh-type DnaQ-like 3'-5' exoribonuclease E implicated in the 3' maturation of small stable RNAs and 23srRNA, and in the end turnover of tRNA. It contains three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific Hx(4)D conserved pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. RNase T is related to the proofreading domain of DNA polymerase III. Despite its important role, RNase T is mainly found only in gammaproteobacteria. It is speculated that it might have originated from DNA polymerase III at the time the gamma division of proteobacteria diverged from other bacteria. RNase T is a homodimer with the catalytic residues of one monomer contacting a large basic patch on the other monomer to form a functional active site. Probab=98.66 E-value=7.7e-08 Score=75.78 Aligned_cols=151 Identities=15% Similarity=0.145 Sum_probs=84.6 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCEEECCC--CCCCC-CCCCCCEEC-CC-----CCC------CCCCC-----CCCCCCHH Q ss_conf 8523233222433110012442010257--76654-434550231-37-----443------33456-----50017889 Q gi|254780277|r 388 GSASFKIITDTMDAFHSKPIAIAISTLD--QKDDT-SFKVETIFI-DL-----SVH------TSQDS-----TGKNTLTK 447 (976) Q Consensus 388 ~~ia~DtETt~ld~~~~~ivGisls~~~--~~~~~-~~~~~~~yi-p~-----~~~------~~~d~-----~~~~~~~~ 447 (976) ..+++|+||||++|..++|+-|+..... ..+.. .......++ |. +.. ...+. .......+ T Consensus 6 ~~VV~D~ETTGL~p~~d~IIEIgav~v~~d~~g~i~~~~~~~~~v~P~~~~~i~~~~~~itGIt~~~~~~~~~~~~~~~~ 85 (189) T cd06134 6 LPVVVDVETGGFNPQTDALLEIAAVTLEMDEQGNLYPDETFHFHILPFEGANLDPAALEFNGIDPFHPFRFAVDEKEALK 85 (189) T ss_pred CCEEEEEECCCCCCCCCEEEEEEEEEEEECCCCCCCCCCEEEEEECCCCCCCCCHHHHHHCCCCCHHHHCCCCCHHHHHH T ss_conf 50799952899998898079999999997687853245168997789889989855665507770011001675778899 Q ss_pred HHHHHHHHHHCCC---CCHHHHCCCCHHHHHHH----HCCCCCC---C--CCCHHHHHHHHCCCCCCCCHHHHHHHHCCC Q ss_conf 9997444420222---10011003212378898----6188666---4--210023455431687667724667762244 Q gi|254780277|r 448 EILSYLKKFFENE---HFLKIGHNIKYDKLVLH----RYGISMR---G--FEDIMLMSYVLDSGRSSHDIASIAQKWLSY 515 (976) Q Consensus 448 evl~~L~~lle~~---~i~kIgHN~KfD~~~L~----~~GI~l~---~--i~DTmlaayLLdp~~~~~~L~~la~~yL~~ 515 (976) +++..+...+.+. .-..|+||+.||+.+|. +.|+... + +.|||-.+.++-+ .++|+.+|+++ ++ T Consensus 86 ~~~~~i~~~~~~~~~~~avlVaHNa~FD~~FL~~~~~r~g~~~~p~~~~~~~DTl~La~~~~~---~~~L~~l~~~~-gi 161 (189) T cd06134 86 EIFKPIRKALKAQGCTRAILVGHNAHFDLGFLNAAVARCKIKRNPFHPFSTFDTATLAGLAYG---QTVLAKACQAA-GI 161 (189) T ss_pred HHHHHHHHHHHHCCCCCCEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCEEEHHHHHHHHHC---CCCHHHHHHHC-CC T ss_conf 999999999875167555687517777299999999984998888877765669999867738---00199999987-99 Q ss_pred CCCCCCCCCCCCCCCHHHCCHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 42233565432221000000123333222111112456776322 Q gi|254780277|r 516 TRKEILKSRKSSIPIDKISDSQVQEHAIENSNVILQLWLLLRPK 559 (976) Q Consensus 516 ~~~e~~~~~k~~~~~~~i~~e~~~~Yaa~Da~~t~~L~~~l~~~ 559 (976) ..+ .....-|..||.+|.+|+.+|..+ T Consensus 162 ~~~-----------------~~~~HrAl~DA~~ta~lf~~l~~k 188 (189) T cd06134 162 EFD-----------------NKEAHSALYDTQKTAELFCKIVNR 188 (189) T ss_pred CCC-----------------CCCCCCCHHHHHHHHHHHHHHHHH T ss_conf 999-----------------987887299999999999999964 No 85 >PRK06063 DNA polymerase III subunit epsilon; Provisional Probab=98.66 E-value=4.5e-08 Score=77.47 Aligned_cols=148 Identities=15% Similarity=0.203 Sum_probs=95.2 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCEECCC----C---CCCCCCCCCCCCCHHHHHHHHHHHHCC Q ss_conf 685232332224331100124420102577665443455023137----4---433345650017889999744442022 Q gi|254780277|r 387 IGSASFKIITDTMDAFHSKPIAIAISTLDQKDDTSFKVETIFIDL----S---VHTSQDSTGKNTLTKEILSYLKKFFEN 459 (976) Q Consensus 387 ~~~ia~DtETt~ld~~~~~ivGisls~~~~~~~~~~~~~~~yip~----~---~~~~~d~~~~~~~~~evl~~L~~lle~ 459 (976) .+++++|+||||++|..++|+-|+....+..+...-...++.-|- . |....+.....-...+|+..|.+++.+ T Consensus 15 ~~fvvvD~ETTGl~p~~drIieig~v~~~~~g~i~~~~~tlvnP~~dp~~~~IhGIt~~~v~~aP~f~ev~~~l~~~l~g 94 (313) T PRK06063 15 QGWAVVDVETSGFRPGQARIISVAVLGLDADGNVEQSVVSLLNPGVDPGPTHVHGLTAAMLEGQPQFADIAGELAEVLRG 94 (313) T ss_pred CCEEEEEEECCCCCCCCCEEEEEEEEEEECCCEEEEEEEEEECCCCCCCCEEEECCCHHHHCCCCCHHHHHHHHHHHHCC T ss_conf 99799997689999989706999999998997888999997591999898434187999981799899999999999789 Q ss_pred CCCHHHHCCCCHHHHHHH----HCCCCCC--CCCCHHHHHHHHCCCCCCCCHHHHHHHHCCCCCCCCCCCCCCCCCCHHH Q ss_conf 210011003212378898----6188666--4210023455431687667724667762244422335654322210000 Q gi|254780277|r 460 EHFLKIGHNIKYDKLVLH----RYGISMR--GFEDIMLMSYVLDSGRSSHDIASIAQKWLSYTRKEILKSRKSSIPIDKI 533 (976) Q Consensus 460 ~~i~kIgHN~KfD~~~L~----~~GI~l~--~i~DTmlaayLLdp~~~~~~L~~la~~yL~~~~~e~~~~~k~~~~~~~i 533 (976) . ..|+||+.||+.||. ++|+... .+.||+-.+.-+.+...+|.|..||.+| |+... T Consensus 95 ~--vlVAHNa~FD~~FL~~e~~r~g~~~p~~~~lcTl~lARr~~~~l~~~kL~tla~~~-Gi~~~--------------- 156 (313) T PRK06063 95 R--TLVAHNVAFDYSFLAAEAELAGAELPVDQVMCTVELARRLDLGLPNLRLETLAAHW-GVVQQ--------------- 156 (313) T ss_pred C--EEEEECCHHHHHHHHHHHHHCCCCCCCCCEEEHHHHHHHHCCCCCCCCHHHHHHHC-CCCCC--------------- T ss_conf 9--79996856529999999997399889996676899886544899997478899980-99988--------------- Q ss_pred CCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 00123333222111112456776 Q gi|254780277|r 534 SDSQVQEHAIENSNVILQLWLLL 556 (976) Q Consensus 534 ~~e~~~~Yaa~Da~~t~~L~~~l 556 (976) -..-|..||.++-.+.... T Consensus 157 ----~~H~Al~DAra~a~il~~~ 175 (313) T PRK06063 157 ----RPHDALDDARVLAGILAPA 175 (313) T ss_pred ----CCCCCHHHHHHHHHHHHHH T ss_conf ----7655067789999998999 No 86 >PRK09146 DNA polymerase III subunit epsilon; Validated Probab=98.65 E-value=9.6e-08 Score=75.06 Aligned_cols=153 Identities=11% Similarity=0.094 Sum_probs=90.7 Q ss_pred HHHCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCEECCCCC----------CCCCCCCCCCCCHHHHHHHH Q ss_conf 98668523233222433110012442010257766544345502313744----------33345650017889999744 Q gi|254780277|r 384 LETIGSASFKIITDTMDAFHSKPIAIAISTLDQKDDTSFKVETIFIDLSV----------HTSQDSTGKNTLTKEILSYL 453 (976) Q Consensus 384 L~~~~~ia~DtETt~ld~~~~~ivGisls~~~~~~~~~~~~~~~yip~~~----------~~~~d~~~~~~~~~evl~~L 453 (976) +....++++|+|||||||..++|+.|+.....+.....-....+++--.. ..............+++..| T Consensus 44 l~d~~fvalD~ETTGLdp~~D~IiSIGaV~i~~~~I~l~~a~~~lV~p~~~i~~~s~~IHGIt~~~l~~ap~l~evl~~l 123 (239) T PRK09146 44 LSEVPFVALDFETTGLDPEQDAIVSIGLVPFTLQRIRCRQARHWVVKPRRPLNEESVVIHGITHSDLQDAPDLERILDEL 123 (239) T ss_pred CCCCCEEEEECCCCCCCCCCCCEEEEEEEEEECCEEEECCEEEEEECCCCCCCCCCEEECCCCHHHHHCCCCHHHHHHHH T ss_conf 21087899964157888788854899899987888865341589988999787645110696989985699999999999 Q ss_pred HHHHCCCCCHHHHCCCCHHHHHHHH-----CCCCCCC-CCCHHHHHHHHCCC-------------CCCCCHHHHHHHHCC Q ss_conf 4420222100110032123788986-----1886664-21002345543168-------------766772466776224 Q gi|254780277|r 454 KKFFENEHFLKIGHNIKYDKLVLHR-----YGISMRG-FEDIMLMSYVLDSG-------------RSSHDIASIAQKWLS 514 (976) Q Consensus 454 ~~lle~~~i~kIgHN~KfD~~~L~~-----~GI~l~~-i~DTmlaayLLdp~-------------~~~~~L~~la~~yL~ 514 (976) -+++.+. ..|+||+.||+.||.+ +|..+.. +.|||-.+.-+... ..++.|.++..||-= T Consensus 124 l~~l~g~--VLVaH~a~~D~~FL~~a~~~~~g~~l~~P~IDTl~Le~~~~~~~~~~~~~r~~~~~~~slrL~~~r~rYgL 201 (239) T PRK09146 124 LEALAGK--VVVVHYRRIERDFLDQALRNRIGEGIEFPVIDTMEIEARIQRKQAGGLWNRLKGKKPESIRLADSRLRYGL 201 (239) T ss_pred HHHHCCC--EEEEECCHHHHHHHHHHHHHHHCCCCCCCEEEHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHCCC T ss_conf 9981899--89996839999999999999628898874644899999998854200444313589867638999998198 Q ss_pred CCCCCCCCCCCCCCCCHHHCCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 44223356543222100000012333322211111245677632 Q gi|254780277|r 515 YTRKEILKSRKSSIPIDKISDSQVQEHAIENSNVILQLWLLLRP 558 (976) Q Consensus 515 ~~~~e~~~~~k~~~~~~~i~~e~~~~Yaa~Da~~t~~L~~~l~~ 558 (976) |. -...-|..||..|..|+..+.. T Consensus 202 -------------------P~-y~aH~AL~DAlATAELflAQi~ 225 (239) T PRK09146 202 -------------------PA-YPPHHALTDAIATAELLQAQIA 225 (239) T ss_pred -------------------CC-CCCCCCHHHHHHHHHHHHHHHH T ss_conf -------------------98-8876719999999999999999 No 87 >cd06127 DEDDh DEDDh exonucleases, part of the DnaQ-like (or DEDD) exonuclease superfamily, catalyze the excision of nucleoside monophosphates at the DNA or RNA termini in the 3'-5' direction. These proteins contain four invariant acidic residues in three conserved sequence motifs termed ExoI, ExoII and ExoIII. DEDDy exonucleases are classified as such because of the presence of specific Hx(4)D conserved pattern at the ExoIII motif. The four conserved acidic residues are clustered around the active site and serve as ligands for the two metal ions required for catalysis. Most DEDDh exonucleases are the proofreading subunits (epsilon) or domains of bacterial DNA polymerase III, the main replicating enzyme in bacteria, which functions as the chromosomal replicase. Other members include other DNA and RNA exonucleases such as RNase T, Oligoribonuclease, and RNA exonuclease (REX), among others. Probab=98.63 E-value=1.2e-07 Score=74.35 Aligned_cols=142 Identities=15% Similarity=0.152 Sum_probs=88.2 Q ss_pred CCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCEEC-CCC---------CCCCCCCCCCCCCHHHHHHHHHHHHCC Q ss_conf 2323322243311001244201025776654434550231-374---------433345650017889999744442022 Q gi|254780277|r 390 ASFKIITDTMDAFHSKPIAIAISTLDQKDDTSFKVETIFI-DLS---------VHTSQDSTGKNTLTKEILSYLKKFFEN 459 (976) Q Consensus 390 ia~DtETt~ld~~~~~ivGisls~~~~~~~~~~~~~~~yi-p~~---------~~~~~d~~~~~~~~~evl~~L~~lle~ 459 (976) +.+|+||||++|..++|+-++....+......- ....++ |.. +....+.........++...+..++.+ T Consensus 1 v~~D~ETTGl~~~~~~Iieia~v~~~~~~~~~~-~~~~~i~P~~~~~~~~~~i~gi~~~~~~~~~~~~~~~~~~~~~l~~ 79 (159) T cd06127 1 VVLDTETTGLDPKGDRIIEIGAVKVDGGIEIVE-RFETLVNPGRPIPPEATAIHGITDEMLADAPPFEEVLPEFLEFLGG 79 (159) T ss_pred CEEEEECCCCCCCCCCEEEEEEEEEECCEEEEE-EEEEEECCCCCCCHHHHHHHCCCHHHHCCCCCHHHHHHHHHHHCCC T ss_conf 989826799999998469999999999956520-0799988999999889876067789958999689999999873156 Q ss_pred CCCHHHHCCCCHHHHHHHH----CCCCCC--CCCCHHHHHHHHCCCCCCCCHHHHHHHHCCCCCCCCCCCCCCCCCCHHH Q ss_conf 2100110032123788986----188666--4210023455431687667724667762244422335654322210000 Q gi|254780277|r 460 EHFLKIGHNIKYDKLVLHR----YGISMR--GFEDIMLMSYVLDSGRSSHDIASIAQKWLSYTRKEILKSRKSSIPIDKI 533 (976) Q Consensus 460 ~~i~kIgHN~KfD~~~L~~----~GI~l~--~i~DTmlaayLLdp~~~~~~L~~la~~yL~~~~~e~~~~~k~~~~~~~i 533 (976) . ..||||+.||..+|.+ +|.... .+.|||..+.-+.+....+.+..+...+++.... T Consensus 80 ~--~lv~hN~~FD~~~L~~~~~~~~~~~~~~~~iDt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------------- 142 (159) T cd06127 80 R--VLVAHNASFDLRFLNRELRRLGGPPLPNPWIDTLRLARRLLPGLRSHRLGLLLAARYGIPLE--------------- 142 (159) T ss_pred C--EEEECCHHHHHHHHHHHHHHCCCCCCCCCEEEHHHHHHHHCCCCCCCCHHHHHHHHCCCCCC--------------- T ss_conf 3--46515888999999999998399988996664799999973888899999999997698879--------------- Q ss_pred CCHHHHHHHHHHHHHHHHHH Q ss_conf 00123333222111112456 Q gi|254780277|r 534 SDSQVQEHAIENSNVILQLW 553 (976) Q Consensus 534 ~~e~~~~Yaa~Da~~t~~L~ 553 (976) -..-|..||.+|.+++ T Consensus 143 ----~aH~Al~D~~at~~l~ 158 (159) T cd06127 143 ----GAHRALADALATAELL 158 (159) T ss_pred ----CCCCHHHHHHHHHHHH T ss_conf ----8959899999999996 No 88 >cd06130 DNA_pol_III_epsilon_like This subfamily is composed of uncharacterized bacterial proteins with similarity to the epsilon subunit of DNA polymerase III (polIII), a multisubunit polymerase which is the main DNA replicating enzyme in bacteria, functioning as the chromosomal replicase. The pol III holoenzyme is a complex of ten different subunits, three of which (alpha, epsilon, and theta) compose the catalytic core. The polIII epsilon subunit, encoded by the dnaQ gene, is a DEDDh-type 3'-5' exonuclease which is responsible for the proofreading activity of the polymerase, increasing the fidelity of DNA synthesis. It contains three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific Hx(4)D conserved pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. The epsilon subunit of polIII also functions as a stabilizer of the holoenzyme complex Probab=98.60 E-value=8.6e-08 Score=75.43 Aligned_cols=138 Identities=13% Similarity=0.211 Sum_probs=88.7 Q ss_pred CCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCEEC-C---CC------CCCCCCCCCCCCCHHHHHHHHHHHHC Q ss_conf 52323322243311001244201025776654434550231-3---74------43334565001788999974444202 Q gi|254780277|r 389 SASFKIITDTMDAFHSKPIAIAISTLDQKDDTSFKVETIFI-D---LS------VHTSQDSTGKNTLTKEILSYLKKFFE 458 (976) Q Consensus 389 ~ia~DtETt~ld~~~~~ivGisls~~~~~~~~~~~~~~~yi-p---~~------~~~~~d~~~~~~~~~evl~~L~~lle 458 (976) ++++|+|||+. ..++++-|+.....+..... ....++ | ++ |....+.....-...+++..|.+++. T Consensus 1 fvv~D~ETT~~--~~d~IieIgav~i~~g~i~~--~f~~lv~P~~~i~~~~~~ihGIt~~~v~~~p~~~~v~~~l~~~l~ 76 (156) T cd06130 1 FVAIDFETANA--DRASACSIGLVKVRDGQIVD--TFYTLIRPPTRFDPFNIAIHGITPEDVADAPTFPEVWPEIKPFLG 76 (156) T ss_pred CEEEEEECCCC--CCCEEEEEEEEEEECCEEEE--EEEEEECCCCCCCHHHHHHHCCCHHHHHCCCCHHHHHHHHHHHCC T ss_conf 98999778989--99969999999999999989--999987789879988932208887998409999999999998558 Q ss_pred CCCCHHHHCCCCHHHHHHHH----CCCCCCC--CCCHHHHHHHHCCCCCCCCHHHHHHHHCCCCCCCCCCCCCCCCCCHH Q ss_conf 22100110032123788986----1886664--21002345543168766772466776224442233565432221000 Q gi|254780277|r 459 NEHFLKIGHNIKYDKLVLHR----YGISMRG--FEDIMLMSYVLDSGRSSHDIASIAQKWLSYTRKEILKSRKSSIPIDK 532 (976) Q Consensus 459 ~~~i~kIgHN~KfD~~~L~~----~GI~l~~--i~DTmlaayLLdp~~~~~~L~~la~~yL~~~~~e~~~~~k~~~~~~~ 532 (976) +. ..||||+.||+.+|.+ +|+.... +.|||-.+..+.|...+|+|+.++++| |... T Consensus 77 ~~--~lVaHn~~FD~~fL~~~~~~~~~~~~~~~~iDtl~l~r~~~~~~~~~~L~~l~~~~-gi~~--------------- 138 (156) T cd06130 77 GS--LVVAHNASFDRSVLRAALEAYGLPPPPYQYLCTVRLARRVWPLLPNHKLNTVAEHL-GIEL--------------- 138 (156) T ss_pred CC--EEEEECHHHHHHHHHHHHHHCCCCCCCCCEECHHHHHHHHCCCCCCCCHHHHHHHC-CCCC--------------- T ss_conf 98--89995778899999999998199998985881899999973888889999999985-9998--------------- Q ss_pred HCCHHHHHHHHHHHHHHHHHH Q ss_conf 000123333222111112456 Q gi|254780277|r 533 ISDSQVQEHAIENSNVILQLW 553 (976) Q Consensus 533 i~~e~~~~Yaa~Da~~t~~L~ 553 (976) + ..-|..||.+|.+|+ T Consensus 139 ---~--~H~Al~DA~ata~l~ 154 (156) T cd06130 139 ---N--HHDALEDARACAEIL 154 (156) T ss_pred ---C--CCCCHHHHHHHHHHH T ss_conf ---7--879689999999998 No 89 >cd06136 TREX1_2 Three prime repair exonuclease (TREX) 1 and 2 are closely related DEDDh-type DnaQ-like 3'-5' exonucleases. They contain three conserved sequence motifs known as ExoI, II, and III, with a specific Hx(4)D conserved pattern at ExoIII. These motifs contain four conserved acidic residues that participate in coordination of divalent metal ions required for catalysis. Both proteins play a role in the metabolism and clearance of DNA. TREX1 is the major 3'-5' exonuclease activity detected in mammalian cells. Mutations in the human TREX1 gene can cause Aicardi-Goutieres syndrome (AGS), which is characterized by perturbed innate immunity and presents itself as a severe neurological disease. TREX1 degrades ssDNA generated by aberrant replication intermediates to prevent checkpoint activation and autoimmune disease. There are distinct structural differences between TREX1 and 2 that point to different biological roles for these proteins. The main difference is the presence of about 7 Probab=98.50 E-value=2.1e-07 Score=72.58 Aligned_cols=142 Identities=15% Similarity=0.195 Sum_probs=80.7 Q ss_pred CCCCCCCCCCCCCCC-CCCCCCEEECCCCCCCCCCC---------CC---CEE---CCCCC------CCCCCCC-CCCCC Q ss_conf 523233222433110-01244201025776654434---------55---023---13744------3334565-00178 Q gi|254780277|r 389 SASFKIITDTMDAFH-SKPIAIAISTLDQKDDTSFK---------VE---TIF---IDLSV------HTSQDST-GKNTL 445 (976) Q Consensus 389 ~ia~DtETt~ld~~~-~~ivGisls~~~~~~~~~~~---------~~---~~y---ip~~~------~~~~d~~-~~~~~ 445 (976) +++||+|||||++.. ++|+-|+.....+....... .+ ... .|++. ....+.. ..... T Consensus 1 fvv~D~ETTGL~~~~~~rIiEI~av~v~~~~~~~~~~~~~~~~~i~d~~~~lvnP~~~Ip~~it~itGIt~emv~~~~~~ 80 (177) T cd06136 1 FVFLDLETTGLPKHNRPEITELCLVAVHRDHLLNTSRDKPALPRVLDKLSLCFNPGRAISPGASEITGLSNDLLEHKAPF 80 (177) T ss_pred CEEEEEECCCCCCCCCCCEEEEEEEEEECCCCCCCCCCCCCCCCEEEEEEEEECCCCCCCHHHHHHCCCCHHHHHCCCCH T ss_conf 98998678997988998558999999977874443111245662142256764879938876887518899998159981 Q ss_pred HHHHHHHHHHHHCC--CCCHHHHCCC-CHHHHHHH----HCCCCCC-C--CCCHHHHHHHHCCCCCCCCHHHHHHHHCCC Q ss_conf 89999744442022--2100110032-12378898----6188666-4--210023455431687667724667762244 Q gi|254780277|r 446 TKEILSYLKKFFEN--EHFLKIGHNI-KYDKLVLH----RYGISMR-G--FEDIMLMSYVLDSGRSSHDIASIAQKWLSY 515 (976) Q Consensus 446 ~~evl~~L~~lle~--~~i~kIgHN~-KfD~~~L~----~~GI~l~-~--i~DTmlaayLLdp~~~~~~L~~la~~yL~~ 515 (976) ..++...+..++.. .....|+||+ +||+.+|. ++|+.+. . +.||+-+++-+ .++|.+++++|++. T Consensus 81 ~~~~~~~~~~F~~~~~~~~~LVAHNa~~FD~~fL~~e~~r~g~~~p~~~~~iDtL~l~r~~-----~~~L~~l~~~~~gi 155 (177) T cd06136 81 DSDTANLIKLFLRRQPKPICLVAHNGNRFDFPILRSELERLGTKLPDDILCVDSLPAFREL-----DQSLGSLYKRLFGQ 155 (177) T ss_pred HHHHHHHHHHHHHHCCCCCEEEECCCHHCCHHHHHHHHHHCCCCCCCCCEEEEEHHHHHHH-----CCCHHHHHHHHCCC T ss_conf 7999999999996267786799648512379999999998699899898899860764550-----69999999997599 Q ss_pred CCCCCCCCCCCCCCCHHHCCHHHHHHHHHHHHHHHHHHH Q ss_conf 422335654322210000001233332221111124567 Q gi|254780277|r 516 TRKEILKSRKSSIPIDKISDSQVQEHAIENSNVILQLWL 554 (976) Q Consensus 516 ~~~e~~~~~k~~~~~~~i~~e~~~~Yaa~Da~~t~~L~~ 554 (976) ... ...-|..||.+|.+++- T Consensus 156 ~~~-------------------~aHRAl~Da~at~~vfl 175 (177) T cd06136 156 EPK-------------------NSHTAEGDVLALLKCAL 175 (177) T ss_pred CCC-------------------CCCCCHHHHHHHHHHHC T ss_conf 998-------------------77784899999999975 No 90 >pfam00929 Exonuc_X-T Exonuclease. This family includes a variety of exonuclease proteins, such as ribonuclease T and the epsilon subunit of DNA polymerase III.; Probab=98.45 E-value=3.9e-07 Score=70.63 Aligned_cols=144 Identities=14% Similarity=0.176 Sum_probs=89.5 Q ss_pred CCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCEEC-C---CC------CCCCCCCCCCCCCHHHHHHHHHHHHC Q ss_conf 52323322243311001244201025776654434550231-3---74------43334565001788999974444202 Q gi|254780277|r 389 SASFKIITDTMDAFHSKPIAIAISTLDQKDDTSFKVETIFI-D---LS------VHTSQDSTGKNTLTKEILSYLKKFFE 458 (976) Q Consensus 389 ~ia~DtETt~ld~~~~~ivGisls~~~~~~~~~~~~~~~yi-p---~~------~~~~~d~~~~~~~~~evl~~L~~lle 458 (976) .+++|+||||+++..++|+-|+....++....... ...|+ | ++ +....+.........+++..+..|+. T Consensus 1 ~v~~D~ETTGl~~~~~~Iieigav~~~~~~~~~~~-~~~~v~P~~~i~~~~~~i~GIt~~~l~~~~~~~~~~~~~~~~l~ 79 (162) T pfam00929 1 LVVIDCETTGLDPEKDRIIEIAAVSIVGGENIGPV-FDTYVKPERLITDEATKFHGITPEMLRNAPSFEEVLEAFLEFLK 79 (162) T ss_pred CEEEEEECCCCCCCCCEEEEEEEEEEECCEEEEEE-EEEEECCCCCCCHHHHHHCCCCHHHHHHCCCHHHHHHHHHHHHC T ss_conf 98999868999899975899999999999776555-67877788888999998739787775415311779999999842 Q ss_pred CCCCHHHHCCCCHHHHHHHH----CCCCC----CCCCCHHHHHHHHCCCCCCCCHHHHHHHHCCCCCCCCCCCCCCCCCC Q ss_conf 22100110032123788986----18866----64210023455431687667724667762244422335654322210 Q gi|254780277|r 459 NEHFLKIGHNIKYDKLVLHR----YGISM----RGFEDIMLMSYVLDSGRSSHDIASIAQKWLSYTRKEILKSRKSSIPI 530 (976) Q Consensus 459 ~~~i~kIgHN~KfD~~~L~~----~GI~l----~~i~DTmlaayLLdp~~~~~~L~~la~~yL~~~~~e~~~~~k~~~~~ 530 (976) +..+ .++||+.||..++.+ ++... ..+.||+..+..+-++..+++|+.+++.|..... T Consensus 80 ~~~~-~v~~n~~fD~~~l~~~~~~~~~~~~~~~~~~~dt~~~~~~~~~~~~~~~L~~l~~~~~i~~~------------- 145 (162) T pfam00929 80 KLKI-LVGHNASFDVGFLLYDDLRFLKLPHPKLNDVIDTLILDKATYKGFKRRSLDALAEKLGLEKI------------- 145 (162) T ss_pred CCCE-EECCCCCHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHCCCCCCCHHHHHHHCCCCCC------------- T ss_conf 5857-77788778999999999985863366333304489999997504467998999988599998------------- Q ss_pred HHHCCHHHHHHHHHHHHHHHHHH Q ss_conf 00000123333222111112456 Q gi|254780277|r 531 DKISDSQVQEHAIENSNVILQLW 553 (976) Q Consensus 531 ~~i~~e~~~~Yaa~Da~~t~~L~ 553 (976) .-..-|..||..|.+|| T Consensus 146 ------~~aH~Al~Da~~t~~lf 162 (162) T pfam00929 146 ------QRAHRALDDARATAELF 162 (162) T ss_pred ------CCCCCHHHHHHHHHHHC T ss_conf ------88678299999999879 No 91 >cd06125 DnaQ_like_exo The DnaQ-like exonuclease superfamily is a structurally conserved group of 3'-5' exonucleases, which catalyze the excision of nucleoside monophosphates at the DNA or RNA termini in the 3'-5' direction. It is also called the DEDD superfamily, after the four invariant acidic residues present in the catalytic site of its members. The superfamily consists of DNA- and RNA-processing enzymes such as the proofreading domains of DNA polymerases, other DNA exonucleases, RNase D, RNase T, Oligoribonuclease and RNA exonucleases (REX). The DnaQ-like exonuclease domain contains three conserved sequence motifs termed ExoI, ExoII and ExoIII, which are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. The conservation patterns of the three motifs may vary among different subfamilies. DnaQ-like exonucleases are classified as DEDDy or DEDDh exonucleases depending on the variation of motif Probab=98.36 E-value=1.8e-07 Score=73.04 Aligned_cols=104 Identities=22% Similarity=0.324 Sum_probs=63.4 Q ss_pred CCCCCCCCCCCC-CCCCCCCCEEECCCCCCCCCCCCCC-EECCCC----CCCCCCCCCCCCCHHHHHHHHHHHHCCCCCH Q ss_conf 232332224331-1001244201025776654434550-231374----4333456500178899997444420222100 Q gi|254780277|r 390 ASFKIITDTMDA-FHSKPIAIAISTLDQKDDTSFKVET-IFIDLS----VHTSQDSTGKNTLTKEILSYLKKFFENEHFL 463 (976) Q Consensus 390 ia~DtETt~ld~-~~~~ivGisls~~~~~~~~~~~~~~-~yip~~----~~~~~d~~~~~~~~~evl~~L~~lle~~~i~ 463 (976) ++||+||||+++ ..++|+.|+................ ..++.. +....+.........+++..+.+++++.+.. T Consensus 1 Vv~D~ETTGl~~~~~d~Iiei~~v~~~~~~~~~~~v~p~~~i~~~~~~i~gi~~~~v~~~p~~~~~~~~~~~f~~~~~~i 80 (130) T cd06125 1 VAFDIETTGLNGNARDEIIEIALVVVHIGEKFVFDVKPDDLIDPYAFTISGITDEVVERAATEAELLIELLAFLRDKDDI 80 (130) T ss_pred CEEEEECCCCCCCCCCEEEEEEEEEEECCCEEEEEECCCCCCCHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCE T ss_conf 98980189969999985899999997189889999799998997794103759999833060899999999997699999 Q ss_pred HHHCCCCHHHHHHH----HCCCCCC----CCCCHHHHH Q ss_conf 11003212378898----6188666----421002345 Q gi|254780277|r 464 KIGHNIKYDKLVLH----RYGISMR----GFEDIMLMS 493 (976) Q Consensus 464 kIgHN~KfD~~~L~----~~GI~l~----~i~DTmlaa 493 (976) +||||++||+.+|. ++|++.. .+.||+..| T Consensus 81 lVgHN~~FD~~fL~~~~~r~g~~~~~~~~~~iDTl~la 118 (130) T cd06125 81 LVGHNISFDLPYLLNRAAELGIPYPILGRIVFDTLALA 118 (130) T ss_pred EEEECHHHHHHHHHHHHHHCCCCCCCCCCCEEECHHHH T ss_conf 99919477499999999984999987899686860984 No 92 >cd06144 REX4_like This subfamily is composed of RNA exonuclease 4 (REX4 or Rex4p), XPMC2, Interferon (IFN) Stimulated Gene product of 20 kDa (ISG20), and similar proteins. REX4 is involved in pre-rRNA processing. It controls the ratio between the two forms of 5.8S rRNA in yeast. XPMC2 is a Xenopus gene which was identified through its ability to correct a mitotic defect in fission yeast. The human homolog of XPMC2 (hPMC2) may be involved in angiotensin II-induced adrenal cell cycle progression and cell proliferation. ISG20 is an IFN-induced antiviral exonuclease with a strong preference for single-stranded RNA and minor activity towards single-stranded DNA. These proteins are DEDDh-type DnaQ-like 3'-5' exonucleases containing three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific Hx(4)D conserved pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required fo Probab=98.25 E-value=1.3e-06 Score=66.71 Aligned_cols=138 Identities=13% Similarity=0.199 Sum_probs=80.6 Q ss_pred CCCCCCCCCCCCCCCC--CCCCEEECCCCCCCCCCCCCCEECC---CC------CCCCCCCCCCCCCHHHHHHHHHHHHC Q ss_conf 2323322243311001--2442010257766544345502313---74------43334565001788999974444202 Q gi|254780277|r 390 ASFKIITDTMDAFHSK--PIAIAISTLDQKDDTSFKVETIFID---LS------VHTSQDSTGKNTLTKEILSYLKKFFE 458 (976) Q Consensus 390 ia~DtETt~ld~~~~~--ivGisls~~~~~~~~~~~~~~~yip---~~------~~~~~d~~~~~~~~~evl~~L~~lle 458 (976) +|+|+|++|..+.... +.-+++-...+. .-+ +.+.-| +. +....+.-.......++...+..++. T Consensus 1 valDcEm~g~~~~~~~~~larv~vvd~~g~--~v~--d~~V~P~~~V~d~~T~~sGIt~~~l~~a~~~~~v~~~~~~~l~ 76 (152) T cd06144 1 VALDCEMVGVGPDGSESALARVSIVNEDGN--VVY--DTYVKPQEPVTDYRTAVSGIRPEHLKDAPDFEEVQKKVAELLK 76 (152) T ss_pred CEEEEEECCCCCCCCEEEEEEEEEEECCCC--EEE--EEEECCCCCCCCCCEEECCCCHHHHCCCCCHHHHHHHHHHHHC T ss_conf 989876033528998789999999967899--998--8723699767876356248579997389999999999999853 Q ss_pred CCCCHHHHCCCCHHHHHHHHCCCCC--CCCCCHHHHHHHHCC-CCCCCCHHHHHHHHCCCCCCCCCCCCCCCCCCHHHCC Q ss_conf 2210011003212378898618866--642100234554316-8766772466776224442233565432221000000 Q gi|254780277|r 459 NEHFLKIGHNIKYDKLVLHRYGISM--RGFEDIMLMSYVLDS-GRSSHDIASIAQKWLSYTRKEILKSRKSSIPIDKISD 535 (976) Q Consensus 459 ~~~i~kIgHN~KfD~~~L~~~GI~l--~~i~DTmlaayLLdp-~~~~~~L~~la~~yL~~~~~e~~~~~k~~~~~~~i~~ 535 (976) +. ..||||+.||+.+|. +.. ..+.||.....+... ...+++|+.|+++|||....+ + T Consensus 77 ~~--ilVGH~l~~Dl~~L~---l~~~~~~i~DT~~~~~~~~~~~~~~~sLk~L~~~~Lg~~Iq~----~----------- 136 (152) T cd06144 77 GR--ILVGHALKNDLKVLK---LDHPKKLIRDTSKYKPLRKTAKGKSPSLKKLAKQLLGLDIQE----G----------- 136 (152) T ss_pred CC--EEEECCHHHHHHHHH---CCCCCCEEEEEEEEHHHHHCCCCCCCCHHHHHHHHCCCCCCC----C----------- T ss_conf 88--798606464699962---438876068855642245416889927999999985885899----7----------- Q ss_pred HHHHHHHHHHHHHHHHHHH Q ss_conf 1233332221111124567 Q gi|254780277|r 536 SQVQEHAIENSNVILQLWL 554 (976) Q Consensus 536 e~~~~Yaa~Da~~t~~L~~ 554 (976) ..-|.+||.+|.+||+ T Consensus 137 ---~H~s~eDA~at~~Lfk 152 (152) T cd06144 137 ---EHSSVEDARAAMRLYR 152 (152) T ss_pred ---CCCCHHHHHHHHHHHC T ss_conf ---9696999999999859 No 93 >PRK09182 DNA polymerase III subunit epsilon; Validated Probab=98.18 E-value=9.4e-06 Score=60.54 Aligned_cols=172 Identities=13% Similarity=0.186 Sum_probs=83.0 Q ss_pred CCCCCCCCCCCCCCCCCCCCCEEECC---CCCCCCCC-CCCCEEC----CCCCCC-------CCCCCCCCCCHHHHHHHH Q ss_conf 52323322243311001244201025---77665443-4550231----374433-------345650017889999744 Q gi|254780277|r 389 SASFKIITDTMDAFHSKPIAIAISTL---DQKDDTSF-KVETIFI----DLSVHT-------SQDSTGKNTLTKEILSYL 453 (976) Q Consensus 389 ~ia~DtETt~ld~~~~~ivGisls~~---~~~~~~~~-~~~~~yi----p~~~~~-------~~d~~~~~~~~~evl~~L 453 (976) -+.+|+||||+|+..++|+-+++..- ..+..... ..-..+. |++... ...-.+..++...| T Consensus 39 g~~lD~ETTGl~~~~d~IIElg~v~f~~~~~G~i~~i~~~~~~~~dP~~pIp~~it~lTGItd~mV~Gq~id~~~V---- 114 (293) T PRK09182 39 GVILDTETTGLDPRKDEIIEIGMVAFEYDDDGRIGDVVDTFGGLQQPSRPIPPEITRLTGITDEMVAGQTIDPAEV---- 114 (293) T ss_pred EEEEECCCCCCCCCCCEEEEEEEEEEEECCCCCEEEEECCEECCCCCCCCCCHHHHHHCCCCHHHHCCCCCCHHHH---- T ss_conf 9999530578897787799976899998699858776432311038999899879875188889968882799999---- Q ss_pred HHHHCCCCCHHHHCCCCHHHHHHHHCCCCC--CCCCCHHHH-HHHHCCCCCCCCHHHHHHHHCCCCCCCCCCCCCCCCCC Q ss_conf 442022210011003212378898618866--642100234-55431687667724667762244422335654322210 Q gi|254780277|r 454 KKFFENEHFLKIGHNIKYDKLVLHRYGISM--RGFEDIMLM-SYVLDSGRSSHDIASIAQKWLSYTRKEILKSRKSSIPI 530 (976) Q Consensus 454 ~~lle~~~i~kIgHN~KfD~~~L~~~GI~l--~~i~DTmla-ayLLdp~~~~~~L~~la~~yL~~~~~e~~~~~k~~~~~ 530 (976) ..++++.+ ..|+||+.||..++.++--.+ ..|.++|-- .|.- -+-.+..|+.|+..+. +-.+ T Consensus 115 ~~~~~~ad-liiAHNA~FDR~F~E~~~p~f~~k~WaCS~~~IdW~~-~Gf~s~kLeyL~~~~G-~F~~------------ 179 (293) T PRK09182 115 DALIAPAD-LIIAHNAGFDRPFLERFSPVFANKPWACSVSEIDWSA-RGFEGTKLGYLAGQAG-FFHD------------ 179 (293) T ss_pred HHHHCCCC-EEEECCCCCCHHHHHHHCCCCCCCCEEEECCCCCHHH-CCCCCCCHHHHHHHCC-CCCC------------ T ss_conf 99864498-9998177546778985095325875053163797032-2877415999998609-4545------------ Q ss_pred HHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHCCCCCCCC Q ss_conf 0000012333322211111245677632221000134578860750123322101431005 Q gi|254780277|r 531 DKISDSQVQEHAIENSNVILQLWLLLRPKLIVEKLLHVYERLDKPMIDVVSQMEITGIQID 591 (976) Q Consensus 531 ~~i~~e~~~~Yaa~Da~~t~~L~~~l~~~L~~~~l~~l~~~iE~Pl~~vLa~ME~~Gi~vD 591 (976) ...|..||.+++.|-..-.+.-...-+..|+..-..|-.+ +...|-+.| T Consensus 180 --------aHRA~~Dc~All~LL~~~l~~s~~~~l~~Ll~~a~~~~~r----I~A~~aPfd 228 (293) T PRK09182 180 --------GHRAVDDCQALLELLARPLPETGQPPLAELLEASRRPRMR----IWAENSPFE 228 (293) T ss_pred --------CCCHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHCCCCEEE----EEEECCCCH T ss_conf --------6534889999999976778644554999999972788499----997079813 No 94 >cd06138 ExoI_N This subfamily is composed of the N-terminal domain of E. coli exonuclease I (ExoI) and similar proteins. ExoI is a monomeric enzyme that hydrolyzes single stranded DNA in the 3' to 5' direction. It plays a role in DNA recombination and repair. It primarily functions in repairing frameshift mutations. The N-terminal domain of ExoI is a DEDDh-type DnaQ-like 3'-5 exonuclease containing three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific Hx(4)D conserved pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. The ExoI structure is unique among DnaQ family enzymes in that there is a large distance between the two metal ions required for catalysis and the catalytic histidine is oriented away from the active site. Probab=98.16 E-value=4.9e-06 Score=62.59 Aligned_cols=144 Identities=14% Similarity=0.140 Sum_probs=82.1 Q ss_pred CCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCC------CCEECCCC-----CCCC-CCCCCCCCCHHHHHHHHHHHH Q ss_conf 23233222433110012442010257766544345------50231374-----4333-456500178899997444420 Q gi|254780277|r 390 ASFKIITDTMDAFHSKPIAIAISTLDQKDDTSFKV------ETIFIDLS-----VHTS-QDSTGKNTLTKEILSYLKKFF 457 (976) Q Consensus 390 ia~DtETt~ld~~~~~ivGisls~~~~~~~~~~~~------~~~yip~~-----~~~~-~d~~~~~~~~~evl~~L~~ll 457 (976) +.||+||||+++..++|+-++.-..+......-.. ....+|-. +... .+.........+++..+..+| T Consensus 1 i~fD~ETTGl~~~~d~Iiqiaai~~d~~~~~~~~~~~~i~p~~~~~p~p~a~~v~git~~~~~~~~~s~~e~~~~i~~~~ 80 (183) T cd06138 1 LFYDYETFGLNPSFDQILQFAAIRTDENFNEIEPFNIFCRLPPDVLPSPEALIVTGITPQQLLKEGLSEYEFIAKIHRLF 80 (183) T ss_pred CEEEECCCCCCCCCCCEEEEEEEEECCCCCCCCEEEEEECCCCCCCCCHHHHHHCCCCHHHHHHCCCCHHHHHHHHHHHH T ss_conf 98970389979998760699999998998786446887404899998847998718899997531887789999999986 Q ss_pred CCCCCHHHHCC-CCHHHHHHH----HCCCCCC--------CCCC---HHHHHHHHCCC----------CCCCCHHHHHHH Q ss_conf 22210011003-212378898----6188666--------4210---02345543168----------766772466776 Q gi|254780277|r 458 ENEHFLKIGHN-IKYDKLVLH----RYGISMR--------GFED---IMLMSYVLDSG----------RSSHDIASIAQK 511 (976) Q Consensus 458 e~~~i~kIgHN-~KfD~~~L~----~~GI~l~--------~i~D---TmlaayLLdp~----------~~~~~L~~la~~ 511 (976) .......|||| +.||..+|. ++|.... .-+| +.-+++.+.|+ ..++.|++||+. T Consensus 81 ~~~~~i~vg~N~~~FD~~fL~~~~~r~~~~p~~~~~~~~~~~~D~l~~~r~~~~~~p~~i~~~~~~~g~~s~kL~~l~~~ 160 (183) T cd06138 81 NTPGTCIVGYNNIRFDDEFLRFAFYRNLYDPYTWEWKNGNSRWDLLDVVRAYYALRPDGIVWPKNDDGKPSFKLEDLAQA 160 (183) T ss_pred CCCCCEEEEECCHHHHHHHHHHHHHHHCCCCHHHHHCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCHHHHHHH T ss_conf 16997799845355689999999998278715667527871230788999999978201447655689741129999998 Q ss_pred HCCCCCCCCCCCCCCCCCCHHHCCHHHHHHHHHHHHHHHHHH Q ss_conf 224442233565432221000000123333222111112456 Q gi|254780277|r 512 WLSYTRKEILKSRKSSIPIDKISDSQVQEHAIENSNVILQLW 553 (976) Q Consensus 512 yL~~~~~e~~~~~k~~~~~~~i~~e~~~~Yaa~Da~~t~~L~ 553 (976) + |...+ -+.-|..||.+|.+|. T Consensus 161 ~-gi~~~-------------------~aH~Al~Dv~aT~~l~ 182 (183) T cd06138 161 N-GIEHS-------------------NAHDALSDVEATIALA 182 (183) T ss_pred C-CCCCC-------------------CCCCCHHHHHHHHHHH T ss_conf 4-99998-------------------8868289999999986 No 95 >PRK07983 exodeoxyribonuclease X; Provisional Probab=98.16 E-value=6.1e-06 Score=61.94 Aligned_cols=85 Identities=13% Similarity=0.158 Sum_probs=54.6 Q ss_pred HHHHHCCCCCHHHHCCCCHHHHHHHHCCCCCCCCCCHHHHHHHHCCCCCCCCHHHHHHHHCCCCCCCCCCCCCCCCCCHH Q ss_conf 44420222100110032123788986188666421002345543168766772466776224442233565432221000 Q gi|254780277|r 453 LKKFFENEHFLKIGHNIKYDKLVLHRYGISMRGFEDIMLMSYVLDSGRSSHDIASIAQKWLSYTRKEILKSRKSSIPIDK 532 (976) Q Consensus 453 L~~lle~~~i~kIgHN~KfD~~~L~~~GI~l~~i~DTmlaayLLdp~~~~~~L~~la~~yL~~~~~e~~~~~k~~~~~~~ 532 (976) +..++.+. ..|+||+.||+.+|...+.. +.||+-.|..+-|+. +|++.+|... ++...+.- . T Consensus 68 ~~~~~~~~--~lVaHNa~FD~~~L~~~~~~---~IcTl~lAR~l~p~~-~~~l~~Lry~-~~~~~~~~-----~------ 129 (219) T PRK07983 68 IPHYYGSE--WYVAHNASFDRRVLPEMPGE---WICTMKLARRLWPGI-KYSNMALYKS-RKLNVQTP-----P------ 129 (219) T ss_pred HHHHCCCC--EEEECCCCCCHHHHCCCCCC---EEEHHHHHHHHCCCC-CHHHHHHHHH-HCCCCCCC-----C------ T ss_conf 99862898--78753651038664155875---665789999876774-2418999998-42761332-----4------ Q ss_pred HCCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 00012333322211111245677632 Q gi|254780277|r 533 ISDSQVQEHAIENSNVILQLWLLLRP 558 (976) Q Consensus 533 i~~e~~~~Yaa~Da~~t~~L~~~l~~ 558 (976) ..-+.-|..||.+|..|...+.+ T Consensus 130 ---~~~AHrAl~Dv~v~~~ll~~l~~ 152 (219) T PRK07983 130 ---GLHHHRALYDCYITAALLIDIMR 152 (219) T ss_pred ---CCCCCCHHHHHHHHHHHHHHHHH T ss_conf ---43334307879999999999998 No 96 >COG0847 DnaQ DNA polymerase III, epsilon subunit and related 3'-5' exonucleases [DNA replication, recombination, and repair] Probab=98.06 E-value=3e-05 Score=56.85 Aligned_cols=153 Identities=18% Similarity=0.218 Sum_probs=97.2 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCEECCCCCC----------CCCCCCCCCCCHHHHHHHHHHH Q ss_conf 685232332224331100124420102577665443455023137443----------3345650017889999744442 Q gi|254780277|r 387 IGSASFKIITDTMDAFHSKPIAIAISTLDQKDDTSFKVETIFIDLSVH----------TSQDSTGKNTLTKEILSYLKKF 456 (976) Q Consensus 387 ~~~ia~DtETt~ld~~~~~ivGisls~~~~~~~~~~~~~~~yip~~~~----------~~~d~~~~~~~~~evl~~L~~l 456 (976) ...+++|+||||+++..+.++-|+................+.-| ... ...+.-..+-...++...+..+ T Consensus 13 ~~~vv~D~ETtg~~~~~~~iieIgav~~~~~~i~~~~~~~~v~P-~~~i~~~~~~i~git~e~l~~~p~~~~v~~~~~~~ 91 (243) T COG0847 13 TRFVVIDLETTGLNPKKDRIIEIGAVTLEDGRIVERSFHTLVNP-ERPIPPEIFKIHGITDEMLADAPKFAEVLPEFLDF 91 (243) T ss_pred CCEEEEECCCCCCCCCCCCEEEEEEEEEECCCCCCCCCCEEECC-CCCCCCCCCCCCCCCHHHHHCCCCHHHHHHHHHHH T ss_conf 75799966148988778854899659967784024311146688-98787543114773889974786389999999998 Q ss_pred HCCCCCHHHHCCCCHHHHHHHH----CCCCCCC--CCCHHHHHHHHCCCCCCCCHHHHHHHHCCCCCCCCCCCCCCCCCC Q ss_conf 0222100110032123788986----1886664--210023455431687667724667762244422335654322210 Q gi|254780277|r 457 FENEHFLKIGHNIKYDKLVLHR----YGISMRG--FEDIMLMSYVLDSGRSSHDIASIAQKWLSYTRKEILKSRKSSIPI 530 (976) Q Consensus 457 le~~~i~kIgHN~KfD~~~L~~----~GI~l~~--i~DTmlaayLLdp~~~~~~L~~la~~yL~~~~~e~~~~~k~~~~~ 530 (976) +.+. ...|+||+.||+.++.. +++.+.+ +.||+-.+.-..|+...++|+.++.+ +++... . T Consensus 92 i~~~-~~~Vahna~fD~~fl~~~~~~~~~~~~~~~~~~t~~~~r~~~~~~~~~~L~~l~~~-~gi~~~-~---------- 158 (243) T COG0847 92 IGGL-RLLVAHNAAFDVGFLRVESERLGIEIPGDPVLDTLALARRHFPGFDRSSLDALAER-LGIDRN-P---------- 158 (243) T ss_pred HCCC-CEEEEEEHHHCHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHCCCCCCCCHHHHHHH-CCCCCC-C---------- T ss_conf 4788-86999505308899998999708987576420079999987699761348999996-298877-7---------- Q ss_pred HHHCCHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 00000123333222111112456776322 Q gi|254780277|r 531 DKISDSQVQEHAIENSNVILQLWLLLRPK 559 (976) Q Consensus 531 ~~i~~e~~~~Yaa~Da~~t~~L~~~l~~~ 559 (976) ....-|..|+.++..++..+... T Consensus 159 ------~~~H~Al~Da~~~a~~~~~~~~~ 181 (243) T COG0847 159 ------FHPHRALFDALALAELFLLLQTG 181 (243) T ss_pred ------CCCCCHHHHHHHHHHHHHHHHHH T ss_conf ------77661678899999999998620 No 97 >cd06133 ERI-1_3'hExo_like This subfamily is composed of Caenorhabditis elegans ERI-1, human 3' exonuclease (3'hExo), Drosophila exonuclease snipper (snp), and similar proteins from eukaryotes and bacteria. These are DEDDh-type DnaQ-like 3'-5' exonucleases containing three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific Hx(4)D conserved pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. ERI-1 has been implicated in the degradation of small interfering RNAs (RNAi). 3'hExo participates in the degradation of histone mRNAs. Snp is a non-essential exonuclease that efficiently degrades structured RNA and DNA substrates as long as there is a minimum of 2 nucleotides in the 3' overhang to initiate degradation. Snp is not a functional homolog of either ERI-1 or 3'hExo. Probab=98.02 E-value=1.8e-05 Score=58.57 Aligned_cols=149 Identities=11% Similarity=0.092 Sum_probs=88.7 Q ss_pred CCCCCCCCCCCCCCCC-----CCCCCEEECCCCCCCCCCCCCCEEC-C-----CC------CCCCCCCCCCCCCHHHHHH Q ss_conf 5232332224331100-----1244201025776654434550231-3-----74------4333456500178899997 Q gi|254780277|r 389 SASFKIITDTMDAFHS-----KPIAIAISTLDQKDDTSFKVETIFI-D-----LS------VHTSQDSTGKNTLTKEILS 451 (976) Q Consensus 389 ~ia~DtETt~ld~~~~-----~ivGisls~~~~~~~~~~~~~~~yi-p-----~~------~~~~~d~~~~~~~~~evl~ 451 (976) ++++|+||||+++... +|+-|+...-+............|+ | ++ +....+.-...-...+++. T Consensus 1 yvv~D~EtTg~~~~~~~~~~~eIIeIgav~vd~~~~~i~~~f~~lI~P~~~~~i~~~i~~itGIt~~~l~~ap~~~~v~~ 80 (176) T cd06133 1 YLVIDFEATCWEGNSKPDYPNEIIEIGAVLVDVKTKEIIDTFSSYVKPVINPKLSDFCTELTGITQEDVDNAPSFPEVLK 80 (176) T ss_pred CEEEEEECCCCCCCCCCCCCCCEEEEEEEEEECCCCEEEEEEEEEECCCCCCCCCHHHHHHHCCCHHHHHCCCCHHHHHH T ss_conf 98999726899878898999707999999998799979899999975876887898899773818878707863999999 Q ss_pred HHHHHHCCCCCHHHHCCCCHHHHHHH----HCCCC-CC----CCCCHHHHHHHHCCCCCCCCHHHHHHHHCCCCCCCCCC Q ss_conf 44442022210011003212378898----61886-66----42100234554316876677246677622444223356 Q gi|254780277|r 452 YLKKFFENEHFLKIGHNIKYDKLVLH----RYGIS-MR----GFEDIMLMSYVLDSGRSSHDIASIAQKWLSYTRKEILK 522 (976) Q Consensus 452 ~L~~lle~~~i~kIgHN~KfD~~~L~----~~GI~-l~----~i~DTmlaayLLdp~~~~~~L~~la~~yL~~~~~e~~~ 522 (976) .|.+|+.+.......||++||..++. +++.. .. .+.|++......-+...+++|.+++++| |.... T Consensus 81 ~f~~~i~~~~~~~~~~~~~fD~~~l~~~~~~~~~~~~~~~~~~~iD~~~~~~~~~~~~~~~sL~~l~~~~-gi~~~---- 155 (176) T cd06133 81 EFLEWLGKNGKYAFVTWGDWDLKDLLQNQCKYKIINLPPFFRQWIDLKKEFAKFYGLKKRTGLSKALEYL-GLEFE---- 155 (176) T ss_pred HHHHHHCCCCCEEEEEECCCHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCHHHHHHHC-CCCCC---- T ss_conf 9999972698579996060029999999999789988730111204999999981888898999999986-99999---- Q ss_pred CCCCCCCCHHHCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 5432221000000123333222111112456776 Q gi|254780277|r 523 SRKSSIPIDKISDSQVQEHAIENSNVILQLWLLL 556 (976) Q Consensus 523 ~~k~~~~~~~i~~e~~~~Yaa~Da~~t~~L~~~l 556 (976) .-...|..||..|..++.+| T Consensus 156 --------------~~~H~AL~DA~~ta~v~~~l 175 (176) T cd06133 156 --------------GRHHRGLDDARNIARILKRL 175 (176) T ss_pred --------------CCCCCCHHHHHHHHHHHHHH T ss_conf --------------98858599999999999987 No 98 >cd05780 DNA_polB_Kod1_like_exo The 3'-5' exonuclease domain of archaeal family-B DNA polymerases with similarity to Pyrococcus kodakaraensis Kod1, including polymerases from Desulfurococcus (D. Tok Pol) and Thermococcus gorgonarius (Tgo Pol). Kod1, D. Tok Pol, and Tgo Pol are thermostable enzymes that exhibit both polymerase and 3'-5' exonuclease activities. They are family-B DNA polymerases. Their amino termini harbor a DEDDy-type DnaQ-like 3'-5' exonuclease domain that contains three sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are clustered around the active site and are involved in metal binding and catalysis. The exonuclease domain of family B polymerases contains a beta hairpin structure that plays an important role in active site switching in the event of nucleotide misincorporation. Members of this subfamily show similarity to eukaryotic DNA polymerases involved in DNA replication. Some archaea possess multiple family-B D Probab=97.93 E-value=2.9e-05 Score=57.03 Aligned_cols=158 Identities=11% Similarity=0.115 Sum_probs=90.0 Q ss_pred CCCCCCCCCCCC----CCCCCCCCCCCEEECCCCCCCCCCCCCCEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHC--CC Q ss_conf 685232332224----331100124420102577665443455023137443334565001788999974444202--22 Q gi|254780277|r 387 IGSASFKIITDT----MDAFHSKPIAIAISTLDQKDDTSFKVETIFIDLSVHTSQDSTGKNTLTKEILSYLKKFFE--NE 460 (976) Q Consensus 387 ~~~ia~DtETt~----ld~~~~~ivGisls~~~~~~~~~~~~~~~yip~~~~~~~d~~~~~~~~~evl~~L~~lle--~~ 460 (976) .+.++||+||.+ .+|..+.++.||++...+.....+... +.. ....-..-.+++..+..++. +| T Consensus 3 lkilsfDIE~~~~~g~P~p~~d~Ii~Is~~~~~~~~vi~~~~~----~~~------~v~~~~tE~eLL~~F~~~i~~~dP 72 (195) T cd05780 3 LKILSFDIEVLNHEGEPNPEKDPIIMISFADEGGNKVITWKKF----DLP------FVEVVKTEKEMIKRFIEIVKEKDP 72 (195) T ss_pred CCEEEEEEEECCCCCCCCCCCCCEEEEEEEECCCCEEEEECCC----CCC------CEEEECCHHHHHHHHHHHHHHCCC T ss_conf 4299999997589999898888289999996899789981798----877------456738999999999999987299 Q ss_pred CCHHHHCCC-CHHHHHHHH----CCCCCC----------------------C--CCCHHHHHHHHCCCCCCCCHHHHHHH Q ss_conf 100110032-123788986----188666----------------------4--21002345543168766772466776 Q gi|254780277|r 461 HFLKIGHNI-KYDKLVLHR----YGISMR----------------------G--FEDIMLMSYVLDSGRSSHDIASIAQK 511 (976) Q Consensus 461 ~i~kIgHN~-KfD~~~L~~----~GI~l~----------------------~--i~DTmlaayLLdp~~~~~~L~~la~~ 511 (976) ++. +|+|. .||+..|.+ +|+.+. | ..|.+-... -+-...+|+|+++|+. T Consensus 73 Dii-~GyN~~~FD~pYl~~R~~~~~~~~~~gr~~~~~~~~~~g~~~~~~i~Gr~~lD~~~~~~-~~~~l~sy~L~~Va~~ 150 (195) T cd05780 73 DVI-YTYNGDNFDFPYLKKRAEKLGIELDLGRDGSEIKIQRGGFNNASEIKGRIHVDLYPVAR-RTLNLTRYTLERVYEE 150 (195) T ss_pred CEE-EECCCCCCCHHHHHHHHHHHCCCCCCCCCCCEEEEEECCCCEEEEECCEEEEEHHHHHH-HHCCCCCCCHHHHHHH T ss_conf 999-95798787689999999995898645788860479977850799876868855499997-5256240159999999 Q ss_pred HCCCCCCCCCCCCCCCCCCHHHCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 224442233565432221000000123333222111112456776 Q gi|254780277|r 512 WLSYTRKEILKSRKSSIPIDKISDSQVQEHAIENSNVILQLWLLL 556 (976) Q Consensus 512 yL~~~~~e~~~~~k~~~~~~~i~~e~~~~Yaa~Da~~t~~L~~~l 556 (976) +||....++....-....-.....+.+.+||.+||.++++|.+++ T Consensus 151 ~Lg~~K~d~~~~~i~~~~~~~~~~~~l~~Yn~~D~~L~~~L~~K~ 195 (195) T cd05780 151 LFGIEKEDVPGEEIAEAWDSGENLERLFRYSMEDAKYTYEIGKEF 195 (195) T ss_pred HHCCCCCCCCHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHC T ss_conf 839997889999999999759753888844599899999998529 No 99 >cd05781 DNA_polB_B3_exo The 3'-5' exonuclease domain of archaeal proteins with similarity to Sulfurisphaera ohwakuensis DNA polymerase B3. B3 is a family-B DNA polymerase. Family-B DNA polymerases contain an N-terminal DEDDy DnaQ-like exonuclease domain in the same polypeptide chain as the polymerase domain, similar to family-A DNA polymerases. B3 exhibits both polymerase and 3'-5' exonuclease activities. This exonuclease domain contains three sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are clustered around the active site and are involved in metal binding and catalysis. The exonuclease domain of family B polymerases also contains a beta hairpin structure that plays an important role in active site switching in the event of nucleotide misincorporation. Archaeal proteins that are involved in DNA replication are similar to those from eukaryotes. Some archaea possess multiple family-B DNA polymerases. B3 is mainly found in crenarch Probab=97.77 E-value=8.7e-05 Score=53.49 Aligned_cols=149 Identities=15% Similarity=0.154 Sum_probs=84.0 Q ss_pred CCCCCCCCCCCCC----CCCCCCCCCCEEECCCCCCCCCCCCCCEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHC--CC Q ss_conf 6852323322243----31100124420102577665443455023137443334565001788999974444202--22 Q gi|254780277|r 387 IGSASFKIITDTM----DAFHSKPIAIAISTLDQKDDTSFKVETIFIDLSVHTSQDSTGKNTLTKEILSYLKKFFE--NE 460 (976) Q Consensus 387 ~~~ia~DtETt~l----d~~~~~ivGisls~~~~~~~~~~~~~~~yip~~~~~~~d~~~~~~~~~evl~~L~~lle--~~ 460 (976) .+..+||+||.+. +|..+.++.||++..++... ++-. ....-.+++..+..++. +| T Consensus 3 lk~lsfDIE~~~~~gfp~~~~d~Ii~Is~~~~~g~~~--------~~~~----------~~~~E~~ll~~f~~~i~~~~P 64 (188) T cd05781 3 LKTLAFDIEVYSKYGTPNPRRDPIIVISLATSNGDVE--------FILA----------EGLDDRKIIREFVKYVKEYDP 64 (188) T ss_pred CEEEEEEEEECCCCCCCCCCCCCEEEEEEEECCCCEE--------EEEE----------CCCCHHHHHHHHHHHHHHCCC T ss_conf 4199999887689998998889789999996899769--------9984----------799989999999999987299 Q ss_pred CCHHHHCCC-CHHHHHHH----HCCCCCC--------------------C--CCCHHHHHHHHCCCCCCCCHHHHHHHHC Q ss_conf 100110032-12378898----6188666--------------------4--2100234554316876677246677622 Q gi|254780277|r 461 HFLKIGHNI-KYDKLVLH----RYGISMR--------------------G--FEDIMLMSYVLDSGRSSHDIASIAQKWL 513 (976) Q Consensus 461 ~i~kIgHN~-KfD~~~L~----~~GI~l~--------------------~--i~DTmlaayLLdp~~~~~~L~~la~~yL 513 (976) ++ .+|||. .||+.+|. .+|+.+. | ..|.+-..+ -+-...+++|+.+|+ +| T Consensus 65 di-i~gyN~~~FD~pyl~~Ra~~~~i~l~~gr~~~~~~~~~~~~~~~i~Gr~~iDl~~~~~-~~~~l~sysL~~Va~-~l 141 (188) T cd05781 65 DI-IVGYNSNAFDWPYLVERARVLGVKLDVGRRGGSEPSTGVYGHYSITGRLNVDLYDFAE-EIPEVKVKTLENVAE-YL 141 (188) T ss_pred CE-EEECCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEEEEEEEECHHHHH-HHCCCCCCCHHHHHH-HH T ss_conf 99-9966986757899999999969985015567765122542069987899997789997-624756457999999-85 Q ss_pred CCC-CCCCCCCCCCCCC-CHH--HCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 444-2233565432221-000--000123333222111112456776 Q gi|254780277|r 514 SYT-RKEILKSRKSSIP-IDK--ISDSQVQEHAIENSNVILQLWLLL 556 (976) Q Consensus 514 ~~~-~~e~~~~~k~~~~-~~~--i~~e~~~~Yaa~Da~~t~~L~~~l 556 (976) +.. ..+..-.....+. ++. -..+.+..||.+||..|++|.++| T Consensus 142 g~~~k~e~~~~~~~~I~~~~~~~~~r~~l~~YnlqDa~L~~~L~~KL 188 (188) T cd05781 142 GVMKKSERVLIEWYRIYEYWDDEKKRDILLKYNRDDARSTYGLAEKL 188 (188) T ss_pred CCCCCCCCCCCCHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHCC T ss_conf 41224544657999999987084121699999788799999998429 No 100 >cd05779 DNA_polB_epsilon_exo The 3'-5' exonuclease domain of eukaryotic DNA polymerase epsilon. DNA polymerase epsilon is a family-B DNA polymerase with a catalytic subunit that contains a DEDDy-type DnaQ-like 3'-5' exonuclease domain. It is one of the three DNA-dependent type B DNA polymerases (alpha and delta are the other two) that have been identified as essential for nuclear DNA replication in eukaryotes. DNA polymerase epsilon plays a role in elongating the leading strand during DNA replication. It is also involved in DNA repair. The catalytic subunit contains both polymerase and 3'-5' exonuclease activities. The N-terminal exonuclease domain contains three sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are clustered around the active site and are involved in metal binding and catalysis. DNA polymerase epsilon also carries a unique large C-terminal domain with an unknown function. Phylogenetic analyses indicate that it is ort Probab=97.76 E-value=5.7e-05 Score=54.81 Aligned_cols=162 Identities=17% Similarity=0.203 Sum_probs=86.4 Q ss_pred CCCCCCCCCCC-----CCCCCCCCCCEEECCCCCCCC------CCCCCCE-ECCCCCCCCCCCCCCCCCHHHHHHHHHHH Q ss_conf 52323322243-----311001244201025776654------4345502-31374433345650017889999744442 Q gi|254780277|r 389 SASFKIITDTM-----DAFHSKPIAIAISTLDQKDDT------SFKVETI-FIDLSVHTSQDSTGKNTLTKEILSYLKKF 456 (976) Q Consensus 389 ~ia~DtETt~l-----d~~~~~ivGisls~~~~~~~~------~~~~~~~-yip~~~~~~~d~~~~~~~~~evl~~L~~l 456 (976) ..|||+||.+. +|..+.++.||+.....+... ......+ |.+.....+.........-.+.+..+..+ T Consensus 4 ilsfDIE~~~~~~~fP~~~~D~Ii~Is~~~~~~g~~i~~~~~~~~~i~~~~~~~~~~~~g~~~v~~~~~E~~ll~~f~~~ 83 (204) T cd05779 4 VLAFDIETTKLPLKFPDAETDQIMMISYMIDGQGYLIVNREIVSEDIEDFEYTPKPEYEGPFKVFNEPDEKALLQRFFEH 83 (204) T ss_pred EEEEEEEEECCCCCCCCCCCCEEEEEEEEEECCCEEEEECCCCCCCCCCCCCCCCCCCCCCEEEEECCCHHHHHHHHHHH T ss_conf 99999995079999989888908999999916877987434013533000146785665764898369999999999999 Q ss_pred HC--CCCCHHHHCCC-CHHHHHHH----HCCCCCCC---C----CCHHHHHHHH----------CC--CCCCCCHHHHHH Q ss_conf 02--22100110032-12378898----61886664---2----1002345543----------16--876677246677 Q gi|254780277|r 457 FE--NEHFLKIGHNI-KYDKLVLH----RYGISMRG---F----EDIMLMSYVL----------DS--GRSSHDIASIAQ 510 (976) Q Consensus 457 le--~~~i~kIgHN~-KfD~~~L~----~~GI~l~~---i----~DTmlaayLL----------dp--~~~~~~L~~la~ 510 (976) +. +|++ .+|+|. .||+.+|. .+|+.+.. + .++....+.+ +. ...+++|+++++ T Consensus 84 i~~~~pdi-i~gyN~~~FD~pyl~~Ra~~~~~~~~~~~~~~~~~~~~~~~~~~vhiDly~~v~rd~~l~~~sy~Ln~Vs~ 162 (204) T cd05779 84 IREVKPHI-IVTYNGDFFDWPFVEARAAIHGLSMEEEIGFRKDSEGEYKSRYIIHMDCFRWVKRDSYLPQGSQGLKAVTK 162 (204) T ss_pred HHHCCCCE-EEECCCCCCCHHHHHHHHHHHCCCHHHHCCCCCCCCCCEECCCCEEEECHHHHHHHCCCCCCCCCHHHHHH T ss_conf 98649999-98558767767999999999599825531800067772762672563007765101217556877899999 Q ss_pred HHCCCCCCCCCCCCCCCCCCHHHCCHHHHHHHHHHHHHHHHHH Q ss_conf 6224442233565432221000000123333222111112456 Q gi|254780277|r 511 KWLSYTRKEILKSRKSSIPIDKISDSQVQEHAIENSNVILQLW 553 (976) Q Consensus 511 ~yL~~~~~e~~~~~k~~~~~~~i~~e~~~~Yaa~Da~~t~~L~ 553 (976) .+||....++....- ..++.-..+..++||.+||.+|+.|| T Consensus 163 ~~Lg~~K~dv~~~~m--~~l~~~~~~~la~Y~~~Da~~Ty~l~ 203 (204) T cd05779 163 AKLGYDPVELDPEDM--VPLAREDPQTLASYSVSDAVATYYLY 203 (204) T ss_pred HHHCCCCCCCCHHHH--HHHHCCCHHHHHHCCHHHHHHHHHHH T ss_conf 996899750899999--99982891555524099999999976 No 101 >PRK05762 DNA polymerase II; Reviewed Probab=97.67 E-value=0.00012 Score=52.39 Aligned_cols=156 Identities=15% Similarity=0.179 Sum_probs=83.6 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHC--CCCCHH Q ss_conf 685232332224331100124420102577665443455023137443334565001788999974444202--221001 Q gi|254780277|r 387 IGSASFKIITDTMDAFHSKPIAIAISTLDQKDDTSFKVETIFIDLSVHTSQDSTGKNTLTKEILSYLKKFFE--NEHFLK 464 (976) Q Consensus 387 ~~~ia~DtETt~ld~~~~~ivGisls~~~~~~~~~~~~~~~yip~~~~~~~d~~~~~~~~~evl~~L~~lle--~~~i~k 464 (976) ....+||+||...+ .++.|++...+.... +.-.. .........-..-.+++..|..++. ||++. T Consensus 155 lrvlSfDIEc~~~g----~ii~I~l~~~~~~~v--------~~~g~-~~~~~~v~~~~~E~eLL~~F~~~i~~~DPDII- 220 (785) T PRK05762 155 LKVVSLDIETSNKG----ELYSIGLEGCGQRQV--------YMLGE-ANGDAELEYVADEKALLERFNAWFAEFDPDVI- 220 (785) T ss_pred CEEEEEEEEECCCC----CEEEEEECCCCCCEE--------EEECC-CCCCCEEEEECCHHHHHHHHHHHHHHHCCCEE- T ss_conf 86999998977999----889999517998468--------99668-89985389909999999999999998599989- Q ss_pred HHCCC-CHHHHHHHH----CCCCC-------------------------CC--CCCHHHHHHHHCCCCCCCCHHHHHHHH Q ss_conf 10032-123788986----18866-------------------------64--210023455431687667724667762 Q gi|254780277|r 465 IGHNI-KYDKLVLHR----YGISM-------------------------RG--FEDIMLMSYVLDSGRSSHDIASIAQKW 512 (976) Q Consensus 465 IgHN~-KfD~~~L~~----~GI~l-------------------------~~--i~DTmlaayLLdp~~~~~~L~~la~~y 512 (976) +|||+ .||+..|.+ +|+.+ .| +.|++-+..=-.-...+|+|+++|..+ T Consensus 221 ~GyNi~~FDlpyL~~Ra~~~gi~l~lgR~~~~~~~~~~~~~~~~~~~~i~GRv~iD~~~~~k~~~~~l~sYsL~~Va~~~ 300 (785) T PRK05762 221 IGWNVVQFDLRLLQERAERYGIPLRLGRGGSELSWRVHGHKRGHFFASVPGRLVLDGIDALKSAFYEFKSFSLENVAQEL 300 (785) T ss_pred EECCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCEECCCCCCCEEEEEEEEEEEEEEHHHHHHHHHCCCCCCHHHHHHHH T ss_conf 96274787779999999994998546878886533315655660599984289997099998752036766299999998 Q ss_pred CCCCCCCCCCCC-CCCC-CCHHHCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 244422335654-3222-1000000123333222111112456776 Q gi|254780277|r 513 LSYTRKEILKSR-KSSI-PIDKISDSQVQEHAIENSNVILQLWLLL 556 (976) Q Consensus 513 L~~~~~e~~~~~-k~~~-~~~~i~~e~~~~Yaa~Da~~t~~L~~~l 556 (976) |+.......... ...+ .+..-+...+..||.+||.++++|..++ T Consensus 301 Lge~K~~~~~~~r~~eI~~~~~~~r~~l~~Y~l~Da~L~~~L~~kl 346 (785) T PRK05762 301 LGEGKAIDNPYDRMDEIDRYFADDKPALAKYNLKDCELVTRIFEKT 346 (785) T ss_pred HCCCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHH T ss_conf 5445555666666888899885468999999999999999999987 No 102 >KOG0950 consensus Probab=97.65 E-value=1.6e-05 Score=58.91 Aligned_cols=266 Identities=15% Similarity=0.153 Sum_probs=153.1 Q ss_pred HHHCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHH---CCCCHH-----HHHHHHHHHH-HHHHHH Q ss_conf 9740345577776566679987420788755566545353178877653---047627-----9999999878-889886 Q gi|254780277|r 613 IYDLSGEKFNLSSPKQLGDILFTKLKFPSGAKTKTGQWKTTAQDLEQIN---CGDNPI-----IKNILEWRQL-SKIKST 683 (976) Q Consensus 613 i~~~aG~~FNinSpkQL~~vLf~~LgLp~~kKTktG~~ST~~~~Le~L~---~~~hp~-----~~~iLeyR~l-~KL~sT 683 (976) |..+.---||..|+--.+.+||-+|+.- -..-+|++.. .-.-|+ .-..+.|+.+ .+|- T Consensus 715 ~t~Lg~a~f~~~~~~~~a~~l~~~L~~~-----------~~~~vle~~lh~lylvtP~~~~~~~~dwli~f~i~~~L~-- 781 (1008) T KOG0950 715 ITRLGRACFNAGSDPEVANILFADLKKS-----------LPQLVLESSLHLLYLVTPYLEVMNDIDWLIYFQIYHTLP-- 781 (1008) T ss_pred HHHHHHHHHCCCCCHHHHHHHHHHHHHH-----------HHCCCCCCCCCEEEEECCHHHHCCCCCHHHHHHHHHCCC-- T ss_conf 5666565642567856667899999875-----------302433230004544123676325500788889986189-- Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCCEECCCCCCCCCCCCCCCCC-CCCCCCEEEECCCCCEEEEEECCCCCEEEEEEEE Q ss_conf 532001111001233345330022210000013334333455633-2222430242489948999632654436876651 Q gi|254780277|r 684 YADSLPNHINKRTQRVHTFYSLASTMTGRLASLEPNLQNIPIKTD-LGQKIRRAFISPPTKKLISADYSQIELRILAHIA 762 (976) Q Consensus 684 Y~~~l~~~i~~~dgRIH~~f~q~~t~TGRlSs~~PNlQNiP~r~~-~g~~iR~~F~a~~g~~lv~aDySQiElRilA~ls 762 (976) .+...... --|++|..+-- ...| -+.||+||.|++.. ++-.++.-|+...--..|..+|-|-+.|+.||++ T Consensus 782 --~~~~~~~~-~~G~~e~fi~~--~~~g---qs~~~~~~~~~~~r~y~~l~L~~li~espi~~V~~kYk~~rg~lqall~ 853 (1008) T KOG0950 782 --SPEQKLAK-LLGVIESFIEK--CVSG---QSVRNLQNVQKRKRLYVALALQKLINESPIRTVAEKYKVERGRLQALLS 853 (1008) T ss_pred --CHHHHHHH-HHCHHHHHHHH--HHHC---CCCCCCCCHHHHHHHHHHHHHHHHHHHCCHHHHHHHHCCHHHHHHHHHH T ss_conf --37899986-60637778887--6405---5345532135788999999999998527278899884751788999875 Q ss_pred EHHHHHHHHHCCCCHHHHHHHHHHCCCHHHCCHHHHHHHHHHHHHHHHCCCHHHHHHHCCCCHHHHHHHHHHHHHHCHHH Q ss_conf 03777873406988899999898099968899888863025135644077703567881999899999999999988889 Q gi|254780277|r 763 KITPLYQVFENSLDIHRMVAAEIFGVNIEKVSPQMRRHAKTINFSIIYGISPFRLATQLRIPRSEAADYIKRYFHRFPGI 842 (976) Q Consensus 763 ~D~~l~~af~~g~DiH~~tA~~vf~~~~~~vt~~~R~~aK~infgi~YG~~~~~La~~l~is~~eA~~~i~~yf~~~p~v 842 (976) ++..+..... .|.-.+..- -++.-+.-.+|.+|+++.+-|..=..++-..+++...-|++.|++| T Consensus 854 ~a~~~a~~It------------~Fce~l~w~---~~~~l~~~~~~rl~~g~~~eL~~Lmrv~~~~~~RAr~lf~Agf~tv 918 (1008) T KOG0950 854 NASSFASLIT------------FFCESIQWF---PLRALLSEFYGRLSFGGHAELIPLMRVPDVKAERARQLFKAGFTSV 918 (1008) T ss_pred CCHHHHHHHH------------HHHHHHHHC---CHHHHHHHHHHHHHCCCHHHHHHHHCCCHHHHHHHHHHHHHHCCCH T ss_conf 0046788999------------998776223---1688999998887412125345452574367899999998445335 Q ss_pred HHHHHHHHHHHHHCCCEECCCCCEECCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEE Q ss_conf 99999999999984988827787601898758837888778787701345653899999999999999986599818999 Q gi|254780277|r 843 HEYIEKTKNFVRQNGYVETIFGRRIHYDEINSPKSSIRNINERAAINAPIQGSAADITRRAMISVHKSLEDHKLATKMLL 922 (976) Q Consensus 843 ~~~~~~~~~~a~~~gyv~T~~GRrr~~p~i~s~~~~~r~~aeR~a~N~~iQGsaAdi~k~am~~i~~~~~~~~~~~~~il 922 (976) ..+...+.....+.=+..+.++++.-+. +... .++|. .+||+.-.+-+.+.+...-+.. T Consensus 919 ~~iA~a~p~klvkel~~si~~~~a~~i~---~s~~----~~l~~----t~~~~~~e~~~~~iv~~es~~~---------- 977 (1008) T KOG0950 919 GSIANATPEKLVKELPISISMKQATQIV---ASAK----DELRK----TGKGLMEEIEATKIVDIESREF---------- 977 (1008) T ss_pred HHHHCCCHHHHHHHHHCCCCHHHHHHHH---HHHH----HHHHH----HHCCCCEECCHHHHHHHHHHHH---------- T ss_conf 8885588277898751445187765577---5555----78887----3004523333378777788875---------- Q ss_pred EECCEEEEECCHHHH Q ss_conf 726389995598799 Q gi|254780277|r 923 QIHDELVFEVPEEEI 937 (976) Q Consensus 923 qVHDElv~ev~~~~~ 937 (976) ||+-..++|+-.. T Consensus 978 --~w~~~~a~~~i~~ 990 (1008) T KOG0950 978 --HWEVLVALPEIHV 990 (1008) T ss_pred --HHHHHHHCCHHHH T ss_conf --4467651323667 No 103 >PTZ00166 DNA polymerase delta catalytic subunit; Provisional Probab=97.61 E-value=0.00054 Score=47.73 Aligned_cols=206 Identities=17% Similarity=0.161 Sum_probs=99.8 Q ss_pred HHCCCCCCCCCCCCCC------CCCCCCCCCEEECCCCCCCCCCCCCCEECCCCCCCCC-CCCCCC-CCHHHHHHHHHHH Q ss_conf 8668523233222433------1100124420102577665443455023137443334-565001-7889999744442 Q gi|254780277|r 385 ETIGSASFKIITDTMD------AFHSKPIAIAISTLDQKDDTSFKVETIFIDLSVHTSQ-DSTGKN-TLTKEILSYLKKF 456 (976) Q Consensus 385 ~~~~~ia~DtETt~ld------~~~~~ivGisls~~~~~~~~~~~~~~~yip~~~~~~~-d~~~~~-~~~~evl~~L~~l 456 (976) ......+||+|+.|.+ +..+.++.||-..........+.. .+.+.+..-..- +...-. -.-.+.+..+..+ T Consensus 288 aPlrilSfDIEC~~~~G~gFP~~~~DpVIqIs~~~~~~g~~~~~~~-~~~ftl~~C~~I~g~~V~~F~tE~eLL~aF~~f 366 (1081) T PTZ00166 288 PPIRILSFDIECIKLDGKGFPEANNDPVIQISSVLHTHGDPLDSCK-NFIFTLKECASLAGANVLWFDNEKTLLLAWNDF 366 (1081) T ss_pred CCCEEEEEEEEECCCCCCCCCCCCCCCEEEEEEEEEECCCCCCCCC-CEEEECCCCCCCCCCEEEEECCHHHHHHHHHHH T ss_conf 8816999867845888998989999957998579881478887646-446674787788997799709999999999999 Q ss_pred H--CCCCCHHHHCCC-CHHHHHHHH----CCCC----------------------------------CCC--CCCHHHHH Q ss_conf 0--222100110032-123788986----1886----------------------------------664--21002345 Q gi|254780277|r 457 F--ENEHFLKIGHNI-KYDKLVLHR----YGIS----------------------------------MRG--FEDIMLMS 493 (976) Q Consensus 457 l--e~~~i~kIgHN~-KfD~~~L~~----~GI~----------------------------------l~~--i~DTmlaa 493 (976) + .||++. +|||+ .||+..|.. .++. +.| ++|.+... T Consensus 367 i~~~DPDII-TGYNI~nFDlpYLl~Ra~~L~i~~f~~lgRlk~~~~~~~~~~~~s~a~G~~~~k~i~i~GRv~~Dl~~~v 445 (1081) T PTZ00166 367 IIAVDPDFL-TGYNIINFDLPYLLNRATALNLKKFKFLTRIKSVKSTVKDSIFSSNQFGTHENKEINIEGRILFDVYDLI 445 (1081) T ss_pred HHHCCCCEE-EECCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCEECCCCCCCCCCCCCCEEEEECCEEEEEHHHHH T ss_conf 987099899-9668788888999999999588643323611577652202656655556665506867578996469999 Q ss_pred HHHCCCCCCCCHHHHHHHHCCCCCCCCCCCCCCC-CCCHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 5431687667724667762244422335654322-210000001233332221111124567763222100013457886 Q gi|254780277|r 494 YVLDSGRSSHDIASIAQKWLSYTRKEILKSRKSS-IPIDKISDSQVQEHAIENSNVILQLWLLLRPKLIVEKLLHVYERL 572 (976) Q Consensus 494 yLLdp~~~~~~L~~la~~yL~~~~~e~~~~~k~~-~~~~~i~~e~~~~Yaa~Da~~t~~L~~~l~~~L~~~~l~~l~~~i 572 (976) . -+-...+|.|++++..+|+...+++.-+.-.. ...+......+..||..|+.++++|..+|.--..--.+.++ - T Consensus 446 ~-r~~kL~SYkLntVs~~fLgeqKeDV~~~~I~~l~~~~~e~r~rla~YClkDa~L~lrL~~KL~~l~n~~EMArV---t 521 (1081) T PTZ00166 446 R-RDYKLKSYSLNYVSFEFLKEQKEDVHYSTINKLQNGNPEDRKRIASYCLKDSILPLRLIDKLLLLYNYIEMARV---T 521 (1081) T ss_pred H-HHCCCCCCCHHHHHHHHHCCCCCCCCHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---H T ss_conf 8-75587857688899998188753379999999981493788999999899899999999999899999999999---7 Q ss_pred CCCCHHHHHHHCCCCCCCCHHHHHHH Q ss_conf 07501233221014310058999986 Q gi|254780277|r 573 DKPMIDVVSQMEITGIQIDQDLLKQV 598 (976) Q Consensus 573 E~Pl~~vLa~ME~~Gi~vD~~~L~~l 598 (976) -.|+..+|.+ -.+++|....|++. T Consensus 522 gVPl~~ll~r--GQQIRV~S~LLr~a 545 (1081) T PTZ00166 522 GTPIKYLLTR--GQQIKVTSQIYRKC 545 (1081) T ss_pred CCCHHHHHCC--CCHHHHHHHHHHHH T ss_conf 8879998106--52374899999998 No 104 >cd05160 DNA_polB_exo The 3'-5' exonuclease domain of family-B DNA polymerases. This domain has a fundamental role in reducing polymerase errors and is involved in proofreading activity. Family-B DNA polymerases contain an N-terminal DEDDy DnaQ-like exonuclease domain in the same polypeptide chain as the polymerase domain, similar to family-A DNA polymerases. This domain contains three sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. The exonuclease domain of family B polymerase also contains a beta hairpin structure that plays an important role in active site switching in the event of nucleotide misincorporation. Members include E. coli DNA polymerase II, some eubacterial phage DNA polymerases, nuclear replicative DNA polymerases (alpha, delta, epsilon and zeta), and eukaryotic viral and Probab=97.60 E-value=0.00014 Score=51.93 Aligned_cols=162 Identities=18% Similarity=0.138 Sum_probs=82.9 Q ss_pred CCCCCCCCCCC----CCCCCCCCCEEECCCCCCCCCCCCCCEECCCCC-CCCCCC-CC-CCCCHHHHHHHHHHHHC--CC Q ss_conf 23233222433----110012442010257766544345502313744-333456-50-01788999974444202--22 Q gi|254780277|r 390 ASFKIITDTMD----AFHSKPIAIAISTLDQKDDTSFKVETIFIDLSV-HTSQDS-TG-KNTLTKEILSYLKKFFE--NE 460 (976) Q Consensus 390 ia~DtETt~ld----~~~~~ivGisls~~~~~~~~~~~~~~~yip~~~-~~~~d~-~~-~~~~~~evl~~L~~lle--~~ 460 (976) ++||+||.+.+ |-...++.||++.........+.....-..... ....+. .. .-..-.+.+..+..++. +| T Consensus 2 lsfDIE~~~~~gfP~p~~d~Ii~Is~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~E~eLL~~F~~~i~~~dP 81 (200) T cd05160 2 LSFDIETLPNVGGPEPSRDPIICISYADSFQGHKVVFLLKVSTHGDDIEAGFIDGIEVEYFDDEKELLKRFFDIIREYDP 81 (200) T ss_pred EEEEEEECCCCCCCCCCCCEEEEEEEEEECCCEEEEEEEECCCCCCCCCCCCCCCEEEEEECCHHHHHHHHHHHHHHCCC T ss_conf 79998327999978999880899999993599999988643556767654677996999969999999999999997099 Q ss_pred CCHHHHCCC-CHHHHHHH----HCCCCCC---------------------C--CCCHHHHHHHHCCCCCCCCHHHHHHHH Q ss_conf 100110032-12378898----6188666---------------------4--210023455431687667724667762 Q gi|254780277|r 461 HFLKIGHNI-KYDKLVLH----RYGISMR---------------------G--FEDIMLMSYVLDSGRSSHDIASIAQKW 512 (976) Q Consensus 461 ~i~kIgHN~-KfD~~~L~----~~GI~l~---------------------~--i~DTmlaayLLdp~~~~~~L~~la~~y 512 (976) ++. +|+|. .||+..|. ++|+.+. | ..|.+-... -+....+|+|+.+|+.+ T Consensus 82 Dii-~GyN~~~FD~pyL~~R~~~l~~~~~~~~~~~~~~~~~~~~~~~~i~Gr~~lD~~~~~k-~~~~l~sy~Ld~Va~~~ 159 (200) T cd05160 82 DIL-TGYNIDDFDLPYLLERAKALGIKLDDIGRRSGQEKSSGETERAAVKGRVVFDLLRAVK-RDFKLKSYTLDAVAEEL 159 (200) T ss_pred CEE-EECCCCCCCHHHHHHHHHHHCCCHHHCCCCCCCCCCCCCEEEEEECCEEEEHHHHHHH-HHHCCCCCCHHHHHHHH T ss_conf 999-9726777848899999999698855546734545435750599736778602899999-75324665399999998 Q ss_pred CCCCCCCCCCCCCCCCCCHHHCCHHHHHHHHHHHHHHHHHHH Q ss_conf 244422335654322210000001233332221111124567 Q gi|254780277|r 513 LSYTRKEILKSRKSSIPIDKISDSQVQEHAIENSNVILQLWL 554 (976) Q Consensus 513 L~~~~~e~~~~~k~~~~~~~i~~e~~~~Yaa~Da~~t~~L~~ 554 (976) |+....+.....-... +..-..+.+.+|+..||.++++|.+ T Consensus 160 Lg~~K~d~~~~~~~~~-~~~~d~~k~~~Y~~~D~~L~~~L~e 200 (200) T cd05160 160 LGEGKEKVDGEIIEDL-EWEEDLERLLEYNLKDAELTLQILE 200 (200) T ss_pred HCCCCCCCCHHHHHHH-HHCCCCHHHHHEEHHHHHHHHHHHC T ss_conf 4999787898999999-8738914866025999998999759 No 105 >cd05785 DNA_polB_like2_exo A subfamily of the 3'-5' exonuclease domain of family-B DNA polymerases. This subfamily is composed of uncharacterized bacterial family-B DNA polymerases. Family-B DNA polymerases contain an N-terminal DEDDy DnaQ-like exonuclease domain in the same polypeptide chain as the polymerase domain, similar to family-A DNA polymerases. This exonuclease domain contains three sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are involved in metal binding and catalysis. The exonuclease domain of family-B DNA polymerases has a fundamental role in proofreading activity. It contains a beta hairpin structure that plays an important role in active site switching in the event of a nucleotide misincorporation. Family-B DNA polymerases are predominantly involved in DNA replication and DNA repair. Probab=97.59 E-value=0.00017 Score=51.36 Aligned_cols=149 Identities=15% Similarity=0.269 Sum_probs=81.2 Q ss_pred HHCCCCCCCCCCCCCCCC--------CCCCCCCEEECCCCCCCCCCCCCCEECCCCCCCCCCCCCCCCCHHHHHHHHHHH Q ss_conf 866852323322243311--------001244201025776654434550231374433345650017889999744442 Q gi|254780277|r 385 ETIGSASFKIITDTMDAF--------HSKPIAIAISTLDQKDDTSFKVETIFIDLSVHTSQDSTGKNTLTKEILSYLKKF 456 (976) Q Consensus 385 ~~~~~ia~DtETt~ld~~--------~~~ivGisls~~~~~~~~~~~~~~~yip~~~~~~~d~~~~~~~~~evl~~L~~l 456 (976) .....++||+||++.+.+ ...++.|++....+... .+.. ....-.+++..+..+ T Consensus 7 ~~lr~lsfDIE~~~~~g~~~p~p~~~~d~Ii~I~~~~~~~~~~--------v~~~----------~~~~E~~lL~~F~~~ 68 (207) T cd05785 7 DDLRRLQLDIETYSLPGFFFSNPDRGDDRIIIVALRDNRGWEE--------VLHA----------EDAAEKELLEELVAI 68 (207) T ss_pred HHEEEEEEEEEECCCCCCCCCCCCCCCCCEEEEEEEECCCCEE--------EEEE----------CCCCHHHHHHHHHHH T ss_conf 0405999999747889977999888999789999885899779--------8762----------699999999999999 Q ss_pred HC--CCCCHHHHCCC-CHHHHHHHH----CCCCCC------------------------------C--CCCHHHHHHHH- Q ss_conf 02--22100110032-123788986----188666------------------------------4--21002345543- Q gi|254780277|r 457 FE--NEHFLKIGHNI-KYDKLVLHR----YGISMR------------------------------G--FEDIMLMSYVL- 496 (976) Q Consensus 457 le--~~~i~kIgHN~-KfD~~~L~~----~GI~l~------------------------------~--i~DTmlaayLL- 496 (976) +. +|++. +|+|. .||+..|.+ +|+++. | +.|.+.+..-. T Consensus 69 i~~~dPDII-tGyNi~~FD~pYL~~Ra~~~~i~~~lgR~~~~~~~~~~~~~~~~~~~~~~~~~i~Gr~~lD~~~~~k~~~ 147 (207) T cd05785 69 IRERDPDVI-EGHNIFRFDLPYLRRRCRRHGVPLAIGRDGSIPRQRPSRFRFAERLIDYPRYDIPGRHVIDTYFLVQLFD 147 (207) T ss_pred HHHCCCCEE-EECCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCEEECCEEEECCCCCCEEEEEECCEEEEEHHHHHHHHC T ss_conf 987399999-8679877588999999999599722257898744640302122355530499873648865499998642 Q ss_pred --CCCCCCCCHHHHHHHHCCCCCCCCCCC-CCCCCCCHHHCCHHHHHHHHHHHHHHHHHH Q ss_conf --168766772466776224442233565-432221000000123333222111112456 Q gi|254780277|r 497 --DSGRSSHDIASIAQKWLSYTRKEILKS-RKSSIPIDKISDSQVQEHAIENSNVILQLW 553 (976) Q Consensus 497 --dp~~~~~~L~~la~~yL~~~~~e~~~~-~k~~~~~~~i~~e~~~~Yaa~Da~~t~~L~ 553 (976) +-...+|+|+.+|+. ++...++.... +..-..+..-+.+.+..|+.+||..|.+|. T Consensus 148 ~~~~~L~SY~L~~Va~~-~~l~~~~k~~~~~~~i~~~~~~d~~~l~~Yni~Dv~~t~~La 206 (207) T cd05785 148 VSSRDLPSYGLKAVAKH-FGLASPDRTYIDGRQIAEVWRSDPARLLAYALDDVRETEGLA 206 (207) T ss_pred CCCCCCCCCCHHHHHHH-HHHCCCCCCCCCHHHHHHHHHCCHHHHHHHHHHHHHHHHHHC T ss_conf 11146776528999998-731366656388999999995298999998299999999860 No 106 >PRK00558 uvrC excinuclease ABC subunit C; Validated Probab=97.53 E-value=0.0008 Score=46.45 Aligned_cols=32 Identities=19% Similarity=0.116 Sum_probs=18.1 Q ss_pred HHHHHH---CCCHHHHHHHCCCCHHHHHHHHHHHH Q ss_conf 356440---77703567881999899999999999 Q gi|254780277|r 805 NFSIIY---GISPFRLATQLRIPRSEAADYIKRYF 836 (976) Q Consensus 805 nfgi~Y---G~~~~~La~~l~is~~eA~~~i~~yf 836 (976) .||=+. -.|..-|++-.||+.+-|+.+.+.++ T Consensus 573 ~Fgs~~~i~~As~eeL~~v~Gi~~~~A~~I~~~l~ 607 (609) T PRK00558 573 HFGSLKAIKEASVEELAKVPGISKKLAEKIYEALH 607 (609) T ss_pred HCCCHHHHHHCCHHHHHHCCCCCHHHHHHHHHHHH T ss_conf 70799999738999996489989999999999970 No 107 >cd05777 DNA_polB_delta_exo The 3'-5' exonuclease domain of eukaryotic DNA polymerase delta. DNA polymerase delta is a family-B DNA polymerase with a catalytic subunit that contains a DEDDy-type DnaQ-like 3'-5' exonuclease domain. It is one of the three DNA-dependent type B DNA polymerases (alpha and epsilon are the other two) that have been identified as essential for nuclear DNA replication in eukaryotes. DNA polymerase delta is the enzyme responsible for both elongation and maturation of Okazaki fragments on the lagging strand. It is also implicated in mismatch repair (MMR) and base excision repair (BER). The catalytic subunit displays both polymerase and 3'-5' exonuclease activities. The exonuclease domain contains three sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues necessary for metal binding and catalysis. The exonuclease domain of family B polyme Probab=97.50 E-value=0.00015 Score=51.72 Aligned_cols=168 Identities=17% Similarity=0.200 Sum_probs=81.4 Q ss_pred HHCCCCCCCCCCCCC-----CCCCCCCCCCEEECCCCCCCCCCCCCCEECCCCCCCC-CCCCC-CCCCHHHHHHHHHHHH Q ss_conf 866852323322243-----3110012442010257766544345502313744333-45650-0178899997444420 Q gi|254780277|r 385 ETIGSASFKIITDTM-----DAFHSKPIAIAISTLDQKDDTSFKVETIFIDLSVHTS-QDSTG-KNTLTKEILSYLKKFF 457 (976) Q Consensus 385 ~~~~~ia~DtETt~l-----d~~~~~ivGisls~~~~~~~~~~~~~~~yip~~~~~~-~d~~~-~~~~~~evl~~L~~ll 457 (976) ......+||+||.+. +|..+.++.|+...........+. ..++. +..... .+... .-..-.+.+..+..++ T Consensus 5 pplki~sfDIE~~s~~g~fP~p~~D~Ii~I~~~~~~~~~~~~~~-~~v~~-l~~~~~i~~~~V~~~~tE~eLL~~F~~~i 82 (230) T cd05777 5 APLRILSFDIECAGRKGVFPEPEKDPVIQIANVVTRQGEGEPFI-RNIFT-LKTCAPIVGAQVFSFETEEELLLAWRDFV 82 (230) T ss_pred CCCEEEEEEEEECCCCCCCCCCCCCCEEEEEEEEEECCCCCCEE-EEEEE-ECCCCCCCCCEEEECCCHHHHHHHHHHHH T ss_conf 98049999999526999998988883489999998279776047-87898-26878889987998499999999999999 Q ss_pred C--CCCCHHHHCCC-CHHHHHHHH----CCCCC----------------------------------CC--CCCHHHHHH Q ss_conf 2--22100110032-123788986----18866----------------------------------64--210023455 Q gi|254780277|r 458 E--NEHFLKIGHNI-KYDKLVLHR----YGISM----------------------------------RG--FEDIMLMSY 494 (976) Q Consensus 458 e--~~~i~kIgHN~-KfD~~~L~~----~GI~l----------------------------------~~--i~DTmlaay 494 (976) . +|++. +|+|. .||+..|.+ +|+.. .| ..|.+-.-. T Consensus 83 ~~~dPDii-tGyN~~~FD~pYL~~Ra~~l~~~~~~~lgr~~~~~~~~~~~~~~s~~~g~~~~~~~~i~Gr~~lD~~~~~r 161 (230) T cd05777 83 QEVDPDII-TGYNICNFDLPYLLERAKALKLNTFPFLGRIKNIKSTIKDTTFSSKQMGTRETKEINIEGRIQFDLLQVIQ 161 (230) T ss_pred HHCCCCEE-EECCCCCCCHHHHHHHHHHHCCCCHHHHCCCCCCCCEEECCCCCCCCCCCCEEEEEEECCEEHHHHHHHHH T ss_conf 98099999-96366678878999999996886145417667876357416522231264002699875875999999999 Q ss_pred HHCCCCCCCCHHHHHHHHCCCCCCCCCCCCCCCCCCHHHC---CHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 4316876677246677622444223356543222100000---012333322211111245677632 Q gi|254780277|r 495 VLDSGRSSHDIASIAQKWLSYTRKEILKSRKSSIPIDKIS---DSQVQEHAIENSNVILQLWLLLRP 558 (976) Q Consensus 495 LLdp~~~~~~L~~la~~yL~~~~~e~~~~~k~~~~~~~i~---~e~~~~Yaa~Da~~t~~L~~~l~~ 558 (976) -+-...+|+|+++|+.+||....++.-+. -..+..-. ...+.+||.+||.++++|.++|.- T Consensus 162 -~~~~l~sY~L~~Va~~~Lg~~K~d~~~~~--i~~~~~~~~~~~~~~~~Y~i~D~~L~~~L~~Kl~~ 225 (230) T cd05777 162 -RDYKLRSYSLNSVSAHFLGEQKEDVHYSI--ITDLQNGNPETRRRLAVYCLKDAYLPLRLLDKLMC 225 (230) T ss_pred -HHCCHHHCCHHHHHHHHHCCCCCCCCHHH--HHHHHHCCHHHHHHHHEECHHHHHHHHHHHHHHHH T ss_conf -75016336499999998389988799999--99999479646456006426877999999999877 No 108 >cd06137 DEDDh_RNase This group is composed of eukaryotic exoribonucleases that include PAN2, RNA exonuclease 1 (REX1 or Rex1p), REX3 (Rex3p), REX4 (or Rex4p), ISG20, and similar proteins. They are DEDDh-type DnaQ-like 3'-5' exonucleases containing three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific Hx(4)D conserved pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. PAN2 is the catalytic subunit of poly(A) nuclease (PAN), a Pab1p-dependent 3'-5' exoribonuclease which plays an important role in the posttranscriptional maturation of pre-mRNAs. REX proteins are required for the processing and maturation of many RNA species, and ISG20 is an interferon-induced antiviral exonuclease with a strong preference for single-stranded RNA. Probab=97.47 E-value=0.00021 Score=50.69 Aligned_cols=141 Identities=11% Similarity=0.139 Sum_probs=76.2 Q ss_pred CCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCEECCC---CCCCCC-------CCC-----CCCCC-HHHHHHHH Q ss_conf 232332224331100124420102577665443455023137---443334-------565-----00178-89999744 Q gi|254780277|r 390 ASFKIITDTMDAFHSKPIAIAISTLDQKDDTSFKVETIFIDL---SVHTSQ-------DST-----GKNTL-TKEILSYL 453 (976) Q Consensus 390 ia~DtETt~ld~~~~~ivGisls~~~~~~~~~~~~~~~yip~---~~~~~~-------d~~-----~~~~~-~~evl~~L 453 (976) +|+|+|..+..+-..++.-+++......... .+.+.-|. ..+... +.. +.... ...+...+ T Consensus 1 VAlDCEMv~~~~~~~el~rvs~Vd~~~g~vl---~d~~VkP~~~V~Dy~T~~SGIt~~~l~~a~~~~~~~~~~~~a~~~l 77 (161) T cd06137 1 VALDCEMVGLADGDSEVVRISAVDVLTGEVL---IDSLVRPSVRVTDWRTRFSGVTPADLEEAAKAGKTIFGWEAARAAL 77 (161) T ss_pred CEEEEEEECCCCCCCEEEEEEEEECCCCEEE---EEECCCCCCCCCCCEEECCCCCHHHHHHHHHCCCCCCCHHHHHHHH T ss_conf 9796452332299725899999972799199---9863359843230143325799999854342153333199999999 Q ss_pred HHHHCCCCCHHHHCCCCHHHHHHHHCCCCCCCCCCHHHHHHHHC--C-CCCCCCHHHHHHHHCCCCCCCCCCCCCCCCCC Q ss_conf 44202221001100321237889861886664210023455431--6-87667724667762244422335654322210 Q gi|254780277|r 454 KKFFENEHFLKIGHNIKYDKLVLHRYGISMRGFEDIMLMSYVLD--S-GRSSHDIASIAQKWLSYTRKEILKSRKSSIPI 530 (976) Q Consensus 454 ~~lle~~~i~kIgHN~KfD~~~L~~~GI~l~~i~DTmlaayLLd--p-~~~~~~L~~la~~yL~~~~~e~~~~~k~~~~~ 530 (976) ..++... ...|||++.+|+++|.. .-..+.||.+...... | ..++++|+.|+++||+.....- . T Consensus 78 ~~~i~~~-tIlVGH~L~nDL~aL~~---~H~~viDTa~l~~~~~~~~~~~~~~sLk~La~~~L~~~IQ~g----~----- 144 (161) T cd06137 78 WKFIDPD-TILVGHSLQNDLDALRM---IHTRVVDTAILTREAVKGPLAKRQWSLRTLCRDFLGLKIQGG----G----- 144 (161) T ss_pred HHHCCCC-CEEEECCHHHHHHHHCC---CCCCEEECEEECCHHCCCCCCCCCHHHHHHHHHHHCHHHCCC----C----- T ss_conf 9745899-67985101310878627---798354320332001157888888669999999828775289----9----- Q ss_pred HHHCCHHHHHHHHHHHHHHHHHH Q ss_conf 00000123333222111112456 Q gi|254780277|r 531 DKISDSQVQEHAIENSNVILQLW 553 (976) Q Consensus 531 ~~i~~e~~~~Yaa~Da~~t~~L~ 553 (976) ...-..+||.++++|| T Consensus 145 -------~gHds~EDA~aam~L~ 160 (161) T cd06137 145 -------EGHDSLEDALAAREVV 160 (161) T ss_pred -------CCCCCHHHHHHHHHHH T ss_conf -------8849199999999985 No 109 >pfam05991 DUF901 Protein of unknown function (DUF901). This family consists of several hypothetical bacterial proteins as well as some uncharacterized sequences from Arabidopsis thaliana. The function of this family is unknown. Probab=97.40 E-value=0.00058 Score=47.50 Aligned_cols=108 Identities=22% Similarity=0.196 Sum_probs=66.7 Q ss_pred EEEECCHHHHHHHHCCCCCCCCCCCCEECHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECCC-CC-HHHHCCHHHHCCCC Q ss_conf 99966269998874267544689885500699999999999998485547987999972748-87-13240865216679 Q gi|254780277|r 8 FLVDGSSFIYRAFYATPLLSRKHDGLPVNAIAGFCNMLWKLLQNSRKESIASHFAVIFDYPA-VT-FRNEIYPDYKANRP 85 (976) Q Consensus 8 ~liDg~~~~~ra~~a~p~~~~~~~G~~t~ai~gf~~~l~~~i~~~~~~~~~~~~~v~fD~~~-~t-fR~~~~~~YKa~R~ 85 (976) |||||+|+| |+.|.+........ +-..=..|...+..+.. +.--.++||||+.+ ++ .+++. T Consensus 1 LiVDGYNvI----~aw~~l~~l~~~~~---le~AR~~Li~~L~~y~~-~~~~~v~vVFDa~~~~~~~~~~~--------- 63 (166) T pfam05991 1 LLVDGYNVI----GAWPELKKLKDRGS---LEAARDKLIEILANYQA-FKGYRVIVVFDAHYVPGSEEEYE--------- 63 (166) T ss_pred CEECCHHHH----CCCHHHHHHCCCCC---HHHHHHHHHHHHHHHHC-CCCCEEEEEEECCCCCCCCCCEE--------- T ss_conf 927533561----73399996534799---99999999999998753-45967999985664899987434--------- Q ss_pred CCCHHHHHHHHHHHHHHHHCCCCE-EEECCEEHHHHHHHHHHHHHHCCCCEEEECCCCHHHEECC Q ss_conf 879758864899999999709979-9568901665899999999848994999868701212237 Q gi|254780277|r 86 KIPEMLLPQLPLVRLATQAFGIPA-IEIQGFEADDIIATYTYIAEKEGFAVTIISTDKDLMQLVS 149 (976) Q Consensus 86 ~~p~~l~~q~~~i~~~l~~~gi~~-~~~~~~EADDviatla~~~~~~~~~v~i~s~DkD~~Qlv~ 149 (976) + .-||.+ +..+|--||.+|=-++......+.+|+++|+|..--+.+- T Consensus 64 --------~---------~~~i~vvfT~~~etAD~~IEr~v~~~~~~~~~v~VvTSD~~~q~~v~ 111 (166) T pfam05991 64 --------K---------YGGIEVVFTKEGETADSYIEKLVAELNRRKKQVTVVTSDRAEQWTIF 111 (166) T ss_pred --------E---------CCCEEEEECCCCCCHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHH T ss_conf --------5---------09889998889998789999999996168844999928889998951 No 110 >cd05784 DNA_polB_II_exo The 3'-5' exonuclease domain of Escherichia coli DNA polymerase II (PolII) and similar bacterial proteins. PolII is a family-B DNA polymerase. Family-B DNA polymerases contain an N-terminal DEDDy DnaQ-like exonuclease domain in the same polypeptide chain as the polymerase domain, similar to family-A DNA polymerases. This exonuclease domain contains three sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are clustered around the active site and are involved in metal binding and catalysis. The exonuclease domain has a fundamental role in the proofreading activity of polII. It contains a beta hairpin structure that plays an important role in active site switching in the event of a nucleotide misincorporation. PolII is involved in a variety of cellular activities, such as the repair of DNA damaged by UV irradiation or oxidation. It plays a pivotal role in replication-restart, a process that bypasses DNA damage in a Probab=97.39 E-value=0.00029 Score=49.69 Aligned_cols=156 Identities=15% Similarity=0.149 Sum_probs=79.4 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHC--CCCCHH Q ss_conf 685232332224331100124420102577665443455023137443334565001788999974444202--221001 Q gi|254780277|r 387 IGSASFKIITDTMDAFHSKPIAIAISTLDQKDDTSFKVETIFIDLSVHTSQDSTGKNTLTKEILSYLKKFFE--NEHFLK 464 (976) Q Consensus 387 ~~~ia~DtETt~ld~~~~~ivGisls~~~~~~~~~~~~~~~yip~~~~~~~d~~~~~~~~~evl~~L~~lle--~~~i~k 464 (976) .+.++||+||+.- .+|..|++.....+...++... . ....+..-.-..-.+.+..+..++. +|++. T Consensus 3 lki~s~DIE~~~~----~~i~~I~~~~~~~~~v~~~~~~------~-~~~~~~v~~~~sE~eLL~~F~~~i~~~dPDiI- 70 (193) T cd05784 3 LKVVSLDIETSMD----GELYSIGLYGEGQERVLMVGDP------E-DDAPDNIEWFADEKSLLLALIAWFAQYDPDII- 70 (193) T ss_pred CEEEEEEECCCCC----CCEEEEEEECCCCCEEEEEECC------C-CCCCCEEEEECCHHHHHHHHHHHHHHCCCCEE- T ss_conf 3099999632798----8888999986999989999789------8-88997799989999999999999986299999- Q ss_pred HHCCC-CHHHHHHHH----CCCCC------------------------CC--CCCHHHHHHHHCCCCCCCCHHHHHHHHC Q ss_conf 10032-123788986----18866------------------------64--2100234554316876677246677622 Q gi|254780277|r 465 IGHNI-KYDKLVLHR----YGISM------------------------RG--FEDIMLMSYVLDSGRSSHDIASIAQKWL 513 (976) Q Consensus 465 IgHN~-KfD~~~L~~----~GI~l------------------------~~--i~DTmlaayLLdp~~~~~~L~~la~~yL 513 (976) +|+|. .||+..|.+ +|+.+ .| +.|.|-+-.--.-...+|+|+.+|+.+| T Consensus 71 tG~N~~~FDiPYL~~R~~~~~i~~~~~r~~~~~~~~~~~~~~~~~~~i~Gr~~lD~~~~~~~~~~~l~sy~Ld~Va~~~L 150 (193) T cd05784 71 IGWNVINFDLRLLQRRAEAHGLPLRLGRGGSPLNWRQSGKPGQGFLSLPGRVVLDGIDALKTATYHFESFSLENVAQELL 150 (193) T ss_pred EECCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCEEEECCCCCEEEEEECCEEEECHHHHHHHCCCCHHHHHHHHHHHHHH T ss_conf 97798777678999999985886433657885059876856269999788784172999976073511434888999983 Q ss_pred CCCCCCCCCCC--CCCCCCHHHCCHHHHHHHHHHHHHHHHHHH Q ss_conf 44422335654--322210000001233332221111124567 Q gi|254780277|r 514 SYTRKEILKSR--KSSIPIDKISDSQVQEHAIENSNVILQLWL 554 (976) Q Consensus 514 ~~~~~e~~~~~--k~~~~~~~i~~e~~~~Yaa~Da~~t~~L~~ 554 (976) |....+..... ..-..+..-..+.+.+||.+||.++++|.+ T Consensus 151 Ge~K~~~~~~~~~~~i~~l~~~d~~k~~~Yni~D~~Lv~rLf~ 193 (193) T cd05784 151 GEGKLIHDVDDRGAEIERLFREDKLALARYNLQDCELVWRIFE 193 (193) T ss_pred CCCCCCCCCCCCHHHHHHHHHCCCHHHEEEHHHHHHHHHHHHC T ss_conf 8775445655558889998865903610320277999999749 No 111 >TIGR00573 dnaq exonuclease, DNA polymerase III, epsilon subunit family; InterPro: IPR006054 All proteins in this entry for which functions are known are components of the DNA polymerase III complex (epsilon subunit). There is, however, an outgroup that includes paralogs in some gamma-proteobacteria and the N-terminal region of DinG from some low GC Gram-positive bacteria. ; GO: 0003677 DNA binding, 0003887 DNA-directed DNA polymerase activity, 0006260 DNA replication. Probab=97.38 E-value=0.00032 Score=49.34 Aligned_cols=155 Identities=15% Similarity=0.125 Sum_probs=91.5 Q ss_pred HHCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCEECCCCCCCCCC---------CCCCC-CCHHHHHHHHH Q ss_conf 866852323322243311001244201025776654434550231374433345---------65001-78899997444 Q gi|254780277|r 385 ETIGSASFKIITDTMDAFHSKPIAIAISTLDQKDDTSFKVETIFIDLSVHTSQD---------STGKN-TLTKEILSYLK 454 (976) Q Consensus 385 ~~~~~ia~DtETt~ld~~~~~ivGisls~~~~~~~~~~~~~~~yip~~~~~~~d---------~~~~~-~~~~evl~~L~ 454 (976) ....+.++|+||||+....++++-+.-...-+...+.-....+-.|- -....+ ....+ =...||.+.+. T Consensus 5 ~~~t~t~~d~e~tgl~~~~~~II~IGAv~~i~rr~~~~~~~~~~~P~-R~~~~~~~k~hGItDd~L~dkP~F~eia~dF~ 83 (228) T TIGR00573 5 VLDTFTTGDNETTGLYAKHDEIIEIGAVEIINRRIIGNKFHTYIKPD-RLIDPDAIKIHGITDDMLKDKPDFKEIAEDFA 83 (228) T ss_pred EEEEEEEEECCCCCCCCCCCCEEEEEEEEEECCCEEECCEEEEECCC-CCCCCCEEEECCCCHHHHCCCCCHHHHHHHHH T ss_conf 65426875211067632243045453376644614401005776188-88896425524888288468985137889999 Q ss_pred HHHCCCCCHHHHCCCCHHHHHHH----HCCCCC----CCCCCHHHHHHHHCCC--CCCCCHHHHHHHHCCCCCCCCCCCC Q ss_conf 42022210011003212378898----618866----6421002345543168--7667724667762244422335654 Q gi|254780277|r 455 KFFENEHFLKIGHNIKYDKLVLH----RYGISM----RGFEDIMLMSYVLDSG--RSSHDIASIAQKWLSYTRKEILKSR 524 (976) Q Consensus 455 ~lle~~~i~kIgHN~KfD~~~L~----~~GI~l----~~i~DTmlaayLLdp~--~~~~~L~~la~~yL~~~~~e~~~~~ 524 (976) .|+.+.-+ |+||+.||+-||. +.|... ..+.||.=+...+-|. ...++|+.||.+|- .+ T Consensus 84 ~~I~g~~L--v~HNA~FD~GFl~~e~~~LG~~~~~~~~~v~~~~~~~~~~~~~~p~~~~~Ld~L~~~~~-~~-------- 152 (228) T TIGR00573 84 DYIKGAVL--VIHNASFDVGFLNYEFSKLGLKKEPKTNDVIDTTDTLQAARPEFPGKRNTLDALALRYE-IT-------- 152 (228) T ss_pred HHHCCCEE--EEEHHHCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHCC-CC-------- T ss_conf 98379478--75200305778999998514501100114444999999997205884302777787805-44-------- Q ss_pred CCCCCCHHHCCHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 322210000001233332221111124567763222 Q gi|254780277|r 525 KSSIPIDKISDSQVQEHAIENSNVILQLWLLLRPKL 560 (976) Q Consensus 525 k~~~~~~~i~~e~~~~Yaa~Da~~t~~L~~~l~~~L 560 (976) ....+..-|..||.++.++|..+...- T Consensus 153 ---------~s~R~~H~A~~DA~~l~~~~~~~~~~~ 179 (228) T TIGR00573 153 ---------NSHRALHGALLDAFILAKLYLVMTGKQ 179 (228) T ss_pred ---------HHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf ---------037777317788999999999998612 No 112 >PRK13766 Hef nuclease; Provisional Probab=97.28 E-value=0.00033 Score=49.30 Aligned_cols=74 Identities=9% Similarity=0.013 Sum_probs=33.4 Q ss_pred EEEEEECC-CCCHHHHCCHH----HHCCC----CCCCHHHHHHHHHHHHHHHHCCCC---EEEECCEEHHHHHHHHHHHH Q ss_conf 99997274-88713240865----21667----987975886489999999970997---99568901665899999999 Q gi|254780277|r 61 FAVIFDYP-AVTFRNEIYPD----YKANR----PKIPEMLLPQLPLVRLATQAFGIP---AIEIQGFEADDIIATYTYIA 128 (976) Q Consensus 61 ~~v~fD~~-~~tfR~~~~~~----YKa~R----~~~p~~l~~q~~~i~~~l~~~gi~---~~~~~~~EADDviatla~~~ 128 (976) .+||.+.+ |+||=--+--. ++.++ .|+-+=-.+|...+++++ ++| +...-| +.-...-...+ T Consensus 32 tiVvLPTG~GKT~IA~lvi~~~l~~~~gKilFLaPT~pLV~Qq~~~~~~~l---~i~~~~i~~ltG---~~~~~~r~~~w 105 (764) T PRK13766 32 TLVVLPTGLGKTAIALLVIAERLQKYGGKVLILAPTKPLVEQHAEFFRKFL---NIDPEKIVVLTG---EISPEKRAALW 105 (764) T ss_pred EEEEECCCCCHHHHHHHHHHHHHHHCCCEEEEECCCHHHHHHHHHHHHHHC---CCCCCEEEEEEC---CCCHHHHHHHH T ss_conf 899959986689999999999997489889998588889999999999970---999552899988---87827689986 Q ss_pred HHCCCCEEEECCCC Q ss_conf 84899499986870 Q gi|254780277|r 129 EKEGFAVTIISTDK 142 (976) Q Consensus 129 ~~~~~~v~i~s~Dk 142 (976) ...+|+++|+.- T Consensus 106 --~~~~Viv~TPQv 117 (764) T PRK13766 106 --EKAKVIVATPQV 117 (764) T ss_pred --CCCCEEEECCHH T ss_conf --079999999089 No 113 >cd06149 ISG20 Interferon (IFN) Stimulated Gene product of 20 kDa (ISG20) is an IFN-induced antiviral exonuclease with a strong preference for single-stranded RNA and minor activity towards single-stranded DNA. It was also independently identified by its response to estrogen and was called HEM45 (human estrogen regulated transcript). ISG20 is a DEDDh-type DnaQ-like 3'-5' exonuclease containing three conserved sequence motifs termed ExoI, ExoII and ExoIII with a specific Hx(4)D conserved pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. ISG20 may be a major effector of innate immunity against pathogens including viruses, bacteria, and parasites. It is located in promyelocytic leukemia (PML) nuclear bodies, sites for oncogenic DNA viral transcription and replication. It may carry out its function by degrading viral RNAs as part of the IFN-regulated antiviral r Probab=97.28 E-value=0.00044 Score=48.37 Aligned_cols=90 Identities=13% Similarity=0.134 Sum_probs=59.5 Q ss_pred CCHHHHHHHHHHHHCCCCCHHHHCCCCHHHHHHHHCCCCCC--CCCCHHHHHHHH----CCCCCCCCHHHHHHHHCCCCC Q ss_conf 78899997444420222100110032123788986188666--421002345543----168766772466776224442 Q gi|254780277|r 444 TLTKEILSYLKKFFENEHFLKIGHNIKYDKLVLHRYGISMR--GFEDIMLMSYVL----DSGRSSHDIASIAQKWLSYTR 517 (976) Q Consensus 444 ~~~~evl~~L~~lle~~~i~kIgHN~KfD~~~L~~~GI~l~--~i~DTmlaayLL----dp~~~~~~L~~la~~yL~~~~ 517 (976) ....++...+..++.+. ..|||++.+|+++|. +... .+.||-....+. -|..++.+|+.|+++||+... T Consensus 62 ~~~~~~~~~l~~~l~~~--IlVGH~L~nDL~aL~---l~h~~~~i~DT~~~~~~~~~~~~p~~~~~sLk~L~~~~L~~~I 136 (157) T cd06149 62 TPFAVAQKEILKILKGK--VVVGHAIHNDFKALK---YFHPKHMTRDTSTIPLLNRKAGFPENCRVSLKVLAKRLLHRDI 136 (157) T ss_pred CCHHHHHHHHHHHHCCC--EEEEECCHHHHHHHH---CCCCCCCEEECCCCEEHHHHCCCCCCCCHHHHHHHHHHHCCCC T ss_conf 84999999999971698--899826076698860---3389872677711100133248987668759999999848502 Q ss_pred CCCCCCCCCCCCCHHHCCHHHHHHHHHHHHHHHHHHH Q ss_conf 2335654322210000001233332221111124567 Q gi|254780277|r 518 KEILKSRKSSIPIDKISDSQVQEHAIENSNVILQLWL 554 (976) Q Consensus 518 ~e~~~~~k~~~~~~~i~~e~~~~Yaa~Da~~t~~L~~ 554 (976) ... . ...-+.+||.++++||+ T Consensus 137 Q~~----~------------~gHds~EDA~aa~~L~k 157 (157) T cd06149 137 QVG----R------------QGHSSVEDARATMELYK 157 (157) T ss_pred CCC----C------------CCCCHHHHHHHHHHHHC T ss_conf 599----9------------88080999999999859 No 114 >cd05782 DNA_polB_like1_exo A subfamily of the 3'-5' exonuclease domain of family-B DNA polymerases. This subfamily is composed of uncharacterized bacterial family-B DNA polymerases. Family-B DNA polymerases contain an N-terminal DEDDy DnaQ-like exonuclease domain in the same polypeptide chain as the polymerase domain, similar to family-A DNA polymerases. This exonuclease domain contains three sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are involved in metal binding and catalysis. The exonuclease domain of family-B DNA polymerases has a fundamental role in proofreading activity. It contains a beta hairpin structure that plays an important role in active site switching in the event of a nucleotide misincorporation. Family-B DNA polymerases are predominantly involved in DNA replication and DNA repair. Probab=97.17 E-value=0.00087 Score=46.22 Aligned_cols=141 Identities=18% Similarity=0.192 Sum_probs=81.3 Q ss_pred CCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCHHHHCCC-CHHH Q ss_conf 222433110012442010257766544345502313744333456500178899997444420222100110032-1237 Q gi|254780277|r 395 ITDTMDAFHSKPIAIAISTLDQKDDTSFKVETIFIDLSVHTSQDSTGKNTLTKEILSYLKKFFENEHFLKIGHNI-KYDK 473 (976) Q Consensus 395 ETt~ld~~~~~ivGisls~~~~~~~~~~~~~~~yip~~~~~~~d~~~~~~~~~evl~~L~~lle~~~i~kIgHN~-KfD~ 473 (976) .++.+...-.++|.||+...+..+. .+-+... ....-.+++..|-..++...-..|+||. .||+ T Consensus 42 g~~flp~~f~rIV~Is~~~~~~~~~--------~~~~~~~-------~~~~E~~ll~~F~~~i~~~~p~lv~~Ng~~FD~ 106 (208) T cd05782 42 GSDFLPLPFHKVVSISALYRDDDGG--------FLKVRTL-------DGADEKELLEDFFQLIEKKNPRLVSFNGRGFDL 106 (208) T ss_pred CCCCCHHHHCCEEEEEEEEEECCCC--------EEEEEEC-------CCCCHHHHHHHHHHHHHHCCCEEEEECCCCCCH T ss_conf 8975446638627999999955897--------7998417-------999999999999999986599799636875758 Q ss_pred HHHH----HCCCCCCCCC------------------CHHHHHHHHCCC-CCCCCHHHHHHHHCCCC-CCCCCCCCCCCCC Q ss_conf 8898----6188666421------------------002345543168-76677246677622444-2233565432221 Q gi|254780277|r 474 LVLH----RYGISMRGFE------------------DIMLMSYVLDSG-RSSHDIASIAQKWLSYT-RKEILKSRKSSIP 529 (976) Q Consensus 474 ~~L~----~~GI~l~~i~------------------DTmlaayLLdp~-~~~~~L~~la~~yL~~~-~~e~~~~~k~~~~ 529 (976) .+|. .|||.+..++ |||-. +-+-+ +.+.+|+.+|.- +|.. ..++.|+.-.. - T Consensus 107 P~l~~Ral~~gi~~p~~~~~~~~~~~y~~r~~~~H~Dl~d~--l~~~~~~~~~sLd~la~l-lG~pgK~~~~Gs~V~~-~ 182 (208) T cd05782 107 PVLHLRALIHGVSAPAYFDLGNKDWNYRNRYSERHLDLMDL--LAFYGARARASLDLLAKL-LGIPGKMDVDGSQVWE-L 182 (208) T ss_pred HHHHHHHHHCCCCCCHHHHCCCCCCCCCCCCCCCCCCHHHH--HHCCCCCCCCCHHHHHHH-HCCCCCCCCCHHHHHH-H T ss_conf 99999999919897555404887411248887631329999--855487577889999998-3899757888899999-9 Q ss_pred CHHHCCHHHHHHHHHHHHHHHHHHH Q ss_conf 0000001233332221111124567 Q gi|254780277|r 530 IDKISDSQVQEHAIENSNVILQLWL 554 (976) Q Consensus 530 ~~~i~~e~~~~Yaa~Da~~t~~L~~ 554 (976) +..-..+....||..||..|+.||. T Consensus 183 ~~~g~l~~I~~YCe~DVl~T~~v~l 207 (208) T cd05782 183 YAEGKLDEIAEYCETDVLNTYLLYL 207 (208) T ss_pred HHCCCHHHHHHHHHHHHHHHHHHHH T ss_conf 9819988999999999999999980 No 115 >cd06143 PAN2_exo PAN2 is the catalytic subunit of poly(A) nuclease (PAN), a Pab1p-dependent 3'-5' exoribonuclease which plays an important role in the posttranscriptional maturation of pre-mRNAs. PAN catalyzes the deadenylation of poly(A) tails, which are initially synthesized to default lengths of 70 to 90, to mRNA-specific lengths of 55 to 71. Pab1p and PAN also play a role in the export and decay of mRNA. PAN2 contains a DEDDh-type DnaQ-like 3'-5' exonuclease domain with three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific Hx(4)D conserved pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. Probab=97.13 E-value=0.002 Score=43.54 Aligned_cols=87 Identities=16% Similarity=0.169 Sum_probs=60.1 Q ss_pred HHHHHHHHHHHHCCCCCHHHHCCCCHHHHHHHHCCCCCCCCCCHHHHHHHHCCCCCCCCHHHHHHHHCCCCCCCCCCCCC Q ss_conf 89999744442022210011003212378898618866642100234554316876677246677622444223356543 Q gi|254780277|r 446 TKEILSYLKKFFENEHFLKIGHNIKYDKLVLHRYGISMRGFEDIMLMSYVLDSGRSSHDIASIAQKWLSYTRKEILKSRK 525 (976) Q Consensus 446 ~~evl~~L~~lle~~~i~kIgHN~KfD~~~L~~~GI~l~~i~DTmlaayLLdp~~~~~~L~~la~~yL~~~~~e~~~~~k 525 (976) .+.+...++.++.... ..|||-++.|+++|.-. .+-..+.||...-+ .|..+..+|+.|++.+|+....+- T Consensus 88 ~~~~~~~l~~li~~~~-ILVGHsL~nDL~aL~l~-hP~~~i~DTs~l~~--~~~~r~~sLk~La~~~Lg~~IQ~g----- 158 (174) T cd06143 88 LKSAYLKLRLLVDLGC-IFVGHGLAKDFRVINIQ-VPKEQVIDTVELFH--LPGQRKLSLRFLAWYLLGEKIQSE----- 158 (174) T ss_pred HHHHHHHHHHHCCCCC-EEECCCCHHHHHHHCCC-CCCCCEEECHHHCC--CCCCCCCCHHHHHHHHCCCCCCCC----- T ss_conf 9999999998547996-89789636679885066-99866577788526--899887089999999808104698----- Q ss_pred CCCCCHHHCCHHHHHHHHHHHHHHHHHHH Q ss_conf 22210000001233332221111124567 Q gi|254780277|r 526 SSIPIDKISDSQVQEHAIENSNVILQLWL 554 (976) Q Consensus 526 ~~~~~~~i~~e~~~~Yaa~Da~~t~~L~~ 554 (976) ..-..+||.++++||. T Consensus 159 -------------~HdSvEDArAam~LYr 174 (174) T cd06143 159 -------------THDSIEDARTALKLYR 174 (174) T ss_pred -------------CCCCHHHHHHHHHHHC T ss_conf -------------9593999999999629 No 116 >cd06145 REX1_like This subfamily is composed of RNA exonuclease 1 (REX1 or Rex1p), REX3 (or Rex3p), and similar eukaryotic proteins. In yeast, REX1 and REX3 are required for 5S rRNA and MRP (mitochondrial RNA processing) RNA maturation, respectively. They are DEDDh-type DnaQ-like 3'-5' exonucleases containing three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific Hx(4)D conserved pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. REX1 is the major exonuclease responsible for pre-tRNA trail trimming and may also be involved in nuclear CCA turnover. REX proteins function in the processing and maturation of many RNA species, similar to the function of E.coli RNase T. Probab=96.84 E-value=0.0026 Score=42.70 Aligned_cols=90 Identities=13% Similarity=0.177 Sum_probs=58.4 Q ss_pred CHHHHHHHHHHHHCCCCCHHHHCCCCHHHHHHHHCCCCCCCCCCHHHHHHHHCCCCCCCCHHHHHHHHCCCCCCCCCCCC Q ss_conf 88999974444202221001100321237889861886664210023455431687667724667762244422335654 Q gi|254780277|r 445 LTKEILSYLKKFFENEHFLKIGHNIKYDKLVLHRYGISMRGFEDIMLMSYVLDSGRSSHDIASIAQKWLSYTRKEILKSR 524 (976) Q Consensus 445 ~~~evl~~L~~lle~~~i~kIgHN~KfD~~~L~~~GI~l~~i~DTmlaayLLdp~~~~~~L~~la~~yL~~~~~e~~~~~ 524 (976) ..+++.+.+..++.... ..|||.+..|+++|. +.-..+.||.+.-..-..-..+.+|+.|++.+|+....... T Consensus 61 ~~~~~~~~l~~li~~~~-IlVGH~L~nDL~aL~---i~H~~v~DTa~l~~~~~~~~~~~sLk~L~~~~L~~~IQ~~~--- 133 (150) T cd06145 61 TLEDVQKKLLSLISPDT-ILVGHSLENDLKALK---LIHPRVIDTAILFPHPRGPPYKPSLKNLAKKYLGRDIQQGE--- 133 (150) T ss_pred CHHHHHHHHHHHCCCCC-EEEECCCCCCHHHHH---CCCCCEEEHHHHHHHCCCCCCCHHHHHHHHHHHCHHHCCCC--- T ss_conf 89999999998759998-899626434777761---55876157166211037999897899999998287743999--- Q ss_pred CCCCCCHHHCCHHHHHHHHHHHHHHHHHHH Q ss_conf 322210000001233332221111124567 Q gi|254780277|r 525 KSSIPIDKISDSQVQEHAIENSNVILQLWL 554 (976) Q Consensus 525 k~~~~~~~i~~e~~~~Yaa~Da~~t~~L~~ 554 (976) ...-+.+||.++.+|+. T Consensus 134 -------------~gHdsvEDA~aam~L~k 150 (150) T cd06145 134 -------------GGHDSVEDARAALELVK 150 (150) T ss_pred -------------CCCCCHHHHHHHHHHHC T ss_conf -------------88183999999999759 No 117 >cd05783 DNA_polB_B1_exo The 3'-5' exonuclease domain of Sulfolobus solfataricus DNA polymerase B1 and similar archaeal proteins. B1 is a family-B DNA polymerase. Family-B DNA polymerases contain an N-terminal DEDDy DnaQ-like exonuclease domain in the same polypeptide chain as the polymerase domain, similar to family-A DNA polymerases. B1displays thermostable polymerase and 3'-5' exonuclease activities. This exonuclease domain contains three sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are clustered around the active site and are involved in metal binding and catalysis. The exonuclease domain of family-B polymerases also contains a beta hairpin structure that plays an important role in active site switching in the event of nucleotide misincorporation. Family-B DNA polymerases from thermophilic archaea are unique in that they are able to recognize the presence of uracil in the template strand, leading to the stalling of DNA synthes Probab=96.80 E-value=0.0019 Score=43.66 Aligned_cols=101 Identities=19% Similarity=0.252 Sum_probs=61.8 Q ss_pred HHHHHHHHHHHHCCCCCHHHHCCC-CHHHHHHHH----CCCCCC---------------CC-CC-------HHHHHHHHC Q ss_conf 899997444420222100110032-123788986----188666---------------42-10-------023455431 Q gi|254780277|r 446 TKEILSYLKKFFENEHFLKIGHNI-KYDKLVLHR----YGISMR---------------GF-ED-------IMLMSYVLD 497 (976) Q Consensus 446 ~~evl~~L~~lle~~~i~kIgHN~-KfD~~~L~~----~GI~l~---------------~i-~D-------TmlaayLLd 497 (976) -.+.+..+-.++.+.. ..+|+|. .||+..|.+ .|+.-. ++ .| +++..|... T Consensus 73 E~~LL~~F~~~i~~~p-~Ii~wN~~~FDiPYL~~R~~~Lg~~~~~~~~~~~~~~~~~~~~~~~dl~~~~~~~~~~~y~~~ 151 (204) T cd05783 73 EKELIREAFKIISEYP-IVLTFNGDNFDLPYLYNRALKLGIPKEEIPIYLKRDYATLKHGIHIDLYKFFSNRAIQVYAFG 151 (204) T ss_pred HHHHHHHHHHHHHCCC-EEEECCCCCCCHHHHHHHHHHHCCCCEEEEEEECCEEEEEEEEEEEEEHHHCCCCHHHHHHCC T ss_conf 9999999999974399-799817867788999999998599820456673436999934499986463573135441202 Q ss_pred CCCCCCCHHHHHHHHCCCCCCCCCCCCCCCCCCHHHCCHHHHHHHHHHHHHHHHHH Q ss_conf 68766772466776224442233565432221000000123333222111112456 Q gi|254780277|r 498 SGRSSHDIASIAQKWLSYTRKEILKSRKSSIPIDKISDSQVQEHAIENSNVILQLW 553 (976) Q Consensus 498 p~~~~~~L~~la~~yL~~~~~e~~~~~k~~~~~~~i~~e~~~~Yaa~Da~~t~~L~ 553 (976) ....+|.|+.+|+.+||...-+..+ .+...+......|+.+||.++.+|- T Consensus 152 ~k~~sY~Ld~Va~~~LGe~K~d~~~------~~~~~d~~k~i~YNi~D~~Lv~~L~ 201 (204) T cd05783 152 NKYREYTLDAVAKALLGEGKVELEK------NISELNLYELAEYNYRDAELTLELT 201 (204) T ss_pred CCCCCCCHHHHHHHHHCCCCCCCCC------CCCCCCHHHHHEEEHHHHHHHHHHH T ss_conf 4212252889999983888600676------4442586785514267767777862 No 118 >cd06135 Orn Oligoribonuclease (Orn) is a DEDDh-type DnaQ-like 3'-5' exoribonuclease that is responsible for degrading small oligoribonucleotides to mononucleotides. It contains three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific Hx(4)D conserved pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. Orn is essential for E.coli survival. The human homolog, also called Sfn (small fragment nuclease), is able to hydrolyze short single-stranded RNA and DNA oligomers. It plays a role in cellular nucleotide recycling. Probab=96.66 E-value=0.003 Score=42.31 Aligned_cols=26 Identities=15% Similarity=0.128 Sum_probs=21.0 Q ss_pred CCCCCCCCCCCCCCCCCCCEEECCCC Q ss_conf 32332224331100124420102577 Q gi|254780277|r 391 SFKIITDTMDAFHSKPIAIAISTLDQ 416 (976) Q Consensus 391 a~DtETt~ld~~~~~ivGisls~~~~ 416 (976) =+|+|||||||..+.|+-|+.-.-++ T Consensus 3 WiDlEmTGLd~~~d~IiEIA~ivTD~ 28 (173) T cd06135 3 WIDLEMTGLDPEKDRILEIACIITDG 28 (173) T ss_pred EEECCCCCCCCCCCEEEEEEEEEECC T ss_conf 99330368888886489999999889 No 119 >pfam00940 RNA_pol DNA-dependent RNA polymerase. This is a family of single chain RNA polymerases. Probab=96.61 E-value=0.023 Score=35.78 Aligned_cols=145 Identities=14% Similarity=0.211 Sum_probs=79.8 Q ss_pred HHHHHHHHHHHHHCCCHHHHHHHCC-------CC------HHH----HHHHHHHHHHHCHHH---HHHHHHHHHHHHHCC Q ss_conf 8630251356440777035678819-------99------899----999999999988889---999999999999849 Q gi|254780277|r 798 RRHAKTINFSIIYGISPFRLATQLR-------IP------RSE----AADYIKRYFHRFPGI---HEYIEKTKNFVRQNG 857 (976) Q Consensus 798 R~~aK~infgi~YG~~~~~La~~l~-------is------~~e----A~~~i~~yf~~~p~v---~~~~~~~~~~a~~~g 857 (976) |...|....-+.||.+.+|.++++. .. .+. |+.+.+...+.||+. .+|++.+.+.+.+.| T Consensus 156 Rk~vKr~vMT~~Yg~t~~g~~~qi~~~l~~~~~~~~~~~~~~~~~yla~~i~~si~~v~~~a~~~m~wL~~~a~~~~~~~ 235 (391) T pfam00940 156 RKVVKQTVMTLVYGVTFYGAREQIKDQLKEKGTDESKVLSYKAANYLAKIIFEAIKEVFPGAREIMDWLNECAKLIAKLN 235 (391) T ss_pred HHHHHCCEEEEECCCCCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 86640321664226662009999999998436873145678999999999999999985749999999999999998289 Q ss_pred C-E--ECCCC---------------------CEECCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 8-8--82778---------------------7601898758837888778787701345653899999999999999986 Q gi|254780277|r 858 Y-V--ETIFG---------------------RRIHYDEINSPKSSIRNINERAAINAPIQGSAADITRRAMISVHKSLED 913 (976) Q Consensus 858 y-v--~T~~G---------------------Rrr~~p~i~s~~~~~r~~aeR~a~N~~iQGsaAdi~k~am~~i~~~~~~ 913 (976) - | .|+.| +.+......+.....|...--.+-|..=+==|+-++..+. ...+ T Consensus 236 ~pv~W~TP~Gf~V~Q~Y~k~~~~~V~t~l~~~~~~~~~~~~~~~d~~Kq~~a~~PNfIHSlDAshl~~t~~-----~~~~ 310 (391) T pfam00940 236 IPVVWTTPLGLPVVQPYRKSKKKRVKTSLQGVSRIKVITLTPVSDKRKQKSAFPPNFIHSLDASHLMLTAL-----ACIK 310 (391) T ss_pred CCEEEECCCCCEEEEEECCCCCEEEEEEECCEEEECCCCCCCCCCHHHHHCCCCCCHHHHHHHHHHHHHHH-----HHHH T ss_conf 96599999999898641688757999861566886135666656756542367974133344899999999-----9985 Q ss_pred CCCCEEEEEEECCEEEEECCHHHHHHHHHHHHHHHHHCC Q ss_conf 599818999726389995598799999999999965101 Q gi|254780277|r 914 HKLATKMLLQIHDELVFEVPEEEITTASQIIVHSMEKAC 952 (976) Q Consensus 914 ~~~~~~~ilqVHDElv~ev~~~~~~~~~~~~~~~Me~~~ 952 (976) .|+. +..|||... ++...++.+..++++.+-.-+ T Consensus 311 ~gi~---f~~IHDsf~--tha~d~~~l~~~iRe~Fv~ly 344 (391) T pfam00940 311 AGIP---FASVHDSFW--THANDVDKMNKLLREQFVELY 344 (391) T ss_pred CCCC---EEEEEECCC--CCCCCHHHHHHHHHHHHHHHH T ss_conf 6998---788710788--772149999999999999996 No 120 >COG3359 Predicted exonuclease [DNA replication, recombination, and repair] Probab=96.33 E-value=0.018 Score=36.62 Aligned_cols=157 Identities=15% Similarity=0.114 Sum_probs=82.4 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCC-HHHH Q ss_conf 852323322243311001244201025776654434550231374433345650017889999744442022210-0110 Q gi|254780277|r 388 GSASFKIITDTMDAFHSKPIAIAISTLDQKDDTSFKVETIFIDLSVHTSQDSTGKNTLTKEILSYLKKFFENEHF-LKIG 466 (976) Q Consensus 388 ~~ia~DtETt~ld~~~~~ivGisls~~~~~~~~~~~~~~~yip~~~~~~~d~~~~~~~~~evl~~L~~lle~~~i-~kIg 466 (976) ..+-||+||||++...+.+..++.++.... ..++--......+ .+ ...+..++.++.. ..|. T Consensus 99 ~~~FFDiETTGL~~ag~~I~~~g~a~~~~~--------~~~Vrq~~lp~p~--------~E-~avle~fl~~~~~~~lvs 161 (278) T COG3359 99 DVAFFDIETTGLDRAGNTITLVGGARGVDD--------TMHVRQHFLPAPE--------EE-VAVLENFLHDPDFNMLVS 161 (278) T ss_pred CEEEEEEECCCCCCCCCEEEEEEEEECCCC--------EEEEEEECCCCCC--------HH-HHHHHHHHCCCCCCEEEE T ss_conf 258996331466888985999988873575--------6999751279841--------35-689999753898455997 Q ss_pred CCC-CHHHHHHH---HCCCCC--CC-CCCHHHHHHHHCCCC-CCCCHHHHHHHHCCCCCCC-CCCCCCCCC------CCH Q ss_conf 032-12378898---618866--64-210023455431687-6677246677622444223-356543222------100 Q gi|254780277|r 467 HNI-KYDKLVLH---RYGISM--RG-FEDIMLMSYVLDSGR-SSHDIASIAQKWLSYTRKE-ILKSRKSSI------PID 531 (976) Q Consensus 467 HN~-KfD~~~L~---~~GI~l--~~-i~DTmlaayLLdp~~-~~~~L~~la~~yL~~~~~e-~~~~~k~~~------~~~ 531 (976) +|- .||..+++ ++.+++ .. -+|.|-.+.=+.-.. -+-+|+.+-+ +||...++ ..|...-.. .-+ T Consensus 162 fNGkaFD~PfikR~v~~~~el~l~~~H~DL~h~~RRlwk~~l~~c~Lk~VEr-~LGi~R~edtdG~~~p~lyr~~~~~~d 240 (278) T COG3359 162 FNGKAFDIPFIKRMVRDRLELSLEFGHFDLYHPSRRLWKHLLPRCGLKTVER-ILGIRREEDTDGYDGPELYRLYRRYGD 240 (278) T ss_pred ECCCCCCCHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCHHHHHH-HHCCCCCCCCCCCCHHHHHHHHHHCCC T ss_conf 4683267089999875164447666631221034453120287888136788-857550115898501999999987098 Q ss_pred HHCCHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 0000123333222111112456776322210 Q gi|254780277|r 532 KISDSQVQEHAIENSNVILQLWLLLRPKLIV 562 (976) Q Consensus 532 ~i~~e~~~~Yaa~Da~~t~~L~~~l~~~L~~ 562 (976) .-....+..|.-+|+.-...|+..+++.+.+ T Consensus 241 p~ll~~l~~hN~eDvlnL~~i~~h~~~~i~~ 271 (278) T COG3359 241 PGLLDGLVLHNREDVLNLPTIIKHVSKKILE 271 (278) T ss_pred HHHHHHHHHCCHHHHHHHHHHHHHHHHHHHH T ss_conf 7777898870699887599999999999998 No 121 >KOG1798 consensus Probab=96.30 E-value=0.0049 Score=40.70 Aligned_cols=36 Identities=17% Similarity=0.283 Sum_probs=14.6 Q ss_pred CEECHHHHHHHHCCCHHHHHHHHHHCCCCCCCCCCC Q ss_conf 353899998884898878777787518743589987 Q gi|254780277|r 161 EKIDIEHVIKKWGIPPEKMVCLQALTGDSIDNIPGI 196 (976) Q Consensus 161 ~~~~~~~v~~~~gv~p~~~~d~~al~GD~sDnipGv 196 (976) .++|.....+++|-.-..++..-|..---|.-+|-| T Consensus 1100 ~IiDWdYY~eRLgSaIQKiItIPAAlQ~VsNPVpRv 1135 (2173) T KOG1798 1100 AIIDWDYYRERLGSAIQKIITIPAALQGVSNPVPRV 1135 (2173) T ss_pred EEEEHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCC T ss_conf 043059999998778888710187773267888777 No 122 >KOG0969 consensus Probab=96.28 E-value=0.033 Score=34.67 Aligned_cols=118 Identities=19% Similarity=0.309 Sum_probs=69.5 Q ss_pred HHHHHHHHHHHC--CCCCHHHHCCC-CHHHHHHHH----CCCC-------CCC----CCCHHHHHHHH------------ Q ss_conf 999974444202--22100110032-123788986----1886-------664----21002345543------------ Q gi|254780277|r 447 KEILSYLKKFFE--NEHFLKIGHNI-KYDKLVLHR----YGIS-------MRG----FEDIMLMSYVL------------ 496 (976) Q Consensus 447 ~evl~~L~~lle--~~~i~kIgHN~-KfD~~~L~~----~GI~-------l~~----i~DTmlaayLL------------ 496 (976) +++++.-+.++- ||++. +|+|+ .||+..+.+ .||. +++ +-||+.-+.-. T Consensus 339 ~elL~~W~~firevDPDvI-~GYNi~nFDiPYll~RA~tL~Ie~Fp~LGRikn~~s~irDttfSSkq~GtRetK~v~I~G 417 (1066) T KOG0969 339 KELLESWRKFIREVDPDVI-IGYNICNFDIPYLLNRAKTLGIENFPYLGRIKNSRSVIRDSTFSSKQYGTRETKEVNIDG 417 (1066) T ss_pred HHHHHHHHHHHHHCCCCEE-ECCCCCCCCCCEECCHHHHCCCCCCCCCCEECCCCEEEECCCCCHHHCCCCCCEEEEECC T ss_conf 8999999999986298867-133544556321307676468066650100035412441032123213751013886656 Q ss_pred ------------CCCCCCCCHHHHHHHHCCCCCCCCCCCCCCCC-CCHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf ------------16876677246677622444223356543222-10000001233332221111124567763222100 Q gi|254780277|r 497 ------------DSGRSSHDIASIAQKWLSYTRKEILKSRKSSI-PIDKISDSQVQEHAIENSNVILQLWLLLRPKLIVE 563 (976) Q Consensus 497 ------------dp~~~~~~L~~la~~yL~~~~~e~~~~~k~~~-~~~~i~~e~~~~Yaa~Da~~t~~L~~~l~~~L~~~ 563 (976) |=..++|+|.+++.++|+...+|+--+--..+ +-+...--..+-||..||.+.+||.+.| T Consensus 418 RlqfDllqvi~RdyKLrSytLNaVs~hFL~EQKEDV~~siItdLQng~~~TRRRlA~YCLKDAYLPlRLlekL------- 490 (1066) T KOG0969 418 RLQFDLLQVILRDYKLRSYTLNAVSAHFLGEQKEDVHHSIITDLQNGNEQTRRRLAVYCLKDAYLPLRLLEKL------- 490 (1066) T ss_pred EEEEHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCCCHHHHHCCCHHHHHHHHHHHHHHHCCHHHHHHHH------- T ss_conf 5652279998776433111011457876553201565210043314767888899988764410229999999------- Q ss_pred HHHHHHHHHCC Q ss_conf 01345788607 Q gi|254780277|r 564 KLLHVYERLDK 574 (976) Q Consensus 564 ~l~~l~~~iE~ 574 (976) .-++.-+|| T Consensus 491 --M~ivNy~EM 499 (1066) T KOG0969 491 --MVIVNYTEM 499 (1066) T ss_pred --HHHHHHHHH T ss_conf --999867865 No 123 >cd05776 DNA_polB_alpha_exo The 3'-5' exonuclease domain of eukaryotic DNA polymerase alpha. DNA polymerase alpha is a family-B DNA polymerase with a catalytic subunit that contains a DnaQ-like 3'-5' exonuclease domain. It is one of the three DNA-dependent type B DNA polymerases (delta and epsilon are the other two) that have been identified as essential for nuclear DNA replication in eukaryotes. DNA polymerase alpha is almost exclusively required for the initiation of DNA replication and the priming of Okazaki fragments during elongation. It associates with DNA primase and is the only enzyme able to start DNA synthesis de novo. The catalytic subunit contains both polymerase and 3'-5' exonuclease domains, but only exhibits polymerase activity. The 3'-5' exonuclease domain contains three sequence motifs termed ExoI, ExoII and ExoIII, without the four conserved acidic residues that are crucial for metal binding and catalysis. This explains why in most organisms, that no specific repair r Probab=96.22 E-value=0.0089 Score=38.84 Aligned_cols=109 Identities=24% Similarity=0.317 Sum_probs=62.1 Q ss_pred HHHHHHHHHHH--CCCCCHHHHCCC-CHHHHHHHH----CCCC---------------------------CCC--CCCHH Q ss_conf 99997444420--222100110032-123788986----1886---------------------------664--21002 Q gi|254780277|r 447 KEILSYLKKFF--ENEHFLKIGHNI-KYDKLVLHR----YGIS---------------------------MRG--FEDIM 490 (976) Q Consensus 447 ~evl~~L~~ll--e~~~i~kIgHN~-KfD~~~L~~----~GI~---------------------------l~~--i~DTm 490 (976) .+.+..|..++ .||++. +|||+ .||+.+|.. +++. +.| +.|+. T Consensus 83 r~LL~~f~~~i~~~DPDii-~Ghni~~fdl~~L~~R~~~lki~~wsriGRlkr~~~p~~~~~~~~~~~~~~~GRl~~D~~ 161 (234) T cd05776 83 RALLNFFLAKLQKIDPDVL-VGHDLEGFDLDVLLSRIQELKVPHWSRIGRLKRSVWPKKKGGGKFGERELTAGRLLCDTY 161 (234) T ss_pred HHHHHHHHHHHHHCCCCEE-EEECCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCEEEEECH T ss_conf 9999999999985099899-850637775999999999809985331355001437644565653442035235887609 Q ss_pred HHHHHHCCCCCCCCHHHHHHHHCCCCCCCCCCCCCCCCCCHHHCCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 3455431687667724667762244422335654322210000001233332221111124567763 Q gi|254780277|r 491 LMSYVLDSGRSSHDIASIAQKWLSYTRKEILKSRKSSIPIDKISDSQVQEHAIENSNVILQLWLLLR 557 (976) Q Consensus 491 laayLLdp~~~~~~L~~la~~yL~~~~~e~~~~~k~~~~~~~i~~e~~~~Yaa~Da~~t~~L~~~l~ 557 (976) ..+.=+. ...+|+|..+++.+|+....++.-..-.+...+.-....+..|+..||..+++|..+|. T Consensus 162 ~~~ke~~-k~~sy~L~~v~~~~L~~~k~dv~~~~i~~~~~~~~~l~~l~~~~~~Da~l~~~L~~~L~ 227 (234) T cd05776 162 LSAKELI-RCKSYDLTELSQQVLGIERQDIDPEEILNMYNDSESLLKLLEHTEKDAYLILQLMFKLN 227 (234) T ss_pred HHHHHHH-CCCCCCHHHHHHHHHCCCCCCCCHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 9999874-66778889999999686766799999999986939999999999999999999999958 No 124 >pfam00867 XPG_I XPG I-region. Probab=96.10 E-value=0.029 Score=35.15 Aligned_cols=79 Identities=29% Similarity=0.484 Sum_probs=61.2 Q ss_pred HCCCCEEEECCEEHHHHHHHHHHHHHHCCCCEEEECCCCHHHEECCCCEEE-EECCCC------CEECHHHHHHHHCCCH Q ss_conf 709979956890166589999999984899499986870121223797599-966788------3538999988848988 Q gi|254780277|r 104 AFGIPAIEIQGFEADDIIATYTYIAEKEGFAVTIISTDKDLMQLVSPTTCL-YDTVKE------EKIDIEHVIKKWGIPP 176 (976) Q Consensus 104 ~~gi~~~~~~~~EADDviatla~~~~~~~~~v~i~s~DkD~~Qlv~~~v~~-~~~~~~------~~~~~~~v~~~~gv~p 176 (976) .||||.+..| ||||-..|.|++. |.-=.++|.|-|++=.=.+.+.. +...++ ..++...+.+..|++. T Consensus 2 ~~Gi~yi~AP-~EAeaqca~L~~~----g~vd~v~t~DsD~l~~G~~~vi~~~~~~~~~~~~~~~~~~~~~il~~l~lt~ 76 (89) T pfam00867 2 LMGIPYVVAP-YEAEAQCAYLVKT----GLVDAVISEDSDLLLFGAPRVIRNLSDSKAKKKVPVEEIDLDKVLEELGLTR 76 (89) T ss_pred CCCCEEEECC-CCHHHHHHHHHHC----CCCCEEECCCHHEEECCCCEEEEEECCCCCCCEEEEEEEEHHHHHHHCCCCH T ss_conf 0696179879-4099999999977----9920996181331204898999850477871121369989899575629999 Q ss_pred HHHHHHHHHCC Q ss_conf 78777787518 Q gi|254780277|r 177 EKMVCLQALTG 187 (976) Q Consensus 177 ~~~~d~~al~G 187 (976) +|++|+--|.| T Consensus 77 ~~f~~~ciL~G 87 (89) T pfam00867 77 EQFIDLCILLG 87 (89) T ss_pred HHHHHHHEEEC T ss_conf 99999765606 No 125 >pfam09281 Taq-exonuc Taq polymerase, exonuclease. Members of this family are found in prokaryotic Taq DNA polymerase, where they assume a ribonuclease H-like motif. The domain confers 5'-3' exonuclease activity to the polymerase. Probab=95.66 E-value=0.015 Score=37.24 Aligned_cols=68 Identities=21% Similarity=0.316 Sum_probs=51.2 Q ss_pred CHHHHHH-HHCCCCCCCCCCHHHHHHHHCCCCCCCCHHHHHHHHCCCCCCCCCCCCCCCCCCHHHCCHHHHHHHHHHHHH Q ss_conf 1237889-861886664210023455431687667724667762244422335654322210000001233332221111 Q gi|254780277|r 470 KYDKLVL-HRYGISMRGFEDIMLMSYVLDSGRSSHDIASIAQKWLSYTRKEILKSRKSSIPIDKISDSQVQEHAIENSNV 548 (976) Q Consensus 470 KfD~~~L-~~~GI~l~~i~DTmlaayLLdp~~~~~~L~~la~~yL~~~~~e~~~~~k~~~~~~~i~~e~~~~Yaa~Da~~ 548 (976) ..|+.++ .+.|+.+.+-.|.|+.+||+||... +...++.||++....+ -++..+.+ T Consensus 70 AKdLAvla~~eG~~v~PGDDPmLlAYLlDPsNT--~p~~varRY~ggeW~e---------------------Da~~RA~~ 126 (138) T pfam09281 70 AKALAAHLSVRGTVVEPGDDPLLVAYLLDPANT--NMPGVARRYLGGEWPE---------------------DAATRAAI 126 (138) T ss_pred HHHHHHHHHCCCCCCCCCCCHHHHHHHCCCCCC--CCHHHHHHHCCCCCCC---------------------CHHHHHHH T ss_conf 999999987026656999860278765085678--8215667623888875---------------------67888999 Q ss_pred HHHHHHHHHHHH Q ss_conf 124567763222 Q gi|254780277|r 549 ILQLWLLLRPKL 560 (976) Q Consensus 549 t~~L~~~l~~~L 560 (976) +.+|++.|.++| T Consensus 127 t~rL~~~L~~rl 138 (138) T pfam09281 127 TERLLRNLPPRL 138 (138) T ss_pred HHHHHHHCCCCC T ss_conf 999998510019 No 126 >KOG2249 consensus Probab=95.56 E-value=0.034 Score=34.59 Aligned_cols=148 Identities=9% Similarity=0.107 Sum_probs=84.1 Q ss_pred CCCCCCCCCCCCCCC--CCCCCCCEEECCCCCCCCCCCCCCEEC-CCC---------CCCCCCCCCCCCCHHHHHHHHHH Q ss_conf 852323322243311--001244201025776654434550231-374---------43334565001788999974444 Q gi|254780277|r 388 GSASFKIITDTMDAF--HSKPIAIAISTLDQKDDTSFKVETIFI-DLS---------VHTSQDSTGKNTLTKEILSYLKK 455 (976) Q Consensus 388 ~~ia~DtETt~ld~~--~~~ivGisls~~~~~~~~~~~~~~~yi-p~~---------~~~~~d~~~~~~~~~evl~~L~~ 455 (976) ..+|+|+|-.|..|. .+.+--+|+...-+.-.+ +. |+ |-. ..+..+......+.+.+-....+ T Consensus 106 r~vAmDCEMVG~Gp~G~~s~lARvSIVN~~G~Vvy----Dk-yVkP~~~VtDyRT~vSGIrpehm~~A~pf~~aQ~ev~k 180 (280) T KOG2249 106 RVVAMDCEMVGVGPDGRESLLARVSIVNYHGHVVY----DK-YVKPTEPVTDYRTRVSGIRPEHMRDAMPFKVAQKEVLK 180 (280) T ss_pred EEEEEEEEEECCCCCCCCEEEEEEEEEECCCCEEE----EE-ECCCCCCCCCCEEEECCCCHHHHCCCCCHHHHHHHHHH T ss_conf 38997404751479976000347888614686765----55-03788663310332236587785157538999999999 Q ss_pred HHCCCCCHHHHCCCCHHHHHHHHCCCCCCC-CCCHHHHHHHHC--CCCCCCCHHHHHHHHCCCCCCCCCCCCCCCCCCHH Q ss_conf 202221001100321237889861886664-210023455431--68766772466776224442233565432221000 Q gi|254780277|r 456 FFENEHFLKIGHNIKYDKLVLHRYGISMRG-FEDIMLMSYVLD--SGRSSHDIASIAQKWLSYTRKEILKSRKSSIPIDK 532 (976) Q Consensus 456 lle~~~i~kIgHN~KfD~~~L~~~GI~l~~-i~DTmlaayLLd--p~~~~~~L~~la~~yL~~~~~e~~~~~k~~~~~~~ 532 (976) +|.+ ...|||-+.-|+.+|.-. .+.. +-||.----|.. +.....+|+.|++.+||+.... |. T Consensus 181 lL~g--RIlVGHaLhnDl~~L~l~--hp~s~iRDTs~~~pl~k~~~~~~tpSLK~Lt~~~Lg~~IQ~----Ge------- 245 (280) T KOG2249 181 LLKG--RILVGHALHNDLQALKLE--HPRSMIRDTSKYPPLMKLLSKKATPSLKKLTEALLGKDIQV----GE------- 245 (280) T ss_pred HHHC--CEEECCCCCCHHHHHHHH--CCHHHHCCCCCCCHHHHHHHCCCCCCHHHHHHHHHCHHHHC----CC------- T ss_conf 9708--777512200079998553--74355314312754877752159831999999984525431----56------- Q ss_pred HCCHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 000123333222111112456776322210 Q gi|254780277|r 533 ISDSQVQEHAIENSNVILQLWLLLRPKLIV 562 (976) Q Consensus 533 i~~e~~~~Yaa~Da~~t~~L~~~l~~~L~~ 562 (976) .-..+||.+|.+||...+.+-++ T Consensus 246 -------HsSvEDA~AtM~LY~~vk~qwe~ 268 (280) T KOG2249 246 -------HSSVEDARATMELYKRVKVQWEK 268 (280) T ss_pred -------CCCHHHHHHHHHHHHHHHHHHHH T ss_conf -------68588999999999999999999 No 127 >COG3688 Predicted RNA-binding protein containing a PIN domain [General function prediction only] Probab=95.54 E-value=0.11 Score=30.79 Aligned_cols=106 Identities=24% Similarity=0.316 Sum_probs=63.8 Q ss_pred CEEEEEECCHHHHHHHHCCCCCCCCCCCCEECHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECC-CCCHHHHCCHHHHCC Q ss_conf 7289996626999887426754468988550069999999999999848554798799997274-887132408652166 Q gi|254780277|r 5 NHLFLVDGSSFIYRAFYATPLLSRKHDGLPVNAIAGFCNMLWKLLQNSRKESIASHFAVIFDYP-AVTFRNEIYPDYKAN 83 (976) Q Consensus 5 ~~~~liDg~~~~~ra~~a~p~~~~~~~G~~t~ai~gf~~~l~~~i~~~~~~~~~~~~~v~fD~~-~~tfR~~~~~~YKa~ 83 (976) +.+|||||++.| - +-|.+...++ .|.+--.=..|...+..+.. +.---++||||+- -|+-+. .|+-+ T Consensus 3 ~~iLLVDGYNmI-g---aWp~Lk~lkd---~~~LE~AR~~LIe~l~~Y~s-~~g~~iivVFDA~~v~g~~~----~~~~~ 70 (173) T COG3688 3 MRILLVDGYNMI-G---AWPELKQLKD---RNGLEAARDKLIEALAEYQS-FTGYKIIVVFDAHYVPGVGR----EYKNH 70 (173) T ss_pred CEEEEECCCHHC-C---CCHHHHHHHH---HCCHHHHHHHHHHHHHHHHC-CCCCEEEEEEECCCCCCCCC----CCCCC T ss_conf 328996162311-5---5677898876---24699999999999987403-05852899987434666343----43356 Q ss_pred CCCCCHHHHHHHHHHHHHHHHCCCCEEEECCEEHHHHHHHHHHHHHHCCC-CEEEECCCCH Q ss_conf 79879758864899999999709979956890166589999999984899-4999868701 Q gi|254780277|r 84 RPKIPEMLLPQLPLVRLATQAFGIPAIEIQGFEADDIIATYTYIAEKEGF-AVTIISTDKD 143 (976) Q Consensus 84 R~~~p~~l~~q~~~i~~~l~~~gi~~~~~~~~EADDviatla~~~~~~~~-~v~i~s~DkD 143 (976) |-+ -++..+|--||-.|--+|.......- +|.++|+|.- T Consensus 71 ~vs---------------------vvyT~~~ETADs~IEr~~~el~~~~t~~V~VaTSD~~ 110 (173) T COG3688 71 RVS---------------------VVYTKEGETADSFIERYVAELRNAATHQVIVATSDRA 110 (173) T ss_pred CEE---------------------EEEECCCCCHHHHHHHHHHHHHCCCCCEEEEEECCHH T ss_conf 369---------------------9980587627899999999973046625999967515 No 128 >COG0322 UvrC Nuclease subunit of the excinuclease complex [DNA replication, recombination, and repair] Probab=95.38 E-value=0.23 Score=28.49 Aligned_cols=12 Identities=25% Similarity=0.529 Sum_probs=5.0 Q ss_pred HHHHHHHHHCCC Q ss_conf 997444420222 Q gi|254780277|r 449 ILSYLKKFFENE 460 (976) Q Consensus 449 vl~~L~~lle~~ 460 (976) .++.++.+|.+. T Consensus 188 ~v~~~~~fL~G~ 199 (581) T COG0322 188 IVEEVKGFLKGK 199 (581) T ss_pred HHHHHHHHHCCC T ss_conf 999999997698 No 129 >smart00486 POLBc DNA polymerase type-B family. DNA polymerase alpha, delta, epsilon and zeta chain (eukaryota), DNA polymerases in archaea, DNA polymerase II in e. coli, mitochondrial DNA polymerases and and virus DNA polymerases Probab=95.25 E-value=0.062 Score=32.70 Aligned_cols=165 Identities=18% Similarity=0.170 Sum_probs=70.2 Q ss_pred CCCCCCCCCCCCC-----CC--CCCCCCCEEECCCCCCCCCCCCCCEECCCCCCCCCCCCCCC-CCHHHHHHHHHHHHC- Q ss_conf 8523233222433-----11--00124420102577665443455023137443334565001-788999974444202- Q gi|254780277|r 388 GSASFKIITDTMD-----AF--HSKPIAIAISTLDQKDDTSFKVETIFIDLSVHTSQDSTGKN-TLTKEILSYLKKFFE- 458 (976) Q Consensus 388 ~~ia~DtETt~ld-----~~--~~~ivGisls~~~~~~~~~~~~~~~yip~~~~~~~d~~~~~-~~~~evl~~L~~lle- 458 (976) ...+||+||.+.+ +. ...++.+++............. ..+.-.......+..... ....+.+..+..++. T Consensus 4 ~i~s~DIEt~~~~~~~p~~~~~~d~i~~i~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~E~~ll~~f~~~i~~ 82 (471) T smart00486 4 KILSFDIETYTDGGLFPDPLIFEDEIIQISLVINDGDKKGPEER-ICFTLGTCKEIDGVEVYEFNNEKELLKAFLEFIKK 82 (471) T ss_pred EEEEEEEEECCCCCCCCCCCCCCCEEEEEEEEEEECCEEEEEEE-EEECCCCCCCCCCCEEEECCCHHHHHHHHHHHHHH T ss_conf 69999989667999999988899849999999821987878877-54303676778998899809999999999999987 Q ss_pred -CCCCHHHHCCC-CHHHHHHHH----CCCCCC---------------------------------C--CCCHHHHHHHHC Q ss_conf -22100110032-123788986----188666---------------------------------4--210023455431 Q gi|254780277|r 459 -NEHFLKIGHNI-KYDKLVLHR----YGISMR---------------------------------G--FEDIMLMSYVLD 497 (976) Q Consensus 459 -~~~i~kIgHN~-KfD~~~L~~----~GI~l~---------------------------------~--i~DTmlaayLLd 497 (976) +|+ ..+|||. .||..++.+ +++... | ..|++....- . T Consensus 83 ~dpd-ii~g~N~~~FD~~yi~~R~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~D~~~~~~~-~ 160 (471) T smart00486 83 YDPD-IIYGHNISNFDLPYIISRLEKLKIKPLSFIGRLKNIIDIKRKKPLFGSKSFGKTIKVKIKGRLVIDLYNLYKN-K 160 (471) T ss_pred CCCC-EEEECCCCCCCHHHHHHHHHHHCCCHHHHHCCCCCCCCCCEECCCCCCCCCCCEEEEEEEEEEEEEHHHHHHH-H T ss_conf 1999-9996588787689999999984896477637357787643203321012566314899855999788999988-6 Q ss_pred CCCCCCCHHHHHHHHCCCCCCCCCCCCCCCCCCHH--HCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 68766772466776224442233565432221000--000123333222111112456776 Q gi|254780277|r 498 SGRSSHDIASIAQKWLSYTRKEILKSRKSSIPIDK--ISDSQVQEHAIENSNVILQLWLLL 556 (976) Q Consensus 498 p~~~~~~L~~la~~yL~~~~~e~~~~~k~~~~~~~--i~~e~~~~Yaa~Da~~t~~L~~~l 556 (976) -...+++|+.+++.+|+....++....-.. .+.. -.......|+..|+.++.+|...+ T Consensus 161 ~~l~sy~L~~v~~~~l~~~k~d~~~~~i~~-~~~~~~~~~~~~~~Y~~~D~~l~~~l~~~~ 220 (471) T smart00486 161 LKLPSYKLDTVAEYLLGKEKDDLPYKDIPE-LYNLNYKLRDELLEYCIQDAVLTLKLFNKL 220 (471) T ss_pred HCCCCCCHHHHHHHHHCCCCCCCCHHHHHH-HHHCCHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 044668889999999577888899899999-985393679999999999999999999986 No 130 >pfam01550 consensus Probab=95.11 E-value=0.13 Score=30.28 Aligned_cols=84 Identities=15% Similarity=0.144 Sum_probs=60.7 Q ss_pred HHHHHHHHHHHHHCCCCEEEECCEEHHHHHHHHHHHHHHCCCCEEEECCCCHHHEECCCCEEEEECCCCCEECHHHHHHH Q ss_conf 86489999999970997995689016658999999998489949998687012122379759996678835389999888 Q gi|254780277|r 92 LPQLPLVRLATQAFGIPAIEIQGFEADDIIATYTYIAEKEGFAVTIISTDKDLMQLVSPTTCLYDTVKEEKIDIEHVIKK 171 (976) Q Consensus 92 ~~q~~~i~~~l~~~gi~~~~~~~~EADDviatla~~~~~~~~~v~i~s~DkD~~Qlv~~~v~~~~~~~~~~~~~~~v~~~ 171 (976) +----++-.++.+||.|.+...+.||||+-|-|=..- -=.+|.|+|-||+=.=.|-|.=..|.---.+--.+|.+- T Consensus 169 rL~hrlC~~lIr~lGYaYV~a~~mEADd~CANLfHTn----TVAyV~TTDTDLlLMGCDIIlDi~p~FpPtirCrDvL~y 244 (443) T pfam01550 169 RLAHRLCVRLIRFMGYAYVDAGDMEADDVCANLYHTN----TVAYVYTTDTDLILMGCDIILDIAPLFPPTLRCRDVLAY 244 (443) T ss_pred HHHHHHHHHHHHHCCCCEEEECCCCHHHHHHHHHCCC----EEEEEECCCCCCEEECCCCHHCCCCCCCCCEEHHHHHHH T ss_conf 7899999999997087636166652768877640135----047997057520000672101163347751008999999 Q ss_pred HCCCHHHH Q ss_conf 48988787 Q gi|254780277|r 172 WGIPPEKM 179 (976) Q Consensus 172 ~gv~p~~~ 179 (976) +|+.=.|| T Consensus 245 lgmtY~eF 252 (443) T pfam01550 245 LGMTYPEF 252 (443) T ss_pred HCCCHHHH T ss_conf 78899999 No 131 >PRK05601 DNA polymerase III subunit epsilon; Validated Probab=95.11 E-value=0.049 Score=33.42 Aligned_cols=92 Identities=17% Similarity=0.194 Sum_probs=58.8 Q ss_pred HHHHHCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCEECCCCCCC----------CCCCCCCCCCHHHHHH Q ss_conf 999866852323322243311001244201025776654434550231374433----------3456500178899997 Q gi|254780277|r 382 QKLETIGSASFKIITDTMDAFHSKPIAIAISTLDQKDDTSFKVETIFIDLSVHT----------SQDSTGKNTLTKEILS 451 (976) Q Consensus 382 ~~L~~~~~ia~DtETt~ld~~~~~ivGisls~~~~~~~~~~~~~~~yip~~~~~----------~~d~~~~~~~~~evl~ 451 (976) +.+....++|+-+-|+|..|.++.++-|....-+..+.. .+.||--++-.. ..+.....-...+|+. T Consensus 41 ~~~~~~p~va~~~~ttgihpst~r~v~~da~~~da~Ge~---~~~f~~~lNPg~DPGP~HiHGLT~e~L~gAP~F~~ia~ 117 (377) T PRK05601 41 EAIEDAPFVAVSIQSSGIHPSTSRMVTIDAVTLSADGEP---VETFHAVLNPKTDPGPFHLHGVTEEEFASAKRFGQILK 117 (377) T ss_pred HHHHCCCEEEEEEEECCCCCCCCEEEEEEEEEECCCCCC---CCEEEEECCCCCCCCCCCCCCCCHHHHCCCCCHHHHHH T ss_conf 123128727999885575777550588878886478972---23457712799999981013789999706842899999 Q ss_pred HHHHHHCCCCCHHHHCCCCHHHHHHHH Q ss_conf 444420222100110032123788986 Q gi|254780277|r 452 YLKKFFENEHFLKIGHNIKYDKLVLHR 478 (976) Q Consensus 452 ~L~~lle~~~i~kIgHN~KfD~~~L~~ 478 (976) .|..+|..- ..|.||+.||+.||.. T Consensus 118 ~l~~lLdGR--tLVaHNa~fdw~Fia~ 142 (377) T PRK05601 118 SLDRLIDGR--TLLLHNAPRTWGFIVS 142 (377) T ss_pred HHHHHHCCC--EEEEECCCCHHHHHHH T ss_conf 999984586--7998356020889999 No 132 >PRK11779 sbcB exonuclease I; Provisional Probab=95.08 E-value=0.084 Score=31.74 Aligned_cols=152 Identities=15% Similarity=0.135 Sum_probs=78.1 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCEEECCCCCC-------CCCCCCCCEECCCCCC------CCCCCCCCCCCHHHHHHHHH Q ss_conf 8523233222433110012442010257766-------5443455023137443------33456500178899997444 Q gi|254780277|r 388 GSASFKIITDTMDAFHSKPIAIAISTLDQKD-------DTSFKVETIFIDLSVH------TSQDSTGKNTLTKEILSYLK 454 (976) Q Consensus 388 ~~ia~DtETt~ld~~~~~ivGisls~~~~~~-------~~~~~~~~~yip~~~~------~~~d~~~~~~~~~evl~~L~ 454 (976) .++-.|+||+|.++.-+.|+.++.-.-+... ...+.....++|-..- +..........-.+.+..+. T Consensus 8 ~f~fyD~ETtG~~~~~dqi~Qfa~i~td~~ln~i~~~~~~~c~~~~~~lP~p~A~lvt~itp~~~~~~~~~~~~~~~~i~ 87 (477) T PRK11779 8 TFLWHDYETFGANPALDRPAQFAGIRTDADLNIIGEPLVFYCKPADDYLPSPEAVLITGITPQEALEKGLPEAEFAARIH 87 (477) T ss_pred CEEEEECCCCCCCCCCCCCCEEEEEEECCCCCCCCCCCEEEECCCCCCCCCCCHHHCCCCCHHHHHHCCCCHHHHHHHHH T ss_conf 57999453689998776501257789877537257741056447999899930431148899999866998799999999 Q ss_pred HHHCCCCCHHHHCC-CCHHHHHHHHC---CC-C--C----CC--CCC---HHHHHHHHCCCCCC----------CCHHHH Q ss_conf 42022210011003-21237889861---88-6--6----64--210---02345543168766----------772466 Q gi|254780277|r 455 KFFENEHFLKIGHN-IKYDKLVLHRY---GI-S--M----RG--FED---IMLMSYVLDSGRSS----------HDIASI 508 (976) Q Consensus 455 ~lle~~~i~kIgHN-~KfD~~~L~~~---GI-~--l----~~--i~D---TmlaayLLdp~~~~----------~~L~~l 508 (976) ..|..|....||+| +.||-.++.+. ++ + + +| =+| .+-++|.+.|+.-. ..|+.| T Consensus 88 ~~~~~~~t~~iGyNsi~FDee~~R~~fy~nl~dPY~~~~~ngNsR~Dll~~~r~~~~~~p~~i~~p~~~~G~~sfKLe~l 167 (477) T PRK11779 88 AEFSQPGTCILGYNNIRFDDEVTRYIFYRNFYDPYAREWQNGNSRWDLLDVVRACYALRPEGINWPENEDGLPSFKLEHL 167 (477) T ss_pred HHHCCCCCEEEECCCCCCCHHHHHHHHHHHCCCHHHHHCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCEEEEHHHH T ss_conf 99668984898227754489999999998613457664248970579999999998748366116737999854449999 Q ss_pred HHHHCCCCCCCCCCCCCCCCCCHHHCCHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 776224442233565432221000000123333222111112456776322 Q gi|254780277|r 509 AQKWLSYTRKEILKSRKSSIPIDKISDSQVQEHAIENSNVILQLWLLLRPK 559 (976) Q Consensus 509 a~~yL~~~~~e~~~~~k~~~~~~~i~~e~~~~Yaa~Da~~t~~L~~~l~~~ 559 (976) |. ...+..+ .+.-|..||.+|..|...++++ T Consensus 168 a~-------------------aNgi~h~-~AHDAl~Dv~aTi~laklIk~k 198 (477) T PRK11779 168 TK-------------------ANGIEHE-NAHDAMSDVYATIAMAKLIKQK 198 (477) T ss_pred HH-------------------HCCCCCC-CCCHHHHHHHHHHHHHHHHHHH T ss_conf 98-------------------5598765-5410688999999999999876 No 133 >PRK07956 ligA NAD-dependent DNA ligase LigA; Validated Probab=94.93 E-value=0.019 Score=36.52 Aligned_cols=12 Identities=25% Similarity=0.465 Sum_probs=6.4 Q ss_pred HHHHHHHHCCCCC Q ss_conf 6799874207887 Q gi|254780277|r 629 LGDILFTKLKFPS 641 (976) Q Consensus 629 L~~vLf~~LgLp~ 641 (976) |..+|| -||+|- T Consensus 507 l~r~L~-ALGI~~ 518 (668) T PRK07956 507 LARFLY-ALGIRH 518 (668) T ss_pred HHHHHH-HCCCCC T ss_conf 888998-627864 No 134 >PRK08609 hypothetical protein; Provisional Probab=94.69 E-value=0.038 Score=34.21 Aligned_cols=48 Identities=31% Similarity=0.428 Sum_probs=34.3 Q ss_pred CCCCCCCHHHHHHHHHCC--CHHHHHHHHHH-----HH--HHHHHHHHCCCHHHHHH Q ss_conf 877636024998984136--78999864542-----00--11223220035789997 Q gi|254780277|r 195 GIPGIGYKTAALLLQEYG--NLENILINASR-----IK--QKKRRENILEYAETARL 242 (976) Q Consensus 195 Gv~GiG~KtA~~ll~~~g--sle~i~~~~d~-----i~--~~k~~~~l~~~~e~a~l 242 (976) .|||||||+|.+|..+.| |+|.+-+.+.+ ++ |.|..++|.++.+.... T Consensus 92 ~IpGlGPKka~~L~~eLGI~sleeL~~A~~~gri~~L~GfG~Ktee~IL~gI~~~~~ 148 (570) T PRK08609 92 KLPGLGGKKIAKLYKELGVVDKESLKEACENGKVQALAGFGKKTEEKILEALKQLGA 148 (570) T ss_pred CCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 789877899999999829999999999987384876543426899999999999975 No 135 >pfam10108 Exon_PolB Predicted 3'-5' exonuclease related to the exonuclease domain of PolB. This domain is found in various prokaryotic 3'-5' exonucleases and hypothetical proteins. Probab=94.57 E-value=0.097 Score=31.27 Aligned_cols=110 Identities=16% Similarity=0.149 Sum_probs=63.4 Q ss_pred CHHHHHHHHHHHHCCCCCHHHHCCC-CHHHHHHH----HCCCCCCCCCCHH-------------------HHHHHHCC-C Q ss_conf 8899997444420222100110032-12378898----6188666421002-------------------34554316-8 Q gi|254780277|r 445 LTKEILSYLKKFFENEHFLKIGHNI-KYDKLVLH----RYGISMRGFEDIM-------------------LMSYVLDS-G 499 (976) Q Consensus 445 ~~~evl~~L~~lle~~~i~kIgHN~-KfD~~~L~----~~GI~l~~i~DTm-------------------laayLLdp-~ 499 (976) ...++++.|=..++...-..|++|. .||+.+|. .|||....+||+- ++..|-+= . T Consensus 36 ~E~~ll~~F~~~i~~~~P~LVs~NG~gFDlPvL~~RAl~~gi~ap~~~~~~~~~~kw~nY~~Ry~~~H~DLmd~l~~~g~ 115 (211) T pfam10108 36 SEKELIQRFFDGIEKYTPQLVSFNGRGFDLPVLHYRALKHGVSAPRYFDTGDRDFKWNNYFNRYSTRHLDLMDLLSGYGA 115 (211) T ss_pred CHHHHHHHHHHHHHHCCCEEEEECCCCCCHHHHHHHHHHCCCCCHHHHHCCCCCCCCCCCCCCCCCCCCCHHHHHHHCCC T ss_conf 99999999999998439828865686554899999999818885078756787665444112356544548999873288 Q ss_pred CCCCCHHHHHHHHCCCCC-CCCCCCCCCCCCCHHHCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 766772466776224442-233565432221000000123333222111112456776 Q gi|254780277|r 500 RSSHDIASIAQKWLSYTR-KEILKSRKSSIPIDKISDSQVQEHAIENSNVILQLWLLL 556 (976) Q Consensus 500 ~~~~~L~~la~~yL~~~~-~e~~~~~k~~~~~~~i~~e~~~~Yaa~Da~~t~~L~~~l 556 (976) +..-+|+.+|.- +|... .+..|+..... +..-..++...||-.||.-|+-+|-++ T Consensus 116 ~~~~~Ld~~a~l-~G~PGK~~~~G~~V~~~-~~~G~l~~I~~YCe~DvlnTylv~LR~ 171 (211) T pfam10108 116 RANAPLDEVAAL-LGLPGKMGVDGSQVWEL-YQAGKIEEIRDYCETDVLNTYLVYLRF 171 (211) T ss_pred CCCCCHHHHHHH-CCCCCCCCCCHHHHHHH-HHCCCHHHHHHHHHHHHHHHHHHHHHH T ss_conf 677898999998-19998787764589999-985988899999887689999999999 No 136 >TIGR01448 recD_rel helicase, RecD/TraA family; InterPro: IPR006345 These sequences represent a family similar to RecD, the exodeoxyribonuclease V alpha chain of IPR006344 from INTERPRO. Members of this family, however, are not found in a context of RecB and RecC and are longer by about 200 amino acids at the amino end. Chlamydia muridarum has both a member of this family and a RecD. . Probab=94.33 E-value=0.03 Score=34.96 Aligned_cols=63 Identities=24% Similarity=0.389 Sum_probs=37.5 Q ss_pred HHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCC--CHHHHHHHHHH---HHH--HHHHHHHCCCHHHHHHHHHHH Q ss_conf 787777875187435899877636024998984136--78999864542---001--122322003578999765543 Q gi|254780277|r 177 EKMVCLQALTGDSIDNIPGIPGIGYKTAALLLQEYG--NLENILINASR---IKQ--KKRRENILEYAETARLSRKLV 247 (976) Q Consensus 177 ~~~~d~~al~GD~sDnipGv~GiG~KtA~~ll~~~g--sle~i~~~~d~---i~~--~k~~~~l~~~~e~a~ls~~L~ 247 (976) +-++-|+ +|+. |+|||+|||.+++..|| ++|.|.++.++ |.| ++-++++..++.....+++|. T Consensus 87 ~gi~~Yl-----~S~~---~KGvGk~~Aq~Iv~TfGe~~~~~l~~~P~kL~~V~G~s~~~~~~~~~~~~~~~~~~~~l 156 (769) T TIGR01448 87 EGIVAYL-----SSSS---IKGVGKKLAQRIVETFGEAAIDVLDDDPEKLLEVSGISKANLEKIVSQLSKQKDERKLL 156 (769) T ss_pred HHHHHHH-----HCCC---CCCCCHHHHHHHHHHHHHHHHHHHHCCCHHEEEECCCCHHHHHHHHHHHHHHHHHHHHH T ss_conf 7876555-----2254---16720577899999876999988740860103307756677899999999755899999 No 137 >cd05778 DNA_polB_zeta_exo The 3'-5' exonuclease domain of eukaryotic DNA polymerase zeta. DNA polymerase zeta is a family-B DNA polymerase which is distantly related to DNA polymerase delta. It plays a major role in translesion replication and the production of either spontaneous or induced mutations. In addition, DNA polymerase zeta also appears to be involved in somatic hypermutability in B lymphocytes, an important element for the production of high affinity antibodies in response to an antigen. The catalytic subunit contains both polymerase and 3'-5' exonuclease domains, but only exhibits polymerase activity. The DnaQ-like 3'-5' exonuclease domain contains three sequence motifs termed ExoI, ExoII and ExoIII, without the four conserved acidic residues that are crucial for metal binding and catalysis. Probab=94.32 E-value=0.065 Score=32.56 Aligned_cols=54 Identities=7% Similarity=-0.018 Sum_probs=28.9 Q ss_pred CCCCCHHHHHHHHCCCCCCCCCCCCCCCC--CCHHHCCHHHHHHHHHHHHHHHHHH Q ss_conf 76677246677622444223356543222--1000000123333222111112456 Q gi|254780277|r 500 RSSHDIASIAQKWLSYTRKEILKSRKSSI--PIDKISDSQVQEHAIENSNVILQLW 553 (976) Q Consensus 500 ~~~~~L~~la~~yL~~~~~e~~~~~k~~~--~~~~i~~e~~~~Yaa~Da~~t~~L~ 553 (976) ..+|+|++++.++|+...+++.-+.-... +-+.-....+..||..||...++|- T Consensus 175 L~sYsLe~V~~~~L~e~k~~~~~~~l~~~~~~g~~~~r~~v~~Yclkd~~l~l~Ll 230 (231) T cd05778 175 LTNYTLENVVYHVLHQRIPLYSNKTLTEWYKSGSASERWRVLEYYLKRVRLNLEIL 230 (231) T ss_pred CCCCCHHHHHHHHHCCCCCCCCHHHHHHHHHCCCHHHHHHHHEEEHHHHHHHHHHH T ss_conf 44387999999996798888999999999974998785164400075305158652 No 138 >PRK00116 ruvA Holliday junction DNA helicase RuvA; Reviewed Probab=94.01 E-value=0.17 Score=29.46 Aligned_cols=131 Identities=21% Similarity=0.264 Sum_probs=66.3 Q ss_pred CCCEEEECCCCHHHEE--CCCCEEEEEC--CCCCEECHHHHHHHHCCCHH-HHHHHHHHCCCCCCCCCCCCCCCCHHHHH Q ss_conf 9949998687012122--3797599966--78835389999888489887-87777875187435899877636024998 Q gi|254780277|r 132 GFAVTIISTDKDLMQL--VSPTTCLYDT--VKEEKIDIEHVIKKWGIPPE-KMVCLQALTGDSIDNIPGIPGIGYKTAAL 206 (976) Q Consensus 132 ~~~v~i~s~DkD~~Ql--v~~~v~~~~~--~~~~~~~~~~v~~~~gv~p~-~~~d~~al~GD~sDnipGv~GiG~KtA~~ 206 (976) |+--.|..+.+.+.+| +.+++.+|-. .+... ..-||+.-. .---|.-|. .|.|||||+|.. T Consensus 23 GvGY~v~v~~~~~~~l~~~g~~~~l~~~~~vrEd~------~~LyGF~~~~Er~~F~~Li--------~V~GIGpK~AL~ 88 (198) T PRK00116 23 GVGYEVQVPMRTLYELPEVGEEVKLYTHLVVREDA------QLLYGFLTKEERELFRLLI--------SVSGVGPKLALA 88 (198) T ss_pred CEEEEEEECHHHHHHCCCCCCEEEEEEEEEEECCC------CEEEEECCHHHHHHHHHHH--------CCCCCCHHHHHH T ss_conf 98899991689998624689839999999980688------7578408889999999985--------668857899998 Q ss_pred HHHHCCCHHHHHHHHH--------HHH--HHHHHHHHCCCHHHHHHHHHHHHCCCCCCCCCCHHHHCCCCCCHHHHHHHH Q ss_conf 9841367899986454--------200--112232200357899976554320356422476355212676768899865 Q gi|254780277|r 207 LLQEYGNLENILINAS--------RIK--QKKRRENILEYAETARLSRKLVMLRTDVPITIPLEHLILEDCNGPRLISFL 276 (976) Q Consensus 207 ll~~~gsle~i~~~~d--------~i~--~~k~~~~l~~~~e~a~ls~~L~tL~~Dvpl~~~~~~l~~~~~d~~~l~~~~ 276 (976) +|..++ .+.+.+.+. +++ |+|..++|. +-|.-++..+..+...... .-.......+++.+-+ T Consensus 89 ILs~~~-~~~l~~aI~~~D~~~L~~vpGIG~KtA~rIi-----~ELk~K~~~~~~~~~~~~~--~~~~~~~~~~d~~~AL 160 (198) T PRK00116 89 ILSGLS-PEELAQAIANGDIKALTKVPGVGKKTAERIV-----LELKDKLAALADAGAAAAA--AASAANDALEEAVSAL 160 (198) T ss_pred HHCCCC-HHHHHHHHHHCCHHHHCCCCCCCHHHHHHHH-----HHHHHHHHHHCCCCCCCCC--CCCCCCCHHHHHHHHH T ss_conf 870299-9999999985899997068897889999999-----9999888742124554545--6677752599999999 Q ss_pred HHHHHHHH Q ss_conf 32336899 Q gi|254780277|r 277 KALEFTKL 284 (976) Q Consensus 277 ~~lef~~l 284 (976) ..|+|+.- T Consensus 161 ~~LGy~~~ 168 (198) T PRK00116 161 VALGYKPK 168 (198) T ss_pred HHCCCCHH T ss_conf 98599999 No 139 >pfam04857 CAF1 CAF1 family ribonuclease. The major pathways of mRNA turnover in eukaryotes initiate with shortening of the polyA tail. CAF1 encodes a critical component of the major cytoplasmic deadenylase in yeast. Both Caf1p is required for normal mRNA deadenylation in vivo and localizes to the cytoplasm. Caf1p copurifies with a Ccr4p-dependent polyA-specific exonuclease activity. Some members of this family include and inserted RNA binding domain pfam01424. This family of proteins is related to other exonucleases pfam00929 (Bateman A pers. obs.). The crystal structure of Saccharomyces cerevisiae Pop2 has been resolved at 2.3 Angstrom#resolution. Probab=93.74 E-value=0.28 Score=27.95 Aligned_cols=52 Identities=13% Similarity=0.022 Sum_probs=24.6 Q ss_pred CCCHHHHCCCCHHHHHHHHC--CCCC---------------CCCCCHHHHHHHHCCC--CCCCCHHHHHHH Q ss_conf 21001100321237889861--8866---------------6421002345543168--766772466776 Q gi|254780277|r 460 EHFLKIGHNIKYDKLVLHRY--GISM---------------RGFEDIMLMSYVLDSG--RSSHDIASIAQK 511 (976) Q Consensus 460 ~~i~kIgHN~KfD~~~L~~~--GI~l---------------~~i~DTmlaayLLdp~--~~~~~L~~la~~ 511 (976) .++++|+||.-||...|.+. |-++ ..++||..++..+... ..+-||..+|.. T Consensus 139 ~~~~wv~f~g~yD~~yl~k~l~g~~LP~~~~eF~~~l~~~FP~vyD~K~la~~~~~~~~~~~~gL~~lA~~ 209 (235) T pfam04857 139 SSVTWVTFHGLYDFGYLLKLLTGGPLPETLDDFLELLRELFPRVYDTKYLAKFCFELDGGSNGGLQELADL 209 (235) T ss_pred CCCEEEEECCHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHCCHHHHHHHHHHHCCCCCCCCCCCHHHHHHH T ss_conf 99879981727789999999749989998999999999977717509999985221358755789999997 No 140 >pfam00752 XPG_N XPG N-terminal domain. Probab=93.55 E-value=0.19 Score=29.08 Aligned_cols=67 Identities=18% Similarity=0.251 Sum_probs=46.3 Q ss_pred CEEEEEECCHHHHHHHHCCCCCCCCCCCCE-ECHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHHCCHH Q ss_conf 728999662699988742675446898855-006999999999999984855479879999727488713240865 Q gi|254780277|r 5 NHLFLVDGSSFIYRAFYATPLLSRKHDGLP-VNAIAGFCNMLWKLLQNSRKESIASHFAVIFDYPAVTFRNEIYPD 79 (976) Q Consensus 5 ~~~~liDg~~~~~ra~~a~p~~~~~~~G~~-t~ai~gf~~~l~~~i~~~~~~~~~~~~~v~fD~~~~tfR~~~~~~ 79 (976) .+.+-||++..+||.-++...... ++.+ +.-+.||++.+.+++.. . -.-+.|||++.|..-.+-... T Consensus 25 Gk~vAID~s~wl~q~~~~~~~~~~--~~~~~~~~l~~~~~ri~~l~~~-~-----I~PvfVFDG~~p~~K~~t~~~ 92 (100) T pfam00752 25 GKTLAIDASIWLYQFLKAVRDQLG--NALQNTSHLMGFFSRLCRLKDF-G-----IKPIFVFDGGPPPLKAETLQK 92 (100) T ss_pred CCEEEECCHHHHHHHHHHHHHHHC--CCCCCCHHHHHHHHHHHHHHHC-C-----CEEEEEECCCCCHHHHHHHHH T ss_conf 987997719999999999702313--7888828999999999999988-9-----989998789996635999999 No 141 >TIGR02236 recomb_radA DNA repair and recombination protein RadA; InterPro: IPR011938 This family consists exclusively of archaeal RadA protein, a homolog of bacterial RecA, eukaryotic RAD51 (IPR011941 from INTERPRO), and archaeal RadB (IPR011939 from INTERPRO). This protein is involved in DNA repair and recombination. The member from Pyrococcus horikoshii contains an intein .; GO: 0003684 damaged DNA binding, 0005524 ATP binding, 0008094 DNA-dependent ATPase activity, 0006281 DNA repair, 0006310 DNA recombination. Probab=93.47 E-value=0.058 Score=32.89 Aligned_cols=34 Identities=29% Similarity=0.557 Sum_probs=26.1 Q ss_pred CCCCCCCCCCHHHHHHH-HHCCCHHHH-HHHHHHHH Q ss_conf 89987763602499898-413678999-86454200 Q gi|254780277|r 192 NIPGIPGIGYKTAALLL-QEYGNLENI-LINASRIK 225 (976) Q Consensus 192 nipGv~GiG~KtA~~ll-~~~gsle~i-~~~~d~i~ 225 (976) +|--+|||||+||.||- +-|.+++.| .++..+++ T Consensus 1 ~ledLPGVGp~TA~KL~EaGy~t~~~iA~A~~~EL~ 36 (333) T TIGR02236 1 DLEDLPGVGPATAEKLREAGYDTLEAIAVASPKELS 36 (333) T ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCHHHHH T ss_conf 966578975768899886107889998445857953 No 142 >KOG2207 consensus Probab=93.36 E-value=0.33 Score=27.43 Aligned_cols=175 Identities=18% Similarity=0.164 Sum_probs=83.5 Q ss_pred CCCCCCCEEECCHHHHHHHH-HHHHHC-CCCCCCCCCCCC-CCCCCCCCCCEEECCCCCCCCCCCCCCEECCCCCCCCCC Q ss_conf 55666503547999999999-999866-852323322243-311001244201025776654434550231374433345 Q gi|254780277|r 362 PINNDSYTKIVNTKDIQQWV-QKLETI-GSASFKIITDTM-DAFHSKPIAIAISTLDQKDDTSFKVETIFIDLSVHTSQD 438 (976) Q Consensus 362 ~~~~~~y~~I~~~~~L~~li-~~L~~~-~~ia~DtETt~l-d~~~~~ivGisls~~~~~~~~~~~~~~~yip~~~~~~~d 438 (976) +.....-..|.++++++.++ +.+... -++.+|.|..-. +..+.++..+.++..+. .|.+++... T Consensus 386 ~~~~~~i~~V~~e~El~~l~l~~l~~e~~yVGiDsEwkps~~v~dsk~~IlQif~~~~---------v~Lidc~~l---- 452 (617) T KOG2207 386 PPWVESIGMVGNEKELRDLLLESLSEELRYVGIDSEWKPSKKVSDSKLAILQIFFKDC---------VYLIDCVKL---- 452 (617) T ss_pred CCCCCCEEEECCHHHHHHHHHHHHHHCCEEEEECCCCCCCCCCCHHHHHHHHHHHCCE---------EEEEEHHHH---- T ss_conf 9752301433788899999999863167799874544765678803889999870480---------899656876---- Q ss_pred CCCCCCCHHHH-HHHHHHHHCCCCCHHHHCCCCHHHHHHHH------CCCCCC---C-CCCHHHHHHHHCC-------CC Q ss_conf 65001788999-97444420222100110032123788986------188666---4-2100234554316-------87 Q gi|254780277|r 439 STGKNTLTKEI-LSYLKKFFENEHFLKIGHNIKYDKLVLHR------YGISMR---G-FEDIMLMSYVLDS-------GR 500 (976) Q Consensus 439 ~~~~~~~~~ev-l~~L~~lle~~~i~kIgHN~KfD~~~L~~------~GI~l~---~-i~DTmlaayLLdp-------~~ 500 (976) +....++ ...+..+|+++.+.|||...--|+.++.. ....+. . +.++.++..+.+- +. T Consensus 453 ----~~~~se~w~~~~s~if~s~~i~kvGf~~~eDL~~l~~s~pa~~~q~ki~~~~l~~~~~kl~e~~~~~~~~i~n~~~ 528 (617) T KOG2207 453 ----ENLASEIWHLLLSQIFESKSILKVGFSMREDLEVLEASSPALRFQMKIEGLQLVSCVLKLAENVIDLPLSIENLNE 528 (617) T ss_pred ----HHCHHHHHHHHHHHHCCCCCEEEEECCHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCCCHHHHHCC T ss_conf ----5240899999999870688604660335623999986516666313312318889999999988616325542121 Q ss_pred CCCCHHHHHHHHCCCCCC--CCCCCCCCCCCCHHHCCH-HHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 667724667762244422--335654322210000001-23333222111112456776322 Q gi|254780277|r 501 SSHDIASIAQKWLSYTRK--EILKSRKSSIPIDKISDS-QVQEHAIENSNVILQLWLLLRPK 559 (976) Q Consensus 501 ~~~~L~~la~~yL~~~~~--e~~~~~k~~~~~~~i~~e-~~~~Yaa~Da~~t~~L~~~l~~~ 559 (976) -.-+|..|+...+|...+ +..+ ++..-|.. ....||+.|+.+...++.+.... T Consensus 529 ~~~~L~~Lt~~llg~~lnKteqcs------nWqcrpLr~nQi~yaalDa~~~~~ifkkv~~v 584 (617) T KOG2207 529 ATKGLADLTDCLLGKKLNKTEQCS------NWQCRPLRRNQIYYAALDAVVLVEIFKKVCSV 584 (617) T ss_pred HHHHHHHHHHHHHHHHCCCCCCCC------HHHCCCCHHHHHHHHHHCCHHHHHHHHHHHHH T ss_conf 110110235777522225200050------54258730668899884202668999999851 No 143 >smart00485 XPGN Xeroderma pigmentosum G N-region. domain in nucleases Probab=92.95 E-value=0.32 Score=27.45 Aligned_cols=59 Identities=19% Similarity=0.225 Sum_probs=41.4 Q ss_pred CEEEEEECCHHHHHHHHCCCCCCCCCCCCEEC-HHHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCC Q ss_conf 72899966269998874267544689885500-69999999999999848554798799997274887 Q gi|254780277|r 5 NHLFLVDGSSFIYRAFYATPLLSRKHDGLPVN-AIAGFCNMLWKLLQNSRKESIASHFAVIFDYPAVT 71 (976) Q Consensus 5 ~~~~liDg~~~~~ra~~a~p~~~~~~~G~~t~-ai~gf~~~l~~~i~~~~~~~~~~~~~v~fD~~~~t 71 (976) .+-+-||++.++||..++...... ++.+++ -+.||++.+..++.. . -.-+.|||++.+- T Consensus 24 gk~vAIDas~wl~k~~~~~~~~~~--~~~~~~~~l~~~~~ri~~l~~~-~-----I~PIfVFDG~~~~ 83 (99) T smart00485 24 GKTLAIDASIWLYQFLTACREKLG--TPLPNSKHLMGLFYRTCRLLEF-G-----IKPIFVFDGKPPP 83 (99) T ss_pred CCEEEEECHHHHHHHHHHHHCCCC--CCCCCCHHHHHHHHHHHHHHHC-C-----CEEEEEECCCCCH T ss_conf 987999158999999999710015--7777719999999999999987-9-----9899988899976 No 144 >PRK07748 sporulation inhibitor KapD; Provisional Probab=92.77 E-value=0.62 Score=25.41 Aligned_cols=96 Identities=13% Similarity=0.118 Sum_probs=58.4 Q ss_pred CCHHHHHHHHHHHHCCCCCHHHHCCCCHHHHHHHH----CCCCCC---CCCCHHHHHHHHCCCCCCCCHHHHHHHHCCCC Q ss_conf 78899997444420222100110032123788986----188666---42100234554316876677246677622444 Q gi|254780277|r 444 TLTKEILSYLKKFFENEHFLKIGHNIKYDKLVLHR----YGISMR---GFEDIMLMSYVLDSGRSSHDIASIAQKWLSYT 516 (976) Q Consensus 444 ~~~~evl~~L~~lle~~~i~kIgHN~KfD~~~L~~----~GI~l~---~i~DTmlaayLLdp~~~~~~L~~la~~yL~~~ 516 (976) ....++++.|..|..+++.. ++++..+|++.|.+ +|+.+. .+.|........-+.+.+.||....+. +|.. T Consensus 77 ~~F~evl~~f~~~~~~~~~~-~~tWG~~D~~~L~~~c~~~~i~~p~~~~~idlk~~f~~~~~~~~~~gL~~aL~~-lgl~ 154 (205) T PRK07748 77 ISFEELVEKLAEYDKRCKPT-IVTWGNMDMKVLKHNCEKAGVPFPFKGQCRDLSLEYKKFFGERNQTGLWKAIEE-YGKE 154 (205) T ss_pred CCHHHHHHHHHHHHCCCCEE-EEEECHHHHHHHHHHHHHHCCCCCCCCCEEEHHHHHHHHHCCCCCCCHHHHHHH-CCCC T ss_conf 98899999999981778728-987178899999999999389998531006399999999688887689999997-6998 Q ss_pred CCCCCCCCCCCCCCHHHCCHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 2233565432221000000123333222111112456776322 Q gi|254780277|r 517 RKEILKSRKSSIPIDKISDSQVQEHAIENSNVILQLWLLLRPK 559 (976) Q Consensus 517 ~~e~~~~~k~~~~~~~i~~e~~~~Yaa~Da~~t~~L~~~l~~~ 559 (976) . +--..-|..||..|.+++..+... T Consensus 155 ~------------------eG~~H~glDDA~NtAkI~kk~~~d 179 (205) T PRK07748 155 G------------------TGKHHCALDDAMTTYNIFKLVEKD 179 (205) T ss_pred C------------------CCCCCCHHHHHHHHHHHHHHHHCC T ss_conf 8------------------898637589999999999997613 No 145 >PHA00452 T3/T7-like RNA polymerase Probab=92.50 E-value=0.81 Score=24.56 Aligned_cols=142 Identities=15% Similarity=0.258 Sum_probs=77.1 Q ss_pred HHHHHHHHHHHHHCCCHHHHHHHC-------CCC-------HHHHHHHH-----HHHHHHCH---HHHHHHHHHHHHHHH Q ss_conf 863025135644077703567881-------999-------89999999-----99999888---899999999999998 Q gi|254780277|r 798 RRHAKTINFSIIYGISPFRLATQL-------RIP-------RSEAADYI-----KRYFHRFP---GIHEYIEKTKNFVRQ 855 (976) Q Consensus 798 R~~aK~infgi~YG~~~~~La~~l-------~is-------~~eA~~~i-----~~yf~~~p---~v~~~~~~~~~~a~~ 855 (976) |...|.-..-.+||.+.+|...++ ++. ...|..|+ +.-.+.|| .+..|+..+.+.+.+ T Consensus 581 Rk~vK~~vMT~~Yg~t~~g~~~qi~~~l~~~~~~~~~~~~~~~~~~~yla~~i~~ai~~~~~~a~~im~wl~~~a~~~~~ 660 (816) T PHA00452 581 RSVTKRPVMTLPYGSTRFGFRDQVREDLVQPAIDEGKGAMFTNQAAGYMAKLIWDAISETVPAAVEAMNWLQQAAKLLAK 660 (816) T ss_pred HHHHCCCEEEEECCCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 76632531675206610008999999997305566743034789999999999999999977099999999999999971 Q ss_pred CC----C-------------------EEC-CCCCEECC-C-CCCCCCHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHH Q ss_conf 49----8-------------------882-77876018-9-875883788877878770--1345653899999999999 Q gi|254780277|r 856 NG----Y-------------------VET-IFGRRIHY-D-EINSPKSSIRNINERAAI--NAPIQGSAADITRRAMISV 907 (976) Q Consensus 856 ~g----y-------------------v~T-~~GRrr~~-p-~i~s~~~~~r~~aeR~a~--N~~iQGsaAdi~k~am~~i 907 (976) .| | |+| +.|+.+.- + +.+..+.... ++|+ |..=-=-|+-++..| + T Consensus 661 ~~~~v~W~TP~G~pV~Q~Y~~~~~~~v~t~l~g~~~~~~~~~~~~~~~~kq----~~a~~PNfIHSlDashm~~t~-~-- 733 (816) T PHA00452 661 EVKDLHWTTPTGFPVWQEYRKTETKRVRLRLMGSARIQLTVNTDEIDARKH----ASGIAPNFVHSLDASHLRLTV-V-- 733 (816) T ss_pred CCCCCEEECCCCCEEEEECCCCCCEEEEEEECCEEEEEECCCCCCCCHHHH----HCCCCCHHHHHHHHHHHHHHH-H-- T ss_conf 699728989999989877378884689987415279983378898897787----243794888889999999999-9-- Q ss_pred HHHHHHCCCCEEEEEEECCEEEEECCHHHHHHHHHHHHHHHHHCC Q ss_conf 999986599818999726389995598799999999999965101 Q gi|254780277|r 908 HKSLEDHKLATKMLLQIHDELVFEVPEEEITTASQIIVHSMEKAC 952 (976) Q Consensus 908 ~~~~~~~~~~~~~ilqVHDElv~ev~~~~~~~~~~~~~~~Me~~~ 952 (976) ...+.|+.+ ..-|||+. =++...++.+.+++++.+-.-+ T Consensus 734 --~~~~~gi~~--fa~VHDsf--~tha~dv~~m~~~lRe~Fv~ly 772 (816) T PHA00452 734 --ACAEKGIES--FAVIHDSF--GTHAADADNLFKAVRETMVKMY 772 (816) T ss_pred --HHHHCCCCC--EEEEECCC--CCCCCCHHHHHHHHHHHHHHHH T ss_conf --999839962--78863278--8771589999999999999996 No 146 >COG1948 MUS81 ERCC4-type nuclease [DNA replication, recombination, and repair] Probab=92.33 E-value=0.17 Score=29.48 Aligned_cols=27 Identities=44% Similarity=0.819 Sum_probs=17.0 Q ss_pred CCCCCCCHHHHHHHHHCCCHHHHHHHH Q ss_conf 877636024998984136789998645 Q gi|254780277|r 195 GIPGIGYKTAALLLQEYGNLENILINA 221 (976) Q Consensus 195 Gv~GiG~KtA~~ll~~~gsle~i~~~~ 221 (976) .+||||++.|-.||..|||++++...- T Consensus 186 s~pgig~~~a~~ll~~fgS~~~~~tas 212 (254) T COG1948 186 SIPGIGPKLAERLLKKFGSVEDVLTAS 212 (254) T ss_pred CCCCCCHHHHHHHHHHHCCHHHHHHCC T ss_conf 089964899999999856888776559 No 147 >TIGR00575 dnlj DNA ligase, NAD-dependent; InterPro: IPR001679 DNA ligase (polydeoxyribonucleotide synthase) is the enzyme that joins two DNA fragments by catalyzing the formation of an internucleotide ester bond between phosphate and deoxyribose. It is active during DNA replication, DNA repair and DNA recombination. There are two forms of DNA ligase: one requires ATP (6.5.1.1 from EC), the other NAD (6.5.1.2 from EC). This family is predominantly composed of NAD-dependent bacterial DNA ligases. They are proteins of about 75 to 85 Kd whose sequence is well conserved , . They also show similarity to yicF, an Escherichia coli hypothetical protein of 63 Kd.; GO: 0003911 DNA ligase (NAD+) activity, 0006260 DNA replication, 0006281 DNA repair. Probab=92.27 E-value=0.086 Score=31.65 Aligned_cols=57 Identities=9% Similarity=0.151 Sum_probs=33.1 Q ss_pred EHHHHHHHHHCC-----CCHHHHH--HHHHHCCCH-HHCCH---HHHHH--HHHHHH-HHHHCCCHHHHHH Q ss_conf 037778734069-----8889999--989809996-88998---88863--025135-6440777035678 Q gi|254780277|r 763 KITPLYQVFENS-----LDIHRMV--AAEIFGVNI-EKVSP---QMRRH--AKTINF-SIIYGISPFRLAT 819 (976) Q Consensus 763 ~D~~l~~af~~g-----~DiH~~t--A~~vf~~~~-~~vt~---~~R~~--aK~inf-gi~YG~~~~~La~ 819 (976) |++.-..-|.++ .|+++.+ ...+.+.+. =.-.+ -.++. +|..-| =++||+|-..+++ T Consensus 461 G~~~I~~LfE~~Lv~~~~DLY~L~~k~~~Ll~le~~~g~~sa~nLl~aIe~sK~~pl~RlL~aLGIr~VG~ 531 (706) T TIGR00575 461 GEKVIEQLFEEKLVRSVADLYALKEKLEDLLELEGKFGEKSAQNLLSAIEKSKKKPLARLLFALGIRHVGE 531 (706) T ss_pred CHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHH T ss_conf 38999999872666785666577889999870640355688999999999852104899998628603579 No 148 >TIGR00084 ruvA Holliday junction DNA helicase RuvA; InterPro: IPR000085 In prokaryotes, RuvA, RuvB, and RuvC process the universal DNA intermediate of homologous recombination, termed Holliday junction. The tetrameric DNA helicase RuvA specifically binds to the Holliday junction and facilitates the isomerization of the junction from the stacked folded configuration to the square-planar structure . In the RuvA tetramer, each subunit consists of three domains, I, II and III, where I and II form the major core that is responsible for Holliday junction binding and base pair rearrangements of Holliday junction executed at the crossover point, whereas domain III regulates branch migration through direct contact with RuvB.; GO: 0003678 DNA helicase activity, 0006281 DNA repair, 0006310 DNA recombination. Probab=92.05 E-value=0.17 Score=29.52 Aligned_cols=111 Identities=17% Similarity=0.216 Sum_probs=58.3 Q ss_pred CH-HHHHHHHCCCHHH-HHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCC--CHHHHHHHHH-----HHH--HHHHHHH Q ss_conf 89-9998884898878-7777875187435899877636024998984136--7899986454-----200--1122322 Q gi|254780277|r 164 DI-EHVIKKWGIPPEK-MVCLQALTGDSIDNIPGIPGIGYKTAALLLQEYG--NLENILINAS-----RIK--QKKRREN 232 (976) Q Consensus 164 ~~-~~v~~~~gv~p~~-~~d~~al~GD~sDnipGv~GiG~KtA~~ll~~~g--sle~i~~~~d-----~i~--~~k~~~~ 232 (976) |. .-+.+-||+.-.. =.-|.-|+ ||-|||||+|..+|.... ..+.+.++-| +++ |+|..++ T Consensus 62 daNQi~~~LfGF~~~~Er~lF~~Li--------~~nGvGpk~ALaiL~~~~~~~~~~ai~~~~~~~L~k~pGvGKK~A~~ 133 (217) T TIGR00084 62 DANQILHLLFGFNTLEERELFKELI--------KVNGVGPKLALAILSNMSPEEFVQAIETEEVKALVKIPGVGKKTAER 133 (217) T ss_pred CHHHHHHHHHCCCCHHHHHHHHHHH--------HCCCCHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHCCCCCCHHHHHH T ss_conf 4678999973479877899999985--------14880289999986678875898888641044420458857378999 Q ss_pred HC--CCHHHHHHHHH--HHHCCCCCCCCCCHHHHCCCCCCHHHHHHHHHHHHHHHH Q ss_conf 00--35789997655--432035642247635521267676889986532336899 Q gi|254780277|r 233 IL--EYAETARLSRK--LVMLRTDVPITIPLEHLILEDCNGPRLISFLKALEFTKL 284 (976) Q Consensus 233 l~--~~~e~a~ls~~--L~tL~~Dvpl~~~~~~l~~~~~d~~~l~~~~~~lef~~l 284 (976) |. +=++...-.+. ..|=..| ++.-+. ...-.....+++..-+..|||+.- T Consensus 134 l~~leL~gk~~~~~~~~~~~~~~~-~~~~~~-~~~~~~~~y~E~~~aL~sLGY~~~ 187 (217) T TIGR00084 134 LLALELKGKLKGVKNLEMFTNLAN-HLETDE-KLTAEAARYDELFEALVSLGYKPQ 187 (217) T ss_pred HHHHHHHHHHCCCCCCCCCCCCCC-CCCCCC-HHHHHHHHHHHHHHHHHHCCCCHH T ss_conf 987775454405521124554335-335541-012102118899999997298888 No 149 >PRK10702 endonuclease III; Provisional Probab=91.88 E-value=0.2 Score=28.92 Aligned_cols=26 Identities=15% Similarity=0.103 Sum_probs=16.0 Q ss_pred HHHCCCCCEECCCC--CCCCCCCCCCCC Q ss_conf 33002221000001--333433345563 Q gi|254780277|r 702 FYSLASTMTGRLAS--LEPNLQNIPIKT 727 (976) Q Consensus 702 ~f~q~~t~TGRlSs--~~PNlQNiP~r~ 727 (976) .|||..-.-||.-| .+|+=..-|.+. T Consensus 174 ~~~~~li~~G~~iC~~~~P~C~~Cpl~~ 201 (211) T PRK10702 174 DCHHWLILHGRYTCIARKPRCGSCIIED 201 (211) T ss_pred HHHHHHHHHHCCCCCCCCCCCCCCCCCC T ss_conf 9999999995015069999399998914 No 150 >PRK06722 exonuclease; Provisional Probab=91.66 E-value=0.18 Score=29.35 Aligned_cols=51 Identities=18% Similarity=0.237 Sum_probs=38.7 Q ss_pred HHHHHHHHHHHHCCCCCHHHHCCCCHHHHHHHH----CCCCCCCC--CCHH-HHHHHHCC Q ss_conf 899997444420222100110032123788986----18866642--1002-34554316 Q gi|254780277|r 446 TKEILSYLKKFFENEHFLKIGHNIKYDKLVLHR----YGISMRGF--EDIM-LMSYVLDS 498 (976) Q Consensus 446 ~~evl~~L~~lle~~~i~kIgHN~KfD~~~L~~----~GI~l~~i--~DTm-laayLLdp 498 (976) ..+|+..|-++..+..+ |+||++||+.+|.+ ||+...++ .|+. ++-+++.+ T Consensus 38 ieeVLp~FleFIGd~~L--VAHNAsFD~gFL~~~C~~hg~e~P~~~~~~~~dl~k~v~~~ 95 (242) T PRK06722 38 FPQIIEKFIQFIGEDSI--FVTWGKEDYRFLSHDCTLHGVECPCMEKERRIDLQKFVFQA 95 (242) T ss_pred HHHHHHHHHHHHCCCEE--EEECCHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHC T ss_conf 89999999998278838--97344022779997752428999998421088899998631 No 151 >pfam10391 DNA_pol_lambd_f Fingers domain of DNA polymerase lambda. DNA polymerases catalyse the addition of dNMPs onto the 3-prime ends of DNA chains. There is a general polymerase fold consisting of three subdomains that have been likened to the fingers, palm, and thumb of a right hand. DNA_pol_lambd_f is the central three-helical region of DNA polymerase lambda referred to as the F and G helices of the fingers domain. Contacts with DNA involve this conserved helix-hairpin-helix motif in the fingers region which interacts with the primer strand. This motif is common to several DNA binding proteins and confers a sequence-independent interaction with the DNA backbone. Probab=91.66 E-value=0.13 Score=30.29 Aligned_cols=32 Identities=28% Similarity=0.359 Sum_probs=25.0 Q ss_pred CCCCCCCCHHHHHHHHH-CCCHHHHHHHHHHHH Q ss_conf 98776360249989841-367899986454200 Q gi|254780277|r 194 PGIPGIGYKTAALLLQE-YGNLENILINASRIK 225 (976) Q Consensus 194 pGv~GiG~KtA~~ll~~-~gsle~i~~~~d~i~ 225 (976) -+|-||||+||.+|..+ +.|||.+-++-+.+. T Consensus 5 ~~I~GvGp~~A~~~~~~G~~ti~dL~~~~~~L~ 37 (52) T pfam10391 5 TNIWGVGPKTARKWYRQGIRTLEDLRENKASLT 37 (52) T ss_pred HHCCCCCHHHHHHHHHHCCCCHHHHHHCCCCCC T ss_conf 635440699999999947888999974631089 No 152 >smart00483 POLXc DNA polymerase X family. includes vertebrate polymerase beta and terminal deoxynucleotidyltransferases Probab=91.41 E-value=0.22 Score=28.63 Aligned_cols=31 Identities=29% Similarity=0.334 Sum_probs=21.1 Q ss_pred CCCCCCCCCCHHHHHHHHH-CCCHHHHHHHHH Q ss_conf 8998776360249989841-367899986454 Q gi|254780277|r 192 NIPGIPGIGYKTAALLLQE-YGNLENILINAS 222 (976) Q Consensus 192 nipGv~GiG~KtA~~ll~~-~gsle~i~~~~d 222 (976) .+-+|||||||||.+|..+ ..||+.+-++.+ T Consensus 90 el~~I~GvGpk~a~~l~~~Gi~tl~dL~~a~~ 121 (334) T smart00483 90 LFTNVFGVGPKTAAKWYRKGIRTLEELKKNKE 121 (334) T ss_pred HHHCCCCCCHHHHHHHHHCCCCCHHHHHHHHH T ss_conf 98538887789999999849887999998788 No 153 >cd00141 NT_POLXc Nucleotidyltransferase (NT) domain of family X DNA Polymerases. X family polymerases fill in short gaps during DNA repair. They are relatively inaccurate enzymes and play roles in base excision repair, in non-homologous end joining (NHEJ) which acts mainly to repair damage due to ionizing radiation, and in V(D)J recombination. This family includes eukaryotic Pol beta, Pol lambda, Pol mu, and terminal deoxyribonucleotidyl transferase (TdT). Pol beta and Pol lambda are primarily DNA template-dependent polymerases. TdT is a DNA template-independent polymerase. Pol mu has both template dependent and template independent activities. This subgroup belongs to the Pol beta-like NT superfamily. In the majority of enzymes in this superfamily, two carboxylates, Dx[D/E], together with a third more distal carboxylate, coordinate two divalent metal cations involved in a two-metal ion mechanism of nucleotide addition. These three carboxylate residues are fairly well conserved in this Probab=91.37 E-value=0.2 Score=29.01 Aligned_cols=29 Identities=34% Similarity=0.471 Sum_probs=20.0 Q ss_pred CCCCCCCCCHHHHHHHHH-CCCHHHHHHHH Q ss_conf 998776360249989841-36789998645 Q gi|254780277|r 193 IPGIPGIGYKTAALLLQE-YGNLENILINA 221 (976) Q Consensus 193 ipGv~GiG~KtA~~ll~~-~gsle~i~~~~ 221 (976) +-.|||||||||.+|... +.||+.+-++. T Consensus 87 l~~I~GiGpk~a~~l~~~Gi~sl~dL~~a~ 116 (307) T cd00141 87 LLRVPGVGPKTARKLYELGIRTLEDLRKAA 116 (307) T ss_pred HHCCCCCCHHHHHHHHHCCCCCHHHHHHHH T ss_conf 964788788999999982999799999750 No 154 >PRK05761 DNA polymerase I; Reviewed Probab=91.30 E-value=1 Score=23.77 Aligned_cols=102 Identities=22% Similarity=0.286 Sum_probs=59.7 Q ss_pred HHHHHHHHHHHHC--CCCCHHHHCCC-CHHHHHHHH----CCCCCCCC-C-------------CHH-------HHHHHHC Q ss_conf 8999974444202--22100110032-123788986----18866642-1-------------002-------3455431 Q gi|254780277|r 446 TKEILSYLKKFFE--NEHFLKIGHNI-KYDKLVLHR----YGISMRGF-E-------------DIM-------LMSYVLD 497 (976) Q Consensus 446 ~~evl~~L~~lle--~~~i~kIgHN~-KfD~~~L~~----~GI~l~~i-~-------------DTm-------laayLLd 497 (976) -.+++..|-.++. ||++ + +|. .||+..|.+ +|+....+ + |.- +-.|-.. T Consensus 198 E~elL~~f~~~i~~~DPDI--i-yN~d~FD~PYL~~Ra~~lgi~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~y~~~ 274 (777) T PRK05761 198 EKELLAELFKIIEDYPPVI--T-FNGDNFDLPYLYNRALKLGIPKEELPFNVKYASLKTGIHIDLYKFFQNRAIRSYAFY 274 (777) T ss_pred HHHHHHHHHHHHHHCCCCE--E-ECCCCCCHHHHHHHHHHHCCCCHHCCCCCCCCCCCCCEEEECCCCCCHHHHHHEEEC T ss_conf 9999999999997469829--9-807876489999999983899000665156654024446504453121135530105 Q ss_pred --CCCCCCCHHHHHHHHCCCCCCCCCCCCCCCCCCHHHCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf --68766772466776224442233565432221000000123333222111112456776 Q gi|254780277|r 498 --SGRSSHDIASIAQKWLSYTRKEILKSRKSSIPIDKISDSQVQEHAIENSNVILQLWLLL 556 (976) Q Consensus 498 --p~~~~~~L~~la~~yL~~~~~e~~~~~k~~~~~~~i~~e~~~~Yaa~Da~~t~~L~~~l 556 (976) -...+++|+.++..+||....+... +......+.+.+||.+||..|++|.... T Consensus 275 ~~~~~~~y~Ld~Va~~lLg~~K~~~~~------~~~~~~~~~l~~Y~l~Da~l~~~L~~~~ 329 (777) T PRK05761 275 GKYRHREAALDAVARALLGESKVELEK------NISELNYWELAEYNYRDAEITLKLTFFN 329 (777) T ss_pred CEECCCCEEHHHHHHHHCCCCCCCCCC------CCCCCCHHHHHHHHHHHHHHHHHHHHHH T ss_conf 474247455478899975997410312------4663347889999899999999987530 No 155 >smart00484 XPGI Xeroderma pigmentosum G I-region. domain in nucleases Probab=91.17 E-value=0.77 Score=24.71 Aligned_cols=66 Identities=30% Similarity=0.376 Sum_probs=49.6 Q ss_pred HCCCCEEEECCEEHHHHHHHHHHHHHHCCCCEEEECCCCHHHEECCCCEEEEECCCC------CEECHHHHHHHHCC Q ss_conf 709979956890166589999999984899499986870121223797599966788------35389999888489 Q gi|254780277|r 104 AFGIPAIEIQGFEADDIIATYTYIAEKEGFAVTIISTDKDLMQLVSPTTCLYDTVKE------EKIDIEHVIKKWGI 174 (976) Q Consensus 104 ~~gi~~~~~~~~EADDviatla~~~~~~~~~v~i~s~DkD~~Qlv~~~v~~~~~~~~------~~~~~~~v~~~~gv 174 (976) .||||++..|+ |||=..|.|++. |.-=-++|.|-|.+-.=.+.+..--...+ +.++.+.+.++.|+ T Consensus 2 l~Gipyi~Ap~-Eaea~ca~L~~~----g~vd~v~s~D~D~l~~G~~~~ir~~~~~~~~~~~~~~~~l~~il~~L~l 73 (73) T smart00484 2 LMGIPYIVAPY-EAEAQCAYLAKS----GLVDAIITEDSDLLLFGAPRLYRNLFFSGKKKLEFRIIDLESVLKELGL 73 (73) T ss_pred CCCCCEEECCC-CHHHHHHHHHHC----CCEEEEECCCHHHHHHCCCEEEEECCCCCCCCCCEEEEEHHHHHHHCCC T ss_conf 46963697697-899999999988----9889997262664764976402312335884355499889999976489 No 156 >PRK13901 ruvA Holliday junction DNA helicase motor protein; Provisional Probab=90.98 E-value=0.37 Score=27.07 Aligned_cols=121 Identities=19% Similarity=0.235 Sum_probs=56.8 Q ss_pred CCCEEEECCCCHHHEEC-CCCEEEEECC--CCCEECHHHHHHHHCCCHH-HHHHHHHHCCCCCCCCCCCCCCCCHHHHHH Q ss_conf 99499986870121223-7975999667--8835389999888489887-877778751874358998776360249989 Q gi|254780277|r 132 GFAVTIISTDKDLMQLV-SPTTCLYDTV--KEEKIDIEHVIKKWGIPPE-KMVCLQALTGDSIDNIPGIPGIGYKTAALL 207 (976) Q Consensus 132 ~~~v~i~s~DkD~~Qlv-~~~v~~~~~~--~~~~~~~~~v~~~~gv~p~-~~~d~~al~GD~sDnipGv~GiG~KtA~~l 207 (976) ||+|.+ +.+.+.+|- .+.+.+|-.. +.. -..-||+.-. .---|.-| -.|.|||||+|..+ T Consensus 25 GY~V~v--s~~~~~~l~~~~~v~l~t~~~vrEd------~~~LyGF~~~~Er~~F~~L--------isVsGIGpk~Al~i 88 (196) T PRK13901 25 EFELLV--SSFCLAELRLLEDVEILTYLHTRED------ELKLFGFLNSSEREVFEEL--------IGVDGIGPRAALRV 88 (196) T ss_pred EEEEEE--CHHHHHHCCCCCEEEEEEEEEEEEC------CCEEECCCCHHHHHHHHHH--------HCCCCCCHHHHHHH T ss_conf 999996--7899976589980899999999516------7713365988999999998--------76588268999999 Q ss_pred HHHCCCHHHHHHHH---H-----HHH--HHHHHHHHCCCHHHHHHHHHHHHCCCCCCCCCCHHHHCCCCCCHHHHHHHHH Q ss_conf 84136789998645---4-----200--1122322003578999765543203564224763552126767688998653 Q gi|254780277|r 208 LQEYGNLENILINA---S-----RIK--QKKRRENILEYAETARLSRKLVMLRTDVPITIPLEHLILEDCNGPRLISFLK 277 (976) Q Consensus 208 l~~~gsle~i~~~~---d-----~i~--~~k~~~~l~~~~e~a~ls~~L~tL~~Dvpl~~~~~~l~~~~~d~~~l~~~~~ 277 (976) |..+. .+.+...+ | +++ |+|..++|. .-|.-++... +...-.....+++..-+. T Consensus 89 Ls~~~-~~~l~~aI~~~D~~~L~~vpGIG~KtA~rIi-----~ELk~Kl~~~----------~~~~~~~~~~~e~~~AL~ 152 (196) T PRK13901 89 LSGIK-YNEFRDAIDREDIELISKVKGIGNKMAGKIF-----LKLRGKLVKN----------DELESSLFKFKELEQSIV 152 (196) T ss_pred HCCCC-HHHHHHHHHHCCHHHHHHCCCCCHHHHHHHH-----HHHHHHHCCC----------CCCCCCCCCHHHHHHHHH T ss_conf 75799-9999999992899998319995899999999-----9997653156----------655655344899999999 Q ss_pred HHHHHHH Q ss_conf 2336899 Q gi|254780277|r 278 ALEFTKL 284 (976) Q Consensus 278 ~lef~~l 284 (976) .|+|++- T Consensus 153 ~LGy~~~ 159 (196) T PRK13901 153 NMGFDRK 159 (196) T ss_pred HCCCCHH T ss_conf 8499989 No 157 >COG0417 PolB DNA polymerase elongation subunit (family B) [DNA replication, recombination, and repair] Probab=90.96 E-value=1.2 Score=23.41 Aligned_cols=162 Identities=15% Similarity=0.141 Sum_probs=80.2 Q ss_pred HCCCCCCCCCCCCCCCC-----CCCCCCCEEECCCCCCCCCCCCCCEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHC-- Q ss_conf 66852323322243311-----00124420102577665443455023137443334565001788999974444202-- Q gi|254780277|r 386 TIGSASFKIITDTMDAF-----HSKPIAIAISTLDQKDDTSFKVETIFIDLSVHTSQDSTGKNTLTKEILSYLKKFFE-- 458 (976) Q Consensus 386 ~~~~ia~DtETt~ld~~-----~~~ivGisls~~~~~~~~~~~~~~~yip~~~~~~~d~~~~~~~~~evl~~L~~lle-- 458 (976) .-..++||+||.+.... .+.+.-++.+........ ....+.+..... .-......++++.+..++. T Consensus 153 ~l~~la~DiE~~~~~~~~~~~~~d~~~~i~~~~~~~~~~~---~~~~~~~~~~~~----v~~~~~e~e~l~~~~~~i~~~ 225 (792) T COG0417 153 PLRVLAFDIETLSEPGKFPDGEKDPIIMISYAIEAEGGLI---EVFIYTSGEGFS----VEVVISEAELLERFVELIREY 225 (792) T ss_pred CCCEEEEEEEECCCCCCCCCCCCCCEEEEEEEECCCCCCC---CCCCCCCCCCCC----EEEECCHHHHHHHHHHHHHHC T ss_conf 8754799503105777889877786269999704677764---443223567763----487258799999999998502 Q ss_pred CCCCHHHHCCCC-HHHHHHH----HCCCCCC-------------------C--CCCHHHHHHHHCCCCCCCCHHHHHHHH Q ss_conf 221001100321-2378898----6188666-------------------4--210023455431687667724667762 Q gi|254780277|r 459 NEHFLKIGHNIK-YDKLVLH----RYGISMR-------------------G--FEDIMLMSYVLDSGRSSHDIASIAQKW 512 (976) Q Consensus 459 ~~~i~kIgHN~K-fD~~~L~----~~GI~l~-------------------~--i~DTmlaayLLdp~~~~~~L~~la~~y 512 (976) ||++. +|||.. ||+..|. .+|+.+. | ..|..-+..--.....+++|.+++..+ T Consensus 226 dPdVI-vgyn~~~fd~pyl~~Ra~~lgi~~~~gr~~~~~~~~~~~~~~~~Gr~~iDl~~~~~~~~~~~~~ysl~~v~~~~ 304 (792) T COG0417 226 DPDVI-VGYNGDNFDWPYLAERAERLGIPLRLGRDGSELRVRKSGFSSQVGRLHIDLYPALRRRPLNLKSYSLEAVSEAL 304 (792) T ss_pred CCCEE-EEECCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCEEECCCCCCCCEEEEEHHHHHHHCCCCCCCEEHHHHHHHH T ss_conf 98999-98367777738999999981998511356653200212544557716741066542032355632434677664 Q ss_pred CCCCCCCCCCCCCC--CCCCHHHCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 24442233565432--22100000012333322211111245677 Q gi|254780277|r 513 LSYTRKEILKSRKS--SIPIDKISDSQVQEHAIENSNVILQLWLL 555 (976) Q Consensus 513 L~~~~~e~~~~~k~--~~~~~~i~~e~~~~Yaa~Da~~t~~L~~~ 555 (976) |+............ ..+...-.......|+..|++++.++... T Consensus 305 l~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~l~~~~~~~ 349 (792) T COG0417 305 LGEGKREDIPYDSMEEIWPDWADSKLRLLLYNLSDADLVLRILLK 349 (792) T ss_pred CCCCCHHCCCCCHHHHCCCCCCCCHHHHHHHCCCHHHHHHHHHHH T ss_conf 133522005620133304235510768888341078899999866 No 158 >KOG2248 consensus Probab=90.95 E-value=0.26 Score=28.10 Aligned_cols=98 Identities=13% Similarity=0.158 Sum_probs=62.8 Q ss_pred CCCCCCHHHHHHHHHHHHCCCCCHHHHCCCCHHHHHHHHCCCCCCCCCCHHHHHHHHCC-CCCCCCHHHHHHHHCCCCCC Q ss_conf 50017889999744442022210011003212378898618866642100234554316-87667724667762244422 Q gi|254780277|r 440 TGKNTLTKEILSYLKKFFENEHFLKIGHNIKYDKLVLHRYGISMRGFEDIMLMSYVLDS-GRSSHDIASIAQKWLSYTRK 518 (976) Q Consensus 440 ~~~~~~~~evl~~L~~lle~~~i~kIgHN~KfD~~~L~~~GI~l~~i~DTmlaayLLdp-~~~~~~L~~la~~yL~~~~~ 518 (976) .+......+|-+.|++|+....++ |||-+--|+.+|+. .-+.+.||.+.-..-+. ...+.+|+.|++.||+.... T Consensus 274 e~~t~tl~dvq~~l~~~~~~~TIL-VGHSLenDL~aLKl---~H~~ViDTa~lf~~~~g~~~~k~sLk~L~~~~L~~~Iq 349 (380) T KOG2248 274 ENSTITLEDVQKELLELISKNTIL-VGHSLENDLKALKL---DHPSVIDTAVLFKHPTGPYPFKSSLKNLAKSYLGKLIQ 349 (380) T ss_pred HCCCCCHHHHHHHHHHHCCCCCEE-EEECHHHHHHHHHH---HCCCEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHH T ss_conf 258627999999999646867179-95100307889764---27742200588864889854259999999999889986 Q ss_pred CCCCCCCCCCCCHHHCCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 335654322210000001233332221111124567763 Q gi|254780277|r 519 EILKSRKSSIPIDKISDSQVQEHAIENSNVILQLWLLLR 557 (976) Q Consensus 519 e~~~~~k~~~~~~~i~~e~~~~Yaa~Da~~t~~L~~~l~ 557 (976) + +. ...-+.+||..+.+|..... T Consensus 350 ~--~~--------------~~HdS~eDA~acm~Lv~~k~ 372 (380) T KOG2248 350 E--GV--------------GGHDSVEDALACMKLVKLKI 372 (380) T ss_pred C--CC--------------CCCCCHHHHHHHHHHHHHHH T ss_conf 1--67--------------78760888999999999998 No 159 >TIGR03491 RecB family nuclease, putative, TM0106 family. Members of this uncharacterized protein family are found broadly but sporadically among bacteria. The N-terminal region is homologous to the Cas4 protein of CRISPR systems, although this protein family shows no signs of association with CRISPR repeats. Probab=90.87 E-value=0.76 Score=24.76 Aligned_cols=151 Identities=17% Similarity=0.168 Sum_probs=76.6 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHCC-CCCHHHH Q ss_conf 852323322243311001244201025776654434550231374433345650017889999744442022-2100110 Q gi|254780277|r 388 GSASFKIITDTMDAFHSKPIAIAISTLDQKDDTSFKVETIFIDLSVHTSQDSTGKNTLTKEILSYLKKFFEN-EHFLKIG 466 (976) Q Consensus 388 ~~ia~DtETt~ld~~~~~ivGisls~~~~~~~~~~~~~~~yip~~~~~~~d~~~~~~~~~evl~~L~~lle~-~~i~kIg 466 (976) +.+-||+|..- +.....+.|+.+....+. ...|.|+-.+...+ -..+...|-.++.. |. ..|. T Consensus 285 gdlffDiEg~P-d~~leYL~G~~~~~~~~~-------~~~f~~f~A~~~~~-------E~~~~~~f~~~l~~~p~-~~Iy 348 (457) T TIGR03491 285 GELIFDIESDP-DENLDYLHGFLVVDKGQE-------NEKYRPFLAEDPNT-------EELAWQQFLQLLQSYPD-APIY 348 (457) T ss_pred CCEEEEECCCC-CCCCCEEEEEEEECCCCC-------CCEEEEHHCCCCHH-------HHHHHHHHHHHHHHCCC-CEEE T ss_conf 63798505689-988776899997248887-------75054200489357-------89999999999986899-6599 Q ss_pred CCCCHHHHHHHH----CCCCCC-------CCCCHHH--HHHHHCCCCCCCCHHHHHHHHCCCCCCCCCCCCCCCCCC--- Q ss_conf 032123788986----188666-------4210023--455431687667724667762244422335654322210--- Q gi|254780277|r 467 HNIKYDKLVLHR----YGISMR-------GFEDIML--MSYVLDSGRSSHDIASIAQKWLSYTRKEILKSRKSSIPI--- 530 (976) Q Consensus 467 HN~KfD~~~L~~----~GI~l~-------~i~DTml--aayLLdp~~~~~~L~~la~~yL~~~~~e~~~~~k~~~~~--- 530 (976) |=..|....|.+ ||+.-. .+.|..- -..+.-| ..+|||+.++. ++|....+-..+|...+.+ T Consensus 349 HY~~yE~~al~rLa~~~~~~e~evd~ll~r~VDL~~vvr~~~~~p-~~sysLK~ve~-~~gf~w~~~~~~g~~si~~y~~ 426 (457) T TIGR03491 349 HYGETEKDSLRRLAKRYGTPEAEIEELLKRFVDIHTIVRRSWILP-IESYSLKSIAR-WLGFEWRQKEASGAKSLLWYRQ 426 (457) T ss_pred EECHHHHHHHHHHHHHCCCCHHHHHHHHHHHEEHHHHHHCEEECC-CCCCCHHHHHH-HCCCCCCCCCCCHHHHHHHHHH T ss_conf 845487999999998719986999999987610699760158837-99875999998-7197445799887999999999 Q ss_pred -----HHHCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf -----00000123333222111112456776 Q gi|254780277|r 531 -----DKISDSQVQEHAIENSNVILQLWLLL 556 (976) Q Consensus 531 -----~~i~~e~~~~Yaa~Da~~t~~L~~~l 556 (976) +..-.+....|.-.||..|+.|...| T Consensus 427 w~~t~d~~~l~~I~~YN~dDC~aT~~l~~WL 457 (457) T TIGR03491 427 WKKTGDRRLLERILRYNEDDCRATWIVADWL 457 (457) T ss_pred HHHCCCHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 9975889999999998489999999999659 No 160 >COG0272 Lig NAD-dependent DNA ligase (contains BRCT domain type II) [DNA replication, recombination, and repair] Probab=90.59 E-value=0.16 Score=29.68 Aligned_cols=19 Identities=0% Similarity=0.342 Sum_probs=14.3 Q ss_pred EEECCHHHHHHHHHHHHHC Q ss_conf 3547999999999999866 Q gi|254780277|r 369 TKIVNTKDIQQWVQKLETI 387 (976) Q Consensus 369 ~~I~~~~~L~~li~~L~~~ 387 (976) .-.-+.+++..|.+..... T Consensus 86 ~naf~~~el~~f~~ri~~~ 104 (667) T COG0272 86 DNAFDEEELRAFDRRIRKF 104 (667) T ss_pred HHCCCHHHHHHHHHHHHHH T ss_conf 4227989999999999875 No 161 >pfam09293 RNaseH_C T4 RNase H, C terminal. Members of this family are found in T4 RNaseH ribonuclease, and adopt a SAM domain-like fold, consisting of a bundle of four/five helices. These residues may have a role in providing a docking site for other proteins or enzymes in the replication fork. Probab=89.39 E-value=0.58 Score=25.63 Aligned_cols=103 Identities=18% Similarity=0.169 Sum_probs=56.7 Q ss_pred CHHHHHHHHHHCCCCCCCCCCCCC-----------CCCHH-HHHHHHHCCCHHHHHHHHHHHHHHHHHHHHCCCHHHHHH Q ss_conf 887877778751874358998776-----------36024-998984136789998645420011223220035789997 Q gi|254780277|r 175 PPEKMVCLQALTGDSIDNIPGIPG-----------IGYKT-AALLLQEYGNLENILINASRIKQKKRRENILEYAETARL 242 (976) Q Consensus 175 ~p~~~~d~~al~GD~sDnipGv~G-----------iG~Kt-A~~ll~~~gsle~i~~~~d~i~~~k~~~~l~~~~e~a~l 242 (976) .|.++.--.-|.||+||+||-|-- .-.|- ..|.|..+ ++ .+..+. -.. .+...+-.. T Consensus 2 nP~~yl~ehIlkGD~~DGIPNiLS~Dd~fvt~veG~RQkPiskKkl~~~------~~-~~~~~~-~~~---~e~~rny~R 70 (122) T pfam09293 2 SPEKDLMTKIIKGDKKDGVASIKVRSDYILTRVEGERAPSIRTKELEAI------FD-REDPKV-LLT---EEEYARYDE 70 (122) T ss_pred CHHHHHHHHHHCCCCCCCCCCCCCCCCCEECCCCCCCCCCCHHHHHHHH------HC-CCCCCC-CCC---HHHHHHHHH T ss_conf 8879999882267886887874668862321567656898409999999------86-688443-489---999999975 Q ss_pred HHHHHHCCCCCCCCCC---HHHHC-CCCCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 6554320356422476---35521-26767688998653233689998765 Q gi|254780277|r 243 SRKLVMLRTDVPITIP---LEHLI-LEDCNGPRLISFLKALEFTKLINRVA 289 (976) Q Consensus 243 s~~L~tL~~Dvpl~~~---~~~l~-~~~~d~~~l~~~~~~lef~~ll~rL~ 289 (976) +++|+-|. .+|-++. ++.+. ....+...+..+|-+-.++.|+..+. T Consensus 71 N~~LIDL~-~iP~~i~~~Ii~~y~~~~~~~r~kll~YFi~~rLk~Lie~I~ 120 (122) T pfam09293 71 NRELIDFD-FIPDDIADNIIEAYNTYKVPPRGKIYSYFVKNGLVKLLEKVN 120 (122) T ss_pred HHHHHCCH-HCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHC T ss_conf 37772811-184999999999986489886215499999999999999871 No 162 >KOG1275 consensus Probab=88.94 E-value=0.41 Score=26.70 Aligned_cols=22 Identities=27% Similarity=0.360 Sum_probs=10.5 Q ss_pred HHHHHHHHHHHCCCCCCCCCCH Q ss_conf 8999765543203564224763 Q gi|254780277|r 238 ETARLSRKLVMLRTDVPITIPL 259 (976) Q Consensus 238 e~a~ls~~L~tL~~Dvpl~~~~ 259 (976) |.-..=+.|-+++-=.|+++.. T Consensus 245 D~FvkVYDLRmmral~PI~~~~ 266 (1118) T KOG1275 245 DPFVKVYDLRMMRALSPIQFPY 266 (1118) T ss_pred CCHHHHHHHHHHHCCCCCCCCC T ss_conf 6313202235553168856566 No 163 >PRK05359 oligoribonuclease; Provisional Probab=88.73 E-value=0.54 Score=25.85 Aligned_cols=29 Identities=17% Similarity=0.122 Sum_probs=22.8 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCEEECCCC Q ss_conf 85232332224331100124420102577 Q gi|254780277|r 388 GSASFKIITDTMDAFHSKPIAIAISTLDQ 416 (976) Q Consensus 388 ~~ia~DtETt~ld~~~~~ivGisls~~~~ 416 (976) ..+=+|+|+||||+..+.|+-++...-++ T Consensus 4 ~lvWiDlEMTGLd~~~d~IlEiA~ivTD~ 32 (181) T PRK05359 4 NLIWIDLEMTGLDPERDRIIEIATIVTDA 32 (181) T ss_pred CEEEEECCCCCCCCCCCEEEEEEEEEECC T ss_conf 65899761348987877689999999988 No 164 >KOG4373 consensus Probab=88.66 E-value=0.84 Score=24.45 Aligned_cols=104 Identities=18% Similarity=0.201 Sum_probs=61.7 Q ss_pred HHHHHHHHHCCCCCHHHHCCCCHHHHHHH--HCCCCCCCCCCHHH-HHHHHCCCCCCCCHHHHHHHHCCCCCCCCCCCCC Q ss_conf 99744442022210011003212378898--61886664210023-4554316876677246677622444223356543 Q gi|254780277|r 449 ILSYLKKFFENEHFLKIGHNIKYDKLVLH--RYGISMRGFEDIML-MSYVLDSGRSSHDIASIAQKWLSYTRKEILKSRK 525 (976) Q Consensus 449 vl~~L~~lle~~~i~kIgHN~KfD~~~L~--~~GI~l~~i~DTml-aayLLdp~~~~~~L~~la~~yL~~~~~e~~~~~k 525 (976) |-..|+.+|+|++..+||-+..-|..=|. .|+..+.-..|... ..-.+.++..+.+...+....+++...++..... T Consensus 172 IP~~LR~fl~D~~~~~vgv~~d~D~~KL~r~~hql~I~~~~dlr~~~~d~~g~~~~~~s~e~i~~~~~~~~~~~v~l~~~ 251 (319) T KOG4373 172 IPHELRSFLEDPDHTFVGVWNDQDAGKLERKEHQLEIGELEDLRLLVNDSLGGSMPNDSFEEIVSETLGYYGKDVRLDKE 251 (319) T ss_pred CHHHHHHHHCCCCCEEEECCCCCCHHHHHHHHHCCCHHHHHHHHHHCCHHHCCCCCCCCHHHHHHHHHHCCCCCCCCCHH T ss_conf 55999986238770587305553078776655226377664087650333336766761889999874124622025721 Q ss_pred CCC---CCHHHCCHHHHHHHHHHHHHHHHHH Q ss_conf 222---1000000123333222111112456 Q gi|254780277|r 526 SSI---PIDKISDSQVQEHAIENSNVILQLW 553 (976) Q Consensus 526 ~~~---~~~~i~~e~~~~Yaa~Da~~t~~L~ 553 (976) -+. .+..++ .....||+-|+.+.+.|+ T Consensus 252 i~msdw~~~~Ls-~~Ql~~asidvy~c~~lg 281 (319) T KOG4373 252 IRMSDWSVYPLS-DDQLLQASIDVYVCHKLG 281 (319) T ss_pred CCCCCCEEEECC-HHHHHHHHHHHHHHHHHH T ss_conf 043565146266-889999876788887888 No 165 >TIGR00630 uvra excinuclease ABC, A subunit; InterPro: IPR004602 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). During the process of Escherichia coli nucleotide excision repair, DNA damage recognition and processing are achieved by the action of the uvrA, uvrB, and uvrC gene products . The UvrC protein contain 4 conserved regions: a central region which interact with UvrB (Uvr domain), a Helix hairpin Helix (HhH) domain important for 5 prime incision of damage DNA and the homology regions 1 and 2 of unknown function. UvrC homology region 2 is specific for UvrC proteins, whereas UvrC homology region 1 is also shared by few other nucleases.; GO: 0009381 excinuclease ABC activity, 0006289 nucleotide-excision repair, 0009380 excinuclease repair complex. Probab=87.37 E-value=0.8 Score=24.59 Aligned_cols=38 Identities=24% Similarity=0.458 Sum_probs=32.0 Q ss_pred HHHHCCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHCCCEE Q ss_conf 67881999899999999999988889999999999999849888 Q gi|254780277|r 817 LATQLRIPRSEAADYIKRYFHRFPGIHEYIEKTKNFVRQNGYVE 860 (976) Q Consensus 817 La~~l~is~~eA~~~i~~yf~~~p~v~~~~~~~~~~a~~~gyv~ 860 (976) .|.-|.||.+||.+| |+.+|.|.++++-..+-. -||++ T Consensus 816 IadVLdMtveEA~eF----F~~~P~I~rKL~~L~dVG--LgY~~ 853 (956) T TIGR00630 816 IADVLDMTVEEAREF----FEAVPKISRKLQTLCDVG--LGYIK 853 (956) T ss_pred EHHHHCCCHHHHHHH----HHCCHHHHHHHHHHHHHC--CCEEE T ss_conf 123315866899887----422605899999987624--21020 No 166 >TIGR01298 RNaseT ribonuclease T; InterPro: IPR005987 Ribonuclease T (3.1.13 from EC) is an enzyme found so far only in gamma-subdivision proteobacteria such as Escherichia coli and Xylella fastidiosa. Ribonuclease T is homologous to the DNA polymerase III alpha chain. It can liberate AMP from the common C-C-A terminus of uncharged tRNA. It appears also to be involved in RNA maturation. It also acts as a 3' to 5' single-strand DNA-specific exonuclease; it is distinctive for its ability to remove residues near a double-stranded stem. Ribonuclease T is a high copy suppressor in E. coli of a UV-repair defect caused by deletion of three other single-stranded DNA exonucleases.; GO: 0004540 ribonuclease activity, 0006396 RNA processing. Probab=86.81 E-value=0.47 Score=26.28 Aligned_cols=151 Identities=15% Similarity=0.192 Sum_probs=70.1 Q ss_pred CCCCCCCCCCCCCCCCCC---CCEEECCCCCCCCCCCCCCEECCCCCCCCCC--------------CC-----CCCCCHH Q ss_conf 232332224331100124---4201025776654434550231374433345--------------65-----0017889 Q gi|254780277|r 390 ASFKIITDTMDAFHSKPI---AIAISTLDQKDDTSFKVETIFIDLSVHTSQD--------------ST-----GKNTLTK 447 (976) Q Consensus 390 ia~DtETt~ld~~~~~iv---Gisls~~~~~~~~~~~~~~~yip~~~~~~~d--------------~~-----~~~~~~~ 447 (976) +.+|+||.|+|..+.-+. .+.+..++.+ +-++...++.-+.-..+.+ .+ ..+.-.. T Consensus 11 VVvDVET~GfN~~~dALLEiAait~~MDe~G--~L~P~~~~~~~~~P~~Gani~P~~L~~~GI~~d~P~R~A~~E~~Al~ 88 (201) T TIGR01298 11 VVVDVETAGFNAKTDALLEIAAITLKMDEQG--YLVPDETLHFHVEPFEGANIEPEALEFTGIDLDHPLRGAVEEKLALE 88 (201) T ss_pred EEEEECCCCCCCHHHHHHHHHHHHCCCCCCE--EECCCCEEEEEEECCCCCCCCCCCEEECCCCCCCCCCHHHHHHHHHH T ss_conf 4886106776601345555542000116730--01478525788522679988712002206544887522466889999 Q ss_pred HHHHHHHHHHCC---CCCHHHHCCCCHHHHHHHH-----CCCC---CCC--CCCHHHHHHHHCCCCCCCCHHHHHHHHCC Q ss_conf 999744442022---2100110032123788986-----1886---664--21002345543168766772466776224 Q gi|254780277|r 448 EILSYLKKFFEN---EHFLKIGHNIKYDKLVLHR-----YGIS---MRG--FEDIMLMSYVLDSGRSSHDIASIAQKWLS 514 (976) Q Consensus 448 evl~~L~~lle~---~~i~kIgHN~KfD~~~L~~-----~GI~---l~~--i~DTmlaayLLdp~~~~~~L~~la~~yL~ 514 (976) ++....|.-+.. ..-..|+||+.||..+|.+ .+++ +.+ .|||.-.+=|.- ++. =|..-++ T Consensus 89 ~iF~~vR~~~K~~gC~RailV~HNa~fD~~Fl~~AA~~R~~~KRnPFH~F~~FDTatLAGl~y-GQT--VL~kA~Q---- 161 (201) T TIGR01298 89 EIFKVVRKALKASGCQRAILVAHNASFDLGFLNRAAVKRTALKRNPFHPFSVFDTATLAGLAY-GQT--VLAKACQ---- 161 (201) T ss_pred HHHHHHHHHHHHCCCCEEEEEECCCCCCHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHH-HHH--HHHHHHH---- T ss_conf 999999999975579603676003204666899999998611578987652246899988864-179--9998886---- Q ss_pred CCCCCCCCCCCCCCCCHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 4422335654322210000001233332221111124567763222100 Q gi|254780277|r 515 YTRKEILKSRKSSIPIDKISDSQVQEHAIENSNVILQLWLLLRPKLIVE 563 (976) Q Consensus 515 ~~~~e~~~~~k~~~~~~~i~~e~~~~Yaa~Da~~t~~L~~~l~~~L~~~ 563 (976) .-++.|+. ....-|.-|+.-|-.|+..+-.++++- T Consensus 162 ----------~Ag~~fd~----~~AHSA~YDtE~TA~LFC~ivN~wk~~ 196 (201) T TIGR01298 162 ----------AAGLDFDA----KQAHSALYDTEKTAELFCEIVNRWKEL 196 (201) T ss_pred ----------HCCCCCCC----CCCCHHHHHHHHHHHHHHHHHHHHHHH T ss_conf ----------42788675----443101200356899999886304550 No 167 >COG1796 POL4 DNA polymerase IV (family X) [DNA replication, recombination, and repair] Probab=86.64 E-value=0.56 Score=25.72 Aligned_cols=20 Identities=25% Similarity=0.300 Sum_probs=10.0 Q ss_pred HHHHHHHHHHHHHHHCCCHHHHHH Q ss_conf 888630251356440777035678 Q gi|254780277|r 796 QMRRHAKTINFSIIYGISPFRLAT 819 (976) Q Consensus 796 ~~R~~aK~infgi~YG~~~~~La~ 819 (976) ..|+.||.=.| -.|.+||.+ T Consensus 270 ~iR~lA~~kg~----klseyGl~~ 289 (326) T COG1796 270 KIRQLAKAKGE----KLSEYGLFR 289 (326) T ss_pred HHHHHHHHHCC----CHHHCCEEC T ss_conf 99999997186----310146432 No 168 >PRK13901 ruvA Holliday junction DNA helicase motor protein; Provisional Probab=86.48 E-value=0.28 Score=27.89 Aligned_cols=36 Identities=25% Similarity=0.365 Sum_probs=19.5 Q ss_pred CCHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHC Q ss_conf 98878777787518743589987763602499898413 Q gi|254780277|r 174 IPPEKMVCLQALTGDSIDNIPGIPGIGYKTAALLLQEY 211 (976) Q Consensus 174 v~p~~~~d~~al~GD~sDnipGv~GiG~KtA~~ll~~~ 211 (976) .+|.++.+ |+...-..-+-.|||||+|||.+++-|- T Consensus 92 ~~~~~l~~--aI~~~D~~~L~~vpGIG~KtA~rIi~EL 127 (196) T PRK13901 92 IKYNEFRD--AIDREDIELISKVKGIGNKMAGKIFLKL 127 (196) T ss_pred CCHHHHHH--HHHHCCHHHHHHCCCCCHHHHHHHHHHH T ss_conf 99999999--9992899998319995899999999999 No 169 >PRK00116 ruvA Holliday junction DNA helicase RuvA; Reviewed Probab=85.55 E-value=0.37 Score=27.03 Aligned_cols=28 Identities=32% Similarity=0.598 Sum_probs=16.2 Q ss_pred HHCCCCCCCCCCCCCCCCHHHHHHHHHC Q ss_conf 7518743589987763602499898413 Q gi|254780277|r 184 ALTGDSIDNIPGIPGIGYKTAALLLQEY 211 (976) Q Consensus 184 al~GD~sDnipGv~GiG~KtA~~ll~~~ 211 (976) |+.-.-..-+--|||||+|||.+++.|. T Consensus 101 aI~~~D~~~L~~vpGIG~KtA~rIi~EL 128 (198) T PRK00116 101 AIANGDIKALTKVPGVGKKTAERIVLEL 128 (198) T ss_pred HHHHCCHHHHCCCCCCCHHHHHHHHHHH T ss_conf 9985899997068897889999999999 No 170 >COG0177 Nth Predicted EndoIII-related endonuclease [DNA replication, recombination, and repair] Probab=85.53 E-value=0.9 Score=24.21 Aligned_cols=16 Identities=25% Similarity=0.578 Sum_probs=7.2 Q ss_pred CEECCCCC--CCCCCCCC Q ss_conf 10000013--33433345 Q gi|254780277|r 709 MTGRLASL--EPNLQNIP 724 (976) Q Consensus 709 ~TGRlSs~--~PNlQNiP 724 (976) .-||.-|. +|.=-.-| T Consensus 181 ~~GR~iC~ar~P~C~~C~ 198 (211) T COG0177 181 LHGRYICKARKPRCEECP 198 (211) T ss_pred HHHHHHCCCCCCCCCCCC T ss_conf 960531168999867564 No 171 >COG0632 RuvA Holliday junction resolvasome, DNA-binding subunit [DNA replication, recombination, and repair] Probab=84.38 E-value=1 Score=23.78 Aligned_cols=49 Identities=22% Similarity=0.264 Sum_probs=31.1 Q ss_pred HHHHHHHHCCCHHHH-HHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCCCHHHHHHHHH Q ss_conf 999988848988787-7778751874358998776360249989841367899986454 Q gi|254780277|r 165 IEHVIKKWGIPPEKM-VCLQALTGDSIDNIPGIPGIGYKTAALLLQEYGNLENILINAS 222 (976) Q Consensus 165 ~~~v~~~~gv~p~~~-~d~~al~GD~sDnipGv~GiG~KtA~~ll~~~gsle~i~~~~d 222 (976) .++...-||+.-..= .-|.-| -.|.|||||+|..+|+-. +++.+...++ T Consensus 54 REd~~~LyGF~~~~ER~lF~~L--------isVnGIGpK~ALaiLs~~-~~~~l~~aI~ 103 (201) T COG0632 54 REDAHLLYGFLTEEERELFRLL--------ISVNGIGPKLALAILSNL-DPEELAQAIA 103 (201) T ss_pred HHHHHHHCCCCCHHHHHHHHHH--------HCCCCCCHHHHHHHHCCC-CHHHHHHHHH T ss_conf 2027887089988999999998--------711880589999998489-9999999998 No 172 >TIGR01259 comE comEA protein; InterPro: IPR004787 The comE locus is obligatory for bacterial cell competence - the process of internalizing the exogenous added DNA. comEA and comEC are required for transformability, whereas the products of comEB and of the overlapping comER, which is transcribed in the reverse direction, are dispensable . ComEA has been shown to be an integral membrane protein, as predicted from hydropathy analysis, with its C-terminal domain outside the cytoplasmic membrane. This C-terminal domain possesses a sequence with similarity to those of several proteins known to be involved in nucleic acid transactions including UvrC and a human protein that binds to the replication origin of the Human herpesvirus 4 (Epstein-Barr virus) .. Probab=82.92 E-value=1.1 Score=23.50 Aligned_cols=43 Identities=30% Similarity=0.469 Sum_probs=32.0 Q ss_pred HHHHHHHCCCHHH---HHHHHHHCC--CCCCCCCCCCCCCCHHHHHHH Q ss_conf 9998884898878---777787518--743589987763602499898 Q gi|254780277|r 166 EHVIKKWGIPPEK---MVCLQALTG--DSIDNIPGIPGIGYKTAALLL 208 (976) Q Consensus 166 ~~v~~~~gv~p~~---~~d~~al~G--D~sDnipGv~GiG~KtA~~ll 208 (976) .+.+.--||-|.. |++|.+=-| -|.|.|.=|.|||+|+-.||= T Consensus 72 ~EL~~l~GiGP~kA~aIi~YRe~nG~F~SvddL~kVsGIG~k~~eKLk 119 (124) T TIGR01259 72 EELQALPGIGPAKAKAIIEYREENGAFKSVDDLTKVSGIGEKSLEKLK 119 (124) T ss_pred HHHHHCCCCCCHHHHHHHHHHHHCCCCCCHHHHHCCCCCCHHHHHHHH T ss_conf 998636999813379999999856997775550035788546687420 No 173 >COG2925 SbcB Exonuclease I [DNA replication, recombination, and repair] Probab=82.91 E-value=3.2 Score=20.19 Aligned_cols=113 Identities=15% Similarity=0.087 Sum_probs=58.1 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCEEECCCCC-------CCCCCCCCCEECCC------CCCCCCCCCCCCCCHHHHHHHH Q ss_conf 6852323322243311001244201025776-------65443455023137------4433345650017889999744 Q gi|254780277|r 387 IGSASFKIITDTMDAFHSKPIAIAISTLDQK-------DDTSFKVETIFIDL------SVHTSQDSTGKNTLTKEILSYL 453 (976) Q Consensus 387 ~~~ia~DtETt~ld~~~~~ivGisls~~~~~-------~~~~~~~~~~yip~------~~~~~~d~~~~~~~~~evl~~L 453 (976) ..++-.|.||.|.+|..+.+-.++----+.. ...++....-|+|- ..-+.+......+...+....+ T Consensus 9 ~tF~~yDYETfG~~Pa~DRPaQFAgIRTD~~~NiIgeP~~fyCkpsdDyLP~P~a~LITGITPQ~~~~~Gi~E~~F~~~I 88 (475) T COG2925 9 PTFLFYDYETFGVHPALDRPAQFAGIRTDIEFNIIGEPIVFYCKPADDYLPQPGAVLITGITPQEAREKGINEAAFAARI 88 (475) T ss_pred CCEEEEECHHCCCCCCCCCCHHHHEEECCCCCCCCCCCEEEEECCCCCCCCCCCCEEEECCCHHHHHHCCCCHHHHHHHH T ss_conf 86799750331778566661354112103434324787279854754568998736662578788976489858999999 Q ss_pred HHHHCCCCCHHHHCC-CCHHHHHH----HHCCCCC-----C---C-C--CCHHHHHHHHCCC Q ss_conf 442022210011003-21237889----8618866-----6---4-2--1002345543168 Q gi|254780277|r 454 KKFFENEHFLKIGHN-IKYDKLVL----HRYGISM-----R---G-F--EDIMLMSYVLDSG 499 (976) Q Consensus 454 ~~lle~~~i~kIgHN-~KfD~~~L----~~~GI~l-----~---~-i--~DTmlaayLLdp~ 499 (976) ...|..|....+|+| +.||--+- +++-+++ + + | .|.+-+.|-|.|. T Consensus 89 ~~~ls~P~Tcv~GYNniRfDDEvtRy~fyRNF~DPYa~sWqngNSRWDLLD~~RacyALRPe 150 (475) T COG2925 89 HAELTQPNTCVLGYNNIRFDDEVTRYIFYRNFYDPYAWSWQNGNSRWDLLDVVRACYALRPE 150 (475) T ss_pred HHHHCCCCEEEECCCCCCCCHHHHHHHHHHHCCCHHHHHHCCCCCHHHHHHHHHHHHHCCCC T ss_conf 99857998156235553332688988898722766667613898425799999999752855 No 174 >PRK13636 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=82.54 E-value=2.4 Score=21.07 Aligned_cols=15 Identities=13% Similarity=0.195 Sum_probs=8.5 Q ss_pred CCCHHHHHHHHHHHH Q ss_conf 999899999999999 Q gi|254780277|r 822 RIPRSEAADYIKRYF 836 (976) Q Consensus 822 ~is~~eA~~~i~~yf 836 (976) -+|.+||.+.+..|. T Consensus 265 ~lt~~e~~~~l~~~~ 279 (285) T PRK13636 265 DLTISQARKTLNSWK 279 (285) T ss_pred CCCHHHHHHHHHHHH T ss_conf 777999999999999 No 175 >KOG2841 consensus Probab=81.86 E-value=0.99 Score=23.91 Aligned_cols=10 Identities=30% Similarity=0.663 Sum_probs=6.5 Q ss_pred HHHHHHHHHC Q ss_conf 2333453300 Q gi|254780277|r 696 TQRVHTFYSL 705 (976) Q Consensus 696 dgRIH~~f~q 705 (976) -.|+|..|+| T Consensus 240 ak~l~~~l~~ 249 (254) T KOG2841 240 AKRLHKFLHQ 249 (254) T ss_pred HHHHHHHHHC T ss_conf 9999999845 No 176 >pfam00633 HHH Helix-hairpin-helix motif. The helix-hairpin-helix DNA-binding motif is found to be duplicated in the central domain of RuvA. The HhH domain of DisA, a bacterial checkpoint control protein, is a DNA-binding domain. Probab=81.36 E-value=0.33 Score=27.43 Aligned_cols=25 Identities=36% Similarity=0.579 Sum_probs=18.6 Q ss_pred HCCCCCCCCCCCCCCCCHHHHHHHH Q ss_conf 5187435899877636024998984 Q gi|254780277|r 185 LTGDSIDNIPGIPGIGYKTAALLLQ 209 (976) Q Consensus 185 l~GD~sDnipGv~GiG~KtA~~ll~ 209 (976) +.=.+-+-+-.|||||+|||..++. T Consensus 5 ~~~as~eeL~~lpGVG~~tA~~I~~ 29 (30) T pfam00633 5 LIPASREELLALPGVGPKTAEAILS 29 (30) T ss_pred CCCCCHHHHHHCCCCCHHHHHHHHC T ss_conf 3523599997288977688998853 No 177 >TIGR00615 recR recombination protein RecR; InterPro: IPR000093 The bacterial protein recR seems to play a role in a recombinational process of DNA repair . It may act with recF and recO. RecR is a protein of about 200 amino acid residues. This protein contains a putative C4-type zinc finger in the N-terminal section.; GO: 0006281 DNA repair, 0006310 DNA recombination. Probab=81.17 E-value=0.53 Score=25.89 Aligned_cols=19 Identities=42% Similarity=0.714 Sum_probs=15.5 Q ss_pred CCCCCCCCCCCCHHHHHHH Q ss_conf 3589987763602499898 Q gi|254780277|r 190 IDNIPGIPGIGYKTAALLL 208 (976) Q Consensus 190 sDnipGv~GiG~KtA~~ll 208 (976) -|.+.=+||||+|||.+|- T Consensus 11 ie~L~kLPgiG~KsA~RlA 29 (205) T TIGR00615 11 IESLKKLPGIGPKSAQRLA 29 (205) T ss_pred HHHHHHCCCCCHHHHHHHH T ss_conf 9986407898714789999 No 178 >smart00278 HhH1 Helix-hairpin-helix DNA-binding motif class 1. Probab=80.57 E-value=0.94 Score=24.07 Aligned_cols=21 Identities=38% Similarity=0.629 Sum_probs=16.9 Q ss_pred CCCCCCCCCCHHHHHHHHHCC Q ss_conf 899877636024998984136 Q gi|254780277|r 192 NIPGIPGIGYKTAALLLQEYG 212 (976) Q Consensus 192 nipGv~GiG~KtA~~ll~~~g 212 (976) -+-.|||||+|+|.+++.... T Consensus 2 ~L~~v~GIG~k~A~~ll~~~~ 22 (26) T smart00278 2 ELLKVPGIGPKTAEKILEAXX 22 (26) T ss_pred CCCCCCCCCCHHHHHHHHHHH T ss_conf 210179988115999999762 No 179 >PRK13646 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=80.33 E-value=3.5 Score=19.95 Aligned_cols=13 Identities=8% Similarity=0.227 Sum_probs=5.1 Q ss_pred CCHHHHHHHHHHH Q ss_conf 9989999999999 Q gi|254780277|r 823 IPRSEAADYIKRY 835 (976) Q Consensus 823 is~~eA~~~i~~y 835 (976) ++.+|+.+++..| T Consensus 270 ~t~e~~~~~~k~~ 282 (286) T PRK13646 270 LTEEAFVSLYKEW 282 (286) T ss_pred CCHHHHHHHHHHH T ss_conf 4299999999997 No 180 >cd00431 cysteine_hydrolases Cysteine hydrolases; This family contains amidohydrolases, like CSHase (N-carbamoylsarcosine amidohydrolase), involved in creatine metabolism and nicotinamidase, converting nicotinamide to nicotinic acid and ammonia in the pyridine nucleotide cycle. It also contains isochorismatase, an enzyme that catalyzes the conversion of isochorismate to 2,3-dihydroxybenzoate and pyruvate, via the hydrolysis of the vinyl ether bond, and other related enzymes with unknown function. Probab=79.90 E-value=4 Score=19.47 Aligned_cols=118 Identities=17% Similarity=0.180 Sum_probs=69.5 Q ss_pred EEEEECCHHHHHHHHCCCCCCCCCCCCEECHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCC---------------- Q ss_conf 8999662699988742675446898855006999999999999984855479879999727488---------------- Q gi|254780277|r 7 LFLVDGSSFIYRAFYATPLLSRKHDGLPVNAIAGFCNMLWKLLQNSRKESIASHFAVIFDYPAV---------------- 70 (976) Q Consensus 7 ~~liDg~~~~~ra~~a~p~~~~~~~G~~t~ai~gf~~~l~~~i~~~~~~~~~~~~~v~fD~~~~---------------- 70 (976) |++||--+-+... .+......-.++.-+.++++..+.... .++.+-+...+ T Consensus 2 LlvID~Q~~f~~~-----------~~~~~~~~~~~i~~i~~l~~~~r~~~~--pvi~~~~~~~~~~~~~~~~~~~~~~~~ 68 (161) T cd00431 2 LLVVDMQNDFVPG-----------GGLLLPGADELVPNINRLLAAARAAGI--PVIFTRDWHPPDDPEFAELLWPPHCVK 68 (161) T ss_pred EEEECCCHHHCCC-----------CCCCCCCHHHHHHHHHHHHHHHHHCCC--CEEEEEECCCCCCCCCCCCCCCCCCCC T ss_conf 8999384442199-----------753456899999999999999998499--489998458887420000257655438 Q ss_pred -CHHHHCCHHHHCC------CCCCCHHHHHHHHHHHHHHHHCCCCEEEECCEEHHHHHHHHHHHHHHCCCCEEEEC Q ss_conf -7132408652166------79879758864899999999709979956890166589999999984899499986 Q gi|254780277|r 71 -TFRNEIYPDYKAN------RPKIPEMLLPQLPLVRLATQAFGIPAIEIQGFEADDIIATYTYIAEKEGFAVTIIS 139 (976) Q Consensus 71 -tfR~~~~~~YKa~------R~~~p~~l~~q~~~i~~~l~~~gi~~~~~~~~EADDviatla~~~~~~~~~v~i~s 139 (976) +.-.++.|+-+.. +|..+..+.. +-+.++|...||..+..-|.++|=++-+-++.+.+.|++|++++ T Consensus 69 g~~~~~i~~~l~~~~~~~vi~K~~~saF~~--t~L~~~L~~~~i~~lii~G~~T~~CV~~Ta~~a~~~g~~v~vv~ 142 (161) T cd00431 69 GTEGAELVPELAPLPDDLVIEKTRYSAFYG--TDLDELLRERGIDTLVVCGIATDICVLATARDALDLGYRVIVVE 142 (161) T ss_pred CCCCCCCCCCCCCCCCCEEEECCCCCCCCC--CCHHHHHHHCCCCCEEEEEHHHHHHHHHHHHHHHHCCCEEEEEC T ss_conf 997565082107899988985674677578--63899999739986799644453399999999998899899967 No 181 >cd01702 Pol_eta Pol eta is member of the DNA polimerase Y-family. Unlike other Y-family members, Pol eta can efficiently and accurately replicate DNA past UV-induced cis-syn cyclobutane thymine-thumine (T-T) lesions. It synthesizes AA opposite a TT dimer. Pol eta is able to replicate through a variety of other distorting DNA lesion as well. Probab=79.58 E-value=2.7 Score=20.74 Aligned_cols=102 Identities=13% Similarity=0.102 Sum_probs=43.3 Q ss_pred HCCCHHHCCHHHHHHHHHHHHHHHHCCCHHHHHHHCCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHCCCE-ECCCC Q ss_conf 09996889988886302513564407770356788199989999999999998888999999999999984988-82778 Q gi|254780277|r 786 FGVNIEKVSPQMRRHAKTINFSIIYGISPFRLATQLRIPRSEAADYIKRYFHRFPGIHEYIEKTKNFVRQNGYV-ETIFG 864 (976) Q Consensus 786 f~~~~~~vt~~~R~~aK~infgi~YG~~~~~La~~l~is~~eA~~~i~~yf~~~p~v~~~~~~~~~~a~~~gyv-~T~~G 864 (976) .|++..+|.+. ...|.|.-.--| +...-.+..+.+..+..+- .+..+++.+..+.+|.+ +|+.= T Consensus 237 ~Gid~~~V~~~--~~~KSI~~e~TF-------~~~~~~~~~~~~~~l~~l~------~~l~~Rl~~~~~~~~~~~~tvtv 301 (358) T cd01702 237 RGIDDEEVRPR--PLPKSMGSSKNF-------RGKTSSSLEDAQKWLEVLA------GELASRLIELEEENKRRPKTLTL 301 (358) T ss_pred CCCCCCCCCCC--CCCCCCEEEEEC-------CCCCCCCHHHHHHHHHHHH------HHHHHHHHHHHHHCCCEEEEEEE T ss_conf 59899877689--998411576610-------9998886999999999999------99999999999843964689999 Q ss_pred CEECCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 760189875883788877878770134565389999999999999998 Q gi|254780277|r 865 RRIHYDEINSPKSSIRNINERAAINAPIQGSAADITRRAMISVHKSLE 912 (976) Q Consensus 865 Rrr~~p~i~s~~~~~r~~aeR~a~N~~iQGsaAdi~k~am~~i~~~~~ 912 (976) ..|+-+. .. .-|+. .+..+. ..-+|++.|+-.+.+-++ T Consensus 302 kir~~~f-~~---~sr~~----~l~~~~--d~~~i~~~A~~ll~~i~~ 339 (358) T cd01702 302 SLRSKGG-GT---RSRQC----ALPIYD--DLEKLVKDAFKLLKQLNR 339 (358) T ss_pred EEECCCC-CE---EEEEE----ECCCCC--CHHHHHHHHHHHHHHHHH T ss_conf 9998999-77---88885----579999--999999999999999986 No 182 >PRK10017 putative pyruvyl transferase; Provisional Probab=79.18 E-value=1.7 Score=22.23 Aligned_cols=105 Identities=12% Similarity=0.228 Sum_probs=63.9 Q ss_pred CCCCEEEECCCCCEEEEEECCCCCEEEEEEEEEHHHHHHHHHCCCCHHHHHHHHHHCCCHHHCCHHHHHHHHHHHHHHHH Q ss_conf 22430242489948999632654436876651037778734069888999998980999688998888630251356440 Q gi|254780277|r 731 QKIRRAFISPPTKKLISADYSQIELRILAHIAKITPLYQVFENSLDIHRMVAAEIFGVNIEKVSPQMRRHAKTINFSIIY 810 (976) Q Consensus 731 ~~iR~~F~a~~g~~lv~aDySQiElRilA~ls~D~~l~~af~~g~DiH~~tA~~vf~~~~~~vt~~~R~~aK~infgi~Y 810 (976) +.+++....++...++.-||+-.|+ ++-+|+..-++ |-=+|+..++.++|+|. .||-| T Consensus 297 ~~i~~~m~~~~~~~il~~~~~~~E~--~~ii~~~dl~I-----G~RLHslIfA~~~gvP~---------------i~IsY 354 (426) T PRK10017 297 LNLRQHISDPARYHVVMDELNDLEM--GKILGACELTV-----GTRLHSAIISMNFATPA---------------IAINY 354 (426) T ss_pred HHHHHHHCCCCCEEEECCCCCHHHH--HHHHHHCHHHH-----HHHHHHHHHHHHCCCCE---------------EEEEE T ss_conf 9999972687636983899998999--99997392299-----88899999999759996---------------98402 Q ss_pred CCCHHHHHHHCCC----------CHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHCC Q ss_conf 7770356788199----------9899999999999988889999999999999849 Q gi|254780277|r 811 GISPFRLATQLRI----------PRSEAADYIKRYFHRFPGIHEYIEKTKNFVRQNG 857 (976) Q Consensus 811 G~~~~~La~~l~i----------s~~eA~~~i~~yf~~~p~v~~~~~~~~~~a~~~g 857 (976) .--..|+.+++|+ +.++...-+++.+..++.+++-....+++.|+.| T Consensus 355 d~K~~g~~~~lGl~~~~~di~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 411 (426) T PRK10017 355 EHKSAGIMQQLGLPEMAIDIRHLLDGSLQAMVADTLGQLPALNARLAEAVSRERQTG 411 (426) T ss_pred CHHHHHHHHHCCCCHHEECHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 287899999759930030377669278999999999769999999999999999999 No 183 >pfam04599 Pox_G5 Poxvirus G5 protein. This protein has been predicted to be related to the FEN-1 endonuclease. Probab=79.15 E-value=4.2 Score=19.31 Aligned_cols=94 Identities=21% Similarity=0.270 Sum_probs=39.4 Q ss_pred HHHHHHHC--CCCEEEECCEEHHHHHHHHHHHHH-HCCCCEEEECCCCHHHEECCCCEEEEECCCCCEECHHHHHHHHCC Q ss_conf 99999970--997995689016658999999998-489949998687012122379759996678835389999888489 Q gi|254780277|r 98 VRLATQAF--GIPAIEIQGFEADDIIATYTYIAE-KEGFAVTIISTDKDLMQLVSPTTCLYDTVKEEKIDIEHVIKKWGI 174 (976) Q Consensus 98 i~~~l~~~--gi~~~~~~~~EADDviatla~~~~-~~~~~v~i~s~DkD~~Qlv~~~v~~~~~~~~~~~~~~~v~~~~gv 174 (976) +.+.|..+ ++.++--+|..|.=|.-+-|+... ..|.=-+++|.|.|.+-+.+-+. ..+- ...+-+-|-+ T Consensus 149 ~~~~L~~~~~~v~I~yCDgvDAEFvMC~~Ak~~a~~~G~WPlliStDQDtllf~s~D~-----~~Ki---I~t~~~~Y~~ 220 (426) T pfam04599 149 LEEILSSLKEDVTIIYCDGIDAEFVMCSRAKKLADKTGNWPLLISTDQDTLLLSSCDT-----LPKI---IKTINQIYKL 220 (426) T ss_pred HHHHHHHCCCCCEEEEECCCCHHHHHHHHHHHHHHHCCCCCEEEECCCCEEEEEECCC-----HHHH---HHHHHHHHEE T ss_conf 9999984527856998778576689999999999735987637730666256540685-----8889---9979888521 Q ss_pred CHHHHHHHHH---HCCCCCCCCCCCCCC Q ss_conf 8878777787---518743589987763 Q gi|254780277|r 175 PPEKMVCLQA---LTGDSIDNIPGIPGI 199 (976) Q Consensus 175 ~p~~~~d~~a---l~GD~sDnipGv~Gi 199 (976) -|....-|++ ..---.|=.||.-|+ T Consensus 221 ~P~~~t~YL~kL~~L~NGCDfFpGLyG~ 248 (426) T pfam04599 221 IPCAKTKYLSKLVVLVNGCDFFPGLYGI 248 (426) T ss_pred CCCCHHHHHHHHHHHHHCCCCCCCCCCE T ss_conf 7870478999999998266345763306 No 184 >cd01014 nicotinamidase_related Nicotinamidase_ related amidohydrolases. Cysteine hydrolases of unknown function that share the catalytic triad with other amidohydrolases, like nicotinamidase, which converts nicotinamide to nicotinic acid and ammonia. Probab=79.13 E-value=4.2 Score=19.30 Aligned_cols=97 Identities=16% Similarity=0.143 Sum_probs=63.0 Q ss_pred HHHHHHHHHHHHHHCCCCCCCEEEEE-EECC-C-----CCHHHHCCHHHHCC------CCCCCHHHHHHHHHHHHHHHHC Q ss_conf 99999999999984855479879999-7274-8-----87132408652166------7987975886489999999970 Q gi|254780277|r 39 AGFCNMLWKLLQNSRKESIASHFAVI-FDYP-A-----VTFRNEIYPDYKAN------RPKIPEMLLPQLPLVRLATQAF 105 (976) Q Consensus 39 ~gf~~~l~~~i~~~~~~~~~~~~~v~-fD~~-~-----~tfR~~~~~~YKa~------R~~~p~~l~~q~~~i~~~l~~~ 105 (976) -.++..+.++++.++....| ++.+ +..+ . ++...+++|+-+.. +|..+..|.. +-+..+|... T Consensus 22 ~~~i~~i~~Li~~ar~~g~p--Vi~v~h~~~~~~~~~~g~~g~~l~~~l~~~~~d~vi~K~~~saF~~--t~L~~~L~~~ 97 (155) T cd01014 22 EAALENIAALIAAARAAGIP--VIHVRHIDDEGGSFAPGSEGWEIHPELAPLEGETVIEKTVPNAFYG--TDLEEWLREA 97 (155) T ss_pred HHHHHHHHHHHHHHHHCCCC--EEEEEECCCCCCCCCCCCCCCCCCHHHCCCCCCEEEECCCCCCCCC--CCHHHHHHHC T ss_conf 99999999999999986993--8999746888887799998752274007899998998897786668--7879999975 Q ss_pred CCCEEEECCEEHHHHHHHHHHHHHHCCCCEEEEC Q ss_conf 9979956890166589999999984899499986 Q gi|254780277|r 106 GIPAIEIQGFEADDIIATYTYIAEKEGFAVTIIS 139 (976) Q Consensus 106 gi~~~~~~~~EADDviatla~~~~~~~~~v~i~s 139 (976) ||..+..-|...|=++-+-|+-+.+.|++|++++ T Consensus 98 gi~~lii~Gi~T~~CV~~Ta~da~~~Gy~v~vv~ 131 (155) T cd01014 98 GIDHLVICGAMTEMCVDTTVRSAFDLGYDVTVVA 131 (155) T ss_pred CCCEEEEEEECCCHHHHHHHHHHHHCCCEEEEEC T ss_conf 9970899871016799999999998899799958 No 185 >pfam03104 DNA_pol_B_exo DNA polymerase family B, exonuclease domain. This domain has 3' to 5' exonuclease activity and adopts a ribonuclease H type fold. Probab=77.99 E-value=4.1 Score=19.43 Aligned_cols=91 Identities=15% Similarity=0.150 Sum_probs=44.4 Q ss_pred HCCCCCCCCCCCCCC-----CCCCCCCCCEEECCCCCCCCCCCCCCEECCCCCCCCCCCC-CCC-CCHHHHHHHHHHHHC Q ss_conf 668523233222433-----1100124420102577665443455023137443334565-001-788999974444202 Q gi|254780277|r 386 TIGSASFKIITDTMD-----AFHSKPIAIAISTLDQKDDTSFKVETIFIDLSVHTSQDST-GKN-TLTKEILSYLKKFFE 458 (976) Q Consensus 386 ~~~~ia~DtETt~ld-----~~~~~ivGisls~~~~~~~~~~~~~~~yip~~~~~~~d~~-~~~-~~~~evl~~L~~lle 458 (976) .....+||+||.+.+ +..+.++.||............ ....+.........+.. ... ..-.+++..+..++. T Consensus 85 plrilsfDIEc~~~~g~FP~~~~d~IiqIs~~~~~~g~~~~~-~~~~~tl~~~~~i~~~~~V~~f~~E~~ll~~f~~~i~ 163 (254) T pfam03104 85 PLRVLSFDIECTSLKGKFPENEKDPIIQISCMLYREGEPDPI-EDKLFTLKSCALIEIGGEVYIFPSEKELLRRFFEFIR 163 (254) T ss_pred CCEEEEEEEEECCCCCCCCCCCCCEEEEEEEEEEECCCCCCC-CEEEEEECCCCCCCCCCEEEEECCHHHHHHHHHHHHH T ss_conf 853999988958889999987898389999898856789863-2069980787767789779995999999999999987 Q ss_pred --CCCCHHHHCCC-CHHHHHHHH Q ss_conf --22100110032-123788986 Q gi|254780277|r 459 --NEHFLKIGHNI-KYDKLVLHR 478 (976) Q Consensus 459 --~~~i~kIgHN~-KfD~~~L~~ 478 (976) +|++. +|||+ .||+..|.. T Consensus 164 ~~dpdii-~gyNi~~FD~~yl~~ 185 (254) T pfam03104 164 QYDPDII-TGYNGDNFDWPYILN 185 (254) T ss_pred HCCCCEE-EECCCCCCCHHHHHH T ss_conf 5199899-966787788899999 No 186 >pfam00857 Isochorismatase Isochorismatase family. This family are hydrolase enzymes. Probab=77.86 E-value=4.6 Score=19.04 Aligned_cols=116 Identities=16% Similarity=0.216 Sum_probs=67.2 Q ss_pred EEEEECCHHHHHHHHCCCCCCCCCCCCEECHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECC------------------ Q ss_conf 89996626999887426754468988550069999999999999848554798799997274------------------ Q gi|254780277|r 7 LFLVDGSSFIYRAFYATPLLSRKHDGLPVNAIAGFCNMLWKLLQNSRKESIASHFAVIFDYP------------------ 68 (976) Q Consensus 7 ~~liDg~~~~~ra~~a~p~~~~~~~G~~t~ai~gf~~~l~~~i~~~~~~~~~~~~~v~fD~~------------------ 68 (976) |++||=-+-+.+. .+. . .... ..+..+.++++..+.... .++.+-+.. T Consensus 3 LlvID~Q~~f~~~---~~~---~---~~~~---~~i~~i~~li~~ar~~~~--pVi~~~~~~~~~~~~~~~~~~~~~~~~ 68 (172) T pfam00857 3 LLVIDMQNGFFSG---VPK---V---AGIA---AVLENISRLLEAARAAGI--PVIFTRHVHEPDDADDALKDRPSGAFP 68 (172) T ss_pred EEEEECCCHHCCC---CCC---C---CCHH---HHHHHHHHHHHHHHHCCC--EEEEEEECCCCCCCCHHHCCCCCCCCC T ss_conf 9999684101076---767---5---6889---999999999999998699--199987437898621010135766566 Q ss_pred CCCHHHHCCHHHHCCC------CCCCHHHHHHHHHHHHHHHHCCCCEEEECCEEHHHHHHHHHHHHHHCCCCEEEEC Q ss_conf 8871324086521667------9879758864899999999709979956890166589999999984899499986 Q gi|254780277|r 69 AVTFRNEIYPDYKANR------PKIPEMLLPQLPLVRLATQAFGIPAIEIQGFEADDIIATYTYIAEKEGFAVTIIS 139 (976) Q Consensus 69 ~~tfR~~~~~~YKa~R------~~~p~~l~~q~~~i~~~l~~~gi~~~~~~~~EADDviatla~~~~~~~~~v~i~s 139 (976) .+|+-.++.|+.+..- |..+..+.. -+.++|...||..+..-|.++|=++-+-+..+.+.|++|++++ T Consensus 69 ~gt~~~el~~~l~~~~~d~vi~K~~~saF~~---~L~~~L~~~~i~~lii~G~~t~~CV~~Ta~~a~~~g~~v~vv~ 142 (172) T pfam00857 69 PGTWGAELVPELAPLPGDLVVDKTRFSAFAT---DLDEILRELGIDTLVLAGVATDVCVLSTARDAFDRGYEVVVVS 142 (172) T ss_pred CCCCHHHHHHHHCCCCCCEEEECCCCCCCHH---HHHHHHHHCCCCEEEEECCCCCCHHHHHHHHHHHCCCEEEEEC T ss_conf 9996777237447889998997786777564---8999999859999999503648699999999998898799957 No 187 >COG1949 Orn Oligoribonuclease (3'-5' exoribonuclease) [RNA processing and modification] Probab=76.59 E-value=3.2 Score=20.15 Aligned_cols=30 Identities=17% Similarity=0.087 Sum_probs=22.9 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCEEECCCC Q ss_conf 685232332224331100124420102577 Q gi|254780277|r 387 IGSASFKIITDTMDAFHSKPIAIAISTLDQ 416 (976) Q Consensus 387 ~~~ia~DtETt~ld~~~~~ivGisls~~~~ 416 (976) ...+=+|+|-|||||....|+-|+.-.-++ T Consensus 6 ~nLiWIDlEMTGLd~~~drIIEiA~iVTD~ 35 (184) T COG1949 6 NNLIWIDLEMTGLDPERDRIIEIATIVTDA 35 (184) T ss_pred CCEEEEEEEECCCCCCCCEEEEEEEEEECC T ss_conf 862899632026886764278888877547 No 188 >KOG2304 consensus Probab=75.91 E-value=2.2 Score=21.42 Aligned_cols=55 Identities=16% Similarity=0.337 Sum_probs=42.1 Q ss_pred HHHHHHHHCCCHHHHHHHCCCCHHHHHHHHHHHHHHCHHHHHH-----HHHHHHHHHHCCCEECCCCCEE Q ss_conf 5135644077703567881999899999999999988889999-----9999999998498882778760 Q gi|254780277|r 803 TINFSIIYGISPFRLATQLRIPRSEAADYIKRYFHRFPGIHEY-----IEKTKNFVRQNGYVETIFGRRI 867 (976) Q Consensus 803 ~infgi~YG~~~~~La~~l~is~~eA~~~i~~yf~~~p~v~~~-----~~~~~~~a~~~gyv~T~~GRrr 867 (976) ++-||--|-||||-|+.-.|. +-.+-+++.|++.||+...| +++.+.+. -+||++ T Consensus 231 aMklGagyPMGPfEL~DyvGL--Dt~kfvmdgwhe~~pe~~~f~psPll~klVaeg--------klGrKt 290 (298) T KOG2304 231 AMKLGAGYPMGPFELADYVGL--DTCKFVMDGWHEGYPEDSLFAPSPLLDKLVAEG--------KLGRKT 290 (298) T ss_pred HHHCCCCCCCCHHHHHHHHHH--HHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHCC--------CCCCCC T ss_conf 985258998870888777437--999999999974287643358874899998635--------446535 No 189 >COG0353 RecR Recombinational DNA repair protein (RecF pathway) [DNA replication, recombination, and repair] Probab=75.75 E-value=1.3 Score=23.07 Aligned_cols=46 Identities=24% Similarity=0.382 Sum_probs=23.7 Q ss_pred CCCCCCCCCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHCCCHHHHHHH Q ss_conf 58998776360249989841367899986454200112232200357899976 Q gi|254780277|r 191 DNIPGIPGIGYKTAALLLQEYGNLENILINASRIKQKKRRENILEYAETARLS 243 (976) Q Consensus 191 DnipGv~GiG~KtA~~ll~~~gsle~i~~~~d~i~~~k~~~~l~~~~e~a~ls 243 (976) |.+.-+||||+|||.++-=. +++.-++ ...++.++|.+-++.+..+ T Consensus 12 ~~l~kLPGvG~KsA~R~Afh------LL~~~~~-~~~~la~al~~a~~~i~~C 57 (198) T COG0353 12 DALKKLPGVGPKSAQRLAFH------LLQRDRE-DVERLAKALLEAKENIKHC 57 (198) T ss_pred HHHHHCCCCCHHHHHHHHHH------HHCCCHH-HHHHHHHHHHHHHHCCCCC T ss_conf 99976899883279999999------9735998-9999999999999738635 No 190 >cd06167 LabA_like LabA_like proteins. A well conserved group of bacterial proteins with no defined function. LabA, a member from Synechococcus elongatus PCC 7942, has been shown to play a role in cyanobacterial circadian timing. It is required for negative feedback regulation of the autokinase/autophosphatase KaiC, a central component of the circadian clock system. In particular, LabA seems necessary for KaiC-dependent repression of gene expression. Probab=75.31 E-value=5.4 Score=18.57 Aligned_cols=52 Identities=23% Similarity=0.265 Sum_probs=33.0 Q ss_pred HHHHHHHCCCCEEEECCE-------EHHHHHHHHHH-HHHHCCC-CEEEECCCCHHHEECC Q ss_conf 999999709979956890-------16658999999-9984899-4999868701212237 Q gi|254780277|r 98 VRLATQAFGIPAIEIQGF-------EADDIIATYTY-IAEKEGF-AVTIISTDKDLMQLVS 149 (976) Q Consensus 98 i~~~l~~~gi~~~~~~~~-------EADDviatla~-~~~~~~~-~v~i~s~DkD~~Qlv~ 149 (976) ..+.|...|+.....+.. -+|=-|++=+- .+....+ .++++|||.||..++. T Consensus 57 ~~~~L~~~g~~~~~~~~~~~~~~kk~~Dv~la~D~~~~a~~~~~D~~vLvSgD~Df~p~v~ 117 (149) T cd06167 57 FLDALRRLGFEPIQKPLRTRGSGKKGVDVALAIDALELAYKRRIDTIVLVSGDSDFVPLVE 117 (149) T ss_pred HHHHHHHCCCEEEEEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEECCHHHHHHHH T ss_conf 9999997598479961130477666420999999999961389998999957772799999 No 191 >PRK08306 dipicolinate synthase subunit A; Reviewed Probab=75.02 E-value=5.4 Score=18.52 Aligned_cols=18 Identities=28% Similarity=0.202 Sum_probs=9.1 Q ss_pred CCCCCCCCCCCCCCCCCHHHHHH Q ss_conf 07887555665453531788776 Q gi|254780277|r 637 LKFPSGAKTKTGQWKTTAQDLEQ 659 (976) Q Consensus 637 LgLp~~kKTktG~~ST~~~~Le~ 659 (976) +|||... .|.|..+.|-+ T Consensus 263 pgLPGk~-----APkTag~Ila~ 280 (296) T PRK08306 263 PGLPGKV-----APKTAGQILAN 280 (296) T ss_pred CCCCCCC-----CHHHHHHHHHH T ss_conf 8999731-----71539999999 No 192 >COG1029 FwdB Formylmethanofuran dehydrogenase subunit B [Energy production and conversion] Probab=74.69 E-value=1.7 Score=22.15 Aligned_cols=49 Identities=24% Similarity=0.438 Sum_probs=35.7 Q ss_pred EEHHHHHHHHH---CCCCHHHHHHHHHHCCCHHHCCHHHHHHHHHHHHHHHH-CCC Q ss_conf 10377787340---69888999998980999688998888630251356440-777 Q gi|254780277|r 762 AKITPLYQVFE---NSLDIHRMVAAEIFGVNIEKVSPQMRRHAKTINFSIIY-GIS 813 (976) Q Consensus 762 s~D~~l~~af~---~g~DiH~~tA~~vf~~~~~~vt~~~R~~aK~infgi~Y-G~~ 813 (976) ..|--|+.|++ +|.-.|.. -+++|+|.+++ .+.-..-|..+||.|| ||| T Consensus 203 ~sDyelisAl~~~l~G~~~~~~--eev~gvp~~~i-~e~a~~mKna~Fg~if~GlG 255 (429) T COG1029 203 NSDYELISALRAALHGKEPHRS--EEVAGVPIEEI-EELADMMKNAKFGAIFVGLG 255 (429) T ss_pred CCCHHHHHHHHHHHCCCCCCCC--HHHCCCCHHHH-HHHHHHHHCCCCCEEEEECC T ss_conf 9867999999998558999872--42148798999-99999986398124998302 No 193 >pfam02371 Transposase_20 Transposase IS116/IS110/IS902 family. Transposases are needed for efficient transposition of the insertion sequence or transposon DNA. This family includes transposases for IS116, IS110 and IS902. This region is often found with pfam01548. The exact function of this region is uncertain. This family contains a HHH motif suggesting a DNA-binding function. Probab=73.83 E-value=2 Score=21.62 Aligned_cols=31 Identities=42% Similarity=0.562 Sum_probs=21.9 Q ss_pred CCCCCCCCCCHHHHHHHHHCCCHHHHHHHHHH Q ss_conf 89987763602499898413678999864542 Q gi|254780277|r 192 NIPGIPGIGYKTAALLLQEYGNLENILINASR 223 (976) Q Consensus 192 nipGv~GiG~KtA~~ll~~~gsle~i~~~~d~ 223 (976) .+--|||||+.||+.|+.+.|.++ =+.+.++ T Consensus 3 ~L~sipGiG~~~a~~l~aeigd~~-rF~~~~~ 33 (87) T pfam02371 3 LLLSIPGIGPITAAALLAEIGDIS-RFKSARQ 33 (87) T ss_pred HHHCCCCCCHHHHHHHHHHHCCHH-HCCCHHH T ss_conf 342699952999999999929853-2789999 No 194 >KOG0917 consensus Probab=73.66 E-value=5.8 Score=18.29 Aligned_cols=22 Identities=27% Similarity=0.443 Sum_probs=18.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHH Q ss_conf 2799999998788898865320 Q gi|254780277|r 666 PIIKNILEWRQLSKIKSTYADS 687 (976) Q Consensus 666 p~~~~iLeyR~l~KL~sTY~~~ 687 (976) |+..-+|..|++.|+|+|||-. T Consensus 127 el~e~~l~~rKYAkWKat~I~~ 148 (338) T KOG0917 127 ELTEENLKHRKYAKWKATYIHN 148 (338) T ss_pred CCCHHHHHHHHHHHHHHHHHHH T ss_conf 8985887554776778999999 No 195 >PRK13844 recombination protein RecR; Provisional Probab=73.38 E-value=1.6 Score=22.41 Aligned_cols=19 Identities=26% Similarity=0.471 Sum_probs=14.2 Q ss_pred CCCCCCCCCCCHHHHHHHH Q ss_conf 5899877636024998984 Q gi|254780277|r 191 DNIPGIPGIGYKTAALLLQ 209 (976) Q Consensus 191 DnipGv~GiG~KtA~~ll~ 209 (976) +.+.-.||||+|||.++.- T Consensus 15 ~~l~kLPGIG~KsA~Rla~ 33 (200) T PRK13844 15 ESLRKLPTIGKKSSQRLAL 33 (200) T ss_pred HHHHHCCCCCHHHHHHHHH T ss_conf 9981689987889999999 No 196 >PRK00254 ski2-like helicase; Provisional Probab=73.31 E-value=2.5 Score=21.01 Aligned_cols=12 Identities=8% Similarity=0.135 Sum_probs=7.0 Q ss_pred HHHHHHHHHHCC Q ss_conf 999999965101 Q gi|254780277|r 941 SQIIVHSMEKAC 952 (976) Q Consensus 941 ~~~~~~~Me~~~ 952 (976) .++.++++|++. T Consensus 687 ~~~a~~i~~~~~ 698 (717) T PRK00254 687 AKIVEGIFKHLG 698 (717) T ss_pred HHHHHHHHHHHC T ss_conf 899999999838 No 197 >PRK01216 DNA polymerase IV; Validated Probab=73.27 E-value=3.9 Score=19.55 Aligned_cols=12 Identities=42% Similarity=0.714 Sum_probs=4.7 Q ss_pred CCCCCCHHHHHH Q ss_conf 776360249989 Q gi|254780277|r 196 IPGIGYKTAALL 207 (976) Q Consensus 196 v~GiG~KtA~~l 207 (976) +||||+||+.+| T Consensus 183 i~GIG~~t~~~L 194 (351) T PRK01216 183 VPGIGKVLSERL 194 (351) T ss_pred ECCCCHHHHHHH T ss_conf 548579999999 No 198 >PRK08097 ligB NAD-dependent DNA ligase LigB; Reviewed Probab=72.37 E-value=6.2 Score=18.08 Aligned_cols=13 Identities=8% Similarity=0.685 Sum_probs=9.2 Q ss_pred ECCHHHHHHHHHH Q ss_conf 4799999999999 Q gi|254780277|r 371 IVNTKDIQQWVQK 383 (976) Q Consensus 371 I~~~~~L~~li~~ 383 (976) +-+.+++..|+.. T Consensus 105 ~~~~~el~~~~~~ 117 (563) T PRK08097 105 LADKQALARWMRG 117 (563) T ss_pred CCCHHHHHHHHHC T ss_conf 1898999999841 No 199 >COG4987 CydC ABC-type transport system involved in cytochrome bd biosynthesis, fused ATPase and permease components [Energy production and conversion / Posttranslational modification, protein turnover, chaperones] Probab=72.36 E-value=6.3 Score=18.08 Aligned_cols=10 Identities=10% Similarity=0.401 Sum_probs=6.2 Q ss_pred HHHCCCCHHH Q ss_conf 1100321237 Q gi|254780277|r 464 KIGHNIKYDK 473 (976) Q Consensus 464 kIgHN~KfD~ 473 (976) .+.||+-|-+ T Consensus 80 lvsH~Atfrv 89 (573) T COG4987 80 LVSHDATFRV 89 (573) T ss_pred HHHHHHHHHH T ss_conf 8634899999 No 200 >TIGR01169 rplA_bact ribosomal protein L1; InterPro: IPR005878 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites , . About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many of ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome , . Ribosomal protein L1 is the largest protein from the large ribosomal subunit. In Escherichia coli, L1 is known to bind to the 23S rRNA. This model describe s bacterial and chloroplast ribosomal protein L1. Most mitochondrial L1 sequences are sufficiently divergent to be the contained in a different entry (IPR005879 from INTERPRO).; GO: 0003723 RNA binding, 0003735 structural constituent of ribosome, 0006412 translation, 0015934 large ribosomal subunit. Probab=72.20 E-value=1.6 Score=22.30 Aligned_cols=40 Identities=25% Similarity=0.405 Sum_probs=27.9 Q ss_pred HHHHHHHHHHHHCCCCCEECCCCCCCCC-CCC-----------CCCCCCCCCCCEEEECC Q ss_conf 0012333453300222100000133343-334-----------55633222243024248 Q gi|254780277|r 693 NKRTQRVHTFYSLASTMTGRLASLEPNL-QNI-----------PIKTDLGQKIRRAFISP 740 (976) Q Consensus 693 ~~~dgRIH~~f~q~~t~TGRlSs~~PNl-QNi-----------P~r~~~g~~iR~~F~a~ 740 (976) .+++|-||.++ ||.|-++=.| .|+ |+ +.-|+-||++.++. T Consensus 163 ~DK~G~iH~~~-------GK~SF~~~~L~eN~~Al~~~i~k~KP~-~~KG~Yik~~~lSs 214 (227) T TIGR01169 163 ADKAGNIHAPI-------GKVSFDEEKLKENLEALLDAIKKAKPS-GAKGQYIKNIALSS 214 (227) T ss_pred ECCCEEEEEEE-------CCCCCCHHHHHHHHHHHHHHHHHHCCC-CCCCEEEEEEEEEE T ss_conf 15851587775-------266688899999999999999850883-01230788889840 No 201 >PRK03352 DNA polymerase IV; Validated Probab=71.54 E-value=6.5 Score=17.95 Aligned_cols=68 Identities=16% Similarity=0.146 Sum_probs=28.4 Q ss_pred HHHHHHHHHHHHHCCCE-ECCCCCEECCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEE Q ss_conf 99999999999984988-82778760189875883788877878770134565389999999999999998659981899 Q gi|254780277|r 843 HEYIEKTKNFVRQNGYV-ETIFGRRIHYDEINSPKSSIRNINERAAINAPIQGSAADITRRAMISVHKSLEDHKLATKML 921 (976) Q Consensus 843 ~~~~~~~~~~a~~~gyv-~T~~GRrr~~p~i~s~~~~~r~~aeR~a~N~~iQGsaAdi~k~am~~i~~~~~~~~~~~~~i 921 (976) ..+.+++.+..++.|.. +|+.=..|+ .+..+.. |+ ..+..|.+- ..+|.+.|+-.+ +.+. .+...|++ T Consensus 267 ~~l~~~~~~rlr~~~~~~~~i~v~ir~-~~f~~~t---r~----~tL~~pT~~-~~~i~~~a~~Ll-~~~~-~~~~vRli 335 (345) T PRK03352 267 TELAQRVLEEVVAEGRVVTRVAVTVRT-ATFYTRT---KI----RKLPAPTTD-PDVITAAALDVL-DLFE-LDRPVRLL 335 (345) T ss_pred HHHHHHHHHHHHHCCCCEEEEEEEEEE-CCCCEEE---EE----EECCCCCCC-HHHHHHHHHHHH-HHCC-CCCCEEEE T ss_conf 999999999999759963089999991-8983579---99----988865599-999999999999-8426-79978899 No 202 >cd01012 YcaC_related YcaC related amidohydrolases; E.coli YcaC is an homooctameric hydrolase with unknown specificity. Despite its weak sequence similarity, it is structurally related to other amidohydrolases and shares conserved active site residues with them. Multimerisation interface seems not to be conserved in all members. Probab=71.51 E-value=6.5 Score=17.95 Aligned_cols=114 Identities=19% Similarity=0.193 Sum_probs=66.7 Q ss_pred EEEEECCHHHHHHHHCCCCCCCCCCCCEECHHHHHHHHHHHHHHHHCCCCCCCEEEEEEEC---CCCCHH--HHCCHHHH Q ss_conf 8999662699988742675446898855006999999999999984855479879999727---488713--24086521 Q gi|254780277|r 7 LFLVDGSSFIYRAFYATPLLSRKHDGLPVNAIAGFCNMLWKLLQNSRKESIASHFAVIFDY---PAVTFR--NEIYPDYK 81 (976) Q Consensus 7 ~~liDg~~~~~ra~~a~p~~~~~~~G~~t~ai~gf~~~l~~~i~~~~~~~~~~~~~v~fD~---~~~tfR--~~~~~~YK 81 (976) |++||=-.-++.. .+ ++ -..+..+.++++..+.-..| ++..-.. .|+++. .+..|+.. T Consensus 2 LlvID~Q~~f~~~---v~------~~------~~iv~ni~~Li~~ar~~~~p--Vi~t~~~~~~~g~~~~el~~~~~~~~ 64 (157) T cd01012 2 LLLVDVQEKLAPA---IK------SF------DELINNTVKLAKAAKLLDVP--VILTEQYPKGLGPTVPELREVFPDAP 64 (157) T ss_pred EEEEECHHHHHCC---CC------CH------HHHHHHHHHHHHHHHHCCCC--EEEEEECCCCCCCCCHHHHHCCCCCE T ss_conf 9999774445313---57------88------99999999999999984999--89998788888885765672499984 Q ss_pred CCCCCCCHHHHHHHHHHHHHHHHCCCCEEEECCEEHHHHHHHHHHHHHHCCCCEEEEC Q ss_conf 6679879758864899999999709979956890166589999999984899499986 Q gi|254780277|r 82 ANRPKIPEMLLPQLPLVRLATQAFGIPAIEIQGFEADDIIATYTYIAEKEGFAVTIIS 139 (976) Q Consensus 82 a~R~~~p~~l~~q~~~i~~~l~~~gi~~~~~~~~EADDviatla~~~~~~~~~v~i~s 139 (976) -=.|..+..+.. +-+.+.|...||..+-.-|++.|=++-+-|..+...||+|++++ T Consensus 65 vi~K~~~saf~~--t~L~~~L~~~gi~~lvi~G~~T~~CV~~Ta~da~~~Gy~v~vv~ 120 (157) T cd01012 65 VIEKTSFSCWED--EAFRKALKATGRKQVVLAGLETHVCVLQTALDLLEEGYEVFVVA 120 (157) T ss_pred EEECCCCCCCCC--CHHHHHHHHCCCCEEEEEEEECCHHHHHHHHHHHHCCCEEEEEC T ss_conf 577781478378--13999998629998999998542579997999998699699946 No 203 >PRK13910 DNA glycosylase MutY; Provisional Probab=71.36 E-value=4.9 Score=18.86 Aligned_cols=36 Identities=31% Similarity=0.440 Sum_probs=24.5 Q ss_pred HHHHHHH-CCCHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHH-HCC Q ss_conf 9998884-8988787777875187435899877636024998984-136 Q gi|254780277|r 166 EHVIKKW-GIPPEKMVCLQALTGDSIDNIPGIPGIGYKTAALLLQ-EYG 212 (976) Q Consensus 166 ~~v~~~~-gv~p~~~~d~~al~GD~sDnipGv~GiG~KtA~~ll~-~~g 212 (976) ..+.++| |.-|..+-+++ -.||||+-||...+. -|+ T Consensus 58 ~~i~~~~~g~~P~~~~~L~-----------~LPGIG~yTA~AI~siaf~ 95 (290) T PRK13910 58 EICVKEHHSQLPNDYQSLL-----------KLPGIGAYTANAILCFGFR 95 (290) T ss_pred HHHHHHHCCCCCCHHHHHH-----------HCCCCCCHHHHHHHHHHCC T ss_conf 9999983898985299997-----------5889982699999998647 No 204 >PRK00076 recR recombination protein RecR; Reviewed Probab=69.97 E-value=2.1 Score=21.49 Aligned_cols=17 Identities=41% Similarity=0.638 Sum_probs=13.2 Q ss_pred CCCCCCCCCCHHHHHHH Q ss_conf 89987763602499898 Q gi|254780277|r 192 NIPGIPGIGYKTAALLL 208 (976) Q Consensus 192 nipGv~GiG~KtA~~ll 208 (976) .+.-.||||+|||..+. T Consensus 12 ~l~kLPGIG~KsA~Rla 28 (197) T PRK00076 12 ALRKLPGIGPKSAQRLA 28 (197) T ss_pred HHHHCCCCCHHHHHHHH T ss_conf 99878999889999999 No 205 >KOG2534 consensus Probab=68.75 E-value=3.2 Score=20.24 Aligned_cols=14 Identities=14% Similarity=0.551 Sum_probs=6.2 Q ss_pred EEEEECCCCCEEEE Q ss_conf 99963265443687 Q gi|254780277|r 745 LISADYSQIELRIL 758 (976) Q Consensus 745 lv~aDySQiElRil 758 (976) |+.+-|+|+-.-+| T Consensus 279 ivv~P~~~~~~alL 292 (353) T KOG2534 279 IVVCPYDEFGFALL 292 (353) T ss_pred EEEECHHHCCEEEE T ss_conf 89966577123456 No 206 >PRK02362 ski2-like helicase; Provisional Probab=68.73 E-value=7.4 Score=17.54 Aligned_cols=11 Identities=36% Similarity=0.471 Sum_probs=4.6 Q ss_pred CCCEEEECCCC Q ss_conf 99499986870 Q gi|254780277|r 132 GFAVTIISTDK 142 (976) Q Consensus 132 ~~~v~i~s~Dk 142 (976) |..|-+.|||- T Consensus 94 gi~V~~~tGd~ 104 (736) T PRK02362 94 GLRVGISTGDY 104 (736) T ss_pred CCEEEEEECCC T ss_conf 99899980898 No 207 >pfam01936 DUF88 Protein of unknown function DUF88. This highly conserved bacterial protein has no known function. The alignment contains many conserved aspartates, suggesting an enzymatic function such as an endonuclease or glycosyl hydrolase (Bateman A pers. obs). Probab=68.43 E-value=7.5 Score=17.49 Aligned_cols=52 Identities=19% Similarity=0.115 Sum_probs=33.9 Q ss_pred HHHHHHHCCCCEEEECC-----EEHHHHHHHHHHHHH--HCCCCEEEECCCCHHHEECC Q ss_conf 99999970997995689-----016658999999998--48994999868701212237 Q gi|254780277|r 98 VRLATQAFGIPAIEIQG-----FEADDIIATYTYIAE--KEGFAVTIISTDKDLMQLVS 149 (976) Q Consensus 98 i~~~l~~~gi~~~~~~~-----~EADDviatla~~~~--~~~~~v~i~s~DkD~~Qlv~ 149 (976) .++.|...||.+...+. --+|--|+.=+-... ..--.++|+|||.||..++. T Consensus 49 ~~~~L~~~Gi~v~~~~~~~~~Kn~~D~~l~vD~~~~~~~~~~d~~ilvsgD~Df~~~~~ 107 (140) T pfam01936 49 FPDALSSTGIPVQHKPLTKSGKNAVDVGLAVDALELAYDNNPDTFVLVSGDGDFAPLLE 107 (140) T ss_pred HHHHHHHCCCEEEEECCCCCCCCCHHHHHHHHHHHHHHCCCCCEEEEEECCHHHHHHHH T ss_conf 99999986971899455158765034999999999973389998999926740799999 No 208 >pfam01927 DUF82 Protein of unknown function DUF82. This prokaryotic protein family has no known function. The protein contains four conserved cysteines that may be involved in metal binding or disulphide bridges. Probab=67.57 E-value=7.8 Score=17.37 Aligned_cols=75 Identities=21% Similarity=0.293 Sum_probs=33.0 Q ss_pred HHHHHHHHCCCCEEEECCEEHHHHHHHHHHHHHHCCCCEEEECCCCHHHEECCCCEEEEECCCCCEECHHHHHHHHCCCH Q ss_conf 99999997099799568901665899999999848994999868701212237975999667883538999988848988 Q gi|254780277|r 97 LVRLATQAFGIPAIEIQGFEADDIIATYTYIAEKEGFAVTIISTDKDLMQLVSPTTCLYDTVKEEKIDIEHVIKKWGIPP 176 (976) Q Consensus 97 ~i~~~l~~~gi~~~~~~~~EADDviatla~~~~~~~~~v~i~s~DkD~~Qlv~~~v~~~~~~~~~~~~~~~v~~~~gv~p 176 (976) -+-..|..+|+-..-..+++ |+ ++...|..++- +|+|.|+.|++=....-.++-...+..--..+|.+.||+.| T Consensus 11 ~LAr~LR~lG~Dt~~~~~~~-D~---~l~~~A~~E~R--ilLTrD~~L~~r~~~~~~~ll~~~~~~~Ql~ev~~~~~l~~ 84 (146) T pfam01927 11 KLARWLRLLGYDTLYDNDFE-DD---ELLAIAAKEGR--ILLTRDRGLLKRRLLTGGVLVRSLDPEEQLREVIERFGLKL 84 (146) T ss_pred HHHHHHHHCCCCEEECCCCC-HH---HHHHHHHHCCC--EEEECCHHHHHHCCCCCEEEECCCCHHHHHHHHHHHCCCCC T ss_conf 99999998698870159998-59---99999866498--99967889997332378499789999999999999669974 Q ss_pred H Q ss_conf 7 Q gi|254780277|r 177 E 177 (976) Q Consensus 177 ~ 177 (976) . T Consensus 85 ~ 85 (146) T pfam01927 85 S 85 (146) T ss_pred C T ss_conf 3 No 209 >PRK13639 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=67.49 E-value=7.8 Score=17.36 Aligned_cols=13 Identities=31% Similarity=0.355 Sum_probs=6.5 Q ss_pred CCHHHHHHHHHHH Q ss_conf 9989999999999 Q gi|254780277|r 823 IPRSEAADYIKRY 835 (976) Q Consensus 823 is~~eA~~~i~~y 835 (976) .|.+||++.+.+| T Consensus 260 ~t~~e~~~~~~~~ 272 (275) T PRK13639 260 YTIGEARRNIKEF 272 (275) T ss_pred CCHHHHHHHHHHH T ss_conf 4699999999998 No 210 >COG1936 Predicted nucleotide kinase (related to CMP and AMP kinases) [Nucleotide transport and metabolism] Probab=66.60 E-value=2.9 Score=20.48 Aligned_cols=20 Identities=50% Similarity=0.878 Sum_probs=16.6 Q ss_pred CCCCCCCCCHHHHHHHHHCC Q ss_conf 99877636024998984136 Q gi|254780277|r 193 IPGIPGIGYKTAALLLQEYG 212 (976) Q Consensus 193 ipGv~GiG~KtA~~ll~~~g 212 (976) |.|-||+|+-|++++|.++| T Consensus 5 ITGTPGvGKTT~~~~L~~lg 24 (180) T COG1936 5 ITGTPGVGKTTVCKLLRELG 24 (180) T ss_pred EECCCCCCHHHHHHHHHHHC T ss_conf 93799986687999999829 No 211 >KOG0687 consensus Probab=66.32 E-value=8.2 Score=17.20 Aligned_cols=55 Identities=29% Similarity=0.333 Sum_probs=41.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 56776322210001345788607501233221014310058999986434368999999999740 Q gi|254780277|r 552 LWLLLRPKLIVEKLLHVYERLDKPMIDVVSQMEITGIQIDQDLLKQVSAEISKNLLSLEENIYDL 616 (976) Q Consensus 552 L~~~l~~~L~~~~l~~l~~~iE~Pl~~vLa~ME~~Gi~vD~~~L~~l~~~l~~~l~~le~~i~~~ 616 (976) ..+.+...+.++++..+|.-+ -|.-++.+|.+.|..+.+.=+.+++++.++|.+. T Consensus 40 ~~e~l~~~Ird~~Map~Ye~l----------ce~~~i~~D~~~l~~m~~~neeki~eld~~ieda 94 (393) T KOG0687 40 AREKLLAAIRDEDMAPLYEYL----------CESLVIKLDQDLLNSMKKANEEKIKELDEKIEDA 94 (393) T ss_pred HHHHHHHHHHHCCCCHHHHHH----------HHHCCEECCHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 999999998844640699999----------8543550049999999985288999999999999 No 212 >COG1569 Predicted nucleic acid-binding protein, contains PIN domain [General function prediction only] Probab=66.02 E-value=5.6 Score=18.41 Aligned_cols=31 Identities=16% Similarity=0.329 Sum_probs=20.7 Q ss_pred EEECCCCHHHEECCCCEEEEECCCCCEECHHHHHHHH Q ss_conf 9986870121223797599966788353899998884 Q gi|254780277|r 136 TIISTDKDLMQLVSPTTCLYDTVKEEKIDIEHVIKKW 172 (976) Q Consensus 136 ~i~s~DkD~~Qlv~~~v~~~~~~~~~~~~~~~v~~~~ 172 (976) +|+|||+|++-|=++|- +-++.+|.++.+++ T Consensus 107 ~lvTgD~dLL~lr~~n~------~~~Iltp~eF~~~~ 137 (142) T COG1569 107 YLVTGDQDLLVLRDENK------RVEILTPQEFLEEV 137 (142) T ss_pred EEEECCHHHHEECCCCC------CEEECCHHHHHHHH T ss_conf 89972615501036678------57973799999999 No 213 >pfam11798 IMS_HHH IMS family HHH motif. These proteins are involved in UV protection, eg. Probab=65.94 E-value=3.5 Score=19.94 Aligned_cols=20 Identities=55% Similarity=0.795 Sum_probs=16.3 Q ss_pred CCCCCCCCCCHHHHHHHHHCC Q ss_conf 899877636024998984136 Q gi|254780277|r 192 NIPGIPGIGYKTAALLLQEYG 212 (976) Q Consensus 192 nipGv~GiG~KtA~~ll~~~g 212 (976) .|--+||||.||+.+ |+.+| T Consensus 13 pi~~i~GIG~~~~~~-L~~~g 32 (33) T pfam11798 13 PISKIPGIGRKTAEK-LEALG 32 (33) T ss_pred CCCCCCCCCHHHHHH-HHHCC T ss_conf 622168866678999-99826 No 214 >KOG2144 consensus Probab=64.49 E-value=1.5 Score=22.58 Aligned_cols=54 Identities=22% Similarity=0.448 Sum_probs=29.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCC--C-CCEECCCCCCCCCCCCCCCC---CCCCCCCEEEECC Q ss_conf 53200111100123334533002--2-21000001333433345563---3222243024248 Q gi|254780277|r 684 YADSLPNHINKRTQRVHTFYSLA--S-TMTGRLASLEPNLQNIPIKT---DLGQKIRRAFISP 740 (976) Q Consensus 684 Y~~~l~~~i~~~dgRIH~~f~q~--~-t~TGRlSs~~PNlQNiP~r~---~~g~~iR~~F~a~ 740 (976) |.+.....++.+ .|||-. |.+ | +..+.||||+||-- |-.-+ +.-++||+||++| T Consensus 193 ~A~eylp~l~yk-KrihLm-npMvPGL~q~~KMSsSd~~Sk-IdllD~~~~V~kKI~kAfCeP 252 (360) T KOG2144 193 LAEEYLPDLGYK-KRIHLM-NPMVPGLAQGEKMSSSDPLSK-IDLLDEPADVNKKIKKAFCEP 252 (360) T ss_pred HHHHHHHHHCCC-CCEEEC-CCCCCCCCCCCCCCCCCCCCC-CCCCCCHHHHHHHHHHHCCCC T ss_conf 999861541766-552323-777855344576666785232-311169899999999743898 No 215 >PRK10880 adenine DNA glycosylase; Provisional Probab=64.04 E-value=8.6 Score=17.08 Aligned_cols=34 Identities=32% Similarity=0.499 Sum_probs=19.7 Q ss_pred HHHHHHHH-CCCHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHH Q ss_conf 99998884-8988787777875187435899877636024998984 Q gi|254780277|r 165 IEHVIKKW-GIPPEKMVCLQALTGDSIDNIPGIPGIGYKTAALLLQ 209 (976) Q Consensus 165 ~~~v~~~~-gv~p~~~~d~~al~GD~sDnipGv~GiG~KtA~~ll~ 209 (976) ...|.++| |.-|..+-++++ .||||+-||..++. T Consensus 93 A~~i~~~~~G~~P~~~~~L~~-----------LPGIG~yTA~AI~s 127 (350) T PRK10880 93 AQQVATLHGGKFPETFEEVAA-----------LPGVGRSTAGAILS 127 (350) T ss_pred HHHHHHHCCCCCCCHHHHHHH-----------CCCCCHHHHHHHHH T ss_conf 999999758989825999862-----------66887279999999 No 216 >cd01701 Pol_zeta Pol_zeta, a member of the Y-family of DNA polymerases. Y-family polymerases can transverse normal replication-blocking DNA lesions, such as cyclobutane pyrimidine dimers resulting from UV damage, at higher error rate. The Y-family has no 3'-5' exonuclease activity. Pol zeta has also been named Rev1; this subfamily is mainly present in eukaryotes. Probab=63.65 E-value=9.2 Score=16.86 Aligned_cols=127 Identities=12% Similarity=0.165 Sum_probs=53.9 Q ss_pred HHHHHHH--CCCCHHHHHHHHHHCCCHHHCCHH--HHHHHHHHHHHHHHCCCHHHHHHHCCCCHHHHHHHHHHHHHHCHH Q ss_conf 7787340--698889999989809996889988--886302513564407770356788199989999999999998888 Q gi|254780277|r 766 PLYQVFE--NSLDIHRMVAAEIFGVNIEKVSPQ--MRRHAKTINFSIIYGISPFRLATQLRIPRSEAADYIKRYFHRFPG 841 (976) Q Consensus 766 ~l~~af~--~g~DiH~~tA~~vf~~~~~~vt~~--~R~~aK~infgi~YG~~~~~La~~l~is~~eA~~~i~~yf~~~p~ 841 (976) .|...|- .|..++.. ..|++...|+++ ..+..+..+||.-+ .+.+|...++ T Consensus 259 ~L~~~fG~k~G~~l~~~----a~GiD~~~V~~~~~~KSIs~e~Tf~~d~------------~~~~ei~~~l--------- 313 (405) T cd01701 259 KLQKVFGKKTGEKLYNY----CRGIDDRPVEGEKERKSVSAEINYGIRF------------TTVEDVEQFL--------- 313 (405) T ss_pred HHHHHHHHHHHHHHHHH----HCCCCCCCCCCCCCCCCCEEEEECCCCC------------CCHHHHHHHH--------- T ss_conf 99999778999999999----6798998777789997315235689888------------9999999999--------- Q ss_pred HHHHHHHHHHHHHHCCCE-ECCC--CCEECCCC--CCCCCHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 999999999999984988-8277--87601898--75883788----877878770134565389999999999999998 Q gi|254780277|r 842 IHEYIEKTKNFVRQNGYV-ETIF--GRRIHYDE--INSPKSSI----RNINERAAINAPIQGSAADITRRAMISVHKSLE 912 (976) Q Consensus 842 v~~~~~~~~~~a~~~gyv-~T~~--GRrr~~p~--i~s~~~~~----r~~aeR~a~N~~iQGsaAdi~k~am~~i~~~~~ 912 (976) .++-+++.+..++.|.. +|+. =++|+ ++ +....+.- ..+.....+..|.+ .+.+|.+.|. .+.+.+. T Consensus 314 -~~la~~v~~rLr~~~~~~~tVtlKi~~R~-~d~~~~~~ky~g~g~~~t~trs~tL~~pT~-~~~~I~~~a~-~Ll~~~~ 389 (405) T cd01701 314 -RRLSQEVCRRLEEIGVKGRQITLKLMVRA-PGAPVEPPKYMGHGICDSFSKSSTLGVATD-DFGVIGKEAK-ALFRDLK 389 (405) T ss_pred -HHHHHHHHHHHHHCCCEEEEEEEEEEEEC-CCCCCCCCCCCCCCCCCEEEEEEECCCCCC-CHHHHHHHHH-HHHHHHC T ss_conf -99999999999984981528999998753-776544310035666662331226787669-9999999999-9999706 Q ss_pred HCCCCEEEE Q ss_conf 659981899 Q gi|254780277|r 913 DHKLATKML 921 (976) Q Consensus 913 ~~~~~~~~i 921 (976) ..+.+.|+| T Consensus 390 ~~~~~iRli 398 (405) T cd01701 390 VPPKDLRGV 398 (405) T ss_pred CCCCCEEEE T ss_conf 689976789 No 217 >pfam07377 DUF1493 Protein of unknown function (DUF1493). This family consists of several bacterial proteins of around 115 residues in length. Members of this family seem to be found exclusively in Salmonella and Yersinia species and several have been described as being putative cytoplasmic proteins. The function of this family is unknown. Probab=63.65 E-value=8 Score=17.28 Aligned_cols=24 Identities=25% Similarity=0.648 Sum_probs=20.4 Q ss_pred HHHHHCCCCHHHHHHHHHHHHHHC Q ss_conf 567881999899999999999988 Q gi|254780277|r 816 RLATQLRIPRSEAADYIKRYFHRF 839 (976) Q Consensus 816 ~La~~l~is~~eA~~~i~~yf~~~ 839 (976) .|..+||++-++|.++|+.||++| T Consensus 31 ~L~~Dl~~~gdDa~elm~~ff~~F 54 (110) T pfam07377 31 DLDEDLGIDGDDAEELMEKFFKKF 54 (110) T ss_pred HHHHHHCCCHHHHHHHHHHHHHHC T ss_conf 069875969799999999999982 No 218 >pfam11977 RNase_Zc3h12a Zc3h12a-like Ribonuclease domain. This domain is found in the Zc3h12a protein which has shown to be a ribonuclease that controls the stability of a set of inflammatory genes. It has been suggested that this domain belongs to the PIN domain superfamily. Probab=61.29 E-value=10 Score=16.57 Aligned_cols=104 Identities=19% Similarity=0.227 Sum_probs=55.5 Q ss_pred CEEEEEECCHHHHHHHHCCCCCCCCCCCCEECHHHHHHHHHHHHHHHHCCCCCCCE-EEEEEECCCCCHHHHCCHHHHCC Q ss_conf 72899966269998874267544689885500699999999999998485547987-99997274887132408652166 Q gi|254780277|r 5 NHLFLVDGSSFIYRAFYATPLLSRKHDGLPVNAIAGFCNMLWKLLQNSRKESIASH-FAVIFDYPAVTFRNEIYPDYKAN 83 (976) Q Consensus 5 ~~~~liDg~~~~~ra~~a~p~~~~~~~G~~t~ai~gf~~~l~~~i~~~~~~~~~~~-~~v~fD~~~~tfR~~~~~~YKa~ 83 (976) -+-++|||++..++ |+.... .. ++.|.-.+.-|.... | .+++|+ |+||.+.++ T Consensus 2 lR~IvIDGsNVA~~--hg~~~~------fs-------~~gI~~av~yf~~rG---h~~i~vf~---p~~r~~~~~----- 55 (155) T pfam11977 2 LRPIVIDGSNVAMS--HGNKEF------FS-------CRGLLLAVDYFRKRG---HEVITVFV---PNWRYKARD----- 55 (155) T ss_pred CCEEEECCHHHHHH--CCCCCC------CC-------HHHHHHHHHHHHHCC---CCEEEEEC---CHHHHHCCC----- T ss_conf 86189876488887--399984------50-------899999999999859---94599989---378731777----- Q ss_pred CCCCCHHHHHHHHHHHHHHHHCCCCEEEECCEE--------HHHHHHHHHHHHHHCCCCEEEECCCC--HHHE Q ss_conf 798797588648999999997099799568901--------66589999999984899499986870--1212 Q gi|254780277|r 84 RPKIPEMLLPQLPLVRLATQAFGIPAIEIQGFE--------ADDIIATYTYIAEKEGFAVTIISTDK--DLMQ 146 (976) Q Consensus 84 R~~~p~~l~~q~~~i~~~l~~~gi~~~~~~~~E--------ADDviatla~~~~~~~~~v~i~s~Dk--D~~Q 146 (976) .+....++..|+ ...|+-++.-.+.- +|=.|-.+|.+ . .-+|||.|+ |+.+ T Consensus 56 ~~~~~~~~L~~L-------~~~~~l~~tPs~~~~g~~~~~yDDr~iL~~A~~---~--~g~IVSND~yRD~~~ 116 (155) T pfam11977 56 GKVTDQHELERL-------IRLGLIVFTPSRTLDGRRIVSYDDRFILELAEE---T--DGVIVSNDNFRDLAD 116 (155) T ss_pred CCCCCHHHHHHH-------HHCCCEEECCCCCCCCCCCCCCCHHHHHHHHHH---C--CCEEEECCHHHHHHH T ss_conf 888779999999-------978978974565678854333118999999998---3--999997826889987 No 219 >PRK02406 DNA polymerase IV; Validated Probab=61.09 E-value=9 Score=16.93 Aligned_cols=25 Identities=20% Similarity=0.205 Sum_probs=12.5 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCEEEE Q ss_conf 389999999999999998659981899 Q gi|254780277|r 895 SAADITRRAMISVHKSLEDHKLATKML 921 (976) Q Consensus 895 saAdi~k~am~~i~~~~~~~~~~~~~i 921 (976) +..+|...|.-.+.+... +.+.|++ T Consensus 312 ~~~~i~~~a~~Ll~~~~~--~~~iRli 336 (355) T PRK02406 312 SAEDLEQLARQLLARRRG--GRGVRLL 336 (355) T ss_pred CHHHHHHHHHHHHHHHHC--CCCEEEE T ss_conf 999999999999974005--9977899 No 220 >PRK13647 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=60.67 E-value=10 Score=16.49 Aligned_cols=15 Identities=13% Similarity=0.372 Sum_probs=8.8 Q ss_pred CCCHHHHHHHHHHHH Q ss_conf 999899999999999 Q gi|254780277|r 822 RIPRSEAADYIKRYF 836 (976) Q Consensus 822 ~is~~eA~~~i~~yf 836 (976) -.+.+||.+++.+++ T Consensus 257 p~~~~~~~~~~~~~~ 271 (273) T PRK13647 257 PTNLKDARQIIRKFL 271 (273) T ss_pred CCCHHHHHHHHHHHH T ss_conf 987999999999985 No 221 >PRK10917 ATP-dependent DNA helicase RecG; Provisional Probab=60.46 E-value=6.4 Score=17.99 Aligned_cols=31 Identities=26% Similarity=0.507 Sum_probs=23.4 Q ss_pred CCCCCCCCCCCCHHHHHHHHHCC--CHHHHHHHH Q ss_conf 35899877636024998984136--789998645 Q gi|254780277|r 190 IDNIPGIPGIGYKTAALLLQEYG--NLENILINA 221 (976) Q Consensus 190 sDnipGv~GiG~KtA~~ll~~~g--sle~i~~~~ 221 (976) .+-|--++|||||+|.+| ++.| |+..++-+. T Consensus 7 ~~pI~~LKGVGpK~a~~L-~kLGI~TV~DLL~~f 39 (677) T PRK10917 7 DAPLTSLKGVGPKTAEKL-AKLGIHTVQDLLLHL 39 (677) T ss_pred CCCHHHCCCCCHHHHHHH-HHCCCCCHHHHHHCC T ss_conf 488755799789999999-976999499998519 No 222 >PRK07945 hypothetical protein; Provisional Probab=59.57 E-value=10 Score=16.52 Aligned_cols=90 Identities=17% Similarity=0.228 Sum_probs=44.2 Q ss_pred CCCHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHC-CCHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCC Q ss_conf 898878777787518743589987763602499898413-6789998645420011223220035789997655432035 Q gi|254780277|r 173 GIPPEKMVCLQALTGDSIDNIPGIPGIGYKTAALLLQEY-GNLENILINASRIKQKKRRENILEYAETARLSRKLVMLRT 251 (976) Q Consensus 173 gv~p~~~~d~~al~GD~sDnipGv~GiG~KtA~~ll~~~-gsle~i~~~~d~i~~~k~~~~l~~~~e~a~ls~~L~tL~~ 251 (976) +..|.++.+-. ..+-+..+||||||||.-+-+-. |.+-.-++.+..-.++--.+.| .+.|+- T Consensus 36 ~~~~~~~~~~~-----~~~~~~~l~gig~~ta~vi~~a~~g~~p~~l~~l~~~~~~~~g~~l------------~~~LRG 98 (335) T PRK07945 36 ALDEAERARHG-----RAGSWQSLPGIGPKTAKVIAQAWAGRVPDYLAELRAAAEDLGGGEL------------RAALRG 98 (335) T ss_pred HCCHHHHHHHH-----HCCCCEECCCCCHHHHHHHHHHHCCCCCHHHHHHHHHCCCCCHHHH------------HHHHHH T ss_conf 37999999987-----4498112788780589999999658982899999974469864899------------998630 Q ss_pred CCCCCCCHHHHCCCCCCHHHHHHHHHHHHHH Q ss_conf 6422476355212676768899865323368 Q gi|254780277|r 252 DVPITIPLEHLILEDCNGPRLISFLKALEFT 282 (976) Q Consensus 252 Dvpl~~~~~~l~~~~~d~~~l~~~~~~lef~ 282 (976) |+-+--+++| +....++..+...++++. T Consensus 99 DLH~HT~wSD---G~~sieeMa~aA~~lGye 126 (335) T PRK07945 99 DLHLHSDWSD---GGSPIEEMMATAAALGHE 126 (335) T ss_pred CCCCCCCCCC---CCCHHHHHHHHHHHCCCE T ss_conf 6533788457---978699999999983982 No 223 >cd01013 isochorismatase Isochorismatase, also known as 2,3 dihydro-2,3 dihydroxybenzoate synthase, catalyses the conversion of isochorismate, in the presence of water, to 2,3-dihydroxybenzoate and pyruvate, via the hydrolysis of a vinyl ether, an uncommon reaction in biological systems. Isochorismatase is part of the phenazine biosynthesis pathway. Phenazines are antimicrobial compounds that provide the competitive advantage for certain bacteria. Probab=59.07 E-value=11 Score=16.30 Aligned_cols=42 Identities=12% Similarity=-0.043 Sum_probs=24.8 Q ss_pred HHHHHHHCCCCEEEECCEEHHHHHHHHHHHHHHCCCCEEEEC Q ss_conf 999999709979956890166589999999984899499986 Q gi|254780277|r 98 VRLATQAFGIPAIEIQGFEADDIIATYTYIAEKEGFAVTIIS 139 (976) Q Consensus 98 i~~~l~~~gi~~~~~~~~EADDviatla~~~~~~~~~v~i~s 139 (976) +..+|.++||..+-.-|...+-++-+-++.+...||+|++++ T Consensus 133 L~~~Lr~~gi~~Lvv~Gv~T~~CV~~T~rdA~~~gy~v~vv~ 174 (203) T cd01013 133 LLERLKESGRDQLIITGVYAHIGCLSTAVDAFMRDIQPFVVA 174 (203) T ss_pred HHHHHHHCCCCEEEEEEEECCCHHHHHHHHHHHCCCEEEEEE T ss_conf 899999769998999865425139999999998799499931 No 224 >KOG3416 consensus Probab=58.27 E-value=7.8 Score=17.36 Aligned_cols=28 Identities=29% Similarity=0.440 Sum_probs=17.6 Q ss_pred CCCCCCCC-----------CCCCCCCCCCEEEECCC-CCE Q ss_conf 33433345-----------56332222430242489-948 Q gi|254780277|r 717 EPNLQNIP-----------IKTDLGQKIRRAFISPP-TKK 744 (976) Q Consensus 717 ~PNlQNiP-----------~r~~~g~~iR~~F~a~~-g~~ 744 (976) .|+||||- .+|..|+++|.|-++++ |.+ T Consensus 11 ~P~~kN~~v~fIvl~~g~~tkTkdg~~v~~~kVaD~TgsI 50 (134) T KOG3416 11 KPGLKNINVTFIVLEYGRATKTKDGHEVRSCKVADETGSI 50 (134) T ss_pred CHHHHCCEEEEEEEEECEEEECCCCCEEEEEEEECCCCEE T ss_conf 8234162489999861403013689888899980665528 No 225 >pfam10266 Strumpellin Hereditary spastic paraplegia protein strumpellin. This is a family of proteins conserved from plants to humans, in which two closely situated point mutations in the human protein lead to the condition of hereditary spastic paraplegia. Strumpellin contains one known domain called a spectrin repeat that consists of three alpha-helices of a characteristic length wrapped in a left-handed coiled coil. The spectrin proteins have multiple copies of this repeat, which can then form multimers in the cell. Spectrin associates with the cell membrane via spectrin repeats in the ankyrin protein. The spectrin repeat is a structural platform for cytoskeletal protein assemblies. Probab=58.11 E-value=2.6 Score=20.88 Aligned_cols=44 Identities=23% Similarity=0.304 Sum_probs=23.4 Q ss_pred HHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHHCCHHHHCCCCCCCHHHHHH Q ss_conf 99999999984855479879999727488713240865216679879758864 Q gi|254780277|r 42 CNMLWKLLQNSRKESIASHFAVIFDYPAVTFRNEIYPDYKANRPKIPEMLLPQ 94 (976) Q Consensus 42 ~~~l~~~i~~~~~~~~~~~~~v~fD~~~~tfR~~~~~~YKa~R~~~p~~l~~q 94 (976) +.-+.++++...... +++++.-|-.-||+===+|-|.|+.+... T Consensus 157 ~d~vc~L~rsTg~~~---------~~~~~~~~p~~YPe~yf~R~p~~~~~i~~ 200 (1080) T pfam10266 157 IDEVCKLCRSTGYGA---------DSAYASKRPPNYPEEYFNRVPLPESFITM 200 (1080) T ss_pred HHHHHHHHHHCCCCC---------CCCCCCCCCCCCHHHHHHHCCCCHHHHHH T ss_conf 999999999738889---------98655579889969998607997999999 No 226 >PRK13634 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=57.20 E-value=12 Score=16.09 Aligned_cols=10 Identities=20% Similarity=0.069 Sum_probs=3.6 Q ss_pred CHHHHHHHHH Q ss_conf 9899999999 Q gi|254780277|r 824 PRSEAADYIK 833 (976) Q Consensus 824 s~~eA~~~i~ 833 (976) |.+|+.+.|- T Consensus 258 t~ee~~~~i~ 267 (276) T PRK13634 258 TLEELAHEVV 267 (276) T ss_pred CHHHHHHHHH T ss_conf 9999999999 No 227 >TIGR00706 SppA_dom signal peptide peptidase SppA, 36K type; InterPro: IPR004635 Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes . They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence . Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases . Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base . The geometric orientations of the catalytic residues are similar between families, despite different protein folds . The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) , . Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. This group of serine peptidases belong to MEROPS peptidase family S49 (protease IV family, clan S-). The predicted active site serine for members of this family occurs in a transmembrane domain. This group of sequences represent both long and short forms of the bacterial SppA and homologs found in the archaea and plants. Signal peptides of secretory proteins seem to serve at least two important biological functions. First, they are required for protein targeting to and translocation across membranes, such as the eubacterial plasma membrane and the endoplasmic reticular membrane of eukaryotes. Second, in addition to their role as determinants for protein targeting and translocation, certain signal peptides have a signaling function. During or shortly after pre-protein translocation, the signal peptide is removed by signal peptidases. The integral membrane protein, SppA (protease IV), of Escherichia coli was shown experimentally to degrade signal peptides. The member of this family from Bacillus subtilis has only been shown to be required for efficient processing of pre-proteins under conditions of hyper-secretion . ; GO: 0008233 peptidase activity, 0006508 proteolysis. Probab=56.80 E-value=5.9 Score=18.24 Aligned_cols=59 Identities=24% Similarity=0.324 Sum_probs=30.3 Q ss_pred HHHHCCCCCCCCCCCCCCC---------CC--HHHHHHHHCCCC-HHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHH Q ss_conf 8742078875556654535---------31--788776530476-27999999987--8889886532001111001233 Q gi|254780277|r 633 LFTKLKFPSGAKTKTGQWK---------TT--AQDLEQINCGDN-PIIKNILEWRQ--LSKIKSTYADSLPNHINKRTQR 698 (976) Q Consensus 633 Lf~~LgLp~~kKTktG~~S---------T~--~~~Le~L~~~~h-p~~~~iLeyR~--l~KL~sTY~~~l~~~i~~~dgR 698 (976) |++++|+... --|||.|. |. ++.|+.+=.+.| .|++.+.+.|. ++. +.+.++ .||| T Consensus 124 L~~k~GI~~~-~iK~G~yKd~~~~~R~lt~eE~~~lQ~~v~~~Y~~F~~~V~~~R~nkl~~------~~vK~~---AdGR 193 (224) T TIGR00706 124 LLEKLGIEFE-AIKSGEYKDIGSPTRELTPEERKILQSLVNESYEQFVQVVAKGRNNKLSV------EDVKKF---ADGR 193 (224) T ss_pred HHHHCCCEEE-EEECCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCH------HHHHHH---HCCC T ss_conf 9986491565-66516656789875776299999999988888757899999841677897------887652---0686 Q ss_pred HHH Q ss_conf 345 Q gi|254780277|r 699 VHT 701 (976) Q Consensus 699 IH~ 701 (976) |++ T Consensus 194 vf~ 196 (224) T TIGR00706 194 VFT 196 (224) T ss_pred EEC T ss_conf 010 No 228 >TIGR03269 met_CoM_red_A2 methyl coenzyme M reductase system, component A2. The enzyme that catalyzes the final step in methanogenesis, methyl coenzyme M reductase, contains alpha, beta, and gamma chains. In older literature, the complex of alpha, beta, and gamma chains was termed component C, while this single chain protein was termed methyl coenzyme M reductase system component A2. Probab=56.39 E-value=12 Score=16.00 Aligned_cols=12 Identities=25% Similarity=0.130 Sum_probs=5.5 Q ss_pred CCEEEEECCHHH Q ss_conf 638999559879 Q gi|254780277|r 925 HDELVFEVPEEE 936 (976) Q Consensus 925 HDElv~ev~~~~ 936 (976) +=+++.+.+.++ T Consensus 503 ~G~iv~~g~~~e 514 (520) T TIGR03269 503 DGKIVKIGDPEE 514 (520) T ss_pred CCEEEEEECHHH T ss_conf 999999987899 No 229 >KOG2213 consensus Probab=56.33 E-value=8.8 Score=16.99 Aligned_cols=18 Identities=17% Similarity=0.172 Sum_probs=7.8 Q ss_pred HHHHHHCCCHHHHHHHHHH Q ss_conf 9898413678999864542 Q gi|254780277|r 205 ALLLQEYGNLENILINASR 223 (976) Q Consensus 205 ~~ll~~~gsle~i~~~~d~ 223 (976) ++||+ -+||-+++.+++. T Consensus 102 ~qLLn-k~sl~~Lf~~~~~ 119 (460) T KOG2213 102 VQLLN-KASLTGLFGQIEV 119 (460) T ss_pred HHHHH-HHHHHHHHHHHHH T ss_conf 99998-8779999754300 No 230 >PRK03348 DNA polymerase IV; Provisional Probab=55.77 E-value=8.8 Score=16.99 Aligned_cols=37 Identities=30% Similarity=0.339 Sum_probs=22.2 Q ss_pred CHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCCCHHHHH Q ss_conf 88787777875187435899877636024998984136789998 Q gi|254780277|r 175 PPEKMVCLQALTGDSIDNIPGIPGIGYKTAALLLQEYGNLENIL 218 (976) Q Consensus 175 ~p~~~~d~~al~GD~sDnipGv~GiG~KtA~~ll~~~gsle~i~ 218 (976) .|+.+.++++-. .|--++|||+||+.+| +.+| +..|. T Consensus 168 ~p~~~~~~L~~l-----PV~~lwGIG~~t~~kL-~~~G-I~Tig 204 (456) T PRK03348 168 PPGEERELLAPL-----PVRRLWGIGPVAEEKL-HRLG-IETIG 204 (456) T ss_pred CHHHHHHHHHHC-----CHHHCCCCCHHHHHHH-HHCC-CHHHH T ss_conf 888999998756-----5743387759999999-9869-81468 No 231 >COG2019 AdkA Archaeal adenylate kinase [Nucleotide transport and metabolism] Probab=55.65 E-value=6.8 Score=17.79 Aligned_cols=48 Identities=19% Similarity=0.407 Sum_probs=24.1 Q ss_pred CCCCCCCCCHHHHHHHHHCCCHHHHHHHHHHHHHH-HHHHHHCCCHHHHH Q ss_conf 99877636024998984136789998645420011-22322003578999 Q gi|254780277|r 193 IPGIPGIGYKTAALLLQEYGNLENILINASRIKQK-KRRENILEYAETAR 241 (976) Q Consensus 193 ipGv~GiG~KtA~~ll~~~gsle~i~~~~d~i~~~-k~~~~l~~~~e~a~ 241 (976) |-||||+|.-|-.+.+.+-- ++--+-|-..+.-. ..++.+.+++|+.+ T Consensus 9 itGVpGvGKTTVl~~~~~~l-~~~~ivNyG~~Mle~A~k~glve~rD~~R 57 (189) T COG2019 9 ITGVPGVGKTTVLKIALKEL-VKHKIVNYGDLMLEIAKKKGLVEHRDEMR 57 (189) T ss_pred EECCCCCCHHHHHHHHHHHH-HHCEEEEHHHHHHHHHHHHCCCCCHHHHH T ss_conf 97579887266999999877-52200007699999999838720289885 No 232 >PRK13641 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=55.39 E-value=12 Score=15.88 Aligned_cols=13 Identities=31% Similarity=0.266 Sum_probs=5.5 Q ss_pred CCHHHHHHHHHHH Q ss_conf 9989999999999 Q gi|254780277|r 823 IPRSEAADYIKRY 835 (976) Q Consensus 823 is~~eA~~~i~~y 835 (976) +|.+|+-+.+.++ T Consensus 268 lt~~e~~~~l~~~ 280 (286) T PRK13641 268 LTIDELVDGIKNN 280 (286) T ss_pred CCHHHHHHHHHHH T ss_conf 7999999999999 No 233 >PRK13507 formate--tetrahydrofolate ligase; Provisional Probab=55.24 E-value=9.9 Score=16.62 Aligned_cols=35 Identities=9% Similarity=0.301 Sum_probs=17.5 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHCC Q ss_conf 3689999999997403455777765666799874207 Q gi|254780277|r 602 ISKNLLSLEENIYDLSGEKFNLSSPKQLGDILFTKLK 638 (976) Q Consensus 602 l~~~l~~le~~i~~~aG~~FNinSpkQL~~vLf~~Lg 638 (976) +.++++.+.++||...|-+|...--+||.. |+++| T Consensus 473 i~eKIe~IA~~IYgA~~V~~s~~A~~~l~~--~e~~G 507 (587) T PRK13507 473 LRERIETIAREVYGADGVSYTPEALEKLAR--LESDP 507 (587) T ss_pred HHHHHHHHHHHHCCCCCEEECHHHHHHHHH--HHHCC T ss_conf 999999999971599867789999999999--99748 No 234 >COG1555 ComEA DNA uptake protein and related DNA-binding proteins [DNA replication, recombination, and repair] Probab=55.22 E-value=13 Score=15.87 Aligned_cols=11 Identities=18% Similarity=0.471 Sum_probs=6.2 Q ss_pred HHHHHHHHHHH Q ss_conf 79999999878 Q gi|254780277|r 667 IIKNILEWRQL 677 (976) Q Consensus 667 ~~~~iLeyR~l 677 (976) .++.|.+||+- T Consensus 109 kA~aIi~yRe~ 119 (149) T COG1555 109 KAQAIIDYREE 119 (149) T ss_pred HHHHHHHHHHH T ss_conf 99999999997 No 235 >TIGR00194 uvrC excinuclease ABC, C subunit; InterPro: IPR004791 Excinuclease ABC complex is involved in DNA replication, recombination and repair. This group defines the C subunit.; GO: 0009381 excinuclease ABC activity, 0006281 DNA repair, 0005737 cytoplasm, 0009380 excinuclease repair complex. Probab=55.15 E-value=6.4 Score=18.02 Aligned_cols=10 Identities=30% Similarity=0.614 Sum_probs=4.2 Q ss_pred HHHHHCHHHH Q ss_conf 9999888899 Q gi|254780277|r 834 RYFHRFPGIH 843 (976) Q Consensus 834 ~yf~~~p~v~ 843 (976) .+|..+.+++ T Consensus 586 ~~~~~~~~~~ 595 (601) T TIGR00194 586 KHFGSLDGIK 595 (601) T ss_pred HHHHHHHHHH T ss_conf 8741132333 No 236 >PRK03839 putative kinase; Provisional Probab=55.03 E-value=5.7 Score=18.34 Aligned_cols=19 Identities=47% Similarity=0.737 Sum_probs=15.9 Q ss_pred CCCCCCCCCHHHHHHHHHC Q ss_conf 9987763602499898413 Q gi|254780277|r 193 IPGIPGIGYKTAALLLQEY 211 (976) Q Consensus 193 ipGv~GiG~KtA~~ll~~~ 211 (976) |-|.||+|+-|.+++|++- T Consensus 5 ITGTPGtGKTTva~~La~~ 23 (180) T PRK03839 5 ITGTPGVGKTTISKLLAEK 23 (180) T ss_pred EECCCCCCHHHHHHHHHHH T ss_conf 9789999989999999997 No 237 >PRK06294 coproporphyrinogen III oxidase; Provisional Probab=54.95 E-value=13 Score=15.84 Aligned_cols=21 Identities=5% Similarity=-0.282 Sum_probs=12.1 Q ss_pred HHHHHHHHCCCEECCCCCEEC Q ss_conf 999999984988827787601 Q gi|254780277|r 848 KTKNFVRQNGYVETIFGRRIH 868 (976) Q Consensus 848 ~~~~~a~~~gyv~T~~GRrr~ 868 (976) ...+.....|+++.=-+|-|. T Consensus 334 ~~l~~~~~~Gll~~~~~~l~L 354 (374) T PRK06294 334 ELTMQPSIQGLFTQNDQILSL 354 (374) T ss_pred HHHHHHHHCCCEEEECCEEEE T ss_conf 999999988988984999999 No 238 >TIGR01084 mutY A/G-specific adenine glycosylase; InterPro: IPR005760 The DNA repair enzyme MutY plays an important role in the prevention of DNA mutations resulting from the presence of the oxidatively damaged lesion 7,8-dihydro-8-oxo-2'-deoxyguanosine (8-OxoG). 8-OxoG can mispair with 2'-deoxycytidine 5'-triphosphate or with 2'-deoxyadenosine triphosphate during DNA replication, forming C*8-oxoG and A*8-oxoG mispairs. Human MutY is responsible for recognition and removal of the inappropriately inserted adenine in an A=8-oxoG mispair. If unrepaired, the A=8-oxoG mispairs can result in deleterious C:G to A:T transversions. Multiple forms of mammalian MutY are formed by alternative splicing and locate to the nucleus or mitochondrion, where they have been shown to interact with several other proteins involved in the repair of DNA damage . The HhH-GPD domain within the protein binds the phosphate backbone of the substrate. This family represents bacterial MutY and a limited number of murine homologs. In rat, the Escherichia coli MutY DNA glycosylase homolog (MYH) is induced in response to mitochondrial DNA damage . ; GO: 0019104 DNA N-glycosylase activity, 0006284 base-excision repair, 0005622 intracellular. Probab=54.81 E-value=13 Score=15.87 Aligned_cols=139 Identities=17% Similarity=0.227 Sum_probs=62.5 Q ss_pred HHHHHHHHHHHHHHH-HHHH-HC-CCCHHHHHHHCCCCCCCCHHHHHHHHHHHHHH---HHHHHHHHHHCCCCCCCCCCH Q ss_conf 677632221000134-5788-60-75012332210143100589999864343689---999999997403455777765 Q gi|254780277|r 553 WLLLRPKLIVEKLLH-VYER-LD-KPMIDVVSQMEITGIQIDQDLLKQVSAEISKN---LLSLEENIYDLSGEKFNLSSP 626 (976) Q Consensus 553 ~~~l~~~L~~~~l~~-l~~~-iE-~Pl~~vLa~ME~~Gi~vD~~~L~~l~~~l~~~---l~~le~~i~~~aG~~FNinSp 626 (976) -++|..+-+-..... -|.. || .|-+..||.-...-| ...++-+. +-.+ |.+..++|.+.-|-+| | T Consensus 44 SEvMLQQTqV~tV~prYf~rFle~FPTv~~LA~A~~deV---L~lW~GLG--YYaRARNL~kAA~~v~~~fGG~f----P 114 (297) T TIGR01084 44 SEVMLQQTQVATVIPRYFERFLERFPTVQALANAPQDEV---LKLWEGLG--YYARARNLHKAAQEVVEEFGGEF----P 114 (297) T ss_pred EEECCCCCEEEECCCCCCHHHHHHCCCHHHHHCCCHHHH---HHHHHCCC--HHHHHHHHHHHHHHHHHHHCCCC----C T ss_conf 021100110011267100476642788578747796579---99862578--67888999999999998718817----7 Q ss_pred HHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHCCCCHHH-----HHHHHHH---------HHHHHHHHHHHHHHH-H Q ss_conf 666799874207887555665453531788776530476279-----9999998---------788898865320011-1 Q gi|254780277|r 627 KQLGDILFTKLKFPSGAKTKTGQWKTTAQDLEQINCGDNPII-----KNILEWR---------QLSKIKSTYADSLPN-H 691 (976) Q Consensus 627 kQL~~vLf~~LgLp~~kKTktG~~ST~~~~Le~L~~~~hp~~-----~~iLeyR---------~l~KL~sTY~~~l~~-~ 691 (976) +.+.++. .||.+ | +||...+|.-=....+|++ +.+-+++ ++.+-..++++.|.+ . T Consensus 115 ~d~~~~~----~L~GV-----G-~yTAgAils~a~~~~~p~~DGNV~RVLsR~fA~~~~~~~k~~e~~l~~~~~~llpe~ 184 (297) T TIGR01084 115 QDLEDLK----ALPGV-----G-RYTAGAILSFAYNKPVPILDGNVKRVLSRLFAVEGWPGKKKVENRLWELAESLLPEK 184 (297) T ss_pred CCHHHHH----HCCCC-----C-HHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCC T ss_conf 2379785----17897-----6-217999999872687620154078899999862489887348899999999858865 Q ss_pred HHHHHHHHHHHHHCCCCCEECCCCC Q ss_conf 1001233345330022210000013 Q gi|254780277|r 692 INKRTQRVHTFYSLASTMTGRLASL 716 (976) Q Consensus 692 i~~~dgRIH~~f~q~~t~TGRlSs~ 716 (976) .++ ..|||..=.=|=+=|. T Consensus 185 ~~~------~~~nqalmDlGA~iC~ 203 (297) T TIGR01084 185 ADP------EAFNQALMDLGALICT 203 (297) T ss_pred CCH------HHHHHHHHHCCCCCCC T ss_conf 686------5888899862361037 No 239 >TIGR02093 P_ylase glycogen/starch/alpha-glucan phosphorylases; InterPro: IPR011833 The biosynthesis of disaccharides, oligosaccharides and polysaccharides involves the action of hundreds of different glycosyltransferases. These enzymes catalyse the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. A classification of glycosyltransferases using nucleotide diphospho-sugar, nucleotide monophospho-sugar and sugar phosphates (2.4.1.- from EC) and related proteins into distinct sequence based families has been described . This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site . The same three-dimensional fold is expected to occur within each of the families. Because 3-D structures are better conserved than sequences, several of the families defined on the basis of sequence similarities may have similar 3-D structures and therefore form 'clans'. Proteins in this entry are glycosyltransferases with phosphorylase activities. Members use phosphate to break alpha 1,4 linkages between pairs of glucose residues at the end of long glucose polymers, releasing alpha-D-glucose 1-phosphate. The nomenclature convention is to preface the name according to the natural substrate, as in glycogen phosphorylase, starch phosphorylase, maltodextrin phosphorylase, etc. The main role of glycogen phosphorylase (GPase) is to provide phosphorylated glucose molecules (G-1-P) . GPase is a highly regulated allosteric enzyme. The net effect of the regulatory site allows the enzyme to operate at a variety of rates; the enzyme is not simply regulated as "on" or "off", but rather it can be thought of being set to operate at an ideal rate based on changing conditions at in the cell. The most important allosteric effector is the phosphate molecule covalently attached to Ser14. This switches GPase from the b (inactive) state to the a (active) state. Upon phosphorylation, GPase attains about 800f its Vmax. When the enzyme is not phosphorylated, GPase activity is practically non-existent at low AMP levels . There is some apparent controversy as to the structure of GPase. All sources agree that the enzyme is multimeric, but there is apparent controversy as to the enzyme being a tetramer or a dimer. Apparently, GPase (in the a form) forms tetramers in the crystal form. The consensus seems to be that 'regardless of the a or b form, GPase functions as a dimer in vivo . The GPase monomer is best described as consisting of two domains, an N-terminal domain and a C-terminal domain . The C-terminal domain is often referred to as the catalytic domain. It consists of a beta-sheet core surrounded by layers of helical segments . The vitamin cofactor pyridoxal phosphate (PLP) is covalently attached to the amino acid backbone. The N-terminal domain also consists of a central beta-sheet core and is surrounded by layers of helical segments. The N-terminal domain contains different allosteric effector sites to regulate the enzyme. Bacterial phosphorylases follow the same catalytic mechanisms as their plant and animal counterparts, but differ considerably in terms of their substrate specificity and regulation. The catalytic domains are highly conserved while the regulatory sites are only poorly conserved. For maltodextrin phosphorylase from Escherichia coli the physiological role of the enzyme in the utilisation of maltidextrins is known in detail; that of all the other bacterial phosphorylases is still unclear. Roles in regulatuon of endogenous glycogen metabolism in periods of starvation, and sporulation, stress response or quick adaptation to changing environments are possible .; GO: 0004645 phosphorylase activity, 0030170 pyridoxal phosphate binding, 0005975 carbohydrate metabolic process. Probab=54.30 E-value=6.9 Score=17.77 Aligned_cols=26 Identities=19% Similarity=0.344 Sum_probs=16.5 Q ss_pred HHHHHHHHHHCC-CCCCCCCCCCCCCCCHHHHHHHH Q ss_conf 666799874207-88755566545353178877653 Q gi|254780277|r 627 KQLGDILFTKLK-FPSGAKTKTGQWKTTAQDLEQIN 661 (976) Q Consensus 627 kQL~~vLf~~Lg-Lp~~kKTktG~~ST~~~~Le~L~ 661 (976) +.|+.+|=+.+| = .+-||-+-|++|. T Consensus 483 P~LS~l~~e~iGt~---------~W~~~l~~L~~L~ 509 (822) T TIGR02093 483 PGLSALLTETIGTD---------EWLTDLDLLKKLE 509 (822) T ss_pred HHHHHHHHHHHCCC---------HHHCCHHHHHHHH T ss_conf 13689998762467---------2221479885111 No 240 >COG1194 MutY A/G-specific DNA glycosylase [DNA replication, recombination, and repair] Probab=54.06 E-value=12 Score=15.91 Aligned_cols=18 Identities=11% Similarity=-0.062 Sum_probs=9.7 Q ss_pred HHHHHCCCCCEECCCCCC Q ss_conf 453300222100000133 Q gi|254780277|r 700 HTFYSLASTMTGRLASLE 717 (976) Q Consensus 700 H~~f~q~~t~TGRlSs~~ 717 (976) -..|||..-.=|.+-|+- T Consensus 182 ~~~fnqammdlGA~ICt~ 199 (342) T COG1194 182 PGDFNQAMMDLGATICTA 199 (342) T ss_pred HHHHHHHHHHHHHHHHCC T ss_conf 679999999863676158 No 241 >PRK13651 cobalt transporter ATP-binding subunit; Provisional Probab=53.66 E-value=13 Score=15.69 Aligned_cols=13 Identities=31% Similarity=0.360 Sum_probs=4.7 Q ss_pred CHHHHHHHHHHHH Q ss_conf 9899999999999 Q gi|254780277|r 824 PRSEAADYIKRYF 836 (976) Q Consensus 824 s~~eA~~~i~~yf 836 (976) |.+|+..-|..|+ T Consensus 284 t~~e~~~~~~~~~ 296 (304) T PRK13651 284 SIDELASEINMYL 296 (304) T ss_pred CHHHHHHHHHHHH T ss_conf 8999999999999 No 242 >PRK11440 hypothetical protein; Provisional Probab=52.78 E-value=14 Score=15.60 Aligned_cols=42 Identities=21% Similarity=0.251 Sum_probs=36.0 Q ss_pred HHHHHHHCCCCEEEECCEEHHHHHHHHHHHHHHCCCCEEEEC Q ss_conf 999999709979956890166589999999984899499986 Q gi|254780277|r 98 VRLATQAFGIPAIEIQGFEADDIIATYTYIAEKEGFAVTIIS 139 (976) Q Consensus 98 i~~~l~~~gi~~~~~~~~EADDviatla~~~~~~~~~v~i~s 139 (976) +..+|...||..+-.-|+-.+=++-+-++.|.+.||+|++++ T Consensus 112 L~~~Lr~~gi~~lvi~G~~T~~CV~sTardA~~~gy~v~vv~ 153 (188) T PRK11440 112 LELQLRRRGIDTIVLCGISTNIGVESTARNAWELGFNLVIAE 153 (188) T ss_pred HHHHHHHCCCCEEEEEEECCCHHHHHHHHHHHHCCCEEEEEC T ss_conf 899998779998999322116569999999998899899810 No 243 >COG2390 DeoR Transcriptional regulator, contains sigma factor-related N-terminal domain [Transcription] Probab=51.52 E-value=14 Score=15.46 Aligned_cols=12 Identities=42% Similarity=0.631 Sum_probs=6.7 Q ss_pred HHCCCEECCCCC Q ss_conf 984988827787 Q gi|254780277|r 854 RQNGYVETIFGR 865 (976) Q Consensus 854 ~~~gyv~T~~GR 865 (976) .+.|-|--++|| T Consensus 239 ~~~gaVGdi~g~ 250 (321) T COG2390 239 LAKGAVGDILGR 250 (321) T ss_pred HHCCCCEECCCC T ss_conf 967930003066 No 244 >PRK04301 radA DNA repair and recombination protein RadA; Validated Probab=51.42 E-value=14 Score=15.45 Aligned_cols=32 Identities=28% Similarity=0.505 Sum_probs=26.5 Q ss_pred CCCCCCCCCCCCCHHHHHHH-HHCCCHHHHHHH Q ss_conf 43589987763602499898-413678999864 Q gi|254780277|r 189 SIDNIPGIPGIGYKTAALLL-QEYGNLENILIN 220 (976) Q Consensus 189 ~sDnipGv~GiG~KtA~~ll-~~~gsle~i~~~ 220 (976) +...+..+|||||.++.||- +.|.|++.+... T Consensus 5 ~~~~l~~lpGv~~~~~~kL~~aG~~tv~~l~~~ 37 (318) T PRK04301 5 KEKDLEDLPGVGPATAEKLREAGYDTVEAIAVA 37 (318) T ss_pred CCCCCCCCCCCCHHHHHHHHHCCCCCHHHHHCC T ss_conf 547745589989999999998699549998748 No 245 >cd01011 nicotinamidase Nicotinamidase/pyrazinamidase (PZase). Nicotinamidase, a ubiquitous enzyme in prokaryotes, converts nicotinamide to nicotinic acid (niacin) and ammonia, which in turn can be recycled to make nicotinamide adenine dinucleotide (NAD). The same enzyme is also called pyrazinamidase, because in converts the tuberculosis drug pyrazinamide (PZA) into its active form pyrazinoic acid (POA). Probab=51.08 E-value=14 Score=15.41 Aligned_cols=43 Identities=21% Similarity=0.307 Sum_probs=27.3 Q ss_pred HHHHHHHHCCCCEEEECCEEHHHHHHHHHHHHHHCCCCEEEEC Q ss_conf 9999999709979956890166589999999984899499986 Q gi|254780277|r 97 LVRLATQAFGIPAIEIQGFEADDIIATYTYIAEKEGFAVTIIS 139 (976) Q Consensus 97 ~i~~~l~~~gi~~~~~~~~EADDviatla~~~~~~~~~v~i~s 139 (976) -+.++|...||..+..-|+-.|=++-+-++.|...||+|+|++ T Consensus 128 ~L~~~Lr~~gi~~lvl~Gv~T~~CV~~Ta~dA~~~gy~V~vv~ 170 (196) T cd01011 128 GLAEYLRERGIDRVDVVGLATDYCVKATALDALKAGFEVRVLE 170 (196) T ss_pred CHHHHHHHCCCCEEEEEEECCCHHHHHHHHHHHHCCCEEEEEC T ss_conf 2999999779989999721737799999999998799899966 No 246 >COG4277 Predicted DNA-binding protein with the Helix-hairpin-helix motif [General function prediction only] Probab=50.78 E-value=7.9 Score=17.32 Aligned_cols=19 Identities=16% Similarity=-0.088 Sum_probs=8.9 Q ss_pred CCEEEEEECCCCCEEEEEE Q ss_conf 9489996326544368766 Q gi|254780277|r 742 TKKLISADYSQIELRILAH 760 (976) Q Consensus 742 g~~lv~aDySQiElRilA~ 760 (976) +|.+-.-+|++=|++-..+ T Consensus 281 DwLlrfYgF~~~Ei~~~g~ 299 (404) T COG4277 281 DWLLRFYGFSADEILASGG 299 (404) T ss_pred HHHHHHHCCCHHHHHHCCC T ss_conf 7899970888788874489 No 247 >PRK09057 coproporphyrinogen III oxidase; Provisional Probab=50.30 E-value=15 Score=15.33 Aligned_cols=19 Identities=16% Similarity=0.109 Sum_probs=11.0 Q ss_pred HHHHHHCCCEECC-CCCEEC Q ss_conf 9999984988827-787601 Q gi|254780277|r 850 KNFVRQNGYVETI-FGRRIH 868 (976) Q Consensus 850 ~~~a~~~gyv~T~-~GRrr~ 868 (976) .+...+.|+++.- .||-|. T Consensus 345 l~~l~~~Gll~~~~~~rl~l 364 (381) T PRK09057 345 LALLIEEGLIERDGDSRLRA 364 (381) T ss_pred HHHHHHCCCEEEECCCEEEE T ss_conf 99999789979808986999 No 248 >TIGR02607 antidote_HigA addiction module antidote protein, HigA family; InterPro: IPR013430 Proteins in this entry form a distinct group of helix-turn-helix proteins, which are strictly bacterial and nearly always shorter than 110 amino acids. They include the characterised member HigA, without which the killer protein HigB cannot be cloned. The hig (host inhibition of growth) system is noted to be unusual in that killer protein is encoded by the upstream member of the gene pair.; GO: 0003677 DNA binding. Probab=49.66 E-value=14 Score=15.64 Aligned_cols=23 Identities=30% Similarity=0.413 Sum_probs=14.2 Q ss_pred CCCHHHHHHHCCCCHHHHHHHHH Q ss_conf 77703567881999899999999 Q gi|254780277|r 811 GISPFRLATQLRIPRSEAADYIK 833 (976) Q Consensus 811 G~~~~~La~~l~is~~eA~~~i~ 833 (976) |+|...||+.||||+.-..+++. T Consensus 20 g~s~~~LA~~LgVsr~~~sriv~ 42 (81) T TIGR02607 20 GLSVRALAKALGVSRSTLSRIVN 42 (81) T ss_pred CHHHHHHHHHCCCCHHHHHHHHH T ss_conf 70689999870999788888874 No 249 >pfam07592 Transposase_36 Rhodopirellula transposase. These transposases are found in the planctomycete Rhodopirellula baltica, the cyanobacterium Nostoc, and the Gram-positive bacterium Streptomyces. Probab=48.88 E-value=16 Score=15.18 Aligned_cols=111 Identities=20% Similarity=0.316 Sum_probs=59.1 Q ss_pred CCEEEEEECCCCCEEEEEEEEEHHHHHHHHHCCCCHHHHHHHHHHCCCHHHCCHHH--HHHHHHHHHHHHHCCCHHHHHH Q ss_conf 94899963265443687665103777873406988899999898099968899888--8630251356440777035678 Q gi|254780277|r 742 TKKLISADYSQIELRILAHIAKITPLYQVFENSLDIHRMVAAEIFGVNIEKVSPQM--RRHAKTINFSIIYGISPFRLAT 819 (976) Q Consensus 742 g~~lv~aDySQiElRilA~ls~D~~l~~af~~g~DiH~~tA~~vf~~~~~~vt~~~--R~~aK~infgi~YG~~~~~La~ 819 (976) |.-.+|.| ++-+.|+..|.++++--+-. |.|.+--+-+- ...+|++-||| |-++.-..-= T Consensus 90 g~PvISVD------------tKkKeliG~~~n~G~~w~~~-----g~p~~v~~HDf~~~~~gk~iPyGi-YD~~~n~g~V 151 (311) T pfam07592 90 GQPVISVD------------TKKKELIGEFKNSGKEWTKK-----GAPVEVLDHDFPDKALGTAIPYGI-YDVGANTGWV 151 (311) T ss_pred CCCEEEEE------------CHHHHHHCCCCCCCCEECCC-----CCCCEECCCCCCCCCCCCCCCCEE-EECCCCEEEE T ss_conf 99578874------------30245526778898542307-----995041146676645781156413-6658872799 Q ss_pred HCCCCHHHHH---HHHHHHH-----HHCHH-----------------HHHHHHHHHHHHHHCCCEECCCCCEECCCCCCC Q ss_conf 8199989999---9999999-----98888-----------------999999999999984988827787601898758 Q gi|254780277|r 820 QLRIPRSEAA---DYIKRYF-----HRFPG-----------------IHEYIEKTKNFVRQNGYVETIFGRRIHYDEINS 874 (976) Q Consensus 820 ~l~is~~eA~---~~i~~yf-----~~~p~-----------------v~~~~~~~~~~a~~~gyv~T~~GRrr~~p~i~s 874 (976) ..|+|.+-|. +-|.+|. .+||. .+.|+.+.+++|.+.|-.- |..|+|--.| T Consensus 152 ~vgt~~dTa~FaV~si~~WW~~~g~~~yp~a~~LlItaD~GgsN~~R~r~wk~~Lq~lA~e~gl~I----~v~h~PPgtS 227 (311) T pfam07592 152 SVGTDHDTAEFAVESIRRWWDEMGKKRYPHAKELLITADCGGSNGSRRRLWKKRLQELANETGLPI----QVCHYPPGTS 227 (311) T ss_pred EEEECCCHHHHHHHHHHHHHHHCCCCCCCCCEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHCCEE----EEEECCCCCC T ss_conf 982066528999999999999738755885207999767887664277899999999999739747----9983599876 No 250 >cd00245 Glm_e Coenzyme B12-dependent glutamate mutase epsilon subunit-like family; contains proteins similar to Clostridium cochlearium glutamate mutase (Glm) and Streptomyces tendae Tu901 NikV. Glm catalyzes a carbon-skeleton rearrangement of L-glutamate to L-threo-3-methylaspartate. The first step in the catalysis is a homolytic cleavage of the Co-C bond of the coenzyme B12 cofactor to generate a 5'-deoxyadenosyl radical. This radical then initiates the rearrangement reaction. C. cochlearium Glm is a sigma2epsilon2 heterotetramer. Glm plays a role in glutamate fermentation in Clostridium sp. and in members of the family Enterobacteriaceae, and in the synthesis of the lipopeptide antibiotic friulimicin in Actinoplanes friuliensis. S. tendae Tu901 glutamate mutase-like proteins NikU and NIkV participate in the synthesis of the peptidyl nucleoside antibiotic nikkomycin. NikU and NikV proteins have sequence similarity to Clostridium Glm sigma and epsilon components respectively, and may Probab=48.53 E-value=15 Score=15.29 Aligned_cols=79 Identities=10% Similarity=0.130 Sum_probs=39.6 Q ss_pred HHHHHHHHHHCCCHHHCCHHHHHHHHHHHHHHHHCCC---HHHHHHHCCCCHHHHHHHHHHHHHH-CHHHHHHHHHHHHH Q ss_conf 8999998980999688998888630251356440777---0356788199989999999999998-88899999999999 Q gi|254780277|r 777 IHRMVAAEIFGVNIEKVSPQMRRHAKTINFSIIYGIS---PFRLATQLRIPRSEAADYIKRYFHR-FPGIHEYIEKTKNF 852 (976) Q Consensus 777 iH~~tA~~vf~~~~~~vt~~~R~~aK~infgi~YG~~---~~~La~~l~is~~eA~~~i~~yf~~-~p~v~~~~~~~~~~ 852 (976) +-.-|..+-+|+|..+....-=+.+|++ +.++-++. ...+..+-.+-..|++.++|.-|+. -+.|..- .-. T Consensus 278 vIvKT~~EA~gIPt~eaN~~~l~~t~~~-~~~~~~q~~~~~~~~~~E~~~i~~Ev~~i~d~Vle~g~gDl~~~----~v~ 352 (428) T cd00245 278 VIVKTPDEAHGIPTIEANVAGLKATATV-LNMLRGQKFPPSEAIEQEEEIIKAEVKAILDKVFELGDGDVARG----TVK 352 (428) T ss_pred EEECCHHHHCCCCCHHHHHHHHHHHHHH-HHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHH----HHH T ss_conf 9972889975899889999999999999-99873455678631035899999999999999984489719999----999 Q ss_pred HHHCCCEE Q ss_conf 99849888 Q gi|254780277|r 853 VRQNGYVE 860 (976) Q Consensus 853 a~~~gyv~ 860 (976) |-+.|+.- T Consensus 353 AF~~GvlD 360 (428) T cd00245 353 AFEAGVLD 360 (428) T ss_pred HHHCCCCC T ss_conf 99758776 No 251 >pfam11731 Cdd1 Pathogenicity locus. Cdd1 is expressed as part of the pathogenicity locus operon in several different orders of bacteria. Many members of the family are annotated as being putative mitomycin resistance proteins but this could not be confirmed. Probab=48.52 E-value=14 Score=15.61 Aligned_cols=13 Identities=46% Similarity=0.544 Sum_probs=5.5 Q ss_pred CCCCCCHHHHHHH Q ss_conf 7763602499898 Q gi|254780277|r 196 IPGIGYKTAALLL 208 (976) Q Consensus 196 v~GiG~KtA~~ll 208 (976) +||||+++|..|. T Consensus 17 lPnIG~a~a~DL~ 29 (92) T pfam11731 17 LPNIGKATAKDLR 29 (92) T ss_pred CCCCCHHHHHHHH T ss_conf 8974699999999 No 252 >TIGR03614 RutB pyrimidine utilization protein B. RL Proc Natl Acad Sci U S A. 2006 Mar 28;103(13):5114-9. Epub 2006 Mar 15. Probab=48.19 E-value=16 Score=15.10 Aligned_cols=42 Identities=10% Similarity=0.088 Sum_probs=19.6 Q ss_pred HHHHHHHCCCCEEEECCEEHHHHHHHHHHHHHHCCCCEEEEC Q ss_conf 999999709979956890166589999999984899499986 Q gi|254780277|r 98 VRLATQAFGIPAIEIQGFEADDIIATYTYIAEKEGFAVTIIS 139 (976) Q Consensus 98 i~~~l~~~gi~~~~~~~~EADDviatla~~~~~~~~~v~i~s 139 (976) +..+|.++||..+-.-|+=.+=++-+-++.|.+.||+|++++ T Consensus 142 L~~~Lr~~gI~tlvv~Gv~T~~CV~sTardA~~~gy~vvvv~ 183 (226) T TIGR03614 142 LDSMLRARGIRNLVFTGIATNVCVESTLRDGFHLEYFGVVLE 183 (226) T ss_pred HHHHHHHCCCCEEEEEEEEHHHHHHHHHHHHHHCCCEEEEEH T ss_conf 899998689980699422053799999999998899499930 No 253 >PRK08238 hypothetical protein; Validated Probab=47.93 E-value=9.4 Score=16.80 Aligned_cols=20 Identities=20% Similarity=0.232 Sum_probs=7.8 Q ss_pred HHHHHHHHHHCCCCCEECCC Q ss_conf 12333453300222100000 Q gi|254780277|r 695 RTQRVHTFYSLASTMTGRLA 714 (976) Q Consensus 695 ~dgRIH~~f~q~~t~TGRlS 714 (976) +..|.||+=+.---++|++| T Consensus 254 e~DR~Hp~Kr~RP~ASG~ls 273 (481) T PRK08238 254 EADRAHPRKRRRPFASGALP 273 (481) T ss_pred HHHCCCCCCCCCCCCCCCCC T ss_conf 75416862456977778889 No 254 >KOG2405 consensus Probab=47.54 E-value=8.5 Score=17.11 Aligned_cols=47 Identities=21% Similarity=0.339 Sum_probs=34.2 Q ss_pred HHHHHHHHCCCCCHHHHCCCCHHHHHHH-HCCCCCCCCCCHHHHHHHH Q ss_conf 9744442022210011003212378898-6188666421002345543 Q gi|254780277|r 450 LSYLKKFFENEHFLKIGHNIKYDKLVLH-RYGISMRGFEDIMLMSYVL 496 (976) Q Consensus 450 l~~L~~lle~~~i~kIgHN~KfD~~~L~-~~GI~l~~i~DTmlaayLL 496 (976) -..++.++++.++.|+-|++..-...+. .+||.++++|||..+.-+- T Consensus 101 ~n~~q~~~~~k~i~~~~~d~~~~~~~~~~~~~i~~n~v~~~q~~d~~q 148 (458) T KOG2405 101 HNGLQMILEDKRILKVIHDCRWLSDCLSHQYGILLNNVFDTQVADVLQ 148 (458) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEECCHHHHHHHHHHH T ss_conf 989999860466765579999999874030224433203333206655 No 255 >COG1747 Uncharacterized N-terminal domain of the transcription elongation factor GreA [Function unknown] Probab=47.31 E-value=16 Score=15.01 Aligned_cols=11 Identities=45% Similarity=0.401 Sum_probs=4.4 Q ss_pred EHHHHHHHHHH Q ss_conf 16658999999 Q gi|254780277|r 116 EADDIIATYTY 126 (976) Q Consensus 116 EADDviatla~ 126 (976) |-||+|+-.-+ T Consensus 16 eld~~i~~~~~ 26 (711) T COG1747 16 ELDRIIAEAKR 26 (711) T ss_pred HHHHHHHHHHH T ss_conf 89999999988 No 256 >cd04891 ACT_AK-LysC-DapG-like_1 ACT domains of the lysine-sensitive aspartokinase isoenzyme AKII and related proteins. This CD includes the N-terminal of the two ACT domains of the lysine-sensitive aspartokinase isoenzyme AKII of Bacillus subtilis (BS) strain 168, and the lysine plus threonine-sensitive aspartokinase of Corynebacterium glutamicum, as well as, the first and third, of four, ACT domains present in cyanobacteria AK. Also included are the N-terminal of the two ACT domains of the diaminopimelate-sensitive aspartokinase isoenzyme AKI found in Bacilli (Bacillus subtilis strain 168), Clostridia, and Actinobacteria bacterial species. Members of this CD belong to the superfamily of ACT regulatory domains. Probab=46.93 E-value=17 Score=14.97 Aligned_cols=44 Identities=27% Similarity=0.500 Sum_probs=35.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCEEEEEEEC-----CEEEEECCHHHHHHHHHHHH Q ss_conf 53899999999999999986599818999726-----38999559879999999999 Q gi|254780277|r 894 GSAADITRRAMISVHKSLEDHKLATKMLLQIH-----DELVFEVPEEEITTASQIIV 945 (976) Q Consensus 894 GsaAdi~k~am~~i~~~~~~~~~~~~~ilqVH-----DElv~ev~~~~~~~~~~~~~ 945 (976) |.||.| ...|.+.+...-||.|-- -.+.|-||++.++++.++++ T Consensus 13 g~aa~i--------F~~LA~~~InVDmIvQ~~~~~~~~~isFTv~~~dl~~a~~~L~ 61 (61) T cd04891 13 GVAAKI--------FSALAEAGINVDMIVQSVSRGGTTDISFTVPKSDLEKALAILE 61 (61) T ss_pred CHHHHH--------HHHHHHCCCCEEEEEECCCCCCCCEEEEEECHHHHHHHHHHHC T ss_conf 579999--------9999985981799986588899521899824989999999849 No 257 >COG1938 Archaeal enzymes of ATP-grasp superfamily [General function prediction only] Probab=46.91 E-value=17 Score=14.97 Aligned_cols=38 Identities=26% Similarity=0.497 Sum_probs=29.5 Q ss_pred HHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 1233221014310058999986434368999999999740 Q gi|254780277|r 577 IDVVSQMEITGIQIDQDLLKQVSAEISKNLLSLEENIYDL 616 (976) Q Consensus 577 ~~vLa~ME~~Gi~vD~~~L~~l~~~l~~~l~~le~~i~~~ 616 (976) +.+|..| .|+.||.+.|.+..++++.+++++.++..+. T Consensus 193 ve~lnk~--~~l~V~td~L~keAe~i~~~lekl~eq~~~~ 230 (244) T COG1938 193 VEALNKM--LGLNVDTDKLEKEAEEIEEQLEKLAEQLEKE 230 (244) T ss_pred HHHHHHH--HCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999988--5686678999999999999999999999998 No 258 >PRK13851 type IV secretion system protein VirB11; Provisional Probab=46.82 E-value=13 Score=15.67 Aligned_cols=43 Identities=12% Similarity=0.167 Sum_probs=30.6 Q ss_pred CCCEEEECCCCCEEEEEECCCCCEEEEEEEEEHHH--HHHHHHCCCC-----HHHHHHHHHH Q ss_conf 24302424899489996326544368766510377--7873406988-----8999998980 Q gi|254780277|r 732 KIRRAFISPPTKKLISADYSQIELRILAHIAKITP--LYQVFENSLD-----IHRMVAAEIF 786 (976) Q Consensus 732 ~iR~~F~a~~g~~lv~aDySQiElRilA~ls~D~~--l~~af~~g~D-----iH~~tA~~vf 786 (976) -+|.|+--+|+.++| +.+-|+++ |++|.+.|.+ +|+..|...| T Consensus 228 Ll~~aLRmrPDRIiv------------GEvRG~EA~d~L~A~nTGH~GsitTiHAnSa~~a~ 277 (343) T PRK13851 228 LLQASLRMRPDRILL------------GEMRDDAAWAYLSEVVSGHPGSISTIHGANPVQGF 277 (343) T ss_pred HHHHHHCCCCCEEEE------------EEECCHHHHHHHHHHHCCCCCCCEEECCCCHHHHH T ss_conf 999986079982899------------85266999999999846999830335679999999 No 259 >cd01015 CSHase N-carbamoylsarcosine amidohydrolase (CSHase) hydrolyzes N-carbamoylsarcosine to sarcosine, carbon dioxide and ammonia. CSHase is involved in one of the two alternative pathways for creatinine degradation to glycine in microorganisms.This CSHase-containing pathway degrades creatinine via N-methylhydantoin N-carbamoylsarcosine and sarcosine to glycine. Enzymes of this pathway are used in the diagnosis for renal disfunction, for determining creatinine levels in urine and serum. Probab=46.62 E-value=17 Score=14.94 Aligned_cols=44 Identities=16% Similarity=0.243 Sum_probs=37.6 Q ss_pred HHHHHHHHHCCCCEEEECCEEHHHHHHHHHHHHHHCCCCEEEEC Q ss_conf 99999999709979956890166589999999984899499986 Q gi|254780277|r 96 PLVRLATQAFGIPAIEIQGFEADDIIATYTYIAEKEGFAVTIIS 139 (976) Q Consensus 96 ~~i~~~l~~~gi~~~~~~~~EADDviatla~~~~~~~~~v~i~s 139 (976) .-+.++|...||..+-.-|...|=++-+-++-|...||+|++++ T Consensus 103 T~L~~~Lr~~gI~~lvi~G~~T~~CV~~Ta~dA~~~gy~v~vv~ 146 (179) T cd01015 103 TSLAATLTARGVDTLIVAGCSTSGCIRATAVDAMQHGFRPIVVR 146 (179) T ss_pred CCHHHHHHHCCCCEEEEEEEEHHHHHHHHHHHHHHCCCEEEEEC T ss_conf 76899998679977999840013789999999998899799945 No 260 >cd01703 Pol_iota Pol iota is member of the DNA polymerase Y-family, and has also been called Rad30 homolog B. Unlike classic DNA polymerases,Y-family polymerases are induced by DNA damage. They can transverse normal replication-blocking DNA lesions. Unlike Pol eta, Pol iota is unable to replicate through a cis-syn T-T dimer. In human Pol iota, the base-pairing mode in the active site at the replicative end mat bee Hoogsteen instead of Watson-Click. Human Pol iota can incorporate the correct nucleotide opposite a purine much more efficiently than opposite a pyrimidine. Pol iota prefers to insert Guanosine instead of Adenosine opposite Thymidine. Probab=46.51 E-value=8.8 Score=16.99 Aligned_cols=67 Identities=13% Similarity=0.138 Sum_probs=29.5 Q ss_pred HHCCCHHHCCHHHHHHHHHHHHHHHHCCCHHHHHHHCCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHCCCE-ECCC Q ss_conf 809996889988886302513564407770356788199989999999999998888999999999999984988-8277 Q gi|254780277|r 785 IFGVNIEKVSPQMRRHAKTINFSIIYGISPFRLATQLRIPRSEAADYIKRYFHRFPGIHEYIEKTKNFVRQNGYV-ETIF 863 (976) Q Consensus 785 vf~~~~~~vt~~~R~~aK~infgi~YG~~~~~La~~l~is~~eA~~~i~~yf~~~p~v~~~~~~~~~~a~~~gyv-~T~~ 863 (976) ..|++..+|++ |...|.|.-.--|+ .+. +..++..-+... +....+++.+..+++|.. +|+. T Consensus 262 a~GiD~~~V~~--~~~~KSis~e~tF~--------~~~-~~~~~~~~l~~~------~~~L~~Rl~~~l~~~~~~~~Tv~ 324 (394) T cd01703 262 AHGRDDTPVPA--SGPPQSISIEDSYK--------KCS-LVTEAKAELEEL------LRSLLERLRKDLAGDGRRPHTLR 324 (394) T ss_pred HCCCCCCCCCC--CCCCCEEECCCCCC--------CCC-CHHHHHHHHHHH------HHHHHHHHHHHHHHCCCCCEEEE T ss_conf 56978998888--99986112343788--------888-528899999999------99999999999985798654999 Q ss_pred CCEEC Q ss_conf 87601 Q gi|254780277|r 864 GRRIH 868 (976) Q Consensus 864 GRrr~ 868 (976) =..|+ T Consensus 325 l~iR~ 329 (394) T cd01703 325 LTVRK 329 (394) T ss_pred EEEEC T ss_conf 99977 No 261 >COG1412 Uncharacterized proteins of PilT N-term./Vapc superfamily [General function prediction only] Probab=46.40 E-value=17 Score=14.91 Aligned_cols=54 Identities=17% Similarity=0.155 Sum_probs=39.5 Q ss_pred HCCCCEEEECCEEHHHHHHHHHHHHHHCCCCEEEECCCCHHHEECC-CCEEEEECCCCCE Q ss_conf 7099799568901665899999999848994999868701212237-9759996678835 Q gi|254780277|r 104 AFGIPAIEIQGFEADDIIATYTYIAEKEGFAVTIISTDKDLMQLVS-PTTCLYDTVKEEK 162 (976) Q Consensus 104 ~~gi~~~~~~~~EADDviatla~~~~~~~~~v~i~s~DkD~~Qlv~-~~v~~~~~~~~~~ 162 (976) +.+.+++.. +=.|||+|-.+|.+. + .++++|.|+.|-|.+. ..|-+...+.++. T Consensus 74 ~er~~~~~~-~~~aDe~i~~~a~~~---~-~~iVaTnD~eLk~rlr~~GIPvi~lr~r~~ 128 (136) T COG1412 74 AERLECIHK-GRYADECLLEAALKH---G-RYIVATNDKELKRRLRENGIPVITLRQRKL 128 (136) T ss_pred HHCCCCCCC-CCCHHHHHHHHHHHC---C-CEEEEECCHHHHHHHHHCCCCEEEEECCEE T ss_conf 651472346-777689999999980---9-889994899999999981997799837828 No 262 >TIGR02788 VirB11 P-type DNA transfer ATPase VirB11; InterPro: IPR014155 This entry contains VirB11, a protein that is found in the vir locus of Agrobacterium Ti plasmids where it is involved in the type IV secretion system for DNA transfer . VirB11 is believed to be an ATPase and is a homologue of the P-like conjugation system TrbB protein and the Flp pilus system protein TadA .. Probab=46.13 E-value=4.2 Score=19.34 Aligned_cols=72 Identities=14% Similarity=0.212 Sum_probs=37.1 Q ss_pred HHHHHHHHHHHHHHHHHHH------------H-HHHHCCCCCEE-CCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCEEE Q ss_conf 8865320011110012333------------4-53300222100-00013334333455633222243024248994899 Q gi|254780277|r 681 KSTYADSLPNHINKRTQRV------------H-TFYSLASTMTG-RLASLEPNLQNIPIKTDLGQKIRRAFISPPTKKLI 746 (976) Q Consensus 681 ~sTY~~~l~~~i~~~dgRI------------H-~~f~q~~t~TG-RlSs~~PNlQNiP~r~~~g~~iR~~F~a~~g~~lv 746 (976) |+||..+|.++| |.+-|| | |+.-+.-..-| +-++++=+.|. -+.+|.-=+|+++ T Consensus 171 KTTf~kal~~~I-P~~ER~iTIED~~E~~~~hhpN~V~L~ysk~v~~g~~~vt~~~---------Ll~scLRMrPDRI-- 238 (328) T TIGR02788 171 KTTFLKALVKEI-PKDERLITIEDTRELFLPHHPNKVHLFYSKGVGQGSAKVTPKD---------LLESCLRMRPDRI-- 238 (328) T ss_pred HHHHHHHHHHCC-CCCCCEEEEEEEECCCCCCCCCEEEEEECCCCCCCCCCCCHHH---------HHHHHHCCCCCHH-- T ss_conf 899999997327-6225278885201147888986456553464234435689899---------9999711774057-- Q ss_pred EEECCCCCEEEEEEEEEHHH--HHHHHHCC Q ss_conf 96326544368766510377--78734069 Q gi|254780277|r 747 SADYSQIELRILAHIAKITP--LYQVFENS 774 (976) Q Consensus 747 ~aDySQiElRilA~ls~D~~--l~~af~~g 774 (976) +||.|-||++ .+++-++| T Consensus 239 ----------~LgELRG~Eaf~F~~~~nsG 258 (328) T TIGR02788 239 ----------LLGELRGDEAFDFIRAVNSG 258 (328) T ss_pred ----------HHHHHCCHHHHHHHHHHCCC T ss_conf ----------67430332578888752059 No 263 >PRK13631 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=45.97 E-value=17 Score=14.87 Aligned_cols=11 Identities=18% Similarity=0.416 Sum_probs=5.1 Q ss_pred HHHHHHHHHHC Q ss_conf 02513564407 Q gi|254780277|r 801 AKTINFSIIYG 811 (976) Q Consensus 801 aK~infgi~YG 811 (976) .|-++|.-+|- T Consensus 289 ~~g~~~~~~~~ 299 (320) T PRK13631 289 KKDPKYKKLYQ 299 (320) T ss_pred HCCCCCHHHCC T ss_conf 73998333047 No 264 >PRK01810 DNA polymerase IV; Validated Probab=45.06 E-value=18 Score=14.77 Aligned_cols=15 Identities=47% Similarity=0.616 Sum_probs=6.1 Q ss_pred CCCCCCCCCHHHHHH Q ss_conf 998776360249989 Q gi|254780277|r 193 IPGIPGIGYKTAALL 207 (976) Q Consensus 193 ipGv~GiG~KtA~~l 207 (976) |-=|+|||++|+.+| T Consensus 181 v~~iwGIG~~~~~~L 195 (410) T PRK01810 181 VEEMHGIGEKTAEKL 195 (410) T ss_pred CCCCCCCCHHHHHHH T ss_conf 512058677899999 No 265 >TIGR00758 UDG_fam4 uracil-DNA glycosylase, family 4; InterPro: IPR005273 This well-conserved family of proteins is about 200 residues in length and homologous to the N-terminus of the DNA polymerase of phage SPO1 of Bacillus subtilis. The function of these proteins is unknown. . Probab=44.86 E-value=18 Score=14.75 Aligned_cols=24 Identities=21% Similarity=0.231 Sum_probs=9.7 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHCC Q ss_conf 679879758864899999999709 Q gi|254780277|r 83 NRPKIPEMLLPQLPLVRLATQAFG 106 (976) Q Consensus 83 ~R~~~p~~l~~q~~~i~~~l~~~g 106 (976) ||.|||+|...=.|++++-++... T Consensus 79 NR~PT~eE~~~C~pyL~~~i~lI~ 102 (185) T TIGR00758 79 NRDPTPEEVEACAPYLVKQIELIR 102 (185) T ss_pred CCCCCHHHHHHHHHHHHHHHHHCC T ss_conf 223898999851578899997509 No 266 >KOG1931 consensus Probab=44.69 E-value=13 Score=15.65 Aligned_cols=76 Identities=18% Similarity=0.130 Sum_probs=50.5 Q ss_pred HHHHHHHHHHHCCC-CCEECCCCCCCCCCCCCCCCCCCCCCCEEEECC--CCCEEEEEECCCCCEEEEEEEE--EHHHHH Q ss_conf 01233345330022-210000013334333455633222243024248--9948999632654436876651--037778 Q gi|254780277|r 694 KRTQRVHTFYSLAS-TMTGRLASLEPNLQNIPIKTDLGQKIRRAFISP--PTKKLISADYSQIELRILAHIA--KITPLY 768 (976) Q Consensus 694 ~~dgRIH~~f~q~~-t~TGRlSs~~PNlQNiP~r~~~g~~iR~~F~a~--~g~~lv~aDySQiElRilA~ls--~D~~l~ 768 (976) +.|-++..++++++ +.++-+-++--++-+||++.. +.||..+ +-++=+.+-+||.|++|.+|+. .|-+-. T Consensus 836 ~~d~~~~~s~~~~p~~~~~d~e~~~~~~~~ip~l~~-----~d~~ss~h~~~kvsin~~~s~~e~~IA~~f~~~~~vt~~ 910 (1156) T KOG1931 836 RNDKNIFESLLCTPKLIFFDFEESFISLYEIPVLNG-----FDCFSSDHFNLKVSINLPFSQLELFIADAFLIPFDVTWR 910 (1156) T ss_pred CCCCCCHHHHCCCCCEEEECCHHCCCCCCCCCCCCC-----CCCCCCCCCCCEEEECCCHHHHHEEHHHHEECCCCEEEE T ss_conf 147630101114564687210110044200135687-----544555554540674363045300023222155322675 Q ss_pred HHHHCC Q ss_conf 734069 Q gi|254780277|r 769 QVFENS 774 (976) Q Consensus 769 ~af~~g 774 (976) .+|..| T Consensus 911 l~~~~~ 916 (1156) T KOG1931 911 LLFECG 916 (1156) T ss_pred EEHHCC T ss_conf 421047 No 267 >pfam02882 THF_DHG_CYH_C Tetrahydrofolate dehydrogenase/cyclohydrolase, NAD(P)-binding domain. Probab=44.46 E-value=14 Score=15.41 Aligned_cols=47 Identities=19% Similarity=0.202 Sum_probs=20.1 Q ss_pred HHHHHHHHCCCCEEEE-CCCCHHHEECC-CCEEEEECCCCCEECHHHHH Q ss_conf 9999998489949998-68701212237-97599966788353899998 Q gi|254780277|r 123 TYTYIAEKEGFAVTII-STDKDLMQLVS-PTTCLYDTVKEEKIDIEHVI 169 (976) Q Consensus 123 tla~~~~~~~~~v~i~-s~DkD~~Qlv~-~~v~~~~~~~~~~~~~~~v~ 169 (976) -++.....++..|+++ |.-+|+-+.+. -++.+--..+...++.+.|. T Consensus 51 Pla~lL~~~~atVtichs~T~nl~~~~~~ADIvI~A~G~p~~i~~~~ik 99 (159) T pfam02882 51 PLALLLLNANATVTVCHSKTKDLAEITREADIVVVAVGKPGLIKADWVK 99 (159) T ss_pred HHHHHHHHCCCEEEEECCCCCCHHHHHHCCCEEEEECCCCCCCCHHHCC T ss_conf 9999998779989998189999789630034442315885505698858 No 268 >TIGR02313 HpaI-NOT-DapA 2,4-dihydroxyhept-2-ene-1,7-dioic acid aldolase; InterPro: IPR012691 This entry represents a subset of the DapA (dihydrodipicolinate synthase) family which has apparently evolved a separate function. The product of DapA, dihydrodipicolinate, results from the non-enzymatic cyclization and dehydration of 6-amino-2,4-dihydroxyhept-2-ene-1,7-dioic acid, which is different from the substrate of this reaction only in the presence of the amino group. In the absence of this amino group, and running the reaction in the opposite direction, the reaction corresponds to the HpaI aldolase component of the 4-hydroxyphenylacetic acid catabolism pathway (see IPR012689 from INTERPRO). At present, this variant of DapA is found only in Oceanobacillus iheyensis HTE831 and Thermus thermophilus HB27. In both of these cases, one or more other DapA genes can be found and the members of this family are found in the operon for 4-hydroxyphenylacetic acid catabolism.; GO: 0008840 dihydrodipicolinate synthase activity, 0019877 diaminopimelate biosynthetic process, 0005737 cytoplasm. Probab=43.56 E-value=17 Score=14.86 Aligned_cols=13 Identities=15% Similarity=0.317 Sum_probs=5.9 Q ss_pred CCCCHHHHHHHHH Q ss_conf 6767688998653 Q gi|254780277|r 265 EDCNGPRLISFLK 277 (976) Q Consensus 265 ~~~d~~~l~~~~~ 277 (976) +.+|.+.++++.+ T Consensus 16 G~~D~~~~~~L~E 28 (294) T TIGR02313 16 GDLDEKKLRRLVE 28 (294) T ss_pred CCCCHHHHHHHHH T ss_conf 7521888876754 No 269 >TIGR00761 argB acetylglutamate kinase; InterPro: IPR004662 N -Acetylglutamate (NAG) fulfils distinct biological roles in lower and higher organisms. In prokaryotes, lower eukaryotes and plants it is the first intermediate in the biosynthesis of arginine, whereas in ureotelic (excreting nitrogen mostly in the form of urea) vertebrates, it is an essential allosteric cofactor for carbamyl phosphate synthetase I (CPSI), the first enzyme of the urea cycle. The pathway that leads from glutamate to arginine in lower organisms employs eight steps, starting with the acetylation of glutamate to form NAG. In these species, NAG can be produced by two enzymatic reactions: one catalysed by NAG synthase (NAGS) and the other by ornithine acetyltransferase (OAT). In ureotelic species, NAG is produced exclusively by NAGS. In lower organisms, NAGS is feedback-inhibited by L-arginine, whereas mammalian NAGS activity is significantly enhanced by this amino acid. The NAGS genes of bacteria, fungi and mammals are more diverse than other arginine-biosynthesis and urea-cycle genes. The evolutionary relationship between the distinctly different roles of NAG and its metabolism in lower and higher organisms remains to be determined . The pathway from glutamate to arginine is: NAGS; N-acetylglutamate synthase (2.3.1.1 from EC) (glutamate to N-acetylglutamate) NAGK; N-acetylglutamate kinase (2.7.2.8 from EC) (N-acetylglutamate to N-acetylglutamate-5P) NAGSA; N-acetyl-gamma-glutamyl-phosphate reductase (1.2.1.38 from EC) (N-acetylglutamate-5P to N-acetylglumate semialdehyde) Acetylornithine aminotransferase (2.6.1.11 from EC) (N-acetylglumate semialdehyde to N-acetylornithine) Acetylornithine deacetylase (3.5.1.16 from EC) (N-acetylornithine to ornithine) Arginase (3.5.3.1 from EC) (ornithine to arginine) This entry describes N-acetylglutamate kinases (ArgB) of many prokaryotes and the N-acetylglutamate kinase domains of multifunctional proteins from yeasts as well as some, as yet, uncharacterised proteins.; GO: 0003991 acetylglutamate kinase activity, 0006526 arginine biosynthetic process, 0005737 cytoplasm. Probab=43.38 E-value=19 Score=14.60 Aligned_cols=54 Identities=20% Similarity=0.270 Sum_probs=33.2 Q ss_pred HHHHHHHHHHHHHHHHHHH-CC---CCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCC-----CCCCHHHHHH Q ss_conf 8643436899999999974-03---45577776566679987420788755566545-----3531788776 Q gi|254780277|r 597 QVSAEISKNLLSLEENIYD-LS---GEKFNLSSPKQLGDILFTKLKFPSGAKTKTGQ-----WKTTAQDLEQ 659 (976) Q Consensus 597 ~l~~~l~~~l~~le~~i~~-~a---G~~FNinSpkQL~~vLf~~LgLp~~kKTktG~-----~ST~~~~Le~ 659 (976) ++.+++-.++..+...=++ .. | +|| |+-+.| ++||+|+-+ ++|- .-||+++|+= T Consensus 15 ~~~~~~~~~~~~l~~~g~~PpviVHG-----GGp-~in~~~-~~lgi~~~f--~~Gl~G~~~RvTd~~tl~v 77 (254) T TIGR00761 15 ELLEEFFSDIAFLRAVGIKPPVIVHG-----GGP-EINELL-EALGIEPEF--KNGLSGVVFRVTDKETLEV 77 (254) T ss_pred HHHHHHHHHHHHHHHCCCCCCEEEEC-----CCH-HHHHHH-HHCCCCCEE--ECCCCCEEEECCCHHHHHH T ss_conf 24699999999999729855389837-----852-688899-832898367--6688720375278899999 No 270 >TIGR03556 photolyase_8HDF deoxyribodipyrimidine photo-lyase, 8-HDF type. This model describes a narrow clade of cyanobacterial deoxyribodipyrimidine photo-lyase. This group, in contrast to several closely related proteins, uses a chromophore that, in other lineages is modified further to become coenzyme F420. This chromophore is called 8-HDF in most articles on the DNA photolyase and FO in most literature on coenzyme F420. Probab=43.28 E-value=19 Score=14.59 Aligned_cols=37 Identities=19% Similarity=0.206 Sum_probs=28.5 Q ss_pred HHHHHHHHHHHHHHCCCCEEEECCEEHHHHHHHHHHHH Q ss_conf 88648999999997099799568901665899999999 Q gi|254780277|r 91 LLPQLPLVRLATQAFGIPAIEIQGFEADDIIATYTYIA 128 (976) Q Consensus 91 l~~q~~~i~~~l~~~gi~~~~~~~~EADDviatla~~~ 128 (976) |.+-+.-+++-|..+|++++...| ++-++|..|+.+. T Consensus 53 l~~sL~~L~~~L~~lg~~L~v~~G-~~~~vl~~l~~~~ 89 (471) T TIGR03556 53 LIGCLQELQQRYQQAGSQLLILQG-DPVQLIPQLAQQL 89 (471) T ss_pred HHHHHHHHHHHHHHCCCCEEEEEC-CHHHHHHHHHHHC T ss_conf 999999999999987995699980-7899999999973 No 271 >PRK09536 btuD corrinoid ABC transporter ATPase; Reviewed Probab=43.11 E-value=19 Score=14.57 Aligned_cols=11 Identities=9% Similarity=0.063 Sum_probs=4.0 Q ss_pred CHHHHHHHCCC Q ss_conf 70356788199 Q gi|254780277|r 813 SPFRLATQLRI 823 (976) Q Consensus 813 ~~~~La~~l~i 823 (976) +.+-.|+.|++ T Consensus 300 ~d~~~A~~l~~ 310 (409) T PRK09536 300 TAAETARRVGC 310 (409) T ss_pred CHHHHHHHCCC T ss_conf 24899997198 No 272 >KOG0577 consensus Probab=42.91 E-value=17 Score=14.93 Aligned_cols=30 Identities=23% Similarity=0.333 Sum_probs=20.1 Q ss_pred HHHHCCCCCCCCHHHHHHHHHHHHHHHHHHH Q ss_conf 3221014310058999986434368999999 Q gi|254780277|r 580 VSQMEITGIQIDQDLLKQVSAEISKNLLSLE 610 (976) Q Consensus 580 La~ME~~Gi~vD~~~L~~l~~~l~~~l~~le 610 (976) -++||..-+++|++ |..+...+..+++.+. T Consensus 511 k~e~eehr~~ldrE-le~~~~~f~~e~ekl~ 540 (948) T KOG0577 511 KGEREEHRARLDRE-LETLRANFSAELEKLA 540 (948) T ss_pred HHHHHHHHHHHHHH-HHHHHHHHHHHHHHHH T ss_conf 77599999998889-9999853058899999 No 273 >COG0178 UvrA Excinuclease ATPase subunit [DNA replication, recombination, and repair] Probab=42.72 E-value=19 Score=14.53 Aligned_cols=39 Identities=23% Similarity=0.456 Sum_probs=30.8 Q ss_pred HHHHHCCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHCCCEE Q ss_conf 567881999899999999999988889999999999999849888 Q gi|254780277|r 816 RLATQLRIPRSEAADYIKRYFHRFPGIHEYIEKTKNFVRQNGYVE 860 (976) Q Consensus 816 ~La~~l~is~~eA~~~i~~yf~~~p~v~~~~~~~~~~a~~~gyv~ 860 (976) ..|.-|.+|++||.+| |+..|.|++-++..++-. -||++ T Consensus 777 ~IadVL~MTveEA~~F----F~~~p~I~rkLqtL~dVG--LgYi~ 815 (935) T COG0178 777 NIADVLDMTVEEALEF----FEAIPKIARKLQTLVDVG--LGYIK 815 (935) T ss_pred CHHHHHHCCHHHHHHH----HHCCHHHHHHHHHHHHCC--CCEEE T ss_conf 5999974619999998----760658999999999807--55475 No 274 >PRK02515 psbU photosystem II complex extrinsic protein precursor U; Provisional Probab=42.18 E-value=17 Score=14.94 Aligned_cols=11 Identities=9% Similarity=0.284 Sum_probs=4.8 Q ss_pred CCCCCCHHHHH Q ss_conf 45353178877 Q gi|254780277|r 648 GQWKTTAQDLE 658 (976) Q Consensus 648 G~~ST~~~~Le 658 (976) |-|.+-+|+|+ T Consensus 93 ~PY~sVeDvl~ 103 (144) T PRK02515 93 APYDSVEDVLN 103 (144) T ss_pred CCCCCHHHHHH T ss_conf 99787999971 No 275 >PRK05686 fliG flagellar motor switch protein G; Validated Probab=41.96 E-value=20 Score=14.45 Aligned_cols=19 Identities=32% Similarity=0.429 Sum_probs=6.5 Q ss_pred HHHHHHHCCCCH-HHHHHHH Q ss_conf 887765304762-7999999 Q gi|254780277|r 655 QDLEQINCGDNP-IIKNILE 673 (976) Q Consensus 655 ~~Le~L~~~~hp-~~~~iLe 673 (976) +.|-.+...+|| .+..||- T Consensus 121 ~~La~~L~~EhPQtiAlILs 140 (337) T PRK05686 121 QQLANFIRNEHPQTIALILS 140 (337) T ss_pred HHHHHHHHCCCHHHHHHHHH T ss_conf 99999987248589999998 No 276 >COG1897 MetA Homoserine trans-succinylase [Amino acid transport and metabolism] Probab=41.95 E-value=20 Score=14.44 Aligned_cols=75 Identities=15% Similarity=0.287 Sum_probs=52.6 Q ss_pred CEEEECCCCHHHEECCC-CEEEEECCCCCEECHHHHHHHHCCCHHHH-HHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHC Q ss_conf 49998687012122379-75999667883538999988848988787-77787518743589987763602499898413 Q gi|254780277|r 134 AVTIISTDKDLMQLVSP-TTCLYDTVKEEKIDIEHVIKKWGIPPEKM-VCLQALTGDSIDNIPGIPGIGYKTAALLLQEY 211 (976) Q Consensus 134 ~v~i~s~DkD~~Qlv~~-~v~~~~~~~~~~~~~~~v~~~~gv~p~~~-~d~~al~GD~sDnipGv~GiG~KtA~~ll~~~ 211 (976) .|.+++..+-.-|=+.+ +|.+++.+.+++.++..+.--.|-.|-|+ +.++...+-.|-|-| +-.|+.-| T Consensus 19 nifvM~e~rA~~QdIRPL~IlilNLMP~Ki~TE~Q~lRLL~nsPLQV~itll~~~sh~~KnTp---------~eHl~~FY 89 (307) T COG1897 19 NIFVMTESRAKHQDIRPLKILILNLMPKKIETETQILRLLGNSPLQVDITLLRIDSHESKNTP---------AEHLNSFY 89 (307) T ss_pred CEEEECHHHHHHCCCCCCEEEEEECCCHHHHHHHHHHHHHCCCCCEEEEEEEEECCCCCCCCC---------HHHHHHHH T ss_conf 818853014540577531035432473557779999998569975378899983376778880---------89999986 Q ss_pred CCHHHH Q ss_conf 678999 Q gi|254780277|r 212 GNLENI 217 (976) Q Consensus 212 gsle~i 217 (976) .+.|+| T Consensus 90 ~tfeeV 95 (307) T COG1897 90 CTFEEV 95 (307) T ss_pred HCHHHH T ss_conf 069988 No 277 >PRK06761 hypothetical protein; Provisional Probab=41.47 E-value=12 Score=16.06 Aligned_cols=18 Identities=50% Similarity=0.883 Sum_probs=15.2 Q ss_pred CCCCCCCCCHHHHHHHHH Q ss_conf 998776360249989841 Q gi|254780277|r 193 IPGIPGIGYKTAALLLQE 210 (976) Q Consensus 193 ipGv~GiG~KtA~~ll~~ 210 (976) |-|+||.|+-|.+..|.+ T Consensus 7 IEGlPGsGKSTta~~l~d 24 (281) T PRK06761 7 IEGLPGFGKSTTAHLLND 24 (281) T ss_pred EECCCCCCHHHHHHHHHH T ss_conf 966899980149999999 No 278 >PRK06582 coproporphyrinogen III oxidase; Provisional Probab=41.21 E-value=20 Score=14.37 Aligned_cols=17 Identities=0% Similarity=-0.170 Sum_probs=10.1 Q ss_pred HHHHHHHCCCEECCCCCE Q ss_conf 999999849888277876 Q gi|254780277|r 849 TKNFVRQNGYVETIFGRR 866 (976) Q Consensus 849 ~~~~a~~~gyv~T~~GRr 866 (976) ..+...+.|+++. -+|. T Consensus 355 ~l~~l~~~gLi~~-d~~i 371 (390) T PRK06582 355 NLKHYQALDLIRL-DENI 371 (390) T ss_pred HHHHHHHCCCEEE-CCEE T ss_conf 9999997979898-9989 No 279 >KOG0304 consensus Probab=41.20 E-value=20 Score=14.36 Aligned_cols=23 Identities=0% Similarity=-0.107 Sum_probs=11.5 Q ss_pred HHHHHHHHHHHCCCCCCCCCCCC Q ss_conf 99999999986685232332224 Q gi|254780277|r 376 DIQQWVQKLETIGSASFKIITDT 398 (976) Q Consensus 376 ~L~~li~~L~~~~~ia~DtETt~ 398 (976) ++...-+-+....++|+|||--| T Consensus 13 Em~~Ir~~v~~y~~IamDTEFPG 35 (239) T KOG0304 13 EMALIRECVKDYPYIAMDTEFPG 35 (239) T ss_pred HHHHHHHHHHHCCEEEECCCCCC T ss_conf 99999999975991475376785 No 280 >PRK07379 coproporphyrinogen III oxidase; Provisional Probab=41.07 E-value=20 Score=14.35 Aligned_cols=21 Identities=29% Similarity=0.112 Sum_probs=12.5 Q ss_pred HHHHHHHCCCEECCCCCEECC Q ss_conf 999999849888277876018 Q gi|254780277|r 849 TKNFVRQNGYVETIFGRRIHY 869 (976) Q Consensus 849 ~~~~a~~~gyv~T~~GRrr~~ 869 (976) ..+...+.|+++---+|.+.- T Consensus 356 ~l~~l~~~Gll~~~~~~i~LT 376 (399) T PRK07379 356 CLQPYFQQGWVELTGGRLRLT 376 (399) T ss_pred HHHHHHHCCCEEEECCEEEEE T ss_conf 999999679889979999994 No 281 >pfam09992 DUF2233 Predicted periplasmic protein (DUF2233). This domain, found in various hypothetical bacterial proteins, has no known function. Probab=40.76 E-value=19 Score=14.51 Aligned_cols=13 Identities=15% Similarity=0.148 Sum_probs=8.3 Q ss_pred HHHHHHHHHCCCC Q ss_conf 2333453300222 Q gi|254780277|r 696 TQRVHTFYSLAST 708 (976) Q Consensus 696 dgRIH~~f~q~~t 708 (976) ||+|||.|+..+| T Consensus 126 dG~ihp~f~~~s~ 138 (212) T pfam09992 126 DGALHPRFLPDSD 138 (212) T ss_pred CCCCCCCCCCCCC T ss_conf 8932755469997 No 282 >cd01700 Pol_V Pol V was discovered in Escherichia coli as Umuc and UmuD proteins induced by UV. This branch of DNA polymerases is mostly found in bacteria. Pol V enables DNA replication to bypass covalently linked cys-sin T-T photo-dimers and 6-4 T-T or T-C photoproducts, which would otherwise stall the DNA replication fork. Probab=40.75 E-value=13 Score=15.86 Aligned_cols=74 Identities=11% Similarity=0.096 Sum_probs=34.2 Q ss_pred HHHHHHHHHHHHHHCCC-EECCCCCEECCCCC-CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEE Q ss_conf 99999999999998498-88277876018987-58837888778787701345653899999999999999986599818 Q gi|254780277|r 842 IHEYIEKTKNFVRQNGY-VETIFGRRIHYDEI-NSPKSSIRNINERAAINAPIQGSAADITRRAMISVHKSLEDHKLATK 919 (976) Q Consensus 842 v~~~~~~~~~~a~~~gy-v~T~~GRrr~~p~i-~s~~~~~r~~aeR~a~N~~iQGsaAdi~k~am~~i~~~~~~~~~~~~ 919 (976) +..+.+++.+..+++|. .+|+.=..|+=+.- .+.....|+ .-++.|.+ .+.+|.+.|+-.+.+.++ .+...| T Consensus 263 l~~l~~~v~~rLr~~~~~~rtvtl~ir~s~F~~~~~~~~~~~----~~L~~pT~-~~~~I~~~a~~Ll~~~~~-~~~~~R 336 (344) T cd01700 263 LATYAARAAEKLRKQKSVASALTVFIRTSPFRPDPPYYSRSA----TILPTPTS-DTRDLIKAALRALEAIYR-PGYRYK 336 (344) T ss_pred HHHHHHHHHHHHHHCCCCCCEEEEEEEECCCCCCCCEEEEEE----EECCCCCC-CHHHHHHHHHHHHHHHHC-CCCCEE T ss_conf 999999999999981986037999998078878884453346----86898788-999999999999999826-999679 Q ss_pred EE Q ss_conf 99 Q gi|254780277|r 920 ML 921 (976) Q Consensus 920 ~i 921 (976) ++ T Consensus 337 li 338 (344) T cd01700 337 KA 338 (344) T ss_pred EE T ss_conf 99 No 283 >COG5290 IkappaB kinase complex, IKAP component [Transcription] Probab=40.56 E-value=6.3 Score=18.07 Aligned_cols=12 Identities=17% Similarity=0.385 Sum_probs=7.5 Q ss_pred EEECCHHHHHHH Q ss_conf 995598799999 Q gi|254780277|r 929 VFEVPEEEITTA 940 (976) Q Consensus 929 v~ev~~~~~~~~ 940 (976) ||+|++.-.+.| T Consensus 1210 I~sv~~~~~e~~ 1221 (1243) T COG5290 1210 IFSVERRLLERV 1221 (1243) T ss_pred EEEEEHHHHHHH T ss_conf 056308887664 No 284 >TIGR02784 addA_alphas double-strand break repair helicase AddA; InterPro: IPR014151 AddAB, also called RexAB, substitutes for RecBCD in several bacterial lineages. These DNA recombination proteins act before synapse and are particularly important for DNA repair of double-stranded breaks by homologous recombination. The term AddAB is used broadly, with AddA homologues between the alphaproteobacteria and the Firmicutes, while the partner AddB proteins show no strong homology across the two groups of species.. Probab=40.34 E-value=21 Score=14.27 Aligned_cols=91 Identities=24% Similarity=0.318 Sum_probs=48.0 Q ss_pred CCCCCEECHHHHHHHHHHHHHHHHCCCCC-CCEEEEEEECCCCCHHHHCCHHHHCCCCCCCHHHHHHHHHHHHHHHHCCC Q ss_conf 89885500699999999999998485547-98799997274887132408652166798797588648999999997099 Q gi|254780277|r 29 KHDGLPVNAIAGFCNMLWKLLQNSRKESI-ASHFAVIFDYPAVTFRNEIYPDYKANRPKIPEMLLPQLPLVRLATQAFGI 107 (976) Q Consensus 29 ~~~G~~t~ai~gf~~~l~~~i~~~~~~~~-~~~~~v~fD~~~~tfR~~~~~~YKa~R~~~p~~l~~q~~~i~~~l~~~gi 107 (976) |+-|.-+..|+-||..| ++.|-=+-+ +.|+-+ .| | +.|=.+.++....+=- T Consensus 110 TPGGLKIQTIHAFCe~L---L~~FPLEAnv~~~F~~-~D-----------------------D-r~~A~l~~~arr~ll~ 161 (1190) T TIGR02784 110 TPGGLKIQTIHAFCEAL---LHQFPLEANVAGHFSV-ID-----------------------D-RAAAALLEEARRELLA 161 (1190) T ss_pred CCCCCHHHHHHHHHHHH---HHHCCCCCCCCCCCCE-EH-----------------------H-HHHHHHHHHHHHHHHH T ss_conf 78776157779999988---5207751378988533-02-----------------------6-8999999999999972 Q ss_pred CEEEECCEEHHHHHHHHHHHHHHCCCCEEE--ECCCCHHHEE Q ss_conf 799568901665899999999848994999--8687012122 Q gi|254780277|r 108 PAIEIQGFEADDIIATYTYIAEKEGFAVTI--ISTDKDLMQL 147 (976) Q Consensus 108 ~~~~~~~~EADDviatla~~~~~~~~~v~i--~s~DkD~~Ql 147 (976) ..-..+.-+-++.++++...+.+.|++-+| ++..||-++- T Consensus 162 ~~~~~~d~~~~~a~a~~l~~~~e~gl~~ll~ei~~~R~~~~~ 203 (1190) T TIGR02784 162 GAAAPPDPALAEALATVLEAAGETGLEALLAEIVAKRDALSA 203 (1190) T ss_pred HHCCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHH T ss_conf 303788405899999999871655878999999961688878 No 285 >cd07922 CarBa CarBa is the A subunit of 2-aminophenol 1,6-dioxygenase, which catalyzes the oxidization and subsequent ring-opening of 2-aminophenyl-2,3-diol. CarBa is the A subunit of 2-aminophenol 1,6-dioxygenase, which catalyzes the oxidization and subsequent ring-opening of 2-aminophenyl-2,3-diol. 2-aminophenol 1,6-dioxygenase is a key enzyme in the carbazole degradation pathway isolated from bacterial strains with carbazole degradation ability. The enzyme is a heterotetramer composed of two A and two B subunits. CarB belongs to the class III extradiol dioxygenase family, composed of enzymes which use a non-heme Fe(II) to cleave aromatic rings between a hydroxylated carbon and an adjacent non-hydroxylated carbon. Although the enzyme was originally isolated as a meta-cleavage enzyme for 2'-aminobiphenyl-2,3-diol involved in carbazole degradation, the enzyme has also shown high specificity for 2,3-dihydroxybiphenyl. Probab=40.25 E-value=13 Score=15.79 Aligned_cols=52 Identities=17% Similarity=0.332 Sum_probs=33.0 Q ss_pred EEEEEEEHHHHHHHHHCCCCHHHHHHHHHHCCCHHHCCHHHHHHHHHHHHHHHHCCCHHHH Q ss_conf 8766510377787340698889999989809996889988886302513564407770356 Q gi|254780277|r 757 ILAHIAKITPLYQVFENSLDIHRMVAAEIFGVNIEKVSPQMRRHAKTINFSIIYGISPFRL 817 (976) Q Consensus 757 ilA~ls~D~~l~~af~~g~DiH~~tA~~vf~~~~~~vt~~~R~~aK~infgi~YG~~~~~L 817 (976) ++-.|..||.+++.|+++. ++.- +=|+ .|+++|+.--.-.||.+|+.|..+. T Consensus 9 LiqeL~r~p~~~e~~~~dp--~av~--e~y~-----Lt~~E~~AL~eg~~gaL~~iGVhp~ 60 (81) T cd07922 9 LIQELFKDPGLIERFQDDP--SAVF--EEYG-----LTPAERAALREGTFGALTSIGVHPI 60 (81) T ss_pred HHHHHHCCHHHHHHHHHCH--HHHH--HHCC-----CCHHHHHHHHCCCHHHHHHCCCCHH T ss_conf 9999936988899987498--9987--7528-----9989999986388646773673699 No 286 >COG2251 Predicted nuclease (RecB family) [General function prediction only] Probab=40.19 E-value=20 Score=14.38 Aligned_cols=16 Identities=25% Similarity=0.605 Sum_probs=7.1 Q ss_pred CHHHHHHHHCCCCCCC Q ss_conf 1237889861886664 Q gi|254780277|r 470 KYDKLVLHRYGISMRG 485 (976) Q Consensus 470 KfD~~~L~~~GI~l~~ 485 (976) .|-...|.++|..+.+ T Consensus 142 a~ya~~La~~g~~v~~ 157 (474) T COG2251 142 AFYALLLAKSGVPVAG 157 (474) T ss_pred HHHHHHHHHCCCCCCC T ss_conf 9999999851786678 No 287 >TIGR02778 ligD_pol DNA polymerase LigD, polymerase domain; InterPro: IPR014145 DNA repair of double-stranded breaks by non-homologous end joining (NHEJ) is accomplished by a two-protein system that is present in a minority of prokaryotes. One component is the Ku protein (see IPR009187 from INTERPRO), which binds DNA ends. The other is a DNA ligase, a protein that is a multidomain polypeptide in most of those bacteria that have NheJ, a permuted polypeptide in Mycobacterium tuberculosis and a few other species, and the product of tandem genes in some other bacteria. This region represents the C-terminal polymerase domain.. Probab=40.12 E-value=21 Score=14.25 Aligned_cols=16 Identities=38% Similarity=0.418 Sum_probs=11.1 Q ss_pred HHHHCCCCCCCCCCCC Q ss_conf 8742078875556654 Q gi|254780277|r 633 LFTKLKFPSGAKTKTG 648 (976) Q Consensus 633 Lf~~LgLp~~kKTktG 648 (976) +++.|||+.--||--| T Consensus 146 ~L~~lgL~sFvKTSGG 161 (251) T TIGR02778 146 LLDELGLESFVKTSGG 161 (251) T ss_pred HHHHHCCCCCCEECCC T ss_conf 9987123564411589 No 288 >PRK01172 ski2-like helicase; Provisional Probab=39.83 E-value=21 Score=14.22 Aligned_cols=14 Identities=36% Similarity=0.359 Sum_probs=8.2 Q ss_pred HCCCCEEEECCCCH Q ss_conf 48994999868701 Q gi|254780277|r 130 KEGFAVTIISTDKD 143 (976) Q Consensus 130 ~~~~~v~i~s~DkD 143 (976) +-|+.|.+.+||.| T Consensus 90 ~~g~~v~~~tGd~~ 103 (674) T PRK01172 90 SLGMRVKISIGDYD 103 (674) T ss_pred CCCCEEEEEECCCC T ss_conf 37982778853888 No 289 >COG1335 PncA Amidases related to nicotinamidase [Secondary metabolites biosynthesis, transport, and catabolism] Probab=39.63 E-value=21 Score=14.20 Aligned_cols=45 Identities=18% Similarity=0.227 Sum_probs=36.7 Q ss_pred HHHHHHHHHCCCCEEEECCEEHHHHHHHHHHHHHHCCCCEEEECC Q ss_conf 999999997099799568901665899999999848994999868 Q gi|254780277|r 96 PLVRLATQAFGIPAIEIQGFEADDIIATYTYIAEKEGFAVTIIST 140 (976) Q Consensus 96 ~~i~~~l~~~gi~~~~~~~~EADDviatla~~~~~~~~~v~i~s~ 140 (976) +-+..+|...||..+..-|..+|-++-+-++.+.+.|++|++++. T Consensus 122 T~L~~~Lr~~~i~~l~v~G~~Td~CV~~T~~~A~~~gy~v~v~~d 166 (205) T COG1335 122 TDLDDILRNLGIDTVVVCGIATDICVLATARDAFDLGYQVTLVED 166 (205) T ss_pred CCHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHCCCEEEEEHH T ss_conf 639999987793199997776029999999999967986999868 No 290 >TIGR02074 PBP_1a_fam penicillin-binding protein, 1A family; InterPro: IPR011816 Bacterial that synthesize a cell wall of peptidoglycan (murein) generally have several transglycosylases and transpeptidases for the task. This family consists of bifunctional transglycosylase/transpeptidase penicillin-binding proteins (PBP). In the Proteobacteria, this family includes PBP 1A but not the paralogous PBP 1B (IPR011813 from INTERPRO). This family also includes related proteins, often designated PBP 1A, from other bacterial lineages.; GO: 0008233 peptidase activity, 0016763 transferase activity transferring pentosyl groups, 0009252 peptidoglycan biosynthetic process, 0046677 response to antibiotic. Probab=39.60 E-value=19 Score=14.49 Aligned_cols=22 Identities=14% Similarity=0.354 Sum_probs=17.2 Q ss_pred CCCCCCHHHHHHHHCCCCCCCC Q ss_conf 8766772466776224442233 Q gi|254780277|r 499 GRSSHDIASIAQKWLSYTRKEI 520 (976) Q Consensus 499 ~~~~~~L~~la~~yL~~~~~e~ 520 (976) +.+.||..+-|+.|+|....++ T Consensus 112 G~G~YG~~~AA~~YFgK~v~~L 133 (700) T TIGR02074 112 GNGAYGVEAAAQVYFGKSVNDL 133 (700) T ss_pred CCCCHHHHHHHHHHCCCCHHHC T ss_conf 6884158899976069776527 No 291 >cd03586 Pol_IV_kappa Pol_IV_kappa, a member of the Y-family of DNA polymerases. Pol_Y's can transverse replication-blocking DNA lesions, such as cyclobutane pyrimidine dimers resulting from UV damage, at the cost of an elevated error rate. The Y-family has no 3'-5' exonuclease activity. In addition to possessing a topology akin to a right hand, with "thumb", "fingers" and "palm" motifs, like polymerases from the A-, B-, C- and X-families, the Y-family has a unique "little finger" motif. Expression of Y-family polymerases is often induced by DNA damage. These polymerases are phylogenetically unrelated to classical DNA polymerases. Originally called the DinB family, they belong to the recently described Y-family of DNA polymerases. Pol IV is mostly found in bacteria and archaea. Although the structure of Pol IV is similar to that of Pol eta, it shows markedly differenct efficiencies and fidelities in bypassing various DNA lesions. All Pol IV-like polymerases studied to date are able to b Probab=39.60 E-value=21 Score=14.20 Aligned_cols=27 Identities=19% Similarity=0.255 Sum_probs=14.2 Q ss_pred HHHHHHHHHCCCCCC--CCCCHHHHHHHH Q ss_conf 999999974034557--777656667998 Q gi|254780277|r 607 LSLEENIYDLSGEKF--NLSSPKQLGDIL 633 (976) Q Consensus 607 ~~le~~i~~~aG~~F--NinSpkQL~~vL 633 (976) ..+.++|++..|-+. -++..|.|+++- T Consensus 120 ~~ir~~I~~~tgl~~SiGIa~nk~lAKiA 148 (337) T cd03586 120 KEIRARIREETGLTCSVGIAPNKFLAKIA 148 (337) T ss_pred HHHHHHHHHHHCCCEEEEEECCHHHHHHH T ss_conf 99999999987997489995739999999 No 292 >COG0122 AlkA 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase [DNA replication, recombination, and repair] Probab=39.46 E-value=15 Score=15.30 Aligned_cols=17 Identities=24% Similarity=0.341 Sum_probs=6.6 Q ss_pred HHHHCCCCCCCCHHHHHH Q ss_conf 322101431005899998 Q gi|254780277|r 580 VSQMEITGIQIDQDLLKQ 597 (976) Q Consensus 580 La~ME~~Gi~vD~~~L~~ 597 (976) +|++..+|. +|.+.+.. T Consensus 174 ~A~~~~~g~-~~~~~l~~ 190 (285) T COG0122 174 LARAAAEGE-LDLSELKP 190 (285) T ss_pred HHHHHHCCC-CCHHHHCC T ss_conf 999998599-65676625 No 293 >TIGR01455 glmM phosphoglucosamine mutase; InterPro: IPR006352 This family describes GlmM, phosphoglucosamine mutase, also designated MrsA and YhbF in Escherichia coli , UreC in Helicobacter pylori , and femR315 or FemD in Staphlococcus aureus . It converts glucosamine-6-phosphate to glucosamine-1-phosphate as part of the pathway toward UDP-N-acetylglucosamine for peptidoglycan and lipopolysaccharides.. Probab=39.44 E-value=17 Score=14.96 Aligned_cols=19 Identities=21% Similarity=0.233 Sum_probs=10.7 Q ss_pred EEEHHHHHHHHHCCCCHHH Q ss_conf 5103777873406988899 Q gi|254780277|r 761 IAKITPLYQVFENSLDIHR 779 (976) Q Consensus 761 ls~D~~l~~af~~g~DiH~ 779 (976) -||-|.||..--+|.|-+. T Consensus 418 pSGTEPliRVMVEa~d~~~ 436 (450) T TIGR01455 418 PSGTEPLIRVMVEAADEEL 436 (450) T ss_pred CCCCCCCEEEEEECCCHHH T ss_conf 7899783168864488799 No 294 >TIGR01282 nifD nitrogenase molybdenum-iron protein alpha chain; InterPro: IPR005972 The enzyme responsible for nitrogen fixation, the nitrogenase, shows a high degree of conservation of structure, function, and amino acid sequence across wide phylogenetic ranges. All known Mo-nitrogenases consist of two components, component I (also called dinitrogenase, or Fe-Mo protein), an alpha2beta2 tetramer encoded by the nifD and nifK genes, and component II (dinitrogenase reductase, or Fe protein) a homodimer encoded by the nifH gene , . Two operons, nifDK and nifEN, encode a tetrameric (alpha2beta2 and N2E2) enzymatic complex. Nitrogenase contains two unusual rare metal clusters; one of them is the iron molybdenum cofactor (FeMo-co), which is considered to be the site of dinitrogen reduction and whose biosynthesis requires the products of nifNE and of some other nif genes . It has been proposed that NifNE might serve as a scaffold upon which FeMo-co is built and then inserted into component I .; GO: 0016163 nitrogenase activity, 0009399 nitrogen fixation, 0016612 molybdenum-iron nitrogenase complex. Probab=39.31 E-value=21 Score=14.16 Aligned_cols=16 Identities=25% Similarity=0.279 Sum_probs=5.9 Q ss_pred CEEEEEEEEEHHHHHH Q ss_conf 4368766510377787 Q gi|254780277|r 754 ELRILAHIAKITPLYQ 769 (976) Q Consensus 754 ElRilA~ls~D~~l~~ 769 (976) =|||.|.+|||-++-+ T Consensus 236 GlrvvA~~sGDgT~~~ 251 (521) T TIGR01282 236 GLRVVAQWSGDGTLAE 251 (521) T ss_pred CCEEEEEECCCHHHHH T ss_conf 9489985026268999 No 295 >TIGR00426 TIGR00426 competence protein ComEA helix-hairpin-helix repeat region; InterPro: IPR004509 This domain is found in competence protein ComEA and closely related proteins from a number of species that exhibit competence for transformation by exongenous DNA, including Streptococcus pneumoniae, Bacillus subtilis, Neisseria meningitidis, and Haemophilus influenzae. This domain represents a region of two tandem copies of a helix-hairpin-helix domain, each about 30 residues in length. Limited sequence similarity can be found among some members of this family N-terminal to this domain.. Probab=39.15 E-value=19 Score=14.64 Aligned_cols=17 Identities=24% Similarity=0.544 Sum_probs=11.1 Q ss_pred CCC-CCCCCCHHHHHHHH Q ss_conf 998-77636024998984 Q gi|254780277|r 193 IPG-IPGIGYKTAALLLQ 209 (976) Q Consensus 193 ipG-v~GiG~KtA~~ll~ 209 (976) |.+ +.|||+|-|...++ T Consensus 19 lq~~~~GvG~kKAeAIv~ 36 (70) T TIGR00426 19 LQKALSGVGAKKAEAIVA 36 (70) T ss_pred HHHHHCCCCHHHHHHHHH T ss_conf 887642887237899988 No 296 >PRK03858 DNA polymerase IV; Validated Probab=38.68 E-value=22 Score=14.10 Aligned_cols=20 Identities=35% Similarity=0.428 Sum_probs=11.1 Q ss_pred CCCCCCCCCCHHHHHHHHHCC Q ss_conf 899877636024998984136 Q gi|254780277|r 192 NIPGIPGIGYKTAALLLQEYG 212 (976) Q Consensus 192 nipGv~GiG~KtA~~ll~~~g 212 (976) .|--|+|||++|+.+| +++| T Consensus 174 pv~~iwGIG~~~~~~L-~~~G 193 (398) T PRK03858 174 PVRRLWGVGAVTAAKL-RAHG 193 (398) T ss_pred CCCCCCCCCHHHHHHH-HHCC T ss_conf 6012158687999999-9849 No 297 >PTZ00112 origin recognition complex 1 protein; Provisional Probab=38.28 E-value=11 Score=16.43 Aligned_cols=25 Identities=12% Similarity=0.038 Sum_probs=17.2 Q ss_pred CCCEEEEEECCCCCEEEEEEEEEHH Q ss_conf 9948999632654436876651037 Q gi|254780277|r 741 PTKKLISADYSQIELRILAHIAKIT 765 (976) Q Consensus 741 ~g~~lv~aDySQiElRilA~ls~D~ 765 (976) .|...+.-|-=|+=-|=.|.+|||- T Consensus 458 ~~~~~f~~dAIQl~ARKVAavSGDA 482 (650) T PTZ00112 458 ENCEIIDHTALQLCARRVTNYSGDM 482 (650) T ss_pred CCCCCCCHHHHHHHHHHHHHHCCHH T ss_conf 2677788789999988887503148 No 298 >PRK00285 ihfA integration host factor subunit alpha; Reviewed Probab=38.27 E-value=22 Score=14.05 Aligned_cols=22 Identities=18% Similarity=0.461 Sum_probs=18.2 Q ss_pred HHHHHCCCCHHHHHHHHHHHHH Q ss_conf 5678819998999999999999 Q gi|254780277|r 816 RLATQLRIPRSEAADYIKRYFH 837 (976) Q Consensus 816 ~La~~l~is~~eA~~~i~~yf~ 837 (976) .||+++|+|..+|..++|.||+ T Consensus 12 ~ia~~~g~s~~~a~~~v~~~~~ 33 (100) T PRK00285 12 LLFDKVGLSKREAKELVELFFE 33 (100) T ss_pred HHHHHHCCCHHHHHHHHHHHHH T ss_conf 9999869599999999999999 No 299 >PRK03103 DNA polymerase IV; Reviewed Probab=38.24 E-value=22 Score=14.05 Aligned_cols=15 Identities=27% Similarity=0.406 Sum_probs=8.1 Q ss_pred CCCCCCCCCHHHHHH Q ss_conf 998776360249989 Q gi|254780277|r 193 IPGIPGIGYKTAALL 207 (976) Q Consensus 193 ipGv~GiG~KtA~~l 207 (976) |.=|+|||++|+.+| T Consensus 184 v~~iwGIG~~~~~kL 198 (410) T PRK03103 184 IGKLFGVGRRMEHHL 198 (410) T ss_pred HHHCCCCCHHHHHHH T ss_conf 133068788999999 No 300 >COG1875 NYN ribonuclease and ATPase of PhoH family domains [General function prediction only] Probab=38.16 E-value=18 Score=14.78 Aligned_cols=34 Identities=38% Similarity=0.692 Sum_probs=15.5 Q ss_pred HHHHHHCCCHH---HHHHHHHHCCCCCC--CCCCCCCCC Q ss_conf 99888489887---87777875187435--899877636 Q gi|254780277|r 167 HVIKKWGIPPE---KMVCLQALTGDSID--NIPGIPGIG 200 (976) Q Consensus 167 ~v~~~~gv~p~---~~~d~~al~GD~sD--nipGv~GiG 200 (976) .-++-|||.|. |-.-+-+|.-|..| ++-|-.|-| T Consensus 219 ~~~~vwGi~prn~eQ~~ALdlLld~dI~lV~L~G~AGtG 257 (436) T COG1875 219 EDQEVWGIRPRNAEQRVALDLLLDDDIDLVSLGGKAGTG 257 (436) T ss_pred CCHHHHCCCCCCHHHHHHHHHHCCCCCCEEEEECCCCCC T ss_conf 750200657455888888998638777848863357766 No 301 >PRK03695 vitamin B12-transporter ATPase; Provisional Probab=38.14 E-value=22 Score=14.04 Aligned_cols=16 Identities=19% Similarity=0.439 Sum_probs=6.3 Q ss_pred CCCCCCEEEECCCCCEEE Q ss_conf 222243024248994899 Q gi|254780277|r 729 LGQKIRRAFISPPTKKLI 746 (976) Q Consensus 729 ~g~~iR~~F~a~~g~~lv 746 (976) +|..+|.. .-.|+-++ T Consensus 227 y~~~~~~l--~~~g~~~~ 242 (245) T PRK03695 227 FGTDFRRL--DVEGHPML 242 (245) T ss_pred HCCCEEEE--EECCEEEE T ss_conf 58965999--98998899 No 302 >TIGR02061 aprA adenylylsulfate reductase, alpha subunit; InterPro: IPR011803 During dissimilatory sulphate reduction or sulphur oxidation, adenylylsulphate (APS) reductase catalyzes reversibly the two-electron reduction of APS to sulphite and AMP. Found in several bacterial lineages and in Archaeoglobales, APS reductase is a heterodimer composed of an alpha subunit containing a noncovalently bound FAD, and a beta subunit containing two [4Fe-4S] clusters. This entry describes the alpha subunit of APS reductase, which shares a common evolutionary origin with fumarate reductase/succinate dehydrogenase flavoproteins.. Probab=37.58 E-value=19 Score=14.53 Aligned_cols=17 Identities=24% Similarity=0.368 Sum_probs=11.1 Q ss_pred HHHHHHHHHHHHHHHHH Q ss_conf 99999999999999986 Q gi|254780277|r 897 ADITRRAMISVHKSLED 913 (976) Q Consensus 897 Adi~k~am~~i~~~~~~ 913 (976) =|+.|+|-=+||+.|+. T Consensus 567 EDleKLAArdlHELlRa 583 (651) T TIGR02061 567 EDLEKLAARDLHELLRA 583 (651) T ss_pred HHHHHHHHHHHHHHHHH T ss_conf 11778878778899988 No 303 >pfam04748 Polysacc_deac_2 Divergent polysaccharide deacetylase. This family is divergently related to pfam01522 (personal obs:Yeats C). Probab=37.48 E-value=8.5 Score=17.08 Aligned_cols=49 Identities=18% Similarity=0.381 Sum_probs=28.6 Q ss_pred HHHHHHCCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHCCCEECCCCC Q ss_conf 356788199989999999999998888999999999999984988827787 Q gi|254780277|r 815 FRLATQLRIPRSEAADYIKRYFHRFPGIHEYIEKTKNFVRQNGYVETIFGR 865 (976) Q Consensus 815 ~~La~~l~is~~eA~~~i~~yf~~~p~v~~~~~~~~~~a~~~gyv~T~~GR 865 (976) ..+|+.+|++..+..-|+|.- .....|++.+++..+.|+++|++--+ |+ T Consensus 134 ~~~A~~~gvp~~~rdvfLD~~-~~~~~I~~ql~~~~~~A~~~G~aI~I-gh 182 (213) T pfam04748 134 PKLAKELGVPAAKRDVFLDNE-QTEAAIRRQLDQAAALARKEGSAIAI-GH 182 (213) T ss_pred HHHHHHCCCCEEEEEEECCCC-CCHHHHHHHHHHHHHHHHHCCCEEEE-EC T ss_conf 999998399867610314799-99999999999999999863957999-77 No 304 >PHA00666 putative protease Probab=37.07 E-value=23 Score=13.92 Aligned_cols=14 Identities=21% Similarity=0.427 Sum_probs=9.2 Q ss_pred HHHHHHHHHHHHHH Q ss_conf 86532336899987 Q gi|254780277|r 274 SFLKALEFTKLINR 287 (976) Q Consensus 274 ~~~~~lef~~ll~r 287 (976) ++|++|=|++|... T Consensus 2 nlf~rl~~rrl~ne 15 (233) T PHA00666 2 NLFERLLMRRLCNE 15 (233) T ss_pred CHHHHHHHHHHHCC T ss_conf 58999999998536 No 305 >PRK13506 formate--tetrahydrofolate ligase; Provisional Probab=37.00 E-value=15 Score=15.36 Aligned_cols=52 Identities=25% Similarity=0.385 Sum_probs=29.5 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHCCC---CCC-CCCCCCCCCCCHHH Q ss_conf 3436899999999974034557777656667998742078---875-55665453531788 Q gi|254780277|r 600 AEISKNLLSLEENIYDLSGEKFNLSSPKQLGDILFTKLKF---PSG-AKTKTGQWKTTAQD 656 (976) Q Consensus 600 ~~l~~~l~~le~~i~~~aG~~FNinSpkQL~~vLf~~LgL---p~~-kKTktG~~ST~~~~ 656 (976) ..+.++++.+.++||...|-+|...--+||.. |+++|+ |+- -|| +||.+.+- T Consensus 463 ~~i~eKIe~IA~eIYGA~~V~~s~~A~~qL~~--~e~~G~~~lPVCmAKT---qySfSdDp 518 (577) T PRK13506 463 MSLEAKLMTLAEVGYGAAGVSLSDKAKQQLAE--LTALGYDHLPVCMAKT---PLSISHDP 518 (577) T ss_pred CCHHHHHHHHHHHCCCCCCEEECHHHHHHHHH--HHHCCCCCCCEEEECC---CCCCCCCH T ss_conf 99999999999862399857889999999999--9975999888768747---87757792 No 306 >pfam09507 CDC27 DNA polymerase subunit Cdc27. This protein forms the C subunit of DNA polymerase delta. It carries the essential residues for binding to the Pol1 subunit of polymerase alpha, from residues 293-332, which are characterized by the motif D--G--VT, referred to as the DPIM motif. The first 160 residues of the protein form the minimal domain for binding to the B subunit, Cdc1, of polymerase delta, the final 10 C-terminal residues, 362-372, being the DNA sliding clamp, PCNA, binding motif. Probab=36.81 E-value=23 Score=13.90 Aligned_cols=14 Identities=14% Similarity=0.266 Sum_probs=5.8 Q ss_pred EEECCHHHHHHHHH Q ss_conf 35479999999999 Q gi|254780277|r 369 TKIVNTKDIQQWVQ 382 (976) Q Consensus 369 ~~I~~~~~L~~li~ 382 (976) ..+...++|..... T Consensus 58 i~Lv~Ee~Le~~k~ 71 (418) T pfam09507 58 VVLVREDDLEETKS 71 (418) T ss_pred EEEECHHHHHHHHH T ss_conf 99845336777774 No 307 >pfam01312 Bac_export_2 FlhB HrpN YscU SpaS Family. This family includes the following members: FlhB, HrpN, YscU, SpaS, HrcU SsaU and YopU. All of these proteins export peptides using the type III secretion system. The peptides exported are quite diverse. Probab=36.66 E-value=24 Score=13.88 Aligned_cols=38 Identities=24% Similarity=0.357 Sum_probs=23.0 Q ss_pred CCCCCCCCCCCCCCCEEEECCCCCEEEEEECCCCC---EEEEEEEEEHH Q ss_conf 33345563322224302424899489996326544---36876651037 Q gi|254780277|r 720 LQNIPIKTDLGQKIRRAFISPPTKKLISADYSQIE---LRILAHIAKIT 765 (976) Q Consensus 720 lQNiP~r~~~g~~iR~~F~a~~g~~lv~aDySQiE---lRilA~ls~D~ 765 (976) ++++|.- ...+..|-|.-|...|..=+ ++|+|--.+.- T Consensus 252 ~~~V~~A--------~vVItNPTH~AVAL~Y~~~~~~aP~vvAKG~d~~ 292 (343) T pfam01312 252 MQDVPKA--------DVVVTNPTHYAVALYYDPEEMPAPRIVAKGEDEQ 292 (343) T ss_pred HHCCCCC--------CEEEECCCCEEEEEEECCCCCCCCEEEEECCCHH T ss_conf 7258989--------8899999862789887889999998888548699 No 308 >PRK10982 galactose/methyl galaxtoside transporter ATP-binding protein; Provisional Probab=35.21 E-value=25 Score=13.72 Aligned_cols=20 Identities=5% Similarity=0.139 Sum_probs=9.3 Q ss_pred HHHHHHHHHCCCCEEEEEEECC Q ss_conf 9999999865998189997263 Q gi|254780277|r 905 ISVHKSLEDHKLATKMLLQIHD 926 (976) Q Consensus 905 ~~i~~~~~~~~~~~~~ilqVHD 926 (976) .++-+.+++.|. -+|+--|| T Consensus 431 ~~ll~~l~~~G~--~il~iSHD 450 (491) T PRK10982 431 YQLIAELAKKDK--GIIIISSE 450 (491) T ss_pred HHHHHHHHHCCC--EEEEEECC T ss_conf 999999996899--99999585 No 309 >PRK02794 DNA polymerase IV; Provisional Probab=35.00 E-value=17 Score=14.85 Aligned_cols=63 Identities=22% Similarity=0.192 Sum_probs=29.2 Q ss_pred HHHHHHHHHHHHHHHCCCE-ECCCCCEECCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 8999999999999984988-82778760189875883788877878770134565389999999999999998 Q gi|254780277|r 841 GIHEYIEKTKNFVRQNGYV-ETIFGRRIHYDEINSPKSSIRNINERAAINAPIQGSAADITRRAMISVHKSLE 912 (976) Q Consensus 841 ~v~~~~~~~~~~a~~~gyv-~T~~GRrr~~p~i~s~~~~~r~~aeR~a~N~~iQGsaAdi~k~am~~i~~~~~ 912 (976) .+..+.+++.+..++.|+. +|+.=..|+=+. .+.. | ...+..|.+- ...|++.|+-.+.+... T Consensus 295 ~l~~l~~~va~rLR~~~~~a~~i~v~ir~~df-~~~t---r----~~tL~~pT~~-~~~i~~~a~~Ll~~~~~ 358 (417) T PRK02794 295 HLWRLSEKVSRRLKAAGLAGRTVTLKLKTADF-RLRT---R----RRTLGDPTQL-ADRIFRTARELLEKETD 358 (417) T ss_pred HHHHHHHHHHHHHHHCCCCEEEEEEEEEECCC-CCCC---E----EEECCCCCCC-HHHHHHHHHHHHHHHHC T ss_conf 99999999999999759846899999994899-6431---6----8872886689-99999999999998656 No 310 >cd04913 ACT_AKii-LysC-BS-like_1 ACT domains of the lysine-sensitive aspartokinase isoenzyme AKII of Bacillus subtilis (BS) strain 168 and related proteins. This CD includes the N-terminal of the two ACT domains of the lysine-sensitive aspartokinase isoenzyme AKII of Bacillus subtilis (BS) strain 168, and the lysine plus threonine-sensitive aspartokinase of Corynebacterium glutamicum, and related sequences. In B. subtilis 168, the regulation of the diaminopimelate (Dap)-lysine biosynthetic pathway involves dual control by Dap and lysine, effected through separate Dap- and lysine-sensitive aspartokinase isoenzymes. The B. subtilis 168 AKII is induced by methionine and repressed and inhibited by lysine. Although Corynebacterium glutamicum is known to contain a single aspartokinase, both the succinylase and dehydrogenase variant pathways of DAP-lysine synthesis operate simultaneously in this organism. In corynebacteria and other various Gram-positive bacteria, the DAP-lysine pathway is fee Probab=34.85 E-value=25 Score=13.68 Aligned_cols=51 Identities=22% Similarity=0.401 Sum_probs=41.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEECCE-----EEEECCHHHHHHHHHHHHHHHHHC Q ss_conf 65389999999999999998659981899972638-----999559879999999999996510 Q gi|254780277|r 893 QGSAADITRRAMISVHKSLEDHKLATKMLLQIHDE-----LVFEVPEEEITTASQIIVHSMEKA 951 (976) Q Consensus 893 QGsaAdi~k~am~~i~~~~~~~~~~~~~ilqVHDE-----lv~ev~~~~~~~~~~~~~~~Me~~ 951 (976) -|.||.| ...|.+.+...-||.|---+ +-|-||+++++++..++++.++.. T Consensus 13 PG~aa~i--------F~~La~~~InVDmIvQ~~~~~~~~disFTv~~~dl~~a~~vl~~~~~~l 68 (75) T cd04913 13 PGVAAKI--------FGALAEANINVDMIVQNVSRDGTTDISFTVPKSDLKKALAVLEKLKKEL 68 (75) T ss_pred CCHHHHH--------HHHHHHCCCCEEEEEECCCCCCCEEEEEEECHHHHHHHHHHHHHHHHHC T ss_conf 9669999--------9999985984799986167899458999904999999999999999864 No 311 >TIGR03631 bact_S13 30S ribosomal protein S13. This model describes bacterial ribosomal protein S13, to the exclusion of the homologous archaeal S13P and eukaryotic ribosomal protein S18. This model identifies some (but not all) instances of chloroplast and mitochondrial S13, which is of bacterial type. Probab=34.74 E-value=19 Score=14.50 Aligned_cols=15 Identities=40% Similarity=0.569 Sum_probs=5.4 Q ss_pred CCCCCCHHHHHHHHH Q ss_conf 776360249989841 Q gi|254780277|r 196 IPGIGYKTAALLLQE 210 (976) Q Consensus 196 v~GiG~KtA~~ll~~ 210 (976) |.|||+++|..+... T Consensus 20 I~GIG~~~A~~Ic~~ 34 (113) T TIGR03631 20 IYGIGRTRARKILEK 34 (113) T ss_pred EECCCHHHHHHHHHH T ss_conf 027589999999999 No 312 >KOG1921 consensus Probab=34.56 E-value=14 Score=15.57 Aligned_cols=10 Identities=40% Similarity=0.720 Sum_probs=4.6 Q ss_pred CCCCCCCCHH Q ss_conf 0143100589 Q gi|254780277|r 584 EITGIQIDQD 593 (976) Q Consensus 584 E~~Gi~vD~~ 593 (976) ...||.||.. T Consensus 182 ~i~GI~VDtH 191 (286) T KOG1921 182 KIVGICVDTH 191 (286) T ss_pred CCEEEEEEHH T ss_conf 6405786328 No 313 >TIGR00922 nusG transcription termination/antitermination factor NusG; InterPro: IPR001062 Bacterial transcription antitermination protein, nusG, is a component of the transcription complex and interacts with the termination factor Rho and RNA polymerase , . NusG is a bacterial transcriptional elongation factor involved in transcription termination and anti-termination .; GO: 0003711 transcription elongation regulator activity, 0006355 regulation of transcription DNA-dependent. Probab=34.32 E-value=24 Score=13.78 Aligned_cols=13 Identities=38% Similarity=0.705 Sum_probs=6.9 Q ss_pred CEEEECCCCCEEEEEE Q ss_conf 3024248994899963 Q gi|254780277|r 734 RRAFISPPTKKLISAD 749 (976) Q Consensus 734 R~~F~a~~g~~lv~aD 749 (976) |++| |||+||-+| T Consensus 58 rK~f---PGYvLv~m~ 70 (193) T TIGR00922 58 RKIF---PGYVLVKMD 70 (193) T ss_pred EEEC---CCEEEEEEE T ss_conf 1154---864899997 No 314 >KOG1521 consensus Probab=33.80 E-value=9.2 Score=16.85 Aligned_cols=40 Identities=20% Similarity=0.341 Sum_probs=27.9 Q ss_pred HHHHHHHHHHCCCCCEECCCCCCCCC-CCCCCCCCCCCCCCEEE Q ss_conf 12333453300222100000133343-33455633222243024 Q gi|254780277|r 695 RTQRVHTFYSLASTMTGRLASLEPNL-QNIPIKTDLGQKIRRAF 737 (976) Q Consensus 695 ~dgRIH~~f~q~~t~TGRlSs~~PNl-QNiP~r~~~g~~iR~~F 737 (976) .=|+-|.-|..+.|+|-||= |-. -+-|+.++...++-+|| T Consensus 214 GIGkdHAKfSPVaTAsYRLl---P~I~l~~ei~ge~A~~lqkcf 254 (338) T KOG1521 214 GIGKDHAKFSPVATASYRLL---PTIVLLAEIEGEDAERLQKCF 254 (338) T ss_pred CCCCCCCCCCCCCCCEEECC---CEEEECCCCCCHHHHHHHHHC T ss_conf 56755443057522020005---305734745646599998518 No 315 >TIGR01862 N2-ase-Ialpha nitrogenase component I, alpha chain; InterPro: IPR010143 This entry represents all three varieties (Fe-Fe, Mo-Fe and V-Fe) of the component I alpha chain of nitrogenase. Nitrogenase (1.18.6.1 from EC) is the enzyme system responsible for biological nitrogen fixation. Nitrogenase is an oligomeric complex which consists of two components: component 2 is an homodimer of an iron-sulphur protein, while component 1 which contains the active site for the reduction of nitrogen to ammonia exists in three different forms: the molybdenum-iron containing protein (MoFe) is a hetero-tetramer consisting of two pairs of alpha (nifD) and beta (nifK) subunits; the vanadium-iron containing protein (VFe) is a hexamer of two pairs each of alpha (vnfD), beta (vnfK), and delta (vnfG) subunits; the third form seems to only contain iron and is a hexamer composed of alpha (anfD), beta (anfK), and delta (anfG) subunits. The alpha and beta chains of the three types of component 1 are evolutionary related and they are also related to proteins nifE and nifN, which are most probably involved in the iron-molybdenum cofactor biosynthesis .; GO: 0016163 nitrogenase activity, 0051536 iron-sulfur cluster binding, 0009399 nitrogen fixation. Probab=33.43 E-value=16 Score=15.12 Aligned_cols=15 Identities=13% Similarity=-0.144 Sum_probs=8.8 Q ss_pred HHHHHHHHHCCCCCC Q ss_conf 667998742078875 Q gi|254780277|r 628 QLGDILFTKLKFPSG 642 (976) Q Consensus 628 QL~~vLf~~LgLp~~ 642 (976) .+...++++||+.+. T Consensus 345 H~~~~~~~~lG~~VV 359 (510) T TIGR01862 345 HWIKSLEEDLGVEVV 359 (510) T ss_pred HHHHHHHHHCCCEEE T ss_conf 111257776394599 No 316 >KOG2404 consensus Probab=33.13 E-value=3.6 Score=19.79 Aligned_cols=38 Identities=32% Similarity=0.467 Sum_probs=19.5 Q ss_pred HHHHHHCCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHCCCEECCCCCEEC Q ss_conf 356788199989999999999998888999999999999984988827787601 Q gi|254780277|r 815 FRLATQLRIPRSEAADYIKRYFHRFPGIHEYIEKTKNFVRQNGYVETIFGRRIH 868 (976) Q Consensus 815 ~~La~~l~is~~eA~~~i~~yf~~~p~v~~~~~~~~~~a~~~gyv~T~~GRrr~ 868 (976) .-|+++++|+.+|-+.-.+.|-.. ..|-.+-.|||+.. T Consensus 349 ~el~s~ln~t~sel~ttl~eY~~~----------------~~g~~~D~fgrk~f 386 (477) T KOG2404 349 AELASALNITESELKTTLEEYSKS----------------FTGKSEDPFGRKVF 386 (477) T ss_pred HHHHHHHCCCHHHHHHHHHHHHHH----------------HCCCCCCCCCCCCC T ss_conf 999998698899999999998775----------------33888886787112 No 317 >COG1031 Uncharacterized Fe-S oxidoreductase [Energy production and conversion] Probab=32.84 E-value=17 Score=14.98 Aligned_cols=31 Identities=26% Similarity=0.238 Sum_probs=18.2 Q ss_pred HHHHCCCCCEECCCCCCCCCCCCCCCCCCCC Q ss_conf 5330022210000013334333455633222 Q gi|254780277|r 701 TFYSLASTMTGRLASLEPNLQNIPIKTDLGQ 731 (976) Q Consensus 701 ~~f~q~~t~TGRlSs~~PNlQNiP~r~~~g~ 731 (976) ....+.+..-||---|=|=|--||.|-+.|+ T Consensus 457 vE~~~~~~tfgRQ~GSYPllvgi~~~~~~g~ 487 (560) T COG1031 457 VEVYEGGLTFGRQLGSYPLLVGIPGRLELGR 487 (560) T ss_pred EEEECCCCEEECCCCCCCEEEECCCCCCCCC T ss_conf 9994287302113577644763035455785 No 318 >TIGR02005 PTS-IIBC-alpha PTS system, alpha-glucoside-specific IIBC component; InterPro: IPR010975 This entry represents the fused PTS enzyme II B and C domains. A gene from Clostridium has been partially characterised as a maltose transporter, while genes from Fusobacterium and Klebsiella , have been proposed to transport the five non-standard isomers of sucrose.; GO: 0008982 protein-N(PI)-phosphohistidine-sugar phosphotransferase activity, 0009401 phosphoenolpyruvate-dependent sugar phosphotransferase system. Probab=32.78 E-value=27 Score=13.45 Aligned_cols=22 Identities=27% Similarity=0.347 Sum_probs=9.0 Q ss_pred CCHHHHHHHHHHHHHHHHHHHH Q ss_conf 6768899865323368999876 Q gi|254780277|r 267 CNGPRLISFLKALEFTKLINRV 288 (976) Q Consensus 267 ~d~~~l~~~~~~lef~~ll~rL 288 (976) .-..-|..+..-|-||.+++-+ T Consensus 83 ~grAcLa~L~~YLtfNyFInAi 104 (533) T TIGR02005 83 QGRACLAVLLSYLTFNYFINAI 104 (533) T ss_pred CHHHHHHHHHHHHHHHHHHHHH T ss_conf 5458999999987899999998 No 319 >PRK11145 pflA pyruvate formate lyase-activating enzyme 1; Provisional Probab=32.72 E-value=27 Score=13.44 Aligned_cols=29 Identities=17% Similarity=0.224 Sum_probs=17.1 Q ss_pred HHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHH Q ss_conf 67998742078875556654535317887765 Q gi|254780277|r 629 LGDILFTKLKFPSGAKTKTGQWKTTAQDLEQI 660 (976) Q Consensus 629 L~~vLf~~LgLp~~kKTktG~~ST~~~~Le~L 660 (976) +|.-=|+.||.+...+ +-++++.+.|+++ T Consensus 205 ~G~~Ky~~lg~~Y~~~---~~~~~~~e~l~~~ 233 (246) T PRK11145 205 LGKHKWEAMGEEYKLD---GVKPPSKETMERI 233 (246) T ss_pred CCHHHHHHCCCCCCCC---CCCCCCHHHHHHH T ss_conf 6654799839998888---9899799999999 No 320 >TIGR01057 topA_arch DNA topoisomerase I; InterPro: IPR005739 DNA topoisomerases regulate the number of topological links between two DNA strands (i.e. change the number of superhelical turns) by catalysing transient single- or double-strand breaks, crossing the strands through one another, then resealing the breaks. These enzymes have several functions: to remove DNA supercoils during transcription and DNA replication; for strand breakage during recombination; for chromosome condensation; and to disentangle intertwined DNA during mitosis , . DNA topoisomerases are divided into two classes: type I enzymes (5.99.1.2 from EC; topoisomerases I, III and V) break single-strand DNA, and type II enzymes (5.99.1.3 from EC; topoisomerases II, IV and VI) break double-strand DNA . Type I topoisomerases are ATP-independent enzymes (except for reverse gyrase), and can be subdivided according to their structure and reaction mechanisms: type IA (bacterial and archaeal topoisomerase I, topoisomerase III and reverse gyrase) and type IB (eukaryotic topoisomerase I and topoisomerase V). These enzymes are primarily responsible for relaxing positively and/or negatively supercoiled DNA, except for reverse gyrase, which can introduce positive supercoils into DNA. This entry describes topoisomerase I from archaea, which is more closely related to bacterial than to eukaryotic topoisomerase I . Topoisomerase I is the major enzyme for relaxing negatively supercoiled DNA, and its presence is balanced by reverse gyrase, which can introduce negative supercoils. More information about this protein can be found at Protein of the Month: DNA Topoisomerase .; GO: 0003677 DNA binding, 0003916 DNA topoisomerase activity, 0006265 DNA topological change, 0006268 DNA unwinding during replication, 0005694 chromosome. Probab=32.61 E-value=27 Score=13.43 Aligned_cols=101 Identities=17% Similarity=0.317 Sum_probs=42.8 Q ss_pred HHHHHHHCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCC--CCCC-HHHHHHHHCCCCHHHHHHHHHHHHHH-HHHHH Q ss_conf 99999740345577776566679987420788755566545--3531-78877653047627999999987888-98865 Q gi|254780277|r 609 LEENIYDLSGEKFNLSSPKQLGDILFTKLKFPSGAKTKTGQ--WKTT-AQDLEQINCGDNPIIKNILEWRQLSK-IKSTY 684 (976) Q Consensus 609 le~~i~~~aG~~FNinSpkQL~~vLf~~LgLp~~kKTktG~--~ST~-~~~Le~L~~~~hp~~~~iLeyR~l~K-L~sTY 684 (976) |+.++|..-| |.|--++.+++=||++ +|-.-.+|-+=+ .|-+ ...|.+| ++.+|. ||...- |+.+ T Consensus 287 LQ~EAy~~fG--~sPk~Tq~IAq~LY~~-al~SYPRT~SQKLP~~i~Y~~il~nL-akn~P~------yr~~~e~L~~~- 355 (637) T TIGR01057 287 LQREAYRVFG--FSPKKTQEIAQELYEE-ALISYPRTSSQKLPKSINYRKILKNL-AKNNPL------YREYAERLLEK- 355 (637) T ss_pred HHHHHHHHCC--CCCHHHHHHHHHHHHC-CCCCCCCCCCCCCCCCCCHHHHHHHH-HHCCCC------CHHHHHHHHHC- T ss_conf 5899998607--9964468999998642-35158878766788877879999999-724898------46899998716- Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCC--CEECCCCCCCCC Q ss_conf 320011110012333453300222--100000133343 Q gi|254780277|r 685 ADSLPNHINKRTQRVHTFYSLAST--MTGRLASLEPNL 720 (976) Q Consensus 685 ~~~l~~~i~~~dgRIH~~f~q~~t--~TGRlSs~~PNl 720 (976) ...|...-..++.==||-.+.||- .-|+||-.+-.+ T Consensus 356 G~~LkPveGkKeDpAHPAIhptG~~P~~G~LSk~e~~v 393 (637) T TIGR01057 356 GGELKPVEGKKEDPAHPAIHPTGEIPSEGELSKDEKKV 393 (637) T ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHH T ss_conf 78878857877966788600567888888777788999 No 321 >PRK06746 peptide chain release factor 2; Provisional Probab=32.54 E-value=27 Score=13.42 Aligned_cols=71 Identities=18% Similarity=0.296 Sum_probs=40.6 Q ss_pred CHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCCCCCHHHHCCCCCCHHHHHHHHHHHH Q ss_conf 02499898413678999864542001122322003578999765543203564224763552126767688998653233 Q gi|254780277|r 201 YKTAALLLQEYGNLENILINASRIKQKKRRENILEYAETARLSRKLVMLRTDVPITIPLEHLILEDCNGPRLISFLKALE 280 (976) Q Consensus 201 ~KtA~~ll~~~gsle~i~~~~d~i~~~k~~~~l~~~~e~a~ls~~L~tL~~Dvpl~~~~~~l~~~~~d~~~l~~~~~~le 280 (976) ++-|.++++++.+|+.+++..+++.. . .+.| ....+|..-..|-.+. +++ ......+...++++| T Consensus 10 ~~~A~~i~kE~s~l~~~v~~~~~l~~-~-~~dl-------~~l~eL~~ee~D~el~---ee~---~~el~~l~~~i~~lE 74 (326) T PRK06746 10 QQGAQAVINEANALKDMVGKFRQLDE-T-FENL-------EITHELLKEEYDEDLH---EEL---ESEVKGLIQEMNEYE 74 (326) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH-H-HHHH-------HHHHHHHHCCCCHHHH---HHH---HHHHHHHHHHHHHHH T ss_conf 99999999999999999999999999-9-9979-------9999998548998999---999---999999999999999 Q ss_pred HHHHHH Q ss_conf 689998 Q gi|254780277|r 281 FTKLIN 286 (976) Q Consensus 281 f~~ll~ 286 (976) ++.|+. T Consensus 75 ~~~lls 80 (326) T PRK06746 75 LQLLLS 80 (326) T ss_pred HHHCCC T ss_conf 985449 No 322 >PRK04040 adenylate kinase; Provisional Probab=32.49 E-value=21 Score=14.26 Aligned_cols=17 Identities=29% Similarity=0.556 Sum_probs=11.7 Q ss_pred CCCCCCCCCHHHHHHHH Q ss_conf 99877636024998984 Q gi|254780277|r 193 IPGIPGIGYKTAALLLQ 209 (976) Q Consensus 193 ipGv~GiG~KtA~~ll~ 209 (976) |-||||+|.-|..+.+. T Consensus 7 vtGiPGvGKTTv~~~~~ 23 (189) T PRK04040 7 VTGVPGVGKTTVLNKAL 23 (189) T ss_pred EECCCCCCHHHHHHHHH T ss_conf 97589887899999999 No 323 >PRK11609 nicotinamidase/pyrazinamidase; Provisional Probab=32.29 E-value=27 Score=13.40 Aligned_cols=44 Identities=16% Similarity=0.187 Sum_probs=25.4 Q ss_pred HHHHHHHHHCCCCEEEECCEEHHHHHHHHHHHHHHCCCCEEEEC Q ss_conf 99999999709979956890166589999999984899499986 Q gi|254780277|r 96 PLVRLATQAFGIPAIEIQGFEADDIIATYTYIAEKEGFAVTIIS 139 (976) Q Consensus 96 ~~i~~~l~~~gi~~~~~~~~EADDviatla~~~~~~~~~v~i~s 139 (976) .-+.++|...||..+-+-|.=.|=++-+=++.|...||+|++++ T Consensus 132 T~L~~~Lr~~gI~~lvv~Gv~Td~CV~~TardA~~~Gy~v~vv~ 175 (213) T PRK11609 132 TALDDWLREHEIDELIVMGLATDYCVKFTVLDALQLGYKVNVIT 175 (213) T ss_pred CHHHHHHHHCCCCEEEEEECCCCHHHHHHHHHHHHCCCEEEEEC T ss_conf 50999999669987999856726799999999997799899952 No 324 >PRK03968 DNA primase large subunit; Validated Probab=31.96 E-value=28 Score=13.36 Aligned_cols=24 Identities=21% Similarity=0.440 Sum_probs=20.2 Q ss_pred HHHHHHHHHCCCHHHHHHHHHHHH Q ss_conf 249989841367899986454200 Q gi|254780277|r 202 KTAALLLQEYGNLENILINASRIK 225 (976) Q Consensus 202 KtA~~ll~~~gsle~i~~~~d~i~ 225 (976) +-|..|+.+||+|+++++..+.+- T Consensus 12 ~~AkeivkefGsin~~l~~ip~lv 35 (388) T PRK03968 12 EEAKELVKEFGSINELLDAIPSLV 35 (388) T ss_pred HHHHHHHHHHCCHHHHHHCCCCHH T ss_conf 899999998435988875072123 No 325 >TIGR00977 LeuA_rel 2-isopropylmalate synthase/homocitrate synthase family protein; InterPro: IPR005675 This model represents largely uncharacterised proteins related to 2-isopropylmalate synthases and homocitrate synthases but phylogenetically distinct. ; GO: 0046912 transferase activity transferring acyl groups acyl groups converted into alkyl on transfer, 0019752 carboxylic acid metabolic process. Probab=31.85 E-value=28 Score=13.35 Aligned_cols=95 Identities=26% Similarity=0.422 Sum_probs=56.5 Q ss_pred HHHHHHHHCCCCCEEC--CCCCCCCCC---------------------------CCCCCCCCCCCCCEEEECCCCCEEEE Q ss_conf 3334533002221000--001333433---------------------------34556332222430242489948999 Q gi|254780277|r 697 QRVHTFYSLASTMTGR--LASLEPNLQ---------------------------NIPIKTDLGQKIRRAFISPPTKKLIS 747 (976) Q Consensus 697 gRIH~~f~q~~t~TGR--lSs~~PNlQ---------------------------NiP~r~~~g~~iR~~F~a~~g~~lv~ 747 (976) -.||+|.|-.|-.||- |-|-=|||| |.|.....=.-=+.|| |-+|.+-+| T Consensus 232 ~~V~GTING~GER~GNANLCs~IPNL~LKl~~dvI~~E~L~~L~~~a~lvaEi~n~~~~~~~PyVG~~AF-AHKGGvHvS 310 (543) T TIGR00977 232 TQVQGTINGLGERCGNANLCSVIPNLQLKLGYDVIPKENLKKLTEVARLVAEIVNLPLDENMPYVGESAF-AHKGGVHVS 310 (543) T ss_pred CEECCEECCCCCCCCCCCHHHHCCCHHHHCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCC-CCCCCEEEE T ss_conf 1542204255556576411111001021026645686667889989999999854885346782366322-025865776 Q ss_pred E------ECCCCCEEEE--------EEEEEHHHHHHHHHCCCCHHHHHHHHHHCCCHHHCCHHHHHHHHHH Q ss_conf 6------3265443687--------6651037778734069888999998980999688998888630251 Q gi|254780277|r 748 A------DYSQIELRIL--------AHIAKITPLYQVFENSLDIHRMVAAEIFGVNIEKVSPQMRRHAKTI 804 (976) Q Consensus 748 a------DySQiElRil--------A~ls~D~~l~~af~~g~DiH~~tA~~vf~~~~~~vt~~~R~~aK~i 804 (976) | -|-.|.+-+. ..+||-.++++- .+.||+.+|.-+|+.|..-+.+ T Consensus 311 Av~r~p~TYEHI~P~LVGN~R~I~vSe~aG~SNvl~K------------~~~~G~~~D~~~P~~r~il~~~ 369 (543) T TIGR00977 311 AVKRNPKTYEHIDPELVGNKRKIVVSELAGKSNVLEK------------LKEFGIEIDEKSPKVRKILKKI 369 (543) T ss_pred EEECCCCCCCCCCCCCCCCEEEEEEECCCCCHHHHHH------------HHHHCHHCCCCCCHHHHHHHHH T ss_conf 5505887423458643177137998235573148999------------9870401057880689999999 No 326 >KOG0523 consensus Probab=31.75 E-value=16 Score=15.10 Aligned_cols=15 Identities=7% Similarity=0.257 Sum_probs=7.8 Q ss_pred HHHHHHHHCCCCEEE Q ss_conf 999999865998189 Q gi|254780277|r 906 SVHKSLEDHKLATKM 920 (976) Q Consensus 906 ~i~~~~~~~~~~~~~ 920 (976) .-.+.|.+.++++++ T Consensus 522 ~AA~~L~e~gi~vrV 536 (632) T KOG0523 522 EAAELLSEDGIKVRV 536 (632) T ss_pred HHHHHHHHCCCEEEE T ss_conf 999999864963899 No 327 >pfam03753 HHV6-IE Human herpesvirus 6 immediate early protein. The proteins in this family are poorly characterized, but an investigation has indicated that the immediate early protein is required the down-regulation of MHC class I expression in dendritic cells. Human herpesvirus 6 immediate early protein is also referred to as U90. Probab=31.55 E-value=26 Score=13.55 Aligned_cols=187 Identities=18% Similarity=0.205 Sum_probs=80.3 Q ss_pred HHHHHHHH-HHHCCCCCEECCCCC-CCC-CCCCCCCCCCC-CCCCEEEECCCCCEEEEEECCCCCEEEEEEEEEHHHHHH Q ss_conf 01233345-330022210000013-334-33345563322-224302424899489996326544368766510377787 Q gi|254780277|r 694 KRTQRVHT-FYSLASTMTGRLASL-EPN-LQNIPIKTDLG-QKIRRAFISPPTKKLISADYSQIELRILAHIAKITPLYQ 769 (976) Q Consensus 694 ~~dgRIH~-~f~q~~t~TGRlSs~-~PN-lQNiP~r~~~g-~~iR~~F~a~~g~~lv~aDySQiElRilA~ls~D~~l~~ 769 (976) ..|+-+|- .|+-+ +.-|..-|. -|- .=.||+|+-.| +.||.-|||...|.|.-----++.--..-...+|.. T Consensus 733 sedn~lhlSdydkv-iNNg~C~s~GFPspVFTIPIRSm~GT~gir~KfvPKKnwlWfmrKTHkVdNcpih~s~k~~~--- 808 (993) T pfam03753 733 SEDNELHLSDYDKV-INNGNCESKGFPSPVFTIPIRSMRGTHGIRVKFVPKKNWLWFMRKTHKVDNCPIHSSKKDNN--- 808 (993) T ss_pred CCCCCCCHHHHHHH-HCCCCCCCCCCCCCEEEEEEECCCCCCCCEEEEECCCCEEEEEEHHCCCCCCCCCCCCCCCC--- T ss_conf 53331121116776-42798865798874388762025677874023310544455420003356666655322354--- Q ss_pred HHHCCCCHHHHHHHHHHC---CCHHHCCHHHHHHHHHHHHHHHHCCCHHHHHHHCCC---CHHHHHH-----HHHHHHHH Q ss_conf 340698889999989809---996889988886302513564407770356788199---9899999-----99999998 Q gi|254780277|r 770 VFENSLDIHRMVAAEIFG---VNIEKVSPQMRRHAKTINFSIIYGISPFRLATQLRI---PRSEAAD-----YIKRYFHR 838 (976) Q Consensus 770 af~~g~DiH~~tA~~vf~---~~~~~vt~~~R~~aK~infgi~YG~~~~~La~~l~i---s~~eA~~-----~i~~yf~~ 838 (976) ++..|. .-|+..|. +|+..-+.+-|+. |-.-+|- .+ +.-.+ +..+.+. ++|.|.+- T Consensus 809 --K~dsd~--teA~hCF~NHFvPIktdDEeY~ke----NVSYiy~----Ki-Q~skid~~~I~pTK~lit~miMdNFMdl 875 (993) T pfam03753 809 --KEDSDG--TEASHCFNNHFVPIKTDDEEYRKE----NVSYIYP----KI-QESKIDVESITPTKTLITNMIMDNFMDL 875 (993) T ss_pred --CCCCCC--CHHHHHHHCCCCCCCCCHHHHCCC----CCEEECH----HH-HHCCCCCCCCCCHHHHHHHHHHHHHHHH T ss_conf --335664--134566650412432674776115----5102415----45-4244553444636899999999877769 Q ss_pred CHHHHHHHHHHHHHHHHCCCEECCCCCEECCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 88899999999999998498882778760189875883788877878770134565389999 Q gi|254780277|r 839 FPGIHEYIEKTKNFVRQNGYVETIFGRRIHYDEINSPKSSIRNINERAAINAPIQGSAADIT 900 (976) Q Consensus 839 ~p~v~~~~~~~~~~a~~~gyv~T~~GRrr~~p~i~s~~~~~r~~aeR~a~N~~iQGsaAdi~ 900 (976) ---++.-+.+--+....+=-|-|..--. .+++..|..--.-++-|-+|-++-|--.-|+ T Consensus 876 tdiIK~GI~kHcqdL~~kY~vvt~t~ce---~~ln~~nsq~~~t~~tqvfdp~vtgnNS~IL 934 (993) T pfam03753 876 TDIIKNGINKHCQDLKKKYAVVTPTTCE---KDLNVTNSQTVATTATQVFDPNVTGNNSFIL 934 (993) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCCCC---CCCCCCCCHHHHHHHHCCCCCCCCCCCCHHH T ss_conf 9999872788999997642005664202---2345543201100001003888777760456 No 328 >pfam04614 Pex19 Pex19 protein family. Probab=31.52 E-value=26 Score=13.54 Aligned_cols=10 Identities=30% Similarity=0.737 Sum_probs=4.3 Q ss_pred HHHHHHHHHH Q ss_conf 3689998765 Q gi|254780277|r 280 EFTKLINRVA 289 (976) Q Consensus 280 ef~~ll~rL~ 289 (976) .|..+++.+. T Consensus 34 qfe~~~kel~ 43 (245) T pfam04614 34 QFEKLLKELG 43 (245) T ss_pred HHHHHHHHHH T ss_conf 9999999997 No 329 >COG0776 HimA Bacterial nucleoid DNA-binding protein [DNA replication, recombination, and repair] Probab=31.47 E-value=28 Score=13.30 Aligned_cols=22 Identities=9% Similarity=0.394 Sum_probs=14.0 Q ss_pred HHHHHCCCCHHHHHHHHHHHHH Q ss_conf 5678819998999999999999 Q gi|254780277|r 816 RLATQLRIPRSEAADYIKRYFH 837 (976) Q Consensus 816 ~La~~l~is~~eA~~~i~~yf~ 837 (976) .+|...+++..+|+.+++.+|+ T Consensus 10 ~ia~~~~l~k~~a~~~v~~~~~ 31 (94) T COG0776 10 AIAEKAGLSKKDAEEAVDAFLE 31 (94) T ss_pred HHHHHHCCCHHHHHHHHHHHHH T ss_conf 9999709879999999999999 No 330 >TIGR01418 PEP_synth phosphoenolpyruvate synthase; InterPro: IPR006319 This family represents phosphoenolpyruvate synthase; also called pyruvate,water dikinase and PEP synthase. The member from Methanococcus jannaschii contains a large intein. This enzyme generates phosphoenolpyruvate (PEP) from pyruvate, hydrolyzing ATP to AMP and releasing inorganic phosphate in the process. The enzyme shows extensive homology to other enzymes that use PEP as substrate or product. This enzyme may provide PEP for gluconeogenesis, for PTS-type carbohydrate transport systems, or for other processes. ; GO: 0008986 pyruvate water dikinase activity, 0006094 gluconeogenesis. Probab=31.23 E-value=28 Score=13.28 Aligned_cols=42 Identities=24% Similarity=0.437 Sum_probs=25.0 Q ss_pred CCCCHHHHHHHHHHH--H-HHCHHHHHHHHHHHHHHHHCCCEECCCC Q ss_conf 199989999999999--9-9888899999999999998498882778 Q gi|254780277|r 821 LRIPRSEAADYIKRY--F-HRFPGIHEYIEKTKNFVRQNGYVETIFG 864 (976) Q Consensus 821 l~is~~eA~~~i~~y--f-~~~p~v~~~~~~~~~~a~~~gyv~T~~G 864 (976) ||+.++ .+++..- | ++-|.|++.++++++-|+++|-...|.| T Consensus 793 LgvDRD--SelVA~lGYFDErn~AVL~li~~~I~aAk~~G~~vgICG 837 (877) T TIGR01418 793 LGVDRD--SELVAHLGYFDERNPAVLRLIEMAIKAAKEHGKKVGICG 837 (877) T ss_pred HHHHCC--CHHHHHCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEC T ss_conf 420104--314330377757687999999999999987399698706 No 331 >PRK08561 rps15p 30S ribosomal protein S15P; Reviewed Probab=31.04 E-value=29 Score=13.25 Aligned_cols=35 Identities=26% Similarity=0.397 Sum_probs=26.1 Q ss_pred HHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHCCCCCCC Q ss_conf 99999999740345577776566679987420788755 Q gi|254780277|r 606 LLSLEENIYDLSGEKFNLSSPKQLGDILFTKLKFPSGA 643 (976) Q Consensus 606 l~~le~~i~~~aG~~FNinSpkQL~~vLf~~LgLp~~k 643 (976) -+++++.|.++|.+.. ||-|+|.+|=|..|+|-.| T Consensus 30 ~~eve~~I~klaKkG~---spS~IG~~LRD~~gip~Vk 64 (151) T PRK08561 30 PEEIEELVVELAKQGY---SPSMIGIILRDQYGIPDVK 64 (151) T ss_pred HHHHHHHHHHHHHCCC---CHHHHHCEECCCCCCCCCH T ss_conf 9999999999996799---8878610122325665201 No 332 >TIGR00737 nifR3_yhdG putative TIM-barrel protein, nifR3 family; InterPro: IPR004652 This family represents one branch of COG0042 from COG (Predicted TIM-barrel enzymes, possibly dehydrogenases, nifR3 family) and includes NifR3 itself, from Rhodobacter capsulatus. Members of this family catalyse the reduction of the 5,6-double bond of a uridine residue on tRNA. Dihydrouridine modification of tRNA is widely observed in prokaryotes and eukaryotes , and also in some archaea. Most dihydrouridines are found in the D loop of t-RNAs. The role of dihydrouridine in tRNA is currently unknown, but may increase conformational flexibility of the tRNA. ; GO: 0016491 oxidoreductase activity, 0050660 FAD binding, 0008033 tRNA processing. Probab=30.81 E-value=29 Score=13.23 Aligned_cols=10 Identities=10% Similarity=0.375 Sum_probs=4.1 Q ss_pred HHHHCCCHHH Q ss_conf 6440777035 Q gi|254780277|r 807 SIIYGISPFR 816 (976) Q Consensus 807 gi~YG~~~~~ 816 (976) |++-|-|+.| T Consensus 234 Gvm~gRG~lG 243 (336) T TIGR00737 234 GVMIGRGALG 243 (336) T ss_pred EEEEECHHCC T ss_conf 8985002227 No 333 >cd03183 GST_C_Theta GST_C family, Class Theta subfamily; composed of eukaryotic class Theta GSTs and bacterial dichloromethane (DCM) dehalogenase. GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins and products of oxidative stress. The GST fold contains an N-terminal thioredoxin-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. GSH binds to the N-terminal domain while the hydrophobic substrate occupies a pocket in the C-terminal domain. Mammalian class Theta GSTs show poor GSH conjugating activity towards the standard substrates, CDNB and ethacrynic acid, differentiating them from other mammalian GSTs. GSTT1-1 shows similar cataytic activity as bacterial DCM dehalogenase, catalyzing the GSH-dependent hydrolytic dehalogenatio Probab=30.59 E-value=25 Score=13.72 Aligned_cols=27 Identities=19% Similarity=0.310 Sum_probs=19.3 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 431005899998643436899999999 Q gi|254780277|r 586 TGIQIDQDLLKQVSAEISKNLLSLEEN 612 (976) Q Consensus 586 ~Gi~vD~~~L~~l~~~l~~~l~~le~~ 612 (976) .|-+.|.+.+++..+.+..-++.+|+. T Consensus 34 ~~~~~~~~~v~~~~~~l~~~l~~le~~ 60 (126) T cd03183 34 GGKPVSPEKVKKAEENLEESLDLLENY 60 (126) T ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHHH T ss_conf 489999999999999999999999998 No 334 >PRK10664 transcriptional regulator HU subunit beta; Provisional Probab=30.29 E-value=29 Score=13.17 Aligned_cols=22 Identities=14% Similarity=0.256 Sum_probs=17.1 Q ss_pred HHHHHCCCCHHHHHHHHHHHHH Q ss_conf 5678819998999999999999 Q gi|254780277|r 816 RLATQLRIPRSEAADYIKRYFH 837 (976) Q Consensus 816 ~La~~l~is~~eA~~~i~~yf~ 837 (976) .||+++|+|..+|+.+++.|++ T Consensus 9 ~ia~~~~ltk~~a~~~v~~~~~ 30 (90) T PRK10664 9 KIAAGADISKAAAGRALDAIIA 30 (90) T ss_pred HHHHHCCCCHHHHHHHHHHHHH T ss_conf 9998709999999999999999 No 335 >pfam07367 FB_lectin Fungal fruit body lectin. This family consists of several fungal fruit body lectin proteins. Fruit body lectins are thought to have insecticidal activity and may also function in capturing nematodes. Probab=30.04 E-value=19 Score=14.60 Aligned_cols=45 Identities=22% Similarity=0.363 Sum_probs=26.1 Q ss_pred HHHHHHHCCCCCEECCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCEE Q ss_conf 334533002221000001333433345563322224302424899489 Q gi|254780277|r 698 RVHTFYSLASTMTGRLASLEPNLQNIPIKTDLGQKIRRAFISPPTKKL 745 (976) Q Consensus 698 RIH~~f~q~~t~TGRlSs~~PNlQNiP~r~~~g~~iR~~F~a~~g~~l 745 (976) ++||.|=- .-||+++..-.+-..-+++..||.|+--|...+|+.| T Consensus 90 ~~~peYY~---ng~R~~~r~~q~s~~~~~~~~Gr~v~v~y~~~eGn~L 134 (139) T pfam07367 90 KLHPEYYN---NGGRLSARWAQLSSYEVKTSKGRTVKVKYTVAEGNNL 134 (139) T ss_pred EECHHHCC---CCHHHHHHHHHHHHEEEECCCCCEEEEEEEECCCCEE T ss_conf 84840417---8316778875442315534689789999998548708 No 336 >pfam08972 DUF1902 Domain of unknown function (DUF1902). Members of this family of prokaryotic proteins adopt a fold consisting of one alpha-helix and four beta-strands. Their function has not, as yet, been elucidated. Probab=30.01 E-value=30 Score=13.13 Aligned_cols=16 Identities=31% Similarity=0.453 Sum_probs=8.3 Q ss_pred HHHHCCCHHHHHHHHH Q ss_conf 9841367899986454 Q gi|254780277|r 207 LLQEYGNLENILINAS 222 (976) Q Consensus 207 ll~~~gsle~i~~~~d 222 (976) |..|=+|+|.+++.+- T Consensus 24 LvtEA~Tle~L~~Kl~ 39 (54) T pfam08972 24 LVTEADTLEELLAKLR 39 (54) T ss_pred HHHCCCHHHHHHHHHH T ss_conf 4325632999999999 No 337 >PRK13900 type IV secretion system ATPase VirB11; Provisional Probab=29.94 E-value=16 Score=15.05 Aligned_cols=43 Identities=16% Similarity=0.257 Sum_probs=29.8 Q ss_pred CCCEEEECCCCCEEEEEECCCCCEEEEEEEEEHH--HHHHHHHCCCC-----HHHHHHHHHH Q ss_conf 2430242489948999632654436876651037--77873406988-----8999998980 Q gi|254780277|r 732 KIRRAFISPPTKKLISADYSQIELRILAHIAKIT--PLYQVFENSLD-----IHRMVAAEIF 786 (976) Q Consensus 732 ~iR~~F~a~~g~~lv~aDySQiElRilA~ls~D~--~l~~af~~g~D-----iH~~tA~~vf 786 (976) -+|.|.--+|++++|+ |+| |++ .|++|.+.|.+ ||+..|...+ T Consensus 227 Ll~~aLR~rPDRIivG------EvR------G~EA~~~l~A~nTGH~Gs~tTiHA~sa~~a~ 276 (332) T PRK13900 227 LIEACLRLRPDRIIVG------ELR------GAEAFSFLRAINTGHPGSISTLHADSPAMAI 276 (332) T ss_pred HHHHHHCCCCCEEEEC------CCC------HHHHHHHHHHHHCCCCCCEECCCCCCHHHHH T ss_conf 9999756899758445------557------1999999999976999711462789999999 No 338 >TIGR01703 hybrid_clust hydroxylamine reductase; InterPro: IPR010048 Hybrid cluster proteins (HCP, or Prismane) have been identified in bacteria, archaea and eukaryotic protozoa. No specific function has yet been assigned to these proteins, but it may involve oxidoreductase enzymatic activity. These proteins contain one 4Fe-4S cluster, and one hybrid 4Fe-2O-2S cluster, the latter being similar to the Ni-Fe-S cluster found in carbon monoxide dehydrogenase enzymes (IPR010047 from INTERPRO) , . This subfamily is heterogeneous with respect to the presence or absence of a region of about 100 amino acids not far from the N terminus of the protein. Members have been described as monomeric. ; GO: 0016661 oxidoreductase activity acting on other nitrogenous compounds as donors, 0051536 iron-sulfur cluster binding, 0006118 electron transport, 0005737 cytoplasm. Probab=29.90 E-value=18 Score=14.70 Aligned_cols=83 Identities=16% Similarity=0.313 Sum_probs=50.1 Q ss_pred CCCEEEEEECCCCCEEEEEEEEEHHHHHHHHHCCCCHHHHHH-----HHHHCCC-HHHCC-----HHHHHHHHHHHHHHH Q ss_conf 994899963265443687665103777873406988899999-----8980999-68899-----888863025135644 Q gi|254780277|r 741 PTKKLISADYSQIELRILAHIAKITPLYQVFENSLDIHRMVA-----AEIFGVN-IEKVS-----PQMRRHAKTINFSII 809 (976) Q Consensus 741 ~g~~lv~aDySQiElRilA~ls~D~~l~~af~~g~DiH~~tA-----~~vf~~~-~~~vt-----~~~R~~aK~infgi~ 809 (976) --|.|=..||.-|| |=|+|+++=+-+ |-|+..- +++|+.. +.+-. +=.=|+|=+|-+.++ T Consensus 450 GKYRfNk~dlGdIe--------GiPRllD~GQCN-D~YSai~~AlkL~e~f~~~DvN~LPLs~~LSW~EQKA~~ILL~LL 520 (567) T TIGR01703 450 GKYRFNKLDLGDIE--------GIPRLLDLGQCN-DAYSAIVIALKLAEVFGCDDVNELPLSIVLSWYEQKAVAILLALL 520 (567) T ss_pred HHHHHHHHHCCCCC--------CCHHHHCCCCCH-HHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHH T ss_conf 21223112137778--------744341445650-699999999999987288983633589998778899999999998 Q ss_pred HCCCHHHHHHHCC------CCHHHHHHHHHHH Q ss_conf 0777035678819------9989999999999 Q gi|254780277|r 810 YGISPFRLATQLR------IPRSEAADYIKRY 835 (976) Q Consensus 810 YG~~~~~La~~l~------is~~eA~~~i~~y 835 (976) | .|-.+. .+| .|..-++-+.++| T Consensus 521 ~-LG~KnI--~~GP~lP~FLtp~v~~~L~~nF 549 (567) T TIGR01703 521 Y-LGVKNI--YIGPTLPAFLTPNVVKILVDNF 549 (567) T ss_pred H-CCCCCC--EECCCCCCCCCHHHHHHHHHHH T ss_conf 4-576761--0078781112668999999971 No 339 >cd01748 GATase1_IGP_Synthase Type 1 glutamine amidotransferase (GATase1) domain found in imidazole glycerol phosphate synthase (IGPS). Type 1 glutamine amidotransferase (GATase1) domain found in imidazole glycerol phosphate synthase (IGPS). IGPS incorporates ammonia derived from glutamine into N1-[(5'-phosphoribulosyl)-formimino]-5-aminoimidazole-4-carboxamide ribonucleotide (PRFAR) to form 5'-(5-aminoimidazole-4-carboxamide) ribonucleotide (AICAR) and imidazole glycerol phosphate (IGP). The glutamine amidotransferase domain generates the ammonia nucleophile which is channeled from the glutaminase active site to the PRFAR active site. IGPS belong to the triad family of amidotransferases having a conserved Cys-His-Glu catalytic triad in the glutaminase active site. Probab=29.78 E-value=30 Score=13.11 Aligned_cols=11 Identities=36% Similarity=0.655 Sum_probs=4.3 Q ss_pred CCCEEEEEECC Q ss_conf 99489996326 Q gi|254780277|r 741 PTKKLISADYS 751 (976) Q Consensus 741 ~g~~lv~aDyS 751 (976) +.+++-.++|. T Consensus 154 ~~~~~a~~~~~ 164 (198) T cd01748 154 PDYILATTDYG 164 (198) T ss_pred CCEEEEEECCC T ss_conf 75068897599 No 340 >PRK12402 replication factor C small subunit 2; Reviewed Probab=29.68 E-value=30 Score=13.16 Aligned_cols=21 Identities=33% Similarity=0.480 Sum_probs=17.9 Q ss_pred CCCCC-----CCCCCCCHHHHHHHHH Q ss_conf 35899-----8776360249989841 Q gi|254780277|r 190 IDNIP-----GIPGIGYKTAALLLQE 210 (976) Q Consensus 190 sDnip-----Gv~GiG~KtA~~ll~~ 210 (976) +.++| |-||+|+.|++..+.+ T Consensus 33 ~~~~phlLf~GPpG~GKTt~A~~lA~ 58 (337) T PRK12402 33 SGNLPHLVVYGPSGSGKTAAVRALAR 58 (337) T ss_pred CCCCCEEEEECCCCCCHHHHHHHHHH T ss_conf 79987698889298489999999999 No 341 >COG1168 MalY Bifunctional PLP-dependent enzyme with beta-cystathionase and maltose regulon repressor activities [Amino acid transport and metabolism] Probab=29.64 E-value=25 Score=13.70 Aligned_cols=67 Identities=15% Similarity=0.198 Sum_probs=43.7 Q ss_pred EECCCCCEEEEEECCCCCEEEEEEEEEHHHHHHHHHCCCCHHHHHHHHHHCCCHHHCCHHHHHHHH-HHHHHHHHCCCHH Q ss_conf 424899489996326544368766510377787340698889999989809996889988886302-5135644077703 Q gi|254780277|r 737 FISPPTKKLISADYSQIELRILAHIAKITPLYQVFENSLDIHRMVAAEIFGVNIEKVSPQMRRHAK-TINFSIIYGISPF 815 (976) Q Consensus 737 F~a~~g~~lv~aDySQiElRilA~ls~D~~l~~af~~g~DiH~~tA~~vf~~~~~~vt~~~R~~aK-~infgi~YG~~~~ 815 (976) +..++|..|+=.|++-+-+ |. ++...-.+..|| ++|.|..||-+.. T Consensus 313 v~~p~gTYL~WLD~r~l~l--------~d-------------------------~~l~~~ll~~akVal~~G~~FG~~g~ 359 (388) T COG1168 313 VTEPQGTYLAWLDCRELGL--------DD-------------------------SELAEFLLEEAKVALSPGSTFGEEGS 359 (388) T ss_pred EECCCCCEEEEEECCCCCC--------CH-------------------------HHHHHHHHHHHCEECCCCCCCCCCCC T ss_conf 8668875666532002389--------85-------------------------78999999751571357875676787 Q ss_pred HHHH-HCCCCHHHHHHHHHHHH Q ss_conf 5678-81999899999999999 Q gi|254780277|r 816 RLAT-QLRIPRSEAADYIKRYF 836 (976) Q Consensus 816 ~La~-~l~is~~eA~~~i~~yf 836 (976) |..+ .+++|++--++-++|-- T Consensus 360 gf~RlN~acpr~~L~eal~ri~ 381 (388) T COG1168 360 GFVRLNFACPRAILEEALERLK 381 (388) T ss_pred CEEEEECCCCHHHHHHHHHHHH T ss_conf 4588863798899999999999 No 342 >TIGR01420 pilT_fam twitching motility protein; InterPro: IPR006321 These represent the PilT subfamily of proteins related to GspE, a protein involved in type II secretion (also called the General Secretion Pathway). PilT is an apparent cytosolic ATPase associated with type IV pilus systems. It is not required for pilin biogenesis, but is required for twitching motility and social gliding behaviors, shown in some species, powered by pilus retraction . Members of this family may be found in some species that do not have type IV pili but have related structures for DNA uptake and natural transformation. ; GO: 0005524 ATP binding, 0006810 transport. Probab=29.55 E-value=13 Score=15.75 Aligned_cols=10 Identities=30% Similarity=0.551 Sum_probs=7.4 Q ss_pred HHHCCCC-CCC Q ss_conf 7420788-755 Q gi|254780277|r 634 FTKLKFP-SGA 643 (976) Q Consensus 634 f~~LgLp-~~k 643 (976) |++|||| ++- T Consensus 109 fe~LGLP~~v~ 119 (350) T TIGR01420 109 FEELGLPRPVL 119 (350) T ss_pred HHHCCCCHHHH T ss_conf 66637987899 No 343 >PRK05179 rpsM 30S ribosomal protein S13; Validated Probab=29.31 E-value=27 Score=13.50 Aligned_cols=12 Identities=42% Similarity=0.506 Sum_probs=3.9 Q ss_pred CCCCCHHHHHHH Q ss_conf 763602499898 Q gi|254780277|r 197 PGIGYKTAALLL 208 (976) Q Consensus 197 ~GiG~KtA~~ll 208 (976) .|||+++|..++ T Consensus 23 ~GIG~~~A~~Ic 34 (122) T PRK05179 23 YGIGRTRAKEIL 34 (122) T ss_pred CCCCHHHHHHHH T ss_conf 275899999999 No 344 >pfam00809 Pterin_bind Pterin binding enzyme. This family includes a variety of pterin binding enzymes that all adopt a TIM barrel fold. The family includes dihydropteroate synthase EC:2.5.1.15 as well as a group methyltransferase enzymes including methyltetrahydrofolate, corrinoid iron-sulfur protein methyltransferase (MeTr) that catalyses a key step in the Wood-Ljungdahl pathway of carbon dioxide fixation. It transfers the N5-methyl group from methyltetrahydrofolate (CH3-H4folate) to a cob(I)amide centre in another protein, the corrinoid iron-sulfur protein. MeTr is a member of a family of proteins that includes methionine synthase and methanogenic enzymes that activate the methyl group of methyltetra-hydromethano(or -sarcino)pterin. Probab=29.20 E-value=31 Score=13.04 Aligned_cols=15 Identities=7% Similarity=0.153 Sum_probs=5.5 Q ss_pred HHHHHHHHHHHHHHH Q ss_conf 998643436899999 Q gi|254780277|r 595 LKQVSAEISKNLLSL 609 (976) Q Consensus 595 L~~l~~~l~~~l~~l 609 (976) +.++...|..+++.+ T Consensus 140 ~~~i~~~~~~~i~~~ 154 (208) T pfam00809 140 VEELLRFLTERLEAA 154 (208) T ss_pred HHHHHHHHHHHHHHH T ss_conf 999999999999999 No 345 >pfam02022 Integrase_Zn Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catalytic domain pfam00665. The carboxyl terminal domain is a DNA binding domain pfam00552. Probab=29.12 E-value=31 Score=13.03 Aligned_cols=25 Identities=24% Similarity=0.368 Sum_probs=20.1 Q ss_pred HCCCHHHHHHHCCCCHHHHHHHHHH Q ss_conf 0777035678819998999999999 Q gi|254780277|r 810 YGISPFRLATQLRIPRSEAADYIKR 834 (976) Q Consensus 810 YG~~~~~La~~l~is~~eA~~~i~~ 834 (976) |=..+..|+.+.|+|..+|++++.. T Consensus 8 ~H~n~~~L~~~F~ip~~vAr~IV~~ 32 (40) T pfam02022 8 YHSNAKALRHKFGIPRLVAKEIVNS 32 (40) T ss_pred HCCCHHHHHHHHCCCHHHHHHHHHH T ss_conf 8569999998929899999999997 No 346 >TIGR02624 rhamnu_1P_ald rhamnulose-1-phosphate aldolase; InterPro: IPR013447 Proteins in this entry match the enzyme RhaD, rhamnulose-1-phosphate aldolase (4.1.2.19 from EC).; GO: 0008994 rhamnulose-1-phosphate aldolase activity. Probab=29.01 E-value=22 Score=14.10 Aligned_cols=34 Identities=15% Similarity=0.331 Sum_probs=14.7 Q ss_pred CCCCHHHHHHHHHHHHHHHHH--HHHHHHHCCCCCC Q ss_conf 100589999864343689999--9999974034557 Q gi|254780277|r 588 IQIDQDLLKQVSAEISKNLLS--LEENIYDLSGEKF 621 (976) Q Consensus 588 i~vD~~~L~~l~~~l~~~l~~--le~~i~~~aG~~F 621 (976) .+++.+....+..++.+-++. +...+.++||.-| T Consensus 35 l~L~~eEv~~Y~~~~~q~~R~Ipl~fd~~~LAg~yF 70 (273) T TIGR02624 35 LRLKEEEVEPYLDDFKQVLRKIPLKFDAPELAGKYF 70 (273) T ss_pred ECCCHHHHHHHHCCCCCCEEEEECCCCHHHHCCCCE T ss_conf 015767898861462411036731468688568706 No 347 >pfam00735 Septin Septin. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP. As regards the septins, these are polypeptides of 30-65kDa with three characteristic GTPase motifs (G-1, G-3 and G-4) that are similar to those of the Ras family. The G-4 motif is strictly conserved with a unique septin consensus of AKAD. Most septins are thought to have at least one coiled-coil region, which in some cases is necessary for intermolecular interactions that allow septins to polymerize to form rod-shaped complexes. In turn, these are arranged into tandem arrays to form filaments. They are multifunctional proteins, with roles in cytokinesis, sporulation, germ cell development, exocytosis and apoptosis. Probab=28.94 E-value=31 Score=13.01 Aligned_cols=38 Identities=8% Similarity=0.288 Sum_probs=18.6 Q ss_pred CHHHHHHHHCCCCCCCCCCHHHHH--HHHCCCCCCCCHHHHH Q ss_conf 123788986188666421002345--5431687667724667 Q gi|254780277|r 470 KYDKLVLHRYGISMRGFEDIMLMS--YVLDSGRSSHDIASIA 509 (976) Q Consensus 470 KfD~~~L~~~GI~l~~i~DTmlaa--yLLdp~~~~~~L~~la 509 (976) +|+...-....+.=..+.|+.+-+ |++.|. +|+|..+- T Consensus 92 qf~~yl~eE~~i~R~~~~D~RVH~cLYFI~Pt--gh~L~~lD 131 (280) T pfam00735 92 QFEQYLRDESRLNRTSIVDNRVHCCLYFISPT--GHGLKPLD 131 (280) T ss_pred HHHHHHHHHHCCCCCCCCCCCEEEEEEEECCC--CCCCCHHH T ss_conf 99999998647566777888437999975688--88886889 No 348 >PRK09490 metH B12-dependent methionine synthase; Provisional Probab=28.73 E-value=31 Score=12.98 Aligned_cols=27 Identities=11% Similarity=0.107 Sum_probs=12.8 Q ss_pred HHHHHHHHHHHHH--CCCCEEEECCE-EHH Q ss_conf 8648999999997--09979956890-166 Q gi|254780277|r 92 LPQLPLVRLATQA--FGIPAIEIQGF-EAD 118 (976) Q Consensus 92 ~~q~~~i~~~l~~--~gi~~~~~~~~-EAD 118 (976) +.-+--++++++. ..+|++..--+ |.+ T Consensus 194 KAAv~A~~e~~~~~~~~lPv~vs~T~~e~~ 223 (1229) T PRK09490 194 KAAIFAVEEVFEELGVRLPVMISGTITDAS 223 (1229) T ss_pred HHHHHHHHHHHHHCCCCCCEEEEEEEECCC T ss_conf 999999999998519998589999998799 No 349 >KOG3218 consensus Probab=28.69 E-value=29 Score=13.23 Aligned_cols=20 Identities=25% Similarity=0.398 Sum_probs=9.2 Q ss_pred CCCCCCCCHHHHHHHHHCCC Q ss_conf 98776360249989841367 Q gi|254780277|r 194 PGIPGIGYKTAALLLQEYGN 213 (976) Q Consensus 194 pGv~GiG~KtA~~ll~~~gs 213 (976) ||=|-||-||-..+..++.+ T Consensus 75 ~~~~kvgvk~~k~~~~~~~~ 94 (208) T KOG3218 75 PEEPKVGVKTMKTYVIQMQS 94 (208) T ss_pred CCCCCCCHHHHHHHHHHHHH T ss_conf 89984257899999999874 No 350 >COG4332 Uncharacterized protein conserved in bacteria [Function unknown] Probab=28.31 E-value=11 Score=16.37 Aligned_cols=14 Identities=0% Similarity=-0.209 Sum_probs=7.1 Q ss_pred HHHHHHHHHHHHHH Q ss_conf 33332221111124 Q gi|254780277|r 538 VQEHAIENSNVILQ 551 (976) Q Consensus 538 ~~~Yaa~Da~~t~~ 551 (976) ..-|-|-+++.||. T Consensus 47 WlIYkC~~Cd~tWN 60 (203) T COG4332 47 WLIYKCTHCDYTWN 60 (203) T ss_pred EEEEEEECCCCCCC T ss_conf 99998504677256 No 351 >pfam04396 consensus Probab=28.31 E-value=32 Score=12.93 Aligned_cols=62 Identities=18% Similarity=0.178 Sum_probs=38.0 Q ss_pred HHHHHHHCCCCEEEECC---EEHHH-HHHHHHHHHHHCC--CCEEEECCC-CHHHEECC-------CCEEEEECCC Q ss_conf 99999970997995689---01665-8999999998489--949998687-01212237-------9759996678 Q gi|254780277|r 98 VRLATQAFGIPAIEIQG---FEADD-IIATYTYIAEKEG--FAVTIISTD-KDLMQLVS-------PTTCLYDTVK 159 (976) Q Consensus 98 i~~~l~~~gi~~~~~~~---~EADD-viatla~~~~~~~--~~v~i~s~D-kD~~Qlv~-------~~v~~~~~~~ 159 (976) +++.|.+.||.+...+. .-||+ ++..+-..+.+.. ..++++||| +||.-.++ -||.+..|.+ T Consensus 55 ~~~~L~stGI~l~hvp~g~k~aad~~m~~d~~~~a~~np~Pati~LISgd~~dfa~~l~~L~~~r~Y~vlLa~p~~ 130 (149) T pfam04396 55 ILRALSSTGISLAHVPAGAKDARDKKMLVDILLWALDNPPPANLMLISGDDRDFASALHDLLRKRGYNILLAQPEK 130 (149) T ss_pred HHHHHHHCCCEEEECCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEEEECCCHHHHHHHHHHHHCCCCEEEEECCCC T ss_conf 9999997698688578877534788999999999970999707999957707679999999871285599974897 No 352 >cd00477 FTHFS Formyltetrahydrofolate synthetase (FTHFS) catalyzes the ATP-dependent activation of formate ion via its addition to the N10 position of tetrahydrofolate. FTHFS is a highly expressed key enzyme in both the Wood-Ljungdahl pathway of autotrophic CO2 fixation (acetogenesis) and the glycine synthase/reductase pathways of purinolysis. The key physiological role of this enzyme in acetogens is to catalyze the formylation of tetrahydrofolate, an initial step in the reduction of carbon dioxide and other one-carbon precursors to acetate. In purinolytic organisms, the enzymatic reaction is reversed, liberating formate from 10-formyltetrahydrofolate with concurrent production of ATP. Probab=28.25 E-value=32 Score=12.93 Aligned_cols=35 Identities=17% Similarity=0.414 Sum_probs=16.8 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHCC Q ss_conf 3689999999997403455777765666799874207 Q gi|254780277|r 602 ISKNLLSLEENIYDLSGEKFNLSSPKQLGDILFTKLK 638 (976) Q Consensus 602 l~~~l~~le~~i~~~aG~~FNinSpkQL~~vLf~~Lg 638 (976) +.++++.+.++||...|-+|....-+||.+ |+++| T Consensus 428 l~eKI~~IA~~IYgA~~V~~s~~A~~~L~~--~e~~G 462 (524) T cd00477 428 LEDKIETIAKKIYGADGVELSPKAKKKLAR--YEKQG 462 (524) T ss_pred HHHHHHHHHHHCCCCCCEEECHHHHHHHHH--HHHCC T ss_conf 999999999973598858789999999999--99769 No 353 >TIGR01208 rmlA_long glucose-1-phosphate thymidylyltransferase; InterPro: IPR005908 Synonym: dTDP-D-glucose synthase This group of proteins comprises a tightly conserved but broadly distributed subfamily of known and putative bacterial glucose-1-phosphate thymidylyltransferases (2.7.7.24 from EC). It is well characterised in several species as the first of four enzymes involved in the biosynthesis of dTDP-L-rhamnose, a cell wall constituent and a feedback inhibitor of the enzyme.dTTP + alpha-D-glucose 1-phosphate = diphosphate + dTDP-glucose The family of known and putative glucose-1-phosphate thymidyltransferases shows a deep split into a short form (see IPR005907 from INTERPRO) and a long form described by this model. The homotetrameric short form is found in numerous bacterial species that incorporate dTDP-L-rhamnose, which it helps synthesize, into the cell wall. It is subject to feedback inhibition. The long form, in contrast, is found in many species for which it serves as a sugar-activating enzyme for antibiotic biosynthesis and or other, unknown pathways, and in which dTDP-L-rhamnose is not necessarily produced. . Probab=28.19 E-value=22 Score=14.06 Aligned_cols=23 Identities=43% Similarity=0.618 Sum_probs=17.7 Q ss_pred CCEEEECCCCCEEEEEECCCCCE Q ss_conf 43024248994899963265443 Q gi|254780277|r 733 IRRAFISPPTKKLISADYSQIEL 755 (976) Q Consensus 733 iR~~F~a~~g~~lv~aDySQiEl 755 (976) ||+.=.-|....|+-.||||+|| T Consensus 339 i~~~~~~P~~~rL~igd~S~v~L 361 (361) T TIGR01208 339 IKGNRRRPKALRLIIGDYSQVEL 361 (361) T ss_pred EECCCCCCCCCCCEEECCCEEEC T ss_conf 84375784102214705405729 No 354 >cd01080 NAD_bind_m-THF_DH_Cyclohyd NADP binding domain of methylene-tetrahydrofolate dehydrogenase/cyclohydrolase. NADP binding domain of the Methylene-Tetrahydrofolate Dehydrogenase/cyclohydrolase (m-THF DH/cyclohydrolase) bifunctional enzyme. Tetrahydrofolate is a versatile carrier of activated one-carbon units. The major one-carbon folate donors are N-5 methyltetrahydrofolate, N5,N10-m-THF, and N10-formayltetrahydrofolate. The oxidation of metabolic intermediate m-THF to m-THF requires the enzyme m-THF DH. In addition, most DHs also have an associated cyclohydrolase activity which catalyzes its hydrolysis to N10-formyltetrahydrofolate. m-THF DH is typically found as part of a multifunctional protein in eukaryotes. NADP-dependent m-THF DH in mammals, birds and yeast are components of a trifunctional enzyme with DH, cyclohydrolase, and synthetase activities. Certain eukaryotic cells also contain homodimeric bifunctional DH/cyclodrolase form. In bacteria, monofucntional DH, as well a Probab=28.12 E-value=32 Score=12.91 Aligned_cols=14 Identities=43% Similarity=0.918 Sum_probs=7.9 Q ss_pred CCC-CCCHHHHHHHH Q ss_conf 776-36024998984 Q gi|254780277|r 196 IPG-IGYKTAALLLQ 209 (976) Q Consensus 196 v~G-iG~KtA~~ll~ 209 (976) ||| |||-|-+.|+. T Consensus 147 VPGGVGpmTva~L~~ 161 (168) T cd01080 147 VPGGVGPMTVAMLMK 161 (168) T ss_pred CCCCCCHHHHHHHHH T ss_conf 999608999999999 No 355 >PRK07194 fliG flagellar motor switch protein G; Reviewed Probab=28.09 E-value=32 Score=12.91 Aligned_cols=21 Identities=14% Similarity=0.295 Sum_probs=8.2 Q ss_pred HHHHHHHHCCCCH-HHHHHHHH Q ss_conf 7887765304762-79999999 Q gi|254780277|r 654 AQDLEQINCGDNP-IIKNILEW 674 (976) Q Consensus 654 ~~~Le~L~~~~hp-~~~~iLey 674 (976) .+.|-.+...+|| ++..||-| T Consensus 117 p~~La~~L~~EhPQtiAlILs~ 138 (334) T PRK07194 117 PQQLARLIANEHLQMQAVFLAF 138 (334) T ss_pred HHHHHHHHHCCCHHHHHHHHHH T ss_conf 9999999873485799999980 No 356 >PTZ00205 DNA polymerase kappa; Provisional Probab=27.84 E-value=32 Score=12.88 Aligned_cols=41 Identities=7% Similarity=0.013 Sum_probs=16.9 Q ss_pred HHHHHHHCCCCCCC-CCCHHHHHHHHCCCCCCCCHHHHHHHHCCC Q ss_conf 37889861886664-210023455431687667724667762244 Q gi|254780277|r 472 DKLVLHRYGISMRG-FEDIMLMSYVLDSGRSSHDIASIAQKWLSY 515 (976) Q Consensus 472 D~~~L~~~GI~l~~-i~DTmlaayLLdp~~~~~~L~~la~~yL~~ 515 (976) --+.|...||...+ +++-...-+.|-. ...+..+..-|||. T Consensus 321 te~~L~alGI~TCgDiy~~R~~L~~lf~---ekt~~Fll~~~lGl 362 (571) T PTZ00205 321 TEALLKGLGITTLSDIYNRRVELCYILH---NNLFRFLLGASIGI 362 (571) T ss_pred HHHHHHHCCCCHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHCC T ss_conf 8999987597509999986779999843---89999999987552 No 357 >TIGR01083 nth endonuclease III; InterPro: IPR005759 The spectrum of DNA damage caused by reactive oxygen species includes a wide variety of modifications of purine and pyrimidine bases. Among these modified bases, 7,8-dihydro-8-oxoguanine (8-oxoG) is an important mutagenic lesion. Base excision repair is a critical mechanism for preventing mutations by removing the oxidative lesion from the DNA. Escherichia coli Nth protein (endonuclease III) has an 8-oxoG DNA glycosylase/AP lyase activity which removes 8-oxoG preferentially from 8-oxoG/G mispairs. Human hNTH1 protein, a homolog of E. coli Nth protein, has similar DNA glycosylase/AP lyase activity that removes 8-oxoG from 8-oxoG/G mispairs . ; GO: 0003906 DNA-(apurinic or apyrimidinic site) lyase activity, 0006284 base-excision repair, 0005622 intracellular. Probab=27.70 E-value=22 Score=14.04 Aligned_cols=17 Identities=35% Similarity=0.755 Sum_probs=10.4 Q ss_pred HHHHHHHHHHHCCCCC-CCCCC Q ss_conf 5666799874207887-55566 Q gi|254780277|r 626 PKQLGDILFTKLKFPS-GAKTK 646 (976) Q Consensus 626 pkQL~~vLf~~LgLp~-~kKTk 646 (976) |..+.+++ .||. |+||. T Consensus 103 P~~~~eL~----~LPGVGRKTA 120 (192) T TIGR01083 103 PEDREELV----KLPGVGRKTA 120 (192) T ss_pred CCCHHHHH----CCCCCCCHHH T ss_conf 75537661----7899871145 No 358 >TIGR01286 nifK nitrogenase molybdenum-iron protein beta chain; InterPro: IPR005976 The enzyme responsible for nitrogen fixation, the nitrogenase, shows a high degree of conservation of structure, function, and amino acid sequence across wide phylogenetic ranges. All known Mo-nitrogenases consist of two components, component I (also called dinitrogenase, or Fe-Mo protein), an alpha2beta2 tetramer encoded by the nifD and nifK genes, and component II (dinitrogenase reductase, or Fe protein) a homodimer encoded by the nifH gene , . Two operons, nifDK and nifEN, encode a tetrameric (alpha2beta2 and N2E2) enzymatic complex. Nitrogenase contains two unusual rare metal clusters; one of them is the iron molybdenum cofactor (FeMo-co), which is considered to be the site of dinitrogen reduction and whose biosynthesis requires the products of nifNE and of some other nif genes . It has been proposed that NifNE might serve as a scaffold upon which FeMo-co is built and then inserted into component I .; GO: 0016163 nitrogenase activity, 0009399 nitrogen fixation, 0016612 molybdenum-iron nitrogenase complex. Probab=27.62 E-value=33 Score=12.85 Aligned_cols=24 Identities=13% Similarity=0.189 Sum_probs=14.1 Q ss_pred CEEECCHHHHHHHHHHHHHCCCCC Q ss_conf 035479999999999998668523 Q gi|254780277|r 368 YTKIVNTKDIQQWVQKLETIGSAS 391 (976) Q Consensus 368 y~~I~~~~~L~~li~~L~~~~~ia 391 (976) +.-=.-.++|.++++..++.+.+- T Consensus 152 CMAEVIGdDL~AFi~nAk~~g~ip 175 (526) T TIGR01286 152 CMAEVIGDDLRAFIGNAKEEGVIP 175 (526) T ss_pred CCCHHHHHHHHHHHHHHHCCCCCC T ss_conf 440110135899998753068888 No 359 >PRK01395 V-type ATP synthase subunit F; Provisional Probab=27.59 E-value=18 Score=14.70 Aligned_cols=53 Identities=19% Similarity=0.333 Sum_probs=31.1 Q ss_pred EEEHHHHHHHHHC-CCCHHHHHHHHHHCCCHHHCCHHHHHHHHHHHHHHHHCCCHHHHHHHCCCCHHHHHHHHH Q ss_conf 5103777873406-988899999898099968899888863025135644077703567881999899999999 Q gi|254780277|r 761 IAKITPLYQVFEN-SLDIHRMVAAEIFGVNIEKVSPQMRRHAKTINFSIIYGISPFRLATQLRIPRSEAADYIK 833 (976) Q Consensus 761 ls~D~~l~~af~~-g~DiH~~tA~~vf~~~~~~vt~~~R~~aK~infgi~YG~~~~~La~~l~is~~eA~~~i~ 833 (976) .-||+..+-.|+. |.|.+-. .+.++..+..|+.|+. +||++| +|..-|+.+-+ T Consensus 8 VvGd~DsvlgF~~lG~dvfpv-------~~~eEa~~~l~~La~~-~y~II~------------ITE~~a~~i~~ 61 (104) T PRK01395 8 VIGDKDSILPFKALGFDVFPV-------IEEQEARNTLRKLAME-DYAIIY------------ITEDIAKDIPE 61 (104) T ss_pred EECCHHHHHHHHHCCEEEEEC-------CCHHHHHHHHHHHHHC-CEEEEE------------EEHHHHHHHHH T ss_conf 986857878888658047854-------8879999999999867-927999------------72999845699 No 360 >PRK03609 umuC DNA polymerase V subunit UmuC; Reviewed Probab=27.45 E-value=21 Score=14.29 Aligned_cols=17 Identities=18% Similarity=0.270 Sum_probs=7.8 Q ss_pred HHHHHHHHHHHHHHCCC Q ss_conf 99999999999998498 Q gi|254780277|r 842 IHEYIEKTKNFVRQNGY 858 (976) Q Consensus 842 v~~~~~~~~~~a~~~gy 858 (976) +..+-+++.+..|++|+ T Consensus 267 l~~~~~~va~rLR~~~~ 283 (422) T PRK03609 267 ICSYAARAAEKLRGEHQ 283 (422) T ss_pred HHHHHHHHHHHHHHHCC T ss_conf 99999999999998087 No 361 >COG2759 MIS1 Formyltetrahydrofolate synthetase [Nucleotide transport and metabolism] Probab=27.34 E-value=31 Score=12.96 Aligned_cols=36 Identities=28% Similarity=0.517 Sum_probs=15.1 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHCCC Q ss_conf 36899999999974034557777656667998742078 Q gi|254780277|r 602 ISKNLLSLEENIYDLSGEKFNLSSPKQLGDILFTKLKF 639 (976) Q Consensus 602 l~~~l~~le~~i~~~aG~~FNinSpkQL~~vLf~~LgL 639 (976) ++.+++.+.+++|.-.|-+|...--+||.. |++.|+ T Consensus 442 i~~Ki~~I~~~iYga~~v~~s~~A~~ql~~--~~~~g~ 477 (554) T COG2759 442 IEEKIEKIAKEIYGADGVEFSPKAKEQLKT--FEKQGF 477 (554) T ss_pred HHHHHHHHHHHHCCCCCEEECHHHHHHHHH--HHHHCC T ss_conf 899999999985187634568999999999--998188 No 362 >pfam05503 Pox_G7 Poxvirus G7-like. Probab=27.27 E-value=33 Score=12.81 Aligned_cols=34 Identities=12% Similarity=0.020 Sum_probs=18.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHH Q ss_conf 322211111245677632221000134578860750123 Q gi|254780277|r 541 HAIENSNVILQLWLLLRPKLIVEKLLHVYERLDKPMIDV 579 (976) Q Consensus 541 Yaa~Da~~t~~L~~~l~~~L~~~~l~~l~~~iE~Pl~~v 579 (976) |+|.-.+ -|+||..|..--+. .. |+..-||.+.+ T Consensus 84 yV~n~~e-F~rLynsL~RfTH~---~S-FF~tC~pTI~a 117 (363) T pfam05503 84 YVCNSTE-FWRLYNSLYRFTHC---KS-FFNTCMPTIVA 117 (363) T ss_pred EECCHHH-HHHHHHHHHHHHHH---HH-HHHHHHHHHHH T ss_conf 2258389-99999999887502---87-99880899999 No 363 >KOG0334 consensus Probab=27.23 E-value=33 Score=12.80 Aligned_cols=10 Identities=20% Similarity=0.547 Sum_probs=5.5 Q ss_pred HHHCHHHHHH Q ss_conf 9988889999 Q gi|254780277|r 836 FHRFPGIHEY 845 (976) Q Consensus 836 f~~~p~v~~~ 845 (976) +..||.-.+| T Consensus 904 inD~Pq~~r~ 913 (997) T KOG0334 904 INDFPQNARW 913 (997) T ss_pred CCCCCHHCCE T ss_conf 1356143362 No 364 >PRK08599 coproporphyrinogen III oxidase; Provisional Probab=27.03 E-value=33 Score=12.78 Aligned_cols=19 Identities=16% Similarity=-0.015 Sum_probs=10.6 Q ss_pred HHHHHHHCCCEECCCCCEE Q ss_conf 9999998498882778760 Q gi|254780277|r 849 TKNFVRQNGYVETIFGRRI 867 (976) Q Consensus 849 ~~~~a~~~gyv~T~~GRrr 867 (976) ..+...+.|+++---+|.+ T Consensus 339 ~l~~l~~~Gll~~~~~~i~ 357 (377) T PRK08599 339 TIQELQEQGLLEEDDDFVR 357 (377) T ss_pred HHHHHHHCCCEEECCCEEE T ss_conf 9999997798998099999 No 365 >COG3083 Predicted hydrolase of alkaline phosphatase superfamily [General function prediction only] Probab=27.02 E-value=33 Score=12.78 Aligned_cols=42 Identities=24% Similarity=0.369 Sum_probs=19.0 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCC-HHHHCCHHHHCCCC Q ss_conf 69999999999999848554798799997274887-13240865216679 Q gi|254780277|r 37 AIAGFCNMLWKLLQNSRKESIASHFAVIFDYPAVT-FRNEIYPDYKANRP 85 (976) Q Consensus 37 ai~gf~~~l~~~i~~~~~~~~~~~~~v~fD~~~~t-fR~~~~~~YKa~R~ 85 (976) --|-|+|.+.-++-..+ -+-.+|-+... =|-=+|+.|=+++. T Consensus 20 hwfa~fNi~~aivlG~r-------Yl~~~dwp~tl~g~lY~~~s~lGhFs 62 (600) T COG3083 20 HWFAFFNILLAIVLGSR-------YLFIMDWPSTLAGRLYLYVSILGHFS 62 (600) T ss_pred HHHHHHHHHHHHHHCCC-------CEEECCCCHHHHHHHHHHHHHHHHHH T ss_conf 99999999999984773-------00441571776789999999999999 No 366 >pfam03221 Transposase_Tc5 Tc5 transposase DNA-binding domain. Probab=26.77 E-value=34 Score=12.75 Aligned_cols=36 Identities=19% Similarity=0.454 Sum_probs=24.6 Q ss_pred HHCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHHHHH Q ss_conf 860750123322101431005899998643436899 Q gi|254780277|r 571 RLDKPMIDVVSQMEITGIQIDQDLLKQVSAEISKNL 606 (976) Q Consensus 571 ~iE~Pl~~vLa~ME~~Gi~vD~~~L~~l~~~l~~~l 606 (976) ++|-.|..=+.++...|++|+.+.+++...++..++ T Consensus 3 ~le~~L~~W~~~~~~~~~~vs~~~i~~kA~~~~~~l 38 (62) T pfam03221 3 DLEKALYEWIRQARAQGIPISGPIIREKAKRFAREL 38 (62) T ss_pred HHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHH T ss_conf 889999999999998799998899999999999983 No 367 >COG1737 RpiR Transcriptional regulators [Transcription] Probab=26.75 E-value=34 Score=12.74 Aligned_cols=58 Identities=19% Similarity=0.323 Sum_probs=32.1 Q ss_pred HHHHHHHHHHHHH--HHHHHHHCCCHHHHHHHHHHHHCCCCCCCCCCHHHHC-CCCCCHHHHHHHHHHHHHHHHH Q ss_conf 8999864542001--1223220035789997655432035642247635521-2676768899865323368999 Q gi|254780277|r 214 LENILINASRIKQ--KKRRENILEYAETARLSRKLVMLRTDVPITIPLEHLI-LEDCNGPRLISFLKALEFTKLI 285 (976) Q Consensus 214 le~i~~~~d~i~~--~k~~~~l~~~~e~a~ls~~L~tL~~Dvpl~~~~~~l~-~~~~d~~~l~~~~~~lef~~ll 285 (976) ++-|.++.+++.. .++.+-|.+|.+.+.. .++.++. ........+..|+++|||..+- T Consensus 6 ~~~I~~~~~~Lt~~er~iA~yil~~~~~~~~--------------~si~elA~~a~VS~aTv~Rf~~kLGf~Gf~ 66 (281) T COG1737 6 LERIRERYDSLTKSERKIADYILANPDEVAL--------------LSIAELAERAGVSPATVVRFARKLGFEGFS 66 (281) T ss_pred HHHHHHHHHHCCHHHHHHHHHHHHCHHHHHH--------------CCHHHHHHHHCCCHHHHHHHHHHCCCCCHH T ss_conf 9999998852599999999999939678856--------------149999988199888999999985999889 No 368 >cd01966 Nitrogenase_NifN_1 Nitrogenase_nifN1: A subgroup of the NifN subunit of the NifEN complex: NifN forms an alpha2beta2 tetramer with NifE. NifN and nifE are structurally homologous to nitrogenase MoFe protein beta and alpha subunits respectively. NifEN participates in the synthesis of the iron-molybdenum cofactor (FeMoco) of the MoFe protein. NifB-co (an iron and sulfur containing precursor of the FeMoco) from NifB is transferred to the NifEN complex where it is further processed to FeMoco. The nifEN bound precursor of FeMoco has been identified as a molybdenum-free, iron- and sulfur- containing analog of FeMoco. It has been suggested that this nifEN bound precursor also acts as a cofactor precursor in nitrogenase systems which require a cofactor other than FeMoco: i.e. iron-vanadium cofactor (FeVco) or iron only cofactor (FeFeco). Probab=26.71 E-value=34 Score=12.74 Aligned_cols=45 Identities=16% Similarity=0.127 Sum_probs=17.0 Q ss_pred HHHHHHHCCCCCCCC-CCCCCCCCCHHHHHHHHC-CCCHHHHHHHHHH Q ss_conf 799874207887555-665453531788776530-4762799999998 Q gi|254780277|r 630 GDILFTKLKFPSGAK-TKTGQWKTTAQDLEQINC-GDNPIIKNILEWR 675 (976) Q Consensus 630 ~~vLf~~LgLp~~kK-TktG~~ST~~~~Le~L~~-~~hp~~~~iLeyR 675 (976) ++.|=++.|.|...- ..-|--.|| +.|+.+.. ...|++..+.+.| T Consensus 238 a~~L~~~~giP~~~~~~p~G~~~Td-~fl~~l~~~~G~~vpe~~~~er 284 (417) T cd01966 238 AEALEERTGVPYYVFPSLTGLEAVD-ALIATLAKLSGRPVPEKIRRQR 284 (417) T ss_pred HHHHHHHHCCCEEECCCCCCHHHHH-HHHHHHHHHHCCCCCHHHHHHH T ss_conf 9999999799838307861558789-9999999982899849999999 No 369 >PRK06298 type III secretion system protein; Validated Probab=26.69 E-value=34 Score=12.74 Aligned_cols=25 Identities=24% Similarity=0.321 Sum_probs=14.6 Q ss_pred EEECCCCCEEEEEECCCCC---EEEEEE Q ss_conf 2424899489996326544---368766 Q gi|254780277|r 736 AFISPPTKKLISADYSQIE---LRILAH 760 (976) Q Consensus 736 ~F~a~~g~~lv~aDySQiE---lRilA~ 760 (976) ..+..|-|.=|...|..=+ .+|+|- T Consensus 259 vVitNPTH~AVAL~Y~~~~~~aP~vvAK 286 (360) T PRK06298 259 AVVSNPKDIAVAIGYMPEKYKAPWIIAM 286 (360) T ss_pred EEEECCCEEEEEEEECCCCCCCCEEEEE T ss_conf 8998886178886666899999988885 No 370 >TIGR01591 Fdh-alpha formate dehydrogenase, alpha subunit; InterPro: IPR006478 This group of sequences describe a subset of formate dehydrogenase alpha chains found mainly in the archaea but also in alpha and gamma proteobacteria and a small number of Gram-positive bacteria. The alpha chain contains domains for molybdopterin dinucleotide binding and molybdopterin oxidoreductase. The holo-enzyme also contains beta and gamma subunits. The enzyme catalyzes the oxidation of formate (produced from pyruvate during anaerobic growth) to carbon dioxide with the concomitant release of two electrons and two protons. The enzyme's purpose is to allow growth on formate in some circumstances and, in the case of FdhH in gamma proteobacteria, to pass electrons to hydrogenase (by which process acid is neutralized) . The alpha subunit of a version of nitrate reductase is closely related.; GO: 0008863 formate dehydrogenase activity, 0015942 formate metabolic process, 0009326 formate dehydrogenase complex. Probab=26.53 E-value=15 Score=15.26 Aligned_cols=54 Identities=20% Similarity=0.179 Sum_probs=28.4 Q ss_pred HHHHHHCCCCCCCCHHHHHHHH-------HHHHHHHHHHHHHHHHCCCCC---CCCCCHHHHHHH Q ss_conf 2332210143100589999864-------343689999999997403455---777765666799 Q gi|254780277|r 578 DVVSQMEITGIQIDQDLLKQVS-------AEISKNLLSLEENIYDLSGEK---FNLSSPKQLGDI 632 (976) Q Consensus 578 ~vLa~ME~~Gi~vD~~~L~~l~-------~~l~~~l~~le~~i~~~aG~~---FNinSpkQL~~v 632 (976) |.=+-.|.-|..++.|+==++. -+...+++-+..-+.++ |-+ ||-++|+|+=+. T Consensus 448 Paa~w~EkeGtftN~eRriq~~~Ka~eP~gesk~DWeI~~~lA~~l-G~~gemf~Y~~~~~i~~E 511 (694) T TIGR01591 448 PAAAWAEKEGTFTNAERRIQRFFKAVEPKGESKPDWEIIQELANAL-GLDGEMFNYNHPKEIFEE 511 (694) T ss_pred CCCCCCCCCCEEEECCCEEEEEEEEECCCCCCCHHHHHHHHHHHHC-CCCCCCCCCCCHHHHHHH T ss_conf 4422113476277057617778875356746542389999999972-878875677888899999 No 371 >KOG0923 consensus Probab=26.42 E-value=34 Score=12.70 Aligned_cols=47 Identities=21% Similarity=0.218 Sum_probs=31.5 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCC--CCCCHHHHHHHHHH Q ss_conf 31005899998643436899999999974034557--77765666799874 Q gi|254780277|r 587 GIQIDQDLLKQVSAEISKNLLSLEENIYDLSGEKF--NLSSPKQLGDILFT 635 (976) Q Consensus 587 Gi~vD~~~L~~l~~~l~~~l~~le~~i~~~aG~~F--NinSpkQL~~vLf~ 635 (976) ++..-.+........+...+..|..+|.++.=.++ |+-|.-| .-.|+ T Consensus 478 VFltGQeEIEt~~e~l~~~~~~LGski~eliv~PiYaNLPselQ--akIFe 526 (902) T KOG0923 478 VFLTGQEEIETVKENLKERCRRLGSKIRELIVLPIYANLPSELQ--AKIFE 526 (902) T ss_pred EEECCHHHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCHHHH--HHHCC T ss_conf 99446789999999999999985336550798400046986888--76358 No 372 >KOG4302 consensus Probab=26.41 E-value=34 Score=12.70 Aligned_cols=89 Identities=17% Similarity=0.154 Sum_probs=50.4 Q ss_pred HHHHHHHCCCHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHH Q ss_conf 99988848988787777875187435899877636024998984136789998645420011223220035789997655 Q gi|254780277|r 166 EHVIKKWGIPPEKMVCLQALTGDSIDNIPGIPGIGYKTAALLLQEYGNLENILINASRIKQKKRRENILEYAETARLSRK 245 (976) Q Consensus 166 ~~v~~~~gv~p~~~~d~~al~GD~sDnipGv~GiG~KtA~~ll~~~gsle~i~~~~d~i~~~k~~~~l~~~~e~a~ls~~ 245 (976) +......++--+.+.++...+||.++..+ +-+|.-..|..++..|.-.++.+-..+..+.++--.=......+|-. T Consensus 64 ~~L~~~ia~~eael~~l~s~l~~~~~~~~----~~~k~e~tLke~l~~l~~~le~lr~qk~eR~~ef~el~~qie~l~~~ 139 (660) T KOG4302 64 ARLLQEIAVIEAELNDLCSALGEPSIIGE----ISDKIEGTLKEQLESLKPYLEGLRKQKDERRAEFKELYHQIEKLCEE 139 (660) T ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCCCCC----CCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999999999999998477522244----32446765999999999989999999999999999999999999999 Q ss_pred HHHCCCCCCCCCCH Q ss_conf 43203564224763 Q gi|254780277|r 246 LVMLRTDVPITIPL 259 (976) Q Consensus 246 L~tL~~Dvpl~~~~ 259 (976) |.-- .++|..+.. T Consensus 140 l~g~-~~~~~~~~~ 152 (660) T KOG4302 140 LGGP-EDLPSFLIA 152 (660) T ss_pred HCCC-CCCCCCCCC T ss_conf 6687-557763336 No 373 >pfam01268 FTHFS Formate--tetrahydrofolate ligase. Probab=26.30 E-value=34 Score=12.69 Aligned_cols=28 Identities=25% Similarity=0.489 Sum_probs=10.7 Q ss_pred HHHHHHHHHHHHHCCCCCCCCCCHHHHH Q ss_conf 6899999999974034557777656667 Q gi|254780277|r 603 SKNLLSLEENIYDLSGEKFNLSSPKQLG 630 (976) Q Consensus 603 ~~~l~~le~~i~~~aG~~FNinSpkQL~ 630 (976) .++++.+.++||...|-+|....-+||. T Consensus 444 ~eKIe~IA~~IYGA~~V~~s~~A~~~L~ 471 (555) T pfam01268 444 EEKIETIATEIYGADGVEFSPKAKKQLA 471 (555) T ss_pred HHHHHHHHHHCCCCCCEEECHHHHHHHH T ss_conf 9999999997158885778999999999 No 374 >KOG4379 consensus Probab=26.11 E-value=35 Score=12.66 Aligned_cols=18 Identities=28% Similarity=0.218 Sum_probs=10.7 Q ss_pred HHHHHCCCHHHHHHHHHH Q ss_conf 898413678999864542 Q gi|254780277|r 206 LLLQEYGNLENILINASR 223 (976) Q Consensus 206 ~ll~~~gsle~i~~~~d~ 223 (976) -|++|=.+++.+..+.++ T Consensus 182 ~l~~E~~~i~~~~~~ve~ 199 (596) T KOG4379 182 DLLQERKQISLVAGHVER 199 (596) T ss_pred HHHCCCCHHHHHHHHHHH T ss_conf 654134125567766666 No 375 >TIGR01487 SPP-like SPP-like hydrolase, Archaeal; InterPro: IPR006382 This group of Archaeal sequences is most closely related to the sucrose-phosphate phosphatases (SPP) from plants and cyanobacteria. If it should be shown that the proteins in this family possess sucrose-phosphatase activity then SPP would represent a broad family of functionally related proteins.. Probab=25.99 E-value=31 Score=13.01 Aligned_cols=17 Identities=18% Similarity=0.372 Sum_probs=7.6 Q ss_pred CCHHHHHHHHCCCCCCCC Q ss_conf 772466776224442233 Q gi|254780277|r 503 HDIASIAQKWLSYTRKEI 520 (976) Q Consensus 503 ~~L~~la~~yL~~~~~e~ 520 (976) .+++-++++ ||...+++ T Consensus 158 ~gv~~~~El-lGlk~~e~ 174 (223) T TIGR01487 158 VGVKKLKEL-LGLKKEEI 174 (223) T ss_pred HHHHHHHHH-HCCCCCCE T ss_conf 899999986-08881018 No 376 >PRK00106 hypothetical protein; Provisional Probab=25.59 E-value=35 Score=12.66 Aligned_cols=14 Identities=29% Similarity=0.102 Sum_probs=8.2 Q ss_pred HHHHCCCCHHHHHH Q ss_conf 67881999899999 Q gi|254780277|r 817 LATQLRIPRSEAAD 830 (976) Q Consensus 817 La~~l~is~~eA~~ 830 (976) +|.+||....-|++ T Consensus 365 mA~ElG~d~~~AkR 378 (535) T PRK00106 365 LAGELGENVALARR 378 (535) T ss_pred HHHHHCCCHHHHHH T ss_conf 99983999999999 No 377 >PRK13505 formate--tetrahydrofolate ligase; Provisional Probab=25.51 E-value=35 Score=12.59 Aligned_cols=29 Identities=24% Similarity=0.463 Sum_probs=11.9 Q ss_pred HHHHHHHHHHHHHCCCCCCCCCCHHHHHH Q ss_conf 68999999999740345577776566679 Q gi|254780277|r 603 SKNLLSLEENIYDLSGEKFNLSSPKQLGD 631 (976) Q Consensus 603 ~~~l~~le~~i~~~aG~~FNinSpkQL~~ 631 (976) .++++.+.++||...|-+|...--+||.. T Consensus 445 ~eKIe~IA~~IYgA~~V~~s~~A~~qL~~ 473 (556) T PRK13505 445 EEKIEKIATKIYGAKGVEFSPKAKKQLKE 473 (556) T ss_pred HHHHHHHHHHCCCCCCEEECHHHHHHHHH T ss_conf 99999999873688867789999999999 No 378 >cd00424 Pol_Y Y-family of DNA polymerases. Pol_Y's can transverse replication-blocking DNA lesions, such as cyclobutane pyrimidine dimers resulting from UV damage, at the cost of an elevated error rate. The Y-family has no 3'-5' exonuclease activity. In addition to possessing a topology akin to a right hand, with "thumb", "fingers" and "palm" motifs, like polymerases from the A-, B-, C- and X-families, the Y-family has a unique "little finger" motif. Expression of Y-family polymerases is often induced by DNA damage. These polymerases are phylogenetically unrelated to classical DNA polymerases. Probab=25.39 E-value=30 Score=13.10 Aligned_cols=27 Identities=19% Similarity=0.347 Sum_probs=14.7 Q ss_pred HHHHHHHHHCCCCC--CCCCCHHHHHHHH Q ss_conf 99999997403455--7777656667998 Q gi|254780277|r 607 LSLEENIYDLSGEK--FNLSSPKQLGDIL 633 (976) Q Consensus 607 ~~le~~i~~~aG~~--FNinSpkQL~~vL 633 (976) .++.+.|++..|-+ +-+++.|.|+++. T Consensus 120 ~~ir~~I~~~~gl~~SiGIa~nk~lAKlA 148 (341) T cd00424 120 LEIKQIIAEETGITASIGIAANKLLAKLA 148 (341) T ss_pred HHHHHHHHHHHCCCEEEEECCCHHHHHHH T ss_conf 99999999984996699646519999999 No 379 >KOG0970 consensus Probab=25.24 E-value=36 Score=12.55 Aligned_cols=97 Identities=28% Similarity=0.331 Sum_probs=47.8 Q ss_pred CCCCCHHHHCC-CCHHHHHHHH----CCCC---------------------------CCC--CCCHHHHHHHHCCCCCCC Q ss_conf 22210011003-2123788986----1886---------------------------664--210023455431687667 Q gi|254780277|r 458 ENEHFLKIGHN-IKYDKLVLHR----YGIS---------------------------MRG--FEDIMLMSYVLDSGRSSH 503 (976) Q Consensus 458 e~~~i~kIgHN-~KfD~~~L~~----~GI~---------------------------l~~--i~DTmlaayLLdp~~~~~ 503 (976) +++++ .|||| ..|++-+|.. ..|. +.| +-|+-+++.=|-. ..++ T Consensus 621 ~dpD~-iVgHn~~~~~l~VLl~R~~~~Kip~WS~IgRLrrS~~~kfg~~s~~~e~~~~aGRl~CD~~~~a~~lik-~~S~ 698 (1429) T KOG0970 621 EDPDV-IVGHNIQGFYLDVLLSRLHALKIPNWSSIGRLRRSWPPKFGRSSSFGEFFIIAGRLMCDLNLAARELIK-AQSY 698 (1429) T ss_pred CCCCE-EEEECCCCCHHHHHHHHHHHHCCCCHHHHHHHHHCCCCCCCCCCCCCCCCCCCCEEEEEHHHHHHHHHC-CCCC T ss_conf 29987-997141102099999999873486443432221046400177665442110211578505777776451-3566 Q ss_pred CHHHHHHHHCCCCCCCCCCCCCCCCCCHHHCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 72466776224442233565432221000000123333222111112456776 Q gi|254780277|r 504 DIASIAQKWLSYTRKEILKSRKSSIPIDKISDSQVQEHAIENSNVILQLWLLL 556 (976) Q Consensus 504 ~L~~la~~yL~~~~~e~~~~~k~~~~~~~i~~e~~~~Yaa~Da~~t~~L~~~l 556 (976) +|..|++..|+....++......+..-+..+.-.+......|+.+.+++...+ T Consensus 699 ~LseL~q~~l~~eR~~i~~~~i~~~y~~s~~L~~ll~~~~~d~~~~l~i~~~l 751 (1429) T KOG0970 699 SLSELSQQILKEERKEINANEIPKMYEDSKSLTYLLEHTITDAELILQIMFRL 751 (1429) T ss_pred CHHHHHHHHHHHHCCCCCHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 59999999986541557875754523670779999998767799999999995 No 380 >pfam09249 tRNA_NucTransf2 tRNA nucleotidyltransferase, second domain. Members of this family adopt a structure consisting of a five helical bundle core. They are predominantly found in Archaeal tRNA nucleotidyltransferase, following the catalytic nucleotidyltransferase domain. Probab=25.21 E-value=36 Score=12.55 Aligned_cols=24 Identities=38% Similarity=0.589 Sum_probs=17.7 Q ss_pred HHHHHHCCCHHHHHHHHHHHHHHH Q ss_conf 989841367899986454200112 Q gi|254780277|r 205 ALLLQEYGNLENILINASRIKQKK 228 (976) Q Consensus 205 ~~ll~~~gsle~i~~~~d~i~~~k 228 (976) --|+-+|||.+++++++.++++++ T Consensus 31 ElLii~YGsF~~~lk~a~~wk~~~ 54 (114) T pfam09249 31 ELLVIHYGGFENVLKAASKWRPGV 54 (114) T ss_pred HHHHHHHCCHHHHHHHHHHCCCCE T ss_conf 999999745999999998428985 No 381 >KOG1282 consensus Probab=25.20 E-value=36 Score=12.55 Aligned_cols=119 Identities=19% Similarity=0.259 Sum_probs=55.5 Q ss_pred CCCCCCCCEEECCCCCCCCCCCCCCEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCHHHHCCCCHHHHHHHHCCC Q ss_conf 10012442010257766544345502313744333456500178899997444420222100110032123788986188 Q gi|254780277|r 402 FHSKPIAIAISTLDQKDDTSFKVETIFIDLSVHTSQDSTGKNTLTKEILSYLKKFFENEHFLKIGHNIKYDKLVLHRYGI 481 (976) Q Consensus 402 ~~~~ivGisls~~~~~~~~~~~~~~~yip~~~~~~~d~~~~~~~~~evl~~L~~lle~~~i~kIgHN~KfD~~~L~~~GI 481 (976) +.+.++|+.+|......+ |. .+.....++....|+.||+. .+..-+| =++-.|= T Consensus 122 fLd~PvGvGFSYs~~~~~--------~~----------~~D~~~A~d~~~FL~~wf~k--fPey~~~------~fyI~GE 175 (454) T KOG1282 122 FLDQPVGVGFSYSNTSSD--------YK----------TGDDGTAKDNYEFLQKWFEK--FPEYKSN------DFYIAGE 175 (454) T ss_pred EEECCCCCCCCCCCCCCC--------CC----------CCCHHHHHHHHHHHHHHHHH--CHHHCCC------CEEEECC T ss_conf 972687677633478776--------76----------78488899999999999996--9543079------8689614 Q ss_pred CCCCCCCHHHHHHHHCCCCC----CCCHHHHHHHHCCCCCCCCCCCCCCCCCCHHHCCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 66642100234554316876----67724667762244422335654322210000001233332221111124567763 Q gi|254780277|r 482 SMRGFEDIMLMSYVLDSGRS----SHDIASIAQKWLSYTRKEILKSRKSSIPIDKISDSQVQEHAIENSNVILQLWLLLR 557 (976) Q Consensus 482 ~l~~i~DTmlaayLLdp~~~----~~~L~~la~~yL~~~~~e~~~~~k~~~~~~~i~~e~~~~Yaa~Da~~t~~L~~~l~ 557 (976) ...|.+=++||..+++.+.. ..+|+.++ +|++- .-..........|+..-+.+.=.+|+.++ T Consensus 176 SYAG~YVP~La~~I~~~N~~~~~~~iNLkG~~-----------IGNg~---td~~~~~~~~~~~a~~h~liSde~~~~l~ 241 (454) T KOG1282 176 SYAGHYVPALAQEILKGNKKCCKPNINLKGYA-----------IGNGL---TDPEIDYNGRIPFAWGHGLISDELYESLK 241 (454) T ss_pred CCCCEEHHHHHHHHHHCCCCCCCCCCCCEEEE-----------ECCCC---CCCCCCCCCHHHHHHHCCCCCHHHHHHHH T ss_conf 65664409999999855644668841405789-----------53755---67000324123234542647899999999 Q ss_pred HHH Q ss_conf 222 Q gi|254780277|r 558 PKL 560 (976) Q Consensus 558 ~~L 560 (976) +.. T Consensus 242 ~~C 244 (454) T KOG1282 242 RAC 244 (454) T ss_pred HHH T ss_conf 985 No 382 >KOG0276 consensus Probab=25.18 E-value=34 Score=12.67 Aligned_cols=11 Identities=9% Similarity=0.039 Sum_probs=5.5 Q ss_pred CCCEEEEECCC Q ss_conf 79759996678 Q gi|254780277|r 149 SPTTCLYDTVK 159 (976) Q Consensus 149 ~~~v~~~~~~~ 159 (976) +..|+||.... T Consensus 162 DrTVKVWslgs 172 (794) T KOG0276 162 DRTVKVWSLGS 172 (794) T ss_pred CCCEEEEECCC T ss_conf 55078887589 No 383 >TIGR02877 spore_yhbH sporulation protein YhbH; InterPro: IPR014230 Proteins in this entry, typified by YhbH from Bacillus subtilis, are found in the genomes of nearly every endospore-forming bacterium, and in no other genomes. The gene in Bacillus subtilis was shown to be a member of the sigma-E regulon, with mutation leading to a sporulation defect .. Probab=25.10 E-value=36 Score=12.54 Aligned_cols=16 Identities=19% Similarity=0.443 Sum_probs=8.2 Q ss_pred CCCEEECCHHHHHHHH Q ss_conf 6503547999999999 Q gi|254780277|r 366 DSYTKIVNTKDIQQWV 381 (976) Q Consensus 366 ~~y~~I~~~~~L~~li 381 (976) +-|..=.+.++|.+++ T Consensus 110 DyyEaEVtleel~e~l 125 (392) T TIGR02877 110 DYYEAEVTLEELEELL 125 (392) T ss_pred CEEECCCCHHHHHHHH T ss_conf 3464156689999999 No 384 >KOG3666 consensus Probab=24.96 E-value=35 Score=12.64 Aligned_cols=61 Identities=16% Similarity=0.177 Sum_probs=34.7 Q ss_pred HHHHHHHHHHHH--HHHHHHHHHHHCCCCHHHHHHHCCCCCCCCHHHHHHHHHHH-HHHHHHHHHHH Q ss_conf 245677632221--00013457886075012332210143100589999864343-68999999999 Q gi|254780277|r 550 LQLWLLLRPKLI--VEKLLHVYERLDKPMIDVVSQMEITGIQIDQDLLKQVSAEI-SKNLLSLEENI 613 (976) Q Consensus 550 ~~L~~~l~~~L~--~~~l~~l~~~iE~Pl~~vLa~ME~~Gi~vD~~~L~~l~~~l-~~~l~~le~~i 613 (976) ..|...|++-|+ .+.+..+..+|=||...+.-.. --+|..++|+++++-- ..++.++..-| T Consensus 602 ~eL~~~Lr~VLQiVPe~mF~Il~kiI~l~t~~iKeF---pt~iEK~~LrdyaqF~~r~~va~lT~~I 665 (1141) T KOG3666 602 GELVSYLRKVLQIVPESMFTILLKIIKLQTHDIKEF---PTRLEKDKLRDYAQFGPRYEVAKLTHAI 665 (1141) T ss_pred HHHHHHHHHHHHHCHHHHHHHHHHHHHHCCCHHHHC---CCHHHHHHHHHHHHHCHHHHHHHHHHHH T ss_conf 999999999999650999999999996430211116---6511088888999732078999999999 No 385 >COG0505 CarA Carbamoylphosphate synthase small subunit [Amino acid transport and metabolism / Nucleotide transport and metabolism] Probab=24.69 E-value=29 Score=13.26 Aligned_cols=30 Identities=27% Similarity=0.586 Sum_probs=15.3 Q ss_pred CCCCCCCCCCHHHHHHHHHCCCHHHHHHHH Q ss_conf 899877636024998984136789998645 Q gi|254780277|r 192 NIPGIPGIGYKTAALLLQEYGNLENILINA 221 (976) Q Consensus 192 nipGv~GiG~KtA~~ll~~~gsle~i~~~~ 221 (976) |||||-||--.--++.|.++|++.+.+..- T Consensus 105 gipgI~GIDTRaLtr~iR~~G~m~~~I~~~ 134 (368) T COG0505 105 GIPGIAGIDTRALTRKIREKGAMKGVIATG 134 (368) T ss_pred CCCCEECCCHHHHHHHHHHCCCCCEEEECC T ss_conf 997750551899999998669731376137 No 386 >TIGR02397 dnaX_nterm DNA polymerase III, subunits gamma and tau; InterPro: IPR012763 This entry represents the well-conserved first N-terminal domain of DnaX, approx. 365 aa. The full-length product of the dnaX gene in Escherichia coli encodes the DNA polymerase III tau subunit. A translational frameshift leads to early termination and a truncated protein subunit gamma, about 1/3 shorter than tau and present in roughly equal amounts. This frameshift mechanism is not necessarily universal for species with DNA polymerase III but appears conserved in the extreme thermophile Thermus thermophilis.; GO: 0003887 DNA-directed DNA polymerase activity, 0005524 ATP binding, 0006260 DNA replication, 0009360 DNA polymerase III complex. Probab=24.66 E-value=37 Score=12.48 Aligned_cols=74 Identities=19% Similarity=0.258 Sum_probs=36.3 Q ss_pred ECCHHHHHHHHHHHHHCCC-CCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCEECCCCCCCCCCCCCCCCCHHHH Q ss_conf 4799999999999986685-232332224331100124420102577665443455023137443334565001788999 Q gi|254780277|r 371 IVNTKDIQQWVQKLETIGS-ASFKIITDTMDAFHSKPIAIAISTLDQKDDTSFKVETIFIDLSVHTSQDSTGKNTLTKEI 449 (976) Q Consensus 371 I~~~~~L~~li~~L~~~~~-ia~DtETt~ld~~~~~ivGisls~~~~~~~~~~~~~~~yip~~~~~~~d~~~~~~~~~ev 449 (976) -.+...+.++++.|+.=.. +.|=+=||.+.-....| +|=| + - ++ ...+..+++ T Consensus 127 MLS~~AFNALLKTLEEPP~hV~FIlATTE~~KiP~TI--lSRC---Q---------r--F~----------Fk~i~~~~i 180 (363) T TIGR02397 127 MLSKSAFNALLKTLEEPPEHVVFILATTEPHKIPATI--LSRC---Q---------R--FD----------FKRIPLEDI 180 (363) T ss_pred CCCHHHHHHHHHHHCCCCCCEEEEEECCCHHHCCCCC--EEEC---E---------E--EC----------CCCCCHHHH T ss_conf 2865689998765227987628887348711205540--2100---0---------3--12----------678998999 Q ss_pred HHHHHHHHCCCCCHHHHCCCCHHHHHHH Q ss_conf 9744442022210011003212378898 Q gi|254780277|r 450 LSYLKKFFENEHFLKIGHNIKYDKLVLH 477 (976) Q Consensus 450 l~~L~~lle~~~i~kIgHN~KfD~~~L~ 477 (976) .+.|+.+++.+.+ ++|-..|. T Consensus 181 ~~~L~~I~~~E~I-------~~e~~AL~ 201 (363) T TIGR02397 181 VERLKKILDKEGI-------KIEDEALE 201 (363) T ss_pred HHHHHHHHHHHCC-------CCCHHHHH T ss_conf 9999999987088-------31778999 No 387 >smart00411 BHL bacterial (prokaryotic) histone like domain. Probab=24.56 E-value=37 Score=12.47 Aligned_cols=22 Identities=9% Similarity=0.342 Sum_probs=13.9 Q ss_pred HHHHHCCCCHHHHHHHHHHHHH Q ss_conf 5678819998999999999999 Q gi|254780277|r 816 RLATQLRIPRSEAADYIKRYFH 837 (976) Q Consensus 816 ~La~~l~is~~eA~~~i~~yf~ 837 (976) .||+++|+|..+|+.+++.+|+ T Consensus 9 ~ia~~~~~s~~~~~~~v~~~~~ 30 (90) T smart00411 9 AIAEKAGLSKKDAKAAVDAFLE 30 (90) T ss_pred HHHHHHCCCHHHHHHHHHHHHH T ss_conf 9999869699999999999999 No 388 >PRK00024 radC DNA repair protein RadC; Reviewed Probab=24.55 E-value=31 Score=13.05 Aligned_cols=15 Identities=40% Similarity=0.687 Sum_probs=7.2 Q ss_pred CCCHHHHHHHHHHHC Q ss_conf 776566679987420 Q gi|254780277|r 623 LSSPKQLGDILFTKL 637 (976) Q Consensus 623 inSpkQL~~vLf~~L 637 (976) ++||+++.+++-..| T Consensus 103 l~s~~~v~~~~~~~l 117 (224) T PRK00024 103 LTSPEDVADYLQAEL 117 (224) T ss_pred CCCHHHHHHHHHHHH T ss_conf 489999999999986 No 389 >TIGR02506 NrdE_NrdA ribonucleoside-diphosphate reductase, alpha subunit; InterPro: IPR013346 This entry represents the alpha (large) chain of the class I ribonucleotide reductase (RNR). RNR's are responsible for the conversion of the ribose sugar of RNA into the deoxyribose sugar of DNA. This is the rate-limiting step of DNA biosynthesis. Class I RNR's generate the required radical (on tyrosine) via a "non-heme" iron cofactor which resides in the beta (small) subunit. The alpha subunit contains the catalytic and allosteric regulatory sites. The mechanism of this enzyme requires molecular oxygen . E. coli contains two versions of this enzyme which are regulated independently (NrdAB and NrdEF, where NrdA and NrdE are the large chains , ). Most organisms contain only one, but the application of the gene symbols NrdA and NrdE are somewhat arbitrary. This model identifies RNR's in diverse clades of bacteria, eukaryotes as well as numerous DNA viruses and phage.; GO: 0005971 ribonucleoside-diphosphate reductase complex. Probab=24.26 E-value=37 Score=12.43 Aligned_cols=45 Identities=22% Similarity=0.332 Sum_probs=32.9 Q ss_pred CCCHHHHHH---HCCC--CHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHCC Q ss_conf 777035678---8199--9899999999999988889999999999999849 Q gi|254780277|r 811 GISPFRLAT---QLRI--PRSEAADYIKRYFHRFPGIHEYIEKTKNFVRQNG 857 (976) Q Consensus 811 G~~~~~La~---~l~i--s~~eA~~~i~~yf~~~p~v~~~~~~~~~~a~~~g 857 (976) |.|+.||+. .+++ +-.||.++.++.|+..- -.-++...+.|+++| T Consensus 420 GiGv~Gl~d~l~~~~~~Y~S~ea~~~~~~ife~~~--Y~a~~~S~~LAKE~G 469 (685) T TIGR02506 420 GIGVMGLHDVLAKLGLPYDSEEARALNDRIFEAIY--YYAIKASAELAKEKG 469 (685) T ss_pred CCCCHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHH--HHHHHHHHHHHHHCC T ss_conf 41330179999965889889889999999999999--999999999998678 No 390 >COG2805 PilT Tfp pilus assembly protein, pilus retraction ATPase PilT [Cell motility and secretion / Intracellular trafficking and secretion] Probab=24.04 E-value=38 Score=12.40 Aligned_cols=23 Identities=26% Similarity=0.271 Sum_probs=14.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 701345653899999999999999986 Q gi|254780277|r 887 AINAPIQGSAADITRRAMISVHKSLED 913 (976) Q Consensus 887 a~N~~iQGsaAdi~k~am~~i~~~~~~ 913 (976) -+.+.||-.+. ..|..+++.+.. T Consensus 303 qi~s~iqtG~~----~GM~T~dq~L~~ 325 (353) T COG2805 303 QIPSLIQTGQQ----LGMQTFDQSLAQ 325 (353) T ss_pred HHHHHHHHHHH----HHHHHHHHHHHH T ss_conf 89999996466----406579999999 No 391 >PTZ00126 tyrosyl-tRNA synthetase; Provisional Probab=23.90 E-value=28 Score=13.29 Aligned_cols=30 Identities=23% Similarity=0.587 Sum_probs=19.2 Q ss_pred EECCCCCCCCCCCCCCC---CCCCCCCCEEEECC Q ss_conf 00000133343334556---33222243024248 Q gi|254780277|r 710 TGRLASLEPNLQNIPIK---TDLGQKIRRAFISP 740 (976) Q Consensus 710 TGRlSs~~PNlQNiP~r---~~~g~~iR~~F~a~ 740 (976) -+-||+|+||-- |=+- .+.-++||+||.|+ T Consensus 271 ~~KMSkSdp~sa-I~m~Ds~e~V~kKIkkAyCpp 303 (399) T PTZ00126 271 QEKMSKSDPNSA-IFMEDSAEEVNRKIKKAYCPP 303 (399) T ss_pred CCCCCCCCCCCE-EEECCCHHHHHHHHHHCCCCC T ss_conf 765666898772-120189999999998456999 No 392 >PRK07394 hypothetical protein; Provisional Probab=23.85 E-value=38 Score=12.37 Aligned_cols=37 Identities=11% Similarity=0.050 Sum_probs=16.1 Q ss_pred CCCCCCHHHHHHHHHHHCCCCCCCCC------CCCCCCCCHHHHHHHH Q ss_conf 57777656667998742078875556------6545353178877653 Q gi|254780277|r 620 KFNLSSPKQLGDILFTKLKFPSGAKT------KTGQWKTTAQDLEQIN 661 (976) Q Consensus 620 ~FNinSpkQL~~vLf~~LgLp~~kKT------ktG~~ST~~~~Le~L~ 661 (976) .||++.. .-++--..|+|+.|.. |+| ||++|+||.|. T Consensus 97 t~NiST~---aa~v~Aa~G~~VaKHGnrsvssk~G--~~saDvLe~lG 139 (342) T PRK07394 97 TAPIYPL---TALLLAAAGQPVVLHGGDRMPTKYG--VPLIELWQGLG 139 (342) T ss_pred CCCCHHH---HHHHHHHCCCEEEEECCCCCCCCCC--CCHHHHHHHCC T ss_conf 6422899---9999996899599888998876567--42799999759 No 393 >pfam00450 Peptidase_S10 Serine carboxypeptidase. Probab=23.42 E-value=39 Score=12.32 Aligned_cols=47 Identities=26% Similarity=0.287 Sum_probs=24.2 Q ss_pred HHHHHHHHHHHHCCCCCHHHHCCCCHHHHHHHHCCCCCCCCCCHHHHHHHHCCCC Q ss_conf 8999974444202221001100321237889861886664210023455431687 Q gi|254780277|r 446 TKEILSYLKKFFENEHFLKIGHNIKYDKLVLHRYGISMRGFEDIMLMSYVLDSGR 500 (976) Q Consensus 446 ~~evl~~L~~lle~~~i~kIgHN~KfD~~~L~~~GI~l~~i~DTmlaayLLdp~~ 500 (976) ..++...|+.||+.- +. .++- + ++-.|-...|.+=+.+|+++++.+. T Consensus 115 A~~~~~fL~~Ff~~f--p~-~~~~--~---~~i~GESYgG~YvP~ia~~i~~~n~ 161 (415) T pfam00450 115 AEDNYEFLQKFFEKF--PE-YKNN--P---FYIAGESYAGHYVPALAQEILDGNK 161 (415) T ss_pred HHHHHHHHHHHHHHC--HH-HCCC--C---EEEEECCCCCEEHHHHHHHHHHHCC T ss_conf 999999999999969--67-5079--6---3898233354338999999997320 No 394 >COG2256 MGS1 ATPase related to the helicase subunit of the Holliday junction resolvase [DNA replication, recombination, and repair] Probab=23.27 E-value=16 Score=15.09 Aligned_cols=60 Identities=27% Similarity=0.257 Sum_probs=31.7 Q ss_pred CCCCCCCCCCCCCHHHHHHHHHCCCHHHHHHHHHHHHH--HHHHHHHCCCHHHHHHHHHHHHCC Q ss_conf 43589987763602499898413678999864542001--122322003578999765543203 Q gi|254780277|r 189 SIDNIPGIPGIGYKTAALLLQEYGNLENILINASRIKQ--KKRRENILEYAETARLSRKLVMLR 250 (976) Q Consensus 189 ~sDnipGv~GiG~KtA~~ll~~~gsle~i~~~~d~i~~--~k~~~~l~~~~e~a~ls~~L~tL~ 250 (976) +|==.=|=||+|+-|-+.+|.++-+.+ +..+.-+.. +-+++.+++.+......|+.+-.. T Consensus 49 ~SmIl~GPPG~GKTTlA~liA~~~~~~--f~~~sAv~~gvkdlr~i~e~a~~~~~~gr~tiLfl 110 (436) T COG2256 49 HSMILWGPPGTGKTTLARLIAGTTNAA--FEALSAVTSGVKDLREIIEEARKNRLLGRRTILFL 110 (436) T ss_pred CEEEEECCCCCCHHHHHHHHHHHHCCC--EEEECCCCCCHHHHHHHHHHHHHHHHCCCCEEEEE T ss_conf 605777899988889999998761776--69951523467999999999999872588349987 No 395 >KOG2379 consensus Probab=23.16 E-value=39 Score=12.28 Aligned_cols=24 Identities=8% Similarity=-0.054 Sum_probs=13.3 Q ss_pred CCCEEEECCEEHHHHHHHHHHHHH Q ss_conf 997995689016658999999998 Q gi|254780277|r 106 GIPAIEIQGFEADDIIATYTYIAE 129 (976) Q Consensus 106 gi~~~~~~~~EADDviatla~~~~ 129 (976) -+|.+....-=+++.|+-+...+. T Consensus 55 tl~dl~~ikG~G~~~~~k~~~~~~ 78 (501) T KOG2379 55 TLPDLSSIKGFGKKWIVKLMEAYS 78 (501) T ss_pred CHHHCCCCCCCCCHHHHHHHHHHH T ss_conf 112100105765036888876777 No 396 >PRK09483 response regulator; Provisional Probab=23.15 E-value=39 Score=12.28 Aligned_cols=62 Identities=16% Similarity=0.221 Sum_probs=36.4 Q ss_pred HCCHHHHHHHHHHHHHHHHCCCHHHHHHHCCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHCCCE Q ss_conf 89988886302513564407770356788199989999999999998888999999999999984988 Q gi|254780277|r 792 KVSPQMRRHAKTINFSIIYGISPFRLATQLRIPRSEAADYIKRYFHRFPGIHEYIEKTKNFVRQNGYV 859 (976) Q Consensus 792 ~vt~~~R~~aK~infgi~YG~~~~~La~~l~is~~eA~~~i~~yf~~~p~v~~~~~~~~~~a~~~gyv 859 (976) ..|+-+++..+. +.-|++-.-.|++|++|..-.+.++.+-|... +|+. ....+..|.++|.+ T Consensus 148 ~LT~RE~eVl~l----l~~G~snkeIA~~L~iS~~TV~~h~~~i~~KL-~v~~-r~el~~~A~~~gli 209 (216) T PRK09483 148 SLSERELQIMLM----ITKGQKVNEISEQLNLSPKTVNSYRYRMFSKL-NISG-DVELTHLAIRHGLL 209 (216) T ss_pred CCCHHHHHHHHH----HHCCCCHHHHHHHHCCCHHHHHHHHHHHHHHH-CCCC-HHHHHHHHHHCCCC T ss_conf 689999999999----98799999999996989999999999999980-9999-99999999995991 No 397 >pfam06368 Met_asp_mut_E Methylaspartate mutase E chain (MutE). This family consists of several methylaspartate mutase E chain proteins (EC:5.4.99.1). Glutamate mutase catalyses the first step in the fermentation of glutamate by Clostridium tetanomorphum. This is an unusual isomerisation in which L-glutamate is converted to threo-beta-methyl L-aspartate. Probab=23.15 E-value=39 Score=12.28 Aligned_cols=79 Identities=9% Similarity=0.080 Sum_probs=36.3 Q ss_pred HHHHHHHHHHCCCHHHCCHHHHHHHHHHHHHHHHCCCH---HHHHHHCCCCHHHHHHHHHHHHHH-CHHHHHHHHHHHHH Q ss_conf 89999989809996889988886302513564407770---356788199989999999999998-88899999999999 Q gi|254780277|r 777 IHRMVAAEIFGVNIEKVSPQMRRHAKTINFSIIYGISP---FRLATQLRIPRSEAADYIKRYFHR-FPGIHEYIEKTKNF 852 (976) Q Consensus 777 iH~~tA~~vf~~~~~~vt~~~R~~aK~infgi~YG~~~---~~La~~l~is~~eA~~~i~~yf~~-~p~v~~~~~~~~~~ 852 (976) +-.-|..+-+|+|..+....-=+.+|++ +.++-++.- .++..+-..-..|++.++|.-|+. -+.+.. -.-. T Consensus 282 vIvKT~~EA~gIPt~eaN~~~l~~t~~~-~~~~~~q~~~~~~~~~~E~~~I~~Ev~aI~d~Vle~g~gDl~~----g~v~ 356 (441) T pfam06368 282 VIVKTPHEASGIPTAEANAAGLRATKQA-LNMLERQKIPMSEEVEIEDALIKEETRAILDKVFELGDGDLAK----GTVR 356 (441) T ss_pred EEECCHHHHCCCCCHHHHHHHHHHHHHH-HHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCHHH----HHHH T ss_conf 9873889975899889999999999999-9986346677740146789999999999999998458983999----9999 Q ss_pred HHHCCCEE Q ss_conf 99849888 Q gi|254780277|r 853 VRQNGYVE 860 (976) Q Consensus 853 a~~~gyv~ 860 (976) |-+.|+.- T Consensus 357 AF~~GvLD 364 (441) T pfam06368 357 AFEAGVLD 364 (441) T ss_pred HHHCCCCC T ss_conf 99758776 No 398 >PRK12285 tryptophanyl-tRNA synthetase; Reviewed Probab=23.02 E-value=31 Score=12.97 Aligned_cols=23 Identities=17% Similarity=0.212 Sum_probs=17.0 Q ss_pred HHHHHHHHCCCCCEECCCCCCCCC Q ss_conf 333453300222100000133343 Q gi|254780277|r 697 QRVHTFYSLASTMTGRLASLEPNL 720 (976) Q Consensus 697 gRIH~~f~q~~t~TGRlSs~~PNl 720 (976) .-||..|-...+- |-||||+||= T Consensus 239 a~ih~~flpgL~G-~KMSsS~p~s 261 (369) T PRK12285 239 ASTYHKFMTGLTG-GKMSSSKPES 261 (369) T ss_pred HHHHCCCCCCCCC-CCCCCCCCCC T ss_conf 7775114567788-9876899985 No 399 >PRK13824 replication initiation protein RepC; Provisional Probab=22.75 E-value=26 Score=13.55 Aligned_cols=16 Identities=38% Similarity=0.596 Sum_probs=7.5 Q ss_pred CCCCCCHHHHHHHHHH Q ss_conf 8758837888778787 Q gi|254780277|r 871 EINSPKSSIRNINERA 886 (976) Q Consensus 871 ~i~s~~~~~r~~aeR~ 886 (976) +|+|..++.|...+|. T Consensus 363 ~I~SPGGYLR~Lt~rA 378 (404) T PRK13824 363 HINSAGGYLRDLTRRA 378 (404) T ss_pred CCCCCCHHHHHHHHHH T ss_conf 2789527999999999 No 400 >KOG2093 consensus Probab=22.68 E-value=40 Score=12.22 Aligned_cols=18 Identities=22% Similarity=0.407 Sum_probs=12.4 Q ss_pred CCCC-CCCCCHHHHHHHHC Q ss_conf 8866-64210023455431 Q gi|254780277|r 480 GISM-RGFEDIMLMSYVLD 497 (976) Q Consensus 480 GI~l-~~i~DTmlaayLLd 497 (976) |-.. .|+.+|||+|+|.. T Consensus 508 gC~aS~Gig~t~LLARlAT 526 (1016) T KOG2093 508 GCPASAGIGGTMLLARLAT 526 (1016) T ss_pred CCCEEECCCHHHHHHHHHH T ss_conf 9862313144799999987 No 401 >PRK08156 surface presentation of antigens protein SpaS; Validated Probab=22.64 E-value=40 Score=12.21 Aligned_cols=16 Identities=19% Similarity=0.382 Sum_probs=10.0 Q ss_pred EECCCCCEEEEEECCC Q ss_conf 4248994899963265 Q gi|254780277|r 737 FISPPTKKLISADYSQ 752 (976) Q Consensus 737 F~a~~g~~lv~aDySQ 752 (976) .|..|-|.=|..-|.. T Consensus 254 VItNPTH~AVALkY~~ 269 (367) T PRK08156 254 IVANPTHIAIGIYFNP 269 (367) T ss_pred EEECCCHHHEEEEECC T ss_conf 9988562204655468 No 402 >TIGR01116 ATPase-IIA1_Ca calcium-translocating P-type ATPase, SERCA-type; InterPro: IPR005782 This model describes Ca2+ ATPases (3.6.3.8 from EC) from eukaryotes and includes significant representatives from plants. Plasma membrane Ca2+ ATPases are primarily responsible for extrusion of calcium from cytoplasmic milieu to outside, coupled to the hydrolyis of ATP. Besides Ca2+ ATPases, Na+/Ca2+ exchangers are also involved in the maintenance of Ca2+ homestasis in the cell. ; GO: 0005388 calcium-transporting ATPase activity, 0006816 calcium ion transport, 0016021 integral to membrane. Probab=22.61 E-value=40 Score=12.21 Aligned_cols=22 Identities=23% Similarity=0.291 Sum_probs=15.8 Q ss_pred CCCCCCHHHHHHHHHHHHHHHH Q ss_conf 1267676889986532336899 Q gi|254780277|r 263 ILEDCNGPRLISFLKALEFTKL 284 (976) Q Consensus 263 ~~~~~d~~~l~~~~~~lef~~l 284 (976) .|++.|.|+..+-+++++.+.. T Consensus 45 VWQE~nAE~AIeALK~~~~~~a 66 (800) T TIGR01116 45 VWQERNAEKAIEALKEYESEHA 66 (800) T ss_pred EEECHHHHHHHHHHHHCCCCCC T ss_conf 1202245689998741263233 No 403 >pfam09091 consensus Probab=22.51 E-value=40 Score=12.19 Aligned_cols=38 Identities=29% Similarity=0.575 Sum_probs=25.7 Q ss_pred HHHHCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHHHHH Q ss_conf 78860750123322101431005899998643436899 Q gi|254780277|r 569 YERLDKPMIDVVSQMEITGIQIDQDLLKQVSAEISKNL 606 (976) Q Consensus 569 ~~~iE~Pl~~vLa~ME~~Gi~vD~~~L~~l~~~l~~~l 606 (976) |.+||..|..=+.++...|++|..+.+++-..++..++ T Consensus 7 ~~~le~al~~W~~~~~~~~~pvs~~~i~~kA~~~a~~l 44 (64) T pfam09091 7 YPDLEKALIEWIRQQRAKGIPISGPIIQEKAKEIARQL 44 (64) T ss_pred CHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHH T ss_conf 48999999999999998799988899999999999983 No 404 >cd04780 HTH_MerR-like_sg5 Helix-Turn-Helix DNA binding domain of putative transcription regulators from the MerR superfamily. Putative helix-turn-helix (HTH) MerR-like transcription regulators (subgroup 5), N-terminal domain. Based on sequence similarity, these proteins are predicted to function as transcription regulators that mediate responses to stress in eubacteria. They belong to the MerR superfamily of transcription regulators that promote transcription of various stress regulons by reconfiguring the operator sequence located between the -35 and -10 promoter elements. A typical MerR regulator is comprised of two distinct domains that harbor the regulatory (effector-binding) site and the active (DNA-binding) site. Their N-terminal domains are homologous and contain a DNA-binding winged HTH motif, while the C-terminal domains are often dissimilar and bind specific coactivator molecules such as metal ions, drugs, and organic substrates. Probab=22.41 E-value=40 Score=12.18 Aligned_cols=28 Identities=14% Similarity=0.355 Sum_probs=22.4 Q ss_pred HHHHCCC-CCCCCCCCCCCCCCHHHHHHH Q ss_conf 8742078-875556654535317887765 Q gi|254780277|r 633 LFTKLKF-PSGAKTKTGQWKTTAQDLEQI 660 (976) Q Consensus 633 Lf~~LgL-p~~kKTktG~~ST~~~~Le~L 660 (976) .|++.|| |++.++..|..--+++.++.| T Consensus 19 yYe~~GLL~p~~R~~~g~r~Y~~~~v~rL 47 (95) T cd04780 19 YYLREGLLPEGRRLAPNQAEYSEAHVERL 47 (95) T ss_pred HHHHCCCCCCCCCCCCCCEECCHHHHHHH T ss_conf 99997899987769998715499999999 No 405 >PRK10070 glycine betaine transporter ATP-binding subunit; Provisional Probab=22.38 E-value=40 Score=12.18 Aligned_cols=20 Identities=15% Similarity=0.200 Sum_probs=10.2 Q ss_pred CCCCHHHHHHHHCCCCHHHHHHH Q ss_conf 35317887765304762799999 Q gi|254780277|r 650 WKTTAQDLEQINCGDNPIIKNIL 672 (976) Q Consensus 650 ~ST~~~~Le~L~~~~hp~~~~iL 672 (976) .-|.++++.. ..||++.... T Consensus 246 ~GtpeeI~~~---Pa~~yV~~F~ 265 (400) T PRK10070 246 VGTPDEILNN---PANDYVRTFF 265 (400) T ss_pred ECCHHHHHHC---CCCHHHHHHH T ss_conf 7288999867---9986899875 No 406 >pfam03796 DnaB_C DnaB-like helicase C terminal domain. The hexameric helicase DnaB unwinds the DNA duplex at the Escherichia coli chromosome replication fork. Although the mechanism by which DnaB both couples ATP hydrolysis to translocation along DNA and denatures the duplex is unknown, a change in the quaternary structure of the protein involving dimerization of the N-terminal domain has been observed and may occur during the enzymatic cycle. This C-terminal domain contains an ATP-binding site and is therefore probably the site of ATP hydrolysis. Probab=22.37 E-value=40 Score=12.17 Aligned_cols=14 Identities=7% Similarity=0.195 Sum_probs=6.5 Q ss_pred HHHHHCCCCHHHHH Q ss_conf 57886075012332 Q gi|254780277|r 568 VYERLDKPMIDVVS 581 (976) Q Consensus 568 l~~~iE~Pl~~vLa 581 (976) +|..+||+--.+.. T Consensus 52 l~~slEm~~~~~~~ 65 (186) T pfam03796 52 LFFSLEMSAEQLAE 65 (186) T ss_pred EEECCCCCHHHHHH T ss_conf 87547552999999 No 407 >pfam04318 DUF468 Protein of unknown function (DUF468). These conserved ORFs probably are probably not translated into protein [Personal communication, Val Wood]. Probab=22.37 E-value=40 Score=12.22 Aligned_cols=39 Identities=33% Similarity=0.399 Sum_probs=26.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEECCCCCCCCCCCC Q ss_conf 98865320011110012333453300222100000133343334 Q gi|254780277|r 680 IKSTYADSLPNHINKRTQRVHTFYSLASTMTGRLASLEPNLQNI 723 (976) Q Consensus 680 L~sTY~~~l~~~i~~~dgRIH~~f~q~~t~TGRlSs~~PNlQNi 723 (976) -.+||+++|...|. .+.-|||-+|. |-.--+..|+--|| T Consensus 39 alNt~viPllh~il-ttq~i~Ty~ni----t~ksp~kspkHkNi 77 (84) T pfam04318 39 ALNTYVIPLLHHIL-TTQFIYTYVNI----TEKSPSKSPKHKNI 77 (84) T ss_pred HHHHHHHHHHHHHH-HHHHHHEEECC----CCCCCCCCCCCCCE T ss_conf 53542799999999-87753124424----34689889764538 No 408 >PRK04195 replication factor C large subunit; Provisional Probab=22.29 E-value=40 Score=12.16 Aligned_cols=18 Identities=33% Similarity=0.554 Sum_probs=13.8 Q ss_pred CCCCCCCCCHHHHHHHHH Q ss_conf 998776360249989841 Q gi|254780277|r 193 IPGIPGIGYKTAALLLQE 210 (976) Q Consensus 193 ipGv~GiG~KtA~~ll~~ 210 (976) +-|-||||+-|++.+|.+ T Consensus 45 L~GPpGvGKTT~a~~lAk 62 (403) T PRK04195 45 LYGPPGVGKTSLAHALAN 62 (403) T ss_pred EECCCCCCHHHHHHHHHH T ss_conf 889399879999999999 No 409 >TIGR03624 conserved hypothetical protein. Members of this protein family have a phylogenetic distribution skewed toward the Actinobacteria (high GC Gram-positive bacteria), but with a few members occuring in the archaea. The function is unknown. Probab=22.19 E-value=41 Score=12.15 Aligned_cols=16 Identities=25% Similarity=0.401 Sum_probs=9.0 Q ss_pred CCCCCCCHHHHHHHHH Q ss_conf 1431005899998643 Q gi|254780277|r 585 ITGIQIDQDLLKQVSA 600 (976) Q Consensus 585 ~~Gi~vD~~~L~~l~~ 600 (976) ..|+.+|.+.+.+.-. T Consensus 188 ~~~~~~d~~~i~e~~~ 203 (345) T TIGR03624 188 AAGLAIDTSALEERLR 203 (345) T ss_pred HHHCCCCHHHHHHHHH T ss_conf 8745688799999999 No 410 >TIGR01279 DPOR_bchN light-independent protochlorophyllide reductase, N subunit; InterPro: IPR005970 Synonym: dark protochlorophyllide reductase Protochlorophyllide reductase catalyzes the reductive formation of chlorophyllide from protochlorophyllide during biosynthesis of chlorophylls and bacteriochlorophylls. The light-independent (dark) form of protochlorophyllide reductase plays a key role in the ability of gymnosperms, algae, and photosynthetic bacteria to form chlorophyll in the dark. Genetic and sequence analyses have indicated that dark protochlorophyllide reductase consists of three protein subunits that exhibit significant sequence similarity to the three subunits of nitrogenase, which catalyzes the reductive formation of ammonia from dinitrogen. Dark protochlorophyllide reductase activity was shown to be dependent on the presence of all three subunits, ATP, and the reductant dithionite. This enzyme describes the N subunit of the dark form protochlorophyllide reductase, a nitrogenase-like enzyme . ; GO: 0016730 oxidoreductase activity acting on iron-sulfur proteins as donors, 0015995 chlorophyll biosynthetic process, 0019685 photosynthesis dark reaction. Probab=22.05 E-value=30 Score=13.15 Aligned_cols=79 Identities=16% Similarity=0.220 Sum_probs=49.7 Q ss_pred HHHHHHHHHHHHHHHHHHHH----HHCCCCHHHHHH-HCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCC---- Q ss_conf 45677632221000134578----860750123322-101431005899998643436899999999974034557---- Q gi|254780277|r 551 QLWLLLRPKLIVEKLLHVYE----RLDKPMIDVVSQ-MEITGIQIDQDLLKQVSAEISKNLLSLEENIYDLSGEKF---- 621 (976) Q Consensus 551 ~L~~~l~~~L~~~~l~~l~~----~iE~Pl~~vLa~-ME~~Gi~vD~~~L~~l~~~l~~~l~~le~~i~~~aG~~F---- 621 (976) +.|..|++++.--+-.++|. -+|.||.+-|.. |-..=+-+-..|+.+ +..+.||+-|++.-- ..|-+. T Consensus 298 ~aW~~l~~~~~lLrGkkifF~~D~llElPLARFL~~e~G~~vVE~gtPYi~~--~~~a~ELalL~~~~s-d~GvPlv~~v 374 (458) T TIGR01279 298 QAWRALEPHTQLLRGKKIFFLGDNLLELPLARFLKRECGMEVVEVGTPYIHK--KFHAAELALLEGTDS-DMGVPLVRIV 374 (458) T ss_pred HHHHHHHHHHHHHCCCEEEECCCCHHHHHHHHHHHHHCCCEEEECCCCCCCH--HHHHHHHHHCCCCCC-CCCCCCCCCC T ss_conf 9999988888877387578547614566689987875684613016731023--789999985205776-4666665332 Q ss_pred -CCCCHHHHHHH Q ss_conf -77765666799 Q gi|254780277|r 622 -NLSSPKQLGDI 632 (976) Q Consensus 622 -NinSpkQL~~v 632 (976) .+|-.+||..+ T Consensus 375 E~~d~~rQL~ri 386 (458) T TIGR01279 375 EQPDFHRQLQRI 386 (458) T ss_pred CCCCHHHHHHHH T ss_conf 472257899999 No 411 >PRK12704 phosphodiesterase; Provisional Probab=22.05 E-value=41 Score=12.13 Aligned_cols=13 Identities=23% Similarity=0.238 Sum_probs=5.8 Q ss_pred HHHCCCCHHHHHH Q ss_conf 7881999899999 Q gi|254780277|r 818 ATQLRIPRSEAAD 830 (976) Q Consensus 818 a~~l~is~~eA~~ 830 (976) |.+||....-|++ T Consensus 286 A~elg~d~~~Akr 298 (455) T PRK12704 286 AAELGLDEKLAKR 298 (455) T ss_pred HHHHCCCHHHHHH T ss_conf 9982999999999 No 412 >COG4619 ABC-type uncharacterized transport system, ATPase component [General function prediction only] Probab=21.78 E-value=32 Score=12.94 Aligned_cols=30 Identities=27% Similarity=0.256 Sum_probs=12.5 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 589999864343689999999997403455 Q gi|254780277|r 591 DQDLLKQVSAEISKNLLSLEENIYDLSGEK 620 (976) Q Consensus 591 D~~~L~~l~~~l~~~l~~le~~i~~~aG~~ 620 (976) |+.....+-..+.-.-..|.+.|-++.|.| T Consensus 109 dr~aa~~llar~~l~~~~L~k~it~lSGGE 138 (223) T COG4619 109 DRAAALDLLARFALPDSILTKNITELSGGE 138 (223) T ss_pred CHHHHHHHHHHCCCCHHHHCCHHHHCCCHH T ss_conf 867999999870796466414023316607 No 413 >smart00674 CENPB Putative DNA-binding domain in centromere protein B, mouse jerky and transposases. Probab=21.65 E-value=42 Score=12.07 Aligned_cols=38 Identities=26% Similarity=0.463 Sum_probs=24.9 Q ss_pred HHHHCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHHHHH Q ss_conf 78860750123322101431005899998643436899 Q gi|254780277|r 569 YERLDKPMIDVVSQMEITGIQIDQDLLKQVSAEISKNL 606 (976) Q Consensus 569 ~~~iE~Pl~~vLa~ME~~Gi~vD~~~L~~l~~~l~~~l 606 (976) |.++|--|..=+.++...|++|+.+.+++...++..++ T Consensus 4 ~~~le~~L~~Wi~~~~~~g~~is~~~i~~kA~~~~~~~ 41 (66) T smart00674 4 YALLEKALYEWILRQEALGIPISGEQIREKALEILQRL 41 (66) T ss_pred HHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHH T ss_conf 89999999999999999699998899999999999981 No 414 >TIGR01113 mtrE tetrahydromethanopterin S-methyltransferase, subunit E; InterPro: IPR005780 This model describes N5-methyltetrahydromethanopterin: coenzyme M methyltransferase subunit E in methanogenic archaea. This methyltranferase is a membrane-associated enzyme complex that uses methyl-transfer reaction to drive sodium-ion pump.5-methyl-5,6,7,8-tetrahydromethanopterin + 2-mercaptoethanesulphonate = 5,6,7,8-tetrahydromethanopterin + 2-(methylthio)ethanesulphonate. Archaea have evolved energy-yielding pathways marked by one-carbon biochemistry featuring novel cofactors and enzymes. This transferase (encoded by subunit A) is involved in the transfer of 'methyl' group from N5-methyltetrahydromethanopterin to coenzyme M. In an accompanying reaction, methane is produced by two-electron reduction of methyl-coenzyme M by another enzyme, methyl-coenzyme M reductase.; GO: 0030269 tetrahydromethanopterin S-methyltransferase activity, 0006814 sodium ion transport, 0005737 cytoplasm, 0012506 vesicle membrane. Probab=21.63 E-value=42 Score=12.07 Aligned_cols=29 Identities=31% Similarity=0.362 Sum_probs=19.1 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHH Q ss_conf 434368999999999740345577776566679 Q gi|254780277|r 599 SAEISKNLLSLEENIYDLSGEKFNLSSPKQLGD 631 (976) Q Consensus 599 ~~~l~~~l~~le~~i~~~aG~~FNinSpkQL~~ 631 (976) ...+....+++|.+| |..=||||.-||+- T Consensus 10 ~AtIAGa~EDlESDv----GSQSNPNSQVQLAP 38 (298) T TIGR01113 10 AATIAGAAEDLESDV----GSQSNPNSQVQLAP 38 (298) T ss_pred HHHHHHHHHHHHHCC----CCCCCCCHHHCCCC T ss_conf 999975676555157----76468860212530 No 415 >KOG1759 consensus Probab=21.31 E-value=42 Score=12.03 Aligned_cols=18 Identities=6% Similarity=0.305 Sum_probs=6.1 Q ss_pred CCCHHHHHHHCCCCCCCC Q ss_conf 750123322101431005 Q gi|254780277|r 574 KPMIDVVSQMEITGIQID 591 (976) Q Consensus 574 ~Pl~~vLa~ME~~Gi~vD 591 (976) ||.+++-+..-..++..+ T Consensus 1 MP~l~i~TNv~~~~V~~~ 18 (115) T KOG1759 1 MPVLRIQTNVPVDKVPDG 18 (115) T ss_pred CCEEEEECCCCCCCCCCC T ss_conf 975998546775667833 No 416 >pfam00416 Ribosomal_S13 Ribosomal protein S13/S18. This family includes ribosomal protein S13 from prokaryotes and S18 from eukaryotes. Probab=21.28 E-value=42 Score=12.02 Aligned_cols=10 Identities=10% Similarity=0.099 Sum_probs=4.4 Q ss_pred HHCCCCCCCC Q ss_conf 4207887555 Q gi|254780277|r 635 TKLKFPSGAK 644 (976) Q Consensus 635 ~~LgLp~~kK 644 (976) .++|+.+.++ T Consensus 33 ~~lgi~~~~k 42 (106) T pfam00416 33 KKAGVDKDKR 42 (106) T ss_pred HHCCCCCCCC T ss_conf 9919597757 No 417 >TIGR02320 PEP_mutase phosphoenolpyruvate phosphomutase; InterPro: IPR012698 Phosphonates are a class of organophosphorus compounds, characterised by a stable C-P bond, which are found in a variety of biologically produced molecules including antiobiotics, lipids, proteins and polysaccharides . The functions of these molecules include phosphorus storage, cell communication, host recognition and chemical warfare. Phosphoenolpyruvate (PEP) phosphomutase catalyses the reversible conversion of PEP to 3-phosphonopyruvate, forming a stable C-P bond, which is the entry point to all known phosphonate biosynthetic pathways . This entry consists of characterised and predicted PEP phosphomutases found in bacterial and eukayotic species. A closely related enzyme, phosphonopyruvate hydrolase from Variovorax sp. Pal2, is excluded from this entry, and the functional identification of some more distantly related sequences, such as from Bacteroides fragilis, Treponema denticola, and Clostridium tetani E88 is unknown. PEP phosphomutase forms a modified TIM barrel fold where the eighth alpha helix adopts a different conformation than in the classical TIM barrel fold . The substrate binds in the central channel of the barrel and is anchored to the active site by the Mg(2+) cofactor . In the absence of substrate the active site is acessible to the solvent, while substrate-binding causes a conformational change where a large loop shields the site from solvent . This shielding appears to be required for catalysis to occur.; GO: 0050188 phosphoenolpyruvate mutase activity. Probab=21.24 E-value=42 Score=12.02 Aligned_cols=48 Identities=19% Similarity=0.362 Sum_probs=20.5 Q ss_pred HHHHHCCCCCC-----CCCCHHHHHHHHHHHCC--CCCCCCCCCCCCCCCHHHHHHH Q ss_conf 99974034557-----77765666799874207--8875556654535317887765 Q gi|254780277|r 611 ENIYDLSGEKF-----NLSSPKQLGDILFTKLK--FPSGAKTKTGQWKTTAQDLEQI 660 (976) Q Consensus 611 ~~i~~~aG~~F-----NinSpkQL~~vLf~~Lg--Lp~~kKTktG~~ST~~~~Le~L 660 (976) .++|..||-+= --+.|.|+.... +.+. +|+. -.||.-|.|..+.-..+ T Consensus 168 A~AYaeAGADAiliHSrk~~p~qI~aF~-~aW~n~~PVv-IvPTkYy~tpvd~fra~ 222 (272) T TIGR02320 168 AEAYAEAGADAILIHSRKADPEQIAAFM-KAWENSLPVV-IVPTKYYKTPVDDFRAL 222 (272) T ss_pred HHHHHHHCHHHHHHHHCCCCHHHHHHHH-HHHCCCCCEE-EECCCCCCCCHHHHHHC T ss_conf 9998851312455631348887999999-9642898778-83476789976778637 No 418 >PRK05702 flhB flagellar biosynthesis protein FlhB; Reviewed Probab=21.21 E-value=42 Score=12.01 Aligned_cols=40 Identities=15% Similarity=0.231 Sum_probs=24.7 Q ss_pred CCCCCCCCCCCCCCCCEEEECCCCCEEEEEECCCCC---EEEEEEEEEHHH Q ss_conf 433345563322224302424899489996326544---368766510377 Q gi|254780277|r 719 NLQNIPIKTDLGQKIRRAFISPPTKKLISADYSQIE---LRILAHIAKITP 766 (976) Q Consensus 719 NlQNiP~r~~~g~~iR~~F~a~~g~~lv~aDySQiE---lRilA~ls~D~~ 766 (976) -+||+|.- ...+..|-|.=|..-|..=+ ++|+|--.++-+ T Consensus 256 m~~~V~~A--------~vVItNPTH~AVAL~Y~~~~~~aP~VvAKG~d~~A 298 (352) T PRK05702 256 MMAAVPKA--------DVVITNPTHYAVALKYDRGKMAAPVVVAKGVDEVA 298 (352) T ss_pred HHHCCCCC--------CEEEECCCCEEEEEEECCCCCCCCEEEEECCCHHH T ss_conf 87338879--------78999998628998878899999988774586999 No 419 >PRK12721 secretion system apparatus protein SsaU; Reviewed Probab=21.13 E-value=43 Score=12.00 Aligned_cols=15 Identities=20% Similarity=0.421 Sum_probs=8.1 Q ss_pred ECCCCCEEEEEECCC Q ss_conf 248994899963265 Q gi|254780277|r 738 ISPPTKKLISADYSQ 752 (976) Q Consensus 738 ~a~~g~~lv~aDySQ 752 (976) +..|-|.-|..-|.. T Consensus 260 itNPTH~AVAL~Y~~ 274 (349) T PRK12721 260 VRNPTHIAVCLGYHP 274 (349) T ss_pred EECCCHHHHHHHCCC T ss_conf 988652764000268 No 420 >PTZ00072 40S ribosomal protein S13; Provisional Probab=21.05 E-value=43 Score=11.99 Aligned_cols=35 Identities=20% Similarity=0.296 Sum_probs=25.7 Q ss_pred HHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHCCCCCCC Q ss_conf 99999999740345577776566679987420788755 Q gi|254780277|r 606 LLSLEENIYDLSGEKFNLSSPKQLGDILFTKLKFPSGA 643 (976) Q Consensus 606 l~~le~~i~~~aG~~FNinSpkQL~~vLf~~LgLp~~k 643 (976) -+++++.|.++|.+. -||-|+|.+|=|.-|+|-.+ T Consensus 27 ~~eve~~I~klakkG---~t~SqIG~~LRD~~gip~Vk 61 (148) T PTZ00072 27 SRDVVDQICKLAKKG---LTPSQIGVYLRDSMGIPQVK 61 (148) T ss_pred HHHHHHHHHHHHHCC---CCHHHCCCEECCCCCCCCEE T ss_conf 999999999999779---99877120133457755321 No 421 >PRK11886 biotin--protein ligase; Provisional Probab=21.03 E-value=43 Score=11.99 Aligned_cols=15 Identities=20% Similarity=0.206 Sum_probs=8.7 Q ss_pred ECCCCCEEEEEECCCC Q ss_conf 2489948999632654 Q gi|254780277|r 738 ISPPTKKLISADYSQI 753 (976) Q Consensus 738 ~a~~g~~lv~aDySQi 753 (976) +.++|..+|..| .++ T Consensus 291 Id~~G~Liv~~~-~g~ 305 (319) T PRK11886 291 IDEQGALLLETD-GGE 305 (319) T ss_pred ECCCCEEEEEEC-CCE T ss_conf 999980999989-968 No 422 >cd07361 MEMO_like Memo (mediator of ErbB2-driven cell motility) is co-precipitated with the C terminus of ErbB2, a protein involved in cell motility. This subfamily is composed of Memo (mediator of ErbB2-driven cell motility) and similar proteins. Memo is a protein that is co-precipitated with the C terminus of ErbB2, a protein involved in cell motility. It is required for the ErbB2-driven cell mobility and is found in protein complexes with cofilin, ErbB2 and PLCgamma1. However, Memo is not homologous to any known signaling proteins, and its function in ErbB2 signaling is not known. Structural studies show that Memo binds directly to a specific ErbB2-derived phosphopeptide. Memo is homologous to class III nonheme iron-dependent extradiol dioxygenases, however, no metal binding or enzymatic activity can be detected for Memo. This subfamily also contains a few members containing a C-terminal AMMECR1-like domain. The AMMECR1 protein was proposed to be a regulatory factor that is potentia Probab=20.99 E-value=43 Score=11.98 Aligned_cols=15 Identities=0% Similarity=0.275 Sum_probs=7.8 Q ss_pred CCHHHHHHHHHHHHH Q ss_conf 799999999999986 Q gi|254780277|r 372 VNTKDIQQWVQKLET 386 (976) Q Consensus 372 ~~~~~L~~li~~L~~ 386 (976) .+.++|+.+++.... T Consensus 11 ~d~~~L~~~i~~~~~ 25 (266) T cd07361 11 ADPEELRRQLEAFLA 25 (266) T ss_pred CCHHHHHHHHHHHHH T ss_conf 999999999999996 No 423 >smart00670 PINc Large family of predicted nucleotide-binding domains. From similarities to 5'-exonucleases, these domains are predicted to be RNases. PINc domains in nematode SMG-5 and yeast NMD4p are predicted to be involved in RNAi. Probab=20.98 E-value=43 Score=11.98 Aligned_cols=25 Identities=24% Similarity=0.393 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHHCCCCEEEECCCCHHHE Q ss_conf 65899999999848994999868701212 Q gi|254780277|r 118 DDIIATYTYIAEKEGFAVTIISTDKDLMQ 146 (976) Q Consensus 118 DDviatla~~~~~~~~~v~i~s~DkD~~Q 146 (976) |..|...|.... .++++|.|+|+.+ T Consensus 87 D~~il~~a~~~~----~~~LvT~D~~l~~ 111 (111) T smart00670 87 DALILATAKELG----NVVLVTNDRDLRR 111 (111) T ss_pred HHHHHHHHHHCC----CCEEEECCHHHCC T ss_conf 799999999749----8789746401049 No 424 >cd03008 TryX_like_RdCVF Tryparedoxin (TryX)-like family, Rod-derived cone viability factor (RdCVF) subfamily; RdCVF is a thioredoxin (TRX)-like protein specifically expressed in photoreceptors. RdCVF was isolated and identified as a factor that supports cone survival in retinal cultures. Cone photoreceptor loss is responsible for the visual handicap resulting from the inherited disease, retinitis pigmentosa. RdCVF shows 33% similarity to TRX but does not exhibit any detectable thiol oxidoreductase activity. Probab=20.95 E-value=37 Score=12.44 Aligned_cols=72 Identities=14% Similarity=0.130 Sum_probs=0.0 Q ss_pred HHHHHHCCCCEEEECCCCHHHEECCCCEEE---EECCCCCEECHHHHHHHHCCCHHHHHHHHHHCCCCCCCCCCCCCCCC Q ss_conf 999984899499986870121223797599---96678835389999888489887877778751874358998776360 Q gi|254780277|r 125 TYIAEKEGFAVTIISTDKDLMQLVSPTTCL---YDTVKEEKIDIEHVIKKWGIPPEKMVCLQALTGDSIDNIPGIPGIGY 201 (976) Q Consensus 125 a~~~~~~~~~v~i~s~DkD~~Qlv~~~v~~---~~~~~~~~~~~~~v~~~~gv~p~~~~d~~al~GD~sDnipGv~GiG~ 201 (976) +.....+.++|+-||+|+|--+.-.---.+ |....-..-.......+||| -.||-+--|++ T Consensus 58 ~~~~~~~~fEIVfVS~Drse~~f~~y~~~MP~~WlAlPf~d~~~~~L~~~f~V----------------~gIPtLVil~~ 121 (146) T cd03008 58 FYVDRSAQLALVYVSMDQSEQQQESFLKDMPKKWLFLPFEDEFRRELEAQFSV----------------EELPTVVVLKP 121 (146) T ss_pred HHHCCCCCEEEEEEECCCCHHHHHHHHHHCCCCEEECCCCCHHHHHHHHHCCC----------------CCCCEEEEECC T ss_conf 54403787799998368999999999997798726657696899999998599----------------87888999989 Q ss_pred HH------HHHHHHHCC Q ss_conf 24------998984136 Q gi|254780277|r 202 KT------AALLLQEYG 212 (976) Q Consensus 202 Kt------A~~ll~~~g 212 (976) .+ |...+.+|| T Consensus 122 dG~vit~~gr~~I~~~G 138 (146) T cd03008 122 DGDVLAANAVDEILRLG 138 (146) T ss_pred CCCEECHHHHHHHHHHH T ss_conf 99897376689999976 No 425 >PRK00440 rfc replication factor C small subunit; Reviewed Probab=20.90 E-value=43 Score=11.97 Aligned_cols=16 Identities=56% Similarity=0.760 Sum_probs=0.0 Q ss_pred CCCCCCCHHHHHHHHH Q ss_conf 8776360249989841 Q gi|254780277|r 195 GIPGIGYKTAALLLQE 210 (976) Q Consensus 195 Gv~GiG~KtA~~ll~~ 210 (976) |-||+|+.|++..+.. T Consensus 44 GppG~GKTt~a~~la~ 59 (318) T PRK00440 44 GPPGTGKTTAALALAR 59 (318) T ss_pred CCCCCCHHHHHHHHHH T ss_conf 9599889999999999 No 426 >PRK10563 6-phosphogluconate phosphatase; Provisional Probab=20.83 E-value=34 Score=12.74 Aligned_cols=69 Identities=10% Similarity=0.113 Sum_probs=0.0 Q ss_pred HHHCCHHHHCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEECCEEHHHHHHHHHHHHHHCCCCEEEECCC Q ss_conf 324086521667987975886489999999970997995689016658999999998489949998687 Q gi|254780277|r 73 RNEIYPDYKANRPKIPEMLLPQLPLVRLATQAFGIPAIEIQGFEADDIIATYTYIAEKEGFAVTIISTD 141 (976) Q Consensus 73 R~~~~~~YKa~R~~~p~~l~~q~~~i~~~l~~~gi~~~~~~~~EADDviatla~~~~~~~~~v~i~s~D 141 (976) ..++...|++.....=+.-.+.+|-+.++|+.+++++....+--...+...|.+..-..-+.-.|+|+| T Consensus 68 ~~~~~~~~~~~~~~~~~~~~~~~pga~elL~~l~~~~aivS~~~~~~~~~~L~~~gl~~~F~d~ivs~d 136 (221) T PRK10563 68 KAELEHVYRAEVARLFDTELEAIAGANALLESITVPMCVVSNGPVSKMQHSLGKLGMLHYFPDKLFSGY 136 (221) T ss_pred HHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHCCCCCEEEECCHHHHHHHHHHHCCCHHHCCCEEECHH T ss_conf 999999999999998763587071399999966899789849879999999997598251687897110 No 427 >TIGR02176 pyruv_ox_red pyruvate:ferredoxin (flavodoxin) oxidoreductase; InterPro: IPR011895 The oxidative decarboxylation of pyruvate to acetyl-CoA, a central step in energy metabolism, can occur by two different mechanisms . In mitochondria and aerobic bacteria this reaction is catalysed by the multienzyme complex pyruvate dehydrogenase using NAD as electron acceptor. In anaerobic organisms, however, this reaction is reversibly catalysed by a single enzyme using either ferrodoxin or flavodoxin as the electron acceptor. Pyruvate:ferrodoxin/flavodoxin reductases (PFORs) in this entry occur in both obligately and facultatively anaerobic bacteria and also some eukaryotic microorganisms. These proteins are single-chain enzymes containing a thiamin pyrophosphate cofactor for the cleavage of carbon-carbon bonds next to a carbonyl group, and iron-sulphur clusters for electron transfer. The Desulfovibrio africanus enzyme is currently the only PFOR whose three dimensional structure is known , . It is a homodimer where each subunit contains one thiamin pyrophosphate cofactor and two ferrodoxin-like 4Fe-S clusters and an atypical 4Fe-S cluster. Each monomer is composed of seven domains - domains I, II and VI make intersubunit contacts, while domains III, IV and V are located at the suface of the dimer, and domain VII forms a long arm extending over the other subunit. The cofactor is bound at the interface of domains I and VI and is proximal to the atypical 4Fe-S bound by domain VI, while the ferrodoxin-like 4Fe-S clusters are bound by domain V. Comparison of this enzyme with the multi-chain PFORs shows a correspondance between the domains in this enzyme and the subunits of the multi-chain enzymes.; GO: 0005506 iron ion binding, 0016903 oxidoreductase activity acting on the aldehyde or oxo group of donors, 0006118 electron transport. Probab=20.73 E-value=43 Score=11.94 Aligned_cols=70 Identities=23% Similarity=0.433 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHCHHHH-HHHHHHHHHHHHCCCEECCCCCEECC------CCCCCCCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999888899-99999999999849888277876018------987588378887787877013456538999 Q gi|254780277|r 827 EAADYIKRYFHRFPGIH-EYIEKTKNFVRQNGYVETIFGRRIHY------DEINSPKSSIRNINERAAINAPIQGSAADI 899 (976) Q Consensus 827 eA~~~i~~yf~~~p~v~-~~~~~~~~~a~~~gyv~T~~GRrr~~------p~i~s~~~~~r~~aeR~a~N~~iQGsaAdi 899 (976) ++.|-+++|+..-|+|- .||++.- +++||.-+| || |||.=| .=||.||= T Consensus 223 Q~REa~N~YY~a~P~IVq~yM~~~a----------~lTGR~Y~LfdYYGapd-----------AE~vIi---~MGSvaeT 278 (1194) T TIGR02176 223 QGREAVNKYYEAVPEIVQKYMDKIA----------KLTGRSYHLFDYYGAPD-----------AERVII---AMGSVAET 278 (1194) T ss_pred CCCCCCCCHHHCCCHHHHHHHHHHH----------HCCCCCCCCCCCCCCCC-----------CCCEEE---ECCCHHHH T ss_conf 4541437121013358999998876----------30576567523378702-----------031044---05856678 Q ss_pred HHHHHHHHHHHHHHCCCCEEEEEEEC Q ss_conf 99999999999986599818999726 Q gi|254780277|r 900 TRRAMISVHKSLEDHKLATKMLLQIH 925 (976) Q Consensus 900 ~k~am~~i~~~~~~~~~~~~~ilqVH 925 (976) + -.+=+.|+.+|-+.=+|- || T Consensus 279 i----eEtvdyL~~kG~KvGllk-Vr 299 (1194) T TIGR02176 279 I----EETVDYLRSKGEKVGLLK-VR 299 (1194) T ss_pred H----HHHHHHHHHCCCEEEEEE-EE T ss_conf 9----999999972893276888-75 No 428 >TIGR03333 salvage_mtnX 2-hydroxy-3-keto-5-methylthiopentenyl-1-phosphate phosphatase. Members of this family are the methionine salvage enzyme MnxX, a member of the HAD-superfamily hydrolases, subfamily IB (see TIGR01488). Members are found in Bacillus subtilis and related species, paired with MtnW (TIGR03332). In most species that recycle methionine from methylthioadenosine, the single protein MtnC replaces the MtnW/MtnX pair. In B. subtilis, mtnX was first known as ykrX. Probab=20.71 E-value=43 Score=11.94 Aligned_cols=71 Identities=14% Similarity=0.117 Sum_probs=0.0 Q ss_pred CCCHHHHCCHHHHCCCCCCCHHHHHHHHHH-----HHHHHHCCCCEEEECCEEHHHHHHHHHHHHHHCCCCEEEECCCCH Q ss_conf 887132408652166798797588648999-----999997099799568901665899999999848994999868701 Q gi|254780277|r 69 AVTFRNEIYPDYKANRPKIPEMLLPQLPLV-----RLATQAFGIPAIEIQGFEADDIIATYTYIAEKEGFAVTIISTDKD 143 (976) Q Consensus 69 ~~tfR~~~~~~YKa~R~~~p~~l~~q~~~i-----~~~l~~~gi~~~~~~~~EADDviatla~~~~~~~~~v~i~s~DkD 143 (976) +|--|.++.+++++++-..-+.+..|+..+ .++++.+ .+..+=|-=..-+...+..+|++++|+|+--| T Consensus 24 a~~~~~~i~~~~~~g~is~re~~~~~~~~l~~~~~~e~~~~l------~~~~~idpgF~eF~~~~~~~~ip~~IvS~G~d 97 (214) T TIGR03333 24 APPEWEALKDGVLSKTLSIQEGVGRMFGLLPSSLKEEITSFV------LETAEIREGFREFVAFINEHGIPFYVISGGMD 97 (214) T ss_pred CCHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCHHHHHHHHH------HHCCCCCCCHHHHHHHHHHCCCCEEEECCCCH T ss_conf 848899999999819865999999999837887279999999------85598685799999999985997899869933 Q ss_pred HH Q ss_conf 21 Q gi|254780277|r 144 LM 145 (976) Q Consensus 144 ~~ 145 (976) ++ T Consensus 98 ~~ 99 (214) T TIGR03333 98 FF 99 (214) T ss_pred HH T ss_conf 89 No 429 >pfam00216 Bac_DNA_binding Bacterial DNA-binding protein. Probab=20.59 E-value=44 Score=11.92 Aligned_cols=21 Identities=10% Similarity=0.396 Sum_probs=0.0 Q ss_pred HHHHCCCCHHHHHHHHHHHHH Q ss_conf 678819998999999999999 Q gi|254780277|r 817 LATQLRIPRSEAADYIKRYFH 837 (976) Q Consensus 817 La~~l~is~~eA~~~i~~yf~ 837 (976) +|++.|+|..+|+.+++.+|+ T Consensus 10 ia~~~~ls~~~~~~~v~~~~~ 30 (90) T pfam00216 10 IAEKTGLSKKDAEAIVDAFLD 30 (90) T ss_pred HHHHHCCCHHHHHHHHHHHHH T ss_conf 998649699999999999999 No 430 >pfam05918 API5 Apoptosis inhibitory protein 5 (API5). This family consists of apoptosis inhibitory protein 5 (API5) sequences from several organisms. Apoptosis or programmed cell death is a physiological form of cell death that occurs in embryonic development and organ formation. It is characterized by biochemical and morphological changes such as DNA fragmentation and cell volume shrinkage. API5 is an anti apoptosis gene located in human chromosome 11, whose expression prevents the programmed cell death that occurs upon the deprivation of growth factors. Probab=20.48 E-value=44 Score=11.91 Aligned_cols=17 Identities=41% Similarity=0.815 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHCHHHHH Q ss_conf 99999999998888999 Q gi|254780277|r 828 AADYIKRYFHRFPGIHE 844 (976) Q Consensus 828 A~~~i~~yf~~~p~v~~ 844 (976) |.+||-+||.-||.+.. T Consensus 42 AsQfIprFFK~FP~La~ 58 (543) T pfam05918 42 ASQLIPRYFKFFPSLAT 58 (543) T ss_pred HHHHHHHHHHHCHHHHH T ss_conf 99899999865744689 No 431 >pfam04025 DUF370 Domain of unknown function (DUF370). Bacterial domain of unknown function. Probab=20.44 E-value=44 Score=11.90 Aligned_cols=20 Identities=30% Similarity=0.486 Sum_probs=0.0 Q ss_pred CCCCCCEEEECCCCCEEEEE Q ss_conf 22224302424899489996 Q gi|254780277|r 729 LGQKIRRAFISPPTKKLISA 748 (976) Q Consensus 729 ~g~~iR~~F~a~~g~~lv~a 748 (976) +||+-|...+-+.||+++|| T Consensus 44 ~GrkTRavii~DS~HviLSa 63 (73) T pfam04025 44 YGRKTRAVIITDSGHVILSA 63 (73) T ss_pred CCCEEEEEEEECCCCEEEEE T ss_conf 69606799997089389970 No 432 >CHL00137 rps13 ribosomal protein S13; Validated Probab=20.41 E-value=44 Score=11.90 Aligned_cols=20 Identities=30% Similarity=0.441 Sum_probs=0.0 Q ss_pred CCCCCCCCCHHHHHHHHHCC Q ss_conf 99877636024998984136 Q gi|254780277|r 193 IPGIPGIGYKTAALLLQEYG 212 (976) Q Consensus 193 ipGv~GiG~KtA~~ll~~~g 212 (976) +..|.|||+.+|..+....| T Consensus 19 Lt~I~GIG~~~A~~Ic~~lg 38 (122) T CHL00137 19 LTYIYGIGLTSAKKILEKAN 38 (122) T ss_pred HHHHCCCCHHHHHHHHHHCC T ss_conf 11000618999999999849 No 433 >KOG1968 consensus Probab=20.40 E-value=40 Score=12.19 Aligned_cols=14 Identities=57% Similarity=0.869 Sum_probs=0.0 Q ss_pred CCCCCCCCCCHHHHH Q ss_conf 899877636024998 Q gi|254780277|r 192 NIPGIPGIGYKTAAL 206 (976) Q Consensus 192 nipGv~GiG~KtA~~ 206 (976) ++-|.|||| ||+++ T Consensus 361 l~~G~pGig-KT~~~ 374 (871) T KOG1968 361 LLSGPPGIG-KTTAA 374 (871) T ss_pred HHCCCCCCC-CHHHH T ss_conf 731788777-20567 No 434 >KOG2875 consensus Probab=20.27 E-value=44 Score=11.88 Aligned_cols=13 Identities=38% Similarity=0.736 Sum_probs=0.0 Q ss_pred CCCCCCHHHHHHH Q ss_conf 7763602499898 Q gi|254780277|r 196 IPGIGYKTAALLL 208 (976) Q Consensus 196 v~GiG~KtA~~ll 208 (976) .||||+|-|..++ T Consensus 223 lpGVG~KVADCI~ 235 (323) T KOG2875 223 LPGVGPKVADCIC 235 (323) T ss_pred CCCCCCHHHHHHH T ss_conf 8887614756223 No 435 >COG1415 Uncharacterized conserved protein [Function unknown] Probab=20.24 E-value=44 Score=11.87 Aligned_cols=34 Identities=24% Similarity=0.470 Sum_probs=0.0 Q ss_pred ECHHHHHHHHCCCHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHH Q ss_conf 389999888489887877778751874358998776360249989 Q gi|254780277|r 163 IDIEHVIKKWGIPPEKMVCLQALTGDSIDNIPGIPGIGYKTAALL 207 (976) Q Consensus 163 ~~~~~v~~~~gv~p~~~~d~~al~GD~sDnipGv~GiG~KtA~~l 207 (976) ++...+..-|...|.-+-+++- +|||||+|...| T Consensus 261 i~~~~l~~~ye~~p~Df~elLl-----------~~GiGpstvRAL 294 (373) T COG1415 261 LNWRRLRRLYELNPDDFEELLL-----------VPGIGPSTVRAL 294 (373) T ss_pred CCHHHHHHHHHCCCCCHHHHHH-----------CCCCCHHHHHHH T ss_conf 7699997664468324999874-----------068788999999 No 436 >cd00635 PLPDE_III_YBL036c_like Type III Pyridoxal 5-phosphate (PLP)-Dependent Enzymes, YBL036c-like proteins. This family contains mostly uncharacterized proteins, widely distributed among eukaryotes, bacteria and archaea, that bear similarity to the yeast hypothetical protein YBL036c, which is homologous to a Pseudomonas aeruginosa gene that is co-transcribed with a known proline biosynthetic gene. YBL036c is a single domain monomeric protein with a typical TIM barrel fold. It binds the PLP cofactor and has been shown to exhibit amino acid racemase activity. The YBL036c structure is similar to the N-terminal domain of the fold type III PLP-dependent enzymes, bacterial alanine racemase and eukaryotic ornithine decarboxylase, which are two-domain dimeric proteins. The lack of a second domain in YBL036c may explain limited D- to L-alanine racemase or non-specific racemase activity. Probab=20.21 E-value=44 Score=11.87 Aligned_cols=51 Identities=20% Similarity=0.164 Sum_probs=0.0 Q ss_pred HHCCCHHHHHHHHHHCCC-CCCCCCCCCCCCCHHH--HHHHHHCCCHHHHHHHH Q ss_conf 848988787777875187-4358998776360249--98984136789998645 Q gi|254780277|r 171 KWGIPPEKMVCLQALTGD-SIDNIPGIPGIGYKTA--ALLLQEYGNLENILINA 221 (976) Q Consensus 171 ~~gv~p~~~~d~~al~GD-~sDnipGv~GiG~KtA--~~ll~~~gsle~i~~~~ 221 (976) |.||.|+.+.++..-+-+ +.=++-|+..|||.+. ...-..|..+-.+.+.+ T Consensus 131 K~G~~~~e~~~~~~~~~~~~~l~~~GLM~i~p~~~d~~~~~~~F~~l~~l~~~l 184 (222) T cd00635 131 KSGVAPEELEELLEEIAALPNLRIRGLMTIAPLTEDPEEVRPYFRELRELRDEL 184 (222) T ss_pred CCCCCHHHHHHHHHHHHHCCCCCCCEEEEECCCCCCHHHHHHHHHHHHHHHHHH T ss_conf 568899999999999996689975103653699999799999999999999998 No 437 >TIGR01362 KDO8P_synth 3-deoxy-8-phosphooctulonate synthase; InterPro: IPR006269 These sequences describe 2-dehydro-3-deoxyphosphooctonate aldolase. Alternate names include 3-deoxy-d-manno-octulosonic acid 8-phosphate and KDO-8 phosphate synthetase. It catalyzes the aldol condensation of phosphoenolpyruvate with D-arabinose 5-phosphate. phosphoenolpyruvate + D-arabinose 5-phosphate + H_2O = 2-dehydro-3-deoxy-D-octonate 8-phosphate + phosphate In Gram-negative bacteria, this is the first step in the biosynthesis of 3-deoxy-D-manno-octulosonate, part of the oligosaccharide core of lipopolysaccharide. ; GO: 0008676 3-deoxy-8-phosphooctulonate synthase activity, 0008152 metabolic process, 0005737 cytoplasm. Probab=20.11 E-value=45 Score=11.85 Aligned_cols=30 Identities=33% Similarity=0.542 Sum_probs=0.0 Q ss_pred HCCCCCCC----HHHHHHHHHHHHHHHHCCCCEE Q ss_conf 16679879----7588648999999997099799 Q gi|254780277|r 81 KANRPKIP----EMLLPQLPLVRLATQAFGIPAI 110 (976) Q Consensus 81 Ka~R~~~p----~~l~~q~~~i~~~l~~~gi~~~ 110 (976) ||||...- .-|..-+.+++++=+.+|+|++ T Consensus 45 KANRtS~~SfRGpGle~GL~iL~kvK~~fG~pi~ 78 (279) T TIGR01362 45 KANRTSINSFRGPGLEEGLKILQKVKEEFGVPIL 78 (279) T ss_pred HHHHCCCCCCCCCCHHHHHHHHHHHHHHCCCCCC T ss_conf 1232110243688878999999986565289702 No 438 >PRK13331 pantothenate kinase; Reviewed Probab=20.04 E-value=45 Score=11.84 Aligned_cols=38 Identities=5% Similarity=0.016 Sum_probs=0.0 Q ss_pred CCCCCCCEECHHHHHHHHHHHHHHHHCCCCCCCEEEEE Q ss_conf 46898855006999999999999984855479879999 Q gi|254780277|r 27 SRKHDGLPVNAIAGFCNMLWKLLQNSRKESIASHFAVI 64 (976) Q Consensus 27 ~~~~~G~~t~ai~gf~~~l~~~i~~~~~~~~~~~~~v~ 64 (976) .+|.+.+..+.+||++.++..+|++.+.++....++++ T Consensus 175 ~nT~~ai~sG~~~G~~~~I~~~i~~~~~e~~~~~vIlT 212 (251) T PRK13331 175 LNTQEAIQSGVIYTLLAGLRDFIEDWLSLFPDGPIVLT 212 (251) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEE T ss_conf 89899999999999999999999999997799949997 No 439 >TIGR00375 TIGR00375 conserved hypothetical protein TIGR00375; InterPro: IPR005287 This family of conserved hypothetical proteins has no known function. . Probab=20.02 E-value=44 Score=11.89 Aligned_cols=42 Identities=19% Similarity=0.347 Sum_probs=0.0 Q ss_pred CCCCCCHH----------HHHHHHHCCC-----HHHHHHHHHHHHHHHHHHHHCCCH Q ss_conf 77636024----------9989841367-----899986454200112232200357 Q gi|254780277|r 196 IPGIGYKT----------AALLLQEYGN-----LENILINASRIKQKKRRENILEYA 237 (976) Q Consensus 196 v~GiG~Kt----------A~~ll~~~gs-----le~i~~~~d~i~~~k~~~~l~~~~ 237 (976) |+|+++++ =-+|+.+||| +|.-++.+..|.++|+.+.+...+ T Consensus 311 ~ig~~~~kG~~TK~VQ~~wekl~k~fGtEi~VL~~A~~edla~~~pPkvA~~i~~fR 367 (384) T TIGR00375 311 VIGLKIDKGVFTKAVQSLWEKLVKKFGTEIEVLLEAALEDLAKVDPPKVAALIEKFR 367 (384) T ss_pred ECCCCCCCCEEEHHHHHHHHHHHHHHCCHHHHHHCCCHHHHHCCCCCHHHHHHHHHH T ss_conf 035345873130346778999864104156676427845873358835889999863 Done!