Query gi|254780814|ref|YP_003065227.1| DNA gyrase subunit B [Candidatus Liberibacter asiaticus str. psy62] Match_columns 803 No_of_seqs 234 out of 2361 Neff 6.6 Searched_HMMs 39220 Date Sun May 29 18:16:44 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780814.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 TIGR01059 gyrB DNA gyrase, B s 100.0 0 0 2598.9 57.0 789 10-803 1-818 (818) 2 PRK05644 gyrB DNA gyrase subun 100.0 0 0 2108.0 62.5 723 1-803 2-725 (725) 3 COG0187 GyrB Type IIA topoisom 100.0 0 0 1967.4 55.0 631 7-803 3-635 (635) 4 PTZ00109 DNA gyrase subunit b; 100.0 0 0 1959.2 57.7 729 7-797 53-938 (941) 5 TIGR01058 parE_Gpos DNA topois 100.0 0 0 1850.1 41.1 627 1-796 1-639 (655) 6 PRK05559 DNA topoisomerase IV 100.0 0 0 1804.1 53.5 627 5-800 3-632 (633) 7 smart00433 TOP2c Topoisomerase 100.0 0 0 1677.1 53.3 590 40-797 1-594 (594) 8 TIGR01055 parE_Gneg DNA topois 100.0 0 0 1487.7 39.1 627 8-798 2-642 (647) 9 PTZ00108 DNA topoisomerase II; 100.0 0 0 1175.5 38.3 577 1-596 1-680 (1506) 10 KOG0355 consensus 100.0 0 0 867.7 27.1 597 12-796 5-636 (842) 11 cd00822 TopoII_Trans_DNA_gyras 100.0 0 0 391.6 15.0 172 227-398 1-172 (172) 12 cd03366 TOPRIM_TopoIIA_GyrB TO 100.0 0 0 391.9 10.8 114 425-539 1-114 (114) 13 cd01030 TOPRIM_TopoIIA_like TO 100.0 0 0 384.8 10.7 115 425-539 1-115 (115) 14 pfam00204 DNA_gyraseB DNA gyra 100.0 0 0 376.1 14.8 170 228-398 1-173 (173) 15 cd03365 TOPRIM_TopoIIA TOPRIM_ 100.0 2.9E-43 0 355.0 8.3 112 425-536 1-117 (120) 16 pfam00986 DNA_gyraseB_C DNA gy 99.9 3.6E-28 9.2E-33 237.8 4.2 64 730-793 2-65 (65) 17 cd03481 TopoIIA_Trans_ScTopoII 99.9 3.8E-22 9.7E-27 191.1 11.9 130 246-386 22-151 (153) 18 PRK04184 DNA topoisomerase VI 99.4 1.6E-10 4.2E-15 100.7 19.2 187 12-213 8-211 (533) 19 COG1389 DNA topoisomerase VI, 99.3 1.9E-10 4.8E-15 100.3 16.5 185 15-213 12-213 (538) 20 TIGR01052 top6b DNA topoisomer 99.0 1.3E-09 3.4E-14 93.7 8.5 168 33-213 21-216 (662) 21 PRK00095 mutL DNA mismatch rep 98.9 2E-07 5E-12 76.8 14.3 229 40-302 22-269 (612) 22 cd00329 TopoII_MutL_Trans MutL 98.5 1.8E-07 4.5E-12 77.2 6.1 99 230-347 2-106 (107) 23 COG0323 MutL DNA mismatch repa 98.4 3.3E-06 8.5E-11 67.3 10.4 257 41-339 24-298 (638) 24 TIGR00585 mutl DNA mismatch re 98.4 3.5E-06 8.8E-11 67.2 8.9 182 40-237 22-235 (367) 25 cd00075 HATPase_c Histidine ki 97.7 4.2E-05 1.1E-09 58.7 3.7 80 41-135 1-83 (103) 26 pfam02518 HATPase_c Histidine 97.6 1.7E-05 4.2E-10 61.9 1.1 87 38-139 3-92 (111) 27 smart00387 HATPase_c Histidine 97.6 2.4E-05 6.2E-10 60.6 1.6 88 38-140 3-93 (111) 28 KOG1978 consensus 97.5 0.00059 1.5E-08 49.8 7.3 170 38-228 18-205 (672) 29 PRK05218 heat shock protein 90 97.2 0.0062 1.6E-07 41.9 10.0 152 46-212 31-209 (612) 30 COG0326 HtpG Molecular chapero 97.0 0.0014 3.6E-08 46.9 5.4 127 46-181 33-182 (623) 31 PTZ00272 heat shock protein 83 97.0 0.0045 1.1E-07 43.0 7.5 150 45-211 30-206 (701) 32 PRK13560 hypothetical protein; 96.9 0.0011 2.8E-08 47.8 3.9 23 192-214 195-223 (807) 33 KOG1979 consensus 96.7 0.0063 1.6E-07 41.9 6.6 151 43-212 30-196 (694) 34 COG3920 Signal transduction hi 96.1 0.0083 2.1E-07 40.9 4.3 94 30-153 113-216 (221) 35 PRK10935 nitrate/nitrite senso 96.1 0.019 4.8E-07 38.1 5.9 11 662-672 507-517 (568) 36 COG3850 NarQ Signal transducti 96.0 0.015 3.9E-07 38.9 5.3 17 656-672 507-523 (574) 37 PTZ00130 heat shock protein 90 95.9 0.034 8.7E-07 36.2 6.6 149 46-213 102-281 (824) 38 COG2205 KdpD Osmosensitive K+ 95.8 0.011 2.8E-07 40.0 3.6 29 25-57 20-48 (890) 39 pfam01751 Toprim Toprim domain 95.8 0.0067 1.7E-07 41.6 2.5 79 426-527 2-82 (89) 40 PRK10604 sensor protein RstB; 95.6 0.0055 1.4E-07 42.3 1.7 22 573-594 352-373 (433) 41 PRK10337 sensor protein QseC; 95.4 0.026 6.5E-07 37.1 4.4 18 575-592 382-399 (446) 42 PRK10549 signal transduction h 95.4 0.0059 1.5E-07 42.1 1.1 13 577-589 392-404 (467) 43 PRK09835 sensor kinase CusS; P 95.3 0.02 5.1E-07 38.0 3.7 20 573-592 410-429 (482) 44 PRK10600 nitrate/nitrite senso 95.3 0.038 9.6E-07 35.8 5.0 14 659-672 498-511 (569) 45 PRK11073 glnL nitrogen regulat 95.3 0.042 1.1E-06 35.5 5.2 11 68-78 23-33 (348) 46 PRK03660 anti-sigma F factor; 95.2 0.063 1.6E-06 34.1 6.0 88 43-149 42-135 (146) 47 PRK10815 sensor protein PhoQ; 95.2 0.013 3.2E-07 39.5 2.3 21 572-592 409-429 (484) 48 PRK10490 sensor protein KdpD; 95.2 0.0094 2.4E-07 40.5 1.6 38 16-54 70-137 (895) 49 cd00188 TOPRIM Topoisomerase-p 94.9 0.064 1.6E-06 34.0 5.2 73 426-524 2-74 (83) 50 PTZ00108 DNA topoisomerase II; 94.8 0.098 2.5E-06 32.6 5.9 12 57-69 229-240 (1506) 51 PRK11644 sensory histidine kin 94.6 0.07 1.8E-06 33.8 4.8 15 658-672 440-454 (497) 52 COG4585 Signal transduction hi 94.6 0.061 1.6E-06 34.2 4.5 11 472-482 223-233 (365) 53 PRK09467 envZ osmolarity senso 94.5 0.071 1.8E-06 33.7 4.7 22 573-594 366-387 (437) 54 PRK10755 sensor protein BasS/P 94.5 0.1 2.6E-06 32.5 5.4 23 573-595 287-309 (355) 55 KOG1977 consensus 94.3 0.028 7.1E-07 36.8 2.3 161 40-214 21-195 (1142) 56 TIGR02938 nifL_nitrog nitrogen 94.3 0.053 1.4E-06 34.7 3.6 27 140-176 83-109 (496) 57 PRK11100 sensory histidine kin 94.1 0.084 2.1E-06 33.1 4.3 12 578-589 408-419 (475) 58 PRK09303 adaptive-response sen 94.1 0.1 2.6E-06 32.4 4.7 19 574-592 307-325 (378) 59 KOG0787 consensus 94.0 0.13 3.4E-06 31.6 5.2 49 495-544 238-292 (414) 60 PRK10364 sensor protein ZraS; 93.8 0.084 2.2E-06 33.1 3.9 23 191-214 76-99 (455) 61 PRK11006 phoR phosphate regulo 93.4 0.14 3.6E-06 31.4 4.4 22 573-594 352-373 (431) 62 PRK04069 serine-protein kinase 92.8 0.43 1.1E-05 27.6 6.2 92 43-150 45-141 (158) 63 COG4191 Signal transduction hi 92.3 0.19 4.7E-06 30.4 3.8 11 62-73 101-111 (603) 64 PRK11107 hybrid sensory histid 92.3 0.16 4.2E-06 30.9 3.5 107 456-594 670-785 (920) 65 COG2172 RsbW Anti-sigma regula 91.0 1.3 3.3E-05 23.9 7.8 83 41-143 41-130 (146) 66 COG5000 NtrY Signal transducti 90.8 0.46 1.2E-05 27.4 4.5 33 569-601 545-577 (712) 67 PRK11466 hybrid sensory histid 90.1 0.45 1.2E-05 27.4 4.0 54 28-87 552-608 (912) 68 PRK13837 two-component VirA-li 88.9 0.94 2.4E-05 25.0 4.9 73 509-592 560-632 (831) 69 PRK11091 aerobic respiration c 88.7 0.18 4.5E-06 30.6 1.1 114 451-596 521-644 (779) 70 TIGR02966 phoR_proteo phosphat 88.3 0.69 1.8E-05 26.0 3.9 29 570-598 267-295 (339) 71 PRK10547 chemotaxis protein Ch 87.8 0.5 1.3E-05 27.1 2.9 21 705-725 631-651 (662) 72 PRK10841 hybrid sensory kinase 87.4 0.64 1.6E-05 26.3 3.3 25 62-87 143-167 (947) 73 PRK04031 DNA primase; Provisio 87.1 2.3 5.9E-05 22.0 6.6 156 354-603 115-271 (420) 74 COG3852 NtrB Signal transducti 85.6 1.5 3.8E-05 23.4 4.4 10 473-482 249-258 (363) 75 COG4564 Signal transduction hi 84.3 2.5 6.4E-05 21.7 5.0 40 266-305 259-307 (459) 76 PRK13557 histidine kinase; Pro 83.7 2.1 5.2E-05 22.3 4.4 11 70-80 49-59 (538) 77 TIGR02956 TMAO_torS TMAO reduc 79.4 0.92 2.3E-05 25.1 1.3 113 28-157 613-742 (1052) 78 TIGR02916 PEP_his_kin putative 77.9 4.9 0.00013 19.4 5.0 10 338-347 288-297 (696) 79 cd01027 TOPRIM_RNase_M5_like T 77.6 0.59 1.5E-05 26.6 -0.1 26 496-521 44-69 (81) 80 COG5002 VicK Signal transducti 77.2 3.2 8.2E-05 20.8 3.6 26 575-600 379-404 (459) 81 smart00493 TOPRIM topoisomeras 75.2 5.1 0.00013 19.3 4.2 70 426-521 2-71 (76) 82 COG1658 Small primase-like pro 74.8 2.6 6.7E-05 21.5 2.6 27 495-521 53-79 (127) 83 TIGR02604 Piru_Ver_Nterm putat 74.4 1.7 4.4E-05 22.9 1.6 15 170-184 198-212 (459) 84 COG0643 CheA Chemotaxis protei 72.5 4.1 0.0001 20.0 3.1 25 704-728 687-711 (716) 85 PRK11558 hypothetical protein; 69.9 5.5 0.00014 19.0 3.3 57 26-87 26-83 (97) 86 KOG0019 consensus 69.3 5.6 0.00014 19.0 3.2 122 44-177 61-204 (656) 87 cd01026 TOPRIM_OLD TOPRIM_OLD: 67.5 8.3 0.00021 17.6 3.9 59 426-512 5-70 (97) 88 TIGR01925 spIIAB anti-sigma F 66.0 4 0.0001 20.1 2.0 81 43-141 42-126 (137) 89 TIGR00032 argG argininosuccina 64.8 0.81 2.1E-05 25.5 -1.8 34 275-310 231-273 (420) 90 cd01028 TOPRIM_TopoIA TOPRIM_T 63.3 4.9 0.00012 19.4 2.0 27 496-522 90-116 (142) 91 pfam09707 Cas_Cas2CT1978 CRISP 61.5 10 0.00026 16.9 3.3 28 26-55 24-51 (86) 92 pfam09239 Topo-VIb_trans Topoi 61.5 11 0.00027 16.8 8.6 80 256-349 41-125 (160) 93 PTZ00233 variable surface prot 60.7 3.5 9E-05 20.5 0.9 55 495-567 429-484 (509) 94 pfam07994 NAD_binding_5 Myo-in 60.1 4.2 0.00011 19.9 1.2 75 228-334 157-231 (389) 95 COG2972 Predicted signal trans 56.0 13 0.00033 16.2 4.5 80 489-593 323-409 (456) 96 TIGR01320 mal_quin_oxido malat 55.8 6.8 0.00017 18.3 1.6 33 495-531 406-451 (487) 97 cd03363 TOPRIM_TopoIA_TopoI TO 55.8 8.5 0.00022 17.6 2.1 25 497-521 73-97 (123) 98 cd00823 TopoIIB_Trans TopoIIB_ 54.2 14 0.00035 16.0 8.6 80 256-349 32-116 (151) 99 TIGR01290 nifB nitrogenase cof 51.6 9.6 0.00024 17.1 1.9 10 274-283 189-198 (461) 100 KOG2672 consensus 50.7 5 0.00013 19.4 0.3 34 520-553 310-344 (360) 101 PRK13790 phosphoribosylamine-- 50.5 15 0.00039 15.5 4.8 10 444-453 288-297 (415) 102 TIGR01924 rsbW_low_gc anti-sig 50.3 11 0.00027 16.8 1.9 88 44-148 47-141 (161) 103 PRK00885 phosphoribosylamine-- 49.2 16 0.00041 15.4 3.7 10 444-453 290-299 (424) 104 TIGR00644 recJ single-stranded 49.1 16 0.00041 15.4 3.1 112 9-136 63-193 (705) 105 PRK13529 malate dehydrogenase; 48.7 13 0.00034 16.0 2.2 46 29-86 124-174 (563) 106 KOG0308 consensus 47.3 17 0.00044 15.2 4.3 30 53-83 77-106 (735) 107 PRK04017 hypothetical protein; 47.3 17 0.00044 15.2 3.0 28 497-524 65-92 (132) 108 PTZ00306 NADH-dependent fumara 46.7 11 0.00028 16.7 1.5 46 717-766 1077-1122(1167) 109 TIGR01420 pilT_fam twitching m 44.7 19 0.00048 14.9 3.8 190 45-302 2-199 (350) 110 PRK05279 N-acetylglutamate syn 43.9 6.1 0.00015 18.7 -0.1 73 453-530 149-235 (441) 111 TIGR02343 chap_CCT_epsi T-comp 43.1 14 0.00035 15.9 1.6 26 66-91 55-80 (541) 112 TIGR01873 cas_CT1978 CRISPR-as 42.9 15 0.00037 15.7 1.7 24 26-51 24-48 (91) 113 pfam07182 DUF1402 Protein of u 42.0 11 0.00027 16.8 0.8 12 336-348 173-184 (304) 114 TIGR01836 PHA_synth_III_C poly 41.7 21 0.00052 14.6 3.0 46 454-503 302-347 (367) 115 PRK13789 phosphoribosylamine-- 40.0 22 0.00055 14.4 4.0 12 449-460 378-389 (426) 116 KOG1332 consensus 39.7 22 0.00056 14.4 7.6 107 59-178 21-160 (299) 117 TIGR02749 prenyl_cyano solanes 39.3 22 0.00057 14.3 2.5 35 368-402 250-293 (325) 118 pfam09284 RhgB_N Rhamnogalactu 38.1 23 0.00059 14.2 6.0 87 97-184 17-116 (249) 119 TIGR02080 O_succ_thio_ly O-suc 37.4 14 0.00036 15.8 0.9 16 297-312 259-274 (383) 120 pfam10340 DUF2424 Protein of u 37.2 19 0.00047 14.9 1.5 15 425-439 194-208 (374) 121 COG0049 RpsG Ribosomal protein 37.1 24 0.0006 14.1 2.0 57 334-390 12-72 (148) 122 KOG0020 consensus 36.7 24 0.00062 14.0 2.7 155 46-208 101-280 (785) 123 COG4822 CbiK Cobalamin biosynt 35.8 25 0.00064 13.9 3.6 10 426-435 200-209 (265) 124 TIGR01781 Trep_dent_lipo Trepo 35.7 23 0.00059 14.2 1.8 25 266-290 267-293 (463) 125 TIGR01992 PTS-IIBC-Tre PTS sys 35.3 16 0.0004 15.5 0.9 16 198-213 19-34 (489) 126 COG5257 GCD11 Translation init 34.8 26 0.00066 13.8 3.6 33 317-351 104-136 (415) 127 pfam05914 RIB43A RIB43A. This 34.7 16 0.00041 15.4 0.8 40 317-370 127-166 (379) 128 LOAD_Toprim consensus 34.2 26 0.00067 13.7 2.6 27 496-522 45-71 (98) 129 TIGR00018 panC pantoate--beta- 34.1 26 0.00066 13.8 1.8 25 176-200 151-176 (310) 130 KOG3938 consensus 33.9 27 0.00068 13.7 3.1 16 275-290 160-175 (334) 131 cd00223 TOPRIM_TopoIIB_SPO TOP 33.9 21 0.00053 14.5 1.3 34 229-266 39-72 (160) 132 PRK09918 putative periplasmic 33.7 25 0.00064 13.9 1.7 12 72-84 42-53 (230) 133 KOG2479 consensus 33.4 8.1 0.00021 17.7 -0.9 27 171-203 162-188 (549) 134 PRK11070 ssDNA exonuclease Rec 33.1 27 0.0007 13.6 3.1 15 369-383 306-320 (574) 135 pfam10691 DUF2497 Protein of u 32.1 28 0.00072 13.5 3.0 32 352-383 34-66 (72) 136 PRK05776 DNA topoisomerase III 32.0 28 0.00072 13.5 1.9 11 293-303 163-173 (675) 137 KOG0318 consensus 30.9 30 0.00075 13.4 8.2 13 172-184 195-207 (603) 138 TIGR01245 trpD anthranilate ph 30.8 24 0.00061 14.1 1.2 37 760-796 254-291 (331) 139 pfam05055 DUF677 Protein of un 30.2 30 0.00077 13.3 4.6 10 275-284 71-80 (336) 140 PHA02516 baseplate wedge subun 30.1 30 0.00076 13.3 1.6 65 16-80 7-92 (103) 141 cd03362 TOPRIM_TopoIA_TopoIII 30.1 30 0.00078 13.3 2.1 24 498-521 100-123 (151) 142 TIGR01990 bPGM beta-phosphoglu 29.1 18 0.00047 14.9 0.4 16 315-330 86-101 (190) 143 PRK03353 ribB 3,4-dihydroxy-2- 28.4 32 0.00082 13.1 2.5 22 189-213 13-34 (217) 144 KOG1845 consensus 27.3 22 0.00057 14.3 0.5 28 45-72 151-180 (775) 145 TIGR00010 TIGR00010 hydrolase, 27.2 34 0.00086 12.9 2.8 28 425-463 210-240 (269) 146 PRK11385 hypothetical protein; 27.1 34 0.00087 12.9 1.7 51 29-81 5-62 (245) 147 TIGR01880 Ac-peptdase-euk N-ac 26.7 15 0.00038 15.6 -0.4 46 136-186 98-156 (433) 148 pfam09052 SipA Salmonella inva 26.1 20 0.0005 14.7 0.1 30 257-292 300-329 (674) 149 pfam06154 YagB_YeeU_YfjZ YagB/ 26.1 35 0.0009 12.8 2.2 10 503-512 83-92 (104) 150 KOG0526 consensus 26.1 34 0.00086 12.9 1.3 52 169-226 115-168 (615) 151 PRK09522 bifunctional anthrani 25.9 36 0.00091 12.7 1.7 43 754-796 444-492 (531) 152 COG4251 Bacteriophytochrome (l 25.5 36 0.00092 12.7 3.1 24 720-743 683-706 (750) 153 pfam08166 NUC202 NUC202 domain 25.2 33 0.00085 12.9 1.1 36 749-788 14-49 (61) 154 TIGR02288 PaaN_2 phenylacetic 24.9 36 0.00092 12.7 1.2 71 271-341 348-427 (560) 155 PTZ00317 malic enzyme; Provisi 24.7 37 0.00095 12.6 2.6 47 28-86 122-173 (570) 156 KOG0180 consensus 24.5 27 0.00069 13.7 0.5 15 183-197 52-68 (204) 157 COG2831 FhaC Hemolysin activat 24.4 38 0.00096 12.5 1.6 11 121-131 242-252 (554) 158 pfam01525 Rota_NS26 Rotavirus 24.3 25 0.00064 13.9 0.4 27 432-460 178-204 (212) 159 cd03361 TOPRIM_TopoIA_RevGyr T 24.2 38 0.00097 12.5 2.3 25 497-521 119-143 (170) 160 KOG1676 consensus 23.7 28 0.0007 13.6 0.5 40 134-180 231-270 (600) 161 COG4942 Membrane-bound metallo 23.2 40 0.001 12.4 4.4 68 436-513 291-365 (420) 162 PTZ00260 glycosyl transferase 23.0 40 0.001 12.3 1.5 21 424-444 116-136 (336) 163 cd04237 AAK_NAGS-ABP AAK_NAGS- 22.9 18 0.00045 15.1 -0.7 63 453-516 142-218 (280) 164 TIGR01497 kdpB K+-transporting 22.6 41 0.001 12.3 4.5 55 70-145 125-196 (675) 165 pfam08812 YtxC YtxC-like famil 22.2 41 0.0011 12.2 4.5 10 294-303 102-111 (220) 166 TIGR01311 glycerol_kin glycero 22.0 31 0.0008 13.1 0.5 50 114-170 175-233 (518) 167 smart00462 PTB Phosphotyrosine 21.3 43 0.0011 12.1 5.2 37 47-84 23-65 (134) 168 COG1497 Predicted transcriptio 21.2 22 0.00056 14.4 -0.5 16 520-535 237-252 (260) 169 KOG3651 consensus 21.2 43 0.0011 12.1 2.2 28 526-553 253-281 (429) 170 PRK10162 acetyl esterase; Prov 21.2 35 0.0009 12.8 0.6 19 428-446 159-182 (319) 171 TIGR02257 cobalto_cobN cobalto 21.1 43 0.0011 12.1 4.9 16 489-504 513-528 (1310) 172 pfam08304 consensus 21.1 27 0.00068 13.7 -0.0 29 449-477 341-369 (378) 173 pfam11243 DUF3045 Protein of u 21.0 43 0.0011 12.1 1.6 37 525-564 40-78 (90) 174 pfam00926 DHBP_synthase 3,4-di 20.8 44 0.0011 12.0 2.2 10 204-213 10-19 (193) 175 COG3013 Uncharacterized conser 20.8 33 0.00083 13.0 0.3 35 446-480 130-168 (168) 176 KOG1777 consensus 20.6 44 0.0011 12.0 1.6 22 15-36 7-28 (625) 177 COG4309 Uncharacterized conser 20.3 45 0.0011 11.9 2.2 36 506-548 41-78 (98) 178 cd04252 AAK_NAGK-fArgBP AAK_NA 20.3 31 0.0008 13.2 0.2 83 454-540 112-208 (248) 179 pfam06317 Arena_RNA_pol Arenav 20.3 45 0.0011 11.9 6.6 56 144-199 530-597 (2206) 180 PRK05823 consensus 20.2 45 0.0011 11.9 2.2 13 16-28 26-38 (691) 181 pfam12535 Nudix_N Hydrolase of 20.1 26 0.00066 13.8 -0.3 16 482-499 18-33 (58) 182 pfam11692 DUF3289 Protein of u 20.1 45 0.0012 11.9 5.1 21 491-511 211-231 (277) 183 pfam08876 DUF1836 Domain of un 20.1 32 0.00081 13.1 0.2 37 269-311 12-48 (102) No 1 >TIGR01059 gyrB DNA gyrase, B subunit; InterPro: IPR011557 DNA topoisomerases regulate the number of topological links between two DNA strands (i.e. change the number of superhelical turns) by catalysing transient single- or double-strand breaks, crossing the strands through one another, then resealing the breaks. These enzymes have several functions: to remove DNA supercoils during transcription and DNA replication; for strand breakage during recombination; for chromosome condensation; and to disentangle intertwined DNA during mitosis , . DNA topoisomerases are divided into two classes: type I enzymes (5.99.1.2 from EC; topoisomerases I, III and V) break single-strand DNA, and type II enzymes (5.99.1.3 from EC; topoisomerases II, IV and VI) break double-strand DNA . Type II topoisomerases are ATP-dependent enzymes, and can be subdivided according to their structure and reaction mechanisms: type IIA (topoisomerase II or gyrase, and topoisomerase IV) and type IIB (topoisomerase VI). These enzymes are responsible for relaxing supercoiled DNA as well as for introducing both negative and positive supercoils . Topoisomerase II (called gyrase in bacteria) primarily introduces negative supercoils into DNA. In bacteria, topoisomerase II consists of two polypeptide subunits, gyrA and gyrB, which form a heterotetramer: (BA)2. In most eukaryotes, topoisomerase II consists of a single polypeptide, where the N- and C-terminal regions correspond to gyrB and gyrA, respectively. This entry represents the B subunit (gyrB) as found predominantly in bacteria, but does not include the topoisomerase II enzymes composed of a single polypeptide, as are found in most eukaryotes. GyrB has two functional domains: an N-terminal ATPase and a C-terminal responsible for subunit interactions, the latter differing between subunit B and single polypeptide topoisomerase II . More information about this protein can be found at Protein of the Month: DNA Topoisomerase .; GO: 0003677 DNA binding, 0003918 DNA topoisomerase (ATP-hydrolyzing) activity, 0005524 ATP binding, 0006265 DNA topological change, 0005694 chromosome. Probab=100.00 E-value=0 Score=2598.85 Aligned_cols=789 Identities=52% Similarity=0.851 Sum_probs=756.8 Q ss_pred CCHHHCEECCCCCHHCCCCCCEECCCCCC-CHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCEEEEEECCCCCCCCCC Q ss_conf 78655123135420004587154327996-32544565675557886860779757999838983999976844536407 Q gi|254780814|r 10 YDADSIQILKGLDAVKKRPSMYIGDTDGE-SGLHHMIYEVLDNAIDESLEGYADIITVTLNMDGSCTVSDNGRGIPTDLH 88 (803) Q Consensus 10 Y~~~~i~~L~gle~vrkRP~mYiG~t~~~-~Gl~hlv~EiidNsvDe~~~g~~~~I~V~i~~dgsisV~dnGrGIPv~~h 88 (803) |+|+|||||+||||||||||||||||+ . +||||||||||||||||||||||+.|.|+|+.||||+|.||||||||||| T Consensus 1 Y~A~sIkVLeGLEAVRKRPGMYIGdTg-~G~GLHHlvyEVVDNaIDEAlAGyC~~I~v~i~~dgsv~V~DnGRGIPVdIH 79 (818) T TIGR01059 1 YDASSIKVLEGLEAVRKRPGMYIGDTG-EGTGLHHLVYEVVDNAIDEALAGYCDTINVTINDDGSVTVEDNGRGIPVDIH 79 (818) T ss_pred CCCCCEEEEECCCCCCCCCCCEECCCC-CCCCCEEEEEEEECCHHHHHHCCCCCEEEEEEECCCEEEEEECCCCCCCCCC T ss_conf 983300433147775378883011248-7897238999984340442006612414899962876999858858874306 Q ss_pred CCCC------CCEEEEEEEEECCCCCCCCCCCEEECCCCCCEEEEEEECCCEEEEEEEECCEEEEEEEECCEECCCCEEE Q ss_conf 7788------5001256425011323289860483116752037786326449999997895999998898364631387 Q gi|254780814|r 89 KEEG------ISAAEVIMTRLHAGGKFNQKSYKVSGGLHGVGVSVVNALSSWLKLRIKREGNIYEMSFINGILDNPLVVT 162 (803) Q Consensus 89 ~~~~------~~~~E~i~t~L~ag~kfd~~~yk~sgGlhGvG~s~vNalS~~~~v~v~rdgk~~~q~f~~g~~~~~l~~~ 162 (803) |++| +||+|||||+||||||||+++|||||||||||||||||||+||+|+|+||||.|+|+|++|+|.+||+++ T Consensus 80 p~eGakqGRe~SA~EvVlT~LHAGGKFD~~sYKVSGGLHGVGvSVVNALS~~L~v~v~RdGk~y~q~F~~G~P~~~L~~~ 159 (818) T TIGR01059 80 PEEGAKQGREISAVEVVLTVLHAGGKFDKDSYKVSGGLHGVGVSVVNALSEWLEVTVKRDGKIYRQEFERGIPVGPLEVV 159 (818) T ss_pred CCCCCCCCCCCCCEEEEEEEECCCEEECCCCEEEECCEEECCEEEEEECCCEEEEEEEECCEEEEEEECCCCEECCEEEE T ss_conf 76878788856600012200057500158804786756503279996117289999998692999998487220530681 Q ss_pred CCCCCCCCEEEEEEECHHHHH-HHHCCHHHHHHHHHHHHCCCCEEEEEEEECCCCCCCEEEECCCCCCHHHHHHHCCCCC Q ss_conf 327788870899987788854-2100089999999973114641699999635864210100126883577653003565 Q gi|254780814|r 163 GSAGNDTGTEVTFLPSSDIFS-VQDFNYDTLQHRLRELSFLNSSVQISLIDKRSPEPREITMFYKGGIEAFVSYLDRHKK 241 (803) Q Consensus 163 ~~~~~~~GT~I~F~PD~~iF~-~~~~~~d~l~~R~~~~A~Ln~gl~i~l~Der~~~~~~~~f~~~~Gi~~fv~~l~~~~~ 241 (803) |.+.+++||+|+||||++||+ +++||+++|.+|||||||||+||+|.|.|+|..+.++.+|||+|||++||+|||.+|+ T Consensus 160 G~~~~~~GT~V~F~PD~~iF~~~~efd~~~l~~RLrELAFLN~Gv~I~l~D~R~~~~k~~~F~YeGGI~~fV~YLn~~K~ 239 (818) T TIGR01059 160 GEETKKTGTTVRFVPDPEIFEETTEFDFDILAKRLRELAFLNSGVKISLEDERLGKVKKVEFHYEGGIKSFVKYLNRNKE 239 (818) T ss_pred CCCCCCCCCEEEEEECHHHHCCCCEECHHHHHHHHHHHHHHCCCCEEEEEECCCCCCCEEEEEECCCHHHHHHHHCCCCC T ss_conf 34656667678996071321443033278899999888871897269998732567520378735618999999537882 Q ss_pred CCCCCCEE----EEEEEE----EEEEEEEEEECCCCCCEEEEEECCCCCCCCCCEEHHHHHHHHHHHHHHHHHCCCCC-C Q ss_conf 56885203----642003----69997334531775412122224621666772103568889999999998624200-0 Q gi|254780814|r 242 PLMQSPIR----IQGSRD----NISIDLAMRWNDGYHENVLCFTNNIPQKDGGTHLSGLRSALTRQITSYVEKSFGSK-K 312 (803) Q Consensus 242 ~~~~~~i~----~~~~~~----~~~veval~~~~~~~e~i~SfvN~I~T~~GGtHv~gf~~al~~~i~~y~~~~~~~K-k 312 (803) |+|++||| ++++.+ ++.||||||||++|+|+|+||||||+|.+||||+.|||+||||+||+|+++++++| + T Consensus 240 ~l~~~ii~ektt~~g~~~~~~~~i~VEvALQwnd~Y~e~i~sF~NNI~T~eGGTHl~GFr~aLTr~~n~y~~~~~l~Ki~ 319 (818) T TIGR01059 240 PLHEEIIYEKTTISGEKEDDDGGIEVEVALQWNDGYSENILSFVNNINTREGGTHLEGFRSALTRVINSYAKKNKLLKIE 319 (818) T ss_pred CCCCCCCCCCCEEEEEECCCCCEEEEEEEEEEECCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCC T ss_conf 24788642343157775167760799999876268551037885675567886117788657899999999736875324 Q ss_pred CCCCCCHHHHHCCCEEEEEEECCCCCCCCCCHHHHCCCCCCHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 12357613430572156665213763357210020261111032111026899998740035799987789998877554 Q gi|254780814|r 313 DKRSVIGDDCREGFTSVLSIKMPDPRFSSQTKEKLVSSEVRSVVESFVNDGLSAWLEEHPLEAKIIVKKVLDASMVRDAA 392 (803) Q Consensus 313 ~k~~~~~~Diregl~~visvki~nP~F~gQTK~kL~s~~v~~~v~~~v~~~l~~~l~~n~~~ak~i~~k~~~a~~ar~~a 392 (803) .+.+++|||||||||||||||||||||+||||+||||||||++|+++|+++|..||++||.+||.|++|++.||+||+|| T Consensus 320 ~~~~l~GeD~REGLtAViSvKvP~PqFEGQTK~KLGNsEVr~~V~~~v~e~l~~f~eeNP~~Ak~Iv~K~~~AA~AReAA 399 (818) T TIGR01059 320 SKPNLTGEDIREGLTAVISVKVPDPQFEGQTKTKLGNSEVRSIVESLVYEKLTEFFEENPTEAKAIVEKAIEAARAREAA 399 (818) T ss_pred CCCCCCCCCCCCCCEEEEEEECCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHH T ss_conf 45567657710223899998148789788322346464167778888799999997529068999999999999998875 Q ss_pred HHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEECCCCCC-CCCCCCCCCCEEEEECCCCCCCHHHCCHHHHHCC Q ss_conf 553211001100124676676666555555521799982557543-2233465443023300562033443058766323 Q gi|254780814|r 393 RRARDLTRRKGVLDIASLPGKLADCSERDPKKSELFLVEGDSAGG-SAKQGRSRENQAILPLRGKILNVERARFDKMLSS 471 (803) Q Consensus 393 ~kar~~~rrK~~~~~~~lpgKL~Dc~~k~~~~~eL~ivEGDSAgg-sak~gRdr~~qai~PLrGKilNv~~a~~~~i~~n 471 (803) |||||+|||||+|+.++||||||||++|||++||||||||||||| ||||||||.|||||||||||||||||+++|||+| T Consensus 400 rkARELtRRKSaLd~~~LPGKLADC~~kDP~~sElYiVEGDSAGGcSAKqGRDR~fQAILPLrGKILNVEKaR~dK~l~N 479 (818) T TIGR01059 400 RKARELTRRKSALDSGGLPGKLADCSSKDPSKSELYIVEGDSAGGCSAKQGRDRKFQAILPLRGKILNVEKARLDKILSN 479 (818) T ss_pred HHHHHHCCCCCCCCCCCCCCCEECCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCEECEECCCHHHHHHH T ss_conf 66443001667423245886302578888123126687345277765444678654342332455210001204553212 Q ss_pred HHHHHHHHHHCCCCCCCCCCCCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHCCEEEEECCCEEEEEECCCEE Q ss_conf 66788887625787753321235673738998508888532899999999963077874895999438337887267179 Q gi|254780814|r 472 QEIGTLITALGTGIGQDSFDINKLRYHKIIIMTDADVDGAHIRTLLLTFFFRQMPSLIHNGFLYIIKPPLYGITRGKSLQ 551 (803) Q Consensus 472 ~Ei~~l~~~lG~~~g~~~~~~~~lrY~kiiimtDaD~DG~HI~~Llltff~~~~~~li~~g~~yia~ppl~~~~~~K~E~ 551 (803) +||++||+|||||||+|+||++|||||||||||||||||||||||||||||||||+|||+|||||||||||||++||+++ T Consensus 480 ~Ei~~~ItALG~GI~~d~Fdl~KLRYHKIIIMTDADVDGSHIRTLLLTFFYR~M~~LIE~GyvYIAqPPLYKvk~gk~~~ 559 (818) T TIGR01059 480 DEIQAIITALGCGIGKDEFDLEKLRYHKIIIMTDADVDGSHIRTLLLTFFYRYMRPLIENGYVYIAQPPLYKVKKGKKER 559 (818) T ss_pred HHHHHHHHHHCCCCCCCCCCCCCCCCCCEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCHHHHCCCCCC T ss_conf 53388899736886766787232464436884067878368999999988740158874587265078602120465511 Q ss_pred EEECHHHHHHHHHHHCCCCHHCCCCCCCCCCCHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHCCCCCHH--H Q ss_conf 981457898999850011000014577654354689999999999999---99864344579999988742113511--2 Q gi|254780814|r 552 YVKDEESLEDYLINQSLSEEIELFCDSKGVISKNDLRNFIDDALKIDK---LIKDFYPNHDKTFIEQAVISGVFNCE--S 626 (803) Q Consensus 552 YikDD~EL~~~Ll~~gl~~~~l~~~~~~~~l~ge~L~~li~~~~~~~~---~l~~l~~~~~~~lle~l~~~~~l~~~--~ 626 (803) |||||.||++|||+.++++..+. .....|.++.|+.++..+..+++ ++.++.+++......-.++...+... . T Consensus 560 Yikdd~e~d~yll~~~~~~~~l~--~~~~~L~~~~Le~~~~~~~~~~k~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 637 (818) T TIGR01059 560 YIKDDKELDHYLLNLGINEAELV--ITKRELVGELLEDLVNAYLELEKRKNLINRLERKAIRFSEELLIYQDLLEKELLE 637 (818) T ss_pred CCCCHHHHHHHHHHHHCCCEEEE--ECHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHCHHHHC T ss_conf 22277899889887532420265--3478999999999999999886314479999874043557888755530222200 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHC-----CCCEECCCCCCCEEEEEEECCCCEEEEECHHHHCCCCHHHHHHHHHHHHHHC Q ss_conf 35778999999998876531000-----0110024456705999995185302664667741642789999888877511 Q gi|254780814|r 627 DQEINVSLASKLANRLNFIAEKS-----EKSWHGSINNSDILVKRIARGVQEQFVLEGSSIFSFGSKYKRIFSNRIEENY 701 (803) Q Consensus 627 d~~~~~~~~~~l~~~l~~~~~~~-----~~~~~~~~~~~~i~~~~~~~Gv~~~~~l~~~~~~s~~~~~~~~~~~~l~~~~ 701 (803) .......|........+...... +...+.....+...+ .+ ||+...+.+...++...++.........++..+ T Consensus 638 ~~~~~~~~~~~~~y~~~~~E~~~~~~~~~~~~~~~~~~f~~~v-vR-~G~~~~~~L~~~q~~~~e~~~~~~~~~~~~~~~ 715 (818) T TIGR01059 638 YESAVKYWLQGALYIYSDKEKEEAKTQLPVHKNAEKELFESSV-VR-HGLDTDYILEKEQLELFEYGEIISLELSLQGLF 715 (818) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHE-EE-CCCCCCHHHHHHHHHHHHHHHHHHHHCCHHHHH T ss_conf 1468888999999887556578888742235677887541211-44-377632001666643323454344211034554 Q ss_pred CCCEEEEECCCEEEECCHHHHHHHHHHHHHCCCEEEECCCCCCCCHHHHHHCCCCHHHCEEEEEEHHHHHHHHHHHHHHC Q ss_conf 45502765582587399899999999863067628603777788955722303896134021236798898887865641 Q gi|254780814|r 702 LTILSLICNGDKIKITGPCSLLDTIFSIGRKGISMQRYKGLGEMNASQLWETTLNPDTRSLLRVKITDASQADDLFSRLM 781 (803) Q Consensus 702 ~~~~~l~~~~~~~~i~s~~el~~~i~~~gkKG~sIQRYKGLGEMNPeQLWETTMDP~tRtLLqV~IeDa~~Ad~iF~~LM 781 (803) .....+..+++.+.|.++.++++|+++.+++|++||||||||||||+|||||||||++|||+||+|+||++||++||+|| T Consensus 716 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~iQRYKGLGEMna~QLWETTMdP~~R~L~~V~~~DA~~AD~~F~~LM 795 (818) T TIGR01059 716 ETRAKLIRKNKKFEINSLEEALDKLVELGRKGIEIQRYKGLGEMNADQLWETTMDPESRTLLKVTIEDAVEADRIFSTLM 795 (818) T ss_pred HHHHHHCCCCCCCCCCCHHHHHHHHHHCCCCCCEEEEECCCCCCCHHHHHHHCCCCCCCEEEEEEHHHHHHHHHHCCCCC T ss_conf 43432100122310025899999998407898604543166667822203342884644487745799988654320006 Q ss_pred CCCCCCHHHHHHHHHHHC-CCCC Q ss_conf 899876289988524412-3669 Q gi|254780814|r 782 GDEVEPRREFIQENSLPS-ALDI 803 (803) Q Consensus 782 GD~VEPRReFIe~NAL~v-NLDI 803 (803) ||+|||||+|||.|||+| |||| T Consensus 796 Gd~VePRR~FIE~nAl~v~nLDv 818 (818) T TIGR01059 796 GDEVEPRREFIEANALDVKNLDV 818 (818) T ss_pred CCCCCHHHHHHHHHHHHHHHCCC T ss_conf 89865307899999998852489 No 2 >PRK05644 gyrB DNA gyrase subunit B; Validated Probab=100.00 E-value=0 Score=2108.03 Aligned_cols=723 Identities=58% Similarity=0.949 Sum_probs=686.9 Q ss_pred CCCCCCCCCCCHHHCEECCCCCHHCCCCCCEECCCCCCCHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCEEEEEECC Q ss_conf 98764223678655123135420004587154327996325445656755578868607797579998389839999768 Q gi|254780814|r 1 MSDNIQNSSYDADSIQILKGLDAVKKRPSMYIGDTDGESGLHHMIYEVLDNAIDESLEGYADIITVTLNMDGSCTVSDNG 80 (803) Q Consensus 1 m~~~~~~~~Y~~~~i~~L~gle~vrkRP~mYiG~t~~~~Gl~hlv~EiidNsvDe~~~g~~~~I~V~i~~dgsisV~dnG 80 (803) +..+|.+++|+|+||++|+||||||||||||||||+ .+|||||||||||||||||+||||+.|+|+||+||||||+||| T Consensus 2 ~~~~~~~~~Y~~~~I~vL~gle~VRkRPgMYiG~t~-~~glhhlv~EvvDNsiDEa~aG~~~~I~V~i~~DgsitV~DnG 80 (725) T PRK05644 2 ENKEMMENEYDASSIKVLEGLEAVRKRPGMYIGDTG-GRGLHHLVYEVVDNSIDEALAGYCDEIEVTIHADGSVSVTDNG 80 (725) T ss_pred CHHHHHHCCCCHHHCEECCCCHHHCCCCCCEECCCC-CCCCEEEEEEEEHHHHHHHHCCCCCEEEEEECCCCEEEEEECC T ss_conf 155475667896448450375755059864656699-9876688986421478798638996799999689829999899 Q ss_pred CCCCCCCCCCCCCCEEEEEEEEECCCCCCCCCCCEEECCCCCCEEEEEEECCCEEEEEEEECCEEEEEEEECCEECCCCE Q ss_conf 44536407778850012564250113232898604831167520377863264499999978959999988983646313 Q gi|254780814|r 81 RGIPTDLHKEEGISAAEVIMTRLHAGGKFNQKSYKVSGGLHGVGVSVVNALSSWLKLRIKREGNIYEMSFINGILDNPLV 160 (803) Q Consensus 81 rGIPv~~h~~~~~~~~E~i~t~L~ag~kfd~~~yk~sgGlhGvG~s~vNalS~~~~v~v~rdgk~~~q~f~~g~~~~~l~ 160 (803) ||||||+||++|+|+||+|||+||||||||+++|++||||||||||||||||+||+|+|+|||+.|+|+|++|++..|++ T Consensus 81 RGIPvd~h~~~~~~~~E~I~t~LhAGgKFd~~~YkvSGGLHGVGasvVNALS~~~~VeV~RdGk~y~q~f~~G~~~~~l~ 160 (725) T PRK05644 81 RGIPVDIHPKEGVSAAEVVMTVLHAGGKFDGNSYKVSGGLHGVGVSVVNALSEWLELTVRRDGKIHRQEFERGVPVAPLK 160 (725) T ss_pred CCCCCCCCCCCCCCHHHHEECCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCEEEEEEEECCEEEEEEEECCCCCCCCE T ss_conf 76776345777972261300003556689999635316888802032160547089999989999999972897588852 Q ss_pred EECCCCCCCCEEEEEEECHHHHHHHHCCHHHHHHHHHHHHCCCCEEEEEEEECCCCCCCEEEECCCCCCHHHHHHHCCCC Q ss_conf 87327788870899987788854210008999999997311464169999963586421010012688357765300356 Q gi|254780814|r 161 VTGSAGNDTGTEVTFLPSSDIFSVQDFNYDTLQHRLRELSFLNSSVQISLIDKRSPEPREITMFYKGGIEAFVSYLDRHK 240 (803) Q Consensus 161 ~~~~~~~~~GT~I~F~PD~~iF~~~~~~~d~l~~R~~~~A~Ln~gl~i~l~Der~~~~~~~~f~~~~Gi~~fv~~l~~~~ 240 (803) +++++ +++||+|+||||++||++..|++++|.+|++++|||||||+|+|+|+|...+++.+|+|+|||++|++|++.++ T Consensus 161 ~ig~~-~k~GT~ItF~PD~~iF~~~~fd~d~L~~RlrelA~Ln~GL~I~l~Der~~~~~~~~f~y~gGi~dyv~~l~~~k 239 (725) T PRK05644 161 VIGET-DKTGTTVRFWPDPEIFTTTEFDYETLAKRLRELAFLNSGVKITLTDEREGEGKEETFHYEGGIKSFVEYLNRNK 239 (725) T ss_pred ECCCC-CCCCEEEEEEECHHHCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCEEEEEECCCHHHHHHHHHCCC T ss_conf 46788-88965999997756617744089999999999972179847998645778776148971476788898862578 Q ss_pred CCCCCCCEEEEEEEEEEEEEEEEEECCCCCCEEEEEECCCCCCCCCCEEHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHH Q ss_conf 55688520364200369997334531775412122224621666772103568889999999998624200012357613 Q gi|254780814|r 241 KPLMQSPIRIQGSRDNISIDLAMRWNDGYHENVLCFTNNIPQKDGGTHLSGLRSALTRQITSYVEKSFGSKKDKRSVIGD 320 (803) Q Consensus 241 ~~~~~~~i~~~~~~~~~~veval~~~~~~~e~i~SfvN~I~T~~GGtHv~gf~~al~~~i~~y~~~~~~~Kk~k~~~~~~ 320 (803) ++++++||+++++.+++.||||+||+++|+|+++||||+|+|++|||||+|||+||||+||+|++++++.|+.+.+++++ T Consensus 240 ~~l~~~~i~~~~e~~~i~vEvAlqw~d~~~e~i~SFvN~I~T~eGGTHv~Gfr~aLtr~In~ya~~~~l~Kk~~~~itgd 319 (725) T PRK05644 240 TPLHPEIIYFEGEKDGIGVEVAMQWNDSYSENVLCFTNNIPTRDGGTHLAGFRAALTRVINDYARKNGLLKKAKVNLTGD 319 (725) T ss_pred CCCCCCCEEEEEEECCEEEEEEEEECCCCCCEEEEEECCEECCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHH T ss_conf 76788855899864780699999854776631455421435588981889999999999999999837543344566567 Q ss_pred HHHCCCEEEEEEECCCCCCCCCCHHHHCCCCCCHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 43057215666521376335721002026111103211102689999874003579998778999887755455321100 Q gi|254780814|r 321 DCREGFTSVLSIKMPDPRFSSQTKEKLVSSEVRSVVESFVNDGLSAWLEEHPLEAKIIVKKVLDASMVRDAARRARDLTR 400 (803) Q Consensus 321 Diregl~~visvki~nP~F~gQTK~kL~s~~v~~~v~~~v~~~l~~~l~~n~~~ak~i~~k~~~a~~ar~~a~kar~~~r 400 (803) ||||||++||||+|+||||+||||+||+|++|+++|+++++++|..||++||..|+.|++|++.|++||++|||||+++| T Consensus 320 DIregL~~vISvki~nPqFegQTKeKLgn~ev~~~V~~~v~e~l~~~l~~np~~ak~Ii~k~~~aa~aR~aa~kare~~r 399 (725) T PRK05644 320 DVREGLTAVISVKVPDPQFEGQTKTKLGNSEVRPIVESLVNEALSEFLEENPNEAKIIVGKAIEAARAREAARKARELTR 399 (725) T ss_pred HHHHCCEEEEEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 77603079999861698747731030158546788999999889999887979899999999999999999997687652 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCEEEEEEECCCCCCCCCCCCCCCCEEEEECCCCCCCHHHCCHHHHHCCHHHHHHHHH Q ss_conf 11001246766766665555555217999825575432233465443023300562033443058766323667888876 Q gi|254780814|r 401 RKGVLDIASLPGKLADCSERDPKKSELFLVEGDSAGGSAKQGRSRENQAILPLRGKILNVERARFDKMLSSQEIGTLITA 480 (803) Q Consensus 401 rK~~~~~~~lpgKL~Dc~~k~~~~~eL~ivEGDSAggsak~gRdr~~qai~PLrGKilNv~~a~~~~i~~n~Ei~~l~~~ 480 (803) ||++++.++|||||+||+++||++|||||||||||||||||||||+|||||||||||||||+|+.+||++|+||++|++| T Consensus 400 rK~~~~~~~LPGKL~Dc~skd~~~~ELfiVEGDSAgGsAkqgRdr~~QAIlPLRGKiLNv~ka~~~ki~~N~EI~~li~a 479 (725) T PRK05644 400 RKSALDSAGLPGKLADCQSKDPAKSELYIVEGDSAGGSAKQGRDRKFQAILPLRGKILNVEKARFDKILSSEEIGTLITA 479 (725) T ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCEEEEECCCCCCCCCCCCCCCCCEEEEECCCCCCCCCCCCHHHHHCCHHHHHHHHH T ss_conf 02212357898755465568977767999825788863103356574688641672200200679888618799999999 Q ss_pred HCCCCCCCCCCCCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHCCEEEEECCCEEEEEECCCEEEEECHHHHH Q ss_conf 25787753321235673738998508888532899999999963077874895999438337887267179981457898 Q gi|254780814|r 481 LGTGIGQDSFDINKLRYHKIIIMTDADVDGAHIRTLLLTFFFRQMPSLIHNGFLYIIKPPLYGITRGKSLQYVKDEESLE 560 (803) Q Consensus 481 lG~~~g~~~~~~~~lrY~kiiimtDaD~DG~HI~~Llltff~~~~~~li~~g~~yia~ppl~~~~~~K~E~YikDD~EL~ 560 (803) ||||+|++ ||+++||||||||||||||||||||||||||||||||+||++|||||||||||||++||++.|++||+||+ T Consensus 480 lG~g~g~~-fd~~~LRY~kIiIMTDADvDGsHIrtLLlTfF~r~m~~LIe~G~vYiAqPPLyki~~gk~~~Y~~~d~e~~ 558 (725) T PRK05644 480 LGTGIGRD-FDIDKLRYHKIIIMTDADVDGAHIRTLLLTFFYRQMPELIERGYVYIAQPPLYKVKKGKQEQYLKDDRELD 558 (725) T ss_pred HCCCCCCC-CCCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCEEEECCCEEEEECCHHHHH T ss_conf 67788865-77112774708999789987689999999999999999997593899259615985298148837999999 Q ss_pred HHHHHHCCCCHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHH Q ss_conf 99985001100001457765435468999999999999999864344579999988742113511235778999999998 Q gi|254780814|r 561 DYLINQSLSEEIELFCDSKGVISKNDLRNFIDDALKIDKLIKDFYPNHDKTFIEQAVISGVFNCESDQEINVSLASKLAN 640 (803) Q Consensus 561 ~~Ll~~gl~~~~l~~~~~~~~l~ge~L~~li~~~~~~~~~l~~l~~~~~~~lle~l~~~~~l~~~~d~~~~~~~~~~l~~ 640 (803) ++|.+.+.++..+ ..+++++|.+++..+..+++++.++.++++...++..+........... T Consensus 559 ~~l~~~~~~~~~~------~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------------ 620 (725) T PRK05644 559 EYLIELALEGATL------HAIAGEALEKLVKEYRAVRKVIDRLSRRYPRALLEALATNAERELFELR------------ 620 (725) T ss_pred HHHHHHCCCCCEE------ECCCCHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHCCCCCCCCCH------------ T ss_conf 9999732056255------2046077899999999999999876430446788887521101223200------------ Q ss_pred HHHHHHHHCCCCEECCCCCCCEEEEEEECCCCEEEEECHHHHCCCCHHHHHHHHHHHHHHCCCCEEEEECCCEEEECCHH Q ss_conf 87653100001100244567059999951853026646677416427899998888775114550276558258739989 Q gi|254780814|r 641 RLNFIAEKSEKSWHGSINNSDILVKRIARGVQEQFVLEGSSIFSFGSKYKRIFSNRIEENYLTILSLICNGDKIKITGPC 720 (803) Q Consensus 641 ~l~~~~~~~~~~~~~~~~~~~i~~~~~~~Gv~~~~~l~~~~~~s~~~~~~~~~~~~l~~~~~~~~~l~~~~~~~~i~s~~ 720 (803) ..+..+ .........++.++. T Consensus 621 -----------------------~~~~~~------------------------------------~~~~~~~~~~~~~~~ 641 (725) T PRK05644 621 -----------------------VERRTH------------------------------------VYVLDHDEQPVSSFE 641 (725) T ss_pred -----------------------HHHHCC------------------------------------CCCCCCCCCCCCCHH T ss_conf -----------------------121101------------------------------------122455531247889 Q ss_pred HHHHHHHHHHHCCCEEEECCCCCCCCHHHHHHCCCCHHHCEEEEEEHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHC- Q ss_conf 9999999863067628603777788955722303896134021236798898887865641899876289988524412- Q gi|254780814|r 721 SLLDTIFSIGRKGISMQRYKGLGEMNASQLWETTLNPDTRSLLRVKITDASQADDLFSRLMGDEVEPRREFIQENSLPS- 799 (803) Q Consensus 721 el~~~i~~~gkKG~sIQRYKGLGEMNPeQLWETTMDP~tRtLLqV~IeDa~~Ad~iF~~LMGD~VEPRReFIe~NAL~v- 799 (803) ++++.+.+.+++|++||||||||||||+|||||||||++|+|+||+|+||.+||++|++||||+|||||+|||+||++| T Consensus 642 ~~~~~~~~~~~k~~~IqRyKGLGEMnp~qLweTTMdP~~R~L~~V~i~da~~ad~~f~~LMG~~v~~Rr~fI~~na~~v~ 721 (725) T PRK05644 642 EALDWLLAEGRKGLTIQRYKGLGEMNPEQLWETTMDPEVRRLLQVTIEDAIAADEIFTTLMGDEVEPRREFIEENALDVR 721 (725) T ss_pred HHHHHHHHHHCCCCEEEECCCCCCCCHHHHHHHCCCCCCCEEEEEECHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHH T ss_conf 99999997305788022276067669789986365830135999870239999999998659995688999999851303 Q ss_pred CCCC Q ss_conf 3669 Q gi|254780814|r 800 ALDI 803 (803) Q Consensus 800 NLDI 803 (803) |||| T Consensus 722 ~lDi 725 (725) T PRK05644 722 NLDI 725 (725) T ss_pred HCCC T ss_conf 3489 No 3 >COG0187 GyrB Type IIA topoisomerase (DNA gyrase/topo II, topoisomerase IV), B subunit [DNA replication, recombination, and repair] Probab=100.00 E-value=0 Score=1967.41 Aligned_cols=631 Identities=61% Similarity=1.009 Sum_probs=611.1 Q ss_pred CCCCCHHHCEECCCCCHHCCCCCCEECCCCCCCHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCEEEEEECCCCCCCC Q ss_conf 23678655123135420004587154327996325445656755578868607797579998389839999768445364 Q gi|254780814|r 7 NSSYDADSIQILKGLDAVKKRPSMYIGDTDGESGLHHMIYEVLDNAIDESLEGYADIITVTLNMDGSCTVSDNGRGIPTD 86 (803) Q Consensus 7 ~~~Y~~~~i~~L~gle~vrkRP~mYiG~t~~~~Gl~hlv~EiidNsvDe~~~g~~~~I~V~i~~dgsisV~dnGrGIPv~ 86 (803) ...|+|++|++|+||||||||||||||||++.+|||||||||||||||||+||||+.|+|+||+||||||.||||||||| T Consensus 3 ~~~Y~a~~I~vL~GLEaVRkRPGMYIGst~~~~GLhHlv~EVvDNsiDEalaG~~~~I~V~l~~d~sisV~DnGRGIPvd 82 (635) T COG0187 3 TNNYDASSIQVLEGLEAVRKRPGMYIGSTGDGRGLHHLVWEVVDNSIDEALAGYADRIDVTLHEDGSISVEDNGRGIPVD 82 (635) T ss_pred CCCCCHHHCEECCCCHHHHCCCCCEECCCCCCCCCEEEEEEEEECCHHHHHHCCCCEEEEEECCCCEEEEEECCCCCCCC T ss_conf 57677767333027477645998553368999965046867631307478638685789999479808998789877654 Q ss_pred CCCCCCCCEEEEEEEEECCCCCCCCCCCEEECCCCCCEEEEEEECCCEEEEEEEECCEEEEEEEECCEECCCCEEECC-C Q ss_conf 077788500125642501132328986048311675203778632644999999789599999889836463138732-7 Q gi|254780814|r 87 LHKEEGISAAEVIMTRLHAGGKFNQKSYKVSGGLHGVGVSVVNALSSWLKLRIKREGNIYEMSFINGILDNPLVVTGS-A 165 (803) Q Consensus 87 ~h~~~~~~~~E~i~t~L~ag~kfd~~~yk~sgGlhGvG~s~vNalS~~~~v~v~rdgk~~~q~f~~g~~~~~l~~~~~-~ 165 (803) +||++++|++|+|||+||||||||+++|++||||||||+|||||||+||+|+|+|||+.|+|.|++|++++++++++. . T Consensus 83 iH~~~~~s~vEvI~T~LHAGGKFd~~~YkvSGGLHGVG~SVVNALS~~l~v~v~r~gk~y~q~f~~G~~~~~l~~ig~~~ 162 (635) T COG0187 83 IHPKEKVSAVEVIFTVLHAGGKFDNDSYKVSGGLHGVGVSVVNALSTWLEVEVKRDGKIYRQRFERGVPVTPLEVIGSTD 162 (635) T ss_pred CCCCCCCCCEEEEEEEECCCCCCCCCCCEEECCCCCCCEEEEECCCCEEEEEEEECCEEEEEEEECCCCCCCCEECCCCC T ss_conf 27778987117888761257666999527606777504167761654699999989879999871898577740125677 Q ss_pred CCCCCEEEEEEECHHHHHHHHCCHHHHHHHHHHHHCCCCEEEEEEEECCCCCCCEEEECCCCCCHHHHHHHCCCCCCCCC Q ss_conf 78887089998778885421000899999999731146416999996358642101001268835776530035655688 Q gi|254780814|r 166 GNDTGTEVTFLPSSDIFSVQDFNYDTLQHRLRELSFLNSSVQISLIDKRSPEPREITMFYKGGIEAFVSYLDRHKKPLMQ 245 (803) Q Consensus 166 ~~~~GT~I~F~PD~~iF~~~~~~~d~l~~R~~~~A~Ln~gl~i~l~Der~~~~~~~~f~~~~Gi~~fv~~l~~~~~~~~~ 245 (803) .+++||+|+||||++||++++|++++|++|+|++||||+||+|.|+|+|...... .|+|+|||++|++|+|.+++++++ T Consensus 163 ~~~~GT~V~F~PD~~iF~~~~f~~~~l~~RlrelA~L~~gl~I~l~d~r~~~~~~-~~~ye~Gl~~yv~~l~~~k~~l~~ 241 (635) T COG0187 163 TKKTGTKVRFKPDPEIFGETEFDYEILKRRLRELAFLNKGVKITLTDERTGEEKK-EFHYEGGLKDYVEYLNKGKTPLHE 241 (635) T ss_pred CCCCCCEEEEECCHHHCCCCCCCHHHHHHHHHHHHCCCCCCEEEEEECCCCCCCC-EEECCCHHHHHHHHHHCCCCCCCC T ss_conf 7899528999748686277666799999999998606799779997246775543-243460799999997258864565 Q ss_pred CCEEEEEEEEEEEEEEEEEECCCCCCEEEEEECCCCCCCCCCEEHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHCC Q ss_conf 52036420036999733453177541212222462166677210356888999999999862420001235761343057 Q gi|254780814|r 246 SPIRIQGSRDNISIDLAMRWNDGYHENVLCFTNNIPQKDGGTHLSGLRSALTRQITSYVEKSFGSKKDKRSVIGDDCREG 325 (803) Q Consensus 246 ~~i~~~~~~~~~~veval~~~~~~~e~i~SfvN~I~T~~GGtHv~gf~~al~~~i~~y~~~~~~~Kk~k~~~~~~Direg 325 (803) .|+++.++..++.||||+||+++|+++++||||+|||++||||++|||+||||+||+|+++++++|+.+ ++++||||| T Consensus 242 ~~~~~~~~~~~~~vEvA~q~~d~~~e~~~SFvNnI~T~eGGTH~~Gfr~altr~in~y~~~~~~~k~~~--l~g~Direg 319 (635) T COG0187 242 EIFYFNGEKDGIAVEVALQWNDGYSENILSFVNNIPTREGGTHEAGFRSALTRAINEYAKKKNLLKEGD--LTGDDIREG 319 (635) T ss_pred CCEECCCCCCCEEEEEEEEEECCCCEEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCC--CCHHHHHHC T ss_conf 756513676663799999980688447998645855899860899999999999999999838665357--777787503 Q ss_pred CEEEEEEECCCCCCCCCCHHHHCCCCCCHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCC Q ss_conf 21566652137633572100202611110321110268999987400357999877899988775545532110011001 Q gi|254780814|r 326 FTSVLSIKMPDPRFSSQTKEKLVSSEVRSVVESFVNDGLSAWLEEHPLEAKIIVKKVLDASMVRDAARRARDLTRRKGVL 405 (803) Q Consensus 326 l~~visvki~nP~F~gQTK~kL~s~~v~~~v~~~v~~~l~~~l~~n~~~ak~i~~k~~~a~~ar~~a~kar~~~rrK~~~ 405 (803) |+||||||||||||+||||+||+|++++++|++++++.|..||++||..|+.|++|++.||+||++|||||+++|||+++ T Consensus 320 l~aviSvki~~PqFegQTK~KL~n~e~~~~V~~~v~~~~~~~l~enp~~a~~i~~k~i~aa~aR~aarkare~~R~k~~~ 399 (635) T COG0187 320 LTAVISVKIPDPQFEGQTKEKLGNSEVRSIVEKLVSEAFSLFLEENPQEAKKIVEKAIAAAKAREAARKARELTRRKSAL 399 (635) T ss_pred CEEEEEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 38999997799885754103342688899999999999999999692999999999999999999989889988651134 Q ss_pred CCCCCCCCCCCCCCCCCCCEEEEEEECCCCCCCCCCCCCCCCEEEEECCCCCCCHHHCCHHHHHCCHHHHHHHHHHCCCC Q ss_conf 24676676666555555521799982557543223346544302330056203344305876632366788887625787 Q gi|254780814|r 406 DIASLPGKLADCSERDPKKSELFLVEGDSAGGSAKQGRSRENQAILPLRGKILNVERARFDKMLSSQEIGTLITALGTGI 485 (803) Q Consensus 406 ~~~~lpgKL~Dc~~k~~~~~eL~ivEGDSAggsak~gRdr~~qai~PLrGKilNv~~a~~~~i~~n~Ei~~l~~~lG~~~ 485 (803) +.+.|||||+||+++||++|||||||||||||||||||||.|||||||||||||||+|+.+||++|+||++|++|||||+ T Consensus 400 ~~~~LpGKLadC~skd~~~~ELfiVEGDSAGGSAKqgRdR~~QAILPLRGKiLNVeka~~~kil~N~EI~~ii~AlG~g~ 479 (635) T COG0187 400 DIPGLPGKLADCTSKDPEKSELFLVEGDSAGGSAKQGRDREFQAILPLRGKILNVEKARLDKILKNEEIQTIITALGTGI 479 (635) T ss_pred CCCCCCCCCCCCCCCCCCCCEEEEEECCCCCHHHHHCCCCCCEEEEECCCCCHHHHHCCHHHHHHHHHHHHHHHHHCCCC T ss_conf 55789975756556882415599980477633244216866557720467311232235765543388999999958788 Q ss_pred CCCCCCCCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHCCEEEEECCCEEEEEECCCEEEEECHHHHHHHHHH Q ss_conf 75332123567373899850888853289999999996307787489599943833788726717998145789899985 Q gi|254780814|r 486 GQDSFDINKLRYHKIIIMTDADVDGAHIRTLLLTFFFRQMPSLIHNGFLYIIKPPLYGITRGKSLQYVKDEESLEDYLIN 565 (803) Q Consensus 486 g~~~~~~~~lrY~kiiimtDaD~DG~HI~~Llltff~~~~~~li~~g~~yia~ppl~~~~~~K~E~YikDD~EL~~~Ll~ 565 (803) |++ ||+++||||||||||||||||+|||||||||||||||+||++|||||||||||||++||++.|++||.|+...+-+ T Consensus 480 ~~~-fd~~~LRY~kIiIMTDADvDGaHIrtLLlTfFyr~m~~LIe~G~vyiA~PPLYkv~~~k~~~Y~~~d~E~~~~~~~ 558 (635) T COG0187 480 GKD-FDLEKLRYHKIIIMTDADVDGAHIRTLLLTFFYRYMPPLIENGHVYIAQPPLYKVKKGKKTFYAYDDEELEKLLER 558 (635) T ss_pred CCC-CCHHHCCCCCEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCEEEEECCCCEEEECCHHHHHHHHHH T ss_conf 987-7855576472799965887728899999999999749999769489974855899758742672799999999986 Q ss_pred HCCCCHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 00110000145776543546899999999999999986434457999998874211351123577899999999887653 Q gi|254780814|r 566 QSLSEEIELFCDSKGVISKNDLRNFIDDALKIDKLIKDFYPNHDKTFIEQAVISGVFNCESDQEINVSLASKLANRLNFI 645 (803) Q Consensus 566 ~gl~~~~l~~~~~~~~l~ge~L~~li~~~~~~~~~l~~l~~~~~~~lle~l~~~~~l~~~~d~~~~~~~~~~l~~~l~~~ 645 (803) .+- T Consensus 559 ~~~----------------------------------------------------------------------------- 561 (635) T COG0187 559 LGK----------------------------------------------------------------------------- 561 (635) T ss_pred HCC----------------------------------------------------------------------------- T ss_conf 342----------------------------------------------------------------------------- Q ss_pred HHHCCCCEECCCCCCCEEEEEEECCCCEEEEECHHHHCCCCHHHHHHHHHHHHHHCCCCEEEEECCCEEEECCHHHHHHH Q ss_conf 10000110024456705999995185302664667741642789999888877511455027655825873998999999 Q gi|254780814|r 646 AEKSEKSWHGSINNSDILVKRIARGVQEQFVLEGSSIFSFGSKYKRIFSNRIEENYLTILSLICNGDKIKITGPCSLLDT 725 (803) Q Consensus 646 ~~~~~~~~~~~~~~~~i~~~~~~~Gv~~~~~l~~~~~~s~~~~~~~~~~~~l~~~~~~~~~l~~~~~~~~i~s~~el~~~ 725 (803) T Consensus 562 -------------------------------------------------------------------------------- 561 (635) T COG0187 562 -------------------------------------------------------------------------------- 561 (635) T ss_pred -------------------------------------------------------------------------------- T ss_conf -------------------------------------------------------------------------------- Q ss_pred HHHHHHCCCEEEECCCCCCCCHHHHHHCCCCHHHCEEEEEEHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHC-CCCC Q ss_conf 99863067628603777788955722303896134021236798898887865641899876289988524412-3669 Q gi|254780814|r 726 IFSIGRKGISMQRYKGLGEMNASQLWETTLNPDTRSLLRVKITDASQADDLFSRLMGDEVEPRREFIQENSLPS-ALDI 803 (803) Q Consensus 726 i~~~gkKG~sIQRYKGLGEMNPeQLWETTMDP~tRtLLqV~IeDa~~Ad~iF~~LMGD~VEPRReFIe~NAL~v-NLDI 803 (803) + +|++||||||||||||+|||||||||++|+|+||+|+||++||++|++||||+|||||+|||+||+.| |||| T Consensus 562 ----~-~~~~IqRyKGLGEMnp~QLwETTmdP~~R~L~~V~i~da~~ad~~f~~LMGd~ve~Rr~fIe~na~~~~~~di 635 (635) T COG0187 562 ----K-KGYEIQRYKGLGEMNPDQLWETTMDPETRRLLQVTIEDADEADEIFSTLMGDKVEPRRNFIEENALFVENLDI 635 (635) T ss_pred ----C-CCCEEEEECCCCCCCHHHHHHHCCCCCCEEEEEEECCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCC T ss_conf ----6-8840576016577898898775359041258988622289999999987077856889999987533332369 No 4 >PTZ00109 DNA gyrase subunit b; Provisional Probab=100.00 E-value=0 Score=1959.22 Aligned_cols=729 Identities=38% Similarity=0.599 Sum_probs=610.6 Q ss_pred CCCCCHHHCEECCCCCHHCCCCCCEECCCCCCCHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCEEEEEECCCCCCCC Q ss_conf 23678655123135420004587154327996325445656755578868607797579998389839999768445364 Q gi|254780814|r 7 NSSYDADSIQILKGLDAVKKRPSMYIGDTDGESGLHHMIYEVLDNAIDESLEGYADIITVTLNMDGSCTVSDNGRGIPTD 86 (803) Q Consensus 7 ~~~Y~~~~i~~L~gle~vrkRP~mYiG~t~~~~Gl~hlv~EiidNsvDe~~~g~~~~I~V~i~~dgsisV~dnGrGIPv~ 86 (803) -++|+|++|+||+||||||||||||||+|| .+|||||||||||||||||+||||+.|+|+||+||||||.||||||||| T Consensus 53 ~n~Y~A~dI~VLeGLEaVRKRPGMYIG~Td-~~GlHHlv~EIvDNSvDEalaG~c~~I~V~l~~DgSvtV~DnGRGIPvD 131 (941) T PTZ00109 53 CNDYDAKDIVILEGLEAVRKRPGMYIGNTD-VKGLHQILFEIIDNSVDEYNNFECNEIKIVIHKDDSVTIEDNGRGIPCD 131 (941) T ss_pred CCCCCCCCCEEECCCHHHCCCCCCEECCCC-CCCCCCEEEEEHHCHHHHHHCCCCCEEEEEEECCCCEEEEECCCCCCCC T ss_conf 788771012361285321369974456899-9863212210012568798618996689999479978998899767888 Q ss_pred CCCCCCCCEEEEEEEEECCCCCCCCC------------------------------------------------------ Q ss_conf 07778850012564250113232898------------------------------------------------------ Q gi|254780814|r 87 LHKEEGISAAEVIMTRLHAGGKFNQK------------------------------------------------------ 112 (803) Q Consensus 87 ~h~~~~~~~~E~i~t~L~ag~kfd~~------------------------------------------------------ 112 (803) +||++|+|++|+|||+||||||||++ T Consensus 132 ~hpk~~~salEvVlT~LHAGGKF~dd~~~~~~d~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~g~~~~~~ 211 (941) T PTZ00109 132 VHEKTKKSALETVLTVLHSGAKFFDDEADAEIDGGMNNGGLSDGGEEDNRDAIGEHKGEHKGEKKAKANVAERGAEKLDE 211 (941) T ss_pred CCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC T ss_conf 78888985028888541557565776443212233344444444332222222223322122322211122334433222 Q ss_pred ------------------CCEEECCCCCCEEEEEEECCCEEEEEEEECCEEEEEEEECCEECCCCEEECCCCCCCCEEEE Q ss_conf ------------------60483116752037786326449999997895999998898364631387327788870899 Q gi|254780814|r 113 ------------------SYKVSGGLHGVGVSVVNALSSWLKLRIKREGNIYEMSFINGILDNPLVVTGSAGNDTGTEVT 174 (803) Q Consensus 113 ------------------~yk~sgGlhGvG~s~vNalS~~~~v~v~rdgk~~~q~f~~g~~~~~l~~~~~~~~~~GT~I~ 174 (803) .|++||||||||+|||||||+||+|+|+|||+.|.|+|++|.+++||.+.+++.+++||+|+ T Consensus 212 ~~~~~~~~~~a~~k~~~~~Yk~SGGLHGVGvSVVNALS~~l~v~V~Rdgk~y~~~f~~G~~~~~L~v~~~~~~k~GT~I~ 291 (941) T PTZ00109 212 KKSPQKTPKEAPQKEPAQKYKFSSGLHGVGLSVANALSSFMKVKVFRNGKIHSIELEKGKVTKELSIRDCPINKRGTQIH 291 (941) T ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEECCCCEEEEEEEECCEEEEEEECCCEECCCCEECCCCCCCCCCEEE T ss_conf 33333343333345568762246667866668670365758999999999999996899088853540788899975799 Q ss_pred EEECHHHHH-HHHCCHHHHHHHHHHHHCCCCEEEEEEEECCCCCCCE--------------------------------- Q ss_conf 987788854-2100089999999973114641699999635864210--------------------------------- Q gi|254780814|r 175 FLPSSDIFS-VQDFNYDTLQHRLRELSFLNSSVQISLIDKRSPEPRE--------------------------------- 220 (803) Q Consensus 175 F~PD~~iF~-~~~~~~d~l~~R~~~~A~Ln~gl~i~l~Der~~~~~~--------------------------------- 220 (803) ||||++||+ .+.|++++|..|++++|||||||+|.|.|+|...... T Consensus 292 F~PD~~IF~~~~~f~~~~l~~RlrelAfLn~gL~I~~~Der~~~~~~~~~~~~~k~~~~~~~~~~~~e~~~~~~~~fy~~ 371 (941) T PTZ00109 292 YKPDPSIFKNSIKHNADLIKNRIHQLAYLNDKLSFYFYDERAAKKAAAEAEKAAKAADIADLADHAGEIADHNNLDFYNY 371 (941) T ss_pred EECCHHHHCCCCCCCHHHHHHHHHHHHCCCCCCEEEEEEECCCCCCCCEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCC T ss_conf 97786885887656899999999997026788779999706765454100245443221000012334333344443433 Q ss_pred EEECCCCCCHHHHHHHCCCCCCCCC---CCEEEEEEEEEEEEEEEEEECCC-CCCEEEEEECCCCCCCCCCEEHHHHHHH Q ss_conf 1001268835776530035655688---52036420036999733453177-5412122224621666772103568889 Q gi|254780814|r 221 ITMFYKGGIEAFVSYLDRHKKPLMQ---SPIRIQGSRDNISIDLAMRWNDG-YHENVLCFTNNIPQKDGGTHLSGLRSAL 296 (803) Q Consensus 221 ~~f~~~~Gi~~fv~~l~~~~~~~~~---~~i~~~~~~~~~~veval~~~~~-~~e~i~SfvN~I~T~~GGtHv~gf~~al 296 (803) +.|+|+|||.+|+++++.+++++++ .++++.++.+++.||||+||+++ |.|+++||||+|+|++||||++|||+|| T Consensus 372 e~~~~egGl~efv~~l~~~k~~l~~d~~~~i~~~~~~~~i~ve~al~w~~~~y~e~i~SFvNnI~T~dGGTH~~GFrsAL 451 (941) T PTZ00109 372 EIIKHEGGLDEYIENITKNKANLFKDNDKIISISCHHKNIHIDLRLKWLQGEYHEHIISFVNNVNTTDGGTHVDALKYAI 451 (941) T ss_pred CCCCCCCCHHHHHHHHHCCCCCCCCCCCCEEEEEEEECCEEEEEEEEECCCCCCCEEEEEECCCCCCCCCCHHHHHHHHH T ss_conf 11126886999998870588745478884279998506757999999657887505887743666899973799999999 Q ss_pred HHHHHHHHHHCCCCCCCCCCCCHHHHHCCCEEEEEEECCCCCCCCCCHHHHCCCCCCHHHHCCCHHHHHHHHHHHHHHHH Q ss_conf 99999999862420001235761343057215666521376335721002026111103211102689999874003579 Q gi|254780814|r 297 TRQITSYVEKSFGSKKDKRSVIGDDCREGFTSVLSIKMPDPRFSSQTKEKLVSSEVRSVVESFVNDGLSAWLEEHPLEAK 376 (803) Q Consensus 297 ~~~i~~y~~~~~~~Kk~k~~~~~~Diregl~~visvki~nP~F~gQTK~kL~s~~v~~~v~~~v~~~l~~~l~~n~~~ak 376 (803) ||+||.|++++.++|+ +.+++++||||||+|||||+++||||+||||+||+|++|+++|+++|.+.|..||++||..|+ T Consensus 452 traIn~y~~k~~l~K~-~~~i~geDIrEGLtaVISVkv~dPqFeGQTK~KLgn~ev~~~Ve~iv~e~L~~~leenp~~ak 530 (941) T PTZ00109 452 SRCVNFNIKKNEAIKN-FVNIPGEYIREGLTAILSIKMNNPEFEGQTKTKLGSHFLKPILESAIFEKLSEIFDFEPNLLN 530 (941) T ss_pred HHHHHHHHHHCCCCCC-CCCCCHHHHHHCCEEEEEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCHHHHH T ss_conf 9999999997163555-566888899733789999976997718740144368889999999999999999997979999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCCC-CCCCCCCCCCCCCCCCCCCCCEEEEEEECCCCCCCCCCCCCCCCEEEEECCC Q ss_conf 998778999887755455321100110-0124676676666555555521799982557543223346544302330056 Q gi|254780814|r 377 IIVKKVLDASMVRDAARRARDLTRRKG-VLDIASLPGKLADCSERDPKKSELFLVEGDSAGGSAKQGRSRENQAILPLRG 455 (803) Q Consensus 377 ~i~~k~~~a~~ar~~a~kar~~~rrK~-~~~~~~lpgKL~Dc~~k~~~~~eL~ivEGDSAggsak~gRdr~~qai~PLrG 455 (803) .|+++++.|++||++|||||+++|||+ .+....|||||+||+++||++|||||||||||||||||||||+||||||||| T Consensus 531 ~Ii~ka~~Aa~AreaAkkare~~rrK~~~~~~~~LPGKLaDCsskdpe~sELFIVEGDSAGGSAKQgRDR~fQAILPLRG 610 (941) T PTZ00109 531 SIYFKALQAKASDEEAKAARDLIRSKNNQCSATILPGKLVDCISDDINRNEIFIVEGDSAAGSAKQARNREIQAILPLKG 610 (941) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCHHHCEEEEEECCCCCCCHHCCCCCCCEEEEECCC T ss_conf 99999999999999999988988631034556789975546667980035489995477777410145756567631277 Q ss_pred CCCCHHHCCHHH-HHCCHHHHHHHHHHCCCCCCCCCCCCC---------------------------------------- Q ss_conf 203344305876-632366788887625787753321235---------------------------------------- Q gi|254780814|r 456 KILNVERARFDK-MLSSQEIGTLITALGTGIGQDSFDINK---------------------------------------- 494 (803) Q Consensus 456 KilNv~~a~~~~-i~~n~Ei~~l~~~lG~~~g~~~~~~~~---------------------------------------- 494 (803) ||||||+|+.++ |++|+||++||+|||||+|.+.|++++ T Consensus 611 KILNVEKa~~~kril~N~EI~~LItAlG~Gig~d~~~~~k~~~~~~~~~~~~~~~~~Dl~~~~~~~~~~~~~~~~~~~~~ 690 (941) T PTZ00109 611 KILNVEKIKNNKRIFENSELKSLITAIGLNVNCDNKKIAKGKILINNKKGKDLKKKSDLKNGKFENAQNNNAILNKKKDI 690 (941) T ss_pred CCCCHHHHHHHHHHHCCHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC T ss_conf 30104442003444404999999998586867553322112333333222332211111122211111121123432233 Q ss_pred -----CCCCCEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHCCEEEEECCCEEEEEECCCEEEEECHHHHHHHHHHHCCC Q ss_conf -----673738998508888532899999999963077874895999438337887267179981457898999850011 Q gi|254780814|r 495 -----LRYHKIIIMTDADVDGAHIRTLLLTFFFRQMPSLIHNGFLYIIKPPLYGITRGKSLQYVKDEESLEDYLINQSLS 569 (803) Q Consensus 495 -----lrY~kiiimtDaD~DG~HI~~Llltff~~~~~~li~~g~~yia~ppl~~~~~~K~E~YikDD~EL~~~Ll~~gl~ 569 (803) ||||||||||||||||+|||||||||||||||+||++|||||||||||||++||.+.|..+|..|.+|++...-. T Consensus 691 ~~~~~LRY~KIIIMTDADVDGaHIrTLLLTFFyR~m~~LIe~G~VYIAqPPLYkI~~~k~~~~~~~d~~~~~~~~~~~~~ 770 (941) T PTZ00109 691 LFDNPLRYGKIIIMTDADVDGEHIRILLLTFLYRFQKEIIENGNVFVACPPLYKITYNKFFDASIKDIVMKQFNVSTKNS 770 (941) T ss_pred CCCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHCCEEEEECCCEEEEECCCHHHHHHHHHHHHHHHHHHCCC T ss_conf 43322540528999568876789999999999987499997486999569715863573266555478898776653146 Q ss_pred CHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 00001457765435468999999999999999864344579999988742113511235778999999998876531000 Q gi|254780814|r 570 EEIELFCDSKGVISKNDLRNFIDDALKIDKLIKDFYPNHDKTFIEQAVISGVFNCESDQEINVSLASKLANRLNFIAEKS 649 (803) Q Consensus 570 ~~~l~~~~~~~~l~ge~L~~li~~~~~~~~~l~~l~~~~~~~lle~l~~~~~l~~~~d~~~~~~~~~~l~~~l~~~~~~~ 649 (803) .... ...+.++|..++..+.+-..... .. ....+........-....++. ..... .+.. T Consensus 771 ~~~~------~~~~d~el~~~l~~l~~~~~~~~--~~--~~~~~~~~~~~~~~~~~~~~~-~~~~~-~~~~--------- 829 (941) T PTZ00109 771 KFII------HTYSDDELNNLLGLLDKDKIAHA--EH--NMANLGRGKPTRGGNLPSDDR-GSGLA-GIAD--------- 829 (941) T ss_pred CEEE------EECCHHHHHHHHHHHHHHHHHHH--HH--HHHHHHCCCCCCCCCCCCHHH-CCCHH-HHHH--------- T ss_conf 4267------60799999999987656556678--87--776640133211233320000-11011-1100--------- Q ss_pred CCCEECCCCCCCEEEEEEECCCCEEEEECHHHHCCCCHHHHHHHHHHHHHHCCCCEEEEECCCEEEECCHHHHHHHHHHH Q ss_conf 01100244567059999951853026646677416427899998888775114550276558258739989999999986 Q gi|254780814|r 650 EKSWHGSINNSDILVKRIARGVQEQFVLEGSSIFSFGSKYKRIFSNRIEENYLTILSLICNGDKIKITGPCSLLDTIFSI 729 (803) Q Consensus 650 ~~~~~~~~~~~~i~~~~~~~Gv~~~~~l~~~~~~s~~~~~~~~~~~~l~~~~~~~~~l~~~~~~~~i~s~~el~~~i~~~ 729 (803) .. ..............+ ......+...+...+. +.+..+-. T Consensus 830 -----------------~~--------------~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~-----~~~~~~~~ 870 (941) T PTZ00109 830 -----------------KN--------------FGSNGADESEKEEEL---KTDDIFVAEEENADPF-----AADNAFFN 870 (941) T ss_pred -----------------HC--------------CCCCCCHHHHHHHHH---CCCCHHHHCCCCCCCC-----HHHHHHHH T ss_conf -----------------00--------------122320234333220---2320232112334420-----13456765 Q ss_pred HHCCCEEEECCCCCCCCHHHHHHCCCCHHHCEEEEEEHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHH Q ss_conf 30676286037777889557223038961340212367988988878656418998762899885244 Q gi|254780814|r 730 GRKGISMQRYKGLGEMNASQLWETTLNPDTRSLLRVKITDASQADDLFSRLMGDEVEPRREFIQENSL 797 (803) Q Consensus 730 gkKG~sIQRYKGLGEMNPeQLWETTMDP~tRtLLqV~IeDa~~Ad~iF~~LMGD~VEPRReFIe~NAL 797 (803) ..|+++||||||||||||+|||||||||++|+|+||+|+||++||+||++||||+|||||+||++||- T Consensus 871 ~~kk~~IQRyKGLGEMnpeQLWETTMdP~~R~LlrV~i~Da~~A~~if~~LMGd~VepRr~FI~eNA~ 938 (941) T PTZ00109 871 FSKKYEIQRFKGLGEMMADQLWNTTMDPRVRKLIRVTVNDAIRANDLIFSLMGEDSKLRKNFILENSN 938 (941) T ss_pred HCCCCEEECCCCCCCCCHHHHHHHCCCCCCCEEEEEECHHHHHHHHHHHHHCCCCCHHHHHHHHHCCC T ss_conf 25774766286246679789866277811444999742179999989997479994287999997278 No 5 >TIGR01058 parE_Gpos DNA topoisomerase IV, B subunit; InterPro: IPR005740 DNA topoisomerases regulate the number of topological links between two DNA strands (i.e. change the number of superhelical turns) by catalysing transient single- or double-strand breaks, crossing the strands through one another, then resealing the breaks. These enzymes have several functions: to remove DNA supercoils during transcription and DNA replication; for strand breakage during recombination; for chromosome condensation; and to disentangle intertwined DNA during mitosis , . DNA topoisomerases are divided into two classes: type I enzymes (5.99.1.2 from EC; topoisomerases I, III and V) break single-strand DNA, and type II enzymes (5.99.1.3 from EC; topoisomerases II, IV and VI) break double-strand DNA . Type II topoisomerases are ATP-dependent enzymes, and can be subdivided according to their structure and reaction mechanisms: type IIA (topoisomerase II or gyrase, and topoisomerase IV) and type IIB (topoisomerase VI). These enzymes are responsible for relaxing supercoiled DNA as well as for introducing both negative and positive supercoils . This entry represents subunit B (parE) of topoisomerase IV from Gram positive bacteria. Topoisomerase IV primarily decatenates DNA and relaxes positive supercoils, which is important in bacteria, where the circular chromosome becomes catenated, or linked, during replication . Topoisomerase IV consists of two polypeptide subunits, parC (subunit A), which is homologous to gyrA of topoisomerase II, and parE (subunit B), which is homologous to gyrB of topoisomerase II. More information about this protein can be found at Protein of the Month: DNA Topoisomerase .; GO: 0003677 DNA binding, 0003918 DNA topoisomerase (ATP-hydrolyzing) activity, 0005524 ATP binding, 0006265 DNA topological change, 0005694 chromosome. Probab=100.00 E-value=0 Score=1850.09 Aligned_cols=627 Identities=49% Similarity=0.841 Sum_probs=599.7 Q ss_pred CCCCC-CCCCCCHHHCEECCCCCHHCCCCCCEECCCCCCCHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCEEEEEEC Q ss_conf 98764-22367865512313542000458715432799632544565675557886860779757999838983999976 Q gi|254780814|r 1 MSDNI-QNSSYDADSIQILKGLDAVKKRPSMYIGDTDGESGLHHMIYEVLDNAIDESLEGYADIITVTLNMDGSCTVSDN 79 (803) Q Consensus 1 m~~~~-~~~~Y~~~~i~~L~gle~vrkRP~mYiG~t~~~~Gl~hlv~EiidNsvDe~~~g~~~~I~V~i~~dgsisV~dn 79 (803) |+.+. .-.+|+++.||+|+||||||||||||||||| .+||||||||||||||||+|+|||++|.|+||.||||||.|+ T Consensus 1 M~~k~~~~~~Y~~~aI~iLeGL~AVrKRPGMYIGSTd-s~GLHHl~wEIvdNsvDEvl~g~a~~I~V~l~~dnsi~V~D~ 79 (655) T TIGR01058 1 MAKKEINIFDYNEDAIKILEGLDAVRKRPGMYIGSTD-SKGLHHLVWEIVDNSVDEVLAGYADNITVTLHKDNSISVQDD 79 (655) T ss_pred CCCCCCCCCCCCCHHHHHHHCCHHHCCCCCCEECCCC-CCCCHHHHHHHHHHHHHHHHHCCCCEEEEEEECCCEEEEEEC T ss_conf 9745552003550246775200000158984122568-785055678788446888982269747999827870788757 Q ss_pred CCCCCCCCCCCCCCCEEEEEEEEECCCCCCCCCCCEEECCCCCCEEEEEEECCCEEEEEEEECCEEEEEEEECC-EECCC Q ss_conf 84453640777885001256425011323289860483116752037786326449999997895999998898-36463 Q gi|254780814|r 80 GRGIPTDLHKEEGISAAEVIMTRLHAGGKFNQKSYKVSGGLHGVGVSVVNALSSWLKLRIKREGNIYEMSFING-ILDNP 158 (803) Q Consensus 80 GrGIPv~~h~~~~~~~~E~i~t~L~ag~kfd~~~yk~sgGlhGvG~s~vNalS~~~~v~v~rdgk~~~q~f~~g-~~~~~ 158 (803) |||||++||...++|++|+|||.||||||||+.+||+||||||||||||||||+||+|+|.|||++|+|.|.+| .+.+. T Consensus 80 GRG~P~~~h~~~~isT~e~vlT~LHAGGKFd~~~YKtaGGLHGVGaSVVNALS~wL~v~v~Rdg~iYq~~f~nGGk~~~~ 159 (655) T TIGR01058 80 GRGIPTGIHKQGKISTVETVLTVLHAGGKFDQGGYKTAGGLHGVGASVVNALSSWLEVEVKRDGKIYQQRFENGGKIVQS 159 (655) T ss_pred CCCCCCCHHCCCCCCCCCEEEEEEECCCCCCCCCEEECCCCCCCHHHHHHHCCCCEEEEEEECCHHHHHHHHCCCCEEEE T ss_conf 85376012216898851004544307730378650105732230367764022633899987873445787447913000 Q ss_pred CEEECCC-CCCCCEEEEEEECHHHHHHHHCCHHHHHHHHHHHHCCCCEEEEEEEECCCCCCCEEEECCCCCCHHHHHHHC Q ss_conf 1387327-788870899987788854210008999999997311464169999963586421010012688357765300 Q gi|254780814|r 159 LVVTGSA-GNDTGTEVTFLPSSDIFSVQDFNYDTLQHRLRELSFLNSSVQISLIDKRSPEPREITMFYKGGIEAFVSYLD 237 (803) Q Consensus 159 l~~~~~~-~~~~GT~I~F~PD~~iF~~~~~~~d~l~~R~~~~A~Ln~gl~i~l~Der~~~~~~~~f~~~~Gi~~fv~~l~ 237 (803) +++||.| ++++||.|+|.|||.+|..+.|++++|..||+|.|||..+|+|.|.|+|+. +...|+|+.||.+||+|+| T Consensus 160 lkkIG~tpk~ktGT~v~F~pDp~~F~~~~F~~n~i~eRL~Es~FLlk~lkl~l~D~~~~--~~~~F~~~~Gl~~F~~~~N 237 (655) T TIGR01058 160 LKKIGTTPKKKTGTLVHFLPDPTIFKTTQFNSNIIKERLKESAFLLKKLKLTLTDKRTD--KKTVFFYENGLVDFVKYIN 237 (655) T ss_pred EEECCCCCCCCCCCEEEECCCCHHHCCCCCCHHHHHHHHHHHHHHHHCCEEEEEEECCC--CEEEEEECCCHHHHHHHHC T ss_conf 10204788877773477556713330245672678767668888771473899970267--2177730044789999853 Q ss_pred CCCCCCCCCCEEEEEEEEEEEEEEEEEECCCCCCEEEEEECCCCCCCCCCEEHHHHHHHHHHHHHHHHHCCCCCCCCCCC Q ss_conf 35655688520364200369997334531775412122224621666772103568889999999998624200012357 Q gi|254780814|r 238 RHKKPLMQSPIRIQGSRDNISIDLAMRWNDGYHENVLCFTNNIPQKDGGTHLSGLRSALTRQITSYVEKSFGSKKDKRSV 317 (803) Q Consensus 238 ~~~~~~~~~~i~~~~~~~~~~veval~~~~~~~e~i~SfvN~I~T~~GGtHv~gf~~al~~~i~~y~~~~~~~Kk~k~~~ 317 (803) +.+..+ .++++|+++.+++.|||||||||+++|.++||||+|.|.+||||++||+.|||+++|+|+++.+++|.++.++ T Consensus 238 e~k~~l-~~v~~f~ge~~~i~Ve~afQfnd~~~e~ilSF~N~vkT~eGG~HE~g~k~A~t~v~N~YaRK~~LLKEKDKNl 316 (655) T TIGR01058 238 EEKETL-SDVIYFEGEKNGIEVEVAFQFNDGDSENILSFANSVKTKEGGTHENGFKLAITKVINEYARKVNLLKEKDKNL 316 (655) T ss_pred CCCEEC-CCEEEEEECCCCEEEEEEEEECCCCCCEEEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCC T ss_conf 550001-2345887647625899999742788753798640034068872078899999999865543200111001478 Q ss_pred CHHHHHCCCEEEEEEECCCC--CCCCCCHHHHCCCCCCHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 61343057215666521376--3357210020261111032111026899998740035799987789998877554553 Q gi|254780814|r 318 IGDDCREGFTSVLSIKMPDP--RFSSQTKEKLVSSEVRSVVESFVNDGLSAWLEEHPLEAKIIVKKVLDASMVRDAARRA 395 (803) Q Consensus 318 ~~~Diregl~~visvki~nP--~F~gQTK~kL~s~~v~~~v~~~v~~~l~~~l~~n~~~ak~i~~k~~~a~~ar~~a~ka 395 (803) .++||||||.|||+|+||+- +||||||.||+|++|+.+|..+|.++|.-||++|...|..|++|+|.|..||+||||| T Consensus 317 eg~diReGL~avi~v~~PE~~~~fEGQTK~Klf~~ea~~~V~~~v~~~~~~fLeeN~~~A~~lv~kai~Ar~akeaak~a 396 (655) T TIGR01058 317 EGSDIREGLSAVISVRIPEELIQFEGQTKSKLFSPEAREVVDEIVEDKLSFFLEENKEDAKLLVKKAIKARDAKEAAKKA 396 (655) T ss_pred CCCHHHHHHHHEEEEECCHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 71034532021155325754524576057667876689999999988889999743578999999998889999999874 Q ss_pred HHHCCC--CCCCCCCCCCCCCCCCCCCCCCCEEEEEEECCCCCCCCCCCCCCCCEEEEECCCCCCCHHHCC-HHHHHCCH Q ss_conf 211001--100124676676666555555521799982557543223346544302330056203344305-87663236 Q gi|254780814|r 396 RDLTRR--KGVLDIASLPGKLADCSERDPKKSELFLVEGDSAGGSAKQGRSRENQAILPLRGKILNVERAR-FDKMLSSQ 472 (803) Q Consensus 396 r~~~rr--K~~~~~~~lpgKL~Dc~~k~~~~~eL~ivEGDSAggsak~gRdr~~qai~PLrGKilNv~~a~-~~~i~~n~ 472 (803) ||..|+ |..-+.+-|.|||++|+||+|.+.||||||||||||||||||||.||||||||||+|||+||+ ...|++|| T Consensus 397 Ree~k~~kk~kk~~g~l~GKLtPaqsk~p~~nELflVEGDSAGGsAK~GRdr~fQAiLPLRGKVlNv~Ka~n~~d~lkNE 476 (655) T TIGR01058 397 REEKKSGKKRKKEKGILSGKLTPAQSKNPAKNELFLVEGDSAGGSAKLGRDRKFQAILPLRGKVLNVEKAKNLADILKNE 476 (655) T ss_pred HHHHHCCCCCCHHCCCCCCCCCCCCCCCCHHHCEEEEECCCCCCCCCCCCCCCEEEECCCCCCEEHHHHHHHHHHHHCCC T ss_conf 56651356420111110111377677561110525785136887533367853034226675010377887688874030 Q ss_pred HHHHHHHHHCCCCCCCCCCCCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHCCEEEEECCCEEEEE--ECCC- Q ss_conf 6788887625787753321235673738998508888532899999999963077874895999438337887--2671- Q gi|254780814|r 473 EIGTLITALGTGIGQDSFDINKLRYHKIIIMTDADVDGAHIRTLLLTFFFRQMPSLIHNGFLYIIKPPLYGIT--RGKS- 549 (803) Q Consensus 473 Ei~~l~~~lG~~~g~~~~~~~~lrY~kiiimtDaD~DG~HI~~Llltff~~~~~~li~~g~~yia~ppl~~~~--~~K~- 549 (803) ||.+||.+||+|+|.+ |++++|+|+|||||||||.|||||+.||||||||||+||||.||||||.|||||+. .+|+ T Consensus 477 EI~t~i~~iG~Giga~-F~i~d~~Y~KIIIMTDADtDGaHIQ~LLLTfFYRyM~PLiE~G~vYiALPPLYKl~~k~~KkE 555 (655) T TIGR01058 477 EINTIIFCIGTGIGAD-FSIKDLKYDKIIIMTDADTDGAHIQVLLLTFFYRYMRPLIELGHVYIALPPLYKLSKKDGKKE 555 (655) T ss_pred HHHHHHHHHCCCCCCC-CCHHHCCCCCEEEEECCCCCCHHHHHHHHHHHHHHCCCHHHCCCEEEECCCCCCCCCCCCCEE T ss_conf 4889999816651256-436541678679963589883189999999887614522046833783687200227998547 Q ss_pred -EEEEECHHHHHHHHHHHCCCCHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHH Q ss_conf -7998145789899985001100001457765435468999999999999999864344579999988742113511235 Q gi|254780814|r 550 -LQYVKDEESLEDYLINQSLSEEIELFCDSKGVISKNDLRNFIDDALKIDKLIKDFYPNHDKTFIEQAVISGVFNCESDQ 628 (803) Q Consensus 550 -E~YikDD~EL~~~Ll~~gl~~~~l~~~~~~~~l~ge~L~~li~~~~~~~~~l~~l~~~~~~~lle~l~~~~~l~~~~d~ 628 (803) ..|+.+|.||++...+.| T Consensus 556 ~~~YaW~D~EL~~~~k~~G------------------------------------------------------------- 574 (655) T TIGR01058 556 IVKYAWSDLELEELKKKLG------------------------------------------------------------- 574 (655) T ss_pred EEEEECCCHHHHHHHHHHC------------------------------------------------------------- T ss_conf 8887127212899999737------------------------------------------------------------- Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCEECCCCCCCEEEEEEECCCCEEEEECHHHHCCCCHHHHHHHHHHHHHHCCCCEEEE Q ss_conf 77899999999887653100001100244567059999951853026646677416427899998888775114550276 Q gi|254780814|r 629 EINVSLASKLANRLNFIAEKSEKSWHGSINNSDILVKRIARGVQEQFVLEGSSIFSFGSKYKRIFSNRIEENYLTILSLI 708 (803) Q Consensus 629 ~~~~~~~~~l~~~l~~~~~~~~~~~~~~~~~~~i~~~~~~~Gv~~~~~l~~~~~~s~~~~~~~~~~~~l~~~~~~~~~l~ 708 (803) T Consensus 575 -------------------------------------------------------------------------------- 574 (655) T TIGR01058 575 -------------------------------------------------------------------------------- 574 (655) T ss_pred -------------------------------------------------------------------------------- T ss_conf -------------------------------------------------------------------------------- Q ss_pred ECCCEEEECCHHHHHHHHHHHHHCCCEEEECCCCCCCCHHHHHHCCCCHHHCEEEEEEHHHHHHHHHHHHHHCCCCCCCH Q ss_conf 55825873998999999998630676286037777889557223038961340212367988988878656418998762 Q gi|254780814|r 709 CNGDKIKITGPCSLLDTIFSIGRKGISMQRYKGLGEMNASQLWETTLNPDTRSLLRVKITDASQADDLFSRLMGDEVEPR 788 (803) Q Consensus 709 ~~~~~~~i~s~~el~~~i~~~gkKG~sIQRYKGLGEMNPeQLWETTMDP~tRtLLqV~IeDa~~Ad~iF~~LMGD~VEPR 788 (803) ||.+|||||||||||+||||||||||++|+|+||+|+|++.|++-.++||||+|+|| T Consensus 575 -----------------------kn~tlQRYKGLGEMNADQLweTTMnP~~R~Lv~v~i~D~~~aEr~in~lMG~kv~~R 631 (655) T TIGR01058 575 -----------------------KNYTLQRYKGLGEMNADQLWETTMNPETRTLVRVKIDDLARAERQINTLMGDKVELR 631 (655) T ss_pred -----------------------CCCEEEEECCCCCCCHHHHCCCCCCCCCCEEEEEEHHHHHHHCCEEEEEECCCCCHH T ss_conf -----------------------981677543776434023101468898641888726788621330333217873033 Q ss_pred HHHHHHHH Q ss_conf 89988524 Q gi|254780814|r 789 REFIQENS 796 (803) Q Consensus 789 ReFIe~NA 796 (803) |.|||.|- T Consensus 632 k~WIe~Ni 639 (655) T TIGR01058 632 KKWIEANI 639 (655) T ss_pred HHHHHHHC T ss_conf 67465300 No 6 >PRK05559 DNA topoisomerase IV subunit B; Reviewed Probab=100.00 E-value=0 Score=1804.10 Aligned_cols=627 Identities=49% Similarity=0.823 Sum_probs=590.5 Q ss_pred CCCCCCCHHHCEECCCCCHHCCCCCCEECCCCCCCHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCEEEEEECCCCCC Q ss_conf 42236786551231354200045871543279963254456567555788686077975799983898399997684453 Q gi|254780814|r 5 IQNSSYDADSIQILKGLDAVKKRPSMYIGDTDGESGLHHMIYEVLDNAIDESLEGYADIITVTLNMDGSCTVSDNGRGIP 84 (803) Q Consensus 5 ~~~~~Y~~~~i~~L~gle~vrkRP~mYiG~t~~~~Gl~hlv~EiidNsvDe~~~g~~~~I~V~i~~dgsisV~dnGrGIP 84 (803) +.+++|+|++|++|+||||||||||||||||+ .+||||||||||||||||+++|||+.|+|+||+||+|||+||||||| T Consensus 3 ~~~~~Y~~~~i~~L~glE~Vr~RP~mYIGst~-~~Gl~hi~~EIldNavDe~~~g~~~~I~V~i~~dgsIsV~dnGrGIP 81 (633) T PRK05559 3 MMTNNYNADSIEVLEGLEPVRKRPGMYIGSTD-TRGLHHLVQEVIDNSVDEALAGHGKKIEVTLHADGSVSVRDNGRGIP 81 (633) T ss_pred CCCCCCCHHHCEEECCCHHHHCCCCCEECCCC-CCCCEEEEEEEEECHHHHHHCCCCCEEEEEECCCCEEEEEECCCCCC T ss_conf 20057797778660574676449986656899-98562888776551142765078976999997898499998996545 Q ss_pred CCCCCCCCCCEEEEEEEEECCCCCCCCCCCEEECCCCCCEEEEEEECCCEEEEEEEECCEEEEEEEECCEECCCCEEECC Q ss_conf 64077788500125642501132328986048311675203778632644999999789599999889836463138732 Q gi|254780814|r 85 TDLHKEEGISAAEVIMTRLHAGGKFNQKSYKVSGGLHGVGVSVVNALSSWLKLRIKREGNIYEMSFINGILDNPLVVTGS 164 (803) Q Consensus 85 v~~h~~~~~~~~E~i~t~L~ag~kfd~~~yk~sgGlhGvG~s~vNalS~~~~v~v~rdgk~~~q~f~~g~~~~~l~~~~~ 164 (803) |++||++|+|+||||||+||||||||+++||+||||||||||||||||++|+|+|+|||+.|+|+|++|++..++++++. T Consensus 82 v~~h~~~~~~~~ElIft~LhaGgkfd~~~ykvSGGlhGvGasvvNalS~~f~Vev~r~gk~y~q~f~~g~~~~~~~~i~~ 161 (633) T PRK05559 82 VGIHPEEGKSGVEVILTKLHAGGKFSNKAYKFSGGLHGVGVSVVNALSERLEVEVKRDGKVYRQRFERGEPVTPLEVVGG 161 (633) T ss_pred CCCCCCCCCCCHHHHEEECCCCCCCCCCCCEEECCCCCCCEEEEEEEECCEEEEEEECCEEEEEEECCCEECCCCCEECC T ss_conf 51756579761566100204578889985155078776212670873055899999899999999549909888513045 Q ss_pred CCC--CCCEEEEEEECHHHHHHHHCCHHHHHHHHHHHHCCCCEEEEEEEECCCCCCCEEEECCCCCCHHHHHHHCCCCCC Q ss_conf 778--887089998778885421000899999999731146416999996358642101001268835776530035655 Q gi|254780814|r 165 AGN--DTGTEVTFLPSSDIFSVQDFNYDTLQHRLRELSFLNSSVQISLIDKRSPEPREITMFYKGGIEAFVSYLDRHKKP 242 (803) Q Consensus 165 ~~~--~~GT~I~F~PD~~iF~~~~~~~d~l~~R~~~~A~Ln~gl~i~l~Der~~~~~~~~f~~~~Gi~~fv~~l~~~~~~ 242 (803) +.+ .+||+|+|+||++||++..|++++|.+|++++|||||||+|+|+|+|.. ..+|+|++||++|+++++.++.+ T Consensus 162 ~~~~~~~GT~ItF~PD~~iF~~~~~~~d~l~~rl~~~A~l~~gl~I~lnder~~---~~~F~~e~Gi~dyv~~~~~~~~~ 238 (633) T PRK05559 162 TKGKRKTGTRVRFWPDPKIFDSAKFSPERLKERLRSKAFLCPGLEITLNDERTG---EETFHYENGLKDYLAELNEGKEL 238 (633) T ss_pred CCCCCCCCEEEEEEECHHHHCCCEECHHHHHHHHHHHHHCCCCCEEEEEECCCC---CEEEECCCCHHHHHHHHHCCCCC T ss_conf 678788834999998857837725428999999999863279977999824677---34466476199999986179975 Q ss_pred CCCCCEEEEEEEEEEEEEEEEEECCCCCCEEEEEECCCCCCCCCCEEHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHH Q ss_conf 68852036420036999733453177541212222462166677210356888999999999862420001235761343 Q gi|254780814|r 243 LMQSPIRIQGSRDNISIDLAMRWNDGYHENVLCFTNNIPQKDGGTHLSGLRSALTRQITSYVEKSFGSKKDKRSVIGDDC 322 (803) Q Consensus 243 ~~~~~i~~~~~~~~~~veval~~~~~~~e~i~SfvN~I~T~~GGtHv~gf~~al~~~i~~y~~~~~~~Kk~k~~~~~~Di 322 (803) +++.+++ ++ .+..||||+||+++++++++||||+|+|++|||||+||++||+++||+|+++.++.|+. .+++++|| T Consensus 239 i~~~~~~--~~-~~~~veval~~~~~~~~~~~SFVN~I~T~~GGTHv~g~~~al~k~i~~y~~k~~~~Kkk-~~i~~~dI 314 (633) T PRK05559 239 LPVEPFE--GD-EGEAVEWALAWLEGGSENIESYVNLIPTPDGGTHENGFREGLLRALREFAEKRNLLKRG-VKLTAEDV 314 (633) T ss_pred CCCCEEE--CC-CCCEEEEEEEECCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCC-CCCCHHHH T ss_conf 7864287--55-79759999994478884476568870579998767889989999999999982745566-65788898 Q ss_pred HCCCEEEEEEECCCCCCCCCCHHHHCCCCCCHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 05721566652137633572100202611110321110268999987400357999877899988775545532110011 Q gi|254780814|r 323 REGFTSVLSIKMPDPRFSSQTKEKLVSSEVRSVVESFVNDGLSAWLEEHPLEAKIIVKKVLDASMVRDAARRARDLTRRK 402 (803) Q Consensus 323 regl~~visvki~nP~F~gQTK~kL~s~~v~~~v~~~v~~~l~~~l~~n~~~ak~i~~k~~~a~~ar~~a~kar~~~rrK 402 (803) |+||++||||+|+||+|+||||++|+|++++..|+.+++++|..||++||..|+.|+++++.++++|++|+|+|+ ++|| T Consensus 315 regL~~~Vsv~i~nP~FesQTKekL~s~~~~~~v~~~v~~~l~~~l~~n~~~a~~I~~k~~~~a~~r~~a~k~~~-~~rk 393 (633) T PRK05559 315 REGLAAVLSVKIPEPQFEGQTKEKLGSREARRFVSGVVKDAFDLWLNQNPELAEKLAEKAIKAAQARLRARKKKE-VRRK 393 (633) T ss_pred HCCCEEEEEECCCCCCCCCCHHCCCCCHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHCCC-CCCC T ss_conf 465326866310498406733200259657767778888999999997989999999999999999999972132-0134 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCEEEEEEECCCCCCCCCCCCCCCCEEEEECCCCCCCHHHCCHHHHHCCHHHHHHHHHHC Q ss_conf 00124676676666555555521799982557543223346544302330056203344305876632366788887625 Q gi|254780814|r 403 GVLDIASLPGKLADCSERDPKKSELFLVEGDSAGGSAKQGRSRENQAILPLRGKILNVERARFDKMLSSQEIGTLITALG 482 (803) Q Consensus 403 ~~~~~~~lpgKL~Dc~~k~~~~~eL~ivEGDSAggsak~gRdr~~qai~PLrGKilNv~~a~~~~i~~n~Ei~~l~~~lG 482 (803) +++...+|||||+||+++++++|||||||||||||||||||||+|||||||||||||||+|+.+||++|+||++|++||| T Consensus 394 ~~~~~~~lpgKL~Dc~~~~~~~~eL~lvEGDSAgGsak~gRDr~~qailPLRGKiLNv~~a~~~ki~~N~Ei~~l~~alG 473 (633) T PRK05559 394 KATKKPALPGKLADCTSQDPERTELFLVEGDSAGGSAKQARDREFQAILPLRGKILNTWEASLDDVLANEEIHDIIVAIG 473 (633) T ss_pred CCCCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCEEEECCCCCEEEEECCCEECCHHHCCHHHHHHCHHHHHHHHHHC T ss_conf 53346789866613023896744699983467776335221546265300266222353178888762689999999857 Q ss_pred CCCCCCCCCCCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHCCEEEEECCCEEEEEECC-CEEEEECHHHHHH Q ss_conf 787753321235673738998508888532899999999963077874895999438337887267-1799814578989 Q gi|254780814|r 483 TGIGQDSFDINKLRYHKIIIMTDADVDGAHIRTLLLTFFFRQMPSLIHNGFLYIIKPPLYGITRGK-SLQYVKDEESLED 561 (803) Q Consensus 483 ~~~g~~~~~~~~lrY~kiiimtDaD~DG~HI~~Llltff~~~~~~li~~g~~yia~ppl~~~~~~K-~E~YikDD~EL~~ 561 (803) ||+|++ ||+++||||||||||||||||||||||||||||||||+||++||||||+||||||++|| +..|..+|.|++. T Consensus 474 ~~~g~~-~d~~~LRYgkIiImTDaD~DGsHI~~Llltff~~~~p~Li~~G~vy~~~tPl~kv~~gk~~~~y~~~e~e~~~ 552 (633) T PRK05559 474 IGPGDS-FDLEDLRYGKIIIMTDADVDGAHIATLLLTFFYRHFPPLVEAGHVYIALPPLYRVDVGKGKTFYALDEEEKEE 552 (633) T ss_pred CCCCCC-CCCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCEEEEEECCCCEEEECCHHHHHH T ss_conf 787876-8701067671799977999768899999999998769988659589973988999938943698588999999 Q ss_pred HHHHHCCCCHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHH Q ss_conf 99850011000014577654354689999999999999998643445799999887421135112357789999999988 Q gi|254780814|r 562 YLINQSLSEEIELFCDSKGVISKNDLRNFIDDALKIDKLIKDFYPNHDKTFIEQAVISGVFNCESDQEINVSLASKLANR 641 (803) Q Consensus 562 ~Ll~~gl~~~~l~~~~~~~~l~ge~L~~li~~~~~~~~~l~~l~~~~~~~lle~l~~~~~l~~~~d~~~~~~~~~~l~~~ 641 (803) ++-+.+-. T Consensus 553 ~~~~~~~~------------------------------------------------------------------------ 560 (633) T PRK05559 553 LLKKLGKG------------------------------------------------------------------------ 560 (633) T ss_pred HHHHHHHC------------------------------------------------------------------------ T ss_conf 99986411------------------------------------------------------------------------ Q ss_pred HHHHHHHCCCCEECCCCCCCEEEEEEECCCCEEEEECHHHHCCCCHHHHHHHHHHHHHHCCCCEEEEECCCEEEECCHHH Q ss_conf 76531000011002445670599999518530266466774164278999988887751145502765582587399899 Q gi|254780814|r 642 LNFIAEKSEKSWHGSINNSDILVKRIARGVQEQFVLEGSSIFSFGSKYKRIFSNRIEENYLTILSLICNGDKIKITGPCS 721 (803) Q Consensus 642 l~~~~~~~~~~~~~~~~~~~i~~~~~~~Gv~~~~~l~~~~~~s~~~~~~~~~~~~l~~~~~~~~~l~~~~~~~~i~s~~e 721 (803) T Consensus 561 -------------------------------------------------------------------------------- 560 (633) T PRK05559 561 -------------------------------------------------------------------------------- 560 (633) T ss_pred -------------------------------------------------------------------------------- T ss_conf -------------------------------------------------------------------------------- Q ss_pred HHHHHHHHHHCCCEEEECCCCCCCCHHHHHHCCCCHHHCEEEEEEHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHCC Q ss_conf 9999998630676286037777889557223038961340212367988988878656418998762899885244123 Q gi|254780814|r 722 LLDTIFSIGRKGISMQRYKGLGEMNASQLWETTLNPDTRSLLRVKITDASQADDLFSRLMGDEVEPRREFIQENSLPSA 800 (803) Q Consensus 722 l~~~i~~~gkKG~sIQRYKGLGEMNPeQLWETTMDP~tRtLLqV~IeDa~~Ad~iF~~LMGD~VEPRReFIe~NAL~vN 800 (803) ..+..++||||||||||||+|||||||||++|+|+||+++|+..||++|++|||++|+|||+||++||..|. T Consensus 561 -------~~~~k~~i~ryKGLGem~~~ql~et~mdp~~r~l~~v~~~d~~~~~~~~~~lmg~~~~~Rk~~i~~~a~~~~ 632 (633) T PRK05559 561 -------GKKGKVEIQRFKGLGEMNPDQLWETTMDPETRRLLRVTIDDAAETEELVDMLMGKKAEPRRNWIEENGDELD 632 (633) T ss_pred -------CCCCCCEEEEEEECCCCCHHHHHHHHCCCCCCEEEEEECCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCC T ss_conf -------577870368764067799899998725822146999864868899999999828995799999996034146 No 7 >smart00433 TOP2c TopoisomeraseII. Eukaryotic DNA topoisomerase II, GyrB, ParE Probab=100.00 E-value=0 Score=1677.08 Aligned_cols=590 Identities=54% Similarity=0.881 Sum_probs=555.0 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCEEEEEECCCCCCCCCCCCCCCCEEEEEEEEECCCCCCCCCCCEEECC Q ss_conf 25445656755578868607797579998389839999768445364077788500125642501132328986048311 Q gi|254780814|r 40 GLHHMIYEVLDNAIDESLEGYADIITVTLNMDGSCTVSDNGRGIPTDLHKEEGISAAEVIMTRLHAGGKFNQKSYKVSGG 119 (803) Q Consensus 40 Gl~hlv~EiidNsvDe~~~g~~~~I~V~i~~dgsisV~dnGrGIPv~~h~~~~~~~~E~i~t~L~ag~kfd~~~yk~sgG 119 (803) ||||||||||||||||++||||+.|+|+||+||||||+||||||||++||++++|+||+|||+||||||||+++|++||| T Consensus 1 GL~hl~~EIldNsvDe~~~g~~~~I~V~i~~dgsisV~DnGrGIPv~~h~~~~~~~~E~i~t~LhaGgkFd~~~yk~sGG 80 (594) T smart00433 1 GLHHLVDEIVDNAADEALAGYMDTIKVTIDKDNSISVEDNGRGIPVEIHPKEKKYAPEVIFTVLHAGGKFDDDAYKVSGG 80 (594) T ss_pred CCEEEEEEEEECHHHHHHHCCCCEEEEEECCCCEEEEEECCCCCCCCCCCCCCCCCEEEEEEECCCCCCCCCCCCEEECC T ss_conf 96799845667377688428997799999279849999899765423667679712014761322436889985167067 Q ss_pred CCCCEEEEEEECCCEEEEEEEECCEEEEEEEEC-CEECCCCEEECCCCCCCCEEEEEEECHHHHHH-HHCCHHHHHHHHH Q ss_conf 675203778632644999999789599999889-83646313873277888708999877888542-1000899999999 Q gi|254780814|r 120 LHGVGVSVVNALSSWLKLRIKREGNIYEMSFIN-GILDNPLVVTGSAGNDTGTEVTFLPSSDIFSV-QDFNYDTLQHRLR 197 (803) Q Consensus 120 lhGvG~s~vNalS~~~~v~v~rdgk~~~q~f~~-g~~~~~l~~~~~~~~~~GT~I~F~PD~~iF~~-~~~~~d~l~~R~~ 197 (803) |||||||||||||+||+|+|+|+|+.|.|+|++ |.+.+++++++++ +++||+|+|+||++||++ ..|++++|.+|++ T Consensus 81 lhGvGasvvNalS~~f~Vev~r~gk~y~q~f~~~g~~~~~~~~~~~~-~k~GT~I~F~PD~~iF~~~~~~~~~~l~~Rl~ 159 (594) T smart00433 81 LHGVGASVVNALSTEFEVEVARDGKEYKQSFSNNGKPLSEPKIIGDT-KKDGTKVTFKPDLEIFGMTTDDDFELLKRRLR 159 (594) T ss_pred CCCCCEEEEEECCCEEEEEEEECCEEEEEEECCCCEECCCCEECCCC-CCCCEEEEEEECHHHHCCCCCCCHHHHHHHHH T ss_conf 88711476584127399999989999999990888489885554788-99951999997878839975466999999999 Q ss_pred HHHCCCCEEEEEEEECCCCCCCEEEECCCCCCHHHHHHHCCCCCCCCCCCEEEEEEEEEEEEEEEEEECCCCCCEEEEEE Q ss_conf 73114641699999635864210100126883577653003565568852036420036999733453177541212222 Q gi|254780814|r 198 ELSFLNSSVQISLIDKRSPEPREITMFYKGGIEAFVSYLDRHKKPLMQSPIRIQGSRDNISIDLAMRWNDGYHENVLCFT 277 (803) Q Consensus 198 ~~A~Ln~gl~i~l~Der~~~~~~~~f~~~~Gi~~fv~~l~~~~~~~~~~~i~~~~~~~~~~veval~~~~~~~e~i~Sfv 277 (803) ++|||||||+|+|+|+|.. .+.+|+|++|+++|+++++.++.++++.+++..++.+++.||||+||+++|.++++||| T Consensus 160 ~~A~Ln~gl~I~lnder~~--~~~~f~~~~Gl~dyv~~~~~~~~~~~~~~~~~~~~~~~~~~eva~~~s~~~~~~~~SFv 237 (594) T smart00433 160 ELAFLNKGVKITLNDERSD--EEETFLFEGGIKDYVELLNKNKELLSPEPTYIEGEKDNIRVEVAFQYTDGYSENIVSFV 237 (594) T ss_pred HHHCCCCCCEEEEEEECCC--CEEEEECCCCHHHHHHHHCCCCCCCCCCCEEEEEEECCCEEEEEEEECCCCCCCCCEEE T ss_conf 9842388838999984378--45789717879999998538987678874587787168269999996598872100023 Q ss_pred CCCCCCCCCCEEHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHCCCEEEEEEECCCCCCCCCCHHHHCCCCCCHHHH Q ss_conf 46216667721035688899999999986242000123576134305721566652137633572100202611110321 Q gi|254780814|r 278 NNIPQKDGGTHLSGLRSALTRQITSYVEKSFGSKKDKRSVIGDDCREGFTSVLSIKMPDPRFSSQTKEKLVSSEVRSVVE 357 (803) Q Consensus 278 N~I~T~~GGtHv~gf~~al~~~i~~y~~~~~~~Kk~k~~~~~~Diregl~~visvki~nP~F~gQTK~kL~s~~v~~~v~ 357 (803) |+|+|++|||||+||++||+++||+|++++++.|+ .+++++||||||++||||+|+||+|+||||+||+|++++..|+ T Consensus 238 N~I~T~~GGTHv~gf~~al~~~i~~~~~k~~~~K~--~~i~~~DIregL~~vIsvki~nPqFegQTKekL~~~~~~~~v~ 315 (594) T smart00433 238 NNIATTEGGTHENGFKDALTRVINEYAKKKKKLKE--KNIKGEDVREGLTAFISVKIPEPQFEGQTKEKLGTSEVRFGVE 315 (594) T ss_pred EEEECCCCCCHHHHHHHHHHHHHHHHHHHHCCCCC--CCCCHHHHHCCCEEEEEEEECCCCCCCCHHHHHCCHHHHHHHH T ss_conf 14788999848899999999999999998474244--6799999852336999998248730664213435977765689 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEECCCCCC Q ss_conf 11026899998740035799987789998877554553211001100124676676666555555521799982557543 Q gi|254780814|r 358 SFVNDGLSAWLEEHPLEAKIIVKKVLDASMVRDAARRARDLTRRKGVLDIASLPGKLADCSERDPKKSELFLVEGDSAGG 437 (803) Q Consensus 358 ~~v~~~l~~~l~~n~~~ak~i~~k~~~a~~ar~~a~kar~~~rrK~~~~~~~lpgKL~Dc~~k~~~~~eL~ivEGDSAgg 437 (803) ..+.+.|..||++||..|+.|+++++.+++||++||++|+..||| ++....|||||+||+++++++||||||||||||| T Consensus 316 ~~v~~~l~~~l~~n~~~a~~Ii~k~~~~a~ar~~ak~~k~~~rk~-~~~~~~lpgKL~Dc~~~~~~~~eLfivEGDSA~g 394 (594) T smart00433 316 KIVSECLLSFLEENPVEASKIVEKVLLAAKARAAAKKARELTRKK-KLSKISLPGKLADASSAGPKKCELFLVEGDSAGG 394 (594) T ss_pred HHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC-CCCCCCCCCCCCCCCCCCCCCEEEEEEECCCCCC T ss_conf 999998866654398988999999999999999999768775654-2112578866724356798862899973478874 Q ss_pred CCCCCCCCCCEEEEECCCCCCCHHHCCHHHHHCCHHHHHHHHHHCCCCCCCCCCCCCCCCCCEEEEECCCCCCCCHHHHH Q ss_conf 22334654430233005620334430587663236678888762578775332123567373899850888853289999 Q gi|254780814|r 438 SAKQGRSRENQAILPLRGKILNVERARFDKMLSSQEIGTLITALGTGIGQDSFDINKLRYHKIIIMTDADVDGAHIRTLL 517 (803) Q Consensus 438 sak~gRdr~~qai~PLrGKilNv~~a~~~~i~~n~Ei~~l~~~lG~~~g~~~~~~~~lrY~kiiimtDaD~DG~HI~~Ll 517 (803) ||||||||+|||||||||||||||+|+.+||++|+||++|++|||||+|++ ||+++||||||||||||||||||||||| T Consensus 395 sAk~grdr~~qailPLRGKiLNv~~a~~~ki~~N~Ei~~li~alG~~~g~~-~d~~~LRY~kIiImTDADvDG~HI~~LL 473 (594) T smart00433 395 SAKSGRDRDFQAILPLRGKLLNVEKASLDKILKNEEIQALITALGLGIGKD-FDIEKLRYGKIIIMTDADVDGSHIKGLL 473 (594) T ss_pred CEEEEECCCCEEEECCCCCCCCHHHCCHHHHHHHHHHHHHHHHHCCCCCCC-CCCCCCCCCEEEEEECCCCCHHHHHHHH T ss_conf 337751255155531476015355468888751078999999977787876-6711067782899977898757899999 Q ss_pred HHHHHHHHHHHHHCCEEEEECCCEEEEEECCCEEEEE--CHHHHHHHHHHHCCCCHHCCCCCCCCCCCHHHHHHHHHHHH Q ss_conf 9999963077874895999438337887267179981--45789899985001100001457765435468999999999 Q gi|254780814|r 518 LTFFFRQMPSLIHNGFLYIIKPPLYGITRGKSLQYVK--DEESLEDYLINQSLSEEIELFCDSKGVISKNDLRNFIDDAL 595 (803) Q Consensus 518 ltff~~~~~~li~~g~~yia~ppl~~~~~~K~E~Yik--DD~EL~~~Ll~~gl~~~~l~~~~~~~~l~ge~L~~li~~~~ 595 (803) |||||||||+||++|||||||||||||++||+..|+. .++|.+.+.... T Consensus 474 ltff~r~~p~Li~~G~vy~a~pPL~kv~~~~~~~~~~~~~~~e~~~~~~~~----------------------------- 524 (594) T smart00433 474 LTFFYRYMPPLIEAGFVYIAIPPLYKVTKGKKKYVYSFYSLDEYEKWLETK----------------------------- 524 (594) T ss_pred HHHHHHHHHHHHHCCEEEEEECCEEEEEECCCCEEEEECCCHHHHHHHHHH----------------------------- T ss_conf 999998769988679699980988999948940799843747999999862----------------------------- Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEECCCCCCCEEEEEEECCCCEEE Q ss_conf 99999986434457999998874211351123577899999999887653100001100244567059999951853026 Q gi|254780814|r 596 KIDKLIKDFYPNHDKTFIEQAVISGVFNCESDQEINVSLASKLANRLNFIAEKSEKSWHGSINNSDILVKRIARGVQEQF 675 (803) Q Consensus 596 ~~~~~l~~l~~~~~~~lle~l~~~~~l~~~~d~~~~~~~~~~l~~~l~~~~~~~~~~~~~~~~~~~i~~~~~~~Gv~~~~ 675 (803) T Consensus 525 -------------------------------------------------------------------------------- 524 (594) T smart00433 525 -------------------------------------------------------------------------------- 524 (594) T ss_pred -------------------------------------------------------------------------------- T ss_conf -------------------------------------------------------------------------------- Q ss_pred EECHHHHCCCCHHHHHHHHHHHHHHCCCCEEEEECCCEEEECCHHHHHHHHHHHHHCCCEEEECCCCCCCCHHHHHHCCC Q ss_conf 64667741642789999888877511455027655825873998999999998630676286037777889557223038 Q gi|254780814|r 676 VLEGSSIFSFGSKYKRIFSNRIEENYLTILSLICNGDKIKITGPCSLLDTIFSIGRKGISMQRYKGLGEMNASQLWETTL 755 (803) Q Consensus 676 ~l~~~~~~s~~~~~~~~~~~~l~~~~~~~~~l~~~~~~~~i~s~~el~~~i~~~gkKG~sIQRYKGLGEMNPeQLWETTM 755 (803) ...+++++||||||||||||+||||||| T Consensus 525 ----------------------------------------------------~~~~~~~~i~ryKGLGem~~~ql~ettm 552 (594) T smart00433 525 ----------------------------------------------------GGNKSKYEIQRYKGLGEMNADQLWETTM 552 (594) T ss_pred ----------------------------------------------------HCCCCCCEEEEEECCCCCCHHHHHHHHH T ss_conf ----------------------------------------------------0678983069631267899899998720 Q ss_pred CHHHCEEEEEEHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHH Q ss_conf 961340212367988988878656418998762899885244 Q gi|254780814|r 756 NPDTRSLLRVKITDASQADDLFSRLMGDEVEPRREFIQENSL 797 (803) Q Consensus 756 DP~tRtLLqV~IeDa~~Ad~iF~~LMGD~VEPRReFIe~NAL 797 (803) ||++|+|+||+++|+.+||++|++||||+|+|||+||++||. T Consensus 553 ~p~~r~l~~v~~~d~~~~~~~~~~lmg~~~~~Rk~~i~~~a~ 594 (594) T smart00433 553 DPERRTLLFVTLDDADEADLIFSALMGDKVEPRKEWIEENAP 594 (594) T ss_pred CHHHEEEEEEECCCHHHHHHHHHHHCCCCCHHHHHHHHHCCC T ss_conf 963636999976866779999998769996899999997298 No 8 >TIGR01055 parE_Gneg DNA topoisomerase IV, B subunit; InterPro: IPR005737 DNA topoisomerases regulate the number of topological links between two DNA strands (i.e. change the number of superhelical turns) by catalysing transient single- or double-strand breaks, crossing the strands through one another, then resealing the breaks. These enzymes have several functions: to remove DNA supercoils during transcription and DNA replication; for strand breakage during recombination; for chromosome condensation; and to disentangle intertwined DNA during mitosis , . DNA topoisomerases are divided into two classes: type I enzymes (5.99.1.2 from EC; topoisomerases I, III and V) break single-strand DNA, and type II enzymes (5.99.1.3 from EC; topoisomerases II, IV and VI) break double-strand DNA . Type II topoisomerases are ATP-dependent enzymes, and can be subdivided according to their structure and reaction mechanisms: type IIA (topoisomerase II or gyrase, and topoisomerase IV) and type IIB (topoisomerase VI). These enzymes are responsible for relaxing supercoiled DNA as well as for introducing both negative and positive supercoils . This entry represents subunit B (parE) of topoisomerase IV from Gram negative bacteria. Topoisomerase IV primarily decatenates DNA and relaxes positive supercoils, which is important in bacteria, where the circular chromosome becomes catenated, or linked, during replication . Topoisomerase IV consists of two polypeptide subunits, parC (subunit A), which is homologous to gyrA of topoisomerase II, and parE (subunit B), which is homologous to gyrB of topoisomerase II. More information about this protein can be found at Protein of the Month: DNA Topoisomerase .; GO: 0003677 DNA binding, 0003918 DNA topoisomerase (ATP-hydrolyzing) activity, 0005524 ATP binding, 0006265 DNA topological change, 0005694 chromosome. Probab=100.00 E-value=0 Score=1487.68 Aligned_cols=627 Identities=42% Similarity=0.688 Sum_probs=581.2 Q ss_pred CCCCHHHCEECCCCCHHCCCCCCEECCCCCCCHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCEEEEEECCCCCCCCC Q ss_conf 36786551231354200045871543279963254456567555788686077975799983898399997684453640 Q gi|254780814|r 8 SSYDADSIQILKGLDAVKKRPSMYIGDTDGESGLHHMIYEVLDNAIDESLEGYADIITVTLNMDGSCTVSDNGRGIPTDL 87 (803) Q Consensus 8 ~~Y~~~~i~~L~gle~vrkRP~mYiG~t~~~~Gl~hlv~EiidNsvDe~~~g~~~~I~V~i~~dgsisV~dnGrGIPv~~ 87 (803) ++|.|++|+||+||||||+|||||||+|| .+-.|||+.||||||||||+||||+.|.|.+|.|.||+|.|||||||||+ T Consensus 2 ~~YsA~~I~VL~gLEPVr~RPGMY~GGtD-~~~~~HL~~EV~DN~~DEA~AGfA~~i~v~lH~D~Si~v~D~GRG~PvD~ 80 (647) T TIGR01055 2 TNYSAKDIEVLDGLEPVRKRPGMYTGGTD-TTRPNHLVQEVIDNSVDEALAGFASKIEVILHQDQSIEVIDNGRGMPVDI 80 (647) T ss_pred CCCCCCHHHHHCCCCCCCCCCCCEECCCC-CCCCHHHHHHHHHHHHHHHHCCCCEEEEEEECCCCEEEEEECCCCCCCCC T ss_conf 87461013220168773428872017865-65301345554433355554065307889982896179988883034427 Q ss_pred CCCC-CCCEEEEEEEEECCCCCCCCCCCEEECCCCCCEEEEEEECCCEEEEEEEECCEEEEEEEECCEECCCCEEECCCC Q ss_conf 7778-850012564250113232898604831167520377863264499999978959999988983646313873277 Q gi|254780814|r 88 HKEE-GISAAEVIMTRLHAGGKFNQKSYKVSGGLHGVGVSVVNALSSWLKLRIKREGNIYEMSFINGILDNPLVVTGSAG 166 (803) Q Consensus 88 h~~~-~~~~~E~i~t~L~ag~kfd~~~yk~sgGlhGvG~s~vNalS~~~~v~v~rdgk~~~q~f~~g~~~~~l~~~~~~~ 166 (803) ||++ ++|++|+|+|.|||||||+++.|.+||||||||+|||||||++|+|.|+|+|++|.|.|++|.....|..++.++ T Consensus 81 HP~~P~~sa~EvILt~LHaG~KFSN~~Y~~sGGLHGVGISVVNALS~~~~i~V~~~G~~Y~~af~~G~K~~dL~~~~~~G 160 (647) T TIGR01055 81 HPKEPGVSAVEVILTKLHAGGKFSNKNYEFSGGLHGVGISVVNALSKRVKIKVKRQGKLYSIAFENGAKVTDLEVVGTCG 160 (647) T ss_pred CCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCEECCHHHHEEECCCEEEEEEEECCEEEEHHHHCCCCHHHHHHHHCCC T ss_conf 88888886235564300578521144401368600214320220132367899746704320221575110233211267 Q ss_pred -CCCCEEEEEEECHHHHHH-HHCCHHHHHHHHHHHHCCCCEEEEEEEECCC---CCCCEEEECCCCCCHHHHHHHCCCCC Q ss_conf -888708999877888542-1000899999999731146416999996358---64210100126883577653003565 Q gi|254780814|r 167 -NDTGTEVTFLPSSDIFSV-QDFNYDTLQHRLRELSFLNSSVQISLIDKRS---PEPREITMFYKGGIEAFVSYLDRHKK 241 (803) Q Consensus 167 -~~~GT~I~F~PD~~iF~~-~~~~~d~l~~R~~~~A~Ln~gl~i~l~Der~---~~~~~~~f~~~~Gi~~fv~~l~~~~~ 241 (803) ..+||.|.|+||+++|.. ..|+...|.+-+|.+|+|++||.|.|.|+-. +.+++..++|++||+||+..--...+ T Consensus 161 kR~~GT~v~F~Pd~~~F~~k~~F~~~~l~~~~raKA~L~~Gv~I~f~~~v~~PsdiP~~~~~~y~~GL~DYL~~~v~~~~ 240 (647) T TIGR01055 161 KRLTGTSVHFTPDPEIFDSKLKFSVSRLYHILRAKAVLCRGVEIEFEDEVNNPSDIPTKALWNYEDGLKDYLSEAVNGLN 240 (647) T ss_pred CCCCCCEEEECCCCCHHCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHCCCCHHHHHHHHHCCCC T ss_conf 76556357731682010312255789999998632420267207897510686557545543110331368887631232 Q ss_pred CCCCCCEEE--EEEEEEEEEEEEEEECCCCCCEE-EEEECCCCCCCCCCEEHHHHHHHHHHHHHHHHHCCCCCCCCCCCC Q ss_conf 568852036--42003699973345317754121-222246216667721035688899999999986242000123576 Q gi|254780814|r 242 PLMQSPIRI--QGSRDNISIDLAMRWNDGYHENV-LCFTNNIPQKDGGTHLSGLRSALTRQITSYVEKSFGSKKDKRSVI 318 (803) Q Consensus 242 ~~~~~~i~~--~~~~~~~~veval~~~~~~~e~i-~SfvN~I~T~~GGtHv~gf~~al~~~i~~y~~~~~~~Kk~k~~~~ 318 (803) .+.++||.. ++.-++..|+||+.|-..-.+.+ .||||.|||++||||++|+|.||.++++.|++-.++.++. ++++ T Consensus 241 ~~~~~~F~G~~~~tvd~~~~~WAl~W~~~~~~~~~~SY~N~iPT~QGGTH~~GL~~~lL~a~~~f~E~~~~~P~~-~~Lt 319 (647) T TIGR01055 241 TLPKKPFVGNFEGTVDDEAVEWALLWLPEGGELLMESYVNLIPTIQGGTHVNGLRQGLLDALREFCEMRNLLPRG-VKLT 319 (647) T ss_pred CCCCCCEECCCCCCHHHHHCCEEEEECCCCCHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCE-EECC T ss_conf 234553002321101111002446632788400320411015789887104578999999999988875188640-2210 Q ss_pred HHHHHCCCEEEEEEECCCCCCCCCCHHHHCCCCCCHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 13430572156665213763357210020261111032111026899998740035799987789998877554553211 Q gi|254780814|r 319 GDDCREGFTSVLSIKMPDPRFSSQTKEKLVSSEVRSVVESFVNDGLSAWLEEHPLEAKIIVKKVLDASMVRDAARRARDL 398 (803) Q Consensus 319 ~~Diregl~~visvki~nP~F~gQTK~kL~s~~v~~~v~~~v~~~l~~~l~~n~~~ak~i~~k~~~a~~ar~~a~kar~~ 398 (803) .+||++-...|+|++|.+|+|.|||||||+|..+..+|+.+++|+|..||..|...|..+++.+|..|+.|...|.+|.. T Consensus 320 ~eDI~~~~s~vLS~~~~~p~F~GQTKE~L~S~~~a~~V~~v~KD~F~~~L~~~~~~A~~L~E~~I~~A~~Rrn~~~~~~v 399 (647) T TIGR01055 320 AEDILDRCSYVLSIKMQDPQFAGQTKERLSSRQVAKLVSGVIKDAFDLWLNQNVQLAEKLAEIAISSAQRRRNKRAAKKV 399 (647) T ss_pred HHHHHHHHHEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 88887311101455665566577531233131357887777877888874110788999999998888877654553142 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEECCCCCCCCCCCCCCCCEEEEECCCCCCCHHHCCHHHHHCCHHHHHHH Q ss_conf 00110012467667666655555552179998255754322334654430233005620334430587663236678888 Q gi|254780814|r 399 TRRKGVLDIASLPGKLADCSERDPKKSELFLVEGDSAGGSAKQGRSRENQAILPLRGKILNVERARFDKMLSSQEIGTLI 478 (803) Q Consensus 399 ~rrK~~~~~~~lpgKL~Dc~~k~~~~~eL~ivEGDSAggsak~gRdr~~qai~PLrGKilNv~~a~~~~i~~n~Ei~~l~ 478 (803) + ||+......||||||||+..+.+++||||||||||||||||||||++||||||||||||||.++.++|+.|+||+||- T Consensus 400 ~-RK~~~~~~~LPGKLADCt~~~~e~TELFlVEGDSAGGSAKQAR~RE~QAilPL~GKiLN~~~~~~~~vl~~~~i~D~~ 478 (647) T TIGR01055 400 V-RKKLTSGPALPGKLADCTRQDLEKTELFLVEGDSAGGSAKQARDREYQAILPLKGKILNTWEVSLDKVLNSQEIHDIE 478 (647) T ss_pred C-HHHHCCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCHHHHCCCCCEEECCCCCCEEEEECCCHHHHHCCCCCCEEE T ss_conf 0-012104678897211334311126427898325677521343053201214456647763214644553124200021 Q ss_pred HHHCCCCCCCCCCCCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHCCEEEEECCCEEEEEECCCEEEEECHHH Q ss_conf 76257877533212356737389985088885328999999999630778748959994383378872671799814578 Q gi|254780814|r 479 TALGTGIGQDSFDINKLRYHKIIIMTDADVDGAHIRTLLLTFFFRQMPSLIHNGFLYIIKPPLYGITRGKSLQYVKDEES 558 (803) Q Consensus 479 ~~lG~~~g~~~~~~~~lrY~kiiimtDaD~DG~HI~~Llltff~~~~~~li~~g~~yia~ppl~~~~~~K~E~YikDD~E 558 (803) -||||+..++ +..+.|||+|||||||||+||.||+|||+|||+++||.||+.||||+|.|||||++.+|+..|..||+| T Consensus 479 vAl~~~~~~~-~k~~~LRY~Ki~Il~DAD~DG~HIA~LL~~lF~~~~p~LV~~GH~Y~A~PPLYR~~~~~~vyYA~dEeE 557 (647) T TIGR01055 479 VALGIDPDSN-DKLSQLRYGKIIILADADSDGLHIATLLCALFLKHFPKLVEEGHVYVAKPPLYRIDLSKEVYYALDEEE 557 (647) T ss_pred EEEECCCCCH-HHHHHCCCCCEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCEEEECCCEEEEECCHHH T ss_conf 0222487200-036426722288984178880789999999999988888753907871787202106870688528466 Q ss_pred HHHHHHHHCCCCHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHH Q ss_conf 98999850011000014577654354689999999999999998643445799999887421135112357789999999 Q gi|254780814|r 559 LEDYLINQSLSEEIELFCDSKGVISKNDLRNFIDDALKIDKLIKDFYPNHDKTFIEQAVISGVFNCESDQEINVSLASKL 638 (803) Q Consensus 559 L~~~Ll~~gl~~~~l~~~~~~~~l~ge~L~~li~~~~~~~~~l~~l~~~~~~~lle~l~~~~~l~~~~d~~~~~~~~~~l 638 (803) -...+-.+ T Consensus 558 K~~~~~~L------------------------------------------------------------------------ 565 (647) T TIGR01055 558 KEKLLEKL------------------------------------------------------------------------ 565 (647) T ss_pred HHHHHHHH------------------------------------------------------------------------ T ss_conf 89999999------------------------------------------------------------------------ Q ss_pred HHHHHHHHHHCCCCEECCCCCCCEEEEEEECCCCEEEEECHHHHCCCCHHHHHHHHHHHHHHCCCCEEEEECCCEEEECC Q ss_conf 98876531000011002445670599999518530266466774164278999988887751145502765582587399 Q gi|254780814|r 639 ANRLNFIAEKSEKSWHGSINNSDILVKRIARGVQEQFVLEGSSIFSFGSKYKRIFSNRIEENYLTILSLICNGDKIKITG 718 (803) Q Consensus 639 ~~~l~~~~~~~~~~~~~~~~~~~i~~~~~~~Gv~~~~~l~~~~~~s~~~~~~~~~~~~l~~~~~~~~~l~~~~~~~~i~s 718 (803) T Consensus 566 -------------------------------------------------------------------------------- 565 (647) T TIGR01055 566 -------------------------------------------------------------------------------- 565 (647) T ss_pred -------------------------------------------------------------------------------- T ss_conf -------------------------------------------------------------------------------- Q ss_pred HHHHHHHHHHHHHCCCEEEECCCCCCCCHHHHHHCCCCHHHCEEEEEEHHHHHH--HH--HHHHHHCCCC-CCCHHHHHH Q ss_conf 899999999863067628603777788955722303896134021236798898--88--7865641899-876289988 Q gi|254780814|r 719 PCSLLDTIFSIGRKGISMQRYKGLGEMNASQLWETTLNPDTRSLLRVKITDASQ--AD--DLFSRLMGDE-VEPRREFIQ 793 (803) Q Consensus 719 ~~el~~~i~~~gkKG~sIQRYKGLGEMNPeQLWETTMDP~tRtLLqV~IeDa~~--Ad--~iF~~LMGD~-VEPRReFIe 793 (803) ...+|-.+.+|||||||||||.||.||||||++|+|+|||++|... .| +|..+|||.+ .|.|++||+ T Consensus 566 --------~~~sK~K~~V~RFKGLGEM~P~QL~ETT~~P~~R~L~~~T~~~~~~q~~D~l~~~D~LlaKKR~E~R~~~l~ 637 (647) T TIGR01055 566 --------KKKSKGKPNVQRFKGLGEMNPAQLRETTMDPNTRRLVQLTLDDVQDQRVDTLKLVDMLLAKKRSEDRKNWLQ 637 (647) T ss_pred --------HHHCCCCCCCCCCCCCCCCCCCHHHCCCCCCCCCEEEEEEEECCCCCCCCHHHHHHHHHCCCCCHHHHHHHH T ss_conf --------763068854411025456782012103588876137887631255655327999998750688446899999 Q ss_pred HHHHH Q ss_conf 52441 Q gi|254780814|r 794 ENSLP 798 (803) Q Consensus 794 ~NAL~ 798 (803) +.||. T Consensus 638 ~qalv 642 (647) T TIGR01055 638 EQALV 642 (647) T ss_pred HHHHH T ss_conf 87754 No 9 >PTZ00108 DNA topoisomerase II; Provisional Probab=100.00 E-value=0 Score=1175.46 Aligned_cols=577 Identities=23% Similarity=0.378 Sum_probs=473.4 Q ss_pred CCCCC-CCCCCCHHHCEECCCCCHHCCCCCCEECCCCC----------------------CCHHHHHHHHHHHHHHHHHH Q ss_conf 98764-22367865512313542000458715432799----------------------63254456567555788686 Q gi|254780814|r 1 MSDNI-QNSSYDADSIQILKGLDAVKKRPSMYIGDTDG----------------------ESGLHHMIYEVLDNAIDESL 57 (803) Q Consensus 1 m~~~~-~~~~Y~~~~i~~L~gle~vrkRP~mYiG~t~~----------------------~~Gl~hlv~EiidNsvDe~~ 57 (803) |+.+. .++.| |+++.||||.+||+|||||+.. ..||+++++|||.||+|... T Consensus 1 m~k~ktiee~Y-----qKkTqlEHILlRPDtYIGSvE~~~~~mWV~d~e~~~m~~k~v~yvPGLyKIFDEILVNAaDNk~ 75 (1506) T PTZ00108 1 MAKNKSIEERY-----QKKSQIEHILLRPDTYIGSVEMHTQLLWVWNKEKNRMVQKNITYVPGLYKIFDEIIVNAADVKA 75 (1506) T ss_pred CCCCCCHHHHH-----HCCCHHHHHHCCCCCCCCCCCCCCEEEEEEECCCCEEEEEEEEECCHHHHHHHHHHCCCHHHHH T ss_conf 98766689977-----5068888986289877567646510388997676838888866665347544456415144432 Q ss_pred H------CCCCEEEEEECCC-CEEEEEECCCCCCCCCCCCCCCCEEEEEEEEECCCCCCCCCCCEEECCCCCCEEEEEEE Q ss_conf 0------7797579998389-83999976844536407778850012564250113232898604831167520377863 Q gi|254780814|r 58 E------GYADIITVTLNMD-GSCTVSDNGRGIPTDLHKEEGISAAEVIMTRLHAGGKFNQKSYKVSGGLHGVGVSVVNA 130 (803) Q Consensus 58 ~------g~~~~I~V~i~~d-gsisV~dnGrGIPv~~h~~~~~~~~E~i~t~L~ag~kfd~~~yk~sgGlhGvG~s~vNa 130 (803) . ..|+.|+|+|++| |+|||+||||||||++||++++|+||||||+||||||||+++||+|||||||||||||| T Consensus 76 Rd~~~~~~pmt~IKVtID~eng~ISV~NNGRGIPVeiH~kekvyvPELIFGhLlTGsNFDDdeyKVSGGrNGvGAKLvNI 155 (1506) T PTZ00108 76 REKEKSENPMTCIKIEINKDNKKISVYNDGEGIPVDIHKEMNIYVPHMIFGELLTSDNYDDAEDRITGGRNGFGAKLTNI 155 (1506) T ss_pred CCCCCCCCCCCEEEEEEECCCCEEEEEECCCCCCCCCCCCCCCCHHHHHHCCCCCCCCCCCCCCEECCCCCCCCEEEEEE T ss_conf 07543568766599999678997999979995243687767974726865207676787899563448888724578786 Q ss_pred CCCEEEEEEEEC--CEEEEEEEECCEECCCCEEECCCCCCCCEEEEEEECHHHHHHHHCCHHH---HHHHHHHHHCCCCE Q ss_conf 264499999978--9599999889836463138732778887089998778885421000899---99999973114641 Q gi|254780814|r 131 LSSWLKLRIKRE--GNIYEMSFINGILDNPLVVTGSAGNDTGTEVTFLPSSDIFSVQDFNYDT---LQHRLRELSFLNSS 205 (803) Q Consensus 131 lS~~~~v~v~rd--gk~~~q~f~~g~~~~~l~~~~~~~~~~GT~I~F~PD~~iF~~~~~~~d~---l~~R~~~~A~Ln~g 205 (803) ||++|+|+++++ ++.|+|+|++||.....+.++....++||+|+|+||+++|++..|+.|+ |.+|++|+|.++ + T Consensus 156 FSteF~VEt~D~~~kk~ykQtw~nNM~k~~~PkIk~~~gk~yTkITFkPD~~iFgmt~ld~Di~sLL~KRvyDLAgt~-s 234 (1506) T PTZ00108 156 FSKEFIVQCGDSSRKKEFKMTWSDNMSKFSEPHIKNYNGKDYVKVTFKPDLAKFGMTEMDDDIESLLCKRVYDLAGTC-S 234 (1506) T ss_pred CCCEEEEEEEECCCEEEEEEEEECCCCCCCCCEEECCCCCCCEEEEEECCHHHCCCCCCCHHHHHHHHHHHHHHHCCC-C T ss_conf 268189999847953799999708971378874743689981699998488980983037789999999999973677-8 Q ss_pred EEEEEEECCCCCCC--------------------EEEECCCCCCHHHH----------HHHCCCC---------CCCC-- Q ss_conf 69999963586421--------------------01001268835776----------5300356---------5568-- Q gi|254780814|r 206 VQISLIDKRSPEPR--------------------EITMFYKGGIEAFV----------SYLDRHK---------KPLM-- 244 (803) Q Consensus 206 l~i~l~Der~~~~~--------------------~~~f~~~~Gi~~fv----------~~l~~~~---------~~~~-- 244 (803) ++|+|++++..-.. ...+++.+|...+. +.++... .|.. T Consensus 235 VkVyLNG~rI~Ik~Fk~Yv~Lylkd~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~de~~n~~~d~~dvs~~~eP~~~~ 314 (1506) T PTZ00108 235 VRVYLNGNRLAIKDFKSYVDLYLKDNAGDNKGNGGQSGSGSGGGSGNDNGNGGGNGGDEEANNAADNLDVSASNEPADAT 314 (1506) T ss_pred EEEEECCCEEECCCHHHHHEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHCCCCCCCCCCCCCCCCCCC T ss_conf 06997585100155344202675012233333333222223332111122344433201210111233210013544465 Q ss_pred ---------------CCCEEEEEEEEEEEEEEEEEECCCCCCEEEEEECCCCCCCCCCEEHHHHHHHHHHHHHHHHHCCC Q ss_conf ---------------85203642003699973345317754121222246216667721035688899999999986242 Q gi|254780814|r 245 ---------------QSPIRIQGSRDNISIDLAMRWNDGYHENVLCFTNNIPQKDGGTHLSGLRSALTRQITSYVEKSFG 309 (803) Q Consensus 245 ---------------~~~i~~~~~~~~~~veval~~~~~~~e~i~SfvN~I~T~~GGtHv~gf~~al~~~i~~y~~~~~~ 309 (803) .+++.+... ..+||||++++++++++++||||||+|++|||||++|..+|++.+.+++.+ T Consensus 315 p~~~~~~~~~~~~d~~eii~I~e~--~~RWEVav~~Sd~~~FqqVSFVNnI~T~kGGTHVdyi~dqIvn~L~e~~~k--- 389 (1506) T PTZ00108 315 PSKNNAANGANNNDEDEIVKIHEK--QHRWEIVVSKSDGSQFQQVSFVNSICTTKGGSHVNYIVEQLLNSLSKKANA--- 389 (1506) T ss_pred CCCCCCCCCCCCCCCCCEEEEECC--CCCEEEEEEECCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHHH--- T ss_conf 432222223345665531565136--774799999778996578877676158999808999999999999999865--- Q ss_pred CCCCCCCCCHHHHHCCCEEEEEEECCCCCCCCCCHHHHCCCCCCHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 00012357613430572156665213763357210020261111032111026899998740035799987789998877 Q gi|254780814|r 310 SKKDKRSVIGDDCREGFTSVLSIKMPDPRFSSQTKEKLVSSEVRSVVESFVNDGLSAWLEEHPLEAKIIVKKVLDASMVR 389 (803) Q Consensus 310 ~Kk~k~~~~~~Diregl~~visvki~nP~F~gQTK~kL~s~~v~~~v~~~v~~~l~~~l~~n~~~ak~i~~k~~~a~~ar 389 (803) .+|....++++|||+||++||||+|+||+|+||||++|++++++..+...+.++|..++.++| |+++++..+++| T Consensus 390 KkK~~~~Ik~~dIKegL~aFVn~kI~NPqFdSQTKEkLtT~~~kFGSk~~LsekfikkllKs~-----IIekIl~~a~aK 464 (1506) T PTZ00108 390 KNKGGMEIKSGHIKNHLWVFVNCLIVNPTFDSQTKETLTTKPVKFGSKCILSDKTINNVLKSP-----ILSNILLWAQAK 464 (1506) T ss_pred HCCCCCCCCHHHHHHCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHCH-----HHHHHHHHHHHH T ss_conf 255777679899860579999997779976761015403886456355347899999998695-----999999999999 Q ss_pred HHHH--HHHHHCCCCCCCCCCCCCCCCCCCC---CCCCCCEEEEEEECCCC-----CCCCCCCCCCCCEEEEECCCCCCC Q ss_conf 5545--5321100110012467667666655---55555217999825575-----432233465443023300562033 Q gi|254780814|r 390 DAAR--RARDLTRRKGVLDIASLPGKLADCS---ERDPKKSELFLVEGDSA-----GGSAKQGRSRENQAILPLRGKILN 459 (803) Q Consensus 390 ~~a~--kar~~~rrK~~~~~~~lpgKL~Dc~---~k~~~~~eL~ivEGDSA-----ggsak~gRdr~~qai~PLrGKilN 459 (803) ++++ |++....+|......++| ||.||+ ++++++||||||||||| +|++++|||+ |||||||||||| T Consensus 465 a~~~LkKk~Ka~~~KkR~ri~gIp-KL~DAN~AGtk~s~~CTLiLtEGDSAK~lAvaGlsvvGRD~--~GVfPLRGKiLN 541 (1506) T PTZ00108 465 AQVELKKKMKAGSSKARERIIGIP-KLEDANDAGSKYSQECTLILTEGDSAKTSCLAGLSIVGRDK--YGVFPLKGKLLN 541 (1506) T ss_pred HHHHHHHHHHHHCCCCHHHCCCCC-CCCCCCCCCCCCCCCEEEEEECCCCHHHHHHCCCCCCCCCC--EEEEECCCEEEE T ss_conf 999999986541144101114898-77463002799863328999637516877750402136542--234420561310 Q ss_pred HHHCCHHHHHCCHHHHHHHHHHCCCCC-CCCCCCCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHHHHHH-CCEEEEE Q ss_conf 443058766323667888876257877-53321235673738998508888532899999999963077874-8959994 Q gi|254780814|r 460 VERARFDKMLSSQEIGTLITALGTGIG-QDSFDINKLRYHKIIIMTDADVDGAHIRTLLLTFFFRQMPSLIH-NGFLYII 537 (803) Q Consensus 460 v~~a~~~~i~~n~Ei~~l~~~lG~~~g-~~~~~~~~lrY~kiiimtDaD~DG~HI~~Llltff~~~~~~li~-~g~~yia 537 (803) ||+|+.+||++|+||++|++||||+++ +.++|+++|||||||||||||||||||+||||+|||.+||+|++ .|+|--- T Consensus 542 Vr~As~~kI~~N~EI~nIikiLGL~i~~k~~~dik~LRYGkImIMTDqDvDGSHIKGLLiNf~h~~wPsLLk~~gFl~~f 621 (1506) T PTZ00108 542 VRDASFKQLMDNKEIQNIFKIMGLDITDKNKDDIKGLRYGSLMIMTDQDYDGSHIKGLLINMIHKFWPSLLKHKGFLSEF 621 (1506) T ss_pred CCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHCCCCCHHHHH T ss_conf 13465988861388999999977786776556553445561699834889860138899999998628761876788873 Q ss_pred CCCEEEEEECCCEEEEECHHHHHHHHHHHCCCCHHCCCCCCCCCCCHHHHHHHHHHHHH Q ss_conf 38337887267179981457898999850011000014577654354689999999999 Q gi|254780814|r 538 KPPLYGITRGKSLQYVKDEESLEDYLINQSLSEEIELFCDSKGVISKNDLRNFIDDALK 596 (803) Q Consensus 538 ~ppl~~~~~~K~E~YikDD~EL~~~Ll~~gl~~~~l~~~~~~~~l~ge~L~~li~~~~~ 596 (803) .+|+.|+++|+++..+.+.+|.++|.-+....+..+.+..+-...+..+.+++.....+ T Consensus 622 iTPIvK~~kg~~~~~Fyt~~eye~Wk~~~~~~~w~iKYYKGLGTSt~~EakeYF~~l~~ 680 (1506) T PTZ00108 622 VTPIVKVQKGNQEHSFFTIAEYEQWKENTNLLGWKIKYYKGLGTSTDKEFKQYFSDIKN 680 (1506) T ss_pred CEEEEEECCCCCCCCCCCHHHHHHHHHCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHH T ss_conf 01035532697401127889999998638988740577643688880899999987875 No 10 >KOG0355 consensus Probab=100.00 E-value=0 Score=867.70 Aligned_cols=597 Identities=26% Similarity=0.380 Sum_probs=493.1 Q ss_pred HHHCEECCCCCHHCCCCCCEECCCC--------------------CCCHHHHHHHHHHHHHHHHHHHC-CCCEEEEEECC Q ss_conf 6551231354200045871543279--------------------96325445656755578868607-79757999838 Q gi|254780814|r 12 ADSIQILKGLDAVKKRPSMYIGDTD--------------------GESGLHHMIYEVLDNAIDESLEG-YADIITVTLNM 70 (803) Q Consensus 12 ~~~i~~L~gle~vrkRP~mYiG~t~--------------------~~~Gl~hlv~EiidNsvDe~~~g-~~~~I~V~i~~ 70 (803) ++.++.-+.+||+..||..|||++. +.+||||+++||++||+| -.++ -.+.|.|+||+ T Consensus 5 ~~~yqkks~~ehiL~rPdtyig~ve~~~~~~wv~~~e~~k~~~~t~~pGl~ki~dEilvNaad-k~rd~~m~~i~v~i~~ 83 (842) T KOG0355 5 EDIYQKKSQLEHILLRPDTYIGSVEPTEQLMWVYDMEKRKMVQRTYVPGLYKIFDEILVNAAD-KQRDPKMNTIKVTIDK 83 (842) T ss_pred HHHHHHHHHHHHHHCCCCCCCCCCCCCEEEEEEECCCCCCEEEEECCCCHHHHHHHHHHCCCC-CCCCCCCCEEEEEECC T ss_conf 777765211133314876423554552247754033448546763487178888998603323-2248776526888716 Q ss_pred -CCEEEEEECCCCCCCCCCCCCCCCEEEEEEEEECCCCCCCCCCCEEECCCCCCEEEEEEECCCEEEEEE--EECCEEEE Q ss_conf -983999976844536407778850012564250113232898604831167520377863264499999--97895999 Q gi|254780814|r 71 -DGSCTVSDNGRGIPTDLHKEEGISAAEVIMTRLHAGGKFNQKSYKVSGGLHGVGVSVVNALSSWLKLRI--KREGNIYE 147 (803) Q Consensus 71 -dgsisV~dnGrGIPv~~h~~~~~~~~E~i~t~L~ag~kfd~~~yk~sgGlhGvG~s~vNalS~~~~v~v--~rdgk~~~ 147 (803) +++|||+|||+||||++||++++|.||||||+||||||||+++.|++||+|||||+|+|+||..|.+|+ ++.++.|+ T Consensus 84 e~~~isv~nnGkGIPv~~H~~ek~yvpelifg~Lltssny~d~ekK~tggrngygakLcniFs~~f~~Et~d~~~~~~~k 163 (842) T KOG0355 84 EKNEISVYNNGKGIPVTIHKVEKVYVPELIFGNLLTSSNYDDDEKKVTGGRNGYGAKLCNIFSTEFTVETADREYKMAFK 163 (842) T ss_pred CCCEEEEEECCCCCEEEECCCCCCCCHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCEEEEECCCCCEEEEEEHHHHHHHH T ss_conf 77679999579721312234222453478876644115678875432367775020343211454036530067777877 Q ss_pred EEEECCEECCCCEEECCCCCCCCEEEEEEECHHHHHHHHCCHHHH---HHHHHHHHCCCCEEEEEEEECCCCCCCEEEEC Q ss_conf 998898364631387327788870899987788854210008999---99999731146416999996358642101001 Q gi|254780814|r 148 MSFINGILDNPLVVTGSAGNDTGTEVTFLPSSDIFSVQDFNYDTL---QHRLRELSFLNSSVQISLIDKRSPEPREITMF 224 (803) Q Consensus 148 q~f~~g~~~~~l~~~~~~~~~~GT~I~F~PD~~iF~~~~~~~d~l---~~R~~~~A~Ln~gl~i~l~Der~~~~~~~~f~ 224 (803) |+|.+||+.+....+....+..+|+|||+||.+.|.+.++|.|++ .+|++++|-...++++.+++.+... T Consensus 164 Q~w~~nm~~~~~~~i~~~~~~~yTkitF~PDl~~F~m~eLD~Div~l~~rr~~d~a~~~~~vkv~ln~~~~~~------- 236 (842) T KOG0355 164 QTWINNMTRDEEPKIVPSTDEDYTKITFSPDLEKFKMKELDDDIVALMARRAYDLAGSVKSVKVELNGKNIPV------- 236 (842) T ss_pred HHHHCCCCCCCCCEEECCCCCCCCEEEECCCHHHCCCHHHCCHHHHHHHHHHHHHCCCCCCEEEEECCCCCCC------- T ss_conf 7652687656786552258999536996847675272232231887677666652344212156516887752------- Q ss_pred CCCCCHHHHHHHCCCCCCC--CCCCEEEEEEEEEEEEEEEEEECCCCCCEEEEEECCCCCCCCCCEEHHHHHHHHHHHHH Q ss_conf 2688357765300356556--88520364200369997334531775412122224621666772103568889999999 Q gi|254780814|r 225 YKGGIEAFVSYLDRHKKPL--MQSPIRIQGSRDNISIDLAMRWNDGYHENVLCFTNNIPQKDGGTHLSGLRSALTRQITS 302 (803) Q Consensus 225 ~~~Gi~~fv~~l~~~~~~~--~~~~i~~~~~~~~~~veval~~~~~~~e~i~SfvN~I~T~~GGtHv~gf~~al~~~i~~ 302 (803) +|-.+|++.+.. ...+ ...+++ +.-+-.||++++-.+...++ +||||+|+|.+|||||++++..+++.++. T Consensus 237 --~~f~~yv~~~~~-~~~~~~~~~~ih---e~~~~RwEv~~a~s~~~fqq-isfvNsI~T~kGGThVd~V~D~iv~~l~~ 309 (842) T KOG0355 237 --KGFYDYVKMYLE-VLWLNDDLKPLH---EVLNFRWEVALALSDVGFQQ-VSFVNSIATTKGGTHVDYVVDQIVAKLID 309 (842) T ss_pred --HHHHHHHHHCCC-CCCCCCCCHHHH---CCCCHHHHHHHHHCCCCCCE-EEEECCCEECCCCCCCCHHHHHHHHHHHH T ss_conf --017788762353-222467522220---04324888865504466533-54200101047986544166557778888 Q ss_pred HHHHCCCCCCCCCCCCHHHHHCCCEEEEEEECCCCCCCCCCHHHHCCCCCCHHHHCCCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99862420001235761343057215666521376335721002026111103211102689999874003579998778 Q gi|254780814|r 303 YVEKSFGSKKDKRSVIGDDCREGFTSVLSIKMPDPRFSSQTKEKLVSSEVRSVVESFVNDGLSAWLEEHPLEAKIIVKKV 382 (803) Q Consensus 303 y~~~~~~~Kk~k~~~~~~Diregl~~visvki~nP~F~gQTK~kL~s~~v~~~v~~~v~~~l~~~l~~n~~~ak~i~~k~ 382 (803) .+.+.... .++++++..|+++|+++|+|.|+||.|+||||++|+++...-.+...+++++...+.+++ ....|++++ T Consensus 310 vvkKk~~~--~~~svk~f~vkn~L~vfVN~lIENPtFesQTke~lt~~~~~fgs~~elsek~~k~~~~~~-i~esils~s 386 (842) T KOG0355 310 VVKKKKVK--KDISVKPFQVKNHLWVFVNCLIENPTFESQTKERLTLRPKRFGSDCELSEKFTKAIELKG-VVESILSKS 386 (842) T ss_pred HHCCCCCC--CCCCCCHHHHHCCEEEEEEEEECCCCCCCCCHHHCCCCCCCCCCEEEEHHHHHHHHHHCC-HHHHHHHHH T ss_conf 74211346--654446453303569999887427663443310126663454326763167799998676-499999988 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCC---CCCCCCEEEEEEECCCC---CCCCCCCCCCCCEEEEECCCC Q ss_conf 999887755455321100110012467667666655---55555217999825575---432233465443023300562 Q gi|254780814|r 383 LDASMVRDAARRARDLTRRKGVLDIASLPGKLADCS---ERDPKKSELFLVEGDSA---GGSAKQGRSRENQAILPLRGK 456 (803) Q Consensus 383 ~~a~~ar~~a~kar~~~rrK~~~~~~~lpgKL~Dc~---~k~~~~~eL~ivEGDSA---ggsak~gRdr~~qai~PLrGK 456 (803) ..++..-.+.+.++...+.+ ++| ||.|+. .+...+|+|+++||||| +.|.++..+|.+++||||||| T Consensus 387 ~~~~~~dL~kk~~~~~s~~e------~I~-KleDa~~ag~k~s~kctlIltEgdsaksLavsGl~vvgRd~~gVfplrgk 459 (842) T KOG0355 387 LNKYKDDLAKKRGKLTSRSK------GIP-KLEDANDAGTKTSEKCTLILTEGDSAKSLAVSGLSVVGRDYYGVFPLRGK 459 (842) T ss_pred HHHHHHHHHHHHHHHHCCCC------CHH-HHHHCCCCCCCCCCCEEEEEECCCCCHHHEECCCCCCCEEEEEECCCCCC T ss_conf 88898765444344433201------306-76523324434554316888615520332013555466046755011231 Q ss_pred CCCHHHCCHHHHHCCHHHHHHHHHHCCCCCCCCCCCCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHCCEEEE Q ss_conf 03344305876632366788887625787753321235673738998508888532899999999963077874895999 Q gi|254780814|r 457 ILNVERARFDKMLSSQEIGTLITALGTGIGQDSFDINKLRYHKIIIMTDADVDGAHIRTLLLTFFFRQMPSLIHNGFLYI 536 (803) Q Consensus 457 ilNv~~a~~~~i~~n~Ei~~l~~~lG~~~g~~~~~~~~lrY~kiiimtDaD~DG~HI~~Llltff~~~~~~li~~g~~yi 536 (803) +|||++++.++|++|+||++|+.+||++..+.||+.+.||||++|||||||+|||||++||++|||++||.|+..++|+- T Consensus 460 lLNv~e~s~kqilkn~EI~aIikIlgl~~~~~y~~~eslryg~~mimtDQd~dgshikgllInf~~~~~psl~~~~flee 539 (842) T KOG0355 460 LLNVREASHKQILKNAEINAIIKILGLQYKKKYFDVESLRYGKLMIMTDQDHDGSHIKGLLINFIHHFWPSLLQIPFLEE 539 (842) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCCHHH T ss_conf 34457777988877999999999998877533133122242224665123456434777889889876588635874210 Q ss_pred ECCCEEEEEECCCEEEEECHHHHHHHHHHHCCCCHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 43833788726717998145789899985001100001457765435468999999999999999864344579999988 Q gi|254780814|r 537 IKPPLYGITRGKSLQYVKDEESLEDYLINQSLSEEIELFCDSKGVISKNDLRNFIDDALKIDKLIKDFYPNHDKTFIEQA 616 (803) Q Consensus 537 a~ppl~~~~~~K~E~YikDD~EL~~~Ll~~gl~~~~l~~~~~~~~l~ge~L~~li~~~~~~~~~l~~l~~~~~~~lle~l 616 (803) --||++|+++||+++|+.+++|++++--+.. T Consensus 540 fitpi~k~~~~~~~~~fys~pey~kw~~~~~------------------------------------------------- 570 (842) T KOG0355 540 FITPIVKATKGKQEVSFYSLPEYEKWKKNTD------------------------------------------------- 570 (842) T ss_pred CCCCCEEECCCCCCEEEECCHHHHHHHHCCC------------------------------------------------- T ss_conf 4776457404774314544565788875484------------------------------------------------- Q ss_pred HHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEECCCCCCCEEEEEEECCCCEEEEECHHHHCCCCHHHHHHHHHH Q ss_conf 74211351123577899999999887653100001100244567059999951853026646677416427899998888 Q gi|254780814|r 617 VISGVFNCESDQEINVSLASKLANRLNFIAEKSEKSWHGSINNSDILVKRIARGVQEQFVLEGSSIFSFGSKYKRIFSNR 696 (803) Q Consensus 617 ~~~~~l~~~~d~~~~~~~~~~l~~~l~~~~~~~~~~~~~~~~~~~i~~~~~~~Gv~~~~~l~~~~~~s~~~~~~~~~~~~ 696 (803) T Consensus 571 -------------------------------------------------------------------------------- 570 (842) T KOG0355 571 -------------------------------------------------------------------------------- 570 (842) T ss_pred -------------------------------------------------------------------------------- T ss_conf -------------------------------------------------------------------------------- Q ss_pred HHHHCCCCEEEEECCCEEEECCHHHHHHHHHHHHHCCCEEEECCCCCCCCHHHHHHCCCCHHHCEEEEEEHHHHHHHHHH Q ss_conf 77511455027655825873998999999998630676286037777889557223038961340212367988988878 Q gi|254780814|r 697 IEENYLTILSLICNGDKIKITGPCSLLDTIFSIGRKGISMQRYKGLGEMNASQLWETTLNPDTRSLLRVKITDASQADDL 776 (803) Q Consensus 697 l~~~~~~~~~l~~~~~~~~i~s~~el~~~i~~~gkKG~sIQRYKGLGEMNPeQLWETTMDP~tRtLLqV~IeDa~~Ad~i 776 (803) +.++..|++|||||+|.|+|+||++||++.++.+.....+...+..- T Consensus 571 ---------------------------------~~~~~~ikyykglgts~~~e~keyf~~~~~h~~~F~~~~~~d~~~i~ 617 (842) T KOG0355 571 ---------------------------------NWKSWKIKYYKGLGTSTSKEAKEYFSDLDRHRILFKYDGGIDDEAID 617 (842) T ss_pred ---------------------------------CCCCCCEEEEECCCCCCHHHHHHHHHCHHHHEEEEECCCCCHHHHHH T ss_conf ---------------------------------42442237751477454789999861600123787525750477886 Q ss_pred HHHHCCCCCCCHHHHHHHHH Q ss_conf 65641899876289988524 Q gi|254780814|r 777 FSRLMGDEVEPRREFIQENS 796 (803) Q Consensus 777 F~~LMGD~VEPRReFIe~NA 796 (803) +...|+ ..+.||+++-+++ T Consensus 618 ~af~k~-k~~~rk~wL~~~~ 636 (842) T KOG0355 618 MAFSKK-KIDARKEWLRKYM 636 (842) T ss_pred HHCCCC-CHHHHHHHHHHHH T ss_conf 422310-0666776899865 No 11 >cd00822 TopoII_Trans_DNA_gyrase TopoIIA_Trans_DNA_gyrase: Transducer domain, having a ribosomal S5 domain 2-like fold, of the type found in proteins of the type IIA family of DNA topoisomerases similar to the B subunits of E. coli DNA gyrase and E. coli Topoisomerase IV which are heterodimers composed of two subunits. The type IIA enzymes are the predominant form of topoisomerase and are found in some bacteriophages, viruses and archaea, and in all bacteria and eukaryotes. All type IIA topoisomerases are related to each other at amino acid sequence level, though their oligomeric organization sometimes differs. TopoIIA enzymes cut both strands of the duplex DNA to remove (relax) both positive and negative supercoils in DNA. These enzymes covalently attach to the 5' ends of the cut DNA, separate the free ends of the cleaved strands, pass another region of the duplex through this gap, then rejoin the ends. TopoIIA enzymes also catenate/ decatenate duplex rings. E.coli DNA gyrase is a Probab=100.00 E-value=0 Score=391.64 Aligned_cols=172 Identities=50% Similarity=0.866 Sum_probs=168.9 Q ss_pred CCCHHHHHHHCCCCCCCCCCCEEEEEEEEEEEEEEEEEECCCCCCEEEEEECCCCCCCCCCEEHHHHHHHHHHHHHHHHH Q ss_conf 88357765300356556885203642003699973345317754121222246216667721035688899999999986 Q gi|254780814|r 227 GGIEAFVSYLDRHKKPLMQSPIRIQGSRDNISIDLAMRWNDGYHENVLCFTNNIPQKDGGTHLSGLRSALTRQITSYVEK 306 (803) Q Consensus 227 ~Gi~~fv~~l~~~~~~~~~~~i~~~~~~~~~~veval~~~~~~~e~i~SfvN~I~T~~GGtHv~gf~~al~~~i~~y~~~ 306 (803) |||++|+++++.++++++++||+++++.+++.|||||||+++|+|.++||||+|+|++||||++|||+||||+||+|++. T Consensus 1 gGi~dfv~~l~~~k~~l~~~~i~~~~~~~~~~vEvA~qw~~~~~e~i~sf~N~I~T~~GGTH~~Gfk~altr~i~~y~~~ 80 (172) T cd00822 1 GGLKDFVEELNKDKEPLHEEPIYIEGEKDGVEVEVALQWTDSYSENILSFVNNIPTPEGGTHETGFRAALTRAINDYAKK 80 (172) T ss_pred CCHHHHHHHHHCCCCCCCCCCEEEEEEECCEEEEEEEEECCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHHH T ss_conf 98899999984699858899979998748989999998358888607860668467999852789999999999999998 Q ss_pred CCCCCCCCCCCCHHHHHCCCEEEEEEECCCCCCCCCCHHHHCCCCCCHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 24200012357613430572156665213763357210020261111032111026899998740035799987789998 Q gi|254780814|r 307 SFGSKKDKRSVIGDDCREGFTSVLSIKMPDPRFSSQTKEKLVSSEVRSVVESFVNDGLSAWLEEHPLEAKIIVKKVLDAS 386 (803) Q Consensus 307 ~~~~Kk~k~~~~~~Diregl~~visvki~nP~F~gQTK~kL~s~~v~~~v~~~v~~~l~~~l~~n~~~ak~i~~k~~~a~ 386 (803) ++++|+.+.+++++||||||+|||||+|+||||+||||+||+|++|+++|+++|.++|..||++||..|+.|+++++.++ T Consensus 81 ~~l~k~~~~~~~~dDi~egl~~iiSvki~~PqFegQTK~KL~s~ev~~~v~~~v~~~l~~~l~~np~~a~~Ii~k~i~~a 160 (172) T cd00822 81 NNLLKKKDVKLTGDDIREGLTAVISVKVPEPQFEGQTKDKLGNSEVRSIVESAVREALEEWLEENPEEAKKILEKAILAA 160 (172) T ss_pred HCCCCCCCCCCCHHHHHCCCEEEEEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHH T ss_conf 25023354789999996436488552536882266220521788899999999999999999989999999999999999 Q ss_pred HHHHHHHHHHHH Q ss_conf 877554553211 Q gi|254780814|r 387 MVRDAARRARDL 398 (803) Q Consensus 387 ~ar~~a~kar~~ 398 (803) ++|++|+||||+ T Consensus 161 ~aR~~akkare~ 172 (172) T cd00822 161 KAREAARKAREL 172 (172) T ss_pred HHHHHHHHHCCC T ss_conf 999999973479 No 12 >cd03366 TOPRIM_TopoIIA_GyrB TOPRIM_TopoIIA_GyrB: topoisomerase-primase (TOPRIM) nucleotidyl transferase/hydrolase domain of the type found in proteins of the type IIA family of DNA topoisomerases similar to the Escherichia coli GyrB subunit. TopoIIA enzymes cut both strands of the duplex DNA to remove (relax) both positive and negative supercoils in DNA. These enzymes covalently attach to the 5' ends of the cut DNA, separate the free ends of the cleaved strands, pass another region of the duplex through this gap, then rejoin the ends. These proteins also catenate/ decatenate duplex rings. DNA gyrase is more effective at relaxing supercoils than decatentating DNA. DNA gyrase in addition inserts negative supercoils in the presence of ATP. The TOPRIM domain has two conserved motifs, one of which centers at a conserved glutamate and the other one at two conserved aspartates (DxD). The conserved glutamate may act as a general base in strand joining and as a general acid in strand cleava Probab=100.00 E-value=0 Score=391.90 Aligned_cols=114 Identities=79% Similarity=1.209 Sum_probs=112.0 Q ss_pred EEEEEEECCCCCCCCCCCCCCCCEEEEECCCCCCCHHHCCHHHHHCCHHHHHHHHHHCCCCCCCCCCCCCCCCCCEEEEE Q ss_conf 17999825575432233465443023300562033443058766323667888876257877533212356737389985 Q gi|254780814|r 425 SELFLVEGDSAGGSAKQGRSRENQAILPLRGKILNVERARFDKMLSSQEIGTLITALGTGIGQDSFDINKLRYHKIIIMT 504 (803) Q Consensus 425 ~eL~ivEGDSAggsak~gRdr~~qai~PLrGKilNv~~a~~~~i~~n~Ei~~l~~~lG~~~g~~~~~~~~lrY~kiiimt 504 (803) +||||||||||+|||||||||+||||||||||+|||++++.+|+++|+||++|++||||++|++ ||+++|||||||||| T Consensus 1 seL~l~EGDSA~g~a~~~Rd~~~qgv~PLrGK~lNv~~~~~~ki~~N~Ei~~l~~~lGl~~~~~-~d~~~LRYgkIiImt 79 (114) T cd03366 1 SELYIVEGDSAGGSAKQGRDRRFQAILPLRGKILNVEKARLDKILKNEEIRALITALGTGIGED-FDLEKLRYHKIIIMT 79 (114) T ss_pred CEEEEECCCCCHHHHHCCCCCCCEEEEECCCCCHHHHHCCHHHHHCCHHHHHHHHHHCCCCCCC-CCHHHCCCCEEEEEE T ss_conf 9289972767405565217878689987078500064287878862967999999968787864-540007878699996 Q ss_pred CCCCCCCCHHHHHHHHHHHHHHHHHHCCEEEEECC Q ss_conf 08888532899999999963077874895999438 Q gi|254780814|r 505 DADVDGAHIRTLLLTFFFRQMPSLIHNGFLYIIKP 539 (803) Q Consensus 505 DaD~DG~HI~~Llltff~~~~~~li~~g~~yia~p 539 (803) |||+||+||++|||||||||||+|+++||||||+| T Consensus 80 DaD~DG~HI~~Lli~ff~~~~p~Li~~G~vyia~P 114 (114) T cd03366 80 DADVDGAHIRTLLLTFFFRYMRPLIENGHVYIAQP 114 (114) T ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHHCCCEEEECC T ss_conf 89988068999999999998499998694788369 No 13 >cd01030 TOPRIM_TopoIIA_like TOPRIM_TopoIIA_like: topoisomerase-primase (TOPRIM) nucleotidyl transferase/hydrolase domain of the type found in proteins of the type IIA family of DNA topoisomerases similar to Saccharomyces cerevisiae Topoisomerase II. TopoIIA enzymes cut both strands of the duplex DNA to remove (relax) both positive and negative supercoils in DNA. These enzymes covalently attach to the 5' ends of the cut DNA, separate the free ends of the cleaved strands, pass another region of the duplex through this gap, then rejoin the ends. These proteins also catenate/ decatenate duplex rings. The TOPRIM domain has two conserved motifs, one of which centers at a conserved glutamate and the other one at two conserved aspartates (DxD). The conserved glutamate may act as a general base in strand joining and as a general acid in strand cleavage by topisomerases. The DXD motif may co-ordinate Mg2+, a cofactor required for full catalytic function. Probab=100.00 E-value=0 Score=384.82 Aligned_cols=115 Identities=70% Similarity=1.108 Sum_probs=113.9 Q ss_pred EEEEEEECCCCCCCCCCCCCCCCEEEEECCCCCCCHHHCCHHHHHCCHHHHHHHHHHCCCCCCCCCCCCCCCCCCEEEEE Q ss_conf 17999825575432233465443023300562033443058766323667888876257877533212356737389985 Q gi|254780814|r 425 SELFLVEGDSAGGSAKQGRSRENQAILPLRGKILNVERARFDKMLSSQEIGTLITALGTGIGQDSFDINKLRYHKIIIMT 504 (803) Q Consensus 425 ~eL~ivEGDSAggsak~gRdr~~qai~PLrGKilNv~~a~~~~i~~n~Ei~~l~~~lG~~~g~~~~~~~~lrY~kiiimt 504 (803) |||||||||||+|||++||||.+|||||||||+|||++++.+++++|+||.+|++||||++|++++|+++|||||||||| T Consensus 1 c~L~l~EGdSA~g~~~~~rd~~~~gv~pLrGK~lNv~~~~~~ki~~N~Ei~~l~~~LGl~~~~~~~~~~~LRYgkIii~t 80 (115) T cd01030 1 CELILVEGDSAGGSAKQGRDRVFQAVFPLRGKILNVEKASLKKILKNEEIQNIIKALGLGIGKDDFDLDKLRYGKIIIMT 80 (115) T ss_pred CEEEEECCCCCHHHHHHHCCCCCEEEEECCCCCCCHHHCCHHHHHCCHHHHHHHHHHCCCCCCCCCCHHHCCCCCEEEEE T ss_conf 97999647453455542048675899852781466666889898709789999999677989653536348627448987 Q ss_pred CCCCCCCCHHHHHHHHHHHHHHHHHHCCEEEEECC Q ss_conf 08888532899999999963077874895999438 Q gi|254780814|r 505 DADVDGAHIRTLLLTFFFRQMPSLIHNGFLYIIKP 539 (803) Q Consensus 505 DaD~DG~HI~~Llltff~~~~~~li~~g~~yia~p 539 (803) |||+||+||++|||||||||||+|+++|||||||| T Consensus 81 DaD~DG~HI~~Lll~ff~~~~P~L~~~G~vyi~~T 115 (115) T cd01030 81 DADVDGSHIRTLLLTFFYRFWPSLLENGFLYIAQT 115 (115) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHCCEEEEEEC T ss_conf 89997579999999999998199998791799859 No 14 >pfam00204 DNA_gyraseB DNA gyrase B. This family represents the second domain of DNA gyrase B which has a ribosomal S5 domain 2-like fold. This family is structurally related to PF01119. Probab=100.00 E-value=0 Score=376.12 Aligned_cols=170 Identities=47% Similarity=0.781 Sum_probs=164.3 Q ss_pred CCHHHHHHHCCCCCCCCCCCEEEEEEEE---EEEEEEEEEECCCCCCEEEEEECCCCCCCCCCEEHHHHHHHHHHHHHHH Q ss_conf 8357765300356556885203642003---6999733453177541212222462166677210356888999999999 Q gi|254780814|r 228 GIEAFVSYLDRHKKPLMQSPIRIQGSRD---NISIDLAMRWNDGYHENVLCFTNNIPQKDGGTHLSGLRSALTRQITSYV 304 (803) Q Consensus 228 Gi~~fv~~l~~~~~~~~~~~i~~~~~~~---~~~veval~~~~~~~e~i~SfvN~I~T~~GGtHv~gf~~al~~~i~~y~ 304 (803) ||++|+++++.+++++|++|+++.++.+ ++.||||+||+++|+|+++||||+|+|++|||||+|||+||+|+||+|+ T Consensus 1 Gi~~fv~~l~~~k~~l~~~~~~~~~~~~~~~~~~VevA~q~~~~~~e~~~sfvN~I~T~~GGTH~~gf~~altr~i~~~~ 80 (173) T pfam00204 1 GLKDYVKELNKDKTPLHPEVIYFEGEEAFDDRIEVEVALQWNDSYSENIVSFVNNIPTPDGGTHVDGFRAALTRALNEYA 80 (173) T ss_pred CHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCEEEEEEEECCCCCCEEEEEECCEECCCCCCHHHHHHHHHHHHHHHHH T ss_conf 98999999856999788988489856527899389999995288875388768886999999789999999999999999 Q ss_pred HHCCCCCCCCCCCCHHHHHCCCEEEEEEECCCCCCCCCCHHHHCCCCCCHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 86242000123576134305721566652137633572100202611110321110268999987400357999877899 Q gi|254780814|r 305 EKSFGSKKDKRSVIGDDCREGFTSVLSIKMPDPRFSSQTKEKLVSSEVRSVVESFVNDGLSAWLEEHPLEAKIIVKKVLD 384 (803) Q Consensus 305 ~~~~~~Kk~k~~~~~~Diregl~~visvki~nP~F~gQTK~kL~s~~v~~~v~~~v~~~l~~~l~~n~~~ak~i~~k~~~ 384 (803) ++++++|+.. +++++||||||++||||+|+||+|+||||+||+|++++++|++++++.|..||++||..++.|+++++. T Consensus 81 ~k~~~~k~~~-~i~~~diregl~~visvki~~P~FegQTK~kL~~~ev~~~v~~~v~~~l~~~l~~np~~a~~Ii~k~~~ 159 (173) T pfam00204 81 KKKGLLKKKD-KITGDDIREGLTAVVSVKIPNPQFEGQTKEKLGNSEVRSAVESVVSEKFAKFLEKNPEIAKKILEKALA 159 (173) T ss_pred HHHCCCCCCC-CCCHHHHHCCEEEEEEEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHH T ss_conf 9808764567-668999970628999995069963573026427988999999999999999999899999999999999 Q ss_pred HHHHHHHHHHHHHH Q ss_conf 98877554553211 Q gi|254780814|r 385 ASMVRDAARRARDL 398 (803) Q Consensus 385 a~~ar~~a~kar~~ 398 (803) +++||++|||||++ T Consensus 160 ~~~aR~aakkare~ 173 (173) T pfam00204 160 AAKARLAARKAREL 173 (173) T ss_pred HHHHHHHHHHHCCC T ss_conf 99999999985379 No 15 >cd03365 TOPRIM_TopoIIA TOPRIM_TopoIIA: topoisomerase-primase (TOPRIM) nucleotidyl transferase/hydrolase domain of the type found in proteins of the type IIA family of DNA topoisomerases similar to Saccharomyces cerevisiae Topoisomerase II. TopoIIA enzymes cut both strands of the duplex DNA to remove (relax) both positive and negative supercoils in DNA. These enzymes covalently attach to the 5' ends of the cut DNA, separate the free ends of the cleaved strands, pass another region of the duplex through this gap, then rejoin the ends. These proteins also catenate/ decatenate duplex rings. The TOPRIM domain has two conserved motifs, one of which centers at a conserved glutamate and the other one at two conserved aspartates (DxD). This glutamate and two aspartates, cluster together to form a highly acid surface patch. The conserved glutamate may act as a general base in strand joining and as a general acid in strand cleavage by topisomerases. The DXD motif may co-ordinate Mg2+, a cofact Probab=100.00 E-value=2.9e-43 Score=355.03 Aligned_cols=112 Identities=40% Similarity=0.661 Sum_probs=106.8 Q ss_pred EEEEEEECCCCCCCCCCCC---CCCCEEEEECCCCCCCHHHCCHHHHHCCHHHHHHHHHHCCCCCCCC-CCCCCCCCCCE Q ss_conf 1799982557543223346---5443023300562033443058766323667888876257877533-21235673738 Q gi|254780814|r 425 SELFLVEGDSAGGSAKQGR---SRENQAILPLRGKILNVERARFDKMLSSQEIGTLITALGTGIGQDS-FDINKLRYHKI 500 (803) Q Consensus 425 ~eL~ivEGDSAggsak~gR---dr~~qai~PLrGKilNv~~a~~~~i~~n~Ei~~l~~~lG~~~g~~~-~~~~~lrY~ki 500 (803) |||||||||||+||||||| ||.+|||||||||+||||+++.+++++|+||++|++||||++|+++ +|+++|||||| T Consensus 1 CtL~l~EGDSA~g~a~~g~~~~~r~~~gv~PLrGK~lNv~~~~~~ki~~N~Ei~~l~~~LGl~~g~~~~~d~~~LRYgkI 80 (120) T cd03365 1 CTLILTEGDSAKALAVAGLSVVGRDYYGVFPLRGKLLNVREASHKQIMENAEIQNIKKILGLQHGKSDYESTKSLRYGRL 80 (120) T ss_pred CEEEEECCCCCHHHHHHCCCCCCCCEEEEEECCCEECCHHHCCHHHHHCCHHHHHHHHHHCCCCCCCCCCCHHHCCCCEE T ss_conf 97999768474778872246558751789741887611522768777528889999999856767665565212788858 Q ss_pred EEEECCCCCCCCHHHHHHHHHHHHHHHHHH-CCEEEE Q ss_conf 998508888532899999999963077874-895999 Q gi|254780814|r 501 IIMTDADVDGAHIRTLLLTFFFRQMPSLIH-NGFLYI 536 (803) Q Consensus 501 iimtDaD~DG~HI~~Llltff~~~~~~li~-~g~~yi 536 (803) +||||||+||+||++|||||||||||+||+ .|+|.. T Consensus 81 iI~tDaDvDG~HI~~LlltfF~~~~P~Li~~~gFi~~ 117 (120) T cd03365 81 MIMTDQDHDGSHIKGLLINFIHSFWPSLLKIPGFLVE 117 (120) T ss_pred EEEECCCCCHHHHHHHHHHHHHHHHHHHHCCCCHHHH T ss_conf 9987898865789999999999981998655685665 No 16 >pfam00986 DNA_gyraseB_C DNA gyrase B subunit, carboxyl terminus. The amino terminus of eukaryotic and prokaryotic DNA topoisomerase II are similar, but they have a different carboxyl terminus. The amino-terminal portion of the DNA gyrase B protein is thought to catalyse the ATP-dependent super-coiling of DNA. See pfam00204. The carboxyl-terminal end supports the complexation with the DNA gyrase A protein and the ATP-independent relaxation. This family also contains Topoisomerase IV. This is a bacterial enzyme that is closely related to DNA gyrase,. Probab=99.94 E-value=3.6e-28 Score=237.81 Aligned_cols=64 Identities=64% Similarity=1.132 Sum_probs=62.6 Q ss_pred HHCCCEEEECCCCCCCCHHHHHHCCCCHHHCEEEEEEHHHHHHHHHHHHHHCCCCCCCHHHHHH Q ss_conf 3067628603777788955722303896134021236798898887865641899876289988 Q gi|254780814|r 730 GRKGISMQRYKGLGEMNASQLWETTLNPDTRSLLRVKITDASQADDLFSRLMGDEVEPRREFIQ 793 (803) Q Consensus 730 gkKG~sIQRYKGLGEMNPeQLWETTMDP~tRtLLqV~IeDa~~Ad~iF~~LMGD~VEPRReFIe 793 (803) ++++++||||||||||||+|||||||||++|+|+||+++|+.+|+++|++|||++|+|||+||| T Consensus 2 ~~~k~~I~RfKGLGEM~p~qL~eTTmdp~~R~L~~V~~~d~~~~~~~~~~LMG~~~~~R~~~I~ 65 (65) T pfam00986 2 GNKKVEIQRFKGLGEMNPEQLWETTMDPETRRLLQVTIEDAEEADEMFSTLMGKDVEPRREFIE 65 (65) T ss_pred CCCCCEEEECCCCCCCCHHHHHHHCCCHHCCEEEEEECCCHHHHHHHHHHHCCCCCHHHHHHCC T ss_conf 9887535753576779999999873383106789886388899999999983999689897639 No 17 >cd03481 TopoIIA_Trans_ScTopoIIA TopoIIA_Trans_ScTopoIIA: Transducer domain, having a ribosomal S5 domain 2-like fold, of the type found in proteins of the type IIA family of DNA topoisomerases similar to Saccharomyces cerevisiae Topo IIA. S. cerevisiae Topo IIA is a homodimer encoded by a single gene. The type IIA enzymes are the predominant form of topoisomerase and are found in some bacteriophages, viruses and archaea, and in all bacteria and eukaryotes. All type IIA topoisomerases are related to each other at amino acid sequence level, though their oligomeric organization sometimes differs. TopoIIA enzymes cut both strands of the duplex DNA to remove (relax) both positive and negative supercoils in DNA. These enzymes covalently attach to the 5' ends of the cut DNA, separate the free ends of the cleaved strands, pass another region of the duplex through this gap, then rejoin the ends. TopoIIA enzymes also catenate/ decatenate duplex rings. This transducer domain is homologous to t Probab=99.88 E-value=3.8e-22 Score=191.06 Aligned_cols=130 Identities=22% Similarity=0.287 Sum_probs=96.9 Q ss_pred CCEEEEEEEEEEEEEEEEEECCCCCCEEEEEECCCCCCCCCCEEHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHCC Q ss_conf 52036420036999733453177541212222462166677210356888999999999862420001235761343057 Q gi|254780814|r 246 SPIRIQGSRDNISIDLAMRWNDGYHENVLCFTNNIPQKDGGTHLSGLRSALTRQITSYVEKSFGSKKDKRSVIGDDCREG 325 (803) Q Consensus 246 ~~i~~~~~~~~~~veval~~~~~~~e~i~SfvN~I~T~~GGtHv~gf~~al~~~i~~y~~~~~~~Kk~k~~~~~~Direg 325 (803) .++.++... -+||||++.+++..+++ ||||||+|.+|||||+++..+|++.+.+++.+. +|.++++++++||++ T Consensus 22 ~~~~~e~~~--~rWeV~v~~s~~~f~qv-SFVNgI~T~~GGtHVd~i~~~I~~~l~~~i~kk---~K~~~~ik~~~Ik~~ 95 (153) T cd03481 22 PPVVYEPVN--DRWEVAVALSDGQFQQV-SFVNSIATTKGGTHVDYVADQIVKKLDEVVKKK---NKGGINVKPFQVKNH 95 (153) T ss_pred CCEEEECCC--CCEEEEEEECCCCCEEE-EEECCCCCCCCCCHHHHHHHHHHHHHHHHHHHH---CCCCCCCCHHHHHHH T ss_conf 977980379--98899999879982798-886660069998666999999999999999985---678877799999844 Q ss_pred CEEEEEEECCCCCCCCCCHHHHCCCCCCHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 2156665213763357210020261111032111026899998740035799987789998 Q gi|254780814|r 326 FTSVLSIKMPDPRFSSQTKEKLVSSEVRSVVESFVNDGLSAWLEEHPLEAKIIVKKVLDAS 386 (803) Q Consensus 326 l~~visvki~nP~F~gQTK~kL~s~~v~~~v~~~v~~~l~~~l~~n~~~ak~i~~k~~~a~ 386 (803) |++||+|.|+||+|+||||++|+++..+=-....+.++|.+.+... .|+++++..| T Consensus 96 l~ifv~~~I~NP~FdSQTKe~Ltt~~~~FGsk~~ls~~fi~k~~k~-----~i~~~v~~~a 151 (153) T cd03481 96 LWIFVNCLIENPSFDSQTKETLTTKPKSFGSKCELSEKFLKKAVKS-----GIVESVLSWA 151 (153) T ss_pred CEEEEEEEECCCCCCCHHHHHHCCCHHHCCCCCCCCHHHHHHHHHC-----HHHHHHHHHH T ss_conf 1799999826998662123242485565698766799999999834-----1999999886 No 18 >PRK04184 DNA topoisomerase VI subunit B; Validated Probab=99.40 E-value=1.6e-10 Score=100.75 Aligned_cols=187 Identities=21% Similarity=0.298 Sum_probs=126.3 Q ss_pred HHHCEECCCCCHHCCCCCCEECCCCCCCHHHHHHHHHHHHHHHHH-HHCCCCEEEEEECC-C---CEEEEEECCCCCCCC Q ss_conf 655123135420004587154327996325445656755578868-60779757999838-9---839999768445364 Q gi|254780814|r 12 ADSIQILKGLDAVKKRPSMYIGDTDGESGLHHMIYEVLDNAIDES-LEGYADIITVTLNM-D---GSCTVSDNGRGIPTD 86 (803) Q Consensus 12 ~~~i~~L~gle~vrkRP~mYiG~t~~~~Gl~hlv~EiidNsvDe~-~~g~~~~I~V~i~~-d---gsisV~dnGrGIPv~ 86 (803) +++.+.++--|=-+|-..|- |=.+..++|...|.|.||||.|-+ -+|---.|.|+|.. | -.|+|.|||.|||-+ T Consensus 8 ~~~~~~iS~aEFF~kNk~~l-GFd~~~ral~ttvkE~VdNsLDAcEe~gILPdI~v~I~~~~~~~y~v~VeDNGpGI~~~ 86 (533) T PRK04184 8 AEKFREISVAEFFEKNKHML-GFDSPARALYTTVKELVDNSLDACEEAGILPDIKVEIRRVGEDYYRVIVEDNGPGIPPE 86 (533) T ss_pred HHHHHCCCHHHHHHHCHHHC-CCCCCCHHHHHHHHHHHHCCHHHHHHHCCCCCEEEEEEECCCCEEEEEEECCCCCCCHH T ss_conf 98744179999988575641-87985317999999997331766886088986599999837856999994488998867 Q ss_pred CCCCCCCCEEEEEEEEECCCCCCCCCCCEEECCCCCCEEEEEEECCCEE-----EEEEEECCEE--EEEEE----ECCEE Q ss_conf 0777885001256425011323289860483116752037786326449-----9999978959--99998----89836 Q gi|254780814|r 87 LHKEEGISAAEVIMTRLHAGGKFNQKSYKVSGGLHGVGVSVVNALSSWL-----KLRIKREGNI--YEMSF----INGIL 155 (803) Q Consensus 87 ~h~~~~~~~~E~i~t~L~ag~kfd~~~yk~sgGlhGvG~s~vNalS~~~-----~v~v~rdgk~--~~q~f----~~g~~ 155 (803) . +| .|||.|++||||..- +-|=|.-|.|+|.+-.+|..- .|.+...+.. |..+. .+|.| T Consensus 87 ~-----ip---~vFGk~L~GSKf~~~--rQsRGqqGiGis~avlysQlTTGkP~~V~S~~~~~~~a~~~~l~iD~~kNep 156 (533) T PRK04184 87 Q-----IP---KVFGKLLYGSKFHRL--RQSRGQQGIGISAAVLYAQMTTGKPVRVISSTGGSKKAYEFELKIDTEKNEP 156 (533) T ss_pred H-----HH---HHHHHHHCCCCCCHH--HHCCCCCCCCHHHHHHHHHHCCCCCEEEEECCCCCCEEEEEEEEEECCCCCC T ss_conf 8-----88---999888502532402--1147753524788877763126886599964799852699999967577873 Q ss_pred CCCC-EEECCCCCCCCEEEEEEECHHHHHHHHCCHHHHHHHHHHHHCCCCEEEEEEEEC Q ss_conf 4631-387327788870899987788854210008999999997311464169999963 Q gi|254780814|r 156 DNPL-VVTGSAGNDTGTEVTFLPSSDIFSVQDFNYDTLQHRLRELSFLNSSVQISLIDK 213 (803) Q Consensus 156 ~~~l-~~~~~~~~~~GT~I~F~PD~~iF~~~~~~~d~l~~R~~~~A~Ln~gl~i~l~De 213 (803) .--. .........+||+|++.-.-.+-.. -.-+.+.++..|..||-.+|+|.|- T Consensus 157 ~i~~~~~~~~~~~~hGt~Ve~~ieg~~~~~----r~~v~eYlr~TaIvnPhA~I~f~~P 211 (533) T PRK04184 157 EILEREERDNWKRWHGTRVELELEGNWYRA----KQGIYEYLKRTAIVNPHARITLKDP 211 (533) T ss_pred EEECCCEECCCCCCCCEEEEEEEEEEHHHH----HCCHHHHHHHHHCCCCCEEEEEECC T ss_conf 453341204777888439999997641544----2369999976510378449999889 No 19 >COG1389 DNA topoisomerase VI, subunit B [DNA replication, recombination, and repair] Probab=99.34 E-value=1.9e-10 Score=100.30 Aligned_cols=185 Identities=22% Similarity=0.293 Sum_probs=128.9 Q ss_pred CEECCCCCHHCCCCCCEECCCCCCCHHHHHHHHHHHHHHHHH-HHCCCCEEEEEECCC----CEEEEEECCCCCCCCCCC Q ss_conf 123135420004587154327996325445656755578868-607797579998389----839999768445364077 Q gi|254780814|r 15 IQILKGLDAVKKRPSMYIGDTDGESGLHHMIYEVLDNAIDES-LEGYADIITVTLNMD----GSCTVSDNGRGIPTDLHK 89 (803) Q Consensus 15 i~~L~gle~vrkRP~mYiG~t~~~~Gl~hlv~EiidNsvDe~-~~g~~~~I~V~i~~d----gsisV~dnGrGIPv~~h~ 89 (803) .+.++--|--||.+.|- |=++..++|...|.|.|+||.|-+ .+|---.|.|.|..- -.|.|.|||.|||...-| T Consensus 12 ~re~SvAEFF~kNk~ml-Gf~~p~RsL~~tv~ElV~NSLDA~eeaGILPdI~v~I~~~~~d~y~v~veDNGpGIP~e~IP 90 (538) T COG1389 12 FRELSVAEFFRKNKEML-GFDGPIRSLTTTVHELVTNSLDACEEAGILPDIKVEIERIGKDHYKVIVEDNGPGIPEEQIP 90 (538) T ss_pred HHHCCHHHHHHHCHHHC-CCCCCHHHHHHHHHHHHHCCHHHHHHCCCCCCEEEEEEECCCCEEEEEEECCCCCCCHHHHH T ss_conf 33269999987488861-89973068999999998531656876088876599998547755899993489999856757 Q ss_pred CCCCCEEEEEEEEECCCCCCCCCCCEEECCCCCCEEEEEEECCCEE-----EEEEEEC--CEEEEEEEECCEECCCCEEE Q ss_conf 7885001256425011323289860483116752037786326449-----9999978--95999998898364631387 Q gi|254780814|r 90 EEGISAAEVIMTRLHAGGKFNQKSYKVSGGLHGVGVSVVNALSSWL-----KLRIKRE--GNIYEMSFINGILDNPLVVT 162 (803) Q Consensus 90 ~~~~~~~E~i~t~L~ag~kfd~~~yk~sgGlhGvG~s~vNalS~~~-----~v~v~rd--gk~~~q~f~~g~~~~~l~~~ 162 (803) .|||.+++|+||+- -+.|-|.-|.|+|.+--+|-.- .|.+... ++.|...-.=........++ T Consensus 91 --------kvFGk~LygSKfh~--~~QsRGqqGiGis~avLysQmTtGkPv~V~s~T~~s~~~~~~~l~id~~kNEp~Iv 160 (538) T COG1389 91 --------KVFGKMLYGSKFHR--NIQSRGQQGIGISAAVLYSQMTTGKPVRVISSTGDSGTAYEYELKIDVQKNEPEIV 160 (538) T ss_pred --------HHHHHHHCCCHHHH--HHHCCCCCCCCHHHHHHHHHHCCCCCEEEEECCCCCCCEEEEEEEECCCCCCCHHE T ss_conf --------89988860232323--44325532424789999987517986699953788862489999951788876200 Q ss_pred -----CCCCCCCCEEEEEEECHHHHHHHHCCHHHHHHHHHHHHCCCCEEEEEEEEC Q ss_conf -----327788870899987788854210008999999997311464169999963 Q gi|254780814|r 163 -----GSAGNDTGTEVTFLPSSDIFSVQDFNYDTLQHRLRELSFLNSSVQISLIDK 213 (803) Q Consensus 163 -----~~~~~~~GT~I~F~PD~~iF~~~~~~~d~l~~R~~~~A~Ln~gl~i~l~De 213 (803) .....-+||+|.-.-+-........ -++..++..|..||--+|.|.|- T Consensus 161 ~r~~~~~~~~~hGT~Vel~~~~~~~~~~~q---gi~eYlkrTaiinPhA~I~l~dP 213 (538) T COG1389 161 ERGEVENPGGWHGTRVELELKGVWYRAKRQ---GIYEYLKRTAIINPHARIVLKDP 213 (538) T ss_pred ECCCCCCCCCCCCEEEEEEECCCCHHHCCC---CHHHHHHHHHHCCCCEEEEEECC T ss_conf 002224788877507999955530542224---79999988753387417999889 No 20 >TIGR01052 top6b DNA topoisomerase VI, B subunit; InterPro: IPR005734 DNA topoisomerases regulate the number of topological links between two DNA strands (i.e. change the number of superhelical turns) by catalysing transient single- or double-strand breaks, crossing the strands through one another, then resealing the breaks. These enzymes have several functions: to remove DNA supercoils during transcription and DNA replication; for strand breakage during recombination; for chromosome condensation; and to disentangle intertwined DNA during mitosis , . DNA topoisomerases are divided into two classes: type I enzymes (5.99.1.2 from EC; topoisomerases I, III and V) break single-strand DNA, and type II enzymes (5.99.1.3 from EC; topoisomerases II, IV and VI) break double-strand DNA . Type II topoisomerases are ATP-dependent enzymes, and can be subdivided according to their structure and reaction mechanisms: type IIA (topoisomerase II or gyrase, and topoisomerase IV) and type IIB (topoisomerase VI). These enzymes are responsible for relaxing supercoiled DNA as well as for introducing both negative and positive supercoils . This entry represents subunit B of topoisomerase VI, a type IIB topoisomerase found predominantly in archaea, but also in a few eukayotes, such as the plant Arabidopsis thaliana . This enzyme assembles as a heterotetramer, consisting of two A subunits required for DNA cleavage and two B subunits required for ATP hydrolysis. The B subunit is structurally similar to the ATPase domain of type IIA topoisomerases, but the A subunit is distinct, and instead shares homology with the Spo11 protein that mediates double-strand DNA breaks during meiotic recombination in eukaryotes . Therefore, though related to type IIA topoisomerases, topoisomerase VI may have a distinctive mechanism of action. More information about this protein can be found at Protein of the Month: DNA Topoisomerase .; GO: 0003677 DNA binding, 0003918 DNA topoisomerase (ATP-hydrolyzing) activity, 0006265 DNA topological change, 0006268 DNA unwinding during replication, 0005694 chromosome. Probab=99.04 E-value=1.3e-09 Score=93.66 Aligned_cols=168 Identities=21% Similarity=0.312 Sum_probs=113.1 Q ss_pred CCCCCCCHHHHHHHHHHHHHHHHH-HHCCCCEEEEEECC-CC----------EEEEEECCCCCCCCCCCCCCCCEEEEEE Q ss_conf 327996325445656755578868-60779757999838-98----------3999976844536407778850012564 Q gi|254780814|r 33 GDTDGESGLHHMIYEVLDNAIDES-LEGYADIITVTLNM-DG----------SCTVSDNGRGIPTDLHKEEGISAAEVIM 100 (803) Q Consensus 33 G~t~~~~Gl~hlv~EiidNsvDe~-~~g~~~~I~V~i~~-dg----------sisV~dnGrGIPv~~h~~~~~~~~E~i~ 100 (803) |=|.-.+.|..+|.|.|+||.|-+ -||===.|.|.|.+ |+ .|.|.|||.|||.+.-| .|| T Consensus 21 Gy~gkiRsL~~~ihElVtNSLDAcEEAGILP~I~v~I~~~g~seGrGGr~HY~v~VEDNGPGIP~~~IP--------kVF 92 (662) T TIGR01052 21 GYSGKIRSLTTVIHELVTNSLDACEEAGILPDIKVEIEKIGKSEGRGGRDHYKVVVEDNGPGIPEEYIP--------KVF 92 (662) T ss_pred CCCCHHHHHHHHHHHHHHCHHHHHHHCCCCCCCEEEEEEECCCCCCCCCCCEEEEEEECCCCCCCCCCC--------CHH T ss_conf 773606889999999971424676643799840589898527778999962079984078899700013--------132 Q ss_pred EEECCCCCCCCCCCEEECCCCCCEEEEEEECCCEEE-----EEEEE--CCEEE--EEEEECCEECCCCEEEC-----CCC Q ss_conf 250113232898604831167520377863264499-----99997--89599--99988983646313873-----277 Q gi|254780814|r 101 TRLHAGGKFNQKSYKVSGGLHGVGVSVVNALSSWLK-----LRIKR--EGNIY--EMSFINGILDNPLVVTG-----SAG 166 (803) Q Consensus 101 t~L~ag~kfd~~~yk~sgGlhGvG~s~vNalS~~~~-----v~v~r--dgk~~--~q~f~~g~~~~~l~~~~-----~~~ 166 (803) |.++|||||-.. +-|=|.-|.|+|.+=-||---+ |-+.. ||..| .+.-.=+.......++. ... T Consensus 93 GKmLaGSK~h~~--iQSRGqQGiG~sgavL~~Q~TTGKP~~VIs~T~~nGe~~~~~~~l~idV~KNEg~Ivek~eee~~~ 170 (662) T TIGR01052 93 GKMLAGSKFHRI--IQSRGQQGIGISGAVLYAQITTGKPVKVISSTGENGEIYAKKVKLKIDVKKNEGEIVEKGEEEWEK 170 (662) T ss_pred HHHCCCCEEEEE--EECCCCHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCEEEEEEEEEEEEEECCCEEEECCCEEECC T ss_conf 021114400033--206785156799999987614798458880568675615899999985332587343012101417 Q ss_pred CC--CCEEEEEEECHHHHHHHHCCHHHHHHHHHHHHCCCCEEEEEEEEC Q ss_conf 88--870899987788854210008999999997311464169999963 Q gi|254780814|r 167 ND--TGTEVTFLPSSDIFSVQDFNYDTLQHRLRELSFLNSSVQISLIDK 213 (803) Q Consensus 167 ~~--~GT~I~F~PD~~iF~~~~~~~d~l~~R~~~~A~Ln~gl~i~l~De 213 (803) .+ +||+|+.--.-......++ -++..+|..|.-||--+|.|.|- T Consensus 171 ~~~W~GTriEle~~~v~y~~~~~---g~yEYlrrt~~anPHA~I~l~~P 216 (662) T TIGR01052 171 PGSWRGTRIELEVKDVSYRRSKQ---GVYEYLRRTAVANPHAKIVLVEP 216 (662) T ss_pred CCCCEEEEEEEEEECCEEECCCC---CHHHHHHHHHHHCCCCEEEEECC T ss_conf 99813689999982225531113---44589899875186408998869 No 21 >PRK00095 mutL DNA mismatch repair protein; Reviewed Probab=98.86 E-value=2e-07 Score=76.82 Aligned_cols=229 Identities=17% Similarity=0.281 Sum_probs=135.5 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCEEEEEECCCC--EEEEEECCCCCCCCCCCCCCCCEEEEEEEEECCCCCCCCCC--CE Q ss_conf 254456567555788686077975799983898--39999768445364077788500125642501132328986--04 Q gi|254780814|r 40 GLHHMIYEVLDNAIDESLEGYADIITVTLNMDG--SCTVSDNGRGIPTDLHKEEGISAAEVIMTRLHAGGKFNQKS--YK 115 (803) Q Consensus 40 Gl~hlv~EiidNsvDe~~~g~~~~I~V~i~~dg--sisV~dnGrGIPv~~h~~~~~~~~E~i~t~L~ag~kfd~~~--yk 115 (803) -.-..|.|+|.||+|- -|+.|+|.|...| .|.|.|||.||+-+- +++.|.. ||-||-.+-+ +. T Consensus 22 rpasvVKELvENsiDA----gAt~I~V~i~~~G~~~I~V~DnG~Gi~~~d--------~~l~~~r-haTSKi~~~~DL~~ 88 (612) T PRK00095 22 RPASVVKELVENALDA----GATRIDIEIEEGGLKLIRVRDNGCGIPKED--------LPLALSR-HATSKIASLDDLEA 88 (612) T ss_pred CHHHHHHHHHHHHHCC----CCCEEEEEEECCCCCEEEEEECCCCCCHHH--------HHHHHHC-CCCCCCCCHHHHHH T ss_conf 8899999998413317----998799999749831799997999989899--------9999757-27767898456611 Q ss_pred E-ECCCCCCEEEEEEECCCEEEEEEEECC--EEEEEEEECCEECCCCEEECCCCCCCCEEEE----EEECHHHHH---HH Q ss_conf 8-311675203778632644999999789--5999998898364631387327788870899----987788854---21 Q gi|254780814|r 116 V-SGGLHGVGVSVVNALSSWLKLRIKREG--NIYEMSFINGILDNPLVVTGSAGNDTGTEVT----FLPSSDIFS---VQ 185 (803) Q Consensus 116 ~-sgGlhGvG~s~vNalS~~~~v~v~rdg--k~~~q~f~~g~~~~~l~~~~~~~~~~GT~I~----F~PD~~iF~---~~ 185 (803) + |-|--|=-.+...+.| .+++.+++.+ .-|...++.|..... ..++...||+|+ |.-=|.|-. .. T Consensus 89 i~T~GFRGEAL~SIa~vs-~~~i~s~~~~~~~g~~i~~~~g~~~~~----~~~~~~~GT~V~V~~LF~N~PaRrKfLKs~ 163 (612) T PRK00095 89 IRTLGFRGEALPSIASVS-RLTLTSRTADAAEGWQIVVEGGEIVEV----KPAAHPVGTTIEVRDLFFNTPARRKFLKSE 163 (612) T ss_pred CCCCCCCHHHHHHHHHHE-EEEEEECCCCCCCEEEEEEECCEECCE----EECCCCCCCEEEECHHHCCCHHHHHHHCCC T ss_conf 767876188899877326-789996789976728999949967532----333689984999604525576889875798 Q ss_pred HCCHHHHHHHHHHHHCCCCEEEEEEEECCCCCCCEEEECCCC--CCHHHHHHHCCCCCCCCCCCEEEEEEEEEEEEEEEE Q ss_conf 000899999999731146416999996358642101001268--835776530035655688520364200369997334 Q gi|254780814|r 186 DFNYDTLQHRLRELSFLNSSVQISLIDKRSPEPREITMFYKG--GIEAFVSYLDRHKKPLMQSPIRIQGSRDNISIDLAM 263 (803) Q Consensus 186 ~~~~d~l~~R~~~~A~Ln~gl~i~l~Der~~~~~~~~f~~~~--Gi~~fv~~l~~~~~~~~~~~i~~~~~~~~~~veval 263 (803) .-....+.+.+..+|..+|.+.+.|... .+..|.+.+ .+.+-+..+-+.. +....+.++.+.+++.|...+ T Consensus 164 ~~E~~~I~~~v~~~AL~~P~I~f~l~~~-----~k~~~~~~~~~~l~~ri~~i~G~~--~~~~li~v~~~~~~~~i~G~i 236 (612) T PRK00095 164 KTELGHIDDVVRRLALAHPDVAFTLTHN-----GKLVLQTRANGDLLQRLAAILGRD--FAENALPIDAEHGDLRLSGYV 236 (612) T ss_pred CHHHHHHHHHHHHHHHHCCCEEEEEEEC-----CEEEECCCCCCCHHHHHHHHHCHH--HHHHCEEEEEECCCEEEEEEE T ss_conf 1799999999999975179758999989-----999723588878999999984989--896277986016976999999 Q ss_pred EE---CCCCCCEEEEEECCCCCCCCCCEEHHHHHHHHHHHHH Q ss_conf 53---1775412122224621666772103568889999999 Q gi|254780814|r 264 RW---NDGYHENVLCFTNNIPQKDGGTHLSGLRSALTRQITS 302 (803) Q Consensus 264 ~~---~~~~~e~i~SfvN~I~T~~GGtHv~gf~~al~~~i~~ 302 (803) .- +-+....++-|||+-+-.+ ..|.++|++ T Consensus 237 s~P~~~r~~r~~q~~FVNgR~V~~---------~~l~~aI~~ 269 (612) T PRK00095 237 GLPTLSRANRDYQYLFVNGRYVRD---------KLLNHAIRQ 269 (612) T ss_pred ECCHHCCCCCCCEEEEECCCCCCC---------HHHHHHHHH T ss_conf 583120367762799987840478---------999999999 No 22 >cd00329 TopoII_MutL_Trans MutL_Trans: transducer domain, having a ribosomal S5 domain 2-like fold, conserved in the C-terminal domain of type II DNA topoisomerases (Topo II) and DNA mismatch repair (MutL/MLH1/PMS2) proteins. This transducer domain is homologous to the second domain of the DNA gyrase B subunit, which is known to be important in nucleotide hydrolysis and the transduction of structural signals from ATP-binding site to the DNA breakage/reunion regions of the enzymes. The GyrB dimerizes in response to ATP binding, and is homologous to the N-terminal half of eukaryotic Topo II and the ATPase fragment of MutL. Type II DNA topoisomerases catalyze the ATP-dependent transport of one DNA duplex through another, in the process generating transient double strand breaks via covalent attachments to both DNA strands at the 5' positions. Included in this group are proteins similar to human MLH1 and PMS2. MLH1 forms a heterodimer with PMS2 which functions in meiosis and in DNA mismatch Probab=98.55 E-value=1.8e-07 Score=77.17 Aligned_cols=99 Identities=29% Similarity=0.492 Sum_probs=78.0 Q ss_pred HHHHHHHCCCCCCCCCCCEEEEEEEEEEEEEEEEEECCC---CCCEEEEEECCCCCCCCCCEEHHHHHHHHHHHHHHHHH Q ss_conf 577653003565568852036420036999733453177---54121222246216667721035688899999999986 Q gi|254780814|r 230 EAFVSYLDRHKKPLMQSPIRIQGSRDNISIDLAMRWNDG---YHENVLCFTNNIPQKDGGTHLSGLRSALTRQITSYVEK 306 (803) Q Consensus 230 ~~fv~~l~~~~~~~~~~~i~~~~~~~~~~veval~~~~~---~~e~i~SfvN~I~T~~GGtHv~gf~~al~~~i~~y~~~ 306 (803) ++++..+.+.+. .+..+.++.+.+++.|+..++|.+. ....+++|||++++.+||+|..+++.|+.+.++ T Consensus 2 ~~~i~~i~g~~~--~~~l~~v~~~~~~~~i~g~i~~~~~~~~~~~~q~~fVN~R~v~~~~~~~~ai~~~y~~~~~----- 74 (107) T cd00329 2 KDRLAEILGDKV--ADKLIYVEGESDGFRVEGAISYPDSGRSSKDRQFSFVNGRPVREGGTHVKAVREAYTRALN----- 74 (107) T ss_pred HHHHHHHHCHHH--HHHCEEEEEECCCEEEEEEEECCCCCCCCCCCEEEEECCEEECCCHHHHHHHHHHHHHHHH----- T ss_conf 779998819898--9717799875499899999957853567785479999784860776999999999999861----- Q ss_pred CCCCCCCCCCCCHHHHHCCCEEEEEEECCC--CCCC-CCCHHHH Q ss_conf 242000123576134305721566652137--6335-7210020 Q gi|254780814|r 307 SFGSKKDKRSVIGDDCREGFTSVLSIKMPD--PRFS-SQTKEKL 347 (803) Q Consensus 307 ~~~~Kk~k~~~~~~Diregl~~visvki~n--P~F~-gQTK~kL 347 (803) +.+.+.+..+++++.++. |.|. +|||+++ T Consensus 75 ------------~~~~~~~p~~vl~i~~~~~~vDvNvhP~K~ev 106 (107) T cd00329 75 ------------GDDVRRYPVAVLSLKIPPSLVDVNVHPTKEEV 106 (107) T ss_pred ------------CCCCCCCCEEEEEEEECCHHEEECCCCCCCCC T ss_conf ------------46789887899999848153872749983413 No 23 >COG0323 MutL DNA mismatch repair enzyme (predicted ATPase) [DNA replication, recombination, and repair] Probab=98.44 E-value=3.3e-06 Score=67.29 Aligned_cols=257 Identities=17% Similarity=0.280 Sum_probs=130.7 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCEEEEEECCCC--EEEEEECCCCCCCCCCCCCCCCEEEEEEEEECCCCCCCCCCC--EE Q ss_conf 54456567555788686077975799983898--399997684453640777885001256425011323289860--48 Q gi|254780814|r 41 LHHMIYEVLDNAIDESLEGYADIITVTLNMDG--SCTVSDNGRGIPTDLHKEEGISAAEVIMTRLHAGGKFNQKSY--KV 116 (803) Q Consensus 41 l~hlv~EiidNsvDe~~~g~~~~I~V~i~~dg--sisV~dnGrGIPv~~h~~~~~~~~E~i~t~L~ag~kfd~~~y--k~ 116 (803) .--.|.|+|+||+| || |+.|+|.+...| .|.|.|||.||+-+-- ++ ...=||-||-.+.+. ++ T Consensus 24 PaSVVKELVENSlD---AG-At~I~I~ve~gG~~~I~V~DNG~Gi~~~Dl--------~l-a~~rHaTSKI~~~~DL~~I 90 (638) T COG0323 24 PASVVKELVENSLD---AG-ATRIDIEVEGGGLKLIRVRDNGSGIDKEDL--------PL-ALLRHATSKIASLEDLFRI 90 (638) T ss_pred HHHHHHHHHHCCCC---CC-CCEEEEEEECCCEEEEEEEECCCCCCHHHH--------HH-HHHCCCCCCCCCHHHHHHH T ss_conf 88999999861030---49-988999993598018999889999998999--------99-9865146457865568774 Q ss_pred -ECCCCCCEEEEEEECCCEEEEEEEE-CCEEEEEEEECCEECCCCEEECCCCCCCCEEEEE----E--ECHHHH-HHHHC Q ss_conf -3116752037786326449999997-8959999988983646313873277888708999----8--778885-42100 Q gi|254780814|r 117 -SGGLHGVGVSVVNALSSWLKLRIKR-EGNIYEMSFINGILDNPLVVTGSAGNDTGTEVTF----L--PSSDIF-SVQDF 187 (803) Q Consensus 117 -sgGlhGvG~s~vNalS~~~~v~v~r-dgk~~~q~f~~g~~~~~l~~~~~~~~~~GT~I~F----~--PD~~iF-~~~~~ 187 (803) |-|--|=-.+..-.. +++++++++ +...+.|-+..|..... .+.++....||+|.- . |=-..| ....- T Consensus 91 ~TlGFRGEAL~SIasV-srlti~Srt~~~~~~~~~~~~g~~~~~--~~~p~a~~~GTtVeV~dLF~NtPaRrKflks~~~ 167 (638) T COG0323 91 RTLGFRGEALASIASV-SRLTITSRTAEASEGTQIYAEGGGMEV--TVKPAAHPVGTTVEVRDLFYNTPARRKFLKSEKT 167 (638) T ss_pred HCCCCCHHHHHHHHHH-HEEEEEECCCCCCCEEEEEECCCCCCC--CCCCCCCCCCCEEEEHHHHCCCHHHHHHHCCCHH T ss_conf 3567338779998875-444899416776763899962884112--3347779999899845712558476654145389 Q ss_pred CHHHHHHHHHHHHCCCCEEEEEEEECCCCCCC-EEEECCCCC-CHHHHHHHCCCCCCCCCCCEEEEEEEEEEEEEEEEE- Q ss_conf 08999999997311464169999963586421-010012688-357765300356556885203642003699973345- Q gi|254780814|r 188 NYDTLQHRLRELSFLNSSVQISLIDKRSPEPR-EITMFYKGG-IEAFVSYLDRHKKPLMQSPIRIQGSRDNISIDLAMR- 264 (803) Q Consensus 188 ~~d~l~~R~~~~A~Ln~gl~i~l~Der~~~~~-~~~f~~~~G-i~~fv~~l~~~~~~~~~~~i~~~~~~~~~~veval~- 264 (803) .+..+..-++.+|..+|.+.++|.-. ++.. ...-....| ...-+..+-+. ......+.++.+.+++.+..-+. T Consensus 168 E~~~i~~vv~r~ALahp~I~F~l~~~--gk~~~~~~~~~~~~~~~~ri~~i~G~--~~~~~~l~i~~~~~~~~l~G~v~~ 243 (638) T COG0323 168 EFGHITELINRYALAHPDISFSLSHN--GKLRIELLKLPGTGDLEERIAAVYGT--EFLKNALPIENEHEDLRLSGYVSL 243 (638) T ss_pred HHHHHHHHHHHHHHCCCCEEEEEEEC--CEEEEEEEECCCCCCHHHHHHHHHCH--HHHHHHCCCCCCCCCEEEEEEEEC T ss_conf 99999999999873079727999988--90414788657787578999998686--988752034567786699999825 Q ss_pred --ECCCCCCEEEEEECCCCCCCCCCEEHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHCCCEEEEEEECCCCCC Q ss_conf --317754121222246216667721035688899999999986242000123576134305721566652137633 Q gi|254780814|r 265 --WNDGYHENVLCFTNNIPQKDGGTHLSGLRSALTRQITSYVEKSFGSKKDKRSVIGDDCREGFTSVLSIKMPDPRF 339 (803) Q Consensus 265 --~~~~~~e~i~SfvN~I~T~~GGtHv~gf~~al~~~i~~y~~~~~~~Kk~k~~~~~~Diregl~~visvki~nP~F 339 (803) .+-+-....+.|||+-+-.+= .|.++|.+-... ++... ..-++|+.+.+ +|.. T Consensus 244 P~~~r~~~~~q~~fVNgR~V~~~---------~l~~Ai~~aY~~--~L~~~----------r~P~~vL~l~l-~p~~ 298 (638) T COG0323 244 PEFTRASRDYQYLFVNGRPVRDK---------LLNHALREAYAD--YLPRG----------RYPVFVLFLEL-DPEL 298 (638) T ss_pred CEECCCCCCCEEEEECCCEECCH---------HHHHHHHHHHHH--CCCCC----------CCCEEEEEEEE-CHHH T ss_conf 60135787744999889881158---------999999999861--06688----------87679999987-7123 No 24 >TIGR00585 mutl DNA mismatch repair protein MutL; InterPro: IPR014763 This entry represents DNA mismatch repair proteins, such as MutL. The dimeric MutL protein has a key function in communicating mismatch recognition by MutS to downstream repair processes. Mismatch repair contributes to the overall fidelity of DNA replication by targeting mispaired bases that arise through replication errors during homologous recombination and as a result of DNA damage. It involves the correction of mismatched base pairs that have been missed by the proofreading element of the DNA polymerase complex .. Probab=98.36 E-value=3.5e-06 Score=67.17 Aligned_cols=182 Identities=19% Similarity=0.252 Sum_probs=108.3 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCEEEEEECCCC--EEEEEECCCCCCCCCCCCCCCCEEEEEEEEECCCCCCCCCCCEE- Q ss_conf 254456567555788686077975799983898--39999768445364077788500125642501132328986048- Q gi|254780814|r 40 GLHHMIYEVLDNAIDESLEGYADIITVTLNMDG--SCTVSDNGRGIPTDLHKEEGISAAEVIMTRLHAGGKFNQKSYKV- 116 (803) Q Consensus 40 Gl~hlv~EiidNsvDe~~~g~~~~I~V~i~~dg--sisV~dnGrGIPv~~h~~~~~~~~E~i~t~L~ag~kfd~~~yk~- 116 (803) ..--.|-|.|+||+| || |+.|+|.+...| .|.|.|||.||+.+=- .+++.=||=||..+-+.=. T Consensus 22 ~p~~vVKELvENSLD---AG-At~I~v~~~~gG~~~I~V~DNG~Gi~~~d~---------~~~~~~haTSKI~~~~DL~~ 88 (367) T TIGR00585 22 RPASVVKELVENSLD---AG-ATKIEVEIEEGGLKLIEVSDNGSGIDKEDL---------ELACERHATSKIQSFEDLER 88 (367) T ss_pred HHHHHHHHHHHHHHC---CC-CCEEEEEEEECCEEEEEEEECCCCCCHHHH---------HHHHCCCCCCCCCCHHHHHH T ss_conf 279999998873121---48-858999996265358999977856777779---------98612357763224789998 Q ss_pred --ECCCCCCEEEEEEECCCEEEEEE------------EECCEEEEEEEECCEECCCCEEECCCCCCCCEEEEEE------ Q ss_conf --31167520377863264499999------------9789599999889836463138732778887089998------ Q gi|254780814|r 117 --SGGLHGVGVSVVNALSSWLKLRI------------KREGNIYEMSFINGILDNPLVVTGSAGNDTGTEVTFL------ 176 (803) Q Consensus 117 --sgGlhGvG~s~vNalS~~~~v~v------------~rdgk~~~q~f~~g~~~~~l~~~~~~~~~~GT~I~F~------ 176 (803) |=|==|==.+---.. +.+++.+ ..+|..|...|..|.... ..+.+++...||+|+=. T Consensus 89 i~TlGFRGEAL~Si~~v-s~~~i~t~~~~~~~W~yQ~~~~~~~~~~~~~~G~~~~--~~~~p~~~~~GT~V~V~~LF~N~ 165 (367) T TIGR00585 89 IRTLGFRGEALASISSV-SRLTITTKTSGPTAWFYQGAADGLALQVLYAEGKNII--EEIKPAPRPVGTTVEVRDLFYNL 165 (367) T ss_pred HHCCCCHHHHHHHHHHH-HCCEEEEEECCCCCCEECCCCCCEEEEEEECCCCEEE--ECCCCCCCCCCCEEEEECCCCCC T ss_conf 64048346899999864-3334787425898612212356404676620687342--00256688989789973031585 Q ss_pred ECHHH-H--HHHHCCHHHHHHHHHHHHCCCCEEEEEEEEC--CCCCCCEEE-ECC--CCCCHHH-HHHHC Q ss_conf 77888-5--4210008999999997311464169999963--586421010-012--6883577-65300 Q gi|254780814|r 177 PSSDI-F--SVQDFNYDTLQHRLRELSFLNSSVQISLIDK--RSPEPREIT-MFY--KGGIEAF-VSYLD 237 (803) Q Consensus 177 PD~~i-F--~~~~~~~d~l~~R~~~~A~Ln~gl~i~l~De--r~~~~~~~~-f~~--~~Gi~~f-v~~l~ 237 (803) |=-.. | +..+-.+..+.+.+...|..+|+|++.|..+ -..+++... +.. ++++++- +..+- T Consensus 166 P~R~k~~s~k~~~~e~~~~~~~l~~~~l~~~~v~f~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~v~ 235 (367) T TIGR00585 166 PVRRKEFSLKSPKKELRKILDLLERYALIHPDVSFSLTSDESFQTNGKKVLSLSTKPNQSLKERRIRSVF 235 (367) T ss_pred CHHHHHHHCCCHHHHHHHHHHHHHHHEEECCCCEEEEEECCEECCCCCEEEEECCCCCCHHHHHHHHHHH T ss_conf 2026877557806889999999864401337935899971166477410033524887307899999887 No 25 >cd00075 HATPase_c Histidine kinase-like ATPases; This family includes several ATP-binding proteins for example: histidine kinase, DNA gyrase B, topoisomerases, heat shock protein HSP90, phytochrome-like ATPases and DNA mismatch repair proteins Probab=97.67 E-value=4.2e-05 Score=58.71 Aligned_cols=80 Identities=34% Similarity=0.539 Sum_probs=55.0 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCEEEEEECCCC---EEEEEECCCCCCCCCCCCCCCCEEEEEEEEECCCCCCCCCCCEEE Q ss_conf 54456567555788686077975799983898---399997684453640777885001256425011323289860483 Q gi|254780814|r 41 LHHMIYEVLDNAIDESLEGYADIITVTLNMDG---SCTVSDNGRGIPTDLHKEEGISAAEVIMTRLHAGGKFNQKSYKVS 117 (803) Q Consensus 41 l~hlv~EiidNsvDe~~~g~~~~I~V~i~~dg---sisV~dnGrGIPv~~h~~~~~~~~E~i~t~L~ag~kfd~~~yk~s 117 (803) |++++.++|+||++-+-. ....|.|....++ .++|.|||+|||-+..+. +|... ........ T Consensus 1 l~~vl~nLl~NAik~~~~-~~~~I~i~~~~~~~~~~i~V~D~G~Gi~~~~~~~--------if~~f------~~~~~~~~ 65 (103) T cd00075 1 LQQVLLNLLSNAIKHTPE-GGGRITISVERDGDHLEIRVEDNGPGIPEEDLER--------IFERF------SDGSRSRK 65 (103) T ss_pred CHHHHHHHHHHHHHHCCC-CCCEEEEEEEECCCEEEEEEEECCCCCCHHHHHH--------CCCCC------CCCCCCCC T ss_conf 969999999999986789-9948999999839999999997799408899976--------06975------01788877 Q ss_pred CCCCCCEEEEEEECCCEE Q ss_conf 116752037786326449 Q gi|254780814|r 118 GGLHGVGVSVVNALSSWL 135 (803) Q Consensus 118 gGlhGvG~s~vNalS~~~ 135 (803) .+-.|+|.+++..+.+.+ T Consensus 66 ~~g~GlGL~i~~~i~~~~ 83 (103) T cd00075 66 GGGTGLGLSIVKKLVELH 83 (103) T ss_pred CCCCCCHHHHHHHHHHHC T ss_conf 898650599999999988 No 26 >pfam02518 HATPase_c Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90. Probab=97.63 E-value=1.7e-05 Score=61.87 Aligned_cols=87 Identities=39% Similarity=0.511 Sum_probs=60.4 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCC---CEEEEEECCCCCCCCCCCCCCCCEEEEEEEEECCCCCCCCCCC Q ss_conf 6325445656755578868607797579998389---8399997684453640777885001256425011323289860 Q gi|254780814|r 38 ESGLHHMIYEVLDNAIDESLEGYADIITVTLNMD---GSCTVSDNGRGIPTDLHKEEGISAAEVIMTRLHAGGKFNQKSY 114 (803) Q Consensus 38 ~~Gl~hlv~EiidNsvDe~~~g~~~~I~V~i~~d---gsisV~dnGrGIPv~~h~~~~~~~~E~i~t~L~ag~kfd~~~y 114 (803) ...|.+++.|+|+||++-+-.+ ..|.|.+..+ -.++|.|||.|||-+.. +-+|.....+.+ ... T Consensus 3 ~~~l~~i~~nll~NAik~~~~~--~~I~i~~~~~~~~~~i~V~D~G~Gi~~~~~--------~~if~~f~~~~~---~~~ 69 (111) T pfam02518 3 EDRLRQVLSNLLDNAIKHAPAG--GEIEVTLERDGGRLRITVEDNGIGIPPEDL--------PKIFEPFFRTDK---DSR 69 (111) T ss_pred HHHHHHHHHHHHHHHHHHCCCC--CEEEEEEEEECCEEEEEEEECCCCCCHHHH--------HHHCCCCEECCC---CCC T ss_conf 8999999999999999977999--759999999599999999989999598999--------887499177799---997 Q ss_pred EEECCCCCCEEEEEEECCCEEEEEE Q ss_conf 4831167520377863264499999 Q gi|254780814|r 115 KVSGGLHGVGVSVVNALSSWLKLRI 139 (803) Q Consensus 115 k~sgGlhGvG~s~vNalS~~~~v~v 139 (803) .. +| +|+|.++++.+.+.+..++ T Consensus 70 ~~-~g-~GlGL~i~~~~~~~~~g~i 92 (111) T pfam02518 70 KV-GG-TGLGLSIVRKLVELHGGTI 92 (111) T ss_pred CC-CC-CCHHHHHHHHHHHHCCCEE T ss_conf 77-97-7647999999999879989 No 27 >smart00387 HATPase_c Histidine kinase-like ATPases. Histidine kinase-, DNA gyrase B-, phytochrome-like ATPases. Probab=97.59 E-value=2.4e-05 Score=60.59 Aligned_cols=88 Identities=34% Similarity=0.409 Sum_probs=60.4 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCC---CEEEEEECCCCCCCCCCCCCCCCEEEEEEEEECCCCCCCCCCC Q ss_conf 6325445656755578868607797579998389---8399997684453640777885001256425011323289860 Q gi|254780814|r 38 ESGLHHMIYEVLDNAIDESLEGYADIITVTLNMD---GSCTVSDNGRGIPTDLHKEEGISAAEVIMTRLHAGGKFNQKSY 114 (803) Q Consensus 38 ~~Gl~hlv~EiidNsvDe~~~g~~~~I~V~i~~d---gsisV~dnGrGIPv~~h~~~~~~~~E~i~t~L~ag~kfd~~~y 114 (803) ..-|++++.++++||++-+-. ...|.|++..+ -.|+|.|||.|||-+.. +.+|....++.+ . T Consensus 3 ~~~l~~il~Nli~NAik~~~~--~~~I~i~~~~~~~~~~i~V~D~G~Gi~~~~~--------~~if~~f~~~~~---~-- 67 (111) T smart00387 3 PDRLRQVLSNLLDNAIKYTPE--GGRITVTLERDGDHLEITVEDNGPGIPPEDL--------EKIFEPFFRTDG---R-- 67 (111) T ss_pred HHHHHHHHHHHHHHHHHHCCC--CCEEEEEEEECCCEEEEEEEECCCCCCHHHH--------HHCCCCCEECCC---C-- T ss_conf 899999999999999996779--9789999998399999999988997398999--------642699477799---9-- Q ss_pred EEECCCCCCEEEEEEECCCEEEEEEE Q ss_conf 48311675203778632644999999 Q gi|254780814|r 115 KVSGGLHGVGVSVVNALSSWLKLRIK 140 (803) Q Consensus 115 k~sgGlhGvG~s~vNalS~~~~v~v~ 140 (803) ....+-.|+|.++++-+.+.+..+++ T Consensus 68 ~~~~~g~GlGL~i~~~i~~~~~G~i~ 93 (111) T smart00387 68 SRKIGGTGLGLSIVKKLVELHGGEIS 93 (111) T ss_pred CCCCCCCCHHHHHHHHHHHHCCCEEE T ss_conf 98879766489999999998799899 No 28 >KOG1978 consensus Probab=97.47 E-value=0.00059 Score=49.84 Aligned_cols=170 Identities=21% Similarity=0.322 Sum_probs=102.8 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCCC--EEEEEECCCCCCCCCCCCCCCCEEEEEEEEECCCCC---CCCC Q ss_conf 63254456567555788686077975799983898--399997684453640777885001256425011323---2898 Q gi|254780814|r 38 ESGLHHMIYEVLDNAIDESLEGYADIITVTLNMDG--SCTVSDNGRGIPTDLHKEEGISAAEVIMTRLHAGGK---FNQK 112 (803) Q Consensus 38 ~~Gl~hlv~EiidNsvDe~~~g~~~~I~V~i~~dg--sisV~dnGrGIPv~~h~~~~~~~~E~i~t~L~ag~k---fd~~ 112 (803) ..++.--|.|+|+||+|. -|+.|+|.+..-| +|+|.|||-||--.--+ + |+.-|.-+| |.|= T Consensus 18 I~sl~sAVKELvENSiDA----GAT~I~I~~kdyG~d~IEV~DNG~GI~~~n~~--~-------l~lkh~TSKi~~f~Dl 84 (672) T KOG1978 18 ITSLVSAVKELVENSIDA----GATAIDIKVKDYGSDSIEVSDNGSGISATDFE--G-------LALKHTTSKIVSFADL 84 (672) T ss_pred ECCHHHHHHHHHHCCCCC----CCCEEEEECCCCCCCEEEEECCCCCCCCCCHH--H-------HHHHHHHHCCCCHHHH T ss_conf 215899999998637034----77525575378774328980488887733103--4-------4434544133012324 Q ss_pred CCEEECCCCCCEEEEEEECCCEEEEEEEECC-EEEEEEEEC-CEECCCCEEECCCCCCCCEEEEEEECHHHHHH------ Q ss_conf 6048311675203778632644999999789-599999889-83646313873277888708999877888542------ Q gi|254780814|r 113 SYKVSGGLHGVGVSVVNALSSWLKLRIKREG-NIYEMSFIN-GILDNPLVVTGSAGNDTGTEVTFLPSSDIFSV------ 184 (803) Q Consensus 113 ~yk~sgGlhGvG~s~vNalS~~~~v~v~rdg-k~~~q~f~~-g~~~~~l~~~~~~~~~~GT~I~F~PD~~iF~~------ 184 (803) +..+|=|--|--.|.-.|+|.-...+...+- .-.+-.|.+ |.. ....++....||+|.-. +.|++ T Consensus 85 ~~l~T~GFRGEALSsLCa~~dv~I~Trt~~~~vgt~l~~Dh~G~I----~~k~~~ar~~GTTV~v~---~LF~tLPVR~k 157 (672) T KOG1978 85 AVLFTLGFRGEALSSLCALGDVMISTRSHSAKVGTRLVYDHDGHI----IQKKPVARGRGTTVMVR---QLFSTLPVRRK 157 (672) T ss_pred HHHHCHHHHHHHHHHHHHCCCEEEEEEECCCCCCEEEEECCCCCE----EEEECCCCCCCCEEEHH---HHCCCCCCCHH T ss_conf 444231246777763664165279986226754226887157863----44301257788778786---53146998668 Q ss_pred -----HHCCHHHHHHHHHHHHCCCCEEEEEEEECCCCCCCEEEECCCCC Q ss_conf -----10008999999997311464169999963586421010012688 Q gi|254780814|r 185 -----QDFNYDTLQHRLRELSFLNSSVQISLIDKRSPEPREITMFYKGG 228 (803) Q Consensus 185 -----~~~~~d~l~~R~~~~A~Ln~gl~i~l~Der~~~~~~~~f~~~~G 228 (803) .+=.++.+...+...|-+++++++.+..--...++.+ -.-.+| T Consensus 158 ef~r~~Kref~k~i~li~~y~li~~~ir~~~~n~t~~~~k~i-il~t~~ 205 (672) T KOG1978 158 EFQRNIKRKFVKLISLIQAYALISTAIKFLVSNSTLAGKKNI-ILKTGG 205 (672) T ss_pred HHCCCHHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCCCCEE-EEECCC T ss_conf 750114555666666677787631211013440134577405-773388 No 29 >PRK05218 heat shock protein 90; Provisional Probab=97.21 E-value=0.0062 Score=41.91 Aligned_cols=152 Identities=20% Similarity=0.231 Sum_probs=82.3 Q ss_pred HHHHHHHHHHH-------HH-------CCCCEEEEEECCC-CEEEEEECCCCCCCC-CCCCCCCCEEEEEEEEECCCCC- Q ss_conf 56755578868-------60-------7797579998389-839999768445364-0777885001256425011323- Q gi|254780814|r 46 YEVLDNAIDES-------LE-------GYADIITVTLNMD-GSCTVSDNGRGIPTD-LHKEEGISAAEVIMTRLHAGGK- 108 (803) Q Consensus 46 ~EiidNsvDe~-------~~-------g~~~~I~V~i~~d-gsisV~dnGrGIPv~-~h~~~~~~~~E~i~t~L~ag~k- 108 (803) =|+|.||+|-. +. +.--.|.|+.+++ ++++|.|||=|+--+ .|. -+=|.-++|++ T Consensus 31 RELisNAsDAl~K~r~~~l~~~~~~~~~~~~~I~I~~d~~~~tl~I~D~GIGMt~~El~~--------~LgTIa~Sgt~~ 102 (612) T PRK05218 31 RELISNASDAIDKLRFEALSDEALYEGDGELRIRISFDKEARTLTISDNGIGMTREEVIE--------NLGTIAKSGTKE 102 (612) T ss_pred HHHHHHHHHHHHHHHHHHCCCHHHCCCCCCCEEEEEECCCCCEEEEEECCCCCCHHHHHH--------HHHHHCCHHHHH T ss_conf 888741999999999987259032178888569999759899899995388889999998--------611102011899 Q ss_pred ----CCCCCCE--EECCCCCCEEEEEEECCCEEEEEEEE--CCEEEEEEEE-CCEECCCCEEECCCCCCCCEEEEEEECH Q ss_conf ----2898604--83116752037786326449999997--8959999988-9836463138732778887089998778 Q gi|254780814|r 109 ----FNQKSYK--VSGGLHGVGVSVVNALSSWLKLRIKR--EGNIYEMSFI-NGILDNPLVVTGSAGNDTGTEVTFLPSS 179 (803) Q Consensus 109 ----fd~~~yk--~sgGlhGvG~s~vNalS~~~~v~v~r--dgk~~~q~f~-~g~~~~~l~~~~~~~~~~GT~I~F~PD~ 179 (803) +...... ---|.-|||---+=..|.+.+|.++. ++-. .-.|+ +|...-.+.. .....+||+|+..+-. T Consensus 103 F~~~~~~~~~~~~~lIGqFGvGFyS~FmVad~V~V~srs~~~~~~-~~~W~s~g~g~y~i~~--~~~~~~GT~I~L~lk~ 179 (612) T PRK05218 103 FLEKLKGDQKKDSQLIGQFGVGFYSAFMVADKVTVITRSAGPASE-AVRWESDGEGEYTIEE--IEKEERGTEITLHLKE 179 (612) T ss_pred HHHHHHHCCCCCCCHHHHCCCCCEEEEEECCEEEEEECCCCCCCC-CEEEEECCCCCEEEEE--CCCCCCCCEEEEEECC T ss_conf 999865213554002321352420465650479999767789975-4599977999767610--5678998379999886 Q ss_pred HHHHHHHCCHHHHHHHHHHHH-CCCCEEEEEEEE Q ss_conf 885421000899999999731-146416999996 Q gi|254780814|r 180 DIFSVQDFNYDTLQHRLRELS-FLNSSVQISLID 212 (803) Q Consensus 180 ~iF~~~~~~~d~l~~R~~~~A-~Ln~gl~i~l~D 212 (803) +-=+ =++...|+..++..+ |+. ..|.+.+ T Consensus 180 d~~e--fl~~~~i~~li~kYs~fi~--~PI~l~~ 209 (612) T PRK05218 180 DEDE--FLDEWRIRSIIKKYSDHIP--VPIELEK 209 (612) T ss_pred HHHH--HCCHHHHHHHHHHHCCCCC--CCEEECC T ss_conf 0676--4478899999997404655--3349714 No 30 >COG0326 HtpG Molecular chaperone, HSP90 family [Posttranslational modification, protein turnover, chaperones] Probab=97.04 E-value=0.0014 Score=46.87 Aligned_cols=127 Identities=22% Similarity=0.283 Sum_probs=73.9 Q ss_pred HHHHHHHHHHH-------HHC------C-CCEEEEEECCC-CEEEEEECCCCCCCC-CCCCC---CCCEEEEEEEEECCC Q ss_conf 56755578868-------607------7-97579998389-839999768445364-07778---850012564250113 Q gi|254780814|r 46 YEVLDNAIDES-------LEG------Y-ADIITVTLNMD-GSCTVSDNGRGIPTD-LHKEE---GISAAEVIMTRLHAG 106 (803) Q Consensus 46 ~EiidNsvDe~-------~~g------~-~~~I~V~i~~d-gsisV~dnGrGIPv~-~h~~~---~~~~~E~i~t~L~ag 106 (803) -|.|.||.|-+ +.+ . --.|+|..++| +++||.|||-|+-=+ .|.-- .++.-.--+..|+.+ T Consensus 33 RELISNAsDAidKlr~~al~~~~~~~~~~~~~I~i~~Dk~~kTLtI~DNGIGMT~~Ev~~~LgTIAkSgT~~F~~~l~~~ 112 (623) T COG0326 33 RELISNASDAIDKLRFEALSDPELGEGDSDLRIRISFDKDNKTLTISDNGIGMTKDEVIENLGTIAKSGTKEFLESLSED 112 (623) T ss_pred HHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCEEEEEECCCCCEEEEEECCCCCCHHHHHHHHHHHHHCCHHHHHHHHHCC T ss_conf 99985068999998877526821147888844899763667879997278788899999987775100279999873035 Q ss_pred CCCCCCCCEEECCCCCCEEEEEEECCCEEEEEEEECC--EEEEEEEE-CCEECCCCEEECCCCCC-CCEEEEEEECHHH Q ss_conf 2328986048311675203778632644999999789--59999988-98364631387327788-8708999877888 Q gi|254780814|r 107 GKFNQKSYKVSGGLHGVGVSVVNALSSWLKLRIKREG--NIYEMSFI-NGILDNPLVVTGSAGND-TGTEVTFLPSSDI 181 (803) Q Consensus 107 ~kfd~~~yk~sgGlhGvG~s~vNalS~~~~v~v~rdg--k~~~q~f~-~g~~~~~l~~~~~~~~~-~GT~I~F~PD~~i 181 (803) . .+ + ---|.-|||.--.=+.+.+.+|.+++.| ..| .|+ .|...-.+..+. ... +||+|+-.+-++- T Consensus 113 ~--~~-~--~lIGQFGVGFYSaFmVAdkV~V~T~~~~~~~~~--~W~S~g~g~ytv~~~~--~~~~~GT~I~L~Lk~~e 182 (623) T COG0326 113 Q--KD-S--DLIGQFGVGFYSAFMVADKVTVITRSAGEDEAY--HWESDGEGEYTVEDID--KEPRRGTEITLHLKEEE 182 (623) T ss_pred C--CC-C--CCCCCCCCHHHHEEEEEEEEEEEECCCCCCCCC--EEEECCCCCEEEEECC--CCCCCCCEEEEEECCCH T ss_conf 4--45-4--400245500342025511689996468998660--6877698746776566--77878857999988766 No 31 >PTZ00272 heat shock protein 83 kDa (Hsp83); Provisional Probab=96.98 E-value=0.0045 Score=43.01 Aligned_cols=150 Identities=23% Similarity=0.331 Sum_probs=79.0 Q ss_pred HHHHHHHHHHH-------H-----HHCCCC--EEEEEECCC-CEEEEEECCCCCCCCCCCCCCCCEEEEE--E-EEECCC Q ss_conf 65675557886-------8-----607797--579998389-8399997684453640777885001256--4-250113 Q gi|254780814|r 45 IYEVLDNAIDE-------S-----LEGYAD--IITVTLNMD-GSCTVSDNGRGIPTDLHKEEGISAAEVI--M-TRLHAG 106 (803) Q Consensus 45 v~EiidNsvDe-------~-----~~g~~~--~I~V~i~~d-gsisV~dnGrGIPv~~h~~~~~~~~E~i--~-t~L~ag 106 (803) +-|+|.||.|- + +.|... .|.|..+++ +++||.|||-|+--+ ||| + |--++| T Consensus 30 LRELISNASDAldKlr~~~lt~~~~~~~~~~l~I~I~~D~~~~tltI~D~GIGMt~~----------el~~nLGTIA~SG 99 (701) T PTZ00272 30 LRELISNASDACDKIRYQSLTDPSVLGESPRLCIRVVPDKENKTLTVEDNGIGMTKA----------DLVNNLGTIARSG 99 (701) T ss_pred HHHHHHHHHHHHHHHHHHHCCCHHHCCCCCCCEEEEEECCCCCEEEEEECCCCCCHH----------HHHHHCCCCCCHH T ss_conf 787776279999999999736934407887735999975878979999438888999----------9997521011141 Q ss_pred CC-C----CCCCCEEECCCCCCEEEEEEECCCEEEEEEEECC-EEEEEEEE-CCEECCCCEEECCCCCCCCEEEEEEECH Q ss_conf 23-2----8986048311675203778632644999999789-59999988-9836463138732778887089998778 Q gi|254780814|r 107 GK-F----NQKSYKVSGGLHGVGVSVVNALSSWLKLRIKREG-NIYEMSFI-NGILDNPLVVTGSAGNDTGTEVTFLPSS 179 (803) Q Consensus 107 ~k-f----d~~~yk~sgGlhGvG~s~vNalS~~~~v~v~rdg-k~~~q~f~-~g~~~~~l~~~~~~~~~~GT~I~F~PD~ 179 (803) ++ | +.....---|.-|||---+=..+.+.+|..+..+ +.| .|+ +|...-.+.......-.+||+|+...-. T Consensus 100 t~~Fl~~l~~~~d~~lIGQFGVGFYSaFmVAdkV~V~sk~~~~~~~--~W~S~g~g~y~i~~~~~~~~~rGT~I~L~Lke 177 (701) T PTZ00272 100 TKAFMEALEAGGDMSMIGQFGVGFYSAYLVADRVTVTSKNNSDESY--VWESSAGGTFTITSTPESDMKRGTRITLHLKE 177 (701) T ss_pred HHHHHHHHHCCCCCCCCCCCCCCHHHEEEECCEEEEEECCCCCCCE--EEEEECCCCEEEEECCCCCCCCCCEEEEEECH T ss_conf 8999999864797352346766543201205679999646998755--89980589658885777777898579999880 Q ss_pred HHHHHHHC-CHHHHHHHHHHHH-CCCCEEEEEEE Q ss_conf 88542100-0899999999731-14641699999 Q gi|254780814|r 180 DIFSVQDF-NYDTLQHRLRELS-FLNSSVQISLI 211 (803) Q Consensus 180 ~iF~~~~~-~~d~l~~R~~~~A-~Ln~gl~i~l~ 211 (803) +- .+| +...|...++..+ |++ ..|.+. T Consensus 178 d~---~efl~~~~lk~likKYS~fI~--~PI~l~ 206 (701) T PTZ00272 178 DQ---MEYLEPRRLKELIKKHSEFIG--YDIELM 206 (701) T ss_pred HH---HHHHHHHHHHHHHHHHCCCCC--CCEEEC T ss_conf 48---877547889999997246756--443622 No 32 >PRK13560 hypothetical protein; Provisional Probab=96.91 E-value=0.0011 Score=47.77 Aligned_cols=23 Identities=30% Similarity=0.374 Sum_probs=11.0 Q ss_pred HHHHHHH-HHCC-----CCEEEEEEEECC Q ss_conf 9999997-3114-----641699999635 Q gi|254780814|r 192 LQHRLRE-LSFL-----NSSVQISLIDKR 214 (803) Q Consensus 192 l~~R~~~-~A~L-----n~gl~i~l~Der 214 (803) ++.|+++ ++|| |-...|.+.|.. T Consensus 195 aE~rl~~ql~flq~LlDnipdpvF~kD~~ 223 (807) T PRK13560 195 AEERIDEALHFLQQLLDNIADPAFWKDED 223 (807) T ss_pred HHHHHHHHHHHHHHHHHCCCCCEEEECCC T ss_conf 99999999999999985488746997799 No 33 >KOG1979 consensus Probab=96.72 E-value=0.0063 Score=41.87 Aligned_cols=151 Identities=20% Similarity=0.309 Sum_probs=87.0 Q ss_pred HHHHHHHHHHHHHHHHCCCCEEEEEECCCCE--EEEEECCCCCCCCCCCCCCCCEEEEEEEEECCCCCCCCCCCEE---E Q ss_conf 4565675557886860779757999838983--9999768445364077788500125642501132328986048---3 Q gi|254780814|r 43 HMIYEVLDNAIDESLEGYADIITVTLNMDGS--CTVSDNGRGIPTDLHKEEGISAAEVIMTRLHAGGKFNQKSYKV---S 117 (803) Q Consensus 43 hlv~EiidNsvDe~~~g~~~~I~V~i~~dgs--isV~dnGrGIPv~~h~~~~~~~~E~i~t~L~ag~kfd~~~yk~---s 117 (803) ..+.|.+.||.|- -.+.|.|.+..+|- +-|.|||.||--+ ..+ |+++=+|-+|-++-+.-. | T Consensus 30 NAlKEliENSLDA----~ST~I~V~vk~GGLKLlQisDnG~GI~re-----Dl~----ilCeRftTSKL~kFEDL~~lsT 96 (694) T KOG1979 30 NALKELIENSLDA----NSTSIDVLVKDGGLKLLQISDNGSGIRRE-----DLP----ILCERFTTSKLTKFEDLFSLST 96 (694) T ss_pred HHHHHHHHCCCCC----CCCEEEEEEECCCEEEEEEECCCCCCCHH-----HHH----HHHHHHHHHHCCHHHHHHHHHH T ss_conf 8999998614157----98569999815976899983289861522-----468----9987762120000677776664 Q ss_pred CCCCCCEEEEEEECCCEEEEEE---EECCE-EEEEEEECCEECCCCEEECCCCCCCCEEEEE----EECHHHHH---HHH Q ss_conf 1167520377863264499999---97895-9999988983646313873277888708999----87788854---210 Q gi|254780814|r 118 GGLHGVGVSVVNALSSWLKLRI---KREGN-IYEMSFINGILDNPLVVTGSAGNDTGTEVTF----LPSSDIFS---VQD 186 (803) Q Consensus 118 gGlhGvG~s~vNalS~~~~v~v---~rdgk-~~~q~f~~g~~~~~l~~~~~~~~~~GT~I~F----~PD~~iF~---~~~ 186 (803) =|--|--. --.|.--.|+| .++++ -|+.+|.+|.-..+++ ++..+.||.|+- +--+.+=. +.. T Consensus 97 yGFRGEAL---ASiShVA~VtV~TK~~~~~cayrasY~DGkm~~~pK---pcAgk~GT~I~vedLFYN~~~Rrkal~~~~ 170 (694) T KOG1979 97 YGFRGEAL---ASISHVAHVTVTTKTAEGKCAYRASYRDGKMIATPK---PCAGKQGTIITVEDLFYNMPTRRKALRNHA 170 (694) T ss_pred CCCCHHHH---HHHHHEEEEEEEEEECCCEEEEEEEEECCCCCCCCC---CCCCCCCEEEEEHHHHCCCHHHHHHHCCCH T ss_conf 17447887---643323689999862376445678751450146997---755777639986684403788898751738 Q ss_pred CCHHHHHHHHHHHHCCCCEEEEEEEE Q ss_conf 00899999999731146416999996 Q gi|254780814|r 187 FNYDTLQHRLRELSFLNSSVQISLID 212 (803) Q Consensus 187 ~~~d~l~~R~~~~A~Ln~gl~i~l~D 212 (803) -.+..|...+-..|.-||++.+.+.- T Consensus 171 EE~~ki~dlv~ryAIHn~~VsFs~rk 196 (694) T KOG1979 171 EEYRKIMDLVGRYAIHNPRVSFSLRK 196 (694) T ss_pred HHHHHHHHHHHHHHEECCCCCEEEEE T ss_conf 89999999998871017886369851 No 34 >COG3920 Signal transduction histidine kinase [Signal transduction mechanisms] Probab=96.12 E-value=0.0083 Score=40.91 Aligned_cols=94 Identities=26% Similarity=0.373 Sum_probs=61.0 Q ss_pred CEECCCCCCCHHHHHHHHHHHHHHHHH-HHCCCCEEEEEECC--CC---EEEEEECCCCCCCCCCCCCCCCEEEEEEEEE Q ss_conf 154327996325445656755578868-60779757999838--98---3999976844536407778850012564250 Q gi|254780814|r 30 MYIGDTDGESGLHHMIYEVLDNAIDES-LEGYADIITVTLNM--DG---SCTVSDNGRGIPTDLHKEEGISAAEVIMTRL 103 (803) Q Consensus 30 mYiG~t~~~~Gl~hlv~EiidNsvDe~-~~g~~~~I~V~i~~--dg---sisV~dnGrGIPv~~h~~~~~~~~E~i~t~L 103 (803) .+++. |....|-=+|.|.+.||+--+ ..+-...|.|++.. ++ ..+|||||.|+|++.-+ T Consensus 113 ~~l~~-d~A~~Lgliv~EL~tNa~Khaf~~~~~G~I~I~~~~~~~~~~~~l~v~deg~G~~~~~~~-------------- 177 (221) T COG3920 113 VFLDP-DTAVPLGLIVHELVTNALKHAFLSRPGGEIRITLSREGDGGRFLLTVWDEGGGPPVEAPL-------------- 177 (221) T ss_pred EEECH-HHHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEEEEECCCCCEEEEEEEECCCCCCCCCCC-------------- T ss_conf 22673-678899999999999889861788899779999998479971799998789898888887-------------- Q ss_pred CCCCCCCCCCCEEECCCCCCEEEEEEECC-CEEEE--EEE-ECCEEEEEEEECC Q ss_conf 11323289860483116752037786326-44999--999-7895999998898 Q gi|254780814|r 104 HAGGKFNQKSYKVSGGLHGVGVSVVNALS-SWLKL--RIK-REGNIYEMSFING 153 (803) Q Consensus 104 ~ag~kfd~~~yk~sgGlhGvG~s~vNalS-~~~~v--~v~-rdgk~~~q~f~~g 153 (803) ...|+|..+++.+- ..+.- +.+ .+|-.|..+|... T Consensus 178 ---------------~~~g~G~~Lv~~lv~~q~~g~~~~~~~~Gt~~~i~~~~~ 216 (221) T COG3920 178 ---------------SRGGFGLQLVERLVPEQLGGELEDERPDGTEFRLRFPLS 216 (221) T ss_pred ---------------CCCCCHHHHHHHHHHHHCCCEEEEECCCCEEEEEEEECC T ss_conf ---------------899837999999999967972799749977999998864 No 35 >PRK10935 nitrate/nitrite sensor protein NarQ; Provisional Probab=96.07 E-value=0.019 Score=38.15 Aligned_cols=11 Identities=27% Similarity=0.271 Sum_probs=4.5 Q ss_pred EEEEEEECCCC Q ss_conf 59999951853 Q gi|254780814|r 662 ILVKRIARGVQ 672 (803) Q Consensus 662 i~~~~~~~Gv~ 672 (803) +.+....+|+. T Consensus 507 v~l~V~DnG~G 517 (568) T PRK10935 507 VTVSIRDDGVG 517 (568) T ss_pred EEEEEEECCCC T ss_conf 99999779947 No 36 >COG3850 NarQ Signal transduction histidine kinase, nitrate/nitrite-specific [Signal transduction mechanisms] Probab=96.04 E-value=0.015 Score=38.90 Aligned_cols=17 Identities=24% Similarity=0.307 Sum_probs=8.1 Q ss_pred CCCCCCEEEEEEECCCC Q ss_conf 44567059999951853 Q gi|254780814|r 656 SINNSDILVKRIARGVQ 672 (803) Q Consensus 656 ~~~~~~i~~~~~~~Gv~ 672 (803) .....+..+....+|+. T Consensus 507 ~~~~g~~~~~VeDnG~G 523 (574) T COG3850 507 SQNDGQVTLTVEDNGVG 523 (574) T ss_pred EECCCEEEEEEEECCCC T ss_conf 86596489999428837 No 37 >PTZ00130 heat shock protein 90; Provisional Probab=95.91 E-value=0.034 Score=36.16 Aligned_cols=149 Identities=21% Similarity=0.303 Sum_probs=78.1 Q ss_pred HHHHHHHHHH-------HH-----HCCCC--EEEEEECCC-CEEEEEECCCCCCCCCCCCCCCCEEEEE--E-EEECCCC Q ss_conf 5675557886-------86-----07797--579998389-8399997684453640777885001256--4-2501132 Q gi|254780814|r 46 YEVLDNAIDE-------SL-----EGYAD--IITVTLNMD-GSCTVSDNGRGIPTDLHKEEGISAAEVI--M-TRLHAGG 107 (803) Q Consensus 46 ~EiidNsvDe-------~~-----~g~~~--~I~V~i~~d-gsisV~dnGrGIPv~~h~~~~~~~~E~i--~-t~L~ag~ 107 (803) -|+|.||-|- ++ .+... .|.|..+++ +++||.|+|-|+--+ ||| + |.-++|+ T Consensus 102 RELISNASDALDKlRf~sLtd~~~~~~~~~l~I~I~~Dke~ktLtI~DtGIGMTk~----------eLi~NLGTIAkSGT 171 (824) T PTZ00130 102 RELISNAADALEKIRFLSLSDEKVLGEEKKLEIRISANKEKNILSITDTGIGMTKE----------DLINNLGTIAKSGT 171 (824) T ss_pred HHHHHHHHHHHHHHHHHHCCCHHHCCCCCCCEEEEEECCCCCEEEEEECCCCCCHH----------HHHHHCCCCEEHHH T ss_conf 88876489999999999717924308888856998975889989998448888899----------99985691333528 Q ss_pred C-C-----CCCCCEEECCCCCCEEEEEEECCCEEEEEEEECC-EEEEEEEE-CCEECCCCEEE--CCCCCCCCEEEEEEE Q ss_conf 3-2-----8986048311675203778632644999999789-59999988-98364631387--327788870899987 Q gi|254780814|r 108 K-F-----NQKSYKVSGGLHGVGVSVVNALSSWLKLRIKREG-NIYEMSFI-NGILDNPLVVT--GSAGNDTGTEVTFLP 177 (803) Q Consensus 108 k-f-----d~~~yk~sgGlhGvG~s~vNalS~~~~v~v~rdg-k~~~q~f~-~g~~~~~l~~~--~~~~~~~GT~I~F~P 177 (803) + | +....----|.-|||---+-..+.+.+|.++.++ ..|. |+ .|...-.+... +.+ -.+||+|+..- T Consensus 172 k~Fle~l~~~~~d~~LIGQFGVGFYSaFmVAdkV~V~SK~~~de~~~--WeS~g~gsftI~~~~~~~~-l~RGT~I~LhL 248 (824) T PTZ00130 172 SNFLEAISKSGGDMSLIGQFGVGFYSAFLVADKVIVYTKNNDDEQYI--WESTADAKFSIYKDPRGAT-LKRGTRISLHL 248 (824) T ss_pred HHHHHHHHHCCCCCCCCCCCCCCHHHHHHCCCEEEEEECCCCCCCEE--EEEECCCCEEEEECCCCCC-CCCCEEEEEEE T ss_conf 99999987358970102677752544431067689995789996408--9980678558886777777-78970699985 Q ss_pred --CHHHHHHHHCCHHHHHHHHHHHH-CCCCEEEEEEEEC Q ss_conf --78885421000899999999731-1464169999963 Q gi|254780814|r 178 --SSDIFSVQDFNYDTLQHRLRELS-FLNSSVQISLIDK 213 (803) Q Consensus 178 --D~~iF~~~~~~~d~l~~R~~~~A-~Ln~gl~i~l~De 213 (803) |..=|- +...|...++..+ |.+ ..|.+..+ T Consensus 249 Ked~~Efl----ee~rik~lVkKYS~FI~--fPI~l~~e 281 (824) T PTZ00130 249 KDDATNLL----NDKKLMDLISKYSQFIQ--FPIYLLHE 281 (824) T ss_pred CCCHHHHH----HHHHHHHHHHHHCCCCC--CCEEEECC T ss_conf 67567665----47889999998513665--22587203 No 38 >COG2205 KdpD Osmosensitive K+ channel histidine kinase [Signal transduction mechanisms] Probab=95.77 E-value=0.011 Score=40.01 Aligned_cols=29 Identities=17% Similarity=0.376 Sum_probs=14.3 Q ss_pred CCCCCCEECCCCCCCHHHHHHHHHHHHHHHHHH Q ss_conf 045871543279963254456567555788686 Q gi|254780814|r 25 KKRPSMYIGDTDGESGLHHMIYEVLDNAIDESL 57 (803) Q Consensus 25 rkRP~mYiG~t~~~~Gl~hlv~EiidNsvDe~~ 57 (803) |-|--+|+|--- |.- =-|+-+.++.+--- T Consensus 20 RGklkIf~G~ap---GVG-KTyaML~~a~~~~~ 48 (890) T COG2205 20 RGKLKIFLGAAP---GVG-KTYAMLSEAQRLLA 48 (890) T ss_pred CCCEEEEEECCC---CCC-HHHHHHHHHHHHHH T ss_conf 774689962688---862-79999999999997 No 39 >pfam01751 Toprim Toprim domain. This is a conserved region from DNA primase. This corresponds to the Toprim domain common to DnaG primases, topoisomerases, OLD family nucleases and RecR proteins. Both DnaG motifs IV and V are present in the alignment, the DxD (V) motif may be involved in Mg2+ binding and mutations to the conserved glutamate (IV) completely abolish DnaG type primase activity. DNA primase EC:2.7.7.6 is a nucleotidyltransferase it synthesizes the oligoribonucleotide primers required for DNA replication on the lagging strand of the replication fork; it can also prime the leading stand and has been implicated in cell division. This family also includes the atypical archaeal A subunit from type II DNA topoisomerases. Type II DNA topoisomerases catalyse the relaxation of DNA supercoiling by causing transient double strand breaks. Probab=95.76 E-value=0.0067 Score=41.64 Aligned_cols=79 Identities=24% Similarity=0.310 Sum_probs=48.8 Q ss_pred EEEEEECCCCCCCCCCCCCCCCEEEEECCCCCCCHHHCCHHHHHCCHHHHHHHHHHCCCCCCCCCCCCCCCCCC--EEEE Q ss_conf 79998255754322334654430233005620334430587663236678888762578775332123567373--8998 Q gi|254780814|r 426 ELFLVEGDSAGGSAKQGRSRENQAILPLRGKILNVERARFDKMLSSQEIGTLITALGTGIGQDSFDINKLRYHK--IIIM 503 (803) Q Consensus 426 eL~ivEGDSAggsak~gRdr~~qai~PLrGKilNv~~a~~~~i~~n~Ei~~l~~~lG~~~g~~~~~~~~lrY~k--iiim 503 (803) +|+||||.|-.-+..+..... .-+..+.|.+..... ...+.+ +.-+.++.+ |||+ T Consensus 2 ~l~IVE~~~d~~~i~~~~~~~-~~~~~~~G~~~~l~~------~~~~~~----------------~~l~~~~~~~eViia 58 (89) T pfam01751 2 VLIIVEGPSDAIALAKAGGYK-GNVVALLGHLSDVIP------LTKEQL----------------KLLKKLAKKDEVILA 58 (89) T ss_pred EEEEECCHHHHHHHHHHCCCC-EEEEEEECEECCCCC------CCHHHH----------------HHHHHHHCCCEEEEE T ss_conf 899989889999999624888-799984157057655------424679----------------999986237749998 Q ss_pred ECCCCCCCCHHHHHHHHHHHHHHH Q ss_conf 508888532899999999963077 Q gi|254780814|r 504 TDADVDGAHIRTLLLTFFFRQMPS 527 (803) Q Consensus 504 tDaD~DG~HI~~Llltff~~~~~~ 527 (803) ||+|++|.+|+..++..+-.+.+. T Consensus 59 tD~D~eGe~~a~~l~~~l~~~~~~ 82 (89) T pfam01751 59 TDPDREGEAIAWKLLELLKPLGKK 82 (89) T ss_pred CCCCHHHHHHHHHHHHHHHHCCCC T ss_conf 798948999999999983122981 No 40 >PRK10604 sensor protein RstB; Provisional Probab=95.64 E-value=0.0055 Score=42.32 Aligned_cols=22 Identities=9% Similarity=0.072 Sum_probs=11.7 Q ss_pred CCCCCCCCCCCHHHHHHHHHHH Q ss_conf 0145776543546899999999 Q gi|254780814|r 573 ELFCDSKGVISKNDLRNFIDDA 594 (803) Q Consensus 573 l~~~~~~~~l~ge~L~~li~~~ 594 (803) +...++|+.+..+++.++.+-. T Consensus 352 i~V~D~G~GI~~e~~~~IFepF 373 (433) T PRK10604 352 LIVEDDGPGIAPEERERIFEPF 373 (433) T ss_pred EEEEECCCCCCHHHHHHHCCCC T ss_conf 9999778788989999865794 No 41 >PRK10337 sensor protein QseC; Provisional Probab=95.41 E-value=0.026 Score=37.13 Aligned_cols=18 Identities=11% Similarity=0.095 Sum_probs=8.1 Q ss_pred CCCCCCCCCHHHHHHHHH Q ss_conf 457765435468999999 Q gi|254780814|r 575 FCDSKGVISKNDLRNFID 592 (803) Q Consensus 575 ~~~~~~~l~ge~L~~li~ 592 (803) ..++|+.+..+++.++.+ T Consensus 382 V~D~G~GIp~e~~~~iFe 399 (446) T PRK10337 382 VRDNGPGVTPEALARIGE 399 (446) T ss_pred EEECCCCCCHHHHHHHCC T ss_conf 997699999999987427 No 42 >PRK10549 signal transduction histidine-protein kinase BaeS; Provisional Probab=95.40 E-value=0.0059 Score=42.10 Aligned_cols=13 Identities=31% Similarity=0.399 Sum_probs=4.8 Q ss_pred CCCCCCCHHHHHH Q ss_conf 7765435468999 Q gi|254780814|r 577 DSKGVISKNDLRN 589 (803) Q Consensus 577 ~~~~~l~ge~L~~ 589 (803) ++|+.+..+++.+ T Consensus 392 D~G~GI~~e~l~~ 404 (467) T PRK10549 392 DSAPGVSDEQLQK 404 (467) T ss_pred EECCCCCHHHHHH T ss_conf 8188879899998 No 43 >PRK09835 sensor kinase CusS; Provisional Probab=95.34 E-value=0.02 Score=37.98 Aligned_cols=20 Identities=15% Similarity=0.210 Sum_probs=10.1 Q ss_pred CCCCCCCCCCCHHHHHHHHH Q ss_conf 01457765435468999999 Q gi|254780814|r 573 ELFCDSKGVISKNDLRNFID 592 (803) Q Consensus 573 l~~~~~~~~l~ge~L~~li~ 592 (803) +...+.|..+..+++.++.+ T Consensus 410 i~V~D~G~GI~~e~~~~iFe 429 (482) T PRK09835 410 VVVENPGTPIAPEHLPRLFD 429 (482) T ss_pred EEEEECCCCCCHHHHHHHCC T ss_conf 99996587889899987667 No 44 >PRK10600 nitrate/nitrite sensor protein NarX; Provisional Probab=95.31 E-value=0.038 Score=35.81 Aligned_cols=14 Identities=7% Similarity=0.207 Sum_probs=6.1 Q ss_pred CCCEEEEEEECCCC Q ss_conf 67059999951853 Q gi|254780814|r 659 NSDILVKRIARGVQ 672 (803) Q Consensus 659 ~~~i~~~~~~~Gv~ 672 (803) ...+.+....+|+. T Consensus 498 ~~~~~l~I~DdG~G 511 (569) T PRK10600 498 DNQVKLSVQDNGCG 511 (569) T ss_pred CCEEEEEEEECCCC T ss_conf 99899999889989 No 45 >PRK11073 glnL nitrogen regulation protein NR(II); Provisional Probab=95.29 E-value=0.042 Score=35.50 Aligned_cols=11 Identities=9% Similarity=0.090 Sum_probs=4.6 Q ss_pred ECCCCEEEEEE Q ss_conf 83898399997 Q gi|254780814|r 68 LNMDGSCTVSD 78 (803) Q Consensus 68 i~~dgsisV~d 78 (803) ++.||.|.-+| T Consensus 23 ~D~~g~I~~~N 33 (348) T PRK11073 23 LDDNLAIHYAN 33 (348) T ss_pred ECCCCCEEEEC T ss_conf 98979701283 No 46 >PRK03660 anti-sigma F factor; Provisional Probab=95.23 E-value=0.063 Score=34.11 Aligned_cols=88 Identities=19% Similarity=0.330 Sum_probs=52.1 Q ss_pred HHHHHHHHHHHHHHHHCCC-CEEEEEEC--CCC-EEEEEECCCCCC-CCCCCCCCCCEEEEEEEEECCCCCCCCCCCEEE Q ss_conf 4565675557886860779-75799983--898-399997684453-640777885001256425011323289860483 Q gi|254780814|r 43 HMIYEVLDNAIDESLEGYA-DIITVTLN--MDG-SCTVSDNGRGIP-TDLHKEEGISAAEVIMTRLHAGGKFNQKSYKVS 117 (803) Q Consensus 43 hlv~EiidNsvDe~~~g~~-~~I~V~i~--~dg-sisV~dnGrGIP-v~~h~~~~~~~~E~i~t~L~ag~kfd~~~yk~s 117 (803) --|.|.+.|||.-+..+.. ..|.|.+. .+. .|+|.|+|.||| +....+ | .++. +. . T Consensus 42 lAV~EA~tNaI~H~y~~~~~g~I~I~~~~~~~~l~I~V~D~G~Gid~~~~~~~-----P--~~t~-------~~-----~ 102 (146) T PRK03660 42 TAVSEAVTNAIIHGYENNPDGIVYIEAEIEDEELEITVRDEGKGIEDVEEARQ-----P--LFTT-------KP-----E 102 (146) T ss_pred HHHHHHHHHHHHHHCCCCCCCEEEEEEEEECCEEEEEEEECCCCCCCHHHCCC-----C--CCCC-------CC-----C T ss_conf 99999999999860367999279999998099999999973789473766438-----8--7766-------87-----6 Q ss_pred CCCCCCEEEEEEECCCEEEEEEEEC-CEEEEEE Q ss_conf 1167520377863264499999978-9599999 Q gi|254780814|r 118 GGLHGVGVSVVNALSSWLKLRIKRE-GNIYEMS 149 (803) Q Consensus 118 gGlhGvG~s~vNalS~~~~v~v~rd-gk~~~q~ 149 (803) +...|.|..+.+.|...++++...+ |....|+ T Consensus 103 ~~~~GlGl~li~~LmDeVei~s~~g~GTtV~m~ 135 (146) T PRK03660 103 LERSGMGFTFMESFMDEVEVESEPGKGTTIRMK 135 (146) T ss_pred CCCCCHHHHHHHHHCCEEEEEECCCCCEEEEEE T ss_conf 675640589999838869999559997899999 No 47 >PRK10815 sensor protein PhoQ; Provisional Probab=95.19 E-value=0.013 Score=39.51 Aligned_cols=21 Identities=14% Similarity=0.036 Sum_probs=12.6 Q ss_pred HCCCCCCCCCCCHHHHHHHHH Q ss_conf 001457765435468999999 Q gi|254780814|r 572 IELFCDSKGVISKNDLRNFID 592 (803) Q Consensus 572 ~l~~~~~~~~l~ge~L~~li~ 592 (803) .+...++|+.+..+++..+.+ T Consensus 409 ~I~V~D~GpGIp~e~~~~IFe 429 (484) T PRK10815 409 HIVVEDDGPGIPESKRELIFD 429 (484) T ss_pred EEEEEECCCCCCHHHHHHHCC T ss_conf 999997399999899986246 No 48 >PRK10490 sensor protein KdpD; Provisional Probab=95.18 E-value=0.0094 Score=40.52 Aligned_cols=38 Identities=29% Similarity=0.464 Sum_probs=22.0 Q ss_pred EECCCCCHHCCCCCCEECCC-------------------------CCCCHHHH-----HHHHHHHHHHH Q ss_conf 23135420004587154327-------------------------99632544-----56567555788 Q gi|254780814|r 16 QILKGLDAVKKRPSMYIGDT-------------------------DGESGLHH-----MIYEVLDNAID 54 (803) Q Consensus 16 ~~L~gle~vrkRP~mYiG~t-------------------------~~~~Gl~h-----lv~EiidNsvD 54 (803) ..|+|||-+-.|.--|=|-+ + ..|..| =|.|++|..|| T Consensus 70 ~l~~gl~~~p~~~~~y~g~~~~e~dlda~l~r~P~~~lvDelahtn-~pgsrh~kR~qdv~ell~aGi~ 137 (895) T PRK10490 70 ALLEGLTVLPPKRLAHRGRQISEFDLDAALARRPALILMDELAHSN-APGSRHPKRWQDIEELLEAGID 137 (895) T ss_pred HHHCCCCCCCCCEEEECCCCCCCCCHHHHHHCCCCEEEEECCCCCC-CCCCCCCCHHHHHHHHHHCCCE T ss_conf 9874866588711412783234168989985399889973345669-9988776407649999977983 No 49 >cd00188 TOPRIM Topoisomerase-primase domain. This is a nucleotidyl transferase/hydrolase domain found in type IA, type IIA and type IIB topoisomerases, bacterial DnaG-type primases, small primase-like proteins from bacteria and archaea, OLD family nucleases from bacterial and archaea, and bacterial DNA repair proteins of the RecR/M family. This domain has two conserved motifs, one of which centers at a conserved glutamate and the other one at two conserved aspartates (DxD). This glutamate and two aspartates, cluster together to form a highly acid surface patch. The conserved glutamate may act as a general base in nucleotide polymerization by primases and in strand joining in topoisomerases and, as a general acid in strand cleavage by topisomerases and nucleases. The DXD motif may co-ordinate Mg2+, a cofactor required for full catalytic function. Probab=94.88 E-value=0.064 Score=34.03 Aligned_cols=73 Identities=33% Similarity=0.399 Sum_probs=48.6 Q ss_pred EEEEEECCCCCCCCCCCCCCCCEEEEECCCCCCCHHHCCHHHHHCCHHHHHHHHHHCCCCCCCCCCCCCCCCCCEEEEEC Q ss_conf 79998255754322334654430233005620334430587663236678888762578775332123567373899850 Q gi|254780814|r 426 ELFLVEGDSAGGSAKQGRSRENQAILPLRGKILNVERARFDKMLSSQEIGTLITALGTGIGQDSFDINKLRYHKIIIMTD 505 (803) Q Consensus 426 eL~ivEGDSAggsak~gRdr~~qai~PLrGKilNv~~a~~~~i~~n~Ei~~l~~~lG~~~g~~~~~~~~lrY~kiiimtD 505 (803) .++||||++..-+.-+.... ...++++.|+..+ .....+...++ . +.+|+|+|| T Consensus 2 ~viivEG~~d~~~l~~~~~~-~~~~~~~~G~~~~------------~~~~~~~~~~~------------~-~~~vii~~D 55 (83) T cd00188 2 KLIIVEGPSDALALAQAGGY-GGAVVALGGHALN------------KTRELLKRLLG------------E-AKEVIIATD 55 (83) T ss_pred EEEEECCHHHHHHHHHHCCC-CEEEEEEEEEECC------------CHHHHHHHHHC------------C-CCEEEEECC T ss_conf 79998888999999985699-9799997557384------------38999999876------------8-999999749 Q ss_pred CCCCCCCHHHHHHHHHHHH Q ss_conf 8888532899999999963 Q gi|254780814|r 506 ADVDGAHIRTLLLTFFFRQ 524 (803) Q Consensus 506 aD~DG~HI~~Llltff~~~ 524 (803) .|..|.++........... T Consensus 56 ~D~~g~~~~~~~~~~~~~~ 74 (83) T cd00188 56 ADREGEAIALRLLELLKSL 74 (83) T ss_pred CCHHHHHHHHHHHHHHHHC T ss_conf 8937999999999973234 No 50 >PTZ00108 DNA topoisomerase II; Provisional Probab=94.75 E-value=0.098 Score=32.62 Aligned_cols=12 Identities=42% Similarity=0.545 Sum_probs=6.6 Q ss_pred HHCCCCEEEEEEC Q ss_conf 6077975799983 Q gi|254780814|r 57 LEGYADIITVTLN 69 (803) Q Consensus 57 ~~g~~~~I~V~i~ 69 (803) +||-| .|+|.++ T Consensus 229 LAgt~-sVkVyLN 240 (1506) T PTZ00108 229 LAGTC-SVRVYLN 240 (1506) T ss_pred HHCCC-CEEEEEC T ss_conf 73677-8069975 No 51 >PRK11644 sensory histidine kinase UhpB; Provisional Probab=94.60 E-value=0.07 Score=33.76 Aligned_cols=15 Identities=7% Similarity=0.180 Sum_probs=7.4 Q ss_pred CCCCEEEEEEECCCC Q ss_conf 567059999951853 Q gi|254780814|r 658 NNSDILVKRIARGVQ 672 (803) Q Consensus 658 ~~~~i~~~~~~~Gv~ 672 (803) ....+.+....+|+. T Consensus 440 ~~~~l~L~I~DDG~G 454 (497) T PRK11644 440 QDERLMLVIEDNGSG 454 (497) T ss_pred CCCEEEEEEEECCCC T ss_conf 699899999889989 No 52 >COG4585 Signal transduction histidine kinase [Signal transduction mechanisms] Probab=94.59 E-value=0.061 Score=34.19 Aligned_cols=11 Identities=36% Similarity=0.564 Sum_probs=6.8 Q ss_pred HHHHHHHHHHC Q ss_conf 66788887625 Q gi|254780814|r 472 QEIGTLITALG 482 (803) Q Consensus 472 ~Ei~~l~~~lG 482 (803) +|++.++..|- T Consensus 223 ~evR~~v~~Lr 233 (365) T COG4585 223 QEVRALVRDLR 233 (365) T ss_pred HHHHHHHHHCC T ss_conf 99999999719 No 53 >PRK09467 envZ osmolarity sensor protein; Provisional Probab=94.50 E-value=0.071 Score=33.68 Aligned_cols=22 Identities=9% Similarity=0.096 Sum_probs=11.4 Q ss_pred CCCCCCCCCCCHHHHHHHHHHH Q ss_conf 0145776543546899999999 Q gi|254780814|r 573 ELFCDSKGVISKNDLRNFIDDA 594 (803) Q Consensus 573 l~~~~~~~~l~ge~L~~li~~~ 594 (803) +...++|+.+..+++.++.+-. T Consensus 366 i~V~D~GpGI~~e~l~~if~pF 387 (437) T PRK09467 366 FQVEDNGPGIPEEQIKHLFQPF 387 (437) T ss_pred EEEEECCCCCCHHHHHHHCCCC T ss_conf 9999759999989998761685 No 54 >PRK10755 sensor protein BasS/PmrB; Provisional Probab=94.46 E-value=0.1 Score=32.50 Aligned_cols=23 Identities=9% Similarity=-0.023 Sum_probs=12.9 Q ss_pred CCCCCCCCCCCHHHHHHHHHHHH Q ss_conf 01457765435468999999999 Q gi|254780814|r 573 ELFCDSKGVISKNDLRNFIDDAL 595 (803) Q Consensus 573 l~~~~~~~~l~ge~L~~li~~~~ 595 (803) +...++|+.+..+++.++.+... T Consensus 287 i~V~D~G~GI~~e~~~~iFerFy 309 (355) T PRK10755 287 MAVEDEGPGIDESKCGELSKAFV 309 (355) T ss_pred EEEEECCCCCCHHHHHHHCCCCC T ss_conf 99998799989799988558861 No 55 >KOG1977 consensus Probab=94.34 E-value=0.028 Score=36.83 Aligned_cols=161 Identities=21% Similarity=0.248 Sum_probs=87.3 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCEEEEEECCC-CEEEEEECCCCCCCCCCCCCCCCEEEEEEEEECCCCCCCCCCCEEEC Q ss_conf 25445656755578868607797579998389-83999976844536407778850012564250113232898604831 Q gi|254780814|r 40 GLHHMIYEVLDNAIDESLEGYADIITVTLNMD-GSCTVSDNGRGIPTDLHKEEGISAAEVIMTRLHAGGKFNQKSYKVSG 118 (803) Q Consensus 40 Gl~hlv~EiidNsvDe~~~g~~~~I~V~i~~d-gsisV~dnGrGIPv~~h~~~~~~~~E~i~t~L~ag~kfd~~~yk~sg 118 (803) -|-+.|-|++=||+|- +|+.|.|-++-. =||-|.|||-|+--|-- .++++-.+-+||-....-+.- T Consensus 21 sla~~VeElv~NSiDA----~At~V~v~V~~~t~sv~ViDdG~G~~rdDl---------~~lg~ry~TSK~h~~ndl~~~ 87 (1142) T KOG1977 21 SLAQCVEELVLNSIDA----EATCVAVRVNMETFSVQVIDDGFGMGRDDL---------EKLGNRYFTSKCHSVNDLENP 87 (1142) T ss_pred HHHHHHHHHHHHCCCC----CCEEEEEEECCCEEEEEEEECCCCCCHHHH---------HHHHHHHHHHHCEECCCCCCC T ss_conf 7999999998612255----752799996575067999856877667889---------998765544320000001362 Q ss_pred CCCCC-EEEEEEECC-CEEEEEEE---ECCEEEEEEEECCEECCCCEEECCCCCCCCEEEEEE------ECHHHHH--HH Q ss_conf 16752-037786326-44999999---789599999889836463138732778887089998------7788854--21 Q gi|254780814|r 119 GLHGV-GVSVVNALS-SWLKLRIK---REGNIYEMSFINGILDNPLVVTGSAGNDTGTEVTFL------PSSDIFS--VQ 185 (803) Q Consensus 119 GlhGv-G~s~vNalS-~~~~v~v~---rdgk~~~q~f~~g~~~~~l~~~~~~~~~~GT~I~F~------PD~~iF~--~~ 185 (803) -.-|+ |-.+.++-- +.+.|... |..+.+.-.|-.|.....+. +-.+....||+|+-. |=..++. .. T Consensus 88 ~tyGfRGeALasIsd~s~l~v~skkk~r~~~~~~kk~~~gs~~~~l~-iD~~R~~sGTtVtV~dlfY~lPVRRr~k~~~P 166 (1142) T KOG1977 88 RTYGFRGEALASISDMSSLVVISKKKNRTMKTFVKKFQSGSALKALE-IDVTRASSGTTVTVYDLFYQLPVRRRLKCMDP 166 (1142) T ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHCCCCCEECC-CCCCCCCCCCEEEEHHHHHCCHHHHHHHCCCH T ss_conf 00254346664243104464555413885068999874122110366-55332468858984784403345665502887 Q ss_pred HCCHHHHHHHHHHHHCCCCEEEEEEEECC Q ss_conf 00089999999973114641699999635 Q gi|254780814|r 186 DFNYDTLQHRLRELSFLNSSVQISLIDKR 214 (803) Q Consensus 186 ~~~~d~l~~R~~~~A~Ln~gl~i~l~Der 214 (803) .-.++.|.+|+..+|.+.|.+.+.+.|.- T Consensus 167 ~k~fe~Ik~~i~~i~lmHp~iSfsv~~~~ 195 (1142) T KOG1977 167 RKEFEKIKQRIEAISLMHPSISFSVRNDV 195 (1142) T ss_pred HHHHHHHHHHHHHHHHHCCCEEEEEEECC T ss_conf 88999999999999752664047887526 No 56 >TIGR02938 nifL_nitrog nitrogen fixation negative regulator NifL; InterPro: IPR014285 NifL modulates the activity of the nitrogen fixation positive regulator protein NifA, acting as a negative regulator by binding NifA. NifA and NifL are encoded by adjacent genes.. Probab=94.30 E-value=0.053 Score=34.66 Aligned_cols=27 Identities=26% Similarity=0.601 Sum_probs=15.8 Q ss_pred EECCEEEEEEEECCEECCCCEEECCCCCCCCEEEEEE Q ss_conf 9789599999889836463138732778887089998 Q gi|254780814|r 140 KREGNIYEMSFINGILDNPLVVTGSAGNDTGTEVTFL 176 (803) Q Consensus 140 ~rdgk~~~q~f~~g~~~~~l~~~~~~~~~~GT~I~F~ 176 (803) +|||..|--+-+ +.| --+.+||++.|. T Consensus 83 rkDg~lYLAeLt----vaP------vlneaGeT~hfl 109 (496) T TIGR02938 83 RKDGELYLAELT----VAP------VLNEAGETTHFL 109 (496) T ss_pred CCCCCHHHHHHC----CCC------HHCCCCCCEEEE T ss_conf 116530022202----020------001468815873 No 57 >PRK11100 sensory histidine kinase CreC; Provisional Probab=94.10 E-value=0.084 Score=33.13 Aligned_cols=12 Identities=25% Similarity=0.080 Sum_probs=4.2 Q ss_pred CCCCCCHHHHHH Q ss_conf 765435468999 Q gi|254780814|r 578 SKGVISKNDLRN 589 (803) Q Consensus 578 ~~~~l~ge~L~~ 589 (803) .|+.+..+++.+ T Consensus 408 ~G~GIp~e~l~~ 419 (475) T PRK11100 408 SGPGIPDYALPR 419 (475) T ss_pred CCCCCCHHHHHH T ss_conf 488878899988 No 58 >PRK09303 adaptive-response sensory kinase; Validated Probab=94.06 E-value=0.1 Score=32.45 Aligned_cols=19 Identities=16% Similarity=0.415 Sum_probs=9.2 Q ss_pred CCCCCCCCCCHHHHHHHHH Q ss_conf 1457765435468999999 Q gi|254780814|r 574 LFCDSKGVISKNDLRNFID 592 (803) Q Consensus 574 ~~~~~~~~l~ge~L~~li~ 592 (803) ...+.|..|..+++.++.. T Consensus 307 sV~DtG~GIp~e~~~rIFe 325 (378) T PRK09303 307 SICDTGPGIPEEEQERIFE 325 (378) T ss_pred EEEEECCCCCHHHHHHHCC T ss_conf 9998078889899987563 No 59 >KOG0787 consensus Probab=94.04 E-value=0.13 Score=31.59 Aligned_cols=49 Identities=18% Similarity=0.243 Sum_probs=26.4 Q ss_pred CCCCCEEEEECCCCC-----CCCHHHHHHHHHHHHHHHHHHCCEEEEE-CCCEEEE Q ss_conf 673738998508888-----5328999999999630778748959994-3833788 Q gi|254780814|r 495 LRYHKIIIMTDADVD-----GAHIRTLLLTFFFRQMPSLIHNGFLYII-KPPLYGI 544 (803) Q Consensus 495 lrY~kiiimtDaD~D-----G~HI~~Llltff~~~~~~li~~g~~yia-~ppl~~~ 544 (803) ++=-.++|-.|.+.- -+|..-+|.-.|-.-|+--+|.+-.--- .||+ +| T Consensus 238 ~~sPel~i~~~~a~~~~vyvPshL~ymlfElfKNamrATve~h~~~~~~~ppI-~V 292 (414) T KOG0787 238 LNSPELIIEGHNALSFTVYVPSHLYYMLFELFKNAMRATVEHHGDDGDELPPI-KV 292 (414) T ss_pred CCCCEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCE-EE T ss_conf 37970675376667676045618999999999999999999744488889985-99 No 60 >PRK10364 sensor protein ZraS; Provisional Probab=93.84 E-value=0.084 Score=33.10 Aligned_cols=23 Identities=22% Similarity=0.258 Sum_probs=13.0 Q ss_pred HHHHHHHHHHCCCCEE-EEEEEECC Q ss_conf 9999999731146416-99999635 Q gi|254780814|r 191 TLQHRLRELSFLNSSV-QISLIDKR 214 (803) Q Consensus 191 ~l~~R~~~~A~Ln~gl-~i~l~Der 214 (803) .++..+.|+|. .||+ -+.++|+. T Consensus 76 ~~q~ll~e~a~-q~~v~~~~v~d~~ 99 (455) T PRK10364 76 QQQALLEEMAG-QPGVLWFAVTDEQ 99 (455) T ss_pred HHHHHHHHHHC-CCCEEEEEEECCC T ss_conf 99999999715-9986999998899 No 61 >PRK11006 phoR phosphate regulon sensor protein; Provisional Probab=93.36 E-value=0.14 Score=31.41 Aligned_cols=22 Identities=9% Similarity=0.006 Sum_probs=12.0 Q ss_pred CCCCCCCCCCCHHHHHHHHHHH Q ss_conf 0145776543546899999999 Q gi|254780814|r 573 ELFCDSKGVISKNDLRNFIDDA 594 (803) Q Consensus 573 l~~~~~~~~l~ge~L~~li~~~ 594 (803) +...++|..+..+.+.++.+.. T Consensus 352 i~V~D~G~GI~~e~~~riFerF 373 (431) T PRK11006 352 FSVEDNGPGIAPEHIPRLTERF 373 (431) T ss_pred EEEEECCCCCCHHHHHHHCCCC T ss_conf 9999838785989999862688 No 62 >PRK04069 serine-protein kinase RsbW; Provisional Probab=92.82 E-value=0.43 Score=27.64 Aligned_cols=92 Identities=16% Similarity=0.156 Sum_probs=56.2 Q ss_pred HHHHHHHHHHHHHHHHCC-CCEEEEEE--CCCC-EEEEEECCCCCCCCCCCCCCCCE-EEEEEEEECCCCCCCCCCCEEE Q ss_conf 456567555788686077-97579998--3898-39999768445364077788500-1256425011323289860483 Q gi|254780814|r 43 HMIYEVLDNAIDESLEGY-ADIITVTL--NMDG-SCTVSDNGRGIPTDLHKEEGISA-AEVIMTRLHAGGKFNQKSYKVS 117 (803) Q Consensus 43 hlv~EiidNsvDe~~~g~-~~~I~V~i--~~dg-sisV~dnGrGIPv~~h~~~~~~~-~E~i~t~L~ag~kfd~~~yk~s 117 (803) =-|-|.+.|+|.-+..+. ...|.|.+ ..|. .|.|+|.|.|.+.+..+..--|. +.--.. + T Consensus 45 LAV~EA~tNaI~Hay~~~~~~~i~i~~~~~~~~l~i~V~D~G~gfD~~~~~~~~~P~~~~~~l~---------~------ 109 (158) T PRK04069 45 IAVSEACTNAVQHAYKEEEVGEINIRFEIYEDRLEIVVADNGDSFDYETTKSKIGPYDPSEPID---------D------ 109 (158) T ss_pred HHHHHHHHHHHHHHCCCCCCCEEEEEEEEECCEEEEEEEECCCCCCHHHCCCCCCCCCCCCCHH---------H------ T ss_conf 9999999999997515699947999999959999999999174879567432458888888611---------1------ Q ss_pred CCCCCCEEEEEEECCCEEEEEEEECCEEEEEEE Q ss_conf 116752037786326449999997895999998 Q gi|254780814|r 118 GGLHGVGVSVVNALSSWLKLRIKREGNIYEMSF 150 (803) Q Consensus 118 gGlhGvG~s~vNalS~~~~v~v~rdgk~~~q~f 150 (803) -...|.|..+...|....+++. .+|....++- T Consensus 110 ~~~gGlGl~lI~~lmDeV~~~~-~~Gt~v~m~K 141 (158) T PRK04069 110 LREGGLGLFLIETLMDDVTVYK-DSGVTVSMTK 141 (158) T ss_pred CCCCCCHHHHHHHHCCEEEEEE-CCCEEEEEEE T ss_conf 3678740999997525589990-8982999999 No 63 >COG4191 Signal transduction histidine kinase regulating C4-dicarboxylate transport system [Signal transduction mechanisms] Probab=92.32 E-value=0.19 Score=30.45 Aligned_cols=11 Identities=18% Similarity=0.326 Sum_probs=6.0 Q ss_pred CEEEEEECCCCE Q ss_conf 757999838983 Q gi|254780814|r 62 DIITVTLNMDGS 73 (803) Q Consensus 62 ~~I~V~i~~dgs 73 (803) +.|.| |+.+|. T Consensus 101 s~iYl-id~~G~ 111 (603) T COG4191 101 SAIYL-IDPTGL 111 (603) T ss_pred CEEEE-ECCCCC T ss_conf 73999-979876 No 64 >PRK11107 hybrid sensory histidine kinase BarA; Provisional Probab=92.29 E-value=0.16 Score=30.86 Aligned_cols=107 Identities=16% Similarity=0.313 Sum_probs=47.2 Q ss_pred CCCCHHHCCHHHHHCCHHHHHHHHHHCCCCC-----CCCCC-CCCCCCCCEEEEEC---CCCCCCCHHHHHHHHHHHHHH Q ss_conf 2033443058766323667888876257877-----53321-23567373899850---888853289999999996307 Q gi|254780814|r 456 KILNVERARFDKMLSSQEIGTLITALGTGIG-----QDSFD-INKLRYHKIIIMTD---ADVDGAHIRTLLLTFFFRQMP 526 (803) Q Consensus 456 KilNv~~a~~~~i~~n~Ei~~l~~~lG~~~g-----~~~~~-~~~lrY~kiiimtD---aD~DG~HI~~Llltff~~~~~ 526 (803) +||=|++-... =+-+..+..-+|+.+- .+-.+ ...-+|. + |++| ..-||.- . |=--|.+| T Consensus 670 rVLvVEDN~iN----~~vi~~lL~~lG~~v~~A~nG~eAle~~~~~~fD-L-ILMDiqMP~MDG~e-A----Tr~IR~~~ 738 (920) T PRK11107 670 TVMAVDDNPAN----LKLIGALLEEQVEHVVLCDSGHQAVELAKQRPFD-L-IFMDIQMPGMDGIR-A----CELIRQLP 738 (920) T ss_pred EEEEECCCHHH----HHHHHHHHHHCCCEEEEECCHHHHHHHHHCCCCC-E-EEECCCCCCCCHHH-H----HHHHHHCC T ss_conf 49998488999----9999999998498899989999999997479997-8-99889999998999-9----99998186 Q ss_pred HHHHCCEEEEECCCEEEEEECCCEEEEECHHHHHHHHHHHCCCCHHCCCCCCCCCCCHHHHHHHHHHH Q ss_conf 78748959994383378872671799814578989998500110000145776543546899999999 Q gi|254780814|r 527 SLIHNGFLYIIKPPLYGITRGKSLQYVKDEESLEDYLINQSLSEEIELFCDSKGVISKNDLRNFIDDA 594 (803) Q Consensus 527 ~li~~g~~yia~ppl~~~~~~K~E~YikDD~EL~~~Ll~~gl~~~~l~~~~~~~~l~ge~L~~li~~~ 594 (803) . . .+.|..-++-.- -.+=.+-.++.|+++.. .+++..+.|...+..+ T Consensus 739 ~---~-----~~~PIIAlTA~a-------~~~dre~cl~aGMddyL------tKPI~~~~L~~~L~~~ 785 (920) T PRK11107 739 H---N-----QNTPIIAVTAHA-------MAGERERLLGAGMDDYL------AKPIDEAALKQVLLRW 785 (920) T ss_pred C---C-----CCCEEEEEECCC-------CHHHHHHHHHCCCCEEE------CCCCCHHHHHHHHHHH T ss_conf 7---8-----989599997998-------98999999977999387------2879999999999986 No 65 >COG2172 RsbW Anti-sigma regulatory factor (Ser/Thr protein kinase) [Signal transduction mechanisms] Probab=90.99 E-value=1.3 Score=23.92 Aligned_cols=83 Identities=23% Similarity=0.256 Sum_probs=53.7 Q ss_pred HHHHHHHHHHHHHHHHHHCCC----CEEEEEECCCC-EEEEEECCCCCCCCCCCCCCCCEEEEEEEEECCCCCCCCCCCE Q ss_conf 544565675557886860779----75799983898-3999976844536407778850012564250113232898604 Q gi|254780814|r 41 LHHMIYEVLDNAIDESLEGYA----DIITVTLNMDG-SCTVSDNGRGIPTDLHKEEGISAAEVIMTRLHAGGKFNQKSYK 115 (803) Q Consensus 41 l~hlv~EiidNsvDe~~~g~~----~~I~V~i~~dg-sisV~dnGrGIPv~~h~~~~~~~~E~i~t~L~ag~kfd~~~yk 115 (803) +..-+.|.+.|+|.-+-.+-+ -.|.+++..+. .|.|+|.|.||+.- +..+... |. T Consensus 41 l~~av~E~~~N~v~Ha~~~~~~~g~I~i~~~~~~~~~~i~i~D~G~~~~~~----------~~~~~~~----------~~ 100 (146) T COG2172 41 LAIAVSEALTNAVKHAYKLDPSEGEIRIEVSLDDGKLEIRIWDQGPGIEDL----------EESLGPG----------DT 100 (146) T ss_pred HHHHHHHHHHHHHHHHHCCCCCCCEEEEEEEECCCEEEEEEEECCCCCCCH----------HHHCCCC----------CC T ss_conf 999999999899997630389986599999970993999999579898788----------8853888----------88 Q ss_pred EECCC--CCCEEEEEEECCCEEEEEEEECC Q ss_conf 83116--75203778632644999999789 Q gi|254780814|r 116 VSGGL--HGVGVSVVNALSSWLKLRIKREG 143 (803) Q Consensus 116 ~sgGl--hGvG~s~vNalS~~~~v~v~rdg 143 (803) ..+++ .|+|.-+..-+...|.++...++ T Consensus 101 ~~~~~~~~G~Gl~l~~~~~D~~~~~~~~~~ 130 (146) T COG2172 101 TAEGLQEGGLGLFLAKRLMDEFSYERSEDG 130 (146) T ss_pred CCCCCCCCCCCHHHHHHHHEEEEEEEECCC T ss_conf 887532473137888621107999960698 No 66 >COG5000 NtrY Signal transduction histidine kinase involved in nitrogen fixation and metabolism regulation [Signal transduction mechanisms] Probab=90.80 E-value=0.46 Score=27.37 Aligned_cols=33 Identities=15% Similarity=0.117 Sum_probs=15.2 Q ss_pred CCHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHH Q ss_conf 100001457765435468999999999999999 Q gi|254780814|r 569 SEEIELFCDSKGVISKNDLRNFIDDALKIDKLI 601 (803) Q Consensus 569 ~~~~l~~~~~~~~l~ge~L~~li~~~~~~~~~l 601 (803) +++..+..-..+.+...+|..++........+- T Consensus 545 deF~afARmP~p~~e~~dL~~ll~e~~~L~e~~ 577 (712) T COG5000 545 DEFRAFARMPAPKLEKSDLRALLKEVSFLYEIG 577 (712) T ss_pred HHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHCC T ss_conf 999987148998777422999999999999626 No 67 >PRK11466 hybrid sensory histidine kinase TorS; Provisional Probab=90.09 E-value=0.45 Score=27.43 Aligned_cols=54 Identities=33% Similarity=0.500 Sum_probs=31.9 Q ss_pred CCCEECCCCCCCHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCCC---EEEEEECCCCCCCCC Q ss_conf 871543279963254456567555788686077975799983898---399997684453640 Q gi|254780814|r 28 PSMYIGDTDGESGLHHMIYEVLDNAIDESLEGYADIITVTLNMDG---SCTVSDNGRGIPTDL 87 (803) Q Consensus 28 P~mYiG~t~~~~Gl~hlv~EiidNsvDe~~~g~~~~I~V~i~~dg---sisV~dnGrGIPv~~ 87 (803) |..+.|| ..-|.++++-+++|||----.| .|.+....++ .++|+|.|.|||-+. T Consensus 552 p~~v~gD---~~RLrQIL~NLlsNAiKFT~~G---~V~l~~~~~~~~l~~~V~DTGiGI~~e~ 608 (912) T PRK11466 552 PSALMGD---PRRIRQVITNLLSNALRFTDEG---SIVLRSRTDGEQWLVEVEDSGCGIDPAK 608 (912) T ss_pred CCEEEEC---HHHHHHHHHHHHHHHCCCCCCC---EEEEEEEECCCEEEEEEEECCCCCCHHH T ss_conf 8608635---8899999999999870618996---7999999869789999850899999999 No 68 >PRK13837 two-component VirA-like sensor kinase; Provisional Probab=88.89 E-value=0.94 Score=24.99 Aligned_cols=73 Identities=12% Similarity=0.131 Sum_probs=35.5 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHCCEEEEECCCEEEEEECCCEEEEECHHHHHHHHHHHCCCCHHCCCCCCCCCCCHHHHH Q ss_conf 85328999999999630778748959994383378872671799814578989998500110000145776543546899 Q gi|254780814|r 509 DGAHIRTLLLTFFFRQMPSLIHNGFLYIIKPPLYGITRGKSLQYVKDEESLEDYLINQSLSEEIELFCDSKGVISKNDLR 588 (803) Q Consensus 509 DG~HI~~Llltff~~~~~~li~~g~~yia~ppl~~~~~~K~E~YikDD~EL~~~Ll~~gl~~~~l~~~~~~~~l~ge~L~ 588 (803) |-++|.-.|+|+.-.=.--+-+.|.|-|...... + ..+..+..--+.-| +-..+...++|..+..+.+. T Consensus 560 D~~QL~QVllNL~~NA~dAM~~gG~l~I~~~~~~-~---------~~~~~~~~~~~~~G-~yv~L~VsDtG~GI~~e~l~ 628 (831) T PRK13837 560 NPAQLQQVILNLCKNAAQAMDENGRVDIRLSRVD-L---------RKPKVLAHGTAPPG-RYVLLRVSDTGRGIDEAVLP 628 (831) T ss_pred CHHHHHHHHHHHHHHHHHHCCCCCEEEEEEEEEE-C---------CCHHHHCCCCCCCC-CEEEEEEEECCCCCCHHHHH T ss_conf 7999999999999999997679986999985553-3---------63232115778998-98999998679998989996 Q ss_pred HHHH Q ss_conf 9999 Q gi|254780814|r 589 NFID 592 (803) Q Consensus 589 ~li~ 592 (803) ++.+ T Consensus 629 RIFE 632 (831) T PRK13837 629 HIFE 632 (831) T ss_pred HHCC T ss_conf 6048 No 69 >PRK11091 aerobic respiration control sensor protein ArcB; Provisional Probab=88.69 E-value=0.18 Score=30.61 Aligned_cols=114 Identities=22% Similarity=0.427 Sum_probs=61.4 Q ss_pred EECCC-CCCCHHHCCHHHHHCCHHHHHHHHHHCCCCCC-----CCCC-CCCCCCCCEEEEEC---CCCCCCCHHHHHHHH Q ss_conf 30056-20334430587663236678888762578775-----3321-23567373899850---888853289999999 Q gi|254780814|r 451 LPLRG-KILNVERARFDKMLSSQEIGTLITALGTGIGQ-----DSFD-INKLRYHKIIIMTD---ADVDGAHIRTLLLTF 520 (803) Q Consensus 451 ~PLrG-KilNv~~a~~~~i~~n~Ei~~l~~~lG~~~g~-----~~~~-~~~lrY~kiiimtD---aD~DG~HI~~Llltf 520 (803) .|+.| +||=|++-...+ .-+..+..-+|+.+.. +-.+ .+.-+|. +|++| .+-||.-..-. T Consensus 521 ~~~~~lrILvVEDn~iN~----~V~~~~L~~lG~~v~~A~nG~eALe~~~~~~fD--LILMDiqMP~MDG~Eatr~---- 590 (779) T PRK11091 521 MPLPALNILLVEDIELNV----IVARSLLEKLGNSVDVAMTGKDALEMFKPGEYD--LVLLDIQLPDMTGLDVARE---- 590 (779) T ss_pred CCCCCCEEEEECCCHHHH----HHHHHHHHHCCCEEEEECCHHHHHHHHCCCCCC--EEEEECCCCCCCHHHHHHH---- T ss_conf 688888189986889999----999999997599999989999999985069999--8998289999978999999---- Q ss_pred HHHHHHHHHHCCEEEEECCCEEEEEECCCEEEEECHHHHHHHHHHHCCCCHHCCCCCCCCCCCHHHHHHHHHHHHH Q ss_conf 9963077874895999438337887267179981457898999850011000014577654354689999999999 Q gi|254780814|r 521 FFRQMPSLIHNGFLYIIKPPLYGITRGKSLQYVKDEESLEDYLINQSLSEEIELFCDSKGVISKNDLRNFIDDALK 596 (803) Q Consensus 521 f~~~~~~li~~g~~yia~ppl~~~~~~K~E~YikDD~EL~~~Ll~~gl~~~~l~~~~~~~~l~ge~L~~li~~~~~ 596 (803) .|. ..-+-..||+.-++..- ++|. +++ ++.|+++.. .++++-+.|...+..+.. T Consensus 591 -IR~-------~~~~~~~~PIIALTAna----~~d~---~~~-l~aGMdd~L------~KPi~~~~L~~~l~~~~~ 644 (779) T PRK11091 591 -LRE-------RYGREDLPPLVALTANV----LKDK---QEY-LDAGMDDVL------SKPLSVPALTAMIKKFWD 644 (779) T ss_pred -HHH-------CCCCCCCCCEEEEECCC----CHHH---HHH-HHCCCCCEE------ECCCCHHHHHHHHHHHHC T ss_conf -984-------25768999279881874----1769---999-976998475------188989999999999846 No 70 >TIGR02966 phoR_proteo phosphate regulon sensor kinase PhoR; InterPro: IPR014310 Members of this protein are the regulatory histidine kinase PhoR associated with the phosphate ABC transporter found in most Proteobacteria, and related proteins from Gram-positive organisms are excluded. The phoR gene usually is adjacent to the response regulator phoB gene (IPR011879 from INTERPRO).. Probab=88.30 E-value=0.69 Score=26.01 Aligned_cols=29 Identities=10% Similarity=0.087 Sum_probs=13.9 Q ss_pred CHHCCCCCCCCCCCHHHHHHHHHHHHHHH Q ss_conf 00001457765435468999999999999 Q gi|254780814|r 570 EEIELFCDSKGVISKNDLRNFIDDALKID 598 (803) Q Consensus 570 ~~~l~~~~~~~~l~ge~L~~li~~~~~~~ 598 (803) ++.+-..++|..|..+.|-.|.+...+++ T Consensus 267 ga~fsV~DtG~GI~~eHipRLTERFYRVD 295 (339) T TIGR02966 267 GAEFSVTDTGIGIAPEHIPRLTERFYRVD 295 (339) T ss_pred EEEEEEEECCCCCCHHHCCCCCCEEEEEC T ss_conf 03999987798987313775420012305 No 71 >PRK10547 chemotaxis protein CheA; Provisional Probab=87.82 E-value=0.5 Score=27.13 Aligned_cols=21 Identities=14% Similarity=0.207 Sum_probs=10.4 Q ss_pred EEEEECCCEEEECCHHHHHHH Q ss_conf 027655825873998999999 Q gi|254780814|r 705 LSLICNGDKIKITGPCSLLDT 725 (803) Q Consensus 705 ~~l~~~~~~~~i~s~~el~~~ 725 (803) +.+..+++-..+-++.+|+.. T Consensus 631 atiLgdG~vvlILd~~~L~~~ 651 (662) T PRK10547 631 ATILGDGSVALIVDVSALQAL 651 (662) T ss_pred EEECCCCCEEEEECHHHHHHH T ss_conf 999679719999849999999 No 72 >PRK10841 hybrid sensory kinase in two-component regulatory system with RcsB and YojN; Provisional Probab=87.40 E-value=0.64 Score=26.25 Aligned_cols=25 Identities=16% Similarity=0.349 Sum_probs=15.4 Q ss_pred CEEEEEECCCCEEEEEECCCCCCCCC Q ss_conf 75799983898399997684453640 Q gi|254780814|r 62 DIITVTLNMDGSCTVSDNGRGIPTDL 87 (803) Q Consensus 62 ~~I~V~i~~dgsisV~dnGrGIPv~~ 87 (803) +.|-+ |..+..+.+-=.+|..|++. T Consensus 143 ~~~~~-~~~~~~~~~~~~~~~~~~~~ 167 (947) T PRK10841 143 NRVFL-IGSDNLCMANFGLRDMPVER 167 (947) T ss_pred CEEEE-ECCCCEEEEECCCCCCCCCH T ss_conf 45899-73884067632434676870 No 73 >PRK04031 DNA primase; Provisional Probab=87.14 E-value=2.3 Score=21.95 Aligned_cols=156 Identities=21% Similarity=0.436 Sum_probs=89.4 Q ss_pred HHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEECC Q ss_conf 03211102689999874003579998778999887755455321100110012467667666655555552179998255 Q gi|254780814|r 354 SVVESFVNDGLSAWLEEHPLEAKIIVKKVLDASMVRDAARRARDLTRRKGVLDIASLPGKLADCSERDPKKSELFLVEGD 433 (803) Q Consensus 354 ~~v~~~v~~~l~~~l~~n~~~ak~i~~k~~~a~~ar~~a~kar~~~rrK~~~~~~~lpgKL~Dc~~k~~~~~eL~ivEGD 433 (803) .+++.+ ++-|..|+++.....+.|++.+-++.+.- +.+. ....+||. ...=..--++++|||- T Consensus 115 ~i~~RA-kell~~~~~~~~~~~~~i~~ev~~~~~~~-------~~~~----yg~~~lpa-----gp~v~~sd~iIvVEGR 177 (420) T PRK04031 115 KIIERA-KEILKKWFDEKIPDSKEIIEEVREAVRVE-------EIIE----YGPEKLPA-----GPNVDDSDAIIVVEGR 177 (420) T ss_pred HHHHHH-HHHHHHHHHCCCCCHHHHHHHHHHHHHHH-------HHHE----ECCCCCCC-----CCCCCCCCEEEEEECH T ss_conf 999999-99999997636821999999999886036-------4411----16567899-----9897888808999565 Q ss_pred CCCCCCCCCCCCCCEEEEECCCCCCCHHHCCHHHHHCCHHHHHHHHHHCCCCCCCCCCCCCCCCCCEEEEECCCCCCCCH Q ss_conf 75432233465443023300562033443058766323667888876257877533212356737389985088885328 Q gi|254780814|r 434 SAGGSAKQGRSRENQAILPLRGKILNVERARFDKMLSSQEIGTLITALGTGIGQDSFDINKLRYHKIIIMTDADVDGAHI 513 (803) Q Consensus 434 SAggsak~gRdr~~qai~PLrGKilNv~~a~~~~i~~n~Ei~~l~~~lG~~~g~~~~~~~~lrY~kiiimtDaD~DG~HI 513 (803) .- +||..+. -|.|-+..-|+.+-+.--++++-+ .+...+|-|.-|-- T Consensus 178 aD---------------------V~nLLk~---------GikN~ia~~G~~ip~~i~~L~~~k--~vtaF~DGDrGG~l- 224 (420) T PRK04031 178 AD---------------------VLNLLRY---------GIKNVIAVEGTSVPETIIELSKKK--TVTAFLDGDRGGEL- 224 (420) T ss_pred HH---------------------HHHHHHH---------CCCCEEEECCCCCCHHHHHHHCCC--EEEEEECCCCCHHH- T ss_conf 88---------------------9999873---------633278626888888999986386--58999538854378- Q ss_pred HHHHHHHHHHHHHHHHHCCEE-EEECCCEEEEEECCCEEEEECHHHHHHHHHHHCCCCHHCCCCCCCCCCCHHHHHHHHH Q ss_conf 999999999630778748959-9943833788726717998145789899985001100001457765435468999999 Q gi|254780814|r 514 RTLLLTFFFRQMPSLIHNGFL-YIIKPPLYGITRGKSLQYVKDEESLEDYLINQSLSEEIELFCDSKGVISKNDLRNFID 592 (803) Q Consensus 514 ~~Llltff~~~~~~li~~g~~-yia~ppl~~~~~~K~E~YikDD~EL~~~Ll~~gl~~~~l~~~~~~~~l~ge~L~~li~ 592 (803) +| ++|++.+.| |+|+.|- ||+ -+.|++.++.+.|. T Consensus 225 ---iL-------KELlqVadIDYVArAP~-----GKE-----------------------------VEeLT~KEI~KALR 260 (420) T PRK04031 225 ---IL-------KELLQVADIDYVARAPP-----GKE-----------------------------VEELTKKEIAKALR 260 (420) T ss_pred ---HH-------HHHHHHCCEEEEEECCC-----CCC-----------------------------HHHCCHHHHHHHHH T ss_conf ---99-------99985467359997989-----963-----------------------------53268879999987 Q ss_pred HHHHHHHHHHH Q ss_conf 99999999986 Q gi|254780814|r 593 DALKIDKLIKD 603 (803) Q Consensus 593 ~~~~~~~~l~~ 603 (803) +-.-++.++.. T Consensus 261 ~KVP~eq~~~~ 271 (420) T PRK04031 261 NKVPVEQYLEE 271 (420) T ss_pred CCCCHHHHHHH T ss_conf 24888999876 No 74 >COG3852 NtrB Signal transduction histidine kinase, nitrogen specific [Signal transduction mechanisms] Probab=85.64 E-value=1.5 Score=23.40 Aligned_cols=10 Identities=30% Similarity=0.361 Sum_probs=5.0 Q ss_pred HHHHHHHHHC Q ss_conf 6788887625 Q gi|254780814|r 473 EIGTLITALG 482 (803) Q Consensus 473 Ei~~l~~~lG 482 (803) -++|-.+|+| T Consensus 249 lVrNAaqA~~ 258 (363) T COG3852 249 LVRNAAQALG 258 (363) T ss_pred HHHHHHHHHC T ss_conf 9999999716 No 75 >COG4564 Signal transduction histidine kinase [Signal transduction mechanisms] Probab=84.27 E-value=2.5 Score=21.66 Aligned_cols=40 Identities=23% Similarity=0.413 Sum_probs=23.5 Q ss_pred CCCCCCEEEE-------EECCCCCCCCCCE--EHHHHHHHHHHHHHHHH Q ss_conf 1775412122-------2246216667721--03568889999999998 Q gi|254780814|r 266 NDGYHENVLC-------FTNNIPQKDGGTH--LSGLRSALTRQITSYVE 305 (803) Q Consensus 266 ~~~~~e~i~S-------fvN~I~T~~GGtH--v~gf~~al~~~i~~y~~ 305 (803) .||..++.+| -.|-+.++.||.| .+--..+|.-+|++.-+ T Consensus 259 HDGIsQ~LVs~k~~lela~~ql~~p~~~a~~aieKaa~aL~~Ai~EVRR 307 (459) T COG4564 259 HDGISQNLVSVKCALELAARQLNPPKGGAHPAIEKAADALNGAIKEVRR 307 (459) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHH T ss_conf 5007777799999999996267799888750156678999989999998 No 76 >PRK13557 histidine kinase; Provisional Probab=83.70 E-value=2.1 Score=22.34 Aligned_cols=11 Identities=9% Similarity=0.190 Sum_probs=7.5 Q ss_pred CCCEEEEEECC Q ss_conf 89839999768 Q gi|254780814|r 70 MDGSCTVSDNG 80 (803) Q Consensus 70 ~dgsisV~dnG 80 (803) .||.|.-+|.+ T Consensus 49 ~Dg~I~~vN~a 59 (538) T PRK13557 49 PDNPIVFANRA 59 (538) T ss_pred CCCEEEEECHH T ss_conf 99879998399 No 77 >TIGR02956 TMAO_torS TMAO reductase sytem sensor TorS; InterPro: IPR014302 This protein, TorS, is part of a regulatory system for the torCAD operon that encodes the pterin molybdenum cofactor-containing enzyme trimethylamine-N-oxide (TMAO) reductase (TorA), a cognate chaperone (TorD), and a penta-haem cytochrome (TorC). TorS works together with the inducer-binding protein TorT and the response regulator TorR. TorS contains histidine kinase ATPase (IPR003594 from INTERPRO), HAMP (IPR003660 from INTERPRO), phosphoacceptor (IPR003661 from INTERPRO), and phosphotransfer (IPR008207 from INTERPRO) domains and a response regulator receiver domain (IPR001789 from INTERPRO).. Probab=79.36 E-value=0.92 Score=25.06 Aligned_cols=113 Identities=26% Similarity=0.428 Sum_probs=66.1 Q ss_pred CCCEECCCCCCCHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCCC-------EEEEEECCCCCCCCCCCCCCCCEEEEEE Q ss_conf 871543279963254456567555788686077975799983898-------3999976844536407778850012564 Q gi|254780814|r 28 PSMYIGDTDGESGLHHMIYEVLDNAIDESLEGYADIITVTLNMDG-------SCTVSDNGRGIPTDLHKEEGISAAEVIM 100 (803) Q Consensus 28 P~mYiG~t~~~~Gl~hlv~EiidNsvDe~~~g~~~~I~V~i~~dg-------sisV~dnGrGIPv~~h~~~~~~~~E~i~ 100 (803) |..+-||. .=+++++.=.|.|||=---.|- =.|.|-++.|. .+.|.|-|-|||= +..-.+--.| T Consensus 613 ~~~~~gD~---~riRQvL~NLvgNaIKFT~~Gs-v~l~~~l~~~~~~gdsel~F~V~DtG~GIae-----~~~~~lF~aF 683 (1052) T TIGR02956 613 PNWWQGDL---RRIRQVLINLVGNAIKFTDRGS-VVLRVSLNDDSSSGDSELLFEVEDTGVGIAE-----EEQATLFDAF 683 (1052) T ss_pred HHHHHCCC---CHHHHHHHHHHHCCCEECCCCE-EEEEEEECCCCCCCCCEEEEEEEECCCCCCH-----HHHHHHCCCC T ss_conf 78875366---1344767654413200434526-9999885588898663478888624779987-----9998543310 Q ss_pred EEECCCCCCCCCCCEEECCCCCCEEE----EEEECCCE---EEEEEEEC-CE--EEEEEEECCEECC Q ss_conf 25011323289860483116752037----78632644---99999978-95--9999988983646 Q gi|254780814|r 101 TRLHAGGKFNQKSYKVSGGLHGVGVS----VVNALSSW---LKLRIKRE-GN--IYEMSFINGILDN 157 (803) Q Consensus 101 t~L~ag~kfd~~~yk~sgGlhGvG~s----~vNalS~~---~~v~v~rd-gk--~~~q~f~~g~~~~ 157 (803) +++-+| -|.+|| -|.|.. +|.|---. +.|+..-+ |- .|...+.+|.|.. T Consensus 684 ~Qa~eg-------~~~~gG-TGLGLAIs~~Lv~AM~G~GrGl~v~S~~~~GScF~F~lpl~~~~~~~ 742 (1052) T TIGR02956 684 TQADEG-------RRKSGG-TGLGLAISRRLVEAMDGEGRGLGVESELGQGSCFWFTLPLARGKPAE 742 (1052) T ss_pred CCCCCC-------CCCCCC-CCCHHHHHHHHHHHHCCCCCCCEEEEECCCCCEEEEECCCCCCCHHC T ss_conf 121013-------731377-65068999999997088876501666458750221242313375000 No 78 >TIGR02916 PEP_his_kin putative PEP-CTERM system histidine kinase; InterPro: IPR014265 Proteins in this entry are putative periplasmic sensor signal transduction histidine kinases. They all contain a GAF domain that is present in phytochromes and cGMP-specific phosphodiesterases, and which has been experimentally proven to be involved in protein:protein interactions. They also contain a C-terminal histidine kinase domain, which is composed of a dimerisation sub-domain and an ATP/ADP-binding phosphotransfer, or catalytic, sub-domain. The proteins in this entry are found strictly within a subset of Gram-negative bacterial species with the proposed PEP-CTERM/exosortase system, analogous to the LPXTG/sortase system common in Gram-positive bacteria, where members of IPR014264 from INTERPRO and IPR014266 from INTERPRO also occur.. Probab=77.94 E-value=4.9 Score=19.39 Aligned_cols=10 Identities=30% Similarity=0.517 Sum_probs=5.4 Q ss_pred CCCCCCHHHH Q ss_conf 3357210020 Q gi|254780814|r 338 RFSSQTKEKL 347 (803) Q Consensus 338 ~F~gQTK~kL 347 (803) =|+|+++.|| T Consensus 288 L~SG~lRa~l 297 (696) T TIGR02916 288 LFSGTLRARL 297 (696) T ss_pred HHHHHHHHHH T ss_conf 9602341333 No 79 >cd01027 TOPRIM_RNase_M5_like TOPRIM_ RNase M5_like: The topoisomerase-primase (TOPRIM) nucleotidyl transferase/hydrolase domain found in Ribonuclease M5: (RNase M5) and other small primase-like proteins from bacteria and archaea. RNase M5 catalyzes the maturation of 5S rRNA in low G+C Gram-positive bacteria. The TOPRIM domain has two conserved motifs, one of which centers at a conserved glutamate and the other one at two conserved aspartates (DxD). The conserved glutamate may act as a general base in nucleotide polymerization by primases. The DXD motif may co-ordinate Mg2+, a cofactor required for full catalytic function. Probab=77.57 E-value=0.59 Score=26.57 Aligned_cols=26 Identities=38% Similarity=0.694 Sum_probs=22.7 Q ss_pred CCCCEEEEECCCCCCCCHHHHHHHHH Q ss_conf 73738998508888532899999999 Q gi|254780814|r 496 RYHKIIIMTDADVDGAHIRTLLLTFF 521 (803) Q Consensus 496 rY~kiiimtDaD~DG~HI~~Llltff 521 (803) .|..||||||.|--|-.||--|.-.| T Consensus 44 ~~~~VIIlTD~D~~Gekirk~i~~~l 69 (81) T cd01027 44 AYRGVIILTDPDRKGEKIRKKLSEYL 69 (81) T ss_pred HCCCEEEEECCCCCHHHHHHHHHHHH T ss_conf 55985999689967079999999982 No 80 >COG5002 VicK Signal transduction histidine kinase [Signal transduction mechanisms] Probab=77.16 E-value=3.2 Score=20.82 Aligned_cols=26 Identities=31% Similarity=0.414 Sum_probs=15.8 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHH Q ss_conf 45776543546899999999999999 Q gi|254780814|r 575 FCDSKGVISKNDLRNFIDDALKIDKL 600 (803) Q Consensus 575 ~~~~~~~l~ge~L~~li~~~~~~~~~ 600 (803) ..+.|..+.++++.++.+...++.+. T Consensus 379 I~D~G~gIPk~d~~~iFdrfyRvdkA 404 (459) T COG5002 379 ISDQGLGIPKEDLEKIFDRFYRVDKA 404 (459) T ss_pred ECCCCCCCCCHHHHHHHHHHHHHHHH T ss_conf 74688899840689999988522366 No 81 >smart00493 TOPRIM topoisomerases, DnaG-type primases, OLD family nucleases and RecR proteins. Probab=75.18 E-value=5.1 Score=19.26 Aligned_cols=70 Identities=26% Similarity=0.392 Sum_probs=38.3 Q ss_pred EEEEEECCCCCCCCCCCCCCCCEEEEECCCCCCCHHHCCHHHHHCCHHHHHHHHHHCCCCCCCCCCCCCCCCCCEEEEEC Q ss_conf 79998255754322334654430233005620334430587663236678888762578775332123567373899850 Q gi|254780814|r 426 ELFLVEGDSAGGSAKQGRSRENQAILPLRGKILNVERARFDKMLSSQEIGTLITALGTGIGQDSFDINKLRYHKIIIMTD 505 (803) Q Consensus 426 eL~ivEGDSAggsak~gRdr~~qai~PLrGKilNv~~a~~~~i~~n~Ei~~l~~~lG~~~g~~~~~~~~lrY~kiiimtD 505 (803) +|+||||-+-.-+..+.....+ =+.++.|-+++ .++..++. ...+ + ..|||+|| T Consensus 2 ~liIVEg~~d~~~i~~~~~~~~-~~~~~~G~~~~------------~~~i~~l~--------~~~~--~---~eViiatD 55 (76) T smart00493 2 VLIIVEGPADAIALEKAGGFGG-NVVALGGHLLK------------KEIIKLLK--------RLAK--K---KEVILATD 55 (76) T ss_pred EEEEECCHHHHHHHHHHCCCCE-EEEECCCCCCC------------HHHHHHHH--------HHHC--C---CEEEEECC T ss_conf 7999878889999998569997-99982152350------------44899999--------8633--8---97999769 Q ss_pred CCCCCCCHHHHHHHHH Q ss_conf 8888532899999999 Q gi|254780814|r 506 ADVDGAHIRTLLLTFF 521 (803) Q Consensus 506 aD~DG~HI~~Llltff 521 (803) .|..|--|+--+.-.+ T Consensus 56 ~D~eGe~~a~~i~~~l 71 (76) T smart00493 56 PDREGEAIAWKLAELL 71 (76) T ss_pred CCCCHHHHHHHHHHHH T ss_conf 8944699999999987 No 82 >COG1658 Small primase-like proteins (Toprim domain) [DNA replication, recombination, and repair] Probab=74.85 E-value=2.6 Score=21.49 Aligned_cols=27 Identities=37% Similarity=0.689 Sum_probs=22.4 Q ss_pred CCCCCEEEEECCCCCCCCHHHHHHHHH Q ss_conf 673738998508888532899999999 Q gi|254780814|r 495 LRYHKIIIMTDADVDGAHIRTLLLTFF 521 (803) Q Consensus 495 lrY~kiiimtDaD~DG~HI~~Llltff 521 (803) ++|..|||+||.|.-|.-||.-|.-.| T Consensus 53 ~~~k~VIILTD~D~~Ge~Irk~l~~~l 79 (127) T COG1658 53 QKYKGVIILTDPDRKGERIRKKLKEYL 79 (127) T ss_pred HCCCCEEEEECCCCCHHHHHHHHHHHC T ss_conf 425877998689856289999999971 No 83 >TIGR02604 Piru_Ver_Nterm putative membrane-bound dehydrogenase domain; InterPro: IPR013428 Proteins containing this domain also contain several hundred residues of highly variable sequence, and then a well-conserved C-terminal domain (IPR013427 from INTERPRO) that features a putative cytochrome c-type heme binding motif CXXCH. The membrane-bound L-sorbosone dehydrogenase from Acetobacter liquefaciens (Gluconacetobacter liquefaciens) (Q44091 from SWISSPROT) shows some homolgy to this domain but lacks additional sequence regions found in this entry and belongs to a different clade of the larger family of homologues. . Probab=74.42 E-value=1.7 Score=22.91 Aligned_cols=15 Identities=20% Similarity=0.375 Sum_probs=7.5 Q ss_pred CEEEEEEECHHHHHH Q ss_conf 708999877888542 Q gi|254780814|r 170 GTEVTFLPSSDIFSV 184 (803) Q Consensus 170 GT~I~F~PD~~iF~~ 184 (803) |+--+|-|+-..|+. T Consensus 198 g~~~~~~P~~~~l~~ 212 (459) T TIGR02604 198 GGLFRYNPDGGKLEV 212 (459) T ss_pred CEEEEECCCCCEEEE T ss_conf 315787444857899 No 84 >COG0643 CheA Chemotaxis protein histidine kinase and related kinases [Cell motility and secretion / Signal transduction mechanisms] Probab=72.52 E-value=4.1 Score=20.03 Aligned_cols=25 Identities=16% Similarity=0.177 Sum_probs=15.3 Q ss_pred CEEEEECCCEEEECCHHHHHHHHHH Q ss_conf 5027655825873998999999998 Q gi|254780814|r 704 ILSLICNGDKIKITGPCSLLDTIFS 728 (803) Q Consensus 704 ~~~l~~~~~~~~i~s~~el~~~i~~ 728 (803) .+.+...++-..+-++..|+.+... T Consensus 687 GATILGDG~V~lIlDv~~li~~~~~ 711 (716) T COG0643 687 GATILGDGSVVLILDVAALLRLARK 711 (716) T ss_pred EEEEECCCEEEEEECHHHHHHHHHH T ss_conf 6688269629999708999999763 No 85 >PRK11558 hypothetical protein; Provisional Probab=69.90 E-value=5.5 Score=19.04 Aligned_cols=57 Identities=26% Similarity=0.458 Sum_probs=32.3 Q ss_pred CCCCCEECCCCCCCHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCCC-EEEEEECCCCCCCCC Q ss_conf 45871543279963254456567555788686077975799983898-399997684453640 Q gi|254780814|r 26 KRPSMYIGDTDGESGLHHMIYEVLDNAIDESLEGYADIITVTLNMDG-SCTVSDNGRGIPTDL 87 (803) Q Consensus 26 kRP~mYiG~t~~~~Gl~hlv~EiidNsvDe~~~g~~~~I~V~i~~dg-sisV~dnGrGIPv~~ 87 (803) -|||.|+|+.+ ......+||.+...+.+ |.|.-+.-+=++-| .+-..-+-|-.|||. T Consensus 26 v~~GVyVG~~S--~rVRd~lW~~v~~~~~~---G~avmv~~~~nEqG~~~~t~G~~~r~~vD~ 83 (97) T PRK11558 26 VRAGVYVGDVS--KRIREMIWQQVTQLAEE---GNVVMAWATNTESGFEFQTFGENRRIPVDL 83 (97) T ss_pred CCCCCEECCCC--HHHHHHHHHHHHHHCCC---CCEEEEECCCCCCCEEEEEECCCCCCEEEC T ss_conf 16875783878--79999999999966779---858999737998855899817998615856 No 86 >KOG0019 consensus Probab=69.27 E-value=5.6 Score=18.98 Aligned_cols=122 Identities=19% Similarity=0.272 Sum_probs=65.1 Q ss_pred HHHHHHHHHHHHH-------H-----HCCCCEEEEEECCC-CEEEEEECCCCCCCCCCCCCCCCEEEEE--EEE-ECCCC Q ss_conf 5656755578868-------6-----07797579998389-8399997684453640777885001256--425-01132 Q gi|254780814|r 44 MIYEVLDNAIDES-------L-----EGYADIITVTLNMD-GSCTVSDNGRGIPTDLHKEEGISAAEVI--MTR-LHAGG 107 (803) Q Consensus 44 lv~EiidNsvDe~-------~-----~g~~~~I~V~i~~d-gsisV~dnGrGIPv~~h~~~~~~~~E~i--~t~-L~ag~ 107 (803) .+.|.+.||-|-. + .|---.|.++.+++ +.+|+.|.|-|+--+ |+| +|+ ..+|+ T Consensus 61 FlRELISNaSDAldKiRy~~lt~~~~~~~~l~I~i~~nk~~~tlti~DtGIGMTk~----------dLvnnLGTIAkSGt 130 (656) T KOG0019 61 FLRELISNASDALEKLRYLELKGDEKALPELEIRIITNKDKRTITIQDTGIGMTKE----------DLVNNLGTIAKSGS 130 (656) T ss_pred HHHHHHCCCCCHHHHHHHHHHCCCCCCCCCEEEEECCCCCCCEEEEEECCCCCCHH----------HHHHHHHHHHHCCC T ss_conf 88765314640477788776247433566326896158876558998447776889----------99865515643465 Q ss_pred C-CC-----CCCCEEECCCCCCEEEEEEECCCEEEEEEEECCEEEEEEEECCEECCCCEEECCCCCCCCEEEEEEE Q ss_conf 3-28-----9860483116752037786326449999997895999998898364631387327788870899987 Q gi|254780814|r 108 K-FN-----QKSYKVSGGLHGVGVSVVNALSSWLKLRIKREGNIYEMSFINGILDNPLVVTGSAGNDTGTEVTFLP 177 (803) Q Consensus 108 k-fd-----~~~yk~sgGlhGvG~s~vNalS~~~~v~v~rdgk~~~q~f~~g~~~~~l~~~~~~~~~~GT~I~F~P 177 (803) | |- ..+..---|.-|||.--.....++..|....+.-. .-.|+.+..-+- .+...+.-.+||+|.-.- T Consensus 131 K~Fmealkea~ad~~~IGQFGvGFYSaylVAdkV~V~tk~~~~e-~y~Wes~~~gs~-~v~~~~~~~rGTki~l~l 204 (656) T KOG0019 131 KAFLEALKEAEAESNLIGQFGVGFYSAFMVADRVVVTTRHPADE-GLQWTSNGRGSY-EIAEASGLRTGTKIVIHL 204 (656) T ss_pred HHHHHHHHHCCCCHHHHHHCCCCHHHHHHHHHEEEEEECCCCCC-CEEEECCCCCCE-EEEECCCCCCCCEEEEEE T ss_conf 89999887634411122110100456665123268861268874-104313788726-876236755530477650 No 87 >cd01026 TOPRIM_OLD TOPRIM_OLD: topoisomerase-primase (TOPRIM) nucleotidyl transferase/hydrolase domain of the type found in bacterial and archaeal nucleases of the OLD (overcome lysogenization defect) family. The bacteriophage P2 OLD protein, which has DNase as well as RNase activity, consists of an N-terminal ABC-type ATPase domain and a C-terminal Toprim domain; the nuclease activity of OLD is stimulated by ATP, though the ATPase activity is not DNA-dependent. Functional details on OLD are scant and further experimentation is required to define the relationship between the ATPase and Toprim nuclease domains. The TOPRIM domain has two conserved motifs, one of which centers at a conserved glutamate and the other one at two conserved aspartates (DxD). The conserved glutamate may act as a general acid in strand cleavage by nucleases. The DXD motif may co-ordinate Mg2+, a cofactor required for full catalytic function. Probab=67.47 E-value=8.3 Score=17.63 Aligned_cols=59 Identities=36% Similarity=0.529 Sum_probs=34.3 Q ss_pred EEEEEECCCCCC----CC-CCCC--CCCCEEEEECCCCCCCHHHCCHHHHHCCHHHHHHHHHHCCCCCCCCCCCCCCCCC Q ss_conf 799982557543----22-3346--5443023300562033443058766323667888876257877533212356737 Q gi|254780814|r 426 ELFLVEGDSAGG----SA-KQGR--SRENQAILPLRGKILNVERARFDKMLSSQEIGTLITALGTGIGQDSFDINKLRYH 498 (803) Q Consensus 426 eL~ivEGDSAgg----sa-k~gR--dr~~qai~PLrGKilNv~~a~~~~i~~n~Ei~~l~~~lG~~~g~~~~~~~~lrY~ 498 (803) -.+||||+|=-. -| +.|. +...-.|.|++|| |... ...|+..+|. T Consensus 5 ~viLVEG~tE~~~l~~~~~~~~~~l~~~~i~ii~v~G~--~~~~-----------~~~l~~~l~i--------------- 56 (97) T cd01026 5 KVILVEGDSEEILLPALAKKLGLDLDEAGISIIPVGGK--NFKP-----------FIKLLNALGI--------------- 56 (97) T ss_pred EEEEEECHHHHHHHHHHHHHHCCCHHHCCCEEEEECCC--CHHH-----------HHHHHHHCCC--------------- T ss_conf 79999441399999999999697977879789985892--7499-----------9999997699--------------- Q ss_pred CEEEEECCCCCCCC Q ss_conf 38998508888532 Q gi|254780814|r 499 KIIIMTDADVDGAH 512 (803) Q Consensus 499 kiiimtDaD~DG~H 512 (803) +..|.||.|....+ T Consensus 57 ~~~vi~D~D~~~~~ 70 (97) T cd01026 57 PVAVLTDLDAKRNE 70 (97) T ss_pred CEEEEECCCCCCCC T ss_conf 38999718987465 No 88 >TIGR01925 spIIAB anti-sigma F factor; InterPro: IPR010194 This entry describes the SpoIIAB anti-sigma F factor. Sigma F regulates spore development in B subtilis. SpoIIAB binds to sigma F, preventing formation of the transcription complex at the promoter . SpoIIAA (anti-anti-sigma F factor) binds to SpoIIAB to inhibit association with sigma F, however SpoIIAB can phosphorylate SpoIIAA, causing disassociation of the SpoIIAA/B complex . The SpoIIE phosphatase dephosphorylates SpoIIAA.; GO: 0004674 protein serine/threonine kinase activity, 0005524 ATP binding, 0016989 sigma factor antagonist activity, 0006468 protein amino acid phosphorylation, 0030435 sporulation, 0045892 negative regulation of transcription DNA-dependent. Probab=66.04 E-value=4 Score=20.09 Aligned_cols=81 Identities=30% Similarity=0.344 Sum_probs=45.9 Q ss_pred HHHHHHHHHHHHH---HHHCCCCEEEEEECCC-CEEEEEECCCCCCCCCCCCCCCCEEEEEEEEECCCCCCCCCCCEEEC Q ss_conf 4565675557886---8607797579998389-83999976844536407778850012564250113232898604831 Q gi|254780814|r 43 HMIYEVLDNAIDE---SLEGYADIITVTLNMD-GSCTVSDNGRGIPTDLHKEEGISAAEVIMTRLHAGGKFNQKSYKVSG 118 (803) Q Consensus 43 hlv~EiidNsvDe---~~~g~~~~I~V~i~~d-gsisV~dnGrGIPv~~h~~~~~~~~E~i~t~L~ag~kfd~~~yk~sg 118 (803) -.|-|.|.|||=. .-.+-.=.|.++|..+ =+|+|.|+|-||-= +- + |-|=.|| +|-+= T Consensus 42 ~~VSEAVTNAIIHGYE~~~~~~V~I~~~~~d~~~~~~v~D~G~GI~~-lE-~----A~~PLyT-----skPeL------- 103 (137) T TIGR01925 42 TAVSEAVTNAIIHGYEENKEGVVIISATIEDDEVSITVRDEGIGIEN-LE-E----AREPLYT-----SKPEL------- 103 (137) T ss_pred HHHHHHHHCCEEECEEECCCCEEEEEEEEECCEEEEEEEECCCCHHH-HH-H----HCCCCCC-----CCCCC------- T ss_conf 33322321205314563799778999996054899998646757233-78-5----3266457-----99872------- Q ss_pred CCCCCEEEEEEECCCEEEEEEEE Q ss_conf 16752037786326449999997 Q gi|254780814|r 119 GLHGVGVSVVNALSSWLKLRIKR 141 (803) Q Consensus 119 GlhGvG~s~vNalS~~~~v~v~r 141 (803) -|-|.|-.+---|=..+.|+..+ T Consensus 104 ERSGMGFTvME~FMD~~~v~S~~ 126 (137) T TIGR01925 104 ERSGMGFTVMESFMDDVEVDSEK 126 (137) T ss_pred CCCCCCEEEECCCCCCEEEEECC T ss_conf 20678601211124512686238 No 89 >TIGR00032 argG argininosuccinate synthase; InterPro: IPR001518 Argininosuccinate synthase (6.3.4.5 from EC) (AS) is a urea cycle enzyme that catalyzes the penultimate step in arginine biosynthesis: the ATP-dependent ligation of citrulline to aspartate to form argininosuccinate, AMP and pyrophosphate , . In humans, a defect in the AS gene causes citrullinemia, a genetic disease characterised by severe vomiting spells and mental retardation. AS is a homotetrameric enzyme of chains of about 400 amino-acid residues. An arginine seems to be important for the enzyme's catalytic mechanism. The sequences of AS from various prokaryotes, archaebacteria and eukaryotes show significant similarity.; GO: 0004055 argininosuccinate synthase activity, 0005524 ATP binding, 0006526 arginine biosynthetic process. Probab=64.76 E-value=0.81 Score=25.48 Aligned_cols=34 Identities=18% Similarity=0.169 Sum_probs=22.0 Q ss_pred EEECCCCCCCC--------CCEEH-HHHHHHHHHHHHHHHHCCCC Q ss_conf 22246216667--------72103-56888999999999862420 Q gi|254780814|r 275 CFTNNIPQKDG--------GTHLS-GLRSALTRQITSYVEKSFGS 310 (803) Q Consensus 275 SfvN~I~T~~G--------GtHv~-gf~~al~~~i~~y~~~~~~~ 310 (803) .|-.|+|-.-- |.+++ .+ =|....|+.+.+|++= T Consensus 231 ~Fe~GvPVaLnytdnktskg~~~~~~v--eLi~~aNeiaG~HGvG 273 (420) T TIGR00032 231 DFEKGVPVALNYTDNKTSKGVSLEKPV--ELILKANEIAGKHGVG 273 (420) T ss_pred EECCCCCEEECCCCCCCCCCCCCCCHH--HHHHHHHHHHCCCCCC T ss_conf 623785323015546443773306378--9999999851007886 No 90 >cd01028 TOPRIM_TopoIA TOPRIM_TopoIA: topoisomerase-primase (TOPRIM) nucleotidyl transferase/hydrolase domain of the type found in the type IA family of DNA topoisomerases (TopoIA). This subgroup contains proteins similar to the Type I DNA topoisomerases: E. coli topisomerases I and III, eukaryotic topoisomerase III and, ATP-dependent reverse gyrase found in archaea and thermophilic bacteria. Type IA DNA topoisomerases remove (relax) negative supercoils in the DNA. These enzymes cleave one strand of the DNA duplex, covalently link to the 5' phosphoryl end of the DNA break and allow the other strand of the duplex to pass through the gap. Reverse gyrase is also able to insert positive supercoils in the presence of ATP and negative supercoils in the presence of AMPPNP. The TOPRIM domain has two conserved motifs, one of which centers at a conserved glutamate and the other one at two conserved aspartates (DxD). For topoisomerases the conserved glutamate is believed to act as a general b Probab=63.26 E-value=4.9 Score=19.41 Aligned_cols=27 Identities=26% Similarity=0.390 Sum_probs=23.0 Q ss_pred CCCCEEEEECCCCCCCCHHHHHHHHHH Q ss_conf 737389985088885328999999999 Q gi|254780814|r 496 RYHKIIIMTDADVDGAHIRTLLLTFFF 522 (803) Q Consensus 496 rY~kiiimtDaD~DG~HI~~Llltff~ 522 (803) +...|||+||+|--|-+|.-.++..+- T Consensus 90 ~~d~iiiAtD~DrEGE~I~~~i~~~~~ 116 (142) T cd01028 90 KADEIVLATDPDREGELIAWEILEVLK 116 (142) T ss_pred HCCEEEECCCCCCCHHHHHHHHHHHHC T ss_conf 199999888998227999999999957 No 91 >pfam09707 Cas_Cas2CT1978 CRISPR-associated protein (Cas_Cas2CT1978). This entry represents a minor branch of the Cas2 family of CRISPR-associated protein which are found in IPR003799. Cas proteins are found adjacent to a characteristic short, palindromic repeat cluster termed CRISPR, a probable mobile DNA element. Probab=61.53 E-value=10 Score=16.95 Aligned_cols=28 Identities=21% Similarity=0.572 Sum_probs=22.2 Q ss_pred CCCCCEECCCCCCCHHHHHHHHHHHHHHHH Q ss_conf 458715432799632544565675557886 Q gi|254780814|r 26 KRPSMYIGDTDGESGLHHMIYEVLDNAIDE 55 (803) Q Consensus 26 kRP~mYiG~t~~~~Gl~hlv~EiidNsvDe 55 (803) -|||.|+|+.+ ......+||.+...+.+ T Consensus 24 v~~GVyVg~~S--~rVRd~lW~~v~~~~~~ 51 (86) T pfam09707 24 VRAGVYVGDVS--ARVRERLWEKVTEGIGD 51 (86) T ss_pred CCCCCEECCCC--HHHHHHHHHHHHHHCCC T ss_conf 27876788988--78999999999966789 No 92 >pfam09239 Topo-VIb_trans Topoisomerase VI B subunit, transducer. Members of this family adopt a structure consisting of a four-stranded beta-sheet backed by three alpha-helices, the last of which is over 50 amino acids long and extends from the body of the protein by several turns. This domain has been proposed to mediate intersubunit communication by structurally transducing signals from the ATP binding and hydrolysis domains to the DNA binding and cleavage domains of the gyrase holoenzyme. Probab=61.47 E-value=11 Score=16.83 Aligned_cols=80 Identities=18% Similarity=0.332 Sum_probs=50.1 Q ss_pred EEEEEEEEEECCCC----CCEEEEEECCCCC-CCCCCEEHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHCCCEEEE Q ss_conf 69997334531775----4121222246216-667721035688899999999986242000123576134305721566 Q gi|254780814|r 256 NISIDLAMRWNDGY----HENVLCFTNNIPQ-KDGGTHLSGLRSALTRQITSYVEKSFGSKKDKRSVIGDDCREGFTSVL 330 (803) Q Consensus 256 ~~~veval~~~~~~----~e~i~SfvN~I~T-~~GGtHv~gf~~al~~~i~~y~~~~~~~Kk~k~~~~~~Diregl~~vi 330 (803) ...||+++.|...- .-+++=|+|-||- -+.|.- ++|+++.+. . .+..++...++ .-++.+| T Consensus 41 PF~VEvgiayGG~~~~~~~~~vlRFANRVPLly~~g~c------~itkav~~i-~----W~~Ygl~q~~~---~P~~i~V 106 (160) T pfam09239 41 PFIVEVGIAYGGDIPPEGKPELLRFANRVPLLYDQGAC------VITKAVESI-N----WKRYGLEQPGG---GPLVILV 106 (160) T ss_pred CEEEEEEEECCCCCCCCCCEEEEEEECCCCEEEECCCC------HHHHHHHHC-C----HHHCCCCCCCC---CCEEEEE T ss_conf 87999860016878888972488860567645515743------899999876-7----33126568999---9889999 Q ss_pred EEECCCCCCCCCCHHHHCC Q ss_conf 6521376335721002026 Q gi|254780814|r 331 SIKMPDPRFSSQTKEKLVS 349 (803) Q Consensus 331 svki~nP~F~gQTK~kL~s 349 (803) ++-.++-.|.|-.|+-+.+ T Consensus 107 hvaSt~VPfts~sKeaIa~ 125 (160) T pfam09239 107 HVASTKVPFTSEGKEAIAD 125 (160) T ss_pred EEEECCCCCCCCCHHHHCC T ss_conf 9610477856620544137 No 93 >PTZ00233 variable surface protein Vir18; Provisional Probab=60.66 E-value=3.5 Score=20.51 Aligned_cols=55 Identities=29% Similarity=0.417 Sum_probs=33.5 Q ss_pred CCCCCEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHCCEEEEECCCEEEE-EECCCEEEEECHHHHHHHHHHHC Q ss_conf 67373899850888853289999999996307787489599943833788-72671799814578989998500 Q gi|254780814|r 495 LRYHKIIIMTDADVDGAHIRTLLLTFFFRQMPSLIHNGFLYIIKPPLYGI-TRGKSLQYVKDEESLEDYLINQS 567 (803) Q Consensus 495 lrY~kiiimtDaD~DG~HI~~Llltff~~~~~~li~~g~~yia~ppl~~~-~~~K~E~YikDD~EL~~~Ll~~g 567 (803) +-+.+=||+|=|=. -| -|||+|+|+| +||.|| +++|+-.=..=-+||+.-|.+-- T Consensus 429 Iqnn~~IIktSaPm---GI-VLLLgLLFKy--------------TPLWrvLTKknRKk~a~inEeLnsVLQePs 484 (509) T PTZ00233 429 IQSNKNIIKTSMPI---GI-ALLLGLLFKY--------------TPLWRVLTKKNRKKGAGINEELNSVLQEPS 484 (509) T ss_pred HCCCCCEEECCCHH---HH-HHHHHHHHCC--------------CHHHHHHHHCCCCCCCCCCHHHHHHHHCCC T ss_conf 24662000015018---99-9999988622--------------623665511143555670488887760676 No 94 >pfam07994 NAD_binding_5 Myo-inositol-1-phosphate synthase. This is a family of myo-inositol-1-phosphate synthases. Inositol-1-phosphate catalyses the conversion of glucose-6- phosphate to inositol-1-phosphate, which is then dephosphorylated to inositol. Inositol phosphates play an important role in signal transduction. Probab=60.11 E-value=4.2 Score=19.95 Aligned_cols=75 Identities=20% Similarity=0.299 Sum_probs=43.9 Q ss_pred CCHHHHHHHCCCCCCCCCCCEEEEEEEEEEEEEEEEEECCCCCCEEEEEECCCCCCCCCCEEHHHHHHHHHHHHHHHHHC Q ss_conf 83577653003565568852036420036999733453177541212222462166677210356888999999999862 Q gi|254780814|r 228 GIEAFVSYLDRHKKPLMQSPIRIQGSRDNISIDLAMRWNDGYHENVLCFTNNIPQKDGGTHLSGLRSALTRQITSYVEKS 307 (803) Q Consensus 228 Gi~~fv~~l~~~~~~~~~~~i~~~~~~~~~~veval~~~~~~~e~i~SfvN~I~T~~GGtHv~gf~~al~~~i~~y~~~~ 307 (803) .+..|..-+..+...+.+..+|-. .|+. +. .+|||++|.. +..+-++ .+. T Consensus 157 t~~~~~~~~~~~~~~i~aS~~YA~---------AAl~------~G-~~fvN~~P~~--~a~~pal--------~el---- 206 (389) T pfam07994 157 TAENLLKAIKNNHEEESASTLYAY---------AALL------AG-CAFINGSPQN--TASVPGL--------IEL---- 206 (389) T ss_pred CHHHHHHHHHCCCCCCCHHHHHHH---------HHHH------CC-CCEECCCCCC--CCCCHHH--------HHH---- T ss_conf 199999876548865666899999---------9997------49-9745265531--4685999--------999---- Q ss_pred CCCCCCCCCCCHHHHHCCCEEEEEEEC Q ss_conf 420001235761343057215666521 Q gi|254780814|r 308 FGSKKDKRSVIGDDCREGFTSVLSIKM 334 (803) Q Consensus 308 ~~~Kk~k~~~~~~Diregl~~visvki 334 (803) .++..+-+.|+|++-|.++..++.. T Consensus 207 --a~~~gvpi~GdD~KTg~t~~ks~la 231 (389) T pfam07994 207 --AEEKGLPIAGDDFKSGQTKTKSVLA 231 (389) T ss_pred --HHHCCCCCCCCCCCCCCCEEEHHHH T ss_conf --9965998216401158750002456 No 95 >COG2972 Predicted signal transduction protein with a C-terminal ATPase domain [Signal transduction mechanisms] Probab=55.98 E-value=13 Score=16.17 Aligned_cols=80 Identities=19% Similarity=0.151 Sum_probs=33.9 Q ss_pred CCCCCCCCCCCEEEEEC-CCCCC--CCHHHHHHHHHHHHHHHHHHCCEEEE---ECCC-EEEEEECCCEEEEECHHHHHH Q ss_conf 32123567373899850-88885--32899999999963077874895999---4383-378872671799814578989 Q gi|254780814|r 489 SFDINKLRYHKIIIMTD-ADVDG--AHIRTLLLTFFFRQMPSLIHNGFLYI---IKPP-LYGITRGKSLQYVKDEESLED 561 (803) Q Consensus 489 ~~~~~~lrY~kiiimtD-aD~DG--~HI~~Llltff~~~~~~li~~g~~yi---a~pp-l~~~~~~K~E~YikDD~EL~~ 561 (803) +..+.++||+--+=-.+ =|.+- ..|=.++| -||+|+-..+- -+|+ .-.+...++..++ T Consensus 323 yl~iq~~r~~~~le~~~~i~~~~~~l~~p~l~l-------qpLvENAi~hgi~~~~~~~~I~i~~~~~~~~i-------- 387 (456) T COG2972 323 YLEIQKLRIGDRLEVPLPIDEELEPLIDPKLVL-------QPLVENAIEHGIEPKRPGGSIAISAKKQDDVI-------- 387 (456) T ss_pred HHHHHHCCCCEEEEEEECCCCCCCCCCCHHHHH-------HHHHHHHHHHHHCCCCCCCEEEEEEECCCCEE-------- T ss_conf 999998199879999720375332023278898-------88799999986204699957999994058799-------- Q ss_pred HHHHHCCCCHHCCCCCCCCCCCHHHHHHHHHH Q ss_conf 99850011000014577654354689999999 Q gi|254780814|r 562 YLINQSLSEEIELFCDSKGVISKNDLRNFIDD 593 (803) Q Consensus 562 ~Ll~~gl~~~~l~~~~~~~~l~ge~L~~li~~ 593 (803) .+...+++..+..+.+..+... T Consensus 388 ----------~i~i~Dng~g~~~~~~~~~~~~ 409 (456) T COG2972 388 ----------QISISDNGPGIDEEKLEGLSTK 409 (456) T ss_pred ----------EEEECCCCCCCCHHHHHHHHHC T ss_conf ----------9997769999587888667641 No 96 >TIGR01320 mal_quin_oxido malate:quinone-oxidoreductase; InterPro: IPR006231 The membrane-associated enzyme, malate:quinone-oxidoreductase, is an alternative to the better-known NAD-dependent malate dehydrogenase as part of the TCA cycle. The reduction of a quinone rather than NAD+ makes the reaction essentially irreversible in the direction of malate oxidation to oxaloacetate. Both forms of malate dehydrogenase are active in E. coli; disruption of this form causes less phenotypic change. In some bacteria, this form is the only or the more important malate dehydrogenase . ; GO: 0008924 malate dehydrogenase (acceptor) activity, 0006099 tricarboxylic acid cycle. Probab=55.83 E-value=6.8 Score=18.32 Aligned_cols=33 Identities=33% Similarity=0.578 Sum_probs=24.1 Q ss_pred CCCCCEEEEECCCCCCCCHHHHH-------------HHHHHHHHHHHHHC Q ss_conf 67373899850888853289999-------------99999630778748 Q gi|254780814|r 495 LRYHKIIIMTDADVDGAHIRTLL-------------LTFFFRQMPSLIHN 531 (803) Q Consensus 495 lrY~kiiimtDaD~DG~HI~~Ll-------------ltff~~~~~~li~~ 531 (803) |-+|--+| ||.||| |++|| |-+..|.||+=... T Consensus 406 L~FGT~~v---~~~dGs-iAaLLGASPGAS~Avs~Ml~ll~~CF~~r~~s 451 (487) T TIGR01320 406 LEFGTELV---ADADGS-IAALLGASPGASTAVSIMLELLERCFPERARS 451 (487) T ss_pred CCCHHEEE---CCCCCH-HHHHHCCCCCHHHHHHHHHHHHHHCCCCCCCC T ss_conf 12001113---056732-78873679842345899999997505466878 No 97 >cd03363 TOPRIM_TopoIA_TopoI TOPRIM_TopoIA_TopoI: The topoisomerase-primase (TORPIM) domain found in members of the type IA family of DNA topoisomerases (Topo IA) similar to Escherichia coli DNA topoisomerase I. Type IA DNA topoisomerases remove (relax) negative supercoils in the DNA by: cleaving one strand of the DNA duplex, covalently linking to the 5' phosphoryl end of the DNA break and, allowing the other strand of the duplex to pass through the gap. The TOPRIM domain has two conserved motifs, one of which centers at a conserved glutamate and the other one at two conserved aspartates (DxD). For topoisomerases the conserved glutamate is believed to act as a general base in strand joining and, as a general acid in strand cleavage. The DXD motif may co-ordinate Mg2+, a cofactor required for full catalytic function. Probab=55.82 E-value=8.5 Score=17.56 Aligned_cols=25 Identities=28% Similarity=0.352 Sum_probs=21.0 Q ss_pred CCCEEEEECCCCCCCCHHHHHHHHH Q ss_conf 3738998508888532899999999 Q gi|254780814|r 497 YHKIIIMTDADVDGAHIRTLLLTFF 521 (803) Q Consensus 497 Y~kiiimtDaD~DG~HI~~Llltff 521 (803) -..|+|.||+|--|..|.-.++.++ T Consensus 73 ad~viiAtD~DrEGE~I~~~i~~~~ 97 (123) T cd03363 73 ADEIYLATDPDREGEAIAWHLAEVL 97 (123) T ss_pred CCEEEECCCCCCCHHHHHHHHHHHH T ss_conf 9989984899824799999999995 No 98 >cd00823 TopoIIB_Trans TopoIIB_Trans: Transducer domain, having a ribosomal S5 domain 2-like fold, of the type found in proteins of the type IIB family of DNA topoisomerases similar to Sulfolobus shibatae topoisomerase VI (topoVI). The sole representative of the Type IIB family is topo VI. Topo VI enzymes are heterotetramers found in archaea and plants. S. shibatae topoVI relaxes both positive and negative supercoils, and in addition has a strong decatenase activity. This transducer domain is homologous to the second domain of the DNA gyrase B subunit, which is known to be important in nucleotide hydrolysis and the transduction of structural signals from ATP-binding site to the DNA breakage/reunion regions of the enzymes. Probab=54.25 E-value=14 Score=15.96 Aligned_cols=80 Identities=18% Similarity=0.325 Sum_probs=49.3 Q ss_pred EEEEEEEEEECCCC----CCEEEEEECCCCC-CCCCCEEHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHCCCEEEE Q ss_conf 69997334531775----4121222246216-667721035688899999999986242000123576134305721566 Q gi|254780814|r 256 NISIDLAMRWNDGY----HENVLCFTNNIPQ-KDGGTHLSGLRSALTRQITSYVEKSFGSKKDKRSVIGDDCREGFTSVL 330 (803) Q Consensus 256 ~~~veval~~~~~~----~e~i~SfvN~I~T-~~GGtHv~gf~~al~~~i~~y~~~~~~~Kk~k~~~~~~Diregl~~vi 330 (803) ...||+++.|...- .-+++=|+|-||- -+.|.- ++|+++.+. + .+..+++..+ ..-++.+| T Consensus 32 PF~VEvgiayGG~~~~~~~~~vlRFANRVPLly~~g~c------~itkav~~i----n-W~~Ygl~q~~---~~p~~i~V 97 (151) T cd00823 32 PFIVEVGIAYGGDIPADEKVELLRFANRVPLLYDAGAC------VITKAVESI----N-WKRYGLEQPG---QGPLVVLV 97 (151) T ss_pred CEEEEEEEECCCCCCCCCCCEEEEEECCCCEEEECCCC------HHHHHHHHC----C-CHHCCCCCCC---CCCEEEEE T ss_conf 87999860006878888871588860567646515752------689998856----8-1104657899---88879999 Q ss_pred EEECCCCCCCCCCHHHHCC Q ss_conf 6521376335721002026 Q gi|254780814|r 331 SIKMPDPRFSSQTKEKLVS 349 (803) Q Consensus 331 svki~nP~F~gQTK~kL~s 349 (803) ++-.++-.|.|-.|+-+.+ T Consensus 98 hvaSt~VPfts~sKeaIa~ 116 (151) T cd00823 98 HVASTKVPFTSEGKEAIAD 116 (151) T ss_pred EEEECCCCCCCCCHHHHCC T ss_conf 9600467856620544137 No 99 >TIGR01290 nifB nitrogenase cofactor biosynthesis protein NifB; InterPro: IPR005980 NifB is a protein required for the biosynthesis of the iron-molybdenum (or iron-vanadium) cofactor used by the nitrogen-fixing enzyme nitrogenase. Archaeal homologs lack most of the C-terminal region and are excluded from this model.; GO: 0009108 coenzyme biosynthetic process, 0009399 nitrogen fixation. Probab=51.62 E-value=9.6 Score=17.15 Aligned_cols=10 Identities=20% Similarity=0.125 Sum_probs=4.3 Q ss_pred EEEECCCCCC Q ss_conf 2222462166 Q gi|254780814|r 274 LCFTNNIPQK 283 (803) Q Consensus 274 ~SfvN~I~T~ 283 (803) +.=||+|--| T Consensus 189 L~KvNSvlIP 198 (461) T TIGR01290 189 LVKVNSVLIP 198 (461) T ss_pred EEEECCEECC T ss_conf 4888006438 No 100 >KOG2672 consensus Probab=50.73 E-value=5 Score=19.37 Aligned_cols=34 Identities=21% Similarity=0.252 Sum_probs=24.5 Q ss_pred HHHHHHHHHHH-CCEEEEECCCEEEEEECCCEEEE Q ss_conf 99963077874-89599943833788726717998 Q gi|254780814|r 520 FFFRQMPSLIH-NGFLYIIKPPLYGITRGKSLQYV 553 (803) Q Consensus 520 ff~~~~~~li~-~g~~yia~ppl~~~~~~K~E~Yi 553 (803) =+|.||.+.=+ -|++|.|.+||.+-.++-.|.|+ T Consensus 310 ekf~~w~~~~~~lgf~y~AsgplvrSsykage~~i 344 (360) T KOG2672 310 EKFDYWKEYGEELGFLYVASGPLVRSSYKAGEYFI 344 (360) T ss_pred HHHHHHHHHHHHCCEEEECCCCEEECHHHHHHHHH T ss_conf 88789999755145489636854422111037899 No 101 >PRK13790 phosphoribosylamine--glycine ligase; Provisional Probab=50.50 E-value=15 Score=15.54 Aligned_cols=10 Identities=30% Similarity=0.358 Sum_probs=7.1 Q ss_pred CCCCEEEEEC Q ss_conf 5443023300 Q gi|254780814|r 444 SRENQAILPL 453 (803) Q Consensus 444 dr~~qai~PL 453 (803) |++.|.|||+ T Consensus 288 DPE~q~iLp~ 297 (415) T PRK13790 288 DPEAQVLLSR 297 (415) T ss_pred CCHHHHHHHH T ss_conf 9228889876 No 102 >TIGR01924 rsbW_low_gc anti-sigma B factor; InterPro: IPR010193 This entry describes the anti-sigma B factor also known as serine-protein kinase RsbW. Sigma B controls the general stress regulon in B subtilis and is activated by cell stresses such as stationary phase and heat shock. RsbW binds to sigma B and prevents formation of the transcription complex at the promoter . RsbV (anti-anti-sigma factor) binds to RsbW to inhibit association with sigma B, however RsbW can phosphorylate RsbV, causing disassociation of the RsbV/RsbW complex . Low ATP level or environmental stress causes the dephosphorylation of RsbV .; GO: 0004674 protein serine/threonine kinase activity, 0005524 ATP binding, 0016989 sigma factor antagonist activity, 0006468 protein amino acid phosphorylation, 0045892 negative regulation of transcription DNA-dependent. Probab=50.29 E-value=11 Score=16.79 Aligned_cols=88 Identities=22% Similarity=0.339 Sum_probs=48.4 Q ss_pred HHHHHHHHHHHHHHHCCC--CEEEEEEC--CCC-EEEEEECCCCCCCCCCCCCCCCEEEEEEEEECCCCCCCCCC--CEE Q ss_conf 565675557886860779--75799983--898-39999768445364077788500125642501132328986--048 Q gi|254780814|r 44 MIYEVLDNAIDESLEGYA--DIITVTLN--MDG-SCTVSDNGRGIPTDLHKEEGISAAEVIMTRLHAGGKFNQKS--YKV 116 (803) Q Consensus 44 lv~EiidNsvDe~~~g~~--~~I~V~i~--~dg-sisV~dnGrGIPv~~h~~~~~~~~E~i~t~L~ag~kfd~~~--yk~ 116 (803) =|-|.+.|||+.|--+-- ..|.|..+ .|. .|-|.|+|.--..|-.+..= |=||.+. ... T Consensus 47 AVSEA~TN~V~HAYke~~nvG~~~i~F~iyEDklev~vsD~G~SFD~d~~k~~l--------------GPyd~~~~i~~L 112 (161) T TIGR01924 47 AVSEACTNAVKHAYKEEENVGEISIEFEIYEDKLEVIVSDEGDSFDLDTKKQEL--------------GPYDKSEDIDQL 112 (161) T ss_pred HHHHHHCCCEEEEECCCCCEEEEEEEEEEEEEEEEEEEECCCCCCCHHHHHHHC--------------CCCCCCCCHHHH T ss_conf 665553033112230898367899888788757899982078641110025325--------------897957875432 Q ss_pred ECCCCCCEEEEEEECCCEEEEEEEECCEEEEE Q ss_conf 31167520377863264499999978959999 Q gi|254780814|r 117 SGGLHGVGVSVVNALSSWLKLRIKREGNIYEM 148 (803) Q Consensus 117 sgGlhGvG~s~vNalS~~~~v~v~rdgk~~~q 148 (803) +- =|+|.=|.|-|=...+|.-. .|-.-.| T Consensus 113 ~e--GGLGLFLietLMD~V~~~~~-sGV~v~M 141 (161) T TIGR01924 113 RE--GGLGLFLIETLMDEVKVYED-SGVTVAM 141 (161) T ss_pred HC--CCCHHHHHHCCCCEEEEEEC-CCEEEEE T ss_conf 01--77136645103761699832-8608988 No 103 >PRK00885 phosphoribosylamine--glycine ligase; Provisional Probab=49.18 E-value=16 Score=15.40 Aligned_cols=10 Identities=40% Similarity=0.634 Sum_probs=5.5 Q ss_pred CCCCEEEEEC Q ss_conf 5443023300 Q gi|254780814|r 444 SRENQAILPL 453 (803) Q Consensus 444 dr~~qai~PL 453 (803) |++.|.|||| T Consensus 290 DPE~qvllp~ 299 (424) T PRK00885 290 DPETQVVLPR 299 (424) T ss_pred CHHHHHHHHH T ss_conf 8588899875 No 104 >TIGR00644 recJ single-stranded-DNA-specific exonuclease RecJ; InterPro: IPR004610 All proteins in this family are 5'-3' single-strand DNA exonucleases. These proteins are used in some aspects of mismatch repair, recombination, and recombinational repair. RecJ is a single-stranded DNA-specific endonuclease which is required for many types of recombination events, although the stringency of the requirement for RecJ appears to vary with the type of recombinational event monitored, and the other recombination gene products which are available.; GO: 0008409 5'-3' exonuclease activity, 0006281 DNA repair, 0006310 DNA recombination. Probab=49.13 E-value=16 Score=15.39 Aligned_cols=112 Identities=27% Similarity=0.358 Sum_probs=56.0 Q ss_pred CCCHHHCEE----CCCCCHHC--CCCCCEECCC-CCCCHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCEEE-----E Q ss_conf 678655123----13542000--4587154327-9963254456567555788686077975799983898399-----9 Q gi|254780814|r 9 SYDADSIQI----LKGLDAVK--KRPSMYIGDT-DGESGLHHMIYEVLDNAIDESLEGYADIITVTLNMDGSCT-----V 76 (803) Q Consensus 9 ~Y~~~~i~~----L~gle~vr--kRP~mYiG~t-~~~~Gl~hlv~EiidNsvDe~~~g~~~~I~V~i~~dgsis-----V 76 (803) +||+|=|.. -+.|++.= ..+..||=.. ..+.|+-+ |++++..+ .+..+-||+ |+.|+ - T Consensus 63 DyDvDGiTSt~~l~~~L~~lGp~~~~~~~IPnR~~egYG~~~---~~~~~~~~-----~G~~LiiTV--D~Gi~a~~e~~ 132 (705) T TIGR00644 63 DYDVDGITSTAILVEFLRELGPYVQVDYYIPNRFTEGYGLSP---EALRELKE-----NGVSLIITV--DNGISAHEEIE 132 (705) T ss_pred CCCCCHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCCCH---HHHHHHHH-----CCCEEEEEC--CCCHHHHHHHH T ss_conf 175226899999999999509973246662663577778898---99999986-----698399982--68742699999 Q ss_pred EECCCCCCC---CCC--CCCCCCEEEEEEEEECCCCCCCCCCCEEECCCCCCEE--EEEEECCCEEE Q ss_conf 976844536---407--7788500125642501132328986048311675203--77863264499 Q gi|254780814|r 77 SDNGRGIPT---DLH--KEEGISAAEVIMTRLHAGGKFNQKSYKVSGGLHGVGV--SVVNALSSWLK 136 (803) Q Consensus 77 ~dnGrGIPv---~~h--~~~~~~~~E~i~t~L~ag~kfd~~~yk~sgGlhGvG~--s~vNalS~~~~ 136 (803) .=+=+||+| |=| +.+..|....|---=+-+.+|-.+. |=|||+ +++-||++.+. T Consensus 133 ~a~~~G~dVIVtDHHC~~~~~lP~a~AivNP~~~~~~yP~k~------LaGvGVaFkl~~AL~~~~~ 193 (705) T TIGR00644 133 YAKELGIDVIVTDHHCLPPETLPEAAAIVNPKRPDCNYPFKE------LAGVGVAFKLVTALLEELR 193 (705) T ss_pred HHHHCCCEEEEECCCCCCCCCCCCCCEEEECCCCCCCCCCHH------HHHHHHHHHHHHHHHHHCC T ss_conf 998669819997256798489986505771675889988314------6788999999999997333 No 105 >PRK13529 malate dehydrogenase; Provisional Probab=48.67 E-value=13 Score=15.99 Aligned_cols=46 Identities=28% Similarity=0.484 Sum_probs=21.9 Q ss_pred CCEECCCCCCCHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCE-E-EEEE---CCCCCCCC Q ss_conf 715432799632544565675557886860779757999838983-9-9997---68445364 Q gi|254780814|r 29 SMYIGDTDGESGLHHMIYEVLDNAIDESLEGYADIITVTLNMDGS-C-TVSD---NGRGIPTD 86 (803) Q Consensus 29 ~mYiG~t~~~~Gl~hlv~EiidNsvDe~~~g~~~~I~V~i~~dgs-i-sV~d---nGrGIPv~ 86 (803) |||| |.++ .| | +.|++.|.-. ..|++.+-.||+ | .+-| ||=||||+ T Consensus 124 Glyi-s~~d-~g--~-i~~il~nwp~-------~~V~~IVVTDGerILGlGDlG~~GmgI~iG 174 (563) T PRK13529 124 GLFI-SYPD-RD--R-IDDILRNAPN-------RDIKLIVVTDGERILGLGDQGIGGMGIPIG 174 (563) T ss_pred EEEE-ECCC-HH--H-HHHHHHHCCC-------CCCEEEEEECCCCCCCCCCCCCCCCCCCHH T ss_conf 4896-0266-66--8-9999970885-------670499984673002457656787765143 No 106 >KOG0308 consensus Probab=47.34 E-value=17 Score=15.19 Aligned_cols=30 Identities=17% Similarity=0.129 Sum_probs=19.2 Q ss_pred HHHHHHCCCCEEEEEECCCCEEEEEECCCCC Q ss_conf 8868607797579998389839999768445 Q gi|254780814|r 53 IDESLEGYADIITVTLNMDGSCTVSDNGRGI 83 (803) Q Consensus 53 vDe~~~g~~~~I~V~i~~dgsisV~dnGrGI 83 (803) =|-++.|..+ .-|.-..|-+|-|||--++- T Consensus 77 NDiiL~~~~~-tlIS~SsDtTVK~W~~~~~~ 106 (735) T KOG0308 77 NDIILCGNGK-TLISASSDTTVKVWNAHKDN 106 (735) T ss_pred HHHHHHCCCC-CEEEECCCCEEEEEECCCCC T ss_conf 6677545888-56984178558996335586 No 107 >PRK04017 hypothetical protein; Provisional Probab=47.26 E-value=17 Score=15.18 Aligned_cols=28 Identities=25% Similarity=0.388 Sum_probs=23.8 Q ss_pred CCCEEEEECCCCCCCCHHHHHHHHHHHH Q ss_conf 3738998508888532899999999963 Q gi|254780814|r 497 YHKIIIMTDADVDGAHIRTLLLTFFFRQ 524 (803) Q Consensus 497 Y~kiiimtDaD~DG~HI~~Llltff~~~ 524 (803) |++||||||=|--|.-++.-|...|-.+ T Consensus 65 ~~~vIlL~DwDr~G~~L~~kl~~~L~~~ 92 (132) T PRK04017 65 KKEVIILTDFDRKGEELAKKLSEYLQGY 92 (132) T ss_pred CCEEEEEEECCCCHHHHHHHHHHHHHHC T ss_conf 9839999713632179999999999868 No 108 >PTZ00306 NADH-dependent fumarate reductase; Provisional Probab=46.70 E-value=11 Score=16.74 Aligned_cols=46 Identities=15% Similarity=0.091 Sum_probs=18.3 Q ss_pred CCHHHHHHHHHHHHHCCCEEEECCCCCCCCHHHHHHCCCCHHHCEEEEEE Q ss_conf 99899999999863067628603777788955722303896134021236 Q gi|254780814|r 717 TGPCSLLDTIFSIGRKGISMQRYKGLGEMNASQLWETTLNPDTRSLLRVK 766 (803) Q Consensus 717 ~s~~el~~~i~~~gkKG~sIQRYKGLGEMNPeQLWETTMDP~tRtLLqV~ 766 (803) -..+++++....++...+.- ..==-||..+|---.+--.|.+||.. T Consensus 1077 ~~~r~~l~~~~~~~~~~f~~----~f~~~~p~~~w~~~v~~~~~~~~~~~ 1122 (1167) T PTZ00306 1077 LTYRELLESYRKENPGKFKC----HFVLNNPPEGWTDGVGFVDRALLQSA 1122 (1167) T ss_pred CHHHHHHHHHHHHCCCCEEE----EEEECCCCCCCCCCCCCHHHHHHHHC T ss_conf 20999999988758876478----99806998554365440006878624 No 109 >TIGR01420 pilT_fam twitching motility protein; InterPro: IPR006321 These represent the PilT subfamily of proteins related to GspE, a protein involved in type II secretion (also called the General Secretion Pathway). PilT is an apparent cytosolic ATPase associated with type IV pilus systems. It is not required for pilin biogenesis, but is required for twitching motility and social gliding behaviors, shown in some species, powered by pilus retraction . Members of this family may be found in some species that do not have type IV pili but have related structures for DNA uptake and natural transformation. ; GO: 0005524 ATP binding, 0006810 transport. Probab=44.67 E-value=19 Score=14.90 Aligned_cols=190 Identities=16% Similarity=0.246 Sum_probs=90.4 Q ss_pred HHHHHHHHHHHHHHCCCCEEEEEECCCCE-EEEEECCCCC-CCCCCCCCCCCEEEE---EEEEECCC--CCCC-CCCCEE Q ss_conf 65675557886860779757999838983-9999768445-364077788500125---64250113--2328-986048 Q gi|254780814|r 45 IYEVLDNAIDESLEGYADIITVTLNMDGS-CTVSDNGRGI-PTDLHKEEGISAAEV---IMTRLHAG--GKFN-QKSYKV 116 (803) Q Consensus 45 v~EiidNsvDe~~~g~~~~I~V~i~~dgs-isV~dnGrGI-Pv~~h~~~~~~~~E~---i~t~L~ag--~kfd-~~~yk~ 116 (803) +.|+|--|++. -++-|-+.. |+ --++.||.=- |.+.-|-..-..-++ ||++-|.- -.|. +.+.-+ T Consensus 2 l~~ll~~~~~~----~aSD~HL~a---G~PP~~R~~G~~~~~~~~~plt~~~~~~l~~~~l~~th~~~~~~f~~~~E~Df 74 (350) T TIGR01420 2 LEELLREAVKK----GASDIHLSA---GLPPAIRIDGDLRTRIGFEPLTPEDTQKLLREILSSTHEKQREEFEENGELDF 74 (350) T ss_pred HHHHHHHHHHC----CCCEEEEEC---CCCEEEEECCCEECCCCCCCCCHHHHHHHHHHHCCCCCHHHHHHHHHCCCCCE T ss_conf 67889999848----997488503---77403754783103446789898999999998638456577750565066444 Q ss_pred ECCCCCCEEEEEEECCCEEEEEEEECCEEEEEEEECCEECCCCEEECCCCCCCCEEEEEEECHHHHHHHHCCHHHHHHHH Q ss_conf 31167520377863264499999978959999988983646313873277888708999877888542100089999999 Q gi|254780814|r 117 SGGLHGVGVSVVNALSSWLKLRIKREGNIYEMSFINGILDNPLVVTGSAGNDTGTEVTFLPSSDIFSVQDFNYDTLQHRL 196 (803) Q Consensus 117 sgGlhGvG~s~vNalS~~~~v~v~rdgk~~~q~f~~g~~~~~l~~~~~~~~~~GT~I~F~PD~~iF~~~~~~~d~l~~R~ 196 (803) |=++-|+| +|.|.++....-+-+-| ..+.. =+|-++-.+.. -++ + T Consensus 75 s~~~~~~~---------RfRvN~f~QRg~~a~vl---------R~ip~----------~Ip~fe~LGLP---~~v----~ 119 (350) T TIGR01420 75 SFSLPGLA---------RFRVNAFKQRGGVALVL---------RLIPS----------KIPTFEELGLP---RPV----L 119 (350) T ss_pred EEEECCCC---------EEEEHHHHHCCHHHHHH---------HHCCC----------CCCCHHHCCCC---HHH----H T ss_conf 66306732---------21220323500064232---------31153----------46216663798---789----9 Q ss_pred HHHHCCCCEEEEEEEECCCCCCCEEEECCCCCCHHHHHHHCCCCCCCCCCCEEEEEEEEEEEEEEEEEECCCCCCEEEEE Q ss_conf 97311464169999963586421010012688357765300356556885203642003699973345317754121222 Q gi|254780814|r 197 RELSFLNSSVQISLIDKRSPEPREITMFYKGGIEAFVSYLDRHKKPLMQSPIRIQGSRDNISIDLAMRWNDGYHENVLCF 276 (803) Q Consensus 197 ~~~A~Ln~gl~i~l~Der~~~~~~~~f~~~~Gi~~fv~~l~~~~~~~~~~~i~~~~~~~~~~veval~~~~~~~e~i~Sf 276 (803) +++|---.||=+ ++.- ++.+|..+ |++.++|+|+++.. .++.++.. +|+-+ .+..|+ T Consensus 120 ~~~a~~~~GLiL-VTGP-TGSGKSTT------lAsmIDyIN~~~~~---HIiTIEDP-----IEyvh-------~~~~sl 176 (350) T TIGR01420 120 RELAERPRGLIL-VTGP-TGSGKSTT------LASMIDYINKNKAG---HIITIEDP-----IEYVH-------KNKRSL 176 (350) T ss_pred HHHHHCCCCCEE-EECC-CCCCHHHH------HHHHHHHHHCCCCC---CCEEEECC-----EEEEE-------CCCEEE T ss_conf 999836699389-8768-89867899------99999787403888---82563177-----31410-------477024 Q ss_pred ECCCCCCCCCCEEHHHHHHHHHHHHH Q ss_conf 24621666772103568889999999 Q gi|254780814|r 277 TNNIPQKDGGTHLSGLRSALTRQITS 302 (803) Q Consensus 277 vN~I~T~~GGtHv~gf~~al~~~i~~ 302 (803) +| +++=|.|-.-|-+||-.|+++ T Consensus 177 i~---QREvG~DT~sF~~ALraALRe 199 (350) T TIGR01420 177 IN---QREVGLDTLSFANALRAALRE 199 (350) T ss_pred EE---CCCCCCCHHHHHHHHHHHHCC T ss_conf 54---362467545799997684102 No 110 >PRK05279 N-acetylglutamate synthase; Validated Probab=43.93 E-value=6.1 Score=18.69 Aligned_cols=73 Identities=25% Similarity=0.455 Sum_probs=52.0 Q ss_pred CCCCCCCHHHCCHHHHHCCHHHHHHHHHHCCCCCCCCCCCC----------CCCCCCEEEEECCC----CCCCCHHHHHH Q ss_conf 05620334430587663236678888762578775332123----------56737389985088----88532899999 Q gi|254780814|r 453 LRGKILNVERARFDKMLSSQEIGTLITALGTGIGQDSFDIN----------KLRYHKIIIMTDAD----VDGAHIRTLLL 518 (803) Q Consensus 453 LrGKilNv~~a~~~~i~~n~Ei~~l~~~lG~~~g~~~~~~~----------~lrY~kiiimtDaD----~DG~HI~~Lll 518 (803) +-|++-.|-.....+.+++..|- ++.-||.+..-+.||+. .|+=.|+|.|||.+ .+|.-|+.|-. T Consensus 149 ~vG~V~~Vd~~~I~~~L~~g~Ip-visplG~s~~Ge~fNinad~vA~~iA~aL~A~KLI~ltd~~Gi~d~~g~li~~lt~ 227 (441) T PRK05279 149 HTGEVRRIDTEAIRRQLDNGAIV-LLSPLGYSPTGESFNLTMEEVATQVAIALKADKLIFFTESQGILDEDGELIRELSP 227 (441) T ss_pred CCCEEEEECHHHHHHHHHCCCEE-EECCEEECCCCCEECCCHHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCCCCCCCH T ss_conf 54158797389999999789829-98664638999986669999999999976998899852887642899987530799 Q ss_pred HHHHHHHHHHHH Q ss_conf 999963077874 Q gi|254780814|r 519 TFFFRQMPSLIH 530 (803) Q Consensus 519 tff~~~~~~li~ 530 (803) . ...+|++ T Consensus 228 ~----ea~~ll~ 235 (441) T PRK05279 228 N----EAQALLE 235 (441) T ss_pred H----HHHHHHH T ss_conf 9----9999997 No 111 >TIGR02343 chap_CCT_epsi T-complex protein 1, epsilon subunit; InterPro: IPR012718 Members of this eukaryotic family are part of the group II chaperonin complex called CCT (chaperonin containing TCP-1) or TRiC. The archaeal equivalent group II chaperonin is often called the thermosome. Both are somewhat related to the group I chaperonin of bacterial, GroEL/GroES. This family consists exclusively of the CCT epsilon chain (part of a paralogous family) from animals, plants, fungi, and other eukaryotes.; GO: 0005524 ATP binding, 0051082 unfolded protein binding, 0006457 protein folding. Probab=43.10 E-value=14 Score=15.91 Aligned_cols=26 Identities=23% Similarity=0.470 Sum_probs=18.8 Q ss_pred EEECCCCEEEEEECCCCCCCCCCCCC Q ss_conf 99838983999976844536407778 Q gi|254780814|r 66 VTLNMDGSCTVSDNGRGIPTDLHKEE 91 (803) Q Consensus 66 V~i~~dgsisV~dnGrGIPv~~h~~~ 91 (803) +-+.+||.|||+|||--|=-.|.-.+ T Consensus 55 ~l~s~Dgdi~vTNDGATIl~~Md~~~ 80 (541) T TIGR02343 55 MLISPDGDITVTNDGATILEQMDVDN 80 (541) T ss_pred CEECCCCCEEEECCHHHHHHHCCCCC T ss_conf 00178987788526145764307771 No 112 >TIGR01873 cas_CT1978 CRISPR-associated protein Cas2; InterPro: IPR010152 This entry represents a minor branch of the Cas2 family of CRISPR-associated protein which are found in IPR003799 from INTERPRO. Cas proteins are found adjacent to a characteristic short, palindromic repeat cluster termed CRISPR, a probable mobile DNA element.. Probab=42.91 E-value=15 Score=15.72 Aligned_cols=24 Identities=21% Similarity=0.520 Sum_probs=18.6 Q ss_pred CCCCCEEC-CCCCCCHHHHHHHHHHHH Q ss_conf 45871543-279963254456567555 Q gi|254780814|r 26 KRPSMYIG-DTDGESGLHHMIYEVLDN 51 (803) Q Consensus 26 kRP~mYiG-~t~~~~Gl~hlv~EiidN 51 (803) .|||.|+| ..+ .++...|||.|.- T Consensus 24 ~rAgVYv~P~~~--~~vr~~iwd~l~q 48 (91) T TIGR01873 24 VRAGVYVGPRVS--AKVRERIWDELAQ 48 (91) T ss_pred HCCCCEECCCCC--HHHHHHHHHHHCC T ss_conf 178822357626--3678888886512 No 113 >pfam07182 DUF1402 Protein of unknown function (DUF1402). This family consists of several hypothetical bacterial proteins of around 310 residues in length. Members of this family seem to be found exclusively in Agrobacterium, Rhizobium and Brucella species. The function of this family is unknown. Probab=42.02 E-value=11 Score=16.83 Aligned_cols=12 Identities=33% Similarity=0.382 Sum_probs=9.0 Q ss_pred CCCCCCCCHHHHC Q ss_conf 7633572100202 Q gi|254780814|r 336 DPRFSSQTKEKLV 348 (803) Q Consensus 336 nP~F~gQTK~kL~ 348 (803) .|-|.|||=. || T Consensus 173 QPfyAGQtFG-LG 184 (304) T pfam07182 173 QPFYAGQTFG-LG 184 (304) T ss_pred CCCCCCCCCC-CC T ss_conf 6100457667-76 No 114 >TIGR01836 PHA_synth_III_C poly(R)-hydroxyalkanoic acid synthase, class III, PhaC subunit; InterPro: IPR010125 This entry represents the PhaC subunit of a heterodimeric form of polyhydroxyalkanoic acid (PHA) synthase. Excepting the PhaC of Bacillus megaterium (which needs PhaR), all members require PhaE (IPR010123 from INTERPRO) for activity and are designated class III. This enzyme builds ester polymers for carbon and energy storage that accumulate in inclusions, and both this enzyme and the depolymerase associate with the inclusions. Class III enzymes polymerise short-chain-length hydroxyalkanoates.; GO: 0008415 acyltransferase activity, 0042619 poly-hydroxybutyrate biosynthetic process. Probab=41.72 E-value=21 Score=14.58 Aligned_cols=46 Identities=15% Similarity=0.177 Sum_probs=29.3 Q ss_pred CCCCCCHHHCCHHHHHCCHHHHHHHHHHCCCCCCCCCCCCCCCCCCEEEE Q ss_conf 56203344305876632366788887625787753321235673738998 Q gi|254780814|r 454 RGKILNVERARFDKMLSSQEIGTLITALGTGIGQDSFDINKLRYHKIIIM 503 (803) Q Consensus 454 rGKilNv~~a~~~~i~~n~Ei~~l~~~lG~~~g~~~~~~~~lrY~kiiim 503 (803) +==|||+. |..|-+--..--.+|.-++|-. + .+..-.++=|||-|. T Consensus 302 ~~PvLNiy-A~~DHLV~P~~S~aL~~~v~sn--~-Dyt~~~F~~GHiG~y 347 (367) T TIGR01836 302 KMPVLNIY-AERDHLVPPDASKALNDLVSSN--E-DYTELSFPGGHIGIY 347 (367) T ss_pred CCCHHHHH-HCCCCCCCCHHHHHHCCCCCCC--C-CCEECCCCCCEEEEE T ss_conf 65512564-3347755516999745675433--6-621004798438999 No 115 >PRK13789 phosphoribosylamine--glycine ligase; Provisional Probab=39.98 E-value=22 Score=14.39 Aligned_cols=12 Identities=25% Similarity=0.634 Sum_probs=6.3 Q ss_pred EEEECCCCCCCH Q ss_conf 233005620334 Q gi|254780814|r 449 AILPLRGKILNV 460 (803) Q Consensus 449 ai~PLrGKilNv 460 (803) .++---|.+|+| T Consensus 378 ~l~t~GGRVL~v 389 (426) T PRK13789 378 KVFSSGGRILGI 389 (426) T ss_pred EEEECCCCEEEE T ss_conf 899735938999 No 116 >KOG1332 consensus Probab=39.74 E-value=22 Score=14.36 Aligned_cols=107 Identities=19% Similarity=0.237 Sum_probs=63.8 Q ss_pred CCCCEEEEEECCCCEEE---EEECCCCCCCCCCCCCCCCEEEEE-----EEEECCCCCCCCCC----------------- Q ss_conf 77975799983898399---997684453640777885001256-----42501132328986----------------- Q gi|254780814|r 59 GYADIITVTLNMDGSCT---VSDNGRGIPTDLHKEEGISAAEVI-----MTRLHAGGKFNQKS----------------- 113 (803) Q Consensus 59 g~~~~I~V~i~~dgsis---V~dnGrGIPv~~h~~~~~~~~E~i-----~t~L~ag~kfd~~~----------------- 113 (803) =|+++| .|-..|++|- |++||.-.+..--..+--|++++. ||.++|+.-||.+- T Consensus 21 yygkrl-ATcsSD~tVkIf~v~~n~~s~ll~~L~Gh~GPVwqv~wahPk~G~iLAScsYDgkVIiWke~~g~w~k~~e~~ 99 (299) T KOG1332 21 YYGKRL-ATCSSDGTVKIFEVRNNGQSKLLAELTGHSGPVWKVAWAHPKFGTILASCSYDGKVIIWKEENGRWTKAYEHA 99 (299) T ss_pred HHCCEE-EEECCCCCEEEEEECCCCCCEEEEEECCCCCCEEEEEECCCCCCCEEEEEECCCEEEEEECCCCCHHHHHHHH T ss_conf 310435-6632787189999748887355467427778705776416554737677623872999956898456654334 Q ss_pred ------CEEECCCCCCEEEEEEECCCEEEEEEEECCEEEEEEEECC-EECCCCEEECCCCCCCC-EEEEEEEC Q ss_conf ------0483116752037786326449999997895999998898-36463138732778887-08999877 Q gi|254780814|r 114 ------YKVSGGLHGVGVSVVNALSSWLKLRIKREGNIYEMSFING-ILDNPLVVTGSAGNDTG-TEVTFLPS 178 (803) Q Consensus 114 ------yk~sgGlhGvG~s~vNalS~~~~v~v~rdgk~~~q~f~~g-~~~~~l~~~~~~~~~~G-T~I~F~PD 178 (803) --++.--|++|..|..|-| ||++-..+|... ...++ ++-.+ ..-| +.|+|-|- T Consensus 100 ~h~~SVNsV~wapheygl~LacasS---------DG~vsvl~~~~~g~w~t~--ki~~a-H~~GvnsVswapa 160 (299) T KOG1332 100 AHSASVNSVAWAPHEYGLLLACASS---------DGKVSVLTYDSSGGWTTS--KIVFA-HEIGVNSVSWAPA 160 (299) T ss_pred HHCCCCCEECCCCCCCCEEEEEEEC---------CCCEEEEEECCCCCCCCH--HHHCC-CCCCCCEEEECCC T ss_conf 2015300020355566427999607---------996789997478885420--10103-1255533564576 No 117 >TIGR02749 prenyl_cyano solanesyl diphosphate synthase; InterPro: IPR014120 This entry contains solanesyl diphosphate synthases from cyanobacteria or plastid-containing eukaryotes. A member from Arabidopsis (where both plastoquinone and ubiquinone contain the C(45) prenyl moiety) was characterised by heterologous expression as a solanesyl diphosphate synthase.. Probab=39.26 E-value=22 Score=14.31 Aligned_cols=35 Identities=29% Similarity=0.525 Sum_probs=21.8 Q ss_pred HHHHHHHHHHH---------HHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 87400357999---------877899988775545532110011 Q gi|254780814|r 368 LEEHPLEAKII---------VKKVLDASMVRDAARRARDLTRRK 402 (803) Q Consensus 368 l~~n~~~ak~i---------~~k~~~a~~ar~~a~kar~~~rrK 402 (803) |+++|...+-| +++++.--+.+.+.+|+||+...+ T Consensus 250 LE~~p~L~~LIer~F~~~~DLe~AL~lv~~~~aI~k~rELA~~~ 293 (325) T TIGR02749 250 LEEEPKLSELIEREFSQKGDLEQALELVRKSGAIKKARELAKEQ 293 (325) T ss_pred HCCCCCCCHHHCCCCCCCCCHHHHHHHHHHCCCCHHHHHHHHHH T ss_conf 23788720120111378145799999998237215789999999 No 118 >pfam09284 RhgB_N Rhamnogalacturonase B, N-terminal. Members of this family are found in prokaryotic Rhamnogalacturonase B, and adopt a structure consisting of a beta supersandwich, with eighteen strands in two beta-sheets. The exact function of the domain is unknown, but a putative role includes carbohydrate-binding. Probab=38.07 E-value=23 Score=14.18 Aligned_cols=87 Identities=15% Similarity=0.157 Sum_probs=49.5 Q ss_pred EEEEEEEC-----CCCCCCCCCCEEEC-CCC---CCEEEEEEECC--CEEEEEEEECCEEEEEEEECCEECCCCEEECCC Q ss_conf 25642501-----13232898604831-167---52037786326--449999997895999998898364631387327 Q gi|254780814|r 97 EVIMTRLH-----AGGKFNQKSYKVSG-GLH---GVGVSVVNALS--SWLKLRIKREGNIYEMSFINGILDNPLVVTGSA 165 (803) Q Consensus 97 E~i~t~L~-----ag~kfd~~~yk~sg-Glh---GvG~s~vNalS--~~~~v~v~rdgk~~~q~f~~g~~~~~l~~~~~~ 165 (803) .|||+.=+ ++=||...+|++++ |-| |+|++.|-|-. ..-.|++....-.|..-+.+|.+.--+..- .+ T Consensus 17 ~Lvf~V~~s~gDi~Si~Y~g~E~Qy~~k~SHI~SGLGsatVs~~~~~~~IkVt~~~~tLthYyv~r~g~~~IYMaT~-~~ 95 (249) T pfam09284 17 ALVFKVNKSSGDITSIVYRGTELQYSGKGSHIGSGLGSATVTAEQIGGTIKVTCKTSTLTHYYIARSGDNTIYMATY-IT 95 (249) T ss_pred CEEEEEECCCCCEEEEEECCEEEECCCCCCEECCCCCCCEEEEEEECCEEEEEEECCCCEEEEEEECCCCEEEEEEC-CC T ss_conf 75999944888757888888874327765313156675357776308879999982895288997179844898733-56 Q ss_pred CCCCCEEEEEEE--CHHHHHH Q ss_conf 788870899987--7888542 Q gi|254780814|r 166 GNDTGTEVTFLP--SSDIFSV 184 (803) Q Consensus 166 ~~~~GT~I~F~P--D~~iF~~ 184 (803) .+..=...+|+. +..+|++ T Consensus 96 ae~sigelRfIaRL~~~~lPn 116 (249) T pfam09284 96 AEPSIGELRFIARLNVSLLPN 116 (249) T ss_pred CCCCCCCEEEEEECCCCCCCC T ss_conf 788854078899815453889 No 119 >TIGR02080 O_succ_thio_ly O-succinylhomoserine (thiol)-lyase; InterPro: IPR011821 Proteins in this entry are important in the interconversion of the sulphur-containing amino acids L-cysteine and L-methionine in bacteria. In bacteria, inorganic sulphur is reduced to sulphide, which is then used to displace the acetyl group of O-acetyl-L-serine to form the essential amino acid L-cysteine. Biosynthesis of L-methionine then occurs via the transsulphuration pathway which involves the sequential action of two pyridoxal 5'-phosphate (PLP) enzymes, cystathionine gamma-synthase (CGS) and cystathionine beta-lyase (CBL). Firstly CGS catalyses the formation of the intermediate cystathione from a homoserine ester and L-cysteine. This intermediate is then converted by CBL to L-homoserine, which is subsequently methylated by methionine synthase to form L-methionine. For more information see . The CGS enzyme (P00935 from SWISSPROT) from Escherichia coli is, like other CGS enzymes, a homotetramer composed of two catalytically active dimers . The active site of each dimer is found at the interface between the monomers and involves residues from each monomer. Catalysis occurs by alpha,gamma-elimination/replacement reactions.. Probab=37.37 E-value=14 Score=15.82 Aligned_cols=16 Identities=25% Similarity=0.316 Sum_probs=9.6 Q ss_pred HHHHHHHHHHCCCCCC Q ss_conf 9999999986242000 Q gi|254780814|r 297 TRQITSYVEKSFGSKK 312 (803) Q Consensus 297 ~~~i~~y~~~~~~~Kk 312 (803) ++.|-+|..++-..|| T Consensus 259 A~~Iv~yL~~~P~V~k 274 (383) T TIGR02080 259 AQTIVEYLQTQPLVKK 274 (383) T ss_pred HHHHHHHHHCCCCEEE T ss_conf 9999999844894026 No 120 >pfam10340 DUF2424 Protein of unknown function (DUF2424). This is a family of proteins conserved in yeasts. The function is not known. Probab=37.17 E-value=19 Score=14.92 Aligned_cols=15 Identities=40% Similarity=0.572 Sum_probs=7.8 Q ss_pred EEEEEEECCCCCCCC Q ss_conf 179998255754322 Q gi|254780814|r 425 SELFLVEGDSAGGSA 439 (803) Q Consensus 425 ~eL~ivEGDSAggsa 439 (803) |+=+++=||||||-+ T Consensus 194 ~kni~LmGDSAGGNL 208 (374) T pfam10340 194 CKNVTLMGDSAGGNL 208 (374) T ss_pred CCEEEEEECCCCCHH T ss_conf 616999723776237 No 121 >COG0049 RpsG Ribosomal protein S7 [Translation, ribosomal structure and biogenesis] Probab=37.12 E-value=24 Score=14.12 Aligned_cols=57 Identities=19% Similarity=0.371 Sum_probs=28.9 Q ss_pred CCCCCCCCCCHHHHCCCC----CCHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 137633572100202611----110321110268999987400357999877899988775 Q gi|254780814|r 334 MPDPRFSSQTKEKLVSSE----VRSVVESFVNDGLSAWLEEHPLEAKIIVKKVLDASMVRD 390 (803) Q Consensus 334 i~nP~F~gQTK~kL~s~~----v~~~v~~~v~~~l~~~l~~n~~~ak~i~~k~~~a~~ar~ 390 (803) .|+|.|.+.+=++|.|.- =+...+.+|+..|...-++..+..-.+..+++++..-|+ T Consensus 12 ~~dpk~~~~iv~rliN~iM~~GKK~~A~~Ivy~A~~ii~~kTg~~Piqvf~~Ai~N~~P~~ 72 (148) T COG0049 12 LPDPKYNSVIVERLINKIMRDGKKSLAEKIVYGAFDIIEKKTGQNPLQVFEKAIENVKPRE 72 (148) T ss_pred CCCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCHH T ss_conf 8986412499999999999805389999999999999999978986999999998438665 No 122 >KOG0020 consensus Probab=36.70 E-value=24 Score=14.02 Aligned_cols=155 Identities=20% Similarity=0.299 Sum_probs=73.1 Q ss_pred HHHHHHHHHH-------------HHHCCCCE--EEEEECCC-CEEEEEECCCCCCCC-CCCCCC---CCEEEEEEEEECC Q ss_conf 5675557886-------------86077975--79998389-839999768445364-077788---5001256425011 Q gi|254780814|r 46 YEVLDNAIDE-------------SLEGYADI--ITVTLNMD-GSCTVSDNGRGIPTD-LHKEEG---ISAAEVIMTRLHA 105 (803) Q Consensus 46 ~EiidNsvDe-------------~~~g~~~~--I~V~i~~d-gsisV~dnGrGIPv~-~h~~~~---~~~~E~i~t~L~a 105 (803) -|+|.||-|- .+.+ .-. |+|..++. .-..|.|-|-|+.-+ .-..-| .++.---+..+.. T Consensus 101 RELISNASDAlDKIRllaLtd~~~L~~-~~el~ikIK~Dke~klLhi~DtGiGMT~edLi~NLGTIAkSGTs~Fl~Km~~ 179 (785) T KOG0020 101 RELISNASDALDKIRLLALTDKDVLGE-TEELEIKIKADKEKKLLHITDTGIGMTREDLIKNLGTIAKSGTSEFLEKMQD 179 (785) T ss_pred HHHHHHHHHHHHHEEEEECCCHHHHCC-CCCEEEEEEECHHHCEEEEECCCCCCCHHHHHHHHHHHHCCCHHHHHHHHHC T ss_conf 999731366655510111168657275-7126999862201073467516678529999976655511457999998536 Q ss_pred CCCCCCCCCEEECCCCCCEEEEEEECCCEEEEEEEECC-EEEEEEEEC--C-EECCCCEEECCCCCCCCEEEEEEECHHH Q ss_conf 32328986048311675203778632644999999789-599999889--8-3646313873277888708999877888 Q gi|254780814|r 106 GGKFNQKSYKVSGGLHGVGVSVVNALSSWLKLRIKREG-NIYEMSFIN--G-ILDNPLVVTGSAGNDTGTEVTFLPSSDI 181 (803) Q Consensus 106 g~kfd~~~yk~sgGlhGvG~s~vNalS~~~~v~v~rdg-k~~~q~f~~--g-~~~~~l~~~~~~~~~~GT~I~F~PD~~i 181 (803) .|. +.....---|.-|||.-..-....+..|+...|. ..|. |+. + -.++. ...|.+ -++||.|+...-.+- T Consensus 180 ~~~-~~~~~~dlIGQFGVGFYsAfLVAD~vvVtsKhNdD~Qyi--WESdan~Fsvse-Dprg~t-L~RGt~ItL~LkeEA 254 (785) T KOG0020 180 SGD-SEGLMNDLIGQFGVGFYSAFLVADRVVVTSKHNDDSQYI--WESDANSFSVSE-DPRGNT-LGRGTEITLYLKEEA 254 (785) T ss_pred CCC-CHHHHHHHHHHCCHHHHHHHHHCCEEEEEECCCCCCCEE--EECCCCCEEEEC-CCCCCC-CCCCCEEEEEEHHHH T ss_conf 432-123688887750512565222001699973257864013--404676135531-788885-568647999865545 Q ss_pred HHHHHCCHHHHHHHHHHHH-CCCCEEEE Q ss_conf 5421000899999999731-14641699 Q gi|254780814|r 182 FSVQDFNYDTLQHRLRELS-FLNSSVQI 208 (803) Q Consensus 182 F~~~~~~~d~l~~R~~~~A-~Ln~gl~i 208 (803) =. -+..|.|...++..+ |.|=-+.+ T Consensus 255 ~d--yLE~dtlkeLvkkYSqFINFpI~l 280 (785) T KOG0020 255 GD--YLEEDTLKELVKKYSQFINFPISL 280 (785) T ss_pred HH--HCCHHHHHHHHHHHHHHCCCCEEE T ss_conf 54--306668999999988750774010 No 123 >COG4822 CbiK Cobalamin biosynthesis protein CbiK, Co2+ chelatase [Coenzyme metabolism] Probab=35.81 E-value=25 Score=13.92 Aligned_cols=10 Identities=60% Similarity=0.866 Sum_probs=7.4 Q ss_pred EEEEEECCCC Q ss_conf 7999825575 Q gi|254780814|r 426 ELFLVEGDSA 435 (803) Q Consensus 426 eL~ivEGDSA 435 (803) -|.+|-||-| T Consensus 200 PlMlvAG~Ha 209 (265) T COG4822 200 PLMLVAGDHA 209 (265) T ss_pred EEEEEECHHH T ss_conf 5688604022 No 124 >TIGR01781 Trep_dent_lipo Treponema denticola clustered lipoprotein; InterPro: IPR010103 This entry represents a family of six predicted lipoproteins from a region of about 20 tandemly arranged genes in the Treponema denticola genome. Two other neighbouring genes share the lipoprotein signal peptide region but do not show more extensive homology. The function of this locus is unknown.. Probab=35.73 E-value=23 Score=14.17 Aligned_cols=25 Identities=32% Similarity=0.595 Sum_probs=17.2 Q ss_pred CCCCCCEEEEEECCCCC--CCCCCEEH Q ss_conf 17754121222246216--66772103 Q gi|254780814|r 266 NDGYHENVLCFTNNIPQ--KDGGTHLS 290 (803) Q Consensus 266 ~~~~~e~i~SfvN~I~T--~~GGtHv~ 290 (803) +.||.|-++=|.|--+- .+-|.|.- T Consensus 267 nSGyDEYiV~Ft~~~~k~~~edG~~~~ 293 (463) T TIGR01781 267 NSGYDEYIVLFTEDSRKEELEDGIFLL 293 (463) T ss_pred CCCCEEEEEEECCCCCCCHHHCCCCCH T ss_conf 887304799840799961656471001 No 125 >TIGR01992 PTS-IIBC-Tre PTS system, trehalose-specific IIBC component; InterPro: IPR011296 This entry represents the fused enzyme II B and C components of the trehalose-specific PTS sugar transporter system . Trehalose is converted to trehalose-6-phosphate in the process of translocation into the cell. These transporters lack their own IIA domains and instead use the glucose IIA protein (IIAglc or Crr) . The exceptions to this rule are Staphylococci and Streptococci which contain their own A domain as a C-terminal fusion. This family is closely related to the sucrose transporting PTS IIBC enzymes described by the IIB component and the IIC component (IPR004719 from INTERPRO), respectively. In Escherichia coli, Bacillus subtilis and Pseudomonas fluorescens the presence of this gene is associated with the presence of trehalase which degrades T6P to glucose and glucose-6-P. Trehalose may also be transported (in Salmonella) via the mannose PTS or galactose permease systems , or (in Sinorhizobium, Thermococcus and Sulfolobus, for instance) by ABC transporters , , .; GO: 0015574 trehalose transmembrane transporter activity, 0015771 trehalose transport, 0009276 1-2nm peptidoglycan-based cell wall, 0016021 integral to membrane. Probab=35.32 E-value=16 Score=15.46 Aligned_cols=16 Identities=13% Similarity=0.150 Sum_probs=7.8 Q ss_pred HHHCCCCEEEEEEEEC Q ss_conf 7311464169999963 Q gi|254780814|r 198 ELSFLNSSVQISLIDK 213 (803) Q Consensus 198 ~~A~Ln~gl~i~l~De 213 (803) ..+.|---|++.|+|+ T Consensus 19 ~atHCvTRLRfvL~D~ 34 (489) T TIGR01992 19 AATHCVTRLRFVLKDP 34 (489) T ss_pred HHHHHHHHHHHHHCCC T ss_conf 8756888888861681 No 126 >COG5257 GCD11 Translation initiation factor 2, gamma subunit (eIF-2gamma; GTPase) [Translation, ribosomal structure and biogenesis] Probab=34.81 E-value=26 Score=13.81 Aligned_cols=33 Identities=27% Similarity=0.377 Sum_probs=18.8 Q ss_pred CCHHHHHCCCEEEEEEECCCCCCCCCCHHHHCCCC Q ss_conf 76134305721566652137633572100202611 Q gi|254780814|r 317 VIGDDCREGFTSVLSIKMPDPRFSSQTKEKLVSSE 351 (803) Q Consensus 317 ~~~~Diregl~~visvki~nP~F~gQTK~kL~s~~ 351 (803) ++|.-+.+|-..+|.. .+|-=+-||+|-|.--+ T Consensus 104 LsGAAlMDgAlLvIaA--NEpcPQPQT~EHl~Ale 136 (415) T COG5257 104 LSGAALMDGALLVIAA--NEPCPQPQTREHLMALE 136 (415) T ss_pred HCCHHHHCCEEEEEEC--CCCCCCCCHHHHHHHHH T ss_conf 6023442153899953--89898973187788776 No 127 >pfam05914 RIB43A RIB43A. This family consists of several RIB43A-like eukaryotic proteins. Ciliary and flagellar microtubules contain a specialized set of protofilaments, termed ribbons, that are composed of tubulin and several associated proteins. RIB43A was first characterized in the unicellular biflagellate, Chlamydomonas reinhardtii although highly related sequences are present in several higher eukaryotes including humans. The function of this protein is unknown although the structure of RIB43A and its association with the specialized protofilament ribbons and with basal bodies is relevant to the proposed role of ribbons in forming and stabilising doublet and triplet microtubules and in organising their three-dimensional structure. Human RIB43A homologues could represent a structural requirement in centriole replication in dividing cells. Probab=34.69 E-value=16 Score=15.39 Aligned_cols=40 Identities=30% Similarity=0.494 Sum_probs=16.9 Q ss_pred CCHHHHHCCCEEEEEEECCCCCCCCCCHHHHCCCCCCHHHHCCCHHHHHHHHHH Q ss_conf 761343057215666521376335721002026111103211102689999874 Q gi|254780814|r 317 VIGDDCREGFTSVLSIKMPDPRFSSQTKEKLVSSEVRSVVESFVNDGLSAWLEE 370 (803) Q Consensus 317 ~~~~Diregl~~visvki~nP~F~gQTK~kL~s~~v~~~v~~~v~~~l~~~l~~ 370 (803) ++-+|.|-|.. |+ ..|+| +=|+..+=+ .+-.+.+-.|+.. T Consensus 127 v~DdDpr~g~S---S~----Q~F~G---EDL~~~eR~----k~Q~eQ~r~W~~q 166 (379) T pfam05914 127 VGDDDPRLGPS---SL----QKFLG---EDLNEADRK----KMQQEQQREWLEQ 166 (379) T ss_pred CCCCCCCCCHH---HH----HHCCC---CCCCHHHHH----HHHHHHHHHHHHH T ss_conf 89888877725---41----31257---757859899----9999999999999 No 128 >LOAD_Toprim consensus Probab=34.20 E-value=26 Score=13.74 Aligned_cols=27 Identities=33% Similarity=0.530 Sum_probs=21.6 Q ss_pred CCCCEEEEECCCCCCCCHHHHHHHHHH Q ss_conf 737389985088885328999999999 Q gi|254780814|r 496 RYHKIIIMTDADVDGAHIRTLLLTFFF 522 (803) Q Consensus 496 rY~kiiimtDaD~DG~HI~~Llltff~ 522 (803) ....|+|+||.|.-|--|+-=|...+- T Consensus 45 ~~~~VilATDpDrEGEaIa~hl~~~l~ 71 (98) T LOAD_Toprim 45 KAKEVIIATDPDREGEKIAKKLLELLR 71 (98) T ss_pred CCCEEEEECCCCCCHHHHHHHHHHHHC T ss_conf 699299946969652899999999836 No 129 >TIGR00018 panC pantoate--beta-alanine ligase; InterPro: IPR003721 D-Pantothenate is synthesized via four enzymes from ketoisovalerate, which is an intermediate of branched-chain amino acid synthesis . Pantoate-beta-alanine ligase, also know as pantothenate synthase, (6.3.2.1 from EC) catalyzes the formation of pantothenate from pantoate and alanine in the pantothenate biosynthesis pathway .; GO: 0004592 pantoate-beta-alanine ligase activity, 0015940 pantothenate biosynthetic process. Probab=34.07 E-value=26 Score=13.78 Aligned_cols=25 Identities=16% Similarity=0.336 Sum_probs=14.7 Q ss_pred EECHHHHHHHHCC-HHHHHHHHHHHH Q ss_conf 8778885421000-899999999731 Q gi|254780814|r 176 LPSSDIFSVQDFN-YDTLQHRLRELS 200 (803) Q Consensus 176 ~PD~~iF~~~~~~-~d~l~~R~~~~A 200 (803) .||-++||..++. .-+|.+.+.|+- T Consensus 151 ~Pd~AYFG~KD~QQla~i~~lV~Dl~ 176 (310) T TIGR00018 151 KPDVAYFGKKDAQQLAVIRKLVKDLF 176 (310) T ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHC T ss_conf 74510047743889999999998734 No 130 >KOG3938 consensus Probab=33.95 E-value=27 Score=13.71 Aligned_cols=16 Identities=19% Similarity=0.260 Sum_probs=7.9 Q ss_pred EEECCCCCCCCCCEEH Q ss_conf 2224621666772103 Q gi|254780814|r 275 CFTNNIPQKDGGTHLS 290 (803) Q Consensus 275 SfvN~I~T~~GGtHv~ 290 (803) |-.+.|.+..=|-|.. T Consensus 160 svidri~~i~VGd~IE 175 (334) T KOG3938 160 SVIDRIEAICVGDHIE 175 (334) T ss_pred CHHHHHHHEEHHHHHH T ss_conf 4210312000776787 No 131 >cd00223 TOPRIM_TopoIIB_SPO TOPRIM_TopoIIB_SPO: topoisomerase-primase (TOPRIM) nucleotidyl transferase/hydrolase domain of the type found in the type IIB family of DNA topoisomerases and Spo11. This subgroup contains proteins similar to Sulfolobus shibatae topoisomerase VI (TopoVI) and Saccharomyces cerevisiae meiotic recombination factor: Spo11. Type II DNA topoisomerases catalyze the ATP-dependent transport of one DNA duplex through another, in the process generating transient double strand breaks via covalent attachments to both DNA strands at the 5' positions. TopoVI enzymes are heterotetramers found in archaea and plants. Spo11 plays a role in generating the double strand breaks that initiate homologous recombination during meiosis. S. shibatae TopoVI relaxes both positive and negative supercoils, and in addition has a strong decatenase activity. The TOPRIM domain has two conserved motifs, one of which centers at a conserved glutamate and the other one at two conserved aspart Probab=33.87 E-value=21 Score=14.52 Aligned_cols=34 Identities=18% Similarity=0.320 Sum_probs=17.2 Q ss_pred CHHHHHHHCCCCCCCCCCCEEEEEEEEEEEEEEEEEEC Q ss_conf 35776530035655688520364200369997334531 Q gi|254780814|r 229 IEAFVSYLDRHKKPLMQSPIRIQGSRDNISIDLAMRWN 266 (803) Q Consensus 229 i~~fv~~l~~~~~~~~~~~i~~~~~~~~~~veval~~~ 266 (803) .+.|+..|.... .-|++.-...|..+++|+.+|- T Consensus 39 TR~fl~~L~~~~----~lpv~~l~D~DP~Gi~I~~~Yk 72 (160) T cd00223 39 TRRFLRRLHEEL----DLPVYILVDGDPYGISILLTYK 72 (160) T ss_pred HHHHHHHHHHHH----CCCEEEEECCCCCCEEEEEEEE T ss_conf 999999999987----8988999758866106778886 No 132 >PRK09918 putative periplasmic chaperone protein; Provisional Probab=33.73 E-value=25 Score=13.91 Aligned_cols=12 Identities=33% Similarity=0.620 Sum_probs=8.5 Q ss_pred CEEEEEECCCCCC Q ss_conf 8399997684453 Q gi|254780814|r 72 GSCTVSDNGRGIP 84 (803) Q Consensus 72 gsisV~dnGrGIP 84 (803) .+|+|.|+|.. | T Consensus 42 ~sv~v~N~~~~-p 53 (230) T PRK09918 42 GSINVKNTDSN-P 53 (230) T ss_pred EEEEEEECCCC-C T ss_conf 89999929999-4 No 133 >KOG2479 consensus Probab=33.37 E-value=8.1 Score=17.71 Aligned_cols=27 Identities=33% Similarity=0.486 Sum_probs=14.3 Q ss_pred EEEEEEECHHHHHHHHCCHHHHHHHHHHHHCCC Q ss_conf 089998778885421000899999999731146 Q gi|254780814|r 171 TEVTFLPSSDIFSVQDFNYDTLQHRLRELSFLN 203 (803) Q Consensus 171 T~I~F~PD~~iF~~~~~~~d~l~~R~~~~A~Ln 203 (803) ..|.-.||+...+...|+ ++-.|++.| T Consensus 162 sSV~V~~dW~~lEEm~fs------~lsKL~l~n 188 (549) T KOG2479 162 SSVDVRPDWVVLEEMDFS------RLSKLSLPN 188 (549) T ss_pred CCCEECCCCEEECCCCHH------HHHHHCCCC T ss_conf 762305442010136626------655412667 No 134 >PRK11070 ssDNA exonuclease RecJ; Provisional Probab=33.12 E-value=27 Score=13.61 Aligned_cols=15 Identities=13% Similarity=0.189 Sum_probs=5.5 Q ss_pred HHHHHHHHHHHHHHH Q ss_conf 740035799987789 Q gi|254780814|r 369 EEHPLEAKIIVKKVL 383 (803) Q Consensus 369 ~~n~~~ak~i~~k~~ 383 (803) .+++..|..+..+.. T Consensus 306 ~~d~~~A~~la~~L~ 320 (574) T PRK11070 306 CDNIGEARVLANELD 320 (574) T ss_pred CCCHHHHHHHHHHHH T ss_conf 899999999999999 No 135 >pfam10691 DUF2497 Protein of unknown function (DUF2497). Members of this family belong to the Alphaproteobacteria. The function of the family is not known. Probab=32.10 E-value=28 Score=13.49 Aligned_cols=32 Identities=19% Similarity=0.507 Sum_probs=20.8 Q ss_pred CCHHHHCCCHHHHHHHHHHH-HHHHHHHHHHHH Q ss_conf 11032111026899998740-035799987789 Q gi|254780814|r 352 VRSVVESFVNDGLSAWLEEH-PLEAKIIVKKVL 383 (803) Q Consensus 352 v~~~v~~~v~~~l~~~l~~n-~~~ak~i~~k~~ 383 (803) +..+|.+.+..-|..||+.| |...+.++.+-| T Consensus 34 lE~lv~emLrPmL~~WLd~nLp~lve~~V~~EI 66 (72) T pfam10691 34 LEDLVREMLRPMLKEWLDNNLPALVERLVREEI 66 (72) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 999999998999999998613799999999999 No 136 >PRK05776 DNA topoisomerase III; Provisional Probab=32.02 E-value=28 Score=13.48 Aligned_cols=11 Identities=18% Similarity=0.395 Sum_probs=4.2 Q ss_pred HHHHHHHHHHH Q ss_conf 88899999999 Q gi|254780814|r 293 RSALTRQITSY 303 (803) Q Consensus 293 ~~al~~~i~~y 303 (803) .+|+.|.+-++ T Consensus 163 ~A~~aR~~~D~ 173 (675) T PRK05776 163 NAGLARHELDW 173 (675) T ss_pred HHHHHHHHHHH T ss_conf 99999999988 No 137 >KOG0318 consensus Probab=30.90 E-value=30 Score=13.35 Aligned_cols=13 Identities=23% Similarity=0.460 Sum_probs=5.5 Q ss_pred EEEEEECHHHHHH Q ss_conf 8999877888542 Q gi|254780814|r 172 EVTFLPSSDIFSV 184 (803) Q Consensus 172 ~I~F~PD~~iF~~ 184 (803) +|+|-||-++|-. T Consensus 195 ~VRysPDG~~Fat 207 (603) T KOG0318 195 CVRYSPDGSRFAT 207 (603) T ss_pred EEEECCCCCEEEE T ss_conf 4898899875998 No 138 >TIGR01245 trpD anthranilate phosphoribosyltransferase; InterPro: IPR005940 In many widely different species, including Escherichia coli, Thermotoga maritima, and Archaeoglobus fulgidus, this enzyme is a C-terminal part of a multifunctional protein together with glutamine amidotransferase; the fusion protein is designated anthranilate synthase component II (4.1.3.27 from EC).; GO: 0004048 anthranilate phosphoribosyltransferase activity, 0000162 tryptophan biosynthetic process. Probab=30.75 E-value=24 Score=14.07 Aligned_cols=37 Identities=19% Similarity=0.234 Sum_probs=20.2 Q ss_pred CEEEEEEHHHHHH-HHHHHHHHCCCCCCCHHHHHHHHH Q ss_conf 4021236798898-887865641899876289988524 Q gi|254780814|r 760 RSLLRVKITDASQ-ADDLFSRLMGDEVEPRREFIQENS 796 (803) Q Consensus 760 RtLLqV~IeDa~~-Ad~iF~~LMGD~VEPRReFIe~NA 796 (803) ..|-++.+.|+.+ |+.+=..|=|....|.|+|+-=|| T Consensus 254 ~~l~~l~g~~p~Ena~~~~~~l~G~~~~~~~d~V~lNA 291 (331) T TIGR01245 254 APLEELRGGSPEENAEILREILEGKGSGAKRDIVALNA 291 (331) T ss_pred CCCHHCCCCCHHHHHHHHHHHHCCCCCHHHHHHHHHHH T ss_conf 53011046898999999999971888717899999989 No 139 >pfam05055 DUF677 Protein of unknown function (DUF677). This family consists of AT14A like proteins from Arabidopsis thaliana. At14a has a small domain that has sequence similarities to integrins from fungi, insects and humans. Transcripts of At14a are found in all Arabidopsis tissues and localizes partly to the plasma membrane. Probab=30.23 E-value=30 Score=13.27 Aligned_cols=10 Identities=20% Similarity=0.258 Sum_probs=3.8 Q ss_pred EEECCCCCCC Q ss_conf 2224621666 Q gi|254780814|r 275 CFTNNIPQKD 284 (803) Q Consensus 275 SfvN~I~T~~ 284 (803) .|-|+..|.+ T Consensus 71 YFesS~ktld 80 (336) T pfam05055 71 YFESTKKTLD 80 (336) T ss_pred HHHHHHHHHH T ss_conf 9981099999 No 140 >PHA02516 baseplate wedge subunit gpW; Provisional Probab=30.08 E-value=30 Score=13.33 Aligned_cols=65 Identities=18% Similarity=0.189 Sum_probs=30.0 Q ss_pred EECCCCCHHCCCCCCEECC--------CCCCCHHHHHHHHHHH--------HHHHHHHHCCC-----CEEEEEECCCCEE Q ss_conf 2313542000458715432--------7996325445656755--------57886860779-----7579998389839 Q gi|254780814|r 16 QILKGLDAVKKRPSMYIGD--------TDGESGLHHMIYEVLD--------NAIDESLEGYA-----DIITVTLNMDGSC 74 (803) Q Consensus 16 ~~L~gle~vrkRP~mYiG~--------t~~~~Gl~hlv~Eiid--------NsvDe~~~g~~-----~~I~V~i~~dgsi 74 (803) +.++|.+||+--=.+.... -+-+.+++.+++|-++ .+|-||+.-+= +.|+|+-..||.+ T Consensus 7 ~~l~g~~~I~QSI~~IL~T~~GeRvmr~~fGs~l~~lld~p~~~~~~~~i~~~i~~Al~rwEPRi~l~~V~v~~~~~G~~ 86 (103) T PHA02516 7 RALSDLEHISQSIGDILLTPVGSRVMRREYGSLLPDLIDQPQNPALRLQIYAACAMALMRWEPRITLTRVQIERAADGRM 86 (103) T ss_pred CCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEEEEEEECCCCCE T ss_conf 63368899999999997699967034665355708882797998999999999999999549964899999997579819 Q ss_pred EEEECC Q ss_conf 999768 Q gi|254780814|r 75 TVSDNG 80 (803) Q Consensus 75 sV~dnG 80 (803) .+.=.| T Consensus 87 ~l~l~g 92 (103) T PHA02516 87 TVDITG 92 (103) T ss_pred EEEEEE T ss_conf 999999 No 141 >cd03362 TOPRIM_TopoIA_TopoIII TOPRIM_TopoIA_TopoIII: The topoisomerase-primase (TORPIM) domain found in members of the type IA family of DNA topoisomerases (Topo IA) similar to topoisomerase III. Type IA DNA topoisomerases remove (relax) negative supercoils in the DNA by: cleaving one strand of the DNA duplex, covalently linking to the 5' phosphoryl end of the DNA break and, allowing the other strand of the duplex to pass through the gap. The TOPRIM domain has two conserved motifs, one of which centers at a conserved glutamate and the other one at two conserved aspartates (DxD). For topoisomerases the conserved glutamate is believed to act as a general base in strand joining and, as a general acid in strand cleavage. The DXD motif may co-ordinate Mg2+, a cofactor required for full catalytic function. Probab=30.05 E-value=30 Score=13.25 Aligned_cols=24 Identities=38% Similarity=0.573 Sum_probs=20.6 Q ss_pred CCEEEEECCCCCCCCHHHHHHHHH Q ss_conf 738998508888532899999999 Q gi|254780814|r 498 HKIIIMTDADVDGAHIRTLLLTFF 521 (803) Q Consensus 498 ~kiiimtDaD~DG~HI~~Llltff 521 (803) ..||+.||+|--|-.|.-.++..+ T Consensus 100 d~ii~atD~DrEGE~I~~ei~~~~ 123 (151) T cd03362 100 DEIVIATDADREGELIGREILEYA 123 (151) T ss_pred CEEEECCCCCCCHHHHHHHHHHHH T ss_conf 989987788720459999999984 No 142 >TIGR01990 bPGM beta-phosphoglucomutase; InterPro: IPR010972 This model represents the beta-phosphoglucomutase enzyme which catalyses the interconversion of beta-D-glucose-1-phosphate and beta-D-glucose-6-phosphate. The 6-phosphate is capable of non-enzymatic anomerization (alpha to beta or vice versa) while the 1-phosphate is not. A separate enzyme is responsible for the isomerisation of the alpha anomers. Beta-D-glucose-1-phosphate results from the phosphorolysis of maltose (2.4.1.8 from EC), trehalose (2.4.1.64 from EC) or trehalose-6-phosphate (2.4.1.216 from EC). Alternatively, these reactions can be run in the synthetic direction to create the disaccharides. All sequenced genomes which contain a member of this family also appear to contain at least one putative maltose or trehalose phosphorylase. Three species, Lactococcus, Enterococcus and Neisseria appear to contain a pair of paralogous beta-PGM's. Beta-phosphoglucomutase is a member of the haloacid dehalogenase superfamily of hydrolase enzymes. These enzymes are characterised by a series of three catalytic motifs positioned within an alpha-beta (Rossman) fold . Beta-PGM contains an inserted alpha helical domain in between the first and second conserved motifs and thus is a member of subfamily IA of the superfamily , . The third catalytic motif comes in three variants, the third of which, containing a conserved DD or ED, is the only one found here as well as in several other related enzymes. The enzyme from Lactococcus lactis has been extensively characterised including a remarkable crystal structure which traps the pentacoordinate transition state . . Probab=29.06 E-value=18 Score=14.93 Aligned_cols=16 Identities=25% Similarity=0.478 Sum_probs=9.2 Q ss_pred CCCCHHHHHCCCEEEE Q ss_conf 3576134305721566 Q gi|254780814|r 315 RSVIGDDCREGFTSVL 330 (803) Q Consensus 315 ~~~~~~Diregl~~vi 330 (803) ..+|+.||.-|..-++ T Consensus 86 ~~lTp~d~LPGi~~lL 101 (190) T TIGR01990 86 KELTPEDVLPGIKSLL 101 (190) T ss_pred CCCCHHHHCCCHHHHH T ss_conf 0689866040189999 No 143 >PRK03353 ribB 3,4-dihydroxy-2-butanone 4-phosphate synthase; Provisional Probab=28.43 E-value=32 Score=13.05 Aligned_cols=22 Identities=14% Similarity=0.389 Sum_probs=11.7 Q ss_pred HHHHHHHHHHHHCCCCEEEEEEEEC Q ss_conf 8999999997311464169999963 Q gi|254780814|r 189 YDTLQHRLRELSFLNSSVQISLIDK 213 (803) Q Consensus 189 ~d~l~~R~~~~A~Ln~gl~i~l~De 213 (803) ++.+.+-+.+++ .|--|.+.|. T Consensus 13 f~~Ie~Ai~al~---~G~~Viv~Dd 34 (217) T PRK03353 13 FERVEAALDALR---EGRGVLVLDD 34 (217) T ss_pred CCHHHHHHHHHH---CCCEEEEEEC T ss_conf 215999999998---7995999838 No 144 >KOG1845 consensus Probab=27.27 E-value=22 Score=14.29 Aligned_cols=28 Identities=36% Similarity=0.417 Sum_probs=12.9 Q ss_pred HHHHHHHHHHHHH--HCCCCEEEEEECCCC Q ss_conf 6567555788686--077975799983898 Q gi|254780814|r 45 IYEVLDNAIDESL--EGYADIITVTLNMDG 72 (803) Q Consensus 45 v~EiidNsvDe~~--~g~~~~I~V~i~~dg 72 (803) +.|++|||+||.. ++|...+.+..-.|+ T Consensus 151 ~aeLldnalDEi~~~~tf~~vd~I~p~~d~ 180 (775) T KOG1845 151 IAELLDNALDEITNGATFVRVDYINPVMDI 180 (775) T ss_pred HHHHCCCCCCCCCCCCCEEEEEEECCCCCC T ss_conf 666504554123465315776300521015 No 145 >TIGR00010 TIGR00010 hydrolase, TatD family; InterPro: IPR015991 Escherichia coli TatD (synonym MttC) has been shown experimentally to be a DNase . On the basis of operon structure, it was originally thought to be involved in the Sec-independent export of protein precursors bearing twin arginine signal peptides by the Tat system . This alternative membrane targeting and translocation system (Mtt), also called twin arginine translocase (Tat) (MttA1A2BC or TatABCD), has been shown to transport fully folded proteins to, or across, the membrane , . However, it was later determined that TatD is not required for this translocation process , . Furthermore, it was noted that no exact correlation exists between the ability of an organism to synthesise a TatD-like protein and whether that organism has a Tat protein transport pathway . Also, with the exception of Escherichia coli , the genetic linkage between TatD and other Tat genes (operon structure) is not maintained. Members of this group may be distantly related to a large 3D fold-based domain superfamily of metalloenzymes . The description of this fold superfamily was based on an analysis of conservation patterns in three dimensions, and the discovery that the same active-site architecture occurs in a large set of enzymes involved primarily in nucleotide metabolism. The group is thought to include urease, dihydroorotase, allantoinase, hydantoinase, AMP-, adenine- and cytosine- deaminases, imidazolonepropionase, aryldialkylphosphatase, chlorohydrolase, formylmethanofuran dehydrogenase, and other enzymes .. Probab=27.19 E-value=34 Score=12.90 Aligned_cols=28 Identities=43% Similarity=0.409 Sum_probs=22.6 Q ss_pred EEEEEEECCCCCCCCCCCCCCCCEEEEECCCC---CCCHHHC Q ss_conf 17999825575432233465443023300562---0334430 Q gi|254780814|r 425 SELFLVEGDSAGGSAKQGRSRENQAILPLRGK---ILNVERA 463 (803) Q Consensus 425 ~eL~ivEGDSAggsak~gRdr~~qai~PLrGK---ilNv~~a 463 (803) .|-.|||=||= |=|=-|.||| +.+|+-. T Consensus 210 ~e~LL~ETDSP-----------yLaP~P~RGK~N~P~~v~~v 240 (269) T TIGR00010 210 LERLLVETDSP-----------YLAPVPYRGKRNEPAFVRYV 240 (269) T ss_pred CCCCEEEECCC-----------CCCCCCCCCCCCCCCHHHHH T ss_conf 64323201674-----------45788988985674059999 No 146 >PRK11385 hypothetical protein; Provisional Probab=27.06 E-value=34 Score=12.88 Aligned_cols=51 Identities=20% Similarity=0.227 Sum_probs=22.5 Q ss_pred CCEECCCCCCCHHHHHHHHHHHHHH-HHHHHCC---CCEEEEEECCC---CEEEEEECCC Q ss_conf 7154327996325445656755578-8686077---97579998389---8399997684 Q gi|254780814|r 29 SMYIGDTDGESGLHHMIYEVLDNAI-DESLEGY---ADIITVTLNMD---GSCTVSDNGR 81 (803) Q Consensus 29 ~mYiG~t~~~~Gl~hlv~EiidNsv-De~~~g~---~~~I~V~i~~d---gsisV~dnGr 81 (803) -||.=.++.-+-+.-+++=++-=.+ -.|.||- +|+ |..+.| -+|+|.|+|. T Consensus 5 ~~~~~~~~~~~r~~~~i~~~~~ll~s~~a~Agv~l~~TR--vIy~~~~~~~sv~v~N~~~ 62 (245) T PRK11385 5 HLYFIMTNTWNRLALLIFAVLSLLVAGELQAGVVVGGTR--FIFPADRESISILLTNTSQ 62 (245) T ss_pred EEEEEEEHHHHHHHHHHHHHHHHHHHHHHEEEEEECCCE--EEECCCCCEEEEEEEECCC T ss_conf 689987006899999999999999865400308978758--9975999478999995899 No 147 >TIGR01880 Ac-peptdase-euk N-acyl-L-amino-acid amidohydrolase; InterPro: IPR010159 This entry represents a family of eukaryotic N-acyl-L-amino-acid amidohydrolase (3.5.1.14 from EC) is a homodimeric zinc-binding mammalian enzyme that catalyzes the hydrolysis of N-alpha-acylated amino acids except L-aspartic acid. , . These enzymes are listed as being members of MEROPS peptidase family S20A (clan MH).; GO: 0004046 aminoacylase activity, 0006520 amino acid metabolic process, 0005737 cytoplasm. Probab=26.67 E-value=15 Score=15.62 Aligned_cols=46 Identities=22% Similarity=0.296 Sum_probs=29.4 Q ss_pred EEEEEECCEEEEEEEECCE------------ECCCCEEEC-CCCCCCCEEEEEEECHHHHHHHH Q ss_conf 9999978959999988983------------646313873-27788870899987788854210 Q gi|254780814|r 136 KLRIKREGNIYEMSFINGI------------LDNPLVVTG-SAGNDTGTEVTFLPSSDIFSVQD 186 (803) Q Consensus 136 ~v~v~rdgk~~~q~f~~g~------------~~~~l~~~~-~~~~~~GT~I~F~PD~~iF~~~~ 186 (803) ..-...||.+ |.||- .+-.|+.-| ... +|-.-|+|.||.||||... T Consensus 98 ~A~~D~~G~I----yARGaQDMKCVG~QyLEA~R~Lk~~Gi~~~-~RTIHlsfVPDEEiGG~~G 156 (433) T TIGR01880 98 SAFKDEEGNI----YARGAQDMKCVGVQYLEAVRNLKASGIFKF-KRTIHLSFVPDEEIGGHDG 156 (433) T ss_pred CCCCCCCCCE----EECCCCCCCHHHHHHHHHHHHHHHCCCCCC-CCEEEEEECCCCCCCCHHH T ss_conf 0355788887----731567641265899999999986573478-8758998328622366121 No 148 >pfam09052 SipA Salmonella invasion protein A. Salmonella invasion protein A is an actin-binding protein that contributes to host cytoskeletal rearrangements by stimulating actin polymerisation and counteracting F-actin destabilising proteins. Members of this family possess an all-helical fold consisting of eight alpha-helices arranged so that six long, amphipathic helices form a compact fold that surrounds a final, predominantly hydrophobic helix in the middle of the molecule. Probab=26.13 E-value=20 Score=14.71 Aligned_cols=30 Identities=20% Similarity=0.385 Sum_probs=20.7 Q ss_pred EEEEEEEEECCCCCCEEEEEECCCCCCCCCCEEHHH Q ss_conf 999733453177541212222462166677210356 Q gi|254780814|r 257 ISIDLAMRWNDGYHENVLCFTNNIPQKDGGTHLSGL 292 (803) Q Consensus 257 ~~veval~~~~~~~e~i~SfvN~I~T~~GGtHv~gf 292 (803) |.+.|-+.|++ .||-|..+-..|-||+++= T Consensus 300 IhyhIni~~~n------~s~DNr~~~n~~~t~~~~~ 329 (674) T pfam09052 300 IHYHINIHNDN------RSYDNRVFDNRGDTYLGGA 329 (674) T ss_pred EEEEECCCCCC------CCCCCHHHCCCCCCCCCCC T ss_conf 68872256666------6532123304654323543 No 149 >pfam06154 YagB_YeeU_YfjZ YagB/YeeU/YfjZ family. This family of proteins includes three proteins from E. coli YagB, YeeU and YfjZ. The function of these proteins is unknown. They are about 120 amino acids in length. Probab=26.12 E-value=35 Score=12.76 Aligned_cols=10 Identities=40% Similarity=0.614 Sum_probs=5.2 Q ss_pred EECCCCCCCC Q ss_conf 8508888532 Q gi|254780814|r 503 MTDADVDGAH 512 (803) Q Consensus 503 mtDaD~DG~H 512 (803) -++||.=||| T Consensus 83 tCeADtlGS~ 92 (104) T pfam06154 83 TCEADTLGSH 92 (104) T ss_pred EEECCCCCCC T ss_conf 8874678775 No 150 >KOG0526 consensus Probab=26.05 E-value=34 Score=12.92 Aligned_cols=52 Identities=17% Similarity=0.153 Sum_probs=24.4 Q ss_pred CC-EEEEEEECHHHHHHHHCCHHHHHHHHHHHHCCC-CEEEEEEEECCCCCCCEEEECCC Q ss_conf 87-089998778885421000899999999731146-41699999635864210100126 Q gi|254780814|r 169 TG-TEVTFLPSSDIFSVQDFNYDTLQHRLRELSFLN-SSVQISLIDKRSPEPREITMFYK 226 (803) Q Consensus 169 ~G-T~I~F~PD~~iF~~~~~~~d~l~~R~~~~A~Ln-~gl~i~l~Der~~~~~~~~f~~~ 226 (803) .| +-+-++-+..+|+..- .-+..- ++.-| -+|.++.+|......-+..||-+ T Consensus 115 ~G~~l~F~~~skpiFEIP~---s~Vsn~---l~gKNEv~LEFh~nDda~~~LmEmRF~iP 168 (615) T KOG0526 115 KGQELVFDVNSKPIFEIPL---SDVSNT---LTGKNEVTLEFHQNDDAPVGLMEMRFHIP 168 (615) T ss_pred CCCEEEEEECCCEEEEEEH---HHHHHH---HCCCCEEEEEEECCCCCCCCEEEEEEECC T ss_conf 5737899607962688666---564556---43775179998616777743389999548 No 151 >PRK09522 bifunctional anthranilate synthase/anthranilate phosphoribosyltransferase; Provisional Probab=25.85 E-value=36 Score=12.72 Aligned_cols=43 Identities=16% Similarity=0.131 Sum_probs=32.6 Q ss_pred CCCHHHCEEEEEEHHH------HHHHHHHHHHHCCCCCCCHHHHHHHHH Q ss_conf 3896134021236798------898887865641899876289988524 Q gi|254780814|r 754 TLNPDTRSLLRVKITD------ASQADDLFSRLMGDEVEPRREFIQENS 796 (803) Q Consensus 754 TMDP~tRtLLqV~IeD------a~~Ad~iF~~LMGD~VEPRReFIe~NA 796 (803) |.+|+-=-|-++.++| +..|+-+..+|-|+.-.|||+++--|| T Consensus 444 ~i~P~d~Gl~~~~~~~l~gg~~~~na~i~~~iL~G~~~~~~~d~VllNA 492 (531) T PRK09522 444 QLTAEDFGLTPYHQEQLAGGTPEENRDILTRLLQGKGDAAHEAAVAANV 492 (531) T ss_pred EECHHHCCCCCCCHHHCCCCCHHHHHHHHHHHHCCCCCHHHHHHHHHHH T ss_conf 9888997999899899378999999999999978999559999999999 No 152 >COG4251 Bacteriophytochrome (light-regulated signal transduction histidine kinase) [Signal transduction mechanisms] Probab=25.54 E-value=36 Score=12.68 Aligned_cols=24 Identities=29% Similarity=0.394 Sum_probs=21.7 Q ss_pred HHHHHHHHHHHHCCCEEEECCCCC Q ss_conf 999999998630676286037777 Q gi|254780814|r 720 CSLLDTIFSIGRKGISMQRYKGLG 743 (803) Q Consensus 720 ~el~~~i~~~gkKG~sIQRYKGLG 743 (803) .+.++.++..-++.-+.-+|+|.| T Consensus 683 ~a~~~riF~iFqRl~s~~~y~gtG 706 (750) T COG4251 683 PAYFERIFVIFQRLHSRDEYLGTG 706 (750) T ss_pred HHHHHHHHHHHHHCCCHHHHCCCC T ss_conf 899988899887037565514777 No 153 >pfam08166 NUC202 NUC202 domain. This domain is found in a novel family of nucleolar proteins. Probab=25.15 E-value=33 Score=12.94 Aligned_cols=36 Identities=25% Similarity=0.432 Sum_probs=21.8 Q ss_pred HHHHCCCCHHHCEEEEEEHHHHHHHHHHHHHHCCCCCCCH Q ss_conf 7223038961340212367988988878656418998762 Q gi|254780814|r 749 QLWETTLNPDTRSLLRVKITDASQADDLFSRLMGDEVEPR 788 (803) Q Consensus 749 QLWETTMDP~tRtLLqV~IeDa~~Ad~iF~~LMGD~VEPR 788 (803) .||-+|- -..--+||+..-- .+.+|+.||+|-|.|- T Consensus 14 e~wv~tc-Ga~~g~LQgs~~~---sEaL~t~Ll~di~Pp~ 49 (61) T pfam08166 14 ELWVLTC-GASEGSLQGSAFS---SEALLTHLLSDIVPPA 49 (61) T ss_pred HHHHHHH-CCCCCCCCCCCCH---HHHHHHHHHHCCCCCH T ss_conf 9999982-3120000478510---6899999870368944 No 154 >TIGR02288 PaaN_2 phenylacetic acid degradation protein paaN; InterPro: IPR011975 This enzyme is proposed to act in the ring-opening step of phenylacetic acid degradation which follows ligation of the acid with coenzyme A (by PaaF) and hydroxylation by a multicomponent non-heme iron hydroxylase complex (PaaGHIJK). Gene symbols have been standardized in . This enzyme is related to aldehyde dehydrogenase family. This family includes sequences from Burkholderia, Bordetella, Streptomyces. Other PaaN enzymes are represented by a separate entry, IPR011966 from INTERPRO.. Probab=24.87 E-value=36 Score=12.66 Aligned_cols=71 Identities=17% Similarity=0.248 Sum_probs=47.9 Q ss_pred CEEEEEECCCCCCCCCCEEHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHH--------CCCEEEE-EEECCCCCCCC Q ss_conf 12122224621666772103568889999999998624200012357613430--------5721566-65213763357 Q gi|254780814|r 271 ENVLCFTNNIPQKDGGTHLSGLRSALTRQITSYVEKSFGSKKDKRSVIGDDCR--------EGFTSVL-SIKMPDPRFSS 341 (803) Q Consensus 271 e~i~SfvN~I~T~~GGtHv~gf~~al~~~i~~y~~~~~~~Kk~k~~~~~~Dir--------egl~~vi-svki~nP~F~g 341 (803) ++|+==--||.|-+|=.=-|-+-..|.++|++++-+....-..---|..+|+. -||-.+. |-++.+|||.+ T Consensus 348 QailvPrDGI~t~~g~~SyDeVaa~LA~AidkLl~D~a~A~allGAiq~~~t~ARi~~A~~lGl~V~~~S~~~~hp~Fp~ 427 (560) T TIGR02288 348 QAILVPRDGIRTDQGRKSYDEVAADLAAAIDKLLGDPARAVALLGAIQSPDTVARIAEARALGLEVLLDSKKVEHPEFPG 427 (560) T ss_pred HHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHCCCCCHHHHHHHHHHHHCCHHHHHHHHHHHCCCCCC T ss_conf 32103677741776666557899999998776517803655541461287899999985634628887436640779868 No 155 >PTZ00317 malic enzyme; Provisional Probab=24.71 E-value=37 Score=12.57 Aligned_cols=47 Identities=28% Similarity=0.487 Sum_probs=24.4 Q ss_pred CCCEECCCCCCCHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCE--EEEEE---CCCCCCCC Q ss_conf 8715432799632544565675557886860779757999838983--99997---68445364 Q gi|254780814|r 28 PSMYIGDTDGESGLHHMIYEVLDNAIDESLEGYADIITVTLNMDGS--CTVSD---NGRGIPTD 86 (803) Q Consensus 28 P~mYiG~t~~~~Gl~hlv~EiidNsvDe~~~g~~~~I~V~i~~dgs--isV~d---nGrGIPv~ 86 (803) -||||- .++ .| -+.|++.|-.. ..|++.+-.||+ .-+-| ||=||||+ T Consensus 122 rGlyis-~~d-~g---~i~~~l~nwp~-------~~v~~IVVTDGerILGlGDlG~~GmgI~iG 173 (570) T PTZ00317 122 HGLYLS-MQE-KG---NIRQIMKNLRK-------ENVDVIVITDGSRILGLGDLGANGIGISIG 173 (570) T ss_pred CEEEEE-CCC-HH---HHHHHHHHCCC-------CCCEEEEEECCCCEEECCCCCCCCCCCHHH T ss_conf 647960-066-66---89999973784-------670499994572112026666785440088 No 156 >KOG0180 consensus Probab=24.52 E-value=27 Score=13.67 Aligned_cols=15 Identities=40% Similarity=0.567 Sum_probs=6.2 Q ss_pred HHHHCCHH--HHHHHHH Q ss_conf 42100089--9999999 Q gi|254780814|r 183 SVQDFNYD--TLQHRLR 197 (803) Q Consensus 183 ~~~~~~~d--~l~~R~~ 197 (803) |.+.|..| +|.+|++ T Consensus 52 GL~glatDvqtl~~~~~ 68 (204) T KOG0180 52 GLTGLATDVQTLLERLR 68 (204) T ss_pred ECCCCCHHHHHHHHHHH T ss_conf 04223230899999999 No 157 >COG2831 FhaC Hemolysin activation/secretion protein [Intracellular trafficking and secretion] Probab=24.44 E-value=38 Score=12.54 Aligned_cols=11 Identities=55% Similarity=0.779 Sum_probs=4.5 Q ss_pred CCCEEEEEEEC Q ss_conf 75203778632 Q gi|254780814|r 121 HGVGVSVVNAL 131 (803) Q Consensus 121 hGvG~s~vNal 131 (803) -|++.++-|.+ T Consensus 242 ~~~~l~~dn~l 252 (554) T COG2831 242 LGVGLSLDNPL 252 (554) T ss_pred EEEEEEECCCC T ss_conf 67689726976 No 158 >pfam01525 Rota_NS26 Rotavirus NS26. Gene 11 product is a non-structural phosphoprotein designated as NS26. Probab=24.34 E-value=25 Score=13.88 Aligned_cols=27 Identities=30% Similarity=0.396 Sum_probs=11.2 Q ss_pred CCCCCCCCCCCCCCCCEEEEECCCCCCCH Q ss_conf 55754322334654430233005620334 Q gi|254780814|r 432 GDSAGGSAKQGRSRENQAILPLRGKILNV 460 (803) Q Consensus 432 GDSAggsak~gRdr~~qai~PLrGKilNv 460 (803) -||-+|-.|.---++ --|.||-++-|| T Consensus 178 ~Dsdd~~CkNC~Yk~--KYf~LR~rMK~V 204 (212) T pfam01525 178 EDIDGCTCKNCPYKE--KYFKLRKRMKNV 204 (212) T ss_pred CCCCCCEECCCCHHH--HHHHHHHHHHHH T ss_conf 557887243881189--999999999999 No 159 >cd03361 TOPRIM_TopoIA_RevGyr TopoIA_RevGyr : The topoisomerase-primase (TORPIM) domain found in members of the type IA family of DNA topoisomerases (Topo IA) similar to the ATP-dependent reverse gyrase found in archaea and thermophilic bacteria. Type IA DNA topoisomerases remove (relax) negative supercoils in the DNA by: cleaving one strand of the DNA duplex, covalently linking to the 5' phosphoryl end of the DNA break and, allowing the other strand of the duplex to pass through the gap. Reverse gyrase is also able to insert positive supercoils in the presence of ATP and negative supercoils in the presence of AMPPNP. The TOPRIM domain has two conserved motifs, one of which centers at a conserved glutamate and the other one at two conserved aspartates (DxD). For topoisomerases the conserved glutamate is believed to act as a general base in strand joining and, as a general acid in strand cleavage. The DXD motif may co-ordinate Mg2+, a cofactor required for full catalytic function. Probab=24.23 E-value=38 Score=12.51 Aligned_cols=25 Identities=24% Similarity=0.373 Sum_probs=20.5 Q ss_pred CCCEEEEECCCCCCCCHHHHHHHHH Q ss_conf 3738998508888532899999999 Q gi|254780814|r 497 YHKIIIMTDADVDGAHIRTLLLTFF 521 (803) Q Consensus 497 Y~kiiimtDaD~DG~HI~~Llltff 521 (803) -..|+|.||.|--|-.|.=-++.++ T Consensus 119 ad~v~lATD~DREGEaIa~hi~~~l 143 (170) T cd03361 119 VDEVLIATDPDTEGEKIAWDVYLAL 143 (170) T ss_pred CCEEEECCCCCCCHHHHHHHHHHHH T ss_conf 8979985799822229999999995 No 160 >KOG1676 consensus Probab=23.69 E-value=28 Score=13.59 Aligned_cols=40 Identities=20% Similarity=0.319 Sum_probs=22.3 Q ss_pred EEEEEEEECCEEEEEEEECCEECCCCEEECCCCCCCCEEEEEEECHH Q ss_conf 49999997895999998898364631387327788870899987788 Q gi|254780814|r 134 WLKLRIKREGNIYEMSFINGILDNPLVVTGSAGNDTGTEVTFLPSSD 180 (803) Q Consensus 134 ~~~v~v~rdgk~~~q~f~~g~~~~~l~~~~~~~~~~GT~I~F~PD~~ 180 (803) .++|.|=|.. +=..-=+.|.. |++-...+||+|-|+||.. T Consensus 231 ~~~V~VPr~~-VG~IIGkgGE~------IKklq~etG~KIQfkpDd~ 270 (600) T KOG1676 231 TREVKVPRSK-VGIIIGKGGEM------IKKLQNETGAKIQFKPDDD 270 (600) T ss_pred EEEEECCCCC-EEEEEECCCHH------HHHHHHCCCCEEEEECCCC T ss_conf 3688545652-34687047167------8887633574268623888 No 161 >COG4942 Membrane-bound metallopeptidase [Cell division and chromosome partitioning] Probab=23.16 E-value=40 Score=12.36 Aligned_cols=68 Identities=25% Similarity=0.376 Sum_probs=33.6 Q ss_pred CCCCCCCCCCCCEEEEECCCCCCCHHHCCHHHHHCC------HHHHHHHHHHCCCCCCCCCCCCCCC-CCCEEEEECCCC Q ss_conf 432233465443023300562033443058766323------6678888762578775332123567-373899850888 Q gi|254780814|r 436 GGSAKQGRSRENQAILPLRGKILNVERARFDKMLSS------QEIGTLITALGTGIGQDSFDINKLR-YHKIIIMTDADV 508 (803) Q Consensus 436 ggsak~gRdr~~qai~PLrGKilNv~~a~~~~i~~n------~Ei~~l~~~lG~~~g~~~~~~~~lr-Y~kiiimtDaD~ 508 (803) |-++.+|+ ..+|+.|.|+.--.....--... ..-.+.++|.--| ..-| -+.|| ||.++|.---+ T Consensus 291 ~~~~~~G~-----l~~PV~G~il~rFG~~~~gg~~wkG~vi~a~~Ga~V~A~AdG--~Vvy-A~~l~GYG~vvIldhG~- 361 (420) T COG4942 291 GFGALRGQ-----LAWPVTGRILRRFGQADGGGLRWKGMVIGASAGATVKAIADG--RVVY-ADWLRGYGLVVILDHGG- 361 (420) T ss_pred CCCCCCCC-----CCCCCCCHHHHHHCCCCCCCCCCCCEEEECCCCCEEEECCCC--EEEE-CHHHCCCCEEEEEECCC- T ss_conf 32002577-----688878707877345678876106758735999825620695--6995-43325675699997488- Q ss_pred CCCCH Q ss_conf 85328 Q gi|254780814|r 509 DGAHI 513 (803) Q Consensus 509 DG~HI 513 (803) |+|+ T Consensus 362 -gy~s 365 (420) T COG4942 362 -GYHS 365 (420) T ss_pred -CCEE T ss_conf -5378 No 162 >PTZ00260 glycosyl transferase group 2; Provisional Probab=22.95 E-value=40 Score=12.33 Aligned_cols=21 Identities=14% Similarity=0.264 Sum_probs=14.5 Q ss_pred CEEEEEEECCCCCCCCCCCCC Q ss_conf 217999825575432233465 Q gi|254780814|r 424 KSELFLVEGDSAGGSAKQGRS 444 (803) Q Consensus 424 ~~eL~ivEGDSAggsak~gRd 444 (803) .+|+++|.--|--+++.-+++ T Consensus 116 ~yEIIVVDDGStD~T~eVa~~ 136 (336) T PTZ00260 116 IYEIIIINDGSKDKTLKVAKK 136 (336) T ss_pred CEEEEEECCCCCCCHHHHHHH T ss_conf 679999816999886999999 No 163 >cd04237 AAK_NAGS-ABP AAK_NAGS-ABP: N-acetylglutamate (NAG) kinase-like domain of the NAG Synthase (NAGS) of the arginine-biosynthesis pathway (ABP) found in gamma- and beta-proteobacteria and higher plant chloroplasts. Domain architecture of these NAGS consisted of an N-terminal NAG kinase-like (ArgB) domain (this CD) and a C-terminal NAG synthase, acetyltransferase (ArgA) domain. Both bacterial and plant sequences in this CD have a conserved N-terminal extension; a similar sequence in the NAG kinases of the cyclic arginine-biosynthesis pathway has been implicated in feedback inhibition sensing. Plant sequences also have an N-terminal chloroplast transit peptide and an insert (approx. 70 residues) in the C-terminal region of ArgB. Members of this CD belong to the Amino Acid Kinase Superfamily (AAK). Probab=22.87 E-value=18 Score=15.11 Aligned_cols=63 Identities=29% Similarity=0.524 Sum_probs=37.7 Q ss_pred CCCCCCCHHHCCHHHHHCCHHHHHHHHHHCCCCCCCCCCCC----------CCCCCCEEEEECC----CCCCCCHHHH Q ss_conf 05620334430587663236678888762578775332123----------5673738998508----8885328999 Q gi|254780814|r 453 LRGKILNVERARFDKMLSSQEIGTLITALGTGIGQDSFDIN----------KLRYHKIIIMTDA----DVDGAHIRTL 516 (803) Q Consensus 453 LrGKilNv~~a~~~~i~~n~Ei~~l~~~lG~~~g~~~~~~~----------~lrY~kiiimtDa----D~DG~HI~~L 516 (803) +=|.+-.|...-+..++++..| -++.-||.+..-+.||+. .|.-.|+++|||. |.+|.=|+.| T Consensus 142 ~vG~v~~Vn~~~l~~ll~~~~i-PVIspig~~~~G~~~NvNAD~~A~~iA~aL~A~kLi~ltdv~Gvl~~~~~~i~~l 218 (280) T cd04237 142 HTGEVRRIDADAIRRQLDQGSI-VLLSPLGYSPTGEVFNLSMEDVATAVAIALKADKLIFLTDGPGLLDDDGELIREL 218 (280) T ss_pred CCCCCCEEEHHHHHHHHHCCCE-EEEECCEECCCCCEEEECHHHHHHHHHHHHHHCEEEEECCCCCEECCCCCCCCCC T ss_conf 7983026649999868758965-8960431889998870089999999999974470998718883278999886504 No 164 >TIGR01497 kdpB K+-transporting ATPase, B subunit; InterPro: IPR006391 These sequences describe the P-type ATPase subunit of the complex responsible for translocating potassium ions across biological membranes in microbes. In Escherichia coli and other species, this complex consists of the proteins KdpA, KdpB, KdpC and KdpF. KdpB is the ATPase subunit, while KdpA is the potassium-ion translocating subunit . The function of KdpC is unclear, although it has been suggested to couple the ATPase subunit to the ion-translocating subunit , while KdpF serves to stabilize the complex . The potassium P-type ATPases have been characterised as Type IA based on a phylogenetic analysis which places this clade closest to the heavy-metal translocating ATPases (Type IB) . Others place this clade closer to the Na+/K+ antiporter type (Type IIC) based on physical characteristics . ; GO: 0008556 potassium-transporting ATPase activity, 0006813 potassium ion transport, 0016021 integral to membrane. Probab=22.64 E-value=41 Score=12.29 Aligned_cols=55 Identities=35% Similarity=0.660 Sum_probs=36.4 Q ss_pred CCCEEEEEECCCCCCCCCCCCCCCCEEEEEEEE---------------EC-CCCCCCCCCCEEECCCCCCEEEEEEECCC Q ss_conf 898399997684453640777885001256425---------------01-13232898604831167520377863264 Q gi|254780814|r 70 MDGSCTVSDNGRGIPTDLHKEEGISAAEVIMTR---------------LH-AGGKFNQKSYKVSGGLHGVGVSVVNALSS 133 (803) Q Consensus 70 ~dgsisV~dnGrGIPv~~h~~~~~~~~E~i~t~---------------L~-ag~kfd~~~yk~sgGlhGvG~s~vNalS~ 133 (803) +-|=|-...+|.=||.|= |+|.|. |. +||-|-. |||| |-++|. T Consensus 125 kkGD~VlVeaGDvIP~DG---------EVi~GvASVDESAITGESAPV~kEsGgDfAS----VTGG--------Tri~SD 183 (675) T TIGR01497 125 KKGDVVLVEAGDVIPADG---------EVIEGVASVDESAITGESAPVLKESGGDFAS----VTGG--------TRIVSD 183 (675) T ss_pred CCCCEEEEECCCEECCCC---------CEECCCCCCCCCCCCCCCCCCCCCCCCCEEE----ECCC--------CEEEEE T ss_conf 578889996383725997---------4764511211221025788721078896546----4388--------458855 Q ss_pred EEEEEEEEC-CEE Q ss_conf 499999978-959 Q gi|254780814|r 134 WLKLRIKRE-GNI 145 (803) Q Consensus 134 ~~~v~v~rd-gk~ 145 (803) ||.|||.-| |+- T Consensus 184 wL~ve~ta~PGe~ 196 (675) T TIGR01497 184 WLVVEVTADPGES 196 (675) T ss_pred EEEEEEEECCCCC T ss_conf 6899998269974 No 165 >pfam08812 YtxC YtxC-like family. This family includes proteins similar to B. subtilis YtxC an uncharacterized protein. Probab=22.18 E-value=41 Score=12.22 Aligned_cols=10 Identities=10% Similarity=0.491 Sum_probs=3.6 Q ss_pred HHHHHHHHHH Q ss_conf 8899999999 Q gi|254780814|r 294 SALTRQITSY 303 (803) Q Consensus 294 ~al~~~i~~y 303 (803) .++-+||.+| T Consensus 102 ~~ve~aIdEy 111 (220) T pfam08812 102 EIVEKAVDEY 111 (220) T ss_pred HHHHHHHHHH T ss_conf 9999999999 No 166 >TIGR01311 glycerol_kin glycerol kinase; InterPro: IPR005999 Glycerol kinase (2.7.1.30 from EC) is a bacterial sugar kinase which catalyzes the Mg-ATP-dependent phosphorylation of glycerol to yield glycerol 3-phosphate. The enzyme from Escherichia coli is an allosteric regulatory enzyme whose activity is inhibited by fructose 1,6-bisphosphate (FBP) and the glucose-specific phosphocarrier of the phosphoenolpyruvate:glycose phosphotransferase system, IIA(Glc), structural studies suggest a nucleophilic in-line transfer mechanism for the ATP-dependent phosphorylation of glycerol by glycerol kinase .; GO: 0004370 glycerol kinase activity, 0006072 glycerol-3-phosphate metabolic process. Probab=22.01 E-value=31 Score=13.14 Aligned_cols=50 Identities=22% Similarity=0.378 Sum_probs=25.6 Q ss_pred CEEECCCCCCEEEEEEEC--CCEEEEEEEECCEEEEEEEEC------CEECCCC-EEECCCCCCCC Q ss_conf 048311675203778632--644999999789599999889------8364631-38732778887 Q gi|254780814|r 114 YKVSGGLHGVGVSVVNAL--SSWLKLRIKREGNIYEMSFIN------GILDNPL-VVTGSAGNDTG 170 (803) Q Consensus 114 yk~sgGlhGvG~s~vNal--S~~~~v~v~rdgk~~~q~f~~------g~~~~~l-~~~~~~~~~~G 170 (803) |+.|||-|| |+=||-+- |.-|--. +++|.|-. +.|..=| ++|.++.+-=| T Consensus 175 WnLTGG~nG-~~HvTDvtNASRT~LfN------l~tl~WD~eLL~~f~IP~~iLmPEv~sSse~yG 233 (518) T TIGR01311 175 WNLTGGVNG-KVHVTDVTNASRTMLFN------LHTLEWDDELLELFGIPREILMPEVRSSSEVYG 233 (518) T ss_pred HHHCCCCCC-CEEECCCCCHHHHHHCC------CHHHHHHHHHHHHHCCCHHHCCCCCCCCCHHCC T ss_conf 531589788-65533786146876414------012100089998722647762886587601447 No 167 >smart00462 PTB Phosphotyrosine-binding domain, phosphotyrosine-interaction (PI) domain. PTB/PI domain structure similar to those of pleckstrin homology (PH) and IRS-1-like PTB domains. Probab=21.31 E-value=43 Score=12.10 Aligned_cols=37 Identities=11% Similarity=0.156 Sum_probs=14.4 Q ss_pred HHHHHHHHHHHH------CCCCEEEEEECCCCEEEEEECCCCCC Q ss_conf 675557886860------77975799983898399997684453 Q gi|254780814|r 47 EVLDNAIDESLE------GYADIITVTLNMDGSCTVSDNGRGIP 84 (803) Q Consensus 47 EiidNsvDe~~~------g~~~~I~V~i~~dgsisV~dnGrGIP 84 (803) +++.+|+.+..+ .....+.++|..+ .|.|.|...+-+ T Consensus 23 ~~~~~ai~~i~~~~~~~~~~~~~v~l~Vs~~-gi~v~d~~~~~~ 65 (134) T smart00462 23 QVVQEAIRKLRAAQGSEKKEPQKVILSISSR-GVKLIDEDTKAV 65 (134) T ss_pred HHHHHHHHHHHHHHHCCCCCCCEEEEEEECC-EEEEEECCCCEE T ss_conf 8999999999975020578896899999858-899992788806 No 168 >COG1497 Predicted transcriptional regulator [Transcription] Probab=21.23 E-value=22 Score=14.38 Aligned_cols=16 Identities=19% Similarity=0.318 Sum_probs=11.3 Q ss_pred HHHHHHHHHHHCCEEE Q ss_conf 9996307787489599 Q gi|254780814|r 520 FFFRQMPSLIHNGFLY 535 (803) Q Consensus 520 ff~~~~~~li~~g~~y 535 (803) +++..+.+|.+.|--| T Consensus 237 ml~~~~~~l~~~~~eY 252 (260) T COG1497 237 MLRYLLRKLEEEGLEY 252 (260) T ss_pred HHHHHHHHHHHCCCCC T ss_conf 7999998887627852 No 169 >KOG3651 consensus Probab=21.19 E-value=43 Score=12.08 Aligned_cols=28 Identities=14% Similarity=0.323 Sum_probs=20.8 Q ss_pred HHHHHCCEEEEE-CCCEEEEEECCCEEEE Q ss_conf 778748959994-3833788726717998 Q gi|254780814|r 526 PSLIHNGFLYII-KPPLYGITRGKSLQYV 553 (803) Q Consensus 526 ~~li~~g~~yia-~ppl~~~~~~K~E~Yi 553 (803) .++=.+-+-|+| |-|||||..|+-|.-+ T Consensus 253 KEMDDEE~~f~AlqEPLYRVeTGNYEYRl 281 (429) T KOG3651 253 KEMDDEEVEFVALQEPLYRVETGNYEYRL 281 (429) T ss_pred HHCCCHHHCEEEECCCEEEEECCCEEEEH T ss_conf 33262020433406843676238725531 No 170 >PRK10162 acetyl esterase; Provisional Probab=21.18 E-value=35 Score=12.77 Aligned_cols=19 Identities=37% Similarity=0.372 Sum_probs=11.9 Q ss_pred EEEECCCCCCC-----CCCCCCCC Q ss_conf 99825575432-----23346544 Q gi|254780814|r 428 FLVEGDSAGGS-----AKQGRSRE 446 (803) Q Consensus 428 ~ivEGDSAggs-----ak~gRdr~ 446 (803) +.+=||||||. +-..||+. T Consensus 159 I~v~GDSAGG~LA~a~al~lrd~~ 182 (319) T PRK10162 159 IGFAGDSAGAMLALASALWLRDKQ 182 (319) T ss_pred CEEEECCCHHHHHHHHHHHHHHCC T ss_conf 456634812689999999866446 No 171 >TIGR02257 cobalto_cobN cobaltochelatase, CobN subunit; InterPro: IPR011953 This family of proteins catalyzes the insertion of cobalt into the corrin ring of hydrogenobyrinic acid a,c-diamide. This aerobic branch of corrin ring synthesis is part of the adenosylcobalamin biosynthetic pathway .. Probab=21.11 E-value=43 Score=12.07 Aligned_cols=16 Identities=31% Similarity=0.683 Sum_probs=8.9 Q ss_pred CCCCCCCCCCCEEEEE Q ss_conf 3212356737389985 Q gi|254780814|r 489 SFDINKLRYHKIIIMT 504 (803) Q Consensus 489 ~~~~~~lrY~kiiimt 504 (803) .|-+..+|||+|++.= T Consensus 513 ~f~~~G~rfG~i~V~i 528 (1310) T TIGR02257 513 GFPIAGLRFGNIFVLI 528 (1310) T ss_pred CCCCHHHHHCEEEEEE T ss_conf 4642022101010251 No 172 >pfam08304 consensus Probab=21.08 E-value=27 Score=13.70 Aligned_cols=29 Identities=21% Similarity=0.218 Sum_probs=14.3 Q ss_pred EEEECCCCCCCHHHCCHHHHHCCHHHHHH Q ss_conf 23300562033443058766323667888 Q gi|254780814|r 449 AILPLRGKILNVERARFDKMLSSQEIGTL 477 (803) Q Consensus 449 ai~PLrGKilNv~~a~~~~i~~n~Ei~~l 477 (803) ||.-||-..|--+..+-..|+.-+.+.+| T Consensus 341 GII~lR~~FL~~~gl~G~ei~~~e~~~el 369 (378) T pfam08304 341 GIIKLRNLFLQDYGMSGLEILNHDKLKEL 369 (378) T ss_pred CHHHHHHHHHHHCCCCHHHHHHHHHHHHH T ss_conf 72678999999759957777425666666 No 173 >pfam11243 DUF3045 Protein of unknown function (DUF3045). Members in this family of proteins are annotated as gene protein 30.1. Currently no function is known. Probab=21.01 E-value=43 Score=12.05 Aligned_cols=37 Identities=30% Similarity=0.436 Sum_probs=18.3 Q ss_pred HHHHHHCCEEEEECCCEEEEE-ECCC-EEEEECHHHHHHHHH Q ss_conf 077874895999438337887-2671-799814578989998 Q gi|254780814|r 525 MPSLIHNGFLYIIKPPLYGIT-RGKS-LQYVKDEESLEDYLI 564 (803) Q Consensus 525 ~~~li~~g~~yia~ppl~~~~-~~K~-E~YikDD~EL~~~Ll 564 (803) |.+-+++|++||.. |-+. .||. ..|-|.-.||.+-+. T Consensus 40 Fk~CVeqGFiYvse---~~~~g~~kRittYhktL~eLlDE~a 78 (90) T pfam11243 40 FKECVEQGFIYISE---HWMEGLGKRITTYHKSLKELLDELA 78 (90) T ss_pred HHHHHHCCEEEEEH---HHHCCCCEEEEEHHHHHHHHHHHHH T ss_conf 99999705388657---6641577465539878999999998 No 174 >pfam00926 DHBP_synthase 3,4-dihydroxy-2-butanone 4-phosphate synthase. 3,4-Dihydroxy-2-butanone 4-phosphate is biosynthesized from ribulose 5-phosphate and serves as the biosynthetic precursor for the xylene ring of riboflavin. Sometimes found as a bifunctional enzyme with pfam00925. Probab=20.81 E-value=44 Score=12.02 Aligned_cols=10 Identities=10% Similarity=0.368 Sum_probs=5.1 Q ss_pred CEEEEEEEEC Q ss_conf 4169999963 Q gi|254780814|r 204 SSVQISLIDK 213 (803) Q Consensus 204 ~gl~i~l~De 213 (803) .|--|.+.|. T Consensus 10 ~G~~Viv~D~ 19 (193) T pfam00926 10 AGKPVIVVDD 19 (193) T ss_pred CCCEEEEEEC T ss_conf 7995999948 No 175 >COG3013 Uncharacterized conserved protein [Function unknown] Probab=20.80 E-value=33 Score=13.03 Aligned_cols=35 Identities=31% Similarity=0.370 Sum_probs=27.4 Q ss_pred CCEEEEECCCC---CCCHHHCCHHHH-HCCHHHHHHHHH Q ss_conf 43023300562---033443058766-323667888876 Q gi|254780814|r 446 ENQAILPLRGK---ILNVERARFDKM-LSSQEIGTLITA 480 (803) Q Consensus 446 ~~qai~PLrGK---ilNv~~a~~~~i-~~n~Ei~~l~~~ 480 (803) .|-+-+|.--| .||||.|.+.+- ++-+||.+|+.| T Consensus 130 ~fnSq~PM~dKY~rMl~~W~~cprqYhLs~~EI~~IlNA 168 (168) T COG3013 130 GFNSQTPMWDKYQRMLNVWHACPRQYHLSANEINQILNA 168 (168) T ss_pred CCCCCCHHHHHHHHHHHHHHHCCHHCCCCHHHHHHHHCC T ss_conf 876668269999999999981821215659999998639 No 176 >KOG1777 consensus Probab=20.63 E-value=44 Score=12.00 Aligned_cols=22 Identities=23% Similarity=0.350 Sum_probs=16.7 Q ss_pred CEECCCCCHHCCCCCCEECCCC Q ss_conf 1231354200045871543279 Q gi|254780814|r 15 IQILKGLDAVKKRPSMYIGDTD 36 (803) Q Consensus 15 i~~L~gle~vrkRP~mYiG~t~ 36 (803) .+-|.-=-||+|-|..|.|+++ T Consensus 7 ~~Ql~rGvHV~~~~~~~~~~~~ 28 (625) T KOG1777 7 FRQLYRGVHVMKNPEQFVGAAN 28 (625) T ss_pred HHHHHCCCEECCCHHHHHHHHH T ss_conf 9987366234468778520333 No 177 >COG4309 Uncharacterized conserved protein [Function unknown] Probab=20.33 E-value=45 Score=11.95 Aligned_cols=36 Identities=25% Similarity=0.380 Sum_probs=15.4 Q ss_pred CCCCCCCHHHHHHH-HHHHHHHHHHHCCEEEEECCC-EEEEEECC Q ss_conf 88885328999999-999630778748959994383-37887267 Q gi|254780814|r 506 ADVDGAHIRTLLLT-FFFRQMPSLIHNGFLYIIKPP-LYGITRGK 548 (803) Q Consensus 506 aD~DG~HI~~Lllt-ff~~~~~~li~~g~~yia~pp-l~~~~~~K 548 (803) +|+|-.|.+-+|=+ |--.| |-.|.++-| +.||..+| T Consensus 41 sDHdP~pL~~~L~~~~pg~f-------~wey~e~Gp~vwRv~i~r 78 (98) T COG4309 41 SDHDPRPLRYQLSTEFPGKF-------GWEYLENGPEVWRVEIKR 78 (98) T ss_pred CCCCCHHHHHHHHHCCCCCC-------EEEEECCCCEEEEEEEEE T ss_conf 27896789998620587553-------048814798379988541 No 178 >cd04252 AAK_NAGK-fArgBP AAK_NAGK-fArgBP: N-Acetyl-L-glutamate kinase (NAGK) of the fungal arginine-biosynthetic pathway (fArgBP). The nuclear-encoded, mitochondrial polyprotein precursor with an N-terminal NAGK (ArgB) domain (this CD), a central DUF619 domain, and a C-terminal reductase domain (ArgC, N-Acetylglutamate Phosphate Reductase, NAGPR). The precursor is cleaved in the mitochondria into two distinct enzymes (NAGK-DUF619 and NAGPR). Native molecular weights of these proteins indicate that the kinase is an octamer whereas the reductase is a dimer. This CD also includes some gamma-proteobacteria (Xanthomonas and Xylella) NAG kinases with an N-terminal NAGK (ArgB) domain (this CD) and a C-terminal DUF619 domain. The DUF619 domain is described as a putative distant homolog of the acetyltransferase, ArgA, predicted to function in NAG synthase association in fungi. Eukaryotic sequences have an N-terminal mitochondrial transit peptide. Members of this NAG kinase domain CD belong to th Probab=20.31 E-value=31 Score=13.16 Aligned_cols=83 Identities=19% Similarity=0.320 Sum_probs=50.1 Q ss_pred CCCCCCHHHCCHHHHHCCHHHHHHHHHHCCCCCCCCCCCC----------CCCCCCEEEEECC----CCCCCCHHHHHHH Q ss_conf 5620334430587663236678888762578775332123----------5673738998508----8885328999999 Q gi|254780814|r 454 RGKILNVERARFDKMLSSQEIGTLITALGTGIGQDSFDIN----------KLRYHKIIIMTDA----DVDGAHIRTLLLT 519 (803) Q Consensus 454 rGKilNv~~a~~~~i~~n~Ei~~l~~~lG~~~g~~~~~~~----------~lrY~kiiimtDa----D~DG~HI~~Lllt 519 (803) -|++--|....+..++++..| -++.-||....-+.||++ .|+=.|+|+|||. |.||.-|+.+-+. T Consensus 112 vG~v~~Vd~~~i~~lL~~g~I-pvispig~~~~G~~~NvnaD~~A~~iA~aL~A~kLi~ltdv~Gv~d~~g~~i~~l~~~ 190 (248) T cd04252 112 VGKITGVNKAPIEAAIRAGYL-PILTSLAETPSGQLLNVNADVAAGELARVLEPLKIVFLNETGGLLDGTGKKISAINLD 190 (248) T ss_pred CCCEEEECHHHHHHHHHCCCC-EEEEEEEECCCCCEEEECHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCHH T ss_conf 322589829999999858994-4885147889999898799999999999759998999958830179999997427889 Q ss_pred HHHHHHHHHHHCCEEEEECCC Q ss_conf 999630778748959994383 Q gi|254780814|r 520 FFFRQMPSLIHNGFLYIIKPP 540 (803) Q Consensus 520 ff~~~~~~li~~g~~yia~pp 540 (803) +....|+++|.+.-.+-| T Consensus 191 ---~~~~~l~~~~~i~~gm~~ 208 (248) T cd04252 191 ---EEYDDLMKQPWVKYGTKL 208 (248) T ss_pred ---HHHHHHHHCCCCCCCCHH T ss_conf ---999999877997786364 No 179 >pfam06317 Arena_RNA_pol Arenavirus RNA polymerase. This family consists of several Arenavirus RNA polymerase proteins (EC:2.7.7.48). Probab=20.28 E-value=45 Score=11.94 Aligned_cols=56 Identities=23% Similarity=0.296 Sum_probs=34.2 Q ss_pred EEEEEEEECCEECCCC--EEECCCC----------CCCCEEEEEEECHHHHHHHHCCHHHHHHHHHHH Q ss_conf 5999998898364631--3873277----------888708999877888542100089999999973 Q gi|254780814|r 144 NIYEMSFINGILDNPL--VVTGSAG----------NDTGTEVTFLPSSDIFSVQDFNYDTLQHRLREL 199 (803) Q Consensus 144 k~~~q~f~~g~~~~~l--~~~~~~~----------~~~GT~I~F~PD~~iF~~~~~~~d~l~~R~~~~ 199 (803) ..|.|+|..+....-| .++|..+ ..-|+.++|+.||++|=..-|+.++|.+.+.++ T Consensus 530 Ecy~Q~f~~~~~~~~LlYQKTGE~SRCYsi~~~~~g~~~~~~SFYaDPKRfFLPIfS~~Vl~~mc~eM 597 (2206) T pfam06317 530 ECYVQRFLLEGLEGLLLYQKTGERSRCYSINLSDKGVVGELCSFYADPKRFFLPIFSDEVLQEMCDEM 597 (2206) T ss_pred EEEEEEEECCCCEEEEEEECCCCCCCCEEEECCCCCCCCEEEEEECCCCCCCCCCCHHHHHHHHHHHH T ss_conf 88887740588259999971587665247751688851026678538510155213299999999999 No 180 >PRK05823 consensus Probab=20.18 E-value=45 Score=11.93 Aligned_cols=13 Identities=8% Similarity=0.261 Sum_probs=5.3 Q ss_pred EECCCCCHHCCCC Q ss_conf 2313542000458 Q gi|254780814|r 16 QILKGLDAVKKRP 28 (803) Q Consensus 16 ~~L~gle~vrkRP 28 (803) .|....=|||-=| T Consensus 26 ~V~~s~GHl~dL~ 38 (691) T PRK05823 26 EIVATVGHIRELV 38 (691) T ss_pred EEEECCCCHHHCC T ss_conf 9997234086578 No 181 >pfam12535 Nudix_N Hydrolase of X-linked nucleoside diphosphate N terminal. This family of proteins is found in eukaryotes. Proteins in this family are typically between 847 and 5344 amino acids in length. These enzymes hydrolyse the molecular motif of a nucleoside diphosphate linked to some other moiety, X. Probab=20.13 E-value=26 Score=13.81 Aligned_cols=16 Identities=31% Similarity=0.657 Sum_probs=7.4 Q ss_pred CCCCCCCCCCCCCCCCCC Q ss_conf 578775332123567373 Q gi|254780814|r 482 GTGIGQDSFDINKLRYHK 499 (803) Q Consensus 482 G~~~g~~~~~~~~lrY~k 499 (803) |+-.++|.||.+ ||.+ T Consensus 18 GL~YskD~yD~E--Rye~ 33 (58) T pfam12535 18 GLTYSKDVYDRE--RYEE 33 (58) T ss_pred CCCCCCCHHHHH--HHHH T ss_conf 501078763699--9999 No 182 >pfam11692 DUF3289 Protein of unknown function (DUF3289). This family of proteins with unknown function appears to be restricted to Proteobacteria. Probab=20.11 E-value=45 Score=11.92 Aligned_cols=21 Identities=24% Similarity=0.249 Sum_probs=8.4 Q ss_pred CCCCCCCCCEEEEECCCCCCC Q ss_conf 123567373899850888853 Q gi|254780814|r 491 DINKLRYHKIIIMTDADVDGA 511 (803) Q Consensus 491 ~~~~lrY~kiiimtDaD~DG~ 511 (803) ++.+-+|.-++=-.=||+=|- T Consensus 211 ~i~~~~~~A~v~y~~QDHFGL 231 (277) T pfam11692 211 QIYNDSYSAIVHYKGQDHFGL 231 (277) T ss_pred EEECCEEEEEEEEEECCCCCC T ss_conf 997788899999982254688 No 183 >pfam08876 DUF1836 Domain of unknown function (DUF1836). This family of proteins are functionally uncharacterized. Probab=20.05 E-value=32 Score=13.10 Aligned_cols=37 Identities=22% Similarity=0.376 Sum_probs=25.2 Q ss_pred CCCEEEEEECCCCCCCCCCEEHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 5412122224621666772103568889999999998624200 Q gi|254780814|r 269 YHENVLCFTNNIPQKDGGTHLSGLRSALTRQITSYVEKSFGSK 311 (803) Q Consensus 269 ~~e~i~SfvN~I~T~~GGtHv~gf~~al~~~i~~y~~~~~~~K 311 (803) |-++|.+|+|..-.+-+.++++ ...||+|++..-+.+ T Consensus 12 YmdQVi~~~n~~l~~~~~~~lT------~tMInNYvK~~li~~ 48 (102) T pfam08876 12 YMDQVIQLVNDKLEPLGDKILT------KTMINNYVKRGLIPP 48 (102) T ss_pred HHHHHHHHHHHHHCCCCCCCCC------HHHHHHHHCCCCCCC T ss_conf 7999999999871346766252------999999881377789 Done!