Query gi|254780222|ref|YP_003064635.1| DNA topoisomerase IV subunit B [Candidatus Liberibacter asiaticus str. psy62] Match_columns 686 No_of_seqs 238 out of 3153 Neff 6.9 Searched_HMMs 39220 Date Mon May 23 19:59:25 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780222.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 TIGR01059 gyrB DNA gyrase, B s 100.0 0 0 1996.3 46.0 632 44-685 1-817 (818) 2 PRK05644 gyrB DNA gyrase subun 100.0 0 0 1757.7 55.7 633 38-685 5-724 (725) 3 PRK05559 DNA topoisomerase IV 100.0 0 0 1726.4 53.1 624 39-682 3-632 (633) 4 COG0187 GyrB Type IIA topoisom 100.0 0 0 1707.2 53.8 629 41-686 3-635 (635) 5 PTZ00109 DNA gyrase subunit b; 100.0 0 0 1652.4 52.9 631 42-680 54-939 (941) 6 TIGR01058 parE_Gpos DNA topois 100.0 0 0 1600.1 40.5 619 41-680 8-641 (655) 7 smart00433 TOP2c Topoisomerase 100.0 0 0 1568.7 52.0 591 73-679 1-594 (594) 8 PTZ00108 DNA topoisomerase II; 100.0 0 0 1411.2 41.2 627 36-682 3-722 (1506) 9 TIGR01055 parE_Gneg DNA topois 100.0 0 0 1400.7 39.4 634 42-678 2-640 (647) 10 KOG0355 consensus 100.0 0 0 1094.0 28.3 597 47-679 6-637 (842) 11 cd03366 TOPRIM_TopoIIA_GyrB TO 100.0 0 0 386.4 11.1 114 464-577 1-114 (114) 12 cd01030 TOPRIM_TopoIIA_like TO 100.0 0 0 380.9 11.0 114 464-577 1-115 (115) 13 cd03365 TOPRIM_TopoIIA TOPRIM_ 100.0 0 0 363.7 9.3 114 464-577 1-120 (120) 14 pfam00204 DNA_gyraseB DNA gyra 100.0 5.4E-43 0 314.4 12.5 171 268-438 1-173 (173) 15 cd00822 TopoII_Trans_DNA_gyras 100.0 1.4E-42 0 311.6 12.2 169 267-438 1-172 (172) 16 cd03481 TopoIIA_Trans_ScTopoII 100.0 5.9E-32 1.5E-36 238.6 10.7 147 269-426 3-151 (153) 17 pfam00986 DNA_gyraseB_C DNA gy 99.8 5.7E-21 1.5E-25 163.2 6.8 64 608-675 2-65 (65) 18 PRK04184 DNA topoisomerase VI 99.5 3.5E-12 9E-17 102.9 19.4 315 50-394 12-440 (533) 19 TIGR01052 top6b DNA topoisomer 99.3 5.3E-11 1.3E-15 94.8 10.5 163 71-248 26-216 (662) 20 COG1389 DNA topoisomerase VI, 99.2 1.4E-10 3.5E-15 91.9 9.2 317 49-394 12-445 (538) 21 PRK00095 mutL DNA mismatch rep 99.0 3.3E-08 8.5E-13 75.6 14.9 221 72-326 21-260 (612) 22 COG0323 MutL DNA mismatch repa 98.5 4.4E-06 1.1E-10 61.0 12.5 160 73-250 23-195 (638) 23 TIGR00585 mutl DNA mismatch re 98.4 5.1E-06 1.3E-10 60.6 9.7 189 72-279 21-237 (367) 24 cd00329 TopoII_MutL_Trans MutL 98.3 9.2E-07 2.3E-11 65.7 5.3 74 299-387 26-106 (107) 25 cd00075 HATPase_c Histidine ki 98.2 5.9E-06 1.5E-10 60.2 6.9 80 74-169 1-83 (103) 26 smart00387 HATPase_c Histidine 98.2 3.3E-06 8.5E-11 61.9 5.2 88 70-173 2-92 (111) 27 pfam02518 HATPase_c Histidine 98.1 4.7E-06 1.2E-10 60.9 4.9 88 70-173 2-92 (111) 28 PRK13560 hypothetical protein; 97.6 0.00022 5.5E-09 49.4 6.9 15 99-113 218-232 (807) 29 COG2205 KdpD Osmosensitive K+ 97.6 7.5E-05 1.9E-09 52.6 4.5 89 50-143 68-213 (890) 30 PRK10337 sensor protein QseC; 97.6 0.00011 2.8E-09 51.5 5.2 12 379-390 237-248 (446) 31 COG0326 HtpG Molecular chapero 97.5 0.00035 9E-09 48.0 6.8 157 78-249 32-212 (623) 32 KOG1978 consensus 97.3 0.002 5E-08 42.8 8.7 233 71-347 18-269 (672) 33 PRK05218 heat shock protein 90 97.3 0.0017 4.3E-08 43.3 7.8 155 78-248 30-210 (612) 34 PRK10755 sensor protein BasS/P 97.3 0.00055 1.4E-08 46.6 5.1 78 73-167 249-330 (355) 35 PRK10815 sensor protein PhoQ; 97.2 0.001 2.6E-08 44.8 5.8 35 592-630 419-453 (484) 36 COG3920 Signal transduction hi 97.1 0.003 7.8E-08 41.6 7.3 94 64-187 113-216 (221) 37 PTZ00272 heat shock protein 83 97.0 0.0043 1.1E-07 40.5 7.6 157 78-249 30-209 (701) 38 COG3850 NarQ Signal transducti 96.9 0.002 5.1E-08 42.8 5.4 36 463-501 298-340 (574) 39 PRK10490 sensor protein KdpD; 96.9 0.0017 4.2E-08 43.4 4.9 41 47-87 67-137 (895) 40 PTZ00130 heat shock protein 90 96.9 0.0081 2.1E-07 38.7 8.4 156 78-249 101-282 (824) 41 KOG1979 consensus 96.8 0.0074 1.9E-07 38.9 7.6 153 74-247 28-196 (694) 42 PRK10604 sensor protein RstB; 96.8 0.0029 7.3E-08 41.7 5.2 13 592-604 361-373 (433) 43 PRK03660 anti-sigma F factor; 96.7 0.016 4E-07 36.6 8.5 90 74-183 40-135 (146) 44 PRK09470 cpxA two-component se 96.7 0.0026 6.7E-08 42.0 4.4 11 653-663 398-408 (461) 45 COG3290 CitA Signal transducti 96.6 0.0044 1.1E-07 40.5 5.1 19 316-334 293-311 (537) 46 PRK09303 adaptive-response sen 96.6 0.0064 1.6E-07 39.3 5.8 10 550-559 270-279 (378) 47 PRK10549 signal transduction h 96.5 0.0028 7E-08 41.9 3.7 11 592-602 397-407 (467) 48 TIGR02938 nifL_nitrog nitrogen 96.5 0.0024 6.2E-08 42.2 3.3 50 71-121 386-441 (496) 49 PRK11100 sensory histidine kin 96.5 0.0041 1.1E-07 40.6 4.3 13 592-604 412-424 (475) 50 PRK09467 envZ osmolarity senso 96.5 0.0062 1.6E-07 39.5 5.1 13 592-604 375-387 (437) 51 PRK10600 nitrate/nitrite senso 96.3 0.0099 2.5E-07 38.1 5.4 13 610-622 536-548 (569) 52 cd00188 TOPRIM Topoisomerase-p 96.3 0.011 2.7E-07 37.8 5.5 74 464-562 1-74 (83) 53 PRK11360 sensory histidine kin 96.1 0.0072 1.8E-07 39.0 4.0 28 547-574 495-522 (607) 54 PRK11644 sensory histidine kin 96.1 0.016 4E-07 36.6 5.5 43 73-118 412-457 (497) 55 PRK09835 sensor kinase CusS; P 96.1 0.035 9E-07 34.3 7.3 11 337-347 243-253 (482) 56 pfam01751 Toprim Toprim domain 96.0 0.0074 1.9E-07 38.9 3.6 78 465-565 2-82 (89) 57 COG4585 Signal transduction hi 95.8 0.019 5E-07 36.0 5.1 48 71-121 277-327 (365) 58 PRK10364 sensor protein ZraS; 95.8 0.011 2.9E-07 37.6 3.9 23 224-247 74-96 (455) 59 COG5000 NtrY Signal transducti 95.4 0.03 7.6E-07 34.8 4.8 41 641-681 640-689 (712) 60 KOG1977 consensus 95.1 0.014 3.7E-07 36.9 2.5 172 59-250 12-196 (1142) 61 COG4191 Signal transduction hi 95.1 0.037 9.5E-07 34.1 4.5 45 501-546 495-539 (603) 62 KOG0787 consensus 94.8 0.049 1.2E-06 33.3 4.4 21 549-569 259-279 (414) 63 COG2172 RsbW Anti-sigma regula 94.8 0.26 6.6E-06 28.3 8.1 93 73-186 40-141 (146) 64 PRK04069 serine-protein kinase 94.8 0.25 6.3E-06 28.4 8.0 94 74-183 43-140 (158) 65 COG0642 BaeS Signal transducti 94.2 0.16 4E-06 29.8 6.0 47 71-120 226-275 (336) 66 PRK11107 hybrid sensory histid 93.7 0.57 1.5E-05 26.0 8.3 92 62-169 399-496 (920) 67 TIGR02966 phoR_proteo phosphat 93.6 0.19 5E-06 29.2 5.4 100 71-184 229-339 (339) 68 PRK11466 hybrid sensory histid 93.4 0.11 2.9E-06 30.7 4.0 101 62-184 552-660 (912) 69 TIGR01386 cztS_silS_copS heavy 93.4 0.16 4E-06 29.8 4.7 92 69-173 369-470 (483) 70 PRK13557 histidine kinase; Pro 92.9 0.13 3.2E-06 30.4 3.6 44 65-113 13-59 (538) 71 PRK11091 aerobic respiration c 92.8 0.15 3.9E-06 29.9 3.9 107 61-184 388-503 (779) 72 COG3851 UhpB Signal transducti 92.7 0.2 5.1E-06 29.1 4.4 18 506-523 369-386 (497) 73 PRK10841 hybrid sensory kinase 92.5 0.051 1.3E-06 33.2 1.2 38 513-551 722-759 (947) 74 PRK10547 chemotaxis protein Ch 92.5 0.14 3.5E-06 30.2 3.4 19 229-247 346-365 (662) 75 PRK13837 two-component VirA-li 92.5 0.18 4.7E-06 29.3 4.0 13 548-560 561-573 (831) 76 cd01027 TOPRIM_RNase_M5_like T 90.7 1.2 3E-05 23.8 6.5 69 464-560 2-70 (81) 77 PRK09959 hybrid sensory histid 88.1 0.43 1.1E-05 26.8 2.7 17 52-68 195-212 (1197) 78 COG3852 NtrB Signal transducti 87.8 2.1 5.3E-05 22.1 6.7 16 570-585 281-296 (363) 79 COG4564 Signal transduction hi 87.5 1.1 2.8E-05 24.0 4.5 30 319-348 277-308 (459) 80 cd01028 TOPRIM_TopoIA TOPRIM_T 85.8 1.6 4.1E-05 22.9 4.6 50 534-601 90-139 (142) 81 TIGR02956 TMAO_torS TMAO reduc 85.7 0.85 2.2E-05 24.8 3.2 106 70-190 619-741 (1052) 82 COG4192 Signal transduction hi 84.8 1.6 4.1E-05 22.8 4.2 55 533-594 544-606 (673) 83 COG0643 CheA Chemotaxis protei 83.8 1.3 3.4E-05 23.4 3.4 19 508-526 511-529 (716) 84 smart00493 TOPRIM topoisomeras 80.5 4.2 0.00011 20.0 5.2 70 465-559 2-71 (76) 85 cd01026 TOPRIM_OLD TOPRIM_OLD: 79.7 3.2 8.2E-05 20.8 4.2 59 465-550 5-70 (97) 86 COG4251 Bacteriophytochrome (l 79.2 2.1 5.3E-05 22.1 3.1 19 156-182 263-281 (750) 87 TIGR01286 nifK nitrogenase mol 78.1 1.5 3.9E-05 23.1 2.1 65 267-346 125-207 (526) 88 COG1658 Small primase-like pro 74.9 3.3 8.4E-05 20.7 3.1 102 464-599 10-115 (127) 89 COG2865 Predicted transcriptio 72.2 7 0.00018 18.5 4.4 13 620-632 414-426 (467) 90 COG3275 LytS Putative regulato 72.1 3.9 9.9E-05 20.2 2.9 57 510-574 420-484 (557) 91 TIGR02916 PEP_his_kin putative 66.5 8.6 0.00022 17.9 3.7 16 156-171 331-346 (696) 92 KOG0019 consensus 65.8 9.4 0.00024 17.6 4.3 155 77-249 61-238 (656) 93 LOAD_Toprim consensus 65.2 9.6 0.00025 17.5 5.3 70 465-560 2-71 (98) 94 cd03363 TOPRIM_TopoIA_TopoI TO 60.2 8.3 0.00021 18.0 2.7 25 535-559 73-97 (123) 95 TIGR01925 spIIAB anti-sigma F 58.4 7.3 0.00019 18.3 2.2 79 76-176 42-127 (137) 96 KOG0020 consensus 58.3 13 0.00032 16.7 6.8 155 78-238 100-275 (785) 97 COG2972 Predicted signal trans 57.1 13 0.00034 16.6 4.9 48 529-583 325-379 (456) 98 PRK04031 DNA primase; Provisio 55.9 14 0.00035 16.5 5.9 124 401-582 121-246 (420) 99 PRK04017 hypothetical protein; 53.4 15 0.00038 16.2 3.3 28 535-562 65-92 (132) 100 COG2516 Biotin synthase-relate 51.8 13 0.00034 16.6 2.6 112 408-526 134-258 (339) 101 TIGR02108 PQQ_syn_pqqB coenzym 49.0 8.3 0.00021 18.0 1.2 24 120-145 69-93 (314) 102 TIGR01924 rsbW_low_gc anti-sig 48.9 18 0.00045 15.7 3.7 83 78-178 48-137 (161) 103 TIGR00032 argG argininosuccina 47.1 1.9 4.9E-05 22.3 -2.3 117 209-350 138-271 (420) 104 pfam09239 Topo-VIb_trans Topoi 46.3 19 0.00049 15.5 9.9 78 299-388 41-124 (160) 105 cd00823 TopoIIB_Trans TopoIIB_ 44.9 20 0.00052 15.3 9.7 79 299-389 32-116 (151) 106 TIGR01369 CPSaseII_lrg carbamo 44.6 19 0.00047 15.6 2.4 66 57-129 164-247 (1089) 107 KOG1332 consensus 41.7 23 0.00058 15.0 5.7 112 89-212 17-160 (299) 108 PRK05645 lipid A biosynthesis 41.1 18 0.00047 15.6 2.0 36 533-570 179-221 (295) 109 TIGR02343 chap_CCT_epsi T-comp 40.1 16 0.00042 15.9 1.6 37 82-120 35-76 (541) 110 pfam07994 NAD_binding_5 Myo-in 38.7 13 0.00034 16.6 0.9 47 315-375 186-232 (389) 111 KOG2731 consensus 36.8 19 0.0005 15.4 1.5 20 524-543 263-282 (378) 112 KOG1777 consensus 35.9 20 0.00051 15.3 1.5 31 47-77 5-35 (625) 113 PRK06009 flgD flagellar basal 34.8 29 0.00073 14.3 2.8 23 97-119 107-129 (135) 114 pfam06882 DUF1263 Protein of u 33.6 21 0.00053 15.2 1.3 19 54-72 55-73 (108) 115 PRK05157 pyrroloquinoline quin 33.1 19 0.00048 15.5 1.0 11 223-233 11-21 (242) 116 KOG1845 consensus 32.9 16 0.00041 16.0 0.6 16 77-92 150-165 (775) 117 cd03362 TOPRIM_TopoIA_TopoIII 31.6 32 0.00082 13.9 2.5 24 536-559 100-123 (151) 118 KOG1413 consensus 30.8 33 0.00083 13.9 1.9 32 205-238 134-165 (411) 119 PRK05776 DNA topoisomerase III 30.3 34 0.00086 13.8 2.4 12 591-602 479-490 (675) 120 pfam11871 DUF3391 Domain of un 29.8 22 0.00056 15.1 0.9 11 60-70 10-20 (127) 121 PRK00885 phosphoribosylamine-- 29.7 35 0.00088 13.7 4.3 15 487-501 370-384 (424) 122 PRK11558 hypothetical protein; 29.6 35 0.00088 13.7 3.0 57 60-120 26-83 (97) 123 PRK05279 N-acetylglutamate syn 29.3 15 0.00038 16.2 -0.1 19 228-246 43-62 (441) 124 TIGR02604 Piru_Ver_Nterm putat 29.0 32 0.00081 14.0 1.6 15 203-217 197-211 (459) 125 pfam08874 DUF1835 Domain of un 29.0 21 0.00054 15.2 0.7 27 468-494 3-35 (115) 126 PTZ00306 NADH-dependent fumara 28.9 33 0.00084 13.9 1.6 50 595-644 1077-1135(1167) 127 KOG2333 consensus 28.8 28 0.00072 14.3 1.3 28 555-582 543-572 (614) 128 COG0550 TopA Topoisomerase IA 28.8 16 0.0004 16.1 -0.0 13 591-603 452-464 (570) 129 TIGR00396 leuS_bact leucyl-tRN 28.5 19 0.00048 15.6 0.3 12 486-497 854-865 (916) 130 pfam11626 Rap1_C Rap1 - C term 27.8 26 0.00067 14.5 1.0 17 111-131 21-37 (68) 131 KOG4518 consensus 27.2 38 0.00097 13.4 3.3 39 62-100 68-106 (264) 132 KOG3938 consensus 27.1 38 0.00097 13.4 3.2 29 90-118 123-152 (334) 133 cd00223 TOPRIM_TopoIIB_SPO TOP 27.0 28 0.00072 14.3 1.0 10 269-278 39-48 (160) 134 COG4942 Membrane-bound metallo 26.9 36 0.00093 13.6 1.6 83 486-582 297-390 (420) 135 TIGR01497 kdpB K+-transporting 26.8 39 0.00099 13.4 1.7 62 103-181 125-198 (675) 136 pfam10340 DUF2424 Protein of u 26.6 31 0.0008 14.0 1.2 25 314-343 123-147 (374) 137 PTZ00233 variable surface prot 25.8 25 0.00064 14.7 0.6 30 535-582 431-460 (509) 138 cd01213 tensin Tensin Phosphot 25.0 42 0.0011 13.2 4.7 51 64-116 11-64 (138) 139 COG0137 ArgG Argininosuccinate 25.0 11 0.00027 17.2 -1.5 46 302-349 204-253 (403) 140 TIGR01992 PTS-IIBC-Tre PTS sys 24.6 30 0.00077 14.1 0.8 14 235-248 21-34 (489) 141 PRK07198 hypothetical protein; 23.8 41 0.0011 13.2 1.4 76 316-405 174-255 (418) 142 TIGR02442 Cob-chelat-sub cobal 23.7 23 0.0006 14.9 0.1 27 222-248 275-302 (688) 143 cd03361 TOPRIM_TopoIA_RevGyr T 22.0 48 0.0012 12.8 2.6 26 535-560 119-144 (170) 144 TIGR00022 TIGR00022 conserved 22.0 37 0.00094 13.5 0.8 11 73-83 14-24 (156) 145 PRK08294 phenol 2-monooxygenas 21.8 23 0.00058 15.0 -0.3 17 389-405 353-369 (634) 146 PRK11597 heat shock chaperone 20.7 50 0.0013 12.6 2.0 67 299-368 44-116 (142) 147 KOG1185 consensus 20.5 51 0.0013 12.6 1.4 14 464-477 449-462 (571) 148 KOG2672 consensus 20.5 51 0.0013 12.6 2.4 30 558-587 310-340 (360) 149 COG5257 GCD11 Translation init 20.3 45 0.0011 12.9 1.0 32 357-390 104-135 (415) 150 KOG0956 consensus 20.1 47 0.0012 12.8 1.0 42 94-140 60-103 (900) No 1 >TIGR01059 gyrB DNA gyrase, B subunit; InterPro: IPR011557 DNA topoisomerases regulate the number of topological links between two DNA strands (i.e. change the number of superhelical turns) by catalysing transient single- or double-strand breaks, crossing the strands through one another, then resealing the breaks. These enzymes have several functions: to remove DNA supercoils during transcription and DNA replication; for strand breakage during recombination; for chromosome condensation; and to disentangle intertwined DNA during mitosis , . DNA topoisomerases are divided into two classes: type I enzymes (5.99.1.2 from EC; topoisomerases I, III and V) break single-strand DNA, and type II enzymes (5.99.1.3 from EC; topoisomerases II, IV and VI) break double-strand DNA . Type II topoisomerases are ATP-dependent enzymes, and can be subdivided according to their structure and reaction mechanisms: type IIA (topoisomerase II or gyrase, and topoisomerase IV) and type IIB (topoisomerase VI). These enzymes are responsible for relaxing supercoiled DNA as well as for introducing both negative and positive supercoils . Topoisomerase II (called gyrase in bacteria) primarily introduces negative supercoils into DNA. In bacteria, topoisomerase II consists of two polypeptide subunits, gyrA and gyrB, which form a heterotetramer: (BA)2. In most eukaryotes, topoisomerase II consists of a single polypeptide, where the N- and C-terminal regions correspond to gyrB and gyrA, respectively. This entry represents the B subunit (gyrB) as found predominantly in bacteria, but does not include the topoisomerase II enzymes composed of a single polypeptide, as are found in most eukaryotes. GyrB has two functional domains: an N-terminal ATPase and a C-terminal responsible for subunit interactions, the latter differing between subunit B and single polypeptide topoisomerase II . More information about this protein can be found at Protein of the Month: DNA Topoisomerase .; GO: 0003677 DNA binding, 0003918 DNA topoisomerase (ATP-hydrolyzing) activity, 0005524 ATP binding, 0006265 DNA topological change, 0005694 chromosome. Probab=100.00 E-value=0 Score=1996.34 Aligned_cols=632 Identities=46% Similarity=0.740 Sum_probs=602.2 Q ss_pred CCHHHEEECCCCHHHHCCCCEEECCCCC-CCHHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCEEEEEECCCCCCCCCCC Q ss_conf 6975602311210343269815467775-502202443666678777618997699998289839999669843444176 Q gi|254780222|r 44 YDASSIRILEGLEPVRMRPGMYIGGTDE-KALHHLFSEVIDNAMDEVIAGYANLIEVSLDGNGFLTVVDNGRGIPVENHP 122 (686) Q Consensus 44 Y~~~~i~~L~glE~VrkRP~mYIGst~~-~GL~~i~~EIldNavDe~~~g~~~~I~V~i~~dg~IsV~ddGrGIPv~~h~ 122 (686) |+|+|||||+||||||||||||||||++ .|||||+|||||||||||+||||++|.|+|++||||+|.|||||||||+|| T Consensus 1 Y~A~sIkVLeGLEAVRKRPGMYIGdTg~G~GLHHlvyEVVDNaIDEAlAGyC~~I~v~i~~dgsv~V~DnGRGIPVdIHp 80 (818) T TIGR01059 1 YDASSIKVLEGLEAVRKRPGMYIGDTGEGTGLHHLVYEVVDNAIDEALAGYCDTINVTINDDGSVTVEDNGRGIPVDIHP 80 (818) T ss_pred CCCCCEEEEECCCCCCCCCCCEECCCCCCCCCEEEEEEEECCHHHHHHCCCCCEEEEEEECCCEEEEEECCCCCCCCCCC T ss_conf 98330043314777537888301124878972389999843404420066124148999628769998588588743067 Q ss_pred CCCC------CEEEEEEEEEEECCCCCCCCCEEEECCCCCHHHHHHHHEEEEEEEEEECCCEEEEEEEECCCEECCCEEC Q ss_conf 6578------7012345452002477777741563144538898762032126999941883899997279451131121 Q gi|254780222|r 123 KFPN------KSTLEIILTTLHAGGKFDNSTYEISGGLHGVGISVVNALSDELHVTVARQNTIFSQKFSRGIPLSPLEKK 196 (686) Q Consensus 123 k~~~------~~~~Elif~~L~aggnfdd~~yk~sgGlnGvGaklvNalS~~f~Vev~r~gk~y~q~f~~g~~~~~~~~i 196 (686) + +| +|++|||||.||||||||+++|||||||||||+|||||||+||+|+|+|+||.|+|+|++|+|..||.++ T Consensus 81 ~-eGakqGRe~SA~EvVlT~LHAGGKFD~~sYKVSGGLHGVGvSVVNALS~~L~v~v~RdGk~y~q~F~~G~P~~~L~~~ 159 (818) T TIGR01059 81 E-EGAKQGREISAVEVVLTVLHAGGKFDKDSYKVSGGLHGVGVSVVNALSEWLEVTVKRDGKIYRQEFERGIPVGPLEVV 159 (818) T ss_pred C-CCCCCCCCCCCEEEEEEEECCCEEECCCCEEEECCEEECCEEEEEECCCEEEEEEEECCEEEEEEECCCCEECCEEEE T ss_conf 6-878788856600012200057500158804786756503279996117289999998692999998487220530681 Q ss_pred CCCCCCCCEEEEEEECHHHEECCCCCCHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCCCCCCEEEEEECCCCHHHHHCC Q ss_conf 67677885399999842432001222289999988887530577359995164532224554112343158612221002 Q gi|254780222|r 197 EKVRNKRGTRITFRPDPKIFGDKAGFDAARLLKMTQSKAYLSGRVKTCWSCDKEIAEKYGIPEKEEFYFPGGLKTYLQTK 276 (686) Q Consensus 197 ~~~~~k~GT~ItF~PD~~iFg~~~~~d~~~l~~rl~~~A~L~~gl~i~~~~~~~~~~~~~~~~~~~f~~~~Gi~dyv~~~ 276 (686) +.+.+++||+|+|+||++||.++++||+++|.+|||++||||+||+|.|.|+|. ...+..+|||+|||++||+|+ T Consensus 160 G~~~~~~GT~V~F~PD~~iF~~~~efd~~~l~~RLrELAFLN~Gv~I~l~D~R~-----~~~k~~~F~YeGGI~~fV~YL 234 (818) T TIGR01059 160 GEETKKTGTTVRFVPDPEIFEETTEFDFDILAKRLRELAFLNSGVKISLEDERL-----GKVKKVEFHYEGGIKSFVKYL 234 (818) T ss_pred CCCCCCCCCEEEEEECHHHHCCCCEECHHHHHHHHHHHHHHCCCCEEEEEECCC-----CCCCEEEEEECCCHHHHHHHH T ss_conf 346566676789960713214430332788999998888718972699987325-----675203787356189999995 Q ss_pred CCCCCCCCCCCEE----EEEEEC-CCEEEEEEEEEEECCCCCC-EEEEEEECCCCCCCEEECCHHHHHHHHHHHHHHHCC Q ss_conf 4677666775203----333214-8015589998541467774-278883000655752413046789999998763018 Q gi|254780222|r 277 LENRSLISSEIFT----GKTEKK-GTHRGTVEWAIAWCEENPE-ITSYCNTILTDEGGTHESGLRVALTRGIKKYAELTQ 350 (686) Q Consensus 277 ~~~~~~~~~~~~~----~~~~~~-~~~~~~veval~~~~~~~~-~~SFVN~I~T~~GGTHv~g~~~ai~k~i~~~~~~~~ 350 (686) +.+|++++++|++ ..+..+ ......|||||||++++.+ +.||||+|+|.+||||+.||++||||+||.|++..+ T Consensus 235 n~~K~~l~~~ii~ektt~~g~~~~~~~~i~VEvALQwnd~Y~e~i~sF~NNI~T~eGGTHl~GFr~aLTr~~n~y~~~~~ 314 (818) T TIGR01059 235 NRNKEPLHEEIIYEKTTISGEKEDDDGGIEVEVALQWNDGYSENILSFVNNINTREGGTHLEGFRSALTRVINSYAKKNK 314 (818) T ss_pred CCCCCCCCCCCCCCCCEEEEEECCCCCEEEEEEEEEEECCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHHHCC T ss_conf 37882247886423431577751677607999998762685510378856755678861177886578999999997368 Q ss_pred --C-CCCCCCCCCCCCCCEEEEEEEEEEHHHCCCCCCCCCCCHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf --8-3322233222236426999977634560764466202523200001210158899977401356889999999999 Q gi|254780222|r 351 --N-KRAISIISDDLMISAVGILSVFIREPEFAGQTKTKLVSLDAQRTVENALRDPFDHYLVQNPIEATKLLEWVIERSE 427 (686) Q Consensus 351 --~-kk~~~i~~~dIk~~L~~~Isv~I~nP~FesQTKekL~s~~~~~~v~~~i~~~l~~~l~~n~~~a~~I~~~~~~~a~ 427 (686) | +.+.+++++||||||+|||||+|||||||||||+||+|.+|+..|+.++++.|..||++||.+|+.|++|++.||+ T Consensus 315 l~Ki~~~~~l~GeD~REGLtAViSvKvP~PqFEGQTK~KLGNsEVr~~V~~~v~e~l~~f~eeNP~~Ak~Iv~K~~~AA~ 394 (818) T TIGR01059 315 LLKIESKPNLTGEDIREGLTAVISVKVPDPQFEGQTKTKLGNSEVRSIVESLVYEKLTEFFEENPTEAKAIVEKAIEAAR 394 (818) T ss_pred CCCCCCCCCCCCCCCCCCCEEEEEEECCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHH T ss_conf 75324455676577102238999981487897883223464641677788887999999975290689999999999999 Q ss_pred HHHHHHHHHHH-HHHHHCCCCCCCCCCCCCCCCCCCEEEEEEECCCCCCC-CHHHCCCCCCEEEEEECCEECCHHHHHHH Q ss_conf 65454433444-22110024565333567765655403799732767642-10010112220122112313144541087 Q gi|254780222|r 428 ERLRRRKQREV-NRKTAIRKLRLPGKLADCSQNIAHGTELFIVEGDSAGG-SAKQARNRSNQAVLPLRGKILNVASAGSE 505 (686) Q Consensus 428 ~r~~~~k~k~~-~rk~~~~~~~lpgKL~Da~~~~~~~~eL~lvEGDSA~g-sAk~grdr~~qgi~PLRGKiLNv~~as~~ 505 (686) ||+|||||||+ +||+++...+|||||+||++|||.+||||||||||||| ||||||||.||||||||||||||+||+.+ T Consensus 395 AReAArkARELtRRKSaLd~~~LPGKLADC~~kDP~~sElYiVEGDSAGGcSAKqGRDR~fQAILPLrGKILNVEKaR~d 474 (818) T TIGR01059 395 AREAARKARELTRRKSALDSGGLPGKLADCSSKDPSKSELYIVEGDSAGGCSAKQGRDRKFQAILPLRGKILNVEKARLD 474 (818) T ss_pred HHHHHHHHHHHCCCCCCCCCCCCCCCEECCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCEECEECCCHH T ss_conf 98875664430016674232458863025788881231266873452777654446786543423324552100012045 Q ss_pred HHHCCHHHHHHHHHHCCCCCCC-CCCCCCCCCCEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHCCCEEEECCCEEEEEE Q ss_conf 7632288999998825665433-333465525189997798771468899999999870888633938993099699974 Q gi|254780222|r 506 KIRNNQQIMDLVQALGCRTRSQ-YREEDLRYEKVIIMTDADVDGAHIASLLLTFFYQEMYDLIEQKHLFLISPPLFRITQ 584 (686) Q Consensus 506 ki~~N~Ei~~l~~~lGl~~g~~-~~~~~LRYgkIiImTDaD~DGsHI~~Lll~ff~~~~p~Li~~g~v~~~~tPl~kv~~ 584 (686) ||++|+||++||+|||||+|++ ||+++|||+||||||||||||||||||||||||||||+|||+|||||||||||||++ T Consensus 475 K~l~N~Ei~~~ItALG~GI~~d~Fdl~KLRYHKIIIMTDADVDGSHIRTLLLTFFYR~M~~LIE~GyvYIAqPPLYKvk~ 554 (818) T TIGR01059 475 KILSNDEIQAIITALGCGIGKDEFDLEKLRYHKIIIMTDADVDGSHIRTLLLTFFYRYMRPLIENGYVYIAQPPLYKVKK 554 (818) T ss_pred HHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCHHHHC T ss_conf 53212533888997368867667872324644368840678783689999999887401588745872650786021204 Q ss_pred CCEEEEECCHHHHHHHH--------------------------------------------------------------- Q ss_conf 89799937899999999--------------------------------------------------------------- Q gi|254780222|r 585 GTKSVYAHDEEHKQKIL--------------------------------------------------------------- 601 (686) Q Consensus 585 g~~~~y~~~e~e~~~~~--------------------------------------------------------------- 601 (686) ||+..|.++|.|++.++ T Consensus 555 gk~~~Yikdd~e~d~yll~~~~~~~~l~~~~~~L~~~~Le~~~~~~~~~~k~~~l~~~~~~~~~~~~~~~~~~~~~~~~~ 634 (818) T TIGR01059 555 GKKERYIKDDKELDHYLLNLGINEAELVITKRELVGELLEDLVNAYLELEKRKNLINRLERKAIRFSEELLIYQDLLEKE 634 (818) T ss_pred CCCCCCCCCHHHHHHHHHHHHCCCEEEEECHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHCHH T ss_conf 65511222778998898875324202653478999999999999999886314479999874043557888755530222 Q ss_pred ----------------------------HH-------------------------------------------------- Q ss_conf ----------------------------98-------------------------------------------------- Q gi|254780222|r 602 ----------------------------EE-------------------------------------------------- 603 (686) Q Consensus 602 ----------------------------~~-------------------------------------------------- 603 (686) .. T Consensus 635 ~~~~~~~~~~~~~~~~y~~~~~E~~~~~~~~~~~~~~~~~~f~~~vvR~G~~~~~~L~~~q~~~~e~~~~~~~~~~~~~~ 714 (818) T TIGR01059 635 LLEYESAVKYWLQGALYIYSDKEKEEAKTQLPVHKNAEKELFESSVVRHGLDTDYILEKEQLELFEYGEIISLELSLQGL 714 (818) T ss_pred HHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHEEECCCCCCHHHHHHHHHHHHHHHHHHHHCCHHHH T ss_conf 20014688889999998875565788887422356778875412114437763200166664332345434421103455 Q ss_pred -------------------------HHHCCCCCEEEEEEECCCCCCHHHHHHHCCCCCCEEEEEEEECCCCCCHHHHHHH Q ss_conf -------------------------7741788607868506778898899886039400079999866410267989999 Q gi|254780222|r 604 -------------------------FKKHGKGKIEISRFKGLGEMLASQLKETAMDANKRTLLRVEINKDLNSLKDTQDS 658 (686) Q Consensus 604 -------------------------~~~~~~~k~~i~ryKGLGe~~~~e~~et~~~p~~r~l~~v~~~~~~~~~~~~~~~ 658 (686) +.+.++.+++||||||||||||+|||||||||+.|+|+||+++| +.+||.+ T Consensus 715 ~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~~iQRYKGLGEMna~QLWETTMdP~~R~L~~V~~~D----A~~AD~~ 790 (818) T TIGR01059 715 FETRAKLIRKNKKFEINSLEEALDKLVELGRKGIEIQRYKGLGEMNADQLWETTMDPESRTLLKVTIED----AVEADRI 790 (818) T ss_pred HHHHHHHCCCCCCCCCCCHHHHHHHHHHCCCCCCEEEEECCCCCCCHHHHHHHCCCCCCCEEEEEEHHH----HHHHHHH T ss_conf 443432100122310025899999998407898604543166667822203342884644487745799----9886543 Q ss_pred HHHHCCCCHHHHHHHHHHHHHHHHHCC Q ss_conf 998807981689999997524677517 Q gi|254780222|r 659 INKLMGTKADERFKFIQERASFIDDVG 685 (686) Q Consensus 659 f~~lmG~~~~~RK~wi~~~~~~~~~ld 685 (686) |++||||+|+|||+||++||..|.||| T Consensus 791 F~~LMGd~VePRR~FIE~nAl~v~nLD 817 (818) T TIGR01059 791 FSTLMGDEVEPRREFIEANALDVKNLD 817 (818) T ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHCC T ss_conf 200068986530789999999885248 No 2 >PRK05644 gyrB DNA gyrase subunit B; Validated Probab=100.00 E-value=0 Score=1757.68 Aligned_cols=633 Identities=44% Similarity=0.731 Sum_probs=602.1 Q ss_pred CCCCCCCCHHHEEECCCCHHHHCCCCEEECCCCCCCHHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCEEEEEECCCCCC Q ss_conf 76312369756023112103432698154677755022024436666787776189976999982898399996698434 Q gi|254780222|r 38 QSASDNYDASSIRILEGLEPVRMRPGMYIGGTDEKALHHLFSEVIDNAMDEVIAGYANLIEVSLDGNGFLTVVDNGRGIP 117 (686) Q Consensus 38 ~~~~~~Y~~~~i~~L~glE~VrkRP~mYIGst~~~GL~~i~~EIldNavDe~~~g~~~~I~V~i~~dg~IsV~ddGrGIP 117 (686) .-++++|++++|++|+||||||||||||||||+.+|||||+||||||||||++||||++|+|+|++||+|||+||||||| T Consensus 5 ~~~~~~Y~~~~I~vL~gle~VRkRPgMYiG~t~~~glhhlv~EvvDNsiDEa~aG~~~~I~V~i~~DgsitV~DnGRGIP 84 (725) T PRK05644 5 EMMENEYDASSIKVLEGLEAVRKRPGMYIGDTGGRGLHHLVYEVVDNSIDEALAGYCDEIEVTIHADGSVSVTDNGRGIP 84 (725) T ss_pred HHHHCCCCHHHCEECCCCHHHCCCCCCEECCCCCCCCEEEEEEEEHHHHHHHHCCCCCEEEEEECCCCEEEEEECCCCCC T ss_conf 47566789644845037575505986465669998766889864214787986389967999996898299998997677 Q ss_pred CCCCCCCCCCEEEEEEEEEEECCCCCCCCCEEEECCCCCHHHHHHHHEEEEEEEEEECCCEEEEEEEECCCEECCCEECC Q ss_conf 44176657870123454520024777777415631445388987620321269999418838999972794511311216 Q gi|254780222|r 118 VENHPKFPNKSTLEIILTTLHAGGKFDNSTYEISGGLHGVGISVVNALSDELHVTVARQNTIFSQKFSRGIPLSPLEKKE 197 (686) Q Consensus 118 v~~h~k~~~~~~~Elif~~L~aggnfdd~~yk~sgGlnGvGaklvNalS~~f~Vev~r~gk~y~q~f~~g~~~~~~~~i~ 197 (686) ||+||+ +++|+||+|||+||||||||+++|++||||||||||||||||+||+|+|+|+|+.|+|+|++|++..++++++ T Consensus 85 vd~h~~-~~~~~~E~I~t~LhAGgKFd~~~YkvSGGLHGVGasvVNALS~~~~VeV~RdGk~y~q~f~~G~~~~~l~~ig 163 (725) T PRK05644 85 VDIHPK-EGVSAAEVVMTVLHAGGKFDGNSYKVSGGLHGVGVSVVNALSEWLELTVRRDGKIHRQEFERGVPVAPLKVIG 163 (725) T ss_pred CCCCCC-CCCCHHHHEECCCCCCCCCCCCCCEEECCCCCCCCEEEEECCCEEEEEEEECCEEEEEEEECCCCCCCCEECC T ss_conf 634577-7972261300003556689999635316888802032160547089999989999999972897588852467 Q ss_pred CCCCCCCEEEEEEECHHHEECCCCCCHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCCCCCCEEEEEECCCCHHHHHCCC Q ss_conf 76778853999998424320012222899999888875305773599951645322245541123431586122210024 Q gi|254780222|r 198 KVRNKRGTRITFRPDPKIFGDKAGFDAARLLKMTQSKAYLSGRVKTCWSCDKEIAEKYGIPEKEEFYFPGGLKTYLQTKL 277 (686) Q Consensus 198 ~~~~k~GT~ItF~PD~~iFg~~~~~d~~~l~~rl~~~A~L~~gl~i~~~~~~~~~~~~~~~~~~~f~~~~Gi~dyv~~~~ 277 (686) ++ +++||+|+|+||++||++ +.|++++|.+|++++|||||||+|.|+|++. ..+..++|||++||++|+++++ T Consensus 164 ~~-~k~GT~ItF~PD~~iF~~-~~fd~d~L~~RlrelA~Ln~GL~I~l~Der~-----~~~~~~~f~y~gGi~dyv~~l~ 236 (725) T PRK05644 164 ET-DKTGTTVRFWPDPEIFTT-TEFDYETLAKRLRELAFLNSGVKITLTDERE-----GEGKEETFHYEGGIKSFVEYLN 236 (725) T ss_pred CC-CCCCEEEEEEECHHHCCC-CCCCHHHHHHHHHHHHHCCCCCEEEEEECCC-----CCCCEEEEEECCCHHHHHHHHH T ss_conf 88-889659999977566177-4408999999999997217984799864577-----8776148971476788898862 Q ss_pred CCCCCCCCCCEEEEEEECCCEEEEEEEEEEECCCCCC-EEEEEEECCCCCCCEEECCHHHHHHHHHHHHHHHCCC--CCC Q ss_conf 6776667752033332148015589998541467774-2788830006557524130467899999987630188--332 Q gi|254780222|r 278 ENRSLISSEIFTGKTEKKGTHRGTVEWAIAWCEENPE-ITSYCNTILTDEGGTHESGLRVALTRGIKKYAELTQN--KRA 354 (686) Q Consensus 278 ~~~~~~~~~~~~~~~~~~~~~~~~veval~~~~~~~~-~~SFVN~I~T~~GGTHv~g~~~ai~k~i~~~~~~~~~--kk~ 354 (686) .++.+++++++++..+.+. +.|||||+|++++.+ ++||||+|+|++|||||+||++||+++|++|++..+. +++ T Consensus 237 ~~k~~l~~~~i~~~~e~~~---i~vEvAlqw~d~~~e~i~SFvN~I~T~eGGTHv~Gfr~aLtr~In~ya~~~~l~Kk~~ 313 (725) T PRK05644 237 RNKTPLHPEIIYFEGEKDG---IGVEVAMQWNDSYSENVLCFTNNIPTRDGGTHLAGFRAALTRVINDYARKNGLLKKAK 313 (725) T ss_pred CCCCCCCCCCEEEEEEECC---EEEEEEEEECCCCCCEEEEEECCEECCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCC T ss_conf 5787678885589986478---0699999854776631455421435588981889999999999999999837543344 Q ss_pred CCCCCCCCCCCEEEEEEEEEEHHHCCCCCCCCCCCHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 22332222364269999776345607644662025232000012101588999774013568899999999996545443 Q gi|254780222|r 355 ISIISDDLMISAVGILSVFIREPEFAGQTKTKLVSLDAQRTVENALRDPFDHYLVQNPIEATKLLEWVIERSEERLRRRK 434 (686) Q Consensus 355 ~~i~~~dIk~~L~~~Isv~I~nP~FesQTKekL~s~~~~~~v~~~i~~~l~~~l~~n~~~a~~I~~~~~~~a~~r~~~~k 434 (686) .+++++|||+||++||||+|+||+|+||||++|+|++++..|+..+++.|..||++||..++.|+++++.++++|+++++ T Consensus 314 ~~itgdDIregL~~vISvki~nPqFegQTKeKLgn~ev~~~V~~~v~e~l~~~l~~np~~ak~Ii~k~~~aa~aR~aa~k 393 (725) T PRK05644 314 VNLTGDDVREGLTAVISVKVPDPQFEGQTKTKLGNSEVRPIVESLVNEALSEFLEENPNEAKIIVGKAIEAARAREAARK 393 (725) T ss_pred CCCCHHHHHHCCEEEEEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 56656777603079999861698747731030158546788999999889999887979899999999999999999997 Q ss_pred HHHH-HHHHHCCCCCCCCCCCCCCCCCCCEEEEEEECCCCCCCCHHHCCCCCCEEEEEECCEECCHHHHHHHHHHCCHHH Q ss_conf 3444-221100245653335677656554037997327676421001011222012211231314454108776322889 Q gi|254780222|r 435 QREV-NRKTAIRKLRLPGKLADCSQNIAHGTELFIVEGDSAGGSAKQARNRSNQAVLPLRGKILNVASAGSEKIRNNQQI 513 (686) Q Consensus 435 ~k~~-~rk~~~~~~~lpgKL~Da~~~~~~~~eL~lvEGDSA~gsAk~grdr~~qgi~PLRGKiLNv~~as~~ki~~N~Ei 513 (686) +|+. +||+.....+|||||+||+++++++|||||||||||+|||||||||+||||||||||||||++|+.+||++|+|| T Consensus 394 are~~rrK~~~~~~~LPGKL~Dc~skd~~~~ELfiVEGDSAgGsAkqgRdr~~QAIlPLRGKiLNv~ka~~~ki~~N~EI 473 (725) T PRK05644 394 ARELTRRKSALDSAGLPGKLADCQSKDPAKSELYIVEGDSAGGSAKQGRDRKFQAILPLRGKILNVEKARFDKILSSEEI 473 (725) T ss_pred HHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCEEEEECCCCCCCCCCCCCCCCCEEEEECCCCCCCCCCCCHHHHHCCHHH T ss_conf 68765202212357898755465568977767999825788863103356574688641672200200679888618799 Q ss_pred HHHHHHHCCCCCCCCCCCCCCCCCEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHCCCEEEECCCEEEEEECCEEEEECC Q ss_conf 99998825665433333465525189997798771468899999999870888633938993099699974897999378 Q gi|254780222|r 514 MDLVQALGCRTRSQYREEDLRYEKVIIMTDADVDGAHIASLLLTFFYQEMYDLIEQKHLFLISPPLFRITQGTKSVYAHD 593 (686) Q Consensus 514 ~~l~~~lGl~~g~~~~~~~LRYgkIiImTDaD~DGsHI~~Lll~ff~~~~p~Li~~g~v~~~~tPl~kv~~g~~~~y~~~ 593 (686) ++|++|||||+|++||+++||||||||||||||||||||+|||||||||||+||++||||||+||||||++|++.+|+|+ T Consensus 474 ~~li~alG~g~g~~fd~~~LRY~kIiIMTDADvDGsHIrtLLlTfF~r~m~~LIe~G~vYiAqPPLyki~~gk~~~Y~~~ 553 (725) T PRK05644 474 GTLITALGTGIGRDFDIDKLRYHKIIIMTDADVDGAHIRTLLLTFFYRQMPELIERGYVYIAQPPLYKVKKGKQEQYLKD 553 (725) T ss_pred HHHHHHHCCCCCCCCCCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCEEEECCCEEEEECC T ss_conf 99999967788865771127747089997899876899999999999999999975938992596159852981488379 Q ss_pred HHHHHHHHHHHH-------------------------------------------------------------------- Q ss_conf 999999999877-------------------------------------------------------------------- Q gi|254780222|r 594 EEHKQKILEEFK-------------------------------------------------------------------- 605 (686) Q Consensus 594 e~e~~~~~~~~~-------------------------------------------------------------------- 605 (686) |.|+++++.+.. T Consensus 554 d~e~~~~l~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 633 (725) T PRK05644 554 DRELDEYLIELALEGATLHAIAGEALEKLVKEYRAVRKVIDRLSRRYPRALLEALATNAERELFELRVERRTHVYVLDHD 633 (725) T ss_pred HHHHHHHHHHHCCCCCEEECCCCHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHCCCCCCCCCHHHHHCCCCCCCCC T ss_conf 99999999973205625520460778999999999999998764304467888875211012232001211011224555 Q ss_pred ---------------HCCCCCEEEEEEECCCCCCHHHHHHHCCCCCCEEEEEEEECCCCCCHHHHHHHHHHHCCCCHHHH Q ss_conf ---------------41788607868506778898899886039400079999866410267989999998807981689 Q gi|254780222|r 606 ---------------KHGKGKIEISRFKGLGEMLASQLKETAMDANKRTLLRVEINKDLNSLKDTQDSINKLMGTKADER 670 (686) Q Consensus 606 ---------------~~~~~k~~i~ryKGLGe~~~~e~~et~~~p~~r~l~~v~~~~~~~~~~~~~~~f~~lmG~~~~~R 670 (686) +.++.+++||||||||||||+|||||||||++|+|+||+++| +..++++|++|||++|++| T Consensus 634 ~~~~~~~~~~~~~~~~~~~k~~~IqRyKGLGEMnp~qLweTTMdP~~R~L~~V~i~d----a~~ad~~f~~LMG~~v~~R 709 (725) T PRK05644 634 EQPVSSFEEALDWLLAEGRKGLTIQRYKGLGEMNPEQLWETTMDPEVRRLLQVTIED----AIAADEIFTTLMGDEVEPR 709 (725) T ss_pred CCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCCHHHHHHHCCCCCCCEEEEEECHH----HHHHHHHHHHHCCCCCHHH T ss_conf 312478899999999730578802227606766978998636583013599987023----9999999998659995688 Q ss_pred HHHHHHHHHHHHHCC Q ss_conf 999997524677517 Q gi|254780222|r 671 FKFIQERASFIDDVG 685 (686) Q Consensus 671 K~wi~~~~~~~~~ld 685 (686) |+||++||.+|+||| T Consensus 710 r~fI~~na~~v~~lD 724 (725) T PRK05644 710 REFIEENALDVRNLD 724 (725) T ss_pred HHHHHHHHHHHHHCC T ss_conf 999999851303348 No 3 >PRK05559 DNA topoisomerase IV subunit B; Reviewed Probab=100.00 E-value=0 Score=1726.42 Aligned_cols=624 Identities=50% Similarity=0.810 Sum_probs=591.7 Q ss_pred CCCCCCCHHHEEECCCCHHHHCCCCEEECCCCCCCHHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCEEEEEECCCCCCC Q ss_conf 63123697560231121034326981546777550220244366667877761899769999828983999966984344 Q gi|254780222|r 39 SASDNYDASSIRILEGLEPVRMRPGMYIGGTDEKALHHLFSEVIDNAMDEVIAGYANLIEVSLDGNGFLTVVDNGRGIPV 118 (686) Q Consensus 39 ~~~~~Y~~~~i~~L~glE~VrkRP~mYIGst~~~GL~~i~~EIldNavDe~~~g~~~~I~V~i~~dg~IsV~ddGrGIPv 118 (686) .++++|+|++||+|+||||||+|||||||||+.+||||||||||||||||+++|||++|+|+|++||+|||+|||||||| T Consensus 3 ~~~~~Y~~~~i~~L~glE~Vr~RP~mYIGst~~~Gl~hi~~EIldNavDe~~~g~~~~I~V~i~~dgsIsV~dnGrGIPv 82 (633) T PRK05559 3 MMTNNYNADSIEVLEGLEPVRKRPGMYIGSTDTRGLHHLVQEVIDNSVDEALAGHGKKIEVTLHADGSVSVRDNGRGIPV 82 (633) T ss_pred CCCCCCCHHHCEEECCCHHHHCCCCCEECCCCCCCCEEEEEEEEECHHHHHHCCCCCEEEEEECCCCEEEEEECCCCCCC T ss_conf 20057797778660574676449986656899985628887765511427650789769999978984999989965455 Q ss_pred CCCCCCCCCEEEEEEEEEEECCCCCCCCCEEEECCCCCHHHHHHHHEEEEEEEEEECCCEEEEEEEECCCEECCCEECCC Q ss_conf 41766578701234545200247777774156314453889876203212699994188389999727945113112167 Q gi|254780222|r 119 ENHPKFPNKSTLEIILTTLHAGGKFDNSTYEISGGLHGVGISVVNALSDELHVTVARQNTIFSQKFSRGIPLSPLEKKEK 198 (686) Q Consensus 119 ~~h~k~~~~~~~Elif~~L~aggnfdd~~yk~sgGlnGvGaklvNalS~~f~Vev~r~gk~y~q~f~~g~~~~~~~~i~~ 198 (686) ++||+ +++|+||||||+||||||||+++||+||||||||||||||||++|+|+|+|+|++|+|+|++||+..+++++++ T Consensus 83 ~~h~~-~~~~~~ElIft~LhaGgkfd~~~ykvSGGlhGvGasvvNalS~~f~Vev~r~gk~y~q~f~~g~~~~~~~~i~~ 161 (633) T PRK05559 83 GIHPE-EGKSGVEVILTKLHAGGKFSNKAYKFSGGLHGVGVSVVNALSERLEVEVKRDGKVYRQRFERGEPVTPLEVVGG 161 (633) T ss_pred CCCCC-CCCCCHHHHEEECCCCCCCCCCCCEEECCCCCCCEEEEEEEECCEEEEEEECCEEEEEEECCCEECCCCCEECC T ss_conf 17565-79761566100204578889985155078776212670873055899999899999999549909888513045 Q ss_pred CCC--CCCEEEEEEECHHHEECCCCCCHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCCCCCCEEEEEECCCCHHHHHCC Q ss_conf 677--885399999842432001222289999988887530577359995164532224554112343158612221002 Q gi|254780222|r 199 VRN--KRGTRITFRPDPKIFGDKAGFDAARLLKMTQSKAYLSGRVKTCWSCDKEIAEKYGIPEKEEFYFPGGLKTYLQTK 276 (686) Q Consensus 199 ~~~--k~GT~ItF~PD~~iFg~~~~~d~~~l~~rl~~~A~L~~gl~i~~~~~~~~~~~~~~~~~~~f~~~~Gi~dyv~~~ 276 (686) +.+ .+||+|+|+||++||++ ..|+.+.|.+|++++|||||||+|.|++++.. .++|||++||++|++++ T Consensus 162 ~~~~~~~GT~ItF~PD~~iF~~-~~~~~d~l~~rl~~~A~l~~gl~I~lnder~~--------~~~F~~e~Gi~dyv~~~ 232 (633) T PRK05559 162 TKGKRKTGTRVRFWPDPKIFDS-AKFSPERLKERLRSKAFLCPGLEITLNDERTG--------EETFHYENGLKDYLAEL 232 (633) T ss_pred CCCCCCCCEEEEEEECHHHHCC-CEECHHHHHHHHHHHHHCCCCCEEEEEECCCC--------CEEEECCCCHHHHHHHH T ss_conf 6787888349999988578377-25428999999999863279977999824677--------34466476199999986 Q ss_pred CCCCCCCCCCCEEEEEEECCCEEEEEEEEEEECCCCC-CEEEEEEECCCCCCCEEECCHHHHHHHHHHHHHHHCC-CCCC Q ss_conf 4677666775203333214801558999854146777-4278883000655752413046789999998763018-8332 Q gi|254780222|r 277 LENRSLISSEIFTGKTEKKGTHRGTVEWAIAWCEENP-EITSYCNTILTDEGGTHESGLRVALTRGIKKYAELTQ-NKRA 354 (686) Q Consensus 277 ~~~~~~~~~~~~~~~~~~~~~~~~~veval~~~~~~~-~~~SFVN~I~T~~GGTHv~g~~~ai~k~i~~~~~~~~-~kk~ 354 (686) +.++.++++.+|... ..+.||||++|++++. +++||||+|+|++|||||+||++||+++|++|+++.+ .|++ T Consensus 233 ~~~~~~i~~~~~~~~------~~~~veval~~~~~~~~~~~SFVN~I~T~~GGTHv~g~~~al~k~i~~y~~k~~~~Kkk 306 (633) T PRK05559 233 NEGKELLPVEPFEGD------EGEAVEWALAWLEGGSENIESYVNLIPTPDGGTHENGFREGLLRALREFAEKRNLLKRG 306 (633) T ss_pred HCCCCCCCCCEEECC------CCCEEEEEEEECCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCC T ss_conf 179975786428755------79759999994478884476568870579998767889989999999999982745566 Q ss_pred CCCCCCCCCCCEEEEEEEEEEHHHCCCCCCCCCCCHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 22332222364269999776345607644662025232000012101588999774013568899999999996545443 Q gi|254780222|r 355 ISIISDDLMISAVGILSVFIREPEFAGQTKTKLVSLDAQRTVENALRDPFDHYLVQNPIEATKLLEWVIERSEERLRRRK 434 (686) Q Consensus 355 ~~i~~~dIk~~L~~~Isv~I~nP~FesQTKekL~s~~~~~~v~~~i~~~l~~~l~~n~~~a~~I~~~~~~~a~~r~~~~k 434 (686) .+++++|||+||++||||+|+||+|+||||++|+|+++++.|++.+++.|..||++||..++.|+++++.++++|+++++ T Consensus 307 ~~i~~~dIregL~~~Vsv~i~nP~FesQTKekL~s~~~~~~v~~~v~~~l~~~l~~n~~~a~~I~~k~~~~a~~r~~a~k 386 (633) T PRK05559 307 VKLTAEDVREGLAAVLSVKIPEPQFEGQTKEKLGSREARRFVSGVVKDAFDLWLNQNPELAEKLAEKAIKAAQARLRARK 386 (633) T ss_pred CCCCHHHHHCCCEEEEEECCCCCCCCCCHHCCCCCHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 65788898465326866310498406733200259657767778888999999997989999999999999999999972 Q ss_pred HHHHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEEECCCCCCCCHHHCCCCCCEEEEEECCEECCHHHHHHHHHHCCHHHH Q ss_conf 34442211002456533356776565540379973276764210010112220122112313144541087763228899 Q gi|254780222|r 435 QREVNRKTAIRKLRLPGKLADCSQNIAHGTELFIVEGDSAGGSAKQARNRSNQAVLPLRGKILNVASAGSEKIRNNQQIM 514 (686) Q Consensus 435 ~k~~~rk~~~~~~~lpgKL~Da~~~~~~~~eL~lvEGDSA~gsAk~grdr~~qgi~PLRGKiLNv~~as~~ki~~N~Ei~ 514 (686) +++..|++..++..|||||+||+++++++|||||||||||+|||+|||||.|||||||||||||||+|+.+||++|+||+ T Consensus 387 ~~~~~rk~~~~~~~lpgKL~Dc~~~~~~~~eL~lvEGDSAgGsak~gRDr~~qailPLRGKiLNv~~a~~~ki~~N~Ei~ 466 (633) T PRK05559 387 KKEVRRKKATKKPALPGKLADCTSQDPERTELFLVEGDSAGGSAKQARDREFQAILPLRGKILNTWEASLDDVLANEEIH 466 (633) T ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCEEEECCCCCEEEEECCCEECCHHHCCHHHHHHCHHHH T ss_conf 13201345334678986661302389674469998346777633522154626530026622235317888876268999 Q ss_pred HHHHHHCCCCCCCCCCCCCCCCCEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHCCCEEEECCCEEEEEECC-EEEEECC Q ss_conf 999882566543333346552518999779877146889999999987088863393899309969997489-7999378 Q gi|254780222|r 515 DLVQALGCRTRSQYREEDLRYEKVIIMTDADVDGAHIASLLLTFFYQEMYDLIEQKHLFLISPPLFRITQGT-KSVYAHD 593 (686) Q Consensus 515 ~l~~~lGl~~g~~~~~~~LRYgkIiImTDaD~DGsHI~~Lll~ff~~~~p~Li~~g~v~~~~tPl~kv~~g~-~~~y~~~ 593 (686) +|++||||++|++||+++||||||||||||||||||||||||||||+|||+||+.||||+|+||||||++|+ +++|+|+ T Consensus 467 ~l~~alG~~~g~~~d~~~LRYgkIiImTDaD~DGsHI~~Llltff~~~~p~Li~~G~vy~~~tPl~kv~~gk~~~~y~~~ 546 (633) T PRK05559 467 DIIVAIGIGPGDSFDLEDLRYGKIIIMTDADVDGAHIATLLLTFFYRHFPPLVEAGHVYIALPPLYRVDVGKGKTFYALD 546 (633) T ss_pred HHHHHHCCCCCCCCCCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCEEEEEECCCCEEEECC T ss_conf 99998577878768701067671799977999768899999999998769988659589973988999938943698588 Q ss_pred HHHHHHHHHHHHHC-CCCCEEEEEEECCCCCCHHHHHHHCCCCCCEEEEEEEECCCCCCHHHHHHHHHHHCCCCHHHHHH Q ss_conf 99999999987741-78860786850677889889988603940007999986641026798999999880798168999 Q gi|254780222|r 594 EEHKQKILEEFKKH-GKGKIEISRFKGLGEMLASQLKETAMDANKRTLLRVEINKDLNSLKDTQDSINKLMGTKADERFK 672 (686) Q Consensus 594 e~e~~~~~~~~~~~-~~~k~~i~ryKGLGe~~~~e~~et~~~p~~r~l~~v~~~~~~~~~~~~~~~f~~lmG~~~~~RK~ 672 (686) |+|+++|++++.+. ..+||+||||||||||+|+|||||||||++|+|+||+++| ++.++++|++|||++|++||+ T Consensus 547 e~e~~~~~~~~~~~~~~~k~~i~ryKGLGem~~~ql~et~mdp~~r~l~~v~~~d----~~~~~~~~~~lmg~~~~~Rk~ 622 (633) T PRK05559 547 EEEKEELLKKLGKGGKKGKVEIQRFKGLGEMNPDQLWETTMDPETRRLLRVTIDD----AAETEELVDMLMGKKAEPRRN 622 (633) T ss_pred HHHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHHHHHHCCCCCCEEEEEECCC----HHHHHHHHHHHHCCCCHHHHH T ss_conf 9999999998641157787036876406779989999872582214699986486----889999999982899579999 Q ss_pred HHHHHHHHHH Q ss_conf 9997524677 Q gi|254780222|r 673 FIQERASFID 682 (686) Q Consensus 673 wi~~~~~~~~ 682 (686) ||++||.+|| T Consensus 623 ~i~~~a~~~~ 632 (633) T PRK05559 623 WIEENGDELD 632 (633) T ss_pred HHHHHHHHCC T ss_conf 9996034146 No 4 >COG0187 GyrB Type IIA topoisomerase (DNA gyrase/topo II, topoisomerase IV), B subunit [DNA replication, recombination, and repair] Probab=100.00 E-value=0 Score=1707.18 Aligned_cols=629 Identities=48% Similarity=0.778 Sum_probs=595.5 Q ss_pred CCCCCHHHEEECCCCHHHHCCCCEEECCCCC-CCHHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCEEEEEECCCCCCCC Q ss_conf 1236975602311210343269815467775-502202443666678777618997699998289839999669843444 Q gi|254780222|r 41 SDNYDASSIRILEGLEPVRMRPGMYIGGTDE-KALHHLFSEVIDNAMDEVIAGYANLIEVSLDGNGFLTVVDNGRGIPVE 119 (686) Q Consensus 41 ~~~Y~~~~i~~L~glE~VrkRP~mYIGst~~-~GL~~i~~EIldNavDe~~~g~~~~I~V~i~~dg~IsV~ddGrGIPv~ 119 (686) .++|++++||||+||||||||||||||||+. +||||||||||||||||++||||+.|.|+||+||+|||.||||||||| T Consensus 3 ~~~Y~a~~I~vL~GLEaVRkRPGMYIGst~~~~GLhHlv~EVvDNsiDEalaG~~~~I~V~l~~d~sisV~DnGRGIPvd 82 (635) T COG0187 3 TNNYDASSIQVLEGLEAVRKRPGMYIGSTGDGRGLHHLVWEVVDNSIDEALAGYADRIDVTLHEDGSISVEDNGRGIPVD 82 (635) T ss_pred CCCCCHHHCEECCCCHHHHCCCCCEECCCCCCCCCEEEEEEEEECCHHHHHHCCCCEEEEEECCCCEEEEEECCCCCCCC T ss_conf 57677767333027477645998553368999965046867631307478638685789999479808998789877654 Q ss_pred CCCCCCCCEEEEEEEEEEECCCCCCCCCEEEECCCCCHHHHHHHHEEEEEEEEEECCCEEEEEEEECCCEECCCEECCCC Q ss_conf 17665787012345452002477777741563144538898762032126999941883899997279451131121676 Q gi|254780222|r 120 NHPKFPNKSTLEIILTTLHAGGKFDNSTYEISGGLHGVGISVVNALSDELHVTVARQNTIFSQKFSRGIPLSPLEKKEKV 199 (686) Q Consensus 120 ~h~k~~~~~~~Elif~~L~aggnfdd~~yk~sgGlnGvGaklvNalS~~f~Vev~r~gk~y~q~f~~g~~~~~~~~i~~~ 199 (686) +||+ +++|++|+|||+||||||||+++||+||||||||+|||||||+||+|+|+|+|+.|+|+|++|++..++++++.+ T Consensus 83 iH~~-~~~s~vEvI~T~LHAGGKFd~~~YkvSGGLHGVG~SVVNALS~~l~v~v~r~gk~y~q~f~~G~~~~~l~~ig~~ 161 (635) T COG0187 83 IHPK-EKVSAVEVIFTVLHAGGKFDNDSYKVSGGLHGVGVSVVNALSTWLEVEVKRDGKIYRQRFERGVPVTPLEVIGST 161 (635) T ss_pred CCCC-CCCCCEEEEEEEECCCCCCCCCCCEEECCCCCCCEEEEECCCCEEEEEEEECCEEEEEEEECCCCCCCCEECCCC T ss_conf 2777-898711788876125766699952760677750416776165469999998987999987189857774012567 Q ss_pred -CCCCCEEEEEEECHHHEECCCCCCHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCCCCCCEEEEEECCCCHHHHHCCCC Q ss_conf -7788539999984243200122228999998888753057735999516453222455411234315861222100246 Q gi|254780222|r 200 -RNKRGTRITFRPDPKIFGDKAGFDAARLLKMTQSKAYLSGRVKTCWSCDKEIAEKYGIPEKEEFYFPGGLKTYLQTKLE 278 (686) Q Consensus 200 -~~k~GT~ItF~PD~~iFg~~~~~d~~~l~~rl~~~A~L~~gl~i~~~~~~~~~~~~~~~~~~~f~~~~Gi~dyv~~~~~ 278 (686) .+++||+|+|+||+++|++ +.|+++.|.+|+|++||||+||+|.|+|++.... .. .|+|++||.+|+++++. T Consensus 162 ~~~~~GT~V~F~PD~~iF~~-~~f~~~~l~~RlrelA~L~~gl~I~l~d~r~~~~-----~~-~~~ye~Gl~~yv~~l~~ 234 (635) T COG0187 162 DTKKTGTKVRFKPDPEIFGE-TEFDYEILKRRLRELAFLNKGVKITLTDERTGEE-----KK-EFHYEGGLKDYVEYLNK 234 (635) T ss_pred CCCCCCCEEEEECCHHHCCC-CCCCHHHHHHHHHHHHCCCCCCEEEEEECCCCCC-----CC-EEECCCHHHHHHHHHHC T ss_conf 77899528999748686277-6667999999999986067997799972467755-----43-24346079999999725 Q ss_pred CCCCCCCCCEEEEEEECCCEEEEEEEEEEECCCCCC-EEEEEEECCCCCCCEEECCHHHHHHHHHHHHHHHCCCCCCCCC Q ss_conf 776667752033332148015589998541467774-2788830006557524130467899999987630188332223 Q gi|254780222|r 279 NRSLISSEIFTGKTEKKGTHRGTVEWAIAWCEENPE-ITSYCNTILTDEGGTHESGLRVALTRGIKKYAELTQNKRAISI 357 (686) Q Consensus 279 ~~~~~~~~~~~~~~~~~~~~~~~veval~~~~~~~~-~~SFVN~I~T~~GGTHv~g~~~ai~k~i~~~~~~~~~kk~~~i 357 (686) ++.+++++++.+..+.. ...|||||+|++++.+ ++||||+|+|++||||++||+.+|+++||+|+++.+..|...+ T Consensus 235 ~k~~l~~~~~~~~~~~~---~~~vEvA~q~~d~~~e~~~SFvNnI~T~eGGTH~~Gfr~altr~in~y~~~~~~~k~~~l 311 (635) T COG0187 235 GKTPLHEEIFYFNGEKD---GIAVEVALQWNDGYSENILSFVNNIPTREGGTHEAGFRSALTRAINEYAKKKNLLKEGDL 311 (635) T ss_pred CCCCCCCCCEECCCCCC---CEEEEEEEEEECCCCEEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCC T ss_conf 88645657565136766---637999999806884479986458558998608999999999999999998386653577 Q ss_pred CCCCCCCCEEEEEEEEEEHHHCCCCCCCCCCCHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 32222364269999776345607644662025232000012101588999774013568899999999996545443344 Q gi|254780222|r 358 ISDDLMISAVGILSVFIREPEFAGQTKTKLVSLDAQRTVENALRDPFDHYLVQNPIEATKLLEWVIERSEERLRRRKQRE 437 (686) Q Consensus 358 ~~~dIk~~L~~~Isv~I~nP~FesQTKekL~s~~~~~~v~~~i~~~l~~~l~~n~~~a~~I~~~~~~~a~~r~~~~k~k~ 437 (686) +++|||+||+++|||+|+||+|+||||+||+|+++++.|++.+++.|..||++||..|+.|+++++.++++|++++++|+ T Consensus 312 ~g~Diregl~aviSvki~~PqFegQTK~KL~n~e~~~~V~~~v~~~~~~~l~enp~~a~~i~~k~i~aa~aR~aarkare 391 (635) T COG0187 312 TGDDIREGLTAVISVKIPDPQFEGQTKEKLGNSEVRSIVEKLVSEAFSLFLEENPQEAKKIVEKAIAAAKAREAARKARE 391 (635) T ss_pred CHHHHHHCCEEEEEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 77787503389999977998857541033426888999999999999999996929999999999999999999898899 Q ss_pred HHH-HHHCCCCCCCCCCCCCCCCCCCEEEEEEECCCCCCCCHHHCCCCCCEEEEEECCEECCHHHHHHHHHHCCHHHHHH Q ss_conf 422-1100245653335677656554037997327676421001011222012211231314454108776322889999 Q gi|254780222|r 438 VNR-KTAIRKLRLPGKLADCSQNIAHGTELFIVEGDSAGGSAKQARNRSNQAVLPLRGKILNVASAGSEKIRNNQQIMDL 516 (686) Q Consensus 438 ~~r-k~~~~~~~lpgKL~Da~~~~~~~~eL~lvEGDSA~gsAk~grdr~~qgi~PLRGKiLNv~~as~~ki~~N~Ei~~l 516 (686) ..| |+.++...|||||+||+++++++|||||||||||||||||||||+||||||||||||||++|+.+||++|+||++| T Consensus 392 ~~R~k~~~~~~~LpGKLadC~skd~~~~ELfiVEGDSAGGSAKqgRdR~~QAILPLRGKiLNVeka~~~kil~N~EI~~i 471 (635) T COG0187 392 LTRRKSALDIPGLPGKLADCTSKDPEKSELFLVEGDSAGGSAKQGRDREFQAILPLRGKILNVEKARLDKILKNEEIQTI 471 (635) T ss_pred HHHHHCCCCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCHHHHHCCCCCCEEEEECCCCCHHHHHCCHHHHHHHHHHHHH T ss_conf 88651134557899757565568824155999804776332442168665577204673112322357655433889999 Q ss_pred HHHHCCCCCCCCCCCCCCCCCEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHCCCEEEECCCEEEEEECCEEEEECCHHH Q ss_conf 98825665433333465525189997798771468899999999870888633938993099699974897999378999 Q gi|254780222|r 517 VQALGCRTRSQYREEDLRYEKVIIMTDADVDGAHIASLLLTFFYQEMYDLIEQKHLFLISPPLFRITQGTKSVYAHDEEH 596 (686) Q Consensus 517 ~~~lGl~~g~~~~~~~LRYgkIiImTDaD~DGsHI~~Lll~ff~~~~p~Li~~g~v~~~~tPl~kv~~g~~~~y~~~e~e 596 (686) ++|||||++++||+++||||||||||||||||+|||+|||||||||||+||++||||+|+||||||++|++.+|+|+|.| T Consensus 472 i~AlG~g~~~~fd~~~LRY~kIiIMTDADvDGaHIrtLLlTfFyr~m~~LIe~G~vyiA~PPLYkv~~~k~~~Y~~~d~E 551 (635) T COG0187 472 ITALGTGIGKDFDLEKLRYHKIIIMTDADVDGAHIRTLLLTFFYRYMPPLIENGHVYIAQPPLYKVKKGKKTFYAYDDEE 551 (635) T ss_pred HHHHCCCCCCCCCHHHCCCCCEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCEEEEECCCCEEEECCHHH T ss_conf 99958788987785557647279996588772889999999999974999976948997485589975874267279999 Q ss_pred HHHHHHHHHHCCCCCEEEEEEECCCCCCHHHHHHHCCCCCCEEEEEEEECCCCCCHHHHHHHHHHHCCCCHHHHHHHHHH Q ss_conf 99999987741788607868506778898899886039400079999866410267989999998807981689999997 Q gi|254780222|r 597 KQKILEEFKKHGKGKIEISRFKGLGEMLASQLKETAMDANKRTLLRVEINKDLNSLKDTQDSINKLMGTKADERFKFIQE 676 (686) Q Consensus 597 ~~~~~~~~~~~~~~k~~i~ryKGLGe~~~~e~~et~~~p~~r~l~~v~~~~~~~~~~~~~~~f~~lmG~~~~~RK~wi~~ 676 (686) +++.++++.+. ++|+||||||||||||+|||||||||++|+|+||+++| +..++++|++|||+++++||+||++ T Consensus 552 ~~~~~~~~~~~--~~~~IqRyKGLGEMnp~QLwETTmdP~~R~L~~V~i~d----a~~ad~~f~~LMGd~ve~Rr~fIe~ 625 (635) T COG0187 552 LEKLLERLGKK--KGYEIQRYKGLGEMNPDQLWETTMDPETRRLLQVTIED----ADEADEIFSTLMGDKVEPRRNFIEE 625 (635) T ss_pred HHHHHHHHCCC--CCCEEEEECCCCCCCHHHHHHHCCCCCCEEEEEEECCC----HHHHHHHHHHHCCCCCHHHHHHHHH T ss_conf 99999863426--88405760165778988987753590412589886222----8999999998707785688999998 Q ss_pred HHHHHHHCCC Q ss_conf 5246775179 Q gi|254780222|r 677 RASFIDDVGT 686 (686) Q Consensus 677 ~~~~~~~ld~ 686 (686) ||.++.++|- T Consensus 626 na~~~~~~di 635 (635) T COG0187 626 NALFVENLDI 635 (635) T ss_pred HHHHHHHCCC T ss_conf 7533332369 No 5 >PTZ00109 DNA gyrase subunit b; Provisional Probab=100.00 E-value=0 Score=1652.36 Aligned_cols=631 Identities=37% Similarity=0.564 Sum_probs=570.9 Q ss_pred CCCCHHHEEECCCCHHHHCCCCEEECCCCCCCHHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCEEEEEECCCCCCCCCC Q ss_conf 23697560231121034326981546777550220244366667877761899769999828983999966984344417 Q gi|254780222|r 42 DNYDASSIRILEGLEPVRMRPGMYIGGTDEKALHHLFSEVIDNAMDEVIAGYANLIEVSLDGNGFLTVVDNGRGIPVENH 121 (686) Q Consensus 42 ~~Y~~~~i~~L~glE~VrkRP~mYIGst~~~GL~~i~~EIldNavDe~~~g~~~~I~V~i~~dg~IsV~ddGrGIPv~~h 121 (686) ++|+|++|+|||||||||||||||||||+.+|||||+||||||||||+++|||++|+|+||+||||||.|||||||||+| T Consensus 54 n~Y~A~dI~VLeGLEaVRKRPGMYIG~Td~~GlHHlv~EIvDNSvDEalaG~c~~I~V~l~~DgSvtV~DnGRGIPvD~h 133 (941) T PTZ00109 54 NDYDAKDIVILEGLEAVRKRPGMYIGNTDVKGLHQILFEIIDNSVDEYNNFECNEIKIVIHKDDSVTIEDNGRGIPCDVH 133 (941) T ss_pred CCCCCCCCEEECCCHHHCCCCCCEECCCCCCCCCCEEEEEHHCHHHHHHCCCCCEEEEEEECCCCEEEEECCCCCCCCCC T ss_conf 88771012361285321369974456899986321221001256879861899668999947997899889976788878 Q ss_pred CCCCCCEEEEEEEEEEECCCCCCCC------------------------------------------------------- Q ss_conf 6657870123454520024777777------------------------------------------------------- Q gi|254780222|r 122 PKFPNKSTLEIILTTLHAGGKFDNS------------------------------------------------------- 146 (686) Q Consensus 122 ~k~~~~~~~Elif~~L~aggnfdd~------------------------------------------------------- 146 (686) |+ ++++++|+|||+|||||||||+ T Consensus 134 pk-~~~salEvVlT~LHAGGKF~dd~~~~~~d~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~g~~~~~~~ 212 (941) T PTZ00109 134 EK-TKKSALETVLTVLHSGAKFFDDEADAEIDGGMNNGGLSDGGEEDNRDAIGEHKGEHKGEKKAKANVAERGAEKLDEK 212 (941) T ss_pred CC-CCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC T ss_conf 88-89850288885415575657764432122333444444443322222222233221223222111223344332223 Q ss_pred -----------------CEEEECCCCCHHHHHHHHEEEEEEEEEECCCEEEEEEEECCCEECCCEECCCCCCCCCEEEEE Q ss_conf -----------------415631445388987620321269999418838999972794511311216767788539999 Q gi|254780222|r 147 -----------------TYEISGGLHGVGISVVNALSDELHVTVARQNTIFSQKFSRGIPLSPLEKKEKVRNKRGTRITF 209 (686) Q Consensus 147 -----------------~yk~sgGlnGvGaklvNalS~~f~Vev~r~gk~y~q~f~~g~~~~~~~~i~~~~~k~GT~ItF 209 (686) .|++||||||||+|||||||+||+|+|+|+|++|.|+|++|.++.++.+.++...++||+|+| T Consensus 213 ~~~~~~~~~a~~k~~~~~Yk~SGGLHGVGvSVVNALS~~l~v~V~Rdgk~y~~~f~~G~~~~~L~v~~~~~~k~GT~I~F 292 (941) T PTZ00109 213 KSPQKTPKEAPQKEPAQKYKFSSGLHGVGLSVANALSSFMKVKVFRNGKIHSIELEKGKVTKELSIRDCPINKRGTQIHY 292 (941) T ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEECCCCEEEEEEEECCEEEEEEECCCEECCCCEECCCCCCCCCCEEEE T ss_conf 33333433333455687622466678666686703657589999999999999968990888535407888999757999 Q ss_pred EECHHHEECCCCCCHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCC----------------------------CCCCEE Q ss_conf 9842432001222289999988887530577359995164532224----------------------------554112 Q gi|254780222|r 210 RPDPKIFGDKAGFDAARLLKMTQSKAYLSGRVKTCWSCDKEIAEKY----------------------------GIPEKE 261 (686) Q Consensus 210 ~PD~~iFg~~~~~d~~~l~~rl~~~A~L~~gl~i~~~~~~~~~~~~----------------------------~~~~~~ 261 (686) |||++||++.+.|++++|..|++++|||||||+|.|.|+|...... ..+..+ T Consensus 293 ~PD~~IF~~~~~f~~~~l~~RlrelAfLn~gL~I~~~Der~~~~~~~~~~~~~k~~~~~~~~~~~~e~~~~~~~~fy~~e 372 (941) T PTZ00109 293 KPDPSIFKNSIKHNADLIKNRIHQLAYLNDKLSFYFYDERAAKKAAAEAEKAAKAADIADLADHAGEIADHNNLDFYNYE 372 (941) T ss_pred ECCHHHHCCCCCCCHHHHHHHHHHHHCCCCCCEEEEEEECCCCCCCCEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC T ss_conf 77868858876568999999999970267887799997067654541002454432210000123343333444434331 Q ss_pred EEEECCCCHHHHHCCCCCCCCCC---CCCEEEEEEECCCEEEEEEEEEEECCC-CC-CEEEEEEECCCCCCCEEECCHHH Q ss_conf 34315861222100246776667---752033332148015589998541467-77-42788830006557524130467 Q gi|254780222|r 262 EFYFPGGLKTYLQTKLENRSLIS---SEIFTGKTEKKGTHRGTVEWAIAWCEE-NP-EITSYCNTILTDEGGTHESGLRV 336 (686) Q Consensus 262 ~f~~~~Gi~dyv~~~~~~~~~~~---~~~~~~~~~~~~~~~~~veval~~~~~-~~-~~~SFVN~I~T~~GGTHv~g~~~ 336 (686) +|+|++||.+|+++++.++..+. ..++......+ ...+|||++|+++ +. .+.||||+|+|++||||++||+. T Consensus 373 ~~~~egGl~efv~~l~~~k~~l~~d~~~~i~~~~~~~---~i~ve~al~w~~~~y~e~i~SFvNnI~T~dGGTH~~GFrs 449 (941) T PTZ00109 373 IIKHEGGLDEYIENITKNKANLFKDNDKIISISCHHK---NIHIDLRLKWLQGEYHEHIISFVNNVNTTDGGTHVDALKY 449 (941) T ss_pred CCCCCCCHHHHHHHHHCCCCCCCCCCCCEEEEEEEEC---CEEEEEEEEECCCCCCCEEEEEECCCCCCCCCCHHHHHHH T ss_conf 1126886999998870588745478884279998506---7579999996578875058877436668999737999999 Q ss_pred HHHHHHHHHHHHCC-CCCCCCCCCCCCCCCEEEEEEEEEEHHHCCCCCCCCCCCHHHCCCCCCCCHHHHHHHHHHHHHHH Q ss_conf 89999998763018-83322233222236426999977634560764466202523200001210158899977401356 Q gi|254780222|r 337 ALTRGIKKYAELTQ-NKRAISIISDDLMISAVGILSVFIREPEFAGQTKTKLVSLDAQRTVENALRDPFDHYLVQNPIEA 415 (686) Q Consensus 337 ai~k~i~~~~~~~~-~kk~~~i~~~dIk~~L~~~Isv~I~nP~FesQTKekL~s~~~~~~v~~~i~~~l~~~l~~n~~~a 415 (686) ||+++|+.|++... .+++.+++++|||+||+++|||+|++|+|+||||++|+|++++..|+.++.+.|..||++||..+ T Consensus 450 ALtraIn~y~~k~~l~K~~~~i~geDIrEGLtaVISVkv~dPqFeGQTK~KLgn~ev~~~Ve~iv~e~L~~~leenp~~a 529 (941) T PTZ00109 450 AISRCVNFNIKKNEAIKNFVNIPGEYIREGLTAILSIKMNNPEFEGQTKTKLGSHFLKPILESAIFEKLSEIFDFEPNLL 529 (941) T ss_pred HHHHHHHHHHHHCCCCCCCCCCCHHHHHHCCEEEEEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCHHHH T ss_conf 99999999999716355556688889973378999997699771874014436888999999999999999999797999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH-HH-HHCCCCCCCCCCCCCCCCCCCEEEEEEECCCCCCCCHHHCCCCCCEEEEEEC Q ss_conf 889999999999654544334442-21-1002456533356776565540379973276764210010112220122112 Q gi|254780222|r 416 TKLLEWVIERSEERLRRRKQREVN-RK-TAIRKLRLPGKLADCSQNIAHGTELFIVEGDSAGGSAKQARNRSNQAVLPLR 493 (686) Q Consensus 416 ~~I~~~~~~~a~~r~~~~k~k~~~-rk-~~~~~~~lpgKL~Da~~~~~~~~eL~lvEGDSA~gsAk~grdr~~qgi~PLR 493 (686) +.|+++++.++++|++++++|+.. || .......|||||+||+++++++|||||||||||||||||||||+|||||||| T Consensus 530 k~Ii~ka~~Aa~AreaAkkare~~rrK~~~~~~~~LPGKLaDCsskdpe~sELFIVEGDSAGGSAKQgRDR~fQAILPLR 609 (941) T PTZ00109 530 NSIYFKALQAKASDEEAKAARDLIRSKNNQCSATILPGKLVDCISDDINRNEIFIVEGDSAAGSAKQARNREIQAILPLK 609 (941) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCHHHCEEEEEECCCCCCCHHCCCCCCCEEEEECC T ss_conf 99999999999999999998898863103455678997554666798003548999547777741014575656763127 Q ss_pred CEECCHHHHHHHH-HHCCHHHHHHHHHHCCCCCC----------------------CCCC-------------------- Q ss_conf 3131445410877-63228899999882566543----------------------3333-------------------- Q gi|254780222|r 494 GKILNVASAGSEK-IRNNQQIMDLVQALGCRTRS----------------------QYRE-------------------- 530 (686) Q Consensus 494 GKiLNv~~as~~k-i~~N~Ei~~l~~~lGl~~g~----------------------~~~~-------------------- 530 (686) ||||||++++..+ |++|+||++||+|||||+|. .||+ T Consensus 610 GKILNVEKa~~~kril~N~EI~~LItAlG~Gig~d~~~~~k~~~~~~~~~~~~~~~~~Dl~~~~~~~~~~~~~~~~~~~~ 689 (941) T PTZ00109 610 GKILNVEKIKNNKRIFENSELKSLITAIGLNVNCDNKKIAKGKILINNKKGKDLKKKSDLKNGKFENAQNNNAILNKKKD 689 (941) T ss_pred CCCCCHHHHHHHHHHHCCHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC T ss_conf 73010444200344440499999999858686755332211233333322233221111112221111112112343223 Q ss_pred ----CCCCCCCEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHCCCEEEECCCEEEEEECC-------------------- Q ss_conf ----46552518999779877146889999999987088863393899309969997489-------------------- Q gi|254780222|r 531 ----EDLRYEKVIIMTDADVDGAHIASLLLTFFYQEMYDLIEQKHLFLISPPLFRITQGT-------------------- 586 (686) Q Consensus 531 ----~~LRYgkIiImTDaD~DGsHI~~Lll~ff~~~~p~Li~~g~v~~~~tPl~kv~~g~-------------------- 586 (686) ++||||||||||||||||+|||+|||||||||||+||+.|||||||||||||++|+ T Consensus 690 ~~~~~~LRY~KIIIMTDADVDGaHIrTLLLTFFyR~m~~LIe~G~VYIAqPPLYkI~~~k~~~~~~~d~~~~~~~~~~~~ 769 (941) T PTZ00109 690 ILFDNPLRYGKIIIMTDADVDGEHIRILLLTFLYRFQKEIIENGNVFVACPPLYKITYNKFFDASIKDIVMKQFNVSTKN 769 (941) T ss_pred CCCCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHCCEEEEECCCEEEEECCCHHHHHHHHHHHHHHHHHHCC T ss_conf 34332254052899956887678999999999998749999748699956971586357326655547889877665314 Q ss_pred --EEEEECCHHHHHHHHHHHHH---------------------------------------------------------- Q ss_conf --79993789999999998774---------------------------------------------------------- Q gi|254780222|r 587 --KSVYAHDEEHKQKILEEFKK---------------------------------------------------------- 606 (686) Q Consensus 587 --~~~y~~~e~e~~~~~~~~~~---------------------------------------------------------- 606 (686) ..+|+|++.|++.++..+.+ T Consensus 770 ~~~~~~~~~d~el~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 849 (941) T PTZ00109 770 SKFIIHTYSDDELNNLLGLLDKDKIAHAEHNMANLGRGKPTRGGNLPSDDRGSGLAGIADKNFGSNGADESEKEEELKTD 849 (941) T ss_pred CCEEEEECCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHCCCHHHHHHHCCCCCCCHHHHHHHHHCCC T ss_conf 64267607999999999876565566788777664013321123332000011011110000122320234333220232 Q ss_pred --------------------CCCCCEEEEEEECCCCCCHHHHHHHCCCCCCEEEEEEEECCCCCCHHHHHHHHHHHCCCC Q ss_conf --------------------178860786850677889889988603940007999986641026798999999880798 Q gi|254780222|r 607 --------------------HGKGKIEISRFKGLGEMLASQLKETAMDANKRTLLRVEINKDLNSLKDTQDSINKLMGTK 666 (686) Q Consensus 607 --------------------~~~~k~~i~ryKGLGe~~~~e~~et~~~p~~r~l~~v~~~~~~~~~~~~~~~f~~lmG~~ 666 (686) ....+|+||||||||||||+|||||||||++|+|+||+++| +..++++|++|||++ T Consensus 850 ~~~~~~~~~~~~~~~~~~~~~~~kk~~IQRyKGLGEMnpeQLWETTMdP~~R~LlrV~i~D----a~~A~~if~~LMGd~ 925 (941) T PTZ00109 850 DIFVAEEENADPFAADNAFFNFSKKYEIQRFKGLGEMMADQLWNTTMDPRVRKLIRVTVND----AIRANDLIFSLMGED 925 (941) T ss_pred CHHHHCCCCCCCCHHHHHHHHHCCCCEEECCCCCCCCCHHHHHHHCCCCCCCEEEEEECHH----HHHHHHHHHHHCCCC T ss_conf 0232112334420134567652577476628624667978986627781144499974217----999998999747999 Q ss_pred HHHHHHHHHHHHHH Q ss_conf 16899999975246 Q gi|254780222|r 667 ADERFKFIQERASF 680 (686) Q Consensus 667 ~~~RK~wi~~~~~~ 680 (686) |++||+||++||.- T Consensus 926 VepRr~FI~eNA~~ 939 (941) T PTZ00109 926 SKLRKNFILENSNA 939 (941) T ss_pred CHHHHHHHHHCCCC T ss_conf 42879999972788 No 6 >TIGR01058 parE_Gpos DNA topoisomerase IV, B subunit; InterPro: IPR005740 DNA topoisomerases regulate the number of topological links between two DNA strands (i.e. change the number of superhelical turns) by catalysing transient single- or double-strand breaks, crossing the strands through one another, then resealing the breaks. These enzymes have several functions: to remove DNA supercoils during transcription and DNA replication; for strand breakage during recombination; for chromosome condensation; and to disentangle intertwined DNA during mitosis , . DNA topoisomerases are divided into two classes: type I enzymes (5.99.1.2 from EC; topoisomerases I, III and V) break single-strand DNA, and type II enzymes (5.99.1.3 from EC; topoisomerases II, IV and VI) break double-strand DNA . Type II topoisomerases are ATP-dependent enzymes, and can be subdivided according to their structure and reaction mechanisms: type IIA (topoisomerase II or gyrase, and topoisomerase IV) and type IIB (topoisomerase VI). These enzymes are responsible for relaxing supercoiled DNA as well as for introducing both negative and positive supercoils . This entry represents subunit B (parE) of topoisomerase IV from Gram positive bacteria. Topoisomerase IV primarily decatenates DNA and relaxes positive supercoils, which is important in bacteria, where the circular chromosome becomes catenated, or linked, during replication . Topoisomerase IV consists of two polypeptide subunits, parC (subunit A), which is homologous to gyrA of topoisomerase II, and parE (subunit B), which is homologous to gyrB of topoisomerase II. More information about this protein can be found at Protein of the Month: DNA Topoisomerase .; GO: 0003677 DNA binding, 0003918 DNA topoisomerase (ATP-hydrolyzing) activity, 0005524 ATP binding, 0006265 DNA topological change, 0005694 chromosome. Probab=100.00 E-value=0 Score=1600.13 Aligned_cols=619 Identities=40% Similarity=0.655 Sum_probs=583.0 Q ss_pred CCCCCHHHEEECCCCHHHHCCCCEEECCCCCCCHHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCEEEEEECCCCCCCCC Q ss_conf 12369756023112103432698154677755022024436666787776189976999982898399996698434441 Q gi|254780222|r 41 SDNYDASSIRILEGLEPVRMRPGMYIGGTDEKALHHLFSEVIDNAMDEVIAGYANLIEVSLDGNGFLTVVDNGRGIPVEN 120 (686) Q Consensus 41 ~~~Y~~~~i~~L~glE~VrkRP~mYIGst~~~GL~~i~~EIldNavDe~~~g~~~~I~V~i~~dg~IsV~ddGrGIPv~~ 120 (686) -.+|++++||+|+|||+||||||||||||++.|||||+||||||||||+++||+++|.|+||+||||||.|+|||||+++ T Consensus 8 ~~~Y~~~aI~iLeGL~AVrKRPGMYIGSTds~GLHHl~wEIvdNsvDEvl~g~a~~I~V~l~~dnsi~V~D~GRG~P~~~ 87 (655) T TIGR01058 8 IFDYNEDAIKILEGLDAVRKRPGMYIGSTDSKGLHHLVWEIVDNSVDEVLAGYADNITVTLHKDNSISVQDDGRGIPTGI 87 (655) T ss_pred CCCCCCHHHHHHHCCHHHCCCCCCEECCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEEEECCCEEEEEECCCCCCCCH T ss_conf 00355024677520000015898412256878505567878844688898226974799982787078875785376012 Q ss_pred CCCCCCCEEEEEEEEEEECCCCCCCCCEEEECCCCCHHHHHHHHEEEEEEEEEECCCEEEEEEEECC-CEECCCEECCCC Q ss_conf 7665787012345452002477777741563144538898762032126999941883899997279-451131121676 Q gi|254780222|r 121 HPKFPNKSTLEIILTTLHAGGKFDNSTYEISGGLHGVGISVVNALSDELHVTVARQNTIFSQKFSRG-IPLSPLEKKEKV 199 (686) Q Consensus 121 h~k~~~~~~~Elif~~L~aggnfdd~~yk~sgGlnGvGaklvNalS~~f~Vev~r~gk~y~q~f~~g-~~~~~~~~i~~~ 199 (686) |.. .+++.+|+|||.||||||||+.+||+||||||||||||||||+|++|+|.|||++|+|+|.+| .++..+++|+.+ T Consensus 88 h~~-~~isT~e~vlT~LHAGGKFd~~~YKtaGGLHGVGaSVVNALS~wL~v~v~Rdg~iYq~~f~nGGk~~~~lkkIG~t 166 (655) T TIGR01058 88 HKQ-GKISTVETVLTVLHAGGKFDQGGYKTAGGLHGVGASVVNALSSWLEVEVKRDGKIYQQRFENGGKIVQSLKKIGTT 166 (655) T ss_pred HCC-CCCCCCCEEEEEEECCCCCCCCCEEECCCCCCCHHHHHHHCCCCEEEEEEECCHHHHHHHHCCCCEEEEEEECCCC T ss_conf 216-8988510045443077303786501057322303677640226338999878734457874479130001020478 Q ss_pred -CCCCCEEEEEEECHHHEECCCCCCHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCCCCCCEEEEEECCCCHHHHHCCCC Q ss_conf -7788539999984243200122228999998888753057735999516453222455411234315861222100246 Q gi|254780222|r 200 -RNKRGTRITFRPDPKIFGDKAGFDAARLLKMTQSKAYLSGRVKTCWSCDKEIAEKYGIPEKEEFYFPGGLKTYLQTKLE 278 (686) Q Consensus 200 -~~k~GT~ItF~PD~~iFg~~~~~d~~~l~~rl~~~A~L~~gl~i~~~~~~~~~~~~~~~~~~~f~~~~Gi~dyv~~~~~ 278 (686) +.++||.|+|.||+.+|. ++.|++++|..||++.|||.++|++.|.|++. .+..+|++++||.+|++++|+ T Consensus 167 pk~ktGT~v~F~pDp~~F~-~~~F~~n~i~eRL~Es~FLlk~lkl~l~D~~~-------~~~~~F~~~~Gl~~F~~~~Ne 238 (655) T TIGR01058 167 PKKKTGTLVHFLPDPTIFK-TTQFNSNIIKERLKESAFLLKKLKLTLTDKRT-------DKKTVFFYENGLVDFVKYINE 238 (655) T ss_pred CCCCCCCEEEECCCCHHHC-CCCCCHHHHHHHHHHHHHHHHCCEEEEEEECC-------CCEEEEEECCCHHHHHHHHCC T ss_conf 8877773477556713330-24567267876766888877147389997026-------721777300447899998535 Q ss_pred CCCCCCCCCEEEEEEECCCEEEEEEEEEEECCCCCC-EEEEEEECCCCCCCEEECCHHHHHHHHHHHHHHHCC--CCCCC Q ss_conf 776667752033332148015589998541467774-278883000655752413046789999998763018--83322 Q gi|254780222|r 279 NRSLISSEIFTGKTEKKGTHRGTVEWAIAWCEENPE-ITSYCNTILTDEGGTHESGLRVALTRGIKKYAELTQ--NKRAI 355 (686) Q Consensus 279 ~~~~~~~~~~~~~~~~~~~~~~~veval~~~~~~~~-~~SFVN~I~T~~GGTHv~g~~~ai~k~i~~~~~~~~--~kk~~ 355 (686) .+..+.+ +++++++.+. ..|||||||++++.+ ++||||+|.|.+||||++||+.||++++|+||++.+ |.|+. T Consensus 239 ~k~~l~~-v~~f~ge~~~---i~Ve~afQfnd~~~e~ilSF~N~vkT~eGG~HE~g~k~A~t~v~N~YaRK~~LLKEKDK 314 (655) T TIGR01058 239 EKETLSD-VIYFEGEKNG---IEVEVAFQFNDGDSENILSFANSVKTKEGGTHENGFKLAITKVINEYARKVNLLKEKDK 314 (655) T ss_pred CCEECCC-EEEEEECCCC---EEEEEEEEECCCCCCEEEEEEEEEEECCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCC T ss_conf 5000123-4588764762---58999997427887537986400340688720788999999998655432001110014 Q ss_pred CCCCCCCCCCEEEEEEEEEEHH--HCCCCCCCCCCCHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 2332222364269999776345--60764466202523200001210158899977401356889999999999654544 Q gi|254780222|r 356 SIISDDLMISAVGILSVFIREP--EFAGQTKTKLVSLDAQRTVENALRDPFDHYLVQNPIEATKLLEWVIERSEERLRRR 433 (686) Q Consensus 356 ~i~~~dIk~~L~~~Isv~I~nP--~FesQTKekL~s~~~~~~v~~~i~~~l~~~l~~n~~~a~~I~~~~~~~a~~r~~~~ 433 (686) ++.++|||+||.++|++.||+- +||||||.||.||+++..|..+|.++|.-||++|...|..|+++++.|.++|++++ T Consensus 315 Nleg~diReGL~avi~v~~PE~~~~fEGQTK~Klf~~ea~~~V~~~v~~~~~~fLeeN~~~A~~lv~kai~Ar~akeaak 394 (655) T TIGR01058 315 NLEGSDIREGLSAVISVRIPEELIQFEGQTKSKLFSPEAREVVDEIVEDKLSFFLEENKEDAKLLVKKAIKARDAKEAAK 394 (655) T ss_pred CCCCCHHHHHHHHEEEEECCHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 78710345320211553257545245760576678766899999999888899997435789999999988899999998 Q ss_pred HHHHHHHHHH---CCCCCCCCCCCCCCCCCCCEEEEEEECCCCCCCCHHHCCCCCCEEEEEECCEECCHHHHH-HHHHHC Q ss_conf 3344422110---024565333567765655403799732767642100101122201221123131445410-877632 Q gi|254780222|r 434 KQREVNRKTA---IRKLRLPGKLADCSQNIAHGTELFIVEGDSAGGSAKQARNRSNQAVLPLRGKILNVASAG-SEKIRN 509 (686) Q Consensus 434 k~k~~~rk~~---~~~~~lpgKL~Da~~~~~~~~eL~lvEGDSA~gsAk~grdr~~qgi~PLRGKiLNv~~as-~~ki~~ 509 (686) ++|+..|+.. .++..|.|||++|+|++|.+.|||||||||||||||+||||.||||||||||+|||.||. ...|++ T Consensus 395 ~aRee~k~~kk~kk~~g~l~GKLtPaqsk~p~~nELflVEGDSAGGsAK~GRdr~fQAiLPLRGKVlNv~Ka~n~~d~lk 474 (655) T TIGR01058 395 KAREEKKSGKKRKKEKGILSGKLTPAQSKNPAKNELFLVEGDSAGGSAKLGRDRKFQAILPLRGKVLNVEKAKNLADILK 474 (655) T ss_pred HHHHHHHCCCCCCHHCCCCCCCCCCCCCCCCHHHCEEEEECCCCCCCCCCCCCCCEEEECCCCCCEEHHHHHHHHHHHHC T ss_conf 74566513564201111101113776775611105257851368875333678530342266750103778876888740 Q ss_pred CHHHHHHHHHHCCCCCCCCCCCCCCCCCEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHCCCEEEECCCEEEEEEC--CE Q ss_conf 2889999988256654333334655251899977987714688999999998708886339389930996999748--97 Q gi|254780222|r 510 NQQIMDLVQALGCRTRSQYREEDLRYEKVIIMTDADVDGAHIASLLLTFFYQEMYDLIEQKHLFLISPPLFRITQG--TK 587 (686) Q Consensus 510 N~Ei~~l~~~lGl~~g~~~~~~~LRYgkIiImTDaD~DGsHI~~Lll~ff~~~~p~Li~~g~v~~~~tPl~kv~~g--~~ 587 (686) ||||..|+.+||.|+|-+|++++|+|+|||||||||-||+||..||||||||||+||||.|+||||.|||||+++| |+ T Consensus 475 NEEI~t~i~~iG~Giga~F~i~d~~Y~KIIIMTDADtDGaHIQ~LLLTfFYRyM~PLiE~G~vYiALPPLYKl~~k~~Kk 554 (655) T TIGR01058 475 NEEINTIIFCIGTGIGADFSIKDLKYDKIIIMTDADTDGAHIQVLLLTFFYRYMRPLIELGHVYIALPPLYKLSKKDGKK 554 (655) T ss_pred CCHHHHHHHHHCCCCCCCCCHHHCCCCCEEEEECCCCCCHHHHHHHHHHHHHHCCCHHHCCCEEEECCCCCCCCCCCCCE T ss_conf 30488999981665125643654167867996358988318999999988761452204683378368720022799854 Q ss_pred --EEEECCHHHHHHHHHHHHHCCCCCEEEEEEECCCCCCHHHHHHHCCCCCCEEEEEEEECCCCCCHHHHHHHHHHHCCC Q ss_conf --999378999999999877417886078685067788988998860394000799998664102679899999988079 Q gi|254780222|r 588 --SVYAHDEEHKQKILEEFKKHGKGKIEISRFKGLGEMLASQLKETAMDANKRTLLRVEINKDLNSLKDTQDSINKLMGT 665 (686) Q Consensus 588 --~~y~~~e~e~~~~~~~~~~~~~~k~~i~ryKGLGe~~~~e~~et~~~p~~r~l~~v~~~~~~~~~~~~~~~f~~lmG~ 665 (686) ..|+|+|.|+++...++. ..|+||||||||||||||||||||||++|+|+||+++| ...+++-.+.|||+ T Consensus 555 E~~~YaW~D~EL~~~~k~~G----kn~tlQRYKGLGEMNADQLweTTMnP~~R~Lv~v~i~D----~~~aEr~in~lMG~ 626 (655) T TIGR01058 555 EIVKYAWSDLELEELKKKLG----KNYTLQRYKGLGEMNADQLWETTMNPETRTLVRVKIDD----LARAERQINTLMGD 626 (655) T ss_pred EEEEEECCCHHHHHHHHHHC----CCCEEEEECCCCCCCHHHHCCCCCCCCCCEEEEEEHHH----HHHHCCEEEEEECC T ss_conf 78887127212899999737----98167754377643402310146889864188872678----86213303332178 Q ss_pred CHHHHHHHHHHHHHH Q ss_conf 816899999975246 Q gi|254780222|r 666 KADERFKFIQERASF 680 (686) Q Consensus 666 ~~~~RK~wi~~~~~~ 680 (686) +|++||.||+.|..| T Consensus 627 kv~~Rk~WIe~Ni~F 641 (655) T TIGR01058 627 KVELRKKWIEANIDF 641 (655) T ss_pred CCCHHHHHHHHHCCC T ss_conf 730336746530032 No 7 >smart00433 TOP2c TopoisomeraseII. Eukaryotic DNA topoisomerase II, GyrB, ParE Probab=100.00 E-value=0 Score=1568.72 Aligned_cols=591 Identities=44% Similarity=0.659 Sum_probs=556.3 Q ss_pred CHHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCEEEEEECCCCCCCCCCCCCCCCEEEEEEEEEEECCCCCCCCCEEEEC Q ss_conf 02202443666678777618997699998289839999669843444176657870123454520024777777415631 Q gi|254780222|r 73 ALHHLFSEVIDNAMDEVIAGYANLIEVSLDGNGFLTVVDNGRGIPVENHPKFPNKSTLEIILTTLHAGGKFDNSTYEISG 152 (686) Q Consensus 73 GL~~i~~EIldNavDe~~~g~~~~I~V~i~~dg~IsV~ddGrGIPv~~h~k~~~~~~~Elif~~L~aggnfdd~~yk~sg 152 (686) ||||||||||||||||++||||++|+|+|++||+|||+|||||||||+||+ +++|+||||||+||||||||+++|++|| T Consensus 1 GL~hl~~EIldNsvDe~~~g~~~~I~V~i~~dgsisV~DnGrGIPv~~h~~-~~~~~~E~i~t~LhaGgkFd~~~yk~sG 79 (594) T smart00433 1 GLHHLVDEIVDNAADEALAGYMDTIKVTIDKDNSISVEDNGRGIPVEIHPK-EKKYAPEVIFTVLHAGGKFDDDAYKVSG 79 (594) T ss_pred CCEEEEEEEEECHHHHHHHCCCCEEEEEECCCCEEEEEECCCCCCCCCCCC-CCCCCEEEEEEECCCCCCCCCCCCEEEC T ss_conf 967998456673776884289977999992798499998997654236676-7971201476132243688998516706 Q ss_pred CCCCHHHHHHHHEEEEEEEEEECCCEEEEEEEECCCEECCCEECCCCCCCCCEEEEEEECHHHEECCCCCCHHHHHHHHH Q ss_conf 44538898762032126999941883899997279451131121676778853999998424320012222899999888 Q gi|254780222|r 153 GLHGVGISVVNALSDELHVTVARQNTIFSQKFSRGIPLSPLEKKEKVRNKRGTRITFRPDPKIFGDKAGFDAARLLKMTQ 232 (686) Q Consensus 153 GlnGvGaklvNalS~~f~Vev~r~gk~y~q~f~~g~~~~~~~~i~~~~~k~GT~ItF~PD~~iFg~~~~~d~~~l~~rl~ 232 (686) ||||||||||||||++|+|+|+|+|++|+|+|++||.....+++..+++++||+|+|+||++||++.+.|+++.|.+|++ T Consensus 80 GlhGvGasvvNalS~~f~Vev~r~gk~y~q~f~~~g~~~~~~~~~~~~~k~GT~I~F~PD~~iF~~~~~~~~~~l~~Rl~ 159 (594) T smart00433 80 GLHGVGASVVNALSTEFEVEVARDGKEYKQSFSNNGKPLSEPKIIGDTKKDGTKVTFKPDLEIFGMTTDDDFELLKRRLR 159 (594) T ss_pred CCCCCCEEEEEECCCEEEEEEEECCEEEEEEECCCCEECCCCEECCCCCCCCEEEEEEECHHHHCCCCCCCHHHHHHHHH T ss_conf 78871147658412739999998999999999088848988555478899951999997878839975466999999999 Q ss_pred HHHHCCCCCEEEEECCCCCCCCCCCCCEEEEEECCCCHHHHHCCCCCCCCCCCCCEEEEEEECCCEEEEEEEEEEECCCC Q ss_conf 87530577359995164532224554112343158612221002467766677520333321480155899985414677 Q gi|254780222|r 233 SKAYLSGRVKTCWSCDKEIAEKYGIPEKEEFYFPGGLKTYLQTKLENRSLISSEIFTGKTEKKGTHRGTVEWAIAWCEEN 312 (686) Q Consensus 233 ~~A~L~~gl~i~~~~~~~~~~~~~~~~~~~f~~~~Gi~dyv~~~~~~~~~~~~~~~~~~~~~~~~~~~~veval~~~~~~ 312 (686) ++|||||||+|.|+|++. +..++|+|++||.+|+++++.++.+++++++....+... ..||||++|++++ T Consensus 160 ~~A~Ln~gl~I~lnder~-------~~~~~f~~~~Gl~dyv~~~~~~~~~~~~~~~~~~~~~~~---~~~eva~~~s~~~ 229 (594) T smart00433 160 ELAFLNKGVKITLNDERS-------DEEETFLFEGGIKDYVELLNKNKELLSPEPTYIEGEKDN---IRVEVAFQYTDGY 229 (594) T ss_pred HHHCCCCCCEEEEEEECC-------CCEEEEECCCCHHHHHHHHCCCCCCCCCCCEEEEEEECC---CEEEEEEEECCCC T ss_conf 984238883899998437-------845789717879999998538987678874587787168---2699999965988 Q ss_pred -CCEEEEEEECCCCCCCEEECCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCEEEEEEEEEEHHHCCCCCCCCCCCHH Q ss_conf -7427888300065575241304678999999876301883322233222236426999977634560764466202523 Q gi|254780222|r 313 -PEITSYCNTILTDEGGTHESGLRVALTRGIKKYAELTQNKRAISIISDDLMISAVGILSVFIREPEFAGQTKTKLVSLD 391 (686) Q Consensus 313 -~~~~SFVN~I~T~~GGTHv~g~~~ai~k~i~~~~~~~~~kk~~~i~~~dIk~~L~~~Isv~I~nP~FesQTKekL~s~~ 391 (686) ++++||||+|+|++|||||+||++||+++|++|++++++.|+.+++++|||++|++||||+|+||+|+||||++|+|++ T Consensus 230 ~~~~~SFvN~I~T~~GGTHv~gf~~al~~~i~~~~~k~~~~K~~~i~~~DIregL~~vIsvki~nPqFegQTKekL~~~~ 309 (594) T smart00433 230 SENIVSFVNNIATTEGGTHENGFKDALTRVINEYAKKKKKLKEKNIKGEDVREGLTAFISVKIPEPQFEGQTKEKLGTSE 309 (594) T ss_pred CCCCCEEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHCCCEEEEEEEECCCCCCCCHHHHHCCHH T ss_conf 72100023147889998488999999999999999984742446799999852336999998248730664213435977 Q ss_pred HCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEEECC Q ss_conf 20000121015889997740135688999999999965454433444221100245653335677656554037997327 Q gi|254780222|r 392 AQRTVENALRDPFDHYLVQNPIEATKLLEWVIERSEERLRRRKQREVNRKTAIRKLRLPGKLADCSQNIAHGTELFIVEG 471 (686) Q Consensus 392 ~~~~v~~~i~~~l~~~l~~n~~~a~~I~~~~~~~a~~r~~~~k~k~~~rk~~~~~~~lpgKL~Da~~~~~~~~eL~lvEG 471 (686) ++++|++.+++.|..||.+||..++.|+++++.++++|++++++++..|+....+..|||||+||+++++++|||||||| T Consensus 310 ~~~~v~~~v~~~l~~~l~~n~~~a~~Ii~k~~~~a~ar~~ak~~k~~~rk~~~~~~~lpgKL~Dc~~~~~~~~eLfivEG 389 (594) T smart00433 310 VRFGVEKIVSECLLSFLEENPVEASKIVEKVLLAAKARAAAKKARELTRKKKLSKISLPGKLADASSAGPKKCELFLVEG 389 (594) T ss_pred HHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEC T ss_conf 76568999999886665439898899999999999999999976877565421125788667243567988628999734 Q ss_pred CCCCCCHHHCCCCCCEEEEEECCEECCHHHHHHHHHHCCHHHHHHHHHHCCCCCCCCCCCCCCCCCEEEEECCCCCCHHH Q ss_conf 67642100101122201221123131445410877632288999998825665433333465525189997798771468 Q gi|254780222|r 472 DSAGGSAKQARNRSNQAVLPLRGKILNVASAGSEKIRNNQQIMDLVQALGCRTRSQYREEDLRYEKVIIMTDADVDGAHI 551 (686) Q Consensus 472 DSA~gsAk~grdr~~qgi~PLRGKiLNv~~as~~ki~~N~Ei~~l~~~lGl~~g~~~~~~~LRYgkIiImTDaD~DGsHI 551 (686) |||+|||+|||||+||||||||||||||++|+.+||++|+||++|++||||++|++||+++||||||||||||||||+|| T Consensus 390 DSA~gsAk~grdr~~qailPLRGKiLNv~~a~~~ki~~N~Ei~~li~alG~~~g~~~d~~~LRY~kIiImTDADvDG~HI 469 (594) T smart00433 390 DSAGGSAKSGRDRDFQAILPLRGKLLNVEKASLDKILKNEEIQALITALGLGIGKDFDIEKLRYGKIIIMTDADVDGSHI 469 (594) T ss_pred CCCCCCEEEEECCCCEEEECCCCCCCCHHHCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCEEEEEECCCCCHHHH T ss_conf 78874337751255155531476015355468888751078999999977787876671106778289997789875789 Q ss_pred HHHHHHHHHHHHHHHHHCCCEEEECCCEEEEEECCEE--EEECCHHHHHHHHHHHHHCCCCCEEEEEEECCCCCCHHHHH Q ss_conf 8999999998708886339389930996999748979--99378999999999877417886078685067788988998 Q gi|254780222|r 552 ASLLLTFFYQEMYDLIEQKHLFLISPPLFRITQGTKS--VYAHDEEHKQKILEEFKKHGKGKIEISRFKGLGEMLASQLK 629 (686) Q Consensus 552 ~~Lll~ff~~~~p~Li~~g~v~~~~tPl~kv~~g~~~--~y~~~e~e~~~~~~~~~~~~~~k~~i~ryKGLGe~~~~e~~ 629 (686) +||||||||||||+||++||||+|+||||||++|++. .++|++.|+++|..... .+..+|+||||||||||+|+||| T Consensus 470 ~~LLltff~r~~p~Li~~G~vy~a~pPL~kv~~~~~~~~~~~~~~~e~~~~~~~~~-~~~~~~~i~ryKGLGem~~~ql~ 548 (594) T smart00433 470 KGLLLTFFYRYMPPLIEAGFVYIAIPPLYKVTKGKKKYVYSFYSLDEYEKWLETKG-GNKSKYEIQRYKGLGEMNADQLW 548 (594) T ss_pred HHHHHHHHHHHHHHHHHCCEEEEEECCEEEEEECCCCEEEEECCCHHHHHHHHHHH-CCCCCCEEEEEECCCCCCHHHHH T ss_conf 99999999987699886796999809889999489407998437479999998620-67898306963126789989999 Q ss_pred HHCCCCCCEEEEEEEECCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHH Q ss_conf 86039400079999866410267989999998807981689999997524 Q gi|254780222|r 630 ETAMDANKRTLLRVEINKDLNSLKDTQDSINKLMGTKADERFKFIQERAS 679 (686) Q Consensus 630 et~~~p~~r~l~~v~~~~~~~~~~~~~~~f~~lmG~~~~~RK~wi~~~~~ 679 (686) ||||||++|+|+||++++ ++.++.+|+.|||++|++||+||++||. T Consensus 549 ettm~p~~r~l~~v~~~d----~~~~~~~~~~lmg~~~~~Rk~~i~~~a~ 594 (594) T smart00433 549 ETTMDPERRTLLFVTLDD----ADEADLIFSALMGDKVEPRKEWIEENAP 594 (594) T ss_pred HHHHCHHHEEEEEEECCC----HHHHHHHHHHHCCCCCHHHHHHHHHCCC T ss_conf 872096363699997686----6779999998769996899999997298 No 8 >PTZ00108 DNA topoisomerase II; Provisional Probab=100.00 E-value=0 Score=1411.21 Aligned_cols=627 Identities=22% Similarity=0.338 Sum_probs=514.7 Q ss_pred CCCCCCCCCCHHHEEECCCCHHHHCCCCEEECCCC-----------------------CCCHHHHHHHHHHHHHHHHHCC Q ss_conf 67763123697560231121034326981546777-----------------------5502202443666678777618 Q gi|254780222|r 36 ANQSASDNYDASSIRILEGLEPVRMRPGMYIGGTD-----------------------EKALHHLFSEVIDNAMDEVIAG 92 (686) Q Consensus 36 ~~~~~~~~Y~~~~i~~L~glE~VrkRP~mYIGst~-----------------------~~GL~~i~~EIldNavDe~~~g 92 (686) ..+++++.| |+|++||||++||+|||||++ .|||+|||||||+||+|+.+|+ T Consensus 3 k~ktiee~Y-----qKkTqlEHILlRPDtYIGSvE~~~~~mWV~d~e~~~m~~k~v~yvPGLyKIFDEILVNAaDNk~Rd 77 (1506) T PTZ00108 3 KNKSIEERY-----QKKSQIEHILLRPDTYIGSVEMHTQLLWVWNKEKNRMVQKNITYVPGLYKIFDEIIVNAADVKARE 77 (1506) T ss_pred CCCCHHHHH-----HCCCHHHHHHCCCCCCCCCCCCCCEEEEEEECCCCEEEEEEEEECCHHHHHHHHHHCCCHHHHHCC T ss_conf 766689977-----506888898628987756764651038899767683888886666534754445641514443207 Q ss_pred C------CCEEEEEECCC-CEEEEEECCCCCCCCCCCCCCCCEEEEEEEEEEECCCCCCCCCEEEECCCCCHHHHHHHHE Q ss_conf 9------97699998289-8399996698434441766578701234545200247777774156314453889876203 Q gi|254780222|r 93 Y------ANLIEVSLDGN-GFLTVVDNGRGIPVENHPKFPNKSTLEIILTTLHAGGKFDNSTYEISGGLHGVGISVVNAL 165 (686) Q Consensus 93 ~------~~~I~V~i~~d-g~IsV~ddGrGIPv~~h~k~~~~~~~Elif~~L~aggnfdd~~yk~sgGlnGvGaklvNal 165 (686) + |+.|+|+|+.+ |+|||+||||||||++|++ +++|+||||||||||||||||++|||||||||||||||||| T Consensus 78 ~~~~~~pmt~IKVtID~eng~ISV~NNGRGIPVeiH~k-ekvyvPELIFGhLlTGsNFDDdeyKVSGGrNGvGAKLvNIF 156 (1506) T PTZ00108 78 KEKSENPMTCIKIEINKDNKKISVYNDGEGIPVDIHKE-MNIYVPHMIFGELLTSDNYDDAEDRITGGRNGFGAKLTNIF 156 (1506) T ss_pred CCCCCCCCCEEEEEEECCCCEEEEEECCCCCCCCCCCC-CCCCHHHHHHCCCCCCCCCCCCCCEECCCCCCCCEEEEEEC T ss_conf 54356876659999967899799997999524368776-79747268652076767878995634488887245787862 Q ss_pred EEEEEEEEECC--CEEEEEEEECCCEECCCEECCCCCCCCCEEEEEEECHHHEECCCCCCHHH---HHHHHHHHHHCCCC Q ss_conf 21269999418--83899997279451131121676778853999998424320012222899---99988887530577 Q gi|254780222|r 166 SDELHVTVARQ--NTIFSQKFSRGIPLSPLEKKEKVRNKRGTRITFRPDPKIFGDKAGFDAAR---LLKMTQSKAYLSGR 240 (686) Q Consensus 166 S~~f~Vev~r~--gk~y~q~f~~g~~~~~~~~i~~~~~k~GT~ItF~PD~~iFg~~~~~d~~~---l~~rl~~~A~L~~g 240 (686) |++|+|||+++ ++.|+|+|++||.....|+|+.+++++||+|||+||+++|+++ .++.++ |.+|++++|.++ + T Consensus 157 SteF~VEt~D~~~kk~ykQtw~nNM~k~~~PkIk~~~gk~yTkITFkPD~~iFgmt-~ld~Di~sLL~KRvyDLAgt~-s 234 (1506) T PTZ00108 157 SKEFIVQCGDSSRKKEFKMTWSDNMSKFSEPHIKNYNGKDYVKVTFKPDLAKFGMT-EMDDDIESLLCKRVYDLAGTC-S 234 (1506) T ss_pred CCEEEEEEEECCCEEEEEEEEECCCCCCCCCEEECCCCCCCEEEEEECCHHHCCCC-CCCHHHHHHHHHHHHHHHCCC-C T ss_conf 68189999847953799999708971378874743689981699998488980983-037789999999999973677-8 Q ss_pred CEEEEECCCCCCCCCCCC---------------CEEEEEECCCCHHHHHC----------CCCCCCCCC----CCCE--- Q ss_conf 359995164532224554---------------11234315861222100----------246776667----7520--- Q gi|254780222|r 241 VKTCWSCDKEIAEKYGIP---------------EKEEFYFPGGLKTYLQT----------KLENRSLIS----SEIF--- 288 (686) Q Consensus 241 l~i~~~~~~~~~~~~~~~---------------~~~~f~~~~Gi~dyv~~----------~~~~~~~~~----~~~~--- 288 (686) ++|.||+++.....+... ....+++.+|...+... .+.....+. ..+. T Consensus 235 VkVyLNG~rI~Ik~Fk~Yv~Lylkd~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~de~~n~~~d~~dvs~~~eP~~~~ 314 (1506) T PTZ00108 235 VRVYLNGNRLAIKDFKSYVDLYLKDNAGDNKGNGGQSGSGSGGGSGNDNGNGGGNGGDEEANNAADNLDVSASNEPADAT 314 (1506) T ss_pred EEEEECCCEEECCCHHHHHEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHCCCCCCCCCCCCCCCCCCC T ss_conf 06997585100155344202675012233333333222223332111122344433201210111233210013544465 Q ss_pred -----EEEE--EEC-------CCEEEEEEEEEEECCCC-CCEEEEEEECCCCCCCEEECCHHHHHHHHHHHHHHHCCCCC Q ss_conf -----3333--214-------80155899985414677-74278883000655752413046789999998763018833 Q gi|254780222|r 289 -----TGKT--EKK-------GTHRGTVEWAIAWCEEN-PEITSYCNTILTDEGGTHESGLRVALTRGIKKYAELTQNKR 353 (686) Q Consensus 289 -----~~~~--~~~-------~~~~~~veval~~~~~~-~~~~SFVN~I~T~~GGTHv~g~~~ai~k~i~~~~~~~~~kk 353 (686) .... ..+ ......||||+++++++ ++|+||||||+|++|||||+||.++|++.+.++++.+ +|+ T Consensus 315 p~~~~~~~~~~~~d~~eii~I~e~~~RWEVav~~Sd~~~FqqVSFVNnI~T~kGGTHVdyi~dqIvn~L~e~~~kK-kK~ 393 (1506) T PTZ00108 315 PSKNNAANGANNNDEDEIVKIHEKQHRWEIVVSKSDGSQFQQVSFVNSICTTKGGSHVNYIVEQLLNSLSKKANAK-NKG 393 (1506) T ss_pred CCCCCCCCCCCCCCCCCEEEEECCCCCEEEEEEECCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHHHH-CCC T ss_conf 4322222233456655315651367747999997789965788776761589998089999999999999998652-557 Q ss_pred CCCCCCCCCCCCEEEEEEEEEEHHHCCCCCCCCCCCHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 22233222236426999977634560764466202523200001210158899977401356889999999999654544 Q gi|254780222|r 354 AISIISDDLMISAVGILSVFIREPEFAGQTKTKLVSLDAQRTVENALRDPFDHYLVQNPIEATKLLEWVIERSEERLRRR 433 (686) Q Consensus 354 ~~~i~~~dIk~~L~~~Isv~I~nP~FesQTKekL~s~~~~~~v~~~i~~~l~~~l~~n~~~a~~I~~~~~~~a~~r~~~~ 433 (686) +.+++++|||+||++||||+|+||+|+||||++|+|++++|+++|.++++|.+++.++|. ...|+.++..++++++++ T Consensus 394 ~~~Ik~~dIKegL~aFVn~kI~NPqFdSQTKEkLtT~~~kFGSk~~LsekfikkllKs~I-IekIl~~a~aKa~~~LkK- 471 (1506) T PTZ00108 394 GMEIKSGHIKNHLWVFVNCLIVNPTFDSQTKETLTTKPVKFGSKCILSDKTINNVLKSPI-LSNILLWAQAKAQVELKK- 471 (1506) T ss_pred CCCCCHHHHHHCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHCHH-HHHHHHHHHHHHHHHHHH- T ss_conf 776798998605799999977799767610154038864563553478999999986959-999999999999999999- Q ss_pred HHHHHHHHHHCCCCCCCCCCCCCC---CCCCCEEEEEEECCCCCCCCHHHCCC---CCCEEEEEECCEECCHHHHHHHHH Q ss_conf 334442211002456533356776---56554037997327676421001011---222012211231314454108776 Q gi|254780222|r 434 KQREVNRKTAIRKLRLPGKLADCS---QNIAHGTELFIVEGDSAGGSAKQARN---RSNQAVLPLRGKILNVASAGSEKI 507 (686) Q Consensus 434 k~k~~~rk~~~~~~~lpgKL~Da~---~~~~~~~eL~lvEGDSA~gsAk~grd---r~~qgi~PLRGKiLNv~~as~~ki 507 (686) +.+...++...+..++| ||+||+ ++++++|||||||||||+|+|++|+. |++||||||||||||||+|+.+|| T Consensus 472 k~Ka~~~KkR~ri~gIp-KL~DAN~AGtk~s~~CTLiLtEGDSAK~lAvaGlsvvGRD~~GVfPLRGKiLNVr~As~~kI 550 (1506) T PTZ00108 472 KMKAGSSKARERIIGIP-KLEDANDAGSKYSQECTLILTEGDSAKTSCLAGLSIVGRDKYGVFPLKGKLLNVRDASFKQL 550 (1506) T ss_pred HHHHHCCCCHHHCCCCC-CCCCCCCCCCCCCCCEEEEEECCCCHHHHHHCCCCCCCCCCEEEEECCCEEEECCCCCHHHH T ss_conf 86541144101114898-77463002799863328999637516877750402136542234420561310134659888 Q ss_pred HCCHHHHHHHHHHCCCCCC-CC-CCCCCCCCCEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHC-CCEEEECCCEEEEEE Q ss_conf 3228899999882566543-33-33465525189997798771468899999999870888633-938993099699974 Q gi|254780222|r 508 RNNQQIMDLVQALGCRTRS-QY-REEDLRYEKVIIMTDADVDGAHIASLLLTFFYQEMYDLIEQ-KHLFLISPPLFRITQ 584 (686) Q Consensus 508 ~~N~Ei~~l~~~lGl~~g~-~~-~~~~LRYgkIiImTDaD~DGsHI~~Lll~ff~~~~p~Li~~-g~v~~~~tPl~kv~~ 584 (686) ++|+||++|++||||+++. .| |+++||||||||||||||||||||||||||||+|||+||+. |||..|+|||+||++ T Consensus 551 ~~N~EI~nIikiLGL~i~~k~~~dik~LRYGkImIMTDqDvDGSHIKGLLiNf~h~~wPsLLk~~gFl~~fiTPIvK~~k 630 (1506) T PTZ00108 551 MDNKEIQNIFKIMGLDITDKNKDDIKGLRYGSLMIMTDQDYDGSHIKGLLINMIHKFWPSLLKHKGFLSEFVTPIVKVQK 630 (1506) T ss_pred HHHHHHHHHHHHHCCCCCCCCCCCHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHCCCCCHHHHHCEEEEEECC T ss_conf 61388999999977786776556553445561699834889860138899999998628761876788873010355326 Q ss_pred CCEEEEECCHHHHHHHHHHHHHCCCCCEEEEEEECCCCCCHHHHHHHCCCCCCEEEEEEEECCCCCCHHHHHHHHHHHCC Q ss_conf 89799937899999999987741788607868506778898899886039400079999866410267989999998807 Q gi|254780222|r 585 GTKSVYAHDEEHKQKILEEFKKHGKGKIEISRFKGLGEMLASQLKETAMDANKRTLLRVEINKDLNSLKDTQDSINKLMG 664 (686) Q Consensus 585 g~~~~y~~~e~e~~~~~~~~~~~~~~k~~i~ryKGLGe~~~~e~~et~~~p~~r~l~~v~~~~~~~~~~~~~~~f~~lmG 664 (686) |++...||+.+||++|.+. ....+|+|+||||||||++.|++|||.|.++|++..... ...+.+.++.+|+. T Consensus 631 g~~~~~Fyt~~eye~Wk~~---~~~~~w~iKYYKGLGTSt~~EakeYF~~l~~h~~~f~~~--~~~d~~~i~laF~k--- 702 (1506) T PTZ00108 631 GNQEHSFFTIAEYEQWKEN---TNLLGWKIKYYKGLGTSTDKEFKQYFSDIKNHKIMFLWT--GDRDGDSIDMAFSK--- 702 (1506) T ss_pred CCCCCCCCCHHHHHHHHHC---CCCCCCEEEEEECCCCCCHHHHHHHHHHHHHCEEEEEEC--CCCCHHHHHHHHCC--- T ss_conf 9740112788999999863---898874057764368888089999998787557888957--98646679877287--- Q ss_pred CCHHHHHHHHHHH--HHHHH Q ss_conf 9816899999975--24677 Q gi|254780222|r 665 TKADERFKFIQER--ASFID 682 (686) Q Consensus 665 ~~~~~RK~wi~~~--~~~~~ 682 (686) ++|++||+||+++ ++++| T Consensus 703 k~ad~RK~WL~~~~~~~~ld 722 (1506) T PTZ00108 703 KRIEDRKLWLQNFIIGSYVD 722 (1506) T ss_pred CCHHHHHHHHHHCCCCCCCC T ss_conf 76889999998378776335 No 9 >TIGR01055 parE_Gneg DNA topoisomerase IV, B subunit; InterPro: IPR005737 DNA topoisomerases regulate the number of topological links between two DNA strands (i.e. change the number of superhelical turns) by catalysing transient single- or double-strand breaks, crossing the strands through one another, then resealing the breaks. These enzymes have several functions: to remove DNA supercoils during transcription and DNA replication; for strand breakage during recombination; for chromosome condensation; and to disentangle intertwined DNA during mitosis , . DNA topoisomerases are divided into two classes: type I enzymes (5.99.1.2 from EC; topoisomerases I, III and V) break single-strand DNA, and type II enzymes (5.99.1.3 from EC; topoisomerases II, IV and VI) break double-strand DNA . Type II topoisomerases are ATP-dependent enzymes, and can be subdivided according to their structure and reaction mechanisms: type IIA (topoisomerase II or gyrase, and topoisomerase IV) and type IIB (topoisomerase VI). These enzymes are responsible for relaxing supercoiled DNA as well as for introducing both negative and positive supercoils . This entry represents subunit B (parE) of topoisomerase IV from Gram negative bacteria. Topoisomerase IV primarily decatenates DNA and relaxes positive supercoils, which is important in bacteria, where the circular chromosome becomes catenated, or linked, during replication . Topoisomerase IV consists of two polypeptide subunits, parC (subunit A), which is homologous to gyrA of topoisomerase II, and parE (subunit B), which is homologous to gyrB of topoisomerase II. More information about this protein can be found at Protein of the Month: DNA Topoisomerase .; GO: 0003677 DNA binding, 0003918 DNA topoisomerase (ATP-hydrolyzing) activity, 0005524 ATP binding, 0006265 DNA topological change, 0005694 chromosome. Probab=100.00 E-value=0 Score=1400.72 Aligned_cols=634 Identities=43% Similarity=0.712 Sum_probs=604.6 Q ss_pred CCCCHHHEEECCCCHHHHCCCCEEECCCCCCCHHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCEEEEEECCCCCCCCCC Q ss_conf 23697560231121034326981546777550220244366667877761899769999828983999966984344417 Q gi|254780222|r 42 DNYDASSIRILEGLEPVRMRPGMYIGGTDEKALHHLFSEVIDNAMDEVIAGYANLIEVSLDGNGFLTVVDNGRGIPVENH 121 (686) Q Consensus 42 ~~Y~~~~i~~L~glE~VrkRP~mYIGst~~~GL~~i~~EIldNavDe~~~g~~~~I~V~i~~dg~IsV~ddGrGIPv~~h 121 (686) .+|+|+.|+||+||||||+|||||+|+||+...|||..||||||+|||+|||++.|.|.+|.|.||+|.|||||||||+| T Consensus 2 ~~YsA~~I~VL~gLEPVr~RPGMY~GGtD~~~~~HL~~EV~DN~~DEA~AGfA~~i~v~lH~D~Si~v~D~GRG~PvD~H 81 (647) T TIGR01055 2 TNYSAKDIEVLDGLEPVRKRPGMYTGGTDTTRPNHLVQEVIDNSVDEALAGFASKIEVILHQDQSIEVIDNGRGMPVDIH 81 (647) T ss_pred CCCCCCHHHHHCCCCCCCCCCCCEECCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEEEECCCCEEEEEECCCCCCCCCC T ss_conf 87461013220168773428872017865653013455544333555540653078899828961799888830344278 Q ss_pred CCCCCCEEEEEEEEEEECCCCCCCCCEEEECCCCCHHHHHHHHEEEEEEEEEECCCEEEEEEEECCCEECCCEECCCC-C Q ss_conf 665787012345452002477777741563144538898762032126999941883899997279451131121676-7 Q gi|254780222|r 122 PKFPNKSTLEIILTTLHAGGKFDNSTYEISGGLHGVGISVVNALSDELHVTVARQNTIFSQKFSRGIPLSPLEKKEKV-R 200 (686) Q Consensus 122 ~k~~~~~~~Elif~~L~aggnfdd~~yk~sgGlnGvGaklvNalS~~f~Vev~r~gk~y~q~f~~g~~~~~~~~i~~~-~ 200 (686) |+.++++++|+|+|.|||||||.++.|.+||||||||+|||||||++|.|.|+|+|+.|.|.|++|....++..++.| + T Consensus 82 P~~P~~sa~EvILt~LHaG~KFSN~~Y~~sGGLHGVGISVVNALS~~~~i~V~~~G~~Y~~af~~G~K~~dL~~~~~~Gk 161 (647) T TIGR01055 82 PKEPGVSAVEVILTKLHAGGKFSNKNYEFSGGLHGVGISVVNALSKRVKIKVKRQGKLYSIAFENGAKVTDLEVVGTCGK 161 (647) T ss_pred CCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCEECCHHHHEEECCCEEEEEEEECCEEEEHHHHCCCCHHHHHHHHCCCC T ss_conf 88888862355643005785211444013686002143202201323678997467043202215751102332112677 Q ss_pred CCCCEEEEEEECHHHEECCCCCCHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCCCCCCEEEEEECCCCHHHHHCCCCCC Q ss_conf 78853999998424320012222899999888875305773599951645322245541123431586122210024677 Q gi|254780222|r 201 NKRGTRITFRPDPKIFGDKAGFDAARLLKMTQSKAYLSGRVKTCWSCDKEIAEKYGIPEKEEFYFPGGLKTYLQTKLENR 280 (686) Q Consensus 201 ~k~GT~ItF~PD~~iFg~~~~~d~~~l~~rl~~~A~L~~gl~i~~~~~~~~~~~~~~~~~~~f~~~~Gi~dyv~~~~~~~ 280 (686) ..+||.|.|+||+++|.+...|++..|.+.+|.+|||++||.|.|.|+-.. ....|....++|++||+||+....... T Consensus 162 R~~GT~v~F~Pd~~~F~~k~~F~~~~l~~~~raKA~L~~Gv~I~f~~~v~~--PsdiP~~~~~~y~~GL~DYL~~~v~~~ 239 (647) T TIGR01055 162 RLTGTSVHFTPDPEIFDSKLKFSVSRLYHILRAKAVLCRGVEIEFEDEVNN--PSDIPTKALWNYEDGLKDYLSEAVNGL 239 (647) T ss_pred CCCCCEEEECCCCCHHCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEEEECC--CCCCCHHHHHHCCCCHHHHHHHHHCCC T ss_conf 655635773168201031225578999999863242026720789751068--655754554311033136888763123 Q ss_pred CCCCCCCEEEEEEECCCEEEEEEEEEEECCCCCC--EEEEEEECCCCCCCEEECCHHHHHHHHHHHHHHHCCC-CCCCCC Q ss_conf 6667752033332148015589998541467774--2788830006557524130467899999987630188-332223 Q gi|254780222|r 281 SLISSEIFTGKTEKKGTHRGTVEWAIAWCEENPE--ITSYCNTILTDEGGTHESGLRVALTRGIKKYAELTQN-KRAISI 357 (686) Q Consensus 281 ~~~~~~~~~~~~~~~~~~~~~veval~~~~~~~~--~~SFVN~I~T~~GGTHv~g~~~ai~k~i~~~~~~~~~-kk~~~i 357 (686) +++.+++|.+..+-. .....++||+.|.+++.+ ..||||.|||++||||++|++++|.++|++|++..+. ++++++ T Consensus 240 ~~~~~~~F~G~~~~t-vd~~~~~WAl~W~~~~~~~~~~SY~N~iPT~QGGTH~~GL~~~lL~a~~~f~E~~~~~P~~~~L 318 (647) T TIGR01055 240 NTLPKKPFVGNFEGT-VDDEAVEWALLWLPEGGELLMESYVNLIPTIQGGTHVNGLRQGLLDALREFCEMRNLLPRGVKL 318 (647) T ss_pred CCCCCCCEECCCCCC-HHHHHCCEEEEECCCCCHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEC T ss_conf 223455300232110-1111002446632788400320411015789887104578999999999988875188640221 Q ss_pred CCCCCCCCEEEEEEEEEEHHHCCCCCCCCCCCHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 32222364269999776345607644662025232000012101588999774013568899999999996545443344 Q gi|254780222|r 358 ISDDLMISAVGILSVFIREPEFAGQTKTKLVSLDAQRTVENALRDPFDHYLVQNPIEATKLLEWVIERSEERLRRRKQRE 437 (686) Q Consensus 358 ~~~dIk~~L~~~Isv~I~nP~FesQTKekL~s~~~~~~v~~~i~~~l~~~l~~n~~~a~~I~~~~~~~a~~r~~~~k~k~ 437 (686) +.+||.+...+++|++|.+|+|.|||||+|.|..+...|+.+++|.|..||..|...|..++|.++..|+.|...+.+|. T Consensus 319 t~eDI~~~~s~vLS~~~~~p~F~GQTKE~L~S~~~a~~V~~v~KD~F~~~L~~~~~~A~~L~E~~I~~A~~Rrn~~~~~~ 398 (647) T TIGR01055 319 TAEDILDRCSYVLSIKMQDPQFAGQTKERLSSRQVAKLVSGVIKDAFDLWLNQNVQLAEKLAEIAISSAQRRRNKRAAKK 398 (647) T ss_pred CHHHHHHHHHEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 08888731110145566556657753123313135788777787788887411078899999999888887765455314 Q ss_pred HHHHHHCCCCCCCCCCCCCCCCCCCEEEEEEECCCCCCCCHHHCCCCCCEEEEEECCEECCHHHHHHHHHHCCHHHHHHH Q ss_conf 42211002456533356776565540379973276764210010112220122112313144541087763228899999 Q gi|254780222|r 438 VNRKTAIRKLRLPGKLADCSQNIAHGTELFIVEGDSAGGSAKQARNRSNQAVLPLRGKILNVASAGSEKIRNNQQIMDLV 517 (686) Q Consensus 438 ~~rk~~~~~~~lpgKL~Da~~~~~~~~eL~lvEGDSA~gsAk~grdr~~qgi~PLRGKiLNv~~as~~ki~~N~Ei~~l~ 517 (686) +.||+...+..|||||+||+..+.+++||||||||||||||||||||++|||||||||||||+..+.++|+.|+||++|- T Consensus 399 v~RK~~~~~~~LPGKLADCt~~~~e~TELFlVEGDSAGGSAKQAR~RE~QAilPL~GKiLN~~~~~~~~vl~~~~i~D~~ 478 (647) T TIGR01055 399 VVRKKLTSGPALPGKLADCTRQDLEKTELFLVEGDSAGGSAKQARDREYQAILPLKGKILNTWEVSLDKVLNSQEIHDIE 478 (647) T ss_pred CCHHHHCCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCHHHHCCCCCEEECCCCCCEEEEECCCHHHHHCCCCCCEEE T ss_conf 20012104678897211334311126427898325677521343053201214456647763214644553124200021 Q ss_pred HHHCCCCCCCCCCCCCCCCCEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHCCCEEEECCCEEEEEECCEEEEECCHHHH Q ss_conf 88256654333334655251899977987714688999999998708886339389930996999748979993789999 Q gi|254780222|r 518 QALGCRTRSQYREEDLRYEKVIIMTDADVDGAHIASLLLTFFYQEMYDLIEQKHLFLISPPLFRITQGTKSVYAHDEEHK 597 (686) Q Consensus 518 ~~lGl~~g~~~~~~~LRYgkIiImTDaD~DGsHI~~Lll~ff~~~~p~Li~~g~v~~~~tPl~kv~~g~~~~y~~~e~e~ 597 (686) -||||+..++|..++||||||+||||||+||.||.+||++||.+.||.|++.||||+|.|||||++.+++.+|+.+|+|+ T Consensus 479 vAl~~~~~~~~k~~~LRY~Ki~Il~DAD~DG~HIA~LL~~lF~~~~p~LV~~GH~Y~A~PPLYR~~~~~~vyYA~dEeEK 558 (647) T TIGR01055 479 VALGIDPDSNDKLSQLRYGKIIILADADSDGLHIATLLCALFLKHFPKLVEEGHVYVAKPPLYRIDLSKEVYYALDEEEK 558 (647) T ss_pred EEEECCCCCHHHHHHCCCCCEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCEEEECCCEEEEECCHHHH T ss_conf 02224872000364267222889841788807899999999999888887539078717872021068706885284668 Q ss_pred HHHHHHHHHCCCCCEEEEEEECCCCCCHHHHHHHCCCCCCEEEEEEEECCCCCCHHHHHHHHHHHCCCC-HHHHHHHHHH Q ss_conf 999998774178860786850677889889988603940007999986641026798999999880798-1689999997 Q gi|254780222|r 598 QKILEEFKKHGKGKIEISRFKGLGEMLASQLKETAMDANKRTLLRVEINKDLNSLKDTQDSINKLMGTK-ADERFKFIQE 676 (686) Q Consensus 598 ~~~~~~~~~~~~~k~~i~ryKGLGe~~~~e~~et~~~p~~r~l~~v~~~~~~~~~~~~~~~f~~lmG~~-~~~RK~wi~~ 676 (686) .+.++.+++..++|.++|||||||||+|.||+||||+|++|.|+|||+++..+..-++-+++++||||+ ++.|++||++ T Consensus 559 ~~~~~~L~~~sK~K~~V~RFKGLGEM~P~QL~ETT~~P~~R~L~~~T~~~~~~q~~D~l~~~D~LlaKKR~E~R~~~l~~ 638 (647) T TIGR01055 559 EKLLEKLKKKSKGKPNVQRFKGLGEMNPAQLRETTMDPNTRRLVQLTLDDVQDQRVDTLKLVDMLLAKKRSEDRKNWLQE 638 (647) T ss_pred HHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHCCCCCCCCCEEEEEEEECCCCCCCCHHHHHHHHHCCCCCHHHHHHHHH T ss_conf 99999997630688544110254567820121035888761378876312556553279999987506884468999998 Q ss_pred HH Q ss_conf 52 Q gi|254780222|r 677 RA 678 (686) Q Consensus 677 ~~ 678 (686) .| T Consensus 639 qa 640 (647) T TIGR01055 639 QA 640 (647) T ss_pred HH T ss_conf 77 No 10 >KOG0355 consensus Probab=100.00 E-value=0 Score=1094.04 Aligned_cols=597 Identities=25% Similarity=0.342 Sum_probs=522.4 Q ss_pred HHEEECCCCHHHHCCCCEEECCCC---------------------CCCHHHHHHHHHHHHHHHHHCCC-CCEEEEEECC- Q ss_conf 560231121034326981546777---------------------55022024436666787776189-9769999828- Q gi|254780222|r 47 SSIRILEGLEPVRMRPGMYIGGTD---------------------EKALHHLFSEVIDNAMDEVIAGY-ANLIEVSLDG- 103 (686) Q Consensus 47 ~~i~~L~glE~VrkRP~mYIGst~---------------------~~GL~~i~~EIldNavDe~~~g~-~~~I~V~i~~- 103 (686) +++|+.+++||+++||++||||++ .|||||||||||+||+| .++++ |+.|.|+||+ T Consensus 6 ~~yqkks~~ehiL~rPdtyig~ve~~~~~~wv~~~e~~k~~~~t~~pGl~ki~dEilvNaad-k~rd~~m~~i~v~i~~e 84 (842) T KOG0355 6 DIYQKKSQLEHILLRPDTYIGSVEPTEQLMWVYDMEKRKMVQRTYVPGLYKIFDEILVNAAD-KQRDPKMNTIKVTIDKE 84 (842) T ss_pred HHHHHHHHHHHHHCCCCCCCCCCCCCEEEEEEECCCCCCEEEEECCCCHHHHHHHHHHCCCC-CCCCCCCCEEEEEECCC T ss_conf 77765211133314876423554552247754033448546763487178888998603323-22487765268887167 Q ss_pred CCEEEEEECCCCCCCCCCCCCCCCEEEEEEEEEEECCCCCCCCCEEEECCCCCHHHHHHHHEEEEEEEEEE--CCCEEEE Q ss_conf 98399996698434441766578701234545200247777774156314453889876203212699994--1883899 Q gi|254780222|r 104 NGFLTVVDNGRGIPVENHPKFPNKSTLEIILTTLHAGGKFDNSTYEISGGLHGVGISVVNALSDELHVTVA--RQNTIFS 181 (686) Q Consensus 104 dg~IsV~ddGrGIPv~~h~k~~~~~~~Elif~~L~aggnfdd~~yk~sgGlnGvGaklvNalS~~f~Vev~--r~gk~y~ 181 (686) +++|||+|||+||||++||. +++|+||||||+|||||||||++.|++||+|||||+|||+||++|++||+ +.++.|+ T Consensus 85 ~~~isv~nnGkGIPv~~H~~-ek~yvpelifg~Lltssny~d~ekK~tggrngygakLcniFs~~f~~Et~d~~~~~~~k 163 (842) T KOG0355 85 KNEISVYNNGKGIPVTIHKV-EKVYVPELIFGNLLTSSNYDDDEKKVTGGRNGYGAKLCNIFSTEFTVETADREYKMAFK 163 (842) T ss_pred CCEEEEEECCCCCEEEECCC-CCCCCHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCEEEEECCCCCEEEEEEHHHHHHHH T ss_conf 76799995797213122342-22453478876644115678875432367775020343211454036530067777877 Q ss_pred EEEECCCEECCCEECCCCCCCCCEEEEEEECHHHEECCCCCCHHH---HHHHHHHHHHCCCCCEEEEECCCCCCCCCCCC Q ss_conf 997279451131121676778853999998424320012222899---99988887530577359995164532224554 Q gi|254780222|r 182 QKFSRGIPLSPLEKKEKVRNKRGTRITFRPDPKIFGDKAGFDAAR---LLKMTQSKAYLSGRVKTCWSCDKEIAEKYGIP 258 (686) Q Consensus 182 q~f~~g~~~~~~~~i~~~~~k~GT~ItF~PD~~iFg~~~~~d~~~---l~~rl~~~A~L~~gl~i~~~~~~~~~~~~~~~ 258 (686) |+|.+||+.+..+.+..+.+.+||+|||+||.+.|++. ++|-|+ +.+|+.++|....++++.+|+.+... T Consensus 164 Q~w~~nm~~~~~~~i~~~~~~~yTkitF~PDl~~F~m~-eLD~Div~l~~rr~~d~a~~~~~vkv~ln~~~~~~------ 236 (842) T KOG0355 164 QTWINNMTRDEEPKIVPSTDEDYTKITFSPDLEKFKMK-ELDDDIVALMARRAYDLAGSVKSVKVELNGKNIPV------ 236 (842) T ss_pred HHHHCCCCCCCCCEEECCCCCCCCEEEECCCHHHCCCH-HHCCHHHHHHHHHHHHHCCCCCCEEEEECCCCCCC------ T ss_conf 76526876567865522589995369968476752722-32231887677666652344212156516887752------ Q ss_pred CEEEEEECCCCHHHHHCCCCCCCCCCCCCEEEEEEECCCEEEEEEEEEEECCCCCCEEEEEEECCCCCCCEEECCHHHHH Q ss_conf 11234315861222100246776667752033332148015589998541467774278883000655752413046789 Q gi|254780222|r 259 EKEEFYFPGGLKTYLQTKLENRSLISSEIFTGKTEKKGTHRGTVEWAIAWCEENPEITSYCNTILTDEGGTHESGLRVAL 338 (686) Q Consensus 259 ~~~~f~~~~Gi~dyv~~~~~~~~~~~~~~~~~~~~~~~~~~~~veval~~~~~~~~~~SFVN~I~T~~GGTHv~g~~~ai 338 (686) +|..+|++.+.. ..... .+...- .+ .....||+|++-++..++++||||+|+|.+|||||+++++++ T Consensus 237 --------~~f~~yv~~~~~-~~~~~-~~~~~i-he--~~~~RwEv~~a~s~~~fqqisfvNsI~T~kGGThVd~V~D~i 303 (842) T KOG0355 237 --------KGFYDYVKMYLE-VLWLN-DDLKPL-HE--VLNFRWEVALALSDVGFQQVSFVNSIATTKGGTHVDYVVDQI 303 (842) T ss_pred --------HHHHHHHHHCCC-CCCCC-CCCHHH-HC--CCCHHHHHHHHHCCCCCCEEEEECCCEECCCCCCCCHHHHHH T ss_conf --------017788762353-22246-752222-00--432488886550446653354200101047986544166557 Q ss_pred HHHHHHHHHHCCCCCCCCCCCCCCCCCEEEEEEEEEEHHHCCCCCCCCCCCHHHCCCCCCCCHHHHHHHHHHHHHHHHHH Q ss_conf 99999876301883322233222236426999977634560764466202523200001210158899977401356889 Q gi|254780222|r 339 TRGIKKYAELTQNKRAISIISDDLMISAVGILSVFIREPEFAGQTKTKLVSLDAQRTVENALRDPFDHYLVQNPIEATKL 418 (686) Q Consensus 339 ~k~i~~~~~~~~~kk~~~i~~~dIk~~L~~~Isv~I~nP~FesQTKekL~s~~~~~~v~~~i~~~l~~~l~~n~~~a~~I 418 (686) ++.|...+++.+.++...+++..|+++|++||+|.|+||+|+||||++|+++..++++.|.++++|.+.+.+++ +...| T Consensus 304 v~~l~~vvkKk~~~~~~svk~f~vkn~L~vfVN~lIENPtFesQTke~lt~~~~~fgs~~elsek~~k~~~~~~-i~esi 382 (842) T KOG0355 304 VAKLIDVVKKKKVKKDISVKPFQVKNHLWVFVNCLIENPTFESQTKERLTLRPKRFGSDCELSEKFTKAIELKG-VVESI 382 (842) T ss_pred HHHHHHHHCCCCCCCCCCCCHHHHHCCEEEEEEEEECCCCCCCCCHHHCCCCCCCCCCEEEEHHHHHHHHHHCC-HHHHH T ss_conf 77888874211346654446453303569999887427663443310126663454326763167799998676-49999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCC---CCCCCEEEEEEECCCCCCCCHHHCC---CCCCEEEEEE Q ss_conf 999999999654544334442211002456533356776---5655403799732767642100101---1222012211 Q gi|254780222|r 419 LEWVIERSEERLRRRKQREVNRKTAIRKLRLPGKLADCS---QNIAHGTELFIVEGDSAGGSAKQAR---NRSNQAVLPL 492 (686) Q Consensus 419 ~~~~~~~a~~r~~~~k~k~~~rk~~~~~~~lpgKL~Da~---~~~~~~~eL~lvEGDSA~gsAk~gr---dr~~qgi~PL 492 (686) +.++..+....++.+..++..+. -.+| ||.|++ ++.+.+|+|+++|||||+.+|..|+ +|++|||||| T Consensus 383 ls~s~~~~~~dL~kk~~~~~s~~-----e~I~-KleDa~~ag~k~s~kctlIltEgdsaksLavsGl~vvgRd~~gVfpl 456 (842) T KOG0355 383 LSKSLNKYKDDLAKKRGKLTSRS-----KGIP-KLEDANDAGTKTSEKCTLILTEGDSAKSLAVSGLSVVGRDYYGVFPL 456 (842) T ss_pred HHHHHHHHHHHHHHHHHHHHCCC-----CCHH-HHHHCCCCCCCCCCCEEEEEECCCCCHHHEECCCCCCCEEEEEECCC T ss_conf 99888889876544434443320-----1306-76523324434554316888615520332013555466046755011 Q ss_pred CCEECCHHHHHHHHHHCCHHHHHHHHHHCCCCCCCC-CCCCCCCCCEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 231314454108776322889999988256654333-3346552518999779877146889999999987088863393 Q gi|254780222|r 493 RGKILNVASAGSEKIRNNQQIMDLVQALGCRTRSQY-REEDLRYEKVIIMTDADVDGAHIASLLLTFFYQEMYDLIEQKH 571 (686) Q Consensus 493 RGKiLNv~~as~~ki~~N~Ei~~l~~~lGl~~g~~~-~~~~LRYgkIiImTDaD~DGsHI~~Lll~ff~~~~p~Li~~g~ 571 (686) |||+|||++++.++|++|+||++|++|||+++.+.| +.++|||||+|||||||+|||||+||||||||+|||+|+..+| T Consensus 457 rgklLNv~e~s~kqilkn~EI~aIikIlgl~~~~~y~~~eslryg~~mimtDQd~dgshikgllInf~~~~~psl~~~~f 536 (842) T KOG0355 457 RGKLLNVREASHKQILKNAEINAIIKILGLQYKKKYFDVESLRYGKLMIMTDQDHDGSHIKGLLINFIHHFWPSLLQIPF 536 (842) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCC T ss_conf 23134457777988877999999999998877533133122242224665123456434777889889876588635874 Q ss_pred EEEECCCEEEEEECCEEEEECCHHHHHHHHHHHHHCCCCCEEEEEEECCCCCCHHHHHHHCCCCCCEEEEEEEECCCCCC Q ss_conf 89930996999748979993789999999998774178860786850677889889988603940007999986641026 Q gi|254780222|r 572 LFLISPPLFRITQGTKSVYAHDEEHKQKILEEFKKHGKGKIEISRFKGLGEMLASQLKETAMDANKRTLLRVEINKDLNS 651 (686) Q Consensus 572 v~~~~tPl~kv~~g~~~~y~~~e~e~~~~~~~~~~~~~~k~~i~ryKGLGe~~~~e~~et~~~p~~r~l~~v~~~~~~~~ 651 (686) |+.|+|||+|+++|++..+||+++||++|.+.... .++|+||||||||||+++|+||||+|+++|++.+.. +...+ T Consensus 537 leefitpi~k~~~~~~~~~fys~pey~kw~~~~~~--~~~~~ikyykglgts~~~e~keyf~~~~~h~~~F~~--~~~~d 612 (842) T KOG0355 537 LEEFITPIVKATKGKQEVSFYSLPEYEKWKKNTDN--WKSWKIKYYKGLGTSTSKEAKEYFSDLDRHRILFKY--DGGID 612 (842) T ss_pred HHHCCCCCEEECCCCCCEEEECCHHHHHHHHCCCC--CCCCCEEEEECCCCCCHHHHHHHHHCHHHHEEEEEC--CCCCH T ss_conf 21047764574047743145445657888754844--244223775147745478999986160012378752--57504 Q ss_pred HHHHHHHHHHHCCCCHHHHHHHHHHHHH Q ss_conf 7989999998807981689999997524 Q gi|254780222|r 652 LKDTQDSINKLMGTKADERFKFIQERAS 679 (686) Q Consensus 652 ~~~~~~~f~~lmG~~~~~RK~wi~~~~~ 679 (686) .+.++.+|.+ ++.++||+||.+++. T Consensus 613 ~~~i~~af~k---~k~~~rk~wL~~~~~ 637 (842) T KOG0355 613 DEAIDMAFSK---KKIDARKEWLRKYMP 637 (842) T ss_pred HHHHHHHCCC---CCHHHHHHHHHHHHH T ss_conf 7788642231---006667768998656 No 11 >cd03366 TOPRIM_TopoIIA_GyrB TOPRIM_TopoIIA_GyrB: topoisomerase-primase (TOPRIM) nucleotidyl transferase/hydrolase domain of the type found in proteins of the type IIA family of DNA topoisomerases similar to the Escherichia coli GyrB subunit. TopoIIA enzymes cut both strands of the duplex DNA to remove (relax) both positive and negative supercoils in DNA. These enzymes covalently attach to the 5' ends of the cut DNA, separate the free ends of the cleaved strands, pass another region of the duplex through this gap, then rejoin the ends. These proteins also catenate/ decatenate duplex rings. DNA gyrase is more effective at relaxing supercoils than decatentating DNA. DNA gyrase in addition inserts negative supercoils in the presence of ATP. The TOPRIM domain has two conserved motifs, one of which centers at a conserved glutamate and the other one at two conserved aspartates (DxD). The conserved glutamate may act as a general base in strand joining and as a general acid in strand cleava Probab=100.00 E-value=0 Score=386.37 Aligned_cols=114 Identities=60% Similarity=0.918 Sum_probs=112.8 Q ss_pred EEEEEECCCCCCCCHHHCCCCCCEEEEEECCEECCHHHHHHHHHHCCHHHHHHHHHHCCCCCCCCCCCCCCCCCEEEEEC Q ss_conf 37997327676421001011222012211231314454108776322889999988256654333334655251899977 Q gi|254780222|r 464 TELFIVEGDSAGGSAKQARNRSNQAVLPLRGKILNVASAGSEKIRNNQQIMDLVQALGCRTRSQYREEDLRYEKVIIMTD 543 (686) Q Consensus 464 ~eL~lvEGDSA~gsAk~grdr~~qgi~PLRGKiLNv~~as~~ki~~N~Ei~~l~~~lGl~~g~~~~~~~LRYgkIiImTD 543 (686) +||||||||||+|+|+|||||++|||||||||+|||++++.+++++|+||++|++||||+++++||+++||||||||||| T Consensus 1 seL~l~EGDSA~g~a~~~Rd~~~qgv~PLrGK~lNv~~~~~~ki~~N~Ei~~l~~~lGl~~~~~~d~~~LRYgkIiImtD 80 (114) T cd03366 1 SELYIVEGDSAGGSAKQGRDRRFQAILPLRGKILNVEKARLDKILKNEEIRALITALGTGIGEDFDLEKLRYHKIIIMTD 80 (114) T ss_pred CEEEEECCCCCHHHHHCCCCCCCEEEEECCCCCHHHHHCCHHHHHCCHHHHHHHHHHCCCCCCCCCHHHCCCCEEEEEEC T ss_conf 92899727674055652178786899870785000642878788629679999999687878645400078786999968 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHCCCEEEECC Q ss_conf 9877146889999999987088863393899309 Q gi|254780222|r 544 ADVDGAHIASLLLTFFYQEMYDLIEQKHLFLISP 577 (686) Q Consensus 544 aD~DGsHI~~Lll~ff~~~~p~Li~~g~v~~~~t 577 (686) ||+||+||++|||||||+|||+|+++||||+++| T Consensus 81 aD~DG~HI~~Lli~ff~~~~p~Li~~G~vyia~P 114 (114) T cd03366 81 ADVDGAHIRTLLLTFFFRYMRPLIENGHVYIAQP 114 (114) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHCCCEEEECC T ss_conf 9988068999999999998499998694788369 No 12 >cd01030 TOPRIM_TopoIIA_like TOPRIM_TopoIIA_like: topoisomerase-primase (TOPRIM) nucleotidyl transferase/hydrolase domain of the type found in proteins of the type IIA family of DNA topoisomerases similar to Saccharomyces cerevisiae Topoisomerase II. TopoIIA enzymes cut both strands of the duplex DNA to remove (relax) both positive and negative supercoils in DNA. These enzymes covalently attach to the 5' ends of the cut DNA, separate the free ends of the cleaved strands, pass another region of the duplex through this gap, then rejoin the ends. These proteins also catenate/ decatenate duplex rings. The TOPRIM domain has two conserved motifs, one of which centers at a conserved glutamate and the other one at two conserved aspartates (DxD). The conserved glutamate may act as a general base in strand joining and as a general acid in strand cleavage by topisomerases. The DXD motif may co-ordinate Mg2+, a cofactor required for full catalytic function. Probab=100.00 E-value=0 Score=380.91 Aligned_cols=114 Identities=54% Similarity=0.816 Sum_probs=112.5 Q ss_pred EEEEEECCCCCCCCHHHCCCCCCEEEEEECCEECCHHHHHHHHHHCCHHHHHHHHHHCCCCCCCC-CCCCCCCCCEEEEE Q ss_conf 37997327676421001011222012211231314454108776322889999988256654333-33465525189997 Q gi|254780222|r 464 TELFIVEGDSAGGSAKQARNRSNQAVLPLRGKILNVASAGSEKIRNNQQIMDLVQALGCRTRSQY-REEDLRYEKVIIMT 542 (686) Q Consensus 464 ~eL~lvEGDSA~gsAk~grdr~~qgi~PLRGKiLNv~~as~~ki~~N~Ei~~l~~~lGl~~g~~~-~~~~LRYgkIiImT 542 (686) |||||||||||+|+|++||||++|||||||||+|||++++.+++++|+||++|++||||+++++| ++++|||||||||| T Consensus 1 c~L~l~EGdSA~g~~~~~rd~~~~gv~pLrGK~lNv~~~~~~ki~~N~Ei~~l~~~LGl~~~~~~~~~~~LRYgkIii~t 80 (115) T cd01030 1 CELILVEGDSAGGSAKQGRDRVFQAVFPLRGKILNVEKASLKKILKNEEIQNIIKALGLGIGKDDFDLDKLRYGKIIIMT 80 (115) T ss_pred CEEEEECCCCCHHHHHHHCCCCCEEEEECCCCCCCHHHCCHHHHHCCHHHHHHHHHHCCCCCCCCCCHHHCCCCCEEEEE T ss_conf 97999647453455542048675899852781466666889898709789999999677989653536348627448987 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHHCCCEEEECC Q ss_conf 79877146889999999987088863393899309 Q gi|254780222|r 543 DADVDGAHIASLLLTFFYQEMYDLIEQKHLFLISP 577 (686) Q Consensus 543 DaD~DGsHI~~Lll~ff~~~~p~Li~~g~v~~~~t 577 (686) |||+||+||++|||||||+|||+|+++||||+++| T Consensus 81 DaD~DG~HI~~Lll~ff~~~~P~L~~~G~vyi~~T 115 (115) T cd01030 81 DADVDGSHIRTLLLTFFYRFWPSLLENGFLYIAQT 115 (115) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHCCEEEEEEC T ss_conf 89997579999999999998199998791799859 No 13 >cd03365 TOPRIM_TopoIIA TOPRIM_TopoIIA: topoisomerase-primase (TOPRIM) nucleotidyl transferase/hydrolase domain of the type found in proteins of the type IIA family of DNA topoisomerases similar to Saccharomyces cerevisiae Topoisomerase II. TopoIIA enzymes cut both strands of the duplex DNA to remove (relax) both positive and negative supercoils in DNA. These enzymes covalently attach to the 5' ends of the cut DNA, separate the free ends of the cleaved strands, pass another region of the duplex through this gap, then rejoin the ends. These proteins also catenate/ decatenate duplex rings. The TOPRIM domain has two conserved motifs, one of which centers at a conserved glutamate and the other one at two conserved aspartates (DxD). This glutamate and two aspartates, cluster together to form a highly acid surface patch. The conserved glutamate may act as a general base in strand joining and as a general acid in strand cleavage by topisomerases. The DXD motif may co-ordinate Mg2+, a cofact Probab=100.00 E-value=0 Score=363.68 Aligned_cols=114 Identities=35% Similarity=0.522 Sum_probs=110.0 Q ss_pred EEEEEECCCCCCCCHHHCC---CCCCEEEEEECCEECCHHHHHHHHHHCCHHHHHHHHHHCCCCCCCC--CCCCCCCCCE Q ss_conf 3799732767642100101---1222012211231314454108776322889999988256654333--3346552518 Q gi|254780222|r 464 TELFIVEGDSAGGSAKQAR---NRSNQAVLPLRGKILNVASAGSEKIRNNQQIMDLVQALGCRTRSQY--REEDLRYEKV 538 (686) Q Consensus 464 ~eL~lvEGDSA~gsAk~gr---dr~~qgi~PLRGKiLNv~~as~~ki~~N~Ei~~l~~~lGl~~g~~~--~~~~LRYgkI 538 (686) |||||||||||+|+|++|| +|++|||||||||+|||++++.+++++|+||++|++||||++|++| ++++|||||| T Consensus 1 CtL~l~EGDSA~g~a~~g~~~~~r~~~gv~PLrGK~lNv~~~~~~ki~~N~Ei~~l~~~LGl~~g~~~~~d~~~LRYgkI 80 (120) T cd03365 1 CTLILTEGDSAKALAVAGLSVVGRDYYGVFPLRGKLLNVREASHKQIMENAEIQNIKKILGLQHGKSDYESTKSLRYGRL 80 (120) T ss_pred CEEEEECCCCCHHHHHHCCCCCCCCEEEEEECCCEECCHHHCCHHHHHCCHHHHHHHHHHCCCCCCCCCCCHHHCCCCEE T ss_conf 97999768474778872246558751789741887611522768777528889999999856767665565212788858 Q ss_pred EEEECCCCCCHHHHHHHHHHHHHHHHHHHHC-CCEEEECC Q ss_conf 9997798771468899999999870888633-93899309 Q gi|254780222|r 539 IIMTDADVDGAHIASLLLTFFYQEMYDLIEQ-KHLFLISP 577 (686) Q Consensus 539 iImTDaD~DGsHI~~Lll~ff~~~~p~Li~~-g~v~~~~t 577 (686) +||||||+||+||++|||||||+|||+|++. |||..+.| T Consensus 81 iI~tDaDvDG~HI~~LlltfF~~~~P~Li~~~gFi~~~~T 120 (120) T cd03365 81 MIMTDQDHDGSHIKGLLINFIHSFWPSLLKIPGFLVEFIT 120 (120) T ss_pred EEEECCCCCHHHHHHHHHHHHHHHHHHHHCCCCHHHHHCC T ss_conf 9987898865789999999999981998655685665139 No 14 >pfam00204 DNA_gyraseB DNA gyrase B. This family represents the second domain of DNA gyrase B which has a ribosomal S5 domain 2-like fold. This family is structurally related to PF01119. Probab=100.00 E-value=5.4e-43 Score=314.36 Aligned_cols=171 Identities=35% Similarity=0.536 Sum_probs=159.0 Q ss_pred CCHHHHHCCCCCCCCCCCCCEEEEEEECCCEEEEEEEEEEECCCCCC-EEEEEEECCCCCCCEEECCHHHHHHHHHHHHH Q ss_conf 61222100246776667752033332148015589998541467774-27888300065575241304678999999876 Q gi|254780222|r 268 GLKTYLQTKLENRSLISSEIFTGKTEKKGTHRGTVEWAIAWCEENPE-ITSYCNTILTDEGGTHESGLRVALTRGIKKYA 346 (686) Q Consensus 268 Gi~dyv~~~~~~~~~~~~~~~~~~~~~~~~~~~~veval~~~~~~~~-~~SFVN~I~T~~GGTHv~g~~~ai~k~i~~~~ 346 (686) ||++|+++++.++.++++++++...+.+...++.+||||+|++++.+ ++||||+|+|++|||||+||+++|+++|++|+ T Consensus 1 Gi~~fv~~l~~~k~~l~~~~~~~~~~~~~~~~~~VevA~q~~~~~~e~~~sfvN~I~T~~GGTH~~gf~~altr~i~~~~ 80 (173) T pfam00204 1 GLKDYVKELNKDKTPLHPEVIYFEGEEAFDDRIEVEVALQWNDSYSENIVSFVNNIPTPDGGTHVDGFRAALTRALNEYA 80 (173) T ss_pred CHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCEEEEEEEECCCCCCEEEEEECCEECCCCCCHHHHHHHHHHHHHHHHH T ss_conf 98999999856999788988489856527899389999995288875388768886999999789999999999999999 Q ss_pred HHCC-CCCCCCCCCCCCCCCEEEEEEEEEEHHHCCCCCCCCCCCHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 3018-833222332222364269999776345607644662025232000012101588999774013568899999999 Q gi|254780222|r 347 ELTQ-NKRAISIISDDLMISAVGILSVFIREPEFAGQTKTKLVSLDAQRTVENALRDPFDHYLVQNPIEATKLLEWVIER 425 (686) Q Consensus 347 ~~~~-~kk~~~i~~~dIk~~L~~~Isv~I~nP~FesQTKekL~s~~~~~~v~~~i~~~l~~~l~~n~~~a~~I~~~~~~~ 425 (686) ++.+ .+++.+|+++|||+||+++|||+|+||+|+||||++|+|++++..|+.++++.|..||++||..++.|+++++.+ T Consensus 81 ~k~~~~k~~~~i~~~diregl~~visvki~~P~FegQTK~kL~~~ev~~~v~~~v~~~l~~~l~~np~~a~~Ii~k~~~~ 160 (173) T pfam00204 81 KKKGLLKKKDKITGDDIREGLTAVVSVKIPNPQFEGQTKEKLGNSEVRSAVESVVSEKFAKFLEKNPEIAKKILEKALAA 160 (173) T ss_pred HHHCCCCCCCCCCHHHHHCCEEEEEEEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHH T ss_conf 98087645676689999706289999950699635730264279889999999999999999998999999999999999 Q ss_pred HHHHHHHHHHHHH Q ss_conf 9965454433444 Q gi|254780222|r 426 SEERLRRRKQREV 438 (686) Q Consensus 426 a~~r~~~~k~k~~ 438 (686) +++|++++++|+. T Consensus 161 ~~aR~aakkare~ 173 (173) T pfam00204 161 AKARLAARKAREL 173 (173) T ss_pred HHHHHHHHHHCCC T ss_conf 9999999985379 No 15 >cd00822 TopoII_Trans_DNA_gyrase TopoIIA_Trans_DNA_gyrase: Transducer domain, having a ribosomal S5 domain 2-like fold, of the type found in proteins of the type IIA family of DNA topoisomerases similar to the B subunits of E. coli DNA gyrase and E. coli Topoisomerase IV which are heterodimers composed of two subunits. The type IIA enzymes are the predominant form of topoisomerase and are found in some bacteriophages, viruses and archaea, and in all bacteria and eukaryotes. All type IIA topoisomerases are related to each other at amino acid sequence level, though their oligomeric organization sometimes differs. TopoIIA enzymes cut both strands of the duplex DNA to remove (relax) both positive and negative supercoils in DNA. These enzymes covalently attach to the 5' ends of the cut DNA, separate the free ends of the cleaved strands, pass another region of the duplex through this gap, then rejoin the ends. TopoIIA enzymes also catenate/ decatenate duplex rings. E.coli DNA gyrase is a Probab=100.00 E-value=1.4e-42 Score=311.57 Aligned_cols=169 Identities=38% Similarity=0.599 Sum_probs=157.6 Q ss_pred CCCHHHHHCCCCCCCCCCCCCEEEEEEECCCEEEEEEEEEEECCCCCC-EEEEEEECCCCCCCEEECCHHHHHHHHHHHH Q ss_conf 861222100246776667752033332148015589998541467774-2788830006557524130467899999987 Q gi|254780222|r 267 GGLKTYLQTKLENRSLISSEIFTGKTEKKGTHRGTVEWAIAWCEENPE-ITSYCNTILTDEGGTHESGLRVALTRGIKKY 345 (686) Q Consensus 267 ~Gi~dyv~~~~~~~~~~~~~~~~~~~~~~~~~~~~veval~~~~~~~~-~~SFVN~I~T~~GGTHv~g~~~ai~k~i~~~ 345 (686) |||.||+++++.+++++++++|+++.+.++ ..+||||+|++++.+ +.||||+|+|++||||++||+.||+++|++| T Consensus 1 gGi~dfv~~l~~~k~~l~~~~i~~~~~~~~---~~vEvA~qw~~~~~e~i~sf~N~I~T~~GGTH~~Gfk~altr~i~~y 77 (172) T cd00822 1 GGLKDFVEELNKDKEPLHEEPIYIEGEKDG---VEVEVALQWTDSYSENILSFVNNIPTPEGGTHETGFRAALTRAINDY 77 (172) T ss_pred CCHHHHHHHHHCCCCCCCCCCEEEEEEECC---EEEEEEEEECCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHH T ss_conf 988999999846998588999799987489---89999998358888607860668467999852789999999999999 Q ss_pred HHHCCC--CCCCCCCCCCCCCCEEEEEEEEEEHHHCCCCCCCCCCCHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 630188--332223322223642699997763456076446620252320000121015889997740135688999999 Q gi|254780222|r 346 AELTQN--KRAISIISDDLMISAVGILSVFIREPEFAGQTKTKLVSLDAQRTVENALRDPFDHYLVQNPIEATKLLEWVI 423 (686) Q Consensus 346 ~~~~~~--kk~~~i~~~dIk~~L~~~Isv~I~nP~FesQTKekL~s~~~~~~v~~~i~~~l~~~l~~n~~~a~~I~~~~~ 423 (686) ++..+. +++.+++++|||+||+++||++|+||+|+||||++|+|++++..|++++++.|..||++||..++.|+++++ T Consensus 78 ~~~~~l~k~~~~~~~~dDi~egl~~iiSvki~~PqFegQTK~KL~s~ev~~~v~~~v~~~l~~~l~~np~~a~~Ii~k~i 157 (172) T cd00822 78 AKKNNLLKKKDVKLTGDDIREGLTAVISVKVPEPQFEGQTKDKLGNSEVRSIVESAVREALEEWLEENPEEAKKILEKAI 157 (172) T ss_pred HHHHCCCCCCCCCCCHHHHHCCCEEEEEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHH T ss_conf 99825023354789999996436488552536882266220521788899999999999999999989999999999999 Q ss_pred HHHHHHHHHHHHHHH Q ss_conf 999965454433444 Q gi|254780222|r 424 ERSEERLRRRKQREV 438 (686) Q Consensus 424 ~~a~~r~~~~k~k~~ 438 (686) .++++|++++++|++ T Consensus 158 ~~a~aR~~akkare~ 172 (172) T cd00822 158 LAAKAREAARKAREL 172 (172) T ss_pred HHHHHHHHHHHHCCC T ss_conf 999999999973479 No 16 >cd03481 TopoIIA_Trans_ScTopoIIA TopoIIA_Trans_ScTopoIIA: Transducer domain, having a ribosomal S5 domain 2-like fold, of the type found in proteins of the type IIA family of DNA topoisomerases similar to Saccharomyces cerevisiae Topo IIA. S. cerevisiae Topo IIA is a homodimer encoded by a single gene. The type IIA enzymes are the predominant form of topoisomerase and are found in some bacteriophages, viruses and archaea, and in all bacteria and eukaryotes. All type IIA topoisomerases are related to each other at amino acid sequence level, though their oligomeric organization sometimes differs. TopoIIA enzymes cut both strands of the duplex DNA to remove (relax) both positive and negative supercoils in DNA. These enzymes covalently attach to the 5' ends of the cut DNA, separate the free ends of the cleaved strands, pass another region of the duplex through this gap, then rejoin the ends. TopoIIA enzymes also catenate/ decatenate duplex rings. This transducer domain is homologous to t Probab=99.97 E-value=5.9e-32 Score=238.62 Aligned_cols=147 Identities=19% Similarity=0.221 Sum_probs=119.0 Q ss_pred CHHHHHCCCCCCCCC--CCCCEEEEEEECCCEEEEEEEEEEECCCCCCEEEEEEECCCCCCCEEECCHHHHHHHHHHHHH Q ss_conf 122210024677666--775203333214801558999854146777427888300065575241304678999999876 Q gi|254780222|r 269 LKTYLQTKLENRSLI--SSEIFTGKTEKKGTHRGTVEWAIAWCEENPEITSYCNTILTDEGGTHESGLRVALTRGIKKYA 346 (686) Q Consensus 269 i~dyv~~~~~~~~~~--~~~~~~~~~~~~~~~~~~veval~~~~~~~~~~SFVN~I~T~~GGTHv~g~~~ai~k~i~~~~ 346 (686) +.+|++++..+.... .++++..+ ... .+||||++.+++.++++||||||+|.+|||||+|+.++|++.|.+++ T Consensus 3 F~~Yv~LY~~~~~~~~~~~~~~~~e---~~~--~rWeV~v~~s~~~f~qvSFVNgI~T~~GGtHVd~i~~~I~~~l~~~i 77 (153) T cd03481 3 FKDYVKLYLKDANKEDGPPPPVVYE---PVN--DRWEVAVALSDGQFQQVSFVNSIATTKGGTHVDYVADQIVKKLDEVV 77 (153) T ss_pred HHHHHHHHCCCCCCCCCCCCCEEEE---CCC--CCEEEEEEECCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHH T ss_conf 7899998678764556889977980---379--98899999879982798886660069998666999999999999999 Q ss_pred HHCCCCCCCCCCCCCCCCCEEEEEEEEEEHHHCCCCCCCCCCCHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 30188332223322223642699997763456076446620252320000121015889997740135688999999999 Q gi|254780222|r 347 ELTQNKRAISIISDDLMISAVGILSVFIREPEFAGQTKTKLVSLDAQRTVENALRDPFDHYLVQNPIEATKLLEWVIERS 426 (686) Q Consensus 347 ~~~~~kk~~~i~~~dIk~~L~~~Isv~I~nP~FesQTKekL~s~~~~~~v~~~i~~~l~~~l~~n~~~a~~I~~~~~~~a 426 (686) ++++ ++.+++++++||++|++||+|.|+||+|+|||||+|+||..+|+++|.++++|.+.+.+. . |++++++.+ T Consensus 78 ~kk~-K~~~~ik~~~Ik~~l~ifv~~~I~NP~FdSQTKe~Ltt~~~~FGsk~~ls~~fi~k~~k~-~----i~~~v~~~a 151 (153) T cd03481 78 KKKN-KGGINVKPFQVKNHLWIFVNCLIENPSFDSQTKETLTTKPKSFGSKCELSEKFLKKAVKS-G----IVESVLSWA 151 (153) T ss_pred HHHC-CCCCCCCHHHHHHHCEEEEEEEECCCCCCCHHHHHHCCCHHHCCCCCCCCHHHHHHHHHC-H----HHHHHHHHH T ss_conf 9856-788777999998441799999826998662123242485565698766799999999834-1----999999886 No 17 >pfam00986 DNA_gyraseB_C DNA gyrase B subunit, carboxyl terminus. The amino terminus of eukaryotic and prokaryotic DNA topoisomerase II are similar, but they have a different carboxyl terminus. The amino-terminal portion of the DNA gyrase B protein is thought to catalyse the ATP-dependent super-coiling of DNA. See pfam00204. The carboxyl-terminal end supports the complexation with the DNA gyrase A protein and the ATP-independent relaxation. This family also contains Topoisomerase IV. This is a bacterial enzyme that is closely related to DNA gyrase,. Probab=99.83 E-value=5.7e-21 Score=163.22 Aligned_cols=64 Identities=45% Similarity=0.737 Sum_probs=59.2 Q ss_pred CCCCEEEEEEECCCCCCHHHHHHHCCCCCCEEEEEEEECCCCCCHHHHHHHHHHHCCCCHHHHHHHHH Q ss_conf 78860786850677889889988603940007999986641026798999999880798168999999 Q gi|254780222|r 608 GKGKIEISRFKGLGEMLASQLKETAMDANKRTLLRVEINKDLNSLKDTQDSINKLMGTKADERFKFIQ 675 (686) Q Consensus 608 ~~~k~~i~ryKGLGe~~~~e~~et~~~p~~r~l~~v~~~~~~~~~~~~~~~f~~lmG~~~~~RK~wi~ 675 (686) ++++++||||||||||+|+|||||||||++|+|+||++++ +..++.+|+.|||+++++||+||+ T Consensus 2 ~~~k~~I~RfKGLGEM~p~qL~eTTmdp~~R~L~~V~~~d----~~~~~~~~~~LMG~~~~~R~~~I~ 65 (65) T pfam00986 2 GNKKVEIQRFKGLGEMNPEQLWETTMDPETRRLLQVTIED----AEEADEMFSTLMGKDVEPRREFIE 65 (65) T ss_pred CCCCCEEEECCCCCCCCHHHHHHHCCCHHCCEEEEEECCC----HHHHHHHHHHHCCCCCHHHHHHCC T ss_conf 9887535753576779999999873383106789886388----899999999983999689897639 No 18 >PRK04184 DNA topoisomerase VI subunit B; Validated Probab=99.54 E-value=3.5e-12 Score=102.88 Aligned_cols=315 Identities=20% Similarity=0.265 Sum_probs=184.7 Q ss_pred EECCCCHHHHCCCCEEECCCCCCCHHHHHHHHHHHHHHHH-HCCCCCEEEEEECC-C---CEEEEEECCCCCCCCCCCCC Q ss_conf 2311210343269815467775502202443666678777-61899769999828-9---83999966984344417665 Q gi|254780222|r 50 RILEGLEPVRMRPGMYIGGTDEKALHHLFSEVIDNAMDEV-IAGYANLIEVSLDG-N---GFLTVVDNGRGIPVENHPKF 124 (686) Q Consensus 50 ~~L~glE~VrkRP~mYIGst~~~GL~~i~~EIldNavDe~-~~g~~~~I~V~i~~-d---g~IsV~ddGrGIPv~~h~k~ 124 (686) +.++-.|=-++-..|-==+.-.++|+..+.|.||||.|-. .+|---.|.|.|.. + =.|+|.|||+|||-+. T Consensus 12 ~~iS~aEFF~kNk~~lGFd~~~ral~ttvkE~VdNsLDAcEe~gILPdI~v~I~~~~~~~y~v~VeDNGpGI~~~~---- 87 (533) T PRK04184 12 REISVAEFFEKNKHMLGFDSPARALYTTVKELVDNSLDACEEAGILPDIKVEIRRVGEDYYRVIVEDNGPGIPPEQ---- 87 (533) T ss_pred HCCCHHHHHHHCHHHCCCCCCCHHHHHHHHHHHHCCHHHHHHHCCCCCEEEEEEECCCCEEEEEEECCCCCCCHHH---- T ss_conf 4179999988575641879853179999999973317668860889865999998378569999944889988678---- Q ss_pred CCCEEEEEEEEEEECCCCCCCCCEEEECCCCCHHHHHHHHEEEEE-----EEEEECCCEEEEEEEE------CCCEECCC Q ss_conf 787012345452002477777741563144538898762032126-----9999418838999972------79451131 Q gi|254780222|r 125 PNKSTLEIILTTLHAGGKFDNSTYEISGGLHGVGISVVNALSDEL-----HVTVARQNTIFSQKFS------RGIPLSPL 193 (686) Q Consensus 125 ~~~~~~Elif~~L~aggnfdd~~yk~sgGlnGvGaklvNalS~~f-----~Vev~r~gk~y~q~f~------~g~~~~~~ 193 (686) ...|||.|++||||.-- +-|=|.-|.|+|.+-.+|.-- .|.+...+......|+ .|.|..-. T Consensus 88 -----ip~vFGk~L~GSKf~~~--rQsRGqqGiGis~avlysQlTTGkP~~V~S~~~~~~~a~~~~l~iD~~kNep~i~~ 160 (533) T PRK04184 88 -----IPKVFGKLLYGSKFHRL--RQSRGQQGIGISAAVLYAQMTTGKPVRVISSTGGSKKAYEFELKIDTEKNEPEILE 160 (533) T ss_pred -----HHHHHHHHHCCCCCCHH--HHCCCCCCCCHHHHHHHHHHCCCCCEEEEECCCCCCEEEEEEEEEECCCCCCEEEC T ss_conf -----88999888502532402--11477535247888777631268865999647998526999999675778734533 Q ss_pred -EECCCCCCCCCEEEEEEECHHHEECCCCCCHHHHHHHHHHHHHCCCCCEEEEECCCCCC-------CCCCCCCEEEEEE Q ss_conf -12167677885399999842432001222289999988887530577359995164532-------2245541123431 Q gi|254780222|r 194 -EKKEKVRNKRGTRITFRPDPKIFGDKAGFDAARLLKMTQSKAYLSGRVKTCWSCDKEIA-------EKYGIPEKEEFYF 265 (686) Q Consensus 194 -~~i~~~~~k~GT~ItF~PD~~iFg~~~~~d~~~l~~rl~~~A~L~~gl~i~~~~~~~~~-------~~~~~~~~~~f~~ 265 (686) .........+||+|++.-.-..-. . -..+...++..|..||-.+|.|.+-.... +....+..+.-.+ T Consensus 161 ~~~~~~~~~~hGt~Ve~~ieg~~~~--~---r~~v~eYlr~TaIvnPhA~I~f~~Pdg~~~~f~R~t~~lP~~P~eikPH 235 (533) T PRK04184 161 REERDNWKRWHGTRVELELEGNWYR--A---KQGIYEYLKRTAIVNPHARITLKDPDGEELVFERATDELPKPPKEIKPH 235 (533) T ss_pred CCEECCCCCCCCEEEEEEEEEEHHH--H---HCCHHHHHHHHHCCCCCEEEEEECCCCCEEEEEECCCCCCCCCCCCCCC T ss_conf 4120477788843999999764154--4---2369999976510378449999889998799840456689898767899 Q ss_pred CCCCH--HHHHCCCCC---------------------------------------------------------------- Q ss_conf 58612--221002467---------------------------------------------------------------- Q gi|254780222|r 266 PGGLK--TYLQTKLEN---------------------------------------------------------------- 279 (686) Q Consensus 266 ~~Gi~--dyv~~~~~~---------------------------------------------------------------- 279 (686) |-|+. .+..++... T Consensus 236 P~Gvelg~L~~ml~~t~~~tl~~FL~~eFsrVg~~~A~ei~~~agl~~~~~p~~l~~~e~~~L~~a~~~~k~~~Ppt~cL 315 (533) T PRK04184 236 PHGVDLGTLKRMAARTDRSTLSEFLVSEFSRVGEKTAKEILEYAGLDPNKKPKELTWEEAERLVEAMKKMKFMAPPTDCL 315 (533) T ss_pred CCCCCHHHHHHHHHHCCCCCHHHHHHHHCCCCCHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCCC T ss_conf 86265889999998759977899998543556989999999972899999900189999999999998477889998745 Q ss_pred ------------CCCCCCCCEEEEEEE----CCCEEEEEEEEEEECCC-----CCCEEEEEEECCCC-CCCEEECCHHHH Q ss_conf ------------766677520333321----48015589998541467-----77427888300065-575241304678 Q gi|254780222|r 280 ------------RSLISSEIFTGKTEK----KGTHRGTVEWAIAWCEE-----NPEITSYCNTILTD-EGGTHESGLRVA 337 (686) Q Consensus 280 ------------~~~~~~~~~~~~~~~----~~~~~~~veval~~~~~-----~~~~~SFVN~I~T~-~GGTHv~g~~~a 337 (686) +..+.++.+ ..... -..+...||++++|..+ ..++.=|+|-||-. +.|. .+ T Consensus 316 SPIGe~~i~~Gl~k~~~peF~-aavTR~p~vy~G~PF~VEvgiAyGG~~~~~~~~~vlRFANRVPLly~~g~------ca 388 (533) T PRK04184 316 SPIGEELIEKGLKKILQPEFV-AAVTRKPKVYSGHPFIVEVGIAYGGEIPPSGKPELLRFANRVPLLYDEGA------CV 388 (533) T ss_pred CCCCHHHHHHHHHHHCCCCEE-EEECCCCCCCCCCCEEEEEEEEECCCCCCCCCCEEEEEECCCCEEECCCC------CH T ss_conf 767889999999864394168-98545884038987799997310577788897159997056871212664------17 Q ss_pred HHHHHHHHHHHCCCCCCCCCCCCCCCCC-EEEEEEEEEEHHHCCCCCCCCCC-CHHHCC Q ss_conf 9999998763018833222332222364-26999977634560764466202-523200 Q gi|254780222|r 338 LTRGIKKYAELTQNKRAISIISDDLMIS-AVGILSVFIREPEFAGQTKTKLV-SLDAQR 394 (686) Q Consensus 338 i~k~i~~~~~~~~~kk~~~i~~~dIk~~-L~~~Isv~I~nP~FesQTKekL~-s~~~~~ 394 (686) ++++|... ..+ ..+++-. -.+ +.++|++-.++--|.|-.||.+. .|++.. T Consensus 389 itkav~~i-nWr--~Ygl~q~----p~gP~~v~VhvaSt~VPftsesKEaIAd~pEI~~ 440 (533) T PRK04184 389 ITKAVESI-NWK--RYGLSQP----PDAPLVVLVHVASTKVPYTSEGKEAIADVPEIEK 440 (533) T ss_pred HHHHHHHC-CHH--HCCCCCC----CCCCEEEEEEEEECCCCCCCCCHHHHCCCHHHHH T ss_conf 88887747-856--5187899----9987899999620688967743554337789999 No 19 >TIGR01052 top6b DNA topoisomerase VI, B subunit; InterPro: IPR005734 DNA topoisomerases regulate the number of topological links between two DNA strands (i.e. change the number of superhelical turns) by catalysing transient single- or double-strand breaks, crossing the strands through one another, then resealing the breaks. These enzymes have several functions: to remove DNA supercoils during transcription and DNA replication; for strand breakage during recombination; for chromosome condensation; and to disentangle intertwined DNA during mitosis , . DNA topoisomerases are divided into two classes: type I enzymes (5.99.1.2 from EC; topoisomerases I, III and V) break single-strand DNA, and type II enzymes (5.99.1.3 from EC; topoisomerases II, IV and VI) break double-strand DNA . Type II topoisomerases are ATP-dependent enzymes, and can be subdivided according to their structure and reaction mechanisms: type IIA (topoisomerase II or gyrase, and topoisomerase IV) and type IIB (topoisomerase VI). These enzymes are responsible for relaxing supercoiled DNA as well as for introducing both negative and positive supercoils . This entry represents subunit B of topoisomerase VI, a type IIB topoisomerase found predominantly in archaea, but also in a few eukayotes, such as the plant Arabidopsis thaliana . This enzyme assembles as a heterotetramer, consisting of two A subunits required for DNA cleavage and two B subunits required for ATP hydrolysis. The B subunit is structurally similar to the ATPase domain of type IIA topoisomerases, but the A subunit is distinct, and instead shares homology with the Spo11 protein that mediates double-strand DNA breaks during meiotic recombination in eukaryotes . Therefore, though related to type IIA topoisomerases, topoisomerase VI may have a distinctive mechanism of action. More information about this protein can be found at Protein of the Month: DNA Topoisomerase .; GO: 0003677 DNA binding, 0003918 DNA topoisomerase (ATP-hydrolyzing) activity, 0006265 DNA topological change, 0006268 DNA unwinding during replication, 0005694 chromosome. Probab=99.28 E-value=5.3e-11 Score=94.81 Aligned_cols=163 Identities=25% Similarity=0.306 Sum_probs=113.9 Q ss_pred CCCHHHHHHHHHHHHHHHHH-CCCCCEEEEEEC---C-------CC-EEEEEECCCCCCCCCCCCCCCCEEEEEEEEEEE Q ss_conf 55022024436666787776-189976999982---8-------98-399996698434441766578701234545200 Q gi|254780222|r 71 EKALHHLFSEVIDNAMDEVI-AGYANLIEVSLD---G-------NG-FLTVVDNGRGIPVENHPKFPNKSTLEIILTTLH 138 (686) Q Consensus 71 ~~GL~~i~~EIldNavDe~~-~g~~~~I~V~i~---~-------dg-~IsV~ddGrGIPv~~h~k~~~~~~~Elif~~L~ 138 (686) .+.|..+|-|.|+||-|-.= ||=-=.|.|.|. + |. .|.|+|||+|||.+.-| .|||.|+ T Consensus 26 iRsL~~~ihElVtNSLDAcEEAGILP~I~v~I~~~g~seGrGGr~HY~v~VEDNGPGIP~~~IP---------kVFGKmL 96 (662) T TIGR01052 26 IRSLTTVIHELVTNSLDACEEAGILPDIKVEIEKIGKSEGRGGRDHYKVVVEDNGPGIPEEYIP---------KVFGKML 96 (662) T ss_pred HHHHHHHHHHHHHCHHHHHHHCCCCCCCEEEEEEECCCCCCCCCCCEEEEEEECCCCCCCCCCC---------CHHHHHC T ss_conf 6889999999971424676643799840589898527778999962079984078899700013---------1320211 Q ss_pred CCCCCCCCCEEEECCCCCHHHHHHHHEEEEEE-----EEEEC--CCEEE--EEEEECCCEECCCEECC-------CCCCC Q ss_conf 24777777415631445388987620321269-----99941--88389--99972794511311216-------76778 Q gi|254780222|r 139 AGGKFDNSTYEISGGLHGVGISVVNALSDELH-----VTVAR--QNTIF--SQKFSRGIPLSPLEKKE-------KVRNK 202 (686) Q Consensus 139 aggnfdd~~yk~sgGlnGvGaklvNalS~~f~-----Vev~r--~gk~y--~q~f~~g~~~~~~~~i~-------~~~~k 202 (686) |||||-.. +-|=|.-|+|+|.|=-||.--+ |-+.. +|..| .+.-.=+..+....++. ..++. T Consensus 97 aGSK~h~~--iQSRGqQGiG~sgavL~~Q~TTGKP~~VIs~T~~nGe~~~~~~~l~idV~KNEg~Ivek~eee~~~~~~W 174 (662) T TIGR01052 97 AGSKFHRI--IQSRGQQGIGISGAVLYAQITTGKPVKVISSTGENGEIYAKKVKLKIDVKKNEGEIVEKGEEEWEKPGSW 174 (662) T ss_pred CCCEEEEE--EECCCCHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCEEEEEEEEEEEEEECCCEEEECCCEEECCCCCC T ss_conf 14400033--2067851567999999876147984588805686756158999999853325873430121014179981 Q ss_pred CCEEEEEEECHHHEECCCCCCHHHHHHHHHHHHHCCCCCEEEEECC Q ss_conf 8539999984243200122228999998888753057735999516 Q gi|254780222|r 203 RGTRITFRPDPKIFGDKAGFDAARLLKMTQSKAYLSGRVKTCWSCD 248 (686) Q Consensus 203 ~GT~ItF~PD~~iFg~~~~~d~~~l~~rl~~~A~L~~gl~i~~~~~ 248 (686) +||+|+.--.-... .-.-.-++.-||+.|..||--+|.|++- T Consensus 175 ~GTriEle~~~v~y----~~~~~g~yEYlrrt~~anPHA~I~l~~P 216 (662) T TIGR01052 175 RGTRIELEVKDVSY----RRSKQGVYEYLRRTAVANPHAKIVLVEP 216 (662) T ss_pred EEEEEEEEEECCEE----ECCCCCHHHHHHHHHHHCCCCEEEEECC T ss_conf 36899999822255----3111344589899875186408998869 No 20 >COG1389 DNA topoisomerase VI, subunit B [DNA replication, recombination, and repair] Probab=99.19 E-value=1.4e-10 Score=91.95 Aligned_cols=317 Identities=20% Similarity=0.252 Sum_probs=187.4 Q ss_pred EEECCCCHHHHCCCCEEECCCCCCCHHHHHHHHHHHHHHHH-HCCCCCEEEEEECC---C-CEEEEEECCCCCCCCCCCC Q ss_conf 02311210343269815467775502202443666678777-61899769999828---9-8399996698434441766 Q gi|254780222|r 49 IRILEGLEPVRMRPGMYIGGTDEKALHHLFSEVIDNAMDEV-IAGYANLIEVSLDG---N-GFLTVVDNGRGIPVENHPK 123 (686) Q Consensus 49 i~~L~glE~VrkRP~mYIGst~~~GL~~i~~EIldNavDe~-~~g~~~~I~V~i~~---d-g~IsV~ddGrGIPv~~h~k 123 (686) .+.|+-.|--|+-+-|-==+-..++|+..+-|.++||.|-. .+|---.|.|.|.. | =.|.|.|||.|||.+.-| T Consensus 12 ~re~SvAEFF~kNk~mlGf~~p~RsL~~tv~ElV~NSLDA~eeaGILPdI~v~I~~~~~d~y~v~veDNGpGIP~e~IP- 90 (538) T COG1389 12 FRELSVAEFFRKNKEMLGFDGPIRSLTTTVHELVTNSLDACEEAGILPDIKVEIERIGKDHYKVIVEDNGPGIPEEQIP- 90 (538) T ss_pred HHHCCHHHHHHHCHHHCCCCCCHHHHHHHHHHHHHCCHHHHHHCCCCCCEEEEEEECCCCEEEEEEECCCCCCCHHHHH- T ss_conf 3326999998748886189973068999999998531656876088876599998547755899993489999856757- Q ss_pred CCCCEEEEEEEEEEECCCCCCCCCEEEECCCCCHHHHHHHHEEEE-----EEEEEECC--CEEEEEEEECCCEECCCEEC Q ss_conf 578701234545200247777774156314453889876203212-----69999418--83899997279451131121 Q gi|254780222|r 124 FPNKSTLEIILTTLHAGGKFDNSTYEISGGLHGVGISVVNALSDE-----LHVTVARQ--NTIFSQKFSRGIPLSPLEKK 196 (686) Q Consensus 124 ~~~~~~~Elif~~L~aggnfdd~~yk~sgGlnGvGaklvNalS~~-----f~Vev~r~--gk~y~q~f~~g~~~~~~~~i 196 (686) .|||.+++||||+-. +-|-|.-|.|+|.+--+|.- ..|.+... ++.|...-.=+..+....++ T Consensus 91 --------kvFGk~LygSKfh~~--~QsRGqqGiGis~avLysQmTtGkPv~V~s~T~~s~~~~~~~l~id~~kNEp~Iv 160 (538) T COG1389 91 --------KVFGKMLYGSKFHRN--IQSRGQQGIGISAAVLYSQMTTGKPVRVISSTGDSGTAYEYELKIDVQKNEPEIV 160 (538) T ss_pred --------HHHHHHHCCCHHHHH--HHCCCCCCCCHHHHHHHHHHCCCCCEEEEECCCCCCCEEEEEEEECCCCCCCHHE T ss_conf --------899888602323234--4325532424789999987517986699953788862489999951788876200 Q ss_pred -----CCCCCCCCEEEEEEECHHHEECCCCCCHHHHHHHHHHHHHCCCCCEEEEECCCCCCC-------CCCCCCEEEEE Q ss_conf -----676778853999998424320012222899999888875305773599951645322-------24554112343 Q gi|254780222|r 197 -----EKVRNKRGTRITFRPDPKIFGDKAGFDAARLLKMTQSKAYLSGRVKTCWSCDKEIAE-------KYGIPEKEEFY 264 (686) Q Consensus 197 -----~~~~~k~GT~ItF~PD~~iFg~~~~~d~~~l~~rl~~~A~L~~gl~i~~~~~~~~~~-------~~~~~~~~~f~ 264 (686) ....+.+||+|.-.-+-.... .. -.-++..++..|..||--+|.|.+-.-... +...+..+..- T Consensus 161 ~r~~~~~~~~~hGT~Vel~~~~~~~~-~~---~qgi~eYlkrTaiinPhA~I~l~dPdG~~~vf~r~t~~lP~pP~E~kP 236 (538) T COG1389 161 ERGEVENPGGWHGTRVELELKGVWYR-AK---RQGIYEYLKRTAIINPHARIVLKDPDGNLVVFPRSTDKLPKPPKEIKP 236 (538) T ss_pred ECCCCCCCCCCCCEEEEEEECCCCHH-HC---CCCHHHHHHHHHHCCCCEEEEEECCCCCEEEECCCHHHCCCCCCCCCC T ss_conf 00222478887750799995553054-22---247999998875338741799988998678713314117999300489 Q ss_pred ECCCC---------------------------------HHHHHCCCCC-------------------------------- Q ss_conf 15861---------------------------------2221002467-------------------------------- Q gi|254780222|r 265 FPGGL---------------------------------KTYLQTKLEN-------------------------------- 279 (686) Q Consensus 265 ~~~Gi---------------------------------~dyv~~~~~~-------------------------------- 279 (686) ++.|+ .+++++..-+ T Consensus 237 HP~gvd~~~L~~M~~~T~~~tv~~fL~sef~rig~~ta~e~~e~~g~~~~~~p~~L~~~~~~eea~~lv~a~~~~~fm~P 316 (538) T COG1389 237 HPHGVDLDTLKKMAHRTRRSTVREFLVSEFSRIGEKTADELLEYAGFDPDKKPRELTKKKTREEAEKLVEAFKKMKFMAP 316 (538) T ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHCCCCHHHHHHHHHHHHHCCCCCC T ss_conf 97645788999999875343599999999877626679999998467965487886034578999999999985866589 Q ss_pred -----------------CCCCCCCCEEEEEEE----CCCEEEEEEEEEEECCC---C--CCEEEEEEECCCC-CCCEEEC Q ss_conf -----------------766677520333321----48015589998541467---7--7427888300065-5752413 Q gi|254780222|r 280 -----------------RSLISSEIFTGKTEK----KGTHRGTVEWAIAWCEE---N--PEITSYCNTILTD-EGGTHES 332 (686) Q Consensus 280 -----------------~~~~~~~~~~~~~~~----~~~~~~~veval~~~~~---~--~~~~SFVN~I~T~-~GGTHv~ 332 (686) +..+.++ |...... -..+...||++++|..+ . .++.-|+|-||-. +-|. T Consensus 317 ptd~LsPIGee~ie~gLk~~~~pe-fv~avTRkPkvy~G~pFiVEvgiAYGG~i~~~e~~~llRyANrvPLlyd~g~--- 392 (538) T COG1389 317 PTDGLSPIGEELIEKGLKSILNPE-FVAAVTRKPKVYRGHPFIVEVGIAYGGNIPGGEKPLLLRYANRVPLLYDAGS--- 392 (538) T ss_pred CCCCCCCCCHHHHHHHHHHHCCCH-HEEEECCCCCCCCCCCEEEEEEEECCCCCCCCCCHHHHHHCCCCCEEEECCC--- T ss_conf 854548455999999998751814-3043125850017986599987511688787752466643045764651576--- Q ss_pred CHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCEEEEEEEEEEHHHCCCCCCCCCC-CHHHCC Q ss_conf 04678999999876301883322233222236426999977634560764466202-523200 Q gi|254780222|r 333 GLRVALTRGIKKYAELTQNKRAISIISDDLMISAVGILSVFIREPEFAGQTKTKLV-SLDAQR 394 (686) Q Consensus 333 g~~~ai~k~i~~~~~~~~~kk~~~i~~~dIk~~L~~~Isv~I~nP~FesQTKekL~-s~~~~~ 394 (686) .++++++.+.- . +.. .+...+ .-+.++|++-..|--|.|--|+... .|++.. T Consensus 393 ---cv~tk~V~si~-W--krY--gl~~~~--aplvvlVhv~St~VPyts~gKqsIa~vpeIe~ 445 (538) T COG1389 393 ---CVITKVVESID-W--KRY--GLDQPD--APLVVLVHVASTNVPYTSAGKQSIADVPEIEN 445 (538) T ss_pred ---CHHHHHHHHCC-C--EEC--CCCCCC--CCEEEEEEECCCCCCCCCCCHHHCCCCHHHHH T ss_conf ---04667644056-2--013--768999--86799998514688867754012036688999 No 21 >PRK00095 mutL DNA mismatch repair protein; Reviewed Probab=99.01 E-value=3.3e-08 Score=75.60 Aligned_cols=221 Identities=19% Similarity=0.231 Sum_probs=131.6 Q ss_pred CCHHHHHHHHHHHHHHHHHCCCCCEEEEEECCCC--EEEEEECCCCCCCCCCCCCCCCEEEEEEEEEEECCCCCCCCC-- Q ss_conf 5022024436666787776189976999982898--399996698434441766578701234545200247777774-- Q gi|254780222|r 72 KALHHLFSEVIDNAMDEVIAGYANLIEVSLDGNG--FLTVVDNGRGIPVENHPKFPNKSTLEIILTTLHAGGKFDNST-- 147 (686) Q Consensus 72 ~GL~~i~~EIldNavDe~~~g~~~~I~V~i~~dg--~IsV~ddGrGIPv~~h~k~~~~~~~Elif~~L~aggnfdd~~-- 147 (686) ..+.-.+.|+|-||+|- .+++|.|.|...| .|.|.|||.||+-+ . ++++|.. ||-||..+-+ T Consensus 21 erpasvVKELvENsiDA----gAt~I~V~i~~~G~~~I~V~DnG~Gi~~~------d---~~l~~~r-haTSKi~~~~DL 86 (612) T PRK00095 21 ERPASVVKELVENALDA----GATRIDIEIEEGGLKLIRVRDNGCGIPKE------D---LPLALSR-HATSKIASLDDL 86 (612) T ss_pred CCHHHHHHHHHHHHHCC----CCCEEEEEEECCCCCEEEEEECCCCCCHH------H---HHHHHHC-CCCCCCCCHHHH T ss_conf 58899999998413317----99879999974983179999799998989------9---9999757-277678984566 Q ss_pred EE-EECCCCCHHHHHHHHEEEEEEEEEECC--CEEEEEEEECCCEECCCEECCCCCCCCCEEEE----EE--ECHHHEEC Q ss_conf 15-631445388987620321269999418--83899997279451131121676778853999----99--84243200 Q gi|254780222|r 148 YE-ISGGLHGVGISVVNALSDELHVTVARQ--NTIFSQKFSRGIPLSPLEKKEKVRNKRGTRIT----FR--PDPKIFGD 218 (686) Q Consensus 148 yk-~sgGlnGvGaklvNalS~~f~Vev~r~--gk~y~q~f~~g~~~~~~~~i~~~~~k~GT~It----F~--PD~~iFg~ 218 (686) +. .|-|=-|=-.+...+.| ++++.++.. ..-|+..++.|.... .+.+....||+|+ |. |=..+|-. T Consensus 87 ~~i~T~GFRGEAL~SIa~vs-~~~i~s~~~~~~~g~~i~~~~g~~~~----~~~~~~~~GT~V~V~~LF~N~PaRrKfLK 161 (612) T PRK00095 87 EAIRTLGFRGEALPSIASVS-RLTLTSRTADAAEGWQIVVEGGEIVE----VKPAAHPVGTTIEVRDLFFNTPARRKFLK 161 (612) T ss_pred HHCCCCCCCHHHHHHHHHHE-EEEEEECCCCCCCEEEEEEECCEECC----EEECCCCCCCEEEECHHHCCCHHHHHHHC T ss_conf 11767876188899877326-78999678997672899994996753----23336899849996045255768898757 Q ss_pred CCCCCHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCCCCCCEEEEEEC--CCCHHHHHCCCCCCCCCCCCCEEEEEEECC Q ss_conf 122228999998888753057735999516453222455411234315--861222100246776667752033332148 Q gi|254780222|r 219 KAGFDAARLLKMTQSKAYLSGRVKTCWSCDKEIAEKYGIPEKEEFYFP--GGLKTYLQTKLENRSLISSEIFTGKTEKKG 296 (686) Q Consensus 219 ~~~~d~~~l~~rl~~~A~L~~gl~i~~~~~~~~~~~~~~~~~~~f~~~--~Gi~dyv~~~~~~~~~~~~~~~~~~~~~~~ 296 (686) +..-....|.+.++.+|..+|.+.+.+..+.+ ..|..+ +.+.+-+..+.+..- ....+....+. T Consensus 162 s~~~E~~~I~~~v~~~AL~~P~I~f~l~~~~k----------~~~~~~~~~~l~~ri~~i~G~~~--~~~li~v~~~~-- 227 (612) T PRK00095 162 SEKTELGHIDDVVRRLALAHPDVAFTLTHNGK----------LVLQTRANGDLLQRLAAILGRDF--AENALPIDAEH-- 227 (612) T ss_pred CCCHHHHHHHHHHHHHHHHCCCEEEEEEECCE----------EEECCCCCCCHHHHHHHHHCHHH--HHHCEEEEEEC-- T ss_conf 98179999999999997517975899998999----------97235888789999999849898--96277986016-- Q ss_pred CEEEEEEEEEEE---CCCC-CCEEEEEEECCCCC Q ss_conf 015589998541---4677-74278883000655 Q gi|254780222|r 297 THRGTVEWAIAW---CEEN-PEITSYCNTILTDE 326 (686) Q Consensus 297 ~~~~~veval~~---~~~~-~~~~SFVN~I~T~~ 326 (686) ....+.-.++. +... ..|.-|||+-+-.+ T Consensus 228 -~~~~i~G~is~P~~~r~~r~~q~~FVNgR~V~~ 260 (612) T PRK00095 228 -GDLRLSGYVGLPTLSRANRDYQYLFVNGRYVRD 260 (612) T ss_pred -CCEEEEEEEECCHHCCCCCCCEEEEECCCCCCC T ss_conf -976999999583120367762799987840478 No 22 >COG0323 MutL DNA mismatch repair enzyme (predicted ATPase) [DNA replication, recombination, and repair] Probab=98.51 E-value=4.4e-06 Score=61.02 Aligned_cols=160 Identities=19% Similarity=0.285 Sum_probs=101.6 Q ss_pred CHHHHHHHHHHHHHHHHHCCCCCEEEEEECCCC--EEEEEECCCCCCCCCCCCCCCCEEEEEEEEEEECCCCCCCCCE-- Q ss_conf 022024436666787776189976999982898--3999966984344417665787012345452002477777741-- Q gi|254780222|r 73 ALHHLFSEVIDNAMDEVIAGYANLIEVSLDGNG--FLTVVDNGRGIPVENHPKFPNKSTLEIILTTLHAGGKFDNSTY-- 148 (686) Q Consensus 73 GL~~i~~EIldNavDe~~~g~~~~I~V~i~~dg--~IsV~ddGrGIPv~~h~k~~~~~~~Elif~~L~aggnfdd~~y-- 148 (686) ...-.+.|+|.||+| | -+++|+|.+...| .|.|.|||.||+-+ . ++++ ..=||-||-.+.+. T Consensus 23 rPaSVVKELVENSlD---A-GAt~I~I~ve~gG~~~I~V~DNG~Gi~~~------D---l~la-~~rHaTSKI~~~~DL~ 88 (638) T COG0323 23 RPASVVKELVENSLD---A-GATRIDIEVEGGGLKLIRVRDNGSGIDKE------D---LPLA-LLRHATSKIASLEDLF 88 (638) T ss_pred CHHHHHHHHHHCCCC---C-CCCEEEEEEECCCEEEEEEEECCCCCCHH------H---HHHH-HHCCCCCCCCCHHHHH T ss_conf 588999999861030---4-99889999935980189998899999989------9---9999-8651464578655687 Q ss_pred -EEECCCCCHHHHHHHHEEEEEEEEEEC-CCEEEEEEE-ECCCEECCCEECCCCCCCCCEEEEE------EECHHHEECC Q ss_conf -563144538898762032126999941-883899997-2794511311216767788539999------9842432001 Q gi|254780222|r 149 -EISGGLHGVGISVVNALSDELHVTVAR-QNTIFSQKF-SRGIPLSPLEKKEKVRNKRGTRITF------RPDPKIFGDK 219 (686) Q Consensus 149 -k~sgGlnGvGaklvNalS~~f~Vev~r-~gk~y~q~f-~~g~~~~~~~~i~~~~~k~GT~ItF------~PD~~iFg~~ 219 (686) -.|-|=-|=-....-.. .++++.+.. +.....|-+ +.|.... .+..+....||+|.- .|=..+|-.+ T Consensus 89 ~I~TlGFRGEAL~SIasV-srlti~Srt~~~~~~~~~~~~g~~~~~---~~~p~a~~~GTtVeV~dLF~NtPaRrKflks 164 (638) T COG0323 89 RIRTLGFRGEALASIASV-SRLTITSRTAEASEGTQIYAEGGGMEV---TVKPAAHPVGTTVEVRDLFYNTPARRKFLKS 164 (638) T ss_pred HHHCCCCCHHHHHHHHHH-HEEEEEECCCCCCCEEEEEECCCCCCC---CCCCCCCCCCCEEEEHHHHCCCHHHHHHHCC T ss_conf 743567338779998875-444899416776763899962884112---3347779999899845712558476654145 Q ss_pred CCCCHHHHHHHHHHHHHCCCCCEEEEECCCC Q ss_conf 2222899999888875305773599951645 Q gi|254780222|r 220 AGFDAARLLKMTQSKAYLSGRVKTCWSCDKE 250 (686) Q Consensus 220 ~~~d~~~l~~rl~~~A~L~~gl~i~~~~~~~ 250 (686) ..-.+..|...++++|..+|.+.+.+..+.+ T Consensus 165 ~~~E~~~i~~vv~r~ALahp~I~F~l~~~gk 195 (638) T COG0323 165 EKTEFGHITELINRYALAHPDISFSLSHNGK 195 (638) T ss_pred CHHHHHHHHHHHHHHHHCCCCEEEEEEECCE T ss_conf 3899999999999987307972799998890 No 23 >TIGR00585 mutl DNA mismatch repair protein MutL; InterPro: IPR014763 This entry represents DNA mismatch repair proteins, such as MutL. The dimeric MutL protein has a key function in communicating mismatch recognition by MutS to downstream repair processes. Mismatch repair contributes to the overall fidelity of DNA replication by targeting mispaired bases that arise through replication errors during homologous recombination and as a result of DNA damage. It involves the correction of mismatched base pairs that have been missed by the proofreading element of the DNA polymerase complex .. Probab=98.36 E-value=5.1e-06 Score=60.64 Aligned_cols=189 Identities=20% Similarity=0.300 Sum_probs=116.1 Q ss_pred CCHHHHHHHHHHHHHHHHHCCCCCEEEEEECCCC--EEEEEECCCCCCCCCCCCCCCCEEEEEEEEEEECCCCCCCCCEE Q ss_conf 5022024436666787776189976999982898--39999669843444176657870123454520024777777415 Q gi|254780222|r 72 KALHHLFSEVIDNAMDEVIAGYANLIEVSLDGNG--FLTVVDNGRGIPVENHPKFPNKSTLEIILTTLHAGGKFDNSTYE 149 (686) Q Consensus 72 ~GL~~i~~EIldNavDe~~~g~~~~I~V~i~~dg--~IsV~ddGrGIPv~~h~k~~~~~~~Elif~~L~aggnfdd~~yk 149 (686) ....-.+.|.|+||.| || +++|.|.+...| .|.|.|||.||.-+ .. .+.+.=||=||..+-+.- T Consensus 21 ~~p~~vVKELvENSLD---AG-At~I~v~~~~gG~~~I~V~DNG~Gi~~~------d~----~~~~~~haTSKI~~~~DL 86 (367) T TIGR00585 21 ERPASVVKELVENSLD---AG-ATKIEVEIEEGGLKLIEVSDNGSGIDKE------DL----ELACERHATSKIQSFEDL 86 (367) T ss_pred HHHHHHHHHHHHHHHC---CC-CCEEEEEEEECCEEEEEEEECCCCCCHH------HH----HHHHCCCCCCCCCCHHHH T ss_conf 3279999998873121---48-8589999962653589999778567777------79----986123577632247899 Q ss_pred ---EECCCCCHHHHHHHHEEEEEEEEE------------ECCCEEEEEEEECCCEECCCEECCCCCCCCCEEEEE----- Q ss_conf ---631445388987620321269999------------418838999972794511311216767788539999----- Q gi|254780222|r 150 ---ISGGLHGVGISVVNALSDELHVTV------------ARQNTIFSQKFSRGIPLSPLEKKEKVRNKRGTRITF----- 209 (686) Q Consensus 150 ---~sgGlnGvGaklvNalS~~f~Vev------------~r~gk~y~q~f~~g~~~~~~~~i~~~~~k~GT~ItF----- 209 (686) .|-|==|==++.--..| +++|.+ ..++..|...+.+|.... ..+.++....||+|+= T Consensus 87 ~~i~TlGFRGEAL~Si~~vs-~~~i~t~~~~~~~W~yQ~~~~~~~~~~~~~~G~~~~--~~~~p~~~~~GT~V~V~~LF~ 163 (367) T TIGR00585 87 ERIRTLGFRGEALASISSVS-RLTITTKTSGPTAWFYQGAADGLALQVLYAEGKNII--EEIKPAPRPVGTTVEVRDLFY 163 (367) T ss_pred HHHHCCCCHHHHHHHHHHHH-CCEEEEEECCCCCCEECCCCCCEEEEEEECCCCEEE--ECCCCCCCCCCCEEEEECCCC T ss_conf 98640483468999998643-334787425898612212356404676620687342--002566889897899730315 Q ss_pred -EECHHH-E-ECCCCCCHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCCCCCCEEEEEE--CCCCHHH-HHCCCCC Q ss_conf -984243-2-0012222899999888875305773599951645322245541123431--5861222-1002467 Q gi|254780222|r 210 -RPDPKI-F-GDKAGFDAARLLKMTQSKAYLSGRVKTCWSCDKEIAEKYGIPEKEEFYF--PGGLKTY-LQTKLEN 279 (686) Q Consensus 210 -~PD~~i-F-g~~~~~d~~~l~~rl~~~A~L~~gl~i~~~~~~~~~~~~~~~~~~~f~~--~~Gi~dy-v~~~~~~ 279 (686) .|=..+ | -.+..-....+...+.+.|-.+|+|.+.|..+... .......-.+.. ..++++- +..+... T Consensus 164 N~P~R~k~~s~k~~~~e~~~~~~~l~~~~l~~~~v~f~~~~~~~~--~~~g~~~~~~~~~~~~~~~~~~~~~v~g~ 237 (367) T TIGR00585 164 NLPVRRKEFSLKSPKKELRKILDLLERYALIHPDVSFSLTSDESF--QTNGKKVLSLSTKPNQSLKERRIRSVFGT 237 (367) T ss_pred CCCHHHHHHHCCCHHHHHHHHHHHHHHHEEECCCCEEEEEECCEE--CCCCCEEEEECCCCCCHHHHHHHHHHHHH T ss_conf 852026877557806889999999864401337935899971166--47741003352488730789999988744 No 24 >cd00329 TopoII_MutL_Trans MutL_Trans: transducer domain, having a ribosomal S5 domain 2-like fold, conserved in the C-terminal domain of type II DNA topoisomerases (Topo II) and DNA mismatch repair (MutL/MLH1/PMS2) proteins. This transducer domain is homologous to the second domain of the DNA gyrase B subunit, which is known to be important in nucleotide hydrolysis and the transduction of structural signals from ATP-binding site to the DNA breakage/reunion regions of the enzymes. The GyrB dimerizes in response to ATP binding, and is homologous to the N-terminal half of eukaryotic Topo II and the ATPase fragment of MutL. Type II DNA topoisomerases catalyze the ATP-dependent transport of one DNA duplex through another, in the process generating transient double strand breaks via covalent attachments to both DNA strands at the 5' positions. Included in this group are proteins similar to human MLH1 and PMS2. MLH1 forms a heterodimer with PMS2 which functions in meiosis and in DNA mismatch Probab=98.33 E-value=9.2e-07 Score=65.72 Aligned_cols=74 Identities=31% Similarity=0.360 Sum_probs=63.3 Q ss_pred EEEEEEEEEECCC----CCCEEEEEEECCCCCCCEEECCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCEEEEEEEEE Q ss_conf 5589998541467----774278883000655752413046789999998763018833222332222364269999776 Q gi|254780222|r 299 RGTVEWAIAWCEE----NPEITSYCNTILTDEGGTHESGLRVALTRGIKKYAELTQNKRAISIISDDLMISAVGILSVFI 374 (686) Q Consensus 299 ~~~veval~~~~~----~~~~~SFVN~I~T~~GGTHv~g~~~ai~k~i~~~~~~~~~kk~~~i~~~dIk~~L~~~Isv~I 374 (686) ...++..++|.+. ...|.+|||++++.+||+|..++.+++.+.++ ..+.+.+.++++++.+ T Consensus 26 ~~~i~g~i~~~~~~~~~~~~q~~fVN~R~v~~~~~~~~ai~~~y~~~~~---------------~~~~~~~p~~vl~i~~ 90 (107) T cd00329 26 GFRVEGAISYPDSGRSSKDRQFSFVNGRPVREGGTHVKAVREAYTRALN---------------GDDVRRYPVAVLSLKI 90 (107) T ss_pred CEEEEEEEECCCCCCCCCCCEEEEECCEEECCCHHHHHHHHHHHHHHHH---------------CCCCCCCCEEEEEEEE T ss_conf 9899999957853567785479999784860776999999999999861---------------4678988789999984 Q ss_pred EH--HHCC-CCCCCCC Q ss_conf 34--5607-6446620 Q gi|254780222|r 375 RE--PEFA-GQTKTKL 387 (686) Q Consensus 375 ~n--P~Fe-sQTKekL 387 (686) +. |.|. +|||+++ T Consensus 91 ~~~~vDvNvhP~K~ev 106 (107) T cd00329 91 PPSLVDVNVHPTKEEV 106 (107) T ss_pred CCHHEEECCCCCCCCC T ss_conf 8153872749983413 No 25 >cd00075 HATPase_c Histidine kinase-like ATPases; This family includes several ATP-binding proteins for example: histidine kinase, DNA gyrase B, topoisomerases, heat shock protein HSP90, phytochrome-like ATPases and DNA mismatch repair proteins Probab=98.19 E-value=5.9e-06 Score=60.17 Aligned_cols=80 Identities=26% Similarity=0.438 Sum_probs=54.4 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCEEEEEECCC---CEEEEEECCCCCCCCCCCCCCCCEEEEEEEEEEECCCCCCCCCEEE Q ss_conf 2202443666678777618997699998289---8399996698434441766578701234545200247777774156 Q gi|254780222|r 74 LHHLFSEVIDNAMDEVIAGYANLIEVSLDGN---GFLTVVDNGRGIPVENHPKFPNKSTLEIILTTLHAGGKFDNSTYEI 150 (686) Q Consensus 74 L~~i~~EIldNavDe~~~g~~~~I~V~i~~d---g~IsV~ddGrGIPv~~h~k~~~~~~~Elif~~L~aggnfdd~~yk~ 150 (686) |.+++.++|+||++.... ....|.|++..+ -.|+|.|||+|||-+..++ + |.....+ .... T Consensus 1 l~~vl~nLl~NAik~~~~-~~~~I~i~~~~~~~~~~i~V~D~G~Gi~~~~~~~---i------f~~f~~~------~~~~ 64 (103) T cd00075 1 LQQVLLNLLSNAIKHTPE-GGGRITISVERDGDHLEIRVEDNGPGIPEEDLER---I------FERFSDG------SRSR 64 (103) T ss_pred CHHHHHHHHHHHHHHCCC-CCCEEEEEEEECCCEEEEEEEECCCCCCHHHHHH---C------CCCCCCC------CCCC T ss_conf 969999999999986789-9948999999839999999997799408899976---0------6975017------8887 Q ss_pred ECCCCCHHHHHHHHEEEEE Q ss_conf 3144538898762032126 Q gi|254780222|r 151 SGGLHGVGISVVNALSDEL 169 (686) Q Consensus 151 sgGlnGvGaklvNalS~~f 169 (686) ..+-.|+|.++|..+.+.+ T Consensus 65 ~~~g~GlGL~i~~~i~~~~ 83 (103) T cd00075 65 KGGGTGLGLSIVKKLVELH 83 (103) T ss_pred CCCCCCCHHHHHHHHHHHC T ss_conf 7898650599999999988 No 26 >smart00387 HATPase_c Histidine kinase-like ATPases. Histidine kinase-, DNA gyrase B-, phytochrome-like ATPases. Probab=98.17 E-value=3.3e-06 Score=61.88 Aligned_cols=88 Identities=30% Similarity=0.404 Sum_probs=60.1 Q ss_pred CCCCHHHHHHHHHHHHHHHHHCCCCCEEEEEECCC---CEEEEEECCCCCCCCCCCCCCCCEEEEEEEEEEECCCCCCCC Q ss_conf 75502202443666678777618997699998289---839999669843444176657870123454520024777777 Q gi|254780222|r 70 DEKALHHLFSEVIDNAMDEVIAGYANLIEVSLDGN---GFLTVVDNGRGIPVENHPKFPNKSTLEIILTTLHAGGKFDNS 146 (686) Q Consensus 70 ~~~GL~~i~~EIldNavDe~~~g~~~~I~V~i~~d---g~IsV~ddGrGIPv~~h~k~~~~~~~Elif~~L~aggnfdd~ 146 (686) +...|.+++.++|+||++.... ...|.|++..+ -.|+|.|||+|||-+.. +.+|....++.+ + T Consensus 2 d~~~l~~il~Nli~NAik~~~~--~~~I~i~~~~~~~~~~i~V~D~G~Gi~~~~~---------~~if~~f~~~~~---~ 67 (111) T smart00387 2 DPDRLRQVLSNLLDNAIKYTPE--GGRITVTLERDGDHLEITVEDNGPGIPPEDL---------EKIFEPFFRTDG---R 67 (111) T ss_pred CHHHHHHHHHHHHHHHHHHCCC--CCEEEEEEEECCCEEEEEEEECCCCCCHHHH---------HHCCCCCEECCC---C T ss_conf 8899999999999999996779--9789999998399999999988997398999---------642699477799---9 Q ss_pred CEEEECCCCCHHHHHHHHEEEEEEEEE Q ss_conf 415631445388987620321269999 Q gi|254780222|r 147 TYEISGGLHGVGISVVNALSDELHVTV 173 (686) Q Consensus 147 ~yk~sgGlnGvGaklvNalS~~f~Vev 173 (686) ....+-.|+|.++|.-+.+.+..++ T Consensus 68 --~~~~~g~GlGL~i~~~i~~~~~G~i 92 (111) T smart00387 68 --SRKIGGTGLGLSIVKKLVELHGGEI 92 (111) T ss_pred --CCCCCCCCHHHHHHHHHHHHCCCEE T ss_conf --9887976648999999999879989 No 27 >pfam02518 HATPase_c Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90. Probab=98.10 E-value=4.7e-06 Score=60.88 Aligned_cols=88 Identities=36% Similarity=0.550 Sum_probs=61.4 Q ss_pred CCCCHHHHHHHHHHHHHHHHHCCCCCEEEEEECCC---CEEEEEECCCCCCCCCCCCCCCCEEEEEEEEEEECCCCCCCC Q ss_conf 75502202443666678777618997699998289---839999669843444176657870123454520024777777 Q gi|254780222|r 70 DEKALHHLFSEVIDNAMDEVIAGYANLIEVSLDGN---GFLTVVDNGRGIPVENHPKFPNKSTLEIILTTLHAGGKFDNS 146 (686) Q Consensus 70 ~~~GL~~i~~EIldNavDe~~~g~~~~I~V~i~~d---g~IsV~ddGrGIPv~~h~k~~~~~~~Elif~~L~aggnfdd~ 146 (686) +...|.+++.|+|+||++....+ ..|.|.+..+ -.|+|.|||.|||-+.. +.+|....++.+ . T Consensus 2 d~~~l~~i~~nll~NAik~~~~~--~~I~i~~~~~~~~~~i~V~D~G~Gi~~~~~---------~~if~~f~~~~~---~ 67 (111) T pfam02518 2 DEDRLRQVLSNLLDNAIKHAPAG--GEIEVTLERDGGRLRITVEDNGIGIPPEDL---------PKIFEPFFRTDK---D 67 (111) T ss_pred CHHHHHHHHHHHHHHHHHHCCCC--CEEEEEEEEECCEEEEEEEECCCCCCHHHH---------HHHCCCCEECCC---C T ss_conf 88999999999999999977999--759999999599999999989999598999---------887499177799---9 Q ss_pred CEEEECCCCCHHHHHHHHEEEEEEEEE Q ss_conf 415631445388987620321269999 Q gi|254780222|r 147 TYEISGGLHGVGISVVNALSDELHVTV 173 (686) Q Consensus 147 ~yk~sgGlnGvGaklvNalS~~f~Vev 173 (686) ... .+| +|+|.++|+.+.+.+..++ T Consensus 68 ~~~-~~g-~GlGL~i~~~~~~~~~g~i 92 (111) T pfam02518 68 SRK-VGG-TGLGLSIVRKLVELHGGTI 92 (111) T ss_pred CCC-CCC-CCHHHHHHHHHHHHCCCEE T ss_conf 977-797-7647999999999879989 No 28 >PRK13560 hypothetical protein; Provisional Probab=97.63 E-value=0.00022 Score=49.43 Aligned_cols=15 Identities=7% Similarity=-0.015 Sum_probs=7.5 Q ss_pred EEECCCCEEEEEECC Q ss_conf 998289839999669 Q gi|254780222|r 99 VSLDGNGFLTVVDNG 113 (686) Q Consensus 99 V~i~~dg~IsV~ddG 113 (686) ...|.+|++--.|.+ T Consensus 218 F~kD~~G~ylgcN~a 232 (807) T PRK13560 218 FWKDEDAKVFGCNDA 232 (807) T ss_pred EEECCCCCEEEECHH T ss_conf 997799518962099 No 29 >COG2205 KdpD Osmosensitive K+ channel histidine kinase [Signal transduction mechanisms] Probab=97.63 E-value=7.5e-05 Score=52.60 Aligned_cols=89 Identities=22% Similarity=0.290 Sum_probs=43.8 Q ss_pred EECCCCHHHHCCCCEEECC-------------------------CCCCCHH-----HHHHHHHHHHHH----------HH Q ss_conf 2311210343269815467-------------------------7755022-----024436666787----------77 Q gi|254780222|r 50 RILEGLEPVRMRPGMYIGG-------------------------TDEKALH-----HLFSEVIDNAMD----------EV 89 (686) Q Consensus 50 ~~L~glE~VrkRP~mYIGs-------------------------t~~~GL~-----~i~~EIldNavD----------e~ 89 (686) +.|+|||.|-.|+=-|=|- |..+|.. +=+.|||+-.|| |. T Consensus 68 al~eGl~~ip~r~i~~~g~~l~E~Dlda~L~r~P~lvLvDELAHTNa~gsrh~KR~qDiEelL~aGIDV~TT~NvQHlES 147 (890) T COG2205 68 ALLEGLEVIPRRRIEYRGRTLSEFDLDAALARRPALVLVDELAHTNAPGSRHPKRWQDIEELLAAGIDVYTTVNVQHLES 147 (890) T ss_pred HHHCCCCCCCCCCCCCCCEEEEECCHHHHHHCCCCEEEEEHHHCCCCCCCCCCCHHHHHHHHHHCCCEEEEEEHHHHHHH T ss_conf 99738766886411217823011257789735997898514330489888883035409999976980677400555767 Q ss_pred HCC---CCCEEEEEEC-CC------CEEEEEECCCCCCCC----CCCCCCCCEEEEE---EEEEEECCCCC Q ss_conf 618---9976999982-89------839999669843444----1766578701234---54520024777 Q gi|254780222|r 90 IAG---YANLIEVSLD-GN------GFLTVVDNGRGIPVE----NHPKFPNKSTLEI---ILTTLHAGGKF 143 (686) Q Consensus 90 ~~g---~~~~I~V~i~-~d------g~IsV~ddGrGIPv~----~h~k~~~~~~~El---if~~L~aggnf 143 (686) +++ ..+.|.|.=. +| ..|.+.| ||=| --.+ -++|.||- .|.+..+-+|. T Consensus 148 LnDvV~~iTgv~VrE~VPD~v~d~AD~v~lVD----i~P~~L~~RL~~-GKVY~~~~a~~A~~nFF~~~NL 213 (890) T COG2205 148 LNDVVAGITGVRVRETVPDRVFDEADEVELVD----IPPDDLRERLKE-GKVYAPEQAERALENFFRKGNL 213 (890) T ss_pred HHHHHHHCCCEEEEEECCHHHHCCCCEEEEEC----CCHHHHHHHHHC-CCCCCHHHHHHHHHHHHCCCCH T ss_conf 88899863670774027806756457179961----887999999975-8823778999999976320307 No 30 >PRK10337 sensor protein QseC; Provisional Probab=97.62 E-value=0.00011 Score=51.49 Aligned_cols=12 Identities=25% Similarity=0.332 Sum_probs=4.6 Q ss_pred CCCCCCCCCCCH Q ss_conf 076446620252 Q gi|254780222|r 379 FAGQTKTKLVSL 390 (686) Q Consensus 379 FesQTKekL~s~ 390 (686) |.+..=..|-|| T Consensus 237 F~adaAHELRTP 248 (446) T PRK10337 237 FTSDAAHELRSP 248 (446) T ss_pred HHHHHHHHHCCH T ss_conf 999788875168 No 31 >COG0326 HtpG Molecular chaperone, HSP90 family [Posttranslational modification, protein turnover, chaperones] Probab=97.54 E-value=0.00035 Score=47.98 Aligned_cols=157 Identities=24% Similarity=0.279 Sum_probs=89.5 Q ss_pred HHHHHHHHHHHH-------HCCC-------CCEEEEEECCC-CEEEEEECCCCCCCC-CCCC--CCCCEEEEEEEEEEEC Q ss_conf 443666678777-------6189-------97699998289-839999669843444-1766--5787012345452002 Q gi|254780222|r 78 FSEVIDNAMDEV-------IAGY-------ANLIEVSLDGN-GFLTVVDNGRGIPVE-NHPK--FPNKSTLEIILTTLHA 139 (686) Q Consensus 78 ~~EIldNavDe~-------~~g~-------~~~I~V~i~~d-g~IsV~ddGrGIPv~-~h~k--~~~~~~~Elif~~L~a 139 (686) +.|.|.||.|-. +.++ --+|.|.+|+| .+++|.|||-|+-=+ +|.- .-..+.-.--+..|+. T Consensus 32 LRELISNAsDAidKlr~~al~~~~~~~~~~~~~I~i~~Dk~~kTLtI~DNGIGMT~~Ev~~~LgTIAkSgT~~F~~~l~~ 111 (623) T COG0326 32 LRELISNASDAIDKLRFEALSDPELGEGDSDLRIRISFDKDNKTLTISDNGIGMTKDEVIENLGTIAKSGTKEFLESLSE 111 (623) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCEEEEEECCCCCEEEEEECCCCCCHHHHHHHHHHHHHCCHHHHHHHHHC T ss_conf 99998506899999887752682114788884489976366787999727878889999998777510027999987303 Q ss_pred CCCCCCCCEEEECCCCCHHHHHHHHEEEEEEEEEECCC--EEEEEEEE-CCCEECCCEECCCCCCC-CCEEEEEEEC--H Q ss_conf 47777774156314453889876203212699994188--38999972-79451131121676778-8539999984--2 Q gi|254780222|r 140 GGKFDNSTYEISGGLHGVGISVVNALSDELHVTVARQN--TIFSQKFS-RGIPLSPLEKKEKVRNK-RGTRITFRPD--P 213 (686) Q Consensus 140 ggnfdd~~yk~sgGlnGvGaklvNalS~~f~Vev~r~g--k~y~q~f~-~g~~~~~~~~i~~~~~k-~GT~ItF~PD--~ 213 (686) +. .|+. -.|.-|||.--+=+.+.+.+|.+...| ..| .|+ +|...-.+..+.+ .. .||+|+-.+- . T Consensus 112 ~~--~~~~---lIGQFGVGFYSaFmVAdkV~V~T~~~~~~~~~--~W~S~g~g~ytv~~~~~--~~~~GT~I~L~Lk~~e 182 (623) T COG0326 112 DQ--KDSD---LIGQFGVGFYSAFMVADKVTVITRSAGEDEAY--HWESDGEGEYTVEDIDK--EPRRGTEITLHLKEEE 182 (623) T ss_pred CC--CCCC---CCCCCCCHHHHEEEEEEEEEEEECCCCCCCCC--EEEECCCCCEEEEECCC--CCCCCCEEEEEECCCH T ss_conf 54--4544---00245500342025511689996468998660--68776987467765667--7878857999988766 Q ss_pred HHEECCCCCCHHHHHHHHHHHHHCCCCCEEEEECCC Q ss_conf 432001222289999988887530577359995164 Q gi|254780222|r 214 KIFGDKAGFDAARLLKMTQSKAYLSGRVKTCWSCDK 249 (686) Q Consensus 214 ~iFg~~~~~d~~~l~~rl~~~A~L~~gl~i~~~~~~ 249 (686) +-|- +...|...+...+-. =.+.|.+..++ T Consensus 183 ~efl-----~~~rl~~ivkkYSd~-i~~PI~~~~~~ 212 (623) T COG0326 183 DEFL-----EEWRLREIVKKYSDH-IAYPIYIEGEK 212 (623) T ss_pred HHHH-----HHHHHHHHHHHHHCC-CCCCEEEEEEC T ss_conf 7774-----066799999987325-43166996422 No 32 >KOG1978 consensus Probab=97.34 E-value=0.002 Score=42.84 Aligned_cols=233 Identities=21% Similarity=0.219 Sum_probs=128.0 Q ss_pred CCCHHHHHHHHHHHHHHHHHCCCCCEEEEEECCCC--EEEEEECCCCCCCCCCCCCCCCEEEEEEEEEEECC---CCCCC Q ss_conf 55022024436666787776189976999982898--39999669843444176657870123454520024---77777 Q gi|254780222|r 71 EKALHHLFSEVIDNAMDEVIAGYANLIEVSLDGNG--FLTVVDNGRGIPVENHPKFPNKSTLEIILTTLHAG---GKFDN 145 (686) Q Consensus 71 ~~GL~~i~~EIldNavDe~~~g~~~~I~V~i~~dg--~IsV~ddGrGIPv~~h~k~~~~~~~Elif~~L~ag---gnfdd 145 (686) ..+|.--+.|+|+||+|. -|+.|+|.+..-| +|+|.|||.||--. . .++ ++--|+- ..|.| T Consensus 18 I~sl~sAVKELvENSiDA----GAT~I~I~~kdyG~d~IEV~DNG~GI~~~-n--~~~-------l~lkh~TSKi~~f~D 83 (672) T KOG1978 18 ITSLVSAVKELVENSIDA----GATAIDIKVKDYGSDSIEVSDNGSGISAT-D--FEG-------LALKHTTSKIVSFAD 83 (672) T ss_pred ECCHHHHHHHHHHCCCCC----CCCEEEEECCCCCCCEEEEECCCCCCCCC-C--HHH-------HHHHHHHHCCCCHHH T ss_conf 215899999998637034----77525575378774328980488887733-1--034-------443454413301232 Q ss_pred CCEEEECCCCCHHHHHHHHEEEEEEEEEECC-CEEEEEEEEC-CCEECCCEECCCCCCCCCEEEEEE----ECH---HHE Q ss_conf 7415631445388987620321269999418-8389999727-945113112167677885399999----842---432 Q gi|254780222|r 146 STYEISGGLHGVGISVVNALSDELHVTVARQ-NTIFSQKFSR-GIPLSPLEKKEKVRNKRGTRITFR----PDP---KIF 216 (686) Q Consensus 146 ~~yk~sgGlnGvGaklvNalS~~f~Vev~r~-gk~y~q~f~~-g~~~~~~~~i~~~~~k~GT~ItF~----PD~---~iF 216 (686) =....|-|--|=-.+.-.++|.-...+...+ ..-.+-.|.. |.. ....++....||+|+-. -=+ +-| T Consensus 84 l~~l~T~GFRGEALSsLCa~~dv~I~Trt~~~~vgt~l~~Dh~G~I----~~k~~~ar~~GTTV~v~~LF~tLPVR~kef 159 (672) T KOG1978 84 LAVLFTLGFRGEALSSLCALGDVMISTRSHSAKVGTRLVYDHDGHI----IQKKPVARGRGTTVMVRQLFSTLPVRRKEF 159 (672) T ss_pred HHHHHCHHHHHHHHHHHHHCCCEEEEEEECCCCCCEEEEECCCCCE----EEEECCCCCCCCEEEHHHHCCCCCCCHHHH T ss_conf 4444231246777763664165279986226754226887157863----443012577887787865314699866875 Q ss_pred ECCCCCCHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCCCCCCEEEEEECCCC---HHHHHCCCCCCCCCCCCCEEEEEE Q ss_conf 00122228999998888753057735999516453222455411234315861---222100246776667752033332 Q gi|254780222|r 217 GDKAGFDAARLLKMTQSKAYLSGRVKTCWSCDKEIAEKYGIPEKEEFYFPGGL---KTYLQTKLENRSLISSEIFTGKTE 293 (686) Q Consensus 217 g~~~~~d~~~l~~rl~~~A~L~~gl~i~~~~~~~~~~~~~~~~~~~f~~~~Gi---~dyv~~~~~~~~~~~~~~~~~~~~ 293 (686) .....=.++.+...+...|-+++++++....-.. ......-+..+|- ++++....+......-.++-+ T Consensus 160 ~r~~Kref~k~i~li~~y~li~~~ir~~~~n~t~------~~~k~iil~t~~~~~~k~~i~svfg~~~~~~l~p~~~--- 230 (672) T KOG1978 160 QRNIKRKFVKLISLIQAYALISTAIKFLVSNSTL------AGKKNIILKTGGYGSDKINISSNFGSVEEENLEPLIF--- 230 (672) T ss_pred CCCHHHHHHHHHHHHHHHHHHCCCCEEEEEECCC------CCCCEEEEECCCCCHHHHHHHHHCCHHHHHCCCCCCC--- T ss_conf 0114555666666677787631211013440134------5774057733886217788875110354431466555--- Q ss_pred ECCCEEEEEEEE--EEECCCCCCEEEEEEECCCCCCCEEECCHHHHHHHHHHHHHH Q ss_conf 148015589998--541467774278883000655752413046789999998763 Q gi|254780222|r 294 KKGTHRGTVEWA--IAWCEENPEITSYCNTILTDEGGTHESGLRVALTRGIKKYAE 347 (686) Q Consensus 294 ~~~~~~~~veva--l~~~~~~~~~~SFVN~I~T~~GGTHv~g~~~ai~k~i~~~~~ 347 (686) +.-+ .+.......|+-|+|.-+... ..|.+.+|++.+ T Consensus 231 --------is~~~~g~~r~s~drqf~fIn~Rpv~~---------~~i~~~inevy~ 269 (672) T KOG1978 231 --------ISSCHHGCGRSSEDRQFIFINRRPVFP---------SDICRVINEVYK 269 (672) T ss_pred --------CCCCCCCCCCCCCCCEEEEECCCCCCH---------HHHHHHHHHHHH T ss_conf --------231122465567652146643765887---------999999999875 No 33 >PRK05218 heat shock protein 90; Provisional Probab=97.29 E-value=0.0017 Score=43.34 Aligned_cols=155 Identities=23% Similarity=0.236 Sum_probs=86.0 Q ss_pred HHHHHHHHHHHH--HC-----C-------CCCEEEEEECCC-CEEEEEECCCCCCCC-CCCCCCCCEEEEEEEEEEECCC Q ss_conf 443666678777--61-----8-------997699998289-839999669843444-1766578701234545200247 Q gi|254780222|r 78 FSEVIDNAMDEV--IA-----G-------YANLIEVSLDGN-GFLTVVDNGRGIPVE-NHPKFPNKSTLEIILTTLHAGG 141 (686) Q Consensus 78 ~~EIldNavDe~--~~-----g-------~~~~I~V~i~~d-g~IsV~ddGrGIPv~-~h~k~~~~~~~Elif~~L~agg 141 (686) +.|+|.||+|-- ++ + ..-.|.|++|++ ++++|.|||=|+--+ +| +-+=|.-++|+ T Consensus 30 lRELisNAsDAl~K~r~~~l~~~~~~~~~~~~~I~I~~d~~~~tl~I~D~GIGMt~~El~---------~~LgTIa~Sgt 100 (612) T PRK05218 30 LRELISNASDAIDKLRFEALSDEALYEGDGELRIRISFDKEARTLTISDNGIGMTREEVI---------ENLGTIAKSGT 100 (612) T ss_pred HHHHHHHHHHHHHHHHHHHCCCHHHCCCCCCCEEEEEECCCCCEEEEEECCCCCCHHHHH---------HHHHHHCCHHH T ss_conf 888874199999999998725903217888856999975989989999538888999999---------86111020118 Q ss_pred C-----CCCCCE--EEECCCCCHHHHHHHHEEEEEEEEEEC--CC-EEEEEEEECCCEECCCEECCCCCCCCCEEEEEEE Q ss_conf 7-----777741--563144538898762032126999941--88-3899997279451131121676778853999998 Q gi|254780222|r 142 K-----FDNSTY--EISGGLHGVGISVVNALSDELHVTVAR--QN-TIFSQKFSRGIPLSPLEKKEKVRNKRGTRITFRP 211 (686) Q Consensus 142 n-----fdd~~y--k~sgGlnGvGaklvNalS~~f~Vev~r--~g-k~y~q~f~~g~~~~~~~~i~~~~~k~GT~ItF~P 211 (686) . +.++.. .--.|.-|||.=.|=..|.+.+|.+.. ++ +.|..+.. |-..-.+.. ..+..+||+|+..+ T Consensus 101 ~~F~~~~~~~~~~~~~lIGqFGvGFyS~FmVad~V~V~srs~~~~~~~~~W~s~-g~g~y~i~~--~~~~~~GT~I~L~l 177 (612) T PRK05218 101 KEFLEKLKGDQKKDSQLIGQFGVGFYSAFMVADKVTVITRSAGPASEAVRWESD-GEGEYTIEE--IEKEERGTEITLHL 177 (612) T ss_pred HHHHHHHHHCCCCCCCHHHHCCCCCEEEEEECCEEEEEECCCCCCCCCEEEEEC-CCCCEEEEE--CCCCCCCCEEEEEE T ss_conf 999998652135540023213524204656504799997677899754599977-999767610--56789983799998 Q ss_pred CHHHEECCCCCCHHHHHHHHHHHHHCCCCCEEEEECC Q ss_conf 4243200122228999998888753057735999516 Q gi|254780222|r 212 DPKIFGDKAGFDAARLLKMTQSKAYLSGRVKTCWSCD 248 (686) Q Consensus 212 D~~iFg~~~~~d~~~l~~rl~~~A~L~~gl~i~~~~~ 248 (686) -.+- .+ -.+...|...++..+-.. ...|.++++ T Consensus 178 k~d~-~e--fl~~~~i~~li~kYs~fi-~~PI~l~~~ 210 (612) T PRK05218 178 KEDE-DE--FLDEWRIRSIIKKYSDHI-PVPIELEKE 210 (612) T ss_pred CCHH-HH--HCCHHHHHHHHHHHCCCC-CCCEEECCC T ss_conf 8606-76--447889999999740465-533497145 No 34 >PRK10755 sensor protein BasS/PmrB; Provisional Probab=97.25 E-value=0.00055 Score=46.64 Aligned_cols=78 Identities=22% Similarity=0.346 Sum_probs=49.9 Q ss_pred CHHHHHHHHHHHHHHHHHC--CCCCEEEEEECCCC--EEEEEECCCCCCCCCCCCCCCCEEEEEEEEEEECCCCCCCCCE Q ss_conf 0220244366667877761--89976999982898--3999966984344417665787012345452002477777741 Q gi|254780222|r 73 ALHHLFSEVIDNAMDEVIA--GYANLIEVSLDGNG--FLTVVDNGRGIPVENHPKFPNKSTLEIILTTLHAGGKFDNSTY 148 (686) Q Consensus 73 GL~~i~~EIldNavDe~~~--g~~~~I~V~i~~dg--~IsV~ddGrGIPv~~h~k~~~~~~~Elif~~L~aggnfdd~~y 148 (686) |=...+..++.|-+||+++ ..+..|.|.+..++ .|+|.|+|.|||-+-. +-+|...+-+. . T Consensus 249 ~D~~lL~~vl~NLi~NAikys~~g~~I~I~l~~~~~~~i~V~D~G~GI~~e~~---------~~iFerFyR~d----~-- 313 (355) T PRK10755 249 GDATLLRLLLRNLVENAHRYSPEGSNITIKLQEDGGAVMAVEDEGPGIDESKC---------GELSKAFVRMD----S-- 313 (355) T ss_pred ECHHHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCEEEEEEECCCCCCHHHH---------HHHCCCCCCCC----C-- T ss_conf 68999999999999989974899976999999779889999987999897999---------88558861389----9-- Q ss_pred EEECCCCCHHHHHHHHEEE Q ss_conf 5631445388987620321 Q gi|254780222|r 149 EISGGLHGVGISVVNALSD 167 (686) Q Consensus 149 k~sgGlnGvGaklvNalS~ 167 (686) -+|| .|+|.+.|...-+ T Consensus 314 -~~gG-sGLGLaIvk~Iv~ 330 (355) T PRK10755 314 -RYGG-IGLGLSIVSRITQ 330 (355) T ss_pred -CCCC-CCHHHHHHHHHHH T ss_conf -9998-7589999999999 No 35 >PRK10815 sensor protein PhoQ; Provisional Probab=97.18 E-value=0.001 Score=44.81 Aligned_cols=35 Identities=17% Similarity=0.247 Sum_probs=14.5 Q ss_pred CCHHHHHHHHHHHHHCCCCCEEEEEEECCCCCCHHHHHH Q ss_conf 789999999998774178860786850677889889988 Q gi|254780222|r 592 HDEEHKQKILEEFKKHGKGKIEISRFKGLGEMLASQLKE 630 (686) Q Consensus 592 ~~e~e~~~~~~~~~~~~~~k~~i~ryKGLGe~~~~e~~e 630 (686) .++++.++..+...+.....- =-|||=+-..+.-+ T Consensus 419 Ip~e~~~~IFeRF~R~d~~r~----GsGLGLaIvk~Iv~ 453 (484) T PRK10815 419 IPESKRELIFDRGQRVDTLRP----GQGVGLAVAREITE 453 (484) T ss_pred CCHHHHHHHCCCCCCCCCCCC----CCCHHHHHHHHHHH T ss_conf 998999862468877999999----85767999999999 No 36 >COG3920 Signal transduction histidine kinase [Signal transduction mechanisms] Probab=97.07 E-value=0.003 Score=41.55 Aligned_cols=94 Identities=27% Similarity=0.459 Sum_probs=65.9 Q ss_pred EEECCCCCCCHHHHHHHHHHHHHHHHHCC-CCCEEEEEEC--CCC---EEEEEECCCCCCCCCCCCCCCCEEEEEEEEEE Q ss_conf 15467775502202443666678777618-9976999982--898---39999669843444176657870123454520 Q gi|254780222|r 64 MYIGGTDEKALHHLFSEVIDNAMDEVIAG-YANLIEVSLD--GNG---FLTVVDNGRGIPVENHPKFPNKSTLEIILTTL 137 (686) Q Consensus 64 mYIGst~~~GL~~i~~EIldNavDe~~~g-~~~~I~V~i~--~dg---~IsV~ddGrGIPv~~h~k~~~~~~~Elif~~L 137 (686) .+++.--.-.|-=+|.|.+.||+-.+--+ ....|.|++. .++ ...|||||.|+|++.-+ T Consensus 113 ~~l~~d~A~~Lgliv~EL~tNa~Khaf~~~~~G~I~I~~~~~~~~~~~~l~v~deg~G~~~~~~~--------------- 177 (221) T COG3920 113 VFLDPDTAVPLGLIVHELVTNALKHAFLSRPGGEIRITLSREGDGGRFLLTVWDEGGGPPVEAPL--------------- 177 (221) T ss_pred EEECHHHHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEEEEECCCCCEEEEEEEECCCCCCCCCCC--------------- T ss_conf 22673678899999999999889861788899779999998479971799998789898888887--------------- Q ss_pred ECCCCCCCCCEEEECCCCCHHHHHHHHEE-EEEE--EEEE-CCCEEEEEEEECC Q ss_conf 02477777741563144538898762032-1269--9994-1883899997279 Q gi|254780222|r 138 HAGGKFDNSTYEISGGLHGVGISVVNALS-DELH--VTVA-RQNTIFSQKFSRG 187 (686) Q Consensus 138 ~aggnfdd~~yk~sgGlnGvGaklvNalS-~~f~--Vev~-r~gk~y~q~f~~g 187 (686) .+.|+|..+++.+- .... ++.+ .+|..|..+|... T Consensus 178 ---------------~~~g~G~~Lv~~lv~~q~~g~~~~~~~~Gt~~~i~~~~~ 216 (221) T COG3920 178 ---------------SRGGFGLQLVERLVPEQLGGELEDERPDGTEFRLRFPLS 216 (221) T ss_pred ---------------CCCCCHHHHHHHHHHHHCCCEEEEECCCCEEEEEEEECC T ss_conf ---------------899837999999999967972799749977999998864 No 37 >PTZ00272 heat shock protein 83 kDa (Hsp83); Provisional Probab=97.00 E-value=0.0043 Score=40.50 Aligned_cols=157 Identities=25% Similarity=0.255 Sum_probs=83.4 Q ss_pred HHHHHHHHHHH--HHC-----C-----CC--CEEEEEECCC-CEEEEEECCCCCCCCCCCCCCCCEEEEEEEEEEECCCC Q ss_conf 44366667877--761-----8-----99--7699998289-83999966984344417665787012345452002477 Q gi|254780222|r 78 FSEVIDNAMDE--VIA-----G-----YA--NLIEVSLDGN-GFLTVVDNGRGIPVENHPKFPNKSTLEIILTTLHAGGK 142 (686) Q Consensus 78 ~~EIldNavDe--~~~-----g-----~~--~~I~V~i~~d-g~IsV~ddGrGIPv~~h~k~~~~~~~Elif~~L~aggn 142 (686) +.|+|.||.|- .++ + .. =.|.|.+|++ ++++|.|||=|+--+ +=+ +-+=|..++|++ T Consensus 30 LRELISNASDAldKlr~~~lt~~~~~~~~~~l~I~I~~D~~~~tltI~D~GIGMt~~-----el~---~nLGTIA~SGt~ 101 (701) T PTZ00272 30 LRELISNASDACDKIRYQSLTDPSVLGESPRLCIRVVPDKENKTLTVEDNGIGMTKA-----DLV---NNLGTIARSGTK 101 (701) T ss_pred HHHHHHHHHHHHHHHHHHHCCCHHHCCCCCCCEEEEEECCCCCEEEEEECCCCCCHH-----HHH---HHCCCCCCHHHH T ss_conf 787776279999999999736934407887735999975878979999438888999-----999---752101114189 Q ss_pred -C----CCCCEEEECCCCCHHHHHHHHEEEEEEEEEECC-CEEEEEEEECCCEECCCEECCCCCCCCCEEEEEEE--CHH Q ss_conf -7----777415631445388987620321269999418-83899997279451131121676778853999998--424 Q gi|254780222|r 143 -F----DNSTYEISGGLHGVGISVVNALSDELHVTVARQ-NTIFSQKFSRGIPLSPLEKKEKVRNKRGTRITFRP--DPK 214 (686) Q Consensus 143 -f----dd~~yk~sgGlnGvGaklvNalS~~f~Vev~r~-gk~y~q~f~~g~~~~~~~~i~~~~~k~GT~ItF~P--D~~ 214 (686) | +.....--.|.-|||--.+=..+.+.+|.+..+ ...|..+.. |...-.+.......-.+||+|+... |.. T Consensus 102 ~Fl~~l~~~~d~~lIGQFGVGFYSaFmVAdkV~V~sk~~~~~~~~W~S~-g~g~y~i~~~~~~~~~rGT~I~L~Lked~~ 180 (701) T PTZ00272 102 AFMEALEAGGDMSMIGQFGVGFYSAYLVADRVTVTSKNNSDESYVWESS-AGGTFTITSTPESDMKRGTRITLHLKEDQM 180 (701) T ss_pred HHHHHHHCCCCCCCCCCCCCCHHHEEEECCEEEEEECCCCCCCEEEEEE-CCCCEEEEECCCCCCCCCCEEEEEECHHHH T ss_conf 9999986479735234676654320120567999964699875589980-589658885777777898579999880488 Q ss_pred HEECCCCCCHHHHHHHHHHHHHCCCCCEEEEECCC Q ss_conf 32001222289999988887530577359995164 Q gi|254780222|r 215 IFGDKAGFDAARLLKMTQSKAYLSGRVKTCWSCDK 249 (686) Q Consensus 215 iFg~~~~~d~~~l~~rl~~~A~L~~gl~i~~~~~~ 249 (686) -| .+...|...++..+-- =...|.+..++ T Consensus 181 ef-----l~~~~lk~likKYS~f-I~~PI~l~~ek 209 (701) T PTZ00272 181 EY-----LEPRRLKELIKKHSEF-IGYDIELMVEK 209 (701) T ss_pred HH-----HHHHHHHHHHHHHCCC-CCCCEEECCCC T ss_conf 77-----5478899999972467-56443622234 No 38 >COG3850 NarQ Signal transduction histidine kinase, nitrate/nitrite-specific [Signal transduction mechanisms] Probab=96.94 E-value=0.002 Score=42.79 Aligned_cols=36 Identities=22% Similarity=0.185 Sum_probs=17.8 Q ss_pred EEEEEEECCCCCCCCHHHC-------CCCCCEEEEEECCEECCHHH Q ss_conf 0379973276764210010-------11222012211231314454 Q gi|254780222|r 463 GTELFIVEGDSAGGSAKQA-------RNRSNQAVLPLRGKILNVAS 501 (686) Q Consensus 463 ~~eL~lvEGDSA~gsAk~g-------rdr~~qgi~PLRGKiLNv~~ 501 (686) .|-.-+++|| +.++-.- -.-...|.||+ |--|+..+ T Consensus 298 ~~~~di~~~d--~~~~~~~~~~~~l~~~g~~Lg~l~~-~~~l~~~d 340 (574) T COG3850 298 AEQWDISEGD--QPSGLKWPQEDPLTQQGHLLGTLPW-QRSLPEDD 340 (574) T ss_pred CCCCCEECCC--CCCCCCHHHHCCHHHHHHHHEEEEC-CCCCCCCH T ss_conf 4674342488--8766301343306553046614103-68999813 No 39 >PRK10490 sensor protein KdpD; Provisional Probab=96.93 E-value=0.0017 Score=43.38 Aligned_cols=41 Identities=20% Similarity=0.334 Sum_probs=27.5 Q ss_pred HHEEECCCCHHHHCCCCEEECC-------------------------CCCCCHHHH-----HHHHHHHHHH Q ss_conf 5602311210343269815467-------------------------775502202-----4436666787 Q gi|254780222|r 47 SSIRILEGLEPVRMRPGMYIGG-------------------------TDEKALHHL-----FSEVIDNAMD 87 (686) Q Consensus 47 ~~i~~L~glE~VrkRP~mYIGs-------------------------t~~~GL~~i-----~~EIldNavD 87 (686) +.-..|+|||-+-.|.--|=|- |..+|..|- +.|+|+..|| T Consensus 67 et~~l~~gl~~~p~~~~~y~g~~~~e~dlda~l~r~P~~~lvDelahtn~pgsrh~kR~qdv~ell~aGi~ 137 (895) T PRK10490 67 ETAALLEGLTVLPPKRLAHRGRQISEFDLDAALARRPALILMDELAHSNAPGSRHPKRWQDIEELLEAGID 137 (895) T ss_pred HHHHHHCCCCCCCCCEEEECCCCCCCCCHHHHHHCCCCEEEEECCCCCCCCCCCCCCHHHHHHHHHHCCCE T ss_conf 99998748665887114127832341689899853998899733456699988776407649999977983 No 40 >PTZ00130 heat shock protein 90; Provisional Probab=96.93 E-value=0.0081 Score=38.66 Aligned_cols=156 Identities=20% Similarity=0.246 Sum_probs=84.6 Q ss_pred HHHHHHHHHHH--HH-----CCC-------CCEEEEEECCC-CEEEEEECCCCCCCCCCCCCCCCEEEEEEEEEEECCCC Q ss_conf 44366667877--76-----189-------97699998289-83999966984344417665787012345452002477 Q gi|254780222|r 78 FSEVIDNAMDE--VI-----AGY-------ANLIEVSLDGN-GFLTVVDNGRGIPVENHPKFPNKSTLEIILTTLHAGGK 142 (686) Q Consensus 78 ~~EIldNavDe--~~-----~g~-------~~~I~V~i~~d-g~IsV~ddGrGIPv~~h~k~~~~~~~Elif~~L~aggn 142 (686) +.|+|.||.|- .+ .++ -=.|.|.+|++ ++++|.|+|-|+--+ +=+.++ =|..++|++ T Consensus 101 LRELISNASDALDKlRf~sLtd~~~~~~~~~l~I~I~~Dke~ktLtI~DtGIGMTk~-----eLi~NL---GTIAkSGTk 172 (824) T PTZ00130 101 LRELISNAADALEKIRFLSLSDEKVLGEEKKLEIRISANKEKNILSITDTGIGMTKE-----DLINNL---GTIAKSGTS 172 (824) T ss_pred HHHHHHHHHHHHHHHHHHHCCCHHHCCCCCCCEEEEEECCCCCEEEEEECCCCCCHH-----HHHHHC---CCCEEHHHH T ss_conf 888876489999999999717924308888856998975889989998448888899-----999856---913335289 Q ss_pred -C-----CCCCEEEECCCCCHHHHHHHHEEEEEEEEEECC-CEEEEEEEEC-C-CEECCCEECCCCCCCCCEEEEEE--E Q ss_conf -7-----777415631445388987620321269999418-8389999727-9-45113112167677885399999--8 Q gi|254780222|r 143 -F-----DNSTYEISGGLHGVGISVVNALSDELHVTVARQ-NTIFSQKFSR-G-IPLSPLEKKEKVRNKRGTRITFR--P 211 (686) Q Consensus 143 -f-----dd~~yk~sgGlnGvGaklvNalS~~f~Vev~r~-gk~y~q~f~~-g-~~~~~~~~i~~~~~k~GT~ItF~--P 211 (686) | +....---.|.-|||.-.+-..+.+.+|.+..+ ...|..+..- | ..+.+.++- ..-.+||+|+.. + T Consensus 173 ~Fle~l~~~~~d~~LIGQFGVGFYSaFmVAdkV~V~SK~~~de~~~WeS~g~gsftI~~~~~~--~~l~RGT~I~LhLKe 250 (824) T PTZ00130 173 NFLEAISKSGGDMSLIGQFGVGFYSAFLVADKVIVYTKNNDDEQYIWESTADAKFSIYKDPRG--ATLKRGTRISLHLKD 250 (824) T ss_pred HHHHHHHHCCCCCCCCCCCCCCHHHHHHCCCEEEEEECCCCCCCEEEEEECCCCEEEEECCCC--CCCCCCEEEEEEECC T ss_conf 999998735897010267775254443106768999578999640899806785588867777--777897069998567 Q ss_pred CHHHEECCCCCCHHHHHHHHHHHHHCCCCCEEEEECCC Q ss_conf 42432001222289999988887530577359995164 Q gi|254780222|r 212 DPKIFGDKAGFDAARLLKMTQSKAYLSGRVKTCWSCDK 249 (686) Q Consensus 212 D~~iFg~~~~~d~~~l~~rl~~~A~L~~gl~i~~~~~~ 249 (686) |..-|- +...|...++..+-- =+..|.+..++ T Consensus 251 d~~Efl-----ee~rik~lVkKYS~F-I~fPI~l~~ek 282 (824) T PTZ00130 251 DATNLL-----NDKKLMDLISKYSQF-IQFPIYLLHEN 282 (824) T ss_pred CHHHHH-----HHHHHHHHHHHHCCC-CCCCEEEECCC T ss_conf 567665-----478899999985136-65225872034 No 41 >KOG1979 consensus Probab=96.83 E-value=0.0074 Score=38.90 Aligned_cols=153 Identities=20% Similarity=0.306 Sum_probs=91.2 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCEEEEEECCCC--EEEEEECCCCCCCCCCCCCCCCEEEEEEEEEEECCCCCCCCCEEE- Q ss_conf 22024436666787776189976999982898--399996698434441766578701234545200247777774156- Q gi|254780222|r 74 LHHLFSEVIDNAMDEVIAGYANLIEVSLDGNG--FLTVVDNGRGIPVENHPKFPNKSTLEIILTTLHAGGKFDNSTYEI- 150 (686) Q Consensus 74 L~~i~~EIldNavDe~~~g~~~~I~V~i~~dg--~IsV~ddGrGIPv~~h~k~~~~~~~Elif~~L~aggnfdd~~yk~- 150 (686) -.-.+.|.+-||.|- -.+.|.|.+..+| -+-|.|||.||-- +... |+++=+|-||...=+.-. T Consensus 28 P~NAlKEliENSLDA----~ST~I~V~vk~GGLKLlQisDnG~GI~r------eDl~----ilCeRftTSKL~kFEDL~~ 93 (694) T KOG1979 28 PVNALKELIENSLDA----NSTSIDVLVKDGGLKLLQISDNGSGIRR------EDLP----ILCERFTTSKLTKFEDLFS 93 (694) T ss_pred HHHHHHHHHHCCCCC----CCCEEEEEEECCCEEEEEEECCCCCCCH------HHHH----HHHHHHHHHHCCHHHHHHH T ss_conf 578999998614157----9856999981597689998328986152------2468----9987762120000677776 Q ss_pred --ECCCCCHHHHHHHHEEE--EEEEEEEC--CCEEEEEEEECCCEECCCEECCCCCCCCCEEEEE----EECHHH---EE Q ss_conf --31445388987620321--26999941--8838999972794511311216767788539999----984243---20 Q gi|254780222|r 151 --SGGLHGVGISVVNALSD--ELHVTVAR--QNTIFSQKFSRGIPLSPLEKKEKVRNKRGTRITF----RPDPKI---FG 217 (686) Q Consensus 151 --sgGlnGvGaklvNalS~--~f~Vev~r--~gk~y~q~f~~g~~~~~~~~i~~~~~k~GT~ItF----~PD~~i---Fg 217 (686) |=|--|- +|+- .|. +.+|.+.+ +..-|+..|.+|.-.. ..+.|.++.||.|+- .--+.| +. T Consensus 94 lsTyGFRGE--ALAS-iShVA~VtV~TK~~~~~cayrasY~DGkm~~---~pKpcAgk~GT~I~vedLFYN~~~Rrkal~ 167 (694) T KOG1979 94 LSTYGFRGE--ALAS-ISHVAHVTVTTKTAEGKCAYRASYRDGKMIA---TPKPCAGKQGTIITVEDLFYNMPTRRKALR 167 (694) T ss_pred HHHCCCCHH--HHHH-HHHEEEEEEEEEECCCEEEEEEEEECCCCCC---CCCCCCCCCCEEEEEHHHHCCCHHHHHHHC T ss_conf 664174478--8764-3323689999862376445678751450146---997755777639986684403788898751 Q ss_pred CCCCCCHHHHHHHHHHHHHCCCCCEEEEEC Q ss_conf 012222899999888875305773599951 Q gi|254780222|r 218 DKAGFDAARLLKMTQSKAYLSGRVKTCWSC 247 (686) Q Consensus 218 ~~~~~d~~~l~~rl~~~A~L~~gl~i~~~~ 247 (686) + ....+..|...+-..|.-|+++.+.+.- T Consensus 168 ~-~~EE~~ki~dlv~ryAIHn~~VsFs~rk 196 (694) T KOG1979 168 N-HAEEYRKIMDLVGRYAIHNPRVSFSLRK 196 (694) T ss_pred C-CHHHHHHHHHHHHHHHEECCCCCEEEEE T ss_conf 7-3889999999998871017886369851 No 42 >PRK10604 sensor protein RstB; Provisional Probab=96.78 E-value=0.0029 Score=41.73 Aligned_cols=13 Identities=31% Similarity=0.621 Sum_probs=5.9 Q ss_pred CCHHHHHHHHHHH Q ss_conf 7899999999987 Q gi|254780222|r 592 HDEEHKQKILEEF 604 (686) Q Consensus 592 ~~e~e~~~~~~~~ 604 (686) .++++.++..+.+ T Consensus 361 I~~e~~~~IFepF 373 (433) T PRK10604 361 IAPEERERIFEPF 373 (433) T ss_pred CCHHHHHHHCCCC T ss_conf 8989999865794 No 43 >PRK03660 anti-sigma F factor; Provisional Probab=96.71 E-value=0.016 Score=36.64 Aligned_cols=90 Identities=23% Similarity=0.337 Sum_probs=57.1 Q ss_pred HHHHHHHHHHHHHHHHHCC-CCCEEEEEEC--CCC-EEEEEECCCCCC-CCCCCCCCCCEEEEEEEEEEECCCCCCCCCE Q ss_conf 2202443666678777618-9976999982--898-399996698434-4417665787012345452002477777741 Q gi|254780222|r 74 LHHLFSEVIDNAMDEVIAG-YANLIEVSLD--GNG-FLTVVDNGRGIP-VENHPKFPNKSTLEIILTTLHAGGKFDNSTY 148 (686) Q Consensus 74 L~~i~~EIldNavDe~~~g-~~~~I~V~i~--~dg-~IsV~ddGrGIP-v~~h~k~~~~~~~Elif~~L~aggnfdd~~y 148 (686) +---+.|.+.|||.++-.+ +...|.|.+. .+. .|+|.|+|+||| ++. ..-| .|+. +. T Consensus 40 i~lAV~EA~tNaI~H~y~~~~~g~I~I~~~~~~~~l~I~V~D~G~Gid~~~~------~~~P--~~t~-------~~--- 101 (146) T PRK03660 40 IKTAVSEAVTNAIIHGYENNPDGIVYIEAEIEDEELEITVRDEGKGIEDVEE------ARQP--LFTT-------KP--- 101 (146) T ss_pred HHHHHHHHHHHHHHHHCCCCCCCEEEEEEEEECCEEEEEEEECCCCCCCHHH------CCCC--CCCC-------CC--- T ss_conf 9999999999999860367999279999998099999999973789473766------4388--7766-------87--- Q ss_pred EEECCCCCHHHHHHHHEEEEEEEEEECC-CEEEEEE Q ss_conf 5631445388987620321269999418-8389999 Q gi|254780222|r 149 EISGGLHGVGISVVNALSDELHVTVARQ-NTIFSQK 183 (686) Q Consensus 149 k~sgGlnGvGaklvNalS~~f~Vev~r~-gk~y~q~ 183 (686) .+...|.|..+.+.|..+++++...+ |...+++ T Consensus 102 --~~~~~GlGl~li~~LmDeVei~s~~g~GTtV~m~ 135 (146) T PRK03660 102 --ELERSGMGFTFMESFMDEVEVESEPGKGTTIRMK 135 (146) T ss_pred --CCCCCCHHHHHHHHHCCEEEEEECCCCCEEEEEE T ss_conf --6675640589999838869999559997899999 No 44 >PRK09470 cpxA two-component sensor protein; Provisional Probab=96.67 E-value=0.0026 Score=41.99 Aligned_cols=11 Identities=0% Similarity=0.214 Sum_probs=4.5 Q ss_pred HHHHHHHHHHC Q ss_conf 98999999880 Q gi|254780222|r 653 KDTQDSINKLM 663 (686) Q Consensus 653 ~~~~~~f~~lm 663 (686) ++.+.+|+-+. T Consensus 398 e~l~~iFepFy 408 (461) T PRK09470 398 EEREQIFRPFY 408 (461) T ss_pred HHHHHHCCCCC T ss_conf 99987627972 No 45 >COG3290 CitA Signal transduction histidine kinase regulating citrate/malate metabolism [Signal transduction mechanisms] Probab=96.60 E-value=0.0044 Score=40.47 Aligned_cols=19 Identities=16% Similarity=0.221 Sum_probs=12.9 Q ss_pred EEEEEECCCCCCCEEECCH Q ss_conf 7888300065575241304 Q gi|254780222|r 316 TSYCNTILTDEGGTHESGL 334 (686) Q Consensus 316 ~SFVN~I~T~~GGTHv~g~ 334 (686) .-++|-+|-..||-++-++ T Consensus 293 ~~i~nr~pI~~~~~~~GaI 311 (537) T COG3290 293 LLVANRVPIRSGGQIVGAI 311 (537) T ss_pred EEEEEECCEEECCEEEEEE T ss_conf 9999852378889776899 No 46 >PRK09303 adaptive-response sensory kinase; Validated Probab=96.57 E-value=0.0064 Score=39.33 Aligned_cols=10 Identities=30% Similarity=0.617 Sum_probs=3.5 Q ss_pred HHHHHHHHHH Q ss_conf 6889999999 Q gi|254780222|r 550 HIASLLLTFF 559 (686) Q Consensus 550 HI~~Lll~ff 559 (686) .|+-.|.|++ T Consensus 270 rL~QVl~NLL 279 (378) T PRK09303 270 RIRQVLLNLL 279 (378) T ss_pred HHHHHHHHHH T ss_conf 9999999999 No 47 >PRK10549 signal transduction histidine-protein kinase BaeS; Provisional Probab=96.54 E-value=0.0028 Score=41.85 Aligned_cols=11 Identities=36% Similarity=0.552 Sum_probs=4.4 Q ss_pred CCHHHHHHHHH Q ss_conf 78999999999 Q gi|254780222|r 592 HDEEHKQKILE 602 (686) Q Consensus 592 ~~e~e~~~~~~ 602 (686) .++++..++.+ T Consensus 397 I~~e~l~~IFe 407 (467) T PRK10549 397 VSDEQLQKLFE 407 (467) T ss_pred CCHHHHHHHCC T ss_conf 79899998668 No 48 >TIGR02938 nifL_nitrog nitrogen fixation negative regulator NifL; InterPro: IPR014285 NifL modulates the activity of the nitrogen fixation positive regulator protein NifA, acting as a negative regulator by binding NifA. NifA and NifL are encoded by adjacent genes.. Probab=96.52 E-value=0.0024 Score=42.24 Aligned_cols=50 Identities=26% Similarity=0.492 Sum_probs=23.9 Q ss_pred CCCHHHHHHHHHHHHHHHHHCCC-CCE--EEEEEC-CCC--EEEEEECCCCCCCCCC Q ss_conf 55022024436666787776189-976--999982-898--3999966984344417 Q gi|254780222|r 71 EKALHHLFSEVIDNAMDEVIAGY-ANL--IEVSLD-GNG--FLTVVDNGRGIPVENH 121 (686) Q Consensus 71 ~~GL~~i~~EIldNavDe~~~g~-~~~--I~V~i~-~dg--~IsV~ddGrGIPv~~h 121 (686) +.-|.-||..+|||||| +++-. .++ +.++-- .|. .|+|.|-|+|||-|.- T Consensus 386 e~~lRslfK~LvdNAIe-aM~~~~~~RREL~i~t~~~~~li~~~i~DsGPGIP~dlr 441 (496) T TIGR02938 386 ESRLRSLFKALVDNAIE-AMAQKGVKRRELSILTAVEDDLIRVEIEDSGPGIPADLR 441 (496) T ss_pred CHHHHHHHHHHHHHHHH-HHHHCCCCCHHHHHHHHHCCCEEEEEEEECCCCCCCCCC T ss_conf 52578999999999999-853078410231010223085689999735879872442 No 49 >PRK11100 sensory histidine kinase CreC; Provisional Probab=96.48 E-value=0.0041 Score=40.65 Aligned_cols=13 Identities=23% Similarity=0.353 Sum_probs=6.1 Q ss_pred CCHHHHHHHHHHH Q ss_conf 7899999999987 Q gi|254780222|r 592 HDEEHKQKILEEF 604 (686) Q Consensus 592 ~~e~e~~~~~~~~ 604 (686) .++++..+..+.+ T Consensus 412 Ip~e~l~~IFerF 424 (475) T PRK11100 412 IPDYALPRIFERF 424 (475) T ss_pred CCHHHHHHHCCCC T ss_conf 7889998861678 No 50 >PRK09467 envZ osmolarity sensor protein; Provisional Probab=96.45 E-value=0.0062 Score=39.45 Aligned_cols=13 Identities=23% Similarity=0.475 Sum_probs=7.1 Q ss_pred CCHHHHHHHHHHH Q ss_conf 7899999999987 Q gi|254780222|r 592 HDEEHKQKILEEF 604 (686) Q Consensus 592 ~~e~e~~~~~~~~ 604 (686) .++++.++..+-+ T Consensus 375 I~~e~l~~if~pF 387 (437) T PRK09467 375 IPEEQIKHLFQPF 387 (437) T ss_pred CCHHHHHHHCCCC T ss_conf 9989998761685 No 51 >PRK10600 nitrate/nitrite sensor protein NarX; Provisional Probab=96.29 E-value=0.0099 Score=38.05 Aligned_cols=13 Identities=23% Similarity=0.368 Sum_probs=7.6 Q ss_pred CCEEEEEEECCCC Q ss_conf 8607868506778 Q gi|254780222|r 610 GKIEISRFKGLGE 622 (686) Q Consensus 610 ~k~~i~ryKGLGe 622 (686) +...|.--.|=|| T Consensus 536 G~l~i~s~p~~GT 548 (569) T PRK10600 536 GDCSVRRRESGGT 548 (569) T ss_pred CEEEEEECCCCCE T ss_conf 9899986699958 No 52 >cd00188 TOPRIM Topoisomerase-primase domain. This is a nucleotidyl transferase/hydrolase domain found in type IA, type IIA and type IIB topoisomerases, bacterial DnaG-type primases, small primase-like proteins from bacteria and archaea, OLD family nucleases from bacterial and archaea, and bacterial DNA repair proteins of the RecR/M family. This domain has two conserved motifs, one of which centers at a conserved glutamate and the other one at two conserved aspartates (DxD). This glutamate and two aspartates, cluster together to form a highly acid surface patch. The conserved glutamate may act as a general base in nucleotide polymerization by primases and in strand joining in topoisomerases and, as a general acid in strand cleavage by topisomerases and nucleases. The DXD motif may co-ordinate Mg2+, a cofactor required for full catalytic function. Probab=96.27 E-value=0.011 Score=37.80 Aligned_cols=74 Identities=39% Similarity=0.416 Sum_probs=52.7 Q ss_pred EEEEEECCCCCCCCHHHCCCCCCEEEEEECCEECCHHHHHHHHHHCCHHHHHHHHHHCCCCCCCCCCCCCCCCCEEEEEC Q ss_conf 37997327676421001011222012211231314454108776322889999988256654333334655251899977 Q gi|254780222|r 464 TELFIVEGDSAGGSAKQARNRSNQAVLPLRGKILNVASAGSEKIRNNQQIMDLVQALGCRTRSQYREEDLRYEKVIIMTD 543 (686) Q Consensus 464 ~eL~lvEGDSA~gsAk~grdr~~qgi~PLRGKiLNv~~as~~ki~~N~Ei~~l~~~lGl~~g~~~~~~~LRYgkIiImTD 543 (686) +.++||||++..-+..+.... ...++++.|+..+ .....+...++. +.+|+|++| T Consensus 1 ~~viivEG~~d~~~l~~~~~~-~~~~~~~~G~~~~------------~~~~~~~~~~~~------------~~~vii~~D 55 (83) T cd00188 1 KKLIIVEGPSDALALAQAGGY-GGAVVALGGHALN------------KTRELLKRLLGE------------AKEVIIATD 55 (83) T ss_pred CEEEEECCHHHHHHHHHHCCC-CEEEEEEEEEECC------------CHHHHHHHHHCC------------CCEEEEECC T ss_conf 979998888999999985699-9799997557384------------389999998768------------999999749 Q ss_pred CCCCCHHHHHHHHHHHHHH Q ss_conf 9877146889999999987 Q gi|254780222|r 544 ADVDGAHIASLLLTFFYQE 562 (686) Q Consensus 544 aD~DGsHI~~Lll~ff~~~ 562 (686) .|..|.++...+....... T Consensus 56 ~D~~g~~~~~~~~~~~~~~ 74 (83) T cd00188 56 ADREGEAIALRLLELLKSL 74 (83) T ss_pred CCHHHHHHHHHHHHHHHHC T ss_conf 8937999999999973234 No 53 >PRK11360 sensory histidine kinase AtoS; Provisional Probab=96.14 E-value=0.0072 Score=39.01 Aligned_cols=28 Identities=11% Similarity=0.260 Sum_probs=15.0 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHCCCEEE Q ss_conf 7146889999999987088863393899 Q gi|254780222|r 547 DGAHIASLLLTFFYQEMYDLIEQKHLFL 574 (686) Q Consensus 547 DGsHI~~Lll~ff~~~~p~Li~~g~v~~ 574 (686) |-.+++-.|+|++.-=+.-.=+.|.+.+ T Consensus 495 D~~~l~qvl~NLi~NAi~a~~~~g~I~i 522 (607) T PRK11360 495 DRELLKQVLLNILINAVQAISARGKIRI 522 (607) T ss_pred CHHHHHHHHHHHHHHHHHHCCCCCEEEE T ss_conf 8899999999999996286289967999 No 54 >PRK11644 sensory histidine kinase UhpB; Provisional Probab=96.07 E-value=0.016 Score=36.64 Aligned_cols=43 Identities=23% Similarity=0.393 Sum_probs=16.4 Q ss_pred CHHHHHHHHHHHHHHHHHCCCCCEEEEEECC-CC--EEEEEECCCCCCC Q ss_conf 0220244366667877761899769999828-98--3999966984344 Q gi|254780222|r 73 ALHHLFSEVIDNAMDEVIAGYANLIEVSLDG-NG--FLTVVDNGRGIPV 118 (686) Q Consensus 73 GL~~i~~EIldNavDe~~~g~~~~I~V~i~~-dg--~IsV~ddGrGIPv 118 (686) -+|.++.|-|.|++-++ .+++|.|.+.. ++ .+.|.|||.|+|. T Consensus 412 tlYRIvQEaLTNi~KHA---~A~~V~I~L~~~~~~l~L~I~DDG~Gf~~ 457 (497) T PRK11644 412 TLFRVCQEGLNNIVKHA---DASAVTIQGWQQDERLMLVIEDNGSGLPP 457 (497) T ss_pred HHHHHHHHHHHHHHHCC---CCCEEEEEEEECCCEEEEEEEECCCCCCC T ss_conf 99999999999999737---88769999997699899999889989898 No 55 >PRK09835 sensor kinase CusS; Provisional Probab=96.07 E-value=0.035 Score=34.26 Aligned_cols=11 Identities=18% Similarity=0.214 Sum_probs=5.2 Q ss_pred HHHHHHHHHHH Q ss_conf 89999998763 Q gi|254780222|r 337 ALTRGIKKYAE 347 (686) Q Consensus 337 ai~k~i~~~~~ 347 (686) .|...+|.+++ T Consensus 243 ~La~~fN~M~~ 253 (482) T PRK09835 243 QLVLSFNHMIE 253 (482) T ss_pred HHHHHHHHHHH T ss_conf 99999999999 No 56 >pfam01751 Toprim Toprim domain. This is a conserved region from DNA primase. This corresponds to the Toprim domain common to DnaG primases, topoisomerases, OLD family nucleases and RecR proteins. Both DnaG motifs IV and V are present in the alignment, the DxD (V) motif may be involved in Mg2+ binding and mutations to the conserved glutamate (IV) completely abolish DnaG type primase activity. DNA primase EC:2.7.7.6 is a nucleotidyltransferase it synthesizes the oligoribonucleotide primers required for DNA replication on the lagging strand of the replication fork; it can also prime the leading stand and has been implicated in cell division. This family also includes the atypical archaeal A subunit from type II DNA topoisomerases. Type II DNA topoisomerases catalyse the relaxation of DNA supercoiling by causing transient double strand breaks. Probab=96.02 E-value=0.0074 Score=38.90 Aligned_cols=78 Identities=33% Similarity=0.317 Sum_probs=49.7 Q ss_pred EEEEECCCC-CCCCHHHCCCCCCEEEEEECCEECCHHHHHHHHHHCCHHHHHHHHHHCCCCCCCCCCCCCCCCC--EEEE Q ss_conf 799732767-6421001011222012211231314454108776322889999988256654333334655251--8999 Q gi|254780222|r 465 ELFIVEGDS-AGGSAKQARNRSNQAVLPLRGKILNVASAGSEKIRNNQQIMDLVQALGCRTRSQYREEDLRYEK--VIIM 541 (686) Q Consensus 465 eL~lvEGDS-A~gsAk~grdr~~qgi~PLRGKiLNv~~as~~ki~~N~Ei~~l~~~lGl~~g~~~~~~~LRYgk--IiIm 541 (686) +|+||||.| |....+.+-. ..-+..+.|-+..... ...+.+..|. .++.+ |||+ T Consensus 2 ~l~IVE~~~d~~~i~~~~~~--~~~~~~~~G~~~~l~~------~~~~~~~~l~---------------~~~~~~eViia 58 (89) T pfam01751 2 VLIIVEGPSDAIALAKAGGY--KGNVVALLGHLSDVIP------LTKEQLKLLK---------------KLAKKDEVILA 58 (89) T ss_pred EEEEECCHHHHHHHHHHCCC--CEEEEEEECEECCCCC------CCHHHHHHHH---------------HHHCCCEEEEE T ss_conf 89998988999999962488--8799984157057655------4246799999---------------86237749998 Q ss_pred ECCCCCCHHHHHHHHHHHHHHHHH Q ss_conf 779877146889999999987088 Q gi|254780222|r 542 TDADVDGAHIASLLLTFFYQEMYD 565 (686) Q Consensus 542 TDaD~DGsHI~~Lll~ff~~~~p~ 565 (686) ||.|..|.+|.-.++.++-.+.+. T Consensus 59 tD~D~eGe~~a~~l~~~l~~~~~~ 82 (89) T pfam01751 59 TDPDREGEAIAWKLLELLKPLGKK 82 (89) T ss_pred CCCCHHHHHHHHHHHHHHHHCCCC T ss_conf 798948999999999983122981 No 57 >COG4585 Signal transduction histidine kinase [Signal transduction mechanisms] Probab=95.83 E-value=0.019 Score=36.03 Aligned_cols=48 Identities=31% Similarity=0.512 Sum_probs=37.8 Q ss_pred CCCHHHHHHHHHHHHHHHHHCCCCCEEEEEECC-CC--EEEEEECCCCCCCCCC Q ss_conf 550220244366667877761899769999828-98--3999966984344417 Q gi|254780222|r 71 EKALHHLFSEVIDNAMDEVIAGYANLIEVSLDG-NG--FLTVVDNGRGIPVENH 121 (686) Q Consensus 71 ~~GL~~i~~EIldNavDe~~~g~~~~I~V~i~~-dg--~IsV~ddGrGIPv~~h 121 (686) +..|+.++.|-|.|++=++ .++++.|++.. ++ .++|.|||+|.+.+.. T Consensus 277 e~~l~rivQEaltN~~rHa---~A~~v~V~l~~~~~~l~l~V~DnG~Gf~~~~~ 327 (365) T COG4585 277 EDALFRIVQEALTNAIRHA---QATEVRVTLERTDDELRLEVIDNGVGFDPDKE 327 (365) T ss_pred HHHHHHHHHHHHHHHHHCC---CCCEEEEEEEECCCEEEEEEEECCCCCCCCCC T ss_conf 9999999999998998757---84569999998298899999989978897645 No 58 >PRK10364 sensor protein ZraS; Provisional Probab=95.82 E-value=0.011 Score=37.62 Aligned_cols=23 Identities=13% Similarity=-0.150 Sum_probs=12.7 Q ss_pred HHHHHHHHHHHHHCCCCCEEEEEC Q ss_conf 899999888875305773599951 Q gi|254780222|r 224 AARLLKMTQSKAYLSGRVKTCWSC 247 (686) Q Consensus 224 ~~~l~~rl~~~A~L~~gl~i~~~~ 247 (686) ...+...+.+.|. .||+.....- T Consensus 74 ~~~~q~ll~e~a~-q~~v~~~~v~ 96 (455) T PRK10364 74 HAQQQALLEEMAG-QPGVLWFAVT 96 (455) T ss_pred HHHHHHHHHHHHC-CCCEEEEEEE T ss_conf 4799999999715-9986999998 No 59 >COG5000 NtrY Signal transduction histidine kinase involved in nitrogen fixation and metabolism regulation [Signal transduction mechanisms] Probab=95.43 E-value=0.03 Score=34.76 Aligned_cols=41 Identities=10% Similarity=0.155 Sum_probs=23.5 Q ss_pred EEEEECCCC--CCHHHHHHHHHHHCCCC-------HHHHHHHHHHHHHHH Q ss_conf 999866410--26798999999880798-------168999999752467 Q gi|254780222|r 641 LRVEINKDL--NSLKDTQDSINKLMGTK-------ADERFKFIQERASFI 681 (686) Q Consensus 641 ~~v~~~~~~--~~~~~~~~~f~~lmG~~-------~~~RK~wi~~~~~~~ 681 (686) +.|++-|+. -..+..+.+|+-+|-.+ -+-=|.-|++|+-.+ T Consensus 640 i~v~V~DNGkG~p~e~r~r~~EPYvTtr~KGTGLGLAiVKkIvEeHGG~l 689 (712) T COG5000 640 IVVDVIDNGKGFPRENRHRALEPYVTTREKGTGLGLAIVKKIVEEHGGRL 689 (712) T ss_pred EEEEEECCCCCCCHHHHHHHCCCCEECCCCCCCCCHHHHHHHHHHCCCEE T ss_conf 99998108987996786432367142165666623999999999638827 No 60 >KOG1977 consensus Probab=95.15 E-value=0.014 Score=36.92 Aligned_cols=172 Identities=19% Similarity=0.237 Sum_probs=95.4 Q ss_pred HCCCCEEECCCCCCCHHHHHHHHHHHHHHHHHCCCCCEEEEEEC-CCCEEEEEECCCCCCCCCCCCCCCCEEEEEEEEEE Q ss_conf 32698154677755022024436666787776189976999982-89839999669843444176657870123454520 Q gi|254780222|r 59 RMRPGMYIGGTDEKALHHLFSEVIDNAMDEVIAGYANLIEVSLD-GNGFLTVVDNGRGIPVENHPKFPNKSTLEIILTTL 137 (686) Q Consensus 59 rkRP~mYIGst~~~GL~~i~~EIldNavDe~~~g~~~~I~V~i~-~dg~IsV~ddGrGIPv~~h~k~~~~~~~Elif~~L 137 (686) ++|-|.-|-| |-.++.|++-||+|. +++.|.|-++ ..-++-|.|||-|+--|-- ..+|+- T Consensus 12 ~lrSg~~~~s-----la~~VeElv~NSiDA----~At~V~v~V~~~t~sv~ViDdG~G~~rdDl----------~~lg~r 72 (1142) T KOG1977 12 KLRSGLAISS-----LAQCVEELVLNSIDA----EATCVAVRVNMETFSVQVIDDGFGMGRDDL----------EKLGNR 72 (1142) T ss_pred HHHCCCHHHH-----HHHHHHHHHHHCCCC----CCEEEEEEECCCEEEEEEEECCCCCCHHHH----------HHHHHH T ss_conf 8763604787-----999999998612255----752799996575067999856877667889----------998765 Q ss_pred ECCCCCCCCCEEEECCCCCH-HHHHHHHE--EEEEEEEEE--CCCEEEEEEEECCCEECCCEECCCCCCCCCEEEEEE-- Q ss_conf 02477777741563144538-89876203--212699994--188389999727945113112167677885399999-- Q gi|254780222|r 138 HAGGKFDNSTYEISGGLHGV-GISVVNAL--SDELHVTVA--RQNTIFSQKFSRGIPLSPLEKKEKVRNKRGTRITFR-- 210 (686) Q Consensus 138 ~aggnfdd~~yk~sgGlnGv-GaklvNal--S~~f~Vev~--r~gk~y~q~f~~g~~~~~~~~i~~~~~k~GT~ItF~-- 210 (686) ++-+||-..+.-+.--.-|+ |-+|++|- |.-+.+.-. |..+.+...|-.|-...-++ |..+....||.||-. T Consensus 73 y~TSK~h~~ndl~~~~tyGfRGeALasIsd~s~l~v~skkk~r~~~~~~kk~~~gs~~~~l~-iD~~R~~sGTtVtV~dl 151 (1142) T KOG1977 73 YFTSKCHSVNDLENPRTYGFRGEALASISDMSSLVVISKKKNRTMKTFVKKFQSGSALKALE-IDVTRASSGTTVTVYDL 151 (1142) T ss_pred HHHHHCEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHCCCCCEECC-CCCCCCCCCCEEEEHHH T ss_conf 54432000000136200254346664243104464555413885068999874122110366-55332468858984784 Q ss_pred ----ECHHHEE-CCCCCCHHHHHHHHHHHHHCCCCCEEEEECCCC Q ss_conf ----8424320-012222899999888875305773599951645 Q gi|254780222|r 211 ----PDPKIFG-DKAGFDAARLLKMTQSKAYLSGRVKTCWSCDKE 250 (686) Q Consensus 211 ----PD~~iFg-~~~~~d~~~l~~rl~~~A~L~~gl~i~~~~~~~ 250 (686) |=..+.. +...-.++.|++|+..+|.+.|.+.+.+.+... T Consensus 152 fY~lPVRRr~k~~~P~k~fe~Ik~~i~~i~lmHp~iSfsv~~~~s 196 (1142) T KOG1977 152 FYQLPVRRRLKCMDPRKEFEKIKQRIEAISLMHPSISFSVRNDVS 196 (1142) T ss_pred HHCCHHHHHHHCCCHHHHHHHHHHHHHHHHHHCCCEEEEEEECCC T ss_conf 403345665502887889999999999997526640478875267 No 61 >COG4191 Signal transduction histidine kinase regulating C4-dicarboxylate transport system [Signal transduction mechanisms] Probab=95.10 E-value=0.037 Score=34.10 Aligned_cols=45 Identities=13% Similarity=0.140 Sum_probs=18.0 Q ss_pred HHHHHHHHCCHHHHHHHHHHCCCCCCCCCCCCCCCCCEEEEECCCC Q ss_conf 4108776322889999988256654333334655251899977987 Q gi|254780222|r 501 SAGSEKIRNNQQIMDLVQALGCRTRSQYREEDLRYEKVIIMTDADV 546 (686) Q Consensus 501 ~as~~ki~~N~Ei~~l~~~lGl~~g~~~~~~~LRYgkIiImTDaD~ 546 (686) .....||+-| =|+|=.-|+.-.....-.+.--+=+.-+++|=.|. T Consensus 495 ~iRLeQVLvN-Ll~NALDA~~~~~~~~i~i~~~~~~~~v~l~VrDn 539 (603) T COG4191 495 EIRLEQVLVN-LLQNALDAMAGQEDRRLSIRAQREGGQVVLTVRDN 539 (603) T ss_pred HHHHHHHHHH-HHHHHHHHHCCCCCCEEEEEEEECCCEEEEEECCC T ss_conf 2209999999-99989998458878716999871598699998269 No 62 >KOG0787 consensus Probab=94.82 E-value=0.049 Score=33.30 Aligned_cols=21 Identities=24% Similarity=0.491 Sum_probs=10.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHC Q ss_conf 468899999999870888633 Q gi|254780222|r 549 AHIASLLLTFFYQEMYDLIEQ 569 (686) Q Consensus 549 sHI~~Lll~ff~~~~p~Li~~ 569 (686) +|..-+|.-.|-.-|+--+|. T Consensus 259 shL~ymlfElfKNamrATve~ 279 (414) T KOG0787 259 SHLYYMLFELFKNAMRATVEH 279 (414) T ss_pred HHHHHHHHHHHHHHHHHHHHH T ss_conf 189999999999999999997 No 63 >COG2172 RsbW Anti-sigma regulatory factor (Ser/Thr protein kinase) [Signal transduction mechanisms] Probab=94.80 E-value=0.26 Score=28.33 Aligned_cols=93 Identities=26% Similarity=0.275 Sum_probs=59.8 Q ss_pred CHHHHHHHHHHHHHHHHHCCCC----CEEEEEECCCC-EEEEEECCCCCCCCCCCCCCCCEEEEEEEEEEECCCCCCCCC Q ss_conf 0220244366667877761899----76999982898-399996698434441766578701234545200247777774 Q gi|254780222|r 73 ALHHLFSEVIDNAMDEVIAGYA----NLIEVSLDGNG-FLTVVDNGRGIPVENHPKFPNKSTLEIILTTLHAGGKFDNST 147 (686) Q Consensus 73 GL~~i~~EIldNavDe~~~g~~----~~I~V~i~~dg-~IsV~ddGrGIPv~~h~k~~~~~~~Elif~~L~aggnfdd~~ 147 (686) -+-..+.|.+.|+|-.+-++-. -.|.++++.+. .|.|+|.|.||+. . +..+... T Consensus 40 ~l~~av~E~~~N~v~Ha~~~~~~~g~I~i~~~~~~~~~~i~i~D~G~~~~~-~----------~~~~~~~---------- 98 (146) T COG2172 40 DLAIAVSEALTNAVKHAYKLDPSEGEIRIEVSLDDGKLEIRIWDQGPGIED-L----------EESLGPG---------- 98 (146) T ss_pred HHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEEECCCEEEEEEEECCCCCCC-H----------HHHCCCC---------- T ss_conf 999999999989999763038998659999997099399999957989878-8----------8853888---------- Q ss_pred EEEECCC--CCHHHHHHHHEEEEEEEEEECCC--EEEEEEEEC Q ss_conf 1563144--53889876203212699994188--389999727 Q gi|254780222|r 148 YEISGGL--HGVGISVVNALSDELHVTVARQN--TIFSQKFSR 186 (686) Q Consensus 148 yk~sgGl--nGvGaklvNalS~~f~Vev~r~g--k~y~q~f~~ 186 (686) +...+++ .|+|.-+.+-+...|.++...++ ......+.+ T Consensus 99 ~~~~~~~~~~G~Gl~l~~~~~D~~~~~~~~~~~~~~~~~~~~~ 141 (146) T COG2172 99 DTTAEGLQEGGLGLFLAKRLMDEFSYERSEDGRNRLTKITLRK 141 (146) T ss_pred CCCCCCCCCCCCCHHHHHHHHEEEEEEEECCCCEEEEEEEECC T ss_conf 8888753247313788862110799996069855899999624 No 64 >PRK04069 serine-protein kinase RsbW; Provisional Probab=94.78 E-value=0.25 Score=28.44 Aligned_cols=94 Identities=18% Similarity=0.156 Sum_probs=58.4 Q ss_pred HHHHHHHHHHHHHHHHHCC-CCCEEEEEEC--CCC-EEEEEECCCCCCCCCCCCCCCCEEEEEEEEEEECCCCCCCCCEE Q ss_conf 2202443666678777618-9976999982--898-39999669843444176657870123454520024777777415 Q gi|254780222|r 74 LHHLFSEVIDNAMDEVIAG-YANLIEVSLD--GNG-FLTVVDNGRGIPVENHPKFPNKSTLEIILTTLHAGGKFDNSTYE 149 (686) Q Consensus 74 L~~i~~EIldNavDe~~~g-~~~~I~V~i~--~dg-~IsV~ddGrGIPv~~h~k~~~~~~~Elif~~L~aggnfdd~~yk 149 (686) +-=-+.|.+.|||..+-++ +...|.|.+. .+. .|.|.|.|.|...+..+.. ..|. --+ . .-+. T Consensus 43 i~LAV~EA~tNaI~Hay~~~~~~~i~i~~~~~~~~l~i~V~D~G~gfD~~~~~~~---~~P~-~~~-----~---~l~~- 109 (158) T PRK04069 43 LKIAVSEACTNAVQHAYKEEEVGEINIRFEIYEDRLEIVVADNGDSFDYETTKSK---IGPY-DPS-----E---PIDD- 109 (158) T ss_pred HHHHHHHHHHHHHHHHCCCCCCCEEEEEEEEECCEEEEEEEECCCCCCHHHCCCC---CCCC-CCC-----C---CHHH- T ss_conf 9999999999999975156999479999999599999999991748795674324---5888-888-----8---6111- Q ss_pred EECCCCCHHHHHHHHEEEEEEEEEECCCEEEEEE Q ss_conf 6314453889876203212699994188389999 Q gi|254780222|r 150 ISGGLHGVGISVVNALSDELHVTVARQNTIFSQK 183 (686) Q Consensus 150 ~sgGlnGvGaklvNalS~~f~Vev~r~gk~y~q~ 183 (686) -...|.|..+...|....+++. ++|...+++ T Consensus 110 --~~~gGlGl~lI~~lmDeV~~~~-~~Gt~v~m~ 140 (158) T PRK04069 110 --LREGGLGLFLIETLMDDVTVYK-DSGVTVSMT 140 (158) T ss_pred --CCCCCCHHHHHHHHCCEEEEEE-CCCEEEEEE T ss_conf --3678740999997525589990-898299999 No 65 >COG0642 BaeS Signal transduction histidine kinase [Signal transduction mechanisms] Probab=94.23 E-value=0.16 Score=29.79 Aligned_cols=47 Identities=26% Similarity=0.455 Sum_probs=33.9 Q ss_pred CCCHHHHHHHHHHHHHHHHHCCCCCEEEEEECCCC---EEEEEECCCCCCCCC Q ss_conf 55022024436666787776189976999982898---399996698434441 Q gi|254780222|r 71 EKALHHLFSEVIDNAMDEVIAGYANLIEVSLDGNG---FLTVVDNGRGIPVEN 120 (686) Q Consensus 71 ~~GL~~i~~EIldNavDe~~~g~~~~I~V~i~~dg---~IsV~ddGrGIPv~~ 120 (686) ..-|-+++..+++||++-.. ...|.|.+..++ .++|.|||.|||.+. T Consensus 226 ~~~l~~vl~nLi~NAi~~~~---~~~i~i~~~~~~~~i~i~V~D~G~Gi~~~~ 275 (336) T COG0642 226 PERLRQVLVNLLSNAIKYTP---GGEITISVRQDDEQVTISVEDTGPGIPEEE 275 (336) T ss_pred HHHHHHHHHHHHHHHHHHCC---CCEEEEEEEECCCEEEEEEECCCCCCCHHH T ss_conf 88999999999998998668---985999999638779999980789979889 No 66 >PRK11107 hybrid sensory histidine kinase BarA; Provisional Probab=93.71 E-value=0.57 Score=25.96 Aligned_cols=92 Identities=25% Similarity=0.386 Sum_probs=44.0 Q ss_pred CCEEECCCCCCCHHHHHHHHHHHHHHHHHCCCCCEEEEEEC--CCC----EEEEEECCCCCCCCCCCCCCCCEEEEEEEE Q ss_conf 98154677755022024436666787776189976999982--898----399996698434441766578701234545 Q gi|254780222|r 62 PGMYIGGTDEKALHHLFSEVIDNAMDEVIAGYANLIEVSLD--GNG----FLTVVDNGRGIPVENHPKFPNKSTLEIILT 135 (686) Q Consensus 62 P~mYIGst~~~GL~~i~~EIldNavDe~~~g~~~~I~V~i~--~dg----~IsV~ddGrGIPv~~h~k~~~~~~~Elif~ 135 (686) |...+| |..-|.+++.-+|.||+----.|. =.|.|.+. .++ .++|.|.|.|||-+..+ .+.- -|. T Consensus 399 p~~v~g--D~~Rl~QIl~NLl~NAiKfT~~G~-I~v~v~~~~~~~~~~~l~~~V~DTGiGI~~~~~~---~iF~---~F~ 469 (920) T PRK11107 399 PDNVIG--DPLRLQQIITNLVGNAIKFTESGN-IDILVELRALSDEKVQLEVQIRDTGIGISERQQS---QLFQ---AFR 469 (920) T ss_pred CCCEEE--CHHHHHHHHHHHHHHHHHCCCCCC-EEEEEEEEECCCCEEEEEEEEEEECCCCCHHHHH---HHHC---CCC T ss_conf 730653--899999999999999997188993-7999999964798489999999970586999999---7854---201 Q ss_pred EEECCCCCCCCCEEEECCCCCHHHHHHHHEEEEE Q ss_conf 2002477777741563144538898762032126 Q gi|254780222|r 136 TLHAGGKFDNSTYEISGGLHGVGISVVNALSDEL 169 (686) Q Consensus 136 ~L~aggnfdd~~yk~sgGlnGvGaklvNalS~~f 169 (686) ...+ +.-+-.||. |+|.+.|.-+-+.+ T Consensus 470 q~d~------s~~r~~gGt-GLGLaI~k~Lv~~m 496 (920) T PRK11107 470 QADA------SISRRHGGT-GLGLVITQKLVNEM 496 (920) T ss_pred CCCC------CCCCCCCCC-CCHHHHHHHHHHHC T ss_conf 5676------544567987-74299999999980 No 67 >TIGR02966 phoR_proteo phosphate regulon sensor kinase PhoR; InterPro: IPR014310 Members of this protein are the regulatory histidine kinase PhoR associated with the phosphate ABC transporter found in most Proteobacteria, and related proteins from Gram-positive organisms are excluded. The phoR gene usually is adjacent to the response regulator phoB gene (IPR011879 from INTERPRO).. Probab=93.56 E-value=0.19 Score=29.16 Aligned_cols=100 Identities=24% Similarity=0.333 Sum_probs=58.8 Q ss_pred CCCHHHHHHHHHHHHHHHHHC--CCCCEEEEEECCC--C-EEEEEECCCCCCCCCCCCCCCCEEE-EEEEEEEECCCCCC Q ss_conf 550220244366667877761--8997699998289--8-3999966984344417665787012-34545200247777 Q gi|254780222|r 71 EKALHHLFSEVIDNAMDEVIA--GYANLIEVSLDGN--G-FLTVVDNGRGIPVENHPKFPNKSTL-EIILTTLHAGGKFD 144 (686) Q Consensus 71 ~~GL~~i~~EIldNavDe~~~--g~~~~I~V~i~~d--g-~IsV~ddGrGIPv~~h~k~~~~~~~-Elif~~L~aggnfd 144 (686) ..|=-.=+-.++.|=|.||++ .+..+|+|.-..+ | ..+|.|+|.|||-+ .++=+ |=-+=+ | T Consensus 229 ~~G~e~~L~SAf~NLv~NAikYTp~gg~I~v~W~~~~~ga~fsV~DtG~GI~~e------HipRLTERFYRV-------D 295 (339) T TIGR02966 229 LLGNEDELRSAFSNLVSNAIKYTPEGGTITVRWRRDEGGAEFSVTDTGIGIAPE------HIPRLTERFYRV-------D 295 (339) T ss_pred CCCCHHHHHHHHHHHHHHHHHCCCCCCEEEEEEEEECCEEEEEEEECCCCCCHH------HCCCCCCEEEEE-------C T ss_conf 014477799999999887530089988799999985780399998779898731------377542001230-------5 Q ss_pred CCCEEEECCCCCHHHHHH----HHEEEEEEEEEECC-CEEEEEEE Q ss_conf 774156314453889876----20321269999418-83899997 Q gi|254780222|r 145 NSTYEISGGLHGVGISVV----NALSDELHVTVARQ-NTIFSQKF 184 (686) Q Consensus 145 d~~yk~sgGlnGvGaklv----NalS~~f~Vev~r~-gk~y~q~f 184 (686) .+-=+-||| .|+|.+-| +-.-.+++|++.-+ |..|+-.| T Consensus 296 ~~RSR~tGG-TGLGLAIVKH~L~rH~a~L~I~Se~G~GS~F~~~F 339 (339) T TIGR02966 296 KSRSRDTGG-TGLGLAIVKHVLSRHHARLEIESELGKGSTFSFIF 339 (339) T ss_pred HHHCCCCCC-CCCHHHHHHHHHHCCCCEEEEEEEECCCEEEEEEC T ss_conf 334157889-84336899998741487899998853760688759 No 68 >PRK11466 hybrid sensory histidine kinase TorS; Provisional Probab=93.42 E-value=0.11 Score=30.75 Aligned_cols=101 Identities=21% Similarity=0.405 Sum_probs=58.9 Q ss_pred CCEEECCCCCCCHHHHHHHHHHHHHHHHHCCCCCEEEEEECCC---CEEEEEECCCCCCCCCCCCCCCCEEEEEEEEEEE Q ss_conf 9815467775502202443666678777618997699998289---8399996698434441766578701234545200 Q gi|254780222|r 62 PGMYIGGTDEKALHHLFSEVIDNAMDEVIAGYANLIEVSLDGN---GFLTVVDNGRGIPVENHPKFPNKSTLEIILTTLH 138 (686) Q Consensus 62 P~mYIGst~~~GL~~i~~EIldNavDe~~~g~~~~I~V~i~~d---g~IsV~ddGrGIPv~~h~k~~~~~~~Elif~~L~ 138 (686) |....| |..-|.+++.-+|.||+----.| .|.+....+ -.|+|+|.|.|||-+..+ .+..| |... T Consensus 552 p~~v~g--D~~RLrQIL~NLlsNAiKFT~~G---~V~l~~~~~~~~l~~~V~DTGiGI~~e~~~---~IF~~---F~Q~- 619 (912) T PRK11466 552 PSALMG--DPRRIRQVITNLLSNALRFTDEG---SIVLRSRTDGEQWLVEVEDSGCGIDPAKLA---EIFQP---FVQV- 619 (912) T ss_pred CCEEEE--CHHHHHHHHHHHHHHHCCCCCCC---EEEEEEEECCCEEEEEEEECCCCCCHHHHH---HHHCC---CCCC- T ss_conf 860863--58899999999999870618996---799999986978999985089999999999---88447---6158- Q ss_pred CCCCCCCCCEEEECCCCCHHHHHHHHEEEEE----EEEEEC-CCEEEEEEE Q ss_conf 2477777741563144538898762032126----999941-883899997 Q gi|254780222|r 139 AGGKFDNSTYEISGGLHGVGISVVNALSDEL----HVTVAR-QNTIFSQKF 184 (686) Q Consensus 139 aggnfdd~~yk~sgGlnGvGaklvNalS~~f----~Vev~r-~gk~y~q~f 184 (686) +++| || -|+|.+.|--+-+-+ .|+..- .|..|..+. T Consensus 620 -~~~~--------gG-tGLGLaI~k~LvelMGG~I~V~S~~G~GS~F~~~L 660 (912) T PRK11466 620 -SGKR--------GG-TGLGLTISSRLAQAMGGELSATSTPEVGSCFCLRL 660 (912) T ss_pred -CCCC--------CC-CCHHHHHHHHHHHHCCCEEEEECCCCCCEEEEEEE T ss_conf -8998--------99-26689999999998799799971699872899999 No 69 >TIGR01386 cztS_silS_copS heavy metal sensor kinase; InterPro: IPR006290 Members of this family contain a sensor histidine kinase domain (IPR003661 from INTERPRO) and a domain found in bacterial signal proteins (IPR003660 from INTERPRO). This group is separated phylogenetically from related proteins with similar architecture and contains a number of proteins associated with heavy metal resistance efflux systems for copper, silver, cadmium, and/or zinc. ; GO: 0000155 two-component sensor activity, 0016775 phosphotransferase activity nitrogenous group as acceptor, 0000160 two-component signal transduction system (phosphorelay), 0007600 sensory perception. Probab=93.35 E-value=0.16 Score=29.78 Aligned_cols=92 Identities=16% Similarity=0.299 Sum_probs=56.0 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHCC--CCCEEEEEECCC---CEEEEEECCCCCCCCCCCCCCCCEEEEEEEEEEECCCCC Q ss_conf 775502202443666678777618--997699998289---839999669843444176657870123454520024777 Q gi|254780222|r 69 TDEKALHHLFSEVIDNAMDEVIAG--YANLIEVSLDGN---GFLTVVDNGRGIPVENHPKFPNKSTLEIILTTLHAGGKF 143 (686) Q Consensus 69 t~~~GL~~i~~EIldNavDe~~~g--~~~~I~V~i~~d---g~IsV~ddGrGIPv~~h~k~~~~~~~Elif~~L~aggnf 143 (686) -...|=-.||.=.|.|=++||++- ..+.|.|.+..+ -.|+|+|.|+|||= + . .+-+|=..+= T Consensus 369 ~~~~gD~~lfrRA~sNLLsNA~rhtp~g~~I~v~~~~~~d~~~~~v~N~G~~Ip~----e--~---L~rLFdRFYR---- 435 (483) T TIGR01386 369 AEVRGDALLFRRALSNLLSNALRHTPAGSTIDVRVERRADEVRVSVENTGDGIPP----E--H---LSRLFDRFYR---- 435 (483) T ss_pred CEEECCHHHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCEEEEEEECCCCCCCC----C--C---CCCCCCCCCC---- T ss_conf 0560247888999999999998608998858999961477568998358799985----2--3---5655530026---- Q ss_pred CCCCEEE-ECCCCCHHHHHHHHEE----EEEEEEE Q ss_conf 7774156-3144538898762032----1269999 Q gi|254780222|r 144 DNSTYEI-SGGLHGVGISVVNALS----DELHVTV 173 (686) Q Consensus 144 dd~~yk~-sgGlnGvGaklvNalS----~~f~Vev 173 (686) .|.+-.- ++==+|+|.+-|-.-= =...||. T Consensus 436 ~D~aR~~G~~~g~GLGLAIV~SI~~~HGG~~~aeS 470 (483) T TIGR01386 436 VDPARSNGSQEGTGLGLAIVRSIMEAHGGRAEAES 470 (483) T ss_pred CCHHHCCCCCCCCCHHHHHHHHHHHHCCCEEEEEE T ss_conf 76333276645676048999999982398689986 No 70 >PRK13557 histidine kinase; Provisional Probab=92.88 E-value=0.13 Score=30.43 Aligned_cols=44 Identities=7% Similarity=0.095 Sum_probs=24.1 Q ss_pred EECCCCCCCHHHHHHHHHHHHHHHHHCCCCCEEEEEEC---CCCEEEEEECC Q ss_conf 54677755022024436666787776189976999982---89839999669 Q gi|254780222|r 65 YIGGTDEKALHHLFSEVIDNAMDEVIAGYANLIEVSLD---GNGFLTVVDNG 113 (686) Q Consensus 65 YIGst~~~GL~~i~~EIldNavDe~~~g~~~~I~V~i~---~dg~IsV~ddG 113 (686) -++..+..++.|-..|+.=-+++. .+ .-| |..| .||.|.-+|.+ T Consensus 13 ~~~~~~~~~~~~~~~~~~~~~v~~-~~---~~~-~~~d~~~~Dg~I~~vN~a 59 (538) T PRK13557 13 PVDESAAGDVSDHRSDIFFAAVET-TR---MPM-IVTDPRQPDNPIVFANRA 59 (538) T ss_pred CCCCCCCHHHHHHHHHHHHHHHHH-CC---CCE-EEECCCCCCCEEEEECHH T ss_conf 999886410567889999999960-79---738-993589999879998399 No 71 >PRK11091 aerobic respiration control sensor protein ArcB; Provisional Probab=92.80 E-value=0.15 Score=29.89 Aligned_cols=107 Identities=21% Similarity=0.329 Sum_probs=59.6 Q ss_pred CCCEEECCCCCCCHHHHHHHHHHHHHHHHHCCCCCEEEEEEC-CC-C--EEEEEECCCCCCCCCCCCCCCCEEEEEEEEE Q ss_conf 698154677755022024436666787776189976999982-89-8--3999966984344417665787012345452 Q gi|254780222|r 61 RPGMYIGGTDEKALHHLFSEVIDNAMDEVIAGYANLIEVSLD-GN-G--FLTVVDNGRGIPVENHPKFPNKSTLEIILTT 136 (686) Q Consensus 61 RP~mYIGst~~~GL~~i~~EIldNavDe~~~g~~~~I~V~i~-~d-g--~IsV~ddGrGIPv~~h~k~~~~~~~Elif~~ 136 (686) -|...+| |..-|.+|++-+|.||+---..| .|.|.+. .+ . .++|.|.|.|||-+..+ .+.. -|.. T Consensus 388 lP~~v~g--D~~RLrQIL~NLlsNAIKFT~~G---~V~l~v~~~~~~~l~f~V~DTGiGI~~e~l~---~IF~---~F~Q 456 (779) T PRK11091 388 LPHKVIT--DGTRLRQILWNLISNAVKFTQQG---QVTVRVRYEDGDMLHFEVEDSGIGIPEDELD---KIFA---MYYQ 456 (779) T ss_pred CCCEEEC--CHHHHHHHHHHHHHHHHHCCCCC---CEEEEEEECCCCEEEEEEEECCCCCCHHHHH---HHHC---CCEE T ss_conf 9862863--89999999999999997738999---7799999827988999999579999999999---8705---7560 Q ss_pred EECCCCCCCCCEEEECCCCCHHHHHHHHEEEEEE----EEEEC-CCEEEEEEE Q ss_conf 0024777777415631445388987620321269----99941-883899997 Q gi|254780222|r 137 LHAGGKFDNSTYEISGGLHGVGISVVNALSDELH----VTVAR-QNTIFSQKF 184 (686) Q Consensus 137 L~aggnfdd~~yk~sgGlnGvGaklvNalS~~f~----Vev~r-~gk~y~q~f 184 (686) ... +..+-.+|=-|+|.++|.-|-+.+- |++.- .|..|.-+. T Consensus 457 ~~~------s~~~r~~~GTGLGLaI~k~LvelMGG~I~V~S~~G~GStFt~~L 503 (779) T PRK11091 457 VKD------SHGGKPATGTGIGLAVSRRLAQNMGGDITVTSEEGKGSTFTLTI 503 (779) T ss_pred CCC------CCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEECCCCCCEEEEEE T ss_conf 667------87776778876079999999998499799995599871799999 No 72 >COG3851 UhpB Signal transduction histidine kinase, glucose-6-phosphate specific [Signal transduction mechanisms] Probab=92.68 E-value=0.2 Score=29.10 Aligned_cols=18 Identities=17% Similarity=0.270 Sum_probs=9.8 Q ss_pred HHHCCHHHHHHHHHHCCC Q ss_conf 763228899999882566 Q gi|254780222|r 506 KIRNNQQIMDLVQALGCR 523 (686) Q Consensus 506 ki~~N~Ei~~l~~~lGl~ 523 (686) .+.=-+-|++++.-+++. T Consensus 369 dL~l~qai~~l~~Em~~~ 386 (497) T COG3851 369 DLTLEQAIRSLLREMELE 386 (497) T ss_pred CCCHHHHHHHHHHHHHHH T ss_conf 257899999999985254 No 73 >PRK10841 hybrid sensory kinase in two-component regulatory system with RcsB and YojN; Provisional Probab=92.53 E-value=0.051 Score=33.18 Aligned_cols=38 Identities=21% Similarity=0.239 Sum_probs=14.9 Q ss_pred HHHHHHHHCCCCCCCCCCCCCCCCCEEEEECCCCCCHHH Q ss_conf 999998825665433333465525189997798771468 Q gi|254780222|r 513 IMDLVQALGCRTRSQYREEDLRYEKVIIMTDADVDGAHI 551 (686) Q Consensus 513 i~~l~~~lGl~~g~~~~~~~LRYgkIiImTDaD~DGsHI 551 (686) +..+..-.|+..-. +......-..++|..|.+..+... T Consensus 722 l~~~L~~~Gi~v~~-~~~~~~~~~dvlI~d~~~~~~~~~ 759 (947) T PRK10841 722 LETLLQRSGIHVQR-YEGQEPTPEDVLITDDPVQKKWQG 759 (947) T ss_pred HHHHHHHCCCEEEE-ECCCCCCCCCEEEECCCCCCCCCH T ss_conf 99999865953786-123456744326632533444312 No 74 >PRK10547 chemotaxis protein CheA; Provisional Probab=92.53 E-value=0.14 Score=30.21 Aligned_cols=19 Identities=16% Similarity=0.137 Sum_probs=6.5 Q ss_pred HHHHHHHH-CCCCCEEEEEC Q ss_conf 98888753-05773599951 Q gi|254780222|r 229 KMTQSKAY-LSGRVKTCWSC 247 (686) Q Consensus 229 ~rl~~~A~-L~~gl~i~~~~ 247 (686) +.+|++|- +++.+.+.+.+ T Consensus 346 R~VRdlar~~gK~V~L~i~G 365 (662) T PRK10547 346 RLVRDLAGKLGKQVELTLVG 365 (662) T ss_pred HHHHHHHHHHCCEEEEEEEC T ss_conf 99999999869825999966 No 75 >PRK13837 two-component VirA-like sensor kinase; Provisional Probab=92.51 E-value=0.18 Score=29.32 Aligned_cols=13 Identities=15% Similarity=0.335 Sum_probs=5.4 Q ss_pred CHHHHHHHHHHHH Q ss_conf 1468899999999 Q gi|254780222|r 548 GAHIASLLLTFFY 560 (686) Q Consensus 548 GsHI~~Lll~ff~ 560 (686) -.+|.-.|+|+.- T Consensus 561 ~~QL~QVllNL~~ 573 (831) T PRK13837 561 PAQLQQVILNLCK 573 (831) T ss_pred HHHHHHHHHHHHH T ss_conf 9999999999999 No 76 >cd01027 TOPRIM_RNase_M5_like TOPRIM_ RNase M5_like: The topoisomerase-primase (TOPRIM) nucleotidyl transferase/hydrolase domain found in Ribonuclease M5: (RNase M5) and other small primase-like proteins from bacteria and archaea. RNase M5 catalyzes the maturation of 5S rRNA in low G+C Gram-positive bacteria. The TOPRIM domain has two conserved motifs, one of which centers at a conserved glutamate and the other one at two conserved aspartates (DxD). The conserved glutamate may act as a general base in nucleotide polymerization by primases. The DXD motif may co-ordinate Mg2+, a cofactor required for full catalytic function. Probab=90.70 E-value=1.2 Score=23.80 Aligned_cols=69 Identities=30% Similarity=0.506 Sum_probs=45.1 Q ss_pred EEEEEECCCCCCCCHHHCCCCCCEEEEEECCEECCHHHHHHHHHHCCHHHHHHHHHHCCCCCCCCCCCCCCCCCEEEEEC Q ss_conf 37997327676421001011222012211231314454108776322889999988256654333334655251899977 Q gi|254780222|r 464 TELFIVEGDSAGGSAKQARNRSNQAVLPLRGKILNVASAGSEKIRNNQQIMDLVQALGCRTRSQYREEDLRYEKVIIMTD 543 (686) Q Consensus 464 ~eL~lvEGDSA~gsAk~grdr~~qgi~PLRGKiLNv~~as~~ki~~N~Ei~~l~~~lGl~~g~~~~~~~LRYgkIiImTD 543 (686) .+.|+|||-+=..+.++.-+. --+.-..|-.++- +..+++.- .|..|||||| T Consensus 2 ~evIvVEGk~D~~~l~~~~~~--~~ii~t~g~~~~~------------~~~~~i~~--------------~~~~VIIlTD 53 (81) T cd01027 2 GEVIIVEGKNDTESLKKLGIE--AEIIETNGSIINK------------ETIELIKK--------------AYRGVIILTD 53 (81) T ss_pred CEEEEEECCCHHHHHHHHCCC--EEEEEECCCCCCH------------HHHHHHHH--------------HCCCEEEEEC T ss_conf 769999556189999986378--1399989903459------------99999998--------------5598599968 Q ss_pred CCCCCHHHHHHHHHHHH Q ss_conf 98771468899999999 Q gi|254780222|r 544 ADVDGAHIASLLLTFFY 560 (686) Q Consensus 544 aD~DGsHI~~Lll~ff~ 560 (686) .|--|.-||-.|...|- T Consensus 54 ~D~~Gekirk~i~~~l~ 70 (81) T cd01027 54 PDRKGEKIRKKLSEYLS 70 (81) T ss_pred CCCCHHHHHHHHHHHHC T ss_conf 99670799999999822 No 77 >PRK09959 hybrid sensory histidine kinase in two-component regulatory system with EvgA; Provisional Probab=88.11 E-value=0.43 Score=26.78 Aligned_cols=17 Identities=24% Similarity=0.505 Sum_probs=10.4 Q ss_pred CCCCHHHHC-CCCEEECC Q ss_conf 112103432-69815467 Q gi|254780222|r 52 LEGLEPVRM-RPGMYIGG 68 (686) Q Consensus 52 L~glE~Vrk-RP~mYIGs 68 (686) +.+|++|.. .-+.|||. T Consensus 195 ~~Al~AVa~G~aD~~igd 212 (1197) T PRK09959 195 YQALASVSAGQNDYFIGS 212 (1197) T ss_pred HHHHHHHHCCCCCEEEEC T ss_conf 999999977998889602 No 78 >COG3852 NtrB Signal transduction histidine kinase, nitrogen specific [Signal transduction mechanisms] Probab=87.78 E-value=2.1 Score=22.10 Aligned_cols=16 Identities=25% Similarity=0.374 Sum_probs=7.0 Q ss_pred CCEEEECCCEEEEEEC Q ss_conf 9389930996999748 Q gi|254780222|r 570 KHLFLISPPLFRITQG 585 (686) Q Consensus 570 g~v~~~~tPl~kv~~g 585 (686) |..|-..-||-.+..| T Consensus 281 g~r~rl~l~leViDNG 296 (363) T COG3852 281 GTRYRLALPLEVIDNG 296 (363) T ss_pred CCEEEEEEEEEEECCC T ss_conf 7424663116986189 No 79 >COG4564 Signal transduction histidine kinase [Signal transduction mechanisms] Probab=87.55 E-value=1.1 Score=24.00 Aligned_cols=30 Identities=23% Similarity=0.324 Sum_probs=20.8 Q ss_pred EEECCCCCCCEE--ECCHHHHHHHHHHHHHHH Q ss_conf 830006557524--130467899999987630 Q gi|254780222|r 319 CNTILTDEGGTH--ESGLRVALTRGIKKYAEL 348 (686) Q Consensus 319 VN~I~T~~GGTH--v~g~~~ai~k~i~~~~~~ 348 (686) .|-++++.||.| .+--.+++..+|++.-+. T Consensus 277 ~~ql~~p~~~a~~aieKaa~aL~~Ai~EVRRi 308 (459) T COG4564 277 ARQLNPPKGGAHPAIEKAADALNGAIKEVRRI 308 (459) T ss_pred HCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHH T ss_conf 62677998887501566789999899999985 No 80 >cd01028 TOPRIM_TopoIA TOPRIM_TopoIA: topoisomerase-primase (TOPRIM) nucleotidyl transferase/hydrolase domain of the type found in the type IA family of DNA topoisomerases (TopoIA). This subgroup contains proteins similar to the Type I DNA topoisomerases: E. coli topisomerases I and III, eukaryotic topoisomerase III and, ATP-dependent reverse gyrase found in archaea and thermophilic bacteria. Type IA DNA topoisomerases remove (relax) negative supercoils in the DNA. These enzymes cleave one strand of the DNA duplex, covalently link to the 5' phosphoryl end of the DNA break and allow the other strand of the duplex to pass through the gap. Reverse gyrase is also able to insert positive supercoils in the presence of ATP and negative supercoils in the presence of AMPPNP. The TOPRIM domain has two conserved motifs, one of which centers at a conserved glutamate and the other one at two conserved aspartates (DxD). For topoisomerases the conserved glutamate is believed to act as a general b Probab=85.77 E-value=1.6 Score=22.91 Aligned_cols=50 Identities=18% Similarity=0.239 Sum_probs=33.6 Q ss_pred CCCCEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHCCCEEEECCCEEEEEECCEEEEECCHHHHHHHH Q ss_conf 52518999779877146889999999987088863393899309969997489799937899999999 Q gi|254780222|r 534 RYEKVIIMTDADVDGAHIASLLLTFFYQEMYDLIEQKHLFLISPPLFRITQGTKSVYAHDEEHKQKIL 601 (686) Q Consensus 534 RYgkIiImTDaD~DGsHI~~Lll~ff~~~~p~Li~~g~v~~~~tPl~kv~~g~~~~y~~~e~e~~~~~ 601 (686) +...|+|+||+|--|-+|.-.++..+.. ..-|+.|++ |..+|+.+-.+.+ T Consensus 90 ~~d~iiiAtD~DrEGE~I~~~i~~~~~~-------------~~~~v~R~~-----fsslT~~~I~~A~ 139 (142) T cd01028 90 KADEIVLATDPDREGELIAWEILEVLKC-------------DNKPVKRAW-----FSEITPKAIREAF 139 (142) T ss_pred HCCEEEECCCCCCCHHHHHHHHHHHHCC-------------CCCCCEEEE-----EECCCHHHHHHHH T ss_conf 1999998889982279999999999578-------------899927899-----8357999999999 No 81 >TIGR02956 TMAO_torS TMAO reductase sytem sensor TorS; InterPro: IPR014302 This protein, TorS, is part of a regulatory system for the torCAD operon that encodes the pterin molybdenum cofactor-containing enzyme trimethylamine-N-oxide (TMAO) reductase (TorA), a cognate chaperone (TorD), and a penta-haem cytochrome (TorC). TorS works together with the inducer-binding protein TorT and the response regulator TorR. TorS contains histidine kinase ATPase (IPR003594 from INTERPRO), HAMP (IPR003660 from INTERPRO), phosphoacceptor (IPR003661 from INTERPRO), and phosphotransfer (IPR008207 from INTERPRO) domains and a response regulator receiver domain (IPR001789 from INTERPRO).. Probab=85.75 E-value=0.85 Score=24.76 Aligned_cols=106 Identities=28% Similarity=0.449 Sum_probs=65.5 Q ss_pred CCCCHHHHHHHHHHHHHHHHHCCCCCEEEEEECCCC-------EEEEEECCCCCCCCCCCCCCCCEEEEEEEEEEECCCC Q ss_conf 755022024436666787776189976999982898-------3999966984344417665787012345452002477 Q gi|254780222|r 70 DEKALHHLFSEVIDNAMDEVIAGYANLIEVSLDGNG-------FLTVVDNGRGIPVENHPKFPNKSTLEIILTTLHAGGK 142 (686) Q Consensus 70 ~~~GL~~i~~EIldNavDe~~~g~~~~I~V~i~~dg-------~IsV~ddGrGIPv~~h~k~~~~~~~Elif~~L~aggn 142 (686) |.+-+.+++.=.|-|||==-.+|- =.|.|.++.|. .+.|.|-|-|||=+ ..-.+--+|+.+-+ T Consensus 619 D~~riRQvL~NLvgNaIKFT~~Gs-v~l~~~l~~~~~~gdsel~F~V~DtG~GIae~------~~~~lF~aF~Qa~e--- 688 (1052) T TIGR02956 619 DLRRIRQVLINLVGNAIKFTDRGS-VVLRVSLNDDSSSGDSELLFEVEDTGVGIAEE------EQATLFDAFTQADE--- 688 (1052) T ss_pred CCCHHHHHHHHHHHCCCEECCCCE-EEEEEEECCCCCCCCCEEEEEEEECCCCCCHH------HHHHHCCCCCCCCC--- T ss_conf 661344767654413200434526-99998855888986634788886247799879------99854331012101--- Q ss_pred CCCCCEEEECCCCCHHH----HHHHHEEEE---EEEEEECC-CEE--EEEEEECCCEE Q ss_conf 77774156314453889----876203212---69999418-838--99997279451 Q gi|254780222|r 143 FDNSTYEISGGLHGVGI----SVVNALSDE---LHVTVARQ-NTI--FSQKFSRGIPL 190 (686) Q Consensus 143 fdd~~yk~sgGlnGvGa----klvNalS~~---f~Vev~r~-gk~--y~q~f~~g~~~ 190 (686) +.|.+||. |+|. +||.|.--. +.|++--+ |.. |+.-+.+|.+. T Consensus 689 ----g~~~~gGT-GLGLAIs~~Lv~AM~G~GrGl~v~S~~~~GScF~F~lpl~~~~~~ 741 (1052) T TIGR02956 689 ----GRRKSGGT-GLGLAISRRLVEAMDGEGRGLGVESELGQGSCFWFTLPLARGKPA 741 (1052) T ss_pred ----CCCCCCCC-CCHHHHHHHHHHHHCCCCCCCEEEEECCCCCEEEEECCCCCCCHH T ss_conf ----37313776-506899999999708887650166645875022124231337500 No 82 >COG4192 Signal transduction histidine kinase regulating phosphoglycerate transport system [Signal transduction mechanisms] Probab=84.79 E-value=1.6 Score=22.85 Aligned_cols=55 Identities=24% Similarity=0.274 Sum_probs=24.8 Q ss_pred CCCCCEEEEECCC---CCCHHHHHHHHHHHHHHHHHHHHCCCEEEECCCEEEEEE-C----CEEEEECCH Q ss_conf 5525189997798---771468899999999870888633938993099699974-8----979993789 Q gi|254780222|r 533 LRYEKVIIMTDAD---VDGAHIASLLLTFFYQEMYDLIEQKHLFLISPPLFRITQ-G----TKSVYAHDE 594 (686) Q Consensus 533 LRYgkIiImTDaD---~DGsHI~~Lll~ff~~~~p~Li~~g~v~~~~tPl~kv~~-g----~~~~y~~~e 594 (686) -|-+++|--||-= -|---|.-.++|.|-. .|=..-| -||-+++.. | .-.+|..+. T Consensus 544 ~rQ~~Li~ptD~~~V~gd~v~ieQVlvNl~~N---aldA~~h----~~p~i~~~~~~~~~e~l~i~i~Dn 606 (673) T COG4192 544 RRQIKLINPTDDLMVMGDAVSIEQVLVNLIVN---ALDASTH----FAPWIKLIALGTEQEMLRIAIIDN 606 (673) T ss_pred HCCCCCCCCCCCCEECCHHHHHHHHHHHHHHH---HHHHHCC----CCCEEEEEEECCCCCCEEEEEECC T ss_conf 00244237765300000156499999999998---8866415----786279986427642158998448 No 83 >COG0643 CheA Chemotaxis protein histidine kinase and related kinases [Cell motility and secretion / Signal transduction mechanisms] Probab=83.82 E-value=1.3 Score=23.44 Aligned_cols=19 Identities=26% Similarity=0.450 Sum_probs=14.3 Q ss_pred HCCHHHHHHHHHHCCCCCC Q ss_conf 3228899999882566543 Q gi|254780222|r 508 RNNQQIMDLVQALGCRTRS 526 (686) Q Consensus 508 ~~N~Ei~~l~~~lGl~~g~ 526 (686) +..+||-+||-.=|--... T Consensus 511 lSd~Ei~~LIF~PGFSTa~ 529 (716) T COG0643 511 LSDEEILNLIFAPGFSTAE 529 (716) T ss_pred CCHHHHHHHHHCCCCCCCH T ss_conf 7999999887369987341 No 84 >smart00493 TOPRIM topoisomerases, DnaG-type primases, OLD family nucleases and RecR proteins. Probab=80.45 E-value=4.2 Score=19.98 Aligned_cols=70 Identities=30% Similarity=0.455 Sum_probs=40.5 Q ss_pred EEEEECCCCCCCCHHHCCCCCCEEEEEECCEECCHHHHHHHHHHCCHHHHHHHHHHCCCCCCCCCCCCCCCCCEEEEECC Q ss_conf 79973276764210010112220122112313144541087763228899999882566543333346552518999779 Q gi|254780222|r 465 ELFIVEGDSAGGSAKQARNRSNQAVLPLRGKILNVASAGSEKIRNNQQIMDLVQALGCRTRSQYREEDLRYEKVIIMTDA 544 (686) Q Consensus 465 eL~lvEGDSA~gsAk~grdr~~qgi~PLRGKiLNv~~as~~ki~~N~Ei~~l~~~lGl~~g~~~~~~~LRYgkIiImTDa 544 (686) +|+||||-+-.-...+.... -+=+.++.|-+++ .|+.++++ +... + ..|||+||. T Consensus 2 ~liIVEg~~d~~~i~~~~~~-~~~~~~~~G~~~~------------~~~i~~l~-------~~~~--~---~eViiatD~ 56 (76) T smart00493 2 VLIIVEGPADAIALEKAGGF-GGNVVALGGHLLK------------KEIIKLLK-------RLAK--K---KEVILATDP 56 (76) T ss_pred EEEEECCHHHHHHHHHHCCC-CEEEEECCCCCCC------------HHHHHHHH-------HHHC--C---CEEEEECCC T ss_conf 79998788899999985699-9799982152350------------44899999-------8633--8---979997698 Q ss_pred CCCCHHHHHHHHHHH Q ss_conf 877146889999999 Q gi|254780222|r 545 DVDGAHIASLLLTFF 559 (686) Q Consensus 545 D~DGsHI~~Lll~ff 559 (686) |..|--|.--+...+ T Consensus 57 D~eGe~~a~~i~~~l 71 (76) T smart00493 57 DREGEAIAWKLAELL 71 (76) T ss_pred CCCHHHHHHHHHHHH T ss_conf 944699999999987 No 85 >cd01026 TOPRIM_OLD TOPRIM_OLD: topoisomerase-primase (TOPRIM) nucleotidyl transferase/hydrolase domain of the type found in bacterial and archaeal nucleases of the OLD (overcome lysogenization defect) family. The bacteriophage P2 OLD protein, which has DNase as well as RNase activity, consists of an N-terminal ABC-type ATPase domain and a C-terminal Toprim domain; the nuclease activity of OLD is stimulated by ATP, though the ATPase activity is not DNA-dependent. Functional details on OLD are scant and further experimentation is required to define the relationship between the ATPase and Toprim nuclease domains. The TOPRIM domain has two conserved motifs, one of which centers at a conserved glutamate and the other one at two conserved aspartates (DxD). The conserved glutamate may act as a general acid in strand cleavage by nucleases. The DXD motif may co-ordinate Mg2+, a cofactor required for full catalytic function. Probab=79.73 E-value=3.2 Score=20.79 Aligned_cols=59 Identities=32% Similarity=0.533 Sum_probs=39.0 Q ss_pred EEEEECCCCCCCC----HH-HC--CCCCCEEEEEECCEECCHHHHHHHHHHCCHHHHHHHHHHCCCCCCCCCCCCCCCCC Q ss_conf 7997327676421----00-10--11222012211231314454108776322889999988256654333334655251 Q gi|254780222|r 465 ELFIVEGDSAGGS----AK-QA--RNRSNQAVLPLRGKILNVASAGSEKIRNNQQIMDLVQALGCRTRSQYREEDLRYEK 537 (686) Q Consensus 465 eL~lvEGDSA~gs----Ak-~g--rdr~~qgi~PLRGKiLNv~~as~~ki~~N~Ei~~l~~~lGl~~g~~~~~~~LRYgk 537 (686) ..+||||+|=... |+ .+ .+...-.|.|++|| |... +..|...||. + T Consensus 5 ~viLVEG~tE~~~l~~~~~~~~~~l~~~~i~ii~v~G~--~~~~-----------~~~l~~~l~i--------------~ 57 (97) T cd01026 5 KVILVEGDSEEILLPALAKKLGLDLDEAGISIIPVGGK--NFKP-----------FIKLLNALGI--------------P 57 (97) T ss_pred EEEEEECHHHHHHHHHHHHHHCCCHHHCCCEEEEECCC--CHHH-----------HHHHHHHCCC--------------C T ss_conf 79999441399999999999697977879789985892--7499-----------9999997699--------------3 Q ss_pred EEEEECCCCCCHH Q ss_conf 8999779877146 Q gi|254780222|r 538 VIIMTDADVDGAH 550 (686) Q Consensus 538 IiImTDaD~DGsH 550 (686) +.|+||.|....+ T Consensus 58 ~~vi~D~D~~~~~ 70 (97) T cd01026 58 VAVLTDLDAKRNE 70 (97) T ss_pred EEEEECCCCCCCC T ss_conf 8999718987465 No 86 >COG4251 Bacteriophytochrome (light-regulated signal transduction histidine kinase) [Signal transduction mechanisms] Probab=79.21 E-value=2.1 Score=22.13 Aligned_cols=19 Identities=21% Similarity=0.559 Sum_probs=13.5 Q ss_pred CHHHHHHHHEEEEEEEEEECCCEEEEE Q ss_conf 388987620321269999418838999 Q gi|254780222|r 156 GVGISVVNALSDELHVTVARQNTIFSQ 182 (686) Q Consensus 156 GvGaklvNalS~~f~Vev~r~gk~y~q 182 (686) |||||+ .|-+..+|+.|-- T Consensus 263 GV~ASm--------SISivv~g~LWGL 281 (750) T COG4251 263 GVGASM--------SISIVVDGKLWGL 281 (750) T ss_pred CCCEEE--------EEEEEECCEEEEE T ss_conf 754036--------9999988855776 No 87 >TIGR01286 nifK nitrogenase molybdenum-iron protein beta chain; InterPro: IPR005976 The enzyme responsible for nitrogen fixation, the nitrogenase, shows a high degree of conservation of structure, function, and amino acid sequence across wide phylogenetic ranges. All known Mo-nitrogenases consist of two components, component I (also called dinitrogenase, or Fe-Mo protein), an alpha2beta2 tetramer encoded by the nifD and nifK genes, and component II (dinitrogenase reductase, or Fe protein) a homodimer encoded by the nifH gene , . Two operons, nifDK and nifEN, encode a tetrameric (alpha2beta2 and N2E2) enzymatic complex. Nitrogenase contains two unusual rare metal clusters; one of them is the iron molybdenum cofactor (FeMo-co), which is considered to be the site of dinitrogen reduction and whose biosynthesis requires the products of nifNE and of some other nif genes . It has been proposed that NifNE might serve as a scaffold upon which FeMo-co is built and then inserted into component I .; GO: 0016163 nitrogenase activity, 0009399 nitrogen fixation, 0016612 molybdenum-iron nitrogenase complex. Probab=78.13 E-value=1.5 Score=23.05 Aligned_cols=65 Identities=22% Similarity=0.337 Sum_probs=32.3 Q ss_pred CCCHHHHHCCCCCCCCCCCCCEEEEEE------------------ECCCEEEEEEEEEEECCCCCCEEEEEEECCCCCCC Q ss_conf 861222100246776667752033332------------------14801558999854146777427888300065575 Q gi|254780222|r 267 GGLKTYLQTKLENRSLISSEIFTGKTE------------------KKGTHRGTVEWAIAWCEENPEITSYCNTILTDEGG 328 (686) Q Consensus 267 ~Gi~dyv~~~~~~~~~~~~~~~~~~~~------------------~~~~~~~~veval~~~~~~~~~~SFVN~I~T~~GG 328 (686) |||+..++=+.+...+..|+.|..++. .++.-...+.|.+++++ ||| | T Consensus 125 GGl~NmvdGL~N~~alYkPkmIAVSTTCMAEVIGdDL~AFi~nAk~~g~ip~dfPVpFAHTP------sFV--------G 190 (526) T TIGR01286 125 GGLKNMVDGLQNIYALYKPKMIAVSTTCMAEVIGDDLRAFIGNAKEEGVIPDDFPVPFAHTP------SFV--------G 190 (526) T ss_pred CCHHHHHHHHHHHHHHCCCCEEEECCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCC------CCC--------C T ss_conf 25677999997404325896466304544011013589999875306888888767865477------732--------6 Q ss_pred EEECCHHHHHHHHHHHHH Q ss_conf 241304678999999876 Q gi|254780222|r 329 THESGLRVALTRGIKKYA 346 (686) Q Consensus 329 THv~g~~~ai~k~i~~~~ 346 (686) +||+|+-+ +.+.|-++. T Consensus 191 SHitGYD~-M~kGil~~l 207 (526) T TIGR01286 191 SHITGYDS-MLKGILETL 207 (526) T ss_pred CEECCCCH-HHHHHHHHH T ss_conf 50025322-689999985 No 88 >COG1658 Small primase-like proteins (Toprim domain) [DNA replication, recombination, and repair] Probab=74.89 E-value=3.3 Score=20.73 Aligned_cols=102 Identities=26% Similarity=0.337 Sum_probs=57.9 Q ss_pred EEEEEECCCCCCCCHHHCCCCCCEEEEEECCEECCHHHHHHHHHHCCHHHHHHHHHHCCCCCCCCCCCCCCCCCEEEEEC Q ss_conf 37997327676421001011222012211231314454108776322889999988256654333334655251899977 Q gi|254780222|r 464 TELFIVEGDSAGGSAKQARNRSNQAVLPLRGKILNVASAGSEKIRNNQQIMDLVQALGCRTRSQYREEDLRYEKVIIMTD 543 (686) Q Consensus 464 ~eL~lvEGDSA~gsAk~grdr~~qgi~PLRGKiLNv~~as~~ki~~N~Ei~~l~~~lGl~~g~~~~~~~LRYgkIiImTD 543 (686) -+.|+|||=+-..+.++.-+ -+++-..|.-+|- .+.+..|. +.++|-.|||+|| T Consensus 10 ~~vIVVEGK~D~~~l~~~~~---~~~i~~~g~~i~~----------~~~ie~i~-------------~~~~~k~VIILTD 63 (127) T COG1658 10 KEVIVVEGKDDTASLKRLGD---AGVIITNGSAINS----------LETIELIK-------------KAQKYKGVIILTD 63 (127) T ss_pred CCEEEEECCCHHHHHHHHCC---CCEEEECCCCCCH----------HHHHHHHH-------------HHHCCCCEEEEEC T ss_conf 96699967767999998546---8638975865427----------88999999-------------8642587799868 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHCCCEEE---ECCCEEEEEE-CCEEEEECCHHHHHH Q ss_conf 9877146889999999987088863393899---3099699974-897999378999999 Q gi|254780222|r 544 ADVDGAHIASLLLTFFYQEMYDLIEQKHLFL---ISPPLFRITQ-GTKSVYAHDEEHKQK 599 (686) Q Consensus 544 aD~DGsHI~~Lll~ff~~~~p~Li~~g~v~~---~~tPl~kv~~-g~~~~y~~~e~e~~~ 599 (686) .|.-|.-||--|.- ++|+ .-+.|+ ...|+ +... ...+-.++++.+... T Consensus 64 ~D~~Ge~Irk~l~~----~l~~---~~~~~id~~~~~~~-~~~~~i~gVE~~~~~~~~~~ 115 (127) T COG1658 64 PDRKGERIRKKLKE----YLPG---AKGAFIDREIRNKL-KINGKIIGVEEASSEALRKA 115 (127) T ss_pred CCCCHHHHHHHHHH----HCCC---CCCCCCCHHHHHHC-CCCCCCCCCEECCHHHHHHH T ss_conf 98562899999999----7152---54654148985130-36755236110784889999 No 89 >COG2865 Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen [Transcription] Probab=72.24 E-value=7 Score=18.51 Aligned_cols=13 Identities=31% Similarity=0.450 Sum_probs=5.8 Q ss_pred CCCCCHHHHHHHC Q ss_conf 7788988998860 Q gi|254780222|r 620 LGEMLASQLKETA 632 (686) Q Consensus 620 LGe~~~~e~~et~ 632 (686) +|-+++.++++.+ T Consensus 414 n~~~T~~~L~~~l 426 (467) T COG2865 414 NGKVTARELREIL 426 (467) T ss_pred CCCCCHHHHHHHH T ss_conf 6655889999885 No 90 >COG3275 LytS Putative regulator of cell autolysis [Signal transduction mechanisms] Probab=72.15 E-value=3.9 Score=20.23 Aligned_cols=57 Identities=14% Similarity=0.257 Sum_probs=28.5 Q ss_pred CHHHHHHHHHHCCCCCCCCCCCCCCCC---CEEEEECCCCCCHH-----HHHHHHHHHHHHHHHHHHCCCEEE Q ss_conf 288999998825665433333465525---18999779877146-----889999999987088863393899 Q gi|254780222|r 510 NQQIMDLVQALGCRTRSQYREEDLRYE---KVIIMTDADVDGAH-----IASLLLTFFYQEMYDLIEQKHLFL 574 (686) Q Consensus 510 N~Ei~~l~~~lGl~~g~~~~~~~LRYg---kIiImTDaD~DGsH-----I~~Lll~ff~~~~p~Li~~g~v~~ 574 (686) .+|+..+..-| .++++||+ .|.|--|.-.--.+ |..|+=|-+-+=.+++.+.|.|-+ T Consensus 420 ~kEl~~v~AYl--------~IEkARF~~rL~v~i~id~~l~~~~iP~filQPLVENAIKHG~~~~~~~g~V~I 484 (557) T COG3275 420 SKELEHVNAYL--------SIEKARFGDRLDVVIDIDEELRQVQIPSFILQPLVENAIKHGISQLKDTGRVTI 484 (557) T ss_pred HHHHHHHHHHH--------HHHHHHCCCCEEEEEECCHHHHHCCCCHHHHHHHHHHHHHHCCCCHHCCCCEEE T ss_conf 99999999999--------899864488447999468777533686166778888888753644420881799 No 91 >TIGR02916 PEP_his_kin putative PEP-CTERM system histidine kinase; InterPro: IPR014265 Proteins in this entry are putative periplasmic sensor signal transduction histidine kinases. They all contain a GAF domain that is present in phytochromes and cGMP-specific phosphodiesterases, and which has been experimentally proven to be involved in protein:protein interactions. They also contain a C-terminal histidine kinase domain, which is composed of a dimerisation sub-domain and an ATP/ADP-binding phosphotransfer, or catalytic, sub-domain. The proteins in this entry are found strictly within a subset of Gram-negative bacterial species with the proposed PEP-CTERM/exosortase system, analogous to the LPXTG/sortase system common in Gram-positive bacteria, where members of IPR014264 from INTERPRO and IPR014266 from INTERPRO also occur.. Probab=66.50 E-value=8.6 Score=17.87 Aligned_cols=16 Identities=25% Similarity=0.412 Sum_probs=8.0 Q ss_pred CHHHHHHHHEEEEEEE Q ss_conf 3889876203212699 Q gi|254780222|r 156 GVGISVVNALSDELHV 171 (686) Q Consensus 156 GvGaklvNalS~~f~V 171 (686) +.|-.++.++|.-++. T Consensus 331 ~L~~~v~~~La~~ves 346 (696) T TIGR02916 331 DLGERVIKALAQLVES 346 (696) T ss_pred HHHHHHHHHHHHHCCC T ss_conf 7899999999863026 No 92 >KOG0019 consensus Probab=65.83 E-value=9.4 Score=17.62 Aligned_cols=155 Identities=18% Similarity=0.181 Sum_probs=81.8 Q ss_pred HHHHHHHHHHHH--HH-----C-----CCCCEEEEEECCC-CEEEEEECCCCCCCCCCCCCCCCEEEEEEEE-EEECCCC Q ss_conf 244366667877--76-----1-----8997699998289-8399996698434441766578701234545-2002477 Q gi|254780222|r 77 LFSEVIDNAMDE--VI-----A-----GYANLIEVSLDGN-GFLTVVDNGRGIPVENHPKFPNKSTLEIILT-TLHAGGK 142 (686) Q Consensus 77 i~~EIldNavDe--~~-----~-----g~~~~I~V~i~~d-g~IsV~ddGrGIPv~~h~k~~~~~~~Elif~-~L~aggn 142 (686) .+.|.|.||-|. .+ . ++--.|.++.+++ +.+++.|.|-|+--+ +=+.. +| ...+|+| T Consensus 61 FlRELISNaSDAldKiRy~~lt~~~~~~~~l~I~i~~nk~~~tlti~DtGIGMTk~-----dLvnn----LGTIAkSGtK 131 (656) T KOG0019 61 FLRELISNASDALEKLRYLELKGDEKALPELEIRIITNKDKRTITIQDTGIGMTKE-----DLVNN----LGTIAKSGSK 131 (656) T ss_pred HHHHHHCCCCCHHHHHHHHHHCCCCCCCCCEEEEECCCCCCCEEEEEECCCCCCHH-----HHHHH----HHHHHHCCCH T ss_conf 88765314640477788776247433566326896158876558998447776889-----99865----5156434658 Q ss_pred C------CCCCEEEECCCCCHHHHHHHHEEEEEEEEEECC-CEEEEEEEECCCEECCCEECCCCCCCCCEEEEEEE--CH Q ss_conf 7------777415631445388987620321269999418-83899997279451131121676778853999998--42 Q gi|254780222|r 143 F------DNSTYEISGGLHGVGISVVNALSDELHVTVARQ-NTIFSQKFSRGIPLSPLEKKEKVRNKRGTRITFRP--DP 213 (686) Q Consensus 143 f------dd~~yk~sgGlnGvGaklvNalS~~f~Vev~r~-gk~y~q~f~~g~~~~~~~~i~~~~~k~GT~ItF~P--D~ 213 (686) = +..+..--.|.-|||.--+.....+..|.+..+ ...|..++..|-.- .......-.+||+|.-.- |. T Consensus 132 ~Fmealkea~ad~~~IGQFGvGFYSaylVAdkV~V~tk~~~~e~y~Wes~~~gs~---~v~~~~~~~rGTki~l~lKe~~ 208 (656) T KOG0019 132 AFLEALKEAEAESNLIGQFGVGFYSAFMVADRVVVTTRHPADEGLQWTSNGRGSY---EIAEASGLRTGTKIVIHLKEGD 208 (656) T ss_pred HHHHHHHHCCCCHHHHHHCCCCHHHHHHHHHEEEEEECCCCCCCEEEECCCCCCE---EEEECCCCCCCCEEEEEEHHHH T ss_conf 9999887634411122110100456665123268861268874104313788726---8762367555304776502103 Q ss_pred HHEECCCCCCHHHHHHHHHHHHHCCCCCEEEEECCC Q ss_conf 432001222289999988887530577359995164 Q gi|254780222|r 214 KIFGDKAGFDAARLLKMTQSKAYLSGRVKTCWSCDK 249 (686) Q Consensus 214 ~iFg~~~~~d~~~l~~rl~~~A~L~~gl~i~~~~~~ 249 (686) .-| .+.++|...+..-+. .-+..|.++.++ T Consensus 209 ~ey-----~ee~rikeiVKK~S~-Fv~yPI~l~~ek 238 (656) T KOG0019 209 CEF-----LEEKRIKEVVKKYSN-FVSYPIYLNGER 238 (656) T ss_pred HHH-----CCHHHHHHHHHHCCC-CCCCCCHHHHHH T ss_conf 432-----427678888752033-224410103555 No 93 >LOAD_Toprim consensus Probab=65.20 E-value=9.6 Score=17.54 Aligned_cols=70 Identities=26% Similarity=0.497 Sum_probs=41.3 Q ss_pred EEEEECCCCCCCCHHHCCCCCCEEEEEECCEECCHHHHHHHHHHCCHHHHHHHHHHCCCCCCCCCCCCCCCCCEEEEECC Q ss_conf 79973276764210010112220122112313144541087763228899999882566543333346552518999779 Q gi|254780222|r 465 ELFIVEGDSAGGSAKQARNRSNQAVLPLRGKILNVASAGSEKIRNNQQIMDLVQALGCRTRSQYREEDLRYEKVIIMTDA 544 (686) Q Consensus 465 eL~lvEGDSA~gsAk~grdr~~qgi~PLRGKiLNv~~as~~ki~~N~Ei~~l~~~lGl~~g~~~~~~~LRYgkIiImTDa 544 (686) .|+||||-+=.-...+-++ .++ |+..-|-+.+ ..+..|.... .....|+|+||. T Consensus 2 ~liIVEsp~k~k~I~~~l~-~~~-vias~GHi~~------------~~~~~~~~~~------------~~~~~VilATDp 55 (98) T LOAD_Toprim 2 KIIVVEGPADAKAIERFGG-DKN-VIASKGHARK------------KSVLELIKLA------------LKAKEVIIATDP 55 (98) T ss_pred EEEEEECHHHHHHHHHHCC-CCE-EEECCCCCCC------------CHHHHHHHHH------------HCCCEEEEECCC T ss_conf 4999928488999999859-987-9979588325------------1699999998------------469929994696 Q ss_pred CCCCHHHHHHHHHHHH Q ss_conf 8771468899999999 Q gi|254780222|r 545 DVDGAHIASLLLTFFY 560 (686) Q Consensus 545 D~DGsHI~~Lll~ff~ 560 (686) |.-|--|+-=|...|- T Consensus 56 DrEGEaIa~hl~~~l~ 71 (98) T LOAD_Toprim 56 DREGEKIAKKLLELLR 71 (98) T ss_pred CCCHHHHHHHHHHHHC T ss_conf 9652899999999836 No 94 >cd03363 TOPRIM_TopoIA_TopoI TOPRIM_TopoIA_TopoI: The topoisomerase-primase (TORPIM) domain found in members of the type IA family of DNA topoisomerases (Topo IA) similar to Escherichia coli DNA topoisomerase I. Type IA DNA topoisomerases remove (relax) negative supercoils in the DNA by: cleaving one strand of the DNA duplex, covalently linking to the 5' phosphoryl end of the DNA break and, allowing the other strand of the duplex to pass through the gap. The TOPRIM domain has two conserved motifs, one of which centers at a conserved glutamate and the other one at two conserved aspartates (DxD). For topoisomerases the conserved glutamate is believed to act as a general base in strand joining and, as a general acid in strand cleavage. The DXD motif may co-ordinate Mg2+, a cofactor required for full catalytic function. Probab=60.19 E-value=8.3 Score=17.98 Aligned_cols=25 Identities=28% Similarity=0.413 Sum_probs=21.5 Q ss_pred CCCEEEEECCCCCCHHHHHHHHHHH Q ss_conf 2518999779877146889999999 Q gi|254780222|r 535 YEKVIIMTDADVDGAHIASLLLTFF 559 (686) Q Consensus 535 YgkIiImTDaD~DGsHI~~Lll~ff 559 (686) -..|+|.||+|--|..|.-.++.++ T Consensus 73 ad~viiAtD~DrEGE~I~~~i~~~~ 97 (123) T cd03363 73 ADEIYLATDPDREGEAIAWHLAEVL 97 (123) T ss_pred CCEEEECCCCCCCHHHHHHHHHHHH T ss_conf 9989984899824799999999995 No 95 >TIGR01925 spIIAB anti-sigma F factor; InterPro: IPR010194 This entry describes the SpoIIAB anti-sigma F factor. Sigma F regulates spore development in B subtilis. SpoIIAB binds to sigma F, preventing formation of the transcription complex at the promoter . SpoIIAA (anti-anti-sigma F factor) binds to SpoIIAB to inhibit association with sigma F, however SpoIIAB can phosphorylate SpoIIAA, causing disassociation of the SpoIIAA/B complex . The SpoIIE phosphatase dephosphorylates SpoIIAA.; GO: 0004674 protein serine/threonine kinase activity, 0005524 ATP binding, 0016989 sigma factor antagonist activity, 0006468 protein amino acid phosphorylation, 0030435 sporulation, 0045892 negative regulation of transcription DNA-dependent. Probab=58.41 E-value=7.3 Score=18.34 Aligned_cols=79 Identities=25% Similarity=0.410 Sum_probs=44.4 Q ss_pred HHHHHHHHHHHHHHHCCCC---CEEEEEECCC-CEEEEEECCCCCC-CC--CCCCCCCCEEEEEEEEEEECCCCCCCCCE Q ss_conf 0244366667877761899---7699998289-8399996698434-44--17665787012345452002477777741 Q gi|254780222|r 76 HLFSEVIDNAMDEVIAGYA---NLIEVSLDGN-GFLTVVDNGRGIP-VE--NHPKFPNKSTLEIILTTLHAGGKFDNSTY 148 (686) Q Consensus 76 ~i~~EIldNavDe~~~g~~---~~I~V~i~~d-g~IsV~ddGrGIP-v~--~h~k~~~~~~~Elif~~L~aggnfdd~~y 148 (686) ..+-|-+.||+=+.-...+ =.|.++|..+ =+|+|.|+|-||- ++ ..|=++-. || T Consensus 42 ~~VSEAVTNAIIHGYE~~~~~~V~I~~~~~d~~~~~~v~D~G~GI~~lE~A~~PLyTsk--Pe----------------- 102 (137) T TIGR01925 42 TAVSEAVTNAIIHGYEENKEGVVIISATIEDDEVSITVRDEGIGIENLEEAREPLYTSK--PE----------------- 102 (137) T ss_pred HHHHHHHHCCEEECEEECCCCEEEEEEEEECCEEEEEEEECCCCHHHHHHHCCCCCCCC--CC----------------- T ss_conf 33322321205314563799778999996054899998646757233785326645799--87----------------- Q ss_pred EEECCCCCHHHHHHHHEEEEEEEEEECC Q ss_conf 5631445388987620321269999418 Q gi|254780222|r 149 EISGGLHGVGISVVNALSDELHVTVARQ 176 (686) Q Consensus 149 k~sgGlnGvGaklvNalS~~f~Vev~r~ 176 (686) .-|-|.|-.+---|=....|+...+ T Consensus 103 ---LERSGMGFTvME~FMD~~~v~S~~~ 127 (137) T TIGR01925 103 ---LERSGMGFTVMESFMDDVEVDSEKE 127 (137) T ss_pred ---CCCCCCCEEEECCCCCCEEEEECCC T ss_conf ---2206786012111245126862389 No 96 >KOG0020 consensus Probab=58.28 E-value=13 Score=16.73 Aligned_cols=155 Identities=22% Similarity=0.288 Sum_probs=80.3 Q ss_pred HHHHHHHHHHHH--HC-----------C-CCCEEEEEECCC-CEEEEEECCCCCCCCCCCCCCC---CEEEEEEEEEEEC Q ss_conf 443666678777--61-----------8-997699998289-8399996698434441766578---7012345452002 Q gi|254780222|r 78 FSEVIDNAMDEV--IA-----------G-YANLIEVSLDGN-GFLTVVDNGRGIPVENHPKFPN---KSTLEIILTTLHA 139 (686) Q Consensus 78 ~~EIldNavDe~--~~-----------g-~~~~I~V~i~~d-g~IsV~ddGrGIPv~~h~k~~~---~~~~Elif~~L~a 139 (686) +.|+|.||.|.. +| + .--.|+|..|+. .-.+|.|-|-|+.-+---+.-| .+.---.+..+.. T Consensus 100 LRELISNASDAlDKIRllaLtd~~~L~~~~el~ikIK~Dke~klLhi~DtGiGMT~edLi~NLGTIAkSGTs~Fl~Km~~ 179 (785) T KOG0020 100 LRELISNASDALDKIRLLALTDKDVLGETEELEIKIKADKEKKLLHITDTGIGMTREDLIKNLGTIAKSGTSEFLEKMQD 179 (785) T ss_pred HHHHHHHHHHHHHHEEEEECCCHHHHCCCCCEEEEEEECHHHCEEEEECCCCCCCHHHHHHHHHHHHCCCHHHHHHHHHC T ss_conf 99997313666555101111686572757126999862201073467516678529999976655511457999998536 Q ss_pred CCCCCCCCEEEECCCCCHHHHHHHHEEEEEEEEEECC-CEEEEEEEECC-CEECCCEECCCCCCCCCEEEEEEECHHHEE Q ss_conf 4777777415631445388987620321269999418-83899997279-451131121676778853999998424320 Q gi|254780222|r 140 GGKFDNSTYEISGGLHGVGISVVNALSDELHVTVARQ-NTIFSQKFSRG-IPLSPLEKKEKVRNKRGTRITFRPDPKIFG 217 (686) Q Consensus 140 ggnfdd~~yk~sgGlnGvGaklvNalS~~f~Vev~r~-gk~y~q~f~~g-~~~~~~~~i~~~~~k~GT~ItF~PD~~iFg 217 (686) ++.-.+ ...--.|.-|||.-.+=....+..|.+..| ...|..+-.-| -.+..-|.- . .-++||.|+...--+- + T Consensus 180 ~~~~~~-~~~dlIGQFGVGFYsAfLVAD~vvVtsKhNdD~QyiWESdan~FsvseDprg-~-tL~RGt~ItL~LkeEA-~ 255 (785) T KOG0020 180 SGDSEG-LMNDLIGQFGVGFYSAFLVADRVVVTSKHNDDSQYIWESDANSFSVSEDPRG-N-TLGRGTEITLYLKEEA-G 255 (785) T ss_pred CCCCHH-HHHHHHHHCCHHHHHHHHHCCEEEEEECCCCCCCEEEECCCCCEEEECCCCC-C-CCCCCCEEEEEEHHHH-H T ss_conf 432123-6888877505125652220016999732578640134046761355317888-8-5568647999865545-5 Q ss_pred CCCCCCHHHHHHHHHHHH-HCC Q ss_conf 012222899999888875-305 Q gi|254780222|r 218 DKAGFDAARLLKMTQSKA-YLS 238 (686) Q Consensus 218 ~~~~~d~~~l~~rl~~~A-~L~ 238 (686) + -+..+.|+..++..+ |.| T Consensus 256 d--yLE~dtlkeLvkkYSqFIN 275 (785) T KOG0020 256 D--YLEEDTLKELVKKYSQFIN 275 (785) T ss_pred H--HCCHHHHHHHHHHHHHHCC T ss_conf 4--3066689999999887507 No 97 >COG2972 Predicted signal transduction protein with a C-terminal ATPase domain [Signal transduction mechanisms] Probab=57.14 E-value=13 Score=16.60 Aligned_cols=48 Identities=19% Similarity=0.160 Sum_probs=21.1 Q ss_pred CCCCCCCCCEEEEE-CCCCCCHH--HHHHHHHHHHHHHHHHHHCCC---EEEECCC-EEEEE Q ss_conf 33465525189997-79877146--889999999987088863393---8993099-69997 Q gi|254780222|r 529 REEDLRYEKVIIMT-DADVDGAH--IASLLLTFFYQEMYDLIEQKH---LFLISPP-LFRIT 583 (686) Q Consensus 529 ~~~~LRYgkIiImT-DaD~DGsH--I~~Lll~ff~~~~p~Li~~g~---v~~~~tP-l~kv~ 583 (686) .++++|||--+=-. +=|.+-.| |=.|+| -+|+|+-. +..-+|+ .+.+. T Consensus 325 ~iq~~r~~~~le~~~~i~~~~~~l~~p~l~l-------qpLvENAi~hgi~~~~~~~~I~i~ 379 (456) T COG2972 325 EIQKLRIGDRLEVPLPIDEELEPLIDPKLVL-------QPLVENAIEHGIEPKRPGGSIAIS 379 (456) T ss_pred HHHHCCCCEEEEEEECCCCCCCCCCCHHHHH-------HHHHHHHHHHHHCCCCCCCEEEEE T ss_conf 9998199879999720375332023278898-------887999999862046999579999 No 98 >PRK04031 DNA primase; Provisional Probab=55.89 E-value=14 Score=16.46 Aligned_cols=124 Identities=17% Similarity=0.329 Sum_probs=62.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEEECCCCCCCCHHH Q ss_conf 15889997740135688999999999965454433444221100245653335677656554037997327676421001 Q gi|254780222|r 401 RDPFDHYLVQNPIEATKLLEWVIERSEERLRRRKQREVNRKTAIRKLRLPGKLADCSQNIAHGTELFIVEGDSAGGSAKQ 480 (686) Q Consensus 401 ~~~l~~~l~~n~~~a~~I~~~~~~~a~~r~~~~k~k~~~rk~~~~~~~lpgKL~Da~~~~~~~~eL~lvEGDSA~gsAk~ 480 (686) ++-+..|+.+.....+.|++.+.+..+. . +. .-...-+|| |+-.-..--++|+|||-. T Consensus 121 kell~~~~~~~~~~~~~i~~ev~~~~~~------~-~~---~~yg~~~lp-----agp~v~~sd~iIvVEGRa------- 178 (420) T PRK04031 121 KEILKKWFDEKIPDSKEIIEEVREAVRV------E-EI---IEYGPEKLP-----AGPNVDDSDAIIVVEGRA------- 178 (420) T ss_pred HHHHHHHHHCCCCCHHHHHHHHHHHHHH------H-HH---HEECCCCCC-----CCCCCCCCCEEEEEECHH------- T ss_conf 9999999763682199999999988603------6-44---111656789-----998978888089995658------- Q ss_pred CCCCCCEEEEEECCEECCHHHHHHHHHHCCHHHHHHHHHHCCCCCCCC-CCCCCCCCCEEEEECCCCCCHHHHHHHHHHH Q ss_conf 011222012211231314454108776322889999988256654333-3346552518999779877146889999999 Q gi|254780222|r 481 ARNRSNQAVLPLRGKILNVASAGSEKIRNNQQIMDLVQALGCRTRSQY-REEDLRYEKVIIMTDADVDGAHIASLLLTFF 559 (686) Q Consensus 481 grdr~~qgi~PLRGKiLNv~~as~~ki~~N~Ei~~l~~~lGl~~g~~~-~~~~LRYgkIiImTDaD~DGsHI~~Lll~ff 559 (686) -+||..+. =|.|-+.+-|..+-+.- ++.+-| .+...+|-|.-|--| T Consensus 179 --------------DV~nLLk~---------GikN~ia~~G~~ip~~i~~L~~~k--~vtaF~DGDrGG~li-------- 225 (420) T PRK04031 179 --------------DVLNLLRY---------GIKNVIAVEGTSVPETIIELSKKK--TVTAFLDGDRGGELI-------- 225 (420) T ss_pred --------------HHHHHHHH---------CCCCEEEECCCCCCHHHHHHHCCC--EEEEEECCCCCHHHH-------- T ss_conf --------------89999873---------633278626888888999986386--589995388543789-------- Q ss_pred HHHHHHHHHCCCE-EEECCCEEEE Q ss_conf 9870888633938-9930996999 Q gi|254780222|r 560 YQEMYDLIEQKHL-FLISPPLFRI 582 (686) Q Consensus 560 ~~~~p~Li~~g~v-~~~~tPl~kv 582 (686) ..+|+..+.| |+|+.|-=|- T Consensus 226 ---LKELlqVadIDYVArAP~GKE 246 (420) T PRK04031 226 ---LKELLQVADIDYVARAPPGKE 246 (420) T ss_pred ---HHHHHHHCCEEEEEECCCCCC T ss_conf ---999985467359997989963 No 99 >PRK04017 hypothetical protein; Provisional Probab=53.40 E-value=15 Score=16.20 Aligned_cols=28 Identities=32% Similarity=0.433 Sum_probs=24.5 Q ss_pred CCCEEEEECCCCCCHHHHHHHHHHHHHH Q ss_conf 2518999779877146889999999987 Q gi|254780222|r 535 YEKVIIMTDADVDGAHIASLLLTFFYQE 562 (686) Q Consensus 535 YgkIiImTDaD~DGsHI~~Lll~ff~~~ 562 (686) |++||||||=|--|.-+..-|...|..+ T Consensus 65 ~~~vIlL~DwDr~G~~L~~kl~~~L~~~ 92 (132) T PRK04017 65 KKEVIILTDFDRKGEELAKKLSEYLQGY 92 (132) T ss_pred CCEEEEEEECCCCHHHHHHHHHHHHHHC T ss_conf 9839999713632179999999999868 No 100 >COG2516 Biotin synthase-related enzyme [General function prediction only] Probab=51.75 E-value=13 Score=16.61 Aligned_cols=112 Identities=21% Similarity=0.167 Sum_probs=62.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC--CCC---CCCCCCCCCCCCEEEEEEECCCCCCCCH---H Q ss_conf 7740135688999999999965454433444221100245--653---3356776565540379973276764210---0 Q gi|254780222|r 408 LVQNPIEATKLLEWVIERSEERLRRRKQREVNRKTAIRKL--RLP---GKLADCSQNIAHGTELFIVEGDSAGGSA---K 479 (686) Q Consensus 408 l~~n~~~a~~I~~~~~~~a~~r~~~~k~k~~~rk~~~~~~--~lp---gKL~Da~~~~~~~~eL~lvEGDSA~gsA---k 479 (686) +.+........++-+++++.+.. -++++|+++...+ +.- -|++++=.+..-.--|++-||.|-++.+ + T Consensus 134 l~e~~klg~d~l~V~~daa~~~~----~e~v~~~s~s~~S~e~~~~~l~~~~~~~~k~rv~ihliVglGesD~~~ve~~~ 209 (339) T COG2516 134 LEEYRKLGADYLGVAEDAANEEL----FEKVRKTSGSPHSWERYWEFLEKVAEAFGKGRVGIHLIVGLGESDKDIVETIK 209 (339) T ss_pred HHHHHHCCHHHHHHHHHHCCHHH----HHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCCCEEEEECCCCCHHHHHHHHH T ss_conf 89988542355357887507778----99987415898718889999999999854677451579604875689999999 Q ss_pred HCC----CCCCEEEEEECCEE-CCHHHHHHHHHHCCHHHHHHHHHHCCCCCC Q ss_conf 101----12220122112313-144541087763228899999882566543 Q gi|254780222|r 480 QAR----NRSNQAVLPLRGKI-LNVASAGSEKIRNNQQIMDLVQALGCRTRS 526 (686) Q Consensus 480 ~gr----dr~~qgi~PLRGKi-LNv~~as~~ki~~N~Ei~~l~~~lGl~~g~ 526 (686) .++ ---.+|+.||+|++ .|=...+...-.++++.. -++|.|.-+ T Consensus 210 ~v~~~g~~v~Lfaf~P~~gt~me~r~~~pve~Yrk~q~a~---yli~~G~v~ 258 (339) T COG2516 210 RVRKRGGIVSLFAFTPLKGTQMENRKPPPVERYRKIQVAR---YLIGNGEVD 258 (339) T ss_pred HHHHCCCEEEEEEECCCCCCCCCCCCCCCHHHHHHHHHHH---HHHHCCCCC T ss_conf 9985586489998646556644578998689999999999---997348655 No 101 >TIGR02108 PQQ_syn_pqqB coenzyme PQQ biosynthesis protein B; InterPro: IPR011842 This entry describes coenzyme PQQ biosynthesis protein B, a gene required for the biosynthesis of pyrrolo-quinoline-quinone (coenzyme PQQ). PQQ is required for some glucose dehydrogenases and alcohol dehydrogenases. Note that this gene appears to be required for PQQ in biosynthesis in Methylobacterium extorquens (under the name pqqG) and in Klebsiella pneumoniae but that the equivalent pqqV in Acinetobacter calcoaceticus is not necessary for heterologous expression of PQQ biosynthesis in Escherichia coli. Based on this latter finding, it is suggested that PqqB might be a transporter or a PQQ-dependent enzyme rather than a PQQ biosynthesis enzyme .; GO: 0006810 transport, 0018189 pyrroloquinoline quinone biosynthetic process. Probab=49.03 E-value=8.3 Score=17.97 Aligned_cols=24 Identities=21% Similarity=0.220 Sum_probs=11.5 Q ss_pred CCCCCCC-CEEEEEEEEEEECCCCCCC Q ss_conf 1766578-7012345452002477777 Q gi|254780222|r 120 NHPKFPN-KSTLEIILTTLHAGGKFDN 145 (686) Q Consensus 120 ~h~k~~~-~~~~Elif~~L~aggnfdd 145 (686) +||+. + + =-.-|-+++++.|--|. T Consensus 69 L~P~~-~aL-R~TpI~~V~Lt~g~iDH 93 (314) T TIGR02108 69 LHPQR-GAL-RDTPIEGVVLTDGEIDH 93 (314) T ss_pred HCCCC-CCC-CCCCEEEEEECCCCHHH T ss_conf 15777-888-65760478820662245 No 102 >TIGR01924 rsbW_low_gc anti-sigma B factor; InterPro: IPR010193 This entry describes the anti-sigma B factor also known as serine-protein kinase RsbW. Sigma B controls the general stress regulon in B subtilis and is activated by cell stresses such as stationary phase and heat shock. RsbW binds to sigma B and prevents formation of the transcription complex at the promoter . RsbV (anti-anti-sigma factor) binds to RsbW to inhibit association with sigma B, however RsbW can phosphorylate RsbV, causing disassociation of the RsbV/RsbW complex . Low ATP level or environmental stress causes the dephosphorylation of RsbV .; GO: 0004674 protein serine/threonine kinase activity, 0005524 ATP binding, 0016989 sigma factor antagonist activity, 0006468 protein amino acid phosphorylation, 0045892 negative regulation of transcription DNA-dependent. Probab=48.89 E-value=18 Score=15.73 Aligned_cols=83 Identities=22% Similarity=0.391 Sum_probs=55.0 Q ss_pred HHHHHHHHHHHHHCCC--CCEEEEEEC--CCC-EEEEEECCCCCCCCCCCCCCCCEEEEEEEEEEECCCCCCCC--CEEE Q ss_conf 4436666787776189--976999982--898-39999669843444176657870123454520024777777--4156 Q gi|254780222|r 78 FSEVIDNAMDEVIAGY--ANLIEVSLD--GNG-FLTVVDNGRGIPVENHPKFPNKSTLEIILTTLHAGGKFDNS--TYEI 150 (686) Q Consensus 78 ~~EIldNavDe~~~g~--~~~I~V~i~--~dg-~IsV~ddGrGIPv~~h~k~~~~~~~Elif~~L~aggnfdd~--~yk~ 150 (686) +-|-+.|||++|-.+- .-.|.|.++ +|- .|-|.|+|...-.|-..+ .+ |=||.+ -... T Consensus 48 VSEA~TN~V~HAYke~~nvG~~~i~F~iyEDklev~vsD~G~SFD~d~~k~--~l-------------GPyd~~~~i~~L 112 (161) T TIGR01924 48 VSEACTNAVKHAYKEEENVGEISIEFEIYEDKLEVIVSDEGDSFDLDTKKQ--EL-------------GPYDKSEDIDQL 112 (161) T ss_pred HHHHHCCCEEEEECCCCCEEEEEEEEEEEEEEEEEEEECCCCCCCHHHHHH--HC-------------CCCCCCCCHHHH T ss_conf 655530331122308983678998887887578999820786411100253--25-------------897957875432 Q ss_pred ECCCCCHHHHHHHHEEEEEEEEEECCCE Q ss_conf 3144538898762032126999941883 Q gi|254780222|r 151 SGGLHGVGISVVNALSDELHVTVARQNT 178 (686) Q Consensus 151 sgGlnGvGaklvNalS~~f~Vev~r~gk 178 (686) .-| |+|+=|.|-|=...+|.-. .|- T Consensus 113 ~eG--GLGLFLietLMD~V~~~~~-sGV 137 (161) T TIGR01924 113 REG--GLGLFLIETLMDEVKVYED-SGV 137 (161) T ss_pred HCC--CCHHHHHHCCCCEEEEEEC-CCE T ss_conf 017--7136645103761699832-860 No 103 >TIGR00032 argG argininosuccinate synthase; InterPro: IPR001518 Argininosuccinate synthase (6.3.4.5 from EC) (AS) is a urea cycle enzyme that catalyzes the penultimate step in arginine biosynthesis: the ATP-dependent ligation of citrulline to aspartate to form argininosuccinate, AMP and pyrophosphate , . In humans, a defect in the AS gene causes citrullinemia, a genetic disease characterised by severe vomiting spells and mental retardation. AS is a homotetrameric enzyme of chains of about 400 amino-acid residues. An arginine seems to be important for the enzyme's catalytic mechanism. The sequences of AS from various prokaryotes, archaebacteria and eukaryotes show significant similarity.; GO: 0004055 argininosuccinate synthase activity, 0005524 ATP binding, 0006526 arginine biosynthetic process. Probab=47.09 E-value=1.9 Score=22.35 Aligned_cols=117 Identities=15% Similarity=0.156 Sum_probs=54.7 Q ss_pred EEECHHHEECCCCCC-HHHHHHHHHHHHHC-CCCCEEEEECCCCCCCCCCCCCEE-EEEECCCCHHHHHCCCCCCCCCCC Q ss_conf 998424320012222-89999988887530-577359995164532224554112-343158612221002467766677 Q gi|254780222|r 209 FRPDPKIFGDKAGFD-AARLLKMTQSKAYL-SGRVKTCWSCDKEIAEKYGIPEKE-EFYFPGGLKTYLQTKLENRSLISS 285 (686) Q Consensus 209 F~PD~~iFg~~~~~d-~~~l~~rl~~~A~L-~~gl~i~~~~~~~~~~~~~~~~~~-~f~~~~Gi~dyv~~~~~~~~~~~~ 285 (686) -.||+++...--+.+ .+.+--|-..+.|. ..|+.+-..-++. +++.+.. --.+|+|.-+... ....+. T Consensus 138 ~~p~LkviaP~R~~~~~e~lgGR~e~~eYa~~~Gip~p~~~~K~----YSiD~Nl~grs~Ea~~LEdP~-----~~~Ppe 208 (420) T TIGR00032 138 LNPDLKVIAPWRDLNLTEELGGREEEIEYAAQKGIPVPMTKEKP----YSIDENLWGRSIEAGILEDPS-----DTEPPE 208 (420) T ss_pred HCCCCEEECCCCCHHHHHHHCCCHHHHHHHHHCCCCCCCCCCCC----CCHHHHHHHHHHHCCCCCCCC-----CCCCCH T ss_conf 26898587473464677640594889999996688876546777----762366775555055357887-----888814 Q ss_pred CCEEEEEEECCCEEEEEEEEEEECC---CCCCE--EEEEEECCCCCC--------CEEEC-CHHHHHHHHHHHHHHHCC Q ss_conf 5203333214801558999854146---77742--788830006557--------52413-046789999998763018 Q gi|254780222|r 286 EIFTGKTEKKGTHRGTVEWAIAWCE---ENPEI--TSYCNTILTDEG--------GTHES-GLRVALTRGIKKYAELTQ 350 (686) Q Consensus 286 ~~~~~~~~~~~~~~~~veval~~~~---~~~~~--~SFVN~I~T~~G--------GTHv~-g~~~ai~k~i~~~~~~~~ 350 (686) ++ ++|.-.|-+ +.+|. +.|=.|+|..-- |.+++ .+ -|...+|+.+-.+. T Consensus 209 ~~--------------y~~t~~p~~~~~d~PE~v~I~Fe~GvPVaLnytdnktskg~~~~~~v--eLi~~aNeiaG~HG 271 (420) T TIGR00032 209 DI--------------YELTKFPIEATPDEPEVVEIDFEKGVPVALNYTDNKTSKGVSLEKPV--ELILKANEIAGKHG 271 (420) T ss_pred HH--------------HHHHCCHHHCCCCCCCEEEEEECCCCCEEECCCCCCCCCCCCCCCHH--HHHHHHHHHHCCCC T ss_conf 57--------------88630733467999716888623785323015546443773306378--99999998510078 No 104 >pfam09239 Topo-VIb_trans Topoisomerase VI B subunit, transducer. Members of this family adopt a structure consisting of a four-stranded beta-sheet backed by three alpha-helices, the last of which is over 50 amino acids long and extends from the body of the protein by several turns. This domain has been proposed to mediate intersubunit communication by structurally transducing signals from the ATP binding and hydrolysis domains to the DNA binding and cleavage domains of the gyrase holoenzyme. Probab=46.25 E-value=19 Score=15.46 Aligned_cols=78 Identities=17% Similarity=0.252 Sum_probs=50.9 Q ss_pred EEEEEEEEEECCC-----CCCEEEEEEECCCC-CCCEEECCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCEEEEEEE Q ss_conf 5589998541467-----77427888300065-57524130467899999987630188332223322223642699997 Q gi|254780222|r 299 RGTVEWAIAWCEE-----NPEITSYCNTILTD-EGGTHESGLRVALTRGIKKYAELTQNKRAISIISDDLMISAVGILSV 372 (686) Q Consensus 299 ~~~veval~~~~~-----~~~~~SFVN~I~T~-~GGTHv~g~~~ai~k~i~~~~~~~~~kk~~~i~~~dIk~~L~~~Isv 372 (686) ...||++++|..+ ..++.=|+|-||-. +.|. .+|+++++. +..+ ..+++-..+ .-+.++|++ T Consensus 41 PF~VEvgiayGG~~~~~~~~~vlRFANRVPLly~~g~------c~itkav~~-i~W~--~Ygl~q~~~---~P~~i~Vhv 108 (160) T pfam09239 41 PFIVEVGIAYGGDIPPEGKPELLRFANRVPLLYDQGA------CVITKAVES-INWK--RYGLEQPGG---GPLVILVHV 108 (160) T ss_pred CEEEEEEEECCCCCCCCCCEEEEEEECCCCEEEECCC------CHHHHHHHH-CCHH--HCCCCCCCC---CCEEEEEEE T ss_conf 8799986001687888897248886056764551574------389999987-6733--126568999---988999996 Q ss_pred EEEHHHCCCCCCCCCC Q ss_conf 7634560764466202 Q gi|254780222|r 373 FIREPEFAGQTKTKLV 388 (686) Q Consensus 373 ~I~nP~FesQTKekL~ 388 (686) -.++--|.|-.||.+. T Consensus 109 aSt~VPfts~sKeaIa 124 (160) T pfam09239 109 ASTKVPFTSEGKEAIA 124 (160) T ss_pred EECCCCCCCCCHHHHC T ss_conf 1047785662054413 No 105 >cd00823 TopoIIB_Trans TopoIIB_Trans: Transducer domain, having a ribosomal S5 domain 2-like fold, of the type found in proteins of the type IIB family of DNA topoisomerases similar to Sulfolobus shibatae topoisomerase VI (topoVI). The sole representative of the Type IIB family is topo VI. Topo VI enzymes are heterotetramers found in archaea and plants. S. shibatae topoVI relaxes both positive and negative supercoils, and in addition has a strong decatenase activity. This transducer domain is homologous to the second domain of the DNA gyrase B subunit, which is known to be important in nucleotide hydrolysis and the transduction of structural signals from ATP-binding site to the DNA breakage/reunion regions of the enzymes. Probab=44.87 E-value=20 Score=15.32 Aligned_cols=79 Identities=15% Similarity=0.219 Sum_probs=51.2 Q ss_pred EEEEEEEEEECCC-----CCCEEEEEEECCCC-CCCEEECCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCEEEEEEE Q ss_conf 5589998541467-----77427888300065-57524130467899999987630188332223322223642699997 Q gi|254780222|r 299 RGTVEWAIAWCEE-----NPEITSYCNTILTD-EGGTHESGLRVALTRGIKKYAELTQNKRAISIISDDLMISAVGILSV 372 (686) Q Consensus 299 ~~~veval~~~~~-----~~~~~SFVN~I~T~-~GGTHv~g~~~ai~k~i~~~~~~~~~kk~~~i~~~dIk~~L~~~Isv 372 (686) ...||++++|..+ ..++.=|+|-||-. +.|. .++++++... ..+ ..+++-.. ..-+.++|++ T Consensus 32 PF~VEvgiayGG~~~~~~~~~vlRFANRVPLly~~g~------c~itkav~~i-nW~--~Ygl~q~~---~~p~~i~Vhv 99 (151) T cd00823 32 PFIVEVGIAYGGDIPADEKVELLRFANRVPLLYDAGA------CVITKAVESI-NWK--RYGLEQPG---QGPLVVLVHV 99 (151) T ss_pred CEEEEEEEECCCCCCCCCCCEEEEEECCCCEEEECCC------CHHHHHHHHC-CCH--HCCCCCCC---CCCEEEEEEE T ss_conf 8799986000687888887158886056764651575------2689998856-811--04657899---8887999996 Q ss_pred EEEHHHCCCCCCCCCCC Q ss_conf 76345607644662025 Q gi|254780222|r 373 FIREPEFAGQTKTKLVS 389 (686) Q Consensus 373 ~I~nP~FesQTKekL~s 389 (686) -.++-.|.|-.||.+.. T Consensus 100 aSt~VPfts~sKeaIa~ 116 (151) T cd00823 100 ASTKVPFTSEGKEAIAD 116 (151) T ss_pred EECCCCCCCCCHHHHCC T ss_conf 00467856620544137 No 106 >TIGR01369 CPSaseII_lrg carbamoyl-phosphate synthase, large subunit; InterPro: IPR006275 Carbamoyl phosphate synthase (CPSase) is a heterodimeric enzyme composed of a small and a large subunit (with the exception of CPSase III, see below). CPSase catalyses the synthesis of carbamoyl phosphate from biocarbonate, ATP and glutamine (6.3.5.5 from EC) or ammonia (6.3.4.16 from EC), and represents the first committed step in pyrimidine and arginine biosynthesis in prokaryotes and eukaryotes, and in the urea cycle in most terrestrial vertebrates , . CPSase has three active sites, one in the small subunit and two in the large subunit. The small subunit contains the glutamine binding site and catalyses the hydrolysis of glutamine to glutamate and ammonia. The large subunit has two homologous carboxy phosphate domains, both of which have ATP-binding sites; however, the N-terminal carboxy phosphate domain catalyses the phosphorylation of biocarbonate, while the C-terminal domain catalyses the phosphorylation of the carbamate intermediate . The carboxy phosphate domain found duplicated in the large subunit of CPSase is also present as a single copy in the biotin-dependent enzymes acetyl-CoA carboxylase (6.4.1.2 from EC) (ACC), propionyl-CoA carboxylase (6.4.1.3 from EC) (PCCase), pyruvate carboxylase (6.4.1.1 from EC) (PC) and urea carboxylase (6.3.4.6 from EC). Most prokaryotes carry one form of CPSase that participates in both arginine and pyrimidine biosynthesis, however certain bacteria can have separate forms. The large subunit in bacterial CPSase has four structural domains: the carboxy phosphate domain 1, the oligomerisation domain, the carbamoyl phosphate domain 2 and the allosteric domain . CPSase heterodimers from Escherichia coli contain two molecular tunnels: an ammonia tunnel and a carbamate tunnel. These inter-domain tunnels connect the three distinct active sites, and function as conduits for the transport of unstable reaction intermediates (ammonia and carbamate) between successive active sites . The catalytic mechanism of CPSase involves the diffusion of carbamate through the interior of the enzyme from the site of synthesis within the N-terminal domain of the large subunit to the site of phosphorylation within the C-terminal domain. Eukaryotes have two distinct forms of CPSase: a mitochondrial enzyme (CPSase I) that participates in both arginine biosynthesis and the urea cycle; and a cytosolic enzyme (CPSase II) involved in pyrimidine biosynthesis. CPSase II occurs as part of a multi-enzyme complex along with aspartate transcarbamoylase and dihydroorotase; this complex is referred to as the CAD protein . The hepatic expression of CPSase is transcriptionally regulated by glucocorticoids and/or cAMP . There is a third form of the enzyme, CPSase III, found in fish, which uses glutamine as a nitrogen source instead of ammonia . CPSase III is closely related to CPSase I, and is composed of a single polypeptide that may have arisen from gene fusion of the glutaminase and synthetase domains . This entry represents glutamine-dependent CPSase (6.3.5.5 from EC) from prokaryotes and eukaryotes (CPSase II). ; GO: 0004086 carbamoyl-phosphate synthase activity, 0006807 nitrogen compound metabolic process. Probab=44.63 E-value=19 Score=15.58 Aligned_cols=66 Identities=32% Similarity=0.491 Sum_probs=30.3 Q ss_pred HHHCCCCEEECCCCC--------------CCHHHH-HHHHHHHHHHHHHCCCCCEEE--EEEC-CCCEEEEEECCCCCCC Q ss_conf 343269815467775--------------502202-443666678777618997699--9982-8983999966984344 Q gi|254780222|r 57 PVRMRPGMYIGGTDE--------------KALHHL-FSEVIDNAMDEVIAGYANLIE--VSLD-GNGFLTVVDNGRGIPV 118 (686) Q Consensus 57 ~VrkRP~mYIGst~~--------------~GL~~i-~~EIldNavDe~~~g~~~~I~--V~i~-~dg~IsV~ddGrGIPv 118 (686) ||--||.==.|++|. +||-+- +.+||. +|.++|. +.|+ |--| .|++|+|.|==.==|+ T Consensus 164 PviVRpAftLGG~GgGiA~n~eEL~~~~~~aL~~SpI~qvL~---EkSl~GW-KE~EYEVmRD~~dNCItVCNmEN~DPm 239 (1089) T TIGR01369 164 PVIVRPAFTLGGTGGGIASNEEELKEIVERALSASPINQVLV---EKSLAGW-KEIEYEVMRDSKDNCITVCNMENFDPM 239 (1089) T ss_pred CEEECCCHHCCCCCCCEECCHHHHHHHHHHHHHHCCCCEEEE---EEECCCC-EEEEEEEEEECCCCEEEEECCCCCCCC T ss_conf 689810032388998522577899999998863088855750---1100331-788988986268788999745476898 Q ss_pred CCCCCCCCCEE Q ss_conf 41766578701 Q gi|254780222|r 119 ENHPKFPNKST 129 (686) Q Consensus 119 ~~h~k~~~~~~ 129 (686) .+| +|=|+ T Consensus 240 GVH---TGdSI 247 (1089) T TIGR01369 240 GVH---TGDSI 247 (1089) T ss_pred CEE---ECCCE T ss_conf 457---07833 No 107 >KOG1332 consensus Probab=41.71 E-value=23 Score=15.00 Aligned_cols=112 Identities=13% Similarity=0.160 Sum_probs=64.0 Q ss_pred HHCCCCCEEEEEECCCCEEE---EEECCCCCCCCCCCCCCCCEEEEEE-----EEEEECCCCCCCCC------------- Q ss_conf 76189976999982898399---9966984344417665787012345-----45200247777774------------- Q gi|254780222|r 89 VIAGYANLIEVSLDGNGFLT---VVDNGRGIPVENHPKFPNKSTLEII-----LTTLHAGGKFDNST------------- 147 (686) Q Consensus 89 ~~~g~~~~I~V~i~~dg~Is---V~ddGrGIPv~~h~k~~~~~~~Eli-----f~~L~aggnfdd~~------------- 147 (686) ++.++--+--+|--.|++|. |.+||.-.++.--. ...-++++++ ||.++||.-||..- T Consensus 17 a~lDyygkrlATcsSD~tVkIf~v~~n~~s~ll~~L~-Gh~GPVwqv~wahPk~G~iLAScsYDgkVIiWke~~g~w~k~ 95 (299) T KOG1332 17 AQLDYYGKRLATCSSDGTVKIFEVRNNGQSKLLAELT-GHSGPVWKVAWAHPKFGTILASCSYDGKVIIWKEENGRWTKA 95 (299) T ss_pred HHHHHHCCEEEEECCCCCEEEEEECCCCCCEEEEEEC-CCCCCEEEEEECCCCCCCEEEEEECCCEEEEEECCCCCHHHH T ss_conf 6653310435663278718999974888735546742-777870577641655473767762387299995689845665 Q ss_pred ----------EEEECCCCCHHHHHHHHEEEEEEEEEECCCEEEEEEEECCCEECCCEECCCCCCCCC-EEEEEEEC Q ss_conf ----------156314453889876203212699994188389999727945113112167677885-39999984 Q gi|254780222|r 148 ----------YEISGGLHGVGISVVNALSDELHVTVARQNTIFSQKFSRGIPLSPLEKKEKVRNKRG-TRITFRPD 212 (686) Q Consensus 148 ----------yk~sgGlnGvGaklvNalS~~f~Vev~r~gk~y~q~f~~g~~~~~~~~i~~~~~k~G-T~ItF~PD 212 (686) -.++.--|+||+.|..|-| ||++-..+|... ..-..++|..+- +-| +.|+|-|- T Consensus 96 ~e~~~h~~SVNsV~wapheygl~LacasS---------DG~vsvl~~~~~-g~w~t~ki~~aH-~~GvnsVswapa 160 (299) T KOG1332 96 YEHAAHSASVNSVAWAPHEYGLLLACASS---------DGKVSVLTYDSS-GGWTTSKIVFAH-EIGVNSVSWAPA 160 (299) T ss_pred HHHHHHCCCCCEECCCCCCCCEEEEEEEC---------CCCEEEEEECCC-CCCCCHHHHCCC-CCCCCEEEECCC T ss_conf 43342015300020355566427999607---------996789997478-885420101031-255533564576 No 108 >PRK05645 lipid A biosynthesis lauroyl acyltransferase; Provisional Probab=41.07 E-value=18 Score=15.62 Aligned_cols=36 Identities=17% Similarity=0.302 Sum_probs=21.3 Q ss_pred CCC-CCEEEEECCCCCCHHHHHHHHHHHHHH------HHHHHHCC Q ss_conf 552-518999779877146889999999987------08886339 Q gi|254780222|r 533 LRY-EKVIIMTDADVDGAHIASLLLTFFYQE------MYDLIEQK 570 (686) Q Consensus 533 LRY-gkIiImTDaD~DGsHI~~Lll~ff~~~------~p~Li~~g 570 (686) ||= +.|.|+.|||+-.. .|-...||-+. -|.|+... T Consensus 179 lk~g~~v~~l~DQ~~~~~--~gi~v~FFg~~a~t~~~~~~l~~~~ 221 (295) T PRK05645 179 VRKGGQVGIPADPEPAES--AGIFVPFLGTQALTSKFVPNMLAGG 221 (295) T ss_pred HHCCCCEEECCCCCCCCC--CCEEEHHCCCCCCHHHHHHHHHHHC T ss_conf 743881364578776678--8765210498110665799999717 No 109 >TIGR02343 chap_CCT_epsi T-complex protein 1, epsilon subunit; InterPro: IPR012718 Members of this eukaryotic family are part of the group II chaperonin complex called CCT (chaperonin containing TCP-1) or TRiC. The archaeal equivalent group II chaperonin is often called the thermosome. Both are somewhat related to the group I chaperonin of bacterial, GroEL/GroES. This family consists exclusively of the CCT epsilon chain (part of a paralogous family) from animals, plants, fungi, and other eukaryotes.; GO: 0005524 ATP binding, 0051082 unfolded protein binding, 0006457 protein folding. Probab=40.07 E-value=16 Score=15.95 Aligned_cols=37 Identities=16% Similarity=0.302 Sum_probs=26.1 Q ss_pred HHHHHHHHHC---CC--CCEEEEEECCCCEEEEEECCCCCCCCC Q ss_conf 6667877761---89--976999982898399996698434441 Q gi|254780222|r 82 IDNAMDEVIA---GY--ANLIEVSLDGNGFLTVVDNGRGIPVEN 120 (686) Q Consensus 82 ldNavDe~~~---g~--~~~I~V~i~~dg~IsV~ddGrGIPv~~ 120 (686) ..-||=..++ || +++|-+ .+||-|+|+|||--|=-.| T Consensus 35 aak~vA~~l~tSLGP~G~DK~l~--s~Dgdi~vTNDGATIl~~M 76 (541) T TIGR02343 35 AAKSVASILRTSLGPKGLDKMLI--SPDGDITVTNDGATILEQM 76 (541) T ss_pred HHHHHHHHHHHCCCCCCCCCCEE--CCCCCEEEECCHHHHHHHC T ss_conf 99999999975068876620001--7898778852614576430 No 110 >pfam07994 NAD_binding_5 Myo-inositol-1-phosphate synthase. This is a family of myo-inositol-1-phosphate synthases. Inositol-1-phosphate catalyses the conversion of glucose-6- phosphate to inositol-1-phosphate, which is then dephosphorylated to inositol. Inositol phosphates play an important role in signal transduction. Probab=38.66 E-value=13 Score=16.56 Aligned_cols=47 Identities=21% Similarity=0.249 Sum_probs=33.1 Q ss_pred EEEEEEECCCCCCCEEECCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCEEEEEEEEEE Q ss_conf 2788830006557524130467899999987630188332223322223642699997763 Q gi|254780222|r 315 ITSYCNTILTDEGGTHESGLRVALTRGIKKYAELTQNKRAISIISDDLMISAVGILSVFIR 375 (686) Q Consensus 315 ~~SFVN~I~T~~GGTHv~g~~~ai~k~i~~~~~~~~~kk~~~i~~~dIk~~L~~~Isv~I~ 375 (686) -++|||++|.. +..+-+| .+.++ .+++-+-++|+|.|.+.+++++.+ T Consensus 186 G~~fvN~~P~~--~a~~pal--------~ela~----~~gvpi~GdD~KTg~t~~ks~la~ 232 (389) T pfam07994 186 GCAFINGSPQN--TASVPGL--------IELAE----EKGLPIAGDDFKSGQTKTKSVLAL 232 (389) T ss_pred CCCEECCCCCC--CCCCHHH--------HHHHH----HCCCCCCCCCCCCCCCEEEHHHHH T ss_conf 99745265531--4685999--------99999----659982164011587500024568 No 111 >KOG2731 consensus Probab=36.76 E-value=19 Score=15.44 Aligned_cols=20 Identities=25% Similarity=0.237 Sum_probs=16.1 Q ss_pred CCCCCCCCCCCCCCEEEEEC Q ss_conf 54333334655251899977 Q gi|254780222|r 524 TRSQYREEDLRYEKVIIMTD 543 (686) Q Consensus 524 ~g~~~~~~~LRYgkIiImTD 543 (686) .++..+..-||||.|+||+- T Consensus 263 l~e~p~p~~lrsGdv~im~G 282 (378) T KOG2731 263 LGENPDPMTLRSGDVVIMDG 282 (378) T ss_pred CCCCCCCCCCCCCCEEEECC T ss_conf 68899830000585476334 No 112 >KOG1777 consensus Probab=35.89 E-value=20 Score=15.35 Aligned_cols=31 Identities=23% Similarity=0.368 Sum_probs=21.4 Q ss_pred HHEEECCCCHHHHCCCCEEECCCCCCCHHHH Q ss_conf 5602311210343269815467775502202 Q gi|254780222|r 47 SSIRILEGLEPVRMRPGMYIGGTDEKALHHL 77 (686) Q Consensus 47 ~~i~~L~glE~VrkRP~mYIGst~~~GL~~i 77 (686) +|.+.|.-=-||++-|..|+|++...-+-|| T Consensus 5 Es~~Ql~rGvHV~~~~~~~~~~~~~~~fD~i 35 (625) T KOG1777 5 ESFRQLYRGVHVMKNPEQFVGAANIQCFDHI 35 (625) T ss_pred HHHHHHHCCCEECCCHHHHHHHHHHHHHHHH T ss_conf 8899873662344687785203336755349 No 113 >PRK06009 flgD flagellar basal body rod modification protein; Reviewed Probab=34.80 E-value=29 Score=14.28 Aligned_cols=23 Identities=13% Similarity=0.484 Sum_probs=18.9 Q ss_pred EEEEECCCCEEEEEECCCCCCCC Q ss_conf 99998289839999669843444 Q gi|254780222|r 97 IEVSLDGNGFLTVVDNGRGIPVE 119 (686) Q Consensus 97 I~V~i~~dg~IsV~ddGrGIPv~ 119 (686) =.|.+..||.|.+.+||+-||+. T Consensus 107 ~sV~~~~dg~~a~l~dG~~v~~~ 129 (135) T PRK06009 107 KSVTIYSDGIIATLEDGKKVLLG 129 (135) T ss_pred EEEEEECCCEEEEEECCCEEEEC T ss_conf 99999389449998389889837 No 114 >pfam06882 DUF1263 Protein of unknown function (DUF1263). This family represents a conserved region located towards the C-terminus of a number proteins of unknown function that seem to be specific to Oryza sativa. Probab=33.60 E-value=21 Score=15.22 Aligned_cols=19 Identities=47% Similarity=0.766 Sum_probs=13.8 Q ss_pred CCHHHHCCCCEEECCCCCC Q ss_conf 2103432698154677755 Q gi|254780222|r 54 GLEPVRMRPGMYIGGTDEK 72 (686) Q Consensus 54 glE~VrkRP~mYIGst~~~ 72 (686) -|--|+..||||..+|+.+ T Consensus 55 ~l~mv~mkpg~~l~gt~vp 73 (108) T pfam06882 55 FLAMVAMKPGIYLTGTDVP 73 (108) T ss_pred HHHHHHCCCCEEECCCCCC T ss_conf 9998742775242267788 No 115 >PRK05157 pyrroloquinoline quinone biosynthesis protein PqqC; Provisional Probab=33.13 E-value=19 Score=15.52 Aligned_cols=11 Identities=0% Similarity=0.021 Sum_probs=4.1 Q ss_pred CHHHHHHHHHH Q ss_conf 28999998888 Q gi|254780222|r 223 DAARLLKMTQS 233 (686) Q Consensus 223 d~~~l~~rl~~ 233 (686) +.+....+|++ T Consensus 11 ~~~efe~~Lr~ 21 (242) T PRK05157 11 SPEEFEAALRA 21 (242) T ss_pred CHHHHHHHHHH T ss_conf 99999999999 No 116 >KOG1845 consensus Probab=32.91 E-value=16 Score=16.02 Aligned_cols=16 Identities=44% Similarity=0.831 Sum_probs=13.7 Q ss_pred HHHHHHHHHHHHHHCC Q ss_conf 2443666678777618 Q gi|254780222|r 77 LFSEVIDNAMDEVIAG 92 (686) Q Consensus 77 i~~EIldNavDe~~~g 92 (686) -++|+||||+||...+ T Consensus 150 a~aeLldnalDEi~~~ 165 (775) T KOG1845 150 AIAELLDNALDEITNG 165 (775) T ss_pred HHHHHCCCCCCCCCCC T ss_conf 3666504554123465 No 117 >cd03362 TOPRIM_TopoIA_TopoIII TOPRIM_TopoIA_TopoIII: The topoisomerase-primase (TORPIM) domain found in members of the type IA family of DNA topoisomerases (Topo IA) similar to topoisomerase III. Type IA DNA topoisomerases remove (relax) negative supercoils in the DNA by: cleaving one strand of the DNA duplex, covalently linking to the 5' phosphoryl end of the DNA break and, allowing the other strand of the duplex to pass through the gap. The TOPRIM domain has two conserved motifs, one of which centers at a conserved glutamate and the other one at two conserved aspartates (DxD). For topoisomerases the conserved glutamate is believed to act as a general base in strand joining and, as a general acid in strand cleavage. The DXD motif may co-ordinate Mg2+, a cofactor required for full catalytic function. Probab=31.58 E-value=32 Score=13.93 Aligned_cols=24 Identities=33% Similarity=0.613 Sum_probs=20.9 Q ss_pred CCEEEEECCCCCCHHHHHHHHHHH Q ss_conf 518999779877146889999999 Q gi|254780222|r 536 EKVIIMTDADVDGAHIASLLLTFF 559 (686) Q Consensus 536 gkIiImTDaD~DGsHI~~Lll~ff 559 (686) ..||+.||+|--|-.|.-.++..+ T Consensus 100 d~ii~atD~DrEGE~I~~ei~~~~ 123 (151) T cd03362 100 DEIVIATDADREGELIGREILEYA 123 (151) T ss_pred CEEEECCCCCCCHHHHHHHHHHHH T ss_conf 989987788720459999999984 No 118 >KOG1413 consensus Probab=30.84 E-value=33 Score=13.89 Aligned_cols=32 Identities=22% Similarity=0.384 Sum_probs=18.4 Q ss_pred EEEEEEECHHHEECCCCCCHHHHHHHHHHHHHCC Q ss_conf 3999998424320012222899999888875305 Q gi|254780222|r 205 TRITFRPDPKIFGDKAGFDAARLLKMTQSKAYLS 238 (686) Q Consensus 205 T~ItF~PD~~iFg~~~~~d~~~l~~rl~~~A~L~ 238 (686) +.|+--|+...|... +....=++.+...+|.. T Consensus 134 ~ei~v~~~~~k~~~Y--y~IarHYkwAL~q~F~~ 165 (411) T KOG1413 134 DEISVPPRHKKFNAY--YKIARHYKWALNQLFIV 165 (411) T ss_pred CCCCCCCCCCCCCHH--HHHHHHHHHHHHHHHHH T ss_conf 541457750002316--88999999998647765 No 119 >PRK05776 DNA topoisomerase III; Provisional Probab=30.30 E-value=34 Score=13.79 Aligned_cols=12 Identities=17% Similarity=0.066 Sum_probs=6.8 Q ss_pred ECCHHHHHHHHH Q ss_conf 378999999999 Q gi|254780222|r 591 AHDEEHKQKILE 602 (686) Q Consensus 591 ~~~e~e~~~~~~ 602 (686) -|||..+-++++ T Consensus 479 ryTEatLi~~ME 490 (675) T PRK05776 479 RYTKASLLKWME 490 (675) T ss_pred CCCHHHHHHHHH T ss_conf 989999999986 No 120 >pfam11871 DUF3391 Domain of unknown function (DUF3391). This domain is functionally uncharacterized. This domain is found in bacteria. This presumed domain is typically between 122 to 139 amino acids in length. This domain is found associated with pfam01966. Probab=29.78 E-value=22 Score=15.06 Aligned_cols=11 Identities=36% Similarity=0.718 Sum_probs=4.9 Q ss_pred CCCCEEECCCC Q ss_conf 26981546777 Q gi|254780222|r 60 MRPGMYIGGTD 70 (686) Q Consensus 60 kRP~mYIGst~ 70 (686) +||||||=..+ T Consensus 10 L~vGMyV~~l~ 20 (127) T pfam11871 10 LQVGMYVSLLD 20 (127) T ss_pred CCCCCEEEECC T ss_conf 57761898678 No 121 >PRK00885 phosphoribosylamine--glycine ligase; Provisional Probab=29.72 E-value=35 Score=13.72 Aligned_cols=15 Identities=20% Similarity=0.401 Sum_probs=8.7 Q ss_pred EEEEEECCEECCHHH Q ss_conf 012211231314454 Q gi|254780222|r 487 QAVLPLRGKILNVAS 501 (686) Q Consensus 487 qgi~PLRGKiLNv~~ 501 (686) -.++.--|.+|+|-. T Consensus 370 g~l~t~GGRVl~vva 384 (424) T PRK00885 370 GKLVTNGGRVLCVTA 384 (424) T ss_pred CEEEECCCEEEEEEE T ss_conf 989977894899999 No 122 >PRK11558 hypothetical protein; Provisional Probab=29.64 E-value=35 Score=13.71 Aligned_cols=57 Identities=23% Similarity=0.486 Sum_probs=31.5 Q ss_pred CCCCEEECCCCCCCHHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCE-EEEEECCCCCCCCC Q ss_conf 26981546777550220244366667877761899769999828983-99996698434441 Q gi|254780222|r 60 MRPGMYIGGTDEKALHHLFSEVIDNAMDEVIAGYANLIEVSLDGNGF-LTVVDNGRGIPVEN 120 (686) Q Consensus 60 kRP~mYIGst~~~GL~~i~~EIldNavDe~~~g~~~~I~V~i~~dg~-IsV~ddGrGIPv~~ 120 (686) .|||.|+|.. +......+||.+...+.+ |.+.-+.-+-++-|. +....+-|-.|||. T Consensus 26 v~~GVyVG~~-S~rVRd~lW~~v~~~~~~---G~avmv~~~~nEqG~~~~t~G~~~r~~vD~ 83 (97) T PRK11558 26 VRAGVYVGDV-SKRIREMIWQQVTQLAEE---GNVVMAWATNTESGFEFQTFGENRRIPVDL 83 (97) T ss_pred CCCCCEECCC-CHHHHHHHHHHHHHHCCC---CCEEEEECCCCCCCEEEEEECCCCCCEEEC T ss_conf 1687578387-879999999999966779---858999737998855899817998615856 No 123 >PRK05279 N-acetylglutamate synthase; Validated Probab=29.26 E-value=15 Score=16.20 Aligned_cols=19 Identities=11% Similarity=0.155 Sum_probs=9.1 Q ss_pred HHHHHHHHHCCC-CCEEEEE Q ss_conf 998888753057-7359995 Q gi|254780222|r 228 LKMTQSKAYLSG-RVKTCWS 246 (686) Q Consensus 228 ~~rl~~~A~L~~-gl~i~~~ 246 (686) ...++++|.|.. |+++.+. T Consensus 43 ~~~~~DIaLL~slGIr~VlV 62 (441) T PRK05279 43 SNIVNDIALLHSLGIRLVLV 62 (441) T ss_pred HHHHHHHHHHHHCCCEEEEE T ss_conf 99999999998689769998 No 124 >TIGR02604 Piru_Ver_Nterm putative membrane-bound dehydrogenase domain; InterPro: IPR013428 Proteins containing this domain also contain several hundred residues of highly variable sequence, and then a well-conserved C-terminal domain (IPR013427 from INTERPRO) that features a putative cytochrome c-type heme binding motif CXXCH. The membrane-bound L-sorbosone dehydrogenase from Acetobacter liquefaciens (Gluconacetobacter liquefaciens) (Q44091 from SWISSPROT) shows some homolgy to this domain but lacks additional sequence regions found in this entry and belongs to a different clade of the larger family of homologues. . Probab=28.96 E-value=32 Score=13.99 Aligned_cols=15 Identities=20% Similarity=0.383 Sum_probs=8.7 Q ss_pred CCEEEEEEECHHHEE Q ss_conf 853999998424320 Q gi|254780222|r 203 RGTRITFRPDPKIFG 217 (686) Q Consensus 203 ~GT~ItF~PD~~iFg 217 (686) .|+--+|-|+=.+|. T Consensus 197 gg~~~~~~P~~~~l~ 211 (459) T TIGR02604 197 GGGLFRYNPDGGKLE 211 (459) T ss_pred CCEEEEECCCCCEEE T ss_conf 731578744485789 No 125 >pfam08874 DUF1835 Domain of unknown function (DUF1835). This family of proteins are functionally uncharacterized. Probab=28.95 E-value=21 Score=15.17 Aligned_cols=27 Identities=44% Similarity=0.492 Sum_probs=15.3 Q ss_pred EECCCCCCCCHHHCCC------CCCEEEEEECC Q ss_conf 7327676421001011------22201221123 Q gi|254780222|r 468 IVEGDSAGGSAKQARN------RSNQAVLPLRG 494 (686) Q Consensus 468 lvEGDSA~gsAk~grd------r~~qgi~PLRG 494 (686) ||=||||+|+.|++.- ++..+|=||.+ T Consensus 3 I~fg~saagsLk~al~q~Vi~~~DdlSiGPl~~ 35 (115) T pfam08874 3 ITFGDSAAGSLKQALKQKVISLPDDLSIGPLYR 35 (115) T ss_pred EECCCCHHHHHHHHHCCCEEEECCCCCCCCCCC T ss_conf 746875057999982167599546752378888 No 126 >PTZ00306 NADH-dependent fumarate reductase; Provisional Probab=28.86 E-value=33 Score=13.87 Aligned_cols=50 Identities=20% Similarity=0.307 Sum_probs=32.5 Q ss_pred HHHHHHHHHHHHCCCCCEEEE---------EEECCCCCCHHHHHHHCCCCCCEEEEEEE Q ss_conf 999999998774178860786---------85067788988998860394000799998 Q gi|254780222|r 595 EHKQKILEEFKKHGKGKIEIS---------RFKGLGEMLASQLKETAMDANKRTLLRVE 644 (686) Q Consensus 595 ~e~~~~~~~~~~~~~~k~~i~---------ryKGLGe~~~~e~~et~~~p~~r~l~~v~ 644 (686) --|.+.++.+.+.+..|+... .--|.|=++..-+...+..|..-.|+.+- T Consensus 1077 ~~~r~~l~~~~~~~~~~f~~~f~~~~p~~~w~~~v~~~~~~~~~~~~~~p~~~~~~~~~ 1135 (1167) T PTZ00306 1077 LTYRELLESYRKENPGKFKCHFVLNNPPEGWTDGVGFVDRALLQSALQPPSKDLLVAIC 1135 (1167) T ss_pred CHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCCCCCCHHHHHHHHCCCCCCCCEEEEEE T ss_conf 20999999988758876478998069985543654400068786246998867179983 No 127 >KOG2333 consensus Probab=28.83 E-value=28 Score=14.33 Aligned_cols=28 Identities=29% Similarity=0.405 Sum_probs=19.4 Q ss_pred HHHHHHHHHHH-HHHCCCEEE-ECCCEEEE Q ss_conf 99999987088-863393899-30996999 Q gi|254780222|r 555 LLTFFYQEMYD-LIEQKHLFL-ISPPLFRI 582 (686) Q Consensus 555 ll~ff~~~~p~-Li~~g~v~~-~~tPl~kv 582 (686) .|.|||||-|- |++.-.-.| .+||+|.- T Consensus 543 ~lSF~~RYiPv~l~e~lpqriN~RPp~y~g 572 (614) T KOG2333 543 FLSFFHRYIPVGLLEVLPQRINDRPPLYTG 572 (614) T ss_pred HHHHHHHHCHHHHHHCCCHHHCCCCCCCCC T ss_conf 998877523088764072533368865426 No 128 >COG0550 TopA Topoisomerase IA [DNA replication, recombination, and repair] Probab=28.79 E-value=16 Score=16.07 Aligned_cols=13 Identities=23% Similarity=0.181 Sum_probs=9.2 Q ss_pred ECCHHHHHHHHHH Q ss_conf 3789999999998 Q gi|254780222|r 591 AHDEEHKQKILEE 603 (686) Q Consensus 591 ~~~e~e~~~~~~~ 603 (686) -|||..+-+++++ T Consensus 452 rytEasLvk~mE~ 464 (570) T COG0550 452 RYTEASLVKAMEK 464 (570) T ss_pred CCCHHHHHHHHHH T ss_conf 9898999999984 No 129 >TIGR00396 leuS_bact leucyl-tRNA synthetase; InterPro: IPR002302 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology . The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric . Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices , and are mostly dimeric or multimeric, containing at least three conserved regions , , . However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases; these synthetases are further divided into three subclasses, a, b and c, according to sequence homology. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases . Leucyl tRNA synthetase (6.1.1.4 from EC) is an alpha monomer that belongs to class Ia. There are two different families of leucyl-tRNA synthetases. This family includes the eubacterial and mitochondrial synthetases. The crystal structure of leucyl-tRNA synthetase from the hyperthermophile Thermus thermophilus has an overall architecture that is similar to that of isoleucyl-tRNA synthetase, except that the putative editing domain is inserted at a different position in the primary structure. This feature is unique to prokaryote-like leucyl-tRNA synthetases, as is the presence of a novel additional flexibly inserted domain . ; GO: 0000166 nucleotide binding, 0004823 leucine-tRNA ligase activity, 0005524 ATP binding, 0006412 translation, 0006429 leucyl-tRNA aminoacylation, 0005737 cytoplasm. Probab=28.49 E-value=19 Score=15.55 Aligned_cols=12 Identities=25% Similarity=0.598 Sum_probs=6.9 Q ss_pred CEEEEEECCEEC Q ss_conf 201221123131 Q gi|254780222|r 486 NQAVLPLRGKIL 497 (686) Q Consensus 486 ~qgi~PLRGKiL 497 (686) .+=|+-++||++ T Consensus 854 ~~iv~~~nGK~m 865 (916) T TIGR00396 854 VEIVVQVNGKVM 865 (916) T ss_pred EEEEEEEEEEEE T ss_conf 589999742689 No 130 >pfam11626 Rap1_C Rap1 - C terminal domain. This family of proteins represents the C-terminal domain of the protein Rap-1, which plays a distinct role in silencing at the silent mating-type loci and telomeres. The Rap-1 C terminus adopts an all-helical fold. Rap1 carries out its function by recruiting the Sir3 and Sir4 proteins to chromatin via its C terminal domain. Probab=27.77 E-value=26 Score=14.53 Aligned_cols=17 Identities=41% Similarity=0.624 Sum_probs=8.6 Q ss_pred ECCCCCCCCCCCCCCCCEEEE Q ss_conf 669843444176657870123 Q gi|254780222|r 111 DNGRGIPVENHPKFPNKSTLE 131 (686) Q Consensus 111 ddGrGIPv~~h~k~~~~~~~E 131 (686) .+|+|||.++ +|+|..| T Consensus 21 ~~g~GIP~n~----pGIWT~e 37 (68) T pfam11626 21 RDGKGIPDNV----PGIWTKE 37 (68) T ss_pred HCCCCCCCCC----CCCCCCC T ss_conf 8079999999----9867523 No 131 >KOG4518 consensus Probab=27.21 E-value=38 Score=13.43 Aligned_cols=39 Identities=15% Similarity=0.090 Sum_probs=29.9 Q ss_pred CCEEECCCCCCCHHHHHHHHHHHHHHHHHCCCCCEEEEE Q ss_conf 981546777550220244366667877761899769999 Q gi|254780222|r 62 PGMYIGGTDEKALHHLFSEVIDNAMDEVIAGYANLIEVS 100 (686) Q Consensus 62 P~mYIGst~~~GL~~i~~EIldNavDe~~~g~~~~I~V~ 100 (686) -+|..|++..+|--+.|.+=||-|||=+.+=-|-+|-+. T Consensus 68 d~~~~G~~svpg~~k~FR~~Ld~ai~yAkalgC~rIHlm 106 (264) T KOG4518 68 DDGFRGLASVPGAKKEFRKSLDTAIEYAKALGCCRIHLM 106 (264) T ss_pred HHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCEEEEE T ss_conf 442347434874489999889999999997099637875 No 132 >KOG3938 consensus Probab=27.15 E-value=38 Score=13.42 Aligned_cols=29 Identities=34% Similarity=0.491 Sum_probs=17.1 Q ss_pred HCCCCCEEEEEECCCC-EEEEEECCCCCCC Q ss_conf 6189976999982898-3999966984344 Q gi|254780222|r 90 IAGYANLIEVSLDGNG-FLTVVDNGRGIPV 118 (686) Q Consensus 90 ~~g~~~~I~V~i~~dg-~IsV~ddGrGIPv 118 (686) ++|..+.++|.-++|. -+||+|||-|--. T Consensus 123 vkGq~kEv~v~KsedalGlTITDNG~GyAF 152 (334) T KOG3938 123 VKGQAKEVEVVKSEDALGLTITDNGAGYAF 152 (334) T ss_pred HCCCCEEEEEEECCCCCCEEEEECCCCEEE T ss_conf 247630688985054430378507865044 No 133 >cd00223 TOPRIM_TopoIIB_SPO TOPRIM_TopoIIB_SPO: topoisomerase-primase (TOPRIM) nucleotidyl transferase/hydrolase domain of the type found in the type IIB family of DNA topoisomerases and Spo11. This subgroup contains proteins similar to Sulfolobus shibatae topoisomerase VI (TopoVI) and Saccharomyces cerevisiae meiotic recombination factor: Spo11. Type II DNA topoisomerases catalyze the ATP-dependent transport of one DNA duplex through another, in the process generating transient double strand breaks via covalent attachments to both DNA strands at the 5' positions. TopoVI enzymes are heterotetramers found in archaea and plants. Spo11 plays a role in generating the double strand breaks that initiate homologous recombination during meiosis. S. shibatae TopoVI relaxes both positive and negative supercoils, and in addition has a strong decatenase activity. The TOPRIM domain has two conserved motifs, one of which centers at a conserved glutamate and the other one at two conserved aspart Probab=27.04 E-value=28 Score=14.32 Aligned_cols=10 Identities=20% Similarity=0.375 Sum_probs=5.2 Q ss_pred CHHHHHCCCC Q ss_conf 1222100246 Q gi|254780222|r 269 LKTYLQTKLE 278 (686) Q Consensus 269 i~dyv~~~~~ 278 (686) ...|+..+.. T Consensus 39 TR~fl~~L~~ 48 (160) T cd00223 39 TRRFLRRLHE 48 (160) T ss_pred HHHHHHHHHH T ss_conf 9999999999 No 134 >COG4942 Membrane-bound metallopeptidase [Cell division and chromosome partitioning] Probab=26.86 E-value=36 Score=13.58 Aligned_cols=83 Identities=18% Similarity=0.293 Sum_probs=43.8 Q ss_pred CEEEEEECCEECCHHHHHHHHHHCCHHHHHHHHHHCCCCCCCC----C-----CCCCC-CCCEEEEECCCCCCHHHHHHH Q ss_conf 2012211231314454108776322889999988256654333----3-----34655-251899977987714688999 Q gi|254780222|r 486 NQAVLPLRGKILNVASAGSEKIRNNQQIMDLVQALGCRTRSQY----R-----EEDLR-YEKVIIMTDADVDGAHIASLL 555 (686) Q Consensus 486 ~qgi~PLRGKiLNv~~as~~ki~~N~Ei~~l~~~lGl~~g~~~----~-----~~~LR-YgkIiImTDaD~DGsHI~~Ll 555 (686) -+-.+|+.|.||.-.+....--...+ =-.++...|.+- | -+.|| ||.++|+--- +|+|+ T Consensus 297 G~l~~PV~G~il~rFG~~~~gg~~wk-----G~vi~a~~Ga~V~A~AdG~VvyA~~l~GYG~vvIldhG--~gy~s---- 365 (420) T COG4942 297 GQLAWPVTGRILRRFGQADGGGLRWK-----GMVIGASAGATVKAIADGRVVYADWLRGYGLVVILDHG--GGYHS---- 365 (420) T ss_pred CCCCCCCCCHHHHHHCCCCCCCCCCC-----CEEEECCCCCEEEECCCCEEEECHHHCCCCEEEEEECC--CCCEE---- T ss_conf 77688878707877345678876106-----75873599982562069569954332567569999748--85378---- Q ss_pred HHHHHHHHHHHH-HCCCEEEECCCEEEE Q ss_conf 999998708886-339389930996999 Q gi|254780222|r 556 LTFFYQEMYDLI-EQKHLFLISPPLFRI 582 (686) Q Consensus 556 l~ff~~~~p~Li-~~g~v~~~~tPl~kv 582 (686) +|-++-++. ..|....+=-||=.+ T Consensus 366 ---lyg~~~~i~v~~G~~V~AGepIa~~ 390 (420) T COG4942 366 ---LYGGNQSILVNPGQFVKAGEPIALV 390 (420) T ss_pred ---EECCCCEEEECCCCEEECCCCHHHC T ss_conf ---8616642210689775569715322 No 135 >TIGR01497 kdpB K+-transporting ATPase, B subunit; InterPro: IPR006391 These sequences describe the P-type ATPase subunit of the complex responsible for translocating potassium ions across biological membranes in microbes. In Escherichia coli and other species, this complex consists of the proteins KdpA, KdpB, KdpC and KdpF. KdpB is the ATPase subunit, while KdpA is the potassium-ion translocating subunit . The function of KdpC is unclear, although it has been suggested to couple the ATPase subunit to the ion-translocating subunit , while KdpF serves to stabilize the complex . The potassium P-type ATPases have been characterised as Type IA based on a phylogenetic analysis which places this clade closest to the heavy-metal translocating ATPases (Type IB) . Others place this clade closer to the Na+/K+ antiporter type (Type IIC) based on physical characteristics . ; GO: 0008556 potassium-transporting ATPase activity, 0006813 potassium ion transport, 0016021 integral to membrane. Probab=26.78 E-value=39 Score=13.38 Aligned_cols=62 Identities=29% Similarity=0.490 Sum_probs=40.0 Q ss_pred CCCEEEEEECCCCCCCCCC----------CCCCCCEEEEEEEEEEE-CCCCCCCCCEEEECCCCCHHHHHHHHEEEEEEE Q ss_conf 8983999966984344417----------66578701234545200-247777774156314453889876203212699 Q gi|254780222|r 103 GNGFLTVVDNGRGIPVENH----------PKFPNKSTLEIILTTLH-AGGKFDNSTYEISGGLHGVGISVVNALSDELHV 171 (686) Q Consensus 103 ~dg~IsV~ddGrGIPv~~h----------~k~~~~~~~Elif~~L~-aggnfdd~~yk~sgGlnGvGaklvNalS~~f~V 171 (686) +-|=|--.-+|.=||.|-. .=-+|-|+| +|. +||-| =.|||| |-++|.|+.| T Consensus 125 kkGD~VlVeaGDvIP~DGEVi~GvASVDESAITGESAP-----V~kEsGgDf----ASVTGG--------Tri~SDwL~v 187 (675) T TIGR01497 125 KKGDVVLVEAGDVIPADGEVIEGVASVDESAITGESAP-----VLKESGGDF----ASVTGG--------TRIVSDWLVV 187 (675) T ss_pred CCCCEEEEECCCEECCCCCEECCCCCCCCCCCCCCCCC-----CCCCCCCCE----EEECCC--------CEEEEEEEEE T ss_conf 57888999638372599747645112112210257887-----210788965----464388--------4588556899 Q ss_pred EEE-CCCEEEE Q ss_conf 994-1883899 Q gi|254780222|r 172 TVA-RQNTIFS 181 (686) Q Consensus 172 ev~-r~gk~y~ 181 (686) |+. +.|+-|- T Consensus 188 e~ta~PGe~Fl 198 (675) T TIGR01497 188 EVTADPGESFL 198 (675) T ss_pred EEEECCCCCHH T ss_conf 99826997477 No 136 >pfam10340 DUF2424 Protein of unknown function (DUF2424). This is a family of proteins conserved in yeasts. The function is not known. Probab=26.62 E-value=31 Score=14.00 Aligned_cols=25 Identities=24% Similarity=0.141 Sum_probs=15.3 Q ss_pred CEEEEEEECCCCCCCEEECCHHHHHHHHHH Q ss_conf 427888300065575241304678999999 Q gi|254780222|r 314 EITSYCNTILTDEGGTHESGLRVALTRGIK 343 (686) Q Consensus 314 ~~~SFVN~I~T~~GGTHv~g~~~ai~k~i~ 343 (686) -+.-|+ .-||--++-+..++.-..+ T Consensus 123 pVLiY~-----HGGGy~L~~~~~~lv~l~~ 147 (374) T pfam10340 123 PILLYY-----HGGGFALKLIPVTLVFLNN 147 (374) T ss_pred CEEEEE-----ECCEEEECCCHHHHHHHHH T ss_conf 689998-----2775576132256569999 No 137 >PTZ00233 variable surface protein Vir18; Provisional Probab=25.81 E-value=25 Score=14.67 Aligned_cols=30 Identities=37% Similarity=0.632 Sum_probs=16.0 Q ss_pred CCCEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHCCCEEEECCCEEEE Q ss_conf 251899977987714688999999998708886339389930996999 Q gi|254780222|r 535 YEKVIIMTDADVDGAHIASLLLTFFYQEMYDLIEQKHLFLISPPLFRI 582 (686) Q Consensus 535 YgkIiImTDaD~DGsHI~~Lll~ff~~~~p~Li~~g~v~~~~tPl~kv 582 (686) +.+=||+|=|=+ .| .|||.+++.| |||.|| T Consensus 431 nn~~IIktSaPm---GI-VLLLgLLFKy--------------TPLWrv 460 (509) T PTZ00233 431 SNKNIIKTSMPI---GI-ALLLGLLFKY--------------TPLWRV 460 (509) T ss_pred CCCCEEECCCHH---HH-HHHHHHHHCC--------------CHHHHH T ss_conf 662000015018---99-9999988622--------------623665 No 138 >cd01213 tensin Tensin Phosphotyrosine-binding (PTB) domain. Tensin is a a focal adhesion protein, which contains a C-terminal SH2 domain followed by a PTB domain. PTB domains have a PH-like fold and are found in various eukaryotic signaling molecules. They were initially identified based upon their ability to recognize phosphorylated tyrosine residues. In contrast to SH2 domains, which recognize phosphotyrosine and adjacent carboxy-terminal residues, PTB-domain binding specificity is conferred by residues amino-terminal to the phosphotyrosine. More recent studies have found that some types of PTB domains can bind to peptides which are not tyrosine phosphorylated or lack tyrosine residues altogether. Probab=24.98 E-value=42 Score=13.16 Aligned_cols=51 Identities=25% Similarity=0.399 Sum_probs=21.2 Q ss_pred EEECCCCCCCHHHHHHHHHHHHHHHHHCCC--CCEEEEEEC-CCCEEEEEECCCCC Q ss_conf 154677755022024436666787776189--976999982-89839999669843 Q gi|254780222|r 64 MYIGGTDEKALHHLFSEVIDNAMDEVIAGY--ANLIEVSLD-GNGFLTVVDNGRGI 116 (686) Q Consensus 64 mYIGst~~~GL~~i~~EIldNavDe~~~g~--~~~I~V~i~-~dg~IsV~ddGrGI 116 (686) ||+||++.+-|-- .+.|--|+.+-+... .....|.+. .+.-||..||-|-. T Consensus 11 lYL~Svd~EsLTG--~~AV~kAv~~~~~~~~~P~~t~VhfKVS~QGITLTDn~Rk~ 64 (138) T cd01213 11 LYLGSVDTESLTG--NEAIKKAIAQCSGQAPDPQATEVHFKVSSQGITLTDNTRKK 64 (138) T ss_pred EEEEEEEEHHCCC--HHHHHHHHHHHHCCCCCCCCEEEEEEECCCCEEEEECCHHH T ss_conf 9980076100548--78999999999708999876689999757755788433013 No 139 >COG0137 ArgG Argininosuccinate synthase [Amino acid transport and metabolism] Probab=24.97 E-value=11 Score=17.23 Aligned_cols=46 Identities=17% Similarity=0.279 Sum_probs=27.6 Q ss_pred EEEEEEECC--CCCC--EEEEEEECCCCCCCEEECCHHHHHHHHHHHHHHHC Q ss_conf 999854146--7774--27888300065575241304678999999876301 Q gi|254780222|r 302 VEWAIAWCE--ENPE--ITSYCNTILTDEGGTHESGLRVALTRGIKKYAELT 349 (686) Q Consensus 302 veval~~~~--~~~~--~~SFVN~I~T~~GGTHv~g~~~ai~k~i~~~~~~~ 349 (686) ++|...+-+ ...+ -+.|.+|+|+.--|+-.+.. .+...+|+.+-.+ T Consensus 204 ~~~tv~p~~ap~~~e~v~I~Fe~G~PValnG~~~~~~--~li~~lN~iag~h 253 (403) T COG0137 204 YEWTVSPEDAPDEPEEVEIGFEKGVPVALNGEKLSPV--ELILELNEIAGRH 253 (403) T ss_pred HHHCCCHHHCCCCCEEEEEEEECCEEEEECCEECCHH--HHHHHHHHHHHHC T ss_conf 7613785358999728999975773788759748899--9999999987445 No 140 >TIGR01992 PTS-IIBC-Tre PTS system, trehalose-specific IIBC component; InterPro: IPR011296 This entry represents the fused enzyme II B and C components of the trehalose-specific PTS sugar transporter system . Trehalose is converted to trehalose-6-phosphate in the process of translocation into the cell. These transporters lack their own IIA domains and instead use the glucose IIA protein (IIAglc or Crr) . The exceptions to this rule are Staphylococci and Streptococci which contain their own A domain as a C-terminal fusion. This family is closely related to the sucrose transporting PTS IIBC enzymes described by the IIB component and the IIC component (IPR004719 from INTERPRO), respectively. In Escherichia coli, Bacillus subtilis and Pseudomonas fluorescens the presence of this gene is associated with the presence of trehalase which degrades T6P to glucose and glucose-6-P. Trehalose may also be transported (in Salmonella) via the mannose PTS or galactose permease systems , or (in Sinorhizobium, Thermococcus and Sulfolobus, for instance) by ABC transporters , , .; GO: 0015574 trehalose transmembrane transporter activity, 0015771 trehalose transport, 0009276 1-2nm peptidoglycan-based cell wall, 0016021 integral to membrane. Probab=24.57 E-value=30 Score=14.14 Aligned_cols=14 Identities=7% Similarity=-0.062 Sum_probs=5.4 Q ss_pred HHCCCCCEEEEECC Q ss_conf 53057735999516 Q gi|254780222|r 235 AYLSGRVKTCWSCD 248 (686) Q Consensus 235 A~L~~gl~i~~~~~ 248 (686) +.|.-.+++.++|+ T Consensus 21 tHCvTRLRfvL~D~ 34 (489) T TIGR01992 21 THCVTRLRFVLKDP 34 (489) T ss_pred HHHHHHHHHHHCCC T ss_conf 56888888861681 No 141 >PRK07198 hypothetical protein; Validated Probab=23.83 E-value=41 Score=13.20 Aligned_cols=76 Identities=17% Similarity=0.114 Sum_probs=35.5 Q ss_pred EEEEEECCCCCCCEEECCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCC------CEEEEEEEEEEHHHCCCCCCCCCCC Q ss_conf 7888300065575241304678999999876301883322233222236------4269999776345607644662025 Q gi|254780222|r 316 TSYCNTILTDEGGTHESGLRVALTRGIKKYAELTQNKRAISIISDDLMI------SAVGILSVFIREPEFAGQTKTKLVS 389 (686) Q Consensus 316 ~SFVN~I~T~~GGTHv~g~~~ai~k~i~~~~~~~~~kk~~~i~~~dIk~------~L~~~Isv~I~nP~FesQTKekL~s 389 (686) +=|.-||.-.- |.-+.-++.+|.. +.....-. -++..|++- ||++.| .-+|+.-|--..+|+- T Consensus 174 VWYLPGvA~Rf-g~~E~~LRr~LFe------~TgGmyPE-LvTRpDlkvFLPPIGGlTvYi---fGdp~~~sdp~~~La~ 242 (418) T PRK07198 174 VWYLPGVAERF-GVSETDLRRTLFE------QTGGMFPE-LVTRPDLKVFLPPIGGQTVYI---FGDVTDLADPETELTC 242 (418) T ss_pred CCCCHHHHHHC-CCCHHHHHHHHHH------HCCCCCCE-EECCCCCEEECCCCCCEEEEE---ECCHHHCCCCCCEEEE T ss_conf 22462259780-9978999999998------72798841-031577517548888828999---6697767898752578 Q ss_pred HHHCCCCCCCCHHHHH Q ss_conf 2320000121015889 Q gi|254780222|r 390 LDAQRTVENALRDPFD 405 (686) Q Consensus 390 ~~~~~~v~~~i~~~l~ 405 (686) + -.-+|.=+|-|. T Consensus 243 R---vHDECNGSDVFg 255 (418) T PRK07198 243 R---VHDECNGSDVFG 255 (418) T ss_pred E---ECCCCCCCCCCC T ss_conf 8---746356763456 No 142 >TIGR02442 Cob-chelat-sub cobaltochelatase subunit; InterPro: IPR012804 Cobalamin (vitamin B12) can be complexed with metal via ATP-dependent reactions (aerobic pathway) (e.g., in Pseudomonas denitrificans) or via ATP-independent reactions (anaerobic pathway) (e.g., in Salmonella typhimurium) , . The corresponding cobalt chelatases are not homologous. Cobaltochelatase is responsible for the insertion of cobalt into the corrin ring of coenzyme B12 during its biosynthesis. Two versions have been well described. CbiK/CbiX is a monomeric, anaerobic version which acts early in the biosynthesis (IPR010388 from INTERPRO). CobNST is a trimeric, ATP-dependent, aerobic version which acts late in the biosynthesis, (IPR011953 from INTERPRO, IPR006537 from INTERPRO, IPR006538 from INTERPRO) . The two pathways differ in the point of cobalt insertion during corrin ring formation . There are apparently a number of variations on these two pathways, where the major differences seem to be concerned with the process of ring contraction . Cobaltochelatase shows similarities with magnesium chelatase, which is also a complex ATP-dependent enzyme made up of two separable components. However, unlike the situation in cobaltochelatase, one of these two components is membrane bound in magnesium chelatase . . Probab=23.66 E-value=23 Score=14.89 Aligned_cols=27 Identities=19% Similarity=0.122 Sum_probs=13.1 Q ss_pred CCHHHH-HHHHHHHHHCCCCCEEEEECC Q ss_conf 228999-998888753057735999516 Q gi|254780222|r 222 FDAARL-LKMTQSKAYLSGRVKTCWSCD 248 (686) Q Consensus 222 ~d~~~l-~~rl~~~A~L~~gl~i~~~~~ 248 (686) +-.|++ .+-.+.+|.|++--.|+-.|- T Consensus 275 hRAdi~~~raArAlAAl~GR~~V~~eDv 302 (688) T TIGR02442 275 HRADIVMARAARALAALDGRRRVTAEDV 302 (688) T ss_pred CHHHHHHHHHHHHHHHHCCCCEECHHHH T ss_conf 2599999999999987728853537789 No 143 >cd03361 TOPRIM_TopoIA_RevGyr TopoIA_RevGyr : The topoisomerase-primase (TORPIM) domain found in members of the type IA family of DNA topoisomerases (Topo IA) similar to the ATP-dependent reverse gyrase found in archaea and thermophilic bacteria. Type IA DNA topoisomerases remove (relax) negative supercoils in the DNA by: cleaving one strand of the DNA duplex, covalently linking to the 5' phosphoryl end of the DNA break and, allowing the other strand of the duplex to pass through the gap. Reverse gyrase is also able to insert positive supercoils in the presence of ATP and negative supercoils in the presence of AMPPNP. The TOPRIM domain has two conserved motifs, one of which centers at a conserved glutamate and the other one at two conserved aspartates (DxD). For topoisomerases the conserved glutamate is believed to act as a general base in strand joining and, as a general acid in strand cleavage. The DXD motif may co-ordinate Mg2+, a cofactor required for full catalytic function. Probab=21.95 E-value=48 Score=12.77 Aligned_cols=26 Identities=31% Similarity=0.410 Sum_probs=21.1 Q ss_pred CCCEEEEECCCCCCHHHHHHHHHHHH Q ss_conf 25189997798771468899999999 Q gi|254780222|r 535 YEKVIIMTDADVDGAHIASLLLTFFY 560 (686) Q Consensus 535 YgkIiImTDaD~DGsHI~~Lll~ff~ 560 (686) -..|+|.||.|--|-.|.=-++.++- T Consensus 119 ad~v~lATD~DREGEaIa~hi~~~l~ 144 (170) T cd03361 119 VDEVLIATDPDTEGEKIAWDVYLALR 144 (170) T ss_pred CCEEEECCCCCCCHHHHHHHHHHHHC T ss_conf 89799857998222299999999955 No 144 >TIGR00022 TIGR00022 conserved hypothetical protein; InterPro: IPR004375 This family consists of conserved hypothetical proteins, about 150 amino acids in length, with no known function. The family is restricted to the bacteria. It includes three members in Escherichia coli K12 and three in Streptococcus pneumoniae.. Probab=21.95 E-value=37 Score=13.52 Aligned_cols=11 Identities=27% Similarity=0.431 Sum_probs=4.7 Q ss_pred CHHHHHHHHHH Q ss_conf 02202443666 Q gi|254780222|r 73 ALHHLFSEVID 83 (686) Q Consensus 73 GL~~i~~EIld 83 (686) ||+.-|.++|| T Consensus 14 GLp~~~~~a~d 24 (156) T TIGR00022 14 GLPPALAKAID 24 (156) T ss_pred CCHHHHHHHHH T ss_conf 75389999999 No 145 >PRK08294 phenol 2-monooxygenase; Provisional Probab=21.84 E-value=23 Score=14.95 Aligned_cols=17 Identities=24% Similarity=0.419 Sum_probs=11.1 Q ss_pred CHHHCCCCCCCCHHHHH Q ss_conf 52320000121015889 Q gi|254780222|r 389 SLDAQRTVENALRDPFD 405 (686) Q Consensus 389 s~~~~~~v~~~i~~~l~ 405 (686) +|..-.++..-+.|.+. T Consensus 353 sP~gGqGMNtgiqDA~N 369 (634) T PRK08294 353 SAKAGQGMNVSMQDGFN 369 (634) T ss_pred CCCCCCCCCCCCCCHHH T ss_conf 86433441342200555 No 146 >PRK11597 heat shock chaperone IbpB; Provisional Probab=20.74 E-value=50 Score=12.61 Aligned_cols=67 Identities=13% Similarity=0.119 Sum_probs=30.1 Q ss_pred EEEEEEEEE-ECCCCCCEEEEEEECCCCCCCEE-----ECCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCEEE Q ss_conf 558999854-14677742788830006557524-----13046789999998763018833222332222364269 Q gi|254780222|r 299 RGTVEWAIA-WCEENPEITSYCNTILTDEGGTH-----ESGLRVALTRGIKKYAELTQNKRAISIISDDLMISAVG 368 (686) Q Consensus 299 ~~~veval~-~~~~~~~~~SFVN~I~T~~GGTH-----v~g~~~ai~k~i~~~~~~~~~kk~~~i~~~dIk~~L~~ 368 (686) .+.+++|++ |+.++- -++.-+++-|..|-.. .+|+..+|.. +.|.+.-.....+.+..+.+.+||-. T Consensus 44 ~y~IeiAvaGf~k~di-~I~~~~~~LtI~g~~~~~~~~~~ylhrGIa~--r~F~r~F~L~e~veV~~A~l~nGlL~ 116 (142) T PRK11597 44 HYRITLALAGFRQEDL-DIQLEGTRLTVKGTPEQPEKEKKWLHQGLVN--QPFSLSFTLAENMEVSGATFVNGLLH 116 (142) T ss_pred EEEEEEEECCCCHHHE-EEEEECCEEEEECCCCCCCCCCEEEEECCCC--CCEEEEEECCCCEEECCEEEECCEEE T ss_conf 2999999688775778-9999789999971578876411267741203--72699999888739965497178899 No 147 >KOG1185 consensus Probab=20.54 E-value=51 Score=12.58 Aligned_cols=14 Identities=50% Similarity=0.555 Sum_probs=9.0 Q ss_pred EEEEEECCCCCCCC Q ss_conf 37997327676421 Q gi|254780222|r 464 TELFIVEGDSAGGS 477 (686) Q Consensus 464 ~eL~lvEGDSA~gs 477 (686) ...+++|||||=|. T Consensus 449 ~~V~~veGDsaFGf 462 (571) T KOG1185 449 RKVVCVEGDSAFGF 462 (571) T ss_pred CEEEEEECCCCCCC T ss_conf 73899964765571 No 148 >KOG2672 consensus Probab=20.53 E-value=51 Score=12.58 Aligned_cols=30 Identities=20% Similarity=0.174 Sum_probs=22.8 Q ss_pred HHHHHHHHHHH-CCCEEEECCCEEEEEECCE Q ss_conf 99987088863-3938993099699974897 Q gi|254780222|r 558 FFYQEMYDLIE-QKHLFLISPPLFRITQGTK 587 (686) Q Consensus 558 ff~~~~p~Li~-~g~v~~~~tPl~kv~~g~~ 587 (686) =++.||...=+ .||+|.+..||++-...-. T Consensus 310 ekf~~w~~~~~~lgf~y~AsgplvrSsykag 340 (360) T KOG2672 310 EKFDYWKEYGEELGFLYVASGPLVRSSYKAG 340 (360) T ss_pred HHHHHHHHHHHHCCEEEECCCCEEECHHHHH T ss_conf 8878999975514548963685442211103 No 149 >COG5257 GCD11 Translation initiation factor 2, gamma subunit (eIF-2gamma; GTPase) [Translation, ribosomal structure and biogenesis] Probab=20.31 E-value=45 Score=12.94 Aligned_cols=32 Identities=28% Similarity=0.326 Sum_probs=17.3 Q ss_pred CCCCCCCCCEEEEEEEEEEHHHCCCCCCCCCCCH Q ss_conf 3322223642699997763456076446620252 Q gi|254780222|r 357 IISDDLMISAVGILSVFIREPEFAGQTKTKLVSL 390 (686) Q Consensus 357 i~~~dIk~~L~~~Isv~I~nP~FesQTKekL~s~ 390 (686) ++++-+.++..++|.+. +|-=+-||+|-|.-- T Consensus 104 LsGAAlMDgAlLvIaAN--EpcPQPQT~EHl~Al 135 (415) T COG5257 104 LSGAALMDGALLVIAAN--EPCPQPQTREHLMAL 135 (415) T ss_pred HCCHHHHCCEEEEEECC--CCCCCCCHHHHHHHH T ss_conf 60234421538999538--989897318778877 No 150 >KOG0956 consensus Probab=20.08 E-value=47 Score=12.83 Aligned_cols=42 Identities=24% Similarity=0.194 Sum_probs=29.4 Q ss_pred CCEEEEEEC--CCCEEEEEECCCCCCCCCCCCCCCCEEEEEEEEEEECC Q ss_conf 976999982--89839999669843444176657870123454520024 Q gi|254780222|r 94 ANLIEVSLD--GNGFLTVVDNGRGIPVENHPKFPNKSTLEIILTTLHAG 140 (686) Q Consensus 94 ~~~I~V~i~--~dg~IsV~ddGrGIPv~~h~k~~~~~~~Elif~~L~ag 140 (686) +.++...+. +||-.--.|||-.--| ..-+|+||+-|++.||= T Consensus 60 aarvrCeLCP~kdGALKkTDn~GWAHV-----VCALYIPEVrFgNV~TM 103 (900) T KOG0956 60 AARVRCELCPHKDGALKKTDNGGWAHV-----VCALYIPEVRFGNVHTM 103 (900) T ss_pred HCCCEEECCCCCCCCEECCCCCCCEEE-----EEEEECCCEEECCCCCC T ss_conf 102013216675663020468872478-----88763141233243233 Done!