Query gi|254780229|ref|YP_003064642.1| hypothetical protein CLIBASIA_00570 [Candidatus Liberibacter asiaticus str. psy62] Match_columns 1775 No_of_seqs 820 out of 1265 Neff 10.5 Searched_HMMs 39220 Date Tue May 24 01:04:06 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780229.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 COG1112 Superfamily I DNA and 100.0 1.7E-34 4.4E-39 374.6 19.1 287 1323-1648 451-754 (767) 2 KOG1803 consensus 100.0 1.2E-26 3.1E-31 292.8 17.7 247 1360-1651 357-629 (649) 3 KOG1805 consensus 99.9 4.5E-23 1.2E-27 255.5 15.5 242 1341-1621 777-1055(1100) 4 KOG1802 consensus 99.9 1.9E-21 4.9E-26 238.5 15.4 245 1360-1647 567-833 (935) 5 KOG1801 consensus 99.7 1.5E-16 3.9E-21 187.5 12.0 235 1359-1623 534-790 (827) 6 KOG1807 consensus 99.6 6.6E-13 1.7E-17 149.5 20.5 241 1363-1640 722-973 (1025) 7 TIGR00376 TIGR00376 DNA helica 99.6 1.2E-13 3E-18 157.4 13.7 251 1358-1648 411-698 (709) 8 KOG1804 consensus 99.4 1.8E-13 4.6E-18 155.4 6.1 230 1360-1616 441-694 (775) 9 TIGR02169 SMC_prok_A chromosom 99.4 1E-06 2.7E-11 84.9 76.1 113 409-555 148-262 (1202) 10 KOG1803 consensus 99.1 1.3E-09 3.4E-14 115.1 12.0 73 399-471 180-253 (649) 11 COG1196 Smc Chromosome segrega 99.1 1.1E-05 2.9E-10 74.1 78.9 23 423-446 28-50 (1163) 12 TIGR02168 SMC_prok_B chromosom 99.1 1.6E-05 4E-10 72.7 88.3 117 420-537 23-215 (1191) 13 PRK10246 exonuclease subunit S 99.1 2.1E-05 5.3E-10 71.3 78.2 28 419-448 30-57 (1047) 14 PRK03918 chromosome segregatio 98.9 7.5E-05 1.9E-09 65.5 67.5 26 420-446 24-49 (882) 15 PRK02224 chromosome segregatio 98.8 0.00011 2.7E-09 64.0 69.8 24 422-446 26-49 (880) 16 cd01038 Endonuclease_DUF559 Do 98.7 1.3E-07 3.3E-12 94.3 9.3 92 1662-1762 13-107 (108) 17 PRK11773 uvrD DNA-dependent he 98.7 9.7E-05 2.5E-09 64.4 23.6 64 405-470 10-77 (722) 18 pfam04480 DUF559 Protein of un 98.5 7.1E-07 1.8E-11 86.6 9.2 92 1663-1763 15-109 (109) 19 COG0419 SbcC ATPase involved i 98.5 0.00059 1.5E-08 56.2 69.8 31 419-451 25-55 (908) 20 KOG1805 consensus 98.5 1E-06 2.6E-11 85.0 8.3 74 406-481 671-745 (1100) 21 cd00221 Vsr Very Short Patch R 98.5 1.2E-06 3.1E-11 84.3 8.6 82 1660-1747 15-115 (115) 22 TIGR02168 SMC_prok_B chromosom 98.5 0.00072 1.8E-08 55.3 81.7 53 479-531 170-223 (1191) 23 PRK01156 chromosome segregatio 98.5 0.00074 1.9E-08 55.1 76.3 27 420-448 24-50 (895) 24 KOG1806 consensus 98.4 1.1E-06 2.8E-11 84.7 8.1 232 1358-1621 988-1244(1320) 25 KOG1802 consensus 98.4 7.1E-07 1.8E-11 86.6 7.0 71 404-474 410-481 (935) 26 KOG0161 consensus 98.4 0.0011 2.8E-08 53.3 90.1 26 144-169 364-389 (1930) 27 PRK09767 hypothetical protein; 98.3 5.7E-06 1.5E-10 77.2 9.0 94 1663-1765 16-112 (116) 28 COG2852 Very-short-patch-repai 98.3 8E-06 2.1E-10 75.6 9.6 98 1662-1768 26-126 (129) 29 KOG1807 consensus 98.2 1.1E-05 2.9E-10 74.1 9.2 141 304-469 303-449 (1025) 30 TIGR01448 recD_rel helicase, R 98.2 1.8E-05 4.7E-10 71.9 9.5 231 1358-1628 452-754 (769) 31 TIGR00618 sbcc exonuclease Sbc 98.2 0.0023 5.9E-08 50.0 82.8 138 419-563 30-176 (1063) 32 PRK11054 helD DNA helicase IV; 98.2 0.00011 2.7E-09 63.9 13.2 207 1361-1620 430-663 (684) 33 TIGR00376 TIGR00376 DNA helica 98.2 5E-05 1.3E-09 67.3 11.3 86 401-486 199-288 (709) 34 PRK10875 recD exonuclease V su 98.2 4.8E-06 1.2E-10 77.9 5.9 54 1553-1620 528-581 (607) 35 pfam00580 UvrD-helicase UvrD/R 98.1 0.00012 3.2E-09 63.2 12.7 150 1361-1535 212-372 (494) 36 pfam00580 UvrD-helicase UvrD/R 97.9 5.5E-05 1.4E-09 66.9 6.7 63 406-470 2-68 (494) 37 PRK12377 putative replication 97.8 7.3E-05 1.9E-09 65.6 6.2 39 416-454 98-136 (248) 38 PRK10919 ATP-dependent DNA hel 97.7 0.00014 3.5E-09 62.8 6.8 63 406-470 4-70 (672) 39 PRK11054 helD DNA helicase IV; 97.6 0.00024 6.2E-09 60.2 7.2 64 404-469 196-263 (684) 40 pfam04851 ResIII Type III rest 97.6 0.00037 9.4E-09 58.3 7.7 61 407-467 6-66 (103) 41 pfam09848 DUF2075 Uncharacteri 97.6 0.0012 3E-08 53.1 10.2 262 1305-1619 39-347 (348) 42 smart00487 DEXDc DEAD-like hel 97.6 0.00071 1.8E-08 55.3 8.9 77 406-482 10-92 (201) 43 PRK10919 ATP-dependent DNA hel 97.6 0.0042 1.1E-07 47.3 12.8 60 1555-1620 551-613 (672) 44 PRK08181 transposase; Validate 97.6 0.00015 3.8E-09 62.5 5.2 46 408-454 91-141 (269) 45 PRK07952 DNA replication prote 97.6 0.00028 7.1E-09 59.6 6.6 48 407-454 76-131 (242) 46 PRK09401 reverse gyrase; Revie 97.6 0.0016 4E-08 51.8 10.4 87 407-494 81-174 (1176) 47 PRK06526 transposase; Provisio 97.6 0.00015 3.8E-09 62.4 5.2 47 408-454 87-133 (254) 48 PRK08116 hypothetical protein; 97.5 0.00032 8.1E-09 59.0 6.7 38 417-455 106-144 (262) 49 COG1110 Reverse gyrase [DNA re 97.5 0.0013 3.4E-08 52.5 9.8 87 407-495 85-179 (1187) 50 PRK09183 transposase/IS protei 97.5 0.00018 4.7E-09 61.5 5.3 45 409-454 91-136 (258) 51 pfam01695 IstB IstB-like ATP b 97.5 0.00014 3.5E-09 62.8 4.6 47 408-455 36-83 (178) 52 PRK13341 recombination factor 97.5 0.00061 1.5E-08 56.1 7.8 37 408-445 41-77 (726) 53 PRK13889 conjugal transfer rel 97.5 0.00093 2.4E-08 54.1 8.8 149 1361-1542 433-586 (992) 54 pfam00270 DEAD DEAD/DEAH box h 97.5 0.00069 1.8E-08 55.5 8.0 63 408-470 3-68 (167) 55 PRK10876 recB exonuclease V su 97.5 0.0053 1.3E-07 46.3 12.5 51 418-469 16-78 (1181) 56 KOG0994 consensus 97.5 0.019 4.8E-07 40.5 51.0 15 508-522 1168-1182(1758) 57 COG1484 DnaC DNA replication p 97.5 0.00021 5.4E-09 60.8 5.0 38 418-456 104-142 (254) 58 PRK13342 recombination factor 97.5 0.00063 1.6E-08 55.9 7.4 34 410-444 28-61 (417) 59 COG0210 UvrD Superfamily I DNA 97.5 0.00069 1.8E-08 55.4 7.5 66 405-472 3-72 (655) 60 CHL00181 cbbX CbbX; Provisiona 97.5 0.0011 2.9E-08 53.3 8.5 28 418-446 58-85 (287) 61 PRK05580 primosome assembly pr 97.4 0.0021 5.4E-08 50.4 9.7 82 405-487 169-253 (699) 62 COG3973 Superfamily I DNA and 97.4 0.0094 2.4E-07 43.7 12.8 214 1360-1619 527-745 (747) 63 KOG0996 consensus 97.4 0.024 6.1E-07 39.4 77.3 15 225-239 160-174 (1293) 64 PRK13909 putative recombinatio 97.3 0.015 3.9E-07 41.4 13.3 49 422-470 1-52 (911) 65 COG1198 PriA Primosomal protei 97.3 0.0014 3.7E-08 52.2 8.0 80 405-485 199-282 (730) 66 PRK10875 recD exonuclease V su 97.3 0.00078 2E-08 54.9 6.1 44 1359-1402 258-304 (607) 67 KOG0994 consensus 97.3 0.031 7.8E-07 38.3 56.6 20 474-493 1157-1176(1758) 68 TIGR01447 recD exodeoxyribonuc 97.3 0.00046 1.2E-08 57.3 4.7 101 1358-1458 346-475 (753) 69 PRK11773 uvrD DNA-dependent he 97.2 0.00055 1.4E-08 56.5 5.0 62 1553-1620 551-616 (722) 70 PRK10869 recombination and rep 97.2 0.034 8.5E-07 37.9 28.5 26 419-445 21-47 (553) 71 TIGR02169 SMC_prok_A chromosom 97.2 0.034 8.6E-07 37.8 76.1 77 419-496 23-152 (1202) 72 cd00009 AAA The AAA+ (ATPases 97.2 0.00074 1.9E-08 55.2 5.4 40 414-454 14-54 (151) 73 TIGR01448 recD_rel helicase, R 97.2 0.0012 3.1E-08 53.0 6.2 43 402-445 348-390 (769) 74 pfam10881 DUF2726 Protein of u 97.2 0.0023 5.9E-08 50.0 7.6 88 1660-1749 3-115 (127) 75 PRK06835 DNA replication prote 97.1 0.00094 2.4E-08 54.1 5.0 37 417-454 181-218 (330) 76 COG2255 RuvB Holliday junction 97.1 0.0017 4.4E-08 51.4 6.3 39 408-447 36-79 (332) 77 cd03115 SRP The signal recogni 97.1 0.0026 6.6E-08 49.5 7.1 52 422-473 3-57 (173) 78 cd00046 DEXDc DEAD-like helica 97.1 0.0043 1.1E-07 47.2 8.0 63 420-482 1-66 (144) 79 COG2256 MGS1 ATPase related to 97.1 0.0019 4.9E-08 50.9 6.2 38 414-454 43-80 (436) 80 COG1074 RecB ATP-dependent exo 97.0 0.029 7.5E-07 38.5 12.2 56 415-470 12-72 (1139) 81 PRK00440 rfc replication facto 97.0 0.0015 3.8E-08 52.0 5.6 26 419-445 37-62 (318) 82 PRK09361 radB DNA repair and r 97.0 0.00092 2.3E-08 54.2 4.5 39 418-456 22-60 (224) 83 PRK09302 circadian clock prote 97.0 0.015 3.9E-07 41.4 10.6 83 416-499 263-354 (501) 84 pfam00448 SRP54 SRP54-type pro 97.0 0.0028 7.1E-08 49.2 6.8 54 422-475 4-60 (196) 85 KOG0979 consensus 97.0 0.049 1.3E-06 36.1 59.4 36 416-453 38-74 (1072) 86 smart00489 DEXDc3 DEAD-like he 97.0 0.0047 1.2E-07 46.7 7.7 63 404-469 8-83 (289) 87 smart00488 DEXDc2 DEAD-like he 97.0 0.0047 1.2E-07 46.7 7.7 63 404-469 8-83 (289) 88 PRK00254 ski2-like helicase; P 97.0 0.008 2E-07 44.4 8.8 61 406-467 25-88 (717) 89 TIGR02173 cyt_kin_arch cytidyl 97.0 0.00051 1.3E-08 56.9 2.6 23 422-445 3-25 (173) 90 COG0210 UvrD Superfamily I DNA 97.0 0.044 1.1E-06 36.6 12.6 151 1359-1535 211-372 (655) 91 PRK11331 5-methylcytosine-spec 97.0 0.0016 4E-08 51.8 5.1 49 402-453 179-230 (459) 92 cd01037 Restriction_endonuclea 97.0 0.0027 6.8E-08 49.4 6.2 74 1664-1744 1-80 (80) 93 TIGR00635 ruvB Holliday juncti 96.9 0.0022 5.6E-08 50.2 5.7 39 408-447 14-57 (305) 94 PRK08939 primosomal protein Dn 96.9 0.003 7.6E-08 48.9 6.2 36 418-454 156-192 (306) 95 PRK09435 arginine/ornithine tr 96.9 0.0034 8.6E-08 48.3 6.5 39 417-455 45-85 (325) 96 PRK09302 circadian clock prote 96.9 0.0025 6.4E-08 49.6 5.8 21 1602-1622 347-367 (501) 97 cd01124 KaiC KaiC is a circadi 96.9 0.0022 5.5E-08 50.3 5.5 50 421-472 1-51 (187) 98 COG0714 MoxR-like ATPases [Gen 96.9 0.0024 6.2E-08 49.8 5.4 55 408-464 32-87 (329) 99 cd01122 GP4d_helicase GP4d_hel 96.9 0.0067 1.7E-07 45.2 7.6 54 416-469 27-81 (271) 100 smart00382 AAA ATPases associa 96.9 0.0015 3.7E-08 52.1 4.2 43 418-461 1-44 (148) 101 TIGR01447 recD exodeoxyribonuc 96.8 0.00094 2.4E-08 54.1 3.1 39 408-446 231-269 (753) 102 pfam03308 ArgK ArgK protein. T 96.8 0.0043 1.1E-07 47.2 6.4 41 415-455 23-65 (267) 103 PRK06731 flhF flagellar biosyn 96.8 0.0056 1.4E-07 46.0 7.0 54 421-474 77-133 (270) 104 PRK08084 DNA replication initi 96.8 0.052 1.3E-06 35.9 11.8 57 419-478 45-104 (235) 105 PRK05703 flhF flagellar biosyn 96.8 0.0068 1.7E-07 45.1 7.3 56 418-473 209-269 (412) 106 PRK04328 hypothetical protein; 96.8 0.0031 7.9E-08 48.6 5.5 55 417-472 22-76 (250) 107 cd01127 TrwB Bacterial conjuga 96.8 0.0023 6E-08 49.9 4.9 40 419-458 42-81 (410) 108 COG0467 RAD55 RecA-superfamily 96.8 0.0035 9E-08 48.1 5.7 51 416-466 20-70 (260) 109 PRK11057 ATP-dependent DNA hel 96.8 0.021 5.3E-07 40.0 9.6 85 408-495 29-113 (607) 110 TIGR01420 pilT_fam twitching m 96.8 0.0012 3E-08 53.1 3.1 32 415-446 122-154 (350) 111 cd01120 RecA-like_NTPases RecA 96.8 0.0037 9.5E-08 47.8 5.7 44 422-465 2-45 (165) 112 PRK01172 ski2-like helicase; P 96.8 0.0049 1.3E-07 46.6 6.3 54 404-458 22-76 (674) 113 PRK06067 flagellar accessory p 96.8 0.0027 6.8E-08 49.4 4.9 53 418-471 31-83 (241) 114 pfam05496 RuvB_N Holliday junc 96.8 0.0011 2.7E-08 53.5 2.8 24 421-445 52-75 (234) 115 KOG0250 consensus 96.8 0.072 1.8E-06 34.4 59.5 27 417-445 61-87 (1074) 116 cd01130 VirB11-like_ATPase Typ 96.7 0.0024 6.2E-08 49.8 4.6 46 414-461 20-65 (186) 117 pfam06068 TIP49 TIP49 C-termin 96.7 0.0028 7.2E-08 49.1 4.9 58 418-497 49-107 (395) 118 cd01394 radB RadB. The archaea 96.7 0.0067 1.7E-07 45.2 6.9 39 418-456 18-56 (218) 119 pfam06745 KaiC KaiC. This fami 96.7 0.0033 8.4E-08 48.4 5.3 54 417-472 17-72 (231) 120 PRK13946 shikimate kinase; Pro 96.7 0.0026 6.7E-08 49.4 4.6 39 407-446 8-46 (195) 121 PRK13531 regulatory ATPase Rav 96.7 0.002 5E-08 50.7 4.0 39 409-448 29-67 (498) 122 pfam03796 DnaB_C DnaB-like hel 96.7 0.0043 1.1E-07 47.1 5.7 52 416-467 16-68 (186) 123 PRK02362 ski2-like helicase; P 96.7 0.011 2.8E-07 43.0 7.7 62 406-468 25-88 (736) 124 TIGR03499 FlhF flagellar biosy 96.7 0.0078 2E-07 44.5 6.9 56 418-473 193-253 (282) 125 PRK13826 Dtr system oriT relax 96.7 0.011 2.7E-07 43.0 7.6 60 405-464 382-442 (1102) 126 PRK12726 flagellar biosynthesi 96.7 0.0024 6.2E-08 49.8 4.3 60 413-472 200-262 (407) 127 TIGR00750 lao LAO/AO transport 96.6 0.0066 1.7E-07 45.3 6.2 49 406-454 23-73 (333) 128 cd03279 ABC_sbcCD SbcCD and ot 96.6 0.0017 4.4E-08 51.3 3.2 27 421-449 30-56 (213) 129 cd00268 DEADc DEAD-box helicas 96.6 0.035 9E-07 37.6 9.8 76 408-483 25-107 (203) 130 pfam05970 DUF889 PIF1 helicase 96.6 0.0022 5.6E-08 50.3 3.5 40 1361-1400 74-127 (418) 131 COG1703 ArgK Putative periplas 96.6 0.0073 1.9E-07 44.8 6.1 47 408-454 38-86 (323) 132 TIGR00631 uvrb excinuclease AB 96.6 0.0046 1.2E-07 46.9 5.0 44 409-455 14-62 (667) 133 COG0497 RecN ATPase involved i 96.5 0.095 2.4E-06 33.2 30.0 55 419-478 21-92 (557) 134 pfam01078 Mg_chelatase Magnesi 96.5 0.0019 4.9E-08 50.9 2.9 47 410-464 13-59 (207) 135 PRK11176 lipid transporter ATP 96.5 0.038 9.7E-07 37.3 9.5 14 1380-1393 363-376 (581) 136 PRK06921 hypothetical protein; 96.5 0.005 1.3E-07 46.5 4.9 36 418-454 115-152 (265) 137 COG1061 SSL2 DNA or RNA helica 96.5 0.018 4.5E-07 40.8 7.7 49 1343-1392 125-178 (442) 138 PRK04195 replication factor C 96.5 0.0022 5.7E-08 50.1 3.0 24 420-444 41-64 (403) 139 KOG0989 consensus 96.5 0.0038 9.7E-08 47.8 4.1 41 402-442 34-80 (346) 140 pfam10412 TrwB_AAD_bind Type I 96.5 0.0049 1.2E-07 46.6 4.7 40 417-456 13-52 (386) 141 PRK06696 uridine kinase; Valid 96.5 0.0057 1.5E-07 45.9 5.0 41 413-454 18-61 (227) 142 PRK06995 flhF flagellar biosyn 96.5 0.005 1.3E-07 46.5 4.7 48 418-465 175-227 (404) 143 PRK00411 cdc6 cell division co 96.4 0.0069 1.8E-07 45.0 5.3 36 410-446 46-81 (394) 144 cd00984 DnaB_C DnaB helicase C 96.4 0.008 2E-07 44.4 5.6 53 416-468 10-63 (242) 145 PRK12402 replication factor C 96.4 0.0044 1.1E-07 47.0 4.2 36 409-445 26-61 (337) 146 TIGR03158 cas3_cyano CRISPR-as 96.4 0.018 4.6E-07 40.7 7.3 11 1362-1372 146-156 (357) 147 PRK11823 DNA repair protein Ra 96.4 0.02 5.2E-07 40.2 7.6 73 419-494 90-162 (454) 148 PRK11664 ATP-dependent RNA hel 96.4 0.016 4E-07 41.4 7.0 62 405-467 6-68 (812) 149 PRK13407 bchI magnesium chelat 96.4 0.0042 1.1E-07 47.3 4.1 44 402-446 10-55 (334) 150 TIGR01054 rgy reverse gyrase; 96.4 0.024 6.2E-07 39.3 7.8 86 408-494 88-183 (1843) 151 COG4581 Superfamily II RNA hel 96.4 0.035 9E-07 37.7 8.6 67 401-468 116-183 (1041) 152 pfam02562 PhoH PhoH-like prote 96.4 0.015 3.8E-07 41.6 6.7 34 1364-1397 122-158 (205) 153 pfam00437 GSPII_E Type II/IV s 96.4 0.0055 1.4E-07 46.1 4.5 45 408-453 127-173 (283) 154 PRK13909 putative recombinatio 96.4 0.0078 2E-07 44.5 5.2 19 1343-1367 602-620 (911) 155 TIGR02237 recomb_radB DNA repa 96.4 0.0044 1.1E-07 47.1 3.9 29 424-453 17-46 (223) 156 PRK03992 proteasome-activating 96.4 0.003 7.6E-08 48.8 3.0 28 416-444 163-190 (390) 157 KOG0991 consensus 96.4 0.0049 1.3E-07 46.6 4.2 60 408-469 37-101 (333) 158 COG3973 Superfamily I DNA and 96.4 0.019 4.8E-07 40.5 7.1 68 407-476 215-289 (747) 159 PRK12724 flagellar biosynthesi 96.4 0.0067 1.7E-07 45.1 4.8 53 420-472 224-280 (432) 160 TIGR00368 TIGR00368 Mg chelata 96.4 0.0017 4.4E-08 51.3 1.8 161 191-433 58-227 (505) 161 PRK06620 hypothetical protein; 96.3 0.12 3E-06 32.3 11.1 22 420-442 45-66 (214) 162 pfam07726 AAA_3 ATPase family 96.3 0.0029 7.3E-08 49.0 2.9 25 421-446 1-25 (131) 163 pfam00308 Bac_DnaA Bacterial d 96.3 0.013 3.4E-07 42.1 6.2 55 400-455 10-72 (219) 164 pfam00004 AAA ATPase family as 96.3 0.0029 7.4E-08 49.0 2.8 28 422-452 1-28 (131) 165 PRK11889 flhF flagellar biosyn 96.3 0.0064 1.6E-07 45.4 4.5 46 420-465 242-290 (436) 166 PRK09270 frcK putative fructos 96.3 0.01 2.7E-07 43.2 5.5 31 416-447 29-61 (230) 167 PRK00080 ruvB Holliday junctio 96.3 0.0073 1.9E-07 44.8 4.7 40 408-448 35-79 (328) 168 PRK04863 mukB cell division pr 96.3 0.13 3.2E-06 31.9 87.2 51 221-283 82-133 (1486) 169 PRK13765 ATP-dependent proteas 96.3 0.0076 1.9E-07 44.6 4.7 43 402-445 33-75 (637) 170 TIGR01241 FtsH_fam ATP-depende 96.3 0.0023 5.9E-08 50.0 2.1 62 414-496 87-150 (505) 171 COG0606 Predicted ATPase with 96.3 0.003 7.6E-08 48.9 2.6 39 402-441 181-219 (490) 172 COG0514 RecQ Superfamily II DN 96.2 0.042 1.1E-06 36.8 8.4 84 409-495 22-105 (590) 173 TIGR03015 pepcterm_ATPase puta 96.2 0.038 9.6E-07 37.4 8.1 34 410-444 32-67 (269) 174 COG1224 TIP49 DNA helicase TIP 96.2 0.0092 2.3E-07 43.7 5.0 59 417-497 63-122 (450) 175 pfam07728 AAA_5 AAA domain (dy 96.2 0.0031 7.9E-08 48.6 2.6 31 421-453 1-31 (139) 176 PRK00131 aroK shikimate kinase 96.2 0.0045 1.1E-07 47.0 3.4 28 417-445 2-29 (175) 177 PRK13851 type IV secretion sys 96.2 0.0061 1.6E-07 45.6 4.0 45 414-460 157-201 (343) 178 PRK13767 ATP-dependent helicas 96.2 0.065 1.7E-06 34.9 9.2 60 408-467 36-104 (878) 179 cd01129 PulE-GspE PulE/GspE Th 96.2 0.015 3.8E-07 41.6 5.9 49 406-454 65-115 (264) 180 PRK08074 bifunctional ATP-depe 96.2 0.026 6.7E-07 39.0 7.2 41 415-457 273-316 (932) 181 PRK12727 flagellar biosynthesi 96.2 0.0094 2.4E-07 43.6 4.8 50 416-465 345-399 (557) 182 cd03114 ArgK-like The function 96.2 0.0073 1.9E-07 44.8 4.1 32 424-455 4-35 (148) 183 COG1132 MdlB ABC-type multidru 96.1 0.046 1.2E-06 36.5 8.2 11 1382-1392 352-362 (567) 184 PRK08533 flagellar accessory p 96.1 0.017 4.3E-07 41.0 5.9 47 418-464 23-69 (230) 185 TIGR02768 TraA_Ti Ti-type conj 96.1 0.0033 8.3E-08 48.4 2.2 40 1361-1400 501-544 (888) 186 PRK09862 putative ATP-dependen 96.1 0.005 1.3E-07 46.5 3.1 47 408-462 199-245 (506) 187 PRK13900 type IV secretion sys 96.1 0.0063 1.6E-07 45.5 3.7 45 414-460 155-199 (332) 188 PRK08727 hypothetical protein; 96.1 0.025 6.5E-07 39.1 6.8 54 406-459 25-81 (233) 189 PRK11747 dinG ATP-dependent DN 96.1 0.016 4.1E-07 41.2 5.8 47 411-460 35-88 (697) 190 PRK06761 hypothetical protein; 96.1 0.0087 2.2E-07 44.0 4.3 37 419-456 2-39 (281) 191 KOG0743 consensus 96.1 0.0033 8.5E-08 48.3 2.1 25 419-444 235-259 (457) 192 PRK13833 conjugal transfer pro 96.1 0.011 2.8E-07 42.9 4.8 47 414-462 139-188 (323) 193 smart00763 AAA_PrkA PrkA AAA d 96.1 0.0053 1.3E-07 46.3 3.1 25 420-445 79-103 (361) 194 PRK03839 putative kinase; Prov 96.1 0.0061 1.5E-07 45.6 3.4 30 422-454 3-32 (180) 195 PRK11448 hsdR type I restricti 96.1 0.043 1.1E-06 36.8 7.7 60 408-467 420-486 (1126) 196 PRK10590 ATP-dependent RNA hel 96.1 0.1 2.5E-06 33.0 9.6 34 1341-1374 126-162 (457) 197 PRK00698 tmk thymidylate kinas 96.1 0.011 2.8E-07 43.0 4.6 39 417-456 1-40 (204) 198 cd01121 Sms Sms (bacterial rad 96.0 0.047 1.2E-06 36.4 7.8 74 418-494 81-155 (372) 199 cd01672 TMPK Thymidine monopho 96.0 0.011 2.8E-07 42.9 4.6 34 422-456 3-37 (200) 200 COG2805 PilT Tfp pilus assembl 96.0 0.007 1.8E-07 45.0 3.5 35 418-452 123-158 (353) 201 cd01126 TraG_VirD4 The TraG/Tr 96.0 0.038 9.8E-07 37.3 7.3 57 421-479 1-57 (384) 202 PRK13898 type IV secretion sys 96.0 0.12 3E-06 32.3 9.7 73 419-496 447-529 (800) 203 COG1201 Lhr Lhr-like helicases 96.0 0.024 6.1E-07 39.4 6.2 62 409-470 27-96 (814) 204 TIGR03420 DnaA_homol_Hda DnaA 96.0 0.019 4.9E-07 40.4 5.6 40 418-458 37-77 (226) 205 cd01125 repA Hexameric Replica 96.0 0.017 4.5E-07 40.8 5.3 47 422-470 4-63 (239) 206 COG0552 FtsY Signal recognitio 95.9 0.013 3.4E-07 42.1 4.6 72 423-497 143-218 (340) 207 PRK13894 conjugal transfer ATP 95.9 0.012 3E-07 42.6 4.4 46 414-461 144-192 (320) 208 PRK08903 hypothetical protein; 95.9 0.04 1E-06 37.1 7.1 37 419-456 42-79 (227) 209 PRK12723 flagellar biosynthesi 95.9 0.013 3.2E-07 42.3 4.4 55 418-472 173-234 (388) 210 TIGR03185 DNA_S_dndD DNA sulfu 95.9 0.17 4.4E-06 30.4 41.1 32 418-451 26-58 (650) 211 COG1199 DinG Rad3-related DNA 95.9 0.046 1.2E-06 36.5 7.2 63 405-469 16-85 (654) 212 COG1223 Predicted ATPase (AAA+ 95.9 0.032 8.2E-07 38.1 6.4 95 418-535 150-247 (368) 213 TIGR00634 recN DNA repair prot 95.9 0.18 4.5E-06 30.3 23.1 30 415-445 17-47 (605) 214 COG1419 FlhF Flagellar GTP-bin 95.9 0.012 3E-07 42.7 4.1 57 406-463 191-252 (407) 215 PRK13853 type IV secretion sys 95.9 0.11 2.8E-06 32.5 9.1 44 419-465 427-475 (789) 216 PRK10436 hypothetical protein; 95.9 0.025 6.3E-07 39.3 5.7 48 406-453 200-249 (461) 217 pfam12128 DUF3584 Protein of u 95.8 0.18 4.6E-06 30.2 62.9 16 431-446 196-211 (1192) 218 COG1936 Predicted nucleotide k 95.8 0.013 3.4E-07 42.1 4.2 57 421-481 2-76 (180) 219 cd01882 BMS1 Bms1. Bms1 is an 95.8 0.015 3.7E-07 41.7 4.4 60 422-481 42-112 (225) 220 TIGR00609 recB exodeoxyribonuc 95.8 0.18 4.7E-06 30.2 15.7 61 420-481 10-82 (1324) 221 CHL00081 chlI Mg-protoporyphyr 95.8 0.014 3.5E-07 42.0 4.3 48 402-450 14-63 (347) 222 PRK09087 hypothetical protein; 95.8 0.19 4.7E-06 30.1 13.6 23 420-443 45-67 (226) 223 PRK05298 excinuclease ABC subu 95.8 0.041 1E-06 37.0 6.7 44 409-455 18-66 (657) 224 PRK05642 DNA replication initi 95.8 0.069 1.8E-06 34.6 7.8 54 422-478 48-104 (234) 225 TIGR02782 TrbB_P P-type conjug 95.8 0.015 3.8E-07 41.5 4.4 55 402-460 123-182 (315) 226 TIGR03600 phage_DnaB phage rep 95.8 0.031 7.9E-07 38.2 6.0 55 416-470 191-246 (421) 227 TIGR01242 26Sp45 26S proteasom 95.8 0.0058 1.5E-07 45.8 2.2 23 419-442 156-178 (364) 228 PRK07667 uridine kinase; Provi 95.8 0.028 7.2E-07 38.6 5.7 45 409-454 2-49 (190) 229 PRK09825 idnK D-gluconate kina 95.8 0.0093 2.4E-07 43.7 3.2 28 417-445 1-28 (176) 230 PRK05057 aroK shikimate kinase 95.8 0.01 2.6E-07 43.4 3.4 28 417-445 2-29 (172) 231 PRK13766 Hef nuclease; Provisi 95.8 0.033 8.4E-07 37.9 6.1 81 402-483 13-96 (764) 232 TIGR00632 vsr DNA mismatch end 95.8 0.0087 2.2E-07 44.0 3.1 77 1663-1746 41-143 (143) 233 PRK05595 replicative DNA helic 95.8 0.036 9.1E-07 37.6 6.2 55 416-470 198-253 (444) 234 cd01428 ADK Adenylate kinase ( 95.7 0.0093 2.4E-07 43.7 3.1 22 422-444 2-23 (194) 235 PRK13830 conjugal transfer pro 95.7 0.17 4.4E-06 30.5 9.6 46 419-467 457-504 (818) 236 PRK10522 multidrug transporter 95.7 0.0073 1.9E-07 44.8 2.5 13 1381-1393 345-357 (547) 237 pfam01443 Viral_helicase1 Vira 95.7 0.015 3.8E-07 41.6 4.1 40 1361-1400 59-100 (226) 238 TIGR01074 rep ATP-dependent DN 95.7 0.056 1.4E-06 35.6 7.0 63 406-470 5-71 (677) 239 PRK13973 thymidylate kinase; P 95.7 0.017 4.5E-07 40.8 4.5 39 417-456 1-40 (216) 240 KOG0744 consensus 95.7 0.0084 2.2E-07 44.1 2.8 25 420-445 178-202 (423) 241 pfam02374 ArsA_ATPase Anion-tr 95.7 0.017 4.4E-07 40.9 4.4 36 422-458 4-40 (304) 242 COG4988 CydD ABC-type transpor 95.7 0.2 5.2E-06 29.7 10.3 45 1569-1626 475-519 (559) 243 KOG1806 consensus 95.7 0.033 8.3E-07 38.0 5.8 16 1665-1680 1142-1157(1320) 244 PRK11192 ATP-dependent RNA hel 95.7 0.078 2E-06 34.0 7.7 34 1341-1374 126-162 (417) 245 PRK11174 cysteine/glutathione 95.6 0.012 3E-07 42.6 3.4 11 1382-1392 373-383 (588) 246 cd02025 PanK Pantothenate kina 95.6 0.016 4E-07 41.3 4.0 32 423-455 3-37 (220) 247 COG0470 HolB ATPase involved i 95.6 0.051 1.3E-06 36.0 6.6 73 409-487 13-87 (325) 248 COG3727 Vsr DNA G:T-mismatch r 95.6 0.21 5.3E-06 29.6 9.7 95 1661-1764 20-134 (150) 249 PRK13768 GTPase; Provisional 95.6 0.02 5.2E-07 40.2 4.6 39 421-459 4-42 (253) 250 PRK13889 conjugal transfer rel 95.6 0.08 2E-06 34.0 7.5 60 405-464 347-407 (992) 251 PRK11819 putative ABC transpor 95.6 0.14 3.5E-06 31.5 8.7 28 415-443 29-56 (556) 252 COG1643 HrpA HrpA-like helicas 95.6 0.099 2.5E-06 33.0 7.9 76 403-478 49-128 (845) 253 KOG0733 consensus 95.6 0.0084 2.1E-07 44.1 2.4 40 418-464 222-261 (802) 254 PRK02496 adk adenylate kinase; 95.6 0.011 2.8E-07 42.9 3.0 24 420-444 2-25 (185) 255 PRK06893 DNA replication initi 95.5 0.027 6.9E-07 38.8 4.9 57 419-478 39-98 (229) 256 TIGR00041 DTMP_kinase thymidyl 95.5 0.021 5.3E-07 40.1 4.3 37 419-456 2-40 (211) 257 KOG1533 consensus 95.5 0.014 3.5E-07 41.9 3.4 34 422-456 5-39 (290) 258 PRK01297 ATP-dependent RNA hel 95.5 0.22 5.6E-06 29.4 9.5 25 409-433 111-135 (472) 259 CHL00195 ycf46 Ycf46; Provisio 95.5 0.011 2.8E-07 43.0 2.8 28 417-445 257-284 (491) 260 TIGR03029 EpsG chain length de 95.5 0.04 1E-06 37.1 5.6 58 419-476 103-163 (274) 261 cd02028 UMPK_like Uridine mono 95.5 0.023 5.9E-07 39.5 4.4 31 423-454 3-34 (179) 262 COG1204 Superfamily II helicas 95.5 0.097 2.5E-06 33.1 7.6 78 404-482 31-112 (766) 263 pfam10236 DAP3 Mitochondrial r 95.5 0.028 7.1E-07 38.7 4.8 39 417-455 21-59 (274) 264 CHL00176 ftsH cell division pr 95.5 0.013 3.4E-07 42.1 3.1 40 414-458 205-244 (631) 265 TIGR02788 VirB11 P-type DNA tr 95.4 0.015 3.8E-07 41.5 3.3 37 414-451 153-189 (328) 266 COG1222 RPT1 ATP-dependent 26S 95.4 0.011 2.7E-07 43.1 2.6 46 415-467 181-226 (406) 267 PRK11776 ATP-dependent RNA hel 95.4 0.19 4.7E-06 30.1 8.9 33 1341-1373 124-159 (459) 268 PRK07246 bifunctional ATP-depe 95.4 0.052 1.3E-06 35.9 6.1 51 406-460 247-303 (820) 269 cd02023 UMPK Uridine monophosp 95.4 0.023 5.8E-07 39.7 4.2 31 422-454 2-32 (198) 270 PRK11160 cysteine/glutathione 95.4 0.011 2.7E-07 43.0 2.6 21 1660-1680 506-528 (575) 271 pfam05729 NACHT NACHT domain. 95.4 0.017 4.4E-07 40.9 3.6 28 421-449 2-29 (165) 272 KOG0057 consensus 95.4 0.022 5.7E-07 39.7 4.2 16 1660-1675 517-532 (591) 273 PRK10536 hypothetical protein; 95.4 0.077 2E-06 34.1 6.9 34 1364-1397 179-215 (262) 274 PRK00149 dnaA chromosomal repl 95.4 0.061 1.6E-06 35.1 6.4 58 400-458 121-186 (447) 275 PRK10787 DNA-binding ATP-depen 95.4 0.014 3.7E-07 41.7 3.2 22 438-459 38-59 (784) 276 PRK13947 shikimate kinase; Pro 95.4 0.015 3.9E-07 41.4 3.3 25 420-445 2-26 (171) 277 COG0703 AroK Shikimate kinase 95.4 0.014 3.6E-07 41.8 3.1 26 419-445 2-27 (172) 278 cd03248 ABCC_TAP TAP, the Tran 95.4 0.013 3.4E-07 42.0 2.9 52 415-467 36-90 (226) 279 PRK10416 cell division protein 95.4 0.028 7.2E-07 38.7 4.5 50 423-472 299-351 (499) 280 PRK08006 replicative DNA helic 95.4 0.054 1.4E-06 35.7 6.0 57 415-471 220-277 (471) 281 TIGR03018 pepcterm_TyrKin exop 95.4 0.028 7E-07 38.8 4.5 36 418-454 34-72 (207) 282 cd03289 ABCC_CFTR2 The CFTR su 95.4 0.018 4.5E-07 40.8 3.5 38 415-453 26-63 (275) 283 KOG0738 consensus 95.4 0.012 3E-07 42.6 2.6 23 419-442 245-267 (491) 284 PRK05541 adenylylsulfate kinas 95.4 0.026 6.5E-07 39.1 4.3 33 420-453 6-41 (176) 285 COG1221 PspF Transcriptional r 95.3 0.026 6.7E-07 39.0 4.3 48 409-460 93-141 (403) 286 pfam02456 Adeno_IVa2 Adenoviru 95.3 0.014 3.6E-07 41.8 3.0 29 1587-1619 241-269 (370) 287 PRK13700 conjugal transfer pro 95.3 0.027 7E-07 38.8 4.4 36 419-454 185-220 (732) 288 TIGR01192 chvA glucan exporter 95.3 0.01 2.7E-07 43.2 2.2 35 405-440 345-381 (592) 289 pfam01580 FtsK_SpoIIIE FtsK/Sp 95.3 0.031 7.9E-07 38.2 4.7 46 418-463 37-86 (202) 290 cd01123 Rad51_DMC1_radA Rad51_ 95.3 0.028 7.1E-07 38.7 4.4 38 419-456 19-62 (235) 291 COG1074 RecB ATP-dependent exo 95.3 0.06 1.5E-06 35.2 6.1 63 437-499 6-75 (1139) 292 cd02021 GntK Gluconate kinase 95.3 0.015 3.7E-07 41.6 2.9 23 422-445 2-24 (150) 293 TIGR02746 TraC-F-type type-IV 95.3 0.033 8.3E-07 38.0 4.7 132 304-453 375-523 (900) 294 PRK10789 putative multidrug tr 95.3 0.013 3.4E-07 42.1 2.7 14 1380-1393 336-349 (569) 295 cd03277 ABC_SMC5_euk Eukaryoti 95.3 0.013 3.4E-07 42.0 2.7 34 418-452 21-55 (213) 296 COG0563 Adk Adenylate kinase a 95.3 0.014 3.6E-07 41.8 2.8 23 420-443 1-23 (178) 297 pfam07724 AAA_2 AAA domain (Cd 95.3 0.023 5.9E-07 39.5 3.9 34 420-454 4-37 (168) 298 cd03291 ABCC_CFTR1 The CFTR su 95.3 0.012 3.2E-07 42.4 2.5 31 414-445 58-88 (282) 299 cd02019 NK Nucleoside/nucleoti 95.3 0.029 7.4E-07 38.5 4.4 32 422-455 2-33 (69) 300 COG4181 Predicted ABC-type tra 95.3 0.047 1.2E-06 36.4 5.4 49 415-464 32-110 (228) 301 PRK01184 hypothetical protein; 95.3 0.061 1.5E-06 35.2 6.0 41 422-472 4-44 (183) 302 PRK03731 aroL shikimate kinase 95.3 0.018 4.6E-07 40.7 3.3 26 419-445 2-27 (172) 303 TIGR01073 pcrA ATP-dependent D 95.3 0.064 1.6E-06 35.0 6.1 185 1513-1713 585-786 (811) 304 cd03298 ABC_ThiQ_thiamine_tran 95.3 0.012 3.2E-07 42.4 2.4 49 415-464 20-69 (211) 305 KOG1969 consensus 95.2 0.11 2.7E-06 32.7 7.2 59 419-480 326-394 (877) 306 cd00550 ArsA_ATPase Oxyanion-t 95.2 0.031 7.8E-07 38.3 4.4 36 422-458 3-39 (254) 307 PRK13657 cyclic beta-1,2-gluca 95.2 0.014 3.5E-07 42.0 2.6 12 1381-1392 357-368 (585) 308 COG0513 SrmB Superfamily II DN 95.2 0.16 4E-06 30.9 8.0 24 409-432 56-79 (513) 309 cd01393 recA_like RecA is a b 95.2 0.031 7.8E-07 38.3 4.4 37 418-454 18-60 (226) 310 cd03253 ABCC_ATM1_transporter 95.2 0.014 3.5E-07 42.0 2.6 39 415-454 23-62 (236) 311 cd03260 ABC_PstB_phosphate_tra 95.2 0.023 5.9E-07 39.6 3.7 40 415-455 22-67 (227) 312 cd02035 ArsA ArsA ATPase funct 95.2 0.034 8.7E-07 37.8 4.6 36 422-458 2-38 (217) 313 cd02020 CMPK Cytidine monophos 95.2 0.016 4.2E-07 41.1 2.9 23 422-445 2-24 (147) 314 PRK11634 ATP-dependent RNA hel 95.2 0.23 6E-06 29.1 8.8 50 409-458 33-85 (629) 315 pfam07652 Flavi_DEAD Flaviviru 95.2 0.072 1.8E-06 34.4 6.2 54 418-471 1-55 (146) 316 KOG0736 consensus 95.2 0.034 8.7E-07 37.8 4.5 30 415-445 426-456 (953) 317 PRK07263 consensus 95.2 0.069 1.8E-06 34.6 6.0 55 416-470 200-255 (453) 318 PRK00279 adk adenylate kinase; 95.2 0.018 4.7E-07 40.6 3.0 23 421-444 2-24 (215) 319 cd01131 PilT Pilus retraction 95.2 0.028 7E-07 38.8 3.9 37 422-458 4-40 (198) 320 pfam06309 Torsin Torsin. This 95.2 0.06 1.5E-06 35.3 5.7 27 420-447 52-80 (127) 321 KOG0058 consensus 95.1 0.032 8.1E-07 38.1 4.2 12 1381-1392 490-501 (716) 322 TIGR00064 ftsY signal recognit 95.1 0.03 7.6E-07 38.4 4.1 73 417-492 78-156 (284) 323 PRK13826 Dtr system oriT relax 95.1 0.14 3.5E-06 31.5 7.4 29 473-501 397-425 (1102) 324 TIGR01277 thiQ thiamine ABC tr 95.1 0.015 3.8E-07 41.6 2.5 32 414-446 19-50 (213) 325 COG0556 UvrB Helicase subunit 95.1 0.057 1.5E-06 35.4 5.5 44 409-455 17-65 (663) 326 PRK13949 shikimate kinase; Pro 95.1 0.021 5.3E-07 40.1 3.2 25 420-445 2-26 (169) 327 PTZ00088 adenylate kinase 1; P 95.1 0.019 4.9E-07 40.4 3.0 24 421-445 2-25 (225) 328 PRK10790 putative multidrug tr 95.1 0.017 4.2E-07 41.1 2.6 14 1380-1393 362-375 (593) 329 TIGR01351 adk adenylate kinase 95.1 0.017 4.4E-07 40.9 2.7 24 421-445 1-24 (232) 330 pfam00910 RNA_helicase RNA hel 95.1 0.017 4.3E-07 41.0 2.7 24 422-446 1-24 (105) 331 cd03300 ABC_PotA_N PotA is an 95.1 0.016 4E-07 41.4 2.5 46 415-461 22-68 (232) 332 cd03249 ABC_MTABC3_MDL1_MDL2 M 95.1 0.016 4.1E-07 41.2 2.5 44 415-459 25-69 (238) 333 COG3839 MalK ABC-type sugar tr 95.1 0.054 1.4E-06 35.7 5.2 77 415-499 25-102 (338) 334 PRK06851 hypothetical protein; 95.1 0.037 9.4E-07 37.4 4.4 34 420-453 216-249 (368) 335 cd03296 ABC_CysA_sulfate_impor 95.1 0.012 3.2E-07 42.4 1.9 46 414-460 23-69 (239) 336 PRK04863 mukB cell division pr 95.0 0.3 7.6E-06 28.0 70.0 58 402-470 188-248 (1486) 337 PRK05563 DNA polymerase III su 95.0 0.021 5.5E-07 39.9 3.1 32 413-445 32-63 (541) 338 PRK06645 DNA polymerase III su 95.0 0.025 6.3E-07 39.2 3.4 33 412-445 36-68 (507) 339 PRK10733 hflB ATP-dependent me 95.0 0.025 6.4E-07 39.2 3.4 41 413-458 179-219 (644) 340 KOG0976 consensus 95.0 0.31 7.8E-06 27.9 55.4 25 1690-1714 1186-1213(1265) 341 TIGR02928 TIGR02928 orc1/cdc6 95.0 0.023 5.9E-07 39.6 3.2 29 416-445 40-68 (383) 342 COG1474 CDC6 Cdc6-related prot 95.0 0.032 8.2E-07 38.0 3.9 29 415-443 38-66 (366) 343 COG1136 SalX ABC-type antimicr 95.0 0.018 4.5E-07 40.7 2.6 38 415-453 27-65 (226) 344 PRK05748 replicative DNA helic 95.0 0.081 2.1E-06 33.9 5.9 54 416-469 200-254 (448) 345 PRK08840 replicative DNA helic 95.0 0.08 2E-06 34.0 5.9 55 416-470 214-269 (464) 346 PRK13721 conjugal transfer ATP 95.0 0.051 1.3E-06 36.0 4.9 37 418-454 473-509 (864) 347 PRK04040 adenylate kinase; Pro 95.0 0.049 1.2E-06 36.2 4.8 45 419-466 2-46 (189) 348 COG1205 Distinct helicase fami 95.0 0.16 4.1E-06 30.7 7.4 57 401-457 67-125 (851) 349 PRK13873 conjugal transfer ATP 95.0 0.31 8E-06 27.8 9.7 44 419-465 442-487 (815) 350 PRK08506 replicative DNA helic 94.9 0.087 2.2E-06 33.5 6.0 54 418-471 192-245 (473) 351 KOG2028 consensus 94.9 0.02 5.2E-07 40.1 2.8 36 406-443 150-185 (554) 352 PRK13948 shikimate kinase; Pro 94.9 0.025 6.4E-07 39.2 3.2 26 419-445 10-35 (182) 353 COG0125 Tmk Thymidylate kinase 94.9 0.049 1.2E-06 36.2 4.7 40 418-458 2-42 (208) 354 cd00464 SK Shikimate kinase (S 94.9 0.025 6.4E-07 39.2 3.2 24 421-445 1-24 (154) 355 TIGR00455 apsK adenylylsulfate 94.9 0.022 5.7E-07 39.7 2.9 29 417-446 15-45 (187) 356 KOG0731 consensus 94.9 0.02 5.1E-07 40.2 2.7 36 414-454 339-374 (774) 357 PRK08694 consensus 94.9 0.082 2.1E-06 33.8 5.8 55 416-470 215-270 (468) 358 TIGR02203 MsbA_lipidA lipid A 94.9 0.014 3.6E-07 41.8 1.9 27 414-441 383-409 (603) 359 COG0507 RecD ATP-dependent exo 94.9 0.031 8E-07 38.2 3.6 45 1356-1400 402-449 (696) 360 pfam08298 AAA_PrkA PrkA AAA do 94.9 0.024 6E-07 39.5 3.0 26 421-447 87-112 (358) 361 cd03224 ABC_TM1139_LivF_branch 94.9 0.061 1.6E-06 35.2 5.1 40 415-455 22-62 (222) 362 PRK03695 vitamin B12-transport 94.9 0.02 5.1E-07 40.2 2.6 46 415-461 19-64 (245) 363 cd03299 ABC_ModC_like Archeal 94.9 0.018 4.7E-07 40.6 2.4 51 415-466 21-72 (235) 364 COG3598 RepA RecA-family ATPas 94.9 0.092 2.3E-06 33.3 5.9 59 416-475 86-158 (402) 365 KOG1942 consensus 94.9 0.027 6.9E-07 38.9 3.2 56 415-492 60-115 (456) 366 PRK00771 signal recognition pa 94.8 0.13 3.3E-06 31.7 6.7 54 422-475 100-156 (433) 367 KOG3928 consensus 94.8 0.037 9.4E-07 37.5 3.9 36 419-454 179-214 (461) 368 COG0466 Lon ATP-dependent Lon 94.8 0.027 6.8E-07 38.9 3.1 30 1344-1375 570-599 (782) 369 cd03234 ABCG_White The White s 94.8 0.017 4.4E-07 40.9 2.2 42 415-457 29-74 (226) 370 KOG3347 consensus 94.8 0.029 7.4E-07 38.5 3.3 28 417-445 5-32 (176) 371 PRK09165 replicative DNA helic 94.8 0.096 2.5E-06 33.1 5.9 54 417-470 203-271 (484) 372 COG0630 VirB11 Type IV secreto 94.8 0.044 1.1E-06 36.6 4.2 51 407-459 130-181 (312) 373 PRK04537 ATP-dependent RNA hel 94.8 0.3 7.7E-06 27.9 8.5 27 409-435 36-62 (574) 374 cd03254 ABCC_Glucan_exporter_l 94.8 0.021 5.3E-07 40.1 2.5 52 415-467 25-79 (229) 375 TIGR03117 cas_csf4 CRISPR-asso 94.8 0.14 3.5E-06 31.5 6.7 44 413-458 9-57 (636) 376 PRK13891 conjugal transfer pro 94.8 0.34 8.7E-06 27.4 10.0 36 419-457 489-526 (852) 377 PRK00300 gmk guanylate kinase; 94.8 0.026 6.6E-07 39.1 2.9 27 416-443 4-30 (208) 378 PRK08154 anaerobic benzoate ca 94.8 0.057 1.4E-06 35.5 4.7 42 403-445 106-158 (304) 379 pfam03029 ATP_bind_1 Conserved 94.8 0.032 8.2E-07 38.1 3.4 36 424-459 1-36 (234) 380 COG0593 DnaA ATPase involved i 94.7 0.35 8.9E-06 27.3 8.9 64 400-464 89-160 (408) 381 TIGR03608 L_ocin_972_ABC putat 94.7 0.022 5.7E-07 39.8 2.5 40 415-455 20-60 (206) 382 PRK06904 replicative DNA helic 94.7 0.1 2.6E-06 32.9 5.9 55 416-470 218-273 (472) 383 PRK13695 putative NTPase; Prov 94.7 0.041 1E-06 37.0 3.8 29 421-450 5-34 (174) 384 TIGR01978 sufC FeS assembly AT 94.7 0.018 4.5E-07 40.8 2.0 48 414-462 21-71 (248) 385 KOG1804 consensus 94.7 0.14 3.6E-06 31.4 6.6 227 1361-1619 265-516 (775) 386 cd03246 ABCC_Protease_Secretio 94.7 0.023 5.8E-07 39.7 2.5 38 415-453 24-62 (173) 387 COG3854 SpoIIIAA ncharacterize 94.7 0.052 1.3E-06 35.9 4.3 40 416-456 133-179 (308) 388 cd03292 ABC_FtsE_transporter F 94.7 0.022 5.6E-07 39.8 2.4 38 415-453 23-61 (214) 389 pfam03266 DUF265 Protein of un 94.7 0.044 1.1E-06 36.7 3.9 32 422-454 2-35 (168) 390 PRK13644 cbiO cobalt transport 94.7 0.027 6.9E-07 38.9 2.9 40 414-454 23-63 (274) 391 PRK03846 adenylylsulfate kinas 94.7 0.054 1.4E-06 35.8 4.4 34 418-452 23-57 (198) 392 PRK09580 sufC cysteine desulfu 94.7 0.024 6E-07 39.5 2.5 45 415-460 23-70 (248) 393 cd03276 ABC_SMC6_euk Eukaryoti 94.7 0.025 6.5E-07 39.1 2.7 28 419-449 22-49 (198) 394 cd03222 ABC_RNaseL_inhibitor T 94.6 0.025 6.3E-07 39.3 2.6 37 415-452 21-58 (177) 395 PRK13764 ATPase; Provisional 94.6 0.035 8.9E-07 37.7 3.3 44 418-462 258-301 (605) 396 COG0464 SpoVK ATPases of the A 94.6 0.026 6.7E-07 39.0 2.7 37 417-458 274-310 (494) 397 PRK00625 shikimate kinase; Pro 94.6 0.031 7.8E-07 38.3 3.0 24 421-445 2-25 (173) 398 PRK13540 cytochrome c biogenes 94.6 0.027 6.9E-07 38.8 2.7 41 415-456 23-64 (200) 399 COG0468 RecA RecA/RadA recombi 94.6 0.044 1.1E-06 36.7 3.8 43 419-462 60-102 (279) 400 TIGR03574 selen_PSTK L-seryl-t 94.6 0.054 1.4E-06 35.8 4.2 33 422-455 2-35 (249) 401 cd03245 ABCC_bacteriocin_expor 94.6 0.024 6.1E-07 39.4 2.4 46 415-461 26-72 (220) 402 pfam01583 APS_kinase Adenylyls 94.6 0.056 1.4E-06 35.6 4.3 30 422-452 5-35 (157) 403 pfam06414 Zeta_toxin Zeta toxi 94.6 0.041 1.1E-06 36.9 3.6 37 417-454 10-46 (191) 404 TIGR02533 type_II_gspE general 94.6 0.039 1E-06 37.1 3.5 30 422-451 248-277 (495) 405 TIGR00958 3a01208 antigen pept 94.6 0.023 6E-07 39.5 2.3 25 1378-1402 451-475 (770) 406 KOG0734 consensus 94.6 0.025 6.5E-07 39.2 2.5 42 418-464 336-377 (752) 407 TIGR01587 cas3_core CRISPR-ass 94.6 0.25 6.4E-06 28.8 7.6 72 422-495 2-85 (424) 408 PRK13546 teichoic acids export 94.6 0.025 6.5E-07 39.1 2.5 38 414-452 45-83 (264) 409 pfam02399 Herpes_ori_bp Origin 94.5 0.22 5.5E-06 29.5 7.2 22 533-554 358-379 (829) 410 pfam05970 DUF889 PIF1 helicase 94.5 0.058 1.5E-06 35.4 4.3 35 426-460 1-35 (418) 411 cd03278 ABC_SMC_barmotin Barmo 94.5 0.036 9.1E-07 37.6 3.2 29 415-445 19-47 (197) 412 PRK11545 gntK gluconate kinase 94.5 0.032 8.2E-07 38.1 2.9 25 420-445 9-33 (177) 413 COG1618 Predicted nucleotide k 94.5 0.05 1.3E-06 36.1 3.9 27 419-446 5-31 (179) 414 PRK08082 consensus 94.5 0.13 3.3E-06 31.8 6.0 55 416-470 200-255 (453) 415 cd01136 ATPase_flagellum-secre 94.5 0.24 6.1E-06 29.0 7.3 69 410-479 60-129 (326) 416 PRK05636 replicative DNA helic 94.5 0.13 3.4E-06 31.7 6.0 56 416-471 264-320 (507) 417 cd03301 ABC_MalK_N The N-termi 94.5 0.027 7E-07 38.8 2.5 40 414-454 21-61 (213) 418 CHL00131 ycf16 sulfate ABC tra 94.5 0.025 6.4E-07 39.2 2.3 47 414-461 27-76 (252) 419 PRK13976 thymidylate kinase; P 94.5 0.059 1.5E-06 35.3 4.2 33 422-455 3-38 (202) 420 PRK08760 replicative DNA helic 94.5 0.13 3.4E-06 31.7 6.0 55 416-470 226-281 (476) 421 cd03235 ABC_Metallic_Cations A 94.5 0.029 7.3E-07 38.6 2.6 38 415-453 21-59 (213) 422 cd03213 ABCG_EPDR ABCG transpo 94.5 0.025 6.2E-07 39.3 2.2 39 415-454 31-72 (194) 423 pfam05707 Zot Zonular occluden 94.5 0.055 1.4E-06 35.7 4.0 10 1364-1373 73-82 (183) 424 COG1102 Cmk Cytidylate kinase 94.5 0.032 8.2E-07 38.1 2.8 23 422-445 3-25 (179) 425 PRK06749 replicative DNA helic 94.5 0.14 3.6E-06 31.4 6.1 54 416-469 183-236 (428) 426 PRK05506 bifunctional sulfate 94.4 0.05 1.3E-06 36.0 3.8 36 415-451 439-475 (613) 427 COG1066 Sms Predicted ATP-depe 94.4 0.063 1.6E-06 35.0 4.3 70 408-479 77-151 (456) 428 cd01128 rho_factor Transcripti 94.4 0.13 3.4E-06 31.7 5.9 52 415-467 12-68 (249) 429 PRK10771 thiQ thiamine transpo 94.4 0.028 7.2E-07 38.6 2.5 39 414-453 20-59 (233) 430 cd03369 ABCC_NFT1 Domain 2 of 94.4 0.033 8.4E-07 38.0 2.8 44 415-459 30-74 (207) 431 pfam02223 Thymidylate_kin Thym 94.4 0.045 1.2E-06 36.5 3.5 32 424-456 1-33 (186) 432 PRK13635 cbiO cobalt transport 94.4 0.03 7.7E-07 38.4 2.6 39 414-453 28-67 (279) 433 KOG2859 consensus 94.4 0.074 1.9E-06 34.3 4.6 37 418-454 37-79 (293) 434 cd03252 ABCC_Hemolysin The ABC 94.4 0.029 7.3E-07 38.6 2.4 38 415-453 24-62 (237) 435 PRK07004 replicative DNA helic 94.4 0.14 3.5E-06 31.6 5.9 54 417-470 211-265 (460) 436 KOG0354 consensus 94.4 0.17 4.4E-06 30.5 6.4 71 401-473 59-132 (746) 437 COG0529 CysC Adenylylsulfate k 94.4 0.088 2.3E-06 33.5 4.9 48 415-468 16-67 (197) 438 PRK05480 uridine kinase; Provi 94.4 0.055 1.4E-06 35.6 3.9 33 420-454 5-39 (209) 439 PRK10867 signal recognition pa 94.4 0.2 5.2E-06 29.7 6.7 57 419-475 99-160 (453) 440 cd03293 ABC_NrtD_SsuB_transpor 94.4 0.028 7.2E-07 38.7 2.3 40 414-454 25-65 (220) 441 PRK09473 oppD oligopeptide tra 94.3 0.13 3.4E-06 31.6 5.8 41 414-455 37-81 (330) 442 cd03217 ABC_FeS_Assembly ABC-t 94.3 0.024 6.2E-07 39.3 2.0 47 414-461 21-70 (200) 443 cd03251 ABCC_MsbA MsbA is an e 94.3 0.03 7.8E-07 38.3 2.5 44 415-459 24-68 (234) 444 cd03261 ABC_Org_Solvent_Resist 94.3 0.026 6.6E-07 39.1 2.1 39 414-453 21-60 (235) 445 cd03247 ABCC_cytochrome_bd The 94.3 0.08 2E-06 34.0 4.6 46 415-461 24-70 (178) 446 PRK11701 phnK phosphonates tra 94.3 0.028 7.2E-07 38.7 2.3 39 414-453 27-66 (258) 447 PRK10247 putative ABC transpor 94.3 0.033 8.4E-07 38.0 2.6 42 415-457 29-71 (225) 448 COG0396 sufC Cysteine desulfur 94.3 0.031 7.9E-07 38.2 2.4 47 415-462 26-75 (251) 449 pfam10443 RNA12 RNA12 protein. 94.3 0.051 1.3E-06 35.9 3.6 34 419-456 16-51 (428) 450 cd03237 ABC_RNaseL_inhibitor_d 94.3 0.032 8.1E-07 38.1 2.5 37 415-452 21-58 (246) 451 pfam07088 GvpD GvpD gas vesicl 94.3 0.011 2.7E-07 43.0 0.1 14 1700-1713 459-472 (484) 452 COG1123 ATPase components of v 94.3 0.15 3.9E-06 31.0 6.0 42 414-456 312-354 (539) 453 PRK10419 nikE nickel transport 94.3 0.21 5.2E-06 29.7 6.6 40 414-454 33-73 (266) 454 PRK08118 topology modulation p 94.3 0.042 1.1E-06 36.9 3.1 24 421-445 3-26 (167) 455 PRK13539 cytochrome c biogenes 94.3 0.033 8.4E-07 37.9 2.5 38 415-453 24-62 (206) 456 COG4525 TauB ABC-type taurine 94.3 0.028 7.2E-07 38.7 2.2 32 414-446 26-57 (259) 457 cd03229 ABC_Class3 This class 94.3 0.033 8.3E-07 38.0 2.5 38 415-453 22-60 (178) 458 cd03262 ABC_HisP_GlnQ_permease 94.3 0.032 8.1E-07 38.1 2.4 51 414-465 21-76 (213) 459 KOG0741 consensus 94.3 0.038 9.7E-07 37.3 2.8 27 418-445 255-281 (744) 460 PRK13632 cbiO cobalt transport 94.2 0.035 9E-07 37.7 2.6 39 415-454 32-71 (273) 461 cd03244 ABCC_MRP_domain2 Domai 94.2 0.035 8.9E-07 37.7 2.6 38 415-453 26-64 (221) 462 PRK07165 F0F1 ATP synthase sub 94.2 0.37 9.5E-06 27.0 7.9 72 410-482 134-208 (507) 463 pfam03215 Rad17 Rad17 cell cyc 94.2 0.042 1.1E-06 36.9 3.0 25 420-445 46-70 (490) 464 PRK11248 tauB taurine transpor 94.2 0.032 8.1E-07 38.1 2.4 39 414-453 22-61 (255) 465 PRK11022 dppD dipeptide transp 94.2 0.14 3.5E-06 31.4 5.6 41 414-455 28-73 (327) 466 TIGR03263 guanyl_kin guanylate 94.2 0.038 9.6E-07 37.4 2.7 24 419-443 1-24 (180) 467 cd03233 ABC_PDR_domain1 The pl 94.2 0.028 7.2E-07 38.7 2.1 39 415-454 29-71 (202) 468 cd03290 ABCC_SUR1_N The SUR do 94.2 0.034 8.6E-07 37.8 2.5 38 415-453 23-61 (218) 469 PRK13640 cbiO cobalt transport 94.2 0.037 9.3E-07 37.5 2.7 31 415-446 30-60 (283) 470 cd03232 ABC_PDR_domain2 The pl 94.2 0.031 8E-07 38.2 2.3 27 415-442 29-55 (192) 471 COG3840 ThiQ ABC-type thiamine 94.2 0.036 9.2E-07 37.5 2.6 43 415-458 21-64 (231) 472 cd03231 ABC_CcmA_heme_exporter 94.2 0.036 9.1E-07 37.6 2.6 37 415-452 22-59 (201) 473 PRK13548 hmuV hemin importer A 94.2 0.04 1E-06 37.0 2.8 39 415-454 24-63 (257) 474 pfam02534 TraG TraG/TraD famil 94.2 0.19 4.8E-06 30.0 6.2 62 420-483 45-106 (468) 475 pfam00158 Sigma54_activat Sigm 94.2 0.08 2E-06 33.9 4.3 31 414-445 17-47 (168) 476 PRK10851 sulfate/thiosulfate t 94.2 0.027 7E-07 38.8 1.9 37 415-452 24-61 (352) 477 PRK13538 cytochrome c biogenes 94.2 0.037 9.3E-07 37.5 2.6 38 415-453 23-61 (204) 478 PRK13543 cytochrome c biogenes 94.1 0.037 9.5E-07 37.4 2.6 38 415-453 33-71 (214) 479 cd03250 ABCC_MRP_domain1 Domai 94.1 0.034 8.6E-07 37.9 2.4 42 415-457 27-75 (204) 480 cd03257 ABC_NikE_OppD_transpor 94.1 0.034 8.8E-07 37.8 2.4 40 414-454 26-66 (228) 481 cd03294 ABC_Pro_Gly_Bertaine T 94.1 0.033 8.3E-07 38.0 2.3 40 414-454 45-85 (269) 482 PRK11650 ugpC glycerol-3-phosp 94.1 0.033 8.5E-07 37.9 2.3 39 415-454 26-65 (358) 483 PRK00023 cmk cytidylate kinase 94.1 0.044 1.1E-06 36.6 2.9 24 421-445 6-29 (225) 484 TIGR03375 type_I_sec_LssB type 94.1 0.031 7.8E-07 38.3 2.1 36 1359-1394 463-500 (694) 485 cd03221 ABCF_EF-3 ABCF_EF-3 E 94.1 0.037 9.4E-07 37.5 2.5 32 415-447 22-53 (144) 486 PRK12422 chromosomal replicati 94.1 0.21 5.2E-06 29.7 6.3 41 421-462 143-184 (455) 487 PRK10070 glycine betaine trans 94.1 0.029 7.5E-07 38.5 2.0 41 414-455 49-90 (400) 488 COG4096 HsdR Type I site-speci 94.1 0.14 3.5E-06 31.5 5.4 56 409-464 170-232 (875) 489 pfam00693 Herpes_TK Thymidine 94.1 0.06 1.5E-06 35.2 3.6 38 426-464 1-38 (279) 490 cd03256 ABC_PhnC_transporter A 94.1 0.036 9.1E-07 37.6 2.4 37 415-452 23-60 (241) 491 PRK06793 fliI flagellum-specif 94.1 0.27 6.9E-06 28.5 6.9 78 402-480 139-218 (432) 492 COG1120 FepC ABC-type cobalami 94.1 0.045 1.1E-06 36.6 2.9 37 415-452 24-61 (258) 493 cd03228 ABCC_MRP_Like The MRP 94.1 0.041 1.1E-06 36.9 2.7 40 415-455 24-64 (171) 494 cd00267 ABC_ATPase ABC (ATP-bi 94.1 0.04 1E-06 37.1 2.6 38 415-453 21-59 (157) 495 cd03295 ABC_OpuCA_Osmoprotecti 94.1 0.036 9.1E-07 37.6 2.4 40 414-454 22-62 (242) 496 cd03239 ABC_SMC_head The struc 94.1 0.047 1.2E-06 36.3 3.0 25 421-446 24-48 (178) 497 PRK11000 maltose/maltodextrin 94.1 0.035 9E-07 37.7 2.3 27 416-443 26-52 (369) 498 CHL00095 clpC Clp protease ATP 94.1 0.093 2.4E-06 33.3 4.5 40 408-448 189-228 (823) 499 PRK13646 cbiO cobalt transport 94.0 0.038 9.8E-07 37.3 2.5 25 416-441 30-54 (286) 500 PRK13637 cbiO cobalt transport 94.0 0.033 8.4E-07 37.9 2.2 36 416-452 30-66 (287) No 1 >COG1112 Superfamily I DNA and RNA helicases and helicase subunits [DNA replication, recombination, and repair] Probab=100.00 E-value=1.7e-34 Score=374.59 Aligned_cols=287 Identities=28% Similarity=0.342 Sum_probs=221.6 Q ss_pred CCCHHHHHHHHHHHHHHHCCEEEECHHHHHHHCCCCCCCCCEEEECCHHHCCHHHHHHHHHHCCEEEEEECCCCCCCCHH Q ss_conf 86689999987898774067899257799872585225410788606323698998898861182899607786686201 Q gi|254780229|r 1323 NIPIRQLISRAKNSLLQLKPCFMMSPMSVANYLEPKDVKFDLVIMDESSQIKPEDALGVIARGKQIVVVGDPKQLPPTRF 1402 (1775) Q Consensus 1323 ~~~~R~l~~~a~~~~~~l~P~~~msp~sva~~l~~~~~~FD~VI~DEASQ~~~~~al~al~rakqvvvvGD~kQlpPt~f 1402 (1775) ++..+.+.......+....+ .+|+|.+++.+-......||+||||||||+.++.+++++.||+++|+||||||||||.| T Consensus 451 ~~~~~~~~~~~~~~i~~~~~-~~~~~~~~a~~~~~~~~~fd~viiDEAsQ~~~~~~~~~l~~~~~~il~GD~kQL~p~~~ 529 (767) T COG1112 451 LKELRRLKKKAVTKILEAAD-VVLSTLSIAGFSILKKYEFDYVIIDEASQATEPSALIALSRAKKVILVGDHKQLPPTVF 529 (767) T ss_pred HHHHHHHHHHHHHHHHHHHH-HHHCCCHHHHHHHHHCCCCCEEEEECCCHHCCHHHHHHHHHHCEEEEECCHHHCCCCCC T ss_conf 34433555666888888776-54041014443221003577899944110111233323534255899146310787200 Q ss_pred HHHCCCCCCCCCCHHHCCCCCHHHHHHHHH--CCCCEECCCCCCCCHHHHHHHHHHCCCCEEEECCCCCCCCC------- Q ss_conf 100013444432011014400079999975--58752053055799789999985036980476468776666------- Q gi|254780229|r 1403 FDHDNDQEDYDEEVAAVSQTESILDALLPL--FSMRRLQWHYRSLNENLIACSNYYFYDNSLIVFPSPYTSVD------- 1473 (1775) Q Consensus 1403 f~~~~~~~~~~~~~~~~~~~eSild~~~~~--~p~~~L~~HyRs~~e~lI~fSN~~fY~~~L~~~psp~~~~~------- 1473 (1775) |... ...+ -..|.+...... ....+|+|||| ||++||+|||+.||+|+|...+....... T Consensus 530 ~~~~-----~~~~-----~~~slf~~~~~~~~~~~~~L~~qYR-m~~~i~~f~s~~fY~~~l~~~~~~~~~~~~~~~~~~ 598 (767) T COG1112 530 FKES-----SPEG-----LSASLFERLIDNGPEVVYLLRVQYR-MHPDIIAFSSKVFYNGRLEVHTSFLAFTLLDGEIPE 598 (767) T ss_pred HHHC-----CCHH-----HHHHHHHHHHHHCCCHHHHHHHHHC-CCHHHHHCCHHCEECCCCEECCCHHHHHHCCCCCCC T ss_conf 2112-----6301-----2267899997404313367787617-788898331322557763024105544302455434 Q ss_pred ---CCCEEEEECCCCCC----CCCCCHHHHHHHHHHHHHHHHCCC-CCEEEEEECCHHHHHHHHHHHHHHHHCCHHHHHH Q ss_conf ---68848997289721----788898999999999999953185-7718998199899999999999993129768999 Q gi|254780229|r 1474 ---RYGVGFTHVKNGVV----VDQGNPEEARVIALAVKDHALRYP-EESLGVIAMNAKQRDLIESTINKLCRKDSEVEKA 1545 (1775) Q Consensus 1474 ---~~~v~~~~v~~g~~----~~~~N~~EA~~vv~~v~~~~~~~~-~~siGVvt~~~~Q~~~I~~~l~~~~~~~~~~~~~ 1545 (1775) ..++.+..+-+... ....|..||..|...+..++.... ...|||||++.+|..+|...+.... T Consensus 599 ~~~~~~~~~~~~~~~~~~~~~~s~~n~~ea~~v~~~~~~~~~~~~~~~~igvispy~~q~~~i~~~~~~~~--------- 669 (767) T COG1112 599 VVISNPLEFYDTLGAEEFFESKSKLNELEAEIVKVIVDELLKDGLEENDIGVISPYRAQVSLIRRLLNEAG--------- 669 (767) T ss_pred CCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCHHHCEEECCCHHHHHHHHHHHHHCC--------- T ss_conf 46664246776423655665311222544659999999999834524303154453889999999876415--------- Q ss_pred HHHHCCCCCCEEECCHHHCCCCCCCEEEEEEEEECCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCEEEEEECCCHHHC Q ss_conf 76411478985854256808666788999966507888876200114777656246899999832142698870387887 Q gi|254780229|r 1546 ISKLRMHADPFFIKNLENVQGDERDRIFISFTYGPHEPGGRVFQRFGPINSDIGWRRLNVLFTRSRRRIEVFSTMRYLDV 1625 (1775) Q Consensus 1546 ~~~~~~~~e~~fvknle~vQGdERDvI~~S~~yg~~~~~g~~~~~fGpln~~gg~rRlNVa~sRAr~~~~v~~s~~~~~i 1625 (1775) .+++|+|+++|||+||||||+|+||+.+. . |+++..|+.||||||+||||++|+||. ...++ T Consensus 670 --------~~v~v~tVd~fQG~EkdvIi~S~v~s~~~-~-------~~i~~l~d~rRLNVAlTRAk~~livvg--~~~~l 731 (767) T COG1112 670 --------KGVEVGTVDGFQGREKDVIILSLVRSNDD-K-------GEIGFLGDPRRLNVALTRAKRKLIVVG--SSSTL 731 (767) T ss_pred --------CCCEECCCCCCCCCCCCEEEEEEEECCCC-C-------CCCCCCCCCCCEEEHHHHCCCCEEEEE--CHHHC T ss_conf --------67257887661798665799986541787-7-------753213687521146444026459996--60331 Q ss_pred CHHHCCCHHHHHHHHHHHHHHCC Q ss_conf 96653543489999999997406 Q gi|254780229|r 1626 VVDIDSKLGIRVMRDFLHFAETG 1648 (1775) Q Consensus 1626 ~~~~~~~~g~~~lk~~l~~a~~g 1648 (1775) ........++..++.+.-|...+ T Consensus 732 ~~~~~~~~~~~~~~~~~~~~~~~ 754 (767) T COG1112 732 ESDPLYKRLINDLKRKGLLAELN 754 (767) T ss_pred CCCHHHHHHHHHHHHHHHHHHCC T ss_conf 45613789999998733422023 No 2 >KOG1803 consensus Probab=99.95 E-value=1.2e-26 Score=292.82 Aligned_cols=247 Identities=26% Similarity=0.344 Sum_probs=179.8 Q ss_pred CCCCEEEECCHHHCCHHHHHHHHHHCCEEEEEECCCCCCCCHHHHHCCCCCCCCCCHHHCCCCCHHHHHHHHHCC---CC Q ss_conf 541078860632369899889886118289960778668620110001344443201101440007999997558---75 Q gi|254780229|r 1360 VKFDLVIMDESSQIKPEDALGVIARGKQIVVVGDPKQLPPTRFFDHDNDQEDYDEEVAAVSQTESILDALLPLFS---MR 1436 (1775) Q Consensus 1360 ~~FD~VI~DEASQ~~~~~al~al~rakqvvvvGD~kQlpPt~ff~~~~~~~~~~~~~~~~~~~eSild~~~~~~p---~~ 1436 (1775) ..||+||||||+|+--+...++++.+|++|++||||||||+-+-.... ...-.-|+|+-....|+ .+ T Consensus 357 ~~fD~vIIDEaaQamE~~cWipvlk~kk~ILaGDp~QLpP~v~S~~a~----------~~gl~~Sl~erlae~~~~~~~~ 426 (649) T KOG1803 357 RTFDLVIIDEAAQAMEPQCWIPVLKGKKFILAGDPKQLPPTVLSDKAK----------RGGLQVSLLERLAEKFGNLSKI 426 (649) T ss_pred CCCCEEEEEHHHHHCCCHHHHHHHCCCCEEEECCCCCCCCCCCCCHHH----------HCCCHHHHHHHHHHHCCCCHHH T ss_conf 678889981254323501306776277369928800178600242334----------2451234999999872650132 Q ss_pred EECCCCCCCCHHHHHHHHHHCCCCEEEECCC-----------CCCCC-CCCCEEEEECCCCC----------CCCCCCHH Q ss_conf 2053055799789999985036980476468-----------77666-66884899728972----------17888989 Q gi|254780229|r 1437 RLQWHYRSLNENLIACSNYYFYDNSLIVFPS-----------PYTSV-DRYGVGFTHVKNGV----------VVDQGNPE 1494 (1775) Q Consensus 1437 ~L~~HyRs~~e~lI~fSN~~fY~~~L~~~ps-----------p~~~~-~~~~v~~~~v~~g~----------~~~~~N~~ 1494 (1775) +|..+|| ||+.|.+|||+.||+|+|.+-++ +.... -.-++-++.-.+-+ +.+..|.. T Consensus 427 ~Ln~QYR-Mn~~Im~wsn~~fY~~qlka~~~v~~~lL~dl~~v~~t~~t~~PlvlvDT~~~~~~e~~~e~~~~~S~~N~g 505 (649) T KOG1803 427 LLNEQYR-MNEKIMNWSNEVFYNGQLKAASSVASHLLRDLPNVLATESTKSPLVLVDTQGEKDEEKRGEEEELGSKYNEG 505 (649) T ss_pred HHHHHHC-CHHHHHHCCHHHHCCCEEEECCHHHHHHHHCCCCCCCCCCCCCCEEEEECCCCHHHHHCCCHHHCCCCCCHH T ss_conf 3455640-368886075765367742005123323554056777764458967999656605444042033324547778 Q ss_pred HHHHHHHHHHHHHHCCC-CCEEEEEECCHHHHHHHHHHHHHHHHCCHHHHHHHHHHCCCCCCEEECCHHHCCCCCCCEEE Q ss_conf 99999999999953185-77189981998999999999999931297689997641147898585425680866678899 Q gi|254780229|r 1495 EARVIALAVKDHALRYP-EESLGVIAMNAKQRDLIESTINKLCRKDSEVEKAISKLRMHADPFFIKNLENVQGDERDRIF 1573 (1775) Q Consensus 1495 EA~~vv~~v~~~~~~~~-~~siGVvt~~~~Q~~~I~~~l~~~~~~~~~~~~~~~~~~~~~e~~fvknle~vQGdERDvI~ 1573 (1775) ||+-|...|.+++..-- ...|||||+|.+|..++.... ...| -..=|+++..|||.|+|||| T Consensus 506 Ea~Iv~~Hv~~L~~~gV~p~dIaVIsPY~aQv~llR~~~---------~~~~--------~~veV~TVD~fQGrEkdvVI 568 (649) T KOG1803 506 EAKIVMEHVKRLLEAGVQPSDIAVISPYNAQVSLLREED---------EEDF--------RDVEVGTVDGFQGREKDVVI 568 (649) T ss_pred HHHHHHHHHHHHHHCCCCHHHEEEECCCHHHHHHHHHCC---------CCCC--------CCCEEECCCCCCCCEEEEEE T ss_conf 899999999999984998567378655168888875303---------5667--------66346021253552101899 Q ss_pred EEEEEECCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCEEEEEECCCHHHCCHHHCCCHHHHHHHHHHHHHHCCCCC Q ss_conf 996650788887620011477765624689999983214269887038788796653543489999999997406422 Q gi|254780229|r 1574 ISFTYGPHEPGGRVFQRFGPINSDIGWRRLNVLFTRSRRRIEVFSTMRYLDVVVDIDSKLGIRVMRDFLHFAETGYME 1651 (1775) Q Consensus 1574 ~S~~yg~~~~~g~~~~~fGpln~~gg~rRlNVa~sRAr~~~~v~~s~~~~~i~~~~~~~~g~~~lk~~l~~a~~g~~~ 1651 (1775) ||||-..+ -|. +++-|..|||||||||||+++.|++. +++-..|-..||.|.+|......- T Consensus 569 fsmVRSN~--k~e-------vGFL~e~RRLNVAiTRaRRh~~vIgd--------s~tl~~~~~~l~k~~~f~~~~~~~ 629 (649) T KOG1803 569 FSLVRSND--KGE-------VGFLGETRRLNVAITRARRHFVVIGD--------SRTLKEGNEFLKKLVEFLEENKLV 629 (649) T ss_pred EEEEEECC--CCC-------CCCCCCCCEEEEEEEECCCEEEEECC--------CHHHHHHHHHHHHHHHHHHHCCEE T ss_conf 99996367--654-------44347764356788732413899818--------088876689999999986322643 No 3 >KOG1805 consensus Probab=99.91 E-value=4.5e-23 Score=255.50 Aligned_cols=242 Identities=25% Similarity=0.376 Sum_probs=180.9 Q ss_pred CCEEEECHHHHHHHCCCCCCCCCEEEECCHHHCCHHHHHHHHHHCCEEEEEECCCCCCCCHHHHHCCCCCCCCCCHHHCC Q ss_conf 67899257799872585225410788606323698998898861182899607786686201100013444432011014 Q gi|254780229|r 1341 KPCFMMSPMSVANYLEPKDVKFDLVIMDESSQIKPEDALGVIARGKQIVVVGDPKQLPPTRFFDHDNDQEDYDEEVAAVS 1420 (1775) Q Consensus 1341 ~P~~~msp~sva~~l~~~~~~FD~VI~DEASQ~~~~~al~al~rakqvvvvGD~kQlpPt~ff~~~~~~~~~~~~~~~~~ 1420 (1775) .+++..+++.|..-|--.. .||++||||||||..+-+||++.-+++.|+|||+.||||-. +.... .+.. T Consensus 777 ~~IVa~TClgi~~plf~~R-~FD~cIiDEASQI~lP~~LgPL~~s~kFVLVGDh~QLpPLV---~s~ea-------r~~G 845 (1100) T KOG1805 777 TSIVACTCLGINHPLFVNR-QFDYCIIDEASQILLPLCLGPLSFSNKFVLVGDHYQLPPLV---RSSEA-------RQEG 845 (1100) T ss_pred CCEEEEECCCCCCHHHHCC-CCCEEEECCCCCCCCCHHHHHHHHCCEEEEECCCCCCCCCC---CCHHH-------HHCC T ss_conf 7679997157886555214-26789986511114342121021123589964633378630---36455-------5567 Q ss_pred CCCHHHHHHHHHCCC--CEECCCCCCCCHHHHHHHHHHCCCCEEEECCCCCC---------------------------C Q ss_conf 400079999975587--52053055799789999985036980476468776---------------------------6 Q gi|254780229|r 1421 QTESILDALLPLFSM--RRLQWHYRSLNENLIACSNYYFYDNSLIVFPSPYT---------------------------S 1471 (1775) Q Consensus 1421 ~~eSild~~~~~~p~--~~L~~HyRs~~e~lI~fSN~~fY~~~L~~~psp~~---------------------------~ 1471 (1775) -.+|+|......-|+ ..|+-+|| |...|...||..||+|+|.---.++. . T Consensus 846 l~~SLFkrL~e~hpeaV~~Lt~QYR-Mn~~I~~LSN~L~Yg~~L~Cgs~eVs~~~~~~~~~~~~~~~~~s~s~~wl~~v~ 924 (1100) T KOG1805 846 LSESLFKRLSEKHPEAVSSLTLQYR-MNREIMRLSNKLIYGNRLKCGSKEVSRASELDRKGALSVYMDDSSSDHWLQAVL 924 (1100) T ss_pred CCHHHHHHHHHHCCHHHHHHHHHHH-HCCHHHHHHHHHEECCEEEECCHHHHHHHCCCCCHHHHHHCCCCCCHHHHHHHH T ss_conf 3459999986509417776888876-321687640563688764556714422221464034544314442248999750 Q ss_pred CCCCCEEEEECCCC--CC---C--CCCCHHHHHHHHHHHHHHHHC-CCCCEEEEEECCHHHHHHHHHHHHHHHHCCHHHH Q ss_conf 66688489972897--21---7--888989999999999999531-8577189981998999999999999931297689 Q gi|254780229|r 1472 VDRYGVGFTHVKNG--VV---V--DQGNPEEARVIALAVKDHALR-YPEESLGVIAMNAKQRDLIESTINKLCRKDSEVE 1543 (1775) Q Consensus 1472 ~~~~~v~~~~v~~g--~~---~--~~~N~~EA~~vv~~v~~~~~~-~~~~siGVvt~~~~Q~~~I~~~l~~~~~~~~~~~ 1543 (1775) .|..+|.|+.+.+- +- + +-+|..||.-|++++..++.. .+-..|||+|.+.+|.++|...+-.. + T Consensus 925 ~p~~~v~f~~~D~~~~ie~~~e~~~i~N~~EA~li~~~~~~fv~sGv~~~dIGIis~YraQv~Li~~~l~~~----~--- 997 (1100) T KOG1805 925 EPTRDVCFVNTDTCSTIESQGEKGGITNHGEAKLISELVEDFVKSGVKPSDIGIISPYRAQVELIRKILSSA----V--- 997 (1100) T ss_pred CCCCCCEEEECCCCCCHHHHCCCCCCCCHHHHHHHHHHHHHHHHCCCCHHHEEEEEHHHHHHHHHHHHCCCC----C--- T ss_conf 678661688447654223321135767616789999999999982997888026516788999998660366----2--- Q ss_pred HHHHHHCCCCCCEEECCHHHCCCCCCCEEEEEEEEECCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCEEEEEECCC Q ss_conf 997641147898585425680866678899996650788887620011477765624689999983214269887038 Q gi|254780229|r 1544 KAISKLRMHADPFFIKNLENVQGDERDRIFISFTYGPHEPGGRVFQRFGPINSDIGWRRLNVLFTRSRRRIEVFSTMR 1621 (1775) Q Consensus 1544 ~~~~~~~~~~e~~fvknle~vQGdERDvI~~S~~yg~~~~~g~~~~~fGpln~~gg~rRlNVa~sRAr~~~~v~~s~~ 1621 (1775) +=|-++..+||.+.|+||+|++-. +..++ -|-|= +.|||||||+||||.++++|.|.. T Consensus 998 ------------lEinTVD~yQGRDKd~IivSfvrs--n~~~~----~~eLL--kD~rRlNVAlTRAK~KLIlvGs~s 1055 (1100) T KOG1805 998 ------------LEINTVDRYQGRDKDCIIVSFVRS--NKKSK----VGELL--KDWRRLNVALTRAKKKLILVGSKS 1055 (1100) T ss_pred ------------EEEEEHHHHCCCCCCEEEEEEEEC--CCCCC----HHHHH--HHHHHHHHHHHHHHCEEEEEECCC T ss_conf ------------455302432377787799999714--88662----77787--766777889876304089993311 No 4 >KOG1802 consensus Probab=99.88 E-value=1.9e-21 Score=238.51 Aligned_cols=245 Identities=23% Similarity=0.306 Sum_probs=174.2 Q ss_pred CCCCEEEECCHHHCCHHH-HHHHHHHCCEEEEEECCCCCCCCHHHHHCCCCCCCCCCHHHCCCCCHHHHHHHHH-CCCCE Q ss_conf 541078860632369899-8898861182899607786686201100013444432011014400079999975-58752 Q gi|254780229|r 1360 VKFDLVIMDESSQIKPED-ALGVIARGKQIVVVGDPKQLPPTRFFDHDNDQEDYDEEVAAVSQTESILDALLPL-FSMRR 1437 (1775) Q Consensus 1360 ~~FD~VI~DEASQ~~~~~-al~al~rakqvvvvGD~kQlpPt~ff~~~~~~~~~~~~~~~~~~~eSild~~~~~-~p~~~ 1437 (1775) ..|-.|+||||.|.+-+. .+|...-+||+|.|||++||.|+....+.... .-..|+++-.... .-..| T Consensus 567 ~kfr~VLiDEaTQatEpe~LiPlvlG~kq~VlVGDh~QLgpvi~~kK~a~A----------gl~qsLferli~lg~~P~~ 636 (935) T KOG1802 567 FKFRTVLIDEATQATEPECLIPLVLGAKQLVLVGDHKQLGPVIMCKKAATA----------GLSQSLFERLISLGIKPIR 636 (935) T ss_pred CCCCEEEEECCCCCCCCCHHHHHHHCCEEEEEECCCCCCCCEEEEHHHHHH----------HHHHHHHHHHHHCCCCCEE T ss_conf 114379971343357861123345363158995154213741200888886----------7789999998860677438 Q ss_pred ECCCCCCCCHHHHHHHHHHCCCCEEE-------------ECCCCCCCCCCCCEEEEECCCCCCC---CC---CCHHHHHH Q ss_conf 05305579978999998503698047-------------6468776666688489972897217---88---89899999 Q gi|254780229|r 1438 LQWHYRSLNENLIACSNYYFYDNSLI-------------VFPSPYTSVDRYGVGFTHVKNGVVV---DQ---GNPEEARV 1498 (1775) Q Consensus 1438 L~~HyRs~~e~lI~fSN~~fY~~~L~-------------~~psp~~~~~~~~v~~~~v~~g~~~---~~---~N~~EA~~ 1498 (1775) |..+|| |||.|-+|.+..||+|.|. .||.|.... ++-|... -|..+ ++ .|+.||-- T Consensus 637 L~vQYR-mhP~lSefpsn~fY~G~LqnGVT~~~R~~~g~~~pwp~p~~---pl~fy~~-~g~eeisasGtSf~Nr~Ea~~ 711 (935) T KOG1802 637 LQVQYR-MHPALSEFPSNMFYEGELQNGVTEIERSPLGVDFPWPQPDK---PLFFYVC-YGQEEISASGTSFLNRTEAAN 711 (935) T ss_pred EEEEEE-ECHHHHHCCHHHHCCCHHHCCCCHHHHCCCCCCCCCCCCCC---CCCEEEE-CCCEEEECCCCCEECHHHHHH T ss_conf 887510-08056536245422340104762443166788777778898---6114774-260014036654002788887 Q ss_pred HHHHHHHHHHCC-CCCEEEEEECCHHHHHHHHHHHHHHHHCCHHHHHHHHHHCCCCCCEEECCHHHCCCCCCCEEEEEEE Q ss_conf 999999995318-5771899819989999999999999312976899976411478985854256808666788999966 Q gi|254780229|r 1499 IALAVKDHALRY-PEESLGVIAMNAKQRDLIESTINKLCRKDSEVEKAISKLRMHADPFFIKNLENVQGDERDRIFISFT 1577 (1775) Q Consensus 1499 vv~~v~~~~~~~-~~~siGVvt~~~~Q~~~I~~~l~~~~~~~~~~~~~~~~~~~~~e~~fvknle~vQGdERDvI~~S~~ 1577 (1775) +..+|..+++.. +..-|||||++..|+.+|-+.+...-..++.+= .-+=|-++..|||.|.|.||||+| T Consensus 712 ~ekii~~l~~~gv~~~qIGVITpYegQr~~i~~ym~~~gsl~~~ly----------~~veVasVDaFQGrEKdfIIlSCV 781 (935) T KOG1802 712 CEKIITKLLKSGVKPSQIGVITPYEGQRSYIVNYMQTNGSLHKDLY----------KEVEVASVDAFQGREKDFIILSCV 781 (935) T ss_pred HHHHHHHHHHCCCCHHHEEEECCCCHHHHHHHHHHHHCCCCCCCHH----------HEEEEEEECCCCCCCCCEEEEEEE T ss_conf 9999999998399889923421340169999999876176450401----------236887520226764545999986 Q ss_pred EECCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCEEEEEECCCHHHCCHHHCCCHHHHHHHHHHHHHHC Q ss_conf 5078888762001147776562468999998321426988703878879665354348999999999740 Q gi|254780229|r 1578 YGPHEPGGRVFQRFGPINSDIGWRRLNVLFTRSRRRIEVFSTMRYLDVVVDIDSKLGIRVMRDFLHFAET 1647 (1775) Q Consensus 1578 yg~~~~~g~~~~~fGpln~~gg~rRlNVa~sRAr~~~~v~~s~~~~~i~~~~~~~~g~~~lk~~l~~a~~ 1647 (1775) -... .. -.|-| +..||||||+||||+.++|+..- .-++.. -+-.+|+.|.+. T Consensus 782 Rsn~-~q-----gIGFl---~d~RRlNVaLTRaK~glvivGN~-------~~L~k~--~LW~~li~h~~e 833 (935) T KOG1802 782 RSNE-HQ-----GIGFL---NDPRRLNVALTRAKYGLVIVGNP-------KVLRKH--PLWGHLITHYKE 833 (935) T ss_pred ECCC-CC-----CCCCC---CCCHHHHHHHHHCCCCEEEECCH-------HHHHHC--HHHHHHHHHHHC T ss_conf 1365-44-----43211---68014332220214316996688-------885324--478999998630 No 5 >KOG1801 consensus Probab=99.71 E-value=1.5e-16 Score=187.47 Aligned_cols=235 Identities=23% Similarity=0.216 Sum_probs=168.4 Q ss_pred CCCCCEEEECCHHHCCHHHHHHHHHH--CCEEEEEECCCCCCCCHHHHHCCCCCCCCCCHHHCCCCCHHHHHHHHH-CCC Q ss_conf 25410788606323698998898861--182899607786686201100013444432011014400079999975-587 Q gi|254780229|r 1359 DVKFDLVIMDESSQIKPEDALGVIAR--GKQIVVVGDPKQLPPTRFFDHDNDQEDYDEEVAAVSQTESILDALLPL-FSM 1435 (1775) Q Consensus 1359 ~~~FD~VI~DEASQ~~~~~al~al~r--akqvvvvGD~kQlpPt~ff~~~~~~~~~~~~~~~~~~~eSild~~~~~-~p~ 1435 (1775) ...||.||||||+|+.-...+++|.. +...|+|||+.|||+|-. +... . . ..-..|+|....-. .+. T Consensus 534 ~~p~~~vviDeaaq~~e~~s~~PL~l~g~~~~~lvgd~~qlP~~V~--s~~~-~--~-----~k~~~slf~rl~l~~~~~ 603 (827) T KOG1801 534 GPPLDTVVIDEAAQKYEPSSLEPLQLAGYQHCILVGDLAQLPATVH--SSPA-G--C-----FKYMTSLFERLELAGHKT 603 (827) T ss_pred CCCCEEEEECHHHHHHCCCCCCCHHHCCCCCEEEECCHHHCCCCEE--CCCC-C--C-----CCHHHHHHHHHHHCCCCC T ss_conf 8884024405877753525511134414674477346442741000--1555-6--6-----415788999999736842 Q ss_pred CEECCCCCCCCHHHHHHHHHHCCCCEEEECCC---------CCCCCCCCCEEEEECCCCCC-----CCCCCHHHHHHHHH Q ss_conf 52053055799789999985036980476468---------77666668848997289721-----78889899999999 Q gi|254780229|r 1436 RRLQWHYRSLNENLIACSNYYFYDNSLIVFPS---------PYTSVDRYGVGFTHVKNGVV-----VDQGNPEEARVIAL 1501 (1775) Q Consensus 1436 ~~L~~HyRs~~e~lI~fSN~~fY~~~L~~~ps---------p~~~~~~~~v~~~~v~~g~~-----~~~~N~~EA~~vv~ 1501 (1775) .+|.-.|| |||++..|.|.+||+++|.-.+. +.......+..|..|..|.. .+..|..|+..++. T Consensus 604 ~~L~vqyr-mhp~Is~fP~~~fy~~~i~d~~~vs~~~~~~~~~~~~~~~~y~f~~v~~g~e~~~~~~s~~n~~E~~~~~~ 682 (827) T KOG1801 604 LLLTVQYR-MHPEISRFPSKEFYGGRLKDVNNVSESNTVKLWHSGETFGPYPFFNVHYGKERAGGGKSPVNNEEVRFVGA 682 (827) T ss_pred HHCCEEEE-CCCCCCCCCHHHHCCCCEECCCCCCHHCCCCCCCCCCCCCCEEEEECCCCCCCCCCCCCCCCHHHHHHHHH T ss_conf 10030320-16854667345433573001665420012354556776576489981015332278988512799999999 Q ss_pred HHHHHHHCCC-----CCEEEEEECCHHHHHHHHHHHHHHHHCCHHHHHHHHHHCCCCCCEEECCHHHCCCCCCCEEEEEE Q ss_conf 9999953185-----77189981998999999999999931297689997641147898585425680866678899996 Q gi|254780229|r 1502 AVKDHALRYP-----EESLGVIAMNAKQRDLIESTINKLCRKDSEVEKAISKLRMHADPFFIKNLENVQGDERDRIFISF 1576 (1775) Q Consensus 1502 ~v~~~~~~~~-----~~siGVvt~~~~Q~~~I~~~l~~~~~~~~~~~~~~~~~~~~~e~~fvknle~vQGdERDvI~~S~ 1576 (1775) .+..+.+..+ ..++||||++..|...+.+..-.....- ..+ .-..-|-.++.|||-|+|||++|+ T Consensus 683 ~~~~l~~~~~~~~~~~~~vGvisPY~~q~~~l~~~~~~~~~~~---~~~-------~~~i~v~tvD~fqg~e~diii~s~ 752 (827) T KOG1801 683 IYSRLYKVSQPQVSVPGSVGVISPYKNQVKALRERFPEAYSLL---LAN-------NVDLSVSTVDSFQGGERDIIIIST 752 (827) T ss_pred HHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHH---HCC-------CCCEEEEECCCCCCCCCCEEEEEC T ss_conf 9987840015556666644631610789999999726653013---315-------641244453654368766378732 Q ss_pred EEECCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCEEEEEECCCHH Q ss_conf 65078888762001147776562468999998321426988703878 Q gi|254780229|r 1577 TYGPHEPGGRVFQRFGPINSDIGWRRLNVLFTRSRRRIEVFSTMRYL 1623 (1775) Q Consensus 1577 ~yg~~~~~g~~~~~fGpln~~gg~rRlNVa~sRAr~~~~v~~s~~~~ 1623 (1775) |-..++ |++...+..||+|||+||||.-+||+.+..+- T Consensus 753 vrs~~~---------g~igf~~~~~RlnvALtra~~~l~v~Gne~~L 790 (827) T KOG1801 753 VRSIDE---------GSIGFECNLRRLNVALTRARTCFWLVGNEITL 790 (827) T ss_pred CEECCC---------CCEEEECCHHHHHHHHCCCCCCEEEECCCCCC T ss_conf 301477---------76014311778888603131215770563201 No 6 >KOG1807 consensus Probab=99.61 E-value=6.6e-13 Score=149.50 Aligned_cols=241 Identities=18% Similarity=0.299 Sum_probs=168.9 Q ss_pred CEEEECCHHHCCHHHHHHHHHHC-CEEEEEECCCCCCCCHHHHHCCCCCCCCCCHHHCCCCCHHHHHHHHH-CCCCEECC Q ss_conf 07886063236989988988611-82899607786686201100013444432011014400079999975-58752053 Q gi|254780229|r 1363 DLVIMDESSQIKPEDALGVIARG-KQIVVVGDPKQLPPTRFFDHDNDQEDYDEEVAAVSQTESILDALLPL-FSMRRLQW 1440 (1775) Q Consensus 1363 D~VI~DEASQ~~~~~al~al~ra-kqvvvvGD~kQlpPt~ff~~~~~~~~~~~~~~~~~~~eSild~~~~~-~p~~~L~~ 1440 (1775) -.|||.||.-+--...|+++.-. .+||++||+|||-|.+ ........ +.-.-|+|+-...+ +|=.+|+- T Consensus 722 kivivEEAAEVlEahiIaal~p~~EhviLIGDHKQLrP~~---~vy~L~q~------fnL~iSlFERLVe~glpfsrLn~ 792 (1025) T KOG1807 722 KIVIVEEAAEVLEAHIIAALTPHTEHVILIGDHKQLRPFS---GVYKLPQI------FNLSISLFERLVEAGLPFSRLNL 792 (1025) T ss_pred CEEEEHHHHHHHHCCHHHHHCCCCEEEEEECCHHHCCCCC---CHHHHHHH------CCHHHHHHHHHHHCCCCHHHHHH T ss_conf 5899745767754122665322300589963532238995---35567675------26138999999984997255557 Q ss_pred CCCCCCHHHHHHHHHHCCCCEEEECCC--CCCCCCCCCEEEEECCCCCC------CCCCCHHHHHHHHHHHHHHHHC-CC Q ss_conf 055799789999985036980476468--77666668848997289721------7888989999999999999531-85 Q gi|254780229|r 1441 HYRSLNENLIACSNYYFYDNSLIVFPS--PYTSVDRYGVGFTHVKNGVV------VDQGNPEEARVIALAVKDHALR-YP 1511 (1775) Q Consensus 1441 HyRs~~e~lI~fSN~~fY~~~L~~~ps--p~~~~~~~~v~~~~v~~g~~------~~~~N~~EA~~vv~~v~~~~~~-~~ 1511 (1775) +|| ||+.|-.---..||++ |.-.|+ -+...++++--+.+|..-.. .+-.|.-||..+|+++.-++.. |. T Consensus 793 QhR-M~p~IsrllvpsiYdd-l~d~esvk~yedI~gms~nlfFv~hnspee~~de~S~~NlhEa~mlv~l~kyli~q~y~ 870 (1025) T KOG1807 793 QHR-MRPCISRLLVPSIYDD-LLDSESVKEYEDIRGMSKNLFFVQHNSPEECMDEMSIGNLHEAGMLVKLTKYLIQQQYK 870 (1025) T ss_pred HHH-HCHHHHHHHHHHHHHH-HHCCHHHCCCCCCCCCCCEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCC T ss_conf 765-2558997700777654-20444431355454510004677638800073222024477889999999999965787 Q ss_pred CCEEEEEECCHHHHHHHHHHHHHHHHCCHHHHHHHHHHCCCCCCEEECCHHHCCCCCCCEEEEEEEEECCCCCCCCCCCC Q ss_conf 77189981998999999999999931297689997641147898585425680866678899996650788887620011 Q gi|254780229|r 1512 EESLGVIAMNAKQRDLIESTINKLCRKDSEVEKAISKLRMHADPFFIKNLENVQGDERDRIFISFTYGPHEPGGRVFQRF 1591 (1775) Q Consensus 1512 ~~siGVvt~~~~Q~~~I~~~l~~~~~~~~~~~~~~~~~~~~~e~~fvknle~vQGdERDvI~~S~~yg~~~~~g~~~~~f 1591 (1775) .--|-|.|++.+|-.+|..+|-. .|++. +-|-.+..+||-|+|||++|+|-.. .+|+ . T Consensus 871 psdIviLttY~gQk~ci~rllp~---------~~~st-------v~VatVDsfQGeEndIVLlSLVRsn--~~gr----i 928 (1025) T KOG1807 871 PSDIVILTTYNGQKECIKRLLPQ---------NYRST-------VQVATVDSFQGEENDIVLLSLVRSN--ISGR----I 928 (1025) T ss_pred CCCEEEEEECHHHHHHHHHHHHH---------HHCCC-------CEEEEECCCCCCCCCEEEEEEEECC--CCCE----E T ss_conf 66569995233379999998578---------74676-------4489730446751657999987516--7751----3 Q ss_pred CCCCCCCHHHHHHHHHHHHCCEEEEEECCCHHHCCHHHCCCHHHHHHHH Q ss_conf 4777656246899999832142698870387887966535434899999 Q gi|254780229|r 1592 GPINSDIGWRRLNVLFTRSRRRIEVFSTMRYLDVVVDIDSKLGIRVMRD 1640 (1775) Q Consensus 1592 Gpln~~gg~rRlNVa~sRAr~~~~v~~s~~~~~i~~~~~~~~g~~~lk~ 1640 (1775) |-|..+ .|.-||.||||+-|++|..+.--. +...+=+.-|.-|+. T Consensus 929 GFL~~a---nRvCVALSRAr~glyiiGN~q~la-~~~pLWnkivntLre 973 (1025) T KOG1807 929 GFLRQA---NRVCVALSRARWGLYIIGNVQILA-DTPPLWNKIVNTLRE 973 (1025) T ss_pred EEEECC---CHHHHHHHHHHCCEEEECCEEECC-CCCHHHHHHHHHHHH T ss_conf 334225---514422125123349963400103-585257999998874 No 7 >TIGR00376 TIGR00376 DNA helicase, putative; InterPro: IPR004483 Eukaryotic members of this family have been characterised as binding certain single-stranded G-rich DNA sequences (GGGGT and GGGCT). A number of related proteins are characterised as helicases.; GO: 0003677 DNA binding, 0005524 ATP binding. Probab=99.57 E-value=1.2e-13 Score=157.36 Aligned_cols=251 Identities=25% Similarity=0.399 Sum_probs=182.4 Q ss_pred CCCCCCEEEECCHHHCCHHHHHHHHHHCC----EEEEEECCCCCCCCHHHHHCCCCCCCCCCHHHCCCCCHHHHHHHHHC Q ss_conf 22541078860632369899889886118----28996077866862011000134444320110144000799999755 Q gi|254780229|r 1358 KDVKFDLVIMDESSQIKPEDALGVIARGK----QIVVVGDPKQLPPTRFFDHDNDQEDYDEEVAAVSQTESILDALLPLF 1433 (1775) Q Consensus 1358 ~~~~FD~VI~DEASQ~~~~~al~al~rak----qvvvvGD~kQlpPt~ff~~~~~~~~~~~~~~~~~~~eSild~~~~~~ 1433 (1775) ....||++|+||+||...+..+..+.+++ +.|+.||++|+|||....... .....-..++++.....+ T Consensus 411 ~g~~~d~~~~d~~~~~~~p~~~~p~~~~~~~~~~~~~~gd~~~lpp~~~~~~~~--------~~~~~l~~~l~~~~~~~~ 482 (709) T TIGR00376 411 DGWYFDVLVIDEASQALEPSCWIPLLKARNQFHKLILAGDHKQLPPTVLSEDDK--------NALEELELTLFERLIKLY 482 (709) T ss_pred CCCHHEEEEECCHHHHHHHHHHHHHHHCCCHHHHEEECCCCCCCCCCHHCCHHH--------HHHHHHHHHHHHHHHHHC T ss_conf 021000001000012321145544341310121100205643367400011045--------676666788999999851 Q ss_pred CCC----EECCCCCCCCHHHHHHHHHHCCCCEEEECCC---------------C-CC--CCCCCCEEEEECCCCC--C-- Q ss_conf 875----2053055799789999985036980476468---------------7-76--6666884899728972--1-- Q gi|254780229|r 1434 SMR----RLQWHYRSLNENLIACSNYYFYDNSLIVFPS---------------P-YT--SVDRYGVGFTHVKNGV--V-- 1487 (1775) Q Consensus 1434 p~~----~L~~HyRs~~e~lI~fSN~~fY~~~L~~~ps---------------p-~~--~~~~~~v~~~~v~~g~--~-- 1487 (1775) |.+ .|...|| +|+.++.|.|+.||+|++..-++ + .. .....++.|+...+-- + T Consensus 483 p~~~~~~~l~~~~~-~~~~~~~~p~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~g~~~~~~~ 561 (709) T TIGR00376 483 PERDVSRLLNVQYR-LNEKILEFPSREFYNGKLLAHESVANILLRDLPKVEATPDLEDDLDDGIPLLFIDTSGCELRWEG 561 (709) T ss_pred CCCHHHHHHHHHHH-HHHHHHHHCCHHHHCCCCCCHHHHHHHHHHHHHHHCCCHHHHHHHCCCCCEEEEECCCCCCCHHH T ss_conf 64002455343434-21467640101220452011035667777653200011023443057664578632456520000 Q ss_pred -----C-CCCCHHHHHHHHHHHHHHHHC-CCCCEEEEEECCHHHHHHHHHHHHHHHHCCHHHHHHHHHHCCCCCCEEECC Q ss_conf -----7-888989999999999999531-857718998199899999999999993129768999764114789858542 Q gi|254780229|r 1488 -----V-DQGNPEEARVIALAVKDHALR-YPEESLGVIAMNAKQRDLIESTINKLCRKDSEVEKAISKLRMHADPFFIKN 1560 (1775) Q Consensus 1488 -----~-~~~N~~EA~~vv~~v~~~~~~-~~~~siGVvt~~~~Q~~~I~~~l~~~~~~~~~~~~~~~~~~~~~e~~fvkn 1560 (1775) . ...|+.|+..+...+.....- -|..++|||+.+..|.+++...+..... .-.+-|++ T Consensus 562 ~~~~~~g~~~~~~~~~~~~~~~~~l~~~~~~~~~~g~~~p~~~~~~~~~~~~~~~~~---------------~~~~~~~~ 626 (709) T TIGR00376 562 KEKDSTGSKYNPGEAELVLEHVESLLKFGVPPNDIGVITPYDDQVDLLRKLLHDELK---------------LIDLEVSS 626 (709) T ss_pred HHHHCCCCCCCHHHHHHHHHHHHHHHHCCCCHHHCEEECCCHHHHHHHHHHHHHHHH---------------HCCCEEEC T ss_conf 011024543443678999999999986078700310203531578999987655553---------------02320001 Q ss_pred HHHCCCCCCCEEEEEEEEECCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCEEEEEECCCHHHCCHHHCCCHHHHHHHH Q ss_conf 56808666788999966507888876200114777656246899999832142698870387887966535434899999 Q gi|254780229|r 1561 LENVQGDERDRIFISFTYGPHEPGGRVFQRFGPINSDIGWRRLNVLFTRSRRRIEVFSTMRYLDVVVDIDSKLGIRVMRD 1640 (1775) Q Consensus 1561 le~vQGdERDvI~~S~~yg~~~~~g~~~~~fGpln~~gg~rRlNVa~sRAr~~~~v~~s~~~~~i~~~~~~~~g~~~lk~ 1640 (1775) ++.+||.|.|+|++|++-... -|.++.-...||+||++||+|+++.++.....-. .-+-..++. T Consensus 627 ~dg~~g~~~~~~~~~~~~~~~---------~~~~g~l~~~~~~~~~~~~~~~~~~~~g~~~~~~-------~c~~~~~~~ 690 (709) T TIGR00376 627 VDGFQGREKEVIILSFVRSNR---------KGEVGFLKDLRRLNVALTRARRKLILVGDSETLS-------RCGNKFYKR 690 (709) T ss_pred CCCCCCCCCCEEEEEEECCCC---------CCCCCHHHHHHHHHHHHHHCCCCEEEEECHHHHH-------HHHHHHHHH T ss_conf 345456543334542100355---------5432202244332233210112056640436776-------530057889 Q ss_pred HHHHHHCC Q ss_conf 99997406 Q gi|254780229|r 1641 FLHFAETG 1648 (1775) Q Consensus 1641 ~l~~a~~g 1648 (1775) +.+|+... T Consensus 691 ~~~~~~~~ 698 (709) T TIGR00376 691 LVEWCKEN 698 (709) T ss_pred HHHHHHHC T ss_conf 99987531 No 8 >KOG1804 consensus Probab=99.44 E-value=1.8e-13 Score=155.38 Aligned_cols=230 Identities=23% Similarity=0.148 Sum_probs=149.3 Q ss_pred CCCCEEEECCHHHCCHHHHHHHHHHCC---EEEEEECCCCCCCCHHHHHCCCCCCCCCCHHHCCCCCHHHHHHHHHCC-- Q ss_conf 541078860632369899889886118---289960778668620110001344443201101440007999997558-- Q gi|254780229|r 1360 VKFDLVIMDESSQIKPEDALGVIARGK---QIVVVGDPKQLPPTRFFDHDNDQEDYDEEVAAVSQTESILDALLPLFS-- 1434 (1775) Q Consensus 1360 ~~FD~VI~DEASQ~~~~~al~al~rak---qvvvvGD~kQlpPt~ff~~~~~~~~~~~~~~~~~~~eSild~~~~~~p-- 1434 (1775) +.|=-+.+|||.|++-...+.++.-=+ |||+.|||+||+|..--.++..-.-+ . .-++..|.-....++ T Consensus 441 ~~f~hil~DeAg~stEpe~lv~i~~~~~~~~vvLsgdh~Qlgpv~~s~~A~~~gl~---r---sLler~l~r~~~~~~~~ 514 (775) T KOG1804 441 GHFRHILVDEAGVSTEPELLVPGKQFRQPFQVVLSGDHTQLGPVSKSARAEELGLD---R---SLLERALTRAQSLVAVV 514 (775) T ss_pred CCEEEEEECCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCHHHHHHCCC---H---HHHHHHHHHHHHCCCCC T ss_conf 31454411334664574102440233102479980475311655643225664422---8---88888878886256667 Q ss_pred -------CCEECCCCCCCCHHHHHHHHHHCCCCEEEECCCCCCC---CCCCC-EEEEECCCCCCCC------CCCHHHHH Q ss_conf -------7520530557997899999850369804764687766---66688-4899728972178------88989999 Q gi|254780229|r 1435 -------MRRLQWHYRSLNENLIACSNYYFYDNSLIVFPSPYTS---VDRYG-VGFTHVKNGVVVD------QGNPEEAR 1497 (1775) Q Consensus 1435 -------~~~L~~HyRs~~e~lI~fSN~~fY~~~L~~~psp~~~---~~~~~-v~~~~v~~g~~~~------~~N~~EA~ 1497 (1775) +..|.--||| |+.|+.--|+.||++.|..--+.... ....+ +-|+-+ .|..++ --|+.||. T Consensus 515 g~~~~l~~t~l~rnyrs-hp~il~l~~~l~y~~eL~~~~~~~~v~~~~~w~~liif~g~-~G~~~r~~~s~S~~n~~Ea~ 592 (775) T KOG1804 515 GDYNALCSTGLCRNYRS-HPIILCLENRLYYLGELTAEASEVDVRGLELWSGLILFYGA-PGFTERAGNSPSWLNLEEAA 592 (775) T ss_pred CCCCCCCCHHHHHHHHH-HHHHHHCCCCCCCCCCEEEECCHHHHHHHHHCCCCEECCCC-CCCCCCCCCCHHHCCHHHHH T ss_conf 88332004246778764-10766434445456661352308889998744553122445-54210256981323688899 Q ss_pred HHHHHHHHHHHCCC-C-CEEEEEECCHHHHHHHHHHHHHHHHCCHHHHHHHHHHCCCCCCEEECCHHHCCCCCCCEEEEE Q ss_conf 99999999953185-7-718998199899999999999993129768999764114789858542568086667889999 Q gi|254780229|r 1498 VIALAVKDHALRYP-E-ESLGVIAMNAKQRDLIESTINKLCRKDSEVEKAISKLRMHADPFFIKNLENVQGDERDRIFIS 1575 (1775) Q Consensus 1498 ~vv~~v~~~~~~~~-~-~siGVvt~~~~Q~~~I~~~l~~~~~~~~~~~~~~~~~~~~~e~~fvknle~vQGdERDvI~~S 1575 (1775) .|+..+........ + ..|||||.+.+|...|..+|...- .-+.-|..+|.+||-||.||++| T Consensus 593 ~V~~~~k~l~~~~~~~~~DIgvitpy~aq~~~i~~~l~~~~----------------~~~~~vgsVe~fqGqE~~viiiS 656 (775) T KOG1804 593 VVVRMTKALPLGEVAQPQDIGVITPYTAQVSEIRKALRRLG----------------VPGVKVGSVEEFQGQEPWVILGS 656 (775) T ss_pred HHHHHHHCCCCCCCCCCCCCEEECCHHHHHHHHHHHHCCCC----------------CCCCCCCCEEEECCCCCEEEEEE T ss_conf 99988733678875550234045768999899998752257----------------77876552434066453213762 Q ss_pred EEEECCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCEEEE Q ss_conf 66507888876200114777656246899999832142698 Q gi|254780229|r 1576 FTYGPHEPGGRVFQRFGPINSDIGWRRLNVLFTRSRRRIEV 1616 (1775) Q Consensus 1576 ~~yg~~~~~g~~~~~fGpln~~gg~rRlNVa~sRAr~~~~v 1616 (1775) +|-.-.. ...+-.+||- .-++.+||||||+||+-=++. T Consensus 657 tVrS~~~-~~~~~~~~~~--fls~pk~l~v~V~rp~~l~i~ 694 (775) T KOG1804 657 TVRSFAL-PLLDDRYFGL--FLSRPKRLLVAVGRPRALLIN 694 (775) T ss_pred CCCCCCC-CCCCCCCCCE--EECCCCCCEEECCCCCCCCCC T ss_conf 3456677-7510133441--422760000000176431203 No 9 >TIGR02169 SMC_prok_A chromosome segregation protein SMC; InterPro: IPR011891 The SMC (structural maintenance of chromosomes) family of proteins, exist in virtually all organisms including both bacteria and archaea. The SMC proteins are essential for successful chromosome transmission during replication and segregation of the genome in all organisms and form three types of heterodimer (SMC1SMC3, SMC2SMC4, SMC5SMC6), which are core components of large multiprotein complexes. The best known complexes are cohesin, which is responsible for sister-chromatid cohesion, and condensin, which is required for full chromosome condensation in mitosis. SMCs are generally present as single proteins in bacteria, and as at least six distinct proteins in eukaryotes. The proteins range in size from approximately 110 to 170 kDa, and share a five-domain structure, with globular N- and C-terminal (IPR003395 from INTERPRO) domains separated by a long (circa 100 nm or 900 residues) coiled coil segment in the centre of which is a globular ''hinge'' domain, characterised by a set of four highly conserved glycine residues that are typical of flexible regions in a protein. The amino-terminal domain contains a 'Walker A' nucleotide-binding domain (GxxGxGKS/T), which by mutational studies has been shown to be essential in several proteins. The carboxy-terminal domain contains a sequence (the DA-box) that resembles a 'Walker B' motif (XXXXD, where X is any hydrophobic residue), and a LSGG motif with homology to the signature sequence of the ATP-binding cassette (ABC) family of ATPases . All SMC proteins appear to form dimers, either forming homodimers with themselves, as in the case of prokaryotic SMC proteins, or heterodimers between different but related SMC proteins. The dimers are arranged in an antiparallel alignment. This orientation brings the N- and C-terminal globular domains (from either different or identical protamers) together, which unites an ATP binding site (Walker A motif) within the N-terminal domain with a Walker B motif (DA box) within the C-terminal domain, to form a potentially functional ATPase. Protein interaction and microscopy data suggest that SMC dimers form a ring-like structure which might embrace DNA molecules. Non-SMC subunits associate with the SMC amino- and carboxy-terminal domains. The sequence homology within the carboxy-terminal domain is relatively high within the SMC1-SMC4 group, whereas SMC5 and SMC6 show some divergence in both of these sequences. SMCs share not only sequence similarity but also structural similarity with ABC proteins. SMC proteins function together with other proteins in a range of chromosomal transactions, including chromosome condensation, sister-chromatid cohesion, recombination, DNA repair and epigenetic silencing of gene expression . This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by IPR011890 from INTERPRO. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent .. Probab=99.38 E-value=1e-06 Score=84.90 Aligned_cols=113 Identities=20% Similarity=0.292 Sum_probs=63.9 Q ss_pred HHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEECCHHHHH--HHHHHHHHCCCCCEEEECCCCCHHH Q ss_conf 89999985148955994287877038999999999967982899601058999--9998644358441365402661108 Q gi|254780229|r 409 QHSAVIDVINGKNLIIEGPPGTGKSQTITNIIAAAMLHGKKILFCAQKMAAIE--VVRHRLEKAGLGEFCLELHSHKVHK 486 (1775) Q Consensus 409 Q~~av~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVaeK~aALd--VV~~RL~~~gLg~~cleLHs~ka~k 486 (1775) ++++..=.-.|-|||+||- ||.+|. |.|.+ -.-+++ +|+++|--.. ..- T Consensus 148 ~L~~~gI~p~gYNvVlQGD--------vt~~I~---------------Ms~~ERRkiIdei--AGvaeFD~kk----ekA 198 (1202) T TIGR02169 148 FLAAAGIYPEGYNVVLQGD--------VTKIIS---------------MSAVERRKIIDEI--AGVAEFDRKK----EKA 198 (1202) T ss_pred HHHHCCCCCCCCEEEEECC--------HHHHCC---------------CCHHHHHHHHHHH--CCHHHHHHHH----HHH T ss_conf 9986176889870674354--------123103---------------8843567888885--4814888999----999 Q ss_pred HHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHC Q ss_conf 999999999986311247158999999999999999999999999788540257899999999988512 Q gi|254780229|r 487 RAILDDLRKRIDNRNIKTVPQEIDVEITRYEELKGQLNQYAQEINQIWKNTELSIHQILMGAARYRHNL 555 (1775) Q Consensus 487 ~~vl~~L~~~le~~~~~~~~~e~e~~~~~L~~~r~~L~~~~~~L~~~~~~~~~s~~~~~~~~~~~~~~l 555 (1775) ++-|+....+++ +......+...++.+|...|...-.|.+...+....- .+........+...+ T Consensus 199 ~eeL~~V~~~~~--~~~~~i~E~~~qL~rL~~Er~~aerY~~L~~~k~e~e---~~~ll~~~~~~~~~~ 262 (1202) T TIGR02169 199 LEELEEVEERIE--RVDLVIDEVRQQLDRLEKEREKAERYQALKKEKREYE---GYLLLKEKEALEKQK 262 (1202) T ss_pred HHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHH T ss_conf 999999999999--9999999999888864999988988999999988899---999999999999999 No 10 >KOG1803 consensus Probab=99.14 E-value=1.3e-09 Score=115.06 Aligned_cols=73 Identities=37% Similarity=0.582 Sum_probs=66.6 Q ss_pred CCCCCCCCHHHHHHHHHHHCCCCEE-EECCCCCCCHHHHHHHHHHHHHCCCEEEEEECCHHHHHHHHHHHHHCC Q ss_conf 2236677346899999851489559-942878770389999999999679828996010589999998644358 Q gi|254780229|r 399 CPLIDDSDSSQHSAVIDVINGKNLI-IEGPPGTGKSQTITNIIAAAMLHGKKILFCAQKMAAIEVVRHRLEKAG 471 (1775) Q Consensus 399 ~~~~~~aDsSQ~~av~~~~~g~~~v-i~GPPGTGKSQTIaNlIa~~la~Gk~VLfVaeK~aALdVV~~RL~~~g 471 (1775) .|+.-..|+||.+||..+..-+++. ||||||||||.|++.||+.++.+||+||+.+-.+.|+|....||...| T Consensus 180 ~~~~~~ln~SQk~Av~~~~~~k~l~~I~GPPGTGKT~TlvEiI~qlvk~~k~VLVcaPSn~AVdNiverl~~~~ 253 (649) T KOG1803 180 TFFNKNLNSSQKAAVSFAINNKDLLIIHGPPGTGKTRTLVEIISQLVKQKKRVLVCAPSNVAVDNIVERLTHLK 253 (649) T ss_pred CCCCCCCCHHHHHHHHHHHCCCCCEEEECCCCCCCEEEHHHHHHHHHHCCCEEEEECCCHHHHHHHHHHHCCCC T ss_conf 36774323779999999735688357557998884043999999999728859997673678999998750035 No 11 >COG1196 Smc Chromosome segregation ATPases [Cell division and chromosome partitioning] Probab=99.13 E-value=1.1e-05 Score=74.06 Aligned_cols=23 Identities=39% Similarity=0.589 Sum_probs=18.9 Q ss_pred EEECCCCCCCHHHHHHHHHHHHHC Q ss_conf 994287877038999999999967 Q gi|254780229|r 423 IIEGPPGTGKSQTITNIIAAAMLH 446 (1775) Q Consensus 423 vi~GPPGTGKSQTIaNlIa~~la~ 446 (1775) +|.||.|+||| .|+++|+.+|.. T Consensus 28 ~IvGPNGSGKS-NI~DAi~fVLG~ 50 (1163) T COG1196 28 AIVGPNGSGKS-NIVDAIRFVLGE 50 (1163) T ss_pred EEECCCCCCHH-HHHHHHHHHCCC T ss_conf 68899988668-899999998389 No 12 >TIGR02168 SMC_prok_B chromosome segregation protein SMC; InterPro: IPR011890 The SMC (structural maintenance of chromosomes) family of proteins, exist in virtually all organisms including both bacteria and archaea. The SMC proteins are essential for successful chromosome transmission during replication and segregation of the genome in all organisms and form three types of heterodimer (SMC1SMC3, SMC2SMC4, SMC5SMC6), which are core components of large multiprotein complexes. The best known complexes are cohesin, which is responsible for sister-chromatid cohesion, and condensin, which is required for full chromosome condensation in mitosis. SMCs are generally present as single proteins in bacteria, and as at least six distinct proteins in eukaryotes. The proteins range in size from approximately 110 to 170 kDa, and share a five-domain structure, with globular N- and C-terminal (IPR003395 from INTERPRO) domains separated by a long (circa 100 nm or 900 residues) coiled coil segment in the centre of which is a globular ''hinge'' domain, characterised by a set of four highly conserved glycine residues that are typical of flexible regions in a protein. The amino-terminal domain contains a 'Walker A' nucleotide-binding domain (GxxGxGKS/T), which by mutational studies has been shown to be essential in several proteins. The carboxy-terminal domain contains a sequence (the DA-box) that resembles a 'Walker B' motif (XXXXD, where X is any hydrophobic residue), and a LSGG motif with homology to the signature sequence of the ATP-binding cassette (ABC) family of ATPases . All SMC proteins appear to form dimers, either forming homodimers with themselves, as in the case of prokaryotic SMC proteins, or heterodimers between different but related SMC proteins. The dimers are arranged in an antiparallel alignment. This orientation brings the N- and C-terminal globular domains (from either different or identical protamers) together, which unites an ATP binding site (Walker A motif) within the N-terminal domain with a Walker B motif (DA box) within the C-terminal domain, to form a potentially functional ATPase. Protein interaction and microscopy data suggest that SMC dimers form a ring-like structure which might embrace DNA molecules. Non-SMC subunits associate with the SMC amino- and carboxy-terminal domains. The sequence homology within the carboxy-terminal domain is relatively high within the SMC1-SMC4 group, whereas SMC5 and SMC6 show some divergence in both of these sequences. SMCs share not only sequence similarity but also structural similarity with ABC proteins. SMC proteins function together with other proteins in a range of chromosomal transactions, including chromosome condensation, sister-chromatid cohesion, recombination, DNA repair and epigenetic silencing of gene expression . The smc gene is often associated with scpB (IPR005234 from INTERPRO) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle , , .; GO: 0005515 protein binding, 0005524 ATP binding, 0005694 chromosome. Probab=99.10 E-value=1.6e-05 Score=72.66 Aligned_cols=117 Identities=19% Similarity=0.385 Sum_probs=69.6 Q ss_pred CCEE-EECCCCCCCHHHHHHHHHHHHHC-------CC---EEEE-EECCHHHHH-------------------------H Q ss_conf 9559-94287877038999999999967-------98---2899-601058999-------------------------9 Q gi|254780229|r 420 KNLI-IEGPPGTGKSQTITNIIAAAMLH-------GK---KILF-CAQKMAAIE-------------------------V 462 (1775) Q Consensus 420 ~~~v-i~GPPGTGKSQTIaNlIa~~la~-------Gk---~VLf-VaeK~aALd-------------------------V 462 (1775) ++++ |.||-|+||| =|+++|-=.|.. |. =|=| -|++++++. + T Consensus 23 ~g~TgiVGPNGcGKS-Ni~DAiRWVLGEqSak~LRg~~M~DVIF~Gs~~Rkp~~~AeV~L~fdN~d~~l~~~y~~y~Ei~ 101 (1191) T TIGR02168 23 KGITGIVGPNGCGKS-NIVDAIRWVLGEQSAKALRGGKMEDVIFNGSETRKPLSLAEVELVFDNSDGLLPGQYADYSEIS 101 (1191) T ss_pred CCEEEEECCCCCCCC-CHHHHHHHHHHHHHHHHHCCCCCCCEEECCCCCCCCCCEEEEEEEEECCCCCCHHHHCCCCEEE T ss_conf 852798627998700-1899999860033477734689995563488777657700689888558883512113677178 Q ss_pred HHHHHHH-CCCCCEEE----------------------------------ECCCCCHHHHHHHHHH----HHHHHHHCCC Q ss_conf 9986443-58441365----------------------------------4026611089999999----9998631124 Q gi|254780229|r 463 VRHRLEK-AGLGEFCL----------------------------------ELHSHKVHKRAILDDL----RKRIDNRNIK 503 (1775) Q Consensus 463 V~~RL~~-~gLg~~cl----------------------------------eLHs~ka~k~~vl~~L----~~~le~~~~~ 503 (1775) |.|||.. -|=++|-+ =++|-.-.+|.++++- +..-....+. T Consensus 102 itRr~~Rg~GeSeY~iN~~~cRLkDI~dLF~dtGlG~~sysII~QG~I~~i~~aKPeerR~i~EEAAGisKYK~Rk~Et~ 181 (1191) T TIGR02168 102 ITRRLYRGDGESEYFINGQPCRLKDIQDLFLDTGLGKRSYSIIEQGKISEIIEAKPEERRAIFEEAAGISKYKERKKETE 181 (1191) T ss_pred EEEEEEECCCCCEEEECCCCCHHHHHHHHHHHCCCCCCCCEEECCCCHHHHHCCCHHHHHHHHHHHCCHHHHHHHHHHHH T ss_conf 99888745864404557750137779988732278865432303430556524996899999887213578887799999 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 7158999999999999999999999999788540 Q gi|254780229|r 504 TVPQEIDVEITRYEELKGQLNQYAQEINQIWKNT 537 (1775) Q Consensus 504 ~~~~e~e~~~~~L~~~r~~L~~~~~~L~~~~~~~ 537 (1775) .........+.||++...+|...+..|+.+...+ T Consensus 182 ~kL~~t~~NL~Rl~Dil~El~~ql~~L~~QA~~A 215 (1191) T TIGR02168 182 RKLERTRENLERLEDILKELERQLESLERQAEKA 215 (1191) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9899999859999999999884227789999999 No 13 >PRK10246 exonuclease subunit SbcC; Provisional Probab=99.06 E-value=2.1e-05 Score=71.30 Aligned_cols=28 Identities=32% Similarity=0.571 Sum_probs=25.2 Q ss_pred CCCEEEECCCCCCCHHHHHHHHHHHHHCCC Q ss_conf 895599428787703899999999996798 Q gi|254780229|r 419 GKNLIIEGPPGTGKSQTITNIIAAAMLHGK 448 (1775) Q Consensus 419 g~~~vi~GPPGTGKSQTIaNlIa~~la~Gk 448 (1775) +.=|+|.||+|+||| ||-++||.||| |+ T Consensus 30 ~gLFlI~G~TGAGKS-TIlDAItfALY-G~ 57 (1047) T PRK10246 30 NGLFAITGPTGAGKT-TLLDAICLALY-HE 57 (1047) T ss_pred CCEEEEECCCCCCHH-HHHHHHHHHHC-CC T ss_conf 887888899999889-99999999958-99 No 14 >PRK03918 chromosome segregation protein; Provisional Probab=98.87 E-value=7.5e-05 Score=65.52 Aligned_cols=26 Identities=27% Similarity=0.419 Sum_probs=23.2 Q ss_pred CCEEEECCCCCCCHHHHHHHHHHHHHC Q ss_conf 955994287877038999999999967 Q gi|254780229|r 420 KNLIIEGPPGTGKSQTITNIIAAAMLH 446 (1775) Q Consensus 420 ~~~vi~GPPGTGKSQTIaNlIa~~la~ 446 (1775) .=|+|.||.|+||| ||.++|+.||+. T Consensus 24 Gi~~I~G~nGsGKS-tIlDAI~~aL~g 49 (882) T PRK03918 24 GINLIIGQNGSGKS-SLLDAILVGLYW 49 (882) T ss_pred CCEEEECCCCCCHH-HHHHHHHHHHCC T ss_conf 92798899999889-999999999818 No 15 >PRK02224 chromosome segregation protein; Provisional Probab=98.82 E-value=0.00011 Score=63.98 Aligned_cols=24 Identities=29% Similarity=0.430 Sum_probs=22.0 Q ss_pred EEEECCCCCCCHHHHHHHHHHHHHC Q ss_conf 5994287877038999999999967 Q gi|254780229|r 422 LIIEGPPGTGKSQTITNIIAAAMLH 446 (1775) Q Consensus 422 ~vi~GPPGTGKSQTIaNlIa~~la~ 446 (1775) ++|.||.|+||| ||..+|+.+|+. T Consensus 26 ~~I~G~NGsGKS-sIldAI~~aL~g 49 (880) T PRK02224 26 TVIHGLNGSGKS-SLLEACFFALYG 49 (880) T ss_pred EEEECCCCCCHH-HHHHHHHHHHHC T ss_conf 899899999889-999999999808 No 16 >cd01038 Endonuclease_DUF559 Domain of unknown function, appears to be related to a diverse group of endonucleases. Probab=98.71 E-value=1.3e-07 Score=94.33 Aligned_cols=92 Identities=25% Similarity=0.391 Sum_probs=81.1 Q ss_pred CCHHHHHHHHHHHHC---CCEEEECCCCCCEECCEEEECCCCCCCEEEEEECCCHHHCCCCCHHHHHHHHHHHHHHCCCC Q ss_conf 998999999999978---99897065757710125733689888317999745802368324867774448999956990 Q gi|254780229|r 1662 DSDFAVSVVNELEKT---GFACDSQLGDMGFSIDVVVRDPNNPGHYLMGIECDGAMYNSAKSARDRDRLRQEVLERMGWR 1738 (1775) Q Consensus 1662 ds~fe~~v~~~L~~~---G~~v~~qvg~~~~riDl~v~~p~~pg~~~l~iecDG~~yh~~~~ardrD~~Rq~vLe~~GW~ 1738 (1775) .++=|.-+...|+.+ ||....|+.+++|-+|++... ..|+||+||. ||.... ++|+.|+..|+.+||+ T Consensus 13 ~T~aE~~LW~~Lr~~~l~g~kFrRQ~pIg~yIvDF~c~~------~rLvIElDG~-~H~~~~--~~D~~Rd~~L~~~G~~ 83 (108) T cd01038 13 QTDAERLLWQELRRRRLNGFKFRRQAPIGRYIVDFACPE------AKLVVELDGG-QHDEQI--AYDAERDAWLEARGFR 83 (108) T ss_pred CCHHHHHHHHHHHHCCCCCCCEECCCEECCEEEEEEEHH------HCCEEEECCC-CCCCHH--HHHHHHHHHHHHCCCE T ss_conf 999999999999855877983455572679778886042------1826685685-422005--6799999999979998 Q ss_pred EEEEECHHHHCCHHHHHHHHHHHH Q ss_conf 999322164117899999999999 Q gi|254780229|r 1739 IRRIWSVDWFSNPDEVIEPIIREL 1762 (1775) Q Consensus 1739 ~~Riws~dw~~~p~~~~~~l~~~l 1762 (1775) +.|+|..|.+.|++++++.|.++| T Consensus 84 VlRf~n~eV~~~~~~Vl~~I~~~l 107 (108) T cd01038 84 VLRFWNNEVLRNIEAVLEAILAAL 107 (108) T ss_pred EEEEEHHHHHHCHHHHHHHHHHHH T ss_conf 999888998868999999999975 No 17 >PRK11773 uvrD DNA-dependent helicase II; Provisional Probab=98.70 E-value=9.7e-05 Score=64.35 Aligned_cols=64 Identities=28% Similarity=0.315 Sum_probs=52.1 Q ss_pred CCHHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHC-C---CEEEEEECCHHHHHHHHHHHHHC Q ss_conf 734689999985148955994287877038999999999967-9---82899601058999999864435 Q gi|254780229|r 405 SDSSQHSAVIDVINGKNLIIEGPPGTGKSQTITNIIAAAMLH-G---KKILFCAQKMAAIEVVRHRLEKA 470 (1775) Q Consensus 405 aDsSQ~~av~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~-G---k~VLfVaeK~aALdVV~~RL~~~ 470 (1775) -+..|..||. ...|.-+ |...||||||.||+.-|++++.. | .++|.+..+++|..-++.||... T Consensus 10 LN~~Q~~AV~-~~~g~~l-V~AGaGSGKT~vL~~Ria~Li~~~gv~p~~ILalTFTnkAA~Emk~Rl~~~ 77 (722) T PRK11773 10 LNDKQREAVA-APLGNML-VLAGAGSGKTRVLVHRIAWLMQVENASPYSILAVTFTNKAAAEMRHRIEQL 77 (722) T ss_pred CCHHHHHHHC-CCCCCEE-EEECCCCHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 6999999980-9999879-997387159999999999999829998788289845899999999999987 No 18 >pfam04480 DUF559 Protein of unknown function (DUF559). Probab=98.55 E-value=7.1e-07 Score=86.61 Aligned_cols=92 Identities=26% Similarity=0.413 Sum_probs=78.9 Q ss_pred CHHHHHHHHHHHHC---CCEEEECCCCCCEECCEEEECCCCCCCEEEEEECCCHHHCCCCCHHHHHHHHHHHHHHCCCCE Q ss_conf 98999999999978---998970657577101257336898883179997458023683248677744489999569909 Q gi|254780229|r 1663 SDFAVSVVNELEKT---GFACDSQLGDMGFSIDVVVRDPNNPGHYLMGIECDGAMYNSAKSARDRDRLRQEVLERMGWRI 1739 (1775) Q Consensus 1663 s~fe~~v~~~L~~~---G~~v~~qvg~~~~riDl~v~~p~~pg~~~l~iecDG~~yh~~~~ardrD~~Rq~vLe~~GW~~ 1739 (1775) ++=|.-+-..|+.+ ||.-.-|+.+++|-+|++... ..|.||+||. +|.... +.|+.|+..|+.+||++ T Consensus 15 T~aE~~LW~~Lr~~~l~g~kFrRQ~pIg~yIvDF~C~~------~rLvIElDG~-~H~~~~--~~D~~R~~~L~~~G~~V 85 (109) T pfam04480 15 TDAERKLWQLLRNRRLNGFKFRRQKPIGSYIVDFVCEK------AKLIVELDGG-QHDEQE--EYDARRTAWLESQGFTV 85 (109) T ss_pred CHHHHHHHHHHHHCCCCCCCEEECCEECCEEEEEECHH------HCEEEEECCC-CCCCCH--HHHHHHHHHHHHCCCEE T ss_conf 99999999999854878982570360369878888342------3968883085-211025--78889999999799989 Q ss_pred EEEECHHHHCCHHHHHHHHHHHHH Q ss_conf 993221641178999999999999 Q gi|254780229|r 1740 RRIWSVDWFSNPDEVIEPIIRELR 1763 (1775) Q Consensus 1740 ~Riws~dw~~~p~~~~~~l~~~l~ 1763 (1775) .|+|..|.+.|++++++.|..+|+ T Consensus 86 lRf~n~dV~~~~~~Vl~~I~~~le 109 (109) T pfam04480 86 LRFWNDEVLKNPEGVLEEILLELE 109 (109) T ss_pred EEEEHHHHHHCHHHHHHHHHHHHC T ss_conf 998789988799999999999859 No 19 >COG0419 SbcC ATPase involved in DNA repair [DNA replication, recombination, and repair] Probab=98.50 E-value=0.00059 Score=56.21 Aligned_cols=31 Identities=39% Similarity=0.582 Sum_probs=26.1 Q ss_pred CCCEEEECCCCCCCHHHHHHHHHHHHHCCCEEE Q ss_conf 895599428787703899999999996798289 Q gi|254780229|r 419 GKNLIIEGPPGTGKSQTITNIIAAAMLHGKKIL 451 (1775) Q Consensus 419 g~~~vi~GPPGTGKSQTIaNlIa~~la~Gk~VL 451 (1775) +.=|+|.||.|+||| ||.++|+.||+ |++-. T Consensus 25 ~gi~lI~G~nGsGKS-SIldAI~~ALy-G~~~~ 55 (908) T COG0419 25 SGIFLIVGPNGAGKS-SILDAITFALY-GKTPR 55 (908) T ss_pred CCEEEEECCCCCCHH-HHHHHHHHHHC-CCCCC T ss_conf 883799899999788-99999999982-89877 No 20 >KOG1805 consensus Probab=98.47 E-value=1e-06 Score=85.04 Aligned_cols=74 Identities=30% Similarity=0.475 Sum_probs=66.0 Q ss_pred CHHHHHHHHHHHCCCCEE-EECCCCCCCHHHHHHHHHHHHHCCCEEEEEECCHHHHHHHHHHHHHCCCCCEEEECCC Q ss_conf 346899999851489559-9428787703899999999996798289960105899999986443584413654026 Q gi|254780229|r 406 DSSQHSAVIDVINGKNLI-IEGPPGTGKSQTITNIIAAAMLHGKKILFCAQKMAAIEVVRHRLEKAGLGEFCLELHS 481 (1775) Q Consensus 406 DsSQ~~av~~~~~g~~~v-i~GPPGTGKSQTIaNlIa~~la~Gk~VLfVaeK~aALdVV~~RL~~~gLg~~cleLHs 481 (1775) .--|..|+..+.+.+|+. |.|=||||||+||+-+|--++|.||+||.+|.+.-|+|..--+|...|++ .+.|=+ T Consensus 671 N~dQr~A~~k~L~aedy~LI~GMPGTGKTTtI~~LIkiL~~~gkkVLLtsyThsAVDNILiKL~~~~i~--~lRLG~ 745 (1100) T KOG1805 671 NNDQRQALLKALAAEDYALILGMPGTGKTTTISLLIKILVALGKKVLLTSYTHSAVDNILIKLKGFGIY--ILRLGS 745 (1100) T ss_pred CHHHHHHHHHHHHCCCHHEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEEHHHHHHHHHHHHHHCCCCC--EEECCC T ss_conf 889999999987303322032699898122599999999973881899850567889999987506711--034487 No 21 >cd00221 Vsr Very Short Patch Repair (Vsr) Endonuclease. Endonucleases in DNA repair that recognize damaged DNA and cleave the phosphodiester backbone. Vsr endonucleases have a common endonuclease topology that has been tailored for recognition of TG mismatches. Probab=98.46 E-value=1.2e-06 Score=84.26 Aligned_cols=82 Identities=30% Similarity=0.525 Sum_probs=72.6 Q ss_pred CCCCHHHHHHHHHHHHCCCEEEECCCCCCEECCEEEECCCCCCCEEEEEECCCHHHCCCC-------------------C Q ss_conf 689989999999999789989706575771012573368988831799974580236832-------------------4 Q gi|254780229|r 1660 KQDSDFAVSVVNELEKTGFACDSQLGDMGFSIDVVVRDPNNPGHYLMGIECDGAMYNSAK-------------------S 1720 (1775) Q Consensus 1660 ~~ds~fe~~v~~~L~~~G~~v~~qvg~~~~riDl~v~~p~~pg~~~l~iecDG~~yh~~~-------------------~ 1720 (1775) .-|+.+|..|...|.+.||....++..-..+-|+|+. +|.+||.|||.-||+-. . T Consensus 15 sk~TkpE~~vrk~L~~~G~Ryr~~~k~lpG~PDiv~~------~~k~aIFvdGCFWHgh~~~~~~~pk~n~~fW~~Ki~~ 88 (115) T cd00221 15 SKDTKPERAVRSALWELGYRFRKQDKDLPGKPDIVVP------GYRLAIFVDGCFWHGHPCHKRKPPPKNTEFWLEKIER 88 (115) T ss_pred CCCCHHHHHHHHHHHHCCCEEEECCCCCCCCCCEEEC------CCCEEEEECCEEECCCCCCCCCCCCCCHHHHHHHHHH T ss_conf 7998699999999998899899668877999999735------7628999324043388875788898538999999999 Q ss_pred HHHHHHHHHHHHHHCCCCEEEEECHHH Q ss_conf 867774448999956990999322164 Q gi|254780229|r 1721 ARDRDRLRQEVLERMGWRIRRIWSVDW 1747 (1775) Q Consensus 1721 ardrD~~Rq~vLe~~GW~~~Riws~dw 1747 (1775) -++||+...+.|+.+||+++|||..++ T Consensus 89 N~~RD~~~~~~L~~~GW~v~~vWEcel 115 (115) T cd00221 89 NVERDRRVQAALERLGWRVLRVWECEL 115 (115) T ss_pred HHHHHHHHHHHHHHCCCEEEEEEECCC T ss_conf 999899999999988998999983059 No 22 >TIGR02168 SMC_prok_B chromosome segregation protein SMC; InterPro: IPR011890 The SMC (structural maintenance of chromosomes) family of proteins, exist in virtually all organisms including both bacteria and archaea. The SMC proteins are essential for successful chromosome transmission during replication and segregation of the genome in all organisms and form three types of heterodimer (SMC1SMC3, SMC2SMC4, SMC5SMC6), which are core components of large multiprotein complexes. The best known complexes are cohesin, which is responsible for sister-chromatid cohesion, and condensin, which is required for full chromosome condensation in mitosis. SMCs are generally present as single proteins in bacteria, and as at least six distinct proteins in eukaryotes. The proteins range in size from approximately 110 to 170 kDa, and share a five-domain structure, with globular N- and C-terminal (IPR003395 from INTERPRO) domains separated by a long (circa 100 nm or 900 residues) coiled coil segment in the centre of which is a globular ''hinge'' domain, characterised by a set of four highly conserved glycine residues that are typical of flexible regions in a protein. The amino-terminal domain contains a 'Walker A' nucleotide-binding domain (GxxGxGKS/T), which by mutational studies has been shown to be essential in several proteins. The carboxy-terminal domain contains a sequence (the DA-box) that resembles a 'Walker B' motif (XXXXD, where X is any hydrophobic residue), and a LSGG motif with homology to the signature sequence of the ATP-binding cassette (ABC) family of ATPases . All SMC proteins appear to form dimers, either forming homodimers with themselves, as in the case of prokaryotic SMC proteins, or heterodimers between different but related SMC proteins. The dimers are arranged in an antiparallel alignment. This orientation brings the N- and C-terminal globular domains (from either different or identical protamers) together, which unites an ATP binding site (Walker A motif) within the N-terminal domain with a Walker B motif (DA box) within the C-terminal domain, to form a potentially functional ATPase. Protein interaction and microscopy data suggest that SMC dimers form a ring-like structure which might embrace DNA molecules. Non-SMC subunits associate with the SMC amino- and carboxy-terminal domains. The sequence homology within the carboxy-terminal domain is relatively high within the SMC1-SMC4 group, whereas SMC5 and SMC6 show some divergence in both of these sequences. SMCs share not only sequence similarity but also structural similarity with ABC proteins. SMC proteins function together with other proteins in a range of chromosomal transactions, including chromosome condensation, sister-chromatid cohesion, recombination, DNA repair and epigenetic silencing of gene expression . The smc gene is often associated with scpB (IPR005234 from INTERPRO) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle , , .; GO: 0005515 protein binding, 0005524 ATP binding, 0005694 chromosome. Probab=98.46 E-value=0.00072 Score=55.28 Aligned_cols=53 Identities=11% Similarity=0.232 Sum_probs=42.7 Q ss_pred CCCCCHHHHHHHHHHHHHHH-HHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 02661108999999999986-311247158999999999999999999999999 Q gi|254780229|r 479 LHSHKVHKRAILDDLRKRID-NRNIKTVPQEIDVEITRYEELKGQLNQYAQEIN 531 (1775) Q Consensus 479 LHs~ka~k~~vl~~L~~~le-~~~~~~~~~e~e~~~~~L~~~r~~L~~~~~~L~ 531 (1775) +--+|.+|++....|.++-+ +.+..+...|+...+..|+.+...-..|.+.-. T Consensus 170 isKYK~Rk~Et~~kL~~t~~NL~Rl~Dil~El~~ql~~L~~QA~~A~~Y~~l~~ 223 (1191) T TIGR02168 170 ISKYKERKKETERKLERTRENLERLEDILKELERQLESLERQAEKAERYKELKE 223 (1191) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 578887799999989999985999999999988422778999999999999999 No 23 >PRK01156 chromosome segregation protein; Provisional Probab=98.45 E-value=0.00074 Score=55.15 Aligned_cols=27 Identities=37% Similarity=0.493 Sum_probs=23.6 Q ss_pred CCEEEECCCCCCCHHHHHHHHHHHHHCCC Q ss_conf 95599428787703899999999996798 Q gi|254780229|r 420 KNLIIEGPPGTGKSQTITNIIAAAMLHGK 448 (1775) Q Consensus 420 ~~~vi~GPPGTGKSQTIaNlIa~~la~Gk 448 (1775) .=|+|.||.|+||| ||.++|+.||+ |. T Consensus 24 Gi~~I~G~NGaGKS-tIldAI~~aL~-g~ 50 (895) T PRK01156 24 GINIITGKNGAGKS-SIVDAIRFALF-TD 50 (895) T ss_pred CCEEEECCCCCCHH-HHHHHHHHHHC-CC T ss_conf 92788899999878-99999999977-89 No 24 >KOG1806 consensus Probab=98.45 E-value=1.1e-06 Score=84.67 Aligned_cols=232 Identities=22% Similarity=0.345 Sum_probs=157.6 Q ss_pred CCCCCCEEEECCHHHCCHHHHHHHH---------HHCCEEEEEECCCCCCCCHHHHHCCCCCCCCCCHHHCCCCCHHHHH Q ss_conf 2254107886063236989988988---------6118289960778668620110001344443201101440007999 Q gi|254780229|r 1358 KDVKFDLVIMDESSQIKPEDALGVI---------ARGKQIVVVGDPKQLPPTRFFDHDNDQEDYDEEVAAVSQTESILDA 1428 (1775) Q Consensus 1358 ~~~~FD~VI~DEASQ~~~~~al~al---------~rakqvvvvGD~kQlpPt~ff~~~~~~~~~~~~~~~~~~~eSild~ 1428 (1775) ....||=+++-|+.|++-......+ +|=|+.|..||..|+||.. +..+. -.+ ....-|++.- T Consensus 988 lgf~ydnl~mEesaqile~etfiplLlq~p~dg~~rlkr~iligdhhqlPPv~---~n~af----qky--snm~qslf~r 1058 (1320) T KOG1806 988 LGFKYDNLLMEESAQILEIETFIPLLLQNPQDGHNRLKRWILIGDHHQLPPVV---KNQAF----QKY--SNMEQSLFTR 1058 (1320) T ss_pred HCEEECHHHHHHCCCCCCCCCCCHHHHCCCCCHHHHHHHEEECCCCCCCCCCC---CCHHH----HHH--HCCHHHHHHC T ss_conf 32022212465226776444122777629850666764126634410169754---42478----877--2113654210 Q ss_pred HHHH-CCCCEECCCCCCCCHHHHHHHHHHCCCCEEEECCC--CCC--CCCCCCEEEE--ECCC--CCCCC------CCCH Q ss_conf 9975-58752053055799789999985036980476468--776--6666884899--7289--72178------8898 Q gi|254780229|r 1429 LLPL-FSMRRLQWHYRSLNENLIACSNYYFYDNSLIVFPS--PYT--SVDRYGVGFT--HVKN--GVVVD------QGNP 1493 (1775) Q Consensus 1429 ~~~~-~p~~~L~~HyRs~~e~lI~fSN~~fY~~~L~~~ps--p~~--~~~~~~v~~~--~v~~--g~~~~------~~N~ 1493 (1775) .... .|.+.|.-.||- .++|.+.-|..|=.-.+.+--+ |.- ..++.++.+. .|++ |..+. -.|. T Consensus 1059 ~vRl~ip~i~lnaqgra-r~sI~~Ly~wry~lLg~l~~v~~lp~f~~aNagf~~~~qlinv~Df~g~gEt~p~p~fyQnl 1137 (1320) T KOG1806 1059 LVRLGVPIIDLNAQGRA-RASIASLYNWRYPLLGNLPHVSPLPRFQYANAGFAYEFQFINVPDFKGSGETEPSPGFYQNL 1137 (1320) T ss_pred CEECCCCEECCHHHHHH-HHHHHHHHHHHHCCCCCCCCCCCCHHHHCCCCCCEEEEEEECCHHHCCCCCCCCCCCCCCCC T ss_conf 00104613410145568-99999988766123356767765534430255744667995423214656457886302577 Q ss_pred HHHHHHHHHHHH-HHHCCCCCEEEEEECCHHHHHHHHHHHHHHHHCCHHHHHHHHHHCCCCCCEEECCHHHCCCCCCCEE Q ss_conf 999999999999-9531857718998199899999999999993129768999764114789858542568086667889 Q gi|254780229|r 1494 EEARVIALAVKD-HALRYPEESLGVIAMNAKQRDLIESTINKLCRKDSEVEKAISKLRMHADPFFIKNLENVQGDERDRI 1572 (1775) Q Consensus 1494 ~EA~~vv~~v~~-~~~~~~~~siGVvt~~~~Q~~~I~~~l~~~~~~~~~~~~~~~~~~~~~e~~fvknle~vQGdERDvI 1572 (1775) .||+.+|..-.= ++.-||-.-|-|-|.+..|..+|++.+.+++..+|-+ +-|-+|..++-+||---|-| T Consensus 1138 geaey~vAly~YMr~Lgypa~Kisilttyngq~~lirdii~rrc~~nPfi----------g~pAkv~tvdk~qgqqndfi 1207 (1320) T KOG1806 1138 GEAEYAVALFQYMRLLGYPANKISILTTYNGQKSLIRDIINRRCSHNPFI----------GQPAKVTTVDKFQGQQNDFI 1207 (1320) T ss_pred CHHHHHHHHHHHHHHHCCCHHHEEEEEEECCHHHHHHHHHHHHCCCCCCC----------CCCCCCCCCCCCCCCCCCEE T ss_conf 62365899999999957856673678764426889999999755688766----------87555775000255343357 Q ss_pred EEEEEEECCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCEEEEEECCC Q ss_conf 9996650788887620011477765624689999983214269887038 Q gi|254780229|r 1573 FISFTYGPHEPGGRVFQRFGPINSDIGWRRLNVLFTRSRRRIEVFSTMR 1621 (1775) Q Consensus 1573 ~~S~~yg~~~~~g~~~~~fGpln~~gg~rRlNVa~sRAr~~~~v~~s~~ 1621 (1775) |+|+|.-+. -|.+ + .-|||-||+||||--++|++-.. T Consensus 1208 Ilslv~tr~--~gh~-------r---dvrrlvva~srarlglyv~~r~~ 1244 (1320) T KOG1806 1208 ILSLVRTRE--VGHL-------R---DVRRLVVAMSRARLGLYVLCRRS 1244 (1320) T ss_pred EEEEHHHHH--HHHH-------C---CHHHHHHHHHHHHCCCHHHHHHH T ss_conf 765224554--0012-------0---39999999988620414678899 No 25 >KOG1802 consensus Probab=98.44 E-value=7.1e-07 Score=86.62 Aligned_cols=71 Identities=31% Similarity=0.491 Sum_probs=64.4 Q ss_pred CCCHHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHCC-CEEEEEECCHHHHHHHHHHHHHCCCCC Q ss_conf 77346899999851489559942878770389999999999679-828996010589999998644358441 Q gi|254780229|r 404 DSDSSQHSAVIDVINGKNLIIEGPPGTGKSQTITNIIAAAMLHG-KKILFCAQKMAAIEVVRHRLEKAGLGE 474 (1775) Q Consensus 404 ~aDsSQ~~av~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~G-k~VLfVaeK~aALdVV~~RL~~~gLg~ 474 (1775) .-.+||..||..|....=-+||||||||||-|++-||.|..-++ -.||++|-.+-|.|-+-.++...||.- T Consensus 410 kLN~SQ~~AV~~VL~rplsLIQGPPGTGKTvtsa~IVyhl~~~~~~~VLvcApSNiAVDqLaeKIh~tgLKV 481 (935) T KOG1802 410 KLNASQSNAVKHVLQRPLSLIQGPPGTGKTVTSATIVYHLARQHAGPVLVCAPSNIAVDQLAEKIHKTGLKV 481 (935) T ss_pred HHCHHHHHHHHHHHCCCCEEEECCCCCCCEEHHHHHHHHHHHHCCCCEEEECCCCHHHHHHHHHHHHCCCEE T ss_conf 224678999999975985155469998833116899999998528956998165002899999998618547 No 26 >KOG0161 consensus Probab=98.36 E-value=0.0011 Score=53.33 Aligned_cols=26 Identities=15% Similarity=-0.097 Sum_probs=12.3 Q ss_pred CCCCCCCCCHHHHHHHHCCCCCCCCH Q ss_conf 23443210002111221045663200 Q gi|254780229|r 144 EKKIEISLQKLTSIIQNYGYADIEDL 169 (1775) Q Consensus 144 ~~~~~~~~~~~~~~~~~~~~~~~~~~ 169 (1775) ...........-..++.+|++...-. T Consensus 364 ~qa~~~~~~~a~ka~~llg~~~~~~~ 389 (1930) T KOG0161 364 EQAEFDNTEVADKACHLLGINVEEFL 389 (1930) T ss_pred CCCCCCCCHHHHHHHHHCCCCHHHHH T ss_conf 23467773699999998399999999 No 27 >PRK09767 hypothetical protein; Provisional Probab=98.31 E-value=5.7e-06 Score=77.20 Aligned_cols=94 Identities=23% Similarity=0.306 Sum_probs=81.0 Q ss_pred CHHHHHHHHHHHHC---CCEEEECCCCCCEECCEEEECCCCCCCEEEEEECCCHHHCCCCCHHHHHHHHHHHHHHCCCCE Q ss_conf 98999999999978---998970657577101257336898883179997458023683248677744489999569909 Q gi|254780229|r 1663 SDFAVSVVNELEKT---GFACDSQLGDMGFSIDVVVRDPNNPGHYLMGIECDGAMYNSAKSARDRDRLRQEVLERMGWRI 1739 (1775) Q Consensus 1663 s~fe~~v~~~L~~~---G~~v~~qvg~~~~riDl~v~~p~~pg~~~l~iecDG~~yh~~~~ardrD~~Rq~vLe~~GW~~ 1739 (1775) ++=|.-+-..|+.+ ||+-.-|+.++.|-+|++-.. ..|.||.||.. |.. ..++|..|...|+.+||++ T Consensus 16 T~aE~~LW~~LR~~~l~g~kFrRQ~pIg~yIvDF~C~~------~rLvIElDG~~-H~~--~~~~D~~R~~~L~~~G~~V 86 (116) T PRK09767 16 TLQERKLWRYLRSRRFGDFKFRRQHPVGSYILDFACCS------ARVVVELDGGQ-HDL--AVAYDTRRTSWLESQGWTV 86 (116) T ss_pred CHHHHHHHHHHHHCCCCCCEEEECCCCCCEEHHHHHHH------HCEEEEECCCC-CCH--HHHHHHHHHHHHHHCCCEE T ss_conf 99999999999846768980573163158602022233------39799923864-214--6688899999999889979 Q ss_pred EEEECHHHHCCHHHHHHHHHHHHHHH Q ss_conf 99322164117899999999999999 Q gi|254780229|r 1740 RRIWSVDWFSNPDEVIEPIIRELREL 1765 (1775) Q Consensus 1740 ~Riws~dw~~~p~~~~~~l~~~l~~~ 1765 (1775) .|+|..|.+.|.+++++.|..+|... T Consensus 87 lRF~N~dV~~~~~~Vl~~I~~~l~~r 112 (116) T PRK09767 87 LRFWNNEIDCNEEAVLEIILQELNRR 112 (116) T ss_pred EEEEHHHHHHHHHHHHHHHHHHHHCC T ss_conf 99768998878999999999998722 No 28 >COG2852 Very-short-patch-repair endonuclease [Replication, recombination, and repair] Probab=98.30 E-value=8e-06 Score=75.63 Aligned_cols=98 Identities=26% Similarity=0.402 Sum_probs=82.4 Q ss_pred CCHHHHHHHHHHHH---CCCEEEECCCCCCEECCEEEECCCCCCCEEEEEECCCHHHCCCCCHHHHHHHHHHHHHHCCCC Q ss_conf 99899999999997---899897065757710125733689888317999745802368324867774448999956990 Q gi|254780229|r 1662 DSDFAVSVVNELEK---TGFACDSQLGDMGFSIDVVVRDPNNPGHYLMGIECDGAMYNSAKSARDRDRLRQEVLERMGWR 1738 (1775) Q Consensus 1662 ds~fe~~v~~~L~~---~G~~v~~qvg~~~~riDl~v~~p~~pg~~~l~iecDG~~yh~~~~ardrD~~Rq~vLe~~GW~ 1738 (1775) .|++|...-..|+. .||.+..|+.+++|.+|++-.++ .|=||.||.. | +-..+.|-.|-..|+.+||+ T Consensus 26 ~t~aE~~LW~~lr~~rL~g~KFrRQ~~ig~yivDF~c~~~------klIvElDG~q-H--~~~~~~Da~Rd~~L~~~G~~ 96 (129) T COG2852 26 QTDAEKALWQHLRNRRLNGFKFRRQQPIGRYIVDFACRDA------KLIVELDGGQ-H--EEREEYDAERDAFLESQGFT 96 (129) T ss_pred CCHHHHHHHHHHHHHHHCCEEEEEEEECCCEEEEEECCCC------CEEEEECCCC-C--HHHHHHHHHHHHHHHHCCCE T ss_conf 8798999999999855157147874001577888982674------0899815752-2--56652148889999968957 Q ss_pred EEEEECHHHHCCHHHHHHHHHHHHHHHHHH Q ss_conf 999322164117899999999999999710 Q gi|254780229|r 1739 IRRIWSVDWFSNPDEVIEPIIRELRELISQ 1768 (1775) Q Consensus 1739 ~~Riws~dw~~~p~~~~~~l~~~l~~~~~~ 1768 (1775) +.|+|-.+-..||+.+++.|...|++.... T Consensus 97 VLRf~N~ev~~n~~~vl~~I~~~l~~~~~~ 126 (129) T COG2852 97 VLRFWNDEVLQNIDRVLEEILAVLEERRPP 126 (129) T ss_pred EEEECCHHHHHHHHHHHHHHHHHHHHCCCC T ss_conf 999654888867898999999999842465 No 29 >KOG1807 consensus Probab=98.24 E-value=1.1e-05 Score=74.13 Aligned_cols=141 Identities=18% Similarity=0.234 Sum_probs=90.8 Q ss_pred CHHHHHHHHHHHHHHCCCCEEEEEEEEEEECCHHHHHHHHHHHHCCCCCCHHHHHCCHHHHHHHCCCCCCCCCCCCCCCC Q ss_conf 98999999999986128980886545566225147778643100013221014320878788504555556655666433 Q gi|254780229|r 304 WPEDYFLQIQKIIEESKSHWAVRRYGVLGLLNFSKMLMYLDLDHLRWPEGEDNILNHDIIQRLFIAQSCDHDKDNASSNQ 383 (1775) Q Consensus 304 ~~~~~~~~~~~~i~~~~~~w~v~~~~~lg~fsf~k~~m~~DL~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~ 383 (1775) ....||++.++.++..+ +-...-+|+|-+|+.- -+.+...| +++..++.....++...... T Consensus 303 ~t~~Y~eayrhVLr~lq------r~s~~~~vpf~rylvh-----------c~s~~~~p--r~L~~~~rytinp~~~n~s~ 363 (1025) T KOG1807 303 KTVKYVEAYRHVLRELQ------RASLVEFVPFLRYLVH-----------CDSLKQQP--RLLWSDVRYTINPQFANASR 363 (1025) T ss_pred HHHHHHHHHHHHHHHHH------HCCCCCCCCHHHHHCC-----------CHHHHHCH--HHHHCCCCEECCCCCCCCHH T ss_conf 38889999999999766------3321345416663030-----------14433151--77650873411712368066 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHCC------CEEEEEECCH Q ss_conf 4335688211330002236677346899999851489559942878770389999999999679------8289960105 Q gi|254780229|r 384 NIEYKIDEIEDIHQRCPLIDDSDSSQHSAVIDVINGKNLIIEGPPGTGKSQTITNIIAAAMLHG------KKILFCAQKM 457 (1775) Q Consensus 384 ~~~~~~~~~~~~~~~~~~~~~aDsSQ~~av~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~G------k~VLfVaeK~ 457 (1775) .+....+| +.+ -++=-||||..|-....+-+--+||||||||||.+-.-+|-..|-.- -.+|+||..+ T Consensus 364 ~~~n~leP-p~~-----g~~ildsSq~~A~qs~ltyelsliqgppGTgkt~vtlkav~tLL~n~s~~~~~epIlvvC~Tn 437 (1025) T KOG1807 364 HIVNALEP-PGP-----GLVILDSSQQFAKQSKLTYELSLIQGPPGTGKTLVTLKAVDTLLLNSSGYTEPEPILVVCLTN 437 (1025) T ss_pred HHHHHCCC-CCC-----CCEEECHHHHHHHHHHHHHHHHEEECCCCCCCEEEHHHHHHHHHHCCCCCCCCCCEEEEEHHH T ss_conf 66640389-998-----705504889999987765632113259987722034999999996265545556446642235 Q ss_pred HHHHHHHHHHHH Q ss_conf 899999986443 Q gi|254780229|r 458 AAIEVVRHRLEK 469 (1775) Q Consensus 458 aALdVV~~RL~~ 469 (1775) .|+|-.-.|+-. T Consensus 438 havdq~ligiy~ 449 (1025) T KOG1807 438 HAVDQYLIGIYY 449 (1025) T ss_pred HHHHHHHHHHHH T ss_conf 789999999873 No 30 >TIGR01448 recD_rel helicase, RecD/TraA family; InterPro: IPR006345 These sequences represent a family similar to RecD, the exodeoxyribonuclease V alpha chain of IPR006344 from INTERPRO. Members of this family, however, are not found in a context of RecB and RecC and are longer by about 200 amino acids at the amino end. Chlamydia muridarum has both a member of this family and a RecD. . Probab=98.19 E-value=1.8e-05 Score=71.91 Aligned_cols=231 Identities=18% Similarity=0.259 Sum_probs=142.5 Q ss_pred CCCCCCEEEECCHHHCCHHHHHHHHH---HCCEEEEEECCCCCCCCHHHHHCCCCCCCCCCHHHCCCCCHHHHHHHHH-C Q ss_conf 22541078860632369899889886---1182899607786686201100013444432011014400079999975-5 Q gi|254780229|r 1358 KDVKFDLVIMDESSQIKPEDALGVIA---RGKQIVVVGDPKQLPPTRFFDHDNDQEDYDEEVAAVSQTESILDALLPL-F 1433 (1775) Q Consensus 1358 ~~~~FD~VI~DEASQ~~~~~al~al~---rakqvvvvGD~kQlpPt~ff~~~~~~~~~~~~~~~~~~~eSild~~~~~-~ 1433 (1775) .....|+|||||.|=|..|-+.-.|+ -.=++++|||.-||||.+ +-.-+.|+..+. . T Consensus 452 ~~~~~DL~IvDE~SM~Dt~L~~~lL~a~P~~a~lllVGD~DQLPSV~-------------------pG~VL~DLi~s~~i 512 (769) T TIGR01448 452 DPIDADLLIVDESSMVDTWLASSLLAAVPDHARLLLVGDADQLPSVG-------------------PGQVLKDLIQSKVI 512 (769) T ss_pred CCCCCCEEEEECCCHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCCC-------------------CCHHHHHHHHCCCC T ss_conf 34787769981462188999999986179777798883768889886-------------------44089999846886 Q ss_pred CCCEECCCCCCCCHHHHHHHHHHCCCCEEEECCCCCCCCCCCCEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHC--CC Q ss_conf 8752053055799789999985036980476468776666688489972897217888989999999999999531--85 Q gi|254780229|r 1434 SMRRLQWHYRSLNENLIACSNYYFYDNSLIVFPSPYTSVDRYGVGFTHVKNGVVVDQGNPEEARVIALAVKDHALR--YP 1511 (1775) Q Consensus 1434 p~~~L~~HyRs~~e~lI~fSN~~fY~~~L~~~psp~~~~~~~~v~~~~v~~g~~~~~~N~~EA~~vv~~v~~~~~~--~~ 1511 (1775) |+..|+.=||=...|=|.-.=+..-.|..-++-|... ..-..+..+++-..+...+ -+..+|+.|...++. +| T Consensus 513 P~~~LT~vyRQ~~~S~Ii~~Ah~~~~G~~Pvlnss~~---~~Df~~~~~~~s~~e~~~~--~i~~~v~~~~~~a~~~Gi~ 587 (769) T TIGR01448 513 PVTRLTKVYRQAEGSSIITLAHKILQGEAPVLNSSSA---KKDFKFLKLDRSEPEEAAK--HIPLMVEKIVGAAKKKGIP 587 (769) T ss_pred CEEECCCEEEECCCCHHHHHHHHHHCCCCCCCCCHHH---HHHHHHHHHHHCCHHHHHH--HHHHHHHHHHHHHHHCCCC T ss_conf 6121211124113664678889873178875232446---7667777640010106667--8999998877898717986 Q ss_pred CCEEEEEEC-C--HHHHHHHHHHHHHHHHCCH--H-HHHHHHHH----CCC----------------------------- Q ss_conf 771899819-9--8999999999999931297--6-89997641----147----------------------------- Q gi|254780229|r 1512 EESLGVIAM-N--AKQRDLIESTINKLCRKDS--E-VEKAISKL----RMH----------------------------- 1552 (1775) Q Consensus 1512 ~~siGVvt~-~--~~Q~~~I~~~l~~~~~~~~--~-~~~~~~~~----~~~----------------------------- 1552 (1775) .+-|=|+|+ . ..=-+.|...|.... || . -+.+++-. +.+ T Consensus 588 ~~dIQVLaPM~kG~~Gi~~LN~~Lq~~L--NP~qkG~~~~~~~~~~~~R~GDrV~Qt~Ndyn~e~FNGD~G~I~~i~~a~ 665 (769) T TIGR01448 588 GADIQVLAPMYKGELGIKALNQELQDLL--NPKQKGDQEVIELKEVEYRKGDRVMQTKNDYNKEVFNGDLGVIVEIEKAK 665 (769) T ss_pred CCEEEECCCCCCCCHHHHHHHHHHHHHC--CCCCCCCCCEEEECCEEECCCCEEEEEECCCCCCCCCCCCEEEEEEECHH T ss_conf 0000152589898401788889999970--88323785206515554468872465200765553366630445640022 Q ss_pred ---------------------------CCCEEECCHHHCCCCCCCEEEEEEEEECCCCCCCCCCCCCCCCCCCHHHHHHH Q ss_conf ---------------------------89858542568086667889999665078888762001147776562468999 Q gi|254780229|r 1553 ---------------------------ADPFFIKNLENVQGDERDRIFISFTYGPHEPGGRVFQRFGPINSDIGWRRLNV 1605 (1775) Q Consensus 1553 ---------------------------~e~~fvknle~vQGdERDvI~~S~~yg~~~~~g~~~~~fGpln~~gg~rRlNV 1605 (1775) .-|=+--+++--||-|--+||+-++ ++|++|..| +-|== T Consensus 666 ~~~~k~~~~~~dFdG~~~~~~~~el~~l~~aYA~siHKsQGSEf~~Vi~Pi~----~ah~~ML~R----------NL~YT 731 (769) T TIGR01448 666 EGELKKDLVVVDFDGNEVLYTREELFNLTLAYATSIHKSQGSEFPVVILPIV----TAHYRMLYR----------NLLYT 731 (769) T ss_pred CCCCCCCEEEEEECCCEEEECHHHHHCCCCCHHEEEECCCCCCCCEEEEEEE----ECCHHHCCC----------CHHHH T ss_conf 0323363589987597677355756206320101232104774773666322----122000124----------02555 Q ss_pred HHHHHCCEEEEEECCCHHHCCHH Q ss_conf 99832142698870387887966 Q gi|254780229|r 1606 LFTRSRRRIEVFSTMRYLDVVVD 1628 (1775) Q Consensus 1606 a~sRAr~~~~v~~s~~~~~i~~~ 1628 (1775) |+||||..++++.|..+=+|=.. T Consensus 732 a~TRAk~~~~L~G~~~A~~iA~~ 754 (769) T TIGR01448 732 AITRAKKKLVLVGSAEAFDIAVK 754 (769) T ss_pred HHHHHCCCEEECCCHHHHHHHHH T ss_conf 34320221364236899999986 No 31 >TIGR00618 sbcc exonuclease SbcC; InterPro: IPR004592 All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. SbcC may have nuclease activity that is functionally related to one of the nuclease activities of the RecBCD enzyme (IPR004586 from INTERPRO).; GO: 0004527 exonuclease activity, 0006259 DNA metabolic process. Probab=98.18 E-value=0.0023 Score=50.03 Aligned_cols=138 Identities=17% Similarity=0.177 Sum_probs=68.4 Q ss_pred CCCEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEECCHHHHHHHHHHHHHCCCCCEEEECCCCCHHHHHHHHHHHHHHH Q ss_conf 89559942878770389999999999679828996010589999998644358441365402661108999999999986 Q gi|254780229|r 419 GKNLIIEGPPGTGKSQTITNIIAAAMLHGKKILFCAQKMAAIEVVRHRLEKAGLGEFCLELHSHKVHKRAILDDLRKRID 498 (1775) Q Consensus 419 g~~~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVaeK~aALdVV~~RL~~~gLg~~cleLHs~ka~k~~vl~~L~~~le 498 (1775) -+-|||+||+|+||| ||.++||.||| ||.|=--|+..+-..+.+.+=....+.+|-+.+ +..-..++..++.+.. T Consensus 30 l~~f~i~G~tGAGKt-sLldAI~yALY-GkP~~~~s~~~~~~~~~~~~~~~~~~~~lef~~---~~~~yR~~r~~~~~~~ 104 (1063) T TIGR00618 30 LKLFVICGKTGAGKT-SLLDAITYALY-GKPVPRESEVVASKNSLSRADSEEALAELEFAL---DTKLYRVFRSLRVERR 104 (1063) T ss_pred CCEEEEECCCCCCHH-HHHHHHHHHHC-CCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHEE---CCCHHHHHHHHHHHHC T ss_conf 573677788998354-59999999872-888775146788875420010787765464013---5601146776654513 Q ss_pred HHCCCC-C--------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHC Q ss_conf 311247-1--------5899999999999999999999999978854025789999999998851222123310 Q gi|254780229|r 499 NRNIKT-V--------PQEIDVEITRYEELKGQLNQYAQEINQIWKNTELSIHQILMGAARYRHNLPLDPVQLH 563 (1775) Q Consensus 499 ~~~~~~-~--------~~e~e~~~~~L~~~r~~L~~~~~~L~~~~~~~~~s~~~~~~~~~~~~~~l~~~~~~l~ 563 (1775) ...... . ...|+.....- .++...--...+.=.+..+..+++--.|........-+.....+- T Consensus 105 ~R~~~~~~q~~~~~~~~~GWe~i~a~~--~~~~~~~~~~~~kLdy~~F~~~~lL~QG~~~~FL~~~s~ekk~~L 176 (1063) T TIGR00618 105 QRESEEEVQVKVEEEKARGWEKILAKK--LKETEELIAKLLKLDYKTFTRVVLLPQGEFDEFLKAKSKEKKELL 176 (1063) T ss_pred CCCCCCCCCHHHHHHCCCCHHHHHHHH--HHHHHHHHHHHHCCCHHHHHHHEECCCHHHHHHHHCCCHHHHHHH T ss_conf 789876410353430356079998633--358999999984464214120000474158998611836799998 No 32 >PRK11054 helD DNA helicase IV; Provisional Probab=98.18 E-value=0.00011 Score=63.90 Aligned_cols=207 Identities=18% Similarity=0.245 Sum_probs=109.9 Q ss_pred CCCEEEECCHHHCCHHH--HHHHHHH---CCEEEEEECCCCCCCCHHHHHCCCCCCCCCCHHHCCCCCHHHHHHHHHCCC Q ss_conf 41078860632369899--8898861---182899607786686201100013444432011014400079999975587 Q gi|254780229|r 1361 KFDLVIMDESSQIKPED--ALGVIAR---GKQIVVVGDPKQLPPTRFFDHDNDQEDYDEEVAAVSQTESILDALLPLFSM 1435 (1775) Q Consensus 1361 ~FD~VI~DEASQ~~~~~--al~al~r---akqvvvvGD~kQlpPt~ff~~~~~~~~~~~~~~~~~~~eSild~~~~~~p~ 1435 (1775) .|+.|+|||==-+.|.- .|-++.. ...+.+||||-|=== =|.++. -+++-.+...||. T Consensus 430 ~~ryILVDEfQDiS~~Q~~Ll~aL~~~~~~~~LfaVGDD~QSIY--rFrGAd---------------i~~~~~F~~~F~~ 492 (684) T PRK11054 430 PWKHILVDEFQDISPQRAALLEALRKQNSQTTLFAVGDDWQAIY--RFSGAD---------------VSLTTAFHERFGE 492 (684) T ss_pred CCCEEEEECCCCCCHHHHHHHHHHHHCCCCCEEEEEECCCHHEE--EECCCC---------------HHHHHHHHHHCCC T ss_conf 37689987633699999999999873399975999806742101--126888---------------4899989876799 Q ss_pred ---CEECCCCCCCCHHHHHHHHHHCCCCEEEECCCC---CCCCCCCCEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHC Q ss_conf ---520530557997899999850369804764687---76666688489972897217888989999999999999531 Q gi|254780229|r 1436 ---RRLQWHYRSLNENLIACSNYYFYDNSLIVFPSP---YTSVDRYGVGFTHVKNGVVVDQGNPEEARVIALAVKDHALR 1509 (1775) Q Consensus 1436 ---~~L~~HyRs~~e~lI~fSN~~fY~~~L~~~psp---~~~~~~~~v~~~~v~~g~~~~~~N~~EA~~vv~~v~~~~~~ 1509 (1775) ..|.--|||-++ |+..+|+..=.|.-. ++-+ ......++|.+. + .+ ..+++.+.+... . T Consensus 493 a~~~~L~~nYRs~~~-I~~~An~fI~~Np~Q-l~K~l~s~~~~~~~aV~~~--~-------~~--~le~~l~~l~~~--~ 557 (684) T PRK11054 493 GTICHLDTTYRFNSR-IGEVANRFIQQNPHQ-LKKPLNSLTKGKKKAVTLL--P-------ES--QLEALLDKLSGY--A 557 (684) T ss_pred CEEEECCCCCCCCHH-HHHHHHHHHHHCHHH-CCCCCCCCCCCCCCCEEEE--C-------HH--HHHHHHHHHHHC--C T ss_conf 679978788888659-999999999849574-1877565557898628973--4-------67--899999999732--6 Q ss_pred CCCCEEEEEECCHHHHHHHHHHHHHHHHCCHHHHHHHHHHCCCCCCEEECCHHHCCCCCCCEEEE-EEEEECCCCCCC-- Q ss_conf 85771899819989999999999999312976899976411478985854256808666788999-966507888876-- Q gi|254780229|r 1510 YPEESLGVIAMNAKQRDLIESTINKLCRKDSEVEKAISKLRMHADPFFIKNLENVQGDERDRIFI-SFTYGPHEPGGR-- 1586 (1775) Q Consensus 1510 ~~~~siGVvt~~~~Q~~~I~~~l~~~~~~~~~~~~~~~~~~~~~e~~fvknle~vQGdERDvI~~-S~~yg~~~~~g~-- 1586 (1775) .++.+|-|..-+..+.- +.+.......|.+ .+-..+++.-=|-|-|+||+ .+.- |. T Consensus 558 ~~~~~vllL~R~~~~~p---~~l~~~~~r~p~L------------~i~~~TiHaSKGLEad~V~ilgl~e------G~~g 616 (684) T PRK11054 558 KPEERILLLARYHHLKP---ASLEKAATRWPKL------------QIDFMTIHASKGQQADYVIIVGLQE------GQDG 616 (684) T ss_pred CCCCEEEEEHHHHCCCH---HHHHHHHHHCCCC------------CEEEEEHHHHCCCCCCEEEEEECCC------CCCC T ss_conf 99875998300001380---7999999748688------------7699961331176588899960788------9878 Q ss_pred CCC--CCCCCC-----CC-----CHHHH-HHHHHHHHCCEEEEEECC Q ss_conf 200--114777-----65-----62468-999998321426988703 Q gi|254780229|r 1587 VFQ--RFGPIN-----SD-----IGWRR-LNVLFTRSRRRIEVFSTM 1620 (1775) Q Consensus 1587 ~~~--~fGpln-----~~-----gg~rR-lNVa~sRAr~~~~v~~s~ 1620 (1775) ++. +--||. .. .-||| |=||+||||++++|.++. T Consensus 617 FP~~~~~~~l~~~llp~~e~~~~aEERRLFYVAlTRAK~~v~l~~~~ 663 (684) T PRK11054 617 FPAPARESIMEDALLPPVEDFPDAEERRLFYVALTRAKHRVWLLFNK 663 (684) T ss_pred CCCCCCCCHHHHHHCCCCCCCCCHHHHHHHHHHHHHCCCEEEEEECC T ss_conf 98766553566742587766765888999898474102058999779 No 33 >TIGR00376 TIGR00376 DNA helicase, putative; InterPro: IPR004483 Eukaryotic members of this family have been characterised as binding certain single-stranded G-rich DNA sequences (GGGGT and GGGCT). A number of related proteins are characterised as helicases.; GO: 0003677 DNA binding, 0005524 ATP binding. Probab=98.17 E-value=5e-05 Score=67.32 Aligned_cols=86 Identities=31% Similarity=0.446 Sum_probs=71.5 Q ss_pred CCCCCCHHHHHHHHHHHCCCCE-EEECCCCCCCHHHHHHHHHHHHHCCC--EEEEEECCHHHHHHHHHHHHHCCCCCEEE Q ss_conf 3667734689999985148955-99428787703899999999996798--28996010589999998644358441365 Q gi|254780229|r 401 LIDDSDSSQHSAVIDVINGKNL-IIEGPPGTGKSQTITNIIAAAMLHGK--KILFCAQKMAAIEVVRHRLEKAGLGEFCL 477 (1775) Q Consensus 401 ~~~~aDsSQ~~av~~~~~g~~~-vi~GPPGTGKSQTIaNlIa~~la~Gk--~VLfVaeK~aALdVV~~RL~~~gLg~~cl 477 (1775) +-...+.||..+|..+...++| +|+|||||||+.|+..+|...+..|. +||..+....+.|.+..+|...--...|+ T Consensus 199 ~~~~l~~~~~~~~~~~~~~~~~~~~~gp~g~g~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~d~~~~~~~~~~p~~~~~ 278 (709) T TIGR00376 199 LNPNLNESQKEAVGFALSSKDLFLIHGPPGTGKTRTLVELIRQLVKRGPEERVLVTAPSNIAVDNLLERLAPLYPDLKLL 278 (709) T ss_pred HCCCCCHHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHHHHHHHHCCCCCCEEEECCCCHHHHHHHHHHHHHCCCCEEE T ss_conf 22232234677765431234416872677776215689999999852853306762353103778888876414541022 Q ss_pred EC-CCCCHHH Q ss_conf 40-2661108 Q gi|254780229|r 478 EL-HSHKVHK 486 (1775) Q Consensus 478 eL-Hs~ka~k 486 (1775) .+ |.....+ T Consensus 279 ~~g~p~~~~~ 288 (709) T TIGR00376 279 RLGHPARLLK 288 (709) T ss_pred ECCCHHHHHH T ss_conf 2266046777 No 34 >PRK10875 recD exonuclease V subunit alpha; Provisional Probab=98.16 E-value=4.8e-06 Score=77.93 Aligned_cols=54 Identities=20% Similarity=0.382 Sum_probs=39.6 Q ss_pred CCCEEECCHHHCCCCCCCEEEEEEEEECCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCEEEEEECC Q ss_conf 89858542568086667889999665078888762001147776562468999998321426988703 Q gi|254780229|r 1553 ADPFFIKNLENVQGDERDRIFISFTYGPHEPGGRVFQRFGPINSDIGWRRLNVLFTRSRRRIEVFSTM 1620 (1775) Q Consensus 1553 ~e~~fvknle~vQGdERDvI~~S~~yg~~~~~g~~~~~fGpln~~gg~rRlNVa~sRAr~~~~v~~s~ 1620 (1775) .|+-|.=+++--||-|-|.+++-+ |+. ..+| |++ .-|=-||||||++++||++- T Consensus 528 ~e~AYAmTVHKSQGSEF~~V~lvL---P~~-~s~l------LtR----eLlYTAITRAk~~l~lvg~~ 581 (607) T PRK10875 528 HETAWAMTVHKSQGSEFDHAALIL---PSQ-RTPV------VTR----ELVYTAITRARRRLSLYADE 581 (607) T ss_pred CCEEEEEEEECCCCCCCCCEEEEE---CCC-CCCC------CCH----HHHHHHHHHHHCEEEEEECH T ss_conf 577988555535211779679990---587-6752------529----89999987665068999899 No 35 >pfam00580 UvrD-helicase UvrD/REP helicase. The Rep family helicases are composed of four structural domains. The Rep family function as dimers. REP helicases catalyse ATP dependent unwinding of double stranded DNA to single stranded DNA. Bacillus subtilis addA and Escherichia coli exodeoxyribonuclease V beta have large insertions near to the carboxy-terminus relative to other members of the family. Probab=98.13 E-value=0.00012 Score=63.24 Aligned_cols=150 Identities=23% Similarity=0.258 Sum_probs=95.6 Q ss_pred CCCEEEECCHHHCCHH--HHHHHHHHC-CEEEEEECCCCCCCCHHHHHCCCCCCCCCCHHHCCCCCHHHHHHHHHCC--- Q ss_conf 4107886063236989--988988611-8289960778668620110001344443201101440007999997558--- Q gi|254780229|r 1361 KFDLVIMDESSQIKPE--DALGVIARG-KQIVVVGDPKQLPPTRFFDHDNDQEDYDEEVAAVSQTESILDALLPLFS--- 1434 (1775) Q Consensus 1361 ~FD~VI~DEASQ~~~~--~al~al~ra-kqvvvvGD~kQlpPt~ff~~~~~~~~~~~~~~~~~~~eSild~~~~~~p--- 1434 (1775) .|+.|+|||.=-+.+. ..|-.|+.+ +.+.|||||+|-== + |.++ + -+++..+...|| T Consensus 212 r~~~ilVDEfQDtn~~Q~~il~~L~~~~~~l~~VGD~~QSIY-~-FRGA--------------d-~~~~~~f~~~~~~~~ 274 (494) T pfam00580 212 RFKYILVDEFQDTNPIQYELLKLLAGGHPNLFLVGDPDQSIY-G-FRGA--------------D-IENILKFEKDFPAAK 274 (494) T ss_pred HCCEEECHHHCCCHHHHHHHHHHHHCCCCEEEEEECCCEEEE-E-CCCC--------------C-HHHHHHHHHHCCCCC T ss_conf 072654034302649999999998658983999817971442-0-1799--------------9-699999998666566 Q ss_pred CCEECCCCCCCCHHHHHHHHHHCCCCEEEECCCCCCC--CCCCCEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHCC-- Q ss_conf 7520530557997899999850369804764687766--666884899728972178889899999999999995318-- Q gi|254780229|r 1435 MRRLQWHYRSLNENLIACSNYYFYDNSLIVFPSPYTS--VDRYGVGFTHVKNGVVVDQGNPEEARVIALAVKDHALRY-- 1510 (1775) Q Consensus 1435 ~~~L~~HyRs~~e~lI~fSN~~fY~~~L~~~psp~~~--~~~~~v~~~~v~~g~~~~~~N~~EA~~vv~~v~~~~~~~-- 1510 (1775) ...|..-|||.++ |+.+.|..|-.+.-......... .....+.+ +...+...||+.|+..|.+..... T Consensus 275 ~i~L~~NyRS~~~-Il~~aN~l~~~~~~~~~~~~l~~~~~~~~~v~~-------~~~~~~~~Ea~~ia~~I~~l~~~~~~ 346 (494) T pfam00580 275 VITLEQNYRSTPE-ILNAANKLISNNTARKEKRLLWSQKGSGEKVTV-------YEAETEQEEAEFIADEIKALHKSGAA 346 (494) T ss_pred EEECCCCCCCCHH-HHHHHHHHHHHCCCCCCCCCCCCCCCCCCCEEE-------EECCCHHHHHHHHHHHHHHHHHCCCC T ss_conf 7258899898878-999999999726022445435455788996079-------83799999999999999999877999 Q ss_pred -CCCEEEEEECCHHHHHHHHHHHHHH Q ss_conf -5771899819989999999999999 Q gi|254780229|r 1511 -PEESLGVIAMNAKQRDLIESTINKL 1535 (1775) Q Consensus 1511 -~~~siGVvt~~~~Q~~~I~~~l~~~ 1535 (1775) +-..|.|.+=+..|...|++.|.+. T Consensus 347 ~~~~DiAIL~R~~~~~~~l~~~L~~~ 372 (494) T pfam00580 347 IPFGDIAVLVRSNSQSRAIEEALLKA 372 (494) T ss_pred CCCCCEEEEEECCHHHHHHHHHHHHC T ss_conf 88565689982523189999999977 No 36 >pfam00580 UvrD-helicase UvrD/REP helicase. The Rep family helicases are composed of four structural domains. The Rep family function as dimers. REP helicases catalyse ATP dependent unwinding of double stranded DNA to single stranded DNA. Bacillus subtilis addA and Escherichia coli exodeoxyribonuclease V beta have large insertions near to the carboxy-terminus relative to other members of the family. Probab=97.86 E-value=5.5e-05 Score=66.90 Aligned_cols=63 Identities=30% Similarity=0.409 Sum_probs=55.2 Q ss_pred CHHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHCC----CEEEEEECCHHHHHHHHHHHHHC Q ss_conf 346899999851489559942878770389999999999679----82899601058999999864435 Q gi|254780229|r 406 DSSQHSAVIDVINGKNLIIEGPPGTGKSQTITNIIAAAMLHG----KKILFCAQKMAAIEVVRHRLEKA 470 (1775) Q Consensus 406 DsSQ~~av~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~G----k~VLfVaeK~aALdVV~~RL~~~ 470 (1775) +..|..||.. .+.+++|.+.||||||.||+.-|++.+..+ .++|.|+.+++|..-++.|+... T Consensus 2 n~~Q~~av~~--~~~~llV~AgAGSGKT~~L~~Ri~~li~~~~~~p~~IL~lTFT~kAA~Em~~Ri~~~ 68 (494) T pfam00580 2 NPEQRKAVTH--LGGPLLVLAGAGTGKTRVLTERIAYLILEGGIDPEEILAVTFTNKAAREMKERILKL 68 (494) T ss_pred CHHHHHHHCC--CCCCEEEEEECCHHHHHHHHHHHHHHHHHCCCCHHHEEEEECHHHHHHHHHHHHHHH T ss_conf 9889999809--999979997187068999999999999818999747876702899999999999987 No 37 >PRK12377 putative replication protein; Provisional Probab=97.77 E-value=7.3e-05 Score=65.63 Aligned_cols=39 Identities=23% Similarity=0.497 Sum_probs=32.0 Q ss_pred HHCCCCEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEE Q ss_conf 514895599428787703899999999996798289960 Q gi|254780229|r 416 VINGKNLIIEGPPGTGKSQTITNIIAAAMLHGKKILFCA 454 (1775) Q Consensus 416 ~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVa 454 (1775) ...+.|||+-|||||||||.-+.|--.++.+|++|+|+. T Consensus 98 ~~~~~NlIf~G~pGtGKTHLA~AIg~~a~~~G~sVlF~t 136 (248) T PRK12377 98 MTGCTNFVFSGKPGTGKNHLAAAIGNRLLAKGRSVIVVT 136 (248) T ss_pred HCCCCEEEEECCCCCCHHHHHHHHHHHHHHCCCEEEEEE T ss_conf 318860899899998788999999999998799699988 No 38 >PRK10919 ATP-dependent DNA helicase Rep; Provisional Probab=97.71 E-value=0.00014 Score=62.80 Aligned_cols=63 Identities=27% Similarity=0.390 Sum_probs=53.2 Q ss_pred CHHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHC-C---CEEEEEECCHHHHHHHHHHHHHC Q ss_conf 34689999985148955994287877038999999999967-9---82899601058999999864435 Q gi|254780229|r 406 DSSQHSAVIDVINGKNLIIEGPPGTGKSQTITNIIAAAMLH-G---KKILFCAQKMAAIEVVRHRLEKA 470 (1775) Q Consensus 406 DsSQ~~av~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~-G---k~VLfVaeK~aALdVV~~RL~~~ 470 (1775) ++.|..||.. ..|.-+|+ ..||||||.||++-|++++.. | .+||.++.++.|..-++.||... T Consensus 4 n~~Q~~AV~~-~~gp~lVl-AGaGSGKT~tLt~Ria~Li~~~~v~P~~ILalTFT~kAA~Emk~Rl~~~ 70 (672) T PRK10919 4 NPGQQQAVEF-VTGPCLVL-AGAGSGKTRVITNKIAHLIRGCGYQARHIAAVTFTNKAAREMKERVGQT 70 (672) T ss_pred CHHHHHHHCC-CCCCEEEE-ECCCHHHHHHHHHHHHHHHHCCCCCCCCEEEECCHHHHHHHHHHHHHHH T ss_conf 9899999839-99998999-7385587999999999998668989333854421799999999999977 No 39 >PRK11054 helD DNA helicase IV; Provisional Probab=97.63 E-value=0.00024 Score=60.17 Aligned_cols=64 Identities=25% Similarity=0.370 Sum_probs=55.6 Q ss_pred CCCHHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHCCC----EEEEEECCHHHHHHHHHHHHH Q ss_conf 773468999998514895599428787703899999999996798----289960105899999986443 Q gi|254780229|r 404 DSDSSQHSAVIDVINGKNLIIEGPPGTGKSQTITNIIAAAMLHGK----KILFCAQKMAAIEVVRHRLEK 469 (1775) Q Consensus 404 ~aDsSQ~~av~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~Gk----~VLfVaeK~aALdVV~~RL~~ 469 (1775) |-+.+|..||+. -.+.++||-|| |||||.||+.=|++++..|. +||.++..+.|..-++.||.. T Consensus 196 PLn~~Qr~Avi~-~ed~~LVLAGA-GSGKT~vLt~RiayLI~~g~~~P~~ILaLTFT~kAA~EMreRl~~ 263 (684) T PRK11054 196 PLNPSQARAVVN-GEDSLLVLAGA-GSGKTSVLVARAGWLLARGQAQPEQILLLAFGRKAAEEMDERIRE 263 (684) T ss_pred CCCHHHHHHEEE-CCCCEEEEECC-CCCCHHHHHHHHHHHHHCCCCCHHHEEEEECCHHHHHHHHHHHHH T ss_conf 999899957274-79964898338-997077999999999975999866778686349999999999997 No 40 >pfam04851 ResIII Type III restriction enzyme, res subunit. Probab=97.60 E-value=0.00037 Score=58.32 Aligned_cols=61 Identities=25% Similarity=0.333 Sum_probs=53.4 Q ss_pred HHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEECCHHHHHHHHHHH Q ss_conf 4689999985148955994287877038999999999967982899601058999999864 Q gi|254780229|r 407 SSQHSAVIDVINGKNLIIEGPPGTGKSQTITNIIAAAMLHGKKILFCAQKMAAIEVVRHRL 467 (1775) Q Consensus 407 sSQ~~av~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVaeK~aALdVV~~RL 467 (1775) +=|..||.++...++.+|..|+|||||-+.+.+|......|++|||++-...=++-..+++ T Consensus 6 ~yQ~~a~~~~~~~~~~~i~~pTGsGKT~~~~~~i~~~~~~~~~~lvlvp~~~L~~Q~~~~~ 66 (103) T pfam04851 6 PYQIEAIRNLLEKKRGLIVMATGSGKTLTAAKLIARLLKGKKKVLFLVPRKDLLEQALEEF 66 (103) T ss_pred HHHHHHHHHHHHCCCEEEEECCCCCHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHHHHHH T ss_conf 9999999999963986999589998799999999999846992999908299999999965 No 41 >pfam09848 DUF2075 Uncharacterized conserved protein (DUF2075). This domain, found in various prokaryotic proteins (including putative ATP/GTP binding proteins), has no known function. Probab=97.59 E-value=0.0012 Score=53.10 Aligned_cols=262 Identities=17% Similarity=0.180 Sum_probs=129.0 Q ss_pred CCHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCEEEECHHHHHHHCCCCCCCCCEEEECCHHHCC---------- Q ss_conf 3213899998752026788668999998789877406789925779987258522541078860632369---------- Q gi|254780229|r 1305 SDYTELSLIRSELGKKNRNIPIRQLISRAKNSLLQLKPCFMMSPMSVANYLEPKDVKFDLVIMDESSQIK---------- 1374 (1775) Q Consensus 1305 ~~~~e~~~L~~e~~k~~~~~~~R~l~~~a~~~~~~l~P~~~msp~sva~~l~~~~~~FD~VI~DEASQ~~---------- 1374 (1775) .+..++..|++.+.+..+.. . ++ -.+.|-+.-+++......+|+||||||=-+. T Consensus 39 ~N~~~~~~l~~~l~~~~~~~------------~---~~-~~~~~~~fi~~~~~~~~~~dvvivDEAhRl~~k~~~~~~~~ 102 (348) T pfam09848 39 GNHPLVLVLYEALAGDLKVR------------K---KK-LFRKPTSFINNLHKAPPHEDVVIVDEAHRLWTKSDLYFNFS 102 (348) T ss_pred CCHHHHHHHHHHHHHHCCHH------------H---CC-CCCCCHHHHCCCCCCCCCCCEEEEEHHHHHHHCCCCCCCCC T ss_conf 78669999999986041200------------1---02-00072523165235798677899831786654336556778 Q ss_pred -HHHHHHHHHHCCEEEEEECCCCC-CCCHHHHHCCCCCCCCCCHHHCCCCCHHHHHHHHH---CCCCEECCCCCC-CCHH Q ss_conf -89988988611828996077866-86201100013444432011014400079999975---587520530557-9978 Q gi|254780229|r 1375 -PEDALGVIARGKQIVVVGDPKQL-PPTRFFDHDNDQEDYDEEVAAVSQTESILDALLPL---FSMRRLQWHYRS-LNEN 1448 (1775) Q Consensus 1375 -~~~al~al~rakqvvvvGD~kQl-pPt~ff~~~~~~~~~~~~~~~~~~~eSild~~~~~---~p~~~L~~HyRs-~~e~ 1448 (1775) +....-.|-|+|-+|++=|+.|. -|.... ..+.|...+... +....|.-.||| ..+. T Consensus 103 ~~~ql~~i~~~a~v~V~~~D~~Q~i~~~e~g-----------------~~~~l~~~~~~~~~~~~~~~L~~qfR~~~s~~ 165 (348) T pfam09848 103 GPNQLDEIMKRAKVVVFFIDEGQEINTGEIG-----------------TIEELKKIAAKWPAEIYELHLSSQFRSGGSDD 165 (348) T ss_pred CHHHHHHHHHHCCEEEEEECCCCEECCCCCC-----------------CHHHHHHHHHHCCCEEEEEECCCCCCCCCCHH T ss_conf 5799999997528599998798488111376-----------------89999999986897047986550104578679 Q ss_pred HHHHHHHHCCCCEEEECCCCCCCCCCCCEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCEEEEEE-CCHHHH-H Q ss_conf 99999850369804764687766666884899728972178889899999999999995318577189981-998999-9 Q gi|254780229|r 1449 LIACSNYYFYDNSLIVFPSPYTSVDRYGVGFTHVKNGVVVDQGNPEEARVIALAVKDHALRYPEESLGVIA-MNAKQR-D 1526 (1775) Q Consensus 1449 lI~fSN~~fY~~~L~~~psp~~~~~~~~v~~~~v~~g~~~~~~N~~EA~~vv~~v~~~~~~~~~~siGVvt-~~~~Q~-~ 1526 (1775) .+.|-+...+.+.....|.. ...+.+. -. -++.|+..-+. +... .-++ -|+|+ +.-.=. . T Consensus 166 ~~~wv~~lL~~~~~~~~~~~---~~~yd~~--if--------~~~~~l~~~i~---~k~~-~~~~-~rlvA~~~~~~~~~ 227 (348) T pfam09848 166 YLAWVDALLDNRDIAAKPLS---DKDYDFK--VF--------DDPEEMRAAIK---EKNR-GGGR-ARLVAGYDWPWLSK 227 (348) T ss_pred HHHHHHHHHCCCCCCCCCCC---CCCCCEE--EE--------CCHHHHHHHHH---HHHC-CCCC-EEEEEECCCCCCCC T ss_conf 99999998547766676545---7898689--98--------99999999999---9626-6885-69999756500266 Q ss_pred -HHHHHHHHHHH-CCHHHHHHHH----HHCCCCCCE-EECCHHHCCCCCCCEEEEEEE----EECCCCCCCC------CC Q ss_conf -99999999931-2976899976----411478985-854256808666788999966----5078888762------00 Q gi|254780229|r 1527 -LIESTINKLCR-KDSEVEKAIS----KLRMHADPF-FIKNLENVQGDERDRIFISFT----YGPHEPGGRV------FQ 1589 (1775) Q Consensus 1527 -~I~~~l~~~~~-~~~~~~~~~~----~~~~~~e~~-fvknle~vQGdERDvI~~S~~----yg~~~~~g~~------~~ 1589 (1775) ...+..+-... .+=..-++.. .|+....++ -|...-++||-|=|-+.+-+| |-+. ++.. +. T Consensus 228 ~~~~~~~d~~v~~g~~~~~Wn~~~~~~~w~~~~~~l~evgsi~t~QG~Eldy~GVi~G~Dl~~~~~--~~~~~~~~~~~~ 305 (348) T pfam09848 228 EKDPSAKDIYIGEGDFKLPWNLDNKPLPWAESPDTIEEVGSIYTCQGLELDYVGVIWGPDLVYRPG--TDRIVFDGTKWE 305 (348) T ss_pred CCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCHHHEEEEEEECCCCCCEEEEEECCCEEEECC--CEEEEECCCCCC T ss_conf 678875776970697355677899987510066432110577852032232159998785478779--606996753103 Q ss_pred CCCCCCCCC------------HHHHHHHHHHHHCCEEEEEEC Q ss_conf 114777656------------246899999832142698870 Q gi|254780229|r 1590 RFGPINSDI------------GWRRLNVLFTRSRRRIEVFST 1619 (1775) Q Consensus 1590 ~fGpln~~g------------g~rRlNVa~sRAr~~~~v~~s 1619 (1775) +...-.... =.+.+||++||+++-|+||+. T Consensus 306 d~~~~~~~~~~~~~~~~~~~~i~N~YrVLLTRG~~G~yiy~~ 347 (348) T pfam09848 306 DKSAFQKRKDLKEDPEDADRLIKNTYRVLLTRGRKGLYIYAP 347 (348) T ss_pred CCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEEEC T ss_conf 710013455444587899999999999988643764899954 No 42 >smart00487 DEXDc DEAD-like helicases superfamily. Probab=97.58 E-value=0.00071 Score=55.31 Aligned_cols=77 Identities=21% Similarity=0.305 Sum_probs=62.1 Q ss_pred CHHHHHHHHHHHCC-CCEEEECCCCCCCHHHHHHHHHHHHHCC--CEEEEEECCHHHHHHHHHHHHHCCCC---CEEEEC Q ss_conf 34689999985148-9559942878770389999999999679--82899601058999999864435844---136540 Q gi|254780229|r 406 DSSQHSAVIDVING-KNLIIEGPPGTGKSQTITNIIAAAMLHG--KKILFCAQKMAAIEVVRHRLEKAGLG---EFCLEL 479 (1775) Q Consensus 406 DsSQ~~av~~~~~g-~~~vi~GPPGTGKSQTIaNlIa~~la~G--k~VLfVaeK~aALdVV~~RL~~~gLg---~~cleL 479 (1775) .+-|..++..+..+ ++++|.+|||||||-+...++...+..+ ++|||++-..+...-+.+++...+-. ..|.-+ T Consensus 10 ~~~Q~~~~~~~~~~~~~~~i~~~tGsGKT~~~~~~~~~~~~~~~~~~~li~~P~~~l~~q~~~~~~~~~~~~~~~~~~~~ 89 (201) T smart00487 10 RPYQKEAIEALLSGLRDVILAAPTGSGKTLAALLPALEALKRGKGKRVLVLVPTRELAEQWAEELKKLGPSLGLKVVGLY 89 (201) T ss_pred CHHHHHHHHHHHCCCCCEEEECCCCCCHHHHHHHHHHHHHHCCCCCEEEEEECCHHHHHHHHHHHHHCCCCCEEEEEEEE T ss_conf 98899999999838998899899996099999999999863389975999908599999999886010210204455652 Q ss_pred CCC Q ss_conf 266 Q gi|254780229|r 480 HSH 482 (1775) Q Consensus 480 Hs~ 482 (1775) ++. T Consensus 90 ~~~ 92 (201) T smart00487 90 GGD 92 (201) T ss_pred CCC T ss_conf 477 No 43 >PRK10919 ATP-dependent DNA helicase Rep; Provisional Probab=97.57 E-value=0.0042 Score=47.26 Aligned_cols=60 Identities=20% Similarity=0.199 Sum_probs=23.9 Q ss_pred CEEECCHHHCCCCCCCEEEEEEEEECCCCCCCCCCCCCCCCCC--CHHHHH-HHHHHHHCCEEEEEECC Q ss_conf 8585425680866678899996650788887620011477765--624689-99998321426988703 Q gi|254780229|r 1555 PFFIKNLENVQGDERDRIFISFTYGPHEPGGRVFQRFGPINSD--IGWRRL-NVLFTRSRRRIEVFSTM 1620 (1775) Q Consensus 1555 ~~fvknle~vQGdERDvI~~S~~yg~~~~~g~~~~~fGpln~~--gg~rRl-NVa~sRAr~~~~v~~s~ 1620 (1775) .+-+-+++.--|-|-|+||+-=+ ..|.+++. +.++.. .-|||| =|||||||+++++-++- T Consensus 551 ~V~LmTiH~SKGLEf~~Vfl~gl-----~eg~~P~~-~~~~~~~~eEERRLfYVa~TRAk~~L~ls~~~ 613 (672) T PRK10919 551 QVQLMTLHASKGLEFPYVYMVGM-----EEGFLPHQ-SSIDEDNIDEERRLAYVGITRAQKELTFTLCK 613 (672) T ss_pred EEEEEEHHHHHCCCCCEEEEECC-----CCCCCCCC-CCCCCCHHHHHHHHHHHHHHHHHHHHEEEECC T ss_conf 08996488623103687999678-----38999876-56998608799999999999987766550045 No 44 >PRK08181 transposase; Validated Probab=97.57 E-value=0.00015 Score=62.46 Aligned_cols=46 Identities=37% Similarity=0.604 Sum_probs=34.9 Q ss_pred HHHHHH----HHHHCCCCEEEECCCCCCCHHHHHHHHH-HHHHCCCEEEEEE Q ss_conf 689999----9851489559942878770389999999-9996798289960 Q gi|254780229|r 408 SQHSAV----IDVINGKNLIIEGPPGTGKSQTITNIIA-AAMLHGKKILFCA 454 (1775) Q Consensus 408 SQ~~av----~~~~~g~~~vi~GPPGTGKSQTIaNlIa-~~la~Gk~VLfVa 454 (1775) .|...+ .=+.+|.|+++.||||||||+ ++..|+ .|+-+|.+|+|+. T Consensus 91 ~~i~~L~~~~~fi~~~~Nvil~Gp~GtGKTh-LA~Alg~~A~~~G~~V~f~~ 141 (269) T PRK08181 91 AQVMALAAGDSWLAKGANLLLFGPPGGGKSH-LAAAIGLALIENGWRVLFTR 141 (269) T ss_pred HHHHHHHCCCHHHHCCCEEEEECCCCCCHHH-HHHHHHHHHHHCCCEEEEEE T ss_conf 9999996567588648708998999987889-99999999998799399978 No 45 >PRK07952 DNA replication protein DnaC; Validated Probab=97.56 E-value=0.00028 Score=59.60 Aligned_cols=48 Identities=31% Similarity=0.484 Sum_probs=34.5 Q ss_pred HHHHHHHHHHH--------CCCCEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEE Q ss_conf 46899999851--------4895599428787703899999999996798289960 Q gi|254780229|r 407 SSQHSAVIDVI--------NGKNLIIEGPPGTGKSQTITNIIAAAMLHGKKILFCA 454 (1775) Q Consensus 407 sSQ~~av~~~~--------~g~~~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVa 454 (1775) .+|..|...|+ ...+|+..||||||||+.-+-|.-+++.+|++|+|+. T Consensus 76 ~~q~~al~~a~~y~enf~~~~~gLlF~G~~GTGKThLA~aIan~Li~~G~sVlf~t 131 (242) T PRK07952 76 EGQMNALSKARQYVEEFDGNIASFIFSGKPGTGKNHLAAAICNELLLRGKSVLIIT 131 (242) T ss_pred HHHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCHHHHHHHHHHHHHHCCCEEEEEE T ss_conf 77899999999999865438871799789999789999999999998799499977 No 46 >PRK09401 reverse gyrase; Reviewed Probab=97.56 E-value=0.0016 Score=51.80 Aligned_cols=87 Identities=28% Similarity=0.334 Sum_probs=68.9 Q ss_pred HHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHHHHH-CCCEEEEEECCHHHHHHHHHHHHHC----CCCCEEEECCC Q ss_conf 468999998514895599428787703899999999996-7982899601058999999864435----84413654026 Q gi|254780229|r 407 SSQHSAVIDVINGKNLIIEGPPGTGKSQTITNIIAAAML-HGKKILFCAQKMAAIEVVRHRLEKA----GLGEFCLELHS 481 (1775) Q Consensus 407 sSQ~~av~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la-~Gk~VLfVaeK~aALdVV~~RL~~~----gLg~~cleLHs 481 (1775) |-|..=.-.+..|+||+|.-|||.||| |-.=+.+-.+| .||+++++--+..-..-|++||... |+..-.+-.|| T Consensus 81 ~~Qr~WakR~~~g~SFaiiAPTG~GKT-tfgl~~sly~a~kgkks~~i~PT~~Lv~Q~~~kl~~~~~~~~~~~~~~~y~~ 159 (1176) T PRK09401 81 SLQRTWAKRLLLGESFAIIAPTGVGKT-TFGLVMALYLAKKGKKSYIIFPTRLLVEQVVEKLRKLAEKVGVKVRLLYYHS 159 (1176) T ss_pred HHHHHHHHHHHCCCCEEEECCCCCCHH-HHHHHHHHHHHHCCCEEEEEECCHHHHHHHHHHHHHHHHHHCCCCEEEEEEC T ss_conf 889999999866897489888998888-9999999999865983999968889999999999999997099840899856 Q ss_pred CC--HHHHHHHHHHH Q ss_conf 61--10899999999 Q gi|254780229|r 482 HK--VHKRAILDDLR 494 (1775) Q Consensus 482 ~k--a~k~~vl~~L~ 494 (1775) .. ..|+++++.++ T Consensus 160 ~~~~~~kee~~~~~~ 174 (1176) T PRK09401 160 SLKKKEKEEFLERLE 174 (1176) T ss_pred CCCHHHHHHHHHHHH T ss_conf 776667899998865 No 47 >PRK06526 transposase; Provisional Probab=97.55 E-value=0.00015 Score=62.41 Aligned_cols=47 Identities=28% Similarity=0.478 Sum_probs=37.9 Q ss_pred HHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEE Q ss_conf 68999998514895599428787703899999999996798289960 Q gi|254780229|r 408 SQHSAVIDVINGKNLIIEGPPGTGKSQTITNIIAAAMLHGKKILFCA 454 (1775) Q Consensus 408 SQ~~av~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVa 454 (1775) .+++...=+..|.|+++.||||||||+.-.-+.-.|+-.|.+|+|+. T Consensus 87 ~~La~~~fi~~~~Nvil~G~~GtGKThLA~Alg~~A~~~G~~v~f~~ 133 (254) T PRK06526 87 AHLGTLDFVTGKENVVFLGPPGTGKTHLAIGLGIRACQAGHRVLFAT 133 (254) T ss_pred HHHHCCCHHHCCCCEEEECCCCCCHHHHHHHHHHHHHHCCCCEEEEE T ss_conf 99863717765887899899998689999999999998699679987 No 48 >PRK08116 hypothetical protein; Validated Probab=97.55 E-value=0.00032 Score=59.01 Aligned_cols=38 Identities=26% Similarity=0.509 Sum_probs=30.3 Q ss_pred HCCCCEEEECCCCCCCHHHHHHHHHHHH-HCCCEEEEEEC Q ss_conf 1489559942878770389999999999-67982899601 Q gi|254780229|r 417 INGKNLIIEGPPGTGKSQTITNIIAAAM-LHGKKILFCAQ 455 (1775) Q Consensus 417 ~~g~~~vi~GPPGTGKSQTIaNlIa~~l-a~Gk~VLfVae 455 (1775) ..|.++++.||||||||+ ++..||+.| .+|++|+|+.- T Consensus 106 ~~~~GLll~G~~GtGKTh-La~aIa~~l~~~g~~V~~~~~ 144 (262) T PRK08116 106 KESVGLLLWGSPGNGKTY-LAAAIANELIEKGVPVVFVNV 144 (262) T ss_pred CCCCEEEEECCCCCCHHH-HHHHHHHHHHHCCCEEEEEEH T ss_conf 468618998989998999-999999999987993999889 No 49 >COG1110 Reverse gyrase [DNA replication, recombination, and repair] Probab=97.54 E-value=0.0013 Score=52.52 Aligned_cols=87 Identities=24% Similarity=0.396 Sum_probs=66.6 Q ss_pred HHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHHHHH-CCCEEEEEECCHHHHHHHHHHHHHCC-----CCCEEEECC Q ss_conf 468999998514895599428787703899999999996-79828996010589999998644358-----441365402 Q gi|254780229|r 407 SSQHSAVIDVINGKNLIIEGPPGTGKSQTITNIIAAAML-HGKKILFCAQKMAAIEVVRHRLEKAG-----LGEFCLELH 480 (1775) Q Consensus 407 sSQ~~av~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la-~Gk~VLfVaeK~aALdVV~~RL~~~g-----Lg~~cleLH 480 (1775) |-|..=.-....|+||.|.-|||+||| |-.-+++..+| .||+|+++--+..-..-|++||...+ +.-..+ .| T Consensus 85 s~QR~WakR~~rg~SFaiiAPTGvGKT-Tfg~~~sl~~a~kgkr~yii~PT~~Lv~Q~~~kl~~~~e~~~~~~~~~~-yh 162 (1187) T COG1110 85 SAQRVWAKRLVRGKSFAIIAPTGVGKT-TFGLLMSLYLAKKGKRVYIIVPTTTLVRQVYERLKKFAEDAGSLDVLVV-YH 162 (1187) T ss_pred HHHHHHHHHHHCCCCEEEECCCCCCHH-HHHHHHHHHHHHCCCEEEEEECCHHHHHHHHHHHHHHHHHCCCCCEEEE-EC T ss_conf 788999999873784489827887654-7999999998755874999966789999999999998865378524665-31 Q ss_pred CC--CHHHHHHHHHHHH Q ss_conf 66--1108999999999 Q gi|254780229|r 481 SH--KVHKRAILDDLRK 495 (1775) Q Consensus 481 s~--ka~k~~vl~~L~~ 495 (1775) +. +..|++.++++.. T Consensus 163 ~~l~~~ekee~le~i~~ 179 (1187) T COG1110 163 SALPTKEKEEALERIES 179 (1187) T ss_pred CCCCHHHHHHHHHHHHC T ss_conf 23665779999999865 No 50 >PRK09183 transposase/IS protein; Provisional Probab=97.52 E-value=0.00018 Score=61.45 Aligned_cols=45 Identities=22% Similarity=0.420 Sum_probs=35.7 Q ss_pred HHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHH-HHHCCCEEEEEE Q ss_conf 8999998514895599428787703899999999-996798289960 Q gi|254780229|r 409 QHSAVIDVINGKNLIIEGPPGTGKSQTITNIIAA-AMLHGKKILFCA 454 (1775) Q Consensus 409 Q~~av~~~~~g~~~vi~GPPGTGKSQTIaNlIa~-~la~Gk~VLfVa 454 (1775) +++...=+..+.|+++.|||||||| -++..|++ ++-.|.+|+|+. T Consensus 91 ~La~~~fi~~~~Nvil~G~~GtGKT-hLA~Alg~~A~~~G~~v~f~~ 136 (258) T PRK09183 91 SLRSLSFIERNENIVLLGPSGVGKT-HLAIALGYEAVRAGIKVRFTT 136 (258) T ss_pred HHHCCCHHHCCCCEEEECCCCCCHH-HHHHHHHHHHHHCCCEEEEEE T ss_conf 8825816655886799899998689-999999999998799399978 No 51 >pfam01695 IstB IstB-like ATP binding protein. This protein contains an ATP/GTP binding P-loop motif. It is found associated with IS21 family insertion sequences. The function of this protein is unknown, but it may perform a transposase function. Probab=97.52 E-value=0.00014 Score=62.78 Aligned_cols=47 Identities=23% Similarity=0.472 Sum_probs=37.0 Q ss_pred HHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHHH-HHCCCEEEEEEC Q ss_conf 689999985148955994287877038999999999-967982899601 Q gi|254780229|r 408 SQHSAVIDVINGKNLIIEGPPGTGKSQTITNIIAAA-MLHGKKILFCAQ 455 (1775) Q Consensus 408 SQ~~av~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~-la~Gk~VLfVae 455 (1775) .|++...=+..|.|+++.||||||||+ ++..|++. +-.|++|+|+.- T Consensus 36 ~~L~~~~~i~~~~Nlll~G~~GtGKTh-LA~Ai~~~~~~~g~~v~f~~~ 83 (178) T pfam01695 36 AELAGLDWIEQAENLLLLGPPGVGKTH-LACALGHQACRAGYSVLFTRT 83 (178) T ss_pred HHHHCCCCHHCCCCEEEECCCCCCHHH-HHHHHHHHHHHCCCEEEEEEC T ss_conf 988559742158768998999987899-999999999986985999961 No 52 >PRK13341 recombination factor protein RarA/unknown domain fusion protein; Reviewed Probab=97.51 E-value=0.00061 Score=56.05 Aligned_cols=37 Identities=35% Similarity=0.485 Sum_probs=29.0 Q ss_pred HHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHHHHH Q ss_conf 68999998514895599428787703899999999996 Q gi|254780229|r 408 SQHSAVIDVINGKNLIIEGPPGTGKSQTITNIIAAAML 445 (1775) Q Consensus 408 SQ~~av~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la 445 (1775) +=+.-++....=.|+++-|||||||| ||+++||...- T Consensus 41 ~~Lrr~i~~~~~~S~Il~GPPGtGKT-TLA~iIA~~t~ 77 (726) T PRK13341 41 RLLRRAIKADRVGSLILYGPPGVGKT-TLARIIANHTR 77 (726) T ss_pred CHHHHHHHCCCCCEEEEECCCCCCHH-HHHHHHHHHHC T ss_conf 28999997699982788897999999-99999988748 No 53 >PRK13889 conjugal transfer relaxase TraA; Provisional Probab=97.51 E-value=0.00093 Score=54.13 Aligned_cols=149 Identities=14% Similarity=0.175 Sum_probs=71.3 Q ss_pred CCCEEEECCHHHCCHHHHHHHHH----HCCEEEEEECCCCCCCCHHHHHCCCCCCCCCCHHHCCCCCHHHHHHHHHCCCC Q ss_conf 41078860632369899889886----11828996077866862011000134444320110144000799999755875 Q gi|254780229|r 1361 KFDLVIMDESSQIKPEDALGVIA----RGKQIVVVGDPKQLPPTRFFDHDNDQEDYDEEVAAVSQTESILDALLPLFSMR 1436 (1775) Q Consensus 1361 ~FD~VI~DEASQ~~~~~al~al~----rakqvvvvGD~kQlpPt~ff~~~~~~~~~~~~~~~~~~~eSild~~~~~~p~~ 1436 (1775) .=||+|||||+-+..-.---.+- .|-+||+||||+||+|..- -..|......+... T Consensus 433 ~~dVlVVDEAGMVgSRqMarll~~Ae~AGAKVVLVGD~~QLq~IeA--------------------GaaFral~er~G~a 492 (992) T PRK13889 433 ARDVLVIDEAGMVGTRQLERVLSHAADAGAKVVLVGDPQQLQAIEA--------------------GAAFRSIHERHGGA 492 (992) T ss_pred CCCEEEEECCCCCCHHHHHHHHHHHHHCCCEEEEECCHHHCCCCCC--------------------CHHHHHHHHHCCCE T ss_conf 9858999676557749999999999984998999488777188646--------------------88999999851967 Q ss_pred EECCCCCCCCHHHHHHHHHHCCCCEEEECCCCCCCCCCCCEEEEECCCCCCCCCCCHHHHH-HHHHHHHHHHHCCCCCEE Q ss_conf 2053055799789999985036980476468776666688489972897217888989999-999999999531857718 Q gi|254780229|r 1437 RLQWHYRSLNENLIACSNYYFYDNSLIVFPSPYTSVDRYGVGFTHVKNGVVVDQGNPEEAR-VIALAVKDHALRYPEESL 1515 (1775) Q Consensus 1437 ~L~~HyRs~~e~lI~fSN~~fY~~~L~~~psp~~~~~~~~v~~~~v~~g~~~~~~N~~EA~-~vv~~v~~~~~~~~~~si 1515 (1775) .|..-+|-+++..-.-|- .|=.|+. ..+|.++ -..|......+..++. .++.....--...|++|= T Consensus 493 eLteIrRQr~~w~ReAS~-~lA~G~~-----------~~AL~~Y-~~~G~V~~~~treea~~~lv~~W~~Dr~~~p~~s~ 559 (992) T PRK13889 493 EIGEVRRQREDWQRDATR-DLATGRT-----------GEALDAY-DAHGMVHAAETREQAREDLIDRWDRDRQASPDRSR 559 (992) T ss_pred EEEHEECCCCHHHHHHHH-HHHCCCH-----------HHHHHHH-HHCCCEECCCCHHHHHHHHHHHHHHHHHCCCCCCE T ss_conf 510222148799999999-9875888-----------8999999-97799745777999999999999988751887635 Q ss_pred EEEECCHHHHHHHHHHHHHHHHCCHHH Q ss_conf 998199899999999999993129768 Q gi|254780229|r 1516 GVIAMNAKQRDLIESTINKLCRKDSEV 1542 (1775) Q Consensus 1516 GVvt~~~~Q~~~I~~~l~~~~~~~~~~ 1542 (1775) =|+|...+....|.+.+...+.....+ T Consensus 560 iiLA~tN~dVraLN~~aR~~l~~~G~L 586 (992) T PRK13889 560 IILTHTNDEVRALNEAARERMRAAGDL 586 (992) T ss_pred EEEECCHHHHHHHHHHHHHHHHHCCCC T ss_conf 999464589999999999999975987 No 54 >pfam00270 DEAD DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA splicing, ribosome biogenesis, nucleocytoplasmic transport, translation, RNA decay and organellar gene expression. Probab=97.51 E-value=0.00069 Score=55.45 Aligned_cols=63 Identities=24% Similarity=0.424 Sum_probs=48.0 Q ss_pred HHHHHHHHHHCCCCEEEECCCCCCCHHHHH-HHHHHHHH--CCCEEEEEECCHHHHHHHHHHHHHC Q ss_conf 689999985148955994287877038999-99999996--7982899601058999999864435 Q gi|254780229|r 408 SQHSAVIDVINGKNLIIEGPPGTGKSQTIT-NIIAAAML--HGKKILFCAQKMAAIEVVRHRLEKA 470 (1775) Q Consensus 408 SQ~~av~~~~~g~~~vi~GPPGTGKSQTIa-NlIa~~la--~Gk~VLfVaeK~aALdVV~~RL~~~ 470 (1775) =|..|+..+.+|+|++|.+|+|||||-+.. -+|..... .|++++|++-..+-..-+++++... T Consensus 3 ~Q~~~i~~~~~g~~~iv~~pTGsGKT~~~~~~~l~~~~~~~~~~~~v~l~Pt~aL~~q~~~~~~~~ 68 (167) T pfam00270 3 IQAEAIPAILEGKDVLVQAPTGSGKTLAFLLPALQALLKNPDGPQALVLAPTRELAEQIYEELKKL 68 (167) T ss_pred HHHHHHHHHHCCCCEEEECCCCCCHHHHHHHHHHHHHHHCCCCCEEEEECCHHHHHHHHHHHHHHH T ss_conf 599999999769978998899975899999999999874778987999906088888999886432 No 55 >PRK10876 recB exonuclease V subunit beta; Provisional Probab=97.50 E-value=0.0053 Score=46.25 Aligned_cols=51 Identities=33% Similarity=0.481 Sum_probs=43.1 Q ss_pred CCCCEEEECCCCCCCHHHHHHHHHHHHHC---------C---CEEEEEECCHHHHHHHHHHHHH Q ss_conf 48955994287877038999999999967---------9---8289960105899999986443 Q gi|254780229|r 418 NGKNLIIEGPPGTGKSQTITNIIAAAMLH---------G---KKILFCAQKMAAIEVVRHRLEK 469 (1775) Q Consensus 418 ~g~~~vi~GPPGTGKSQTIaNlIa~~la~---------G---k~VLfVaeK~aALdVV~~RL~~ 469 (1775) .|.++ |+-.=|||||.||++++-..|.. + .++|+|..+.||-.-++.|+.. T Consensus 16 ~G~~L-VeASAGTGKT~~La~l~lRLll~~~~~~~~~~pl~id~ILvVTFT~aAA~EMk~RIr~ 78 (1181) T PRK10876 16 QGERL-IEASAGTGKTFTIAALYLRLLLGLGGSAAFPRPLTVEELLVVTFTEAATEELRGRIRS 78 (1181) T ss_pred CCCEE-EEECCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCEEEEECHHHHHHHHHHHHHHH T ss_conf 99979-8871536678999999999986244554567999828198965678999999999999 No 56 >KOG0994 consensus Probab=97.49 E-value=0.019 Score=40.46 Aligned_cols=15 Identities=7% Similarity=-0.028 Sum_probs=5.2 Q ss_pred HHHHHHHHHHHHHHH Q ss_conf 999999999999999 Q gi|254780229|r 508 EIDVEITRYEELKGQ 522 (1775) Q Consensus 508 e~e~~~~~L~~~r~~ 522 (1775) .|+..+..|...-.. T Consensus 1168 ~WD~il~~L~~rt~r 1182 (1758) T KOG0994 1168 TWDAILQELALRTHR 1182 (1758) T ss_pred HHHHHHHHHHHHHHH T ss_conf 899999999999999 No 57 >COG1484 DnaC DNA replication protein [DNA replication, recombination, and repair] Probab=97.47 E-value=0.00021 Score=60.79 Aligned_cols=38 Identities=37% Similarity=0.765 Sum_probs=31.6 Q ss_pred CCCCEEEECCCCCCCHHHHHHHHHHHHHC-CCEEEEEECC Q ss_conf 48955994287877038999999999967-9828996010 Q gi|254780229|r 418 NGKNLIIEGPPGTGKSQTITNIIAAAMLH-GKKILFCAQK 456 (1775) Q Consensus 418 ~g~~~vi~GPPGTGKSQTIaNlIa~~la~-Gk~VLfVaeK 456 (1775) +|+|+++-|||||||| -++-+|++.+.. |.+|+|+.-. T Consensus 104 ~~~nl~l~G~~G~GKt-hLa~Ai~~~l~~~g~sv~f~~~~ 142 (254) T COG1484 104 RGENLVLLGPPGVGKT-HLAIAIGNELLKAGISVLFITAP 142 (254) T ss_pred CCCCEEEECCCCCCHH-HHHHHHHHHHHHCCCEEEEEEHH T ss_conf 5882899899998799-99999999999839849998859 No 58 >PRK13342 recombination factor protein RarA; Reviewed Probab=97.46 E-value=0.00063 Score=55.88 Aligned_cols=34 Identities=41% Similarity=0.633 Sum_probs=27.3 Q ss_pred HHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHHHH Q ss_conf 99999851489559942878770389999999999 Q gi|254780229|r 410 HSAVIDVINGKNLIIEGPPGTGKSQTITNIIAAAM 444 (1775) Q Consensus 410 ~~av~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~l 444 (1775) +.-+.....=.|+++-|||||||| |++++||..+ T Consensus 28 L~~~i~~~~~~s~Il~GPPG~GKT-TlA~iiA~~~ 61 (417) T PRK13342 28 LRRMIEAGRLSSMILWGPPGTGKT-TLARIIAGAT 61 (417) T ss_pred HHHHHHCCCCCEEEEECCCCCCHH-HHHHHHHHHH T ss_conf 999997699975998896999899-9999999986 No 59 >COG0210 UvrD Superfamily I DNA and RNA helicases [DNA replication, recombination, and repair] Probab=97.46 E-value=0.00069 Score=55.45 Aligned_cols=66 Identities=29% Similarity=0.349 Sum_probs=54.6 Q ss_pred CCHHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHCC----CEEEEEECCHHHHHHHHHHHHHCCC Q ss_conf 7346899999851489559942878770389999999999679----8289960105899999986443584 Q gi|254780229|r 405 SDSSQHSAVIDVINGKNLIIEGPPGTGKSQTITNIIAAAMLHG----KKILFCAQKMAAIEVVRHRLEKAGL 472 (1775) Q Consensus 405 aDsSQ~~av~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~G----k~VLfVaeK~aALdVV~~RL~~~gL 472 (1775) ..+.|..||... .|..+|+ ..||||||.||++-|++++..| ..||.|..++.|..-++.|+..... T Consensus 3 Ln~~Q~~av~~~-~gp~lV~-AGaGsGKT~vlt~Ria~li~~~~v~p~~Il~vTFTnkAA~em~~Rl~~~~~ 72 (655) T COG0210 3 LNPEQREAVLHP-DGPLLVL-AGAGSGKTRVLTERIAYLIAAGGVDPEQILAITFTNKAAAEMRERLLKLLG 72 (655) T ss_pred CCHHHHHHHHCC-CCCEEEE-ECCCCCCHHHHHHHHHHHHHCCCCCHHHEEEEEEEHHHHHHHHHHHHHHHC T ss_conf 998999986188-9986999-579986189999999999873895757717789676999999999998738 No 60 >CHL00181 cbbX CbbX; Provisional Probab=97.45 E-value=0.0011 Score=53.25 Aligned_cols=28 Identities=32% Similarity=0.596 Sum_probs=25.0 Q ss_pred CCCCEEEECCCCCCCHHHHHHHHHHHHHC Q ss_conf 48955994287877038999999999967 Q gi|254780229|r 418 NGKNLIIEGPPGTGKSQTITNIIAAAMLH 446 (1775) Q Consensus 418 ~g~~~vi~GPPGTGKSQTIaNlIa~~la~ 446 (1775) .+.++|..|||||||| |+|-++|..++. T Consensus 58 ~s~h~vF~GnPGTGKT-TVARl~a~il~~ 85 (287) T CHL00181 58 PGLHMSFTGSPGTGKT-TVALKMADILYR 85 (287) T ss_pred CCCEEEEECCCCCCHH-HHHHHHHHHHHH T ss_conf 7653888789986799-999999999998 No 61 >PRK05580 primosome assembly protein PriA; Validated Probab=97.43 E-value=0.0021 Score=50.40 Aligned_cols=82 Identities=22% Similarity=0.316 Sum_probs=65.4 Q ss_pred CCHHHHHHHHHHHC---CCCEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEECCHHHHHHHHHHHHHCCCCCEEEECCC Q ss_conf 73468999998514---895599428787703899999999996798289960105899999986443584413654026 Q gi|254780229|r 405 SDSSQHSAVIDVIN---GKNLIIEGPPGTGKSQTITNIIAAAMLHGKKILFCAQKMAAIEVVRHRLEKAGLGEFCLELHS 481 (1775) Q Consensus 405 aDsSQ~~av~~~~~---g~~~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVaeK~aALdVV~~RL~~~gLg~~cleLHs 481 (1775) -...|..|+..... .+.++++|.||+|||..-..+|..+|..||.|||+-=-.+-..-+..||... +|.-...+|| T Consensus 169 L~~eQ~~a~~~i~~~~~~~~~LL~GvTGSGKTevYl~li~~~l~~GkqvLiLvPEI~lt~q~~~rl~~~-fg~~v~v~HS 247 (699) T PRK05580 169 LNEEQAAALAAIRAAGGFSAFLLDGVTGSGKTEVYLQAIAEALAQGKQALVLVPEIALTPQLLARFRAR-FGARVAVLHS 247 (699) T ss_pred CCHHHHHHHHHHHHCCCCCCEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHHHHHHHHH-HCCCEEEECC T ss_conf 899999999999855888717874789860799999999999973997899917678789999999987-0995799648 Q ss_pred CCHHHH Q ss_conf 611089 Q gi|254780229|r 482 HKVHKR 487 (1775) Q Consensus 482 ~ka~k~ 487 (1775) .-+.+. T Consensus 248 ~ls~~e 253 (699) T PRK05580 248 GLSDGE 253 (699) T ss_pred CCCHHH T ss_conf 898579 No 62 >COG3973 Superfamily I DNA and RNA helicases [General function prediction only] Probab=97.41 E-value=0.0094 Score=43.66 Aligned_cols=214 Identities=18% Similarity=0.192 Sum_probs=124.7 Q ss_pred CCCCEEEECCHHHCCHH--HHHHHHHHCCEEEEEECCCCCCCCHHHHHCCCCCCCCCCHHHCCCCCHHHHHHHHH--CCC Q ss_conf 54107886063236989--98898861182899607786686201100013444432011014400079999975--587 Q gi|254780229|r 1360 VKFDLVIMDESSQIKPE--DALGVIARGKQIVVVGDPKQLPPTRFFDHDNDQEDYDEEVAAVSQTESILDALLPL--FSM 1435 (1775) Q Consensus 1360 ~~FD~VI~DEASQ~~~~--~al~al~rakqvvvvGD~kQlpPt~ff~~~~~~~~~~~~~~~~~~~eSild~~~~~--~p~ 1435 (1775) ..|--+|||||--..+. ..+-.++-+-+.-||||-.|.== . ..+..+-.+.-|.|... |.- T Consensus 527 ~~~kh~vIDeaqdys~~q~~~~r~l~~~as~tivgd~gq~i~----~-----------~~~e~~~~e~~~~~fed~~~e~ 591 (747) T COG3973 527 RRLKHTVIDEAQDYSRFQFTDNRTLAERASMTIVGDYGQVIY----D-----------EAQELSPMERMDVFFEDPSFEY 591 (747) T ss_pred CCCCCEEECHHHHCCHHHHHHHHHHHHHCCCEEECCCCCEEH----H-----------HHCCCCHHHHHHHHHHCCCCHH T ss_conf 203322551344334024588764432036447624785532----2-----------0122687999888872897054 Q ss_pred CEECCCCCCCCHHHHHHHHHHCCCCEEEECC-CCCCCCCCCCEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCE Q ss_conf 5205305579978999998503698047646-877666668848997289721788898999999999999953185771 Q gi|254780229|r 1436 RRLQWHYRSLNENLIACSNYYFYDNSLIVFP-SPYTSVDRYGVGFTHVKNGVVVDQGNPEEARVIALAVKDHALRYPEES 1514 (1775) Q Consensus 1436 ~~L~~HyRs~~e~lI~fSN~~fY~~~L~~~p-sp~~~~~~~~v~~~~v~~g~~~~~~N~~EA~~vv~~v~~~~~~~~~~s 1514 (1775) ..|.-.|||--+ |+.|-|...-++ || -|.+.....+..... -+|..=.+... .++..|+.....+ T Consensus 592 v~l~~syrSt~e-I~efan~~l~d~----~~~~p~~rsge~p~~i~~--------~~ne~l~qr~~-~ii~~mkk~~~et 657 (747) T COG3973 592 VGLIASYRSTAE-IDEFANSLLPDR----FRIHPLTRSGEKPAVIMS--------VANEELVQRNP-DIIPRMKKRGSET 657 (747) T ss_pred HHHHHHHCCHHH-HHHHHHHHCCCC----CCCCHHHCCCCCCEEEEC--------CCHHHHHHHHH-HHHHHHHHCCCCC T ss_conf 324565258588-999999863277----766612117887622111--------13398987507-7888887527871 Q ss_pred EEEEECCHHHHHHHHHHHHHHHHCCHHHHHHHHHHCCCCCCEEECCHHHCCCCCCCEEEEEEEEECCCCCCCCCCCCCCC Q ss_conf 89981998999999999999931297689997641147898585425680866678899996650788887620011477 Q gi|254780229|r 1515 LGVIAMNAKQRDLIESTINKLCRKDSEVEKAISKLRMHADPFFIKNLENVQGDERDRIFISFTYGPHEPGGRVFQRFGPI 1594 (1775) Q Consensus 1515 iGVvt~~~~Q~~~I~~~l~~~~~~~~~~~~~~~~~~~~~e~~fvknle~vQGdERDvI~~S~~yg~~~~~g~~~~~fGpl 1594 (1775) ||||+-+..+-..+.+.|.+.-... .++.--. ..+.+.-| =-.--.-|-|-|.+| +|.|.. -- T Consensus 658 iaVi~kt~~d~~~~~d~lre~~~~r-~I~k~nq-~f~~~~~v--ipvy~aKGlEFD~vi---v~d~s~----------~e 720 (747) T COG3973 658 IAVICKTDHDCKAVMDSLREKDSQR-TIAKENQ-RFHHGSDV--IPVYDAKGLEFDHVI---VVDPSI----------VE 720 (747) T ss_pred EEEECCCHHHHHHHHHHHHHCCHHH-HHHHHCC-CCCCCCEE--EEEEECCCCEEEEEE---EECCHH----------HC T ss_conf 6997776789999999874201145-7876156-66478248--882202563000489---964113----------02 Q ss_pred CCCCHHHHHHHHHHHHCCEEEEEEC Q ss_conf 7656246899999832142698870 Q gi|254780229|r 1595 NSDIGWRRLNVLFTRSRRRIEVFST 1619 (1775) Q Consensus 1595 n~~gg~rRlNVa~sRAr~~~~v~~s 1619 (1775) +.+.|.|-|-||+|||=+.+.+|+- T Consensus 721 ~te~~~r~LYva~TRAlh~l~if~~ 745 (747) T COG3973 721 ETEQDLRDLYVAVTRALHSLYIFGE 745 (747) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHC T ss_conf 5633305689999999989888506 No 63 >KOG0996 consensus Probab=97.39 E-value=0.024 Score=39.42 Aligned_cols=15 Identities=7% Similarity=0.053 Sum_probs=8.2 Q ss_pred EEEEEEEEEEECCCC Q ss_conf 567764357633888 Q gi|254780229|r 225 LYIVLGFLEWYEIDD 239 (1775) Q Consensus 225 Lyla~G~L~W~~~~~ 239 (1775) +.++++|=+-.+.++ T Consensus 160 CsV~vhFq~iiD~~~ 174 (1293) T KOG0996 160 CSVEVHFQKIIDKPG 174 (1293) T ss_pred EEEEEEEEEEECCCC T ss_conf 048876433231689 No 64 >PRK13909 putative recombination protein RecB; Provisional Probab=97.35 E-value=0.015 Score=41.44 Aligned_cols=49 Identities=20% Similarity=0.348 Sum_probs=44.9 Q ss_pred EEEECCCCCCCHHHHHHHHHHHHHCCC---EEEEEECCHHHHHHHHHHHHHC Q ss_conf 599428787703899999999996798---2899601058999999864435 Q gi|254780229|r 422 LIIEGPPGTGKSQTITNIIAAAMLHGK---KILFCAQKMAAIEVVRHRLEKA 470 (1775) Q Consensus 422 ~vi~GPPGTGKSQTIaNlIa~~la~Gk---~VLfVaeK~aALdVV~~RL~~~ 470 (1775) +||.-++|||||.|++..|.++|..|. .+|.|..+++|..-.+.|+... T Consensus 1 l~i~AsAGSGKT~~L~~R~l~Ll~~g~~p~~ILavTFT~kAA~EMk~RI~~~ 52 (911) T PRK13909 1 LALKASAGSGKTFALSVRFLALLFKGANPNEILALTFTKKAANEMKERIIDT 52 (911) T ss_pred CEEEECCCHHHHHHHHHHHHHHHCCCCCCCCEEEECHHHHHHHHHHHHHHHH T ss_conf 9799767044999999999999738979244878556889999999999999 No 65 >COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair] Probab=97.35 E-value=0.0014 Score=52.16 Aligned_cols=80 Identities=23% Similarity=0.381 Sum_probs=52.8 Q ss_pred CCHHHHHHHHHHHCC----CCEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEECCHHHHHHHHHHHHHCCCCCEEEECC Q ss_conf 734689999985148----9559942878770389999999999679828996010589999998644358441365402 Q gi|254780229|r 405 SDSSQHSAVIDVING----KNLIIEGPPGTGKSQTITNIIAAAMLHGKKILFCAQKMAAIEVVRHRLEKAGLGEFCLELH 480 (1775) Q Consensus 405 aDsSQ~~av~~~~~g----~~~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVaeK~aALdVV~~RL~~~gLg~~cleLH 480 (1775) ...-|..|+..+... +.|+++|.||+|||-.-.++|+.+|++||.|||.-=-.+=+.-+-.|+... .|.-.-.+| T Consensus 199 Ln~~Q~~a~~~i~~~~~~~~~~Ll~GvTGSGKTEvYl~~i~~~L~~GkqvLvLVPEI~Ltpq~~~rf~~r-Fg~~v~vlH 277 (730) T COG1198 199 LNQEQQAAVEAILSSLGGFAPFLLDGVTGSGKTEVYLEAIAKVLAQGKQVLVLVPEIALTPQLLARFKAR-FGAKVAVLH 277 (730) T ss_pred CCHHHHHHHHHHHHHCCCCCCEEEECCCCCCHHHHHHHHHHHHHHCCCEEEEEECCCCCHHHHHHHHHHH-HCCCHHHHC T ss_conf 3889999999999750566653676777885899999999999975987999956534569999999998-678745314 Q ss_pred CCCHH Q ss_conf 66110 Q gi|254780229|r 481 SHKVH 485 (1775) Q Consensus 481 s~ka~ 485 (1775) |.-++ T Consensus 278 S~Ls~ 282 (730) T COG1198 278 SGLSP 282 (730) T ss_pred CCCCH T ss_conf 65792 No 66 >PRK10875 recD exonuclease V subunit alpha; Provisional Probab=97.28 E-value=0.00078 Score=54.89 Aligned_cols=44 Identities=20% Similarity=0.416 Sum_probs=33.4 Q ss_pred CCCCCEEEECCHHHCCHHH---HHHHHHHCCEEEEEECCCCCCCCHH Q ss_conf 2541078860632369899---8898861182899607786686201 Q gi|254780229|r 1359 DVKFDLVIMDESSQIKPED---ALGVIARGKQIVVVGDPKQLPPTRF 1402 (1775) Q Consensus 1359 ~~~FD~VI~DEASQ~~~~~---al~al~rakqvvvvGD~kQlpPt~f 1402 (1775) +..+|+|||||||=++.+- .|-||--.-++|+|||+.||||..- T Consensus 258 PL~~DvlIVDEASMVDl~Lm~~LL~Alp~~aRLILvGD~dQLpSVga 304 (607) T PRK10875 258 PLHLDVLVVDEASMIDLPMMSRLIDALPDHARVIFLGDRDQLASVEA 304 (607) T ss_pred CCCCCEEEEECCHHHHHHHHHHHHHHCCCCCEEEEECCHHHCCCCCC T ss_conf 99889899907336659999999982899988999656232478888 No 67 >KOG0994 consensus Probab=97.27 E-value=0.031 Score=38.30 Aligned_cols=20 Identities=30% Similarity=0.416 Sum_probs=6.7 Q ss_pred CEEEECCCCCHHHHHHHHHH Q ss_conf 13654026611089999999 Q gi|254780229|r 474 EFCLELHSHKVHKRAILDDL 493 (1775) Q Consensus 474 ~~cleLHs~ka~k~~vl~~L 493 (1775) +.|.+-|---.....++..| T Consensus 1157 P~C~PCh~CF~~WD~il~~L 1176 (1758) T KOG0994 1157 PVCVPCHECFQTWDAILQEL 1176 (1758) T ss_pred CCCCCHHHHHHHHHHHHHHH T ss_conf 87744488887899999999 No 68 >TIGR01447 recD exodeoxyribonuclease V, alpha subunit; InterPro: IPR006344 This family describes the exodeoxyribonuclease V alpha subunit, RecD. RecD is part of a RecBCD complex. C-terminal part of the protein matches a domain found in viral RNA helicase, superfamily 1, IPR000606 from INTERPRO. ; GO: 0008854 exodeoxyribonuclease V activity, 0009338 exodeoxyribonuclease V complex. Probab=97.25 E-value=0.00046 Score=57.28 Aligned_cols=101 Identities=16% Similarity=0.152 Sum_probs=59.3 Q ss_pred CCCCCCEEEECCHHHCCHHH---HHHHHHHCCE--------EEEEECCCCCCCCHHHHHCCCCCC---------CCCCHH Q ss_conf 22541078860632369899---8898861182--------899607786686201100013444---------432011 Q gi|254780229|r 1358 KDVKFDLVIMDESSQIKPED---ALGVIARGKQ--------IVVVGDPKQLPPTRFFDHDNDQED---------YDEEVA 1417 (1775) Q Consensus 1358 ~~~~FD~VI~DEASQ~~~~~---al~al~rakq--------vvvvGD~kQlpPt~ff~~~~~~~~---------~~~~~~ 1417 (1775) .+..|||+||||||=++..- .|-|+--.-| .|++||+.||||..-+.-...-.+ ..+... T Consensus 346 N~L~~DVLvvDEaSMVdl~lm~kL~~A~~~~~k~~KLy~~~LIllGD~nQL~sveaG~Vl~~l~~~~~~g~~~y~~~~~a 425 (753) T TIGR01447 346 NPLPLDVLVVDEASMVDLPLMAKLLKALPPNTKDKKLYADRLILLGDKNQLPSVEAGAVLGDLCELANEGESLYAAEKSA 425 (753) T ss_pred CCCCCCEEEECCCHHCCHHHHHHHHHHCCCCCCCCCHHHCCCCEECCCCCCCCCCHHHHHHHHHHHHHCCCCHHHHHHHH T ss_conf 88985527870600226799999997226300132010102001226788887543555799999975375002454888 Q ss_pred ----HCCCCCHHHHHHHHH-CC----CCEECCCCCCCCHHHHHHHHHHCC Q ss_conf ----014400079999975-58----752053055799789999985036 Q gi|254780229|r 1418 ----AVSQTESILDALLPL-FS----MRRLQWHYRSLNENLIACSNYYFY 1458 (1775) Q Consensus 1418 ----~~~~~eSild~~~~~-~p----~~~L~~HyRs~~e~lI~fSN~~fY 1458 (1775) .....-.+...|... -| ..+|+.-||+-+.|=|+---...| T Consensus 426 ~l~~~~~~~~~~~~~~~~~~~~~~d~~~~L~~ShRf~~~S~i~~La~Ai~ 475 (753) T TIGR01447 426 LLAELLGEVAAVKKIASKTRNPLSDNVCMLKTSHRFGKDSGIGQLAKAIN 475 (753) T ss_pred HHHHHHHHHHHHHHHHHHHCCCHHHHHHHHCCCCCCCCCCHHHHHHHHHH T ss_conf 99985478899998531211523544765301220487506789999997 No 69 >PRK11773 uvrD DNA-dependent helicase II; Provisional Probab=97.25 E-value=0.00055 Score=56.54 Aligned_cols=62 Identities=21% Similarity=0.231 Sum_probs=42.9 Q ss_pred CCCEEECCHHHCCCCCCCEEEEEEEEECCCCCCCCCCCCCCCCCCC---HHHHH-HHHHHHHCCEEEEEECC Q ss_conf 8985854256808666788999966507888876200114777656---24689-99998321426988703 Q gi|254780229|r 1553 ADPFFIKNLENVQGDERDRIFISFTYGPHEPGGRVFQRFGPINSDI---GWRRL-NVLFTRSRRRIEVFSTM 1620 (1775) Q Consensus 1553 ~e~~fvknle~vQGdERDvI~~S~~yg~~~~~g~~~~~fGpln~~g---g~rRl-NVa~sRAr~~~~v~~s~ 1620 (1775) .+.+-+-+++.--|-|-||||+- |=. .|.++.. -.++..+ -|||| =|||||||+++++-++- T Consensus 551 ~d~V~LmTiH~AKGLEFp~Vfl~---gl~--eg~fP~~-~s~~~~~~leEERRLfYVaiTRAk~~L~ls~~~ 616 (722) T PRK11773 551 EDAVQLMTLHSAKGLEFPLVFLV---GME--EGLFPSQ-MSLEEGGRLEEERRLAYVGITRAMQKLTLTYAE 616 (722) T ss_pred CCEEEEEEEECCCCCCCCEEEEE---CCC--CCCCCCC-CCCCCCCCHHHHHHHHHHHHHHHHHHEEEEECC T ss_conf 88289996120268667768980---661--7988863-346897510899999999898874524885512 No 70 >PRK10869 recombination and repair protein; Provisional Probab=97.23 E-value=0.034 Score=37.88 Aligned_cols=26 Identities=23% Similarity=0.451 Sum_probs=21.8 Q ss_pred CCCE-EEECCCCCCCHHHHHHHHHHHHH Q ss_conf 8955-99428787703899999999996 Q gi|254780229|r 419 GKNL-IIEGPPGTGKSQTITNIIAAAML 445 (1775) Q Consensus 419 g~~~-vi~GPPGTGKSQTIaNlIa~~la 445 (1775) +..| ||+|.+|+||| -|.+.|..++. T Consensus 21 ~~Gl~viTGETGAGKS-ill~al~lllG 47 (553) T PRK10869 21 QSGMTVITGETGAGKS-IAIDALGLCLG 47 (553) T ss_pred CCCCEEEECCCCCCHH-HHHHHHHHHHC T ss_conf 9998687789998799-99999999848 No 71 >TIGR02169 SMC_prok_A chromosome segregation protein SMC; InterPro: IPR011891 The SMC (structural maintenance of chromosomes) family of proteins, exist in virtually all organisms including both bacteria and archaea. The SMC proteins are essential for successful chromosome transmission during replication and segregation of the genome in all organisms and form three types of heterodimer (SMC1SMC3, SMC2SMC4, SMC5SMC6), which are core components of large multiprotein complexes. The best known complexes are cohesin, which is responsible for sister-chromatid cohesion, and condensin, which is required for full chromosome condensation in mitosis. SMCs are generally present as single proteins in bacteria, and as at least six distinct proteins in eukaryotes. The proteins range in size from approximately 110 to 170 kDa, and share a five-domain structure, with globular N- and C-terminal (IPR003395 from INTERPRO) domains separated by a long (circa 100 nm or 900 residues) coiled coil segment in the centre of which is a globular ''hinge'' domain, characterised by a set of four highly conserved glycine residues that are typical of flexible regions in a protein. The amino-terminal domain contains a 'Walker A' nucleotide-binding domain (GxxGxGKS/T), which by mutational studies has been shown to be essential in several proteins. The carboxy-terminal domain contains a sequence (the DA-box) that resembles a 'Walker B' motif (XXXXD, where X is any hydrophobic residue), and a LSGG motif with homology to the signature sequence of the ATP-binding cassette (ABC) family of ATPases . All SMC proteins appear to form dimers, either forming homodimers with themselves, as in the case of prokaryotic SMC proteins, or heterodimers between different but related SMC proteins. The dimers are arranged in an antiparallel alignment. This orientation brings the N- and C-terminal globular domains (from either different or identical protamers) together, which unites an ATP binding site (Walker A motif) within the N-terminal domain with a Walker B motif (DA box) within the C-terminal domain, to form a potentially functional ATPase. Protein interaction and microscopy data suggest that SMC dimers form a ring-like structure which might embrace DNA molecules. Non-SMC subunits associate with the SMC amino- and carboxy-terminal domains. The sequence homology within the carboxy-terminal domain is relatively high within the SMC1-SMC4 group, whereas SMC5 and SMC6 show some divergence in both of these sequences. SMCs share not only sequence similarity but also structural similarity with ABC proteins. SMC proteins function together with other proteins in a range of chromosomal transactions, including chromosome condensation, sister-chromatid cohesion, recombination, DNA repair and epigenetic silencing of gene expression . This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by IPR011890 from INTERPRO. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent .. Probab=97.22 E-value=0.034 Score=37.84 Aligned_cols=77 Identities=30% Similarity=0.402 Sum_probs=45.5 Q ss_pred CCCE-EEECCCCCCCHHHHHHHHHHHHH-----------------CC-----C----------EEEEEECCHH-H----- Q ss_conf 8955-99428787703899999999996-----------------79-----8----------2899601058-9----- Q gi|254780229|r 419 GKNL-IIEGPPGTGKSQTITNIIAAAML-----------------HG-----K----------KILFCAQKMA-A----- 459 (1775) Q Consensus 419 g~~~-vi~GPPGTGKSQTIaNlIa~~la-----------------~G-----k----------~VLfVaeK~a-A----- 459 (1775) +++| ||-||.|+||| =|-++|-.||. +| | +|.|-..-.+ . T Consensus 23 ~~GFtvIsGPNGSGKS-NI~DaiLFaLGl~s~k~~RAe~LsDLi~~gahe~~~~~~~~~ea~V~v~F~nd~~~~prsqvt 101 (1202) T TIGR02169 23 LKGFTVISGPNGSGKS-NIIDAILFALGLSSSKAMRAERLSDLIFNGAHEAAKNAKSADEAEVTVTFKNDDGKFPRSQVT 101 (1202) T ss_pred CCCCEEEECCCCCCHH-HHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCEEEEEEEEECCCCCCCCCCCC T ss_conf 8888478888888645-599999997664102342221052224487422235688996898988872278758731000 Q ss_pred ----------H--HHHHHHHHHCCCCC-E-EEECCCCCHHHHHHHHHHHHH Q ss_conf ----------9--99998644358441-3-654026611089999999999 Q gi|254780229|r 460 ----------I--EVVRHRLEKAGLGE-F-CLELHSHKVHKRAILDDLRKR 496 (1775) Q Consensus 460 ----------L--dVV~~RL~~~gLg~-~-cleLHs~ka~k~~vl~~L~~~ 496 (1775) + .+|.+||.-.+=|. + ..-|-+...+..++-+.|.+. T Consensus 102 taaGsenv~~vde~~v~Rr~kv~~~~~yySyY~lNG~~~~l~ei~d~L~~~ 152 (1202) T TIGR02169 102 TAAGSENVLEVDELEVSRRLKVTDDGKYYSYYYLNGKSVRLSEIHDFLAAA 152 (1202) T ss_pred CCCCCCCCCCCCCEEEEEEEEECCCCCEEEEEEECCCCCCHHHHHHHHHHC T ss_conf 022444444224358998888737984688888708203576689999861 No 72 >cd00009 AAA The AAA+ (ATPases Associated with a wide variety of cellular Activities) superfamily represents an ancient group of ATPases belonging to the ASCE (for additional strand, catalytic E) division of the P-loop NTPase fold. The ASCE division also includes ABC, RecA-like, VirD4-like, PilT-like, and SF1/2 helicases. Members of the AAA+ ATPases function as molecular chaperons, ATPase subunits of proteases, helicases, or nucleic-acid stimulated ATPases. The AAA+ proteins contain several distinct features in addition to the conserved alpha-beta-alpha core domain structure and the Walker A and B motifs of the P-loop NTPases. Probab=97.21 E-value=0.00074 Score=55.16 Aligned_cols=40 Identities=38% Similarity=0.742 Sum_probs=30.3 Q ss_pred HHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHC-CCEEEEEE Q ss_conf 985148955994287877038999999999967-98289960 Q gi|254780229|r 414 IDVINGKNLIIEGPPGTGKSQTITNIIAAAMLH-GKKILFCA 454 (1775) Q Consensus 414 ~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~-Gk~VLfVa 454 (1775) .....+++++|.|||||||| |++..||..+.. |..++.+. T Consensus 14 ~~~~~~~~ill~GppGtGKT-~la~~ia~~~~~~~~~~~~~~ 54 (151) T cd00009 14 LELPPPKNLLLYGPPGTGKT-TLARAIANELFRPGAPFLYLN 54 (151) T ss_pred HHCCCCCEEEEECCCCCCHH-HHHHHHHHHHCCCCCCEEEEE T ss_conf 81879980899899998865-999999997121379827854 No 73 >TIGR01448 recD_rel helicase, RecD/TraA family; InterPro: IPR006345 These sequences represent a family similar to RecD, the exodeoxyribonuclease V alpha chain of IPR006344 from INTERPRO. Members of this family, however, are not found in a context of RecB and RecC and are longer by about 200 amino acids at the amino end. Chlamydia muridarum has both a member of this family and a RecD. . Probab=97.18 E-value=0.0012 Score=52.96 Aligned_cols=43 Identities=35% Similarity=0.419 Sum_probs=37.2 Q ss_pred CCCCCHHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHHHHH Q ss_conf 66773468999998514895599428787703899999999996 Q gi|254780229|r 402 IDDSDSSQHSAVIDVINGKNLIIEGPPGTGKSQTITNIIAAAML 445 (1775) Q Consensus 402 ~~~aDsSQ~~av~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la 445 (1775) -+.-+.+|..|+..+..-|=+||.|=|||||| ||++.|...-+ T Consensus 348 ~~~l~~~Qk~AL~~~~~~Kv~iLTGGPGTGKt-T~t~~i~~~~~ 390 (769) T TIGR01448 348 RLKLSEEQKEALKTAIQDKVVILTGGPGTGKT-TITKAIIELYE 390 (769) T ss_pred CCCHHHHHHHHHHHHHCCCEEEEECCCCCCHH-HHHHHHHHHHH T ss_conf 67706889999999860948998577888616-89999999998 No 74 >pfam10881 DUF2726 Protein of unknown function (DUF2726). This bacterial family of proteins has no known function. Probab=97.18 E-value=0.0023 Score=49.98 Aligned_cols=88 Identities=20% Similarity=0.239 Sum_probs=68.5 Q ss_pred CCCCHHHHHHHHHHHH---CCCEEEECCCCC----------------------CEECCEEEECCCCCCCEEEEEECCCHH Q ss_conf 6899899999999997---899897065757----------------------710125733689888317999745802 Q gi|254780229|r 1660 KQDSDFAVSVVNELEK---TGFACDSQLGDM----------------------GFSIDVVVRDPNNPGHYLMGIECDGAM 1714 (1775) Q Consensus 1660 ~~ds~fe~~v~~~L~~---~G~~v~~qvg~~----------------------~~riDl~v~~p~~pg~~~l~iecDG~~ 1714 (1775) +.=|+-|..++..|.+ ..|.|.|||-.| +.++|++|.|+.. +..+++||-||.. T Consensus 3 ~llt~~E~~f~~~L~~al~~~~~V~~qV~l~dil~~~~~~~~~~~~~~~~~i~~k~~DFvi~d~~~-~~~v~~IEld~~~ 81 (127) T pfam10881 3 PLLNPAERAFLNALEEAVGDRYRVLAQVSLSDVLTVDPGVSKREWQRAFNRISAKHVDFVLCDKSD-LRPVAAIELDDAS 81 (127) T ss_pred CCCCHHHHHHHHHHHHHHCCCCEEEEEEEHHHHCCCCCCCCHHHHHHHHHHHHCCEEEEEEEECCC-CCEEEEEEECCCC T ss_conf 888999999999999983878599976688895066677576789999998655066699997799-9489999946866 Q ss_pred HCCCCCHHHHHHHHHHHHHHCCCCEEEEECHHHHC Q ss_conf 36832486777444899995699099932216411 Q gi|254780229|r 1715 YNSAKSARDRDRLRQEVLERMGWRIRRIWSVDWFS 1749 (1775) Q Consensus 1715 yh~~~~ardrD~~Rq~vLe~~GW~~~Riws~dw~~ 1749 (1775) |..+..+.||.....+|+..|-.++|+-...-|. T Consensus 82 -h~~~~~~~rD~~k~~~~~~agipllR~~~~~~y~ 115 (127) T pfam10881 82 -HKQGKAAKRDAFKRAALESAGIPLLRVKAKASYT 115 (127) T ss_pred -CCCHHHHHHHHHHHHHHHHCCCCEEEECCCCCCC T ss_conf -5655278889999999998799789854777999 No 75 >PRK06835 DNA replication protein DnaC; Validated Probab=97.11 E-value=0.00094 Score=54.08 Aligned_cols=37 Identities=35% Similarity=0.612 Sum_probs=31.6 Q ss_pred HCCCCEEEECCCCCCCHHHHHHHHHHHH-HCCCEEEEEE Q ss_conf 1489559942878770389999999999-6798289960 Q gi|254780229|r 417 INGKNLIIEGPPGTGKSQTITNIIAAAM-LHGKKILFCA 454 (1775) Q Consensus 417 ~~g~~~vi~GPPGTGKSQTIaNlIa~~l-a~Gk~VLfVa 454 (1775) ....|+++.|||||||| -.+|.||+.| -+|.+|+|++ T Consensus 181 ~~~~nLlf~G~~G~GKT-fLa~~IA~ell~~g~sViy~t 218 (330) T PRK06835 181 KNNENLLFYGNTGTGKT-FLSNCIAKELLDRGKTVIYRT 218 (330) T ss_pred CCCCCEEEECCCCCCHH-HHHHHHHHHHHHCCCEEEEEE T ss_conf 88886698899999889-999999999998799499962 No 76 >COG2255 RuvB Holliday junction resolvasome, helicase subunit [DNA replication, recombination, and repair] Probab=97.10 E-value=0.0017 Score=51.37 Aligned_cols=39 Identities=31% Similarity=0.539 Sum_probs=32.0 Q ss_pred HHHHHHHHHHCCCC-----EEEECCCCCCCHHHHHHHHHHHHHCC Q ss_conf 68999998514895-----59942878770389999999999679 Q gi|254780229|r 408 SQHSAVIDVINGKN-----LIIEGPPGTGKSQTITNIIAAAMLHG 447 (1775) Q Consensus 408 SQ~~av~~~~~g~~-----~vi~GPPGTGKSQTIaNlIa~~la~G 447 (1775) +|+.....|+++++ +.+-||||.||| |.++|||.-|.-+ T Consensus 36 ~~L~ifI~AAk~r~e~lDHvLl~GPPGlGKT-TLA~IIA~Emgvn 79 (332) T COG2255 36 EQLQIFIKAAKKRGEALDHVLLFGPPGLGKT-TLAHIIANELGVN 79 (332) T ss_pred HHHHHHHHHHHHCCCCCCEEEEECCCCCCHH-HHHHHHHHHHCCC T ss_conf 9999999999844987674786479987688-8999999985677 No 77 >cd03115 SRP The signal recognition particle (SRP) mediates the transport to or across the plasma membrane in bacteria and the endoplasmic reticulum in eukaryotes. SRP recognizes N-terminal sighnal sequences of newly synthesized polypeptides at the ribosome. The SRP-polypeptide complex is then targeted to the membrane by an interaction between SRP and its cognated receptor (SR). In mammals, SRP consists of six protein subunits and a 7SL RNA. One of these subunits is a 54 kd protein (SRP54), which is a GTP-binding protein that interacts with the signal sequence when it emerges from the ribosome. SRP54 is a multidomain protein that consists of an N-terminal domain, followed by a central G (GTPase) domain and a C-terminal M domain. Probab=97.09 E-value=0.0026 Score=49.47 Aligned_cols=52 Identities=33% Similarity=0.450 Sum_probs=43.6 Q ss_pred EEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEE---CCHHHHHHHHHHHHHCCCC Q ss_conf 599428787703899999999996798289960---1058999999864435844 Q gi|254780229|r 422 LIIEGPPGTGKSQTITNIIAAAMLHGKKILFCA---QKMAAIEVVRHRLEKAGLG 473 (1775) Q Consensus 422 ~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVa---eK~aALdVV~~RL~~~gLg 473 (1775) +++.||||.|||.||+-|-+.+.-+|++|.+++ ...+|.|-++.--+-.|+. T Consensus 3 i~lvGptGvGKTTTiaKLA~~~~~~~~kV~lit~Dt~R~gA~eQL~~~a~~l~v~ 57 (173) T cd03115 3 ILLVGLQGVGKTTTAAKLALYLKKKGKKVLLVAADTYRPAAIEQLRVLGEQVGVP 57 (173) T ss_pred EEEECCCCCCHHHHHHHHHHHHHHCCCEEEEEECCCCCHHHHHHHHHHHHHCCCE T ss_conf 9998999998899999999999976992899974887577999999999974985 No 78 >cd00046 DEXDc DEAD-like helicases superfamily. A diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region. Probab=97.06 E-value=0.0043 Score=47.18 Aligned_cols=63 Identities=21% Similarity=0.217 Sum_probs=48.4 Q ss_pred CCEEEECCCCCCCHHHHHHHHHHHHH--CCCEEEEEECCHHHHHHHHHHHHHCCC-CCEEEECCCC Q ss_conf 95599428787703899999999996--798289960105899999986443584-4136540266 Q gi|254780229|r 420 KNLIIEGPPGTGKSQTITNIIAAAML--HGKKILFCAQKMAAIEVVRHRLEKAGL-GEFCLELHSH 482 (1775) Q Consensus 420 ~~~vi~GPPGTGKSQTIaNlIa~~la--~Gk~VLfVaeK~aALdVV~~RL~~~gL-g~~cleLHs~ 482 (1775) +|++|++|+|+|||-+..-.|.+.+. .+++|||++-..+...-+++++...+- +..+...++. T Consensus 1 ~~~lv~~ptGsGKT~~~~~~~~~~~~~~~~~~~lil~Pt~~L~~q~~~~~~~~~~~~~~~~~~~~~ 66 (144) T cd00046 1 RDVLLAAPTGSGKTLAALLPILELLDSLKGGQVLVLAPTRELANQVAERLKELFGEGIKVGYLIGG 66 (144) T ss_pred CCEEEECCCCCHHHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHHHHHHHHHHCCCCCEEEEEECC T ss_conf 999998899717999999999999975689769997467999999999999974887179999613 No 79 >COG2256 MGS1 ATPase related to the helicase subunit of the Holliday junction resolvase [DNA replication, recombination, and repair] Probab=97.05 E-value=0.0019 Score=50.86 Aligned_cols=38 Identities=29% Similarity=0.511 Sum_probs=27.1 Q ss_pred HHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEE Q ss_conf 98514895599428787703899999999996798289960 Q gi|254780229|r 414 IDVINGKNLIIEGPPGTGKSQTITNIIAAAMLHGKKILFCA 454 (1775) Q Consensus 414 ~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVa 454 (1775) +.+..=.|.++-|||||||| ||+.+||... +..+...+ T Consensus 43 v~~~~l~SmIl~GPPG~GKT-TlA~liA~~~--~~~f~~~s 80 (436) T COG2256 43 VEAGHLHSMILWGPPGTGKT-TLARLIAGTT--NAAFEALS 80 (436) T ss_pred HHCCCCCEEEEECCCCCCHH-HHHHHHHHHH--CCCEEEEC T ss_conf 96499860577789998888-9999998761--77669951 No 80 >COG1074 RecB ATP-dependent exoDNAse (exonuclease V) beta subunit (contains helicase and exonuclease domains) [DNA replication, recombination, and repair] Probab=97.05 E-value=0.029 Score=38.47 Aligned_cols=56 Identities=23% Similarity=0.426 Sum_probs=49.8 Q ss_pred HHHCCCCEEEECCCCCCCHHHHHHHHHHHHHCC-----CEEEEEECCHHHHHHHHHHHHHC Q ss_conf 851489559942878770389999999999679-----82899601058999999864435 Q gi|254780229|r 415 DVINGKNLIIEGPPGTGKSQTITNIIAAAMLHG-----KKILFCAQKMAAIEVVRHRLEKA 470 (1775) Q Consensus 415 ~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~G-----k~VLfVaeK~aALdVV~~RL~~~ 470 (1775) ..-.|.+++|.---|||||.||+..|-.+|..| ..+|+|..+.||-..++.|+.+. T Consensus 12 ~~~~~~~~lveASAGSGKT~vL~~r~lrlLl~~~~~~v~~ILvvTFT~aAa~Emk~RI~~~ 72 (1139) T COG1074 12 ASPPGQSVLVEASAGTGKTFVLAERVLRLLLEGGPLDVDEILVVTFTKAAAAEMKERIRDR 72 (1139) T ss_pred HCCCCCCEEEEECCCCCCHHHHHHHHHHHHHHCCCCCHHHEEEECCCHHHHHHHHHHHHHH T ss_conf 3586666899966889842169999999997258998557234406388899999999999 No 81 >PRK00440 rfc replication factor C small subunit; Reviewed Probab=97.05 E-value=0.0015 Score=52.02 Aligned_cols=26 Identities=38% Similarity=0.680 Sum_probs=22.8 Q ss_pred CCCEEEECCCCCCCHHHHHHHHHHHHH Q ss_conf 895599428787703899999999996 Q gi|254780229|r 419 GKNLIIEGPPGTGKSQTITNIIAAAML 445 (1775) Q Consensus 419 g~~~vi~GPPGTGKSQTIaNlIa~~la 445 (1775) =.++++.|||||||| |++.++|..+. T Consensus 37 ~phlLf~GppG~GKT-t~a~~la~~l~ 62 (318) T PRK00440 37 MPHLLFAGPPGTGKT-TAALALARELY 62 (318) T ss_pred CCEEEEECCCCCCHH-HHHHHHHHHHC T ss_conf 866988895998899-99999999976 No 82 >PRK09361 radB DNA repair and recombination protein RadB; Provisional Probab=97.05 E-value=0.00092 Score=54.16 Aligned_cols=39 Identities=31% Similarity=0.592 Sum_probs=33.8 Q ss_pred CCCCEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEECC Q ss_conf 489559942878770389999999999679828996010 Q gi|254780229|r 418 NGKNLIIEGPPGTGKSQTITNIIAAAMLHGKKILFCAQK 456 (1775) Q Consensus 418 ~g~~~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVaeK 456 (1775) .|.-..|-|||||||||.-..+.+++...|++|||++-- T Consensus 22 ~G~itei~G~pG~GKTtl~lq~a~~~~~~g~~vlYidtE 60 (224) T PRK09361 22 RGTITQIYGPPGSGKTNICIQLAVEAARQGKKVIYIDTE 60 (224) T ss_pred CCEEEEEECCCCCCHHHHHHHHHHHHHHCCCCEEEECCC T ss_conf 887999989999859999999999999749909996787 No 83 >PRK09302 circadian clock protein KaiC; Reviewed Probab=97.03 E-value=0.015 Score=41.41 Aligned_cols=83 Identities=19% Similarity=0.359 Sum_probs=51.8 Q ss_pred HHCCCCEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEECCHHHHHHHHHHHHHCCCC--CE----EEECC---CCCHHH Q ss_conf 5148955994287877038999999999967982899601058999999864435844--13----65402---661108 Q gi|254780229|r 416 VINGKNLIIEGPPGTGKSQTITNIIAAAMLHGKKILFCAQKMAAIEVVRHRLEKAGLG--EF----CLELH---SHKVHK 486 (1775) Q Consensus 416 ~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVaeK~aALdVV~~RL~~~gLg--~~----cleLH---s~ka~k 486 (1775) .-+|.+.+|-||||||||..-+.-+++++.+|.+++|++---..-.+ .+|....|+. ++ .|.++ ....+- T Consensus 263 l~~GsstLi~Gp~GtGKTtla~qFl~~~a~~GE~~l~~~FeE~~~~l-~~~a~~~G~dl~~~~~~G~l~i~~~~p~~~~~ 341 (501) T PRK09302 263 FFRGSIILVSGATGTGKTLLVSKFAEAACRRGERCLLFAFEESRAQL-VRNATSWGIDLEEMERKGLLKIICARPESTGL 341 (501) T ss_pred CCCCCEEEEECCCCCCHHHHHHHHHHHHHHCCCCEEEEEEECCHHHH-HHHHHHCCCCHHHHHHCCCEEEEEECCCCCCH T ss_conf 75894699988999888999999999998659908999996799999-99999739984888748947999837000598 Q ss_pred HHHHHHHHHHHHH Q ss_conf 9999999999863 Q gi|254780229|r 487 RAILDDLRKRIDN 499 (1775) Q Consensus 487 ~~vl~~L~~~le~ 499 (1775) .+.+..+++.++. T Consensus 342 ~e~~~~i~~~v~~ 354 (501) T PRK09302 342 EDHLQIIKREIEE 354 (501) T ss_pred HHHHHHHHHHHHH T ss_conf 9999999999997 No 84 >pfam00448 SRP54 SRP54-type protein, GTPase domain. This family includes relatives of the G-domain of the SRP54 family of proteins. Probab=97.02 E-value=0.0028 Score=49.16 Aligned_cols=54 Identities=35% Similarity=0.560 Sum_probs=45.3 Q ss_pred EEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEE---CCHHHHHHHHHHHHHCCCCCE Q ss_conf 599428787703899999999996798289960---105899999986443584413 Q gi|254780229|r 422 LIIEGPPGTGKSQTITNIIAAAMLHGKKILFCA---QKMAAIEVVRHRLEKAGLGEF 475 (1775) Q Consensus 422 ~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVa---eK~aALdVV~~RL~~~gLg~~ 475 (1775) +.+.||+|.|||.||+-|-+++.-+|++|.+++ ...+|.+.++.--...|+.-+ T Consensus 4 i~lvGptGvGKTTTiaKLAa~~~~~~~~V~lit~Dt~R~gA~eQL~~ya~~l~v~~~ 60 (196) T pfam00448 4 ILLVGLQGSGKTTTIAKLAAYLKKQGKKVLLVAADTFRAAAIEQLKQLAERLGVPVF 60 (196) T ss_pred EEEECCCCCCHHHHHHHHHHHHHHCCCEEEEEECCCCCHHHHHHHHHHHHHCCCEEE T ss_conf 999899999889999999999997799289997587768899999999986398178 No 85 >KOG0979 consensus Probab=97.01 E-value=0.049 Score=36.14 Aligned_cols=36 Identities=39% Similarity=0.614 Sum_probs=27.2 Q ss_pred HHCCCCE-EEECCCCCCCHHHHHHHHHHHHHCCCEEEEE Q ss_conf 5148955-9942878770389999999999679828996 Q gi|254780229|r 416 VINGKNL-IIEGPPGTGKSQTITNIIAAAMLHGKKILFC 453 (1775) Q Consensus 416 ~~~g~~~-vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfV 453 (1775) ..-|.+| .|.||.||||| ||+++||-+|+ |+.-+.- T Consensus 38 ~~pgpsLNmIiGpNGSGKS-SiVcAIcLglg-G~Pk~lG 74 (1072) T KOG0979 38 FLPGPSLNMIIGPNGSGKS-SIVCAICLGLG-GKPKLLG 74 (1072) T ss_pred ECCCCCEEEEECCCCCCCH-HHHHHHHHHCC-CCHHHCC T ss_conf 4378861268778989704-88999999727-9744314 No 86 >smart00489 DEXDc3 DEAD-like helicases superfamily. Probab=96.99 E-value=0.0047 Score=46.75 Aligned_cols=63 Identities=24% Similarity=0.350 Sum_probs=39.4 Q ss_pred CCCHHHHHHHH----HHHCCCCEEEECCCCCCCHHHHHHHHHHHHH---------CCCEEEEEECCHHHHHHHHHHHHH Q ss_conf 77346899999----8514895599428787703899999999996---------798289960105899999986443 Q gi|254780229|r 404 DSDSSQHSAVI----DVINGKNLIIEGPPGTGKSQTITNIIAAAML---------HGKKILFCAQKMAAIEVVRHRLEK 469 (1775) Q Consensus 404 ~aDsSQ~~av~----~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la---------~Gk~VLfVaeK~aALdVV~~RL~~ 469 (1775) +--+.|...+. ....|+++|++.|+|||||- + +++.+|+ .+++|.+.+.+..-++-+.+.|.. T Consensus 8 ~py~~Q~e~m~~v~~~l~~~~~~llEaPTGtGKTl--a-lL~~al~~~~~~~~~~~~~ki~~~t~t~~~~~~~~~elr~ 83 (289) T smart00489 8 EPYPIQYEFMEELKRVLDRGKIGILESPTGTGKTL--S-LLCLTLTWLRSFPERIQKIKLIYLSRTVSEIEKRLEELRK 83 (289) T ss_pred CCCHHHHHHHHHHHHHHHCCCEEEEECCCCHHHHH--H-HHHHHHHHHHHCHHCCCCCEEEEEECCHHHHHHHHHHHHH T ss_conf 99989999999999999749979998999651899--9-9999999999670102456168761537889999999986 No 87 >smart00488 DEXDc2 DEAD-like helicases superfamily. Probab=96.99 E-value=0.0047 Score=46.75 Aligned_cols=63 Identities=24% Similarity=0.350 Sum_probs=39.4 Q ss_pred CCCHHHHHHHH----HHHCCCCEEEECCCCCCCHHHHHHHHHHHHH---------CCCEEEEEECCHHHHHHHHHHHHH Q ss_conf 77346899999----8514895599428787703899999999996---------798289960105899999986443 Q gi|254780229|r 404 DSDSSQHSAVI----DVINGKNLIIEGPPGTGKSQTITNIIAAAML---------HGKKILFCAQKMAAIEVVRHRLEK 469 (1775) Q Consensus 404 ~aDsSQ~~av~----~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la---------~Gk~VLfVaeK~aALdVV~~RL~~ 469 (1775) +--+.|...+. ....|+++|++.|+|||||- + +++.+|+ .+++|.+.+.+..-++-+.+.|.. T Consensus 8 ~py~~Q~e~m~~v~~~l~~~~~~llEaPTGtGKTl--a-lL~~al~~~~~~~~~~~~~ki~~~t~t~~~~~~~~~elr~ 83 (289) T smart00488 8 EPYPIQYEFMEELKRVLDRGKIGILESPTGTGKTL--S-LLCLTLTWLRSFPERIQKIKLIYLSRTVSEIEKRLEELRK 83 (289) T ss_pred CCCHHHHHHHHHHHHHHHCCCEEEEECCCCHHHHH--H-HHHHHHHHHHHCHHCCCCCEEEEEECCHHHHHHHHHHHHH T ss_conf 99989999999999999749979998999651899--9-9999999999670102456168761537889999999986 No 88 >PRK00254 ski2-like helicase; Provisional Probab=96.97 E-value=0.008 Score=44.36 Aligned_cols=61 Identities=28% Similarity=0.374 Sum_probs=45.9 Q ss_pred CHHHHHHHHH-HHCCCCEEEECCCCCCCHHHHHHHHH--HHHHCCCEEEEEECCHHHHHHHHHHH Q ss_conf 3468999998-51489559942878770389999999--99967982899601058999999864 Q gi|254780229|r 406 DSSQHSAVID-VINGKNLIIEGPPGTGKSQTITNIIA--AAMLHGKKILFCAQKMAAIEVVRHRL 467 (1775) Q Consensus 406 DsSQ~~av~~-~~~g~~~vi~GPPGTGKSQTIaNlIa--~~la~Gk~VLfVaeK~aALdVV~~RL 467 (1775) -+-|..|+.. +..|+|+||--|+|+||| .|+.+.. .++..|++++|++=-.|=..--++.+ T Consensus 25 ~p~Q~e~l~~g~~~g~NllvsaPT~sGKT-lvAElail~~~l~~~~k~iyi~P~kALa~EK~~~f 88 (717) T PRK00254 25 YPPQAEALTSGVLEGKNLLIAIPTASGKT-LIAEIAMVNKLLREGGKAVYLVPLKALAEEKFREF 88 (717) T ss_pred CHHHHHHHHHHHCCCCCEEEECCCCCCHH-HHHHHHHHHHHHHCCCEEEEECCHHHHHHHHHHHH T ss_conf 89999998742336981899899887489-99999999999852992999926799999999999 No 89 >TIGR02173 cyt_kin_arch cytidylate kinase, putative; InterPro: IPR011892 Proteins in this family are believed to be cytidylate kinase. Members of this family are found in the archaea and in spirochaetes, and differ considerably from the common bacterial form of cytidylate kinase described by IPR003136 from INTERPRO.; GO: 0004127 cytidylate kinase activity, 0005524 ATP binding, 0006139 nucleobase nucleoside nucleotide and nucleic acid metabolic process. Probab=96.97 E-value=0.00051 Score=56.88 Aligned_cols=23 Identities=43% Similarity=0.928 Sum_probs=20.9 Q ss_pred EEEECCCCCCCHHHHHHHHHHHHH Q ss_conf 599428787703899999999996 Q gi|254780229|r 422 LIIEGPPGTGKSQTITNIIAAAML 445 (1775) Q Consensus 422 ~vi~GPPGTGKSQTIaNlIa~~la 445 (1775) .+|=|||||||| |||-+||.-|. T Consensus 3 I~ISGpPGSGkt-TvA~~lA~~Ls 25 (173) T TIGR02173 3 ITISGPPGSGKT-TVAKILAEKLS 25 (173) T ss_pred EEEECCCCCCHH-HHHHHHHHHCC T ss_conf 887358968647-89999998639 No 90 >COG0210 UvrD Superfamily I DNA and RNA helicases [DNA replication, recombination, and repair] Probab=96.97 E-value=0.044 Score=36.62 Aligned_cols=151 Identities=22% Similarity=0.270 Sum_probs=94.9 Q ss_pred CCCCCEEEECCHHHCCHHH--HHHHHHH-CCEEEEEECCCCCCCCHHHHHCCCCCCCCCCHHHCCCCCHHHHHHHHH--C Q ss_conf 2541078860632369899--8898861-182899607786686201100013444432011014400079999975--5 Q gi|254780229|r 1359 DVKFDLVIMDESSQIKPED--ALGVIAR-GKQIVVVGDPKQLPPTRFFDHDNDQEDYDEEVAAVSQTESILDALLPL--F 1433 (1775) Q Consensus 1359 ~~~FD~VI~DEASQ~~~~~--al~al~r-akqvvvvGD~kQlpPt~ff~~~~~~~~~~~~~~~~~~~eSild~~~~~--~ 1433 (1775) ...|+.|.|||.--..+.. .+-.|+- +..+.||||+.|-==. |.+ .+.+.|++..... . T Consensus 211 ~~rf~~iLvDE~QDtn~~Q~~ll~~la~~~~~l~~VGD~dQsIY~--frG--------------A~~~ni~~f~~df~~~ 274 (655) T COG0210 211 QARFRYILVDEFQDTNPLQYELLKLLAGNAANLFVVGDDDQSIYG--FRG--------------ADPENILDFEKDFPAA 274 (655) T ss_pred HHCCCEEEEECCCCCCHHHHHHHHHHHCCCCCEEEECCCCCCEEE--ECC--------------CCCHHHHHHHHHCCCC T ss_conf 956898987877785187999999984487757998589873676--778--------------7836789787635655 Q ss_pred CCCEECCCCCCCCHHHHHHHHHHCCCCEEEECCCCCCCCCCCCEEEE-ECCC---CCCCCCCCHHHHHHHHHHHHHHHHC Q ss_conf 87520530557997899999850369804764687766666884899-7289---7217888989999999999999531 Q gi|254780229|r 1434 SMRRLQWHYRSLNENLIACSNYYFYDNSLIVFPSPYTSVDRYGVGFT-HVKN---GVVVDQGNPEEARVIALAVKDHALR 1509 (1775) Q Consensus 1434 p~~~L~~HyRs~~e~lI~fSN~~fY~~~L~~~psp~~~~~~~~v~~~-~v~~---g~~~~~~N~~EA~~vv~~v~~~~~~ 1509 (1775) +...|.+-|||.+ .|+.+.|+..=.|+-. ++ ..+... .-.+ .++.......||..|+..|.++... T Consensus 275 ~~i~Le~nyRSt~-~Il~~An~~i~~n~~r-~~--------k~l~~~~~~~~~~~~~~~~~~~~~ea~~i~~~I~~l~~~ 344 (655) T COG0210 275 KVIKLEQNYRSTP-NILAAANKVIANNKKR-QA--------KTLRTEVEGSGEKVVLLLANDEEDEARWIASEIDALIEI 344 (655) T ss_pred CEEECCCCCCCHH-HHHHHHHHHHHHCCCC-CC--------CCCCCCCCCCCCCCEEEECCCHHHHHHHHHHHHHHHHHH T ss_conf 4676042448889-9999999998718652-33--------421102356778856986788177899999999999982 Q ss_pred C--CCCEEEEEECCHHHHHHHHHHHHHH Q ss_conf 8--5771899819989999999999999 Q gi|254780229|r 1510 Y--PEESLGVIAMNAKQRDLIESTINKL 1535 (1775) Q Consensus 1510 ~--~~~siGVvt~~~~Q~~~I~~~l~~~ 1535 (1775) . +-..+.|..=+..|...|+..|... T Consensus 345 ~~~~~~d~aiL~R~n~~s~~~e~~l~~~ 372 (655) T COG0210 345 GKVNYSDIAILYRTNAQSRLIEEALRAA 372 (655) T ss_pred CCCCCCCEEEEEECCHHHHHHHHHHHHC T ss_conf 6888030788861566778899998662 No 91 >PRK11331 5-methylcytosine-specific restriction enzyme subunit McrB; Provisional Probab=96.97 E-value=0.0016 Score=51.76 Aligned_cols=49 Identities=24% Similarity=0.453 Sum_probs=35.8 Q ss_pred CCCCCHHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHCCC---EEEEE Q ss_conf 66773468999998514895599428787703899999999996798---28996 Q gi|254780229|r 402 IDDSDSSQHSAVIDVINGKNLIIEGPPGTGKSQTITNIIAAAMLHGK---KILFC 453 (1775) Q Consensus 402 ~~~aDsSQ~~av~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~Gk---~VLfV 453 (1775) +.+-.+.++ +...+..+|++++|||||||| -+|-.+|.+|..++ +|-+| T Consensus 179 ~~~~~i~~~--~~sLktKknvIL~G~pGtGKT-~lAk~lA~~l~g~~~~~rv~~V 230 (459) T PRK11331 179 IPETTIETI--LKRLTIKKNIILQGPPGVGKT-FVARRLAYLLTGEKAPQRVNMV 230 (459) T ss_pred CCCHHHHHH--HHHHCCCCCEEEECCCCCCHH-HHHHHHHHHHHCCCCCCCEEEE T ss_conf 561359999--998545882796589998878-9999999997078877846899 No 92 >cd01037 Restriction_endonuclease_like Superfamily of nucleases including Short Patch Repair (Vsr) Endonucleases, archaeal Holliday junction resolvases, MutH methy-directed DNA mismatch-repair endonucleases, and catalytic domains of many restriction endonucleases, such as EcoRI, BamHI, and FokI Probab=96.96 E-value=0.0027 Score=49.35 Aligned_cols=74 Identities=22% Similarity=0.226 Sum_probs=61.2 Q ss_pred HHHHHHHHHHHHCCCEEEE---CCCCCCEECCEEEECCCCCCCEEEEEECCCHHHCCCCCHHHHH---HHHHHHHHHCCC Q ss_conf 8999999999978998970---6575771012573368988831799974580236832486777---444899995699 Q gi|254780229|r 1664 DFAVSVVNELEKTGFACDS---QLGDMGFSIDVVVRDPNNPGHYLMGIECDGAMYNSAKSARDRD---RLRQEVLERMGW 1737 (1775) Q Consensus 1664 ~fe~~v~~~L~~~G~~v~~---qvg~~~~riDl~v~~p~~pg~~~l~iecDG~~yh~~~~ardrD---~~Rq~vLe~~GW 1737 (1775) .+|..+...|+..|+.... |+..++|..|+++. .+.++|||||..+|...-. .++ ..|...|+..|| T Consensus 1 ~~E~~l~~~l~~~~~~~~~~~~~~~~~~~~pDf~~~------~~~l~iE~k~~~~~~~~~~-~~~~~~~~~~~~~~~~g~ 73 (80) T cd01037 1 AAERLLWKLLRAKGLAVRRFRRQVPIGSYIPDFVCP------SAKLVIELKGTFHDGLLRK-LRTSEKQERIAFLEADGK 73 (80) T ss_pred CHHHHHHHHHHHCCCCCCEEEECCCCCCEEEEEECC------CCCEEEEECCCHHCCCHHH-HHHHHHHHHHHHHHHCCC T ss_conf 979999999985587887057137507887757725------4487999145121531355-544117789999998899 Q ss_pred CEEEEEC Q ss_conf 0999322 Q gi|254780229|r 1738 RIRRIWS 1744 (1775) Q Consensus 1738 ~~~Riws 1744 (1775) .++|+|. T Consensus 74 ~~~~~~~ 80 (80) T cd01037 74 KVLRFWN 80 (80) T ss_pred EEEEECC T ss_conf 8997439 No 93 >TIGR00635 ruvB Holliday junction DNA helicase RuvB; InterPro: IPR004605 All proteins in this family for which functions are known are 5'-3' DNA helicases that, as part of a complex with RuvA homologs serve as a 5'-3' Holliday junction helicase. RuvA specifically binds Holliday junctions as a sandwich of two tetramers and maintains the configuration of the junction. It forms a complex with two hexameric rings of RuvB, the subunit that contains helicase activity. The complex drives ATP-dependent branch migration of the Holliday junction recombination intermediate. The endonuclease RuvC resolves junctions.; GO: 0003677 DNA binding, 0005524 ATP binding, 0009378 Holliday junction helicase activity, 0006281 DNA repair, 0006310 DNA recombination. Probab=96.95 E-value=0.0022 Score=50.22 Aligned_cols=39 Identities=36% Similarity=0.578 Sum_probs=32.3 Q ss_pred HHHHHHHHHHCCCC-----EEEECCCCCCCHHHHHHHHHHHHHCC Q ss_conf 68999998514895-----59942878770389999999999679 Q gi|254780229|r 408 SQHSAVIDVINGKN-----LIIEGPPGTGKSQTITNIIAAAMLHG 447 (1775) Q Consensus 408 SQ~~av~~~~~g~~-----~vi~GPPGTGKSQTIaNlIa~~la~G 447 (1775) +|+.....|+++++ +.+-||||=||| |+|+|||+=|--. T Consensus 14 ~~L~l~I~AAk~R~e~LDH~LL~GPPGLGKT-TLA~IiA~Emg~~ 57 (305) T TIGR00635 14 EQLQLFIEAAKMRQEALDHLLLYGPPGLGKT-TLAHIIANEMGVN 57 (305) T ss_pred HHHHHHHHHHHHCCCCCCCEEEECCCCCCHH-HHHHHHHHHHCCC T ss_conf 9999999999824897341663175687467-8999999983893 No 94 >PRK08939 primosomal protein DnaI; Reviewed Probab=96.92 E-value=0.003 Score=48.87 Aligned_cols=36 Identities=25% Similarity=0.320 Sum_probs=31.8 Q ss_pred CCCCEEEECCCCCCCHHHHHHHHHHHHH-CCCEEEEEE Q ss_conf 4895599428787703899999999996-798289960 Q gi|254780229|r 418 NGKNLIIEGPPGTGKSQTITNIIAAAML-HGKKILFCA 454 (1775) Q Consensus 418 ~g~~~vi~GPPGTGKSQTIaNlIa~~la-~Gk~VLfVa 454 (1775) .++.|-|.|||||||| -++..||+.|+ .|.+|+||. T Consensus 156 ~~kGlyl~G~~G~GKT-yL~~aian~La~~g~~v~~v~ 192 (306) T PRK08939 156 KVKGLYLYGDFGVGKT-YLLAAIANELAKKGVSSTLVH 192 (306) T ss_pred CCCEEEEECCCCCCHH-HHHHHHHHHHHHCCCEEEEEE T ss_conf 8877889899999899-999999999998699299987 No 95 >PRK09435 arginine/ornithine transport system ATPase; Provisional Probab=96.92 E-value=0.0034 Score=48.26 Aligned_cols=39 Identities=33% Similarity=0.442 Sum_probs=34.3 Q ss_pred HCCCCEE--EECCCCCCCHHHHHHHHHHHHHCCCEEEEEEC Q ss_conf 1489559--94287877038999999999967982899601 Q gi|254780229|r 417 INGKNLI--IEGPPGTGKSQTITNIIAAAMLHGKKILFCAQ 455 (1775) Q Consensus 417 ~~g~~~v--i~GPPGTGKSQTIaNlIa~~la~Gk~VLfVae 455 (1775) ..|+.++ |.||||.|||..|-.++.++...|++|-++|- T Consensus 45 ~~g~a~~iGiTG~pG~GKStli~~l~~~~~~~g~~v~vlav 85 (325) T PRK09435 45 HTGNALRIGITGVPGVGKSTFIEALGMHLIEQGHKVAVLAV 85 (325) T ss_pred CCCCCEEEEEECCCCCCHHHHHHHHHHHHHHCCCEEEEEEE T ss_conf 17982599742799986889999999999967985899997 No 96 >PRK09302 circadian clock protein KaiC; Reviewed Probab=96.92 E-value=0.0025 Score=49.63 Aligned_cols=21 Identities=0% Similarity=-0.102 Sum_probs=9.8 Q ss_pred HHHHHHHHHCCEEEEEECCCH Q ss_conf 899999832142698870387 Q gi|254780229|r 1602 RLNVLFTRSRRRIEVFSTMRY 1622 (1775) Q Consensus 1602 RlNVa~sRAr~~~~v~~s~~~ 1622 (1775) ++--++-.-+-+.+|+=|+.. T Consensus 347 ~i~~~v~~~~~~rVvIDsls~ 367 (501) T PRK09302 347 IIKREIEEFKPSRVAVDPLSA 367 (501) T ss_pred HHHHHHHHCCCCEEEECCHHH T ss_conf 999999972998999958068 No 97 >cd01124 KaiC KaiC is a circadian clock protein primarily found in cyanobacteria KaiC is a RecA-like ATPase, having both Walker A and Walker B motifs. A related protein is found in archaea. Probab=96.91 E-value=0.0022 Score=50.27 Aligned_cols=50 Identities=26% Similarity=0.452 Sum_probs=36.1 Q ss_pred CEEEECCCCCCCHHHH-HHHHHHHHHCCCEEEEEECCHHHHHHHHHHHHHCCC Q ss_conf 5599428787703899-999999996798289960105899999986443584 Q gi|254780229|r 421 NLIIEGPPGTGKSQTI-TNIIAAAMLHGKKILFCAQKMAAIEVVRHRLEKAGL 472 (1775) Q Consensus 421 ~~vi~GPPGTGKSQTI-aNlIa~~la~Gk~VLfVaeK~aALdVV~~RL~~~gL 472 (1775) +.+|.|||||||| |+ ...+.+.+..|.+|||++--...-+ +.++....|+ T Consensus 1 stLi~G~pGsGKT-~~a~qfl~~~a~~ge~~lyis~eE~~~~-l~~~~~~~g~ 51 (187) T cd01124 1 STLLSGGPGTGKT-TFALQFLYAGLARGEPGLYVTLEESPEE-LIENAESLGW 51 (187) T ss_pred CEEEEECCCCCHH-HHHHHHHHHHHHCCCCEEEEEECCCHHH-HHHHHHHCCC T ss_conf 9158768999999-9999999999876997899995079999-9999998399 No 98 >COG0714 MoxR-like ATPases [General function prediction only] Probab=96.88 E-value=0.0024 Score=49.79 Aligned_cols=55 Identities=22% Similarity=0.462 Sum_probs=40.5 Q ss_pred HHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHCC-CEEEEEECCHHHHHHHH Q ss_conf 6899999851489559942878770389999999999679-82899601058999999 Q gi|254780229|r 408 SQHSAVIDVINGKNLIIEGPPGTGKSQTITNIIAAAMLHG-KKILFCAQKMAAIEVVR 464 (1775) Q Consensus 408 SQ~~av~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~G-k~VLfVaeK~aALdVV~ 464 (1775) -=..+..+...|..+.+.|||||||| +++..+|.++... .+|.|.+. +.+-|++. T Consensus 32 ~~~~~l~a~~~~~~vll~G~PG~gKT-~la~~lA~~l~~~~~~i~~t~~-l~p~d~~G 87 (329) T COG0714 32 VIELALLALLAGGHVLLEGPPGVGKT-LLARALARALGLPFVRIQCTPD-LLPSDLLG 87 (329) T ss_pred HHHHHHHHHHCCCCEEEECCCCCCHH-HHHHHHHHHHCCCCEEEECCCC-CCHHHHCC T ss_conf 99999999985997787798987779-9999999983898189956899-88888205 No 99 >cd01122 GP4d_helicase GP4d_helicase is a homohexameric 5'-3' helicases. Helicases couple NTP hydrolysis to the unwinding of nucleic acid duplexes into their component strands. Probab=96.86 E-value=0.0067 Score=45.18 Aligned_cols=54 Identities=17% Similarity=0.305 Sum_probs=43.8 Q ss_pred HHCCCCEEEECCCCCCCHHHHHHHHHHHHH-CCCEEEEEECCHHHHHHHHHHHHH Q ss_conf 514895599428787703899999999996-798289960105899999986443 Q gi|254780229|r 416 VINGKNLIIEGPPGTGKSQTITNIIAAAML-HGKKILFCAQKMAAIEVVRHRLEK 469 (1775) Q Consensus 416 ~~~g~~~vi~GPPGTGKSQTIaNlIa~~la-~Gk~VLfVaeK~aALdVV~~RL~~ 469 (1775) ...|.=.||-|+||.|||.-+.|++++++. .|++|||.|--|..-++..+-+.. T Consensus 27 l~~GeL~viaarpg~GKT~f~~~~a~~~~~~~g~~vl~~SlEm~~~~~~~Rlls~ 81 (271) T cd01122 27 LRKGELIILTAGTGVGKTTFLREYALDLITQHGVRVGTISLEEPVVRTARRLLGQ 81 (271) T ss_pred CCCCCEEEEEECCCCCHHHHHHHHHHHHHHHCCCEEEEEECCCCHHHHHHHHHHH T ss_conf 9998089999689986999999999999997699089997049999999999999 No 100 >smart00382 AAA ATPases associated with a variety of cellular activities. AAA - ATPases associated with a variety of cellular activities. This profile/alignment only detects a fraction of this vast family. The poorly conserved N-terminal helix is missing from the alignment. Probab=96.85 E-value=0.0015 Score=52.05 Aligned_cols=43 Identities=26% Similarity=0.488 Sum_probs=31.7 Q ss_pred CCCCEEEECCCCCCCHHHHHHHHHHHHHCCC-EEEEEECCHHHHH Q ss_conf 4895599428787703899999999996798-2899601058999 Q gi|254780229|r 418 NGKNLIIEGPPGTGKSQTITNIIAAAMLHGK-KILFCAQKMAAIE 461 (1775) Q Consensus 418 ~g~~~vi~GPPGTGKSQTIaNlIa~~la~Gk-~VLfVaeK~aALd 461 (1775) .+..+.|.|||||||| |++..||..+.... .|.+++-....-. T Consensus 1 ~~~~ill~G~~GsGKT-tl~~~la~~~~~~~~~v~~~~~~~~~~~ 44 (148) T smart00382 1 PGEVILIVGPPGSGKT-TLARALARELGPPGGGVIYIDGEDILEE 44 (148) T ss_pred CCCEEEEECCCCHHHH-HHHHHHHHHHCCCCCCEEEEEHHHHHHH T ss_conf 9978999999970299-9999999872668996899875998988 No 101 >TIGR01447 recD exodeoxyribonuclease V, alpha subunit; InterPro: IPR006344 This family describes the exodeoxyribonuclease V alpha subunit, RecD. RecD is part of a RecBCD complex. C-terminal part of the protein matches a domain found in viral RNA helicase, superfamily 1, IPR000606 from INTERPRO. ; GO: 0008854 exodeoxyribonuclease V activity, 0009338 exodeoxyribonuclease V complex. Probab=96.84 E-value=0.00094 Score=54.07 Aligned_cols=39 Identities=31% Similarity=0.455 Sum_probs=35.8 Q ss_pred HHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHC Q ss_conf 689999985148955994287877038999999999967 Q gi|254780229|r 408 SQHSAVIDVINGKNLIIEGPPGTGKSQTITNIIAAAMLH 446 (1775) Q Consensus 408 SQ~~av~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~ 446 (1775) -|..|+.-|.++.=++|.|=||||||.||+-||...+.. T Consensus 231 ~Q~~a~~~aL~~~f~li~GGPGTGKTTTv~~LL~al~~~ 269 (753) T TIGR01447 231 WQKVAVALALKSNFSLITGGPGTGKTTTVARLLLALIKQ 269 (753) T ss_pred HHHHHHHHHHHCCEEEEECCCCCCHHHHHHHHHHHHHHH T ss_conf 999999998608768998798897789999999999998 No 102 >pfam03308 ArgK ArgK protein. The ArgK protein acts as an ATPase enzyme and as a kinase, and phosphorylates periplasmic binding proteins involved in the LAO (lysine, arginine, ornithine)/AO transport systems. Probab=96.83 E-value=0.0043 Score=47.17 Aligned_cols=41 Identities=27% Similarity=0.370 Sum_probs=34.9 Q ss_pred HHHCCCCEE--EECCCCCCCHHHHHHHHHHHHHCCCEEEEEEC Q ss_conf 851489559--94287877038999999999967982899601 Q gi|254780229|r 415 DVINGKNLI--IEGPPGTGKSQTITNIIAAAMLHGKKILFCAQ 455 (1775) Q Consensus 415 ~~~~g~~~v--i~GPPGTGKSQTIaNlIa~~la~Gk~VLfVae 455 (1775) -...|++++ |.||||.|||..|-.+|.+.+..|++|-++|- T Consensus 23 ~~~~g~a~~iGiTG~PGaGKStli~~l~~~~~~~g~~vaVlAv 65 (267) T pfam03308 23 MPLTGRAHRVGITGVPGAGKSTLIEALGMELRRRGHRVAVLAV 65 (267) T ss_pred HHHCCCCEEEEEECCCCCCHHHHHHHHHHHHHHCCCEEEEEEE T ss_conf 7435995599876899887999999999999968986899997 No 103 >PRK06731 flhF flagellar biosynthesis regulator FlhF; Validated Probab=96.83 E-value=0.0056 Score=45.96 Aligned_cols=54 Identities=17% Similarity=0.367 Sum_probs=43.6 Q ss_pred CEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEE---CCHHHHHHHHHHHHHCCCCC Q ss_conf 5599428787703899999999996798289960---10589999998644358441 Q gi|254780229|r 421 NLIIEGPPGTGKSQTITNIIAAAMLHGKKILFCA---QKMAAIEVVRHRLEKAGLGE 474 (1775) Q Consensus 421 ~~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVa---eK~aALdVV~~RL~~~gLg~ 474 (1775) .+++.||.|+|||.||+-|-+.+...|++|++|| ...+|.|-++.--...|+.- T Consensus 77 vI~lvG~~G~GKTTT~AKLA~~~~~~~~kV~lia~DtyR~aA~eQLk~~a~~l~v~~ 133 (270) T PRK06731 77 TIALIGPTGVGKTTTLAKMAWQFHGKKKTVGFITTDHSRIGTVQQLQDYVKTIGFEV 133 (270) T ss_pred EEEEECCCCCCHHHHHHHHHHHHHHCCCEEEEEEECCCCHHHHHHHHHHHHHHCCCE T ss_conf 899988898988999999999998679908999838888889999999999819953 No 104 >PRK08084 DNA replication initiation factor; Provisional Probab=96.81 E-value=0.052 Score=35.94 Aligned_cols=57 Identities=19% Similarity=0.335 Sum_probs=36.1 Q ss_pred CCCEEEECCCCCCCHHHHHHHHHH-HHHCCCEEEEEECCHHHH--HHHHHHHHHCCCCCEEEE Q ss_conf 895599428787703899999999-996798289960105899--999986443584413654 Q gi|254780229|r 419 GKNLIIEGPPGTGKSQTITNIIAA-AMLHGKKILFCAQKMAAI--EVVRHRLEKAGLGEFCLE 478 (1775) Q Consensus 419 g~~~vi~GPPGTGKSQTIaNlIa~-~la~Gk~VLfVaeK~aAL--dVV~~RL~~~gLg~~cle 478 (1775) |.-+.|.||+|+|||| +.++||+ +...|++|.+++-+..+- .-+...+.+ +.-.|++ T Consensus 45 ~~~l~l~G~~G~GKTH-LLqA~~~~~~~~~~~~~yl~~~~~~~~~~~~l~~l~~--~dll~iD 104 (235) T PRK08084 45 SGYIYLWGREGAGRSH-LLHAACAELSQRGDAVGYVPLDKRAWFVPEVLEGMEQ--LSLVCID 104 (235) T ss_pred CCEEEEECCCCCCHHH-HHHHHHHHHHCCCCCEEEEEHHHHHHHHHHHHHHHHH--CCEEEEE T ss_conf 8769998999988899-9999999997079857998779866517999987641--8989982 No 105 >PRK05703 flhF flagellar biosynthesis regulator FlhF; Validated Probab=96.81 E-value=0.0068 Score=45.08 Aligned_cols=56 Identities=25% Similarity=0.490 Sum_probs=40.0 Q ss_pred CCCCEEEECCCCCCCHHHHHHHHHHH-HHCCC-EEEEEE---CCHHHHHHHHHHHHHCCCC Q ss_conf 48955994287877038999999999-96798-289960---1058999999864435844 Q gi|254780229|r 418 NGKNLIIEGPPGTGKSQTITNIIAAA-MLHGK-KILFCA---QKMAAIEVVRHRLEKAGLG 473 (1775) Q Consensus 418 ~g~~~vi~GPPGTGKSQTIaNlIa~~-la~Gk-~VLfVa---eK~aALdVV~~RL~~~gLg 473 (1775) .|+-+.+.||+|.|||.||+-|-|.+ +.+|| +|-+++ ...+|.+-++.==+-.|+. T Consensus 209 ~~~vvalVGPTGVGKTTTiAKLAA~~~l~~~~~kV~lIT~DtyRigA~eQLk~Ya~ilgvp 269 (412) T PRK05703 209 QGGVVALVGPTGVGKTTTLAKLAARYALEEGKDKVALITLDTYRIGAVEQLKTYAKIMGIP 269 (412) T ss_pred CCCEEEEECCCCCCHHHHHHHHHHHHHHHCCCCEEEEEEECCCCHHHHHHHHHHHHHCCCE T ss_conf 6736999888887567699999999999729981799983767777999999999971973 No 106 >PRK04328 hypothetical protein; Provisional Probab=96.80 E-value=0.0031 Score=48.65 Aligned_cols=55 Identities=20% Similarity=0.349 Sum_probs=42.4 Q ss_pred HCCCCEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEECCHHHHHHHHHHHHHCCC Q ss_conf 14895599428787703899999999996798289960105899999986443584 Q gi|254780229|r 417 INGKNLIIEGPPGTGKSQTITNIIAAAMLHGKKILFCAQKMAAIEVVRHRLEKAGL 472 (1775) Q Consensus 417 ~~g~~~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVaeK~aALdVV~~RL~~~gL 472 (1775) -.|...+|.||||||||---...+++.+.+|.++||++--...-+++ +.....|. T Consensus 22 p~gs~~Lv~G~pGtGKT~la~qFl~~g~~~GE~~lyis~eE~~~~l~-~~~~~~G~ 76 (250) T PRK04328 22 PERNVVLLSGGPGTGKSIFSQQFLWNGLQMGEPGIYVALEEHPVQVR-RNMAQFGW 76 (250) T ss_pred CCCEEEEEEECCCCCHHHHHHHHHHHHHHCCCCEEEEEEECCHHHHH-HHHHHCCC T ss_conf 79969999828999989999999999987699779999727999999-99998099 No 107 >cd01127 TrwB Bacterial conjugation protein TrwB, ATP binding domain. TrwB is a homohexamer encoded by conjugative plasmids in Gram-negative bacteria. TrwB also has an all alpha domain which has been hypothesized to be responsible for DNA binding. TrwB is a component of Type IV secretion and is responsible for the horizontal transfer of DNA between bacteria. Probab=96.80 E-value=0.0023 Score=49.95 Aligned_cols=40 Identities=23% Similarity=0.418 Sum_probs=35.0 Q ss_pred CCCEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEECCHH Q ss_conf 8955994287877038999999999967982899601058 Q gi|254780229|r 419 GKNLIIEGPPGTGKSQTITNIIAAAMLHGKKILFCAQKMA 458 (1775) Q Consensus 419 g~~~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVaeK~a 458 (1775) -+++.|.|+||||||++|.++|.+++++|.++.++=-|-- T Consensus 42 ~~H~lv~G~tGsGKT~~i~~li~~~~~rg~~~II~DpkGe 81 (410) T cd01127 42 EAHTMIIGTTGTGKTTQIRELLASIRARGDRAIIYDPNGG 81 (410) T ss_pred HCCEEEECCCCCCHHHHHHHHHHHHHHCCCEEEEEECCCH T ss_conf 2747998899998899999999999986990999958854 No 108 >COG0467 RAD55 RecA-superfamily ATPases implicated in signal transduction [Signal transduction mechanisms] Probab=96.79 E-value=0.0035 Score=48.07 Aligned_cols=51 Identities=24% Similarity=0.494 Sum_probs=42.6 Q ss_pred HHCCCCEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEECCHHHHHHHHHH Q ss_conf 514895599428787703899999999996798289960105899999986 Q gi|254780229|r 416 VINGKNLIIEGPPGTGKSQTITNIIAAAMLHGKKILFCAQKMAAIEVVRHR 466 (1775) Q Consensus 416 ~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVaeK~aALdVV~~R 466 (1775) .-.|..++|.||||||||--..+.+...+-.|.+|+||+-....-++.+.- T Consensus 20 ~p~g~~~lI~G~pGsGKT~f~~qfl~~~~~~ge~vlyvs~~e~~~~l~~~~ 70 (260) T COG0467 20 LPRGSVVLITGPPGTGKTIFALQFLYEGAREGEPVLYVSTEESPEELLENA 70 (260) T ss_pred CCCCCEEEEEECCCCCHHHHHHHHHHHHHHCCCEEEEEEEECCHHHHHHHH T ss_conf 889978999938998689999999997762698589999206989999999 No 109 >PRK11057 ATP-dependent DNA helicase RecQ; Provisional Probab=96.78 E-value=0.021 Score=40.05 Aligned_cols=85 Identities=20% Similarity=0.343 Sum_probs=58.4 Q ss_pred HHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEECCHHHHHHHHHHHHHCCCCCEEEECCCCCHHHH Q ss_conf 68999998514895599428787703899999999996798289960105899999986443584413654026611089 Q gi|254780229|r 408 SQHSAVIDVINGKNLIIEGPPGTGKSQTITNIIAAAMLHGKKILFCAQKMAAIEVVRHRLEKAGLGEFCLELHSHKVHKR 487 (1775) Q Consensus 408 SQ~~av~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVaeK~aALdVV~~RL~~~gLg~~cleLHs~ka~k~ 487 (1775) -|..+|.++..|+|.++.=|+|.||| +.--+-.++- +..+++||=-.+=-.---..|...|+.-.|+.=..++..+. T Consensus 29 ~Q~e~i~~~l~g~D~l~~mpTG~GKS--lcyQlPal~~-~g~tiVisPLisLm~DQv~~L~~~gi~a~~l~s~~~~~e~~ 105 (607) T PRK11057 29 GQEEIIDTVLSGRDCLVVMPTGGGKS--LCYQIPALVL-DGLTLVVSPLISLMKDQVDQLLANGVAAACLNSTQTREQQL 105 (607) T ss_pred HHHHHHHHHHCCCCEEEECCCCCHHH--HHHHHHHHHC-CCCEEEECCHHHHHHHHHHHHHHCCCCEEEECCCCCHHHHH T ss_conf 69999999986998899878995597--9999999877-99889986879999999999997899299956999999999 Q ss_pred HHHHHHHH Q ss_conf 99999999 Q gi|254780229|r 488 AILDDLRK 495 (1775) Q Consensus 488 ~vl~~L~~ 495 (1775) .++..+.. T Consensus 106 ~~~~~~~~ 113 (607) T PRK11057 106 EVMAGCRT 113 (607) T ss_pred HHHHHHHC T ss_conf 99999975 No 110 >TIGR01420 pilT_fam twitching motility protein; InterPro: IPR006321 These represent the PilT subfamily of proteins related to GspE, a protein involved in type II secretion (also called the General Secretion Pathway). PilT is an apparent cytosolic ATPase associated with type IV pilus systems. It is not required for pilin biogenesis, but is required for twitching motility and social gliding behaviors, shown in some species, powered by pilus retraction . Members of this family may be found in some species that do not have type IV pili but have related structures for DNA uptake and natural transformation. ; GO: 0005524 ATP binding, 0006810 transport. Probab=96.77 E-value=0.0012 Score=53.10 Aligned_cols=32 Identities=31% Similarity=0.496 Sum_probs=25.8 Q ss_pred HHHCCCCEE-EECCCCCCCHHHHHHHHHHHHHC Q ss_conf 851489559-94287877038999999999967 Q gi|254780229|r 415 DVINGKNLI-IEGPPGTGKSQTITNIIAAAMLH 446 (1775) Q Consensus 415 ~~~~g~~~v-i~GPPGTGKSQTIaNlIa~~la~ 446 (1775) -+.+-+.+| |.||+|||||.|||-||-+.=-+ T Consensus 122 ~a~~~~GLiLVTGPTGSGKSTTlAsmIDyIN~~ 154 (350) T TIGR01420 122 LAERPRGLILVTGPTGSGKSTTLASMIDYINKN 154 (350) T ss_pred HHHCCCCCEEEECCCCCCHHHHHHHHHHHHHCC T ss_conf 983669938987688986789999999787403 No 111 >cd01120 RecA-like_NTPases RecA-like NTPases. This family includes the NTP binding domain of F1 and V1 H+ATPases, DnaB and related helicases as well as bacterial RecA and related eukaryotic and archaeal recombinases. This group also includes bacterial conjugation proteins and related DNA transfer proteins involved in type II and type IV secretion. Probab=96.77 E-value=0.0037 Score=47.81 Aligned_cols=44 Identities=18% Similarity=0.400 Sum_probs=35.1 Q ss_pred EEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEECCHHHHHHHHH Q ss_conf 59942878770389999999999679828996010589999998 Q gi|254780229|r 422 LIIEGPPGTGKSQTITNIIAAAMLHGKKILFCAQKMAAIEVVRH 465 (1775) Q Consensus 422 ~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVaeK~aALdVV~~ 465 (1775) ..|-||||||||+...+++.++...|.+|+||+-....-++..+ T Consensus 2 ~li~g~~g~GKttl~~~~~~~~~~~~~~~~~~~~ee~~~q~~~~ 45 (165) T cd01120 2 ILVFGPTGSGKTTLALQLALNIATKGGKVVYVDIEEEIEELTER 45 (165) T ss_pred EEEECCCCCCHHHHHHHHHHHHHHCCCEEEEEECCCCHHHHHHH T ss_conf 89998999989999999999987639979999866644899999 No 112 >PRK01172 ski2-like helicase; Provisional Probab=96.77 E-value=0.0049 Score=46.56 Aligned_cols=54 Identities=24% Similarity=0.369 Sum_probs=45.0 Q ss_pred CCCHHHHHHHHHHHCCCCEEEECCCCCCCHHHHHH-HHHHHHHCCCEEEEEECCHH Q ss_conf 77346899999851489559942878770389999-99999967982899601058 Q gi|254780229|r 404 DSDSSQHSAVIDVINGKNLIIEGPPGTGKSQTITN-IIAAAMLHGKKILFCAQKMA 458 (1775) Q Consensus 404 ~aDsSQ~~av~~~~~g~~~vi~GPPGTGKSQTIaN-lIa~~la~Gk~VLfVaeK~a 458 (1775) +-.+.|..|+.++..|+|+|+--|+|+||| .|+. +|..++..|+|++|++=-.| T Consensus 22 ~l~p~Q~ea~~~~~~gkNllvsaPTgsGKT-lvAe~ai~~~l~~~~k~iyi~P~kA 76 (674) T PRK01172 22 ELYDHQRMAIEQLRKGENVIVSVPTAAGKT-LIAYSAIYETFLAGLKSIYIVPLRS 76 (674) T ss_pred CCCHHHHHHHHHHHCCCCEEEECCCCCCHH-HHHHHHHHHHHHHCCCEEEECCHHH T ss_conf 889899999999977995999789998699-9999999999985897999877899 No 113 >PRK06067 flagellar accessory protein FlaH; Validated Probab=96.76 E-value=0.0027 Score=49.36 Aligned_cols=53 Identities=25% Similarity=0.285 Sum_probs=38.2 Q ss_pred CCCCEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEECCHHHHHHHHHHHHHCC Q ss_conf 489559942878770389999999999679828996010589999998644358 Q gi|254780229|r 418 NGKNLIIEGPPGTGKSQTITNIIAAAMLHGKKILFCAQKMAAIEVVRHRLEKAG 471 (1775) Q Consensus 418 ~g~~~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVaeK~aALdVV~~RL~~~g 471 (1775) .|...+|.||||||||---+..++.++.+|++|+|++--...-+. .++....| T Consensus 31 ~g~~~li~G~~G~GKt~~~~~f~~~~~~~g~~~~~~~~ee~~~~~-~~~~~~~g 83 (241) T PRK06067 31 FGSLILIEGENDTGKSVLSQQFVWGALNQGKRGLAITTENTSKSY-LKQMESLK 83 (241) T ss_pred CCEEEEEECCCCCCHHHHHHHHHHHHHHCCCEEEEEEECCCHHHH-HHHHHHCC T ss_conf 990899980799887999999999998679829999942899999-99999839 No 114 >pfam05496 RuvB_N Holliday junction DNA helicase ruvB N-terminus. The RuvB protein makes up part of the RuvABC revolvasome which catalyses the resolution of Holliday junctions that arise during genetic recombination and DNA repair. Branch migration is catalysed by the RuvB protein that is targeted to the Holliday junction by the structure specific RuvA protein. This family contains the N-terminal region of the protein. Probab=96.76 E-value=0.0011 Score=53.52 Aligned_cols=24 Identities=50% Similarity=0.888 Sum_probs=21.3 Q ss_pred CEEEECCCCCCCHHHHHHHHHHHHH Q ss_conf 5599428787703899999999996 Q gi|254780229|r 421 NLIIEGPPGTGKSQTITNIIAAAML 445 (1775) Q Consensus 421 ~~vi~GPPGTGKSQTIaNlIa~~la 445 (1775) ++++-|||||||| |+|.|||..+- T Consensus 52 h~lf~GPPG~GKT-TlAriiAk~~~ 75 (234) T pfam05496 52 HVLLYGPPGLGKT-TLANIIANEMG 75 (234) T ss_pred EEEEECCCCCCHH-HHHHHHHHHHC T ss_conf 2788789999888-99999998408 No 115 >KOG0250 consensus Probab=96.75 E-value=0.072 Score=34.41 Aligned_cols=27 Identities=30% Similarity=0.357 Sum_probs=18.0 Q ss_pred HCCCCEEEECCCCCCCHHHHHHHHHHHHH Q ss_conf 14895599428787703899999999996 Q gi|254780229|r 417 INGKNLIIEGPPGTGKSQTITNIIAAAML 445 (1775) Q Consensus 417 ~~g~~~vi~GPPGTGKSQTIaNlIa~~la 445 (1775) ...-||| .||.|+||| -|.-+|+-+|. T Consensus 61 g~~vNfI-~G~NGSGKS-AIltAl~lglG 87 (1074) T KOG0250 61 GPRVNFI-VGNNGSGKS-AILTALTLGLG 87 (1074) T ss_pred CCCCEEE-ECCCCCCHH-HHHHHHHHHHC T ss_conf 8871376-468987489-99999998626 No 116 >cd01130 VirB11-like_ATPase Type IV secretory pathway component VirB11, and related ATPases. The homohexamer, VirB11 is one of eleven Vir proteins, which are required for T-pilus biogenesis and virulence in the transfer of T-DNA from the Ti (tumor-inducing) plasmid of bacterial to plant cells. The pilus is a fibrous cell surface organelle, which mediates adhesion between bacteria during conjugative transfer or between bacteria and host eukaryotic cells during infection. VirB11- related ATPases include the archaeal flagella biosynthesis protein and the pilus assembly proteins CpaF/TadA and TrbB. This alignment contains the C-terminal domain, which is the ATPase. Probab=96.74 E-value=0.0024 Score=49.76 Aligned_cols=46 Identities=28% Similarity=0.501 Sum_probs=38.2 Q ss_pred HHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEECCHHHHH Q ss_conf 985148955994287877038999999999967982899601058999 Q gi|254780229|r 414 IDVINGKNLIIEGPPGTGKSQTITNIIAAAMLHGKKILFCAQKMAAIE 461 (1775) Q Consensus 414 ~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVaeK~aALd 461 (1775) ..++.++|++|-||||+||| |..|.++..+-...++. +-|...-+. T Consensus 20 ~~v~~~~nIlIsG~tGSGKT-Tll~al~~~i~~~~riv-tiEd~~El~ 65 (186) T cd01130 20 LAVEARKNILISGGTGSGKT-TLLNALLAFIPPDERII-TIEDTAELQ 65 (186) T ss_pred HHHHCCCCEEEECCCCCCHH-HHHHHHHHHCCCCCCEE-EECCCHHHC T ss_conf 99985998999899999899-99999996133456459-841535404 No 117 >pfam06068 TIP49 TIP49 C-terminus. This family consists of the C-terminal region of several eukaryotic and archaeal RuvB-like 1 (Pontin or TIP49a) and RuvB-like 2 (Reptin or TIP49b) proteins. The N-terminal domain contains the pfam00004 domain. In zebrafish, the liebeskummer (lik) mutation, causes development of hyperplastic embryonic hearts. lik encodes Reptin, a component of a DNA-stimulated ATPase complex. Beta-catenin and Pontin, a DNA-stimulated ATPase that is often part of complexes with Reptin, are in the same genetic pathways. The Reptin/Pontin ratio serves to regulate heart growth during development, at least in part via the beta-catenin pathway. TBP-interacting protein 49 (TIP49) was originally identified as a TBP-binding protein, and two related proteins are encoded by individual genes, tip49a and b. Although the function of this gene family has not been elucidated, they are supposed to play a critical role in nuclear events because they interact with various kinds of nuclear Probab=96.74 E-value=0.0028 Score=49.08 Aligned_cols=58 Identities=34% Similarity=0.695 Sum_probs=41.2 Q ss_pred CCCCEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEECCHHHHHHHHHHHHHCCCCCEEEECCCCCHHHHHHHHH-HHHH Q ss_conf 489559942878770389999999999679828996010589999998644358441365402661108999999-9999 Q gi|254780229|r 418 NGKNLIIEGPPGTGKSQTITNIIAAAMLHGKKILFCAQKMAAIEVVRHRLEKAGLGEFCLELHSHKVHKRAILDD-LRKR 496 (1775) Q Consensus 418 ~g~~~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVaeK~aALdVV~~RL~~~gLg~~cleLHs~ka~k~~vl~~-L~~~ 496 (1775) .|+.+++-|||||||| -|+=.||..| |..|-|++ +.+ .|+-|..-+|.++|.+ ++++ T Consensus 49 aGraiLlaGppGTGKT-AlA~aiakeL--G~~vPF~~--i~g-----------------SEvyS~E~kKTE~L~qafRrs 106 (395) T pfam06068 49 AGRAVLIAGPPGTGKT-ALAIAISKEL--GEDTPFCP--ISG-----------------SEVYSLEMKKTEALTQAFRKA 106 (395) T ss_pred CCCEEEEECCCCCCHH-HHHHHHHHHH--CCCCCEEE--EEH-----------------HEEECCCCCHHHHHHHHHHHH T ss_conf 7738998779998889-9999999974--87997345--001-----------------112125654889999999887 Q ss_pred H Q ss_conf 8 Q gi|254780229|r 497 I 497 (1775) Q Consensus 497 l 497 (1775) + T Consensus 107 I 107 (395) T pfam06068 107 I 107 (395) T ss_pred H T ss_conf 5 No 118 >cd01394 radB RadB. The archaeal protein radB shares similarity radA, the archaeal functional homologue to the bacterial RecA. The precise function of radB is unclear. Probab=96.74 E-value=0.0067 Score=45.19 Aligned_cols=39 Identities=28% Similarity=0.532 Sum_probs=31.6 Q ss_pred CCCCEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEECC Q ss_conf 489559942878770389999999999679828996010 Q gi|254780229|r 418 NGKNLIIEGPPGTGKSQTITNIIAAAMLHGKKILFCAQK 456 (1775) Q Consensus 418 ~g~~~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVaeK 456 (1775) .|.-..|-||||+||||--..+.+++...|++|+|++-- T Consensus 18 ~G~it~i~G~pG~GKStl~lq~a~~~~~~g~~v~YidtE 56 (218) T cd01394 18 RGTVTQVYGPPGTGKTNIAIQLAVETAGQGKKVAYIDTE 56 (218) T ss_pred CCEEEEEECCCCCCHHHHHHHHHHHHHHCCCEEEEEECC T ss_conf 887999989999849999999999986369869999665 No 119 >pfam06745 KaiC KaiC. This family represents a conserved region within bacterial and archaeal proteins, most of which are hypothetical. More than one copy is sometimes found in each protein. This family includes KaiC, which is one of the Kai proteins among which direct protein-protein association may be a critical process in the generation of circadian rhythms in cyanobacteria. Probab=96.74 E-value=0.0033 Score=48.41 Aligned_cols=54 Identities=20% Similarity=0.386 Sum_probs=38.3 Q ss_pred HCCCCEEEECCCCCCCHHHHH-HHHHHHH-HCCCEEEEEECCHHHHHHHHHHHHHCCC Q ss_conf 148955994287877038999-9999999-6798289960105899999986443584 Q gi|254780229|r 417 INGKNLIIEGPPGTGKSQTIT-NIIAAAM-LHGKKILFCAQKMAAIEVVRHRLEKAGL 472 (1775) Q Consensus 417 ~~g~~~vi~GPPGTGKSQTIa-NlIa~~l-a~Gk~VLfVaeK~aALdVV~~RL~~~gL 472 (1775) -.|...+|.|||||||| |++ ..+++.+ -.|++||||+-....-++ .++....|. T Consensus 17 ~~gs~~LI~G~pGsGKT-~la~qfl~~ga~~~ge~~lYis~ee~~~~l-~~~~~~~g~ 72 (231) T pfam06745 17 PEGRVVLITGGPGTGKT-IFGLQFLYNGALEYGEPGVYVTLEEPPEDL-RENAKSFGW 72 (231) T ss_pred CCCEEEEEECCCCCCHH-HHHHHHHHHHHHHCCCCEEEEEECCCHHHH-HHHHHHCCC T ss_conf 29969999858972599-999999999998658968999813799999-999998299 No 120 >PRK13946 shikimate kinase; Provisional Probab=96.71 E-value=0.0026 Score=49.39 Aligned_cols=39 Identities=23% Similarity=0.370 Sum_probs=35.7 Q ss_pred HHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHC Q ss_conf 4689999985148955994287877038999999999967 Q gi|254780229|r 407 SSQHSAVIDVINGKNLIIEGPPGTGKSQTITNIIAAAMLH 446 (1775) Q Consensus 407 sSQ~~av~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~ 446 (1775) .||..-|.+.+.++|+|+.|+||+||| ||...+|..|.- T Consensus 8 ~~~~~~~~~~l~kknIvLIG~mGsGKS-tvGk~LA~~L~~ 46 (195) T PRK13946 8 ASQEEQIRAALGKRTVVLVGLMGAGKS-TVGRRLATMLGL 46 (195) T ss_pred HCHHHHHHHHHCCCCEEEECCCCCCHH-HHHHHHHHHHCC T ss_conf 455999999858995899899999889-999999999797 No 121 >PRK13531 regulatory ATPase RavA; Provisional Probab=96.71 E-value=0.002 Score=50.71 Aligned_cols=39 Identities=26% Similarity=0.542 Sum_probs=32.8 Q ss_pred HHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHCCC Q ss_conf 8999998514895599428787703899999999996798 Q gi|254780229|r 409 QHSAVIDVINGKNLIIEGPPGTGKSQTITNIIAAAMLHGK 448 (1775) Q Consensus 409 Q~~av~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~Gk 448 (1775) =..++.++..|.++++-|||||||| .|+.-++.++-.|+ T Consensus 29 i~l~lLaalagehvlllGPPGtAKS-~larrl~~~~~~a~ 67 (498) T PRK13531 29 IRLCLLAALSGESVFLLGPPGIAKS-LIARRLKFAFQHAR 67 (498) T ss_pred HHHHHHHHHCCCCEEEECCCCHHHH-HHHHHHHHHHCCCH T ss_conf 9999999972894698889951388-99999999855740 No 122 >pfam03796 DnaB_C DnaB-like helicase C terminal domain. The hexameric helicase DnaB unwinds the DNA duplex at the Escherichia coli chromosome replication fork. Although the mechanism by which DnaB both couples ATP hydrolysis to translocation along DNA and denatures the duplex is unknown, a change in the quaternary structure of the protein involving dimerization of the N-terminal domain has been observed and may occur during the enzymatic cycle. This C-terminal domain contains an ATP-binding site and is therefore probably the site of ATP hydrolysis. Probab=96.70 E-value=0.0043 Score=47.15 Aligned_cols=52 Identities=29% Similarity=0.437 Sum_probs=42.1 Q ss_pred HHCCCCEEEECCCCCCCHHHHHHHHHHH-HHCCCEEEEEECCHHHHHHHHHHH Q ss_conf 5148955994287877038999999999-967982899601058999999864 Q gi|254780229|r 416 VINGKNLIIEGPPGTGKSQTITNIIAAA-MLHGKKILFCAQKMAAIEVVRHRL 467 (1775) Q Consensus 416 ~~~g~~~vi~GPPGTGKSQTIaNlIa~~-la~Gk~VLfVaeK~aALdVV~~RL 467 (1775) ...|.=.||-|+||.|||....|+..++ +..|++|||+|--|..-++..+-+ T Consensus 16 l~~G~l~vi~g~pg~GKS~~~~~~a~~~a~~~g~~Vl~~slEm~~~~~~~R~~ 68 (186) T pfam03796 16 LQKGDLIIIAARPSMGKTAFALNIARNAALKQDKPVLFFSLEMSAEQLAERLL 68 (186) T ss_pred CCCCCEEEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEECCCCCHHHHHHHHH T ss_conf 88881799996799987999999999999970996687547552999999999 No 123 >PRK02362 ski2-like helicase; Provisional Probab=96.70 E-value=0.011 Score=42.95 Aligned_cols=62 Identities=24% Similarity=0.346 Sum_probs=45.8 Q ss_pred CHHHHHHHHH-HHCCCCEEEECCCCCCCHHHHHHH-HHHHHHCCCEEEEEECCHHHHHHHHHHHH Q ss_conf 3468999998-514895599428787703899999-99999679828996010589999998644 Q gi|254780229|r 406 DSSQHSAVID-VINGKNLIIEGPPGTGKSQTITNI-IAAAMLHGKKILFCAQKMAAIEVVRHRLE 468 (1775) Q Consensus 406 DsSQ~~av~~-~~~g~~~vi~GPPGTGKSQTIaNl-Ia~~la~Gk~VLfVaeK~aALdVV~~RL~ 468 (1775) -+-|..||.. +..|+|+|+.-|+|+||| .|+.+ |..++..|+||+|++=-.|=..-.++.+. T Consensus 25 yp~Q~eal~~gl~~g~NlvvsaPTgsGKT-lvAElail~~l~~g~k~vYi~P~kALa~EK~~~~~ 88 (736) T PRK02362 25 YPPQAEAVEAGLLEGKNLLAAIPTASGKT-LLAELAMLKAIAEGGKALYIVPLRALASEKFEEFS 88 (736) T ss_pred CHHHHHHHHHHHCCCCCEEEECCCCCCHH-HHHHHHHHHHHHHCCEEEEECCHHHHHHHHHHHHH T ss_conf 89999999864356981899799998589-99999999999839979998587999999999999 No 124 >TIGR03499 FlhF flagellar biosynthetic protein FlhF. Probab=96.70 E-value=0.0078 Score=44.51 Aligned_cols=56 Identities=27% Similarity=0.520 Sum_probs=40.2 Q ss_pred CCCCEEEECCCCCCCHHHHHHHHHHHHHC-C-CEEEEEE---CCHHHHHHHHHHHHHCCCC Q ss_conf 48955994287877038999999999967-9-8289960---1058999999864435844 Q gi|254780229|r 418 NGKNLIIEGPPGTGKSQTITNIIAAAMLH-G-KKILFCA---QKMAAIEVVRHRLEKAGLG 473 (1775) Q Consensus 418 ~g~~~vi~GPPGTGKSQTIaNlIa~~la~-G-k~VLfVa---eK~aALdVV~~RL~~~gLg 473 (1775) .|+-+++.||+|.|||.||+-|-|++... | ++|-|++ ...+|.+-.+.--+-.|+. T Consensus 193 ~~~vi~lvGPTGVGKTTTiAKLAa~~~l~~~~~~V~lIT~DtyRigA~eQLk~ya~il~vp 253 (282) T TIGR03499 193 QGGVIALVGPTGVGKTTTLAKLAARFVLEHGKKKVALITTDTYRIGAVEQLKTYAKILGVP 253 (282) T ss_pred CCCEEEEECCCCCCHHHHHHHHHHHHHHHCCCCCEEEEEECCCCHHHHHHHHHHHHHHCCE T ss_conf 6727999778887578899999999999738996799980777678999999999995974 No 125 >PRK13826 Dtr system oriT relaxase; Provisional Probab=96.70 E-value=0.011 Score=43.03 Aligned_cols=60 Identities=20% Similarity=0.212 Sum_probs=46.4 Q ss_pred CCHHHHHHHHHHHCCCCE-EEECCCCCCCHHHHHHHHHHHHHCCCEEEEEECCHHHHHHHH Q ss_conf 734689999985148955-994287877038999999999967982899601058999999 Q gi|254780229|r 405 SDSSQHSAVIDVINGKNL-IIEGPPGTGKSQTITNIIAAAMLHGKKILFCAQKMAAIEVVR 464 (1775) Q Consensus 405 aDsSQ~~av~~~~~g~~~-vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVaeK~aALdVV~ 464 (1775) -+..|..||..+..+..+ +|+|..|||||..+.-+=-..=+.|-+|.=.|-.-.|.+.+. T Consensus 382 Ls~EQ~~Av~hiT~~~~Ia~VvG~AGaGKStmL~aAReawEa~GyrV~GaALsGkAAegLe 442 (1102) T PRK13826 382 LSDEQKTAIEHVAGAERIAAVIGRAGAGKTTMMKAAREAWEAAGYRVVGGALAGKAAEGLE 442 (1102) T ss_pred CCHHHHHHHHHHCCCCCEEEEEECCCCCHHHHHHHHHHHHHHCCCEEEEECCCHHHHHHHH T ss_conf 8999999999853788668998428887889999999999977977980150078999775 No 126 >PRK12726 flagellar biosynthesis regulator FlhF; Provisional Probab=96.69 E-value=0.0024 Score=49.75 Aligned_cols=60 Identities=18% Similarity=0.309 Sum_probs=46.0 Q ss_pred HHHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEEC---CHHHHHHHHHHHHHCCC Q ss_conf 9985148955994287877038999999999967982899601---05899999986443584 Q gi|254780229|r 413 VIDVINGKNLIIEGPPGTGKSQTITNIIAAAMLHGKKILFCAQ---KMAAIEVVRHRLEKAGL 472 (1775) Q Consensus 413 v~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVae---K~aALdVV~~RL~~~gL 472 (1775) ..+...|+=+.+.||.|+|||.||+-|-+.+..+|++|++||- ..||.|-++-==...|+ T Consensus 200 ~~~l~~g~VIaLVGvnGvGKTTTiAKLA~~l~~~gkkV~LVAaDTFRaAAiEQLk~~g~rlgV 262 (407) T PRK12726 200 SFDLSNHRIISLIGQTGVGKTTTLVKLGWQLLKQNRTVGFITTDTFRSGAVEQFQGYADKLDV 262 (407) T ss_pred EEEECCCCEEEEECCCCCCHHHHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHHHHCC T ss_conf 023036908999899989789999999999997799179997066778899999999999796 No 127 >TIGR00750 lao LAO/AO transport system ATPase; InterPro: IPR005129 Bacterial periplasmic transport systems require the function of a specific substrate-binding protein, located in the periplasm, and several cytoplasmic membrane transport components. In Escherichia coli K-12, the arginine-ornithine transport system requires an arginine-ornithine-binding protein and the lysine-arginine-ornithine (LAO) transport system includes a LAO-binding protein. Both periplasmic proteins can be phosphorylated by a single kinase, ArgK resulting in reduced levels of transport activity of the periplasmic transport systems that include each of the binding proteins. The ArgK protein acts as an ATPase enzyme and as a kinase.. Probab=96.64 E-value=0.0066 Score=45.28 Aligned_cols=49 Identities=27% Similarity=0.331 Sum_probs=43.6 Q ss_pred CHHHHHHHHHHHCCCCEE--EECCCCCCCHHHHHHHHHHHHHCCCEEEEEE Q ss_conf 346899999851489559--9428787703899999999996798289960 Q gi|254780229|r 406 DSSQHSAVIDVINGKNLI--IEGPPGTGKSQTITNIIAAAMLHGKKILFCA 454 (1775) Q Consensus 406 DsSQ~~av~~~~~g~~~v--i~GPPGTGKSQTIaNlIa~~la~Gk~VLfVa 454 (1775) -|.|+.-=.-...|+-.| |+|+||-|||.-|--+|.++.-+|++|=++| T Consensus 23 ~a~~ll~~i~p~~GnA~~vG~TG~PGaGKSTl~~~l~~~lrRrG~~VaViA 73 (333) T TIGR00750 23 LAKELLERILPKTGNAHVVGITGVPGAGKSTLVEKLIMELRRRGLKVAVIA 73 (333) T ss_pred HHHHHHHHHHHHCCCCEEEEEECCCCCCHHHHHHHHHHHHHHCCCEEEEEE T ss_conf 999999986243279078766468888577799999899976597689998 No 128 >cd03279 ABC_sbcCD SbcCD and other Mre11/Rad50 (MR) complexes are implicated in the metabolism of DNA ends. They cleave ends sealed by hairpin structures and are thought to play a role in removing protein bound to DNA termini. Probab=96.64 E-value=0.0017 Score=51.29 Aligned_cols=27 Identities=48% Similarity=0.746 Sum_probs=24.4 Q ss_pred CEEEECCCCCCCHHHHHHHHHHHHHCCCE Q ss_conf 55994287877038999999999967982 Q gi|254780229|r 421 NLIIEGPPGTGKSQTITNIIAAAMLHGKK 449 (1775) Q Consensus 421 ~~vi~GPPGTGKSQTIaNlIa~~la~Gk~ 449 (1775) =|+|.||.|+||| ||.|+|+-||| ||+ T Consensus 30 lflI~G~nGsGKS-TIlDAI~~aLY-Gk~ 56 (213) T cd03279 30 LFLICGPTGAGKS-TILDAITYALY-GKT 56 (213) T ss_pred EEEEECCCCCCHH-HHHHHHHHHHC-CCC T ss_conf 8999889999788-99999999983-888 No 129 >cd00268 DEADc DEAD-box helicases. A diverse family of proteins involved in ATP-dependent RNA unwinding, needed in a variety of cellular processes including splicing, ribosome biogenesis and RNA degradation. The name derives from the sequence of the Walker B motif (motif II). This domain contains the ATP- binding region. Probab=96.61 E-value=0.035 Score=37.64 Aligned_cols=76 Identities=16% Similarity=0.167 Sum_probs=49.7 Q ss_pred HHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHHHH-H----CCCEEEEEECCHHHHHHHHHHHHHC--CCCCEEEECC Q ss_conf 6899999851489559942878770389999999999-6----7982899601058999999864435--8441365402 Q gi|254780229|r 408 SQHSAVIDVINGKNLIIEGPPGTGKSQTITNIIAAAM-L----HGKKILFCAQKMAAIEVVRHRLEKA--GLGEFCLELH 480 (1775) Q Consensus 408 SQ~~av~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~l-a----~Gk~VLfVaeK~aALdVV~~RL~~~--gLg~~cleLH 480 (1775) =|..||-.+.+|+|++++.|+|||||-+-+=-|-+.+ . .|-++|+++-.+.-..-+++.+... +++--|.-++ T Consensus 25 IQ~~~ip~il~g~dvi~~a~TGSGKTlay~lpil~~l~~~~~~~~~~alil~PTrELa~Qi~~~~~~l~~~~~i~~~~~~ 104 (203) T cd00268 25 IQARAIPPLLSGRDVIGQAQTGSGKTAAFLIPILEKLDPSPKKDGPQALILAPTRELALQIAEVARKLGKHTNLKVVVIY 104 (203) T ss_pred HHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHCCCCCCCCEEEEEECCHHHHHHHHHHHHHHCCCCCEEEEEEE T ss_conf 99999999977998899757997222888869999986166768966999968799999999999985057983899983 Q ss_pred CCC Q ss_conf 661 Q gi|254780229|r 481 SHK 483 (1775) Q Consensus 481 s~k 483 (1775) ++. T Consensus 105 gg~ 107 (203) T cd00268 105 GGT 107 (203) T ss_pred CCC T ss_conf 898 No 130 >pfam05970 DUF889 PIF1 helicase. The PIF1 helicase inhibits telomerase activity and is cell cycle regulated. Probab=96.59 E-value=0.0022 Score=50.26 Aligned_cols=40 Identities=38% Similarity=0.548 Sum_probs=31.1 Q ss_pred CCCEEEECCHHHCCHH--HHHHHHHH------------CCEEEEEECCCCCCCC Q ss_conf 4107886063236989--98898861------------1828996077866862 Q gi|254780229|r 1361 KFDLVIMDESSQIKPE--DALGVIAR------------GKQIVVVGDPKQLPPT 1400 (1775) Q Consensus 1361 ~FD~VI~DEASQ~~~~--~al~al~r------------akqvvvvGD~kQlpPt 1400 (1775) .-|++||||.|-+... +.|-.++| |.|||++||--||||. T Consensus 74 ~~~vLIIDEiSMv~~~lfd~id~~lr~i~~~~~~~PFGGiqvIl~GDf~QLPPV 127 (418) T pfam05970 74 KASLIIWDEAPMTHRHCFEALDRTLRDILKNTDDKPFGGKTVVLGGDFRQILPV 127 (418) T ss_pred HCCEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEECCHHHCCCC T ss_conf 087999854113578999999999999871278767797479982447655875 No 131 >COG1703 ArgK Putative periplasmic protein kinase ArgK and related GTPases of G3E family [Amino acid transport and metabolism] Probab=96.57 E-value=0.0073 Score=44.77 Aligned_cols=47 Identities=23% Similarity=0.319 Sum_probs=39.2 Q ss_pred HHHHHHHHHHCCCCEE--EECCCCCCCHHHHHHHHHHHHHCCCEEEEEE Q ss_conf 6899999851489559--9428787703899999999996798289960 Q gi|254780229|r 408 SQHSAVIDVINGKNLI--IEGPPGTGKSQTITNIIAAAMLHGKKILFCA 454 (1775) Q Consensus 408 SQ~~av~~~~~g~~~v--i~GPPGTGKSQTIaNlIa~~la~Gk~VLfVa 454 (1775) -|+.--.....|+..| |.||||.|||..|--+|.++.-+|+||-+.| T Consensus 38 ~~ll~~l~p~tG~a~viGITG~PGaGKSTli~~L~~~l~~~G~rVaVlA 86 (323) T COG1703 38 RELLRALYPRTGNAHVIGITGVPGAGKSTLIEALGRELRERGHRVAVLA 86 (323) T ss_pred HHHHHHHHHCCCCCCEEEECCCCCCCHHHHHHHHHHHHHHCCCEEEEEE T ss_conf 9999997311799837873179988668899999999997796789999 No 132 >TIGR00631 uvrb excinuclease ABC, B subunit; InterPro: IPR004807 All proteins in this family for which functions are known are DNA helicases that function in the nucleotide excision repair and are endonucleases that make the 3' incision next to DNA damage. They are part of a pathway requiring UvrA, UvrB, UvrC, and UvrD homologs.; GO: 0009381 excinuclease ABC activity, 0006289 nucleotide-excision repair, 0005737 cytoplasm, 0009380 excinuclease repair complex. Probab=96.56 E-value=0.0046 Score=46.87 Aligned_cols=44 Identities=30% Similarity=0.441 Sum_probs=35.4 Q ss_pred HHHHHHH----HHCCCCE-EEECCCCCCCHHHHHHHHHHHHHCCCEEEEEEC Q ss_conf 8999998----5148955-994287877038999999999967982899601 Q gi|254780229|r 409 QHSAVID----VINGKNL-IIEGPPGTGKSQTITNIIAAAMLHGKKILFCAQ 455 (1775) Q Consensus 409 Q~~av~~----~~~g~~~-vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVae 455 (1775) |=.||.. ...|.-+ ++=|=+|||||-||||+|+.. ||-.|++|. T Consensus 14 QP~AI~~L~~~l~~G~~~QtLLGvTGsGKTFT~AnVIa~~---~rPTLV~aH 62 (667) T TIGR00631 14 QPKAIAKLVEGLEAGEKEQTLLGVTGSGKTFTMANVIAQV---QRPTLVLAH 62 (667) T ss_pred HHHHHHHHHHHHHCCCCCEEEEEEECCCHHHHHHHHHHHH---CCCEEEECC T ss_conf 1899999999985688714785321486278898999984---798499857 No 133 >COG0497 RecN ATPase involved in DNA repair [DNA replication, recombination, and repair] Probab=96.53 E-value=0.095 Score=33.15 Aligned_cols=55 Identities=25% Similarity=0.402 Sum_probs=36.1 Q ss_pred CCCE-EEECCCCCCCHHHHHHHHHHHHHC---------CC-----EEEEEECCHHHHHHHHHHHHHCCCCC--EEEE Q ss_conf 8955-994287877038999999999967---------98-----28996010589999998644358441--3654 Q gi|254780229|r 419 GKNL-IIEGPPGTGKSQTITNIIAAAMLH---------GK-----KILFCAQKMAAIEVVRHRLEKAGLGE--FCLE 478 (1775) Q Consensus 419 g~~~-vi~GPPGTGKSQTIaNlIa~~la~---------Gk-----~VLfVaeK~aALdVV~~RL~~~gLg~--~cle 478 (1775) ++.| ||.|-+|+||| -|.+.|..|+.. |. ...|.-. ...++.-|.+.|+.+ .|+- T Consensus 21 ~~GltVlTGETGAGKS-IiidAl~lllG~ra~~~~VR~G~~~a~v~a~F~~~----~~~~~~~L~e~gie~~~~iil 92 (557) T COG0497 21 EKGLTVLTGETGAGKS-IIIDALGLLLGGRADASLVRHGAKRAEVEAIFDLD----NPPARAWLEENGIEDDEEVIL 92 (557) T ss_pred CCCCEEEECCCCCCHH-HHHHHHHHHHCCCCCCCHHCCCCCEEEEEEEECCC----CHHHHHHHHHCCCCCCCCEEE T ss_conf 6895686568887566-79989999727877811211788606899985479----668899999759997675799 No 134 >pfam01078 Mg_chelatase Magnesium chelatase, subunit ChlI. Magnesium-chelatase is a three-component enzyme that catalyses the insertion of Mg2+ into protoporphyrin IX. This is the first unique step in the synthesis of (bacterio)chlorophyll. Due to this, it is thought that Mg-chelatase has an important role in channelling inter- mediates into the (bacterio)chlorophyll branch in response to conditions suitable for photosynthetic growth. ChlI and BchD have molecular weight between 38-42 kDa. Probab=96.53 E-value=0.0019 Score=50.87 Aligned_cols=47 Identities=28% Similarity=0.531 Sum_probs=33.8 Q ss_pred HHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEECCHHHHHHHH Q ss_conf 9999985148955994287877038999999999967982899601058999999 Q gi|254780229|r 410 HSAVIDVINGKNLIIEGPPGTGKSQTITNIIAAAMLHGKKILFCAQKMAAIEVVR 464 (1775) Q Consensus 410 ~~av~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVaeK~aALdVV~ 464 (1775) .+..++|+.|.|+++.|||||||| +++..|...|=. -....+++|.+ T Consensus 13 rAl~iAaaG~H~lLl~GpPG~GKT-mlA~rl~~iLP~-------l~~~e~le~~~ 59 (207) T pfam01078 13 RALEIAAAGGHNLLMIGPPGSGKT-MLAKRLPGILPP-------LTEQEALEVTA 59 (207) T ss_pred HHHHHHHCCCCCEEEECCCCCCHH-HHHHHHHCCCCC-------CCHHHHHHHHH T ss_conf 999998547875897889980299-999763014899-------87899887776 No 135 >PRK11176 lipid transporter ATP-binding/permease protein; Provisional Probab=96.52 E-value=0.038 Score=37.30 Aligned_cols=14 Identities=36% Similarity=0.508 Sum_probs=9.0 Q ss_pred HHHHHCCEEEEEEC Q ss_conf 98861182899607 Q gi|254780229|r 1380 GVIARGKQIVVVGD 1393 (1775) Q Consensus 1380 ~al~rakqvvvvGD 1393 (1775) --|-.|+.+-+||. T Consensus 363 l~I~~G~~vaiVG~ 376 (581) T PRK11176 363 FKIPAGKTVALVGR 376 (581) T ss_pred CCCCCCCEEECCCC T ss_conf 35799944312289 No 136 >PRK06921 hypothetical protein; Provisional Probab=96.50 E-value=0.005 Score=46.48 Aligned_cols=36 Identities=25% Similarity=0.544 Sum_probs=27.4 Q ss_pred CCCCEEEECCCCCCCHHHHHHHHHHH-HHC-CCEEEEEE Q ss_conf 48955994287877038999999999-967-98289960 Q gi|254780229|r 418 NGKNLIIEGPPGTGKSQTITNIIAAA-MLH-GKKILFCA 454 (1775) Q Consensus 418 ~g~~~vi~GPPGTGKSQTIaNlIa~~-la~-Gk~VLfVa 454 (1775) ...++++.||||||||+ +++.||.. |-+ |.+|||+. T Consensus 115 ~~~~l~f~G~~G~GKTh-La~aIa~~Ll~~~~~~Vly~~ 152 (265) T PRK06921 115 RKNSIALLGQPGSGKTH-LLTAAANELMRKKGVPVLYFP 152 (265) T ss_pred CCCCEEEECCCCCCHHH-HHHHHHHHHHHHCCCEEEEEE T ss_conf 76627997289898899-999999999996297199988 No 137 >COG1061 SSL2 DNA or RNA helicases of superfamily II [Transcription / DNA replication, recombination, and repair] Probab=96.50 E-value=0.018 Score=40.80 Aligned_cols=49 Identities=16% Similarity=0.217 Sum_probs=28.3 Q ss_pred EEEECHHHHHHH--CC-CCCCCCCEEEECCHHHCCHHH--HHHHHHHCCEEEEEE Q ss_conf 899257799872--58-522541078860632369899--889886118289960 Q gi|254780229|r 1343 CFMMSPMSVANY--LE-PKDVKFDLVIMDESSQIKPED--ALGVIARGKQIVVVG 1392 (1775) Q Consensus 1343 ~~~msp~sva~~--l~-~~~~~FD~VI~DEASQ~~~~~--al~al~rakqvvvvG 1392 (1775) +-+.|-.+++.. +. .....||+|||||+=+++-.. .+..+..+..- +.| T Consensus 125 i~vat~qtl~~~~~l~~~~~~~~~liI~DE~Hh~~a~~~~~~~~~~~~~~~-~LG 178 (442) T COG1061 125 VTVATVQTLARRQLLDEFLGNEFGLIIFDEVHHLPAPSYRRILELLSAAYP-RLG 178 (442) T ss_pred EEEEEEHHHHHHHHHHHHCCCCCCEEEEECCCCCCCHHHHHHHHHHHCCCE-EEE T ss_conf 899983897641555540356667599975245784779999997510310-467 No 138 >PRK04195 replication factor C large subunit; Provisional Probab=96.48 E-value=0.0022 Score=50.14 Aligned_cols=24 Identities=38% Similarity=0.822 Sum_probs=21.1 Q ss_pred CCEEEECCCCCCCHHHHHHHHHHHH Q ss_conf 9559942878770389999999999 Q gi|254780229|r 420 KNLIIEGPPGTGKSQTITNIIAAAM 444 (1775) Q Consensus 420 ~~~vi~GPPGTGKSQTIaNlIa~~l 444 (1775) +.+++.|||||||| |++.+||.-+ T Consensus 41 k~lLL~GPpGvGKT-T~a~~lAk~~ 64 (403) T PRK04195 41 KALLLYGPPGVGKT-SLAHALANDY 64 (403) T ss_pred CEEEEECCCCCCHH-HHHHHHHHHH T ss_conf 46998893998799-9999999984 No 139 >KOG0989 consensus Probab=96.48 E-value=0.0038 Score=47.76 Aligned_cols=41 Identities=29% Similarity=0.305 Sum_probs=30.6 Q ss_pred CCCCCHHHHHHHHHH------HCCCCEEEECCCCCCCHHHHHHHHHH Q ss_conf 667734689999985------14895599428787703899999999 Q gi|254780229|r 402 IDDSDSSQHSAVIDV------INGKNLIIEGPPGTGKSQTITNIIAA 442 (1775) Q Consensus 402 ~~~aDsSQ~~av~~~------~~g~~~vi~GPPGTGKSQTIaNlIa~ 442 (1775) -+|....|..+|... +.+.+++.+||||||||-||--.-.+ T Consensus 34 t~de~~gQe~vV~~L~~a~~~~~lp~~LFyGPpGTGKTStalafar~ 80 (346) T KOG0989 34 TFDELAGQEHVVQVLKNALLRRILPHYLFYGPPGTGKTSTALAFARA 80 (346) T ss_pred CHHHHCCHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHHHHHHHH T ss_conf 37765015999999999986068860786689998676899999998 No 140 >pfam10412 TrwB_AAD_bind Type IV secretion-system coupling protein DNA-binding domain. The plasmid conjugative coupling protein TrwB forms hexamers from six structurally very similar protomers. This hexamer contains a central channel running from the cytosolic pole (made up by the AADs) to the membrane pole ending at the transmembrane pore shaped by 12 transmembrane helices, rendering an overall mushroom-like structure. The TrwB_AAD (all-alpha domain) domain appears to be the DNA-binding domain of the structure. TrwB, a basic integral inner-membrane nucleoside-triphosphate-binding protein, is the structural prototype for the type IV secretion system coupling proteins, a family of proteins essential for macromolecular transport between cells and export. Probab=96.48 E-value=0.0049 Score=46.63 Aligned_cols=40 Identities=20% Similarity=0.470 Sum_probs=34.0 Q ss_pred HCCCCEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEECC Q ss_conf 1489559942878770389999999999679828996010 Q gi|254780229|r 417 INGKNLIIEGPPGTGKSQTITNIIAAAMLHGKKILFCAQK 456 (1775) Q Consensus 417 ~~g~~~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVaeK 456 (1775) ..-+++.|.||||+|||..|.++|..++..|.++.++=-| T Consensus 13 ~~~~H~lviG~tGsGKT~~i~~~i~~~~~~~~s~iv~DpK 52 (386) T pfam10412 13 SETQHILIVGTTGTGKTQALRELLDQIRARGDRAIIYDPT 52 (386) T ss_pred CCCCEEEEECCCCCCHHHHHHHHHHHHHHCCCCEEEEECC T ss_conf 7767589988999988879999999999779919999587 No 141 >PRK06696 uridine kinase; Validated Probab=96.47 E-value=0.0057 Score=45.90 Aligned_cols=41 Identities=22% Similarity=0.430 Sum_probs=32.2 Q ss_pred HHHHHCCCCEE--EECCCCCCCHHHHHHHHHHHHHC-CCEEEEEE Q ss_conf 99851489559--94287877038999999999967-98289960 Q gi|254780229|r 413 VIDVINGKNLI--IEGPPGTGKSQTITNIIAAAMLH-GKKILFCA 454 (1775) Q Consensus 413 v~~~~~g~~~v--i~GPPGTGKSQTIaNlIa~~la~-Gk~VLfVa 454 (1775) |....-|+-|+ |-||||+||| |++|.|+..|.. |..|+.++ T Consensus 18 i~~~~p~rpl~VgIdG~~gSGKT-TlA~~La~~L~~~G~~V~~v~ 61 (227) T PRK06696 18 ILTLNLTRPLRVAIDGITASGKT-TFANELAEEIKKRGRPVIRAS 61 (227) T ss_pred HHHCCCCCCEEEEEECCCCCCHH-HHHHHHHHHHHHCCCCEEEEC T ss_conf 98359998689997789987879-999999999974699489971 No 142 >PRK06995 flhF flagellar biosynthesis regulator FlhF; Validated Probab=96.47 E-value=0.005 Score=46.46 Aligned_cols=48 Identities=27% Similarity=0.535 Sum_probs=37.5 Q ss_pred CCCCEEEECCCCCCCHHHHHHHHHHHHH-CCC-EEEEEE---CCHHHHHHHHH Q ss_conf 4895599428787703899999999996-798-289960---10589999998 Q gi|254780229|r 418 NGKNLIIEGPPGTGKSQTITNIIAAAML-HGK-KILFCA---QKMAAIEVVRH 465 (1775) Q Consensus 418 ~g~~~vi~GPPGTGKSQTIaNlIa~~la-~Gk-~VLfVa---eK~aALdVV~~ 465 (1775) .|+=|.+.||+|-|||.||+-|-|.+.. +|+ +|-+++ ...+|.+-++. T Consensus 175 ~ggV~alVGPTGVGKTTTiAKLAAr~~l~~g~~kVaLIT~DTYRIgAvEQLkt 227 (404) T PRK06995 175 RGGVFALVGPTGVGKTTTTAKLAARCVMRHGASKVALLTTDSYRIGGHEQLRI 227 (404) T ss_pred CCCEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHH T ss_conf 47558986688876375899999999998389837999768754789999999 No 143 >PRK00411 cdc6 cell division control protein 6; Reviewed Probab=96.45 E-value=0.0069 Score=45.04 Aligned_cols=36 Identities=28% Similarity=0.446 Sum_probs=24.6 Q ss_pred HHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHC Q ss_conf 9999985148955994287877038999999999967 Q gi|254780229|r 410 HSAVIDVINGKNLIIEGPPGTGKSQTITNIIAAAMLH 446 (1775) Q Consensus 410 ~~av~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~ 446 (1775) +..+..-..+.|+.|-|||||||| +.+.-++.-+-. T Consensus 46 l~~~l~g~~~~n~~I~G~pGTGKT-~~vk~v~~~l~~ 81 (394) T PRK00411 46 LRPALRGSRPSNVLILGPPGTGKT-TTVKKVFEELEE 81 (394) T ss_pred HHHHHCCCCCCCEEEECCCCCCHH-HHHHHHHHHHHH T ss_conf 999975999984799889999899-999999999997 No 144 >cd00984 DnaB_C DnaB helicase C terminal domain. The hexameric helicase DnaB unwinds the DNA duplex at the chromosome replication fork. Although the mechanism by which DnaB both couples ATP hydrolysis to translocation along DNA and denatures the duplex is unknown, a change in the quaternary structure of the protein involving dimerization of the N-terminal domain has been observed and may occur during the enzymatic cycle. This C-terminal domain contains an ATP-binding site and is therefore probably the site of ATP hydrolysis. Probab=96.44 E-value=0.008 Score=44.38 Aligned_cols=53 Identities=26% Similarity=0.421 Sum_probs=42.0 Q ss_pred HHCCCCEEEECCCCCCCHHHHHHHHHHHH-HCCCEEEEEECCHHHHHHHHHHHH Q ss_conf 51489559942878770389999999999-679828996010589999998644 Q gi|254780229|r 416 VINGKNLIIEGPPGTGKSQTITNIIAAAM-LHGKKILFCAQKMAAIEVVRHRLE 468 (1775) Q Consensus 416 ~~~g~~~vi~GPPGTGKSQTIaNlIa~~l-a~Gk~VLfVaeK~aALdVV~~RL~ 468 (1775) ...|.=.||-|+||.|||....|++.++. -.|++|+|.|--|..-+++.+-+. T Consensus 10 ~~~G~L~vi~a~~g~GKS~~~~~la~~~a~~~g~~V~~~SlEm~~~~~~~R~~s 63 (242) T cd00984 10 LQPGDLIIIAARPSMGKTAFALNIAENIAKKQGKPVLFFSLEMSKEQLLQRLLA 63 (242) T ss_pred CCCCCEEEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHH T ss_conf 999818999968999999999999999999779959999333538899999999 No 145 >PRK12402 replication factor C small subunit 2; Reviewed Probab=96.43 E-value=0.0044 Score=47.04 Aligned_cols=36 Identities=28% Similarity=0.439 Sum_probs=27.7 Q ss_pred HHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHHHHH Q ss_conf 8999998514895599428787703899999999996 Q gi|254780229|r 409 QHSAVIDVINGKNLIIEGPPGTGKSQTITNIIAAAML 445 (1775) Q Consensus 409 Q~~av~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la 445 (1775) ++........=.++++.|||||||| |.+-++|..|. T Consensus 26 ~L~~~~~~~~~phlLf~GPpG~GKT-t~A~~lA~~l~ 61 (337) T PRK12402 26 HLSALAASGNLPHLVVYGPSGSGKT-AAVRALARELY 61 (337) T ss_pred HHHHHHHCCCCCEEEEECCCCCCHH-HHHHHHHHHHC T ss_conf 9999997799876988892984899-99999999967 No 146 >TIGR03158 cas3_cyano CRISPR-associated helicase, Cyano-type. subtype of CRISPR/Cas locus, found in several species of Cyanobacteria and several archaeal species. It contains helicase motifs and appears to represent the Cas3 protein of the Cyano subtype of CRISPR/Cas system. Probab=96.43 E-value=0.018 Score=40.68 Aligned_cols=11 Identities=45% Similarity=0.573 Sum_probs=5.5 Q ss_pred CCEEEECCHHH Q ss_conf 10788606323 Q gi|254780229|r 1362 FDLVIMDESSQ 1372 (1775) Q Consensus 1362 FD~VI~DEASQ 1372 (1775) |+.|||||.=- T Consensus 146 l~~VViDEiH~ 156 (357) T TIGR03158 146 FSTVIFDEFHL 156 (357) T ss_pred CCEEEEEEEHC T ss_conf 67477562000 No 147 >PRK11823 DNA repair protein RadA; Provisional Probab=96.42 E-value=0.02 Score=40.17 Aligned_cols=73 Identities=32% Similarity=0.477 Sum_probs=43.8 Q ss_pred CCCEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEECCHHHHHHHHHHHHHCCCCCEEEECCCCCHHHHHHHHHHH Q ss_conf 8955994287877038999999999967982899601058999999864435844136540266110899999999 Q gi|254780229|r 419 GKNLIIEGPPGTGKSQTITNIIAAAMLHGKKILFCAQKMAAIEVVRHRLEKAGLGEFCLELHSHKVHKRAILDDLR 494 (1775) Q Consensus 419 g~~~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVaeK~aALdVV~~RL~~~gLg~~cleLHs~ka~k~~vl~~L~ 494 (1775) |.-.+|-|.||-||| |+.=-||..++.|++||+||---.+-. |+.|-+..|+..--+.|-+. ++-..+++.+. T Consensus 90 GS~iLlgGePGIGKS-TLlLQ~a~~la~~~~vLYvSGEES~~Q-ik~RA~RLg~~~~~l~l~~e-t~l~~Il~~i~ 162 (454) T PRK11823 90 GSVVLIGGDPGIGKS-TLLLQVAAALAAGGKVLYVSGEESLQQ-IKLRAERLGLPSDNLYLLAE-TNLEDILATIE 162 (454) T ss_pred CEEEEECCCCCCCHH-HHHHHHHHHHHCCCCEEEECCCHHHHH-HHHHHHHHCCCCCCCEEEEC-CCHHHHHHHHH T ss_conf 648995079988899-999999999855995799815015789-99999975888887378853-68999999998 No 148 >PRK11664 ATP-dependent RNA helicase HrpB; Provisional Probab=96.42 E-value=0.016 Score=41.37 Aligned_cols=62 Identities=19% Similarity=0.366 Sum_probs=34.1 Q ss_pred CCHHHHHHHHHHHCCCCEEEECCCCCCCHHHHH-HHHHHHHHCCCEEEEEECCHHHHHHHHHHH Q ss_conf 734689999985148955994287877038999-999999967982899601058999999864 Q gi|254780229|r 405 SDSSQHSAVIDVINGKNLIIEGPPGTGKSQTIT-NIIAAAMLHGKKILFCAQKMAAIEVVRHRL 467 (1775) Q Consensus 405 aDsSQ~~av~~~~~g~~~vi~GPPGTGKSQTIa-NlIa~~la~Gk~VLfVaeK~aALdVV~~RL 467 (1775) .++-=...+.+..++..+||+||||+|||.-|- .+..+....| +|+.+-=.+-|.--+-+|+ T Consensus 6 i~~~~~~i~~~l~~~~~~vl~a~tGsGKtTqvP~~ll~~~~~~g-~I~~~qPRR~AA~s~A~Rv 68 (812) T PRK11664 6 VAAVLPELLTALKTAPQVLLKAPTGAGKSTWLPLQLLQQGGING-KIIMLEPRRLAARNVAQRL 68 (812) T ss_pred HHHHHHHHHHHHHHCCEEEEEECCCCCHHHHHHHHHHHCCCCCC-CEEEECCHHHHHHHHHHHH T ss_conf 47899999999997997999908999989999999996468899-3899388399999999999 No 149 >PRK13407 bchI magnesium chelatase subunit I; Provisional Probab=96.41 E-value=0.0042 Score=47.26 Aligned_cols=44 Identities=25% Similarity=0.312 Sum_probs=32.1 Q ss_pred CCCCCHHHHHHHHHH--HCCCCEEEECCCCCCCHHHHHHHHHHHHHC Q ss_conf 667734689999985--148955994287877038999999999967 Q gi|254780229|r 402 IDDSDSSQHSAVIDV--INGKNLIIEGPPGTGKSQTITNIIAAAMLH 446 (1775) Q Consensus 402 ~~~aDsSQ~~av~~~--~~g~~~vi~GPPGTGKSQTIaNlIa~~la~ 446 (1775) |+-=+.--.+-++++ ..+.+++|.|||||||| |++-.++..|-. T Consensus 10 IvGQe~~K~AL~laav~p~~ggvLi~G~~GtgKS-tlaR~l~~iLP~ 55 (334) T PRK13407 10 IVGQEEMKQAMVLTAIDPGIGGVLVFGDRGTGKS-TAVRALAALLPL 55 (334) T ss_pred HCCCHHHHHHHHHHHCCCCCCEEEEECCCCCCHH-HHHHHHHHHCCC T ss_conf 6493999999999772789860899789986599-999999972899 No 150 >TIGR01054 rgy reverse gyrase; InterPro: IPR005736 DNA topoisomerases regulate the number of topological links between two DNA strands (i.e. change the number of superhelical turns) by catalysing transient single- or double-strand breaks, crossing the strands through one another, then resealing the breaks. These enzymes have several functions: to remove DNA supercoils during transcription and DNA replication; for strand breakage during recombination; for chromosome condensation; and to disentangle intertwined DNA during mitosis , . DNA topoisomerases are divided into two classes: type I enzymes (5.99.1.2 from EC; topoisomerases I, III and V) break single-strand DNA, and type II enzymes (5.99.1.3 from EC; topoisomerases II, IV and VI) break double-strand DNA . Type I topoisomerases are ATP-independent enzymes (except for reverse gyrase), and can be subdivided according to their structure and reaction mechanisms: type IA (bacterial and archaeal topoisomerase I, topoisomerase III and reverse gyrase) and type IB (eukaryotic topoisomerase I and topoisomerase V). These enzymes are primarily responsible for relaxing positively and/or negatively supercoiled DNA, except for reverse gyrase, which can introduce positive supercoils into DNA. Reverse gyrase is a type IA topoisomerase that is unique among these enzymes in its requirement for ATP. Reverse gyrase is a hyperthermophile-specific enzyme that acts as a renaturase by positively supercoiling DNA, and by annealing complementary single-strand circles . Hyperthermophilic organisms must protect themselves against heat-induced degradation, and reverse gyrase acts to reduce the rate of double-strand DNA breakage, a function that does not require ATP hydrolysis and which is independent of its positive supercoiling abilities. Reverse gyrase achieves this by recognising nicked DNA and recruiting a protein coat to the site of damage . More information about this protein can be found at Protein of the Month: DNA Topoisomerase .; GO: 0003677 DNA binding, 0003916 DNA topoisomerase activity, 0006265 DNA topological change, 0006268 DNA unwinding during replication, 0005694 chromosome. Probab=96.39 E-value=0.024 Score=39.35 Aligned_cols=86 Identities=21% Similarity=0.307 Sum_probs=67.9 Q ss_pred HHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHC--CCEEEEEECCHHHHHHHHHHHHHCCC----CCE--EEEC Q ss_conf 689999985148955994287877038999999999967--98289960105899999986443584----413--6540 Q gi|254780229|r 408 SQHSAVIDVINGKNLIIEGPPGTGKSQTITNIIAAAMLH--GKKILFCAQKMAAIEVVRHRLEKAGL----GEF--CLEL 479 (1775) Q Consensus 408 SQ~~av~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~--Gk~VLfVaeK~aALdVV~~RL~~~gL----g~~--cleL 479 (1775) -|.+=--.+..|+||.|.-|+|=||| |--=+.|..||. ||||+++==+..=..-|+.||.+.+- +-- ..-. T Consensus 88 ~Qk~WAKRv~~~~SFai~APTGVGKt-tFG~~mslflA~kKGkR~y~ilPT~lLv~Qv~~kl~~~~~k~g~~~~~l~~~y 166 (1843) T TIGR01054 88 IQKSWAKRVLKGDSFAIVAPTGVGKT-TFGLVMSLFLAKKKGKRSYIILPTTLLVKQVAEKLSSLAEKEGVGIVRLLGAY 166 (1843) T ss_pred HHHHHHHHHHCCCCEEEECCCCCCHH-HHHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCEEEEEEE T ss_conf 67999999641796489805887677-99999999986542987899947078899999998752002575000022210 Q ss_pred CCCC--HHHHHHHHHHH Q ss_conf 2661--10899999999 Q gi|254780229|r 480 HSHK--VHKRAILDDLR 494 (1775) Q Consensus 480 Hs~k--a~k~~vl~~L~ 494 (1775) ||-- ..|+++.+++. T Consensus 167 hS~L~~~~kke~~Eri~ 183 (1843) T TIGR01054 167 HSKLSTKEKKEVKERIE 183 (1843) T ss_pred ECCCCHHHHHHHHHHHH T ss_conf 11265456788999873 No 151 >COG4581 Superfamily II RNA helicase [DNA replication, recombination, and repair] Probab=96.39 E-value=0.035 Score=37.66 Aligned_cols=67 Identities=19% Similarity=0.303 Sum_probs=55.5 Q ss_pred CCCCCCHHHHHHHHHHHCCCCEEEECCCCCCCHHHHH-HHHHHHHHCCCEEEEEECCHHHHHHHHHHHH Q ss_conf 3667734689999985148955994287877038999-9999999679828996010589999998644 Q gi|254780229|r 401 LIDDSDSSQHSAVIDVINGKNLIIEGPPGTGKSQTIT-NIIAAAMLHGKKILFCAQKMAAIEVVRHRLE 468 (1775) Q Consensus 401 ~~~~aDsSQ~~av~~~~~g~~~vi~GPPGTGKSQTIa-NlIa~~la~Gk~VLfVaeK~aALdVV~~RL~ 468 (1775) .+.+.|.-|..|+....+|.+++|.-|+|+||| .++ -+|+.++++|++|-+++--.|=-.--|+.|- T Consensus 116 ~~F~LD~fQ~~a~~~Ler~esVlV~ApTssGKT-vVaeyAi~~al~~~qrviYTsPIKALsNQKyrdl~ 183 (1041) T COG4581 116 YPFELDPFQQEAIAILERGESVLVCAPTSSGKT-VVAEYAIALALRDGQRVIYTSPIKALSNQKYRDLL 183 (1041) T ss_pred CCCCCCHHHHHHHHHHHCCCCEEEECCCCCCCC-HHHHHHHHHHHHCCCCEEECCCHHHHHHHHHHHHH T ss_conf 898967899999999847995799733789855-59999999998718944861630664206799999 No 152 >pfam02562 PhoH PhoH-like protein. PhoH is a cytoplasmic protein and predicted ATPase that is induced by phosphate starvation. Probab=96.39 E-value=0.015 Score=41.58 Aligned_cols=34 Identities=32% Similarity=0.554 Sum_probs=19.4 Q ss_pred EEEECCHHHCCHHHHHHHHHHC---CEEEEEECCCCC Q ss_conf 7886063236989988988611---828996077866 Q gi|254780229|r 1364 LVIMDESSQIKPEDALGVIARG---KQIVVVGDPKQL 1397 (1775) Q Consensus 1364 ~VI~DEASQ~~~~~al~al~ra---kqvvvvGD~kQl 1397 (1775) +||+|||--+++....-.|.|. -++|+.||++|. T Consensus 122 ~iIvDEaQN~t~~~lk~ilTRiG~~SK~vi~GD~~Q~ 158 (205) T pfam02562 122 FIILDEAQNTTPEQMKMFLTRIGFNSKMVVTGDITQI 158 (205) T ss_pred EEEEECHHCCCHHHHHHHHHHCCCCCEEEEECCHHHC T ss_conf 8999722139999999998421799689994786651 No 153 >pfam00437 GSPII_E Type II/IV secretion system protein. This family contains both type II and type IV pathway secretion proteins from bacteria. VirB11 ATPase is a subunit of the Agrobacterium tumefaciens transfer DNA (T-DNA) transfer system, a type IV secretion pathway required for delivery of T-DNA and effector proteins to plant cells during infection. Probab=96.38 E-value=0.0055 Score=46.06 Aligned_cols=45 Identities=18% Similarity=0.297 Sum_probs=32.0 Q ss_pred HHHHHHH-HHHCCCCEEEECCCCCCCHHHHHHHHHHHHHC-CCEEEEE Q ss_conf 6899999-85148955994287877038999999999967-9828996 Q gi|254780229|r 408 SQHSAVI-DVINGKNLIIEGPPGTGKSQTITNIIAAAMLH-GKKILFC 453 (1775) Q Consensus 408 SQ~~av~-~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~-Gk~VLfV 453 (1775) .+...+. .+..++|++|-||+||||| |..|.+...+-. +++|.-+ T Consensus 127 ~~~~~L~~~v~~~~~ilIsG~TGSGKT-T~l~all~~i~~~~~riiti 173 (283) T pfam00437 127 DIAEFLRQAVQARGNILVSGGTGSGKT-TLLYALLNEINTDDERIVTI 173 (283) T ss_pred HHHHHHHHHHHCCCEEEEECCCCCCHH-HHHHHHHHHCCCCCCCEEEE T ss_conf 999999999981975999889999889-99999998408777627873 No 154 >PRK13909 putative recombination protein RecB; Provisional Probab=96.37 E-value=0.0078 Score=44.47 Aligned_cols=19 Identities=26% Similarity=0.401 Sum_probs=11.7 Q ss_pred EEEECHHHHHHHCCCCCCCCCEEEE Q ss_conf 8992577998725852254107886 Q gi|254780229|r 1343 CFMMSPMSVANYLEPKDVKFDLVIM 1367 (1775) Q Consensus 1343 ~~~msp~sva~~l~~~~~~FD~VI~ 1367 (1775) |=+||.+. .....|++||+ T Consensus 602 V~iMTIHk------SKGLEFpvVil 620 (911) T PRK13909 602 VQIMTVHK------SKGLEFPHVIV 620 (911) T ss_pred EEEEEEEE------CCCCCCCEEEE T ss_conf 07776320------25766887996 No 155 >TIGR02237 recomb_radB DNA repair and recombination protein RadB; InterPro: IPR011939 This family consists exclusively of archaeal RadB protein, a homolog of bacterial RecA, eukaryotic RAD51 (IPR011941 from INTERPRO) and DMC1 (IPR011940 from INTERPRO), and archaeal RadA (IPR011938 from INTERPRO) ,, .; GO: 0003684 damaged DNA binding, 0005524 ATP binding, 0008094 DNA-dependent ATPase activity, 0006281 DNA repair, 0006310 DNA recombination. Probab=96.37 E-value=0.0044 Score=47.12 Aligned_cols=29 Identities=41% Similarity=0.783 Sum_probs=23.2 Q ss_pred EECCCCCCCHHHHH-HHHHHHHHCCCEEEEE Q ss_conf 94287877038999-9999999679828996 Q gi|254780229|r 424 IEGPPGTGKSQTIT-NIIAAAMLHGKKILFC 453 (1775) Q Consensus 424 i~GPPGTGKSQTIa-NlIa~~la~Gk~VLfV 453 (1775) |-|||||||| ||+ -+=+.++..||+|-|| T Consensus 17 iYGp~G~GKT-n~c~~~a~~a~~~Gk~v~Yi 46 (223) T TIGR02237 17 IYGPPGSGKT-NICLILAVNAARQGKKVVYI 46 (223) T ss_pred EECCCCCCHH-HHHHHHHHHHHHCCCCEEEE T ss_conf 8758998678-99999999998618958999 No 156 >PRK03992 proteasome-activating nucleotidase; Provisional Probab=96.36 E-value=0.003 Score=48.84 Aligned_cols=28 Identities=32% Similarity=0.650 Sum_probs=23.0 Q ss_pred HHCCCCEEEECCCCCCCHHHHHHHHHHHH Q ss_conf 51489559942878770389999999999 Q gi|254780229|r 416 VINGKNLIIEGPPGTGKSQTITNIIAAAM 444 (1775) Q Consensus 416 ~~~g~~~vi~GPPGTGKSQTIaNlIa~~l 444 (1775) +..-+.+++.|||||||| .+|.+||+-+ T Consensus 163 i~pPkGvLLyGPPGtGKT-llAkAvA~e~ 190 (390) T PRK03992 163 IEPPKGVLLYGPPGTGKT-LLAKAVAHET 190 (390) T ss_pred CCCCCEEEEECCCCCCHH-HHHHHHHHHH T ss_conf 999972786898999789-9999999874 No 157 >KOG0991 consensus Probab=96.36 E-value=0.0049 Score=46.56 Aligned_cols=60 Identities=38% Similarity=0.605 Sum_probs=38.3 Q ss_pred HHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHCCC----EEE-EEECCHHHHHHHHHHHHH Q ss_conf 68999998514895599428787703899999999996798----289-960105899999986443 Q gi|254780229|r 408 SQHSAVIDVINGKNLIIEGPPGTGKSQTITNIIAAAMLHGK----KIL-FCAQKMAAIEVVRHRLEK 469 (1775) Q Consensus 408 SQ~~av~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~Gk----~VL-fVaeK~aALdVV~~RL~~ 469 (1775) +-++.+..-.+-.|++|-||||||||.+| -..|+.|- |. -|| .-|---...|||++++.. T Consensus 37 ~rl~via~~gnmP~liisGpPG~GKTTsi-~~LAr~LL-G~~~ke~vLELNASdeRGIDvVRn~IK~ 101 (333) T KOG0991 37 ERLSVIAKEGNMPNLIISGPPGTGKTTSI-LCLARELL-GDSYKEAVLELNASDERGIDVVRNKIKM 101 (333) T ss_pred HHHHHHHHCCCCCCEEEECCCCCCHHHHH-HHHHHHHH-CHHHHHHHHHCCCCCCCCCHHHHHHHHH T ss_conf 99999997289986675279998616489-99999983-8066657632057655460899999999 No 158 >COG3973 Superfamily I DNA and RNA helicases [General function prediction only] Probab=96.36 E-value=0.019 Score=40.48 Aligned_cols=68 Identities=22% Similarity=0.316 Sum_probs=51.9 Q ss_pred HHHHHHHHHHHCCCCEEEECCCCCCCHHHH-HHHHHHHHHC------CCEEEEEECCHHHHHHHHHHHHHCCCCCEE Q ss_conf 468999998514895599428787703899-9999999967------982899601058999999864435844136 Q gi|254780229|r 407 SSQHSAVIDVINGKNLIIEGPPGTGKSQTI-TNIIAAAMLH------GKKILFCAQKMAAIEVVRHRLEKAGLGEFC 476 (1775) Q Consensus 407 sSQ~~av~~~~~g~~~vi~GPPGTGKSQTI-aNlIa~~la~------Gk~VLfVaeK~aALdVV~~RL~~~gLg~~c 476 (1775) +-|-.+|..- +++-+||||-|||||| || ..-+|.+|+. +|-||+++--..=++-+.+=|++.|-+... T Consensus 215 kEQneIIR~e-k~~ilVVQGaAGSGKT-tiALHRvAyLlY~~R~~l~~k~vlvl~PN~vFleYis~VLPeLGe~~V~ 289 (747) T COG3973 215 KEQNEIIRFE-KNKILVVQGAAGSGKT-TIALHRVAYLLYGYRGPLQAKPVLVLGPNRVFLEYISRVLPELGEEGVV 289 (747) T ss_pred HHHHHHHHCC-CCCEEEEECCCCCCCH-HHHHHHHHHHHHCCCCCCCCCCEEEECCCHHHHHHHHHHCHHHCCCCEE T ss_conf 7677787555-7874899558888713-5889999999853566246686599828389999998755441668636 No 159 >PRK12724 flagellar biosynthesis regulator FlhF; Provisional Probab=96.36 E-value=0.0067 Score=45.15 Aligned_cols=53 Identities=28% Similarity=0.533 Sum_probs=39.1 Q ss_pred CCEEEECCCCCCCHHHHHHHHHHHH-HCCCEEEEEE---CCHHHHHHHHHHHHHCCC Q ss_conf 9559942878770389999999999-6798289960---105899999986443584 Q gi|254780229|r 420 KNLIIEGPPGTGKSQTITNIIAAAM-LHGKKILFCA---QKMAAIEVVRHRLEKAGL 472 (1775) Q Consensus 420 ~~~vi~GPPGTGKSQTIaNlIa~~l-a~Gk~VLfVa---eK~aALdVV~~RL~~~gL 472 (1775) +-+++.||+|-|||.|||-|-|.+. -.||+|-++. -..+|.+-++.-=+=.|+ T Consensus 224 kvi~lVGPTGVGKTTTiAKLAA~~~l~~~kkVaLIT~DTYRIgAvEQLktYa~Il~i 280 (432) T PRK12724 224 KVVFFVGPTGSGKTTSIAKLAAKYFLHMGKSVSLYTTDNYRIAAIEQLKRYADTMGM 280 (432) T ss_pred EEEEEECCCCCCHHHHHHHHHHHHHHHCCCEEEEEECCCCCHHHHHHHHHHHHHHCC T ss_conf 299998999988899999999999997499279995266537799999999998599 No 160 >TIGR00368 TIGR00368 Mg chelatase homolog; InterPro: IPR004482 This family of bacterial proteins are variously described as 'hypothetical protein yifB', 'competence protein', 'hypothetical protein' or 'Mg chelatase-related protein'. These proteins are a subset of the magnesium chelatase, ChlI subunit family and either belong to or show significant homology to the non-peptidase homologs of the MEROPS peptidase family S16 (lon protease family, clan SF), IPR001984 from INTERPRO. . Probab=96.35 E-value=0.0017 Score=51.34 Aligned_cols=161 Identities=20% Similarity=0.263 Sum_probs=81.3 Q ss_pred HHHHHCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEEEEEEECCCCCCCCEECCEEEEEEEEEEECCCCCCCCEEEE Q ss_conf 13452147388999999999999977765196265677643576338887773146278987999971267776520689 Q gi|254780229|r 191 NQIQTAYFQDKLESILRLINEKSQISIKETGTSILYIVLGFLEWYEIDDYDNPRLAPLFTIPVALEKRSSTSQPGLTQYH 270 (1775) Q Consensus 191 ~~~~~~l~~~~l~~~l~~l~~~ar~~~eE~G~n~Lyla~G~L~W~~~~~~~~~~~aPLll~Pv~l~r~~~~~~~~~~~f~ 270 (1775) ++|...|.|.+|-| |||.=-|=.|+|+|.=.|.-+..+ .- .|. T Consensus 58 ~rI~iNLAPAdl~K--------------eG~~FDLpIAI~ilaaseq~da~~-L~----------------------~yl 100 (505) T TIGR00368 58 KRITINLAPADLPK--------------EGGRFDLPIAIGILAASEQLDAKK-LD----------------------EYL 100 (505) T ss_pred CEEEEECCCCCCCC--------------CCCCCCHHHHHHHHHHHHHHHHHC-CC----------------------CEE T ss_conf 40166538888766--------------788863389999999863330431-44----------------------011 Q ss_pred EEECCCCCEECHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCEEEEEEEE---EEECCHHHHHHHHHHHH Q ss_conf 996187302339999986676198487644568989999999999861289808865455---66225147778643100 Q gi|254780229|r 271 LRYTGEEIILNLSLKEKLQSDFGIILPAIAEGMWPEDYFLQIQKIIEESKSHWAVRRYGV---LGLLNFSKMLMYLDLDH 347 (1775) Q Consensus 271 l~~~~~e~~~N~~L~~~L~~~~~i~Lp~~~~~~~~~~~~~~~~~~i~~~~~~w~v~~~~~---lg~fsf~k~~m~~DL~~ 347 (1775) +. + -++|--.|| .|.+++..+..|.+. +-.+-|++.-. .|+-+=.+....+||- T Consensus 101 ~l--G-----EL~LdG~lR--------------~i~gvlP~~~~A~k~-~~~~~iVp~~N~~EaSlv~G~~~y~~~~L~- 157 (505) T TIGR00368 101 FL--G-----ELALDGKLR--------------GIKGVLPAIALAQKS-GRKFIIVPKENAEEASLVDGLNVYGADHLK- 157 (505) T ss_pred EH--H-----HHHHCCCCC--------------HHHHHHHHHHHHHHC-CCCEEEECCCCCCCEEEECCCCEEHHHHHH- T ss_conf 00--1-----133247511--------------045689999999865-876776216675520266387402047489- Q ss_pred CCCCCCHHHHHCCHHHHHHHC--CCCCCCCCCCCC--CCCCCCCC--CCCCCCCCCCCCCCCCCCHHHHHHHHHHHCCCC Q ss_conf 013221014320878788504--555556655666--43343356--882113300022366773468999998514895 Q gi|254780229|r 348 LRWPEGEDNILNHDIIQRLFI--AQSCDHDKDNAS--SNQNIEYK--IDEIEDIHQRCPLIDDSDSSQHSAVIDVINGKN 421 (1775) Q Consensus 348 ~~~~~~~~~~~~~~~v~~~~~--~~~~~~~~~~~~--~~~~~~~~--~~~~~~~~~~~~~~~~aDsSQ~~av~~~~~g~~ 421 (1775) .+|. +++ ++..-....... ...+.++. .|-.+ |.-=.-...+-=.+++-|+| T Consensus 158 -------------~vv~-fl~ql~~~~L~~~t~~~~~~~~~~s~~~~~DL~d--------v~GQ~~akRAleIAaAGGHN 215 (505) T TIGR00368 158 -------------EVVK-FLEQLGSEKLPPRTNTKAKSIINVSLIIDLDLKD--------VKGQQHAKRALEIAAAGGHN 215 (505) T ss_pred -------------HHHH-HHHHHCCCCCCCCCCCCCCHHHHHCCHHHCCCCC--------CCCHHHCCHHHHHHHHCCCC T ss_conf -------------9999-9976231017775455531034413201044322--------54510110267775313564 Q ss_pred EEEECCCCCCCH Q ss_conf 599428787703 Q gi|254780229|r 422 LIIEGPPGTGKS 433 (1775) Q Consensus 422 ~vi~GPPGTGKS 433 (1775) +++-|||||||| T Consensus 216 lll~GPPGsGKT 227 (505) T TIGR00368 216 LLLLGPPGSGKT 227 (505) T ss_pred EEEECCCCCCHH T ss_conf 376782496268 No 161 >PRK06620 hypothetical protein; Validated Probab=96.35 E-value=0.12 Score=32.26 Aligned_cols=22 Identities=36% Similarity=0.653 Sum_probs=17.6 Q ss_pred CCEEEECCCCCCCHHHHHHHHHH Q ss_conf 95599428787703899999999 Q gi|254780229|r 420 KNLIIEGPPGTGKSQTITNIIAA 442 (1775) Q Consensus 420 ~~~vi~GPPGTGKSQTIaNlIa~ 442 (1775) +.++|.||+|+|||| ++++.+. T Consensus 45 ~~l~I~Gp~gSGKTH-L~~i~~~ 66 (214) T PRK06620 45 FTLLIKGPSSSGKTY-LTKIWQN 66 (214) T ss_pred CEEEEECCCCCCHHH-HHHHHHH T ss_conf 559998799998899-9999999 No 162 >pfam07726 AAA_3 ATPase family associated with various cellular activities (AAA). This Pfam entry includes some of the AAA proteins not detected by the pfam00004 model. Probab=96.35 E-value=0.0029 Score=49.01 Aligned_cols=25 Identities=24% Similarity=0.522 Sum_probs=21.4 Q ss_pred CEEEECCCCCCCHHHHHHHHHHHHHC Q ss_conf 55994287877038999999999967 Q gi|254780229|r 421 NLIIEGPPGTGKSQTITNIIAAAMLH 446 (1775) Q Consensus 421 ~~vi~GPPGTGKSQTIaNlIa~~la~ 446 (1775) +++|+|||||||| |++..||.++.. T Consensus 1 hVLL~GppG~GKT-~l~~~lA~~~~~ 25 (131) T pfam07726 1 HVLLEGVPGLAKT-LLARTLARSLGL 25 (131) T ss_pred CEEEECCCCCHHH-HHHHHHHHHHCC T ss_conf 9878989987699-999999999599 No 163 >pfam00308 Bac_DnaA Bacterial dnaA protein. Probab=96.33 E-value=0.013 Score=42.11 Aligned_cols=55 Identities=27% Similarity=0.451 Sum_probs=34.5 Q ss_pred CCCCCCCHHHHHHHHHHHC--CCC---EEEECCCCCCCHHHHHHHHHHHHHC---CCEEEEEEC Q ss_conf 2366773468999998514--895---5994287877038999999999967---982899601 Q gi|254780229|r 400 PLIDDSDSSQHSAVIDVIN--GKN---LIIEGPPGTGKSQTITNIIAAAMLH---GKKILFCAQ 455 (1775) Q Consensus 400 ~~~~~aDsSQ~~av~~~~~--g~~---~vi~GPPGTGKSQTIaNlIa~~la~---Gk~VLfVae 455 (1775) |.|-+...-=..++..+.. |.. +.|.||+|+|||| +.++||+.... |++|.+++- T Consensus 10 F~vg~~N~~a~~~~~~i~~~~~~~~npl~i~G~~G~GKTH-LLqA~~~~~~~~~~~~~v~yl~~ 72 (219) T pfam00308 10 FVIGSSNRFAHAAALAVAEAPGKAYNPLFIYGGVGLGKTH-LLHAIGNYALRNFPNLRVVYLTS 72 (219) T ss_pred CCCCCCHHHHHHHHHHHHHCCCCCCCCEEEECCCCCCHHH-HHHHHHHHHHHHCCCCEEEEEEH T ss_conf 0236838999999999996758767826998899998889-99999999998499982888439 No 164 >pfam00004 AAA ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes. Probab=96.32 E-value=0.0029 Score=48.99 Aligned_cols=28 Identities=39% Similarity=0.814 Sum_probs=20.9 Q ss_pred EEEECCCCCCCHHHHHHHHHHHHHCCCEEEE Q ss_conf 5994287877038999999999967982899 Q gi|254780229|r 422 LIIEGPPGTGKSQTITNIIAAAMLHGKKILF 452 (1775) Q Consensus 422 ~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLf 452 (1775) +++.|||||||| +++..||+.+ |..++. T Consensus 1 iLl~GppGtGKT-~~a~~la~~~--~~~~~~ 28 (131) T pfam00004 1 LLLYGPPGTGKT-TLAKAVAKEL--GAPFIE 28 (131) T ss_pred CEEECCCCCCHH-HHHHHHHHHH--CCCCEE T ss_conf 987899999999-9999999997--898533 No 165 >PRK11889 flhF flagellar biosynthesis regulator FlhF; Provisional Probab=96.32 E-value=0.0064 Score=45.39 Aligned_cols=46 Identities=17% Similarity=0.395 Sum_probs=36.8 Q ss_pred CCEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEE---CCHHHHHHHHH Q ss_conf 95599428787703899999999996798289960---10589999998 Q gi|254780229|r 420 KNLIIEGPPGTGKSQTITNIIAAAMLHGKKILFCA---QKMAAIEVVRH 465 (1775) Q Consensus 420 ~~~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVa---eK~aALdVV~~ 465 (1775) +.+++.||+|-|||.|||-|-|.+...||+|-|+. -..+|.|-++. T Consensus 242 q~IALVGPTGVGKTTTIAKLAArf~~~~KkVALITtDTYRIGAVEQLKT 290 (436) T PRK11889 242 QTIALIGPTGVGKTTTLAKMAWQFHGKKKTVGFITTDHSRIGTVQQLQD 290 (436) T ss_pred EEEEEECCCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCHHHHHHHHHH T ss_conf 1799989999888999999999986169808999806634769999999 No 166 >PRK09270 frcK putative fructose transport system kinase; Reviewed Probab=96.29 E-value=0.01 Score=43.19 Aligned_cols=31 Identities=32% Similarity=0.581 Sum_probs=24.1 Q ss_pred HHCCCCEE--EECCCCCCCHHHHHHHHHHHHHCC Q ss_conf 51489559--942878770389999999999679 Q gi|254780229|r 416 VINGKNLI--IEGPPGTGKSQTITNIIAAAMLHG 447 (1775) Q Consensus 416 ~~~g~~~v--i~GPPGTGKSQTIaNlIa~~la~G 447 (1775) +..++-++ |-||||+||| |+++.++..|... T Consensus 29 ~~~~rR~lIgIaG~pGSGKS-TlA~~l~~~L~~~ 61 (230) T PRK09270 29 AEPQRRTVVGIAGPPGAGKS-TLAETLWEALSQQ 61 (230) T ss_pred CCCCCEEEEEEECCCCCCHH-HHHHHHHHHHHHC T ss_conf 59997189999899988999-9999999998623 No 167 >PRK00080 ruvB Holliday junction DNA helicase RuvB; Reviewed Probab=96.28 E-value=0.0073 Score=44.81 Aligned_cols=40 Identities=33% Similarity=0.553 Sum_probs=28.9 Q ss_pred HHHHHHHHHHCCCC-----EEEECCCCCCCHHHHHHHHHHHHHCCC Q ss_conf 68999998514895-----599428787703899999999996798 Q gi|254780229|r 408 SQHSAVIDVINGKN-----LIIEGPPGTGKSQTITNIIAAAMLHGK 448 (1775) Q Consensus 408 SQ~~av~~~~~g~~-----~vi~GPPGTGKSQTIaNlIa~~la~Gk 448 (1775) .++.....|+..++ +.+.||||.||| |.+++||+-|..+= T Consensus 35 ~~L~v~i~Aak~r~e~ldH~Ll~GPPGlGKT-TLA~iiA~E~~~~~ 79 (328) T PRK00080 35 ENLKIFIEAAKKRGEALDHVLLYGPPGLGKT-TLANIIANEMGVNI 79 (328) T ss_pred HHHHHHHHHHHHCCCCCCCEEEECCCCCCHH-HHHHHHHHHHCCCC T ss_conf 9999999999964999880576588998899-99999999868881 No 168 >PRK04863 mukB cell division protein MukB; Provisional Probab=96.28 E-value=0.13 Score=31.91 Aligned_cols=51 Identities=8% Similarity=0.133 Sum_probs=29.8 Q ss_pred CCCEEEEEEEEEEEECCCCCCCCEECCEEEEEEEEEEECCC-CCCCCEEEEEEECCCCCEECHH Q ss_conf 96265677643576338887773146278987999971267-7765206899961873023399 Q gi|254780229|r 221 GTSILYIVLGFLEWYEIDDYDNPRLAPLFTIPVALEKRSST-SQPGLTQYHLRYTGEEIILNLS 283 (1775) Q Consensus 221 G~n~Lyla~G~L~W~~~~~~~~~~~aPLll~Pv~l~r~~~~-~~~~~~~f~l~~~~~e~~~N~~ 283 (1775) |.+++|.++-+..= . .-=+|+-|.|+++..+ +.+....|.|..-. .-+||| T Consensus 82 ~~g~cya~Ldv~ns---~-------~qR~l~gVrlqqvaGrd~kVdikpF~i~gLp--~~v~Pt 133 (1486) T PRK04863 82 KAGVCYAALDVVNS---R-------HQRVVVGVRLQQVAGRDKKVDIKPFAIQGLP--DSVQPT 133 (1486) T ss_pred CCCCEEEEEEEECC---C-------CCEEEEEEEEEECCCCCCCCCCCEEEEECCC--CCCCHH T ss_conf 88850687755247---7-------7568999876540578886676401441686--555779 No 169 >PRK13765 ATP-dependent protease Lon; Provisional Probab=96.27 E-value=0.0076 Score=44.64 Aligned_cols=43 Identities=19% Similarity=0.349 Sum_probs=31.4 Q ss_pred CCCCCHHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHHHHH Q ss_conf 66773468999998514895599428787703899999999996 Q gi|254780229|r 402 IDDSDSSQHSAVIDVINGKNLIIEGPPGTGKSQTITNIIAAAML 445 (1775) Q Consensus 402 ~~~aDsSQ~~av~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la 445 (1775) |+-=|-.=..+--+|.+-+++++.|||||||| -++.+.+..|- T Consensus 33 VIGQe~Av~~i~~Aa~qrrhvlliG~PGtGKS-mlakam~elLp 75 (637) T PRK13765 33 VIGQEHAVEVIKKAAKQRRHVMLIGSPGTGKS-MLAKAMAELLP 75 (637) T ss_pred CCCHHHHHHHHHHHHHHHCCEEEECCCCCCHH-HHHHHHHHHCC T ss_conf 45719999999999984373898689998799-99999997579 No 170 >TIGR01241 FtsH_fam ATP-dependent metallopeptidase HflB; InterPro: IPR005936 Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site . The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases . Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. This group of metallopeptidases belong to MEROPS peptidase family M41 (FtsH endopeptidase family, clan MA(E)). The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. FtsH is a membrane-anchored ATP-dependent protease that degrades misfolded or misassembled membrane proteins as well as a subset of cytoplasmic regulatory proteins. FtsH is a 647-residue protein of 70 kDa, with two putative transmembrane segments towards its N terminus which anchor the protein to the membrane, giving rise to a periplasmic domain of 70 residues and a cytoplasmic segment of 520 residues containing the ATPase and protease domains . ; GO: 0004222 metalloendopeptidase activity, 0030163 protein catabolic process, 0016020 membrane. Probab=96.27 E-value=0.0023 Score=49.97 Aligned_cols=62 Identities=24% Similarity=0.491 Sum_probs=36.3 Q ss_pred HHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHCCC-EEEEEECCHHHHHHHHHHHHHCCCCCEEEECCCCCHHH-HHHHH Q ss_conf 98514895599428787703899999999996798-28996010589999998644358441365402661108-99999 Q gi|254780229|r 414 IDVINGKNLIIEGPPGTGKSQTITNIIAAAMLHGK-KILFCAQKMAAIEVVRHRLEKAGLGEFCLELHSHKVHK-RAILD 491 (1775) Q Consensus 414 ~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~Gk-~VLfVaeK~aALdVV~~RL~~~gLg~~cleLHs~ka~k-~~vl~ 491 (1775) +=++--|.+++.|||||||| |+|=|+| |- .|=|=+ ++-.+|+=-.=.--|+| |++++ T Consensus 87 LGaKIPKGVLLvGPPGTGKT-----LLAKAvA-GEA~VPFF~---------------iSGSdFVEMFVGVGASRVRDLFe 145 (505) T TIGR01241 87 LGAKIPKGVLLVGPPGTGKT-----LLAKAVA-GEAGVPFFS---------------ISGSDFVEMFVGVGASRVRDLFE 145 (505) T ss_pred CCCCCCCCEEEECCCCCCHH-----HHHHHHH-CCCCCCCEE---------------ECCCCEEECCCCCCCEEHHHHHH T ss_conf 27889871473178784246-----7887520-258896247---------------40761011120564000144579 Q ss_pred HHHHH Q ss_conf 99999 Q gi|254780229|r 492 DLRKR 496 (1775) Q Consensus 492 ~L~~~ 496 (1775) +-++. T Consensus 146 qAK~n 150 (505) T TIGR01241 146 QAKKN 150 (505) T ss_pred HHHHH T ss_conf 99971 No 171 >COG0606 Predicted ATPase with chaperone activity [Posttranslational modification, protein turnover, chaperones] Probab=96.27 E-value=0.003 Score=48.85 Aligned_cols=39 Identities=28% Similarity=0.656 Sum_probs=28.2 Q ss_pred CCCCCHHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHH Q ss_conf 6677346899999851489559942878770389999999 Q gi|254780229|r 402 IDDSDSSQHSAVIDVINGKNLIIEGPPGTGKSQTITNIIA 441 (1775) Q Consensus 402 ~~~aDsSQ~~av~~~~~g~~~vi~GPPGTGKSQTIaNlIa 441 (1775) |.---..-.+..++|+.|+|+++.|||||||| -|+.-+. T Consensus 181 V~GQ~~AKrAleiAAAGgHnLl~~GpPGtGKT-mla~Rl~ 219 (490) T COG0606 181 VKGQEQAKRALEIAAAGGHNLLLVGPPGTGKT-MLASRLP 219 (490) T ss_pred HCCCHHHHHHHHHHHHCCCCEEEECCCCCCHH-HHHHHHH T ss_conf 43849999999999843886787569988656-7642310 No 172 >COG0514 RecQ Superfamily II DNA helicase [DNA replication, recombination, and repair] Probab=96.25 E-value=0.042 Score=36.83 Aligned_cols=84 Identities=24% Similarity=0.406 Sum_probs=61.2 Q ss_pred HHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEECCHHHHHHHHHHHHHCCCCCEEEECCCCCHHHHH Q ss_conf 89999985148955994287877038999999999967982899601058999999864435844136540266110899 Q gi|254780229|r 409 QHSAVIDVINGKNLIIEGPPGTGKSQTITNIIAAAMLHGKKILFCAQKMAAIEVVRHRLEKAGLGEFCLELHSHKVHKRA 488 (1775) Q Consensus 409 Q~~av~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVaeK~aALdVV~~RL~~~gLg~~cleLHs~ka~k~~ 488 (1775) |..+|..+..|+|.++.=|+|.||| |.--|...+-.| -+|+||==.+=-.-=-..|...|+.-.|+.=.-++..+.. T Consensus 22 Q~evI~~~l~g~d~lvvmPTGgGKS--lCyQiPAll~~G-~TLVVSPLiSLM~DQV~~l~~~Gi~A~~lnS~l~~~e~~~ 98 (590) T COG0514 22 QQEIIDALLSGKDTLVVMPTGGGKS--LCYQIPALLLEG-LTLVVSPLISLMKDQVDQLEAAGIRAAYLNSTLSREERQQ 98 (590) T ss_pred HHHHHHHHHCCCCEEEECCCCCCCC--HHHHHHHHHCCC-CEEEECCHHHHHHHHHHHHHHCCCEEEHHHCCCCHHHHHH T ss_conf 8999999965886799853898710--674367886599-7899785688899999999975965204424367877999 Q ss_pred HHHHHHH Q ss_conf 9999999 Q gi|254780229|r 489 ILDDLRK 495 (1775) Q Consensus 489 vl~~L~~ 495 (1775) ++..+.. T Consensus 99 v~~~l~~ 105 (590) T COG0514 99 VLNQLKS 105 (590) T ss_pred HHHHHHC T ss_conf 9999864 No 173 >TIGR03015 pepcterm_ATPase putative secretion ATPase, PEP-CTERM locus subfamily. Members of this protein are marked as probable ATPases by the nucleotide binding P-loop motif GXXGXGKTT, a motif DEAQ similar to the DEAD/H box of helicases, and extensive homology to ATPases of MSHA-type pilus systems and to GspA proteins associated with type II protein secretion systems. Probab=96.24 E-value=0.038 Score=37.37 Aligned_cols=34 Identities=18% Similarity=0.318 Sum_probs=21.6 Q ss_pred HHHHHH-HHCCCCE-EEECCCCCCCHHHHHHHHHHHH Q ss_conf 999998-5148955-9942878770389999999999 Q gi|254780229|r 410 HSAVID-VINGKNL-IIEGPPGTGKSQTITNIIAAAM 444 (1775) Q Consensus 410 ~~av~~-~~~g~~~-vi~GPPGTGKSQTIaNlIa~~l 444 (1775) ++.+.- +..++.| ||.|+|||||| |+.......+ T Consensus 32 l~~L~~~l~~~~g~~lltGe~GtGKT-tllr~l~~~l 67 (269) T TIGR03015 32 MAYLEYGLSQREGFILITGEVGAGKT-TLIRNLLKRL 67 (269) T ss_pred HHHHHHHHHCCCCEEEEECCCCCCHH-HHHHHHHHHC T ss_conf 99999999648965999729989889-9999999845 No 174 >COG1224 TIP49 DNA helicase TIP49, TBP-interacting protein [Transcription] Probab=96.24 E-value=0.0092 Score=43.74 Aligned_cols=59 Identities=37% Similarity=0.633 Sum_probs=41.4 Q ss_pred HCCCCEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEECCHHHHHHHHHHHHHCCCCCEEEECCCCCHHHHHHHHH-HHH Q ss_conf 1489559942878770389999999999679828996010589999998644358441365402661108999999-999 Q gi|254780229|r 417 INGKNLIIEGPPGTGKSQTITNIIAAAMLHGKKILFCAQKMAAIEVVRHRLEKAGLGEFCLELHSHKVHKRAILDD-LRK 495 (1775) Q Consensus 417 ~~g~~~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVaeK~aALdVV~~RL~~~gLg~~cleLHs~ka~k~~vl~~-L~~ 495 (1775) ..|+.+.|-|||||||| -||=.||.-| |+.|=||+=. ..|+-|...+|.++|.+ +++ T Consensus 63 ~aGrgiLi~GppgTGKT-AlA~gIa~eL--G~dvPF~~is-------------------gsEiYS~E~kKTE~L~qa~Rr 120 (450) T COG1224 63 MAGRGILIVGPPGTGKT-ALAMGIAREL--GEDVPFVAIS-------------------GSEIYSLEVKKTEALTQALRR 120 (450) T ss_pred CCCCEEEEECCCCCCHH-HHHHHHHHHH--CCCCCCEEEC-------------------CCEEEEECCCHHHHHHHHHHH T ss_conf 66617999789997688-9999999985--8999821501-------------------332233100088999999998 Q ss_pred HH Q ss_conf 98 Q gi|254780229|r 496 RI 497 (1775) Q Consensus 496 ~l 497 (1775) ++ T Consensus 121 aI 122 (450) T COG1224 121 AI 122 (450) T ss_pred HH T ss_conf 64 No 175 >pfam07728 AAA_5 AAA domain (dynein-related subfamily). This Pfam entry includes some of the AAA proteins not detected by the pfam00004 model. Probab=96.23 E-value=0.0031 Score=48.65 Aligned_cols=31 Identities=35% Similarity=0.795 Sum_probs=24.4 Q ss_pred CEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEE Q ss_conf 559942878770389999999999679828996 Q gi|254780229|r 421 NLIIEGPPGTGKSQTITNIIAAAMLHGKKILFC 453 (1775) Q Consensus 421 ~~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfV 453 (1775) +++++|||||||| +++..||..+. |..+..+ T Consensus 1 ~vll~Gp~G~GKT-~la~~la~~l~-~~~~~~i 31 (139) T pfam07728 1 GVLLVGPPGTGKS-ELAERLAAALS-NRPVFYV 31 (139) T ss_pred CEEEECCCCCHHH-HHHHHHHHHCC-CCCCHHH T ss_conf 9899989975699-99999999807-9831112 No 176 >PRK00131 aroK shikimate kinase; Reviewed Probab=96.23 E-value=0.0045 Score=46.97 Aligned_cols=28 Identities=36% Similarity=0.613 Sum_probs=25.7 Q ss_pred HCCCCEEEECCCCCCCHHHHHHHHHHHHH Q ss_conf 14895599428787703899999999996 Q gi|254780229|r 417 INGKNLIIEGPPGTGKSQTITNIIAAAML 445 (1775) Q Consensus 417 ~~g~~~vi~GPPGTGKSQTIaNlIa~~la 445 (1775) ..|+|.++.|+||+||| ||..++|..|. T Consensus 2 ~~~~nI~liG~~GsGKT-tvgk~LA~~L~ 29 (175) T PRK00131 2 LKGPNIVLIGMMGAGKS-TIGRLLAKRLG 29 (175) T ss_pred CCCCEEEEECCCCCCHH-HHHHHHHHHHC T ss_conf 99980898889999989-99999999959 No 177 >PRK13851 type IV secretion system protein VirB11; Provisional Probab=96.21 E-value=0.0061 Score=45.60 Aligned_cols=45 Identities=18% Similarity=0.440 Sum_probs=38.1 Q ss_pred HHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEECCHHHH Q ss_conf 98514895599428787703899999999996798289960105899 Q gi|254780229|r 414 IDVINGKNLIIEGPPGTGKSQTITNIIAAAMLHGKKILFCAQKMAAI 460 (1775) Q Consensus 414 ~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVaeK~aAL 460 (1775) .++..++|++|-|++||||| |..|.+...+-...|+ ++-|-..-| T Consensus 157 ~aV~~r~NIlIsGgTGSGKT-TllnALl~~IP~~eRI-vtIEDt~EL 201 (343) T PRK13851 157 ACVVGRLTMLLCGPTGSGKT-TMSKTLISAIPPQERL-ITIEDTLEL 201 (343) T ss_pred HHHHCCCCEEEECCCCCCHH-HHHHHHHHCCCCCCCE-EEECCCCCC T ss_conf 99976988999888986199-9999999628965527-996115002 No 178 >PRK13767 ATP-dependent helicase; Provisional Probab=96.20 E-value=0.065 Score=34.90 Aligned_cols=60 Identities=23% Similarity=0.343 Sum_probs=40.4 Q ss_pred HHHHHHHHHHCCCCEEEECCCCCCCHHHH-HHHHHHHHH--------CCCEEEEEECCHHHHHHHHHHH Q ss_conf 68999998514895599428787703899-999999996--------7982899601058999999864 Q gi|254780229|r 408 SQHSAVIDVINGKNLIIEGPPGTGKSQTI-TNIIAAAML--------HGKKILFCAQKMAAIEVVRHRL 467 (1775) Q Consensus 408 SQ~~av~~~~~g~~~vi~GPPGTGKSQTI-aNlIa~~la--------~Gk~VLfVaeK~aALdVV~~RL 467 (1775) -|..|+-.+..|+|.+|.-|+|||||-.- .-+|..++. .|=+||+++=-+|=..-+.++| T Consensus 36 ~Q~~a~~~i~~G~~~Li~ApTGsGKTlAaflp~l~~l~~~~~~~~~~~~~~~LyIsPLkAL~~D~~r~L 104 (878) T PRK13767 36 PQRYAIPLIHEGKNVLISSPTGSGKTLAAFLGIIDELFRLAEEGELEDSVYCIYVSPLRALNNDIHRNL 104 (878) T ss_pred HHHHHHHHHHCCCCEEEECCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEECCHHHHHHHHHHHH T ss_conf 999999999679988998999813999999999999985000367788728999684798899999988 No 179 >cd01129 PulE-GspE PulE/GspE The type II secretory pathway is the main terminal branch of the general secretory pathway (GSP). It is responsible for the export the majority of Gram-negative bacterial exoenzymes and toxins. PulE is a cytoplasmic protein of the GSP, which contains an ATP binding site and a tetracysteine motif. This subgroup also includes PillB and HofB. Probab=96.20 E-value=0.015 Score=41.59 Aligned_cols=49 Identities=16% Similarity=0.365 Sum_probs=36.7 Q ss_pred CHHHHHHHHHHHCCC-CE-EEECCCCCCCHHHHHHHHHHHHHCCCEEEEEE Q ss_conf 346899999851489-55-99428787703899999999996798289960 Q gi|254780229|r 406 DSSQHSAVIDVINGK-NL-IIEGPPGTGKSQTITNIIAAAMLHGKKILFCA 454 (1775) Q Consensus 406 DsSQ~~av~~~~~g~-~~-vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVa 454 (1775) ...|...+..+.... .+ +|.||+|+|||.|+..++.+.-..+++|+-+- T Consensus 65 ~~~~~~~l~~~~~~~~GlilitGptGSGKtTtl~a~l~~~~~~~~~i~tiE 115 (264) T cd01129 65 KPENLEIFRKLLEKPHGIILVTGPTGSGKTTTLYSALSELNTPEKNIITVE 115 (264) T ss_pred CHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHHHHHHHHHCCCCCEEEEEE T ss_conf 999999999997089988999789999779999999986436885089986 No 180 >PRK08074 bifunctional ATP-dependent DNA helicase/DNA polymerase III subunit epsilon; Validated Probab=96.20 E-value=0.026 Score=38.96 Aligned_cols=41 Identities=20% Similarity=0.419 Sum_probs=30.1 Q ss_pred HHHCCCCEEEECCCCCCCHHHHHHHHHHH---HHCCCEEEEEECCH Q ss_conf 85148955994287877038999999999---96798289960105 Q gi|254780229|r 415 DVINGKNLIIEGPPGTGKSQTITNIIAAA---MLHGKKILFCAQKM 457 (1775) Q Consensus 415 ~~~~g~~~vi~GPPGTGKSQTIaNlIa~~---la~Gk~VLfVaeK~ 457 (1775) +...|+.++|+-|+||||| ++-++..+ ...||+|.+...+. T Consensus 273 al~~~~~l~iEApTGtGKT--laYLlPai~~A~~~~~~vvIST~T~ 316 (932) T PRK08074 273 ALRDSRHALIEAGTGTGKT--LAYLLPAAYFAKKKEEPVIISTYTV 316 (932) T ss_pred HHHCCCCEEEECCCCCCCH--HHHHHHHHHHHHHCCCCEEEEECCH T ss_conf 9853884799868888713--6879999999997599099991628 No 181 >PRK12727 flagellar biosynthesis regulator FlhF; Provisional Probab=96.19 E-value=0.0094 Score=43.63 Aligned_cols=50 Identities=22% Similarity=0.450 Sum_probs=39.7 Q ss_pred HHCCCCEEEECCCCCCCHHHHHHHHHHHHH-CC-CEEEEEE---CCHHHHHHHHH Q ss_conf 514895599428787703899999999996-79-8289960---10589999998 Q gi|254780229|r 416 VINGKNLIIEGPPGTGKSQTITNIIAAAML-HG-KKILFCA---QKMAAIEVVRH 465 (1775) Q Consensus 416 ~~~g~~~vi~GPPGTGKSQTIaNlIa~~la-~G-k~VLfVa---eK~aALdVV~~ 465 (1775) ...|-=|++.||+|-|||.||+-|-|+|.. +| ++|-+|. ....|.+-++. T Consensus 345 ~~~gGv~AlvGpTGvGKTTT~aKlAa~~~~~~g~~~valit~DtyRiga~eQL~~ 399 (557) T PRK12727 345 LERGGVIALVGPTGAGKTTTIAKLAQRFAAQHAPRDVALVTTDTQRVGGREQLHS 399 (557) T ss_pred HHCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHHH T ss_conf 5407647874377767311799999999997399818999726640879999999 No 182 >cd03114 ArgK-like The function of this protein family is unkown. The protein sequences are similar to the ArgK protein in E. coli. ArgK protein is a membrane ATPase which is required for transporting arginine, ornithine and lysine into the cells by the arginine and ornithine (AO system) and lysine, arginine and ornithine (LAO) transport systems. Probab=96.16 E-value=0.0073 Score=44.79 Aligned_cols=32 Identities=41% Similarity=0.602 Sum_probs=28.4 Q ss_pred EECCCCCCCHHHHHHHHHHHHHCCCEEEEEEC Q ss_conf 94287877038999999999967982899601 Q gi|254780229|r 424 IEGPPGTGKSQTITNIIAAAMLHGKKILFCAQ 455 (1775) Q Consensus 424 i~GPPGTGKSQTIaNlIa~~la~Gk~VLfVae 455 (1775) |.||||.|||..|-.++.....+|++|-++|. T Consensus 4 itG~pGaGKStLi~~l~~~~~~~g~~VaVlav 35 (148) T cd03114 4 ITGVPGAGKSTLIDALITALRARGKRVAVLAI 35 (148) T ss_pred ECCCCCCCHHHHHHHHHHHHHHCCCEEEEEEE T ss_conf 25899787899999999999978983799996 No 183 >COG1132 MdlB ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms] Probab=96.15 E-value=0.046 Score=36.48 Aligned_cols=11 Identities=36% Similarity=0.869 Sum_probs=6.3 Q ss_pred HHHCCEEEEEE Q ss_conf 86118289960 Q gi|254780229|r 1382 IARGKQIVVVG 1392 (1775) Q Consensus 1382 l~rakqvvvvG 1392 (1775) +-.|..+-+|| T Consensus 352 i~~Ge~vaiVG 362 (567) T COG1132 352 IEPGEKVAIVG 362 (567) T ss_pred ECCCCEEEEEC T ss_conf 54898788855 No 184 >PRK08533 flagellar accessory protein FlaH; Reviewed Probab=96.13 E-value=0.017 Score=40.96 Aligned_cols=47 Identities=21% Similarity=0.401 Sum_probs=37.2 Q ss_pred CCCCEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEECCHHHHHHHH Q ss_conf 48955994287877038999999999967982899601058999999 Q gi|254780229|r 418 NGKNLIIEGPPGTGKSQTITNIIAAAMLHGKKILFCAQKMAAIEVVR 464 (1775) Q Consensus 418 ~g~~~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVaeK~aALdVV~ 464 (1775) .|.-++|+|.||||||---..++..++..|++|+|++-....-+.++ T Consensus 23 ~gs~~li~G~~GtGKsi~~~~~~~~~l~~g~~~~yis~e~t~~~~i~ 69 (230) T PRK08533 23 FGSIILIEGDESTGKSILSQRLAYGFLQNGYSVSYVSSQLTTTEFIK 69 (230) T ss_pred CCCEEEEECCCCCCHHHHHHHHHHHHHHCCCEEEEEEECCCHHHHHH T ss_conf 98489998689987899999999999878986999994389999999 No 185 >TIGR02768 TraA_Ti Ti-type conjugative transfer relaxase TraA; InterPro: IPR014136 This entry represents the Ti-type conjugative transfer relaxase TraA. TraA contains domains distinctive of a single strand exonuclease (N-terminus, MobA/MobL, IPR005053 from INTERPRO) as well as a helicase domain (central region, homologous to the corresponding region of the F-type relaxase TraI, IPR014129 from INTERPRO). This protein likely fills the same role as TraI(F), nicking (at the oriT site) and unwinding the coiled plasmid prior to conjugative transfer .. Probab=96.13 E-value=0.0033 Score=48.42 Aligned_cols=40 Identities=30% Similarity=0.563 Sum_probs=28.9 Q ss_pred CCCEEEECCHHHCCH---HHHHH-HHHHCCEEEEEECCCCCCCC Q ss_conf 410788606323698---99889-88611828996077866862 Q gi|254780229|r 1361 KFDLVIMDESSQIKP---EDALG-VIARGKQIVVVGDPKQLPPT 1400 (1775) Q Consensus 1361 ~FD~VI~DEASQ~~~---~~al~-al~rakqvvvvGD~kQlpPt 1400 (1775) .=||+|||||.=+.- .-.|. |--+|=|||.||||+||-|. T Consensus 501 ~~dvLviDEAGMV~S~Qm~r~l~~A~~AGaKvVLvGD~~QLqaI 544 (888) T TIGR02768 501 DKDVLVIDEAGMVGSRQMARVLKEAEEAGAKVVLVGDPEQLQAI 544 (888) T ss_pred CCCEEEEECCCCHHHHHHHHHHHHHHHCCCCEEEECCHHHHCHH T ss_conf 77668985154414677889999998727605984885782442 No 186 >PRK09862 putative ATP-dependent protease; Provisional Probab=96.12 E-value=0.005 Score=46.53 Aligned_cols=47 Identities=32% Similarity=0.565 Sum_probs=33.7 Q ss_pred HHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEECCHHHHHH Q ss_conf 6899999851489559942878770389999999999679828996010589999 Q gi|254780229|r 408 SQHSAVIDVINGKNLIIEGPPGTGKSQTITNIIAAAMLHGKKILFCAQKMAAIEV 462 (1775) Q Consensus 408 SQ~~av~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVaeK~aALdV 462 (1775) --.+..++|+-|.|+++.|||||||| .++.-|...|-. | ....+|+| T Consensus 199 akraleIAAAGgHnlLl~GpPG~GKT-MlA~rlp~ILPp----L---t~~e~lEv 245 (506) T PRK09862 199 GKRGLEITAAGGHNLLLIGPPGTGKT-MLASRINGLLPD----L---SNEEALES 245 (506) T ss_pred HHHHHHHHHCCCCCEEEECCCCCCHH-HHHHHHHHCCCC----C---CHHHHHHH T ss_conf 99999997446886598769994598-999775123899----8---98999999 No 187 >PRK13900 type IV secretion system ATPase VirB11; Provisional Probab=96.12 E-value=0.0063 Score=45.45 Aligned_cols=45 Identities=31% Similarity=0.413 Sum_probs=38.3 Q ss_pred HHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEECCHHHH Q ss_conf 98514895599428787703899999999996798289960105899 Q gi|254780229|r 414 IDVINGKNLIIEGPPGTGKSQTITNIIAAAMLHGKKILFCAQKMAAI 460 (1775) Q Consensus 414 ~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVaeK~aAL 460 (1775) .++..++|++|-||+||||| |..|.++..+-...||.. -|-..-| T Consensus 155 ~aV~~r~NilI~G~TgSGKT-Tll~aL~~~ip~~eRiit-IEDt~EL 199 (332) T PRK13900 155 HAVISKKNIIISGGTSTGKT-TFTNAALREIPAIERLIT-VEDAREI 199 (332) T ss_pred HHHHCCCCEEEECCCCCCHH-HHHHHHHHCCCCCCCEEE-ECCCHHH T ss_conf 99864871999888898899-999999835895353566-3140663 No 188 >PRK08727 hypothetical protein; Validated Probab=96.12 E-value=0.025 Score=39.15 Aligned_cols=54 Identities=26% Similarity=0.406 Sum_probs=35.3 Q ss_pred CHHHHHHHHHHHCCCC---EEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEECCHHH Q ss_conf 3468999998514895---59942878770389999999999679828996010589 Q gi|254780229|r 406 DSSQHSAVIDVINGKN---LIIEGPPGTGKSQTITNIIAAAMLHGKKILFCAQKMAA 459 (1775) Q Consensus 406 DsSQ~~av~~~~~g~~---~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVaeK~aA 459 (1775) .+...+.+.....|.+ +.|.||+|+||||-.--+...+...|+++.++.-+.+. T Consensus 25 ~n~~~a~l~~~~~~~~~~~lyl~G~~GsGKTHLl~a~~~~~~~~~~~~~yl~l~~~~ 81 (233) T PRK08727 25 PDGLLAQLQALAAGQSSDWLYLSGPAGTGKTHLALSLCAAAEQAGRSSAYLPLQAAA 81 (233) T ss_pred CHHHHHHHHHHHCCCCCCEEEEECCCCCCHHHHHHHHHHHHHHCCCCEEEEEHHHHH T ss_conf 559999999874388889899989999988999999999998279972884478853 No 189 >PRK11747 dinG ATP-dependent DNA helicase DinG; Provisional Probab=96.12 E-value=0.016 Score=41.19 Aligned_cols=47 Identities=28% Similarity=0.434 Sum_probs=32.2 Q ss_pred HHHHHHHC----CCCEEEECCCCCCCHHHHHHHHH---HHHHCCCEEEEEECCHHHH Q ss_conf 99998514----89559942878770389999999---9996798289960105899 Q gi|254780229|r 411 SAVIDVIN----GKNLIIEGPPGTGKSQTITNIIA---AAMLHGKKILFCAQKMAAI 460 (1775) Q Consensus 411 ~av~~~~~----g~~~vi~GPPGTGKSQTIaNlIa---~~la~Gk~VLfVaeK~aAL 460 (1775) .+|..+.. ++.+||+-|+||||| ++-+|. .+...||+|.+... +.+| T Consensus 35 ~~V~~al~~~~~~~~l~iEAgTGtGKT--laYL~Pai~~a~~~~~~vvIST~-T~~L 88 (697) T PRK11747 35 AEVAKTLAGEYGRHILVIEAGTGVGKS--LAYLLAGIPIARAEKKKLVISTA-TVAL 88 (697) T ss_pred HHHHHHHHCCCCCCEEEEECCCCCHHH--HHHHHHHHHHHHHCCCCEEEECC-CHHH T ss_conf 999999616667866999899972089--99999999999982997999889-8899 No 190 >PRK06761 hypothetical protein; Provisional Probab=96.11 E-value=0.0087 Score=44.00 Aligned_cols=37 Identities=30% Similarity=0.471 Sum_probs=31.3 Q ss_pred CCCEEEECCCCCCCHHHHHHHHHHHHHC-CCEEEEEECC Q ss_conf 8955994287877038999999999967-9828996010 Q gi|254780229|r 419 GKNLIIEGPPGTGKSQTITNIIAAAMLH-GKKILFCAQK 456 (1775) Q Consensus 419 g~~~vi~GPPGTGKSQTIaNlIa~~la~-Gk~VLfVaeK 456 (1775) +|=++|+|=||+||| |.+.+++..|.. |-.|-+..|- T Consensus 2 ~kLIiIEGlPGsGKS-Tta~~l~d~L~~~g~~v~~~~Eg 39 (281) T PRK06761 2 TKLIIIEGLPGFGKS-TTAHLLNDKLSQLKIEVELFVEG 39 (281) T ss_pred CCEEEEECCCCCCHH-HHHHHHHHHHHHCCCCEEEEEEC T ss_conf 617999668999801-49999999998669853899507 No 191 >KOG0743 consensus Probab=96.09 E-value=0.0033 Score=48.33 Aligned_cols=25 Identities=36% Similarity=0.717 Sum_probs=17.9 Q ss_pred CCCEEEECCCCCCCHHHHHHHHHHHH Q ss_conf 89559942878770389999999999 Q gi|254780229|r 419 GKNLIIEGPPGTGKSQTITNIIAAAM 444 (1775) Q Consensus 419 g~~~vi~GPPGTGKSQTIaNlIa~~l 444 (1775) -+...+-|||||||| |..-+||..| T Consensus 235 KRGYLLYGPPGTGKS-S~IaAmAn~L 259 (457) T KOG0743 235 KRGYLLYGPPGTGKS-SFIAAMANYL 259 (457) T ss_pred CCCCEEECCCCCCHH-HHHHHHHHHC T ss_conf 004120479999888-9999997205 No 192 >PRK13833 conjugal transfer protein TrbB; Provisional Probab=96.09 E-value=0.011 Score=42.92 Aligned_cols=47 Identities=26% Similarity=0.480 Sum_probs=36.9 Q ss_pred HHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHCC---CEEEEEECCHHHHHH Q ss_conf 9851489559942878770389999999999679---828996010589999 Q gi|254780229|r 414 IDVINGKNLIIEGPPGTGKSQTITNIIAAAMLHG---KKILFCAQKMAAIEV 462 (1775) Q Consensus 414 ~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~G---k~VLfVaeK~aALdV 462 (1775) .++..++|++|-|++||||| |..|.+...+... .| +++-|-..-|.. T Consensus 139 ~aV~~r~nilVsGgTGSGKT-TllnaL~~~i~~~~p~eR-ivtIEDt~EL~~ 188 (323) T PRK13833 139 SAISSRLNIVISGGTGSGKT-TLANAVIAEIVASAPEDR-LVILEDTAEIQC 188 (323) T ss_pred HHHHCCCCEEEECCCCCCHH-HHHHHHHHHHCCCCCHHE-EEEECCCCCCCC T ss_conf 99981896899917777568-999999986402893233-999457501146 No 193 >smart00763 AAA_PrkA PrkA AAA domain. This is a family of PrkA bacterial and archaeal serine kinases approximately 630 residues long. This is the N-terminal AAA domain. Probab=96.09 E-value=0.0053 Score=46.28 Aligned_cols=25 Identities=28% Similarity=0.511 Sum_probs=22.0 Q ss_pred CCEEEECCCCCCCHHHHHHHHHHHHH Q ss_conf 95599428787703899999999996 Q gi|254780229|r 420 KNLIIEGPPGTGKSQTITNIIAAAML 445 (1775) Q Consensus 420 ~~~vi~GPPGTGKSQTIaNlIa~~la 445 (1775) +=+++.||||+||| ||++.+...|= T Consensus 79 ~IllL~GPvGsGKS-tl~~~Lk~~lE 103 (361) T smart00763 79 QILYLLGPVGGGKS-SLVECLKRGLE 103 (361) T ss_pred EEEEEECCCCCCHH-HHHHHHHHHHH T ss_conf 69999889988779-99999999999 No 194 >PRK03839 putative kinase; Provisional Probab=96.07 E-value=0.0061 Score=45.63 Aligned_cols=30 Identities=33% Similarity=0.655 Sum_probs=23.7 Q ss_pred EEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEE Q ss_conf 599428787703899999999996798289960 Q gi|254780229|r 422 LIIEGPPGTGKSQTITNIIAAAMLHGKKILFCA 454 (1775) Q Consensus 422 ~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVa 454 (1775) .+|.|.|||||| |++.++|.-+ |=+++=|+ T Consensus 3 I~ITGTPGtGKT-Tva~~La~~l--g~~~i~v~ 32 (180) T PRK03839 3 IAITGTPGVGKT-TISKLLAEKL--GYEYVNLR 32 (180) T ss_pred EEEECCCCCCHH-HHHHHHHHHH--CCEEEEHH T ss_conf 999789999989-9999999976--98798799 No 195 >PRK11448 hsdR type I restriction enzyme EcoKI subunit R; Provisional Probab=96.07 E-value=0.043 Score=36.77 Aligned_cols=60 Identities=25% Similarity=0.335 Sum_probs=45.2 Q ss_pred HHHHHHHHHH----C-CCCEEEECCCCCCCHHHHHHHHHHHHHCC--CEEEEEECCHHHHHHHHHHH Q ss_conf 6899999851----4-89559942878770389999999999679--82899601058999999864 Q gi|254780229|r 408 SQHSAVIDVI----N-GKNLIIEGPPGTGKSQTITNIIAAAMLHG--KKILFCAQKMAAIEVVRHRL 467 (1775) Q Consensus 408 SQ~~av~~~~----~-g~~~vi~GPPGTGKSQTIaNlIa~~la~G--k~VLfVaeK~aALdVV~~RL 467 (1775) -|..||.++- . -+...|+=.+|||||.|...+|.-+|-.| |||||.+-..+=.+-.++-. T Consensus 420 YQ~~AI~~v~~a~~~~~rraLl~MATGTGKTrtaial~~rLlk~~~~kRILFLvDR~~L~~QA~~~F 486 (1126) T PRK11448 420 YQEDAIQAVEKALANGQREILLAMATGTGKTRTAIALMYRLLKAKRFKRILFLVDRRALGEQALDAF 486 (1126) T ss_pred HHHHHHHHHHHHHHCCCCCEEEEECCCCCHHHHHHHHHHHHHHCCCCCEEEEECCHHHHHHHHHHHH T ss_conf 8999999999999809854688724888589899999999996587672579856589999999987 No 196 >PRK10590 ATP-dependent RNA helicase RhlE; Provisional Probab=96.06 E-value=0.1 Score=32.96 Aligned_cols=34 Identities=12% Similarity=0.360 Sum_probs=23.5 Q ss_pred CCEEEECHHHHHHHCCCCCC---CCCEEEECCHHHCC Q ss_conf 67899257799872585225---41078860632369 Q gi|254780229|r 1341 KPCFMMSPMSVANYLEPKDV---KFDLVIMDESSQIK 1374 (1775) Q Consensus 1341 ~P~~~msp~sva~~l~~~~~---~FD~VI~DEASQ~~ 1374 (1775) .-+.+.+|-.+...+..... ...++|+|||=.|- T Consensus 126 ~dIlVaTPGRLldl~~~~~~~l~~v~~lVlDEAD~mL 162 (457) T PRK10590 126 VDVLVATPGRLLDLEHQNAVKLDQVEILVLDEADRML 162 (457) T ss_pred CCEEEECCHHHHHHHHCCCCCCCCCEEEEEECCHHHH T ss_conf 9989989288898886267765752399983705651 No 197 >PRK00698 tmk thymidylate kinase; Validated Probab=96.05 E-value=0.011 Score=43.02 Aligned_cols=39 Identities=31% Similarity=0.585 Sum_probs=32.3 Q ss_pred HCCCCEEEECCCCCCCHHHHHHHHHHHH-HCCCEEEEEECC Q ss_conf 1489559942878770389999999999-679828996010 Q gi|254780229|r 417 INGKNLIIEGPPGTGKSQTITNIIAAAM-LHGKKILFCAQK 456 (1775) Q Consensus 417 ~~g~~~vi~GPPGTGKSQTIaNlIa~~l-a~Gk~VLfVaeK 456 (1775) ++|+=.||+|+.|+||| |.+++++..| +.|..|.|+.|- T Consensus 1 mkG~fIviEGiDGsGKs-Tq~~~L~~~L~~~g~~v~~t~eP 40 (204) T PRK00698 1 MRGMFITIEGIDGAGKS-TQIELLAERLEEQGRDVVFTREP 40 (204) T ss_pred CCCEEEEEECCCCCCHH-HHHHHHHHHHHHCCCCEEEEECC T ss_conf 98319999889999899-99999999999679978998699 No 198 >cd01121 Sms Sms (bacterial radA) DNA repair protein. This protein is not related to archael radA any more than is to other RecA-like NTPases. Sms has a role in recombination and recombinational repair and is responsible for the stabilization or processing of branched DNA molecules. Probab=96.05 E-value=0.047 Score=36.35 Aligned_cols=74 Identities=27% Similarity=0.474 Sum_probs=46.1 Q ss_pred CCCCEEEECCCCCCCHHHHHHHHHHHHH-CCCEEEEEECCHHHHHHHHHHHHHCCCCCEEEECCCCCHHHHHHHHHHH Q ss_conf 4895599428787703899999999996-7982899601058999999864435844136540266110899999999 Q gi|254780229|r 418 NGKNLIIEGPPGTGKSQTITNIIAAAML-HGKKILFCAQKMAAIEVVRHRLEKAGLGEFCLELHSHKVHKRAILDDLR 494 (1775) Q Consensus 418 ~g~~~vi~GPPGTGKSQTIaNlIa~~la-~Gk~VLfVaeK~aALdVV~~RL~~~gLg~~cleLHs~ka~k~~vl~~L~ 494 (1775) .|.-++|=|.||-||| |+.=-||..|+ .|++||+||---.+-. |+.|-+..|+..--+.|-+ .++-..+++.+. T Consensus 81 pGSvvLlgGePGIGKS-TLLLQia~~la~~~~~vLYvSGEES~~Q-Ik~RA~RLg~~~~~l~l~s-et~le~Il~~i~ 155 (372) T cd01121 81 PGSVILIGGDPGIGKS-TLLLQVAARLAKRGGKVLYVSGEESPEQ-IKLRADRLGISTENLYLLA-ETNLEDILASIE 155 (372) T ss_pred CCEEEEEECCCCCCHH-HHHHHHHHHHHHCCCCEEEEECHHHHHH-HHHHHHHHCCCCCCCEEEE-CCCHHHHHHHHH T ss_conf 6717998259988688-9999999999863993899824567899-9989998587887727884-356999999999 No 199 >cd01672 TMPK Thymidine monophosphate kinase (TMPK), also known as thymidylate kinase, catalyzes the phosphorylation of thymidine monophosphate (TMP) to thymidine diphosphate (TDP) utilizing ATP as its preferred phophoryl donor. TMPK represents the rate-limiting step in either de novo or salvage biosynthesis of thymidine triphosphate (TTP). Probab=96.04 E-value=0.011 Score=42.93 Aligned_cols=34 Identities=24% Similarity=0.590 Sum_probs=28.6 Q ss_pred EEEECCCCCCCHHHHHHHHHHHH-HCCCEEEEEECC Q ss_conf 59942878770389999999999-679828996010 Q gi|254780229|r 422 LIIEGPPGTGKSQTITNIIAAAM-LHGKKILFCAQK 456 (1775) Q Consensus 422 ~vi~GPPGTGKSQTIaNlIa~~l-a~Gk~VLfVaeK 456 (1775) .||+||.|+||| |.+++++..| +.|..|.++.+- T Consensus 3 IviEG~dGsGKs-T~~~~L~~~L~~~g~~v~~~~eP 37 (200) T cd01672 3 IVFEGIDGAGKT-TLIELLAERLEARGYEVVLTREP 37 (200) T ss_pred EEEECCCCCCHH-HHHHHHHHHHHHCCCCEEEEECC T ss_conf 999899899999-99999999999779938998699 No 200 >COG2805 PilT Tfp pilus assembly protein, pilus retraction ATPase PilT [Cell motility and secretion / Intracellular trafficking and secretion] Probab=96.02 E-value=0.007 Score=44.98 Aligned_cols=35 Identities=34% Similarity=0.597 Sum_probs=25.9 Q ss_pred CCCCEE-EECCCCCCCHHHHHHHHHHHHHCCCEEEE Q ss_conf 489559-94287877038999999999967982899 Q gi|254780229|r 418 NGKNLI-IEGPPGTGKSQTITNIIAAAMLHGKKILF 452 (1775) Q Consensus 418 ~g~~~v-i~GPPGTGKSQTIaNlIa~~la~Gk~VLf 452 (1775) .-+.+| |.||+|+|||.|+++||-+.=-+-.+.-. T Consensus 123 ~~~GLILVTGpTGSGKSTTlAamId~iN~~~~~HIl 158 (353) T COG2805 123 SPRGLILVTGPTGSGKSTTLAAMIDYINKHKAKHIL 158 (353) T ss_pred CCCCEEEEECCCCCCHHHHHHHHHHHHHCCCCCCEE T ss_conf 879669986799996787999999998414775168 No 201 >cd01126 TraG_VirD4 The TraG/TraD/VirD4 family are bacterial conjugation proteins involved in type IV secretion. These proteins aid the transfer of DNA from the plasmid into the host bacterial chromosome. They contain an ATP binding domain. VirD4 is involved in DNA transfer to plant cells and is required for virulence. Probab=96.02 E-value=0.038 Score=37.29 Aligned_cols=57 Identities=14% Similarity=0.255 Sum_probs=37.1 Q ss_pred CEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEECCHHHHHHHHHHHHHCCCCCEEEEC Q ss_conf 55994287877038999999999967982899601058999999864435844136540 Q gi|254780229|r 421 NLIIEGPPGTGKSQTITNIIAAAMLHGKKILFCAQKMAAIEVVRHRLEKAGLGEFCLEL 479 (1775) Q Consensus 421 ~~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVaeK~aALdVV~~RL~~~gLg~~cleL 479 (1775) +++|.||||+|||.++ +|-+++..+.+|.++.-|.--.+.....+...|=.-+|+.. T Consensus 1 H~lvig~tGsGKt~~~--vip~ll~~~~s~vv~D~Kgel~~~t~~~~~~~G~~v~v~np 57 (384) T cd01126 1 HVLVFAPTRSGKGVGF--VIPNLLTWPGSVVVLDPKGENFELTSEHRRALGRKVFVFDP 57 (384) T ss_pred CEEEECCCCCCHHHHH--HHHHHHHCCCCEEEEECCHHHHHHHHHHHHHCCCEEEEEEC T ss_conf 9799889999731899--99999818998899948789999999999987996899818 No 202 >PRK13898 type IV secretion system ATPase VirB4; Provisional Probab=96.02 E-value=0.12 Score=32.26 Aligned_cols=73 Identities=23% Similarity=0.376 Sum_probs=40.0 Q ss_pred CCCEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEECCHHHHHHHHH-------HHH---HCCCCCEEEECCCCCHHHHH Q ss_conf 89559942878770389999999999679828996010589999998-------644---35844136540266110899 Q gi|254780229|r 419 GKNLIIEGPPGTGKSQTITNIIAAAMLHGKKILFCAQKMAAIEVVRH-------RLE---KAGLGEFCLELHSHKVHKRA 488 (1775) Q Consensus 419 g~~~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVaeK~aALdVV~~-------RL~---~~gLg~~cleLHs~ka~k~~ 488 (1775) | +..|.||+|+||| |..|.++...-+=..-.|+--|....++.-+ +|. ..|+-||.+ ..+..++ . T Consensus 447 G-HtlI~G~TGsGKT-tl~~fL~aq~~ky~~~~f~fDkd~~~~i~~~a~GG~Y~~l~~g~~tg~nP~~l--~dt~~n~-~ 521 (800) T PRK13898 447 G-HTLIIGPTGAGKT-VLMNFLCAQAMKFSPRMFFFDKDRGAEIFIRALNGVYTVIEPRLKCNFNPLQL--DDTPENR-T 521 (800) T ss_pred C-CEEEECCCCCCHH-HHHHHHHHHHHHHCCEEEEEECCCCHHHHHHHHCCEEEECCCCCCCCCCCCCC--CCCHHHH-H T ss_conf 7-5699899999899-99999999987548879999799986999998298887437998568687779--9986788-9 Q ss_pred HHHHHHHH Q ss_conf 99999999 Q gi|254780229|r 489 ILDDLRKR 496 (1775) Q Consensus 489 vl~~L~~~ 496 (1775) ++.++-.. T Consensus 522 fl~~~l~~ 529 (800) T PRK13898 522 FLMEWLKV 529 (800) T ss_pred HHHHHHHH T ss_conf 99999999 No 203 >COG1201 Lhr Lhr-like helicases [General function prediction only] Probab=96.01 E-value=0.024 Score=39.40 Aligned_cols=62 Identities=27% Similarity=0.448 Sum_probs=43.7 Q ss_pred HHHHHHHHHCCCCEEEECCCCCCCHHHH-HHHHHHHHHC-------CCEEEEEECCHHHHHHHHHHHHHC Q ss_conf 8999998514895599428787703899-9999999967-------982899601058999999864435 Q gi|254780229|r 409 QHSAVIDVINGKNLIIEGPPGTGKSQTI-TNIIAAAMLH-------GKKILFCAQKMAAIEVVRHRLEKA 470 (1775) Q Consensus 409 Q~~av~~~~~g~~~vi~GPPGTGKSQTI-aNlIa~~la~-------Gk~VLfVaeK~aALdVV~~RL~~~ 470 (1775) |..|+..+..|+|+.|.-|+|||||-+= .-+|...+-. |=.+|++|==+|=..-+.+||... T Consensus 27 Q~~a~~~i~~G~nvLiiAPTGsGKTeAAfLpil~~l~~~~~~~~~~~i~~lYIsPLkALn~Di~~rL~~~ 96 (814) T COG1201 27 QRYAIPEIHSGENVLIIAPTGSGKTEAAFLPVINELLSLGKGKLEDGIYALYISPLKALNNDIRRRLEEP 96 (814) T ss_pred HHHHHHHHHCCCCEEEECCCCCCHHHHHHHHHHHHHHHCCCCCCCCCEEEEEECCHHHHHHHHHHHHHHH T ss_conf 9999999858984699868999737999999999998606888888569999570788878999999999 No 204 >TIGR03420 DnaA_homol_Hda DnaA regulatory inactivator Hda. Members of this protein family are Hda (Homologous to DnaA). These proteins are about half the length of DnaA and homologous over length of Hda. In the model species Escherichia coli, the initiation of DNA replication requires DnaA bound to ATP rather than ADP; Hda helps facilitate the conversion of DnaA-ATP to DnaA-ADP. Probab=95.99 E-value=0.019 Score=40.40 Aligned_cols=40 Identities=25% Similarity=0.339 Sum_probs=29.5 Q ss_pred CCCCEEEECCCCCCCHHHHHHHHHHH-HHCCCEEEEEECCHH Q ss_conf 48955994287877038999999999-967982899601058 Q gi|254780229|r 418 NGKNLIIEGPPGTGKSQTITNIIAAA-MLHGKKILFCAQKMA 458 (1775) Q Consensus 418 ~g~~~vi~GPPGTGKSQTIaNlIa~~-la~Gk~VLfVaeK~a 458 (1775) .++-+.|.||||+||| .+.++||+. ...|++|.|++-+.. T Consensus 37 ~~~~l~i~G~~GsGKT-HLl~a~~~~~~~~~~~~~yl~~~~~ 77 (226) T TIGR03420 37 GDRFLYLWGESGSGKS-HLLQAACAAAEERGKSAIYLPLAEL 77 (226) T ss_pred CCCEEEEECCCCCCHH-HHHHHHHHHHHCCCCCEEEECHHHH T ss_conf 8886999899999889-9999999998626995799529998 No 205 >cd01125 repA Hexameric Replicative Helicase RepA. RepA is encoded by a plasmid, which is found in most Gram negative bacteria. RepA is a 5'-3' DNA helicase which can utilize ATP, GTP and CTP to a lesser extent. Probab=95.95 E-value=0.017 Score=40.83 Aligned_cols=47 Identities=32% Similarity=0.507 Sum_probs=32.1 Q ss_pred EEEECCCCCCCHHHHHHHHHHHHHCCC-------------EEEEEECCHHHHHHHHHHHHHC Q ss_conf 599428787703899999999996798-------------2899601058999999864435 Q gi|254780229|r 422 LIIEGPPGTGKSQTITNIIAAAMLHGK-------------KILFCAQKMAAIEVVRHRLEKA 470 (1775) Q Consensus 422 ~vi~GPPGTGKSQTIaNlIa~~la~Gk-------------~VLfVaeK~aALdVV~~RL~~~ 470 (1775) =+|.|||||||| .++=-+|.++|.|+ +|||++---- .+.+++||... T Consensus 4 ~~l~g~gG~GKS-~lal~lAl~vA~G~~~~g~~~~~~~~G~Vly~~~Ed~-~~~l~rRl~a~ 63 (239) T cd01125 4 SALVAPGGTGKS-SLLLVLALAMALGKNLFGGGLKVTEPGRVVYLSAEDP-REEIHRRLEAI 63 (239) T ss_pred EEEEECCCCCHH-HHHHHHHHHHHCCCCCCCCCCCCCCCCEEEEEECCCC-HHHHHHHHHHH T ss_conf 999808998889-9999999999759965689854687761999978899-89999999999 No 206 >COG0552 FtsY Signal recognition particle GTPase [Intracellular trafficking and secretion] Probab=95.94 E-value=0.013 Score=42.07 Aligned_cols=72 Identities=25% Similarity=0.338 Sum_probs=45.6 Q ss_pred EEECCCCCCCHHHHHHHHHHHHHCCCEEEEEEC---CHHHHHHHHHHHHHCCCCCEEEECCCCCHHHHH-HHHHHHHHH Q ss_conf 994287877038999999999967982899601---058999999864435844136540266110899-999999998 Q gi|254780229|r 423 IIEGPPGTGKSQTITNIIAAAMLHGKKILFCAQ---KMAAIEVVRHRLEKAGLGEFCLELHSHKVHKRA-ILDDLRKRI 497 (1775) Q Consensus 423 vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVae---K~aALdVV~~RL~~~gLg~~cleLHs~ka~k~~-vl~~L~~~l 497 (1775) ++.|..|||||.||+-+-..+...||+|+++|- ..+|.+-+.--=...|. -+..-+ ..++... +++-+++.- T Consensus 143 l~vGVNG~GKTTTIaKLA~~l~~~g~~VllaA~DTFRAaAiEQL~~w~er~gv--~vI~~~-~G~DpAaVafDAi~~Ak 218 (340) T COG0552 143 LFVGVNGVGKTTTIAKLAKYLKQQGKSVLLAAGDTFRAAAIEQLEVWGERLGV--PVISGK-EGADPAAVAFDAIQAAK 218 (340) T ss_pred EEEECCCCCHHHHHHHHHHHHHHCCCEEEEEECCHHHHHHHHHHHHHHHHHCC--EEECCC-CCCCCHHHHHHHHHHHH T ss_conf 99934888637179999999997898699982334789999999999999599--278259-99980899999999999 No 207 >PRK13894 conjugal transfer ATPase TrbB; Provisional Probab=95.94 E-value=0.012 Score=42.58 Aligned_cols=46 Identities=26% Similarity=0.492 Sum_probs=35.8 Q ss_pred HHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHC---CCEEEEEECCHHHHH Q ss_conf 985148955994287877038999999999967---982899601058999 Q gi|254780229|r 414 IDVINGKNLIIEGPPGTGKSQTITNIIAAAMLH---GKKILFCAQKMAAIE 461 (1775) Q Consensus 414 ~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~---Gk~VLfVaeK~aALd 461 (1775) .++..++|++|-|++||||| |..|.+...+.. +.| +++-|...-|. T Consensus 144 ~~V~~r~nilI~G~TgsGKT-Tll~all~~i~~~~p~eR-ivtIED~~EL~ 192 (320) T PRK13894 144 AAVRAHRNILVIGGTGSGKT-TLVNAIINEMVIQDPTER-VFIIEDTGEIQ 192 (320) T ss_pred HHHHCCCCEEEECCCCCCHH-HHHHHHHHHHCCCCCCCC-EEEECCHHHHC T ss_conf 99972875899858886568-999999863202695201-77525878807 No 208 >PRK08903 hypothetical protein; Validated Probab=95.93 E-value=0.04 Score=37.05 Aligned_cols=37 Identities=22% Similarity=0.366 Sum_probs=27.5 Q ss_pred CCCEEEECCCCCCCHHHHHHHHHH-HHHCCCEEEEEECC Q ss_conf 895599428787703899999999-99679828996010 Q gi|254780229|r 419 GKNLIIEGPPGTGKSQTITNIIAA-AMLHGKKILFCAQK 456 (1775) Q Consensus 419 g~~~vi~GPPGTGKSQTIaNlIa~-~la~Gk~VLfVaeK 456 (1775) +..+.|.||+|+|||| +.++||+ +...|+++.|+.-. T Consensus 42 ~~~l~i~G~~G~GKTH-Ll~a~~~~~~~~~~~~~yl~~~ 79 (227) T PRK08903 42 DRFFYLWGEAGSGRSH-LLQALVAAASEGGKNARYLDPA 79 (227) T ss_pred CCEEEEECCCCCCHHH-HHHHHHHHHHHCCCCEEEECCC T ss_conf 8669998999998889-9999999998069974996511 No 209 >PRK12723 flagellar biosynthesis regulator FlhF; Provisional Probab=95.91 E-value=0.013 Score=42.29 Aligned_cols=55 Identities=22% Similarity=0.389 Sum_probs=38.2 Q ss_pred CCCCEEEECCCCCCCHHHHHHHHHHHHHCC-C---EEEEEE---CCHHHHHHHHHHHHHCCC Q ss_conf 489559942878770389999999999679-8---289960---105899999986443584 Q gi|254780229|r 418 NGKNLIIEGPPGTGKSQTITNIIAAAMLHG-K---KILFCA---QKMAAIEVVRHRLEKAGL 472 (1775) Q Consensus 418 ~g~~~vi~GPPGTGKSQTIaNlIa~~la~G-k---~VLfVa---eK~aALdVV~~RL~~~gL 472 (1775) .++=+++.||+|.|||.||+-|-|.+...+ | +|-|++ ...+|.+-++.=-+-.|+ T Consensus 173 k~~vi~lVGPTGvGKTTTiAKLAa~~~l~~~~k~~~V~lit~DtyRigAveQLktya~il~v 234 (388) T PRK12723 173 KKRIFILVGPTGVGKTTTIAKLAAIYGINSDDKSLNIKIITIDNYRIGAKKQIQTYGDIMGI 234 (388) T ss_pred CCEEEEEECCCCCCHHHHHHHHHHHHHHHCCCCCCCEEEEEECCCCHHHHHHHHHHHHHHCC T ss_conf 76289998998875787999999999986267677379998078758899999999999788 No 210 >TIGR03185 DNA_S_dndD DNA sulfur modification protein DndD. This model describes the DndB protein encoded by an operon associated with a sulfur-containing modification to DNA. The operon is sporadically distributed in bacteria, much like some restriction enzyme operons. DndD is described as a putative ATPase. The small number of examples known so far include species from among the Firmicutes, Actinomycetes, Proteobacteria, and Cyanobacteria. Probab=95.91 E-value=0.17 Score=30.44 Aligned_cols=32 Identities=34% Similarity=0.589 Sum_probs=27.3 Q ss_pred CCCCEE-EECCCCCCCHHHHHHHHHHHHHCCCEEE Q ss_conf 489559-9428787703899999999996798289 Q gi|254780229|r 418 NGKNLI-IEGPPGTGKSQTITNIIAAAMLHGKKIL 451 (1775) Q Consensus 418 ~g~~~v-i~GPPGTGKSQTIaNlIa~~la~Gk~VL 451 (1775) .++|++ |.|.-|.||| |+.|+|.-||| |+.-. T Consensus 26 ~~k~i~li~G~NG~GKT-Tll~Ai~~~LY-G~~~~ 58 (650) T TIGR03185 26 SPKPIILIGGLNGAGKT-TLLDAIQLGLY-GKRAY 58 (650) T ss_pred CCCCEEEEECCCCCCHH-HHHHHHHHHHC-CCCCC T ss_conf 99987999779999789-99999999956-95100 No 211 >COG1199 DinG Rad3-related DNA helicases [Transcription / DNA replication, recombination, and repair] Probab=95.89 E-value=0.046 Score=36.45 Aligned_cols=63 Identities=27% Similarity=0.412 Sum_probs=41.1 Q ss_pred CCHHHHHHH---H-HHHCCCCEEEECCCCCCCHHHHHHHHHH---HHHCCCEEEEEECCHHHHHHHHHHHHH Q ss_conf 734689999---9-8514895599428787703899999999---996798289960105899999986443 Q gi|254780229|r 405 SDSSQHSAV---I-DVINGKNLIIEGPPGTGKSQTITNIIAA---AMLHGKKILFCAQKMAAIEVVRHRLEK 469 (1775) Q Consensus 405 aDsSQ~~av---~-~~~~g~~~vi~GPPGTGKSQTIaNlIa~---~la~Gk~VLfVaeK~aALdVV~~RL~~ 469 (1775) -...|...+ . +...|+.++||+|+||||| ++.+++. +...||+|.+.+.+.+..+-+.++... T Consensus 16 ~r~~Q~~~~~~v~~a~~~~~~~~iEapTGtGKT--l~yL~~al~~~~~~~~~viist~t~~lq~q~~~~~~~ 85 (654) T COG1199 16 PRPEQREMAEAVAEALKGGEGLLIEAPTGTGKT--LAYLLPALAYAREEGKKVIISTRTKALQEQLLEEDLP 85 (654) T ss_pred CCHHHHHHHHHHHHHHHCCCEEEEECCCCCCHH--HHHHHHHHHHHHHCCCEEEEECCCHHHHHHHHHHHHH T ss_conf 898999999999999816871899889985175--9999999986775297599989957688898762134 No 212 >COG1223 Predicted ATPase (AAA+ superfamily) [General function prediction only] Probab=95.88 E-value=0.032 Score=38.09 Aligned_cols=95 Identities=25% Similarity=0.413 Sum_probs=53.0 Q ss_pred CCCCEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEECCHHHHHHHHHHHHHCCCCCEEEECCCCCHHHHHHHHHHHHHH Q ss_conf 48955994287877038999999999967982899601058999999864435844136540266110899999999998 Q gi|254780229|r 418 NGKNLIIEGPPGTGKSQTITNIIAAAMLHGKKILFCAQKMAAIEVVRHRLEKAGLGEFCLELHSHKVHKRAILDDLRKRI 497 (1775) Q Consensus 418 ~g~~~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVaeK~aALdVV~~RL~~~gLg~~cleLHs~ka~k~~vl~~L~~~l 497 (1775) +-+|++..|||||||| |.|-+||.--+|=|..-|.+-| +|+++ +.+. .-+.+|-++- T Consensus 150 APknVLFyGppGTGKT-----m~Akalane~kvp~l~vkat~l-----------iGehV-----Gdga--r~Ihely~rA 206 (368) T COG1223 150 APKNVLFYGPPGTGKT-----MMAKALANEAKVPLLLVKATEL-----------IGEHV-----GDGA--RRIHELYERA 206 (368) T ss_pred CCCEEEEECCCCCCHH-----HHHHHHHCCCCCCEEEECHHHH-----------HHHHH-----HHHH--HHHHHHHHHH T ss_conf 7541687789996487-----9999872545785487116888-----------88774-----3598--9999999988 Q ss_pred HHHCCC-CCHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 631124-7158999--9999999999999999999997885 Q gi|254780229|r 498 DNRNIK-TVPQEID--VEITRYEELKGQLNQYAQEINQIWK 535 (1775) Q Consensus 498 e~~~~~-~~~~e~e--~~~~~L~~~r~~L~~~~~~L~~~~~ 535 (1775) ....+- .-..|++ ..-.+++++|...++.+.+|-+... T Consensus 207 ~~~aPcivFiDE~DAiaLdRryQelRGDVsEiVNALLTelD 247 (368) T COG1223 207 RKAAPCIVFIDELDAIALDRRYQELRGDVSEIVNALLTELD 247 (368) T ss_pred HHCCCEEEEEHHHHHHHHHHHHHHHCCCHHHHHHHHHHHCC T ss_conf 75198499840024555304578864549999999998501 No 213 >TIGR00634 recN DNA repair protein RecN; InterPro: IPR004604 DNA repair protein RecN is thought to be DNA damage inducible and involved in recombinational processes. The N-terminal region of most of the bacterial RecN proteins sequenced to date contains an ATP/GTP binding domain within an SMC-like motif. SMC-like domains are involved in chromosomal scaffolding and segregation. It is possible that the function of RecN in homologous recombination is either structural or enzymatic or both. RecN may be involved in the proper positioning of the recombining segments of DNA, ensuring normal recombination. The observation that inactivation of this gene leads to a decreased transformation efficiency, as well as increased sensitivity to DNA-damaging agents, may be due to some defect in chromosomal partitioning or positioning during these recombination-dependent processes . The protein may function presynaptically to process double-stranded breaks to produce 3 single-stranded DNA intermediates during recombination. ; GO: 0005524 ATP binding, 0006281 DNA repair, 0006310 DNA recombination. Probab=95.87 E-value=0.18 Score=30.32 Aligned_cols=30 Identities=27% Similarity=0.437 Sum_probs=23.3 Q ss_pred HHHCCCCE-EEECCCCCCCHHHHHHHHHHHHH Q ss_conf 85148955-99428787703899999999996 Q gi|254780229|r 415 DVINGKNL-IIEGPPGTGKSQTITNIIAAAML 445 (1775) Q Consensus 415 ~~~~g~~~-vi~GPPGTGKSQTIaNlIa~~la 445 (1775) ++-=|+.| ||.|=+||||| -|...|..++. T Consensus 17 ~~eF~rGltVLTGETGaGKS-m~i~Al~LL~G 47 (605) T TIGR00634 17 TVEFERGLTVLTGETGAGKS-MIIDALSLLLG 47 (605) T ss_pred HHHHHCCCCEEECCCCCCHH-HHHHHHHHCCC T ss_conf 77662687143046455078-87766987058 No 214 >COG1419 FlhF Flagellar GTP-binding protein [Cell motility and secretion] Probab=95.87 E-value=0.012 Score=42.68 Aligned_cols=57 Identities=23% Similarity=0.327 Sum_probs=41.4 Q ss_pred CHHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHHHH--HCCCEEEEEE---CCHHHHHHH Q ss_conf 346899999851489559942878770389999999999--6798289960---105899999 Q gi|254780229|r 406 DSSQHSAVIDVINGKNLIIEGPPGTGKSQTITNIIAAAM--LHGKKILFCA---QKMAAIEVV 463 (1775) Q Consensus 406 DsSQ~~av~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~l--a~Gk~VLfVa---eK~aALdVV 463 (1775) +.+....+ .+..|+.+++.||+|-|||.|||-|-|.+. ..-++|-|++ -...|.+-. T Consensus 191 ~~~~~~~~-~~~~~~vi~LVGPTGVGKTTTlAKLAar~~~~~~~~kVaiITtDtYRIGA~EQL 252 (407) T COG1419 191 LLSLIENL-IVEQKRVIALVGPTGVGKTTTLAKLAARYVMLKKKKKVAIITTDTYRIGAVEQL 252 (407) T ss_pred HHHHCCCC-CCCCCCEEEEECCCCCCHHHHHHHHHHHHHHHCCCCCEEEEEECCCHHHHHHHH T ss_conf 87644111-124685799989988758879999999997532576068997144115289999 No 215 >PRK13853 type IV secretion system protein VirB4; Provisional Probab=95.86 E-value=0.11 Score=32.49 Aligned_cols=44 Identities=30% Similarity=0.429 Sum_probs=29.1 Q ss_pred CCCEEEECCCCCCCHHHHHHHHHHHHH-----CCCEEEEEECCHHHHHHHHH Q ss_conf 895599428787703899999999996-----79828996010589999998 Q gi|254780229|r 419 GKNLIIEGPPGTGKSQTITNIIAAAML-----HGKKILFCAQKMAAIEVVRH 465 (1775) Q Consensus 419 g~~~vi~GPPGTGKSQTIaNlIa~~la-----~Gk~VLfVaeK~aALdVV~~ 465 (1775) |.. .|.||+|+||| |+.|.+...+. ++.+|.| --|....++.-+ T Consensus 427 GHt-~I~G~TGsGKT-tll~fL~aq~~ky~~~~~~~~~~-fDkd~s~~i~~~ 475 (789) T PRK13853 427 GMT-AIFGPIGRGKT-TLMTFILAMLEQSMVDRAGAVVF-FDKDRGGELLVR 475 (789) T ss_pred CCE-EEECCCCCCHH-HHHHHHHHHHHHHHCCCCCEEEE-EECCCCHHHHHH T ss_conf 744-88789999889-99999999999742235770899-958863899999 No 216 >PRK10436 hypothetical protein; Provisional Probab=95.86 E-value=0.025 Score=39.25 Aligned_cols=48 Identities=19% Similarity=0.325 Sum_probs=32.6 Q ss_pred CHHHHHHHHHHHC-CCCEE-EECCCCCCCHHHHHHHHHHHHHCCCEEEEE Q ss_conf 3468999998514-89559-942878770389999999999679828996 Q gi|254780229|r 406 DSSQHSAVIDVIN-GKNLI-IEGPPGTGKSQTITNIIAAAMLHGKKILFC 453 (1775) Q Consensus 406 DsSQ~~av~~~~~-g~~~v-i~GPPGTGKSQTIaNlIa~~la~Gk~VLfV 453 (1775) +..|...+..+.+ ...+| |.||+|+|||.|+--++.+.-..+++|+-+ T Consensus 200 ~~~~~~~~~~~~~~p~GliLvtGPTGSGKTTTLya~L~~l~~~~~~I~Ti 249 (461) T PRK10436 200 TPAQLAQFRQALQQPQGLVLVTGPTGSGKTVTLYSALQTLNTAQINICSV 249 (461) T ss_pred CHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHHHHHHHHHCCCCCEEEEE T ss_conf 99999999999838997799978999956999999997434677169996 No 217 >pfam12128 DUF3584 Protein of unknown function (DUF3584). This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. There are two conserved sequence motifs: GKT and YLP. Probab=95.85 E-value=0.18 Score=30.24 Aligned_cols=16 Identities=38% Similarity=0.534 Sum_probs=12.2 Q ss_pred CCHHHHHHHHHHHHHC Q ss_conf 7038999999999967 Q gi|254780229|r 431 GKSQTITNIIAAAMLH 446 (1775) Q Consensus 431 GKSQTIaNlIa~~la~ 446 (1775) |+--||=+||+..+.. T Consensus 196 l~~~~iK~mIi~~l~e 211 (1192) T pfam12128 196 GKFETVKRMIAAILEE 211 (1192) T ss_pred CCHHHHHHHHHHHHHC T ss_conf 4489999999999843 No 218 >COG1936 Predicted nucleotide kinase (related to CMP and AMP kinases) [Nucleotide transport and metabolism] Probab=95.83 E-value=0.013 Score=42.12 Aligned_cols=57 Identities=26% Similarity=0.498 Sum_probs=33.6 Q ss_pred CEEEECCCCCCCHHHHHHHHHHHHHCCCEEEE-------------EE----CCHHHHHHHHHHHHHC-CCCCEEEECCC Q ss_conf 55994287877038999999999967982899-------------60----1058999999864435-84413654026 Q gi|254780229|r 421 NLIIEGPPGTGKSQTITNIIAAAMLHGKKILF-------------CA----QKMAAIEVVRHRLEKA-GLGEFCLELHS 481 (1775) Q Consensus 421 ~~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLf-------------Va----eK~aALdVV~~RL~~~-gLg~~cleLHs 481 (1775) ...|.|+|||||| |+++.++ .+ |=.+.- +. +.-...|.|+.+++.. .-+...++=|. T Consensus 2 ~I~ITGTPGvGKT-T~~~~L~-~l--g~~~i~l~el~~e~~~~~~~de~r~s~~vD~d~~~~~le~~~~~~~~Ivd~H~ 76 (180) T COG1936 2 LIAITGTPGVGKT-TVCKLLR-EL--GYKVIELNELAKENGLYTEYDELRKSVIVDVDKLRKRLEELLREGSGIVDSHL 76 (180) T ss_pred EEEEECCCCCCHH-HHHHHHH-HH--CCCEEEHHHHHHHCCCEECCCCCCCEEEEEHHHHHHHHHHHHCCCCEEEECHH T ss_conf 3799379998668-7999999-82--98466199999866971143776615886679999888887505884761266 No 219 >cd01882 BMS1 Bms1. Bms1 is an essential, evolutionarily conserved, nucleolar protein. Its depletion interferes with processing of the 35S pre-rRNA at sites A0, A1, and A2, and the formation of 40S subunits. Bms1, the putative endonuclease Rc11, and the essential U3 small nucleolar RNA form a stable subcomplex that is believed to control an early step in the formation of the 40S subumit. The C-terminal domain of Bms1 contains a GTPase-activating protein (GAP) that functions intramolecularly. It is believed that Rc11 activates Bms1 by acting as a guanine-nucleotide exchange factor (GEF) to promote GDP/GTP exchange, and that activated (GTP-bound) Bms1 delivers Rc11 to the preribosomes. Probab=95.83 E-value=0.015 Score=41.67 Aligned_cols=60 Identities=20% Similarity=0.366 Sum_probs=36.8 Q ss_pred EEEECCCCCCCHHHHHHHHHHHHHCC-----CEEEEEECCHHHH------HHHHHHHHHCCCCCEEEECCC Q ss_conf 59942878770389999999999679-----8289960105899------999986443584413654026 Q gi|254780229|r 422 LIIEGPPGTGKSQTITNIIAAAMLHG-----KKILFCAQKMAAI------EVVRHRLEKAGLGEFCLELHS 481 (1775) Q Consensus 422 ~vi~GPPGTGKSQTIaNlIa~~la~G-----k~VLfVaeK~aAL------dVV~~RL~~~gLg~~cleLHs 481 (1775) ++|+||||+|||.-|-.+|-+..-+. ..|-+|+-|..-+ +-...-++-+-+.+.||-+-+ T Consensus 42 VavvGPpgvGKtTLiksLvk~ytk~~l~~i~GPiTvvs~K~rRiTfiEc~nDi~smiD~AKvADlVLl~iD 112 (225) T cd01882 42 VAVVGPPGVGKTTLIKSLVKNYTKQNISDIKGPITVVTGKKRRLTFIECPNDINAMIDIAKVADLVLLLID 112 (225) T ss_pred EEEECCCCCCHHHHHHHHHHHHHHCCCCCCCCCEEEEECCCEEEEEEECCCHHHHHHHHHHHHHEEEEEEC T ss_conf 99989899778899999999985443755788879994684268999748609999878876433688861 No 220 >TIGR00609 recB exodeoxyribonuclease V, beta subunit; InterPro: IPR004586 Exodeoxyribonuclease V, or RecBCD holoenzyme, (3.1.11.5 from EC) is a multifunctional nuclease with potent ATP-dependent exodeoxyribonuclease activity. Ejection of RecD, as occurs at chi recombinational hotspots, cripples exonuclease activity in favor of recombinagenic helicase activity. All proteins in this family for which functions are known are DNA-DNA helicases that are used as part of an exonuclease-helicase complex (made up of RecBCD homologs) that function to generate substrates for the initiation of recombination and recombinational repair. The complex catalyses exonucleolytic cleavage in either the 5' to 3' or 3' to 5' direction to yield 5-phosphooligonucleotides in the presence of ATP. This is the beta subunit .; GO: 0005524 ATP binding, 0008854 exodeoxyribonuclease V activity, 0006281 DNA repair. Probab=95.83 E-value=0.18 Score=30.16 Aligned_cols=61 Identities=30% Similarity=0.427 Sum_probs=50.7 Q ss_pred CCEEEECCCCCCCHHHHHHHHHHHHHCC--------C----EEEEEECCHHHHHHHHHHHHHCCCCCEEEECCC Q ss_conf 9559942878770389999999999679--------8----289960105899999986443584413654026 Q gi|254780229|r 420 KNLIIEGPPGTGKSQTITNIIAAAMLHG--------K----KILFCAQKMAAIEVVRHRLEKAGLGEFCLELHS 481 (1775) Q Consensus 420 ~~~vi~GPPGTGKSQTIaNlIa~~la~G--------k----~VLfVaeK~aALdVV~~RL~~~gLg~~cleLHs 481 (1775) ..++|+-==|||||.||++++--+|..+ . .||+|..+.||-.-+|-|+.. +|++....+.. T Consensus 10 g~~LiEASAGTGKTFtI~~l~lRlll~~gg~~~F~~~~~v~eiLVvTFT~AAT~ELk~RIr~-~i~~a~~~~~~ 82 (1324) T TIGR00609 10 GTFLIEASAGTGKTFTIAALYLRLLLEGGGENAFKRPLTVEEILVVTFTNAATEELKTRIRE-RIHEALRALKA 82 (1324) T ss_pred CCEEEEEECCCHHHHHHHHHHHHHHHHCCCHHHCCCCCCCCCEEEEHHHHHHHHHHHHHHHH-HHHHHHHHHHH T ss_conf 40688741761489999999999998515122024434612121112227888998899888-89999999864 No 221 >CHL00081 chlI Mg-protoporyphyrin IX chelatase Probab=95.82 E-value=0.014 Score=41.96 Aligned_cols=48 Identities=23% Similarity=0.349 Sum_probs=36.4 Q ss_pred CCCCCHHHHHHHHHHHCC--CCEEEECCCCCCCHHHHHHHHHHHHHCCCEE Q ss_conf 667734689999985148--9559942878770389999999999679828 Q gi|254780229|r 402 IDDSDSSQHSAVIDVING--KNLIIEGPPGTGKSQTITNIIAAAMLHGKKI 450 (1775) Q Consensus 402 ~~~aDsSQ~~av~~~~~g--~~~vi~GPPGTGKSQTIaNlIa~~la~Gk~V 450 (1775) |+-=|.-..+-+..+... ..++|.|||||||| |++-.++..|-.-+.| T Consensus 14 IvGQe~~k~aLll~av~p~iGgVLi~G~~GtgKS-tlvRala~lLP~i~~v 63 (347) T CHL00081 14 IVGQEEMKLALLLNVIDPKIGGVMIMGDRGTGKS-TTIRALVDLLPEIEVV 63 (347) T ss_pred HCCHHHHHHHHHHHHCCCCCCEEEEECCCCCCHH-HHHHHHHHHCCCCEEE T ss_conf 5384999999999825788786998789987499-9999999857874220 No 222 >PRK09087 hypothetical protein; Validated Probab=95.82 E-value=0.19 Score=30.13 Aligned_cols=23 Identities=26% Similarity=0.629 Sum_probs=17.2 Q ss_pred CCEEEECCCCCCCHHHHHHHHHHH Q ss_conf 955994287877038999999999 Q gi|254780229|r 420 KNLIIEGPPGTGKSQTITNIIAAA 443 (1775) Q Consensus 420 ~~~vi~GPPGTGKSQTIaNlIa~~ 443 (1775) +.++|.||+|+|||| .+++.|.- T Consensus 45 ~~~~L~Gp~gsGKTH-L~~~~~~~ 67 (226) T PRK09087 45 PVVVLAGPVGSGKTH-LASIWREK 67 (226) T ss_pred CEEEEECCCCCCHHH-HHHHHHHH T ss_conf 758998999998869-99999998 No 223 >PRK05298 excinuclease ABC subunit B; Provisional Probab=95.81 E-value=0.041 Score=36.95 Aligned_cols=44 Identities=30% Similarity=0.462 Sum_probs=29.2 Q ss_pred HHHHHH----HHHCCCCE-EEECCCCCCCHHHHHHHHHHHHHCCCEEEEEEC Q ss_conf 899999----85148955-994287877038999999999967982899601 Q gi|254780229|r 409 QHSAVI----DVINGKNL-IIEGPPGTGKSQTITNIIAAAMLHGKKILFCAQ 455 (1775) Q Consensus 409 Q~~av~----~~~~g~~~-vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVae 455 (1775) |-.||. ...+|+.+ ++-|-+|||||-||||+|+.. +|..|+.|. T Consensus 18 QP~AI~~L~~gi~~g~~~Q~LlGvTGSGKTfTmAnvI~~~---~rPtLVlah 66 (657) T PRK05298 18 QPQAIAELVEGIESGEKHQTLLGVTGSGKTFTMANVIARV---QRPTLVLAH 66 (657) T ss_pred CHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHHHHHHHHH---CCCEEEECC T ss_conf 7999999998997499636997036785668999999986---897599766 No 224 >PRK05642 DNA replication initiation factor; Validated Probab=95.80 E-value=0.069 Score=34.63 Aligned_cols=54 Identities=15% Similarity=0.271 Sum_probs=33.4 Q ss_pred EEEECCCCCCCHHHHHHHHHHH-HHCCCEEEEEECCHHHH-H-HHHHHHHHCCCCCEEEE Q ss_conf 5994287877038999999999-96798289960105899-9-99986443584413654 Q gi|254780229|r 422 LIIEGPPGTGKSQTITNIIAAA-MLHGKKILFCAQKMAAI-E-VVRHRLEKAGLGEFCLE 478 (1775) Q Consensus 422 ~vi~GPPGTGKSQTIaNlIa~~-la~Gk~VLfVaeK~aAL-d-VV~~RL~~~gLg~~cle 478 (1775) +.|.||+|+|||| +.++||+. ...|+++.++.-+...- . -+.+.+.. ..-.|++ T Consensus 48 l~i~G~~G~GKTH-LL~A~~~~~~~~~~~~~yl~~~~~~~~~~~~~~~l~~--~d~l~ID 104 (234) T PRK05642 48 IYLWGKDGVGRSH-LLQAACLRFEQRGEPAVYLPLAELLDRGPELLDNLEQ--YELVCID 104 (234) T ss_pred EEEECCCCCCHHH-HHHHHHHHHHHCCCCEEEEEHHHHHHHHHHHHHHHHH--CCEEEEE T ss_conf 8998899998899-9999999998079967997899987544999862422--7989893 No 225 >TIGR02782 TrbB_P P-type conjugative transfer ATPase TrbB; InterPro: IPR014149 This entry represents TrbB, a protein, which is encoded in the trb locus of Agrobacterium Ti plasmids where it is involved in the type IV secretion system for plasmid conjugative transfer . TrbB is a homologue of the vir system VirB11 ATPase , and the Flp pilus system ATPase TadA .. Probab=95.80 E-value=0.015 Score=41.54 Aligned_cols=55 Identities=24% Similarity=0.439 Sum_probs=38.7 Q ss_pred CCCCCHHHHHHHH-HHHCCCCEEEECCCCCCCHHHHHHHHHHHHHC-C---CEEEEEECCHHHH Q ss_conf 6677346899999-85148955994287877038999999999967-9---8289960105899 Q gi|254780229|r 402 IDDSDSSQHSAVI-DVINGKNLIIEGPPGTGKSQTITNIIAAAMLH-G---KKILFCAQKMAAI 460 (1775) Q Consensus 402 ~~~aDsSQ~~av~-~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~-G---k~VLfVaeK~aAL 460 (1775) |+-| -|..++. ++.+-+|.||-|=+||||| |.+|++=+-++. - -|| |.=|=++=| T Consensus 123 imta--aQ~d~l~~Av~ar~NIlv~GGTGSGKT-TLaNAlla~I~~l~~P~dR~-vIiEDT~El 182 (315) T TIGR02782 123 IMTA--AQRDVLREAVAARKNILVVGGTGSGKT-TLANALLAEIAKLNDPDDRV-VIIEDTAEL 182 (315) T ss_pred CCCH--HHHHHHHHHHHHCCCEEEECCCCCCHH-HHHHHHHHHHHHCCCCCCEE-EEEECCHHH T ss_conf 4557--899999999971298899814588579-99999999885216999618-998547132 No 226 >TIGR03600 phage_DnaB phage replicative helicase, DnaB family, HK022 subfamily. Members of this family are phage (or prophage-region) homologs of the bacterial homohexameric replicative helicase DnaB. Some phage may rely on host DnaB, while others encode their own verions. This model describes the largest phage-specific clade among the close homologs of DnaB, but there are, or course, other DnaB homologs from phage that fall outside the scope of this model. Probab=95.79 E-value=0.031 Score=38.22 Aligned_cols=55 Identities=27% Similarity=0.455 Sum_probs=44.8 Q ss_pred HHCCCCEEEECCCCCCCHHHHHHHHHH-HHHCCCEEEEEECCHHHHHHHHHHHHHC Q ss_conf 514895599428787703899999999-9967982899601058999999864435 Q gi|254780229|r 416 VINGKNLIIEGPPGTGKSQTITNIIAA-AMLHGKKILFCAQKMAAIEVVRHRLEKA 470 (1775) Q Consensus 416 ~~~g~~~vi~GPPGTGKSQTIaNlIa~-~la~Gk~VLfVaeK~aALdVV~~RL~~~ 470 (1775) ...|.=.||-|.||.|||..-.|+..+ |+..|+.|||-|--|.+-+++.|-+... T Consensus 191 l~~g~LiIiaARPsmGKTafalnia~n~A~~~g~~Vl~fSLEMs~eql~~R~la~~ 246 (421) T TIGR03600 191 LVKGDLIVIGARPSMGKTTLALNIAENVALREGKPVLFFSLEMSAEQLGERLLASK 246 (421) T ss_pred CCCCCEEEEEEECCCCHHHHHHHHHHHHHHHCCCEEEEEECCCCHHHHHHHHHHHH T ss_conf 99886899985467874599999999999866983899925799999999999985 No 227 >TIGR01242 26Sp45 26S proteasome subunit P45 family; InterPro: IPR005937 Intracellular proteins, including short-lived proteins such as cyclin, Mos, Myc, p53, NF-kappaB, and IkappaB, are degraded by the ubiquitin-proteasome system. The 26S proteasome (a 2 MDa complex) is made up of two subcomplexes: the 20S proteasome and the regulatory complex. The former is a 700 kDa cylindrical protease complex consisting of four stacks of heptameric rings with 28 subunits (i.e., 7777) with molecular masses of about 20-35 kDa, whereas the latter is a 700-1000 kDa complex consisting of at least 18 subunits with molecular masses of 28-110 kDa, including 6 putative ATPases (Rpt1-Rpt6) and 12 non-ATPase subunits (Rpn1-12). Members of the 26S proteasome subunit P45 family: ATPase p45/Sug1/Rpt6 may be phosphorylated within the proteasome. This phosphorylation event may play a key role in ATP-dependent proteolysis because a good correlation exists between the inhibition pattern of protein kinase inhibitors against the phosphorylation of p45 and that against the ATP-dependent proteolytic activity , . More information about these protein can be found at Protein of the Month: AAA ATPases .; GO: 0016787 hydrolase activity, 0030163 protein catabolic process, 0005634 nucleus, 0005737 cytoplasm. Probab=95.77 E-value=0.0058 Score=45.83 Aligned_cols=23 Identities=39% Similarity=0.807 Sum_probs=16.8 Q ss_pred CCCEEEECCCCCCCHHHHHHHHHH Q ss_conf 895599428787703899999999 Q gi|254780229|r 419 GKNLIIEGPPGTGKSQTITNIIAA 442 (1775) Q Consensus 419 g~~~vi~GPPGTGKSQTIaNlIa~ 442 (1775) -|.+++.|||||||| -+|-++|+ T Consensus 156 PKGvLLyGPPGtGKT-LlAKAvA~ 178 (364) T TIGR01242 156 PKGVLLYGPPGTGKT-LLAKAVAH 178 (364) T ss_pred CCCEEEECCCCCCHH-HHHHHHHC T ss_conf 986570075797688-99998631 No 228 >PRK07667 uridine kinase; Provisional Probab=95.77 E-value=0.028 Score=38.63 Aligned_cols=45 Identities=20% Similarity=0.283 Sum_probs=31.1 Q ss_pred HHHHHHHHHCCCCEE--EECCCCCCCHHHHHHHHHHHHH-CCCEEEEEE Q ss_conf 899999851489559--9428787703899999999996-798289960 Q gi|254780229|r 409 QHSAVIDVINGKNLI--IEGPPGTGKSQTITNIIAAAML-HGKKILFCA 454 (1775) Q Consensus 409 Q~~av~~~~~g~~~v--i~GPPGTGKSQTIaNlIa~~la-~Gk~VLfVa 454 (1775) |+.-|....++.-|| |-|++|+||| |+||-|+..|. .|..|..+. T Consensus 2 ~~~~~~~~~~~~r~iIgIaG~sgSGKT-Tla~~L~~~l~~~~~~v~v~~ 49 (190) T PRK07667 2 ELINVMKKHKENRFILGIDGLSRSGKT-TFVANLKENMKQEGIPFHIFH 49 (190) T ss_pred HHHHHHHHCCCCEEEEEEECCCCCCHH-HHHHHHHHHHHHCCCCEEEEE T ss_conf 377899857598699997798978899-999999999866598379996 No 229 >PRK09825 idnK D-gluconate kinase; Provisional Probab=95.77 E-value=0.0093 Score=43.68 Aligned_cols=28 Identities=39% Similarity=0.578 Sum_probs=25.8 Q ss_pred HCCCCEEEECCCCCCCHHHHHHHHHHHHH Q ss_conf 14895599428787703899999999996 Q gi|254780229|r 417 INGKNLIIEGPPGTGKSQTITNIIAAAML 445 (1775) Q Consensus 417 ~~g~~~vi~GPPGTGKSQTIaNlIa~~la 445 (1775) .+|+.|||-||-|+||| ||+.++|..|. T Consensus 1 ~~~~a~VVmGVsGsGKS-Tvg~~LA~~L~ 28 (176) T PRK09825 1 MAGESYILMGVSGSGKS-LIGSKIAALFS 28 (176) T ss_pred CCCCEEEEEECCCCCHH-HHHHHHHHHHC T ss_conf 99857999828989989-99999999959 No 230 >PRK05057 aroK shikimate kinase I; Reviewed Probab=95.76 E-value=0.01 Score=43.36 Aligned_cols=28 Identities=36% Similarity=0.591 Sum_probs=24.7 Q ss_pred HCCCCEEEECCCCCCCHHHHHHHHHHHHH Q ss_conf 14895599428787703899999999996 Q gi|254780229|r 417 INGKNLIIEGPPGTGKSQTITNIIAAAML 445 (1775) Q Consensus 417 ~~g~~~vi~GPPGTGKSQTIaNlIa~~la 445 (1775) ...+|+|+.|+||+||| ||..++|..|. T Consensus 2 ~~kknI~LiG~mGsGKs-tvgk~LA~~l~ 29 (172) T PRK05057 2 AEKRNIFLVGPMGAGKS-TIGRQLAQQLN 29 (172) T ss_pred CCCCCEEEECCCCCCHH-HHHHHHHHHHC T ss_conf 98882898899999889-99999999969 No 231 >PRK13766 Hef nuclease; Provisional Probab=95.76 E-value=0.033 Score=37.94 Aligned_cols=81 Identities=21% Similarity=0.235 Sum_probs=52.3 Q ss_pred CCCCCHHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHHHHH-CCCEEEEEECCHHHHHHHHHHHHH-CCC-CCEEEE Q ss_conf 66773468999998514895599428787703899999999996-798289960105899999986443-584-413654 Q gi|254780229|r 402 IDDSDSSQHSAVIDVINGKNLIIEGPPGTGKSQTITNIIAAAML-HGKKILFCAQKMAAIEVVRHRLEK-AGL-GEFCLE 478 (1775) Q Consensus 402 ~~~aDsSQ~~av~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la-~Gk~VLfVaeK~aALdVV~~RL~~-~gL-g~~cle 478 (1775) .+.+=.=|...+..|..+ |.+|.=|+|+|||..-+-+|++.+. .|++|||.+-+..=.+--++-+.. .|+ +.-+.. T Consensus 13 ~ie~R~YQ~el~~~Al~~-NtiVvLPTG~GKT~IA~lvi~~~l~~~~gKilFLaPT~pLV~Qq~~~~~~~l~i~~~~i~~ 91 (764) T PRK13766 13 TIEARLYQQLLAAKALKG-NTLVVLPTGLGKTAIALLVIAERLQKYGGKVLILAPTKPLVEQHAEFFRKFLNIDPEKIVV 91 (764) T ss_pred CCCCCHHHHHHHHHHHHC-CEEEEECCCCCHHHHHHHHHHHHHHHCCCEEEEECCCHHHHHHHHHHHHHHCCCCCCEEEE T ss_conf 776538799999999858-9899959986689999999999997489889998588889999999999970999552899 Q ss_pred CCCCC Q ss_conf 02661 Q gi|254780229|r 479 LHSHK 483 (1775) Q Consensus 479 LHs~k 483 (1775) +++.. T Consensus 92 ltG~~ 96 (764) T PRK13766 92 LTGEI 96 (764) T ss_pred EECCC T ss_conf 98887 No 232 >TIGR00632 vsr DNA mismatch endonuclease Vsr; InterPro: IPR004603 This entry represents VSR (very short patch repair) endonucleases, which occur in a variety of bacteria. VSR recognises a TG mismatched base pair, generated after spontaneous deamination of methylated cytosines, and cleaves the phosphate backbone on the 5' side of the thymine . GT mismatches can lead to C-to-T transition mutations if not repaired. VSR repairs the mismatches in favour of the G-containing strand. In Escherichia coli, this endonuclease nicks double-stranded DNA within the sequence CT(AT)GN or NT(AT)GG next to the thymidine residue, which is mismatched to 2'-deoxyguanosine . The incision is mismatch-dependent and strand specific. The structure of VSR is similar to the core structure of restriction endonucleases, which have a 3-layer alpha/beta/alpha topology . ; GO: 0004519 endonuclease activity, 0006298 mismatch repair. Probab=95.75 E-value=0.0087 Score=44.00 Aligned_cols=77 Identities=25% Similarity=0.453 Sum_probs=56.8 Q ss_pred CHHHHHHHHHHHHCCCEEE----EC--CCCCCEECCEEEECCCCCCCEEEEEECCCHHHCCCC----------------- Q ss_conf 9899999999997899897----06--575771012573368988831799974580236832----------------- Q gi|254780229|r 1663 SDFAVSVVNELEKTGFACD----SQ--LGDMGFSIDVVVRDPNNPGHYLMGIECDGAMYNSAK----------------- 1719 (1775) Q Consensus 1663 s~fe~~v~~~L~~~G~~v~----~q--vg~~~~riDl~v~~p~~pg~~~l~iecDG~~yh~~~----------------- 1719 (1775) .--|..|+..|...||.=. -| =|+-| +=|||+ ..|.++|-.||--||+-. T Consensus 41 T~pE~~l~s~L~~lGlryRkdfl~~~~~G~~g-~PD~vf------~~~k~~iFidGCFWHg~~C~~~~~~p~s~~~fW~~ 113 (143) T TIGR00632 41 TKPEKALASLLFALGLRYRKDFLLRDKDGLPG-TPDIVF------KEYKVVIFIDGCFWHGHDCDILGKVPKSRKDFWSP 113 (143) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCC-CCCEEE------CCCEEEEEECCCEECCCCCCCCCCCCCCCHHCCHH T ss_conf 81789999999998899888987523689854-675443------47358886027300587674014788664230368 Q ss_pred ---CHHHHHHHHHHHHHHCCCCEEEEECHH Q ss_conf ---486777444899995699099932216 Q gi|254780229|r 1720 ---SARDRDRLRQEVLERMGWRIRRIWSVD 1746 (1775) Q Consensus 1720 ---~ardrD~~Rq~vLe~~GW~~~Riws~d 1746 (1775) .=++||+--.+-|...||.|.+||.-| T Consensus 114 Ki~~N~~RD~~~~~~L~~~GWrv~~vW~~~ 143 (143) T TIGR00632 114 KIEKNVERDRRVNDALAEEGWRVLRVWECE 143 (143) T ss_pred HHHHHHHCCCCHHHHHHHCCCEEEEEECCC T ss_conf 887425303117899984599898961379 No 233 >PRK05595 replicative DNA helicase; Provisional Probab=95.75 E-value=0.036 Score=37.61 Aligned_cols=55 Identities=22% Similarity=0.335 Sum_probs=45.6 Q ss_pred HHCCCCEEEECCCCCCCHHHHHHHHHH-HHHCCCEEEEEECCHHHHHHHHHHHHHC Q ss_conf 514895599428787703899999999-9967982899601058999999864435 Q gi|254780229|r 416 VINGKNLIIEGPPGTGKSQTITNIIAA-AMLHGKKILFCAQKMAAIEVVRHRLEKA 470 (1775) Q Consensus 416 ~~~g~~~vi~GPPGTGKSQTIaNlIa~-~la~Gk~VLfVaeK~aALdVV~~RL~~~ 470 (1775) ...|.=.||-|.||.|||-.-.||+.+ |+-.|++|+|.|--|.+-+++.|-|... T Consensus 198 l~~GdLiiiaaRP~mGKTa~alnia~~~a~~~g~~V~~fSlEMs~~ql~~R~ls~~ 253 (444) T PRK05595 198 FQKGDMILIAARPSMGKTTFALNIAEYAALREGKSVVIFSLEMSKEQLAYKLLCSE 253 (444) T ss_pred CCCCCEEEEEECCCCCCHHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHH T ss_conf 98577799985798980799999999999866993799958899999999999964 No 234 >cd01428 ADK Adenylate kinase (ADK) catalyzes the reversible phosphoryl transfer from adenosine triphosphates (ATP) to adenosine monophosphates (AMP) and to yield adenosine diphosphates (ADP). This enzyme is required for the biosynthesis of ADP and is essential for homeostasis of adenosine phosphates. Probab=95.73 E-value=0.0093 Score=43.71 Aligned_cols=22 Identities=36% Similarity=0.751 Sum_probs=19.9 Q ss_pred EEEECCCCCCCHHHHHHHHHHHH Q ss_conf 59942878770389999999999 Q gi|254780229|r 422 LIIEGPPGTGKSQTITNIIAAAM 444 (1775) Q Consensus 422 ~vi~GPPGTGKSQTIaNlIa~~l 444 (1775) ++|-||||+||+ |++..||... T Consensus 2 i~l~G~PGsGKg-Tqa~~La~~~ 23 (194) T cd01428 2 ILLLGPPGSGKG-TQAERLAKKY 23 (194) T ss_pred EEEECCCCCCHH-HHHHHHHHHH T ss_conf 899899999879-9999999997 No 235 >PRK13830 conjugal transfer protein TrbE; Provisional Probab=95.72 E-value=0.17 Score=30.49 Aligned_cols=46 Identities=35% Similarity=0.528 Sum_probs=31.2 Q ss_pred CCCEEEECCCCCCCHHHHHHHHHHHHHC--CCEEEEEECCHHHHHHHHHHH Q ss_conf 8955994287877038999999999967--982899601058999999864 Q gi|254780229|r 419 GKNLIIEGPPGTGKSQTITNIIAAAMLH--GKKILFCAQKMAAIEVVRHRL 467 (1775) Q Consensus 419 g~~~vi~GPPGTGKSQTIaNlIa~~la~--Gk~VLfVaeK~aALdVV~~RL 467 (1775) | +..|.||+|+||| |..|.++.-.-+ |-+| |+--|....++.-+-+ T Consensus 457 G-HtlIiGpTGsGKT-vll~fl~aq~~ky~~~~v-f~FDKd~s~~i~~~a~ 504 (818) T PRK13830 457 G-HTLIFGPTGSGKS-TLLALIAAQFRRYAGAQI-FAFDKGRSMLPLTLAA 504 (818) T ss_pred C-EEEEECCCCCCHH-HHHHHHHHHHHHCCCCEE-EEECCCCCHHHHHHHH T ss_conf 5-0589899999889-999999999864279838-9974887689999980 No 236 >PRK10522 multidrug transporter membrane component/ATP-binding component; Provisional Probab=95.72 E-value=0.0073 Score=44.80 Aligned_cols=13 Identities=31% Similarity=0.744 Sum_probs=7.4 Q ss_pred HHHHCCEEEEEEC Q ss_conf 8861182899607 Q gi|254780229|r 1381 VIARGKQIVVVGD 1393 (1775) Q Consensus 1381 al~rakqvvvvGD 1393 (1775) .+-.|+.+-+||- T Consensus 345 ~i~~Ge~vaiVG~ 357 (547) T PRK10522 345 TIKRGELLFLIGG 357 (547) T ss_pred EECCCCEEEEECC T ss_conf 9859988999899 No 237 >pfam01443 Viral_helicase1 Viral (Superfamily 1) RNA helicase. Helicase activity for this family has been demonstrated and NTPase activity. This helicase has multiple roles at different stages of viral RNA replication, as dissected by mutational analysis. Probab=95.71 E-value=0.015 Score=41.61 Aligned_cols=40 Identities=30% Similarity=0.504 Sum_probs=33.1 Q ss_pred CCCEEEECCHHHCCH--HHHHHHHHHCCEEEEEECCCCCCCC Q ss_conf 410788606323698--9988988611828996077866862 Q gi|254780229|r 1361 KFDLVIMDESSQIKP--EDALGVIARGKQIVVVGDPKQLPPT 1400 (1775) Q Consensus 1361 ~FD~VI~DEASQ~~~--~~al~al~rakqvvvvGD~kQlpPt 1400 (1775) .++++||||...+++ .+.+..+..++.++++|||.|.|.- T Consensus 59 ~~~~~iiDE~~~~~~g~l~~~~~~~~~~~~~l~GDp~Q~~~~ 100 (226) T pfam01443 59 TGKILILDEYTLLPPGYILLLAAISGAKLVILFGDPLQIPYH 100 (226) T ss_pred CCCEEEECCCCCCCHHHHHHHHHCCCCCEEEEECCCCCCCCC T ss_conf 778899896456681689998760688679998287616477 No 238 >TIGR01074 rep ATP-dependent DNA helicase Rep; InterPro: IPR005752 RepA hexameric DNA helicase contain ATP-binding domains similar to those seen in monomeric helicases but which are arranged in a ring. There is compelling evidence to suggest that a single ssDNA molecule passes through the centre of the hexameric ring. Activity of the enzyme is based upon two separate but coupled activities, ssDNA translocation and duplex destabilisation, and is driven by energy derived from the continuous ATP-binding and hydrolysis events that take place in the active-site cleft. The resulting conformational changes that accompany these events underpin the coupling process and allow the helicase to translocate along the DNA, destabilizing the duplex and separating the two strands in an active process ; GO: 0004003 ATP-dependent DNA helicase activity, 0006268 DNA unwinding during replication, 0005737 cytoplasm. Probab=95.71 E-value=0.056 Score=35.58 Aligned_cols=63 Identities=30% Similarity=0.435 Sum_probs=54.5 Q ss_pred CHHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHC----CCEEEEEECCHHHHHHHHHHHHHC Q ss_conf 34689999985148955994287877038999999999967----982899601058999999864435 Q gi|254780229|r 406 DSSQHSAVIDVINGKNLIIEGPPGTGKSQTITNIIAAAMLH----GKKILFCAQKMAAIEVVRHRLEKA 470 (1775) Q Consensus 406 DsSQ~~av~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~----Gk~VLfVaeK~aALdVV~~RL~~~ 470 (1775) .+.|..||. =..|.=+|+=|- |||||..|+|=|+|++-. -++|--|..++.|..-++.|+... T Consensus 5 Np~Q~~AV~-Y~~GPlLVLAGA-GSGKT~VI~~KIayLi~~cgY~a~~IaAvTFTNKAA~EMkERVA~~ 71 (677) T TIGR01074 5 NPQQQEAVE-YVGGPLLVLAGA-GSGKTRVITNKIAYLIQNCGYKAKNIAAVTFTNKAAREMKERVAKT 71 (677) T ss_pred CHHHHHHHH-HHCCCCEEECCC-CCCCCHHHHHHHHHHHHHCCCCCCCEEEEECCHHHHHHHHHHHHHH T ss_conf 743799998-615871465177-7786357888999987515878761689735237779999999852 No 239 >PRK13973 thymidylate kinase; Provisional Probab=95.71 E-value=0.017 Score=40.84 Aligned_cols=39 Identities=28% Similarity=0.515 Sum_probs=33.3 Q ss_pred HCCCCEEEECCCCCCCHHHHHHHHHHHH-HCCCEEEEEECC Q ss_conf 1489559942878770389999999999-679828996010 Q gi|254780229|r 417 INGKNLIIEGPPGTGKSQTITNIIAAAM-LHGKKILFCAQK 456 (1775) Q Consensus 417 ~~g~~~vi~GPPGTGKSQTIaNlIa~~l-a~Gk~VLfVaeK 456 (1775) ++|+=.|++|.-|+||| |.+++++..| ..|.+|.++.|- T Consensus 1 MkG~fIv~EGiDGsGKs-Tq~~~L~~~L~~~g~~v~~treP 40 (216) T PRK13973 1 MRGRFITFEGGEGAGKS-TQIRLLAERLRAAGYDVLVTREP 40 (216) T ss_pred CCCEEEEEECCCCCCHH-HHHHHHHHHHHHCCCCEEEEECC T ss_conf 98249998899999899-99999999999779957994098 No 240 >KOG0744 consensus Probab=95.71 E-value=0.0084 Score=44.13 Aligned_cols=25 Identities=32% Similarity=0.797 Sum_probs=17.7 Q ss_pred CCEEEECCCCCCCHHHHHHHHHHHHH Q ss_conf 95599428787703899999999996 Q gi|254780229|r 420 KNLIIEGPPGTGKSQTITNIIAAAML 445 (1775) Q Consensus 420 ~~~vi~GPPGTGKSQTIaNlIa~~la 445 (1775) +=+.++|||||||| +..-..|+-|. T Consensus 178 RliLlhGPPGTGKT-SLCKaLaQkLS 202 (423) T KOG0744 178 RLILLHGPPGTGKT-SLCKALAQKLS 202 (423) T ss_pred EEEEEECCCCCCHH-HHHHHHHHHHE T ss_conf 48998579998822-79999987514 No 241 >pfam02374 ArsA_ATPase Anion-transporting ATPase. This Pfam family represents a conserved domain, which is sometimes repeated, in an anion-transporting ATPase. The ATPase is involved in the removal of arsenate, antimonite, and arsenate from the cell. Probab=95.70 E-value=0.017 Score=40.94 Aligned_cols=36 Identities=31% Similarity=0.528 Sum_probs=32.6 Q ss_pred EEEECCCCCCCHHHHHHHHHHHHHC-CCEEEEEECCHH Q ss_conf 5994287877038999999999967-982899601058 Q gi|254780229|r 422 LIIEGPPGTGKSQTITNIIAAAMLH-GKKILFCAQKMA 458 (1775) Q Consensus 422 ~vi~GPPGTGKSQTIaNlIa~~la~-Gk~VLfVaeK~a 458 (1775) +...|.=|+||| ||+-.+|-.+|+ |+|||+||--.| T Consensus 4 i~~~GKGGVGKT-T~AaA~A~~~A~~G~rvLlvStDPA 40 (304) T pfam02374 4 IFFGGKGGVGKT-TVSCATAVRLSEQGKKVLLVSTDPA 40 (304) T ss_pred EEEECCCCCCHH-HHHHHHHHHHHHCCCEEEEEECCCC T ss_conf 999579857489-9999999999958992999946972 No 242 >COG4988 CydD ABC-type transport system involved in cytochrome bd biosynthesis, ATPase and permease components [Energy production and conversion / Posttranslational modification, protein turnover, chaperones] Probab=95.69 E-value=0.2 Score=29.71 Aligned_cols=45 Identities=16% Similarity=0.256 Sum_probs=32.5 Q ss_pred CCEEEEEEEEECCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCEEEEEECCCHHHCC Q ss_conf 7889999665078888762001147776562468999998321426988703878879 Q gi|254780229|r 1569 RDRIFISFTYGPHEPGGRVFQRFGPINSDIGWRRLNVLFTRSRRRIEVFSTMRYLDVV 1626 (1775) Q Consensus 1569 RDvI~~S~~yg~~~~~g~~~~~fGpln~~gg~rRlNVa~sRAr~~~~v~~s~~~~~i~ 1626 (1775) +++++| |+|+.. |--+.-+--+|+....||.+.++|-|=+..++. T Consensus 475 ~~l~ll------DEpTA~-------LD~etE~~i~~~l~~l~~~ktvl~itHrl~~~~ 519 (559) T COG4988 475 ASLLLL------DEPTAH-------LDAETEQIILQALQELAKQKTVLVITHRLEDAA 519 (559) T ss_pred CCEEEE------CCCCCC-------CCHHHHHHHHHHHHHHHHCCEEEEEECCHHHHH T ss_conf 888985------487567-------987679999999999972786999976737785 No 243 >KOG1806 consensus Probab=95.68 E-value=0.033 Score=38.01 Aligned_cols=16 Identities=19% Similarity=0.505 Sum_probs=10.0 Q ss_pred HHHHHHHHHHHCCCEE Q ss_conf 9999999999789989 Q gi|254780229|r 1665 FAVSVVNELEKTGFAC 1680 (1775) Q Consensus 1665 fe~~v~~~L~~~G~~v 1680 (1775) .+.++|..++-.||.. T Consensus 1142 y~vAly~YMr~Lgypa 1157 (1320) T KOG1806 1142 YAVALFQYMRLLGYPA 1157 (1320) T ss_pred HHHHHHHHHHHHCCCH T ss_conf 5899999999957856 No 244 >PRK11192 ATP-dependent RNA helicase SrmB; Provisional Probab=95.68 E-value=0.078 Score=34.04 Aligned_cols=34 Identities=15% Similarity=0.429 Sum_probs=25.2 Q ss_pred CCEEEECHHHHHHHCCCCCC---CCCEEEECCHHHCC Q ss_conf 67899257799872585225---41078860632369 Q gi|254780229|r 1341 KPCFMMSPMSVANYLEPKDV---KFDLVIMDESSQIK 1374 (1775) Q Consensus 1341 ~P~~~msp~sva~~l~~~~~---~FD~VI~DEASQ~~ 1374 (1775) ..+.+.+|-++.+++..+.. ...+||||||=.|- T Consensus 126 ~dIlV~TPgRL~~~l~~~~~~l~~l~~lVlDEAD~ml 162 (417) T PRK11192 126 QDIVVATPGRLLQYIKEENFDCRAVETLILDEADRML 162 (417) T ss_pred CCEEEECCCHHHHHHHHCCCCCCCCCEEEEECCCCCC T ss_conf 9989978607777886367010457489996755211 No 245 >PRK11174 cysteine/glutathione ABC transporter membrane/ATP-binding component; Reviewed Probab=95.64 E-value=0.012 Score=42.58 Aligned_cols=11 Identities=27% Similarity=0.821 Sum_probs=6.1 Q ss_pred HHHCCEEEEEE Q ss_conf 86118289960 Q gi|254780229|r 1382 IARGKQIVVVG 1392 (1775) Q Consensus 1382 l~rakqvvvvG 1392 (1775) +-.|+.+=||| T Consensus 373 i~~Ge~vaIVG 383 (588) T PRK11174 373 LPAGQRVALVG 383 (588) T ss_pred ECCCCEEEEEC T ss_conf 74997899989 No 246 >cd02025 PanK Pantothenate kinase (PanK) catalyzes the phosphorylation of pantothenic acid to form 4'-phosphopantothenic, which is the first of five steps in coenzyme A (CoA) biosynthetic pathway. The reaction carried out by this enzyme is a key regulatory point in CoA biosynthesis. Probab=95.64 E-value=0.016 Score=41.34 Aligned_cols=32 Identities=22% Similarity=0.299 Sum_probs=26.5 Q ss_pred EEECCCCCCCHHHHHHHHHHHHH---CCCEEEEEEC Q ss_conf 99428787703899999999996---7982899601 Q gi|254780229|r 423 IIEGPPGTGKSQTITNIIAAAML---HGKKILFCAQ 455 (1775) Q Consensus 423 vi~GPPGTGKSQTIaNlIa~~la---~Gk~VLfVae 455 (1775) -|-||||+||| |+++.|+..+. .+.+|-.++. T Consensus 3 GIaG~sgSGKS-T~a~~l~~~l~~~~~~~~v~ii~~ 37 (220) T cd02025 3 GIAGSVAVGKS-TTARVLQALLSRWPDHPNVELITT 37 (220) T ss_pred EEECCCCCCHH-HHHHHHHHHHHHCCCCCCEEEEEC T ss_conf 97889987799-999999998600269994899978 No 247 >COG0470 HolB ATPase involved in DNA replication [DNA replication, recombination, and repair] Probab=95.64 E-value=0.051 Score=36.00 Aligned_cols=73 Identities=25% Similarity=0.276 Sum_probs=41.6 Q ss_pred HHHHHHHHHCCCC-EEEECCCCCCCHHHHHHHHHHHHH-CCCEEEEEECCHHHHHHHHHHHHHCCCCCEEEECCCCCHHH Q ss_conf 8999998514895-599428787703899999999996-79828996010589999998644358441365402661108 Q gi|254780229|r 409 QHSAVIDVINGKN-LIIEGPPGTGKSQTITNIIAAAML-HGKKILFCAQKMAAIEVVRHRLEKAGLGEFCLELHSHKVHK 486 (1775) Q Consensus 409 Q~~av~~~~~g~~-~vi~GPPGTGKSQTIaNlIa~~la-~Gk~VLfVaeK~aALdVV~~RL~~~gLg~~cleLHs~ka~k 486 (1775) +...+.+..++.. +++.|||||||| |.+..+|..+. .+......+..- -..++...|-.+-.+++.....++ T Consensus 13 l~~~~~~~~~~~halL~~Gp~G~Gkt-t~a~~lA~~l~~~~~~~~~~~~~~-----~~~~~~~~~~~~d~lel~~s~~~~ 86 (325) T COG0470 13 LLVQALESGRLPHALLFYGPPGVGKT-TAALALAKELLCENPTGLLPCGHC-----RSCKLIPAGNHPDFLELNPSDLRK 86 (325) T ss_pred HHHHHHHCCCCCCCEEEECCCCCCHH-HHHHHHHHHHCCCCCCCCCCCCHH-----HHHHHHCCCCCCCEEEECCCCCCC T ss_conf 99999865888761003799999789-999999999658664334552002-----244432025688659977321333 Q ss_pred H Q ss_conf 9 Q gi|254780229|r 487 R 487 (1775) Q Consensus 487 ~ 487 (1775) . T Consensus 87 ~ 87 (325) T COG0470 87 I 87 (325) T ss_pred C T ss_conf 3 No 248 >COG3727 Vsr DNA G:T-mismatch repair endonuclease [DNA replication, recombination, and repair] Probab=95.63 E-value=0.21 Score=29.61 Aligned_cols=95 Identities=21% Similarity=0.351 Sum_probs=61.5 Q ss_pred CCCHHHHHHHHHHHHCCCEEEECCCCCCEECCEEEECCCCCCCEEEEEECCCHHHCCC--------------------CC Q ss_conf 8998999999999978998970657577101257336898883179997458023683--------------------24 Q gi|254780229|r 1661 QDSDFAVSVVNELEKTGFACDSQLGDMGFSIDVVVRDPNNPGHYLMGIECDGAMYNSA--------------------KS 1720 (1775) Q Consensus 1661 ~ds~fe~~v~~~L~~~G~~v~~qvg~~~~riDl~v~~p~~pg~~~l~iecDG~~yh~~--------------------~~ 1720 (1775) -|+.-|..+...|...|++-..|=---.=+=|+|+ ..|...|-.-|--||+- +. T Consensus 20 ~dTkpE~~lr~~L~~~G~rfR~~~~~lpGkPDiVl------~~y~~viFvHGCFWh~H~c~~a~vPksnt~fWleKi~kN 93 (150) T COG3727 20 RDTKPEKRLRSLLTGQGLRFRVQDKDLPGKPDIVL------PKYRCVIFVHGCFWHGHHCYLAKVPKSNTEFWLEKIGKN 93 (150) T ss_pred CCCCHHHHHHHHHHHCCEEEEECCCCCCCCCCEEE------CCCEEEEEEEEEECCCCCCCCCCCCCCCHHHHHHHHHHH T ss_conf 47668999999985366278853788999998852------373389998510025774444568874238899998666 Q ss_pred HHHHHHHHHHHHHHCCCCEEEEECHHHHCCHHHHHHHHHHHHHH Q ss_conf 86777444899995699099932216411789999999999999 Q gi|254780229|r 1721 ARDRDRLRQEVLERMGWRIRRIWSVDWFSNPDEVIEPIIRELRE 1764 (1775) Q Consensus 1721 ardrD~~Rq~vLe~~GW~~~Riws~dw~~~p~~~~~~l~~~l~~ 1764 (1775) . +||+---..|+.+||++.-||.-+. ++. ....-...+|.+ T Consensus 94 v-eRD~r~~~~L~~~GwrvlvVWEC~~-r~k-as~a~~l~rl~~ 134 (150) T COG3727 94 V-ERDERDIKRLQQLGWRVLVVWECAL-RKK-ASDAARLERLEE 134 (150) T ss_pred H-HHHHHHHHHHHHCCCEEEEEEEEEC-HHH-HHHHHHHHHHHH T ss_conf 6-6548999999975975999986402-277-758999999999 No 249 >PRK13768 GTPase; Provisional Probab=95.63 E-value=0.02 Score=40.17 Aligned_cols=39 Identities=15% Similarity=0.206 Sum_probs=27.4 Q ss_pred CEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEECCHHH Q ss_conf 559942878770389999999999679828996010589 Q gi|254780229|r 421 NLIIEGPPGTGKSQTITNIIAAAMLHGKKILFCAQKMAA 459 (1775) Q Consensus 421 ~~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVaeK~aA 459 (1775) -.+|.||||+|||.-...+-.++-..|++|-+|-=-.|+ T Consensus 4 ~~~ViGpaGSGKsT~~~~l~~~l~~~~r~~~vvNLDPA~ 42 (253) T PRK13768 4 IVFFLGTAGSGKTTLVGALSDWLEEQGYDVAIVNLDPAV 42 (253) T ss_pred EEEEECCCCCCHHHHHHHHHHHHHHCCCCEEEEECCCCC T ss_conf 899989999988999999999999769975999789866 No 250 >PRK13889 conjugal transfer relaxase TraA; Provisional Probab=95.60 E-value=0.08 Score=33.96 Aligned_cols=60 Identities=27% Similarity=0.273 Sum_probs=48.0 Q ss_pred CCHHHHHHHHHHHCCCCE-EEECCCCCCCHHHHHHHHHHHHHCCCEEEEEECCHHHHHHHH Q ss_conf 734689999985148955-994287877038999999999967982899601058999999 Q gi|254780229|r 405 SDSSQHSAVIDVINGKNL-IIEGPPGTGKSQTITNIIAAAMLHGKKILFCAQKMAAIEVVR 464 (1775) Q Consensus 405 aDsSQ~~av~~~~~g~~~-vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVaeK~aALdVV~ 464 (1775) -+..|..||..+..+..+ ||.|..|||||.++.-+=-..=+.|.+|.=.|-.-.|.+.+. T Consensus 347 Ls~EQ~~A~~hiT~~~~iavVvG~AGtGKStmL~aAReawEa~GyrV~GaALsGkAAegLe 407 (992) T PRK13889 347 LSGEQADALAHVTDGRDLGVVVGYAGTGKSAMLGVAREAWEAAGYEVRGAALSGIAAENLE 407 (992) T ss_pred CCHHHHHHHHHHCCCCCEEEEEECCCCCHHHHHHHHHHHHHHCCCEEEEECCCHHHHHHHH T ss_conf 7999999999864789758998338887889999999999977988981150068999765 No 251 >PRK11819 putative ABC transporter ATP-binding protein; Reviewed Probab=95.59 E-value=0.14 Score=31.47 Aligned_cols=28 Identities=32% Similarity=0.509 Sum_probs=24.4 Q ss_pred HHHCCCCEEEECCCCCCCHHHHHHHHHHH Q ss_conf 85148955994287877038999999999 Q gi|254780229|r 415 DVINGKNLIIEGPPGTGKSQTITNIIAAA 443 (1775) Q Consensus 415 ~~~~g~~~vi~GPPGTGKSQTIaNlIa~~ 443 (1775) .+..|.-+.|.||-|+||| |+.++|+-. T Consensus 29 ~i~~Ge~vgLvG~NGaGKS-TLlriLaG~ 56 (556) T PRK11819 29 SFFPGAKIGVLGLNGAGKS-TLLRIMAGV 56 (556) T ss_pred EECCCCEEEEECCCCCHHH-HHHHHHHCC T ss_conf 9968989999999997199-999998479 No 252 >COG1643 HrpA HrpA-like helicases [DNA replication, recombination, and repair] Probab=95.59 E-value=0.099 Score=32.99 Aligned_cols=76 Identities=22% Similarity=0.280 Sum_probs=43.0 Q ss_pred CCCCHHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEEC--CHHHHHHHHHHHHHCC--CCCEEEE Q ss_conf 67734689999985148955994287877038999999999967982899601--0589999998644358--4413654 Q gi|254780229|r 403 DDSDSSQHSAVIDVINGKNLIIEGPPGTGKSQTITNIIAAAMLHGKKILFCAQ--KMAAIEVVRHRLEKAG--LGEFCLE 478 (1775) Q Consensus 403 ~~aDsSQ~~av~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVae--K~aALdVV~~RL~~~g--Lg~~cle 478 (1775) +|.....-..+.++.+.+-+||.||||+|||.-|--.+-++-..|+....+.| ..||-.|-.+==.+.| +|++|-- T Consensus 49 LPv~~~~~~i~~ai~~~~vvii~getGsGKTTqlP~~lle~g~~~~g~I~~tQPRRlAArsvA~RvAeel~~~~G~~VGY 128 (845) T COG1643 49 LPVTAVRDEILKAIEQNQVVIIVGETGSGKTTQLPQFLLEEGLGIAGKIGCTQPRRLAARSVAERVAEELGEKLGETVGY 128 (845) T ss_pred CCCHHHHHHHHHHHHHCCEEEEECCCCCCHHHHHHHHHHHHHCCCCCEEEECCCHHHHHHHHHHHHHHHHCCCCCCEEEE T ss_conf 98188899999999869789986799887587889999960016687599658438999999999999838986765437 No 253 >KOG0733 consensus Probab=95.57 E-value=0.0084 Score=44.14 Aligned_cols=40 Identities=33% Similarity=0.670 Sum_probs=28.0 Q ss_pred CCCCEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEECCHHHHHHHH Q ss_conf 48955994287877038999999999967982899601058999999 Q gi|254780229|r 418 NGKNLIIEGPPGTGKSQTITNIIAAAMLHGKKILFCAQKMAAIEVVR 464 (1775) Q Consensus 418 ~g~~~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVaeK~aALdVV~ 464 (1775) --+.++++||||+||| -++|.||.-|. |=| -+.+|-++|- T Consensus 222 PprGvLlHGPPGCGKT-~lA~AiAgel~----vPf--~~isApeivS 261 (802) T KOG0733 222 PPRGVLLHGPPGCGKT-SLANAIAGELG----VPF--LSISAPEIVS 261 (802) T ss_pred CCCCEEEECCCCCCHH-HHHHHHHHHCC----CCE--EEECCHHHHC T ss_conf 9975164489986478-99999752128----854--8514146531 No 254 >PRK02496 adk adenylate kinase; Provisional Probab=95.56 E-value=0.011 Score=42.90 Aligned_cols=24 Identities=42% Similarity=0.758 Sum_probs=20.6 Q ss_pred CCEEEECCCCCCCHHHHHHHHHHHH Q ss_conf 9559942878770389999999999 Q gi|254780229|r 420 KNLIIEGPPGTGKSQTITNIIAAAM 444 (1775) Q Consensus 420 ~~~vi~GPPGTGKSQTIaNlIa~~l 444 (1775) +.+||-||||+||+ |.+.+|+... T Consensus 2 ~riillG~PGSGKg-Tqa~~L~~~~ 25 (185) T PRK02496 2 ARLIFLGPPGAGKG-TQAVVLAEHL 25 (185) T ss_pred EEEEEECCCCCCHH-HHHHHHHHHH T ss_conf 18999799999989-9999999996 No 255 >PRK06893 DNA replication initiation factor; Validated Probab=95.55 E-value=0.027 Score=38.85 Aligned_cols=57 Identities=19% Similarity=0.345 Sum_probs=36.1 Q ss_pred CCCEEEECCCCCCCHHHHHHHHHH-HHHCCCEEEEEECCHHH-H-HHHHHHHHHCCCCCEEEE Q ss_conf 895599428787703899999999-99679828996010589-9-999986443584413654 Q gi|254780229|r 419 GKNLIIEGPPGTGKSQTITNIIAA-AMLHGKKILFCAQKMAA-I-EVVRHRLEKAGLGEFCLE 478 (1775) Q Consensus 419 g~~~vi~GPPGTGKSQTIaNlIa~-~la~Gk~VLfVaeK~aA-L-dVV~~RL~~~gLg~~cle 478 (1775) +.-|.|.||+|+|||| +.++||+ +...|+++.|+.-+.+- + ..+...+.+ ..-+|++ T Consensus 39 ~~~l~i~G~~gsGKTH-LLqa~~~~~~~~~~~~~yi~~~~~~~~~~~~l~~l~~--~d~l~iD 98 (229) T PRK06893 39 QPFFYIWGGKSSGKSH-LLKAVSNHYLLNQRTAIYIPLSKSQYFSPAVLENLEQ--QDLVCLD 98 (229) T ss_pred CCEEEEECCCCCCHHH-HHHHHHHHHHHHCCCEEEEEHHHHHHCCHHHHHHHHH--CCEEEEE T ss_conf 9879998999998899-9999999999718985999737756406999987654--7979996 No 256 >TIGR00041 DTMP_kinase thymidylate kinase; InterPro: IPR000062 Thymidylate kinase (2.7.4.9 from EC; dTMP kinase) catalyzes the phosphorylation of thymidine 5'-monophosphate (dTMP) to form thymidine 5'-diphosphate (dTDP) in the presence of ATP and magnesium:ATP + thymidine 5'-phosphate = ADP + thymidine 5'-diphosphate Thymidylate kinase is an ubiquitous enzyme of about 25 Kd and is important in the dTTP synthesis pathway for DNA synthesis. The function of dTMP kinase in eukaryotes comes from the study of a cell cycle mutant, cdc8, in Saccharomyces cerevisiae Saccharomyces cerevisiae. Structural and functional analyses suggest that the cDNA codes for authentic human dTMP kinase. The mRNA levels and enzyme activities corresponded to cell cycle progression and cell growth stages. ; GO: 0004798 thymidylate kinase activity, 0005524 ATP binding, 0006233 dTDP biosynthetic process, 0006235 dTTP biosynthetic process. Probab=95.53 E-value=0.021 Score=40.06 Aligned_cols=37 Identities=35% Similarity=0.604 Sum_probs=31.7 Q ss_pred CCCEEEECCCCCCCHHHHHHHHHHHHHC--CCEEEEEECC Q ss_conf 8955994287877038999999999967--9828996010 Q gi|254780229|r 419 GKNLIIEGPPGTGKSQTITNIIAAAMLH--GKKILFCAQK 456 (1775) Q Consensus 419 g~~~vi~GPPGTGKSQTIaNlIa~~la~--Gk~VLfVaeK 456 (1775) |.=.||+|+=|+||| |++|+++..+-. |-.|+|+-|- T Consensus 2 ~~fiviEGiDGaGKT-T~~~~l~~~l~~l~g~~~~~t~EP 40 (211) T TIGR00041 2 GMFIVIEGIDGAGKT-TQLNLLKKLLKELEGYKVLFTREP 40 (211) T ss_pred CCEEEEECCCCCCHH-HHHHHHHHHHHHHCCCEEEEEECC T ss_conf 745888511587589-999999999775138347887189 No 257 >KOG1533 consensus Probab=95.52 E-value=0.014 Score=41.86 Aligned_cols=34 Identities=29% Similarity=0.532 Sum_probs=25.9 Q ss_pred EEEECCCCCCCHHHHHHHHHHHH-HCCCEEEEEECC Q ss_conf 59942878770389999999999-679828996010 Q gi|254780229|r 422 LIIEGPPGTGKSQTITNIIAAAM-LHGKKILFCAQK 456 (1775) Q Consensus 422 ~vi~GPPGTGKSQTIaNlIa~~l-a~Gk~VLfVaeK 456 (1775) -||.||||+||| |-.|=..+.+ +.|++|..|--- T Consensus 5 qvVIGPPgSGKs-TYc~g~~~fls~~gr~~~vVNLD 39 (290) T KOG1533 5 QVVIGPPGSGKS-TYCNGMSQFLSAIGRPVAVVNLD 39 (290) T ss_pred EEEECCCCCCCC-CHHHHHHHHHHHHCCCEEEEECC T ss_conf 688769999853-11320999999748962799568 No 258 >PRK01297 ATP-dependent RNA helicase RhlB; Provisional Probab=95.50 E-value=0.22 Score=29.38 Aligned_cols=25 Identities=28% Similarity=0.376 Sum_probs=11.9 Q ss_pred HHHHHHHHHCCCCEEEECCCCCCCH Q ss_conf 8999998514895599428787703 Q gi|254780229|r 409 QHSAVIDVINGKNLIIEGPPGTGKS 433 (1775) Q Consensus 409 Q~~av~~~~~g~~~vi~GPPGTGKS 433 (1775) |..||-.+..|+|+|++.|+||||| T Consensus 111 Q~~aIP~iL~GkDvi~~A~TGSGKT 135 (472) T PRK01297 111 QAQVLGYTLAGHDAIGRAQTGTGKT 135 (472) T ss_pred HHHHHHHHHCCCCEEEECCCCCHHH T ss_conf 9999999976998899899986799 No 259 >CHL00195 ycf46 Ycf46; Provisional Probab=95.50 E-value=0.011 Score=42.97 Aligned_cols=28 Identities=32% Similarity=0.392 Sum_probs=21.7 Q ss_pred HCCCCEEEECCCCCCCHHHHHHHHHHHHH Q ss_conf 14895599428787703899999999996 Q gi|254780229|r 417 INGKNLIIEGPPGTGKSQTITNIIAAAML 445 (1775) Q Consensus 417 ~~g~~~vi~GPPGTGKSQTIaNlIa~~la 445 (1775) ..-|.+++.|||||||| .++-+||.-.. T Consensus 257 ~~PkGvLL~GpPG~GKt-l~AKAvA~e~~ 284 (491) T CHL00195 257 PTPRGLLLVGIQGTGKS-LTAKAIANEWN 284 (491) T ss_pred CCCCEEEEECCCCCCHH-HHHHHHHHHHC T ss_conf 99987999799998789-99999986638 No 260 >TIGR03029 EpsG chain length determinant protein tyrosine kinase EpsG. The proteins in this family are homologs of the EpsG protein found in Methylobacillus strain 12S and are generally found in operons with other Eps homologs. The protein is believed to function as the protein tyrosine kinase component of the chain length regulator (along with the transmembrane component EpsF). Probab=95.49 E-value=0.04 Score=37.09 Aligned_cols=58 Identities=21% Similarity=0.205 Sum_probs=37.0 Q ss_pred CCCEEEECC-CCCCCHHHHHHHHHHHHHCCCEEEEEEC--CHHHHHHHHHHHHHCCCCCEE Q ss_conf 895599428-7877038999999999967982899601--058999999864435844136 Q gi|254780229|r 419 GKNLIIEGP-PGTGKSQTITNIIAAAMLHGKKILFCAQ--KMAAIEVVRHRLEKAGLGEFC 476 (1775) Q Consensus 419 g~~~vi~GP-PGTGKSQTIaNlIa~~la~Gk~VLfVae--K~aALdVV~~RL~~~gLg~~c 476 (1775) .+.++|.+| ||-|||.|.+|+-..+--.|||||.|=- .+-.+.-+.+.=...||.++. T Consensus 103 ~~~LaItS~~pGEGKS~vAaNLA~~~Aq~G~RvLLVDaDLRrP~lh~~f~l~~~~GLs~vL 163 (274) T TIGR03029 103 RKALAVVSAKSGEGCSYIAANLAIVFSQLGEKTLLIDANLRDPVQHRNFKLSEQRGLSDIL 163 (274) T ss_pred CCEEEEECCCCCCCHHHHHHHHHHHHHHCCCEEEEEECCCCCCHHHHHHCCCCCCCHHHHH T ss_conf 8389996899999899999999999996799199995888884477975999976878884 No 261 >cd02028 UMPK_like Uridine monophosphate kinase_like (UMPK_like) is a family of proteins highly similar to the uridine monophosphate kinase (UMPK, EC 2.7.1.48), also known as uridine kinase or uridine-cytidine kinase (UCK). Probab=95.48 E-value=0.023 Score=39.53 Aligned_cols=31 Identities=26% Similarity=0.476 Sum_probs=24.9 Q ss_pred EEECCCCCCCHHHHHHHHHHHHH-CCCEEEEEE Q ss_conf 99428787703899999999996-798289960 Q gi|254780229|r 423 IIEGPPGTGKSQTITNIIAAAML-HGKKILFCA 454 (1775) Q Consensus 423 vi~GPPGTGKSQTIaNlIa~~la-~Gk~VLfVa 454 (1775) -|-||.|+||| |++|.|+..|. .|.+|..++ T Consensus 3 gIaG~SgSGKT-T~a~~L~~~l~~~~~~~~vis 34 (179) T cd02028 3 GIAGPSGSGKT-TFAKKLSNQLRVNGIGPVVIS 34 (179) T ss_pred EEECCCCCCHH-HHHHHHHHHHHHCCCCEEEEE T ss_conf 99898977899-999999999846488539995 No 262 >COG1204 Superfamily II helicase [General function prediction only] Probab=95.48 E-value=0.097 Score=33.08 Aligned_cols=78 Identities=23% Similarity=0.287 Sum_probs=54.3 Q ss_pred CCCHHHHHHHHHHHCC-CCEEEECCCCCCCHHHHHHH-HHHHHHC-CCEEEEEECCHHHHHHHHHHHH-HCCCCCEEEEC Q ss_conf 7734689999985148-95599428787703899999-9999967-9828996010589999998644-35844136540 Q gi|254780229|r 404 DSDSSQHSAVIDVING-KNLIIEGPPGTGKSQTITNI-IAAAMLH-GKKILFCAQKMAAIEVVRHRLE-KAGLGEFCLEL 479 (1775) Q Consensus 404 ~aDsSQ~~av~~~~~g-~~~vi~GPPGTGKSQTIaNl-Ia~~la~-Gk~VLfVaeK~aALdVV~~RL~-~~gLg~~cleL 479 (1775) ..-.+|.+||....-+ +|++|--|+|+||| -|+-| |...+.+ |.+|+|++=-+|=..-.++.+. -.++|--.-++ T Consensus 31 el~~~qq~av~~~~~~~~N~li~aPTgsGKT-lIA~lai~~~l~~~~~k~vYivPlkALa~Ek~~~~~~~~~~GirV~~~ 109 (766) T COG1204 31 ELFNPQQEAVEKGLLSDENVLISAPTGSGKT-LIALLAILSTLLEGGGKVVYIVPLKALAEEKYEEFSRLEELGIRVGIS 109 (766) T ss_pred HHHHHHHHHHHCCCCCCCCEEEECCCCCCHH-HHHHHHHHHHHHHCCCCEEEEECHHHHHHHHHHHHHHHHHCCCEEEEE T ss_conf 7557899874111257986799767888669-999999999998559838999075999999999866688659779996 Q ss_pred CCC Q ss_conf 266 Q gi|254780229|r 480 HSH 482 (1775) Q Consensus 480 Hs~ 482 (1775) ++. T Consensus 110 TgD 112 (766) T COG1204 110 TGD 112 (766) T ss_pred CCC T ss_conf 488 No 263 >pfam10236 DAP3 Mitochondrial ribosomal death-associated protein 3. This is a family of conserved proteins which were originally described as death-associated-protein-3 (DAP-3). The proteins carry a P-loop DNA-binding motif, and induce apoptosis. DAP3 has been shown to be a pro-apoptotic factor in the mitochondrial matrix and to be crucial for mitochondrial biogenesis and so has also been designated as MRP-S29 (mitochondrial ribosomal protein subunit 29). Probab=95.47 E-value=0.028 Score=38.72 Aligned_cols=39 Identities=23% Similarity=0.393 Sum_probs=33.3 Q ss_pred HCCCCEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEEC Q ss_conf 148955994287877038999999999967982899601 Q gi|254780229|r 417 INGKNLIIEGPPGTGKSQTITNIIAAAMLHGKKILFCAQ 455 (1775) Q Consensus 417 ~~g~~~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVae 455 (1775) ....-+||.||+|||||-|.+.+++.|..+|==|+.|.. T Consensus 21 ~~~~r~vL~G~~GsGKS~~L~q~v~~A~~~~wiVl~vP~ 59 (274) T pfam10236 21 KKVVRFVLTGERGSGKSVLLAQAMAYALTQGWVVLHVPE 59 (274) T ss_pred CCCEEEEEECCCCCCHHHHHHHHHHHHHHCCEEEEECCC T ss_conf 995189988979977999999999999859989998498 No 264 >CHL00176 ftsH cell division protein; Validated Probab=95.46 E-value=0.013 Score=42.12 Aligned_cols=40 Identities=28% Similarity=0.517 Sum_probs=27.9 Q ss_pred HHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEECCHH Q ss_conf 985148955994287877038999999999967982899601058 Q gi|254780229|r 414 IDVINGKNLIIEGPPGTGKSQTITNIIAAAMLHGKKILFCAQKMA 458 (1775) Q Consensus 414 ~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVaeK~a 458 (1775) .-++--|.+++.|||||||| ++|-|+|.--.|=|.+-.-. T Consensus 205 ~Gak~PkGvLL~GpPGTGKT-----lLAkAvAgEa~vpF~~~sgs 244 (631) T CHL00176 205 VGAKIPKGVLLVGPPGTGKT-----LLAKAIAGEAEVPFFSISGS 244 (631) T ss_pred CCCCCCCEEEEECCCCCCHH-----HHHHHHHCCCCCCEEEEEHH T ss_conf 49968965898898998788-----99999856558846998837 No 265 >TIGR02788 VirB11 P-type DNA transfer ATPase VirB11; InterPro: IPR014155 This entry contains VirB11, a protein that is found in the vir locus of Agrobacterium Ti plasmids where it is involved in the type IV secretion system for DNA transfer . VirB11 is believed to be an ATPase and is a homologue of the P-like conjugation system TrbB protein and the Flp pilus system protein TadA .. Probab=95.45 E-value=0.015 Score=41.54 Aligned_cols=37 Identities=24% Similarity=0.562 Sum_probs=33.2 Q ss_pred HHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHCCCEEE Q ss_conf 98514895599428787703899999999996798289 Q gi|254780229|r 414 IDVINGKNLIIEGPPGTGKSQTITNIIAAAMLHGKKIL 451 (1775) Q Consensus 414 ~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~Gk~VL 451 (1775) .++..+||+||-|=|||||| |+.|+|-+.+=.-.|+. T Consensus 153 ~Ai~~~knIii~GGTgSGKT-Tf~kal~~~IP~~ER~i 189 (328) T TIGR02788 153 LAIASRKNIIISGGTGSGKT-TFLKALVKEIPKDERLI 189 (328) T ss_pred HHHHCCCEEEEEECCCCHHH-HHHHHHHHCCCCCCCEE T ss_conf 99873891999906897189-99999973276225278 No 266 >COG1222 RPT1 ATP-dependent 26S proteasome regulatory subunit [Posttranslational modification, protein turnover, chaperones] Probab=95.45 E-value=0.011 Score=43.05 Aligned_cols=46 Identities=28% Similarity=0.556 Sum_probs=27.0 Q ss_pred HHHCCCCEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEECCHHHHHHHHHHH Q ss_conf 85148955994287877038999999999967982899601058999999864 Q gi|254780229|r 415 DVINGKNLIIEGPPGTGKSQTITNIIAAAMLHGKKILFCAQKMAAIEVVRHRL 467 (1775) Q Consensus 415 ~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVaeK~aALdVV~~RL 467 (1775) -+..-|.+++.|||||||| |.|-|+|+--..=| =+..+-+-|.+=+ T Consensus 181 GI~PPKGVLLYGPPGTGKT-----LLAkAVA~~T~AtF--IrvvgSElVqKYi 226 (406) T COG1222 181 GIDPPKGVLLYGPPGTGKT-----LLAKAVANQTDATF--IRVVGSELVQKYI 226 (406) T ss_pred CCCCCCCEEEECCCCCCHH-----HHHHHHHHCCCCEE--EEECCHHHHHHHH T ss_conf 9999971276689997588-----99999872058669--9942199999983 No 267 >PRK11776 ATP-dependent RNA helicase DbpA; Provisional Probab=95.44 E-value=0.19 Score=30.15 Aligned_cols=33 Identities=12% Similarity=0.338 Sum_probs=24.2 Q ss_pred CCEEEECHHHHHHHCCCCCCC---CCEEEECCHHHC Q ss_conf 678992577998725852254---107886063236 Q gi|254780229|r 1341 KPCFMMSPMSVANYLEPKDVK---FDLVIMDESSQI 1373 (1775) Q Consensus 1341 ~P~~~msp~sva~~l~~~~~~---FD~VI~DEASQ~ 1373 (1775) .-+.+.+|-.+..++..+... ...+|+|||=.| T Consensus 124 ~~IvV~TPGRl~d~l~~~~l~l~~v~~lVlDEAD~m 159 (459) T PRK11776 124 AHIIVGTPGRILDHLRKGTLDLDALNTLVLDEADRM 159 (459) T ss_pred CCEEEECCHHHHHHHHCCCCCCCCEEEEEEECCHHH T ss_conf 999998958999887516632231038997062454 No 268 >PRK07246 bifunctional ATP-dependent DNA helicase/DNA polymerase III subunit epsilon; Validated Probab=95.44 E-value=0.052 Score=35.91 Aligned_cols=51 Identities=18% Similarity=0.289 Sum_probs=32.8 Q ss_pred CHHHHH---HHHH-HHCCCCEEEECCCCCCCHHHHHHHHHHHHHCC--CEEEEEECCHHHH Q ss_conf 346899---9998-51489559942878770389999999999679--8289960105899 Q gi|254780229|r 406 DSSQHS---AVID-VINGKNLIIEGPPGTGKSQTITNIIAAAMLHG--KKILFCAQKMAAI 460 (1775) Q Consensus 406 DsSQ~~---av~~-~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~G--k~VLfVaeK~aAL 460 (1775) =..|.. +|.. ...|+.++|+-|+||||| ++-+|.. ++.+ ++| +||-.+.+| T Consensus 247 R~~Q~~Ma~~V~~al~~~~~l~IEAgTGtGKT--laYLlPa-ia~~~~~~v-VIST~T~~L 303 (820) T PRK07246 247 RPKQEQFAQLVDEDYHDGVASFIEAQTGIGKT--YGYLLPL-LAQSDQNQI-IVSVPTKLL 303 (820) T ss_pred CHHHHHHHHHHHHHHCCCCCEEEECCCCCCHH--HHHHHHH-HHHCCCCEE-EEEECCHHH T ss_conf 88999999999999805883899889996479--9999999-984379839-999086999 No 269 >cd02023 UMPK Uridine monophosphate kinase (UMPK, EC 2.7.1.48), also known as uridine kinase or uridine-cytidine kinase (UCK), catalyzes the reversible phosphoryl transfer from ATP to uridine or cytidine to yield UMP or CMP. In the primidine nucleotide-salvage pathway, this enzyme combined with nucleoside diphosphate kinases further phosphorylates UMP and CMP to form UTP and CTP. This kinase also catalyzes the phosphorylation of several cytotoxic ribonucleoside analogs such as 5-flurrouridine and cyclopentenyl-cytidine. Probab=95.43 E-value=0.023 Score=39.66 Aligned_cols=31 Identities=26% Similarity=0.450 Sum_probs=25.2 Q ss_pred EEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEE Q ss_conf 599428787703899999999996798289960 Q gi|254780229|r 422 LIIEGPPGTGKSQTITNIIAAAMLHGKKILFCA 454 (1775) Q Consensus 422 ~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVa 454 (1775) +.|-||+|+||| |+++.|+..|- +.+|..++ T Consensus 2 IgI~G~sgsGKT-T~a~~L~~~l~-~~~v~~i~ 32 (198) T cd02023 2 IGIAGGSGSGKT-TVAEEIIEQLG-NPKVVIIS 32 (198) T ss_pred EEEECCCCCCHH-HHHHHHHHHHC-CCCEEEEE T ss_conf 898899988599-99999999809-99858997 No 270 >PRK11160 cysteine/glutathione ABC transporter membrane/ATP-binding component; Reviewed Probab=95.43 E-value=0.011 Score=43.04 Aligned_cols=21 Identities=10% Similarity=0.063 Sum_probs=10.8 Q ss_pred CCCCHHHHHHHHHHHH--CCCEE Q ss_conf 6899899999999997--89989 Q gi|254780229|r 1660 KQDSDFAVSVVNELEK--TGFAC 1680 (1775) Q Consensus 1660 ~~ds~fe~~v~~~L~~--~G~~v 1680 (1775) .-|+.-|..|.+.|.+ +|-+| T Consensus 506 ~LD~~te~~i~~~l~~~~~~rT~ 528 (575) T PRK11160 506 GLDAETERQILELLFEHAQNKTV 528 (575) T ss_pred CCCHHHHHHHHHHHHHHCCCCEE T ss_conf 67999999999999986699999 No 271 >pfam05729 NACHT NACHT domain. This NTPase domain is found in apoptosis proteins as well as those involved in MHC transcription activation. This family is closely related to pfam00931. Probab=95.42 E-value=0.017 Score=40.93 Aligned_cols=28 Identities=32% Similarity=0.580 Sum_probs=24.4 Q ss_pred CEEEECCCCCCCHHHHHHHHHHHHHCCCE Q ss_conf 55994287877038999999999967982 Q gi|254780229|r 421 NLIIEGPPGTGKSQTITNIIAAAMLHGKK 449 (1775) Q Consensus 421 ~~vi~GPPGTGKSQTIaNlIa~~la~Gk~ 449 (1775) -++|+|+||.||| |.+--|+...+.|+. T Consensus 2 ~i~i~G~aG~GKT-tll~kl~~~wa~g~~ 29 (165) T pfam05729 2 TVILQGEAGSGKT-TLLQKLALLWAQGKL 29 (165) T ss_pred EEEEECCCCCCHH-HHHHHHHHHHHCCCC T ss_conf 8999827989899-999999999986984 No 272 >KOG0057 consensus Probab=95.41 E-value=0.022 Score=39.70 Aligned_cols=16 Identities=13% Similarity=0.210 Sum_probs=8.7 Q ss_pred CCCCHHHHHHHHHHHH Q ss_conf 6899899999999997 Q gi|254780229|r 1660 KQDSDFAVSVVNELEK 1675 (1775) Q Consensus 1660 ~~ds~fe~~v~~~L~~ 1675 (1775) ..|+.=|.++...+.. T Consensus 517 ~LD~~TE~~i~~~i~~ 532 (591) T KOG0057 517 ALDSETEREILDMIMD 532 (591) T ss_pred CCCHHHHHHHHHHHHH T ss_conf 2365669999999987 No 273 >PRK10536 hypothetical protein; Provisional Probab=95.41 E-value=0.077 Score=34.11 Aligned_cols=34 Identities=26% Similarity=0.439 Sum_probs=19.3 Q ss_pred EEEECCHHHCCHHHHHHHHHH---CCEEEEEECCCCC Q ss_conf 788606323698998898861---1828996077866 Q gi|254780229|r 1364 LVIMDESSQIKPEDALGVIAR---GKQIVVVGDPKQL 1397 (1775) Q Consensus 1364 ~VI~DEASQ~~~~~al~al~r---akqvvvvGD~kQl 1397 (1775) .||+|||-.+++...--.|.| +-++||.||+.|. T Consensus 179 ~IIvDEaQN~T~~qmk~iLTRiG~~SKiVi~GD~~Q~ 215 (262) T PRK10536 179 VVILDEAQNVTAAQMKMFLTRLGENVTVIVNGDITQC 215 (262) T ss_pred EEEEEHHHCCCHHHHHHHHHHCCCCCEEEEECCCCCC T ss_conf 8998412128999998898542599689996882022 No 274 >PRK00149 dnaA chromosomal replication initiation protein; Reviewed Probab=95.41 E-value=0.061 Score=35.15 Aligned_cols=58 Identities=26% Similarity=0.454 Sum_probs=39.4 Q ss_pred CCCCCCCHHHHHHHHHHHC--CCC---EEEECCCCCCCHHHHHHHHHHHHHC---CCEEEEEECCHH Q ss_conf 2366773468999998514--895---5994287877038999999999967---982899601058 Q gi|254780229|r 400 PLIDDSDSSQHSAVIDVIN--GKN---LIIEGPPGTGKSQTITNIIAAAMLH---GKKILFCAQKMA 458 (1775) Q Consensus 400 ~~~~~aDsSQ~~av~~~~~--g~~---~vi~GPPGTGKSQTIaNlIa~~la~---Gk~VLfVaeK~a 458 (1775) |.|-++..-=.+|..++.. |.. |.|.||+|.|||| +.++|++.+.. +++|++++--.= T Consensus 121 FVvG~sN~lA~aAA~~Va~~pg~~yNPLfIyG~~GlGKTH-Ll~AIgn~~~~~~p~~~v~Y~tae~F 186 (447) T PRK00149 121 FVVGKSNRLAHAAALAVAENPGKAYNPLFIYGGVGLGKTH-LLHAIGNYILEKNPNAKVVYVSSEKF 186 (447) T ss_pred CEECCCCHHHHHHHHHHHHCCCCCCCCEEEECCCCCCHHH-HHHHHHHHHHHHCCCCEEEEEEHHHH T ss_conf 2226985999999999983767677855897799887889-99999999998589972899549999 No 275 >PRK10787 DNA-binding ATP-dependent protease La; Provisional Probab=95.41 E-value=0.014 Score=41.70 Aligned_cols=22 Identities=50% Similarity=0.579 Sum_probs=11.8 Q ss_pred HHHHHHHHCCCEEEEEECCHHH Q ss_conf 9999999679828996010589 Q gi|254780229|r 438 NIIAAAMLHGKKILFCAQKMAA 459 (1775) Q Consensus 438 NlIa~~la~Gk~VLfVaeK~aA 459 (1775) ++|-.++..++.|++|++|..- T Consensus 38 ~AlE~A~~~d~~I~LVaQKD~~ 59 (784) T PRK10787 38 RCLEAAMDHDKKIMLVAQKEAS 59 (784) T ss_pred HHHHHHHHCCCEEEEEECCCCC T ss_conf 9999999649979999756888 No 276 >PRK13947 shikimate kinase; Provisional Probab=95.40 E-value=0.015 Score=41.45 Aligned_cols=25 Identities=36% Similarity=0.592 Sum_probs=23.0 Q ss_pred CCEEEECCCCCCCHHHHHHHHHHHHH Q ss_conf 95599428787703899999999996 Q gi|254780229|r 420 KNLIIEGPPGTGKSQTITNIIAAAML 445 (1775) Q Consensus 420 ~~~vi~GPPGTGKSQTIaNlIa~~la 445 (1775) +|+++.|+||+||| ||...+|..|. T Consensus 2 knI~LiG~mGsGKT-tiGk~La~~L~ 26 (171) T PRK13947 2 KNIVLIGFMGTGKT-TVGKKVATTLS 26 (171) T ss_pred CCEEEECCCCCCHH-HHHHHHHHHHC T ss_conf 85899799999889-99999999979 No 277 >COG0703 AroK Shikimate kinase [Amino acid transport and metabolism] Probab=95.40 E-value=0.014 Score=41.81 Aligned_cols=26 Identities=38% Similarity=0.517 Sum_probs=23.5 Q ss_pred CCCEEEECCCCCCCHHHHHHHHHHHHH Q ss_conf 895599428787703899999999996 Q gi|254780229|r 419 GKNLIIEGPPGTGKSQTITNIIAAAML 445 (1775) Q Consensus 419 g~~~vi~GPPGTGKSQTIaNlIa~~la 445 (1775) .+++|+.||||+||| ||.-.+|..|. T Consensus 2 ~~~IvLiG~mGaGKS-TIGr~LAk~L~ 27 (172) T COG0703 2 NMNIVLIGFMGAGKS-TIGRALAKALN 27 (172) T ss_pred CCCEEEECCCCCCHH-HHHHHHHHHCC T ss_conf 961899717999776-89999999819 No 278 >cd03248 ABCC_TAP TAP, the Transporter Associated with Antigen Processing; TAP is essential for peptide delivery from the cytosol into the lumen of the endoplasmic reticulum (ER), where these peptides are loaded on major histocompatibility complex (MHC) I molecules. Loaded MHC I leave the ER and display their antigenic cargo on the cell surface to cytotoxic T cells. Subsequently, virus-infected or malignantly transformed cells can be eliminated. TAP belongs to the large family of ATP-binding cassette (ABC) transporters, which translocate a vast variety of solutes across membranes. Probab=95.39 E-value=0.013 Score=42.01 Aligned_cols=52 Identities=19% Similarity=0.445 Sum_probs=38.2 Q ss_pred HHHCCCCEEEECCCCCCCHHHHHHHHHHHHH-CCCEEEEEECCHHHH--HHHHHHH Q ss_conf 8514895599428787703899999999996-798289960105899--9999864 Q gi|254780229|r 415 DVINGKNLIIEGPPGTGKSQTITNIIAAAML-HGKKILFCAQKMAAI--EVVRHRL 467 (1775) Q Consensus 415 ~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la-~Gk~VLfVaeK~aAL--dVV~~RL 467 (1775) .+..|+-++|.||.|+||| |++++|+-.+. ...+|+|--.....+ +.+++++ T Consensus 36 ~i~~Ge~vaIvG~sGsGKS-TL~~ll~gl~~p~~G~I~idg~~i~~~~~~~lr~~i 90 (226) T cd03248 36 TLHPGEVTALVGPSGSGKS-TVVALLENFYQPQGGQVLLDGKPISQYEHKYLHSKV 90 (226) T ss_pred EECCCCEEEEECCCCCHHH-HHHHHHHCCCCCCCCEEEECCEEHHHCCHHHHHHCE T ss_conf 9829999999999998499-999999645467887899999993448999997326 No 279 >PRK10416 cell division protein FtsY; Provisional Probab=95.37 E-value=0.028 Score=38.68 Aligned_cols=50 Identities=24% Similarity=0.416 Sum_probs=38.8 Q ss_pred EEECCCCCCCHHHHHHHHHHHHHCCCEEEEEEC---CHHHHHHHHHHHHHCCC Q ss_conf 994287877038999999999967982899601---05899999986443584 Q gi|254780229|r 423 IIEGPPGTGKSQTITNIIAAAMLHGKKILFCAQ---KMAAIEVVRHRLEKAGL 472 (1775) Q Consensus 423 vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVae---K~aALdVV~~RL~~~gL 472 (1775) .+.|--|+|||.||.-|...+...||+||++|- ..||.+-++---...|+ T Consensus 299 l~vGvNG~GKTTTigKLA~~~~~~gkkVllaA~DTfRaAAieQL~~w~~r~~v 351 (499) T PRK10416 299 LMVGVNGVGKTTTIGKLARQFEQQGKSVMLAAGDTFRAAAVEQLQVWGQRNNI 351 (499) T ss_pred EEECCCCCCHHHHHHHHHHHHHHCCCCEEEEEECCCCHHHHHHHHHHHCCCCC T ss_conf 99747878789899999999997799537884066756899999998424573 No 280 >PRK08006 replicative DNA helicase; Provisional Probab=95.37 E-value=0.054 Score=35.72 Aligned_cols=57 Identities=21% Similarity=0.345 Sum_probs=46.3 Q ss_pred HHHCCCCEEEECCCCCCCHHHHHHHHHH-HHHCCCEEEEEECCHHHHHHHHHHHHHCC Q ss_conf 8514895599428787703899999999-99679828996010589999998644358 Q gi|254780229|r 415 DVINGKNLIIEGPPGTGKSQTITNIIAA-AMLHGKKILFCAQKMAAIEVVRHRLEKAG 471 (1775) Q Consensus 415 ~~~~g~~~vi~GPPGTGKSQTIaNlIa~-~la~Gk~VLfVaeK~aALdVV~~RL~~~g 471 (1775) -...|.=+||=|-||-|||..-.||+.+ ++..|++|+|.|--|.+-+++.|-|...+ T Consensus 220 Gl~~G~LiviaaRPsmGKTalalnia~~~a~~~~~~V~~fSlEMs~~ql~~Rlla~~s 277 (471) T PRK08006 220 GLQPSDLIIVAARPSMGKTTFAMNLCENAAMLQDKPVLIFSLEMPGEQIMMRMLASLS 277 (471) T ss_pred CCCCCCEEEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHC T ss_conf 8821738999946998769999999999998669957998167999999999999744 No 281 >TIGR03018 pepcterm_TyrKin exopolysaccharide/PEPCTERM locus tyrosine autokinase. Members of this protein family are related to a known protein-tyrosine autokinase and to numerous homologs from exopolysaccharide biosynthesis region proteins, many of which are designated as chain length determinants. Most members of this family contain a short region, immediately C-terminal to the region modeled here, with an abundance of Tyr residues. These C-terminal tyrosine residues are likely to be autophosphorylation sites. Some members of this family are fusion proteins. Probab=95.36 E-value=0.028 Score=38.77 Aligned_cols=36 Identities=25% Similarity=0.421 Sum_probs=25.2 Q ss_pred CCCCEEEECC-CCCCCHHHHHHHHHHHHH--CCCEEEEEE Q ss_conf 4895599428-787703899999999996--798289960 Q gi|254780229|r 418 NGKNLIIEGP-PGTGKSQTITNIIAAAML--HGKKILFCA 454 (1775) Q Consensus 418 ~g~~~vi~GP-PGTGKSQTIaNlIa~~la--~Gk~VLfVa 454 (1775) .+|-++|..| ||-||| ||+--+|.++| -|||||+|= T Consensus 34 ~~kvi~VTS~~pgeGKT-tva~nLA~~lA~~~~~~VLLVD 72 (207) T TIGR03018 34 NNNLIMVTSSLPGEGKS-FTAINLAISLAQEYDKTVLLID 72 (207) T ss_pred CCEEEEEECCCCCCCHH-HHHHHHHHHHHHHCCCEEEEEE T ss_conf 88099997899999889-9999999999972498599995 No 282 >cd03289 ABCC_CFTR2 The CFTR subfamily domain 2. The cystic fibrosis transmembrane regulator (CFTR), the product of the gene mutated in patients with cystic fibrosis, has adapted the ABC transporter structural motif to form a tightly regulated anion channel at the apical surface of many epithelia. Use of the term assembly of a functional ion channel implies the coming together of subunits or at least smaller not-yet functional components of the active whole. In fact, on the basis of current knowledge only the CFTR polypeptide itself is required to form an ATP- and protein kinase A-dependent low-conductance chloride channel of the type present in the apical membrane of many epithelial cells. CFTR displays the typical organization (IM-ABC)2 and carries a characteristic hydrophilic R-domain that separates IM1-ABC1 from IM2-ABC2. Probab=95.36 E-value=0.018 Score=40.76 Aligned_cols=38 Identities=21% Similarity=0.290 Sum_probs=30.4 Q ss_pred HHHCCCCEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEE Q ss_conf 851489559942878770389999999999679828996 Q gi|254780229|r 415 DVINGKNLIIEGPPGTGKSQTITNIIAAAMLHGKKILFC 453 (1775) Q Consensus 415 ~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfV 453 (1775) .+..|.-+.|.||.|+||| |+.++|.-.+..-.+|.+- T Consensus 26 ~I~~Ge~vaIvG~sGsGKS-TLl~lL~gl~~~~G~I~id 63 (275) T cd03289 26 SISPGQRVGLLGRTGSGKS-TLLSAFLRLLNTEGDIQID 63 (275) T ss_pred EECCCCEEEEECCCCCCHH-HHHHHHHHHCCCCCEEEEC T ss_conf 9879999999999999799-9999996035789539999 No 283 >KOG0738 consensus Probab=95.36 E-value=0.012 Score=42.63 Aligned_cols=23 Identities=39% Similarity=0.800 Sum_probs=17.6 Q ss_pred CCCEEEECCCCCCCHHHHHHHHHH Q ss_conf 895599428787703899999999 Q gi|254780229|r 419 GKNLIIEGPPGTGKSQTITNIIAA 442 (1775) Q Consensus 419 g~~~vi~GPPGTGKSQTIaNlIa~ 442 (1775) =|.+++.|||||||| -++-+||. T Consensus 245 WkgvLm~GPPGTGKT-lLAKAvAT 267 (491) T KOG0738 245 WKGVLMVGPPGTGKT-LLAKAVAT 267 (491) T ss_pred CCEEEEECCCCCCHH-HHHHHHHH T ss_conf 300055679997478-99999988 No 284 >PRK05541 adenylylsulfate kinase; Provisional Probab=95.36 E-value=0.026 Score=39.09 Aligned_cols=33 Identities=24% Similarity=0.537 Sum_probs=25.3 Q ss_pred CCEE--EECCCCCCCHHHHHHHHHHHH-HCCCEEEEE Q ss_conf 9559--942878770389999999999-679828996 Q gi|254780229|r 420 KNLI--IEGPPGTGKSQTITNIIAAAM-LHGKKILFC 453 (1775) Q Consensus 420 ~~~v--i~GPPGTGKSQTIaNlIa~~l-a~Gk~VLfV 453 (1775) +.+| +.|.||+||| ||++.+...| ..|..|.+. T Consensus 6 kg~viW~TGLsGSGKT-TiA~~l~~~L~~~g~~~~~L 41 (176) T PRK05541 6 NGYVIWITGLAGSGKT-TIAKALYERLKLKYSNVIYL 41 (176) T ss_pred CCEEEEEECCCCCCHH-HHHHHHHHHHHHCCCCEEEE T ss_conf 8679997899999899-99999999999759977998 No 285 >COG1221 PspF Transcriptional regulators containing an AAA-type ATPase domain and a DNA-binding domain [Transcription / Signal transduction mechanisms] Probab=95.35 E-value=0.026 Score=38.98 Aligned_cols=48 Identities=31% Similarity=0.340 Sum_probs=36.7 Q ss_pred HHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHCC-CEEEEEECCHHHH Q ss_conf 899999851489559942878770389999999999679-8289960105899 Q gi|254780229|r 409 QHSAVIDVINGKNLIIEGPPGTGKSQTITNIIAAAMLHG-KKILFCAQKMAAI 460 (1775) Q Consensus 409 Q~~av~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~G-k~VLfVaeK~aAL 460 (1775) |..+ .+-.|.++.|.|+|||||| .|+++|. ++... +.--||+--=+|+ T Consensus 93 qik~--~ap~~~~vLi~GetGtGKe-l~A~~iH-~~s~r~~~~PFI~~NCa~~ 141 (403) T COG1221 93 QIKA--YAPSGLPVLIIGETGTGKE-LFARLIH-ALSARRAEAPFIAFNCAAY 141 (403) T ss_pred HHHH--HCCCCCCEEEECCCCCCHH-HHHHHHH-HHHHCCCCCCEEEEEHHHH T ss_conf 9986--1899984798668875388-9999999-8612135898799777773 No 286 >pfam02456 Adeno_IVa2 Adenovirus IVa2 protein. IVa2 protein can interact with the adenoviral packaging signal and that this interaction involves DNA sequences that have previously been demonstrated to be required for packaging. During the course of lytic infection, the adenovirus major late promoter (MLP) is induced to high levels after replication of viral DNA has started. IVa2 is a transcriptional activator of the major late promoter. Probab=95.35 E-value=0.014 Score=41.78 Aligned_cols=29 Identities=17% Similarity=0.354 Sum_probs=14.1 Q ss_pred CCCCCCCCCCCCHHHHHHHHHHHHCCEEEEEEC Q ss_conf 200114777656246899999832142698870 Q gi|254780229|r 1587 VFQRFGPINSDIGWRRLNVLFTRSRRRIEVFST 1619 (1775) Q Consensus 1587 ~~~~fGpln~~gg~rRlNVa~sRAr~~~~v~~s 1619 (1775) |.+|-.|-...|| |++-=--.-+|++.|+ T Consensus 241 VLHNmNPR~d~~G----NI~~LKIQaK~HIiSp 269 (370) T pfam02456 241 VLHNMNPRTDIGG----NIATLKIQAKLHILSP 269 (370) T ss_pred EEECCCCCCCCCC----CCCCHHHHHEEEEECC T ss_conf 9605786666679----8341102011345168 No 287 >PRK13700 conjugal transfer protein TraD; Provisional Probab=95.34 E-value=0.027 Score=38.80 Aligned_cols=36 Identities=25% Similarity=0.416 Sum_probs=32.8 Q ss_pred CCCEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEE Q ss_conf 895599428787703899999999996798289960 Q gi|254780229|r 419 GKNLIIEGPPGTGKSQTITNIIAAAMLHGKKILFCA 454 (1775) Q Consensus 419 g~~~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVa 454 (1775) -++|.|.|-||||||+.|-.|+...-++|.++.++= T Consensus 185 ~qH~li~GTtGtGKS~~ir~LL~qIR~RGdrAIIyD 220 (732) T PRK13700 185 IQNFCLHGTVGAGKSEVIRRLANYARQRGDMVVIYD 220 (732) T ss_pred HHEEEEECCCCCCHHHHHHHHHHHHHHCCCCEEEEC T ss_conf 312677468888899999999999997299589993 No 288 >TIGR01192 chvA glucan exporter ATP-binding protein; InterPro: IPR005896 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). This family contains glucan exporter ATP binding protein in bacteria. It belongs to the larger ABC transporter superfamily with the characteristic ATP binding motif. In general, this protein is in some ways implicated in osmo-regulation and is suggested to participate in the export of glucan from the cytoplasm to periplasm. The cyclic beta-1,2-glucan in the bacterial periplasmic space is suggested to confer the property of high osmolority. It has also been demonstrated that mutants in this loci have lost functions of virulence and motility. It is unclear as to how virulence and osmo-adaptaion are related. ; GO: 0005215 transporter activity, 0006810 transport. Probab=95.34 E-value=0.01 Score=43.17 Aligned_cols=35 Identities=34% Similarity=0.666 Sum_probs=28.2 Q ss_pred CCHHH--HHHHHHHHCCCCEEEECCCCCCCHHHHHHHH Q ss_conf 73468--9999985148955994287877038999999 Q gi|254780229|r 405 SDSSQ--HSAVIDVINGKNLIIEGPPGTGKSQTITNII 440 (1775) Q Consensus 405 aDsSQ--~~av~~~~~g~~~vi~GPPGTGKSQTIaNlI 440 (1775) |.|+| ...--.+++|+.+.|.||+|.||| |+.|+. T Consensus 345 aNs~QGV~dvSF~~KAG~TvAIVGPTGAGKT-TLiNLL 381 (592) T TIGR01192 345 ANSSQGVFDVSFEAKAGQTVAIVGPTGAGKT-TLINLL 381 (592) T ss_pred CCCCCCCEEEEEEEECCCEEEEECCCCCCHH-HHHHHH T ss_conf 8766883000366524756898778997178-998877 No 289 >pfam01580 FtsK_SpoIIIE FtsK/SpoIIIE family. FtsK has extensive sequence similarity to wide variety of proteins from prokaryotes and plasmids, termed the FtsK/SpoIIIE family. This domain contains a putative ATP binding P-loop motif. It is found in the FtsK cell division protein from E. coli and the stage III sporulation protein E SpoIIIE which has roles in regulation of prespore specific gene expression in B. subtilis. A mutation in FtsK causes a temperature sensitive block in cell division and it is involved in peptidoglycan synthesis or modification. The SpoIIIE protein is implicated in intercellular chromosomal DNA transfer. Probab=95.33 E-value=0.031 Score=38.23 Aligned_cols=46 Identities=20% Similarity=0.372 Sum_probs=33.3 Q ss_pred CCCCEEEECCCCCCCHHHHHHHHHHHHHCCC----EEEEEECCHHHHHHH Q ss_conf 4895599428787703899999999996798----289960105899999 Q gi|254780229|r 418 NGKNLIIEGPPGTGKSQTITNIIAAAMLHGK----KILFCAQKMAAIEVV 463 (1775) Q Consensus 418 ~g~~~vi~GPPGTGKSQTIaNlIa~~la~Gk----~VLfVaeK~aALdVV 463 (1775) ....++|-|-||+|||.++-++|++++.... ++.++=-|...+... T Consensus 37 ~~pH~Lv~G~tGsGKS~~l~~li~sl~~~~~p~~v~l~liD~K~~~~~~~ 86 (202) T pfam01580 37 KMPHLLIAGATGSGKSTFLNTLILSLAARHSPEEVRLYLIDPKGGELAAL 86 (202) T ss_pred CCCCEEEECCCCCCCCHHHHHHHHHHHHHCCCCEEEEEEECCCCCCHHHH T ss_conf 68868996589998009999999999873796206999974896126767 No 290 >cd01123 Rad51_DMC1_radA Rad51_DMC1_radA,B. This group of recombinases includes the eukaryotic proteins RAD51, RAD55/57 and the meiosis-specific protein DMC1, and the archaeal proteins radA and radB. They are closely related to the bacterial RecA group. Rad51 proteins catalyze a similiar recombination reaction as RecA, using ATP-dependent DNA binding activity and a DNA-dependent ATPase. However, this reaction is less efficient and requires accessory proteins such as RAD55/57 . Probab=95.33 E-value=0.028 Score=38.73 Aligned_cols=38 Identities=24% Similarity=0.295 Sum_probs=26.2 Q ss_pred CCCEEEECCCCCCCHHHHHHHHHHHHH------CCCEEEEEECC Q ss_conf 895599428787703899999999996------79828996010 Q gi|254780229|r 419 GKNLIIEGPPGTGKSQTITNIIAAAML------HGKKILFCAQK 456 (1775) Q Consensus 419 g~~~vi~GPPGTGKSQTIaNlIa~~la------~Gk~VLfVaeK 456 (1775) |.=.-|.|||||||||--..+.+++.. .|++|+|+.-. T Consensus 19 G~itEi~G~~GsGKTql~lqla~~~~~~~~~~g~~~~vvyIdtE 62 (235) T cd01123 19 GSITEIFGEFGSGKTQLCHQLAVTVQLPIELGGLEGKAVYIDTE 62 (235) T ss_pred CEEEEEECCCCCCHHHHHHHHHHHHHHHHHHCCCCCEEEEEECC T ss_conf 87999999999849999999999984247536789629999536 No 291 >COG1074 RecB ATP-dependent exoDNAse (exonuclease V) beta subunit (contains helicase and exonuclease domains) [DNA replication, recombination, and repair] Probab=95.32 E-value=0.06 Score=35.23 Aligned_cols=63 Identities=19% Similarity=0.172 Sum_probs=35.3 Q ss_pred HHHHHHHHHCCCEEEEEE----CCHHHHHHHHHHHHHC--CCC-CEEEECCCCCHHHHHHHHHHHHHHHH Q ss_conf 999999996798289960----1058999999864435--844-13654026611089999999999863 Q gi|254780229|r 437 TNIIAAAMLHGKKILFCA----QKMAAIEVVRHRLEKA--GLG-EFCLELHSHKVHKRAILDDLRKRIDN 499 (1775) Q Consensus 437 aNlIa~~la~Gk~VLfVa----eK~aALdVV~~RL~~~--gLg-~~cleLHs~ka~k~~vl~~L~~~le~ 499 (1775) .+....+.-.|+.+|+-| -|+..|-..+=||--. |+. +-.|-|.-+++...+.-.++..++.. T Consensus 6 ~~~~~~~~~~~~~~lveASAGSGKT~vL~~r~lrlLl~~~~~~v~~ILvvTFT~aAa~Emk~RI~~~L~~ 75 (1139) T COG1074 6 EALQPLASPPGQSVLVEASAGTGKTFVLAERVLRLLLEGGPLDVDEILVVTFTKAAAAEMKERIRDRLKE 75 (1139) T ss_pred HHHCHHHCCCCCCEEEEECCCCCCHHHHHHHHHHHHHHCCCCCHHHEEEECCCHHHHHHHHHHHHHHHHH T ss_conf 4425553586666899966889842169999999997258998557234406388899999999999999 No 292 >cd02021 GntK Gluconate kinase (GntK) catalyzes the phosphoryl transfer from ATP to gluconate. The resulting product gluconate-6-phoshate is an important precursor of gluconate metabolism. GntK acts as a dimmer composed of two identical subunits. Probab=95.31 E-value=0.015 Score=41.63 Aligned_cols=23 Identities=30% Similarity=0.577 Sum_probs=20.2 Q ss_pred EEEECCCCCCCHHHHHHHHHHHHH Q ss_conf 599428787703899999999996 Q gi|254780229|r 422 LIIEGPPGTGKSQTITNIIAAAML 445 (1775) Q Consensus 422 ~vi~GPPGTGKSQTIaNlIa~~la 445 (1775) +||.||.||||| ||+..+|..+. T Consensus 2 iiv~GvsGsGKS-Tia~~La~~lg 24 (150) T cd02021 2 IVVMGVSGSGKS-TVGKALAERLG 24 (150) T ss_pred EEEECCCCCCHH-HHHHHHHHHHC T ss_conf 899918999999-99999999719 No 293 >TIGR02746 TraC-F-type type-IV secretion system protein TraC; InterPro: IPR014117 The proteins in this entry are found in the F, P and I-like type IV secretion systems. Gene symbols include TraC (F-type), TrbE/VirB4 (P-type) and TraU (I-type). The proteins contain the Walker A and B motifs and so are putative nucleotide triphosphatases.. Probab=95.30 E-value=0.033 Score=38.00 Aligned_cols=132 Identities=17% Similarity=0.163 Sum_probs=69.2 Q ss_pred CHHHHHHHHHHHHHH-CCCCEEEEEE------E-EEEECCHHHHH-HHHHHHHCCCCCCHHHHHCCHHHHHHHCCCCCCC Q ss_conf 989999999999861-2898088654------5-56622514777-8643100013221014320878788504555556 Q gi|254780229|r 304 WPEDYFLQIQKIIEE-SKSHWAVRRY------G-VLGLLNFSKML-MYLDLDHLRWPEGEDNILNHDIIQRLFIAQSCDH 374 (1775) Q Consensus 304 ~~~~~~~~~~~~i~~-~~~~w~v~~~------~-~lg~fsf~k~~-m~~DL~~~~~~~~~~~~~~~~~v~~~~~~~~~~~ 374 (1775) +.+.+-+.+..++.- .+.||.+.++ . .|+..||.--- ||+|+.-. .+...+....++.-+-. .+++. T Consensus 375 ~~~~~~~~~~~a~~~Fr~~gF~L~~~~~~~~~~vfL~~LPf~~~~glf~~~~~~---~~~~~~~~~N~~~l~Pl-~adw~ 450 (900) T TIGR02746 375 NPDNLRRDVEAAKSSFRKNGFDLAEDKRIQLQAVFLAALPFALTEGLFKDLKKA---GRVRTLSSANAVNLLPL-IADWK 450 (900) T ss_pred CHHHHHHHHHHHHHHHHHCCCEECCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HCCCCCHHHHHHHCCCE-EEECC T ss_conf 888999999999999985882003054212378999730356314788987652---21343045538661742-14026 Q ss_pred C-------CCCCCCCCCCCCCCCCCCCCCCC-CCCCCCCCHHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHC Q ss_conf 6-------55666433433568821133000-223667734689999985148955994287877038999999999967 Q gi|254780229|r 375 D-------KDNASSNQNIEYKIDEIEDIHQR-CPLIDDSDSSQHSAVIDVINGKNLIIEGPPGTGKSQTITNIIAAAMLH 446 (1775) Q Consensus 375 ~-------~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~aDsSQ~~av~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~ 446 (1775) + .+..+.- + -+..-++. .+-|.+.|+. -..|-|+||.|+||.|||=++=.+|-..|+. T Consensus 451 Gnnmedrl~~~~~g~--l-----~ptrR~Ql~~~d~F~~~~~-------~i~NYN~~~~g~sG~GKSf~~Q~~~~~~L~~ 516 (900) T TIGR02746 451 GNNMEDRLTLKGSGL--L-----LPTRRGQLSAFDPFDSDLL-------NITNYNIAVVGGSGAGKSFFMQELIVSVLSR 516 (900) T ss_pred CCCHHHHHCCCCCEE--E-----EEECCCEEEEEEECCCCCC-------CCCCCEEEEECCCCCCHHHHHHHHHHHHHHC T ss_conf 885012204787727--7-----5213535877830337866-------7677148997288987168798999998517 Q ss_pred CCEEEEE Q ss_conf 9828996 Q gi|254780229|r 447 GKKILFC 453 (1775) Q Consensus 447 Gk~VLfV 453 (1775) |.+|=++ T Consensus 517 G~kv~vi 523 (900) T TIGR02746 517 GGKVWVI 523 (900) T ss_pred CCEEEEE T ss_conf 9848999 No 294 >PRK10789 putative multidrug transporter membrane\ATP-binding components; Provisional Probab=95.29 E-value=0.013 Score=42.09 Aligned_cols=14 Identities=14% Similarity=0.285 Sum_probs=9.7 Q ss_pred HHHHHCCEEEEEEC Q ss_conf 98861182899607 Q gi|254780229|r 1380 GVIARGKQIVVVGD 1393 (1775) Q Consensus 1380 ~al~rakqvvvvGD 1393 (1775) --+-.|+.+.|||. T Consensus 336 l~I~~Ge~vaIVG~ 349 (569) T PRK10789 336 FTLKPGQMLGICGP 349 (569) T ss_pred CEECCCCEEEEECC T ss_conf 68889978998799 No 295 >cd03277 ABC_SMC5_euk Eukaryotic SMC5 proteins; SMC proteins are large (approximately 110 to 170 kDa), and each is arranged into five recognizable domains. Amino-acid sequence homology of SMC proteins between species is largely confined to the amino- and carboxy-terminal globular domains. The amino-terminal domain contains a 'Walker A' nucleotide-binding domain (GxxGxGKS/T, in the single-letter amino-acid code), which by mutational studies has been shown to be essential in several proteins. The carboxy-terminal domain contains a sequence (the DA-box) that resembles a 'Walker B' motif, and a motif with homology to the signature sequence of the ATP-binding cassette (ABC) family of ATPases. The sequence homology within the carboxy-terminal domain is relatively high within the SMC1-SMC4 group, whereas SMC5 and SMC6 show some divergence in both of these sequences. In eukaryotic cells, the proteins are found as heterodimers of SMC1 paired with SMC3, SMC2 with SMC4, and SMC5 with SMC6 (for Probab=95.29 E-value=0.013 Score=42.03 Aligned_cols=34 Identities=38% Similarity=0.530 Sum_probs=25.6 Q ss_pred CCCCE-EEECCCCCCCHHHHHHHHHHHHHCCCEEEE Q ss_conf 48955-994287877038999999999967982899 Q gi|254780229|r 418 NGKNL-IIEGPPGTGKSQTITNIIAAAMLHGKKILF 452 (1775) Q Consensus 418 ~g~~~-vi~GPPGTGKSQTIaNlIa~~la~Gk~VLf 452 (1775) -|.++ +|.||.||||| ||+.+||-+|+-.-++|= T Consensus 21 ~gp~lN~IiGpNGSGKS-sIv~AI~lgLGG~p~~lg 55 (213) T cd03277 21 PGPSLNMIIGPNGSGKS-SIVCAICLGLGGKPKLLG 55 (213) T ss_pred CCCCEEEEECCCCCCHH-HHHHHHHHHHCCCHHHCC T ss_conf 49975799889988789-999999988189800045 No 296 >COG0563 Adk Adenylate kinase and related kinases [Nucleotide transport and metabolism] Probab=95.28 E-value=0.014 Score=41.81 Aligned_cols=23 Identities=43% Similarity=0.807 Sum_probs=20.5 Q ss_pred CCEEEECCCCCCCHHHHHHHHHHH Q ss_conf 955994287877038999999999 Q gi|254780229|r 420 KNLIIEGPPGTGKSQTITNIIAAA 443 (1775) Q Consensus 420 ~~~vi~GPPGTGKSQTIaNlIa~~ 443 (1775) ..++|-||||+||| |.+..|+.. T Consensus 1 ~riiilG~pGaGK~-T~A~~La~~ 23 (178) T COG0563 1 MRILILGPPGAGKS-TLAKKLAKK 23 (178) T ss_pred CEEEEECCCCCCHH-HHHHHHHHH T ss_conf 97999899999889-999999997 No 297 >pfam07724 AAA_2 AAA domain (Cdc48 subfamily). This Pfam entry includes some of the AAA proteins not detected by the pfam00004 model. Probab=95.28 E-value=0.023 Score=39.53 Aligned_cols=34 Identities=21% Similarity=0.500 Sum_probs=28.3 Q ss_pred CCEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEE Q ss_conf 95599428787703899999999996798289960 Q gi|254780229|r 420 KNLIIEGPPGTGKSQTITNIIAAAMLHGKKILFCA 454 (1775) Q Consensus 420 ~~~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVa 454 (1775) -+|++.|||||||| .++-++|.++..+++-+.+. T Consensus 4 ~~~l~~GPsGvGKT-~lAk~la~~l~~~~~~~i~~ 37 (168) T pfam07724 4 GSFLFLGPTGVGKT-ELAKALAELLFGDERALIRI 37 (168) T ss_pred EEEEEECCCCCCHH-HHHHHHHHHHCCCCCCEEEE T ss_conf 79998898998999-99999999967985344885 No 298 >cd03291 ABCC_CFTR1 The CFTR subfamily domain 1. The cystic fibrosis transmembrane regulator (CFTR), the product of the gene mutated in patients with cystic fibrosis, has adapted the ABC transporter structural motif to form a tightly regulated anion channel at the apical surface of many epithelia. Use of the term assembly of a functional ion channel implies the coming together of subunits, or at least smaller not-yet functional components of the active whole. In fact, on the basis of current knowledge only the CFTR polypeptide itself is required to form an ATP- and protein kinase A-dependent low-conductance chloride channel of the type present in the apical membrane of many epithelial cells. CFTR displays the typical organization (IM-ABC)2 and carries a characteristic hydrophilic R-domain that separates IM1-ABC1 from IM2-ABC2. Probab=95.28 E-value=0.012 Score=42.36 Aligned_cols=31 Identities=26% Similarity=0.545 Sum_probs=27.0 Q ss_pred HHHHCCCCEEEECCCCCCCHHHHHHHHHHHHH Q ss_conf 98514895599428787703899999999996 Q gi|254780229|r 414 IDVINGKNLIIEGPPGTGKSQTITNIIAAAML 445 (1775) Q Consensus 414 ~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la 445 (1775) ..+..|.-+.|.||.|+||| |+.++|+-.+- T Consensus 58 f~I~~Ge~vaIVG~sGSGKS-TLl~lL~gl~~ 88 (282) T cd03291 58 LKIEKGEMLAITGSTGSGKT-SLLMLILGELE 88 (282) T ss_pred EEECCCCEEEEECCCCCHHH-HHHHHHHCCCC T ss_conf 99849999999999998199-99999957872 No 299 >cd02019 NK Nucleoside/nucleotide kinase (NK) is a protein superfamily consisting of multiple families of enzymes that share structural similarity and are functionally related to the catalysis of the reversible phosphate group transfer from nucleoside triphosphates to nucleosides/nucleotides, nucleoside monophosphates, or sugars. Members of this family play a wide variety of essential roles in nucleotide metabolism, the biosynthesis of coenzymes and aromatic compounds, as well as the metabolism of sugar and sulfate. Probab=95.28 E-value=0.029 Score=38.52 Aligned_cols=32 Identities=28% Similarity=0.594 Sum_probs=27.5 Q ss_pred EEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEEC Q ss_conf 5994287877038999999999967982899601 Q gi|254780229|r 422 LIIEGPPGTGKSQTITNIIAAAMLHGKKILFCAQ 455 (1775) Q Consensus 422 ~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVae 455 (1775) .+|.||.|+||| |++|.+++.+ .+..+=+..+ T Consensus 2 I~I~G~~gsGKs-T~a~~l~~~l-~~~~~~~~~~ 33 (69) T cd02019 2 IAITGGSGSGKS-TVAKKLAEQL-GGRSVVVLDE 33 (69) T ss_pred EEEECCCCCCHH-HHHHHHHHHH-CCCEEEEEEH T ss_conf 788368888725-8999999995-8954999631 No 300 >COG4181 Predicted ABC-type transport system involved in lysophospholipase L1 biosynthesis, ATPase component [Secondary metabolites biosynthesis, transport, and catabolism] Probab=95.27 E-value=0.047 Score=36.38 Aligned_cols=49 Identities=37% Similarity=0.593 Sum_probs=33.0 Q ss_pred HHHCCCCEEEECCCCCCCHHHHHHHHHH------------------------HHHCCCEEEEEEC------CHHHHHHHH Q ss_conf 8514895599428787703899999999------------------------9967982899601------058999999 Q gi|254780229|r 415 DVINGKNLIIEGPPGTGKSQTITNIIAA------------------------AMLHGKKILFCAQ------KMAAIEVVR 464 (1775) Q Consensus 415 ~~~~g~~~vi~GPPGTGKSQTIaNlIa~------------------------~la~Gk~VLfVae------K~aALdVV~ 464 (1775) .+..|..+.|.||-|+||| |..-+.|- +..+|+.|-||-+ .|.||+.|. T Consensus 32 ~v~~Ge~vaivG~SGSGKS-TLl~vlAGLd~~ssGeV~l~G~~L~~ldEd~rA~~R~~~vGfVFQSF~Lip~ltAlENV~ 110 (228) T COG4181 32 VVKRGETVAIVGPSGSGKS-TLLAVLAGLDDPSSGEVRLLGQPLHKLDEDARAALRARHVGFVFQSFHLIPNLTALENVA 110 (228) T ss_pred EECCCCEEEEECCCCCCHH-HHHHHHHCCCCCCCCEEEECCCCHHCCCHHHHHHHHCCCEEEEEEEEECCCCCHHHHHCC T ss_conf 8627865888778887577-699999567789985489847501006978898763463359987533036520666512 No 301 >PRK01184 hypothetical protein; Provisional Probab=95.27 E-value=0.061 Score=35.20 Aligned_cols=41 Identities=32% Similarity=0.643 Sum_probs=27.5 Q ss_pred EEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEECCHHHHHHHHHHHHHCCC Q ss_conf 599428787703899999999996798289960105899999986443584 Q gi|254780229|r 422 LIIEGPPGTGKSQTITNIIAAAMLHGKKILFCAQKMAAIEVVRHRLEKAGL 472 (1775) Q Consensus 422 ~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVaeK~aALdVV~~RL~~~gL 472 (1775) +.|.|+|||||| |++++...+ |=.|.- +-|+|++-+...|+ T Consensus 4 IGlTG~iGSGKs-tva~i~~e~---G~~vi~------~~Divr~~v~~~g~ 44 (183) T PRK01184 4 IIVTGMPGSGKG-EFSKIAREL---GIPVVV------MGDVIREEVKKRGL 44 (183) T ss_pred EEEECCCCCCHH-HHHHHHHHC---CCEEEE------CCHHHHHHHHHCCC T ss_conf 999689988789-999999977---993998------60778999998389 No 302 >PRK03731 aroL shikimate kinase II; Reviewed Probab=95.26 E-value=0.018 Score=40.72 Aligned_cols=26 Identities=35% Similarity=0.557 Sum_probs=23.0 Q ss_pred CCCEEEECCCCCCCHHHHHHHHHHHHH Q ss_conf 895599428787703899999999996 Q gi|254780229|r 419 GKNLIIEGPPGTGKSQTITNIIAAAML 445 (1775) Q Consensus 419 g~~~vi~GPPGTGKSQTIaNlIa~~la 445 (1775) -+++++.|+||+||| ||..++|..|. T Consensus 2 ~~~I~LiG~mGsGKs-tiGk~LA~~L~ 27 (172) T PRK03731 2 TQPLFLVGPRGCGKT-TVGMALAQALG 27 (172) T ss_pred CCCEEEECCCCCCHH-HHHHHHHHHHC T ss_conf 998899889999889-99999999859 No 303 >TIGR01073 pcrA ATP-dependent DNA helicase PcrA; InterPro: IPR005751 The DNA helicase PcrA is found mainly in Gram-positive bacteria and belongs to a superfamily of helicases, together with Rep and UvrD helicases from Escherichia coli. The action of the 3'5' bacterial DNA helicase is based upon two separate but coupled activities, ssDNA translocation and duplex destabilisation, and is driven by energy derived from the continuous ATP-binding and hydrolysis events that take place in the active-site cleft. The resulting conformational changes that accompany these events underpin the coupling process and allow the helicase to translocate along the DNA, destabilising the duplex and separating the two strands in an active process .; GO: 0004003 ATP-dependent DNA helicase activity, 0006268 DNA unwinding during replication, 0005737 cytoplasm. Probab=95.25 E-value=0.064 Score=34.97 Aligned_cols=185 Identities=16% Similarity=0.131 Sum_probs=100.4 Q ss_pred CEEEEEECCHHHHHHHHHHHHHHHHCCHHHHHHHHHHCCCCCCEEECCHHHCCCCCCCEEEEEEEEECCCCCCCCCCCCC Q ss_conf 71899819989999999999999312976899976411478985854256808666788999966507888876200114 Q gi|254780229|r 1513 ESLGVIAMNAKQRDLIESTINKLCRKDSEVEKAISKLRMHADPFFIKNLENVQGDERDRIFISFTYGPHEPGGRVFQRFG 1592 (1775) Q Consensus 1513 ~siGVvt~~~~Q~~~I~~~l~~~~~~~~~~~~~~~~~~~~~e~~fvknle~vQGdERDvI~~S~~yg~~~~~g~~~~~fG 1592 (1775) .|++=++=.-+...||-|. |.. ++-| ....|.+.+-.=+|+..=|-|-+|||| || = ..|-+++. = T Consensus 585 es~~~L~dFL~~~aL~~D~-D~l----~~~E----~~~~Gdg~VtLMTLHaAKGLEFPVVFL-~G--l--EeG~FPh~-R 649 (811) T TIGR01073 585 ESIKSLVDFLTDLALVSDL-DDL----EEEE----EEAEGDGAVTLMTLHAAKGLEFPVVFL-IG--L--EEGVFPHS-R 649 (811) T ss_pred CHHHHHHHHHHHHHHHHHH-HHC----CCCH----HHCCCCCEEEEHHHHCCCCCCCCEEEE-CC--C--CCCCCHHH-H T ss_conf 0033799999878765424-433----6540----113799648631343025677762565-48--7--65567213-4 Q ss_pred CCC--CCCH---HHHHH-HHHHHHCCEEEEEECCCHH---HCCHHHCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCC Q ss_conf 777--6562---46899-9998321426988703878---8796653543489999999997406422345578666899 Q gi|254780229|r 1593 PIN--SDIG---WRRLN-VLFTRSRRRIEVFSTMRYL---DVVVDIDSKLGIRVMRDFLHFAETGYMEHSLKTKTNKQDS 1663 (1775) Q Consensus 1593 pln--~~gg---~rRlN-Va~sRAr~~~~v~~s~~~~---~i~~~~~~~~g~~~lk~~l~~a~~g~~~~~~~~~~~~~ds 1663 (1775) .|+ .+.+ ||||- |+||||++.+++=|+..-. .+..+.-+..---.--..|+-+.+| ...+.......+.. T Consensus 650 sL~~~d~~~lEEERRLAYVGITRA~~~LyLT~A~~R~lfG~~~~~~PSRFl~EIP~~lLe~~~~g-~~~~~~a~~~~~~~ 728 (811) T TIGR01073 650 SLMEDDEKELEEERRLAYVGITRAEEELYLTHATSRTLFGRIQMNPPSRFLNEIPDELLETASTG-RRTATSAPALDVKA 728 (811) T ss_pred HCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHCCCCCCCCCCCHHCCCHHHHHHCCCC-CCCCCCCCCCCCCC T ss_conf 04767811356788888666554431000100334253223368987542330897772010047-87665443346567 Q ss_pred HHHH----HHHHHHHHCCCEEEECCCCCCEECCEEEEC----CCCCCCEEEEEECCCH Q ss_conf 8999----999999978998970657577101257336----8988831799974580 Q gi|254780229|r 1664 DFAV----SVVNELEKTGFACDSQLGDMGFSIDVVVRD----PNNPGHYLMGIECDGA 1713 (1775) Q Consensus 1664 ~fe~----~v~~~L~~~G~~v~~qvg~~~~riDl~v~~----p~~pg~~~l~iecDG~ 1713 (1775) |+.. .=....+..|+.-...-|.+|-.+-+.|.| ..+.-+-.++|--.|+ T Consensus 729 p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~vGDrv~H~~fG~GTVv~v~G~G~ 786 (811) T TIGR01073 729 PLSRQKQRSRPTTSQATGVAKIRSAVAGGDTLSWAVGDRVSHKKFGVGTVVSVKGKGD 786 (811) T ss_pred CCHHHCCCCCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCCCCCCCEEEEEEEEECCC T ss_conf 7222124322221056553100235666563244058610343000058988850555 No 304 >cd03298 ABC_ThiQ_thiamine_transporter ABC-type thiamine tranport system; part of the binding-protein-dependent transport system tbpA-thiPQ for thiamine and TPP. Probably responsible for the translocation of thiamine across the membrane. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=95.25 E-value=0.012 Score=42.40 Aligned_cols=49 Identities=29% Similarity=0.397 Sum_probs=35.5 Q ss_pred HHHCCCCEEEECCCCCCCHHHHHHHHHHHHHCC-CEEEEEECCHHHHHHHH Q ss_conf 851489559942878770389999999999679-82899601058999999 Q gi|254780229|r 415 DVINGKNLIIEGPPGTGKSQTITNIIAAAMLHG-KKILFCAQKMAAIEVVR 464 (1775) Q Consensus 415 ~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~G-k~VLfVaeK~aALdVV~ 464 (1775) .+..|.-++|.||.|+||| |+.|+|+-.+..- .+|+|--.....++..+ T Consensus 20 ~i~~Ge~~~ilGpSGsGKS-TLl~li~Gl~~p~sG~I~i~G~di~~~~~~~ 69 (211) T cd03298 20 TFAQGEITAIVGPSGSGKS-TLLNLIAGFETPQSGRVLINGVDVTAAPPAD 69 (211) T ss_pred EECCCCEEEEECCCCCHHH-HHHHHHHCCCCCCCEEEEECCEECCCCCHHH T ss_conf 9889989999999995599-9999997699988529999999999999889 No 305 >KOG1969 consensus Probab=95.25 E-value=0.11 Score=32.68 Aligned_cols=59 Identities=31% Similarity=0.552 Sum_probs=37.6 Q ss_pred CCCEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEE-ECCHHHHHHHHHHHHH---------CCCCCEEEECC Q ss_conf 89559942878770389999999999679828996-0105899999986443---------58441365402 Q gi|254780229|r 419 GKNLIIEGPPGTGKSQTITNIIAAAMLHGKKILFC-AQKMAAIEVVRHRLEK---------AGLGEFCLELH 480 (1775) Q Consensus 419 g~~~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfV-aeK~aALdVV~~RL~~---------~gLg~~cleLH 480 (1775) .|=+.+.||||-||| |.|.+||+- .|=+|+=+ |---.+..+|+.|+.. ++-.+-||-+- T Consensus 326 kKilLL~GppGlGKT-TLAHViAkq--aGYsVvEINASDeRt~~~v~~kI~~avq~~s~l~adsrP~CLViD 394 (877) T KOG1969 326 KKILLLCGPPGLGKT-TLAHVIAKQ--AGYSVVEINASDERTAPMVKEKIENAVQNHSVLDADSRPVCLVID 394 (877) T ss_pred CCEEEEECCCCCCHH-HHHHHHHHH--CCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCEEEEE T ss_conf 006875368878724-799999986--285488732555434788999999988641122568886359984 No 306 >cd00550 ArsA_ATPase Oxyanion-translocating ATPase (ArsA). This ATPase is involved in transport of arsenite, antimonite or other oxyanions across biological membranes in all three kingdoms of life. ArsA contains a highly conserved AAA motif present in the AAA+ ATPase superfamily associated with a variety of cellular activities. To form a functional ATP-driven pump, ArsA interacts with the permease ArsB, which is a channel-forming integral membrane protein. One of the most interesting features of ArsA is the allosteric activation by its transport substrates. A divalent cation, typically Mg2+, is required for its enzymatic activity. Probab=95.24 E-value=0.031 Score=38.30 Aligned_cols=36 Identities=33% Similarity=0.558 Sum_probs=30.8 Q ss_pred EEEECCCCCCCHHHHHHHHHHHHHC-CCEEEEEECCHH Q ss_conf 5994287877038999999999967-982899601058 Q gi|254780229|r 422 LIIEGPPGTGKSQTITNIIAAAMLH-GKKILFCAQKMA 458 (1775) Q Consensus 422 ~vi~GPPGTGKSQTIaNlIa~~la~-Gk~VLfVaeK~a 458 (1775) +.+.|-=|.||| |++-.+|-++|. |||||+|+--.| T Consensus 3 i~~~GKGGVGKT-T~AaalA~~lA~~G~kVLlvstDPa 39 (254) T cd00550 3 IFFGGKGGVGKT-TISAATAVRLAEQGKKVLLVSTDPA 39 (254) T ss_pred EEEECCCCCCHH-HHHHHHHHHHHHCCCEEEEEECCCC T ss_conf 999689855489-9999999999968994999958986 No 307 >PRK13657 cyclic beta-1,2-glucan ABC transporter; Provisional Probab=95.24 E-value=0.014 Score=41.99 Aligned_cols=12 Identities=25% Similarity=0.567 Sum_probs=8.1 Q ss_pred HHHHCCEEEEEE Q ss_conf 886118289960 Q gi|254780229|r 1381 VIARGKQIVVVG 1392 (1775) Q Consensus 1381 al~rakqvvvvG 1392 (1775) -+-.|+.+-||| T Consensus 357 ~i~~Ge~vaiVG 368 (585) T PRK13657 357 EAKPGQTVAIVG 368 (585) T ss_pred EECCCCEEEEEC T ss_conf 975998899988 No 308 >COG0513 SrmB Superfamily II DNA and RNA helicases [DNA replication, recombination, and repair / Transcription / Translation, ribosomal structure and biogenesis] Probab=95.24 E-value=0.16 Score=30.91 Aligned_cols=24 Identities=29% Similarity=0.542 Sum_probs=12.2 Q ss_pred HHHHHHHHHCCCCEEEECCCCCCC Q ss_conf 899999851489559942878770 Q gi|254780229|r 409 QHSAVIDVINGKNLIIEGPPGTGK 432 (1775) Q Consensus 409 Q~~av~~~~~g~~~vi~GPPGTGK 432 (1775) |..+|-.+..|+|++++.++|||| T Consensus 56 Q~~~iP~~l~g~Dvi~~A~TGsGK 79 (513) T COG0513 56 QLAAIPLILAGRDVLGQAQTGTGK 79 (513) T ss_pred HHHHCHHHHCCCCEEEECCCCCHH T ss_conf 996587763699979986898717 No 309 >cd01393 recA_like RecA is a bacterial enzyme which has roles in homologous recombination, DNA repair, and the induction of the SOS response. RecA couples ATP hydrolysis to DNA strand exchange. While prokaryotes have a single RecA protein, eukaryotes have multiple RecA homologs such as Rad51, DMC1 and Rad55/57. Archaea have the RecA-like homologs radA and radB. Probab=95.24 E-value=0.031 Score=38.31 Aligned_cols=37 Identities=30% Similarity=0.435 Sum_probs=28.8 Q ss_pred CCCCEEEECCCCCCCHHHHHHHHHHHHHCC------CEEEEEE Q ss_conf 489559942878770389999999999679------8289960 Q gi|254780229|r 418 NGKNLIIEGPPGTGKSQTITNIIAAAMLHG------KKILFCA 454 (1775) Q Consensus 418 ~g~~~vi~GPPGTGKSQTIaNlIa~~la~G------k~VLfVa 454 (1775) .|+=.-|-|||||||||.-.++++++...| .+|+|+. T Consensus 18 ~G~ItEi~G~~gsGKT~l~lqla~~~q~~~~~~~~~g~vvyID 60 (226) T cd01393 18 TGRITEIFGEFGSGKTQLCLQLAVEAQLPGELGGLEGKVVYID 60 (226) T ss_pred CCEEEEEECCCCCCHHHHHHHHHHHHHCCCCCCCCCCEEEEEE T ss_conf 8839999999999899999999999854221169996199995 No 310 >cd03253 ABCC_ATM1_transporter ATM1 is an ABC transporter that is expressed in the mitochondria. Although the specific function of ATM1 is unknown, its disruption results in the accumulation of excess mitochondrial iron, loss of mitochondrial cytochromes, oxidative damage to mitochondrial DNA, and decreased levels of cytosolic heme proteins. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=95.23 E-value=0.014 Score=41.95 Aligned_cols=39 Identities=33% Similarity=0.527 Sum_probs=31.0 Q ss_pred HHHCCCCEEEECCCCCCCHHHHHHHHHHHH-HCCCEEEEEE Q ss_conf 851489559942878770389999999999-6798289960 Q gi|254780229|r 415 DVINGKNLIIEGPPGTGKSQTITNIIAAAM-LHGKKILFCA 454 (1775) Q Consensus 415 ~~~~g~~~vi~GPPGTGKSQTIaNlIa~~l-a~Gk~VLfVa 454 (1775) .+..|.-++|.||.|+||| |++++|.-.+ ....+|+|-- T Consensus 23 ~i~~Ge~v~ivG~sGsGKS-TLl~ll~gl~~p~~G~I~idg 62 (236) T cd03253 23 TIPAGKKVAIVGPSGSGKS-TILRLLFRFYDVSSGSILIDG 62 (236) T ss_pred EECCCCEEEEECCCCCCHH-HHHHHHCCCCCCCCCEEEECC T ss_conf 9869999999999999899-999997438548874899999 No 311 >cd03260 ABC_PstB_phosphate_transporter Phosphate uptake is of fundamental importance in the cell physiology of bacteria because phosphate is required as a nutrient. The Pst system of E. coli comprises four distinct subunits encoded by the pstS, pstA, pstB, and pstC genes. The PstS protein is a phosphate-binding protein located in the periplasmic space. P stA and PstC are hydrophobic and they form the transmembrane portion of the Pst system. PstB is the catalytic subunit, which couples the energy of ATP hydrolysis to the import of phosphate across cellular membranes through the Pst system, often referred as ABC-protein. PstB belongs to one of the largest superfamilies of proteins characterized by a highly conserved adenosine triphosphate (ATP) binding cassette (ABC), which is also a nucleotide binding domain (NBD). Probab=95.22 E-value=0.023 Score=39.59 Aligned_cols=40 Identities=25% Similarity=0.406 Sum_probs=32.0 Q ss_pred HHHCCCCEEEECCCCCCCHHHHHHHHHHHHHC------CCEEEEEEC Q ss_conf 85148955994287877038999999999967------982899601 Q gi|254780229|r 415 DVINGKNLIIEGPPGTGKSQTITNIIAAAMLH------GKKILFCAQ 455 (1775) Q Consensus 415 ~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~------Gk~VLfVae 455 (1775) .+..|+-++|.||-|+||| |+.++|+-.+.. ...|+|--+ T Consensus 22 ~i~~Ge~~~iiG~SGsGKS-Tll~~i~gL~~~~~~~p~~G~I~~~g~ 67 (227) T cd03260 22 DIPKGEITALIGPSGCGKS-TLLRLLNRLNDLIPGAPDEGEVLLDGK 67 (227) T ss_pred EECCCCEEEEECCCCCCHH-HHHHHHHHHHHCCCCCCCCEEEEECCE T ss_conf 8879989999999998199-999999744502689981469999999 No 312 >cd02035 ArsA ArsA ATPase functionas as an efflux pump located on the inner membrane of the cell. This ATP-driven oxyanion pump catalyzes the extrusion of arsenite, antimonite and arsenate. Maintenance of a low intracellular concentration of oxyanion produces resistance to the toxic agents. The pump is composed of two subunits, the catalytic ArsA subunit and the membrane subunit ArsB, which are encoded by arsA and arsB genes respectively. Arsenic efflux in bacteria is catalyzed by either ArsB alone or by ArsAB complex. The ATP-coupled pump, however, is more efficient. ArsA is composed of two homologous halves, A1 and A2, connected by a short linker sequence. Probab=95.22 E-value=0.034 Score=37.80 Aligned_cols=36 Identities=33% Similarity=0.556 Sum_probs=28.0 Q ss_pred EEEECCCCCCCHHHHHHHHHHHHH-CCCEEEEEECCHH Q ss_conf 599428787703899999999996-7982899601058 Q gi|254780229|r 422 LIIEGPPGTGKSQTITNIIAAAML-HGKKILFCAQKMA 458 (1775) Q Consensus 422 ~vi~GPPGTGKSQTIaNlIa~~la-~Gk~VLfVaeK~a 458 (1775) +++.|--|.||| |++-.+|.++| .|||||+|+--.| T Consensus 2 ~~~sGKGGVGKT-TvAaalA~~lA~~G~rvLlvs~DPa 38 (217) T cd02035 2 IFFTGKGGVGKT-TIAAATAVRLAEEGKKVLLVSTDPA 38 (217) T ss_pred EEEECCCCCCHH-HHHHHHHHHHHHCCCEEEEEECCCC T ss_conf 899789966199-9999999999968994999958987 No 313 >cd02020 CMPK Cytidine monophosphate kinase (CMPK) catalyzes the reversible phosphorylation of cytidine monophosphate (CMP) to produce cytidine diphosphate (CDP), using ATP as the preferred phosphoryl donor. Probab=95.22 E-value=0.016 Score=41.15 Aligned_cols=23 Identities=39% Similarity=0.810 Sum_probs=20.5 Q ss_pred EEEECCCCCCCHHHHHHHHHHHHH Q ss_conf 599428787703899999999996 Q gi|254780229|r 422 LIIEGPPGTGKSQTITNIIAAAML 445 (1775) Q Consensus 422 ~vi~GPPGTGKSQTIaNlIa~~la 445 (1775) ..|.||||+||| |++-+||.-+. T Consensus 2 IaIdGpagsGKs-T~ak~lA~~l~ 24 (147) T cd02020 2 IAIDGPAGSGKS-TVAKLLAKKLG 24 (147) T ss_pred EEEECCCCCCHH-HHHHHHHHHHC T ss_conf 888689978989-99999999909 No 314 >PRK11634 ATP-dependent RNA helicase DeaD; Provisional Probab=95.19 E-value=0.23 Score=29.07 Aligned_cols=50 Identities=18% Similarity=0.304 Sum_probs=33.5 Q ss_pred HHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHC---CCEEEEEECCHH Q ss_conf 89999985148955994287877038999999999967---982899601058 Q gi|254780229|r 409 QHSAVIDVINGKNLIIEGPPGTGKSQTITNIIAAAMLH---GKKILFCAQKMA 458 (1775) Q Consensus 409 Q~~av~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~---Gk~VLfVaeK~a 458 (1775) |..||-.+..|+|++++-|+|||||-.-+=-|-+.|-. +-.+|+++=++= T Consensus 33 Q~~aIP~iL~GkDvi~~AqTGSGKTlAFlLPiL~~l~~~~~~pqaLIL~PTRE 85 (629) T PRK11634 33 QAECIPHLLNGRDVLGMAQTGSGKTAAFSLPLLHNLDPELKAPQILVLAPTRE 85 (629) T ss_pred HHHHHHHHHCCCCEEEECCCCCHHHHHHHHHHHHHHHHCCCCCCEEEECCCHH T ss_conf 99999999679988997888478999999999998662368986899789989 No 315 >pfam07652 Flavi_DEAD Flavivirus DEAD domain. Probab=95.19 E-value=0.072 Score=34.40 Aligned_cols=54 Identities=15% Similarity=0.300 Sum_probs=46.3 Q ss_pred CCCCEEEECCCCCCCHHH-HHHHHHHHHHCCCEEEEEECCHHHHHHHHHHHHHCC Q ss_conf 489559942878770389-999999999679828996010589999998644358 Q gi|254780229|r 418 NGKNLIIEGPPGTGKSQT-ITNIIAAAMLHGKKILFCAQKMAAIEVVRHRLEKAG 471 (1775) Q Consensus 418 ~g~~~vi~GPPGTGKSQT-IaNlIa~~la~Gk~VLfVaeK~aALdVV~~RL~~~g 471 (1775) +|+--||.=-||+|||.. |-++|-.|+.+++|+|+.+-.+..++-|+.=|...+ T Consensus 1 kg~~t~ld~HPGaGKTr~vLP~~v~~~i~~~lRtlVLaPTRVV~~Em~eAL~g~~ 55 (146) T pfam07652 1 KGTLTVLDLHPGAGKTRKVLPELVRECIDRRLRTLVLAPTRVVLAEMEEALRGLP 55 (146) T ss_pred CCCEEEEECCCCCCCCCHHHHHHHHHHHHCCCEEEEECCHHHHHHHHHHHHCCCC T ss_conf 9832798538999970224899999999728618997727999999999975899 No 316 >KOG0736 consensus Probab=95.19 E-value=0.034 Score=37.79 Aligned_cols=30 Identities=27% Similarity=0.680 Sum_probs=21.8 Q ss_pred HHHCCC-CEEEECCCCCCCHHHHHHHHHHHHH Q ss_conf 851489-5599428787703899999999996 Q gi|254780229|r 415 DVINGK-NLIIEGPPGTGKSQTITNIIAAAML 445 (1775) Q Consensus 415 ~~~~g~-~~vi~GPPGTGKSQTIaNlIa~~la 445 (1775) ...++. .|.++||||+||+ |.+-..|..+. T Consensus 426 ~~~~~~~~vLLhG~~g~GK~-t~V~~vas~lg 456 (953) T KOG0736 426 ALLTLNPSVLLHGPPGSGKT-TVVRAVASELG 456 (953) T ss_pred HCCCCCEEEEEECCCCCCHH-HHHHHHHHHHC T ss_conf 01335537998679998757-99999999838 No 317 >PRK07263 consensus Probab=95.17 E-value=0.069 Score=34.60 Aligned_cols=55 Identities=20% Similarity=0.289 Sum_probs=43.6 Q ss_pred HHCCCCEEEECCCCCCCHHHHHHHHHHH-HHCCCEEEEEECCHHHHHHHHHHHHHC Q ss_conf 5148955994287877038999999999-967982899601058999999864435 Q gi|254780229|r 416 VINGKNLIIEGPPGTGKSQTITNIIAAA-MLHGKKILFCAQKMAAIEVVRHRLEKA 470 (1775) Q Consensus 416 ~~~g~~~vi~GPPGTGKSQTIaNlIa~~-la~Gk~VLfVaeK~aALdVV~~RL~~~ 470 (1775) ...|.=.||-|.||-|||-.-.|++.++ +-.|++|+|.|--|.+-+++.+-|... T Consensus 200 l~~GdLiviaaRPsmGKTa~alnia~~iA~~~~~~V~~fSlEMs~~ql~~R~la~~ 255 (453) T PRK07263 200 LHPDQLIILAARPAVGKTAFVLNIAQNVGTKQKKTVAIFSLEMGAESLVDRMLAAE 255 (453) T ss_pred CCCCCEEEEEECCCCCHHHHHHHHHHHHHHHCCCEEEEEECCCCHHHHHHHHHHHH T ss_conf 99786899972788847899999999999855982899924699899999999986 No 318 >PRK00279 adk adenylate kinase; Reviewed Probab=95.16 E-value=0.018 Score=40.64 Aligned_cols=23 Identities=48% Similarity=0.816 Sum_probs=20.5 Q ss_pred CEEEECCCCCCCHHHHHHHHHHHH Q ss_conf 559942878770389999999999 Q gi|254780229|r 421 NLIIEGPPGTGKSQTITNIIAAAM 444 (1775) Q Consensus 421 ~~vi~GPPGTGKSQTIaNlIa~~l 444 (1775) ++||-||||+||+ |.+..||... T Consensus 2 ~iillG~PGsGKg-Tqa~~la~~~ 24 (215) T PRK00279 2 RLILLGPPGAGKG-TQAKFIAEKY 24 (215) T ss_pred EEEEECCCCCCHH-HHHHHHHHHH T ss_conf 7999899999879-9999999986 No 319 >cd01131 PilT Pilus retraction ATPase PilT. PilT is a nucleotide binding protein responsible for the retraction of type IV pili, likely by pili disassembly. This retraction provides the force required for travel of bacteria in low water environments by a mechanism known as twitching motility. Probab=95.16 E-value=0.028 Score=38.78 Aligned_cols=37 Identities=22% Similarity=0.430 Sum_probs=27.0 Q ss_pred EEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEECCHH Q ss_conf 5994287877038999999999967982899601058 Q gi|254780229|r 422 LIIEGPPGTGKSQTITNIIAAAMLHGKKILFCAQKMA 458 (1775) Q Consensus 422 ~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVaeK~a 458 (1775) ++|.||+|+|||.|+..++.+.....++..++-|... T Consensus 4 iLitG~TGSGKTTtl~all~~i~~~~~~~IiTiEDPi 40 (198) T cd01131 4 VLVTGPTGSGKSTTLAAMIDYINKNKTHHILTIEDPI 40 (198) T ss_pred EEEECCCCCCHHHHHHHHHHHHCCCCCCEEEEECCCH T ss_conf 9998999997999999999853637883699964737 No 320 >pfam06309 Torsin Torsin. This family consists of several eukaryotic torsin proteins. Torsion dystonia is an autosomal dominant movement disorder characterized by involuntary, repetitive muscle contractions and twisted postures. The most severe early-onset form of dystonia has been linked to mutations in the human DYT1 (TOR1A) gene encoding a protein termed torsinA. While causative genetic alterations have been identified, the function of torsin proteins and the molecular mechanism underlying dystonia remain unknown. Phylogenetic analysis of the torsin protein family indicates these proteins share distant sequence similarity with the large and diverse family of (pfam00004) proteins. It has been suggested that torsins play a role in effectively managing protein folding and that possible breakdown in a neuroprotective mechanism that is, in part, mediated by torsins may be responsible for the neuronal dysfunction associated with dystonia. Probab=95.15 E-value=0.06 Score=35.26 Aligned_cols=27 Identities=37% Similarity=0.593 Sum_probs=22.6 Q ss_pred CCEE--EECCCCCCCHHHHHHHHHHHHHCC Q ss_conf 9559--942878770389999999999679 Q gi|254780229|r 420 KNLI--IEGPPGTGKSQTITNIIAAAMLHG 447 (1775) Q Consensus 420 ~~~v--i~GPPGTGKSQTIaNlIa~~la~G 447 (1775) |-+| .+|||||||| =++.|||..|+.. T Consensus 52 KpLVlSfHG~tGtGKn-~vs~liA~~Ly~~ 80 (127) T pfam06309 52 KPLVLSFHGWTGTGKN-FVAEIIADNLYRD 80 (127) T ss_pred CCEEEEECCCCCCCHH-HHHHHHHHHHHHC T ss_conf 9748870189998798-9999999998754 No 321 >KOG0058 consensus Probab=95.15 E-value=0.032 Score=38.11 Aligned_cols=12 Identities=33% Similarity=0.661 Sum_probs=6.8 Q ss_pred HHHHCCEEEEEE Q ss_conf 886118289960 Q gi|254780229|r 1381 VIARGKQIVVVG 1392 (1775) Q Consensus 1381 al~rakqvvvvG 1392 (1775) .|--|+-|-+|| T Consensus 490 ti~pGe~vALVG 501 (716) T KOG0058 490 TIRPGEVVALVG 501 (716) T ss_pred EECCCCEEEEEC T ss_conf 407997799988 No 322 >TIGR00064 ftsY signal recognition particle-docking protein FtsY; InterPro: IPR004390 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). This family includes the cell division ABC transporter and the periplasmic substrate-binding protein FtsY. There is a weak division between FtsY and SRP54; both are GTPases. In Escherichia coli, ftsY is an essential gene located in an operon with cell division genes ftsE and ftsX, but its apparent function is as the signal recognition particle docking protein.; GO: 0005525 GTP binding. Probab=95.14 E-value=0.03 Score=38.41 Aligned_cols=73 Identities=25% Similarity=0.332 Sum_probs=47.8 Q ss_pred HCCCCEE--EECCCCCCCHHHHHHHHHHHHHCCCEEEEEE---CCHHHHHHHHHHHHHCCCCCEEEECCCCC-HHHHHHH Q ss_conf 1489559--9428787703899999999996798289960---10589999998644358441365402661-1089999 Q gi|254780229|r 417 INGKNLI--IEGPPGTGKSQTITNIIAAAMLHGKKILFCA---QKMAAIEVVRHRLEKAGLGEFCLELHSHK-VHKRAIL 490 (1775) Q Consensus 417 ~~g~~~v--i~GPPGTGKSQTIaNlIa~~la~Gk~VLfVa---eK~aALdVV~~RL~~~gLg~~cleLHs~k-a~k~~vl 490 (1775) ..++=+| +.|=-|||||.||+=|-..+...||+||.+| ...||.+=+.-=-...|.. +=-|... ++..+|. T Consensus 78 ~~~kp~Vil~VGVNG~GKTTTIaKLA~~l~~~Gk~V~laAgDTFRAAA~EQL~~Wa~R~gv~---vi~~~~gn~DPAaV~ 154 (284) T TIGR00064 78 EEKKPNVILFVGVNGVGKTTTIAKLANKLKKQGKSVLLAAGDTFRAAAIEQLEVWAKRLGVD---VIKQKEGNADPAAVI 154 (284) T ss_pred CCCCCEEEEEEEEECCCHHHHHHHHHHHHHHCCCEEEEECCCHHHHHHHHHHHHHHHHHCCE---EEECCCCCCCCHHHH T ss_conf 47897799998440886010288999999874990899827524799999999989883875---540788988717899 Q ss_pred HH Q ss_conf 99 Q gi|254780229|r 491 DD 492 (1775) Q Consensus 491 ~~ 492 (1775) =+ T Consensus 155 fD 156 (284) T TIGR00064 155 FD 156 (284) T ss_pred HH T ss_conf 99 No 323 >PRK13826 Dtr system oriT relaxase; Provisional Probab=95.14 E-value=0.14 Score=31.54 Aligned_cols=29 Identities=7% Similarity=0.113 Sum_probs=14.0 Q ss_pred CCEEEECCCCCHHHHHHHHHHHHHHHHHC Q ss_conf 41365402661108999999999986311 Q gi|254780229|r 473 GEFCLELHSHKVHKRAILDDLRKRIDNRN 501 (1775) Q Consensus 473 g~~cleLHs~ka~k~~vl~~L~~~le~~~ 501 (1775) +.+|+-+=-.-+-|..+|...++.||... T Consensus 397 ~~Ia~VvG~AGaGKStmL~aAReawEa~G 425 (1102) T PRK13826 397 ERIAAVIGRAGAGKTTMMKAAREAWEAAG 425 (1102) T ss_pred CCEEEEEECCCCCHHHHHHHHHHHHHHCC T ss_conf 86689984288878899999999999779 No 324 >TIGR01277 thiQ thiamine ABC transporter, ATP-binding protein; InterPro: IPR005968 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). Thiamine pyrophosphate (TPP) is a required cofactor synthesized de novo in Salmonella typhimurium. The primary role for TPP is in central metabolism as an electron carrier and nucleophile for such enzymes as pyruvate dehydrogenase (1.2.4.1 from EC), acetolactate synthase (4.1.3.18 from EC), and alpha-ketoglutarate dehydrogenase (1.2.4.2 from EC). Despite its importance in cellular physiology, neither the de novo biosynthetic pathway nor the salvage systems for thiamine are fully understood in any organism. The model describes thiamine ABC transporter, ATP-binding protein, believed to be involved in the specific translocation of thiamine and its phosphoesters across the inner membrane The protein belongs to the larger ABC transport system which consists of at least three components: the inner membrane permease; thiamine binding protein and an ATP-binding subunit. This protein is found so far only in Proteobacteria, and is found in complete genomes only if the ThiB and ThiP subunits are also found. It has been experimentally demonstrated that mutants in the various steps in the de novo synthesis of thiamine and its biologically active form, namely thiamine pyrophosphate can be exogenously supplemented with thiamine, thiamine monophosphate or thiamine pyrophosphate. ; GO: 0005524 ATP binding, 0042626 ATPase activity coupled to transmembrane movement of substances, 0006810 transport, 0009276 1-2nm peptidoglycan-based cell wall. Probab=95.13 E-value=0.015 Score=41.56 Aligned_cols=32 Identities=41% Similarity=0.671 Sum_probs=29.3 Q ss_pred HHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHC Q ss_conf 985148955994287877038999999999967 Q gi|254780229|r 414 IDVINGKNLIIEGPPGTGKSQTITNIIAAAMLH 446 (1775) Q Consensus 414 ~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~ 446 (1775) ..|..|.-++|-||=|.||| |..|+||=.+.- T Consensus 19 L~V~~Ge~VAi~GpSGAGKS-TLLnLiAGF~~P 50 (213) T TIGR01277 19 LSVEDGERVAILGPSGAGKS-TLLNLIAGFLEP 50 (213) T ss_pred CCCCCCCEEEEECCCCCCHH-HHHHHHHHHCCC T ss_conf 41301776888758986278-898778640477 No 325 >COG0556 UvrB Helicase subunit of the DNA excision repair complex [DNA replication, recombination, and repair] Probab=95.12 E-value=0.057 Score=35.44 Aligned_cols=44 Identities=30% Similarity=0.497 Sum_probs=33.0 Q ss_pred HHHHHHHHHC----CC-CEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEEC Q ss_conf 8999998514----89-55994287877038999999999967982899601 Q gi|254780229|r 409 QHSAVIDVIN----GK-NLIIEGPPGTGKSQTITNIIAAAMLHGKKILFCAQ 455 (1775) Q Consensus 409 Q~~av~~~~~----g~-~~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVae 455 (1775) |-.||..... |. .-++-|-+|||||-||+|.|+.. +|..|+.|. T Consensus 17 QP~AI~~Lv~gi~~g~~~QtLLGvTGSGKTfT~AnVI~~~---~rPtLV~Ah 65 (663) T COG0556 17 QPEAIAELVEGIENGLKHQTLLGVTGSGKTFTMANVIAKV---QRPTLVLAH 65 (663) T ss_pred CHHHHHHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHH---CCCEEEEEC T ss_conf 6799999998886386025886203688310799999986---897199825 No 326 >PRK13949 shikimate kinase; Provisional Probab=95.12 E-value=0.021 Score=40.08 Aligned_cols=25 Identities=24% Similarity=0.420 Sum_probs=22.8 Q ss_pred CCEEEECCCCCCCHHHHHHHHHHHHH Q ss_conf 95599428787703899999999996 Q gi|254780229|r 420 KNLIIEGPPGTGKSQTITNIIAAAML 445 (1775) Q Consensus 420 ~~~vi~GPPGTGKSQTIaNlIa~~la 445 (1775) +++++.|+||+||| ||..++|..|. T Consensus 2 k~I~LiG~mGsGKs-tiGk~La~~l~ 26 (169) T PRK13949 2 ARIFLVGYMGAGKT-TLGKALARELG 26 (169) T ss_pred CCEEEECCCCCCHH-HHHHHHHHHHC T ss_conf 83899799999889-99999999959 No 327 >PTZ00088 adenylate kinase 1; Provisional Probab=95.11 E-value=0.019 Score=40.38 Aligned_cols=24 Identities=29% Similarity=0.624 Sum_probs=20.7 Q ss_pred CEEEECCCCCCCHHHHHHHHHHHHH Q ss_conf 5599428787703899999999996 Q gi|254780229|r 421 NLIIEGPPGTGKSQTITNIIAAAML 445 (1775) Q Consensus 421 ~~vi~GPPGTGKSQTIaNlIa~~la 445 (1775) ++|+-||||+||+ |++.+|+.... T Consensus 2 ~iillGpPGsGKg-T~a~~l~~~~~ 25 (225) T PTZ00088 2 KIVLFGAPGVGKG-TFAEILSKKEK 25 (225) T ss_pred EEEEECCCCCCHH-HHHHHHHHHHC T ss_conf 7999899999879-99999999879 No 328 >PRK10790 putative multidrug transporter membrane\ATP-binding components; Provisional Probab=95.08 E-value=0.017 Score=41.09 Aligned_cols=14 Identities=14% Similarity=0.249 Sum_probs=9.5 Q ss_pred HHHHHCCEEEEEEC Q ss_conf 98861182899607 Q gi|254780229|r 1380 GVIARGKQIVVVGD 1393 (1775) Q Consensus 1380 ~al~rakqvvvvGD 1393 (1775) --+-.|..+-|||- T Consensus 362 ~~I~~Ge~vaIVG~ 375 (593) T PRK10790 362 LSVPSRNFVALVGH 375 (593) T ss_pred CCCCCCCEEEEECC T ss_conf 04489978998799 No 329 >TIGR01351 adk adenylate kinases; InterPro: IPR006259 Most members of this family are known or believed to be adenylate kinase. Adenylate kinase (ADK) 2.7.4.3 from EC converts ATP + AMP to ADP + ADP, that is, it uses ATP as a phosphate donor for AMP. Two ADK isozymes have been identified in mammalian cells. These specifically bind AMP and favour binding to ATP over other nucleotide triphosphates (AK1 is cytosolic and AK2 is located in the mitochondria). However, some members accept other nucleotide triphosphates as donors, and may be unable to use ATP, and may fail to complement adenylate kinase mutants. An example of a nucleoside-triphosphate--adenylate kinase (2.7.4.10 from EC) is Q9UIJ7 from SWISSPROT, a GTP:AMP phosphotransferase that has been identified in bovine heart and human cells and derived from mitrochondrail GTP AMP that is specific for the phosphorylation of AMP, but can only use GTP or ITP as a substrate .This family is designated subfamily rather than equivalog for this reason. ADK has also been identified in different bacterial species and in yeast . Two further enzymes are known to be related to the ADK family, i.e. yeast uridine monophosphokinase and slime mold UMP-CMP kinase. Within the ADK family there are several conserved regions, including the ATP-binding domains. One of the most conserved areas includes an Arg residue, whose modification inactivates the enzyme, together with an Asp that resides in the catalytic cleft of the enzyme and participates in a salt bridge. ; GO: 0005524 ATP binding, 0016776 phosphotransferase activity phosphate group as acceptor, 0019201 nucleotide kinase activity, 0006139 nucleobase nucleoside nucleotide and nucleic acid metabolic process. Probab=95.08 E-value=0.017 Score=40.93 Aligned_cols=24 Identities=46% Similarity=0.728 Sum_probs=21.0 Q ss_pred CEEEECCCCCCCHHHHHHHHHHHHH Q ss_conf 5599428787703899999999996 Q gi|254780229|r 421 NLIIEGPPGTGKSQTITNIIAAAML 445 (1775) Q Consensus 421 ~~vi~GPPGTGKSQTIaNlIa~~la 445 (1775) +||+=||||+||- |++-.|+.-+. T Consensus 1 ~~~~lGpPGsGKG-TQa~~i~~~~g 24 (232) T TIGR01351 1 RLILLGPPGSGKG-TQAKRIAEKLG 24 (232) T ss_pred CEEEECCCCCCHH-HHHHHHHHHCC T ss_conf 9467559898766-79999998608 No 330 >pfam00910 RNA_helicase RNA helicase. This family includes RNA helicases thought to be involved in duplex unwinding during viral RNA replication. Members of this family are found in a variety of single stranded RNA viruses. Probab=95.08 E-value=0.017 Score=41.01 Aligned_cols=24 Identities=46% Similarity=0.850 Sum_probs=20.9 Q ss_pred EEEECCCCCCCHHHHHHHHHHHHHC Q ss_conf 5994287877038999999999967 Q gi|254780229|r 422 LIIEGPPGTGKSQTITNIIAAAMLH 446 (1775) Q Consensus 422 ~vi~GPPGTGKSQTIaNlIa~~la~ 446 (1775) +.|.||||+||| |.++.+|..+++ T Consensus 1 i~l~G~~G~GKS-~~a~~la~~~~~ 24 (105) T pfam00910 1 IWLYGPPGCGKS-TLAKYLARALLD 24 (105) T ss_pred CEEECCCCCCHH-HHHHHHHHHHHH T ss_conf 989799989889-999999999999 No 331 >cd03300 ABC_PotA_N PotA is an ABC-type transporter and the ATPase component of the spermidine/putrescine-preferential uptake system consisting of PotA, -B, -C, and -D. PotA has two domains with the N-terminal domain containing the ATPase activity and the residues required for homodimerization with PotA and heterdimerization with PotB. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=95.08 E-value=0.016 Score=41.37 Aligned_cols=46 Identities=26% Similarity=0.486 Sum_probs=34.7 Q ss_pred HHHCCCCEEEECCCCCCCHHHHHHHHHHHHHC-CCEEEEEECCHHHHH Q ss_conf 85148955994287877038999999999967-982899601058999 Q gi|254780229|r 415 DVINGKNLIIEGPPGTGKSQTITNIIAAAMLH-GKKILFCAQKMAAIE 461 (1775) Q Consensus 415 ~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~-Gk~VLfVaeK~aALd 461 (1775) .+..|.-++|-||.|+||| |+.++|+-..-. ..+|+|--+....+. T Consensus 22 ~v~~Ge~~~iiGpSGsGKS-Tllr~i~Gl~~p~~G~I~~~g~~v~~~~ 68 (232) T cd03300 22 DIKEGEFFTLLGPSGCGKT-TLLRLIAGFETPTSGEILLDGKDITNLP 68 (232) T ss_pred EECCCCEEEEECCCCCHHH-HHHHHHHCCCCCCCEEEEECCEECCCCC T ss_conf 8879989999999998399-9999997799998539999999999999 No 332 >cd03249 ABC_MTABC3_MDL1_MDL2 MTABC3 (also known as ABCB6) is a mitochondrial ATP-binding cassette protein involved in iron homeostasis and one of four ABC transporters expressed in the mitochondrial inner membrane, the other three being MDL1(ABC7), MDL2, and ATM1. In fact, the yeast MDL1 (multidrug resistance-like protein 1) and MDL2 (multidrug resistance-like protein 2) transporters are also included in this CD. MDL1 is an ATP-dependent permease that acts as a high-copy suppressor of ATM1 and is thought to have a role in resistance to oxidative stress. Interestingly, subfamily B is more closely related to the carboxyl-terminal component of subfamily C than the two halves of ABCC molecules are with one another. Probab=95.07 E-value=0.016 Score=41.19 Aligned_cols=44 Identities=23% Similarity=0.377 Sum_probs=32.8 Q ss_pred HHHCCCCEEEECCCCCCCHHHHHHHHHHHHHC-CCEEEEEECCHHH Q ss_conf 85148955994287877038999999999967-9828996010589 Q gi|254780229|r 415 DVINGKNLIIEGPPGTGKSQTITNIIAAAMLH-GKKILFCAQKMAA 459 (1775) Q Consensus 415 ~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~-Gk~VLfVaeK~aA 459 (1775) .+..|+-++|.||.|+||| |++++|+-.+-. ..+|++--..... T Consensus 25 ~i~~G~~iaIvG~sGsGKS-TLl~ll~gl~~p~~G~I~idg~~i~~ 69 (238) T cd03249 25 TIPPGKTVALVGSSGCGKS-TVVSLLERFYDPTSGEILLDGVDIRD 69 (238) T ss_pred EECCCCEEEEECCCCCCHH-HHHHHHHHCCCCCCCEEEECCEECCC T ss_conf 9769999999999999899-99999823861885189999999231 No 333 >COG3839 MalK ABC-type sugar transport systems, ATPase components [Carbohydrate transport and metabolism] Probab=95.06 E-value=0.054 Score=35.72 Aligned_cols=77 Identities=19% Similarity=0.367 Sum_probs=53.8 Q ss_pred HHHCCCCEEEECCCCCCCHHHHHHHHHHHHH-CCCEEEEEECCHHHHHHHHHHHHHCCCCCEEEECCCCCHHHHHHHHHH Q ss_conf 8514895599428787703899999999996-798289960105899999986443584413654026611089999999 Q gi|254780229|r 415 DVINGKNLIIEGPPGTGKSQTITNIIAAAML-HGKKILFCAQKMAAIEVVRHRLEKAGLGEFCLELHSHKVHKRAILDDL 493 (1775) Q Consensus 415 ~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la-~Gk~VLfVaeK~aALdVV~~RL~~~gLg~~cleLHs~ka~k~~vl~~L 493 (1775) .+..|.-+||-||-|.||| |..+|||-+.. .+.+|+|--+-..-+.--+|.+. .-...|+|-=|-+ |++.+ T Consensus 25 ~i~~Gef~vllGPSGcGKS-TlLr~IAGLe~p~~G~I~i~g~~vt~l~P~~R~ia-mVFQ~yALyPhMt------V~~Ni 96 (338) T COG3839 25 DIEDGEFVVLLGPSGCGKS-TLLRMIAGLEEPTSGEILIDGRDVTDLPPEKRGIA-MVFQNYALYPHMT------VYENI 96 (338) T ss_pred EECCCCEEEEECCCCCCHH-HHHHHHHCCCCCCCCEEEECCEECCCCCHHHCCEE-EEECCCCCCCCCC------HHHHH T ss_conf 9747979999899988889-99999968877887159999999998995578889-9937830157876------99973 Q ss_pred HHHHHH Q ss_conf 999863 Q gi|254780229|r 494 RKRIDN 499 (1775) Q Consensus 494 ~~~le~ 499 (1775) .--+.. T Consensus 97 af~Lk~ 102 (338) T COG3839 97 AFGLKL 102 (338) T ss_pred HHHHHH T ss_conf 416644 No 334 >PRK06851 hypothetical protein; Provisional Probab=95.06 E-value=0.037 Score=37.44 Aligned_cols=34 Identities=29% Similarity=0.501 Sum_probs=24.4 Q ss_pred CCEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEE Q ss_conf 9559942878770389999999999679828996 Q gi|254780229|r 420 KNLIIEGPPGTGKSQTITNIIAAAMLHGKKILFC 453 (1775) Q Consensus 420 ~~~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfV 453 (1775) +=++|-|.||||||.-+..+-..|+.+|-.|-+. T Consensus 216 ~~y~ikG~pGtGKstlL~~i~~~A~~~G~dvevy 249 (368) T PRK06851 216 NRYFLKGRPGTGKSTMLKKVAKAAEERGLDVEVY 249 (368) T ss_pred EEEEECCCCCCCHHHHHHHHHHHHHHCCCEEEEE T ss_conf 6999818999877999999999999859828998 No 335 >cd03296 ABC_CysA_sulfate_importer Part of the ABC transporter complex cysAWTP involved in sulfate import. Responsible for energy coupling to the transport system. The complex is composed of two ATP-binding proteins (cysA), two transmembrane proteins (cysT and cysW), and a solute-binding protein (cysP). ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=95.06 E-value=0.012 Score=42.38 Aligned_cols=46 Identities=28% Similarity=0.537 Sum_probs=33.9 Q ss_pred HHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHC-CCEEEEEECCHHHH Q ss_conf 985148955994287877038999999999967-98289960105899 Q gi|254780229|r 414 IDVINGKNLIIEGPPGTGKSQTITNIIAAAMLH-GKKILFCAQKMAAI 460 (1775) Q Consensus 414 ~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~-Gk~VLfVaeK~aAL 460 (1775) ..+..|.-++|.||.|+||| |+.++|+-..-. ..+|.|--+....+ T Consensus 23 ~~v~~Ge~~~iiGpSGsGKS-Tll~~i~Gl~~p~~G~I~~~g~~i~~~ 69 (239) T cd03296 23 LDIPSGELVALLGPSGSGKT-TLLRLIAGLERPDSGTILFGGEDATDV 69 (239) T ss_pred EEECCCCEEEEECCCCCHHH-HHHHHHHCCCCCCCEEEEECCEECCCC T ss_conf 69889989999999997799-999999769999863999999999999 No 336 >PRK04863 mukB cell division protein MukB; Provisional Probab=95.05 E-value=0.3 Score=27.97 Aligned_cols=58 Identities=26% Similarity=0.236 Sum_probs=28.6 Q ss_pred CCCCCHHHHHHH-HHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHCCCE-EEEEECC-HHHHHHHHHHHHHC Q ss_conf 667734689999-985148955994287877038999999999967982-8996010-58999999864435 Q gi|254780229|r 402 IDDSDSSQHSAV-IDVINGKNLIIEGPPGTGKSQTITNIIAAAMLHGKK-ILFCAQK-MAAIEVVRHRLEKA 470 (1775) Q Consensus 402 ~~~aDsSQ~~av-~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~Gk~-VLfVaeK-~aALdVV~~RL~~~ 470 (1775) --+.|=|.+.=+ ++++.| |=|..|+-=...+|..-.+ |==+-+- .+||+.++.-|..+ T Consensus 188 rs~~DRskfyrlieaSlyG-----------gIssaItrsLrdYLLpen~GVrkafqdMe~alrenR~Tleai 248 (1486) T PRK04863 188 RSSSDRSKFYRLIEASLYG-----------GISSAITRSLRDYLLPENSGVRKAFQDMEAALRENRMTLEAI 248 (1486) T ss_pred CCHHHHHHHHHHHHHHHCC-----------CHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHH T ss_conf 6866689999999999706-----------706888889999848444338889999999999999779999 No 337 >PRK05563 DNA polymerase III subunits gamma and tau; Validated Probab=95.03 E-value=0.021 Score=39.90 Aligned_cols=32 Identities=28% Similarity=0.418 Sum_probs=24.7 Q ss_pred HHHHHCCCCEEEECCCCCCCHHHHHHHHHHHHH Q ss_conf 998514895599428787703899999999996 Q gi|254780229|r 413 VIDVINGKNLIIEGPPGTGKSQTITNIIAAAML 445 (1775) Q Consensus 413 v~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la 445 (1775) |..-+-+.-++..|||||||| |.|-++|-+|- T Consensus 32 i~~~~i~hayLf~GprG~GKT-s~Ari~akaln 63 (541) T PRK05563 32 IINNRIAHAYLFCGTRGTGKT-STAKIFAKAVN 63 (541) T ss_pred HHCCCCCEEEEEECCCCCCHH-HHHHHHHHHHC T ss_conf 984993204530387995899-99999999957 No 338 >PRK06645 DNA polymerase III subunits gamma and tau; Validated Probab=95.02 E-value=0.025 Score=39.24 Aligned_cols=33 Identities=27% Similarity=0.398 Sum_probs=26.7 Q ss_pred HHHHHHCCCCEEEECCCCCCCHHHHHHHHHHHHH Q ss_conf 9998514895599428787703899999999996 Q gi|254780229|r 412 AVIDVINGKNLIIEGPPGTGKSQTITNIIAAAML 445 (1775) Q Consensus 412 av~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la 445 (1775) ||..-+-...++..||+||||| |+|-++|.+|- T Consensus 36 ~i~~~~~~~aylf~G~rG~GKT-t~Ari~ak~ln 68 (507) T PRK06645 36 TILNDRLAGGYLLTGIRGVGKT-TSARIIAKAVN 68 (507) T ss_pred HHHCCCCCCEEEEECCCCCCHH-HHHHHHHHHHC T ss_conf 9973996634774587997889-99999999967 No 339 >PRK10733 hflB ATP-dependent metalloprotease; Reviewed Probab=95.01 E-value=0.025 Score=39.18 Aligned_cols=41 Identities=29% Similarity=0.470 Sum_probs=28.0 Q ss_pred HHHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEECCHH Q ss_conf 9985148955994287877038999999999967982899601058 Q gi|254780229|r 413 VIDVINGKNLIIEGPPGTGKSQTITNIIAAAMLHGKKILFCAQKMA 458 (1775) Q Consensus 413 v~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVaeK~a 458 (1775) -.-++--|.+++.|||||||| ++|-|+|.--.|=|.+-.-. T Consensus 179 ~~Gak~PkGvLL~GPPGtGKT-----lLAkAvAgEa~vpF~~~sgs 219 (644) T PRK10733 179 KLGGKIPKGVLMVGPPGTGKT-----LLAKAIAGEAKVPFFTISGS 219 (644) T ss_pred HCCCCCCCCEEEECCCCCCHH-----HHHHHHHCCCCCEEEEEEHH T ss_conf 749979985177798998778-----99999864559808997847 No 340 >KOG0976 consensus Probab=95.01 E-value=0.31 Score=27.88 Aligned_cols=25 Identities=28% Similarity=0.485 Sum_probs=15.8 Q ss_pred ECCEEEECCCCCCCEEEE---EECCCHH Q ss_conf 012573368988831799---9745802 Q gi|254780229|r 1690 SIDVVVRDPNNPGHYLMG---IECDGAM 1714 (1775) Q Consensus 1690 riDl~v~~p~~pg~~~l~---iecDG~~ 1714 (1775) -|.+.-+-|..|.+|+.| +.||-+. T Consensus 1186 siei~~is~ss~~tvlaeRt~l~cvnah 1213 (1265) T KOG0976 1186 SIEIHPISPSSPHTVLAERTELRCVNAH 1213 (1265) T ss_pred CCCCCCCCCCCCHHHHHHHHHEEEECCC T ss_conf 7664256899830265420320000578 No 341 >TIGR02928 TIGR02928 orc1/cdc6 family replication initiation protein; InterPro: IPR014277 This set of DNA binding proteins shows homology to the origin recognition complex subunit 1/cell division control protein 6 family in eukaryotes. The proteins in this entry are found exclusively in the archaea. Several members may be found in a genome and interact with each other.. Probab=95.00 E-value=0.023 Score=39.57 Aligned_cols=29 Identities=28% Similarity=0.431 Sum_probs=23.6 Q ss_pred HHCCCCEEEECCCCCCCHHHHHHHHHHHHH Q ss_conf 514895599428787703899999999996 Q gi|254780229|r 416 VINGKNLIIEGPPGTGKSQTITNIIAAAML 445 (1775) Q Consensus 416 ~~~g~~~vi~GPPGTGKSQTIaNlIa~~la 445 (1775) =..-.|+.|=|||||||| .+|+-++..|- T Consensus 40 G~~P~Ni~iYGkTGtGKT-~vt~~v~~~l~ 68 (383) T TIGR02928 40 GSRPSNIFIYGKTGTGKT-AVTKYVMKELE 68 (383) T ss_pred CCCCCCEEEECCCCCCHH-HHHHHHHHHHH T ss_conf 489872588788898788-99999999999 No 342 >COG1474 CDC6 Cdc6-related protein, AAA superfamily ATPase [DNA replication, recombination, and repair / Posttranslational modification, protein turnover, chaperones] Probab=95.00 E-value=0.032 Score=38.05 Aligned_cols=29 Identities=34% Similarity=0.528 Sum_probs=19.8 Q ss_pred HHHCCCCEEEECCCCCCCHHHHHHHHHHH Q ss_conf 85148955994287877038999999999 Q gi|254780229|r 415 DVINGKNLIIEGPPGTGKSQTITNIIAAA 443 (1775) Q Consensus 415 ~~~~g~~~vi~GPPGTGKSQTIaNlIa~~ 443 (1775) .--...|++|-||||||||.|+=.+.-.. T Consensus 38 ~~~~p~n~~iyG~~GTGKT~~~~~v~~~l 66 (366) T COG1474 38 RGERPSNIIIYGPTGTGKTATVKFVMEEL 66 (366) T ss_pred CCCCCCCEEEECCCCCCHHHHHHHHHHHH T ss_conf 58998607998899987328999999999 No 343 >COG1136 SalX ABC-type antimicrobial peptide transport system, ATPase component [Defense mechanisms] Probab=95.00 E-value=0.018 Score=40.74 Aligned_cols=38 Identities=29% Similarity=0.519 Sum_probs=30.2 Q ss_pred HHHCCCCEEEECCCCCCCHHHHHHHHHHHHHC-CCEEEEE Q ss_conf 85148955994287877038999999999967-9828996 Q gi|254780229|r 415 DVINGKNLIIEGPPGTGKSQTITNIIAAAMLH-GKKILFC 453 (1775) Q Consensus 415 ~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~-Gk~VLfV 453 (1775) .+..|.-++|.||-|+||| |..|+|+-.... +.+|.|- T Consensus 27 ~i~~Ge~vaI~GpSGSGKS-TLLniig~ld~pt~G~v~i~ 65 (226) T COG1136 27 EIEAGEFVAIVGPSGSGKS-TLLNLLGGLDKPTSGEVLIN 65 (226) T ss_pred EECCCCEEEEECCCCCCHH-HHHHHHHCCCCCCCCEEEEC T ss_conf 8749989999899999899-99999964667888469999 No 344 >PRK05748 replicative DNA helicase; Provisional Probab=94.99 E-value=0.081 Score=33.89 Aligned_cols=54 Identities=20% Similarity=0.245 Sum_probs=42.4 Q ss_pred HHCCCCEEEECCCCCCCHHHHHHHHHHH-HHCCCEEEEEECCHHHHHHHHHHHHH Q ss_conf 5148955994287877038999999999-96798289960105899999986443 Q gi|254780229|r 416 VINGKNLIIEGPPGTGKSQTITNIIAAA-MLHGKKILFCAQKMAAIEVVRHRLEK 469 (1775) Q Consensus 416 ~~~g~~~vi~GPPGTGKSQTIaNlIa~~-la~Gk~VLfVaeK~aALdVV~~RL~~ 469 (1775) ...|.=.||=|.||-|||-.-.|++.++ +..|++|+|.|--|.+-+++.|-+.. T Consensus 200 ~~~g~LiviaaRP~mGKTa~alnia~~~a~~~~~~v~~fSlEM~~~~l~~R~la~ 254 (448) T PRK05748 200 LQPNDLIIVAARPSVGKTAFALNIAQNVATKTDKNVAIFSLEMGAESLVMRMLCA 254 (448) T ss_pred CCCCCEEEEEECCCCCHHHHHHHHHHHHHHHCCCEEEEEECCCCHHHHHHHHHHH T ss_conf 8867379998479987689999999999985698089981778888999999999 No 345 >PRK08840 replicative DNA helicase; Provisional Probab=94.98 E-value=0.08 Score=33.95 Aligned_cols=55 Identities=20% Similarity=0.287 Sum_probs=42.0 Q ss_pred HHCCCCEEEECCCCCCCHHHHHHHHHHHH-HCCCEEEEEECCHHHHHHHHHHHHHC Q ss_conf 51489559942878770389999999999-67982899601058999999864435 Q gi|254780229|r 416 VINGKNLIIEGPPGTGKSQTITNIIAAAM-LHGKKILFCAQKMAAIEVVRHRLEKA 470 (1775) Q Consensus 416 ~~~g~~~vi~GPPGTGKSQTIaNlIa~~l-a~Gk~VLfVaeK~aALdVV~~RL~~~ 470 (1775) ...|.=.||-|.||.|||..-.|++.++. -.|++|+|.|--|.+-+++.|-|... T Consensus 214 l~~G~LiviaaRPsmGKTalalnia~n~a~~~~~~v~~fSlEMs~~ql~~Rlls~~ 269 (464) T PRK08840 214 LQGSDLIIVAARPSMGKTTFAMNLCENAAMDQDKPVLIFSLEMPAEQLMMRMLASL 269 (464) T ss_pred CCCCCEEEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEECCCCCHHHHHHHHHHHH T ss_conf 87576799983798736899999999999965996799767799899999999985 No 346 >PRK13721 conjugal transfer ATP-binding protein TraC; Provisional Probab=94.97 E-value=0.051 Score=36.00 Aligned_cols=37 Identities=22% Similarity=0.355 Sum_probs=33.3 Q ss_pred CCCCEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEE Q ss_conf 4895599428787703899999999996798289960 Q gi|254780229|r 418 NGKNLIIEGPPGTGKSQTITNIIAAAMLHGKKILFCA 454 (1775) Q Consensus 418 ~g~~~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVa 454 (1775) ...|++|.||+|+|||-++-.||.+.++.|-+|-++= T Consensus 473 tN~N~~I~g~SGsGKSf~~q~l~~~~~~~G~~V~IID 509 (864) T PRK13721 473 TNYNMAVCGTSGAGKTGLIQPLIRSVLDSGGFAVVFD 509 (864) T ss_pred CCCCEEEECCCCCCHHHHHHHHHHHHHHCCCEEEEEE T ss_conf 8700699828998689999999999985799799996 No 347 >PRK04040 adenylate kinase; Provisional Probab=94.97 E-value=0.049 Score=36.17 Aligned_cols=45 Identities=22% Similarity=0.331 Sum_probs=30.2 Q ss_pred CCCEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEECCHHHHHHHHHH Q ss_conf 895599428787703899999999996798289960105899999986 Q gi|254780229|r 419 GKNLIIEGPPGTGKSQTITNIIAAAMLHGKKILFCAQKMAAIEVVRHR 466 (1775) Q Consensus 419 g~~~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVaeK~aALdVV~~R 466 (1775) ++-+||.|-||+||| ||+|.+..-+-.+ .-+|+.-.--+++.+++ T Consensus 2 ~k~VvvtGiPGvGKT-Tv~~~~~~~l~~~--~~~vn~G~~M~e~A~~~ 46 (189) T PRK04040 2 MKVVVVTGVPGVGKT-TVLNKALEKLKED--YKIVNFGDVMLEVAKEE 46 (189) T ss_pred CEEEEEECCCCCCHH-HHHHHHHHHHCCC--CEEEEHHHHHHHHHHHC T ss_conf 418999758988789-9999999972358--75986779999999981 No 348 >COG1205 Distinct helicase family with a unique C-terminal domain including a metal-binding cysteine cluster [General function prediction only] Probab=94.96 E-value=0.16 Score=30.73 Aligned_cols=57 Identities=19% Similarity=0.249 Sum_probs=40.9 Q ss_pred CCCCCCHHHHHHHHHHHCCCCEEEECCCCCCCHHHH-HHHHHHHHHCCCE-EEEEECCH Q ss_conf 366773468999998514895599428787703899-9999999967982-89960105 Q gi|254780229|r 401 LIDDSDSSQHSAVIDVINGKNLIIEGPPGTGKSQTI-TNIIAAAMLHGKK-ILFCAQKM 457 (1775) Q Consensus 401 ~~~~aDsSQ~~av~~~~~g~~~vi~GPPGTGKSQTI-aNlIa~~la~Gk~-VLfVaeK~ 457 (1775) .+...-+=|..|...+.+|+|+||.-|+|+|||-+= .-|+-+++-.++. =||+-=.+ T Consensus 67 g~~~lY~HQ~~A~~~~~~G~~vvVtTgTgSGKTe~FllPIld~~l~~~~a~AL~lYPtn 125 (851) T COG1205 67 GIERLYSHQVDALRLIREGRNVVVTTGTGSGKTESFLLPILDHLLRDPSARALLLYPTN 125 (851) T ss_pred CCCCCCHHHHHHHHHHHCCCCEEEECCCCCCHHHHHHHHHHHHHHCCCCCEEEEEECHH T ss_conf 34420077999999997799889978998854589899999998308665089980437 No 349 >PRK13873 conjugal transfer ATPase TrbE; Provisional Probab=94.95 E-value=0.31 Score=27.75 Aligned_cols=44 Identities=25% Similarity=0.499 Sum_probs=29.9 Q ss_pred CCCEEEECCCCCCCHHHHHHHHHHHHHC--CCEEEEEECCHHHHHHHHH Q ss_conf 8955994287877038999999999967--9828996010589999998 Q gi|254780229|r 419 GKNLIIEGPPGTGKSQTITNIIAAAMLH--GKKILFCAQKMAAIEVVRH 465 (1775) Q Consensus 419 g~~~vi~GPPGTGKSQTIaNlIa~~la~--Gk~VLfVaeK~aALdVV~~ 465 (1775) | +..|.||+|+||| |..|+++.-.-+ |-+| |+--|-...++.-+ T Consensus 442 G-HtlI~GpTGsGKT-vll~~l~~q~~ry~~~~v-f~FDkd~s~~i~~~ 487 (815) T PRK13873 442 G-HTLVVGPTGAGKS-VLLALMALQFRRYPGSQV-FAFDFGGSIRAAAL 487 (815) T ss_pred C-CEEEECCCCCCHH-HHHHHHHHHHHHCCCCCE-EEEECCCCHHHHHH T ss_conf 6-4389788999899-999999999864489848-99978987899999 No 350 >PRK08506 replicative DNA helicase; Provisional Probab=94.95 E-value=0.087 Score=33.55 Aligned_cols=54 Identities=26% Similarity=0.443 Sum_probs=43.5 Q ss_pred CCCCEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEECCHHHHHHHHHHHHHCC Q ss_conf 489559942878770389999999999679828996010589999998644358 Q gi|254780229|r 418 NGKNLIIEGPPGTGKSQTITNIIAAAMLHGKKILFCAQKMAAIEVVRHRLEKAG 471 (1775) Q Consensus 418 ~g~~~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVaeK~aALdVV~~RL~~~g 471 (1775) .|.=.||=|-||-|||-.-.||+.+++..|+.|+|-|--|.+-+++.|-|...+ T Consensus 192 ~gdLiIIAARPsmGKTAfAlniA~~~a~~~~~V~~FSLEMs~~ql~~Rlls~~s 245 (473) T PRK08506 192 KGDLIIIAARPSMGKTTLVLNMVLKALNQGKGVAFFSLEMPAEQLMLRMLSAKT 245 (473) T ss_pred CCCEEEEECCCCCCHHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHC T ss_conf 562799950799867899999999999659965898224799999999999728 No 351 >KOG2028 consensus Probab=94.94 E-value=0.02 Score=40.15 Aligned_cols=36 Identities=39% Similarity=0.640 Sum_probs=26.5 Q ss_pred CHHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHHH Q ss_conf 34689999985148955994287877038999999999 Q gi|254780229|r 406 DSSQHSAVIDVINGKNLIIEGPPGTGKSQTITNIIAAA 443 (1775) Q Consensus 406 DsSQ~~av~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~ 443 (1775) |+-=.+ .+.--.=.++|+=|||||||| ||+-+|+.- T Consensus 150 ~gllrs-~ieq~~ipSmIlWGppG~GKT-tlArlia~t 185 (554) T KOG2028 150 DGLLRS-LIEQNRIPSMILWGPPGTGKT-TLARLIAST 185 (554) T ss_pred CHHHHH-HHHCCCCCCEEEECCCCCCHH-HHHHHHHHH T ss_conf 268999-987088870588669987658-899999860 No 352 >PRK13948 shikimate kinase; Provisional Probab=94.93 E-value=0.025 Score=39.18 Aligned_cols=26 Identities=31% Similarity=0.341 Sum_probs=22.8 Q ss_pred CCCEEEECCCCCCCHHHHHHHHHHHHH Q ss_conf 895599428787703899999999996 Q gi|254780229|r 419 GKNLIIEGPPGTGKSQTITNIIAAAML 445 (1775) Q Consensus 419 g~~~vi~GPPGTGKSQTIaNlIa~~la 445 (1775) -.+.|+.|+||+||| ||..++|..|. T Consensus 10 ~~~IvLIG~mGsGKS-tiGk~LA~~l~ 35 (182) T PRK13948 10 ATFVALAGFMGTGKS-RIGWELSRALA 35 (182) T ss_pred CCCEEEECCCCCCHH-HHHHHHHHHHC T ss_conf 981898899999889-99999999969 No 353 >COG0125 Tmk Thymidylate kinase [Nucleotide transport and metabolism] Probab=94.92 E-value=0.049 Score=36.18 Aligned_cols=40 Identities=25% Similarity=0.477 Sum_probs=33.0 Q ss_pred CCCCEEEECCCCCCCHHHHHHHHHHHHH-CCCEEEEEECCHH Q ss_conf 4895599428787703899999999996-7982899601058 Q gi|254780229|r 418 NGKNLIIEGPPGTGKSQTITNIIAAAML-HGKKILFCAQKMA 458 (1775) Q Consensus 418 ~g~~~vi~GPPGTGKSQTIaNlIa~~la-~Gk~VLfVaeK~a 458 (1775) .|.=.||+|+=|+||| |.+++++..|- .|..|.|+.|-.- T Consensus 2 ~g~fI~iEGiDGaGKT-T~~~~L~~~l~~~g~~v~~trEP~~ 42 (208) T COG0125 2 KGMFIVIEGIDGAGKT-TQAELLKERLEERGIKVVLTREPGG 42 (208) T ss_pred CCEEEEEECCCCCCHH-HHHHHHHHHHHHCCCEEEEEECCCC T ss_conf 9629999788889889-9999999999982980799868999 No 354 >cd00464 SK Shikimate kinase (SK) is the fifth enzyme in the shikimate pathway, a seven-step biosynthetic pathway which converts erythrose-4-phosphate to chorismic acid, found in bacteria, fungi and plants. Chorismic acid is a important intermediate in the synthesis of aromatic compounds, such as aromatic amino acids, p-aminobenzoic acid, folate and ubiquinone. Shikimate kinase catalyses the phosphorylation of the 3-hydroxyl group of shikimic acid using ATP. Probab=94.91 E-value=0.025 Score=39.19 Aligned_cols=24 Identities=33% Similarity=0.602 Sum_probs=21.8 Q ss_pred CEEEECCCCCCCHHHHHHHHHHHHH Q ss_conf 5599428787703899999999996 Q gi|254780229|r 421 NLIIEGPPGTGKSQTITNIIAAAML 445 (1775) Q Consensus 421 ~~vi~GPPGTGKSQTIaNlIa~~la 445 (1775) |+++.|+||+||| ||.-++|..|. T Consensus 1 nI~LiG~~G~GKs-tigk~la~~l~ 24 (154) T cd00464 1 NIVLIGMMGAGKT-TVGRLLAKALG 24 (154) T ss_pred CEEEECCCCCCHH-HHHHHHHHHHC T ss_conf 9899889999889-99999999979 No 355 >TIGR00455 apsK adenylylsulfate kinase; InterPro: IPR002891 Enzyme that catalyses the phosphorylation of adenylylsulphate to 3'-phosphoadenylylsulphate. This domain contains an ATP binding P-loop motif .; GO: 0005524 ATP binding, 0016301 kinase activity, 0016772 transferase activity transferring phosphorus-containing groups, 0000103 sulfate assimilation. Probab=94.91 E-value=0.022 Score=39.74 Aligned_cols=29 Identities=28% Similarity=0.516 Sum_probs=24.3 Q ss_pred HCCCCEE--EECCCCCCCHHHHHHHHHHHHHC Q ss_conf 1489559--94287877038999999999967 Q gi|254780229|r 417 INGKNLI--IEGPPGTGKSQTITNIIAAAMLH 446 (1775) Q Consensus 417 ~~g~~~v--i~GPPGTGKSQTIaNlIa~~la~ 446 (1775) ..-+++| ++||=|+||| ||||++...|.. T Consensus 15 ~~h~g~vlWlTGLSGsGKs-TiA~Al~~~L~~ 45 (187) T TIGR00455 15 NKHRGVVLWLTGLSGSGKS-TIANALEKKLEK 45 (187) T ss_pred CCCCEEEEEEECCCCCHHH-HHHHHHHHHHHH T ss_conf 2797389851168856357-999999999996 No 356 >KOG0731 consensus Probab=94.91 E-value=0.02 Score=40.24 Aligned_cols=36 Identities=31% Similarity=0.588 Sum_probs=24.8 Q ss_pred HHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEE Q ss_conf 98514895599428787703899999999996798289960 Q gi|254780229|r 414 IDVINGKNLIIEGPPGTGKSQTITNIIAAAMLHGKKILFCA 454 (1775) Q Consensus 414 ~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVa 454 (1775) .-|+--+..+|.|||||||| +.|-|+|--=.|=|++ T Consensus 339 lGAKiPkGvLL~GPPGTGKT-----LLAKAiAGEAgVPF~s 374 (774) T KOG0731 339 LGAKIPKGVLLVGPPGTGKT-----LLAKAIAGEAGVPFFS 374 (774) T ss_pred CCCCCCCCEEEECCCCCCHH-----HHHHHHHCCCCCCEEE T ss_conf 47767675178789998678-----9999885305896464 No 357 >PRK08694 consensus Probab=94.91 E-value=0.082 Score=33.84 Aligned_cols=55 Identities=24% Similarity=0.390 Sum_probs=44.0 Q ss_pred HHCCCCEEEECCCCCCCHHHHHHHHHHHHHCCC-EEEEEECCHHHHHHHHHHHHHC Q ss_conf 514895599428787703899999999996798-2899601058999999864435 Q gi|254780229|r 416 VINGKNLIIEGPPGTGKSQTITNIIAAAMLHGK-KILFCAQKMAAIEVVRHRLEKA 470 (1775) Q Consensus 416 ~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~Gk-~VLfVaeK~aALdVV~~RL~~~ 470 (1775) ...|.=+||=|-||-|||-.-.|++.++...|+ +|+|.|--|.+-+++.|-|... T Consensus 215 l~~G~LiVIaaRPsmGKTalalnia~~~a~~~~~~V~~fSLEMs~~~l~~Rlla~~ 270 (468) T PRK08694 215 LQPGDLIIVAGRPSMGKTAFSINIAEHVAVEGKLPVAVFSMEMGGAQLVMRMLGSV 270 (468) T ss_pred CCCCCEEEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEECCCCCHHHHHHHHHHHH T ss_conf 88784799961786537899999999999847984799778899999999999972 No 358 >TIGR02203 MsbA_lipidA lipid A export permease/ATP-binding protein MsbA; InterPro: IPR011917 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). This family consists of a single polypeptide chain transporter in the ATP-binding cassette (ABC) transporter family, MsbA, which exports lipid A. It may also act in multidrug resistance. Lipid A, a part of lipopolysaccharide, is found in the outer leaflet of the outer membrane of most Gram-negative bacteria. Members of this family are restricted to the Proteobacteria (although lipid A is more broadly distributed) and often are clustered with lipid A biosynthesis genes .; GO: 0005524 ATP binding, 0006869 lipid transport, 0009276 1-2nm peptidoglycan-based cell wall, 0016021 integral to membrane. Probab=94.90 E-value=0.014 Score=41.82 Aligned_cols=27 Identities=33% Similarity=0.631 Sum_probs=20.1 Q ss_pred HHHHCCCCEEEECCCCCCCHHHHHHHHH Q ss_conf 9851489559942878770389999999 Q gi|254780229|r 414 IDVINGKNLIIEGPPGTGKSQTITNIIA 441 (1775) Q Consensus 414 ~~~~~g~~~vi~GPPGTGKSQTIaNlIa 441 (1775) ..+..|+-+.|.||=|+||| ||+|||- T Consensus 383 l~~~~G~~vALVGRSGSGKs-Tlv~LlP 409 (603) T TIGR02203 383 LVVEPGETVALVGRSGSGKS-TLVNLLP 409 (603) T ss_pred CEECCCCEEEEECCCCCHHH-HHHHHCC T ss_conf 51158735998706885389-9985523 No 359 >COG0507 RecD ATP-dependent exoDNAse (exonuclease V), alpha subunit - helicase superfamily I member [DNA replication, recombination, and repair] Probab=94.89 E-value=0.031 Score=38.20 Aligned_cols=45 Identities=33% Similarity=0.566 Sum_probs=36.4 Q ss_pred CCCCCCCCEEEECCHHHCCHHHHHH---HHHHCCEEEEEECCCCCCCC Q ss_conf 8522541078860632369899889---88611828996077866862 Q gi|254780229|r 1356 EPKDVKFDLVIMDESSQIKPEDALG---VIARGKQIVVVGDPKQLPPT 1400 (1775) Q Consensus 1356 ~~~~~~FD~VI~DEASQ~~~~~al~---al~rakqvvvvGD~kQlpPt 1400 (1775) .......|++||||+|=+..+.+.. ++..+.++|+|||..|+||- T Consensus 402 ~~~~~~~d~~iiDe~~ml~~~~~~~l~~~i~~~a~~i~vGD~~ql~~v 449 (696) T COG0507 402 NEEPLDGDLLIIDEASMLDTSLAFGLLSAIGKLAKVILVGDVDQLPSV 449 (696) T ss_pred CCCCCCCCEEEEECHHHHHHHHHHHHHCCCCCCCEEEEECCHHHCCCC T ss_conf 545543552577304330899876665035446768885263014500 No 360 >pfam08298 AAA_PrkA PrkA AAA domain. This is a family of PrkA bacterial and archaeal serine kinases approximately 630 residues long. This is the N-terminal AAA domain. Probab=94.87 E-value=0.024 Score=39.47 Aligned_cols=26 Identities=27% Similarity=0.378 Sum_probs=21.7 Q ss_pred CEEEECCCCCCCHHHHHHHHHHHHHCC Q ss_conf 559942878770389999999999679 Q gi|254780229|r 421 NLIIEGPPGTGKSQTITNIIAAAMLHG 447 (1775) Q Consensus 421 ~~vi~GPPGTGKSQTIaNlIa~~la~G 447 (1775) =+++.||+|+||| ||+..+...|=.- T Consensus 87 IllL~GPVGsGKS-sl~e~LK~glE~y 112 (358) T pfam08298 87 ILYLLGPVGGGKS-SLAERLKKLLELV 112 (358) T ss_pred EEEEECCCCCCHH-HHHHHHHHHHCCC T ss_conf 8999778987758-9999999872058 No 361 >cd03224 ABC_TM1139_LivF_branched LivF (TM1139) is part of the LIV-I bacterial ABC-type two-component transport system that imports neutral, branched-chain amino acids. The E. coli branched-chain amino acid transporter comprises a heterodimer of ABC transporters (LivF and LivG), a heterodimer of six-helix TM domains (LivM and LivH), and one of two alternative soluble periplasmic substrate binding proteins (LivK or LivJ). ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. Probab=94.87 E-value=0.061 Score=35.16 Aligned_cols=40 Identities=25% Similarity=0.384 Sum_probs=32.3 Q ss_pred HHHCCCCEEEECCCCCCCHHHHHHHHHHHHH-CCCEEEEEEC Q ss_conf 8514895599428787703899999999996-7982899601 Q gi|254780229|r 415 DVINGKNLIIEGPPGTGKSQTITNIIAAAML-HGKKILFCAQ 455 (1775) Q Consensus 415 ~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la-~Gk~VLfVae 455 (1775) .+..|.-+.|-||.|+||| |+.++|+-.+- ...+|.|--. T Consensus 22 ~v~~Gei~~liG~nGaGKS-TLl~~i~Gl~~p~~G~I~~~G~ 62 (222) T cd03224 22 TVPEGEIVALLGRNGAGKT-TLLKTIMGLLPPRSGSIRFDGR 62 (222) T ss_pred EECCCCEEEEECCCCCCHH-HHHHHHHCCCCCCCEEEEECCE T ss_conf 9889989999999998599-9999997798899609999999 No 362 >PRK03695 vitamin B12-transporter ATPase; Provisional Probab=94.86 E-value=0.02 Score=40.20 Aligned_cols=46 Identities=33% Similarity=0.472 Sum_probs=35.3 Q ss_pred HHHCCCCEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEECCHHHHH Q ss_conf 85148955994287877038999999999967982899601058999 Q gi|254780229|r 415 DVINGKNLIIEGPPGTGKSQTITNIIAAAMLHGKKILFCAQKMAAIE 461 (1775) Q Consensus 415 ~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVaeK~aALd 461 (1775) .+..|.-+.|-||.|+||| |+.++|+-.+-...+|.|--.....+. T Consensus 19 ~v~~Ge~v~iiGpNGaGKS-TLlk~i~Gl~p~~G~I~i~g~~i~~~~ 64 (245) T PRK03695 19 EVRAGEILHLVGPNGAGKS-TLLARMAGLLSGSGEIQFAGQPLEAWS 64 (245) T ss_pred EECCCCEEEEECCCCCHHH-HHHHHHHCCCCCCCEEEECCEECCCCC T ss_conf 9959989999978994199-999998466888965999999735389 No 363 >cd03299 ABC_ModC_like Archeal protein closely related to ModC. ModC is an ABC-type transporter and the ATPase component of a molybdate transport system that also includes the periplasmic binding protein ModA and the membrane protein ModB. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=94.86 E-value=0.018 Score=40.61 Aligned_cols=51 Identities=27% Similarity=0.481 Sum_probs=36.2 Q ss_pred HHHCCCCEEEECCCCCCCHHHHHHHHHHHHHC-CCEEEEEECCHHHHHHHHHH Q ss_conf 85148955994287877038999999999967-98289960105899999986 Q gi|254780229|r 415 DVINGKNLIIEGPPGTGKSQTITNIIAAAMLH-GKKILFCAQKMAAIEVVRHR 466 (1775) Q Consensus 415 ~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~-Gk~VLfVaeK~aALdVV~~R 466 (1775) .+..|.-++|-||.|+||| |+.++|+-.+-. ..+|+|--+....+..-+++ T Consensus 21 ~v~~Ge~~~iiGpSGsGKS-TLlr~i~Gl~~p~~G~I~~~G~di~~~~~~~r~ 72 (235) T cd03299 21 EVERGDYFVILGPTGSGKS-VLLETIAGFIKPDSGKILLNGKDITNLPPEKRD 72 (235) T ss_pred EECCCCEEEEECCCCCHHH-HHHHHHHCCCCCCCEEEEECCEECCCCCHHHCC T ss_conf 9889989999999963599-999999749999965999999999999976789 No 364 >COG3598 RepA RecA-family ATPase [DNA replication, recombination, and repair] Probab=94.86 E-value=0.092 Score=33.31 Aligned_cols=59 Identities=25% Similarity=0.390 Sum_probs=39.9 Q ss_pred HHCCCCEEEECCCCCCCHHHHHHHHHHHHHCCC-----------EEEEEEC---CHHHHHHHHHHHHHCCCCCE Q ss_conf 514895599428787703899999999996798-----------2899601---05899999986443584413 Q gi|254780229|r 416 VINGKNLIIEGPPGTGKSQTITNIIAAAMLHGK-----------KILFCAQ---KMAAIEVVRHRLEKAGLGEF 475 (1775) Q Consensus 416 ~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~Gk-----------~VLfVae---K~aALdVV~~RL~~~gLg~~ 475 (1775) -+.|-..+|-||-|+||| |+.=-.|-+|+.|| +||+|+- +..+|.-.+--+...||.+. T Consensus 86 fr~g~~~~~~gdsg~GKt-tllL~l~IalaaG~~lfG~~v~epGkvlyvslEl~re~~L~Rl~~v~a~mgLsPa 158 (402) T COG3598 86 FRKGYVSILYGDSGVGKT-TLLLYLCIALAAGKNLFGNKVKEPGKVLYVSLELYREDILERLEPVRARMGLSPA 158 (402) T ss_pred HHCCEEEEEECCCCCCHH-HHHHHHHHHHHHHHHHHCCCCCCCCEEEEEEECCCHHHHHHHHHHHHHHCCCCHH T ss_conf 651705898448862376-8999999999864777453355888079998226868899999999987099857 No 365 >KOG1942 consensus Probab=94.85 E-value=0.027 Score=38.87 Aligned_cols=56 Identities=30% Similarity=0.625 Sum_probs=38.8 Q ss_pred HHHCCCCEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEECCHHHHHHHHHHHHHCCCCCEEEECCCCCHHHHHHHHH Q ss_conf 851489559942878770389999999999679828996010589999998644358441365402661108999999 Q gi|254780229|r 415 DVINGKNLIIEGPPGTGKSQTITNIIAAAMLHGKKILFCAQKMAAIEVVRHRLEKAGLGEFCLELHSHKVHKRAILDD 492 (1775) Q Consensus 415 ~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVaeK~aALdVV~~RL~~~gLg~~cleLHs~ka~k~~vl~~ 492 (1775) .-..|+-+.+-|||||||+ .|+=.|++-|..| |=|+. |-+ .|+-|+.-.|.++|.+ T Consensus 60 KkmaGravLlaGppgtGKT-AlAlaisqELG~k--vPFcp--mvg-----------------SEvyS~EvKKTEvLme 115 (456) T KOG1942 60 KKMAGRAVLLAGPPGTGKT-ALALAISQELGPK--VPFCP--MVG-----------------SEVYSNEVKKTEVLME 115 (456) T ss_pred HHCCCCEEEEECCCCCCHH-HHHHHHHHHHCCC--CCCCC--CCC-----------------HHHHHHHHHHHHHHHH T ss_conf 6326727997369987656-8999999974799--98566--641-----------------0555434557899999 No 366 >PRK00771 signal recognition particle protein Srp54; Provisional Probab=94.84 E-value=0.13 Score=31.73 Aligned_cols=54 Identities=20% Similarity=0.295 Sum_probs=41.7 Q ss_pred EEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEEC---CHHHHHHHHHHHHHCCCCCE Q ss_conf 5994287877038999999999967982899601---05899999986443584413 Q gi|254780229|r 422 LIIEGPPGTGKSQTITNIIAAAMLHGKKILFCAQ---KMAAIEVVRHRLEKAGLGEF 475 (1775) Q Consensus 422 ~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVae---K~aALdVV~~RL~~~gLg~~ 475 (1775) +.+.|.-|+|||.|++-+-..+--+|++|+.||- ..||.|-++---...|+.-| T Consensus 100 im~vGlqGsGKTTT~aKLA~~~kk~g~kv~lvaaDt~RpaA~eQL~~la~~~~v~~~ 156 (433) T PRK00771 100 ILLVGLQGSGKTTTAAKLARYFQKKGLKVGVICADTWRPGAYEQLKQLCEKINVPFY 156 (433) T ss_pred EEEECCCCCCHHHHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHHHCCCCCC T ss_conf 999737889789999999999997799467850678836899999999986388731 No 367 >KOG3928 consensus Probab=94.84 E-value=0.037 Score=37.47 Aligned_cols=36 Identities=31% Similarity=0.436 Sum_probs=30.7 Q ss_pred CCCEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEE Q ss_conf 895599428787703899999999996798289960 Q gi|254780229|r 419 GKNLIIEGPPGTGKSQTITNIIAAAMLHGKKILFCA 454 (1775) Q Consensus 419 g~~~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVa 454 (1775) -+-|||.|+||||||=|.+-++.+|.-.+-=||-+. T Consensus 179 ~~r~vL~Ge~GtGKSiaL~qa~h~a~~~~wlIlhip 214 (461) T KOG3928 179 VKRFVLDGEPGTGKSIALAQAVHYAADQKWLILHIP 214 (461) T ss_pred CEEEEEECCCCCCHHHHHHHHHHHHHCCCEEEEECC T ss_conf 358998579987620599999998731785999778 No 368 >COG0466 Lon ATP-dependent Lon protease, bacterial type [Posttranslational modification, protein turnover, chaperones] Probab=94.83 E-value=0.027 Score=38.94 Aligned_cols=30 Identities=17% Similarity=0.214 Sum_probs=19.4 Q ss_pred EEECHHHHHHHCCCCCCCCCEEEECCHHHCCH Q ss_conf 99257799872585225410788606323698 Q gi|254780229|r 1344 FMMSPMSVANYLEPKDVKFDLVIMDESSQIKP 1375 (1775) Q Consensus 1344 ~~msp~sva~~l~~~~~~FD~VI~DEASQ~~~ 1375 (1775) +.+++-.+-.||-... |+-=-+++..|+.. T Consensus 570 ~~i~~~~l~~yLG~~~--f~~~~~~~~~~vGv 599 (782) T COG0466 570 VKIDEKNLKKYLGVPV--FRYGKAEEEDQVGV 599 (782) T ss_pred EEECHHHHHHHHCCCC--CCCCCCCCCCCCEE T ss_conf 4427889999739863--47531124788705 No 369 >cd03234 ABCG_White The White subfamily represents ABC transporters homologous to the Drosophila white gene, which acts as a dimeric importer for eye pigment precursors. The eye pigmentation of Drosophila is developed from the synthesis and deposition in the cells of red pigments, which are synthesized from guanine, and brown pigments, which are synthesized from tryptophan. The pigment precursors are encoded by the white, brown, and scarlet genes, respectively. Evidence from genetic and biochemical studies suggest that the White and Brown proteins function as heterodimers to import guanine, while the White and Scarlet proteins function to import tryptophan. However, a recent study also suggests that White may be involved in the transport of a metabolite, such as 3-hydroxykynurenine, across intracellular membranes. Mammalian ABC transporters belonging to the White subfamily (ABCG1, ABCG5, and ABCG8) have been shown to be involved in the regulation of lipid-trafficking mechanisms in Probab=94.83 E-value=0.017 Score=40.90 Aligned_cols=42 Identities=33% Similarity=0.531 Sum_probs=32.6 Q ss_pred HHHCCCCEEEECCCCCCCHHHHHHHHHHHHHCC----CEEEEEECCH Q ss_conf 851489559942878770389999999999679----8289960105 Q gi|254780229|r 415 DVINGKNLIIEGPPGTGKSQTITNIIAAAMLHG----KKILFCAQKM 457 (1775) Q Consensus 415 ~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~G----k~VLfVaeK~ 457 (1775) .+..|.-+.|-||.|+||| |+.|+|+-.+-.+ ..|+|--... T Consensus 29 ~i~~Ge~~~ilGpnGsGKS-TLl~~i~G~~~~~~~~~G~I~~~g~~~ 74 (226) T cd03234 29 HVESGQVMAILGSSGSGKT-TLLDAISGRVEGGGTTSGQILFNGQPR 74 (226) T ss_pred EEECCCEEEEECCCCCHHH-HHHHHHHCCCCCCCCCEEEEEECCEEC T ss_conf 9918809999989996099-999999678978984145999999999 No 370 >KOG3347 consensus Probab=94.81 E-value=0.029 Score=38.53 Aligned_cols=28 Identities=36% Similarity=0.583 Sum_probs=23.7 Q ss_pred HCCCCEEEECCCCCCCHHHHHHHHHHHHH Q ss_conf 14895599428787703899999999996 Q gi|254780229|r 417 INGKNLIIEGPPGTGKSQTITNIIAAAML 445 (1775) Q Consensus 417 ~~g~~~vi~GPPGTGKSQTIaNlIa~~la 445 (1775) +.+.|.+|.|-|||||| |.+.-||.... T Consensus 5 r~~PNILvtGTPG~GKs-tl~~~lae~~~ 32 (176) T KOG3347 5 RERPNILVTGTPGTGKS-TLAERLAEKTG 32 (176) T ss_pred HCCCCEEEECCCCCCCH-HHHHHHHHHHC T ss_conf 13788798679998802-59999999739 No 371 >PRK09165 replicative DNA helicase; Provisional Probab=94.81 E-value=0.096 Score=33.11 Aligned_cols=54 Identities=26% Similarity=0.368 Sum_probs=37.7 Q ss_pred HCCCCEEEECCCCCCCHHHHHHHHHHHH---------------HCCCEEEEEECCHHHHHHHHHHHHHC Q ss_conf 1489559942878770389999999999---------------67982899601058999999864435 Q gi|254780229|r 417 INGKNLIIEGPPGTGKSQTITNIIAAAM---------------LHGKKILFCAQKMAAIEVVRHRLEKA 470 (1775) Q Consensus 417 ~~g~~~vi~GPPGTGKSQTIaNlIa~~l---------------a~Gk~VLfVaeK~aALdVV~~RL~~~ 470 (1775) ..|.=+||=|-||.|||---.||.-++. -.|++|+|.|--|.+-+++.|-|... T Consensus 203 ~~GdLiIIAARPsmGKTafaLniA~n~A~~~~~~~~~~~~~~~~~g~~V~~FSLEMs~~ql~~Rlls~~ 271 (484) T PRK09165 203 HPSDLIILAGRPSMGKTALATNIAFNAAKAYLREAQPDGSKKAVNGGVVGFFSLEMSAEQLATRILSEQ 271 (484) T ss_pred CCCCEEEEEECCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHH T ss_conf 877379996079997789999999999987410222233211368984899947799999999999997 No 372 >COG0630 VirB11 Type IV secretory pathway, VirB11 components, and related ATPases involved in archaeal flagella biosynthesis [Cell motility and secretion / Intracellular trafficking and secretion] Probab=94.80 E-value=0.044 Score=36.63 Aligned_cols=51 Identities=20% Similarity=0.304 Sum_probs=37.8 Q ss_pred HHHHH-HHHHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEECCHHH Q ss_conf 46899-999851489559942878770389999999999679828996010589 Q gi|254780229|r 407 SSQHS-AVIDVINGKNLIIEGPPGTGKSQTITNIIAAAMLHGKKILFCAQKMAA 459 (1775) Q Consensus 407 sSQ~~-av~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVaeK~aA 459 (1775) ..|.+ -..++-.++|.+|.|||||||+ |+.|.|..++=.-.|+. +.|...= T Consensus 130 ~~~~ayL~~~ie~~~siii~G~t~sGKT-t~lnall~~Ip~~~riv-tIEdt~E 181 (312) T COG0630 130 PEQAAYLWLAIEARKSIIICGGTASGKT-TLLNALLDFIPPEERIV-TIEDTPE 181 (312) T ss_pred HHHHHHHHHHHHCCCCEEEECCCCCCHH-HHHHHHHHHCCCCCCEE-EECCCCE T ss_conf 8899999999976994999888888649-59999986378522189-9525554 No 373 >PRK04537 ATP-dependent RNA helicase RhlB; Provisional Probab=94.80 E-value=0.3 Score=27.95 Aligned_cols=27 Identities=22% Similarity=0.303 Sum_probs=25.3 Q ss_pred HHHHHHHHHCCCCEEEECCCCCCCHHH Q ss_conf 899999851489559942878770389 Q gi|254780229|r 409 QHSAVIDVINGKNLIIEGPPGTGKSQT 435 (1775) Q Consensus 409 Q~~av~~~~~g~~~vi~GPPGTGKSQT 435 (1775) |..||-.+..|+|++++-|+|||||-. T Consensus 36 Q~~aIP~iL~GkDvi~~A~TGSGKTLA 62 (574) T PRK04537 36 QALTLPVALPGGDVAGQAQTGTGKTLA 62 (574) T ss_pred HHHHHHHHHCCCCEEEECCCCCHHHHH T ss_conf 999999995799889984898889999 No 374 >cd03254 ABCC_Glucan_exporter_like Glucan exporter ATP-binding protein. In A. tumefaciens cyclic beta-1, 2-glucan must be transported into the periplasmic space to exert its action as a virluence factor. This subfamily belongs to the MRP-like family and is involved in drug, peptide, and lipid export. The MRP-like family, similar to all ABC proteins, have a common four-domain core structure constituted by two membrane-spanning domains each composed of six transmembrane (TM) helices and two nucleotide-binding domains (NBD). ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=94.80 E-value=0.021 Score=40.05 Aligned_cols=52 Identities=23% Similarity=0.416 Sum_probs=35.0 Q ss_pred HHHCCCCEEEECCCCCCCHHHHHHHHHHHHH-CCCEEEEEECCHH--HHHHHHHHH Q ss_conf 8514895599428787703899999999996-7982899601058--999999864 Q gi|254780229|r 415 DVINGKNLIIEGPPGTGKSQTITNIIAAAML-HGKKILFCAQKMA--AIEVVRHRL 467 (1775) Q Consensus 415 ~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la-~Gk~VLfVaeK~a--ALdVV~~RL 467 (1775) .+..|+-++|.||.|+||| |+.++|+-.+. .--+|+|--.-.. ..+.+++++ T Consensus 25 ~i~~Ge~vaivG~sGsGKS-TLl~ll~gl~~p~~G~I~idg~~i~~~~~~~~r~~i 79 (229) T cd03254 25 SIKPGETVAIVGPTGAGKT-TLINLLMRFYDPQKGQILIDGIDIRDISRKSLRSMI 79 (229) T ss_pred EECCCCEEEEECCCCCHHH-HHHHHHHCCCCCCCCEEEECCEECCCCCHHHHHHCE T ss_conf 9879999999999998099-999999668667873899999995418999996328 No 375 >TIGR03117 cas_csf4 CRISPR-associated DEAD/DEAH-box helicase Csf4. Members of this family show up near CRISPR repeats in Acidithiobacillus ferrooxidans ATCC 23270, Azoarcus sp. EbN1, and Rhodoferax ferrireducens DSM 15236. In the latter two species, the CRISPR/cas locus is found on a plasmid. This family is one of several characteristic of a type of CRISPR-associated (cas) gene cluster we designate Aferr after A. ferrooxidans, where it is both chromosomal and the only type of cas gene cluster found. The gene is designated csf4 (CRISPR/cas Subtype as in A. ferrooxidans protein 1), as it lies farthest (fourth closest) from the repeats in the A. ferrooxidans genome. Probab=94.79 E-value=0.14 Score=31.53 Aligned_cols=44 Identities=23% Similarity=0.366 Sum_probs=29.0 Q ss_pred HHHHHC-CCCEEEECCCCCCCHHHHHHHHHHHH---H-CCCEEEEEECCHH Q ss_conf 998514-89559942878770389999999999---6-7982899601058 Q gi|254780229|r 413 VIDVIN-GKNLIIEGPPGTGKSQTITNIIAAAM---L-HGKKILFCAQKMA 458 (1775) Q Consensus 413 v~~~~~-g~~~vi~GPPGTGKSQTIaNlIa~~l---a-~Gk~VLfVaeK~a 458 (1775) +..|.+ ++-.|||.++||||| ++-||.... . .||+|.+-..+.+ T Consensus 9 ~~~a~~~~~v~lVEAGTGTGKS--LAYLvPAl~~a~~~~g~rVVISTaT~a 57 (636) T TIGR03117 9 CLTSLRQKRIGMLEASTGVGKT--LAMIMAALTMLKERPDQKIAIAVPTLA 57 (636) T ss_pred HHHHHHCCCEEEEECCCCCCHH--HHHHHHHHHHHHHCCCCCEEEECCCHH T ss_conf 9998603848999899971088--999999999998579998899798799 No 376 >PRK13891 conjugal transfer protein TrbE; Provisional Probab=94.78 E-value=0.34 Score=27.37 Aligned_cols=36 Identities=42% Similarity=0.559 Sum_probs=23.2 Q ss_pred CCCEEEECCCCCCCHHHHHHHHHHHHHC--CCEEEEEECCH Q ss_conf 8955994287877038999999999967--98289960105 Q gi|254780229|r 419 GKNLIIEGPPGTGKSQTITNIIAAAMLH--GKKILFCAQKM 457 (1775) Q Consensus 419 g~~~vi~GPPGTGKSQTIaNlIa~~la~--Gk~VLfVaeK~ 457 (1775) |. ..|.||+|+||| |..|+++.-.-+ |.+|. +--|. T Consensus 489 GH-TlIiGpTGaGKT-vll~fL~aQ~~rY~~~~vf-~FDK~ 526 (852) T PRK13891 489 GH-TFMFGPTGAGKS-THLGIIAAQLRRYAGMSIY-AFDKG 526 (852) T ss_pred CC-EEEECCCCCCHH-HHHHHHHHHHHHCCCCCEE-EECCC T ss_conf 64-078789999889-9999999997441898189-87898 No 377 >PRK00300 gmk guanylate kinase; Provisional Probab=94.77 E-value=0.026 Score=39.07 Aligned_cols=27 Identities=26% Similarity=0.537 Sum_probs=22.7 Q ss_pred HHCCCCEEEECCCCCCCHHHHHHHHHHH Q ss_conf 5148955994287877038999999999 Q gi|254780229|r 416 VINGKNLIIEGPPGTGKSQTITNIIAAA 443 (1775) Q Consensus 416 ~~~g~~~vi~GPPGTGKSQTIaNlIa~~ 443 (1775) ...|+=+||-||+|+||| ||.+.+..- T Consensus 4 ~~~g~livisGPSG~GK~-tl~~~L~~~ 30 (208) T PRK00300 4 MRRGLLIVLSAPSGAGKS-TLVRALLER 30 (208) T ss_pred CCCCCEEEEECCCCCCHH-HHHHHHHHH T ss_conf 418838999999988999-999999972 No 378 >PRK08154 anaerobic benzoate catabolism transcriptional regulator; Reviewed Probab=94.76 E-value=0.057 Score=35.52 Aligned_cols=42 Identities=24% Similarity=0.394 Sum_probs=31.7 Q ss_pred CCCCHHHHHHHHHHHC-----------CCCEEEECCCCCCCHHHHHHHHHHHHH Q ss_conf 6773468999998514-----------895599428787703899999999996 Q gi|254780229|r 403 DDSDSSQHSAVIDVIN-----------GKNLIIEGPPGTGKSQTITNIIAAAML 445 (1775) Q Consensus 403 ~~aDsSQ~~av~~~~~-----------g~~~vi~GPPGTGKSQTIaNlIa~~la 445 (1775) -.+++.|+.-|..+.. .+-+++.|.+|+||| ||..+.|..|. T Consensus 106 ~~~~~~~~~~~~~~l~~~~~~~~~~~~~~rIaLIGlmGaGKS-TvGr~LA~~Lg 158 (304) T PRK08154 106 EQATPAQLARARDALSGMLGTGGGAARRPRIALIGLRGAGKS-TLGRMLAARLG 158 (304) T ss_pred HCCCHHHHHHHHHHHHHHHCCCCCCCCCCCEEEECCCCCCHH-HHHHHHHHHHC T ss_conf 518999999999998763023766677784798899999888-99999999959 No 379 >pfam03029 ATP_bind_1 Conserved hypothetical ATP binding protein. Members of this family are found in a range of archaea and eukaryotes and have hypothesized ATP binding activity. Probab=94.76 E-value=0.032 Score=38.08 Aligned_cols=36 Identities=22% Similarity=0.373 Sum_probs=25.1 Q ss_pred EECCCCCCCHHHHHHHHHHHHHCCCEEEEEECCHHH Q ss_conf 942878770389999999999679828996010589 Q gi|254780229|r 424 IEGPPGTGKSQTITNIIAAAMLHGKKILFCAQKMAA 459 (1775) Q Consensus 424 i~GPPGTGKSQTIaNlIa~~la~Gk~VLfVaeK~aA 459 (1775) |.||||+|||.-...+..++-..|++|-+|-=-.|+ T Consensus 1 ViGpaGSGKTT~~~~l~~~l~~~~r~~~vvNLDPA~ 36 (234) T pfam03029 1 VVGGAGSGKTTFVGALSEILPLRGRSVYVVNLDPAA 36 (234) T ss_pred CCCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCCCC T ss_conf 989898988999999999999779975999789866 No 380 >COG0593 DnaA ATPase involved in DNA replication initiation [DNA replication, recombination, and repair] Probab=94.74 E-value=0.35 Score=27.29 Aligned_cols=64 Identities=22% Similarity=0.344 Sum_probs=35.7 Q ss_pred CCCCCCCHHHHHHHHHHHCC----CC-EEEECCCCCCCHHHHHHHHHH-HHHCCC--EEEEEECCHHHHHHHH Q ss_conf 23667734689999985148----95-599428787703899999999-996798--2899601058999999 Q gi|254780229|r 400 PLIDDSDSSQHSAVIDVING----KN-LIIEGPPGTGKSQTITNIIAA-AMLHGK--KILFCAQKMAAIEVVR 464 (1775) Q Consensus 400 ~~~~~aDsSQ~~av~~~~~g----~~-~vi~GPPGTGKSQTIaNlIa~-~la~Gk--~VLfVaeK~aALdVV~ 464 (1775) |.+-+...-=.+|+..+... .| +.|-||+|.|||| +.++|++ ++..|. +|++++--.-..+.|. T Consensus 89 Fv~g~~N~~A~aa~~~va~~~g~~~nplfi~G~~GlGKTH-Ll~Aign~~~~~~~~a~v~y~~se~f~~~~v~ 160 (408) T COG0593 89 FVVGPSNRLAYAAAKAVAENPGGAYNPLFIYGGVGLGKTH-LLQAIGNEALANGPNARVVYLTSEDFTNDFVK 160 (408) T ss_pred EEECCCHHHHHHHHHHHHHCCCCCCCCEEEECCCCCCHHH-HHHHHHHHHHHHCCCCEEEECCHHHHHHHHHH T ss_conf 2047744999999999875668868957998799997899-99999999986299864885048998999999 No 381 >TIGR03608 L_ocin_972_ABC putative bacteriocin export ABC transporter, lactococcin 972 group. A gene pair with a fairly wide distribution consists of a polypeptide related to the lactococcin 972 (see TIGR01653) and multiple-membrane-spanning putative immunity protein (see TIGR01654). This model represents a small clade within the ABC transporters that regularly are found adjacent to these bacteriocin system gene pairs and are likely serve as export proteins. Probab=94.73 E-value=0.022 Score=39.76 Aligned_cols=40 Identities=33% Similarity=0.468 Sum_probs=31.1 Q ss_pred HHHCCCCEEEECCCCCCCHHHHHHHHHHHHH-CCCEEEEEEC Q ss_conf 8514895599428787703899999999996-7982899601 Q gi|254780229|r 415 DVINGKNLIIEGPPGTGKSQTITNIIAAAML-HGKKILFCAQ 455 (1775) Q Consensus 415 ~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la-~Gk~VLfVae 455 (1775) .+..|.-++|.||-|+||| |+.|+|+-..- ...+|.|--+ T Consensus 20 ~i~~Ge~~~i~GpSGsGKS-TLL~~i~gl~~p~sG~i~~~g~ 60 (206) T TIGR03608 20 TIEKGKMVAIVGESGSGKS-TLLNIIGLLEKPDSGQVYLNGQ 60 (206) T ss_pred EECCCCEEEEECCCCCCHH-HHHHHHHCCCCCCCEEEEECCE T ss_conf 9869989999879997099-9999997599989759999999 No 382 >PRK06904 replicative DNA helicase; Validated Probab=94.73 E-value=0.1 Score=32.87 Aligned_cols=55 Identities=22% Similarity=0.315 Sum_probs=42.2 Q ss_pred HHCCCCEEEECCCCCCCHHHHHHHHHHH-HHCCCEEEEEECCHHHHHHHHHHHHHC Q ss_conf 5148955994287877038999999999-967982899601058999999864435 Q gi|254780229|r 416 VINGKNLIIEGPPGTGKSQTITNIIAAA-MLHGKKILFCAQKMAAIEVVRHRLEKA 470 (1775) Q Consensus 416 ~~~g~~~vi~GPPGTGKSQTIaNlIa~~-la~Gk~VLfVaeK~aALdVV~~RL~~~ 470 (1775) ...|.=+||=|.||.|||.--.||+.++ +..|+.|+|.|--|.+-+++.|-|... T Consensus 218 l~~g~LiViAaRPsmGKTa~alnia~n~A~~~~~~V~~fSLEM~~~~l~~R~ls~~ 273 (472) T PRK06904 218 LQPSDLIIVAARPSMGKTTFAMNLCENAAMASEKPVLVFSLEMPAEQIMMRMLASL 273 (472) T ss_pred CCCCCEEEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEECCCCCHHHHHHHHHHHH T ss_conf 87575799973798756899999999999955995799778799999999999986 No 383 >PRK13695 putative NTPase; Provisional Probab=94.72 E-value=0.041 Score=37.01 Aligned_cols=29 Identities=38% Similarity=0.568 Sum_probs=20.8 Q ss_pred CEEEECCCCCCCHHHHHHHHHHHH-HCCCEE Q ss_conf 559942878770389999999999-679828 Q gi|254780229|r 421 NLIIEGPPGTGKSQTITNIIAAAM-LHGKKI 450 (1775) Q Consensus 421 ~~vi~GPPGTGKSQTIaNlIa~~l-a~Gk~V 450 (1775) .+.|.||||.||| |+..=|++.| ..|.+| T Consensus 5 kI~iTG~PGvGKT-Tli~Kv~~~L~~~g~~v 34 (174) T PRK13695 5 RIGITGMPGVGKT-TLVLKIAELLAREGYKV 34 (174) T ss_pred EEEEECCCCCCHH-HHHHHHHHHHHHCCCEE T ss_conf 9998789998899-99999999986369617 No 384 >TIGR01978 sufC FeS assembly ATPase SufC; InterPro: IPR010230 Iron-sulphur (FeS) clusters are important cofactors for numerous proteins involved in electron transfer, in redox and non-redox catalysis, in gene regulation, and as sensors of oxygen and iron. These functions depend on the various FeS cluster prosthetic groups, the most common being [2Fe-2S] and [4Fe-4S] . FeS cluster assembly is a complex process involving the mobilisation of Fe and S atoms from storage sources, their assembly into [Fe-S] form, their transport to specific cellular locations, and their transfer to recipient apoproteins. So far, three FeS assembly machineries have been identified, which are capable of synthesising all types of [Fe-S] clusters: ISC (iron-sulphur cluster), SUF (sulphur assimilation), and NIF (nitrogen fixation) systems. The ISC system is conserved in eubacteria and eukaryotes (mitochondria), and has broad specificity, targeting general FeS proteins , . It is encoded by the isc operon (iscRSUA-hscBA-fdx-iscX). IscS is a cysteine desulphurase, which obtains S from cysteine (converting it to alanine) and serves as a S donor for FeS cluster assembly. IscU and IscA act as scaffolds to accept S and Fe atoms, assembling clusters and transfering them to recipient apoproteins. HscA is a molecular chaperone and HscB is a co-chaperone. Fdx is a [2Fe-2S]-type ferredoxin. IscR is a transcription factor that regulates expression of the isc operon. IscX (also known as YfhJ) appears to interact with IscS and may function as an Fe donor during cluster assembly . The SUF system is an alternative pathway to the ISC system that operates under iron starvation and oxidative stress. It is found in eubacteria, archaea and eukaryotes (plastids). The SUF system is encoded by the suf operon (sufABCDSE), and the six encoded proteins are arranged into two complexes (SufSE and SufBCD) and one protein (SufA). SufS is a pyridoxal-phosphate (PLP) protein displaying cysteine desulphurase activity. SufE acts as a scaffold protein that accepts S from SufS and donates it to SufA . SufC is an ATPase with an unorthodox ATP-binding cassette (ABC)-like component. No specific functions have been assigned to SufB and SufD. SufA is homologous to IscA , acting as a scaffold protein in which Fe and S atoms are assembled into [FeS] cluster forms, which can then easily be transferred to apoproteins targets. In the NIF system, NifS and NifU are required for the formation of metalloclusters of nitrogenase in Azotobacter vinelandii, and other organisms, as well as in the maturation of other FeS proteins. Nitrogenase catalyses the fixation of nitrogen. It contains a complex cluster, the FeMo cofactor, which contains molybdenum, Fe and S. NifS is a cysteine desulphurase. NifU binds one Fe atom at its N-terminal, assembling an FeS cluster that is transferred to nitrogenase apoproteins . Nif proteins involved in the formation of FeS clusters can also be found in organisms that do not fix nitrogen . This entry represents SufC, which acts as an ATPase in the SUF system. SufC belongs to the ATP-binding cassette transporter family (IPR003439 from INTERPRO) but is no longer thought to be part of a transporter. The complex is reported as cytosolic or associated with the membrane.; GO: 0005524 ATP binding, 0006810 transport. Probab=94.72 E-value=0.018 Score=40.82 Aligned_cols=48 Identities=35% Similarity=0.524 Sum_probs=34.3 Q ss_pred HHHHCCCCEEEECCCCCCCHHHHHHHHHH---HHHCCCEEEEEECCHHHHHH Q ss_conf 98514895599428787703899999999---99679828996010589999 Q gi|254780229|r 414 IDVINGKNLIIEGPPGTGKSQTITNIIAA---AMLHGKKILFCAQKMAAIEV 462 (1775) Q Consensus 414 ~~~~~g~~~vi~GPPGTGKSQTIaNlIa~---~la~Gk~VLfVaeK~aALdV 462 (1775) ..+.+|.=.+|=||-|+||| |.++.|+. +-.-+-+|.|-=+=-.+|+| T Consensus 21 L~v~~GE~HAiMGPNGsGKS-TL~~~iaGhp~y~vt~G~I~f~G~Dll~l~~ 71 (248) T TIGR01978 21 LTVKKGEIHAIMGPNGSGKS-TLSKTIAGHPKYEVTSGTILFKGQDLLELEP 71 (248) T ss_pred CCCCCCEEEEEECCCCCCHH-HHHHHHHCCCCEEEEEEEEEECCEECCCCCH T ss_conf 62168517998688998478-8877761799337842089877652001896 No 385 >KOG1804 consensus Probab=94.71 E-value=0.14 Score=31.37 Aligned_cols=227 Identities=19% Similarity=0.196 Sum_probs=99.2 Q ss_pred CCCEEEECCHHHCCHHHHHHHHH---HCCEEEEEECCCCCCCCHHHHHCCCCCCCCCCHHHCCCCCHHHHHHH---HHC- Q ss_conf 41078860632369899889886---11828996077866862011000134444320110144000799999---755- Q gi|254780229|r 1361 KFDLVIMDESSQIKPEDALGVIA---RGKQIVVVGDPKQLPPTRFFDHDNDQEDYDEEVAAVSQTESILDALL---PLF- 1433 (1775) Q Consensus 1361 ~FD~VI~DEASQ~~~~~al~al~---rakqvvvvGD~kQlpPt~ff~~~~~~~~~~~~~~~~~~~eSild~~~---~~~- 1433 (1775) .|--.+.|||-|+.+..+++.|+ =++.+|..||.-|+.|.-+ +.... .. .. -++++-.. ..| T Consensus 265 ~~t~~~~~eaae~~~~~~l~P~~~~~~~~~~~L~~~~~ql~~~l~-s~~~~--~~-----~~---~~~~~~~~~~y~~~~ 333 (775) T KOG1804 265 FFTHILLDEAAQAMECELLMPLALPSSGTRIVLAGPHLQLTPFLN-SVARE--EQ-----AL---HLLLCRLPEPYIVFG 333 (775) T ss_pred CEEEEEHHHHHHCCCCEEECCCCCCCCCCEEEECCCCCCCCCCHH-HHHHH--HH-----HH---HHCCCCCCCCCCCCC T ss_conf 513432887875688313313334778853462365301342023-33335--66-----66---541566666532456 Q ss_pred C-CCEECCCCCCCCHHHHHHHHHHCCCCEEEECCCCCC---CCCCCCEEEEECCCCC--CCCC---CCHHHHHHHHHHHH Q ss_conf 8-752053055799789999985036980476468776---6666884899728972--1788---89899999999999 Q gi|254780229|r 1434 S-MRRLQWHYRSLNENLIACSNYYFYDNSLIVFPSPYT---SVDRYGVGFTHVKNGV--VVDQ---GNPEEARVIALAVK 1504 (1775) Q Consensus 1434 p-~~~L~~HyRs~~e~lI~fSN~~fY~~~L~~~psp~~---~~~~~~v~~~~v~~g~--~~~~---~N~~EA~~vv~~v~ 1504 (1775) | -.-.+.-||| |..++.|--++||...+-+.-+... .....++.|.+..+-- -.+. .|.+|...++..+. T Consensus 334 p~~~g~~~n~~~-a~~~v~~~~~~~~il~~~p~~a~~k~~~~rl~~p~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~e 412 (775) T KOG1804 334 PPGTGKTENYRE-AIAIVSFTSPHFYILVCAPSNASGKQPAHRLHYPLTFSTARGEDVRAKSSTAWYNNAEVSEVVEKVE 412 (775) T ss_pred CCCCCCCCCHHH-HHHHHHHCCHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHH T ss_conf 887677641687-7788875256777630353323346665122343000024566320011007765478899999999 Q ss_pred HHHHCCC-------CCEEEEEECCHHHHHHHHHHHHHHHHCCHHHHHHHHHHCCCCCCEEECCHHHCCCCCCCEEEEEEE Q ss_conf 9953185-------771899819989999999999999312976899976411478985854256808666788999966 Q gi|254780229|r 1505 DHALRYP-------EESLGVIAMNAKQRDLIESTINKLCRKDSEVEKAISKLRMHADPFFIKNLENVQGDERDRIFISFT 1577 (1775) Q Consensus 1505 ~~~~~~~-------~~siGVvt~~~~Q~~~I~~~l~~~~~~~~~~~~~~~~~~~~~e~~fvknle~vQGdERDvI~~S~~ 1577 (1775) ...+..+ .-|+|++|-...|...+...|.. +...+.||+ .+|+|--=-.++.++. T Consensus 413 ~~~~~~~~~i~i~t~~sag~~~~~g~~v~~f~hil~D-------------eAg~stEpe-----~lv~i~~~~~~~~vvL 474 (775) T KOG1804 413 ELRKVWPYRWGITTCTSAGCVTSYGFQVGHFRHILVD-------------EAGVSTEPE-----LLVPGKQFRQPFQVVL 474 (775) T ss_pred HHHHCCCEEEEEEECCCEEEEECCCCCCCCEEEEEEC-------------CCCCCCCCC-----CCCCCCCCCCEEEEEE T ss_conf 8751362378874033100331255421314544113-------------346645741-----0244023310247998 Q ss_pred EECCCCCCCCCCCCCCCCCCC--HHHHHHHHHHHHCCEEEEEEC Q ss_conf 507888876200114777656--246899999832142698870 Q gi|254780229|r 1578 YGPHEPGGRVFQRFGPINSDI--GWRRLNVLFTRSRRRIEVFST 1619 (1775) Q Consensus 1578 yg~~~~~g~~~~~fGpln~~g--g~rRlNVa~sRAr~~~~v~~s 1619 (1775) -|....-|.++.. .+-..- |..-||=+++||-....++.+ T Consensus 475 sgdh~Qlgpv~~s--~~A~~~gl~rsLler~l~r~~~~~~~~g~ 516 (775) T KOG1804 475 SGDHTQLGPVSKS--ARAEELGLDRSLLERALTRAQSLVAVVGD 516 (775) T ss_pred CCCCCCCCCCCCC--HHHHHHCCCHHHHHHHHHHHHHCCCCCCC T ss_conf 0475311655643--22566442288888887888625666788 No 386 >cd03246 ABCC_Protease_Secretion This family represents the ABC component of the protease secretion system PrtD, a 60-kDa integral membrane protein sharing 37% identity with HlyB, the ABC component of the alpha-hemolysin secretion pathway, in the C-terminal domain. They export degradative enzymes by using a type I protein secretion system and lack an N-terminal signal peptide, but contain a C-terminal secretion signal. The Type I secretion apparatus is made up of three components, an ABC transporter, a membrane fusion protein (MFP), and an outer membrane protein (OMP). For the HlyA transporter complex, HlyB (ABC transporter) and HlyD (MFP) reside in the inner membrane of E. coli. The OMP component is TolC, which is thought to interact with the MFP to form a continuous channel across the periplasm from the cytoplasm to the exterior. HlyB belongs to the family of ABC transporters, which are ubiquitous, ATP-dependent transmembrane pumps or channels. The spectrum of transport substra Probab=94.70 E-value=0.023 Score=39.66 Aligned_cols=38 Identities=29% Similarity=0.516 Sum_probs=30.6 Q ss_pred HHHCCCCEEEECCCCCCCHHHHHHHHHHHHHCC-CEEEEE Q ss_conf 851489559942878770389999999999679-828996 Q gi|254780229|r 415 DVINGKNLIIEGPPGTGKSQTITNIIAAAMLHG-KKILFC 453 (1775) Q Consensus 415 ~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~G-k~VLfV 453 (1775) .+..|.-++|.||.|+||| |+.++|+-.+-.- .+|++- T Consensus 24 ~i~~G~~vaIvG~sGsGKS-TLl~ll~gl~~p~~G~i~i~ 62 (173) T cd03246 24 SIEPGESLAIIGPSGSGKS-TLARLILGLLRPTSGRVRLD 62 (173) T ss_pred EECCCCEEEEECCCCCHHH-HHHHHHHCCCCCCCCEEEEC T ss_conf 9859999999999998099-99999966666799989999 No 387 >COG3854 SpoIIIAA ncharacterized protein conserved in bacteria [Function unknown] Probab=94.70 E-value=0.052 Score=35.94 Aligned_cols=40 Identities=35% Similarity=0.482 Sum_probs=32.7 Q ss_pred HHCCCC-EEEECCCCCCCHHHHHHHHHHHHHCC------CEEEEEECC Q ss_conf 514895-59942878770389999999999679------828996010 Q gi|254780229|r 416 VINGKN-LIIEGPPGTGKSQTITNIIAAAMLHG------KKILFCAQK 456 (1775) Q Consensus 416 ~~~g~~-~vi~GPPGTGKSQTIaNlIa~~la~G------k~VLfVaeK 456 (1775) .-+|.. .+|.||||+||| |..-=||-.++.| |+|-++-|. T Consensus 133 y~~g~lntLiigpP~~GKT-TlLRdiaR~~s~g~~~~l~kkv~IiDer 179 (308) T COG3854 133 YQNGWLNTLIIGPPQVGKT-TLLRDIARLLSDGINQFLPKKVGIIDER 179 (308) T ss_pred HHCCCEEEEEECCCCCCHH-HHHHHHHHHHHCCCCCCCCCEEEEEECC T ss_conf 8437224699659988707-7999999986315112677328997150 No 388 >cd03292 ABC_FtsE_transporter FtsE is a hydrophilic nucleotide-binding protein that binds FtsX to form a heterodimeric ATP-binding cassette (ABC)-type transporter that associates with the bacterial inner membrane. The FtsE/X transporter is thought to be involved in cell division and is important for assembly or stability of the septal ring. Probab=94.70 E-value=0.022 Score=39.80 Aligned_cols=38 Identities=29% Similarity=0.454 Sum_probs=30.0 Q ss_pred HHHCCCCEEEECCCCCCCHHHHHHHHHHHHHCC-CEEEEE Q ss_conf 851489559942878770389999999999679-828996 Q gi|254780229|r 415 DVINGKNLIIEGPPGTGKSQTITNIIAAAMLHG-KKILFC 453 (1775) Q Consensus 415 ~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~G-k~VLfV 453 (1775) .+..|.-++|.||-|+||| |+.|+|+-....- .+|++- T Consensus 23 ~i~~Ge~v~i~GpSGsGKS-TLl~~i~gl~~p~sG~i~i~ 61 (214) T cd03292 23 SISAGEFVFLVGPSGAGKS-TLLKLIYKEELPTSGTIRVN 61 (214) T ss_pred EECCCCEEEEECCCCCHHH-HHHHHHHCCCCCCCEEEEEC T ss_conf 9859989999979995399-99999962989886499999 No 389 >pfam03266 DUF265 Protein of unknown function, DUF265. Probab=94.69 E-value=0.044 Score=36.68 Aligned_cols=32 Identities=34% Similarity=0.703 Sum_probs=20.9 Q ss_pred EEEECCCCCCCHHHHHHHHHHHH-HCCCEE-EEEE Q ss_conf 59942878770389999999999-679828-9960 Q gi|254780229|r 422 LIIEGPPGTGKSQTITNIIAAAM-LHGKKI-LFCA 454 (1775) Q Consensus 422 ~vi~GPPGTGKSQTIaNlIa~~l-a~Gk~V-LfVa 454 (1775) +.|.||||.||| |+..-|+..| ..|-+| =|.+ T Consensus 2 i~ITG~pGvGKT-Tli~kv~~~l~~~~~~v~GF~T 35 (168) T pfam03266 2 IFITGPPGVGKT-TLVKKVIELLKSEGVKVGGFYT 35 (168) T ss_pred EEEECCCCCCHH-HHHHHHHHHHHHCCCEEEEEEE T ss_conf 899789998899-9999999999867970748993 No 390 >PRK13644 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=94.68 E-value=0.027 Score=38.87 Aligned_cols=40 Identities=25% Similarity=0.365 Sum_probs=31.9 Q ss_pred HHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHC-CCEEEEEE Q ss_conf 985148955994287877038999999999967-98289960 Q gi|254780229|r 414 IDVINGKNLIIEGPPGTGKSQTITNIIAAAMLH-GKKILFCA 454 (1775) Q Consensus 414 ~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~-Gk~VLfVa 454 (1775) ..+..|.-+.|.||.|+||| |++++|+-.+.. ..+|+|-- T Consensus 23 l~i~~Ge~vaiiG~nGsGKS-TL~~~l~Gll~P~~G~I~v~G 63 (274) T PRK13644 23 LVIKKGEYIGIIGKNGSGKS-TLALHLNGLLRPQKGKVLVSG 63 (274) T ss_pred EEECCCCEEEEECCCCCHHH-HHHHHHHCCCCCCCCEEEECC T ss_conf 79848999999999998099-999999706858887299999 No 391 >PRK03846 adenylylsulfate kinase; Provisional Probab=94.67 E-value=0.054 Score=35.76 Aligned_cols=34 Identities=29% Similarity=0.474 Sum_probs=25.1 Q ss_pred CCCCEEEECCCCCCCHHHHHHHHHHHHH-CCCEEEE Q ss_conf 4895599428787703899999999996-7982899 Q gi|254780229|r 418 NGKNLIIEGPPGTGKSQTITNIIAAAML-HGKKILF 452 (1775) Q Consensus 418 ~g~~~vi~GPPGTGKSQTIaNlIa~~la-~Gk~VLf 452 (1775) +|.-+-+.|.||+||| ||++.+...|- .|..|.+ T Consensus 23 kg~viWlTGLSGSGKT-TlA~~L~~~L~~~~~~~~~ 57 (198) T PRK03846 23 KGVVLWFTGLSGSGKS-TVAGALEEALHELGVHTYL 57 (198) T ss_pred CCEEEEEECCCCCCHH-HHHHHHHHHHHHCCCCEEE T ss_conf 9869998799999889-9999999999975997599 No 392 >PRK09580 sufC cysteine desulfurase ATPase component; Reviewed Probab=94.66 E-value=0.024 Score=39.46 Aligned_cols=45 Identities=29% Similarity=0.520 Sum_probs=32.9 Q ss_pred HHHCCCCEEEECCCCCCCHHHHHHHHHHH---HHCCCEEEEEECCHHHH Q ss_conf 85148955994287877038999999999---96798289960105899 Q gi|254780229|r 415 DVINGKNLIIEGPPGTGKSQTITNIIAAA---MLHGKKILFCAQKMAAI 460 (1775) Q Consensus 415 ~~~~g~~~vi~GPPGTGKSQTIaNlIa~~---la~Gk~VLfVaeK~aAL 460 (1775) .+..|.-++|.||.|+||| |+.++|+-. -..+-+|+|--+....+ T Consensus 23 ~i~~Gei~~iiG~nGaGKS-TLl~~i~G~~~~~~~~G~I~~~G~~i~~~ 70 (248) T PRK09580 23 EVRPGEVHAIMGPNGSGKS-TLSATLAGREDYEVTGGTVTFKGKDLLEL 70 (248) T ss_pred EECCCCEEEEECCCCCCHH-HHHHHHHCCCCCCCCCCEEEECCEECCCC T ss_conf 9849979999999999999-99999837755687523599999985648 No 393 >cd03276 ABC_SMC6_euk Eukaryotic SMC6 proteins; SMC proteins are large (approximately 110 to 170 kDa), and each is arranged into five recognizable domains. Amino-acid sequence homology of SMC proteins between species is largely confined to the amino- and carboxy-terminal globular domains. The amino-terminal domain contains a 'Walker A' nucleotide-binding domain (GxxGxGKS/T, in the single-letter amino-acid code), which by mutational studies has been shown to be essential in several proteins. The carboxy-terminal domain contains a sequence (the DA-box) that resembles a 'Walker B' motif, and a motif with homology to the signature sequence of the ATP-binding cassette (ABC) family of ATPases. The sequence homology within the carboxy-terminal domain is relatively high within the SMC1-SMC4 group, whereas SMC5 and SMC6 show some divergence in both of these sequences. In eukaryotic cells, the proteins are found as heterodimers of SMC1 paired with SMC3, SMC2 with SMC4, and SMC5 with SMC6 (for Probab=94.66 E-value=0.025 Score=39.14 Aligned_cols=28 Identities=36% Similarity=0.513 Sum_probs=22.3 Q ss_pred CCCEEEECCCCCCCHHHHHHHHHHHHHCCCE Q ss_conf 8955994287877038999999999967982 Q gi|254780229|r 419 GKNLIIEGPPGTGKSQTITNIIAAAMLHGKK 449 (1775) Q Consensus 419 g~~~vi~GPPGTGKSQTIaNlIa~~la~Gk~ 449 (1775) |-| +|.||.||||| +|..+|+-+|. ||. T Consensus 22 ~lN-~IiG~NGsGKS-sIl~AI~lgLG-gk~ 49 (198) T cd03276 22 RVN-FIVGNNGSGKS-AILTALTIGLG-GKA 49 (198) T ss_pred CEE-EEECCCCCCHH-HHHHHHHHHCC-CCC T ss_conf 828-99889999889-99999998638-881 No 394 >cd03222 ABC_RNaseL_inhibitor The ABC ATPase RNase L inhibitor (RLI) is a key enzyme in ribosomal biogenesis, formation of translation preinitiation complexes, and assembly of HIV capsids. RLI's are not transport proteins, and thus cluster with a group of soluble proteins that lack the transmembrane components commonly found in other members of the family. Structurally, RLI's have an N-terminal Fe-S domain and two nucleotide-binding domains, which are arranged to form two composite active sites in their interface cleft. RLI is one of the most conserved enzymes between archaea and eukaryotes with a sequence identity more than 48%. The high degree of evolutionary conservation suggests that RLI performs a central role in archaeal and eukaryotic physiology. Probab=94.64 E-value=0.025 Score=39.26 Aligned_cols=37 Identities=35% Similarity=0.545 Sum_probs=29.0 Q ss_pred HHHCCCCEEEECCCCCCCHHHHHHHHHHHHHC-CCEEEE Q ss_conf 85148955994287877038999999999967-982899 Q gi|254780229|r 415 DVINGKNLIIEGPPGTGKSQTITNIIAAAMLH-GKKILF 452 (1775) Q Consensus 415 ~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~-Gk~VLf 452 (1775) .+..|.-+.|-||.|+||| |..++|+-.+-. +..|.+ T Consensus 21 ~v~~GEiv~ilGpNGaGKS-Tllk~i~G~l~p~~G~i~~ 58 (177) T cd03222 21 VVKEGEVIGIVGPNGTGKT-TAVKILAGQLIPNGDNDEW 58 (177) T ss_pred CCCCCCEEEEECCCCCCHH-HHHHHHHCCCCCCCCCEEE T ss_conf 5589989999899999999-9999996886788994666 No 395 >PRK13764 ATPase; Provisional Probab=94.64 E-value=0.035 Score=37.68 Aligned_cols=44 Identities=27% Similarity=0.440 Sum_probs=31.5 Q ss_pred CCCCEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEECCHHHHHH Q ss_conf 489559942878770389999999999679828996010589999 Q gi|254780229|r 418 NGKNLIIEGPPGTGKSQTITNIIAAAMLHGKKILFCAQKMAAIEV 462 (1775) Q Consensus 418 ~g~~~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVaeK~aALdV 462 (1775) ......|-||||.||| |.+-.||..++.-.+|-=+-|+..-|.| T Consensus 258 ~a~GilIaG~PGaGKs-TfaqalA~~~~~~g~iVKTmEsPRDl~v 301 (605) T PRK13764 258 RAEGILIAGAPGAGKS-TFAQALAEFYADMGKIVKTMESPRDLQV 301 (605) T ss_pred CCCEEEEECCCCCCHH-HHHHHHHHHHHHCCCEEEECCCCCCCCC T ss_conf 3664999779999778-9999999999847978983248623668 No 396 >COG0464 SpoVK ATPases of the AAA+ class [Posttranslational modification, protein turnover, chaperones] Probab=94.63 E-value=0.026 Score=39.01 Aligned_cols=37 Identities=30% Similarity=0.515 Sum_probs=23.2 Q ss_pred HCCCCEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEECCHH Q ss_conf 148955994287877038999999999967982899601058 Q gi|254780229|r 417 INGKNLIIEGPPGTGKSQTITNIIAAAMLHGKKILFCAQKMA 458 (1775) Q Consensus 417 ~~g~~~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVaeK~a 458 (1775) ...+.+.+.|||||||| +++-.||.- =..-|++-+.. T Consensus 274 ~~~~giLl~GpPGtGKT-~lAkava~~----~~~~fi~v~~~ 310 (494) T COG0464 274 RPPKGVLLYGPPGTGKT-LLAKAVALE----SRSRFISVKGS 310 (494) T ss_pred CCCCEEEEECCCCCCHH-HHHHHHHHH----CCCCEEEECCH T ss_conf 88836999889997589-999998754----49824884335 No 397 >PRK00625 shikimate kinase; Provisional Probab=94.63 E-value=0.031 Score=38.28 Aligned_cols=24 Identities=21% Similarity=0.432 Sum_probs=21.8 Q ss_pred CEEEECCCCCCCHHHHHHHHHHHHH Q ss_conf 5599428787703899999999996 Q gi|254780229|r 421 NLIIEGPPGTGKSQTITNIIAAAML 445 (1775) Q Consensus 421 ~~vi~GPPGTGKSQTIaNlIa~~la 445 (1775) |+++.|+||+||| ||..++|..|. T Consensus 2 nI~LIG~mGsGKS-tiGk~LA~~l~ 25 (173) T PRK00625 2 QIFLCGLPTVGKT-SFGKALAKFLS 25 (173) T ss_pred EEEEECCCCCCHH-HHHHHHHHHHC T ss_conf 2999899999889-99999999939 No 398 >PRK13540 cytochrome c biogenesis protein CcmA; Provisional Probab=94.61 E-value=0.027 Score=38.82 Aligned_cols=41 Identities=32% Similarity=0.480 Sum_probs=33.1 Q ss_pred HHHCCCCEEEECCCCCCCHHHHHHHHHHHHH-CCCEEEEEECC Q ss_conf 8514895599428787703899999999996-79828996010 Q gi|254780229|r 415 DVINGKNLIIEGPPGTGKSQTITNIIAAAML-HGKKILFCAQK 456 (1775) Q Consensus 415 ~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la-~Gk~VLfVaeK 456 (1775) .+..|.-+.|.||-|+||| |..++|+-.+. ...+|+|--+. T Consensus 23 ~i~~G~i~~l~G~NGaGKS-TLlkli~Gl~~p~~G~I~~~~~~ 64 (200) T PRK13540 23 HLPAGGLLHLKGSNGAGKT-TLLKLIAGLLNPEKGEILFERQS 64 (200) T ss_pred EECCCCEEEEECCCCCCHH-HHHHHHHCCCCCCCCEEEECCEE T ss_conf 9879979999889998799-99999977858898569999986 No 399 >COG0468 RecA RecA/RadA recombinase [DNA replication, recombination, and repair] Probab=94.61 E-value=0.044 Score=36.66 Aligned_cols=43 Identities=28% Similarity=0.387 Sum_probs=32.1 Q ss_pred CCCEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEECCHHHHHH Q ss_conf 89559942878770389999999999679828996010589999 Q gi|254780229|r 419 GKNLIIEGPPGTGKSQTITNIIAAAMLHGKKILFCAQKMAAIEV 462 (1775) Q Consensus 419 g~~~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVaeK~aALdV 462 (1775) |.=.=|-|||||||||-...+++.+...|.+|.|+-- --+|+. T Consensus 60 g~ItEiyG~~gsGKT~lal~~~~~aq~~g~~a~fIDt-E~~l~p 102 (279) T COG0468 60 GRITEIYGPESSGKTTLALQLVANAQKPGGKAAFIDT-EHALDP 102 (279) T ss_pred CEEEEEECCCCCCHHHHHHHHHHHHHCCCCEEEEEEC-CCCCCH T ss_conf 3589984688765466899998886537980899958-999899 No 400 >TIGR03574 selen_PSTK L-seryl-tRNA(Sec) kinase, archaeal. Members of this protein are L-seryl-tRNA(Sec) kinase. This enzyme is part of a two-step pathway in Eukaryota and Archaea for performing selenocysteine biosynthesis by changing serine misacylated on selenocysteine-tRNA to selenocysteine. This enzyme performs the first step, phosphorylation of the OH group of the serine side chain. This family represents archaeal proteins with this activity. Probab=94.61 E-value=0.054 Score=35.76 Aligned_cols=33 Identities=24% Similarity=0.469 Sum_probs=26.0 Q ss_pred EEEECCCCCCCHHHHHHHHHHHHH-CCCEEEEEEC Q ss_conf 599428787703899999999996-7982899601 Q gi|254780229|r 422 LIIEGPPGTGKSQTITNIIAAAML-HGKKILFCAQ 455 (1775) Q Consensus 422 ~vi~GPPGTGKSQTIaNlIa~~la-~Gk~VLfVae 455 (1775) +||.|-||+||| |.++-+...+. .|..|.+++. T Consensus 2 ivl~GlP~SGKS-t~a~~L~~~l~~~~~~~i~~~~ 35 (249) T TIGR03574 2 IILTGLPGVGKS-TFSKELSKKLSEKNIDNIILGT 35 (249) T ss_pred EEEECCCCCCHH-HHHHHHHHHHHHHCCCEEEECC T ss_conf 789678999899-9999999999982996599655 No 401 >cd03245 ABCC_bacteriocin_exporters ABC-type bacteriocin exporters. Many non-lantibiotic bacteriocins of lactic acid bacteria are produced as precursors which have N-terminal leader peptides that share similarities in amino acid sequence and contain a conserved processing site of two glycine residues in positions -1 and -2. A dedicated ATP-binding cassette (ABC) transporter is responsible for the proteolytic cleavage of the leader peptides and subsequent translocation of the bacteriocins across the cytoplasmic membrane. Probab=94.60 E-value=0.024 Score=39.40 Aligned_cols=46 Identities=22% Similarity=0.418 Sum_probs=33.8 Q ss_pred HHHCCCCEEEECCCCCCCHHHHHHHHHHHHHC-CCEEEEEECCHHHHH Q ss_conf 85148955994287877038999999999967-982899601058999 Q gi|254780229|r 415 DVINGKNLIIEGPPGTGKSQTITNIIAAAMLH-GKKILFCAQKMAAIE 461 (1775) Q Consensus 415 ~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~-Gk~VLfVaeK~aALd 461 (1775) .+..|.-++|.||.|+||| |+.++|+-.+.. --+|.+--.-...++ T Consensus 26 ~i~~G~~v~ivG~sGsGKS-TLl~ll~gl~~p~~G~I~i~g~~~~~~~ 72 (220) T cd03245 26 TIRAGEKVAIIGRVGSGKS-TLLKLLAGLYKPTSGSVLLDGTDIRQLD 72 (220) T ss_pred EECCCCEEEEECCCCCHHH-HHHHHHHCCCCCCCCEEEECCEEHHHHC T ss_conf 9879999999999998599-9999996725478658999999957725 No 402 >pfam01583 APS_kinase Adenylylsulphate kinase. Enzyme that catalyses the phosphorylation of adenylylsulphate to 3'-phosphoadenylylsulfate. This domain contains an ATP binding P-loop motif. Probab=94.59 E-value=0.056 Score=35.60 Aligned_cols=30 Identities=30% Similarity=0.536 Sum_probs=23.2 Q ss_pred EEEECCCCCCCHHHHHHHHHHHH-HCCCEEEE Q ss_conf 59942878770389999999999-67982899 Q gi|254780229|r 422 LIIEGPPGTGKSQTITNIIAAAM-LHGKKILF 452 (1775) Q Consensus 422 ~vi~GPPGTGKSQTIaNlIa~~l-a~Gk~VLf 452 (1775) +=+.|.||+||| ||++.+..-| ..|..|.+ T Consensus 5 iW~TGLsGsGKT-TlA~~l~~~L~~~~~~~~~ 35 (157) T pfam01583 5 VWFTGLSGSGKS-TIANALERKLFAQGISVYV 35 (157) T ss_pred EEEECCCCCCHH-HHHHHHHHHHHHCCCCEEE T ss_conf 998898999999-9999999999975997799 No 403 >pfam06414 Zeta_toxin Zeta toxin. This family consists of several bacterial zeta toxin proteins. Zeta toxin is thought to be part of a postregulational killing system in bacteria. It relies on antitoxin/toxin systems that secure stable inheritance of low and medium copy number plasmids during cell division and kill cells that have lost the plasmid. Probab=94.59 E-value=0.041 Score=36.93 Aligned_cols=37 Identities=16% Similarity=0.354 Sum_probs=27.5 Q ss_pred HCCCCEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEE Q ss_conf 14895599428787703899999999996798289960 Q gi|254780229|r 417 INGKNLIIEGPPGTGKSQTITNIIAAAMLHGKKILFCA 454 (1775) Q Consensus 417 ~~g~~~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVa 454 (1775) .+-+-++|-|+||+||| |+++.+...+..|.-|.+-+ T Consensus 10 ~~Pkai~laG~pGAGKS-~~~~~~~~~~~~~~~v~In~ 46 (191) T pfam06414 10 ERPVAVLLGGQPGAGKT-ELARALLEELGGGNVVRIDP 46 (191) T ss_pred CCCEEEEEECCCCCCHH-HHHHHHHHHCCCCCCEEECC T ss_conf 69879999579988889-99999987537899389713 No 404 >TIGR02533 type_II_gspE general secretory pathway protein E; InterPro: IPR013369 GspE, the E protein of the type II secretion system, is also referred to as the main terminal branch of the general secretion pathway. ; GO: 0005524 ATP binding, 0008565 protein transporter activity, 0015628 protein secretion by the type II secretion system, 0015627 type II protein secretion system complex. Probab=94.58 E-value=0.039 Score=37.15 Aligned_cols=30 Identities=27% Similarity=0.587 Sum_probs=20.3 Q ss_pred EEEECCCCCCCHHHHHHHHHHHHHCCCEEE Q ss_conf 599428787703899999999996798289 Q gi|254780229|r 422 LIIEGPPGTGKSQTITNIIAAAMLHGKKIL 451 (1775) Q Consensus 422 ~vi~GPPGTGKSQTIaNlIa~~la~Gk~VL 451 (1775) +.|+||+|||||.|.=-..+-+=..++.+| T Consensus 248 iLVTGPTGSGKtTTLYaaL~~LN~~~~NIl 277 (495) T TIGR02533 248 ILVTGPTGSGKTTTLYAALSRLNTPERNIL 277 (495) T ss_pred EEECCCCCCCHHHHHHHHHHHHCCCCCCEE T ss_conf 884177898525889999986358997156 No 405 >TIGR00958 3a01208 antigen peptide transporter 2; InterPro: IPR005293 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). Proteins of this family are involved in the transport of antigens from the cytoplasm to a membrane-bound compartment for association with MHC class I molecules.; GO: 0005215 transporter activity, 0006810 transport. Probab=94.58 E-value=0.023 Score=39.50 Aligned_cols=25 Identities=24% Similarity=0.242 Sum_probs=14.1 Q ss_pred HHHHHHHCCEEEEEECCCCCCCCHH Q ss_conf 8898861182899607786686201 Q gi|254780229|r 1378 ALGVIARGKQIVVVGDPKQLPPTRF 1402 (1775) Q Consensus 1378 al~al~rakqvvvvGD~kQlpPt~f 1402 (1775) -++.|+-|-|.|+-|.=++=-=++| T Consensus 451 ~v~vL~~Gg~LV~~g~v~~g~LvsF 475 (770) T TIGR00958 451 LVLVLYYGGQLVLTGKVSSGLLVSF 475 (770) T ss_pred HHHHHHHHHHHHCCCEECCCCHHHH T ss_conf 9999972255310010046733567 No 406 >KOG0734 consensus Probab=94.56 E-value=0.025 Score=39.15 Aligned_cols=42 Identities=26% Similarity=0.443 Sum_probs=25.0 Q ss_pred CCCCEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEECCHHHHHHHH Q ss_conf 48955994287877038999999999967982899601058999999 Q gi|254780229|r 418 NGKNLIIEGPPGTGKSQTITNIIAAAMLHGKKILFCAQKMAAIEVVR 464 (1775) Q Consensus 418 ~g~~~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVaeK~aALdVV~ 464 (1775) --|.+.+.|||||||| |.|-|+|---.|=|-.-.-.--|-|+ T Consensus 336 LPKGVLLvGPPGTGKT-----lLARAvAGEA~VPFF~~sGSEFdEm~ 377 (752) T KOG0734 336 LPKGVLLVGPPGTGKT-----LLARAVAGEAGVPFFYASGSEFDEMF 377 (752) T ss_pred CCCCEEEECCCCCCHH-----HHHHHHHCCCCCCEEECCCCCHHHHH T ss_conf 8885387689997556-----99998605568974741662044542 No 407 >TIGR01587 cas3_core CRISPR-associated helicase Cas3; InterPro: IPR006474 This entry represents a highly conserved core region of the Cas3 sequences. The proteins are found in association with CRISPR repeat elements in a broad range of bacteria and Archaea . Cas3 appears to be a helicase, containing a DEAD/DEAH box region and conserved C-terminal domain. Some but not all members have an N-terminal HD domain region (IPR006674 from INTERPRO), these sequences are not included within this group.. Probab=94.56 E-value=0.25 Score=28.77 Aligned_cols=72 Identities=31% Similarity=0.493 Sum_probs=36.3 Q ss_pred EEEECCCCCCCHHHHHHHHHHH---HHC-C--CEEEEEECCHHHHHHHHHHHHH----CCCCC--EEEECCCCCHHHHHH Q ss_conf 5994287877038999999999---967-9--8289960105899999986443----58441--365402661108999 Q gi|254780229|r 422 LIIEGPPGTGKSQTITNIIAAA---MLH-G--KKILFCAQKMAAIEVVRHRLEK----AGLGE--FCLELHSHKVHKRAI 489 (1775) Q Consensus 422 ~vi~GPPGTGKSQTIaNlIa~~---la~-G--k~VLfVaeK~aALdVV~~RL~~----~gLg~--~cleLHs~ka~k~~v 489 (1775) |||+-|+|.||| |++=++|.- +-+ + -||-|+==.++-.+-.++|+.. + +|+ ..+-.||+.-+-... T Consensus 2 ~v~~APTG~GKT-e~aL~~A~~sah~~k~~~~~~~I~alP~r~~~na~~~r~~~sash~-Fg~P~~~~~~~ssrfnh~~~ 79 (424) T TIGR01587 2 LVIEAPTGYGKT-EAALLWALKSAHLIKKQKADKVIFALPTRVTINAMYRRLKESASHL-FGSPELVLLLHSSRFNHLAS 79 (424) T ss_pred EEEECCCCCCHH-HHHHHHHHHHHHHHHHHHHCCEEEECCHHHHHHHHHHHHHHHHHHH-CCCCCCCCCCCCCHHHHHHH T ss_conf 688617899878-9999999863666422444010122026888999999999867754-17854323345522678999 Q ss_pred HHHHHH Q ss_conf 999999 Q gi|254780229|r 490 LDDLRK 495 (1775) Q Consensus 490 l~~L~~ 495 (1775) ...++. T Consensus 80 ~ik~~~ 85 (424) T TIGR01587 80 SIKLRK 85 (424) T ss_pred HHHHHH T ss_conf 999877 No 408 >PRK13546 teichoic acids export protein ATP-binding subunit; Provisional Probab=94.55 E-value=0.025 Score=39.15 Aligned_cols=38 Identities=29% Similarity=0.404 Sum_probs=30.0 Q ss_pred HHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHC-CCEEEE Q ss_conf 985148955994287877038999999999967-982899 Q gi|254780229|r 414 IDVINGKNLIIEGPPGTGKSQTITNIIAAAMLH-GKKILF 452 (1775) Q Consensus 414 ~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~-Gk~VLf 452 (1775) ..+..|.-+.|-||.|+||| |+.|+|+-.+-. ..+|.+ T Consensus 45 f~i~~GEivgllG~NGaGKS-TLlk~I~Gl~~P~~G~I~~ 83 (264) T PRK13546 45 LKAYEGDVIGLVGINGSGKS-TLSNIIGGSLSPTVGKVDR 83 (264) T ss_pred EEECCCCEEEEECCCCCHHH-HHHHHHHCCCCCCCCEEEE T ss_conf 88859989999989986199-9999996798888747999 No 409 >pfam02399 Herpes_ori_bp Origin of replication binding protein. This Pfam family represents the herpesvirus origin of replication binding protein, probably involved in DNA replication. Probab=94.54 E-value=0.22 Score=29.46 Aligned_cols=22 Identities=23% Similarity=0.057 Sum_probs=15.6 Q ss_pred HHHHCCCCHHHHHHHHHHHHHH Q ss_conf 8854025789999999998851 Q gi|254780229|r 533 IWKNTELSIHQILMGAARYRHN 554 (1775) Q Consensus 533 ~~~~~~~s~~~~~~~~~~~~~~ 554 (1775) .+++--.|+||.+|....+..+ T Consensus 358 ~~GPDMvSVYQSlGRVR~L~~n 379 (829) T pfam02399 358 KGGPDMVSVYQSLGRVRSLRRN 379 (829) T ss_pred CCCCCEEHHHHHHHHHHHHHHC T ss_conf 6899720135540057766307 No 410 >pfam05970 DUF889 PIF1 helicase. The PIF1 helicase inhibits telomerase activity and is cell cycle regulated. Probab=94.54 E-value=0.058 Score=35.43 Aligned_cols=35 Identities=31% Similarity=0.262 Sum_probs=22.3 Q ss_pred CCCCCCCHHHHHHHHHHHHHCCCEEEEEECCHHHH Q ss_conf 28787703899999999996798289960105899 Q gi|254780229|r 426 GPPGTGKSQTITNIIAAAMLHGKKILFCAQKMAAI 460 (1775) Q Consensus 426 GPPGTGKSQTIaNlIa~~la~Gk~VLfVaeK~aAL 460 (1775) ||=|||||..|--+|..+-..||.|+++|-+--|. T Consensus 1 G~AGTGKS~ll~~i~~~l~~~~~~v~vtA~TGiAA 35 (418) T pfam05970 1 GYGGTGKTFLWNALSARIRSRGKIVLNVASSGIAA 35 (418) T ss_pred CCCCCCHHHHHHHHHHHHHHCCCEEEEECCHHHHH T ss_conf 97988799999999999976898899989689998 No 411 >cd03278 ABC_SMC_barmotin Barmotin is a tight junction-associated protein expressed in rat epithelial cells which is thought to have an important regulatory role in tight junction barrier function. Barmotin belongs to the SMC protein family. SMC proteins are large (approximately 110 to 170 kDa), and each is arranged into five recognizable domains. Amino-acid sequence homology of SMC proteins between species is largely confined to the amino- and carboxy-terminal globular domains. The amino-terminal domain contains a 'Walker A' nucleotide-binding domain (GxxGxGKS/T, in the single-letter amino-acid code), which by mutational studies has been shown to be essential in several proteins. The carboxy-terminal domain contains a sequence (the DA-box) that resembles a 'Walker B' motif, and a motif with homology to the signature sequence of the ATP-binding cassette (ABC) family of ATPases. The sequence homology within the carboxy-terminal domain is relatively high within the SMC1-SMC4 group, w Probab=94.54 E-value=0.036 Score=37.61 Aligned_cols=29 Identities=34% Similarity=0.421 Sum_probs=23.2 Q ss_pred HHHCCCCEEEECCCCCCCHHHHHHHHHHHHH Q ss_conf 8514895599428787703899999999996 Q gi|254780229|r 415 DVINGKNLIIEGPPGTGKSQTITNIIAAAML 445 (1775) Q Consensus 415 ~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la 445 (1775) ..-.| =++|.||-|+||| |+.++|+-.+. T Consensus 19 ~ip~G-itaIvGpsGsGKS-TLl~~i~~~lg 47 (197) T cd03278 19 PFPPG-LTAIVGPNGSGKS-NIIDAIRWVLG 47 (197) T ss_pred ECCCC-EEEEECCCCCCHH-HHHHHHHHHHC T ss_conf 73898-2899999999889-99999998747 No 412 >PRK11545 gntK gluconate kinase 1; Provisional Probab=94.53 E-value=0.032 Score=38.07 Aligned_cols=25 Identities=24% Similarity=0.492 Sum_probs=22.0 Q ss_pred CCEEEECCCCCCCHHHHHHHHHHHHH Q ss_conf 95599428787703899999999996 Q gi|254780229|r 420 KNLIIEGPPGTGKSQTITNIIAAAML 445 (1775) Q Consensus 420 ~~~vi~GPPGTGKSQTIaNlIa~~la 445 (1775) +=+||-||-|+||| ||++++|..|. T Consensus 9 ~iiVVMGVsGsGKS-Tig~~LA~~l~ 33 (177) T PRK11545 9 HIYVLMGVSGSGKS-AVASAVAHQLH 33 (177) T ss_pred EEEEEECCCCCCHH-HHHHHHHHHHC T ss_conf 59999847989999-99999999819 No 413 >COG1618 Predicted nucleotide kinase [Nucleotide transport and metabolism] Probab=94.52 E-value=0.05 Score=36.05 Aligned_cols=27 Identities=33% Similarity=0.568 Sum_probs=21.6 Q ss_pred CCCEEEECCCCCCCHHHHHHHHHHHHHC Q ss_conf 8955994287877038999999999967 Q gi|254780229|r 419 GKNLIIEGPPGTGKSQTITNIIAAAMLH 446 (1775) Q Consensus 419 g~~~vi~GPPGTGKSQTIaNlIa~~la~ 446 (1775) --.+.|.||||.||| |++-=|+..|.. T Consensus 5 ~mki~ITG~PGvGKt-Tl~~ki~e~L~~ 31 (179) T COG1618 5 AMKIFITGRPGVGKT-TLVLKIAEKLRE 31 (179) T ss_pred CEEEEEECCCCCCHH-HHHHHHHHHHHH T ss_conf 459998679984589-999999999985 No 414 >PRK08082 consensus Probab=94.52 E-value=0.13 Score=31.77 Aligned_cols=55 Identities=18% Similarity=0.228 Sum_probs=41.0 Q ss_pred HHCCCCEEEECCCCCCCHHHHHHHHHHH-HHCCCEEEEEECCHHHHHHHHHHHHHC Q ss_conf 5148955994287877038999999999-967982899601058999999864435 Q gi|254780229|r 416 VINGKNLIIEGPPGTGKSQTITNIIAAA-MLHGKKILFCAQKMAAIEVVRHRLEKA 470 (1775) Q Consensus 416 ~~~g~~~vi~GPPGTGKSQTIaNlIa~~-la~Gk~VLfVaeK~aALdVV~~RL~~~ 470 (1775) ...|.=+||-|-||-|||---.||+.++ +-.|++|+|.|--|.+-+++.|-|... T Consensus 200 ~~~g~LiviaaRPsmGKTa~alnia~~~a~~~~~~V~~fSlEM~~~~l~~R~la~~ 255 (453) T PRK08082 200 FQRNDLIIVAARPSVGKTAFALNIAQNVATKTDENVAIFSLEMGADQLVMRMLCAE 255 (453) T ss_pred CCCCCEEEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHCC T ss_conf 77585799986788757899999999999855994899731389899999999715 No 415 >cd01136 ATPase_flagellum-secretory_path_III Flagellum-specific ATPase/type III secretory pathway virulence-related protein. This group of ATPases are responsible for the export of flagellum and virulence-related proteins. The bacterial flagellar motor is similar to the F0F1-ATPase, in that they both are proton driven rotary molecular devices. However, the main function of the bacterial flagellar motor is to rotate the flagellar filament for cell motility. Intracellular pathogens such as Salmonella and Chlamydia also have proteins which are similar to the flagellar-specific ATPase, but function in the secretion of virulence-related proteins via the type III secretory pathway. Probab=94.50 E-value=0.24 Score=29.00 Aligned_cols=69 Identities=16% Similarity=0.152 Sum_probs=42.0 Q ss_pred HHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHCCCEEE-EEECCHHHHHHHHHHHHHCCCCCEEEEC Q ss_conf 999998514895599428787703899999999996798289-9601058999999864435844136540 Q gi|254780229|r 410 HSAVIDVINGKNLIIEGPPGTGKSQTITNIIAAAMLHGKKIL-FCAQKMAAIEVVRHRLEKAGLGEFCLEL 479 (1775) Q Consensus 410 ~~av~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~Gk~VL-fVaeK~aALdVV~~RL~~~gLg~~cleL 479 (1775) .-++.-...|+-..|-||+||||| ||.++|+........|. .+-|+..-..-+.+.+...+.-+.|.-+ T Consensus 60 ID~l~pigrGQR~~Ifg~~g~GKt-~Ll~~i~~~~~~~v~V~~~IGer~rev~e~~~~~~~~~~l~~tvvv 129 (326) T cd01136 60 IDGLLTVGKGQRLGIFAGSGVGKS-TLLGMIARGTTADVNVIALIGERGREVREFIEKDLGEEGLKRSVVV 129 (326) T ss_pred HCCCCCCCCCCEEEEECCCCCCCC-HHHHHHHHCCCCCEEEEEEECCCHHHHHHHHHHHHCCCCCCCEEEE T ss_conf 200266247987874369999846-6787686414677599996053388999999986245653222899 No 416 >PRK05636 replicative DNA helicase; Provisional Probab=94.49 E-value=0.13 Score=31.66 Aligned_cols=56 Identities=23% Similarity=0.402 Sum_probs=45.7 Q ss_pred HHCCCCEEEECCCCCCCHHHHHHHHHHH-HHCCCEEEEEECCHHHHHHHHHHHHHCC Q ss_conf 5148955994287877038999999999-9679828996010589999998644358 Q gi|254780229|r 416 VINGKNLIIEGPPGTGKSQTITNIIAAA-MLHGKKILFCAQKMAAIEVVRHRLEKAG 471 (1775) Q Consensus 416 ~~~g~~~vi~GPPGTGKSQTIaNlIa~~-la~Gk~VLfVaeK~aALdVV~~RL~~~g 471 (1775) ...|.=+||-|-||-|||---.||..++ +..|++|+|.|--|.+-+++.+-|...+ T Consensus 264 l~~G~LiIiAARPsmGKTalAlnia~n~A~~~g~~v~~fSLEMs~~ql~~Rlla~~s 320 (507) T PRK05636 264 LRGGQMIIVAARPGVGKSTIALDFMRSASIKNNKASVIFSLEMSKSEIVMRLLSAEA 320 (507) T ss_pred CCCCCEEEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHC T ss_conf 883567999737878668999999999998769937997156998999999999847 No 417 >cd03301 ABC_MalK_N The N-terminal ATPase domain of the maltose transporter, MalK. ATP binding cassette (ABC) proteins function from bacteria to human, mediating the translocation of substances into and out of cells or organelles. ABC transporters contain two transmembrane-spanning domains (TMDs) or subunits and two nucleotide binding domains (NBDs) or subunits that couple transport to the hydrolysis of ATP. In the maltose transport system, the periplasmic maltose binding protein (MBP) stimulates the ATPase activity of the membrane-associated transporter, which consists of two transmembrane subunits, MalF and MalG, and two copies of the ATP binding subunit, MalK, and becomes tightly bound to the transporter in the catalytic transition state, ensuring that maltose is passed to the transporter as ATP is hydrolyzed. Probab=94.49 E-value=0.027 Score=38.81 Aligned_cols=40 Identities=28% Similarity=0.508 Sum_probs=30.6 Q ss_pred HHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHCC-CEEEEEE Q ss_conf 9851489559942878770389999999999679-8289960 Q gi|254780229|r 414 IDVINGKNLIIEGPPGTGKSQTITNIIAAAMLHG-KKILFCA 454 (1775) Q Consensus 414 ~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~G-k~VLfVa 454 (1775) ..+..|.-++|-||-|+||| |+.++|+-..-.- ..|+|-- T Consensus 21 l~v~~Ge~~~i~GpSG~GKS-TlLr~iaGl~~p~~G~I~~~g 61 (213) T cd03301 21 LDIADGEFVVLLGPSGCGKT-TTLRMIAGLEEPTSGRIYIGG 61 (213) T ss_pred EEECCCCEEEEECCCCCHHH-HHHHHHHCCCCCCCEEEEECC T ss_conf 79869989999999988099-999999769999863999999 No 418 >CHL00131 ycf16 sulfate ABC transporter protein; Validated Probab=94.48 E-value=0.025 Score=39.18 Aligned_cols=47 Identities=32% Similarity=0.540 Sum_probs=33.6 Q ss_pred HHHHCCCCEEEECCCCCCCHHHHHHHHHHH-HH--CCCEEEEEECCHHHHH Q ss_conf 985148955994287877038999999999-96--7982899601058999 Q gi|254780229|r 414 IDVINGKNLIIEGPPGTGKSQTITNIIAAA-ML--HGKKILFCAQKMAAIE 461 (1775) Q Consensus 414 ~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~-la--~Gk~VLfVaeK~aALd 461 (1775) ..+..|.-+.|.||.|+||| |+.|+|+-. ++ ...+|+|--+....+. T Consensus 27 l~i~~Gei~aiiG~nGsGKS-TL~~~i~G~~~~~~~~G~I~~~G~~i~~~~ 76 (252) T CHL00131 27 LSINAGEIHAIMGPNGSGKS-TLSKVIAGHPAYTVLEGDILFKGESILELE 76 (252) T ss_pred EEECCCCEEEEECCCCCCHH-HHHHHHCCCCCCCCCCCEEEECCEECCCCC T ss_conf 78879989999999999999-999997278766766425998772768599 No 419 >PRK13976 thymidylate kinase; Provisional Probab=94.48 E-value=0.059 Score=35.31 Aligned_cols=33 Identities=21% Similarity=0.485 Sum_probs=28.2 Q ss_pred EEEECCCCCCCHHHHHHHHHHHHHC---CCEEEEEEC Q ss_conf 5994287877038999999999967---982899601 Q gi|254780229|r 422 LIIEGPPGTGKSQTITNIIAAAMLH---GKKILFCAQ 455 (1775) Q Consensus 422 ~vi~GPPGTGKSQTIaNlIa~~la~---Gk~VLfVae 455 (1775) .|++|+=|+||| |.+++++..|.. +++|.++-| T Consensus 3 IvfEGiDGsGKs-Tq~~~L~~~L~~~~~~~~~~~trE 38 (202) T PRK13976 3 ITFEGIDGSGKT-TQSRLLAEYLSQIYGENNVVLTRE 38 (202) T ss_pred EEEECCCCCCHH-HHHHHHHHHHHHHCCCCCEEEECC T ss_conf 999899999899-999999999986069986898379 No 420 >PRK08760 replicative DNA helicase; Provisional Probab=94.48 E-value=0.13 Score=31.68 Aligned_cols=55 Identities=18% Similarity=0.239 Sum_probs=43.8 Q ss_pred HHCCCCEEEECCCCCCCHHHHHHHHHHH-HHCCCEEEEEECCHHHHHHHHHHHHHC Q ss_conf 5148955994287877038999999999-967982899601058999999864435 Q gi|254780229|r 416 VINGKNLIIEGPPGTGKSQTITNIIAAA-MLHGKKILFCAQKMAAIEVVRHRLEKA 470 (1775) Q Consensus 416 ~~~g~~~vi~GPPGTGKSQTIaNlIa~~-la~Gk~VLfVaeK~aALdVV~~RL~~~ 470 (1775) ...|.=+||=|.||.|||..-.||..++ +-.|+.|+|.|--|.+-+++.+-|... T Consensus 226 l~~G~LiViaaRPsmGKTalalnia~~~A~~~~~~V~~fSLEMs~~ql~~Rlls~~ 281 (476) T PRK08760 226 LQPTDLIILAARPAMGKTTFALNIAEYAAIKSKKGVAVFSMEMSASQLAMRLISSN 281 (476) T ss_pred CCCCCEEEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHH T ss_conf 98777799987788747899999999999837997899703699999999999983 No 421 >cd03235 ABC_Metallic_Cations ABC component of the metal-type transporters. This family includes transporters involved in the uptake of various metallic cations such as iron, manganese, and zinc. The ATPases of this group of transporters are very similar to members of iron-siderophore uptake family suggesting that they share a common ancestor. The best characterized metal-type ABC transporters are the YfeABCD system of Y. pestis, the SitABCD system of Salmonella enterica serovar Typhimurium, and the SitABCD transporter of Shigella flexneri. Moreover other uncharacterized homologs of these metal-type transporters are mainly found in pathogens like Haemophilus or enteroinvasive E. coli isolates. Probab=94.47 E-value=0.029 Score=38.57 Aligned_cols=38 Identities=34% Similarity=0.462 Sum_probs=31.0 Q ss_pred HHHCCCCEEEECCCCCCCHHHHHHHHHHHHH-CCCEEEEE Q ss_conf 8514895599428787703899999999996-79828996 Q gi|254780229|r 415 DVINGKNLIIEGPPGTGKSQTITNIIAAAML-HGKKILFC 453 (1775) Q Consensus 415 ~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la-~Gk~VLfV 453 (1775) .+..|.-+.|-||.|+||| |+.++|+-.+- ..-+|.|- T Consensus 21 ~i~~Ge~~~liGpNGaGKS-Tllk~i~Gl~~p~~G~i~i~ 59 (213) T cd03235 21 EVKPGEFLAIVGPNGAGKS-TLLKAILGLLKPTSGSIRVF 59 (213) T ss_pred EECCCCEEEEECCCCCCHH-HHHHHHHCCCCCCCCEEEEC T ss_conf 9859989999999998699-99999976878897589999 No 422 >cd03213 ABCG_EPDR ABCG transporters are involved in eye pigment (EP) precursor transport, regulation of lipid-trafficking mechanisms, and pleiotropic drug resistance (DR). DR is a well-described phenomenon occurring in fungi and shares several similarities with processes in bacteria and higher eukaryotes. Compared to other members of the ABC transporter subfamilies, the ABCG transporter family is composed of proteins that have an ATP-binding cassette domain at the N-terminus and a TM (transmembrane) domain at the C-terminus. Probab=94.47 E-value=0.025 Score=39.30 Aligned_cols=39 Identities=31% Similarity=0.438 Sum_probs=29.8 Q ss_pred HHHCCCCEEEECCCCCCCHHHHHHHHHHHHHC---CCEEEEEE Q ss_conf 85148955994287877038999999999967---98289960 Q gi|254780229|r 415 DVINGKNLIIEGPPGTGKSQTITNIIAAAMLH---GKKILFCA 454 (1775) Q Consensus 415 ~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~---Gk~VLfVa 454 (1775) .+..|.-++|-||+|+||| |+.++|+-.+.. .-+|++-- T Consensus 31 ~v~~Gei~~ilGpnGaGKS-TLl~~l~Gl~~~~~~~G~i~~~g 72 (194) T cd03213 31 KAKPGELTAIMGPSGAGKS-TLLNALAGRRTGLGVSGEVLING 72 (194) T ss_pred EEECCEEEEEECCCCCHHH-HHHHHHHCCCCCCCCEEEEEECC T ss_conf 9908819999989995199-99999857777899628999999 No 423 >pfam05707 Zot Zonular occludens toxin (Zot). This family consists of bacterial and viral proteins which are very similar to the Zonular occludens toxin (Zot). Zot is elaborated by bacteriophages present in toxigenic strains of Vibrio cholerae. Zot is a single polypeptide chain of 44.8 kDa, with the ability to reversibly alter intestinal epithelial tight junctions, allowing the passage of macromolecules through mucosal barriers Probab=94.47 E-value=0.055 Score=35.67 Aligned_cols=10 Identities=30% Similarity=0.640 Sum_probs=5.1 Q ss_pred EEEECCHHHC Q ss_conf 7886063236 Q gi|254780229|r 1364 LVIMDESSQI 1373 (1775) Q Consensus 1364 ~VI~DEASQ~ 1373 (1775) |+|||||-.. T Consensus 73 liViDE~~~~ 82 (183) T pfam05707 73 LLVIDEAQTW 82 (183) T ss_pred EEEEECCHHH T ss_conf 9999897655 No 424 >COG1102 Cmk Cytidylate kinase [Nucleotide transport and metabolism] Probab=94.45 E-value=0.032 Score=38.06 Aligned_cols=23 Identities=35% Similarity=0.642 Sum_probs=20.2 Q ss_pred EEEECCCCCCCHHHHHHHHHHHHH Q ss_conf 599428787703899999999996 Q gi|254780229|r 422 LIIEGPPGTGKSQTITNIIAAAML 445 (1775) Q Consensus 422 ~vi~GPPGTGKSQTIaNlIa~~la 445 (1775) ..|-|||||||| |++-+||.-+. T Consensus 3 ItIsG~pGsG~T-Tva~~lAe~~g 25 (179) T COG1102 3 ITISGLPGSGKT-TVARELAEHLG 25 (179) T ss_pred EEECCCCCCCCH-HHHHHHHHHHC T ss_conf 996179999702-79999999829 No 425 >PRK06749 replicative DNA helicase; Provisional Probab=94.45 E-value=0.14 Score=31.39 Aligned_cols=54 Identities=15% Similarity=0.244 Sum_probs=39.9 Q ss_pred HHCCCCEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEECCHHHHHHHHHHHHH Q ss_conf 514895599428787703899999999996798289960105899999986443 Q gi|254780229|r 416 VINGKNLIIEGPPGTGKSQTITNIIAAAMLHGKKILFCAQKMAAIEVVRHRLEK 469 (1775) Q Consensus 416 ~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVaeK~aALdVV~~RL~~ 469 (1775) ...|.=+||-|-||-|||---.|+..++.-.|++|+|.|--|.+-+++.|-|.. T Consensus 183 l~~g~LiviaaRPsmGKTa~alnia~~~a~~g~~v~~fSlEMs~~~l~~R~ls~ 236 (428) T PRK06749 183 LQEGDFVVLGARPSMGKTAFALNVGLHAAKSGAAVGLFSLEMSSKQLLKRMASC 236 (428) T ss_pred CCCCCEEEEECCCCCCHHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHH T ss_conf 998868999627989768999999999996499279983789999999999997 No 426 >PRK05506 bifunctional sulfate adenylyltransferase subunit 1/adenylylsulfate kinase protein; Provisional Probab=94.44 E-value=0.05 Score=36.05 Aligned_cols=36 Identities=25% Similarity=0.421 Sum_probs=26.5 Q ss_pred HHHCCCCEEEECCCCCCCHHHHHHHHHHHHHC-CCEEE Q ss_conf 85148955994287877038999999999967-98289 Q gi|254780229|r 415 DVINGKNLIIEGPPGTGKSQTITNIIAAAMLH-GKKIL 451 (1775) Q Consensus 415 ~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~-Gk~VL 451 (1775) .--+|.-+.+.|+||+||| |||+.+...|.. |..+- T Consensus 439 ~g~~~~~iw~tGlsgsGKs-tiA~~le~~L~~~g~~~~ 475 (613) T PRK05506 439 KGQKPATVWFTGLSGSGKS-TIANLVERRLHALGRHTY 475 (613) T ss_pred HCCCCEEEEEECCCCCCHH-HHHHHHHHHHHHCCCCEE T ss_conf 4899769999778989747-999999999997799879 No 427 >COG1066 Sms Predicted ATP-dependent serine protease [Posttranslational modification, protein turnover, chaperones] Probab=94.44 E-value=0.063 Score=35.05 Aligned_cols=70 Identities=27% Similarity=0.427 Sum_probs=38.8 Q ss_pred HHHHHHHHHHCC-----CCEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEECCHHHHHHHHHHHHHCCCCCEEEEC Q ss_conf 689999985148-----955994287877038999999999967982899601058999999864435844136540 Q gi|254780229|r 408 SQHSAVIDVING-----KNLIIEGPPGTGKSQTITNIIAAAMLHGKKILFCAQKMAAIEVVRHRLEKAGLGEFCLEL 479 (1775) Q Consensus 408 SQ~~av~~~~~g-----~~~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVaeK~aALdVV~~RL~~~gLg~~cleL 479 (1775) +.+.-...+.-| .=+.|-|-||-||| |+.==|++.+|..++||+||---..-. ++-|-+..|+..--+.| T Consensus 77 tg~~EldRVLGGG~V~Gs~iLIgGdPGIGKS-TLLLQva~~lA~~~~vLYVsGEES~~Q-iklRA~RL~~~~~~l~l 151 (456) T COG1066 77 TGIEELDRVLGGGLVPGSVILIGGDPGIGKS-TLLLQVAARLAKRGKVLYVSGEESLQQ-IKLRADRLGLPTNNLYL 151 (456) T ss_pred CCHHHHHHHHCCCCCCCCEEEECCCCCCCHH-HHHHHHHHHHHHCCCEEEEECCCCHHH-HHHHHHHHCCCCCCEEE T ss_conf 7818887653677236617997368987798-999999999870595799967767899-99999982899645577 No 428 >cd01128 rho_factor Transcription termination factor rho is a bacterial ATP-dependent RNA/DNA helicase. It is a homohexamer. Each monomer consists of an N-terminal domain of the OB fold, which is responsible for binding to cysteine rich nucleotides. This alignment is of the C-terminal ATP binding domain. Probab=94.43 E-value=0.13 Score=31.68 Aligned_cols=52 Identities=21% Similarity=0.383 Sum_probs=34.8 Q ss_pred HHHCCCCEEEECCCCCCCHHHHHHHHHHHHHCC-CE-E---EEEECCHHHHHHHHHHH Q ss_conf 851489559942878770389999999999679-82-8---99601058999999864 Q gi|254780229|r 415 DVINGKNLIIEGPPGTGKSQTITNIIAAAMLHG-KK-I---LFCAQKMAAIEVVRHRL 467 (1775) Q Consensus 415 ~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~G-k~-V---LfVaeK~aALdVV~~RL 467 (1775) -..+|+-..|-||+||||| |+..+|++.+... +. + ..+-|+.....-..+.+ T Consensus 12 pigkGQR~gI~g~~gvGKT-~Ll~~i~~~~~~~~~~~~~v~~lIGER~rEv~e~~~~~ 68 (249) T cd01128 12 PIGKGQRGLIVAPPKAGKT-TLLQSIANAITKNHPEVYLIVLLIDERPEEVTDMQRSV 68 (249) T ss_pred CCCCCCEEEEECCCCCCHH-HHHHHHHHHHHHHCCCEEEEEEEEECCHHHHHHHHHHH T ss_conf 6167867788789998899-99999999999858984999999716579999999971 No 429 >PRK10771 thiQ thiamine transporter ATP-binding subunit; Provisional Probab=94.43 E-value=0.028 Score=38.64 Aligned_cols=39 Identities=36% Similarity=0.594 Sum_probs=31.9 Q ss_pred HHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHCC-CEEEEE Q ss_conf 9851489559942878770389999999999679-828996 Q gi|254780229|r 414 IDVINGKNLIIEGPPGTGKSQTITNIIAAAMLHG-KKILFC 453 (1775) Q Consensus 414 ~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~G-k~VLfV 453 (1775) ..+..|.-++|-||-|+||| |+.|+|+-.+..- .+|.|- T Consensus 20 l~i~~GE~v~iiG~nGaGKS-TLl~~i~Gll~p~sG~I~i~ 59 (233) T PRK10771 20 LTVERGEQVAILGPSGAGKS-TLLNLIAGFLTPASGSLLIN 59 (233) T ss_pred EEECCCCEEEEECCCCCHHH-HHHHHHHCCCCCCCEEEEEC T ss_conf 89889989999999998199-99999965999985599999 No 430 >cd03369 ABCC_NFT1 Domain 2 of NFT1 (New full-length MRP-type transporter 1). NFT1 belongs to the MRP (mulrtidrug resisitance-associated protein) family of ABC transporters. Some of the MRP members have five additional transmembrane segments in their N-terminas, but the function of these additional membrane-spanning domains is not clear. The MRP was found in the multidrug-resisting lung cancer cell in which p-glycoprotein was not overexpressed. MRP exports glutathione by drug stimulation, as well as, certain substrates in conjugated forms with anions such as glutathione, glucuronate, and sulfate. Probab=94.42 E-value=0.033 Score=37.97 Aligned_cols=44 Identities=25% Similarity=0.280 Sum_probs=34.3 Q ss_pred HHHCCCCEEEECCCCCCCHHHHHHHHHHHHHC-CCEEEEEECCHHH Q ss_conf 85148955994287877038999999999967-9828996010589 Q gi|254780229|r 415 DVINGKNLIIEGPPGTGKSQTITNIIAAAMLH-GKKILFCAQKMAA 459 (1775) Q Consensus 415 ~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~-Gk~VLfVaeK~aA 459 (1775) .+..|.-+.|.||.|+||| |+.++|+-.+-. .-+|++--....- T Consensus 30 ~i~~Ge~v~ivG~sGsGKS-TLl~ll~g~~~p~~G~I~idg~di~~ 74 (207) T cd03369 30 KVKAGEKIGIVGRTGAGKS-TLILALFRFLEAEEGKIEIDGIDIST 74 (207) T ss_pred EECCCCEEEEECCCCCCHH-HHHHHHHHHCCCCCCEEEECCEECCC T ss_conf 9869999999999998799-99999998728888789999999540 No 431 >pfam02223 Thymidylate_kin Thymidylate kinase. Probab=94.41 E-value=0.045 Score=36.50 Aligned_cols=32 Identities=28% Similarity=0.544 Sum_probs=26.9 Q ss_pred EECCCCCCCHHHHHHHHHHHH-HCCCEEEEEECC Q ss_conf 942878770389999999999-679828996010 Q gi|254780229|r 424 IEGPPGTGKSQTITNIIAAAM-LHGKKILFCAQK 456 (1775) Q Consensus 424 i~GPPGTGKSQTIaNlIa~~l-a~Gk~VLfVaeK 456 (1775) |+|+-|+||| |.+++++..| +.|..|.++-|- T Consensus 1 ~EGiDGsGKs-Tq~~~L~~~L~~~g~~v~~~~ep 33 (186) T pfam02223 1 IEGLDGAGKT-TQAELLKERLKEQGIKVVLTREP 33 (186) T ss_pred CCCCCCCCHH-HHHHHHHHHHHHCCCEEEEEECC T ss_conf 9799998999-99999999999879908997499 No 432 >PRK13635 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=94.41 E-value=0.03 Score=38.37 Aligned_cols=39 Identities=28% Similarity=0.466 Sum_probs=31.2 Q ss_pred HHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHC-CCEEEEE Q ss_conf 985148955994287877038999999999967-9828996 Q gi|254780229|r 414 IDVINGKNLIIEGPPGTGKSQTITNIIAAAMLH-GKKILFC 453 (1775) Q Consensus 414 ~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~-Gk~VLfV 453 (1775) ..+..|.-++|.||.|+||| |++++|+-.+.. ..+|.|- T Consensus 28 l~i~~GE~vaivG~nGsGKS-TL~k~l~Gl~~p~~G~I~i~ 67 (279) T PRK13635 28 FSVYEGEWVAIVGHNGSGKS-TLAKLLNGLLLPEAGTITVG 67 (279) T ss_pred EEECCCCEEEEECCCCCHHH-HHHHHHHCCCCCCCCEEEEC T ss_conf 68879989999999996599-99999972888889649999 No 433 >KOG2859 consensus Probab=94.40 E-value=0.074 Score=34.28 Aligned_cols=37 Identities=27% Similarity=0.506 Sum_probs=30.6 Q ss_pred CCCCEEEECCCCCCCHHHHHHHHHHHHHC------CCEEEEEE Q ss_conf 48955994287877038999999999967------98289960 Q gi|254780229|r 418 NGKNLIIEGPPGTGKSQTITNIIAAAMLH------GKKILFCA 454 (1775) Q Consensus 418 ~g~~~vi~GPPGTGKSQTIaNlIa~~la~------Gk~VLfVa 454 (1775) .|.=+-|.||-|||||+..-.+.|+|+-. |+.|||.- T Consensus 37 aG~~vEi~Gp~~sgKt~vL~ql~a~CilPk~~GGl~~~VLfid 79 (293) T KOG2859 37 AGTLVEISGPGNSGKTLVLQQLVAHCILPKKFGGLQWSVLFID 79 (293) T ss_pred CCCEEEEECCCCCCHHHHHHHHHHHEECCCCCCCCEEEEEEEE T ss_conf 6867998689986188999999888354534488506999996 No 434 >cd03252 ABCC_Hemolysin The ABC-transporter hemolysin B is a central component of the secretion machinery that translocates the toxin, hemolysin A, in a Sec-independent fashion across both membranes of E. coli. The hemolysin A (HlyA) transport machinery is composed of the ATP-binding cassette (ABC) transporter HlyB located in the inner membrane, hemolysin D (HlyD), also anchored in the inner membrane, and TolC, which resides in the outer membrane. HlyD apparently forms a continuous channel that bridges the entire periplasm, interacting with TolC and HlyB. This arrangement prevents the appearance of periplasmic intermediates of HlyA during substrate transport. Little is known about the molecular details of HlyA transport, but it is evident that ATP-hydrolysis by the ABC-transporter HlyB is a necessary source of energy. Probab=94.39 E-value=0.029 Score=38.62 Aligned_cols=38 Identities=29% Similarity=0.451 Sum_probs=28.6 Q ss_pred HHHCCCCEEEECCCCCCCHHHHHHHHHHHHH-CCCEEEEE Q ss_conf 8514895599428787703899999999996-79828996 Q gi|254780229|r 415 DVINGKNLIIEGPPGTGKSQTITNIIAAAML-HGKKILFC 453 (1775) Q Consensus 415 ~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la-~Gk~VLfV 453 (1775) .+..|+-+.|.||.|+||| |+.++|.-.+- ...+|++- T Consensus 24 ~i~~G~~vaivG~sGsGKS-Tll~ll~gl~~p~~G~I~i~ 62 (237) T cd03252 24 RIKPGEVVGIVGRSGSGKS-TLTKLIQRFYVPENGRVLVD 62 (237) T ss_pred EECCCCEEEEECCCCCHHH-HHHHHHHCCCCCCCCEEEEC T ss_conf 9879999999999998599-99999967765798789999 No 435 >PRK07004 replicative DNA helicase; Provisional Probab=94.39 E-value=0.14 Score=31.55 Aligned_cols=54 Identities=20% Similarity=0.282 Sum_probs=43.1 Q ss_pred HCCCCEEEECCCCCCCHHHHHHHHHHH-HHCCCEEEEEECCHHHHHHHHHHHHHC Q ss_conf 148955994287877038999999999-967982899601058999999864435 Q gi|254780229|r 417 INGKNLIIEGPPGTGKSQTITNIIAAA-MLHGKKILFCAQKMAAIEVVRHRLEKA 470 (1775) Q Consensus 417 ~~g~~~vi~GPPGTGKSQTIaNlIa~~-la~Gk~VLfVaeK~aALdVV~~RL~~~ 470 (1775) ..|.=.||=|-||-|||-.-.||+.++ +-.|++|+|.|--|.+-+++.|-|... T Consensus 211 ~~gdLiIIAARPsmGKTafAlniA~n~A~~~g~~V~~FSLEMs~eql~~Rlls~~ 265 (460) T PRK07004 211 HGGELIIVAGRPSMGKTAFSMNIGEYVAVEYGLPVAVFSMEMPGTQLAMRMLGSV 265 (460) T ss_pred CCCCEEEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEECCCCCHHHHHHHHHHHC T ss_conf 8775799973687642699999999998725886699847799999999999860 No 436 >KOG0354 consensus Probab=94.38 E-value=0.17 Score=30.47 Aligned_cols=71 Identities=23% Similarity=0.224 Sum_probs=46.7 Q ss_pred CCCCCCHHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHCC---CEEEEEECCHHHHHHHHHHHHHCCCC Q ss_conf 36677346899999851489559942878770389999999999679---82899601058999999864435844 Q gi|254780229|r 401 LIDDSDSSQHSAVIDVINGKNLIIEGPPGTGKSQTITNIIAAAMLHG---KKILFCAQKMAAIEVVRHRLEKAGLG 473 (1775) Q Consensus 401 ~~~~aDsSQ~~av~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~G---k~VLfVaeK~aALdVV~~RL~~~gLg 473 (1775) ..+.-=+=|...+.-|. |+|.+|.-|+|.||| -||-.|+---.+. -+|.|.+-+.--+.--..+..-.|.+ T Consensus 59 ~~~~lR~YQ~eivq~AL-gkNtii~lPTG~GKT-fIAa~Vm~nh~rw~p~~KiVF~aP~~pLv~QQ~a~~~~~~~~ 132 (746) T KOG0354 59 TNLELRNYQEELVQPAL-GKNTIIALPTGSGKT-FIAAVIMKNHFEWRPKGKVVFLAPTRPLVNQQIACFSIYLIP 132 (746) T ss_pred CCCCCCHHHHHHHHHHH-CCCEEEEEECCCCCC-CHHHHHHHHHHHCCCCCEEEEEECCCHHHHHHHHHHHHCCCC T ss_conf 76656078999867862-687699953599861-047999999972377643899607711788889887620676 No 437 >COG0529 CysC Adenylylsulfate kinase and related kinases [Inorganic ion transport and metabolism] Probab=94.38 E-value=0.088 Score=33.50 Aligned_cols=48 Identities=29% Similarity=0.576 Sum_probs=30.8 Q ss_pred HHHCCC-CEE--EECCCCCCCHHHHHHHHHHHHH-CCCEEEEEECCHHHHHHHHHHHH Q ss_conf 851489-559--9428787703899999999996-79828996010589999998644 Q gi|254780229|r 415 DVINGK-NLI--IEGPPGTGKSQTITNIIAAAML-HGKKILFCAQKMAAIEVVRHRLE 468 (1775) Q Consensus 415 ~~~~g~-~~v--i~GPPGTGKSQTIaNlIa~~la-~Gk~VLfVaeK~aALdVV~~RL~ 468 (1775) ..+.|+ .+| +.|++|+||| ||+|+....|. .|+.|-+.- =|+|+.-|. T Consensus 16 ~~~~~~~~~viW~TGLSGsGKS-TiA~ale~~L~~~G~~~y~LD-----GDnvR~gL~ 67 (197) T COG0529 16 EALKGQKGAVIWFTGLSGSGKS-TIANALEEKLFAKGYHVYLLD-----GDNVRHGLN 67 (197) T ss_pred HHHHCCCCEEEEEECCCCCCHH-HHHHHHHHHHHHCCCEEEEEC-----CHHHHHCCC T ss_conf 8872799859996468888787-999999999997597589855-----746765005 No 438 >PRK05480 uridine kinase; Provisional Probab=94.37 E-value=0.055 Score=35.63 Aligned_cols=33 Identities=27% Similarity=0.481 Sum_probs=24.8 Q ss_pred CCEE--EECCCCCCCHHHHHHHHHHHHHCCCEEEEEE Q ss_conf 9559--9428787703899999999996798289960 Q gi|254780229|r 420 KNLI--IEGPPGTGKSQTITNIIAAAMLHGKKILFCA 454 (1775) Q Consensus 420 ~~~v--i~GPPGTGKSQTIaNlIa~~la~Gk~VLfVa 454 (1775) +-|| |.||+|+||| |+++.|+..+.. +.|.+++ T Consensus 5 ~P~iIgIaG~SgSGKT-T~a~~L~~~l~~-~~v~vi~ 39 (209) T PRK05480 5 QPIIIGIAGGSGSGKT-TVASTIYEELGD-ESIAVIS 39 (209) T ss_pred CCEEEEEECCCCCCHH-HHHHHHHHHCCC-CCEEEEE T ss_conf 9889999899977899-999999998086-8759995 No 439 >PRK10867 signal recognition particle protein; Provisional Probab=94.36 E-value=0.2 Score=29.74 Aligned_cols=57 Identities=25% Similarity=0.372 Sum_probs=41.7 Q ss_pred CCCE-EEECCCCCCCHHHHHHHHHHHHH-CCCEEEEEE---CCHHHHHHHHHHHHHCCCCCE Q ss_conf 8955-99428787703899999999996-798289960---105899999986443584413 Q gi|254780229|r 419 GKNL-IIEGPPGTGKSQTITNIIAAAML-HGKKILFCA---QKMAAIEVVRHRLEKAGLGEF 475 (1775) Q Consensus 419 g~~~-vi~GPPGTGKSQTIaNlIa~~la-~Gk~VLfVa---eK~aALdVV~~RL~~~gLg~~ 475 (1775) +.++ .+.|.-|+|||.||+-+-..+-- .||+||.|| ...||.|-++.--..+|+.-| T Consensus 99 ~p~VIm~vGLqGsGKTTT~aKLA~~lk~k~~k~vllvaaDt~RpaA~eQL~~la~~~~v~~~ 160 (453) T PRK10867 99 PPAVVLMAGLQGAGKTTSVGKLGKFLREKHKKKVLVVSADVYRPAAIKQLETLAEQVGVDFF 160 (453) T ss_pred CCEEEEEECCCCCCHHHHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHHHHCCCCEE T ss_conf 99699997468885185899999999973898379855887705899999999985198043 No 440 >cd03293 ABC_NrtD_SsuB_transporters NrtD and SsuB are the ATP-binding subunits of the bacterial ABC-type nitrate and sulfonate transport systems, respectively. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=94.36 E-value=0.028 Score=38.67 Aligned_cols=40 Identities=33% Similarity=0.483 Sum_probs=31.9 Q ss_pred HHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHC-CCEEEEEE Q ss_conf 985148955994287877038999999999967-98289960 Q gi|254780229|r 414 IDVINGKNLIIEGPPGTGKSQTITNIIAAAMLH-GKKILFCA 454 (1775) Q Consensus 414 ~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~-Gk~VLfVa 454 (1775) ..+..|.-++|.||-|+||| |+.++|+-.+-. .-+|.|-- T Consensus 25 l~i~~Ge~~~iiGpsGsGKS-TLl~~i~Gl~~p~~G~I~~~G 65 (220) T cd03293 25 LSVEEGEFVALVGPSGCGKS-TLLRIIAGLERPTSGEVLVDG 65 (220) T ss_pred EEECCCCEEEEECCCCCHHH-HHHHHHHCCCCCCCCEEEECC T ss_conf 89879989999999995799-999999759998873899999 No 441 >PRK09473 oppD oligopeptide transporter ATP-binding component; Provisional Probab=94.35 E-value=0.13 Score=31.61 Aligned_cols=41 Identities=22% Similarity=0.326 Sum_probs=33.1 Q ss_pred HHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHCC----CEEEEEEC Q ss_conf 9851489559942878770389999999999679----82899601 Q gi|254780229|r 414 IDVINGKNLIIEGPPGTGKSQTITNIIAAAMLHG----KKILFCAQ 455 (1775) Q Consensus 414 ~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~G----k~VLfVae 455 (1775) ..+..|.-+.|.||-|+||| |++++|.-.+..+ .+|.|--+ T Consensus 37 f~i~~GEilgivGeSGsGKS-Tl~~~i~gll~~~~~~sG~I~~~G~ 81 (330) T PRK09473 37 FSLRAGETLGIVGESGSGKS-QTAFALMGLLAANGRIGGSATFNGR 81 (330) T ss_pred EEECCCCEEEEECCCCCHHH-HHHHHHHCCCCCCCCEEEEEEECCE T ss_conf 68889989999868987799-9999997688888833589999999 No 442 >cd03217 ABC_FeS_Assembly ABC-type transport system involved in Fe-S cluster assembly, ATPase component. Biosynthesis of iron-sulfur clusters (Fe-S) depends on multiprotein systems. The SUF system of E. coli and Erwinia chrysanthemi is important for Fe-S biogenesis under stressful conditions. The SUF system is made of six proteins: SufC is an atypical cytoplasmic ABC-ATPase, which forms a complex with SufB and SufD; SufA plays the role of a scaffold protein for assembly of iron-sulfur clusters and delivery to target proteins; SufS is a cysteine desulfurase which mobilizes the sulfur atom from cysteine and provides it to the cluster; SufE has no associated function yet. Probab=94.34 E-value=0.024 Score=39.31 Aligned_cols=47 Identities=26% Similarity=0.492 Sum_probs=34.6 Q ss_pred HHHHCCCCEEEECCCCCCCHHHHHHHHHHHH---HCCCEEEEEECCHHHHH Q ss_conf 9851489559942878770389999999999---67982899601058999 Q gi|254780229|r 414 IDVINGKNLIIEGPPGTGKSQTITNIIAAAM---LHGKKILFCAQKMAAIE 461 (1775) Q Consensus 414 ~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~l---a~Gk~VLfVaeK~aALd 461 (1775) ..+..|.-+.|-||.|+||| |+.|+|+-.. ....+|+|--+....+. T Consensus 21 l~v~~Gei~~iiGpnGaGKS-TLl~~i~G~~~~~~~~G~I~~~g~~i~~~~ 70 (200) T cd03217 21 LTIKKGEVHALMGPNGSGKS-TLAKTIMGHPKYEVTEGEILFKGEDITDLP 70 (200) T ss_pred EEECCCCEEEEECCCCCCHH-HHHHHHCCCCCCCCCCCEEEECCEECCCCC T ss_conf 68879989999968999999-999997077777852007999999988699 No 443 >cd03251 ABCC_MsbA MsbA is an essential ABC transporter, closely related to eukaryotic MDR proteins. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=94.34 E-value=0.03 Score=38.31 Aligned_cols=44 Identities=30% Similarity=0.511 Sum_probs=31.1 Q ss_pred HHHCCCCEEEECCCCCCCHHHHHHHHHHHHHCC-CEEEEEECCHHH Q ss_conf 851489559942878770389999999999679-828996010589 Q gi|254780229|r 415 DVINGKNLIIEGPPGTGKSQTITNIIAAAMLHG-KKILFCAQKMAA 459 (1775) Q Consensus 415 ~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~G-k~VLfVaeK~aA 459 (1775) .+..|+-++|.||.|+||| |+.++|.-.+..= ..|++--.-... T Consensus 24 ~i~~Ge~i~IvG~sGsGKS-TLl~ll~gl~~p~~G~I~i~g~~i~~ 68 (234) T cd03251 24 DIPAGETVALVGPSGSGKS-TLVNLIPRFYDVDSGRILIDGHDVRD 68 (234) T ss_pred EECCCCEEEEECCCCCHHH-HHHHHHHCCCCCCCCEEEECCEECCC T ss_conf 9879999999989998299-99999966766788689999999660 No 444 >cd03261 ABC_Org_Solvent_Resistant ABC (ATP-binding cassette) transport system involved in resistant to organic solvents; ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=94.34 E-value=0.026 Score=39.07 Aligned_cols=39 Identities=36% Similarity=0.585 Sum_probs=31.0 Q ss_pred HHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHC-CCEEEEE Q ss_conf 985148955994287877038999999999967-9828996 Q gi|254780229|r 414 IDVINGKNLIIEGPPGTGKSQTITNIIAAAMLH-GKKILFC 453 (1775) Q Consensus 414 ~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~-Gk~VLfV 453 (1775) ..+..|.-++|.||-|+||| |+.++|+-.+-. ..+|+|- T Consensus 21 l~i~~Ge~~~iiG~SGsGKS-Tll~~i~gL~~p~~G~I~~~ 60 (235) T cd03261 21 LDVRRGEILAIIGPSGSGKS-TLLRLIVGLLRPDSGEVLID 60 (235) T ss_pred EEECCCCEEEEECCCCCHHH-HHHHHHHCCCCCCCCEEEEC T ss_conf 48879989999999997299-99999975999898589999 No 445 >cd03247 ABCC_cytochrome_bd The CYD subfamily implicated in cytochrome bd biogenesis. The CydC and CydD proteins are important for the formation of cytochrome bd terminal oxidase of E. coli and it has been proposed that they were necessary for biosynthesis of the cytochrome bd quinol oxidase and for periplasmic c-type cytochromes. CydCD were proposed to determine a heterooligomeric complex important for heme export into the periplasm or to be involved in the maintenance of the proper redox state of the periplasmic space. In Bacillus subtilius, the absence of CydCD does not affect the presence of halo-cytochrome c in the membrane and this observation suggests that CydCD proteins are not involved in the export of heme in this organism. Probab=94.33 E-value=0.08 Score=33.97 Aligned_cols=46 Identities=20% Similarity=0.441 Sum_probs=35.1 Q ss_pred HHHCCCCEEEECCCCCCCHHHHHHHHHHHHH-CCCEEEEEECCHHHHH Q ss_conf 8514895599428787703899999999996-7982899601058999 Q gi|254780229|r 415 DVINGKNLIIEGPPGTGKSQTITNIIAAAML-HGKKILFCAQKMAAIE 461 (1775) Q Consensus 415 ~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la-~Gk~VLfVaeK~aALd 461 (1775) .+..|.-+.|.||.|+||| |+.++|+..+. ..-+|++--.....+. T Consensus 24 ~i~~Ge~~aivG~sGsGKS-TLl~~l~G~~~p~~G~i~i~g~~i~~~~ 70 (178) T cd03247 24 ELKQGEKIALLGRSGSGKS-TLLQLLTGDLKPQQGEITLDGVPVSDLE 70 (178) T ss_pred EECCCCEEEEECCCCCHHH-HHHHHHHHCCCCCCCEEEECCEEHHHHH T ss_conf 9869999999999987599-9999998617667886999999889978 No 446 >PRK11701 phnK phosphonates transport ATP-binding protein; Provisional Probab=94.32 E-value=0.028 Score=38.68 Aligned_cols=39 Identities=28% Similarity=0.500 Sum_probs=32.4 Q ss_pred HHHHCCCCEEEECCCCCCCHHHHHHHHHHHHH-CCCEEEEE Q ss_conf 98514895599428787703899999999996-79828996 Q gi|254780229|r 414 IDVINGKNLIIEGPPGTGKSQTITNIIAAAML-HGKKILFC 453 (1775) Q Consensus 414 ~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la-~Gk~VLfV 453 (1775) ..+..|.-+.|.||.|+||| |+.++|+-.+. ...+|.|- T Consensus 27 ~~v~~GEi~~iiG~nGaGKS-TLl~~i~G~~~p~~G~I~~~ 66 (258) T PRK11701 27 FDLYPGEVLGIVGESGSGKT-TLLNALSARLAPDAGEVHYR 66 (258) T ss_pred EEECCCCEEEEECCCCCCHH-HHHHHHHCCCCCCCCEEEEC T ss_conf 78879979999888998899-99999856788888739974 No 447 >PRK10247 putative ABC transporter ATP-binding protein YbbL; Provisional Probab=94.31 E-value=0.033 Score=37.99 Aligned_cols=42 Identities=31% Similarity=0.560 Sum_probs=33.2 Q ss_pred HHHCCCCEEEECCCCCCCHHHHHHHHHHHHHC-CCEEEEEECCH Q ss_conf 85148955994287877038999999999967-98289960105 Q gi|254780229|r 415 DVINGKNLIIEGPPGTGKSQTITNIIAAAMLH-GKKILFCAQKM 457 (1775) Q Consensus 415 ~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~-Gk~VLfVaeK~ 457 (1775) .+..|.-++|.||-|+||| |+.++|+-.+.. ..+|+|--+.. T Consensus 29 ~i~~Ge~~~i~G~sGsGKS-TLlk~i~gl~~p~~G~I~~~g~~i 71 (225) T PRK10247 29 SLRAGEFKLITGPSGCGKS-TLLKIVASLISPTSGTLLFEGEDI 71 (225) T ss_pred EECCCCEEEEECCCCCCHH-HHHHHHHCCCCCCCCEEEECCEEC T ss_conf 9859969999999999999-999999646688876599999997 No 448 >COG0396 sufC Cysteine desulfurase activator ATPase [Posttranslational modification, protein turnover, chaperones] Probab=94.30 E-value=0.031 Score=38.24 Aligned_cols=47 Identities=30% Similarity=0.446 Sum_probs=33.4 Q ss_pred HHHCCCCEEEECCCCCCCHHHHHHHHHHHH---HCCCEEEEEECCHHHHHH Q ss_conf 851489559942878770389999999999---679828996010589999 Q gi|254780229|r 415 DVINGKNLIIEGPPGTGKSQTITNIIAAAM---LHGKKILFCAQKMAAIEV 462 (1775) Q Consensus 415 ~~~~g~~~vi~GPPGTGKSQTIaNlIa~~l---a~Gk~VLfVaeK~aALdV 462 (1775) .+..|.-.+|-||.|+||| |++++|+-.- --.-+|+|--+--..|.+ T Consensus 26 ~v~~GEvhaiMGPNGsGKS-TLa~~i~G~p~Y~Vt~G~I~~~GedI~~l~~ 75 (251) T COG0396 26 TVKEGEVHAIMGPNGSGKS-TLAYTIMGHPKYEVTEGEILFDGEDILELSP 75 (251) T ss_pred EECCCCEEEEECCCCCCHH-HHHHHHHCCCCCEEECCEEEECCCCCCCCCH T ss_conf 6759828999889987889-9999972899746755569987854255998 No 449 >pfam10443 RNA12 RNA12 protein. This family includes RNA12 from S. cerevisiae. That protein contains an RRM domain. This region is C-terminal to that and includes a P-loop motif suggesting this region binds to NTP. The RNA12 proteins is involved in pre-rRNA maturation. Probab=94.30 E-value=0.051 Score=35.94 Aligned_cols=34 Identities=21% Similarity=0.536 Sum_probs=27.3 Q ss_pred CCCE-EEECCCCCCCHHHHHHHH-HHHHHCCCEEEEEECC Q ss_conf 8955-994287877038999999-9999679828996010 Q gi|254780229|r 419 GKNL-IIEGPPGTGKSQTITNII-AAAMLHGKKILFCAQK 456 (1775) Q Consensus 419 g~~~-vi~GPPGTGKSQTIaNlI-a~~la~Gk~VLfVaeK 456 (1775) ...| ||+||.|+||+ .+| -+.|...|.||+.=.| T Consensus 16 ~nTFIvV~GPrGSGK~----eLV~d~~L~~r~~vL~IDC~ 51 (428) T pfam10443 16 PNTFIVVQGPRGSGKR----ELVMDRVLKDRKNVLVIDCD 51 (428) T ss_pred CCEEEEEECCCCCCCC----HHHHHHHHCCCCCEEEEECH T ss_conf 9858999799998843----34658876368887999786 No 450 >cd03237 ABC_RNaseL_inhibitor_domain2 The ATPase domain 2 of RNase L inhibitor. The ABC ATPase, RNase L inhibitor (RLI), is a key enzyme in ribosomal biogenesis, formation of translation preinitiation complexes, and assembly of HIV capsids. RLI's are not transport proteins and thus cluster with a group of soluble proteins that lack the transmembrane components commonly found in other members of the family. Structurally, RLI's have an N-terminal Fe-S domain and two nucleotide-binding domains which are arranged to form two composite active sites in their interface cleft. RLI is one of the most conserved enzymes between archaea and eukaryotes with a sequence identity of more than 48%. The high degree of evolutionary conservation suggests that RLI performs a central role in archaeal and eukaryotic physiology. Probab=94.29 E-value=0.032 Score=38.12 Aligned_cols=37 Identities=24% Similarity=0.416 Sum_probs=29.4 Q ss_pred HHHCCCCEEEECCCCCCCHHHHHHHHHHHHH-CCCEEEE Q ss_conf 8514895599428787703899999999996-7982899 Q gi|254780229|r 415 DVINGKNLIIEGPPGTGKSQTITNIIAAAML-HGKKILF 452 (1775) Q Consensus 415 ~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la-~Gk~VLf 452 (1775) .+..|.-++|.||-|+||| |+.++|+-.+- .+..|.+ T Consensus 21 ti~~GEiv~liGpNGaGKS-TLlk~l~Gll~p~~G~I~~ 58 (246) T cd03237 21 SISESEVIGILGPNGIGKT-TFIKMLAGVLKPDEGDIEI 58 (246) T ss_pred CCCCCCEEEEECCCCCHHH-HHHHHHHCCCCCCCCEEEE T ss_conf 4657989999979997699-9999997787888607998 No 451 >pfam07088 GvpD GvpD gas vesicle protein. This family consists of several archaeal GvpD gas vesicle proteins. GvpD is thought to be involved in the regulation of gas vesicle formation. Probab=94.29 E-value=0.011 Score=43.05 Aligned_cols=14 Identities=43% Similarity=0.622 Sum_probs=6.9 Q ss_pred CCCCEEEEEECCCH Q ss_conf 88831799974580 Q gi|254780229|r 1700 NPGHYLMGIECDGA 1713 (1775) Q Consensus 1700 ~pg~~~l~iecDG~ 1713 (1775) +|-.-+++|..|+. T Consensus 459 ~P~T~~l~i~~~~~ 472 (484) T pfam07088 459 NPLTPLLGIGVDDS 472 (484) T ss_pred CCCCCEEEECCCCC T ss_conf 77752378733556 No 452 >COG1123 ATPase components of various ABC-type transport systems, contain duplicated ATPase [General function prediction only] Probab=94.29 E-value=0.15 Score=30.99 Aligned_cols=42 Identities=33% Similarity=0.556 Sum_probs=36.0 Q ss_pred HHHHCCCCEEEECCCCCCCHHHHHHHHHHHHH-CCCEEEEEECC Q ss_conf 98514895599428787703899999999996-79828996010 Q gi|254780229|r 414 IDVINGKNLIIEGPPGTGKSQTITNIIAAAML-HGKKILFCAQK 456 (1775) Q Consensus 414 ~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la-~Gk~VLfVaeK 456 (1775) .+++.|..|-|.|+-|+||| |++-+|+-++- .|.++.|.-.. T Consensus 312 f~l~~GE~lglVGeSGsGKS-Tlar~i~gL~~P~~G~i~~~g~~ 354 (539) T COG1123 312 FDLREGETLGLVGESGSGKS-TLARILAGLLPPSSGSIIFDGQD 354 (539) T ss_pred EEECCCCEEEEECCCCCCHH-HHHHHHHCCCCCCCCEEEEECCC T ss_conf 57638878999889999989-99999948778888449992765 No 453 >PRK10419 nikE nickel transporter ATP-binding protein; Provisional Probab=94.28 E-value=0.21 Score=29.68 Aligned_cols=40 Identities=20% Similarity=0.380 Sum_probs=30.7 Q ss_pred HHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHC-CCEEEEEE Q ss_conf 985148955994287877038999999999967-98289960 Q gi|254780229|r 414 IDVINGKNLIIEGPPGTGKSQTITNIIAAAMLH-GKKILFCA 454 (1775) Q Consensus 414 ~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~-Gk~VLfVa 454 (1775) ..+..|..+.|.||-|+||| |++++|+-.... ..+|.|-- T Consensus 33 ~~i~~GE~l~ivGeSGsGKS-TL~r~i~gl~~p~sG~I~~~g 73 (266) T PRK10419 33 LTLKSGETVALLGRSGCGKS-TLARLLVGLESPSQGNISWRG 73 (266) T ss_pred EEECCCCEEEEECCCCCHHH-HHHHHHHCCCCCCCEEEEECC T ss_conf 58889989999999997799-999999669999962998899 No 454 >PRK08118 topology modulation protein; Reviewed Probab=94.27 E-value=0.042 Score=36.86 Aligned_cols=24 Identities=29% Similarity=0.559 Sum_probs=20.1 Q ss_pred CEEEECCCCCCCHHHHHHHHHHHHH Q ss_conf 5599428787703899999999996 Q gi|254780229|r 421 NLIIEGPPGTGKSQTITNIIAAAML 445 (1775) Q Consensus 421 ~~vi~GPPGTGKSQTIaNlIa~~la 445 (1775) -+.|.|+||+||| |.+.-++..+. T Consensus 3 rI~IiG~~GsGKS-TlAr~L~~~~~ 26 (167) T PRK08118 3 KIILIGSGGSGKS-TLARQLGEKLN 26 (167) T ss_pred EEEEECCCCCCHH-HHHHHHHHHHC T ss_conf 7999889998799-99999999889 No 455 >PRK13539 cytochrome c biogenesis protein CcmA; Provisional Probab=94.26 E-value=0.033 Score=37.95 Aligned_cols=38 Identities=26% Similarity=0.550 Sum_probs=30.6 Q ss_pred HHHCCCCEEEECCCCCCCHHHHHHHHHHHHHCCC-EEEEE Q ss_conf 8514895599428787703899999999996798-28996 Q gi|254780229|r 415 DVINGKNLIIEGPPGTGKSQTITNIIAAAMLHGK-KILFC 453 (1775) Q Consensus 415 ~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~Gk-~VLfV 453 (1775) .+..|.-+.|.||.|+||| |+.++|+-.+..-. .|.+- T Consensus 24 ~i~~Gei~~l~G~NGaGKT-TLlk~i~Gl~~p~~G~I~~~ 62 (206) T PRK13539 24 TLAAGEALVLTGPNGSGKT-TLLRLLAGLLPPAAGTIKLD 62 (206) T ss_pred EECCCCEEEEECCCCCCHH-HHHHHHHCCCCCCCEEEEEC T ss_conf 9869949999899999899-99999958878885189999 No 456 >COG4525 TauB ABC-type taurine transport system, ATPase component [Inorganic ion transport and metabolism] Probab=94.26 E-value=0.028 Score=38.69 Aligned_cols=32 Identities=31% Similarity=0.650 Sum_probs=28.3 Q ss_pred HHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHC Q ss_conf 985148955994287877038999999999967 Q gi|254780229|r 414 IDVINGKNLIIEGPPGTGKSQTITNIIAAAMLH 446 (1775) Q Consensus 414 ~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~ 446 (1775) ...+.|..+|+-||-|.||| ||.|+||-.+.. T Consensus 26 L~ia~ge~vv~lGpSGcGKT-TLLnl~AGf~~P 57 (259) T COG4525 26 LTIASGELVVVLGPSGCGKT-TLLNLIAGFVTP 57 (259) T ss_pred EEECCCCEEEEECCCCCCHH-HHHHHHHCCCCC T ss_conf 02358978999768886578-899998627585 No 457 >cd03229 ABC_Class3 This class is comprised of all BPD (Binding Protein Dependent) systems that are largely represented in archaea and eubacteria and are primarily involved in scavenging solutes from the environment. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=94.26 E-value=0.033 Score=38.00 Aligned_cols=38 Identities=32% Similarity=0.450 Sum_probs=29.8 Q ss_pred HHHCCCCEEEECCCCCCCHHHHHHHHHHHHHC-CCEEEEE Q ss_conf 85148955994287877038999999999967-9828996 Q gi|254780229|r 415 DVINGKNLIIEGPPGTGKSQTITNIIAAAMLH-GKKILFC 453 (1775) Q Consensus 415 ~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~-Gk~VLfV 453 (1775) .+..|.-++|.||-|+||| |+.++|+-.+-. .-+|.|- T Consensus 22 ~i~~Ge~~~ivGpSG~GKS-Tllr~i~Gl~~p~~G~I~~~ 60 (178) T cd03229 22 NIEAGEIVALLGPSGSGKS-TLLRCIAGLEEPDSGSILID 60 (178) T ss_pred EECCCCEEEEECCCCCHHH-HHHHHHHCCCCCCCEEEEEC T ss_conf 9889989999999998399-99999985999996399999 No 458 >cd03262 ABC_HisP_GlnQ_permeases HisP and GlnQ are the ATP-binding components of the bacterial periplasmic histidine and glutamine permeases, repectively. Histidine permease is a multisubunit complex containing the HisQ and HisM integral membrane subunits and two copies of HisP. HisP has properties intermediate between those of integral and peripheral membrane proteins and is accessible from both sides of the membrane, presumably by its interaction with HisQ and HisM. The two HisP subunits form a homodimer within the complex. The domain structure of the amino acid uptake systems is typical for prokaryote extracellular solute binding protein-dependent uptake systems. All of the amino acid uptake systems also have at least one, and in a few cases, two extracellular solute binding proteins located in the periplasm of Gram-negative bacteria, or attached to the cell membrane of Gram-positive bacteria. The best-studied member of the PAAT (polar amino acid transport) family is the HisJQM Probab=94.25 E-value=0.032 Score=38.12 Aligned_cols=51 Identities=27% Similarity=0.423 Sum_probs=34.5 Q ss_pred HHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHC-CCEEEEE----ECCHHHHHHHHH Q ss_conf 985148955994287877038999999999967-9828996----010589999998 Q gi|254780229|r 414 IDVINGKNLIIEGPPGTGKSQTITNIIAAAMLH-GKKILFC----AQKMAAIEVVRH 465 (1775) Q Consensus 414 ~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~-Gk~VLfV----aeK~aALdVV~~ 465 (1775) ..+..|.-++|.||-|+||| |+.++|+-..-. ..+|+|- ..+...+.-.++ T Consensus 21 l~i~~Ge~~~ivGpSGsGKS-TLL~~i~gL~~p~~G~i~i~g~~i~~~~~~~~~~rr 76 (213) T cd03262 21 LTVKKGEVVVIIGPSGSGKS-TLLRCINLLEEPDSGTIIIDGLKLTDDKKNINELRQ 76 (213) T ss_pred EEECCCCEEEEECCCCCHHH-HHHHHHHCCCCCCCEEEEECCEECCCCHHHHHHHHC T ss_conf 59889989999999984499-999999819999864999999999998156999867 No 459 >KOG0741 consensus Probab=94.25 E-value=0.038 Score=37.31 Aligned_cols=27 Identities=37% Similarity=0.671 Sum_probs=22.6 Q ss_pred CCCCEEEECCCCCCCHHHHHHHHHHHHH Q ss_conf 4895599428787703899999999996 Q gi|254780229|r 418 NGKNLIIEGPPGTGKSQTITNIIAAAML 445 (1775) Q Consensus 418 ~g~~~vi~GPPGTGKSQTIaNlIa~~la 445 (1775) .=|..++-|||||||| .||-=|...|- T Consensus 255 HVKGiLLyGPPGTGKT-LiARqIGkMLN 281 (744) T KOG0741 255 HVKGILLYGPPGTGKT-LIARQIGKMLN 281 (744) T ss_pred CEEEEEEECCCCCCHH-HHHHHHHHHHC T ss_conf 1235788779998701-89999878745 No 460 >PRK13632 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=94.24 E-value=0.035 Score=37.67 Aligned_cols=39 Identities=28% Similarity=0.469 Sum_probs=31.5 Q ss_pred HHHCCCCEEEECCCCCCCHHHHHHHHHHHH-HCCCEEEEEE Q ss_conf 851489559942878770389999999999-6798289960 Q gi|254780229|r 415 DVINGKNLIIEGPPGTGKSQTITNIIAAAM-LHGKKILFCA 454 (1775) Q Consensus 415 ~~~~g~~~vi~GPPGTGKSQTIaNlIa~~l-a~Gk~VLfVa 454 (1775) .+..|.-+.|.||-|+||| |+.++|+-.+ ..+.+|++-- T Consensus 32 ~i~~Ge~vaivG~nGsGKS-TLlk~l~Gll~p~~G~I~v~G 71 (273) T PRK13632 32 TINEGEYVAILGHNGSGKS-TISKILTGLLKPQSGEIKIFG 71 (273) T ss_pred EECCCCEEEEECCCCCHHH-HHHHHHHCCCCCCCCEEEECC T ss_conf 9849989999999998699-999999738778887599999 No 461 >cd03244 ABCC_MRP_domain2 Domain 2 of the ABC subfamily C. This family is also known as MRP (mulrtidrug resisitance-associated protein). Some of the MRP members have five additional transmembrane segments in their N-terminus, but the function of these additional membrane-spanning domains is not clear. The MRP was found in the multidrug-resistance lung cancer cell in which p-glycoprotein was not overexpressed. MRP exports glutathione by drug stimulation, as well as, certain substrates in conjugated forms with anions, such as glutathione, glucuronate, and sulfate. Probab=94.24 E-value=0.035 Score=37.72 Aligned_cols=38 Identities=24% Similarity=0.304 Sum_probs=30.5 Q ss_pred HHHCCCCEEEECCCCCCCHHHHHHHHHHHHHC-CCEEEEE Q ss_conf 85148955994287877038999999999967-9828996 Q gi|254780229|r 415 DVINGKNLIIEGPPGTGKSQTITNIIAAAMLH-GKKILFC 453 (1775) Q Consensus 415 ~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~-Gk~VLfV 453 (1775) .+..|.-++|.||.|+||| |+.++|+-.+.. .-+|+|- T Consensus 26 ~i~~Ge~v~ivG~sGsGKS-TLl~ll~gl~~p~~G~I~i~ 64 (221) T cd03244 26 SIKPGEKVGIVGRTGSGKS-SLLLALFRLVELSSGSILID 64 (221) T ss_pred EECCCCEEEEECCCCCCHH-HHHHHHHCCCCCCCCEEEEC T ss_conf 9869989999999999899-99999967971898489999 No 462 >PRK07165 F0F1 ATP synthase subunit alpha; Validated Probab=94.24 E-value=0.37 Score=26.97 Aligned_cols=72 Identities=21% Similarity=0.243 Sum_probs=55.9 Q ss_pred HHHHHHHHCCCCEEEECCCCCCCHHHH-HHHHHHHHHCCCEEEEEE--CCHHHHHHHHHHHHHCCCCCEEEECCCC Q ss_conf 999998514895599428787703899-999999996798289960--1058999999864435844136540266 Q gi|254780229|r 410 HSAVIDVINGKNLIIEGPPGTGKSQTI-TNIIAAAMLHGKKILFCA--QKMAAIEVVRHRLEKAGLGEFCLELHSH 482 (1775) Q Consensus 410 ~~av~~~~~g~~~vi~GPPGTGKSQTI-aNlIa~~la~Gk~VLfVa--eK~aALdVV~~RL~~~gLg~~cleLHs~ 482 (1775) .-+..-+.+|+--.|.|.-||||| || .++|-+--..+-.+-+|| +|.....-+.++|.+.|.=+++.-+.+. T Consensus 134 ID~miPIGRGQRELIiGDrqTGKT-aIAldtIiNQk~~~V~cIYVAIGQK~s~Va~iv~~L~~~gAm~yTiVV~A~ 208 (507) T PRK07165 134 IDLLIPIGKGQRELIIGDRQTGKT-HIALNTIINQKNRNVKCIYVAIGQKRENLSRIYETLKEHDALKNTIIIDAP 208 (507) T ss_pred HHCCCCCCCCCEEEEECCCCCCHH-HHHHHHHHHCCCCCCEEEEEEECCCHHHHHHHHHHHHHCCCHHHEEEEECC T ss_conf 761355567616777357887788-999999984546895899998310177999999999877975652576315 No 463 >pfam03215 Rad17 Rad17 cell cycle checkpoint protein. Probab=94.23 E-value=0.042 Score=36.90 Aligned_cols=25 Identities=32% Similarity=0.608 Sum_probs=17.8 Q ss_pred CCEEEECCCCCCCHHHHHHHHHHHHH Q ss_conf 95599428787703899999999996 Q gi|254780229|r 420 KNLIIEGPPGTGKSQTITNIIAAAML 445 (1775) Q Consensus 420 ~~~vi~GPPGTGKSQTIaNlIa~~la 445 (1775) +=+|+.||||.|||.| +.++|.-|. T Consensus 46 ~iLlLtGPaG~GKTTT-I~lLAkeLG 70 (490) T pfam03215 46 LILLLTGPSGCGKSTT-VKVLSKELG 70 (490) T ss_pred EEEEEECCCCCCHHHH-HHHHHHHHC T ss_conf 1899879899889999-999999759 No 464 >PRK11248 tauB taurine transporter ATP-binding subunit; Provisional Probab=94.22 E-value=0.032 Score=38.15 Aligned_cols=39 Identities=31% Similarity=0.509 Sum_probs=31.8 Q ss_pred HHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHC-CCEEEEE Q ss_conf 985148955994287877038999999999967-9828996 Q gi|254780229|r 414 IDVINGKNLIIEGPPGTGKSQTITNIIAAAMLH-GKKILFC 453 (1775) Q Consensus 414 ~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~-Gk~VLfV 453 (1775) ..+..|.-+.|-||.|+||| |+.|+|+-.+-. ..+|.|- T Consensus 22 l~i~~Ge~~~lvGpnGaGKS-TLl~~i~Gl~~p~~G~I~~~ 61 (255) T PRK11248 22 LTLESGELLVVLGPSGCGKT-TLLNLIAGFVPPQHGSITLD 61 (255) T ss_pred EEECCCCEEEEECCCCCHHH-HHHHHHHCCCCCCCCEEEEC T ss_conf 79869989999999984699-99999975998899718579 No 465 >PRK11022 dppD dipeptide transporter ATP-binding subunit; Provisional Probab=94.22 E-value=0.14 Score=31.44 Aligned_cols=41 Identities=32% Similarity=0.388 Sum_probs=31.2 Q ss_pred HHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHC-C----CEEEEEEC Q ss_conf 985148955994287877038999999999967-9----82899601 Q gi|254780229|r 414 IDVINGKNLIIEGPPGTGKSQTITNIIAAAMLH-G----KKILFCAQ 455 (1775) Q Consensus 414 ~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~-G----k~VLfVae 455 (1775) ..+..|.-+.|.||-|+||| |++++|.-++-. | .+|.|--+ T Consensus 28 f~i~~GEilgivGeSGsGKS-Tl~~~ilgll~~~~~~~~g~i~~~g~ 73 (327) T PRK11022 28 YSVKQGEVVGIVGESGSGKS-VSSLAIMGLIDYPGRVMAEKLEFNGQ 73 (327) T ss_pred EEECCCCEEEEECCCCCHHH-HHHHHHHCCCCCCCCEEEEEEEECCE T ss_conf 79889999999999987899-99999974889899765427999999 No 466 >TIGR03263 guanyl_kin guanylate kinase. Members of this family are the enzyme guanylate kinase, also called GMP kinase. This enzyme transfers a phosphate from ATP to GMP, yielding ADP and GDP. Probab=94.22 E-value=0.038 Score=37.37 Aligned_cols=24 Identities=38% Similarity=0.718 Sum_probs=19.9 Q ss_pred CCCEEEECCCCCCCHHHHHHHHHHH Q ss_conf 8955994287877038999999999 Q gi|254780229|r 419 GKNLIIEGPPGTGKSQTITNIIAAA 443 (1775) Q Consensus 419 g~~~vi~GPPGTGKSQTIaNlIa~~ 443 (1775) |+=+||.||+|+||+ ||.|.++.- T Consensus 1 G~livl~GpsG~GK~-tl~~~l~~~ 24 (180) T TIGR03263 1 GLLIVISGPSGVGKS-TLVKALLEE 24 (180) T ss_pred CEEEEEECCCCCCHH-HHHHHHHHH T ss_conf 939999899988999-999999976 No 467 >cd03233 ABC_PDR_domain1 The pleiotropic drug resistance (PDR) family of ATP-binding cassette (ABC) transporters. PDR is a well-described phenomenon occurring in fungi and shares several similarities with processes in bacteria and higher eukaryotes. This PDR subfamily represents domain I of its (ABC-IM)2 organization. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds including sugars, ions, peptides, and more complex organic molecules. The nucleotide-binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=94.21 E-value=0.028 Score=38.67 Aligned_cols=39 Identities=26% Similarity=0.372 Sum_probs=30.8 Q ss_pred HHHCCCCEEEECCCCCCCHHHHHHHHHHHHHC----CCEEEEEE Q ss_conf 85148955994287877038999999999967----98289960 Q gi|254780229|r 415 DVINGKNLIIEGPPGTGKSQTITNIIAAAMLH----GKKILFCA 454 (1775) Q Consensus 415 ~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~----Gk~VLfVa 454 (1775) .+..|.-++|-||.|+||| |+.|+|+-.+.. ...|+|-- T Consensus 29 ~i~~Gei~~llG~nGsGKS-TLl~~l~G~~~~~~~~~G~i~~~g 71 (202) T cd03233 29 VVKPGEMVLVLGRPGSGCS-TLLKALANRTEGNVSVEGDIHYNG 71 (202) T ss_pred EECCCEEEEEECCCCCCHH-HHHHHHHCCCCCCCCEEEEEEECC T ss_conf 9809849999989999889-999998378789875137999999 No 468 >cd03290 ABCC_SUR1_N The SUR domain 1. The sulfonylurea receptor SUR is an ATP transporter of the ABCC/MRP family with tandem ATPase binding domains. Unlike other ABC proteins, it has no intrinsic transport function, neither active nor passive, but associates with the potassium channel proteins Kir6.1 or Kir6.2 to form the ATP-sensitive potassium (K(ATP)) channel. Within the channel complex, SUR serves as a regulatory subunit that fine-tunes the gating of Kir6.x in response to alterations in cellular metabolism. It constitutes a major pharmaceutical target as it binds numerous drugs, K(ATP) channel openers and blockers, capable of up- or down-regulating channel activity. Probab=94.21 E-value=0.034 Score=37.84 Aligned_cols=38 Identities=26% Similarity=0.378 Sum_probs=30.3 Q ss_pred HHHCCCCEEEECCCCCCCHHHHHHHHHHHHHC-CCEEEEE Q ss_conf 85148955994287877038999999999967-9828996 Q gi|254780229|r 415 DVINGKNLIIEGPPGTGKSQTITNIIAAAMLH-GKKILFC 453 (1775) Q Consensus 415 ~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~-Gk~VLfV 453 (1775) .+..|+-++|.||.|+||| |+.++|+-.+.. ..+|+|- T Consensus 23 ~i~~Ge~~~IvG~sGsGKS-TLl~~l~g~~~~~~G~I~~~ 61 (218) T cd03290 23 RIPTGQLTMIVGQVGCGKS-SLLLAILGEMQTLEGKVHWS 61 (218) T ss_pred EECCCCEEEEECCCCCCHH-HHHHHHHCCCCCCCCEEEEC T ss_conf 9869999999999998099-99999855565677649989 No 469 >PRK13640 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=94.21 E-value=0.037 Score=37.50 Aligned_cols=31 Identities=32% Similarity=0.511 Sum_probs=27.7 Q ss_pred HHHCCCCEEEECCCCCCCHHHHHHHHHHHHHC Q ss_conf 85148955994287877038999999999967 Q gi|254780229|r 415 DVINGKNLIIEGPPGTGKSQTITNIIAAAMLH 446 (1775) Q Consensus 415 ~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~ 446 (1775) .+..|.-+.|.||.|+||| |++.+|+-.+.. T Consensus 30 ~I~~Ge~vaiiG~nGsGKS-TL~~~l~Gll~P 60 (283) T PRK13640 30 SIPRGSWTALIGHNGSGKS-TISKLINGLLLP 60 (283) T ss_pred EECCCCEEEEECCCCCHHH-HHHHHHHCCCCC T ss_conf 9989999999999998799-999999640378 No 470 >cd03232 ABC_PDR_domain2 The pleiotropic drug resistance-like (PDR) family of ATP-binding cassette (ABC) transporters. PDR is a well-described phenomenon occurring in fungi and shares several similarities with processes in bacteria and higher eukaryotes. This PDR subfamily represents domain I of its (ABC-IM)2 organization. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds including sugars, ions, peptides, and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=94.21 E-value=0.031 Score=38.20 Aligned_cols=27 Identities=30% Similarity=0.495 Sum_probs=24.3 Q ss_pred HHHCCCCEEEECCCCCCCHHHHHHHHHH Q ss_conf 8514895599428787703899999999 Q gi|254780229|r 415 DVINGKNLIIEGPPGTGKSQTITNIIAA 442 (1775) Q Consensus 415 ~~~~g~~~vi~GPPGTGKSQTIaNlIa~ 442 (1775) .+..|.-+.|-||.|+||| |+.++|+- T Consensus 29 ~i~~Ge~~~llGpnGaGKS-TLl~~l~g 55 (192) T cd03232 29 YVKPGTLTALMGESGAGKT-TLLDVLAG 55 (192) T ss_pred EEECCEEEEEECCCCCCHH-HHHHHHHC T ss_conf 9928839999999999889-99999837 No 471 >COG3840 ThiQ ABC-type thiamine transport system, ATPase component [Coenzyme metabolism] Probab=94.20 E-value=0.036 Score=37.54 Aligned_cols=43 Identities=35% Similarity=0.467 Sum_probs=32.9 Q ss_pred HHHCCCCEEEECCCCCCCHHHHHHHHHHHHHCCC-EEEEEECCHH Q ss_conf 8514895599428787703899999999996798-2899601058 Q gi|254780229|r 415 DVINGKNLIIEGPPGTGKSQTITNIIAAAMLHGK-KILFCAQKMA 458 (1775) Q Consensus 415 ~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~Gk-~VLfVaeK~a 458 (1775) .+-.|.-+.|-||-|+||| |..|+||-....+. .|++--+-+. T Consensus 21 ~v~~ge~vAi~GpSGaGKS-TLLnLIAGF~~P~~G~i~i~g~d~t 64 (231) T COG3840 21 TVPAGEIVAILGPSGAGKS-TLLNLIAGFETPASGEILINGVDHT 64 (231) T ss_pred EECCCCEEEEECCCCCCHH-HHHHHHHHCCCCCCCEEEECCEECC T ss_conf 6067857999778886578-8999987424778745898572147 No 472 >cd03231 ABC_CcmA_heme_exporter CcmA, the ATP-binding component of the bacterial CcmAB transporter. The CCM family is involved in bacterial cytochrome c biogenesis. Cytochrome c maturation in E. coli requires the ccm operon, which encodes eight membrane proteins (CcmABCDEFGH). CcmE is a periplasmic heme chaperone that binds heme covalently and transfers it onto apocytochrome c in the presence of CcmF, CcmG, and CcmH. The CcmAB proteins represent an ABC transporter and the CcmCD proteins participate in heme transfer to CcmE. Probab=94.18 E-value=0.036 Score=37.59 Aligned_cols=37 Identities=30% Similarity=0.588 Sum_probs=31.0 Q ss_pred HHHCCCCEEEECCCCCCCHHHHHHHHHHHHHC-CCEEEE Q ss_conf 85148955994287877038999999999967-982899 Q gi|254780229|r 415 DVINGKNLIIEGPPGTGKSQTITNIIAAAMLH-GKKILF 452 (1775) Q Consensus 415 ~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~-Gk~VLf 452 (1775) .+..|.-++|.||-|+||| |..++|+-.+-. .-+|+| T Consensus 22 ~i~~Ge~~~l~G~NGsGKS-TLlk~i~Gl~~p~sG~i~~ 59 (201) T cd03231 22 TLAAGEALQVTGPNGSGKT-TLLRILAGLSPPLAGRVLL 59 (201) T ss_pred EECCCCEEEEECCCCCCHH-HHHHHHHCCCCCCCEEEEE T ss_conf 8879959999999999999-9999996677888529999 No 473 >PRK13548 hmuV hemin importer ATP-binding subunit; Provisional Probab=94.17 E-value=0.04 Score=37.04 Aligned_cols=39 Identities=23% Similarity=0.429 Sum_probs=30.9 Q ss_pred HHHCCCCEEEECCCCCCCHHHHHHHHHHHHH-CCCEEEEEE Q ss_conf 8514895599428787703899999999996-798289960 Q gi|254780229|r 415 DVINGKNLIIEGPPGTGKSQTITNIIAAAML-HGKKILFCA 454 (1775) Q Consensus 415 ~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la-~Gk~VLfVa 454 (1775) .+..|.-+.|-||.|+||| |..++|+-.+. ..-+|.|-- T Consensus 24 ~i~~Gei~~liGpNGaGKS-TLlk~i~Gl~~p~~G~I~~~g 63 (257) T PRK13548 24 TLRPGEVVAILGPNGAGKS-TLLRALSGELPPSSGEVRLFG 63 (257) T ss_pred EECCCCEEEEECCCCCCHH-HHHHHHHCCCCCCCCEEEEEC T ss_conf 9869989999999998799-999998567577875699936 No 474 >pfam02534 TraG TraG/TraD family. These proteins contain a P-loop and walker-B site for nucleotide binding. TraG is essential for DNA transfer in bacterial conjugation. These proteins are thought to mediate interactions between the DNA-processing (Dtr) and the mating pair formation (Mpf) systems. Probab=94.17 E-value=0.19 Score=30.04 Aligned_cols=62 Identities=23% Similarity=0.289 Sum_probs=46.7 Q ss_pred CCEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEECCHHHHHHHHHHHHHCCCCCEEEECCCCC Q ss_conf 9559942878770389999999999679828996010589999998644358441365402661 Q gi|254780229|r 420 KNLIIEGPPGTGKSQTITNIIAAAMLHGKKILFCAQKMAAIEVVRHRLEKAGLGEFCLELHSHK 483 (1775) Q Consensus 420 ~~~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfVaeK~aALdVV~~RL~~~gLg~~cleLHs~k 483 (1775) ..+.|.||||||||.++ +|-.+|..+-++.++-=|-=-...-..-+...|=.-+|+...+-+ T Consensus 45 ~H~lviaptgsGKt~~~--ViPnLl~~~~S~VV~DpKGEl~~~Ta~~r~~~G~~V~v~np~~~~ 106 (468) T pfam02534 45 SHVLVFAGTGSGKGVGV--VIPNLLVWKGSIVVLDPKGELWELTSGIREKQGRKVFVLEPFNPK 106 (468) T ss_pred CEEEEECCCCCCCCCCC--HHHHHHHCCCCEEEEECCHHHHHHHHHHHHHCCCEEEEEECCCCC T ss_conf 67999848999723424--298885189978999484889999999999779969999797999 No 475 >pfam00158 Sigma54_activat Sigma-54 interaction domain. Probab=94.16 E-value=0.08 Score=33.94 Aligned_cols=31 Identities=23% Similarity=0.269 Sum_probs=24.6 Q ss_pred HHHHCCCCEEEECCCCCCCHHHHHHHHHHHHH Q ss_conf 98514895599428787703899999999996 Q gi|254780229|r 414 IDVINGKNLIIEGPPGTGKSQTITNIIAAAML 445 (1775) Q Consensus 414 ~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la 445 (1775) ..+....+++|+|+|||||+ ++|..|-..=. T Consensus 17 ~~a~~~~pVLI~GE~GtGK~-~lAr~IH~~S~ 47 (168) T pfam00158 17 RVAPTDATVLITGESGTGKE-LFARAIHQLSP 47 (168) T ss_pred HHHCCCCCEEEECCCCCCHH-HHHHHHHHHCC T ss_conf 99588998899899988889-99999998524 No 476 >PRK10851 sulfate/thiosulfate transporter subunit; Provisional Probab=94.16 E-value=0.027 Score=38.82 Aligned_cols=37 Identities=35% Similarity=0.604 Sum_probs=20.6 Q ss_pred HHHCCCCEEEECCCCCCCHHHHHHHHHHHHHC-CCEEEE Q ss_conf 85148955994287877038999999999967-982899 Q gi|254780229|r 415 DVINGKNLIIEGPPGTGKSQTITNIIAAAMLH-GKKILF 452 (1775) Q Consensus 415 ~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~-Gk~VLf 452 (1775) .+..|.-++|-||-|+||| |+.++|+-+... .-+|+| T Consensus 24 ~i~~GE~~~llGpSGsGKS-TLlr~iaGL~~p~sG~I~~ 61 (352) T PRK10851 24 DIPSGQMVALLGPSGSGKT-TLLRIIAGLEHQTSGHIRF 61 (352) T ss_pred EECCCCEEEEECCCCCHHH-HHHHHHHCCCCCCCEEEEE T ss_conf 9999989999999984699-9999997699999569999 No 477 >PRK13538 cytochrome c biogenesis protein CcmA; Provisional Probab=94.16 E-value=0.037 Score=37.48 Aligned_cols=38 Identities=29% Similarity=0.536 Sum_probs=30.6 Q ss_pred HHHCCCCEEEECCCCCCCHHHHHHHHHHHHH-CCCEEEEE Q ss_conf 8514895599428787703899999999996-79828996 Q gi|254780229|r 415 DVINGKNLIIEGPPGTGKSQTITNIIAAAML-HGKKILFC 453 (1775) Q Consensus 415 ~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la-~Gk~VLfV 453 (1775) .+..|.-+.|.||-|+||| |+.++|+-.+- ...+|.|- T Consensus 23 ~i~~Ge~~~l~G~NGaGKS-TLl~~l~Gl~~p~~G~i~~~ 61 (204) T PRK13538 23 TLNAGELVQIEGPNGAGKT-SLLRILAGLARPDAGEVLWQ 61 (204) T ss_pred EECCCCEEEEECCCCCCHH-HHHHHHHCCCCCCCCEEEEC T ss_conf 9879989999999998599-99999976888887379999 No 478 >PRK13543 cytochrome c biogenesis protein CcmA; Provisional Probab=94.14 E-value=0.037 Score=37.40 Aligned_cols=38 Identities=26% Similarity=0.476 Sum_probs=31.3 Q ss_pred HHHCCCCEEEECCCCCCCHHHHHHHHHHHHHC-CCEEEEE Q ss_conf 85148955994287877038999999999967-9828996 Q gi|254780229|r 415 DVINGKNLIIEGPPGTGKSQTITNIIAAAMLH-GKKILFC 453 (1775) Q Consensus 415 ~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~-Gk~VLfV 453 (1775) .+..|.-++|.||.|+||| |..++|+-.+.. .-+|+|- T Consensus 33 ~v~~Ge~~~l~GpNGaGKT-TLlr~l~Gl~~p~~G~I~~~ 71 (214) T PRK13543 33 HVDAGEALLVQGDNGAGKT-TLLRVLAGLLHVESGQIQID 71 (214) T ss_pred EECCCCEEEEECCCCCCHH-HHHHHHHCCCCCCCEEEEEC T ss_conf 9818989999999998799-99999976977884199999 No 479 >cd03250 ABCC_MRP_domain1 Domain 1 of the ABC subfamily C. This family is also known as MRP (mulrtidrug resisitance-associated protein). Some of the MRP members have five additional transmembrane segments in their N-terminas, but the function of these additional membrane-spanning domains is not clear. The MRP was found in the multidrug-resisting lung cancer cell in which p-glycoprotein was not overexpressed. MRP exports glutathione by drug stimulation, as well as, certain substrates in conjugated forms with anions, such as glutathione, glucuronate, and sulfate. Probab=94.14 E-value=0.034 Score=37.88 Aligned_cols=42 Identities=29% Similarity=0.523 Sum_probs=32.3 Q ss_pred HHHCCCCEEEECCCCCCCHHHHHHHHHHHHH--CC-----CEEEEEECCH Q ss_conf 8514895599428787703899999999996--79-----8289960105 Q gi|254780229|r 415 DVINGKNLIIEGPPGTGKSQTITNIIAAAML--HG-----KKILFCAQKM 457 (1775) Q Consensus 415 ~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la--~G-----k~VLfVaeK~ 457 (1775) .+..|.-+.|.||.|+||| |+.++|+-.+- .| ++|-||.+.. T Consensus 27 ~i~~Ge~~~IvG~sGsGKS-TLl~~i~G~~~~~~G~I~~~g~i~~v~Q~~ 75 (204) T cd03250 27 EVPKGELVAIVGPVGSGKS-SLLSALLGELEKLSGSVSVPGSIAYVSQEP 75 (204) T ss_pred EECCCCEEEEECCCCCCHH-HHHHHHCCCCCCCCCCCCCCCCEEEEECCC T ss_conf 9769989999999998589-999998189525689522589889995877 No 480 >cd03257 ABC_NikE_OppD_transporters The ABC transporter subfamily specific for the transport of dipeptides, oligopeptides (OppD), and nickel (NikDE). The NikABCDE system of E. coli belongs to this family and is composed of the periplasmic binding protein NikA, two integral membrane components (NikB and NikC), and two ATPase (NikD and NikE). The NikABCDE transporter is synthesized under anaerobic conditions to meet the increased demand for nickel resulting from hydrogenase synthesis. The molecular mechanism of nickel uptake in many bacteria and most archaea is not known. Many other members of this ABC family are also involved in the uptake of dipeptides and oligopeptides. The oligopeptide transport system (Opp) is a five-component ABC transport composed of a membrane-anchored substrate binding proteins (SRP), OppA, two transmembrane proteins, OppB and OppC, and two ATP-binding domains, OppD and OppF. Probab=94.14 E-value=0.034 Score=37.78 Aligned_cols=40 Identities=28% Similarity=0.476 Sum_probs=31.5 Q ss_pred HHHHCCCCEEEECCCCCCCHHHHHHHHHHHHH-CCCEEEEEE Q ss_conf 98514895599428787703899999999996-798289960 Q gi|254780229|r 414 IDVINGKNLIIEGPPGTGKSQTITNIIAAAML-HGKKILFCA 454 (1775) Q Consensus 414 ~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la-~Gk~VLfVa 454 (1775) ..+..|.-+.|-||-|+||| |+.++|+-.+- ...+|.|-- T Consensus 26 l~i~~Ge~~~iiG~sGsGKS-TLl~~i~Gl~~p~~G~I~~~g 66 (228) T cd03257 26 FSIKKGETLGLVGESGSGKS-TLARAILGLLKPTSGSIIFDG 66 (228) T ss_pred EEECCCCEEEEECCCCCHHH-HHHHHHHCCCCCCCCEEEECC T ss_conf 89869989999999998699-999999728987886699899 No 481 >cd03294 ABC_Pro_Gly_Bertaine This family comprises the glycine betaine/L-proline ATP binding subunit in bacteria and its equivalents in archaea. This transport system belong to the larger ATP-Binding Cassette (ABC) transporter superfamily. The characteristic feature of these transporters is the obligatory coupling of ATP hydrolysis to substrate translocation. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=94.13 E-value=0.033 Score=38.01 Aligned_cols=40 Identities=33% Similarity=0.490 Sum_probs=32.1 Q ss_pred HHHHCCCCEEEECCCCCCCHHHHHHHHHHHHH-CCCEEEEEE Q ss_conf 98514895599428787703899999999996-798289960 Q gi|254780229|r 414 IDVINGKNLIIEGPPGTGKSQTITNIIAAAML-HGKKILFCA 454 (1775) Q Consensus 414 ~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la-~Gk~VLfVa 454 (1775) ..+..|.-++|-||-|+||| |+.++|+-.+- ...+|+|-- T Consensus 45 l~i~~GE~~~ivG~SGsGKS-TLLr~i~GL~~p~~G~I~~~G 85 (269) T cd03294 45 LDVREGEIFVIMGLSGSGKS-TLLRCINRLIEPTSGKVLIDG 85 (269) T ss_pred EEECCCCEEEEECCCCCHHH-HHHHHHHCCCCCCCEEEEECC T ss_conf 58889999999989984899-999999759999975999999 No 482 >PRK11650 ugpC glycerol-3-phosphate transporter ATP-binding subunit; Provisional Probab=94.13 E-value=0.033 Score=37.90 Aligned_cols=39 Identities=31% Similarity=0.532 Sum_probs=23.7 Q ss_pred HHHCCCCEEEECCCCCCCHHHHHHHHHHHHHC-CCEEEEEE Q ss_conf 85148955994287877038999999999967-98289960 Q gi|254780229|r 415 DVINGKNLIIEGPPGTGKSQTITNIIAAAMLH-GKKILFCA 454 (1775) Q Consensus 415 ~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~-Gk~VLfVa 454 (1775) .+..|.-|+|-||-|+||| |+.++|+-..-. ..+|+|-- T Consensus 26 ~i~~Ge~~~llGpsG~GKT-Tllr~iaGl~~p~~G~I~~~g 65 (358) T PRK11650 26 DVADGEFIVLVGPSGCGKS-TLLRMVAGLERITSGEIWIGG 65 (358) T ss_pred EECCCCEEEEECCCCCHHH-HHHHHHHCCCCCCCEEEEECC T ss_conf 9889989999999863699-999999769998862999999 No 483 >PRK00023 cmk cytidylate kinase; Provisional Probab=94.12 E-value=0.044 Score=36.62 Aligned_cols=24 Identities=38% Similarity=0.738 Sum_probs=21.1 Q ss_pred CEEEECCCCCCCHHHHHHHHHHHHH Q ss_conf 5599428787703899999999996 Q gi|254780229|r 421 NLIIEGPPGTGKSQTITNIIAAAML 445 (1775) Q Consensus 421 ~~vi~GPPGTGKSQTIaNlIa~~la 445 (1775) -..|.||.||||| |++-.||.-|. T Consensus 6 IIaIDGpagSGKS-T~ak~lA~~L~ 29 (225) T PRK00023 6 VIAIDGPAGSGKG-TVAKILAKKLG 29 (225) T ss_pred EEEEECCCCCCHH-HHHHHHHHHHC T ss_conf 8996589867878-99999999939 No 484 >TIGR03375 type_I_sec_LssB type I secretion system ATPase, LssB family. Type I protein secretion is a system in some Gram-negative bacteria to export proteins (often proteases) across both inner and outer membranes to the extracellular medium. This is one of three proteins of the type I secretion apparatus. Targeted proteins are not cleaved at the N-terminus, but rather carry signals located toward the extreme C-terminus to direct type I secretion. This model is related to models TIGR01842 and TIGR01846, and to bacteriocin ABC transporters that cleave their substrates during export. Probab=94.12 E-value=0.031 Score=38.27 Aligned_cols=36 Identities=14% Similarity=0.291 Sum_probs=18.8 Q ss_pred CCCCCEEEEC--CHHHCCHHHHHHHHHHCCEEEEEECC Q ss_conf 2541078860--63236989988988611828996077 Q gi|254780229|r 1359 DVKFDLVIMD--ESSQIKPEDALGVIARGKQIVVVGDP 1394 (1775) Q Consensus 1359 ~~~FD~VI~D--EASQ~~~~~al~al~rakqvvvvGD~ 1394 (1775) ...|+=|-|- ..+.--..+.=-.|-.|..|.|||.. T Consensus 463 ~i~~~~vsf~y~~~~~~vl~~vsl~i~~Ge~vaIvG~s 500 (694) T TIGR03375 463 EIEFRNVSFAYPGQETPALDNVSLTIRPGEKVAIIGRI 500 (694) T ss_pred EEEEEEEEEECCCCCCHHHCCHHHEECCCCEEEEEECC T ss_conf 49999999987988922213631188799789998058 No 485 >cd03221 ABCF_EF-3 ABCF_EF-3 Elongation factor 3 (EF-3) is a cytosolic protein required by fungal ribosomes for in vitro protein synthesis and for in vivo growth. EF-3 stimulates the binding of the EF-1: GTP: aa-tRNA ternary complex to the ribosomal A site by facilitated release of the deacylated tRNA from the E site. The reaction requires ATP hydrolysis. EF-3 contains two ATP nucleotide binding sequence (NBS) motifs. NBSI is sufficient for the intrinsic ATPase activity. NBSII is essential for the ribosome-stimulated functions. Probab=94.11 E-value=0.037 Score=37.46 Aligned_cols=32 Identities=28% Similarity=0.454 Sum_probs=27.7 Q ss_pred HHHCCCCEEEECCCCCCCHHHHHHHHHHHHHCC Q ss_conf 851489559942878770389999999999679 Q gi|254780229|r 415 DVINGKNLIIEGPPGTGKSQTITNIIAAAMLHG 447 (1775) Q Consensus 415 ~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~G 447 (1775) .+..|+-+.|.||.|+||| |+.++|+-.+... T Consensus 22 ~i~~ge~~~l~G~NGsGKT-Tl~~~l~G~~~~~ 53 (144) T cd03221 22 TINPGDRIGLVGRNGAGKS-TLLKLIAGELEPD 53 (144) T ss_pred EECCCCEEEEECCCCCCHH-HHHHHHHCCCCCC T ss_conf 9879999999989998499-9999984898898 No 486 >PRK12422 chromosomal replication initiation protein; Provisional Probab=94.11 E-value=0.21 Score=29.68 Aligned_cols=41 Identities=24% Similarity=0.526 Sum_probs=28.9 Q ss_pred CEEEECCCCCCCHHHHHHHHHHHHH-CCCEEEEEECCHHHHHH Q ss_conf 5599428787703899999999996-79828996010589999 Q gi|254780229|r 421 NLIIEGPPGTGKSQTITNIIAAAML-HGKKILFCAQKMAAIEV 462 (1775) Q Consensus 421 ~~vi~GPPGTGKSQTIaNlIa~~la-~Gk~VLfVaeK~aALdV 462 (1775) -|.|-||+|.|||| ..++|++.+. .+++|++|+--.=.-+. T Consensus 143 PLfIyG~~GlGKTH-LL~AIgn~i~~~~~kV~Yvtae~F~~~~ 184 (455) T PRK12422 143 PIYLFGPEGSGKTH-LMQAAVSALRESGGKILYVSSELFTEHL 184 (455) T ss_pred CEEEECCCCCCHHH-HHHHHHHHHHCCCCEEEEECHHHHHHHH T ss_conf 75887899997899-9999999853799869997499999999 No 487 >PRK10070 glycine betaine transporter ATP-binding subunit; Provisional Probab=94.10 E-value=0.029 Score=38.46 Aligned_cols=41 Identities=22% Similarity=0.419 Sum_probs=32.6 Q ss_pred HHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHC-CCEEEEEEC Q ss_conf 985148955994287877038999999999967-982899601 Q gi|254780229|r 414 IDVINGKNLIIEGPPGTGKSQTITNIIAAAMLH-GKKILFCAQ 455 (1775) Q Consensus 414 ~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~-Gk~VLfVae 455 (1775) ..+..|.-|+|-||-|+||| |+.++|+-++-. ..+|+|--+ T Consensus 49 l~I~~GEi~~ivG~SGsGKS-TLlr~i~gL~~Pt~G~I~i~G~ 90 (400) T PRK10070 49 LAIEEGEIFVIMGLSGSGKS-TMVRLLNRLIEPTRGQVLIDGV 90 (400) T ss_pred EEECCCCEEEEECCCCCHHH-HHHHHHHCCCCCCCCEEEECCE T ss_conf 68879999999999984699-9999997599989818999999 No 488 >COG4096 HsdR Type I site-specific restriction-modification system, R (restriction) subunit and related helicases [Defense mechanisms] Probab=94.09 E-value=0.14 Score=31.52 Aligned_cols=56 Identities=30% Similarity=0.409 Sum_probs=44.5 Q ss_pred HHHHH----HHHHCCCCEE-EECCCCCCCHHHHHHHHHHHHHCC--CEEEEEECCHHHHHHHH Q ss_conf 89999----9851489559-942878770389999999999679--82899601058999999 Q gi|254780229|r 409 QHSAV----IDVINGKNLI-IEGPPGTGKSQTITNIIAAAMLHG--KKILFCAQKMAAIEVVR 464 (1775) Q Consensus 409 Q~~av----~~~~~g~~~v-i~GPPGTGKSQTIaNlIa~~la~G--k~VLfVaeK~aALdVV~ 464 (1775) |+.|| .+..+|++=+ |.=-||||||.|---||--++-.| |||||.+-.++-.+--+ T Consensus 170 Q~~AI~rv~Eaf~~g~~raLlvMATGTGKTrTAiaii~rL~r~~~~KRVLFLaDR~~Lv~QA~ 232 (875) T COG4096 170 QIIAIRRVIEAFSKGQNRALLVMATGTGKTRTAIAIIDRLIKSGWVKRVLFLADRNALVDQAY 232 (875) T ss_pred HHHHHHHHHHHHHCCCCEEEEEEECCCCCCEEHHHHHHHHHHCCHHHEEEEEECHHHHHHHHH T ss_conf 899999999998668744899970588852319999999996141430567612678999999 No 489 >pfam00693 Herpes_TK Thymidine kinase from herpesvirus. Probab=94.08 E-value=0.06 Score=35.24 Aligned_cols=38 Identities=34% Similarity=0.456 Sum_probs=35.1 Q ss_pred CCCCCCCHHHHHHHHHHHHHCCCEEEEEECCHHHHHHHH Q ss_conf 287877038999999999967982899601058999999 Q gi|254780229|r 426 GPPGTGKSQTITNIIAAAMLHGKKILFCAQKMAAIEVVR 464 (1775) Q Consensus 426 GPPGTGKSQTIaNlIa~~la~Gk~VLfVaeK~aALdVV~ 464 (1775) ||=|.||| |.++.++.+.+.|-.||||-|-|+....+. T Consensus 1 G~~GiGKT-Tt~~~l~~~~~~~~~~lyvpEPMaYWR~~f 38 (279) T pfam00693 1 GAHGIGKT-TTARALAAASSAGGPILYVPEPMAYWRTLF 38 (279) T ss_pred CCCCCCHH-HHHHHHHHHCCCCCCEEECCCHHHHHHHHC T ss_conf 99763577-799999861367897576276289999875 No 490 >cd03256 ABC_PhnC_transporter ABC-type phosphate/phosphonate transport system. Phosphonates are a class of organophosphorus compounds characterized by a chemically stable carbon-to-phosphorus (C-P) bond. Phosphonates are widespread among naturally occurring compounds in all kingdoms of wildlife, but only procaryotic microorganisms are able to cleave this bond. Certain bacteria such as E. coli can use alkylphosphonates as a phosphorus source. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=94.08 E-value=0.036 Score=37.58 Aligned_cols=37 Identities=24% Similarity=0.435 Sum_probs=29.9 Q ss_pred HHHCCCCEEEECCCCCCCHHHHHHHHHHHHHCC-CEEEE Q ss_conf 851489559942878770389999999999679-82899 Q gi|254780229|r 415 DVINGKNLIIEGPPGTGKSQTITNIIAAAMLHG-KKILF 452 (1775) Q Consensus 415 ~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~G-k~VLf 452 (1775) .+..|.-+.|-||-|+||| |+.++|+-.+-.- .+|.| T Consensus 23 ~i~~Ge~~~iiGpsGsGKS-TLl~~i~gl~~p~~G~I~~ 60 (241) T cd03256 23 SINPGEFVALIGPSGAGKS-TLLRCLNGLVEPTSGSVLI 60 (241) T ss_pred EECCCCEEEEECCCCCHHH-HHHHHHHCCCCCCCEEEEE T ss_conf 9999989999999983399-9999997499998559999 No 491 >PRK06793 fliI flagellum-specific ATP synthase; Validated Probab=94.08 E-value=0.27 Score=28.45 Aligned_cols=78 Identities=15% Similarity=0.272 Sum_probs=53.6 Q ss_pred CCCCCHHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEE-ECCHHHH-HHHHHHHHHCCCCCEEEEC Q ss_conf 6677346899999851489559942878770389999999999679828996-0105899-9999864435844136540 Q gi|254780229|r 402 IDDSDSSQHSAVIDVINGKNLIIEGPPGTGKSQTITNIIAAAMLHGKKILFC-AQKMAAI-EVVRHRLEKAGLGEFCLEL 479 (1775) Q Consensus 402 ~~~aDsSQ~~av~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~Gk~VLfV-aeK~aAL-dVV~~RL~~~gLg~~cleL 479 (1775) +++---.=.-+..-..+|+-.-|-|+.|+||| ||.++|+..-..+-.|... -|+..-. +.+.+-|.+.|+..-++-+ T Consensus 139 ~l~TGIraID~l~pigrGQR~gIfg~sGvGKs-tLl~~i~~~~~adv~Vi~lIGeRgrEv~efi~~~l~~~gl~~tvvV~ 217 (432) T PRK06793 139 VFETGIKSIDSMLTIGIGQKIGIFAGSGVGKS-TLLGMIAKNAKADINVISLVGERGREVKDFIRKELGEEGMRKSVVVV 217 (432) T ss_pred CCCCCCCHHHHCCCCCCCCEEEEECCCCCCHH-HHHHHHHHHCCCCCEEEEEEEEECHHHHHHHHHHHHCCCCCCEEEEE T ss_conf 40048731000156124626666326898789-99999987245786579998886279999999986446752127887 Q ss_pred C Q ss_conf 2 Q gi|254780229|r 480 H 480 (1775) Q Consensus 480 H 480 (1775) . T Consensus 218 a 218 (432) T PRK06793 218 A 218 (432) T ss_pred E T ss_conf 2 No 492 >COG1120 FepC ABC-type cobalamin/Fe3+-siderophores transport systems, ATPase components [Inorganic ion transport and metabolism / Coenzyme metabolism] Probab=94.07 E-value=0.045 Score=36.58 Aligned_cols=37 Identities=30% Similarity=0.478 Sum_probs=28.6 Q ss_pred HHHCCCCEEEECCCCCCCHHHHHHHHHHHHHC-CCEEEE Q ss_conf 85148955994287877038999999999967-982899 Q gi|254780229|r 415 DVINGKNLIIEGPPGTGKSQTITNIIAAAMLH-GKKILF 452 (1775) Q Consensus 415 ~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~-Gk~VLf 452 (1775) .+..|+-++|-||.|+||| |+...|+-.+.. +..|++ T Consensus 24 ~i~~G~i~~iiGpNG~GKS-TLLk~l~~~l~p~~G~V~l 61 (258) T COG1120 24 SIPKGEITGILGPNGSGKS-TLLKCLAGLLKPKSGEVLL 61 (258) T ss_pred EECCCCEEEEECCCCCCHH-HHHHHHHCCCCCCCCEEEE T ss_conf 8659979999899888999-9999986567888877999 No 493 >cd03228 ABCC_MRP_Like The MRP (Mutidrug Resistance Protein)-like transporters are involved in drug, peptide, and lipid export. They belong to the subfamily C of the ATP-binding cassette (ABC) superfamily of transport proteins. The ABCC subfamily contains transporters with a diverse functional spectrum that includes ion transport, cell surface receptor, and toxin secretion activities. The MRP-like family, simlar to all ABC proteins, have a common four-domain core structure constituted by two membrane-spanning domains, each composed of six transmembrane (TM) helices, and two nucleotide-binding domains (NBD). ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=94.07 E-value=0.041 Score=36.95 Aligned_cols=40 Identities=28% Similarity=0.457 Sum_probs=32.1 Q ss_pred HHHCCCCEEEECCCCCCCHHHHHHHHHHHHH-CCCEEEEEEC Q ss_conf 8514895599428787703899999999996-7982899601 Q gi|254780229|r 415 DVINGKNLIIEGPPGTGKSQTITNIIAAAML-HGKKILFCAQ 455 (1775) Q Consensus 415 ~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la-~Gk~VLfVae 455 (1775) .+..|.-+.|.||.|+||| |+.++|+-.+. ...+|++--. T Consensus 24 ~i~~Ge~i~ivG~sGsGKS-TLl~ll~gl~~p~~G~I~i~g~ 64 (171) T cd03228 24 TIKPGEKVAIVGPSGSGKS-TLLKLLLRLYDPTSGEILIDGV 64 (171) T ss_pred EECCCCEEEEECCCCCHHH-HHHHHHHCCCCCCCCEEEECCE T ss_conf 9859989999999998399-9999997677589748999999 No 494 >cd00267 ABC_ATPase ABC (ATP-binding cassette) transporter nucleotide-binding domain; ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. The nucleotide-binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=94.07 E-value=0.04 Score=37.05 Aligned_cols=38 Identities=32% Similarity=0.489 Sum_probs=30.2 Q ss_pred HHHCCCCEEEECCCCCCCHHHHHHHHHHHHH-CCCEEEEE Q ss_conf 8514895599428787703899999999996-79828996 Q gi|254780229|r 415 DVINGKNLIIEGPPGTGKSQTITNIIAAAML-HGKKILFC 453 (1775) Q Consensus 415 ~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la-~Gk~VLfV 453 (1775) .+..|.-+.|.||.|+||| |+.++|+-.+. ..-+|.|- T Consensus 21 ~i~~Ge~~~i~G~nGaGKS-TLl~~l~gl~~~~~G~i~~~ 59 (157) T cd00267 21 TLKAGEIVALVGPNGSGKS-TLLRAIAGLLKPTSGEILID 59 (157) T ss_pred EECCCCEEEEECCCCCCHH-HHHHHHHCCCCCCCCEEEEC T ss_conf 9879979999878899989-99999958847996289999 No 495 >cd03295 ABC_OpuCA_Osmoprotection OpuCA is a the ATP binding component of a bacterial solute transporter that serves a protective role to cells growing in a hyperosmolar environment. ABC (ATP-binding cassette) transporter nucleotide-binding domain; ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition, to the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=94.06 E-value=0.036 Score=37.59 Aligned_cols=40 Identities=25% Similarity=0.537 Sum_probs=31.8 Q ss_pred HHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHC-CCEEEEEE Q ss_conf 985148955994287877038999999999967-98289960 Q gi|254780229|r 414 IDVINGKNLIIEGPPGTGKSQTITNIIAAAMLH-GKKILFCA 454 (1775) Q Consensus 414 ~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~-Gk~VLfVa 454 (1775) ..+..|.-++|-||-|+||| |+.++|+-.+-. ..+|+|-- T Consensus 22 l~i~~Ge~~~ilGpSG~GKS-Tllr~i~gl~~p~~G~I~i~g 62 (242) T cd03295 22 LEIAKGEFLVLIGPSGSGKT-TTMKMINRLIEPTSGEIFIDG 62 (242) T ss_pred EEECCCCEEEEECCCCCHHH-HHHHHHHCCCCCCCEEEEECC T ss_conf 68869989999999995699-999999759999815999999 No 496 >cd03239 ABC_SMC_head The structural maintenance of chromosomes (SMC) proteins are essential for successful chromosome transmission during replication and segregation of the genome in all organisms. SMCs are generally present as single proteins in bacteria, and as at least six distinct proteins in eukaryotes. The proteins range in size from approximately 110 to 170 kDa, and each has five distinct domains: amino- and carboxy-terminal globular domains, which contain sequences characteristic of ATPases, two coiled-coil regions separating the terminal domains , and a central flexible hinge. SMC proteins function together with other proteins in a range of chromosomal transactions, including chromosome condensation, sister-chromatid cohesion, recombination, DNA repair, and epigenetic silencing of gene expression. Probab=94.06 E-value=0.047 Score=36.34 Aligned_cols=25 Identities=36% Similarity=0.452 Sum_probs=21.5 Q ss_pred CEEEECCCCCCCHHHHHHHHHHHHHC Q ss_conf 55994287877038999999999967 Q gi|254780229|r 421 NLIIEGPPGTGKSQTITNIIAAAMLH 446 (1775) Q Consensus 421 ~~vi~GPPGTGKSQTIaNlIa~~la~ 446 (1775) -.+|.||.|+||| +|.++|+..|.. T Consensus 24 ~~~ivG~nGsGKS-ni~~ai~~~~g~ 48 (178) T cd03239 24 FNAIVGPNGSGKS-NIVDAICFVLGG 48 (178) T ss_pred EEEEECCCCCCHH-HHHHHHHHHHCC T ss_conf 1799899988778-999999999866 No 497 >PRK11000 maltose/maltodextrin transporter ATP-binding protein; Provisional Probab=94.06 E-value=0.035 Score=37.66 Aligned_cols=27 Identities=37% Similarity=0.688 Sum_probs=12.6 Q ss_pred HHCCCCEEEECCCCCCCHHHHHHHHHHH Q ss_conf 5148955994287877038999999999 Q gi|254780229|r 416 VINGKNLIIEGPPGTGKSQTITNIIAAA 443 (1775) Q Consensus 416 ~~~g~~~vi~GPPGTGKSQTIaNlIa~~ 443 (1775) +..|.-++|-||.|+||| |+.++|+-. T Consensus 26 i~~Ge~~~llGpsG~GKS-Tllr~i~Gl 52 (369) T PRK11000 26 IHEGEFVVFVGPSGCGKS-TLLRMIAGL 52 (369) T ss_pred ECCCCEEEEECCCCCHHH-HHHHHHHCC T ss_conf 879989999999973699-999999779 No 498 >CHL00095 clpC Clp protease ATP binding subunit Probab=94.06 E-value=0.093 Score=33.29 Aligned_cols=40 Identities=23% Similarity=0.374 Sum_probs=34.9 Q ss_pred HHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHCCC Q ss_conf 68999998514895599428787703899999999996798 Q gi|254780229|r 408 SQHSAVIDVINGKNLIIEGPPGTGKSQTITNIIAAAMLHGK 448 (1775) Q Consensus 408 SQ~~av~~~~~g~~~vi~GPPGTGKSQTIaNlIa~~la~Gk 448 (1775) -+..-|..=+.-.|-|+.|.||.||| .|+.=+|..++.|+ T Consensus 189 ~r~i~IL~RR~KNNpiLvGepGVGKT-AIvEGLA~rI~~g~ 228 (823) T CHL00095 189 ERVIQILGRRTKNNPILIGEPGVGKT-AIAEGLAQRIANRD 228 (823) T ss_pred HHHHHHHHHHCCCCCEEECCCCCCHH-HHHHHHHHHHHCCC T ss_conf 99999997732488502379998799-99999999760889 No 499 >PRK13646 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=94.05 E-value=0.038 Score=37.29 Aligned_cols=25 Identities=40% Similarity=0.543 Sum_probs=0.0 Q ss_pred HHCCCCEEEECCCCCCCHHHHHHHHH Q ss_conf 51489559942878770389999999 Q gi|254780229|r 416 VINGKNLIIEGPPGTGKSQTITNIIA 441 (1775) Q Consensus 416 ~~~g~~~vi~GPPGTGKSQTIaNlIa 441 (1775) +..|.-+.|.||.|+||| |+.++|+ T Consensus 30 I~~Ge~~aiiG~nGsGKS-TLl~~l~ 54 (286) T PRK13646 30 FEQGKYYAIVGQTGSGKS-TLIQNIN 54 (286) T ss_pred ECCCCEEEEECCCCCHHH-HHHHHHH T ss_conf 869989999999998199-9999997 No 500 >PRK13637 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=94.04 E-value=0.033 Score=37.94 Aligned_cols=36 Identities=25% Similarity=0.460 Sum_probs=0.0 Q ss_pred HHCCCCEEEECCCCCCCHHHHHHHHHHHH-HCCCEEEE Q ss_conf 51489559942878770389999999999-67982899 Q gi|254780229|r 416 VINGKNLIIEGPPGTGKSQTITNIIAAAM-LHGKKILF 452 (1775) Q Consensus 416 ~~~g~~~vi~GPPGTGKSQTIaNlIa~~l-a~Gk~VLf 452 (1775) +..|.-++|.||.|+||| |+.++|+-.+ ....+|+| T Consensus 30 i~~GE~v~iiG~nGsGKS-TLl~~l~GLl~p~~G~V~i 66 (287) T PRK13637 30 IEDGEFVALIGHTGSGKS-TLIQHLNGLLKPTSGKIII 66 (287) T ss_pred ECCCCEEEEECCCCCHHH-HHHHHHHCCCCCCCCEEEE T ss_conf 879989999999993999-9999997399888726999 Done!