Query gi|254780617|ref|YP_003065030.1| F0F1 ATP synthase subunit alpha [Candidatus Liberibacter asiaticus str. psy62] Match_columns 509 No_of_seqs 237 out of 2567 Neff 6.0 Searched_HMMs 39220 Date Mon May 30 02:11:50 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780617.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 TIGR00962 atpA ATP synthase F1 100.0 0 0 1553.0 34.5 508 2-509 1-519 (520) 2 CHL00059 atpA ATP synthase CF1 100.0 0 0 1257.3 51.4 500 1-508 2-501 (501) 3 PRK09281 F0F1 ATP synthase sub 100.0 0 0 1254.3 52.3 501 1-509 1-501 (502) 4 TIGR03324 alt_F1F0_F1_al alter 100.0 0 0 1229.4 51.0 492 5-504 5-496 (497) 5 COG0056 AtpA F0F1-type ATP syn 100.0 0 0 1234.3 44.7 501 1-509 1-501 (504) 6 PRK13343 F0F1 ATP synthase sub 100.0 0 0 1216.7 49.8 501 1-509 1-501 (502) 7 PRK07165 F0F1 ATP synthase sub 100.0 0 0 1091.0 41.4 464 29-506 3-475 (507) 8 TIGR02546 III_secr_ATP type II 100.0 0 0 1010.3 31.8 410 25-449 3-429 (430) 9 TIGR02545 ATP_syn_fliI flagell 100.0 0 0 979.2 36.5 413 26-449 2-438 (439) 10 PRK06002 fliI flagellum-specif 100.0 0 0 968.9 42.1 437 3-454 1-446 (450) 11 PRK05688 fliI flagellum-specif 100.0 0 0 964.2 40.9 435 1-448 1-445 (451) 12 PRK09099 type III secretion sy 100.0 0 0 944.0 41.3 431 5-450 2-440 (441) 13 PRK06820 type III secretion sy 100.0 0 0 939.4 41.4 429 4-448 5-443 (445) 14 TIGR01026 fliI_yscN ATPase Fli 100.0 0 0 950.6 32.3 430 8-448 4-453 (455) 15 PTZ00185 ATPase alpha subunit; 100.0 0 0 948.8 28.5 439 26-469 38-491 (574) 16 PRK08972 fliI flagellum-specif 100.0 0 0 924.3 42.7 431 5-449 2-438 (440) 17 PRK06936 type III secretion sy 100.0 0 0 922.7 41.9 430 3-450 2-439 (439) 18 PRK07594 type III secretion sy 100.0 0 0 923.5 41.2 425 10-450 3-432 (433) 19 PRK08472 fliI flagellum-specif 100.0 0 0 922.7 41.6 420 12-449 6-434 (435) 20 PRK08927 fliI flagellum-specif 100.0 0 0 914.3 41.7 427 10-452 3-438 (441) 21 TIGR03496 FliI_clade1 flagella 100.0 0 0 915.8 38.4 404 29-445 1-411 (411) 22 PRK07721 fliI flagellum-specif 100.0 0 0 909.8 41.0 423 14-451 3-434 (435) 23 TIGR03498 FliI_clade3 flagella 100.0 0 0 908.0 40.6 408 29-448 1-417 (418) 24 PRK07196 fliI flagellum-specif 100.0 0 0 906.4 41.8 423 12-448 2-431 (434) 25 PRK12597 F0F1 ATP synthase sub 100.0 0 0 904.6 43.0 422 27-460 2-442 (459) 26 PRK04196 V-type ATP synthase s 100.0 0 0 906.5 41.2 442 26-477 2-457 (460) 27 PRK06793 fliI flagellum-specif 100.0 0 0 905.1 40.3 419 14-448 8-431 (432) 28 PRK07960 fliI flagellum-specif 100.0 0 0 902.6 41.6 435 1-448 1-452 (455) 29 PRK06315 type III secretion sy 100.0 0 0 902.7 40.7 430 6-449 2-441 (442) 30 TIGR03305 alt_F1F0_F1_bet alte 100.0 0 0 906.7 36.5 434 29-483 1-443 (449) 31 TIGR03497 FliI_clade2 flagella 100.0 0 0 901.0 38.9 405 29-448 1-412 (413) 32 PRK08149 ATP synthase SpaL; Va 100.0 0 0 896.5 39.3 410 25-449 4-426 (427) 33 PRK09280 F0F1 ATP synthase sub 100.0 0 0 891.0 39.9 420 26-457 3-443 (466) 34 CHL00060 atpB ATP synthase CF1 100.0 0 0 884.3 38.2 442 22-484 4-468 (480) 35 PRK05922 type III secretion sy 100.0 0 0 880.7 39.2 421 11-451 6-433 (434) 36 COG1157 FliI Flagellar biosynt 100.0 0 0 865.2 39.5 419 16-449 13-439 (441) 37 PRK02118 V-type ATP synthase s 100.0 0 0 796.5 39.7 400 26-450 3-411 (432) 38 cd01136 ATPase_flagellum-secre 100.0 0 0 797.3 28.0 323 94-430 1-326 (326) 39 cd01135 V_A-ATPase_B V/A-type 100.0 0 0 738.0 21.5 275 94-374 1-276 (276) 40 cd01132 F1_ATPase_alpha F1 ATP 100.0 0 0 728.9 20.2 274 94-375 1-274 (274) 41 PRK04192 V-type ATP synthase s 100.0 0 0 692.9 39.2 426 27-463 3-547 (585) 42 cd01133 F1-ATPase_beta F1 ATP 100.0 0 0 693.9 22.3 272 94-376 1-273 (274) 43 COG0055 AtpD F0F1-type ATP syn 100.0 0 0 637.6 29.0 438 27-485 2-454 (468) 44 cd01134 V_A-ATPase_A V/A-type 100.0 0 0 637.4 18.3 333 29-374 9-369 (369) 45 KOG1353 consensus 100.0 0 0 628.9 13.9 340 8-448 1-340 (340) 46 COG1156 NtpB Archaeal/vacuolar 100.0 0 0 604.3 29.5 435 25-469 4-452 (463) 47 TIGR01043 ATP_syn_A_arch ATP s 100.0 0 0 534.2 30.3 460 28-509 2-584 (584) 48 pfam00006 ATP-synt_ab ATP synt 100.0 0 0 543.1 19.6 213 148-372 1-213 (213) 49 KOG1351 consensus 100.0 0 0 525.0 30.8 433 16-452 13-465 (489) 50 KOG1350 consensus 100.0 0 0 533.7 21.1 426 25-471 49-490 (521) 51 cd01128 rho_factor Transcripti 100.0 0 0 523.9 22.1 243 147-406 1-244 (249) 52 COG1155 NtpA Archaeal/vacuolar 100.0 0 0 503.7 27.8 410 28-448 2-524 (588) 53 TIGR01041 ATP_syn_B_arch ATP s 100.0 0 0 501.8 28.7 434 28-471 2-449 (458) 54 TIGR01040 V-ATPase_V1_B V-type 100.0 0 0 479.4 27.2 422 28-452 2-440 (464) 55 TIGR01039 atpD ATP synthase F1 100.0 0 0 485.6 21.0 435 28-483 2-447 (460) 56 KOG1352 consensus 100.0 0 0 471.3 21.7 476 19-508 12-613 (618) 57 TIGR01042 V-ATPase_V1_A V-type 100.0 0 0 407.1 23.3 470 27-508 1-595 (596) 58 PRK12608 transcription termina 100.0 6.1E-40 1.5E-44 309.0 21.0 293 79-403 55-357 (379) 59 PRK09376 rho transcription ter 100.0 5.6E-33 1.4E-37 258.4 15.5 293 80-403 90-394 (416) 60 PRK12678 transcription termina 100.0 3.2E-29 8.3E-34 231.1 16.2 241 145-402 394-635 (667) 61 COG1158 Rho Transcription term 100.0 3.9E-29 1E-33 230.5 15.5 236 150-402 161-397 (422) 62 cd01120 RecA-like_NTPases RecA 99.9 2.6E-25 6.7E-30 202.8 17.2 160 165-354 2-165 (165) 63 TIGR00767 rho transcription te 99.7 2.2E-16 5.5E-21 138.1 14.8 303 78-404 89-399 (420) 64 pfam00306 ATP-synt_ab_C ATP sy 99.4 2E-12 5.1E-17 109.3 11.2 89 384-472 1-98 (110) 65 pfam06745 KaiC KaiC. This fami 98.5 1.4E-06 3.5E-11 66.9 10.3 189 146-367 1-202 (231) 66 PRK09302 circadian clock prote 98.5 6.8E-06 1.7E-10 61.9 13.6 194 138-371 240-444 (501) 67 pfam02874 ATP-synt_ab_N ATP sy 98.5 5.4E-07 1.4E-11 69.8 7.2 68 25-92 2-69 (69) 68 PRK06067 flagellar accessory p 98.4 4.3E-06 1.1E-10 63.3 10.4 190 143-369 11-209 (241) 69 PRK04328 hypothetical protein; 98.4 9.5E-06 2.4E-10 60.8 11.0 186 144-367 4-210 (250) 70 PRK09302 circadian clock prote 98.3 2.1E-05 5.2E-10 58.4 11.9 70 144-220 4-75 (501) 71 COG0467 RAD55 RecA-superfamily 98.3 1.3E-05 3.2E-10 59.9 10.6 189 144-367 3-207 (260) 72 cd01124 KaiC KaiC is a circadi 98.0 0.00013 3.3E-09 52.5 10.7 162 165-361 2-171 (187) 73 smart00382 AAA ATPases associa 97.8 0.00017 4.4E-09 51.7 8.7 146 161-354 1-146 (148) 74 PRK09361 radB DNA repair and r 97.8 0.00017 4.4E-09 51.7 8.2 117 142-276 1-121 (224) 75 cd01123 Rad51_DMC1_radA Rad51_ 97.8 0.00043 1.1E-08 48.8 10.2 123 146-277 1-130 (235) 76 cd01394 radB RadB. The archaea 97.8 0.00069 1.8E-08 47.3 11.0 113 146-276 1-117 (218) 77 PRK08533 flagellar accessory p 97.5 0.0064 1.6E-07 40.3 13.4 178 137-361 8-192 (230) 78 cd01121 Sms Sms (bacterial rad 97.4 0.006 1.5E-07 40.5 11.3 168 144-355 62-234 (372) 79 cd01393 recA_like RecA is a b 97.2 0.0033 8.4E-08 42.4 8.9 121 146-275 1-127 (226) 80 pfam00154 RecA recA bacterial 97.1 0.004 1E-07 41.7 8.2 109 144-275 31-143 (322) 81 PRK11823 DNA repair protein Ra 97.0 0.017 4.4E-07 37.2 10.6 167 143-354 69-240 (454) 82 COG0468 RecA RecA/RadA recombi 96.9 0.016 4E-07 37.4 9.7 118 142-278 38-157 (279) 83 cd00983 recA RecA is a bacter 96.8 0.009 2.3E-07 39.2 7.9 135 144-311 34-172 (325) 84 PRK04301 radA DNA repair and r 96.8 0.0076 1.9E-07 39.7 7.2 131 137-277 76-214 (318) 85 TIGR02236 recomb_radA DNA repa 96.8 0.0023 5.8E-08 43.5 4.5 143 133-292 65-236 (333) 86 PRK12422 chromosomal replicati 96.6 0.073 1.9E-06 32.6 11.7 239 165-485 144-420 (455) 87 pfam08423 Rad51 Rad51. Rad51 i 96.6 0.0073 1.9E-07 39.8 6.0 132 136-277 15-153 (261) 88 TIGR02237 recomb_radB DNA repa 96.4 0.042 1.1E-06 34.4 9.0 173 166-364 16-199 (223) 89 TIGR00635 ruvB Holliday juncti 96.1 0.0086 2.2E-07 39.3 4.4 210 164-457 32-290 (305) 90 PRK13341 recombination factor 96.1 0.053 1.3E-06 33.6 8.4 84 156-283 46-130 (726) 91 TIGR02782 TrbB_P P-type conjug 96.0 0.029 7.5E-07 35.5 6.7 205 65-315 17-263 (315) 92 PRK09354 recA recombinase A; P 96.0 0.049 1.2E-06 33.9 7.7 135 144-311 39-177 (350) 93 pfam00931 NB-ARC NB-ARC domain 95.9 0.032 8E-07 35.2 6.5 92 164-274 21-112 (285) 94 PTZ00035 Rad51; Provisional 95.9 0.077 2E-06 32.4 8.4 132 136-277 102-240 (350) 95 PRK13342 recombination factor 95.8 0.041 1E-06 34.4 6.8 33 153-186 28-60 (417) 96 PRK03992 proteasome-activating 95.7 0.047 1.2E-06 34.0 6.5 107 73-189 82-192 (390) 97 PRK00411 cdc6 cell division co 95.5 0.083 2.1E-06 32.2 7.4 268 160-487 53-353 (394) 98 PRK13631 cbiO cobalt transport 95.4 0.1 2.6E-06 31.6 7.6 31 150-180 39-70 (320) 99 COG1066 Sms Predicted ATP-depe 95.4 0.19 4.8E-06 29.6 8.9 106 144-274 73-180 (456) 100 pfam01695 IstB IstB-like ATP b 95.4 0.026 6.6E-07 35.8 4.5 54 155-223 40-93 (178) 101 cd03238 ABC_UvrA The excision 95.4 0.073 1.9E-06 32.6 6.8 36 150-185 8-44 (176) 102 COG2256 MGS1 ATPase related to 95.0 0.08 2E-06 32.3 6.0 79 161-283 47-125 (436) 103 COG1474 CDC6 Cdc6-related prot 95.0 0.13 3.2E-06 30.9 7.0 99 156-276 33-137 (366) 104 TIGR02928 TIGR02928 orc1/cdc6 95.0 0.032 8.3E-07 35.2 3.9 126 155-307 32-170 (383) 105 pfam03796 DnaB_C DnaB-like hel 94.9 0.086 2.2E-06 32.1 5.9 42 146-187 2-44 (186) 106 pfam01637 Arch_ATPase Archaeal 94.6 0.21 5.3E-06 29.3 7.2 97 160-275 18-122 (223) 107 COG1124 DppF ABC-type dipeptid 94.5 0.024 6.1E-07 36.1 2.2 30 151-180 21-51 (252) 108 COG4172 ABC-type uncharacteriz 94.5 0.058 1.5E-06 33.3 4.1 31 150-180 300-331 (534) 109 COG2255 RuvB Holliday junction 94.2 0.083 2.1E-06 32.2 4.4 52 164-224 54-105 (332) 110 PRK09519 recA recombinase A; R 94.1 0.43 1.1E-05 27.0 7.9 20 480-499 747-766 (790) 111 pfam05496 RuvB_N Holliday junc 94.1 0.13 3.4E-06 30.7 5.3 28 254-281 93-120 (234) 112 TIGR01447 recD exodeoxyribonuc 94.0 0.094 2.4E-06 31.8 4.5 109 157-269 237-356 (753) 113 cd03291 ABCC_CFTR1 The CFTR su 93.9 0.042 1.1E-06 34.4 2.5 29 155-184 56-84 (282) 114 PRK05595 replicative DNA helic 93.8 0.15 3.7E-06 30.4 5.1 43 145-187 183-226 (444) 115 TIGR02168 SMC_prok_B chromosom 93.8 0.19 4.9E-06 29.6 5.7 23 162-185 23-45 (1191) 116 cd00544 CobU Adenosylcobinamid 93.7 0.58 1.5E-05 26.1 8.0 86 165-277 2-88 (169) 117 PRK10636 putative ABC transpor 93.7 0.04 1E-06 34.5 2.0 30 155-185 331-360 (638) 118 PRK08506 replicative DNA helic 93.6 0.15 3.7E-06 30.4 4.8 41 146-186 176-217 (473) 119 cd03253 ABCC_ATM1_transporter 93.6 0.048 1.2E-06 33.9 2.3 29 155-184 20-48 (236) 120 PRK09165 replicative DNA helic 93.5 0.18 4.6E-06 29.7 5.2 45 145-189 187-232 (484) 121 PRK09473 oppD oligopeptide tra 93.5 0.047 1.2E-06 34.0 2.1 54 131-185 7-64 (330) 122 PRK11174 cysteine/glutathione 93.5 0.05 1.3E-06 33.8 2.3 20 486-505 513-532 (588) 123 COG1222 RPT1 ATP-dependent 26S 93.4 0.19 4.7E-06 29.7 5.1 109 71-189 99-211 (406) 124 TIGR03600 phage_DnaB phage rep 93.4 0.19 4.8E-06 29.6 5.1 43 145-187 176-219 (421) 125 PRK11147 ABC transporter ATPas 93.4 0.05 1.3E-06 33.8 2.1 32 154-186 337-368 (632) 126 PRK08082 consensus 93.3 0.21 5.5E-06 29.2 5.3 43 145-187 185-228 (453) 127 PRK10733 hflB ATP-dependent me 93.3 0.086 2.2E-06 32.1 3.2 63 165-233 188-250 (644) 128 TIGR02868 CydC ABC transporter 93.3 0.055 1.4E-06 33.5 2.2 39 139-183 368-407 (566) 129 pfam07088 GvpD GvpD gas vesicl 93.3 0.27 6.9E-06 28.5 5.7 161 155-345 3-172 (484) 130 CHL00176 ftsH cell division pr 93.2 0.047 1.2E-06 34.0 1.8 22 165-187 213-234 (631) 131 PRK11819 putative ABC transpor 93.2 0.057 1.4E-06 33.4 2.2 25 155-179 26-50 (556) 132 PRK06749 replicative DNA helic 93.2 0.2 5E-06 29.5 4.9 43 145-187 168-211 (428) 133 PRK08006 replicative DNA helic 93.2 0.21 5.3E-06 29.3 5.0 43 145-187 206-249 (471) 134 PRK05748 replicative DNA helic 93.1 0.093 2.4E-06 31.8 3.2 42 146-187 186-228 (448) 135 PRK06904 replicative DNA helic 93.1 0.25 6.5E-06 28.7 5.4 43 146-188 204-247 (472) 136 pfam02283 CobU Cobinamide kina 93.1 0.46 1.2E-05 26.8 6.7 86 165-277 1-87 (166) 137 TIGR02858 spore_III_AA stage I 93.1 0.14 3.5E-06 30.6 4.0 152 149-325 109-280 (282) 138 cd03251 ABCC_MsbA MsbA is an e 93.1 0.06 1.5E-06 33.2 2.2 29 155-184 21-49 (234) 139 pfam04665 Pox_A32 Poxvirus A32 93.1 0.72 1.8E-05 25.4 8.0 144 162-334 13-162 (241) 140 cd03223 ABCD_peroxisomal_ALDP 93.1 0.063 1.6E-06 33.1 2.2 29 154-183 19-47 (166) 141 COG1116 TauB ABC-type nitrate/ 93.0 0.065 1.7E-06 32.9 2.3 33 150-183 16-49 (248) 142 cd03228 ABCC_MRP_Like The MRP 93.0 0.061 1.6E-06 33.2 2.1 28 156-184 22-49 (171) 143 CHL00195 ycf46 Ycf46; Provisio 93.0 0.2 5.1E-06 29.4 4.7 24 164-188 261-284 (491) 144 cd03250 ABCC_MRP_domain1 Domai 93.0 0.066 1.7E-06 32.9 2.3 29 155-184 24-52 (204) 145 PRK13648 cbiO cobalt transport 92.9 0.068 1.7E-06 32.8 2.3 28 155-183 28-55 (269) 146 cd03289 ABCC_CFTR2 The CFTR su 92.9 0.071 1.8E-06 32.7 2.3 30 154-184 22-51 (275) 147 KOG4658 consensus 92.9 0.77 2E-05 25.2 8.0 101 152-273 170-272 (889) 148 TIGR00972 3a0107s01c2 phosphat 92.9 0.068 1.7E-06 32.8 2.2 24 156-179 21-44 (248) 149 cd03248 ABCC_TAP TAP, the Tran 92.8 0.067 1.7E-06 32.9 2.1 30 154-184 32-61 (226) 150 COG1126 GlnQ ABC-type polar am 92.8 0.11 2.9E-06 31.2 3.2 170 157-353 23-217 (240) 151 cd03369 ABCC_NFT1 Domain 2 of 92.8 0.073 1.9E-06 32.6 2.2 31 155-186 27-57 (207) 152 PRK10522 multidrug transporter 92.7 0.076 1.9E-06 32.5 2.3 21 158-178 345-365 (547) 153 PRK10419 nikE nickel transport 92.7 0.07 1.8E-06 32.7 2.1 33 150-183 25-58 (266) 154 PRK13657 cyclic beta-1,2-gluca 92.7 0.075 1.9E-06 32.5 2.2 23 157-179 356-378 (585) 155 PRK10636 putative ABC transpor 92.7 0.072 1.8E-06 32.6 2.1 29 154-183 19-47 (638) 156 pfam05673 DUF815 Protein of un 92.7 0.65 1.6E-05 25.7 7.0 24 162-186 53-76 (248) 157 pfam00004 AAA ATPase family as 92.7 0.25 6.5E-06 28.7 4.9 22 165-187 1-22 (131) 158 cd04147 Ras_dva Ras-dva subfam 92.6 0.13 3.3E-06 30.8 3.3 20 164-183 1-20 (198) 159 cd03290 ABCC_SUR1_N The SUR do 92.6 0.078 2E-06 32.4 2.2 29 155-184 20-48 (218) 160 PRK13633 cobalt transporter AT 92.6 0.076 1.9E-06 32.5 2.1 29 152-180 26-55 (281) 161 PRK11308 dppF dipeptide transp 92.6 0.053 1.4E-06 33.6 1.3 31 150-180 28-59 (327) 162 cd03249 ABC_MTABC3_MDL1_MDL2 M 92.6 0.083 2.1E-06 32.2 2.3 30 154-184 21-50 (238) 163 PRK08694 consensus 92.6 0.16 4.2E-06 30.1 3.8 44 145-188 200-244 (468) 164 cd03257 ABC_NikE_OppD_transpor 92.5 0.077 2E-06 32.4 2.1 32 151-183 19-51 (228) 165 cd03270 ABC_UvrA_I The excisio 92.5 0.09 2.3E-06 31.9 2.4 30 155-184 14-43 (226) 166 PRK11160 cysteine/glutathione 92.5 0.083 2.1E-06 32.2 2.2 20 486-505 504-523 (575) 167 cd03246 ABCC_Protease_Secretio 92.5 0.083 2.1E-06 32.2 2.2 30 154-184 20-49 (173) 168 cd03298 ABC_ThiQ_thiamine_tran 92.5 0.069 1.8E-06 32.8 1.8 31 153-184 15-45 (211) 169 PRK08840 replicative DNA helic 92.4 0.14 3.6E-06 30.5 3.4 43 145-187 199-242 (464) 170 KOG1353 consensus 92.4 0.17 4.3E-06 29.9 3.8 35 228-262 79-113 (340) 171 KOG0733 consensus 92.4 0.87 2.2E-05 24.8 7.5 141 165-311 226-409 (802) 172 PRK13642 cbiO cobalt transport 92.4 0.079 2E-06 32.4 2.0 33 148-180 18-51 (277) 173 PRK11022 dppD dipeptide transp 92.4 0.081 2.1E-06 32.3 2.1 35 150-185 20-55 (327) 174 PRK00080 ruvB Holliday junctio 92.4 0.25 6.3E-06 28.8 4.5 51 164-223 53-103 (328) 175 PRK13635 cbiO cobalt transport 92.4 0.082 2.1E-06 32.2 2.1 27 154-180 25-51 (279) 176 cd03247 ABCC_cytochrome_bd The 92.4 0.083 2.1E-06 32.2 2.1 30 155-185 21-50 (178) 177 cd03245 ABCC_bacteriocin_expor 92.3 0.086 2.2E-06 32.1 2.1 29 155-184 23-51 (220) 178 PRK07263 consensus 92.3 0.15 3.8E-06 30.3 3.4 42 146-187 186-228 (453) 179 TIGR03608 L_ocin_972_ABC putat 92.3 0.089 2.3E-06 32.0 2.2 28 155-183 17-44 (206) 180 PRK11176 lipid transporter ATP 92.3 0.077 2E-06 32.4 1.9 26 157-183 363-388 (581) 181 cd00009 AAA The AAA+ (ATPases 92.2 0.39 9.9E-06 27.3 5.4 25 161-186 18-42 (151) 182 PRK10247 putative ABC transpor 92.2 0.097 2.5E-06 31.7 2.3 30 154-184 25-54 (225) 183 cd03288 ABCC_SUR2 The SUR doma 92.2 0.095 2.4E-06 31.8 2.2 29 155-184 40-68 (257) 184 cd03254 ABCC_Glucan_exporter_l 92.2 0.095 2.4E-06 31.8 2.2 30 155-185 22-51 (229) 185 cd03294 ABC_Pro_Gly_Bertaine T 92.1 0.089 2.3E-06 32.0 2.0 29 154-183 42-70 (269) 186 PRK13632 cbiO cobalt transport 92.1 0.092 2.4E-06 31.9 2.1 29 152-180 25-54 (273) 187 PRK10789 putative multidrug tr 92.1 0.11 2.7E-06 31.4 2.4 27 157-184 336-362 (569) 188 TIGR01277 thiQ thiamine ABC tr 92.1 0.091 2.3E-06 31.9 2.0 27 156-183 18-44 (213) 189 PRK05800 cobU adenosylcobinami 92.1 0.63 1.6E-05 25.8 6.3 89 162-277 1-91 (170) 190 PRK08760 replicative DNA helic 92.1 0.16 4.2E-06 30.0 3.3 43 145-187 211-254 (476) 191 PRK10744 phosphate transporter 92.1 0.1 2.6E-06 31.5 2.3 49 133-183 7-56 (257) 192 PRK11147 ABC transporter ATPas 92.1 0.099 2.5E-06 31.6 2.2 23 157-179 24-46 (632) 193 CHL00181 cbbX CbbX; Provisiona 92.0 0.58 1.5E-05 26.1 6.1 62 155-218 52-119 (287) 194 PRK10584 putative ABC transpor 92.0 0.093 2.4E-06 31.8 2.0 48 136-184 6-57 (228) 195 TIGR03015 pepcterm_ATPase puta 92.0 0.98 2.5E-05 24.4 8.7 124 164-310 45-172 (269) 196 PRK13640 cbiO cobalt transport 92.0 0.098 2.5E-06 31.7 2.1 29 152-180 23-52 (283) 197 cd03262 ABC_HisP_GlnQ_permease 92.0 0.1 2.5E-06 31.6 2.1 28 155-183 19-46 (213) 198 PRK07004 replicative DNA helic 92.0 0.18 4.5E-06 29.8 3.4 42 146-187 196-238 (460) 199 COG0444 DppD ABC-type dipeptid 91.9 0.11 2.9E-06 31.2 2.4 33 150-182 18-51 (316) 200 PRK06321 replicative DNA helic 91.9 0.35 9E-06 27.6 4.9 44 145-188 208-252 (472) 201 PRK13650 cbiO cobalt transport 91.9 0.1 2.6E-06 31.5 2.1 29 154-183 22-50 (276) 202 PRK05636 replicative DNA helic 91.9 0.32 8.2E-06 27.9 4.6 57 274-334 204-260 (507) 203 KOG0057 consensus 91.9 0.11 2.8E-06 31.3 2.2 27 157-184 373-399 (591) 204 PRK13637 cbiO cobalt transport 91.8 0.097 2.5E-06 31.7 1.9 31 150-180 20-51 (287) 205 cd03295 ABC_OpuCA_Osmoprotecti 91.8 0.1 2.7E-06 31.5 2.0 33 150-183 14-47 (242) 206 cd03229 ABC_Class3 This class 91.8 0.11 2.9E-06 31.3 2.2 32 151-183 14-46 (178) 207 cd01122 GP4d_helicase GP4d_hel 91.8 0.72 1.8E-05 25.4 6.3 65 144-215 11-76 (271) 208 cd04125 RabA_like RabA-like su 91.8 0.3 7.7E-06 28.1 4.4 19 164-182 2-20 (188) 209 pfam07475 Hpr_kinase_C HPr Ser 91.7 0.42 1.1E-05 27.0 5.1 61 160-226 16-81 (171) 210 PRK13634 cbiO cobalt transport 91.7 0.1 2.7E-06 31.5 2.0 30 151-180 8-38 (276) 211 cd03261 ABC_Org_Solvent_Resist 91.7 0.12 3E-06 31.1 2.2 32 151-183 14-46 (235) 212 cd03300 ABC_PotA_N PotA is an 91.7 0.12 3E-06 31.1 2.2 33 150-183 13-46 (232) 213 PRK10790 putative multidrug tr 91.7 0.13 3.2E-06 30.9 2.3 23 157-179 362-384 (593) 214 TIGR02769 nickel_nikE nickel i 91.6 0.1 2.6E-06 31.6 1.8 97 140-258 21-117 (267) 215 cd03293 ABC_NrtD_SsuB_transpor 91.6 0.11 2.9E-06 31.2 2.1 33 150-183 17-50 (220) 216 KOG0060 consensus 91.6 0.11 2.7E-06 31.4 1.9 25 409-436 570-594 (659) 217 TIGR03420 DnaA_homol_Hda DnaA 91.6 0.41 1.1E-05 27.1 4.9 47 161-215 37-83 (226) 218 PRK06526 transposase; Provisio 91.6 0.18 4.7E-06 29.7 3.1 54 155-223 91-144 (254) 219 COG3638 ABC-type phosphate/pho 91.6 0.47 1.2E-05 26.7 5.2 62 149-219 19-82 (258) 220 TIGR02746 TraC-F-type type-IV 91.6 0.32 8.3E-06 27.9 4.4 61 150-221 474-537 (900) 221 cd03244 ABCC_MRP_domain2 Domai 91.6 0.12 3.2E-06 30.9 2.2 29 155-184 23-51 (221) 222 pfam07693 KAP_NTPase KAP famil 91.6 0.79 2E-05 25.1 6.3 26 162-188 20-45 (301) 223 PRK08084 DNA replication initi 91.5 0.3 7.5E-06 28.2 4.1 51 161-219 44-94 (235) 224 COG2274 SunT ABC-type bacterio 91.5 0.13 3.4E-06 30.7 2.3 41 342-382 464-513 (709) 225 PRK11701 phnK phosphonates tra 91.5 0.13 3.3E-06 30.8 2.3 35 149-184 18-53 (258) 226 TIGR03375 type_I_sec_LssB type 91.4 0.13 3.2E-06 30.9 2.2 22 485-506 628-649 (694) 227 cd03226 ABC_cobalt_CbiO_domain 91.4 0.14 3.5E-06 30.6 2.3 34 150-184 13-47 (205) 228 PRK10575 iron-hydroxamate tran 91.3 0.13 3.4E-06 30.7 2.2 31 152-183 26-57 (265) 229 PRK08118 topology modulation p 91.3 0.12 3E-06 31.1 1.9 99 163-284 2-100 (167) 230 PRK08181 transposase; Validate 91.3 0.21 5.4E-06 29.2 3.2 50 159-223 103-152 (269) 231 PRK11248 tauB taurine transpor 91.3 0.13 3.2E-06 30.9 2.1 33 150-183 14-47 (255) 232 PRK10771 thiQ thiamine transpo 91.3 0.13 3.4E-06 30.7 2.2 29 155-184 18-46 (233) 233 TIGR03598 GTPase_YsxC ribosome 91.3 1.1 2.9E-05 23.9 7.2 30 157-187 13-42 (179) 234 TIGR02211 LolD_lipo_ex lipopro 91.3 0.12 3.2E-06 30.9 2.0 38 142-179 7-48 (221) 235 cd03252 ABCC_Hemolysin The ABC 91.3 0.13 3.4E-06 30.7 2.1 28 156-184 22-49 (237) 236 cd01865 Rab3 Rab3 subfamily. 91.2 0.4 1E-05 27.2 4.6 18 164-181 3-20 (165) 237 PRK09183 transposase/IS protei 91.2 0.21 5.5E-06 29.2 3.2 51 158-223 97-147 (258) 238 PRK11264 putative amino-acid A 91.2 0.14 3.6E-06 30.5 2.2 27 156-183 21-47 (248) 239 cd03255 ABC_MJ0796_Lo1CDE_FtsE 91.2 0.14 3.5E-06 30.6 2.1 29 154-183 22-50 (218) 240 cd03258 ABC_MetN_methionine_tr 91.2 0.13 3.2E-06 30.9 1.9 31 150-180 18-49 (233) 241 PRK13646 cbiO cobalt transport 91.2 0.14 3.6E-06 30.5 2.2 30 151-180 21-51 (286) 242 PRK10070 glycine betaine trans 91.1 0.14 3.5E-06 30.6 2.1 35 148-183 38-74 (400) 243 TIGR01846 type_I_sec_HlyB type 91.1 0.15 3.8E-06 30.4 2.2 27 154-180 483-509 (703) 244 PRK13543 cytochrome c biogenes 91.1 0.15 3.8E-06 30.4 2.2 33 150-183 24-57 (214) 245 TIGR03265 PhnT2 putative 2-ami 91.1 0.14 3.6E-06 30.5 2.1 34 149-183 16-50 (353) 246 PRK10938 putative molybdenum t 91.1 0.14 3.5E-06 30.6 2.1 23 157-179 24-46 (490) 247 cd03256 ABC_PhnC_transporter A 91.0 0.15 3.7E-06 30.4 2.1 35 148-183 12-47 (241) 248 TIGR02203 MsbA_lipidA lipid A 91.0 0.18 4.5E-06 29.8 2.5 32 148-179 373-405 (603) 249 cd03231 ABC_CcmA_heme_exporter 91.0 0.19 4.9E-06 29.6 2.7 31 155-186 19-49 (201) 250 PRK04195 replication factor C 90.9 0.77 2E-05 25.2 5.8 110 165-286 43-157 (403) 251 PRK11124 artP arginine transpo 90.9 0.15 3.9E-06 30.2 2.2 27 154-180 20-46 (242) 252 PRK09452 potA putrescine/sperm 90.9 0.15 3.8E-06 30.4 2.1 34 149-183 29-63 (378) 253 cd03292 ABC_FtsE_transporter F 90.9 0.098 2.5E-06 31.7 1.2 37 146-183 10-47 (214) 254 TIGR02785 addA_Gpos recombinat 90.9 0.2 5.1E-06 29.4 2.8 245 160-448 12-296 (1295) 255 cd03222 ABC_RNaseL_inhibitor T 90.9 0.14 3.5E-06 30.6 1.9 32 152-184 15-46 (177) 256 PRK07773 replicative DNA helic 90.9 0.69 1.8E-05 25.5 5.5 54 146-205 186-240 (868) 257 TIGR03258 PhnT 2-aminoethylpho 90.9 0.15 3.9E-06 30.3 2.1 32 151-183 19-51 (362) 258 PRK13542 consensus 90.9 0.16 4.2E-06 30.1 2.3 34 149-183 30-64 (224) 259 cd03260 ABC_PstB_phosphate_tra 90.8 0.16 4.2E-06 30.1 2.3 28 155-183 19-46 (227) 260 PRK11819 putative ABC transpor 90.8 0.16 4E-06 30.2 2.1 32 454-485 474-510 (556) 261 cd04144 Ras2 Ras2 subfamily. 90.8 0.58 1.5E-05 26.0 5.1 20 164-183 1-20 (190) 262 PRK13651 cobalt transporter AT 90.8 0.17 4.3E-06 30.0 2.3 31 150-180 20-51 (304) 263 PRK09544 znuC high-affinity zi 90.8 0.17 4.3E-06 29.9 2.3 30 155-185 23-52 (251) 264 PRK13652 cbiO cobalt transport 90.8 0.15 3.9E-06 30.3 2.0 35 148-183 15-50 (277) 265 cd03271 ABC_UvrA_II The excisi 90.8 0.17 4.3E-06 30.0 2.3 24 157-180 16-39 (261) 266 cd03235 ABC_Metallic_Cations A 90.7 0.17 4.3E-06 30.0 2.2 34 151-185 13-47 (213) 267 PRK13644 cbiO cobalt transport 90.7 0.16 4.2E-06 30.1 2.1 32 149-180 14-46 (274) 268 cd03299 ABC_ModC_like Archeal 90.7 0.16 4.1E-06 30.1 2.1 30 154-184 17-46 (235) 269 PRK11831 putative ABC transpor 90.7 0.16 4.1E-06 30.1 2.1 29 155-184 27-55 (269) 270 PRK13649 cbiO cobalt transport 90.7 0.16 4E-06 30.2 2.1 27 154-180 25-51 (280) 271 PRK11629 lolD lipoprotein tran 90.7 0.17 4.3E-06 30.0 2.2 29 154-183 27-55 (233) 272 PRK13647 cbiO cobalt transport 90.7 0.16 4.1E-06 30.1 2.1 34 149-183 17-51 (273) 273 pfam09378 HAS-barrel HAS barre 90.7 1.2 3E-05 23.8 6.5 43 27-70 4-49 (88) 274 PRK11432 fbpC ferric transport 90.7 0.15 3.8E-06 30.3 1.9 32 151-183 20-52 (351) 275 cd03234 ABCG_White The White s 90.6 0.16 4.2E-06 30.0 2.1 35 150-185 20-55 (226) 276 TIGR01842 type_I_sec_PrtD type 90.6 0.2 5E-06 29.5 2.5 34 142-180 341-374 (556) 277 TIGR02315 ABC_phnC phosphonate 90.6 0.16 4.1E-06 30.1 2.0 182 148-355 13-239 (253) 278 PRK00635 excinuclease ABC subu 90.6 0.49 1.3E-05 26.6 4.5 37 148-186 949-985 (1809) 279 PRK12288 ribosome-associated G 90.5 1.1 2.7E-05 24.1 6.2 27 153-179 198-224 (344) 280 PRK11650 ugpC glycerol-3-phosp 90.5 0.16 4.2E-06 30.0 2.1 33 150-183 17-50 (358) 281 TIGR03415 ABC_choXWV_ATP choli 90.5 0.17 4.4E-06 29.9 2.1 36 147-183 33-70 (382) 282 PRK13538 cytochrome c biogenes 90.5 0.19 4.8E-06 29.6 2.3 29 155-184 20-48 (204) 283 cd04148 RGK RGK subfamily. Th 90.5 0.88 2.2E-05 24.8 5.7 18 164-181 2-19 (221) 284 TIGR00630 uvra excinuclease AB 90.4 0.22 5.6E-06 29.1 2.6 123 147-322 652-781 (956) 285 cd04124 RabL2 RabL2 subfamily. 90.4 0.53 1.3E-05 26.4 4.6 19 164-182 2-20 (161) 286 PRK07261 topology modulation p 90.4 1.4 3.5E-05 23.4 7.6 110 164-296 2-117 (171) 287 CHL00131 ycf16 sulfate ABC tra 90.4 0.19 4.9E-06 29.6 2.3 33 151-184 20-53 (252) 288 cd03296 ABC_CysA_sulfate_impor 90.4 0.18 4.5E-06 29.8 2.1 34 149-183 14-48 (239) 289 COG4987 CydC ABC-type transpor 90.4 0.19 4.9E-06 29.6 2.3 28 155-183 357-384 (573) 290 cd03225 ABC_cobalt_CbiO_domain 90.4 0.19 4.8E-06 29.6 2.2 30 154-184 19-48 (211) 291 PRK11231 fecE iron-dicitrate t 90.3 0.19 4.7E-06 29.7 2.2 34 151-185 16-50 (255) 292 PRK13546 teichoic acids export 90.3 0.19 4.7E-06 29.6 2.2 34 150-184 37-71 (264) 293 PRK09493 glnQ glutamine ABC tr 90.3 0.19 4.9E-06 29.5 2.2 31 154-185 19-49 (240) 294 cd03301 ABC_MalK_N The N-termi 90.2 0.2 5E-06 29.5 2.2 33 150-183 13-46 (213) 295 PRK10418 nikD nickel transport 90.2 0.24 6.1E-06 28.9 2.6 33 152-185 18-51 (254) 296 TIGR03411 urea_trans_UrtD urea 90.2 0.17 4.3E-06 29.9 1.9 36 148-184 13-49 (242) 297 TIGR03269 met_CoM_red_A2 methy 90.2 0.18 4.5E-06 29.8 2.0 23 157-179 21-43 (520) 298 PRK13540 cytochrome c biogenes 90.1 0.21 5.3E-06 29.3 2.3 29 155-184 20-48 (200) 299 cd04112 Rab26 Rab26 subfamily. 90.1 0.67 1.7E-05 25.6 4.9 19 164-182 2-20 (191) 300 PRK10619 histidine/lysine/argi 90.1 0.2 5.1E-06 29.4 2.2 28 155-183 24-51 (257) 301 TIGR02640 gas_vesic_GvpN gas v 90.1 0.18 4.6E-06 29.8 1.9 222 162-460 21-252 (265) 302 cd03237 ABC_RNaseL_inhibitor_d 90.1 0.19 4.8E-06 29.6 2.1 26 158-184 21-46 (246) 303 TIGR00750 lao LAO/AO transport 90.1 0.67 1.7E-05 25.6 4.9 27 153-179 28-55 (333) 304 PRK11247 ssuB aliphatic sulfon 90.1 0.19 4.9E-06 29.6 2.0 29 155-184 31-59 (257) 305 PRK10982 galactose/methyl gala 90.1 0.17 4.4E-06 29.9 1.8 11 262-272 247-257 (491) 306 cd03217 ABC_FeS_Assembly ABC-t 90.0 0.21 5.4E-06 29.2 2.3 34 150-184 13-47 (200) 307 PRK13643 cbiO cobalt transport 90.0 0.19 4.8E-06 29.6 2.0 32 149-180 18-50 (288) 308 PRK04841 transcriptional regul 90.0 1.5 3.8E-05 23.1 7.8 129 134-289 14-150 (903) 309 COG0488 Uup ATPase components 89.9 0.26 6.5E-06 28.6 2.6 32 154-186 21-52 (530) 310 COG4608 AppF ABC-type oligopep 89.9 0.17 4.4E-06 29.9 1.8 31 150-180 26-57 (268) 311 cd03220 ABC_KpsT_Wzt ABC_KpsT_ 89.9 0.21 5.4E-06 29.3 2.2 35 149-184 34-69 (224) 312 PRK03695 vitamin B12-transport 89.9 0.21 5.4E-06 29.3 2.1 30 154-184 15-44 (245) 313 cd03221 ABCF_EF-3 ABCF_EF-3 E 89.8 0.22 5.7E-06 29.1 2.2 29 156-185 20-48 (144) 314 PRK09580 sufC cysteine desulfu 89.8 0.22 5.7E-06 29.1 2.3 27 157-184 22-48 (248) 315 PRK10851 sulfate/thiosulfate t 89.8 0.21 5.4E-06 29.3 2.1 33 151-184 16-49 (352) 316 TIGR02142 modC_ABC molybdate A 89.8 0.18 4.6E-06 29.7 1.8 29 151-179 12-40 (361) 317 PRK11000 maltose/maltodextrin 89.8 0.21 5.3E-06 29.3 2.1 33 150-183 16-49 (369) 318 PRK09984 phosphonate/organopho 89.8 0.22 5.6E-06 29.1 2.2 28 155-183 23-50 (262) 319 PRK13547 hmuV hemin importer A 89.8 0.21 5.5E-06 29.2 2.1 29 155-184 20-48 (273) 320 PRK13645 cbiO cobalt transport 89.8 0.21 5.4E-06 29.2 2.1 32 149-180 23-55 (289) 321 COG4178 ABC-type uncharacteriz 89.7 0.23 5.9E-06 28.9 2.3 20 159-178 416-435 (604) 322 cd04120 Rab12 Rab12 subfamily. 89.7 0.64 1.6E-05 25.8 4.5 20 164-183 2-21 (202) 323 TIGR03263 guanyl_kin guanylate 89.7 1.4 3.6E-05 23.3 6.3 85 162-269 1-99 (180) 324 PRK10253 iron-enterobactin tra 89.7 0.2 5E-06 29.5 1.9 35 150-185 20-55 (265) 325 PRK13639 cbiO cobalt transport 89.7 0.22 5.6E-06 29.1 2.1 32 149-180 14-46 (275) 326 PRK13539 cytochrome c biogenes 89.7 0.23 6E-06 28.9 2.3 31 155-186 21-51 (206) 327 PRK10261 glutathione transport 89.6 0.29 7.3E-06 28.3 2.7 31 150-180 29-60 (623) 328 PRK11607 potG putrescine trans 89.6 0.22 5.5E-06 29.2 2.0 46 137-184 20-66 (377) 329 cd00267 ABC_ATPase ABC (ATP-bi 89.6 0.25 6.2E-06 28.8 2.3 30 155-185 18-47 (157) 330 cd03297 ABC_ModC_molybdenum_tr 89.6 0.21 5.5E-06 29.2 2.0 28 154-183 16-43 (214) 331 pfam00580 UvrD-helicase UvrD/R 89.5 0.88 2.2E-05 24.8 5.1 28 161-188 12-39 (494) 332 cd01895 EngA2 EngA2 subfamily. 89.5 1.1 2.9E-05 23.9 5.7 23 163-186 3-25 (174) 333 COG0488 Uup ATPase components 89.5 0.26 6.6E-06 28.6 2.4 31 454-484 468-503 (530) 334 cd03236 ABC_RNaseL_inhibitor_d 89.4 0.25 6.3E-06 28.8 2.2 24 160-184 24-47 (255) 335 cd03224 ABC_TM1139_LivF_branch 89.3 0.26 6.7E-06 28.6 2.3 34 150-184 13-47 (222) 336 PRK00349 uvrA excinuclease ABC 89.3 0.36 9.1E-06 27.6 3.0 29 206-234 520-548 (944) 337 COG1127 Ttg2A ABC-type transpo 89.3 0.32 8.2E-06 27.9 2.7 32 155-187 27-58 (263) 338 COG3842 PotA ABC-type spermidi 89.3 0.26 6.7E-06 28.6 2.3 45 137-183 6-51 (352) 339 KOG0651 consensus 89.2 0.53 1.4E-05 26.3 3.8 136 71-222 81-223 (388) 340 TIGR00929 VirB4_CagE type IV s 89.2 0.32 8.2E-06 27.9 2.7 52 164-222 518-569 (931) 341 cd01918 HprK_C HprK/P, the bif 89.1 0.57 1.5E-05 26.1 3.9 25 161-185 13-37 (149) 342 PRK11153 metN DL-methionine tr 89.1 0.26 6.6E-06 28.6 2.1 33 150-183 18-51 (343) 343 COG3839 MalK ABC-type sugar tr 89.1 0.36 9.3E-06 27.5 2.9 28 155-183 22-49 (338) 344 COG4619 ABC-type uncharacteriz 89.1 0.35 8.9E-06 27.7 2.8 36 148-184 14-50 (223) 345 PRK11300 livG leucine/isoleuci 89.1 0.27 7E-06 28.4 2.2 36 148-184 16-52 (255) 346 COG1132 MdlB ABC-type multidru 89.1 0.27 7E-06 28.4 2.2 29 155-184 348-376 (567) 347 cd01862 Rab7 Rab7 subfamily. 89.1 1.7 4.4E-05 22.6 6.8 20 164-183 2-21 (172) 348 PRK13548 hmuV hemin importer A 89.1 0.28 7.2E-06 28.3 2.3 34 150-184 15-49 (257) 349 TIGR00954 3a01203 Peroxysomal 89.0 0.28 7E-06 28.4 2.2 36 144-179 536-573 (788) 350 cd01864 Rab19 Rab19 subfamily. 88.9 0.74 1.9E-05 25.3 4.4 19 164-182 5-23 (165) 351 PRK13636 cbiO cobalt transport 88.9 0.28 7.1E-06 28.4 2.2 35 148-183 17-52 (285) 352 TIGR01184 ntrCD nitrate ABC tr 88.9 0.32 8.1E-06 28.0 2.5 137 156-325 5-170 (230) 353 cd03215 ABC_Carb_Monos_II This 88.9 0.29 7.3E-06 28.3 2.2 31 152-183 15-46 (182) 354 TIGR01193 bacteriocin_ABC ABC- 88.9 0.27 6.9E-06 28.5 2.1 103 150-265 487-606 (710) 355 TIGR00416 sms DNA repair prote 88.9 0.59 1.5E-05 26.0 3.8 145 145-314 84-238 (481) 356 PRK07952 DNA replication prote 88.9 0.67 1.7E-05 25.6 4.1 42 144-189 80-122 (242) 357 KOG0080 consensus 88.8 0.51 1.3E-05 26.4 3.5 23 164-186 13-35 (209) 358 PRK10908 cell division protein 88.8 0.27 6.8E-06 28.5 2.0 33 150-183 15-48 (222) 359 COG1134 TagH ABC-type polysacc 88.7 0.3 7.5E-06 28.2 2.2 26 154-179 45-70 (249) 360 COG1136 SalX ABC-type antimicr 88.7 0.3 7.7E-06 28.1 2.3 30 150-179 18-48 (226) 361 TIGR02770 nickel_nikD nickel i 88.7 0.45 1.1E-05 26.9 3.1 172 153-356 3-217 (239) 362 KOG1433 consensus 88.6 1.8 4.7E-05 22.4 7.2 121 138-274 85-212 (326) 363 PRK11614 livF leucine/isoleuci 88.6 0.3 7.8E-06 28.1 2.2 35 150-185 18-53 (237) 364 KOG0730 consensus 88.6 0.25 6.3E-06 28.7 1.8 69 162-235 218-286 (693) 365 cd03214 ABC_Iron-Siderophores_ 88.4 0.33 8.4E-06 27.8 2.3 30 155-185 18-47 (180) 366 PRK10895 putative ABC transpor 88.3 0.32 8.2E-06 27.9 2.2 34 150-184 16-50 (241) 367 PTZ00265 multidrug resistance 88.3 0.35 9E-06 27.6 2.4 33 150-183 398-431 (1467) 368 cd00984 DnaB_C DnaB helicase C 88.3 1.8 4.5E-05 22.6 5.9 30 159-188 10-39 (242) 369 TIGR02528 EutP ethanolamine ut 88.2 0.56 1.4E-05 26.2 3.4 71 164-240 2-77 (144) 370 COG1123 ATPase components of v 88.2 0.34 8.6E-06 27.8 2.2 30 154-184 27-56 (539) 371 TIGR02857 CydD ABC transporter 88.2 0.31 8E-06 28.0 2.1 83 102-187 310-402 (570) 372 cd04102 RabL3 RabL3 (Rab-like3 88.2 0.27 7E-06 28.4 1.7 20 164-183 2-21 (202) 373 TIGR02324 CP_lyasePhnL phospho 88.1 0.4 1E-05 27.2 2.6 117 155-300 27-151 (224) 374 PRK00300 gmk guanylate kinase; 88.1 2 5E-05 22.2 7.3 89 159-269 4-106 (208) 375 COG4988 CydD ABC-type transpor 88.0 0.36 9.2E-06 27.5 2.3 33 154-187 339-371 (559) 376 PRK09536 btuD corrinoid ABC tr 88.0 0.32 8.1E-06 28.0 2.0 36 150-186 15-51 (409) 377 PRK13641 cbiO cobalt transport 88.0 0.3 7.7E-06 28.1 1.9 30 151-180 21-51 (286) 378 COG0003 ArsA Predicted ATPase 87.9 2 5.2E-05 22.1 8.4 22 162-183 1-23 (322) 379 COG1484 DnaC DNA replication p 87.9 1.3 3.3E-05 23.5 5.1 48 161-223 104-151 (254) 380 TIGR01448 recD_rel helicase, R 87.9 1.3 3.4E-05 23.4 5.1 75 35-114 33-116 (769) 381 cd03232 ABC_PDR_domain2 The pl 87.8 0.36 9.2E-06 27.5 2.2 30 154-184 25-54 (192) 382 pfam05729 NACHT NACHT domain. 87.8 0.49 1.3E-05 26.6 2.9 22 163-184 1-22 (165) 383 COG0563 Adk Adenylate kinase a 87.8 2 5.2E-05 22.1 9.6 101 163-287 1-102 (178) 384 COG1135 AbcC ABC-type metal io 87.7 0.44 1.1E-05 27.0 2.6 72 148-227 17-96 (339) 385 PRK13409 putative ATPase RIL; 87.7 0.36 9.2E-06 27.6 2.1 20 160-179 97-116 (590) 386 PRK05642 DNA replication initi 87.7 0.55 1.4E-05 26.2 3.1 23 165-188 48-70 (234) 387 PRK10938 putative molybdenum t 87.6 0.37 9.5E-06 27.5 2.2 33 82-117 26-58 (490) 388 COG2607 Predicted ATPase (AAA+ 87.5 2.1 5.4E-05 22.0 6.5 73 164-282 87-161 (287) 389 COG1122 CbiO ABC-type cobalt t 87.5 0.41 1E-05 27.2 2.3 27 154-180 22-48 (235) 390 PRK06217 hypothetical protein; 87.5 0.34 8.7E-06 27.7 1.9 150 163-346 2-174 (185) 391 KOG0743 consensus 87.5 0.83 2.1E-05 24.9 3.9 87 99-186 148-258 (457) 392 cd03233 ABC_PDR_domain1 The pl 87.5 0.4 1E-05 27.3 2.2 29 155-184 26-54 (202) 393 PRK10246 exonuclease subunit S 87.5 0.19 4.8E-06 29.6 0.5 31 152-183 19-50 (1047) 394 PRK13638 cbiO cobalt transport 87.5 0.39 9.9E-06 27.3 2.2 28 156-184 21-48 (271) 395 cd03269 ABC_putative_ATPase Th 87.5 0.54 1.4E-05 26.3 2.9 36 150-186 13-49 (210) 396 pfam06068 TIP49 TIP49 C-termin 87.4 0.46 1.2E-05 26.8 2.5 21 162-182 50-70 (395) 397 cd03259 ABC_Carb_Solutes_like 87.4 0.4 1E-05 27.3 2.2 33 150-183 13-46 (213) 398 PRK00454 engB GTPase EngB; Rev 87.4 2.2 5.5E-05 21.9 8.1 40 146-187 9-48 (196) 399 TIGR02397 dnaX_nterm DNA polym 87.4 0.26 6.6E-06 28.6 1.2 104 166-291 40-146 (363) 400 COG1493 HprK Serine kinase of 87.4 0.63 1.6E-05 25.8 3.2 29 155-184 139-167 (308) 401 PRK10535 macrolide transporter 87.4 0.39 1E-05 27.3 2.2 22 157-178 29-50 (648) 402 TIGR01073 pcrA ATP-dependent D 87.3 1.3 3.4E-05 23.5 4.8 26 165-190 20-45 (811) 403 PRK09518 bifunctional cytidyla 87.1 1.6 4.1E-05 22.8 5.2 56 251-309 377-435 (714) 404 TIGR02982 heterocyst_DevA ABC 87.1 0.44 1.1E-05 27.0 2.3 173 154-350 23-219 (220) 405 PRK13541 cytochrome c biogenes 87.1 0.56 1.4E-05 26.2 2.8 32 153-185 17-48 (195) 406 TIGR03410 urea_trans_UrtE urea 87.1 0.43 1.1E-05 27.0 2.2 28 155-183 19-46 (230) 407 cd03218 ABC_YhbG The ABC trans 87.0 0.55 1.4E-05 26.2 2.7 34 150-184 13-47 (232) 408 PRK06893 DNA replication initi 87.0 1.6 4.1E-05 22.9 5.1 27 161-188 38-64 (229) 409 COG4181 Predicted ABC-type tra 87.0 0.46 1.2E-05 26.8 2.3 27 157-185 31-57 (228) 410 PRK13544 consensus 87.0 0.44 1.1E-05 26.9 2.2 29 155-184 20-48 (208) 411 PTZ00243 ABC transporter; Prov 86.8 0.46 1.2E-05 26.8 2.3 34 150-186 676-709 (1560) 412 PRK10261 glutathione transport 86.8 0.46 1.2E-05 26.8 2.2 20 488-507 493-512 (623) 413 cd04134 Rho3 Rho3 subfamily. 86.8 0.38 9.7E-06 27.4 1.8 22 163-184 1-22 (189) 414 cd03263 ABC_subfamily_A The AB 86.7 0.46 1.2E-05 26.8 2.2 33 152-185 17-50 (220) 415 PRK00635 excinuclease ABC subu 86.7 0.67 1.7E-05 25.6 3.0 37 148-186 609-645 (1809) 416 cd01134 V_A-ATPase_A V/A-type 86.7 0.13 3.3E-06 30.8 -0.6 35 94-128 1-35 (369) 417 COG3840 ThiQ ABC-type thiamine 86.7 0.46 1.2E-05 26.8 2.2 29 155-184 18-46 (231) 418 cd03213 ABCG_EPDR ABCG transpo 86.7 0.43 1.1E-05 27.0 2.1 37 148-185 20-57 (194) 419 PRK00149 dnaA chromosomal repl 86.7 0.66 1.7E-05 25.6 3.0 103 165-310 148-267 (447) 420 TIGR01187 potA polyamine ABC t 86.7 0.27 6.9E-06 28.5 1.0 14 167-180 1-14 (331) 421 COG1201 Lhr Lhr-like helicases 86.6 1.5 3.8E-05 23.1 4.8 19 265-283 576-594 (814) 422 cd03219 ABC_Mj1267_LivG_branch 86.5 0.48 1.2E-05 26.6 2.2 35 149-184 12-47 (236) 423 cd03216 ABC_Carb_Monos_I This 86.5 0.5 1.3E-05 26.5 2.3 34 150-184 13-47 (163) 424 pfam08477 Miro Miro-like prote 86.4 0.39 1E-05 27.3 1.7 21 164-184 1-21 (118) 425 cd01875 RhoG RhoG subfamily. 86.3 0.43 1.1E-05 27.0 1.9 21 164-184 5-25 (191) 426 PRK11054 helD DNA helicase IV; 86.3 1.8 4.6E-05 22.5 5.0 29 162-190 209-237 (684) 427 cd03278 ABC_SMC_barmotin Barmo 86.2 0.64 1.6E-05 25.8 2.7 26 157-184 18-43 (197) 428 cd01871 Rac1_like Rac1-like su 86.2 0.44 1.1E-05 26.9 1.9 22 164-185 3-24 (174) 429 TIGR02204 MsbA_rel ABC transpo 86.2 0.94 2.4E-05 24.5 3.6 113 143-265 346-469 (576) 430 COG3451 VirB4 Type IV secretor 86.1 1.8 4.5E-05 22.5 4.9 49 164-222 438-486 (796) 431 COG2874 FlaH Predicted ATPases 86.0 2.6 6.5E-05 21.4 9.1 157 161-357 27-193 (235) 432 TIGR01166 cbiO cobalt ABC tran 85.9 0.54 1.4E-05 26.3 2.2 31 150-180 5-36 (190) 433 PRK00409 recombination and DNA 85.9 1.1 2.9E-05 23.9 3.9 115 145-265 307-442 (780) 434 cd01919 PEPCK Phosphoenolpyruv 85.7 0.45 1.2E-05 26.8 1.8 275 148-509 202-501 (515) 435 cd01854 YjeQ_engC YjeQ/EngC. 85.7 0.46 1.2E-05 26.8 1.8 27 153-179 152-178 (287) 436 COG4525 TauB ABC-type taurine 85.7 0.58 1.5E-05 26.1 2.3 26 154-179 23-48 (259) 437 cd04101 RabL4 RabL4 (Rab-like4 85.7 0.48 1.2E-05 26.7 1.8 17 164-180 2-18 (164) 438 pfam03193 DUF258 Protein of un 85.6 0.44 1.1E-05 26.9 1.6 28 152-179 25-52 (161) 439 KOG2028 consensus 85.5 2.6 6.5E-05 21.4 5.5 19 163-181 163-181 (554) 440 TIGR02673 FtsE cell division A 85.5 0.54 1.4E-05 26.3 2.0 70 142-218 7-79 (215) 441 cd01861 Rab6 Rab6 subfamily. 85.5 0.51 1.3E-05 26.5 1.9 20 163-182 1-20 (161) 442 PRK13545 tagH teichoic acids e 85.4 0.61 1.5E-05 25.9 2.3 30 154-184 42-71 (549) 443 COG1120 FepC ABC-type cobalami 85.4 0.6 1.5E-05 26.0 2.3 28 155-183 21-48 (258) 444 COG4136 ABC-type uncharacteriz 85.4 0.78 2E-05 25.1 2.8 95 155-258 21-129 (213) 445 TIGR00763 lon ATP-dependent pr 85.3 0.5 1.3E-05 26.5 1.8 28 155-182 442-470 (941) 446 cd04175 Rap1 Rap1 subgroup. T 85.3 0.54 1.4E-05 26.3 2.0 20 164-183 3-22 (164) 447 cd04138 H_N_K_Ras_like H-Ras/N 85.3 0.57 1.4E-05 26.1 2.1 20 164-183 3-22 (162) 448 cd04127 Rab27A Rab27a subfamil 85.3 0.52 1.3E-05 26.4 1.9 20 164-183 6-25 (180) 449 PRK09700 D-allose transporter 85.3 0.6 1.5E-05 26.0 2.2 187 161-369 162-360 (510) 450 cd04137 RheB Rheb (Ras Homolog 85.1 0.54 1.4E-05 26.3 1.9 21 163-183 2-22 (180) 451 cd03267 ABC_NatA_like Similar 85.1 0.6 1.5E-05 26.0 2.1 35 149-184 33-68 (236) 452 KOG1564 consensus 85.1 2.7 7E-05 21.2 5.5 126 144-280 82-225 (351) 453 cd04136 Rap_like Rap-like subf 85.1 0.56 1.4E-05 26.2 2.0 20 164-183 3-22 (163) 454 cd03230 ABC_DR_subfamily_A Thi 85.0 0.64 1.6E-05 25.7 2.3 30 155-185 19-48 (173) 455 cd01870 RhoA_like RhoA-like su 85.0 0.51 1.3E-05 26.4 1.8 21 163-183 2-22 (175) 456 cd04130 Wrch_1 Wrch-1 subfamil 85.0 0.54 1.4E-05 26.3 1.9 21 164-184 2-22 (173) 457 PRK05428 HPr kinase/phosphoryl 85.0 1.8 4.5E-05 22.5 4.5 24 161-184 145-168 (308) 458 PRK13536 nodulation factor exp 84.9 0.85 2.2E-05 24.9 2.8 35 150-185 20-55 (306) 459 cd01893 Miro1 Miro1 subfamily. 84.9 0.59 1.5E-05 26.0 2.0 20 164-183 2-21 (166) 460 PRK11288 araG L-arabinose tran 84.9 0.6 1.5E-05 25.9 2.1 23 157-179 274-296 (501) 461 pfam00308 Bac_DnaA Bacterial d 84.9 0.88 2.2E-05 24.7 2.9 35 165-205 37-71 (219) 462 cd04176 Rap2 Rap2 subgroup. T 84.8 0.56 1.4E-05 26.2 1.9 20 164-183 3-22 (163) 463 PRK08116 hypothetical protein; 84.7 2.5 6.3E-05 21.5 5.1 63 145-222 90-153 (262) 464 KOG0731 consensus 84.7 0.68 1.7E-05 25.6 2.3 73 165-243 347-419 (774) 465 cd04140 ARHI_like ARHI subfami 84.7 0.6 1.5E-05 26.0 2.0 21 164-184 3-23 (165) 466 PRK13549 xylose transporter AT 84.7 0.66 1.7E-05 25.7 2.2 26 154-179 280-305 (513) 467 COG0194 Gmk Guanylate kinase [ 84.7 2.9 7.4E-05 21.0 9.0 89 161-270 3-102 (191) 468 cd01874 Cdc42 Cdc42 subfamily. 84.6 0.6 1.5E-05 26.0 1.9 21 164-184 3-23 (175) 469 PRK11144 modC molybdate transp 84.6 0.54 1.4E-05 26.3 1.7 31 152-183 14-44 (352) 470 TIGR02788 VirB11 P-type DNA tr 84.5 0.71 1.8E-05 25.4 2.3 231 29-356 59-314 (328) 471 cd04139 RalA_RalB RalA/RalB su 84.5 0.6 1.5E-05 26.0 1.9 20 164-183 2-21 (164) 472 TIGR00968 3a0106s01 sulfate AB 84.5 0.64 1.6E-05 25.8 2.1 23 157-179 21-43 (241) 473 KOG1707 consensus 84.5 0.77 2E-05 25.2 2.5 82 162-246 9-100 (625) 474 cd04106 Rab23_lke Rab23-like s 84.5 0.64 1.6E-05 25.8 2.0 20 164-183 2-21 (162) 475 cd04135 Tc10 TC10 subfamily. 84.5 0.6 1.5E-05 26.0 1.9 20 164-183 2-21 (174) 476 cd00157 Rho Rho (Ras homology) 84.4 0.54 1.4E-05 26.3 1.7 20 164-183 2-21 (171) 477 KOG3883 consensus 84.4 0.99 2.5E-05 24.4 3.0 83 158-243 5-102 (198) 478 PRK10787 DNA-binding ATP-depen 84.3 0.93 2.4E-05 24.6 2.8 31 152-182 339-369 (784) 479 cd01873 RhoBTB RhoBTB subfamil 84.3 0.71 1.8E-05 25.4 2.2 22 164-185 4-25 (195) 480 COG0411 LivG ABC-type branched 84.3 0.39 9.9E-06 27.3 0.9 32 148-179 15-47 (250) 481 cd01863 Rab18 Rab18 subfamily. 84.1 0.61 1.6E-05 25.9 1.8 18 164-181 2-19 (161) 482 cd04107 Rab32_Rab38 Rab38/Rab3 84.1 0.64 1.6E-05 25.8 1.9 20 164-183 2-21 (201) 483 cd04119 RJL RJL (RabJ-Like) su 84.0 0.62 1.6E-05 25.9 1.8 19 164-182 2-20 (168) 484 cd04110 Rab35 Rab35 subfamily. 84.0 0.61 1.5E-05 25.9 1.8 20 164-183 8-27 (199) 485 cd04129 Rho2 Rho2 subfamily. 84.0 0.66 1.7E-05 25.7 1.9 21 163-183 2-22 (187) 486 cd00876 Ras Ras family. The R 83.9 0.57 1.5E-05 26.1 1.6 21 164-185 1-21 (160) 487 PRK00098 ribosome-associated G 83.9 0.58 1.5E-05 26.0 1.6 15 28-42 2-16 (298) 488 cd04114 Rab30 Rab30 subfamily. 83.9 0.68 1.7E-05 25.6 2.0 20 164-183 9-28 (169) 489 cd01860 Rab5_related Rab5-rela 83.8 0.65 1.6E-05 25.7 1.8 20 164-183 3-22 (163) 490 KOG1434 consensus 83.8 0.57 1.5E-05 26.1 1.6 123 145-276 96-225 (335) 491 cd04122 Rab14 Rab14 subfamily. 83.8 0.65 1.7E-05 25.7 1.8 20 164-183 4-23 (166) 492 cd04146 RERG_RasL11_like RERG/ 83.7 0.63 1.6E-05 25.8 1.8 20 164-183 1-20 (165) 493 COG1125 OpuBA ABC-type proline 83.7 0.94 2.4E-05 24.5 2.7 34 149-183 13-47 (309) 494 TIGR02903 spore_lon_C ATP-depe 83.7 0.58 1.5E-05 26.0 1.6 24 212-235 308-331 (616) 495 KOG0079 consensus 83.7 0.36 9.2E-06 27.6 0.5 17 164-180 10-26 (198) 496 COG4167 SapF ABC-type antimicr 83.7 0.75 1.9E-05 25.3 2.1 29 155-184 32-60 (267) 497 cd04177 RSR1 RSR1 subgroup. R 83.7 0.71 1.8E-05 25.4 2.0 23 164-186 3-25 (168) 498 COG4618 ArpD ABC-type protease 83.6 0.77 2E-05 25.2 2.2 23 157-179 357-379 (580) 499 cd04116 Rab9 Rab9 subfamily. 83.6 0.69 1.8E-05 25.5 1.9 23 164-186 7-29 (170) 500 PRK08939 primosomal protein Dn 83.6 2.2 5.6E-05 21.9 4.5 46 153-204 148-193 (306) No 1 >TIGR00962 atpA ATP synthase F1, alpha subunit; InterPro: IPR005294 ATPases (or ATP synthases) are membrane-bound enzyme complexes/ion transporters that combine ATP synthesis and/or hydrolysis with the transport of protons across a membrane. ATPases can harness the energy from a proton gradient, using the flux of ions across the membrane via the ATPase proton channel to drive the synthesis of ATP. Some ATPases work in reverse, using the energy from the hydrolysis of ATP to create a proton gradient. There are different types of ATPases, which can differ in function (ATP synthesis and/or hydrolysis), structure (F-, V- and A-ATPases contain rotary motors) and in the type of ions they transport , . F-ATPases (F1F0-ATPases) in mitochondria, chloroplasts and bacterial plasma membranes are the prime producers of ATP, using the proton gradient generated by oxidative phosphorylation (mitochondria) or photosynthesis (chloroplasts). V-ATPases (V1V0-ATPases) are primarily found in eukaryotic vacuoles, catalysing ATP hydrolysis to transport solutes and lower pH in organelles. A-ATPases (A1A0-ATPases) are found in Archaea and function like F-ATPases. P-ATPases (E1E2-ATPases) are found in bacteria and in eukaryotic plasma membranes and organelles, and function to transport a variety of different ions across membranes. E-ATPases are cell-surface enzymes that hydrolyse a range of NTPs, including extracellular ATP. F-ATPases (also known as F1F0-ATPase, or H(+)-transporting two-sector ATPase) (3.6.3.14 from EC) are composed of two linked complexes: the F1 ATPase complex is the catalytic core and is composed of 5 subunits (alpha, beta, gamma, delta, epsilon), while the F0 ATPase complex is the membrane-embedded proton channel that is composed of at least 3 subunits (A-C), nine in mitochondria (A-G, F6, F8). Both the F1 and F0 complexes are rotary motors that are coupled back-to-back. In the F1 complex, the central gamma subunit forms the rotor inside the cylinder made of the alpha(3)beta(3) subunits, while in the F0 complex, the ring-shaped C subunits forms the rotor. The two rotors rotate in opposite directions, but the F0 rotor is usually stronger, using the force from the proton gradient to push the F1 rotor in reverse in order to drive ATP synthesis . These ATPases can also work in reverse to hydrolyse ATP to create a proton gradient. This entry represents the alpha subunit found in the F1 complex of F-ATPases. In F-ATPases, there are three copies each of the alpha and beta subunits that form the catalytic core of the F1 complex, while the remaining F1 subunits (gamma, delta, epsilon) form part of the stalks. There is a substrate-binding site on each of the alpha and beta subunits, those on the beta subunits being catalytic, while those on the alpha subunits are regulatory. The alpha-subunit contains a highly conserved adenine-specific non-catalytic nucleotide-binding domain, with a conserved amino acid sequence of Gly-X-X-X-X-Gly-Lys. The alpha and beta subunits form a cylinder that is attached to the central stalk. The alpha/beta subunits undergo a sequence of conformational changes leading to the formation of ATP from ADP, which are induced by the rotation of the gamma subunit, itself is driven by the movement of protons through the F0 complex C subunit . More information about this protein can be found at Protein of the Month: ATP Synthases .; GO: 0016820 hydrolase activity acting on acid anhydrides catalyzing transmembrane movement of substances, 0015986 ATP synthesis coupled proton transport, 0016469 proton-transporting two-sector ATPase complex. Probab=100.00 E-value=0 Score=1553.04 Aligned_cols=508 Identities=65% Similarity=1.041 Sum_probs=505.7 Q ss_pred CCCHHHHHHHHHHHHHCCCCCCCEEEEEEEEEEECCEEEEECCCCCCCCCEEEECCCEEEEEEEECCCEEEEEEECCCCC Q ss_conf 88889999999999842455542010358999827589997588877374799579818999920598599998058668 Q gi|254780617|r 2 DIHAAEISDILRKRIKDFGNDSEFSEIGRVLSIGDGIARVYGLNNIRAGEMVQFSHGVYGMALNLEVDNVGVVILGSYKE 81 (509) Q Consensus 2 ~~~~~eis~~l~~~i~~~~~~~~~~~~G~V~~V~~giv~v~GL~~a~~GElv~~~~g~~G~Vi~l~~d~v~i~~l~~~~g 81 (509) +|+++||+++|+++|++|+.....+++|+|++|+|||++|+||++|+.+|+++|++|..|+++|||+|+||+++||+..+ T Consensus 1 ~l~~~Ei~~~I~~~I~~f~~~~~~~evG~V~sVgDGiArv~GL~~vm~~El~eF~~G~~GiAlNLEedsVG~vi~Gd~~~ 80 (520) T TIGR00962 1 QLKLEEISELIKQQIKNFEKDIEVEEVGTVVSVGDGIARVYGLENVMSGELVEFEGGVQGIALNLEEDSVGAVILGDYSN 80 (520) T ss_pred CCCHHHHHHHHHHHHHCCCCEEEEECCCEEEEECCCEEEEEECHHHHHCCEEECCCCCEEEEEEECCCCEEEEEECCCCC T ss_conf 98777999999998626251013303627998338789996011014010025578958999971358624899788666 Q ss_pred CCCCCEEEECCCEEEEECCHHHHHCCCCHHHCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCHHHHHCCHHHHHCCCCCCC Q ss_conf 67898889779878866794698300030023144664445444100101357731126600144416023310111156 Q gi|254780617|r 82 ISEGDIVKRTGRIVDVPVGLELLGRVVDALGNPIDGKGPIKCEQRSCTEADAPGIIQRQSVCEPLSTGIKAIDSLIPIGR 161 (509) Q Consensus 82 I~~G~~V~~tg~~~~vpvG~~lLGRVvD~lG~PlDg~g~i~~~~~~~i~~~~p~~~~R~~i~~~l~TGI~~ID~l~pigr 161 (509) |++|+.|++||++++||||++|||||||+||+||||||||..+++++||.+||++++|++|+|||+|||||||+|+|||| T Consensus 81 i~EG~~VKRTg~il~vPVG~~llGRvVn~LG~PiDGkGpI~~~~~~~~E~~APGv~~RkSV~ePlQTGIkAIDAliPIGR 160 (520) T TIGR00962 81 IREGSTVKRTGRILKVPVGDGLLGRVVNALGQPIDGKGPIESDEFRPIEKIAPGVIERKSVHEPLQTGIKAIDALIPIGR 160 (520) T ss_pred CCCCCCCCCCCHHCCCCCCCEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCHHHHHHHHCCCCCCC T ss_conf 53265032032000324575688326845778657777677666343214588552146678730000557641468987 Q ss_pred CCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHH Q ss_conf 83365524778861158999999988630246678760699995167767899999851146742115899728878889 Q gi|254780617|r 162 GQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALSYSIVVVASASDPAP 241 (509) Q Consensus 162 GQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~~tvvv~a~a~~~~~ 241 (509) |||+||+|||+||||++|+|||+|||.++...+++++|+||||+||||.++|+.+++.|+++|||+|||||+|+|+||+. T Consensus 161 GQRELIIGDRqTGKTavAIDtIiNQk~~~~~~~~~~dv~CvYVAIGQK~StvA~vv~~LE~~GAM~YTiVV~A~ASdsA~ 240 (520) T TIGR00962 161 GQRELIIGDRQTGKTAVAIDTIINQKDSGELGDEKKDVYCVYVAIGQKASTVAQVVRKLEEHGAMDYTIVVAASASDSAS 240 (520) T ss_pred CCEEEEECCCCCCCCHHHHHHHHHHCCCCCCCCCCCCEEEEEEEECCCHHHHHHHHHHHHHCCCCCCCEEEEECCCCCHH T ss_conf 62223423266673056557777305776567865564899962075356899999997534871110336741677212 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCC------ Q ss_conf 99998876666667542046884799616478999998754530378754333752001246788887504544------ Q gi|254780617|r 242 MQLLAPFAGCAMGEYFRDNGYHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLLERAAKMSDA------ 315 (509) Q Consensus 242 ~r~~ap~~a~aiAEyfr~~G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l~ERa~~~~~~------ 315 (509) ++|+|||+||||||||||+||||||+||||||||.|||+||||||||||||||||||||+||||||||+|+++. T Consensus 241 lqYLAPY~G~tmaEyF~d~GkhaLIiYDDLSKqA~AYR~iSLLLRRPPGREAyPGDVFYLHSRLLERAAKLsdeYve~Ft 320 (520) T TIGR00962 241 LQYLAPYTGCTMAEYFRDNGKHALIIYDDLSKQAVAYRQISLLLRRPPGREAYPGDVFYLHSRLLERAAKLSDEYVEKFT 320 (520) T ss_pred HHHHHHHHHHHHHHHHHHCCCEEEEEEECCHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHCCHHHHHHHH T ss_conf 45420688999999998659737999747407899999999871886587169974014545689998640788885430 Q ss_pred ----CCC-CCEEEEEEEECCCCCCCCCHHHHHHHHHCCEEEEECCHHHHCCCCCCCCCCCHHHCCCCCCCHHHHHHHHHH Q ss_conf ----588-735675310036766443114668854061699615012210266543344412203434389887755678 Q gi|254780617|r 316 ----LGA-GSLTALPVIETQVNDVSAYIPTNVISITDGQIFLETELFYQGIRPAINIGLSVSRVGSAAQVKAMKQVSGAV 390 (509) Q Consensus 316 ----~g~-GSiT~~~~v~~~~~D~~~~i~~~~~si~DG~i~l~~~l~~~g~~Paid~~~S~SRv~~~~~~~~~~~~a~~l 390 (509) +|| |||||||+|||++||+|+||||||||||||||||+++|||+|+|||||||.|||||||+||.|+||++||+| T Consensus 321 kGevkGgtGSlTALPIIETQAGDvSAYIPTNVISITDGQIFLesdLFn~GiRPAINvGlSVSRVGgAAQiKAmK~vaG~L 400 (520) T TIGR00962 321 KGEVKGGTGSLTALPIIETQAGDVSAYIPTNVISITDGQIFLESDLFNSGIRPAINVGLSVSRVGGAAQIKAMKQVAGSL 400 (520) T ss_pred CCCCCCCCCCCCCCCCEEECCCCEEEECCCCEEECCCCHHHHHHHHHCCCCCCCEECCEEEEECCHHHHHHHHHHHCCCH T ss_conf 56324665352367722521487866307871352455112102011068987141756674420357766555423522 Q ss_pred HHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHH Q ss_conf 88799887678798763389989999999999999996699988748899999999986698066798999999999999 Q gi|254780617|r 391 KGELAQYREMSSFSKFSSDLDSSTQKFLSKGERLTELLKQPQFSPLAMEEQVVMIFAGISGCLDEVAVSQVRKFETNFLS 470 (509) Q Consensus 391 r~~laqy~Ele~f~~fgsdlD~~t~~~l~rg~ri~e~LkQ~~~~p~~~~~qv~~l~a~~~G~ld~i~v~~I~~fe~~l~~ 470 (509) |++||||||||+|+|||||||++||++|+||+|++|+|||++|+|+|+++|+++|||+++||||+||+++|++||++|++ T Consensus 401 rLeLAQYRELeAFsQFaSDLD~ATk~qLerG~R~vElLKQ~~~~P~pV~~Q~v~~yA~~~GyLd~iPv~~v~~Fe~~l~~ 480 (520) T TIGR00962 401 RLELAQYRELEAFSQFASDLDEATKAQLERGKRLVELLKQPQYKPLPVEEQVVILYAGTKGYLDDIPVDKVRKFEEELLD 480 (520) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCCCCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHH T ss_conf 45575488998886401223489999986067234200068865885999999999986145464344679989999999 Q ss_pred HHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHCC Q ss_conf 997415999998873279898999999999999985159 Q gi|254780617|r 471 HMRVSSQDILEDIRKQKVLTDDIRSKLINEIKVFLEDFN 509 (509) Q Consensus 471 ~l~~~~~ei~~~I~~~~~l~de~~~~L~~~i~~~~~~Fs 509 (509) |++.+|++++++|+++++|+||+|++||++|++||++|. T Consensus 481 y~~~~~~~~l~eI~~~k~l~ee~e~~lk~~l~~Fk~~f~ 519 (520) T TIGR00962 481 YLDANHPDILEEINTKKKLTEELEAKLKEALKEFKQTFA 519 (520) T ss_pred HHHCCHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHC T ss_conf 983351489999974078888899999999998887515 No 2 >CHL00059 atpA ATP synthase CF1 alpha subunit Probab=100.00 E-value=0 Score=1257.27 Aligned_cols=500 Identities=58% Similarity=0.921 Sum_probs=496.1 Q ss_pred CCCCHHHHHHHHHHHHHCCCCCCCEEEEEEEEEEECCEEEEECCCCCCCCCEEEECCCEEEEEEEECCCEEEEEEECCCC Q ss_conf 98888999999999984245554201035899982758999758887737479957981899992059859999805866 Q gi|254780617|r 1 MDIHAAEISDILRKRIKDFGNDSEFSEIGRVLSIGDGIARVYGLNNIRAGEMVQFSHGVYGMALNLEVDNVGVVILGSYK 80 (509) Q Consensus 1 m~~~~~eis~~l~~~i~~~~~~~~~~~~G~V~~V~~giv~v~GL~~a~~GElv~~~~g~~G~Vi~l~~d~v~i~~l~~~~ 80 (509) |+|+++||+++|+++|++|+......++|+|.+|+|||++|+||++|+.||+++|++|+.|+|+++++|+|++++|++.. T Consensus 2 ~~~~~~ei~~~i~~~i~~~~~~~~~~e~G~V~~v~DGia~v~GL~~~~~~Elv~f~~g~~G~vlnLe~~~vg~vllg~~~ 81 (501) T CHL00059 2 VTIRPDEISNIIRERIEQYNREVKIVNTGTVLQVGDGIARIYGLDEVMAGELVEFEEGTIGIALNLESNNVGVVLMGDGL 81 (501) T ss_pred CCCCHHHHHHHHHHHHHHCCCCCCCEEEEEEEEECCCEEEEECCCCCCCCCEEEECCCCEEEEEECCCCCEEEEEECCCC T ss_conf 97899999999999998358777520347999970627998267312558649968998799984378806999976987 Q ss_pred CCCCCCEEEECCCEEEEECCHHHHHCCCCHHHCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCHHHHHCCHHHHHCCCCCC Q ss_conf 86789888977987886679469830003002314466444544410010135773112660014441602331011115 Q gi|254780617|r 81 EISEGDIVKRTGRIVDVPVGLELLGRVVDALGNPIDGKGPIKCEQRSCTEADAPGIIQRQSVCEPLSTGIKAIDSLIPIG 160 (509) Q Consensus 81 gI~~G~~V~~tg~~~~vpvG~~lLGRVvD~lG~PlDg~g~i~~~~~~~i~~~~p~~~~R~~i~~~l~TGI~~ID~l~pig 160 (509) +|++|++|++||++++||||++|||||+|+||+||||+|++.+++++|++++||++++|+++++||+||||+||+|+||| T Consensus 82 ~i~eG~~V~~TG~~l~VPVG~~lLGRVvd~LG~PiDgkg~i~~~~~~pie~~aP~~~~R~~V~epL~TGIkaID~l~PiG 161 (501) T CHL00059 82 MIQEGSSVKATGKIAQIPVSEAYLGRVVNALAKPIDGKGEISASESRLIESPAPGIISRRSVYEPLQTGLIAIDSMIPIG 161 (501) T ss_pred CCCCCCEEEECCCEEEEECCHHHCCCEECCCEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCC T ss_conf 76899988857972477457523147406877544799988987501043799785044666742323421110036766 Q ss_pred CCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEEEEECCCCCH Q ss_conf 68336552477886115899999998863024667876069999516776789999985114674211589972887888 Q gi|254780617|r 161 RGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALSYSIVVVASASDPA 240 (509) Q Consensus 161 rGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~~tvvv~a~a~~~~ 240 (509) ||||+|||||+|||||+|++|+|+||+.. +++|||++||||+||+++|++.+++.|+|+||++|+++||+|| T Consensus 162 rGQR~lI~Gd~gvGKTtl~~d~Iinqk~~--------~vicVyvaIGeR~reV~~~i~~l~~~gam~~TvvV~Atadepa 233 (501) T CHL00059 162 RGQRELIIGDRQTGKTAVATDTILNQKGQ--------NVICVYVAIGQKASSVAQVVTTLQERGAMEYTIVVAETADSPA 233 (501) T ss_pred CCCEEEEECCCCCCHHHHHHHHHHHHCCC--------CCEEEEEEEEEEHHHHHHHHHHHHHCCCCEEEEEEECCCCCCH T ss_conf 77566775578844779999999852468--------8489999855627789999999975697103699961799999 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCC Q ss_conf 99999887666666754204688479961647899999875453037875433375200124678888750454458873 Q gi|254780617|r 241 PMQLLAPFAGCAMGEYFRDNGYHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLLERAAKMSDALGAGS 320 (509) Q Consensus 241 ~~r~~ap~~a~aiAEyfr~~G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l~ERa~~~~~~~g~GS 320 (509) ++||+|||+||||||||||+|+|||++||||||||+|||||||++||||||+||||||||+||||+||||++++.+|||| T Consensus 234 ~lr~~Apyaa~aIAEyFrdqGkdVLlv~DdLTr~A~A~REisLllgrPPgregYPgdVFylhsrLlERa~kl~~~~ggGS 313 (501) T CHL00059 234 TLQYLAPYTGAALAEYFMYRERHTLIIYDDLSKQAQAYRQMSLLLRRPPGREAYPGDVFYLHSRLLERAAKLSSQLGEGS 313 (501) T ss_pred HHHHHHHHHHHHHHHHHHHCCCEEEEEEECCHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCEE T ss_conf 99999999999999999977982899986826988999998986489997667896288785789877650578999884 Q ss_pred EEEEEEEECCCCCCCCCHHHHHHHHHCCEEEEECCHHHHCCCCCCCCCCCHHHCCCCCCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 56753100367664431146688540616996150122102665433444122034343898877556788879988767 Q gi|254780617|r 321 LTALPVIETQVNDVSAYIPTNVISITDGQIFLETELFYQGIRPAINIGLSVSRVGSAAQVKAMKQVSGAVKGELAQYREM 400 (509) Q Consensus 321 iT~~~~v~~~~~D~~~~i~~~~~si~DG~i~l~~~l~~~g~~Paid~~~S~SRv~~~~~~~~~~~~a~~lr~~laqy~El 400 (509) ||+||+|+|++||+|+|||||++||||||||||++||++|+||||||+.||||||+++|+++||++|+++|++||||+|| T Consensus 314 iTalpivet~~~D~s~yIptnviSItDGqI~L~~~Lf~~G~~PAIdvg~SVSRVG~~aq~k~~k~va~~lrl~laq~~el 393 (501) T CHL00059 314 MTALPIVETQSGDVSAYIPTNVISITDGQIFLSADLFNAGIRPAINVGISVSRVGSAAQIKAMKQVAGKLKLELAQFAEL 393 (501) T ss_pred EEEEEEEEECCCCCCCCCCHHHHHHCCCEEEEEHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 78888998238864555541044405742998565552777743454555112561767899999877879999989999 Q ss_pred HHHHHHHCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 87987633899899999999999999966999887488999999999866980667989999999999999974159999 Q gi|254780617|r 401 SSFSKFSSDLDSSTQKFLSKGERLTELLKQPQFSPLAMEEQVVMIFAGISGCLDEVAVSQVRKFETNFLSHMRVSSQDIL 480 (509) Q Consensus 401 e~f~~fgsdlD~~t~~~l~rg~ri~e~LkQ~~~~p~~~~~qv~~l~a~~~G~ld~i~v~~I~~fe~~l~~~l~~~~~ei~ 480 (509) ++|++||+|||++|+.+|+||+|++|+|||++|+|+++++|+++|||+++||||++|+++|++|++.+++|+++++++++ T Consensus 394 ~~f~~f~~~ld~~t~~~l~rg~r~~e~lkQ~~~~p~~~~~q~~~l~a~~~g~ld~~~~~~i~~~~~~~~~~~~~~~~~~~ 473 (501) T CHL00059 394 EAFAQFASDLDKATQNQLARGQRLRELLKQSQSAPLTVEEQVATIYTGTNGYLDSLEIGQVRKFLVELRTYLKTNKPQFQ 473 (501) T ss_pred HHHHHHHCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99998623359999999998999999867888999699999999999854986778989999999999999987659999 Q ss_pred HHHHHCCCCCHHHHHHHHHHHHHHHHHC Q ss_conf 9887327989899999999999998515 Q gi|254780617|r 481 EDIRKQKVLTDDIRSKLINEIKVFLEDF 508 (509) Q Consensus 481 ~~I~~~~~l~de~~~~L~~~i~~~~~~F 508 (509) ++|.++++|+|+.+++|++++++|+++| T Consensus 474 ~~i~~~~~l~~~~~~~l~~~i~~F~~~f 501 (501) T CHL00059 474 EIISSTKTFTEEAEALLKEAIQEQKERF 501 (501) T ss_pred HHHHHCCCCCHHHHHHHHHHHHHHHHCC T ss_conf 9997548999999999999999998519 No 3 >PRK09281 F0F1 ATP synthase subunit alpha; Validated Probab=100.00 E-value=0 Score=1254.29 Aligned_cols=501 Identities=66% Similarity=1.078 Sum_probs=496.9 Q ss_pred CCCCHHHHHHHHHHHHHCCCCCCCEEEEEEEEEEECCEEEEECCCCCCCCCEEEECCCEEEEEEEECCCEEEEEEECCCC Q ss_conf 98888999999999984245554201035899982758999758887737479957981899992059859999805866 Q gi|254780617|r 1 MDIHAAEISDILRKRIKDFGNDSEFSEIGRVLSIGDGIARVYGLNNIRAGEMVQFSHGVYGMALNLEVDNVGVVILGSYK 80 (509) Q Consensus 1 m~~~~~eis~~l~~~i~~~~~~~~~~~~G~V~~V~~giv~v~GL~~a~~GElv~~~~g~~G~Vi~l~~d~v~i~~l~~~~ 80 (509) |+++++||+++|+++|++|+.....+++|+|++|+|||++|+||++|+.||+|+|++|+.|+|+|+++|+|++++||++. T Consensus 1 ~~~~~~ei~~~i~~~i~~~~~~~~~~~~G~V~~v~dgia~v~GL~~~~~~elv~f~~g~~g~~~nle~d~vg~v~lg~~~ 80 (502) T PRK09281 1 MQINPDEISSIIKQQIENFDAEAEVREVGTVLSVGDGIARVYGLDNVMAGELLEFPGGVYGIALNLEEDSVGAVILGDYE 80 (502) T ss_pred CCCCHHHHHHHHHHHHHHCCCCCCCEEEEEEEEECCCEEEEECCCCCCCCCEEEECCCCEEEEEECCCCCEEEEEECCCC T ss_conf 99888999999999998248877412446999981738999577535648869968998799982478827999946855 Q ss_pred CCCCCCEEEECCCEEEEECCHHHHHCCCCHHHCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCHHHHHCCHHHHHCCCCCC Q ss_conf 86789888977987886679469830003002314466444544410010135773112660014441602331011115 Q gi|254780617|r 81 EISEGDIVKRTGRIVDVPVGLELLGRVVDALGNPIDGKGPIKCEQRSCTEADAPGIIQRQSVCEPLSTGIKAIDSLIPIG 160 (509) Q Consensus 81 gI~~G~~V~~tg~~~~vpvG~~lLGRVvD~lG~PlDg~g~i~~~~~~~i~~~~p~~~~R~~i~~~l~TGI~~ID~l~pig 160 (509) +|++|+.|++||++++||||++|||||+|++|+||||+||+.+.+++|++++||++++|+++++||+||||+||+|+||| T Consensus 81 ~i~~G~~V~~TG~~~~VPVG~~lLGRVvd~lG~PiDGkgpi~~~~~~pie~~aP~i~~R~~V~epL~TGIkaID~l~PiG 160 (502) T PRK09281 81 DIKEGDTVKRTGRILEVPVGEALLGRVVNPLGQPIDGKGPINATETRPVERKAPGVIDRKSVHEPLQTGIKAIDAMIPIG 160 (502) T ss_pred CCCCCCEEEECCCCEEEECCHHHHCCEECCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCCCCCCCCHHHHHHCCCCC T ss_conf 56899989857982577567523033206774526899988886031355889685135776752331135554125656 Q ss_pred CCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEEEEECCCCCH Q ss_conf 68336552477886115899999998863024667876069999516776789999985114674211589972887888 Q gi|254780617|r 161 RGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALSYSIVVVASASDPA 240 (509) Q Consensus 161 rGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~~tvvv~a~a~~~~ 240 (509) ||||+|||||+|||||+|++|||+||+. .+|+|||++||||+||+++|++.|++.|+|+||++|+++|++|| T Consensus 161 rGQR~lIiGd~gtGKT~iaidtIinqk~--------~~vicVyvaIGer~seV~~~v~~L~~~gal~~TvvV~ata~dpa 232 (502) T PRK09281 161 RGQRELIIGDRQTGKTAIAIDTIINQKG--------KDVICIYVAIGQKASTVANVVRKLEEHGAMEYTIVVAATASDPA 232 (502) T ss_pred CCCEEEEECCCCCCHHHHHHHHHHHHCC--------CCCEEEEEEECCCHHHHHHHHHHHHHCCCCCCEEEEEECCCCCH T ss_conf 8756675247873388999999985325--------88369999851221789999998754486411379971799999 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCC Q ss_conf 99999887666666754204688479961647899999875453037875433375200124678888750454458873 Q gi|254780617|r 241 PMQLLAPFAGCAMGEYFRDNGYHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLLERAAKMSDALGAGS 320 (509) Q Consensus 241 ~~r~~ap~~a~aiAEyfr~~G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l~ERa~~~~~~~g~GS 320 (509) ++||+|||+||||||||||+|+|||++||||||||+|||||||++||||||+||||||||+||||+||||++++.+|||| T Consensus 233 ~lr~~Apyaa~aIAEyFrd~G~dVLlv~DdLtr~A~A~REisLll~rPPgreaYPgdVFylhsrLlERa~~l~~~~ggGS 312 (502) T PRK09281 233 PLQYLAPYAGCAMGEYFMDNGKDALIVYDDLSKQAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLLERAAKLSDELGGGS 312 (502) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCE T ss_conf 99999999999999999967994899985568999999888986279987667894077786799887651478899840 Q ss_pred EEEEEEEECCCCCCCCCHHHHHHHHHCCEEEEECCHHHHCCCCCCCCCCCHHHCCCCCCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 56753100367664431146688540616996150122102665433444122034343898877556788879988767 Q gi|254780617|r 321 LTALPVIETQVNDVSAYIPTNVISITDGQIFLETELFYQGIRPAINIGLSVSRVGSAAQVKAMKQVSGAVKGELAQYREM 400 (509) Q Consensus 321 iT~~~~v~~~~~D~~~~i~~~~~si~DG~i~l~~~l~~~g~~Paid~~~S~SRv~~~~~~~~~~~~a~~lr~~laqy~El 400 (509) ||+||+|+|++||+|+|||||++|||||||||||+||++|+||||||+.||||||+++|.++||++|+++|++||||+|| T Consensus 313 iTalpivet~a~DvsayIptnvisItDGqI~L~~~L~~~G~~PAIdvg~SVSRVG~~aQ~k~~k~vag~lrl~laqy~el 392 (502) T PRK09281 313 LTALPIIETQAGDVSAYIPTNVISITDGQIFLESDLFNSGIRPAINVGISVSRVGGAAQIKAMKKVAGTLRLDLAQYREL 392 (502) T ss_pred EEEEEEEEECCCCCCCCCCCCCEEEECCEEEEEHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 67676666048865565440202341446998642766898750144553334675024499999999999999999999 Q ss_pred HHHHHHHCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 87987633899899999999999999966999887488999999999866980667989999999999999974159999 Q gi|254780617|r 401 SSFSKFSSDLDSSTQKFLSKGERLTELLKQPQFSPLAMEEQVVMIFAGISGCLDEVAVSQVRKFETNFLSHMRVSSQDIL 480 (509) Q Consensus 401 e~f~~fgsdlD~~t~~~l~rg~ri~e~LkQ~~~~p~~~~~qv~~l~a~~~G~ld~i~v~~I~~fe~~l~~~l~~~~~ei~ 480 (509) ++|++||+|||++|+.+|+||+|++|+|||++|+|+++++|+++|||+++||||++|+++|++||+.+++|+++++++++ T Consensus 393 ~~fa~fgsdld~~t~~~l~rg~r~~e~LkQ~~~~p~~~~~qv~~l~a~~~g~ld~~~~~~v~~~~~~~~~~~~~~~~~~~ 472 (502) T PRK09281 393 EAFAQFGSDLDEATRAQLERGQRLVELLKQPQYSPLPVEEQVVILYAGTNGYLDDVPVEKVRRFEAELLAYLRSNHADLL 472 (502) T ss_pred HHHHHHCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99997535669999999999999999867888999699999999999964986678989999999999999987649999 Q ss_pred HHHHHCCCCCHHHHHHHHHHHHHHHHHCC Q ss_conf 98873279898999999999999985159 Q gi|254780617|r 481 EDIRKQKVLTDDIRSKLINEIKVFLEDFN 509 (509) Q Consensus 481 ~~I~~~~~l~de~~~~L~~~i~~~~~~Fs 509 (509) ++|.++++|+|+.+++|++++++|+++|+ T Consensus 473 ~~i~~~~~l~~~~~~~l~~~~~~~~~~f~ 501 (502) T PRK09281 473 EEIRETKKLSDDIEAKLKAAIEEFKKTFA 501 (502) T ss_pred HHHHHCCCCCHHHHHHHHHHHHHHHHHCC T ss_conf 99986489999999999999999985248 No 4 >TIGR03324 alt_F1F0_F1_al alternate F1F0 ATPase, F1 subunit alpha. A small number of taxonomically diverse prokaryotic species, including Methanosarcina barkeri, have what appears to be a second ATP synthase, in addition to the normal F1F0 ATPase in bacteria and A1A0 ATPase in archaea. These enzymes use ion gradients to synthesize ATP, and in principle may run in either direction. This model represents the F1 alpha subunit of this apparent second ATP synthase. Probab=100.00 E-value=0 Score=1229.44 Aligned_cols=492 Identities=48% Similarity=0.809 Sum_probs=485.6 Q ss_pred HHHHHHHHHHHHHCCCCCCCEEEEEEEEEEECCEEEEECCCCCCCCCEEEECCCEEEEEEEECCCEEEEEEECCCCCCCC Q ss_conf 89999999999842455542010358999827589997588877374799579818999920598599998058668678 Q gi|254780617|r 5 AAEISDILRKRIKDFGNDSEFSEIGRVLSIGDGIARVYGLNNIRAGEMVQFSHGVYGMALNLEVDNVGVVILGSYKEISE 84 (509) Q Consensus 5 ~~eis~~l~~~i~~~~~~~~~~~~G~V~~V~~giv~v~GL~~a~~GElv~~~~g~~G~Vi~l~~d~v~i~~l~~~~gI~~ 84 (509) .++..+.|+++|++|+......++|+|++|+|||++|+||++++.||+|+|+++++|+|+|+++|+|++++||+..+|++ T Consensus 5 ~~~~~~~~~~~i~~~~~~~~~~e~G~v~~vgdgia~v~Gl~~~~~~el~~f~~~~~g~~~nle~~~vg~v~lg~~~~i~~ 84 (497) T TIGR03324 5 LDKAFQQLDQARESFQPQLTVQEVGTVESVSTGIARVHGLPGVGFEELLRFPGGLLGIAFNVDEDEVGVVLLGEYSHLQA 84 (497) T ss_pred HHHHHHHHHHHHHHCCCCCEEEEEEEEEEECCCEEEEECCCCCCCCCEEEECCCCEEEEEECCCCCEEEEEECCCCCCCC T ss_conf 99999999999972488752466679999748569997850104486388089988999963788379999569877689 Q ss_pred CCEEEECCCEEEEECCHHHHHCCCCHHHCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCHHHHHCCHHHHHCCCCCCCCCE Q ss_conf 98889779878866794698300030023144664445444100101357731126600144416023310111156833 Q gi|254780617|r 85 GDIVKRTGRIVDVPVGLELLGRVVDALGNPIDGKGPIKCEQRSCTEADAPGIIQRQSVCEPLSTGIKAIDSLIPIGRGQR 164 (509) Q Consensus 85 G~~V~~tg~~~~vpvG~~lLGRVvD~lG~PlDg~g~i~~~~~~~i~~~~p~~~~R~~i~~~l~TGI~~ID~l~pigrGQR 164 (509) |++|++||++++||||++|||||+|++|+||||+||+.+++++|++++||++++|+++++||+||||+||+|+||||||| T Consensus 85 G~~V~~TG~~l~VPVG~~lLGRVvd~LG~PiDGkgpi~~~~~~pie~~aP~i~~R~~V~epL~TGIkaID~liPIGrGQR 164 (497) T TIGR03324 85 GDEVERTGRVMDVPVGDGLLGRVVDPLGRPLDGGGPLASSPRLPIERPAPPIMDRAPVTVPLQTGLKVIDALIPIGRGQR 164 (497) T ss_pred CCEEEECCCCEEEECCHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHCCCCCCCCCE T ss_conf 99998589822874488871764588855678999988764102457898963478888710024145542465567746 Q ss_pred EEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHHH Q ss_conf 65524778861158999999988630246678760699995167767899999851146742115899728878889999 Q gi|254780617|r 165 ELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALSYSIVVVASASDPAPMQL 244 (509) Q Consensus 165 ~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~~tvvv~a~a~~~~~~r~ 244 (509) +|||||+|||||+|++|+|+||+.. +++||||+||||+|||++|++.+++.|+|+||++|+|+|++||.+|| T Consensus 165 elIigdrgtGKTtla~dtIinqk~~--------~vicVyvaIGqr~seV~~~v~~l~~~gal~~TvVV~Ata~dpa~lr~ 236 (497) T TIGR03324 165 ELILGDRQTGKTAIAIDTILNQKGR--------NVLCIYCAIGQRASAVAKVVANLREHGAMDYTIVVVTEGNDPPGLQY 236 (497) T ss_pred EEEECCCCCCHHHHHHHHHHHCCCC--------CEEEEEEEEEEEHHHHHHHHHHHHHCCCCCCEEEEEECCCCCHHHHH T ss_conf 7764588888008899999704566--------71899997325078799999998748752341799956889989998 Q ss_pred HHHHHHHHHHHHHHCCCCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCEEEE Q ss_conf 98876666667542046884799616478999998754530378754333752001246788887504544588735675 Q gi|254780617|r 245 LAPFAGCAMGEYFRDNGYHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLLERAAKMSDALGAGSLTAL 324 (509) Q Consensus 245 ~ap~~a~aiAEyfr~~G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l~ERa~~~~~~~g~GSiT~~ 324 (509) +|||+||||||||||+|+|||++||||||||+|||||||++||||||+||||||||+||||+||||++++.+||||||+| T Consensus 237 ~Apyaa~aiAEyFrd~G~dVLlv~DdLTr~A~A~REisLll~rpPgreaYPgdvFylhsrLLERA~~l~~~~ggGSiTal 316 (497) T TIGR03324 237 IAPYAATSIGEHFMEQGRDVLIVYDDLTQHARAYRELSLLLRRPPGREAFPGDIFYVHSRLLERSTHLNEELGGGSLTAL 316 (497) T ss_pred HHHHHHHHHHHHHHHCCCEEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCEEEE T ss_conf 87899999999999689748999978069999999999864899987788935987707888732477899999772324 Q ss_pred EEEECCCCCCCCCHHHHHHHHHCCEEEEECCHHHHCCCCCCCCCCCHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 31003676644311466885406169961501221026654334441220343438988775567888799887678798 Q gi|254780617|r 325 PVIETQVNDVSAYIPTNVISITDGQIFLETELFYQGIRPAINIGLSVSRVGSAAQVKAMKQVSGAVKGELAQYREMSSFS 404 (509) Q Consensus 325 ~~v~~~~~D~~~~i~~~~~si~DG~i~l~~~l~~~g~~Paid~~~S~SRv~~~~~~~~~~~~a~~lr~~laqy~Ele~f~ 404 (509) |+|+|++||+|+|||||++|||||||||||+||++|+||||||+.||||||+++|+++||++|++||++||||+||++|+ T Consensus 317 pivet~~~D~sayIptnvisItDGqI~L~~~Lf~~G~~PAIdvg~SVSRVG~~aq~~~~k~vag~lrl~laq~~El~~f~ 396 (497) T TIGR03324 317 PIIETEAQNISAYIPTNLISITDGQIYLSPTLFELGVLPAVDVGKSVSRVGGKAQLAAYRAVAGDLKLAYAQFEELETFA 396 (497) T ss_pred EEEECCCCCCCCCHHHHHHHHCCCEEEECCCHHHCCCCCCCCCCCCEECCCHHHCCHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 79980288656644542220108679972407657997643568773136724305899987445699999899999998 Q ss_pred HHHCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 76338998999999999999999669998874889999999998669806679899999999999999741599999887 Q gi|254780617|r 405 KFSSDLDSSTQKFLSKGERLTELLKQPQFSPLAMEEQVVMIFAGISGCLDEVAVSQVRKFETNFLSHMRVSSQDILEDIR 484 (509) Q Consensus 405 ~fgsdlD~~t~~~l~rg~ri~e~LkQ~~~~p~~~~~qv~~l~a~~~G~ld~i~v~~I~~fe~~l~~~l~~~~~ei~~~I~ 484 (509) |||+|||++|+++|+||+|++|+|||++|+|+++++|+++|||+++||||++|+++|++||+.+++|++++++++++.|. T Consensus 397 qfgs~ld~~t~~~l~rG~r~~e~lkQ~~~~p~~~~~qv~~l~a~~~g~ld~~~~~~i~~~~~~~~~~~~~~~~~~~~~i~ 476 (497) T TIGR03324 397 RFGARLDENTRKTIEHGRRIRACLKQTQSSPLTVPQQIAILLALTNGLFDGVDLDAMPEAESAIRAAVTSLPADLRERLQ 476 (497) T ss_pred HHCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 76244489999999887999998667778996999999999998539866688899999999999999864389999998 Q ss_pred HCCCCCHHHHHHHHHHHHHH Q ss_conf 32798989999999999999 Q gi|254780617|r 485 KQKVLTDDIRSKLINEIKVF 504 (509) Q Consensus 485 ~~~~l~de~~~~L~~~i~~~ 504 (509) ++++|+|+.+++|++++++. T Consensus 477 ~~~~l~~~~~~~l~~~~~~a 496 (497) T TIGR03324 477 SGKKLSDEDREQILDIARGA 496 (497) T ss_pred HCCCCCHHHHHHHHHHHHHH T ss_conf 63999989999999999851 No 5 >COG0056 AtpA F0F1-type ATP synthase, alpha subunit [Energy production and conversion] Probab=100.00 E-value=0 Score=1234.29 Aligned_cols=501 Identities=66% Similarity=1.061 Sum_probs=497.0 Q ss_pred CCCCHHHHHHHHHHHHHCCCCCCCEEEEEEEEEEECCEEEEECCCCCCCCCEEEECCCEEEEEEEECCCEEEEEEECCCC Q ss_conf 98888999999999984245554201035899982758999758887737479957981899992059859999805866 Q gi|254780617|r 1 MDIHAAEISDILRKRIKDFGNDSEFSEIGRVLSIGDGIARVYGLNNIRAGEMVQFSHGVYGMALNLEVDNVGVVILGSYK 80 (509) Q Consensus 1 m~~~~~eis~~l~~~i~~~~~~~~~~~~G~V~~V~~giv~v~GL~~a~~GElv~~~~g~~G~Vi~l~~d~v~i~~l~~~~ 80 (509) |+++++||+++++++|++|+......++|+|++|+|||++|+||++++.||+++|+++++|+++|+++|+|+++++|+.. T Consensus 1 ~~~~~~ei~~~i~~~i~~~~~~~~~~~~g~V~sv~DgIa~v~Gl~~~~~~E~~ef~~~v~G~alnle~d~VG~vi~g~~~ 80 (504) T COG0056 1 MQLNPTEISSLIKQQIENFDVEAEVKEVGTVISVGDGIARVSGLENVMAGELVEFPGGVKGMALNLEEDSVGAVILGDYS 80 (504) T ss_pred CCCCHHHHHHHHHHHHHHCCHHHHHHCCCEEEEEECCEEEEECCCHHHCCCEEEECCCCEEEEEECCCCCEEEEEECCCC T ss_conf 98867889999999997125344321164499972546999447435328459916993799985032424699966976 Q ss_pred CCCCCCEEEECCCEEEEECCHHHHHCCCCHHHCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCHHHHHCCHHHHHCCCCCC Q ss_conf 86789888977987886679469830003002314466444544410010135773112660014441602331011115 Q gi|254780617|r 81 EISEGDIVKRTGRIVDVPVGLELLGRVVDALGNPIDGKGPIKCEQRSCTEADAPGIIQRQSVCEPLSTGIKAIDSLIPIG 160 (509) Q Consensus 81 gI~~G~~V~~tg~~~~vpvG~~lLGRVvD~lG~PlDg~g~i~~~~~~~i~~~~p~~~~R~~i~~~l~TGI~~ID~l~pig 160 (509) .|++|++|++||+.+.||||++|+|||||++|+|+||+|++....++|++..+|++++|++++|||+|||++||+|+||| T Consensus 81 ~i~eG~~v~~Tg~i~~Vpvg~~llGRVVn~lG~pidgkg~i~~~~~~~~e~~Apgv~~RksV~ePlqTGikaIDamiPIG 160 (504) T COG0056 81 DIKEGDEVKRTGRILEVPVGEELLGRVVDALGNPIDGKGPIDATKTRPVEKKAPGVMDRKSVNEPLQTGIKAIDALIPIG 160 (504) T ss_pred CCCCCCEEEEECCEEEEECCHHHCCEEECCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCCCCCHHHHHHHHHHHHCCCC T ss_conf 64578678850756887635665140566889826898975423357330207850304423833442057776405657 Q ss_pred CCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEEEEECCCCCH Q ss_conf 68336552477886115899999998863024667876069999516776789999985114674211589972887888 Q gi|254780617|r 161 RGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALSYSIVVVASASDPA 240 (509) Q Consensus 161 rGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~~tvvv~a~a~~~~ 240 (509) ||||++|+||++||||++|+|+|+||+. .+|+||||+||||.++++.++++|++.|+|+||+||+|+|+||+ T Consensus 161 RGQRELIIGDRQTGKTaIAidtIiNQk~--------~~v~CIYVAIGQK~stva~vv~tL~e~gAm~yTiVV~AsASd~a 232 (504) T COG0056 161 RGQRELIIGDRQTGKTAIAIDTIINQKG--------SGVKCIYVAIGQKRSTVANVVRTLEEHGAMDYTIVVAASASDSA 232 (504) T ss_pred CCCEEEEECCCCCCCCHHHHHHHHHCCC--------CCCEEEEEECCCCHHHHHHHHHHHHHCCCCCCEEEEEECCCCCH T ss_conf 8833787515767831102788983556--------89479999815437789999999997597513399981477623 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCC Q ss_conf 99999887666666754204688479961647899999875453037875433375200124678888750454458873 Q gi|254780617|r 241 PMQLLAPFAGCAMGEYFRDNGYHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLLERAAKMSDALGAGS 320 (509) Q Consensus 241 ~~r~~ap~~a~aiAEyfr~~G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l~ERa~~~~~~~g~GS 320 (509) .+||+|||+|||||||||++|+|||++|||||+||.||||||||++||||||+|||||||+||||||||+|+++++|||| T Consensus 233 ~lqYLaPy~g~a~aE~f~~~G~dvLIVyDDLsKhA~AYReiSLLlrRPPGREAyPGDVFYlHSrLLERAakl~~e~g~GS 312 (504) T COG0056 233 PLQYLAPYAGCAMAEYFRDNGKDVLIVYDDLSKHAVAYREISLLLRRPPGREAYPGDVFYLHSRLLERAAKLSDELGGGS 312 (504) T ss_pred HHHHHHHHHHHHHHHHHHHCCCEEEEEECCCHHHHHHHHHHHHHHCCCCCCCCCCCCEEEHHHHHHHHHHHHCCCCCCCC T ss_conf 55553022555889999865974899960741889999999998369998657888640011799999886202458872 Q ss_pred EEEEEEEECCCCCCCCCHHHHHHHHHCCEEEEECCHHHHCCCCCCCCCCCHHHCCCCCCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 56753100367664431146688540616996150122102665433444122034343898877556788879988767 Q gi|254780617|r 321 LTALPVIETQVNDVSAYIPTNVISITDGQIFLETELFYQGIRPAINIGLSVSRVGSAAQVKAMKQVSGAVKGELAQYREM 400 (509) Q Consensus 321 iT~~~~v~~~~~D~~~~i~~~~~si~DG~i~l~~~l~~~g~~Paid~~~S~SRv~~~~~~~~~~~~a~~lr~~laqy~El 400 (509) |||||+++|+++|+|+|||||++|||||||||+++||++|++||||++.||||||++||.++||++|+.+|++||||+|+ T Consensus 313 iTALPIIETqagDvSAyIpTNVISITDGQIfl~t~LFn~G~rPAInvGlSVSRvGssAQ~kamkkvag~lrl~laqYrel 392 (504) T COG0056 313 ITALPIIETQAGDVSAYIPTNVISITDGQIFLETDLFNAGIRPAINVGLSVSRVGSAAQIKAMKKVAGSLRLILAQYREL 392 (504) T ss_pred EEEEEEEEECCCCEEEECCCCEEEECCCCEEEEHHHHHCCCCCCCCCCCEEECCCHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 47511688616866234035458840770785064664688853257842651651889999999987899999999989 Q ss_pred HHHHHHHCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 87987633899899999999999999966999887488999999999866980667989999999999999974159999 Q gi|254780617|r 401 SSFSKFSSDLDSSTQKFLSKGERLTELLKQPQFSPLAMEEQVVMIFAGISGCLDEVAVSQVRKFETNFLSHMRVSSQDIL 480 (509) Q Consensus 401 e~f~~fgsdlD~~t~~~l~rg~ri~e~LkQ~~~~p~~~~~qv~~l~a~~~G~ld~i~v~~I~~fe~~l~~~l~~~~~ei~ 480 (509) ++|+||+||||++|+++|+||+|++|+|||++|+|+++++|++++||+.|||||++|++++..||..++.|++.++++++ T Consensus 393 ~afsqf~sdLd~~T~~~l~~G~r~~ellkQ~~~~p~sv~~qv~il~a~~~g~ld~v~v~~v~~fe~~l~~~~~~~~~~~~ 472 (504) T COG0056 393 EAFSQFGSDLDKATRKQLERGKRLTELLKQPQYSPLSVEEQVLILYAGTNGYLDDVPVEKVADFEKELLAYLRSDHKELL 472 (504) T ss_pred HHHHHHHHHHCHHHHHHHHCCHHHHHHHCCCCCCCCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99875400217989999971379999966778887538899999999873765678567999999999999876479999 Q ss_pred HHHHHCCCCCHHHHHHHHHHHHHHHHHCC Q ss_conf 98873279898999999999999985159 Q gi|254780617|r 481 EDIRKQKVLTDDIRSKLINEIKVFLEDFN 509 (509) Q Consensus 481 ~~I~~~~~l~de~~~~L~~~i~~~~~~Fs 509 (509) ++|.++++|++++++.|++++++|++.|+ T Consensus 473 ~~I~~~~~l~~~~e~~l~~~i~~f~~~f~ 501 (504) T COG0056 473 EEIRTTKELDDEIEAKLKAAIKEFKKTFA 501 (504) T ss_pred HHHHHHCCCCHHHHHHHHHHHHHHHHHCC T ss_conf 99865047887899999999999997313 No 6 >PRK13343 F0F1 ATP synthase subunit alpha; Provisional Probab=100.00 E-value=0 Score=1216.67 Aligned_cols=501 Identities=54% Similarity=0.889 Sum_probs=496.4 Q ss_pred CCCCHHHHHHHHHHHHHCCCCCCCEEEEEEEEEEECCEEEEECCCCCCCCCEEEECCCEEEEEEEECCCEEEEEEECCCC Q ss_conf 98888999999999984245554201035899982758999758887737479957981899992059859999805866 Q gi|254780617|r 1 MDIHAAEISDILRKRIKDFGNDSEFSEIGRVLSIGDGIARVYGLNNIRAGEMVQFSHGVYGMALNLEVDNVGVVILGSYK 80 (509) Q Consensus 1 m~~~~~eis~~l~~~i~~~~~~~~~~~~G~V~~V~~giv~v~GL~~a~~GElv~~~~g~~G~Vi~l~~d~v~i~~l~~~~ 80 (509) |+++++||++.++++|++|+..+..+++|+|++|+|||++|+||++|++||+|+|+++..|+|++|++|.|++++|+++. T Consensus 1 m~~~~~~~~~~~~~~i~~~~~~~~~~~~G~V~~v~dgi~~v~GL~~~~~gel~~f~~~~~g~v~~l~~d~v~~v~lg~~~ 80 (502) T PRK13343 1 MASNSDSWLARIRQRIARYELAPDAQEIGRVESVGDGIAFVSGLPDAALDELLRFDGGSRGFAFNLEEELVGAVLLDDTA 80 (502) T ss_pred CCCCHHHHHHHHHHHHHCCCCCCCEEEEEEEEEECCCEEEEECCCCCCCCCEEEECCCCEEEEEECCCCEEEEEEECCCC T ss_conf 99898999999999996268877236767999982867999267414238889978997799994149839999955987 Q ss_pred CCCCCCEEEECCCEEEEECCHHHHHCCCCHHHCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCHHHHHCCHHHHHCCCCCC Q ss_conf 86789888977987886679469830003002314466444544410010135773112660014441602331011115 Q gi|254780617|r 81 EISEGDIVKRTGRIVDVPVGLELLGRVVDALGNPIDGKGPIKCEQRSCTEADAPGIIQRQSVCEPLSTGIKAIDSLIPIG 160 (509) Q Consensus 81 gI~~G~~V~~tg~~~~vpvG~~lLGRVvD~lG~PlDg~g~i~~~~~~~i~~~~p~~~~R~~i~~~l~TGI~~ID~l~pig 160 (509) +|++|++|++||++++||||++|||||+|++|+||||+|++.+++++|++++||++++|+++++||+||||+||+|+||| T Consensus 81 ~i~~G~~V~~TG~~l~VpVG~~lLGRVvd~lG~PiDg~g~i~~~~~~pi~~~ap~~~~R~~v~epL~TGIkaID~l~pig 160 (502) T PRK13343 81 DLRAGTEVKRTGRVLEVPVGDALLGRVIDPLGRPLDGGGPLEATATRPIERPAPAIIERDSVTEPLQTGLKVIDALIPIG 160 (502) T ss_pred CCCCCCEEEECCCCEEEECCCCCCCCEECCCCCCCCCCCCCCCCCCCCCCCCCCCCHHCCCCCCCCCCCEEEEECCCCCC T ss_conf 88999999968995488757310258737883443899998876432146889783321666754312606740556646 Q ss_pred CCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEEEEECCCCCH Q ss_conf 68336552477886115899999998863024667876069999516776789999985114674211589972887888 Q gi|254780617|r 161 RGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALSYSIVVVASASDPA 240 (509) Q Consensus 161 rGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~~tvvv~a~a~~~~ 240 (509) ||||+|||||+|||||+|++|+|+||+.. +++||||+||||+||+++|++.+++.++|+||++|++++++|| T Consensus 161 rGQR~gIfg~~GvGKT~L~~~~I~nq~~~--------~vi~Vyv~IGer~~ev~~~v~~l~~~gal~~tvvV~ata~~p~ 232 (502) T PRK13343 161 RGQRELIIGDRQTGKTAIAIDAIINQKDK--------DVICVYVAIGQKASTVARVIDTLREHGALEYTTVVVAEASDPP 232 (502) T ss_pred CCCEEECCCCCCCCCCHHHHHHHHHHCCC--------CCEEEEEEEEEEEHHHHHHHHHHHHCCCCHHHEEEECCCCCCH T ss_conf 78577513678888048999999972578--------8749999960451659899999876485131224523778986 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCC Q ss_conf 99999887666666754204688479961647899999875453037875433375200124678888750454458873 Q gi|254780617|r 241 PMQLLAPFAGCAMGEYFRDNGYHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLLERAAKMSDALGAGS 320 (509) Q Consensus 241 ~~r~~ap~~a~aiAEyfr~~G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l~ERa~~~~~~~g~GS 320 (509) ++||+|||+||||||||||+|+|||++||||||||+|||||||++||||||+|||||+||+||+|+||||++++.+|||| T Consensus 233 ~~r~~ap~~a~aiAEyFrd~GkdVLlv~Ddltr~A~A~REislllgrpPgr~gYPg~vf~~~srLlERa~~~~~~~ggGS 312 (502) T PRK13343 233 GLQYLAPFAGCAIAEYFRDRGRDALIVYDDLSKHAAAYRELSLLLRRPPGREAYPGDIFYLHSRLLERATKLSPELGGGS 312 (502) T ss_pred HHHHHHHHHHCCHHHHHHHCCCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCEE T ss_conf 88755566520257999966996799995668999999899997489987667694588772689988875268899846 Q ss_pred EEEEEEEECCCCCCCCCHHHHHHHHHCCEEEEECCHHHHCCCCCCCCCCCHHHCCCCCCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 56753100367664431146688540616996150122102665433444122034343898877556788879988767 Q gi|254780617|r 321 LTALPVIETQVNDVSAYIPTNVISITDGQIFLETELFYQGIRPAINIGLSVSRVGSAAQVKAMKQVSGAVKGELAQYREM 400 (509) Q Consensus 321 iT~~~~v~~~~~D~~~~i~~~~~si~DG~i~l~~~l~~~g~~Paid~~~S~SRv~~~~~~~~~~~~a~~lr~~laqy~El 400 (509) ||+||+|+|++||+|||||||++|||||||||||+||++|+||||||+.|+||||+++|+++||++|+++|++||||+|+ T Consensus 313 ITalp~V~~~~dDis~pIp~~v~sItDGqIvLsr~La~~G~~PAIdv~~SvSRVg~~aq~~~~k~va~~lr~~laqy~el 392 (502) T PRK13343 313 LTALPIIETQAGDLAAYIPTNLISITDGQIFLDSDLFASGQRPAVDVGKSVSRVGGKAQHPAIRKESGRLRLDYAQFLEL 392 (502) T ss_pred EEEEEEEECCCCCCCCCHHHHHHHHCCCEEEEECCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 64516997248875663456565222616997245322683785178776135685778899999999999999989999 Q ss_pred HHHHHHHCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 87987633899899999999999999966999887488999999999866980667989999999999999974159999 Q gi|254780617|r 401 SSFSKFSSDLDSSTQKFLSKGERLTELLKQPQFSPLAMEEQVVMIFAGISGCLDEVAVSQVRKFETNFLSHMRVSSQDIL 480 (509) Q Consensus 401 e~f~~fgsdlD~~t~~~l~rg~ri~e~LkQ~~~~p~~~~~qv~~l~a~~~G~ld~i~v~~I~~fe~~l~~~l~~~~~ei~ 480 (509) +.|+|||+|||++|+.+|+||+|+++||||++|+|+++++|++++||+++||||++|+++|+.|++.+++|+++++++++ T Consensus 393 ~~f~~fg~~ld~~~~~~l~rg~r~~~~lkQ~~~~p~~~~~qv~~l~a~~~g~~d~~~~~~i~~~~~~~~~~~~~~~~~~~ 472 (502) T PRK13343 393 EAFTRFGARLDAGTRKQITRGRRLRELLKQPRFSPLSVEEQIALLYALNNGLLDDVPLDNIPPFEEELLAKLDARPSALL 472 (502) T ss_pred HHHHHHHCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHCHHHHH T ss_conf 99998743589999999999999999857888999789999998877448873666889999999999999986669999 Q ss_pred HHHHHCCCCCHHHHHHHHHHHHHHHHHCC Q ss_conf 98873279898999999999999985159 Q gi|254780617|r 481 EDIRKQKVLTDDIRSKLINEIKVFLEDFN 509 (509) Q Consensus 481 ~~I~~~~~l~de~~~~L~~~i~~~~~~Fs 509 (509) ++|.++++|+|+.+++|++++++|+++|+ T Consensus 473 ~~i~~~~~l~~~~~~~l~~~~~~~~~~~~ 501 (502) T PRK13343 473 FALAPTELLDDADREALLEALKEALETFA 501 (502) T ss_pred HHHHHCCCCCHHHHHHHHHHHHHHHHHCC T ss_conf 99974279899999999999999998616 No 7 >PRK07165 F0F1 ATP synthase subunit alpha; Validated Probab=100.00 E-value=0 Score=1090.98 Aligned_cols=464 Identities=36% Similarity=0.533 Sum_probs=427.4 Q ss_pred EEEEEEECCEEEEECCCCCCCCCEEEECC--CEEEEEEEECCCEEEEEEECCCCCCCCCCEEEECCCEEEEECCHHHHHC Q ss_conf 58999827589997588877374799579--8189999205985999980586686789888977987886679469830 Q gi|254780617|r 29 GRVLSIGDGIARVYGLNNIRAGEMVQFSH--GVYGMALNLEVDNVGVVILGSYKEISEGDIVKRTGRIVDVPVGLELLGR 106 (509) Q Consensus 29 G~V~~V~~giv~v~GL~~a~~GElv~~~~--g~~G~Vi~l~~d~v~i~~l~~~~gI~~G~~V~~tg~~~~vpvG~~lLGR 106 (509) =||++|+|||++|+||++++.||+++|++ +++|+|+++++|+|+++++++...|++|++|++||+.++||||++|||| T Consensus 3 pkV~sV~DgIa~V~GL~n~~~~El~~f~~~~~v~g~~lnleed~vg~vi~g~~~~i~eGd~V~~Tg~i~~VpVG~~lLGR 82 (507) T PRK07165 3 PKIKSIFDYIVEVKGEFDYQQQQVFTLKNNPNVKLFVISATEDTAYLLINNEKAKIKIGDEIIELNNENKVLTSKEYFGK 82 (507) T ss_pred CEEEEEECEEEEEECCHHHCCCCEEEECCCCCCEEEEEECCCCCEEEEEECCCCCCCCCCEEEECCCEEEEECCCCCCCC T ss_conf 70899814489996770224486699648996289997255361799998798770489999845985788757664463 Q ss_pred CCCHHHCCCCCCCC-----CCCCCCCHHCCCCCCCCCCCCCHHHHHCCHHHHHCCCCCCCCCEEEEECCCCCCCHHHHHH Q ss_conf 00300231446644-----4544410010135773112660014441602331011115683365524778861158999 Q gi|254780617|r 107 VVDALGNPIDGKGP-----IKCEQRSCTEADAPGIIQRQSVCEPLSTGIKAIDSLIPIGRGQRELIIGDRKTGKTSIILD 181 (509) Q Consensus 107 VvD~lG~PlDg~g~-----i~~~~~~~i~~~~p~~~~R~~i~~~l~TGI~~ID~l~pigrGQR~~I~g~~g~GKt~l~~~ 181 (509) |+|++|+|||+++. +......|+++.||++++|++++|||+|||++||+|+|||||||++||||+|||||++++| T Consensus 83 VVD~LG~pidp~~~~~~~~~~~~~~~pie~~Apgi~~R~~V~ePLqTGIkaID~miPIGRGQRELIiGDrqTGKTaIAld 162 (507) T PRK07165 83 IIDIDGNIIYPKAANPLKSEKAYQTSSAFNLAHGLMTVQTLNEQLYTGIVAIDLLIPIGKGQRELIIGDRQTGKTHIALN 162 (507) T ss_pred EECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCCCCCCEEEEECCCCCCHHHHHHH T ss_conf 68899986789898887765433347745579985236767861343467876135556761677735788778899999 Q ss_pred HHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 99998863024667876069999516776789999985114674211589972887888999998876666667542046 Q gi|254780617|r 182 TFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALSYSIVVVASASDPAPMQLLAPFAGCAMGEYFRDNG 261 (509) Q Consensus 182 ~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~~tvvv~a~a~~~~~~r~~ap~~a~aiAEyfr~~G 261 (509) ||+||+.. +|+||||+||||+++++++++.|++.|+|+||++|+|++++ |.+||+|||+||||||||+ +| T Consensus 163 tIiNQk~~--------~V~cIYVAIGQK~s~Va~iv~~L~~~gAm~yTiVV~A~sd~-p~~qylAPy~g~aiAEyf~-~~ 232 (507) T PRK07165 163 TIINQKNR--------NVKCIYVAIGQKRENLSRIYETLKEHDALKNTIIIDAPSTS-SYEQYLAPYVAMAHAENIS-YN 232 (507) T ss_pred HHHHCCCC--------CCEEEEEEECCCHHHHHHHHHHHHHCCCHHHEEEEECCCCC-CHHHHHHHHHHHHHHHHHH-CC T ss_conf 99845468--------95899998310177999999999877975652576315778-2777778888887889886-46 Q ss_pred CCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCCCCCCHHHH Q ss_conf 88479961647899999875453037875433375200124678888750454458873567531003676644311466 Q gi|254780617|r 262 YHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLLERAAKMSDALGAGSLTALPVIETQVNDVSAYIPTN 341 (509) Q Consensus 262 ~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l~ERa~~~~~~~g~GSiT~~~~v~~~~~D~~~~i~~~ 341 (509) +|||++||||||||+|||||||+++|||||||||||+||+||||+||||+++ |+|||||||+++|++||+|+||||| T Consensus 233 ~dvLiiyDDLTkhA~AyReisLll~rPPGReayPgdvFy~hsrLLERa~kl~---g~gSiTaLPIieT~~gDisayIpTN 309 (507) T PRK07165 233 DDVLIVFDDLTKHANIYREIALLTNKPVGKEAFPGDIFFAHSKLLERAGKFK---NRKSITALPILQTVDNDITSLISSN 309 (507) T ss_pred CCEEEEECCHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHCC---CCCEEEEEEEEEECCCCCCCCCCCC T ss_conf 8569996476899999998898727999977899605766168888876126---9961576017870278622435664 Q ss_pred HHHHHCCEEEEECCHHHHCCCCCCCCCCCHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHH Q ss_conf 88540616996150122102665433444122034343898877556788879988767879876338998999999999 Q gi|254780617|r 342 VISITDGQIFLETELFYQGIRPAINIGLSVSRVGSAAQVKAMKQVSGAVKGELAQYREMSSFSKFSSDLDSSTQKFLSKG 421 (509) Q Consensus 342 ~~si~DG~i~l~~~l~~~g~~Paid~~~S~SRv~~~~~~~~~~~~a~~lr~~laqy~Ele~f~~fgsdlD~~t~~~l~rg 421 (509) ++|||||||||+++||++|+|||||+++||||||++||.++||++|+.+|++|+||+|+++|+|||+|||++|+.+|+|| T Consensus 310 viSITDGQI~l~~~Lf~~G~rPAInvglSVSRvG~~aQ~k~~k~vag~l~l~~aqy~el~~fa~f~sdld~~t~~~l~rG 389 (507) T PRK07165 310 IISITDGQIVTSSELFAQGKLPAIDIDLSVSRTGSSVQSKNIAKIAKEISKIYRAYKRQLKLSSLDYDLNKETSDLLSKG 389 (507) T ss_pred EEEECCCEEEECHHHHHCCCCCCCCCCCEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHH T ss_conf 47741762885176774698865235700651576888999999863656789999999999865454189999999732 Q ss_pred HHHHHHHCCCCCCCCCHHHHHHHHHHHHCCCCCCCCHH-HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHH-HHHHHH Q ss_conf 99999966999887488999999999866980667989-9999999999999741599999887327989899-999999 Q gi|254780617|r 422 ERLTELLKQPQFSPLAMEEQVVMIFAGISGCLDEVAVS-QVRKFETNFLSHMRVSSQDILEDIRKQKVLTDDI-RSKLIN 499 (509) Q Consensus 422 ~ri~e~LkQ~~~~p~~~~~qv~~l~a~~~G~ld~i~v~-~I~~fe~~l~~~l~~~~~ei~~~I~~~~~l~de~-~~~L~~ 499 (509) +|++|+|||++|+|+++++|++++|+++|||||+|+.+ ..-.|...++++ +...+.+++.|.++.+.+|+. ++-+.. T Consensus 390 ~r~~ellkQ~~~~p~~~~~qv~~~~~~~~g~lddv~~~~~al~fi~~~~~~-~~~~k~~~~~~~~~~~~d~~~~k~~f~~ 468 (507) T PRK07165 390 KMIEKLFIQKGFSLYSYRFILLISKLISWGLLKDVKDEQKALDFIDYLIEN-DPDAKKIFNKIKNNQNVDDELMKNYFAF 468 (507) T ss_pred HHHHHHHCCCCCCCCCHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHC-CHHHHHHHHHHHCCCCCCHHHHHHHHHH T ss_conf 899998667788986899999999987546544687779999999999860-8889999998734663557899999999 Q ss_pred HHHHHHH Q ss_conf 9999985 Q gi|254780617|r 500 EIKVFLE 506 (509) Q Consensus 500 ~i~~~~~ 506 (509) ++++|.+ T Consensus 469 ~l~~~~~ 475 (507) T PRK07165 469 LLNQYSD 475 (507) T ss_pred HHHHHHH T ss_conf 9999998 No 8 >TIGR02546 III_secr_ATP type III secretion apparatus H+-transporting two-sector ATPase; InterPro: IPR013380 Proteins in this entry are found in a variety of bacteria, and are predicted to be ATPases involved in type III secretion systems. One example is YscN (P40290 from SWISSPROT) from Yersinia enterocolitica, which is thought to energise the YOP (Yersinia outer protein) secretion system .; GO: 0046961 hydrogen ion transporting ATPase activity rotational mechanism, 0030254 protein secretion by the type III secretion system. Probab=100.00 E-value=0 Score=1010.31 Aligned_cols=410 Identities=31% Similarity=0.474 Sum_probs=392.2 Q ss_pred EEEEEEEEEEECCEEEEECCCCCCCCCEEEECC-------CEEEEEEEECCCEEEEEEECCCCCCCCCCEEEECCCEEEE Q ss_conf 010358999827589997588877374799579-------8189999205985999980586686789888977987886 Q gi|254780617|r 25 FSEIGRVLSIGDGIARVYGLNNIRAGEMVQFSH-------GVYGMALNLEVDNVGVVILGSYKEISEGDIVKRTGRIVDV 97 (509) Q Consensus 25 ~~~~G~V~~V~~giv~v~GL~~a~~GElv~~~~-------g~~G~Vi~l~~d~v~i~~l~~~~gI~~G~~V~~tg~~~~v 97 (509) +...|||..|.|.++++. ||+|++||||++.+ ...+||++|.+|++.++||++..||..|++|.+||++++| T Consensus 3 ~~~~Grv~~v~G~Ll~A~-lP~a~vGELC~i~~nW~~~~~~~~AEVVGf~~~~alLsPlg~~~G~~~g~~V~p~g~~~~i 81 (430) T TIGR02546 3 VRVRGRVTEVSGTLLKAV-LPGARVGELCLIKRNWPRDPSQLLAEVVGFTGDEALLSPLGELSGISPGSEVIPTGRPLSI 81 (430) T ss_pred CEEEEEEEEECCEEEEEE-CCCEEEEEEEEEECCCCCCCCCCEEEEEEEECCEEEEECCCCCCCCCCCCEEEECCCCEEE T ss_conf 314766775213389998-1760674034552237898566137899872786776026653325656678625842022 Q ss_pred ECCHHHHHCCCCHHHCCCCCCCCCCCCC-------CCHHCCCCCCCCCCCCCHHHHHCCHHHHHCCCCCCCCCEEEEECC Q ss_conf 6794698300030023144664445444-------100101357731126600144416023310111156833655247 Q gi|254780617|r 98 PVGLELLGRVVDALGNPIDGKGPIKCEQ-------RSCTEADAPGIIQRQSVCEPLSTGIKAIDSLIPIGRGQRELIIGD 170 (509) Q Consensus 98 pvG~~lLGRVvD~lG~PlDg~g~i~~~~-------~~~i~~~~p~~~~R~~i~~~l~TGI~~ID~l~pigrGQR~~I~g~ 170 (509) +||++|||||+|+||+||||++++.... .||+.++|||||.|++|.+||-||||+||+|+|||+|||+||||+ T Consensus 82 ~vGe~LLGRVlDG~GrPlD~~~~~~~~~~~Qvt~~~~p~~a~pP~P~~R~~I~~PL~tGvRaiDGlLTcG~GQRiGIFA~ 161 (430) T TIGR02546 82 RVGEALLGRVLDGFGRPLDGKGELPAGEIRQVTGETRPLDADPPPPMSRQPIDQPLPTGVRAIDGLLTCGEGQRIGIFAG 161 (430) T ss_pred EECCCCCCEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCEEEEEEC T ss_conf 10600032004565770378887875564302464576567810487675334651014667543440366530578708 Q ss_pred CCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHHHHHHHHH Q ss_conf 78861158999999988630246678760699995167767899999851146742115899728878889999988766 Q gi|254780617|r 171 RKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALSYSIVVVASASDPAPMQLLAPFAG 250 (509) Q Consensus 171 ~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~~tvvv~a~a~~~~~~r~~ap~~a 250 (509) +||||||| +.||+|+. +.++||+++||||||||+||++.......|+|||+|++|||.|+.+|.+|+|+| T Consensus 162 aG~GKSTL-L~~i~~g~---------~ADv~V~ALIGERGREVREFiE~~lg~e~~~RsVlVvsTSDrss~eR~~AAy~A 231 (430) T TIGR02546 162 AGVGKSTL-LGMIARGA---------SADVNVIALIGERGREVREFIEHLLGEEGRKRSVLVVSTSDRSSLERLKAAYTA 231 (430) T ss_pred CCCCHHHH-HHHHHCCC---------CCCEEEEEECCCCCHHHHHHHHHCCCHHHCCCEEEEEECCCCCHHHHHHHHHHH T ss_conf 88616689-99986189---------887899860278760478887520783052424899607986679999998788 Q ss_pred HHHHHHHHCCCCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECC Q ss_conf 66667542046884799616478999998754530378754333752001246788887504544588735675310036 Q gi|254780617|r 251 CAMGEYFRDNGYHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLLERAAKMSDALGAGSLTALPVIETQ 330 (509) Q Consensus 251 ~aiAEyfr~~G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l~ERa~~~~~~~g~GSiT~~~~v~~~ 330 (509) ||||||||||||+||||||||||||+|+|||+|+.||||+|.|||||||+.+|||+||||+..+ |||||||||+++ T Consensus 232 TaIAEYFRDQGk~VlLmmDSlTRfARA~REiGLAaGEP~aR~GyPPSVF~~LPRLLERaG~~e~----GSITA~YTVLvE 307 (430) T TIGR02546 232 TAIAEYFRDQGKRVLLMMDSLTRFARALREIGLAAGEPPARRGYPPSVFSSLPRLLERAGNSEK----GSITALYTVLVE 307 (430) T ss_pred HHHHHHHHHCCCEEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHCCHHHCCCCCCCC----CCEEEEEEEEEC T ss_conf 7999999973990798840277999998778875378400257787366507501227861129----625345678762 Q ss_pred CCCCCCCHHHHHHHHHCCEEEEECCHHHHCCCCCCCCCCCHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHC-- Q ss_conf 766443114668854061699615012210266543344412203434389887755678887998876787987633-- Q gi|254780617|r 331 VNDVSAYIPTNVISITDGQIFLETELFYQGIRPAINIGLSVSRVGSAAQVKAMKQVSGAVKGELAQYREMSSFSKFSS-- 408 (509) Q Consensus 331 ~~D~~~~i~~~~~si~DG~i~l~~~l~~~g~~Paid~~~S~SRv~~~~~~~~~~~~a~~lr~~laqy~Ele~f~~fgs-- 408 (509) +||++|||+|+++||+||||||||+||+++|||||||+.|+||||+++..+.|++.|+++|.+||+|+|+|.++++|. T Consensus 308 gDd~~dP~ADEvRSILDGHIvLsR~LA~~~HyPAIDVLaS~SRvm~~vv~~eH~~aA~~lR~LLA~Y~e~e~LI~lGEY~ 387 (430) T TIGR02546 308 GDDMNDPIADEVRSILDGHIVLSRKLAERNHYPAIDVLASLSRVMSQVVSKEHRRAAGKLRRLLAKYKEVELLIRLGEYQ 387 (430) T ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC T ss_conf 77799843665544542368998999741688635665236642367788789999999999999999998898744888 Q ss_pred -CCCHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHH Q ss_conf -89989999999999999996699988748899999999986 Q gi|254780617|r 409 -DLDSSTQKFLSKGERLTELLKQPQFSPLAMEEQVVMIFAGI 449 (509) Q Consensus 409 -dlD~~t~~~l~rg~ri~e~LkQ~~~~p~~~~~qv~~l~a~~ 449 (509) +-|+++|+++++-..|+.||+|+..+..++++.+..|+... T Consensus 388 ~G~D~~~D~A~~~~~~i~~FL~Q~~~E~~~~e~tl~~L~~~~ 429 (430) T TIGR02546 388 PGSDPETDKAIDKIDAIRAFLRQSTDEKSSYEETLEQLHALV 429 (430) T ss_pred CCCCHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHH T ss_conf 998988999997577889871488877789899999999981 No 9 >TIGR02545 ATP_syn_fliI flagellar protein export ATPase FliI; InterPro: IPR013379 FliI proteins are involved in bacterial flagellum systems, acting to drive protein export for flagellar biosynthesis. The most closely related proteins are the YscN proteins of bacterial type III secretion systems.; GO: 0016887 ATPase activity, 0001539 ciliary or flagellar motility, 0009296 flagellum biogenesis, 0009288 flagellin-based flagellum. Probab=100.00 E-value=0 Score=979.24 Aligned_cols=413 Identities=30% Similarity=0.454 Sum_probs=388.9 Q ss_pred EEEEEEEEEECCEEEEECCCCCCC---CCEEEEC---C----CEEEEEEEECCCEEEEEEECCCCCCCCCCEEEECC--- Q ss_conf 103589998275899975888773---7479957---9----81899992059859999805866867898889779--- Q gi|254780617|r 26 SEIGRVLSIGDGIARVYGLNNIRA---GEMVQFS---H----GVYGMALNLEVDNVGVVILGSYKEISEGDIVKRTG--- 92 (509) Q Consensus 26 ~~~G~V~~V~~giv~v~GL~~a~~---GElv~~~---~----g~~G~Vi~l~~d~v~i~~l~~~~gI~~G~~V~~tg--- 92 (509) +.+|+|++|.++++++.|+ .+.+ |++|++. + .+.+||++|+++.+.+|||++.+||++||+|...+ T Consensus 2 ~~~G~V~~v~G~~~e~~Gp-~~~~~~~G~~c~i~~~~~g~~~~~~aEVvGF~~~~~~lmP~~~~~Gi~~G~~V~~~~~~a 80 (439) T TIGR02545 2 RIVGRVTKVAGLTVEVAGP-KAAVARLGDLCEIEPQEGGEEKHVLAEVVGFEGDRVILMPYEPLEGIRPGDRVFLLGDIA 80 (439) T ss_pred CEEEEEEEEECCEEEEECC-CCCEEECCCEEEEECCCCCCCCEEEEEEEEECCCEEEEEECCCCCCCCCCCEEEECCCCC T ss_conf 1588999988438898544-213021362789965888521015689998738836986446555624576578634314 Q ss_pred ----CEEEEECCHHHHHCCCCHHHCCCC----CCCCCCCCCCCHHCCCCCCCCCCCCCHHHHHCCHHHHHCCCCCCCCCE Q ss_conf ----878866794698300030023144----664445444100101357731126600144416023310111156833 Q gi|254780617|r 93 ----RIVDVPVGLELLGRVVDALGNPID----GKGPIKCEQRSCTEADAPGIIQRQSVCEPLSTGIKAIDSLIPIGRGQR 164 (509) Q Consensus 93 ----~~~~vpvG~~lLGRVvD~lG~PlD----g~g~i~~~~~~~i~~~~p~~~~R~~i~~~l~TGI~~ID~l~pigrGQR 164 (509) +.++||||++|||||||+||+||| +++.+....++|+.+++|||++|++|.+||.|||||||+|+||||||| T Consensus 81 d~~~~~~~~~~G~~LLGRv~D~lG~PlDg~G~~~~~~~~~~~~~l~~~~pnpl~R~rI~~~l~tGVRaId~lLT~GrGQR 160 (439) T TIGR02545 81 DAGGRSLSIPVGDELLGRVIDALGRPLDGKGAGGGLIDATVYRPLRREPPNPLDRRRIEEVLDTGVRAIDALLTIGRGQR 160 (439) T ss_pred CCCCCCCEECCCCCCCCCEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHCCCCCCCCCE T ss_conf 65456642027744403238678786578778887665655556567830746787578611212310111036556410 Q ss_pred EEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHHH Q ss_conf 65524778861158999999988630246678760699995167767899999851146742115899728878889999 Q gi|254780617|r 165 ELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALSYSIVVVASASDPAPMQL 244 (509) Q Consensus 165 ~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~~tvvv~a~a~~~~~~r~ 244 (509) +||||||||||||| +.||. ++.+..+.|+++||||||||.||+|.-...+.|+|||||+||||+||.+|+ T Consensus 161 ~GIFAGSGVGKSTL-LGMiA---------r~t~ADV~VIALIGERGREV~EFiE~~LG~eGl~kSVVVVATSD~spl~R~ 230 (439) T TIGR02545 161 LGIFAGSGVGKSTL-LGMIA---------RYTEADVNVIALIGERGREVKEFIEDDLGEEGLKKSVVVVATSDESPLMRI 230 (439) T ss_pred EEEECCCCHHHHHH-HHHHH---------CCCCCCEEEEEEECCCCCCCHHHHHHHCCCCCCCCEEEEEECCCCCHHHHH T ss_conf 26633774447889-88875---------066588789984446566431355430351102540799827998689998 Q ss_pred HHHHHHHHHHHHHHCCCCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCEEEE Q ss_conf 98876666667542046884799616478999998754530378754333752001246788887504544588735675 Q gi|254780617|r 245 LAPFAGCAMGEYFRDNGYHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLLERAAKMSDALGAGSLTAL 324 (509) Q Consensus 245 ~ap~~a~aiAEyfr~~G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l~ERa~~~~~~~g~GSiT~~ 324 (509) +|+|+|++|||||||||||||++|||+||||+|+|||+|++||||..+|||||||..+++|+||||-++...++||||+| T Consensus 231 ~aA~~A~~iAEYFRDqGk~VLL~~DSlTRFAmAqREigLa~GEPP~tkGYpPSVF~~Lp~L~ERaG~~p~~~~~GsITa~ 310 (439) T TIGR02545 231 RAAYAATAIAEYFRDQGKDVLLLMDSLTRFAMAQREIGLAAGEPPTTKGYPPSVFSELPRLLERAGIVPGAKGGGSITAF 310 (439) T ss_pred HHHHHHHHHHHHHHHCCCCEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCCEEEEE T ss_conf 88899999999998649834776211788998988999871787666789704899999999870864698887305689 Q ss_pred EEEECCCCCCCCCHHHHHHHHHCCEEEEECCHHHHCCCCCCCCCCCHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 31003676644311466885406169961501221026654334441220343438988775567888799887678798 Q gi|254780617|r 325 PVIETQVNDVSAYIPTNVISITDGQIFLETELFYQGIRPAINIGLSVSRVGSAAQVKAMKQVSGAVKGELAQYREMSSFS 404 (509) Q Consensus 325 ~~v~~~~~D~~~~i~~~~~si~DG~i~l~~~l~~~g~~Paid~~~S~SRv~~~~~~~~~~~~a~~lr~~laqy~Ele~f~ 404 (509) |||+++|||++|||+|.++||+||||||||+||++||||||||+.|+||+|+++.+++|++.+.++|.++|+|++.|++. T Consensus 311 yTVLVeGDD~~ePiAD~~RgiLDGHIvLsR~lA~~G~YPaIDvl~SiSR~m~~i~~~e~~~~a~~~r~l~a~Y~~~eDLi 390 (439) T TIGR02545 311 YTVLVEGDDMNEPIADAVRGILDGHIVLSRKLAERGHYPAIDVLRSISRLMPDIVSPEHNKAARKLRKLLATYKDAEDLI 390 (439) T ss_pred EEEEEECCCCCCHHHHHHHHHCCCEEEECHHHHHCCCCCCCCCCHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99987368998737877500102207732267645889875722227887776279889999999999999888889999 Q ss_pred HHH---CCCCHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHH Q ss_conf 763---389989999999999999996699988748899999999986 Q gi|254780617|r 405 KFS---SDLDSSTQKFLSKGERLTELLKQPQFSPLAMEEQVVMIFAGI 449 (509) Q Consensus 405 ~fg---sdlD~~t~~~l~rg~ri~e~LkQ~~~~p~~~~~qv~~l~a~~ 449 (509) +.| ++-||..|++|+..+.|.+||+|+..+-.++++....|-.+. T Consensus 391 ~iGaY~~GSdp~~D~AI~~~p~i~~FL~Q~~~E~~~~~~~~~~L~~i~ 438 (439) T TIGR02545 391 RIGAYKKGSDPEVDKAIKLYPKIEAFLRQDIEEASSFEESLEALKEIL 438 (439) T ss_pred HHCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHH T ss_conf 817864688878999851002589984679754557789999999982 No 10 >PRK06002 fliI flagellum-specific ATP synthase; Validated Probab=100.00 E-value=0 Score=968.94 Aligned_cols=437 Identities=26% Similarity=0.394 Sum_probs=409.4 Q ss_pred CCHHHHHHHHHHHHHCCCC-CCCEEEEEEEEEEECCEEEEECCC-CCCCCCEEEECCC---EEEEEEEECCCEEEEEEEC Q ss_conf 8889999999999842455-542010358999827589997588-8773747995798---1899992059859999805 Q gi|254780617|r 3 IHAAEISDILRKRIKDFGN-DSEFSEIGRVLSIGDGIARVYGLN-NIRAGEMVQFSHG---VYGMALNLEVDNVGVVILG 77 (509) Q Consensus 3 ~~~~eis~~l~~~i~~~~~-~~~~~~~G~V~~V~~giv~v~GL~-~a~~GElv~~~~g---~~G~Vi~l~~d~v~i~~l~ 77 (509) +.|+|+...|+.++++|.+ .+.+++.|+|++|.|++++|+||. .+++||+|+|+++ +.|||++|+++.+.+++|+ T Consensus 1 ~~~~~~~~~l~~~~~~~~a~~p~~r~~G~V~~V~g~i~~v~GL~~~~~~Ge~~~i~~~~~~~~aeVv~~~~~~v~l~p~g 80 (450) T PRK06002 1 MTPDNALARLAAFVQQYAAPEPLVRIGGTVSEVTPSHYRVRGLSRFVRLGDRVAIRSDGGTQLGEVVRVDEDGVLVKPFE 80 (450) T ss_pred CCHHHHHHHHHHHHHHCCCCCCEEEEEEEEEEEECCEEEEEECCCCCCCCCEEEEECCCCCEEEEEEEEECCCEEEEECC T ss_conf 97647999999999850799982345689999938189999886667779889992799717999998637818999876 Q ss_pred CCCCCCCCCEEEECCCEEEEECCHHHHHCCCCHHHCCCCCCCCCCC-CCCCHHCCCCCCCCCCCCCHHHHHCCHHHHHCC Q ss_conf 8668678988897798788667946983000300231446644454-441001013577311266001444160233101 Q gi|254780617|r 78 SYKEISEGDIVKRTGRIVDVPVGLELLGRVVDALGNPIDGKGPIKC-EQRSCTEADAPGIIQRQSVCEPLSTGIKAIDSL 156 (509) Q Consensus 78 ~~~gI~~G~~V~~tg~~~~vpvG~~lLGRVvD~lG~PlDg~g~i~~-~~~~~i~~~~p~~~~R~~i~~~l~TGI~~ID~l 156 (509) ++.||++|++|.+|| .+++++|++|||||+|++|+||||+||+.. .+++|+++.|||+++|+++++||+||||+||+| T Consensus 81 ~~~gi~~G~~V~~tg-~l~v~vg~~lLGRVvD~lG~PlDgkgpl~~~~~~~pi~~~~p~~~~R~~i~epL~TGIkaID~l 159 (450) T PRK06002 81 PRIPAGLGAAVFRKG-PLRIRPDPSWKGRVINALGEPIDGKGPLAPGTRPMSIDATAPPAMRRARVDTPLRTGVRVIDIF 159 (450) T ss_pred CCCCCCCCCEEEECC-CEEECCCCCCCCCEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHCCCCCCCCCCCHHHHHHH T ss_conf 866688999898479-8044468000478737888657899998877613515689968121278887663103133421 Q ss_pred CCCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEEEEECC Q ss_conf 11156833655247788611589999999886302466787606999951677678999998511467421158997288 Q gi|254780617|r 157 IPIGRGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALSYSIVVVASA 236 (509) Q Consensus 157 ~pigrGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~~tvvv~a~a 236 (509) +|||||||++||||+|||||+|+ .||.++.. ..+|||++||||+||+.+|++.... +.|+||++|++++ T Consensus 160 ~pigrGQR~gIfgg~GvGKTtLl-~~i~~~~~---------~d~vV~~lIGeRgrEv~ef~e~~l~-~~l~~svvV~ats 228 (450) T PRK06002 160 TPLCAGQRIGIFAGSGVGKSTLL-AMLARADA---------FDTVVVALVGERGREVREFLEDTLA-DHLSKAVAVVATG 228 (450) T ss_pred CCCCCCCEEEECCCCCCCHHHHH-HHHHHHCC---------CCEEEEEEEEEEHHHHHHHHHHHHH-CCCCEEEEEEECC T ss_conf 46014877774268999875999-99986258---------9879999864748999999998764-2200059997058 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCC Q ss_conf 78889999988766666675420468847996164789999987545303787543337520012467888875045445 Q gi|254780617|r 237 SDPAPMQLLAPFAGCAMGEYFRDNGYHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLLERAAKMSDAL 316 (509) Q Consensus 237 ~~~~~~r~~ap~~a~aiAEyfr~~G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l~ERa~~~~~~~ 316 (509) |+||.+||+|||+||||||||||+|+|||++|||+||||+|||||||++||||+|+|||||+||.||+|+||||+.++ T Consensus 229 d~~p~~r~~a~~~a~aiAEyfrd~G~~VLli~DslTr~A~A~REisl~~gepP~r~gYp~~vf~~l~~LlERag~~~~-- 306 (450) T PRK06002 229 DESPMMRRLAPKTATAIAEYFRDRGENVLLIVDSVTRFAHAAREVALAAGEPPVARGYPPSVFSELPRLLERAGPGAE-- 306 (450) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHCCCCC-- T ss_conf 898288765565544379999967996799982617888899999987589997567783277660688876225777-- Q ss_pred CCCCEEEEEEEECCCCCCCCCHHHHHHHHHCCEEEEECCHHHHCCCCCCCCCCCHHHCCCCCCCHHHHHHHHHHHHHHHH Q ss_conf 88735675310036766443114668854061699615012210266543344412203434389887755678887998 Q gi|254780617|r 317 GAGSLTALPVIETQVNDVSAYIPTNVISITDGQIFLETELFYQGIRPAINIGLSVSRVGSAAQVKAMKQVSGAVKGELAQ 396 (509) Q Consensus 317 g~GSiT~~~~v~~~~~D~~~~i~~~~~si~DG~i~l~~~l~~~g~~Paid~~~S~SRv~~~~~~~~~~~~a~~lr~~laq 396 (509) |+||||+||+|++++||++||||||++|||||||||||+||++|+|||||++.|+||||+++|+++|+++|+++|++|++ T Consensus 307 ~~GSITal~~V~~~~dD~~~pI~d~~~silDGhIvLsr~la~~G~~PAIDv~~SvSRv~~~~~~~~~~~~a~~~r~~la~ 386 (450) T PRK06002 307 GGGSITGIFSVLVDGDDHNDPVADSIRGTLDGHIVLDRAIAEQGRYPAVDPLASISRLAKHVWTPEQRKLVRRLKSMIAR 386 (450) T ss_pred CCCCEEEEEEEEECCCCCCCCHHHHHHHHCCEEEEECHHHHHCCCCCCCCCCCCCCCCCHHHCCHHHHHHHHHHHHHHHH T ss_conf 87306888888713887677578878677354999848898679999728873420177122429999999999999997 Q ss_pred HHHHHHHHHHH---CCCCHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCCCCC Q ss_conf 87678798763---38998999999999999999669998874889999999998669806 Q gi|254780617|r 397 YREMSSFSKFS---SDLDSSTQKFLSKGERLTELLKQPQFSPLAMEEQVVMIFAGISGCLD 454 (509) Q Consensus 397 y~Ele~f~~fg---sdlD~~t~~~l~rg~ri~e~LkQ~~~~p~~~~~qv~~l~a~~~G~ld 454 (509) |+|++++.+|| +++|+.++++++++++|.+||||++++|.+ ++|++.||+..+|--. T Consensus 387 y~e~~dli~~g~y~~g~d~~~d~ai~~~~~i~~fL~Q~~~ep~s-~d~~~~l~~~~~g~~~ 446 (450) T PRK06002 387 FEETRDLRLMGGYQPGADPELDQAVDLVPRIYEALQQSPGDPPS-DDPFADLAAALKGAAQ 446 (450) T ss_pred HHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCC-HHHHHHHHHHHHHHHH T ss_conf 77789999975946799999999998269999983899899987-7899999998532130 No 11 >PRK05688 fliI flagellum-specific ATP synthase; Validated Probab=100.00 E-value=0 Score=964.15 Aligned_cols=435 Identities=26% Similarity=0.381 Sum_probs=413.4 Q ss_pred CCCCHHHHHHHHHHHHHCCCCCCCEEEEEEEEEEECCEEEEECCCCCCCCCEEEECC--C-----EEEEEEEECCCEEEE Q ss_conf 988889999999999842455542010358999827589997588877374799579--8-----189999205985999 Q gi|254780617|r 1 MDIHAAEISDILRKRIKDFGNDSEFSEIGRVLSIGDGIARVYGLNNIRAGEMVQFSH--G-----VYGMALNLEVDNVGV 73 (509) Q Consensus 1 m~~~~~eis~~l~~~i~~~~~~~~~~~~G~V~~V~~giv~v~GL~~a~~GElv~~~~--g-----~~G~Vi~l~~d~v~i 73 (509) |+++++++++.|+.++++++...+....|||++|.|.+++++|++ +.+|++|.+.+ + +.+||++|+++.+.+ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~p~~~GrV~~v~G~~ie~~G~~-~~iG~~c~i~~~~~~~~~~v~aEVVgf~~~~v~l 79 (451) T PRK05688 1 MRLERTSFAKRLEGYAEAISLPAQPVVEGRLLRMVGLTLEAEGLR-AAVGSRCLVINDDSYHPVQVEAEVMGFSGDKVYL 79 (451) T ss_pred CCCCHHHHHHHHHHHHHHCCCCCCCEEEEEEEEEECEEEEEEECC-CCCCCEEEEEECCCCCCCEEEEEEEEECCCEEEE T ss_conf 987789999999999972577677700369999967499998258-8747847998689887754689992254997999 Q ss_pred EEECCCCCCCCCCEEEECCCEEEEECCHHHHHCCCCHHHCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCHHHHHCCHHHH Q ss_conf 98058668678988897798788667946983000300231446644454441001013577311266001444160233 Q gi|254780617|r 74 VILGSYKEISEGDIVKRTGRIVDVPVGLELLGRVVDALGNPIDGKGPIKCEQRSCTEADAPGIIQRQSVCEPLSTGIKAI 153 (509) Q Consensus 74 ~~l~~~~gI~~G~~V~~tg~~~~vpvG~~lLGRVvD~lG~PlDg~g~i~~~~~~~i~~~~p~~~~R~~i~~~l~TGI~~I 153 (509) |+|+++.||++|++|.++|++++||||++|||||+|++|+||||+|++..++++|++.++|||++|++|++||+||||+| T Consensus 80 ~p~g~~~Gi~~G~~V~~~g~~~~VpVG~~lLGRVvd~lG~PlDg~g~i~~~~~~p~~~~~p~pl~R~~i~epl~TGIraI 159 (451) T PRK05688 80 MPVGSVAGIAPGARVVPLADTGRLPMGMSMLGRVLDGAGRALDGKGGMKAEDWVPMDGPTINPLNRDPISEPLDVGIRSI 159 (451) T ss_pred EECCCCCCCCCCCEEEECCCCCEEECCCCCCCCEECCCCCCCCCCCCCCCCCEEECCCCCCCHHHCCCCCCCEECCCEEE T ss_conf 98878778899999996899877775825358888877642689999898734556689988355178786200384688 Q ss_pred HCCCCCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEEEE Q ss_conf 10111156833655247788611589999999886302466787606999951677678999998511467421158997 Q gi|254780617|r 154 DSLIPIGRGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALSYSIVVV 233 (509) Q Consensus 154 D~l~pigrGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~~tvvv~ 233 (509) |+|+|||||||+|||||+|+|||+| +.||.++.. ..+|||++||||+||+++|++++.+.++|+||++|+ T Consensus 160 D~l~pigrGQR~gIfggsGvGKS~L-l~~i~r~~~---------adv~Vi~lIGERgrEv~efi~~~l~~~~~~~svvV~ 229 (451) T PRK05688 160 NGLLTVGRGQRLGLFAGTGVGKSVL-LGMMTRFTE---------ADIIVVGLIGERGREVKEFIEHILGEEGLKRSVVVA 229 (451) T ss_pred ECCCCCCCCCEEEECCCCCCCHHHH-HHHHHHHHC---------CCEEEEEEEEEEHHHHHHHHHHHHHCCCCCEEEEEE T ss_conf 4546724675642104889868999-999988527---------986999961277599999999985156743058983 Q ss_pred ECCCCCHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCC Q ss_conf 28878889999988766666675420468847996164789999987545303787543337520012467888875045 Q gi|254780617|r 234 ASASDPAPMQLLAPFAGCAMGEYFRDNGYHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLLERAAKMS 313 (509) Q Consensus 234 a~a~~~~~~r~~ap~~a~aiAEyfr~~G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l~ERa~~~~ 313 (509) +|||+||.+|+++||+|+||||||||+|+|||+++||+||||+|||||||++||||+++|||||+||.||+|+||||+. T Consensus 230 atsd~~p~~r~~a~~~a~aiAEyfrd~Gk~VLl~~Dsltr~A~A~REisl~~gepP~~~GYppsvf~~l~~LlERag~~- 308 (451) T PRK05688 230 SPADDAPLMRLRAAMYCTRIAEYFRDKGKNVLLLMDSLTRYAQAQREIALAIGEPPATKGYPPSVFAKLPKLVERAGNA- 308 (451) T ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHCCC- T ss_conf 3888987999999987655899998679825999705577788888899875899876687844887737888641379- Q ss_pred CCCCCCCEEEEEEEECCCCCCCCCHHHHHHHHHCCEEEEECCHHHHCCCCCCCCCCCHHHCCCCCCCHHHHHHHHHHHHH Q ss_conf 44588735675310036766443114668854061699615012210266543344412203434389887755678887 Q gi|254780617|r 314 DALGAGSLTALPVIETQVNDVSAYIPTNVISITDGQIFLETELFYQGIRPAINIGLSVSRVGSAAQVKAMKQVSGAVKGE 393 (509) Q Consensus 314 ~~~g~GSiT~~~~v~~~~~D~~~~i~~~~~si~DG~i~l~~~l~~~g~~Paid~~~S~SRv~~~~~~~~~~~~a~~lr~~ 393 (509) ..|+||||+||+|++++||++||||||++|||||||||||+||++||||||||+.|+||+|+.++.+.|+++|+++|.+ T Consensus 309 -~~g~GSITal~tVlv~gdD~~dPi~d~~~silDGhIvLsr~la~~G~yPAIDvl~SvSRvm~~i~~~~h~~~a~~~r~~ 387 (451) T PRK05688 309 -EPGGGSITAFYTVLSEGDDQQDPIADSARGVLDGHIVLSRRLAEEGHYPAIDIEASISRVMPQVVDPEHLRRAQRFKQL 387 (451) T ss_pred -CCCCCCEEEEEEEECCCCCCCCCCHHHHHHHCCCEEEECHHHHHCCCCCCCCCCCCHHHCCHHHCCHHHHHHHHHHHHH T ss_conf -9999220164677503887677733445655062599848788669999828752665423333129999999999999 Q ss_pred HHHHHHHHHHHHHH---CCCCHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHH Q ss_conf 99887678798763---38998999999999999999669998874889999999998 Q gi|254780617|r 394 LAQYREMSSFSKFS---SDLDSSTQKFLSKGERLTELLKQPQFSPLAMEEQVVMIFAG 448 (509) Q Consensus 394 laqy~Ele~f~~fg---sdlD~~t~~~l~rg~ri~e~LkQ~~~~p~~~~~qv~~l~a~ 448 (509) |++|+|++++++|| ++.|++++.+++++++|++||+|+.+++.++++.+..|..+ T Consensus 388 la~Y~e~~dLi~~g~y~~g~d~~~d~ai~~~~~i~~fL~Q~~~e~~~~~~~~~~l~~l 445 (451) T PRK05688 388 WSRYQQSRDLISVGAYVAGGDPETDLAIARQPHLVQFLRQGLRENVSLAESREQLAAL 445 (451) T ss_pred HHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHH T ss_conf 9986888999998584569999999999869999998189999995999999999998 No 12 >PRK09099 type III secretion system ATPase; Provisional Probab=100.00 E-value=0 Score=943.95 Aligned_cols=431 Identities=28% Similarity=0.447 Sum_probs=408.6 Q ss_pred HHHHHHHHHHHHHCCCCCCCEEEEEEEEEEECCEEEEECCCCCCCCCEEEECC--C---EEEEEEEECCCEEEEEEECCC Q ss_conf 89999999999842455542010358999827589997588877374799579--8---189999205985999980586 Q gi|254780617|r 5 AAEISDILRKRIKDFGNDSEFSEIGRVLSIGDGIARVYGLNNIRAGEMVQFSH--G---VYGMALNLEVDNVGVVILGSY 79 (509) Q Consensus 5 ~~eis~~l~~~i~~~~~~~~~~~~G~V~~V~~giv~v~GL~~a~~GElv~~~~--g---~~G~Vi~l~~d~v~i~~l~~~ 79 (509) .+|||.+++....+++..+.++++|+|++|.|++++++|++ +.+||+|+|++ + ..|+|++|+++.+.+++|+++ T Consensus 2 ~~~~~~~~~~~~~~l~~~~~~~~~G~V~~v~g~~i~~~G~~-a~iGelc~i~~~~g~~~~~aEVvgf~~~~~~l~p~~~~ 80 (441) T PRK09099 2 LAELSRLADAMERELAALAAVRRTGKVLEVIGTLLRVSGLD-VTLGELCELRQRDGTLLQRAEVVGFSRDVALLSPFGEL 80 (441) T ss_pred HHHHHHHHHHHHHHHHCCCCCEEEEEEEEEECCEEEEEECC-CCCCCEEEEECCCCCCCCEEEEEEEECCEEEEEECCCC T ss_conf 66899999999987504775304669999978199998178-88788699984899711025899986898999988786 Q ss_pred CCCCCCCEEEECCCEEEEECCHHHHHCCCCHHHCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCHHHHHCCHHHHHCCCCC Q ss_conf 68678988897798788667946983000300231446644454441001013577311266001444160233101111 Q gi|254780617|r 80 KEISEGDIVKRTGRIVDVPVGLELLGRVVDALGNPIDGKGPIKCEQRSCTEADAPGIIQRQSVCEPLSTGIKAIDSLIPI 159 (509) Q Consensus 80 ~gI~~G~~V~~tg~~~~vpvG~~lLGRVvD~lG~PlDg~g~i~~~~~~~i~~~~p~~~~R~~i~~~l~TGI~~ID~l~pi 159 (509) .||++|++|.+||++++||||++|||||+|++|+||||+|++...+++|+++.||||++|+++++||+||||+||+|+|| T Consensus 81 ~gi~~G~~V~~tg~~~~VpvG~~lLGRVvD~lG~PlDg~g~i~~~~~~pi~~~~p~p~~R~~i~epl~TGIraID~l~pi 160 (441) T PRK09099 81 GGLSRGTRVIGLGRPLSVPVGPALLGRVIDGLGEPIDGRGPLDCDTLVPVIAAPPDPMSRRMIEAPLPTGVRIVDGLMTL 160 (441) T ss_pred CCCCCCCEEEECCCCCEEECCCCCCCCEECCCCCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCCEEECCCCCC T ss_conf 67889999997899637874704148885677655689978887658875167988200257787412785622366780 Q ss_pred CCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEEEEECCCCC Q ss_conf 56833655247788611589999999886302466787606999951677678999998511467421158997288788 Q gi|254780617|r 160 GRGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALSYSIVVVASASDP 239 (509) Q Consensus 160 grGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~~tvvv~a~a~~~ 239 (509) |||||++|||++|+|||+|+ .||..+. +..+|||++||||+||+++|++.+.+.++|+||++|++|+|+| T Consensus 161 grGQR~gIf~gsGvGKstLl-~~iar~~---------~~dv~V~~lIGeRgrEv~efi~~~~~~~~l~~svvv~atsd~~ 230 (441) T PRK09099 161 GEGQRMGIFAPAGVGKSTLM-GMFARGT---------QCDVNVIALIGERGREVREFIELILGEDGMARSVVVCATSDRS 230 (441) T ss_pred CCCCEEEEECCCCCCHHHHH-HHHHHHH---------CCCCEEEEECCCCHHHHHHHHHHHHHCCCCCCEEEEEECCCCC T ss_conf 34763300158988789999-9898751---------3686168863721389999999976245632017986158899 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCC Q ss_conf 89999988766666675420468847996164789999987545303787543337520012467888875045445887 Q gi|254780617|r 240 APMQLLAPFAGCAMGEYFRDNGYHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLLERAAKMSDALGAG 319 (509) Q Consensus 240 ~~~r~~ap~~a~aiAEyfr~~G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l~ERa~~~~~~~g~G 319 (509) |.+|+++||+|||+||||||+|+|||++||||||||+|||||||++||||+++|||||+||.||+|+||||+.. +| T Consensus 231 ~~~R~~a~~~a~aiAEyfrd~Gk~VLl~~DslTr~A~A~REisl~~gepP~~~gYp~~vf~~l~~l~ERag~~~----~G 306 (441) T PRK09099 231 SIERAKAAYAATAIAEYFRDRGKRVLLMMDSLTRFARAQREIGLAAGEPPARRGFPPSVFAELPRLLERAGMGE----HG 306 (441) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHCCCC----CC T ss_conf 88999987750157889986698189970767799999999998657999856677209988789887403789----98 Q ss_pred CEEEEEEEECCCCCCCCCHHHHHHHHHCCEEEEECCHHHHCCCCCCCCCCCHHHCCCCCCCHHHHHHHHHHHHHHHHHHH Q ss_conf 35675310036766443114668854061699615012210266543344412203434389887755678887998876 Q gi|254780617|r 320 SLTALPVIETQVNDVSAYIPTNVISITDGQIFLETELFYQGIRPAINIGLSVSRVGSAAQVKAMKQVSGAVKGELAQYRE 399 (509) Q Consensus 320 SiT~~~~v~~~~~D~~~~i~~~~~si~DG~i~l~~~l~~~g~~Paid~~~S~SRv~~~~~~~~~~~~a~~lr~~laqy~E 399 (509) |||+||+|++++||++||||||++|||||||||||+||++||||||||+.|+||+|++++.+.|+++|+++|.+|++|+| T Consensus 307 SITai~tVl~~~dd~~~pi~d~~~si~DGhivLsr~la~~g~yPAIdvl~SvSR~~~~~~~~~~~~~a~~~r~~la~y~e 386 (441) T PRK09099 307 SITALYTVLAEDDSGSDPIAEEVRGILDGHMILSREIAARNQYPAIDVLGSLSRVMPQVVPREHVQAAGRLRRLLAKHRE 386 (441) T ss_pred CEEEEEEEECCCCCCCCCHHHHHHHHCCEEEEECHHHHHCCCCCCCCCCCCHHHCCHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 55344445506887778767877767063899858788669998717741554244344159999999999999998999 Q ss_pred HHHHHHHH---CCCCHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHC Q ss_conf 78798763---3899899999999999999966999887488999999999866 Q gi|254780617|r 400 MSSFSKFS---SDLDSSTQKFLSKGERLTELLKQPQFSPLAMEEQVVMIFAGIS 450 (509) Q Consensus 400 le~f~~fg---sdlD~~t~~~l~rg~ri~e~LkQ~~~~p~~~~~qv~~l~a~~~ 450 (509) +++|+++| ++.|+.++++++++++|++||+|+..++.++++++..|+.+.. T Consensus 387 ~e~li~~G~y~~G~d~~~d~ai~~~~~i~~fL~Q~~~e~~~~~~~~~~l~~lv~ 440 (441) T PRK09099 387 VETLLQVGEYRAGSDPVADEAIAKIDAIRDFLSQRTDELSDPDATLAALAELVD 440 (441) T ss_pred HHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHC T ss_conf 999997368668989789999987899999838998899899999999999851 No 13 >PRK06820 type III secretion system ATPase; Validated Probab=100.00 E-value=0 Score=939.41 Aligned_cols=429 Identities=26% Similarity=0.396 Sum_probs=403.6 Q ss_pred CHHHHHHHHHHHHHCCCC-CCCEEEEEEEEEEECCEEEEECCCCCCCCCEEEEC-CCEEEEEEEECCCEEEEEEECCCCC Q ss_conf 889999999999842455-54201035899982758999758887737479957-9818999920598599998058668 Q gi|254780617|r 4 HAAEISDILRKRIKDFGN-DSEFSEIGRVLSIGDGIARVYGLNNIRAGEMVQFS-HGVYGMALNLEVDNVGVVILGSYKE 81 (509) Q Consensus 4 ~~~eis~~l~~~i~~~~~-~~~~~~~G~V~~V~~giv~v~GL~~a~~GElv~~~-~g~~G~Vi~l~~d~v~i~~l~~~~g 81 (509) ...++++.+++++..+.. ...+++.|+|++|+|+++++. ++++++||+|+|+ +++.|+|++|+++++.+++|+++.| T Consensus 5 ~~~~~~~~~~~~~~~~~~p~~~vr~~G~V~~V~g~iv~~~-l~~~~~Gelc~i~~~g~~aeVv~~~~~~v~l~~~~~~~G 83 (445) T PRK06820 5 DIARLTPRLQQQLTRPSAPPEGLRYRGPIVEIGPTLLRAS-LPNVAQGELCRIEPQGMLAEVVSIEQEMALLSPFASSDG 83 (445) T ss_pred CHHHHHHHHHHHHHCCCCCCCCCEEEEEEEEEECEEEEEE-CCCCCCCCEEEEECCCEEEEEEEEECCEEEEEECCCCCC T ss_conf 3678999999874167999998466769999927699996-588784898999279989999998789799998878878 Q ss_pred CCCCCEEEECCCEEEEECCHHHHHCCCCHHHCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCHHHHHCCHHHHHCCCCCCC Q ss_conf 67898889779878866794698300030023144664445444100101357731126600144416023310111156 Q gi|254780617|r 82 ISEGDIVKRTGRIVDVPVGLELLGRVVDALGNPIDGKGPIKCEQRSCTEADAPGIIQRQSVCEPLSTGIKAIDSLIPIGR 161 (509) Q Consensus 82 I~~G~~V~~tg~~~~vpvG~~lLGRVvD~lG~PlDg~g~i~~~~~~~i~~~~p~~~~R~~i~~~l~TGI~~ID~l~pigr 161 (509) |++|++|.+||++++||||++|||||+|++|+||||++ +...+++|+++++|||++|+++++||+||||+||+|+|||| T Consensus 84 i~~G~~V~~tG~~l~V~VG~~lLGRVvD~lG~PiDgg~-~~~~~~~pi~~~~p~p~~R~~i~~pL~TGIraID~l~pigr 162 (445) T PRK06820 84 LRCGQWVTPLGHMHRVQVGADLAGRILDGLGAPIDGGP-PLTGQWRELDCPPPSPLTRQPVEQMLTTGIRAIDGILSCGE 162 (445) T ss_pred CCCCCEEEECCCCEEEECCCCCCCCEECCCCCCCCCCC-CCCCCEEECCCCCCCCHHCCCCCCCCCCCCEEEECCCCCCC T ss_conf 89999999799960887282402787068875318998-87774320358996832026878511268546625677447 Q ss_pred CCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHH Q ss_conf 83365524778861158999999988630246678760699995167767899999851146742115899728878889 Q gi|254780617|r 162 GQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALSYSIVVVASASDPAP 241 (509) Q Consensus 162 GQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~~tvvv~a~a~~~~~ 241 (509) |||++|||++|+|||+| +.||.+|.. ..+|||++||||+||+++|++++.+.++++||++|+++||+||. T Consensus 163 GQR~gIfgg~GvGKT~L-l~~i~~~~~---------adv~V~~~IGeRgrEv~efie~~~~~~~~~rtvvV~atsd~p~~ 232 (445) T PRK06820 163 GQRIGIFAAAGVGKSTL-LSMLCADSA---------ADVMVLALIGERGREVREFLEQVLTPEARARTVVVVATSDRPAL 232 (445) T ss_pred CCCCEEECCCCCCCCHH-HHHHHHHCC---------CCEEEEEEEEEECHHHHHHHHHHHCCCCCCCEEEEECCCCCCHH T ss_conf 87421106899981223-445653067---------88799998744128999999986423761106999416768989 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCE Q ss_conf 99998876666667542046884799616478999998754530378754333752001246788887504544588735 Q gi|254780617|r 242 MQLLAPFAGCAMGEYFRDNGYHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLLERAAKMSDALGAGSL 321 (509) Q Consensus 242 ~r~~ap~~a~aiAEyfr~~G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l~ERa~~~~~~~g~GSi 321 (509) +|++|||+|||+||||||+|+|||+++||+||||+|||||||+++|||+++|||||+||.||+|+||||+.. .||| T Consensus 233 ~R~~a~~~a~aiAEyfrd~Gk~VLli~DslTr~A~A~REisl~~gepP~~~gYp~~vf~~l~~L~ERag~~~----~GSI 308 (445) T PRK06820 233 ERLKGLSTATTVAEYFRERGLKVLLLADSLTRYARAAREIGLAAGEPPAAGSFPPSVFANLPRLLERTGNSD----RGSI 308 (445) T ss_pred HHHHHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHCCCC----CCCE T ss_conf 999998761148899986798489984465788889989998647999767778549988888887304788----9756 Q ss_pred EEEEEEECCCCCCCCCHHHHHHHHHCCEEEEECCHHHHCCCCCCCCCCCHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 67531003676644311466885406169961501221026654334441220343438988775567888799887678 Q gi|254780617|r 322 TALPVIETQVNDVSAYIPTNVISITDGQIFLETELFYQGIRPAINIGLSVSRVGSAAQVKAMKQVSGAVKGELAQYREMS 401 (509) Q Consensus 322 T~~~~v~~~~~D~~~~i~~~~~si~DG~i~l~~~l~~~g~~Paid~~~S~SRv~~~~~~~~~~~~a~~lr~~laqy~Ele 401 (509) |+||+|++++||+|||||||++|||||||||||+||++||||||||+.|+||||++++.+.|+++|+++|..||+|+|+| T Consensus 309 Tal~~V~~~~dD~tdpi~d~~~silDG~ivLsr~La~~g~yPAIDvl~SvSRv~~~~~~~~h~~~a~~~r~~la~y~e~e 388 (445) T PRK06820 309 TAFYTVLVEGDDMNEPVADEVRSLLDGHIVLSRRLAGAGHYPAIDIAASVSRIMPQIVSAGQLAMAQKLRRMLACYQEIE 388 (445) T ss_pred EEEEEEEECCCCCCCCCHHHHHHHCCEEEEECHHHHHCCCCCCCCCCCCHHHCCHHHCCHHHHHHHHHHHHHHHHHHHHH T ss_conf 77888970588767771676756515389984879866999981874056454622202999999999999999999999 Q ss_pred HHHHHH---CCCCHHHHHHHHHHHHHHHHHCCCCC-----CCCCHHHHHHHHHHH Q ss_conf 798763---38998999999999999999669998-----874889999999998 Q gi|254780617|r 402 SFSKFS---SDLDSSTQKFLSKGERLTELLKQPQF-----SPLAMEEQVVMIFAG 448 (509) Q Consensus 402 ~f~~fg---sdlD~~t~~~l~rg~ri~e~LkQ~~~-----~p~~~~~qv~~l~a~ 448 (509) +|++|| .+.|+.++++++++++|.+||+|+.. ++.++++.+-.|..+ T Consensus 389 ~li~~G~y~~g~d~~~d~ai~~~~~i~~fL~Q~~~~~~~~~~~~~~~~~~~l~~~ 443 (445) T PRK06820 389 LLVQIGEYQAGEDLQADEALQRYPAICAFLQQDHSLTRDPDTAHLDTTLEHLAQV 443 (445) T ss_pred HHHHHCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHH T ss_conf 9998478767999899999987899999858999888887777989999999986 No 14 >TIGR01026 fliI_yscN ATPase FliI/YscN family; InterPro: IPR005714 Proteins in this entry show extensive homology to the ATP synthase F1 beta subunit, and are involved in type III protein secretion. They fall into the two separate functional groups outlined below. The first group, exemplified by the Salmonella typhimurium FliI protein (P26465 from SWISSPROT), is needed for flagellar assembly. Most structural components of the bacterial flagellum are translocated through the central channel of the growing flagellar structure by the type III flagellar protein-export apparatus in an ATPase-driven manner, to be assembled at the growing end. FliI is the ATPase that couples ATP hydrolysis to the translocation reaction , . The second group couples ATP hydrolysis to protein translocation in non-flagellar type III secretion systems. Often these systems are involved in virulence and pathogenicity. YscN (P40290 from SWISSPROT) from pathogenic Yersinia species, for example, energises the injection of antihost factors directly into eukaryotic cells, thus overcoming host defences .; GO: 0016887 ATPase activity, 0009058 biosynthetic process, 0015031 protein transport, 0005737 cytoplasm. Probab=100.00 E-value=0 Score=950.56 Aligned_cols=430 Identities=27% Similarity=0.414 Sum_probs=396.9 Q ss_pred HHHHHHHHHHCCCCCCCEEEEEEEEEEECCEEEEECCCCCCCCCEEEEC----CC--EEEEEEEECCCEEEEEEECCCCC Q ss_conf 9999999984245554201035899982758999758887737479957----98--18999920598599998058668 Q gi|254780617|r 8 ISDILRKRIKDFGNDSEFSEIGRVLSIGDGIARVYGLNNIRAGEMVQFS----HG--VYGMALNLEVDNVGVVILGSYKE 81 (509) Q Consensus 8 is~~l~~~i~~~~~~~~~~~~G~V~~V~~giv~v~GL~~a~~GElv~~~----~g--~~G~Vi~l~~d~v~i~~l~~~~g 81 (509) ....++++..+......++.+|+|.+|.|.++++.| |++.+||+|.+. ++ +.|||++|.++.+.+|||++..| T Consensus 4 ~~~~~~~~~~~~~d~~~~~~~G~v~~v~Gl~~ea~g-p~~~vG~~c~I~~~g~~~~~~~~EVVGf~~~~v~LmPy~~~~G 82 (455) T TIGR01026 4 NLTTLYNRLAQEADLEAVKRVGRVTKVKGLLIEAVG-PQASVGDLCLIERKGSEGKEVVAEVVGFNGEKVLLMPYEEVEG 82 (455) T ss_pred CHHHHHHHHHHHHHCCCEEEEEEEEEEEEEEEEEEC-CCCCCCCEEEEEEECCCCCEEEEEEEEEECCEEEECCCCCCCC T ss_conf 013488888753211630257899998526898524-7766777789997378987799998852067567523654443 Q ss_pred CCCCCEEEECC----CEEEEE---CCHHHHHCCCCHHHCCCCCCC--CCCCCCCC-HHCCCCCCCCCCCCCHHHHHCCHH Q ss_conf 67898889779----878866---794698300030023144664--44544410-010135773112660014441602 Q gi|254780617|r 82 ISEGDIVKRTG----RIVDVP---VGLELLGRVVDALGNPIDGKG--PIKCEQRS-CTEADAPGIIQRQSVCEPLSTGIK 151 (509) Q Consensus 82 I~~G~~V~~tg----~~~~vp---vG~~lLGRVvD~lG~PlDg~g--~i~~~~~~-~i~~~~p~~~~R~~i~~~l~TGI~ 151 (509) |.+|++|..++ +.+++| +|++|||||+||||+|||+++ .+...+.. ++..++||||+|.+|++.|.|||| T Consensus 83 ~~~G~~V~~~~isae~~L~~~qlP~G~~LLGRVld~~G~PiD~~~~~~~~~~~~~~~l~~~~~nPl~R~~i~~~~~tGVR 162 (455) T TIGR01026 83 VEPGSKVLAKNISAEEGLSIKQLPVGDGLLGRVLDGLGKPIDGKGKGFLDNVETEPALITAPINPLKREPIREILSTGVR 162 (455) T ss_pred CCCCCEEEEECCCCCCCCCCCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHEECCCE T ss_conf 35345233204330025455743368764024485468711377776665430125524689886676300021003645 Q ss_pred HHHCCCCCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEE Q ss_conf 33101111568336552477886115899999998863024667876069999516776789999985114674211589 Q gi|254780617|r 152 AIDSLIPIGRGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALSYSIV 231 (509) Q Consensus 152 ~ID~l~pigrGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~~tvv 231 (509) +||+|+|+|||||+|||+|+||||||| ++||.. | ...+.+.|+++||||||||.||+++-...+.|+|||| T Consensus 163 ~iD~LLTvGkGQR~GIFAGSGVGKStL-lGMIAR----n----~~~ADv~ViALIGERGREV~EFIE~~LG~EGLkrSV~ 233 (455) T TIGR01026 163 SIDGLLTVGKGQRIGIFAGSGVGKSTL-LGMIAR----N----NTEADVNVIALIGERGREVKEFIEKDLGEEGLKRSVV 233 (455) T ss_pred EECCCCCCCCCCCEEEEECCCCCHHHH-HHHHHH----C----CCCCCEEEEEEECCCCCCHHHHHHHHCCCCCCCEEEE T ss_conf 310103555665013550376003445-667630----0----6789827998643777741578863136566601179 Q ss_pred EEECCCCCHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHH Q ss_conf 97288788899999887666666754204688479961647899999875453037875433375200124678888750 Q gi|254780617|r 232 VVASASDPAPMQLLAPFAGCAMGEYFRDNGYHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLLERAAK 311 (509) Q Consensus 232 v~a~a~~~~~~r~~ap~~a~aiAEyfr~~G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l~ERa~~ 311 (509) |+||||+||.+|+.++|+|++|||||||||||||++|||+||||+|+|||+|++||||.++|||||||..+|+|+||||+ T Consensus 234 VVaTSD~SPl~R~~GAy~At~iAEYFrdqGk~VlLlmDSvTRfA~AqREiGLA~GEPP~~kGYtPSVF~~LP~LlERaG~ 313 (455) T TIGR01026 234 VVATSDQSPLLRLKGAYVATAIAEYFRDQGKDVLLLMDSVTRFAMAQREIGLAAGEPPATKGYTPSVFSLLPRLLERAGK 313 (455) T ss_pred EEECCCCCHHHHHHHHHEEHHHHHHHHHCCCEEEEEHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHCC T ss_conf 98368863888873264002543546521870562020278998898899874377777777785488998999976068 Q ss_pred CCCCCCCCCEEEEEEEECCCCCC-CCCHHHHHHHHHCCEEEEECCHHHHCCCCCCCCCCCHHHCCCCCCCHHHHHHHHHH Q ss_conf 45445887356753100367664-43114668854061699615012210266543344412203434389887755678 Q gi|254780617|r 312 MSDALGAGSLTALPVIETQVNDV-SAYIPTNVISITDGQIFLETELFYQGIRPAINIGLSVSRVGSAAQVKAMKQVSGAV 390 (509) Q Consensus 312 ~~~~~g~GSiT~~~~v~~~~~D~-~~~i~~~~~si~DG~i~l~~~l~~~g~~Paid~~~S~SRv~~~~~~~~~~~~a~~l 390 (509) -... +||||||||||+++|||+ +|||+|+++||+||||+|||+||++||||||||+.|+||++++..++.|++.++++ T Consensus 314 G~~~-~GGSITAFYTVLVeGDD~s~ePiAD~~r~ILDGHIVLSR~LA~~g~YPAIdvl~S~SR~~~~i~s~~~~~~~~~f 392 (455) T TIGR01026 314 GASG-KGGSITAFYTVLVEGDDLSNEPIADSVRGILDGHIVLSRALADRGHYPAIDVLASISRLMTAIVSEEHKKAARKF 392 (455) T ss_pred CCCC-CCCEEEEEEEEEECCCCCCCCCCCCCCCEEECCCEEECHHHHHCCCCCCCCHHHHHHHHHHHCCHHHHHHHHHHH T ss_conf 7357-795798888886415678877724310225515355245886178978637555688988740748899999999 Q ss_pred HHHHHHHHHHHHHHHHHC---CCCHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHH Q ss_conf 887998876787987633---8998999999999999999669998874889999999998 Q gi|254780617|r 391 KGELAQYREMSSFSKFSS---DLDSSTQKFLSKGERLTELLKQPQFSPLAMEEQVVMIFAG 448 (509) Q Consensus 391 r~~laqy~Ele~f~~fgs---dlD~~t~~~l~rg~ri~e~LkQ~~~~p~~~~~qv~~l~a~ 448 (509) |..||.|++.|++++.|. +-|++.|++|++-+++..||+|+.+++..+|+.+-.|-.+ T Consensus 393 R~lLs~y~~~EdLiriG~Y~~GSd~e~D~AI~~~~~le~FL~Qg~~E~~~fE~s~~~L~~I 453 (455) T TIGR01026 393 RELLSKYKENEDLIRIGAYKKGSDKELDKAIAKYPKLERFLKQGINEKVNFEESLQQLEEI 453 (455) T ss_pred HHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHH T ss_conf 9999988761323122025587886689998579998887547943463089999999873 No 15 >PTZ00185 ATPase alpha subunit; Provisional Probab=100.00 E-value=0 Score=948.82 Aligned_cols=439 Identities=49% Similarity=0.774 Sum_probs=405.3 Q ss_pred EEEEEEEEEECCEEEEE---CCCCCCCCCEEEEC--CC--EEEEEEEECCC-EEEEEEECCCCCCCCCCEEEECCCEEEE Q ss_conf 10358999827589997---58887737479957--98--18999920598-5999980586686789888977987886 Q gi|254780617|r 26 SEIGRVLSIGDGIARVY---GLNNIRAGEMVQFS--HG--VYGMALNLEVD-NVGVVILGSYKEISEGDIVKRTGRIVDV 97 (509) Q Consensus 26 ~~~G~V~~V~~giv~v~---GL~~a~~GElv~~~--~g--~~G~Vi~l~~d-~v~i~~l~~~~gI~~G~~V~~tg~~~~v 97 (509) ..+|.|-++.|.|+..- |-|++.++.++.+. .. ..|+++||++| .+++++||+..+|.+|+.|.+||+.++| T Consensus 38 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~GlalNLe~D~~VGvVllg~~~~I~eG~~V~rTG~il~v 117 (574) T PTZ00185 38 EMIGYVHSIDGTIATLIPAPGNPGVAYNTIIMIQVSPTTFAAGLVFNLEKDGRIGIILMDNITEVQSGQKVMATGKLLYI 117 (574) T ss_pred HHHEEEEECCCEEEEEECCCCCCCCEEEEEEEEEECCHHHHHHHEECCCCCCCEEEEEECCCCCCCCCCEEEEECCEEEE T ss_conf 54102453064368862079998736536899996641224230241356781349996684532478888751547887 Q ss_pred ECCHHHHHCCCCHHHCCCCCC------CCCCCCCC-CHHCCCCCCCCCCCCCHHHHHCCHHHHHCCCCCCCCCEEEEECC Q ss_conf 679469830003002314466------44454441-00101357731126600144416023310111156833655247 Q gi|254780617|r 98 PVGLELLGRVVDALGNPIDGK------GPIKCEQR-SCTEADAPGIIQRQSVCEPLSTGIKAIDSLIPIGRGQRELIIGD 170 (509) Q Consensus 98 pvG~~lLGRVvD~lG~PlDg~------g~i~~~~~-~~i~~~~p~~~~R~~i~~~l~TGI~~ID~l~pigrGQR~~I~g~ 170 (509) |||+++|||||||||+|||++ +++..+.. +|++.++|++++|+|+++||+|||++||+|+|||||||++|+|| T Consensus 118 PVG~~~LGRVVdpLG~Pi~~~~~~~~r~~i~~~~~~~~iE~~APgI~~R~pV~ePl~TGikaIDamIPIGRGQRELIIGD 197 (574) T PTZ00185 118 PVGAGVLGKVVNPLGHEVPVGLLTRSRALLESEQTLGKVDAGAPNIVSRSPVNYNLLTGFKAVDTMIPIGRGQRELIVGD 197 (574) T ss_pred ECCCHHHCCEECCCCCCCCCCCCCCCCCCCCHHHHCCCCCCCCCCCEECCCCCCHHHHHHHHHHHCCCCCCCCCEEEECC T ss_conf 52700203222567897777643344443200110353236799601057678455655678864366678852035557 Q ss_pred CCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHHHHHHHHH Q ss_conf 78861158999999988630246678760699995167767899999851146742115899728878889999988766 Q gi|254780617|r 171 RKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALSYSIVVVASASDPAPMQLLAPFAG 250 (509) Q Consensus 171 ~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~~tvvv~a~a~~~~~~r~~ap~~a 250 (509) ++||||++++|+|+||+..|.....+.+|+||||+||||.++++++++.|++.|+|+||++|++++++|+.+||+|||+| T Consensus 198 RQTGKTaIAiDTIINQk~~n~~~~k~~~V~CIYVAIGQK~StVA~iv~~L~e~gAm~yTiIVaAtAsd~a~lQYLAPYaG 277 (574) T PTZ00185 198 RQTGKTSIAVSTIINQVRINQQILSKNAVISIYVSIGQRCSNVARIHRLLRSYGALRYTTVMAATAAEPAGLQYLAPYSG 277 (574) T ss_pred CCCCHHHHHHHHHHCCCCCCCCCCCCCCEEEEEEECCCCHHHHHHHHHHHHHCCCCCCEEEEEECCCCCHHHHHHHHHHH T ss_conf 76556899999998376434465778966999998541289999999999976972030899924777514423211012 Q ss_pred HHHHHHHHCCCCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECC Q ss_conf 66667542046884799616478999998754530378754333752001246788887504544588735675310036 Q gi|254780617|r 251 CAMGEYFRDNGYHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLLERAAKMSDALGAGSLTALPVIETQ 330 (509) Q Consensus 251 ~aiAEyfr~~G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l~ERa~~~~~~~g~GSiT~~~~v~~~ 330 (509) ||||||||++|+|||++|||||+||.|||||||+++||||||+|||||||+||||+|||+|+++.+||||||+||+++|+ T Consensus 278 ~AmaEyFm~~G~dvLIVyDDLSKhA~AYRqmSLLLRRPPGREAYPGDVFYlHSRLLERAaKls~~~GgGSiTALPIIETq 357 (574) T PTZ00185 278 VTMGEYFMNRGRHCLCVYDDLSKQAVAYRQISLLLRRPPGREAYPGDVFYLHSRLLERAAMLSPGKGGGSVTALPIVETL 357 (574) T ss_pred HHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHCCCCCCCCCCCCCCEEEECHHHHHHHHHCCCCCCCEEEEECEEEEEC T ss_conf 57778988569847999658078899999987206899998798876611101788888752445698004642069960 Q ss_pred CCCCCCCHHHHHHHHHCCEEEEECCHHHHCCCCCCCCCCCHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 76644311466885406169961501221026654334441220343438988775567888799887678798763389 Q gi|254780617|r 331 VNDVSAYIPTNVISITDGQIFLETELFYQGIRPAINIGLSVSRVGSAAQVKAMKQVSGAVKGELAQYREMSSFSKFSSDL 410 (509) Q Consensus 331 ~~D~~~~i~~~~~si~DG~i~l~~~l~~~g~~Paid~~~S~SRv~~~~~~~~~~~~a~~lr~~laqy~Ele~f~~fgsdl 410 (509) ++|+|+|||||++|||||||||+++||++|++||||+++||||||+++|.++||++|+++|++||||||+.+-+--||.. T Consensus 358 aGDvSAYIPTNVISITDGQIfLes~LF~~GiRPAInVGlSVSRVGsaAQ~KAmK~VAG~Lkl~LAQYRElAA~~~~~~~~ 437 (574) T PTZ00185 358 SNDVTAYIVTNVISITDGQIYLDTKLFTGGQRPAVNIGLSVSRVGSSAQNVAMKAVAGKLKGILAEYRKLAADSVGGSQV 437 (574) T ss_pred CCCEEEEEECCEEEECCCEEEECHHHHHCCCCCCCCCCCCEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEE T ss_conf 78551010012587027718851767737988541468535416768899999997414458899999986332476324 Q ss_pred CHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHH Q ss_conf 98999999999999999669998874889999999998669806679899999999999 Q gi|254780617|r 411 DSSTQKFLSKGERLTELLKQPQFSPLAMEEQVVMIFAGISGCLDEVAVSQVRKFETNFL 469 (509) Q Consensus 411 D~~t~~~l~rg~ri~e~LkQ~~~~p~~~~~qv~~l~a~~~G~ld~i~v~~I~~fe~~l~ 469 (509) .|-. .-||.|+..++.|+ .|.-+....+.||++.|||||++.++..+-+|--+. T Consensus 438 --~~~~-~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 491 (574) T PTZ00185 438 --QTVP-MIRGARFVALFNQK--NPSFFMNALVSLYACLNGYLDDVKVNYAKLYEYLLV 491 (574) T ss_pred --EEEC-CCCCCHHHHHHCCC--CCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHH T ss_conf --5520-01232010114157--845999999999998615501542338989778851 No 16 >PRK08972 fliI flagellum-specific ATP synthase; Validated Probab=100.00 E-value=0 Score=924.30 Aligned_cols=431 Identities=25% Similarity=0.385 Sum_probs=404.8 Q ss_pred HHHHHHHHHHHHHCCCCCCCEEEEEEEEEEECCEEEEECCCCCCCCCEEEECC---CEEEEEEEECCCEEEEEEECCCCC Q ss_conf 89999999999842455542010358999827589997588877374799579---818999920598599998058668 Q gi|254780617|r 5 AAEISDILRKRIKDFGNDSEFSEIGRVLSIGDGIARVYGLNNIRAGEMVQFSH---GVYGMALNLEVDNVGVVILGSYKE 81 (509) Q Consensus 5 ~~eis~~l~~~i~~~~~~~~~~~~G~V~~V~~giv~v~GL~~a~~GElv~~~~---g~~G~Vi~l~~d~v~i~~l~~~~g 81 (509) -.++.++++++-.++.+...+ ..|||++|.|.++++.|+ .+.+||+|.++. ...++|++|+++.+.+|+|+++.| T Consensus 2 ~~~~~~~~~~~~~~~~~~~~~-~sGrv~~v~G~~ie~~g~-~~~iG~~c~i~~~~g~~~aEVvgf~~~~~~l~p~~~~~G 79 (440) T PRK08972 2 QHQLLNRLKQYQQKVPPFRAV-ASGQLVRVVGLTLEATGC-RAPVGSLCSIETMAGELEAEVVGFDGDLLYLMPIEELRG 79 (440) T ss_pred HHHHHHHHHHHHHCCCCCCCC-CEEEEEEEEEEEEEEEEC-CCCCCCEEEEECCCCCEEEEEEEECCCEEEEEECCCCCC T ss_conf 668999999997526778986-604899998258999816-898788789984998289999998299799998888888 Q ss_pred CCCCCEEEECCCEEEEECCHHHHHCCCCHHHCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCHHHHHCCHHHHHCCCCCCC Q ss_conf 67898889779878866794698300030023144664445444100101357731126600144416023310111156 Q gi|254780617|r 82 ISEGDIVKRTGRIVDVPVGLELLGRVVDALGNPIDGKGPIKCEQRSCTEADAPGIIQRQSVCEPLSTGIKAIDSLIPIGR 161 (509) Q Consensus 82 I~~G~~V~~tg~~~~vpvG~~lLGRVvD~lG~PlDg~g~i~~~~~~~i~~~~p~~~~R~~i~~~l~TGI~~ID~l~pigr 161 (509) |++|++|.+||++++||||++|||||+|++|+||||+|++....++|+++++|||++|++|++||+||||+||+|+|||| T Consensus 80 i~~G~~V~~tg~~~~VpVG~~lLGRVvD~lG~PiDg~g~~~~~~~~~i~~~~~~pl~R~~i~epl~TGIkaID~l~pigr 159 (440) T PRK08972 80 VLPGARVTPLGEQSGLPVGMSLLGRVLDGNGNPLDGLGPIYTDQRAPRHSPPINPLSRRPITEPLDVGVRAINAMLTVGK 159 (440) T ss_pred CCCCCEEEECCCCCEEECCCCCCCCEECCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCCCCCCCCCEEEECCCCCCC T ss_conf 89999999789986788481311887578866358999999985213137997821037778610158345634367124 Q ss_pred CCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHH Q ss_conf 83365524778861158999999988630246678760699995167767899999851146742115899728878889 Q gi|254780617|r 162 GQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALSYSIVVVASASDPAP 241 (509) Q Consensus 162 GQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~~tvvv~a~a~~~~~ 241 (509) |||++||||+|+|||+| ++||.++.. ..+|||++||||+||+++|++++.+.+.|+||++|+++||+||. T Consensus 160 GQRigIf~gsGvGKS~L-l~~i~r~~~---------advvVi~lIGERgrEv~efie~~l~~~~~~rsvvV~atsd~~p~ 229 (440) T PRK08972 160 GQRMGLFAGSGVGKSVL-LGMMTRGTT---------ADVIVVGLVGERGREVKEFIEEILGEEGRARSVVVAAPADTSPL 229 (440) T ss_pred CCEEEECCCCCCCHHHH-HHHHHHCCC---------CCEEEEEEECEEHHHHHHHHHHHHCCCCCEEEEEEEECCCCCHH T ss_conf 64666315789758999-988875147---------87899986231369999999998504774478999704888868 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCE Q ss_conf 99998876666667542046884799616478999998754530378754333752001246788887504544588735 Q gi|254780617|r 242 MQLLAPFAGCAMGEYFRDNGYHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLLERAAKMSDALGAGSL 321 (509) Q Consensus 242 ~r~~ap~~a~aiAEyfr~~G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l~ERa~~~~~~~g~GSi 321 (509) +|+++||+|+||||||||+|+|||+++||+||||+|||||||++||||+++|||||+||.||+|+||||+.+. ++||| T Consensus 230 ~R~~a~~~A~tiAEyFrd~G~~VLl~~DslTR~A~A~REisl~~gepP~~~GYppsvf~~l~~LlERag~~~~--~~GSI 307 (440) T PRK08972 230 MRLKGCETATRIAEYFRDQGLNVLLLMDSLTRYAQAQREIALAVGEPPATKGYPPSVFAKLPALVERAGNGGP--GQGSI 307 (440) T ss_pred HHHHHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHCCCCC--CCEEE T ss_conf 8999999999889999967997799963556777888789987489987667764277773688875226899--97658 Q ss_pred EEEEEEECCCCCCCCCHHHHHHHHHCCEEEEECCHHHHCCCCCCCCCCCHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 67531003676644311466885406169961501221026654334441220343438988775567888799887678 Q gi|254780617|r 322 TALPVIETQVNDVSAYIPTNVISITDGQIFLETELFYQGIRPAINIGLSVSRVGSAAQVKAMKQVSGAVKGELAQYREMS 401 (509) Q Consensus 322 T~~~~v~~~~~D~~~~i~~~~~si~DG~i~l~~~l~~~g~~Paid~~~S~SRv~~~~~~~~~~~~a~~lr~~laqy~Ele 401 (509) |+||+|++++||+|||||||++|||||||||||+||++||||||||+.|+||+|+.++.+.|+++|.++|.+|++|+|++ T Consensus 308 Ta~~tVlv~gdD~~dPi~d~~~silDGhIvLsr~La~~g~yPAIDvl~SvSRvm~~i~~~~h~~~a~~~r~~la~y~e~~ 387 (440) T PRK08972 308 TAFYTVLTEGDDLQDPIADASRAILDGHIVLSRELADSGHYPAIDIEASISRVMPMVISEEHLEAMRRVKQVYSLYQQNR 387 (440) T ss_pred EEEEEEECCCCCCCCCCHHHHHHHCCEEEEECHHHHHCCCCCCCCCCCCHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHH T ss_conf 88566631588667872454465535079984879866999875875066552741068999999999999999888899 Q ss_pred HHHHHH---CCCCHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHH Q ss_conf 798763---389989999999999999996699988748899999999986 Q gi|254780617|r 402 SFSKFS---SDLDSSTQKFLSKGERLTELLKQPQFSPLAMEEQVVMIFAGI 449 (509) Q Consensus 402 ~f~~fg---sdlD~~t~~~l~rg~ri~e~LkQ~~~~p~~~~~qv~~l~a~~ 449 (509) +++++| ++.|+.+++++++.++|.+||+|++++|.++++++..|..+. T Consensus 388 dli~~g~y~~g~d~~~D~ai~~~~~i~~fL~Q~~~e~~~~~~~~~~l~~l~ 438 (440) T PRK08972 388 DLISIGAYAQGSDPRIDNAIRLQPAMNAFLQQTMKEAVPYDDSVNMLKQLA 438 (440) T ss_pred HHHHHHCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHH T ss_conf 999975966899989999998689999983899999979999999999986 No 17 >PRK06936 type III secretion system ATPase; Provisional Probab=100.00 E-value=0 Score=922.66 Aligned_cols=430 Identities=27% Similarity=0.402 Sum_probs=407.4 Q ss_pred CCHHHHHHHHHHHHHCCCCCCCEEEEEEEEEEECCEEEEECCCCCCCCCEEEECC-----CEEEEEEEECCCEEEEEEEC Q ss_conf 8889999999999842455542010358999827589997588877374799579-----81899992059859999805 Q gi|254780617|r 3 IHAAEISDILRKRIKDFGNDSEFSEIGRVLSIGDGIARVYGLNNIRAGEMVQFSH-----GVYGMALNLEVDNVGVVILG 77 (509) Q Consensus 3 ~~~~eis~~l~~~i~~~~~~~~~~~~G~V~~V~~giv~v~GL~~a~~GElv~~~~-----g~~G~Vi~l~~d~v~i~~l~ 77 (509) +..+-|.+.|++++.+.++ ++..|||++|.|.++++.| +++.+||+|++.+ .+.|+|++|+++.+.+++|+ T Consensus 2 ~~~~~~~~~l~~~~~~~~~---~~~~Grv~~v~G~~iea~~-~~~~iG~~c~i~~~~~~~~~~aEVVgf~~~~~~l~p~~ 77 (439) T PRK06936 2 LSLDYIPHHLRHAIVGSRL---IQIRGRVTQVTGTILKAVV-PGVRIGELCYLRNPDNSLSLQAEVIGFAQHQALLTPLG 77 (439) T ss_pred CCHHHHHHHHHHHHHCCCE---EEEEEEEEEEEEEEEEEEE-CCCCCCCEEEEECCCCCCEEEEEEEEEECCEEEEEECC T ss_conf 4277889999999826985---7788799999965899984-79997886899828998348999998838989999677 Q ss_pred CCCCCCCCCEEEECCCEEEEECCHHHHHCCCCHHHCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCHHHHHCCHHHHHCCC Q ss_conf 86686789888977987886679469830003002314466444544410010135773112660014441602331011 Q gi|254780617|r 78 SYKEISEGDIVKRTGRIVDVPVGLELLGRVVDALGNPIDGKGPIKCEQRSCTEADAPGIIQRQSVCEPLSTGIKAIDSLI 157 (509) Q Consensus 78 ~~~gI~~G~~V~~tg~~~~vpvG~~lLGRVvD~lG~PlDg~g~i~~~~~~~i~~~~p~~~~R~~i~~~l~TGI~~ID~l~ 157 (509) ++.||++|++|.++|++++||||++|||||+|++|+||||++++...+++|++++||||++|+++++||+||||+||+|+ T Consensus 78 ~~~Gi~~G~~V~~~g~~~~vpvG~~lLGRV~D~~G~PlDg~~~~~~~~~~pi~~~~p~p~~R~~i~epl~TGIraID~ll 157 (439) T PRK06936 78 EMYGISSNTEVSPTGTMHQVGVGEHLLGRVLDGLGQPFDGGHPPEPAAWYPVYADAPAPMSRRLIETPLSLGVRVIDGLL 157 (439) T ss_pred CCCCCCCCCEEEECCCCCEEECCCCCCCCEECCCCCCCCCCCCCCCCCEEECCCCCCCCHHCCCCCCCCCCCCEEEECCC T ss_conf 86678999999978998677757042288888888755899999878762045889784202687874226855552468 Q ss_pred CCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEEEEECCC Q ss_conf 11568336552477886115899999998863024667876069999516776789999985114674211589972887 Q gi|254780617|r 158 PIGRGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALSYSIVVVASAS 237 (509) Q Consensus 158 pigrGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~~tvvv~a~a~ 237 (509) |||||||++|||++|+|||+|+ .||++++. ..+|||++||||+||+++|++...+.+.|+||++|++||| T Consensus 158 t~g~GQR~gIfagsGvGKs~Ll-~~i~r~~~---------adv~V~alIGERgrEv~efie~~l~~~gl~rsvvV~atsd 227 (439) T PRK06936 158 TCGEGQRMGIFAAAGGGKSTLL-ASLIRSAE---------VDVTVLALIGERGREVREFIESDLGEEGLRKAVLVVATSD 227 (439) T ss_pred CHHCCCEEECCCCCCCCHHHHH-HHHHHHHC---------CCCEEEEEECCCHHHHHHHHHHHHHCCCCCEEEEEECCCC T ss_conf 6310570102369999889999-99976432---------6957999807308999999998641254310699961898 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCC Q ss_conf 88899999887666666754204688479961647899999875453037875433375200124678888750454458 Q gi|254780617|r 238 DPAPMQLLAPFAGCAMGEYFRDNGYHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLLERAAKMSDALG 317 (509) Q Consensus 238 ~~~~~r~~ap~~a~aiAEyfr~~G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l~ERa~~~~~~~g 317 (509) +||.+|+++||+|+||||||||+|+|||+++||+||||+|+|||||++||||+++|||||+|+.||+|+||||+.. T Consensus 228 ~p~~~R~~aa~~a~aiAEyFrd~G~~VLl~~DslTR~A~A~REI~l~~gepP~~~GYppsvf~~l~~l~ERag~~~---- 303 (439) T PRK06936 228 RPSMERAKAGFVATSIAEYFRDQGKRVLLLMDSVTRFARAQREIGLAAGEPPTRRGYPPSVFAALPRLMERAGQSD---- 303 (439) T ss_pred CCHHHHHHHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHCCCCC---- T ss_conf 9989999999887788999985898489993567899999989998668999876779509988788887255899---- Q ss_pred CCCEEEEEEEECCCCCCCCCHHHHHHHHHCCEEEEECCHHHHCCCCCCCCCCCHHHCCCCCCCHHHHHHHHHHHHHHHHH Q ss_conf 87356753100367664431146688540616996150122102665433444122034343898877556788879988 Q gi|254780617|r 318 AGSLTALPVIETQVNDVSAYIPTNVISITDGQIFLETELFYQGIRPAINIGLSVSRVGSAAQVKAMKQVSGAVKGELAQY 397 (509) Q Consensus 318 ~GSiT~~~~v~~~~~D~~~~i~~~~~si~DG~i~l~~~l~~~g~~Paid~~~S~SRv~~~~~~~~~~~~a~~lr~~laqy 397 (509) .||||+||+|++++||++|||||+++|||||||||||+||++||||||||+.|+||+|+.++.+.|+++|+++|.+|++| T Consensus 304 ~GSITa~~tVl~~gdD~~dPi~d~~~silDGhivLsR~la~~g~yPAIDvl~S~SRvm~~v~~~~h~~~a~~~r~~la~y 383 (439) T PRK06936 304 KGSITALYTVLVEGDDMTEPVADETRSILDGHIILSRKLAAANHYPAIDVLASASRVMNQIVSKEHKTWAGRLRELLAKY 383 (439) T ss_pred CCCEEEEEEEEECCCCCCCCCHHHHHHHCCEEEEECHHHHHCCCCCCCCCCCCHHHCCHHHCCHHHHHHHHHHHHHHHHH T ss_conf 97457654455048877777156566662606998487886699998187404544452122699999999999999989 Q ss_pred HHHHHHHHHH---CCCCHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHC Q ss_conf 7678798763---3899899999999999999966999887488999999999866 Q gi|254780617|r 398 REMSSFSKFS---SDLDSSTQKFLSKGERLTELLKQPQFSPLAMEEQVVMIFAGIS 450 (509) Q Consensus 398 ~Ele~f~~fg---sdlD~~t~~~l~rg~ri~e~LkQ~~~~p~~~~~qv~~l~a~~~ 450 (509) +|+|+|+++| .+.|+.+++++++.++|.+||+|+.+++.++++++..|+.+.+ T Consensus 384 ~e~edli~iG~y~~G~d~~~D~ai~~~~~i~~FL~Q~~~e~~~~~et~~~L~~l~~ 439 (439) T PRK06936 384 EEVELLLQIGEYQKGQDKEADQAIERIGAIRGFLRQGTHELSHFNETLNLLETLTQ 439 (439) T ss_pred HHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHC T ss_conf 99999998548657999889999986899999808999998499999999999749 No 18 >PRK07594 type III secretion system ATPase; Validated Probab=100.00 E-value=0 Score=923.45 Aligned_cols=425 Identities=26% Similarity=0.373 Sum_probs=398.5 Q ss_pred HHHHHHHH-CCCCCCCEEEEEEEEEEECCEEEEECCCCCCCCCEEEEC-CCEEEEEEEECCCEEEEEEECCCCCCCCCCE Q ss_conf 99999984-245554201035899982758999758887737479957-9818999920598599998058668678988 Q gi|254780617|r 10 DILRKRIK-DFGNDSEFSEIGRVLSIGDGIARVYGLNNIRAGEMVQFS-HGVYGMALNLEVDNVGVVILGSYKEISEGDI 87 (509) Q Consensus 10 ~~l~~~i~-~~~~~~~~~~~G~V~~V~~giv~v~GL~~a~~GElv~~~-~g~~G~Vi~l~~d~v~i~~l~~~~gI~~G~~ 87 (509) +-|.+++. ++.+-..++.+|||.+|.|.++++.| |++.+||+|++. ++..|||++|+++++.+++|+++.||++|++ T Consensus 3 ~~~~~~~~~~~~~~~g~~~~G~v~~v~G~~iea~~-~~~~~Gelc~i~~~~~~aEVvgf~~~~~~l~~~~~~~Gi~~G~~ 81 (433) T PRK07594 3 NELMQRLRLKYPPPDGYCRWGRIQDVSATLLNAWL-PGVFMGELCCIKPGEELAEVVGINGSKALLSPFTSTIGLHCGQQ 81 (433) T ss_pred HHHHHHHCCCCCCCCCCEEEEEEEEEECCEEEEEE-CCCCCCCEEEEEECCEEEEEEEEECCEEEEEECCCCCCCCCCCE T ss_conf 37889731668999985579899999663899986-89886887999758907999999799799997888777899999 Q ss_pred EEECCCEEEEECCHHHHHCCCCHHHCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCHHHHHCCHHHHHCCCCCCCCCEEEE Q ss_conf 89779878866794698300030023144664445444100101357731126600144416023310111156833655 Q gi|254780617|r 88 VKRTGRIVDVPVGLELLGRVVDALGNPIDGKGPIKCEQRSCTEADAPGIIQRQSVCEPLSTGIKAIDSLIPIGRGQRELI 167 (509) Q Consensus 88 V~~tg~~~~vpvG~~lLGRVvD~lG~PlDg~g~i~~~~~~~i~~~~p~~~~R~~i~~~l~TGI~~ID~l~pigrGQR~~I 167 (509) |.+||++++||||++|||||+|++|+||||++. ....+++++..+|||++|+++++||+||||+||+|+|||||||++| T Consensus 82 V~~~g~~~~V~vG~~lLGRVvd~lG~PiDg~~~-~~~~~~~~~~~pp~p~~R~~i~epl~TGIkaID~l~pigrGQR~gI 160 (433) T PRK07594 82 VMALRRRHQVPVGEALLGRVIDGFGRPLDGREL-PDVCWKDYDAMPPPAMVRQPITQPLMTGIRAIDSVATCGEGQRVGI 160 (433) T ss_pred EEECCCCCEEECCHHHCCCEECCCCCCCCCCCC-CCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEHHCCCCCCCCCEEEE T ss_conf 996899747872873327875788763589998-8876352567894940026778614367400013577124787420 Q ss_pred ECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHHHHHH Q ss_conf 24778861158999999988630246678760699995167767899999851146742115899728878889999988 Q gi|254780617|r 168 IGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALSYSIVVVASASDPAPMQLLAP 247 (509) Q Consensus 168 ~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~~tvvv~a~a~~~~~~r~~ap 247 (509) |||+|||||+| +.||.||+. ..+|||++||||+||+++|++.+.+.++++||++|++|+|+||.+|+++| T Consensus 161 fgg~GvGKTtL-l~~i~~~~~---------adv~V~~lIGERgrEv~efie~~~~~~~~~rsvvV~atsd~p~~~R~~a~ 230 (433) T PRK07594 161 FSAPGVGKSTL-LAMLCNAPD---------ADSNVLVLIGERGREVREFIDFTLSEETRKRCVIVVATSDRPALERVRAL 230 (433) T ss_pred CCCCCCCHHHH-HHHHHHCCC---------CCCEEEEEECCCHHHHHHHHHHHHHCCCCCEEEEEEECCCCCHHHHHHHH T ss_conf 47899985589-999984247---------98159999410048899999986533662038999955889999999998 Q ss_pred HHHHHHHHHHHCCCCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCEEEEEEE Q ss_conf 76666667542046884799616478999998754530378754333752001246788887504544588735675310 Q gi|254780617|r 248 FAGCAMGEYFRDNGYHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLLERAAKMSDALGAGSLTALPVI 327 (509) Q Consensus 248 ~~a~aiAEyfr~~G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l~ERa~~~~~~~g~GSiT~~~~v 327 (509) |+|+||||||||+|+|||+++|||||||+|+|||||++||||+++|||||+||.||+|+||||+.+ .||||+||+| T Consensus 231 ~~a~aiAEyFrd~G~~VLl~~DslTR~A~A~REisl~~gepP~~~gYppsvf~~l~~LlERag~~~----~GSITa~~tV 306 (433) T PRK07594 231 FVATTIAEFFRDNGKRVVLLADSLTRYARAAREIALAAGETAVSGEYPPGVFSALPRLLERTGMGE----KGSITAFYTV 306 (433) T ss_pred HHHHHHHHHHHHCCCEEEEEECCHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHCCCC----CCCEEEEEEE T ss_conf 776678999986696489973438889999878998647999878889549888788887513799----9756566556 Q ss_pred ECCCCCCCCCHHHHHHHHHCCEEEEECCHHHHCCCCCCCCCCCHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 03676644311466885406169961501221026654334441220343438988775567888799887678798763 Q gi|254780617|r 328 ETQVNDVSAYIPTNVISITDGQIFLETELFYQGIRPAINIGLSVSRVGSAAQVKAMKQVSGAVKGELAQYREMSSFSKFS 407 (509) Q Consensus 328 ~~~~~D~~~~i~~~~~si~DG~i~l~~~l~~~g~~Paid~~~S~SRv~~~~~~~~~~~~a~~lr~~laqy~Ele~f~~fg 407 (509) ++++||++||||||++|||||||||||+||++||||||||+.|+||+|+.++.++|+++|+++|.+|++|+|+|+|+++| T Consensus 307 lv~gdD~~dPi~d~~~silDGhIvLsr~La~~g~yPAIDvl~SvSRv~~~i~~~~h~~~a~~~r~~la~y~e~edli~~G 386 (433) T PRK07594 307 LVEGDDMNEPLADEVRSLLDGHIVLSRRLAERGHYPAIDVLATLSRVFPVVTSHEHRQLAAILRRCLALYQEVELLIRIG 386 (433) T ss_pred ECCCCCCCCCHHHHHHHHCCEEEEECHHHHHCCCCCCCCCCCCHHHCHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 53588666862787776614599983899967999980886056444255462999999999999999999999999833 Q ss_pred C---CCCHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHC Q ss_conf 3---899899999999999999966999887488999999999866 Q gi|254780617|r 408 S---DLDSSTQKFLSKGERLTELLKQPQFSPLAMEEQVVMIFAGIS 450 (509) Q Consensus 408 s---dlD~~t~~~l~rg~ri~e~LkQ~~~~p~~~~~qv~~l~a~~~ 450 (509) . +.|+++++++++.++|.+||+|+..++.++++++..|+.+.. T Consensus 387 aY~~G~d~~~D~ai~~~~~i~~fL~Q~~~e~~~~e~~~~~L~~ll~ 432 (433) T PRK07594 387 EYQRGVDTDTDKAIDTYPDICTFLRQSKDEVCGPELLIEKLHQILT 432 (433) T ss_pred CCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHC T ss_conf 9778999899999971489999847998999899999999999853 No 19 >PRK08472 fliI flagellum-specific ATP synthase; Validated Probab=100.00 E-value=0 Score=922.73 Aligned_cols=420 Identities=29% Similarity=0.446 Sum_probs=394.9 Q ss_pred HHHHH-HCCCCCCCEEEEEEEEEEECCEEEEECCCCCCCCCEEEEC---CC--EEEEEEEECCCEEEEEEECCCCCCCCC Q ss_conf 99998-4245554201035899982758999758887737479957---98--189999205985999980586686789 Q gi|254780617|r 12 LRKRI-KDFGNDSEFSEIGRVLSIGDGIARVYGLNNIRAGEMVQFS---HG--VYGMALNLEVDNVGVVILGSYKEISEG 85 (509) Q Consensus 12 l~~~i-~~~~~~~~~~~~G~V~~V~~giv~v~GL~~a~~GElv~~~---~g--~~G~Vi~l~~d~v~i~~l~~~~gI~~G 85 (509) +++.+ .+|+.+++ +|+|++|.|+++++.|++ +.+||+|++. ++ ..|+|++|+++.+.+++|+++.||+.| T Consensus 6 ~~~~~~~~~~~s~~---yG~V~~V~G~li~v~G~~-~~iGe~~~I~~~~~g~~~~geVvg~~~~~v~l~~~~~~~Gi~~G 81 (435) T PRK08472 6 LKNKLNQHFSLSPR---FGSITKISANIIEARGLK-PSVGDIVKIVEENDGKECLGMVVVIEKEQFGISPFSFIEGFKIG 81 (435) T ss_pred HHHHHHHCCCCCCC---CCEEEEEECEEEEEEECC-CCCCCEEEEEECCCCCEEEEEEEEEECCEEEEEECCCCCCCCCC T ss_conf 99998735676677---868999957399999458-87678799997699967789999885998999983688789999 Q ss_pred CEEEECCCEEEEECCHHHHHCCCCHHHCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCHHHHHCCHHHHHCCCCCCCCCEE Q ss_conf 88897798788667946983000300231446644454441001013577311266001444160233101111568336 Q gi|254780617|r 86 DIVKRTGRIVDVPVGLELLGRVVDALGNPIDGKGPIKCEQRSCTEADAPGIIQRQSVCEPLSTGIKAIDSLIPIGRGQRE 165 (509) Q Consensus 86 ~~V~~tg~~~~vpvG~~lLGRVvD~lG~PlDg~g~i~~~~~~~i~~~~p~~~~R~~i~~~l~TGI~~ID~l~pigrGQR~ 165 (509) ++|.+||++++||||++|||||+|++|+||||+|++...+++|++++||||++|.++++||+||||+||+|+|||||||+ T Consensus 82 ~~V~~tg~~~~vpvg~~lLGRVvD~lG~PlDg~g~i~~~~~~pi~~~~~~p~~R~~i~epl~TGIraID~l~pigrGQR~ 161 (435) T PRK08472 82 DKVFISDEGLNIPVGRNLLGRVVDPLGRPIDGKGAIDYTEYAPIMRAPIAAMKRGLIDEVFSVGVKSIDGLLTCGKGQKL 161 (435) T ss_pred CEEEECCCCCEEECCCCCCCCEECCCCCCCCCCCCCCCCCEEECCCCCCCHHHHCCCCCCCCCCCEEEECCCCCCCCCEE T ss_conf 99998999737875834027887778864589999887623011478988045168887411585577432662477665 Q ss_pred EEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHHHH Q ss_conf 55247788611589999999886302466787606999951677678999998511467421158997288788899999 Q gi|254780617|r 166 LIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALSYSIVVVASASDPAPMQLL 245 (509) Q Consensus 166 ~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~~tvvv~a~a~~~~~~r~~ 245 (509) +||||+|+|||+|+ .||.++.. ..+|||++||||+||+++|++... .+.++||++|+++||+||.+|++ T Consensus 162 gIf~g~GvGKt~Ll-~~i~~~~~---------~dv~V~~lIGeR~rEv~efie~~~-~~~l~~tvvvvatsd~p~~~r~~ 230 (435) T PRK08472 162 GIFAGSGVGKSTLM-GMIVKGCL---------APIKVVALIGERGREIPEFIEKNL-GGDLSNTVIVVATSDDSPLMRKY 230 (435) T ss_pred ECCCCCCCCCCHHH-HHHHHHHC---------CCEEEEEEECEEHHHHHHHHHHHH-HHCCCCEEEEEECCCCCHHHHHH T ss_conf 12358999833578-88877615---------998899761312278999999874-11322259998468899899988 Q ss_pred HHHHHHHHHHHHHCCCCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCEEEEE Q ss_conf 88766666675420468847996164789999987545303787543337520012467888875045445887356753 Q gi|254780617|r 246 APFAGCAMGEYFRDNGYHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLLERAAKMSDALGAGSLTALP 325 (509) Q Consensus 246 ap~~a~aiAEyfr~~G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l~ERa~~~~~~~g~GSiT~~~ 325 (509) |||+|||+||||||||+|||+++|||||||+||||||++++|||+++|||||+||.||+|+||||+. .|+||||+|| T Consensus 231 a~~~a~aiAEyfrd~G~~VLl~~Dsltr~A~A~REisl~~ge~P~~~gYp~~vf~~l~~l~ERag~~---~~~GSIT~i~ 307 (435) T PRK08472 231 GAFCAMSVAEYFKNQGLDVLFIMDSVTRFAMAQREIGLALGEPPTSKGYPPSVLSLLPQLMERAGKE---EGKGSITAFF 307 (435) T ss_pred HHHHHHHHHHHHHHCCCCEEEEECCHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHCCC---CCCCCEEEEE T ss_conf 8889999999998679848998436278899999999865899975563835776657888864258---8985148887 Q ss_pred EEECCCCCCCCCHHHHHHHHHCCEEEEECCHHHHCCCCCCCCCCCHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 10036766443114668854061699615012210266543344412203434389887755678887998876787987 Q gi|254780617|r 326 VIETQVNDVSAYIPTNVISITDGQIFLETELFYQGIRPAINIGLSVSRVGSAAQVKAMKQVSGAVKGELAQYREMSSFSK 405 (509) Q Consensus 326 ~v~~~~~D~~~~i~~~~~si~DG~i~l~~~l~~~g~~Paid~~~S~SRv~~~~~~~~~~~~a~~lr~~laqy~Ele~f~~ 405 (509) +|++++||+|||||||++|||||||||||+||++||||||||+.|+||+|++++.+.|+++|+++|.+|++|+|++.+++ T Consensus 308 ~v~~~gdD~~~pi~d~~~si~DG~ivLsr~La~~g~yPAIdvl~S~SR~~~~~~~~~h~~~a~~~r~~~a~y~e~e~li~ 387 (435) T PRK08472 308 TVLVEGDDMSDPIADQSRSILDGHIVLSRELTDFGIYPPINILNSASRVMKDIISPEHNLWARKFKRLYSLLKENEVLIR 387 (435) T ss_pred EEEECCCCCCCCCHHHHHHHCCEEEEECHHHHHCCCCCCCCCCCCHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 78504887677745555454060799978798669999848631454414001369999999999999999898999999 Q ss_pred HHC---CCCHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHH Q ss_conf 633---89989999999999999996699988748899999999986 Q gi|254780617|r 406 FSS---DLDSSTQKFLSKGERLTELLKQPQFSPLAMEEQVVMIFAGI 449 (509) Q Consensus 406 fgs---dlD~~t~~~l~rg~ri~e~LkQ~~~~p~~~~~qv~~l~a~~ 449 (509) +|. +.|++++.++.+++++++||+|+++++.++++++..|+.+. T Consensus 388 ig~y~~g~d~~~d~ai~~~~~i~~fL~Q~~~e~~~~eet~~~l~~il 434 (435) T PRK08472 388 IGAYQKGSDKELDEAISKKELMEQFLKQNENELFPFEQSFELLEEIL 434 (435) T ss_pred HHCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHH T ss_conf 64865799989999998479999977999999979999999999973 No 20 >PRK08927 fliI flagellum-specific ATP synthase; Validated Probab=100.00 E-value=0 Score=914.34 Aligned_cols=427 Identities=26% Similarity=0.382 Sum_probs=400.8 Q ss_pred HHHHHHHHCCCCCCCEEEEEEEEEEECCEEEEECCC-CCCCCCEEEECC--C--EEEEEEEECCCEEEEEEECCCCCCCC Q ss_conf 999999842455542010358999827589997588-877374799579--8--18999920598599998058668678 Q gi|254780617|r 10 DILRKRIKDFGNDSEFSEIGRVLSIGDGIARVYGLN-NIRAGEMVQFSH--G--VYGMALNLEVDNVGVVILGSYKEISE 84 (509) Q Consensus 10 ~~l~~~i~~~~~~~~~~~~G~V~~V~~giv~v~GL~-~a~~GElv~~~~--g--~~G~Vi~l~~d~v~i~~l~~~~gI~~ 84 (509) +.++.++.+.+. ++.+|||++|.|.++++.|+. .+.+|++|++.. + +.|+|++|+++++.+++|+++.||+. T Consensus 3 ~~~~~~~~~~~~---~~~~GrV~~V~G~~vev~g~~~~~~iG~~c~I~~~~g~~i~aEVvgf~~~~~~l~~~~~t~Gi~~ 79 (441) T PRK08927 3 KALAAQIGDIDG---VNIYGRVVGVRGLLVEVAGPIHAMSVGARIVVETGDGREIPCEVIGFRGDRALLMPFGPLEGVRR 79 (441) T ss_pred HHHHHHHHCCCC---EEEEEEEEEEECEEEEEEECCCCCCCCCEEEEECCCCCEEEEEEEEECCCEEEEEECCCCCCCCC T ss_conf 899999844897---15778999997708999805777775888999908998899999988598799998888778899 Q ss_pred CCEEEECCCEEEEECCHHHHHCCCCHHHCCCCCCCCCC-CCCCCHHCCCCCCCCCCCCCHHHHHCCHHHHHCCCCCCCCC Q ss_conf 98889779878866794698300030023144664445-44410010135773112660014441602331011115683 Q gi|254780617|r 85 GDIVKRTGRIVDVPVGLELLGRVVDALGNPIDGKGPIK-CEQRSCTEADAPGIIQRQSVCEPLSTGIKAIDSLIPIGRGQ 163 (509) Q Consensus 85 G~~V~~tg~~~~vpvG~~lLGRVvD~lG~PlDg~g~i~-~~~~~~i~~~~p~~~~R~~i~~~l~TGI~~ID~l~pigrGQ 163 (509) |++|.+||++++||||++|||||+|++|+||||+|++. ...++|+++++|||++|.++++||+||||+||+|+|||||| T Consensus 80 G~~V~~tg~~~~vpvg~~lLGRVid~lG~PiDg~g~~~~~~~~~pi~~~~p~p~~R~~i~e~L~TGIraID~l~pigrGQ 159 (441) T PRK08927 80 GCRAVIANAAAQVRPSAAWLGRVVNALGEPIDGKGPLPQGPVPYPLRAPPPPAHSRARVGPPLDLGVRALNTFLTCCRGQ 159 (441) T ss_pred CCEEEECCCCCEEECCCCCCCCEECCCCCCCCCCCCCCCCCEECCCCCCCCCHHHCCCCCCCCCCCHHHHHHHHHHCCCC T ss_conf 99999899998777786642878776711158999888876004024789684661777763103601565330020364 Q ss_pred EEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHH Q ss_conf 36552477886115899999998863024667876069999516776789999985114674211589972887888999 Q gi|254780617|r 164 RELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALSYSIVVVASASDPAPMQ 243 (509) Q Consensus 164 R~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~~tvvv~a~a~~~~~~r 243 (509) |++|||++|+|||+|+ .||.++.. ..+|||++||||+||+++|+++....+.++||++|++|||+||++| T Consensus 160 RigIfagsGvGKs~Ll-~~i~r~~~---------adv~VialIGeRgrEv~efi~~~l~~~~l~rsvvV~atsd~p~~~R 229 (441) T PRK08927 160 RLGIFAGSGVGKSVLL-SMLARNTD---------ADVSVIGLIGERGREVQEFLQDDLGPEGLARSVVVVATSDEPALMR 229 (441) T ss_pred EEEECCCCCCCHHHHH-HHHHHHCC---------CCCEEEEECCCCHHHHHHHHHHHHCCCCCCCEEEEEECCCCCHHHH T ss_conf 6663168999878999-99986416---------7737875226518999999998720245224589996378987887 Q ss_pred HHHHHHHHHHHHHHHCCCCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCEEE Q ss_conf 99887666666754204688479961647899999875453037875433375200124678888750454458873567 Q gi|254780617|r 244 LLAPFAGCAMGEYFRDNGYHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLLERAAKMSDALGAGSLTA 323 (509) Q Consensus 244 ~~ap~~a~aiAEyfr~~G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l~ERa~~~~~~~g~GSiT~ 323 (509) +++||+|+||||||||||||||+++|||||||+|+||||++++|||+++|||||+|+.|++|+||||+. .++||||+ T Consensus 230 ~~a~~~A~aiAEyfrd~G~~VLl~~Dsltr~A~A~REisl~~ge~P~~~gYppsvf~~l~~l~ERag~~---~~~GSIT~ 306 (441) T PRK08927 230 RQAAYLTLAIAEYFRDQGKDVLCLMDSVTRFAMAQREIGLSAGEPPTTKGYTPTVFAELPRLLERAGPG---LGEGTITG 306 (441) T ss_pred HHHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHCCC---CCCCEEEC T ss_conf 999987558999999779967998346028889999999755899987787861787717888761357---89950303 Q ss_pred EEEEECCCCCCCCCHHHHHHHHHCCEEEEECCHHHHCCCCCCCCCCCHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 53100367664431146688540616996150122102665433444122034343898877556788879988767879 Q gi|254780617|r 324 LPVIETQVNDVSAYIPTNVISITDGQIFLETELFYQGIRPAINIGLSVSRVGSAAQVKAMKQVSGAVKGELAQYREMSSF 403 (509) Q Consensus 324 ~~~v~~~~~D~~~~i~~~~~si~DG~i~l~~~l~~~g~~Paid~~~S~SRv~~~~~~~~~~~~a~~lr~~laqy~Ele~f 403 (509) +|+|++++||+||||||+++|||||||||||+||++||||||||+.|+||+|++++.+.|+++|+++|.+|++|+|+++| T Consensus 307 ~~tVl~~gdD~~dPi~d~~~~ilDGhIvLsr~La~~G~yPAIDvl~S~SR~~~~~~~~~h~~~a~~~r~~la~y~e~edl 386 (441) T PRK08927 307 LFTVLVDGDDHNEPVADAVRGILDGHIVMERAIAERGRYPAINVLKSVSRTMPGCNDPAELPLVRRARQLMATYADMEEL 386 (441) T ss_pred EEEEEECCCCCCCCCHHHHHHHCCEEEEECHHHHHCCCCCCCCCCCCHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 03576127876787245446563647999687986699998087625644240002799999999999999988899999 Q ss_pred HHHHC---CCCHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCCC Q ss_conf 87633---89989999999999999996699988748899999999986698 Q gi|254780617|r 404 SKFSS---DLDSSTQKFLSKGERLTELLKQPQFSPLAMEEQVVMIFAGISGC 452 (509) Q Consensus 404 ~~fgs---dlD~~t~~~l~rg~ri~e~LkQ~~~~p~~~~~qv~~l~a~~~G~ 452 (509) +++|. +.|+.++++++++++|.+||+|+.+++.++++++..|..+..|. T Consensus 387 i~~G~y~~g~d~~~D~ai~~~~~i~~fL~Q~~~e~~~~~e~~~~L~~~~~g~ 438 (441) T PRK08927 387 IRLGAYRAGSDPEVDEAIRLHPPLEAFLRQGKDEATSLAEGYAQLAQILGGP 438 (441) T ss_pred HHHHCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCCC T ss_conf 9973864799989999998679999984899889989999999999986787 No 21 >TIGR03496 FliI_clade1 flagellar protein export ATPase FliI. Members of this protein family are the FliI protein of bacterial flagellum systems. This protein acts to drive protein export for flagellar biosynthesis. The most closely related family is the YscN family of bacterial type III secretion systems. This model represents one (of three) segment of the FliI family tree. These have been modeled separately in order to exclude the type III secretion ATPases more effectively. Probab=100.00 E-value=0 Score=915.75 Aligned_cols=404 Identities=30% Similarity=0.453 Sum_probs=386.0 Q ss_pred EEEEEEECCEEEEECCCCCCCCCEEEEC--CC--EEEEEEEECCCEEEEEEECCCCCCCCCCEEEECCCEEEEECCHHHH Q ss_conf 5899982758999758887737479957--98--1899992059859999805866867898889779878866794698 Q gi|254780617|r 29 GRVLSIGDGIARVYGLNNIRAGEMVQFS--HG--VYGMALNLEVDNVGVVILGSYKEISEGDIVKRTGRIVDVPVGLELL 104 (509) Q Consensus 29 G~V~~V~~giv~v~GL~~a~~GElv~~~--~g--~~G~Vi~l~~d~v~i~~l~~~~gI~~G~~V~~tg~~~~vpvG~~lL 104 (509) |||.+|.|.+++++|+ ++.+||+|++. ++ +.|||++|+++.+.+++|+++.||+.|++|.+|+++++||||++|| T Consensus 1 GrV~~V~G~vIea~g~-~~~IGe~c~I~~~~g~~i~aEVVgf~~~~v~l~~~~~~~Gi~~G~~V~~tg~~~~v~vg~~lL 79 (411) T TIGR03496 1 GRLTRVVGLVLEAVGL-RAPVGSRCEIESADGDPIEAEVVGFSGDRVLLMPLEDVEGLRPGARVFPLEGPLRLPVGDSLL 79 (411) T ss_pred CEEEEEEEEEEEEECC-CCCCCCEEEEEECCCCEEEEEEEEECCCEEEEEECCCCCCCCCCCEEEECCCCCEEECCHHHC T ss_conf 9799998018999737-999788899993899789999999729979999866887888899999789966676387653 Q ss_pred HCCCCHHHCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCHHHHHCCHHHHHCCCCCCCCCEEEEECCCCCCCHHHHHHHHH Q ss_conf 30003002314466444544410010135773112660014441602331011115683365524778861158999999 Q gi|254780617|r 105 GRVVDALGNPIDGKGPIKCEQRSCTEADAPGIIQRQSVCEPLSTGIKAIDSLIPIGRGQRELIIGDRKTGKTSIILDTFL 184 (509) Q Consensus 105 GRVvD~lG~PlDg~g~i~~~~~~~i~~~~p~~~~R~~i~~~l~TGI~~ID~l~pigrGQR~~I~g~~g~GKt~l~~~~I~ 184 (509) |||+|++|+||||++++...++||++++||||++|.++++||+||||+||+|+|||||||++|||++|+|||+| +.||. T Consensus 80 GRVid~lG~PlDg~~~~~~~~~~pi~~~~p~p~~R~~i~epl~TGIraID~l~pig~GQR~gIf~g~GvGKs~L-l~~ia 158 (411) T TIGR03496 80 GRVIDGLGRPLDGKGPLDAGERVPLYAPPINPLKRAPIDEPLDVGVRAINGLLTVGRGQRIGIFAGSGVGKSVL-LGMMA 158 (411) T ss_pred CCEECCCCCCCCCCCCCCCCCEEEEECCCCCHHHCCCCCCCCCCCCEEEECCCCCCCCCEEEECCCCCCCHHHH-HHHHH T ss_conf 78857888765777888878456500799681431787762115834663446614686313227999867799-99997 Q ss_pred HHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHHHHHCCCCCE Q ss_conf 98863024667876069999516776789999985114674211589972887888999998876666667542046884 Q gi|254780617|r 185 NQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALSYSIVVVASASDPAPMQLLAPFAGCAMGEYFRDNGYHA 264 (509) Q Consensus 185 nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~~tvvv~a~a~~~~~~r~~ap~~a~aiAEyfr~~G~~V 264 (509) ++.. ..++||++||||+||+.+|++++...++|+||++|++|||+||.+|+++||+||||||||||+|+|| T Consensus 159 ~~~~---------adv~VialIGeR~rev~efie~~~~~~~~~~tvvv~atsd~p~~~r~~a~~~a~aiAEyfrd~G~~V 229 (411) T TIGR03496 159 RYTE---------ADVVVVGLIGERGREVKEFIEEILGEEGLARSVVVAATADESPLMRLRAAFYATAIAEYFRDQGKDV 229 (411) T ss_pred HCCC---------CCEEEEECCCCCHHHHHHHHHHHHCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHCCCCE T ss_conf 5348---------9889995325305899999998622256521699994388998999999999999999999669946 Q ss_pred EEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCCCCCCHHHHHHH Q ss_conf 79961647899999875453037875433375200124678888750454458873567531003676644311466885 Q gi|254780617|r 265 LIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLLERAAKMSDALGAGSLTALPVIETQVNDVSAYIPTNVIS 344 (509) Q Consensus 265 Li~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l~ERa~~~~~~~g~GSiT~~~~v~~~~~D~~~~i~~~~~s 344 (509) |+++|||||||+|+|||||++||||+++|||||+||.||+|+||||+... ++||||+||+|++++||+|||||||++| T Consensus 230 Ll~~Dsltr~A~A~REisl~~ge~P~~~gyp~~vf~~l~~l~ERag~~~~--~~GSIT~~~~V~~~~dD~~dpi~d~~~~ 307 (411) T TIGR03496 230 LLLMDSLTRFAMAQREIALAIGEPPATKGYPPSVFAKLPRLVERAGNGEE--GKGSITAFYTVLVEGDDQQDPIADAARA 307 (411) T ss_pred EEEECCHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHCCCCCC--CCEEEEEEEEEECCCCCCCCCHHHHHHH T ss_conf 99974688999999899987379998667885388887788874368999--8986877358975798756645776565 Q ss_pred HHCCEEEEECCHHHHCCCCCCCCCCCHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH---CCCCHHHHHHHHHH Q ss_conf 406169961501221026654334441220343438988775567888799887678798763---38998999999999 Q gi|254780617|r 345 ITDGQIFLETELFYQGIRPAINIGLSVSRVGSAAQVKAMKQVSGAVKGELAQYREMSSFSKFS---SDLDSSTQKFLSKG 421 (509) Q Consensus 345 i~DG~i~l~~~l~~~g~~Paid~~~S~SRv~~~~~~~~~~~~a~~lr~~laqy~Ele~f~~fg---sdlD~~t~~~l~rg 421 (509) ||||||||||+||++||||||||+.|+||+|++++.+.|+++|+++|.+|++|+|++++.++| ++.|+++++++.++ T Consensus 308 ilDG~ivLsr~la~~g~yPAIdvl~S~SR~~~~~~~~~~~~~a~~~r~~la~y~e~~~li~~g~y~~g~d~~~d~ai~~~ 387 (411) T TIGR03496 308 ILDGHIVLSRELAEQGHYPAIDILASISRVMPDVVSPEHLQLARRFKQLLSRYQENRDLISIGAYQPGSDPELDQAIALY 387 (411) T ss_pred HEEEEEEECHHHHHCCCCCCCCCCCCHHHCCHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHH T ss_conf 14048999398996799997186405644233216899999999999999998999999997472369998999999979 Q ss_pred HHHHHHHCCCCCCCCCHHHHHHHH Q ss_conf 999999669998874889999999 Q gi|254780617|r 422 ERLTELLKQPQFSPLAMEEQVVMI 445 (509) Q Consensus 422 ~ri~e~LkQ~~~~p~~~~~qv~~l 445 (509) ++|++||+|+.+++.++++++..| T Consensus 388 ~~i~~fL~Q~~~e~~~~~~~~~~l 411 (411) T TIGR03496 388 PRIEAFLQQGMRERASFEESLEAL 411 (411) T ss_pred HHHHHHCCCCCCCCCCHHHHHHHC T ss_conf 999997289999997999999649 No 22 >PRK07721 fliI flagellum-specific ATP synthase; Validated Probab=100.00 E-value=0 Score=909.80 Aligned_cols=423 Identities=26% Similarity=0.407 Sum_probs=398.2 Q ss_pred HHHHCCCCCCCEEEEEEEEEEECCEEEEECCCCCCCCCEEEEC--C--C--EEEEEEEECCCEEEEEEECCCCCCCCCCE Q ss_conf 9984245554201035899982758999758887737479957--9--8--18999920598599998058668678988 Q gi|254780617|r 14 KRIKDFGNDSEFSEIGRVLSIGDGIARVYGLNNIRAGEMVQFS--H--G--VYGMALNLEVDNVGVVILGSYKEISEGDI 87 (509) Q Consensus 14 ~~i~~~~~~~~~~~~G~V~~V~~giv~v~GL~~a~~GElv~~~--~--g--~~G~Vi~l~~d~v~i~~l~~~~gI~~G~~ 87 (509) +.+..++....++.+|||++|.|.++++.|. ++.+||+|.+. + + ..|+|++|+++.+.+++|+++.||+.|++ T Consensus 3 ~~~~~~~~~~~~~~~GrV~~I~G~lIea~g~-~~~iGelc~I~~~~~~g~~i~aEVVgf~~~~v~l~p~~~~~GI~~G~~ 81 (435) T PRK07721 3 QLIDLIETLDPYKRYGKVKRVIGLMIESKGP-ESSIGDVCYIHTKGKGGKKIKAEVVGFKDENILLMPYLEAANIAPGSL 81 (435) T ss_pred HHHHHHHCCCCCCCCEEEEEEECEEEEEEEC-CCCCCCEEEEEECCCCCCEEEEEEEEECCCEEEEEECCCCCCCCCCCE T ss_conf 7999986279752312899998638999957-888434179996479997899999987698899998768889999999 Q ss_pred EEECCCEEEEECCHHHHHCCCCHHHCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCHHHHHCCHHHHHCCCCCCCCCEEEE Q ss_conf 89779878866794698300030023144664445444100101357731126600144416023310111156833655 Q gi|254780617|r 88 VKRTGRIVDVPVGLELLGRVVDALGNPIDGKGPIKCEQRSCTEADAPGIIQRQSVCEPLSTGIKAIDSLIPIGRGQRELI 167 (509) Q Consensus 88 V~~tg~~~~vpvG~~lLGRVvD~lG~PlDg~g~i~~~~~~~i~~~~p~~~~R~~i~~~l~TGI~~ID~l~pigrGQR~~I 167 (509) |..+|+.++||||++|||||+|++|+||||++++...+++|+++++|||++|.++++||+||||+||+|+|||||||++| T Consensus 82 V~~~g~~~~vpvg~~lLGRV~d~lG~PiDg~~~~~~~~~~~i~~~~pnp~~R~~i~e~l~TGIraID~l~tigkGQRigI 161 (435) T PRK07721 82 VEATGEPLRVKVGSGLIGQVVDAFGEPLDGSALPKGLAPVSTDQSPPNPMKRPPIREKMEVGVRAIDSLLTVGKGQRVGI 161 (435) T ss_pred EEECCCCCEEECCHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCEEEE T ss_conf 99589976677275322451045654358988888765352558997850147988630378565226476035754100 Q ss_pred ECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHHHHHH Q ss_conf 24778861158999999988630246678760699995167767899999851146742115899728878889999988 Q gi|254780617|r 168 IGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALSYSIVVVASASDPAPMQLLAP 247 (509) Q Consensus 168 ~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~~tvvv~a~a~~~~~~r~~ap 247 (509) ||++|+|||+| +.||.+|.. ..+|||++||||+||+.+|++.....+.++||++|++|||+||++|+++| T Consensus 162 f~gsGvGKs~L-l~~iar~~~---------adv~V~~liGeR~rEv~efie~~l~~~~l~rsvvv~atsd~p~~~R~~a~ 231 (435) T PRK07721 162 FAGSGVGKSTL-MGMIARQTS---------ADLNVIALIGERGREVREFIERDLGEEGLKRSIVVVATSDQPALMRIKGA 231 (435) T ss_pred CCCCCCCHHHH-HHHHHHHHC---------CCEEEEEEECCCHHHHHHHHHHHHCCCCCCCEEEEEECCCCCHHHHHHHH T ss_conf 26899878999-999988744---------88258999332028999999975221555323999943889999999999 Q ss_pred HHHHHHHHHHHCCCCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCEEEEEEE Q ss_conf 76666667542046884799616478999998754530378754333752001246788887504544588735675310 Q gi|254780617|r 248 FAGCAMGEYFRDNGYHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLLERAAKMSDALGAGSLTALPVI 327 (509) Q Consensus 248 ~~a~aiAEyfr~~G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l~ERa~~~~~~~g~GSiT~~~~v 327 (509) |+||||||||||+|+|||+++|||||||+|+||||+++||||+++|||||+|+.|++|+||||+. +.||||+||+| T Consensus 232 ~~a~tiAEyfrd~G~~VLl~~Dsltr~A~A~REisl~~ge~P~~~gYppsvf~~l~~l~ERag~~----~~GSIT~i~~V 307 (435) T PRK07721 232 YTATAIAEYFRDKGQNVMLMMDSVTRVAMAQREIGLAVGEPPTTKGYTPSVFAILPKLLERTGTN----ESGTITAFYTV 307 (435) T ss_pred HHHHHHHHHHHHCCCCEEEECCCHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHCCC----CCCCEEEEEEE T ss_conf 99888999999679978995163889999999999854899986677845998878888851379----99745777545 Q ss_pred ECCCCCCCCCHHHHHHHHHCCEEEEECCHHHHCCCCCCCCCCCHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 03676644311466885406169961501221026654334441220343438988775567888799887678798763 Q gi|254780617|r 328 ETQVNDVSAYIPTNVISITDGQIFLETELFYQGIRPAINIGLSVSRVGSAAQVKAMKQVSGAVKGELAQYREMSSFSKFS 407 (509) Q Consensus 328 ~~~~~D~~~~i~~~~~si~DG~i~l~~~l~~~g~~Paid~~~S~SRv~~~~~~~~~~~~a~~lr~~laqy~Ele~f~~fg 407 (509) ++++||+||||||+++|||||||||||+||++||||||||+.|+||+|+.++.+.|+++|+++|.+|++|+|+++++++| T Consensus 308 ~~~gdD~~dPi~d~~~~ilDG~ivLsr~la~~g~yPAIDvl~S~SR~~~~i~~~~h~~~a~~~r~~la~y~e~~~li~~g 387 (435) T PRK07721 308 LVDGDDMNEPIADTVRGILDGHFVLDRALANKGQFPAVNVLKSISRVMSNIATKEHKEAANRFRELLSTYQNSEDLINIG 387 (435) T ss_pred ECCCCCCCCCCHHHHHHHCCEEEEECHHHHHCCCCCCCCCCCCHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 04798666771687877716179986889866999980886166543222302999999999999999889999999974 Q ss_pred C---CCCHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCC Q ss_conf 3---8998999999999999999669998874889999999998669 Q gi|254780617|r 408 S---DLDSSTQKFLSKGERLTELLKQPQFSPLAMEEQVVMIFAGISG 451 (509) Q Consensus 408 s---dlD~~t~~~l~rg~ri~e~LkQ~~~~p~~~~~qv~~l~a~~~G 451 (509) . +.|+.+++++++..+|.+||+|+.+++.++++++..|+.+.++ T Consensus 388 ~y~~g~d~~~D~ai~~~~~i~~fL~Q~~~e~~~~~~~~~~l~~l~~~ 434 (435) T PRK07721 388 AYKRGSSREIDEAIQFYPQLISFLKQEVDEKATLEESIQALKQLTGR 434 (435) T ss_pred CCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCC T ss_conf 87679998899999868999998489999998999999999998677 No 23 >TIGR03498 FliI_clade3 flagellar protein export ATPase FliI. Members of this protein family are the FliI protein of bacterial flagellum systems. This protein acts to drive protein export for flagellar biosynthesis. The most closely related family is the YscN family of bacterial type III secretion systems. This model represents one (of three) segment of the FliI family tree. These have been modeled separately in order to exclude the type III secretion ATPases more effectively. Probab=100.00 E-value=0 Score=908.04 Aligned_cols=408 Identities=28% Similarity=0.443 Sum_probs=388.4 Q ss_pred EEEEEEECCEEEEECCCC-CCCCCEEEECC--C--EEEEEEEECCCEEEEEEECCCCCCCCCCEEEECCCEEEEECCHHH Q ss_conf 589998275899975888-77374799579--8--189999205985999980586686789888977987886679469 Q gi|254780617|r 29 GRVLSIGDGIARVYGLNN-IRAGEMVQFSH--G--VYGMALNLEVDNVGVVILGSYKEISEGDIVKRTGRIVDVPVGLEL 103 (509) Q Consensus 29 G~V~~V~~giv~v~GL~~-a~~GElv~~~~--g--~~G~Vi~l~~d~v~i~~l~~~~gI~~G~~V~~tg~~~~vpvG~~l 103 (509) |||++|.|++++|.|++. +.+||+|++.. + ..|+|++|+++.+.+++|+++.||++|++|.+||++++||||++| T Consensus 1 GrV~~I~G~viev~g~~~~~~iGe~c~I~~~~g~~~~aEVvg~~~~~v~l~~~~~t~Gi~~G~~V~~tg~~~~v~vg~~l 80 (418) T TIGR03498 1 GRVTAVTGLLIEVRGLSRAVRLGDRCAIRARDGRPVLAEVVGFNGDRVLLMPFEPLEGVGLGCAVFAREGPLAVRPHPSW 80 (418) T ss_pred CEEEEEECCEEEEEECCCCCCCCCEEEEECCCCCEEEEEEEEECCCEEEEEEECCCCCCCCCCEEEECCCCCEEECCHHH T ss_conf 98999978399998156667768889999199988999999981998999990798898999999968997488817422 Q ss_pred HHCCCCHHHCCCCCCCCCCC-CCCCHHCCCCCCCCCCCCCHHHHHCCHHHHHCCCCCCCCCEEEEECCCCCCCHHHHHHH Q ss_conf 83000300231446644454-44100101357731126600144416023310111156833655247788611589999 Q gi|254780617|r 104 LGRVVDALGNPIDGKGPIKC-EQRSCTEADAPGIIQRQSVCEPLSTGIKAIDSLIPIGRGQRELIIGDRKTGKTSIILDT 182 (509) Q Consensus 104 LGRVvD~lG~PlDg~g~i~~-~~~~~i~~~~p~~~~R~~i~~~l~TGI~~ID~l~pigrGQR~~I~g~~g~GKt~l~~~~ 182 (509) ||||+|++|+||||+|++.. ..++|+++++|||++|.++++||+||||+||+|+|||||||++|||++|+|||+| +.| T Consensus 81 LGRVid~~G~PlDg~g~~~~~~~~~~i~~~~p~p~~R~~i~e~l~TGIraID~l~pig~GQRigIfgg~GvGKt~L-l~~ 159 (418) T TIGR03498 81 LGRVINALGEPIDGKGPLPQGERRYPLRASPPPAMSRKRVGEPLDTGVRVIDTFLPLCRGQRLGIFAGSGVGKSTL-LSM 159 (418) T ss_pred CCCEECCCCCCCCCCCCCCCCCEEEEEECCCCCCHHCCCCCCCCCCCCCHHHHCCCCCCCCEEECCCCCCCCHHHH-HHH T ss_conf 0517544676357899987775687651689982210688874315760212015502576641137899988899-999 Q ss_pred HHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 99988630246678760699995167767899999851146742115899728878889999988766666675420468 Q gi|254780617|r 183 FLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALSYSIVVVASASDPAPMQLLAPFAGCAMGEYFRDNGY 262 (509) Q Consensus 183 I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~~tvvv~a~a~~~~~~r~~ap~~a~aiAEyfr~~G~ 262 (509) |.++.. ..+|||++||||+||+++|++++...++|+||++|++|+|+||++|+++||+|||+||||||+|+ T Consensus 160 i~~~~~---------adv~V~aliGeRgrEv~efi~~~~~~~~l~~tvvv~atsd~p~~~R~~a~~~a~aiAEyFrd~G~ 230 (418) T TIGR03498 160 LARNTD---------ADVVVIALVGERGREVREFLEDDLGEEGLKRSVVVVATSDESPLMRRQAAYTATAIAEYFRDQGK 230 (418) T ss_pred HHHHCC---------CCEEEEEEEEEEHHHHHHHHHHHHHCCCCCCEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 987507---------98689988302689999999986531564426999944889878888888787789999986798 Q ss_pred CEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCCCCCCHHHHH Q ss_conf 84799616478999998754530378754333752001246788887504544588735675310036766443114668 Q gi|254780617|r 263 HALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLLERAAKMSDALGAGSLTALPVIETQVNDVSAYIPTNV 342 (509) Q Consensus 263 ~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l~ERa~~~~~~~g~GSiT~~~~v~~~~~D~~~~i~~~~ 342 (509) |||+++|||||||+||||||++++|||+++|||||+||.||+|+||||+.+. ++||||+||+|++++||+||||||++ T Consensus 231 ~VLl~~Dsltr~A~A~REisl~~ge~P~~~gyp~~~f~~l~~l~ERag~~~~--~~GSIT~~~~v~~~~dD~~dpi~d~~ 308 (418) T TIGR03498 231 DVLLLMDSVTRFAMAQREIGLAAGEPPVARGYTPSVFSELPRLLERAGPGAE--GKGSITGIFTVLVDGDDHNEPVADAV 308 (418) T ss_pred CEEEEECCHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHCCCCC--CCEEEEEEEEEECCCCCCCCCCHHHH T ss_conf 3899955725889999999986589998677884177772687865136899--98569999999633888677617777 Q ss_pred HHHHCCEEEEECCHHHHCCCCCCCCCCCHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHC---CCCHHHHHHHH Q ss_conf 854061699615012210266543344412203434389887755678887998876787987633---89989999999 Q gi|254780617|r 343 ISITDGQIFLETELFYQGIRPAINIGLSVSRVGSAAQVKAMKQVSGAVKGELAQYREMSSFSKFSS---DLDSSTQKFLS 419 (509) Q Consensus 343 ~si~DG~i~l~~~l~~~g~~Paid~~~S~SRv~~~~~~~~~~~~a~~lr~~laqy~Ele~f~~fgs---dlD~~t~~~l~ 419 (509) +|||||||||||+||++||||||||+.|+||+|++++.+.|+++|+++|.+|++|+|+++++++|. +.|++++.+++ T Consensus 309 ~~i~Dg~ivLsr~la~~g~yPAIdvl~S~SR~~~~~~~~~~~~~a~~~r~~la~y~e~~dli~~g~y~~g~d~~~d~ai~ 388 (418) T TIGR03498 309 RGILDGHIVLDRAIAERGRYPAINVLASVSRLAPRVWSPEERKLVRRLRALLARYEETEDLIRLGAYRKGSDPELDEAIR 388 (418) T ss_pred HHHCCEEEEECHHHHHCCCCCCCCCCCCHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHH T ss_conf 67626389984889867999970885266444854356999999999999999889999999974866799988999999 Q ss_pred HHHHHHHHHCCCCCCCCCHHHHHHHHHHH Q ss_conf 99999999669998874889999999998 Q gi|254780617|r 420 KGERLTELLKQPQFSPLAMEEQVVMIFAG 448 (509) Q Consensus 420 rg~ri~e~LkQ~~~~p~~~~~qv~~l~a~ 448 (509) ++++|++||+|+.+++.++++++..|+++ T Consensus 389 ~~~~i~~fL~Q~~~e~~~~~~~~~~l~~i 417 (418) T TIGR03498 389 LVPKIYEFLTQGPDEPTSLQDPFADLAAI 417 (418) T ss_pred HHHHHHHHCCCCCCCCCCHHHHHHHHHHH T ss_conf 79999997089998998999999999976 No 24 >PRK07196 fliI flagellum-specific ATP synthase; Validated Probab=100.00 E-value=0 Score=906.40 Aligned_cols=423 Identities=24% Similarity=0.349 Sum_probs=398.2 Q ss_pred HHHHHHCCCCCCCEEEEEEEEEEECCEEEEECCCCCCCCCEEEECC--C--EEEEEEEECCCEEEEEEECCCCCCCCCCE Q ss_conf 9999842455542010358999827589997588877374799579--8--18999920598599998058668678988 Q gi|254780617|r 12 LRKRIKDFGNDSEFSEIGRVLSIGDGIARVYGLNNIRAGEMVQFSH--G--VYGMALNLEVDNVGVVILGSYKEISEGDI 87 (509) Q Consensus 12 l~~~i~~~~~~~~~~~~G~V~~V~~giv~v~GL~~a~~GElv~~~~--g--~~G~Vi~l~~d~v~i~~l~~~~gI~~G~~ 87 (509) |++.+..+......+..|||++|.|.++++.|++ +.+||+|++.. + +.|||++|+++.+.+++|+++.||..|++ T Consensus 2 l~~~~~~~~~~~~~r~~GrV~~i~G~~ie~~g~~-~~iG~~c~I~~~~g~~v~aEVVgf~~~~~~L~p~~~~~Gi~~G~~ 80 (434) T PRK07196 2 LDHALKSIENIHLARVAGRLVRVTGLLLESVGCR-LAIGQRCRIESVDETFIEAQVVGFDRDITYLMPFKHPGGVLGGAR 80 (434) T ss_pred HHHHHHHHHCCCCCEEEEEEEEEECCEEEEECCC-CCCCCEEEEEECCCCEEEEEEEEECCCEEEEEECCCCCCCCCCCE T ss_conf 5899999755986248889999971089998069-898998999808998889999998199699998888778899999 Q ss_pred EEECCCEEEEECCHHHHHCCCCHHHCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCHHHHHCCHHHHHCCCCCCCCCEEEE Q ss_conf 89779878866794698300030023144664445444100101357731126600144416023310111156833655 Q gi|254780617|r 88 VKRTGRIVDVPVGLELLGRVVDALGNPIDGKGPIKCEQRSCTEADAPGIIQRQSVCEPLSTGIKAIDSLIPIGRGQRELI 167 (509) Q Consensus 88 V~~tg~~~~vpvG~~lLGRVvD~lG~PlDg~g~i~~~~~~~i~~~~p~~~~R~~i~~~l~TGI~~ID~l~pigrGQR~~I 167 (509) |+++++.++||||++|||||+|++|+||||+|++..+..++++.+++||++|+++++||+||||+||+|+|||||||++| T Consensus 81 V~~~g~~~~v~vg~~lLGRVvD~~G~PlDg~g~~~~~~~~~~~~~~~nPl~R~~i~e~l~TGIraID~l~pigrGQRigI 160 (434) T PRK07196 81 VFPSEQDGELLIGDSWLGRVINGLGEPLDGKGQLGGSTPLQQQLPQIHPLQRRAVDTPLDVGVNAINGLLTIGKGQRVGL 160 (434) T ss_pred EEECCCCCEEECCHHHCCCEECCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCEEEE T ss_conf 99889987677686224757767886268998888884655668999830137888721268624415577024524210 Q ss_pred ECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHHHHHH Q ss_conf 24778861158999999988630246678760699995167767899999851146742115899728878889999988 Q gi|254780617|r 168 IGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALSYSIVVVASASDPAPMQLLAP 247 (509) Q Consensus 168 ~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~~tvvv~a~a~~~~~~r~~ap 247 (509) ||++|+|||+| +.||.+|... .+|||++||||+||+.+|++++.+.+.|+||++|++|+|+||.+|+++| T Consensus 161 fggsGvGKs~L-l~~I~r~~~a---------dvvVi~lIGeRgrEv~efi~~~~~~~~l~~svvv~atsd~~p~~r~~a~ 230 (434) T PRK07196 161 MAGSGVGKSVL-LGMITRYTQA---------DVVVVGLIGERGREVKEFIEHSLQAAGMAKSVVVAAPADESPLMRIKAT 230 (434) T ss_pred CCCCCCCHHHH-HHHHHHHHCC---------CEEEEEECCEEHHHHHHHHHHHHHCCCCCEEEEEEECCCCCHHHHHHHH T ss_conf 14789978999-9999897379---------9899974222349999999987424674326999845889878888999 Q ss_pred HHHHHHHHHHHCCCCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCEEEEEEE Q ss_conf 76666667542046884799616478999998754530378754333752001246788887504544588735675310 Q gi|254780617|r 248 FAGCAMGEYFRDNGYHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLLERAAKMSDALGAGSLTALPVI 327 (509) Q Consensus 248 ~~a~aiAEyfr~~G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l~ERa~~~~~~~g~GSiT~~~~v 327 (509) |+|+|+||||||+|+|||+++|||||||+|+|||||+++|||+++|||||+||.||+|+||||+. .|+||||+||+| T Consensus 231 ~~a~aiAEyfrd~G~~VLl~~Dsltr~A~A~REisl~~ge~P~~~gyp~svf~~l~~l~ERaG~~---~~~GSITa~~tv 307 (434) T PRK07196 231 ELCHAIATYYRDKGHDVLLLVDSLTRYAMAQREIALSLGEPPATKGYPPSAFSIIPRLAESAGNS---SGNGTMTAIYTV 307 (434) T ss_pred HHHHHHHHHHHHCCCCEEEEECCHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHCCC---CCCEEEEEEEEE T ss_conf 88998999998679856999547468899999999756899986775843888858999852157---886248888888 Q ss_pred ECCCCCCCCCHHHHHHHHHCCEEEEECCHHHHCCCCCCCCCCCHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 03676644311466885406169961501221026654334441220343438988775567888799887678798763 Q gi|254780617|r 328 ETQVNDVSAYIPTNVISITDGQIFLETELFYQGIRPAINIGLSVSRVGSAAQVKAMKQVSGAVKGELAQYREMSSFSKFS 407 (509) Q Consensus 328 ~~~~~D~~~~i~~~~~si~DG~i~l~~~l~~~g~~Paid~~~S~SRv~~~~~~~~~~~~a~~lr~~laqy~Ele~f~~fg 407 (509) ++++||++||||||++|||||||||||+||++|+||||||+.|+||+|+.++.+.|+++|+++|.+|++|+|+|+|+++| T Consensus 308 ~~~~dD~~dpi~d~~~silDGhivLsr~la~~G~yPaIdvl~S~SR~m~~i~~~~~~~~a~~~r~~la~y~e~e~li~~G 387 (434) T PRK07196 308 LAEGDDQQDPIVDCARAVLDGHIVLSRKLAEAGHYPAIDISQSISRCMSQVIGSQQAKAASLLKQCYADYMAIKPLIPLG 387 (434) T ss_pred ECCCCCCCCCCHHHHHHHCCCEEEECHHHHHCCCCCCCCCCCCHHHCCHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 62688878870666575507549984768866999973886264225510166999999999999999999999999852 Q ss_pred C---CCCHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHH Q ss_conf 3---8998999999999999999669998874889999999998 Q gi|254780617|r 408 S---DLDSSTQKFLSKGERLTELLKQPQFSPLAMEEQVVMIFAG 448 (509) Q Consensus 408 s---dlD~~t~~~l~rg~ri~e~LkQ~~~~p~~~~~qv~~l~a~ 448 (509) . +.|+.+++++++.++|.+||+|+.+++.++++.+-.|..+ T Consensus 388 ~Y~~G~d~~~D~ai~~~~~i~~fL~Q~~~e~~~~~~s~~~l~~l 431 (434) T PRK07196 388 GYVAGADPMADQAVHYYPAITQFLRQEVGHPALFSASVEQLTGM 431 (434) T ss_pred CCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHH T ss_conf 85579998999999858999998289999994999999999987 No 25 >PRK12597 F0F1 ATP synthase subunit beta; Provisional Probab=100.00 E-value=0 Score=904.60 Aligned_cols=422 Identities=25% Similarity=0.394 Sum_probs=398.1 Q ss_pred EEEEEEEEECCEEEEECCCCCCCCCEEEEC----CCEEEEEEE-ECCCEEEEEEECCCCCCCCCCEEEECCCEEEEECCH Q ss_conf 035899982758999758887737479957----981899992-059859999805866867898889779878866794 Q gi|254780617|r 27 EIGRVLSIGDGIARVYGLNNIRAGEMVQFS----HGVYGMALN-LEVDNVGVVILGSYKEISEGDIVKRTGRIVDVPVGL 101 (509) Q Consensus 27 ~~G~V~~V~~giv~v~GL~~a~~GElv~~~----~g~~G~Vi~-l~~d~v~i~~l~~~~gI~~G~~V~~tg~~~~vpvG~ 101 (509) .+|||++|.|.+++|.+ +++.++++|++. +...++|++ ++++.|.+++|+++.||++|++|.+||++++||||+ T Consensus 2 ~~GrV~~V~G~vVev~~-~~~~l~~i~~i~~~~~~~~~~EVv~~l~~~~v~~i~l~~t~Gl~~G~~V~~tg~~l~VpVG~ 80 (459) T PRK12597 2 MIGRIIKIRGAVVDVEF-EEGALPAINQALTTHDGPTLLEVKQHLDETTVRAIALGSTAGLARGDEVRNTGAPIEVPVGE 80 (459) T ss_pred CCEEEEEEECCEEEEEE-CCCCCCCCEEEEEECCCCEEEEEEEEECCCEEEEEECCCCCCCCCCCEEEECCCCEEEEECH T ss_conf 83799999842899995-78887770589992799589999798479879999813877888999999689945987086 Q ss_pred HHHHCCCCHHHCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCHHHHHCCHHHHHCCCCCCCCCEEEEECCCCCCCHHHHHH Q ss_conf 69830003002314466444544410010135773112660014441602331011115683365524778861158999 Q gi|254780617|r 102 ELLGRVVDALGNPIDGKGPIKCEQRSCTEADAPGIIQRQSVCEPLSTGIKAIDSLIPIGRGQRELIIGDRKTGKTSIILD 181 (509) Q Consensus 102 ~lLGRVvD~lG~PlDg~g~i~~~~~~~i~~~~p~~~~R~~i~~~l~TGI~~ID~l~pigrGQR~~I~g~~g~GKt~l~~~ 181 (509) +|||||+|++|+||||++++..+++||+++++|++++|.++++||+||||+||+|+|||||||+||||++|+|||+|+.+ T Consensus 81 ~lLGRV~d~lG~PiDg~~~i~~~~~~pi~~~~p~~~~r~~~~e~l~TGIkaID~l~pigrGqrigIfggaGvGKTvL~~~ 160 (459) T PRK12597 81 AVLGRLLDVLGEPLDGGPPLPAEERRPIHSTIPPLAEQDTSTELLETGIKVIDLLCPIAKGGKTGLFGGAGVGKTVLMME 160 (459) T ss_pred HHCCCEECCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCCCCCCCCHHHHHCCCCCCCCCEEEEECCCCCCCCHHHHH T ss_conf 33368767888626888888877455546789984335777872003805554126635787777436899982157899 Q ss_pred HHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHHHHHC-C Q ss_conf 999988630246678760699995167767899999851146742115899728878889999988766666675420-4 Q gi|254780617|r 182 TFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALSYSIVVVASASDPAPMQLLAPFAGCAMGEYFRD-N 260 (509) Q Consensus 182 ~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~~tvvv~a~a~~~~~~r~~ap~~a~aiAEyfr~-~ 260 (509) +|.|+... ++++|||++||||+||+++|++++++.|+|+||++|+++||+||++|+++||+|+|||||||| + T Consensus 161 ii~n~~~~-------~~~v~V~~~iGER~rEv~e~~~el~~~g~l~~tvvV~~~~depp~~R~~~~~~a~tiAEyFrD~~ 233 (459) T PRK12597 161 LIFNISKR-------HQGSSVFAGVGERSREGHELYLEMKESGVLDKTVMVYGQMNEPPGARMRVVLTGLTAAEYFRDEE 233 (459) T ss_pred HHHHHHHH-------CCCCEEEEEECCCHHHHHHHHHHHHHCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHCC T ss_conf 99987755-------17836888624546889999999863687524899980688999999877778889999998637 Q ss_pred CCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCCCCCCHHH Q ss_conf 68847996164789999987545303787543337520012467888875045445887356753100367664431146 Q gi|254780617|r 261 GYHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLLERAAKMSDALGAGSLTALPVIETQVNDVSAYIPT 340 (509) Q Consensus 261 G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l~ERa~~~~~~~g~GSiT~~~~v~~~~~D~~~~i~~ 340 (509) |+|||+++||+||||+|+||||+++||||+++||||++|+.+++|+||+++ .++||||+||+|++|+||+|||||| T Consensus 234 g~dVLl~~D~ltR~A~A~rEisl~lge~P~~~GY~psl~~~l~~l~ER~~~----~~~GSIT~~~~V~v~~dD~tdPi~~ 309 (459) T PRK12597 234 KEDVLLFIDNIFRFVQAGSEVSGLLGRMPSRVGYQPTLASEVAALQERIAS----TKNGSITSIQAVYVPADDLTDPAAV 309 (459) T ss_pred CCCEEEEECCHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHCC----CCCCCEEEEEEEEECCCCCCCCCHH T ss_conf 997899955768999999999986379998667695178889999987516----8997512678998148886775255 Q ss_pred HHHHHHCCEEEEECCHHHHCCCCCCCCCCCHHHCCC-CCCCHHHHHHHHHHHHHHHHHHHHHHHHH-HHCC-CCHHHHHH Q ss_conf 688540616996150122102665433444122034-34389887755678887998876787987-6338-99899999 Q gi|254780617|r 341 NVISITDGQIFLETELFYQGIRPAINIGLSVSRVGS-AAQVKAMKQVSGAVKGELAQYREMSSFSK-FSSD-LDSSTQKF 417 (509) Q Consensus 341 ~~~si~DG~i~l~~~l~~~g~~Paid~~~S~SRv~~-~~~~~~~~~~a~~lr~~laqy~Ele~f~~-fgsd-lD~~t~~~ 417 (509) +++|||||||||||+||++|+||||||+.|+||+|. .+..+.|+++|+++|..|++|+|++++++ +|.| ||+++|.+ T Consensus 310 ~~~silDg~ivLsr~la~~g~yPAID~l~S~SR~m~~~ivg~~H~~~a~~~~~~l~~y~eledii~i~G~deLs~~d~~~ 389 (459) T PRK12597 310 AIFSHLDSSVVLSRDQAAKGIYPAIDPLASSSRLLDPLVVGERHYDAAIEVKRILQRYKELEDVIAILGIDELSAEDKII 389 (459) T ss_pred HHEEECCEEEEECHHHHHCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHH T ss_conf 43110342899825587779997667666532247876699999999999999999988999999975851079999999 Q ss_pred HHHHHHHHHHHCCCCCCC----------CCHHHHHHHHHHHHCCCCCCCCHHH Q ss_conf 999999999966999887----------4889999999998669806679899 Q gi|254780617|r 418 LSKGERLTELLKQPQFSP----------LAMEEQVVMIFAGISGCLDEVAVSQ 460 (509) Q Consensus 418 l~rg~ri~e~LkQ~~~~p----------~~~~~qv~~l~a~~~G~ld~i~v~~ 460 (509) ++||+|+++||+||+|.+ +++++++...+.+.+|.+|++|.+. T Consensus 390 v~rar~l~~fl~Qpf~v~e~ftg~~g~~v~~~~t~~~~~~il~g~~d~~~e~~ 442 (459) T PRK12597 390 VKRARQLQRFLTQPFFVTEAFTGQPGVSVPLEETLASCERILNGEYDDWSESS 442 (459) T ss_pred HHHHHHHHHHHCCCCHHHHHHCCCCCEEEEHHHHHHHHHHHHCCCCCCCCHHH T ss_conf 99999999986587113344148997085199999999999679877889799 No 26 >PRK04196 V-type ATP synthase subunit B; Provisional Probab=100.00 E-value=0 Score=906.52 Aligned_cols=442 Identities=26% Similarity=0.417 Sum_probs=414.6 Q ss_pred EEEEEEEEEECCEEEEECCCCCCCCCEEEEC--CC--EEEEEEEECCCEEEEEEECCCCCCCC-CCEEEECCCEEEEECC Q ss_conf 1035899982758999758887737479957--98--18999920598599998058668678-9888977987886679 Q gi|254780617|r 26 SEIGRVLSIGDGIARVYGLNNIRAGEMVQFS--HG--VYGMALNLEVDNVGVVILGSYKEISE-GDIVKRTGRIVDVPVG 100 (509) Q Consensus 26 ~~~G~V~~V~~giv~v~GL~~a~~GElv~~~--~g--~~G~Vi~l~~d~v~i~~l~~~~gI~~-G~~V~~tg~~~~vpvG 100 (509) +++|+|.+|.|+++.+.|++++.+||+|++. +| ..|+|++|++|.+.+++|+++.||++ |++|.+||++++|||| T Consensus 2 ~~Yg~V~~I~Gplv~~~g~~~~~~gElv~I~~~~g~~~~GeVi~~~~d~~~iqv~e~t~Gl~~~g~~V~~tG~plsV~vG 81 (460) T PRK04196 2 KEYRTVSEIVGPLMFVEGVEGVAYGELVEIELPNGEKRRGQVLEVSGDKAVVQVFEGTTGLNLKGTKVRFTGETLELPVS 81 (460) T ss_pred CCCCEEEEEECCEEEEECCCCCCCCCEEEEECCCCCEEEEEEEEEECCEEEEEECCCCCCCCCCCCEEEECCCCEEEEEC T ss_conf 42326999988689992588898787899983999888899999869979999915988878599789947995288718 Q ss_pred HHHHHCCCCHHHCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCHHHHHCCHHHHHCCCCCCCCCEEEEECCCCCCCHHHHH Q ss_conf 46983000300231446644454441001013577311266001444160233101111568336552477886115899 Q gi|254780617|r 101 LELLGRVVDALGNPIDGKGPIKCEQRSCTEADAPGIIQRQSVCEPLSTGIKAIDSLIPIGRGQRELIIGDRKTGKTSIIL 180 (509) Q Consensus 101 ~~lLGRVvD~lG~PlDg~g~i~~~~~~~i~~~~p~~~~R~~i~~~l~TGI~~ID~l~pigrGQR~~I~g~~g~GKt~l~~ 180 (509) ++|||||+|++|+||||+|++..++++|+++.||||++|.++++||+|||++||+|+||+||||++|||++|+|||+|+. T Consensus 82 ~~lLGRV~DglGrPlDg~~~i~~~~~~~i~g~~~np~~R~~~~e~l~TGI~~ID~l~pl~rGQr~~If~gsGvgks~l~~ 161 (460) T PRK04196 82 EDMLGRIFDGLGRPIDGGPEIIPEKRLDINGAPINPVAREYPEEFIQTGISAIDGLNTLVRGQKLPIFSGSGLPHNELAA 161 (460) T ss_pred HHHHCCEECCCCCCCCCCCCCCCCCEEECCCCCCCCHHHCCCCCEECCCCEEEECCCCEECCCEEEEECCCCCCCCHHHH T ss_conf 77727984778863688998887734125889989255268884613796796164332247576553389988116999 Q ss_pred HHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHHHHH-C Q ss_conf 999998863024667876069999516776789999985114674211589972887888999998876666667542-0 Q gi|254780617|r 181 DTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALSYSIVVVASASDPAPMQLLAPFAGCAMGEYFR-D 259 (509) Q Consensus 181 ~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~~tvvv~a~a~~~~~~r~~ap~~a~aiAEyfr-~ 259 (509) +|++|+... ...++++|||++||||+||+++|++++++.|+|+||++|++|||+|+++|+++||+|+|+||||| | T Consensus 162 -~i~~qa~~~---~~~~~~~vV~~~iGeR~~E~~e~~~~~~~~g~m~rtvlv~~ts~~p~~~R~~~~~~a~tiAEyfr~d 237 (460) T PRK04196 162 -QIARQAKVL---GSEEKFAVVFAAMGITFEEANFFMDEFEETGALERSVLFLNLADDPAIERILTPRMALTTAEYLAFE 237 (460) T ss_pred -HHHHHHHHC---CCCCCCEEEEEECCEECHHHHHHHHHHHHCCCCCCEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHH T ss_conf -999987750---5788815999732076068999999887448632004751358898899888876665789999985 Q ss_pred CCCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCCCCCCHH Q ss_conf 46884799616478999998754530378754333752001246788887504544588735675310036766443114 Q gi|254780617|r 260 NGYHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLLERAAKMSDALGAGSLTALPVIETQVNDVSAYIP 339 (509) Q Consensus 260 ~G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l~ERa~~~~~~~g~GSiT~~~~v~~~~~D~~~~i~ 339 (509) +|+|||++|||+||||+|+||||++++|||+++||||++|+.|++|+||||+++.. +||||++|+|++|+||+||||| T Consensus 238 ~G~~VL~~~dd~tr~A~A~REis~~lge~P~~~GYP~~l~s~la~l~ERAG~~~~~--~GSIT~i~~V~~pgdD~t~Pi~ 315 (460) T PRK04196 238 KGMHVLVILTDMTNYCEALREISAAREEVPGRRGYPGYMYTDLATLYERAGRIKGK--KGSITQIPILTMPEDDITHPIP 315 (460) T ss_pred CCCCEEEEECCCHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCC--CCCEEEEEEEECCCCCCCCCHH T ss_conf 69957999758078888766888754799987776862688868888723555789--8326999999767987666467 Q ss_pred HHHHHHHCCEEEEECCHHHHCCCCCCCCCCCHHHCCCCCC-----CHHHHHHHHHHHHHHHHHHHHHHHHHH-HCC-CCH Q ss_conf 6688540616996150122102665433444122034343-----898877556788879988767879876-338-998 Q gi|254780617|r 340 TNVISITDGQIFLETELFYQGIRPAINIGLSVSRVGSAAQ-----VKAMKQVSGAVKGELAQYREMSSFSKF-SSD-LDS 412 (509) Q Consensus 340 ~~~~si~DG~i~l~~~l~~~g~~Paid~~~S~SRv~~~~~-----~~~~~~~a~~lr~~laqy~Ele~f~~f-gsd-lD~ 412 (509) ||++|||||||||||+||++||||||||+.|+||+|..+. .+.|++++++++..|++|+||++|+++ |.| |++ T Consensus 316 ~~t~~i~dg~i~L~r~La~~g~yPaId~l~S~SR~~~~~ig~~~~~~~h~~va~~~~~~la~~~el~~i~~lvG~d~Ls~ 395 (460) T PRK04196 316 DLTGYITEGQIVLSRELHRKGIYPPIDVLPSLSRLMKDGIGEGKTREDHADVSNQLYAAYARGKDLRELAAIVGEEALSE 395 (460) T ss_pred HHHHHHCCEEEEECHHHHHCCCCCCCCCCCCCCHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCH T ss_conf 76764404389983899967998878876550110203356552166899999999999997888999999849601898 Q ss_pred HHHHHHHHHHHHH-HHHCCCCCCCCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999999-9966999887488999999999866980667989999999999999974159 Q gi|254780617|r 413 STQKFLSKGERLT-ELLKQPQFSPLAMEEQVVMIFAGISGCLDEVAVSQVRKFETNFLSHMRVSSQ 477 (509) Q Consensus 413 ~t~~~l~rg~ri~-e~LkQ~~~~p~~~~~qv~~l~a~~~G~ld~i~v~~I~~fe~~l~~~l~~~~~ 477 (509) .+|..|+++++|+ +||+|++|+++++++|+...|. +|+.+|.+.+..|+.+++++...+|. T Consensus 396 ~dk~~l~~a~~~~~~fL~Q~~~~~~~~~~~l~~~~~----~l~~~p~~~~~~~~~~~~~~~~~~~~ 457 (460) T PRK04196 396 RDRKYLKFADAFEREFVNQGFYENRSIEETLDLGWE----LLSILPESELKRIKDEYIDKYHPKYR 457 (460) T ss_pred HHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHH----HHHHCCHHHHHCCCHHHHHHHHHHHC T ss_conf 899999999899998547999998489999999999----99867899874368999999856542 No 27 >PRK06793 fliI flagellum-specific ATP synthase; Validated Probab=100.00 E-value=0 Score=905.07 Aligned_cols=419 Identities=23% Similarity=0.334 Sum_probs=395.4 Q ss_pred HHHHCCCCCCCEEEEEEEEEEECCEEEEECCCCCCCCCEEEEC-CCEEEEEEEECCCEEEEEEECCCCCCCCCCEEEECC Q ss_conf 9984245554201035899982758999758887737479957-981899992059859999805866867898889779 Q gi|254780617|r 14 KRIKDFGNDSEFSEIGRVLSIGDGIARVYGLNNIRAGEMVQFS-HGVYGMALNLEVDNVGVVILGSYKEISEGDIVKRTG 92 (509) Q Consensus 14 ~~i~~~~~~~~~~~~G~V~~V~~giv~v~GL~~a~~GElv~~~-~g~~G~Vi~l~~d~v~i~~l~~~~gI~~G~~V~~tg 92 (509) ++.......+..+++|+|.+|.|++++++| +++.+||+|.+. +++.|||++|+++.+.+++|++..||++|++|.+++ T Consensus 8 ~~~~~~~~~p~~~~~GrV~~V~g~~i~a~G-~~~~iGe~c~i~~~~~~aeVvgf~~~~v~l~p~~~~~gi~~G~~V~~~g 86 (432) T PRK06793 8 QKWNTFIETPFYTKVGKVHSVQEQFFVAKG-PKAKIGDVCFVGEHNVLCEVIAIEKENNMLLPFEQTEKVCYGDSVTLIA 86 (432) T ss_pred HHHCCCCCCCCCEEEEEEEEEECEEEEEEE-CCCCCCCEEEECCCCEEEEEEEEECCEEEEEECCCCCCCCCCCEEEECC T ss_conf 554341368874478899999351999993-7987589899868981899999858939999887877889999999799 Q ss_pred CEEEEECCHHHHHCCCCHHHCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCHHHHHCCHHHHHCCCCCCCCCEEEEECCCC Q ss_conf 87886679469830003002314466444544410010135773112660014441602331011115683365524778 Q gi|254780617|r 93 RIVDVPVGLELLGRVVDALGNPIDGKGPIKCEQRSCTEADAPGIIQRQSVCEPLSTGIKAIDSLIPIGRGQRELIIGDRK 172 (509) Q Consensus 93 ~~~~vpvG~~lLGRVvD~lG~PlDg~g~i~~~~~~~i~~~~p~~~~R~~i~~~l~TGI~~ID~l~pigrGQR~~I~g~~g 172 (509) ++++||||++|||||+|++|+||||++++...+++|+++++|||++|.++++||+||||+||+|+|||||||++||||+| T Consensus 87 ~~~~vpVg~~lLGRViD~lG~PlD~~~~~~~~~~~~~~~~~~np~~R~~i~e~l~TGIraID~l~pigrGQR~gIfg~sG 166 (432) T PRK06793 87 EDVVIPRGNHLLGKVLSANGEVLNEEAENIPLQKIKLDAPPIHAFEREEITDVFETGIKSIDSMLTIGIGQKIGIFAGSG 166 (432) T ss_pred CCCEEECCCCCCCCEECCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCCCCCCCCCCHHHHCCCCCCCCEEEEECCCC T ss_conf 97578889563287757896615899988877434432899883013777874004873100015612462666632689 Q ss_pred CCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHHHHHHHHHHH Q ss_conf 86115899999998863024667876069999516776789999985114674211589972887888999998876666 Q gi|254780617|r 173 TGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALSYSIVVVASASDPAPMQLLAPFAGCA 252 (509) Q Consensus 173 ~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~~tvvv~a~a~~~~~~r~~ap~~a~a 252 (509) +|||+|+ .||.++.. ..+|||++||||+||+.+|+++....+.|+||++|++|||+||.+|+++||+||| T Consensus 167 vGKstLl-~~i~~~~~---------adv~Vi~lIGeRgrEv~efi~~~l~~~gl~~tvvV~atsd~p~~~r~~a~~~a~a 236 (432) T PRK06793 167 VGKSTLL-GMIAKNAK---------ADINVISLVGERGREVKDFIRKELGEEGMRKSVVVVATSDESHLMQLRAAKLATS 236 (432) T ss_pred CCHHHHH-HHHHHHCC---------CCCEEEEEEEEECHHHHHHHHHHHHCCCCCCEEEEEECCCCCHHHHHHHHHHHHH T ss_conf 8789999-99987245---------7865799988862799999999864467521278872588998999988755123 Q ss_pred HHHHHHCCCCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCC Q ss_conf 66754204688479961647899999875453037875433375200124678888750454458873567531003676 Q gi|254780617|r 253 MGEYFRDNGYHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLLERAAKMSDALGAGSLTALPVIETQVN 332 (509) Q Consensus 253 iAEyfr~~G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l~ERa~~~~~~~g~GSiT~~~~v~~~~~ 332 (509) |||||||+|+|||++||||||||+|||||||++||||. .|||+++|+.+++|+||||+.. +||||+||+|++++| T Consensus 237 iAEyfrd~G~~VLli~DslTr~A~A~REisl~~gepP~-~Gy~p~~~s~l~~LlERag~~~----~GSITa~~tVl~~gd 311 (432) T PRK06793 237 IAEYFRDQGNNVLLMMDSVTRFADARRSVDIAVKELPI-GGKTLLMESYMKKLLERSGKTQ----KGSITGIYTVLVDGD 311 (432) T ss_pred HHHHHHHCCCCEEEEECCHHHHHHHHHHHHHHCCCCCC-CCCCCCHHHHHHHHHHHHHCCC----CCCEEEEEEEEECCC T ss_conf 78899866996899945678888887477885379998-9888645878999997540368----863578889983288 Q ss_pred CCCCCHHHHHHHHHCCEEEEECCHHHHCCCCCCCCCCCHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHC---C Q ss_conf 6443114668854061699615012210266543344412203434389887755678887998876787987633---8 Q gi|254780617|r 333 DVSAYIPTNVISITDGQIFLETELFYQGIRPAINIGLSVSRVGSAAQVKAMKQVSGAVKGELAQYREMSSFSKFSS---D 409 (509) Q Consensus 333 D~~~~i~~~~~si~DG~i~l~~~l~~~g~~Paid~~~S~SRv~~~~~~~~~~~~a~~lr~~laqy~Ele~f~~fgs---d 409 (509) |++||||||++|||||||||||+||++||||||||+.|+||+|++++.+.|+++|+++|.+||+|+|+|+|+++|. + T Consensus 312 D~~~pI~d~~~silDGhIvLsr~La~~g~yPAIDvl~SvSRv~~~v~~~~h~~~a~~~r~~la~y~e~edli~iG~Y~~g 391 (432) T PRK06793 312 DLNGPVPDLARGILDGHIVLKRELATLSHYPAISVLDSVSRIMEEIVSPNHWQLANEMRKILSIYKENELYFKLGTIQEN 391 (432) T ss_pred CCCCCCHHHHHHHCCEEEEECHHHHHCCCCCCCCCCCCHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCC T ss_conf 66787157677662537999675886699997288515422625450399999999999999999999999984797669 Q ss_pred CC-HHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHH Q ss_conf 99-8999999999999999669998874889999999998 Q gi|254780617|r 410 LD-SSTQKFLSKGERLTELLKQPQFSPLAMEEQVVMIFAG 448 (509) Q Consensus 410 lD-~~t~~~l~rg~ri~e~LkQ~~~~p~~~~~qv~~l~a~ 448 (509) -| +..++++++.++|.+||+|+..++.++++++..|+.+ T Consensus 392 ~d~~~~d~a~~~~~~i~~fL~Q~~~e~~~~~~~~~~l~~l 431 (432) T PRK06793 392 AENAYIFECKNKVEGINTFLKQGRSDSFQFDDIVEAMHHI 431 (432) T ss_pred CCCHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHH T ss_conf 9858899999868999998289988997999999999956 No 28 >PRK07960 fliI flagellum-specific ATP synthase; Validated Probab=100.00 E-value=0 Score=902.65 Aligned_cols=435 Identities=24% Similarity=0.322 Sum_probs=408.9 Q ss_pred CCCCHHHHHHHHHHHHHCCCCCCCEEEEEEEEEEECCEEEEECCCCCCCCCEEEEC--CC-----EEEEEEEECCCEEEE Q ss_conf 98888999999999984245554201035899982758999758887737479957--98-----189999205985999 Q gi|254780617|r 1 MDIHAAEISDILRKRIKDFGNDSEFSEIGRVLSIGDGIARVYGLNNIRAGEMVQFS--HG-----VYGMALNLEVDNVGV 73 (509) Q Consensus 1 m~~~~~eis~~l~~~i~~~~~~~~~~~~G~V~~V~~giv~v~GL~~a~~GElv~~~--~g-----~~G~Vi~l~~d~v~i 73 (509) |+.+..+-.+.+....+++...+.++.+|||++|.|.+++++|+ ++.+||+|+++ ++ +.+||++|+++.+.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~l~~~~~~~~~GrV~~v~G~~iea~G~-~a~iG~~C~I~~~~g~~~~~v~AEVVGF~~~~~~L 79 (455) T PRK07960 1 MTTRLTRWLTTLDNFEAKMAQLPAVRRYGRLTRATGLVLEATGL-QLPLGATCVIERQNGSETHEVESEVVGFNGQRLFL 79 (455) T ss_pred CCHHHHHHHHHHHHHHHHHHCCCCCCEEEEEEEEEEEEEEEECC-CCCCCCEEEEEECCCCCCCCEEEEEEEECCCEEEE T ss_conf 95379999999999999983699761466899997359999677-88979989997079987771679999876995899 Q ss_pred EEECCCCCCCCCCEEEE-------CCCEEEEECCHHHHHCCCCHHHCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCHHHH Q ss_conf 98058668678988897-------79878866794698300030023144664445444100101357731126600144 Q gi|254780617|r 74 VILGSYKEISEGDIVKR-------TGRIVDVPVGLELLGRVVDALGNPIDGKGPIKCEQRSCTEADAPGIIQRQSVCEPL 146 (509) Q Consensus 74 ~~l~~~~gI~~G~~V~~-------tg~~~~vpvG~~lLGRVvD~lG~PlDg~g~i~~~~~~~i~~~~p~~~~R~~i~~~l 146 (509) |||++..||.+|++|+. +++...||||++|||||+|++|+||||++++...+++|++++||||++|.++++|| T Consensus 80 mp~~~~~Gi~~G~~V~~~~~~~~~~~~~~~vpVg~~lLGRVvd~lG~PlDg~~~~~~~~~~pi~~~~pnp~~R~~i~epL 159 (455) T PRK07960 80 MPLEEVEGILPGARVYARNISAEGLQSGKQLPLGPALLGRVLDGSGKPLDGLPAPDTTETGALITPPFNPLQRTPIEHVL 159 (455) T ss_pred EECCCCCCCCCCCEEEECCCCCCCCCCCCEECCCCCCCCCEECCCCCCCCCCCCCCCCCCCCCCCCCCCCHHCCCCCCCC T ss_conf 64797665799999972467655578888720384630635189998268999999876153558997822117878723 Q ss_pred HCCHHHHHCCCCCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCC Q ss_conf 41602331011115683365524778861158999999988630246678760699995167767899999851146742 Q gi|254780617|r 147 STGIKAIDSLIPIGRGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGAL 226 (509) Q Consensus 147 ~TGI~~ID~l~pigrGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~ 226 (509) +||||+||+|+|||||||++|||++|+|||+| +.||.+++.. .+|||++||||+||+.+|++.+.+.+.+ T Consensus 160 ~TGIraID~llpigrGQRigIfagsGvGKs~L-l~~iar~~~a---------dv~Vi~lIGERgrEv~efi~~~l~~~~~ 229 (455) T PRK07960 160 DTGVRAINALLTVGRGQRMGLFAGSGVGKSVL-LGMMARYTRA---------DVIVVGLIGERGREVKDFIENILGAEGR 229 (455) T ss_pred CCCCEEEECCCCCCCCCEEECCCCCCCCHHHH-HHHHHHHHCC---------CEEEEEEECEEHHHHHHHHHHHHCCCCC T ss_conf 26844440336714787763248998849999-9999886089---------8589972040628999999997514774 Q ss_pred CEEEEEEECCCCCHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHH Q ss_conf 11589972887888999998876666667542046884799616478999998754530378754333752001246788 Q gi|254780617|r 227 SYSIVVVASASDPAPMQLLAPFAGCAMGEYFRDNGYHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLL 306 (509) Q Consensus 227 ~~tvvv~a~a~~~~~~r~~ap~~a~aiAEyfr~~G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l~ 306 (509) +||++|++|+|+||.+|+++||+|+||||||||+|+|||+++||+||||+|+|||||++||||+++|||||+|+.||+|+ T Consensus 230 ~rsvvv~atsd~~p~~r~~aa~~a~aiAEyFrd~Gk~VLl~~DslTr~A~A~REi~l~~gepP~~~GYppsvf~~l~~L~ 309 (455) T PRK07960 230 ARSVVIAAPADVSPLLRMQGAAYATRIAEDFRDRGQHVLLIMDSLTRYAMAQREIALAIGEPPATKGYPPSVFAKLPALV 309 (455) T ss_pred CCEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHH T ss_conf 21279984266684888878999999999998558737999736478888888888763799865577830677737899 Q ss_pred HHHHHCCCCCCCCCEEEEEEEECCCCCCCCCHHHHHHHHHCCEEEEECCHHHHCCCCCCCCCCCHHHCCCCCCCHHHHHH Q ss_conf 88750454458873567531003676644311466885406169961501221026654334441220343438988775 Q gi|254780617|r 307 ERAAKMSDALGAGSLTALPVIETQVNDVSAYIPTNVISITDGQIFLETELFYQGIRPAINIGLSVSRVGSAAQVKAMKQV 386 (509) Q Consensus 307 ERa~~~~~~~g~GSiT~~~~v~~~~~D~~~~i~~~~~si~DG~i~l~~~l~~~g~~Paid~~~S~SRv~~~~~~~~~~~~ 386 (509) ||||+. ..++||||+||+|++++||++|||||+++|||||||||||+||++||||||||+.|+||+|+.++.+.|+++ T Consensus 310 ERaG~~--~~~~GSITa~~tVl~~gdD~~dPi~d~~~silDGhivLsr~la~~g~yPAIDvl~S~SR~m~~i~~~~h~~~ 387 (455) T PRK07960 310 ERAGNG--ISGGGSITAFYTVLTEGDDQQDPIADSARAILDGHIVLSRRLAEAGHYPAIDIEASISRAMTALIDEQHYAR 387 (455) T ss_pred HHCCCC--CCCCCCEEEEEEEECCCCCCCCCCHHHHHHHEEEEEEECHHHHHCCCCCCCCCCCCHHHCCCCCCCHHHHHH T ss_conf 850478--999811378889963688867871676454212599982899967999970865047550400079999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHC---CCCHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHH Q ss_conf 5678887998876787987633---8998999999999999999669998874889999999998 Q gi|254780617|r 387 SGAVKGELAQYREMSSFSKFSS---DLDSSTQKFLSKGERLTELLKQPQFSPLAMEEQVVMIFAG 448 (509) Q Consensus 387 a~~lr~~laqy~Ele~f~~fgs---dlD~~t~~~l~rg~ri~e~LkQ~~~~p~~~~~qv~~l~a~ 448 (509) |+++|.+|++|+|+++++++|. +.|+.+++++++.++|.+||+|+.+++.++++.+..|..+ T Consensus 388 a~~~r~~la~y~~~~dli~iG~Y~~G~d~~~D~ai~~~~~i~~fL~Q~~~e~~~~~~t~~~L~~l 452 (455) T PRK07960 388 VRTFKQLLSSFQRNRDLVSVGAYAKGSDPMLDKAIALWPQLEAFLQQGIFERADWEDSLQGLELI 452 (455) T ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHH T ss_conf 99999999999999999986186589986799999868999998479988986999999999987 No 29 >PRK06315 type III secretion system ATPase; Provisional Probab=100.00 E-value=0 Score=902.66 Aligned_cols=430 Identities=25% Similarity=0.390 Sum_probs=400.5 Q ss_pred HHHHHHHHHHHHCCCCCCCEEEEEEEEEEECCEEEEECCCCCCCCCEEEECC----CEEEEEEEECCCEEEEEEECCCCC Q ss_conf 9999999999842455542010358999827589997588877374799579----818999920598599998058668 Q gi|254780617|r 6 AEISDILRKRIKDFGNDSEFSEIGRVLSIGDGIARVYGLNNIRAGEMVQFSH----GVYGMALNLEVDNVGVVILGSYKE 81 (509) Q Consensus 6 ~eis~~l~~~i~~~~~~~~~~~~G~V~~V~~giv~v~GL~~a~~GElv~~~~----g~~G~Vi~l~~d~v~i~~l~~~~g 81 (509) +|++.-+.+.+.++......+.+|||++|.|.++++.| |++.+||+|.+.. ...|+|++|+++.+.+++|+++.| T Consensus 2 ~~~~~~~~~~~~~~~~~~~~~~~GrV~~v~G~~ie~~g-~~~~iGelc~I~~~~~~~~~aEVVgf~~~~~~l~p~~~~~G 80 (442) T PRK06315 2 DELTTDFDTLMSQLPDVQLTTVVGRITEVVGMLIKAVV-PDVRVGEVCLVKRHGMEPLVTEVVGFTQNFVFLSPLGELTG 80 (442) T ss_pred CHHHHHHHHHHHHHCCCCCCEEEEEEEEEEEEEEEEEE-CCCCCCCEEEEEECCCCEEEEEEEEECCCEEEEEECCCCCC T ss_conf 24566499999986138862135799999978999986-78986786899918997788999998499799998778767 Q ss_pred CCCCCEEEECCCEEEEECCHHHHHCCCCHHHCCCCC--CCCC-CCCCCCHHCCCCCCCCCCCCCHHHHHCCHHHHHCCCC Q ss_conf 678988897798788667946983000300231446--6444-5444100101357731126600144416023310111 Q gi|254780617|r 82 ISEGDIVKRTGRIVDVPVGLELLGRVVDALGNPIDG--KGPI-KCEQRSCTEADAPGIIQRQSVCEPLSTGIKAIDSLIP 158 (509) Q Consensus 82 I~~G~~V~~tg~~~~vpvG~~lLGRVvD~lG~PlDg--~g~i-~~~~~~~i~~~~p~~~~R~~i~~~l~TGI~~ID~l~p 158 (509) |.+|++|.++|++++||||++|||||+|++|+|||| +||+ ..+++||+++.||||++|.++++||+||||+||+|+| T Consensus 81 i~~G~~V~~~g~~~~vpvg~~lLGRViD~~G~PiDg~~~gp~~~~~~~~pi~~~~p~p~~R~~i~epl~TGIraID~llp 160 (442) T PRK06315 81 VSPSSEVIPTGLPLHIRAGNGLLGRVLNGLGEPIDTETKGPLENVDETYPIFRAPPDPLHRAKLRTILSTGVRCIDGMLT 160 (442) T ss_pred CCCCCEEEECCCCCEEECCCCCCCCEECCCCCCCCCCCCCCCCCCCCEEECCCCCCCCHHHCCCCCCCCCCCEEECCCCC T ss_conf 89999999689987787271513788888877667877788767763113247896934307878753368530011277 Q ss_pred CCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEEEEECCCC Q ss_conf 15683365524778861158999999988630246678760699995167767899999851146742115899728878 Q gi|254780617|r 159 IGRGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALSYSIVVVASASD 238 (509) Q Consensus 159 igrGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~~tvvv~a~a~~ 238 (509) ||||||++|||++|+|||+|+ .||.++.. ...+|||++||||+||+.+|++.+.+.++|+||++|++|||+ T Consensus 161 ig~GQR~gIfagsGvGKs~Ll-~~i~~~~~--------~adv~V~~lIGeRgrEv~efie~~~~~~~l~rsvvV~atsd~ 231 (442) T PRK06315 161 VARGQRIGIFAGAGVGKSSLL-GMIARNAE--------EADVNVIALIGERGREVREFIEGDLGEEGMKRSVIVVSTSDQ 231 (442) T ss_pred CCCCCEEECCCCCCCCHHHHH-HHHHHHHC--------CCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEEECCCCC T ss_conf 023777640389999888999-99997440--------389349999551137999999975201452133999617999 Q ss_pred CHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCC Q ss_conf 88999998876666667542046884799616478999998754530378754333752001246788887504544588 Q gi|254780617|r 239 PAPMQLLAPFAGCAMGEYFRDNGYHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLLERAAKMSDALGA 318 (509) Q Consensus 239 ~~~~r~~ap~~a~aiAEyfr~~G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l~ERa~~~~~~~g~ 318 (509) ||.+|++|||+|+||||||||+|+|||+++||+||||+|+|||||++||||+++|||||+|+.|++|+||||+.. + T Consensus 232 p~~~R~~a~~~A~aiAEyFrd~G~~VLl~~DslTR~A~A~REi~l~~gepP~~~GYppsvf~~l~~l~ERag~~~----~ 307 (442) T PRK06315 232 SSQLRLNAAYVGTAIAEYFRDQGKTVVLMMDSVTRFARALREVGLAAGEPPARAGYTPSVFSTLPKLLERAGASD----K 307 (442) T ss_pred CHHHHHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHCCCCC----C T ss_conf 878875558877789999986899489994370288889999997558999877868339988788887224889----9 Q ss_pred CCEEEEEEEECCCCCCCCCHHHHHHHHHCCEEEEECCHHHHCCCCCCCCCCCHHHCCCCCCCHHHHHHHHHHHHHHHHHH Q ss_conf 73567531003676644311466885406169961501221026654334441220343438988775567888799887 Q gi|254780617|r 319 GSLTALPVIETQVNDVSAYIPTNVISITDGQIFLETELFYQGIRPAINIGLSVSRVGSAAQVKAMKQVSGAVKGELAQYR 398 (509) Q Consensus 319 GSiT~~~~v~~~~~D~~~~i~~~~~si~DG~i~l~~~l~~~g~~Paid~~~S~SRv~~~~~~~~~~~~a~~lr~~laqy~ 398 (509) ||||+||+|++++||++||||||++|||||||||||+||++||||||||+.|+||+|+.++.+.|+++|+++|.+|++|+ T Consensus 308 GSITa~~tVl~~gdD~~dPi~d~~~~ilDGhivLsr~la~~g~yPAIDvl~S~SR~~~~v~~~~h~~~a~~~r~~la~y~ 387 (442) T PRK06315 308 GTITAFYTVLVAGDDMNEPVADEVKSILDGHIVLSNALAQAYHYPAIDVLASISRLLTAIVPEEQRRIIGKAREVLAKYK 387 (442) T ss_pred CCEEEEEEEECCCCCCCCCCHHHHHHHCCCEEEECHHHHHCCCCCCCCCCCCHHHCCCCCCCHHHHHHHHHHHHHHHHHH T ss_conf 74565334651588767873677787606169985879866999971875045336611057999999999999999889 Q ss_pred HHHHHHHHHC---CCCHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHH Q ss_conf 6787987633---89989999999999999996699988748899999999986 Q gi|254780617|r 399 EMSSFSKFSS---DLDSSTQKFLSKGERLTELLKQPQFSPLAMEEQVVMIFAGI 449 (509) Q Consensus 399 Ele~f~~fgs---dlD~~t~~~l~rg~ri~e~LkQ~~~~p~~~~~qv~~l~a~~ 449 (509) |+++++++|. +.|++++.++++.++|++||+|+.+++.++++.+..|..+. T Consensus 388 e~edli~iG~y~~g~d~~~d~ai~~~~~i~~fL~Q~~~e~~~~~~~~~~l~~i~ 441 (442) T PRK06315 388 ANEMLIRIGEYRRGSDREVDFAIDHIDKLNRFLKQDIHEKTNYEEAAQQLRAIF 441 (442) T ss_pred HHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHH T ss_conf 999999962987899988999999799999982899889969999999999975 No 30 >TIGR03305 alt_F1F0_F1_bet alternate F1F0 ATPase, F1 subunit beta. A small number of taxonomically diverse prokaryotic species have what appears to be a second ATP synthase, in addition to the normal F1F0 ATPase in bacteria and A1A0 ATPase in archaea. These enzymes use ion gradients to synthesize ATP, and in principle may run in either direction. This model represents the F1 beta subunit of this apparent second ATP synthase. Probab=100.00 E-value=0 Score=906.71 Aligned_cols=434 Identities=24% Similarity=0.368 Sum_probs=393.8 Q ss_pred EEEEEEECCEEEEECCCCC--CCCCEEEECC-C--EEEEEEEECCCEEEEEEECCCCCCCCCCEEEECCCEEEEECCHHH Q ss_conf 5899982758999758887--7374799579-8--189999205985999980586686789888977987886679469 Q gi|254780617|r 29 GRVLSIGDGIARVYGLNNI--RAGEMVQFSH-G--VYGMALNLEVDNVGVVILGSYKEISEGDIVKRTGRIVDVPVGLEL 103 (509) Q Consensus 29 G~V~~V~~giv~v~GL~~a--~~GElv~~~~-g--~~G~Vi~l~~d~v~i~~l~~~~gI~~G~~V~~tg~~~~vpvG~~l 103 (509) |+|++|.|.++.|.+ |+. .+++.++..+ + +.+++.+|+++.|++++|++++||++|++|..||++++||||++| T Consensus 1 G~V~~V~G~VVdv~f-~~~~p~i~~~l~~~~~~~~~~ev~~~l~~~~V~~i~l~~t~Gl~~G~~V~~tg~~l~VpVG~~l 79 (449) T TIGR03305 1 GHVVAVRGSIVDVRF-DGELPAIHSVLRAGREGEVVVEVLSQLDAHHVRGIALTPTQGLARGMPVRDSGGPLKAPVGKPT 79 (449) T ss_pred CEEEEEECCEEEEEE-CCCCCCHHCEEEECCCCCEEEEEEEEECCCEEEEEEECCCCCCCCCCEEEECCCCEEEECCCCC T ss_conf 969999764799990-6877502018895799959998701227988999990487788899999928995488717210 Q ss_pred HHCCCCHHHCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCHHHHHCCHHHHHCCCCCCCCCEEEEECCCCCCCHHHHHHHH Q ss_conf 83000300231446644454441001013577311266001444160233101111568336552477886115899999 Q gi|254780617|r 104 LGRVVDALGNPIDGKGPIKCEQRSCTEADAPGIIQRQSVCEPLSTGIKAIDSLIPIGRGQRELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 104 LGRVvD~lG~PlDg~g~i~~~~~~~i~~~~p~~~~R~~i~~~l~TGI~~ID~l~pigrGQR~~I~g~~g~GKt~l~~~~I 183 (509) ||||+|++|+||||+|++...+++|+++.||++++|.++++||+||||+||+|+|||||||+|||||+|+|||+|+.++| T Consensus 80 LGRV~d~lG~PiDg~g~i~~~~~~~i~~~~p~~~~R~~~~e~l~TGIkaID~l~pigrGqr~gIfggaGvGKT~Ll~e~i 159 (449) T TIGR03305 80 LSRMFDVFGNTIDRREPPKDVEWRSVHQAPPTLTRRSSKSEVFETGIKAIDVLVPLERGGKAGLFGGAGVGKTVLLTEMI 159 (449) T ss_pred CCCEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCEEHHHCCCCCCCCEEEEECCCCCCCCCHHHHHH T ss_conf 67888056654678999887641323578998555677786400284402302452457565665279998410189999 Q ss_pred HHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHHHHH-CCCC Q ss_conf 998863024667876069999516776789999985114674211589972887888999998876666667542-0468 Q gi|254780617|r 184 LNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALSYSIVVVASASDPAPMQLLAPFAGCAMGEYFR-DNGY 262 (509) Q Consensus 184 ~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~~tvvv~a~a~~~~~~r~~ap~~a~aiAEyfr-~~G~ 262 (509) .||+..+ ..+|||++||||+||+++|++++++.|+|+||++|+++||+||++|+++||+||||||||| ++|+ T Consensus 160 ~n~~~~~-------~~v~V~~~IGER~rE~~e~~~el~~~g~l~~tv~V~a~~depp~~R~~~~~~a~tiAEyfrd~~g~ 232 (449) T TIGR03305 160 HNMVGQH-------QGVSIFCGIGERCREGEELYREMKEAGVLDNTVMVFGQMNEPPGARFRVGHTALTMAEYFRDDEKQ 232 (449) T ss_pred HHHHHHC-------CCEEEEEEEEEEHHHHHHHHHHHHHCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHCCCC T ss_conf 8656414-------886899997452167999999987536654269998369898799999999877699998861798 Q ss_pred CEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCCCCCCHHHHH Q ss_conf 84799616478999998754530378754333752001246788887504544588735675310036766443114668 Q gi|254780617|r 263 HALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLLERAAKMSDALGAGSLTALPVIETQVNDVSAYIPTNV 342 (509) Q Consensus 263 ~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l~ERa~~~~~~~g~GSiT~~~~v~~~~~D~~~~i~~~~ 342 (509) |||++|||+||||+||||||+++||||+|+|||||+||.|++|+||+++.. +||||++|+|++|+||+|||||||+ T Consensus 233 dVLl~~D~ltr~a~A~rEvsl~lg~~P~~~GYpp~vf~~l~~L~ER~~~~~----~GSITa~~~V~~~~dD~tdPi~~~~ 308 (449) T TIGR03305 233 DVLLLIDNIFRFIQAGSEVSGLLGQMPSRLGYQPTLGTELAELEERIATTS----DGAITSIQAVYVPADDFTDPAAVHT 308 (449) T ss_pred CEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHCCCC----CCCEEEEEEEECCCCCCCCCHHHHH T ss_conf 089999676899999989998638999877879719999899998524899----9866788789713887667555543 Q ss_pred HHHHCCEEEEECCHHHHCCCCCCCCCCCHHHCCCC-CCCHHHHHHHHHHHHHHHHHHHHHHHHHH-H-CCCCHHHHHHHH Q ss_conf 85406169961501221026654334441220343-43898877556788879988767879876-3-389989999999 Q gi|254780617|r 343 ISITDGQIFLETELFYQGIRPAINIGLSVSRVGSA-AQVKAMKQVSGAVKGELAQYREMSSFSKF-S-SDLDSSTQKFLS 419 (509) Q Consensus 343 ~si~DG~i~l~~~l~~~g~~Paid~~~S~SRv~~~-~~~~~~~~~a~~lr~~laqy~Ele~f~~f-g-sdlD~~t~~~l~ 419 (509) +|||||||||||+||++|+||||||+.|+||+|+. ...++|+++|+++|..|++|+||++++++ | .+||+++|.+++ T Consensus 309 ~silDg~ivLsr~la~~G~yPAId~l~S~SRv~~~~iv~~~H~~va~~~~~~la~y~el~diiai~G~~eL~~~dk~~v~ 388 (449) T TIGR03305 309 FSHLSASLVLSRKRASEGLYPAIDPLQSTSKMATPGIVGERHYDLAREVRQTLAQYEELKDIIAMLGLEQLSREDRRVVN 388 (449) T ss_pred EEECCCEEEEEHHHHHCCCCCCCCCCCCHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHH T ss_conf 10127027870768967999971776335534881006799999999999999988999999987485106999999999 Q ss_pred HHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999966999887488999999999866980667989999999999999974159999988 Q gi|254780617|r 420 KGERLTELLKQPQFSPLAMEEQVVMIFAGISGCLDEVAVSQVRKFETNFLSHMRVSSQDILEDI 483 (509) Q Consensus 420 rg~ri~e~LkQ~~~~p~~~~~qv~~l~a~~~G~ld~i~v~~I~~fe~~l~~~l~~~~~ei~~~I 483 (509) |++|+++||+||+|. +..|.+..|.+..++ +.+..|++.+-...+...++.+..+ T Consensus 389 ra~rl~~fL~Qpf~v--------ae~ftg~~G~~v~l~-~t~~~~~~il~g~~d~~~e~~~~~~ 443 (449) T TIGR03305 389 RARRLERFLTQPFFT--------TEQFTGMKGKTVSLE-DALDGCERILNDEFQDYPERDLYMI 443 (449) T ss_pred HHHHHHHHHCCCCCH--------HHHHCCCCCEEECHH-HHHHHHHHHHCCCCCCCCHHHHHCC T ss_conf 999999985798511--------232259895186199-9999999984898677998997146 No 31 >TIGR03497 FliI_clade2 flagellar protein export ATPase FliI. Members of this protein family are the FliI protein of bacterial flagellum systems. This protein acts to drive protein export for flagellar biosynthesis. The most closely related family is the YscN family of bacterial type III secretion systems. This model represents one (of three) segment of the FliI family tree. These have been modeled separately in order to exclude the type III secretion ATPases more effectively. Probab=100.00 E-value=0 Score=900.97 Aligned_cols=405 Identities=31% Similarity=0.479 Sum_probs=388.4 Q ss_pred EEEEEEECCEEEEECCCCCCCCCEEEEC--C--CEEEEEEEECCCEEEEEEECCCCCCCCCCEEEECCCEEEEECCHHHH Q ss_conf 5899982758999758887737479957--9--81899992059859999805866867898889779878866794698 Q gi|254780617|r 29 GRVLSIGDGIARVYGLNNIRAGEMVQFS--H--GVYGMALNLEVDNVGVVILGSYKEISEGDIVKRTGRIVDVPVGLELL 104 (509) Q Consensus 29 G~V~~V~~giv~v~GL~~a~~GElv~~~--~--g~~G~Vi~l~~d~v~i~~l~~~~gI~~G~~V~~tg~~~~vpvG~~lL 104 (509) |||++|.|.+++++|+ ++.+||+|++. + .+.|+|++|+++.+.+++|+++.||+.|++|.+|+++++||||++|| T Consensus 1 GrV~~i~G~~iev~g~-~~~iGe~c~I~~~~g~~i~aEVv~~~~~~~~l~~~~~t~Gi~~G~~V~~tg~~~~vpvg~~lL 79 (413) T TIGR03497 1 GKVTRVIGLTIESKGP-KAKIGELCSILTKGGKPVLAEVVGFKEENVLLMPLGEVEGIGPGSLVIATGRPLAIKVGKGLL 79 (413) T ss_pred CEEEEEEEEEEEEEEC-CCCCCCEEEEEECCCCEEEEEEEEECCCEEEEEEECCCCCCCCCCEEEECCCCCEEECCCCCC T ss_conf 9899998279999807-998567599994899889999999829979999936987889999999899974788684305 Q ss_pred HCCCCHHHCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCHHHHHCCHHHHHCCCCCCCCCEEEEECCCCCCCHHHHHHHHH Q ss_conf 30003002314466444544410010135773112660014441602331011115683365524778861158999999 Q gi|254780617|r 105 GRVVDALGNPIDGKGPIKCEQRSCTEADAPGIIQRQSVCEPLSTGIKAIDSLIPIGRGQRELIIGDRKTGKTSIILDTFL 184 (509) Q Consensus 105 GRVvD~lG~PlDg~g~i~~~~~~~i~~~~p~~~~R~~i~~~l~TGI~~ID~l~pigrGQR~~I~g~~g~GKt~l~~~~I~ 184 (509) |||+|++|+||||++++...+++|+++++|||++|.++++||+||||+||+|+|||||||++|||++|+|||+| +.||. T Consensus 80 GRV~d~~G~PlDg~~~~~~~~~~~i~~~~p~p~~R~~i~e~l~TGIraID~l~pigrGQRigIfag~GvGKt~L-l~~ia 158 (413) T TIGR03497 80 GRVLDGLGRPLDGEGPIIGEEPYPLDNPPPNPLKRPRIRDPLETGIKAIDGLLTIGKGQRVGIFAGSGVGKSTL-LGMIA 158 (413) T ss_pred CCEECCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHCCCCCEEECCCEEECCCCCCCCCCEEEECCCCCCCHHHH-HHHHH T ss_conf 68877676655898677776114444689963440678834203865421657611375442105899877699-99999 Q ss_pred HHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHHHHHCCCCCE Q ss_conf 98863024667876069999516776789999985114674211589972887888999998876666667542046884 Q gi|254780617|r 185 NQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALSYSIVVVASASDPAPMQLLAPFAGCAMGEYFRDNGYHA 264 (509) Q Consensus 185 nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~~tvvv~a~a~~~~~~r~~ap~~a~aiAEyfr~~G~~V 264 (509) +|.. ..+|||++||||+||+++|+++....++++||++|++|||+||++|+++||+|+|+||||||+|||| T Consensus 159 ~~~~---------adv~Vi~liGERgrEv~ef~~~~l~~~~l~rtvvv~atsd~p~~~r~~a~~~a~aiAEyfrd~Gk~V 229 (413) T TIGR03497 159 RNAK---------ADINVIALIGERGREVRDFIEKDLGEEGLKRSVVVVATSDQPALMRLKAAFTATAIAEYFRDQGKDV 229 (413) T ss_pred HHHC---------CCCCEEEEEEEECHHHHHHHHHHHCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHCCCCE T ss_conf 8743---------7852388863322899999998631256421599984587887999887746327899998679968 Q ss_pred EEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCCCCCCHHHHHHH Q ss_conf 79961647899999875453037875433375200124678888750454458873567531003676644311466885 Q gi|254780617|r 265 LIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLLERAAKMSDALGAGSLTALPVIETQVNDVSAYIPTNVIS 344 (509) Q Consensus 265 Li~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l~ERa~~~~~~~g~GSiT~~~~v~~~~~D~~~~i~~~~~s 344 (509) |+++|||||||+|+||||++++|||+++|||||+|+.||+|+||||+ .++||||+||+|++++||+||||||+++| T Consensus 230 Ll~~Dsltr~A~A~REisl~~ge~P~~~gyp~~~~s~l~~l~ERag~----~~~GSIT~~~~v~~~~dD~tdPi~d~~~s 305 (413) T TIGR03497 230 LLMMDSVTRFAMAQREIGLAVGEPPTTRGYTPSVFSLLPKLLERSGN----SQKGSITGFYTVLVDGDDMNEPIADAVRG 305 (413) T ss_pred EEEEECCHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHCC----CCCCCEEEEEEEEECCCCCCCCCHHHHHH T ss_conf 99973712988999999986589998878685499887898886138----99985466778971288767771666757 Q ss_pred HHCCEEEEECCHHHHCCCCCCCCCCCHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHC---CCCHHHHHHHHHH Q ss_conf 4061699615012210266543344412203434389887755678887998876787987633---8998999999999 Q gi|254780617|r 345 ITDGQIFLETELFYQGIRPAINIGLSVSRVGSAAQVKAMKQVSGAVKGELAQYREMSSFSKFSS---DLDSSTQKFLSKG 421 (509) Q Consensus 345 i~DG~i~l~~~l~~~g~~Paid~~~S~SRv~~~~~~~~~~~~a~~lr~~laqy~Ele~f~~fgs---dlD~~t~~~l~rg 421 (509) ||||||||||+||++||||||||+.|+||+|++++.+.|+++|+++|.+|++|+|+++|+++|. +.|+.++.+++++ T Consensus 306 i~DG~ivLsr~la~~g~yPaIdvl~S~SR~~~~~~~~~h~~~a~~~r~~~a~y~e~~~li~~g~y~~g~d~~~d~ai~~~ 385 (413) T TIGR03497 306 ILDGHIVLSRELAAKNHYPAIDVLASVSRVMNEIVSEEHKELAGKLRELLAVYKEAEDLINIGAYKRGSNPKIDEAIRYI 385 (413) T ss_pred HCCEEEEECHHHHHCCCCCCCCCCCCHHHCCHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHH T ss_conf 50618998688986699998186424655343115699999999999999988999999996286679998899999989 Q ss_pred HHHHHHHCCCCCCCCCHHHHHHHHHHH Q ss_conf 999999669998874889999999998 Q gi|254780617|r 422 ERLTELLKQPQFSPLAMEEQVVMIFAG 448 (509) Q Consensus 422 ~ri~e~LkQ~~~~p~~~~~qv~~l~a~ 448 (509) ++|++||+|+.+++.++++++..|+.+ T Consensus 386 ~~i~~fL~Q~~~e~~~~~~t~~~l~~l 412 (413) T TIGR03497 386 EKINSFLKQGIDEKFTFEETVQLLKTL 412 (413) T ss_pred HHHHHHCCCCCCCCCCHHHHHHHHHHH T ss_conf 999997389999997999999999986 No 32 >PRK08149 ATP synthase SpaL; Validated Probab=100.00 E-value=0 Score=896.49 Aligned_cols=410 Identities=25% Similarity=0.372 Sum_probs=388.7 Q ss_pred EEEEEEEEEEECCEEEEECCCCCCCCCEEEECC--C-----EEEEEEEECCCEEEEEEECCCCCCCCCCEEEECCCEEEE Q ss_conf 010358999827589997588877374799579--8-----189999205985999980586686789888977987886 Q gi|254780617|r 25 FSEIGRVLSIGDGIARVYGLNNIRAGEMVQFSH--G-----VYGMALNLEVDNVGVVILGSYKEISEGDIVKRTGRIVDV 97 (509) Q Consensus 25 ~~~~G~V~~V~~giv~v~GL~~a~~GElv~~~~--g-----~~G~Vi~l~~d~v~i~~l~~~~gI~~G~~V~~tg~~~~v 97 (509) ++..|+|++|.|.++++.| |++.+||+|++.. + ..|+|++|+++++.+++|+++.||+.|++|.+||++++| T Consensus 4 ~~~~g~v~rI~G~lIe~~~-p~~~iGelc~I~~~~~~~~~~~~aEVvgf~~~~~~l~~~~~t~Gi~~g~~V~~tg~~~~V 82 (427) T PRK08149 4 LQRLAHPQRIQGPIIEAEL-PDVAIGELCEIRAGWHSKEVIARAQVVGFQRERTILSLIGNAQGLSRQVVLYPTGRALSI 82 (427) T ss_pred CCCCCEEEEEEEEEEEEEE-CCCCCCCEEEEEECCCCCCEEEEEEEEEECCCEEEEEECCCCCCCCCCCEEEECCCCCEE T ss_conf 1106758999956999997-899978878998068986146788897314977999978798888999999978995188 Q ss_pred ECCHHHHHCCCCHHHCCCCCCCCC---CCCCCCHHCCCCCCCCCCCCCHHHHHCCHHHHHCCCCCCCCCEEEEECCCCCC Q ss_conf 679469830003002314466444---54441001013577311266001444160233101111568336552477886 Q gi|254780617|r 98 PVGLELLGRVVDALGNPIDGKGPI---KCEQRSCTEADAPGIIQRQSVCEPLSTGIKAIDSLIPIGRGQRELIIGDRKTG 174 (509) Q Consensus 98 pvG~~lLGRVvD~lG~PlDg~g~i---~~~~~~~i~~~~p~~~~R~~i~~~l~TGI~~ID~l~pigrGQR~~I~g~~g~G 174 (509) |||++|||||+|++|+|||++++. ...+++|++++||||++|.++++||+||||+||+|+|||||||++|||++|+| T Consensus 83 ~VG~~lLGRV~d~lG~piD~l~~~~~~~~~~~~~i~~~pp~p~~R~~i~e~l~TGIraID~l~pigkGQR~gIf~gsGvG 162 (427) T PRK08149 83 WVGPALLGAVLDPTGKIVERFAPPTVAPISELRPIDVAPPSYAERRPIREPLITGVRAIDGLLTCGVGQRMGIFASAGCG 162 (427) T ss_pred ECCHHHCCCEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHCCCCCCCEECCCEEECCCCCCCCCCEEECCCCCCCC T ss_conf 70751207977778888668898777765320304589968256078885000386431044532467540002789986 Q ss_pred CHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHH Q ss_conf 11589999999886302466787606999951677678999998511467421158997288788899999887666666 Q gi|254780617|r 175 KTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALSYSIVVVASASDPAPMQLLAPFAGCAMG 254 (509) Q Consensus 175 Kt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~~tvvv~a~a~~~~~~r~~ap~~a~aiA 254 (509) ||+| ++||.||+.. .+|||++||||+||+.+|++++++.+.|+||++|++|||+||.+|+++||+|+||| T Consensus 163 Ks~L-l~~i~~~~~a---------dv~Via~IGeRgrEv~efi~~~~~~~~l~rsvvv~atsd~p~~~R~~a~~~a~tiA 232 (427) T PRK08149 163 KTML-MNMLIEHTEA---------DVFVIGLIGERGREVTEFVEMLRASHKKEKCVLVYATSDFSSVDRCNAALVATTVA 232 (427) T ss_pred HHHH-HHHHHHHCCC---------CEEEEEEECCCHHHHHHHHHHHHCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHH T ss_conf 7799-9988863589---------98999970233789999999750368520368997158899899999998877899 Q ss_pred HHHHCCCCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCCC Q ss_conf 75420468847996164789999987545303787543337520012467888875045445887356753100367664 Q gi|254780617|r 255 EYFRDNGYHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLLERAAKMSDALGAGSLTALPVIETQVNDV 334 (509) Q Consensus 255 Eyfr~~G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l~ERa~~~~~~~g~GSiT~~~~v~~~~~D~ 334 (509) |||||+|+|||+++||+||||+|+||||+++||||+++|||||+|+.||+|+||||++. +||||++|+|++++||+ T Consensus 233 Eyfrd~G~~VLl~~Dsltr~A~A~REisl~~ge~P~~~GYp~svf~~l~~l~ERag~~~----~GSIT~~~tV~~~~dD~ 308 (427) T PRK08149 233 EYFRDQGKRVVLFIDSMTRYARALRDVALAAGELPARRGYPASVFDSLPRLLERPGATS----AGSITAFYTVLLESEEE 308 (427) T ss_pred HHHHHCCCCEEEEECCHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHCCCCC----CCCEEEEEEEEECCCCC T ss_conf 99986798379961757888889989998647999876878529988789876234678----98554675687248887 Q ss_pred CCCHHHHHHHHHCCEEEEECCHHHHCCCCCCCCCCCHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHC---CCC Q ss_conf 43114668854061699615012210266543344412203434389887755678887998876787987633---899 Q gi|254780617|r 335 SAYIPTNVISITDGQIFLETELFYQGIRPAINIGLSVSRVGSAAQVKAMKQVSGAVKGELAQYREMSSFSKFSS---DLD 411 (509) Q Consensus 335 ~~~i~~~~~si~DG~i~l~~~l~~~g~~Paid~~~S~SRv~~~~~~~~~~~~a~~lr~~laqy~Ele~f~~fgs---dlD 411 (509) ||||||+++|||||||||||+||++||||||||+.|+||+|+.++.+.|+++|+++|..|++|+|+++++++|. +.| T Consensus 309 tdPi~d~~~~ilDGhivLsr~la~~g~yPAIDvl~S~SR~~~~i~~~~h~~~a~~~r~~la~y~e~~~li~iG~y~~g~~ 388 (427) T PRK08149 309 ADPIGDEIRSILDGHIYLSRKLAGQGHYPAIDVLKSVSRVFGQVTDPKHRELAAAFRKLLTRLEELQLFIDLGEYRPGEN 388 (427) T ss_pred CCCHHHHHHHHCCCEEEECHHHHHCCCCCCCCCCCCHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCC T ss_conf 87646655550063699967688659999818751554364444259999999999999999999999998478767999 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHH Q ss_conf 89999999999999996699988748899999999986 Q gi|254780617|r 412 SSTQKFLSKGERLTELLKQPQFSPLAMEEQVVMIFAGI 449 (509) Q Consensus 412 ~~t~~~l~rg~ri~e~LkQ~~~~p~~~~~qv~~l~a~~ 449 (509) ++++.+++++++|++||+|+.+++.++++.+-.|..+. T Consensus 389 ~~~d~ai~~~~~i~~fL~Q~~~e~~~~~~t~~~l~~la 426 (427) T PRK08149 389 ADNDRAMDKRDSLEAFLKQPVDEKSSFSDTLERMNEFA 426 (427) T ss_pred HHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHH T ss_conf 88999999799999974899988849899999999864 No 33 >PRK09280 F0F1 ATP synthase subunit beta; Validated Probab=100.00 E-value=0 Score=890.98 Aligned_cols=420 Identities=24% Similarity=0.380 Sum_probs=382.9 Q ss_pred EEEEEEEEEECCEEEEECCCCCC---CCCEEEECC---CEEEEEEE-ECCCEEEEEEECCCCCCCCCCEEEECCCEEEEE Q ss_conf 10358999827589997588877---374799579---81899992-059859999805866867898889779878866 Q gi|254780617|r 26 SEIGRVLSIGDGIARVYGLNNIR---AGEMVQFSH---GVYGMALN-LEVDNVGVVILGSYKEISEGDIVKRTGRIVDVP 98 (509) Q Consensus 26 ~~~G~V~~V~~giv~v~GL~~a~---~GElv~~~~---g~~G~Vi~-l~~d~v~i~~l~~~~gI~~G~~V~~tg~~~~vp 98 (509) ...|||++|.|.++++.+. ... +++.|++.+ ...++|++ ++++.|.+++++++.||++|++|.+||++++|| T Consensus 3 ~~~G~V~~V~G~vV~v~~~-~~~lp~i~~~l~~~~~~~~~~~EV~~~~g~~~v~~i~l~~t~Gl~~G~~V~~tg~~l~Vp 81 (466) T PRK09280 3 MNTGKIVQVIGPVVDVEFP-EGELPKIYNALEVENGGKKLVLEVAQHLGDGVVRTIAMGSTDGLVRGMEVKDTGAPISVP 81 (466) T ss_pred CCCEEEEEEECCEEEEEEC-CCCCCCHHEEEEECCCCCEEEEEEEEEECCCEEEEEECCCCCCCCCCCEEEECCCCEEEE T ss_conf 8743899998228999827-877541213999758995699998899669869999746867878999999589954987 Q ss_pred CCHHHHHCCCCHHHCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCHHHHHCCHHHHHCCCCCCCCCEEEEECCCCCCCHHH Q ss_conf 79469830003002314466444544410010135773112660014441602331011115683365524778861158 Q gi|254780617|r 99 VGLELLGRVVDALGNPIDGKGPIKCEQRSCTEADAPGIIQRQSVCEPLSTGIKAIDSLIPIGRGQRELIIGDRKTGKTSI 178 (509) Q Consensus 99 vG~~lLGRVvD~lG~PlDg~g~i~~~~~~~i~~~~p~~~~R~~i~~~l~TGI~~ID~l~pigrGQR~~I~g~~g~GKt~l 178 (509) ||++|||||+|++|+||||+|++....+||+++++|++++|.++++||+||||+||+|+|||||||+|||||+|+|||+| T Consensus 82 VG~~lLGRV~d~lG~PiDg~g~i~~~~~~pi~~~~p~~~~r~~~~e~l~TGIkaID~l~pigkGqrigIfggaGvGKTvL 161 (466) T PRK09280 82 VGKATLGRIFNVLGEPIDEAGPIGAEERWPIHRKAPSFEELSTKTEILETGIKVIDLLAPYAKGGKIGLFGGAGVGKTVL 161 (466) T ss_pred ECHHHCCCEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHCCCCCCCCCEEEEECCCCCCCHHH T ss_conf 08644278868777526789987676545656789863235677863001813222316713774798557999980089 Q ss_pred HHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHHHHH Q ss_conf 99999998863024667876069999516776789999985114674211589972887888999998876666667542 Q gi|254780617|r 179 ILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALSYSIVVVASASDPAPMQLLAPFAGCAMGEYFR 258 (509) Q Consensus 179 ~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~~tvvv~a~a~~~~~~r~~ap~~a~aiAEyfr 258 (509) +.++|.|.+..+ ..+|||++||||+||+++|++++++.|+|+||++|+++||+||++|+++||+||||||||| T Consensus 162 l~eli~n~a~~~-------~~v~V~a~IGER~rE~~e~~~e~~~~g~l~~tvlV~~~~depp~~R~~~~~~a~tiAEyFR 234 (466) T PRK09280 162 IMELINNIAKEH-------GGYSVFAGVGERTREGNDLYHEMKESGVLDKTALVFGQMNEPPGARLRVALTGLTMAEYFR 234 (466) T ss_pred HHHHHHHHHHHC-------CCCEEEEEECCCHHHHHHHHHHHHHCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHHHHH T ss_conf 999999999865-------9909999714236889999997664287634899996588996899999999999999988 Q ss_pred C-CCCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCCCCCC Q ss_conf 0-468847996164789999987545303787543337520012467888875045445887356753100367664431 Q gi|254780617|r 259 D-NGYHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLLERAAKMSDALGAGSLTALPVIETQVNDVSAY 337 (509) Q Consensus 259 ~-~G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l~ERa~~~~~~~g~GSiT~~~~v~~~~~D~~~~ 337 (509) | +|+|||+++||+||||+|+||||+++||||+++||||++|+.|++|+||+++.+ +||||++|+|++|+||+||| T Consensus 235 D~~g~dVLl~~D~ltRfa~A~rEvslllge~Ps~~GYppsl~~~l~~L~ER~~~~~----~GSIT~i~~V~vp~DD~tdP 310 (466) T PRK09280 235 DVEGQDVLLFIDNIFRFTQAGSEVSALLGRMPSAVGYQPTLATEMGALQERITSTK----KGSITSVQAVYVPADDLTDP 310 (466) T ss_pred HCCCCEEEEEECCHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHCCCC----CCCEEEEEEEEECCCCCCCC T ss_conf 61798289997266899999999998627999877889428989899998524689----98613789999548877787 Q ss_pred HHHHHHHHHCCEEEEECCHHHHCCCCCCCCCCCHHHCCC-CCCCHHHHHHHHHHHHHHHHHHHHHHHHHH-HCC-CCHHH Q ss_conf 146688540616996150122102665433444122034-343898877556788879988767879876-338-99899 Q gi|254780617|r 338 IPTNVISITDGQIFLETELFYQGIRPAINIGLSVSRVGS-AAQVKAMKQVSGAVKGELAQYREMSSFSKF-SSD-LDSST 414 (509) Q Consensus 338 i~~~~~si~DG~i~l~~~l~~~g~~Paid~~~S~SRv~~-~~~~~~~~~~a~~lr~~laqy~Ele~f~~f-gsd-lD~~t 414 (509) ||++++|||||||||||+||++|+||||||+.|+||+|. ....+.|+++|+++|..|++|+|+++++++ |.| |++++ T Consensus 311 i~~~~~silDg~ivLsR~La~~G~yPAID~l~S~SRvm~p~ivg~~H~~~a~~vr~~L~~Y~elediIailG~deLs~~d 390 (466) T PRK09280 311 APATTFAHLDATTVLSRQIAELGIYPAVDPLDSTSRILDPLVVGEEHYDVAREVQQILQRYKELQDIIAILGMDELSEED 390 (466) T ss_pred CHHHHHHHCCEEEEEEHHHHHCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHH T ss_conf 17765231340799726587679999868665401257832189999999999999999888899999974841079989 Q ss_pred HHHHHHHHHHHHHHCCCCCCC----------CCHHHHHHHHHHHHCCCCCCCC Q ss_conf 999999999999966999887----------4889999999998669806679 Q gi|254780617|r 415 QKFLSKGERLTELLKQPQFSP----------LAMEEQVVMIFAGISGCLDEVA 457 (509) Q Consensus 415 ~~~l~rg~ri~e~LkQ~~~~p----------~~~~~qv~~l~a~~~G~ld~i~ 457 (509) |..++|++||+.||+||+|.. .++++++.....+.+|-+|++| T Consensus 391 k~~v~rar~l~~fL~Qpf~vae~ftg~~g~~v~l~~t~~~~~~il~g~~d~~~ 443 (466) T PRK09280 391 KLTVARARKIQRFLSQPFFVAEVFTGSPGKYVPLKDTIRGFKGILDGEYDHLP 443 (466) T ss_pred HHHHHHHHHHHHHHCCCCEEHHCCCCCCCEEEEHHHHHHHHHHHHCCCCCCCC T ss_conf 99999999999986598641000269895285299999999998489877789 No 34 >CHL00060 atpB ATP synthase CF1 beta subunit Probab=100.00 E-value=0 Score=884.33 Aligned_cols=442 Identities=23% Similarity=0.333 Sum_probs=398.9 Q ss_pred CCCEEEEEEEEEEECCEEEEECCCCCCCCCE---EEECC--------CEEEEEEE-ECCCEEEEEEECCCCCCCCCCEEE Q ss_conf 5420103589998275899975888773747---99579--------81899992-059859999805866867898889 Q gi|254780617|r 22 DSEFSEIGRVLSIGDGIARVYGLNNIRAGEM---VQFSH--------GVYGMALN-LEVDNVGVVILGSYKEISEGDIVK 89 (509) Q Consensus 22 ~~~~~~~G~V~~V~~giv~v~GL~~a~~GEl---v~~~~--------g~~G~Vi~-l~~d~v~i~~l~~~~gI~~G~~V~ 89 (509) ..+.+++|+|++|.|.++.|.+ |...++++ |.+.+ ...+||.+ ++++.|++++|+++.||.+|++|. T Consensus 4 ~~~~k~~GrV~~V~G~VVdv~f-p~~~lp~i~~~l~~~~~~~~g~~~~~~~EV~~~lg~~~Vr~i~l~~t~GL~~G~~V~ 82 (480) T CHL00060 4 TLEKKNLGRITQIIGPVLDVAF-PPGKMPNIYNALVVKGRDTAGQEINVTCEVQQLLGNNRVRAVAMSATDGLMRGMEVI 82 (480) T ss_pred CCCCCCCCEEEEEEEEEEEEEE-CCCCCCHHHHEEEECCCCCCCCCCEEEEEEEEEECCCEEEEEECCCCCCCCCCCEEE T ss_conf 6654676089999932699993-787742144432424778777644189999999469979999656877889989999 Q ss_pred ECCCEEEEECCHHHHHCCCCHHHCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCHHHHHCCHHHHHCCCCCCCCCEEEEEC Q ss_conf 77987886679469830003002314466444544410010135773112660014441602331011115683365524 Q gi|254780617|r 90 RTGRIVDVPVGLELLGRVVDALGNPIDGKGPIKCEQRSCTEADAPGIIQRQSVCEPLSTGIKAIDSLIPIGRGQRELIIG 169 (509) Q Consensus 90 ~tg~~~~vpvG~~lLGRVvD~lG~PlDg~g~i~~~~~~~i~~~~p~~~~R~~i~~~l~TGI~~ID~l~pigrGQR~~I~g 169 (509) +||++++||||++|||||+|++|+||||+||+.++++||+++++|++++|.++++||+||||+||+|+|||||||+|||| T Consensus 83 ~tg~~l~VPVG~~lLGRV~d~lG~PiDg~gpi~~~~~~pi~~~aP~~~~r~~~~e~leTGIkaID~l~pigkGQRigIfg 162 (480) T CHL00060 83 DTGAPLSVPVGGATLGRIFNVLGEPVDNLGPVDTRTTSPIHRSAPAFIQLDTKLSIFETGIKVVDLLAPYRRGGKIGLFG 162 (480) T ss_pred ECCCCEEEEECHHHCCCEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHCCHHHHHCCCCCCCCCEEEEEC T ss_conf 58995498728533168788888835788998766503256789985555677861102704443125400366887656 Q ss_pred CCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCC-------EEEEEEECCCCCHHH Q ss_conf 7788611589999999886302466787606999951677678999998511467421-------158997288788899 Q gi|254780617|r 170 DRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALS-------YSIVVVASASDPAPM 242 (509) Q Consensus 170 ~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~-------~tvvv~a~a~~~~~~ 242 (509) |+|+|||+|++++|.|++.. +..+|||++||||+||+.+|++++++.++++ ++++|++++|+||++ T Consensus 163 gaGvGKTvLl~eli~niak~-------~~~v~V~a~IGER~RE~~e~~~e~~e~gvl~~~~~~~s~~vlV~~~~depp~~ 235 (480) T CHL00060 163 GAGVGKTVLIMELINNIAKA-------HGGVSVFGGVGERTREGNDLYMEMKESGVINEQNIAESKVALVYGQMNEPPGA 235 (480) T ss_pred CCCCCHHHHHHHHHHHHHCC-------CCCEEEEEEECCCCHHHHHHHHHHHHCCCCCCCCCCCEEEEEEEECCCCCHHH T ss_conf 89988789999999612003-------79889999966773679999999987185533554430489997356787688 Q ss_pred HHHHHHHHHHHHHHHHCC-CCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCE Q ss_conf 999887666666754204-6884799616478999998754530378754333752001246788887504544588735 Q gi|254780617|r 243 QLLAPFAGCAMGEYFRDN-GYHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLLERAAKMSDALGAGSL 321 (509) Q Consensus 243 r~~ap~~a~aiAEyfr~~-G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l~ERa~~~~~~~g~GSi 321 (509) |+++||+|+||||||||+ |+|||++|||+||||+|+||||+++||||+++|||||+|+.||+|+||+++.+ .||| T Consensus 236 R~~~~~~a~tiAEyFRd~~~~dVLl~~D~ltR~A~A~REIslllge~P~~~GYppslf~~l~~L~ER~~~~~----~GSI 311 (480) T CHL00060 236 RMRVGLTALTMAEYFRDVNKQDVLLFIDNIFRFVQAGSEVSALLGRMPSAVGYQPTLSTEMGSLQERITSTK----EGSI 311 (480) T ss_pred HHHHHHHHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHCCCC----CCCE T ss_conf 999999988899999874897089995462899988878998627999866789648888778888632788----8741 Q ss_pred EEEEEEECCCCCCCCCHHHHHHHHHCCEEEEECCHHHHCCCCCCCCCCCHHHCCC-CCCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 6753100367664431146688540616996150122102665433444122034-343898877556788879988767 Q gi|254780617|r 322 TALPVIETQVNDVSAYIPTNVISITDGQIFLETELFYQGIRPAINIGLSVSRVGS-AAQVKAMKQVSGAVKGELAQYREM 400 (509) Q Consensus 322 T~~~~v~~~~~D~~~~i~~~~~si~DG~i~l~~~l~~~g~~Paid~~~S~SRv~~-~~~~~~~~~~a~~lr~~laqy~El 400 (509) |++|+|++|+||+|||||||++|||||||||||+||++|+||||||+.|+||+|. ....++|+++|.++|..|++|+|+ T Consensus 312 Ta~~~V~v~~DD~tdPip~~~~silDg~ivLsr~La~~g~yPAID~l~S~Sr~m~p~iv~~~H~~~a~~v~~~L~~Y~el 391 (480) T CHL00060 312 TSIQAVYVPADDLTDPAPATTFAHLDATTVLSRGLAAKGIYPAVDPLDSTSTMLQPRIVGEEHYETAQRVKQTLQRYKEL 391 (480) T ss_pred EEEEEEEECCCCCCCCCHHHHHHHCCCEEEECHHHHHCCCCCCCCCCCCHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHH T ss_conf 56778970377666761455522125149983658878998973752010004797648999999999999999998999 Q ss_pred HHHHHH-HCC-CCHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 879876-338-998999999999999999669998874889999999998669806679899999999999999741599 Q gi|254780617|r 401 SSFSKF-SSD-LDSSTQKFLSKGERLTELLKQPQFSPLAMEEQVVMIFAGISGCLDEVAVSQVRKFETNFLSHMRVSSQD 478 (509) Q Consensus 401 e~f~~f-gsd-lD~~t~~~l~rg~ri~e~LkQ~~~~p~~~~~qv~~l~a~~~G~ld~i~v~~I~~fe~~l~~~l~~~~~e 478 (509) ++++++ |.| ||+++|.+++|+++|+.||+||+| ++-.|.+..|.+..++ +.+..|++.+-..++....+ T Consensus 392 ~diIailG~deLs~~dk~~v~rar~l~~fl~Qpf~--------~~e~ftg~~g~~v~l~-~t~~~~~~il~g~~d~~~e~ 462 (480) T CHL00060 392 QDIIAILGLDELSEEDRLTVARARKIERFLSQPFF--------VAEVFTGSPGKYVGLA-ETIRGFQLILSGELDGLPEQ 462 (480) T ss_pred HHHHHHHCCCCCCHHHHHHHHHHHHHHHHHCCCCH--------HHHHHCCCCCEEEEHH-HHHHHHHHHHCCCCCCCCHH T ss_conf 99999758642798899999999999999658522--------3454159995386099-99999999848987779978 Q ss_pred HHHHHH Q ss_conf 999887 Q gi|254780617|r 479 ILEDIR 484 (509) Q Consensus 479 i~~~I~ 484 (509) .+..+. T Consensus 463 ~~~~~g 468 (480) T CHL00060 463 AFYLVG 468 (480) T ss_pred HHHCCC T ss_conf 963046 No 35 >PRK05922 type III secretion system ATPase; Validated Probab=100.00 E-value=0 Score=880.72 Aligned_cols=421 Identities=24% Similarity=0.343 Sum_probs=396.1 Q ss_pred HHHHHHHCCCCCCCEEEEEEEEEEECCEEEEECCCCCCCCCEEEECC----CEEEEEEEECCCEEEEEEECCCCCCCCCC Q ss_conf 99999842455542010358999827589997588877374799579----81899992059859999805866867898 Q gi|254780617|r 11 ILRKRIKDFGNDSEFSEIGRVLSIGDGIARVYGLNNIRAGEMVQFSH----GVYGMALNLEVDNVGVVILGSYKEISEGD 86 (509) Q Consensus 11 ~l~~~i~~~~~~~~~~~~G~V~~V~~giv~v~GL~~a~~GElv~~~~----g~~G~Vi~l~~d~v~i~~l~~~~gI~~G~ 86 (509) -.+..++++++ ++.+|+|++|.|++++++|+ ++.+||+|+|+. .+.|+|++|+++.+.+++|+++.||+.|+ T Consensus 6 ~~~~~~~~~~p---~r~~GrV~~V~G~~ie~~g~-~~~iGelc~I~~~~~~~i~aeVvgf~~~~~~l~p~~~~~Gi~~G~ 81 (434) T PRK05922 6 EEKLLIHQWQP---YRECGLLSRVSGNLLEAQGL-SACLGELCQISLPKSPPILAEVIGFHNQTTLLMSLSPIHYVALGA 81 (434) T ss_pred HHHHHHHHCCC---CEEEEEEEEEECEEEEEEEC-CCCCCCEEEEEECCCCEEEEEEEEEECCEEEEEECCCCCCCCCCC T ss_conf 78989872798---54467999996649999806-878798599981899878999998729979999777766789999 Q ss_pred EEEECCCEEEEECCHHHHHCCCCHHHCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCHHHHHCCHHHHHCCCCCCCCCEEE Q ss_conf 88977987886679469830003002314466444544410010135773112660014441602331011115683365 Q gi|254780617|r 87 IVKRTGRIVDVPVGLELLGRVVDALGNPIDGKGPIKCEQRSCTEADAPGIIQRQSVCEPLSTGIKAIDSLIPIGRGQREL 166 (509) Q Consensus 87 ~V~~tg~~~~vpvG~~lLGRVvD~lG~PlDg~g~i~~~~~~~i~~~~p~~~~R~~i~~~l~TGI~~ID~l~pigrGQR~~ 166 (509) +|.+|+++++||||++|||||+|++|+||||++++...+++|+++.||||++|.++++||+||||+||+|+|||||||+| T Consensus 82 ~V~~~g~~~~v~vg~~lLGRVld~~G~PiDg~~~~~~~~~~~i~~~~p~p~~R~~i~e~L~TGIraID~l~pigrGQR~g 161 (434) T PRK05922 82 EVLPLRRPPSLHLSDHLLGRVLDGFGNPLDGKEQLPKTHLKPLFSPPPSPMSRQPIQEIFPTGIKAIDAFLTLGKGQRIG 161 (434) T ss_pred EEEECCCCCEEECCCCCCCCEECCCCCCCCCCCCCCCCCCCCCCCCCCCCHHCCCCCCCCCCCCEEEECCCCCCCCCEEE T ss_conf 99989998767768061465726884644899999987605245899482221688875547855673567724676777 Q ss_pred EECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHHHHH Q ss_conf 52477886115899999998863024667876069999516776789999985114674211589972887888999998 Q gi|254780617|r 167 IIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALSYSIVVVASASDPAPMQLLA 246 (509) Q Consensus 167 I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~~tvvv~a~a~~~~~~r~~a 246 (509) ||||+|+|||+|+ .+|.+| .+.++|||++||||+||+++|++.+.+.++++||++|++++|+||++|+++ T Consensus 162 If~g~GvGKt~Ll-~~ia~~---------~~~~v~V~alIGeR~rEv~efie~~~~~~~~~~tvvv~atsd~p~~~r~~a 231 (434) T PRK05922 162 VFSEPGSGKSSLL-STIAKG---------SKSTINVIALIGERGREVREYIEQHKEGLKAQRTIIIASPAHETAPTKVIA 231 (434) T ss_pred ECCCCCCCHHHHH-HHHHHC---------CCCCEEEEEEEEEEHHHHHHHHHHHHHCCCCCEEEEEEECCCCCHHHHHHH T ss_conf 3079999789999-867653---------678759998455138999999998661345210378750377887888999 Q ss_pred HHHHHHHHHHHHCCCCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCEEEEEE Q ss_conf 87666666754204688479961647899999875453037875433375200124678888750454458873567531 Q gi|254780617|r 247 PFAGCAMGEYFRDNGYHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLLERAAKMSDALGAGSLTALPV 326 (509) Q Consensus 247 p~~a~aiAEyfr~~G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l~ERa~~~~~~~g~GSiT~~~~ 326 (509) ||+|||+||||||+|+|||+++|||||||+|||||||++||||+|+|||||+||.||+|+||||+.+ .||||+||+ T Consensus 232 ~~~a~aiAEyfrd~G~~VLl~~Dsltr~A~A~REisl~~gepP~~~gYp~svf~~l~~L~ERag~~~----~GSITa~~t 307 (434) T PRK05922 232 GRAAMTIAEYFRDQGHRVLFIMDSLSRWIAALQEVALARGETLSAHHYAASVFHHVSEFTERAGNND----KGSITALYA 307 (434) T ss_pred HHHHHHHHHHHHHCCCCEEEEECCHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHCCCCCC----CCCCCEEEE T ss_conf 9887679999997799679995478899999989998648999876889449998788886043799----965564989 Q ss_pred EECCCCCCCCCHHHHHHHHHCCEEEEECCHHHHCCCCCCCCCCCHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 00367664431146688540616996150122102665433444122034343898877556788879988767879876 Q gi|254780617|r 327 IETQVNDVSAYIPTNVISITDGQIFLETELFYQGIRPAINIGLSVSRVGSAAQVKAMKQVSGAVKGELAQYREMSSFSKF 406 (509) Q Consensus 327 v~~~~~D~~~~i~~~~~si~DG~i~l~~~l~~~g~~Paid~~~S~SRv~~~~~~~~~~~~a~~lr~~laqy~Ele~f~~f 406 (509) |++++|| +|||+|+++|||||||||||+ +++++||||||+.|+||+|++++.+.|+++|+++|.+|++|+|+++|+++ T Consensus 308 Vl~~~~~-~dpi~d~~~silDGhIvLsr~-~~~~~~PAIDvl~S~SR~~~~~~~~~h~~~a~~~r~~la~y~e~~dli~~ 385 (434) T PRK05922 308 ILHYPNH-PDIFTDYLKSLLDGHFFLTPQ-GKALASPPIDILTSLSRSARQLALPHHYAAAEELRSLLKAYHEALDIIQL 385 (434) T ss_pred EEECCCC-CCCCCCCCHHHEEEEEEECHH-HHHCCCCCCCCCCCHHHCCHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9853898-765677333022224675354-66157897565667645142335699999999999999999999999985 Q ss_pred HC---CCCHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCC Q ss_conf 33---8998999999999999999669998874889999999998669 Q gi|254780617|r 407 SS---DLDSSTQKFLSKGERLTELLKQPQFSPLAMEEQVVMIFAGISG 451 (509) Q Consensus 407 gs---dlD~~t~~~l~rg~ri~e~LkQ~~~~p~~~~~qv~~l~a~~~G 451 (509) |+ +.|+.+++++++.++|+.||+|+..+..++++.+..|-.+.++ T Consensus 386 G~y~~g~d~~~D~ai~~~~~i~~fL~Q~~~e~~~~~~t~~~l~~l~~~ 433 (434) T PRK05922 386 GAYTPGQDAHLDRAVKLLPSIKQFLSQPLSSYCALHNTLKQLEALLKH 433 (434) T ss_pred CCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHCC T ss_conf 297678876699999999999997189999973999999999998748 No 36 >COG1157 FliI Flagellar biosynthesis/type III secretory pathway ATPase [Cell motility and secretion / Intracellular trafficking and secretion] Probab=100.00 E-value=0 Score=865.19 Aligned_cols=419 Identities=30% Similarity=0.455 Sum_probs=392.8 Q ss_pred HHCCCCCCCEEEEEEEEEEECCEEEEECCCCCCCCCEEEECCC-----EEEEEEEECCCEEEEEEECCCCCCCCCCEEEE Q ss_conf 8424555420103589998275899975888773747995798-----18999920598599998058668678988897 Q gi|254780617|r 16 IKDFGNDSEFSEIGRVLSIGDGIARVYGLNNIRAGEMVQFSHG-----VYGMALNLEVDNVGVVILGSYKEISEGDIVKR 90 (509) Q Consensus 16 i~~~~~~~~~~~~G~V~~V~~giv~v~GL~~a~~GElv~~~~g-----~~G~Vi~l~~d~v~i~~l~~~~gI~~G~~V~~ 90 (509) ...+.........|+|++|.+.++++.| |.+.+||+|.++.. ..+||++|+++.+.++||++..|+.+|++|.+ T Consensus 13 ~~~~~~~~~~~~~G~v~~v~G~~lea~g-~~~~iGelc~i~~~~~~~~~~aEVvgf~~~~~~L~p~~~~~gv~~g~~V~~ 91 (441) T COG1157 13 ELALRNTDPYKRRGRLTRVTGLLLEAVG-PQARIGELCKIERSRGSEKVLAEVVGFNEERVLLMPFEPVEGVSPGAEVVP 91 (441) T ss_pred HHHHCCCCCCEEEEEEEEEEEEEEEEEC-CCCCCCCEEEEEECCCCCCEEEEEEEECCCEEEEECCCCCCCCCCCCEEEE T ss_conf 4310247761698789998524999844-887623569999658887414799997287689962676556778878974 Q ss_pred CCCEEEEECCHHHHHCCCCHHHCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCHHHHHCCHHHHHCCCCCCCCCEEEEECC Q ss_conf 79878866794698300030023144664445444100101357731126600144416023310111156833655247 Q gi|254780617|r 91 TGRIVDVPVGLELLGRVVDALGNPIDGKGPIKCEQRSCTEADAPGIIQRQSVCEPLSTGIKAIDSLIPIGRGQRELIIGD 170 (509) Q Consensus 91 tg~~~~vpvG~~lLGRVvD~lG~PlDg~g~i~~~~~~~i~~~~p~~~~R~~i~~~l~TGI~~ID~l~pigrGQR~~I~g~ 170 (509) +++.+++|+|++|||||+|++|+||||++++....++|++.+||||+.|++|++||.||||+||+|+|||+|||+|||++ T Consensus 92 ~~~~~~v~~g~~lLGRVld~~G~plDg~~~~~~~~~~~l~~~pp~pm~R~~I~~~l~tGVRaIDgllT~G~GQRiGIFAg 171 (441) T COG1157 92 TGRPLSVPVGDALLGRVLDGLGRPLDGGGLPDGTERRPLDAPPPNPLKRRPIEEPLDTGVRAIDGLLTCGKGQRIGIFAG 171 (441) T ss_pred CCCCCCCCCCHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHCCCCCCCCCCCCEEEECCCCCCCCCEEEEEEC T ss_conf 59754236686665410066888577899998763231457997821026546645456144401002246746788716 Q ss_pred CCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHHHHHHHHH Q ss_conf 78861158999999988630246678760699995167767899999851146742115899728878889999988766 Q gi|254780617|r 171 RKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALSYSIVVVASASDPAPMQLLAPFAG 250 (509) Q Consensus 171 ~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~~tvvv~a~a~~~~~~r~~ap~~a 250 (509) +|+|||+| +.||.+.. +..+.|+++||||||||+||++.......|+|||+|+||||+||.+|..|+++| T Consensus 172 sGVGKStL-LgMiar~t---------~aDv~ViaLIGERGREVrEFIE~~Lg~egl~rsViVvATSD~s~l~R~~aa~~A 241 (441) T COG1157 172 SGVGKSTL-LGMIARNT---------EADVNVIALIGERGREVREFIEKDLGEEGLKRSVVVVATSDESALMRLKAAFTA 241 (441) T ss_pred CCCCHHHH-HHHHHCCC---------CCCEEEEEEEECCCHHHHHHHHHHCCHHHCCCEEEEEECCCCCHHHHHHHHHHH T ss_conf 99868999-99996246---------698799997606640599999986023303526999978988989998889999 Q ss_pred HHHHHHHHCCCCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECC Q ss_conf 66667542046884799616478999998754530378754333752001246788887504544588735675310036 Q gi|254780617|r 251 CAMGEYFRDNGYHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLLERAAKMSDALGAGSLTALPVIETQ 330 (509) Q Consensus 251 ~aiAEyfr~~G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l~ERa~~~~~~~g~GSiT~~~~v~~~ 330 (509) ++|||||||||++||++|||+||||+|+|||+|+.||||.++|||||||+.+|+|+||||+. +.||||+||||+++ T Consensus 242 t~IAEyFRDqG~~VLL~mDSlTRfA~AqREI~LA~GEpP~~kGYppSVF~~LP~LlERaG~~----~~GsITafYTVLve 317 (441) T COG1157 242 TTIAEYFRDQGKRVLLIMDSLTRFAMAQREIGLAAGEPPATKGYPPSVFSELPRLLERAGNG----DKGSITAFYTVLVE 317 (441) T ss_pred HHHHHHHHHCCCEEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHCCCC----CCCCEEEEEEEEEE T ss_conf 99999998679859999613889999998888752899866898955988748988604899----99817899999850 Q ss_pred CCCCCCCHHHHHHHHHCCEEEEECCHHHHCCCCCCCCCCCHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHC-- Q ss_conf 766443114668854061699615012210266543344412203434389887755678887998876787987633-- Q gi|254780617|r 331 VNDVSAYIPTNVISITDGQIFLETELFYQGIRPAINIGLSVSRVGSAAQVKAMKQVSGAVKGELAQYREMSSFSKFSS-- 408 (509) Q Consensus 331 ~~D~~~~i~~~~~si~DG~i~l~~~l~~~g~~Paid~~~S~SRv~~~~~~~~~~~~a~~lr~~laqy~Ele~f~~fgs-- 408 (509) +||++|||+|+++||+||||||||+||++||||||||+.|+||||+.+.++.|++.|.++|.+|+.|+|.+++.++|. T Consensus 318 GDD~~dPiaD~~RsILDGHIvLsR~LA~~ghyPaIdvl~SiSRvm~~i~~~~h~~~a~~~r~lls~y~e~edLi~iGaY~ 397 (441) T COG1157 318 GDDMNDPIADEVRSILDGHIVLSRALAEAGHYPAIDVLASISRVMPQIVSEEHRKAARRLRQLLSRYEENEDLIRIGAYQ 397 (441) T ss_pred CCCCCCCHHHHHHHHCCCEEEEEHHHHHCCCCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC T ss_conf 68888850456664226528973868855999974467778777664089999999999999999988778999863856 Q ss_pred -CCCHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHH Q ss_conf -89989999999999999996699988748899999999986 Q gi|254780617|r 409 -DLDSSTQKFLSKGERLTELLKQPQFSPLAMEEQVVMIFAGI 449 (509) Q Consensus 409 -dlD~~t~~~l~rg~ri~e~LkQ~~~~p~~~~~qv~~l~a~~ 449 (509) +.|+..++++++-..|.+||+|+..+..++++.+-.|.... T Consensus 398 ~G~D~~~D~Ai~~~p~i~~fL~Q~~~e~~~~~~t~~~L~~~~ 439 (441) T COG1157 398 KGSDPELDKAIKLYPKIEQFLKQGIDEKSSFEETLEQLEAIL 439 (441) T ss_pred CCCCHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHH T ss_conf 789878899997509999997189544579999999999875 No 37 >PRK02118 V-type ATP synthase subunit B; Provisional Probab=100.00 E-value=0 Score=796.47 Aligned_cols=400 Identities=23% Similarity=0.318 Sum_probs=371.7 Q ss_pred EEEEEEEEEECCEEEEECCCCCCCCCEEEECC---CEEEEEEEECCCEEEEEEECCCCCCCCCCEEEECCCEEEEECCHH Q ss_conf 10358999827589997588877374799579---818999920598599998058668678988897798788667946 Q gi|254780617|r 26 SEIGRVLSIGDGIARVYGLNNIRAGEMVQFSH---GVYGMALNLEVDNVGVVILGSYKEISEGDIVKRTGRIVDVPVGLE 102 (509) Q Consensus 26 ~~~G~V~~V~~giv~v~GL~~a~~GElv~~~~---g~~G~Vi~l~~d~v~i~~l~~~~gI~~G~~V~~tg~~~~vpvG~~ 102 (509) +++|+|.+|.|.++.+.+ +++.+||+|++.. ...|+|+++++|.+.+++|+++.||++|++|.+||++++||||++ T Consensus 3 k~yg~V~~I~GplI~v~~-~~~~~gElv~I~~~~~~~~gEVI~~~~d~v~iqvfe~T~Gi~~G~~V~~tG~~l~V~vg~~ 81 (432) T PRK02118 3 KIYTKITSIKGNLITVEA-EGVGYGELATVERKGRSSLASVLKLDGDKVTLQVFGGTSGISTGDEVVFLGRPMQVTFSDN 81 (432) T ss_pred CCCCEEEEEECCEEEEEE-CCCCCCCEEEEECCCCEEEEEEEEEECCEEEEEECCCCCCCCCCCEEEECCCCCEEEECHH T ss_conf 221469999883899997-8898787899984997799999998199899998469878999999996899767882865 Q ss_pred HHHCCCCHHHCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCHHHHHCCHHHHHCCCCCCCCCEEEEECCCCCCCHHHHHHH Q ss_conf 98300030023144664445444100101357731126600144416023310111156833655247788611589999 Q gi|254780617|r 103 LLGRVVDALGNPIDGKGPIKCEQRSCTEADAPGIIQRQSVCEPLSTGIKAIDSLIPIGRGQRELIIGDRKTGKTSIILDT 182 (509) Q Consensus 103 lLGRVvD~lG~PlDg~g~i~~~~~~~i~~~~p~~~~R~~i~~~l~TGI~~ID~l~pigrGQR~~I~g~~g~GKt~l~~~~ 182 (509) |||||+|++|+||||++++.. +.+++.++++||+.|.+++++|+|||++||+|+||+||||++|||++|+|||+|+. + T Consensus 82 lLGRV~DglGrPiDggp~~~~-~~~~i~~~~inP~~R~~p~e~l~TGI~aID~l~~l~rGQKi~IFsgsG~gks~L~~-~ 159 (432) T PRK02118 82 LLGRRFNGTGKPIDGGPELEG-EPIEIGGPSVNPVKRIVPREMIRTGIPMIDVFNTLVKSQKIPIFSSSGEPYNALLA-R 159 (432) T ss_pred HCCCEECCCCCCCCCCCCCCC-CEEECCCCCCCCHHHCCCCCCCCCCCEEEECCCCCCCCCEEEEECCCCCCHHHHHH-H T ss_conf 437864476761589987889-88613699989642068876778896785423674058569874479997639999-9 Q ss_pred HHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHHHHH-CCC Q ss_conf 9998863024667876069999516776789999985114674211589972887888999998876666667542-046 Q gi|254780617|r 183 FLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALSYSIVVVASASDPAPMQLLAPFAGCAMGEYFR-DNG 261 (509) Q Consensus 183 I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~~tvvv~a~a~~~~~~r~~ap~~a~aiAEyfr-~~G 261 (509) |.+|+. ..+|||++||||++|+.+|++.+++.++|+||++|+++||+||++|+++||+|+|+||||| ++| T Consensus 160 Ia~~a~---------~dvvV~~~iGer~~e~~~f~~~~~~~~~l~rtvlv~~ts~~p~~~R~~~~~~altiAEyfr~d~G 230 (432) T PRK02118 160 IALQAE---------ADIIILGGMGLTFDDYLFFKDEFEKAGALDRTVMFVHTAVDPPVECLLVPDMALAVAEKFAVEEG 230 (432) T ss_pred HHHHCC---------CCEEEEECCEEECHHHHHHHHHHHHCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 988538---------99896403324412589999987506877707998347778989987788888889999987459 Q ss_pred CCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCCCCCCHHHH Q ss_conf 88479961647899999875453037875433375200124678888750454458873567531003676644311466 Q gi|254780617|r 262 YHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLLERAAKMSDALGAGSLTALPVIETQVNDVSAYIPTN 341 (509) Q Consensus 262 ~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l~ERa~~~~~~~g~GSiT~~~~v~~~~~D~~~~i~~~ 341 (509) +|||+++|||||||+|+||||++++|||+|+||||++|+.|++|+||||++.+ +||||+||+|++|+||+||||||| T Consensus 231 ~~VLli~DdlTr~A~A~REIs~~~ge~P~r~GYP~~l~s~La~l~ERAg~~~~---~GSIT~i~~v~~p~dD~t~Pi~d~ 307 (432) T PRK02118 231 KKVLVLLTDMTNFADALKEISITMDQIPSNRGYPGSLYSDLAKRYEKAVDFAD---GGSITIIAVTTMPGDDITHPVPDN 307 (432) T ss_pred CEEEEEECCHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHCCCCCC---CEEEEEEEEEECCCCCCCCCHHHH T ss_conf 72999976678877788789875489998778695399999999983468999---802689999975787777756889 Q ss_pred HHHHHCCEEEEECCHHHHCCCCCCCCCCCHHHCCCCCCC----HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHH Q ss_conf 885406169961501221026654334441220343438----9887755678887998876787987633899899999 Q gi|254780617|r 342 VISITDGQIFLETELFYQGIRPAINIGLSVSRVGSAAQV----KAMKQVSGAVKGELAQYREMSSFSKFSSDLDSSTQKF 417 (509) Q Consensus 342 ~~si~DG~i~l~~~l~~~g~~Paid~~~S~SRv~~~~~~----~~~~~~a~~lr~~laqy~Ele~f~~fgsdlD~~t~~~ 417 (509) ++|||||||||||+ +|||+.|+||+|..+.. +.|+++|++++..||+|++++++..++.+|++.+++. T Consensus 308 t~~ItdGqIvLsr~--------~id~l~SlSRl~~~~ig~~tr~dh~~~~~~l~~~ya~~~~~~~~~~~~~~ls~~d~~~ 379 (432) T PRK02118 308 TGYITEGQFYLKRG--------RIDPFGSLSRLKQLVIGKKTREDHGDLMNAMIRLYADSRKAKEKMAMGFKLSNWDEKL 379 (432) T ss_pred HHHHHCCEEEEECC--------CCCCCCCHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHH T ss_conf 99985776885068--------7576443777765656542499999999999999986557999999976459878999 Q ss_pred HHHHHHHHH-HHCCCCCCCCCHHHHHHHHHHHHC Q ss_conf 999999999-966999887488999999999866 Q gi|254780617|r 418 LSKGERLTE-LLKQPQFSPLAMEEQVVMIFAGIS 450 (509) Q Consensus 418 l~rg~ri~e-~LkQ~~~~p~~~~~qv~~l~a~~~ 450 (509) |+++++++. |+.|+ +.++++++.-+.|-+.. T Consensus 380 l~f~~~fe~~f~~~~--~~~~~~~~ld~~w~~l~ 411 (432) T PRK02118 380 LAFSELFESRLMDLE--VNIPLEEALDLGWKILA 411 (432) T ss_pred HHHHHHHHHHHHCCC--CCCCHHHHHHHHHHHHH T ss_conf 999999999973578--99658999999999998 No 38 >cd01136 ATPase_flagellum-secretory_path_III Flagellum-specific ATPase/type III secretory pathway virulence-related protein. This group of ATPases are responsible for the export of flagellum and virulence-related proteins. The bacterial flagellar motor is similar to the F0F1-ATPase, in that they both are proton driven rotary molecular devices. However, the main function of the bacterial flagellar motor is to rotate the flagellar filament for cell motility. Intracellular pathogens such as Salmonella and Chlamydia also have proteins which are similar to the flagellar-specific ATPase, but function in the secretion of virulence-related proteins via the type III secretory pathway. Probab=100.00 E-value=0 Score=797.26 Aligned_cols=323 Identities=33% Similarity=0.522 Sum_probs=311.8 Q ss_pred EEEEECCHHHHHCCCCHHHCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCHHHHHCCHHHHHCCCCCCCCCEEEEECCCCC Q ss_conf 78866794698300030023144664445444100101357731126600144416023310111156833655247788 Q gi|254780617|r 94 IVDVPVGLELLGRVVDALGNPIDGKGPIKCEQRSCTEADAPGIIQRQSVCEPLSTGIKAIDSLIPIGRGQRELIIGDRKT 173 (509) Q Consensus 94 ~~~vpvG~~lLGRVvD~lG~PlDg~g~i~~~~~~~i~~~~p~~~~R~~i~~~l~TGI~~ID~l~pigrGQR~~I~g~~g~ 173 (509) +++||||++|||||+|++|+||||+|++...+++|++++||||++|+++++||+||||+||+|+|||||||++||||+|+ T Consensus 1 Pl~vpVg~~lLGRVid~~G~PiDg~~~i~~~~~~pi~~~ap~~~~R~~v~e~l~TGI~~ID~l~pigrGQR~~Ifg~~g~ 80 (326) T cd01136 1 PLSVPVGDALLGRVLDAFGEPLDGKGPLGKEVRYPLLRTPPNPLKRRPIDEVLPTGVRAIDGLLTVGKGQRLGIFAGSGV 80 (326) T ss_pred CCEEECCCCCCCCEECCCCCCCCCCCCCCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCHHCCCCCCCCCCEEEEECCCCC T ss_conf 96535576651747678834157999999986152568995803307888751157704200266247987874369999 Q ss_pred CCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHH Q ss_conf 61158999999988630246678760699995167767899999851146742115899728878889999988766666 Q gi|254780617|r 174 GKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALSYSIVVVASASDPAPMQLLAPFAGCAM 253 (509) Q Consensus 174 GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~~tvvv~a~a~~~~~~r~~ap~~a~ai 253 (509) |||+| +.+|.||.. ..+|||++||||++|+++|++++.+.++|+||++|++++|+||.+|+++||+|+++ T Consensus 81 GKt~L-l~~i~~~~~---------~~v~V~~~IGer~rev~e~~~~~~~~~~l~~tvvv~atad~~~~~r~~a~~~a~~~ 150 (326) T cd01136 81 GKSTL-LGMIARGTT---------ADVNVIALIGERGREVREFIEKDLGEEGLKRSVVVVATSDESPLLRVKAAYTATAI 150 (326) T ss_pred CCCHH-HHHHHHCCC---------CCEEEEEEECCCHHHHHHHHHHHHCCCCCCCEEEEECCCCCCHHHHHHHHHHHHHH T ss_conf 84667-876864146---------77599996053388999999986245653222899606889989999988877777 Q ss_pred HHHHHCCCCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCC Q ss_conf 67542046884799616478999998754530378754333752001246788887504544588735675310036766 Q gi|254780617|r 254 GEYFRDNGYHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLLERAAKMSDALGAGSLTALPVIETQVND 333 (509) Q Consensus 254 AEyfr~~G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l~ERa~~~~~~~g~GSiT~~~~v~~~~~D 333 (509) ||||||+|+|||+++|||||||+||||+|+++||||+++||||++||.|++|+||||+.. +||||+||+|++++|| T Consensus 151 AEyfrd~G~dVL~~~Dsltr~A~A~rEisl~~g~~P~~~gyp~~~f~~~~~l~ERag~~~----~GSIT~i~~v~~~~dd 226 (326) T cd01136 151 AEYFRDQGKDVLLLMDSLTRFAMAQREIGLAAGEPPTTKGYPPSVFALLPRLLERAGNSD----KGSITAFYTVLVEGDD 226 (326) T ss_pred HHHHHHCCCCEEEEECCHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHCCCC----CCCEEEEEEEECCCCC T ss_conf 889997587337875246899999999888648998756677668888789888641799----9852444578705988 Q ss_pred CCCCHHHHHHHHHCCEEEEECCHHHHCCCCCCCCCCCHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHC---CC Q ss_conf 443114668854061699615012210266543344412203434389887755678887998876787987633---89 Q gi|254780617|r 334 VSAYIPTNVISITDGQIFLETELFYQGIRPAINIGLSVSRVGSAAQVKAMKQVSGAVKGELAQYREMSSFSKFSS---DL 410 (509) Q Consensus 334 ~~~~i~~~~~si~DG~i~l~~~l~~~g~~Paid~~~S~SRv~~~~~~~~~~~~a~~lr~~laqy~Ele~f~~fgs---dl 410 (509) +|||||||++|||||||||||+||++|+||||||+.|+||+|++++.+.|+++|+++|..|++|+|+++|++||. +. T Consensus 227 ~tdpi~~~~~~i~Dg~ivLsr~La~~g~~PAIDv~~S~SR~~~~~~~~~~~~~a~~~r~~l~~y~e~~~li~~g~y~~g~ 306 (326) T cd01136 227 LNEPIADAVRSILDGHIVLSRALAAAGHYPAIDVLKSISRLMNAVVTPEHKEAARKLRELLSAYQEVEDLIRIGAYKKGS 306 (326) T ss_pred CCCCCHHHHHHHCCEEEEECHHHHHCCCCCCCCCCCCHHHCCHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCC T ss_conf 78862788987547389985889967999974886275336600067999999999999999899999999854976798 Q ss_pred CHHHHHHHHHHHHHHHHHCC Q ss_conf 98999999999999999669 Q gi|254780617|r 411 DSSTQKFLSKGERLTELLKQ 430 (509) Q Consensus 411 D~~t~~~l~rg~ri~e~LkQ 430 (509) |++++++++|+++|++||+| T Consensus 307 d~~~d~~i~~~~~i~~fL~Q 326 (326) T cd01136 307 DPEVDEAIKLLPKIEAFLKQ 326 (326) T ss_pred CHHHHHHHHHHHHHHHHHCC T ss_conf 98999999989999998586 No 39 >cd01135 V_A-ATPase_B V/A-type ATP synthase (non-catalytic) subunit B. These ATPases couple ATP hydrolysis to the build up of a H+ gradient, but V-type ATPases do not catalyze the reverse reaction. The Vacuolar (V-type) ATPase is found in the membranes of vacuoles, the golgi apparatus and in other coated vesicles in eukaryotes. Archaea have a protein which is similar in sequence to V-ATPases, but functions like an F-ATPase (called A-ATPase). A similar protein is also found in a few bacteria. This subfamily consists of the non-catalytic beta subunit. Probab=100.00 E-value=0 Score=737.99 Aligned_cols=275 Identities=28% Similarity=0.458 Sum_probs=262.3 Q ss_pred EEEEECCHHHHHCCCCHHHCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCHHHHHCCHHHHHCCCCCCCCCEEEEECCCCC Q ss_conf 78866794698300030023144664445444100101357731126600144416023310111156833655247788 Q gi|254780617|r 94 IVDVPVGLELLGRVVDALGNPIDGKGPIKCEQRSCTEADAPGIIQRQSVCEPLSTGIKAIDSLIPIGRGQRELIIGDRKT 173 (509) Q Consensus 94 ~~~vpvG~~lLGRVvD~lG~PlDg~g~i~~~~~~~i~~~~p~~~~R~~i~~~l~TGI~~ID~l~pigrGQR~~I~g~~g~ 173 (509) .++||||++|||||+|++|+||||+|++...+++|+++++|||++|+++++||+||||+||+|+|||||||+|||||+|+ T Consensus 1 sl~VpVG~~lLGRVvd~lG~PiDg~~~i~~~~~~~i~~~ap~p~~R~~v~e~l~TGIkaID~l~pig~GQR~gIfgg~Gv 80 (276) T cd01135 1 TLKVPVSEDMLGRIFNGSGKPIDGGPEILAEEYLDINGPPINPVARIYPEEMIQTGISAIDGMNTLVRGQKIPIFSGSGL 80 (276) T ss_pred CEEEECCCCCCCCEECCCCCCCCCCCCCCCCCEEECCCCCCCCHHCCCCCCCCCCCCEEEECCCCCCCCCEEEECCCCCC T ss_conf 96987477733747458833127999999986402458997804406778632258535405467236766332057886 Q ss_pred CCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHH Q ss_conf 61158999999988630246678760699995167767899999851146742115899728878889999988766666 Q gi|254780617|r 174 GKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALSYSIVVVASASDPAPMQLLAPFAGCAM 253 (509) Q Consensus 174 GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~~tvvv~a~a~~~~~~r~~ap~~a~ai 253 (509) |||+|+.+++.|++. + ...++++|||++||||+||+++|++++++.++|+||++|+++||+||++|++|||+|||| T Consensus 81 GKs~L~~~i~~~~~~-~---~~~~~~v~V~~~IGer~rev~e~~~~l~~~~~l~~tvvv~ata~~~p~~r~~a~~~a~ai 156 (276) T cd01135 81 PHNELAAQIARQAGV-V---GEEENFAVVFAAMGITMEDARFFKDDFEETGALERVVLFLNLANDPTIERIITPRMALTT 156 (276) T ss_pred CHHHHHHHHHHHHHH-C---CCCCCCEEEEEECCEEHHHHHHHHHHHHCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHH T ss_conf 367899999998775-1---368873599961555325799999998716651210146634889768887888887789 Q ss_pred HHHHH-CCCCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCC Q ss_conf 67542-04688479961647899999875453037875433375200124678888750454458873567531003676 Q gi|254780617|r 254 GEYFR-DNGYHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLLERAAKMSDALGAGSLTALPVIETQVN 332 (509) Q Consensus 254 AEyfr-~~G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l~ERa~~~~~~~g~GSiT~~~~v~~~~~ 332 (509) ||||| ++|||||++||||||||+|||||||+++|||+|+|||||+||.||+|+||||+++++ +||||+||+|++++| T Consensus 157 AEyFr~~~Gk~VLl~~D~ltr~A~A~REisl~~g~~P~~~gYp~~vf~~l~~l~ERag~~~~~--~GSITa~~~v~~~~d 234 (276) T cd01135 157 AEYLAYEKGKHVLVILTDMTNYAEALREISAAREEVPGRRGYPGYMYTDLATIYERAGRVEGR--NGSITQIPILTMPND 234 (276) T ss_pred HHHHHHHCCCCEEEEECCHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCC--CEEEEEEEEEECCCC T ss_conf 999887369977999456889999999999864899987888850998867888722467999--801899989944798 Q ss_pred CCCCCHHHHHHHHHCCEEEEECCHHHHCCCCCCCCCCCHHHC Q ss_conf 644311466885406169961501221026654334441220 Q gi|254780617|r 333 DVSAYIPTNVISITDGQIFLETELFYQGIRPAINIGLSVSRV 374 (509) Q Consensus 333 D~~~~i~~~~~si~DG~i~l~~~l~~~g~~Paid~~~S~SRv 374 (509) |+|||||||++|||||||||||+||++|+|||||++.|+||| T Consensus 235 D~~~pi~~~~~si~DG~i~Lsr~la~~G~~PAId~~~S~SRv 276 (276) T cd01135 235 DITHPIPDLTGYITEGQIVLDRQLHNRGIYPPINVLPSLSRL 276 (276) T ss_pred CCCCCHHHHHHHHCCEEEEECHHHHHCCCCCCCCCCCCCCCC T ss_conf 867766777765604599997999967999970875576679 No 40 >cd01132 F1_ATPase_alpha F1 ATP synthase alpha, central domain. The F-ATPase is found in bacterial plasma membranes, mitochondrial inner membranes and in chloroplast thylakoid membranes. It has also been found in the archaea Methanosarcina barkeri. It uses a proton gradient to drive ATP synthesis and hydrolyzes ATP to build the proton gradient. The extrinisic membrane domain, F1, is composed of alpha, beta, gamma, delta and epsilon subunits with a stoichiometry of 3:3:1:1:1. The alpha subunit of the F1 ATP synthase can bind nucleotides, but is non-catalytic. Probab=100.00 E-value=0 Score=728.87 Aligned_cols=274 Identities=78% Similarity=1.230 Sum_probs=268.6 Q ss_pred EEEEECCHHHHHCCCCHHHCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCHHHHHCCHHHHHCCCCCCCCCEEEEECCCCC Q ss_conf 78866794698300030023144664445444100101357731126600144416023310111156833655247788 Q gi|254780617|r 94 IVDVPVGLELLGRVVDALGNPIDGKGPIKCEQRSCTEADAPGIIQRQSVCEPLSTGIKAIDSLIPIGRGQRELIIGDRKT 173 (509) Q Consensus 94 ~~~vpvG~~lLGRVvD~lG~PlDg~g~i~~~~~~~i~~~~p~~~~R~~i~~~l~TGI~~ID~l~pigrGQR~~I~g~~g~ 173 (509) +++||||++|||||+|++|+||||++++..+++||+++++|||++|.++++||+||||+||+|+|||||||++|||++|+ T Consensus 1 v~~VpVg~~lLGRVid~~G~PiDg~~~~~~~~~~pi~~~~p~p~~R~~i~e~l~TGI~aID~l~pig~GQr~~If~~~g~ 80 (274) T cd01132 1 IADVPVGEALLGRVVDALGNPIDGKGPIETKERRPIESKAPGIIPRKSVNEPLQTGIKAIDAMIPIGRGQRELIIGDRQT 80 (274) T ss_pred CEEEECCCCCCCCEECCCCCCCCCCCCCCCCCCCCCCCCCCCCHHCCCCCCCCCCCCEEEECCCCCCCCCEEECCCCCCC T ss_conf 96867474636737568833027999999985020258895852126878611038345413466247867515588875 Q ss_pred CCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHH Q ss_conf 61158999999988630246678760699995167767899999851146742115899728878889999988766666 Q gi|254780617|r 174 GKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALSYSIVVVASASDPAPMQLLAPFAGCAM 253 (509) Q Consensus 174 GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~~tvvv~a~a~~~~~~r~~ap~~a~ai 253 (509) |||+|++++|+||+. ++++|||++||||++|+.+|++++++.++++||++|++++|+||.+|+++||+||++ T Consensus 81 GKt~l~~~~i~~~~~--------~~~~~V~~~IGer~rEv~ef~~~~~~~~~l~~tv~v~~t~~~p~~~r~~a~~~a~~i 152 (274) T cd01132 81 GKTAIAIDTIINQKG--------KKVYCIYVAIGQKASTVAQVVKTLEEHGAMEYTIVVAATASDPAPLQYLAPYTGCAM 152 (274) T ss_pred CHHHHHHHHHHHHHC--------CCCEEEEEEECCCHHHHHHHHHHHHHCCCCCCCEEEECCCCCCHHHHHHHHHHCCCH T ss_conf 578899999997413--------696599997324522699999987605762011477404778758777654412226 Q ss_pred HHHHHCCCCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCC Q ss_conf 67542046884799616478999998754530378754333752001246788887504544588735675310036766 Q gi|254780617|r 254 GEYFRDNGYHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLLERAAKMSDALGAGSLTALPVIETQVND 333 (509) Q Consensus 254 AEyfr~~G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l~ERa~~~~~~~g~GSiT~~~~v~~~~~D 333 (509) ||||||+|||||+++|||||||+||||||++++|||+++|||||+||+||+|+||||++++.+|+||||++|+|++++|| T Consensus 153 AEyfrd~Gk~VLll~DslTr~A~A~rEisl~~ge~P~~~gyp~~vf~~~~~l~ERag~~~~~~~~GSiT~~~~v~~~~dD 232 (274) T cd01132 153 GEYFMDNGKHALIIYDDLSKQAVAYRQMSLLLRRPPGREAYPGDVFYLHSRLLERAAKLNDELGGGSLTALPIIETQAGD 232 (274) T ss_pred HHHHHHCCCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCEEEEEEEEEECCCCC T ss_conf 78998779947999978899999999999972799977777962787768999863215699988142177899735877 Q ss_pred CCCCHHHHHHHHHCCEEEEECCHHHHCCCCCCCCCCCHHHCC Q ss_conf 443114668854061699615012210266543344412203 Q gi|254780617|r 334 VSAYIPTNVISITDGQIFLETELFYQGIRPAINIGLSVSRVG 375 (509) Q Consensus 334 ~~~~i~~~~~si~DG~i~l~~~l~~~g~~Paid~~~S~SRv~ 375 (509) +|||||||++|||||||||||+||++||||||||+.|+|||| T Consensus 233 ~t~pi~d~~~~i~dg~ivLsr~la~~g~yPAIdvl~S~SRvg 274 (274) T cd01132 233 VSAYIPTNVISITDGQIFLETDLFNKGIRPAINVGLSVSRVG 274 (274) T ss_pred CCCCHHHHHHHHCCEEEEECHHHHHCCCCCCCCCCCCCCCCC T ss_conf 788516667222045999979999679999808775767789 No 41 >PRK04192 V-type ATP synthase subunit A; Provisional Probab=100.00 E-value=0 Score=692.90 Aligned_cols=426 Identities=23% Similarity=0.371 Sum_probs=371.7 Q ss_pred EEEEEEEEECCEEEEECCCCCCCCCEEEECC-CEEEEEEEECCCEEEEEEECCCCCCCCCCEEEECCCEEEEECCHHHHH Q ss_conf 0358999827589997588877374799579-818999920598599998058668678988897798788667946983 Q gi|254780617|r 27 EIGRVLSIGDGIARVYGLNNIRAGEMVQFSH-GVYGMALNLEVDNVGVVILGSYKEISEGDIVKRTGRIVDVPVGLELLG 105 (509) Q Consensus 27 ~~G~V~~V~~giv~v~GL~~a~~GElv~~~~-g~~G~Vi~l~~d~v~i~~l~~~~gI~~G~~V~~tg~~~~vpvG~~lLG 105 (509) ..|+|.+|.|+++.+.|++++.++|+|++.+ +..|+|+.+++|.+.+++|+++.||++|++|.+||++++|++|++||| T Consensus 3 ~~G~I~~I~GPlV~~e~~~~~~~~EvV~VG~~~L~GEVI~i~gd~a~iQVyE~T~Gl~~G~~V~~TG~pLsV~LGpgLLG 82 (585) T PRK04192 3 TKGKIVRVSGPLVVAEGMGGARMYEVVKVGEEGLIGEIIRVRGDEASIQVYEETSGIKPGEPVEFTGEPLSVELGPGLLG 82 (585) T ss_pred CCCEEEEEECCEEEEEECCCCCCCCEEEECCCCEEEEEEEEECCEEEEEECCCCCCCCCCCEEEECCCCEEEEECHHHHH T ss_conf 67369999888899952788864667998898557999999499899996668889999898884799449886762552 Q ss_pred CCCCHHHCCCCCCC--------------CC-------------------------------------------------- Q ss_conf 00030023144664--------------44-------------------------------------------------- Q gi|254780617|r 106 RVVDALGNPIDGKG--------------PI-------------------------------------------------- 121 (509) Q Consensus 106 RVvD~lG~PlDg~g--------------~i-------------------------------------------------- 121 (509) ||+|++|||||+.+ ++ T Consensus 83 rIfDGiqrPLd~i~~~~g~fi~rG~~~~~Ldr~~~w~f~P~~~~Gd~v~~Gdilg~V~Et~~i~H~imvPp~~~g~v~~i 162 (585) T PRK04192 83 SIYDGIQRPLDELAEKSGDFLKRGVYVPALDREKKWHFKPTVKVGDKVVAGDVLGTVQETGSIEHKIMVPPGVSGTIKEI 162 (585) T ss_pred CCCCCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCCEECCCCCCCEEEEE T ss_conf 62475778633455432665447887888774444531111355874157865788504665334110698887338997 Q ss_pred -------------------------CCCCCCHHCCCCCCCCCCCCCHHHHHCCHHHHHCCCCCCCCCEEEEECCCCCCCH Q ss_conf -------------------------5444100101357731126600144416023310111156833655247788611 Q gi|254780617|r 122 -------------------------KCEQRSCTEADAPGIIQRQSVCEPLSTGIKAIDSLIPIGRGQRELIIGDRKTGKT 176 (509) Q Consensus 122 -------------------------~~~~~~~i~~~~p~~~~R~~i~~~l~TGI~~ID~l~pigrGQR~~I~g~~g~GKt 176 (509) ..-+.||+.++.| +.+|.++++||.||+|+||+|+||+|||+.+|+|++|+||| T Consensus 163 ~~~G~ytv~d~ia~v~~~~G~~~~~~m~q~WPVR~prP-~~~r~~p~~PL~TG~RvID~lfpi~rGgt~~IpGgfG~GKT 241 (585) T PRK04192 163 ASEGDYTVDDTIAVVEDEDGKGKELTMMQKWPVRRPRP-YKEKLPPVEPLITGQRVIDTFFPVAKGGTAAIPGPFGSGKT 241 (585) T ss_pred ECCCCEEEEEEEEEEECCCCCCCCCCCCCCCCCCCCCC-HHHCCCCCCCCCCCCEEECCCCCCCCCCEEECCCCCCCCHH T ss_conf 05885043016999862678743433110276668896-23305866541037333002332026862321266666646 Q ss_pred HHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCC-------CCCCCEEEEEEECCCCCHHHHHHHHHH Q ss_conf 5899999998863024667876069999516776789999985114-------674211589972887888999998876 Q gi|254780617|r 177 SIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALED-------RGALSYSIVVVASASDPAPMQLLAPFA 249 (509) Q Consensus 177 ~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~-------~~~~~~tvvv~a~a~~~~~~r~~ap~~ 249 (509) ++ +.+|..+ ++..++||++||||++|+++|++++.+ ...|+|||+|+||||+|+++|..++|+ T Consensus 242 vl-~~~lak~---------s~aDivVyvgcGERgnEm~evl~eFpel~Dp~tG~~lm~RTVliaNTSnmPvaaReaSiyt 311 (585) T PRK04192 242 VT-QHQLAKW---------ADADIVIYVGCGERGNEMTEVLEEFPELKDPKTGRPLMERTVLIANTSNMPVAAREASIYT 311 (585) T ss_pred HH-HHHHHHC---------CCCCEEEEEEECCCHHHHHHHHHHHHHHCCCCCCCHHHCCEEEEEECCCCCHHHHHHHHHH T ss_conf 67-9999742---------6899899997325408699999987760375467012030899999999857887651567 Q ss_pred HHHHHHHHHCCCCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCC-CCCEEEEEEEE Q ss_conf 66666754204688479961647899999875453037875433375200124678888750454458-87356753100 Q gi|254780617|r 250 GCAMGEYFRDNGYHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLLERAAKMSDALG-AGSLTALPVIE 328 (509) Q Consensus 250 a~aiAEyfr~~G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l~ERa~~~~~~~g-~GSiT~~~~v~ 328 (509) |+|+||||||+|+|||+++||+||||+|+||||.+++|+||++|||+.+++.++.+|||||+.....+ .||+|.+.+|. T Consensus 312 giTiAEYfRDmG~~VllmaDStSRwAeAlREIS~rleE~Pg~eGYPaYL~SrLA~fYERAGrV~~~~~~~GSvT~igaVS 391 (585) T PRK04192 312 GITIAEYYRDMGYDVLLMADSTSRWAEALREISGRLEEMPGEEGYPAYLASRLAEFYERAGRVKTLGGSEGSVTIIGAVS 391 (585) T ss_pred HHHHHHHHHHCCCEEEEEECCHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHCCEEECCCCCCCCEEEEEEEC T ss_conf 88899999976970899975668999998898786436985346782389899999874245550589976279988865 Q ss_pred CCCCCCCCCHHHHHHHHHCCEEEEECCHHHHCCCCCCCCCCCHHHCCCCCC-------CHHHHHHHHHHHHHHHHHHHHH Q ss_conf 367664431146688540616996150122102665433444122034343-------8988775567888799887678 Q gi|254780617|r 329 TQVNDVSAYIPTNVISITDGQIFLETELFYQGIRPAINIGLSVSRVGSAAQ-------VKAMKQVSGAVKGELAQYREMS 401 (509) Q Consensus 329 ~~~~D~~~~i~~~~~si~DG~i~l~~~l~~~g~~Paid~~~S~SRv~~~~~-------~~~~~~~a~~lr~~laqy~Ele 401 (509) .||||+|+|+..++..++....-||++||+++|||||||+.|.|+...... .+...++..+.+.+|++..||+ T Consensus 392 ppGgDfsePVT~~Tl~~v~~fw~Ld~~lA~~rhfPainwl~SyS~y~~~~~~~~~~~~~~~~~~~r~~~~~iL~~e~~l~ 471 (585) T PRK04192 392 PPGGDFSEPVTQNTLRIVKVFWALDADLAYRRHYPAINWLTSYSLYLDDVADWWEENVSPDWRELRDEAMSLLQREAELQ 471 (585) T ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHH T ss_conf 89988677357888988878875137677556688667024367788889999987418159999999999987268999 Q ss_pred HHHHH-HCC-CCHHHHHHHHHHHHHHH-HHCCCCCCCC----CHHHHHHHHHHHHC-------CCCCCCCHHHHHH Q ss_conf 79876-338-99899999999999999-9669998874----88999999999866-------9806679899999 Q gi|254780617|r 402 SFSKF-SSD-LDSSTQKFLSKGERLTE-LLKQPQFSPL----AMEEQVVMIFAGIS-------GCLDEVAVSQVRK 463 (509) Q Consensus 402 ~f~~f-gsd-lD~~t~~~l~rg~ri~e-~LkQ~~~~p~----~~~~qv~~l~a~~~-------G~ld~i~v~~I~~ 463 (509) .+.++ |.| |.+..+-.++.++.|++ ||+|+.|+++ |+.+|..+|-.+.. -.-.++++++|.+ T Consensus 472 eiv~lvG~d~l~~~~~l~l~~a~~i~~~fLqQnaf~~~D~~~~~~kq~~ml~~i~~~~~~~~~~~~~g~~~~~i~~ 547 (585) T PRK04192 472 EIVRLVGEDALSEEDRLILEVARLIREDFLQQNAFDPVDTYCPPEKQYEMLKLILTFYDEAFKALEAGVPLSEIFE 547 (585) T ss_pred HHHHHHCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHC T ss_conf 9998647755898897899999886550301478995024899899999999999999999999986998999856 No 42 >cd01133 F1-ATPase_beta F1 ATP synthase beta subunit, nucleotide-binding domain. The F-ATPase is found in bacterial plasma membranes, mitochondrial inner membranes and in chloroplast thylakoid membranes. It has also been found in the archaea Methanosarcina barkeri. It uses a proton gradient to drive ATP synthesis and hydrolyzes ATP to build the proton gradient. The extrinisic membrane domain, F1, is composed of alpha, beta, gamma, delta and epsilon subunits with a stoichiometry of 3:3:1:1:1. The beta subunit of ATP synthase is catalytic. Probab=100.00 E-value=0 Score=693.91 Aligned_cols=272 Identities=28% Similarity=0.465 Sum_probs=261.7 Q ss_pred EEEEECCHHHHHCCCCHHHCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCHHHHHCCHHHHHCCCCCCCCCEEEEECCCCC Q ss_conf 78866794698300030023144664445444100101357731126600144416023310111156833655247788 Q gi|254780617|r 94 IVDVPVGLELLGRVVDALGNPIDGKGPIKCEQRSCTEADAPGIIQRQSVCEPLSTGIKAIDSLIPIGRGQRELIIGDRKT 173 (509) Q Consensus 94 ~~~vpvG~~lLGRVvD~lG~PlDg~g~i~~~~~~~i~~~~p~~~~R~~i~~~l~TGI~~ID~l~pigrGQR~~I~g~~g~ 173 (509) +++||||++|||||+|++|+||||++++...++||+++++||+++|+++++||+||||+||+|+|||||||++|||++|+ T Consensus 1 Pl~ipVg~~lLGRVvd~~G~PiDg~~~i~~~~~~pi~~~~p~~~~R~~~~e~l~TGIr~ID~l~pigkGQR~~I~~~~g~ 80 (274) T cd01133 1 PISVPVGPETLGRIFNVLGEPIDERGPIKTKKTWPIHREAPEFVEQSTKTEILETGIKVIDLLAPYAKGGKIGLFGGAGV 80 (274) T ss_pred CCEEECCCCCCCCEECCCCCCCCCCCCCCCCCCCCCCCCCCCHHHCCCCCCCCCCCCEEEECCCCCCCCCEEEEECCCCC T ss_conf 96577680421656368965248999999986042568996845607878731158666644466147857787579999 Q ss_pred CCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHH Q ss_conf 61158999999988630246678760699995167767899999851146742115899728878889999988766666 Q gi|254780617|r 174 GKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALSYSIVVVASASDPAPMQLLAPFAGCAM 253 (509) Q Consensus 174 GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~~tvvv~a~a~~~~~~r~~ap~~a~ai 253 (509) |||+|+.++|.|+... ++.+|||++||||++|+++|++++++.++|+||++|++++|+||.+|+++||+||++ T Consensus 81 GKt~ll~~ii~~~~~~-------~~~v~V~~~IGer~~ev~~~~~~~~~~~~l~~tvvv~~tad~~~~~r~~~~~~a~ai 153 (274) T cd01133 81 GKTVLIMELINNIAKA-------HGGYSVFAGVGERTREGNDLYHEMKESGVLSKTALVYGQMNEPPGARARVALTGLTM 153 (274) T ss_pred CCCHHHHHHHHHHHHC-------CCCEEEEEEECCCHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHH T ss_conf 8236899999999850-------898799998425548899999972035665337999834555405789999999999 Q ss_pred HHHHHC-CCCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCC Q ss_conf 675420-4688479961647899999875453037875433375200124678888750454458873567531003676 Q gi|254780617|r 254 GEYFRD-NGYHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLLERAAKMSDALGAGSLTALPVIETQVN 332 (509) Q Consensus 254 AEyfr~-~G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l~ERa~~~~~~~g~GSiT~~~~v~~~~~ 332 (509) |||||| +|+|||+++|||||||+||||+|++++|||+++||||++|+.|++|+||+++. ++||||+||+|++++| T Consensus 154 AE~frd~~G~dVLll~DslTr~A~A~reis~~~g~~P~~~gyp~~l~~~~~~l~ER~~~~----~~GSiT~i~tv~~~~d 229 (274) T cd01133 154 AEYFRDEEGQDVLLFIDNIFRFTQAGSEVSALLGRMPSAVGYQPTLATEMGALQERITST----KKGSITSVQAVYVPAD 229 (274) T ss_pred HHHHHHCCCCEEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHCCC----CCCCEEEEEEEECCCC T ss_conf 999997289859999718689999999888862899986666830788999999974488----8987468889974588 Q ss_pred CCCCCHHHHHHHHHCCEEEEECCHHHHCCCCCCCCCCCHHHCCC Q ss_conf 64431146688540616996150122102665433444122034 Q gi|254780617|r 333 DVSAYIPTNVISITDGQIFLETELFYQGIRPAINIGLSVSRVGS 376 (509) Q Consensus 333 D~~~~i~~~~~si~DG~i~l~~~l~~~g~~Paid~~~S~SRv~~ 376 (509) |++||||||++|||||||+|||+||++|+||||||+.|+||+|. T Consensus 230 D~~dpi~~~~~~i~dG~ivLsr~la~~g~yPAIdv~~S~SR~m~ 273 (274) T cd01133 230 DLTDPAPATTFAHLDATTVLSRAIAELGIYPAVDPLDSTSRILD 273 (274) T ss_pred CCCCCHHHHHHHHHCEEEEECHHHHHCCCCCCCCCCCCCCCCCC T ss_conf 87661778887740729998588996799996377653101258 No 43 >COG0055 AtpD F0F1-type ATP synthase, beta subunit [Energy production and conversion] Probab=100.00 E-value=0 Score=637.56 Aligned_cols=438 Identities=25% Similarity=0.400 Sum_probs=397.2 Q ss_pred EEEEEEEEECCEEEEECCCC----CCCCCEEEECCC-----EEEEEEEECCCEEEEEEECCCCCCCCCCEEEECCCEEEE Q ss_conf 03589998275899975888----773747995798-----189999205985999980586686789888977987886 Q gi|254780617|r 27 EIGRVLSIGDGIARVYGLNN----IRAGEMVQFSHG-----VYGMALNLEVDNVGVVILGSYKEISEGDIVKRTGRIVDV 97 (509) Q Consensus 27 ~~G~V~~V~~giv~v~GL~~----a~~GElv~~~~g-----~~G~Vi~l~~d~v~i~~l~~~~gI~~G~~V~~tg~~~~v 97 (509) ..|+|++|.+.++.|. +++ ..++..++..++ +.-....+.++.|.++.++.++|+..|+.|..||++++| T Consensus 2 ~~G~vvqv~g~VvdV~-F~~~~~lP~I~naL~~~~~~~~~~~leV~q~lg~~~VR~Iam~~t~gl~rg~~v~dtg~pi~V 80 (468) T COG0055 2 NKGKVVQVIGPVVDVE-FPEEDELPEIYNALEVQNGNQGTLVLEVAQHLGDNVVRTIAMGSTDGLVRGLEVIDTGKPISV 80 (468) T ss_pred CCCEEEEEEEEEEEEE-ECCCCCCCHHHHHHEECCCCCCEEEEEHHHHHCCCEEEEEEECCCCCCCCCCEEECCCCCEEE T ss_conf 8756999982179998-058666720434526435775257530687738883899996375671037477507996687 Q ss_pred ECCHHHHHCCCCHHHCCCCCCCCCCCC--CCCHHCCCCCCCCCCCCCHHHHHCCHHHHHCCCCCCCCCEEEEECCCCCCC Q ss_conf 679469830003002314466444544--410010135773112660014441602331011115683365524778861 Q gi|254780617|r 98 PVGLELLGRVVDALGNPIDGKGPIKCE--QRSCTEADAPGIIQRQSVCEPLSTGIKAIDSLIPIGRGQRELIIGDRKTGK 175 (509) Q Consensus 98 pvG~~lLGRVvD~lG~PlDg~g~i~~~--~~~~i~~~~p~~~~R~~i~~~l~TGI~~ID~l~pigrGQR~~I~g~~g~GK 175 (509) |||++.|||++|.+|+|||+.+|+... ++||+++++|...+...-+|.|+||||+||.|+|+-||.|+|+|||+|+|| T Consensus 81 PVG~~~lgri~nvlG~~iD~~~~~~~~~~~~~~Ih~~~p~~~e~~~~~EIleTGIKVIDll~P~~kGgKiGLFGGAGVGK 160 (468) T COG0055 81 PVGKGTLGRIFNVLGEPIDEKGPIKAEDFEKWPIHRKAPSFEELSTKTEILETGIKVIDLLAPYAKGGKIGLFGGAGVGK 160 (468) T ss_pred ECCHHHCCCCHHCCCCCCCCCCCCCCCCCEEEECCCCCCCHHHCCCCCHHHHHCCEEEEEECCCCCCCEEEEECCCCCCC T ss_conf 52632212310204773466688875555020125799835540773044330736899841114686444423677562 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHH Q ss_conf 15899999998863024667876069999516776789999985114674211589972887888999998876666667 Q gi|254780617|r 176 TSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALSYSIVVVASASDPAPMQLLAPFAGCAMGE 255 (509) Q Consensus 176 t~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~~tvvv~a~a~~~~~~r~~ap~~a~aiAE 255 (509) |++++++|.|-+..+. .++||+++|||.||..+++.++++.+.+++|++|+.++|+||+.|++.+.+|+++|| T Consensus 161 TVl~~ELI~Nia~~h~-------g~SVFaGvGERtREGndLy~Em~es~vl~ktalv~gQMNEpPGaR~RValtGlT~AE 233 (468) T COG0055 161 TVLIQELINNIAKEHG-------GYSVFAGVGERTREGNDLYHEMKESGVLDKTALVFGQMNEPPGARMRVALTGLTMAE 233 (468) T ss_pred EEEHHHHHHHHHHHCC-------CEEEEEECCCCCCCHHHHHHHHHHCCCCCCEEEEEEECCCCCCCEEEEHHHHHHHHH T ss_conf 0109999999999749-------868997135321115778999986388773269996057998641455112331999 Q ss_pred HHHCC-CCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCCC Q ss_conf 54204-68847996164789999987545303787543337520012467888875045445887356753100367664 Q gi|254780617|r 256 YFRDN-GYHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLLERAAKMSDALGAGSLTALPVIETQVNDV 334 (509) Q Consensus 256 yfr~~-G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l~ERa~~~~~~~g~GSiT~~~~v~~~~~D~ 334 (509) ||||+ |+|||+++|+++||.+|..|+|.+|||.|+..||+|++-..++.|.||....++ ||||++++|++|+||+ T Consensus 234 yfRD~~gqdVLlFIDNIfRftQAGsEVSalLGr~PSavGYQpTLatemg~lQERItstk~----GSITSiQavyvPaDDl 309 (468) T COG0055 234 YFRDEEGQDVLLFIDNIFRFTQAGSEVSALLGRMPSAVGYQPTLATEMGQLQERITSTKK----GSITSVQAVYVPADDL 309 (468) T ss_pred HHHCCCCCEEEEEEHHHHHHHHCCHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHCCCC----CCEEEEEEEEECCCCC T ss_conf 864003875899842556776231389998456964145674067788889999753788----8668899998613237 Q ss_pred CCCHHHHHHHHHCCEEEEECCHHHHCCCCCCCCCCCHHHCC-CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH-HCC-CC Q ss_conf 43114668854061699615012210266543344412203-4343898877556788879988767879876-338-99 Q gi|254780617|r 335 SAYIPTNVISITDGQIFLETELFYQGIRPAINIGLSVSRVG-SAAQVKAMKQVSGAVKGELAQYREMSSFSKF-SSD-LD 411 (509) Q Consensus 335 ~~~i~~~~~si~DG~i~l~~~l~~~g~~Paid~~~S~SRv~-~~~~~~~~~~~a~~lr~~laqy~Ele~f~~f-gsd-lD 411 (509) |||.|..+.+++|..++|||++|+.|+||||||+.|.||.. +....+.|..+|.+++..|++|+||++++.+ |-| |+ T Consensus 310 TDPapattFaHLDat~vLsR~iaa~GIyPAvDPL~StSr~l~p~ivGe~Hy~va~~vq~iLqrYkeLqDIIaILGmdELs 389 (468) T COG0055 310 TDPAPATTFAHLDATTVLSRQIAALGIYPAVDPLDSTSRALDPKIVGEEHYEVAREVQSILQRYKELQDIIAILGMDELS 389 (468) T ss_pred CCCCHHHHHHHCCCCEEEEHHHHHCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHCC T ss_conf 89525556551344135637678668876658443422015864145899999999999999989999999983832128 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 89999999999999996699988748899999999986698066798999999999999997415999998873 Q gi|254780617|r 412 SSTQKFLSKGERLTELLKQPQFSPLAMEEQVVMIFAGISGCLDEVAVSQVRKFETNFLSHMRVSSQDILEDIRK 485 (509) Q Consensus 412 ~~t~~~l~rg~ri~e~LkQ~~~~p~~~~~qv~~l~a~~~G~ld~i~v~~I~~fe~~l~~~l~~~~~ei~~~I~~ 485 (509) +++|..+.|+++|+.||.||+| ++.-|.+..|.+..++ +.|+.|+..+-...+..+...+..+.+ T Consensus 390 eedk~~V~rArki~~FlSQpF~--------vAE~FTg~pG~~V~l~-dti~~fk~Il~G~yd~~pE~aF~~vG~ 454 (468) T COG0055 390 EEDKLTVARARKIQRFLSQPFF--------VAEVFTGSPGKYVPLK-DTIRGFKRILEGKYDHLPEQAFYMVGS 454 (468) T ss_pred HHHHHHHHHHHHHHHHHCCCCH--------HHHHCCCCCCEEEEHH-HHHHHHHHHHCCCCCCCCHHHHHHCCC T ss_conf 6678999999999998558620--------1210048996164489-999999998478766698889731174 No 44 >cd01134 V_A-ATPase_A V/A-type ATP synthase catalytic subunit A. These ATPases couple ATP hydrolysis to the build up of a H+ gradient, but V-type ATPases do not catalyze the reverse reaction. The Vacuolar (V-type) ATPase is found in the membranes of vacuoles, the golgi apparatus and in other coated vesicles in eukaryotes. Archaea have a protein which is similar in sequence to V-ATPases, but functions like an F-ATPase (called A-ATPase). A similar protein is also found in a few bacteria. Probab=100.00 E-value=0 Score=637.42 Aligned_cols=333 Identities=24% Similarity=0.338 Sum_probs=281.9 Q ss_pred EEEEEEECCEEE-EECCCCCCCCCEEEECCCEEEEEEEECCCEEEEEEECCCCCCCCCCE---EEEC----CCEEEEECC Q ss_conf 589998275899-97588877374799579818999920598599998058668678988---8977----987886679 Q gi|254780617|r 29 GRVLSIGDGIAR-VYGLNNIRAGEMVQFSHGVYGMALNLEVDNVGVVILGSYKEISEGDI---VKRT----GRIVDVPVG 100 (509) Q Consensus 29 G~V~~V~~giv~-v~GL~~a~~GElv~~~~g~~G~Vi~l~~d~v~i~~l~~~~gI~~G~~---V~~t----g~~~~vpvG 100 (509) |-+.+|.|||.+ ..+|.++..|+.+.......++-.+.+-+.+.++-.|+ .|..||. |.-+ .+++..|.. T Consensus 9 Gll~~I~DGIqRPL~~l~e~~~g~fi~rG~~~~~Ld~~~~w~f~p~v~~Gd--~V~~GDi~G~V~E~~~i~HkImvpp~~ 86 (369) T cd01134 9 GLLGSIYDGIQRPLDKIAEITKGIFIPRGVNVPALDRDKKWDFKPLVKVGD--HVTGGDILGTVPENSLIEHKIMVPPRV 86 (369) T ss_pred CEECCCCCCCCCCHHHHHHHHCCCEECCCCCCCCCCCCCCCCCCCCCCCCC--EECCCCEEEEEECCCCEEEEEECCCCC T ss_conf 557120313566466887764795017999885767634015500267799--962786799985578367778648987 Q ss_pred HHHHHCCCCHHHCCCCC----------CCCCCCCCCCHHCCCCCCCCCCCCCHHHHHCCHHHHHCCCCCCCCCEEEEECC Q ss_conf 46983000300231446----------64445444100101357731126600144416023310111156833655247 Q gi|254780617|r 101 LELLGRVVDALGNPIDG----------KGPIKCEQRSCTEADAPGIIQRQSVCEPLSTGIKAIDSLIPIGRGQRELIIGD 170 (509) Q Consensus 101 ~~lLGRVvD~lG~PlDg----------~g~i~~~~~~~i~~~~p~~~~R~~i~~~l~TGI~~ID~l~pigrGQR~~I~g~ 170 (509) .+.+++|...-.-++|. +.++...++||+..+.| +.+|.++++||+||||+||+|+|||||||++|||| T Consensus 87 ~G~v~~i~~~g~y~v~~~~~~~~~~g~~~~~~m~q~WPVr~prP-v~er~~~~~PL~TGir~ID~l~Pig~Gqr~~I~g~ 165 (369) T cd01134 87 RGTVTYIAPAGDYTVDDVILEVEFDGKKEEITMVQKWPVRQPRP-VKEKLPPNEPLLTGQRVLDTLFPVVKGGTAAIPGP 165 (369) T ss_pred CEEEEEEECCCCCEEEEEEEEECCCCCEEEEEEEEECCCCCCCC-CCCCCCCCCCHHHHHHHHHHHCCCCCCCEEEEECC T ss_conf 37999991589852465699970689846877887077566775-11138999862653568874253214764677668 Q ss_pred CCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHH-------CCCCCCCEEEEEEECCCCCHHHH Q ss_conf 78861158999999988630246678760699995167767899999851-------14674211589972887888999 Q gi|254780617|r 171 RKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKAL-------EDRGALSYSIVVVASASDPAPMQ 243 (509) Q Consensus 171 ~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l-------~~~~~~~~tvvv~a~a~~~~~~r 243 (509) +|||||+|+ ++|.+|+ +..+|||++||||++|+++|+++| ...+.|+||++|++|||+|+++| T Consensus 166 ~g~GKT~l~-~~i~k~~---------~~dv~Vyv~iGeRg~ev~e~l~~f~el~~~~~g~~~~~rtvvVa~ts~~p~~~r 235 (369) T cd01134 166 FGCGKTVIQ-QSLSKYS---------NSDIVIYVGCGERGNEMTEVLEEFPELTDPVTGEPLMKRTVLIANTSNMPVAAR 235 (369) T ss_pred CCCCHHHHH-HHHHHCC---------CCCEEEEEEEECCHHHHHHHHHHHHHHCCCCCCCCHHHCEEEEECCCCCCHHHH T ss_conf 776899999-9998537---------998899999711418999999986774133346641020368742777998899 Q ss_pred HHHHHHHHHHHHHHHCCCCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCC---CCCCCCC Q ss_conf 9988766666675420468847996164789999987545303787543337520012467888875045---4458873 Q gi|254780617|r 244 LLAPFAGCAMGEYFRDNGYHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLLERAAKMS---DALGAGS 320 (509) Q Consensus 244 ~~ap~~a~aiAEyfr~~G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l~ERa~~~~---~~~g~GS 320 (509) ++++|+||||||||||+|+|||+++||+||||+||||||++++|||+++||||++|+.|++|+||||++. +..++|| T Consensus 236 ~~s~y~g~tiAEyfRd~G~~Vll~~DslsR~A~A~REisl~l~e~P~~egYP~~l~s~l~~l~ERag~~~~~~~~~~~GS 315 (369) T cd01134 236 EASIYTGITIAEYFRDMGYNVALMADSTSRWAEALREISGRLEEMPGEEGYPAYLGARLASFYERAGRVKCLGSPGREGS 315 (369) T ss_pred HHHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCE T ss_conf 88898999999999968988799807589999998899886389987657193199888899872144323689999811 Q ss_pred EEEEEEEECCCCCCCCCHHHHHHHHHCCEEEEECCHHHHCCCCCCCCCCCHHHC Q ss_conf 567531003676644311466885406169961501221026654334441220 Q gi|254780617|r 321 LTALPVIETQVNDVSAYIPTNVISITDGQIFLETELFYQGIRPAINIGLSVSRV 374 (509) Q Consensus 321 iT~~~~v~~~~~D~~~~i~~~~~si~DG~i~l~~~l~~~g~~Paid~~~S~SRv 374 (509) ||++|+|++++||+|+||++|++|||||||+||++||++||||||||+.|+||. T Consensus 316 iT~~~~V~~~ggD~s~pI~~~~~si~d~~i~L~~~La~~g~~PAId~~~S~Sry 369 (369) T cd01134 316 VTIVGAVSPPGGDFSEPVTQATLRIVQVFWGLDKKLAQRRHFPSINWLISYSKY 369 (369) T ss_pred EEEEEEEECCCCCCCCCHHHHHHHHHCEEEEECHHHHHCCCCCCCCCCCCCCCC T ss_conf 778989975698867757888876621588985889867999971874366588 No 45 >KOG1353 consensus Probab=100.00 E-value=0 Score=628.89 Aligned_cols=340 Identities=66% Similarity=0.992 Sum_probs=333.8 Q ss_pred HHHHHHHHHHCCCCCCCEEEEEEEEEEECCEEEEECCCCCCCCCEEEECCCEEEEEEEECCCEEEEEEECCCCCCCCCCE Q ss_conf 99999999842455542010358999827589997588877374799579818999920598599998058668678988 Q gi|254780617|r 8 ISDILRKRIKDFGNDSEFSEIGRVLSIGDGIARVYGLNNIRAGEMVQFSHGVYGMALNLEVDNVGVVILGSYKEISEGDI 87 (509) Q Consensus 8 is~~l~~~i~~~~~~~~~~~~G~V~~V~~giv~v~GL~~a~~GElv~~~~g~~G~Vi~l~~d~v~i~~l~~~~gI~~G~~ 87 (509) ||++++++|.+++....+.+.|+|.+|+|||++|+||.++..+|+++|++|.+|+.++++.++|++++|++..-+++|+. T Consensus 1 ~SsI~EErI~~~~~~a~leEtg~VLsIGdGIArV~GL~nvQAeE~vEFssGlKgmalnle~~~vg~v~~g~d~~ikeg~~ 80 (340) T KOG1353 1 MSSIFEERIVGDNTSADLEETGRVLSIGDGIARVYGLTNVQAEEMVEFSSGLKGMALNLEGENVGVVVFGEDSLIKEGDT 80 (340) T ss_pred CCHHHHHHHHCCCCCCCHHHCCCEEEECCCEEEEECCCCCCHHHHHHHHCCCCCHHCCCCCCCEEEEEECCCCEECCCCE T ss_conf 93377898635566555444363678747340221533003677885632432100256677158999757402214755 Q ss_pred EEECCCEEEEECCHHHHHCCCCHHHCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCHHHHHCCHHHHHCCCCCCCCCEEEE Q ss_conf 89779878866794698300030023144664445444100101357731126600144416023310111156833655 Q gi|254780617|r 88 VKRTGRIVDVPVGLELLGRVVDALGNPIDGKGPIKCEQRSCTEADAPGIIQRQSVCEPLSTGIKAIDSLIPIGRGQRELI 167 (509) Q Consensus 88 V~~tg~~~~vpvG~~lLGRVvD~lG~PlDg~g~i~~~~~~~i~~~~p~~~~R~~i~~~l~TGI~~ID~l~pigrGQR~~I 167 (509) |.+|+....||+|+.|||||+|++|+|+||+||+...+++ ++.|..+.+|.+||++++|+++|||||||++| T Consensus 81 VkrTgaIvDVpvg~~LlgrvvdAlGn~idgkG~i~~~~~~--------ii~r~sv~epmqtg~KAvdslVpigRgqrELi 152 (340) T KOG1353 81 VKRTAAISDVPPLKALLGRVGCALGEPIDGNGKISAKERR--------IIPRASVDEPMQTGLKAVDSLVPIGRGQRELI 152 (340) T ss_pred EEEEEEEECCCCHHHHHHHHHHHHCCEECCCCCCCCCCCC--------CCCCEEEECHHHHHHHHHHCEEEECCCCEEEE T ss_conf 8966665126716777522045306743588875632123--------24531240523666668660210335856787 Q ss_pred ECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHHHHHH Q ss_conf 24778861158999999988630246678760699995167767899999851146742115899728878889999988 Q gi|254780617|r 168 IGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALSYSIVVVASASDPAPMQLLAP 247 (509) Q Consensus 168 ~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~~tvvv~a~a~~~~~~r~~ap 247 (509) +|++++|||+|+.|+|+||+..|...+++++.+|||++||++.+.|+.++..|.+.++|+||++|++|++ T Consensus 153 IgdRqTGkTsla~dTI~nqk~~N~~~~ekkKiyCvyvaigqkrStvaqlv~~l~~~~a~~y~ivv~atas---------- 222 (340) T KOG1353 153 IGDRQTGKTSLAIDTILNQKRGNECLDEKKKIYCVYVAIGQKRSTVAQLVQRLEEADAMEYSIVVAATAS---------- 222 (340) T ss_pred ECCCCCCCEEEEEHHHHHHHHHCCCCCCCCEEEEEEEECCCCHHHHHHHHHHHHHCCCCEEEEEEEEECC---------- T ss_conf 3156668545410444454641234343350799999606104089999988875587227999985013---------- Q ss_pred HHHHHHHHHHHCCCCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCEEEEEEE Q ss_conf 76666667542046884799616478999998754530378754333752001246788887504544588735675310 Q gi|254780617|r 248 FAGCAMGEYFRDNGYHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLLERAAKMSDALGAGSLTALPVI 327 (509) Q Consensus 248 ~~a~aiAEyfr~~G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l~ERa~~~~~~~g~GSiT~~~~v 327 (509) T Consensus 223 -------------------------------------------------------------------------------- 222 (340) T KOG1353 223 -------------------------------------------------------------------------------- 222 (340) T ss_pred -------------------------------------------------------------------------------- T ss_conf -------------------------------------------------------------------------------- Q ss_pred ECCCCCCCCCHHHHHHHHHCCEEEEECCHHHHCCCCCCCCCCCHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 03676644311466885406169961501221026654334441220343438988775567888799887678798763 Q gi|254780617|r 328 ETQVNDVSAYIPTNVISITDGQIFLETELFYQGIRPAINIGLSVSRVGSAAQVKAMKQVSGAVKGELAQYREMSSFSKFS 407 (509) Q Consensus 328 ~~~~~D~~~~i~~~~~si~DG~i~l~~~l~~~g~~Paid~~~S~SRv~~~~~~~~~~~~a~~lr~~laqy~Ele~f~~fg 407 (509) +++|+++|||+|++|| ||||||+.+||.+|++||||++.|+||||+++|+++||++|+++|+.||||||+++|+||| T Consensus 223 --q~gdvsayiptnvisi-dgqi~l~t~lfy~girpainvg~svsrvgsaaq~kamkqvag~~klelaq~revaafaqfg 299 (340) T KOG1353 223 --QAGDVSAYIPTNVISI-DGQIFLETELFYKGIRPAINVGLSVSRVGSAAQTKAMKQVAGSLKLELAQYREVAAFAQFG 299 (340) T ss_pred --CCCCEEEECCCCEEEE-CCHHHHHHHHHHHCCCHHHEEEEEEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf --5666246625432455-3256777888771563342420574411546778899987304223787788899998743 Q ss_pred CCCCHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHH Q ss_conf 38998999999999999999669998874889999999998 Q gi|254780617|r 408 SDLDSSTQKFLSKGERLTELLKQPQFSPLAMEEQVVMIFAG 448 (509) Q Consensus 408 sdlD~~t~~~l~rg~ri~e~LkQ~~~~p~~~~~qv~~l~a~ 448 (509) ||||+.|+..|.||.|++|+|||+||.|+.+++|+..+|++ T Consensus 300 sdlda~tq~~l~rg~rltellkq~qy~p~~~e~qv~~iy~g 340 (340) T KOG1353 300 SDLDAATQQLLNRGVRLTELLKQGQYAPLAIEEQVAVIYAG 340 (340) T ss_pred CCCCHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHEEEEECC T ss_conf 53047899999760679999845788982123320247449 No 46 >COG1156 NtpB Archaeal/vacuolar-type H+-ATPase subunit B [Energy production and conversion] Probab=100.00 E-value=0 Score=604.30 Aligned_cols=435 Identities=25% Similarity=0.396 Sum_probs=394.5 Q ss_pred EEEEEEEEEEECCEEEEECCCCCCCCCEEEEC--CC--EEEEEEEECCCEEEEEEECCCCCCCCCCE-EEECCCEEEEEC Q ss_conf 01035899982758999758887737479957--98--18999920598599998058668678988-897798788667 Q gi|254780617|r 25 FSEIGRVLSIGDGIARVYGLNNIRAGEMVQFS--HG--VYGMALNLEVDNVGVVILGSYKEISEGDI-VKRTGRIVDVPV 99 (509) Q Consensus 25 ~~~~G~V~~V~~giv~v~GL~~a~~GElv~~~--~g--~~G~Vi~l~~d~v~i~~l~~~~gI~~G~~-V~~tg~~~~vpv 99 (509) ..++-+|.+|.|.++.|+|-.++.+||++++. +| ..|.|+.++.+.+.+++|+.+.|+...+. |.++|+.+.+|| T Consensus 4 ~~~Y~~i~~i~Gplv~ve~~eg~~y~E~v~i~~~~G~~r~gqVle~~~~~a~vQVfegT~Gl~~~~t~vrF~g~~l~i~v 83 (463) T COG1156 4 VKEYTTISEIKGPLIIVEGVEGASYGELVEIETPDGEVRRGQVLEVRGDKAVVQVFEGTSGLDTKGTTVRFTGETLKIPV 83 (463) T ss_pred CCEEEEEEEECCCEEEEECCCCCCCCEEEEEECCCCCEEEEEEEECCCCEEEEEEEECCCCCCCCCCEEEEECCEEEEEE T ss_conf 42023467731544999645677764379998799876434575315864899995044577888855899486378763 Q ss_pred CHHHHHCCCCHHHCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCHHHHHCCHHHHHCCCCCCCCCEEEEECCCCCCCHHHH Q ss_conf 94698300030023144664445444100101357731126600144416023310111156833655247788611589 Q gi|254780617|r 100 GLELLGRVVDALGNPIDGKGPIKCEQRSCTEADAPGIIQRQSVCEPLSTGIKAIDSLIPIGRGQRELIIGDRKTGKTSII 179 (509) Q Consensus 100 G~~lLGRVvD~lG~PlDg~g~i~~~~~~~i~~~~p~~~~R~~i~~~l~TGI~~ID~l~pigrGQR~~I~g~~g~GKt~l~ 179 (509) +++||||++|++|+||||.+.+..++...++++|.||..|..+.++++|||++||+|.++.||||.+||.++|..+..|+ T Consensus 84 s~dllGRifnG~G~PiDggp~i~~e~~~dI~g~~~NP~aR~yP~efIqTgIsaIDg~NtLvrgQKlPIFSgSGlphN~La 163 (463) T COG1156 84 SEDLLGRIFNGSGKPIDGGPEIVPEDRLDINGAPINPYARIYPEEFIQTGISAIDGMNTLVRGQKLPIFSGSGLPHNELA 163 (463) T ss_pred CHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHCHHHHHHCCCCHHHHHHHHHCCCCCCCCCCCCCCHHHHH T ss_conf 28766544057888478998678775533578878915654726676529608764346533654660127998437999 Q ss_pred HHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHHHHH- Q ss_conf 9999998863024667876069999516776789999985114674211589972887888999998876666667542- Q gi|254780617|r 180 LDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALSYSIVVVASASDPAPMQLLAPFAGCAMGEYFR- 258 (509) Q Consensus 180 ~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~~tvvv~a~a~~~~~~r~~ap~~a~aiAEyfr- 258 (509) . .|..|+... ......-.||+++|-+..|...|.+.+++.|++++++++.+.|++|+.+|.++|..|++.||||. T Consensus 164 a-qIarQA~v~---~~~e~favVfaamGit~eea~fF~~~fe~tGal~r~vlflnlA~dp~vEri~tPr~aLt~AEylA~ 239 (463) T COG1156 164 A-QIARQATVD---GEEEEFAVVFAAMGITHEEALFFMDEFEETGALDRAVLFLNLADDPAVERIITPRMALTVAEYLAF 239 (463) T ss_pred H-HHHHHCCCC---CCCCCEEEEEEECCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEECCHHHHHHHHHHHHC T ss_conf 9-999753667---875430588740473189999999888762246666765423678743476131689999999852 Q ss_pred CCCCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCCCCCCH Q ss_conf 04688479961647899999875453037875433375200124678888750454458873567531003676644311 Q gi|254780617|r 259 DNGYHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLLERAAKMSDALGAGSLTALPVIETQVNDVSAYI 338 (509) Q Consensus 259 ~~G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l~ERa~~~~~~~g~GSiT~~~~v~~~~~D~~~~i 338 (509) .+++|||+++.|||+||+|+||||.+.++.|||+||||++|+.++.+|||||+.... +||||.+|+.++|+||+|||| T Consensus 240 e~~~hVLVilTDMTnyceALREIsaareeVPgrRGYPGymYTdLatiYErAg~i~g~--~GSiTqipIlTMP~DDITHPI 317 (463) T COG1156 240 EKDMHVLVILTDMTNYCEALREISAAREEVPGRRGYPGYMYTDLATIYERAGRIRGR--KGSITQIPILTMPGDDITHPI 317 (463) T ss_pred CCCCEEEEEECCHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHCEECCC--CCCEEEEEEEECCCCCCCCCC T ss_conf 578669999701467899999987641437776788616677889998765033057--872698776526998767877 Q ss_pred HHHHHHHHCCEEEEECCHHHHCCCCCCCCCCCHHHCCCCCCC-----HHHHHHHHHHHHHHHHHHHHHHHHHHHC--CCC Q ss_conf 466885406169961501221026654334441220343438-----9887755678887998876787987633--899 Q gi|254780617|r 339 PTNVISITDGQIFLETELFYQGIRPAINIGLSVSRVGSAAQV-----KAMKQVSGAVKGELAQYREMSSFSKFSS--DLD 411 (509) Q Consensus 339 ~~~~~si~DG~i~l~~~l~~~g~~Paid~~~S~SRv~~~~~~-----~~~~~~a~~lr~~laqy~Ele~f~~fgs--dlD 411 (509) ||++++||+|||||+|+|.++|+||+|||+.|+||+|..+.. +.|+.+++++...||+.+++..+...+. .|+ T Consensus 318 PDlTGYITEGQivl~r~l~~~gIyPpi~vlpSLSRL~~~giG~g~TReDH~~~snql~a~YA~g~d~r~l~avvge~aLs 397 (463) T COG1156 318 PDLTGYITEGQIVLSRDLHRKGIYPPINVLPSLSRLMKDGIGEGKTREDHGDVSNQLYAAYAEGRDLRELVAVVGEEALS 397 (463) T ss_pred CCCCCEECCCEEEEEHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHCC T ss_conf 76555331655999710024886797233612777765144788664410788999999985024089999875244303 Q ss_pred HHHHHHHHHHHHHHH-HHCCCCCCCCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHH Q ss_conf 899999999999999-9669998874889999999998669806679899999999999 Q gi|254780617|r 412 SSTQKFLSKGERLTE-LLKQPQFSPLAMEEQVVMIFAGISGCLDEVAVSQVRKFETNFL 469 (509) Q Consensus 412 ~~t~~~l~rg~ri~e-~LkQ~~~~p~~~~~qv~~l~a~~~G~ld~i~v~~I~~fe~~l~ 469 (509) +.+++.|+.+..++. |++|..++-.|+++...+- |.+|.-.|-...+....+++ T Consensus 398 ~~D~~~l~F~d~FE~~fi~qg~~enrsieetLdlg----W~lL~~lp~~el~ri~~~~I 452 (463) T COG1156 398 ERDRKYLKFADLFEQRFIKQGRYENRSIEETLDLG----WELLSILPESELTRIKPDLI 452 (463) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHH----HHHHHHCCHHHHCCCCHHHH T ss_conf 35799999999999999863432477889998875----89987487877615778899 No 47 >TIGR01043 ATP_syn_A_arch ATP synthase archaeal, A subunit; InterPro: IPR005726 ATPases (or ATP synthases) are membrane-bound enzyme complexes/ion transporters that combine ATP synthesis and/or hydrolysis with the transport of protons across a membrane. ATPases can harness the energy from a proton gradient, using the flux of ions across the membrane via the ATPase proton channel to drive the synthesis of ATP. Some ATPases work in reverse, using the energy from the hydrolysis of ATP to create a proton gradient. There are different types of ATPases, which can differ in function (ATP synthesis and/or hydrolysis), structure (F-, V- and A-ATPases contain rotary motors) and in the type of ions they transport , . F-ATPases (F1F0-ATPases) in mitochondria, chloroplasts and bacterial plasma membranes are the prime producers of ATP, using the proton gradient generated by oxidative phosphorylation (mitochondria) or photosynthesis (chloroplasts). V-ATPases (V1V0-ATPases) are primarily found in eukaryotic vacuoles, catalysing ATP hydrolysis to transport solutes and lower pH in organelles. A-ATPases (A1A0-ATPases) are found in Archaea and function like F-ATPases. P-ATPases (E1E2-ATPases) are found in bacteria and in eukaryotic plasma membranes and organelles, and function to transport a variety of different ions across membranes. E-ATPases are cell-surface enzymes that hydrolyse a range of NTPs, including extracellular ATP. A-ATPases (or A1A0-ATPase) (3.6.3.14 from EC) are found exclusively in Archaea and display a close resemblance in structure and subunit composition with V-ATPases, although their function in both ATP synthesis and ATP hydrolysis is closer to that of F-ATPases . A-ATPases are composed of two linked complexes: the A1 complex consisting of seven subunits contains the catalytic core that synthesizes/hydrolyses ATP, while the A0 complex consisting of at least two subunits forms the membrane-spanning pore . The rotary motor in A-ATPases is composed of only two subunits, the stator subunit I and the rotor subunit C . A-ATPases may have arisen as an adaptation to the different cellular needs and the more extreme environmental conditions faced by Archaeal species. This entry represents the alpha subunit from the A1 complex of A-ATPases. The A1 complex contains three copies each of subunits alpha (or A) and beta (or B) (IPR005724 from INTERPRO), both of which form the headpiece that participates in nucleotide binding. However, only the alpha subunit is catalytic, the beta subunit being regulatory in function . More information about this protein can be found at Protein of the Month: ATP Synthases .; GO: 0016820 hydrolase activity acting on acid anhydrides catalyzing transmembrane movement of substances, 0015986 ATP synthesis coupled proton transport, 0016021 integral to membrane, 0016469 proton-transporting two-sector ATPase complex. Probab=100.00 E-value=0 Score=534.20 Aligned_cols=460 Identities=23% Similarity=0.330 Sum_probs=378.6 Q ss_pred EEEEEEEECCEEEEECCCCCCCCCEEEEC-CCEEEEEEEECCCEEEEEEECCCCCCCCCCEEEECCCEEEEECCHHHHHC Q ss_conf 35899982758999758887737479957-98189999205985999980586686789888977987886679469830 Q gi|254780617|r 28 IGRVLSIGDGIARVYGLNNIRAGEMVQFS-HGVYGMALNLEVDNVGVVILGSYKEISEGDIVKRTGRIVDVPVGLELLGR 106 (509) Q Consensus 28 ~G~V~~V~~giv~v~GL~~a~~GElv~~~-~g~~G~Vi~l~~d~v~i~~l~~~~gI~~G~~V~~tg~~~~vpvG~~lLGR 106 (509) .|+|.+|.|+++.+.||.+|.++|+|... .+..|||+.+.+|.+.++++++|.||++|++|..||.+++|-+|++||+. T Consensus 2 ~G~i~~~~GP~v~A~g~~~a~My~~v~Vg~~~L~GEi~~~~gd~~~IQVYE~T~G~~Pge~v~~tG~pLsveLGPGll~~ 81 (584) T TIGR01043 2 KGEIVRVSGPLVVAEGLKGAQMYEVVKVGEEGLIGEIIRIEGDKAVIQVYEETSGIKPGEPVERTGAPLSVELGPGLLGS 81 (584) T ss_pred CCEEEEEECCEEEECCCCCCCEEEEEEECCCCCEEEEEEEECCEEEEEEEECCCCCCCCCCHHCCCCEEEEECCCHHHHH T ss_conf 85389975788782455766324335670785036899982776899986326888987400028970277435004766 Q ss_pred CCCHHHCCCCCC-------------------------------------------------------------------- Q ss_conf 003002314466-------------------------------------------------------------------- Q gi|254780617|r 107 VVDALGNPIDGK-------------------------------------------------------------------- 118 (509) Q Consensus 107 VvD~lG~PlDg~-------------------------------------------------------------------- 118 (509) ++||+.|||+-. T Consensus 82 iyDG~QRPL~~l~~~~~~~Fi~RGv~~p~l~~~~~w~F~P~v~~Gd~V~~Gd~~G~V~ET~~i~h~ilvPp~~~gg~~~~ 161 (584) T TIGR01043 82 IYDGVQRPLDVLKEKTGSDFIERGVDAPGLDRDKKWEFKPTVKEGDKVEGGDIIGVVQETSLIEHKILVPPNVEGGEIVE 161 (584) T ss_pred HCCCCCCCHHHHHHHHCCHHHHCCCCCCCCCCCCCCEECCEECCCCEECCCCEEEEECCCCCEEEEEEECCCCCCCEEEE T ss_conf 40122463689998620137665851688875345002221226864347615877637650334777357457875778 Q ss_pred ---------------------CCCCCCCCCHHCCCCCCCCCCCCCHHHHHCCHHHHHCCCCCCCCCEEEEECCCCCCCHH Q ss_conf ---------------------44454441001013577311266001444160233101111568336552477886115 Q gi|254780617|r 119 ---------------------GPIKCEQRSCTEADAPGIIQRQSVCEPLSTGIKAIDSLIPIGRGQRELIIGDRKTGKTS 177 (509) Q Consensus 119 ---------------------g~i~~~~~~~i~~~~p~~~~R~~i~~~l~TGI~~ID~l~pigrGQR~~I~g~~g~GKt~ 177 (509) ..+..-+.||+..+.| ..+...+..||.||.|++|+|+|+.||....|.|+.|+|||+ T Consensus 162 ~~~G~ftV~d~i~~~~~~G~~~~~~m~~~WPVR~~RP-~~~K~~P~~PL~tG~R~~Dt~F~~AKGGTAAiPGPFGsGKTV 240 (584) T TIGR01043 162 IAEGDFTVEDTIAVVEKDGEEEEIKMYQKWPVRKPRP-YKEKLEPEEPLVTGQRILDTLFPIAKGGTAAIPGPFGSGKTV 240 (584) T ss_pred EECCCEEEEEEEEEEECCCCCCEEEEEEECCCCCCCC-CCCCCCCCCCCCCCCCHHHCCCCCCCCCCCCCCCCCCCCCHH T ss_conf 7267314655689982278811024644077888887-435788898612150341010242467754578788885046 Q ss_pred HHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCC-------CCCCCEEEEEEECCCCCHHHHHHHHHHH Q ss_conf 899999998863024667876069999516776789999985114-------6742115899728878889999988766 Q gi|254780617|r 178 IILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALED-------RGALSYSIVVVASASDPAPMQLLAPFAG 250 (509) Q Consensus 178 l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~-------~~~~~~tvvv~a~a~~~~~~r~~ap~~a 250 (509) -. ++.++.++..++||++|||||+|+.+.++.|-+ ...|+||++|++|||.|.+.|-.+-|+| T Consensus 241 T~----------~~LAkws~a~ivvYiGCGERGNEMt~vl~~FP~l~dp~TG~pLm~RT~LiANTSNMPVAAREas~Y~G 310 (584) T TIGR01043 241 TQ----------QQLAKWSDADIVVYIGCGERGNEMTEVLEEFPELEDPKTGKPLMERTVLIANTSNMPVAAREASIYTG 310 (584) T ss_pred HH----------HHHHHHHCCCEEEEECCCCCCCHHHHHHHHCCCCCCCCCCHHHHHCCEEEECCCCCCHHHHHHHHHHC T ss_conf 77----------77754304737999616887525899998487868863531565420533526787534401123211 Q ss_pred HHHHHHHHCCCCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCC---CCCCCCEEEEEEE Q ss_conf 6666754204688479961647899999875453037875433375200124678888750454---4588735675310 Q gi|254780617|r 251 CAMGEYFRDNGYHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLLERAAKMSD---ALGAGSLTALPVI 327 (509) Q Consensus 251 ~aiAEyfr~~G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l~ERa~~~~~---~~g~GSiT~~~~v 327 (509) +||||||||+|+||+|+.||.+|||+|+||||-.|.|+||.||||..+-+-++..|||||++.. ....||+|.+-.| T Consensus 311 iTiAEYfRD~GYdv~L~ADSTSRWAEA~ReisgRlEEMPgEEGyPaYLasrLaefYERAG~~~~l~~~~~~gsv~v~gAV 390 (584) T TIGR01043 311 ITIAEYFRDMGYDVALMADSTSRWAEALREISGRLEEMPGEEGYPAYLASRLAEFYERAGRVKTLGGEEREGSVTVVGAV 390 (584) T ss_pred CHHHHHHHCCCCCEEEEECCCCHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHCCEEEECCCCCCEEEEEEEECC T ss_conf 02533441168516887247521789998752021027787886246888887766514602231688744789999614 Q ss_pred ECCCCCCCCCHHHHHHHHHCCEEEEECCHHHHCCCCCCCCCCCHHHCCCCCCCHHHHHHH-------HHHHHHHHHHHHH Q ss_conf 036766443114668854061699615012210266543344412203434389887755-------6788879988767 Q gi|254780617|r 328 ETQVNDVSAYIPTNVISITDGQIFLETELFYQGIRPAINIGLSVSRVGSAAQVKAMKQVS-------GAVKGELAQYREM 400 (509) Q Consensus 328 ~~~~~D~~~~i~~~~~si~DG~i~l~~~l~~~g~~Paid~~~S~SRv~~~~~~~~~~~~a-------~~lr~~laqy~El 400 (509) ..||||+|+|+..|+.-|.-=-.-||.+|+++.|||||||+.|.|-....++...+..+. ...-..|.+-.|| T Consensus 391 SPPGGDfsEPVt~~TlR~vkvFWaLD~~La~~RHfPaInW~~sySlY~d~~~~W~~~Nv~~~w~~~R~~a~~~L~~E~eL 470 (584) T TIGR01043 391 SPPGGDFSEPVTQNTLRIVKVFWALDAKLAQRRHFPAINWLQSYSLYVDSVQDWWAENVDPDWRELRDEAMDLLQKEAEL 470 (584) T ss_pred CCCCCCCCCCCCCCCEEEEEEECCCCHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 77557878862227447898730011788743789730201145776888999987400277999999999997515567 Q ss_pred HHHHHH-HCC-CCHHHHHHHHHHHHHHH-HHCCCCCCCC----CHHHHHHHHHHHHC-------CCCCCCCHHHHHHHHH Q ss_conf 879876-338-99899999999999999-9669998874----88999999999866-------9806679899999999 Q gi|254780617|r 401 SSFSKF-SSD-LDSSTQKFLSKGERLTE-LLKQPQFSPL----AMEEQVVMIFAGIS-------GCLDEVAVSQVRKFET 466 (509) Q Consensus 401 e~f~~f-gsd-lD~~t~~~l~rg~ri~e-~LkQ~~~~p~----~~~~qv~~l~a~~~-------G~ld~i~v~~I~~fe~ 466 (509) +.+.|+ |+| |++..+..|+-.+.|+| ||+|+.|++. +..+|.-+|-++.. -.-.++++++|...+. T Consensus 471 ~eIV~LVG~d~L~~~~~~~Le~ar~~re~fL~q~Af~~vD~y~~~~kq~~~l~~I~~~~~~~~~~~~~G~~~~~I~~~~g 550 (584) T TIGR01043 471 QEIVQLVGPDALPEKEKLILEVARLIREAFLQQNAFDEVDTYCSPQKQYRILKAIMKFNDEALEALEKGVPVKEILKLEG 550 (584) T ss_pred HHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHH T ss_conf 77875317756760245789999999998766520430103588688999999999999999999873886789863210 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCC-C-HHHHHHHHHHHHHHHHHCC Q ss_conf 99999974159999988732798-9-8999999999999985159 Q gi|254780617|r 467 NFLSHMRVSSQDILEDIRKQKVL-T-DDIRSKLINEIKVFLEDFN 509 (509) Q Consensus 467 ~l~~~l~~~~~ei~~~I~~~~~l-~-de~~~~L~~~i~~~~~~Fs 509 (509) ... +.|...+.- + +..++++++.-+++.+.|. T Consensus 551 ~~~-----------~~~~r~k~~~~~~~~~~~~~~~~~~~~~~~~ 584 (584) T TIGR01043 551 RVK-----------EEIARVKYEEDNEKEEAKIDEIKEKIKKELK 584 (584) T ss_pred HHH-----------HHHHHHCCCCHHHHHHHHHHHHHHHHHHHCC T ss_conf 357-----------7888722551045788999999999998619 No 48 >pfam00006 ATP-synt_ab ATP synthase alpha/beta family, nucleotide-binding domain. This family includes the ATP synthase alpha and beta subunits, the ATP synthase associated with flagella and the termination factor Rho. Probab=100.00 E-value=0 Score=543.06 Aligned_cols=213 Identities=49% Similarity=0.753 Sum_probs=207.3 Q ss_pred CCHHHHHCCCCCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCC Q ss_conf 16023310111156833655247788611589999999886302466787606999951677678999998511467421 Q gi|254780617|r 148 TGIKAIDSLIPIGRGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALS 227 (509) Q Consensus 148 TGI~~ID~l~pigrGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~ 227 (509) ||||+||+|+|||||||++|||++|+|||+|+.+++.||+. +++|||++||||++|+.+|++++++.++++ T Consensus 1 TGi~~ID~l~pig~GQr~~I~g~~g~GKt~l~~~i~~~~~~---------~~~~V~~~iGer~~ev~~~~~~~~~~~~~~ 71 (213) T pfam00006 1 TGIRAIDLLLPIGKGQRIGIFGGSGTGKTVLLGMIARNAKA---------DVVEVYVLIGERGREVAEFIEELLGEGALK 71 (213) T ss_pred CCCCEEECCCCCCCCCEEEEECCCCCCHHHHHHHHHHHHHH---------CCCEEEEECCCCHHHHHHHHHHHHCCCCCC T ss_conf 98400020166457888778789999889999999985661---------893599813777799999999752137665 Q ss_pred EEEEEEECCCCCHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHH Q ss_conf 15899728878889999988766666675420468847996164789999987545303787543337520012467888 Q gi|254780617|r 228 YSIVVVASASDPAPMQLLAPFAGCAMGEYFRDNGYHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLLE 307 (509) Q Consensus 228 ~tvvv~a~a~~~~~~r~~ap~~a~aiAEyfr~~G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l~E 307 (509) ||++|++++|+||.+|+++||+||++|||||++|+|||+++|||||||+||||||+++||||+++||||++||.|++|+| T Consensus 72 ~t~vv~~~~d~~~~~r~~~~~~a~~~AEyf~~~G~dVlvi~Dsltr~A~A~reis~~~g~~p~~~gyp~~~~~~~~~l~E 151 (213) T pfam00006 72 RTVVVAATSDEPPAERYLAPYTALTIAEYFRDQGKDVLLLLDSLTRFARALREISLLLGEPPGREGYPGSLFSDLARLLE 151 (213) T ss_pred CEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHCCCCEEEECCCHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHH T ss_conf 06999846889878999999999999999997699689983780599999876500147998545639178888799998 Q ss_pred HHHHCCCCCCCCCEEEEEEEECCCCCCCCCHHHHHHHHHCCEEEEECCHHHHCCCCCCCCCCCHH Q ss_conf 87504544588735675310036766443114668854061699615012210266543344412 Q gi|254780617|r 308 RAAKMSDALGAGSLTALPVIETQVNDVSAYIPTNVISITDGQIFLETELFYQGIRPAINIGLSVS 372 (509) Q Consensus 308 Ra~~~~~~~g~GSiT~~~~v~~~~~D~~~~i~~~~~si~DG~i~l~~~l~~~g~~Paid~~~S~S 372 (509) |||+++ ++||||+||++++++||++||||++++||+||||+|||+||++|+||||||+.|+| T Consensus 152 Rag~~~---~~GSiT~l~tv~~~~~d~~dpi~~~~~~~~dg~ivLsr~la~~g~~PAId~~~S~S 213 (213) T pfam00006 152 RAGKVE---GGGSITALPTVLVPGGDITDPIPDNTISITDGQIVLSRELAERGIYPAIDILLSVS 213 (213) T ss_pred HHHCCC---CCCCEEEEEEEEECCCCCCCCHHHHHHHHCCEEEEECHHHHHCCCCCCCCCCCCCC T ss_conf 753468---88634788899804888675156765211355999968999679999728755789 No 49 >KOG1351 consensus Probab=100.00 E-value=0 Score=525.01 Aligned_cols=433 Identities=22% Similarity=0.342 Sum_probs=390.4 Q ss_pred HHCCCCCCCEEEEEEEEEEECCEEEEECCCCCCCCCEEEE--CCC--EEEEEEEECCCEEEEEEECCCCCCCC-CCEEEE Q ss_conf 8424555420103589998275899975888773747995--798--18999920598599998058668678-988897 Q gi|254780617|r 16 IKDFGNDSEFSEIGRVLSIGDGIARVYGLNNIRAGEMVQF--SHG--VYGMALNLEVDNVGVVILGSYKEISE-GDIVKR 90 (509) Q Consensus 16 i~~~~~~~~~~~~G~V~~V~~giv~v~GL~~a~~GElv~~--~~g--~~G~Vi~l~~d~v~i~~l~~~~gI~~-G~~V~~ 90 (509) +.+|...+.+ .+-+|..|.|.++...-..-.++.|+|.+ ++| ..|.|+...++.+.+++|+.++||-. -+.|.+ T Consensus 13 ~~~~~~~prl-~y~tv~gvngplvild~vkfp~y~eiv~ltlpdgt~r~gqvlev~g~kavvqvfegtsgid~k~t~~ef 91 (489) T KOG1351 13 IENYTTEPRL-TYRTVSGVNGPLVILDKVKFPKYAEIVNLTLPDGSVRSGQVLEVSGEKAVVQVFEGTSGIDAKKTTVEF 91 (489) T ss_pred HHCCCCCCCE-EEEEECCCCCCEEEEECCCCCCHHHHEEEECCCCCEECCEEEEECCCEEEEEEECCCCCCCCCCCEEEE T ss_conf 6324567742-676531567756997525651204306776689871133179960873589995066665544226887 Q ss_pred CCCEEEEECCHHHHHCCCCHHHCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCHHHHHCCHHHHHCCCCCCCCCEEEEECC Q ss_conf 79878866794698300030023144664445444100101357731126600144416023310111156833655247 Q gi|254780617|r 91 TGRIVDVPVGLELLGRVVDALGNPIDGKGPIKCEQRSCTEADAPGIIQRQSVCEPLSTGIKAIDSLIPIGRGQRELIIGD 170 (509) Q Consensus 91 tg~~~~vpvG~~lLGRVvD~lG~PlDg~g~i~~~~~~~i~~~~p~~~~R~~i~~~l~TGI~~ID~l~pigrGQR~~I~g~ 170 (509) ||+.+..||++++||||+|+.|+|||+++++-++++..+.+.|.||..|-.+.|.++|||.+||.|..|.||||++||+. T Consensus 92 tg~~lr~pvsedmlgrifngsgkpid~gp~vl~edyldi~gqpinp~~riypeemiqtgis~idvmnsiargqkipifsa 171 (489) T KOG1351 92 TGEILRTPVSEDMLGRIFNGSGKPIDKGPPVLAEDYLDINGQPINPYARIYPEEMIQTGISAIDVMNSIARGQKIPIFSA 171 (489) T ss_pred ECCCCCCCCCHHHHHHHHCCCCCCCCCCCCCCHHHHHCCCCCCCCCCCCCCHHHHHHHCCHHHHHHHHHHCCCCCCEEEC T ss_conf 43410365568776544268998268899757177611479808940001869998727327765667742674532333 Q ss_pred CCCCCHHHHHHHHHHHHHHCC------CCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHHH Q ss_conf 788611589999999886302------46678760699995167767899999851146742115899728878889999 Q gi|254780617|r 171 RKTGKTSIILDTFLNQKSSHD------KGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALSYSIVVVASASDPAPMQL 244 (509) Q Consensus 171 ~g~GKt~l~~~~I~nq~~~~~------~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~~tvvv~a~a~~~~~~r~ 244 (509) +|..+..++. .|+.|++--. .....+..-.||+++|-.-.+.+.|.+.+++.|.|++++++.+.|+||..+|. T Consensus 172 aglphneiaa-qicrqaglvk~~~k~~~d~~~dnfaivfaamgvnmetarffk~dfeengsm~~v~lflnlandptieri 250 (489) T KOG1351 172 AGLPHNEIAA-QICRQAGLVKRPEKDVHDGHEDNFAIVFAAMGVNMETARFFKQDFEENGSMERVCLFLNLANDPTIERI 250 (489) T ss_pred CCCCHHHHHH-HHHHHCCCCCCCCCCCCCCCCCCEEEEEHHHCCCHHHHHHHHHHHHHCCCCCCEEEEEECCCCCCHHHH T ss_conf 7898058999-998853711168866444667654776003156588889988668755870005778741789744320 Q ss_pred HHHHHHHHHHHHHHCC-CCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCEEE Q ss_conf 9887666666754204-688479961647899999875453037875433375200124678888750454458873567 Q gi|254780617|r 245 LAPFAGCAMGEYFRDN-GYHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLLERAAKMSDALGAGSLTA 323 (509) Q Consensus 245 ~ap~~a~aiAEyfr~~-G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l~ERa~~~~~~~g~GSiT~ 323 (509) +.|..|++.|||+.+| +||||+++.||+.||+|+||+|.+..|.|||+||||.+|..++.++||||++...+ ||||. T Consensus 251 itprlalt~aeflayq~ekhvlviltdmssya~alrevsaareevpgrrg~pgymytdlatiyeragrvegr~--gsitq 328 (489) T KOG1351 251 ITPRLALTTAEFLAYQCEKHVLVILTDMSSYADALREVSAAREEVPGRRGYPGYMYTDLATIYERAGRVEGRN--GSITQ 328 (489) T ss_pred CCHHHHHHHHHHHHHHHCCEEEEEEECCHHHHHHHHHHHHHHHHCCCCCCCCCEEHHHHHHHHHHHCCCCCCC--CCEEE T ss_conf 2403566689999876347389998330778999999777664278867888610024799998733333678--85464 Q ss_pred EEEEECCCCCCCCCHHHHHHHHHCCEEEEECCHHHHCCCCCCCCCCCHHHCCCCCCC-----HHHHHHHHHHHHHHHHHH Q ss_conf 531003676644311466885406169961501221026654334441220343438-----988775567888799887 Q gi|254780617|r 324 LPVIETQVNDVSAYIPTNVISITDGQIFLETELFYQGIRPAINIGLSVSRVGSAAQV-----KAMKQVSGAVKGELAQYR 398 (509) Q Consensus 324 ~~~v~~~~~D~~~~i~~~~~si~DG~i~l~~~l~~~g~~Paid~~~S~SRv~~~~~~-----~~~~~~a~~lr~~laqy~ 398 (509) +|+.++|.||+||||||.+.+||+||||.||.|+++.+||+|||+.|+||+|.++.. +.|-.++++|..-||--+ T Consensus 329 ipiltmpnddithpipdltgyitegqiyvdrqlhnr~iyppinvlpslsrlmksaigegmtr~dh~dvsnqlya~yaigk 408 (489) T KOG1351 329 IPILTMPNDDITHPIPDLTGYITEGQIYVDRQLHNRQIYPPINVLPSLSRLMKSAIGEGMTRKDHSDVSNQLYACYAIGK 408 (489) T ss_pred EEEEECCCCCCCCCCCCCCCEEECCEEEEEHHHHCCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHCC T ss_conf 13672488766787876455231345887232332567897011666899999876578643532146778899988431 Q ss_pred HHHHHHHH-HCC-CCHHHHHHHHHHHHHHH-HHCCCCCCCCCHHHHHHHHHHHHCCC Q ss_conf 67879876-338-99899999999999999-96699988748899999999986698 Q gi|254780617|r 399 EMSSFSKF-SSD-LDSSTQKFLSKGERLTE-LLKQPQFSPLAMEEQVVMIFAGISGC 452 (509) Q Consensus 399 Ele~f~~f-gsd-lD~~t~~~l~rg~ri~e-~LkQ~~~~p~~~~~qv~~l~a~~~G~ 452 (509) +++++... |.+ |+.++.-.|+.-.+++. |..|..|+..++.+..-+-|.+.+-| T Consensus 409 dvqamkavvgeeals~ed~l~lefl~kfek~fi~qg~yenrtvfesldi~w~llrif 465 (489) T KOG1351 409 DVQAMKAVVGEEALSSEDLLYLEFLDKFEKNFISQGAYENRTVFESLDIAWQLLRIF 465 (489) T ss_pred HHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHH T ss_conf 189999874410068224389999999998776327633203777777789999873 No 50 >KOG1350 consensus Probab=100.00 E-value=0 Score=533.70 Aligned_cols=426 Identities=24% Similarity=0.402 Sum_probs=383.3 Q ss_pred EEEEEEEEEEECCEEEEE---CCCCCCCCCEE-EECCC--EEEEEEEECCCEEEEEEECCCCCCCCCCEEEECCCEEEEE Q ss_conf 010358999827589997---58887737479-95798--1899992059859999805866867898889779878866 Q gi|254780617|r 25 FSEIGRVLSIGDGIARVY---GLNNIRAGEMV-QFSHG--VYGMALNLEVDNVGVVILGSYKEISEGDIVKRTGRIVDVP 98 (509) Q Consensus 25 ~~~~G~V~~V~~giv~v~---GL~~a~~GElv-~~~~g--~~G~Vi~l~~d~v~i~~l~~~~gI~~G~~V~~tg~~~~vp 98 (509) ....|++..|.|.++.|. ++| ..++.+. .-.+. +.-...++.++.|.++.++.++|+..|.+|..||.+++|| T Consensus 49 ~~~~G~i~avIGavvDv~F~~~~P-~ilNaLev~~~~~~lvlEV~qhlG~n~VR~iAMdgTEGLvRG~~VlDtG~Pi~ip 127 (521) T KOG1350 49 KKNKGRIVAVIGAVVDVQFEEGLP-PILNALEVKGRDTRLVLEVAQHLGENTVRTIAMDGTEGLVRGQKVLDTGYPISIP 127 (521) T ss_pred CCCCCCEEEEEEEEEEEECCCCCC-CHHHCEEECCCCCEEEEHHHHHHCCCEEEEEEECCCHHHHCCCCCCCCCCCEEEE T ss_conf 246772799870057875388885-3321046437886054178777376727899843731122385202479963563 Q ss_pred CCHHHHHCCCCHHHCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCHHHHHCCHHHHHCCCCCCCCCEEEEECCCCCCCHHH Q ss_conf 79469830003002314466444544410010135773112660014441602331011115683365524778861158 Q gi|254780617|r 99 VGLELLGRVVDALGNPIDGKGPIKCEQRSCTEADAPGIIQRQSVCEPLSTGIKAIDSLIPIGRGQRELIIGDRKTGKTSI 178 (509) Q Consensus 99 vG~~lLGRVvD~lG~PlDg~g~i~~~~~~~i~~~~p~~~~R~~i~~~l~TGI~~ID~l~pigrGQR~~I~g~~g~GKt~l 178 (509) ||++.|||++|.+|+|+|..||+.+..+.|++..+|...+-..-.|.|.||||++|.|.|..||.++++||++|+|||++ T Consensus 128 VG~~tLGRI~NViGePiDerGpi~s~~~~~IHaeaP~f~e~s~~~eIl~TGIKVvDLLAPYakGGKIGLFGGAGVGKTVl 207 (521) T KOG1350 128 VGPETLGRIMNVIGEPIDERGPIKSKKYSPIHAEAPEFVEMSVEQEILVTGIKVVDLLAPYAKGGKIGLFGGAGVGKTVL 207 (521) T ss_pred CCHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCHHHHHHCCCHHHHHHCCEEEEEECCCCCCCEEEEECCCCCCCEEE T ss_conf 17888766887417855445885533356422477067651431788762434344213202487344422577661332 Q ss_pred HHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCC------CEEEEEEECCCCCHHHHHHHHHHHHH Q ss_conf 999999988630246678760699995167767899999851146742------11589972887888999998876666 Q gi|254780617|r 179 ILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGAL------SYSIVVVASASDPAPMQLLAPFAGCA 252 (509) Q Consensus 179 ~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~------~~tvvv~a~a~~~~~~r~~ap~~a~a 252 (509) .++.|-|-+..+.+ ++||+++|||.||..+++.++.+.|.. .+..+|+.++++||+.|.+...+|++ T Consensus 208 ImELINNiAKaHGG-------ySVF~GvGERTREGNDLY~EM~E~gVI~l~~~~SKvaLV~GQMNePPGARaRV~LTgLT 280 (521) T KOG1350 208 IMELINNIAKAHGG-------YSVFAGVGERTREGNDLYHEMIESGVINLEGETSKVALVYGQMNEPPGARARVALTGLT 280 (521) T ss_pred HHHHHHHHHHHCCC-------EEEEECCCCCCCCCCHHHHHHHHCCEEECCCCCCEEEEEEECCCCCCCCEEEEEEECCC T ss_conf 99998788886288-------67863145333443078999986381212577633789862157998730145431115 Q ss_pred HHHHHHC-CCCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCC Q ss_conf 6675420-468847996164789999987545303787543337520012467888875045445887356753100367 Q gi|254780617|r 253 MGEYFRD-NGYHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLLERAAKMSDALGAGSLTALPVIETQV 331 (509) Q Consensus 253 iAEyfr~-~G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l~ERa~~~~~~~g~GSiT~~~~v~~~~ 331 (509) +|||||| +|+|||+++|+++||.+|..|+|.+|||.|+..||+|++-..+..+.||....++ ||||+++.|++|+ T Consensus 281 vAEYFRD~egQDVLLFIDNIFRFtQAGSEVSALLGRiPSAVGYQPTLaTdMG~mQERITtTkk----GSiTSvQAvYVPA 356 (521) T KOG1350 281 VAEYFRDQEGQDVLLFIDNIFRFTQAGSEVSALLGRIPSAVGYQPTLATDMGTMQERITTTKK----GSITSVQAVYVPA 356 (521) T ss_pred HHHHHHCCCCCEEEEEEHHHHHHHCCCHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHCCCC----CCEEEEEEEEEEH T ss_conf 999863446666898612346663152489998635841025676320111356676300456----7556788998510 Q ss_pred CCCCCCHHHHHHHHHCCEEEEECCHHHHCCCCCCCCCCCHHHCC-CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH-HCC Q ss_conf 66443114668854061699615012210266543344412203-4343898877556788879988767879876-338 Q gi|254780617|r 332 NDVSAYIPTNVISITDGQIFLETELFYQGIRPAINIGLSVSRVG-SAAQVKAMKQVSGAVKGELAQYREMSSFSKF-SSD 409 (509) Q Consensus 332 ~D~~~~i~~~~~si~DG~i~l~~~l~~~g~~Paid~~~S~SRv~-~~~~~~~~~~~a~~lr~~laqy~Ele~f~~f-gsd 409 (509) ||+|||.|..+++++|..-+|||.+++.|+|||+||+.|.||+| +.+....|..+|..+...|+.|+.|++++.+ |-| T Consensus 357 DDLtDPaPattFaHLDAttVLSR~iaelgIYPAVDPLDStSrimdp~ivG~eHY~vA~~Vqk~LQ~YKsLQDIIAILGmD 436 (521) T KOG1350 357 DDLTDPAPATTFAHLDATTVLSRGIAELGIYPAVDPLDSTSRIMDPNIVGEEHYNVARGVQKTLQDYKSLQDIIAILGMD 436 (521) T ss_pred HCCCCCCCCCEEECCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCH T ss_conf 10589875420000332345433367607766658766534236844145677789999999999888898899995730 Q ss_pred -CCHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHH Q ss_conf -99899999999999999966999887488999999999866980667989999999999999 Q gi|254780617|r 410 -LDSSTQKFLSKGERLTELLKQPQFSPLAMEEQVVMIFAGISGCLDEVAVSQVRKFETNFLSH 471 (509) Q Consensus 410 -lD~~t~~~l~rg~ri~e~LkQ~~~~p~~~~~qv~~l~a~~~G~ld~i~v~~I~~fe~~l~~~ 471 (509) |+++++-.++|+++|..||.|| .|++-.|.+..|.|..++ +.|+.|...|-.. T Consensus 437 ELSEeDkLTV~RARKiqRFLSQP--------F~VAEvFTG~~GklV~l~-~ti~gF~~iL~Ge 490 (521) T KOG1350 437 ELSEEDKLTVARARKIQRFLSQP--------FQVAEVFTGHPGKLVPLE-ETIRGFKAILEGE 490 (521) T ss_pred HHCHHHHHHHHHHHHHHHHHCCC--------HHHHHHHCCCCCCEECHH-HHHHHHHHHHCCC T ss_conf 10301235499999999987385--------024542138887243299-9999999986576 No 51 >cd01128 rho_factor Transcription termination factor rho is a bacterial ATP-dependent RNA/DNA helicase. It is a homohexamer. Each monomer consists of an N-terminal domain of the OB fold, which is responsible for binding to cysteine rich nucleotides. This alignment is of the C-terminal ATP binding domain. Probab=100.00 E-value=0 Score=523.92 Aligned_cols=243 Identities=19% Similarity=0.261 Sum_probs=225.9 Q ss_pred HCCHHHHHCCCCCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCC Q ss_conf 41602331011115683365524778861158999999988630246678760699995167767899999851146742 Q gi|254780617|r 147 STGIKAIDSLIPIGRGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGAL 226 (509) Q Consensus 147 ~TGI~~ID~l~pigrGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~ 226 (509) +.|+|+||+|+|||||||+|||||+|||||+| +.+|.|+..++ +.+++|||++||||+|||.+|++.++ T Consensus 1 d~~~r~ID~l~pigkGQR~gI~g~~gvGKT~L-l~~i~~~~~~~-----~~~~~~v~~lIGER~rEv~e~~~~~~----- 69 (249) T cd01128 1 ELSTRVVDLFAPIGKGQRGLIVAPPKAGKTTL-LQSIANAITKN-----HPEVYLIVLLIDERPEEVTDMQRSVK----- 69 (249) T ss_pred CCCCEEEEEECCCCCCCEEEEECCCCCCHHHH-HHHHHHHHHHH-----CCCEEEEEEEEECCHHHHHHHHHHHC----- T ss_conf 97410154324616786778878999889999-99999999985-----89849999997165799999999716----- Q ss_pred CEEEEEEECCCCCHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHH Q ss_conf 11589972887888999998876666667542046884799616478999998754530378754333752001246788 Q gi|254780617|r 227 SYSIVVVASASDPAPMQLLAPFAGCAMGEYFRDNGYHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLL 306 (509) Q Consensus 227 ~~tvvv~a~a~~~~~~r~~ap~~a~aiAEyfr~~G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l~ 306 (509) +.+|++|+|+||.+|++++|+|+++|||||++|+|||++||||||||+||||+++..|||++ .|||+++||.+++++ T Consensus 70 --~~vv~st~d~~p~~r~~~a~~a~~iAEyfr~~G~dVLl~~DslTR~A~A~rEis~~~ge~~~-~g~~~~~~~~~~~~~ 146 (249) T cd01128 70 --GEVIASTFDEPPERHVQVAEMVLEKAKRLVEHGKDVVILLDSITRLARAYNTVVPPSGKILS-GGVDANALHKPKRFF 146 (249) T ss_pred --EEEEEECCCCCHHHHHHHHHHHHHHHHHHHHCCCCEEEEEHHHHHHHHHHHHHCCCCCCCCC-CCCCHHHHHHHHHHH T ss_conf --16999569999899999999999999999987997799951488999998663024698788-997913545548999 Q ss_pred HHHHHCCCCCCCCCEEEEEEEECC-CCCCCCCHHHHHHHHHCCEEEEECCHHHHCCCCCCCCCCCHHHCCCCCCCHHHHH Q ss_conf 887504544588735675310036-7664431146688540616996150122102665433444122034343898877 Q gi|254780617|r 307 ERAAKMSDALGAGSLTALPVIETQ-VNDVSAYIPTNVISITDGQIFLETELFYQGIRPAINIGLSVSRVGSAAQVKAMKQ 385 (509) Q Consensus 307 ERa~~~~~~~g~GSiT~~~~v~~~-~~D~~~~i~~~~~si~DG~i~l~~~l~~~g~~Paid~~~S~SRv~~~~~~~~~~~ 385 (509) ||++++.+ +||||++++|+++ +||+++||++|++||+||||||||+||++|+||||||+.|+||++.....+.|++ T Consensus 147 ~~a~~~e~---~GSiT~~~tvlve~~d~~d~~I~~~~~~i~Dg~IvLsr~la~~g~yPAIDv~~S~SR~~~~i~~~~~~~ 223 (249) T cd01128 147 GAARNIEE---GGSLTIIATALVDTGSRMDDVIFEEFKGTGNMELVLDRRLAERRIFPAIDILKSGTRKEELLLDPEELQ 223 (249) T ss_pred HHCCCCCC---CCEEEEEEEEEECCCCCCCCHHHHHHHHCCCEEEEECHHHHHCCCCCCCCCCCCCCCCCHHHCCHHHHH T ss_conf 85267889---953799888876377666726899986517749996399997599986463346210340109999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHH Q ss_conf 556788879988767879876 Q gi|254780617|r 386 VSGAVKGELAQYREMSSFSKF 406 (509) Q Consensus 386 ~a~~lr~~laqy~Ele~f~~f 406 (509) .+..+|..|+.|+|+|.+-.+ T Consensus 224 ~a~~~R~~l~~y~~~e~m~~l 244 (249) T cd01128 224 RMWLLRRVLSDMDPIEAMEFL 244 (249) T ss_pred HHHHHHHHHHCCCHHHHHHHH T ss_conf 999999999668778999999 No 52 >COG1155 NtpA Archaeal/vacuolar-type H+-ATPase subunit A [Energy production and conversion] Probab=100.00 E-value=0 Score=503.74 Aligned_cols=410 Identities=21% Similarity=0.332 Sum_probs=351.5 Q ss_pred EEEEEEEECCEEEEECCCCCCCCCEEEEC-CCEEEEEEEECCCEEEEEEECCCCCCCCCCEEEECCCEEEEECCHHHHHC Q ss_conf 35899982758999758887737479957-98189999205985999980586686789888977987886679469830 Q gi|254780617|r 28 IGRVLSIGDGIARVYGLNNIRAGEMVQFS-HGVYGMALNLEVDNVGVVILGSYKEISEGDIVKRTGRIVDVPVGLELLGR 106 (509) Q Consensus 28 ~G~V~~V~~giv~v~GL~~a~~GElv~~~-~g~~G~Vi~l~~d~v~i~~l~~~~gI~~G~~V~~tg~~~~vpvG~~lLGR 106 (509) .|+|.+|.|.+|.+.|+.++..+|+|... .+..|+|+.+++|.+.+++|+++.||++|++|.+||++++|-.|++||+. T Consensus 2 ~G~i~~isGp~V~a~gm~~~~my~~v~Vg~~~L~gEiI~i~gd~a~iQVyE~T~Gi~~Ge~V~~Tg~pLsvELGPGll~~ 81 (588) T COG1155 2 MGKIIRISGPVVVAEGMEGAKMYDVVKVGEMGLIGEIIRIEGNRATIQVYEDTAGIRPGEKVENTGRPLSVELGPGLLKS 81 (588) T ss_pred CCEEEEEECCEEEEECCCCCCEEEEEEECCCCEEEEEEEEECCEEEEEEEEECCCCCCCCEEECCCCCEEEEECCCHHHH T ss_conf 74699987779998057677417899975775058999985876899999605899999832127982278747547764 Q ss_pred CCCHHHCCCCCC--------------CCC--------------------------------------------------- Q ss_conf 003002314466--------------444--------------------------------------------------- Q gi|254780617|r 107 VVDALGNPIDGK--------------GPI--------------------------------------------------- 121 (509) Q Consensus 107 VvD~lG~PlDg~--------------g~i--------------------------------------------------- 121 (509) ++|++.||||-. +++ T Consensus 82 IyDGiQrPL~~i~e~sg~Fi~RGv~~p~Ldr~~kW~F~P~~~~Gd~V~~GdvlGtV~Et~~i~~imvpp~~~~~~v~~i~ 161 (588) T COG1155 82 IYDGIQRPLDVIKETSGDFIARGLNPPALDRKKKWDFVPAVKKGDTVYPGDVLGTVQETSLITHRIMVPPGVSGKVTWIA 161 (588) T ss_pred HHHHCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCEECCCCEEEEECCCCCEEEEEECCCCCCEEEEEEE T ss_conf 44311474799998740175457889988855520003466669875268548986067735799837887762899995 Q ss_pred -----------------------CCCCCCHHCCCCCCCCCCCCCHHHHHCCHHHHHCCCCCCCCCEEEEECCCCCCCHHH Q ss_conf -----------------------544410010135773112660014441602331011115683365524778861158 Q gi|254780617|r 122 -----------------------KCEQRSCTEADAPGIIQRQSVCEPLSTGIKAIDSLIPIGRGQRELIIGDRKTGKTSI 178 (509) Q Consensus 122 -----------------------~~~~~~~i~~~~p~~~~R~~i~~~l~TGI~~ID~l~pigrGQR~~I~g~~g~GKt~l 178 (509) ..-.+||+..+.| .-++-++.+||.||.|+||+|+|+.||....|.|+.|+|||++ T Consensus 162 ~~G~ytv~d~ia~v~~~~g~~~~~m~~~WPVR~~rp-~~eKl~p~~Pl~TGqRviDt~FPvaKGGTaaiPGpFGsGKTV~ 240 (588) T COG1155 162 EEGEYTVEDVIATVSTEGGEVDVQMMTTWPVRKARP-VKRKLPPEIPLVTGQRVIDTLFPVAKGGTAAVPGPFGSGKTVS 240 (588) T ss_pred CCCCCEEEEEEEEEECCCCEEEEEEEEECCCCCCCC-CCCCCCCCCCCCCCCEEEHHHCCCCCCCCCCCCCCCCCCCEEH T ss_conf 588715667999995279748788887266467764-1024898775212542310102111576512668888983752 Q ss_pred HHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCC-------CCCCCEEEEEEECCCCCHHHHHHHHHHHH Q ss_conf 99999998863024667876069999516776789999985114-------67421158997288788899999887666 Q gi|254780617|r 179 ILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALED-------RGALSYSIVVVASASDPAPMQLLAPFAGC 251 (509) Q Consensus 179 ~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~-------~~~~~~tvvv~a~a~~~~~~r~~ap~~a~ 251 (509) - ++.++.++..++||++||||++|+.++..++-+ .-.|++|+++++|++.|.+.|+.+.|+|. T Consensus 241 q----------h~laK~sdadiVVyigCGERGNEmtevL~eFPeL~Dp~tg~~lm~RT~liaNTSnMPVAAREasIYtGi 310 (588) T COG1155 241 Q----------HTLSKLADGDIVIYVGCGERGNEMTEVLQEFPELKDPNTGQPLMDRTVLIANTSNMPVAAREASIYTGI 310 (588) T ss_pred H----------HHHHHHCCCCEEEEEECCCCCCHHHHHHHHCCCCCCCCCCCCCCCEEEEEECCCCCHHHHHHHHHHHHH T ss_conf 4----------356564068889998157765238999986866668999971200036762576661888664353223 Q ss_pred HHHHHHHCCCCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCC-C--CCEEEEEEEE Q ss_conf 666754204688479961647899999875453037875433375200124678888750454458-8--7356753100 Q gi|254780617|r 252 AMGEYFRDNGYHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLLERAAKMSDALG-A--GSLTALPVIE 328 (509) Q Consensus 252 aiAEyfr~~G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l~ERa~~~~~~~g-~--GSiT~~~~v~ 328 (509) ++||||||+|+||+++.||.+|||+|+||||..++|+|+.+|||..+-.-+++.+||||+.....+ . ||+|.++.|. T Consensus 311 TiaEY~RDmGy~v~lmADSTSRWAEAlREisgRleEmPgeegyPaYL~srlA~fYERaG~v~~~~~~~r~GsvtV~gaVS 390 (588) T COG1155 311 TIAEYYRDMGYDVALMADSTSRWAEALREISGRLEEMPGEEGYPAYLGSRLAEFYERAGRVRLVSPEERFGSITVIGAVS 390 (588) T ss_pred HHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHCCEEEECCCCCCEEEEEEECCCC T ss_conf 19998876302567750138999999998733500288644441678999999998547234227886258999825778 Q ss_pred CCCCCCCCCHHHHHHHHHCCEEEEECCHHHHCCCCCCCCCCCHHHCCCCCCCHHH-------HHHHHHHHHHHHHHHHHH Q ss_conf 3676644311466885406169961501221026654334441220343438988-------775567888799887678 Q gi|254780617|r 329 TQVNDVSAYIPTNVISITDGQIFLETELFYQGIRPAINIGLSVSRVGSAAQVKAM-------KQVSGAVKGELAQYREMS 401 (509) Q Consensus 329 ~~~~D~~~~i~~~~~si~DG~i~l~~~l~~~g~~Paid~~~S~SRv~~~~~~~~~-------~~~a~~lr~~laqy~Ele 401 (509) .+++|+|+|+..|+..++---.-||++||++.||||||++.|.|..-........ .++...+...|.+-.|++ T Consensus 391 PpGGdfSEPVtq~Tlriv~vFw~Ld~~la~~rhfPaInwl~syS~Y~~~~~~~~~~~v~~~~~~~r~~a~~~Lq~e~elq 470 (588) T COG1155 391 PPGGDFSEPVTQNTLRVVRVFWALDAALANRRHFPSINWLNSYSLYTEDLRSWYDENVSPEWGALRDQAMEILQRESELQ 470 (588) T ss_pred CCCCCCCCCCCHHEEEEEEEECCCCHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHH T ss_conf 99998476521100355664200144453232376567377799989888877640258238999999999988779999 Q ss_pred HHHHH-HCC-CCHHHHHHHHHHHHHHH-HHCCCCCCCCC----HHHHHHHHHHH Q ss_conf 79876-338-99899999999999999-96699988748----89999999998 Q gi|254780617|r 402 SFSKF-SSD-LDSSTQKFLSKGERLTE-LLKQPQFSPLA----MEEQVVMIFAG 448 (509) Q Consensus 402 ~f~~f-gsd-lD~~t~~~l~rg~ri~e-~LkQ~~~~p~~----~~~qv~~l~a~ 448 (509) .+.+. |.| +++..+.++..++.|++ ||+|+.|++.+ +..|..++-+. T Consensus 471 eiv~lVG~eal~e~~~~il~va~~ire~fLqQnafd~vD~~~~~~kq~~mm~~i 524 (588) T COG1155 471 EIVQLVGYDALPEKEKSILDVARIIREDFLQQNAFDEIDAYCSLRKQYLMLKAI 524 (588) T ss_pred HHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHH T ss_conf 999985742366288899999999998887651567310138889999999999 No 53 >TIGR01041 ATP_syn_B_arch ATP synthase archaeal, B subunit; InterPro: IPR005724 ATPases (or ATP synthases) are membrane-bound enzyme complexes/ion transporters that combine ATP synthesis and/or hydrolysis with the transport of protons across a membrane. ATPases can harness the energy from a proton gradient, using the flux of ions across the membrane via the ATPase proton channel to drive the synthesis of ATP. Some ATPases work in reverse, using the energy from the hydrolysis of ATP to create a proton gradient. There are different types of ATPases, which can differ in function (ATP synthesis and/or hydrolysis), structure (F-, V- and A-ATPases contain rotary motors) and in the type of ions they transport , . F-ATPases (F1F0-ATPases) in mitochondria, chloroplasts and bacterial plasma membranes are the prime producers of ATP, using the proton gradient generated by oxidative phosphorylation (mitochondria) or photosynthesis (chloroplasts). V-ATPases (V1V0-ATPases) are primarily found in eukaryotic vacuoles, catalysing ATP hydrolysis to transport solutes and lower pH in organelles. A-ATPases (A1A0-ATPases) are found in Archaea and function like F-ATPases. P-ATPases (E1E2-ATPases) are found in bacteria and in eukaryotic plasma membranes and organelles, and function to transport a variety of different ions across membranes. E-ATPases are cell-surface enzymes that hydrolyse a range of NTPs, including extracellular ATP. A-ATPases (or A1A0-ATPase) (3.6.3.14 from EC) are found exclusively in Archaea and display a close resemblance in structure and subunit composition with V-ATPases, although their function in both ATP synthesis and ATP hydrolysis is closer to that of F-ATPases . A-ATPases are composed of two linked complexes: the A1 complex consisting of seven subunits contains the catalytic core that synthesizes/hydrolyses ATP, while the A0 complex consisting of at least two subunits forms the membrane-spanning pore . The rotary motor in A-ATPases is composed of only two subunits, the stator subunit I and the rotor subunit C . A-ATPases may have arisen as an adaptation to the different cellular needs and the more extreme environmental conditions faced by Archaeal species. This entry represents the beta subunit from the A1 complex of A-ATPases. The A1 complex contains three copies each of subunits alpha (or A) (IPR005726 from INTERPRO), and beta (or B), both of which form the headpiece that participates in nucleotide binding. However, only the alpha subunit is catalytic, the beta subunit being regulatory in function . More information about this protein can be found at Protein of the Month: ATP Synthases .; GO: 0046933 hydrogen ion transporting ATP synthase activity rotational mechanism, 0046961 hydrogen ion transporting ATPase activity rotational mechanism, 0015986 ATP synthesis coupled proton transport, 0016021 integral to membrane, 0016469 proton-transporting two-sector ATPase complex. Probab=100.00 E-value=0 Score=501.81 Aligned_cols=434 Identities=25% Similarity=0.388 Sum_probs=396.1 Q ss_pred EEEEEEEECCEEEEECCCCCCCCCEEEEC--CC--EEEEEEEECCCEEEEEEECCCCCCCCCC-EEEECCCEEEEECCHH Q ss_conf 35899982758999758887737479957--98--1899992059859999805866867898-8897798788667946 Q gi|254780617|r 28 IGRVLSIGDGIARVYGLNNIRAGEMVQFS--HG--VYGMALNLEVDNVGVVILGSYKEISEGD-IVKRTGRIVDVPVGLE 102 (509) Q Consensus 28 ~G~V~~V~~giv~v~GL~~a~~GElv~~~--~g--~~G~Vi~l~~d~v~i~~l~~~~gI~~G~-~V~~tg~~~~vpvG~~ 102 (509) +-.|..|.|+++.|.+...+.+.|+|++. +| ..|.|+.-..+.+.+.+|..+.++.... .|..+|+.++++++.+ T Consensus 2 y~~~~~~~GPl~~v~~~~~~~y~e~v~~~~~~G~~~~G~vl~~~~~~~~~~~~~g~~~~~~~~~~v~~~G~~~~~~~~~~ 81 (458) T TIGR01041 2 YKTITEIKGPLVIVEGVEPVAYDEIVEIETPDGEKRRGQVLDSSEGLAVVQVFEGTTGLDKEATKVRFLGETLKLPVSED 81 (458) T ss_pred CCCCCCCCCCEEEEECCCCCCCEEEEEEECCCCCCCCCCEEECCCCEEEEEEECCHHCCCCCCCEEEEECCEEEEHHHHH T ss_conf 65410103756898424321210015664057874234035303765798852111011223417898413024201078 Q ss_pred HHHCCCCHHHCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCHHHHHCCHHHHHCCCCCCCCCEEEEECCCCCCCHHHHHHH Q ss_conf 98300030023144664445444100101357731126600144416023310111156833655247788611589999 Q gi|254780617|r 103 LLGRVVDALGNPIDGKGPIKCEQRSCTEADAPGIIQRQSVCEPLSTGIKAIDSLIPIGRGQRELIIGDRKTGKTSIILDT 182 (509) Q Consensus 103 lLGRVvD~lG~PlDg~g~i~~~~~~~i~~~~p~~~~R~~i~~~l~TGI~~ID~l~pigrGQR~~I~g~~g~GKt~l~~~~ 182 (509) +||||+++.|+|+||++.+..++.+.+.+.+.||..|..+.+.++|||.+||++.++.||||+.||+++|..+..|+. . T Consensus 82 ~lGr~~~g~G~~~d~GP~~~~~~~~~~~G~~~nP~~r~~P~~fiqtG~s~idG~n~l~rG~kl~~fs~sGlP~~~la~-q 160 (458) T TIGR01041 82 LLGRILNGSGEPIDGGPEIVPDERRDINGAALNPAAREYPEEFIQTGISAIDGMNTLVRGQKLPIFSGSGLPHNELAA-Q 160 (458) T ss_pred HHHHHHHCCCCCCCCCCCCCCCCHHCCCCCCCCCHHHCCCHHHHHHCHHHHHHHHHHHCCCCCCEECCCCCCHHHHHH-H T ss_conf 888885026775567765243200023565458432204157875110243212454325421100036887778999-9 Q ss_pred HHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHHHHH-CCC Q ss_conf 9998863024667876069999516776789999985114674211589972887888999998876666667542-046 Q gi|254780617|r 183 FLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALSYSIVVVASASDPAPMQLLAPFAGCAMGEYFR-DNG 261 (509) Q Consensus 183 I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~~tvvv~a~a~~~~~~r~~ap~~a~aiAEyfr-~~G 261 (509) |..|+... ......-+||+++|-+..|...|.+.+++.|+++|.+++.+.+++|+..|.++|..|++.|||+. ++. T Consensus 161 ia~qa~v~---~~~~~favvf~a~G~~~~~~~~f~~~f~~tGal~r~~~~~~la~~P~~~~~~tP~~alt~aeyla~~~d 237 (458) T TIGR01041 161 IARQAKVR---GEESEFAVVFAAMGITYEEANFFMKDFERTGALERAVVFLNLADDPAVERIVTPRMALTLAEYLAFEKD 237 (458) T ss_pred HHHHHCCC---CCCCCEEEEEHHHCCCHHHHHHHHHHHHHCCCHHHHEEEEEECCCCCHHHHHHHHHHHHHHHHHHHCCC T ss_conf 88631012---355321433010224368899999988640432110023320267311233103678889998763067 Q ss_pred CCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCCCCCCHHHH Q ss_conf 88479961647899999875453037875433375200124678888750454458873567531003676644311466 Q gi|254780617|r 262 YHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLLERAAKMSDALGAGSLTALPVIETQVNDVSAYIPTN 341 (509) Q Consensus 262 ~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l~ERa~~~~~~~g~GSiT~~~~v~~~~~D~~~~i~~~ 341 (509) .|||+++.|||+|++|+|||+.+..|.|||+||||.+|..++.++||||+..... ||||.+|++.+|.||+||||||. T Consensus 238 ~hvl~il~d~tny~ealr~~~aar~evPGr~GyPGy~y~dla~~yeraG~~~G~~--G~~tq~Pil~mP~dd~thPiPdl 315 (458) T TIGR01041 238 MHVLVILTDMTNYCEALREISAAREEVPGRRGYPGYMYTDLATLYERAGRVKGKK--GSITQIPILTMPGDDITHPIPDL 315 (458) T ss_pred CEEEEEHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHCEEECCC--CCEEEEEEEECCCCCCCCCCCCC T ss_conf 6466405557789999987643444078888987237888999987506021476--53453003425887765777643 Q ss_pred HHHHHCCEEEEECCHHHHCCCCCCCCCCCHHHCCCCCC-----CHHHHHHHHHHHHHHHHHHHHHHHHHH-HCC-CCHHH Q ss_conf 88540616996150122102665433444122034343-----898877556788879988767879876-338-99899 Q gi|254780617|r 342 VISITDGQIFLETELFYQGIRPAINIGLSVSRVGSAAQ-----VKAMKQVSGAVKGELAQYREMSSFSKF-SSD-LDSST 414 (509) Q Consensus 342 ~~si~DG~i~l~~~l~~~g~~Paid~~~S~SRv~~~~~-----~~~~~~~a~~lr~~laqy~Ele~f~~f-gsd-lD~~t 414 (509) +.+||+|||+|+|+|+.+|+||+||++.|+||++.... .+.|++++.++...|++-+++..+... |.+ |++.+ T Consensus 316 ~GyiteGqi~~~r~l~~~Gi~PP~~~l~slsrl~~~G~G~G~tr~dh~~~~~q~ya~ya~~~~~r~l~~~vG~~~l~~~d 395 (458) T TIGR01041 316 TGYITEGQIVLSRELHRKGIYPPINVLPSLSRLMKDGIGEGKTREDHKDVSDQLYAAYAEGRDLRDLVAIVGEEALSERD 395 (458) T ss_pred CCEEECCEEEEEEHHHHCCCCCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 33011353467302651789881678889999974367888742458999988887675245688899986024555778 Q ss_pred HHHHHHHHHHHH-HHCCCCCCCCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHH Q ss_conf 999999999999-966999887488999999999866980667989999999999999 Q gi|254780617|r 415 QKFLSKGERLTE-LLKQPQFSPLAMEEQVVMIFAGISGCLDEVAVSQVRKFETNFLSH 471 (509) Q Consensus 415 ~~~l~rg~ri~e-~LkQ~~~~p~~~~~qv~~l~a~~~G~ld~i~v~~I~~fe~~l~~~ 471 (509) ++.|+.+..++. |++|...+..++++.+-+. |..|.-+|-+.+....+.++.. T Consensus 396 ~~~l~fa~~fe~~f~~qg~~~~r~i~~tl~~~----W~~l~~lP~~~l~~~~~~~i~~ 449 (458) T TIGR01041 396 RKYLKFADEFERKFVKQGRKEDRSIEETLDLG----WELLSVLPESELKRVKEELIEK 449 (458) T ss_pred HHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHH----HHHHHHCCHHHHHHHHHHHHHH T ss_conf 99999999999999863332010178899888----9999851288998999999988 No 54 >TIGR01040 V-ATPase_V1_B V-type ATPase, B subunit; InterPro: IPR005723 ATPases (or ATP synthases) are membrane-bound enzyme complexes/ion transporters that combine ATP synthesis and/or hydrolysis with the transport of protons across a membrane. ATPases can harness the energy from a proton gradient, using the flux of ions across the membrane via the ATPase proton channel to drive the synthesis of ATP. Some ATPases work in reverse, using the energy from the hydrolysis of ATP to create a proton gradient. There are different types of ATPases, which can differ in function (ATP synthesis and/or hydrolysis), structure (F-, V- and A-ATPases contain rotary motors) and in the type of ions they transport , . F-ATPases (F1F0-ATPases) in mitochondria, chloroplasts and bacterial plasma membranes are the prime producers of ATP, using the proton gradient generated by oxidative phosphorylation (mitochondria) or photosynthesis (chloroplasts). V-ATPases (V1V0-ATPases) are primarily found in eukaryotic vacuoles, catalysing ATP hydrolysis to transport solutes and lower pH in organelles. A-ATPases (A1A0-ATPases) are found in Archaea and function like F-ATPases. P-ATPases (E1E2-ATPases) are found in bacteria and in eukaryotic plasma membranes and organelles, and function to transport a variety of different ions across membranes. E-ATPases are cell-surface enzymes that hydrolyse a range of NTPs, including extracellular ATP. V-ATPases (also known as V1V0-ATPase or vacuolar ATPase) (3.6.3.14 from EC) are found in the eukaryotic endomembrane system, and in the plasma membrane of prokaryotes and certain specialised eukaryotic cells. V-ATPases hydrolyse ATP to drive a proton pump, and are involved in a variety of vital intra- and inter-cellular processes such as receptor mediated endocytosis, protein trafficking, active transport of metabolites, homeostasis and neurotransmitter release . V-ATPases are composed of two linked complexes: the V1 complex (subunits A-H) contains the catalytic core that hydrolyses ATP, while the V0 complex (subunits a, c, c, c d) forms the membrane-spanning pore. V-ATPases may have an additional role in membrane fusion through binding to t-SNARE proteins . This entry represents subunit B from the V1 complex of V-ATPases. There are three copies each of subunits A (IPR005725 from INTERPRO) and B, both of which participate in nucleotide binding. However, only subunit A is catalytic for ATP hydrolysis, subunit B being noncatalytic , . More information about this protein can be found at Protein of the Month: ATP Synthases .; GO: 0008553 hydrogen-exporting ATPase activity phosphorylative mechanism, 0015988 energy coupled proton transport against electrochemical gradient, 0005737 cytoplasm. Probab=100.00 E-value=0 Score=479.43 Aligned_cols=422 Identities=23% Similarity=0.337 Sum_probs=384.6 Q ss_pred EEEEEEEECCEEEEECCCCCCCCCEEEEC--CC--EEEEEEEECCCEEEEEEECCCCCCCC-CCEEEECCCEEEEECCHH Q ss_conf 35899982758999758887737479957--98--18999920598599998058668678-988897798788667946 Q gi|254780617|r 28 IGRVLSIGDGIARVYGLNNIRAGEMVQFS--HG--VYGMALNLEVDNVGVVILGSYKEISE-GDIVKRTGRIVDVPVGLE 102 (509) Q Consensus 28 ~G~V~~V~~giv~v~GL~~a~~GElv~~~--~g--~~G~Vi~l~~d~v~i~~l~~~~gI~~-G~~V~~tg~~~~vpvG~~ 102 (509) +-.|..|.|+++.+.-.......|+|.+. ++ ..|.|+....+...+++|+.+.||-. .+.+..||+.+++||+++ T Consensus 2 y~~~~~v~GPlv~l~~v~~~~~~eiv~~~l~~g~~r~Gqvle~~~~~a~vqvfeGt~G~d~~~~~~~~~G~~l~~~~s~~ 81 (464) T TIGR01040 2 YKTVSGVNGPLVILENVKFPKFAEIVNLTLPDGTVRKGQVLEVSGEKAVVQVFEGTSGIDAKKTTVEFTGDILRLPVSED 81 (464) T ss_pred CCEECCCCCCEEEEHHCCCCCCEEEEEEEECCCCCCCCEEEEEECCCEEEEEECCCCCCCCCCCEEEECCHHHHHHHHHH T ss_conf 62000046644230010551200046676167861233057760772089982476441222314554210231122045 Q ss_pred HHHCCCCHHHCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCHHHHHCCHHHHHCCCCCCCCCEEEEECCCCCCCHHHHHHH Q ss_conf 98300030023144664445444100101357731126600144416023310111156833655247788611589999 Q gi|254780617|r 103 LLGRVVDALGNPIDGKGPIKCEQRSCTEADAPGIIQRQSVCEPLSTGIKAIDSLIPIGRGQRELIIGDRKTGKTSIILDT 182 (509) Q Consensus 103 lLGRVvD~lG~PlDg~g~i~~~~~~~i~~~~p~~~~R~~i~~~l~TGI~~ID~l~pigrGQR~~I~g~~g~GKt~l~~~~ 182 (509) +||||+|+.|.|||..+++-++++..+.+.+.||..|..+.|.++|||.+||.|..|.||||++||+.+|..+..++. . T Consensus 82 mlGr~f~GsG~~id~gP~~l~~~y~d~~G~~inP~~r~yP~em~~tG~s~id~~~si~rGqkiP~fsa~GlPhne~aa-q 160 (464) T TIGR01040 82 MLGRVFNGSGKPIDKGPKVLAEDYLDINGQPINPQARIYPEEMIQTGISAIDVMNSIARGQKIPLFSAAGLPHNEIAA-Q 160 (464) T ss_pred HHHHHCCCCCCCCCCCCCCCHHHHHCCCCCCCCCCCCCCHHHHHHHCCHHHHHHHHHHCCCCCCEEECCCCCHHHHHH-H T ss_conf 310110367752235653103544114786458621147388985020256656665436533123047886356788-8 Q ss_pred HHHHHHHC---CCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHHHHHC Q ss_conf 99988630---246678760699995167767899999851146742115899728878889999988766666675420 Q gi|254780617|r 183 FLNQKSSH---DKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALSYSIVVVASASDPAPMQLLAPFAGCAMGEYFRD 259 (509) Q Consensus 183 I~nq~~~~---~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~~tvvv~a~a~~~~~~r~~ap~~a~aiAEyfr~ 259 (509) |+.|++-- ......++...||+++|-.-...+.|...+++.|.|++++++.+.++||..+|.++|..|++.|||+.+ T Consensus 161 i~rq~~lv~~~~~~~~~~~f~~vf~amGvn~e~a~ff~~df~~~G~~~~~~lflnlandPt~eri~tPr~al~~ae~lay 240 (464) T TIGR01040 161 ICRQAGLVKKDVKDGHEDNFAIVFAAMGVNLETARFFKQDFEENGSMERVTLFLNLANDPTIERIITPRLALTTAEYLAY 240 (464) T ss_pred HHHHHHHHHHHHHHCCCCCEEEEEEHHCCHHHHHHHHHHHHHHCCCHHHEEEEEEECCCCCCHHHHCHHHHHHHHHHHHH T ss_conf 88877888887640223431565200101057889998867534771013566541478530122000367777877765 Q ss_pred -CCCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCCCCCCH Q ss_conf -4688479961647899999875453037875433375200124678888750454458873567531003676644311 Q gi|254780617|r 260 -NGYHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLLERAAKMSDALGAGSLTALPVIETQVNDVSAYI 338 (509) Q Consensus 260 -~G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l~ERa~~~~~~~g~GSiT~~~~v~~~~~D~~~~i 338 (509) .++|||+++.|++.||+|+||+|.+..|.|||+||||.+|..++.++||||+..... ||||.+|++++|.||+|||| T Consensus 241 ~~~~hvl~iltd~ssya~alrevsaareevPgrrG~PGymy~dl~~iyeraGrv~Gr~--Gsitq~PiltmPnddithPi 318 (464) T TIGR01040 241 ECEKHVLVILTDMSSYADALREVSAAREEVPGRRGYPGYMYTDLATIYERAGRVEGRN--GSITQIPILTMPNDDITHPI 318 (464) T ss_pred HCCCEEEEEEECHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHCCCCCCCC--CCEEEEEEEECCCCCCCCCC T ss_conf 2476079985033579999987765554067778887325777899876304211676--64242002425764456776 Q ss_pred HHHHHHHHCCEEEEECCHHHHCCCCCCCCCCCHHHCCCCCCC-----HHHHHHHHHHHHHHHHHHHHHHHHHH-HCC-CC Q ss_conf 466885406169961501221026654334441220343438-----98877556788879988767879876-338-99 Q gi|254780617|r 339 PTNVISITDGQIFLETELFYQGIRPAINIGLSVSRVGSAAQV-----KAMKQVSGAVKGELAQYREMSSFSKF-SSD-LD 411 (509) Q Consensus 339 ~~~~~si~DG~i~l~~~l~~~g~~Paid~~~S~SRv~~~~~~-----~~~~~~a~~lr~~laqy~Ele~f~~f-gsd-lD 411 (509) ||.+.+||+||||+||.|+++.+||+||++.|+||+|.++.. +.|..+++++...|+--+++.++... |.+ |+ T Consensus 319 PdltGyitegqiy~dr~l~~~~iyPPi~vlPslsrlmk~aiG~gmtr~dh~~vs~qly~~~~~~~d~~amkavvG~eal~ 398 (464) T TIGR01040 319 PDLTGYITEGQIYVDRQLHNRQIYPPINVLPSLSRLMKSAIGEGMTRKDHSDVSNQLYAKYAIGKDVAAMKAVVGEEALS 398 (464) T ss_pred CCCCCEEECCEEEEEHHHHHCCCCCCHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 53211022242455101430557883235566899988752353101247889999998888887899998874022214 Q ss_pred HHHHHHHHHHHHHHH-HHCCCCCCCCCHHHHHHHHHHHHCCC Q ss_conf 899999999999999-96699988748899999999986698 Q gi|254780617|r 412 SSTQKFLSKGERLTE-LLKQPQFSPLAMEEQVVMIFAGISGC 452 (509) Q Consensus 412 ~~t~~~l~rg~ri~e-~LkQ~~~~p~~~~~qv~~l~a~~~G~ 452 (509) .++.-.|+.-.+++. |..|..|+...+.+.+-.-|.+.+-| T Consensus 399 ~~d~l~l~f~~~fe~~f~~qg~y~~r~~~~~ld~~W~llr~f 440 (464) T TIGR01040 399 SEDLLYLEFLEKFEKKFIAQGAYENRTIFESLDLAWELLRIF 440 (464) T ss_pred CCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHH T ss_conf 131124578889988775126521026898898889999986 No 55 >TIGR01039 atpD ATP synthase F1, beta subunit; InterPro: IPR005722 ATPases (or ATP synthases) are membrane-bound enzyme complexes/ion transporters that combine ATP synthesis and/or hydrolysis with the transport of protons across a membrane. ATPases can harness the energy from a proton gradient, using the flux of ions across the membrane via the ATPase proton channel to drive the synthesis of ATP. Some ATPases work in reverse, using the energy from the hydrolysis of ATP to create a proton gradient. There are different types of ATPases, which can differ in function (ATP synthesis and/or hydrolysis), structure (F-, V- and A-ATPases contain rotary motors) and in the type of ions they transport , . F-ATPases (F1F0-ATPases) in mitochondria, chloroplasts and bacterial plasma membranes are the prime producers of ATP, using the proton gradient generated by oxidative phosphorylation (mitochondria) or photosynthesis (chloroplasts). V-ATPases (V1V0-ATPases) are primarily found in eukaryotic vacuoles, catalysing ATP hydrolysis to transport solutes and lower pH in organelles. A-ATPases (A1A0-ATPases) are found in Archaea and function like F-ATPases. P-ATPases (E1E2-ATPases) are found in bacteria and in eukaryotic plasma membranes and organelles, and function to transport a variety of different ions across membranes. E-ATPases are cell-surface enzymes that hydrolyse a range of NTPs, including extracellular ATP. F-ATPases (also known as F1F0-ATPase, or H(+)-transporting two-sector ATPase) (3.6.3.14 from EC) are composed of two linked complexes: the F1 ATPase complex is the catalytic core and is composed of 5 subunits (alpha, beta, gamma, delta, epsilon), while the F0 ATPase complex is the membrane-embedded proton channel that is composed of at least 3 subunits (A-C), nine in mitochondria (A-G, F6, F8). Both the F1 and F0 complexes are rotary motors that are coupled back-to-back. In the F1 complex, the central gamma subunit forms the rotor inside the cylinder made of the alpha(3)beta(3) subunits, while in the F0 complex, the ring-shaped C subunits forms the rotor. The two rotors rotate in opposite directions, but the F0 rotor is usually stronger, using the force from the proton gradient to push the F1 rotor in reverse in order to drive ATP synthesis . These ATPases can also work in reverse to hydrolyse ATP to create a proton gradient. This entry represents the beta subunit found in the F1 complex of F-ATPases. In F-ATPases, there are three copies each of the alpha and beta subunits that form the catalytic core of the F1 complex, while the remaining F1 subunits (gamma, delta, epsilon) form part of the stalks. There is a substrate-binding site on each of the alpha and beta subunits, those on the beta subunits being catalytic, while those on the alpha subunits are regulatory. The alpha and beta subunits form a cylinder that is attached to the central stalk. The alpha/beta subunits undergo a sequence of conformational changes leading to the formation of ATP from ADP, which are induced by the rotation of the gamma subunit, itself is driven by the movement of protons through the F0 complex C subunit , . More information about this protein can be found at Protein of the Month: ATP Synthases .; GO: 0008553 hydrogen-exporting ATPase activity phosphorylative mechanism, 0046933 hydrogen ion transporting ATP synthase activity rotational mechanism, 0006754 ATP biosynthetic process, 0015986 ATP synthesis coupled proton transport, 0016021 integral to membrane. Probab=100.00 E-value=0 Score=485.64 Aligned_cols=435 Identities=24% Similarity=0.394 Sum_probs=388.6 Q ss_pred EEEEEEEECCEEEEECCCCCCCCCE---EEEC--CCE-EEEEEEECCCEEEEEEECCCCCCCCCCEEEECCCEEEEECCH Q ss_conf 3589998275899975888773747---9957--981-899992059859999805866867898889779878866794 Q gi|254780617|r 28 IGRVLSIGDGIARVYGLNNIRAGEM---VQFS--HGV-YGMALNLEVDNVGVVILGSYKEISEGDIVKRTGRIVDVPVGL 101 (509) Q Consensus 28 ~G~V~~V~~giv~v~GL~~a~~GEl---v~~~--~g~-~G~Vi~l~~d~v~i~~l~~~~gI~~G~~V~~tg~~~~vpvG~ 101 (509) .|+|..|-+.++.+. ++...+-++ +.+. ... .-....+.++.|.++.++.++|+..|..|..++...+||||. T Consensus 2 ~G~~~~v~g~vvdv~-f~~~~~P~~~~~l~~~~~~~l~~e~~~~~G~~~vr~ia~~~~~Gl~r~~~~~~~~~~i~vPvG~ 80 (460) T TIGR01039 2 KGKVVQVIGPVVDVE-FEESKLPEIYDALKVELEKELVLEVAQHLGDDTVRAIALDSTDGLVRGLEVVDTGKPIEVPVGK 80 (460) T ss_pred CCEEEEEEHHHEEEE-ECHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHHHCCCEEEECCCC T ss_conf 860444010020111-1400026799887554102556344765067605678731433302213555438725511474 Q ss_pred HHHHCCCCHHHCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCHHHHHCCHHHHHCCCCCCCCCEEEEECCCCCCCHHHHHH Q ss_conf 69830003002314466444544410010135773112660014441602331011115683365524778861158999 Q gi|254780617|r 102 ELLGRVVDALGNPIDGKGPIKCEQRSCTEADAPGIIQRQSVCEPLSTGIKAIDSLIPIGRGQRELIIGDRKTGKTSIILD 181 (509) Q Consensus 102 ~lLGRVvD~lG~PlDg~g~i~~~~~~~i~~~~p~~~~R~~i~~~l~TGI~~ID~l~pigrGQR~~I~g~~g~GKt~l~~~ 181 (509) ..+||++|.+|+|+|.++++.....+++++.+|...+...-.+.|+||||+||.|.|..||.|+++|||+|+|||.++.+ T Consensus 81 ~~~G~~~nv~G~~~d~~~~~~~~~~~~~~~~~P~~~~~~~~~~~l~tG~kv~dll~P~~~GGk~glfGGaGvGktv~~~e 160 (460) T TIGR01039 81 EVLGRIFNVLGEPIDEKGEVKKKEKLPIHRKAPSFEEQSTKVEILETGIKVIDLLAPYAKGGKIGLFGGAGVGKTVLIQE 160 (460) T ss_pred CHHHHHHHHHCHHCCCCCCCHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEEECCCCCCHHHHHHH T ss_conf 04557876621000341110023420122068513443346789873368887751543288067731766224566899 Q ss_pred HHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCC-EEEEEEECCCCCHHHHHHHHHHHHHHHHHHHC- Q ss_conf 9999886302466787606999951677678999998511467421-15899728878889999988766666675420- Q gi|254780617|r 182 TFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALS-YSIVVVASASDPAPMQLLAPFAGCAMGEYFRD- 259 (509) Q Consensus 182 ~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~-~tvvv~a~a~~~~~~r~~ap~~a~aiAEyfr~- 259 (509) +|.|-+..+.+ ++||+++|||.||..+++.++.+.+.++ ++.+|...+++||+.|.+.+++++++|||||| T Consensus 161 li~~~a~~~~G-------~sv~~GvGertreGndly~e~~~~~v~~~~~~lv~Gqm~ePPG~r~rva~~~lt~ae~frd~ 233 (460) T TIGR01039 161 LINNIAKEHGG-------LSVFAGVGERTREGNDLYLEMKESGVINSKVALVYGQMNEPPGARLRVALTGLTMAEYFRDE 233 (460) T ss_pred HHHHHHHHHCC-------EEEEEECCCCCCCHHHHHHHHHHCCCCCCCEEEEEECCCCCCCCCHHHHHHHHHHHHHHHCC T ss_conf 99999986288-------07985066533325689999873241012326887213689874110345677888875112 Q ss_pred CCCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCCCCCCHH Q ss_conf 46884799616478999998754530378754333752001246788887504544588735675310036766443114 Q gi|254780617|r 260 NGYHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLLERAAKMSDALGAGSLTALPVIETQVNDVSAYIP 339 (509) Q Consensus 260 ~G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l~ERa~~~~~~~g~GSiT~~~~v~~~~~D~~~~i~ 339 (509) +++|||+++|++.||.+|..|+|.+|||.|+.-||.|++-..+..|.||....+. ||||++..|++|+||+|||.| T Consensus 234 ~~~dvl~f~dnifrf~~aG~e~s~llGr~PsavGyqPtl~~~~G~lqeri~st~~----~s~ts~qa~yvPadd~tdPaP 309 (460) T TIGR01039 234 EGQDVLLFIDNIFRFTQAGSEVSALLGRLPSAVGYQPTLATEMGELQERITSTKK----GSITSVQAVYVPADDLTDPAP 309 (460) T ss_pred CCCEEEEEEHHHHHHHHHHHHHHHHHHCCCHHCCCCCCHHHHHHHHHHHHHHCCC----CCEEEEEEEEECCCCCCCCCC T ss_conf 4663788514467776310478988610110005663035667678777652135----532567678633202578863 Q ss_pred HHHHHHHCCEEEEECCHHHHCCCCCCCCCCCHHHCC-CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH-HC-CCCHHHHH Q ss_conf 668854061699615012210266543344412203-4343898877556788879988767879876-33-89989999 Q gi|254780617|r 340 TNVISITDGQIFLETELFYQGIRPAINIGLSVSRVG-SAAQVKAMKQVSGAVKGELAQYREMSSFSKF-SS-DLDSSTQK 416 (509) Q Consensus 340 ~~~~si~DG~i~l~~~l~~~g~~Paid~~~S~SRv~-~~~~~~~~~~~a~~lr~~laqy~Ele~f~~f-gs-dlD~~t~~ 416 (509) ..+..++|....|+|.+++.|+||||||+.|.||.. +......|.++|..++..|++|++|+++..+ |- +|+++++. T Consensus 310 ~~~fahlda~~~l~r~~~~~GiyPav~Pl~s~s~~l~~~~vG~~hy~~a~~v~~~lq~y~~l~diiailG~del~~~d~~ 389 (460) T TIGR01039 310 ATTFAHLDATTVLSRKIAELGIYPAVDPLDSTSRLLDPEVVGEEHYEVAREVQSILQRYKELQDIIAILGLDELSEEDKL 389 (460) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHH T ss_conf 03444400024545668751776554651013444305652167899999999999988878889888431441144689 Q ss_pred HHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999966999887488999999999866980667989999999999999974159999988 Q gi|254780617|r 417 FLSKGERLTELLKQPQFSPLAMEEQVVMIFAGISGCLDEVAVSQVRKFETNFLSHMRVSSQDILEDI 483 (509) Q Consensus 417 ~l~rg~ri~e~LkQ~~~~p~~~~~qv~~l~a~~~G~ld~i~v~~I~~fe~~l~~~l~~~~~ei~~~I 483 (509) .++|+++|..||.|+.+ ++-.|.+..|.+..+. +.|..|...+....+.-+...+..+ T Consensus 390 ~v~rar~i~~flsq~~~--------vae~f~G~~G~~v~l~-~t~~~f~~~~~G~~d~~Pe~af~~v 447 (460) T TIGR01039 390 VVERARKIERFLSQPFF--------VAEVFTGSPGKYVKLK-DTIEGFKEILEGKYDHLPEQAFYLV 447 (460) T ss_pred HHHHHHHHHHHHCCCHH--------HHHHCCCCCCCEEEHH-HHHHHHHHHHCCCCCCCCHHHHHHH T ss_conf 99999999987313200--------0232057887546478-8999999984684113524577664 No 56 >KOG1352 consensus Probab=100.00 E-value=0 Score=471.33 Aligned_cols=476 Identities=22% Similarity=0.306 Sum_probs=373.8 Q ss_pred CCCCCCEEEEEEEEEEECCEEEEECCCCCCCCCEEEEC-CCEEEEEEEECCCEEEEEEECCCCCCCCCCEEEECCCEEEE Q ss_conf 45554201035899982758999758887737479957-98189999205985999980586686789888977987886 Q gi|254780617|r 19 FGNDSEFSEIGRVLSIGDGIARVYGLNNIRAGEMVQFS-HGVYGMALNLEVDNVGVVILGSYKEISEGDIVKRTGRIVDV 97 (509) Q Consensus 19 ~~~~~~~~~~G~V~~V~~giv~v~GL~~a~~GElv~~~-~g~~G~Vi~l~~d~v~i~~l~~~~gI~~G~~V~~tg~~~~v 97 (509) ......-.++|+|-+|.|+++.+..+-++.++|+|+.. ++..|+++.++.|.+.++++++++|+..||+|.+||++++| T Consensus 12 ~~~e~~es~~G~v~~VSGPVV~a~~M~G~aMYELVrVGh~~LvGEiIrlegD~aTIQVYEeTsG~tVgDpvlrTgkPLsv 91 (618) T KOG1352 12 SLDEEEESEYGRVYSVSGPVVVAENMAGCAMYELVRVGHDELVGEIIRLEGDMATIQVYEETSGLTVGDPVLRTGKPLSV 91 (618) T ss_pred CCCCHHHHCCCEEEECCCCEEEECCCHHHHHHHHHHCCHHHHCEEEEEECCCEEEEEEEECCCCCCCCCCHHHCCCCCEE T ss_conf 34432331051488525765750011106799999713475300005861760379998124885148704315885157 Q ss_pred ECCHHHHHCCCCHHHCCCCCCCCCCCC----------------------------------------------------- Q ss_conf 679469830003002314466444544----------------------------------------------------- Q gi|254780617|r 98 PVGLELLGRVVDALGNPIDGKGPIKCE----------------------------------------------------- 124 (509) Q Consensus 98 pvG~~lLGRVvD~lG~PlDg~g~i~~~----------------------------------------------------- 124 (509) ..|++++|.++|+..+||..-..+... T Consensus 92 ELGPGimgsIfDGIQRPLk~I~~~s~siyiPkGv~~~aL~r~~~weF~p~k~~vg~hitGGDiyg~V~ENsli~hki~lp 171 (618) T KOG1352 92 ELGPGIMGSIFDGIQRPLKDISELSQSIYIPKGVNTPALDREIKWEFTPGKLRVGDHITGGDIYGSVFENSLIKHKILLP 171 (618) T ss_pred EECCCHHHHHHHHHHHHHHHHHHHCCCEECCCCCCCCCCCCCCEEECCCCEEEECCEEECCCEEEEEECCCHHHCEEECC T ss_conf 62753676565654334899987508675347877544670000201455044345210571478863120433202148 Q ss_pred -------------------------------------CCCHHCCCCCCCCCCCCCHHHHHCCHHHHHCCCCCCCCCEEEE Q ss_conf -------------------------------------4100101357731126600144416023310111156833655 Q gi|254780617|r 125 -------------------------------------QRSCTEADAPGIIQRQSVCEPLSTGIKAIDSLIPIGRGQRELI 167 (509) Q Consensus 125 -------------------------------------~~~~i~~~~p~~~~R~~i~~~l~TGI~~ID~l~pigrGQR~~I 167 (509) +.||+..+. ++.+..+-+.||.||-|++|+|+||..|....| T Consensus 172 Pr~~Gtvt~iAp~G~Y~~~d~vlE~Ef~g~k~~~tmlq~WPVR~pR-Pv~ekl~an~PLltGQRvLDalfPcVqGGTtaI 250 (618) T KOG1352 172 PRARGTVTYIAPAGNYTLDDVVLELEFDGEKTKFTMLQTWPVRQPR-PVTEKLPANHPLLTGQRVLDALFPCVQGGTTAI 250 (618) T ss_pred CCCCCEEEEEECCCCCCCCCEEEEEEECCCCEEEEEEEECCCCCCC-CHHHCCCCCCCCCCCCHHHHHHCCHHCCCCCCC T ss_conf 7667648998068865211078998736841157789861457776-411116789964133068875442123886116 Q ss_pred ECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCC---------CCCCCEEEEEEECCCC Q ss_conf 2477886115899999998863024667876069999516776789999985114---------6742115899728878 Q gi|254780617|r 168 IGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALED---------RGALSYSIVVVASASD 238 (509) Q Consensus 168 ~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~---------~~~~~~tvvv~a~a~~ 238 (509) .|..|||||+|- |.- .+.++....|||++|||++|+.|+...+-+ ...|+||++|++|++. T Consensus 251 PGAFGCGKTVIS------QsL----SKYSNSD~iiYVGCGERGNEMsEVL~dFPeLt~ev~G~~esiMKRT~LVANTSNM 320 (618) T KOG1352 251 PGAFGCGKTVIS------QSL----SKYSNSDAIIYVGCGERGNEMSEVLMDFPELTMEVDGKTESIMKRTALVANTSNM 320 (618) T ss_pred CCCCCCCHHHHH------HHH----HHCCCCCEEEEECCCCCCHHHHHHHHHCHHHEEECCCCCHHHHHHHHHHHCCCCC T ss_conf 764565547777------777----5405787699973666521589998737122863488421455441221047888 Q ss_pred CHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCC---C Q ss_conf 8899999887666666754204688479961647899999875453037875433375200124678888750454---4 Q gi|254780617|r 239 PAPMQLLAPFAGCAMGEYFRDNGYHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLLERAAKMSD---A 315 (509) Q Consensus 239 ~~~~r~~ap~~a~aiAEyfr~~G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l~ERa~~~~~---~ 315 (509) |.+.|-.+.|+|.+++|||||+|+||-.+.||.+|||+|+||||-.|.|+|..-|||..+-.-++.++||||+... . T Consensus 321 PVAAREASIYTGITlsEYfRDmG~nVsMmADStSRWAEALREISGRLaEMPADsGyPAYLgArLAsFYERAG~vkcLGsP 400 (618) T KOG1352 321 PVAAREASIYTGITLSEYFRDMGYNVSMMADSTSRWAEALREISGRLAEMPADSGYPAYLGARLASFYERAGRVKCLGSP 400 (618) T ss_pred CHHHHHHHHHHCEEHHHHHHHCCCCEEEEECCHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHCCCEEECCCC T ss_conf 50443202331506999998627644554154468999999863056418676797278888999999864702543798 Q ss_pred CCCCCEEEEEEEECCCCCCCCCHHHHHHHHHCCEEE--EECCHHHHCCCCCCCCCCCHHHCCCCCCCH------HHHHHH Q ss_conf 588735675310036766443114668854061699--615012210266543344412203434389------887755 Q gi|254780617|r 316 LGAGSLTALPVIETQVNDVSAYIPTNVISITDGQIF--LETELFYQGIRPAINIGLSVSRVGSAAQVK------AMKQVS 387 (509) Q Consensus 316 ~g~GSiT~~~~v~~~~~D~~~~i~~~~~si~DG~i~--l~~~l~~~g~~Paid~~~S~SRv~~~~~~~------~~~~~a 387 (509) ...||+|.+..|..||+|+|||+.+.+.+|+ |+| ||.+||++.|||.||++.|.|..+.+...- ..--+- T Consensus 401 ~ReGsVsIVgAVSPpGGDFsDPVTsATLgIv--QVFWGLDKKLAQRKHFPSiNwliSYSkY~~aL~~~Ye~~~peF~~lR 478 (618) T KOG1352 401 DREGSVSIVGAVSPPGGDFSDPVTSATLGIV--QVFWGLDKKLAQRKHFPSINWLISYSKYMRALDPFYEKNYPEFVVLR 478 (618) T ss_pred CCCCEEEEEEEECCCCCCCCCCCHHHHHHEE--EEHHCCCHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCHH T ss_conf 7676069998626999886773002112001--00010247777530388620345189999877777874288764068 Q ss_pred HHHHHHHHHHHHHHHHHHHH--CCCCHHHHHHHHHHHHHHH-HHCCCCCCCC----CHHHHHHHHHHHHCCCCCCCC--H Q ss_conf 67888799887678798763--3899899999999999999-9669998874----889999999998669806679--8 Q gi|254780617|r 388 GAVKGELAQYREMSSFSKFS--SDLDSSTQKFLSKGERLTE-LLKQPQFSPL----AMEEQVVMIFAGISGCLDEVA--V 458 (509) Q Consensus 388 ~~lr~~laqy~Ele~f~~fg--sdlD~~t~~~l~rg~ri~e-~LkQ~~~~p~----~~~~qv~~l~a~~~G~ld~i~--v 458 (509) .+.|..|++..+|..+.|+. +.|.+.+|..|+-++.|.+ ||+|+.|+|| |+.+.+-+|-.... |+|.-. + T Consensus 479 tk~keilq~eedl~eivQLVGK~aLaetdKitLevaklikdDfLqQN~ys~YD~~CPfyKt~~Mlrn~i~-fyd~a~~av 557 (618) T KOG1352 479 TKAKEILQEEEDLSEIVQLVGKSALAETDKITLEVAKLIKDDFLQQNGYSPYDRFCPFYKTVGMLRNIIA-FYDLARQAV 557 (618) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEHHHHHHHHHHHHHHCCCCCHHHCCCHHHHHHHHHHHHH-HHHHHHHHH T ss_conf 8999997535569999998535542455523123887878888875599824304805777999999999-999999999 Q ss_pred HHHHHHH-HHHHHHHHHHHHHHHHHHHHCCCC-----CHHHHHHHHHHHHHHHHHC Q ss_conf 9999999-999999974159999988732798-----9899999999999998515 Q gi|254780617|r 459 SQVRKFE-TNFLSHMRVSSQDILEDIRKQKVL-----TDDIRSKLINEIKVFLEDF 508 (509) Q Consensus 459 ~~I~~fe-~~l~~~l~~~~~ei~~~I~~~~~l-----~de~~~~L~~~i~~~~~~F 508 (509) +.-..-+ +.-+..++....+++.++.+.+.. +++++.+++++.+.....| T Consensus 558 e~~a~~~~kitw~~ir~~~gdi~y~lssmKF~dp~~ge~~i~~~~~~L~~~~~~~F 613 (618) T KOG1352 558 ENTAQSGNKITWSKIRESTGDIFYQLSSMKFKDPKDGEKEIKEDFEKLHEDIQQAF 613 (618) T ss_pred HHHCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHH T ss_conf 74214577345899887753488887542065832016889999999999999999 No 57 >TIGR01042 V-ATPase_V1_A V-type ATPase, A subunit; InterPro: IPR005725 ATPases (or ATP synthases) are membrane-bound enzyme complexes/ion transporters that combine ATP synthesis and/or hydrolysis with the transport of protons across a membrane. ATPases can harness the energy from a proton gradient, using the flux of ions across the membrane via the ATPase proton channel to drive the synthesis of ATP. Some ATPases work in reverse, using the energy from the hydrolysis of ATP to create a proton gradient. There are different types of ATPases, which can differ in function (ATP synthesis and/or hydrolysis), structure (F-, V- and A-ATPases contain rotary motors) and in the type of ions they transport , . F-ATPases (F1F0-ATPases) in mitochondria, chloroplasts and bacterial plasma membranes are the prime producers of ATP, using the proton gradient generated by oxidative phosphorylation (mitochondria) or photosynthesis (chloroplasts). V-ATPases (V1V0-ATPases) are primarily found in eukaryotic vacuoles, catalysing ATP hydrolysis to transport solutes and lower pH in organelles. A-ATPases (A1A0-ATPases) are found in Archaea and function like F-ATPases. P-ATPases (E1E2-ATPases) are found in bacteria and in eukaryotic plasma membranes and organelles, and function to transport a variety of different ions across membranes. E-ATPases are cell-surface enzymes that hydrolyse a range of NTPs, including extracellular ATP. V-ATPases (also known as V1V0-ATPase or vacuolar ATPase) (3.6.3.14 from EC) are found in the eukaryotic endomembrane system, and in the plasma membrane of prokaryotes and certain specialised eukaryotic cells. V-ATPases hydrolyse ATP to drive a proton pump, and are involved in a variety of vital intra- and inter-cellular processes such as receptor mediated endocytosis, protein trafficking, active transport of metabolites, homeostasis and neurotransmitter release . V-ATPases are composed of two linked complexes: the V1 complex (subunits A-H) contains the catalytic core that hydrolyses ATP, while the V0 complex (subunits a, c, c, c d) forms the membrane-spanning pore. V-ATPases may have an additional role in membrane fusion through binding to t-SNARE proteins . This entry represents subunit A from the V1 complex of V-ATPases. There are three copies each of subunits A and B (IPR005723 from INTERPRO), both of which participate in nucleotide binding. However, only subunit A is catalytic, functioning in ATP hydrolysis to drive the rotation of the D and F subunits of V1, as well as the V0 complex c-ring rotor subunit for proton translocation , . More information about this protein can be found at Protein of the Month: ATP Synthases .; GO: 0046933 hydrogen ion transporting ATP synthase activity rotational mechanism, 0046961 hydrogen ion transporting ATPase activity rotational mechanism, 0015986 ATP synthesis coupled proton transport, 0016469 proton-transporting two-sector ATPase complex. Probab=100.00 E-value=0 Score=407.15 Aligned_cols=470 Identities=21% Similarity=0.298 Sum_probs=381.5 Q ss_pred EEEEEEEEECCEEEEECCCCCCCCCEEEECC-CEEEEEEEECCCEEEEEEECCCCCCCCCCEEEECCCEEEEECCHHHHH Q ss_conf 0358999827589997588877374799579-818999920598599998058668678988897798788667946983 Q gi|254780617|r 27 EIGRVLSIGDGIARVYGLNNIRAGEMVQFSH-GVYGMALNLEVDNVGVVILGSYKEISEGDIVKRTGRIVDVPVGLELLG 105 (509) Q Consensus 27 ~~G~V~~V~~giv~v~GL~~a~~GElv~~~~-g~~G~Vi~l~~d~v~i~~l~~~~gI~~G~~V~~tg~~~~vpvG~~lLG 105 (509) ..|.|.+|.|+++.+..+.++.++|+|+... ...|+++.++.|...+++++++.|+..||.|.+|+++++|.+|+++++ T Consensus 1 ~~G~~~~vsGPvv~a~~~~G~~~yelv~vG~~~l~Ge~irl~~d~atiqvyeet~G~~v~d~v~rt~~Plsv~lGPG~~~ 80 (596) T TIGR01042 1 EYGSIKKVSGPVVVAENMAGAAMYELVRVGHEELVGEIIRLEGDKATIQVYEETSGLTVGDPVLRTGKPLSVELGPGLLG 80 (596) T ss_pred CCCEEEECCCCEEEECCCCCCHHHHHHHHCCHHHCEEEEEECCCEEEEEEEECCCCEEECCCHHCCCCCCEEECCCHHHH T ss_conf 96326642575234011134024455550410001254676077058988641455044451000378514640620477 Q ss_pred CCCCHHHCCCCCCCC----------------------------------------------------------------- Q ss_conf 000300231446644----------------------------------------------------------------- Q gi|254780617|r 106 RVVDALGNPIDGKGP----------------------------------------------------------------- 120 (509) Q Consensus 106 RVvD~lG~PlDg~g~----------------------------------------------------------------- 120 (509) .++|++.+||..-.. T Consensus 81 ~i~dGiqrPl~~i~~~~~~~yiP~G~~~~~l~~~~~W~f~~~~~~~~G~~~~GGd~~~~v~en~l~~~~~~l~P~~~G~~ 160 (596) T TIGR01042 81 NIFDGIQRPLKAIAEASQSIYIPRGVNVPALDREKKWEFTPSKKLRVGDHITGGDIYGLVEENSLLKHKILLPPRARGTI 160 (596) T ss_pred HHHHHHHHHHHHHHHHHCCEECCCCCCHHHHCHHHHCCCCCCCCCCCCCCCCCCEEEEEEEHHHHHHCEEEECCCCCCEE T ss_conf 77656545699998621425224676403211112101121010013750026504665410012101023355776207 Q ss_pred --------------------------CCCCCCCHHCCCCCCCCCCCCCHHHHHCCHHHHHCCCCCCCCCEEEEECCCCCC Q ss_conf --------------------------454441001013577311266001444160233101111568336552477886 Q gi|254780617|r 121 --------------------------IKCEQRSCTEADAPGIIQRQSVCEPLSTGIKAIDSLIPIGRGQRELIIGDRKTG 174 (509) Q Consensus 121 --------------------------i~~~~~~~i~~~~p~~~~R~~i~~~l~TGI~~ID~l~pigrGQR~~I~g~~g~G 174 (509) ......||+..+.| ..++..-+.||.||-|++|+|+|+.+|....|.|..||| T Consensus 161 ~~~a~~G~y~~~~~~le~ef~G~~~~~~~~~~WPvr~~rP-~~~~~~a~~Pll~Gqrvld~lfP~v~GGt~~ipGafGCG 239 (596) T TIGR01042 161 TYIAPAGEYTVEEKVLEVEFDGKKKKYSMLQVWPVRSPRP-VAEKLAANTPLLTGQRVLDALFPSVQGGTTAIPGAFGCG 239 (596) T ss_pred EEEECCCCEEEEEEEEEEEECCCCCEEEEEEECCCCCCCC-HHHHHCCCCCCEEHHHHHHHHCCCCCCCCEEECCCCCCC T ss_conf 9973587601100357654247300024544326666564-034313578611014677754333267741203565653 Q ss_pred CHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHH----------CCCCCCCEEEEEEECCCCCHHHHH Q ss_conf 1158999999988630246678760699995167767899999851----------146742115899728878889999 Q gi|254780617|r 175 KTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKAL----------EDRGALSYSIVVVASASDPAPMQL 244 (509) Q Consensus 175 Kt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l----------~~~~~~~~tvvv~a~a~~~~~~r~ 244 (509) ||++.. .+ .+.++...+||+++||||+|++|+...+ +....|+||++|++|++.|...|- T Consensus 240 ktvisq-~l---------skysnsd~i~yvGCGerGnem~evl~~fP~l~~~~~~G~~~~imkrt~lvantsnmPvaare 309 (596) T TIGR01042 240 KTVISQ-SL---------SKYSNSDVIVYVGCGERGNEMAEVLRDFPELTIEVPDGREESIMKRTTLVANTSNMPVAARE 309 (596) T ss_pred HHHHHH-HH---------HHHCCCCEEEEEECCCCCHHHHHHHHHCHHHEEEECCCCCCHHHHHHHHHHCCCCCCHHHHH T ss_conf 257665-55---------53214761799606875126899986320305740478742023333221002467502211 Q ss_pred HHHHHHHHHHHHHHCCCCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCC---CCCCCCE Q ss_conf 9887666666754204688479961647899999875453037875433375200124678888750454---4588735 Q gi|254780617|r 245 LAPFAGCAMGEYFRDNGYHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLLERAAKMSD---ALGAGSL 321 (509) Q Consensus 245 ~ap~~a~aiAEyfr~~G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l~ERa~~~~~---~~g~GSi 321 (509) .+.|+|.++||||||+|++|-.+.|+-+|||+|+||||-.|.|.|...|||..+-.-++.++||+|+... ....||+ T Consensus 310 asiytGitl~eyfrd~G~~~~m~ads~srWaealreisGrl~emPad~GyPayl~a~lasfye~~G~~~~~G~P~r~G~v 389 (596) T TIGR01042 310 ASIYTGITLAEYFRDMGYNVSMMADSTSRWAEALREISGRLAEMPADSGYPAYLGARLASFYERAGRVKCLGSPEREGSV 389 (596) T ss_pred HHHHHHHHHHHHHHHCCCEEEEEECCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHCCCEEECCCCCCCCEE T ss_conf 12330025767776328726775344036899998763133216667785067887888776305714540488865106 Q ss_pred EEEEEEECCCCCCCCCHHHHHHHHHCCEEEEECCHHHHCCCCCCCCCCCHHHCCCCCCC------HHHHHHHHHHHHHHH Q ss_conf 67531003676644311466885406169961501221026654334441220343438------988775567888799 Q gi|254780617|r 322 TALPVIETQVNDVSAYIPTNVISITDGQIFLETELFYQGIRPAINIGLSVSRVGSAAQV------KAMKQVSGAVKGELA 395 (509) Q Consensus 322 T~~~~v~~~~~D~~~~i~~~~~si~DG~i~l~~~l~~~g~~Paid~~~S~SRv~~~~~~------~~~~~~a~~lr~~la 395 (509) |.+..|..+|+|++||+...+.+|..=..-||.+|+++.|||.||+..|.|+....... .....+-.++|..|+ T Consensus 390 ~iv~avsP~GGdfsdPvt~atl~i~qvfWGldkklaqrkh~Ps~n~~~sys~y~~~l~~~y~~~~~~f~~lr~~~~~~l~ 469 (596) T TIGR01042 390 SIVGAVSPPGGDFSDPVTSATLGIVQVFWGLDKKLAQRKHFPSVNWLISYSKYLRALEEFYEKNYPEFVSLRTKIKEILQ 469 (596) T ss_pred EEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHH T ss_conf 89973068988756521244544435433123566642157604453327899999998753103037999999999974 Q ss_pred HHHHHHHHHHHH-C-CCCHHHHHHHHHHHHHHH-HHCCCCCCCC----CHHHHHHHHHHHHCCCCCCCC--HHH----HH Q ss_conf 887678798763-3-899899999999999999-9669998874----889999999998669806679--899----99 Q gi|254780617|r 396 QYREMSSFSKFS-S-DLDSSTQKFLSKGERLTE-LLKQPQFSPL----AMEEQVVMIFAGISGCLDEVA--VSQ----VR 462 (509) Q Consensus 396 qy~Ele~f~~fg-s-dlD~~t~~~l~rg~ri~e-~LkQ~~~~p~----~~~~qv~~l~a~~~G~ld~i~--v~~----I~ 462 (509) +..+|..+.|+. - .|.+.++-.|+.+..|.+ ||+|+.|+.| |+.+.+.++-.... |++.-. ++. -. T Consensus 470 ~~~~l~~~vqlvGk~~l~~~dk~~l~~a~~~~~d~l~qn~~~~yd~~CP~yk~~~m~~~~~~-f~~~~~~~~~~~~~~~~ 548 (596) T TIGR01042 470 EEEELEEIVQLVGKSALAETDKITLEVAKLIKEDFLQQNGYSKYDKFCPFYKTVGMLRNLIA-FYDLARKAVEKTAQGSS 548 (596) T ss_pred HHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCCCCHHCCCCHHHHHHHHHHHHH-HHHHHHHHHHHHHHCCC T ss_conf 03458999998411320134510246899999876531365301223624889999999999-99999999874320010 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCCCH-HHHHHHHHHHHHHHHHC Q ss_conf 99999999997415999998873279898-99999999999998515 Q gi|254780617|r 463 KFETNFLSHMRVSSQDILEDIRKQKVLTD-DIRSKLINEIKVFLEDF 508 (509) Q Consensus 463 ~fe~~l~~~l~~~~~ei~~~I~~~~~l~d-e~~~~L~~~i~~~~~~F 508 (509) .-.+..+..++....++...+.+.+.-+. +.+.+++..++++.+.. T Consensus 549 ~~~k~~~~~~~~~~~~~~~~~~s~kf~~P~~G~~~~~~~~~~l~~~~ 595 (596) T TIGR01042 549 DDNKITWSIIKESLGDLLYKLSSMKFEDPSDGEEKIKKEYEKLAEDL 595 (596) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHC T ss_conf 01257789999999999999887530370025899999999999861 No 58 >PRK12608 transcription termination factor Rho; Provisional Probab=100.00 E-value=6.1e-40 Score=309.01 Aligned_cols=293 Identities=19% Similarity=0.214 Sum_probs=226.4 Q ss_pred CCCCCCCCEEEECCCEEEEECCHHHHHCCCCHHHCCCCC---------CCCCCCCCCCHHCCCCCCCCCCCCCHHHHHCC Q ss_conf 668678988897798788667946983000300231446---------64445444100101357731126600144416 Q gi|254780617|r 79 YKEISEGDIVKRTGRIVDVPVGLELLGRVVDALGNPIDG---------KGPIKCEQRSCTEADAPGIIQRQSVCEPLSTG 149 (509) Q Consensus 79 ~~gI~~G~~V~~tg~~~~vpvG~~lLGRVvD~lG~PlDg---------~g~i~~~~~~~i~~~~p~~~~R~~i~~~l~TG 149 (509) -.+++.||.|...-++ -.-|+ -|-+|-...|.+.+. .-|+..+++..++.. +=..- T Consensus 55 rf~LR~GD~V~G~iR~--p~~ge-aL~~V~~VNg~~pe~~~~R~~F~~LTPi~P~erl~LE~~------------~~~~s 119 (379) T PRK12608 55 RFGLRTGDLVEGVIRA--PREKE-TLVRIDSVNGTDPEKLQRRKHFDDLTPLHPEERIVLETG------------SDDLS 119 (379) T ss_pred HCCCCCCCEEEEEECC--CCCCH-HHEEEHHCCCCCHHHHCCCCCCCCCCCCCCCCEEECCCC------------CCCCC T ss_conf 5499999989998638--99874-204145238949899557467556877799650440148------------87644 Q ss_pred HHHHHCCCCCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEE Q ss_conf 02331011115683365524778861158999999988630246678760699995167767899999851146742115 Q gi|254780617|r 150 IKAIDSLIPIGRGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALSYS 229 (509) Q Consensus 150 I~~ID~l~pigrGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~~t 229 (509) -|+||.+.|||||||.+|++++++|||+|+. .|.| ....+++++.++..+|+||.+||.+|.+..+. T Consensus 120 ~RiiDL~aPIGkGQRgLIVAPPkaGKT~LLq-~IA~-----aI~~NhPev~livLLIDERPEEVTdm~r~v~g------- 186 (379) T PRK12608 120 MRVLDLVAPIGKGQRGLIVAPPRAGKTILLQ-QIAQ-----AVAANHPDIHLMVLLIDERPEEVTDMKRSVKG------- 186 (379) T ss_pred CCEEEEECCCCCCCCCEEECCCCCCHHHHHH-HHHH-----HHHHCCCCCEEEEEECCCCCHHHHHHHHHCCE------- T ss_conf 1103100463457401274589865789999-9999-----99857998489999816893588888862370------- Q ss_pred EEEEECCCCCHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHH Q ss_conf 89972887888999998876666667542046884799616478999998754530378754333752001246788887 Q gi|254780617|r 230 IVVVASASDPAPMQLLAPFAGCAMGEYFRDNGYHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLLERA 309 (509) Q Consensus 230 vvv~a~a~~~~~~r~~ap~~a~aiAEyfr~~G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l~ERa 309 (509) -||++|.|+||-.+..+++.++..|++++++|+||++++||+||+|+||+.+....|+..+ .|+++...+ .++.+..| T Consensus 187 EVvaSTfD~~~~~h~~vAel~lerAkrlvE~G~dVvillDSiTRlaRAyn~~~~~sGr~ls-gg~d~~al~-~pk~~Fga 264 (379) T PRK12608 187 EVYASTFDRPYDRHIRVAELVLERAKRLVEEGKDVVILLDSLTRLARAYNSEVDSSGRTLS-GGVDARALE-RPKRLFGA 264 (379) T ss_pred EEEEECCCCCHHHHHHHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHHHHCCCCCCCCCCC-CCCCHHHHH-HHHHHHHH T ss_conf 7999779899899999999999999999876996899965177889987533688886157-786956750-06898531 Q ss_pred HHCCCCCCCCCEEEEEEEECC-CCCCCCCHHHHHHHHHCCEEEEECCHHHHCCCCCCCCCCCHHHCCCCCCCHHHHHHHH Q ss_conf 504544588735675310036-7664431146688540616996150122102665433444122034343898877556 Q gi|254780617|r 310 AKMSDALGAGSLTALPVIETQ-VNDVSAYIPTNVISITDGQIFLETELFYQGIRPAINIGLSVSRVGSAAQVKAMKQVSG 388 (509) Q Consensus 310 ~~~~~~~g~GSiT~~~~v~~~-~~D~~~~i~~~~~si~DG~i~l~~~l~~~g~~Paid~~~S~SRv~~~~~~~~~~~~a~ 388 (509) ++- ..+|||+|++.|++++ |.+|+++|.+++.++.+++|+|||+|+++++|||||+..|.+|=----..+...+... T Consensus 265 ar~--ie~gGSlTiiaTaLveTgs~mD~~i~eefkgtgn~el~Ldr~la~~r~fPAIdi~~SgTR~EelLl~~~e~~~~~ 342 (379) T PRK12608 265 ARK--IEEGGSLTILATALVDTGSRMDEVIFEEFKGTGNMEIVLDRELADKRVFPAIDIAKSGTRREELLLDSKELEKVR 342 (379) T ss_pred CCC--CCCCCCCEEEEEEEECCCCCCCHHHHHHHHCCCCEEEEECHHHHHCCCCCCCCCCCCCCCHHHHCCCHHHHHHHH T ss_conf 257--778866325666632266643458899963268608997255776788751155457762266548999999999 Q ss_pred HHHHHHHHHHHHHHH Q ss_conf 788879988767879 Q gi|254780617|r 389 AVKGELAQYREMSSF 403 (509) Q Consensus 389 ~lr~~laqy~Ele~f 403 (509) .+|+.|+.-...+++ T Consensus 343 ~lRr~l~~~~~~ea~ 357 (379) T PRK12608 343 LLRRALASLKPVEAM 357 (379) T ss_pred HHHHHHCCCCHHHHH T ss_conf 999986179979999 No 59 >PRK09376 rho transcription termination factor Rho; Provisional Probab=100.00 E-value=5.6e-33 Score=258.45 Aligned_cols=293 Identities=20% Similarity=0.231 Sum_probs=214.6 Q ss_pred CCCCCCCEEEECCCEEEEECCHHH--HHCCCCHHHCCC---------CCCCCCCCCCCCHHCCCCCCCCCCCCCHHHHHC Q ss_conf 686789888977987886679469--830003002314---------466444544410010135773112660014441 Q gi|254780617|r 80 KEISEGDIVKRTGRIVDVPVGLEL--LGRVVDALGNPI---------DGKGPIKCEQRSCTEADAPGIIQRQSVCEPLST 148 (509) Q Consensus 80 ~gI~~G~~V~~tg~~~~vpvG~~l--LGRVvD~lG~Pl---------Dg~g~i~~~~~~~i~~~~p~~~~R~~i~~~l~T 148 (509) .+++.||.|...-++. .-|+.+ |=+|-...|.+- |..-|+..+++..++. .|-.. T Consensus 90 f~LR~GD~V~G~vR~p--ke~Ery~aLl~V~~VNg~~pe~~~~r~~F~~LTPi~P~erl~LE~------------~~~~~ 155 (416) T PRK09376 90 FNLRTGDTVEGKIRPP--KEGERYFALLKVETVNGEDPEKARNRILFENLTPLYPNERLKLET------------GTEDL 155 (416) T ss_pred HCCCCCCEEEEEEECC--CCCCCCCCEEEEEECCCCCHHHHCCCCCCCCCCCCCCHHHHCCCC------------CCCCC T ss_conf 0998888899988689--888877433776322893989965735401487779722311456------------88765 Q ss_pred CHHHHHCCCCCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCE Q ss_conf 60233101111568336552477886115899999998863024667876069999516776789999985114674211 Q gi|254780617|r 149 GIKAIDSLIPIGRGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALSY 228 (509) Q Consensus 149 GI~~ID~l~pigrGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~~ 228 (509) --|+||.+.|||||||-+|.+++.+|||+|+. .|.| ....+++++.++..+|+||..||.+|.+..+. T Consensus 156 s~RiiDL~aPIGkGQRgLIVAPPkaGKT~lLq-~IA~-----aI~~N~Pe~~liVLLIDERPEEVTdm~r~v~~------ 223 (416) T PRK09376 156 STRVIDLVAPIGKGQRGLIVAPPKAGKTVLLQ-NIAN-----SITTNHPEVHLIVLLIDERPEEVTDMQRSVKG------ 223 (416) T ss_pred CCCCEEEECCCCCCCCCEEECCCCCCHHHHHH-HHHH-----HHHHCCCCEEEEEEEECCCCHHHHHHHHHCCE------ T ss_conf 40021100541358500375699875479999-9999-----99856997199999904893477787750461------ Q ss_pred EEEEEECCCCCHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHH Q ss_conf 58997288788899999887666666754204688479961647899999875453037875433375200124678888 Q gi|254780617|r 229 SIVVVASASDPAPMQLLAPFAGCAMGEYFRDNGYHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLLER 308 (509) Q Consensus 229 tvvv~a~a~~~~~~r~~ap~~a~aiAEyfr~~G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l~ER 308 (509) -||++|.|+||-.+...+..++--|..+.++|+||++++||+||.|+||.......|+.-+ .|.-+...|.-.+++-- T Consensus 224 -eV~aStfD~~~~~H~~vae~~lerAkRlvE~G~DVvillDSiTRLaRAyN~~~~~sGr~ls-GG~D~~Al~~PKrfFGA 301 (416) T PRK09376 224 -EVVASTFDEPAERHVQVAEMVIEKAKRLVEHGKDVVILLDSITRLARAYNTVVPSSGKVLS-GGVDANALHRPKRFFGA 301 (416) T ss_pred -EEEEECCCCCHHHHHHHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHHHHCCCCCCCCCCC-CCCCHHHHCCCHHHHHH T ss_conf -8999779998789999999999999999876997899973157888886234699877544-76577776055676410 Q ss_pred HHHCCCCCCCCCEEEEEEEECC-CCCCCCCHHHHHHHHHCCEEEEECCHHHHCCCCCCCCCCCHHHCCCCCCCHHHHHHH Q ss_conf 7504544588735675310036-766443114668854061699615012210266543344412203434389887755 Q gi|254780617|r 309 AAKMSDALGAGSLTALPVIETQ-VNDVSAYIPTNVISITDGQIFLETELFYQGIRPAINIGLSVSRVGSAAQVKAMKQVS 387 (509) Q Consensus 309 a~~~~~~~g~GSiT~~~~v~~~-~~D~~~~i~~~~~si~DG~i~l~~~l~~~g~~Paid~~~S~SRv~~~~~~~~~~~~a 387 (509) |- +..+|||+|.+.|.+++ |.-|++.|-.+..+.-..+++|||+|+++++|||||+..|-+|=----..+...+.. T Consensus 302 AR---nie~GGSLTIiaTaLveTgSrMDdvIfeEfkgTgNmEl~Ldr~la~~RifPAIdi~~SgTRkEelLl~~~e~~~~ 378 (416) T PRK09376 302 AR---NIEEGGSLTIIATALIDTGSRMDEVIFEEFKGTGNMELHLDRKLAEKRIFPAIDINRSGTRKEELLLSPEELQKV 378 (416) T ss_pred HC---CCCCCCCHHHHHHHHHCCCCCHHHHHHHHHCCCCCEEEEECHHHHHCCCCCCCCCCCCCCCHHHHCCCHHHHHHH T ss_conf 02---677675177877777526862538999985157845999876677578864013335777257653799999999 Q ss_pred HHHHHHHHHHHHHHHH Q ss_conf 6788879988767879 Q gi|254780617|r 388 GAVKGELAQYREMSSF 403 (509) Q Consensus 388 ~~lr~~laqy~Ele~f 403 (509) +.+|+.|+.....+++ T Consensus 379 ~~lRr~l~~~~~~ea~ 394 (416) T PRK09376 379 WILRKILSPMDEVEAM 394 (416) T ss_pred HHHHHHHHCCCHHHHH T ss_conf 9999987159979999 No 60 >PRK12678 transcription termination factor Rho; Provisional Probab=99.97 E-value=3.2e-29 Score=231.15 Aligned_cols=241 Identities=17% Similarity=0.242 Sum_probs=191.3 Q ss_pred HHHCCHHHHHCCCCCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCC Q ss_conf 44416023310111156833655247788611589999999886302466787606999951677678999998511467 Q gi|254780617|r 145 PLSTGIKAIDSLIPIGRGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRG 224 (509) Q Consensus 145 ~l~TGI~~ID~l~pigrGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~ 224 (509) |-..--++||.+.|||||||-+|.+++.+|||+|+. .|.| ....+++.+..++++|.||..||.+|...++. T Consensus 394 ~~~~t~RiiDl~~PiGkGQRgLIVapPkaGKT~ll~-~ia~-----ai~~N~pe~~l~vlLiDERPEEVTdm~r~v~g-- 465 (667) T PRK12678 394 PNKLTTRVIDLIMPIGKGQRGLIVSPPKAGKTTILQ-DIAN-----AITTNNPECHLMVVLVDERPEEVTDMQRSVKG-- 465 (667) T ss_pred CCCCCHHHHEEECCCCCCCEEEEECCCCCCCHHHHH-HHHH-----HHHHCCCCEEEEEEECCCCCHHHHHHHHCCCE-- T ss_conf 765322230111356788454675799787259999-9999-----99856997289999737885156676640460-- Q ss_pred CCCEEEEEEECCCCCHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHH Q ss_conf 42115899728878889999988766666675420468847996164789999987545303787543337520012467 Q gi|254780617|r 225 ALSYSIVVVASASDPAPMQLLAPFAGCAMGEYFRDNGYHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSR 304 (509) Q Consensus 225 ~~~~tvvv~a~a~~~~~~r~~ap~~a~aiAEyfr~~G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~ 304 (509) -||++|.|+||-.....+-.++--|..+.+.|+||+|++|||||.|+||.-.+-..|+.-+ .|.-+...|.--+ T Consensus 466 -----eViaStfD~~~~~H~~vael~iErAkRlvE~g~DVvillDSiTRLaRAyN~~~p~sGr~ls-GGvD~~Al~~PKr 539 (667) T PRK12678 466 -----EVIASTFDRPPSDHTTVAELAIERAKRLVELGHDVVVLLDSITRLGRAYNLAAPASGRILS-GGVDSTALYPPKR 539 (667) T ss_pred -----EEEEECCCCCHHHHHHHHHHHHHHHHHHHHCCCCEEEEECHHHHHHHHHHCCCCCCCCCCC-CCCCHHHHHCHHH T ss_conf -----6998668898888999999999999998756997799964066888876044699877346-7648676601077 Q ss_pred HHHHHHHCCCCCCCCCEEEEEEEECC-CCCCCCCHHHHHHHHHCCEEEEECCHHHHCCCCCCCCCCCHHHCCCCCCCHHH Q ss_conf 88887504544588735675310036-76644311466885406169961501221026654334441220343438988 Q gi|254780617|r 305 LLERAAKMSDALGAGSLTALPVIETQ-VNDVSAYIPTNVISITDGQIFLETELFYQGIRPAINIGLSVSRVGSAAQVKAM 383 (509) Q Consensus 305 l~ERa~~~~~~~g~GSiT~~~~v~~~-~~D~~~~i~~~~~si~DG~i~l~~~l~~~g~~Paid~~~S~SRv~~~~~~~~~ 383 (509) ++--|-+ ..+|||+|.|.+.+++ |.-|.+-|-.++.+.-.-.++|||+|+++.+|||||+..|-+|----...+.. T Consensus 540 ffGAARn---ie~GGSLTIlaTaLveTGSrmDevIfeEFKGTGNmEl~LdR~la~~RifPAidi~~SgTR~eelLl~~~e 616 (667) T PRK12678 540 FFGAARN---IENGGSLTIIATALVETGSTMDTVIFEEFKGTGNAELKLDRKIADKRVFPAVDVNPSGTRKEELLLSPDE 616 (667) T ss_pred HHHHHHC---CCCCCCHHHHHHHHHCCCCHHHHHHHHHHCCCCCEEEEECHHHHHCCCCCCCCCCCCCCCHHHHCCCHHH T ss_conf 7677752---4767447675445530465067889988537664288862335645565311344577506765379999 Q ss_pred HHHHHHHHHHHHHHHHHHH Q ss_conf 7755678887998876787 Q gi|254780617|r 384 KQVSGAVKGELAQYREMSS 402 (509) Q Consensus 384 ~~~a~~lr~~laqy~Ele~ 402 (509) .+....||..|+.....++ T Consensus 617 ~~~~~~lRr~l~~~~~~~a 635 (667) T PRK12678 617 LAIVHKLRRVLSGLDSQQA 635 (667) T ss_pred HHHHHHHHHHHHCCCHHHH T ss_conf 9999999998725998999 No 61 >COG1158 Rho Transcription termination factor [Transcription] Probab=99.97 E-value=3.9e-29 Score=230.53 Aligned_cols=236 Identities=19% Similarity=0.251 Sum_probs=189.4 Q ss_pred HHHHHCCCCCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEE Q ss_conf 02331011115683365524778861158999999988630246678760699995167767899999851146742115 Q gi|254780617|r 150 IKAIDSLIPIGRGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALSYS 229 (509) Q Consensus 150 I~~ID~l~pigrGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~~t 229 (509) -|+||.+.|||||||.+|.+++.+|||+|+. -| ++....++.++..++.+|.||..||.++.+..+.. T Consensus 161 ~RviDL~~PIGkGQR~LIVAPPkaGKT~lLq-~I-----A~aIt~N~Pe~~LiVLLIDERPEEVTdmqrsV~ge------ 228 (422) T COG1158 161 TRVIDLISPIGKGQRGLIVAPPKAGKTTLLQ-NI-----ANAITTNHPECELIVLLIDERPEEVTDMQRSVKGE------ 228 (422) T ss_pred HHHHHHHCCCCCCCEEEEECCCCCCCHHHHH-HH-----HHHHHCCCCCEEEEEEEECCCCHHHHHHHHHHCCE------ T ss_conf 6676652656788465686698787338999-99-----99986379964999999347806777788752406------ Q ss_pred EEEEECCCCCHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHH Q ss_conf 89972887888999998876666667542046884799616478999998754530378754333752001246788887 Q gi|254780617|r 230 IVVVASASDPAPMQLLAPFAGCAMGEYFRDNGYHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLLERA 309 (509) Q Consensus 230 vvv~a~a~~~~~~r~~ap~~a~aiAEyfr~~G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l~ERa 309 (509) ||++|-|+||......+....--|....++|+||++++||+||.|+||..+.-..|+.-+ .|.-+..+|---|++--| T Consensus 229 -ViaSTFDepp~~HvqVAE~viEkAKRlVE~~kDVVILLDSITRLaRAYN~v~P~SGkvLs-GGvD~nAL~~PKrFFGAA 306 (422) T COG1158 229 -VVASTFDEPPSRHVQVAEMVIEKAKRLVEHGKDVVILLDSITRLARAYNTVVPSSGKVLS-GGVDANALHRPKRFFGAA 306 (422) T ss_pred -EEEECCCCCHHHHHHHHHHHHHHHHHHHHCCCCEEEEEHHHHHHHHHHCCCCCCCCCEEC-CCCCHHHHCCCHHHHHHH T ss_conf -986448886054689999999999988871786899965677899885366799777404-873856604835542122 Q ss_pred HHCCCCCCCCCEEEEEEEECC-CCCCCCCHHHHHHHHHCCEEEEECCHHHHCCCCCCCCCCCHHHCCCCCCCHHHHHHHH Q ss_conf 504544588735675310036-7664431146688540616996150122102665433444122034343898877556 Q gi|254780617|r 310 AKMSDALGAGSLTALPVIETQ-VNDVSAYIPTNVISITDGQIFLETELFYQGIRPAINIGLSVSRVGSAAQVKAMKQVSG 388 (509) Q Consensus 310 ~~~~~~~g~GSiT~~~~v~~~-~~D~~~~i~~~~~si~DG~i~l~~~l~~~g~~Paid~~~S~SRv~~~~~~~~~~~~a~ 388 (509) -+ ...|||+|.+.|.+++ |.-|++-|-.+..+.-.-.++|||+|+++.+|||||+.+|-+|--.-...+.+.+-.+ T Consensus 307 RN---IEeGGSLTIiATALVdTGSrMDeVIfEEFKGTGNmEl~LdR~laerRifPAIdi~kSGTRkEeLLl~~~~l~k~w 383 (422) T COG1158 307 RN---IEEGGSLTIIATALVDTGSRMDEVIFEEFKGTGNMELHLDRKLAERRIFPAIDINKSGTRKEELLLSPDELQKMW 383 (422) T ss_pred HC---CCCCCCHHHHHHHHHHCCCCCCHHHHHHHCCCCCEEEEEHHHHHHCCCCCCEECCCCCCCHHHHCCCHHHHHHHH T ss_conf 05---545863411111244147751120253425777507886013444124530203567761576548999999999 Q ss_pred HHHHHHHHHHHHHH Q ss_conf 78887998876787 Q gi|254780617|r 389 AVKGELAQYREMSS 402 (509) Q Consensus 389 ~lr~~laqy~Ele~ 402 (509) .+|..+..-.+.++ T Consensus 384 ~lRr~l~~md~~~a 397 (422) T COG1158 384 VLRRILSPMDEIDA 397 (422) T ss_pred HHHHHHCCCCHHHH T ss_conf 99997367874899 No 62 >cd01120 RecA-like_NTPases RecA-like NTPases. This family includes the NTP binding domain of F1 and V1 H+ATPases, DnaB and related helicases as well as bacterial RecA and related eukaryotic and archaeal recombinases. This group also includes bacterial conjugation proteins and related DNA transfer proteins involved in type II and type IV secretion. Probab=99.94 E-value=2.6e-25 Score=202.76 Aligned_cols=160 Identities=32% Similarity=0.440 Sum_probs=148.4 Q ss_pred EEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHHH Q ss_conf 65524778861158999999988630246678760699995167767899999851146742115899728878889999 Q gi|254780617|r 165 ELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALSYSIVVVASASDPAPMQL 244 (509) Q Consensus 165 ~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~~tvvv~a~a~~~~~~r~ 244 (509) ++|+|++|+|||+|+++.+.|+... +..|+|+.++|+.+++.+++..+.....++++.++.+++++++.+++ T Consensus 2 ~li~g~~g~GKttl~~~~~~~~~~~--------~~~~~~~~~ee~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 73 (165) T cd01120 2 ILVFGPTGSGKTTLALQLALNIATK--------GGKVVYVDIEEEIEELTERLIGESLKGALDNLIIVFATADDPAAARL 73 (165) T ss_pred EEEECCCCCCHHHHHHHHHHHHHHC--------CCEEEEEECCCCHHHHHHHHHHHHCCCCCCEEEEEECCCCCCHHHHH T ss_conf 8999899998999999999998763--------99799998666448999999986224671307999359997699999 Q ss_pred HHHHHHHHHHHHHHCCCCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCEEEE Q ss_conf 98876666667542046884799616478999998754530378754333752001246788887504544588735675 Q gi|254780617|r 245 LAPFAGCAMGEYFRDNGYHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLLERAAKMSDALGAGSLTAL 324 (509) Q Consensus 245 ~ap~~a~aiAEyfr~~G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l~ERa~~~~~~~g~GSiT~~ 324 (509) + +++++|++++++||+++||+|+|++|+||+ ++|||+.+++.+++|+|||++ |++|.+ T Consensus 74 ~------~~~~~~~~~~~~vliiiDSit~~~~a~~e~---------~~g~~~~v~~~~~~L~~~Ak~-------~~itvi 131 (165) T cd01120 74 L------SKAERLRERGGDDLIILDELTRLVRALREI---------REGYPGELDEELRELLERARK-------GGVTVI 131 (165) T ss_pred H------HHHHHHHHCCCCEEEEEECHHHHHHHHHHC---------CCCCHHHHHHHHHHHHHHHHC-------CCCEEE T ss_conf 9------999999986997799992889988774001---------588678999999999999977-------982899 Q ss_pred EEEECCCCCCCCC----HHHHHHHHHCCEEEEEC Q ss_conf 3100367664431----14668854061699615 Q gi|254780617|r 325 PVIETQVNDVSAY----IPTNVISITDGQIFLET 354 (509) Q Consensus 325 ~~v~~~~~D~~~~----i~~~~~si~DG~i~l~~ 354 (509) ++..+..||.++| +++++.+++|+||+|+| T Consensus 132 ~i~~v~~d~~~~~~~~~g~~~l~~~~d~~i~L~r 165 (165) T cd01120 132 FTLQVPSGDKGDPRLTRGAQNLEDIADTVIVLSR 165 (165) T ss_pred EEEEEECCCCCCCCCCCHHHHHHHHCCEEEEEEC T ss_conf 9998433778997725388888364266999839 No 63 >TIGR00767 rho transcription termination factor Rho; InterPro: IPR004665 Members of this family have a related but highly variable long, highly charged insert near the amino end. The proteins differ in the specificity of RNA binding.; GO: 0003715 transcription termination factor activity, 0005524 ATP binding, 0006353 transcription termination. Probab=99.73 E-value=2.2e-16 Score=138.07 Aligned_cols=303 Identities=18% Similarity=0.237 Sum_probs=217.9 Q ss_pred CCCCCCCCCEEEECCCEEEEECCHHH--HHCCCCHHHCCCCCCCCCCCCCCCHHCCCCC-CCCCCCCC----HHHHHCCH Q ss_conf 86686789888977987886679469--8300030023144664445444100101357-73112660----01444160 Q gi|254780617|r 78 SYKEISEGDIVKRTGRIVDVPVGLEL--LGRVVDALGNPIDGKGPIKCEQRSCTEADAP-GIIQRQSV----CEPLSTGI 150 (509) Q Consensus 78 ~~~gI~~G~~V~~tg~~~~vpvG~~l--LGRVvD~lG~PlDg~g~i~~~~~~~i~~~~p-~~~~R~~i----~~~l~TGI 150 (509) ...+++.|+.+...-+. -.-|+.+ +=++-. +.+..+-....+.+.+.-.| .|..|-.. .++-...- T Consensus 89 ~~~~~~~gd~~~g~~~~--p~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~l~pl~p~~~~~~~~~~~~~~~~~~ 161 (420) T TIGR00767 89 RRFGLRTGDTVEGKIRS--PKEGEKYFALLKVES-----VNGDDPDKAKERPLFDNLTPLYPNERLKLERGNTDPEDLST 161 (420) T ss_pred HHHCCCCCCEEEEEECC--CCCCCHHHHHEEEEC-----CCCCCCHHHHHHCCCCCCCCCCCCHHHEECCCCCCCHHHHH T ss_conf 65204457513311017--632211211001100-----26765012332023333454454102100146775114677 Q ss_pred HHHHCCCCCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEE Q ss_conf 23310111156833655247788611589999999886302466787606999951677678999998511467421158 Q gi|254780617|r 151 KAIDSLIPIGRGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALSYSI 230 (509) Q Consensus 151 ~~ID~l~pigrGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~~tv 230 (509) +++|.+.|+|+|||.+|++++.+|||.++. -+. +....+..++..+..++.+|..++.++....+.. T Consensus 162 ~~~~~~~p~g~g~~~l~~~pp~~g~~~~~~-~~~-----~~~~~~~p~~~l~~~l~d~~p~~~~~~~~~~~g~------- 228 (420) T TIGR00767 162 RVLDLFAPIGKGQRGLIVAPPKAGKTVLLQ-KIA-----NAITRNHPEVELIVLLIDERPEEVTDLQRSVKGE------- 228 (420) T ss_pred HHHHHHHCCCCCCCEEEECCCCCCHHHHHH-HHH-----HHHHCCCCCEEEEEEEECCCCHHHHHHHHHHHHH------- T ss_conf 888765103677530365177544268888-888-----7763268743799886336622567777764212------- Q ss_pred EEEECCCCCHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHH Q ss_conf 99728878889999988766666675420468847996164789999987545303787543337520012467888875 Q gi|254780617|r 231 VVVASASDPAPMQLLAPFAGCAMGEYFRDNGYHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLLERAA 310 (509) Q Consensus 231 vv~a~a~~~~~~r~~ap~~a~aiAEyfr~~G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l~ERa~ 310 (509) ++.++.|+++......+...+.-|....++++||++++|++||.++||..+.-..|+.-. .|.....++--.+++..+- T Consensus 229 ~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~d~~~~~~~~~~~~~p~~g~~l~-gg~~~~~~~~p~~~~g~~~ 307 (420) T TIGR00767 229 VVASTFDEPPSRHVQVAELVLEKAKRLVEHKKDVVILLDSITRLARAYNTVTPPSGKVLS-GGVDANALHRPKRFFGAAR 307 (420) T ss_pred HHHHCCCCCCHHHHHHHHHHHHHHHHHHHCCCCEEEEHHHHHHHHHHHHCCCCCCCCEEE-CCCCHHHHCCCHHHHHHHC T ss_conf 321001344101246889999988887632563477402356766654103666663221-5645012014145530010 Q ss_pred HCCCCCCCCCEEEEEEEECCCC-CCCCCHHHHHHHHHCCEEEEECCHHHHCCCCCCCCCCCHHHCCCCCCCHHHHHHHHH Q ss_conf 0454458873567531003676-644311466885406169961501221026654334441220343438988775567 Q gi|254780617|r 311 KMSDALGAGSLTALPVIETQVN-DVSAYIPTNVISITDGQIFLETELFYQGIRPAINIGLSVSRVGSAAQVKAMKQVSGA 389 (509) Q Consensus 311 ~~~~~~g~GSiT~~~~v~~~~~-D~~~~i~~~~~si~DG~i~l~~~l~~~g~~Paid~~~S~SRv~~~~~~~~~~~~a~~ 389 (509) . ...|||+|.+.+..++.+ .+.+.+-....+.-.-.+.|++.++.+.+|||+|+..|-+|--.-...+.+..-... T Consensus 308 ~---~~~gg~l~~~~~~~~~~g~~~d~~~~~~~~g~g~~~~~l~~~~~~~~~~p~~~~~~~g~~~~~~l~~~~~~~~~~~ 384 (420) T TIGR00767 308 N---IEEGGSLTIIATALVDTGSRMDEVIFEEFKGTGNLELHLDRKLADRRLFPAIDIKKSGTRKEELLLSPEELQKLWL 384 (420) T ss_pred C---CCCCCCEEEEEHHHHHCCCCHHHHHHHHHCCCCCCEEEEHHHHHHHHCCCCCCCCCCCCCHHHHCCCHHHHHHHHH T ss_conf 2---2466522221001231365101344443227775011100133321102210123456521232037788889999 Q ss_pred HHHHHHHHHHHHHHH Q ss_conf 888799887678798 Q gi|254780617|r 390 VKGELAQYREMSSFS 404 (509) Q Consensus 390 lr~~laqy~Ele~f~ 404 (509) +|..+....+.+.+. T Consensus 385 ~~~~~~~~~~~~~~~ 399 (420) T TIGR00767 385 LRKVLSPLDDVEALE 399 (420) T ss_pred HHHHHCCCCHHHHHH T ss_conf 988740445057899 No 64 >pfam00306 ATP-synt_ab_C ATP synthase alpha/beta chain, C terminal domain. Probab=99.43 E-value=2e-12 Score=109.29 Aligned_cols=89 Identities=34% Similarity=0.532 Sum_probs=79.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH-HCC-CCHHHHHHHHHHHHHHHHHCCCCCCCC-------CHHHHHHHHHHHHCCCCC Q ss_conf 77556788879988767879876-338-998999999999999999669998874-------889999999998669806 Q gi|254780617|r 384 KQVSGAVKGELAQYREMSSFSKF-SSD-LDSSTQKFLSKGERLTELLKQPQFSPL-------AMEEQVVMIFAGISGCLD 454 (509) Q Consensus 384 ~~~a~~lr~~laqy~Ele~f~~f-gsd-lD~~t~~~l~rg~ri~e~LkQ~~~~p~-------~~~~qv~~l~a~~~G~ld 454 (509) +++++++|+.|+||+||++|++| |++ ||+.++.+++||+++++||+|++|+|+ ++++++..++++.+|.+| T Consensus 1 ~~v~~~~~~~Laqy~eL~~~~~~~G~d~L~~~~k~~l~~~~~~~~~L~Q~~~~p~~~~~q~v~~~~~~~~~~~i~~g~~d 80 (110) T pfam00306 1 KQVAGQLKLELAQYRELQAIVQLVGEDALSEEDKLTLERARRIEEFLKQNQYSPEPVEKQYVPVEETIDLFYALLRGKFD 80 (110) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCEEEHHHHHHHHHHHHCCCCC T ss_conf 92479999999989999999987183104899999999999999998274468877787542399999999999878777 Q ss_pred CCCHHHHHHHHHHHHHHH Q ss_conf 679899999999999999 Q gi|254780617|r 455 EVAVSQVRKFETNFLSHM 472 (509) Q Consensus 455 ~i~v~~I~~fe~~l~~~l 472 (509) ++|.+....+.....++. T Consensus 81 ~~p~~~~~~ig~~~~~~~ 98 (110) T pfam00306 81 DLPEDALYRIGTIDLAKY 98 (110) T ss_pred CCCHHHHHHCCCHHHHHH T ss_conf 788999886286999999 No 65 >pfam06745 KaiC KaiC. This family represents a conserved region within bacterial and archaeal proteins, most of which are hypothetical. More than one copy is sometimes found in each protein. This family includes KaiC, which is one of the Kai proteins among which direct protein-protein association may be a critical process in the generation of circadian rhythms in cyanobacteria. Probab=98.53 E-value=1.4e-06 Score=66.92 Aligned_cols=189 Identities=20% Similarity=0.172 Sum_probs=104.0 Q ss_pred HHCCHHHHHCCCC--CCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHH--- Q ss_conf 4416023310111--15683365524778861158999999988630246678760699995167767899999851--- Q gi|254780617|r 146 LSTGIKAIDSLIP--IGRGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKAL--- 220 (509) Q Consensus 146 l~TGI~~ID~l~p--igrGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l--- 220 (509) +.|||..+|.++. +-+|.=.+|.|++|+|||++++..+.+.+... .--|+|+...|...++.+-.+.+ T Consensus 1 i~TGi~~LD~~l~GGi~~gs~~LI~G~pGsGKT~la~qfl~~ga~~~-------ge~~lYis~ee~~~~l~~~~~~~g~~ 73 (231) T pfam06745 1 VKTGIPGLDEILKGGIPEGRVVLITGGPGTGKTIFGLQFLYNGALEY-------GEPGVYVTLEEPPEDLRENAKSFGWD 73 (231) T ss_pred CCCCCHHHHHHCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHC-------CCCEEEEEECCCHHHHHHHHHHCCCC T ss_conf 98895568863169982996999985897259999999999999865-------89689998137999999999982998 Q ss_pred ----CCCCCCCEEEEEEECCCCCHHHHHH----HHHHHHHHHHHHHCCCCCEEEEEECHHHHHHHHHHHHHHCCCCCCCC Q ss_conf ----1467421158997288788899999----88766666675420468847996164789999987545303787543 Q gi|254780617|r 221 ----EDRGALSYSIVVVASASDPAPMQLL----APFAGCAMGEYFRDNGYHALIAYDDLQKHAVAYRQLSLLLRRPPGRE 292 (509) Q Consensus 221 ----~~~~~~~~tvvv~a~a~~~~~~r~~----ap~~a~aiAEyfr~~G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~ 292 (509) .+.+. -.++-...+........ ..-..-.+.+...+.+-+ ++++||+|-+...+.+ . T Consensus 74 ~~~~~~~g~---l~i~d~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~-~vVIDsit~l~~~~~~---------~-- 138 (231) T pfam06745 74 LEKLEEEGK---LAIIDASTSGIGIAEVKGRFDLEELIERLREAIREIGAK-RVVIDSITTLFYLLKP---------A-- 138 (231) T ss_pred HHHHHHCCC---EEEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCC-EEEEECCHHHHHCCCH---------H-- T ss_conf 589864696---789862544222100112279999999999999971998-8999764164005889---------9-- Q ss_pred CCCCCCHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCCCCCCHHHHHHHHHCCEEEEECCHHHHCCCCCCCC Q ss_conf 337520012467888875045445887356753100367664431146688540616996150122102665433 Q gi|254780617|r 293 AYPGDVFYLHSRLLERAAKMSDALGAGSLTALPVIETQVNDVSAYIPTNVISITDGQIFLETELFYQGIRPAINI 367 (509) Q Consensus 293 gyp~~vf~~~s~l~ERa~~~~~~~g~GSiT~~~~v~~~~~D~~~~i~~~~~si~DG~i~l~~~l~~~g~~Paid~ 367 (509) .+...+..|....- +. -+|.+.+.++..++-...-+...-++.||-|.|+..-.+....=.+.+ T Consensus 139 ----~~r~~l~~l~~~lk---~~----g~t~l~t~e~~~~~~~~~~~~~~~~l~D~vI~L~~~~~~~~~~R~l~I 202 (231) T pfam06745 139 ----MAREILRRLKRVLK---KL----GVTAIFTSEKPSGEGGIGGYGVEEFVVDGVIRLDLKEIEGELVRTIEI 202 (231) T ss_pred ----HHHHHHHHHHHHHH---HC----CCEEEEEEEECCCCCCCCCCCHHHHEEEEEEEEEEECCCCEEEEEEEE T ss_conf ----99999999999999---76----991999982125777656763011100369999988249989999999 No 66 >PRK09302 circadian clock protein KaiC; Reviewed Probab=98.52 E-value=6.8e-06 Score=61.87 Aligned_cols=194 Identities=20% Similarity=0.186 Sum_probs=117.8 Q ss_pred CCCCCHHHHHCCHHHHHCCCC--CCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHH Q ss_conf 126600144416023310111--156833655247788611589999999886302466787606999951677678999 Q gi|254780617|r 138 QRQSVCEPLSTGIKAIDSLIP--IGRGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVAR 215 (509) Q Consensus 138 ~R~~i~~~l~TGI~~ID~l~p--igrGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~ 215 (509) .....++.+.|||+-.|.|+- +-+|--.+|.|++|+|||++++.-+.+-+.. .-.|+|+.--|+...+.+ T Consensus 240 ~~~~~~~rv~tGi~~LD~llgGGl~~GsstLi~Gp~GtGKTtla~qFl~~~a~~--------GE~~l~~~FeE~~~~l~~ 311 (501) T PRK09302 240 TQRSSNERISSGVPDLDEMCGGGFFRGSIILVSGATGTGKTLLVSKFAEAACRR--------GERCLLFAFEESRAQLVR 311 (501) T ss_pred CCCCCCCCCCCCCHHHHHHHCCCCCCCCEEEEECCCCCCHHHHHHHHHHHHHHC--------CCCEEEEEEECCHHHHHH T ss_conf 566652000479732788725997589469998899988899999999999865--------990899999679999999 Q ss_pred HHHHH-------CCCCCCCEEEEE-EECCCCCHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECHHHHHHHHHHHHHHCCC Q ss_conf 99851-------146742115899-7288788899999887666666754204688479961647899999875453037 Q gi|254780617|r 216 FVKAL-------EDRGALSYSIVV-VASASDPAPMQLLAPFAGCAMGEYFRDNGYHALIAYDDLQKHAVAYRQLSLLLRR 287 (509) Q Consensus 216 ~~~~l-------~~~~~~~~tvvv-~a~a~~~~~~r~~ap~~a~aiAEyfr~~G~~VLi~~Ddltr~A~a~reisl~~~~ 287 (509) -.+.+ .+.|.+ .++ ..+..-++.+.. ..+=+--.+.+-+ .|++|+++-+..+..+-. T Consensus 312 ~a~~~G~dl~~~~~~G~l---~i~~~~p~~~~~~e~~------~~i~~~v~~~~~~-rVvIDsls~~~~~~~~~~----- 376 (501) T PRK09302 312 NATSWGIDLEEMERKGLL---KIICARPESTGLEDHL------QIIKREIEEFKPS-RVAVDPLSALARGGSLNE----- 376 (501) T ss_pred HHHHCCCCHHHHHHCCCE---EEEEECCCCCCHHHHH------HHHHHHHHHCCCC-EEEECCHHHHHHHCCHHH----- T ss_conf 999739984888748947---9998370005989999------9999999972998-999958068765268599----- Q ss_pred CCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCCC-CCCHHHHHHHHHCCEEEEECCHHHHCCCCCCC Q ss_conf 87543337520012467888875045445887356753100367664-43114668854061699615012210266543 Q gi|254780617|r 288 PPGREAYPGDVFYLHSRLLERAAKMSDALGAGSLTALPVIETQVNDV-SAYIPTNVISITDGQIFLETELFYQGIRPAIN 366 (509) Q Consensus 288 pp~~~gyp~~vf~~~s~l~ERa~~~~~~~g~GSiT~~~~v~~~~~D~-~~~i~~~~~si~DG~i~l~~~l~~~g~~Paid 366 (509) |-.-++ +|+. .++. -.+|.+.+.+++..-- ..+-+.++-++.|+-|+|.---.+...+=+|. T Consensus 377 ------~r~~l~----~L~~---~Lk~----~gvT~l~t~~~~~~~g~~~~t~~~iS~l~D~ii~Lry~E~~g~l~R~i~ 439 (501) T PRK09302 377 ------FRQFVI----RLTD---YLKQ----EEITGLFTNLTPDFMGSHSITESHISSLTDTWILLQYVEINGEMNRALN 439 (501) T ss_pred ------HHHHHH----HHHH---HHHH----CCCEEEEEEECCCCCCCCCCCCCCHHHHCCEEEEEEEEEECCEEEEEEE T ss_conf ------999999----9999---9976----8978999761235567664476660011124789998713899989899 Q ss_pred CCCCH Q ss_conf 34441 Q gi|254780617|r 367 IGLSV 371 (509) Q Consensus 367 ~~~S~ 371 (509) |+++- T Consensus 440 VlK~R 444 (501) T PRK09302 440 VLKMR 444 (501) T ss_pred EEEEC T ss_conf 99936 No 67 >pfam02874 ATP-synt_ab_N ATP synthase alpha/beta family, beta-barrel domain. This family includes the ATP synthase alpha and beta subunits the ATP synthase associated with flagella. Probab=98.48 E-value=5.4e-07 Score=69.82 Aligned_cols=68 Identities=44% Similarity=0.655 Sum_probs=64.1 Q ss_pred EEEEEEEEEEECCEEEEECCCCCCCCCEEEECCCEEEEEEEECCCEEEEEEECCCCCCCCCCEEEECC Q ss_conf 01035899982758999758887737479957981899992059859999805866867898889779 Q gi|254780617|r 25 FSEIGRVLSIGDGIARVYGLNNIRAGEMVQFSHGVYGMALNLEVDNVGVVILGSYKEISEGDIVKRTG 92 (509) Q Consensus 25 ~~~~G~V~~V~~giv~v~GL~~a~~GElv~~~~g~~G~Vi~l~~d~v~i~~l~~~~gI~~G~~V~~tg 92 (509) .+..|.|+.|.++...+.|+.++...+++.|++...|+|++|+++.|++++|++++||++|++|++|| T Consensus 2 ~~v~G~Vv~v~~~~~~~~~i~~~l~~~~~~~~~~~~~~v~~l~~~~V~~i~l~~t~Gl~~G~~V~~TG 69 (69) T pfam02874 2 VQVIGPVVDVEFGIGRLPGLYNALEVELVEFGNGLLGEVLNLGGDKVRVVVMGGTDGLSRGDEVKRTG 69 (69) T ss_pred EEEEEEEEEEECCCCCCCCCCCCEEEEEECCCCCEEEEEEECCCCEEEEEEECCCCCCCCCCEEECCC T ss_conf 79998999969695237885001597300378837998752489839999976867753899998295 No 68 >PRK06067 flagellar accessory protein FlaH; Validated Probab=98.42 E-value=4.3e-06 Score=63.29 Aligned_cols=190 Identities=19% Similarity=0.136 Sum_probs=102.6 Q ss_pred HHHHHCCHHHHHCCC--CCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHH- Q ss_conf 014441602331011--11568336552477886115899999998863024667876069999516776789999985- Q gi|254780617|r 143 CEPLSTGIKAIDSLI--PIGRGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKA- 219 (509) Q Consensus 143 ~~~l~TGI~~ID~l~--pigrGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~- 219 (509) +|.+.|||..+|.++ =+-+|.-.+|.|++|+|||++++..+.+.... ...|+|+..-|..++..+-.+. T Consensus 11 ~e~i~tGi~~lD~~l~GG~p~g~~~li~G~~G~GKt~~~~~f~~~~~~~--------g~~~~~~~~ee~~~~~~~~~~~~ 82 (241) T PRK06067 11 KEIISTGNEEIDRKLGGGIPFGSLILIEGENDTGKSVLSQQFVWGALNQ--------GKRGLAITTENTSKSYLKQMESL 82 (241) T ss_pred CCCCCCCCHHHHHHCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHC--------CCEEEEEEECCCHHHHHHHHHHC T ss_conf 4525668755786506997799089998079988799999999999867--------98299999428999999999983 Q ss_pred ------HCCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCC Q ss_conf ------11467421158997288788899999887666666754204688479961647899999875453037875433 Q gi|254780617|r 220 ------LEDRGALSYSIVVVASASDPAPMQLLAPFAGCAMGEYFRDNGYHALIAYDDLQKHAVAYRQLSLLLRRPPGREA 293 (509) Q Consensus 220 ------l~~~~~~~~tvvv~a~a~~~~~~r~~ap~~a~aiAEyfr~~G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~g 293 (509) +...|.+.-.-+.....+ .... .....--.+.+.-...+.++ +++||+|-++....+- T Consensus 83 g~dl~~~~~~G~L~i~~~~~~~~~--~~~~-~~~~ll~~l~~~v~~~~~~~-vVIDSls~l~~~~~~~------------ 146 (241) T PRK06067 83 KLDISDFFIWGYLRIFPLNTEGFE--WNSE-LAEKLLDLIIEFIKRRREEV-IIIDSLTIFATYASED------------ 146 (241) T ss_pred CCCHHHHHHCCCCEEEECCCCCCC--CCHH-HHHHHHHHHHHHHHHHCCCE-EEECCHHHHHCCCCHH------------ T ss_conf 998599986697057832411134--2155-68999999999999719989-9992801754138889------------ Q ss_pred CCCCCHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCCCCCCHHHHHHHHHCCEEEEECCHHHHCCCCCCCCCC Q ss_conf 3752001246788887504544588735675310036766443114668854061699615012210266543344 Q gi|254780617|r 294 YPGDVFYLHSRLLERAAKMSDALGAGSLTALPVIETQVNDVSAYIPTNVISITDGQIFLETELFYQGIRPAINIGL 369 (509) Q Consensus 294 yp~~vf~~~s~l~ERa~~~~~~~g~GSiT~~~~v~~~~~D~~~~i~~~~~si~DG~i~l~~~l~~~g~~Paid~~~ 369 (509) ++. +++.....+... | .|.+.++... -+.......+.++.||-|.|+..-.+....=.+.|.+ T Consensus 147 ---~~~----~~l~~l~~l~~~---g-~tvllt~~~~--~~~~~~~~~i~~vaD~vI~Lr~~~~~g~~~R~L~V~K 209 (241) T PRK06067 147 ---DVL----NFFTECKNLCDN---G-KTILITLHPY--AFSEDTLVRIRSICDVHLKLRKEQVGDRYVKVLEVVK 209 (241) T ss_pred ---HHH----HHHHHHHHHHHC---C-CEEEEEECCC--CCCCCCCCCEEEEEEEEEEEEEEEECCEEEEEEEEEE T ss_conf ---999----999999999968---9-8899990567--6476643124899899999587843999999999999 No 69 >PRK04328 hypothetical protein; Provisional Probab=98.36 E-value=9.5e-06 Score=60.79 Aligned_cols=186 Identities=19% Similarity=0.238 Sum_probs=105.1 Q ss_pred HHHHCCHHHHHCCCC--CCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHH- Q ss_conf 144416023310111--15683365524778861158999999988630246678760699995167767899999851- Q gi|254780617|r 144 EPLSTGIKAIDSLIP--IGRGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKAL- 220 (509) Q Consensus 144 ~~l~TGI~~ID~l~p--igrGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l- 220 (509) |.+.|||+.+|.++. +-+|.-.+|.|++|+|||++++..+.+.+..+ --|+|+..-|...++.+-.+.+ T Consensus 4 eRv~TGI~gLD~lL~GGlp~gs~~Lv~G~pGtGKT~la~qFl~~g~~~G--------E~~lyis~eE~~~~l~~~~~~~G 75 (250) T PRK04328 4 KRVKTGIPGMDEILYGGIPERNVVLLSGGPGTGKSIFSQQFLWNGLQMG--------EPGIYVALEEHPVQVRRNMAQFG 75 (250) T ss_pred CEECCCCHHHHHHCCCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHCC--------CCEEEEEEECCHHHHHHHHHHCC T ss_conf 1103585667875159987996999982899998999999999998769--------97799997279999999999809 Q ss_pred ------CCCCCCCEEEEEEECCC----CCHHHHHHHH------HHHHHHHHHHHCCCCCEEEEEECHHHHHHHHHHHHHH Q ss_conf ------14674211589972887----8889999988------7666666754204688479961647899999875453 Q gi|254780617|r 221 ------EDRGALSYSIVVVASAS----DPAPMQLLAP------FAGCAMGEYFRDNGYHALIAYDDLQKHAVAYRQLSLL 284 (509) Q Consensus 221 ------~~~~~~~~tvvv~a~a~----~~~~~r~~ap------~~a~aiAEyfr~~G~~VLi~~Ddltr~A~a~reisl~ 284 (509) .+.|.+ .++-+.+. .....++... ...-.+.+..++.+-+ .+++||+|.+.. T Consensus 76 ~d~~~~~~~g~l---~iid~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~-rvVIDSit~l~~-------- 143 (250) T PRK04328 76 WDVRKYEEEGKF---AIVDAFTGGIGEAAEREKYVVKDPDDVRELIDVLRQAIRDIGAK-RVVIDSVSTLYL-------- 143 (250) T ss_pred CCHHHHHHCCCE---EEEECCCCCCCCCCHHHHHCCCCCCCHHHHHHHHHHHHHHHCCC-EEEEECCHHHHH-------- T ss_conf 986898656977---99851233334200000101368535999999999999851898-899937078774-------- Q ss_pred CCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCC--CCCCHHHHHHHHHCCEEEEECCHHHHCCC Q ss_conf 0378754333752001246788887504544588735675310036766--44311466885406169961501221026 Q gi|254780617|r 285 LRRPPGREAYPGDVFYLHSRLLERAAKMSDALGAGSLTALPVIETQVND--VSAYIPTNVISITDGQIFLETELFYQGIR 362 (509) Q Consensus 285 ~~~pp~~~gyp~~vf~~~s~l~ERa~~~~~~~g~GSiT~~~~v~~~~~D--~~~~i~~~~~si~DG~i~l~~~l~~~g~~ 362 (509) ..|--.+ +..+.+.+.+-+.+ +|.+.+.++..++ ++.| .+-++.||-|.|...-.+.... T Consensus 144 -~~~~~~r----~~l~~l~~~l~~~g----------~Ttll~~e~~~~~~~~~~~---~ve~~~DgvI~L~~~~~~~~~~ 205 (250) T PRK04328 144 -TKPAVAR----SIVMQLKRVLAGLG----------CTSIFVSQVSVGERGFGGP---GVEHAVDGIIRLDLDEIDGELK 205 (250) T ss_pred -CCHHHHH----HHHHHHHHHHHHCC----------CEEEEEECCCCCCCCCCCC---CEEEEEEEEEEEEEECCCCEEE T ss_conf -5858899----99999999998689----------8699997100365666778---6589998999988871399799 Q ss_pred CCCCC Q ss_conf 65433 Q gi|254780617|r 363 PAINI 367 (509) Q Consensus 363 Paid~ 367 (509) -.+.| T Consensus 206 R~l~I 210 (250) T PRK04328 206 RSLIV 210 (250) T ss_pred EEEEE T ss_conf 99999 No 70 >PRK09302 circadian clock protein KaiC; Reviewed Probab=98.32 E-value=2.1e-05 Score=58.37 Aligned_cols=70 Identities=27% Similarity=0.405 Sum_probs=56.5 Q ss_pred HHHHCCHHHHHCCC--CCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHH Q ss_conf 14441602331011--115683365524778861158999999988630246678760699995167767899999851 Q gi|254780617|r 144 EPLSTGIKAIDSLI--PIGRGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKAL 220 (509) Q Consensus 144 ~~l~TGI~~ID~l~--pigrGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l 220 (509) +.+.|||.-+|.++ =+-+|.-.+|-|++|+|||+++++.+.|.+... .--|+|+...|...++.+..+.+ T Consensus 4 eRi~TGI~GLD~iL~GGlp~g~~~LV~G~pGsGKTtla~QfL~~Ga~~~-------GE~~lyitl~E~~~~l~~~~~~~ 75 (501) T PRK09302 4 EKLPTGIEGFDDITNGGLPKGRPTLVSGTAGTGKTLFALQFLHNGIKSF-------GEPGVFVTFEESPEDIIRNVASF 75 (501) T ss_pred CCCCCCCCCHHHHHCCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHHHC-------CCCEEEEECCCCHHHHHHHHHHC T ss_conf 4214487115778548988997799983899999999999999998855-------99789998579999999999984 No 71 >COG0467 RAD55 RecA-superfamily ATPases implicated in signal transduction [Signal transduction mechanisms] Probab=98.30 E-value=1.3e-05 Score=59.89 Aligned_cols=189 Identities=21% Similarity=0.257 Sum_probs=113.1 Q ss_pred HHHHCCHHHHHCCCC--CCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHH- Q ss_conf 144416023310111--15683365524778861158999999988630246678760699995167767899999851- Q gi|254780617|r 144 EPLSTGIKAIDSLIP--IGRGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKAL- 220 (509) Q Consensus 144 ~~l~TGI~~ID~l~p--igrGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l- 220 (509) +.+.|||.-+|-++- +-+|.-.+|.|++|+|||++++..+.+++.. ...|+|+..-|...++.+-...+ T Consensus 3 ~~~~TGI~glD~~l~GG~p~g~~~lI~G~pGsGKT~f~~qfl~~~~~~--------ge~vlyvs~~e~~~~l~~~~~~~g 74 (260) T COG0467 3 ERIPTGIPGLDEILGGGLPRGSVVLITGPPGTGKTIFALQFLYEGARE--------GEPVLYVSTEESPEELLENARSFG 74 (260) T ss_pred CCCCCCCCCHHHHHCCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHHC--------CCEEEEEEEECCHHHHHHHHHHCC T ss_conf 656899741888836898899789999389986899999999977626--------985899992069899999998809 Q ss_pred ------CCCCCCCEEEEEEECCCCCHHHH------HHHHHHHHHHHHHHHCCCCCEEEEEECHHHHHHHHHHHHHHCCCC Q ss_conf ------14674211589972887888999------998876666667542046884799616478999998754530378 Q gi|254780617|r 221 ------EDRGALSYSIVVVASASDPAPMQ------LLAPFAGCAMGEYFRDNGYHALIAYDDLQKHAVAYRQLSLLLRRP 288 (509) Q Consensus 221 ------~~~~~~~~tvvv~a~a~~~~~~r------~~ap~~a~aiAEyfr~~G~~VLi~~Ddltr~A~a~reisl~~~~p 288 (509) .+.+. -.++-+......... .......-.+-+.+.+.+ ..-+++||+|.+....-. T Consensus 75 ~d~~~~~~~g~---l~i~d~~~~~~~~~~~~~~~~~~~~~l~~~I~~~~~~~~-~~~~ViDsi~~~~~~~~~-------- 142 (260) T COG0467 75 WDLEVYIEKGK---LAILDAFLSEKGLVSIVVGDPLDLEELLDRIREIVEKEG-ADRVVIDSITELTLYLND-------- 142 (260) T ss_pred CCHHHHHHCCE---EEEEECCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCC-CCEEEEECCHHHHHHCCC-------- T ss_conf 97789754440---687631211125420104665228999999999998628-988999663077665278-------- Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCCCCCCHHHHHHHHHCCEEEEECCHHHHCCCC-CCCC Q ss_conf 754333752001246788887504544588735675310036766443114668854061699615012210266-5433 Q gi|254780617|r 289 PGREAYPGDVFYLHSRLLERAAKMSDALGAGSLTALPVIETQVNDVSAYIPTNVISITDGQIFLETELFYQGIRP-AINI 367 (509) Q Consensus 289 p~~~gyp~~vf~~~s~l~ERa~~~~~~~g~GSiT~~~~v~~~~~D~~~~i~~~~~si~DG~i~l~~~l~~~g~~P-aid~ 367 (509) |.. . -+.+.+.-+.... .| .|++.+...+....... ...-++.||.|.|+......+.+- .+.+ T Consensus 143 ~~~------~----r~~~~~l~~~~~~--~~-~t~~~~~~~~~~~~~~~--~~~~~~vdgvI~l~~~~~~~~~~r~~~~i 207 (260) T COG0467 143 PAL------V----RRILLLLKRFLKK--LG-VTSLLTTEAPVEERGES--GVEEYIVDGVIRLDLKEIEGGGDRRYLRI 207 (260) T ss_pred HHH------H----HHHHHHHHHHHHH--CC-EEEEEEECCCCCCCCCC--CCEEEEEEEEEEEEEEECCCCEEEEEEEE T ss_conf 257------8----9999999987650--68-48999974433466656--61421689999997772257248889999 No 72 >cd01124 KaiC KaiC is a circadian clock protein primarily found in cyanobacteria KaiC is a RecA-like ATPase, having both Walker A and Walker B motifs. A related protein is found in archaea. Probab=98.00 E-value=0.00013 Score=52.54 Aligned_cols=162 Identities=17% Similarity=0.111 Sum_probs=84.1 Q ss_pred EEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHC-------CCCCCCEEEEEEEC-C Q ss_conf 655247788611589999999886302466787606999951677678999998511-------46742115899728-8 Q gi|254780617|r 165 ELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALE-------DRGALSYSIVVVAS-A 236 (509) Q Consensus 165 ~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~-------~~~~~~~tvvv~a~-a 236 (509) .+|-|++|+|||++++..+.+.+..+ .-|+|+..-|...++.+-.+.+. +.+. -.++-.. . T Consensus 2 tLi~G~pGsGKT~~a~qfl~~~a~~g--------e~~lyis~eE~~~~l~~~~~~~g~d~~~~~~~g~---l~i~~~~~~ 70 (187) T cd01124 2 TLLSGGPGTGKTTFALQFLYAGLARG--------EPGLYVTLEESPEELIENAESLGWDLERLEDEGL---LAIVDADPD 70 (187) T ss_pred EEEEECCCCCHHHHHHHHHHHHHHCC--------CCEEEEEECCCHHHHHHHHHHCCCCHHHHHHCCC---EEEEECCCC T ss_conf 15876899999999999999998769--------9789999507999999999983998589864585---689962620 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCC Q ss_conf 78889999988766666675420468847996164789999987545303787543337520012467888875045445 Q gi|254780617|r 237 SDPAPMQLLAPFAGCAMGEYFRDNGYHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLLERAAKMSDAL 316 (509) Q Consensus 237 ~~~~~~r~~ap~~a~aiAEyfr~~G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l~ERa~~~~~~~ 316 (509) ...+..........-.+.+.-...+-++ +|+||++-+.....+ ...-.++ +++.... .. T Consensus 71 ~~~~~~~~~~~~l~~~i~~~i~~~~~~~-vVIDSi~~l~~~~~~------------~~~~~~~----~l~~~l~---~~- 129 (187) T cd01124 71 EIGPAESSLRLELIQRLKDAIEEFKAKR-VVIDSVSGLLLMEQS------------TARLEIR----RLLFALK---RF- 129 (187) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHHCCCE-EEECCHHHHHHCCHH------------HHHHHHH----HHHHHHH---HC- T ss_conf 0220333236789999999999849989-999486887525666------------6899999----9999999---76- Q ss_pred CCCCEEEEEEEECCCCCCCCCHHHHHHHHHCCEEEEECCHHHHCC Q ss_conf 887356753100367664431146688540616996150122102 Q gi|254780617|r 317 GAGSLTALPVIETQVNDVSAYIPTNVISITDGQIFLETELFYQGI 361 (509) Q Consensus 317 g~GSiT~~~~v~~~~~D~~~~i~~~~~si~DG~i~l~~~l~~~g~ 361 (509) -+|.+.+.+...++-....+...-++.||-|.|...--+... T Consensus 130 ---~~t~ll~~e~~~~~~~~~~~~~~~~l~DgiI~L~~~~~~~~~ 171 (187) T cd01124 130 ---GVTTLLTSEQSGLEGTGFGGGDVEYLVDGVIRLRLDEEGGRL 171 (187) T ss_pred ---CCCEEEEEEECCCCCCCCCCCCEEEEEEEEEEEEEEECCCEE T ss_conf ---996899997425667667888477997899999788049979 No 73 >smart00382 AAA ATPases associated with a variety of cellular activities. AAA - ATPases associated with a variety of cellular activities. This profile/alignment only detects a fraction of this vast family. The poorly conserved N-terminal helix is missing from the alignment. Probab=97.82 E-value=0.00017 Score=51.65 Aligned_cols=146 Identities=18% Similarity=0.098 Sum_probs=83.1 Q ss_pred CCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEEEEECCCCCH Q ss_conf 68336552477886115899999998863024667876069999516776789999985114674211589972887888 Q gi|254780617|r 161 RGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALSYSIVVVASASDPA 240 (509) Q Consensus 161 rGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~~tvvv~a~a~~~~ 240 (509) +++.++|+|++|+|||+++...+ ++..... ..++|+... ...+....... ......+... T Consensus 1 ~~~~ill~G~~GsGKTtl~~~la-~~~~~~~-------~~v~~~~~~-~~~~~~~~~~~-----------~~~~~~~~~~ 60 (148) T smart00382 1 PGEVILIVGPPGSGKTTLARALA-RELGPPG-------GGVIYIDGE-DILEEVLDQLL-----------LIIVGGKKAS 60 (148) T ss_pred CCCEEEEECCCCHHHHHHHHHHH-HHHCCCC-------CCEEEEEHH-HHHHHHHHHHH-----------HHHHCCCCCC T ss_conf 99789999999702999999999-8726689-------968998759-98988898765-----------3000112210 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCC Q ss_conf 99999887666666754204688479961647899999875453037875433375200124678888750454458873 Q gi|254780617|r 241 PMQLLAPFAGCAMGEYFRDNGYHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLLERAAKMSDALGAGS 320 (509) Q Consensus 241 ~~r~~ap~~a~aiAEyfr~~G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l~ERa~~~~~~~g~GS 320 (509) ..+...+......|++.. ..++++|++.++.............. .+.. ... .. T Consensus 61 ~~~~~~~~~~~~~~~~~~----~~viiiDei~~~~~~~~~~~~~~~~~------------------~~~~--~~~---~~ 113 (148) T smart00382 61 GSGELRLRLALALARKLK----PDVLILDEITSLLDAEQEALLLLLEE------------------LRLL--LLL---KS 113 (148) T ss_pred CCHHHHHHHHHHHHHHCC----CCEEEEECCHHHCCCCCHHHHHHHHH------------------HHHH--CCC---CC T ss_conf 519999999999998449----98999827502147620799999999------------------9985--176---57 Q ss_pred EEEEEEEECCCCCCCCCHHHHHHHHHCCEEEEEC Q ss_conf 5675310036766443114668854061699615 Q gi|254780617|r 321 LTALPVIETQVNDVSAYIPTNVISITDGQIFLET 354 (509) Q Consensus 321 iT~~~~v~~~~~D~~~~i~~~~~si~DG~i~l~~ 354 (509) .+.+.++.+-.. ...+++.+.+...|.++.+.+ T Consensus 114 ~~~~~vi~~~n~-~~~~~~~~~~~~~~~~~~~~~ 146 (148) T smart00382 114 EKNLTVILTTND-EKDLGPALLRRRFDRRIVLLL 146 (148) T ss_pred CCCEEEEEEECC-CCCCCHHHHCCCCCEEEEEEC T ss_conf 899899995699-522498770744787999828 No 74 >PRK09361 radB DNA repair and recombination protein RadB; Provisional Probab=97.78 E-value=0.00017 Score=51.66 Aligned_cols=117 Identities=21% Similarity=0.254 Sum_probs=66.7 Q ss_pred CHHHHHCCHHHHHCCCC--CCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCC--CHHHHHHHH Q ss_conf 00144416023310111--15683365524778861158999999988630246678760699995167--767899999 Q gi|254780617|r 142 VCEPLSTGIKAIDSLIP--IGRGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQ--KRSSVARFV 217 (509) Q Consensus 142 i~~~l~TGI~~ID~l~p--igrGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGe--r~~ev~~~~ 217 (509) +.|.+.||+..+|.++= +-+|+=..|.|++|+|||+||+....+-...+ -.|+|+-- | +.....++. T Consensus 1 ~~erisTG~~~lD~~LgGGi~~G~itei~G~pG~GKTtl~lq~a~~~~~~g--------~~vlYidt-E~~~~er~~qi~ 71 (224) T PRK09361 1 IEELLPTGCKSLDELLGGGIERGTITQIYGPPGSGKTNICIQLAVEAARQG--------KKVIYIDT-EGLSPERFKQIA 71 (224) T ss_pred CCCEECCCCHHHHHHHCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHCC--------CCEEEECC-CCCCHHHHHHHH T ss_conf 985303797899986269988887999989999859999999999999749--------90999678-767889999985 Q ss_pred HHHCCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECHHHHHH Q ss_conf 85114674211589972887888999998876666667542046884799616478999 Q gi|254780617|r 218 KALEDRGALSYSIVVVASASDPAPMQLLAPFAGCAMGEYFRDNGYHALIAYDDLQKHAV 276 (509) Q Consensus 218 ~~l~~~~~~~~tvvv~a~a~~~~~~r~~ap~~a~aiAEyfr~~G~~VLi~~Ddltr~A~ 276 (509) .. .....+++- .+..+.+-..... .+..++.+..+ +--|+++||++.+.. T Consensus 72 ~~-~~~~~~~~i-~~~~~~~~~~~~~------~i~~~~~~~~~-~~~lvVIDSi~~~~~ 121 (224) T PRK09361 72 GE-DFEELLSNI-IIFEPSSFEEQRE------AIQKAEKIAKE-NVGLIVLDSATSLYR 121 (224) T ss_pred CC-CHHHHHHCC-EEECCCCHHHHHH------HHHHHHHHHHC-CCCEEEEECCHHHEE T ss_conf 65-734542061-4724798899999------99999987505-873899962301000 No 75 >cd01123 Rad51_DMC1_radA Rad51_DMC1_radA,B. This group of recombinases includes the eukaryotic proteins RAD51, RAD55/57 and the meiosis-specific protein DMC1, and the archaeal proteins radA and radB. They are closely related to the bacterial RecA group. Rad51 proteins catalyze a similiar recombination reaction as RecA, using ATP-dependent DNA binding activity and a DNA-dependent ATPase. However, this reaction is less efficient and requires accessory proteins such as RAD55/57 . Probab=97.78 E-value=0.00043 Score=48.81 Aligned_cols=123 Identities=23% Similarity=0.289 Sum_probs=69.5 Q ss_pred HHCCHHHHHCCCC--CCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCC---HHHHHHHHHH- Q ss_conf 4416023310111--156833655247788611589999999886302466787606999951677---6789999985- Q gi|254780617|r 146 LSTGIKAIDSLIP--IGRGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQK---RSSVARFVKA- 219 (509) Q Consensus 146 l~TGI~~ID~l~p--igrGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer---~~ev~~~~~~- 219 (509) +.||++.+|.++- +-.|+=..|.|++|+|||.+|+...+|-........ ..-.|||.-- |. .+.+.++.+. T Consensus 1 isTG~~~lD~~L~GGi~~G~itEi~G~~GsGKTql~lqla~~~~~~~~~~g--~~~~vvyIdt-E~~f~~~Rl~qia~~~ 77 (235) T cd01123 1 LTTGSKALDELLGGGIETGSITEIFGEFGSGKTQLCHQLAVTVQLPIELGG--LEGKAVYIDT-EGTFRPERLVQIAERF 77 (235) T ss_pred CCCCCHHHHHHCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHHCC--CCCEEEEEEC-CCCCCHHHHHHHHHHH T ss_conf 988857899650799647879999999998499999999999842475367--8962999953-6775889999999971 Q ss_pred -HCCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECHHHHHHH Q ss_conf -1146742115899728878889999988766666675420468847996164789999 Q gi|254780617|r 220 -LEDRGALSYSIVVVASASDPAPMQLLAPFAGCAMGEYFRDNGYHALIAYDDLQKHAVA 277 (509) Q Consensus 220 -l~~~~~~~~tvvv~a~a~~~~~~r~~ap~~a~aiAEyfr~~G~~VLi~~Ddltr~A~a 277 (509) ......+++..++-....+ .....+ ...-+.+..+..--|+|+||++-.... T Consensus 78 ~~~~~~~l~~i~~~~~~~~~-~l~~~l-----~~l~~~l~~~~~v~LvVIDSia~l~r~ 130 (235) T cd01123 78 GLDPEEVLDNIYVARAYNSD-HQLQLL-----EELEAILIESSRIKLVIVDSVTALFRA 130 (235) T ss_pred CCCHHHHHCCEEEECCCCHH-HHHHHH-----HHHHHHHHCCCCCEEEEEEECHHHHHH T ss_conf 34724542254796379999-999999-----999998730377239999610455566 No 76 >cd01394 radB RadB. The archaeal protein radB shares similarity radA, the archaeal functional homologue to the bacterial RecA. The precise function of radB is unclear. Probab=97.76 E-value=0.00069 Score=47.27 Aligned_cols=113 Identities=20% Similarity=0.240 Sum_probs=64.8 Q ss_pred HHCCHHHHHCCCC--CCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCC--CHHHHHHHHHHHC Q ss_conf 4416023310111--15683365524778861158999999988630246678760699995167--7678999998511 Q gi|254780617|r 146 LSTGIKAIDSLIP--IGRGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQ--KRSSVARFVKALE 221 (509) Q Consensus 146 l~TGI~~ID~l~p--igrGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGe--r~~ev~~~~~~l~ 221 (509) +.||+..+|-++= +.+|.=.+|.|++|+|||+|++....|-...+ -.++|+- +| -...+.++...- T Consensus 1 IsTG~~~lD~vLgGGi~~G~it~i~G~pG~GKStl~lq~a~~~~~~g--------~~v~Yid-tE~~~~er~~qi~~~~- 70 (218) T cd01394 1 LPTGCKGLDELLGGGVERGTVTQVYGPPGTGKTNIAIQLAVETAGQG--------KKVAYID-TEGLSSERFRQIAGDR- 70 (218) T ss_pred CCCCCHHHHHHHCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHCC--------CEEEEEE-CCCCCCHHHHHHHHHH- T ss_conf 99880789998569987887999989999849999999999986369--------8699996-6556769999998753- Q ss_pred CCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECHHHHHH Q ss_conf 4674211589972887888999998876666667542046884799616478999 Q gi|254780617|r 222 DRGALSYSIVVVASASDPAPMQLLAPFAGCAMGEYFRDNGYHALIAYDDLQKHAV 276 (509) Q Consensus 222 ~~~~~~~tvvv~a~a~~~~~~r~~ap~~a~aiAEyfr~~G~~VLi~~Ddltr~A~ 276 (509) .+.+...+.+....+-..... .....+.+.++ +--|+++||++-+.. T Consensus 71 -~~~~~~~i~v~~~~~~~~~~~------~i~~~~~~~~~-~~~lvViDSi~tl~~ 117 (218) T cd01394 71 -PERAASSIIVFEPMDFNEQGR------AIQETETFADE-KVDLVVVDSATALYR 117 (218) T ss_pred -HHHHHCCCEEECCCCHHHHHH------HHHHHHHHHHC-CCCEEEEECCHHHHH T ss_conf -666530514626787688999------99999976414-772999914045545 No 77 >PRK08533 flagellar accessory protein FlaH; Reviewed Probab=97.54 E-value=0.0064 Score=40.25 Aligned_cols=178 Identities=16% Similarity=0.156 Sum_probs=99.4 Q ss_pred CCCCCCHHHHHCCHHHHHCCCCCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHH Q ss_conf 11266001444160233101111568336552477886115899999998863024667876069999516776789999 Q gi|254780617|r 137 IQRQSVCEPLSTGIKAIDSLIPIGRGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARF 216 (509) Q Consensus 137 ~~R~~i~~~l~TGI~~ID~l~pigrGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~ 216 (509) ++|...++.|.-|+. +|.-.+|-|++|+|||.+++..+..-.. ...-|+|+..-+..++..+. T Consensus 8 ~~~d~ld~~lggGip---------~gs~~li~G~~GtGKsi~~~~~~~~~l~--------~g~~~~yis~e~t~~~~i~q 70 (230) T PRK08533 8 LDGDELHKRLGGGIP---------FGSIILIEGDESTGKSILSQRLAYGFLQ--------NGYSVSYVSSQLTTTEFIKQ 70 (230) T ss_pred CCCHHHHHHHCCCCC---------CCCEEEEECCCCCCHHHHHHHHHHHHHH--------CCCEEEEEEECCCHHHHHHH T ss_conf 571357887178988---------9848999868998789999999999987--------89869999943899999999 Q ss_pred HHHH-------CCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECHHHHHHHHHHHHHHCCCCC Q ss_conf 9851-------146742115899728878889999988766666675420468847996164789999987545303787 Q gi|254780617|r 217 VKAL-------EDRGALSYSIVVVASASDPAPMQLLAPFAGCAMGEYFRDNGYHALIAYDDLQKHAVAYRQLSLLLRRPP 289 (509) Q Consensus 217 ~~~l-------~~~~~~~~tvvv~a~a~~~~~~r~~ap~~a~aiAEyfr~~G~~VLi~~Ddltr~A~a~reisl~~~~pp 289 (509) .+.+ ...|.+.+..+.-.-++......++. -+-+..+..++|| +++||+|-..... T Consensus 71 m~s~g~di~~~~~~G~l~~i~~~~~~~~~~~~~~~L~-----~ll~~~~~~~~dv-IIIDSlS~l~~~~----------- 133 (230) T PRK08533 71 MMSLGYDINKKLISGKLLYIPVYPLLSGNSEKRDFLD-----KLMNTRRFYEKDV-VIIDSLSSLVSRD----------- 133 (230) T ss_pred HHHCCCCCHHHHHCCCEEEEECCCCCCCHHHHHHHHH-----HHHHHHHHCCCCE-EEEECCHHHHHCC----------- T ss_conf 9986998179975796799961343354045789999-----9973266437989-9990531885167----------- Q ss_pred CCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCCCCCCHHHHHHHHHCCEEEEECCHHHHCC Q ss_conf 543337520012467888875045445887356753100367664431146688540616996150122102 Q gi|254780617|r 290 GREAYPGDVFYLHSRLLERAAKMSDALGAGSLTALPVIETQVNDVSAYIPTNVISITDGQIFLETELFYQGI 361 (509) Q Consensus 290 ~~~gyp~~vf~~~s~l~ERa~~~~~~~g~GSiT~~~~v~~~~~D~~~~i~~~~~si~DG~i~l~~~l~~~g~ 361 (509) ..|..++.+.+ .+.+.... |- |.+ ++...+.+++-.-+.++|+.|++|.|+.....-.. T Consensus 134 ---~~~~~~~~~~~-~lk~l~s~------gk-tIi--lTv~p~~~~e~~l~~lrs~aDv~i~L~~~~vGg~~ 192 (230) T PRK08533 134 ---ASEVQIRDLMA-FFKRISSL------NK-VII--LTANPKELPESVLLILRTASTILIRLEVKVFGGDL 192 (230) T ss_pred ---CCHHHHHHHHH-HHHHHHHC------CC-EEE--EEECCCCCCHHHHHHHHEEEEEEEEEEEEECCCEE T ss_conf ---77899999999-99999858------98-899--99563313624544204104899998736109889 No 78 >cd01121 Sms Sms (bacterial radA) DNA repair protein. This protein is not related to archael radA any more than is to other RecA-like NTPases. Sms has a role in recombination and recombinational repair and is responsible for the stabilization or processing of branched DNA molecules. Probab=97.36 E-value=0.006 Score=40.48 Aligned_cols=168 Identities=19% Similarity=0.236 Sum_probs=96.9 Q ss_pred HHHHCCHHHHHCCC--CCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHC Q ss_conf 14441602331011--1156833655247788611589999999886302466787606999951677678999998511 Q gi|254780617|r 144 EPLSTGIKAIDSLI--PIGRGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALE 221 (509) Q Consensus 144 ~~l~TGI~~ID~l~--pigrGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~ 221 (509) ..+.||+.-+|-.+ =+..|.=++|=|++|.|||||++++..+-+..+ ..|.|+.==|....++.=.+.| T Consensus 62 ~R~~Tgi~ElDRVLGGGiVpGSvvLlgGePGIGKSTLLLQia~~la~~~--------~~vLYvSGEES~~QIk~RA~RL- 132 (372) T cd01121 62 ERIPTGIEELDRVLGGGLVPGSVILIGGDPGIGKSTLLLQVAARLAKRG--------GKVLYVSGEESPEQIKLRADRL- 132 (372) T ss_pred CCCCCCCHHHHHHHCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHCC--------CCEEEEECHHHHHHHHHHHHHH- T ss_conf 5444783666540057730671799825998868899999999998639--------9389982456789999899985- Q ss_pred CCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHH Q ss_conf 46742115899728878889999988766666675420468847996164789999987545303787543337520012 Q gi|254780617|r 222 DRGALSYSIVVVASASDPAPMQLLAPFAGCAMGEYFRDNGYHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYL 301 (509) Q Consensus 222 ~~~~~~~tvvv~a~a~~~~~~r~~ap~~a~aiAEyfr~~G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~ 301 (509) +.....+.+.+..+-...+. .. ++-+--++++||+-.. .-|.-.+.||++-.. T Consensus 133 --g~~~~~l~l~set~le~Il~------------~i-~~~kP~~lIIDSIQT~------------~~~~~~s~pGsvsQV 185 (372) T cd01121 133 --GISTENLYLLAETNLEDILA------------SI-EELKPDLVIIDSIQTV------------YSSELTSAPGSVSQV 185 (372) T ss_pred --CCCCCCCEEEECCCHHHHHH------------HH-HHHCCCEEEEECHHHC------------CCCCCCCCCCCHHHH T ss_conf --87887727884356999999------------99-9719988999562202------------037767799878999 Q ss_pred H--HHHHHHHHHCCCCCCCCCEEEEEE-EECCCCCCCCCHHHHHHHHHCCEEEEECC Q ss_conf 4--678888750454458873567531-00367664431146688540616996150 Q gi|254780617|r 302 H--SRLLERAAKMSDALGAGSLTALPV-IETQVNDVSAYIPTNVISITDGQIFLETE 355 (509) Q Consensus 302 ~--s~l~ERa~~~~~~~g~GSiT~~~~-v~~~~~D~~~~i~~~~~si~DG~i~l~~~ 355 (509) - +.-+-|.+|- -.|+.|-+ --|..|++..|=.- --+.|--+||+-+ T Consensus 186 Reca~~L~~~AK~------~~i~~~lVGHVTK~G~iAGPkvL--EHmVDtVl~fEgd 234 (372) T cd01121 186 RECTAELMRFAKE------RNIPIFIVGHVTKEGSIAGPKVL--EHMVDTVLYFEGD 234 (372) T ss_pred HHHHHHHHHHHHH------CCCEEEEEEEECCCCCCCCCHHH--EEEEEEEEEECCC T ss_conf 9999999999986------19739999876268863774031--0021368875157 No 79 >cd01393 recA_like RecA is a bacterial enzyme which has roles in homologous recombination, DNA repair, and the induction of the SOS response. RecA couples ATP hydrolysis to DNA strand exchange. While prokaryotes have a single RecA protein, eukaryotes have multiple RecA homologs such as Rad51, DMC1 and Rad55/57. Archaea have the RecA-like homologs radA and radB. Probab=97.23 E-value=0.0033 Score=42.35 Aligned_cols=121 Identities=20% Similarity=0.207 Sum_probs=63.9 Q ss_pred HHCCHHHHHCCCC--CCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCC--HHHHHHHHHHH- Q ss_conf 4416023310111--156833655247788611589999999886302466787606999951677--67899999851- Q gi|254780617|r 146 LSTGIKAIDSLIP--IGRGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQK--RSSVARFVKAL- 220 (509) Q Consensus 146 l~TGI~~ID~l~p--igrGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer--~~ev~~~~~~l- 220 (509) +.||++.+|.++= |-.|+=.-|+|.+|+|||.+++.+..|-.... .....+-.|||.-.--. .+-+.++.... T Consensus 1 isTG~~~lD~~LgGGip~G~ItEi~G~~gsGKT~l~lqla~~~q~~~--~~~~~~g~vvyIDtE~~f~~~rl~~i~~~~~ 78 (226) T cd01393 1 ISTGSKALDELLGGGIPTGRITEIFGEFGSGKTQLCLQLAVEAQLPG--ELGGLEGKVVYIDTEGAFRPERLVQLAVRFG 78 (226) T ss_pred CCCCCHHHHHHHCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHCCC--CCCCCCCEEEEEECCCCCCHHHHHHHHHHHC T ss_conf 98874789988579988883999999999989999999999985422--1169996199995577531999999987603 Q ss_pred -CCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECHHHHH Q ss_conf -1467421158997288788899999887666666754204688479961647899 Q gi|254780617|r 221 -EDRGALSYSIVVVASASDPAPMQLLAPFAGCAMGEYFRDNGYHALIAYDDLQKHA 275 (509) Q Consensus 221 -~~~~~~~~tvvv~a~a~~~~~~r~~ap~~a~aiAEyfr~~G~~VLi~~Ddltr~A 275 (509) .....+++..++-.. + ....+. .+.-.+.+..+++--|+|+||++.+. T Consensus 79 ~~~~~~l~~i~~~~~~-~--~e~~~~----~~~~l~~~~~~~~v~liViDSi~al~ 127 (226) T cd01393 79 LDPEEVLDNIYVARPY-N--GEQQLE----IVEELERIMSSGRVDLVVVDSVAALF 127 (226) T ss_pred CCHHHHHHHEEEECCC-C--HHHHHH----HHHHHHHHHHCCCCEEEEEECCCEEC T ss_conf 2667764333684379-9--999999----99999987524784289993220011 No 80 >pfam00154 RecA recA bacterial DNA recombination protein. RecA is a DNA-dependent ATPase and functions in DNA repair systems. RecA protein catalyses an ATP-dependent DNA strand-exchange reaction that is the central step in the repair of dsDNA breaks by homologous recombination. Probab=97.10 E-value=0.004 Score=41.75 Aligned_cols=109 Identities=19% Similarity=0.315 Sum_probs=71.7 Q ss_pred HHHHCCHHHHHCCCC---CCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHH Q ss_conf 144416023310111---15683365524778861158999999988630246678760699995167767899999851 Q gi|254780617|r 144 EPLSTGIKAIDSLIP---IGRGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKAL 220 (509) Q Consensus 144 ~~l~TGI~~ID~l~p---igrGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l 220 (509) +.+.||..++|..+- +-+|.=.-|+|.+++|||++++.+|.|- +. ..-+|+|.-. |..-+ .++.+.+ T Consensus 31 ~~i~tgs~~lD~aLg~GGlP~GRi~ei~G~essGKTtlal~~ia~a--Qk------~gg~~~~iD~-E~a~d-~~~a~~l 100 (322) T pfam00154 31 EVISTGSLGLDIALGIGGLPKGRIIEIYGPESSGKTTLALHAIAEA--QK------AGGTAAFIDA-EHALD-PVYAKKL 100 (322) T ss_pred CEEECCHHHHHHHHCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHH--HH------CCCEEEEEEH-HHHCC-HHHHHHC T ss_conf 4461580899998758997787089998898777899999999999--73------4993899853-66059-8899980 Q ss_pred CCCCC-CCEEEEEEECCCCCHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECHHHHH Q ss_conf 14674-21158997288788899999887666666754204688479961647899 Q gi|254780617|r 221 EDRGA-LSYSIVVVASASDPAPMQLLAPFAGCAMGEYFRDNGYHALIAYDDLQKHA 275 (509) Q Consensus 221 ~~~~~-~~~tvvv~a~a~~~~~~r~~ap~~a~aiAEyfr~~G~~VLi~~Ddltr~A 275 (509) |. +++ +++++.+ .+++ ++.++|++...|...|+++||+.... T Consensus 101 ---GVD~~~--l~~~qpd--~~Eq------al~i~~~li~~~~~~liViDSvaal~ 143 (322) T pfam00154 101 ---GVDIDN--LLVSQPD--TGEQ------ALEIADMLVRSGAVDLIVVDSVAALV 143 (322) T ss_pred ---CCCHHH--EEEECCC--HHHH------HHHHHHHHHCCCCCCEEEEECCCCCC T ss_conf ---988025--3897788--3999------99999998537997659982534567 No 81 >PRK11823 DNA repair protein RadA; Provisional Probab=96.99 E-value=0.017 Score=37.16 Aligned_cols=167 Identities=22% Similarity=0.265 Sum_probs=91.5 Q ss_pred HHHHHCCHHHHHCCC--CCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHH Q ss_conf 014441602331011--115683365524778861158999999988630246678760699995167767899999851 Q gi|254780617|r 143 CEPLSTGIKAIDSLI--PIGRGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKAL 220 (509) Q Consensus 143 ~~~l~TGI~~ID~l~--pigrGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l 220 (509) .+.+.||+.-+|-.+ =+..|+=++|=|++|.|||||++++..+-+ . . -.+.|+.==|..+.++.=.+.| T Consensus 69 ~~R~~Tgi~ElDRVLGGGlV~GS~iLlgGePGIGKSTLlLQ~a~~la-~------~--~~vLYvSGEES~~Qik~RA~RL 139 (454) T PRK11823 69 EPRISTGIGELDRVLGGGLVPGSVVLIGGDPGIGKSTLLLQVAAALA-A------G--GKVLYVSGEESLQQIKLRAERL 139 (454) T ss_pred CCCCCCCCHHHHHCCCCCCCCCEEEEECCCCCCCHHHHHHHHHHHHH-C------C--CCEEEECCCHHHHHHHHHHHHH T ss_conf 55323785566522158720664899507998889999999999985-5------9--9579981501578999999975 Q ss_pred CCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHH Q ss_conf 14674211589972887888999998876666667542046884799616478999998754530378754333752001 Q gi|254780617|r 221 EDRGALSYSIVVVASASDPAPMQLLAPFAGCAMGEYFRDNGYHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFY 300 (509) Q Consensus 221 ~~~~~~~~tvvv~a~a~~~~~~r~~ap~~a~aiAEyfr~~G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~ 300 (509) +.....+.+.+..|-...+.. . ++-+--++++||+-.. ..|.-++.||++-. T Consensus 140 ---g~~~~~l~l~~et~l~~Il~~------------i-~~~~P~~lIIDSIQT~------------~~~~~~s~pGsvsQ 191 (454) T PRK11823 140 ---GLPSDNLYLLAETNLEDILAT------------I-EEEKPDLVVIDSIQTM------------YSPELESAPGSVSQ 191 (454) T ss_pred ---CCCCCCCEEEECCCHHHHHHH------------H-HHHCCCEEEEECHHEE------------EECCCCCCCCCHHH T ss_conf ---888887378853689999999------------9-8609988999431115------------41566778997899 Q ss_pred HH--HHHHHHHHHCCCCCCCCCEEEEEE-EECCCCCCCCCHHHHHHHHHCCEEEEEC Q ss_conf 24--678888750454458873567531-0036766443114668854061699615 Q gi|254780617|r 301 LH--SRLLERAAKMSDALGAGSLTALPV-IETQVNDVSAYIPTNVISITDGQIFLET 354 (509) Q Consensus 301 ~~--s~l~ERa~~~~~~~g~GSiT~~~~-v~~~~~D~~~~i~~~~~si~DG~i~l~~ 354 (509) .. +..+-|.+|-++ |+.|-+ --|..|++..|=.- --+.|--++|+- T Consensus 192 Vre~a~~L~~~AK~~~------i~~~lVGHVTKdG~iAGPkvL--EHmVDtVl~fEG 240 (454) T PRK11823 192 VRECAAELTRLAKQSG------IAVFLVGHVTKEGAIAGPRVL--EHMVDTVLYFEG 240 (454) T ss_pred HHHHHHHHHHHHHHCC------CCEEEEEEECCCCCCCCCHHH--HHHHCEEEEECC T ss_conf 9999999999997449------828999977267764661452--220104687515 No 82 >COG0468 RecA RecA/RadA recombinase [DNA replication, recombination, and repair] Probab=96.88 E-value=0.016 Score=37.42 Aligned_cols=118 Identities=20% Similarity=0.196 Sum_probs=67.0 Q ss_pred CHHHHHCCHHHHHCCCC--CCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHH Q ss_conf 00144416023310111--1568336552477886115899999998863024667876069999516776789999985 Q gi|254780617|r 142 VCEPLSTGIKAIDSLIP--IGRGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKA 219 (509) Q Consensus 142 i~~~l~TGI~~ID~l~p--igrGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~ 219 (509) -.+.+.||.+.+|.++- +-+|.-.=|||+.++|||++++..+.|--.. .-.|+|.-- |.+-...++..- T Consensus 38 ~~~~i~TGs~~LD~~LGGGl~~g~ItEiyG~~gsGKT~lal~~~~~aq~~--------g~~a~fIDt-E~~l~p~r~~~l 108 (279) T COG0468 38 DIEAISTGSLALDEALGGGLPRGRITEIYGPESSGKTTLALQLVANAQKP--------GGKAAFIDT-EHALDPERAKQL 108 (279) T ss_pred CCCCCCCCCHHHHHHHCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHCC--------CCEEEEEEC-CCCCCHHHHHHH T ss_conf 36601655055787746886553589984688765466899998886537--------980899958-999899999998 Q ss_pred HCCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECHHHHHHHH Q ss_conf 11467421158997288788899999887666666754204688479961647899999 Q gi|254780617|r 220 LEDRGALSYSIVVVASASDPAPMQLLAPFAGCAMGEYFRDNGYHALIAYDDLQKHAVAY 278 (509) Q Consensus 220 l~~~~~~~~tvvv~a~a~~~~~~r~~ap~~a~aiAEyfr~~G~~VLi~~Ddltr~A~a~ 278 (509) .... +++ ++++.++..--...++-...-..+| .==|+++||++.|-.+. T Consensus 109 ~~~~--~d~--l~v~~~~~~e~q~~i~~~~~~~~~~------~i~LvVVDSvaa~~r~~ 157 (279) T COG0468 109 GVDL--LDN--LLVSQPDTGEQQLEIAEKLARSGAE------KIDLLVVDSVAALVRAE 157 (279) T ss_pred HHHH--HCC--EEEECCCCHHHHHHHHHHHHHHCCC------CCCEEEEECCCCCCHHH T ss_conf 8754--215--3686689779999999999875468------87889982574346365 No 83 >cd00983 recA RecA is a bacterial enzyme which has roles in homologous recombination, DNA repair, and the induction of the SOS response. RecA couples ATP hydrolysis to DNA strand exchange. Probab=96.81 E-value=0.009 Score=39.19 Aligned_cols=135 Identities=19% Similarity=0.287 Sum_probs=82.3 Q ss_pred HHHHCCHHHHHCCCC---CCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHH Q ss_conf 144416023310111---15683365524778861158999999988630246678760699995167767899999851 Q gi|254780617|r 144 EPLSTGIKAIDSLIP---IGRGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKAL 220 (509) Q Consensus 144 ~~l~TGI~~ID~l~p---igrGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l 220 (509) +.+.||...+|..+- +-+|.=.-|+|..++|||++++.++.+- +. ..-+|+|.-. |..- -.++.+.+ T Consensus 34 ~~i~TGs~~lD~alg~GGiP~GRivei~G~essGKTtlal~~ia~a--Qk------~gg~~~~iDa-E~a~-d~~~a~~l 103 (325) T cd00983 34 EVIPTGSLSLDIALGIGGYPKGRIIEIYGPESSGKTTLALHAIAEA--QK------LGGTVAFIDA-EHAL-DPVYAKKL 103 (325) T ss_pred CEECCCCHHHHHHHCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHH--HH------CCCEEEEEEH-HHCC-CHHHHHHC T ss_conf 3462681999999758996688089998898777999999999998--73------5983999962-5425-98999980 Q ss_pred CCCCC-CCEEEEEEECCCCCHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCH Q ss_conf 14674-21158997288788899999887666666754204688479961647899999875453037875433375200 Q gi|254780617|r 221 EDRGA-LSYSIVVVASASDPAPMQLLAPFAGCAMGEYFRDNGYHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVF 299 (509) Q Consensus 221 ~~~~~-~~~tvvv~a~a~~~~~~r~~ap~~a~aiAEyfr~~G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf 299 (509) |. .++ +++...+ .+++ ++.++|.+...|.-.|+|+||+....-. .|+---++ +...| T Consensus 104 ---GVD~~~--l~~~qp~--~~Eq------~l~i~~~li~s~~~dliViDSvaal~p~-~E~e~~~~-----d~~vg--- 161 (325) T cd00983 104 ---GVDLDN--LLISQPD--TGEQ------ALEIADSLVRSGAVDLIVVDSVAALVPK-AEIEGEMG-----DSHVG--- 161 (325) T ss_pred ---CCCHHH--EEEECCC--HHHH------HHHHHHHHHCCCCCCEEEECCCCCCCCH-HHHHHCCC-----CCHHH--- T ss_conf ---998467--5896663--8999------9999999751588767998151123657-88760113-----21143--- Q ss_pred HHHHHHHHHHHH Q ss_conf 124678888750 Q gi|254780617|r 300 YLHSRLLERAAK 311 (509) Q Consensus 300 ~~~s~l~ERa~~ 311 (509) +++|++-++-| T Consensus 162 -~~ArlmskalR 172 (325) T cd00983 162 -LQARLMSQALR 172 (325) T ss_pred -HHHHHHHHHHH T ss_conf -89999999999 No 84 >PRK04301 radA DNA repair and recombination protein RadA; Validated Probab=96.76 E-value=0.0076 Score=39.72 Aligned_cols=131 Identities=22% Similarity=0.298 Sum_probs=70.8 Q ss_pred CCCCCCHHHHHCCHHHHHCCCCCC--CCCEEEEECCCCCCCHHHHHHHHHHH-HHHCCCCCCCCCCEEEEEEC--CCCHH Q ss_conf 112660014441602331011115--68336552477886115899999998-86302466787606999951--67767 Q gi|254780617|r 137 IQRQSVCEPLSTGIKAIDSLIPIG--RGQRELIIGDRKTGKTSIILDTFLNQ-KSSHDKGSEKDKVYCIYVAI--GQKRS 211 (509) Q Consensus 137 ~~R~~i~~~l~TGI~~ID~l~pig--rGQR~~I~g~~g~GKt~l~~~~I~nq-~~~~~~~~~~~~v~~V~~~I--Ger~~ 211 (509) ++++.--..+.||.+.+|.++-=| .|.=.-|+|.+|+|||.+|+..-+|- .-...+... . -+||.-- +=+.+ T Consensus 76 ~~~r~~~~~isTg~~~lD~lLgGGi~~g~ITEi~Ge~gsGKTQlc~qLav~~qlp~~~GGl~-g--~vvYIDTEgtF~pe 152 (318) T PRK04301 76 LERRKSIGKITTGSKELDELLGGGIETQSITEFYGEFGSGKTQICHQLAVNVQLPEEEGGLE-G--KAVYIDTENTFRPE 152 (318) T ss_pred HHHHHCCCCCCCCCHHHHHHCCCCCCCCEEEEEECCCCCCCHHHHHHHHHHEECCHHHCCCC-C--CEEEEECCCCCCHH T ss_conf 99863478247888788805479833670788866887870356677676533767778988-6--37999568986979 Q ss_pred HHHHHHHHH--CCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHHHHHCCCCC-EEEEEECHHHHHHH Q ss_conf 899999851--1467421158997288788899999887666666754204688-47996164789999 Q gi|254780617|r 212 SVARFVKAL--EDRGALSYSIVVVASASDPAPMQLLAPFAGCAMGEYFRDNGYH-ALIAYDDLQKHAVA 277 (509) Q Consensus 212 ev~~~~~~l--~~~~~~~~tvvv~a~a~~~~~~r~~ap~~a~aiAEyfr~~G~~-VLi~~Ddltr~A~a 277 (509) -+.++.+.+ .....+++..+.-+.. ..+++.... .+.+++ .++++ -|+|+||++.+.++ T Consensus 153 Ri~qia~~~g~d~~~~L~nI~v~r~~~---~~~q~~~~~---~~~~~~-~~~~~v~LvVvDSi~alfR~ 214 (318) T PRK04301 153 RIEQMAEGLGLDPDEVLDNIHVARAYN---SDHQMLLAE---KAEELI-KEGNNIKLVIVDSLTAHFRA 214 (318) T ss_pred HHHHHHHHHCCCHHHHHHCEEEEECCC---HHHHHHHHH---HHHHHH-HHCCCCEEEEEECCHHHEEE T ss_conf 999999984999789864026861399---899999999---999999-62788049999434232121 No 85 >TIGR02236 recomb_radA DNA repair and recombination protein RadA; InterPro: IPR011938 This family consists exclusively of archaeal RadA protein, a homolog of bacterial RecA, eukaryotic RAD51 (IPR011941 from INTERPRO), and archaeal RadB (IPR011939 from INTERPRO). This protein is involved in DNA repair and recombination. The member from Pyrococcus horikoshii contains an intein .; GO: 0003684 damaged DNA binding, 0005524 ATP binding, 0008094 DNA-dependent ATPase activity, 0006281 DNA repair, 0006310 DNA recombination. Probab=96.76 E-value=0.0023 Score=43.53 Aligned_cols=143 Identities=22% Similarity=0.296 Sum_probs=95.3 Q ss_pred CCCCCCCCCCHHHHHCCHHHHHCCCCCCCC-CEEE-EECCCCCCCHHHHHHHHHHHHHHCCCCCC---------CC---C Q ss_conf 577311266001444160233101111568-3365-52477886115899999998863024667---------87---6 Q gi|254780617|r 133 APGIIQRQSVCEPLSTGIKAIDSLIPIGRG-QREL-IIGDRKTGKTSIILDTFLNQKSSHDKGSE---------KD---K 198 (509) Q Consensus 133 ~p~~~~R~~i~~~l~TGI~~ID~l~pigrG-QR~~-I~g~~g~GKt~l~~~~I~nq~~~~~~~~~---------~~---~ 198 (509) +...++|+.-=.-|.||++.+|-|+==|-= |-+= +||-+|+|||-| .+|.+.|..... .. . T Consensus 65 a~~vl~rR~~v~kitTgs~~LDeLLGGGiETqaiTE~~GEFGSGKTQ~-----~HqLAVnVQlP~flfydeeave~GGL~ 139 (333) T TIGR02236 65 ADDVLERRKSVGKITTGSKELDELLGGGIETQAITEVFGEFGSGKTQI-----CHQLAVNVQLPEFLFYDEEAVEKGGLE 139 (333) T ss_pred HHHHHHHHHCCCEEECCCHHHHHHCCCCCCCEEEEEEEEEECCCHHHH-----HHHHHHHHCCCCHHHCCCHHHCCCCCC T ss_conf 899999872075044486367656079600005888771007865787-----667655322871121011000147767 Q ss_pred C-EEEEEEC-----CCCHHHHHH-HHHH-------HCCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHHHHHCCCCCE Q ss_conf 0-6999951-----677678999-9985-------114674211589972887888999998876666667542046884 Q gi|254780617|r 199 V-YCIYVAI-----GQKRSSVAR-FVKA-------LEDRGALSYSIVVVASASDPAPMQLLAPFAGCAMGEYFRDNGYHA 264 (509) Q Consensus 199 v-~~V~~~I-----Ger~~ev~~-~~~~-------l~~~~~~~~tvvv~a~a~~~~~~r~~ap~~a~aiAEyfr~~G~~V 264 (509) - -.||.=- =||-.++++ +-.. |.....|++.- +|-|-.| -+|++-..-+-.+++--.+.|++| T Consensus 140 gp~av~IDTEnTFRPERI~qmA~GL~~g~l~~~melD~~evL~nI~--vARAyNS-~HQmllve~~~~li~e~~~~~~pv 216 (333) T TIGR02236 140 GPKAVYIDTENTFRPERIEQMAKGLARGTLQAAMELDPDEVLKNIY--VARAYNS-NHQMLLVEKAKELIKELKNEDKPV 216 (333) T ss_pred CCEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHCCCHHHHHCCEE--EEEEECC-HHHHHHHHHHHHHHHHCCCCCCCE T ss_conf 8768998559798625799999863111366653278777720304--7785263-147999999999997301579946 Q ss_pred -EEEEECHHHHHHHHHHHHHHCCCCCCCC Q ss_conf -7996164789999987545303787543 Q gi|254780617|r 265 -LIAYDDLQKHAVAYRQLSLLLRRPPGRE 292 (509) Q Consensus 265 -Li~~Ddltr~A~a~reisl~~~~pp~~~ 292 (509) ||+.||||.|-+| |=|||+ T Consensus 217 ~L~~VDSLtsHFRa---------EY~GRg 236 (333) T TIGR02236 217 RLLIVDSLTSHFRA---------EYVGRG 236 (333) T ss_pred EEEEECCCCCCCCC---------CCCCHH T ss_conf 89998121002236---------764502 No 86 >PRK12422 chromosomal replication initiation protein; Provisional Probab=96.59 E-value=0.073 Score=32.59 Aligned_cols=239 Identities=14% Similarity=0.199 Sum_probs=107.3 Q ss_pred EEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHHH Q ss_conf 65524778861158999999988630246678760699995167767899999851146742115899728878889999 Q gi|254780617|r 165 ELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALSYSIVVVASASDPAPMQL 244 (509) Q Consensus 165 ~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~~tvvv~a~a~~~~~~r~ 244 (509) .-|.|++|.|||.| +.+|-|.... ... .++|+-- |.. ..+|+..++... ...+ T Consensus 144 LfIyG~~GlGKTHL-L~AIgn~i~~-----~~~--kV~Yvta-e~F--~~~~v~ai~~~~----------------~~~F 196 (455) T PRK12422 144 IYLFGPEGSGKTHL-MQAAVSALRE-----SGG--KILYVSS-ELF--TEHLVSAIRSGE----------------MQRF 196 (455) T ss_pred EEEECCCCCCHHHH-HHHHHHHHHC-----CCC--EEEEECH-HHH--HHHHHHHHHCCC----------------HHHH T ss_conf 58878999978999-9999998537-----998--6999749-999--999999997588----------------9999 Q ss_pred HHHHHHHHHHHHHHCCCCCEEEEEECHHHHH---HHHHH--------------HHHHCCCCCCCCCCCCCCHHHHHHHHH Q ss_conf 9887666666754204688479961647899---99987--------------545303787543337520012467888 Q gi|254780617|r 245 LAPFAGCAMGEYFRDNGYHALIAYDDLQKHA---VAYRQ--------------LSLLLRRPPGREAYPGDVFYLHSRLLE 307 (509) Q Consensus 245 ~ap~~a~aiAEyfr~~G~~VLi~~Ddltr~A---~a~re--------------isl~~~~pp~~~gyp~~vf~~~s~l~E 307 (509) -++| +.-||| ++||+--++ ..+.| |-+...+||. ++-.+..||.- T Consensus 197 ---------r~~y--r~~DvL-LIDDIQfl~gK~~tqeEff~tfN~L~~~~KQIVitsDr~P~------el~~l~~RL~S 258 (455) T PRK12422 197 ---------RSFY--RNVDAL-FIEDIEVFSGKGATQEEFFHTFNSLHTEGKLIVISSSYAPG------DLKAMEERLIS 258 (455) T ss_pred ---------HHHH--HCCCEE-EEEHHHHHHCCHHHHHHHHHHHHHHHHCCCEEEEECCCCHH------HHHHHHHHHHH T ss_conf ---------9999--638877-63147887284889999999999999859969996898957------65126899998 Q ss_pred HHHHCCCCCCCCCEEEEEEEECCCCCCCCCHHHHHHHHHCCEEEEECCHHHHCCCCCCCCCCCHHHCCCCCCCHHHHHHH Q ss_conf 87504544588735675310036766443114668854061699615012210266543344412203434389887755 Q gi|254780617|r 308 RAAKMSDALGAGSLTALPVIETQVNDVSAYIPTNVISITDGQIFLETELFYQGIRPAINIGLSVSRVGSAAQVKAMKQVS 387 (509) Q Consensus 308 Ra~~~~~~~g~GSiT~~~~v~~~~~D~~~~i~~~~~si~DG~i~l~~~l~~~g~~Paid~~~S~SRv~~~~~~~~~~~~a 387 (509) |-. +|=++ |+..|=.+.-+.|+ .++....| +++...+-.....-.....+.+- T Consensus 259 Rf~-------~GL~v----------~I~~Pd~etr~~Il------~~k~~~~~----~~l~~ev~~~iA~~i~~niReLe 311 (455) T PRK12422 259 RFE-------WGIAI----------PIHPLTREGLRSFL------MRQAEQLS----IRIEETALDFLIQALSSNVKTLL 311 (455) T ss_pred HHH-------CCEEE----------CCCCCCHHHHHHHH------HHHHHHCC----CCCCHHHHHHHHHHHHHHHHHHH T ss_conf 863-------76132----------16899989999999------99998718----88844689999999755179999 Q ss_pred HHHHHHH--HHHHHH-------HHHHHHHC---------CCCH-HHHHHHHH--HHHHHHHHCCCCCCCCCHHHHHHHHH Q ss_conf 6788879--988767-------87987633---------8998-99999999--99999996699988748899999999 Q gi|254780617|r 388 GAVKGEL--AQYREM-------SSFSKFSS---------DLDS-STQKFLSK--GERLTELLKQPQFSPLAMEEQVVMIF 446 (509) Q Consensus 388 ~~lr~~l--aqy~El-------e~f~~fgs---------dlD~-~t~~~l~r--g~ri~e~LkQ~~~~p~~~~~qv~~l~ 446 (509) |.+..+. ..|..+ +.....-. .++. .-.+.+.+ +-...+++............|+++ | T Consensus 312 Gal~~l~~~~~~~~~~~~~i~~~~~~~~l~d~~~~~~~~~it~~~I~~~Va~~f~i~~~dl~sk~R~k~iv~aRqiAM-Y 390 (455) T PRK12422 312 HALTLLAKRVAYKKLSHQLLYEDDIKALLHDVLEAAESVRLTPSGIVRAVAQYYGVSPESILGRSQSREYVLPRQVAM-Y 390 (455) T ss_pred HHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHCCCCHHHHCCCCCCCCCCHHHHHHH-H T ss_conf 999999999998715688364999999999874224576589999999999872976999607888845557999999-9 Q ss_pred HHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 986698066798999999999999997415999998873 Q gi|254780617|r 447 AGISGCLDEVAVSQVRKFETNFLSHMRVSSQDILEDIRK 485 (509) Q Consensus 447 a~~~G~ld~i~v~~I~~fe~~l~~~l~~~~~ei~~~I~~ 485 (509) .+.+ +-+....+|...- ...|.-++..+++ T Consensus 391 L~R~--lt~~Sl~~IG~~F-------gRdHSTVihaikk 420 (455) T PRK12422 391 LCRQ--KLSLSYVKIGDVF-------SRDHSTVISSIRA 420 (455) T ss_pred HHHH--HHCCCHHHHHHHH-------CCCCCHHHHHHHH T ss_conf 9998--8368899998875-------9985099999999 No 87 >pfam08423 Rad51 Rad51. Rad51 is a DNA repair and recombination protein and is a homologue of the bacterial ATPase RecA protein. Probab=96.55 E-value=0.0073 Score=39.84 Aligned_cols=132 Identities=17% Similarity=0.198 Sum_probs=71.1 Q ss_pred CCCCCCCHHHHHCCHHHHHCCCC--CCCCCEEEEECCCCCCCHHHHHHHHHHH-HHHCCCCCCCCCCEEEEEEC--CCCH Q ss_conf 31126600144416023310111--1568336552477886115899999998-86302466787606999951--6776 Q gi|254780617|r 136 IIQRQSVCEPLSTGIKAIDSLIP--IGRGQRELIIGDRKTGKTSIILDTFLNQ-KSSHDKGSEKDKVYCIYVAI--GQKR 210 (509) Q Consensus 136 ~~~R~~i~~~l~TGI~~ID~l~p--igrGQR~~I~g~~g~GKt~l~~~~I~nq-~~~~~~~~~~~~v~~V~~~I--Ger~ 210 (509) .++++.--..+.||.+.+|.++- +-+|.=.-|+|.+|+|||.+|+...+|- ....... .. -.+||.-- +-+. T Consensus 15 ~~~~r~~~~~isTg~~~LD~lLgGGi~~g~ITEi~G~~gsGKTQlc~qlav~~qlp~~~gg-~~--g~vvyIDTEg~f~~ 91 (261) T pfam08423 15 LHQRRSEVIRITTGSKELDKLLGGGIETGSITEVFGEFRTGKTQLCHTLCVTCQLPLEMGG-GE--GKALYIDTEGTFRP 91 (261) T ss_pred HHHHHCCCCEECCCCHHHHHHHCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHCHHHHCC-CC--CEEEEEECCCCCCH T ss_conf 9997548735778987899873798667729999899888789999999999407096569-99--72899936888698 Q ss_pred HHHHHHHHHHC--CCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECHHHHHHH Q ss_conf 78999998511--46742115899728878889999988766666675420468847996164789999 Q gi|254780617|r 211 SSVARFVKALE--DRGALSYSIVVVASASDPAPMQLLAPFAGCAMGEYFRDNGYHALIAYDDLQKHAVA 277 (509) Q Consensus 211 ~ev~~~~~~l~--~~~~~~~tvvv~a~a~~~~~~r~~ap~~a~aiAEyfr~~G~~VLi~~Ddltr~A~a 277 (509) +-+.++.+.+. ....+++..+ ... .+. ..++..- ..+.+.+ .+.+--|+++||++.+.++ T Consensus 92 eRl~qia~~~~~~~~~~L~~I~v-~r~-~~~-~~~~~~l---~~~~~~~-~~~~v~LvVvDSiaalfR~ 153 (261) T pfam08423 92 ERIVAIAERFGLDPEEVLDNIAY-ARA-YNT-EHQMQLL---LQAAAMM-SESRFALLIVDSATALYRT 153 (261) T ss_pred HHHHHHHHHHCCCHHHHHHHCEE-CCC-CCH-HHHHHHH---HHHHHHH-HCCCCEEEEEECCCEEEEE T ss_conf 99999999829997898753314-168-998-9999999---9999987-3178349998324002333 No 88 >TIGR02237 recomb_radB DNA repair and recombination protein RadB; InterPro: IPR011939 This family consists exclusively of archaeal RadB protein, a homolog of bacterial RecA, eukaryotic RAD51 (IPR011941 from INTERPRO) and DMC1 (IPR011940 from INTERPRO), and archaeal RadA (IPR011938 from INTERPRO) ,, .; GO: 0003684 damaged DNA binding, 0005524 ATP binding, 0008094 DNA-dependent ATPase activity, 0006281 DNA repair, 0006310 DNA recombination. Probab=96.39 E-value=0.042 Score=34.37 Aligned_cols=173 Identities=22% Similarity=0.173 Sum_probs=85.5 Q ss_pred EEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCC----CCCCCEEEEEEECCCCCHH Q ss_conf 552477886115899999998863024667876069999516776789999985114----6742115899728878889 Q gi|254780617|r 166 LIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALED----RGALSYSIVVVASASDPAP 241 (509) Q Consensus 166 ~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~----~~~~~~tvvv~a~a~~~~~ 241 (509) =|+|++|+|||++|+-+-+|-..+. ..| ||+=- |=|=...+|..-++. .+...-.++|+...+ -- T Consensus 16 QiYGp~G~GKTn~c~~~a~~a~~~G------k~v--~YiDT-EGGLS~ER~~q~~~~~~~D~e~~~~~~iv~~~~~--f~ 84 (223) T TIGR02237 16 QIYGPPGSGKTNICLILAVNAARQG------KKV--VYIDT-EGGLSPERFKQIAEDRALDPERVLSNVIVFEVFD--FD 84 (223) T ss_pred EEECCCCCCHHHHHHHHHHHHHHCC------CCE--EEEEC-CCCCHHHHHHHHHHCCCCCHHHHHCCEEEECCCC--HH T ss_conf 9875899867899999999998618------958--99962-8983289999986305889888841535523535--67 Q ss_pred HHHHHHHHHHHHHHHHHCC-C-CCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCC Q ss_conf 9999887666666754204-6-8847996164789999987545303787543337520012467888875045445887 Q gi|254780617|r 242 MQLLAPFAGCAMGEYFRDN-G-YHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLLERAAKMSDALGAG 319 (509) Q Consensus 242 ~r~~ap~~a~aiAEyfr~~-G-~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l~ERa~~~~~~~g~G 319 (509) +|-.| ..=++-|.++ | +--|||+||+|.|-++-++=.- -..-..--++-..++-|+.=|.|.+=. = T Consensus 85 eQ~~a----i~~~~~~~~~~G~~~~LvVvDs~t~~YRle~~~d~-----nk~~~~~~~l~~Ql~~Ll~lArk~~~A---V 152 (223) T TIGR02237 85 EQEVA----IQKTSKLIDRDGDKADLVVVDSFTALYRLERSDDR-----NKQISLNRELARQLTLLLSLARKKDLA---V 152 (223) T ss_pred HHHHH----HHHHHHHHHCCCCEEEEEEEECCHHHHHHCCCCCC-----HHHHHHHHHHHHHHHHHHHHHHHCCCC---E T ss_conf 89999----99999998606883314888153345420257860-----256799999999999999998764997---8 Q ss_pred CEEEEEEEECCCC---CCCCCHHHHHHHHHCCEEE-EECCHHHH-CCCCC Q ss_conf 3567531003676---6443114668854061699-61501221-02665 Q gi|254780617|r 320 SLTALPVIETQVN---DVSAYIPTNVISITDGQIF-LETELFYQ-GIRPA 364 (509) Q Consensus 320 SiT~~~~v~~~~~---D~~~~i~~~~~si~DG~i~-l~~~l~~~-g~~Pa 364 (509) =|| =++-+.-++ |.-.|+.-.+..+.-.-|+ |++ +++ |.|-| T Consensus 153 viT-NQvy~d~~~gG~~~~~P~GG~~L~h~~K~i~rLE~--~~r~G~R~a 199 (223) T TIGR02237 153 VIT-NQVYTDVNNGGSETLRPLGGHLLEHWSKVILRLEK--LNRVGRRKA 199 (223) T ss_pred EEE-EEEEEEECCCCCCEEEECCCHHHHHHHCEEEEEEC--CCCCCCEEE T ss_conf 997-11588637797320100331134311021335644--789874245 No 89 >TIGR00635 ruvB Holliday junction DNA helicase RuvB; InterPro: IPR004605 All proteins in this family for which functions are known are 5'-3' DNA helicases that, as part of a complex with RuvA homologs serve as a 5'-3' Holliday junction helicase. RuvA specifically binds Holliday junctions as a sandwich of two tetramers and maintains the configuration of the junction. It forms a complex with two hexameric rings of RuvB, the subunit that contains helicase activity. The complex drives ATP-dependent branch migration of the Holliday junction recombination intermediate. The endonuclease RuvC resolves junctions.; GO: 0003677 DNA binding, 0005524 ATP binding, 0009378 Holliday junction helicase activity, 0006281 DNA repair, 0006310 DNA recombination. Probab=96.13 E-value=0.0086 Score=39.33 Aligned_cols=210 Identities=20% Similarity=0.236 Sum_probs=107.1 Q ss_pred EEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHH Q ss_conf 36552477886115899999998863024667876069999516776789999985114674211589972887888999 Q gi|254780617|r 164 RELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALSYSIVVVASASDPAPMQ 243 (509) Q Consensus 164 R~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~~tvvv~a~a~~~~~~r 243 (509) -+++||+||.||||||. .|.|--+.+... .=+-.=||.++.+-+...|++. T Consensus 32 H~LL~GPPGLGKTTLA~-IiA~Emg~~l~i--------TsGP~L~kPgDlaaiLt~L~~g-------------------- 82 (305) T TIGR00635 32 HLLLYGPPGLGKTTLAH-IIANEMGVNLKI--------TSGPALEKPGDLAAILTNLEEG-------------------- 82 (305) T ss_pred CEEEECCCCCCHHHHHH-HHHHHHCCCCEE--------EECCCCCCHHHHHHHHHHCCCC-------------------- T ss_conf 16631756874678999-999983893267--------4067554757899999705689-------------------- Q ss_pred HHHHHHHHHHHHHHHCCCCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCC--- Q ss_conf 99887666666754204688479961647899999875453037875433375200124678888750454458873--- Q gi|254780617|r 244 LLAPFAGCAMGEYFRDNGYHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLLERAAKMSDALGAGS--- 320 (509) Q Consensus 244 ~~ap~~a~aiAEyfr~~G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l~ERa~~~~~~~g~GS--- 320 (509) ||| ++|=+-|.+.+-.|+ ==|..|=|.=|+-= |+-.+ .=| T Consensus 83 -------------------DVL-FIDEIHRL~p~~EE~-----LYpAMEDF~lDi~I---------G~Gp~---Ar~v~l 125 (305) T TIGR00635 83 -------------------DVL-FIDEIHRLSPAIEEL-----LYPAMEDFRLDIVI---------GKGPS---ARSVRL 125 (305) T ss_pred -------------------CEE-ECCHHHHCCHHHHHH-----CCCCCCCEEEEEEE---------ECCCC---CCEEEE T ss_conf -------------------631-012565048334531-----05300121787787---------12898---525760 Q ss_pred ----EEEEEEEECCCCCCCCCHHHHHHHHHCCEEEEECCHHHHCCCCCCCCCCCHHHCC---CC-CCCHHH--------- Q ss_conf ----5675310036766443114668854061699615012210266543344412203---43-438988--------- Q gi|254780617|r 321 ----LTALPVIETQVNDVSAYIPTNVISITDGQIFLETELFYQGIRPAINIGLSVSRVG---SA-AQVKAM--------- 383 (509) Q Consensus 321 ----iT~~~~v~~~~~D~~~~i~~~~~si~DG~i~l~~~l~~~g~~Paid~~~S~SRv~---~~-~~~~~~--------- 383 (509) =|-+- .+|..|-+|.|.-|=..=+.-=..|-..+|+.= |||-. +- ....+- T Consensus 126 dLpPFTLvG-ATTR~G~lt~PLrdRFG~~~rl~fY~~~EL~~I-----------v~R~A~~L~~ei~~~~a~~IArrSRG 193 (305) T TIGR00635 126 DLPPFTLVG-ATTRAGMLTSPLRDRFGIILRLEFYTPEELAEI-----------VSRSAGLLNIEIEQEAALEIARRSRG 193 (305) T ss_pred CCCCCCCCC-CCCCCCCCCCCHHHHHHHHHHCCCCCHHHHHHH-----------HHHHHHHCCCCCCHHHHHHHHHHCCC T ss_conf 686944200-003477410313345447454026898789999-----------87533441430077899999875478 Q ss_pred -HHHHHHHHH---HHHHHHHH---------HHHHHHH---CCCCHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHH-- Q ss_conf -775567888---79988767---------8798763---38998999999999999999669998874889999999-- Q gi|254780617|r 384 -KQVSGAVKG---ELAQYREM---------SSFSKFS---SDLDSSTQKFLSKGERLTELLKQPQFSPLAMEEQVVMI-- 445 (509) Q Consensus 384 -~~~a~~lr~---~laqy~El---------e~f~~fg---sdlD~~t~~~l~rg~ri~e~LkQ~~~~p~~~~~qv~~l-- 445 (509) -++|++|-+ |+||-++- +++..+. -+||..+++.| ..+..|.+.=|+-+..-.+++ T Consensus 194 TPRIAnRLLRRVRDfA~V~~~~~I~~~i~~~AL~~L~iD~~GLd~~D~~~L------~~li~~f~GGPVGl~tlA~a~GE 267 (305) T TIGR00635 194 TPRIANRLLRRVRDFAQVRGQGVITRDIALKALEMLDIDELGLDEIDRKLL------SVLIEKFQGGPVGLDTLAAALGE 267 (305) T ss_pred CHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHH------HHHHHHCCCCCHHHHHHHHHHCC T ss_conf 637888776766448887267873889999998862533300579989999------99998628985238989988557 Q ss_pred -----------HHHHCCCCCCCC Q ss_conf -----------998669806679 Q gi|254780617|r 446 -----------FAGISGCLDEVA 457 (509) Q Consensus 446 -----------~a~~~G~ld~i~ 457 (509) |++.+||+..-+ T Consensus 268 d~~TiEdv~EPyLl~~G~~~RT~ 290 (305) T TIGR00635 268 DADTIEDVYEPYLLQQGFLQRTP 290 (305) T ss_pred CHHHHHCCCCHHHHHHHHHHCCC T ss_conf 51114203465899876650067 No 90 >PRK13341 recombination factor protein RarA/unknown domain fusion protein; Reviewed Probab=96.12 E-value=0.053 Score=33.63 Aligned_cols=84 Identities=17% Similarity=0.202 Sum_probs=47.2 Q ss_pred CCCCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEEEEEC Q ss_conf 11115683365524778861158999999988630246678760699995167767899999851146742115899728 Q gi|254780617|r 156 LIPIGRGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALSYSIVVVAS 235 (509) Q Consensus 156 l~pigrGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~~tvvv~a~ 235 (509) ++--++=.-+.++|++|||||||+- .|.|....+ ++ -+-++--.-.+++++++ T Consensus 46 ~i~~~~~~S~Il~GPPGtGKTTLA~-iIA~~t~~~--------F~-~lsAv~sgvkdlr~ii~----------------- 98 (726) T PRK13341 46 AIKADRVGSLILYGPPGVGKTTLAR-IIANHTRAH--------FS-SLNAVLAGVKDLRAEVD----------------- 98 (726) T ss_pred HHHCCCCCEEEEECCCCCCHHHHHH-HHHHHHCCC--------EE-EEECCCCCHHHHHHHHH----------------- T ss_conf 9976999827888979999999999-998874886--------79-98562037799999999----------------- Q ss_pred CCCCHHHHHHHHHHHHHHHHH-HHCCCCCEEEEEECHHHHHHHHHHHHH Q ss_conf 878889999988766666675-420468847996164789999987545 Q gi|254780617|r 236 ASDPAPMQLLAPFAGCAMGEY-FRDNGYHALIAYDDLQKHAVAYRQLSL 283 (509) Q Consensus 236 a~~~~~~r~~ap~~a~aiAEy-fr~~G~~VLi~~Ddltr~A~a~reisl 283 (509) -|+. +...|+.+++++|-+-||-.++.+.=| T Consensus 99 -----------------~A~~~~~~~g~~tILFIDEIHRfNK~QQD~LL 130 (726) T PRK13341 99 -----------------AAKERLERHGKRTILFIDEVHRFNKAQQDALL 130 (726) T ss_pred -----------------HHHHHHHHCCCCEEEEEECHHHCCHHHHHHHH T ss_conf -----------------99999874599659998625425887899878 No 91 >TIGR02782 TrbB_P P-type conjugative transfer ATPase TrbB; InterPro: IPR014149 This entry represents TrbB, a protein, which is encoded in the trb locus of Agrobacterium Ti plasmids where it is involved in the type IV secretion system for plasmid conjugative transfer . TrbB is a homologue of the vir system VirB11 ATPase , and the Flp pilus system ATPase TadA .. Probab=96.01 E-value=0.029 Score=35.47 Aligned_cols=205 Identities=20% Similarity=0.321 Sum_probs=117.7 Q ss_pred EECCCEEEEEEECCCCCC---CCCCEEEECCCEEEEECCHHHHHCCCCHHHCCCCCCCCCCCCCCCH-------HCCCCC Q ss_conf 205985999980586686---7898889779878866794698300030023144664445444100-------101357 Q gi|254780617|r 65 NLEVDNVGVVILGSYKEI---SEGDIVKRTGRIVDVPVGLELLGRVVDALGNPIDGKGPIKCEQRSC-------TEADAP 134 (509) Q Consensus 65 ~l~~d~v~i~~l~~~~gI---~~G~~V~~tg~~~~vpvG~~lLGRVvD~lG~PlDg~g~i~~~~~~~-------i~~~~p 134 (509) -|+.+.|.=+++++...+ +-|..+.++|+.++.--++.+++-|=|.+|.-.|..-|+-..+ .| +++..| T Consensus 17 ~LdD~~vvEIMLNpDG~Lwve~Lg~G~~~~G~t~~~~~a~~Ii~~vA~~l~~~V~~~~PivegE-LPldflGsRFeGl~P 95 (315) T TIGR02782 17 ALDDPKVVEIMLNPDGKLWVERLGEGMEPLGKTVSPADAERIIGLVADYLGTEVDRDKPIVEGE-LPLDFLGSRFEGLLP 95 (315) T ss_pred HHCCCCEEEEEECCCCHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCEEECCCCEEEEC-CCCCHHHCCCCCCCC T ss_conf 7379883899866987010500679730016611789999999999876446043578626610-751112011004687 Q ss_pred CCCC------CCC------CHHHHHCCHH---HHHCCCCCCC-CCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCC Q ss_conf 7311------266------0014441602---3310111156-8336552477886115899999998863024667876 Q gi|254780617|r 135 GIIQ------RQS------VCEPLSTGIK---AIDSLIPIGR-GQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDK 198 (509) Q Consensus 135 ~~~~------R~~------i~~~l~TGI~---~ID~l~pigr-GQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~ 198 (509) |+.+ |++ ..+-.+-||- ..|.|.-... =|.++|.||-|+|||||+ .+++|+-..- ....+- T Consensus 96 PVV~~p~F~IRkkA~~vfTLDdYV~~gimtaaQ~d~l~~Av~ar~NIlv~GGTGSGKTTLa-NAlla~I~~l--~~P~dR 172 (315) T TIGR02782 96 PVVEAPSFAIRKKAVRVFTLDDYVEAGIMTAAQRDVLREAVAARKNILVVGGTGSGKTTLA-NALLAEIAKL--NDPDDR 172 (315) T ss_pred CCCCCCCCHHCCCCCCCCCCHHHHHHCCCCHHHHHHHHHHHHHCCCEEEECCCCCCHHHHH-HHHHHHHHHC--CCCCCE T ss_conf 7556551011102241047077764044557899999999971298899814588579999-9999988521--699961 Q ss_pred CEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEEEEECCCCC--HHHHHHHHHHHHHHHHHHHCCCCCEEEEEECHHHHHH Q ss_conf 06999951677678999998511467421158997288788--8999998876666667542046884799616478999 Q gi|254780617|r 199 VYCIYVAIGQKRSSVARFVKALEDRGALSYSIVVVASASDP--APMQLLAPFAGCAMGEYFRDNGYHALIAYDDLQKHAV 276 (509) Q Consensus 199 v~~V~~~IGer~~ev~~~~~~l~~~~~~~~tvvv~a~a~~~--~~~r~~ap~~a~aiAEyfr~~G~~VLi~~Ddltr~A~ 276 (509) ++. | |-.+|.+ |-.+-.|..-|+|+- ...+.+=. | +|.+=-. T Consensus 173 ~vI----i-EDT~ElQ----------C~A~N~V~lrT~d~~Gi~M~~LLk~----T----LRlRPDR------------- 216 (315) T TIGR02782 173 VVI----I-EDTAELQ----------CAAENLVALRTSDDVGISMTRLLKA----T----LRLRPDR------------- 216 (315) T ss_pred EEE----E-ECCHHHH----------HCCCCEEEEECCCCCCCCHHHHHHH----H----HCCCCCC------------- T ss_conf 899----8-5471320----------1378706874489866147888887----5----0058873------------- Q ss_pred HHHHHHHHCCCCCC----------CCCCCCCCHHHHHHH----HHHHHHCCCC Q ss_conf 99875453037875----------433375200124678----8887504544 Q gi|254780617|r 277 AYRQLSLLLRRPPG----------REAYPGDVFYLHSRL----LERAAKMSDA 315 (509) Q Consensus 277 a~reisl~~~~pp~----------~~gyp~~vf~~~s~l----~ERa~~~~~~ 315 (509) .+.||.=| .-|.||-+-.+|+.- |-|-.++-.+ T Consensus 217 ------I~VGEVRg~eAL~LLKAWNTGHPGGi~TiHAn~a~~aL~RLeQLi~E 263 (315) T TIGR02782 217 ------IIVGEVRGGEALDLLKAWNTGHPGGIATIHANNAKAALRRLEQLIAE 263 (315) T ss_pred ------EEEEEECCHHHHHHHHHHCCCCCCCCEEECCCCHHHHHHHHHHHHHH T ss_conf ------35630143899999864058995302033148866899999999985 No 92 >PRK09354 recA recombinase A; Provisional Probab=96.00 E-value=0.049 Score=33.86 Aligned_cols=135 Identities=20% Similarity=0.313 Sum_probs=78.0 Q ss_pred HHHHCCHHHHHCCCC---CCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHH Q ss_conf 144416023310111---15683365524778861158999999988630246678760699995167767899999851 Q gi|254780617|r 144 EPLSTGIKAIDSLIP---IGRGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKAL 220 (509) Q Consensus 144 ~~l~TGI~~ID~l~p---igrGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l 220 (509) +.+.||...+|.-+- +-+|.=.-|+|+..+|||+|++.+|.+ . +. ..-+|+|.-. |..- -.++.+.+ T Consensus 39 ~~isTGsl~LD~aLGiGG~P~GRivEi~G~esSGKTtlal~~iae-a-Qk------~Gg~~a~iDa-E~al-d~~~a~~l 108 (350) T PRK09354 39 ETISTGSLALDIALGIGGLPKGRIVEIYGPESSGKTTLALHAIAE-A-QK------AGGTAAFIDA-EHAL-DPVYAKKL 108 (350) T ss_pred CEECCCCHHHHHHHCCCCCCCCCEEEEECCCCCCHHHHHHHHHHH-H-HH------CCCCEEEEEE-CCCC-CHHHHHHC T ss_conf 654477178999875899678708999889877799999999999-9-97------5994799960-0027-98899984 Q ss_pred CCCCC-CCEEEEEEECCCCCHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCH Q ss_conf 14674-21158997288788899999887666666754204688479961647899999875453037875433375200 Q gi|254780617|r 221 EDRGA-LSYSIVVVASASDPAPMQLLAPFAGCAMGEYFRDNGYHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVF 299 (509) Q Consensus 221 ~~~~~-~~~tvvv~a~a~~~~~~r~~ap~~a~aiAEyfr~~G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf 299 (509) |. +++ ++++..| .+++ ++.++|.+...|.-.++|+||+....- ..|+---++ +...| T Consensus 109 ---GVd~d~--llv~qpd--~~Eq------al~i~e~Lvrsg~vd~IVvDSVaAL~p-k~Eieg~mg-----d~~vG--- 166 (350) T PRK09354 109 ---GVDIDN--LLVSQPD--TGEQ------ALEIADALVRSGAVDLIVVDSVAALVP-KAEIEGEMG-----DSHVG--- 166 (350) T ss_pred ---CCCHHH--EEEECCC--HHHH------HHHHHHHHHHCCCCCEEEEECCCCCCC-HHHHHCCCC-----CCCHH--- T ss_conf ---977157--1785686--7999------999999998548841899825334576-888731335-----42263--- Q ss_pred HHHHHHHHHHHH Q ss_conf 124678888750 Q gi|254780617|r 300 YLHSRLLERAAK 311 (509) Q Consensus 300 ~~~s~l~ERa~~ 311 (509) +++||+-+|-| T Consensus 167 -~qARlmSqalR 177 (350) T PRK09354 167 -LQARLMSQALR 177 (350) T ss_pred -HHHHHHHHHHH T ss_conf -89999999999 No 93 >pfam00931 NB-ARC NB-ARC domain. Probab=95.94 E-value=0.032 Score=35.24 Aligned_cols=92 Identities=17% Similarity=0.126 Sum_probs=46.9 Q ss_pred EEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHH Q ss_conf 36552477886115899999998863024667876069999516776789999985114674211589972887888999 Q gi|254780617|r 164 RELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALSYSIVVVASASDPAPMQ 243 (509) Q Consensus 164 R~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~~tvvv~a~a~~~~~~r 243 (509) -++|.|-.|+||||||. .+.|..... ..- ..++++-+++..+ ..++.+.+.+.-.... ....... .. T Consensus 21 vI~I~G~gGiGKTtLA~-~v~~~~~i~----~~F-~~~~wv~vs~~~~-~~~i~~~i~~~l~~~~----~~~~~~~--~~ 87 (285) T pfam00931 21 VVGIVGMGGVGKTTLAK-QIYNDDSVG----GHF-DSVAWVVVSKTYT-EFRLQKDILQELGLDD----SDWVEKN--ES 87 (285) T ss_pred EEEEECCCCCCHHHHHH-HHHCCHHHH----HCC-CEEEEEEECCCCC-HHHHHHHHHHHHCCCC----CCCCCCC--HH T ss_conf 99988999563999999-997165565----059-8389999797666-8999999999856665----4555578--99 Q ss_pred HHHHHHHHHHHHHHHCCCCCEEEEEECHHHH Q ss_conf 9988766666675420468847996164789 Q gi|254780617|r 244 LLAPFAGCAMGEYFRDNGYHALIAYDDLQKH 274 (509) Q Consensus 244 ~~ap~~a~aiAEyfr~~G~~VLi~~Ddltr~ 274 (509) .+ ...+.+++ +++..|+|+||+... T Consensus 88 ~l----~~~l~~~L--~~kr~LiVLDDVw~~ 112 (285) T pfam00931 88 EL----AVKIKEAL--LRKRFLLVLDDVWEK 112 (285) T ss_pred HH----HHHHHHHH--CCCCEEEEECCCCCH T ss_conf 99----99999997--279669996388878 No 94 >PTZ00035 Rad51; Provisional Probab=95.89 E-value=0.077 Score=32.44 Aligned_cols=132 Identities=17% Similarity=0.198 Sum_probs=70.8 Q ss_pred CCCCCCCHHHHHCCHHHHHCCCCCC--CCCEEEEECCCCCCCHHHHHHHHHHH-HHHCCCCCCCCCCEEEEEEC-C-CCH Q ss_conf 3112660014441602331011115--68336552477886115899999998-86302466787606999951-6-776 Q gi|254780617|r 136 IIQRQSVCEPLSTGIKAIDSLIPIG--RGQRELIIGDRKTGKTSIILDTFLNQ-KSSHDKGSEKDKVYCIYVAI-G-QKR 210 (509) Q Consensus 136 ~~~R~~i~~~l~TGI~~ID~l~pig--rGQR~~I~g~~g~GKt~l~~~~I~nq-~~~~~~~~~~~~v~~V~~~I-G-er~ 210 (509) .++|+.---.+.||-+.+|.|+-=| -|.-.=|||.+|+|||-||...-+|- .-...+... --|||.-- | =|. T Consensus 102 ~~~~R~~~~~itTGs~~LD~LLGGGiet~sITEi~Ge~gsGKTQlChqLaV~~QLP~~~GG~~---GkvvYIDTEgtFrp 178 (350) T PTZ00035 102 YHDARQNLIKFTTGSKQLDRLLKGGIETGGITELFGEFRTGKSQLCHTLAITCQLPIEQSGGE---GKCLWIDTEGTFRP 178 (350) T ss_pred HHHHHHHCEEECCCCHHHHHHHCCCEECCCEEEEEECCCCCHHHHHHHHHHHCCCCHHHCCCC---CCEEEEECCCCCCH T ss_conf 999986460357585788876278941385878972798978999999999904857777988---62799968899878 Q ss_pred HHHHHHHHH--HCCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECHHHHHHH Q ss_conf 789999985--1146742115899728878889999988766666675420468847996164789999 Q gi|254780617|r 211 SSVARFVKA--LEDRGALSYSIVVVASASDPAPMQLLAPFAGCAMGEYFRDNGYHALIAYDDLQKHAVA 277 (509) Q Consensus 211 ~ev~~~~~~--l~~~~~~~~tvvv~a~a~~~~~~r~~ap~~a~aiAEyfr~~G~~VLi~~Ddltr~A~a 277 (509) +.+.++.+. +.....+++..+.-+-.. -.++..- .-++-++.+++=-||++||+|.|.++ T Consensus 179 eRi~qIA~~~gld~~~vL~nI~~ara~n~---ehq~~ll----~~~~~~~~e~~vrLlIVDSitalFR~ 240 (350) T PTZ00035 179 ERIVAIAKRYGLHPTDCLNNIAYAKAYNC---DHQTELL----IDASAMMADARFALLIVDSATALYRS 240 (350) T ss_pred HHHHHHHHHCCCCHHHHHHHEEEEEECCH---HHHHHHH----HHHHHHHHCCCEEEEEEHHHHHHHHH T ss_conf 99999998709997998533223220687---8899999----99999851167589985445667665 No 95 >PRK13342 recombination factor protein RarA; Reviewed Probab=95.84 E-value=0.041 Score=34.43 Aligned_cols=33 Identities=24% Similarity=0.364 Sum_probs=22.0 Q ss_pred HHCCCCCCCCCEEEEECCCCCCCHHHHHHHHHHH Q ss_conf 3101111568336552477886115899999998 Q gi|254780617|r 153 IDSLIPIGRGQRELIIGDRKTGKTSIILDTFLNQ 186 (509) Q Consensus 153 ID~l~pigrGQR~~I~g~~g~GKt~l~~~~I~nq 186 (509) +..++--++=.-+.++|++||||||++- .|.++ T Consensus 28 L~~~i~~~~~~s~Il~GPPG~GKTTlA~-iiA~~ 60 (417) T PRK13342 28 LRRMIEAGRLSSMILWGPPGTGKTTLAR-IIAGA 60 (417) T ss_pred HHHHHHCCCCCEEEEECCCCCCHHHHHH-HHHHH T ss_conf 9999976999759988969998999999-99998 No 96 >PRK03992 proteasome-activating nucleotidase; Provisional Probab=95.66 E-value=0.047 Score=33.99 Aligned_cols=107 Identities=17% Similarity=0.163 Sum_probs=47.0 Q ss_pred EEEECCCCCCCCCCEEEECCCEEEEECCHHHHHCCCCHHHCCC--CCCCCCCCCCCCHHCCCCCCCCCCCCCHHHHHCCH Q ss_conf 9980586686789888977987886679469830003002314--46644454441001013577311266001444160 Q gi|254780617|r 73 VVILGSYKEISEGDIVKRTGRIVDVPVGLELLGRVVDALGNPI--DGKGPIKCEQRSCTEADAPGIIQRQSVCEPLSTGI 150 (509) Q Consensus 73 i~~l~~~~gI~~G~~V~~tg~~~~vpvG~~lLGRVvD~lG~Pl--Dg~g~i~~~~~~~i~~~~p~~~~R~~i~~~l~TGI 150 (509) +...-+...+++|++|..-.+.++|- ..|..-+|++=.-+ +.+|.+.-++ |-+-- -..+.+.|.++-=+ T Consensus 82 ~~~~~~~~~l~pg~~V~l~~~~~~i~---~~l~~~~d~~v~~m~v~e~P~v~~~d---IGGl~---~~k~el~E~velPl 152 (390) T PRK03992 82 VSSFVDRDKLKPGARVALNQQTLAIV---EVLPSEKDPRVRAMEVIESPDVTYED---IGGLD---EQIREVREAVELPL 152 (390) T ss_pred CCCCCCHHHCCCCCEEEECCCCCEEE---EECCCCCCCCCCCCCCCCCCCCCHHH---HCCHH---HHHHHHHHHHHHHH T ss_conf 57876888879999899853530305---64688878621104214799998466---14989---99999999999986 Q ss_pred HHHHCCCCCCC--CCEEEEECCCCCCCHHHHHHHHHHHHHH Q ss_conf 23310111156--8336552477886115899999998863 Q gi|254780617|r 151 KAIDSLIPIGR--GQRELIIGDRKTGKTSIILDTFLNQKSS 189 (509) Q Consensus 151 ~~ID~l~pigr--GQR~~I~g~~g~GKt~l~~~~I~nq~~~ 189 (509) +-=+.+--+|- ---++++|++|||||.+| -++.|+.+. T Consensus 153 ~~pe~f~~~Gi~pPkGvLLyGPPGtGKTllA-kAvA~e~~~ 192 (390) T PRK03992 153 KNPELFEEVGIEPPKGVLLYGPPGTGKTLLA-KAVAHETNA 192 (390) T ss_pred HCHHHHHHCCCCCCCEEEEECCCCCCHHHHH-HHHHHHHCC T ss_conf 5989999769999972786898999789999-999987488 No 97 >PRK00411 cdc6 cell division control protein 6; Reviewed Probab=95.52 E-value=0.083 Score=32.19 Aligned_cols=268 Identities=17% Similarity=0.229 Sum_probs=117.5 Q ss_pred CCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCC---CHHHHHHHHHHHCCCCCCCEEEEEEECC Q ss_conf 5683365524778861158999999988630246678760699995167---7678999998511467421158997288 Q gi|254780617|r 160 GRGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQ---KRSSVARFVKALEDRGALSYSIVVVASA 236 (509) Q Consensus 160 grGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGe---r~~ev~~~~~~l~~~~~~~~tvvv~a~a 236 (509) .++..+.|.|.+|||||+.+. .+.++.... ..++..||+-+=. ..+-+.++++.|.... + ... T Consensus 53 ~~~~n~~I~G~pGTGKT~~vk-~v~~~l~~~-----~~~~~~vyINc~~~~t~~~i~~~i~~~L~~~~-------~-p~~ 118 (394) T PRK00411 53 SRPSNVLILGPPGTGKTTTVK-KVFEELEEA-----ALKVVYVYINCQIDRTRYAILSEIARSLFGHP-------P-PSS 118 (394) T ss_pred CCCCCEEEECCCCCCHHHHHH-HHHHHHHHH-----CCCCEEEEEECCCCCCHHHHHHHHHHHHCCCC-------C-CCC T ss_conf 999847998899998999999-999999974-----68965999969668989999999999956998-------9-877 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCC Q ss_conf 78889999988766666675420468847996164789999987545303787543337520012467888875045445 Q gi|254780617|r 237 SDPAPMQLLAPFAGCAMGEYFRDNGYHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLLERAAKMSDAL 316 (509) Q Consensus 237 ~~~~~~r~~ap~~a~aiAEyfr~~G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l~ERa~~~~~~~ 316 (509) .-|. ..+. -.+-+++...++.++|++|.+.... .+ -+.++.|.++|+.|.-. T Consensus 119 G~s~-~~~~-----~~l~~~l~~~~~~~ivvLDEiD~L~--------------~~--~~~~vLY~L~r~~~~~~------ 170 (394) T PRK00411 119 GLSF-DELF-----DKIAEYLDERDRVLIVALDDINYLV--------------EK--EGNDVLYSLLRAHEEYP------ 170 (394) T ss_pred CCCH-HHHH-----HHHHHHHHCCCCEEEEEEECCCCCC--------------CC--CCCHHHHHHHHCCCCCC------ T ss_conf 8789-9999-----9999986166975899996554020--------------36--65089999985402268------ Q ss_pred CCCC-EEEEEEEECCCCCCCCCHHHHHHHHH-CCEEEEECCHH-----------HHCCCCCCCCCCCHHHCCCCCCCHHH Q ss_conf 8873-56753100367664431146688540-61699615012-----------21026654334441220343438988 Q gi|254780617|r 317 GAGS-LTALPVIETQVNDVSAYIPTNVISIT-DGQIFLETELF-----------YQGIRPAINIGLSVSRVGSAAQVKAM 383 (509) Q Consensus 317 g~GS-iT~~~~v~~~~~D~~~~i~~~~~si~-DG~i~l~~~l~-----------~~g~~Paid~~~S~SRv~~~~~~~~~ 383 (509) +| ++.+.+. .. -|+.+-...-+.|.+ .-.|+++..-+ ..+.+|-+- ...+....- T Consensus 171 --~~~~~vI~Is-N~-~~~~~~Ldprv~S~l~~~~i~F~PY~~~qL~~IL~~R~~~af~~gv~--------~~~~i~~~A 238 (394) T PRK00411 171 --GARIGVIGIS-SD-LTFLYILDPRVKSVFGPEEIVFPPYTADEIFDILKDRVEEGFYPGVV--------SDEVLELIA 238 (394) T ss_pred --CCCEEEEEEE-CC-CCHHHHHHHHHHCCCCCEEEEECCCCHHHHHHHHHHHHHHCCCCCCC--------CHHHHHHHH T ss_conf --8738999997-68-71776640777502786289858999899999999999841455678--------978999999 Q ss_pred HHHH---HHHHHHHHHHHHHHHHHHH-HCC--CCHHHHHHHHHHH--HHHHHHCCCCCCCCCHHHHHHHHHHHHC---CC Q ss_conf 7755---6788879988767879876-338--9989999999999--9999966999887488999999999866---98 Q gi|254780617|r 384 KQVS---GAVKGELAQYREMSSFSKF-SSD--LDSSTQKFLSKGE--RLTELLKQPQFSPLAMEEQVVMIFAGIS---GC 452 (509) Q Consensus 384 ~~~a---~~lr~~laqy~Ele~f~~f-gsd--lD~~t~~~l~rg~--ri~e~LkQ~~~~p~~~~~qv~~l~a~~~---G~ 452 (509) .++| |..|..+--.+....++.- |++ ..+..+++.++.. ++.+.++. +|..+++ .|+|+.. .- T Consensus 239 ~~~a~~~GDaR~Aldllr~A~e~Ae~~g~~~Vt~~hV~~A~~~~~~~~~~~~i~~-----L~~~~kl-vL~ai~~~~~~~ 312 (394) T PRK00411 239 DLTGREHGDARVAIDLLRRAGEIAEREGSRKVTEEDVRKAYEKSEPVHLEEVLRT-----LPLHEKL-LLLAIVRLLEKG 312 (394) T ss_pred HHHHHCCCCHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCHHHHHHHC-----CCHHHHH-HHHHHHHHHCCC T ss_conf 9985504758999999999999999718996589999999998600008899873-----8998999-999999985178 Q ss_pred CCCCCHHHHHHHHHHHHHHH------HHHHHHHHHHHHHCC Q ss_conf 06679899999999999999------741599999887327 Q gi|254780617|r 453 LDEVAVSQVRKFETNFLSHM------RVSSQDILEDIRKQK 487 (509) Q Consensus 453 ld~i~v~~I~~fe~~l~~~l------~~~~~ei~~~I~~~~ 487 (509) -..+...++.+....+.+.. .+...+++.++...+ T Consensus 313 ~~~~~~g~vy~~Y~~lc~~~~~~~ls~~~~~~~l~~L~~~g 353 (394) T PRK00411 313 APYVTTGEVYEEYKELCEELGYEPRSHTRFYEYLNKLDMLG 353 (394) T ss_pred CCCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHCC T ss_conf 87663899999999999973998887999999999998679 No 98 >PRK13631 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=95.45 E-value=0.1 Score=31.56 Aligned_cols=31 Identities=19% Similarity=0.432 Sum_probs=26.5 Q ss_pred HHHHH-CCCCCCCCCEEEEECCCCCCCHHHHH Q ss_conf 02331-01111568336552477886115899 Q gi|254780617|r 150 IKAID-SLIPIGRGQRELIIGDRKTGKTSIIL 180 (509) Q Consensus 150 I~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~ 180 (509) ++|+| .=+.+-+|+..+|+|.+|+||||++. T Consensus 39 ~~AL~~Is~~i~~Ge~vaIIG~nGsGKSTL~~ 70 (320) T PRK13631 39 LVALNNISYTFEKNKIYFIIGNSGSGKSTLVT 70 (320) T ss_pred EEEEECCEEEECCCCEEEEECCCCCCHHHHHH T ss_conf 79861645588599899999499984999999 No 99 >COG1066 Sms Predicted ATP-dependent serine protease [Posttranslational modification, protein turnover, chaperones] Probab=95.42 E-value=0.19 Score=29.59 Aligned_cols=106 Identities=20% Similarity=0.270 Sum_probs=63.7 Q ss_pred HHHHCCHHHHHCCC--CCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHC Q ss_conf 14441602331011--1156833655247788611589999999886302466787606999951677678999998511 Q gi|254780617|r 144 EPLSTGIKAIDSLI--PIGRGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALE 221 (509) Q Consensus 144 ~~l~TGI~~ID~l~--pigrGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~ 221 (509) ..+.||+.-.|-.+ =+.+|+-++|=|++|.|||||+++...+ .+.. . .|.|+.=-|....++---+.|. T Consensus 73 ~Ri~tg~~EldRVLGGG~V~Gs~iLIgGdPGIGKSTLLLQva~~-lA~~------~--~vLYVsGEES~~QiklRA~RL~ 143 (456) T COG1066 73 PRISTGIEELDRVLGGGLVPGSVILIGGDPGIGKSTLLLQVAAR-LAKR------G--KVLYVSGEESLQQIKLRADRLG 143 (456) T ss_pred CCCCCCHHHHHHHHCCCCCCCCEEEECCCCCCCHHHHHHHHHHH-HHHC------C--CEEEEECCCCHHHHHHHHHHHC T ss_conf 42357818887653677236617997368987798999999999-8705------9--5799967767899999999828 Q ss_pred CCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECHHHH Q ss_conf 46742115899728878889999988766666675420468847996164789 Q gi|254780617|r 222 DRGALSYSIVVVASASDPAPMQLLAPFAGCAMGEYFRDNGYHALIAYDDLQKH 274 (509) Q Consensus 222 ~~~~~~~tvvv~a~a~~~~~~r~~ap~~a~aiAEyfr~~G~~VLi~~Ddltr~ 274 (509) -. ...+.+.+-.+.....+.+ ++-+--++++||+-.. T Consensus 144 ~~---~~~l~l~aEt~~e~I~~~l-------------~~~~p~lvVIDSIQT~ 180 (456) T COG1066 144 LP---TNNLYLLAETNLEDIIAEL-------------EQEKPDLVVIDSIQTL 180 (456) T ss_pred CC---CCCEEEEHHCCHHHHHHHH-------------HHCCCCEEEEECCCEE T ss_conf 99---6455774112899999999-------------8549978999654123 No 100 >pfam01695 IstB IstB-like ATP binding protein. This protein contains an ATP/GTP binding P-loop motif. It is found associated with IS21 family insertion sequences. The function of this protein is unknown, but it may perform a transposase function. Probab=95.41 E-value=0.026 Score=35.85 Aligned_cols=54 Identities=19% Similarity=0.261 Sum_probs=35.9 Q ss_pred CCCCCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCC Q ss_conf 011115683365524778861158999999988630246678760699995167767899999851146 Q gi|254780617|r 155 SLIPIGRGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDR 223 (509) Q Consensus 155 ~l~pigrGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~ 223 (509) ++-=+.+||.+.++|++|+|||-|+. .|.+++..+. .-|.|.- +.++++.++.. T Consensus 40 ~~~~i~~~~Nlll~G~~GtGKThLA~-Ai~~~~~~~g-------~~v~f~~-------~~~L~~~l~~~ 93 (178) T pfam01695 40 GLDWIEQAENLLLLGPPGVGKTHLAC-ALGHQACRAG-------YSVLFTR-------TPDLVEQLKRA 93 (178) T ss_pred CCCCHHCCCCEEEECCCCCCHHHHHH-HHHHHHHHCC-------CEEEEEE-------CHHHHHHHHHH T ss_conf 59742158768998999987899999-9999999869-------8599996-------16799999987 No 101 >cd03238 ABC_UvrA The excision repair protein UvrA; Nucleotide excision repair in eubacteria is a process that repairs DNA damage by the removal of a 12-13-mer oligonucleotide containing the lesion. Recognition and cleavage of the damaged DNA is a multistep ATP-dependent reaction that requires the UvrA, UvrB, and UvrC proteins. Both UvrA and UvrB are ATPases, with UvrA having two ATP binding sites, which have the characteristic signature of the family of ABC proteins, and UvrB having one ATP binding site that is structurally related to that of helicases. Probab=95.41 E-value=0.073 Score=32.59 Aligned_cols=36 Identities=11% Similarity=0.249 Sum_probs=27.1 Q ss_pred HHHHH-CCCCCCCCCEEEEECCCCCCCHHHHHHHHHH Q ss_conf 02331-0111156833655247788611589999999 Q gi|254780617|r 150 IKAID-SLIPIGRGQRELIIGDRKTGKTSIILDTFLN 185 (509) Q Consensus 150 I~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~~~I~n 185 (509) +++++ .=+.|-+|+..+|+|.+|+|||||+.+.+.. T Consensus 8 ~~aL~~isl~i~~Ge~~~iiG~nGsGKSTLl~~~~~~ 44 (176) T cd03238 8 VHNLQNLDVSIPLNVLVVVTGVSGSGKSTLVNEGLYA 44 (176) T ss_pred EECCCCEEEEECCCCEEEEECCCCCCHHHHHHHHHHH T ss_conf 3546754878889989999999999899999988876 No 102 >COG2256 MGS1 ATPase related to the helicase subunit of the Holliday junction resolvase [DNA replication, recombination, and repair] Probab=95.03 E-value=0.08 Score=32.33 Aligned_cols=79 Identities=18% Similarity=0.181 Sum_probs=46.7 Q ss_pred CCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEEEEECCCCCH Q ss_conf 68336552477886115899999998863024667876069999516776789999985114674211589972887888 Q gi|254780617|r 161 RGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALSYSIVVVASASDPA 240 (509) Q Consensus 161 rGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~~tvvv~a~a~~~~ 240 (509) +=--+-+.|++|||||||+ .++.|... ..+ .. +.|+.+- T Consensus 47 ~l~SmIl~GPPG~GKTTlA--~liA~~~~-------~~f----~~--------------------------~sAv~~g-- 85 (436) T COG2256 47 HLHSMILWGPPGTGKTTLA--RLIAGTTN-------AAF----EA--------------------------LSAVTSG-- 85 (436) T ss_pred CCCEEEEECCCCCCHHHHH--HHHHHHHC-------CCE----EE--------------------------ECCCCCC-- T ss_conf 9860577789998888999--99987617-------766----99--------------------------5152346-- Q ss_pred HHHHHHHHHHHHHHHHHHCCCCCEEEEEECHHHHHHHHHHHHH Q ss_conf 9999988766666675420468847996164789999987545 Q gi|254780617|r 241 PMQLLAPFAGCAMGEYFRDNGYHALIAYDDLQKHAVAYRQLSL 283 (509) Q Consensus 241 ~~r~~ap~~a~aiAEyfr~~G~~VLi~~Ddltr~A~a~reisl 283 (509) ....+ -...-|+-++..|+.+++++|-+-||-.++.+.-| T Consensus 86 vkdlr---~i~e~a~~~~~~gr~tiLflDEIHRfnK~QQD~lL 125 (436) T COG2256 86 VKDLR---EIIEEARKNRLLGRRTILFLDEIHRFNKAQQDALL 125 (436) T ss_pred HHHHH---HHHHHHHHHHHCCCCEEEEEEHHHHCCHHHHHHHH T ss_conf 79999---99999999872588349987225333744565510 No 103 >COG1474 CDC6 Cdc6-related protein, AAA superfamily ATPase [DNA replication, recombination, and repair / Posttranslational modification, protein turnover, chaperones] Probab=95.00 E-value=0.13 Score=30.87 Aligned_cols=99 Identities=15% Similarity=0.211 Sum_probs=54.5 Q ss_pred CCCCCCCC---EEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEC---CCCHHHHHHHHHHHCCCCCCCEE Q ss_conf 11115683---3655247788611589999999886302466787606999951---67767899999851146742115 Q gi|254780617|r 156 LIPIGRGQ---RELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAI---GQKRSSVARFVKALEDRGALSYS 229 (509) Q Consensus 156 l~pigrGQ---R~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~I---Ger~~ev~~~~~~l~~~~~~~~t 229 (509) |.|.-+|- -..|.|.+|||||+.+ ..+..|..... ..+.++|+-+ ..+..-..++++.+. T Consensus 33 l~~~~~~~~p~n~~iyG~~GTGKT~~~-~~v~~~l~~~~-----~~~~~~yINc~~~~t~~~i~~~i~~~~~-------- 98 (366) T COG1474 33 LAPALRGERPSNIIIYGPTGTGKTATV-KFVMEELEESS-----ANVEVVYINCLELRTPYQVLSKILNKLG-------- 98 (366) T ss_pred HHHHHCCCCCCCEEEECCCCCCHHHHH-HHHHHHHHHHC-----CCCCEEEEEEEECCCHHHHHHHHHHHHC-------- T ss_conf 999855899860799889998732899-99999997331-----5675799951307878799999999826-------- Q ss_pred EEEEECCCCCHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECHHHHHH Q ss_conf 89972887888999998876666667542046884799616478999 Q gi|254780617|r 230 IVVVASASDPAPMQLLAPFAGCAMGEYFRDNGYHALIAYDDLQKHAV 276 (509) Q Consensus 230 vvv~a~a~~~~~~r~~ap~~a~aiAEyfr~~G~~VLi~~Ddltr~A~ 276 (509) ..|...+..-..=..+-+++..+++.+++++|.+..... T Consensus 99 --------~~p~~g~~~~~~~~~l~~~~~~~~~~~IvvLDEid~L~~ 137 (366) T COG1474 99 --------KVPLTGDSSLEILKRLYDNLSKKGKTVIVILDEVDALVD 137 (366) T ss_pred --------CCCCCCCCHHHHHHHHHHHHHHCCCEEEEEECCHHHHHC T ss_conf --------899767632689999999777418759999764765415 No 104 >TIGR02928 TIGR02928 orc1/cdc6 family replication initiation protein; InterPro: IPR014277 This set of DNA binding proteins shows homology to the origin recognition complex subunit 1/cell division control protein 6 family in eukaryotes. The proteins in this entry are found exclusively in the archaea. Several members may be found in a genome and interact with each other.. Probab=95.00 E-value=0.032 Score=35.16 Aligned_cols=126 Identities=25% Similarity=0.378 Sum_probs=73.9 Q ss_pred CCCCCCC-CCE---EEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEE Q ss_conf 0111156-833---655247788611589999999886302466787606999951677678999998511467421158 Q gi|254780617|r 155 SLIPIGR-GQR---ELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALSYSI 230 (509) Q Consensus 155 ~l~pigr-GQR---~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~~tv 230 (509) .|-|+-+ |.+ +.|-|..|||||++ +.-+++|...........++..||+=|-.-.+...-+++-..+ T Consensus 32 ~L~~~l~PG~~P~Ni~iYGkTGtGKT~v-t~~v~~~l~~~~~~~d~~D~~~~~~NC~~~~T~y~~~~~L~~~-------- 102 (383) T TIGR02928 32 ALRPILRPGSRPSNIFIYGKTGTGKTAV-TKYVMKELEEAAEDRDVRDVSTVYINCQILDTSYQVLVELANQ-------- 102 (383) T ss_pred HHHHHHCCCCCCCCEEEECCCCCCHHHH-HHHHHHHHHHHHHCCCCCCEEEEEECCCCCCCHHHHHHHHHHH-------- T ss_conf 9887506748987258878889878899-9999999999862269971589997785468469999999998-------- Q ss_pred EEEECCCCCHHHHHHHHHHHHHHHHHHH--------CCCCCEEEEEECHHHHHHHHHHHHHHCCCCCCCC-CCCCCCHHH Q ss_conf 9972887888999998876666667542--------0468847996164789999987545303787543-337520012 Q gi|254780617|r 231 VVVASASDPAPMQLLAPFAGCAMGEYFR--------DNGYHALIAYDDLQKHAVAYRQLSLLLRRPPGRE-AYPGDVFYL 301 (509) Q Consensus 231 vv~a~a~~~~~~r~~ap~~a~aiAEyfr--------~~G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~-gyp~~vf~~ 301 (509) -+ .+..-.-.|++|.+-.|+|+ .++.-++|++|.+-++ . +-.+-+ +|.- +.|. T Consensus 103 -----ln-~~~~~~~vP~tG~s~~~~~~~l~~~l~~~~~~~~~ivLDEiD~L----------v-~~~~d~PAyS~-~LY~ 164 (383) T TIGR02928 103 -----LN-RRGSGEEVPTTGLSTSEVFRELYKELNRERGDSLIIVLDEIDKL----------V-RKDDDDPAYSK-LLYQ 164 (383) T ss_pred -----HC-CCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCEEEEEECCCCHH----------H-CCCCCCHHHHH-HHHH T ss_conf -----51-57788889887787899999999998320188799986231022----------1-58888807878-8534 Q ss_pred HHHHHH Q ss_conf 467888 Q gi|254780617|r 302 HSRLLE 307 (509) Q Consensus 302 ~s~l~E 307 (509) +||.-+ T Consensus 165 L~Ra~~ 170 (383) T TIGR02928 165 LSRARE 170 (383) T ss_pred HHHHHH T ss_conf 331000 No 105 >pfam03796 DnaB_C DnaB-like helicase C terminal domain. The hexameric helicase DnaB unwinds the DNA duplex at the Escherichia coli chromosome replication fork. Although the mechanism by which DnaB both couples ATP hydrolysis to translocation along DNA and denatures the duplex is unknown, a change in the quaternary structure of the protein involving dimerization of the N-terminal domain has been observed and may occur during the enzymatic cycle. This C-terminal domain contains an ATP-binding site and is therefore probably the site of ATP hydrolysis. Probab=94.92 E-value=0.086 Score=32.07 Aligned_cols=42 Identities=29% Similarity=0.426 Sum_probs=35.2 Q ss_pred HHCCHHHHHCCCC-CCCCCEEEEECCCCCCCHHHHHHHHHHHH Q ss_conf 4416023310111-15683365524778861158999999988 Q gi|254780617|r 146 LSTGIKAIDSLIP-IGRGQRELIIGDRKTGKTSIILDTFLNQK 187 (509) Q Consensus 146 l~TGI~~ID~l~p-igrGQR~~I~g~~g~GKt~l~~~~I~nq~ 187 (509) +.||++.+|-++- +.+|+=..|-|.+|+|||++++.+..|-+ T Consensus 2 IpTG~~~lD~~~gGl~~G~l~vi~g~pg~GKS~~~~~~a~~~a 44 (186) T pfam03796 2 LPTGFTDLDQLTGGLQKGDLIIIAARPSMGKTAFALNIARNAA 44 (186) T ss_pred CCCCCHHHHHHHCCCCCCCEEEEEECCCCCHHHHHHHHHHHHH T ss_conf 3888077886426988881799996799987999999999999 No 106 >pfam01637 Arch_ATPase Archaeal ATPase. This family contain a conserved P-loop motif that is involved in binding ATP. This family is almost exclusively found in archaebacteria and particularly in Methanococcus jannaschii that encodes sixteen members of this family. Probab=94.59 E-value=0.21 Score=29.32 Aligned_cols=97 Identities=23% Similarity=0.273 Sum_probs=46.4 Q ss_pred CCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCH--HHHHHHH------HHHCCCCCCCEEEE Q ss_conf 568336552477886115899999998863024667876069999516776--7899999------85114674211589 Q gi|254780617|r 160 GRGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKR--SSVARFV------KALEDRGALSYSIV 231 (509) Q Consensus 160 grGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~--~ev~~~~------~~l~~~~~~~~tvv 231 (509) +..+=+.|.|.+++|||+|+...+ +... .....+||...-+.. ..+.++. +.+.. ....... T Consensus 18 ~~~~~ivi~G~RR~GKTsLi~~~~-~~~~-------~~~~~~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~--~~~~~~~ 87 (223) T pfam01637 18 GTYPIIVVYGPRRCGKTALLREFL-EELR-------ELGYRVIYYDPLRREFEEKLDRFEEAKRLAEALGD--ALPKIGI 87 (223) T ss_pred CCCCEEEEECCCCCCHHHHHHHHH-HHCC-------CCCCEEEEEECHHHHHHHHHHHHHHHHHHHHHHHH--HCCCCCC T ss_conf 997189998688787999999999-8633-------46852899951444379999988888999999876--5123322 Q ss_pred EEECCCCCHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECHHHHH Q ss_conf 97288788899999887666666754204688479961647899 Q gi|254780617|r 232 VVASASDPAPMQLLAPFAGCAMGEYFRDNGYHALIAYDDLQKHA 275 (509) Q Consensus 232 v~a~a~~~~~~r~~ap~~a~aiAEyfr~~G~~VLi~~Ddltr~A 275 (509) ..+. ....+--.+-++..+.|++++|++|.....+ T Consensus 88 ------~~~~---~~~~~l~~~~~~l~~~~~~~iiviDEfq~l~ 122 (223) T pfam01637 88 ------AKSK---LAFLSLTLLFELLKRKGKKIAIIIDEVQYAI 122 (223) T ss_pred ------CCCC---CHHHHHHHHHHHHHHCCCCEEEEEECHHHHH T ss_conf ------2112---0788999999999855996599970167764 No 107 >COG1124 DppF ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component [Amino acid transport and metabolism / Inorganic ion transport and metabolism] Probab=94.50 E-value=0.024 Score=36.10 Aligned_cols=30 Identities=27% Similarity=0.503 Sum_probs=24.8 Q ss_pred HHHH-CCCCCCCCCEEEEECCCCCCCHHHHH Q ss_conf 2331-01111568336552477886115899 Q gi|254780617|r 151 KAID-SLIPIGRGQRELIIGDRKTGKTSIIL 180 (509) Q Consensus 151 ~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~ 180 (509) ++++ .=+.|.+|...+|+|.+|+|||||+- T Consensus 21 ~~l~~VS~~i~~Ge~lgivGeSGsGKSTL~r 51 (252) T COG1124 21 HALNNVSLEIERGETLGIVGESGSGKSTLAR 51 (252) T ss_pred HHHCCEEEEECCCCEEEEECCCCCCHHHHHH T ss_conf 4414325996489789998489898889999 No 108 >COG4172 ABC-type uncharacterized transport system, duplicated ATPase component [General function prediction only] Probab=94.45 E-value=0.058 Score=33.33 Aligned_cols=31 Identities=29% Similarity=0.653 Sum_probs=17.1 Q ss_pred HHHHHCC-CCCCCCCEEEEECCCCCCCHHHHH Q ss_conf 0233101-111568336552477886115899 Q gi|254780617|r 150 IKAIDSL-IPIGRGQRELIIGDRKTGKTSIIL 180 (509) Q Consensus 150 I~~ID~l-~pigrGQR~~I~g~~g~GKt~l~~ 180 (509) ++++|.+ +++-+||..||.|.+|+||||+.. T Consensus 300 ~~AVd~isl~L~~gqTlGlVGESGSGKsTlG~ 331 (534) T COG4172 300 LRAVDGISLTLRRGQTLGLVGESGSGKSTLGL 331 (534) T ss_pred EEEECCCEEEECCCCEEEEEECCCCCCCHHHH T ss_conf 07752543675389767777058898115999 No 109 >COG2255 RuvB Holliday junction resolvasome, helicase subunit [DNA replication, recombination, and repair] Probab=94.15 E-value=0.083 Score=32.18 Aligned_cols=52 Identities=21% Similarity=0.212 Sum_probs=32.1 Q ss_pred EEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCC Q ss_conf 3655247788611589999999886302466787606999951677678999998511467 Q gi|254780617|r 164 RELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRG 224 (509) Q Consensus 164 R~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~ 224 (509) -+++||++|+|||||+. .|.|--+.+.... =+-.=||..+.+.+...|++.+ T Consensus 54 HvLl~GPPGlGKTTLA~-IIA~Emgvn~k~t--------sGp~leK~gDlaaiLt~Le~~D 105 (332) T COG2255 54 HVLLFGPPGLGKTTLAH-IIANELGVNLKIT--------SGPALEKPGDLAAILTNLEEGD 105 (332) T ss_pred EEEEECCCCCCHHHHHH-HHHHHHCCCEEEC--------CCCCCCCHHHHHHHHHCCCCCC T ss_conf 47864799876888999-9999856773763--------6620157265999986398677 No 110 >PRK09519 recA recombinase A; Reviewed Probab=94.06 E-value=0.43 Score=27.02 Aligned_cols=20 Identities=15% Similarity=0.213 Sum_probs=8.9 Q ss_pred HHHHHHCCCCCHHHHHHHHH Q ss_conf 99887327989899999999 Q gi|254780617|r 480 LEDIRKQKVLTDDIRSKLIN 499 (509) Q Consensus 480 ~~~I~~~~~l~de~~~~L~~ 499 (509) .+.+.++.+.-+|+|+++|+ T Consensus 747 rnfL~~N~d~AdEIEKkike 766 (790) T PRK09519 747 RNFLVENADVADEIEKKIKE 766 (790) T ss_pred HHHHHHCHHHHHHHHHHHHH T ss_conf 88877490377999999998 No 111 >pfam05496 RuvB_N Holliday junction DNA helicase ruvB N-terminus. The RuvB protein makes up part of the RuvABC revolvasome which catalyses the resolution of Holliday junctions that arise during genetic recombination and DNA repair. Branch migration is catalysed by the RuvB protein that is targeted to the Holliday junction by the structure specific RuvA protein. This family contains the N-terminal region of the protein. Probab=94.06 E-value=0.13 Score=30.66 Aligned_cols=28 Identities=7% Similarity=0.156 Sum_probs=19.1 Q ss_pred HHHHHCCCCCEEEEEECHHHHHHHHHHH Q ss_conf 6754204688479961647899999875 Q gi|254780617|r 254 GEYFRDNGYHALIAYDDLQKHAVAYRQL 281 (509) Q Consensus 254 AEyfr~~G~~VLi~~Ddltr~A~a~rei 281 (509) ++.+..-++..++++|.+-|+-.++.+. T Consensus 93 ~~~l~~~~~~~ILFIDEIHr~nK~qqd~ 120 (234) T pfam05496 93 AAILTNLEPGDVLFIDEIHRLNRAVEEI 120 (234) T ss_pred HHHHHHCCCCCEEEEECHHHCCHHHHHH T ss_conf 9999845899889996654358768874 No 112 >TIGR01447 recD exodeoxyribonuclease V, alpha subunit; InterPro: IPR006344 This family describes the exodeoxyribonuclease V alpha subunit, RecD. RecD is part of a RecBCD complex. C-terminal part of the protein matches a domain found in viral RNA helicase, superfamily 1, IPR000606 from INTERPRO. ; GO: 0008854 exodeoxyribonuclease V activity, 0009338 exodeoxyribonuclease V complex. Probab=94.03 E-value=0.094 Score=31.80 Aligned_cols=109 Identities=17% Similarity=0.177 Sum_probs=68.9 Q ss_pred CCCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHC-----CCCCCCEEEE Q ss_conf 11156833655247788611589999999886302466787606999951677678999998511-----4674211589 Q gi|254780617|r 157 IPIGRGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALE-----DRGALSYSIV 231 (509) Q Consensus 157 ~pigrGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~-----~~~~~~~tvv 231 (509) -..-++.=.+|.|||||||||.+.-+|..-..+-........-|-+.+--|.=..-.+|.+..-. +.++.+.-++ T Consensus 237 ~~aL~~~f~li~GGPGTGKTTTv~~LL~al~~~~~~~G~~~~~I~l~APTGKAa~RL~esl~~~~~~L~~~~~aid~~~~ 316 (753) T TIGR01447 237 ALALKSNFSLITGGPGTGKTTTVARLLLALIKQSPKQGKPRLRIALAAPTGKAAARLAESLRKAVKKLAAENMAIDEDLI 316 (753) T ss_pred HHHHHCCEEEEECCCCCCHHHHHHHHHHHHHHHHHHCCCCCEEEEEECCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHH T ss_conf 99860876899879889778999999999999898649974047886684479999999999886322342366587985 Q ss_pred EEECCCCCHHHHHHHHHHHHH-HHHHHHC-C----CCCEEEEEE Q ss_conf 972887888999998876666-6675420-4----688479961 Q gi|254780617|r 232 VVASASDPAPMQLLAPFAGCA-MGEYFRD-N----GYHALIAYD 269 (509) Q Consensus 232 v~a~a~~~~~~r~~ap~~a~a-iAEyfr~-~----G~~VLi~~D 269 (509) ..-++.....+|++..+ . --+.||+ + -.||| |+| T Consensus 317 ~~~~~~~~TiHrLLG~~---~I~~~~fr~h~~N~L~~DVL-vvD 356 (753) T TIGR01447 317 AALPSEATTIHRLLGIK---PIDTKRFRHHERNPLPLDVL-VVD 356 (753) T ss_pred CCCCHHHHHHHHHHCCC---CCCCCCCCCCCCCCCCCCEE-EEC T ss_conf 48720456888861661---47876776777788985527-870 No 113 >cd03291 ABCC_CFTR1 The CFTR subfamily domain 1. The cystic fibrosis transmembrane regulator (CFTR), the product of the gene mutated in patients with cystic fibrosis, has adapted the ABC transporter structural motif to form a tightly regulated anion channel at the apical surface of many epithelia. Use of the term assembly of a functional ion channel implies the coming together of subunits, or at least smaller not-yet functional components of the active whole. In fact, on the basis of current knowledge only the CFTR polypeptide itself is required to form an ATP- and protein kinase A-dependent low-conductance chloride channel of the type present in the apical membrane of many epithelial cells. CFTR displays the typical organization (IM-ABC)2 and carries a characteristic hydrophilic R-domain that separates IM1-ABC1 from IM2-ABC2. Probab=93.90 E-value=0.042 Score=34.37 Aligned_cols=29 Identities=31% Similarity=0.431 Sum_probs=24.1 Q ss_pred CCCCCCCCCEEEEECCCCCCCHHHHHHHHH Q ss_conf 011115683365524778861158999999 Q gi|254780617|r 155 SLIPIGRGQRELIIGDRKTGKTSIILDTFL 184 (509) Q Consensus 155 ~l~pigrGQR~~I~g~~g~GKt~l~~~~I~ 184 (509) .=+.|-+||+++|+|.+|+|||||+ ..|. T Consensus 56 Isf~I~~Ge~vaIVG~sGSGKSTLl-~lL~ 84 (282) T cd03291 56 INLKIEKGEMLAITGSTGSGKTSLL-MLIL 84 (282) T ss_pred EEEEECCCCEEEEECCCCCHHHHHH-HHHH T ss_conf 4899849999999999998199999-9995 No 114 >PRK05595 replicative DNA helicase; Provisional Probab=93.80 E-value=0.15 Score=30.39 Aligned_cols=43 Identities=21% Similarity=0.340 Sum_probs=35.8 Q ss_pred HHHCCHHHHHCCCC-CCCCCEEEEECCCCCCCHHHHHHHHHHHH Q ss_conf 44416023310111-15683365524778861158999999988 Q gi|254780617|r 145 PLSTGIKAIDSLIP-IGRGQRELIIGDRKTGKTSIILDTFLNQK 187 (509) Q Consensus 145 ~l~TGI~~ID~l~p-igrGQR~~I~g~~g~GKt~l~~~~I~nq~ 187 (509) -+.||++.+|.++- +.+|+=+.|-|-+|.|||++++.+..|-+ T Consensus 183 Gi~TGf~~LD~~t~Gl~~GdLiiiaaRP~mGKTa~alnia~~~a 226 (444) T PRK05595 183 GVASGFRELDAKTSGFQKGDMILIAARPSMGKTTFALNIAEYAA 226 (444) T ss_pred CCCCCHHHHHHHHCCCCCCCEEEEEECCCCCCHHHHHHHHHHHH T ss_conf 76588476998745998577799985798980799999999999 No 115 >TIGR02168 SMC_prok_B chromosome segregation protein SMC; InterPro: IPR011890 The SMC (structural maintenance of chromosomes) family of proteins, exist in virtually all organisms including both bacteria and archaea. The SMC proteins are essential for successful chromosome transmission during replication and segregation of the genome in all organisms and form three types of heterodimer (SMC1SMC3, SMC2SMC4, SMC5SMC6), which are core components of large multiprotein complexes. The best known complexes are cohesin, which is responsible for sister-chromatid cohesion, and condensin, which is required for full chromosome condensation in mitosis. SMCs are generally present as single proteins in bacteria, and as at least six distinct proteins in eukaryotes. The proteins range in size from approximately 110 to 170 kDa, and share a five-domain structure, with globular N- and C-terminal (IPR003395 from INTERPRO) domains separated by a long (circa 100 nm or 900 residues) coiled coil segment in the centre of which is a globular ''hinge'' domain, characterised by a set of four highly conserved glycine residues that are typical of flexible regions in a protein. The amino-terminal domain contains a 'Walker A' nucleotide-binding domain (GxxGxGKS/T), which by mutational studies has been shown to be essential in several proteins. The carboxy-terminal domain contains a sequence (the DA-box) that resembles a 'Walker B' motif (XXXXD, where X is any hydrophobic residue), and a LSGG motif with homology to the signature sequence of the ATP-binding cassette (ABC) family of ATPases . All SMC proteins appear to form dimers, either forming homodimers with themselves, as in the case of prokaryotic SMC proteins, or heterodimers between different but related SMC proteins. The dimers are arranged in an antiparallel alignment. This orientation brings the N- and C-terminal globular domains (from either different or identical protamers) together, which unites an ATP binding site (Walker A motif) within the N-terminal domain with a Walker B motif (DA box) within the C-terminal domain, to form a potentially functional ATPase. Protein interaction and microscopy data suggest that SMC dimers form a ring-like structure which might embrace DNA molecules. Non-SMC subunits associate with the SMC amino- and carboxy-terminal domains. The sequence homology within the carboxy-terminal domain is relatively high within the SMC1-SMC4 group, whereas SMC5 and SMC6 show some divergence in both of these sequences. SMCs share not only sequence similarity but also structural similarity with ABC proteins. SMC proteins function together with other proteins in a range of chromosomal transactions, including chromosome condensation, sister-chromatid cohesion, recombination, DNA repair and epigenetic silencing of gene expression . The smc gene is often associated with scpB (IPR005234 from INTERPRO) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle , , .; GO: 0005515 protein binding, 0005524 ATP binding, 0005694 chromosome. Probab=93.78 E-value=0.19 Score=29.56 Aligned_cols=23 Identities=26% Similarity=0.309 Sum_probs=19.4 Q ss_pred CCEEEEECCCCCCCHHHHHHHHHH Q ss_conf 833655247788611589999999 Q gi|254780617|r 162 GQRELIIGDRKTGKTSIILDTFLN 185 (509) Q Consensus 162 GQR~~I~g~~g~GKt~l~~~~I~n 185 (509) +.=.+|+||-|||||-| +|+|.| T Consensus 23 ~g~TgiVGPNGcGKSNi-~DAiRW 45 (1191) T TIGR02168 23 KGITGIVGPNGCGKSNI-VDAIRW 45 (1191) T ss_pred CCEEEEECCCCCCCCCH-HHHHHH T ss_conf 85279862799870018-999998 No 116 >cd00544 CobU Adenosylcobinamide kinase / adenosylcobinamide phosphate guanyltransferase (CobU). CobU is bifunctional cobalbumin biosynthesis enzymes which display adenosylcobinamide kinase and adenosylcobinamide phosphate guanyltransferase activity. This enzyme is a homotrimer with a propeller-like shape. Probab=93.71 E-value=0.58 Score=26.09 Aligned_cols=86 Identities=13% Similarity=0.163 Sum_probs=49.0 Q ss_pred EEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCH-HHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHH Q ss_conf 6552477886115899999998863024667876069999516776-789999985114674211589972887888999 Q gi|254780617|r 165 ELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKR-SSVARFVKALEDRGALSYSIVVVASASDPAPMQ 243 (509) Q Consensus 165 ~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~-~ev~~~~~~l~~~~~~~~tvvv~a~a~~~~~~r 243 (509) ++|.||+..|||..|-..+..+ ...+ +|++-++.. .|+++=++..++.-.-...++=. +- T Consensus 2 iLVtGG~rSGKS~~AE~la~~~---------~~~~--~YiAT~~~~D~Em~~RI~~Hr~~R~~~w~TiE~-~~------- 62 (169) T cd00544 2 ILVTGGARSGKSRFAERLAAEL---------GGPV--TYIATAEAFDDEMAERIARHRKRRPAHWRTIET-PR------- 62 (169) T ss_pred EEEECCCCCCHHHHHHHHHHHC---------CCCC--EEEECCCCCCHHHHHHHHHHHHCCCCCCEEEEC-CC------- T ss_conf 8997786636899999999845---------9981--999788988989999999999668999669963-44------- Q ss_pred HHHHHHHHHHHHHHHCCCCCEEEEEECHHHHHHH Q ss_conf 9988766666675420468847996164789999 Q gi|254780617|r 244 LLAPFAGCAMGEYFRDNGYHALIAYDDLQKHAVA 277 (509) Q Consensus 244 ~~ap~~a~aiAEyfr~~G~~VLi~~Ddltr~A~a 277 (509) .+++.++....+-.+++|+||.|..- T Consensus 63 --------~l~~~l~~~~~~~~vLiDclt~wl~N 88 (169) T cd00544 63 --------DLVSALKELDPGDVVLIDCLTLWVTN 88 (169) T ss_pred --------CHHHHHHHCCCCCEEEEECHHHHHHH T ss_conf --------39999985598885998607899999 No 117 >PRK10636 putative ABC transporter ATP-binding protein; Provisional Probab=93.68 E-value=0.04 Score=34.49 Aligned_cols=30 Identities=20% Similarity=0.365 Sum_probs=24.9 Q ss_pred CCCCCCCCCEEEEECCCCCCCHHHHHHHHHH Q ss_conf 0111156833655247788611589999999 Q gi|254780617|r 155 SLIPIGRGQRELIIGDRKTGKTSIILDTFLN 185 (509) Q Consensus 155 ~l~pigrGQR~~I~g~~g~GKt~l~~~~I~n 185 (509) .=+.|.+|+|++|+|+.|+|||||+ .+|.. T Consensus 331 vsl~i~~GeriaIvG~NGsGKSTLl-k~L~G 360 (638) T PRK10636 331 IKLNLVPGSRIGLLGRNGAGKSTLI-KLLAG 360 (638) T ss_pred CCCEECCCCEEEEECCCCCCHHHHH-HHHCC T ss_conf 7505637847999747871388999-99728 No 118 >PRK08506 replicative DNA helicase; Provisional Probab=93.62 E-value=0.15 Score=30.41 Aligned_cols=41 Identities=27% Similarity=0.536 Sum_probs=33.3 Q ss_pred HHCCHHHHHCCCC-CCCCCEEEEECCCCCCCHHHHHHHHHHH Q ss_conf 4416023310111-1568336552477886115899999998 Q gi|254780617|r 146 LSTGIKAIDSLIP-IGRGQRELIIGDRKTGKTSIILDTFLNQ 186 (509) Q Consensus 146 l~TGI~~ID~l~p-igrGQR~~I~g~~g~GKt~l~~~~I~nq 186 (509) +.||++.+|.++. +-+|+=+.|-|-+++|||++++.+..|- T Consensus 176 i~TGf~~LD~~t~Gl~~gdLiIIAARPsmGKTAfAlniA~~~ 217 (473) T PRK08506 176 LDTGFKQLNKMTKGFNKGDLIIIAARPSMGKTTLVLNMVLKA 217 (473) T ss_pred CCCCHHHHHHHHCCCCCCCEEEEECCCCCCHHHHHHHHHHHH T ss_conf 778808788872699856279995079986789999999999 No 119 >cd03253 ABCC_ATM1_transporter ATM1 is an ABC transporter that is expressed in the mitochondria. Although the specific function of ATM1 is unknown, its disruption results in the accumulation of excess mitochondrial iron, loss of mitochondrial cytochromes, oxidative damage to mitochondrial DNA, and decreased levels of cytosolic heme proteins. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=93.56 E-value=0.048 Score=33.92 Aligned_cols=29 Identities=24% Similarity=0.492 Sum_probs=23.7 Q ss_pred CCCCCCCCCEEEEECCCCCCCHHHHHHHHH Q ss_conf 011115683365524778861158999999 Q gi|254780617|r 155 SLIPIGRGQRELIIGDRKTGKTSIILDTFL 184 (509) Q Consensus 155 ~l~pigrGQR~~I~g~~g~GKt~l~~~~I~ 184 (509) .=+.+-+||+.+|+|.+|+|||||+ ..+. T Consensus 20 isl~i~~Ge~v~ivG~sGsGKSTLl-~ll~ 48 (236) T cd03253 20 VSFTIPAGKKVAIVGPSGSGKSTIL-RLLF 48 (236) T ss_pred EEEEECCCCEEEEECCCCCCHHHHH-HHHC T ss_conf 6899869999999999999899999-9974 No 120 >PRK09165 replicative DNA helicase; Provisional Probab=93.53 E-value=0.18 Score=29.71 Aligned_cols=45 Identities=22% Similarity=0.393 Sum_probs=35.9 Q ss_pred HHHCCHHHHHCCCC-CCCCCEEEEECCCCCCCHHHHHHHHHHHHHH Q ss_conf 44416023310111-1568336552477886115899999998863 Q gi|254780617|r 145 PLSTGIKAIDSLIP-IGRGQRELIIGDRKTGKTSIILDTFLNQKSS 189 (509) Q Consensus 145 ~l~TGI~~ID~l~p-igrGQR~~I~g~~g~GKt~l~~~~I~nq~~~ 189 (509) -+.||++.+|.++- +-+|+=+-|=|-+++|||++++.+..|-+.. T Consensus 187 Gi~TGf~~LD~~t~G~~~GdLiIIAARPsmGKTafaLniA~n~A~~ 232 (484) T PRK09165 187 GISTGLRDLDSKLGGLHPSDLIILAGRPSMGKTALATNIAFNAAKA 232 (484) T ss_pred CEECCCHHHHHHHCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHHH T ss_conf 3315844599871588877379996079997789999999999987 No 121 >PRK09473 oppD oligopeptide transporter ATP-binding component; Provisional Probab=93.49 E-value=0.047 Score=33.98 Aligned_cols=54 Identities=19% Similarity=0.375 Sum_probs=38.4 Q ss_pred CCCCCCCCCCCCHHHHHC---CHHHHH-CCCCCCCCCEEEEECCCCCCCHHHHHHHHHH Q ss_conf 135773112660014441---602331-0111156833655247788611589999999 Q gi|254780617|r 131 ADAPGIIQRQSVCEPLST---GIKAID-SLIPIGRGQRELIIGDRKTGKTSIILDTFLN 185 (509) Q Consensus 131 ~~~p~~~~R~~i~~~l~T---GI~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~~~I~n 185 (509) ...|++++-+-++--+.| .++|+| .=+.+.+|+..+|+|.+|+|||+++. +|+. T Consensus 7 ~~~pplLeV~nL~v~f~~~~g~v~av~~Vsf~i~~GEilgivGeSGsGKSTl~~-~i~g 64 (330) T PRK09473 7 ATVPLLLDVKDLRVTFSTPDGDVTAVNDLNFSLRAGETLGIVGESGSGKSQTAF-ALMG 64 (330) T ss_pred CCCCCEEEEECEEEEECCCCCCEEEEECCEEEECCCCEEEEECCCCCHHHHHHH-HHHC T ss_conf 998847999565999648995289866747688899899998689877999999-9976 No 122 >PRK11174 cysteine/glutathione ABC transporter membrane/ATP-binding component; Reviewed Probab=93.49 E-value=0.05 Score=33.78 Aligned_cols=20 Identities=5% Similarity=0.064 Sum_probs=10.1 Q ss_pred CCCCCHHHHHHHHHHHHHHH Q ss_conf 27989899999999999998 Q gi|254780617|r 486 QKVLTDDIRSKLINEIKVFL 505 (509) Q Consensus 486 ~~~l~de~~~~L~~~i~~~~ 505 (509) +-.++.+.|..+.+.+++.. T Consensus 513 TSaLD~~tE~~i~~~L~~~~ 532 (588) T PRK11174 513 TASLDAHSEQLVMQALNAAS 532 (588) T ss_pred CCCCCHHHHHHHHHHHHHHC T ss_conf 77989999999999999867 No 123 >COG1222 RPT1 ATP-dependent 26S proteasome regulatory subunit [Posttranslational modification, protein turnover, chaperones] Probab=93.44 E-value=0.19 Score=29.65 Aligned_cols=109 Identities=19% Similarity=0.181 Sum_probs=50.9 Q ss_pred EEEEEECCCCCCCCCCEEEECCCEEEEECCHHHHHCCCCHHHCCC--CCCCCCCCCCCCHHCCCCCCCCCCCCCHHHHHC Q ss_conf 999980586686789888977987886679469830003002314--466444544410010135773112660014441 Q gi|254780617|r 71 VGVVILGSYKEISEGDIVKRTGRIVDVPVGLELLGRVVDALGNPI--DGKGPIKCEQRSCTEADAPGIIQRQSVCEPLST 148 (509) Q Consensus 71 v~i~~l~~~~gI~~G~~V~~tg~~~~vpvG~~lLGRVvD~lG~Pl--Dg~g~i~~~~~~~i~~~~p~~~~R~~i~~~l~T 148 (509) |.++.+-+...|++|++|-.-.+.++|- ..|-+=.||+=.-+ |.+|.+.-.+ +-+-- .+-+-+.|.++- T Consensus 99 V~i~~~vd~~~L~pG~rVal~~~s~~Iv---~vLp~~~Dp~V~~M~v~e~PdvtY~d---IGGL~---~Qi~EirE~VEL 169 (406) T COG1222 99 VNILSFVDRDLLEPGMRVALNRDSYSIV---RVLPPEVDPRVSVMEVEEKPDVTYED---IGGLD---EQIQEIREVVEL 169 (406) T ss_pred EECCCCCCHHHCCCCCEEEECCCCCEEE---EECCCCCCCHHHEEEECCCCCCCHHH---CCCHH---HHHHHHHHHHCC T ss_conf 7346876987869988899857761456---61787557620200112589878653---35889---999999998403 Q ss_pred CHHHHHCCCCCCCC--CEEEEECCCCCCCHHHHHHHHHHHHHH Q ss_conf 60233101111568--336552477886115899999998863 Q gi|254780617|r 149 GIKAIDSLIPIGRG--QRELIIGDRKTGKTSIILDTFLNQKSS 189 (509) Q Consensus 149 GI~~ID~l~pigrG--QR~~I~g~~g~GKt~l~~~~I~nq~~~ 189 (509) =++-=+-|-.+|-- .-+++.|++|||||-| .-++.||..+ T Consensus 170 PL~~PElF~~~GI~PPKGVLLYGPPGTGKTLL-AkAVA~~T~A 211 (406) T COG1222 170 PLKNPELFEELGIDPPKGVLLYGPPGTGKTLL-AKAVANQTDA 211 (406) T ss_pred CCCCHHHHHHCCCCCCCCEEEECCCCCCHHHH-HHHHHHCCCC T ss_conf 36688899974999997127668999758899-9998720586 No 124 >TIGR03600 phage_DnaB phage replicative helicase, DnaB family, HK022 subfamily. Members of this family are phage (or prophage-region) homologs of the bacterial homohexameric replicative helicase DnaB. Some phage may rely on host DnaB, while others encode their own verions. This model describes the largest phage-specific clade among the close homologs of DnaB, but there are, or course, other DnaB homologs from phage that fall outside the scope of this model. Probab=93.41 E-value=0.19 Score=29.62 Aligned_cols=43 Identities=30% Similarity=0.376 Sum_probs=34.5 Q ss_pred HHHCCHHHHHCCCC-CCCCCEEEEECCCCCCCHHHHHHHHHHHH Q ss_conf 44416023310111-15683365524778861158999999988 Q gi|254780617|r 145 PLSTGIKAIDSLIP-IGRGQRELIIGDRKTGKTSIILDTFLNQK 187 (509) Q Consensus 145 ~l~TGI~~ID~l~p-igrGQR~~I~g~~g~GKt~l~~~~I~nq~ 187 (509) -+.||++.+|-++- +.+|+=+-|-|.+|+|||++++.+..|-+ T Consensus 176 Gi~TG~~~LD~~~~Gl~~g~LiIiaARPsmGKTafalnia~n~A 219 (421) T TIGR03600 176 GLSTGLPKLDRLTNGLVKGDLIVIGARPSMGKTTLALNIAENVA 219 (421) T ss_pred CCCCCCHHHHHHHCCCCCCCEEEEEEECCCCHHHHHHHHHHHHH T ss_conf 78788078999836999886899985467874599999999999 No 125 >PRK11147 ABC transporter ATPase component; Reviewed Probab=93.37 E-value=0.05 Score=33.80 Aligned_cols=32 Identities=31% Similarity=0.528 Sum_probs=25.9 Q ss_pred HCCCCCCCCCEEEEECCCCCCCHHHHHHHHHHH Q ss_conf 101111568336552477886115899999998 Q gi|254780617|r 154 DSLIPIGRGQRELIIGDRKTGKTSIILDTFLNQ 186 (509) Q Consensus 154 D~l~pigrGQR~~I~g~~g~GKt~l~~~~I~nq 186 (509) |.=+.|-+|+|++|+|..|+|||||+ .+|... T Consensus 337 ~vsl~i~~Ge~ialvG~NGsGKSTLl-k~l~G~ 368 (632) T PRK11147 337 DFSAQVQRGDKIALIGPNGCGKTTLL-KLMLGQ 368 (632) T ss_pred EECCCCCCCCEEEEECCCCCCHHHHH-HHHHCC T ss_conf 53333578877999889884277999-986066 No 126 >PRK08082 consensus Probab=93.33 E-value=0.21 Score=29.21 Aligned_cols=43 Identities=21% Similarity=0.346 Sum_probs=35.6 Q ss_pred HHHCCHHHHHCCCC-CCCCCEEEEECCCCCCCHHHHHHHHHHHH Q ss_conf 44416023310111-15683365524778861158999999988 Q gi|254780617|r 145 PLSTGIKAIDSLIP-IGRGQRELIIGDRKTGKTSIILDTFLNQK 187 (509) Q Consensus 145 ~l~TGI~~ID~l~p-igrGQR~~I~g~~g~GKt~l~~~~I~nq~ 187 (509) -+.||++.+|.++. +.+|+=+.|-|-+++|||++++.+..|-+ T Consensus 185 Gi~TGf~~LD~lt~G~~~g~LiviaaRPsmGKTa~alnia~~~a 228 (453) T PRK08082 185 GIPTGFTELDRMTAGFQRNDLIIVAARPSVGKTAFALNIAQNVA 228 (453) T ss_pred CCCCCHHHHHHHHCCCCCCCEEEEEECCCCCHHHHHHHHHHHHH T ss_conf 55488488886414777585799986788757899999999999 No 127 >PRK10733 hflB ATP-dependent metalloprotease; Reviewed Probab=93.30 E-value=0.086 Score=32.07 Aligned_cols=63 Identities=17% Similarity=0.275 Sum_probs=33.4 Q ss_pred EEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEEEE Q ss_conf 655247788611589999999886302466787606999951677678999998511467421158997 Q gi|254780617|r 165 ELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALSYSIVVV 233 (509) Q Consensus 165 ~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~~tvvv~ 233 (509) +++.|++|||||-|+ -++.+.++..-..-...+++..|++.|.+. |+++.+.-++. .-++++. T Consensus 188 vLL~GPPGtGKTlLA-kAvAgEa~vpF~~~sgsef~e~~vGvga~r--VR~lF~~Ar~~---aP~IIFI 250 (644) T PRK10733 188 VLMVGPPGTGKTLLA-KAIAGEAKVPFFTISGSDFVEMFVGVGASR--VRDMFEQAKKA---APCIIFI 250 (644) T ss_pred EEEECCCCCCHHHHH-HHHHCCCCCEEEEEEHHHHHHEEEECCHHH--HHHHHHHHHHC---CCEEEEE T ss_conf 177798998778999-998645598089978477302225306899--99999999966---9979999 No 128 >TIGR02868 CydC ABC transporter, CydDC cysteine exporter (CydDC-E) family, permease/ATP-binding protein CydC; InterPro: IPR014223 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). This entry represents CydC, a member of a heterodimeric ATP-binding cassette-type transporter (ABC transporter). It is involved in the export of glutathione from the cytoplasm to the periplasm and is required for the assembly of both cytochrome c and cytochrome bd , , .; GO: 0042626 ATPase activity coupled to transmembrane movement of substances, 0006810 transport, 0016021 integral to membrane. Probab=93.28 E-value=0.055 Score=33.51 Aligned_cols=39 Identities=23% Similarity=0.500 Sum_probs=26.4 Q ss_pred CCCCHHH-HHCCHHHHHCCCCCCCCCEEEEECCCCCCCHHHHHHHH Q ss_conf 2660014-44160233101111568336552477886115899999 Q gi|254780617|r 139 RQSVCEP-LSTGIKAIDSLIPIGRGQRELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 139 R~~i~~~-l~TGI~~ID~l~pigrGQR~~I~g~~g~GKt~l~~~~I 183 (509) +.|-+++ -..||. +.+--|.|+.|.|.+|+|||||+. ++ T Consensus 368 ~~pg~~~~vl~~V~-----L~l~~G~r~Ai~G~SG~GKsTLL~-~L 407 (566) T TIGR02868 368 GYPGAPPNVLDGVS-----LDLPPGERVAILGPSGSGKSTLLA-TL 407 (566) T ss_pred ECCCCCHHHHCCCC-----CCCCCCCCEEEECCCCCCHHHHHH-HH T ss_conf 26987346542786-----411388608986688765789999-99 No 129 >pfam07088 GvpD GvpD gas vesicle protein. This family consists of several archaeal GvpD gas vesicle proteins. GvpD is thought to be involved in the regulation of gas vesicle formation. Probab=93.25 E-value=0.27 Score=28.49 Aligned_cols=161 Identities=22% Similarity=0.309 Sum_probs=85.3 Q ss_pred CCCCCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEEEEE Q ss_conf 01111568336552477886115899999998863024667876069999516776789999985114674211589972 Q gi|254780617|r 155 SLIPIGRGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALSYSIVVVA 234 (509) Q Consensus 155 ~l~pigrGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~~tvvv~a 234 (509) -|+...-||-.+|=|-+|||||.+.++.. +-. ....+| .|+.----...|.+.+ +..++.++++-++-. T Consensus 3 rFF~~~~G~tLlIkG~PGTGKTlfTl~~L-d~l------~~~~~v--lYvStRvdqd~V~e~y--f~~~~~~~~t~ilD~ 71 (484) T pfam07088 3 RFFDGDFGKTLLINGAPGTGKTLFTIRGL-DVL------RRHHDV--LYVSTRVDQETVHEMY--FEGHGSLDKTAILDL 71 (484) T ss_pred CCCCCCCCCEEEEECCCCCCCEEEEEEHH-HHH------HCCCCE--EEEEECCCHHHHHHHH--HHCCCCCCHHHHHHH T ss_conf 00158998479993699987238985737-788------516976--9996323778887740--200255575665453 Q ss_pred CCCCCHHHHH---HHHHHHH---HHHHHHH---CCCCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHH Q ss_conf 8878889999---9887666---6667542---04688479961647899999875453037875433375200124678 Q gi|254780617|r 235 SASDPAPMQL---LAPFAGC---AMGEYFR---DNGYHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRL 305 (509) Q Consensus 235 ~a~~~~~~r~---~ap~~a~---aiAEyfr---~~G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l 305 (509) + .+ +.+.. ..|+--+ ++-|+.- .-++.-.+++||-....++.-+ |.--|+|.+.+.-+| T Consensus 72 ~-qD-~f~l~~~~~vP~~~l~l~sl~e~vd~i~a~~er~~Ia~DSW~~i~eyLa~----------r~d~pe~f~~~~n~l 139 (484) T pfam07088 72 L-QD-PFILPTDVDVPFEKLNLDSLLEWVDAINAAGERLTIAFDSWELIYEYLAE----------RHDIPPDILTVTNQL 139 (484) T ss_pred C-CC-CHHCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCCEEEEECHHHHHHHHHH----------HHCCCHHHHHHHHHH T ss_conf 0-26-01063111366542355579999987641245747998328999998777----------616846675666567 Q ss_pred HHHHHHCCCCCCCCCEEEEEEEECCCCCCCCCHHHHHHHH Q ss_conf 8887504544588735675310036766443114668854 Q gi|254780617|r 306 LERAAKMSDALGAGSLTALPVIETQVNDVSAYIPTNVISI 345 (509) Q Consensus 306 ~ERa~~~~~~~g~GSiT~~~~v~~~~~D~~~~i~~~~~si 345 (509) ++=|-. . | +-.+-+.|+..++-=+||+|.+.+. T Consensus 140 v~lar~----~--g-~~Li~v~E~~~~~~LeYivDGVVtL 172 (484) T pfam07088 140 VALARG----S--G-ARLVLVLETAQNSRLEYIVDGVVTL 172 (484) T ss_pred HHHHHH----C--C-CEEEEEEECCCCCCCHHEEEEEEEE T ss_conf 776652----5--9-4599998236677512123038999 No 130 >CHL00176 ftsH cell division protein; Validated Probab=93.23 E-value=0.047 Score=33.99 Aligned_cols=22 Identities=27% Similarity=0.444 Sum_probs=16.3 Q ss_pred EEEECCCCCCCHHHHHHHHHHHH Q ss_conf 65524778861158999999988 Q gi|254780617|r 165 ELIIGDRKTGKTSIILDTFLNQK 187 (509) Q Consensus 165 ~~I~g~~g~GKt~l~~~~I~nq~ 187 (509) +++.|++|||||-|| -++.+.+ T Consensus 213 vLL~GpPGTGKTlLA-kAvAgEa 234 (631) T CHL00176 213 VLLVGPPGTGKTLLA-KAIAGEA 234 (631) T ss_pred EEEECCCCCCHHHHH-HHHHCCC T ss_conf 898898998788999-9985655 No 131 >PRK11819 putative ABC transporter ATP-binding protein; Reviewed Probab=93.20 E-value=0.057 Score=33.38 Aligned_cols=25 Identities=16% Similarity=0.307 Sum_probs=18.2 Q ss_pred CCCCCCCCCEEEEECCCCCCCHHHH Q ss_conf 0111156833655247788611589 Q gi|254780617|r 155 SLIPIGRGQRELIIGDRKTGKTSII 179 (509) Q Consensus 155 ~l~pigrGQR~~I~g~~g~GKt~l~ 179 (509) .=+.|.+|+|.+|+|..|+|||||+ T Consensus 26 vsl~i~~Ge~vgLvG~NGaGKSTLl 50 (556) T PRK11819 26 ISLSFFPGAKIGVLGLNGAGKSTLL 50 (556) T ss_pred CEEEECCCCEEEEECCCCCHHHHHH T ss_conf 1899968989999999997199999 No 132 >PRK06749 replicative DNA helicase; Provisional Probab=93.19 E-value=0.2 Score=29.46 Aligned_cols=43 Identities=19% Similarity=0.295 Sum_probs=34.8 Q ss_pred HHHCCHHHHHCCCC-CCCCCEEEEECCCCCCCHHHHHHHHHHHH Q ss_conf 44416023310111-15683365524778861158999999988 Q gi|254780617|r 145 PLSTGIKAIDSLIP-IGRGQRELIIGDRKTGKTSIILDTFLNQK 187 (509) Q Consensus 145 ~l~TGI~~ID~l~p-igrGQR~~I~g~~g~GKt~l~~~~I~nq~ 187 (509) -+.||++.+|-++. +-+|+=+-|=|-++.|||++++.+..|-+ T Consensus 168 Gi~TGf~~LD~lt~Gl~~g~LiviaaRPsmGKTa~alnia~~~a 211 (428) T PRK06749 168 GIETGYTSLNKMTCGLQEGDFVVLGARPSMGKTAFALNVGLHAA 211 (428) T ss_pred CCCCCHHHHHHHHCCCCCCCEEEEECCCCCCHHHHHHHHHHHHH T ss_conf 68778388987524999886899962798976899999999999 No 133 >PRK08006 replicative DNA helicase; Provisional Probab=93.17 E-value=0.21 Score=29.30 Aligned_cols=43 Identities=14% Similarity=0.241 Sum_probs=34.9 Q ss_pred HHHCCHHHHHCCCC-CCCCCEEEEECCCCCCCHHHHHHHHHHHH Q ss_conf 44416023310111-15683365524778861158999999988 Q gi|254780617|r 145 PLSTGIKAIDSLIP-IGRGQRELIIGDRKTGKTSIILDTFLNQK 187 (509) Q Consensus 145 ~l~TGI~~ID~l~p-igrGQR~~I~g~~g~GKt~l~~~~I~nq~ 187 (509) -+.||++.+|.++. +-+|+=+.|=|-++.|||++++.+..|-+ T Consensus 206 Gi~TGf~~LD~~t~Gl~~G~LiviaaRPsmGKTalalnia~~~a 249 (471) T PRK08006 206 GVNTGYDDLNKKTAGLQPSDLIIVAARPSMGKTTFAMNLCENAA 249 (471) T ss_pred CCCCCHHHHHHHHCCCCCCCEEEEEECCCCCHHHHHHHHHHHHH T ss_conf 36688388986416882173899994699876999999999999 No 134 >PRK05748 replicative DNA helicase; Provisional Probab=93.13 E-value=0.093 Score=31.82 Aligned_cols=42 Identities=19% Similarity=0.358 Sum_probs=35.7 Q ss_pred HHCCHHHHHCCCC-CCCCCEEEEECCCCCCCHHHHHHHHHHHH Q ss_conf 4416023310111-15683365524778861158999999988 Q gi|254780617|r 146 LSTGIKAIDSLIP-IGRGQRELIIGDRKTGKTSIILDTFLNQK 187 (509) Q Consensus 146 l~TGI~~ID~l~p-igrGQR~~I~g~~g~GKt~l~~~~I~nq~ 187 (509) +.||++.+|.++. +.+|+=+.|-|-++.|||++++.+..|-+ T Consensus 186 i~TG~~~LD~~~~G~~~g~LiviaaRP~mGKTa~alnia~~~a 228 (448) T PRK05748 186 IPTGFRDLDKMTSGLQPNDLIIVAARPSVGKTAFALNIAQNVA 228 (448) T ss_pred EECCHHHHHHHHCCCCCCCEEEEEECCCCCHHHHHHHHHHHHH T ss_conf 0578278999827988673799984799876899999999999 No 135 >PRK06904 replicative DNA helicase; Validated Probab=93.12 E-value=0.25 Score=28.67 Aligned_cols=43 Identities=16% Similarity=0.277 Sum_probs=35.4 Q ss_pred HHCCHHHHHCCCC-CCCCCEEEEECCCCCCCHHHHHHHHHHHHH Q ss_conf 4416023310111-156833655247788611589999999886 Q gi|254780617|r 146 LSTGIKAIDSLIP-IGRGQRELIIGDRKTGKTSIILDTFLNQKS 188 (509) Q Consensus 146 l~TGI~~ID~l~p-igrGQR~~I~g~~g~GKt~l~~~~I~nq~~ 188 (509) +.||++.+|.++. +-+|+=+.|=|-++.|||++++.+..|-+. T Consensus 204 i~TG~~~LD~~t~Gl~~g~LiViAaRPsmGKTa~alnia~n~A~ 247 (472) T PRK06904 204 VTTGFTDLDKKTAGLQPSDLIIVAARPSMGKTTFAMNLCENAAM 247 (472) T ss_pred ECCCHHHHHHHHCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHH T ss_conf 22897999744158875757999737987568999999999999 No 136 >pfam02283 CobU Cobinamide kinase / cobinamide phosphate guanyltransferase. This family is composed of a group of bifunctional cobalamin biosynthesis enzymes which display cobinamide kinase and cobinamide phosphate guanyltransferase activity. The crystal structure of the enzyme reveals the molecule to be a trimer with a propeller-like shape. Probab=93.10 E-value=0.46 Score=26.76 Aligned_cols=86 Identities=15% Similarity=0.172 Sum_probs=48.9 Q ss_pred EEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCH-HHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHH Q ss_conf 6552477886115899999998863024667876069999516776-789999985114674211589972887888999 Q gi|254780617|r 165 ELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKR-SSVARFVKALEDRGALSYSIVVVASASDPAPMQ 243 (509) Q Consensus 165 ~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~-~ev~~~~~~l~~~~~~~~tvvv~a~a~~~~~~r 243 (509) ++|.||+..|||..|-.... +. ... .+|++-++.. .|+.+=++..++.-.-...++-. + T Consensus 1 iLVtGG~rSGKS~~AE~la~-~~--------~~~--~~YiAT~~~~D~Em~~RI~~Hr~~R~~~w~tiE~-~-------- 60 (166) T pfam02283 1 ILVTGGARSGKSRFAERLAL-AS--------GGP--VVYIATAQAFDDEMAERIARHRARRPAGWTTIEE-P-------- 60 (166) T ss_pred CEEECCCCCCHHHHHHHHHH-HC--------CCC--EEEEECCCCCCHHHHHHHHHHHHCCCCCCEEEEC-C-------- T ss_conf 98868877338999999998-55--------998--1999769888889999999999718999679977-4-------- Q ss_pred HHHHHHHHHHHHHHHCCCCCEEEEEECHHHHHHH Q ss_conf 9988766666675420468847996164789999 Q gi|254780617|r 244 LLAPFAGCAMGEYFRDNGYHALIAYDDLQKHAVA 277 (509) Q Consensus 244 ~~ap~~a~aiAEyfr~~G~~VLi~~Ddltr~A~a 277 (509) ..+++.++.-..+-.+++|+||.|..- T Consensus 61 -------~~l~~~l~~~~~~~~vLiDclt~wl~N 87 (166) T pfam02283 61 -------LDLAEALARLPGGDVVLVDCLTLWLTN 87 (166) T ss_pred -------CCHHHHHHHCCCCCEEEEECHHHHHHH T ss_conf -------459999984698986999717789999 No 137 >TIGR02858 spore_III_AA stage III sporulation protein AA; InterPro: IPR014217 Proteins in this entry include the stage III sporulation protein AA that is encoded by one of several genes in the spoIIIA locus. This protein is only found in species that are capable of endospore formation.. Probab=93.09 E-value=0.14 Score=30.63 Aligned_cols=152 Identities=24% Similarity=0.246 Sum_probs=78.9 Q ss_pred CHHHHHCCC-CCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCC-CCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCC Q ss_conf 602331011-1156833655247788611589999999886302-46678760699995167767899999851146742 Q gi|254780617|r 149 GIKAIDSLI-PIGRGQRELIIGDRKTGKTSIILDTFLNQKSSHD-KGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGAL 226 (509) Q Consensus 149 GI~~ID~l~-pigrGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~-~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~ 226 (509) -.+.++-|+ ==++=+..+|+|+|++|||||+=|...+-. ... .......-+ +.|.|| ||++--++-.-+.+.= T Consensus 109 A~~~~~yL~d~~~~~~NTLiIsPPq~GKTTlLRDlaR~~S-tG~~~~~~~g~KV---givDER-SEIAgC~~GvPQ~~vG 183 (282) T TIGR02858 109 ADKILPYLVDRNGRVLNTLIISPPQCGKTTLLRDLARILS-TGISKLGLKGKKV---GIVDER-SEIAGCVNGVPQLDVG 183 (282) T ss_pred CHHHHHHHHCCCCCEEEEEEECCCCCCCCCHHHHHHHHHH-CCCCCCCCCCCCE---EEEECC-HHHHHHCCCCCCCCCC T ss_conf 5666887730589446788886889885104888988860-7854246899746---998432-4656545882414467 Q ss_pred CEEEEEEECCCCCHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECHHHH------HHHH------------HHHHHHCCCC Q ss_conf 115899728878889999988766666675420468847996164789------9999------------8754530378 Q gi|254780617|r 227 SYSIVVVASASDPAPMQLLAPFAGCAMGEYFRDNGYHALIAYDDLQKH------AVAY------------RQLSLLLRRP 288 (509) Q Consensus 227 ~~tvvv~a~a~~~~~~r~~ap~~a~aiAEyfr~~G~~VLi~~Ddltr~------A~a~------------reisl~~~~p 288 (509) -||=|.=+= |=++ ||-|+= |..-=+|+ +.|=+=|. -+|+ +.++=+.+|| T Consensus 184 ~RtDVLD~C---PKAE-------GmMM~i--RSMSP~Vi-v~DEIGr~ED~~Al~eA~naGV~~I~TaHg~~~~Dl~kRP 250 (282) T TIGR02858 184 IRTDVLDGC---PKAE-------GMMMLI--RSMSPDVI-VVDEIGREEDVEALLEALNAGVSVIATAHGRDLEDLKKRP 250 (282) T ss_pred CCEEECCCC---CHHH-------HHHHHH--HCCCCCEE-EEECCCCHHHHHHHHHHHCCCCEEEEEECCCCHHHHHCCH T ss_conf 606751788---5378-------999999--70698579-9814889533899999861675688764048812665076 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCEEEEE Q ss_conf 7543337520012467888875045445887356753 Q gi|254780617|r 289 PGREAYPGDVFYLHSRLLERAAKMSDALGAGSLTALP 325 (509) Q Consensus 289 p~~~gyp~~vf~~~s~l~ERa~~~~~~~g~GSiT~~~ 325 (509) -=++- +-.+.+||+==|+...|-|+|-.++ T Consensus 251 ~fk~l-------~e~~~Fer~v~Ls~~~G~GTve~vy 280 (282) T TIGR02858 251 VFKEL-------LEQKAFERYVVLSRRKGPGTVEAVY 280 (282) T ss_pred HHHHH-------HHHCCCEEEEEECCCCCCCEEEEEC T ss_conf 67999-------9724314899836889883244531 No 138 >cd03251 ABCC_MsbA MsbA is an essential ABC transporter, closely related to eukaryotic MDR proteins. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=93.09 E-value=0.06 Score=33.21 Aligned_cols=29 Identities=17% Similarity=0.407 Sum_probs=23.9 Q ss_pred CCCCCCCCCEEEEECCCCCCCHHHHHHHHH Q ss_conf 011115683365524778861158999999 Q gi|254780617|r 155 SLIPIGRGQRELIIGDRKTGKTSIILDTFL 184 (509) Q Consensus 155 ~l~pigrGQR~~I~g~~g~GKt~l~~~~I~ 184 (509) .=+.|.+||+++|.|.+|+|||||+ ..+. T Consensus 21 inl~i~~Ge~i~IvG~sGsGKSTLl-~ll~ 49 (234) T cd03251 21 ISLDIPAGETVALVGPSGSGKSTLV-NLIP 49 (234) T ss_pred EEEEECCCCEEEEECCCCCHHHHHH-HHHH T ss_conf 0899879999999989998299999-9996 No 139 >pfam04665 Pox_A32 Poxvirus A32 protein. The A32 protein is thought to be involved in viral DNA packaging. Probab=93.08 E-value=0.72 Score=25.37 Aligned_cols=144 Identities=18% Similarity=0.197 Sum_probs=69.6 Q ss_pred CCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHH Q ss_conf 83365524778861158999999988630246678760699995167767899999851146742115899728878889 Q gi|254780617|r 162 GQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALSYSIVVVASASDPAP 241 (509) Q Consensus 162 GQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~~tvvv~a~a~~~~~ 241 (509) -=|+.|.|++|.|||+.++....+-.... .-||..---+.+++..++ ...-|.-+++.++ T Consensus 13 pFrmaivGgSGSGKT~yLlsLf~tlv~ky---------khIfLfTpv~N~~Yd~YV--------wPdHV~~vtt~ee--- 72 (241) T pfam04665 13 PFRMAIVGGSGSGKTTYLLSLLRTLVRKF---------KHIFLFTPVYNNAYDGYV--------WPDHIFKVTTNEE--- 72 (241) T ss_pred CCEEEEEECCCCCHHHHHHHHHHHHHHHH---------EEEEEECCCCCCCCCCCC--------CCCCEEEECCHHH--- T ss_conf 73599981588756699999999997741---------589996244673236525--------7773256257235--- Q ss_pred HHHHHHHHHHHHHHHHH--CCCC----CEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCC Q ss_conf 99998876666667542--0468----84799616478999998754530378754333752001246788887504544 Q gi|254780617|r 242 MQLLAPFAGCAMGEYFR--DNGY----HALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLLERAAKMSDA 315 (509) Q Consensus 242 ~r~~ap~~a~aiAEyfr--~~G~----~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l~ERa~~~~~~ 315 (509) +-|.-. +...-.|.|- .+|+ +.|+++||+-.-----+.++-.+.. ||...-.-+| |..+--.+ +. T Consensus 73 leY~L~-~~k~kIek~~~~~~~~k~~~~fLiIlDD~Gd~q~rs~~L~~f~N~--gRH~n~Sii~-----Lcqty~Hv-p~ 143 (241) T pfam04665 73 LEYALS-KTKEKIEKFSKKASNQKENFRFLIILDDLGDMQTRSKTLLNFTNH--GRHLNISIIL-----LCQTYKHV-PV 143 (241) T ss_pred HHHHHH-HHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHHHHHHHHHHCC--CCCHHHEEEE-----EEEEEECC-CC T ss_conf 778999-999999999986026664104899962555323311577766401--6101011134-----44221104-99 Q ss_pred CCCCCEEEEEEEECCCCCC Q ss_conf 5887356753100367664 Q gi|254780617|r 316 LGAGSLTALPVIETQVNDV 334 (509) Q Consensus 316 ~g~GSiT~~~~v~~~~~D~ 334 (509) +|--|||-+--.-+...|+ T Consensus 144 n~R~Sith~cccNvs~sD~ 162 (241) T pfam04665 144 NGRDSITHFCCCNVSDSDL 162 (241) T ss_pred CCCCCCEEEEEECCCHHHH T ss_conf 8743204899831546668 No 140 >cd03223 ABCD_peroxisomal_ALDP Peroxisomal ATP-binding cassette transporter (Pat) is involved in the import of very long-chain fatty acids (VLCFA) into the peroxisome. The peroxisomal membrane forms a permeability barrier for a wide variety of metabolites required for and formed during fatty acid beta-oxidation. To communicate with the cytoplasm and mitochondria, peroxisomes need dedicated proteins to transport such hydrophilic molecules across their membranes. X-linked adrenoleukodystrophy (X-ALD) is caused by mutations in the ALD gene, which encodes ALDP (adrenoleukodystrophy protein ), a peroxisomal integral membrane protein that is a member of the ATP-binding cassette (ABC) transporter protein family. The disease is characterized by a striking and unpredictable variation in phenotypic expression. Phenotypes include the rapidly progressive childhood cerebral form (CCALD), the milder adult form, adrenomyeloneuropathy (AMN), and variants without neurologic involvement (i.e. asympt Probab=93.07 E-value=0.063 Score=33.06 Aligned_cols=29 Identities=38% Similarity=0.535 Sum_probs=24.2 Q ss_pred HCCCCCCCCCEEEEECCCCCCCHHHHHHHH Q ss_conf 101111568336552477886115899999 Q gi|254780617|r 154 DSLIPIGRGQRELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 154 D~l~pigrGQR~~I~g~~g~GKt~l~~~~I 183 (509) |.=+.+.+|++.+|+|.+|+|||||+ .+| T Consensus 19 ~isl~i~~Ge~v~i~G~sGsGKSTLl-~~l 47 (166) T cd03223 19 DLSFEIKPGDRLLITGPSGTGKSSLF-RAL 47 (166) T ss_pred EEEEEECCCCEEEEECCCCCCHHHHH-HHH T ss_conf 45889889999999958999889999-998 No 141 >COG1116 TauB ABC-type nitrate/sulfonate/bicarbonate transport system, ATPase component [Inorganic ion transport and metabolism] Probab=93.04 E-value=0.065 Score=32.95 Aligned_cols=33 Identities=15% Similarity=0.446 Sum_probs=25.4 Q ss_pred HHHHH-CCCCCCCCCEEEEECCCCCCCHHHHHHHH Q ss_conf 02331-01111568336552477886115899999 Q gi|254780617|r 150 IKAID-SLIPIGRGQRELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 150 I~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~~~I 183 (509) .++++ .=+.+.+|+-..|+|+||||||||+ .+| T Consensus 16 ~~vl~~i~L~v~~GEfvsilGpSGcGKSTLL-rii 49 (248) T COG1116 16 VEVLEDINLSVEKGEFVAILGPSGCGKSTLL-RLI 49 (248) T ss_pred EEEECCCEEEECCCCEEEEECCCCCCHHHHH-HHH T ss_conf 6876050358779979999899978899999-999 No 142 >cd03228 ABCC_MRP_Like The MRP (Mutidrug Resistance Protein)-like transporters are involved in drug, peptide, and lipid export. They belong to the subfamily C of the ATP-binding cassette (ABC) superfamily of transport proteins. The ABCC subfamily contains transporters with a diverse functional spectrum that includes ion transport, cell surface receptor, and toxin secretion activities. The MRP-like family, simlar to all ABC proteins, have a common four-domain core structure constituted by two membrane-spanning domains, each composed of six transmembrane (TM) helices, and two nucleotide-binding domains (NBD). ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=93.03 E-value=0.061 Score=33.15 Aligned_cols=28 Identities=25% Similarity=0.566 Sum_probs=23.4 Q ss_pred CCCCCCCCEEEEECCCCCCCHHHHHHHHH Q ss_conf 11115683365524778861158999999 Q gi|254780617|r 156 LIPIGRGQRELIIGDRKTGKTSIILDTFL 184 (509) Q Consensus 156 l~pigrGQR~~I~g~~g~GKt~l~~~~I~ 184 (509) =+.+-+|++.+|+|.+|+|||||+ .+|. T Consensus 22 sl~i~~Ge~i~ivG~sGsGKSTLl-~ll~ 49 (171) T cd03228 22 SLTIKPGEKVAIVGPSGSGKSTLL-KLLL 49 (171) T ss_pred EEEECCCCEEEEECCCCCHHHHHH-HHHH T ss_conf 899859989999999998399999-9997 No 143 >CHL00195 ycf46 Ycf46; Provisional Probab=92.99 E-value=0.2 Score=29.42 Aligned_cols=24 Identities=25% Similarity=0.327 Sum_probs=16.9 Q ss_pred EEEEECCCCCCCHHHHHHHHHHHHH Q ss_conf 3655247788611589999999886 Q gi|254780617|r 164 RELIIGDRKTGKTSIILDTFLNQKS 188 (509) Q Consensus 164 R~~I~g~~g~GKt~l~~~~I~nq~~ 188 (509) -++++|.+|||||-++ -+|.+..+ T Consensus 261 GvLL~GpPG~GKtl~A-KAvA~e~~ 284 (491) T CHL00195 261 GLLLVGIQGTGKSLTA-KAIANEWN 284 (491) T ss_pred EEEEECCCCCCHHHHH-HHHHHHHC T ss_conf 7999799998789999-99986638 No 144 >cd03250 ABCC_MRP_domain1 Domain 1 of the ABC subfamily C. This family is also known as MRP (mulrtidrug resisitance-associated protein). Some of the MRP members have five additional transmembrane segments in their N-terminas, but the function of these additional membrane-spanning domains is not clear. The MRP was found in the multidrug-resisting lung cancer cell in which p-glycoprotein was not overexpressed. MRP exports glutathione by drug stimulation, as well as, certain substrates in conjugated forms with anions, such as glutathione, glucuronate, and sulfate. Probab=92.99 E-value=0.066 Score=32.89 Aligned_cols=29 Identities=24% Similarity=0.487 Sum_probs=23.8 Q ss_pred CCCCCCCCCEEEEECCCCCCCHHHHHHHHH Q ss_conf 011115683365524778861158999999 Q gi|254780617|r 155 SLIPIGRGQRELIIGDRKTGKTSIILDTFL 184 (509) Q Consensus 155 ~l~pigrGQR~~I~g~~g~GKt~l~~~~I~ 184 (509) .=+.|-+|+..+|+|.+|+|||||+ .+|. T Consensus 24 isl~i~~Ge~~~IvG~sGsGKSTLl-~~i~ 52 (204) T cd03250 24 INLEVPKGELVAIVGPVGSGKSSLL-SALL 52 (204) T ss_pred EEEEECCCCEEEEECCCCCCHHHHH-HHHC T ss_conf 4899769989999999998589999-9981 No 145 >PRK13648 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=92.93 E-value=0.068 Score=32.81 Aligned_cols=28 Identities=29% Similarity=0.437 Sum_probs=23.2 Q ss_pred CCCCCCCCCEEEEECCCCCCCHHHHHHHH Q ss_conf 01111568336552477886115899999 Q gi|254780617|r 155 SLIPIGRGQRELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 155 ~l~pigrGQR~~I~g~~g~GKt~l~~~~I 183 (509) .=+.|-+|+..+|+|.+|+|||||+. +| T Consensus 28 is~~i~~Ge~vaiiG~sGsGKSTLl~-ll 55 (269) T PRK13648 28 VSFNIPKGQWTSIVGHNGSGKSTIAK-LM 55 (269) T ss_pred EEEEECCCCEEEEECCCCCCHHHHHH-HH T ss_conf 58998599899999999997999999-99 No 146 >cd03289 ABCC_CFTR2 The CFTR subfamily domain 2. The cystic fibrosis transmembrane regulator (CFTR), the product of the gene mutated in patients with cystic fibrosis, has adapted the ABC transporter structural motif to form a tightly regulated anion channel at the apical surface of many epithelia. Use of the term assembly of a functional ion channel implies the coming together of subunits or at least smaller not-yet functional components of the active whole. In fact, on the basis of current knowledge only the CFTR polypeptide itself is required to form an ATP- and protein kinase A-dependent low-conductance chloride channel of the type present in the apical membrane of many epithelial cells. CFTR displays the typical organization (IM-ABC)2 and carries a characteristic hydrophilic R-domain that separates IM1-ABC1 from IM2-ABC2. Probab=92.90 E-value=0.071 Score=32.70 Aligned_cols=30 Identities=30% Similarity=0.595 Sum_probs=24.8 Q ss_pred HCCCCCCCCCEEEEECCCCCCCHHHHHHHHH Q ss_conf 1011115683365524778861158999999 Q gi|254780617|r 154 DSLIPIGRGQRELIIGDRKTGKTSIILDTFL 184 (509) Q Consensus 154 D~l~pigrGQR~~I~g~~g~GKt~l~~~~I~ 184 (509) |.=+.|-+|++.+|+|.+|+|||||+ .++. T Consensus 22 ~isf~I~~Ge~vaIvG~sGsGKSTLl-~lL~ 51 (275) T cd03289 22 NISFSISPGQRVGLLGRTGSGKSTLL-SAFL 51 (275) T ss_pred CEEEEECCCCEEEEECCCCCCHHHHH-HHHH T ss_conf 50799879999999999999799999-9996 No 147 >KOG4658 consensus Probab=92.89 E-value=0.77 Score=25.18 Aligned_cols=101 Identities=21% Similarity=0.249 Sum_probs=50.6 Q ss_pred HHHCCCCCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHH-HCCCCCCCEEE Q ss_conf 33101111568336552477886115899999998863024667876069999516776789999985-11467421158 Q gi|254780617|r 152 AIDSLIPIGRGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKA-LEDRGALSYSI 230 (509) Q Consensus 152 ~ID~l~pigrGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~-l~~~~~~~~tv 230 (509) +.+.|...+- |-++|+|=.|+|||||+. .|.|.... ....-+.+ +.+.+++- -.+.++.+. ++.-+..+.. T Consensus 170 l~~~L~~d~~-~ivgi~GMGGvGKTTL~~-qi~N~~~~--v~~~Fd~~--iWV~VSk~-f~~~~iq~~Il~~l~~~~~~- 241 (889) T KOG4658 170 LWNRLMEDDV-GIVGIYGMGGVGKTTLAR-QIFNKFDE--VGNHFDGV--IWVVVSKE-FTTRKIQQTILERLGLLDEE- 241 (889) T ss_pred HHHHHCCCCC-CEEEEECCCCCCHHHHHH-HHHHCCHH--HCCCCCEE--EEEEECCC-CCHHHHHHHHHHHHCCCCCC- T ss_conf 9998404799-689998897034999999-98413312--23578749--99997763-10888999999984468710- Q ss_pred EEEECCCCCHHHHHHHHHHHHHHHHHHHC-CCCCEEEEEECHHH Q ss_conf 99728878889999988766666675420-46884799616478 Q gi|254780617|r 231 VVVASASDPAPMQLLAPFAGCAMGEYFRD-NGYHALIAYDDLQK 273 (509) Q Consensus 231 vv~a~a~~~~~~r~~ap~~a~aiAEyfr~-~G~~VLi~~Ddltr 273 (509) -++... .- -+. +.++. +++.-|+++||+=. T Consensus 242 -----~~~~~~-~~----~~~---~i~~~L~~krfllvLDDIW~ 272 (889) T KOG4658 242 -----WEDKEE-DE----LAS---KLLNLLEGKRFLLVLDDIWE 272 (889) T ss_pred -----CCCCCH-HH----HHH---HHHHHHCCCCEEEEEECCCC T ss_conf -----001028-89----999---99998556855999825675 No 148 >TIGR00972 3a0107s01c2 phosphate ABC transporter, ATP-binding protein; InterPro: IPR005670 This is a family of phosphate transport system permease proteins.; GO: 0005315 inorganic phosphate transmembrane transporter activity, 0015114 phosphate transmembrane transporter activity, 0006817 phosphate transport, 0016020 membrane. Probab=92.86 E-value=0.068 Score=32.83 Aligned_cols=24 Identities=25% Similarity=0.464 Sum_probs=20.7 Q ss_pred CCCCCCCCEEEEECCCCCCCHHHH Q ss_conf 111156833655247788611589 Q gi|254780617|r 156 LIPIGRGQRELIIGDRKTGKTSII 179 (509) Q Consensus 156 l~pigrGQR~~I~g~~g~GKt~l~ 179 (509) =+.|-++|=..|||+|||||||++ T Consensus 21 ~~~I~~n~vTAlIGPSGCGKSTlL 44 (248) T TIGR00972 21 NLDIPKNQVTALIGPSGCGKSTLL 44 (248) T ss_pred CCEECCCEEEEEECCCCCCHHHHH T ss_conf 620037705898778898678999 No 149 >cd03248 ABCC_TAP TAP, the Transporter Associated with Antigen Processing; TAP is essential for peptide delivery from the cytosol into the lumen of the endoplasmic reticulum (ER), where these peptides are loaded on major histocompatibility complex (MHC) I molecules. Loaded MHC I leave the ER and display their antigenic cargo on the cell surface to cytotoxic T cells. Subsequently, virus-infected or malignantly transformed cells can be eliminated. TAP belongs to the large family of ATP-binding cassette (ABC) transporters, which translocate a vast variety of solutes across membranes. Probab=92.83 E-value=0.067 Score=32.87 Aligned_cols=30 Identities=17% Similarity=0.410 Sum_probs=24.6 Q ss_pred HCCCCCCCCCEEEEECCCCCCCHHHHHHHHH Q ss_conf 1011115683365524778861158999999 Q gi|254780617|r 154 DSLIPIGRGQRELIIGDRKTGKTSIILDTFL 184 (509) Q Consensus 154 D~l~pigrGQR~~I~g~~g~GKt~l~~~~I~ 184 (509) |.=+.|-+|++.+|.|.+|+|||||+ .+|. T Consensus 32 ~is~~i~~Ge~vaIvG~sGsGKSTL~-~ll~ 61 (226) T cd03248 32 DVSFTLHPGEVTALVGPSGSGKSTVV-ALLE 61 (226) T ss_pred CEEEEECCCCEEEEECCCCCHHHHHH-HHHH T ss_conf 53899829999999999998499999-9996 No 150 >COG1126 GlnQ ABC-type polar amino acid transport system, ATPase component [Amino acid transport and metabolism] Probab=92.79 E-value=0.11 Score=31.20 Aligned_cols=170 Identities=18% Similarity=0.286 Sum_probs=79.7 Q ss_pred CCCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEE-ECCCCHHHHHHHHHH----HCCCCCCCEEEE Q ss_conf 111568336552477886115899999998863024667876069999-516776789999985----114674211589 Q gi|254780617|r 157 IPIGRGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYV-AIGQKRSSVARFVKA----LEDRGALSYSIV 231 (509) Q Consensus 157 ~pigrGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~-~IGer~~ev~~~~~~----l~~~~~~~~tvv 231 (509) +.+-+|+...|+|+||+||||++- .+|- .-..+...|.|-. -++.+. ++.++.+. +.......+-++ T Consensus 23 l~v~~Gevv~iiGpSGSGKSTlLR--clN~-----LE~~~~G~I~i~g~~~~~~~-~~~~~R~~vGmVFQ~fnLFPH~Tv 94 (240) T COG1126 23 LSVEKGEVVVIIGPSGSGKSTLLR--CLNG-----LEEPDSGSITVDGEDVGDKK-DILKLRRKVGMVFQQFNLFPHLTV 94 (240) T ss_pred EEECCCCEEEEECCCCCCHHHHHH--HHHC-----CCCCCCCEEEECCEECCCHH-HHHHHHHHCCEECCCCCCCCCCHH T ss_conf 167389789998999998889999--9977-----86887864999987225454-699999855766246654655329 Q ss_pred EEECCCCCHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECHHHHHHHH---------------HHH-----HHHCCCCCCC Q ss_conf 97288788899999887666666754204688479961647899999---------------875-----4530378754 Q gi|254780617|r 232 VVASASDPAPMQLLAPFAGCAMGEYFRDNGYHALIAYDDLQKHAVAY---------------RQL-----SLLLRRPPGR 291 (509) Q Consensus 232 v~a~a~~~~~~r~~ap~~a~aiAEyfr~~G~~VLi~~Ddltr~A~a~---------------rei-----sl~~~~pp~~ 291 (509) .-+-.-.|--.+-+...-|-..|..+.++ =.|.-+|++| |.+ -++..||-+. T Consensus 95 leNv~lap~~v~~~~k~eA~~~A~~lL~~--------VGL~dka~~yP~qLSGGQqQRVAIARALaM~P~vmLFDEPTSA 166 (240) T COG1126 95 LENVTLAPVKVKKLSKAEAREKALELLEK--------VGLADKADAYPAQLSGGQQQRVAIARALAMDPKVMLFDEPTSA 166 (240) T ss_pred HHHHHHHHHHHCCCCHHHHHHHHHHHHHH--------CCCHHHHHHCCCCCCCHHHHHHHHHHHHCCCCCEEEECCCCCC T ss_conf 88877753997298999999999999998--------6955666539510480788999999987179988863697543 Q ss_pred CCCCCCCHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCCCCCCHHHHHHHHHCCEEEEE Q ss_conf 33375200124678888750454458873567531003676644311466885406169961 Q gi|254780617|r 292 EAYPGDVFYLHSRLLERAAKMSDALGAGSLTALPVIETQVNDVSAYIPTNVISITDGQIFLE 353 (509) Q Consensus 292 ~gyp~~vf~~~s~l~ERa~~~~~~~g~GSiT~~~~v~~~~~D~~~~i~~~~~si~DG~i~l~ 353 (509) .-|.+-...-.++...+ + .| +|-+ +-|-.=.+..-++|.++.+-+|+|+.+ T Consensus 167 --LDPElv~EVL~vm~~LA---~---eG-mTMi--vVTHEM~FAr~VadrviFmd~G~iie~ 217 (240) T COG1126 167 --LDPELVGEVLDVMKDLA---E---EG-MTMI--IVTHEMGFAREVADRVIFMDQGKIIEE 217 (240) T ss_pred --CCHHHHHHHHHHHHHHH---H---CC-CEEE--EEECHHHHHHHHHHEEEEEECCEEEEE T ss_conf --79889999999999999---7---69-8699--995036799986222899528889875 No 151 >cd03369 ABCC_NFT1 Domain 2 of NFT1 (New full-length MRP-type transporter 1). NFT1 belongs to the MRP (mulrtidrug resisitance-associated protein) family of ABC transporters. Some of the MRP members have five additional transmembrane segments in their N-terminas, but the function of these additional membrane-spanning domains is not clear. The MRP was found in the multidrug-resisting lung cancer cell in which p-glycoprotein was not overexpressed. MRP exports glutathione by drug stimulation, as well as, certain substrates in conjugated forms with anions such as glutathione, glucuronate, and sulfate. Probab=92.78 E-value=0.073 Score=32.61 Aligned_cols=31 Identities=19% Similarity=0.358 Sum_probs=24.8 Q ss_pred CCCCCCCCCEEEEECCCCCCCHHHHHHHHHHH Q ss_conf 01111568336552477886115899999998 Q gi|254780617|r 155 SLIPIGRGQRELIIGDRKTGKTSIILDTFLNQ 186 (509) Q Consensus 155 ~l~pigrGQR~~I~g~~g~GKt~l~~~~I~nq 186 (509) .=+.|-+||+.+|+|.+|+|||||+ .+|... T Consensus 27 isl~i~~Ge~v~ivG~sGsGKSTLl-~ll~g~ 57 (207) T cd03369 27 VSFKVKAGEKIGIVGRTGAGKSTLI-LALFRF 57 (207) T ss_pred EEEEECCCCEEEEECCCCCCHHHHH-HHHHHH T ss_conf 5889869999999999998799999-999987 No 152 >PRK10522 multidrug transporter membrane component/ATP-binding component; Provisional Probab=92.74 E-value=0.076 Score=32.46 Aligned_cols=21 Identities=33% Similarity=0.680 Sum_probs=8.7 Q ss_pred CCCCCCEEEEECCCCCCCHHH Q ss_conf 115683365524778861158 Q gi|254780617|r 158 PIGRGQRELIIGDRKTGKTSI 178 (509) Q Consensus 158 pigrGQR~~I~g~~g~GKt~l 178 (509) ++-+|++.+|.|++|+||||| T Consensus 345 ~i~~Ge~vaiVG~SGsGKSTL 365 (547) T PRK10522 345 TIKRGELLFLIGGNGSGKSTL 365 (547) T ss_pred EECCCCEEEEECCCCCCHHHH T ss_conf 985998899989999977999 No 153 >PRK10419 nikE nickel transporter ATP-binding protein; Provisional Probab=92.73 E-value=0.07 Score=32.72 Aligned_cols=33 Identities=12% Similarity=0.330 Sum_probs=26.4 Q ss_pred HHHHH-CCCCCCCCCEEEEECCCCCCCHHHHHHHH Q ss_conf 02331-01111568336552477886115899999 Q gi|254780617|r 150 IKAID-SLIPIGRGQRELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 150 I~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~~~I 183 (509) ++++| .=+.+.+|+..+|+|.+|+|||||+. +| T Consensus 25 ~~~l~~vs~~i~~GE~l~ivGeSGsGKSTL~r-~i 58 (266) T PRK10419 25 QAVLNNVSLTLKSGETVALLGRSGCGKSTLAR-LL 58 (266) T ss_pred EEEEECCEEEECCCCEEEEECCCCCHHHHHHH-HH T ss_conf 68885817588899899999999977999999-99 No 154 >PRK13657 cyclic beta-1,2-glucan ABC transporter; Provisional Probab=92.73 E-value=0.075 Score=32.53 Aligned_cols=23 Identities=30% Similarity=0.517 Sum_probs=11.2 Q ss_pred CCCCCCCEEEEECCCCCCCHHHH Q ss_conf 11156833655247788611589 Q gi|254780617|r 157 IPIGRGQRELIIGDRKTGKTSII 179 (509) Q Consensus 157 ~pigrGQR~~I~g~~g~GKt~l~ 179 (509) +.+-+||+++|.|.+|+|||||+ T Consensus 356 l~i~~Ge~vaiVG~SGsGKSTL~ 378 (585) T PRK13657 356 FEAKPGQTVAIVGPTGAGKSTLI 378 (585) T ss_pred EEECCCCEEEEECCCCCCHHHHH T ss_conf 89759988999889898699999 No 155 >PRK10636 putative ABC transporter ATP-binding protein; Provisional Probab=92.68 E-value=0.072 Score=32.63 Aligned_cols=29 Identities=21% Similarity=0.472 Sum_probs=20.5 Q ss_pred HCCCCCCCCCEEEEECCCCCCCHHHHHHHH Q ss_conf 101111568336552477886115899999 Q gi|254780617|r 154 DSLIPIGRGQRELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 154 D~l~pigrGQR~~I~g~~g~GKt~l~~~~I 183 (509) |.=+.|..|+|++|+|..|+|||||+ .+| T Consensus 19 ~vsl~I~~Ge~vgLVG~NGsGKSTLl-klL 47 (638) T PRK10636 19 NATATINPGQKVGLVGKNGCGKSTLL-ALL 47 (638) T ss_pred CCEEEECCCCEEEEECCCCCHHHHHH-HHH T ss_conf 76799989989999889998899999-998 No 156 >pfam05673 DUF815 Protein of unknown function (DUF815). This family consists of several bacterial proteins of unknown function. Probab=92.67 E-value=0.65 Score=25.72 Aligned_cols=24 Identities=38% Similarity=0.645 Sum_probs=16.8 Q ss_pred CCEEEEECCCCCCCHHHHHHHHHHH Q ss_conf 8336552477886115899999998 Q gi|254780617|r 162 GQRELIIGDRKTGKTSIILDTFLNQ 186 (509) Q Consensus 162 GQR~~I~g~~g~GKt~l~~~~I~nq 186 (509) -..+++.|.+|||||++.. +++|. T Consensus 53 AnnvLLwG~RGtGKSSlVK-all~~ 76 (248) T pfam05673 53 ANNVLLWGARGTGKSSLVK-ALLNE 76 (248) T ss_pred CCCEEEECCCCCCHHHHHH-HHHHH T ss_conf 6136767689898889999-99998 No 157 >pfam00004 AAA ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes. Probab=92.67 E-value=0.25 Score=28.67 Aligned_cols=22 Identities=27% Similarity=0.394 Sum_probs=16.6 Q ss_pred EEEECCCCCCCHHHHHHHHHHHH Q ss_conf 65524778861158999999988 Q gi|254780617|r 165 ELIIGDRKTGKTSIILDTFLNQK 187 (509) Q Consensus 165 ~~I~g~~g~GKt~l~~~~I~nq~ 187 (509) +++.|++|||||+++ ..|.++. T Consensus 1 iLl~GppGtGKT~~a-~~la~~~ 22 (131) T pfam00004 1 LLLYGPPGTGKTTLA-KAVAKEL 22 (131) T ss_pred CEEECCCCCCHHHHH-HHHHHHH T ss_conf 987899999999999-9999997 No 158 >cd04147 Ras_dva Ras-dva subfamily. Ras-dva (Ras - dorsal-ventral anterior localization) subfamily consists of a set of proteins characterized only in Xenopus leavis, to date. In Xenopus Ras-dva expression is activated by the transcription factor Otx2 and begins during gastrulation throughout the anterior ectoderm. Ras-dva expression is inhibited in the anterior neural plate by factor Xanf1. Downregulation of Ras-dva results in head development abnormalities through the inhibition of several regulators of the anterior neural plate and folds patterning, including Otx2, BF-1, Xag2, Pax6, Slug, and Sox9. Downregulation of Ras-dva also interferes with the FGF-8a signaling within the anterior ectoderm. Most Ras proteins contain a lipid modification site at the C-terminus, with a typical sequence motif CaaX, where a = an aliphatic amino acid and X = any amino acid. Lipid binding is essential for membrane attachment, a key feature of most Ras proteins. Probab=92.63 E-value=0.13 Score=30.84 Aligned_cols=20 Identities=30% Similarity=0.499 Sum_probs=16.5 Q ss_pred EEEEECCCCCCCHHHHHHHH Q ss_conf 36552477886115899999 Q gi|254780617|r 164 RELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 164 R~~I~g~~g~GKt~l~~~~I 183 (509) |+.++|++|||||+|..-.+ T Consensus 1 KIvvlGd~~VGKTSLi~rf~ 20 (198) T cd04147 1 RLVFMGAAGVGKTALIQRFL 20 (198) T ss_pred CEEEECCCCCCHHHHHHHHH T ss_conf 98999989977999999998 No 159 >cd03290 ABCC_SUR1_N The SUR domain 1. The sulfonylurea receptor SUR is an ATP transporter of the ABCC/MRP family with tandem ATPase binding domains. Unlike other ABC proteins, it has no intrinsic transport function, neither active nor passive, but associates with the potassium channel proteins Kir6.1 or Kir6.2 to form the ATP-sensitive potassium (K(ATP)) channel. Within the channel complex, SUR serves as a regulatory subunit that fine-tunes the gating of Kir6.x in response to alterations in cellular metabolism. It constitutes a major pharmaceutical target as it binds numerous drugs, K(ATP) channel openers and blockers, capable of up- or down-regulating channel activity. Probab=92.63 E-value=0.078 Score=32.39 Aligned_cols=29 Identities=34% Similarity=0.484 Sum_probs=23.6 Q ss_pred CCCCCCCCCEEEEECCCCCCCHHHHHHHHH Q ss_conf 011115683365524778861158999999 Q gi|254780617|r 155 SLIPIGRGQRELIIGDRKTGKTSIILDTFL 184 (509) Q Consensus 155 ~l~pigrGQR~~I~g~~g~GKt~l~~~~I~ 184 (509) .=+.|-+|++.+|+|.+|+|||||+ .+|. T Consensus 20 inl~i~~Ge~~~IvG~sGsGKSTLl-~~l~ 48 (218) T cd03290 20 INIRIPTGQLTMIVGQVGCGKSSLL-LAIL 48 (218) T ss_pred EEEEECCCCEEEEECCCCCCHHHHH-HHHH T ss_conf 6999869999999999998099999-9985 No 160 >PRK13633 cobalt transporter ATP-binding subunit; Provisional Probab=92.62 E-value=0.076 Score=32.48 Aligned_cols=29 Identities=28% Similarity=0.523 Sum_probs=24.2 Q ss_pred HHH-CCCCCCCCCEEEEECCCCCCCHHHHH Q ss_conf 331-01111568336552477886115899 Q gi|254780617|r 152 AID-SLIPIGRGQRELIIGDRKTGKTSIIL 180 (509) Q Consensus 152 ~ID-~l~pigrGQR~~I~g~~g~GKt~l~~ 180 (509) |+| .=+.|.+|+..+|+|.+|+|||||+. T Consensus 26 al~~Isl~i~~GE~v~iiG~nGsGKSTL~r 55 (281) T PRK13633 26 ALDDVNLEVKKGEFLVILGHNGSGKSTIAK 55 (281) T ss_pred EEECEEEEECCCCEEEEECCCCCHHHHHHH T ss_conf 673407688799899999999984999999 No 161 >PRK11308 dppF dipeptide transporter ATP-binding subunit; Provisional Probab=92.59 E-value=0.053 Score=33.58 Aligned_cols=31 Identities=26% Similarity=0.539 Sum_probs=27.2 Q ss_pred HHHHH-CCCCCCCCCEEEEECCCCCCCHHHHH Q ss_conf 02331-01111568336552477886115899 Q gi|254780617|r 150 IKAID-SLIPIGRGQRELIIGDRKTGKTSIIL 180 (509) Q Consensus 150 I~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~ 180 (509) |+|+| .=+.|.+|...+|+|.+|+|||||+. T Consensus 28 v~Av~~Vsl~i~~GE~lgiVGeSGsGKSTL~~ 59 (327) T PRK11308 28 VKALDGVSFNLERGKTLAVVGESGCGKSTLAR 59 (327) T ss_pred EEEECCEEEEECCCCEEEEECCCCHHHHHHHH T ss_conf 88850606798899999999998319999999 No 162 >cd03249 ABC_MTABC3_MDL1_MDL2 MTABC3 (also known as ABCB6) is a mitochondrial ATP-binding cassette protein involved in iron homeostasis and one of four ABC transporters expressed in the mitochondrial inner membrane, the other three being MDL1(ABC7), MDL2, and ATM1. In fact, the yeast MDL1 (multidrug resistance-like protein 1) and MDL2 (multidrug resistance-like protein 2) transporters are also included in this CD. MDL1 is an ATP-dependent permease that acts as a high-copy suppressor of ATM1 and is thought to have a role in resistance to oxidative stress. Interestingly, subfamily B is more closely related to the carboxyl-terminal component of subfamily C than the two halves of ABCC molecules are with one another. Probab=92.58 E-value=0.083 Score=32.20 Aligned_cols=30 Identities=17% Similarity=0.376 Sum_probs=24.6 Q ss_pred HCCCCCCCCCEEEEECCCCCCCHHHHHHHHH Q ss_conf 1011115683365524778861158999999 Q gi|254780617|r 154 DSLIPIGRGQRELIIGDRKTGKTSIILDTFL 184 (509) Q Consensus 154 D~l~pigrGQR~~I~g~~g~GKt~l~~~~I~ 184 (509) |.=+.|-+|++++|+|.+|+|||||+ ..+. T Consensus 21 ~isl~i~~G~~iaIvG~sGsGKSTLl-~ll~ 50 (238) T cd03249 21 GLSLTIPPGKTVALVGSSGCGKSTVV-SLLE 50 (238) T ss_pred CEEEEECCCCEEEEECCCCCCHHHHH-HHHH T ss_conf 55899769999999999999899999-9982 No 163 >PRK08694 consensus Probab=92.56 E-value=0.16 Score=30.06 Aligned_cols=44 Identities=18% Similarity=0.284 Sum_probs=35.2 Q ss_pred HHHCCHHHHHCCCC-CCCCCEEEEECCCCCCCHHHHHHHHHHHHH Q ss_conf 44416023310111-156833655247788611589999999886 Q gi|254780617|r 145 PLSTGIKAIDSLIP-IGRGQRELIIGDRKTGKTSIILDTFLNQKS 188 (509) Q Consensus 145 ~l~TGI~~ID~l~p-igrGQR~~I~g~~g~GKt~l~~~~I~nq~~ 188 (509) =+.||++.+|.++- +-+|+=+.|=|-++.|||++++.++.|-+. T Consensus 200 Gi~TG~~~LD~~t~Gl~~G~LiVIaaRPsmGKTalalnia~~~a~ 244 (468) T PRK08694 200 GVPTGFIDLDKKTSGLQPGDLIIVAGRPSMGKTAFSINIAEHVAV 244 (468) T ss_pred CCCCCHHHHHHHHCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHH T ss_conf 155796889876448887847999617865378999999999998 No 164 >cd03257 ABC_NikE_OppD_transporters The ABC transporter subfamily specific for the transport of dipeptides, oligopeptides (OppD), and nickel (NikDE). The NikABCDE system of E. coli belongs to this family and is composed of the periplasmic binding protein NikA, two integral membrane components (NikB and NikC), and two ATPase (NikD and NikE). The NikABCDE transporter is synthesized under anaerobic conditions to meet the increased demand for nickel resulting from hydrogenase synthesis. The molecular mechanism of nickel uptake in many bacteria and most archaea is not known. Many other members of this ABC family are also involved in the uptake of dipeptides and oligopeptides. The oligopeptide transport system (Opp) is a five-component ABC transport composed of a membrane-anchored substrate binding proteins (SRP), OppA, two transmembrane proteins, OppB and OppC, and two ATP-binding domains, OppD and OppF. Probab=92.55 E-value=0.077 Score=32.42 Aligned_cols=32 Identities=25% Similarity=0.515 Sum_probs=25.2 Q ss_pred HHHH-CCCCCCCCCEEEEECCCCCCCHHHHHHHH Q ss_conf 2331-01111568336552477886115899999 Q gi|254780617|r 151 KAID-SLIPIGRGQRELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 151 ~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~~~I 183 (509) ++++ .=+.+-+|+..+|+|++|+|||||+ .+| T Consensus 19 ~~L~dIsl~i~~Ge~~~iiG~sGsGKSTLl-~~i 51 (228) T cd03257 19 KALDDVSFSIKKGETLGLVGESGSGKSTLA-RAI 51 (228) T ss_pred EEEECEEEEECCCCEEEEECCCCCHHHHHH-HHH T ss_conf 998560789869989999999998699999-999 No 165 >cd03270 ABC_UvrA_I The excision repair protein UvrA domain I; Nucleotide excision repair in eubacteria is a process that repairs DNA damage by the removal of a 12-13-mer oligonucleotide containing the lesion. Recognition and cleavage of the damaged DNA is a multistep ATP-dependent reaction that requires the UvrA, UvrB, and UvrC proteins. Both UvrA and UvrB are ATPases, with UvrA having two ATP binding sites, which have the characteristic signature of the family of ABC proteins, and UvrB having one ATP binding site that is structurally related to that of helicases. Probab=92.53 E-value=0.09 Score=31.92 Aligned_cols=30 Identities=30% Similarity=0.413 Sum_probs=25.5 Q ss_pred CCCCCCCCCEEEEECCCCCCCHHHHHHHHH Q ss_conf 011115683365524778861158999999 Q gi|254780617|r 155 SLIPIGRGQRELIIGDRKTGKTSIILDTFL 184 (509) Q Consensus 155 ~l~pigrGQR~~I~g~~g~GKt~l~~~~I~ 184 (509) .=+.|-+|+-.+|+|.+|+|||||++.++. T Consensus 14 Vsl~i~~Ge~~aIvG~nGsGKSTL~~~~l~ 43 (226) T cd03270 14 VDVDIPRNKLVVITGVSGSGKSSLAFDTIY 43 (226) T ss_pred EEEEECCCCEEEEECCCCCHHHHHHHHHHH T ss_conf 489985998999987899609898361663 No 166 >PRK11160 cysteine/glutathione ABC transporter membrane/ATP-binding component; Reviewed Probab=92.49 E-value=0.083 Score=32.18 Aligned_cols=20 Identities=5% Similarity=0.092 Sum_probs=10.3 Q ss_pred CCCCCHHHHHHHHHHHHHHH Q ss_conf 27989899999999999998 Q gi|254780617|r 486 QKVLTDDIRSKLINEIKVFL 505 (509) Q Consensus 486 ~~~l~de~~~~L~~~i~~~~ 505 (509) +-.++.++|+.+.+++++.. T Consensus 504 Ts~LD~~te~~i~~~l~~~~ 523 (575) T PRK11160 504 TEGLDAETERQILELLFEHA 523 (575) T ss_pred CCCCCHHHHHHHHHHHHHHC T ss_conf 66679999999999999866 No 167 >cd03246 ABCC_Protease_Secretion This family represents the ABC component of the protease secretion system PrtD, a 60-kDa integral membrane protein sharing 37% identity with HlyB, the ABC component of the alpha-hemolysin secretion pathway, in the C-terminal domain. They export degradative enzymes by using a type I protein secretion system and lack an N-terminal signal peptide, but contain a C-terminal secretion signal. The Type I secretion apparatus is made up of three components, an ABC transporter, a membrane fusion protein (MFP), and an outer membrane protein (OMP). For the HlyA transporter complex, HlyB (ABC transporter) and HlyD (MFP) reside in the inner membrane of E. coli. The OMP component is TolC, which is thought to interact with the MFP to form a continuous channel across the periplasm from the cytoplasm to the exterior. HlyB belongs to the family of ABC transporters, which are ubiquitous, ATP-dependent transmembrane pumps or channels. The spectrum of transport substra Probab=92.48 E-value=0.083 Score=32.20 Aligned_cols=30 Identities=27% Similarity=0.437 Sum_probs=24.4 Q ss_pred HCCCCCCCCCEEEEECCCCCCCHHHHHHHHH Q ss_conf 1011115683365524778861158999999 Q gi|254780617|r 154 DSLIPIGRGQRELIIGDRKTGKTSIILDTFL 184 (509) Q Consensus 154 D~l~pigrGQR~~I~g~~g~GKt~l~~~~I~ 184 (509) |.=+.+-+|++.+|+|.+|+|||||+ .++. T Consensus 20 ~i~l~i~~G~~vaIvG~sGsGKSTLl-~ll~ 49 (173) T cd03246 20 NVSFSIEPGESLAIIGPSGSGKSTLA-RLIL 49 (173) T ss_pred CEEEEECCCCEEEEECCCCCHHHHHH-HHHH T ss_conf 76999859999999999998099999-9996 No 168 >cd03298 ABC_ThiQ_thiamine_transporter ABC-type thiamine tranport system; part of the binding-protein-dependent transport system tbpA-thiPQ for thiamine and TPP. Probably responsible for the translocation of thiamine across the membrane. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=92.45 E-value=0.069 Score=32.77 Aligned_cols=31 Identities=16% Similarity=0.419 Sum_probs=25.0 Q ss_pred HHCCCCCCCCCEEEEECCCCCCCHHHHHHHHH Q ss_conf 31011115683365524778861158999999 Q gi|254780617|r 153 IDSLIPIGRGQRELIIGDRKTGKTSIILDTFL 184 (509) Q Consensus 153 ID~l~pigrGQR~~I~g~~g~GKt~l~~~~I~ 184 (509) +|.=+.+.+|+...|+|++|+|||||+ .+|. T Consensus 15 v~i~l~i~~Ge~~~ilGpSGsGKSTLl-~li~ 45 (211) T cd03298 15 MHFDLTFAQGEITAIVGPSGSGKSTLL-NLIA 45 (211) T ss_pred EEEEEEECCCCEEEEECCCCCHHHHHH-HHHH T ss_conf 627889889989999999995599999-9997 No 169 >PRK08840 replicative DNA helicase; Provisional Probab=92.45 E-value=0.14 Score=30.48 Aligned_cols=43 Identities=16% Similarity=0.261 Sum_probs=34.6 Q ss_pred HHHCCHHHHHCCCC-CCCCCEEEEECCCCCCCHHHHHHHHHHHH Q ss_conf 44416023310111-15683365524778861158999999988 Q gi|254780617|r 145 PLSTGIKAIDSLIP-IGRGQRELIIGDRKTGKTSIILDTFLNQK 187 (509) Q Consensus 145 ~l~TGI~~ID~l~p-igrGQR~~I~g~~g~GKt~l~~~~I~nq~ 187 (509) -+.||++.+|.++. +-+|+=+.|=|-++.|||++++.++.|-+ T Consensus 199 Gi~TG~~~LD~~~~Gl~~G~LiviaaRPsmGKTalalnia~n~a 242 (464) T PRK08840 199 GVSTGFTDLNKKTAGLQGSDLIIVAARPSMGKTTFAMNLCENAA 242 (464) T ss_pred CCCCCHHHHHHHCCCCCCCCEEEEEECCCCCHHHHHHHHHHHHH T ss_conf 68889899987536987576799983798736899999999999 No 170 >KOG1353 consensus Probab=92.45 E-value=0.17 Score=29.93 Aligned_cols=35 Identities=34% Similarity=0.520 Sum_probs=29.9 Q ss_pred EEEEEEECCCCCHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 15899728878889999988766666675420468 Q gi|254780617|r 228 YSIVVVASASDPAPMQLLAPFAGCAMGEYFRDNGY 262 (509) Q Consensus 228 ~tvvv~a~a~~~~~~r~~ap~~a~aiAEyfr~~G~ 262 (509) .++...++..|-|+..++++|++||++|||+++|+ T Consensus 79 ~~VkrTgaIvDVpvg~~LlgrvvdAlGn~idgkG~ 113 (340) T KOG1353 79 DTVKRTAAISDVPPLKALLGRVGCALGEPIDGNGK 113 (340) T ss_pred CEEEEEEEEECCCCHHHHHHHHHHHHCCEECCCCC T ss_conf 55896666512671677752204530674358887 No 171 >KOG0733 consensus Probab=92.43 E-value=0.87 Score=24.77 Aligned_cols=141 Identities=18% Similarity=0.129 Sum_probs=78.2 Q ss_pred EEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEEEEECCCCCH---- Q ss_conf 6552477886115899999998863024667876069999516776789999985114674211589972887888---- Q gi|254780617|r 165 ELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALSYSIVVVASASDPA---- 240 (509) Q Consensus 165 ~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~~tvvv~a~a~~~~---- 240 (509) .++-|++|||||.|+ .+|.|+.+.....-....+ |=+--||.-.-++|+.+. .-...-++++.---|.=. T Consensus 226 vLlHGPPGCGKT~lA-~AiAgel~vPf~~isApei--vSGvSGESEkkiRelF~~---A~~~aPcivFiDeIDAI~pkRe 299 (802) T KOG0733 226 VLLHGPPGCGKTSLA-NAIAGELGVPFLSISAPEI--VSGVSGESEKKIRELFDQ---AKSNAPCIVFIDEIDAITPKRE 299 (802) T ss_pred EEEECCCCCCHHHHH-HHHHHHCCCCEEEECCHHH--HCCCCCCCHHHHHHHHHH---HHCCCCEEEEEECCCCCCCCHH T ss_conf 164489986478999-9975212885485141465--315575228999999998---7366975998511001364404 Q ss_pred -----HHHHHHHHHHHHHHHHHH--CCCCCEEEEE-----ECHH----HHHHHHHHHHH-----------------HCCC Q ss_conf -----999998876666667542--0468847996-----1647----89999987545-----------------3037 Q gi|254780617|r 241 -----PMQLLAPFAGCAMGEYFR--DNGYHALIAY-----DDLQ----KHAVAYRQLSL-----------------LLRR 287 (509) Q Consensus 241 -----~~r~~ap~~a~aiAEyfr--~~G~~VLi~~-----Ddlt----r~A~a~reisl-----------------~~~~ 287 (509) .+|.+.+..-+.|-|--+ ..|+.|||+- |+|. |--+.-|||.| -|+- T Consensus 300 ~aqreMErRiVaQLlt~mD~l~~~~~~g~~VlVIgATnRPDslDpaLRRaGRFdrEI~l~vP~e~aR~~IL~~~~~~lrl 379 (802) T KOG0733 300 EAQREMERRIVAQLLTSMDELSNEKTKGDPVLVIGATNRPDSLDPALRRAGRFDREICLGVPSETAREEILRIICRGLRL 379 (802) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCCCCCCEEEEECCCCCCCCCHHHHCCCCCCCEEEECCCCHHHHHHHHHHHHHHCCC T ss_conf 57889999999999985100256666899769982478976558777325655323530689668899999999862777 Q ss_pred CCC------CCCCCCCCHHHHHHHHHHHHH Q ss_conf 875------433375200124678888750 Q gi|254780617|r 288 PPG------REAYPGDVFYLHSRLLERAAK 311 (509) Q Consensus 288 pp~------~~gyp~~vf~~~s~l~ERa~~ 311 (509) .+- .+--||-|-..+..|.-+||. T Consensus 380 ~g~~d~~qlA~lTPGfVGADL~AL~~~Aa~ 409 (802) T KOG0733 380 SGDFDFKQLAKLTPGFVGADLMALCREAAF 409 (802) T ss_pred CCCCCHHHHHHCCCCCCCHHHHHHHHHHHH T ss_conf 877689999751887521419999999999 No 172 >PRK13642 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=92.40 E-value=0.079 Score=32.37 Aligned_cols=33 Identities=21% Similarity=0.428 Sum_probs=26.2 Q ss_pred CCHHHHH-CCCCCCCCCEEEEECCCCCCCHHHHH Q ss_conf 1602331-01111568336552477886115899 Q gi|254780617|r 148 TGIKAID-SLIPIGRGQRELIIGDRKTGKTSIIL 180 (509) Q Consensus 148 TGI~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~ 180 (509) +.+.+++ .=+.|.+|++.+|+|.+|+||||++. T Consensus 18 ~~v~aL~~is~~i~~Ge~~aiiG~sGsGKSTL~~ 51 (277) T PRK13642 18 SDVNQLNGVSFSITKGEWVSIIGQNGSGKSTTAR 51 (277) T ss_pred CCCEEEECEEEEECCCCEEEEECCCCCHHHHHHH T ss_conf 9886644307998899899999999968999999 No 173 >PRK11022 dppD dipeptide transporter ATP-binding subunit; Provisional Probab=92.40 E-value=0.081 Score=32.25 Aligned_cols=35 Identities=26% Similarity=0.437 Sum_probs=28.7 Q ss_pred HHHHH-CCCCCCCCCEEEEECCCCCCCHHHHHHHHHH Q ss_conf 02331-0111156833655247788611589999999 Q gi|254780617|r 150 IKAID-SLIPIGRGQRELIIGDRKTGKTSIILDTFLN 185 (509) Q Consensus 150 I~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~~~I~n 185 (509) ++|+| .=+.+.+|+..+|+|.+|+||||++. +|+. T Consensus 20 v~Av~~Vsf~i~~GEilgivGeSGsGKSTl~~-~ilg 55 (327) T PRK11022 20 FKAVDRISYSVKQGEVVGIVGESGSGKSVSSL-AIMG 55 (327) T ss_pred EEEEECEEEEECCCCEEEEECCCCCHHHHHHH-HHHC T ss_conf 99984418798899999999999878999999-9974 No 174 >PRK00080 ruvB Holliday junction DNA helicase RuvB; Reviewed Probab=92.39 E-value=0.25 Score=28.76 Aligned_cols=51 Identities=20% Similarity=0.195 Sum_probs=30.5 Q ss_pred EEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCC Q ss_conf 365524778861158999999988630246678760699995167767899999851146 Q gi|254780617|r 164 RELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDR 223 (509) Q Consensus 164 R~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~ 223 (509) .+++.|++|.|||||+. .|.|.-+.+... .=+-.=||..++..+...+++. T Consensus 53 H~Ll~GPPGlGKTTLA~-iiA~E~~~~~~~--------tsGP~lek~~DL~~iLt~l~~~ 103 (328) T PRK00080 53 HVLLYGPPGLGKTTLAN-IIANEMGVNIRI--------TSGPALEKAGDLAALLTNLEEG 103 (328) T ss_pred CEEEECCCCCCHHHHHH-HHHHHHCCCCEE--------CCCCCCCCHHHHHHHHHHCCCC T ss_conf 05765889988999999-999986888156--------2450016747899999608878 No 175 >PRK13635 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=92.38 E-value=0.082 Score=32.23 Aligned_cols=27 Identities=22% Similarity=0.416 Sum_probs=23.7 Q ss_pred HCCCCCCCCCEEEEECCCCCCCHHHHH Q ss_conf 101111568336552477886115899 Q gi|254780617|r 154 DSLIPIGRGQRELIIGDRKTGKTSIIL 180 (509) Q Consensus 154 D~l~pigrGQR~~I~g~~g~GKt~l~~ 180 (509) |.=+.|.+|+..+|+|.+|+|||||+. T Consensus 25 ~isl~i~~GE~vaivG~nGsGKSTL~k 51 (279) T PRK13635 25 DVSFSVYEGEWVAIVGHNGSGKSTLAK 51 (279) T ss_pred EEEEEECCCCEEEEECCCCCHHHHHHH T ss_conf 307688799899999999965999999 No 176 >cd03247 ABCC_cytochrome_bd The CYD subfamily implicated in cytochrome bd biogenesis. The CydC and CydD proteins are important for the formation of cytochrome bd terminal oxidase of E. coli and it has been proposed that they were necessary for biosynthesis of the cytochrome bd quinol oxidase and for periplasmic c-type cytochromes. CydCD were proposed to determine a heterooligomeric complex important for heme export into the periplasm or to be involved in the maintenance of the proper redox state of the periplasmic space. In Bacillus subtilius, the absence of CydCD does not affect the presence of halo-cytochrome c in the membrane and this observation suggests that CydCD proteins are not involved in the export of heme in this organism. Probab=92.37 E-value=0.083 Score=32.18 Aligned_cols=30 Identities=13% Similarity=0.441 Sum_probs=24.6 Q ss_pred CCCCCCCCCEEEEECCCCCCCHHHHHHHHHH Q ss_conf 0111156833655247788611589999999 Q gi|254780617|r 155 SLIPIGRGQRELIIGDRKTGKTSIILDTFLN 185 (509) Q Consensus 155 ~l~pigrGQR~~I~g~~g~GKt~l~~~~I~n 185 (509) .=+.+-+|++.+|.|.+|+|||||+ .+|.. T Consensus 21 i~l~i~~Ge~~aivG~sGsGKSTLl-~~l~G 50 (178) T cd03247 21 LSLELKQGEKIALLGRSGSGKSTLL-QLLTG 50 (178) T ss_pred EEEEECCCCEEEEECCCCCHHHHHH-HHHHH T ss_conf 5899869999999999987599999-99986 No 177 >cd03245 ABCC_bacteriocin_exporters ABC-type bacteriocin exporters. Many non-lantibiotic bacteriocins of lactic acid bacteria are produced as precursors which have N-terminal leader peptides that share similarities in amino acid sequence and contain a conserved processing site of two glycine residues in positions -1 and -2. A dedicated ATP-binding cassette (ABC) transporter is responsible for the proteolytic cleavage of the leader peptides and subsequent translocation of the bacteriocins across the cytoplasmic membrane. Probab=92.32 E-value=0.086 Score=32.07 Aligned_cols=29 Identities=24% Similarity=0.475 Sum_probs=23.9 Q ss_pred CCCCCCCCCEEEEECCCCCCCHHHHHHHHH Q ss_conf 011115683365524778861158999999 Q gi|254780617|r 155 SLIPIGRGQRELIIGDRKTGKTSIILDTFL 184 (509) Q Consensus 155 ~l~pigrGQR~~I~g~~g~GKt~l~~~~I~ 184 (509) .=+.+-+|++++|.|.+|+|||||+ .+|. T Consensus 23 isl~i~~G~~v~ivG~sGsGKSTLl-~ll~ 51 (220) T cd03245 23 VSLTIRAGEKVAIIGRVGSGKSTLL-KLLA 51 (220) T ss_pred EEEEECCCCEEEEECCCCCHHHHHH-HHHH T ss_conf 5999879999999999998599999-9996 No 178 >PRK07263 consensus Probab=92.31 E-value=0.15 Score=30.31 Aligned_cols=42 Identities=24% Similarity=0.393 Sum_probs=34.8 Q ss_pred HHCCHHHHHCCCC-CCCCCEEEEECCCCCCCHHHHHHHHHHHH Q ss_conf 4416023310111-15683365524778861158999999988 Q gi|254780617|r 146 LSTGIKAIDSLIP-IGRGQRELIIGDRKTGKTSIILDTFLNQK 187 (509) Q Consensus 146 l~TGI~~ID~l~p-igrGQR~~I~g~~g~GKt~l~~~~I~nq~ 187 (509) +.||++.+|.++- +-+|+=+-|=|-++.|||++++.++.|-+ T Consensus 186 i~TGf~~LD~~t~Gl~~GdLiviaaRPsmGKTa~alnia~~iA 228 (453) T PRK07263 186 LPTGFRDLDKITTGLHPDQLIILAARPAVGKTAFVLNIAQNVG 228 (453) T ss_pred CCCCHHHHHHHHCCCCCCCEEEEEECCCCCHHHHHHHHHHHHH T ss_conf 7588587997732899786899972788847899999999999 No 179 >TIGR03608 L_ocin_972_ABC putative bacteriocin export ABC transporter, lactococcin 972 group. A gene pair with a fairly wide distribution consists of a polypeptide related to the lactococcin 972 (see TIGR01653) and multiple-membrane-spanning putative immunity protein (see TIGR01654). This model represents a small clade within the ABC transporters that regularly are found adjacent to these bacteriocin system gene pairs and are likely serve as export proteins. Probab=92.30 E-value=0.089 Score=31.97 Aligned_cols=28 Identities=21% Similarity=0.532 Sum_probs=23.4 Q ss_pred CCCCCCCCCEEEEECCCCCCCHHHHHHHH Q ss_conf 01111568336552477886115899999 Q gi|254780617|r 155 SLIPIGRGQRELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 155 ~l~pigrGQR~~I~g~~g~GKt~l~~~~I 183 (509) .=+.|-+|+-..|+|++|+|||||+ .+| T Consensus 17 vsl~i~~Ge~~~i~GpSGsGKSTLL-~~i 44 (206) T TIGR03608 17 LNLTIEKGKMVAIVGESGSGKSTLL-NII 44 (206) T ss_pred EEEEECCCCEEEEECCCCCCHHHHH-HHH T ss_conf 0779869989999879997099999-999 No 180 >PRK11176 lipid transporter ATP-binding/permease protein; Provisional Probab=92.29 E-value=0.077 Score=32.41 Aligned_cols=26 Identities=23% Similarity=0.492 Sum_probs=16.7 Q ss_pred CCCCCCCEEEEECCCCCCCHHHHHHHH Q ss_conf 111568336552477886115899999 Q gi|254780617|r 157 IPIGRGQRELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 157 ~pigrGQR~~I~g~~g~GKt~l~~~~I 183 (509) +.|-+|++.+|.|.+|+|||||+ ..+ T Consensus 363 l~I~~G~~vaiVG~SGsGKSTL~-~LL 388 (581) T PRK11176 363 FKIPAGKTVALVGRSGSGKSTIA-NLL 388 (581) T ss_pred CCCCCCCEEECCCCCCCCHHHHH-HHH T ss_conf 35799944312289998678999-999 No 181 >cd00009 AAA The AAA+ (ATPases Associated with a wide variety of cellular Activities) superfamily represents an ancient group of ATPases belonging to the ASCE (for additional strand, catalytic E) division of the P-loop NTPase fold. The ASCE division also includes ABC, RecA-like, VirD4-like, PilT-like, and SF1/2 helicases. Members of the AAA+ ATPases function as molecular chaperons, ATPase subunits of proteases, helicases, or nucleic-acid stimulated ATPases. The AAA+ proteins contain several distinct features in addition to the conserved alpha-beta-alpha core domain structure and the Walker A and B motifs of the P-loop NTPases. Probab=92.24 E-value=0.39 Score=27.34 Aligned_cols=25 Identities=28% Similarity=0.381 Sum_probs=19.6 Q ss_pred CCCEEEEECCCCCCCHHHHHHHHHHH Q ss_conf 68336552477886115899999998 Q gi|254780617|r 161 RGQRELIIGDRKTGKTSIILDTFLNQ 186 (509) Q Consensus 161 rGQR~~I~g~~g~GKt~l~~~~I~nq 186 (509) .+.-+++.|++|||||+++. .|.++ T Consensus 18 ~~~~ill~GppGtGKT~la~-~ia~~ 42 (151) T cd00009 18 PPKNLLLYGPPGTGKTTLAR-AIANE 42 (151) T ss_pred CCCEEEEECCCCCCHHHHHH-HHHHH T ss_conf 99808998999988659999-99997 No 182 >PRK10247 putative ABC transporter ATP-binding protein YbbL; Provisional Probab=92.21 E-value=0.097 Score=31.71 Aligned_cols=30 Identities=20% Similarity=0.390 Sum_probs=24.3 Q ss_pred HCCCCCCCCCEEEEECCCCCCCHHHHHHHHH Q ss_conf 1011115683365524778861158999999 Q gi|254780617|r 154 DSLIPIGRGQRELIIGDRKTGKTSIILDTFL 184 (509) Q Consensus 154 D~l~pigrGQR~~I~g~~g~GKt~l~~~~I~ 184 (509) |.=+.+-+|+..+|+|++|+|||||+ .+|. T Consensus 25 ~is~~i~~Ge~~~i~G~sGsGKSTLl-k~i~ 54 (225) T PRK10247 25 NINFSLRAGEFKLITGPSGCGKSTLL-KIVA 54 (225) T ss_pred CEEEEECCCCEEEEECCCCCCHHHHH-HHHH T ss_conf 51799859969999999999999999-9996 No 183 >cd03288 ABCC_SUR2 The SUR domain 2. The sulfonylurea receptor SUR is an ATP binding cassette (ABC) protein of the ABCC/MRP family. Unlike other ABC proteins, it has no intrinsic transport function, neither active nor passive, but associates with the potassium channel proteins Kir6.1 or Kir6.2 to form the ATP-sensitive potassium (K(ATP)) channel. Within the channel complex, SUR serves as a regulatory subunit that fine-tunes the gating of Kir6.x in response to alterations in cellular metabolism. It constitutes a major pharmaceutical target as it binds numerous drugs, K(ATP) channel openers and blockers, capable of up- or down-regulating channel activity. Probab=92.19 E-value=0.095 Score=31.75 Aligned_cols=29 Identities=34% Similarity=0.440 Sum_probs=23.4 Q ss_pred CCCCCCCCCEEEEECCCCCCCHHHHHHHHH Q ss_conf 011115683365524778861158999999 Q gi|254780617|r 155 SLIPIGRGQRELIIGDRKTGKTSIILDTFL 184 (509) Q Consensus 155 ~l~pigrGQR~~I~g~~g~GKt~l~~~~I~ 184 (509) .=+.|-+|++++|+|.+|+|||||+. .+. T Consensus 40 inl~I~~Ge~vaIvG~sGsGKSTL~~-ll~ 68 (257) T cd03288 40 VKAYIKPGQKVGICGRTGSGKSSLSL-AFF 68 (257) T ss_pred EEEEECCCCEEEEECCCCCCHHHHHH-HHH T ss_conf 38998799999999999981999999-996 No 184 >cd03254 ABCC_Glucan_exporter_like Glucan exporter ATP-binding protein. In A. tumefaciens cyclic beta-1, 2-glucan must be transported into the periplasmic space to exert its action as a virluence factor. This subfamily belongs to the MRP-like family and is involved in drug, peptide, and lipid export. The MRP-like family, similar to all ABC proteins, have a common four-domain core structure constituted by two membrane-spanning domains each composed of six transmembrane (TM) helices and two nucleotide-binding domains (NBD). ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=92.17 E-value=0.095 Score=31.77 Aligned_cols=30 Identities=27% Similarity=0.473 Sum_probs=24.0 Q ss_pred CCCCCCCCCEEEEECCCCCCCHHHHHHHHHH Q ss_conf 0111156833655247788611589999999 Q gi|254780617|r 155 SLIPIGRGQRELIIGDRKTGKTSIILDTFLN 185 (509) Q Consensus 155 ~l~pigrGQR~~I~g~~g~GKt~l~~~~I~n 185 (509) .=+.|-+|++..|+|.+|+|||||+ ..+.. T Consensus 22 inl~i~~Ge~vaivG~sGsGKSTLl-~ll~g 51 (229) T cd03254 22 INFSIKPGETVAIVGPTGAGKTTLI-NLLMR 51 (229) T ss_pred EEEEECCCCEEEEECCCCCHHHHHH-HHHHC T ss_conf 2999879999999999998099999-99966 No 185 >cd03294 ABC_Pro_Gly_Bertaine This family comprises the glycine betaine/L-proline ATP binding subunit in bacteria and its equivalents in archaea. This transport system belong to the larger ATP-Binding Cassette (ABC) transporter superfamily. The characteristic feature of these transporters is the obligatory coupling of ATP hydrolysis to substrate translocation. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=92.14 E-value=0.089 Score=31.97 Aligned_cols=29 Identities=21% Similarity=0.437 Sum_probs=24.2 Q ss_pred HCCCCCCCCCEEEEECCCCCCCHHHHHHHH Q ss_conf 101111568336552477886115899999 Q gi|254780617|r 154 DSLIPIGRGQRELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 154 D~l~pigrGQR~~I~g~~g~GKt~l~~~~I 183 (509) |.=+.+.+|....|+|++|+|||||+ -+| T Consensus 42 ~vsl~i~~GE~~~ivG~SGsGKSTLL-r~i 70 (269) T cd03294 42 DVSLDVREGEIFVIMGLSGSGKSTLL-RCI 70 (269) T ss_pred CCEEEECCCCEEEEECCCCCHHHHHH-HHH T ss_conf 74758889999999989984899999-999 No 186 >PRK13632 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=92.11 E-value=0.092 Score=31.85 Aligned_cols=29 Identities=28% Similarity=0.453 Sum_probs=23.5 Q ss_pred HHH-CCCCCCCCCEEEEECCCCCCCHHHHH Q ss_conf 331-01111568336552477886115899 Q gi|254780617|r 152 AID-SLIPIGRGQRELIIGDRKTGKTSIIL 180 (509) Q Consensus 152 ~ID-~l~pigrGQR~~I~g~~g~GKt~l~~ 180 (509) ++| .=+.|-+|+..+|+|.+|+|||||+. T Consensus 25 ~L~~isl~i~~Ge~vaivG~nGsGKSTLlk 54 (273) T PRK13632 25 ALKNVSFTINEGEYVAILGHNGSGKSTISK 54 (273) T ss_pred EEEEEEEEECCCCEEEEECCCCCHHHHHHH T ss_conf 066428898499899999999986999999 No 187 >PRK10789 putative multidrug transporter membrane\ATP-binding components; Provisional Probab=92.11 E-value=0.11 Score=31.43 Aligned_cols=27 Identities=22% Similarity=0.351 Sum_probs=16.5 Q ss_pred CCCCCCCEEEEECCCCCCCHHHHHHHHH Q ss_conf 1115683365524778861158999999 Q gi|254780617|r 157 IPIGRGQRELIIGDRKTGKTSIILDTFL 184 (509) Q Consensus 157 ~pigrGQR~~I~g~~g~GKt~l~~~~I~ 184 (509) +.|-+|++.+|.|.+|+|||||+ ..+. T Consensus 336 l~I~~Ge~vaIVG~SGsGKSTLl-~LL~ 362 (569) T PRK10789 336 FTLKPGQMLGICGPTGSGKSTLL-SLIQ 362 (569) T ss_pred CEECCCCEEEEECCCCCCHHHHH-HHHH T ss_conf 68889978998799999879999-9999 No 188 >TIGR01277 thiQ thiamine ABC transporter, ATP-binding protein; InterPro: IPR005968 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). Thiamine pyrophosphate (TPP) is a required cofactor synthesized de novo in Salmonella typhimurium. The primary role for TPP is in central metabolism as an electron carrier and nucleophile for such enzymes as pyruvate dehydrogenase (1.2.4.1 from EC), acetolactate synthase (4.1.3.18 from EC), and alpha-ketoglutarate dehydrogenase (1.2.4.2 from EC). Despite its importance in cellular physiology, neither the de novo biosynthetic pathway nor the salvage systems for thiamine are fully understood in any organism. The model describes thiamine ABC transporter, ATP-binding protein, believed to be involved in the specific translocation of thiamine and its phosphoesters across the inner membrane The protein belongs to the larger ABC transport system which consists of at least three components: the inner membrane permease; thiamine binding protein and an ATP-binding subunit. This protein is found so far only in Proteobacteria, and is found in complete genomes only if the ThiB and ThiP subunits are also found. It has been experimentally demonstrated that mutants in the various steps in the de novo synthesis of thiamine and its biologically active form, namely thiamine pyrophosphate can be exogenously supplemented with thiamine, thiamine monophosphate or thiamine pyrophosphate. ; GO: 0005524 ATP binding, 0042626 ATPase activity coupled to transmembrane movement of substances, 0006810 transport, 0009276 1-2nm peptidoglycan-based cell wall. Probab=92.11 E-value=0.091 Score=31.89 Aligned_cols=27 Identities=22% Similarity=0.537 Sum_probs=22.9 Q ss_pred CCCCCCCCEEEEECCCCCCCHHHHHHHH Q ss_conf 1111568336552477886115899999 Q gi|254780617|r 156 LIPIGRGQRELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 156 l~pigrGQR~~I~g~~g~GKt~l~~~~I 183 (509) =+++-+|.++.|+|+||.|||||+ ..| T Consensus 18 ~L~V~~Ge~VAi~GpSGAGKSTLL-nLi 44 (213) T TIGR01277 18 DLSVEDGERVAILGPSGAGKSTLL-NLI 44 (213) T ss_pred ECCCCCCCEEEEECCCCCCHHHHH-HHH T ss_conf 041301776888758986278898-778 No 189 >PRK05800 cobU adenosylcobinamide kinase/adenosylcobinamide-phosphate guanylyltransferase; Validated Probab=92.11 E-value=0.63 Score=25.80 Aligned_cols=89 Identities=19% Similarity=0.240 Sum_probs=51.2 Q ss_pred CCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCH-HHHHHHHHHHCCCCCCCEEEEEEECCCCCH Q ss_conf 8336552477886115899999998863024667876069999516776-789999985114674211589972887888 Q gi|254780617|r 162 GQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKR-SSVARFVKALEDRGALSYSIVVVASASDPA 240 (509) Q Consensus 162 GQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~-~ev~~~~~~l~~~~~~~~tvvv~a~a~~~~ 240 (509) |.-.+|.||+..|||..|-..+.. . ...+ +|++-++.. .|.++=++..++.-.-...++- .+- T Consensus 1 gmi~LVtGG~rSGKS~~AE~la~~-~--------~~~~--~YiAT~~~~D~Em~~RI~~Hr~~R~~~w~TiE-~p~---- 64 (170) T PRK05800 1 GMLILVTGGARSGKSRFAERLAAQ-S--------GLQV--LYIATAQPLDDEMAARIAHHRQRRPAHWQTVE-EPL---- 64 (170) T ss_pred CCEEEEECCCCCCHHHHHHHHHHH-C--------CCCE--EEEECCCCCCHHHHHHHHHHHHCCCCCCEEEE-CCC---- T ss_conf 989999798763489999999985-6--------9982--99975888887899999999973789957996-466---- Q ss_pred HHHHHHHHHHHHHHHHHH-CCCCCEEEEEECHHHHHHH Q ss_conf 999998876666667542-0468847996164789999 Q gi|254780617|r 241 PMQLLAPFAGCAMGEYFR-DNGYHALIAYDDLQKHAVA 277 (509) Q Consensus 241 ~~r~~ap~~a~aiAEyfr-~~G~~VLi~~Ddltr~A~a 277 (509) .+++.+. ....+-.+++|+||.|..- T Consensus 65 -----------~l~~~l~~~~~~~~~vLlDclt~wl~N 91 (170) T PRK05800 65 -----------DLAELLRADAAPGRCVLVDCLTTWVTN 91 (170) T ss_pred -----------CHHHHHHHHCCCCCEEEEHHHHHHHHH T ss_conf -----------789999874577886872267899999 No 190 >PRK08760 replicative DNA helicase; Provisional Probab=92.09 E-value=0.16 Score=30.05 Aligned_cols=43 Identities=19% Similarity=0.253 Sum_probs=34.0 Q ss_pred HHHCCHHHHHCCCC-CCCCCEEEEECCCCCCCHHHHHHHHHHHH Q ss_conf 44416023310111-15683365524778861158999999988 Q gi|254780617|r 145 PLSTGIKAIDSLIP-IGRGQRELIIGDRKTGKTSIILDTFLNQK 187 (509) Q Consensus 145 ~l~TGI~~ID~l~p-igrGQR~~I~g~~g~GKt~l~~~~I~nq~ 187 (509) -+.||++.+|.++. +-+|+=+-|=|-++.|||++++.+..|-+ T Consensus 211 Gi~TG~~~LD~~t~Gl~~G~LiViaaRPsmGKTalalnia~~~A 254 (476) T PRK08760 211 GLPTGYNDFDAMTAGLQPTDLIILAARPAMGKTTFALNIAEYAA 254 (476) T ss_pred ECCCCCHHHHHHHCCCCCCCEEEEEECCCCCHHHHHHHHHHHHH T ss_conf 36779688997446998777799987788747899999999999 No 191 >PRK10744 phosphate transporter subunit; Provisional Probab=92.08 E-value=0.1 Score=31.55 Aligned_cols=49 Identities=27% Similarity=0.369 Sum_probs=31.5 Q ss_pred CCCCCCCCCCHHHHHCCHHHHH-CCCCCCCCCEEEEECCCCCCCHHHHHHHH Q ss_conf 5773112660014441602331-01111568336552477886115899999 Q gi|254780617|r 133 APGIIQRQSVCEPLSTGIKAID-SLIPIGRGQRELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 133 ~p~~~~R~~i~~~l~TGI~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~~~I 183 (509) +|+.++-+-++--+ -+..+++ .=+.|.+|+..+|+|++|+|||||+ .+| T Consensus 7 ~~~~I~v~~ls~~y-g~~~aL~~vsl~i~~Ge~~~liG~nGaGKSTLl-k~i 56 (257) T PRK10744 7 APSKIQVRDLNFYY-GKFHALKNINLDIAKNQVTAFIGPSGCGKSTLL-RTF 56 (257) T ss_pred CCCCEEEEEEEEEE-CCEEEECCCEEEECCCCEEEEECCCCCCHHHHH-HHH T ss_conf 99728998689996-997678142899889989999999998199999-999 No 192 >PRK11147 ABC transporter ATPase component; Reviewed Probab=92.08 E-value=0.099 Score=31.65 Aligned_cols=23 Identities=22% Similarity=0.553 Sum_probs=13.3 Q ss_pred CCCCCCCEEEEECCCCCCCHHHH Q ss_conf 11156833655247788611589 Q gi|254780617|r 157 IPIGRGQRELIIGDRKTGKTSII 179 (509) Q Consensus 157 ~pigrGQR~~I~g~~g~GKt~l~ 179 (509) +.|.+|+|++|+|..|+|||||+ T Consensus 24 l~I~~Ge~vgLVG~NGsGKSTLl 46 (632) T PRK11147 24 LHIEDNERVCLVGRNGAGKSTLM 46 (632) T ss_pred EEECCCCEEEEECCCCCHHHHHH T ss_conf 99989989999999998799999 No 193 >CHL00181 cbbX CbbX; Provisional Probab=92.04 E-value=0.58 Score=26.06 Aligned_cols=62 Identities=18% Similarity=0.183 Sum_probs=32.2 Q ss_pred CCCCCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEE------EECCCCHHHHHHHHH Q ss_conf 0111156833655247788611589999999886302466787606999------951677678999998 Q gi|254780617|r 155 SLIPIGRGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIY------VAIGQKRSSVARFVK 218 (509) Q Consensus 155 ~l~pigrGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~------~~IGer~~ev~~~~~ 218 (509) ++-|...+--+...|+|||||||+| -++.+.-...+.-.+..++.+- .-||+...-.++.++ T Consensus 52 Gl~~~~~s~h~vF~GnPGTGKTTVA--Rl~a~il~~lG~L~~g~vve~~r~dLvg~yvG~Ta~kt~~~i~ 119 (287) T CHL00181 52 GLVSSSPGLHMSFTGSPGTGKTTVA--LKMADILYRLGYIKKGHLITVTRDDLVGQYIGHTAPKTKEVLK 119 (287) T ss_pred CCCCCCCCCEEEEECCCCCCHHHHH--HHHHHHHHHCCCCCCCEEEEECHHHHCCCCCCCCHHHHHHHHH T ss_conf 9998887653888789986799999--9999999986995589589953588416353521699999999 No 194 >PRK10584 putative ABC transporter ATP-binding protein YbbA; Provisional Probab=92.02 E-value=0.093 Score=31.82 Aligned_cols=48 Identities=17% Similarity=0.413 Sum_probs=31.8 Q ss_pred CCCCCCCHHHHHCCH---HHHH-CCCCCCCCCEEEEECCCCCCCHHHHHHHHH Q ss_conf 311266001444160---2331-011115683365524778861158999999 Q gi|254780617|r 136 IIQRQSVCEPLSTGI---KAID-SLIPIGRGQRELIIGDRKTGKTSIILDTFL 184 (509) Q Consensus 136 ~~~R~~i~~~l~TGI---~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~~~I~ 184 (509) +++-+-++.-+.+|- .+++ .=+.+-+|+..+|+|.+|+|||||+ .+|. T Consensus 6 ilev~~lsk~y~~~~~~~~vL~~isl~i~~GE~v~ivG~sGsGKSTLl-~~i~ 57 (228) T PRK10584 6 IVEVHHLKKSVGQGEHELSILTGVELVVKRGETIALIGESGSGKSTLL-AILA 57 (228) T ss_pred EEEEECEEEEECCCCEEEEEEECEEEEECCCCEEEEECCCCCHHHHHH-HHHH T ss_conf 899968499989998279998473889999989999999985899999-9996 No 195 >TIGR03015 pepcterm_ATPase putative secretion ATPase, PEP-CTERM locus subfamily. Members of this protein are marked as probable ATPases by the nucleotide binding P-loop motif GXXGXGKTT, a motif DEAQ similar to the DEAD/H box of helicases, and extensive homology to ATPases of MSHA-type pilus systems and to GspA proteins associated with type II protein secretion systems. Probab=92.01 E-value=0.98 Score=24.42 Aligned_cols=124 Identities=15% Similarity=0.231 Sum_probs=56.1 Q ss_pred EEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHH Q ss_conf 36552477886115899999998863024667876069999516776789999985114674211589972887888999 Q gi|254780617|r 164 RELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALSYSIVVVASASDPAPMQ 243 (509) Q Consensus 164 R~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~~tvvv~a~a~~~~~~r 243 (509) =..|.|+.|+|||+++-..+ .+. ..+.++++|.. . ..-+..+++..+...=.++ ....+.....+ T Consensus 45 ~~lltGe~GtGKTtllr~l~-~~l-------~~~~~~~~~i~-~-~~l~~~~ll~~i~~~lg~~-----~~~~~~~~~~~ 109 (269) T TIGR03015 45 FILITGEVGAGKTTLIRNLL-KRL-------DQERVVAAKLV-N-TRVDAEDLLRMVAADFGLE-----TEGRDKAALLR 109 (269) T ss_pred EEEEECCCCCCHHHHHHHHH-HHC-------CCCCEEEEEEC-C-CCCCHHHHHHHHHHHCCCC-----CCCCCHHHHHH T ss_conf 59997299898899999999-845-------93454899976-9-9999999999999985989-----88989999999 Q ss_pred HHHHHHHHHHHHHH---HCCCCCEEEEEECHHHHH-HHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHH Q ss_conf 99887666666754---204688479961647899-99987545303787543337520012467888875 Q gi|254780617|r 244 LLAPFAGCAMGEYF---RDNGYHALIAYDDLQKHA-VAYRQLSLLLRRPPGREAYPGDVFYLHSRLLERAA 310 (509) Q Consensus 244 ~~ap~~a~aiAEyf---r~~G~~VLi~~Ddltr~A-~a~reisl~~~~pp~~~gyp~~vf~~~s~l~ERa~ 310 (509) .+-+|+ ..+|+.+++++|+--... ++..++-++++--.......--++.-.+.|-++-. T Consensus 110 --------~l~~~L~~~~~~g~~~vliIDEAq~L~~~~Le~Lr~L~n~e~~~~~ll~iiL~GqpeL~~~L~ 172 (269) T TIGR03015 110 --------ELEDFLIEQFAAGKRALLVVDEAQNLTPELLEELRMLSNFQTDNAKLLQIFLVGQPEFRETLQ 172 (269) T ss_pred --------HHHHHHHHHHHCCCCEEEEEECHHHCCHHHHHHHHHHHCCCCCCCCCEEEEEECCHHHHHHHC T ss_conf --------999999999966994699972422199999999999970135888704899957867999872 No 196 >PRK13640 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=92.00 E-value=0.098 Score=31.65 Aligned_cols=29 Identities=31% Similarity=0.488 Sum_probs=23.8 Q ss_pred HHH-CCCCCCCCCEEEEECCCCCCCHHHHH Q ss_conf 331-01111568336552477886115899 Q gi|254780617|r 152 AID-SLIPIGRGQRELIIGDRKTGKTSIIL 180 (509) Q Consensus 152 ~ID-~l~pigrGQR~~I~g~~g~GKt~l~~ 180 (509) +++ .=+.|-+|+..+|+|++|+|||||+. T Consensus 23 aL~~vsl~I~~Ge~vaiiG~nGsGKSTL~~ 52 (283) T PRK13640 23 ALKDVSFSIPRGSWTALIGHNGSGKSTISK 52 (283) T ss_pred EEECCEEEECCCCEEEEECCCCCHHHHHHH T ss_conf 787718699899999999999987999999 No 197 >cd03262 ABC_HisP_GlnQ_permeases HisP and GlnQ are the ATP-binding components of the bacterial periplasmic histidine and glutamine permeases, repectively. Histidine permease is a multisubunit complex containing the HisQ and HisM integral membrane subunits and two copies of HisP. HisP has properties intermediate between those of integral and peripheral membrane proteins and is accessible from both sides of the membrane, presumably by its interaction with HisQ and HisM. The two HisP subunits form a homodimer within the complex. The domain structure of the amino acid uptake systems is typical for prokaryote extracellular solute binding protein-dependent uptake systems. All of the amino acid uptake systems also have at least one, and in a few cases, two extracellular solute binding proteins located in the periplasm of Gram-negative bacteria, or attached to the cell membrane of Gram-positive bacteria. The best-studied member of the PAAT (polar amino acid transport) family is the HisJQM Probab=91.97 E-value=0.1 Score=31.62 Aligned_cols=28 Identities=21% Similarity=0.485 Sum_probs=23.4 Q ss_pred CCCCCCCCCEEEEECCCCCCCHHHHHHHH Q ss_conf 01111568336552477886115899999 Q gi|254780617|r 155 SLIPIGRGQRELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 155 ~l~pigrGQR~~I~g~~g~GKt~l~~~~I 183 (509) .=+.+-+|....|+|+||+|||||+ -+| T Consensus 19 vsl~i~~Ge~~~ivGpSGsGKSTLL-~~i 46 (213) T cd03262 19 IDLTVKKGEVVVIIGPSGSGKSTLL-RCI 46 (213) T ss_pred EEEEECCCCEEEEECCCCCHHHHHH-HHH T ss_conf 0759889989999999984499999-999 No 198 >PRK07004 replicative DNA helicase; Provisional Probab=91.95 E-value=0.18 Score=29.78 Aligned_cols=42 Identities=19% Similarity=0.290 Sum_probs=33.7 Q ss_pred HHCCHHHHHCCCC-CCCCCEEEEECCCCCCCHHHHHHHHHHHH Q ss_conf 4416023310111-15683365524778861158999999988 Q gi|254780617|r 146 LSTGIKAIDSLIP-IGRGQRELIIGDRKTGKTSIILDTFLNQK 187 (509) Q Consensus 146 l~TGI~~ID~l~p-igrGQR~~I~g~~g~GKt~l~~~~I~nq~ 187 (509) +.||++.+|.++- +-+|+=+-|=|-+++|||++++.+..|-+ T Consensus 196 i~TGf~~LD~~t~Gl~~gdLiIIAARPsmGKTafAlniA~n~A 238 (460) T PRK07004 196 TPTGFVDLDRMTSGMHGGELIIVAGRPSMGKTAFSMNIGEYVA 238 (460) T ss_pred CCCCHHHHHHHHCCCCCCCEEEEEECCCCCHHHHHHHHHHHHH T ss_conf 8679388986523898775799973687642699999999998 No 199 >COG0444 DppD ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component [Amino acid transport and metabolism / Inorganic ion transport and metabolism] Probab=91.95 E-value=0.11 Score=31.21 Aligned_cols=33 Identities=24% Similarity=0.449 Sum_probs=27.9 Q ss_pred HHHHHC-CCCCCCCCEEEEECCCCCCCHHHHHHH Q ss_conf 023310-111156833655247788611589999 Q gi|254780617|r 150 IKAIDS-LIPIGRGQRELIIGDRKTGKTSIILDT 182 (509) Q Consensus 150 I~~ID~-l~pigrGQR~~I~g~~g~GKt~l~~~~ 182 (509) ++++|. =+.+.+|+..+|+|.+|+|||+++.-. T Consensus 18 v~av~~vs~~i~~GE~lgiVGESGsGKS~~~~ai 51 (316) T COG0444 18 VKAVDGVSFELKKGEILGIVGESGSGKSVLAKAI 51 (316) T ss_pred EEEEECEEEEECCCCEEEEECCCCCCHHHHHHHH T ss_conf 7777140588758968999838978899999999 No 200 >PRK06321 replicative DNA helicase; Provisional Probab=91.94 E-value=0.35 Score=27.65 Aligned_cols=44 Identities=18% Similarity=0.205 Sum_probs=35.0 Q ss_pred HHHCCHHHHHCCCC-CCCCCEEEEECCCCCCCHHHHHHHHHHHHH Q ss_conf 44416023310111-156833655247788611589999999886 Q gi|254780617|r 145 PLSTGIKAIDSLIP-IGRGQRELIIGDRKTGKTSIILDTFLNQKS 188 (509) Q Consensus 145 ~l~TGI~~ID~l~p-igrGQR~~I~g~~g~GKt~l~~~~I~nq~~ 188 (509) -+.||++-+|.++. +-+|+=+.|=|-++.|||++++.+..|-+. T Consensus 208 GipTGf~~LD~lt~Gl~~GdliviaaRPsmGKTalalnia~~~a~ 252 (472) T PRK06321 208 GIPTHFIDLDKMINGFSPSNLMILAARPAMGKTALALNIAENFCF 252 (472) T ss_pred CCCCCHHHHHHHHCCCCCCCEEEEECCCCCCHHHHHHHHHHHHHH T ss_conf 225684889998559886757998538999779999999999998 No 201 >PRK13650 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=91.89 E-value=0.1 Score=31.51 Aligned_cols=29 Identities=24% Similarity=0.407 Sum_probs=24.1 Q ss_pred HCCCCCCCCCEEEEECCCCCCCHHHHHHHH Q ss_conf 101111568336552477886115899999 Q gi|254780617|r 154 DSLIPIGRGQRELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 154 D~l~pigrGQR~~I~g~~g~GKt~l~~~~I 183 (509) |.=+.|-+|+..+|+|.+|+||||++. +| T Consensus 22 ~vsl~i~~GE~vaivG~nGsGKSTL~~-~l 50 (276) T PRK13650 22 DVSFHVKQGEWLSIIGHNGSGKSTTVR-LI 50 (276) T ss_pred CCEEEECCCCEEEEECCCCCHHHHHHH-HH T ss_conf 758799899899999999987999999-99 No 202 >PRK05636 replicative DNA helicase; Provisional Probab=91.89 E-value=0.32 Score=27.92 Aligned_cols=57 Identities=12% Similarity=0.179 Sum_probs=24.3 Q ss_pred HHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCCC Q ss_conf 9999987545303787543337520012467888875045445887356753100367664 Q gi|254780617|r 274 HAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLLERAAKMSDALGAGSLTALPVIETQVNDV 334 (509) Q Consensus 274 ~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l~ERa~~~~~~~g~GSiT~~~~v~~~~~D~ 334 (509) ...|..++.-+..+.. .+++.. +--.+...+++.-+..+ .+|.+|.+||=...=|++ T Consensus 204 ld~ae~~l~~i~~~~~-~~~~~~-~~d~l~~~~~~ie~~~~--~~g~~~Gi~TGf~~LD~~ 260 (507) T PRK05636 204 IDRAQQEVFAVSQKNQ-SEDYAV-LADILDETMAEIEMLES--DGGIATGIPTGFKDLDDL 260 (507) T ss_pred HHHHHHHHHHHHCCCC-CCCCEE-HHHHHHHHHHHHHHHHH--CCCCCEEEECCCHHHHHH T ss_conf 9999999999860377-888556-99999999999999985--799952565880889975 No 203 >KOG0057 consensus Probab=91.85 E-value=0.11 Score=31.27 Aligned_cols=27 Identities=33% Similarity=0.669 Sum_probs=14.3 Q ss_pred CCCCCCCEEEEECCCCCCCHHHHHHHHH Q ss_conf 1115683365524778861158999999 Q gi|254780617|r 157 IPIGRGQRELIIGDRKTGKTSIILDTFL 184 (509) Q Consensus 157 ~pigrGQR~~I~g~~g~GKt~l~~~~I~ 184 (509) ++|.+|-|.+|.|++|+||||+ +-+++ T Consensus 373 f~I~kGekVaIvG~nGsGKSTi-lr~Ll 399 (591) T KOG0057 373 FTIPKGEKVAIVGSNGSGKSTI-LRLLL 399 (591) T ss_pred EEECCCCEEEEECCCCCCHHHH-HHHHH T ss_conf 8864897898978999878899-99999 No 204 >PRK13637 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=91.83 E-value=0.097 Score=31.69 Aligned_cols=31 Identities=35% Similarity=0.510 Sum_probs=25.3 Q ss_pred HHHHH-CCCCCCCCCEEEEECCCCCCCHHHHH Q ss_conf 02331-01111568336552477886115899 Q gi|254780617|r 150 IKAID-SLIPIGRGQRELIIGDRKTGKTSIIL 180 (509) Q Consensus 150 I~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~ 180 (509) .+|+| .=+.|.+|+..+|+|.+|+|||||+- T Consensus 20 ~~aL~~isl~i~~GE~v~iiG~nGsGKSTLl~ 51 (287) T PRK13637 20 KKALDNVNIEIEDGEFVALIGHTGSGKSTLIQ 51 (287) T ss_pred CEEEECEEEEECCCCEEEEECCCCCHHHHHHH T ss_conf 81753207698799899999999939999999 No 205 >cd03295 ABC_OpuCA_Osmoprotection OpuCA is a the ATP binding component of a bacterial solute transporter that serves a protective role to cells growing in a hyperosmolar environment. ABC (ATP-binding cassette) transporter nucleotide-binding domain; ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition, to the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=91.80 E-value=0.1 Score=31.47 Aligned_cols=33 Identities=27% Similarity=0.474 Sum_probs=26.0 Q ss_pred HHHHH-CCCCCCCCCEEEEECCCCCCCHHHHHHHH Q ss_conf 02331-01111568336552477886115899999 Q gi|254780617|r 150 IKAID-SLIPIGRGQRELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 150 I~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~~~I 183 (509) -+++| .=+.+-+|+...|+|++|+||||++ -+| T Consensus 14 ~~al~~vsl~i~~Ge~~~ilGpSG~GKSTll-r~i 47 (242) T cd03295 14 KKAVNNLNLEIAKGEFLVLIGPSGSGKTTTM-KMI 47 (242) T ss_pred CEEEEEEEEEECCCCEEEEECCCCCHHHHHH-HHH T ss_conf 8898302768869989999999995699999-999 No 206 >cd03229 ABC_Class3 This class is comprised of all BPD (Binding Protein Dependent) systems that are largely represented in archaea and eubacteria and are primarily involved in scavenging solutes from the environment. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=91.79 E-value=0.11 Score=31.25 Aligned_cols=32 Identities=16% Similarity=0.415 Sum_probs=25.0 Q ss_pred HHHH-CCCCCCCCCEEEEECCCCCCCHHHHHHHH Q ss_conf 2331-01111568336552477886115899999 Q gi|254780617|r 151 KAID-SLIPIGRGQRELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 151 ~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~~~I 183 (509) +++| .=+.+-+|....|+|++|+||||++ -+| T Consensus 14 ~~l~~vs~~i~~Ge~~~ivGpSG~GKSTll-r~i 46 (178) T cd03229 14 TVLNDVSLNIEAGEIVALLGPSGSGKSTLL-RCI 46 (178) T ss_pred EEECCCEEEECCCCEEEEECCCCCHHHHHH-HHH T ss_conf 998370769889989999999998399999-999 No 207 >cd01122 GP4d_helicase GP4d_helicase is a homohexameric 5'-3' helicases. Helicases couple NTP hydrolysis to the unwinding of nucleic acid duplexes into their component strands. Probab=91.79 E-value=0.72 Score=25.37 Aligned_cols=65 Identities=17% Similarity=0.246 Sum_probs=42.5 Q ss_pred HHHHCCHHHHHCCCC-CCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHH Q ss_conf 144416023310111-156833655247788611589999999886302466787606999951677678999 Q gi|254780617|r 144 EPLSTGIKAIDSLIP-IGRGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVAR 215 (509) Q Consensus 144 ~~l~TGI~~ID~l~p-igrGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~ 215 (509) +-+.++...+|-++- +.+|+=..|-|.+|+|||+++..++.|-..++. .-+.|...=....++.. T Consensus 11 ~~~~~pf~~LD~~~~Gl~~GeL~viaarpg~GKT~f~~~~a~~~~~~~g-------~~vl~~SlEm~~~~~~~ 76 (271) T cd01122 11 EEVWWPFPVLNKLTKGLRKGELIILTAGTGVGKTTFLREYALDLITQHG-------VRVGTISLEEPVVRTAR 76 (271) T ss_pred CCCCCCCHHHHHHHCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHHHCC-------CEEEEEECCCCHHHHHH T ss_conf 5665881668887379999808999968998699999999999999769-------90899970499999999 No 208 >cd04125 RabA_like RabA-like subfamily. RabA was first identified in D. discoideum, where its expression levels were compared to other Rabs in growing and developing cells. The RabA mRNA levels were below the level of detection by Northern blot analysis, suggesting a very low level of expression. The function of RabA remains unknown. GTPase activating proteins (GAPs) interact with GTP-bound Rab and accelerate the hydrolysis of GTP to GDP. Guanine nucleotide exchange factors (GEFs) interact with GDP-bound Rabs to promote the formation of the GTP-bound state. Rabs are further regulated by guanine nucleotide dissociation inhibitors (GDIs), which facilitate Rab recycling by masking C-terminal lipid binding and promoting cytosolic localization. Most Rab GTPases contain a lipid modification site at the C-terminus, with sequence motifs CC, CXC, or CCX. Lipid binding is essential for membrane attachment, a key feature of most Rab proteins. Probab=91.76 E-value=0.3 Score=28.11 Aligned_cols=19 Identities=37% Similarity=0.588 Sum_probs=16.1 Q ss_pred EEEEECCCCCCCHHHHHHH Q ss_conf 3655247788611589999 Q gi|254780617|r 164 RELIIGDRKTGKTSIILDT 182 (509) Q Consensus 164 R~~I~g~~g~GKt~l~~~~ 182 (509) |+.++|++|||||+|+.-. T Consensus 2 KivvvGd~~VGKTsli~r~ 20 (188) T cd04125 2 KVVIIGDYGVGKSSLLKRF 20 (188) T ss_pred EEEEECCCCCCHHHHHHHH T ss_conf 7999999997899999999 No 209 >pfam07475 Hpr_kinase_C HPr Serine kinase C-terminal domain. This family represents the C terminal kinase domain of Hpr Serine/threonine kinase PtsK. This kinase is the sensor in a multicomponent phosphorelay system in control of carbon catabolic repression in bacteria. This kinase in unusual in that it recognizes the tertiary structure of its target and is a member of a novel family unrelated to any previously described protein phosphorylating enzymes. X-ray analysis of the full-length crystalline enzyme from Staphylococcus xylosus at a resolution of 1.95 A shows the enzyme to consist of two clearly separated domains that are assembled in a hexameric structure resembling a three-bladed propeller. Probab=91.75 E-value=0.42 Score=27.05 Aligned_cols=61 Identities=20% Similarity=0.286 Sum_probs=38.1 Q ss_pred CCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEE-----EECCCCHHHHHHHHHHHCCCCCC Q ss_conf 56833655247788611589999999886302466787606999-----95167767899999851146742 Q gi|254780617|r 160 GRGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIY-----VAIGQKRSSVARFVKALEDRGAL 226 (509) Q Consensus 160 grGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~-----~~IGer~~ev~~~~~~l~~~~~~ 226 (509) .-|.=++|.|++|.|||.++++.|-. +. .-..|+.+.++ .++|....-.+.|.+ ++.-|.+ T Consensus 16 v~G~GVLi~G~sgiGKSe~aLeLi~r--Gh---~LVaDD~v~~~~~~~~~l~G~~p~~~~~~lE-vRGlGii 81 (171) T pfam07475 16 VYGVGVLITGESGIGKSETALELIKR--GH---RLVADDAVEIKRIGEKTLVGRAPEILRHFLE-VRGLGII 81 (171) T ss_pred ECCEEEEEECCCCCCHHHHHHHHHHC--CC---EEEECCEEEEEEECCCEEEEECCHHHHHHHH-HCCCCEE T ss_conf 82779999858999877999999982--99---0880673899993699899979868741775-4588488 No 210 >PRK13634 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=91.71 E-value=0.1 Score=31.47 Aligned_cols=30 Identities=27% Similarity=0.453 Sum_probs=24.3 Q ss_pred HHHH-CCCCCCCCCEEEEECCCCCCCHHHHH Q ss_conf 2331-01111568336552477886115899 Q gi|254780617|r 151 KAID-SLIPIGRGQRELIIGDRKTGKTSIIL 180 (509) Q Consensus 151 ~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~ 180 (509) +|++ .=+.|-+|+..+|+|.+|+|||||+. T Consensus 8 ~AL~~Vsl~i~~Ge~vaiiG~sGsGKSTLl~ 38 (276) T PRK13634 8 RALYDVNVSIPSGSYVAIIGHTGSGKSTLLQ 38 (276) T ss_pred EEEECCEEEECCCCEEEEECCCCCHHHHHHH T ss_conf 6651447799899899999999969999999 No 211 >cd03261 ABC_Org_Solvent_Resistant ABC (ATP-binding cassette) transport system involved in resistant to organic solvents; ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=91.67 E-value=0.12 Score=31.10 Aligned_cols=32 Identities=22% Similarity=0.411 Sum_probs=25.1 Q ss_pred HHHH-CCCCCCCCCEEEEECCCCCCCHHHHHHHH Q ss_conf 2331-01111568336552477886115899999 Q gi|254780617|r 151 KAID-SLIPIGRGQRELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 151 ~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~~~I 183 (509) .++| .=+.|-+|+-.+|+|.+|+||||++. +| T Consensus 14 ~~L~~isl~i~~Ge~~~iiG~SGsGKSTll~-~i 46 (235) T cd03261 14 TVLKGVDLDVRRGEILAIIGPSGSGKSTLLR-LI 46 (235) T ss_pred EEECCEEEEECCCCEEEEECCCCCHHHHHHH-HH T ss_conf 8882606488799899999999972999999-99 No 212 >cd03300 ABC_PotA_N PotA is an ABC-type transporter and the ATPase component of the spermidine/putrescine-preferential uptake system consisting of PotA, -B, -C, and -D. PotA has two domains with the N-terminal domain containing the ATPase activity and the residues required for homodimerization with PotA and heterdimerization with PotB. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=91.67 E-value=0.12 Score=31.09 Aligned_cols=33 Identities=24% Similarity=0.404 Sum_probs=25.2 Q ss_pred HHHHH-CCCCCCCCCEEEEECCCCCCCHHHHHHHH Q ss_conf 02331-01111568336552477886115899999 Q gi|254780617|r 150 IKAID-SLIPIGRGQRELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 150 I~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~~~I 183 (509) ..++| .=+.+-+|....|+|++|+||||++ .+| T Consensus 13 ~~vl~~vsl~v~~Ge~~~iiGpSGsGKSTll-r~i 46 (232) T cd03300 13 FVALDGVSLDIKEGEFFTLLGPSGCGKTTLL-RLI 46 (232) T ss_pred EEEECCCEEEECCCCEEEEECCCCCHHHHHH-HHH T ss_conf 8898761748879989999999998399999-999 No 213 >PRK10790 putative multidrug transporter membrane\ATP-binding components; Provisional Probab=91.65 E-value=0.13 Score=30.90 Aligned_cols=23 Identities=13% Similarity=0.375 Sum_probs=11.7 Q ss_pred CCCCCCCEEEEECCCCCCCHHHH Q ss_conf 11156833655247788611589 Q gi|254780617|r 157 IPIGRGQRELIIGDRKTGKTSII 179 (509) Q Consensus 157 ~pigrGQR~~I~g~~g~GKt~l~ 179 (509) +.|-+|++.+|.|.+|+|||||+ T Consensus 362 ~~I~~Ge~vaIVG~SGsGKSTL~ 384 (593) T PRK10790 362 LSVPSRNFVALVGHTGSGKSTLA 384 (593) T ss_pred CCCCCCCEEEEECCCCCCHHHHH T ss_conf 04489978998799988689999 No 214 >TIGR02769 nickel_nikE nickel import ATP-binding protein NikE; InterPro: IPR014137 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). This family represents the NikE subunit of a multisubunit nickel import ABC transporter complex. Nickel, once imported, may be used in urease and in certain classes of hydrogenase and superoxide dismutase.. Probab=91.64 E-value=0.1 Score=31.60 Aligned_cols=97 Identities=16% Similarity=0.258 Sum_probs=52.9 Q ss_pred CCCHHHHHCCHHHHHCCCCCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHH Q ss_conf 66001444160233101111568336552477886115899999998863024667876069999516776789999985 Q gi|254780617|r 140 QSVCEPLSTGIKAIDSLIPIGRGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKA 219 (509) Q Consensus 140 ~~i~~~l~TGI~~ID~l~pigrGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~ 219 (509) +.-..+..++|. +.|..|-.+++.|-||+|||||+- .++ +-+++.- +-|-=+|..+.++-. T Consensus 21 ~~~~~~VL~~vs-----L~l~~Ge~~gLLG~SG~GKSTLAr-lLl--------GLe~P~~----G~v~F~G~dl~~L~~- 81 (267) T TIGR02769 21 KKQRKAVLDNVS-----LSLEEGETVGLLGRSGCGKSTLAR-LLL--------GLEKPAQ----GEVSFRGQDLSKLDR- 81 (267) T ss_pred HHHHHHHCCCCC-----EEECCCCEEEECCCCCCCHHHHHH-HHH--------HCCCCCC----CEEEECCCCHHHCCH- T ss_conf 454431036743-----231377505523678873778999-987--------5078887----404762564555081- Q ss_pred HCCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHHHHH Q ss_conf 114674211589972887888999998876666667542 Q gi|254780617|r 220 LEDRGALSYSIVVVASASDPAPMQLLAPFAGCAMGEYFR 258 (509) Q Consensus 220 l~~~~~~~~tvvv~a~a~~~~~~r~~ap~~a~aiAEyfr 258 (509) +...+..|++=++=...-++..=-.+ +.-.|+|=+| T Consensus 82 -~~~rafrR~vQlvFQD~~savNPr~t--v~~ii~EPLr 117 (267) T TIGR02769 82 -KQRRAFRRDVQLVFQDSPSAVNPRKT--VREIIGEPLR 117 (267) T ss_pred -HHCCEEEEEEEEEEECCCCCCCHHHH--HHHHHHHHHH T ss_conf -10211121367898747431162578--9999862577 No 215 >cd03293 ABC_NrtD_SsuB_transporters NrtD and SsuB are the ATP-binding subunits of the bacterial ABC-type nitrate and sulfonate transport systems, respectively. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=91.63 E-value=0.11 Score=31.25 Aligned_cols=33 Identities=15% Similarity=0.441 Sum_probs=26.4 Q ss_pred HHHHH-CCCCCCCCCEEEEECCCCCCCHHHHHHHH Q ss_conf 02331-01111568336552477886115899999 Q gi|254780617|r 150 IKAID-SLIPIGRGQRELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 150 I~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~~~I 183 (509) ++++| .=+.+-+|+..+|+|++|+|||||+ .+| T Consensus 17 ~~al~~vsl~i~~Ge~~~iiGpsGsGKSTLl-~~i 50 (220) T cd03293 17 VTALEDISLSVEEGEFVALVGPSGCGKSTLL-RII 50 (220) T ss_pred EEEEECEEEEECCCCEEEEECCCCCHHHHHH-HHH T ss_conf 9999671889879989999999995799999-999 No 216 >KOG0060 consensus Probab=91.61 E-value=0.11 Score=31.40 Aligned_cols=25 Identities=24% Similarity=0.347 Sum_probs=14.8 Q ss_pred CCCHHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 8998999999999999999669998874 Q gi|254780617|r 409 DLDSSTQKFLSKGERLTELLKQPQFSPL 436 (509) Q Consensus 409 dlD~~t~~~l~rg~ri~e~LkQ~~~~p~ 436 (509) -|++..++.|..++ -|+.||+++-. T Consensus 570 vLS~GEqQRLa~AR---Lfy~kPk~AiL 594 (659) T KOG0060 570 VLSPGEQQRLAFAR---LFYHKPKFAIL 594 (659) T ss_pred HCCHHHHHHHHHHH---HHHCCCCEEEE T ss_conf 36988888999999---98608836876 No 217 >TIGR03420 DnaA_homol_Hda DnaA regulatory inactivator Hda. Members of this protein family are Hda (Homologous to DnaA). These proteins are about half the length of DnaA and homologous over length of Hda. In the model species Escherichia coli, the initiation of DNA replication requires DnaA bound to ATP rather than ADP; Hda helps facilitate the conversion of DnaA-ATP to DnaA-ADP. Probab=91.60 E-value=0.41 Score=27.12 Aligned_cols=47 Identities=11% Similarity=0.235 Sum_probs=27.1 Q ss_pred CCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHH Q ss_conf 6833655247788611589999999886302466787606999951677678999 Q gi|254780617|r 161 RGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVAR 215 (509) Q Consensus 161 rGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~ 215 (509) .++..-|+|++|+|||.|+ ..+.|+.... .. -++|....+-.....+ T Consensus 37 ~~~~l~i~G~~GsGKTHLl-~a~~~~~~~~-----~~--~~~yl~~~~~~~~~~~ 83 (226) T TIGR03420 37 GDRFLYLWGESGSGKSHLL-QAACAAAEER-----GK--SAIYLPLAELAQADPE 83 (226) T ss_pred CCCEEEEECCCCCCHHHHH-HHHHHHHHCC-----CC--CEEEECHHHHHHHHHH T ss_conf 8886999899999889999-9999998626-----99--5799529998775399 No 218 >PRK06526 transposase; Provisional Probab=91.58 E-value=0.18 Score=29.67 Aligned_cols=54 Identities=19% Similarity=0.306 Sum_probs=34.7 Q ss_pred CCCCCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCC Q ss_conf 011115683365524778861158999999988630246678760699995167767899999851146 Q gi|254780617|r 155 SLIPIGRGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDR 223 (509) Q Consensus 155 ~l~pigrGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~ 223 (509) .+-=|-+||.+.++|++|||||-|+. .+.+++.... ..| .|.- +.++++.|... T Consensus 91 ~~~fi~~~~Nvil~G~~GtGKThLA~-Alg~~A~~~G-----~~v--~f~~-------~~~L~~~L~~a 144 (254) T PRK06526 91 TLDFVTGKENVVFLGPPGTGKTHLAI-GLGIRACQAG-----HRV--LFAT-------AAQWVARLAAA 144 (254) T ss_pred CCCHHHCCCCEEEECCCCCCHHHHHH-HHHHHHHHCC-----CCE--EEEE-------HHHHHHHHHHH T ss_conf 37177658878998999986899999-9999999869-----967--9987-------79999999998 No 219 >COG3638 ABC-type phosphate/phosphonate transport system, ATPase component [Inorganic ion transport and metabolism] Probab=91.58 E-value=0.47 Score=26.75 Aligned_cols=62 Identities=18% Similarity=0.258 Sum_probs=37.3 Q ss_pred CHHHHHCCCCCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEE--ECCCCHHHHHHHHHH Q ss_conf 60233101111568336552477886115899999998863024667876069999--516776789999985 Q gi|254780617|r 149 GIKAIDSLIPIGRGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYV--AIGQKRSSVARFVKA 219 (509) Q Consensus 149 GI~~ID~l~pigrGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~--~IGer~~ev~~~~~~ 219 (509) ++..|+ +.|-+|+.+.|+|.+|.|||||+- .+|. ..+.+..-+-+.. ..+.++.+.+++... T Consensus 19 aL~~Vn--l~I~~GE~VaiIG~SGaGKSTLLR--~lng-----l~d~t~G~i~~~g~~i~~~~~k~lr~~r~~ 82 (258) T COG3638 19 ALKDVN--LEINQGEMVAIIGPSGAGKSTLLR--SLNG-----LVDPTSGEILFNGVQITKLKGKELRKLRRD 82 (258) T ss_pred EEEEEE--EEECCCCEEEEECCCCCCHHHHHH--HHHC-----CCCCCCCEEEECCCCHHCCCHHHHHHHHHH T ss_conf 663576--775798689998788886899999--9866-----658886159965622102545789999974 No 220 >TIGR02746 TraC-F-type type-IV secretion system protein TraC; InterPro: IPR014117 The proteins in this entry are found in the F, P and I-like type IV secretion systems. Gene symbols include TraC (F-type), TrbE/VirB4 (P-type) and TraU (I-type). The proteins contain the Walker A and B motifs and so are putative nucleotide triphosphatases.. Probab=91.57 E-value=0.32 Score=27.90 Aligned_cols=61 Identities=16% Similarity=0.229 Sum_probs=41.2 Q ss_pred HHHHHCCCCCC---CCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHC Q ss_conf 02331011115---6833655247788611589999999886302466787606999951677678999998511 Q gi|254780617|r 150 IKAIDSLIPIG---RGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALE 221 (509) Q Consensus 150 I~~ID~l~pig---rGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~ 221 (509) +.++|.|.-=. .++.+.|+|.+|.|||.++.++|.++.+.+.. ||+. .-|+....+-+.|. T Consensus 474 l~~~d~F~~~~~~i~NYN~~~~g~sG~GKSf~~Q~~~~~~L~~G~k---------v~vi--DvG~Sy~KLC~~lg 537 (900) T TIGR02746 474 LSAFDPFDSDLLNITNYNIAVVGGSGAGKSFFMQELIVSVLSRGGK---------VWVI--DVGRSYKKLCEMLG 537 (900) T ss_pred EEEEEECCCCCCCCCCCEEEEECCCCCCHHHHHHHHHHHHHHCCCE---------EEEE--EECCCHHHHHHHHC T ss_conf 8778303378667677148997288987168798999998517984---------8999--60788488876518 No 221 >cd03244 ABCC_MRP_domain2 Domain 2 of the ABC subfamily C. This family is also known as MRP (mulrtidrug resisitance-associated protein). Some of the MRP members have five additional transmembrane segments in their N-terminus, but the function of these additional membrane-spanning domains is not clear. The MRP was found in the multidrug-resistance lung cancer cell in which p-glycoprotein was not overexpressed. MRP exports glutathione by drug stimulation, as well as, certain substrates in conjugated forms with anions, such as glutathione, glucuronate, and sulfate. Probab=91.56 E-value=0.12 Score=30.92 Aligned_cols=29 Identities=24% Similarity=0.432 Sum_probs=24.1 Q ss_pred CCCCCCCCCEEEEECCCCCCCHHHHHHHHH Q ss_conf 011115683365524778861158999999 Q gi|254780617|r 155 SLIPIGRGQRELIIGDRKTGKTSIILDTFL 184 (509) Q Consensus 155 ~l~pigrGQR~~I~g~~g~GKt~l~~~~I~ 184 (509) .=+.+-+|++.+|+|.+|+|||||+ .+|. T Consensus 23 isl~i~~Ge~v~ivG~sGsGKSTLl-~ll~ 51 (221) T cd03244 23 ISFSIKPGEKVGIVGRTGSGKSSLL-LALF 51 (221) T ss_pred EEEEECCCCEEEEECCCCCCHHHHH-HHHH T ss_conf 4899869989999999999899999-9996 No 222 >pfam07693 KAP_NTPase KAP family P-loop domain. The KAP (after Kidins220/ARMS and PifA) family of predicted NTPases are sporadically distributed across a wide phylogenetic range in bacteria and in animals. Many of the prokaryotic KAP NTPases are encoded in plasmids and tend to undergo disruption to form pseudogenes. A unique feature of all eukaryotic and certain bacterial KAP NTPases is the presence of two or four transmembrane helices inserted into the P-loop NTPase domain. These transmembrane helices anchor KAP NTPases in the membrane such that the P-loop domain is located on the intracellular side. Probab=91.56 E-value=0.79 Score=25.09 Aligned_cols=26 Identities=23% Similarity=0.289 Sum_probs=20.2 Q ss_pred CCEEEEECCCCCCCHHHHHHHHHHHHH Q ss_conf 833655247788611589999999886 Q gi|254780617|r 162 GQRELIIGDRKTGKTSIILDTFLNQKS 188 (509) Q Consensus 162 GQR~~I~g~~g~GKt~l~~~~I~nq~~ 188 (509) +=-+||.|+=|+|||+++ .++.++-. T Consensus 20 ~~vIgl~G~WGsGKTs~l-~~~~~~L~ 45 (301) T pfam07693 20 GFVIGLYGAWGSGKTSFL-NLLEDELK 45 (301) T ss_pred CEEEEEECCCCCCHHHHH-HHHHHHHH T ss_conf 979999898999999999-99999986 No 223 >PRK08084 DNA replication initiation factor; Provisional Probab=91.54 E-value=0.3 Score=28.19 Aligned_cols=51 Identities=14% Similarity=0.230 Sum_probs=30.2 Q ss_pred CCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHH Q ss_conf 68336552477886115899999998863024667876069999516776789999985 Q gi|254780617|r 161 RGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKA 219 (509) Q Consensus 161 rGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~ 219 (509) .++-.-|.|++|+|||.|+ .++.|+.... .. .++|....+...-..++.+. T Consensus 44 ~~~~l~l~G~~G~GKTHLL-qA~~~~~~~~-----~~--~~~yl~~~~~~~~~~~~l~~ 94 (235) T PRK08084 44 HSGYIYLWGREGAGRSHLL-HAACAELSQR-----GD--AVGYVPLDKRAWFVPEVLEG 94 (235) T ss_pred CCCEEEEECCCCCCHHHHH-HHHHHHHHCC-----CC--CEEEEEHHHHHHHHHHHHHH T ss_conf 9876999899998889999-9999999707-----98--57998779866517999987 No 224 >COG2274 SunT ABC-type bacteriocin/lantibiotic exporters, contain an N-terminal double-glycine peptidase domain [Defense mechanisms] Probab=91.51 E-value=0.13 Score=30.72 Aligned_cols=41 Identities=27% Similarity=0.287 Sum_probs=26.7 Q ss_pred HHHHHCCEEEEECCHHHHCCCCC---------CCCCCCHHHCCCCCCCHH Q ss_conf 88540616996150122102665---------433444122034343898 Q gi|254780617|r 342 VISITDGQIFLETELFYQGIRPA---------INIGLSVSRVGSAAQVKA 382 (509) Q Consensus 342 ~~si~DG~i~l~~~l~~~g~~Pa---------id~~~S~SRv~~~~~~~~ 382 (509) .....+|+|-++.=-|.-+..++ |.++..+-=||.+...|. T Consensus 464 ~~~~~~g~I~~~nvsf~y~~~~~~vL~~isL~I~~Ge~vaIvG~SGsGKS 513 (709) T COG2274 464 HLPKLQGEIEFENVSFRYGPDDPPVLEDLSLEIPPGEKVAIVGRSGSGKS 513 (709) T ss_pred CCCCCCCEEEEEEEEEEECCCCCCHHHCEEEEECCCCEEEEECCCCCCHH T ss_conf 56665762899878998179984412150277679988999879999889 No 225 >PRK11701 phnK phosphonates transport ATP-binding protein; Provisional Probab=91.50 E-value=0.13 Score=30.77 Aligned_cols=35 Identities=20% Similarity=0.299 Sum_probs=27.2 Q ss_pred CHHHHH-CCCCCCCCCEEEEECCCCCCCHHHHHHHHH Q ss_conf 602331-011115683365524778861158999999 Q gi|254780617|r 149 GIKAID-SLIPIGRGQRELIIGDRKTGKTSIILDTFL 184 (509) Q Consensus 149 GI~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~~~I~ 184 (509) |.+++| .=+.+.+|+..+|+|.+|+|||||+ .+|. T Consensus 18 ~~~aL~~Vs~~v~~GEi~~iiG~nGaGKSTLl-~~i~ 53 (258) T PRK11701 18 PRKGCRDVSFDLYPGEVLGIVGESGSGKTTLL-NALS 53 (258) T ss_pred CEEEECCEEEEECCCCEEEEECCCCCCHHHHH-HHHH T ss_conf 97887122778879979999888998899999-9985 No 226 >TIGR03375 type_I_sec_LssB type I secretion system ATPase, LssB family. Type I protein secretion is a system in some Gram-negative bacteria to export proteins (often proteases) across both inner and outer membranes to the extracellular medium. This is one of three proteins of the type I secretion apparatus. Targeted proteins are not cleaved at the N-terminus, but rather carry signals located toward the extreme C-terminus to direct type I secretion. This model is related to models TIGR01842 and TIGR01846, and to bacteriocin ABC transporters that cleave their substrates during export. Probab=91.44 E-value=0.13 Score=30.87 Aligned_cols=22 Identities=9% Similarity=0.303 Sum_probs=14.0 Q ss_pred HCCCCCHHHHHHHHHHHHHHHH Q ss_conf 3279898999999999999985 Q gi|254780617|r 485 KQKVLTDDIRSKLINEIKVFLE 506 (509) Q Consensus 485 ~~~~l~de~~~~L~~~i~~~~~ 506 (509) .+-.|+.+.|+++.+.++++++ T Consensus 628 ~ts~LD~~~e~~i~~~l~~~~~ 649 (694) T TIGR03375 628 PTSAMDNRSEERFKDRLKRWLA 649 (694) T ss_pred CCCCCCHHHHHHHHHHHHHHCC T ss_conf 7568899999999999998669 No 227 >cd03226 ABC_cobalt_CbiO_domain2 Domain II of the ABC component of a cobalt transport family found in bacteria, archaea, and eukaryota. The transition metal cobalt is an essential component of many enzymes and must be transported into cells in appropriate amounts when needed. The CbiMNQO family ABC transport system is involved in cobalt transport in association with the cobalamin (vitamin B12) biosynthetic pathways. Most cobalt (Cbi) transport systems possess a separate CbiN component, the cobalt-binding periplasmic protein, and they are encoded by the conserved gene cluster cbiMNQO. Both the CbiM and CbiQ proteins are integral cytoplasmic membrane proteins, and the CbiO protein has the linker peptide and the Walker A and B motifs commonly found in the ATPase components of the ABC-type transport systems. Probab=91.36 E-value=0.14 Score=30.63 Aligned_cols=34 Identities=18% Similarity=0.305 Sum_probs=26.2 Q ss_pred HHHHH-CCCCCCCCCEEEEECCCCCCCHHHHHHHHH Q ss_conf 02331-011115683365524778861158999999 Q gi|254780617|r 150 IKAID-SLIPIGRGQRELIIGDRKTGKTSIILDTFL 184 (509) Q Consensus 150 I~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~~~I~ 184 (509) -++++ .=+.+-+|+..+|+|++|+|||||+ .+|. T Consensus 13 ~~vL~~vsl~i~~Gei~~iiG~nGaGKSTLl-~~i~ 47 (205) T cd03226 13 TEILDDLSLDLYAGEIIALTGKNGAGKTTLA-KILA 47 (205) T ss_pred CEEEEEEEEEECCCCEEEEECCCCCCHHHHH-HHHH T ss_conf 9786403788869989999889999899999-9995 No 228 >PRK10575 iron-hydroxamate transporter ATP-binding subunit; Provisional Probab=91.34 E-value=0.13 Score=30.71 Aligned_cols=31 Identities=16% Similarity=0.387 Sum_probs=24.4 Q ss_pred HHH-CCCCCCCCCEEEEECCCCCCCHHHHHHHH Q ss_conf 331-01111568336552477886115899999 Q gi|254780617|r 152 AID-SLIPIGRGQRELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 152 ~ID-~l~pigrGQR~~I~g~~g~GKt~l~~~~I 183 (509) ++| .=+.+.+|+..+|+|+.|+|||||+ .+| T Consensus 26 vL~~vsl~i~~Ge~~~liG~NGaGKSTLl-~~l 57 (265) T PRK10575 26 LLHPLSLTFPAGKVTGLIGHNGSGKSTLL-KML 57 (265) T ss_pred EEECEEEEECCCCEEEEECCCCCHHHHHH-HHH T ss_conf 88150889879989999999998099999-999 No 229 >PRK08118 topology modulation protein; Reviewed Probab=91.33 E-value=0.12 Score=31.12 Aligned_cols=99 Identities=10% Similarity=0.142 Sum_probs=45.0 Q ss_pred CEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHH Q ss_conf 33655247788611589999999886302466787606999951677678999998511467421158997288788899 Q gi|254780617|r 163 QRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALSYSIVVVASASDPAPM 242 (509) Q Consensus 163 QR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~~tvvv~a~a~~~~~~ 242 (509) +|+.|+|.+|+||||++- .+-...+ .-...-|. +|=.=|-..+.-.+|.+.+.+.-. +..=|+-..-. ..+ T Consensus 2 ~rI~IiG~~GsGKSTlAr-~L~~~~~--ip~~~LD~---l~w~~~w~~~~~~e~~~~~~~~~~-~~~WIidGny~--~~~ 72 (167) T PRK08118 2 KKIILIGSGGSGKSTLAR-QLGEKLN--IPVHHLDA---LFWKPNWEGVPKEEQRTVQNELVK-EDEWIIDGNYG--GTM 72 (167) T ss_pred CEEEEECCCCCCHHHHHH-HHHHHHC--CCEEECCC---EEECCCCCCCCHHHHHHHHHHHHH-CCCEEEECCCH--HHH T ss_conf 679998899987999999-9999889--69796443---476689946888999999999983-89879947717--799 Q ss_pred HHHHHHHHHHHHHHHHCCCCCEEEEEECHHHHHHHHHHHHHH Q ss_conf 999887666666754204688479961647899999875453 Q gi|254780617|r 243 QLLAPFAGCAMGEYFRDNGYHALIAYDDLQKHAVAYRQLSLL 284 (509) Q Consensus 243 r~~ap~~a~aiAEyfr~~G~~VLi~~Ddltr~A~a~reisl~ 284 (509) ..+- .--|.+ ++=|+.++...+|-+.-. T Consensus 73 ~~r~-------------~~aD~i-I~Ld~p~~~~~~r~~kR~ 100 (167) T PRK08118 73 DIRL-------------NAADTI-IFLDIPRTICLYRAFKRR 100 (167) T ss_pred HHHH-------------HHCCEE-EEECCCHHHHHHHHHHHH T ss_conf 8779-------------769999-998598999999999999 No 230 >PRK08181 transposase; Validated Probab=91.33 E-value=0.21 Score=29.23 Aligned_cols=50 Identities=16% Similarity=0.276 Sum_probs=33.6 Q ss_pred CCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCC Q ss_conf 15683365524778861158999999988630246678760699995167767899999851146 Q gi|254780617|r 159 IGRGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDR 223 (509) Q Consensus 159 igrGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~ 223 (509) |-++|.+.++|++|||||-|+. .+-+++.... ..| .|.- +.++++.|... T Consensus 103 i~~~~Nvil~Gp~GtGKThLA~-Alg~~A~~~G-----~~V--~f~~-------~~~L~~~L~~a 152 (269) T PRK08181 103 LAKGANLLLFGPPGGGKSHLAA-AIGLALIENG-----WRV--LFTR-------TTDLVQKLQVA 152 (269) T ss_pred HHCCCEEEEECCCCCCHHHHHH-HHHHHHHHCC-----CEE--EEEE-------HHHHHHHHHHH T ss_conf 8648708998999987889999-9999999879-----939--9978-------99999999997 No 231 >PRK11248 tauB taurine transporter ATP-binding subunit; Provisional Probab=91.33 E-value=0.13 Score=30.86 Aligned_cols=33 Identities=18% Similarity=0.420 Sum_probs=25.7 Q ss_pred HHHHH-CCCCCCCCCEEEEECCCCCCCHHHHHHHH Q ss_conf 02331-01111568336552477886115899999 Q gi|254780617|r 150 IKAID-SLIPIGRGQRELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 150 I~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~~~I 183 (509) ..++| .=+.+.+|.-.+|+|++|+|||||+ .+| T Consensus 14 ~~~L~dvsl~i~~Ge~~~lvGpnGaGKSTLl-~~i 47 (255) T PRK11248 14 KPALEDINLTLESGELLVVLGPSGCGKTTLL-NLI 47 (255) T ss_pred EEEEECEEEEECCCCEEEEECCCCCHHHHHH-HHH T ss_conf 9888131779869989999999984699999-999 No 232 >PRK10771 thiQ thiamine transporter ATP-binding subunit; Provisional Probab=91.31 E-value=0.13 Score=30.68 Aligned_cols=29 Identities=21% Similarity=0.502 Sum_probs=24.1 Q ss_pred CCCCCCCCCEEEEECCCCCCCHHHHHHHHH Q ss_conf 011115683365524778861158999999 Q gi|254780617|r 155 SLIPIGRGQRELIIGDRKTGKTSIILDTFL 184 (509) Q Consensus 155 ~l~pigrGQR~~I~g~~g~GKt~l~~~~I~ 184 (509) .=+.|-+|+..+|+|++|+|||||+ .+|. T Consensus 18 isl~i~~GE~v~iiG~nGaGKSTLl-~~i~ 46 (233) T PRK10771 18 FTLTVERGEQVAILGPSGAGKSTLL-NLIA 46 (233) T ss_pred EEEEECCCCEEEEECCCCCHHHHHH-HHHH T ss_conf 7889889989999999998199999-9996 No 233 >TIGR03598 GTPase_YsxC ribosome biogenesis GTP-binding protein YsxC/EngB. Members of this protein family are a GTPase associated with ribosome biogenesis, typified by YsxC from Bacillus subutilis. The family is widely but not universally distributed among bacteria. Members commonly are called EngB based on homology to EngA, one of several other GTPases of ribosome biogenesis. Cutoffs as set find essentially all bacterial members, but also identify large numbers of eukaryotic (probably organellar) sequences. This protein is found in about 80 percent of bacterial genomes. Probab=91.30 E-value=1.1 Score=23.91 Aligned_cols=30 Identities=27% Similarity=0.542 Sum_probs=24.0 Q ss_pred CCCCCCCEEEEECCCCCCCHHHHHHHHHHHH Q ss_conf 1115683365524778861158999999988 Q gi|254780617|r 157 IPIGRGQRELIIGDRKTGKTSIILDTFLNQK 187 (509) Q Consensus 157 ~pigrGQR~~I~g~~g~GKt~l~~~~I~nq~ 187 (509) +|...+=++.|+|-+.+||||| +..+++++ T Consensus 13 ~p~~~~p~IaivGrpNvGKSTL-~N~L~g~k 42 (179) T TIGR03598 13 LPPDDGPEIAFAGRSNVGKSSL-INALTNRK 42 (179) T ss_pred CCCCCCCEEEEECCCCCCHHHH-HHHHHCCC T ss_conf 9998897899986999888999-99986898 No 234 >TIGR02211 LolD_lipo_ex lipoprotein releasing system, ATP-binding protein; InterPro: IPR011924 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). This entry represents LolD, a member of the ABC transporter family. LolD is involved in localization of lipoproteins in some bacteria. It works with a transmembrane protein LolC, which in some species is a paralogous pair LolC and LolE. Depending on the residue immediately following the modified N-terminal Cys residue, the nascent lipoprotein may be carried further by LolA and LolB to the outer membrane, or remain at the inner membrane. Excluded from this entry are homologs from the archaeal genus Methanosarcina .; GO: 0005524 ATP binding, 0006810 transport, 0016020 membrane. Probab=91.29 E-value=0.12 Score=30.92 Aligned_cols=38 Identities=24% Similarity=0.497 Sum_probs=27.7 Q ss_pred CHHHHHCCHHHHHCC----CCCCCCCEEEEECCCCCCCHHHH Q ss_conf 001444160233101----11156833655247788611589 Q gi|254780617|r 142 VCEPLSTGIKAIDSL----IPIGRGQRELIIGDRKTGKTSII 179 (509) Q Consensus 142 i~~~l~TGI~~ID~l----~pigrGQR~~I~g~~g~GKt~l~ 179 (509) ++....+|=..++.| +-|.+|--+.|+|.||.|||||+ T Consensus 7 l~K~Y~~G~~~~~vL~g~sl~i~~GE~~~IvG~SGSGKSTLL 48 (221) T TIGR02211 7 LGKRYQEGKLQTRVLKGVSLSIGKGEIVAIVGSSGSGKSTLL 48 (221) T ss_pred CCCEEEECCEEEEEECCCCCEEECCCEEEEECCCCCCHHHHH T ss_conf 531354086236652274512306633798736787168999 No 235 >cd03252 ABCC_Hemolysin The ABC-transporter hemolysin B is a central component of the secretion machinery that translocates the toxin, hemolysin A, in a Sec-independent fashion across both membranes of E. coli. The hemolysin A (HlyA) transport machinery is composed of the ATP-binding cassette (ABC) transporter HlyB located in the inner membrane, hemolysin D (HlyD), also anchored in the inner membrane, and TolC, which resides in the outer membrane. HlyD apparently forms a continuous channel that bridges the entire periplasm, interacting with TolC and HlyB. This arrangement prevents the appearance of periplasmic intermediates of HlyA during substrate transport. Little is known about the molecular details of HlyA transport, but it is evident that ATP-hydrolysis by the ABC-transporter HlyB is a necessary source of energy. Probab=91.27 E-value=0.13 Score=30.73 Aligned_cols=28 Identities=21% Similarity=0.347 Sum_probs=22.8 Q ss_pred CCCCCCCCEEEEECCCCCCCHHHHHHHHH Q ss_conf 11115683365524778861158999999 Q gi|254780617|r 156 LIPIGRGQRELIIGDRKTGKTSIILDTFL 184 (509) Q Consensus 156 l~pigrGQR~~I~g~~g~GKt~l~~~~I~ 184 (509) =+.|-+|++.+|+|.+|+|||||+ ..+. T Consensus 22 s~~i~~G~~vaivG~sGsGKSTll-~ll~ 49 (237) T cd03252 22 SLRIKPGEVVGIVGRSGSGKSTLT-KLIQ 49 (237) T ss_pred EEEECCCCEEEEECCCCCHHHHHH-HHHH T ss_conf 899879999999999998599999-9996 No 236 >cd01865 Rab3 Rab3 subfamily. The Rab3 subfamily contains Rab3A, Rab3B, Rab3C, and Rab3D. All four isoforms were found in mouse brain and endocrine tissues, with varying levels of expression. Rab3A, Rab3B, and Rab3C localized to synaptic and secretory vesicles; Rab3D was expressed at high levels only in adipose tissue, exocrine glands, and the endocrine pituitary, where it is localized to cytoplasmic secretory granules. Rab3 appears to control Ca2+-regulated exocytosis. The appropriate GDP/GTP exchange cycle of Rab3A is required for Ca2+-regulated exocytosis to occur, and interaction of the GTP-bound form of Rab3A with effector molecule(s) is widely believed to be essential for this process. Functionally, most studies point toward a role for Rab3 in the secretion of hormones and neurotransmitters. GTPase activating proteins (GAPs) interact with GTP-bound Rab and accelerate the hydrolysis of GTP to GDP. Guanine nucleotide exchange factors (GEFs) interact with GDP-bound Rabs to promot Probab=91.24 E-value=0.4 Score=27.23 Aligned_cols=18 Identities=44% Similarity=0.752 Sum_probs=15.7 Q ss_pred EEEEECCCCCCCHHHHHH Q ss_conf 365524778861158999 Q gi|254780617|r 164 RELIIGDRKTGKTSIILD 181 (509) Q Consensus 164 R~~I~g~~g~GKt~l~~~ 181 (509) |+.++|++|||||+|+.- T Consensus 3 KivllGd~~VGKTsli~r 20 (165) T cd01865 3 KLLIIGNSSVGKTSFLFR 20 (165) T ss_pred EEEEECCCCCCHHHHHHH T ss_conf 999999999688999999 No 237 >PRK09183 transposase/IS protein; Provisional Probab=91.23 E-value=0.21 Score=29.21 Aligned_cols=51 Identities=20% Similarity=0.278 Sum_probs=33.5 Q ss_pred CCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCC Q ss_conf 115683365524778861158999999988630246678760699995167767899999851146 Q gi|254780617|r 158 PIGRGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDR 223 (509) Q Consensus 158 pigrGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~ 223 (509) =|-+++.+.++|++|||||-|+. .+-+++.... ..| .|.- +.++++.|... T Consensus 97 fi~~~~Nvil~G~~GtGKThLA~-Alg~~A~~~G-----~~v--~f~~-------~~~L~~~L~~a 147 (258) T PRK09183 97 FIERNENIVLLGPSGVGKTHLAI-ALGYEAVRAG-----IKV--RFTT-------AADLLLQLSTA 147 (258) T ss_pred HHHCCCCEEEECCCCCCHHHHHH-HHHHHHHHCC-----CEE--EEEE-------HHHHHHHHHHH T ss_conf 66558867998999986899999-9999999879-----939--9978-------99999999999 No 238 >PRK11264 putative amino-acid ABC transporter ATP-binding protein YecC; Provisional Probab=91.21 E-value=0.14 Score=30.53 Aligned_cols=27 Identities=26% Similarity=0.449 Sum_probs=22.8 Q ss_pred CCCCCCCCEEEEECCCCCCCHHHHHHHH Q ss_conf 1111568336552477886115899999 Q gi|254780617|r 156 LIPIGRGQRELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 156 l~pigrGQR~~I~g~~g~GKt~l~~~~I 183 (509) =+.+.+|...+|+|++|+|||||+ .+| T Consensus 21 sl~i~~Ge~~aliG~sGsGKSTLl-~~l 47 (248) T PRK11264 21 DLEVKPGEVVAIIGPSGSGKTTLL-RCI 47 (248) T ss_pred EEEECCCCEEEEECCCCCCHHHHH-HHH T ss_conf 779879989999999998099999-999 No 239 >cd03255 ABC_MJ0796_Lo1CDE_FtsE This family is comprised of MJ0796 ATP-binding cassette, macrolide-specific ABC-type efflux carrier (MacAB), and proteins involved in cell division (FtsE), and release of liporoteins from the cytoplasmic membrane (LolCDE). They are clustered together phylogenetically. MacAB is an exporter that confers resistance to macrolides, while the LolCDE system is not a transporter at all. An FtsE null mutants showed filamentous growth and appeared viable on high salt medium only, indicating a role for FtsE in cell division and/or salt transport. The LolCDE complex catalyses the release of lipoproteins from the cytoplasmic membrane prior to their targeting to the outer membrane. Probab=91.18 E-value=0.14 Score=30.65 Aligned_cols=29 Identities=21% Similarity=0.439 Sum_probs=24.2 Q ss_pred HCCCCCCCCCEEEEECCCCCCCHHHHHHHH Q ss_conf 101111568336552477886115899999 Q gi|254780617|r 154 DSLIPIGRGQRELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 154 D~l~pigrGQR~~I~g~~g~GKt~l~~~~I 183 (509) |.=+.+.+|+..+|+|++|+||||++- +| T Consensus 22 ~isl~i~~Ge~~~iiG~sGsGKTTll~-~i 50 (218) T cd03255 22 GVSLSIEKGEFVAIVGPSGSGKSTLLN-IL 50 (218) T ss_pred CEEEEECCCCEEEEECCCCCHHHHHHH-HH T ss_conf 628998699899999999986999999-99 No 240 >cd03258 ABC_MetN_methionine_transporter MetN (also known as YusC) is an ABC-type transporter encoded by metN of the metNPQ operon in Bacillus subtilis that is involved in methionine transport. Other members of this system include the MetP permease and the MetQ substrate binding protein. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=91.18 E-value=0.13 Score=30.90 Aligned_cols=31 Identities=26% Similarity=0.450 Sum_probs=25.0 Q ss_pred HHHHH-CCCCCCCCCEEEEECCCCCCCHHHHH Q ss_conf 02331-01111568336552477886115899 Q gi|254780617|r 150 IKAID-SLIPIGRGQRELIIGDRKTGKTSIIL 180 (509) Q Consensus 150 I~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~ 180 (509) +.++| .=+.|-+|...+|+|.+|+|||||+- T Consensus 18 ~~al~~vsl~i~~Ge~~~ivG~SGsGKSTllr 49 (233) T cd03258 18 VTALKDVSLSVPKGEIFGIIGRSGAGKSTLIR 49 (233) T ss_pred EEEEECCEEEECCCCEEEEECCCCCHHHHHHH T ss_conf 99984828899999999998898058999999 No 241 >PRK13646 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=91.15 E-value=0.14 Score=30.47 Aligned_cols=30 Identities=27% Similarity=0.447 Sum_probs=24.6 Q ss_pred HHHH-CCCCCCCCCEEEEECCCCCCCHHHHH Q ss_conf 2331-01111568336552477886115899 Q gi|254780617|r 151 KAID-SLIPIGRGQRELIIGDRKTGKTSIIL 180 (509) Q Consensus 151 ~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~ 180 (509) +|++ .=+.|-+|+..+|+|.+|+|||||+. T Consensus 21 ~aL~~vsl~I~~Ge~~aiiG~nGsGKSTLl~ 51 (286) T PRK13646 21 QAIHDVNTEFEQGKYYAIVGQTGSGKSTLIQ 51 (286) T ss_pred CEEECEEEEECCCCEEEEECCCCCHHHHHHH T ss_conf 0662417798699899999999981999999 No 242 >PRK10070 glycine betaine transporter ATP-binding subunit; Provisional Probab=91.15 E-value=0.14 Score=30.64 Aligned_cols=35 Identities=26% Similarity=0.488 Sum_probs=28.8 Q ss_pred CC--HHHHHCCCCCCCCCEEEEECCCCCCCHHHHHHHH Q ss_conf 16--0233101111568336552477886115899999 Q gi|254780617|r 148 TG--IKAIDSLIPIGRGQRELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 148 TG--I~~ID~l~pigrGQR~~I~g~~g~GKt~l~~~~I 183 (509) || +.+=|.=+.|.+|.-..|+|.+|+|||||+ -+| T Consensus 38 tg~~vAv~dVsl~I~~GEi~~ivG~SGsGKSTLl-r~i 74 (400) T PRK10070 38 TGLSLGVKDASLAIEEGEIFVIMGLSGSGKSTMV-RLL 74 (400) T ss_pred HCCEEEEECCEEEECCCCEEEEECCCCCHHHHHH-HHH T ss_conf 4999989740768879999999999984699999-999 No 243 >TIGR01846 type_I_sec_HlyB type I secretion system ATPase; InterPro: IPR010132 Type I protein secretion is a system in some Gram-negative bacteria to export proteins (often proteases) across both inner and outer membranes to the extracellular medium. This entry contains one of three proteins of the type I secretion apparatus. Targeted proteins are not cleaved at the N terminus, but rather carry signals located toward the extreme C terminus to direct type I secretion.; GO: 0005524 ATP binding, 0008565 protein transporter activity, 0015031 protein transport, 0016021 integral to membrane. Probab=91.12 E-value=0.15 Score=30.39 Aligned_cols=27 Identities=22% Similarity=0.340 Sum_probs=22.5 Q ss_pred HCCCCCCCCCEEEEECCCCCCCHHHHH Q ss_conf 101111568336552477886115899 Q gi|254780617|r 154 DSLIPIGRGQRELIIGDRKTGKTSIIL 180 (509) Q Consensus 154 D~l~pigrGQR~~I~g~~g~GKt~l~~ 180 (509) |-=+-|-.|+-+||.|+||+|||||.. T Consensus 483 nl~L~I~~Ge~IGIvGpSGSGKSTLTK 509 (703) T TIGR01846 483 NLSLDIKPGEVIGIVGPSGSGKSTLTK 509 (703) T ss_pred CCCCCCCCCEEEEEECCCCCCHHHHHH T ss_conf 268765786579987278986789999 No 244 >PRK13543 cytochrome c biogenesis protein CcmA; Provisional Probab=91.10 E-value=0.15 Score=30.39 Aligned_cols=33 Identities=21% Similarity=0.371 Sum_probs=25.1 Q ss_pred HHHHH-CCCCCCCCCEEEEECCCCCCCHHHHHHHH Q ss_conf 02331-01111568336552477886115899999 Q gi|254780617|r 150 IKAID-SLIPIGRGQRELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 150 I~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~~~I 183 (509) -.+++ .=+.+-.|+..+|+|+.|+|||||+ .+| T Consensus 24 ~~vl~~isf~v~~Ge~~~l~GpNGaGKTTLl-r~l 57 (214) T PRK13543 24 EPVFGPLDFHVDAGEALLVQGDNGAGKTTLL-RVL 57 (214) T ss_pred EEEEECEEEEECCCCEEEEECCCCCCHHHHH-HHH T ss_conf 9998263889818989999999998799999-999 No 245 >TIGR03265 PhnT2 putative 2-aminoethylphosphonate ABC transport system, ATP-binding protein component. This ABC transporter ATP-binding protein is found in a number of genomes in operon-like contexts strongly suggesting a substrate specificity for 2-aminoethylphosphonate (2-AEP). The characterized PhnSTUV system is absent in the genomes in which this system is found. These genomes encode systems for the catabolism of 2-AEP, making the need for a 2-AEP-specific transporter likely. Probab=91.05 E-value=0.14 Score=30.54 Aligned_cols=34 Identities=18% Similarity=0.415 Sum_probs=24.7 Q ss_pred CHHHHH-CCCCCCCCCEEEEECCCCCCCHHHHHHHH Q ss_conf 602331-01111568336552477886115899999 Q gi|254780617|r 149 GIKAID-SLIPIGRGQRELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 149 GI~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~~~I 183 (509) +..++| .=+.+.+|+-..|+|++|+||||++ .+| T Consensus 16 ~~~al~~v~l~v~~Ge~~~llGpSG~GKtTlL-r~i 50 (353) T TIGR03265 16 AFTALKDISLSVKKGEFVCLLGPSGCGKTTLL-RII 50 (353) T ss_pred CEEEECCCEEEECCCCEEEEECCCCCHHHHHH-HHH T ss_conf 99888664869989999999999953599999-999 No 246 >PRK10938 putative molybdenum transport ATP-binding protein ModF; Provisional Probab=91.05 E-value=0.14 Score=30.58 Aligned_cols=23 Identities=17% Similarity=0.488 Sum_probs=14.7 Q ss_pred CCCCCCCEEEEECCCCCCCHHHH Q ss_conf 11156833655247788611589 Q gi|254780617|r 157 IPIGRGQRELIIGDRKTGKTSII 179 (509) Q Consensus 157 ~pigrGQR~~I~g~~g~GKt~l~ 179 (509) +.|.+|++.+|+|..|+|||||+ T Consensus 24 l~i~~Ge~~~LvG~NGaGKSTL~ 46 (490) T PRK10938 24 LTLNAGDSWAFVGSNGSGKSALA 46 (490) T ss_pred EEECCCCEEEEECCCCCHHHHHH T ss_conf 89989989999979997799999 No 247 >cd03256 ABC_PhnC_transporter ABC-type phosphate/phosphonate transport system. Phosphonates are a class of organophosphorus compounds characterized by a chemically stable carbon-to-phosphorus (C-P) bond. Phosphonates are widespread among naturally occurring compounds in all kingdoms of wildlife, but only procaryotic microorganisms are able to cleave this bond. Certain bacteria such as E. coli can use alkylphosphonates as a phosphorus source. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=91.02 E-value=0.15 Score=30.41 Aligned_cols=35 Identities=26% Similarity=0.488 Sum_probs=26.8 Q ss_pred CCHHHHH-CCCCCCCCCEEEEECCCCCCCHHHHHHHH Q ss_conf 1602331-01111568336552477886115899999 Q gi|254780617|r 148 TGIKAID-SLIPIGRGQRELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 148 TGI~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~~~I 183 (509) .|..++| .=+.|-+|+-.+|+|++|+|||||+ -+| T Consensus 12 ~g~~~L~~isl~i~~Ge~~~iiGpsGsGKSTLl-~~i 47 (241) T cd03256 12 NGKKALKDVSLSINPGEFVALIGPSGAGKSTLL-RCL 47 (241) T ss_pred CCCEEEECCEEEECCCCEEEEECCCCCHHHHHH-HHH T ss_conf 798899783889999989999999983399999-999 No 248 >TIGR02203 MsbA_lipidA lipid A export permease/ATP-binding protein MsbA; InterPro: IPR011917 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). This family consists of a single polypeptide chain transporter in the ATP-binding cassette (ABC) transporter family, MsbA, which exports lipid A. It may also act in multidrug resistance. Lipid A, a part of lipopolysaccharide, is found in the outer leaflet of the outer membrane of most Gram-negative bacteria. Members of this family are restricted to the Proteobacteria (although lipid A is more broadly distributed) and often are clustered with lipid A biosynthesis genes .; GO: 0005524 ATP binding, 0006869 lipid transport, 0009276 1-2nm peptidoglycan-based cell wall, 0016021 integral to membrane. Probab=90.98 E-value=0.18 Score=29.80 Aligned_cols=32 Identities=22% Similarity=0.525 Sum_probs=17.2 Q ss_pred CCHHHHHCC-CCCCCCCEEEEECCCCCCCHHHH Q ss_conf 160233101-11156833655247788611589 Q gi|254780617|r 148 TGIKAIDSL-IPIGRGQRELIIGDRKTGKTSII 179 (509) Q Consensus 148 TGI~~ID~l-~pigrGQR~~I~g~~g~GKt~l~ 179 (509) +|++++|.. +-+--||.+.+.|-||.|||||+ T Consensus 373 ~~~~aL~~i~l~~~~G~~vALVGRSGSGKsTlv 405 (603) T TIGR02203 373 RDRPALDSISLVVEPGETVALVGRSGSGKSTLV 405 (603) T ss_pred CCHHHHCCCCCEECCCCEEEEECCCCCHHHHHH T ss_conf 872412366651158735998706885389998 No 249 >cd03231 ABC_CcmA_heme_exporter CcmA, the ATP-binding component of the bacterial CcmAB transporter. The CCM family is involved in bacterial cytochrome c biogenesis. Cytochrome c maturation in E. coli requires the ccm operon, which encodes eight membrane proteins (CcmABCDEFGH). CcmE is a periplasmic heme chaperone that binds heme covalently and transfers it onto apocytochrome c in the presence of CcmF, CcmG, and CcmH. The CcmAB proteins represent an ABC transporter and the CcmCD proteins participate in heme transfer to CcmE. Probab=90.96 E-value=0.19 Score=29.56 Aligned_cols=31 Identities=16% Similarity=0.325 Sum_probs=24.9 Q ss_pred CCCCCCCCCEEEEECCCCCCCHHHHHHHHHHH Q ss_conf 01111568336552477886115899999998 Q gi|254780617|r 155 SLIPIGRGQRELIIGDRKTGKTSIILDTFLNQ 186 (509) Q Consensus 155 ~l~pigrGQR~~I~g~~g~GKt~l~~~~I~nq 186 (509) .=+.+.+|+..+|+|+.|+|||||+ .+|.+. T Consensus 19 vs~~i~~Ge~~~l~G~NGsGKSTLl-k~i~Gl 49 (201) T cd03231 19 LSFTLAAGEALQVTGPNGSGKTTLL-RILAGL 49 (201) T ss_pred CEEEECCCCEEEEECCCCCCHHHHH-HHHHCC T ss_conf 0788879959999999999999999-999667 No 250 >PRK04195 replication factor C large subunit; Provisional Probab=90.95 E-value=0.77 Score=25.16 Aligned_cols=110 Identities=16% Similarity=0.179 Sum_probs=51.3 Q ss_pred EEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHH-HHHHHHHHCCCCCC---CEEEEEEECCCCCH Q ss_conf 655247788611589999999886302466787606999951677678-99999851146742---11589972887888 Q gi|254780617|r 165 ELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSS-VARFVKALEDRGAL---SYSIVVVASASDPA 240 (509) Q Consensus 165 ~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~e-v~~~~~~l~~~~~~---~~tvvv~a~a~~~~ 240 (509) .++.|++|+||||++ ..+.++.+.+ ++= +=+-.+|..+ +.+++......+.+ ++.+++.--.|.-. T Consensus 43 lLL~GPpGvGKTT~a-~~lAk~~g~~--------viE-lNASD~R~~~~I~~~i~~~~~~~sl~~~~~KlIIlDEvD~l~ 112 (403) T PRK04195 43 LLLYGPPGVGKTSLA-HALANDYGWE--------VIE-LNASDQRTKDVIERVAGEASTSGSLFGAKRKLILLDEVDGIH 112 (403) T ss_pred EEEECCCCCCHHHHH-HHHHHHHCCC--------EEE-ECCCCCCCHHHHHHHHHHHHHCCCCCCCCCEEEEEECCCCCC T ss_conf 998893998799999-9999984998--------599-771011478999999998760688778873499963434457 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCCEEEEEECH-HHHHHHHHHHHHHCC Q ss_conf 9999988766666675420468847996164-789999987545303 Q gi|254780617|r 241 PMQLLAPFAGCAMGEYFRDNGYHALIAYDDL-QKHAVAYRQLSLLLR 286 (509) Q Consensus 241 ~~r~~ap~~a~aiAEyfr~~G~~VLi~~Ddl-tr~A~a~reisl~~~ 286 (509) ..--... .-++-+....--..+.++..|. ...-...|.....+. T Consensus 113 ~~~d~gg--~~al~~~ik~s~~PiIli~Nd~~~~~~~~lrs~c~~i~ 157 (403) T PRK04195 113 GNADRGG--VRAILEIIKKAKNPIILTANDPYDPSLRPLRNACLMIE 157 (403) T ss_pred CHHHHHH--HHHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHCC T ss_conf 2444799--99999998548870899826845567177997661221 No 251 >PRK11124 artP arginine transporter ATP-binding subunit; Provisional Probab=90.93 E-value=0.15 Score=30.24 Aligned_cols=27 Identities=22% Similarity=0.462 Sum_probs=23.3 Q ss_pred HCCCCCCCCCEEEEECCCCCCCHHHHH Q ss_conf 101111568336552477886115899 Q gi|254780617|r 154 DSLIPIGRGQRELIIGDRKTGKTSIIL 180 (509) Q Consensus 154 D~l~pigrGQR~~I~g~~g~GKt~l~~ 180 (509) |.=+.+-+|+..+|+|.+|+|||||+- T Consensus 20 dvsl~i~~Ge~~~iiG~nGaGKSTLl~ 46 (242) T PRK11124 20 DITLDCPQGETLVLLGPSGAGKSSLLR 46 (242) T ss_pred EEEEEECCCCEEEEECCCCCCHHHHHH T ss_conf 207788799899999999971999999 No 252 >PRK09452 potA putrescine/spermidine ABC transporter ATPase protein; Reviewed Probab=90.92 E-value=0.15 Score=30.38 Aligned_cols=34 Identities=24% Similarity=0.409 Sum_probs=25.5 Q ss_pred CHHHHH-CCCCCCCCCEEEEECCCCCCCHHHHHHHH Q ss_conf 602331-01111568336552477886115899999 Q gi|254780617|r 149 GIKAID-SLIPIGRGQRELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 149 GI~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~~~I 183 (509) |..++| .=+.|.+|+-..|+|++|+|||||+ .+| T Consensus 29 ~~~aL~~vsl~I~~GE~~~llGpSGsGKSTLl-r~i 63 (378) T PRK09452 29 GKEVISNLDLTINNGEFLTLLGPSGCGKTTVL-RLI 63 (378) T ss_pred CEEEEECCEEEECCCCEEEEECCCCCHHHHHH-HHH T ss_conf 99999362779999989999989997699999-999 No 253 >cd03292 ABC_FtsE_transporter FtsE is a hydrophilic nucleotide-binding protein that binds FtsX to form a heterodimeric ATP-binding cassette (ABC)-type transporter that associates with the bacterial inner membrane. The FtsE/X transporter is thought to be involved in cell division and is important for assembly or stability of the septal ring. Probab=90.91 E-value=0.098 Score=31.65 Aligned_cols=37 Identities=24% Similarity=0.494 Sum_probs=29.3 Q ss_pred HHCCHHHHH-CCCCCCCCCEEEEECCCCCCCHHHHHHHH Q ss_conf 441602331-01111568336552477886115899999 Q gi|254780617|r 146 LSTGIKAID-SLIPIGRGQRELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 146 l~TGI~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~~~I 183 (509) +..|..|+| .=+.|-+|.-..|+|++|+|||||+ .+| T Consensus 10 y~~~~~aL~~vsl~i~~Ge~v~i~GpSGsGKSTLl-~~i 47 (214) T cd03292 10 YPNGTAALDGINISISAGEFVFLVGPSGAGKSTLL-KLI 47 (214) T ss_pred ECCCCEEEEEEEEEECCCCEEEEECCCCCHHHHHH-HHH T ss_conf 79898998221779859989999979995399999-999 No 254 >TIGR02785 addA_Gpos recombination helicase AddA; InterPro: IPR014152 AddAB, also called RexAB, substitutes for RecBCD in several bacterial lineages. These DNA recombination proteins act before synapse and are particularly important for DNA repair of double-stranded breaks by homologous recombination. The term AddAB is used broadly, with AddA homologues between the Firmicutes and the alphaproteobacteria, while the partner AddB proteins show no strong homology across the two groups of species.. Probab=90.91 E-value=0.2 Score=29.43 Aligned_cols=245 Identities=16% Similarity=0.174 Sum_probs=109.2 Q ss_pred CCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHH-HHCCCCCCCEEEEEEECCCC Q ss_conf 56833655247788611589999999886302466787606999951677678999998-51146742115899728878 Q gi|254780617|r 160 GRGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVK-ALEDRGALSYSIVVVASASD 238 (509) Q Consensus 160 grGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~-~l~~~~~~~~tvvv~a~a~~ 238 (509) -+||-+|+=|.+|.|||+++++-|+++......--.-+.-. |.------.+|+.+=++ .|.+. +.-..++ T Consensus 12 ~~G~nILVsAsAGSGKTaVLVERii~~i~~gE~pvdiDrLL-VVTFTnaAA~EMK~Ri~~~l~k~--------l~~~~~d 82 (1295) T TIGR02785 12 DRGQNILVSASAGSGKTAVLVERIIKKILRGENPVDIDRLL-VVTFTNAAAREMKERIEEALQKE--------LAQEPDD 82 (1295) T ss_pred HCCCCEEEEEECCCCCHHHHHHHHHHHHCCCCCCCCCCCCH-HHHHHHHHHHHHHHHHHHHHHHH--------HCCCCCC T ss_conf 36886476532266614789999999964788770202103-67888999999999999999997--------4368888 Q ss_pred CHHHHHHHHHHHHHHHHHHHCC---------------------CCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCC Q ss_conf 8899999887666666754204---------------------6884799616478999998754530378754333752 Q gi|254780617|r 239 PAPMQLLAPFAGCAMGEYFRDN---------------------GYHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGD 297 (509) Q Consensus 239 ~~~~r~~ap~~a~aiAEyfr~~---------------------G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~ 297 (509) .|..+ +|..--+|+|.| ++==||=+|=-+|-+.-..|=.||..|- T Consensus 83 ~P~~~-----ta~~~~~hl~~QL~lL~~A~ISTlhsFCL~Vir~~YYllDlDP~FRil~nd~e~~LL~~E~--------- 148 (1295) T TIGR02785 83 DPERR-----TAVDNSKHLRRQLALLNKANISTLHSFCLKVIRKHYYLLDLDPSFRILTNDTEELLLLKEV--------- 148 (1295) T ss_pred CCHHH-----HCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCEEECCCCCCCCCCHHHHHHHHHHH--------- T ss_conf 71232-----0443789999999644676510457899999766604355378876577567888988757--------- Q ss_pred CHHHHHHHHHHHHHCCCCCCCCCEEEEEE-EECCCC-CCCCCHHHHHHHHHCCEEEEECCHHHHCCCCCCCCCCCHHH-- Q ss_conf 00124678888750454458873567531-003676-64431146688540616996150122102665433444122-- Q gi|254780617|r 298 VFYLHSRLLERAAKMSDALGAGSLTALPV-IETQVN-DVSAYIPTNVISITDGQIFLETELFYQGIRPAINIGLSVSR-- 373 (509) Q Consensus 298 vf~~~s~l~ERa~~~~~~~g~GSiT~~~~-v~~~~~-D~~~~i~~~~~si~DG~i~l~~~l~~~g~~Paid~~~S~SR-- 373 (509) ++=-|-|-..+ .+ =++|.- +.-=++ |=+|-=...++. +=-=|.+-+=-|=.|+.++-. T Consensus 149 ---~~d~~E~~y~~-~d------~~~F~~L~~~f~~rDR~D~~L~~li~--------~lY~F~~a~P~Pe~WL~~l~~~Y 210 (1295) T TIGR02785 149 ---VDDVLEEEYYK-ED------NEAFFELVDNFSGRDRSDDGLRDLIL--------KLYDFARATPNPEKWLNKLAEAY 210 (1295) T ss_pred ---HHHHHHHHHHH-CC------CHHHHHHHHHHHCCCCCCHHHHHHHH--------HHHHHHHCCCCHHHHHHHCCCCC T ss_conf ---99999998520-15------15899999865078977012899999--------99999744998678886265542 Q ss_pred -CCCCCCCHHHHHHHHH-----------HHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHCC--CCCCCCCHH Q ss_conf -0343438988775567-----------88879988767879876338998999999999999999669--998874889 Q gi|254780617|r 374 -VGSAAQVKAMKQVSGA-----------VKGELAQYREMSSFSKFSSDLDSSTQKFLSKGERLTELLKQ--PQFSPLAME 439 (509) Q Consensus 374 -v~~~~~~~~~~~~a~~-----------lr~~laqy~Ele~f~~fgsdlD~~t~~~l~rg~ri~e~LkQ--~~~~p~~~~ 439 (509) |......+ .+..+.. ++..|.+|.|+ +..-.+++++...........+.++.++ .+-+.-.+. T Consensus 211 ~V~~~~~~e-~~~~~~~ik~~~~~~~~~~~~~l~~~~~~--~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~ 287 (1295) T TIGR02785 211 EVKEDFLIE-SKLLQQQIKDLLKNELEGLQEKLQKALEL--FMAEDGELAPKLENFQLEAQNIKELIEEKLAQEDWNELR 287 (1295) T ss_pred CCCHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 234046788-89999998999999999999999999986--543202663168999999999999999888543699999 Q ss_pred HHHHHHHHH Q ss_conf 999999998 Q gi|254780617|r 440 EQVVMIFAG 448 (509) Q Consensus 440 ~qv~~l~a~ 448 (509) +.+...-.. T Consensus 288 ~~~~~~~~~ 296 (1295) T TIGR02785 288 EAVALVKFE 296 (1295) T ss_pred HHHHHHHHH T ss_conf 998640244 No 255 >cd03222 ABC_RNaseL_inhibitor The ABC ATPase RNase L inhibitor (RLI) is a key enzyme in ribosomal biogenesis, formation of translation preinitiation complexes, and assembly of HIV capsids. RLI's are not transport proteins, and thus cluster with a group of soluble proteins that lack the transmembrane components commonly found in other members of the family. Structurally, RLI's have an N-terminal Fe-S domain and two nucleotide-binding domains, which are arranged to form two composite active sites in their interface cleft. RLI is one of the most conserved enzymes between archaea and eukaryotes with a sequence identity more than 48%. The high degree of evolutionary conservation suggests that RLI performs a central role in archaeal and eukaryotic physiology. Probab=90.90 E-value=0.14 Score=30.62 Aligned_cols=32 Identities=25% Similarity=0.311 Sum_probs=24.6 Q ss_pred HHHCCCCCCCCCEEEEECCCCCCCHHHHHHHHH Q ss_conf 331011115683365524778861158999999 Q gi|254780617|r 152 AIDSLIPIGRGQRELIIGDRKTGKTSIILDTFL 184 (509) Q Consensus 152 ~ID~l~pigrGQR~~I~g~~g~GKt~l~~~~I~ 184 (509) +++-+..+-+|+..+|+|+.|+||||++ .+|. T Consensus 15 ~l~~~~~v~~GEiv~ilGpNGaGKSTll-k~i~ 46 (177) T cd03222 15 LLVELGVVKEGEVIGIVGPNGTGKTTAV-KILA 46 (177) T ss_pred EECCCCCCCCCCEEEEECCCCCCHHHHH-HHHH T ss_conf 8658995589989999899999999999-9996 No 256 >PRK07773 replicative DNA helicase; Validated Probab=90.88 E-value=0.69 Score=25.50 Aligned_cols=54 Identities=20% Similarity=0.286 Sum_probs=40.6 Q ss_pred HHCCHHHHHCCCC-CCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEE Q ss_conf 4416023310111-15683365524778861158999999988630246678760699995 Q gi|254780617|r 146 LSTGIKAIDSLIP-IGRGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVA 205 (509) Q Consensus 146 l~TGI~~ID~l~p-igrGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~ 205 (509) +.||.+-+|.++- +-+|+=+-|=|-++.|||++++++..|-+..+ ..-+++|-+ T Consensus 186 vpTG~~~ld~~~~g~~~~~l~i~a~rp~~GKt~~~~~~a~~~a~~~------~~~v~~FSL 240 (868) T PRK07773 186 VPTGFYDLDEMTNGLHPGQMIIVAARPGVGKSTLGLDFARSCSIRH------RLASVIFSL 240 (868) T ss_pred CCCCHHHHHHHHCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHHHC------CCCEEEEEE T ss_conf 6889799987631888776799982897777789999999999865------996589730 No 257 >TIGR03258 PhnT 2-aminoethylphosphonate ABC transport system, ATP-binding component PhnT. This ATP-binding component of an ABC transport system is found in Salmonella and Burkholderia lineages in the vicinity of enzymes for the breakdown of 2-aminoethylphosphonate. Probab=90.87 E-value=0.15 Score=30.29 Aligned_cols=32 Identities=25% Similarity=0.452 Sum_probs=23.6 Q ss_pred HHHH-CCCCCCCCCEEEEECCCCCCCHHHHHHHH Q ss_conf 2331-01111568336552477886115899999 Q gi|254780617|r 151 KAID-SLIPIGRGQRELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 151 ~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~~~I 183 (509) .++| .=+.|.+|.-..|+|++|+||||++ -+| T Consensus 19 ~al~dvsl~i~~GE~~~llGpSG~GKTTlL-r~i 51 (362) T TIGR03258 19 TVLDDLSLEIEAGELLALIGKSGCGKTTLL-RAI 51 (362) T ss_pred EEECCCEEEECCCCEEEEECCCCCHHHHHH-HHH T ss_conf 899376719999989999999974599999-999 No 258 >PRK13542 consensus Probab=90.86 E-value=0.16 Score=30.06 Aligned_cols=34 Identities=24% Similarity=0.398 Sum_probs=25.6 Q ss_pred CHHHHH-CCCCCCCCCEEEEECCCCCCCHHHHHHHH Q ss_conf 602331-01111568336552477886115899999 Q gi|254780617|r 149 GIKAID-SLIPIGRGQRELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 149 GI~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~~~I 183 (509) |-++++ .=+.+.+|+..+|+|+.|+|||||+ .+| T Consensus 30 ~~~il~~isl~i~~Gei~~liGpNGaGKTTLl-k~l 64 (224) T PRK13542 30 GRAVFRGIDISLAPGDLLQVMGPNGSGKTSLL-RVL 64 (224) T ss_pred CEEEEECCEEEECCCCEEEEECCCCCCHHHHH-HHH T ss_conf 99988461678759979999999999999999-999 No 259 >cd03260 ABC_PstB_phosphate_transporter Phosphate uptake is of fundamental importance in the cell physiology of bacteria because phosphate is required as a nutrient. The Pst system of E. coli comprises four distinct subunits encoded by the pstS, pstA, pstB, and pstC genes. The PstS protein is a phosphate-binding protein located in the periplasmic space. P stA and PstC are hydrophobic and they form the transmembrane portion of the Pst system. PstB is the catalytic subunit, which couples the energy of ATP hydrolysis to the import of phosphate across cellular membranes through the Pst system, often referred as ABC-protein. PstB belongs to one of the largest superfamilies of proteins characterized by a highly conserved adenosine triphosphate (ATP) binding cassette (ABC), which is also a nucleotide binding domain (NBD). Probab=90.84 E-value=0.16 Score=30.06 Aligned_cols=28 Identities=21% Similarity=0.426 Sum_probs=23.4 Q ss_pred CCCCCCCCCEEEEECCCCCCCHHHHHHHH Q ss_conf 01111568336552477886115899999 Q gi|254780617|r 155 SLIPIGRGQRELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 155 ~l~pigrGQR~~I~g~~g~GKt~l~~~~I 183 (509) .=+.+-+|+..+|+|.+|+||||++. +| T Consensus 19 isl~i~~Ge~~~iiG~SGsGKSTll~-~i 46 (227) T cd03260 19 ISLDIPKGEITALIGPSGCGKSTLLR-LL 46 (227) T ss_pred EEEEECCCCEEEEECCCCCCHHHHHH-HH T ss_conf 06788799899999999981999999-99 No 260 >PRK11819 putative ABC transporter ATP-binding protein; Reviewed Probab=90.81 E-value=0.16 Score=30.22 Aligned_cols=32 Identities=19% Similarity=0.306 Sum_probs=18.7 Q ss_pred CCCCHHHHHHHHHHHHHHH-----HHHHHHHHHHHHH Q ss_conf 6679899999999999999-----7415999998873 Q gi|254780617|r 454 DEVAVSQVRKFETNFLSHM-----RVSSQDILEDIRK 485 (509) Q Consensus 454 d~i~v~~I~~fe~~l~~~l-----~~~~~ei~~~I~~ 485 (509) +.++++.+...++.|.+|= -++...+++.+.+ T Consensus 474 n~LDi~s~e~Le~aL~~y~Gtvl~VSHDr~fi~~va~ 510 (556) T PRK11819 474 NDLDVETLRALEDALLEFPGCAVVISHDRWFLDRIAT 510 (556) T ss_pred CCCCHHHHHHHHHHHHHCCCEEEEEECCHHHHHHHCC T ss_conf 5679999999999998779969999789999998688 No 261 >cd04144 Ras2 Ras2 subfamily. The Ras2 subfamily, found exclusively in fungi, was first identified in Ustilago maydis. In U. maydis, Ras2 is regulated by Sql2, a protein that is homologous to GEFs (guanine nucleotide exchange factors) of the CDC25 family. Ras2 has been shown to induce filamentous growth, but the signaling cascade through which Ras2 and Sql2 regulate cell morphology is not known. Most Ras proteins contain a lipid modification site at the C-terminus, with a typical sequence motif CaaX, where a = an aliphatic amino acid and X = any amino acid. Lipid binding is essential for membrane attachment, a key feature of most Ras proteins. Probab=90.80 E-value=0.58 Score=26.04 Aligned_cols=20 Identities=25% Similarity=0.567 Sum_probs=16.7 Q ss_pred EEEEECCCCCCCHHHHHHHH Q ss_conf 36552477886115899999 Q gi|254780617|r 164 RELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 164 R~~I~g~~g~GKt~l~~~~I 183 (509) |+.++|++|||||+|+.-.+ T Consensus 1 KivviGd~gVGKTsli~r~~ 20 (190) T cd04144 1 KLVVLGDGGVGKTALTIQLC 20 (190) T ss_pred CEEEECCCCCCHHHHHHHHH T ss_conf 98999989987899999996 No 262 >PRK13651 cobalt transporter ATP-binding subunit; Provisional Probab=90.78 E-value=0.17 Score=29.99 Aligned_cols=31 Identities=39% Similarity=0.544 Sum_probs=26.1 Q ss_pred HHHHH-CCCCCCCCCEEEEECCCCCCCHHHHH Q ss_conf 02331-01111568336552477886115899 Q gi|254780617|r 150 IKAID-SLIPIGRGQRELIIGDRKTGKTSIIL 180 (509) Q Consensus 150 I~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~ 180 (509) ++|+| .=+.|-+|...+|+|..|+||||++. T Consensus 20 ~~AL~dvsl~I~~GE~v~IiG~nGsGKSTL~k 51 (304) T PRK13651 20 LKALDGVSTEINQGEFIAIIGQTGSGKTTFIE 51 (304) T ss_pred CEEEECEEEEECCCCEEEEECCCCCCHHHHHH T ss_conf 46863405798599899998799985999999 No 263 >PRK09544 znuC high-affinity zinc transporter ATPase; Reviewed Probab=90.77 E-value=0.17 Score=29.92 Aligned_cols=30 Identities=17% Similarity=0.399 Sum_probs=24.6 Q ss_pred CCCCCCCCCEEEEECCCCCCCHHHHHHHHHH Q ss_conf 0111156833655247788611589999999 Q gi|254780617|r 155 SLIPIGRGQRELIIGDRKTGKTSIILDTFLN 185 (509) Q Consensus 155 ~l~pigrGQR~~I~g~~g~GKt~l~~~~I~n 185 (509) .=+.+.+|+..+|+|+.|+|||||+ .+|.. T Consensus 23 vs~~i~~Gei~~LiGpNGaGKSTLl-k~I~G 52 (251) T PRK09544 23 VSLELKPGKILTLLGPNGAGKSTLV-RVVLG 52 (251) T ss_pred EEEEECCCCEEEEECCCCCCHHHHH-HHHHC T ss_conf 0789879979999989998899999-99966 No 264 >PRK13652 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=90.77 E-value=0.15 Score=30.30 Aligned_cols=35 Identities=29% Similarity=0.467 Sum_probs=27.3 Q ss_pred CCHHHHH-CCCCCCCCCEEEEECCCCCCCHHHHHHHH Q ss_conf 1602331-01111568336552477886115899999 Q gi|254780617|r 148 TGIKAID-SLIPIGRGQRELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 148 TGI~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~~~I 183 (509) .+..++| .=+.+.+|+..+|+|+.|+||||++ .+| T Consensus 15 g~~~aL~~is~~i~~Ge~~aliG~NGaGKSTLl-~~i 50 (277) T PRK13652 15 GSKEALNNINFIAGRKQRIAVIGPNGAGKSTLF-KHF 50 (277) T ss_pred CCCEEEECCEEEECCCCEEEEECCCCCHHHHHH-HHH T ss_conf 998798672879989989999999994799999-999 No 265 >cd03271 ABC_UvrA_II The excision repair protein UvrA domain II; Nucleotide excision repair in eubacteria is a process that repairs DNA damage by the removal of a 12-13-mer oligonucleotide containing the lesion. Recognition and cleavage of the damaged DNA is a multistep ATP-dependent reaction that requires the UvrA, UvrB, and UvrC proteins. Both UvrA and UvrB are ATPases, with UvrA having two ATP binding sites, which have the characteristic signature of the family of ABC proteins and UvrB having one ATP binding site that is structurally related to that of helicases. Probab=90.76 E-value=0.17 Score=29.97 Aligned_cols=24 Identities=29% Similarity=0.385 Sum_probs=21.2 Q ss_pred CCCCCCCEEEEECCCCCCCHHHHH Q ss_conf 111568336552477886115899 Q gi|254780617|r 157 IPIGRGQRELIIGDRKTGKTSIIL 180 (509) Q Consensus 157 ~pigrGQR~~I~g~~g~GKt~l~~ 180 (509) +.|-+|+..+|+|.+|+|||||+. T Consensus 16 l~i~~G~~~aIiG~sGsGKSTLl~ 39 (261) T cd03271 16 VDIPLGVLTCVTGVSGSGKSSLIN 39 (261) T ss_pred EEECCCCEEEEECCCCCCHHHHHH T ss_conf 898899999998799986999999 No 266 >cd03235 ABC_Metallic_Cations ABC component of the metal-type transporters. This family includes transporters involved in the uptake of various metallic cations such as iron, manganese, and zinc. The ATPases of this group of transporters are very similar to members of iron-siderophore uptake family suggesting that they share a common ancestor. The best characterized metal-type ABC transporters are the YfeABCD system of Y. pestis, the SitABCD system of Salmonella enterica serovar Typhimurium, and the SitABCD transporter of Shigella flexneri. Moreover other uncharacterized homologs of these metal-type transporters are mainly found in pathogens like Haemophilus or enteroinvasive E. coli isolates. Probab=90.73 E-value=0.17 Score=29.95 Aligned_cols=34 Identities=18% Similarity=0.416 Sum_probs=25.9 Q ss_pred HHHH-CCCCCCCCCEEEEECCCCCCCHHHHHHHHHH Q ss_conf 2331-0111156833655247788611589999999 Q gi|254780617|r 151 KAID-SLIPIGRGQRELIIGDRKTGKTSIILDTFLN 185 (509) Q Consensus 151 ~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~~~I~n 185 (509) +++| .=+.|-+|+..+|+|+.|+||||++ .+|.. T Consensus 13 ~~L~~is~~i~~Ge~~~liGpNGaGKSTll-k~i~G 47 (213) T cd03235 13 PVLEDVSFEVKPGEFLAIVGPNGAGKSTLL-KAILG 47 (213) T ss_pred EEEEEEEEEECCCCEEEEECCCCCCHHHHH-HHHHC T ss_conf 988502789859989999999998699999-99976 No 267 >PRK13644 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=90.70 E-value=0.16 Score=30.06 Aligned_cols=32 Identities=31% Similarity=0.512 Sum_probs=25.5 Q ss_pred CHHHHH-CCCCCCCCCEEEEECCCCCCCHHHHH Q ss_conf 602331-01111568336552477886115899 Q gi|254780617|r 149 GIKAID-SLIPIGRGQRELIIGDRKTGKTSIIL 180 (509) Q Consensus 149 GI~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~ 180 (509) |..+++ .=+.|-+|+..+|+|..|+|||||+. T Consensus 14 ~~~aL~~vsl~i~~Ge~vaiiG~nGsGKSTL~~ 46 (274) T PRK13644 14 GTPALENINLVIKKGEYIGIIGKNGSGKSTLAL 46 (274) T ss_pred CCEEEEEEEEEECCCCEEEEECCCCCHHHHHHH T ss_conf 984663117798489999999999980999999 No 268 >cd03299 ABC_ModC_like Archeal protein closely related to ModC. ModC is an ABC-type transporter and the ATPase component of a molybdate transport system that also includes the periplasmic binding protein ModA and the membrane protein ModB. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=90.69 E-value=0.16 Score=30.13 Aligned_cols=30 Identities=23% Similarity=0.510 Sum_probs=24.4 Q ss_pred HCCCCCCCCCEEEEECCCCCCCHHHHHHHHH Q ss_conf 1011115683365524778861158999999 Q gi|254780617|r 154 DSLIPIGRGQRELIIGDRKTGKTSIILDTFL 184 (509) Q Consensus 154 D~l~pigrGQR~~I~g~~g~GKt~l~~~~I~ 184 (509) |.=+.+.+|....|+|++|+|||||+ -+|. T Consensus 17 ~vs~~v~~Ge~~~iiGpSGsGKSTLl-r~i~ 46 (235) T cd03299 17 NVSLEVERGDYFVILGPTGSGKSVLL-ETIA 46 (235) T ss_pred CCEEEECCCCEEEEECCCCCHHHHHH-HHHH T ss_conf 14879889989999999963599999-9997 No 269 >PRK11831 putative ABC transporter ATP-binding protein YrbF; Provisional Probab=90.69 E-value=0.16 Score=30.09 Aligned_cols=29 Identities=24% Similarity=0.398 Sum_probs=24.1 Q ss_pred CCCCCCCCCEEEEECCCCCCCHHHHHHHHH Q ss_conf 011115683365524778861158999999 Q gi|254780617|r 155 SLIPIGRGQRELIIGDRKTGKTSIILDTFL 184 (509) Q Consensus 155 ~l~pigrGQR~~I~g~~g~GKt~l~~~~I~ 184 (509) .=+.+-+|+-.+|+|++|+|||||+ .+|. T Consensus 27 is~~i~~Gei~~iiGpnGsGKSTLl-k~i~ 55 (269) T PRK11831 27 ISLTVPRGKITAIMGPSGIGKTTLL-RLIG 55 (269) T ss_pred CEEEECCCCEEEEECCCCCHHHHHH-HHHH T ss_conf 1668879989999939997599999-9996 No 270 >PRK13649 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=90.68 E-value=0.16 Score=30.16 Aligned_cols=27 Identities=30% Similarity=0.442 Sum_probs=23.5 Q ss_pred HCCCCCCCCCEEEEECCCCCCCHHHHH Q ss_conf 101111568336552477886115899 Q gi|254780617|r 154 DSLIPIGRGQRELIIGDRKTGKTSIIL 180 (509) Q Consensus 154 D~l~pigrGQR~~I~g~~g~GKt~l~~ 180 (509) |.=+.+.+|+..+|+|.+|+|||||+. T Consensus 25 ~Vsl~i~~Ge~~aiiG~nGsGKSTLl~ 51 (280) T PRK13649 25 DVNLDILDGSYTAFIGHTGSGKSTIMQ 51 (280) T ss_pred EEEEEECCCCEEEEECCCCCHHHHHHH T ss_conf 026898799899999599986999999 No 271 >PRK11629 lolD lipoprotein transporter ATP-binding subunit; Provisional Probab=90.67 E-value=0.17 Score=29.96 Aligned_cols=29 Identities=24% Similarity=0.562 Sum_probs=24.0 Q ss_pred HCCCCCCCCCEEEEECCCCCCCHHHHHHHH Q ss_conf 101111568336552477886115899999 Q gi|254780617|r 154 DSLIPIGRGQRELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 154 D~l~pigrGQR~~I~g~~g~GKt~l~~~~I 183 (509) |.=+.+-+|...+|+|++|+|||||+ .+| T Consensus 27 ~vs~~i~~GE~v~iiG~sGsGKSTLl-~~i 55 (233) T PRK11629 27 NVSFSIGEGEMMAIVGSSGSGKSTLL-HLL 55 (233) T ss_pred CEEEEECCCCEEEEECCCCCCHHHHH-HHH T ss_conf 62899889989999999994099999-999 No 272 >PRK13647 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=90.67 E-value=0.16 Score=30.08 Aligned_cols=34 Identities=32% Similarity=0.634 Sum_probs=26.4 Q ss_pred CHHHHH-CCCCCCCCCEEEEECCCCCCCHHHHHHHH Q ss_conf 602331-01111568336552477886115899999 Q gi|254780617|r 149 GIKAID-SLIPIGRGQRELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 149 GI~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~~~I 183 (509) |.+++| .=+.+-+|+..+|+|+.|+||||++. +| T Consensus 17 g~~aL~~vs~~i~~Ge~~aiiG~NGsGKSTLl~-~l 51 (273) T PRK13647 17 GTKALDGVSLVIPEGSKTAILGPNGAGKSTLLL-HL 51 (273) T ss_pred CCEEEEEEEEEECCCCEEEEECCCCCHHHHHHH-HH T ss_conf 888988117898899899999999975999999-99 No 273 >pfam09378 HAS-barrel HAS barrel domain. The HAS barrel is named after HerA-ATP Synthase. In ATP synthases, this domain is implicated in the assembly of the catalytic toroid and docking of accessory subunits, such as the subunit of the ATP synthase complex. Similar roles in docking of the functional partner, the NurA nuclease, and assembly of the HerA toroid complex appear likely for the HAS-barrel of the HerA family. Probab=90.66 E-value=1.2 Score=23.80 Aligned_cols=43 Identities=49% Similarity=0.706 Sum_probs=36.1 Q ss_pred EEEEEEEEECCEEEEECCCCCCCCCEEEEC---CCEEEEEEEECCCE Q ss_conf 035899982758999758887737479957---98189999205985 Q gi|254780617|r 27 EIGRVLSIGDGIARVYGLNNIRAGEMVQFS---HGVYGMALNLEVDN 70 (509) Q Consensus 27 ~~G~V~~V~~giv~v~GL~~a~~GElv~~~---~g~~G~Vi~l~~d~ 70 (509) .+|.+.. .+.++.+.+..++++||.|.++ +.+.|+|.++.... T Consensus 4 vig~~~~-~~~~~~~~~~~~v~~Ge~V~i~~~~~~vlG~V~~i~~~~ 49 (88) T pfam09378 4 VIGTVSS-NDGIFRVEGLKDVRVGEYVEIELDGSKVLGMVLNIERGL 49 (88) T ss_pred EEEEEEC-CCCEEEEECCCCCCCCCEEEEEECCEEEEEEEEEEECCC T ss_conf 9988744-885799605663103328999977868998888650204 No 274 >PRK11432 fbpC ferric transporter ATP-binding subunit; Provisional Probab=90.65 E-value=0.15 Score=30.32 Aligned_cols=32 Identities=25% Similarity=0.415 Sum_probs=22.4 Q ss_pred HHHH-CCCCCCCCCEEEEECCCCCCCHHHHHHHH Q ss_conf 2331-01111568336552477886115899999 Q gi|254780617|r 151 KAID-SLIPIGRGQRELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 151 ~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~~~I 183 (509) .++| .=+.|.+|+-..|+|++|+||||++ -+| T Consensus 20 ~al~~vsl~i~~Ge~~~llGpSG~GKTTlL-r~i 52 (351) T PRK11432 20 TVIDNLDLTIKQGTMVTLLGPSGCGKTTVL-RLV 52 (351) T ss_pred EEECCCEEEECCCCEEEEECCCCCHHHHHH-HHH T ss_conf 898445749889989999999964999999-999 No 275 >cd03234 ABCG_White The White subfamily represents ABC transporters homologous to the Drosophila white gene, which acts as a dimeric importer for eye pigment precursors. The eye pigmentation of Drosophila is developed from the synthesis and deposition in the cells of red pigments, which are synthesized from guanine, and brown pigments, which are synthesized from tryptophan. The pigment precursors are encoded by the white, brown, and scarlet genes, respectively. Evidence from genetic and biochemical studies suggest that the White and Brown proteins function as heterodimers to import guanine, while the White and Scarlet proteins function to import tryptophan. However, a recent study also suggests that White may be involved in the transport of a metabolite, such as 3-hydroxykynurenine, across intracellular membranes. Mammalian ABC transporters belonging to the White subfamily (ABCG1, ABCG5, and ABCG8) have been shown to be involved in the regulation of lipid-trafficking mechanisms in Probab=90.58 E-value=0.16 Score=30.04 Aligned_cols=35 Identities=23% Similarity=0.468 Sum_probs=26.6 Q ss_pred HHHHH-CCCCCCCCCEEEEECCCCCCCHHHHHHHHHH Q ss_conf 02331-0111156833655247788611589999999 Q gi|254780617|r 150 IKAID-SLIPIGRGQRELIIGDRKTGKTSIILDTFLN 185 (509) Q Consensus 150 I~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~~~I~n 185 (509) .++++ .=+.+-+|+-.+|+|++|+|||||+ .+|.. T Consensus 20 ~~vL~~vs~~i~~Ge~~~ilGpnGsGKSTLl-~~i~G 55 (226) T cd03234 20 ARILNDVSLHVESGQVMAILGSSGSGKTTLL-DAISG 55 (226) T ss_pred EEEEECCEEEEECCCEEEEECCCCCHHHHHH-HHHHC T ss_conf 8998897789918809999989996099999-99967 No 276 >TIGR01842 type_I_sec_PrtD type I secretion system ATPase; InterPro: IPR010128 Type I protein secretion is a system in some Gram-negative bacteria to export proteins (often proteases) across both inner and outer membranes to the extracellular medium. This is one of three proteins of the type I secretion apparatus. Targeted proteins are not cleaved at the N terminus, but rather carry signals located toward the extreme C terminus to direct type I secretion.; GO: 0005524 ATP binding, 0015031 protein transport, 0016021 integral to membrane. Probab=90.58 E-value=0.2 Score=29.50 Aligned_cols=34 Identities=32% Similarity=0.494 Sum_probs=24.7 Q ss_pred CHHHHHCCHHHHHCCCCCCCCCEEEEECCCCCCCHHHHH Q ss_conf 001444160233101111568336552477886115899 Q gi|254780617|r 142 VCEPLSTGIKAIDSLIPIGRGQRELIIGDRKTGKTSIIL 180 (509) Q Consensus 142 i~~~l~TGI~~ID~l~pigrGQR~~I~g~~g~GKt~l~~ 180 (509) +-.+..+||. +.|-.|.-.+|+|+||+|||||+- T Consensus 341 vPk~~l~gi~-----F~~~aGe~laIIGPSgSGKStLaR 374 (556) T TIGR01842 341 VPKPTLRGIS-----FAIQAGEALAIIGPSGSGKSTLAR 374 (556) T ss_pred CCHHHHCCCC-----EEECCCCEEEEECCCCCCHHHHHH T ss_conf 7614227862-----156377458887478652588987 No 277 >TIGR02315 ABC_phnC phosphonate ABC transporter, ATP-binding protein; InterPro: IPR012693 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). Phosphonates are a class of phosphorus-containing organic compound with a stable direct C-P bond rather than a C-O-P linkage. A number of bacterial species have operons, typically about 14 genes in size, with genes for ATP-dependent transport of phosphonates, degradation, and regulation of the expression of the system. Members of this protein family are the ATP-binding cassette component of tripartite ABC transporters of phosphonates.; GO: 0005524 ATP binding, 0015416 phosphonate transmembrane-transporting ATPase activity, 0015716 phosphonate transport, 0016020 membrane. Probab=90.58 E-value=0.16 Score=30.08 Aligned_cols=182 Identities=15% Similarity=0.177 Sum_probs=84.7 Q ss_pred CCHHHHH-CCCCCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCC---CCCC------CEEEEE-ECC-CC-HHHHH Q ss_conf 1602331-0111156833655247788611589999999886302466---7876------069999-516-77-67899 Q gi|254780617|r 148 TGIKAID-SLIPIGRGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGS---EKDK------VYCIYV-AIG-QK-RSSVA 214 (509) Q Consensus 148 TGI~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~---~~~~------v~~V~~-~IG-er-~~ev~ 214 (509) +|-.++. .=+.|.+|-=+.|+|.||.|||||+= =+|+ ... .+.. -|-+.. -|+ .| +.+.+ T Consensus 13 ~g~~~LK~inl~i~~GE~~~~IG~SGAGKSTLLR--~iNr-----L~~Gdk~~~Geilidf~i~~~g~~i~~~~~~k~LR 85 (253) T TIGR02315 13 NGKQALKNINLEINPGEFVAVIGPSGAGKSTLLR--CINR-----LVEGDKPSSGEILIDFSILLEGTDITKLRRGKKLR 85 (253) T ss_pred CCCEEEEEECCEEECCEEEEEECCCCCCHHHHHH--HHHH-----HCCCCCCCCCEEEEEEEEEECCCHHHHHHHHHHHH T ss_conf 9837885311434165179997378872679998--7753-----02688887650898888887273187675488999 Q ss_pred HHHHH----HCCCCCCCEEEEEEECCCCCHHHHHHHHHHHH--HHHHHH-HCCCCCEEEEEECHHHH------------- Q ss_conf 99985----11467421158997288788899999887666--666754-20468847996164789------------- Q gi|254780617|r 215 RFVKA----LEDRGALSYSIVVVASASDPAPMQLLAPFAGC--AMGEYF-RDNGYHALIAYDDLQKH------------- 274 (509) Q Consensus 215 ~~~~~----l~~~~~~~~tvvv~a~a~~~~~~r~~ap~~a~--aiAEyf-r~~G~~VLi~~Ddltr~------------- 274 (509) +.... |.+....+|. ++.+-.+.++.|. ++.-.| +...+|--.=+..|.|. T Consensus 86 ~~R~~igMIFQ~yNLi~R~---------~VL~NVL~GRLg~~~~~~~~~~~F~~~dk~~Al~~L~rVGl~~~A~~RaD~L 156 (253) T TIGR02315 86 KLRRKIGMIFQHYNLIERL---------TVLENVLHGRLGYKSTWRSLLGRFSEEDKERALSLLERVGLADKAYQRADQL 156 (253) T ss_pred HHHHHHCCEEHHHHCCCCC---------HHHHHHHHHHHCCCHHHHHHHCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHH T ss_conf 9976435011010237830---------3667642143214503455632589989999998765238788986521321 Q ss_pred ----------HHHHHHH--HHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCCCCCCHHHHH Q ss_conf ----------9999875--4530378754333752001246788887504544588735675310036766443114668 Q gi|254780617|r 275 ----------AVAYRQL--SLLLRRPPGREAYPGDVFYLHSRLLERAAKMSDALGAGSLTALPVIETQVNDVSAYIPTNV 342 (509) Q Consensus 275 ----------A~a~rei--sl~~~~pp~~~gyp~~vf~~~s~l~ERa~~~~~~~g~GSiT~~~~v~~~~~D~~~~i~~~~ 342 (509) |+|+-+= =+|..||-+. -=|..-.....++=|-.+ +. -||++ |-.-.=|+.--=+|-+ T Consensus 157 SGGQQQRVaIARAL~Q~P~lILADEPiAS--LDP~~s~~VMd~lk~In~---e~---GIT~i--~NLH~VdlA~~Y~dRi 226 (253) T TIGR02315 157 SGGQQQRVAIARALAQQPKLILADEPIAS--LDPKTSKQVMDYLKRINK---ED---GITVI--VNLHQVDLAKKYADRI 226 (253) T ss_pred CCCCHHHHHHHHHHCCCCCEEEECCCCCC--CCHHHHHHHHHHHHHHHH---HC---CCEEE--EECCCHHHHHHHHHHH T ss_conf 58511689999863689958970488433--675678999999999887---76---97799--9746178788763377 Q ss_pred HHHHCCEEEEECC Q ss_conf 8540616996150 Q gi|254780617|r 343 ISITDGQIFLETE 355 (509) Q Consensus 343 ~si~DG~i~l~~~ 355 (509) +.+=+|+||+|-. T Consensus 227 vGL~~G~iVFDG~ 239 (253) T TIGR02315 227 VGLKAGEIVFDGA 239 (253) T ss_pred HHHHCCCEEECCC T ss_conf 6666672145378 No 278 >PRK00635 excinuclease ABC subunit A; Provisional Probab=90.56 E-value=0.49 Score=26.59 Aligned_cols=37 Identities=30% Similarity=0.514 Sum_probs=31.9 Q ss_pred CCHHHHHCCCCCCCCCEEEEECCCCCCCHHHHHHHHHHH Q ss_conf 160233101111568336552477886115899999998 Q gi|254780617|r 148 TGIKAIDSLIPIGRGQRELIIGDRKTGKTSIILDTFLNQ 186 (509) Q Consensus 148 TGI~~ID~l~pigrGQR~~I~g~~g~GKt~l~~~~I~nq 186 (509) -..|-||.=+| |+|=.-|-|-||.|||+||.|+|.+. T Consensus 949 HNLKnIdv~IP--rnkLvViTGvSGSGKSSLAFDTLyAE 985 (1809) T PRK00635 949 HNLKHIDLSLP--RNALTAITGPSASGKHSLVFDILYAA 985 (1809) T ss_pred CCCCCCCCCCC--CCCEEEEECCCCCCHHHHHHHHHHHH T ss_conf 35566331467--98669997898886899999889999 No 279 >PRK12288 ribosome-associated GTPase; Reviewed Probab=90.55 E-value=1.1 Score=24.15 Aligned_cols=27 Identities=26% Similarity=0.512 Sum_probs=18.2 Q ss_pred HHCCCCCCCCCEEEEECCCCCCCHHHH Q ss_conf 310111156833655247788611589 Q gi|254780617|r 153 IDSLIPIGRGQRELIIGDRKTGKTSII 179 (509) Q Consensus 153 ID~l~pigrGQR~~I~g~~g~GKt~l~ 179 (509) +|.|-+.-+|+.-.+.|.+|+|||||. T Consensus 198 l~~L~~~L~~ktsvf~GqSGVGKSSLi 224 (344) T PRK12288 198 LEPLEAALTGRISIFVGQSGVGKSSLI 224 (344) T ss_pred HHHHHHHHCCCEEEEEECCCCCHHHHH T ss_conf 999999876785999806876788887 No 280 >PRK11650 ugpC glycerol-3-phosphate transporter ATP-binding subunit; Provisional Probab=90.55 E-value=0.16 Score=30.03 Aligned_cols=33 Identities=15% Similarity=0.374 Sum_probs=22.8 Q ss_pred HHHHH-CCCCCCCCCEEEEECCCCCCCHHHHHHHH Q ss_conf 02331-01111568336552477886115899999 Q gi|254780617|r 150 IKAID-SLIPIGRGQRELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 150 I~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~~~I 183 (509) ..++| .=+.|.+|+...|+|++|+||||++ .+| T Consensus 17 ~~al~~vsl~i~~Ge~~~llGpsG~GKTTll-r~i 50 (358) T PRK11650 17 TQVIKGIDLDVADGEFIVLVGPSGCGKSTLL-RMV 50 (358) T ss_pred CEEEECCEEEECCCCEEEEECCCCCHHHHHH-HHH T ss_conf 9798252779889989999999863699999-999 No 281 >TIGR03415 ABC_choXWV_ATP choline ABC transporter, ATP-binding protein. Members of this protein family are the ATP-binding subunit of a three-protein transporter. This family belongs, more broadly, to the family of proline and glycine-betaine transporters, but members have been identified by direct characterization and by bioinformatic means as choline transporters. Many species have several closely-related members of this family, probably with variable abilities to act additionally on related quaternary amines. Probab=90.49 E-value=0.17 Score=29.90 Aligned_cols=36 Identities=22% Similarity=0.462 Sum_probs=29.5 Q ss_pred HCC--HHHHHCCCCCCCCCEEEEECCCCCCCHHHHHHHH Q ss_conf 416--0233101111568336552477886115899999 Q gi|254780617|r 147 STG--IKAIDSLIPIGRGQRELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 147 ~TG--I~~ID~l~pigrGQR~~I~g~~g~GKt~l~~~~I 183 (509) .|| +.+=|.=+.|-+|.-..|+|.||+||||++ -+| T Consensus 33 ~tg~~vAV~dvsl~I~~GEi~~lvGpSGsGKSTLL-r~i 70 (382) T TIGR03415 33 ETGLVVGVANASLDIEEGEICVLMGLSGSGKSSLL-RAV 70 (382) T ss_pred HHCCEEEEECCEEEECCCCEEEEECCCCCHHHHHH-HHH T ss_conf 33999989651748879989999999973499999-999 No 282 >PRK13538 cytochrome c biogenesis protein CcmA; Provisional Probab=90.46 E-value=0.19 Score=29.62 Aligned_cols=29 Identities=24% Similarity=0.336 Sum_probs=24.1 Q ss_pred CCCCCCCCCEEEEECCCCCCCHHHHHHHHH Q ss_conf 011115683365524778861158999999 Q gi|254780617|r 155 SLIPIGRGQRELIIGDRKTGKTSIILDTFL 184 (509) Q Consensus 155 ~l~pigrGQR~~I~g~~g~GKt~l~~~~I~ 184 (509) .=+.+.+|+..+|+|+.|+|||||+ .+|. T Consensus 20 vs~~i~~Ge~~~l~G~NGaGKSTLl-~~l~ 48 (204) T PRK13538 20 LSFTLNAGELVQIEGPNGAGKTSLL-RILA 48 (204) T ss_pred EEEEECCCCEEEEECCCCCCHHHHH-HHHH T ss_conf 1779879989999999998599999-9997 No 283 >cd04148 RGK RGK subfamily. The RGK (Rem, Rem2, Rad, Gem/Kir) subfamily of Ras GTPases are expressed in a tissue-specific manner and are dynamically regulated by transcriptional and posttranscriptional mechanisms in response to environmental cues. RGK proteins bind to the beta subunit of L-type calcium channels, causing functional down-regulation of these voltage-dependent calcium channels, and either termination of calcium-dependent secretion or modulation of electrical conduction and contractile function. Inhibition of L-type calcium channels by Rem2 may provide a mechanism for modulating calcium-triggered exocytosis in hormone-secreting cells, and has been proposed to influence the secretion of insulin in pancreatic beta cells. RGK proteins also interact with and inhibit the Rho/Rho kinase pathway to modulate remodeling of the cytoskeleton. Two characteristics of RGK proteins cited in the literature are N-terminal and C-terminal extensions beyond the GTPase domain typical of Ra Probab=90.45 E-value=0.88 Score=24.75 Aligned_cols=18 Identities=28% Similarity=0.543 Sum_probs=15.7 Q ss_pred EEEEECCCCCCCHHHHHH Q ss_conf 365524778861158999 Q gi|254780617|r 164 RELIIGDRKTGKTSIILD 181 (509) Q Consensus 164 R~~I~g~~g~GKt~l~~~ 181 (509) |+.++|++|||||+|+.- T Consensus 2 KVvllGd~gVGKTSLi~r 19 (221) T cd04148 2 RVVMLGSPGVGKSSLASQ 19 (221) T ss_pred EEEEECCCCCHHHHHHHH T ss_conf 799999899709999999 No 284 >TIGR00630 uvra excinuclease ABC, A subunit; InterPro: IPR004602 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). During the process of Escherichia coli nucleotide excision repair, DNA damage recognition and processing are achieved by the action of the uvrA, uvrB, and uvrC gene products . The UvrC protein contain 4 conserved regions: a central region which interact with UvrB (Uvr domain), a Helix hairpin Helix (HhH) domain important for 5 prime incision of damage DNA and the homology regions 1 and 2 of unknown function. UvrC homology region 2 is specific for UvrC proteins, whereas UvrC homology region 1 is also shared by few other nucleases.; GO: 0009381 excinuclease ABC activity, 0006289 nucleotide-excision repair, 0009380 excinuclease repair complex. Probab=90.45 E-value=0.22 Score=29.12 Aligned_cols=123 Identities=22% Similarity=0.331 Sum_probs=68.5 Q ss_pred HCCHHHHHCCCCCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCC Q ss_conf 41602331011115683365524778861158999999988630246678760699995167767899999851146742 Q gi|254780617|r 147 STGIKAIDSLIPIGRGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGAL 226 (509) Q Consensus 147 ~TGI~~ID~l~pigrGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~ 226 (509) .=.+|-||.=+|+ |+=..|=|-||.|||||..|+...-....-. +.....--++.....+.+ T Consensus 652 ~nNLK~i~v~iPL--G~~t~iTGVSGSGKSTLind~L~~~~~~~L~----------------~~~~~~g~~~~I~G~e~l 713 (956) T TIGR00630 652 ENNLKNITVSIPL--GLFTCITGVSGSGKSTLINDTLYPALARRLN----------------GAKTEPGRYKDIEGLEHL 713 (956) T ss_pred CCCCCCCEEEECC--CEEEEEECCCCCCHHHHHHHHHHHHHHHHHH----------------CCCCCCCCCEEEEEECCC T ss_conf 0564021177407--7179997445874577799999999999985----------------276456652036300023 Q ss_pred CEEEEEEECCCCCHHHHH-H---HHHHHHHHHHHHHCCCCCEEEEEECHHHHHHHHHHHHHHCCCCCC-CCCCCCCCHHH Q ss_conf 115899728878889999-9---887666666754204688479961647899999875453037875-43337520012 Q gi|254780617|r 227 SYSIVVVASASDPAPMQL-L---APFAGCAMGEYFRDNGYHALIAYDDLQKHAVAYRQLSLLLRRPPG-REAYPGDVFYL 301 (509) Q Consensus 227 ~~tvvv~a~a~~~~~~r~-~---ap~~a~aiAEyfr~~G~~VLi~~Ddltr~A~a~reisl~~~~pp~-~~gyp~~vf~~ 301 (509) +|.|.| |++|.=|- + |.|++. +|+ -|++ =..-|-+ .+||.++=|++ T Consensus 714 DKvi~i----DQsPIGRTPRSNPATYtgv----------------Fd~-------IR~L--FA~~PeAk~RGY~~GRFSF 764 (956) T TIGR00630 714 DKVIEI----DQSPIGRTPRSNPATYTGV----------------FDE-------IREL--FAETPEAKARGYKPGRFSF 764 (956) T ss_pred CCEEEE----CCCCCCCCCCCCCCCCCCC----------------CHH-------HHHH--HHCCCHHHHCCCCCCCEEE T ss_conf 846886----5774343468864320134----------------305-------7776--4058537873889986146 Q ss_pred HHHH--HHHHHHCCCCCCCCCEE Q ss_conf 4678--88875045445887356 Q gi|254780617|r 302 HSRL--LERAAKMSDALGAGSLT 322 (509) Q Consensus 302 ~s~l--~ERa~~~~~~~g~GSiT 322 (509) ==+. +|.|. |-|.|+ T Consensus 765 NVkGGRCE~C~------GDG~ik 781 (956) T TIGR00630 765 NVKGGRCEACQ------GDGVIK 781 (956) T ss_pred CCCCCCCCCCC------CCCEEE T ss_conf 58888432426------885478 No 285 >cd04124 RabL2 RabL2 subfamily. RabL2 (Rab-like2) subfamily. RabL2s are novel Rab proteins identified recently which display features that are distinct from other Rabs, and have been termed Rab-like. RabL2 contains RabL2a and RabL2b, two very similar Rab proteins that share 98% sequence identity in humans. RabL2b maps to the subtelomeric region of chromosome 22q13.3 and RabL2a maps to 2q13, a region that suggests it is also a subtelomeric gene. Both genes are believed to be expressed ubiquitously, suggesting that RabL2s are the first example of duplicated genes in human proximal subtelomeric regions that are both expressed actively. Like other Rab-like proteins, RabL2s lack a prenylation site at the C-terminus. The specific functions of RabL2a and RabL2b remain unknown. GTPase activating proteins (GAPs) interact with GTP-bound Rab and accelerate the hydrolysis of GTP to GDP. Guanine nucleotide exchange factors (GEFs) interact with GDP-bound Rabs to promote the formation of the GTP-b Probab=90.44 E-value=0.53 Score=26.36 Aligned_cols=19 Identities=21% Similarity=0.546 Sum_probs=16.1 Q ss_pred EEEEECCCCCCCHHHHHHH Q ss_conf 3655247788611589999 Q gi|254780617|r 164 RELIIGDRKTGKTSIILDT 182 (509) Q Consensus 164 R~~I~g~~g~GKt~l~~~~ 182 (509) |+.++|+++||||+|+.-. T Consensus 2 KivllGd~~VGKTsli~r~ 20 (161) T cd04124 2 KIILLGDSAVGKSKLVERF 20 (161) T ss_pred EEEEECCCCCCHHHHHHHH T ss_conf 8999998996789999999 No 286 >PRK07261 topology modulation protein; Provisional Probab=90.44 E-value=1.4 Score=23.36 Aligned_cols=110 Identities=14% Similarity=0.169 Sum_probs=55.1 Q ss_pred EEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEEEEECCCCCH-HH Q ss_conf 36552477886115899999998863024667876069999516776789999985114674211589972887888-99 Q gi|254780617|r 164 RELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALSYSIVVVASASDPA-PM 242 (509) Q Consensus 164 R~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~~tvvv~a~a~~~~-~~ 242 (509) |+.|+|.+|.||||||- .+-.+.+.- ...-| .+|-.=|-..+.-.+|.+.+.+.-.-+ .=|+-..-.... .. T Consensus 2 rI~IiG~sGsGKSTlAr-~L~~~~~ip--~~~LD---~l~w~p~w~~~~~~e~~~~~~~~~~~~-~WIiDGny~~~~~~~ 74 (171) T PRK07261 2 KIAIIGYSGSGKSTLAR-FLGQHYNCP--VLHLD---QLHFSSNWQERDDDDMIADISNFLLKQ-DWIIEGNYSNCLYEE 74 (171) T ss_pred EEEEECCCCCCHHHHHH-HHHHHHCCC--EEEEC---CEEECCCCEECCHHHHHHHHHHHHHCC-CEEEECCCCCHHHHH T ss_conf 89998899986899999-999987979--79702---278889998888999999999998489-879947851247776 Q ss_pred HHHHHHHHHHHHHHHHCCCCCEEEEEECHHHHHHHHHHHHHHCC-----CCCCCCCCCC Q ss_conf 99988766666675420468847996164789999987545303-----7875433375 Q gi|254780617|r 243 QLLAPFAGCAMGEYFRDNGYHALIAYDDLQKHAVAYRQLSLLLR-----RPPGREAYPG 296 (509) Q Consensus 243 r~~ap~~a~aiAEyfr~~G~~VLi~~Ddltr~A~a~reisl~~~-----~pp~~~gyp~ 296 (509) |+.. -| +|++=|+.++...+|-+.-.+. ||.-.+|.|- T Consensus 75 rl~~---------------aD-~iI~Ld~p~~~~l~rvikR~l~~~~~~r~~m~~g~~E 117 (171) T PRK07261 75 RMAE---------------AD-QIIFLNFSRFNCLYRAFKRYLKYRGKTRPSMAENCPE 117 (171) T ss_pred HHHH---------------CC-EEEEECCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCC T ss_conf 7977---------------99-9999858499999999999999827758765789934 No 287 >CHL00131 ycf16 sulfate ABC transporter protein; Validated Probab=90.41 E-value=0.19 Score=29.57 Aligned_cols=33 Identities=18% Similarity=0.387 Sum_probs=25.4 Q ss_pred HHHH-CCCCCCCCCEEEEECCCCCCCHHHHHHHHH Q ss_conf 2331-011115683365524778861158999999 Q gi|254780617|r 151 KAID-SLIPIGRGQRELIIGDRKTGKTSIILDTFL 184 (509) Q Consensus 151 ~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~~~I~ 184 (509) .+++ .=+.+-+|+..+|+|++|+|||||+ .+|. T Consensus 20 ~vL~~isl~i~~Gei~aiiG~nGsGKSTL~-~~i~ 53 (252) T CHL00131 20 EILKGINLSINAGEIHAIMGPNGSGKSTLS-KVIA 53 (252) T ss_pred EEEECCEEEECCCCEEEEECCCCCCHHHHH-HHHC T ss_conf 988561778879989999999999999999-9972 No 288 >cd03296 ABC_CysA_sulfate_importer Part of the ABC transporter complex cysAWTP involved in sulfate import. Responsible for energy coupling to the transport system. The complex is composed of two ATP-binding proteins (cysA), two transmembrane proteins (cysT and cysW), and a solute-binding protein (cysP). ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=90.37 E-value=0.18 Score=29.80 Aligned_cols=34 Identities=24% Similarity=0.432 Sum_probs=26.4 Q ss_pred CHHHHH-CCCCCCCCCEEEEECCCCCCCHHHHHHHH Q ss_conf 602331-01111568336552477886115899999 Q gi|254780617|r 149 GIKAID-SLIPIGRGQRELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 149 GI~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~~~I 183 (509) +.+++| .=+.+-+|+-..|+|++|+|||||+ .+| T Consensus 14 ~~~~l~~is~~v~~Ge~~~iiGpSGsGKSTll-~~i 48 (239) T cd03296 14 DFVALDDVSLDIPSGELVALLGPSGSGKTTLL-RLI 48 (239) T ss_pred CEEEECCCEEEECCCCEEEEECCCCCHHHHHH-HHH T ss_conf 98998663869889989999999997799999-999 No 289 >COG4987 CydC ABC-type transport system involved in cytochrome bd biosynthesis, fused ATPase and permease components [Energy production and conversion / Posttranslational modification, protein turnover, chaperones] Probab=90.37 E-value=0.19 Score=29.55 Aligned_cols=28 Identities=18% Similarity=0.514 Sum_probs=23.5 Q ss_pred CCCCCCCCCEEEEECCCCCCCHHHHHHHH Q ss_conf 01111568336552477886115899999 Q gi|254780617|r 155 SLIPIGRGQRELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 155 ~l~pigrGQR~~I~g~~g~GKt~l~~~~I 183 (509) .=+++-.|+|+.|+|.+|+||||++ ..+ T Consensus 357 ~~l~l~~GEkvAIlG~SGsGKSTll-qLl 384 (573) T COG4987 357 FNLTLAQGEKVAILGRSGSGKSTLL-QLL 384 (573) T ss_pred CCEEECCCCEEEEECCCCCCHHHHH-HHH T ss_conf 6513258876888779998789999-999 No 290 >cd03225 ABC_cobalt_CbiO_domain1 Domain I of the ABC component of a cobalt transport family found in bacteria, archaea, and eukaryota. The transition metal cobalt is an essential component of many enzymes and must be transported into cells in appropriate amounts when needed. This ABC transport system of the CbiMNQO family is involved in cobalt transport in association with the cobalamin (vitamin B12) biosynthetic pathways. Most of cobalt (Cbi) transport systems possess a separate CbiN component, the cobalt-binding periplasmic protein, and they are encoded by the conserved gene cluster cbiMNQO. Both the CbiM and CbiQ proteins are integral cytoplasmic membrane proteins, and the CbiO protein has the linker peptide and the Walker A and B motifs commonly found in the ATPase components of the ABC-type transport systems. Probab=90.37 E-value=0.19 Score=29.62 Aligned_cols=30 Identities=27% Similarity=0.512 Sum_probs=24.9 Q ss_pred HCCCCCCCCCEEEEECCCCCCCHHHHHHHHH Q ss_conf 1011115683365524778861158999999 Q gi|254780617|r 154 DSLIPIGRGQRELIIGDRKTGKTSIILDTFL 184 (509) Q Consensus 154 D~l~pigrGQR~~I~g~~g~GKt~l~~~~I~ 184 (509) |.=+.+-+|+-.+|+|+.|+|||||+ .+|. T Consensus 19 ~vsl~i~~Gei~~iiG~nGaGKSTLl-k~i~ 48 (211) T cd03225 19 DISLTIKKGEFVLIVGPNGSGKSTLL-RLLN 48 (211) T ss_pred EEEEEECCCCEEEEECCCCCCHHHHH-HHHH T ss_conf 31788849979999889999899999-9996 No 291 >PRK11231 fecE iron-dicitrate transporter ATP-binding subunit; Provisional Probab=90.35 E-value=0.19 Score=29.66 Aligned_cols=34 Identities=18% Similarity=0.464 Sum_probs=25.9 Q ss_pred HHHH-CCCCCCCCCEEEEECCCCCCCHHHHHHHHHH Q ss_conf 2331-0111156833655247788611589999999 Q gi|254780617|r 151 KAID-SLIPIGRGQRELIIGDRKTGKTSIILDTFLN 185 (509) Q Consensus 151 ~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~~~I~n 185 (509) +++| .=+.+-+|+-.+|+|+.|+|||||+ .+|.+ T Consensus 16 ~~L~~isl~i~~Gei~~liGpNGaGKSTLl-k~i~G 50 (255) T PRK11231 16 RILNDLSLSLPTGKITALIGPNGCGKSTLL-KCFAR 50 (255) T ss_pred EEEECCEEEECCCCEEEEECCCCCHHHHHH-HHHHC T ss_conf 998230889989979999999998199999-99975 No 292 >PRK13546 teichoic acids export protein ATP-binding subunit; Provisional Probab=90.33 E-value=0.19 Score=29.65 Aligned_cols=34 Identities=18% Similarity=0.211 Sum_probs=27.0 Q ss_pred HHHHH-CCCCCCCCCEEEEECCCCCCCHHHHHHHHH Q ss_conf 02331-011115683365524778861158999999 Q gi|254780617|r 150 IKAID-SLIPIGRGQRELIIGDRKTGKTSIILDTFL 184 (509) Q Consensus 150 I~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~~~I~ 184 (509) +.|+| .=+.+-+|+..+|+|..|+|||||+ .+|. T Consensus 37 f~AL~dVsf~i~~GEivgllG~NGaGKSTLl-k~I~ 71 (264) T PRK13546 37 FFALDDISLKAYEGDVIGLVGINGSGKSTLS-NIIG 71 (264) T ss_pred EEEECCCEEEECCCCEEEEECCCCCHHHHHH-HHHH T ss_conf 9995270788859989999989986199999-9996 No 293 >PRK09493 glnQ glutamine ABC transporter ATP-binding protein; Reviewed Probab=90.26 E-value=0.19 Score=29.51 Aligned_cols=31 Identities=23% Similarity=0.438 Sum_probs=25.1 Q ss_pred HCCCCCCCCCEEEEECCCCCCCHHHHHHHHHH Q ss_conf 10111156833655247788611589999999 Q gi|254780617|r 154 DSLIPIGRGQRELIIGDRKTGKTSIILDTFLN 185 (509) Q Consensus 154 D~l~pigrGQR~~I~g~~g~GKt~l~~~~I~n 185 (509) |.=+.+.+|...+|+|++|+||||++ -+|.. T Consensus 19 dvs~~i~~Ge~~~liG~nGsGKSTll-~~i~G 49 (240) T PRK09493 19 NIDLNIDQGEVVVIIGPSGSGKSTLL-RCINK 49 (240) T ss_pred CEEEEECCCCEEEEECCCCCHHHHHH-HHHHC T ss_conf 30789879989999999998099999-99963 No 294 >cd03301 ABC_MalK_N The N-terminal ATPase domain of the maltose transporter, MalK. ATP binding cassette (ABC) proteins function from bacteria to human, mediating the translocation of substances into and out of cells or organelles. ABC transporters contain two transmembrane-spanning domains (TMDs) or subunits and two nucleotide binding domains (NBDs) or subunits that couple transport to the hydrolysis of ATP. In the maltose transport system, the periplasmic maltose binding protein (MBP) stimulates the ATPase activity of the membrane-associated transporter, which consists of two transmembrane subunits, MalF and MalG, and two copies of the ATP binding subunit, MalK, and becomes tightly bound to the transporter in the catalytic transition state, ensuring that maltose is passed to the transporter as ATP is hydrolyzed. Probab=90.23 E-value=0.2 Score=29.49 Aligned_cols=33 Identities=24% Similarity=0.457 Sum_probs=25.1 Q ss_pred HHHHH-CCCCCCCCCEEEEECCCCCCCHHHHHHHH Q ss_conf 02331-01111568336552477886115899999 Q gi|254780617|r 150 IKAID-SLIPIGRGQRELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 150 I~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~~~I 183 (509) ..++| .=+.+-+|.-..|+|++|+||||++ -+| T Consensus 13 ~~~l~~isl~v~~Ge~~~i~GpSG~GKSTlL-r~i 46 (213) T cd03301 13 VTALDDLNLDIADGEFVVLLGPSGCGKTTTL-RMI 46 (213) T ss_pred EEEECCCEEEECCCCEEEEECCCCCHHHHHH-HHH T ss_conf 9998761779869989999999988099999-999 No 295 >PRK10418 nikD nickel transporter ATP-binding protein; Provisional Probab=90.22 E-value=0.24 Score=28.87 Aligned_cols=33 Identities=18% Similarity=0.330 Sum_probs=25.4 Q ss_pred HHH-CCCCCCCCCEEEEECCCCCCCHHHHHHHHHH Q ss_conf 331-0111156833655247788611589999999 Q gi|254780617|r 152 AID-SLIPIGRGQRELIIGDRKTGKTSIILDTFLN 185 (509) Q Consensus 152 ~ID-~l~pigrGQR~~I~g~~g~GKt~l~~~~I~n 185 (509) ++| .=+.+.+|.-.+|+|.+|+|||+++. ++++ T Consensus 18 vL~~Isl~v~~Ge~~aiiG~SGsGKStl~k-~llg 51 (254) T PRK10418 18 LVHGVSLTLQRGRVLALVGGSGSGKSLTCA-AALG 51 (254) T ss_pred EEECEEEEECCCCEEEEECCCCCHHHHHHH-HHHC T ss_conf 886607289899999999999878999999-9957 No 296 >TIGR03411 urea_trans_UrtD urea ABC transporter, ATP-binding protein UrtD. Members of this protein family are ABC transporter ATP-binding subunits associated with urea transport and metabolism. This protein is found in a conserved five-gene transport operon typically found adjacent to urease genes. It was shown in Cyanobacteria that disruption leads to the loss of high-affinity urea transport activity. Probab=90.21 E-value=0.17 Score=29.95 Aligned_cols=36 Identities=31% Similarity=0.567 Sum_probs=28.6 Q ss_pred CCHHHHH-CCCCCCCCCEEEEECCCCCCCHHHHHHHHH Q ss_conf 1602331-011115683365524778861158999999 Q gi|254780617|r 148 TGIKAID-SLIPIGRGQRELIIGDRKTGKTSIILDTFL 184 (509) Q Consensus 148 TGI~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~~~I~ 184 (509) -|.+++| .=+.+.+|+..+|+|+.|.||||++ .+|. T Consensus 13 g~~~al~~vsl~v~~Gei~~liGpNGaGKSTLl-~~i~ 49 (242) T TIGR03411 13 DGFKALNDLSLYVDPGELRVIIGPNGAGKTTMM-DVIT 49 (242) T ss_pred CCEEEECCCEEEECCCCEEEEECCCCCCHHHHH-HHHH T ss_conf 998987450789889989999989997599999-9996 No 297 >TIGR03269 met_CoM_red_A2 methyl coenzyme M reductase system, component A2. The enzyme that catalyzes the final step in methanogenesis, methyl coenzyme M reductase, contains alpha, beta, and gamma chains. In older literature, the complex of alpha, beta, and gamma chains was termed component C, while this single chain protein was termed methyl coenzyme M reductase system component A2. Probab=90.18 E-value=0.18 Score=29.81 Aligned_cols=23 Identities=26% Similarity=0.462 Sum_probs=11.3 Q ss_pred CCCCCCCEEEEECCCCCCCHHHH Q ss_conf 11156833655247788611589 Q gi|254780617|r 157 IPIGRGQRELIIGDRKTGKTSII 179 (509) Q Consensus 157 ~pigrGQR~~I~g~~g~GKt~l~ 179 (509) +.+.+|+..+|+|.+|+|||||+ T Consensus 21 l~i~~Gei~~LvG~sGsGKSTL~ 43 (520) T TIGR03269 21 FTIEEGEVLGILGRSGAGKSVLM 43 (520) T ss_pred EEECCCCEEEEECCCCCHHHHHH T ss_conf 79999989999999996999999 No 298 >PRK13540 cytochrome c biogenesis protein CcmA; Provisional Probab=90.14 E-value=0.21 Score=29.31 Aligned_cols=29 Identities=17% Similarity=0.247 Sum_probs=23.7 Q ss_pred CCCCCCCCCEEEEECCCCCCCHHHHHHHHH Q ss_conf 011115683365524778861158999999 Q gi|254780617|r 155 SLIPIGRGQRELIIGDRKTGKTSIILDTFL 184 (509) Q Consensus 155 ~l~pigrGQR~~I~g~~g~GKt~l~~~~I~ 184 (509) .=+.+-+||-.+|.|+.|+|||||+ .+|. T Consensus 20 is~~i~~G~i~~l~G~NGaGKSTLl-kli~ 48 (200) T PRK13540 20 ISFHLPAGGLLHLKGSNGAGKTTLL-KLIA 48 (200) T ss_pred EEEEECCCCEEEEECCCCCCHHHHH-HHHH T ss_conf 2789879979999889998799999-9997 No 299 >cd04112 Rab26 Rab26 subfamily. First identified in rat pancreatic acinar cells, Rab26 is believed to play a role in recruiting mature granules to the plasma membrane upon beta-adrenergic stimulation. Rab26 belongs to the Rab functional group III, which are considered key regulators of intracellular vesicle transport during exocytosis. GTPase activating proteins (GAPs) interact with GTP-bound Rab and accelerate the hydrolysis of GTP to GDP. Guanine nucleotide exchange factors (GEFs) interact with GDP-bound Rabs to promote the formation of the GTP-bound state. Rabs are further regulated by guanine nucleotide dissociation inhibitors (GDIs), which facilitate Rab recycling by masking C-terminal lipid binding and promoting cytosolic localization. Most Rab GTPases contain a lipid modification site at the C-terminus, with sequence motifs CC, CXC, or CCX. Lipid binding is essential for membrane attachment, a key feature of most Rab proteins. Probab=90.13 E-value=0.67 Score=25.60 Aligned_cols=19 Identities=26% Similarity=0.641 Sum_probs=16.2 Q ss_pred EEEEECCCCCCCHHHHHHH Q ss_conf 3655247788611589999 Q gi|254780617|r 164 RELIIGDRKTGKTSIILDT 182 (509) Q Consensus 164 R~~I~g~~g~GKt~l~~~~ 182 (509) |+.++|++|||||+|+.-. T Consensus 2 KIv~vGd~~VGKTsli~r~ 20 (191) T cd04112 2 KVMLLGDSGVGKTCLLVRF 20 (191) T ss_pred EEEEECCCCCCHHHHHHHH T ss_conf 8999994998799999999 No 300 >PRK10619 histidine/lysine/arginine/ornithine transporter subunit; Provisional Probab=90.12 E-value=0.2 Score=29.44 Aligned_cols=28 Identities=21% Similarity=0.374 Sum_probs=23.5 Q ss_pred CCCCCCCCCEEEEECCCCCCCHHHHHHHH Q ss_conf 01111568336552477886115899999 Q gi|254780617|r 155 SLIPIGRGQRELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 155 ~l~pigrGQR~~I~g~~g~GKt~l~~~~I 183 (509) .=+.+.+|...+|+|.+|+|||||+ .+| T Consensus 24 isl~i~~Gei~~iiG~sGsGKSTLl-~~i 51 (257) T PRK10619 24 VSLQANAGDVISIIGSSGSGKSTFL-RCI 51 (257) T ss_pred CEEEECCCCEEEEECCCCCHHHHHH-HHH T ss_conf 0668879979999989998199999-999 No 301 >TIGR02640 gas_vesic_GvpN gas vesicle protein GvpN; InterPro: IPR013462 The GvpN protein is associated with the production of gas vesicles produced in some prokaryotes to give cells buoyancy , . It belongs to a larger family of ATPases .; GO: 0000166 nucleotide binding, 0005524 ATP binding, 0031412 gas vesicle organization and biogenesis, 0031411 gas vesicle. Probab=90.10 E-value=0.18 Score=29.77 Aligned_cols=222 Identities=16% Similarity=0.193 Sum_probs=103.6 Q ss_pred CCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCC---CCCCCEEEEEEECCCC Q ss_conf 8336552477886115899999998863024667876069999516776789999985114---6742115899728878 Q gi|254780617|r 162 GQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALED---RGALSYSIVVVASASD 238 (509) Q Consensus 162 GQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~---~~~~~~tvvv~a~a~~ 238 (509) |-=+=+.|++|||||+||+. +.++. +-.|+.|.+ -.-=.-.+++-+..+ ....++=|==+...+| T Consensus 21 G~PvHl~GPaG~GKT~LA~h-vA~~r--------~RPV~l~~G---d~eL~~~DLvG~~~g~~~~kv~DqfihnV~K~~d 88 (265) T TIGR02640 21 GYPVHLRGPAGTGKTTLAMH-VARKR--------DRPVVLING---DAELTTSDLVGSYAGYTRKKVVDQFIHNVVKLED 88 (265) T ss_pred CCCEEEECCCCCCHHHHHHH-HHHHC--------CCCEEEEEC---CCCCCCCCCCCCCCCCEEEEEEECCEEEEECCCC T ss_conf 88667447888556899999-99736--------896899865---8232654423154675222232012111342512 Q ss_pred CHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCC Q ss_conf 88999998876666667542046884799616478999998754530378754333752001246788887504544588 Q gi|254780617|r 239 PAPMQLLAPFAGCAMGEYFRDNGYHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLLERAAKMSDALGA 318 (509) Q Consensus 239 ~~~~r~~ap~~a~aiAEyfr~~G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l~ERa~~~~~~~g~ 318 (509) ..-.+|.=-..-.|+=|-| =||||-.||= +|-..- =++|-|-||-=.+... T Consensus 89 ~~~~~W~D~rLt~Av~eG~-------TLVYdEF~Rs------------kP~~nN-------VLLSvlEE~vL~LPg~--- 139 (265) T TIGR02640 89 IVRQNWVDNRLTLAVREGF-------TLVYDEFTRS------------KPETNN-------VLLSVLEEGVLELPGK--- 139 (265) T ss_pred CCCCCCCCCHHHHHHHCCC-------EEEECCCCCC------------CCHHHH-------HHHHHHHHHHHCCCCC--- T ss_conf 2002667835789975697-------2766475788------------620456-------5675555232158887--- Q ss_pred CCEEEEEEEECCCCCCCCCHHHH--HHHHHCCEEEEECCHHHHCCCCCCCCCCCHHHCCCCCCCHHHHHHHHHHHHHHHH Q ss_conf 73567531003676644311466--8854061699615012210266543344412203434389887755678887998 Q gi|254780617|r 319 GSLTALPVIETQVNDVSAYIPTN--VISITDGQIFLETELFYQGIRPAINIGLSVSRVGSAAQVKAMKQVSGAVKGELAQ 396 (509) Q Consensus 319 GSiT~~~~v~~~~~D~~~~i~~~--~~si~DG~i~l~~~l~~~g~~Paid~~~S~SRv~~~~~~~~~~~~a~~lr~~laq 396 (509) +.+ +.||-.+ .+ -||-|.--=..|+| ..+=+++.+|-.+... T Consensus 140 -~~~------------~~Yv~VhP~FR-----~IfTSNp~EYAGVh------------------~~QDALlDRL~ti~~D 183 (265) T TIGR02640 140 -RGE------------SRYVDVHPEFR-----VIFTSNPVEYAGVH------------------ETQDALLDRLVTISMD 183 (265) T ss_pred -CCC------------CCCEEECCCCC-----EEECCCCCCCCCCC------------------CHHHHHHHHHCCCCCC T ss_conf -877------------87225788702-----46314870105767------------------7166776644004578 Q ss_pred HHHHHHHHHHH---CCC-CHHHHHHHHHHHHHHHHHCCCCCCCCCHHHH-HHHHHHHHCCCCCCCCHHH Q ss_conf 87678798763---389-9899999999999999966999887488999-9999998669806679899 Q gi|254780617|r 397 YREMSSFSKFS---SDL-DSSTQKFLSKGERLTELLKQPQFSPLAMEEQ-VVMIFAGISGCLDEVAVSQ 460 (509) Q Consensus 397 y~Ele~f~~fg---sdl-D~~t~~~l~rg~ri~e~LkQ~~~~p~~~~~q-v~~l~a~~~G~ld~i~v~~ 460 (509) |.+.+.=..+. .+. +.....++.--+.++.-=...+-+++++.-. .+.-.|..++.=-+.+-+. T Consensus 184 ~~D~~~e~ai~~~~t~~~~~~a~~IV~lv~~~R~a~g~e~~~Gl~~RA~lMiA~~at~~dipv~~d~~~ 252 (265) T TIGR02640 184 YPDEDTETAILRAKTDVAEESAATIVRLVRELRLASGDEKTSGLSLRASLMIAEVATQEDIPVDVDDED 252 (265) T ss_pred CCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHH T ss_conf 544478999999860612467899999999984212553337436899999998754437986776025 No 302 >cd03237 ABC_RNaseL_inhibitor_domain2 The ATPase domain 2 of RNase L inhibitor. The ABC ATPase, RNase L inhibitor (RLI), is a key enzyme in ribosomal biogenesis, formation of translation preinitiation complexes, and assembly of HIV capsids. RLI's are not transport proteins and thus cluster with a group of soluble proteins that lack the transmembrane components commonly found in other members of the family. Structurally, RLI's have an N-terminal Fe-S domain and two nucleotide-binding domains which are arranged to form two composite active sites in their interface cleft. RLI is one of the most conserved enzymes between archaea and eukaryotes with a sequence identity of more than 48%. The high degree of evolutionary conservation suggests that RLI performs a central role in archaeal and eukaryotic physiology. Probab=90.10 E-value=0.19 Score=29.58 Aligned_cols=26 Identities=27% Similarity=0.373 Sum_probs=21.5 Q ss_pred CCCCCCEEEEECCCCCCCHHHHHHHHH Q ss_conf 115683365524778861158999999 Q gi|254780617|r 158 PIGRGQRELIIGDRKTGKTSIILDTFL 184 (509) Q Consensus 158 pigrGQR~~I~g~~g~GKt~l~~~~I~ 184 (509) ++-+|...+|+|+.|+|||||+ .+|. T Consensus 21 ti~~GEiv~liGpNGaGKSTLl-k~l~ 46 (246) T cd03237 21 SISESEVIGILGPNGIGKTTFI-KMLA 46 (246) T ss_pred CCCCCCEEEEECCCCCHHHHHH-HHHH T ss_conf 4657989999979997699999-9997 No 303 >TIGR00750 lao LAO/AO transport system ATPase; InterPro: IPR005129 Bacterial periplasmic transport systems require the function of a specific substrate-binding protein, located in the periplasm, and several cytoplasmic membrane transport components. In Escherichia coli K-12, the arginine-ornithine transport system requires an arginine-ornithine-binding protein and the lysine-arginine-ornithine (LAO) transport system includes a LAO-binding protein. Both periplasmic proteins can be phosphorylated by a single kinase, ArgK resulting in reduced levels of transport activity of the periplasmic transport systems that include each of the binding proteins. The ArgK protein acts as an ATPase enzyme and as a kinase.. Probab=90.07 E-value=0.67 Score=25.59 Aligned_cols=27 Identities=22% Similarity=0.527 Sum_probs=22.4 Q ss_pred HHCCCC-CCCCCEEEEECCCCCCCHHHH Q ss_conf 310111-156833655247788611589 Q gi|254780617|r 153 IDSLIP-IGRGQRELIIGDRKTGKTSII 179 (509) Q Consensus 153 ID~l~p-igrGQR~~I~g~~g~GKt~l~ 179 (509) ++-++| +|+-|++||=|-||+||||++ T Consensus 28 l~~i~p~~GnA~~vG~TG~PGaGKSTl~ 55 (333) T TIGR00750 28 LERILPKTGNAHVVGITGVPGAGKSTLV 55 (333) T ss_pred HHHHHHHCCCCEEEEEECCCCCCHHHHH T ss_conf 9986243279078766468888577799 No 304 >PRK11247 ssuB aliphatic sulfonates transport ATP-binding subunit; Provisional Probab=90.06 E-value=0.19 Score=29.58 Aligned_cols=29 Identities=21% Similarity=0.331 Sum_probs=24.2 Q ss_pred CCCCCCCCCEEEEECCCCCCCHHHHHHHHH Q ss_conf 011115683365524778861158999999 Q gi|254780617|r 155 SLIPIGRGQRELIIGDRKTGKTSIILDTFL 184 (509) Q Consensus 155 ~l~pigrGQR~~I~g~~g~GKt~l~~~~I~ 184 (509) .=+.+.+|...+|+|++|+||||++ -+|. T Consensus 31 Is~~I~~GEiv~LiG~nGaGKSTLl-r~i~ 59 (257) T PRK11247 31 LDLHIPAGQFVAVVGRSGCGKSTLL-RLLA 59 (257) T ss_pred EEEEECCCCEEEEECCCCCHHHHHH-HHHH T ss_conf 0758879989999989988899999-9996 No 305 >PRK10982 galactose/methyl galaxtoside transporter ATP-binding protein; Provisional Probab=90.05 E-value=0.17 Score=29.89 Aligned_cols=11 Identities=9% Similarity=0.211 Sum_probs=5.3 Q ss_pred CCEEEEEECHH Q ss_conf 88479961647 Q gi|254780617|r 262 YHALIAYDDLQ 272 (509) Q Consensus 262 ~~VLi~~Ddlt 272 (509) +++++-+.+++ T Consensus 247 ~~~~l~~~~l~ 257 (491) T PRK10982 247 GEVILEVRNLT 257 (491) T ss_pred CCEEEEEECCC T ss_conf 76179995232 No 306 >cd03217 ABC_FeS_Assembly ABC-type transport system involved in Fe-S cluster assembly, ATPase component. Biosynthesis of iron-sulfur clusters (Fe-S) depends on multiprotein systems. The SUF system of E. coli and Erwinia chrysanthemi is important for Fe-S biogenesis under stressful conditions. The SUF system is made of six proteins: SufC is an atypical cytoplasmic ABC-ATPase, which forms a complex with SufB and SufD; SufA plays the role of a scaffold protein for assembly of iron-sulfur clusters and delivery to target proteins; SufS is a cysteine desulfurase which mobilizes the sulfur atom from cysteine and provides it to the cluster; SufE has no associated function yet. Probab=90.03 E-value=0.21 Score=29.22 Aligned_cols=34 Identities=18% Similarity=0.443 Sum_probs=26.0 Q ss_pred HHHHH-CCCCCCCCCEEEEECCCCCCCHHHHHHHHH Q ss_conf 02331-011115683365524778861158999999 Q gi|254780617|r 150 IKAID-SLIPIGRGQRELIIGDRKTGKTSIILDTFL 184 (509) Q Consensus 150 I~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~~~I~ 184 (509) .+++| .=+.+-+|+..+|+|+.|.|||||+ .+|. T Consensus 13 ~~vL~~vsl~v~~Gei~~iiGpnGaGKSTLl-~~i~ 47 (200) T cd03217 13 KEILKGVNLTIKKGEVHALMGPNGSGKSTLA-KTIM 47 (200) T ss_pred EEEEECCEEEECCCCEEEEECCCCCCHHHHH-HHHC T ss_conf 9988550568879989999968999999999-9970 No 307 >PRK13643 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=90.00 E-value=0.19 Score=29.61 Aligned_cols=32 Identities=19% Similarity=0.395 Sum_probs=25.8 Q ss_pred CHHHHH-CCCCCCCCCEEEEECCCCCCCHHHHH Q ss_conf 602331-01111568336552477886115899 Q gi|254780617|r 149 GIKAID-SLIPIGRGQRELIIGDRKTGKTSIIL 180 (509) Q Consensus 149 GI~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~ 180 (509) |.++++ .=+.|-+|...+|+|..|+|||||+. T Consensus 18 ~~~aL~dIsl~I~~Ge~vaiiG~nGsGKSTLl~ 50 (288) T PRK13643 18 ASRALFDIDLEVKKGSYTALIGHTGSGKSTLLQ 50 (288) T ss_pred CCEEEEEEEEEECCCCEEEEECCCCCHHHHHHH T ss_conf 273366336798599899999999947999999 No 308 >PRK04841 transcriptional regulator MalT; Provisional Probab=89.99 E-value=1.5 Score=23.11 Aligned_cols=129 Identities=17% Similarity=0.152 Sum_probs=65.7 Q ss_pred CCCCCCCCCHHHHHCCHHHHHCCCCCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHH Q ss_conf 77311266001444160233101111568336552477886115899999998863024667876069999516776789 Q gi|254780617|r 134 PGIIQRQSVCEPLSTGIKAIDSLIPIGRGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSV 213 (509) Q Consensus 134 p~~~~R~~i~~~l~TGI~~ID~l~pigrGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev 213 (509) ++.+.|....+-|+- ...+.=..|-|++|-||||++.+-.. .... |....+.+..++. T Consensus 14 ~~~v~R~rL~~~L~~----------~~~~~l~lv~APaG~GKTtl~a~w~~----------~~~~--~~Wlsld~~d~~~ 71 (903) T PRK04841 14 DHTVVRERLLAKLSG----------ANNYRLVLVRSPAGYGKTTLASQWAA----------GKNN--VGWYSLDESDNQP 71 (903) T ss_pred CCCCCCHHHHHHHHH----------HCCCCEEEEECCCCCCHHHHHHHHHH----------CCCC--EEEEECCCCCCCH T ss_conf 782554779999985----------15899799977999889999999983----------4998--5998588544899 Q ss_pred HHHHHHHCCC------CCCCEEEEEEECCCCCHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECHHHH--HHHHHHHHHHC Q ss_conf 9999851146------742115899728878889999988766666675420468847996164789--99998754530 Q gi|254780617|r 214 ARFVKALEDR------GALSYSIVVVASASDPAPMQLLAPFAGCAMGEYFRDNGYHALIAYDDLQKH--AVAYRQLSLLL 285 (509) Q Consensus 214 ~~~~~~l~~~------~~~~~tvvv~a~a~~~~~~r~~ap~~a~aiAEyfr~~G~~VLi~~Ddltr~--A~a~reisl~~ 285 (509) ..|...+... +.....-......+.+......+ .+..-..+.+.++.+|+||+-.. ....+.+.-++ T Consensus 72 ~~f~~~l~~al~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~l~~~l~~~~~~~~lvlDD~h~i~~~~~~~~l~~Ll 146 (903) T PRK04841 72 ERFASYLIAALQQATNGHCSKSEALAEKRQYASLTSLFA-----QLFIELSDWHQPLYLVLDDYHLITNPEIHEAMRFFL 146 (903) T ss_pred HHHHHHHHHHHHHHCCCCCCHHHHCCCCCCCCCHHHHHH-----HHHHHHHCCCCCEEEEEECCCCCCCHHHHHHHHHHH T ss_conf 999999999999856543311231135566554999999-----999997558998699986836488868999999999 Q ss_pred CCCC Q ss_conf 3787 Q gi|254780617|r 286 RRPP 289 (509) Q Consensus 286 ~~pp 289 (509) ...| T Consensus 147 ~~~p 150 (903) T PRK04841 147 RHQP 150 (903) T ss_pred HHCC T ss_conf 8389 No 309 >COG0488 Uup ATPase components of ABC transporters with duplicated ATPase domains [General function prediction only] Probab=89.95 E-value=0.26 Score=28.63 Aligned_cols=32 Identities=16% Similarity=0.453 Sum_probs=25.7 Q ss_pred HCCCCCCCCCEEEEECCCCCCCHHHHHHHHHHH Q ss_conf 101111568336552477886115899999998 Q gi|254780617|r 154 DSLIPIGRGQRELIIGDRKTGKTSIILDTFLNQ 186 (509) Q Consensus 154 D~l~pigrGQR~~I~g~~g~GKt~l~~~~I~nq 186 (509) |.-+.+..|+|+|++|.-|+|||||+ -+|..+ T Consensus 21 ~~~l~~~~G~riGLvG~NGaGKSTLL-kilaG~ 52 (530) T COG0488 21 NVSLTLNPGERIGLVGRNGAGKSTLL-KILAGE 52 (530) T ss_pred CCCEEECCCCEEEEECCCCCCHHHHH-HHHHCC T ss_conf 78407689988999899989889999-998299 No 310 >COG4608 AppF ABC-type oligopeptide transport system, ATPase component [Amino acid transport and metabolism] Probab=89.94 E-value=0.17 Score=29.88 Aligned_cols=31 Identities=26% Similarity=0.445 Sum_probs=27.1 Q ss_pred HHHHH-CCCCCCCCCEEEEECCCCCCCHHHHH Q ss_conf 02331-01111568336552477886115899 Q gi|254780617|r 150 IKAID-SLIPIGRGQRELIIGDRKTGKTSIIL 180 (509) Q Consensus 150 I~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~ 180 (509) ++|+| .=+.|-+|.-.+|+|.+|+||||++- T Consensus 26 v~avd~Vsf~i~~ge~~glVGESG~GKSTlgr 57 (268) T COG4608 26 VKAVDGVSFSIKEGETLGLVGESGCGKSTLGR 57 (268) T ss_pred EEEECCEEEEECCCCEEEEEECCCCCHHHHHH T ss_conf 48751156897589878999368887787999 No 311 >cd03220 ABC_KpsT_Wzt ABC_KpsT_Wzt The KpsT/Wzt ABC transporter subfamily is involved in extracellular polysaccharide export. Among the variety of membrane-linked or extracellular polysaccharides excreted by bacteria, only capsular polysaccharides, lipopolysaccharides, and teichoic acids have been shown to be exported by ABC transporters. A typical system is made of a conserved integral membrane and an ABC. In addition to these proteins, capsular polysaccharide exporter systems require two 'accessory' proteins to perform their function: a periplasmic (E.coli) or a lipid-anchored outer membrane protein called OMA (Neisseria meningitidis and Haemophilus influenzae) and a cytoplasmic membrane protein MPA2. Probab=89.91 E-value=0.21 Score=29.26 Aligned_cols=35 Identities=23% Similarity=0.405 Sum_probs=27.7 Q ss_pred CHHHHH-CCCCCCCCCEEEEECCCCCCCHHHHHHHHH Q ss_conf 602331-011115683365524778861158999999 Q gi|254780617|r 149 GIKAID-SLIPIGRGQRELIIGDRKTGKTSIILDTFL 184 (509) Q Consensus 149 GI~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~~~I~ 184 (509) .+.|+| .=+.+.+|+..+|+|..|+|||||+ .+|. T Consensus 34 ~~~AL~~isf~i~~GeivgilG~NGaGKSTLl-~~i~ 69 (224) T cd03220 34 EFWALKDVSFEVPRGERIGLIGRNGAGKSTLL-RLLA 69 (224) T ss_pred CEEEECCEEEEECCCCEEEEECCCCCHHHHHH-HHHH T ss_conf 96987670789838989999979998199999-9997 No 312 >PRK03695 vitamin B12-transporter ATPase; Provisional Probab=89.87 E-value=0.21 Score=29.27 Aligned_cols=30 Identities=13% Similarity=0.254 Sum_probs=24.9 Q ss_pred HCCCCCCCCCEEEEECCCCCCCHHHHHHHHH Q ss_conf 1011115683365524778861158999999 Q gi|254780617|r 154 DSLIPIGRGQRELIIGDRKTGKTSIILDTFL 184 (509) Q Consensus 154 D~l~pigrGQR~~I~g~~g~GKt~l~~~~I~ 184 (509) |.=+.+.+|+..+|+|+.|+||||++ .+|. T Consensus 15 ~isl~v~~Ge~v~iiGpNGaGKSTLl-k~i~ 44 (245) T PRK03695 15 PLSGEVRAGEILHLVGPNGAGKSTLL-ARMA 44 (245) T ss_pred CEEEEECCCCEEEEECCCCCHHHHHH-HHHH T ss_conf 74899959989999978994199999-9984 No 313 >cd03221 ABCF_EF-3 ABCF_EF-3 Elongation factor 3 (EF-3) is a cytosolic protein required by fungal ribosomes for in vitro protein synthesis and for in vivo growth. EF-3 stimulates the binding of the EF-1: GTP: aa-tRNA ternary complex to the ribosomal A site by facilitated release of the deacylated tRNA from the E site. The reaction requires ATP hydrolysis. EF-3 contains two ATP nucleotide binding sequence (NBS) motifs. NBSI is sufficient for the intrinsic ATPase activity. NBSII is essential for the ribosome-stimulated functions. Probab=89.84 E-value=0.22 Score=29.08 Aligned_cols=29 Identities=21% Similarity=0.452 Sum_probs=23.5 Q ss_pred CCCCCCCCEEEEECCCCCCCHHHHHHHHHH Q ss_conf 111156833655247788611589999999 Q gi|254780617|r 156 LIPIGRGQRELIIGDRKTGKTSIILDTFLN 185 (509) Q Consensus 156 l~pigrGQR~~I~g~~g~GKt~l~~~~I~n 185 (509) =+.+-+|+..+|+|+.|+||||++ .+|.. T Consensus 20 s~~i~~ge~~~l~G~NGsGKTTl~-~~l~G 48 (144) T cd03221 20 SLTINPGDRIGLVGRNGAGKSTLL-KLIAG 48 (144) T ss_pred EEEECCCCEEEEECCCCCCHHHHH-HHHHC T ss_conf 899879999999989998499999-99848 No 314 >PRK09580 sufC cysteine desulfurase ATPase component; Reviewed Probab=89.83 E-value=0.22 Score=29.05 Aligned_cols=27 Identities=22% Similarity=0.481 Sum_probs=22.9 Q ss_pred CCCCCCCEEEEECCCCCCCHHHHHHHHH Q ss_conf 1115683365524778861158999999 Q gi|254780617|r 157 IPIGRGQRELIIGDRKTGKTSIILDTFL 184 (509) Q Consensus 157 ~pigrGQR~~I~g~~g~GKt~l~~~~I~ 184 (509) +.+-+|+..+|+|.+|+|||||+ .+|. T Consensus 22 l~i~~Gei~~iiG~nGaGKSTLl-~~i~ 48 (248) T PRK09580 22 LEVRPGEVHAIMGPNGSGKSTLS-ATLA 48 (248) T ss_pred EEECCCCEEEEECCCCCCHHHHH-HHHH T ss_conf 89849979999999999999999-9983 No 315 >PRK10851 sulfate/thiosulfate transporter subunit; Provisional Probab=89.80 E-value=0.21 Score=29.26 Aligned_cols=33 Identities=21% Similarity=0.420 Sum_probs=23.6 Q ss_pred HHHH-CCCCCCCCCEEEEECCCCCCCHHHHHHHHH Q ss_conf 2331-011115683365524778861158999999 Q gi|254780617|r 151 KAID-SLIPIGRGQRELIIGDRKTGKTSIILDTFL 184 (509) Q Consensus 151 ~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~~~I~ 184 (509) +++| .=+.|-+|+-..|+|++|+|||||+ .+|. T Consensus 16 ~al~~vsl~i~~GE~~~llGpSGsGKSTLl-r~ia 49 (352) T PRK10851 16 QVLNDISLDIPSGQMVALLGPSGSGKTTLL-RIIA 49 (352) T ss_pred EEEECCEEEECCCCEEEEECCCCCHHHHHH-HHHH T ss_conf 999063769999989999999984699999-9997 No 316 >TIGR02142 modC_ABC molybdate ABC transporter, ATP-binding protein; InterPro: IPR011868 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). This entry represents the ATP-binding cassette (ABC) protein of the three subunit molybdate ABC transporter. The three proteins of this complex are homologous to proteins of the sulphate ABC transporter. Molybdenum may be used in nitrogenases of nitrogen-fixing bacteria and in molybdopterin cofactors. In some cases, molybdate may be transported by a sulphate transporter rather than by a specific molybdate transporter.; GO: 0005524 ATP binding, 0015098 molybdate ion transmembrane transporter activity, 0015689 molybdate ion transport, 0009276 1-2nm peptidoglycan-based cell wall. Probab=89.79 E-value=0.18 Score=29.74 Aligned_cols=29 Identities=21% Similarity=0.322 Sum_probs=20.6 Q ss_pred HHHHCCCCCCCCCEEEEECCCCCCCHHHH Q ss_conf 23310111156833655247788611589 Q gi|254780617|r 151 KAIDSLIPIGRGQRELIIGDRKTGKTSII 179 (509) Q Consensus 151 ~~ID~l~pigrGQR~~I~g~~g~GKt~l~ 179 (509) -..|.=+++---....|||.||+|||||. T Consensus 12 f~Ld~~~~~pg~GvtAlFG~SGsGKTtli 40 (361) T TIGR02142 12 FELDVDLTLPGQGVTALFGRSGSGKTTLI 40 (361) T ss_pred EEEEEEEECCCCEEEEEECCCCCHHHHHH T ss_conf 47777653287406871258997078999 No 317 >PRK11000 maltose/maltodextrin transporter ATP-binding protein; Provisional Probab=89.78 E-value=0.21 Score=29.28 Aligned_cols=33 Identities=15% Similarity=0.318 Sum_probs=23.8 Q ss_pred HHHHH-CCCCCCCCCEEEEECCCCCCCHHHHHHHH Q ss_conf 02331-01111568336552477886115899999 Q gi|254780617|r 150 IKAID-SLIPIGRGQRELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 150 I~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~~~I 183 (509) ..++| .=+.|-+|+-..|+|++|+||||++ .+| T Consensus 16 ~~al~~vsl~i~~Ge~~~llGpsG~GKSTll-r~i 49 (369) T PRK11000 16 VVISKDINLDIHEGEFVVFVGPSGCGKSTLL-RMI 49 (369) T ss_pred EEEECCCEEEECCCCEEEEECCCCCHHHHHH-HHH T ss_conf 9998643889879989999999973699999-999 No 318 >PRK09984 phosphonate/organophosphate ester transporter subunit; Provisional Probab=89.78 E-value=0.22 Score=29.10 Aligned_cols=28 Identities=18% Similarity=0.419 Sum_probs=23.7 Q ss_pred CCCCCCCCCEEEEECCCCCCCHHHHHHHH Q ss_conf 01111568336552477886115899999 Q gi|254780617|r 155 SLIPIGRGQRELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 155 ~l~pigrGQR~~I~g~~g~GKt~l~~~~I 183 (509) .=+.+.+|...+|+|+.|.|||||+ .+| T Consensus 23 isl~i~~GE~~~iiGpNGaGKSTLl-k~i 50 (262) T PRK09984 23 VDLNIHHGEMVALLGPSGSGKSTLL-RHL 50 (262) T ss_pred EEEEECCCCEEEEECCCCCHHHHHH-HHH T ss_conf 2789879989999989996099999-999 No 319 >PRK13547 hmuV hemin importer ATP-binding subunit; Provisional Probab=89.78 E-value=0.21 Score=29.21 Aligned_cols=29 Identities=17% Similarity=0.417 Sum_probs=24.2 Q ss_pred CCCCCCCCCEEEEECCCCCCCHHHHHHHHH Q ss_conf 011115683365524778861158999999 Q gi|254780617|r 155 SLIPIGRGQRELIIGDRKTGKTSIILDTFL 184 (509) Q Consensus 155 ~l~pigrGQR~~I~g~~g~GKt~l~~~~I~ 184 (509) .=+.+.+|+..+|+|.+|+|||||+ .+|. T Consensus 20 vsl~i~~Ge~~aliG~nGaGKSTLl-~~i~ 48 (273) T PRK13547 20 LSLRIEPGRVTALLGRNGAGKSTLL-KVLA 48 (273) T ss_pred EEEEECCCCEEEEECCCCCHHHHHH-HHHH T ss_conf 0889989989999999997699999-9995 No 320 >PRK13645 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=89.76 E-value=0.21 Score=29.24 Aligned_cols=32 Identities=22% Similarity=0.476 Sum_probs=26.7 Q ss_pred CHHHHH-CCCCCCCCCEEEEECCCCCCCHHHHH Q ss_conf 602331-01111568336552477886115899 Q gi|254780617|r 149 GIKAID-SLIPIGRGQRELIIGDRKTGKTSIIL 180 (509) Q Consensus 149 GI~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~ 180 (509) .++|+| .=+.|.+|...+|+|.+|+|||||+. T Consensus 23 ~~~AL~~vsl~i~~Ge~~aIiG~nGsGKSTL~~ 55 (289) T PRK13645 23 EFKALNNTSLTFKKNKVTCVIGTTGSGKSTMIQ 55 (289) T ss_pred CCEEEECCEEEECCCCEEEEECCCCCHHHHHHH T ss_conf 365865328898899899999999957999999 No 321 >COG4178 ABC-type uncharacterized transport system, permease and ATPase components [General function prediction only] Probab=89.73 E-value=0.23 Score=28.94 Aligned_cols=20 Identities=45% Similarity=0.745 Sum_probs=8.8 Q ss_pred CCCCCEEEEECCCCCCCHHH Q ss_conf 15683365524778861158 Q gi|254780617|r 159 IGRGQRELIIGDRKTGKTSI 178 (509) Q Consensus 159 igrGQR~~I~g~~g~GKt~l 178 (509) +..|.+.+|.|.+|+|||+| T Consensus 416 v~~G~~llI~G~SG~GKTsL 435 (604) T COG4178 416 VRPGERLLITGESGAGKTSL 435 (604) T ss_pred ECCCCEEEEECCCCCCHHHH T ss_conf 47998799878999878899 No 322 >cd04120 Rab12 Rab12 subfamily. Rab12 was first identified in canine cells, where it was localized to the Golgi complex. The specific function of Rab12 remains unknown, and inconsistent results about its cellular localization have been reported. More recent studies have identified Rab12 associated with post-Golgi vesicles, or with other small vesicle-like structures but not with the Golgi complex. Most Rab GTPases contain a lipid modification site at the C-terminus, with sequence motifs CC, CXC, or CCX. Lipid binding is essential for membrane attachment, a key feature of most Rab proteins. GTPase activating proteins (GAPs) interact with GTP-bound Rab and accelerate the hydrolysis of GTP to GDP. Guanine nucleotide exchange factors (GEFs) interact with GDP-bound Rabs to promote the formation of the GTP-bound state. Rabs are further regulated by guanine nucleotide dissociation inhibitors (GDIs), which facilitate Rab recycling by masking C-terminal lipid binding and promoting cytosolic Probab=89.71 E-value=0.64 Score=25.77 Aligned_cols=20 Identities=40% Similarity=0.567 Sum_probs=16.5 Q ss_pred EEEEECCCCCCCHHHHHHHH Q ss_conf 36552477886115899999 Q gi|254780617|r 164 RELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 164 R~~I~g~~g~GKt~l~~~~I 183 (509) |+.++|++|||||+|+.-.+ T Consensus 2 KIvllGd~gVGKTsLi~rf~ 21 (202) T cd04120 2 QVIIIGSRGVGKTSLMRRFT 21 (202) T ss_pred EEEEECCCCCCHHHHHHHHH T ss_conf 89999979972999999995 No 323 >TIGR03263 guanyl_kin guanylate kinase. Members of this family are the enzyme guanylate kinase, also called GMP kinase. This enzyme transfers a phosphate from ATP to GMP, yielding ADP and GDP. Probab=89.71 E-value=1.4 Score=23.26 Aligned_cols=85 Identities=19% Similarity=0.260 Sum_probs=41.9 Q ss_pred CCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEC------CCCH------HHHHHHHHHHCCCCCCCEE Q ss_conf 833655247788611589999999886302466787606999951------6776------7899999851146742115 Q gi|254780617|r 162 GQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAI------GQKR------SSVARFVKALEDRGALSYS 229 (509) Q Consensus 162 GQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~I------Ger~------~ev~~~~~~l~~~~~~~~t 229 (509) |+=+-|.|++|+||++|+- .++.+- ..+..+.-.+ ||.. -.-.+|-+......-+++. T Consensus 1 G~livl~GpsG~GK~tl~~-~l~~~~---------~~~~~~vs~TTR~~R~gE~dG~dY~Fvs~~~F~~~i~~~~FlE~~ 70 (180) T TIGR03263 1 GLLIVISGPSGVGKSTLVK-ALLEED---------PNLKFSISATTRKPRPGEVDGVDYFFVSKEEFEEMIAAGEFLEWA 70 (180) T ss_pred CEEEEEECCCCCCHHHHHH-HHHHHC---------CCCEEEEECCCCCCCCCCCCCCCEEEECHHHHHHHHHCCEEEEEE T ss_conf 9399998999889999999-999768---------994488704468979987788734785089999998649648876 Q ss_pred EEEEECCCCCHHHHHHHHHHHHHHHH--HHHCCCCCEEEEEE Q ss_conf 89972887888999998876666667--54204688479961 Q gi|254780617|r 230 IVVVASASDPAPMQLLAPFAGCAMGE--YFRDNGYHALIAYD 269 (509) Q Consensus 230 vvv~a~a~~~~~~r~~ap~~a~aiAE--yfr~~G~~VLi~~D 269 (509) .+- .-|.|++..+ -..++|+.|++-+| T Consensus 71 ~~~-------------g~~YGt~~~~i~~~~~~gk~vil~id 99 (180) T TIGR03263 71 EVH-------------GNYYGTPKSPVEEALAAGKDVLLEID 99 (180) T ss_pred EEC-------------CCCCCCCHHHHHHHHHHCCCEEEECC T ss_conf 771-------------67635747999999960998999878 No 324 >PRK10253 iron-enterobactin transporter ATP-binding protein; Provisional Probab=89.69 E-value=0.2 Score=29.49 Aligned_cols=35 Identities=23% Similarity=0.376 Sum_probs=26.0 Q ss_pred HHHHH-CCCCCCCCCEEEEECCCCCCCHHHHHHHHHH Q ss_conf 02331-0111156833655247788611589999999 Q gi|254780617|r 150 IKAID-SLIPIGRGQRELIIGDRKTGKTSIILDTFLN 185 (509) Q Consensus 150 I~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~~~I~n 185 (509) ..++| .=+.|-+|+-.+|+|+.|+|||||+ .+|.. T Consensus 20 ~~~l~~isl~I~~Ge~~~iiGpNGaGKSTLl-k~i~G 55 (265) T PRK10253 20 YTVAENLTVEIPDGHFTAIIGPNGCGKSTLL-RTLSR 55 (265) T ss_pred EEEEEEEEEEECCCCEEEEECCCCCHHHHHH-HHHHC T ss_conf 9998402889859979999999883999999-99974 No 325 >PRK13639 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=89.68 E-value=0.22 Score=29.11 Aligned_cols=32 Identities=25% Similarity=0.420 Sum_probs=25.4 Q ss_pred CHHHHH-CCCCCCCCCEEEEECCCCCCCHHHHH Q ss_conf 602331-01111568336552477886115899 Q gi|254780617|r 149 GIKAID-SLIPIGRGQRELIIGDRKTGKTSIIL 180 (509) Q Consensus 149 GI~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~ 180 (509) |..++| .=+.|-+|...+|+|..|+||||++. T Consensus 14 ~~~al~~vsl~I~~Ge~vaiiG~nGsGKSTLl~ 46 (275) T PRK13639 14 GTVALKGINFKAEEGEMIAILGPNGAGKSTLFL 46 (275) T ss_pred CCEEEECCEEEECCCCEEEEECCCCCHHHHHHH T ss_conf 989998648899899899999999964999999 No 326 >PRK13539 cytochrome c biogenesis protein CcmA; Provisional Probab=89.68 E-value=0.23 Score=28.93 Aligned_cols=31 Identities=16% Similarity=0.351 Sum_probs=25.0 Q ss_pred CCCCCCCCCEEEEECCCCCCCHHHHHHHHHHH Q ss_conf 01111568336552477886115899999998 Q gi|254780617|r 155 SLIPIGRGQRELIIGDRKTGKTSIILDTFLNQ 186 (509) Q Consensus 155 ~l~pigrGQR~~I~g~~g~GKt~l~~~~I~nq 186 (509) .=+.+.+|+..+|+|..|+|||||+ .+|... T Consensus 21 vsf~i~~Gei~~l~G~NGaGKTTLl-k~i~Gl 51 (206) T PRK13539 21 LSFTLAAGEALVLTGPNGSGKTTLL-RLLAGL 51 (206) T ss_pred CEEEECCCCEEEEECCCCCCHHHHH-HHHHCC T ss_conf 0789869949999899999899999-999588 No 327 >PRK10261 glutathione transporter ATP-binding protein; Provisional Probab=89.64 E-value=0.29 Score=28.28 Aligned_cols=31 Identities=29% Similarity=0.497 Sum_probs=22.6 Q ss_pred HHHHH-CCCCCCCCCEEEEECCCCCCCHHHHH Q ss_conf 02331-01111568336552477886115899 Q gi|254780617|r 150 IKAID-SLIPIGRGQRELIIGDRKTGKTSIIL 180 (509) Q Consensus 150 I~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~ 180 (509) ++++| .=+.+.+|+..+|+|.+|+|||+++. T Consensus 29 v~al~~VSf~v~~GE~vaLvGeSGSGKSTl~~ 60 (623) T PRK10261 29 IAAVRNLSFSLQRGETLAIVGESGSGKSVTAL 60 (623) T ss_pred EEEEECCEEEECCCCEEEEECCCCCHHHHHHH T ss_conf 89885617698899899999899978999999 No 328 >PRK11607 potG putrescine transporter ATP-binding subunit; Provisional Probab=89.59 E-value=0.22 Score=29.19 Aligned_cols=46 Identities=17% Similarity=0.415 Sum_probs=30.7 Q ss_pred CCCCCCHHHHHCCHHHHH-CCCCCCCCCEEEEECCCCCCCHHHHHHHHH Q ss_conf 112660014441602331-011115683365524778861158999999 Q gi|254780617|r 137 IQRQSVCEPLSTGIKAID-SLIPIGRGQRELIIGDRKTGKTSIILDTFL 184 (509) Q Consensus 137 ~~R~~i~~~l~TGI~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~~~I~ 184 (509) ++-+-++.-+. |..++| .=+.|.+|.-..|+|++|+||||++ .+|. T Consensus 20 iev~nlsk~y~-~~~al~~vsl~I~~GE~~~llGpsGsGKSTll-r~i~ 66 (377) T PRK11607 20 LEIRNLTKSFD-GQHAVDDVSLTIYKGEIFALLGASGCGKSTLL-RMLA 66 (377) T ss_pred EEEEEEEEEEC-CEEEECCCEEEECCCCEEEEECCCCCHHHHHH-HHHH T ss_conf 99985899999-99999051879999989999999984899999-9997 No 329 >cd00267 ABC_ATPase ABC (ATP-binding cassette) transporter nucleotide-binding domain; ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. The nucleotide-binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=89.57 E-value=0.25 Score=28.78 Aligned_cols=30 Identities=13% Similarity=0.391 Sum_probs=24.2 Q ss_pred CCCCCCCCCEEEEECCCCCCCHHHHHHHHHH Q ss_conf 0111156833655247788611589999999 Q gi|254780617|r 155 SLIPIGRGQRELIIGDRKTGKTSIILDTFLN 185 (509) Q Consensus 155 ~l~pigrGQR~~I~g~~g~GKt~l~~~~I~n 185 (509) .=+.+.+|+..+|+|+.|+|||||+ .+|.. T Consensus 18 i~~~i~~Ge~~~i~G~nGaGKSTLl-~~l~g 47 (157) T cd00267 18 VSLTLKAGEIVALVGPNGSGKSTLL-RAIAG 47 (157) T ss_pred EEEEECCCCEEEEECCCCCCHHHHH-HHHHC T ss_conf 1789879979999878899989999-99958 No 330 >cd03297 ABC_ModC_molybdenum_transporter ModC is an ABC-type transporter and the ATPase component of a molybdate transport system that also includes the periplasmic binding protein ModA and the membrane protein ModB. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=89.57 E-value=0.21 Score=29.20 Aligned_cols=28 Identities=14% Similarity=0.260 Sum_probs=22.1 Q ss_pred HCCCCCCCCCEEEEECCCCCCCHHHHHHHH Q ss_conf 101111568336552477886115899999 Q gi|254780617|r 154 DSLIPIGRGQRELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 154 D~l~pigrGQR~~I~g~~g~GKt~l~~~~I 183 (509) |.=+.+ .|.-.+|+|++|+||||++ .+| T Consensus 16 ~i~f~i-~ge~~~iiGpSGsGKSTll-~~i 43 (214) T cd03297 16 KIDFDL-NEEVTGIFGASGAGKSTLL-RCI 43 (214) T ss_pred EEEEEC-CCCEEEEECCCCCHHHHHH-HHH T ss_conf 999862-9979999999973599999-999 No 331 >pfam00580 UvrD-helicase UvrD/REP helicase. The Rep family helicases are composed of four structural domains. The Rep family function as dimers. REP helicases catalyse ATP dependent unwinding of double stranded DNA to single stranded DNA. Bacillus subtilis addA and Escherichia coli exodeoxyribonuclease V beta have large insertions near to the carboxy-terminus relative to other members of the family. Probab=89.52 E-value=0.88 Score=24.75 Aligned_cols=28 Identities=21% Similarity=0.273 Sum_probs=20.8 Q ss_pred CCCEEEEECCCCCCCHHHHHHHHHHHHH Q ss_conf 6833655247788611589999999886 Q gi|254780617|r 161 RGQRELIIGDRKTGKTSIILDTFLNQKS 188 (509) Q Consensus 161 rGQR~~I~g~~g~GKt~l~~~~I~nq~~ 188 (509) .++..+|.|++|+|||+.++.-|..... T Consensus 12 ~~~~llV~AgAGSGKT~~L~~Ri~~li~ 39 (494) T pfam00580 12 LGGPLLVLAGAGTGKTRVLTERIAYLIL 39 (494) T ss_pred CCCCEEEEEECCHHHHHHHHHHHHHHHH T ss_conf 9999799971870689999999999998 No 332 >cd01895 EngA2 EngA2 subfamily. This CD represents the second GTPase domain of EngA and its orthologs, which are composed of two adjacent GTPase domains. Since the sequences of the two domains are more similar to each other than to other GTPases, it is likely that an ancient gene duplication, rather than a fusion of evolutionarily distinct GTPases, gave rise to this family. Although the exact function of these proteins has not been elucidated, studies have revealed that the E. coli EngA homolog, Der, and Neisseria gonorrhoeae EngA are essential for cell viability. A recent report suggests that E. coli Der functions in ribosome assembly and stability. Probab=89.50 E-value=1.1 Score=23.94 Aligned_cols=23 Identities=35% Similarity=0.611 Sum_probs=18.5 Q ss_pred CEEEEECCCCCCCHHHHHHHHHHH Q ss_conf 336552477886115899999998 Q gi|254780617|r 163 QRELIIGDRKTGKTSIILDTFLNQ 186 (509) Q Consensus 163 QR~~I~g~~g~GKt~l~~~~I~nq 186 (509) =|++|+|-+.+||||| +..+.++ T Consensus 3 ~~V~ivG~pN~GKSTL-~N~l~g~ 25 (174) T cd01895 3 IRIAIIGRPNVGKSSL-VNALLGE 25 (174) T ss_pred CEEEEECCCCCCHHHH-HHHHHCC T ss_conf 8999998999989999-9998389 No 333 >COG0488 Uup ATPase components of ABC transporters with duplicated ATPase domains [General function prediction only] Probab=89.46 E-value=0.26 Score=28.59 Aligned_cols=31 Identities=10% Similarity=0.269 Sum_probs=15.3 Q ss_pred CCCCHHHHHHHHHHHHHHH-----HHHHHHHHHHHH Q ss_conf 6679899999999999999-----741599999887 Q gi|254780617|r 454 DEVAVSQVRKFETNFLSHM-----RVSSQDILEDIR 484 (509) Q Consensus 454 d~i~v~~I~~fe~~l~~~l-----~~~~~ei~~~I~ 484 (509) +.++++.+...++.|.+|- -++...+++.+. T Consensus 468 NhLDi~s~eaLe~aL~~f~Gtvl~VSHDr~Fl~~va 503 (530) T COG0488 468 NHLDIESLEALEEALLDFEGTVLLVSHDRYFLDRVA 503 (530) T ss_pred CCCCHHHHHHHHHHHHHCCCEEEEEECCHHHHHHHC T ss_conf 767988999999999858986999948999998333 No 334 >cd03236 ABC_RNaseL_inhibitor_domain1 The ATPase domain 1 of RNase L inhibitor. The ABC ATPase, RNase L inhibitor (RLI), is a key enzyme in ribosomal biogenesis, formation of translation preinitiation complexes, and assembly of HIV capsids. RLI s are not transport proteins and thus cluster with a group of soluble proteins that lack the transmembrane components commonly found in other members of the family. Structurally, RLIs have an N-terminal Fe-S domain and two nucleotide binding domains which are arranged to form two composite active sites in their interface cleft. RLI is one of the most conserved enzymes between archaea and eukaryotes with a sequence identity more than 48%. The high degree of evolutionary conservation suggests that RLI performs a central role in archaeal and eukaryotic physiology. Probab=89.44 E-value=0.25 Score=28.78 Aligned_cols=24 Identities=21% Similarity=0.354 Sum_probs=19.9 Q ss_pred CCCCEEEEECCCCCCCHHHHHHHHH Q ss_conf 5683365524778861158999999 Q gi|254780617|r 160 GRGQRELIIGDRKTGKTSIILDTFL 184 (509) Q Consensus 160 grGQR~~I~g~~g~GKt~l~~~~I~ 184 (509) -+|+-.+|+|++|+|||||+ .+|. T Consensus 24 k~GEi~gLiGpNGaGKSTLl-k~i~ 47 (255) T cd03236 24 REGQVLGLVGPNGIGKSTAL-KILA 47 (255) T ss_pred CCCEEEEEECCCCCCHHHHH-HHHH T ss_conf 89809999899997099999-9996 No 335 >cd03224 ABC_TM1139_LivF_branched LivF (TM1139) is part of the LIV-I bacterial ABC-type two-component transport system that imports neutral, branched-chain amino acids. The E. coli branched-chain amino acid transporter comprises a heterodimer of ABC transporters (LivF and LivG), a heterodimer of six-helix TM domains (LivM and LivH), and one of two alternative soluble periplasmic substrate binding proteins (LivK or LivJ). ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. Probab=89.34 E-value=0.26 Score=28.58 Aligned_cols=34 Identities=18% Similarity=0.389 Sum_probs=26.4 Q ss_pred HHHHH-CCCCCCCCCEEEEECCCCCCCHHHHHHHHH Q ss_conf 02331-011115683365524778861158999999 Q gi|254780617|r 150 IKAID-SLIPIGRGQRELIIGDRKTGKTSIILDTFL 184 (509) Q Consensus 150 I~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~~~I~ 184 (509) .+++| .=+.+-+|+-.+|+|+.|.||||++ .+|. T Consensus 13 ~~~L~~vs~~v~~Gei~~liG~nGaGKSTLl-~~i~ 47 (222) T cd03224 13 SQILFGVSLTVPEGEIVALLGRNGAGKTTLL-KTIM 47 (222) T ss_pred EEEECCEEEEECCCCEEEEECCCCCCHHHHH-HHHH T ss_conf 9998140899889989999999998599999-9997 No 336 >PRK00349 uvrA excinuclease ABC subunit A; Reviewed Probab=89.33 E-value=0.36 Score=27.61 Aligned_cols=29 Identities=24% Similarity=0.450 Sum_probs=14.0 Q ss_pred CCCCHHHHHHHHHHHCCCCCCCEEEEEEE Q ss_conf 16776789999985114674211589972 Q gi|254780617|r 206 IGQKRSSVARFVKALEDRGALSYSIVVVA 234 (509) Q Consensus 206 IGer~~ev~~~~~~l~~~~~~~~tvvv~a 234 (509) ||...++...+++.|++--.+-+||+|+- T Consensus 520 iGLHprD~~rLi~~L~~Lrd~GNTVlVVE 548 (944) T PRK00349 520 IGLHQRDNDRLIETLKRLRDLGNTLIVVE 548 (944) T ss_pred CCCCHHHHHHHHHHHHHHHHCCCEEEEEE T ss_conf 54798899999999999985798599983 No 337 >COG1127 Ttg2A ABC-type transport system involved in resistance to organic solvents, ATPase component [Secondary metabolites biosynthesis, transport, and catabolism] Probab=89.32 E-value=0.32 Score=27.93 Aligned_cols=32 Identities=22% Similarity=0.391 Sum_probs=25.0 Q ss_pred CCCCCCCCCEEEEECCCCCCCHHHHHHHHHHHH Q ss_conf 011115683365524778861158999999988 Q gi|254780617|r 155 SLIPIGRGQRELIIGDRKTGKTSIILDTFLNQK 187 (509) Q Consensus 155 ~l~pigrGQR~~I~g~~g~GKt~l~~~~I~nq~ 187 (509) .=+.+-||.=.+|+|+||+|||+|+ -.|.-|. T Consensus 27 v~l~V~~Gei~~iiGgSGsGKStlL-r~I~Gll 58 (263) T COG1127 27 VDLDVPRGEILAILGGSGSGKSTLL-RLILGLL 58 (263) T ss_pred CEEEECCCCEEEEECCCCCCHHHHH-HHHHCCC T ss_conf 3135507818999889886899999-9985657 No 338 >COG3842 PotA ABC-type spermidine/putrescine transport systems, ATPase components [Amino acid transport and metabolism] Probab=89.32 E-value=0.26 Score=28.55 Aligned_cols=45 Identities=20% Similarity=0.433 Sum_probs=31.1 Q ss_pred CCCCCCHHHHHCCHHHHH-CCCCCCCCCEEEEECCCCCCCHHHHHHHH Q ss_conf 112660014441602331-01111568336552477886115899999 Q gi|254780617|r 137 IQRQSVCEPLSTGIKAID-SLIPIGRGQRELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 137 ~~R~~i~~~l~TGI~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~~~I 183 (509) ++-+-++..+. +..++| .=+.|.+|--.-++|+|||||||++ -+| T Consensus 6 l~~~~v~k~yg-~~~al~~isl~i~~Gef~tlLGPSGcGKTTlL-R~I 51 (352) T COG3842 6 LEIRNVSKSFG-DFTAVDDISLDIKKGEFVTLLGPSGCGKTTLL-RMI 51 (352) T ss_pred EEEEEEEEECC-CEEEEECCEEEECCCCEEEEECCCCCCHHHHH-HHH T ss_conf 89974566628-72677321445448868999899888889999-999 No 339 >KOG0651 consensus Probab=89.19 E-value=0.53 Score=26.33 Aligned_cols=136 Identities=20% Similarity=0.181 Sum_probs=59.7 Q ss_pred EEEEEECCCCCCCCCCEEEECCCEEEEECCHHHHHCCCCHHHCCCCCC--CCCCCCCCCHHCCCCCCCCCC--CCCHHH- Q ss_conf 999980586686789888977987886679469830003002314466--444544410010135773112--660014- Q gi|254780617|r 71 VGVVILGSYKEISEGDIVKRTGRIVDVPVGLELLGRVVDALGNPIDGK--GPIKCEQRSCTEADAPGIIQR--QSVCEP- 145 (509) Q Consensus 71 v~i~~l~~~~gI~~G~~V~~tg~~~~vpvG~~lLGRVvD~lG~PlDg~--g~i~~~~~~~i~~~~p~~~~R--~~i~~~- 145 (509) ++|---.+...|+.|++|... ...++.+. .|+|-+|-.=+ .-.. +.+..+.-.-+.-.+|..++- -|+..| T Consensus 81 vg~~~~~D~~~i~~G~rv~ld--ittltIm~-~lprevd~vy~-m~~e~~~~~s~~~~ggl~~qirelre~ielpl~np~ 156 (388) T KOG0651 81 VGCRRSVDKEKIARGTRVVLD--ITTLTIMR-GLPREVDLVYN-MSHEDPRNISFENVGGLFYQIRELREVIELPLTNPE 156 (388) T ss_pred EECCCCCCHHHHCCCCEEEEE--EEEEEHHC-CCCHHHHHHHH-HHHCCCCCCCHHHHCCHHHHHHHHHHHEEEECCCCH T ss_conf 872554345551167413665--34322230-56447788877-541575546787717838888998865574024810 Q ss_pred --HHCCHHHHHCCCCCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCC Q ss_conf --44160233101111568336552477886115899999998863024667876069999516776789999985114 Q gi|254780617|r 146 --LSTGIKAIDSLIPIGRGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALED 222 (509) Q Consensus 146 --l~TGI~~ID~l~pigrGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~ 222 (509) +..||+. |. =.||.|++|+|||-+ ...|..|-+.|-..-.+... |=.-|||.+|=+++.+....+ T Consensus 157 lf~rvgIk~-----Pk----g~ll~GppGtGKTll-a~~Vaa~mg~nfl~v~ss~l--v~kyiGEsaRlIRemf~yA~~ 223 (388) T KOG0651 157 LFLRVGIKP-----PK----GLLLYGPPGTGKTLL-ARAVAATMGVNFLKVVSSAL--VDKYIGESARLIRDMFRYARE 223 (388) T ss_pred HCCCCCCCC-----CC----EEEEECCCCCCHHHH-HHHHHHHCCCCEEEEEHHHH--HHHHCCCHHHHHHHHHHHHHH T ss_conf 023457778-----82----568767999864599-99999865985477447666--330026578899999977865 No 340 >TIGR00929 VirB4_CagE type IV secretion/conjugal transfer ATPase, VirB4 family; InterPro: IPR004346 This family includes the Helicobacter pylori protein CagE (see examples), which together with other proteins from the cag pathogenicity island (PAI), encodes a type IV transporter secretion system. The precise role of CagE is not known, but studies in animal models have shown that it is essential for pathogenesis in Helicobacter pylori induced gastritis and peptic ulceration . Indeed, the expression of the cag PAI has been shown to be essential for stimulating human gastric epithelial cell apoptosis in vitro . Similar type IV transport systems are also found in other bacteria. This family includes proteins from the trb and Vir conjugal transfer systems in Agrobacterium tumefaciens and homologues of VirB proteins from other species.; GO: 0005524 ATP binding. Probab=89.16 E-value=0.32 Score=27.91 Aligned_cols=52 Identities=27% Similarity=0.425 Sum_probs=34.8 Q ss_pred EEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCC Q ss_conf 36552477886115899999998863024667876069999516776789999985114 Q gi|254780617|r 164 RELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALED 222 (509) Q Consensus 164 R~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~ 222 (509) -..|||+.|.||||| +..+++|..+...... .-|++. -|.+-..-|++.+.. T Consensus 518 hT~IfG~~G~GKTtL-l~fL~a~~~ky~~~~a----~~~~~f--Dkd~g~~~~~~a~gG 569 (931) T TIGR00929 518 HTLIFGPTGSGKTTL-LNFLLAQLQKYKPNFA----LTIFAF--DKDRGMEIFIRALGG 569 (931) T ss_pred CEEEECCCCCCHHHH-HHHHHHHHHCCCCCCC----EEEEEE--ECCCCCHHHHEEECC T ss_conf 777888889846999-9999999742488987----069998--878982104111174 No 341 >cd01918 HprK_C HprK/P, the bifunctional histidine-containing protein kinase/phosphatase, controls the phosphorylation state of the phosphocarrier protein HPr and regulates the utilization of carbon sources by gram-positive bacteria. It catalyzes both the ATP-dependent phosphorylation of Ser-46 of HPr and its dephosphorylation by phosphorolysis. The latter reaction uses inorganic phosphate as substrate and produces pyrophosphate. Phosphoenolpyruvate carboxykinase (PEPCK) and the C-terminal catalytic domain of HprK/P are structurally similar with conserved active site residues suggesting these two phosphotransferases have related functions. The HprK/P N-terminal domain is structurally similar to the N-terminal domains of the MurE and MurF amino acid ligases. Probab=89.15 E-value=0.57 Score=26.10 Aligned_cols=25 Identities=28% Similarity=0.405 Sum_probs=22.0 Q ss_pred CCCEEEEECCCCCCCHHHHHHHHHH Q ss_conf 6833655247788611589999999 Q gi|254780617|r 161 RGQRELIIGDRKTGKTSIILDTFLN 185 (509) Q Consensus 161 rGQR~~I~g~~g~GKt~l~~~~I~n 185 (509) .|.=.+|.|++|.|||+++++.|.. T Consensus 13 ~g~GvLi~G~SG~GKS~lal~Li~r 37 (149) T cd01918 13 GGIGVLITGPSGIGKSELALELIKR 37 (149) T ss_pred CCEEEEEECCCCCCHHHHHHHHHHC T ss_conf 9989999878999989999999981 No 342 >PRK11153 metN DL-methionine transporter ATP-binding subunit; Provisional Probab=89.11 E-value=0.26 Score=28.62 Aligned_cols=33 Identities=27% Similarity=0.443 Sum_probs=26.2 Q ss_pred HHHHH-CCCCCCCCCEEEEECCCCCCCHHHHHHHH Q ss_conf 02331-01111568336552477886115899999 Q gi|254780617|r 150 IKAID-SLIPIGRGQRELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 150 I~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~~~I 183 (509) ++|+| .=+.|.+|+-.+|+|.+|+|||||+ .+| T Consensus 18 v~AL~~Vsl~I~~Gei~giIG~SGaGKSTLl-r~i 51 (343) T PRK11153 18 IQALNNVSLHVPAGQIYGVIGASGAGKSTLI-RCV 51 (343) T ss_pred EEEEECEEEEECCCCEEEEECCCCCHHHHHH-HHH T ss_conf 9999661889989989999999998699999-999 No 343 >COG3839 MalK ABC-type sugar transport systems, ATPase components [Carbohydrate transport and metabolism] Probab=89.11 E-value=0.36 Score=27.54 Aligned_cols=28 Identities=18% Similarity=0.408 Sum_probs=21.5 Q ss_pred CCCCCCCCCEEEEECCCCCCCHHHHHHHH Q ss_conf 01111568336552477886115899999 Q gi|254780617|r 155 SLIPIGRGQRELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 155 ~l~pigrGQR~~I~g~~g~GKt~l~~~~I 183 (509) .=+.|-.|.-.-|+|++||||||++ .+| T Consensus 22 i~l~i~~Gef~vllGPSGcGKSTlL-r~I 49 (338) T COG3839 22 VNLDIEDGEFVVLLGPSGCGKSTLL-RMI 49 (338) T ss_pred CEEEECCCCEEEEECCCCCCHHHHH-HHH T ss_conf 2689747979999899988889999-999 No 344 >COG4619 ABC-type uncharacterized transport system, ATPase component [General function prediction only] Probab=89.10 E-value=0.35 Score=27.66 Aligned_cols=36 Identities=17% Similarity=0.301 Sum_probs=25.9 Q ss_pred CCHHHHH-CCCCCCCCCEEEEECCCCCCCHHHHHHHHH Q ss_conf 1602331-011115683365524778861158999999 Q gi|254780617|r 148 TGIKAID-SLIPIGRGQRELIIGDRKTGKTSIILDTFL 184 (509) Q Consensus 148 TGI~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~~~I~ 184 (509) ++-+.+| .-+.+-+|.++.|.|++|+|||+++ -++. T Consensus 14 ~~a~il~~isl~v~~Ge~iaitGPSG~GKStll-k~va 50 (223) T COG4619 14 GDAKILNNISLSVRAGEFIAITGPSGCGKSTLL-KIVA 50 (223) T ss_pred CCCEEECCEEEEECCCCEEEEECCCCCCHHHHH-HHHH T ss_conf 797443241366538854887678876688999-9998 No 345 >PRK11300 livG leucine/isoleucine/valine transporter ATP-binding subunit; Provisional Probab=89.08 E-value=0.27 Score=28.44 Aligned_cols=36 Identities=19% Similarity=0.392 Sum_probs=28.2 Q ss_pred CCHHHHH-CCCCCCCCCEEEEECCCCCCCHHHHHHHHH Q ss_conf 1602331-011115683365524778861158999999 Q gi|254780617|r 148 TGIKAID-SLIPIGRGQRELIIGDRKTGKTSIILDTFL 184 (509) Q Consensus 148 TGI~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~~~I~ 184 (509) -|..++| .=+.+.+|+..+|+|+.|.|||||+ .+|. T Consensus 16 g~~~al~~vs~~v~~Gei~~liGpnGaGKSTL~-~~i~ 52 (255) T PRK11300 16 GGLLAVNNVNLEVREQEVVSLIGPNGAGKTTVF-NCLT 52 (255) T ss_pred CCEEEECCEEEEECCCCEEEEECCCCCCHHHHH-HHHH T ss_conf 999988040889899979999989996499999-9996 No 346 >COG1132 MdlB ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms] Probab=89.06 E-value=0.27 Score=28.44 Aligned_cols=29 Identities=28% Similarity=0.526 Sum_probs=22.6 Q ss_pred CCCCCCCCCEEEEECCCCCCCHHHHHHHHH Q ss_conf 011115683365524778861158999999 Q gi|254780617|r 155 SLIPIGRGQRELIIGDRKTGKTSIILDTFL 184 (509) Q Consensus 155 ~l~pigrGQR~~I~g~~g~GKt~l~~~~I~ 184 (509) .=+.+-+|++.+|.|.+|+||||++ ..+. T Consensus 348 is~~i~~Ge~vaiVG~sGsGKSTl~-~LL~ 376 (567) T COG1132 348 ISFSIEPGEKVAIVGPSGSGKSTLI-KLLL 376 (567) T ss_pred CEEEECCCCEEEEECCCCCCHHHHH-HHHH T ss_conf 2277548987888558888578999-9998 No 347 >cd01862 Rab7 Rab7 subfamily. Rab7 is a small Rab GTPase that regulates vesicular traffic from early to late endosomal stages of the endocytic pathway. The yeast Ypt7 and mammalian Rab7 are both involved in transport to the vacuole/lysosome, whereas Ypt7 is also required for homotypic vacuole fusion. Mammalian Rab7 is an essential participant in the autophagic pathway for sequestration and targeting of cytoplasmic components to the lytic compartment. Mammalian Rab7 is also proposed to function as a tumor suppressor. GTPase activating proteins (GAPs) interact with GTP-bound Rab and accelerate the hydrolysis of GTP to GDP. Guanine nucleotide exchange factors (GEFs) interact with GDP-bound Rabs to promote the formation of the GTP-bound state. Rabs are further regulated by guanine nucleotide dissociation inhibitors (GDIs), which facilitate Rab recycling by masking C-terminal lipid binding and promoting cytosolic localization. Most Rab GTPases contain a lipid modification site at the C- Probab=89.06 E-value=1.7 Score=22.63 Aligned_cols=20 Identities=35% Similarity=0.606 Sum_probs=16.7 Q ss_pred EEEEECCCCCCCHHHHHHHH Q ss_conf 36552477886115899999 Q gi|254780617|r 164 RELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 164 R~~I~g~~g~GKt~l~~~~I 183 (509) |+.++|++|||||+|+.-.+ T Consensus 2 KivlvGd~~VGKTsli~r~~ 21 (172) T cd01862 2 KVIILGDSGVGKTSLMNQYV 21 (172) T ss_pred EEEEECCCCCCHHHHHHHHH T ss_conf 89999989978999999995 No 348 >PRK13548 hmuV hemin importer ATP-binding subunit; Provisional Probab=89.05 E-value=0.28 Score=28.34 Aligned_cols=34 Identities=18% Similarity=0.399 Sum_probs=26.0 Q ss_pred HHHHH-CCCCCCCCCEEEEECCCCCCCHHHHHHHHH Q ss_conf 02331-011115683365524778861158999999 Q gi|254780617|r 150 IKAID-SLIPIGRGQRELIIGDRKTGKTSIILDTFL 184 (509) Q Consensus 150 I~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~~~I~ 184 (509) ..++| .=+.|.+|+..+|+|+.|+|||||+ .+|. T Consensus 15 ~~~L~~vsl~i~~Gei~~liGpNGaGKSTLl-k~i~ 49 (257) T PRK13548 15 KTVLDDVSLTLRPGEVVAILGPNGAGKSTLL-RALS 49 (257) T ss_pred EEEECCEEEEECCCCEEEEECCCCCCHHHHH-HHHH T ss_conf 9988033789869989999999998799999-9985 No 349 >TIGR00954 3a01203 Peroxysomal long chain fatty acyl transporter; InterPro: IPR005283 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). The members of this family are integral membrane proteins and they are involved in the import of activated long-chain fatty acids from the cytosol to the peroxisomal matrix. ; GO: 0005524 ATP binding, 0006810 transport, 0016021 integral to membrane. Probab=89.00 E-value=0.28 Score=28.41 Aligned_cols=36 Identities=33% Similarity=0.495 Sum_probs=28.1 Q ss_pred HHHHCCHHHHHCCCCC--CCCCEEEEECCCCCCCHHHH Q ss_conf 1444160233101111--56833655247788611589 Q gi|254780617|r 144 EPLSTGIKAIDSLIPI--GRGQRELIIGDRKTGKTSII 179 (509) Q Consensus 144 ~~l~TGI~~ID~l~pi--grGQR~~I~g~~g~GKt~l~ 179 (509) ..++-=-|+=|.-+|. |.|.-.||+|+-|||||+|= T Consensus 536 nsiqdlskandiklPflqGsG~~lLi~GPNGCGKSSLF 573 (788) T TIGR00954 536 NSIQDLSKANDIKLPFLQGSGNHLLICGPNGCGKSSLF 573 (788) T ss_pred HHHHHHHHHCCCCCCCCCCCCCEEEEECCCCCCHHHHH T ss_conf 35665554302435512158876687688998647899 No 350 >cd01864 Rab19 Rab19 subfamily. Rab19 proteins are associated with Golgi stacks. Similarity analysis indicated that Rab41 is closely related to Rab19. However, the function of these Rabs is not yet chracterized. GTPase activating proteins (GAPs) interact with GTP-bound Rab and accelerate the hydrolysis of GTP to GDP. Guanine nucleotide exchange factors (GEFs) interact with GDP-bound Rabs to promote the formation of the GTP-bound state. Rabs are further regulated by guanine nucleotide dissociation inhibitors (GDIs), which facilitate Rab recycling by masking C-terminal lipid binding and promoting cytosolic localization. Most Rab GTPases contain a lipid modification site at the C-terminus, with sequence motifs CC, CXC, or CCX. Lipid binding is essential for membrane attachment, a key feature of most Rab proteins. Due to the presence of truncated sequences in this CD, the lipid modification site is not available for annotation. Probab=88.90 E-value=0.74 Score=25.29 Aligned_cols=19 Identities=32% Similarity=0.632 Sum_probs=16.3 Q ss_pred EEEEECCCCCCCHHHHHHH Q ss_conf 3655247788611589999 Q gi|254780617|r 164 RELIIGDRKTGKTSIILDT 182 (509) Q Consensus 164 R~~I~g~~g~GKt~l~~~~ 182 (509) |+.++|++|||||+|+.-- T Consensus 5 Kiv~lGd~~vGKTsli~r~ 23 (165) T cd01864 5 KIILIGDSNVGKTCVVQRF 23 (165) T ss_pred EEEEECCCCCCHHHHHHHH T ss_conf 9999999995899999999 No 351 >PRK13636 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=88.90 E-value=0.28 Score=28.39 Aligned_cols=35 Identities=26% Similarity=0.496 Sum_probs=27.5 Q ss_pred CCHHHHH-CCCCCCCCCEEEEECCCCCCCHHHHHHHH Q ss_conf 1602331-01111568336552477886115899999 Q gi|254780617|r 148 TGIKAID-SLIPIGRGQRELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 148 TGI~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~~~I 183 (509) .|..++| .=+.|-+|+..+|+|+.|+||||++. +| T Consensus 17 ~g~~aL~~isl~I~~Ge~~aiiG~NGaGKSTLl~-~i 52 (285) T PRK13636 17 DGTHALKGININIKKGEVTAILGGNGAGKSTLFQ-NL 52 (285) T ss_pred CCCEEEEEEEEEECCCCEEEEECCCCCHHHHHHH-HH T ss_conf 9988876137898799899999999980999999-99 No 352 >TIGR01184 ntrCD nitrate ABC transporter, ATP-binding proteins C and D; InterPro: IPR005890 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). This model describes the ATP binding subunits of nitrate transport in bacteria and archaea. This protein belongs to the ATP-binding cassette (ABC) superfamily. It is thought that the two subunits encoded by ntrC and ntrD form the binding surface for interaction with ATP. This model is restricted in identifying ATP binding subunit associated with the nitrate transport. Nitrate assimilation is aided by other proteins derived from the operon which among others include products of ntrA - a regulatory protein; ntrB - a hydropbobic transmembrane permease and narB - a reductase. ; GO: 0015112 nitrate transmembrane transporter activity, 0015706 nitrate transport, 0016020 membrane. Probab=88.90 E-value=0.32 Score=27.96 Aligned_cols=137 Identities=20% Similarity=0.251 Sum_probs=69.9 Q ss_pred CCCCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEEEEEC Q ss_conf 11115683365524778861158999999988630246678760699995167767899999851146742115899728 Q gi|254780617|r 156 LIPIGRGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALSYSIVVVAS 235 (509) Q Consensus 156 l~pigrGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~~tvvv~a~ 235 (509) =+.|-+|.-+.++|.|||||||| +.+|.- . ...+. +++--.|+++ .+.| -+|.||+-+- T Consensus 5 ~L~i~~GEFisliGHSGCGKSTL-LNli~G---l---~~P~~------G~v~L~G~~i-------~~PG-PdRMVVFQNY 63 (230) T TIGR01184 5 DLTIRQGEFISLIGHSGCGKSTL-LNLISG---L---AQPTS------GGVILEGKQI-------TEPG-PDRMVVFQNY 63 (230) T ss_pred EEEEECCCEEEEECCCCCCHHHH-HHHHHH---C---CCCCC------CCEEECCEEC-------CCCC-CCEEEEEECC T ss_conf 02552673699851278617899-999850---0---57777------6167626241-------7876-9604785062 Q ss_pred CCCCHHHHHHHHHH--HHHHHHHHHC--CCCCEEEEEEC-----HHHHHHH---------------HHHH-----HHHCC Q ss_conf 87888999998876--6666675420--46884799616-----4789999---------------9875-----45303 Q gi|254780617|r 236 ASDPAPMQLLAPFA--GCAMGEYFRD--NGYHALIAYDD-----LQKHAVA---------------YRQL-----SLLLR 286 (509) Q Consensus 236 a~~~~~~r~~ap~~--a~aiAEyfr~--~G~~VLi~~Dd-----ltr~A~a---------------~rei-----sl~~~ 286 (509) |= +=|++.+- ++|+-+-.++ +...--++-+. |++-|+- -|.+ -|+|. T Consensus 64 sL----lPW~tvr~NiaLAV~~v~~~~~~~E~~~iv~~~~~lVgL~~Aa~K~p~~lSGGMKQRVAIARaLs~RP~~LlLD 139 (230) T TIGR01184 64 SL----LPWLTVRENIALAVDRVLRDLSKSERRAIVEEHIDLVGLREAADKRPGQLSGGMKQRVAIARALSIRPKVLLLD 139 (230) T ss_pred CC----CCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCHHHHCCCCHHCCCCHHHHHHHHHHHHHCCEEEEEE T ss_conf 00----32256888999999999862135479999998886402123411780030584268999986653176012310 Q ss_pred CCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCEEEEE Q ss_conf 787543337520012467888875045445887356753 Q gi|254780617|r 287 RPPGREAYPGDVFYLHSRLLERAAKMSDALGAGSLTALP 325 (509) Q Consensus 287 ~pp~~~gyp~~vf~~~s~l~ERa~~~~~~~g~GSiT~~~ 325 (509) ||-|.= |. -.-..|.|+-=+..++ -.+|++- T Consensus 140 EPFGAL----DA-lTr~~LQe~L~~I~~e---~~~T~~M 170 (230) T TIGR01184 140 EPFGAL----DA-LTRGNLQEELLQIVEE---ARVTVVM 170 (230) T ss_pred CCCCHH----HH-HHHHHHHHHHHHHHHH---CCCEEEE T ss_conf 874056----67-5268899999998731---6974899 No 353 >cd03215 ABC_Carb_Monos_II This family represents domain II of the carbohydrate uptake proteins that transport only monosaccharides (Monos). The Carb_Monos family is involved in the uptake of monosaccharides, such as pentoses (such as xylose, arabinose, and ribose) and hexoses (such as xylose, arabinose, and ribose), that cannot be broken down to simple sugars by hydrolysis. In members of Carb_Monos family the single hydrophobic gene product forms a homodimer, while the ABC protein represents a fusion of two nucleotide-binding domains. However, it is assumed that two copies of the ABC domains are present in the assembled transporter. Probab=88.89 E-value=0.29 Score=28.28 Aligned_cols=31 Identities=19% Similarity=0.270 Sum_probs=24.8 Q ss_pred HHH-CCCCCCCCCEEEEECCCCCCCHHHHHHHH Q ss_conf 331-01111568336552477886115899999 Q gi|254780617|r 152 AID-SLIPIGRGQRELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 152 ~ID-~l~pigrGQR~~I~g~~g~GKt~l~~~~I 183 (509) |+| .=+.+-+|...+|+|..|+||||++ .+| T Consensus 15 aL~~vsl~v~~GEi~~liG~nGaGKSTll-~~l 46 (182) T cd03215 15 AVRDVSFEVRAGEIVGIAGLVGNGQTELA-EAL 46 (182) T ss_pred CEECEEEEECCCCEEEEECCCCCCCCHHH-HHH T ss_conf 76231789859969999888999926377-876 No 354 >TIGR01193 bacteriocin_ABC ABC-type bacteriocin transporter; InterPro: IPR005897 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). This family contains ABC-type bacteriocin transporter. In general, bacteriocins are agents which are responsible for killing or inhibiting the closely related species or even different strains of the same species. Bacteriocins are encoded by bacterial plasmids. Bacteriocins are named after the species and hence in literature one encounters various names e.g., leucocin from Leuconostic geldium; pedicocin from Pedicoccus acidilactici; sakacin from Lactobacillus sake etc. Peptide bacteriocins are exported across the cytoplasmic membrane by a dedicated ATP-binding cassette (ABC) transporter. These ABC-transporters have an N-terminal peptidase domain that belong to MEROPS peptidase family C39 (clan CA); a central multi-pass transmembrane region and a C-terminal ABC transporter domain. These transporters have dual function: (i) they remove the N-terminal leader peptide from its bacteriocin precursor by cleavage at a Gly-Gly bond and (ii) transport the mature bacteriocin across the cytoplasmic membrane. This represents a novel strategy for secretion of bacterial proteins . Many bacteria are known to regulate diverse physiological processes through this system, such as bioluminescence, regulation of sporulation, virulence factor expression, antibiotics production, competence for genetic transformation, and activation of biofilm formation .; GO: 0008234 cysteine-type peptidase activity, 0019534 toxin transporter activity, 0006810 transport, 0016020 membrane. Probab=88.87 E-value=0.27 Score=28.47 Aligned_cols=103 Identities=15% Similarity=0.246 Sum_probs=52.4 Q ss_pred HHHH-HCCCCCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECC---CCHHHHHHHHHHHCCCCC Q ss_conf 0233-101111568336552477886115899999998863024667876069999516---776789999985114674 Q gi|254780617|r 150 IKAI-DSLIPIGRGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIG---QKRSSVARFVKALEDRGA 225 (509) Q Consensus 150 I~~I-D~l~pigrGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IG---er~~ev~~~~~~l~~~~~ 225 (509) =+++ |.=++|-++++.-|.|-||.|||||+. ...| ... .......|. ..+.- --..+.|.++.-+-+.- T Consensus 487 ~~~L~di~L~I~~n~k~tiVGmSGSGKsTLaK-LLV~---Ffe-PQ~~sG~I~-Lng~~l~~iD~h~LRq~INYlPQeP- 559 (710) T TIGR01193 487 SNVLSDISLTIKKNEKITIVGMSGSGKSTLAK-LLVG---FFE-PQAESGEIL-LNGISLKDIDRHELRQFINYLPQEP- 559 (710) T ss_pred CCCCCCCEEEEECCCEEEEECCCCCCHHHHHH-HHHC---CCC-CCCCCCEEE-ECCCCHHHCCHHHHCCCCCCCCCCC- T ss_conf 73202642365078548997367974899998-7520---358-998877365-2782445337344412335568878- Q ss_pred CCEEEEEEEC--------CCCCHHHHHHHHHHHHHHHHHH-----HCCCCCEE Q ss_conf 2115899728--------8788899999887666666754-----20468847 Q gi|254780617|r 226 LSYSIVVVAS--------ASDPAPMQLLAPFAGCAMGEYF-----RDNGYHAL 265 (509) Q Consensus 226 ~~~tvvv~a~--------a~~~~~~r~~ap~~a~aiAEyf-----r~~G~~VL 265 (509) .++..+ |.+......+ ..||.+||== +.+|.+.= T Consensus 560 ----~IF~GsILeNLLlGak~~~~~~~i--~~A~~iAEIk~DIe~mp~Gy~T~ 606 (710) T TIGR01193 560 ----YIFSGSILENLLLGAKENVSQDEI--LKAVEIAEIKDDIEKMPLGYQTE 606 (710) T ss_pred ----EEECCCHHHHHHHCCCCCCCHHHH--HHHHHHCCCHHHHHHCCCCCCEE T ss_conf ----451231788765037899898999--98840201124583188876600 No 355 >TIGR00416 sms DNA repair protein RadA; InterPro: IPR004504 RadA/Sms is a highly conserved eubacterial protein that shares sequence similarity with both RecA strand transferase and lon protease. The RadA/Sms family are probable ATP-dependent proteases involved in both DNA repair and degradation of proteins, peptides, glycopeptides. They are classified in as non-peptidase homologues and unassigned peptidases in MEROPS peptidase family S16 (lon protease family, clan SJ). RadA/Sms is involved in recombination and recombinational repair, most likely involving the stabilisation or processing of branched DNA molecules or blocked replication forks because of its genetic redundancy with RecG and RuvABC .; GO: 0003684 damaged DNA binding, 0005524 ATP binding, 0006281 DNA repair. Probab=88.87 E-value=0.59 Score=26.02 Aligned_cols=145 Identities=24% Similarity=0.211 Sum_probs=76.8 Q ss_pred HHHCCHHHHHCCC--CCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCC Q ss_conf 4441602331011--11568336552477886115899999998863024667876069999516776789999985114 Q gi|254780617|r 145 PLSTGIKAIDSLI--PIGRGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALED 222 (509) Q Consensus 145 ~l~TGI~~ID~l~--pigrGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~ 222 (509) .|.||..=+|-.+ =|.+|-=+||=|+||+|||||++.... |.+.+. ..+ .|+.==|....++-=-.+|-- T Consensus 84 rf~s~~~ElDrVLGGGivpGsliLiGG~PG~GKSTLLLqV~~-~LA~~~-----~~~--LYVsGEES~~Q~klRA~RLGi 155 (481) T TIGR00416 84 RFSSGFGELDRVLGGGIVPGSLILIGGDPGIGKSTLLLQVAC-QLAKNS-----MKV--LYVSGEESLQQIKLRATRLGI 155 (481) T ss_pred EEECCCCCCCEEECCCCCCCCEEEECCCCCCCHHHHHHHHHH-HHHHCC-----CCE--EEEEECCHHHHHHHHHHHCCC T ss_conf 053066410011067222441698468899635678999999-984048-----816--899723016778888754553 Q ss_pred CCCCCEE---EEEEECCCCCHHHHHH---HHHHHHHHHHHHHCCCCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCC Q ss_conf 6742115---8997288788899999---887666666754204688479961647899999875453037875433375 Q gi|254780617|r 223 RGALSYS---IVVVASASDPAPMQLL---APFAGCAMGEYFRDNGYHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPG 296 (509) Q Consensus 223 ~~~~~~t---vvv~a~a~~~~~~r~~---ap~~a~aiAEyfr~~G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~ 296 (509) ....+.+ -=+.+.+.|--. .++ --..=+..+| +=+--++|+||+=..- .|--..=|| T Consensus 156 t~~~~~sqaqdGinnlahdG~L-~~L~Et~~e~I~~~~e----~~~P~~~ViDSIQ~ly------------~~di~SaPG 218 (481) T TIGR00416 156 TDLVEPSQAQDGINNLAHDGNL-YVLSETNLEQICAEIE----ELNPQVVVIDSIQTLY------------LPDISSAPG 218 (481) T ss_pred CCCCCHHHHHHHHHHHHCCCCC-CCCCCCCHHHHHHHHH----HHCCCEEEEECCCCCC------------CHHHCCCCC T ss_conf 2478702344324543026753-2157579899999998----5299489991421000------------000025888 Q ss_pred CCHH--HHHHHHHHHHHCCC Q ss_conf 2001--24678888750454 Q gi|254780617|r 297 DVFY--LHSRLLERAAKMSD 314 (509) Q Consensus 297 ~vf~--~~s~l~ERa~~~~~ 314 (509) ||-- ...-+|.|-+|+.+ T Consensus 219 SVsQVRE~t~~Lmr~AKt~~ 238 (481) T TIGR00416 219 SVSQVRECTAELMRLAKTRG 238 (481) T ss_pred CHHHHHHHHHHHHHHHHHCC T ss_conf 42388899999987652168 No 356 >PRK07952 DNA replication protein DnaC; Validated Probab=88.85 E-value=0.67 Score=25.63 Aligned_cols=42 Identities=19% Similarity=0.248 Sum_probs=27.7 Q ss_pred HHHHCCHHHHHCCCCCCCCCEEEEE-CCCCCCCHHHHHHHHHHHHHH Q ss_conf 1444160233101111568336552-477886115899999998863 Q gi|254780617|r 144 EPLSTGIKAIDSLIPIGRGQRELII-GDRKTGKTSIILDTFLNQKSS 189 (509) Q Consensus 144 ~~l~TGI~~ID~l~pigrGQR~~I~-g~~g~GKt~l~~~~I~nq~~~ 189 (509) ..+.+-..-++.+- .|..-+|| |++|||||.||. .|.|.... T Consensus 80 ~al~~a~~y~enf~---~~~~gLlF~G~~GTGKThLA~-aIan~Li~ 122 (242) T PRK07952 80 NALSKARQYVEEFD---GNIASFIFSGKPGTGKNHLAA-AICNELLL 122 (242) T ss_pred HHHHHHHHHHHHHC---CCCCEEEEECCCCCCHHHHHH-HHHHHHHH T ss_conf 99999999998654---388717997899997899999-99999998 No 357 >KOG0080 consensus Probab=88.84 E-value=0.51 Score=26.44 Aligned_cols=23 Identities=35% Similarity=0.598 Sum_probs=19.8 Q ss_pred EEEEECCCCCCCHHHHHHHHHHH Q ss_conf 36552477886115899999998 Q gi|254780617|r 164 RELIIGDRKTGKTSIILDTFLNQ 186 (509) Q Consensus 164 R~~I~g~~g~GKt~l~~~~I~nq 186 (509) +++|+|++|+||++|++..+-|. T Consensus 13 KiLlIGeSGVGKSSLllrFv~~~ 35 (209) T KOG0080 13 KILLIGESGVGKSSLLLRFVSNT 35 (209) T ss_pred EEEEECCCCCCHHHHHHHHHHCC T ss_conf 99998168765789999987643 No 358 >PRK10908 cell division protein FtsE; Provisional Probab=88.76 E-value=0.27 Score=28.51 Aligned_cols=33 Identities=15% Similarity=0.396 Sum_probs=25.3 Q ss_pred HHHHH-CCCCCCCCCEEEEECCCCCCCHHHHHHHH Q ss_conf 02331-01111568336552477886115899999 Q gi|254780617|r 150 IKAID-SLIPIGRGQRELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 150 I~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~~~I 183 (509) ..+++ .=+.+-+|...+|+|++|+|||||+ .+| T Consensus 15 ~~~L~~vsl~i~~Ge~~~liG~nGsGKSTLl-~~i 48 (222) T PRK10908 15 RQALQGVTFHMRPGEMAFLTGHSGAGKSTLL-KLI 48 (222) T ss_pred CEEEECCEEEECCCCEEEEECCCCCHHHHHH-HHH T ss_conf 8798643879969989999999980799999-999 No 359 >COG1134 TagH ABC-type polysaccharide/polyol phosphate transport system, ATPase component [Carbohydrate transport and metabolism / Cell envelope biogenesis, outer membrane] Probab=88.75 E-value=0.3 Score=28.19 Aligned_cols=26 Identities=35% Similarity=0.563 Sum_probs=23.1 Q ss_pred HCCCCCCCCCEEEEECCCCCCCHHHH Q ss_conf 10111156833655247788611589 Q gi|254780617|r 154 DSLIPIGRGQRELIIGDRKTGKTSII 179 (509) Q Consensus 154 D~l~pigrGQR~~I~g~~g~GKt~l~ 179 (509) |.=+.+-+|.|+||+|..|.|||||+ T Consensus 45 disf~i~~Ge~vGiiG~NGaGKSTLl 70 (249) T COG1134 45 DISFEIYKGERVGIIGHNGAGKSTLL 70 (249) T ss_pred CCEEEEECCCEEEEECCCCCCHHHHH T ss_conf 73588607989989878998589999 No 360 >COG1136 SalX ABC-type antimicrobial peptide transport system, ATPase component [Defense mechanisms] Probab=88.74 E-value=0.3 Score=28.13 Aligned_cols=30 Identities=27% Similarity=0.542 Sum_probs=24.2 Q ss_pred HHHH-HCCCCCCCCCEEEEECCCCCCCHHHH Q ss_conf 0233-10111156833655247788611589 Q gi|254780617|r 150 IKAI-DSLIPIGRGQRELIIGDRKTGKTSII 179 (509) Q Consensus 150 I~~I-D~l~pigrGQR~~I~g~~g~GKt~l~ 179 (509) +.++ |.=+.|-+|--..|.|++|+|||||+ T Consensus 18 ~~~L~~v~l~i~~Ge~vaI~GpSGSGKSTLL 48 (226) T COG1136 18 VEALKDVNLEIEAGEFVAIVGPSGSGKSTLL 48 (226) T ss_pred EEECCCCEEEECCCCEEEEECCCCCCHHHHH T ss_conf 6752462588749989999899999899999 No 361 >TIGR02770 nickel_nikD nickel import ATP-binding protein NikD; InterPro: IPR014138 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). This family represents the NikD subunit of a multisubunit nickel import ABC transporter complex. Nickel, once imported, may be used in urease and in certain classes of hydrogenase and superoxide dismutase. NikD and NikE are related families.. Probab=88.66 E-value=0.45 Score=26.89 Aligned_cols=172 Identities=17% Similarity=0.153 Sum_probs=97.9 Q ss_pred HHCCCCCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHH-HHHHCCCCCCCEEEE Q ss_conf 3101111568336552477886115899999998863024667876069999516776789999-985114674211589 Q gi|254780617|r 153 IDSLIPIGRGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARF-VKALEDRGALSYSIV 231 (509) Q Consensus 153 ID~l~pigrGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~-~~~l~~~~~~~~tvv 231 (509) -|.=+.+=||+-.+++|.||+|||.-|+- ++.- -.+.+.-+=+-|=-+|+++... .++.+..-..+-++| T Consensus 3 ~~~~L~lk~G~~~aLvG~SGSGKS~tc~A-~Lg~--------L~~~~~~~~G~i~l~G~~~~~~p~kemr~~Rg~~~~~i 73 (239) T TIGR02770 3 QDVNLSLKRGEVLALVGESGSGKSLTCLA-ILGL--------LPPGLTQTSGEILLDGRPLLALPKKEMRSIRGRHIATI 73 (239) T ss_pred CCCCCEEECCEEEEEECCCCCCHHHHHHH-HHCC--------CCCCCEEEEEEEEECCCCCCCCCHHHHHHHHHHHHHHH T ss_conf 16752033243788874878758999999-8503--------58652231026677172314587368998866663011 Q ss_pred EEECCCCCHHHHHHHHHHHHHHHHHHHCCCCCEEEEE-ECHH-HHHHHHHHHHHHCCCCCCCCCCC-------------- Q ss_conf 9728878889999988766666675420468847996-1647-89999987545303787543337-------------- Q gi|254780617|r 232 VVASASDPAPMQLLAPFAGCAMGEYFRDNGYHALIAY-DDLQ-KHAVAYRQLSLLLRRPPGREAYP-------------- 295 (509) Q Consensus 232 v~a~a~~~~~~r~~ap~~a~aiAEyfr~~G~~VLi~~-Ddlt-r~A~a~reisl~~~~pp~~~gyp-------------- 295 (509) +-++.+.=-+++ +|++++++.=+--.-+. ++=+ .-.+|.++++|--.+- --+-|| T Consensus 74 mQnPr~AFNPl~--------tm~~h~~ETl~~~~k~~k~~A~~~~~~~l~~VgL~~~~~-~L~~YPF~LSGGMLQRvMIA 144 (239) T TIGR02770 74 MQNPRTAFNPLL--------TMAEHAIETLRSLGKLSKKQARALILEALEAVGLEDPEE-VLKKYPFQLSGGMLQRVMIA 144 (239) T ss_pred CCCCCCCCHHHH--------HHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHCCCHHHHH-HHHHCCCCCCCCHHHHHHHH T ss_conf 028500001676--------399999999997217461017999999998638522799-99746641368627899999 Q ss_pred --------------C----C------CHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCCCC--CCHHHHHHHHHCCE Q ss_conf --------------5----2------00124678888750454458873567531003676644--31146688540616 Q gi|254780617|r 296 --------------G----D------VFYLHSRLLERAAKMSDALGAGSLTALPVIETQVNDVS--AYIPTNVISITDGQ 349 (509) Q Consensus 296 --------------~----~------vf~~~s~l~ERa~~~~~~~g~GSiT~~~~v~~~~~D~~--~~i~~~~~si~DG~ 349 (509) | | |-+.+-+|.|+-| |+ ++.-.-|+. +.++|.+-=.-+|+ T Consensus 145 lA~~~~~PfLiADEPTTdLDv~~Q~~vL~~L~~l~~~~G-----------~g---iLLiTHDlgVvA~~AD~V~VM~~G~ 210 (239) T TIGR02770 145 LALLLEPPFLIADEPTTDLDVVVQARVLKLLDELRQKRG-----------TG---ILLITHDLGVVARLADEVAVMDDGR 210 (239) T ss_pred HHHHHCCCCEECCCCCCHHHHHHHHHHHHHHHHHHHHHC-----------CE---EEEEEHHHHHHHHHHHHHHHHHCCE T ss_conf 999855710141799838779999999999999987518-----------72---2652000789999987998876780 Q ss_pred EEEECCH Q ss_conf 9961501 Q gi|254780617|r 350 IFLETEL 356 (509) Q Consensus 350 i~l~~~l 356 (509) |+=.... T Consensus 211 IvE~~~v 217 (239) T TIGR02770 211 IVERGTV 217 (239) T ss_pred EEECCCH T ss_conf 6764887 No 362 >KOG1433 consensus Probab=88.62 E-value=1.8 Score=22.42 Aligned_cols=121 Identities=19% Similarity=0.154 Sum_probs=64.5 Q ss_pred CCCCCHHHHHCCHHHHHCCC--CCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEE-ECCC-CHHHH Q ss_conf 12660014441602331011--11568336552477886115899999998863024667876069999-5167-76789 Q gi|254780617|r 138 QRQSVCEPLSTGIKAIDSLI--PIGRGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYV-AIGQ-KRSSV 213 (509) Q Consensus 138 ~R~~i~~~l~TGI~~ID~l~--pigrGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~-~IGe-r~~ev 213 (509) +++.-.-.|.||.++.|-++ -++-|.=..|+|++++|||-+++-+- +.. ....+-|+|+ ..+. +.... T Consensus 85 ~~~~~~~~l~Tg~~~lD~lL~gGi~~g~~TEi~G~p~~GKtQlc~~~~--v~~------~gge~~~l~IDs~s~~~~~~~ 156 (326) T KOG1433 85 QLRSELGFLSTGSKALDKLLGGGIETGSLTELVGPPGSGKTQLCHTLA--VTC------GGGEGKVLYIDTESTFRLERL 156 (326) T ss_pred HHHCCCEEECCCCHHHHHHHCCCCCCCCEEEEECCCCCCHHHHHHHHH--HHC------CCCCCEEEEEECCHHCCCCHH T ss_conf 861676254155055667742675558356885589844778888998--870------687511899952111033211 Q ss_pred HHHHHHHCCCCCCCEEEEEEECCCCCHHHHHHHHH---HHHHHHHHHHCCCCCEEEEEECHHHH Q ss_conf 99998511467421158997288788899999887---66666675420468847996164789 Q gi|254780617|r 214 ARFVKALEDRGALSYSIVVVASASDPAPMQLLAPF---AGCAMGEYFRDNGYHALIAYDDLQKH 274 (509) Q Consensus 214 ~~~~~~l~~~~~~~~tvvv~a~a~~~~~~r~~ap~---~a~aiAEyfr~~G~~VLi~~Ddltr~ 274 (509) .++-....... +. +.+.-...+..--. ...-.|+.|..+.+.-++++|+.|.. T Consensus 157 ~~ia~~~~~~~-~~-------~~~~l~~~~~~~~~~~l~~~~~a~~~~~~~~~~l~~vds~ta~ 212 (326) T KOG1433 157 TEIAGRSGLRG-RD-------TLSNLMLARAYNLDHQLQLIQEAEIMINQSRVKLLIVDSATAL 212 (326) T ss_pred HHHHHHHHHHH-HH-------HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEEEECCCCCC T ss_conf 35666630306-77-------8888899999856667777999999862031169994134541 No 363 >PRK11614 livF leucine/isoleucine/valine transporter ATP-binding subunit; Provisional Probab=88.59 E-value=0.3 Score=28.10 Aligned_cols=35 Identities=29% Similarity=0.516 Sum_probs=27.0 Q ss_pred HHHHH-CCCCCCCCCEEEEECCCCCCCHHHHHHHHHH Q ss_conf 02331-0111156833655247788611589999999 Q gi|254780617|r 150 IKAID-SLIPIGRGQRELIIGDRKTGKTSIILDTFLN 185 (509) Q Consensus 150 I~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~~~I~n 185 (509) .++++ .=+.+-+|+..+|+|..|.|||||+ .+|.. T Consensus 18 ~~~L~~isl~i~~Gei~~liG~NGaGKSTLl-~~i~G 53 (237) T PRK11614 18 IQALHEVSLHINQGEIVTLIGANGAGKTTLL-GTLCG 53 (237) T ss_pred EEEEEEEEEEECCCCEEEEECCCCCCHHHHH-HHHHC T ss_conf 8888112789869979999879997599999-99967 No 364 >KOG0730 consensus Probab=88.59 E-value=0.25 Score=28.73 Aligned_cols=69 Identities=17% Similarity=0.126 Sum_probs=37.9 Q ss_pred CCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEEEEEC Q ss_conf 83365524778861158999999988630246678760699995167767899999851146742115899728 Q gi|254780617|r 162 GQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALSYSIVVVAS 235 (509) Q Consensus 162 GQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~~tvvv~a~ 235 (509) +--.+..|++|+|||-+ .+.+.|-.+.+....+...+ +--.-||..++.++..+.-..... .++++... T Consensus 218 prg~Ll~gppg~Gkt~l-~~aVa~e~~a~~~~i~~pel--i~k~~gEte~~LR~~f~~a~k~~~--psii~IdE 286 (693) T KOG0730 218 PRGLLLYGPPGTGKTFL-VRAVANEYGAFLFLINGPEL--ISKFPGETESNLRKAFAEALKFQV--PSIIFIDE 286 (693) T ss_pred CCCCCCCCCCCCCHHHH-HHHHHHHHCCEEEECCCHHH--HHHCCCCHHHHHHHHHHHHHCCCC--CEEEEHHH T ss_conf 98744438999981899-99999973722574062899--985246317789999999866599--80775876 No 365 >cd03214 ABC_Iron-Siderophores_B12_Hemin ABC transporters, involved in the uptake of siderophores, heme, and vitamin B12, are widely conserved in bacteria and archaea. Only very few species lack representatives of the siderophore family transporters. The E. coli BtuCD protein is an ABC transporter mediating vitamin B12 uptake. The two ATP-binding cassettes (BtuD) are in close contact with each other, as are the two membrane-spanning subunits (BtuC); this arrangement is distinct from that observed for the E. coli lipid flippase MsbA. The BtuC subunits provide 20 transmembrane helices grouped around a translocation pathway that is closed to the cytoplasm by a gate region, whereas the dimer arrangement of the BtuD subunits resembles the ATP-bound form of the Rad50 DNA repair enzyme. A prominent cytoplasmic loop of BtuC forms the contact region with the ATP-binding cassette and represent a conserved motif among the ABC transporters. Probab=88.42 E-value=0.33 Score=27.83 Aligned_cols=30 Identities=23% Similarity=0.405 Sum_probs=24.1 Q ss_pred CCCCCCCCCEEEEECCCCCCCHHHHHHHHHH Q ss_conf 0111156833655247788611589999999 Q gi|254780617|r 155 SLIPIGRGQRELIIGDRKTGKTSIILDTFLN 185 (509) Q Consensus 155 ~l~pigrGQR~~I~g~~g~GKt~l~~~~I~n 185 (509) .=+.+-+|+..+|+|.+|+|||||+ .+|.. T Consensus 18 is~~i~~Ge~~~liG~nGsGKTTLl-~~i~G 47 (180) T cd03214 18 LSLSIEAGEIVGILGPNGAGKSTLL-KTLAG 47 (180) T ss_pred EEEEECCCCEEEEECCCCCCHHHHH-HHHHC T ss_conf 3778869979999989998899999-99957 No 366 >PRK10895 putative ABC transporter ATP-binding protein YhbG; Provisional Probab=88.34 E-value=0.32 Score=27.92 Aligned_cols=34 Identities=15% Similarity=0.324 Sum_probs=25.9 Q ss_pred HHHHH-CCCCCCCCCEEEEECCCCCCCHHHHHHHHH Q ss_conf 02331-011115683365524778861158999999 Q gi|254780617|r 150 IKAID-SLIPIGRGQRELIIGDRKTGKTSIILDTFL 184 (509) Q Consensus 150 I~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~~~I~ 184 (509) .++++ .=+.+-+|+-.+|+|+.|+||||++ .+|. T Consensus 16 ~~~l~~vsl~i~~Gei~~liGpNGaGKSTLl-~~i~ 50 (241) T PRK10895 16 RRVVEDVSLTVNSGEIVGLLGPNGAGKTTTF-YMVV 50 (241) T ss_pred EEEEEEEEEEECCCCEEEEECCCCCCHHHHH-HHHH T ss_conf 9999520789839979999889998699999-9996 No 367 >PTZ00265 multidrug resistance protein (mdr1); Provisional Probab=88.32 E-value=0.35 Score=27.62 Aligned_cols=33 Identities=15% Similarity=0.399 Sum_probs=22.9 Q ss_pred HHHHHC-CCCCCCCCEEEEECCCCCCCHHHHHHHH Q ss_conf 023310-1111568336552477886115899999 Q gi|254780617|r 150 IKAIDS-LIPIGRGQRELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 150 I~~ID~-l~pigrGQR~~I~g~~g~GKt~l~~~~I 183 (509) +++++. =+.|-.||...|.|.+|+||||+. ..+ T Consensus 398 ~~VL~~~sl~i~~G~t~AlVG~SGsGKSTii-~LL 431 (1467) T PTZ00265 398 VEIYKDLNFTLKEGKTYAFVGESGCGKSTIL-KLI 431 (1467) T ss_pred CEECCCCEEEECCCCEEEEECCCCCCHHHHH-HHH T ss_conf 8300683388638977998668887566799-999 No 368 >cd00984 DnaB_C DnaB helicase C terminal domain. The hexameric helicase DnaB unwinds the DNA duplex at the chromosome replication fork. Although the mechanism by which DnaB both couples ATP hydrolysis to translocation along DNA and denatures the duplex is unknown, a change in the quaternary structure of the protein involving dimerization of the N-terminal domain has been observed and may occur during the enzymatic cycle. This C-terminal domain contains an ATP-binding site and is therefore probably the site of ATP hydrolysis. Probab=88.28 E-value=1.8 Score=22.58 Aligned_cols=30 Identities=23% Similarity=0.268 Sum_probs=25.5 Q ss_pred CCCCCEEEEECCCCCCCHHHHHHHHHHHHH Q ss_conf 156833655247788611589999999886 Q gi|254780617|r 159 IGRGQRELIIGDRKTGKTSIILDTFLNQKS 188 (509) Q Consensus 159 igrGQR~~I~g~~g~GKt~l~~~~I~nq~~ 188 (509) +.+||=..|.|.+|+|||++++.+..|.+. T Consensus 10 ~~~G~L~vi~a~~g~GKS~~~~~la~~~a~ 39 (242) T cd00984 10 LQPGDLIIIAARPSMGKTAFALNIAENIAK 39 (242) T ss_pred CCCCCEEEEEECCCCCHHHHHHHHHHHHHH T ss_conf 999818999968999999999999999999 No 369 >TIGR02528 EutP ethanolamine utilization protein, EutP; InterPro: IPR012381 Members of this family function in ethanolamine and propanediol degradation pathways. Both pathways require coenzyme B12 (adenosylcobalamin, AdoCbl). Bacteria that harbour these pathways can use ethanolamine as a source of carbon and nitrogen, or propanediol as a sole carbon and energy source, respectively. The exact roles of the EutP and PduV proteins in these respective pathways are not yet determined. Members of this family contain P-loop consensus motifs in the N-terminal part, and are distantly related to various GTPases and ATPases, including ATPase components of transport systems. Propanediol degradation is thought to be important for the natural Salmonella populations, since propanediol is produced by the fermentation of the common plant sugars rhamnose and fucose , . More than 10f the Salmonella enterica genome is devoted to the utilisation of propanediol and cobalamin biosynthesis. In vivo expression technology has indicated that propanediol utilisation (pdu) genes may be important for growth in host tissues, and competitive index studies with mice have shown that pdu mutations confer a virulence defect , . The pdu operon is contiguous and co-regulated with the cobalamin (B12) biosynthesis cob operon, indicating that propanediol catabolism may be the primary reason for de novo B12 synthesis in Salmonella , , . Please see IPR003207 from INTERPRO, IPR003208 from INTERPRO, IPR009204 from INTERPRO, IPR009191 from INTERPRO, IPR009192 from INTERPRO for more details on the propanediol utilisation pathway and the pdu operon.; GO: 0005524 ATP binding, 0006576 biogenic amine metabolic process. Probab=88.24 E-value=0.56 Score=26.19 Aligned_cols=71 Identities=28% Similarity=0.263 Sum_probs=42.5 Q ss_pred EEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEE---EECCCCHHHHHH--HHHHHCCCCCCCEEEEEEECCCC Q ss_conf 3655247788611589999999886302466787606999---951677678999--99851146742115899728878 Q gi|254780617|r 164 RELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIY---VAIGQKRSSVAR--FVKALEDRGALSYSIVVVASASD 238 (509) Q Consensus 164 R~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~---~~IGer~~ev~~--~~~~l~~~~~~~~tvvv~a~a~~ 238 (509) |+.++|..|||||||+ |+-++.....++..-+=| -.|.-.|.-+.. ++..|.-.-+-...++.+-.+++ T Consensus 2 r~~f~G~~gCGKTTL~------q~L~g~~~~YKKTQAvE~~~k~~IDTPGEY~enr~~Y~AL~vtaaDAd~i~lV~~a~~ 75 (144) T TIGR02528 2 RIMFIGSVGCGKTTLT------QALQGEEIKYKKTQAVEYKDKEAIDTPGEYVENRRYYSALIVTAADADVIALVQSATD 75 (144) T ss_pred EEEEEECCCCCHHHHH------HHCCCCCCCEEEEEEEEECCCCCCCCCCCCCCCCCHHHHHHHHHHCCEEEEEEECCCC T ss_conf 1788715888744354------3116873210233445425888655985001575237888888721023667735776 Q ss_pred CH Q ss_conf 88 Q gi|254780617|r 239 PA 240 (509) Q Consensus 239 ~~ 240 (509) +. T Consensus 76 ~~ 77 (144) T TIGR02528 76 EE 77 (144) T ss_pred CC T ss_conf 42 No 370 >COG1123 ATPase components of various ABC-type transport systems, contain duplicated ATPase [General function prediction only] Probab=88.23 E-value=0.34 Score=27.76 Aligned_cols=30 Identities=23% Similarity=0.479 Sum_probs=24.0 Q ss_pred HCCCCCCCCCEEEEECCCCCCCHHHHHHHHH Q ss_conf 1011115683365524778861158999999 Q gi|254780617|r 154 DSLIPIGRGQRELIIGDRKTGKTSIILDTFL 184 (509) Q Consensus 154 D~l~pigrGQR~~I~g~~g~GKt~l~~~~I~ 184 (509) |.=+.+.+|+..+|.|.+|+|||+++. +|+ T Consensus 27 ~vsf~v~~GE~lgIvGESGsGKSt~a~-~i~ 56 (539) T COG1123 27 DVSFEVEPGEILGIVGESGSGKSTLAL-ALM 56 (539) T ss_pred CCEEEECCCCEEEEECCCCCCHHHHHH-HHH T ss_conf 424786489589998689888899999-985 No 371 >TIGR02857 CydD ABC transporter, CydDC cysteine exporter (CydDC-E) family, permease/ATP-binding protein CydD; InterPro: IPR014216 The gene pair cydCD encodes an ABC-family transporter in which each gene contains an N-terminal membrane-spanning domain and a C-terminal ATP-binding domain. In Escherichia coli these genes were discovered as mutants which caused the terminal heme-copper oxidase complex cytochrome bd to fail to assemble. Recent work has shown that the transporter is involved in the export of redox-active thiol compounds such as cysteine and glutathione , . The linkage to assembly of the cytochrome bd complex is further supported by the conserved operon structure found outside the gammaproteobacteria (cydABCD) containing both the transporter and oxidase genes components. The genes used as the seed members for this entry are all either found in the gammaproteobacterial context, or the CydABCD context. All members of this entry scoring above the trusted cut off at the time of its creation were from genomes which encode a cytochrome bd complex. Unfortunately, the gene symbol nomenclature, based on this operon in Bacillus subtilis, assigns cydC to the third gene in the operon, where this gene is actually homologous to the E. coli cydD gene. We have chosen to name all homologues in this family in accordance with the precedence of publication of the E. coli name, CydD.. Probab=88.20 E-value=0.31 Score=28.01 Aligned_cols=83 Identities=25% Similarity=0.417 Sum_probs=45.4 Q ss_pred HHHHCCCC-HHHCCCCCCCCCCCCCCCHHCCC-----CCCCCCCCCCHHHHH-CCHH--HHHCC-CCCCCCCEEEEECCC Q ss_conf 69830003-00231446644454441001013-----577311266001444-1602--33101-111568336552477 Q gi|254780617|r 102 ELLGRVVD-ALGNPIDGKGPIKCEQRSCTEAD-----APGIIQRQSVCEPLS-TGIK--AIDSL-IPIGRGQRELIIGDR 171 (509) Q Consensus 102 ~lLGRVvD-~lG~PlDg~g~i~~~~~~~i~~~-----~p~~~~R~~i~~~l~-TGI~--~ID~l-~pigrGQR~~I~g~~ 171 (509) +-+.+|+| .-..|.-++.|+...+.+...+. .|. ++-+-++ ..+ -|-+ +++.+ +++-=|++..|.|.| T Consensus 310 ~~~~~~l~~~~~~p~~~~~p~~~~~~~~~~~~a~DG~~P~-l~~~~v~-~~y~~G~~~pa~~~~sf~~~pG~~vAl~G~S 387 (570) T TIGR02857 310 EALFTVLDAAEPRPLAGKAPLTAAPASALGRLAADGADPS-LEFEGVS-VAYAPGRDEPALRPVSFTVEPGERVALVGPS 387 (570) T ss_pred HHHHHHHHCCCCCCCCCCCCCCCCCCHHHCCCCCCCCCCE-EEEEEEE-EECCCCCHHHCCCCCCEEECCCCEEEEEECC T ss_conf 9999997368887788766776544301001003699840-7887516-8627888010378854166387048886279 Q ss_pred CCCCHHHHHHHHHHHH Q ss_conf 8861158999999988 Q gi|254780617|r 172 KTGKTSIILDTFLNQK 187 (509) Q Consensus 172 g~GKt~l~~~~I~nq~ 187 (509) |.||||| ++.++-.. T Consensus 388 GaGKSTL-L~lLLGf~ 402 (570) T TIGR02857 388 GAGKSTL-LNLLLGFV 402 (570) T ss_pred CCCHHHH-HHHHHCCC T ss_conf 9978899-99997157 No 372 >cd04102 RabL3 RabL3 (Rab-like3) subfamily. RabL3s are novel proteins that have high sequence similarity with Rab family members, but display features that are distinct from Rabs, and have been termed Rab-like. As in other Rab-like proteins, RabL3 lacks a prenylation site at the C-terminus. The specific function of RabL3 remains unknown. Probab=88.17 E-value=0.27 Score=28.44 Aligned_cols=20 Identities=35% Similarity=0.586 Sum_probs=16.4 Q ss_pred EEEEECCCCCCCHHHHHHHH Q ss_conf 36552477886115899999 Q gi|254780617|r 164 RELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 164 R~~I~g~~g~GKt~l~~~~I 183 (509) |+.++||+|||||+|+.-.+ T Consensus 2 KIlllGDsgVGKTSL~~~~~ 21 (202) T cd04102 2 RVLVVGDSGVGKSSLVHLIC 21 (202) T ss_pred EEEEECCCCCCHHHHHHHHH T ss_conf 89999999989999999998 No 373 >TIGR02324 CP_lyasePhnL phosphonate C-P lyase system protein PhnL; InterPro: IPR012701 Members of this family are the PhnL protein of C-P lyase systems for utilization of phosphonates. These systems resemble phosphonatase-based systems in having a three-component ABC transporter, where IPR005769 from INTERPRO is the permease, IPR005770 from INTERPRO is the phosphonates binding protein, and IPR012693 from INTERPRO is the ATP-binding cassette (ABC) protein. They differ, however, in having, typically, ten or more additional genes, many of which are believed to form a membrane-associated C-P lyase complex. This protein (PhnL) and the adjacent-encoded PhnK (IPR012700 from INTERPRO) resemble transporter ATP-binding proteins but are suggested, based on mutagenesis studies, to be part of this C-P lyase complex rather than part of a transporter per se .. Probab=88.11 E-value=0.4 Score=27.24 Aligned_cols=117 Identities=16% Similarity=0.186 Sum_probs=58.3 Q ss_pred CCCCCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCC-CCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEEEE Q ss_conf 0111156833655247788611589999999886302466787-606999951677678999998511467421158997 Q gi|254780617|r 155 SLIPIGRGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKD-KVYCIYVAIGQKRSSVARFVKALEDRGALSYSIVVV 233 (509) Q Consensus 155 ~l~pigrGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~-~v~~V~~~IGer~~ev~~~~~~l~~~~~~~~tvvv~ 233 (509) .=+.+--|+...+.|.+|.|||||+-..-.|=+-.....-... .- +.=+..-..+||.++.+.--. .+ T Consensus 27 v~l~V~aGEcv~L~G~SGaGKSTlLk~lYaNYlp~~G~i~~~H~G~--~~DL~~a~pr~vl~vRr~tiG---------YV 95 (224) T TIGR02324 27 VSLTVNAGECVALSGPSGAGKSTLLKSLYANYLPDSGRILVRHEGA--WVDLAQASPREVLEVRRKTIG---------YV 95 (224) T ss_pred CEEEEECCCEEEEECCCCCCHHHHHHHHHHCCCCCCCEEEEEECCH--HHHHHCCCHHHHHHHHHHHCC---------CE T ss_conf 4378736735885368887678999976630474686777762404--767507684577877300335---------15 Q ss_pred ECCCCCHHHHHH--HHHHHH--HHHHHHHCCCCCEEEEEECHHHHHHHHHHHHHHCCC--CCC-CCCCCCCCHH Q ss_conf 288788899999--887666--666754204688479961647899999875453037--875-4333752001 Q gi|254780617|r 234 ASASDPAPMQLL--APFAGC--AMGEYFRDNGYHALIAYDDLQKHAVAYRQLSLLLRR--PPG-REAYPGDVFY 300 (509) Q Consensus 234 a~a~~~~~~r~~--ap~~a~--aiAEyfr~~G~~VLi~~Ddltr~A~a~reisl~~~~--pp~-~~gyp~~vf~ 300 (509) + ||+ -|...| -+||=....|- .++.|+.+-..+|.+ .|= -=.|||..|+ T Consensus 96 S--------QFLRViPRvsalevV~ePL~~~G~----------~~~~A~~~A~~LL~rLniPERLW~LpPaTFS 151 (224) T TIGR02324 96 S--------QFLRVIPRVSALEVVAEPLLERGV----------PREAARARARELLARLNIPERLWSLPPATFS 151 (224) T ss_pred E--------EEEEECCCCCHHHHHHHHHHHCCC----------HHHHHHHHHHHHHHHCCCCHHHCCCCCCCCC T ss_conf 5--------530312886728888787987289----------5899999999999755740244288788656 No 374 >PRK00300 gmk guanylate kinase; Provisional Probab=88.11 E-value=2 Score=22.20 Aligned_cols=89 Identities=18% Similarity=0.307 Sum_probs=48.8 Q ss_pred CCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCC-EEEEEEC-----CCCH------HHHHHHHHHHCCCCCC Q ss_conf 15683365524778861158999999988630246678760-6999951-----6776------7899999851146742 Q gi|254780617|r 159 IGRGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKV-YCIYVAI-----GQKR------SSVARFVKALEDRGAL 226 (509) Q Consensus 159 igrGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v-~~V~~~I-----Ger~------~ev~~~~~~l~~~~~~ 226 (509) .-+|.=+.|.|++|+||++|+- .++++-. ..+ ++|-.-- ||.. -.-.+|-+.......+ T Consensus 4 ~~~g~livisGPSG~GK~tl~~-~L~~~~p--------~~~~~~vs~TTR~pR~~E~dG~dY~Fvs~eeF~~~i~~g~Fl 74 (208) T PRK00300 4 MRRGLLIVLSAPSGAGKSTLVR-ALLERDP--------NDLQLSVSATTRAPRPGEVDGVHYHFVSREEFEEMIENGEFL 74 (208) T ss_pred CCCCCEEEEECCCCCCHHHHHH-HHHHHCC--------CCEEEEEEEECCCCCCCCCCCCEEEEECHHHHHHHHHHCCCE T ss_conf 4188389999999889999999-9997299--------868998974688989987789657996199999998628366 Q ss_pred CEEEEEEECCCCCHHHHHHHHHHHHHHH--HHHHCCCCCEEEEEE Q ss_conf 1158997288788899999887666666--754204688479961 Q gi|254780617|r 227 SYSIVVVASASDPAPMQLLAPFAGCAMG--EYFRDNGYHALIAYD 269 (509) Q Consensus 227 ~~tvvv~a~a~~~~~~r~~ap~~a~aiA--Eyfr~~G~~VLi~~D 269 (509) +++.+.-+ |.|+... +-...+|++|++-+| T Consensus 75 E~~~~~g~-------------~YGT~~~~I~~~~~~G~~vildid 106 (208) T PRK00300 75 EWAEVFGN-------------YYGTPREPVEEALAAGKDVLLEID 106 (208) T ss_pred EEEEECCC-------------CEECCHHHHHHHHHCCCCEEEECC T ss_conf 78998387-------------035246999999856998797467 No 375 >COG4988 CydD ABC-type transport system involved in cytochrome bd biosynthesis, ATPase and permease components [Energy production and conversion / Posttranslational modification, protein turnover, chaperones] Probab=88.03 E-value=0.36 Score=27.55 Aligned_cols=33 Identities=27% Similarity=0.449 Sum_probs=26.3 Q ss_pred HCCCCCCCCCEEEEECCCCCCCHHHHHHHHHHHH Q ss_conf 1011115683365524778861158999999988 Q gi|254780617|r 154 DSLIPIGRGQRELIIGDRKTGKTSIILDTFLNQK 187 (509) Q Consensus 154 D~l~pigrGQR~~I~g~~g~GKt~l~~~~I~nq~ 187 (509) |.-+++-+||+..|.|.+|+||||+ ++.+.... T Consensus 339 ~l~~t~~~g~~talvG~SGaGKSTL-l~lL~G~~ 371 (559) T COG4988 339 DLNLTIKAGQLTALVGASGAGKSTL-LNLLLGFL 371 (559) T ss_pred CCEEEECCCCEEEEECCCCCCHHHH-HHHHHCCC T ss_conf 7106754896799988999978999-99984757 No 376 >PRK09536 btuD corrinoid ABC transporter ATPase; Reviewed Probab=88.02 E-value=0.32 Score=27.97 Aligned_cols=36 Identities=17% Similarity=0.327 Sum_probs=26.6 Q ss_pred HHHHH-CCCCCCCCCEEEEECCCCCCCHHHHHHHHHHH Q ss_conf 02331-01111568336552477886115899999998 Q gi|254780617|r 150 IKAID-SLIPIGRGQRELIIGDRKTGKTSIILDTFLNQ 186 (509) Q Consensus 150 I~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~~~I~nq 186 (509) ..++| .=+.+-+|+..+|+|+.|+|||||+ .+|... T Consensus 15 ~~~L~~Vsl~I~~GEi~gLIGPNGAGKSTLL-k~I~Gl 51 (409) T PRK09536 15 TTILDGVDLSVREGHLVGVVGPNGAGKTTLL-RAMNGL 51 (409) T ss_pred EEEEECEEEEECCCCEEEEECCCCCHHHHHH-HHHHCC T ss_conf 9989250889889989999999872799999-999668 No 377 >PRK13641 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=88.00 E-value=0.3 Score=28.13 Aligned_cols=30 Identities=23% Similarity=0.451 Sum_probs=24.7 Q ss_pred HHHH-CCCCCCCCCEEEEECCCCCCCHHHHH Q ss_conf 2331-01111568336552477886115899 Q gi|254780617|r 151 KAID-SLIPIGRGQRELIIGDRKTGKTSIIL 180 (509) Q Consensus 151 ~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~ 180 (509) +|+| .=+.|-+|...+|+|.+|+|||||+. T Consensus 21 ~aL~~vsl~I~~Ge~~~iiG~nGsGKSTLl~ 51 (286) T PRK13641 21 KGLDNISFELEDGSFVALIGHTGSGKSTLMQ 51 (286) T ss_pred CEEEEEEEEECCCCEEEEECCCCCHHHHHHH T ss_conf 1243106798699999999999839999999 No 378 >COG0003 ArsA Predicted ATPase involved in chromosome partitioning [Cell division and chromosome partitioning] Probab=87.90 E-value=2 Score=22.11 Aligned_cols=22 Identities=27% Similarity=0.228 Sum_probs=15.4 Q ss_pred CCEEEEECC-CCCCCHHHHHHHH Q ss_conf 833655247-7886115899999 Q gi|254780617|r 162 GQRELIIGD-RKTGKTSIILDTF 183 (509) Q Consensus 162 GQR~~I~g~-~g~GKt~l~~~~I 183 (509) +.|+.+|+| -||||||++..+- T Consensus 1 ~~riv~f~GKGGVGKTT~aaA~A 23 (322) T COG0003 1 MTRIVFFTGKGGVGKTTIAAATA 23 (322) T ss_pred CCEEEEEECCCCCCHHHHHHHHH T ss_conf 93799993688545899999999 No 379 >COG1484 DnaC DNA replication protein [DNA replication, recombination, and repair] Probab=87.87 E-value=1.3 Score=23.51 Aligned_cols=48 Identities=17% Similarity=0.391 Sum_probs=31.4 Q ss_pred CCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCC Q ss_conf 683365524778861158999999988630246678760699995167767899999851146 Q gi|254780617|r 161 RGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDR 223 (509) Q Consensus 161 rGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~ 223 (509) ++--..+.|++|||||.|+. +|.|..... .+-|.|+- +.+++..|+.. T Consensus 104 ~~~nl~l~G~~G~GKthLa~-Ai~~~l~~~-------g~sv~f~~-------~~el~~~Lk~~ 151 (254) T COG1484 104 RGENLVLLGPPGVGKTHLAI-AIGNELLKA-------GISVLFIT-------APDLLSKLKAA 151 (254) T ss_pred CCCCEEEECCCCCCHHHHHH-HHHHHHHHC-------CCEEEEEE-------HHHHHHHHHHH T ss_conf 58828998999987999999-999999983-------98499988-------59999999998 No 380 >TIGR01448 recD_rel helicase, RecD/TraA family; InterPro: IPR006345 These sequences represent a family similar to RecD, the exodeoxyribonuclease V alpha chain of IPR006344 from INTERPRO. Members of this family, however, are not found in a context of RecB and RecC and are longer by about 200 amino acids at the amino end. Chlamydia muridarum has both a member of this family and a RecD. . Probab=87.87 E-value=1.3 Score=23.44 Aligned_cols=75 Identities=24% Similarity=0.320 Sum_probs=39.2 Q ss_pred ECCEEEEEC---CCCCCCCCEEEEC-----CCEEEEEEEECCCEEEEEEECCC-CCCCCCCEEEECCCEEEEECCHHHHH Q ss_conf 275899975---8887737479957-----98189999205985999980586-68678988897798788667946983 Q gi|254780617|r 35 GDGIARVYG---LNNIRAGEMVQFS-----HGVYGMALNLEVDNVGVVILGSY-KEISEGDIVKRTGRIVDVPVGLELLG 105 (509) Q Consensus 35 ~~giv~v~G---L~~a~~GElv~~~-----~g~~G~Vi~l~~d~v~i~~l~~~-~gI~~G~~V~~tg~~~~vpvG~~lLG 105 (509) +|..+.|.| ||+...|+.+.|. +..+|.++.++.-.-..||-..+ .|| |.......-=-||..+-+ T Consensus 33 Dd~~~tV~Gns~l~~~~~G~~~~f~Gq~~~h~~yG~q~~~~~~~~~amP~~~~~~gi-----~~Yl~S~~~KGvGk~~Aq 107 (769) T TIGR01448 33 DDEEVTVVGNSVLAALAEGDSYTFEGQWEEHAKYGKQFKAERMEKEAMPAPTSKEGI-----VAYLSSSSIKGVGKKLAQ 107 (769) T ss_pred CCCCCEEECCCCCCCCCCCCEEEEEEEEEECCCCCEEEEEEEEHHHHCCCCCHHHHH-----HHHHHCCCCCCCCHHHHH T ss_conf 888506863454675301661568898888754451523352014307886506787-----655522541672057789 Q ss_pred CCCCHHHCC Q ss_conf 000300231 Q gi|254780617|r 106 RVVDALGNP 114 (509) Q Consensus 106 RVvD~lG~P 114 (509) |+|+-||+- T Consensus 108 ~Iv~TfGe~ 116 (769) T TIGR01448 108 RIVETFGEA 116 (769) T ss_pred HHHHHHHHH T ss_conf 999987699 No 381 >cd03232 ABC_PDR_domain2 The pleiotropic drug resistance-like (PDR) family of ATP-binding cassette (ABC) transporters. PDR is a well-described phenomenon occurring in fungi and shares several similarities with processes in bacteria and higher eukaryotes. This PDR subfamily represents domain I of its (ABC-IM)2 organization. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds including sugars, ions, peptides, and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=87.83 E-value=0.36 Score=27.55 Aligned_cols=30 Identities=20% Similarity=0.376 Sum_probs=24.7 Q ss_pred HCCCCCCCCCEEEEECCCCCCCHHHHHHHHH Q ss_conf 1011115683365524778861158999999 Q gi|254780617|r 154 DSLIPIGRGQRELIIGDRKTGKTSIILDTFL 184 (509) Q Consensus 154 D~l~pigrGQR~~I~g~~g~GKt~l~~~~I~ 184 (509) |.=+.+-+|+-.+|+|++|+|||||+ .+|. T Consensus 25 ~is~~i~~Ge~~~llGpnGaGKSTLl-~~l~ 54 (192) T cd03232 25 NISGYVKPGTLTALMGESGAGKTTLL-DVLA 54 (192) T ss_pred CCEEEEECCEEEEEECCCCCCHHHHH-HHHH T ss_conf 83889928839999999999889999-9983 No 382 >pfam05729 NACHT NACHT domain. This NTPase domain is found in apoptosis proteins as well as those involved in MHC transcription activation. This family is closely related to pfam00931. Probab=87.83 E-value=0.49 Score=26.57 Aligned_cols=22 Identities=23% Similarity=0.469 Sum_probs=18.3 Q ss_pred CEEEEECCCCCCCHHHHHHHHH Q ss_conf 3365524778861158999999 Q gi|254780617|r 163 QRELIIGDRKTGKTSIILDTFL 184 (509) Q Consensus 163 QR~~I~g~~g~GKt~l~~~~I~ 184 (509) |++.|.|.+|.|||+++.-++. T Consensus 1 r~i~i~G~aG~GKTtll~kl~~ 22 (165) T pfam05729 1 RTVILQGEAGSGKTTLLQKLAL 22 (165) T ss_pred CEEEEECCCCCCHHHHHHHHHH T ss_conf 9899982798989999999999 No 383 >COG0563 Adk Adenylate kinase and related kinases [Nucleotide transport and metabolism] Probab=87.83 E-value=2 Score=22.09 Aligned_cols=101 Identities=20% Similarity=0.262 Sum_probs=52.6 Q ss_pred CEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHH Q ss_conf 33655247788611589999999886302466787606999951677678999998511467421158997288788899 Q gi|254780617|r 163 QRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALSYSIVVVASASDPAPM 242 (509) Q Consensus 163 QR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~~tvvv~a~a~~~~~~ 242 (509) .|+.|+|.+|.||||+|- .+.+. ....+-+...+ --..+-++ ++.....+.+...|.+ | .+.-. T Consensus 1 ~riiilG~pGaGK~T~A~-~La~~---~~i~hlstgd~-~r~~~~~~-t~lg~~~k~~i~~g~l-----v----~d~i~- 64 (178) T COG0563 1 MRILILGPPGAGKSTLAK-KLAKK---LGLPHLDTGDI-LRAAIAER-TELGEEIKKYIDKGEL-----V----PDEIV- 64 (178) T ss_pred CEEEEECCCCCCHHHHHH-HHHHH---CCCCEECCCCC-CCHHHCCC-CHHHHHHHHHHHCCCC-----C----CHHHH- T ss_conf 979998999998899999-99997---69978552201-11100323-6899999999875895-----0----41769- Q ss_pred HHHHHHHHHHHHHHHHC-CCCCEEEEEECHHHHHHHHHHHHHHCCC Q ss_conf 99988766666675420-4688479961647899999875453037 Q gi|254780617|r 243 QLLAPFAGCAMGEYFRD-NGYHALIAYDDLQKHAVAYRQLSLLLRR 287 (509) Q Consensus 243 r~~ap~~a~aiAEyfr~-~G~~VLi~~Ddltr~A~a~reisl~~~~ 287 (509) -..+-+.+.. ..+. .+++|+.-|...-.|.+--.+.+ T Consensus 65 -------~~~v~~rl~~~d~~~-~~I~dg~PR~~~qa~~l~r~l~~ 102 (178) T COG0563 65 -------NGLVKERLDEADCKA-GFILDGFPRTLCQARALKRLLKE 102 (178) T ss_pred -------HHHHHHHHHHCCCCC-EEEEECCCCHHHHHHHHHHHHHH T ss_conf -------979999997506577-29998998369999999999986 No 384 >COG1135 AbcC ABC-type metal ion transport system, ATPase component [Inorganic ion transport and metabolism] Probab=87.73 E-value=0.44 Score=26.96 Aligned_cols=72 Identities=17% Similarity=0.236 Sum_probs=43.0 Q ss_pred CCHHHHH-CCCCCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEE---CCCCHHHHHHHHHH---- Q ss_conf 1602331-011115683365524778861158999999988630246678760699995---16776789999985---- Q gi|254780617|r 148 TGIKAID-SLIPIGRGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVA---IGQKRSSVARFVKA---- 219 (509) Q Consensus 148 TGI~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~---IGer~~ev~~~~~~---- 219 (509) +.+.++| .=+.|-+|+..||+|-||.|||||. -.+|-. ...+..-+ ++.+ -.-...+.+.+.+. T Consensus 17 ~~~~al~~vsL~I~~GeI~GIIG~SGAGKSTLi--R~iN~L-----e~PtsG~v-~v~G~di~~l~~~~Lr~~R~~IGMI 88 (339) T COG1135 17 GTVTALDDVSLEIPKGEIFGIIGYSGAGKSTLL--RLINLL-----ERPTSGSV-FVDGQDLTALSEAELRQLRQKIGMI 88 (339) T ss_pred CCEEEECCCEEEECCCCEEEEECCCCCCHHHHH--HHHHCC-----CCCCCCEE-EECCEECCCCCHHHHHHHHHHCCEE T ss_conf 751565140688738868999748888678899--998565-----79988649-9868740217867899988650278 Q ss_pred HCCCCCCC Q ss_conf 11467421 Q gi|254780617|r 220 LEDRGALS 227 (509) Q Consensus 220 l~~~~~~~ 227 (509) +.....|. T Consensus 89 FQhFnLLs 96 (339) T COG1135 89 FQHFNLLS 96 (339) T ss_pred ECCCCCCC T ss_conf 21521034 No 385 >PRK13409 putative ATPase RIL; Provisional Probab=87.69 E-value=0.36 Score=27.57 Aligned_cols=20 Identities=25% Similarity=0.456 Sum_probs=17.0 Q ss_pred CCCCEEEEECCCCCCCHHHH Q ss_conf 56833655247788611589 Q gi|254780617|r 160 GRGQRELIIGDRKTGKTSII 179 (509) Q Consensus 160 grGQR~~I~g~~g~GKt~l~ 179 (509) -.||+.||+|.-|.||||++ T Consensus 97 ~~G~v~GLiG~NGaGKST~l 116 (590) T PRK13409 97 KEGKVTGILGPNGIGKSTAV 116 (590) T ss_pred CCCCEEEEECCCCCCHHHHH T ss_conf 99878998899999899999 No 386 >PRK05642 DNA replication initiation factor; Validated Probab=87.68 E-value=0.55 Score=26.22 Aligned_cols=23 Identities=9% Similarity=0.195 Sum_probs=17.0 Q ss_pred EEEECCCCCCCHHHHHHHHHHHHH Q ss_conf 655247788611589999999886 Q gi|254780617|r 165 ELIIGDRKTGKTSIILDTFLNQKS 188 (509) Q Consensus 165 ~~I~g~~g~GKt~l~~~~I~nq~~ 188 (509) .-|+|++|+|||.|+ .++.|+.. T Consensus 48 l~i~G~~G~GKTHLL-~A~~~~~~ 70 (234) T PRK05642 48 IYLWGKDGVGRSHLL-QAACLRFE 70 (234) T ss_pred EEEECCCCCCHHHHH-HHHHHHHH T ss_conf 899889999889999-99999998 No 387 >PRK10938 putative molybdenum transport ATP-binding protein ModF; Provisional Probab=87.63 E-value=0.37 Score=27.47 Aligned_cols=33 Identities=21% Similarity=0.210 Sum_probs=17.0 Q ss_pred CCCCCEEEECCCEEEEECCHHHHHCCCCHHHCCCCC Q ss_conf 678988897798788667946983000300231446 Q gi|254780617|r 82 ISEGDIVKRTGRIVDVPVGLELLGRVVDALGNPIDG 117 (509) Q Consensus 82 I~~G~~V~~tg~~~~vpvG~~lLGRVvD~lG~PlDg 117 (509) |.+|..+-..|.-- .|+..|-|++.+.-.|..| T Consensus 26 i~~Ge~~~LvG~NG---aGKSTL~k~l~G~l~~~~G 58 (490) T PRK10938 26 LNAGDSWAFVGSNG---SGKSALARALAGELPLLKG 58 (490) T ss_pred ECCCCEEEEECCCC---CHHHHHHHHHHCCCCCCCC T ss_conf 98998999997999---7799999999569987885 No 388 >COG2607 Predicted ATPase (AAA+ superfamily) [General function prediction only] Probab=87.55 E-value=2.1 Score=21.97 Aligned_cols=73 Identities=29% Similarity=0.414 Sum_probs=39.7 Q ss_pred EEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHH Q ss_conf 36552477886115899999998863024667876069999516776789999985114674211589972887888999 Q gi|254780617|r 164 RELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALSYSIVVVASASDPAPMQ 243 (509) Q Consensus 164 R~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~~tvvv~a~a~~~~~~r 243 (509) ..++.|.+|+|||+|. -+++|+- +.-|.| +|-.+-++-+.+- T Consensus 87 nVLLwGaRGtGKSSLV-KA~~~e~----------------~~~glr---------------------LVEV~k~dl~~Lp 128 (287) T COG2607 87 NVLLWGARGTGKSSLV-KALLNEY----------------ADEGLR---------------------LVEVDKEDLATLP 128 (287) T ss_pred CEEEECCCCCCHHHHH-HHHHHHH----------------HHCCCE---------------------EEEECHHHHHHHH T ss_conf 3677637777747999-9999998----------------741770---------------------7997688886579 Q ss_pred HHHHHHHHHHHHHHHCCCCCEEEEEECHHH--HHHHHHHHH Q ss_conf 998876666667542046884799616478--999998754 Q gi|254780617|r 244 LLAPFAGCAMGEYFRDNGYHALIAYDDLQK--HAVAYRQLS 282 (509) Q Consensus 244 ~~ap~~a~aiAEyfr~~G~~VLi~~Ddltr--~A~a~reis 282 (509) .+-|-.+...+.+.++.||||= ...+|+.+- T Consensus 129 --------~l~~~Lr~~~~kFIlFcDDLSFe~gd~~yK~LK 161 (287) T COG2607 129 --------DLVELLRARPEKFILFCDDLSFEEGDDAYKALK 161 (287) T ss_pred --------HHHHHHHCCCCEEEEEECCCCCCCCCHHHHHHH T ss_conf --------999999618860899956777777813899999 No 389 >COG1122 CbiO ABC-type cobalt transport system, ATPase component [Inorganic ion transport and metabolism] Probab=87.51 E-value=0.41 Score=27.17 Aligned_cols=27 Identities=33% Similarity=0.674 Sum_probs=23.6 Q ss_pred HCCCCCCCCCEEEEECCCCCCCHHHHH Q ss_conf 101111568336552477886115899 Q gi|254780617|r 154 DSLIPIGRGQRELIIGDRKTGKTSIIL 180 (509) Q Consensus 154 D~l~pigrGQR~~I~g~~g~GKt~l~~ 180 (509) |.-+.+.+|++.+|+|..|+|||||+. T Consensus 22 ~v~~~i~~Ge~~~i~G~nGsGKSTL~~ 48 (235) T COG1122 22 DVSLEIEKGERVLLIGPNGSGKSTLLK 48 (235) T ss_pred CCEEEECCCCEEEEECCCCCCHHHHHH T ss_conf 533898789899998899988999999 No 390 >PRK06217 hypothetical protein; Validated Probab=87.51 E-value=0.34 Score=27.72 Aligned_cols=150 Identities=19% Similarity=0.216 Sum_probs=65.4 Q ss_pred CEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCC-----CHHH--HHH-HHHHHCCCCCCCEEEEEEE Q ss_conf 3365524778861158999999988630246678760699995167-----7678--999-9985114674211589972 Q gi|254780617|r 163 QRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQ-----KRSS--VAR-FVKALEDRGALSYSIVVVA 234 (509) Q Consensus 163 QR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGe-----r~~e--v~~-~~~~l~~~~~~~~tvvv~a 234 (509) +|+.|+|.+|+|||||+. .+....+ .-.-..|+ .|=.=|. +..+ -.. +.+.+...+. .|.+ T Consensus 2 ~rI~i~G~sGsGkSTla~-~La~~l~--~~~~~lD~---~~W~p~~~pf~~kR~~~eR~~ll~~~~~~~~~-----WV~s 70 (185) T PRK06217 2 MRIHITGASGSGTTTLGA-ALAEALD--LPHLDTDD---FFWLPTDPPFTTKREPEERLRLLLEDLRDSEG-----WILS 70 (185) T ss_pred CEEEEECCCCCCHHHHHH-HHHHHHC--CCEEECCC---CEECCCCCCCCCCCCHHHHHHHHHHHHHCCCC-----EEEE T ss_conf 679997899887899999-9999759--89686455---53568999756437999999999998637999-----8995 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECHHHHHHH----HHHHHHHCCCCCCCCCCC-CCCHHHHHHHHHHH Q ss_conf 8878889999988766666675420468847996164789999----987545303787543337-52001246788887 Q gi|254780617|r 235 SASDPAPMQLLAPFAGCAMGEYFRDNGYHALIAYDDLQKHAVA----YRQLSLLLRRPPGREAYP-GDVFYLHSRLLERA 309 (509) Q Consensus 235 ~a~~~~~~r~~ap~~a~aiAEyfr~~G~~VLi~~Ddltr~A~a----~reisl~~~~pp~~~gyp-~~vf~~~s~l~ERa 309 (509) -+-..-+.-.+ | --|.+|++| +-+.-.. .||..- -|.+=.| ||++-.|-.++|=| T Consensus 71 Gs~~~wgd~l~-p-------------~~DlvVfL~-lP~~irl~Rlr~RE~~R-----yG~ri~pgGdm~~~~~~Fl~Wa 130 (185) T PRK06217 71 GSLLGWGDPLE-P-------------LFDLVVFLY-IPPELRLERLRLREEQR-----YGNRILPGGDMHKASLEFLEWA 130 (185) T ss_pred CCCCCCCCCCC-C-------------CCCEEEEEE-CCHHHHHHHHHHHHHHH-----CCCCCCCCCCHHHHHHHHHHHH T ss_conf 77523232113-4-------------356899982-89899999999999986-----0786689941888889999999 Q ss_pred HHCCCCCCCC-CE---------EEEEEEECCCCCCCCCHHHHHHHHH Q ss_conf 5045445887-35---------6753100367664431146688540 Q gi|254780617|r 310 AKMSDALGAG-SL---------TALPVIETQVNDVSAYIPTNVISIT 346 (509) Q Consensus 310 ~~~~~~~g~G-Si---------T~~~~v~~~~~D~~~~i~~~~~si~ 346 (509) ....+..-.| |+ -.-|++..++. -++..++-.|+ T Consensus 131 ~~YD~~~~~gRsl~~He~Wl~~~~cPVlrldg~---~~~~~~~~~vl 174 (185) T PRK06217 131 AQYDNAGPEGRSLAAHEQWLADQSCPVLRLDGD---LTVEERLAQVL 174 (185) T ss_pred HHCCCCCCCCCCHHHHHHHHHHCCCCEEEECCC---CCHHHHHHHHH T ss_conf 867889877644999999995289867997698---98999999999 No 391 >KOG0743 consensus Probab=87.48 E-value=0.83 Score=24.92 Aligned_cols=87 Identities=26% Similarity=0.296 Sum_probs=40.9 Q ss_pred CCHHHHHCCCCHHHCCC-----------CCCCCC---CCCCCCHHCCCCCCCCCC-----CCCH---HHHHCCHHHHHCC Q ss_conf 79469830003002314-----------466444---544410010135773112-----6600---1444160233101 Q gi|254780617|r 99 VGLELLGRVVDALGNPI-----------DGKGPI---KCEQRSCTEADAPGIIQR-----QSVC---EPLSTGIKAIDSL 156 (509) Q Consensus 99 vG~~lLGRVvD~lG~Pl-----------Dg~g~i---~~~~~~~i~~~~p~~~~R-----~~i~---~~l~TGI~~ID~l 156 (509) |-..+|++|+--.-+-. .++..+ ....|+++..+.|...+- .--. +=|+-=++.=|-. T Consensus 148 V~~syl~~v~~~~k~I~~~~r~~kl~t~~~~~~~~~~~~~~W~~v~f~HpstF~TlaMd~~~K~~I~~Dl~~F~k~k~~Y 227 (457) T KOG0743 148 VTLSYLPYVVSKAKEILEENRELKLYTNSGKTVIYTAKGGEWRSVGFPHPSTFETLAMDPDLKERIIDDLDDFIKGKDFY 227 (457) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEECCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCHHH T ss_conf 67867888999999999988987776158875112435776125689999874420148667899999999997223578 Q ss_pred CCCCCCCEE--EEECCCCCCCHHHHHHHHHHH Q ss_conf 111568336--552477886115899999998 Q gi|254780617|r 157 IPIGRGQRE--LIIGDRKTGKTSIILDTFLNQ 186 (509) Q Consensus 157 ~pigrGQR~--~I~g~~g~GKt~l~~~~I~nq 186 (509) --+||-=|= |++|++|||||++. -++.|- T Consensus 228 krvGkawKRGYLLYGPPGTGKSS~I-aAmAn~ 258 (457) T KOG0743 228 KRVGKAWKRGYLLYGPPGTGKSSFI-AAMANY 258 (457) T ss_pred HHCCCCCCCCCEEECCCCCCHHHHH-HHHHHH T ss_conf 8648450004120479999888999-999720 No 392 >cd03233 ABC_PDR_domain1 The pleiotropic drug resistance (PDR) family of ATP-binding cassette (ABC) transporters. PDR is a well-described phenomenon occurring in fungi and shares several similarities with processes in bacteria and higher eukaryotes. This PDR subfamily represents domain I of its (ABC-IM)2 organization. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds including sugars, ions, peptides, and more complex organic molecules. The nucleotide-binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=87.48 E-value=0.4 Score=27.27 Aligned_cols=29 Identities=14% Similarity=0.284 Sum_probs=23.8 Q ss_pred CCCCCCCCCEEEEECCCCCCCHHHHHHHHH Q ss_conf 011115683365524778861158999999 Q gi|254780617|r 155 SLIPIGRGQRELIIGDRKTGKTSIILDTFL 184 (509) Q Consensus 155 ~l~pigrGQR~~I~g~~g~GKt~l~~~~I~ 184 (509) .=+.+-+|+..+|+|.+|+|||||+ .+|. T Consensus 26 is~~i~~Gei~~llG~nGsGKSTLl-~~l~ 54 (202) T cd03233 26 FSGVVKPGEMVLVLGRPGSGCSTLL-KALA 54 (202) T ss_pred EEEEECCCEEEEEECCCCCCHHHHH-HHHH T ss_conf 0889809849999989999889999-9983 No 393 >PRK10246 exonuclease subunit SbcC; Provisional Probab=87.47 E-value=0.19 Score=29.63 Aligned_cols=31 Identities=32% Similarity=0.410 Sum_probs=21.0 Q ss_pred HHHCC-CCCCCCCEEEEECCCCCCCHHHHHHHH Q ss_conf 33101-111568336552477886115899999 Q gi|254780617|r 152 AIDSL-IPIGRGQRELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 152 ~ID~l-~pigrGQR~~I~g~~g~GKt~l~~~~I 183 (509) .||.= -|++...=-+|+|+-|.||||| +|+| T Consensus 19 ~IDFt~~~l~~~gLFlI~G~TGAGKSTI-lDAI 50 (1047) T PRK10246 19 KIDFTAEPFASNGLFAITGPTGAGKTTL-LDAI 50 (1047) T ss_pred EEECCCCCCCCCCEEEEECCCCCCHHHH-HHHH T ss_conf 6748986456688788889999988999-9999 No 394 >PRK13638 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=87.46 E-value=0.39 Score=27.32 Aligned_cols=28 Identities=11% Similarity=0.203 Sum_probs=23.0 Q ss_pred CCCCCCCCEEEEECCCCCCCHHHHHHHHH Q ss_conf 11115683365524778861158999999 Q gi|254780617|r 156 LIPIGRGQRELIIGDRKTGKTSIILDTFL 184 (509) Q Consensus 156 l~pigrGQR~~I~g~~g~GKt~l~~~~I~ 184 (509) =+.+-+|+-.+|+|+.|+||||++ .+|. T Consensus 21 sl~i~~Gei~~liGpNGaGKSTLl-k~l~ 48 (271) T PRK13638 21 NLDFSLSPVTGLVGANGCGKSTLF-MNLS 48 (271) T ss_pred EEEECCCCEEEEECCCCCCHHHHH-HHHH T ss_conf 879838979999999998099999-9996 No 395 >cd03269 ABC_putative_ATPase This subfamily is involved in drug resistance, nodulation, lipid transport, and bacteriocin and lantibiotic immunity. In eubacteria and archaea, the typical organization consists of one ABC and one or two IMs. Eukaryote systems of the ABCA subfamily display ABC domains strongly similar to this family. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region in addition to the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=87.45 E-value=0.54 Score=26.31 Aligned_cols=36 Identities=25% Similarity=0.411 Sum_probs=27.4 Q ss_pred HHHHH-CCCCCCCCCEEEEECCCCCCCHHHHHHHHHHH Q ss_conf 02331-01111568336552477886115899999998 Q gi|254780617|r 150 IKAID-SLIPIGRGQRELIIGDRKTGKTSIILDTFLNQ 186 (509) Q Consensus 150 I~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~~~I~nq 186 (509) ..++| .=+.+.+|+-.+++|+.|.||||++ .+|... T Consensus 13 ~~al~~vs~~v~~Gei~gllG~NGaGKTTll-~~i~Gl 49 (210) T cd03269 13 VTALDDISFSVEKGEIFGLLGPNGAGKTTTI-RMILGI 49 (210) T ss_pred EEEECCEEEEECCCCEEEEECCCCCCHHHHH-HHHHCC T ss_conf 9997542678879959999989998499999-999600 No 396 >pfam06068 TIP49 TIP49 C-terminus. This family consists of the C-terminal region of several eukaryotic and archaeal RuvB-like 1 (Pontin or TIP49a) and RuvB-like 2 (Reptin or TIP49b) proteins. The N-terminal domain contains the pfam00004 domain. In zebrafish, the liebeskummer (lik) mutation, causes development of hyperplastic embryonic hearts. lik encodes Reptin, a component of a DNA-stimulated ATPase complex. Beta-catenin and Pontin, a DNA-stimulated ATPase that is often part of complexes with Reptin, are in the same genetic pathways. The Reptin/Pontin ratio serves to regulate heart growth during development, at least in part via the beta-catenin pathway. TBP-interacting protein 49 (TIP49) was originally identified as a TBP-binding protein, and two related proteins are encoded by individual genes, tip49a and b. Although the function of this gene family has not been elucidated, they are supposed to play a critical role in nuclear events because they interact with various kinds of nuclear Probab=87.44 E-value=0.46 Score=26.83 Aligned_cols=21 Identities=38% Similarity=0.580 Sum_probs=18.0 Q ss_pred CCEEEEECCCCCCCHHHHHHH Q ss_conf 833655247788611589999 Q gi|254780617|r 162 GQRELIIGDRKTGKTSIILDT 182 (509) Q Consensus 162 GQR~~I~g~~g~GKt~l~~~~ 182 (509) |+-+++-|++|||||+|++.+ T Consensus 50 GraiLlaGppGTGKTAlA~ai 70 (395) T pfam06068 50 GRAVLIAGPPGTGKTALAIAI 70 (395) T ss_pred CCEEEEECCCCCCHHHHHHHH T ss_conf 738998779998889999999 No 397 >cd03259 ABC_Carb_Solutes_like ABC Carbohydrate and Solute Transporters-like subgroup. This family is comprised of proteins involved in the transport of apparently unrelated solutes and proteins specific for di- and oligosaccharides and polyols. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides and more complex organic molecules. The nucleotide-binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=87.42 E-value=0.4 Score=27.27 Aligned_cols=33 Identities=21% Similarity=0.460 Sum_probs=25.4 Q ss_pred HHHHH-CCCCCCCCCEEEEECCCCCCCHHHHHHHH Q ss_conf 02331-01111568336552477886115899999 Q gi|254780617|r 150 IKAID-SLIPIGRGQRELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 150 I~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~~~I 183 (509) ..++| .=+.+-+|+..+|+|++|+||||++ .+| T Consensus 13 ~~al~~vs~~i~~Gei~~iiGpnGaGKSTl~-~~i 46 (213) T cd03259 13 VRALDDLSLTVEPGEFLALLGPSGCGKTTLL-RLI 46 (213) T ss_pred EEEECCCEEEECCCCEEEEECCCCCHHHHHH-HHH T ss_conf 9998461779889989999999997399999-999 No 398 >PRK00454 engB GTPase EngB; Reviewed Probab=87.41 E-value=2.2 Score=21.92 Aligned_cols=40 Identities=25% Similarity=0.441 Sum_probs=28.8 Q ss_pred HHCCHHHHHCCCCCCCCCEEEEECCCCCCCHHHHHHHHHHHH Q ss_conf 441602331011115683365524778861158999999988 Q gi|254780617|r 146 LSTGIKAIDSLIPIGRGQRELIIGDRKTGKTSIILDTFLNQK 187 (509) Q Consensus 146 l~TGI~~ID~l~pigrGQR~~I~g~~g~GKt~l~~~~I~nq~ 187 (509) |.++...++-+ |--.+-++.|+|-+.+||||| +..+++++ T Consensus 9 f~~~~~~~~~~-p~~~~p~VaivGrpNvGKSTL-~N~L~g~k 48 (196) T PRK00454 9 FIISAPKIKQL-PPDDGPEIAFAGRSNVGKSSL-INALTNRK 48 (196) T ss_pred EEEECCCHHHC-CCCCCCEEEEECCCCCCHHHH-HHHHHCCC T ss_conf 99925897789-998896899984898889999-99986897 No 399 >TIGR02397 dnaX_nterm DNA polymerase III, subunits gamma and tau; InterPro: IPR012763 This entry represents the well-conserved first N-terminal domain of DnaX, approx. 365 aa. The full-length product of the dnaX gene in Escherichia coli encodes the DNA polymerase III tau subunit. A translational frameshift leads to early termination and a truncated protein subunit gamma, about 1/3 shorter than tau and present in roughly equal amounts. This frameshift mechanism is not necessarily universal for species with DNA polymerase III but appears conserved in the extreme thermophile Thermus thermophilis.; GO: 0003887 DNA-directed DNA polymerase activity, 0005524 ATP binding, 0006260 DNA replication, 0009360 DNA polymerase III complex. Probab=87.39 E-value=0.26 Score=28.62 Aligned_cols=104 Identities=23% Similarity=0.283 Sum_probs=52.3 Q ss_pred EEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHHHH Q ss_conf 55247788611589999999886302466787606999951677678999998511467421158997288788899999 Q gi|254780617|r 166 LIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALSYSIVVVASASDPAPMQLL 245 (509) Q Consensus 166 ~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~~tvvv~a~a~~~~~~r~~ 245 (509) +..|+|||||||+| -|++-+- =|- .--|+...+=..-.+-- +...++ .+=+=|-|+... .-.+ T Consensus 40 LF~GpRGtGKTS~A--RIfAKaL-----------NC~-~~~~~PCn~C~~C~~i~-~g~~~D-viEiDAASN~gV-D~IR 102 (363) T TIGR02397 40 LFSGPRGTGKTSIA--RIFAKAL-----------NCQ-GPDGEPCNECESCKEIN-SGSSLD-VIEIDAASNNGV-DDIR 102 (363) T ss_pred EECCCCCCCHHHHH--HHHHHHH-----------CCC-CCCCCCCCCCCHHHHHH-CCCCCC-EEEECCCCCCCH-HHHH T ss_conf 50285997635589--9999986-----------588-78778777750227765-289866-688648656878-8999 Q ss_pred HHHHHHHHHHHHHCCCCCEEEEEEC---HHHHHHHHHHHHHHCCCCCCC Q ss_conf 8876666667542046884799616---478999998754530378754 Q gi|254780617|r 246 APFAGCAMGEYFRDNGYHALIAYDD---LQKHAVAYRQLSLLLRRPPGR 291 (509) Q Consensus 246 ap~~a~aiAEyfr~~G~~VLi~~Dd---ltr~A~a~reisl~~~~pp~~ 291 (509) -- .-=..|---+||-=+.++|- ||+.| +.-+-=.|.|||-. T Consensus 103 ~l---~e~v~y~P~~~kYKvYIIDEVHMLS~~A--FNALLKTLEEPP~h 146 (363) T TIGR02397 103 EL---RENVKYAPSKGKYKVYIIDEVHMLSKSA--FNALLKTLEEPPEH 146 (363) T ss_pred HH---HHHHCCCCCCCCCCEEEEECCCCCCHHH--HHHHHHHHCCCCCC T ss_conf 99---8730368755443358873230286568--99987652279876 No 400 >COG1493 HprK Serine kinase of the HPr protein, regulates carbohydrate metabolism [Signal transduction mechanisms] Probab=87.38 E-value=0.63 Score=25.81 Aligned_cols=29 Identities=31% Similarity=0.444 Sum_probs=22.5 Q ss_pred CCCCCCCCCEEEEECCCCCCCHHHHHHHHH Q ss_conf 011115683365524778861158999999 Q gi|254780617|r 155 SLIPIGRGQRELIIGDRKTGKTSIILDTFL 184 (509) Q Consensus 155 ~l~pigrGQR~~I~g~~g~GKt~l~~~~I~ 184 (509) +|+-| -|.=++|-|+||+|||.+|++.|- T Consensus 139 vl~~I-~GvGVLItG~SG~GKSElALeLi~ 167 (308) T COG1493 139 VLLDI-FGVGVLITGPSGAGKSELALELIK 167 (308) T ss_pred EEEEE-EEEEEEEECCCCCCHHHHHHHHHH T ss_conf 89999-425899987898877689999998 No 401 >PRK10535 macrolide transporter ATP-binding /permease protein; Provisional Probab=87.37 E-value=0.39 Score=27.28 Aligned_cols=22 Identities=27% Similarity=0.531 Sum_probs=9.7 Q ss_pred CCCCCCCEEEEECCCCCCCHHH Q ss_conf 1115683365524778861158 Q gi|254780617|r 157 IPIGRGQRELIIGDRKTGKTSI 178 (509) Q Consensus 157 ~pigrGQR~~I~g~~g~GKt~l 178 (509) +.|.+|...+|+|++|+||||| T Consensus 29 l~i~~Ge~v~i~G~nGsGKSTl 50 (648) T PRK10535 29 LTIYAGEMVAIVGASGSGKSTL 50 (648) T ss_pred EEECCCCEEEEECCCCCHHHHH T ss_conf 9998998999999999629999 No 402 >TIGR01073 pcrA ATP-dependent DNA helicase PcrA; InterPro: IPR005751 The DNA helicase PcrA is found mainly in Gram-positive bacteria and belongs to a superfamily of helicases, together with Rep and UvrD helicases from Escherichia coli. The action of the 3'5' bacterial DNA helicase is based upon two separate but coupled activities, ssDNA translocation and duplex destabilisation, and is driven by energy derived from the continuous ATP-binding and hydrolysis events that take place in the active-site cleft. The resulting conformational changes that accompany these events underpin the coupling process and allow the helicase to translocate along the DNA, destabilising the duplex and separating the two strands in an active process .; GO: 0004003 ATP-dependent DNA helicase activity, 0006268 DNA unwinding during replication, 0005737 cytoplasm. Probab=87.28 E-value=1.3 Score=23.46 Aligned_cols=26 Identities=19% Similarity=0.312 Sum_probs=14.5 Q ss_pred EEEECCCCCCCHHHHHHHHHHHHHHC Q ss_conf 65524778861158999999988630 Q gi|254780617|r 165 ELIIGDRKTGKTSIILDTFLNQKSSH 190 (509) Q Consensus 165 ~~I~g~~g~GKt~l~~~~I~nq~~~~ 190 (509) .||+||+|+|||.+++.-|.+=...+ T Consensus 20 LLi~AGAGSGKTRVLThRIA~L~~e~ 45 (811) T TIGR01073 20 LLIMAGAGSGKTRVLTHRIAHLLAEK 45 (811) T ss_pred CEEEECCCCCCCHHHHHHHHHHHHHC T ss_conf 10101378873105789999999722 No 403 >PRK09518 bifunctional cytidylate kinase/GTP-binding protein; Reviewed Probab=87.14 E-value=1.6 Score=22.81 Aligned_cols=56 Identities=18% Similarity=0.134 Sum_probs=33.9 Q ss_pred HHHHHHHHCCCCCEEEEE---ECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHH Q ss_conf 666675420468847996---16478999998754530378754333752001246788887 Q gi|254780617|r 251 CAMGEYFRDNGYHALIAY---DDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLLERA 309 (509) Q Consensus 251 ~aiAEyfr~~G~~VLi~~---Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l~ERa 309 (509) ..+|+++|-.||.|+++. |+......++.=-+|-+++|----+.-| .-...|++.. T Consensus 377 ~~ia~~LRk~~KpvilvvNK~D~~~~e~~~~ef~~LG~~e~~~ISA~Hg---~G~~dLld~i 435 (714) T PRK09518 377 EVIARMLQRAGKPVVLVANKIDDQASEYDVAEFWKLGLGEPYSISAMHG---RGVADLLDVV 435 (714) T ss_pred HHHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHCCCCCCEEEECCCC---CCHHHHHHHH T ss_conf 9999999856998899998978876401299999659999689847357---8989999999 No 404 >TIGR02982 heterocyst_DevA ABC exporter ATP-binding subunit, DevA family; InterPro: IPR014324 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). This entry represents the ATP-binding subunit DevA, found mostly in the Cyanobacteria, but also in the Planctomycetes. Cyanobacterial examples are involved in heterocyst formation, by which some fraction of members of the colony undergo a developmental change and become capable of nitrogen fixation. The ABC transporter encoded by the devBCA operon is induced by nitrogen deficiency and is necessary for the formation of the laminated layer which envelops heterocysts , . It is thought to be involved in the export of either the heterocyst-specific glycolipids found in the laminated layer, or an enzyme essential for their formation.. Probab=87.11 E-value=0.44 Score=26.97 Aligned_cols=173 Identities=17% Similarity=0.257 Sum_probs=80.5 Q ss_pred HCCCCCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEE--ECCCCHHHHHHHHHH----HCCCCCCC Q ss_conf 101111568336552477886115899999998863024667876069999--516776789999985----11467421 Q gi|254780617|r 154 DSLIPIGRGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYV--AIGQKRSSVARFVKA----LEDRGALS 227 (509) Q Consensus 154 D~l~pigrGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~--~IGer~~ev~~~~~~----l~~~~~~~ 227 (509) |.=+.|-.|-=+.+-|+||+|||||+ +.+---..-+ +.-.-|+. +-|-...+..++.+. |..+..+ T Consensus 23 di~L~i~~GEiViltGPSGSGKTTLL--tLiG~LR~~Q-----~G~L~vlg~~L~ga~~~~l~~~RR~iGyIFQ~HNLl- 94 (220) T TIGR02982 23 DINLEINPGEIVILTGPSGSGKTTLL--TLIGGLRSVQ-----EGSLKVLGQELKGASKKELVQVRRNIGYIFQAHNLL- 94 (220) T ss_pred CCCEEECCCEEEEEECCCCCCHHHHH--HHHHHHCCCC-----CCEEEEECCHHCCCCHHHHHHHHHHCCCEECCCHHC- T ss_conf 76317717647984378898468899--9887625655-----560478220102678889999987639144120001- Q ss_pred EEEEEEECCCCCHHHHH-----HHHHHHHHHHHHHH-C--CCCCEEEEEECHH-----HH--HHHHHHH--HHHCCCCCC Q ss_conf 15899728878889999-----98876666667542-0--4688479961647-----89--9999875--453037875 Q gi|254780617|r 228 YSIVVVASASDPAPMQL-----LAPFAGCAMGEYFR-D--NGYHALIAYDDLQ-----KH--AVAYRQL--SLLLRRPPG 290 (509) Q Consensus 228 ~tvvv~a~a~~~~~~r~-----~ap~~a~aiAEyfr-~--~G~~VLi~~Ddlt-----r~--A~a~rei--sl~~~~pp~ 290 (509) =+=||.+..-+-. ..+....+-|+-+- . -|+|.=-.=++|| |- |+|+=.= =++..||-+ T Consensus 95 ----~~LTA~QNVqM~~eL~~~~~~~~~~~~a~~~L~~VGL~~~~~y~P~~LSGGQKQRVAIARALv~~P~LvLADEPTA 170 (220) T TIGR02982 95 ----GFLTARQNVQMALELQPNLSAQEAREKARAMLEAVGLGDRLDYYPENLSGGQKQRVAIARALVARPKLVLADEPTA 170 (220) T ss_pred ----HHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCHHHHHHHCCCCCCCCCHHHHHHHHHHHCCCCEEECCCCCH T ss_conf ----0001778886489887611688999999999986060125540524367861689999999733897676257723 Q ss_pred C-CCCCCCCHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCCCCCCCCHHHHHHHHHCCEE Q ss_conf 4-33375200124678888750454458873567531003676644311466885406169 Q gi|254780617|r 291 R-EAYPGDVFYLHSRLLERAAKMSDALGAGSLTALPVIETQVNDVSAYIPTNVISITDGQI 350 (509) Q Consensus 291 ~-~gyp~~vf~~~s~l~ERa~~~~~~~g~GSiT~~~~v~~~~~D~~~~i~~~~~si~DG~i 350 (509) . +.-.| -..=.|+.+-|+ +. | -|.+ +|+- .+-|-| |+|-++.+=||++ T Consensus 171 ALD~~SG---r~VV~Lm~~lA~---eq--G-c~iL-~VTH-D~RIlD-vADRI~~MEdG~L 219 (220) T TIGR02982 171 ALDSKSG---RDVVELMQKLAR---EQ--G-CTIL-LVTH-DNRILD-VADRIVQMEDGKL 219 (220) T ss_pred HHHHCCC---HHHHHHHHHHHH---HH--C-CEEE-EEEC-CCCHHH-HHHHHHHCCCCCC T ss_conf 3221133---899999999887---71--9-8899-9836-731200-6544421168635 No 405 >PRK13541 cytochrome c biogenesis protein CcmA; Provisional Probab=87.06 E-value=0.56 Score=26.20 Aligned_cols=32 Identities=22% Similarity=0.255 Sum_probs=26.6 Q ss_pred HHCCCCCCCCCEEEEECCCCCCCHHHHHHHHHH Q ss_conf 310111156833655247788611589999999 Q gi|254780617|r 153 IDSLIPIGRGQRELIIGDRKTGKTSIILDTFLN 185 (509) Q Consensus 153 ID~l~pigrGQR~~I~g~~g~GKt~l~~~~I~n 185 (509) .|.=+++-+|+-.+|.|..|+|||||+ .+|.. T Consensus 17 ~disl~i~~G~i~~i~G~NGsGKSTLl-k~i~G 48 (195) T PRK13541 17 FDLSITFLPSAITYIKGANGCGKSSLL-RMIAG 48 (195) T ss_pred EEEEEEECCCCEEEEECCCCCCHHHHH-HHHHC T ss_conf 977778779979999999998199999-99967 No 406 >TIGR03410 urea_trans_UrtE urea ABC transporter, ATP-binding protein UrtE. Members of this protein family are ABC transporter ATP-binding subunits associated with urea transport and metabolism. This protein is found in a conserved five-gene transport operon typically found adjacent to urease genes. It was shown in Cyanobacteria that disruption leads to the loss of high-affinity urea transport activity. Probab=87.05 E-value=0.43 Score=27.04 Aligned_cols=28 Identities=21% Similarity=0.523 Sum_probs=23.9 Q ss_pred CCCCCCCCCEEEEECCCCCCCHHHHHHHH Q ss_conf 01111568336552477886115899999 Q gi|254780617|r 155 SLIPIGRGQRELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 155 ~l~pigrGQR~~I~g~~g~GKt~l~~~~I 183 (509) .=+.+.+|+-.+|+|+.|+|||||+ .+| T Consensus 19 vs~~v~~Gei~~liGpNGaGKSTL~-~~i 46 (230) T TIGR03410 19 VSLEVPKGEVTCVLGRNGVGKTTLL-KTL 46 (230) T ss_pred EEEEECCCCEEEEECCCCCHHHHHH-HHH T ss_conf 1779999979999999994099999-999 No 407 >cd03218 ABC_YhbG The ABC transporters belonging to the YhbG family are similar to members of the Mj1267_LivG family, which is involved in the transport of branched-chain amino acids. The genes yhbG and yhbN are located in a single operon and may function together in cell envelope during biogenesis. YhbG is the putative ATP-binding cassette component and YhbN is the putative periplasmic-binding protein. Depletion of each gene product leads to growth arrest, irreversible cell damage and loss of viability in E. coli. The YhbG homolog (NtrA) is essential in Rhizobium meliloti, a symbiotic nitrogen-fixing bacterium. Probab=87.00 E-value=0.55 Score=26.23 Aligned_cols=34 Identities=18% Similarity=0.338 Sum_probs=26.3 Q ss_pred HHHHH-CCCCCCCCCEEEEECCCCCCCHHHHHHHHH Q ss_conf 02331-011115683365524778861158999999 Q gi|254780617|r 150 IKAID-SLIPIGRGQRELIIGDRKTGKTSIILDTFL 184 (509) Q Consensus 150 I~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~~~I~ 184 (509) ..++| .=+.+.+|+-.+|+|+.|.||||++ .+|. T Consensus 13 ~~~l~~vs~~v~~Gei~~llGpNGAGKSTll-~~i~ 47 (232) T cd03218 13 RKVVNGVSLSVKQGEIVGLLGPNGAGKTTTF-YMIV 47 (232) T ss_pred EEEECCEEEEECCCCEEEEECCCCCCHHHHH-HHHH T ss_conf 9988260679899959999999996199999-9997 No 408 >PRK06893 DNA replication initiation factor; Validated Probab=86.98 E-value=1.6 Score=22.87 Aligned_cols=27 Identities=19% Similarity=0.300 Sum_probs=19.9 Q ss_pred CCCEEEEECCCCCCCHHHHHHHHHHHHH Q ss_conf 6833655247788611589999999886 Q gi|254780617|r 161 RGQRELIIGDRKTGKTSIILDTFLNQKS 188 (509) Q Consensus 161 rGQR~~I~g~~g~GKt~l~~~~I~nq~~ 188 (509) .++-.-|.|++|+|||.|+ .++.|+.. T Consensus 38 ~~~~l~i~G~~gsGKTHLL-qa~~~~~~ 64 (229) T PRK06893 38 KQPFFYIWGGKSSGKSHLL-KAVSNHYL 64 (229) T ss_pred CCCEEEEECCCCCCHHHHH-HHHHHHHH T ss_conf 6987999899999889999-99999999 No 409 >COG4181 Predicted ABC-type transport system involved in lysophospholipase L1 biosynthesis, ATPase component [Secondary metabolites biosynthesis, transport, and catabolism] Probab=86.96 E-value=0.46 Score=26.82 Aligned_cols=27 Identities=26% Similarity=0.602 Sum_probs=22.4 Q ss_pred CCCCCCCEEEEECCCCCCCHHHHHHHHHH Q ss_conf 11156833655247788611589999999 Q gi|254780617|r 157 IPIGRGQRELIIGDRKTGKTSIILDTFLN 185 (509) Q Consensus 157 ~pigrGQR~~I~g~~g~GKt~l~~~~I~n 185 (509) +-+-+|+.++|+|.+|.|||+|+ +++. T Consensus 31 L~v~~Ge~vaivG~SGSGKSTLl--~vlA 57 (228) T COG4181 31 LVVKRGETVAIVGPSGSGKSTLL--AVLA 57 (228) T ss_pred EEECCCCEEEEECCCCCCHHHHH--HHHH T ss_conf 78627865888778887577699--9995 No 410 >PRK13544 consensus Probab=86.95 E-value=0.44 Score=26.90 Aligned_cols=29 Identities=24% Similarity=0.400 Sum_probs=23.6 Q ss_pred CCCCCCCCCEEEEECCCCCCCHHHHHHHHH Q ss_conf 011115683365524778861158999999 Q gi|254780617|r 155 SLIPIGRGQRELIIGDRKTGKTSIILDTFL 184 (509) Q Consensus 155 ~l~pigrGQR~~I~g~~g~GKt~l~~~~I~ 184 (509) .=+.+-+|+..+|.|+.|+|||||+ .+|. T Consensus 20 vs~~i~~Gei~~l~G~NGsGKSTLl-~~i~ 48 (208) T PRK13544 20 LSFTAKQNSLTLVIGNNGSGKTSLL-RLLA 48 (208) T ss_pred CEEEECCCCEEEEECCCCCCHHHHH-HHHH T ss_conf 1589829949999999999899999-9995 No 411 >PTZ00243 ABC transporter; Provisional Probab=86.82 E-value=0.46 Score=26.78 Aligned_cols=34 Identities=21% Similarity=0.676 Sum_probs=26.6 Q ss_pred HHHHHCCCCCCCCCEEEEECCCCCCCHHHHHHHHHHH Q ss_conf 0233101111568336552477886115899999998 Q gi|254780617|r 150 IKAIDSLIPIGRGQRELIIGDRKTGKTSIILDTFLNQ 186 (509) Q Consensus 150 I~~ID~l~pigrGQR~~I~g~~g~GKt~l~~~~I~nq 186 (509) ++-|| +.+-+||-..|+|.-|+|||+|+ .+|+.. T Consensus 676 L~din--l~v~~G~L~~IvG~vGSGKSSLL-~aiLGE 709 (1560) T PTZ00243 676 LRDVS--VSVPRGKLTVVLGATGSGKSTLL-QSLLSQ 709 (1560) T ss_pred CCCEE--EEECCCCEEEEECCCCCCHHHHH-HHHHCC T ss_conf 01305--88659978999899998799999-999688 No 412 >PRK10261 glutathione transporter ATP-binding protein; Provisional Probab=86.78 E-value=0.46 Score=26.81 Aligned_cols=20 Identities=30% Similarity=0.391 Sum_probs=9.0 Q ss_pred CCCHHHHHHHHHHHHHHHHH Q ss_conf 98989999999999999851 Q gi|254780617|r 488 VLTDDIRSKLINEIKVFLED 507 (509) Q Consensus 488 ~l~de~~~~L~~~i~~~~~~ 507 (509) .|+-..++.+-+.+.+..+. T Consensus 493 ~LDv~~qa~il~Ll~~L~~~ 512 (623) T PRK10261 493 ALDVSIRGQIINLLLDLQRD 512 (623) T ss_pred CCCHHHHHHHHHHHHHHHHH T ss_conf 67999999999999999997 No 413 >cd04134 Rho3 Rho3 subfamily. Rho3 is a member of the Rho family found only in fungi. Rho3 is believed to regulate cell polarity by interacting with the diaphanous/formin family protein For3 to control both the actin cytoskeleton and microtubules. Rho3 is also believed to have a direct role in exocytosis that is independent of its role in regulating actin polarity. The function in exocytosis may be two-pronged: first, in the transport of post-Golgi vesicles from the mother cell to the bud, mediated by myosin (Myo2); second, in the docking and fusion of vesicles to the plasma membrane, mediated by an exocyst (Exo70) protein. Most Rho proteins contain a lipid modification site at the C-terminus, with a typical sequence motif CaaX, where a = an aliphatic amino acid and X = any amino acid. Lipid binding is essential for membrane attachment, a key feature of most Rho proteins. Probab=86.76 E-value=0.38 Score=27.39 Aligned_cols=22 Identities=27% Similarity=0.410 Sum_probs=18.2 Q ss_pred CEEEEECCCCCCCHHHHHHHHH Q ss_conf 3365524778861158999999 Q gi|254780617|r 163 QRELIIGDRKTGKTSIILDTFL 184 (509) Q Consensus 163 QR~~I~g~~g~GKt~l~~~~I~ 184 (509) +|+.++|++|||||+|+.-.+- T Consensus 1 ~KivlvGd~~VGKTsli~r~~~ 22 (189) T cd04134 1 RKVVVLGDGACGKTSLLNVFTR 22 (189) T ss_pred CEEEEECCCCCCHHHHHHHHHC T ss_conf 9899999799769999999970 No 414 >cd03263 ABC_subfamily_A The ABCA subfamily mediates the transport of a variety of lipid compounds. Mutations of members of ABCA subfamily are associated with human genetic diseases, such as, familial high-density lipoprotein (HDL) deficiency, neonatal surfactant deficiency, degenerative retinopathies, and congenital keratinization disorders. The ABCA1 protein is involved in disorders of cholesterol transport and high-density lipoprotein (HDL) biosynthesis. The ABCA4 (ABCR) protein transports vitamin A derivatives in the outer segments of photoreceptor cells, and therefore, performs a crucial step in the visual cycle. The ABCA genes are not present in yeast. However, evolutionary studies of ABCA genes indicate that they arose as transporters that subsequently duplicated and that certain sets of ABCA genes were lost in different eukaryotic lineages. Probab=86.73 E-value=0.46 Score=26.77 Aligned_cols=33 Identities=21% Similarity=0.372 Sum_probs=25.5 Q ss_pred HHH-CCCCCCCCCEEEEECCCCCCCHHHHHHHHHH Q ss_conf 331-0111156833655247788611589999999 Q gi|254780617|r 152 AID-SLIPIGRGQRELIIGDRKTGKTSIILDTFLN 185 (509) Q Consensus 152 ~ID-~l~pigrGQR~~I~g~~g~GKt~l~~~~I~n 185 (509) ++| .=+.+-+|+-.+|+|+.|.|||||+ .+|.. T Consensus 17 aL~~is~~i~~Gei~~llG~NGaGKSTLl-~~i~G 50 (220) T cd03263 17 AVDDLSLNVYKGEIFGLLGHNGAGKTTTL-KMLTG 50 (220) T ss_pred EEECCEEEECCCCEEEEECCCCCCHHHHH-HHHHC T ss_conf 98440889849959999989997399999-99966 No 415 >PRK00635 excinuclease ABC subunit A; Provisional Probab=86.71 E-value=0.67 Score=25.59 Aligned_cols=37 Identities=27% Similarity=0.533 Sum_probs=30.5 Q ss_pred CCHHHHHCCCCCCCCCEEEEECCCCCCCHHHHHHHHHHH Q ss_conf 160233101111568336552477886115899999998 Q gi|254780617|r 148 TGIKAIDSLIPIGRGQRELIIGDRKTGKTSIILDTFLNQ 186 (509) Q Consensus 148 TGI~~ID~l~pigrGQR~~I~g~~g~GKt~l~~~~I~nq 186 (509) -.+|-||.=+|+ |.=..|-|-+|.|||||..|++... T Consensus 609 nNLk~i~v~iPl--g~l~~VTGVSGSGKSTLi~~~L~~~ 645 (1809) T PRK00635 609 HNLKDLTISLPL--GRLTVVTGVSGSGKSSLINDTLVPA 645 (1809) T ss_pred CCCCCCCEEECC--CCEEEEECCCCCCHHHHHHHHHHHH T ss_conf 169875535069--8789995889999788488899999 No 416 >cd01134 V_A-ATPase_A V/A-type ATP synthase catalytic subunit A. These ATPases couple ATP hydrolysis to the build up of a H+ gradient, but V-type ATPases do not catalyze the reverse reaction. The Vacuolar (V-type) ATPase is found in the membranes of vacuoles, the golgi apparatus and in other coated vesicles in eukaryotes. Archaea have a protein which is similar in sequence to V-ATPases, but functions like an F-ATPase (called A-ATPase). A similar protein is also found in a few bacteria. Probab=86.70 E-value=0.13 Score=30.75 Aligned_cols=35 Identities=26% Similarity=0.325 Sum_probs=28.2 Q ss_pred EEEEECCHHHHHCCCCHHHCCCCCCCCCCCCCCCH Q ss_conf 78866794698300030023144664445444100 Q gi|254780617|r 94 IVDVPVGLELLGRVVDALGNPIDGKGPIKCEQRSC 128 (509) Q Consensus 94 ~~~vpvG~~lLGRVvD~lG~PlDg~g~i~~~~~~~ 128 (509) +++|..|+++|+.|+|++.|||++.......++.+ T Consensus 1 pLSveLGPGll~~I~DGIqRPL~~l~e~~~g~fi~ 35 (369) T cd01134 1 PLSVELGPGLLGSIYDGIQRPLDKIAEITKGIFIP 35 (369) T ss_pred CCCEEECCCEECCCCCCCCCCHHHHHHHHCCCEEC T ss_conf 96089689557120313566466887764795017 No 417 >COG3840 ThiQ ABC-type thiamine transport system, ATPase component [Coenzyme metabolism] Probab=86.70 E-value=0.46 Score=26.79 Aligned_cols=29 Identities=17% Similarity=0.356 Sum_probs=23.0 Q ss_pred CCCCCCCCCEEEEECCCCCCCHHHHHHHHH Q ss_conf 011115683365524778861158999999 Q gi|254780617|r 155 SLIPIGRGQRELIIGDRKTGKTSIILDTFL 184 (509) Q Consensus 155 ~l~pigrGQR~~I~g~~g~GKt~l~~~~I~ 184 (509) .=+++-.|++..|+|++|.|||||+ ..|. T Consensus 18 fdl~v~~ge~vAi~GpSGaGKSTLL-nLIA 46 (231) T COG3840 18 FDLTVPAGEIVAILGPSGAGKSTLL-NLIA 46 (231) T ss_pred EEEEECCCCEEEEECCCCCCHHHHH-HHHH T ss_conf 8876067857999778886578899-9987 No 418 >cd03213 ABCG_EPDR ABCG transporters are involved in eye pigment (EP) precursor transport, regulation of lipid-trafficking mechanisms, and pleiotropic drug resistance (DR). DR is a well-described phenomenon occurring in fungi and shares several similarities with processes in bacteria and higher eukaryotes. Compared to other members of the ABC transporter subfamilies, the ABCG transporter family is composed of proteins that have an ATP-binding cassette domain at the N-terminus and a TM (transmembrane) domain at the C-terminus. Probab=86.70 E-value=0.43 Score=26.99 Aligned_cols=37 Identities=16% Similarity=0.339 Sum_probs=26.7 Q ss_pred CCHHHHH-CCCCCCCCCEEEEECCCCCCCHHHHHHHHHH Q ss_conf 1602331-0111156833655247788611589999999 Q gi|254780617|r 148 TGIKAID-SLIPIGRGQRELIIGDRKTGKTSIILDTFLN 185 (509) Q Consensus 148 TGI~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~~~I~n 185 (509) +|-++++ .=+.+-.|+..+|+|++|+|||||+ .+|.. T Consensus 20 ~~k~iL~~vs~~v~~Gei~~ilGpnGaGKSTLl-~~l~G 57 (194) T cd03213 20 SGKQLLKNVSGKAKPGELTAIMGPSGAGKSTLL-NALAG 57 (194) T ss_pred CCCEEEECCEEEEECCEEEEEECCCCCHHHHHH-HHHHC T ss_conf 999878883889908819999989995199999-99857 No 419 >PRK00149 dnaA chromosomal replication initiation protein; Reviewed Probab=86.69 E-value=0.66 Score=25.65 Aligned_cols=103 Identities=28% Similarity=0.344 Sum_probs=54.6 Q ss_pred EEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHHH Q ss_conf 65524778861158999999988630246678760699995167767899999851146742115899728878889999 Q gi|254780617|r 165 ELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALSYSIVVVASASDPAPMQL 244 (509) Q Consensus 165 ~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~~tvvv~a~a~~~~~~r~ 244 (509) .-|.|++|+|||.| +.+|-|..... .+...++|+-- |.. +.+|+..++... ... T Consensus 148 LfIyG~~GlGKTHL-l~AIgn~~~~~-----~p~~~v~Y~ta-e~F--~~~~v~al~~~~----------------~~~- 201 (447) T PRK00149 148 LFIYGGVGLGKTHL-LHAIGNYILEK-----NPNAKVVYVSS-EKF--TNDFVKALRNNA----------------MEE- 201 (447) T ss_pred EEEECCCCCCHHHH-HHHHHHHHHHH-----CCCCEEEEEEH-HHH--HHHHHHHHHCCC----------------HHH- T ss_conf 58977998878899-99999999985-----89972899549-999--999999985186----------------999- Q ss_pred HHHHHHHHHHHHHHCCCCCEEEEEECHHHHHHH---H--------------HHHHHHCCCCCCCCCCCCCCHHHHHHHHH Q ss_conf 988766666675420468847996164789999---9--------------87545303787543337520012467888 Q gi|254780617|r 245 LAPFAGCAMGEYFRDNGYHALIAYDDLQKHAVA---Y--------------RQLSLLLRRPPGREAYPGDVFYLHSRLLE 307 (509) Q Consensus 245 ~ap~~a~aiAEyfr~~G~~VLi~~Ddltr~A~a---~--------------reisl~~~~pp~~~gyp~~vf~~~s~l~E 307 (509) .-++| +.-||| ++||+--++-- + ++|-+...+||. ++-.+..||.- T Consensus 202 --------Fr~~y--r~~DvL-liDDiqfl~gk~~tqeeff~~fn~l~~~~kqiv~tsd~~P~------~l~~l~~rL~S 264 (447) T PRK00149 202 --------FKEKY--RSVDVL-LIDDIQFLAGKEKTQEEFFHTFNALHENNKQIVITSDRPPK------ELEGLEDRLRS 264 (447) T ss_pred --------HHHHH--HCCCEE-EECHHHHHHCCHHHHHHHHHHHHHHHHCCCEEEEECCCCHH------HCCCCCHHHHH T ss_conf --------99999--728854-32148886055779999999999999849968995788967------65651177886 Q ss_pred HHH Q ss_conf 875 Q gi|254780617|r 308 RAA 310 (509) Q Consensus 308 Ra~ 310 (509) |-. T Consensus 265 Rf~ 267 (447) T PRK00149 265 RFE 267 (447) T ss_pred HHH T ss_conf 763 No 420 >TIGR01187 potA polyamine ABC transporter, ATP-binding protein; InterPro: IPR005893 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). This family comprises the spermidine/putrescine ABC transporter, ATP binding subunit in bacteria and its equivalents in archaea. This transport system belongs to the larger ATP-Binding Cassette (ABC) transporter superfamily. Polyamines like spermidine and putrescine play a vital role in cell proliferation, differentiation, and ion homeostasis. The concentration of polyamines within the cell are regulated by biosynthesis, degradation and transport (uptake and efflux included).; GO: 0015417 polyamine-transporting ATPase activity, 0015846 polyamine transport, 0016020 membrane. Probab=86.68 E-value=0.27 Score=28.46 Aligned_cols=14 Identities=29% Similarity=0.641 Sum_probs=12.3 Q ss_pred EECCCCCCCHHHHH Q ss_conf 52477886115899 Q gi|254780617|r 167 IIGDRKTGKTSIIL 180 (509) Q Consensus 167 I~g~~g~GKt~l~~ 180 (509) |+|+||||||||+. T Consensus 1 LLGpSGcGKTTlLr 14 (331) T TIGR01187 1 LLGPSGCGKTTLLR 14 (331) T ss_pred CCCCCCCCHHHHHH T ss_conf 97888874799999 No 421 >COG1201 Lhr Lhr-like helicases [General function prediction only] Probab=86.64 E-value=1.5 Score=23.10 Aligned_cols=19 Identities=16% Similarity=-0.097 Sum_probs=12.5 Q ss_pred EEEEECHHHHHHHHHHHHH Q ss_conf 7996164789999987545 Q gi|254780617|r 265 LIAYDDLQKHAVAYRQLSL 283 (509) Q Consensus 265 Li~~Ddltr~A~a~reisl 283 (509) ..-.+++..|.+-+++.+. T Consensus 576 ~~~~~~i~~~~~eq~~~~~ 594 (814) T COG1201 576 ENAAKAIIQYLREQREYSG 594 (814) T ss_pred HHHHHHHHHHHHHHHHHCC T ss_conf 2447899999999987347 No 422 >cd03219 ABC_Mj1267_LivG_branched The Mj1267/LivG ABC transporter subfamily is involved in the transport of the hydrophobic amino acids leucine, isoleucine and valine. MJ1267 is a branched-chain amino acid transporter with 29% similarity to both the LivF and LivG components of the E. coli branched-chain amino acid transporter. MJ1267 contains an insertion from residues 114 to 123 characteristic of LivG (Leucine-Isoleucine-Valine) homologs. The branched-chain amino acid transporter from E. coli comprises a heterodimer of ABCs (LivF and LivG), a heterodimer of six-helix TM domains (LivM and LivH), and one of two alternative soluble periplasmic substrate binding proteins (LivK or LivJ). Probab=86.53 E-value=0.48 Score=26.64 Aligned_cols=35 Identities=26% Similarity=0.467 Sum_probs=26.6 Q ss_pred CHHHHH-CCCCCCCCCEEEEECCCCCCCHHHHHHHHH Q ss_conf 602331-011115683365524778861158999999 Q gi|254780617|r 149 GIKAID-SLIPIGRGQRELIIGDRKTGKTSIILDTFL 184 (509) Q Consensus 149 GI~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~~~I~ 184 (509) +..++| .=+.+.+|+-.+|+|..|.|||||+ .+|. T Consensus 12 ~~~aL~~vsl~i~~Gei~gliG~nGaGKSTL~-~~i~ 47 (236) T cd03219 12 GLVALDDVSFSVRPGEIHGLIGPNGAGKTTLF-NLIS 47 (236) T ss_pred CEEEECCEEEEECCCCEEEEECCCCCCHHHHH-HHHH T ss_conf 99987233899889989999989997399999-9996 No 423 >cd03216 ABC_Carb_Monos_I This family represents the domain I of the carbohydrate uptake proteins that transport only monosaccharides (Monos). The Carb_Monos family is involved in the uptake of monosaccharides, such as pentoses (such as xylose, arabinose, and ribose) and hexoses (such as xylose, arabinose, and ribose), that cannot be broken down to simple sugars by hydrolysis. Pentoses include xylose, arabinose, and ribose. Important hexoses include glucose, galactose, and fructose. In members of the Carb_monos family, the single hydrophobic gene product forms a homodimer while the ABC protein represents a fusion of two nucleotide-binding domains. However, it is assumed that two copies of the ABC domains are present in the assembled transporter. Probab=86.46 E-value=0.5 Score=26.53 Aligned_cols=34 Identities=24% Similarity=0.601 Sum_probs=26.2 Q ss_pred HHHHH-CCCCCCCCCEEEEECCCCCCCHHHHHHHHH Q ss_conf 02331-011115683365524778861158999999 Q gi|254780617|r 150 IKAID-SLIPIGRGQRELIIGDRKTGKTSIILDTFL 184 (509) Q Consensus 150 I~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~~~I~ 184 (509) ..++| .=+.+.+|+-.+|+|..|.||||++ .+|. T Consensus 13 ~~aL~~vsl~i~~Gei~~lvG~nGaGKSTl~-~~i~ 47 (163) T cd03216 13 VKALDGVSLSVRRGEVHALLGENGAGKSTLM-KILS 47 (163) T ss_pred EEEECCEEEEECCCCEEEEECCCCCCHHHHH-HHHH T ss_conf 9988554889879989999988998999999-9995 No 424 >pfam08477 Miro Miro-like protein. Mitochondrial Rho proteins (Miro-1 and Miro-2), are atypical Rho GTPases. They have a unique domain organisation, with tandem GTP-binding domains and two EF hand domains (pfam00036), that may bind calcium. They are also larger than classical small GTPases. It has been proposed that they are involved in mitochondrial homeostasis and apoptosis. Probab=86.44 E-value=0.39 Score=27.30 Aligned_cols=21 Identities=29% Similarity=0.578 Sum_probs=17.2 Q ss_pred EEEEECCCCCCCHHHHHHHHH Q ss_conf 365524778861158999999 Q gi|254780617|r 164 RELIIGDRKTGKTSIILDTFL 184 (509) Q Consensus 164 R~~I~g~~g~GKt~l~~~~I~ 184 (509) |+.++|++|+|||+|+.-.+- T Consensus 1 KivvvG~~~vGKTSLi~r~~~ 21 (118) T pfam08477 1 KVVVIGDKGSGKSSLLSQLVG 21 (118) T ss_pred CEEEECCCCCCHHHHHHHHHH T ss_conf 999999899789999999983 No 425 >cd01875 RhoG RhoG subfamily. RhoG is a GTPase with high sequence similarity to members of the Rac subfamily, including the regions involved in effector recognition and binding. However, RhoG does not bind to known Rac1 and Cdc42 effectors, including proteins containing a Cdc42/Rac interacting binding (CRIB) motif. Instead, RhoG interacts directly with Elmo, an upstream regulator of Rac1, in a GTP-dependent manner and forms a ternary complex with Dock180 to induce activation of Rac1. The RhoG-Elmo-Dock180 pathway is required for activation of Rac1 and cell spreading mediated by integrin, as well as for neurite outgrowth induced by nerve growth factor. Thus RhoG activates Rac1 through Elmo and Dock180 to control cell morphology. RhoG has also been shown to play a role in caveolar trafficking and has a novel role in signaling the neutrophil respiratory burst stimulated by G protein-coupled receptor (GPCR) agonists. Most Rho proteins contain a lipid modification site at the C-termin Probab=86.33 E-value=0.43 Score=26.98 Aligned_cols=21 Identities=24% Similarity=0.474 Sum_probs=17.5 Q ss_pred EEEEECCCCCCCHHHHHHHHH Q ss_conf 365524778861158999999 Q gi|254780617|r 164 RELIIGDRKTGKTSIILDTFL 184 (509) Q Consensus 164 R~~I~g~~g~GKt~l~~~~I~ 184 (509) |+.++||+|||||+|+.-.+- T Consensus 5 KivlvGd~~VGKTsli~r~~~ 25 (191) T cd01875 5 KCVVVGDGAVGKTCLLICYTT 25 (191) T ss_pred EEEEECCCCCCHHHHHHHHHH T ss_conf 999999999899999999972 No 426 >PRK11054 helD DNA helicase IV; Provisional Probab=86.26 E-value=1.8 Score=22.48 Aligned_cols=29 Identities=17% Similarity=0.170 Sum_probs=22.8 Q ss_pred CCEEEEECCCCCCCHHHHHHHHHHHHHHC Q ss_conf 83365524778861158999999988630 Q gi|254780617|r 162 GQRELIIGDRKTGKTSIILDTFLNQKSSH 190 (509) Q Consensus 162 GQR~~I~g~~g~GKt~l~~~~I~nq~~~~ 190 (509) ....+|+||+|+|||+.++.-|..-.... T Consensus 209 ed~~LVLAGAGSGKT~vLt~RiayLI~~g 237 (684) T PRK11054 209 EDSLLVLAGAGSGKTSVLVARAGWLLARG 237 (684) T ss_pred CCCEEEEECCCCCCHHHHHHHHHHHHHCC T ss_conf 99648983389970779999999999759 No 427 >cd03278 ABC_SMC_barmotin Barmotin is a tight junction-associated protein expressed in rat epithelial cells which is thought to have an important regulatory role in tight junction barrier function. Barmotin belongs to the SMC protein family. SMC proteins are large (approximately 110 to 170 kDa), and each is arranged into five recognizable domains. Amino-acid sequence homology of SMC proteins between species is largely confined to the amino- and carboxy-terminal globular domains. The amino-terminal domain contains a 'Walker A' nucleotide-binding domain (GxxGxGKS/T, in the single-letter amino-acid code), which by mutational studies has been shown to be essential in several proteins. The carboxy-terminal domain contains a sequence (the DA-box) that resembles a 'Walker B' motif, and a motif with homology to the signature sequence of the ATP-binding cassette (ABC) family of ATPases. The sequence homology within the carboxy-terminal domain is relatively high within the SMC1-SMC4 group, w Probab=86.23 E-value=0.64 Score=25.76 Aligned_cols=26 Identities=38% Similarity=0.657 Sum_probs=20.2 Q ss_pred CCCCCCCEEEEECCCCCCCHHHHHHHHH Q ss_conf 1115683365524778861158999999 Q gi|254780617|r 157 IPIGRGQRELIIGDRKTGKTSIILDTFL 184 (509) Q Consensus 157 ~pigrGQR~~I~g~~g~GKt~l~~~~I~ 184 (509) +.+-.| -..|+|++|+|||||+ ++|. T Consensus 18 l~ip~G-itaIvGpsGsGKSTLl-~~i~ 43 (197) T cd03278 18 IPFPPG-LTAIVGPNGSGKSNII-DAIR 43 (197) T ss_pred EECCCC-EEEEECCCCCCHHHHH-HHHH T ss_conf 973898-2899999999889999-9999 No 428 >cd01871 Rac1_like Rac1-like subfamily. The Rac1-like subfamily consists of Rac1, Rac2, and Rac3 proteins, plus the splice variant Rac1b that contains a 19-residue insertion near switch II relative to Rac1. While Rac1 is ubiquitously expressed, Rac2 and Rac3 are largely restricted to hematopoietic and neural tissues respectively. Rac1 stimulates the formation of actin lamellipodia and membrane ruffles. It also plays a role in cell-matrix adhesion and cell anoikis. In intestinal epithelial cells, Rac1 is an important regulator of migration and mediates apoptosis. Rac1 is also essential for RhoA-regulated actin stress fiber and focal adhesion complex formation. In leukocytes, Rac1 and Rac2 have distinct roles in regulating cell morphology, migration, and invasion, but are not essential for macrophage migration or chemotaxis. Rac3 has biochemical properties that are closely related to Rac1, such as effector interaction, nucleotide binding, and hydrolysis; Rac2 has a slower nucleoti Probab=86.20 E-value=0.44 Score=26.92 Aligned_cols=22 Identities=27% Similarity=0.566 Sum_probs=17.9 Q ss_pred EEEEECCCCCCCHHHHHHHHHH Q ss_conf 3655247788611589999999 Q gi|254780617|r 164 RELIIGDRKTGKTSIILDTFLN 185 (509) Q Consensus 164 R~~I~g~~g~GKt~l~~~~I~n 185 (509) |+.++|++|||||+|+.-.+-| T Consensus 3 KivlvGd~~VGKTsli~r~~~~ 24 (174) T cd01871 3 KCVVVGDGAVGKTCLLISYTTN 24 (174) T ss_pred EEEEECCCCCCHHHHHHHHHHC T ss_conf 9999899998699999999739 No 429 >TIGR02204 MsbA_rel ABC transporter, permease/ATP-binding protein; InterPro: IPR011918 This protein is related to a Proteobacterial ATP transporter that exports lipid A and to eukaryotic P-glycoproteins .. Probab=86.18 E-value=0.94 Score=24.53 Aligned_cols=113 Identities=14% Similarity=0.224 Sum_probs=67.7 Q ss_pred HHHHHCCHHHHHCC-CCCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHC Q ss_conf 01444160233101-11156833655247788611589999999886302466787606999951677678999998511 Q gi|254780617|r 143 CEPLSTGIKAIDSL-IPIGRGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALE 221 (509) Q Consensus 143 ~~~l~TGI~~ID~l-~pigrGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~ 221 (509) +-|=.+-.+++|.+ +++--|-++-|.|+||-||||| .++++.- =++...-| -.-+|--|.-+=.+|.+.+ T Consensus 346 ~YP~RPd~~aL~~~~L~v~PGEtvAlVGPSGAGKSTl-f~LLLRF------YDP~~G~i-lLDGvd~r~~dP~~lR~~~- 416 (576) T TIGR02204 346 AYPARPDQPALDGLNLTVRPGETVALVGPSGAGKSTL-FQLLLRF------YDPQSGRI-LLDGVDIRDLDPADLRARI- 416 (576) T ss_pred CCCCCCCCHHHCCCCCEECCCCEEEEECCCCCHHHHH-HHHHHHC------CCCCCCEE-EECCCCHHHCCCHHHHHHC- T ss_conf 5879888042325772562776588766887627999-9999860------48887657-7466414016808788344- Q ss_pred CCCCCCEEEEEEECCCC--------CHHHHHHHHHHHHHHHHHHH--CCCCCEE Q ss_conf 46742115899728878--------88999998876666667542--0468847 Q gi|254780617|r 222 DRGALSYSIVVVASASD--------PAPMQLLAPFAGCAMGEYFR--DNGYHAL 265 (509) Q Consensus 222 ~~~~~~~tvvv~a~a~~--------~~~~r~~ap~~a~aiAEyfr--~~G~~VL 265 (509) .=.-..++++++++-| ..-++..++.-+....||-+ .+|+|.. T Consensus 417 -ALVpQdp~lFa~Sv~eNIRyGrpDAs~~~V~~AA~aA~A~EFI~~LP~GY~T~ 469 (576) T TIGR02204 417 -ALVPQDPALFAASVLENIRYGRPDASDEEVEAAARAAHAHEFISALPEGYDTY 469 (576) T ss_pred -CCCCCCCCCCHHCCHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCC T ss_conf -40258864102030442135996666689999998754101565188754676 No 430 >COG3451 VirB4 Type IV secretory pathway, VirB4 components [Intracellular trafficking and secretion] Probab=86.08 E-value=1.8 Score=22.55 Aligned_cols=49 Identities=31% Similarity=0.352 Sum_probs=36.3 Q ss_pred EEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCC Q ss_conf 36552477886115899999998863024667876069999516776789999985114 Q gi|254780617|r 164 RELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALED 222 (509) Q Consensus 164 R~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~ 222 (509) -..|+|.+|.|||+| +..+++|..+... -+|++. -|.+...-|++.+.. T Consensus 438 hT~I~G~tGaGKTvL-l~~lla~~~k~~~-------~~iv~f--Dk~~g~~~~~~a~gG 486 (796) T COG3451 438 HTLIIGPTGAGKTVL-LSFLLAQALKYGN-------PQIVAF--DKDNGAYIFIEALGG 486 (796) T ss_pred CEEEECCCCCCHHHH-HHHHHHHHHHHCC-------CCEEEE--CCCCCHHHHHHHHCC T ss_conf 749988988878999-9999999987459-------818998--489735778887498 No 431 >COG2874 FlaH Predicted ATPases involved in biogenesis of archaeal flagella [Cell motility and secretion / Intracellular trafficking and secretion] Probab=85.98 E-value=2.6 Score=21.39 Aligned_cols=157 Identities=18% Similarity=0.148 Sum_probs=82.7 Q ss_pred CCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCC--CCCC----CEEEEEEE Q ss_conf 68336552477886115899999998863024667876069999516776789999985114--6742----11589972 Q gi|254780617|r 161 RGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALED--RGAL----SYSIVVVA 234 (509) Q Consensus 161 rGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~--~~~~----~~tvvv~a 234 (509) -||=++|.||.|+|||.|+..... ....+...+ -|+.- ..++++|+..... .+.. .-.-.++. T Consensus 27 ~GsL~lIEGd~~tGKSvLsqr~~Y------G~L~~g~~v--~yvsT---e~T~refi~qm~sl~ydv~~~~l~G~l~~~~ 95 (235) T COG2874 27 VGSLILIEGDNGTGKSVLSQRFAY------GFLMNGYRV--TYVST---ELTVREFIKQMESLSYDVSDFLLSGRLLFFP 95 (235) T ss_pred CCEEEEEECCCCCCHHHHHHHHHH------HHHHCCCEE--EEEEE---CHHHHHHHHHHHHCCCCCHHHHHCCEEEEEE T ss_conf 676999988898548899999999------887089548--99984---0359999998886388716877506268999 Q ss_pred ----CCCCCHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHH Q ss_conf ----8878889999988766666675420468847996164789999987545303787543337520012467888875 Q gi|254780617|r 235 ----SASDPAPMQLLAPFAGCAMGEYFRDNGYHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLLERAA 310 (509) Q Consensus 235 ----~a~~~~~~r~~ap~~a~aiAEyfr~~G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l~ERa~ 310 (509) +.+...-.+-. .=--+.|+-+-..+||+ ++||+|-+|.--.+ -..-.+++.+- T Consensus 96 ~~~~~~~~~~~~~~~---~L~~l~~~~k~~~~dVi-IIDSls~~~~~~~~-------------------~~vl~fm~~~r 152 (235) T COG2874 96 VNLEPVNWGRRSARK---LLDLLLEFIKRWEKDVI-IIDSLSAFATYDSE-------------------DAVLNFMTFLR 152 (235) T ss_pred ECCCCCCCCHHHHHH---HHHHHHHHHHHHCCCEE-EEECCCHHHHCCCH-------------------HHHHHHHHHHH T ss_conf 324542257377899---99999755775237789-99534377652649-------------------99999999999 Q ss_pred HCCCCCCCCCEEEEEEEECCCCCCCCCHHHHHHHHHCCEEEEECCHH Q ss_conf 04544588735675310036766443114668854061699615012 Q gi|254780617|r 311 KMSDALGAGSLTALPVIETQVNDVSAYIPTNVISITDGQIFLETELF 357 (509) Q Consensus 311 ~~~~~~g~GSiT~~~~v~~~~~D~~~~i~~~~~si~DG~i~l~~~l~ 357 (509) ++.+. |-+- ++++...-+++-.-+-++|+-|.++.|+..-+ T Consensus 153 ~l~d~---gKvI---ilTvhp~~l~e~~~~rirs~~d~~l~L~~~~~ 193 (235) T COG2874 153 KLSDL---GKVI---ILTVHPSALDEDVLTRIRSACDVYLRLRLEEL 193 (235) T ss_pred HHHHC---CCEE---EEEECHHHCCHHHHHHHHHHHHEEEEEEHHHH T ss_conf 98728---9789---99947343378999999875202589870231 No 432 >TIGR01166 cbiO cobalt ABC transporter, ATP-binding protein; InterPro: IPR005876 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). This entry represents the ATP binding subunit of the multisubunit cobalt transporter in bacteria and its equivalents in archaea. This superfamily includes two groups, one which catalyses the uptake of small molecules, including ions from the external milieu and the other group which is engaged in the efflux of small molecular weight compounds and ions from within the cell. Energy derived from the hydrolysis of ATP drives both the process of uptake and efflux.; GO: 0006824 cobalt ion transport, 0009276 1-2nm peptidoglycan-based cell wall. Probab=85.94 E-value=0.54 Score=26.28 Aligned_cols=31 Identities=23% Similarity=0.550 Sum_probs=24.3 Q ss_pred HHHHHCC-CCCCCCCEEEEECCCCCCCHHHHH Q ss_conf 0233101-111568336552477886115899 Q gi|254780617|r 150 IKAIDSL-IPIGRGQRELIIGDRKTGKTSIIL 180 (509) Q Consensus 150 I~~ID~l-~pigrGQR~~I~g~~g~GKt~l~~ 180 (509) .++.+.+ +-+.+|....++|.-|+|||||++ T Consensus 5 ~~vL~gln~~~~~G~~~aLlG~NGaGKsTLl~ 36 (190) T TIGR01166 5 PEVLKGLNFAVERGEVLALLGANGAGKSTLLL 36 (190) T ss_pred CCCCCCCCEEECCCCEEEEECCCCCCHHHHHH T ss_conf 52035423022057168987289985789988 No 433 >PRK00409 recombination and DNA strand exchange inhibitor protein; Reviewed Probab=85.93 E-value=1.1 Score=23.93 Aligned_cols=115 Identities=23% Similarity=0.397 Sum_probs=57.1 Q ss_pred HHHCCHHHHHCCCCCCCCCEEEEECCCCC-CCHHHH----HHHHHHHHHHCCCCCCCCC---CEEEEEECCCCHH----- Q ss_conf 44416023310111156833655247788-611589----9999998863024667876---0699995167767----- Q gi|254780617|r 145 PLSTGIKAIDSLIPIGRGQRELIIGDRKT-GKTSII----LDTFLNQKSSHDKGSEKDK---VYCIYVAIGQKRS----- 211 (509) Q Consensus 145 ~l~TGI~~ID~l~pigrGQR~~I~g~~g~-GKt~l~----~~~I~nq~~~~~~~~~~~~---v~~V~~~IGer~~----- 211 (509) ||..+-+++=.=+-++.++|++|+-|+.+ |||+.+ +-+++.|.+...-+..... +--||+-||--.+ T Consensus 307 PLL~~~~vVP~di~l~~~~~~liITGPNtGGKTv~LKtvgL~~lMaq~Gl~vPa~e~s~~~~f~~i~adIGD~QSie~~L 386 (780) T PRK00409 307 PLLDGEKVVPKDIELGFDKTVLVITGPNTGGKTVTLKTLGLAALMAKSGLPIPAAEPSEIPVFKEVFADIGDEQSIEQSL 386 (780) T ss_pred CCCCCCCCCCCEEEECCCCEEEEEECCCCCCCEEHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCEEEEEECCCHHHHHCH T ss_conf 76366772573478669841899968988885637999999999998299975168980423463899827712133265 Q ss_pred -----HHHHHHHHHCCCCCCCEEEEEE---ECCCCCHHHHHHHHHHHHHHHHHHHCCCCCEE Q ss_conf -----8999998511467421158997---28878889999988766666675420468847 Q gi|254780617|r 212 -----SVARFVKALEDRGALSYSIVVV---ASASDPAPMQLLAPFAGCAMGEYFRDNGYHAL 265 (509) Q Consensus 212 -----ev~~~~~~l~~~~~~~~tvvv~---a~a~~~~~~r~~ap~~a~aiAEyfr~~G~~VL 265 (509) .+..+.+-+... -++|.|.. .+..||.- .+-.|+||-|||+++|--++ T Consensus 387 STFS~hm~~i~~il~~a--~~~sLVLlDElG~GTDP~E----GaALa~aile~l~~~~~~~i 442 (780) T PRK00409 387 STFSGHMTNIVRILEKA--DENSLVLFDELGAGTDPDE----GAALAISILDYLRKRGAKII 442 (780) T ss_pred HHHHHHHHHHHHHHHHC--CCCEEEEECCCCCCCCHHH----HHHHHHHHHHHHHHCCCEEE T ss_conf 24999999999999738--9980881232358998456----59999999999997799799 No 434 >cd01919 PEPCK Phosphoenolpyruvate carboxykinase (PEPCK), a critical gluconeogenic enzyme, catalyzes the first committed step in the diversion of tricarboxylic acid cycle intermediates toward gluconeogenesis. It catalyzes the reversible decarboxylation and phosphorylation of oxaloacetate to yield phosphoenolpyruvate and carbon dioxide, using a nucleotide molecule (ATP or GTP) for the phosphoryl transfer, and has a strict requirement for divalent metal ions for activity. PEPCK's separate into two phylogenetic groups based on their nucleotide substrate specificity (the ATP-, and GTP-dependent groups). Probab=85.70 E-value=0.45 Score=26.83 Aligned_cols=275 Identities=19% Similarity=0.201 Sum_probs=124.7 Q ss_pred CCHHHHHCCCCCC-CCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCC Q ss_conf 1602331011115-683365524778861158999999988630246678760699995167767899999851146742 Q gi|254780617|r 148 TGIKAIDSLIPIG-RGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGAL 226 (509) Q Consensus 148 TGI~~ID~l~pig-rGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~ 226 (509) -|+...-+..-+| .|...+.||.||||||+|.+|- + ..-..|+. .+....+ + -. -+.|+. T Consensus 202 ~gvL~mHcsan~g~~gd~a~ffGlSGTGKTTLS~d~-------~-r~LIGDDe----hgw~~~G--v----fN-~EgGcy 262 (515) T cd01919 202 EGWLAMHMSANVGTNGDVLVFFGLSGTGKTTLSMDP-------K-RELIGDDE----HWWKDDG--V----FN-PEGGCY 262 (515) T ss_pred CCCCEEEEEEEECCCCCEEEEEECCCCCHHHHHHCC-------C-CEEECCCE----EEEECCC--E----EC-CCCEEE T ss_conf 484131033440568988999706666743387687-------8-55874625----5784385--5----63-365068 Q ss_pred CEEEEEEECCCCCHHHHHHHHHHHHHHHHHHH-CCCCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHH Q ss_conf 11589972887888999998876666667542-04688479961647899999875453037875433375200124678 Q gi|254780617|r 227 SYSIVVVASASDPAPMQLLAPFAGCAMGEYFR-DNGYHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRL 305 (509) Q Consensus 227 ~~tvvv~a~a~~~~~~r~~ap~~a~aiAEyfr-~~G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l 305 (509) .++ ..++-+.-..-|.|-.-+ ||-|... +... -+-++|.+. -|=+|-+||-+- T Consensus 263 ak~---~~ls~~~eP~i~~ai~~g-ti~eNV~~~~~~--~~d~~d~s~-------------t~NtR~~yPl~~------- 316 (515) T cd01919 263 AKA---IGLSVKTEPNIYKAIRKN-AIFENVAETSDG--GIDFEDISA-------------HPNTRVCYPASH------- 316 (515) T ss_pred EEE---CCCCCCCCHHHHHHHHHC-CEEEEEEECCCC--CEEECCCCC-------------CCCCCEEEEHHH------- T ss_conf 873---488877699999998637-650003675888--543024668-------------876666748887------- Q ss_pred HHHHHHCCCCCCCCCEEEEEEEECCCCCCCCCHHHHHHHHHCCEEEEECCHHHHCCCCCCCCCCCHHHCCCCCCCHHHH- Q ss_conf 8887504544588735675310036766443114668854061699615012210266543344412203434389887- Q gi|254780617|r 306 LERAAKMSDALGAGSLTALPVIETQVNDVSAYIPTNVISITDGQIFLETELFYQGIRPAINIGLSVSRVGSAAQVKAMK- 384 (509) Q Consensus 306 ~ERa~~~~~~~g~GSiT~~~~v~~~~~D~~~~i~~~~~si~DG~i~l~~~l~~~g~~Paid~~~S~SRv~~~~~~~~~~- 384 (509) +|-+..+.+ ....-+.-|||.++. -|+.|+| ||+ +..|.-.|. T Consensus 317 ------------------i~~~~~~~~---------~g~~p~~iifl~~da--~gvlPpv------~~l-~~~qa~y~f~ 360 (515) T cd01919 317 ------------------IPIIDAAWE---------SAGHIEGVIFLTRDA--FGVVPPV------YRL-TWQQGVFVFA 360 (515) T ss_pred ------------------CCCCCCHHH---------CCCCCCEEEEEEECC--CCCCCHH------HHH-CHHHHHHHHH T ss_conf ------------------987668332---------279862589961045--7876216------440-6888888877 Q ss_pred -----HHHHHHHHHHHHHHHHH-HHHH-HHC---CCCHHHHHHHHHHHHHHHHH------------CCCCCCCCCHHHHH Q ss_conf -----75567888799887678-7987-633---89989999999999999996------------69998874889999 Q gi|254780617|r 385 -----QVSGAVKGELAQYREMS-SFSK-FSS---DLDSSTQKFLSKGERLTELL------------KQPQFSPLAMEEQV 442 (509) Q Consensus 385 -----~~a~~lr~~laqy~Ele-~f~~-fgs---dlD~~t~~~l~rg~ri~e~L------------kQ~~~~p~~~~~qv 442 (509) ++|+.-+ .-.|.+ .|+- ||. .+.+..=..+ ..+++.+.. ....+...++.+.- T Consensus 361 sgyt~k~agte~----gv~e~~~tfs~cfg~PF~~~~~~~Ya~~-l~~~i~~~~~~i~lVNtgw~gg~G~g~r~~~~~tr 435 (515) T cd01919 361 AGRTAATAGTEA----GHKGKIPMFSPCFGRPFLGYHFTKYLEH-LLSMMQHPLPKIFLVNTGRKGKEGKFKRPGFGETR 435 (515) T ss_pred HCCCHHHHHHHH----CCCCCEEEECCCCCCCCCCCCHHHHHHH-HHHHHHHHCCCEEEEECCCCCCCCCEEECCHHHHH T ss_conf 425300123431----3578615325333676565689999999-99988861896899947740899967647857999 Q ss_pred HHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHCC Q ss_conf 9999986698066798999999999999997415999998873279898999999999999985159 Q gi|254780617|r 443 VMIFAGISGCLDEVAVSQVRKFETNFLSHMRVSSQDILEDIRKQKVLTDDIRSKLINEIKVFLEDFN 509 (509) Q Consensus 443 ~~l~a~~~G~ld~i~v~~I~~fe~~l~~~l~~~~~ei~~~I~~~~~l~de~~~~L~~~i~~~~~~Fs 509 (509) .++-++..|-+++++-..-.-|-=.+-..+.....++++.-+. -+.-++..++.+++++.|+++|+ T Consensus 436 ~ii~ail~g~l~~~~~~~~pif~l~iP~~~~~v~~~~l~p~~~-w~~~~~y~~~~~~l~~~f~~nF~ 501 (515) T cd01919 436 AIIDAIFNGILDKAETKLTPIFNLYIPKALNLVGLGHLNPRNM-MELFSQSKEFWDKLVEDFEKYFV 501 (515) T ss_pred HHHHHHHCCCCCCCCEEECCCCCCCCCCCCCCCCCCCCCHHHH-HHCHHHHHHHHHHHHHHHHHHHH T ss_conf 9999996596367782886545866876469987011898886-54702459999999999999999 No 435 >cd01854 YjeQ_engC YjeQ/EngC. YjeQ (YloQ in Bacillus subtilis) represents a protein family whose members are broadly conserved in bacteria and have been shown to be essential to the growth of E. coli and B. subtilis. Proteins of the YjeQ family contain all sequence motifs typical of the vast class of P-loop-containing GTPases, but show a circular permutation, with a G4-G1-G3 pattern of motifs as opposed to the regular G1-G3-G4 pattern seen in most GTPases. All YjeQ family proteins display a unique domain architecture, which includes an N-terminal OB-fold RNA-binding domain, the central permuted GTPase domain, and a zinc knuckle-like C-terminal cysteine domain. This domain architecture suggests a role for YjeQ as a regulator of translation. Probab=85.70 E-value=0.46 Score=26.79 Aligned_cols=27 Identities=26% Similarity=0.593 Sum_probs=21.0 Q ss_pred HHCCCCCCCCCEEEEECCCCCCCHHHH Q ss_conf 310111156833655247788611589 Q gi|254780617|r 153 IDSLIPIGRGQRELIIGDRKTGKTSII 179 (509) Q Consensus 153 ID~l~pigrGQR~~I~g~~g~GKt~l~ 179 (509) +|.|...-+|+...+.|.||+|||||. T Consensus 152 ~~~L~~~l~~k~~v~~G~SGvGKSSLi 178 (287) T cd01854 152 LDELREYLKGKTSVLVGQSGVGKSTLI 178 (287) T ss_pred HHHHHHHHCCCEEEEECCCCCCHHHHH T ss_conf 899998747988999889988889999 No 436 >COG4525 TauB ABC-type taurine transport system, ATPase component [Inorganic ion transport and metabolism] Probab=85.68 E-value=0.58 Score=26.08 Aligned_cols=26 Identities=27% Similarity=0.567 Sum_probs=21.6 Q ss_pred HCCCCCCCCCEEEEECCCCCCCHHHH Q ss_conf 10111156833655247788611589 Q gi|254780617|r 154 DSLIPIGRGQRELIIGDRKTGKTSII 179 (509) Q Consensus 154 D~l~pigrGQR~~I~g~~g~GKt~l~ 179 (509) |.-+.|..|+-.-+.|++|||||||+ T Consensus 23 ~vsL~ia~ge~vv~lGpSGcGKTTLL 48 (259) T COG4525 23 DVSLTIASGELVVVLGPSGCGKTTLL 48 (259) T ss_pred CCCEEECCCCEEEEECCCCCCHHHHH T ss_conf 35502358978999768886578899 No 437 >cd04101 RabL4 RabL4 (Rab-like4) subfamily. RabL4s are novel proteins that have high sequence similarity with Rab family members, but display features that are distinct from Rabs, and have been termed Rab-like. As in other Rab-like proteins, RabL4 lacks a prenylation site at the C-terminus. The specific function of RabL4 remains unknown. Probab=85.66 E-value=0.48 Score=26.68 Aligned_cols=17 Identities=35% Similarity=0.640 Sum_probs=15.2 Q ss_pred EEEEECCCCCCCHHHHH Q ss_conf 36552477886115899 Q gi|254780617|r 164 RELIIGDRKTGKTSIIL 180 (509) Q Consensus 164 R~~I~g~~g~GKt~l~~ 180 (509) |+.++|++|||||+|+. T Consensus 2 KivllGd~gVGKTsli~ 18 (164) T cd04101 2 RCAVVGDPAVGKTAFVQ 18 (164) T ss_pred EEEEECCCCCCHHHHHH T ss_conf 99999959958899999 No 438 >pfam03193 DUF258 Protein of unknown function, DUF258. Probab=85.55 E-value=0.44 Score=26.90 Aligned_cols=28 Identities=25% Similarity=0.649 Sum_probs=22.7 Q ss_pred HHHCCCCCCCCCEEEEECCCCCCCHHHH Q ss_conf 3310111156833655247788611589 Q gi|254780617|r 152 AIDSLIPIGRGQRELIIGDRKTGKTSII 179 (509) Q Consensus 152 ~ID~l~pigrGQR~~I~g~~g~GKt~l~ 179 (509) .+|.|...-+|+...+.|.+|+|||||. T Consensus 25 g~~~L~~~l~~k~sv~~G~SGVGKSTLi 52 (161) T pfam03193 25 GIEELKPLLKGKTSVLAGQSGVGKSTLL 52 (161) T ss_pred CHHHHHHHHCCCEEEEECCCCCCHHHHH T ss_conf 9999999867985999889998899999 No 439 >KOG2028 consensus Probab=85.53 E-value=2.6 Score=21.38 Aligned_cols=19 Identities=26% Similarity=0.307 Sum_probs=14.8 Q ss_pred CEEEEECCCCCCCHHHHHH Q ss_conf 3365524778861158999 Q gi|254780617|r 163 QRELIIGDRKTGKTSIILD 181 (509) Q Consensus 163 QR~~I~g~~g~GKt~l~~~ 181 (509) --+-+.|++|||||+|+-- T Consensus 163 pSmIlWGppG~GKTtlArl 181 (554) T KOG2028 163 PSMILWGPPGTGKTTLARL 181 (554) T ss_pred CCEEEECCCCCCHHHHHHH T ss_conf 7058866998765889999 No 440 >TIGR02673 FtsE cell division ATP-binding protein FtsE; InterPro: IPR005286 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). FtsE is an ABC transporter ATP-binding protein. This protein and its permease partner FtsX, localize to the cell division site. In a number of species, the ftsEX gene pair is located next to ftsY, which encodes the signal recognition particle-docki ng protein.; GO: 0005524 ATP binding, 0051301 cell division. Probab=85.48 E-value=0.54 Score=26.30 Aligned_cols=70 Identities=13% Similarity=0.220 Sum_probs=43.8 Q ss_pred CHHHHHCCHHHHH-CCCCCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEE--ECCCCHHHHHHHHH Q ss_conf 0014441602331-01111568336552477886115899999998863024667876069999--51677678999998 Q gi|254780617|r 142 VCEPLSTGIKAID-SLIPIGRGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYV--AIGQKRSSVARFVK 218 (509) Q Consensus 142 i~~~l~TGI~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~--~IGer~~ev~~~~~ 218 (509) |+-...+|+.+.+ .=+.|-+||=.=+.|.||.|||||+ -+|+.+ ...+...|-|.+ .---+++++..+.+ T Consensus 7 V~~~Y~~~~~aL~~v~l~i~kG~F~FLtG~SGAGKttLL-KLl~~~------~~P~~G~v~~~G~~~~~l~~~~~P~LRR 79 (215) T TIGR02673 7 VSKSYPGGVEALHDVSLHIRKGEFLFLTGPSGAGKTTLL-KLLYGA------LTPSRGQVRVAGEDVSRLRGRQLPLLRR 79 (215) T ss_pred ECEECCCCCHHHCCCCEEECCCCEEEEECCCCCCHHHHH-HHHHHH------CCCCCCEEEECCEECCCCCCCCCHHHHC T ss_conf 210078985113276447527740788727786178999-999852------6987580888874046677564312213 No 441 >cd01861 Rab6 Rab6 subfamily. Rab6 is involved in microtubule-dependent transport pathways through the Golgi and from endosomes to the Golgi. Rab6A of mammals is implicated in retrograde transport through the Golgi stack, and is also required for a slow, COPI-independent, retrograde transport pathway from the Golgi to the endoplasmic reticulum (ER). This pathway may allow Golgi residents to be recycled through the ER for scrutiny by ER quality-control systems. Yeast Ypt6p, the homolog of the mammalian Rab6 GTPase, is not essential for cell viability. Ypt6p acts in endosome-to-Golgi, in intra-Golgi retrograde transport, and possibly also in Golgi-to-ER trafficking. GTPase activating proteins (GAPs) interact with GTP-bound Rab and accelerate the hydrolysis of GTP to GDP. Guanine nucleotide exchange factors (GEFs) interact with GDP-bound Rabs to promote the formation of the GTP-bound state. Rabs are further regulated by guanine nucleotide dissociation inhibitors (GDIs), which facilitate Probab=85.48 E-value=0.51 Score=26.49 Aligned_cols=20 Identities=40% Similarity=0.639 Sum_probs=16.9 Q ss_pred CEEEEECCCCCCCHHHHHHH Q ss_conf 33655247788611589999 Q gi|254780617|r 163 QRELIIGDRKTGKTSIILDT 182 (509) Q Consensus 163 QR~~I~g~~g~GKt~l~~~~ 182 (509) .|+.++|+++||||+|+.-. T Consensus 1 ~Ki~vvG~~~vGKTsli~r~ 20 (161) T cd01861 1 HKLVFLGDQSVGKTSIITRF 20 (161) T ss_pred CEEEEECCCCCCHHHHHHHH T ss_conf 97999997997899999999 No 442 >PRK13545 tagH teichoic acids export protein ATP-binding subunit; Provisional Probab=85.43 E-value=0.61 Score=25.92 Aligned_cols=30 Identities=17% Similarity=0.282 Sum_probs=24.6 Q ss_pred HCCCCCCCCCEEEEECCCCCCCHHHHHHHHH Q ss_conf 1011115683365524778861158999999 Q gi|254780617|r 154 DSLIPIGRGQRELIIGDRKTGKTSIILDTFL 184 (509) Q Consensus 154 D~l~pigrGQR~~I~g~~g~GKt~l~~~~I~ 184 (509) |.=+.+.+|.+.||+|.-|.|||||+ .+|. T Consensus 42 nVSFeV~kGE~vGIIG~NGAGKSTLL-KiIa 71 (549) T PRK13545 42 NISFEVPEGEIVGIVGLNGSGKSTLS-NLIA 71 (549) T ss_pred CCEEEECCCCEEEEECCCCCCHHHHH-HHHH T ss_conf 72578648989999889999899999-9996 No 443 >COG1120 FepC ABC-type cobalamin/Fe3+-siderophores transport systems, ATPase components [Inorganic ion transport and metabolism / Coenzyme metabolism] Probab=85.42 E-value=0.6 Score=25.96 Aligned_cols=28 Identities=21% Similarity=0.417 Sum_probs=23.2 Q ss_pred CCCCCCCCCEEEEECCCCCCCHHHHHHHH Q ss_conf 01111568336552477886115899999 Q gi|254780617|r 155 SLIPIGRGQRELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 155 ~l~pigrGQR~~I~g~~g~GKt~l~~~~I 183 (509) .=+.+-+|.-..|+|+-|+|||||+- +| T Consensus 21 ls~~i~~G~i~~iiGpNG~GKSTLLk-~l 48 (258) T COG1120 21 LSFSIPKGEITGILGPNGSGKSTLLK-CL 48 (258) T ss_pred CEEEECCCCEEEEECCCCCCHHHHHH-HH T ss_conf 36886599799998998889999999-98 No 444 >COG4136 ABC-type uncharacterized transport system, ATPase component [General function prediction only] Probab=85.38 E-value=0.78 Score=25.14 Aligned_cols=95 Identities=15% Similarity=0.180 Sum_probs=48.2 Q ss_pred CCCCCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEE-EEECCCCHHH-HHH--------HHHHH-CCC Q ss_conf 011115683365524778861158999999988630246678760699-9951677678-999--------99851-146 Q gi|254780617|r 155 SLIPIGRGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCI-YVAIGQKRSS-VAR--------FVKAL-EDR 223 (509) Q Consensus 155 ~l~pigrGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V-~~~IGer~~e-v~~--------~~~~l-~~~ 223 (509) .-+||.+|.-.-|+|++|+|||+|+.=|+=. -..+.-|- -..++++.-+ +-. |-+.+ -.. T Consensus 21 ~n~Tia~GeivtlMGPSGcGKSTLls~~~G~---------La~~F~~~G~~~l~~~~l~~lPa~qRq~GiLFQD~lLFph 91 (213) T COG4136 21 VNFTIAKGEIVTLMGPSGCGKSTLLSWMIGA---------LAGQFSCTGELWLNEQRLDMLPAAQRQIGILFQDALLFPH 91 (213) T ss_pred EEEEECCCCEEEEECCCCCCHHHHHHHHHHH---------CCCCCCEEEEEEECCEECCCCCHHHHHEEEEECCCCCCCC T ss_conf 5478637847998778886578899999862---------0667624568988872004361324322135425501444 Q ss_pred CCCCEEEEEEECCCCCHHHHHHHHHHHHH---HHHHHH Q ss_conf 74211589972887888999998876666---667542 Q gi|254780617|r 224 GALSYSIVVVASASDPAPMQLLAPFAGCA---MGEYFR 258 (509) Q Consensus 224 ~~~~~tvvv~a~a~~~~~~r~~ap~~a~a---iAEyfr 258 (509) =..-...+++-+++----.|..++|+|+. .++.|. T Consensus 92 lsVg~Nl~fAlp~~~KG~aRr~~a~aAL~~~gL~g~f~ 129 (213) T COG4136 92 LSVGQNLLFALPATLKGNARRNAANAALERSGLDGAFH 129 (213) T ss_pred CCCCCCEEEECCCCCCCHHHHHHHHHHHHHHCCCHHHH T ss_conf 23121447853712142777766888888852300322 No 445 >TIGR00763 lon ATP-dependent protease La; InterPro: IPR004815 Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes . They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence . Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases . Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base . The geometric orientations of the catalytic residues are similar between families, despite different protein folds . The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) , . Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. This signature defines the bacterial and eukaryotic lon proteases, which are ATP-dependent serine peptidases belonging to the MEROPS peptidase family S16 (lon protease family, clan SF). This family of sequences does not include the archaeal lon homologs, IPR004663 from INTERPRO. In the eukaryotes the majority of the proteins are located in the mitochondrial matrix , . In yeast, Pim1, is located in the mitochondrial matrix, is required for mitochondrial function, is constitutively expressed but is increased after thermal stress, suggesting that Pim1 may play a role in the heat shock response .; GO: 0004176 ATP-dependent peptidase activity, 0005524 ATP binding, 0006510 ATP-dependent proteolysis. Probab=85.30 E-value=0.5 Score=26.54 Aligned_cols=28 Identities=25% Similarity=0.208 Sum_probs=19.7 Q ss_pred CCCCCCCCC-EEEEECCCCCCCHHHHHHH Q ss_conf 011115683-3655247788611589999 Q gi|254780617|r 155 SLIPIGRGQ-RELIIGDRKTGKTSIILDT 182 (509) Q Consensus 155 ~l~pigrGQ-R~~I~g~~g~GKt~l~~~~ 182 (509) -|..=-+|- =+++.|++|||||||+-.. T Consensus 442 kL~~~~~GpqIlClvGPPGVGKTSlg~SI 470 (941) T TIGR00763 442 KLRGKMKGPQILCLVGPPGVGKTSLGKSI 470 (941) T ss_pred CCCCCCCCCEEEEEECCCCCCHHHHHHHH T ss_conf 44778888767872072695422278999 No 446 >cd04175 Rap1 Rap1 subgroup. The Rap1 subgroup is part of the Rap subfamily of the Ras family. It can be further divided into the Rap1a and Rap1b isoforms. In humans, Rap1a and Rap1b share 95% sequence homology, but are products of two different genes located on chromosomes 1 and 12, respectively. Rap1a is sometimes called smg p21 or Krev1 in the older literature. Rap1 proteins are believed to perform different cellular functions, depending on the isoform, its subcellular localization, and the effector proteins it binds. For example, in rat salivary gland, neutrophils, and platelets, Rap1 localizes to secretory granules and is believed to regulate exocytosis or the formation of secretory granules. Rap1 has also been shown to localize in the Golgi of rat fibroblasts, zymogen granules, plasma membrane, and the microsomal membrane of pancreatic acini, as well as in the endocytic compartment of skeletal muscle cells and fibroblasts. High expression of Rap1 has been observed in the n Probab=85.28 E-value=0.54 Score=26.27 Aligned_cols=20 Identities=15% Similarity=0.464 Sum_probs=17.0 Q ss_pred EEEEECCCCCCCHHHHHHHH Q ss_conf 36552477886115899999 Q gi|254780617|r 164 RELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 164 R~~I~g~~g~GKt~l~~~~I 183 (509) |+.++|++|||||+|+.-.+ T Consensus 3 KIvllGd~~VGKTsli~r~~ 22 (164) T cd04175 3 KLVVLGSGGVGKSALTVQFV 22 (164) T ss_pred EEEEECCCCCCHHHHHHHHH T ss_conf 99998999975999999997 No 447 >cd04138 H_N_K_Ras_like H-Ras/N-Ras/K-Ras subfamily. H-Ras, N-Ras, and K-Ras4A/4B are the prototypical members of the Ras family. These isoforms generate distinct signal outputs despite interacting with a common set of activators and effectors, and are strongly associated with oncogenic progression in tumor initiation. Mutated versions of Ras that are insensitive to GAP stimulation (and are therefore constitutively active) are found in a significant fraction of human cancers. Many Ras guanine nucleotide exchange factors (GEFs) have been identified. They are sequestered in the cytosol until activation by growth factors triggers recruitment to the plasma membrane or Golgi, where the GEF colocalizes with Ras. Active (GTP-bound) Ras interacts with several effector proteins that stimulate a variety of diverse cytoplasmic signaling activities. Some are known to positively mediate the oncogenic properties of Ras, including Raf, phosphatidylinositol 3-kinase (PI3K), RalGEFs, and Tiam1. Probab=85.27 E-value=0.57 Score=26.13 Aligned_cols=20 Identities=15% Similarity=0.503 Sum_probs=16.9 Q ss_pred EEEEECCCCCCCHHHHHHHH Q ss_conf 36552477886115899999 Q gi|254780617|r 164 RELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 164 R~~I~g~~g~GKt~l~~~~I 183 (509) |+.++|++|||||+|+.-.+ T Consensus 3 KvvlvGd~~VGKTsli~r~~ 22 (162) T cd04138 3 KLVVVGAGGVGKSALTIQLI 22 (162) T ss_pred EEEEECCCCCCHHHHHHHHH T ss_conf 99998999988999999998 No 448 >cd04127 Rab27A Rab27a subfamily. The Rab27a subfamily consists of Rab27a and its highly homologous isoform, Rab27b. Unlike most Rab proteins whose functions remain poorly defined, Rab27a has many known functions. Rab27a has multiple effector proteins, and depending on which effector it binds, Rab27a has different functions as well as tissue distribution and/or cellular localization. Putative functions have been assigned to Rab27a when associated with the effector proteins Slp1, Slp2, Slp3, Slp4, Slp5, DmSlp, rabphilin, Dm/Ce-rabphilin, Slac2-a, Slac2-b, Slac2-c, Noc2, JFC1, and Munc13-4. Rab27a has been associated with several human diseases, including hemophagocytic syndrome (Griscelli syndrome or GS), Hermansky-Pudlak syndrome, and choroidermia. In the case of GS, a rare, autosomal recessive disease, a Rab27a mutation is directly responsible for the disorder. When Rab27a is localized to the secretory granules of pancreatic beta cells, it is believed to mediate glucose-stimulated Probab=85.26 E-value=0.52 Score=26.44 Aligned_cols=20 Identities=35% Similarity=0.499 Sum_probs=16.9 Q ss_pred EEEEECCCCCCCHHHHHHHH Q ss_conf 36552477886115899999 Q gi|254780617|r 164 RELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 164 R~~I~g~~g~GKt~l~~~~I 183 (509) |+.++|++|||||+|+.-.+ T Consensus 6 KivvvGd~~VGKTsli~r~~ 25 (180) T cd04127 6 KFLALGDSGVGKTSFLYQYT 25 (180) T ss_pred EEEEECCCCCCHHHHHHHHH T ss_conf 99999999988899999996 No 449 >PRK09700 D-allose transporter ATP-binding protein; Provisional Probab=85.26 E-value=0.6 Score=25.98 Aligned_cols=187 Identities=14% Similarity=0.178 Sum_probs=77.5 Q ss_pred CCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHH--CCCCCCCEEEEEEECCCC Q ss_conf 683365524778861158999999988630246678760699995167767899999851--146742115899728878 Q gi|254780617|r 161 RGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKAL--EDRGALSYSIVVVASASD 238 (509) Q Consensus 161 rGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l--~~~~~~~~tvvv~a~a~~ 238 (509) .+-+++|+-.+-++=+.-..+.+.+-..... + ... -++.|-.+-.++.++.+.+ ...|. ++...+.++ T Consensus 162 ~~p~llilDEPTa~Ld~~~~~~l~~~l~~l~--~--~g~--tii~isH~l~~v~~~~Drv~vl~~G~----iv~~~~~~e 231 (510) T PRK09700 162 LDAKVIIMDEPTSSLTNKEVDYLFLIMNQLR--K--EGT--AIVYISHKLAEIRRICDRYTVMKDGS----SVCSGMVSD 231 (510) T ss_pred HCCCEEEECCCCCCCCHHHHHHHHHHHHHHH--H--CCC--CEEEEECCHHHHHHHCCEEEECCCCC----EEEEECHHH T ss_conf 5988499878856668678999999988888--7--287--17999523677886488699714981----996715464 Q ss_pred CHHHHHHHHHHHHHHHHHH-------HCCCCCEEEEEECHHHH-HHHHHHHHHHC--CCCCCCCCCCCCCHHHHHHHHHH Q ss_conf 8899999887666666754-------20468847996164789-99998754530--37875433375200124678888 Q gi|254780617|r 239 PAPMQLLAPFAGCAMGEYF-------RDNGYHALIAYDDLQKH-AVAYRQLSLLL--RRPPGREAYPGDVFYLHSRLLER 308 (509) Q Consensus 239 ~~~~r~~ap~~a~aiAEyf-------r~~G~~VLi~~Ddltr~-A~a~reisl~~--~~pp~~~gyp~~vf~~~s~l~ER 308 (509) -....+.....+..+...+ .+....+++-..+++.. ..+.+.+|+-+ ||.-|--|--|+=-+.+.+++-. T Consensus 232 ~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~l~v~~l~~~~~~~l~~vsf~v~~GEi~gl~G~nGsGKsTL~~~l~G 311 (510) T PRK09700 232 VSNDDIVRLMVGRELQNRFNAMKENVSNLAHETVFEVRNVTSRDRKKVRDISFSVCRGEILGFAGLVGSGRTELMNCLFG 311 (510) T ss_pred CCHHHHHHHHHCCCCHHHCCCCCCCCCCCCCCCEEEEECCCCCCCCCEEEEEEEECCCCEEEEECCCCCCHHHHHHHHHC T ss_conf 89999999873736123145566667777787179994652578886543357874881899976888628899999819 Q ss_pred HHHCCCCCCCCCEEEEEEEECCCCCCCCCHHHHHHHHHCCEEEEECCHHHHCCCCCCCCCC Q ss_conf 7504544588735675310036766443114668854061699615012210266543344 Q gi|254780617|r 309 AAKMSDALGAGSLTALPVIETQVNDVSAYIPTNVISITDGQIFLETELFYQGIRPAINIGL 369 (509) Q Consensus 309 a~~~~~~~g~GSiT~~~~v~~~~~D~~~~i~~~~~si~DG~i~l~~~l~~~g~~Paid~~~ 369 (509) +... ..|+|. ..+.|++..-|...+.. |--++..+-...|.||...+.. T Consensus 312 ---l~~~-~~G~I~------~~G~~i~~~~~~~~~~~--gi~~v~~~r~~~~l~~~~sv~e 360 (510) T PRK09700 312 ---VDKR-AGGEIR------LNGKDISPRSPLDAVKK--GMAYITESRRDNGFFPNFSIAQ 360 (510) T ss_pred ---CCCC-CCEEEE------ECCEECCCCCHHHHHHC--CCEEEEEEHHHCCCCCCCCHHH T ss_conf ---8888-861899------99999998998999970--8677531033247488996899 No 450 >cd04137 RheB Rheb (Ras Homolog Enriched in Brain) subfamily. Rheb was initially identified in rat brain, where its expression is elevated by seizures or by long-term potentiation. It is expressed ubiquitously, with elevated levels in muscle and brain. Rheb functions as an important mediator between the tuberous sclerosis complex proteins, TSC1 and TSC2, and the mammalian target of rapamycin (TOR) kinase to stimulate cell growth. TOR kinase regulates cell growth by controlling nutrient availability, growth factors, and the energy status of the cell. TSC1 and TSC2 form a dimeric complex that has tumor suppressor activity, and TSC2 is a GTPase activating protein (GAP) for Rheb. The TSC1/TSC2 complex inhibits the activation of TOR kinase through Rheb. Rheb has also been shown to induce the formation of large cytoplasmic vacuoles in a process that is dependent on the GTPase cycle of Rheb, but independent of the TOR kinase, suggesting Rheb plays a role in endocytic trafficking that le Probab=85.11 E-value=0.54 Score=26.31 Aligned_cols=21 Identities=24% Similarity=0.547 Sum_probs=17.6 Q ss_pred CEEEEECCCCCCCHHHHHHHH Q ss_conf 336552477886115899999 Q gi|254780617|r 163 QRELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 163 QR~~I~g~~g~GKt~l~~~~I 183 (509) +|+.++|++|||||+|+.-.+ T Consensus 2 ~KIvlvGd~~VGKTsli~r~~ 22 (180) T cd04137 2 RKIAVLGSRSVGKSSLTVQFV 22 (180) T ss_pred CEEEEECCCCCCHHHHHHHHH T ss_conf 889999989988999999997 No 451 >cd03267 ABC_NatA_like Similar in sequence to NatA, this is the ATPase component of a bacterial ABC-type Na+ transport system called NatAB, which catalyzes ATP-dependent electrogenic Na+ extrusion without mechanically coupled to proton or K+ uptake. NatB possess six putative membrane spanning regions at its C-terminus. In B. subtilis, NatAB is inducible by agents such as ethanol and protonophores, which lower the protonmotive force across the membrane. The closest sequence similarity to NatA is exhibited by DrrA of the two-component daunomycin- and doxorubicin-efflux system. Hence, the functional NatAB is presumably assembled with two copies of the single ATP-binding protein and the single intergral membrane protein. Probab=85.10 E-value=0.6 Score=25.96 Aligned_cols=35 Identities=23% Similarity=0.367 Sum_probs=27.9 Q ss_pred CHHHHH-CCCCCCCCCEEEEECCCCCCCHHHHHHHHH Q ss_conf 602331-011115683365524778861158999999 Q gi|254780617|r 149 GIKAID-SLIPIGRGQRELIIGDRKTGKTSIILDTFL 184 (509) Q Consensus 149 GI~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~~~I~ 184 (509) +..|+| .=+.+-+|+-.+++|+.|+||||++ .+|. T Consensus 33 ~~~al~~vsf~i~~Gei~gLlGpNGaGKSTll-k~l~ 68 (236) T cd03267 33 EVEALKGISFTIEKGEIVGFIGPNGAGKTTTL-KILS 68 (236) T ss_pred CEEEECCCEEEECCCCEEEEECCCCCHHHHHH-HHHH T ss_conf 98986680578848959999999983099999-9996 No 452 >KOG1564 consensus Probab=85.09 E-value=2.7 Score=21.18 Aligned_cols=126 Identities=18% Similarity=0.192 Sum_probs=67.5 Q ss_pred HHHHCCHHHHHCCCCCC--CCCEEEEECCCCCCCHHHHHHHHHH-HHHHCCCCCCCCCCEEEEEEC--CCCHHHHHHHHH Q ss_conf 14441602331011115--6833655247788611589999999-886302466787606999951--677678999998 Q gi|254780617|r 144 EPLSTGIKAIDSLIPIG--RGQRELIIGDRKTGKTSIILDTFLN-QKSSHDKGSEKDKVYCIYVAI--GQKRSSVARFVK 218 (509) Q Consensus 144 ~~l~TGI~~ID~l~pig--rGQR~~I~g~~g~GKt~l~~~~I~n-q~~~~~~~~~~~~v~~V~~~I--Ger~~ev~~~~~ 218 (509) +-|.||-.+.|.++-=| -++=.-|+|.+|+|||-+++..-+- |--...++-++ -|||..- +-..+...++-. T Consensus 82 ~~lttgc~~LD~~L~GGi~~~giTEi~GeSg~GKtQL~lQL~L~VQLp~~~GGL~~---~~vYI~TE~~fP~rRL~qL~~ 158 (351) T KOG1564 82 SKLTTGCVALDECLRGGIPTDGITEICGESGCGKTQLLLQLSLCVQLPRSHGGLGG---GAVYICTESPFPTRRLHQLSH 158 (351) T ss_pred HHCCCCCHHHHHHHCCCCCCCCHHHHHHCCCCCHHHHHHHHHHHHHCCHHHCCCCC---CEEEEECCCCCCHHHHHHHHH T ss_conf 01104558888874499334548888602577588999999988608551089777---439997688874778999998 Q ss_pred HHCCC-CC-----CC-E---EEEEEECCCCCHHHH---HHHHHHHHHHHHHHHCCCCCEEEEEECHHHHHHHHHH Q ss_conf 51146-74-----21-1---589972887888999---9988766666675420468847996164789999987 Q gi|254780617|r 219 ALEDR-GA-----LS-Y---SIVVVASASDPAPMQ---LLAPFAGCAMGEYFRDNGYHALIAYDDLQKHAVAYRQ 280 (509) Q Consensus 219 ~l~~~-~~-----~~-~---tvvv~a~a~~~~~~r---~~ap~~a~aiAEyfr~~G~~VLi~~Ddltr~A~a~re 280 (509) .+.+. .. .. + -|++.+-.|--+.+. +.-| ....+++=-|+++||+..+-+++-+ T Consensus 159 ~~~~rp~~~~~~~~~~Npgd~IFv~~~~d~d~l~~iv~~qlP--------iL~~r~~i~LVIiDSVAa~fR~E~d 225 (351) T KOG1564 159 TLPQRPNPEKELNYNDNPGDHIFVENVHDVDHLLHIVNRQLP--------ILLNRKKIKLVIIDSVAALFRSEFD 225 (351) T ss_pred HCCCCCCCCHHHHHCCCCCCEEEEEECCCHHHHHHHHHHHCC--------CEECCCCCEEEEEEHHHHHHHHHHC T ss_conf 550479943443421697764798733533568998866423--------1011573008997120577787732 No 453 >cd04136 Rap_like Rap-like subfamily. The Rap subfamily consists of the Rap1, Rap2, and RSR1. Rap subfamily proteins perform different cellular functions, depending on the isoform and its subcellular localization. For example, in rat salivary gland, neutrophils, and platelets, Rap1 localizes to secretory granules and is believed to regulate exocytosis or the formation of secretory granules. Rap1 has also been shown to localize in the Golgi of rat fibroblasts, zymogen granules, plasma membrane, and microsomal membrane of the pancreatic acini, as well as in the endocytic compartment of skeletal muscle cells and fibroblasts. Rap1 localizes in the nucleus of human oropharyngeal squamous cell carcinomas (SCCs) and cell lines. Rap1 plays a role in phagocytosis by controlling the binding of adhesion receptors (typically integrins) to their ligands. In yeast, Rap1 has been implicated in multiple functions, including activation and silencing of transcription and maintenance of telomeres. Probab=85.09 E-value=0.56 Score=26.17 Aligned_cols=20 Identities=15% Similarity=0.479 Sum_probs=16.7 Q ss_pred EEEEECCCCCCCHHHHHHHH Q ss_conf 36552477886115899999 Q gi|254780617|r 164 RELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 164 R~~I~g~~g~GKt~l~~~~I 183 (509) |+.++|++|||||+|+.-.+ T Consensus 3 KivllGd~~VGKTsli~r~~ 22 (163) T cd04136 3 KVVVLGSGGVGKSALTVQFV 22 (163) T ss_pred EEEEECCCCCCHHHHHHHHH T ss_conf 99998999988999999997 No 454 >cd03230 ABC_DR_subfamily_A This family of ATP-binding proteins belongs to a multisubunit transporter involved in drug resistance (BcrA and DrrA), nodulation, lipid transport, and lantibiotic immunity. In bacteria and archaea, these transporters usually include an ATP-binding protein and one or two integral membrane proteins. Eukaryote systems of the ABCA subfamily display ABC domains that are quite similar to this family. The ATP-binding domain shows the highest similarity between all members of the ABC transporter family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=85.04 E-value=0.64 Score=25.75 Aligned_cols=30 Identities=23% Similarity=0.445 Sum_probs=24.4 Q ss_pred CCCCCCCCCEEEEECCCCCCCHHHHHHHHHH Q ss_conf 0111156833655247788611589999999 Q gi|254780617|r 155 SLIPIGRGQRELIIGDRKTGKTSIILDTFLN 185 (509) Q Consensus 155 ~l~pigrGQR~~I~g~~g~GKt~l~~~~I~n 185 (509) .=+.+-+|+-.+|+|..|+|||||+ .+|.. T Consensus 19 vsl~i~~Gei~gl~G~NGaGKSTLl-~~i~G 48 (173) T cd03230 19 ISLTVEKGEIYGLLGPNGAGKTTLI-KIILG 48 (173) T ss_pred EEEEECCCCEEEEECCCCCCHHHHH-HHHHC T ss_conf 0878879939999878997999999-99976 No 455 >cd01870 RhoA_like RhoA-like subfamily. The RhoA subfamily consists of RhoA, RhoB, and RhoC. RhoA promotes the formation of stress fibers and focal adhesions, regulating cell shape, attachment, and motility. RhoA can bind to multiple effector proteins, thereby triggering different downstream responses. In many cell types, RhoA mediates local assembly of the contractile ring, which is necessary for cytokinesis. RhoA is vital for muscle contraction; in vascular smooth muscle cells, RhoA plays a key role in cell contraction, differentiation, migration, and proliferation. RhoA activities appear to be elaborately regulated in a time- and space-dependent manner to control cytoskeletal changes. Most Rho proteins contain a lipid modification site at the C-terminus, with a typical sequence motif CaaX, where a = an aliphatic amino acid and X = any amino acid. Lipid binding is essential for membrane attachment, a key feature of most Rho proteins. RhoA and RhoC are observed only in geranyl Probab=85.04 E-value=0.51 Score=26.44 Aligned_cols=21 Identities=29% Similarity=0.518 Sum_probs=17.7 Q ss_pred CEEEEECCCCCCCHHHHHHHH Q ss_conf 336552477886115899999 Q gi|254780617|r 163 QRELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 163 QR~~I~g~~g~GKt~l~~~~I 183 (509) .|+.++|++|||||+|+.-.+ T Consensus 2 ~Ki~liGd~~VGKTsli~r~~ 22 (175) T cd01870 2 KKLVIVGDGACGKTCLLIVFS 22 (175) T ss_pred CEEEEECCCCCCHHHHHHHHH T ss_conf 099999989966999999997 No 456 >cd04130 Wrch_1 Wrch-1 subfamily. Wrch-1 (Wnt-1 responsive Cdc42 homolog) is a Rho family GTPase that shares significant sequence and functional similarity with Cdc42. Wrch-1 was first identified in mouse mammary epithelial cells, where its transcription is upregulated in Wnt-1 transformation. Wrch-1 contains N- and C-terminal extensions relative to cdc42, suggesting potential differences in cellular localization and function. The Wrch-1 N-terminal extension contains putative SH3 domain-binding motifs and has been shown to bind the SH3 domain-containing protein Grb2, which increases the level of active Wrch-1 in cells. Unlike Cdc42, which localizes to the cytosol and perinuclear membranes, Wrch-1 localizes extensively with the plasma membrane and endosomes. The membrane association, localization, and biological activity of Wrch-1 indicate an atypical model of regulation distinct from other Rho family GTPases. Most Rho proteins contain a lipid modification site at the C-terminus, Probab=85.02 E-value=0.54 Score=26.28 Aligned_cols=21 Identities=33% Similarity=0.564 Sum_probs=17.4 Q ss_pred EEEEECCCCCCCHHHHHHHHH Q ss_conf 365524778861158999999 Q gi|254780617|r 164 RELIIGDRKTGKTSIILDTFL 184 (509) Q Consensus 164 R~~I~g~~g~GKt~l~~~~I~ 184 (509) |+.++|++|||||+|+.-.+- T Consensus 2 KvvlvGd~~VGKTsli~r~~~ 22 (173) T cd04130 2 KCVLVGDGAVGKTSLIVSYTT 22 (173) T ss_pred EEEEECCCCCCHHHHHHHHHH T ss_conf 899999899788999999961 No 457 >PRK05428 HPr kinase/phosphorylase; Provisional Probab=84.97 E-value=1.8 Score=22.55 Aligned_cols=24 Identities=29% Similarity=0.407 Sum_probs=20.9 Q ss_pred CCCEEEEECCCCCCCHHHHHHHHH Q ss_conf 683365524778861158999999 Q gi|254780617|r 161 RGQRELIIGDRKTGKTSIILDTFL 184 (509) Q Consensus 161 rGQR~~I~g~~g~GKt~l~~~~I~ 184 (509) -|==++|.|+||+|||.++++.|- T Consensus 145 ~G~GVLI~G~SGiGKSE~aLeLI~ 168 (308) T PRK05428 145 YGIGVLITGESGIGKSETALELIK 168 (308) T ss_pred EEEEEEEECCCCCCHHHHHHHHHH T ss_conf 444899975888881289999998 No 458 >PRK13536 nodulation factor exporter subunit NodI; Provisional Probab=84.90 E-value=0.85 Score=24.85 Aligned_cols=35 Identities=17% Similarity=0.312 Sum_probs=27.1 Q ss_pred HHHHH-CCCCCCCCCEEEEECCCCCCCHHHHHHHHHH Q ss_conf 02331-0111156833655247788611589999999 Q gi|254780617|r 150 IKAID-SLIPIGRGQRELIIGDRKTGKTSIILDTFLN 185 (509) Q Consensus 150 I~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~~~I~n 185 (509) ..++| .=+.+.+|+-.+++|+.|.||||++ .+|.. T Consensus 20 ~~al~~vs~~v~~Gei~gllGpNGAGKSTli-~~l~G 55 (306) T PRK13536 20 KPVVNGLSFTVASGECFGLLGPNGAGKSTIA-RMILG 55 (306) T ss_pred EEEECCEEEEECCCCEEEEECCCCCCHHHHH-HHHHC T ss_conf 9987161778859969999999898099999-99967 No 459 >cd01893 Miro1 Miro1 subfamily. Miro (mitochondrial Rho) proteins have tandem GTP-binding domains separated by a linker region containing putative calcium-binding EF hand motifs. Genes encoding Miro-like proteins were found in several eukaryotic organisms. This CD represents the N-terminal GTPase domain of Miro proteins. These atypical Rho GTPases have roles in mitochondrial homeostasis and apoptosis. Most Rho proteins contain a lipid modification site at the C-terminus; however, Miro is one of few Rho subfamilies that lack this feature. Probab=84.89 E-value=0.59 Score=26.03 Aligned_cols=20 Identities=40% Similarity=0.727 Sum_probs=16.8 Q ss_pred EEEEECCCCCCCHHHHHHHH Q ss_conf 36552477886115899999 Q gi|254780617|r 164 RELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 164 R~~I~g~~g~GKt~l~~~~I 183 (509) |+.++|++|||||+|+.-.+ T Consensus 2 Kiv~vGd~~VGKTsli~r~~ 21 (166) T cd01893 2 RIVLIGDEGVGKSSLIMSLV 21 (166) T ss_pred EEEEECCCCCCHHHHHHHHH T ss_conf 89999999989999999998 No 460 >PRK11288 araG L-arabinose transporter ATP-binding protein; Provisional Probab=84.87 E-value=0.6 Score=25.95 Aligned_cols=23 Identities=17% Similarity=0.380 Sum_probs=0.0 Q ss_pred CCCCCCCEEEEECCCCCCCHHHH Q ss_conf 11156833655247788611589 Q gi|254780617|r 157 IPIGRGQRELIIGDRKTGKTSII 179 (509) Q Consensus 157 ~pigrGQR~~I~g~~g~GKt~l~ 179 (509) +.+.+|...+|+|..|+|||||+ T Consensus 274 l~v~~GEivgivG~nGsGKSTL~ 296 (501) T PRK11288 274 FAVRRGEIVGFFGLVGAGRSELM 296 (501) T ss_pred EEEECCEEEEEECCCCCCHHHHH T ss_conf 78708839997568886487999 No 461 >pfam00308 Bac_DnaA Bacterial dnaA protein. Probab=84.85 E-value=0.88 Score=24.74 Aligned_cols=35 Identities=20% Similarity=0.283 Sum_probs=23.2 Q ss_pred EEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEE Q ss_conf 65524778861158999999988630246678760699995 Q gi|254780617|r 165 ELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVA 205 (509) Q Consensus 165 ~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~ 205 (509) ..|.|++|+|||.|+ .++.|+..... ....++|.- T Consensus 37 l~i~G~~G~GKTHLL-qA~~~~~~~~~-----~~~~v~yl~ 71 (219) T pfam00308 37 LFIYGGVGLGKTHLL-HAIGNYALRNF-----PNLRVVYLT 71 (219) T ss_pred EEEECCCCCCHHHHH-HHHHHHHHHHC-----CCCEEEEEE T ss_conf 699889999888999-99999999849-----998288843 No 462 >cd04176 Rap2 Rap2 subgroup. The Rap2 subgroup is part of the Rap subfamily of the Ras family. It consists of Rap2a, Rap2b, and Rap2c. Both isoform 3 of the human mitogen-activated protein kinase kinase kinase kinase 4 (MAP4K4) and Traf2- and Nck-interacting kinase (TNIK) are putative effectors of Rap2 in mediating the activation of c-Jun N-terminal kinase (JNK) to regulate the actin cytoskeleton. In human platelets, Rap2 was shown to interact with the cytoskeleton by binding the actin filaments. In embryonic Xenopus development, Rap2 is necessary for the Wnt/beta-catenin signaling pathway. The Rap2 interacting protein 9 (RPIP9) is highly expressed in human breast carcinomas and correlates with a poor prognosis, suggesting a role for Rap2 in breast cancer oncogenesis. Rap2b, but not Rap2a, Rap2c, Rap1a, or Rap1b, is expressed in human red blood cells, where it is believed to be involved in vesiculation. A number of additional effector proteins for Rap2 have been identified, incl Probab=84.79 E-value=0.56 Score=26.15 Aligned_cols=20 Identities=15% Similarity=0.479 Sum_probs=16.6 Q ss_pred EEEEECCCCCCCHHHHHHHH Q ss_conf 36552477886115899999 Q gi|254780617|r 164 RELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 164 R~~I~g~~g~GKt~l~~~~I 183 (509) |+.++|++|||||+|+.-.+ T Consensus 3 KivllGd~~VGKTsli~r~~ 22 (163) T cd04176 3 KVVVLGSGGVGKSALTVQFV 22 (163) T ss_pred EEEEECCCCCCHHHHHHHHH T ss_conf 99998999978999999997 No 463 >PRK08116 hypothetical protein; Validated Probab=84.72 E-value=2.5 Score=21.51 Aligned_cols=63 Identities=16% Similarity=0.296 Sum_probs=38.0 Q ss_pred HHHCCHHHHHCCCCCCC-CCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCC Q ss_conf 44416023310111156-8336552477886115899999998863024667876069999516776789999985114 Q gi|254780617|r 145 PLSTGIKAIDSLIPIGR-GQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALED 222 (509) Q Consensus 145 ~l~TGI~~ID~l~pigr-GQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~ 222 (509) .+....+-.+.+-.+.+ |.=+++.|++|||||-|+. .|.|+..... .-++|+- +.++++.++. T Consensus 90 a~~~a~~Y~~~f~~~~~~~~GLll~G~~GtGKThLa~-aIa~~l~~~g-------~~V~~~~-------~~~ll~~lk~ 153 (262) T PRK08116 90 AYKVAVKYVKKFEEMKKESVGLLLWGSPGNGKTYLAA-AIANELIEKG-------VPVVFVN-------VPELLNRIKS 153 (262) T ss_pred HHHHHHHHHHHHHHHCCCCCEEEEECCCCCCHHHHHH-HHHHHHHHCC-------CEEEEEE-------HHHHHHHHHH T ss_conf 9999999999898736468618998989998999999-9999999879-------9399988-------9999999999 No 464 >KOG0731 consensus Probab=84.72 E-value=0.68 Score=25.56 Aligned_cols=73 Identities=16% Similarity=0.191 Sum_probs=35.1 Q ss_pred EEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHH Q ss_conf 6552477886115899999998863024667876069999516776789999985114674211589972887888999 Q gi|254780617|r 165 ELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALSYSIVVVASASDPAPMQ 243 (509) Q Consensus 165 ~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~~tvvv~a~a~~~~~~r 243 (509) .++.|++|||||-||. +|..-++.--........+=.|++.| .+-++++... .....-++++...-|.....| T Consensus 347 vLL~GPPGTGKTLLAK-AiAGEAgVPF~svSGSEFvE~~~g~~--asrvr~lf~~---ar~~aP~iifideida~~~~r 419 (774) T KOG0731 347 VLLVGPPGTGKTLLAK-AIAGEAGVPFFSVSGSEFVEMFVGVG--ASRVRDLFPL---ARKNAPSIIFIDEIDAVGRKR 419 (774) T ss_pred EEEECCCCCCHHHHHH-HHHCCCCCCEEEECHHHHHHHHCCCC--HHHHHHHHHH---HHCCCCEEEEECCCCCCCCCC T ss_conf 1787899986789999-88530589646413378888760343--4888999987---432698079714542003125 No 465 >cd04140 ARHI_like ARHI subfamily. ARHI (A Ras homolog member I) is a member of the Ras family with several unique structural and functional properties. ARHI is expressed in normal human ovarian and breast tissue, but its expression is decreased or eliminated in breast and ovarian cancer. ARHI contains an N-terminal extension of 34 residues (human) that is required to retain its tumor suppressive activity. Unlike most other Ras family members, ARHI is maintained in the constitutively active (GTP-bound) state in resting cells and has modest GTPase activity. ARHI inhibits STAT3 (signal transducers and activators of transcription 3), a latent transcription factor whose abnormal activation plays a critical role in oncogenesis. Most Ras proteins contain a lipid modification site at the C-terminus, with a typical sequence motif CaaX, where a = an aliphatic amino acid and X = any amino acid. Lipid binding is essential for membrane attachment, a key feature of most Ras proteins. Due to Probab=84.71 E-value=0.6 Score=25.97 Aligned_cols=21 Identities=29% Similarity=0.506 Sum_probs=17.4 Q ss_pred EEEEECCCCCCCHHHHHHHHH Q ss_conf 365524778861158999999 Q gi|254780617|r 164 RELIIGDRKTGKTSIILDTFL 184 (509) Q Consensus 164 R~~I~g~~g~GKt~l~~~~I~ 184 (509) |+.++|++|||||+|+.-.+- T Consensus 3 KivllGd~~VGKTsli~r~~~ 23 (165) T cd04140 3 RVVVFGAGGVGKSSLVLRFVK 23 (165) T ss_pred EEEEECCCCCCHHHHHHHHHH T ss_conf 999989999769999999964 No 466 >PRK13549 xylose transporter ATP-binding subunit; Provisional Probab=84.69 E-value=0.66 Score=25.66 Aligned_cols=26 Identities=27% Similarity=0.378 Sum_probs=0.0 Q ss_pred HCCCCCCCCCEEEEECCCCCCCHHHH Q ss_conf 10111156833655247788611589 Q gi|254780617|r 154 DSLIPIGRGQRELIIGDRKTGKTSII 179 (509) Q Consensus 154 D~l~pigrGQR~~I~g~~g~GKt~l~ 179 (509) |.=+.+.+|...+|+|.+|+|||||+ T Consensus 280 ~vsf~v~~GEi~gi~G~nGsGKsTLl 305 (513) T PRK13549 280 DVSFSLRRGEILGIAGLVGAGRTELV 305 (513) T ss_pred CCEEEECCCCEEEEECCCCCCHHHHH T ss_conf 33578868848997479886589999 No 467 >COG0194 Gmk Guanylate kinase [Nucleotide transport and metabolism] Probab=84.66 E-value=2.9 Score=20.97 Aligned_cols=89 Identities=12% Similarity=0.175 Sum_probs=53.1 Q ss_pred CCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECC-CCHH----------HHHHHHHHHCCCCCCCEE Q ss_conf 68336552477886115899999998863024667876069999516-7767----------899999851146742115 Q gi|254780617|r 161 RGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYVAIG-QKRS----------SVARFVKALEDRGALSYS 229 (509) Q Consensus 161 rGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~~IG-er~~----------ev~~~~~~l~~~~~~~~t 229 (509) +|.=.-|.|+||+||+||+ .+++... + --+||=+--= .|.. +..+|-+-+.....+++. T Consensus 3 ~G~l~vlsgPSG~GKsTl~-k~L~~~~--~-------l~~SVS~TTR~pR~gEv~G~dY~Fvs~~EF~~~i~~~~fLE~a 72 (191) T COG0194 3 KGLLIVLSGPSGVGKSTLV-KALLEDD--K-------LRFSVSATTRKPRPGEVDGVDYFFVTEEEFEELIERDEFLEWA 72 (191) T ss_pred CCEEEEEECCCCCCHHHHH-HHHHHHC--C-------EEEEEEECCCCCCCCCCCCCEEEECCHHHHHHHHHCCCCEEEE T ss_conf 8639999899888889999-9998634--9-------3799985267999987578024757799999987568747887 Q ss_pred EEEEECCCCCHHHHHHHHHHHHHHHHHHHCCCCCEEEEEEC Q ss_conf 89972887888999998876666667542046884799616 Q gi|254780617|r 230 IVVVASASDPAPMQLLAPFAGCAMGEYFRDNGYHALIAYDD 270 (509) Q Consensus 230 vvv~a~a~~~~~~r~~ap~~a~aiAEyfr~~G~~VLi~~Dd 270 (509) -++-+- |= +...-.|--.++|+||++-+|= T Consensus 73 ~~~gny--------YG---T~~~~ve~~~~~G~~vildId~ 102 (191) T COG0194 73 EYHGNY--------YG---TSREPVEQALAEGKDVILDIDV 102 (191) T ss_pred EECCCC--------CC---CCHHHHHHHHHCCCEEEEEEEH T ss_conf 771973--------24---8688999998669908999853 No 468 >cd01874 Cdc42 Cdc42 subfamily. Cdc42 is an essential GTPase that belongs to the Rho family of Ras-like GTPases. These proteins act as molecular switches by responding to exogenous and/or endogenous signals and relaying those signals to activate downstream components of a biological pathway. Cdc42 transduces signals to the actin cytoskeleton to initiate and maintain polarized growth and to mitogen-activated protein morphogenesis. In the budding yeast Saccharomyces cerevisiae, Cdc42 plays an important role in multiple actin-dependent morphogenetic events such as bud emergence, mating-projection formation, and pseudohyphal growth. In mammalian cells, Cdc42 regulates a variety of actin-dependent events and induces the JNK/SAPK protein kinase cascade, which leads to the activation of transcription factors within the nucleus. Cdc42 mediates these processes through interactions with a myriad of downstream effectors, whose number and regulation we are just starting to understand. In addi Probab=84.61 E-value=0.6 Score=25.98 Aligned_cols=21 Identities=24% Similarity=0.532 Sum_probs=17.5 Q ss_pred EEEEECCCCCCCHHHHHHHHH Q ss_conf 365524778861158999999 Q gi|254780617|r 164 RELIIGDRKTGKTSIILDTFL 184 (509) Q Consensus 164 R~~I~g~~g~GKt~l~~~~I~ 184 (509) |+.++|++|||||+|+.-.+- T Consensus 3 Kvv~lGd~~VGKTsli~r~~~ 23 (175) T cd01874 3 KCVVVGDGAVGKTCLLISYTT 23 (175) T ss_pred EEEEECCCCCCHHHHHHHHHH T ss_conf 999989999588999999964 No 469 >PRK11144 modC molybdate transporter ATP-binding protein; Provisional Probab=84.59 E-value=0.54 Score=26.26 Aligned_cols=31 Identities=23% Similarity=0.334 Sum_probs=24.1 Q ss_pred HHHCCCCCCCCCEEEEECCCCCCCHHHHHHHH Q ss_conf 33101111568336552477886115899999 Q gi|254780617|r 152 AIDSLIPIGRGQRELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 152 ~ID~l~pigrGQR~~I~g~~g~GKt~l~~~~I 183 (509) .+|.=+.|.+|.-..|+|++|+|||||+ .+| T Consensus 14 ~ldv~l~i~~g~i~~l~GpsGaGKTTLl-~~i 44 (352) T PRK11144 14 CLTVNLTLPAQGITAIFGRSGAGKTSLI-NLI 44 (352) T ss_pred EEEEEEEECCCCEEEEECCCCCHHHHHH-HHH T ss_conf 9999999889989999999996299999-999 No 470 >TIGR02788 VirB11 P-type DNA transfer ATPase VirB11; InterPro: IPR014155 This entry contains VirB11, a protein that is found in the vir locus of Agrobacterium Ti plasmids where it is involved in the type IV secretion system for DNA transfer . VirB11 is believed to be an ATPase and is a homologue of the P-like conjugation system TrbB protein and the Flp pilus system protein TadA .. Probab=84.50 E-value=0.71 Score=25.43 Aligned_cols=231 Identities=19% Similarity=0.204 Sum_probs=115.4 Q ss_pred EEEEEE-ECCEEEEECCCCCCCCCEEEE--CCC-EEE--EEEEECCCEEEEEEECCCCCCCCCCEEEECCCEEEEECCHH Q ss_conf 589998-275899975888773747995--798-189--99920598599998058668678988897798788667946 Q gi|254780617|r 29 GRVLSI-GDGIARVYGLNNIRAGEMVQF--SHG-VYG--MALNLEVDNVGVVILGSYKEISEGDIVKRTGRIVDVPVGLE 102 (509) Q Consensus 29 G~V~~V-~~giv~v~GL~~a~~GElv~~--~~g-~~G--~Vi~l~~d~v~i~~l~~~~gI~~G~~V~~tg~~~~vpvG~~ 102 (509) ++.++- ..+|.-++ || -||.++| +-- ..+ +.+.++.....=+.|++...=-.=+.++. . T Consensus 59 ~q~is~y~~P~LSa~-LP---~G~RvQ~V~PPAc~~~eTvs~tIRKpS~~~~sL~dy~~~G~F~~ar~----~------- 123 (328) T TIGR02788 59 KQSISEYENPILSAT-LP---GGERVQIVIPPACENDETVSITIRKPSLVDLSLDDYEEKGFFDTARA----V------- 123 (328) T ss_pred CCCCCCCCCCEEEEE-CC---CCCEEEEEECCCCCCCCEEEEEEECCCCCCCCHHHHHHCCCCCEEEE----E------- T ss_conf 862010228637888-69---99479998068758988589999526444554799962798544777----6------- Q ss_pred HHHCCCCHHHCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCHHHHHCCHHHHHCCCCCCCCCEEEEECCCCCCCHHHHHHH Q ss_conf 98300030023144664445444100101357731126600144416023310111156833655247788611589999 Q gi|254780617|r 103 LLGRVVDALGNPIDGKGPIKCEQRSCTEADAPGIIQRQSVCEPLSTGIKAIDSLIPIGRGQRELIIGDRKTGKTSIILDT 182 (509) Q Consensus 103 lLGRVvD~lG~PlDg~g~i~~~~~~~i~~~~p~~~~R~~i~~~l~TGI~~ID~l~pigrGQR~~I~g~~g~GKt~l~~~~ 182 (509) ++...+.-.|-.. .-++ ..+-..+.+.|..= |-.++.+-|.||.|+||||++ .+ T Consensus 124 ----~~~~~~~~~d~~~-------~L~e-----l~~~g~~~~Fl~~A---------i~~~knIii~GGTgSGKTTf~-ka 177 (328) T TIGR02788 124 ----VVPASTELSDKDE-------ELLE-----LLDAGDIKEFLRLA---------IASRKNIIISGGTGSGKTTFL-KA 177 (328) T ss_pred ----ECCCCCCCCHHHH-------HHHH-----HHHCCCHHHHHHHH---------HHCCCEEEEEECCCCHHHHHH-HH T ss_conf ----3144344346899-------9999-----98628887999999---------873891999906897189999-99 Q ss_pred HHHHHHHCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEEEEECC-CCCHHHHHHHHHHHHHHHHHHH-CC Q ss_conf 999886302466787606999951677678999998511467421158997288-7888999998876666667542-04 Q gi|254780617|r 183 FLNQKSSHDKGSEKDKVYCIYVAIGQKRSSVARFVKALEDRGALSYSIVVVASA-SDPAPMQLLAPFAGCAMGEYFR-DN 260 (509) Q Consensus 183 I~nq~~~~~~~~~~~~v~~V~~~IGer~~ev~~~~~~l~~~~~~~~tvvv~a~a-~~~~~~r~~ap~~a~aiAEyfr-~~ 260 (509) ++++.=.+ ||==.+.+..|-.... --++.-++++.. ++... -..|-..+.-+=-+| |+ T Consensus 178 l~~~IP~~-----------------ER~iTIED~~E~~~~h-hpN~V~L~ysk~v~~g~~--~vt~~~Ll~scLRMrPDR 237 (328) T TIGR02788 178 LVKEIPKD-----------------ERLITIEDTRELFLPH-HPNKVHLFYSKGVGQGSA--KVTPKDLLESCLRMRPDR 237 (328) T ss_pred HHHCCCCC-----------------CCEEEEEEEECCCCCC-CCCEEEEEECCCCCCCCC--CCCHHHHHHHHHCCCCCH T ss_conf 97327622-----------------5278885201147888-986456553464234435--689899999971177405 Q ss_pred CCCEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCC-CCCEEEEEEEE----------- Q ss_conf 688479961647899999875453037875433375200124678888750454458-87356753100----------- Q gi|254780617|r 261 GYHALIAYDDLQKHAVAYRQLSLLLRRPPGREAYPGDVFYLHSRLLERAAKMSDALG-AGSLTALPVIE----------- 328 (509) Q Consensus 261 G~~VLi~~Ddltr~A~a~reisl~~~~pp~~~gyp~~vf~~~s~l~ERa~~~~~~~g-~GSiT~~~~v~----------- 328 (509) ++|+|.=|+|+| +.+ |+. ..| .||||.+=.=. T Consensus 238 ----------------------I~LgELRG~Eaf-----~F~-----~~~----nsGHpGsiTT~HA~s~~~Af~qla~l 281 (328) T TIGR02788 238 ----------------------ILLGELRGDEAF-----DFI-----RAV----NSGHPGSITTVHAGSPEEAFEQLALL 281 (328) T ss_pred ----------------------HHHHHHCCHHHH-----HHH-----HHH----CCCCCCCEEEEEECCHHHHHHHHHHH T ss_conf ----------------------767430332578-----888-----752----05988605678718989999999998 Q ss_pred ----CCCCCCC-CCHHHHHHHHHCCEEEEECCH Q ss_conf ----3676644-311466885406169961501 Q gi|254780617|r 329 ----TQVNDVS-AYIPTNVISITDGQIFLETEL 356 (509) Q Consensus 329 ----~~~~D~~-~~i~~~~~si~DG~i~l~~~l 356 (509) -.|-++. |+|-..++-+.|=-|++.++. T Consensus 282 ~k~s~~g~gL~~~~I~~~~~~~iDiVvq~~~dv 314 (328) T TIGR02788 282 VKESQAGLGLDYADIVKLLREVIDIVVQLERDV 314 (328) T ss_pred HHCCHHHCCCCHHHHHHHHHHCCCEEEEEECCE T ss_conf 720254358888999977421144789963143 No 471 >cd04139 RalA_RalB RalA/RalB subfamily. The Ral (Ras-like) subfamily consists of the highly homologous RalA and RalB. Ral proteins are believed to play a crucial role in tumorigenesis, metastasis, endocytosis, and actin cytoskeleton dynamics. Despite their high sequence similarity (80% sequence identity), nonoverlapping and opposing functions have been assigned to RalA and RalBs in tumor migration. In human bladder and prostate cancer cells, RalB promotes migration while RalA inhibits it. A Ral-specific set of GEFs has been identified that are activated by Ras binding. This RalGEF activity is enhanced by Ras binding to another of its target proteins, phosphatidylinositol 3-kinase (PI3K). Ral effectors include RLIP76/RalBP1, a Rac/cdc42 GAP, and the exocyst (Sec6/8) complex, a heterooctomeric protein complex that is involved in tethering vesicles to specific sites on the plasma membrane prior to exocytosis. In rat kidney cells, RalB is required for functional assembly of the exo Probab=84.50 E-value=0.6 Score=25.98 Aligned_cols=20 Identities=20% Similarity=0.549 Sum_probs=16.7 Q ss_pred EEEEECCCCCCCHHHHHHHH Q ss_conf 36552477886115899999 Q gi|254780617|r 164 RELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 164 R~~I~g~~g~GKt~l~~~~I 183 (509) |+.++|++|||||+|+.-.+ T Consensus 2 KvvlvGd~~VGKTsli~r~~ 21 (164) T cd04139 2 KVIVVGAGGVGKSALTLQFM 21 (164) T ss_pred EEEEECCCCCCHHHHHHHHH T ss_conf 79999999988999999997 No 472 >TIGR00968 3a0106s01 sulfate ABC transporter, ATP-binding protein; InterPro: IPR005666 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). These proteins are involved in the transmembrane transport of sulphate and thiosulphate.; GO: 0042626 ATPase activity coupled to transmembrane movement of substances, 0006810 transport, 0016020 membrane. Probab=84.48 E-value=0.64 Score=25.75 Aligned_cols=23 Identities=17% Similarity=0.511 Sum_probs=20.2 Q ss_pred CCCCCCCEEEEECCCCCCCHHHH Q ss_conf 11156833655247788611589 Q gi|254780617|r 157 IPIGRGQRELIIGDRKTGKTSII 179 (509) Q Consensus 157 ~pigrGQR~~I~g~~g~GKt~l~ 179 (509) +.|-.|+=..++|++|.|||||+ T Consensus 21 l~v~~G~lvaLLGPSGSGKsTLL 43 (241) T TIGR00968 21 LEVPTGSLVALLGPSGSGKSTLL 43 (241) T ss_pred EEECCCEEEEEECCCCCCHHHHH T ss_conf 57438527985468987378999 No 473 >KOG1707 consensus Probab=84.48 E-value=0.77 Score=25.18 Aligned_cols=82 Identities=17% Similarity=0.172 Sum_probs=44.8 Q ss_pred CCEEEEECCCCCCCHHHHHHHHHHHHHHC---------CCCCCCCC-CEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEE Q ss_conf 83365524778861158999999988630---------24667876-069999516776789999985114674211589 Q gi|254780617|r 162 GQRELIIGDRKTGKTSIILDTFLNQKSSH---------DKGSEKDK-VYCIYVAIGQKRSSVARFVKALEDRGALSYSIV 231 (509) Q Consensus 162 GQR~~I~g~~g~GKt~l~~~~I~nq~~~~---------~~~~~~~~-v~~V~~~IGer~~ev~~~~~~l~~~~~~~~tvv 231 (509) -=|+.++||.|||||+|.+..+-+-=--+ .-.+..++ +-..++-++.+..+-..+.+++++.+ -..+ T Consensus 9 dVRIvliGD~G~GKtSLImSL~~eef~~~VP~rl~~i~IPadvtPe~vpt~ivD~ss~~~~~~~l~~EirkA~---vi~l 85 (625) T KOG1707 9 DVRIVLIGDEGVGKTSLIMSLLEEEFVDAVPRRLPRILIPADVTPENVPTSIVDTSSDSDDRLCLRKEIRKAD---VICL 85 (625) T ss_pred CEEEEEECCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCEEEEECCCCCCHHHHHHHHHHHCC---EEEE T ss_conf 2599997788866899999987633545345557761158756767672188743666425688999986458---8999 Q ss_pred EEECCCCCHHHHHHH Q ss_conf 972887888999998 Q gi|254780617|r 232 VVASASDPAPMQLLA 246 (509) Q Consensus 232 v~a~a~~~~~~r~~a 246 (509) +.+-.+++...|.-. T Consensus 86 vyavd~~~T~D~ist 100 (625) T KOG1707 86 VYAVDDESTVDRIST 100 (625) T ss_pred EEECCCHHHHHHHHH T ss_conf 985387687654443 No 474 >cd04106 Rab23_lke Rab23-like subfamily. Rab23 is a member of the Rab family of small GTPases. In mouse, Rab23 has been shown to function as a negative regulator in the sonic hedgehog (Shh) signalling pathway. Rab23 mediates the activity of Gli2 and Gli3, transcription factors that regulate Shh signaling in the spinal cord, primarily by preventing Gli2 activation in the absence of Shh ligand. Rab23 also regulates a step in the cytoplasmic signal transduction pathway that mediates the effect of Smoothened (one of two integral membrane proteins that are essential components of the Shh signaling pathway in vertebrates). In humans, Rab23 is expressed in the retina. Mice contain an isoform that shares 93% sequence identity with the human Rab23 and an alternative splicing isoform that is specific to the brain. This isoform causes the murine open brain phenotype, indicating it may have a role in the development of the central nervous system. GTPase activating proteins (GAPs) interact with G Probab=84.46 E-value=0.64 Score=25.77 Aligned_cols=20 Identities=25% Similarity=0.574 Sum_probs=16.9 Q ss_pred EEEEECCCCCCCHHHHHHHH Q ss_conf 36552477886115899999 Q gi|254780617|r 164 RELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 164 R~~I~g~~g~GKt~l~~~~I 183 (509) |+.++|++|||||+|+.-.+ T Consensus 2 KivvlGd~~VGKTsLi~r~~ 21 (162) T cd04106 2 KVIVVGNGNVGKSSMIQRFV 21 (162) T ss_pred EEEEECCCCCCHHHHHHHHH T ss_conf 89999999988999999998 No 475 >cd04135 Tc10 TC10 subfamily. TC10 is a Rho family protein that has been shown to induce microspike formation and neurite outgrowth in vitro. Its expression changes dramatically after peripheral nerve injury, suggesting an important role in promoting axonal outgrowth and regeneration. TC10 regulates translocation of insulin-stimulated GLUT4 in adipocytes and has also been shown to bind directly to Golgi COPI coat proteins. GTP-bound TC10 in vitro can bind numerous potential effectors. Depending on its subcellular localization and distinct functional domains, TC10 can differentially regulate two types of filamentous actin in adipocytes. TC10 mRNAs are highly expressed in three types of mouse muscle tissues: leg skeletal muscle, cardiac muscle, and uterus; they were also present in brain, with higher levels in adults than in newborns. TC10 has also been shown to play a role in regulating the expression of cystic fibrosis transmembrane conductance regulator (CFTR) through interacti Probab=84.45 E-value=0.6 Score=25.98 Aligned_cols=20 Identities=25% Similarity=0.579 Sum_probs=17.0 Q ss_pred EEEEECCCCCCCHHHHHHHH Q ss_conf 36552477886115899999 Q gi|254780617|r 164 RELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 164 R~~I~g~~g~GKt~l~~~~I 183 (509) |+.++|++|+|||+|..-.+ T Consensus 2 Kiv~vGd~~VGKTsli~rf~ 21 (174) T cd04135 2 KCVVVGDGAVGKTCLLMSYA 21 (174) T ss_pred EEEEECCCCCCHHHHHHHHH T ss_conf 89999989985999999996 No 476 >cd00157 Rho Rho (Ras homology) family. Members of the Rho family include RhoA, Cdc42, Rac, Rnd, Wrch1, RhoBTB, and Rop. There are 22 human Rho family members identified currently. These proteins are all involved in the reorganization of the actin cytoskeleton in response to external stimuli. They also have roles in cell transformation by Ras in cytokinesis, in focal adhesion formation and in the stimulation of stress-activated kinase. These various functions are controlled through distinct effector proteins and mediated through a GTP-binding/GTPase cycle involving three classes of regulating proteins: GAPs (GTPase-activating proteins), GEFs (guanine nucleotide exchange factors), and GDIs (guanine nucleotide dissociation inhibitors). Most Rho proteins contain a lipid modification site at the C-terminus, with a typical sequence motif CaaX, where a = an aliphatic amino acid and X = any amino acid. Lipid binding is essential for membrane attachment, a key feature of most Rho protein Probab=84.44 E-value=0.54 Score=26.29 Aligned_cols=20 Identities=25% Similarity=0.586 Sum_probs=16.6 Q ss_pred EEEEECCCCCCCHHHHHHHH Q ss_conf 36552477886115899999 Q gi|254780617|r 164 RELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 164 R~~I~g~~g~GKt~l~~~~I 183 (509) |+.++|++|||||+|+.-.+ T Consensus 2 Ki~llGd~~VGKTsli~r~~ 21 (171) T cd00157 2 KIVVVGDGAVGKTCLLISYT 21 (171) T ss_pred EEEEECCCCCCHHHHHHHHH T ss_conf 89999999966999999996 No 477 >KOG3883 consensus Probab=84.39 E-value=0.99 Score=24.37 Aligned_cols=83 Identities=18% Similarity=0.269 Sum_probs=43.3 Q ss_pred CCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCE-EEEEECCCCHHHHHHHHHHHCCCCC----------- Q ss_conf 1156833655247788611589999999886302466787606-9999516776789999985114674----------- Q gi|254780617|r 158 PIGRGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVY-CIYVAIGQKRSSVARFVKALEDRGA----------- 225 (509) Q Consensus 158 pigrGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~-~V~~~IGer~~ev~~~~~~l~~~~~----------- 225 (509) -+||+-|..+.|..++|||+|+-..... .+....+....+ -||++-=|.+|-++|.+.-....|. T Consensus 5 kmGk~~kVvVcG~k~VGKTaileQl~yg---~~~~~~e~~pTiEDiY~~svet~rgarE~l~lyDTaGlq~~~~eLprhy 81 (198) T KOG3883 5 KMGKVCKVVVCGMKSVGKTAILEQLLYG---NHVPGTELHPTIEDIYVASVETDRGAREQLRLYDTAGLQGGQQELPRHY 81 (198) T ss_pred HHCCCEEEEEECCCCCCHHHHHHHHHHC---CCCCCCCCCCCHHHHEEEEEECCCCHHHEEEEEECCCCCCCHHHHHHHH T ss_conf 1176407999777452289999999851---6788976666344224676406878101688865244557600202767 Q ss_pred ---CCEEEEEEECCCCCHHHH Q ss_conf ---211589972887888999 Q gi|254780617|r 226 ---LSYSIVVVASASDPAPMQ 243 (509) Q Consensus 226 ---~~~tvvv~a~a~~~~~~r 243 (509) .+--++|.+++|--...| T Consensus 82 ~q~aDafVLVYs~~d~eSf~r 102 (198) T KOG3883 82 FQFADAFVLVYSPMDPESFQR 102 (198) T ss_pred HCCCCEEEEEECCCCHHHHHH T ss_conf 105765799963799889889 No 478 >PRK10787 DNA-binding ATP-dependent protease La; Provisional Probab=84.34 E-value=0.93 Score=24.56 Aligned_cols=31 Identities=26% Similarity=0.274 Sum_probs=25.2 Q ss_pred HHHCCCCCCCCCEEEEECCCCCCCHHHHHHH Q ss_conf 3310111156833655247788611589999 Q gi|254780617|r 152 AIDSLIPIGRGQRELIIGDRKTGKTSIILDT 182 (509) Q Consensus 152 ~ID~l~pigrGQR~~I~g~~g~GKt~l~~~~ 182 (509) |+-.+.+=.+|.=+++.|++|||||+|+... T Consensus 339 Av~~~~~~~kg~IlclvGpPGvGKTSl~~sI 369 (784) T PRK10787 339 AVQSRVNKIKGPILCLVGPPGVGKTSLGQSI 369 (784) T ss_pred HHHHHCCCCCCCEEEEECCCCCCHHHHHHHH T ss_conf 9998624677877996469987724699999 No 479 >cd01873 RhoBTB RhoBTB subfamily. Members of the RhoBTB subfamily of Rho GTPases are present in vertebrates, Drosophila, and Dictyostelium. RhoBTB proteins are characterized by a modular organization, consisting of a GTPase domain, a proline rich region, a tandem of two BTB (Broad-Complex, Tramtrack, and Bric a brac) domains, and a C-terminal region of unknown function. RhoBTB proteins may act as docking points for multiple components participating in signal transduction cascades. RhoBTB genes appeared upregulated in some cancer cell lines, suggesting a participation of RhoBTB proteins in the pathogenesis of particular tumors. Note that the Dictyostelium RacA GTPase domain is more closely related to Rac proteins than to RhoBTB proteins, where RacA actually belongs. Thus, the Dictyostelium RacA is not included here. Most Rho proteins contain a lipid modification site at the C-terminus; however, RhoBTB is one of few Rho subfamilies that lack this feature. Probab=84.34 E-value=0.71 Score=25.42 Aligned_cols=22 Identities=32% Similarity=0.543 Sum_probs=17.8 Q ss_pred EEEEECCCCCCCHHHHHHHHHH Q ss_conf 3655247788611589999999 Q gi|254780617|r 164 RELIIGDRKTGKTSIILDTFLN 185 (509) Q Consensus 164 R~~I~g~~g~GKt~l~~~~I~n 185 (509) |+.++||++||||+|++....| T Consensus 4 KiVlvGDs~VGKTsLl~~~~~n 25 (195) T cd01873 4 KCVVVGDNAVGKTRLICARACN 25 (195) T ss_pred EEEEECCCCCCHHHHHHHHHHC T ss_conf 9999878998989999778747 No 480 >COG0411 LivG ABC-type branched-chain amino acid transport systems, ATPase component [Amino acid transport and metabolism] Probab=84.31 E-value=0.39 Score=27.33 Aligned_cols=32 Identities=25% Similarity=0.482 Sum_probs=27.8 Q ss_pred CCHHHHH-CCCCCCCCCEEEEECCCCCCCHHHH Q ss_conf 1602331-0111156833655247788611589 Q gi|254780617|r 148 TGIKAID-SLIPIGRGQRELIIGDRKTGKTSII 179 (509) Q Consensus 148 TGI~~ID-~l~pigrGQR~~I~g~~g~GKt~l~ 179 (509) -|++|+| .=+.+-+|...+|+|+-|.||||+. T Consensus 15 GGl~Al~~Vsl~v~~Gei~~LIGPNGAGKTTlf 47 (250) T COG0411 15 GGLTAVNDVSLEVRPGEIVGLIGPNGAGKTTLF 47 (250) T ss_pred CCEEEEECEEEEECCCEEEEEECCCCCCCEEEE T ss_conf 778997041478738728999889988824566 No 481 >cd01863 Rab18 Rab18 subfamily. Mammalian Rab18 is implicated in endocytic transport and is expressed most highly in polarized epithelial cells. However, trypanosomal Rab, TbRAB18, is upregulated in the BSF (Blood Stream Form) stage and localized predominantly to elements of the Golgi complex. In human and mouse cells, Rab18 has been identified in lipid droplets, organelles that store neutral lipids. GTPase activating proteins (GAPs) interact with GTP-bound Rab and accelerate the hydrolysis of GTP to GDP. Guanine nucleotide exchange factors (GEFs) interact with GDP-bound Rabs to promote the formation of the GTP-bound state. Rabs are further regulated by guanine nucleotide dissociation inhibitors (GDIs), which facilitate Rab recycling by masking C-terminal lipid binding and promoting cytosolic localization. Most Rab GTPases contain a lipid modification site at the C-terminus, with sequence motifs CC, CXC, or CCX. Lipid binding is essential for membrane attachment, a key feature of mos Probab=84.08 E-value=0.61 Score=25.88 Aligned_cols=18 Identities=44% Similarity=0.807 Sum_probs=15.8 Q ss_pred EEEEECCCCCCCHHHHHH Q ss_conf 365524778861158999 Q gi|254780617|r 164 RELIIGDRKTGKTSIILD 181 (509) Q Consensus 164 R~~I~g~~g~GKt~l~~~ 181 (509) |+.++|++|+|||+|..- T Consensus 2 KivvvG~~~vGKTsli~r 19 (161) T cd01863 2 KILLIGDSGVGKSSLLLR 19 (161) T ss_pred EEEEECCCCCCHHHHHHH T ss_conf 899999799579999999 No 482 >cd04107 Rab32_Rab38 Rab38/Rab32 subfamily. Rab32 and Rab38 are members of the Rab family of small GTPases. Human Rab32 was first identified in platelets but it is expressed in a variety of cell types, where it functions as an A-kinase anchoring protein (AKAP). Rab38 has been shown to be melanocyte-specific. GTPase activating proteins (GAPs) interact with GTP-bound Rab and accelerate the hydrolysis of GTP to GDP. Guanine nucleotide exchange factors (GEFs) interact with GDP-bound Rabs to promote the formation of the GTP-bound state. Rabs are further regulated by guanine nucleotide dissociation inhibitors (GDIs), which facilitate Rab recycling by masking C-terminal lipid binding and promoting cytosolic localization. Most Rab GTPases contain a lipid modification site at the C-terminus, with sequence motifs CC, CXC, or CCX. Lipid binding is essential for membrane attachment, a key feature of most Rab proteins. Probab=84.08 E-value=0.64 Score=25.77 Aligned_cols=20 Identities=50% Similarity=0.637 Sum_probs=16.8 Q ss_pred EEEEECCCCCCCHHHHHHHH Q ss_conf 36552477886115899999 Q gi|254780617|r 164 RELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 164 R~~I~g~~g~GKt~l~~~~I 183 (509) |+.++|++|||||+|+.-.+ T Consensus 2 KvvllGd~gVGKTsLi~rf~ 21 (201) T cd04107 2 KVLVIGDLGVGKTSIIKRYV 21 (201) T ss_pred EEEEECCCCCCHHHHHHHHH T ss_conf 79999999978999999998 No 483 >cd04119 RJL RJL (RabJ-Like) subfamily. RJLs are found in many protists and as chimeras with C-terminal DNAJ domains in deuterostome metazoa. They are not found in plants, fungi, and protostome metazoa, suggesting a horizontal gene transfer between protists and deuterostome metazoa. RJLs lack any known membrane targeting signal and contain a degenerate phosphate/magnesium-binding 3 (PM3) motif, suggesting an impaired ability to hydrolyze GTP. GTPase activating proteins (GAPs) interact with GTP-bound Rab and accelerate the hydrolysis of GTP to GDP. Guanine nucleotide exchange factors (GEFs) interact with GDP-bound Rabs to promote the formation of the GTP-bound state. Rabs are further regulated by guanine nucleotide dissociation inhibitors (GDIs), which facilitate Rab recycling by masking C-terminal lipid binding and promoting cytosolic localization. Probab=84.05 E-value=0.62 Score=25.86 Aligned_cols=19 Identities=26% Similarity=0.473 Sum_probs=16.1 Q ss_pred EEEEECCCCCCCHHHHHHH Q ss_conf 3655247788611589999 Q gi|254780617|r 164 RELIIGDRKTGKTSIILDT 182 (509) Q Consensus 164 R~~I~g~~g~GKt~l~~~~ 182 (509) |+.++|++|+|||+|+.-. T Consensus 2 KivvvG~~~vGKTSLi~r~ 20 (168) T cd04119 2 KVISMGNSGVGKSCIIKRY 20 (168) T ss_pred EEEEECCCCCCHHHHHHHH T ss_conf 8999995995689999999 No 484 >cd04110 Rab35 Rab35 subfamily. Rab35 is one of several Rab proteins to be found to participate in the regulation of osteoclast cells in rats. In addition, Rab35 has been identified as a protein that interacts with nucleophosmin-anaplastic lymphoma kinase (NPM-ALK) in human cells. Overexpression of NPM-ALK is a key oncogenic event in some anaplastic large-cell lymphomas; since Rab35 interacts with N|PM-ALK, it may provide a target for cancer treatments. GTPase activating proteins (GAPs) interact with GTP-bound Rab and accelerate the hydrolysis of GTP to GDP. Guanine nucleotide exchange factors (GEFs) interact with GDP-bound Rabs to promote the formation of the GTP-bound state. Rabs are further regulated by guanine nucleotide dissociation inhibitors (GDIs), which facilitate Rab recycling by masking C-terminal lipid binding and promoting cytosolic localization. Most Rab GTPases contain a lipid modification site at the C-terminus, with sequence motifs CC, CXC, or CCX. Lipid binding is Probab=84.02 E-value=0.61 Score=25.92 Aligned_cols=20 Identities=45% Similarity=0.670 Sum_probs=16.8 Q ss_pred EEEEECCCCCCCHHHHHHHH Q ss_conf 36552477886115899999 Q gi|254780617|r 164 RELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 164 R~~I~g~~g~GKt~l~~~~I 183 (509) |+.++|++|||||+|+.-.+ T Consensus 8 KIvlvGd~~VGKTSli~r~~ 27 (199) T cd04110 8 KLLIIGDSGVGKSSLLLRFA 27 (199) T ss_pred EEEEECCCCCCHHHHHHHHH T ss_conf 99999979988899999995 No 485 >cd04129 Rho2 Rho2 subfamily. Rho2 is a fungal GTPase that plays a role in cell morphogenesis, control of cell wall integrity, control of growth polarity, and maintenance of growth direction. Rho2 activates the protein kinase C homolog Pck2, and Pck2 controls Mok1, the major (1-3) alpha-D-glucan synthase. Together with Rho1 (RhoA), Rho2 regulates the construction of the cell wall. Unlike Rho1, Rho2 is not an essential protein, but its overexpression is lethal. Most Rho proteins contain a lipid modification site at the C-terminus, with a typical sequence motif CaaX, where a = an aliphatic amino acid and X = any amino acid. Lipid binding is essential for proper intracellular localization via membrane attachment. As with other Rho family GTPases, the GDP/GTP cycling is regulated by GEFs (guanine nucleotide exchange factors), GAPs (GTPase-activating proteins) and GDIs (guanine nucleotide dissociation inhibitors). Probab=83.99 E-value=0.66 Score=25.66 Aligned_cols=21 Identities=33% Similarity=0.483 Sum_probs=17.4 Q ss_pred CEEEEECCCCCCCHHHHHHHH Q ss_conf 336552477886115899999 Q gi|254780617|r 163 QRELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 163 QR~~I~g~~g~GKt~l~~~~I 183 (509) -|+.++|+++||||+|+.-.+ T Consensus 2 ~KivllGd~~VGKTsLi~r~~ 22 (187) T cd04129 2 RKLVIVGDGACGKTSLLSVFT 22 (187) T ss_pred EEEEEECCCCCCHHHHHHHHH T ss_conf 199999989976899999998 No 486 >cd00876 Ras Ras family. The Ras family of the Ras superfamily includes classical N-Ras, H-Ras, and K-Ras, as well as R-Ras, Rap, Ral, Rheb, Rhes, ARHI, RERG, Rin/Rit, RSR1, RRP22, Ras2, Ras-dva, and RGK proteins. Ras proteins regulate cell growth, proliferation and differentiation. Ras is activated by guanine nucleotide exchange factors (GEFs) that release GDP and allow GTP binding. Many RasGEFs have been identified. These are sequestered in the cytosol until activation by growth factors triggers recruitment to the plasma membrane or Golgi, where the GEF colocalizes with Ras. Active GTP-bound Ras interacts with several effector proteins: among the best characterized are the Raf kinases, phosphatidylinositol 3-kinase (PI3K), RalGEFs and NORE/MST1. Most Ras proteins contain a lipid modification site at the C-terminus, with a typical sequence motif CaaX, where a = an aliphatic amino acid and X = any amino acid. Lipid binding is essential for membrane attachment, a key feature of m Probab=83.93 E-value=0.57 Score=26.10 Aligned_cols=21 Identities=24% Similarity=0.650 Sum_probs=16.6 Q ss_pred EEEEECCCCCCCHHHHHHHHHH Q ss_conf 3655247788611589999999 Q gi|254780617|r 164 RELIIGDRKTGKTSIILDTFLN 185 (509) Q Consensus 164 R~~I~g~~g~GKt~l~~~~I~n 185 (509) |+.|+|++|||||+|+. ..++ T Consensus 1 Ki~ivG~~~vGKTsli~-r~~~ 21 (160) T cd00876 1 KVVVLGAGGVGKSAITI-QFVK 21 (160) T ss_pred CEEEECCCCCCHHHHHH-HHHH T ss_conf 99999969967999999-9961 No 487 >PRK00098 ribosome-associated GTPase; Reviewed Probab=83.91 E-value=0.58 Score=26.04 Aligned_cols=15 Identities=20% Similarity=0.264 Sum_probs=10.3 Q ss_pred EEEEEEEECCEEEEE Q ss_conf 358999827589997 Q gi|254780617|r 28 IGRVLSIGDGIARVY 42 (509) Q Consensus 28 ~G~V~~V~~giv~v~ 42 (509) .|+|++..++...|. T Consensus 2 ~G~Vi~~~~~~y~V~ 16 (298) T PRK00098 2 EGKIIKALGGFYYVE 16 (298) T ss_pred EEEEEEEECCEEEEE T ss_conf 589999999999999 No 488 >cd04114 Rab30 Rab30 subfamily. Rab30 appears to be associated with the Golgi stack. It is expressed in a wide variety of tissue types and in humans maps to chromosome 11. GTPase activating proteins (GAPs) interact with GTP-bound Rab and accelerate the hydrolysis of GTP to GDP. Guanine nucleotide exchange factors (GEFs) interact with GDP-bound Rabs to promote the formation of the GTP-bound state. Rabs are further regulated by guanine nucleotide dissociation inhibitors (GDIs), which facilitate Rab recycling by masking C-terminal lipid binding and promoting cytosolic localization. Most Rab GTPases contain a lipid modification site at the C-terminus, with sequence motifs CC, CXC, or CCX. Lipid binding is essential for membrane attachment, a key feature of most Rab proteins. Due to the presence of truncated sequences in this CD, the lipid modification site is not available for annotation. Probab=83.90 E-value=0.68 Score=25.55 Aligned_cols=20 Identities=25% Similarity=0.453 Sum_probs=16.8 Q ss_pred EEEEECCCCCCCHHHHHHHH Q ss_conf 36552477886115899999 Q gi|254780617|r 164 RELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 164 R~~I~g~~g~GKt~l~~~~I 183 (509) |+.++|++|||||+|+.-.+ T Consensus 9 KivllGd~~VGKTsli~r~~ 28 (169) T cd04114 9 KIVLIGNAGVGKTCLVRRFT 28 (169) T ss_pred EEEEECCCCCCHHHHHHHHH T ss_conf 99999989979999999998 No 489 >cd01860 Rab5_related Rab5-related subfamily. This subfamily includes Rab5 and Rab22 of mammals, Ypt51/Ypt52/Ypt53 of yeast, and RabF of plants. The members of this subfamily are involved in endocytosis and endocytic-sorting pathways. In mammals, Rab5 GTPases localize to early endosomes and regulate fusion of clathrin-coated vesicles to early endosomes and fusion between early endosomes. In yeast, Ypt51p family members similarly regulate membrane trafficking through prevacuolar compartments. GTPase activating proteins (GAPs) interact with GTP-bound Rab and accelerate the hydrolysis of GTP to GDP. Guanine nucleotide exchange factors (GEFs) interact with GDP-bound Rabs to promote the formation of the GTP-bound state. Rabs are further regulated by guanine nucleotide dissociation inhibitors (GDIs), which facilitate Rab recycling by masking C-terminal lipid binding and promoting cytosolic localization. Most Rab GTPases contain a lipid modification site at the C-terminus, with sequence mo Probab=83.83 E-value=0.65 Score=25.72 Aligned_cols=20 Identities=30% Similarity=0.659 Sum_probs=16.8 Q ss_pred EEEEECCCCCCCHHHHHHHH Q ss_conf 36552477886115899999 Q gi|254780617|r 164 RELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 164 R~~I~g~~g~GKt~l~~~~I 183 (509) |+.++|++|+|||+|+.-.+ T Consensus 3 KivviGd~~vGKTsli~r~~ 22 (163) T cd01860 3 KLVLLGDSSVGKSSLVLRFV 22 (163) T ss_pred EEEEECCCCCCHHHHHHHHH T ss_conf 99999959968999999994 No 490 >KOG1434 consensus Probab=83.83 E-value=0.57 Score=26.09 Aligned_cols=123 Identities=18% Similarity=0.274 Sum_probs=56.8 Q ss_pred HHHCCHHHHHCCCCCCCCCE--EEEECCCCCCCHHHHHHHHHH-HHHHCCCCCCCCCCEEEEEEC-C-CCHHHHHHHHHH Q ss_conf 44416023310111156833--655247788611589999999-886302466787606999951-6-776789999985 Q gi|254780617|r 145 PLSTGIKAIDSLIPIGRGQR--ELIIGDRKTGKTSIILDTFLN-QKSSHDKGSEKDKVYCIYVAI-G-QKRSSVARFVKA 219 (509) Q Consensus 145 ~l~TGI~~ID~l~pigrGQR--~~I~g~~g~GKt~l~~~~I~n-q~~~~~~~~~~~~v~~V~~~I-G-er~~ev~~~~~~ 219 (509) .+.||-.++|.++-=|---+ .-|||.+++|||-+....-.+ |.-...+... -++ ||.-- | =|...+.++-+. T Consensus 96 ~ItTgs~~lD~ILGGGI~~m~iTEifGefr~GKTQlshtLcVt~QlPr~~Gg~~-Gk~--ifIDTEgtFrpdRi~~IAe~ 172 (335) T KOG1434 96 SITTGSSALDDILGGGIPSMSITEIFGEFRCGKTQLSHTLCVTVQLPREMGGVG-GKA--IFIDTEGTFRPDRIKDIAER 172 (335) T ss_pred EEECCHHHHHHHHCCCCCCHHHHHHCCCCCCCCCCEEEEEEEEEECCHHHCCCC-CEE--EEEECCCCCCHHHHHHHHHH T ss_conf 463256777656248853102687707877670102358999765643218877-428--99925886145899999987 Q ss_pred H--CCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECHHHHHH Q ss_conf 1--14674211589972887888999998876666667542046884799616478999 Q gi|254780617|r 220 L--EDRGALSYSIVVVASASDPAPMQLLAPFAGCAMGEYFRDNGYHALIAYDDLQKHAV 276 (509) Q Consensus 220 l--~~~~~~~~tvvv~a~a~~~~~~r~~ap~~a~aiAEyfr~~G~~VLi~~Ddltr~A~ 276 (509) + .....|++.. .+-+-.+ -+|+--. .-++++|-+.|+.+|++.|+++-+-+ T Consensus 173 ~~~d~d~~LdNI~--y~Ra~~s-e~qmelv---~~L~~~~se~g~~rlvIVDsIma~FR 225 (335) T KOG1434 173 FKVDPDFTLDNIL--YFRAYNS-EEQMELV---YLLGDFLSEHGKYRLVIVDSIMALFR 225 (335) T ss_pred HCCCHHHHHHHHH--HHHHCCH-HHHHHHH---HHHHHHHHHCCCEEEEEEECEEHHEE T ss_conf 4889889887778--8777286-9999999---99888874448579999800000302 No 491 >cd04122 Rab14 Rab14 subfamily. Rab14 GTPases are localized to biosynthetic compartments, including the rough ER, the Golgi complex, and the trans-Golgi network, and to endosomal compartments, including early endosomal vacuoles and associated vesicles. Rab14 is believed to function in both the biosynthetic and recycling pathways between the Golgi and endosomal compartments. Rab14 has also been identified on GLUT4 vesicles, and has been suggested to help regulate GLUT4 translocation. In addition, Rab14 is believed to play a role in the regulation of phagocytosis. GTPase activating proteins (GAPs) interact with GTP-bound Rab and accelerate the hydrolysis of GTP to GDP. Guanine nucleotide exchange factors (GEFs) interact with GDP-bound Rabs to promote the formation of the GTP-bound state. Rabs are further regulated by guanine nucleotide dissociation inhibitors (GDIs), which facilitate Rab recycling by masking C-terminal lipid binding and promoting cytosolic localization. Most Rab GT Probab=83.81 E-value=0.65 Score=25.72 Aligned_cols=20 Identities=30% Similarity=0.516 Sum_probs=16.5 Q ss_pred EEEEECCCCCCCHHHHHHHH Q ss_conf 36552477886115899999 Q gi|254780617|r 164 RELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 164 R~~I~g~~g~GKt~l~~~~I 183 (509) |+.++|++|||||+|+.-.+ T Consensus 4 KivlvGd~~VGKTsli~r~~ 23 (166) T cd04122 4 KYIIIGDMGVGKSCLLHQFT 23 (166) T ss_pred EEEEECCCCCCHHHHHHHHH T ss_conf 99999999957999999991 No 492 >cd04146 RERG_RasL11_like RERG/RasL11-like subfamily. RERG (Ras-related and Estrogen- Regulated Growth inhibitor) and Ras-like 11 are members of a novel subfamily of Ras that were identified based on their behavior in breast and prostate tumors, respectively. RERG expression was decreased or lost in a significant fraction of primary human breast tumors that lack estrogen receptor and are correlated with poor clinical prognosis. Elevated RERG expression correlated with favorable patient outcome in a breast tumor subtype that is positive for estrogen receptor expression. In contrast to most Ras proteins, RERG overexpression inhibited the growth of breast tumor cells in vitro and in vivo. RasL11 was found to be ubiquitously expressed in human tissue, but down-regulated in prostate tumors. Both RERG and RasL11 lack the C-terminal CaaX prenylation motif, where a = an aliphatic amino acid and X = any amino acid, and are localized primarily in the cytoplasm. Both are believed to have tu Probab=83.75 E-value=0.63 Score=25.81 Aligned_cols=20 Identities=15% Similarity=0.489 Sum_probs=16.3 Q ss_pred EEEEECCCCCCCHHHHHHHH Q ss_conf 36552477886115899999 Q gi|254780617|r 164 RELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 164 R~~I~g~~g~GKt~l~~~~I 183 (509) |+.++|+++||||+|+.--+ T Consensus 1 Kiv~vGd~~VGKTsli~rf~ 20 (165) T cd04146 1 KIAVLGASGVGKSALVVRFL 20 (165) T ss_pred CEEEECCCCCCHHHHHHHHH T ss_conf 99999989977899999997 No 493 >COG1125 OpuBA ABC-type proline/glycine betaine transport systems, ATPase components [Amino acid transport and metabolism] Probab=83.75 E-value=0.94 Score=24.53 Aligned_cols=34 Identities=29% Similarity=0.473 Sum_probs=27.2 Q ss_pred CHHHHH-CCCCCCCCCEEEEECCCCCCCHHHHHHHH Q ss_conf 602331-01111568336552477886115899999 Q gi|254780617|r 149 GIKAID-SLIPIGRGQRELIIGDRKTGKTSIILDTF 183 (509) Q Consensus 149 GI~~ID-~l~pigrGQR~~I~g~~g~GKt~l~~~~I 183 (509) +-+++| .-++|-.|--.-++|.||+||||++ -|| T Consensus 13 ~~~av~~v~l~I~~gef~vliGpSGsGKTTtL-kMI 47 (309) T COG1125 13 NKKAVDDVNLTIEEGEFLVLIGPSGSGKTTTL-KMI 47 (309) T ss_pred CCEEEEEEEEEECCCEEEEEECCCCCCHHHHH-HHH T ss_conf 73233222577659728999878997578799-999 No 494 >TIGR02903 spore_lon_C ATP-dependent protease, Lon family; InterPro: IPR014252 This entry shows some relation to the widely distributed ATP-dependent protease La, also called Lon or LonA (IPR004815 from INTERPRO), but is more closely related to LonB (IPR014251 from INTERPRO), a LonA paralog found only in endospore-forming bacteria. Proteins in this entry are unassigned peptidases belonging to the MEROPS peptidase family S16 (lon protease family, clan SJ). They are restricted to a subset of endospore-forming species, and probably participate in the program of endospore formation. We propose the designation LonC.. Probab=83.73 E-value=0.58 Score=26.04 Aligned_cols=24 Identities=25% Similarity=0.450 Sum_probs=10.0 Q ss_pred HHHHHHHHHCCCCCCCEEEEEEEC Q ss_conf 899999851146742115899728 Q gi|254780617|r 212 SVARFVKALEDRGALSYSIVVVAS 235 (509) Q Consensus 212 ev~~~~~~l~~~~~~~~tvvv~a~ 235 (509) .|-++++.|-+.||-.-=|++=|| T Consensus 308 NvPkYIK~lFe~GAPADFvLIGAT 331 (616) T TIGR02903 308 NVPKYIKKLFEEGAPADFVLIGAT 331 (616) T ss_pred CCCHHHHHHHCCCCCCCEEEECCC T ss_conf 865588885226888256872661 No 495 >KOG0079 consensus Probab=83.72 E-value=0.36 Score=27.57 Aligned_cols=17 Identities=53% Similarity=0.882 Sum_probs=15.3 Q ss_pred EEEEECCCCCCCHHHHH Q ss_conf 36552477886115899 Q gi|254780617|r 164 RELIIGDRKTGKTSIIL 180 (509) Q Consensus 164 R~~I~g~~g~GKt~l~~ 180 (509) +.+|+||+|+|||+|++ T Consensus 10 kllIigDsgVGKssLl~ 26 (198) T KOG0079 10 KLLIIGDSGVGKSSLLL 26 (198) T ss_pred HHHEECCCCCCHHHHHH T ss_conf 88832687645789999 No 496 >COG4167 SapF ABC-type antimicrobial peptide transport system, ATPase component [Defense mechanisms] Probab=83.69 E-value=0.75 Score=25.25 Aligned_cols=29 Identities=24% Similarity=0.417 Sum_probs=24.4 Q ss_pred CCCCCCCCCEEEEECCCCCCCHHHHHHHHH Q ss_conf 011115683365524778861158999999 Q gi|254780617|r 155 SLIPIGRGQRELIIGDRKTGKTSIILDTFL 184 (509) Q Consensus 155 ~l~pigrGQR~~I~g~~g~GKt~l~~~~I~ 184 (509) .-+++-+||..+|+|..|.|||+|+. ||. T Consensus 32 vSFtL~~~QTlaiIG~NGSGKSTLak-Mla 60 (267) T COG4167 32 VSFTLREGQTLAIIGENGSGKSTLAK-MLA 60 (267) T ss_pred EEEEECCCCEEEEECCCCCCHHHHHH-HHH T ss_conf 57896079679998269974758999-983 No 497 >cd04177 RSR1 RSR1 subgroup. RSR1/Bud1p is a member of the Rap subfamily of the Ras family that is found in fungi. In budding yeasts, RSR1 is involved in selecting a site for bud growth on the cell cortex, which directs the establishment of cell polarization. The Rho family GTPase cdc42 and its GEF, cdc24, then establish an axis of polarized growth by organizing the actin cytoskeleton and secretory apparatus at the bud site. It is believed that cdc42 interacts directly with RSR1 in vivo. In filamentous fungi, polar growth occurs at the tips of hypha and at novel growth sites along the extending hypha. In Ashbya gossypii, RSR1 is a key regulator of hyphal growth, localizing at the tip region and regulating in apical polarization of the actin cytoskeleton. Most Ras proteins contain a lipid modification site at the C-terminus, with a typical sequence motif CaaX, where a = an aliphatic amino acid and X = any amino acid. Lipid binding is essential for membrane attachment, a key featu Probab=83.65 E-value=0.71 Score=25.43 Aligned_cols=23 Identities=17% Similarity=0.404 Sum_probs=0.0 Q ss_pred EEEEECCCCCCCHHHHHHHHHHH Q ss_conf 36552477886115899999998 Q gi|254780617|r 164 RELIIGDRKTGKTSIILDTFLNQ 186 (509) Q Consensus 164 R~~I~g~~g~GKt~l~~~~I~nq 186 (509) |+.++|++|||||+|+.-.+-++ T Consensus 3 KivlvGd~~VGKTsli~rf~~~~ 25 (168) T cd04177 3 KIVVLGAGGVGKSALTVQFVQNV 25 (168) T ss_pred EEEEECCCCCCHHHHHHHHHHCC T ss_conf 99998999977999999996193 No 498 >COG4618 ArpD ABC-type protease/lipase transport system, ATPase and permease components [General function prediction only] Probab=83.64 E-value=0.77 Score=25.19 Aligned_cols=23 Identities=26% Similarity=0.463 Sum_probs=0.0 Q ss_pred CCCCCCCEEEEECCCCCCCHHHH Q ss_conf 11156833655247788611589 Q gi|254780617|r 157 IPIGRGQRELIIGDRKTGKTSII 179 (509) Q Consensus 157 ~pigrGQR~~I~g~~g~GKt~l~ 179 (509) +.+..|--.+|+|++|.|||||+ T Consensus 357 F~l~~G~~lgIIGPSgSGKSTLa 379 (580) T COG4618 357 FALQAGEALGIIGPSGSGKSTLA 379 (580) T ss_pred EEECCCCEEEEECCCCCCHHHHH T ss_conf 67658866788788876577899 No 499 >cd04116 Rab9 Rab9 subfamily. Rab9 is found in late endosomes, together with mannose 6-phosphate receptors (MPRs) and the tail-interacting protein of 47 kD (TIP47). Rab9 is a key mediator of vesicular transport from late endosomes to the trans-Golgi network (TGN) by redirecting the MPRs. Rab9 has been identified as a key component for the replication of several viruses, including HIV1, Ebola, Marburg, and measles, making it a potential target for inhibiting a variety of viruses. GTPase activating proteins (GAPs) interact with GTP-bound Rab and accelerate the hydrolysis of GTP to GDP. Guanine nucleotide exchange factors (GEFs) interact with GDP-bound Rabs to promote the formation of the GTP-bound state. Rabs are further regulated by guanine nucleotide dissociation inhibitors (GDIs), which facilitate Rab recycling by masking C-terminal lipid binding and promoting cytosolic localization. Most Rab GTPases contain a lipid modification site at the C-terminus, with sequence motifs CC, CX Probab=83.64 E-value=0.69 Score=25.51 Aligned_cols=23 Identities=26% Similarity=0.524 Sum_probs=0.0 Q ss_pred EEEEECCCCCCCHHHHHHHHHHH Q ss_conf 36552477886115899999998 Q gi|254780617|r 164 RELIIGDRKTGKTSIILDTFLNQ 186 (509) Q Consensus 164 R~~I~g~~g~GKt~l~~~~I~nq 186 (509) |+.++|++|||||+|+.-.+.|+ T Consensus 7 KivvlGd~~VGKTsli~r~~~~~ 29 (170) T cd04116 7 KVILLGDGGVGKSSLMNRYVTNK 29 (170) T ss_pred EEEEECCCCCCHHHHHHHHHHCC T ss_conf 99999999978999999997398 No 500 >PRK08939 primosomal protein DnaI; Reviewed Probab=83.57 E-value=2.2 Score=21.85 Aligned_cols=46 Identities=17% Similarity=0.226 Sum_probs=0.0 Q ss_pred HHCCCCCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEE Q ss_conf 3101111568336552477886115899999998863024667876069999 Q gi|254780617|r 153 IDSLIPIGRGQRELIIGDRKTGKTSIILDTFLNQKSSHDKGSEKDKVYCIYV 204 (509) Q Consensus 153 ID~l~pigrGQR~~I~g~~g~GKt~l~~~~I~nq~~~~~~~~~~~~v~~V~~ 204 (509) +....|=..++=..|.|+.|||||-| +..|.|........ |+.|++ T Consensus 148 ~~~y~~~~~~kGlyl~G~~G~GKTyL-~~aian~La~~g~~-----v~~v~~ 193 (306) T PRK08939 148 LEAYKPGEKVKGLYLYGDFGVGKTYL-LAAIANELAKKGVS-----STLVHF 193 (306) T ss_pred HHHHCCCCCCCEEEEECCCCCCHHHH-HHHHHHHHHHCCCE-----EEEEEH T ss_conf 99737698887788989999989999-99999999986992-----999875 Done!