Query gi|254780561|ref|YP_003064974.1| thiamine transporter substrate binding subunit [Candidatus Liberibacter asiaticus str. psy62] Match_columns 338 No_of_seqs 180 out of 4246 Neff 11.4 Searched_HMMs 39220 Date Sun May 29 23:31:30 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780561.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 PRK11205 tbpA thiamine transpo 100.0 0 0 385.9 30.7 318 19-337 9-326 (326) 2 COG4143 TbpA ABC-type thiamine 100.0 0 0 348.7 27.3 333 1-337 1-336 (336) 3 TIGR03227 PhnS 2-aminoethylpho 100.0 0 0 339.9 29.6 318 15-336 22-354 (367) 4 TIGR03261 phnS2 putative 2-ami 100.0 0 0 336.5 30.7 320 7-337 2-331 (334) 5 PRK09501 potD spermidine/putre 100.0 0 0 296.8 26.5 284 1-298 1-300 (348) 6 TIGR01254 sfuA ABC transporter 100.0 0 0 301.6 19.6 304 28-331 1-324 (324) 7 TIGR01276 thiB thiamin/thiamin 100.0 0 0 300.9 10.8 312 26-337 12-326 (326) 8 PRK10682 putrescine transporte 100.0 1.1E-43 0 273.0 25.6 288 1-298 1-321 (370) 9 COG0687 PotD Spermidine/putres 100.0 1.6E-41 0 260.4 25.1 288 1-298 2-314 (363) 10 COG1840 AfuA ABC-type Fe3+ tra 100.0 3.4E-33 8.6E-38 211.8 24.8 285 50-337 1-298 (299) 11 PRK11622 putative ABC transpor 100.0 2.4E-27 6.2E-32 177.6 24.9 299 1-300 1-352 (401) 12 PRK09474 malE maltose ABC tran 100.0 3.9E-26 9.8E-31 170.6 22.7 280 1-289 1-321 (394) 13 COG2182 MalE Maltose-binding p 99.9 5.9E-21 1.5E-25 140.4 17.6 292 1-298 4-347 (420) 14 PRK10752 sulfate transporter s 99.9 3.6E-20 9.1E-25 135.8 20.5 272 25-316 17-303 (329) 15 COG0725 ModA ABC-type molybdat 99.9 1.8E-19 4.5E-24 131.8 23.3 255 1-291 1-255 (258) 16 PRK10974 glycerol-3-phosphate 99.9 4.8E-20 1.2E-24 135.1 19.0 291 1-299 3-356 (439) 17 PRK10677 modA molybdate transp 99.9 5.4E-19 1.4E-23 129.0 23.4 248 2-291 3-255 (257) 18 PRK10852 thiosulfate transport 99.9 2.9E-19 7.3E-24 130.6 21.3 292 1-316 1-309 (338) 19 COG4134 ABC-type uncharacteriz 99.9 5E-20 1.3E-24 135.0 17.2 327 1-330 1-379 (384) 20 PRK04168 hypothetical protein; 99.8 1.5E-18 3.9E-23 126.3 22.1 263 7-295 9-319 (336) 21 COG1613 Sbp ABC-type sulfate t 99.8 2.1E-18 5.3E-23 125.5 20.3 281 16-316 21-320 (348) 22 COG1653 UgpB ABC-type sugar tr 99.8 8.1E-18 2.1E-22 122.1 20.8 297 1-299 1-358 (433) 23 pfam01547 SBP_bac_1 Bacterial 99.7 3.4E-16 8.6E-21 112.7 13.8 209 42-283 5-260 (260) 24 TIGR01256 modA molybdate ABC t 99.6 1.2E-13 3.2E-18 97.7 14.7 211 45-288 6-224 (225) 25 COG4588 AcfC Accessory coloniz 99.5 4.6E-11 1.2E-15 82.7 21.4 245 1-291 1-251 (252) 26 COG4150 CysP ABC-type sulfate 99.4 5.9E-12 1.5E-16 87.9 12.1 266 1-290 6-290 (341) 27 TIGR00971 3a0106s03 sulfate AB 99.4 6.5E-12 1.7E-16 87.7 11.2 249 39-307 20-289 (320) 28 COG2998 TupB ABC-type tungstat 99.3 1.8E-10 4.6E-15 79.3 15.8 229 26-288 26-261 (280) 29 PRK03537 hypothetical protein; 98.9 5.4E-08 1.4E-12 64.8 11.9 144 144-293 32-185 (188) 30 PRK09861 cytoplasmic membrane 98.8 7.4E-06 1.9E-10 52.3 21.8 253 1-289 1-264 (272) 31 COG3221 PhnD ABC-type phosphat 98.8 6E-06 1.5E-10 52.9 19.4 257 1-288 1-270 (299) 32 PRK11063 metQ DL-methionine tr 98.8 1.2E-05 3.1E-10 51.1 21.6 248 6-289 8-263 (271) 33 COG0226 PstS ABC-type phosphat 98.5 1.1E-05 2.7E-10 51.4 13.7 239 30-297 43-316 (318) 34 TIGR01098 3A0109s03R phosphona 98.5 1.2E-05 3.1E-10 51.1 13.7 257 1-278 1-298 (299) 35 TIGR03431 PhnD phosphonate ABC 98.5 9E-05 2.3E-09 46.0 20.1 250 1-286 1-260 (288) 36 PRK09495 glnH glutamine ABC tr 98.2 0.00041 1.1E-08 42.2 19.7 206 1-264 1-216 (247) 37 PRK11260 cystine transporter s 98.1 0.00055 1.4E-08 41.5 20.5 222 1-276 1-241 (264) 38 PRK10797 glutamate and asparta 98.0 0.0011 2.8E-08 39.7 20.7 245 1-296 3-276 (302) 39 PRK11553 alkanesulfonate trans 97.9 0.0015 3.8E-08 39.0 18.8 229 1-278 1-235 (314) 40 COG1464 NlpA ABC-type metal io 97.8 0.0021 5.4E-08 38.0 19.0 251 1-289 1-260 (268) 41 PRK10859 putative transglycosy 97.7 0.0035 8.9E-08 36.8 16.8 213 28-289 41-265 (507) 42 PRK11480 tauA taurine transpor 97.6 0.0046 1.2E-07 36.1 16.7 213 19-277 12-229 (320) 43 COG1732 OpuBC Periplasmic glyc 97.2 0.0067 1.7E-07 35.1 10.1 248 1-271 1-262 (300) 44 pfam02030 Lipoprotein_8 Hypoth 97.1 0.017 4.4E-07 32.7 12.8 256 1-277 1-334 (493) 45 PRK10918 phosphate transporter 96.9 0.026 6.5E-07 31.7 18.4 73 1-80 1-73 (346) 46 COG3181 Uncharacterized protei 96.5 0.051 1.3E-06 30.0 21.2 236 29-290 27-287 (319) 47 COG2113 ProX ABC-type proline/ 96.2 0.072 1.8E-06 29.1 15.0 99 1-105 1-103 (302) 48 PRK09701 D-allose transporter 96.0 0.068 1.7E-06 29.2 8.3 61 1-69 1-64 (311) 49 pfam04069 OpuAC Substrate bind 95.6 0.14 3.7E-06 27.3 15.4 226 30-287 1-237 (256) 50 pfam03401 Bug Tripartite trica 95.3 0.18 4.6E-06 26.8 20.7 245 48-337 3-271 (274) 51 pfam09084 NMT1 NMT1/THI5 like. 94.6 0.28 7.2E-06 25.7 13.8 189 49-277 10-203 (216) 52 PRK11119 proX glycine betaine 93.3 0.49 1.3E-05 24.2 19.6 76 209-285 194-291 (332) 53 COG0715 TauA ABC-type nitrate/ 93.3 0.49 1.3E-05 24.2 19.2 188 48-277 48-244 (335) 54 PRK06760 hypothetical protein; 92.1 0.19 4.8E-06 26.7 3.6 86 1-110 1-92 (223) 55 TIGR03414 ABC_choline_bnd chol 92.0 0.74 1.9E-05 23.2 9.7 221 28-287 6-243 (290) 56 PRK12450 foldase protein PrsA; 92.0 0.73 1.9E-05 23.2 6.6 15 49-63 63-77 (309) 57 PRK01326 prsA foldase protein 91.3 0.49 1.2E-05 24.3 5.0 63 1-63 1-75 (310) 58 PRK10653 D-ribose transporter 90.6 1 2.6E-05 22.4 8.6 62 1-70 3-67 (295) 59 PRK03002 prsA peptidylprolyl i 90.1 1 2.5E-05 22.5 5.8 24 1-24 1-24 (285) 60 PRK10598 hypothetical protein; 89.8 0.54 1.4E-05 24.0 4.2 47 1-61 1-49 (186) 61 COG1744 Med Uncharacterized AB 89.2 1.3 3.4E-05 21.7 10.4 100 1-102 1-114 (345) 62 PRK11917 bifunctional adhesin/ 89.1 1.4 3.5E-05 21.7 20.3 215 7-276 8-243 (259) 63 PRK09859 multidrug efflux syst 87.2 0.65 1.7E-05 23.5 3.2 26 1-26 1-26 (385) 64 PRK09558 ushA bifunctional UDP 87.2 1.2 3.2E-05 21.9 4.6 37 1-37 1-40 (551) 65 PRK04405 prsA peptidylprolyl i 86.4 2 5.1E-05 20.7 5.6 22 1-23 5-26 (298) 66 TIGR03427 ABC_peri_uca ABC tra 85.0 2.4 6E-05 20.3 19.7 184 48-274 23-209 (328) 67 pfam03466 LysR_substrate LysR 84.9 2.4 6.1E-05 20.3 17.2 200 27-277 3-205 (209) 68 COG4259 Uncharacterized protei 84.2 2.5 6.5E-05 20.1 5.0 35 1-41 1-36 (121) 69 PRK02998 prsA peptidylprolyl i 83.7 2.7 6.9E-05 19.9 6.4 23 1-23 1-24 (283) 70 PRK10833 putative assembly pro 82.9 2.9 7.4E-05 19.7 5.7 49 1-64 1-49 (617) 71 PRK06481 fumarate reductase fl 82.5 1.7 4.3E-05 21.1 3.5 25 1-25 1-25 (506) 72 pfam06316 Ail_Lom Enterobacter 81.8 1.3 3.4E-05 21.7 2.8 54 1-66 1-57 (199) 73 pfam07273 DUF1439 Protein of u 81.0 2.6 6.6E-05 20.0 4.1 45 1-60 1-47 (177) 74 pfam05170 AsmA AsmA family. Th 80.2 3.6 9.2E-05 19.2 4.8 49 1-64 1-49 (537) 75 PRK03095 prsA peptidylprolyl i 79.8 3.7 9.5E-05 19.1 6.1 22 1-24 1-22 (287) 76 COG4521 TauA ABC-type taurine 79.5 3.4 8.7E-05 19.3 4.3 185 52-277 49-237 (334) 77 pfam10566 Glyco_hydro_97 Glyco 77.7 3.1 7.9E-05 19.6 3.6 23 1-23 1-23 (643) 78 PRK06934 flavodoxin; Provision 76.1 4.8 0.00012 18.5 6.9 35 1-37 9-44 (221) 79 pfam08085 Entericidin Enterici 75.6 2.5 6.4E-05 20.1 2.7 21 1-21 1-21 (42) 80 COG2984 ABC-type uncharacteriz 75.3 5.1 0.00013 18.3 10.4 76 25-101 26-104 (322) 81 PRK02944 OxaA-like protein pre 75.1 3.7 9.4E-05 19.1 3.4 25 1-25 1-25 (255) 82 PRK10510 putative outer membra 73.7 3.7 9.3E-05 19.2 3.1 29 1-29 1-29 (219) 83 pfam03202 Lipoprotein_10 Putat 73.6 2.5 6.3E-05 20.2 2.2 28 253-280 5-32 (134) 84 TIGR02036 dsdC D-serine deamin 73.1 1.8 4.5E-05 21.0 1.4 192 29-278 95-299 (302) 85 COG3019 Predicted metal-bindin 72.6 5.9 0.00015 17.9 7.6 66 1-75 1-66 (149) 86 PRK11143 glpQ glycerophosphodi 72.1 6.1 0.00015 17.9 6.1 56 1-56 1-57 (359) 87 pfam05590 DUF769 Xylella fasti 71.2 4.1 0.00011 18.9 2.9 26 1-26 1-26 (279) 88 TIGR03042 PS_II_psbQ_bact phot 69.9 6.8 0.00017 17.6 4.0 49 1-58 1-49 (142) 89 pfam07901 DUF1672 Protein of u 69.9 6.8 0.00017 17.6 4.5 21 1-24 1-21 (304) 90 COG5567 Predicted small peripl 69.6 4.4 0.00011 18.7 2.8 22 1-23 1-22 (58) 91 COG5510 Predicted small secret 69.0 4.8 0.00012 18.5 2.8 22 1-22 2-23 (44) 92 PRK02710 plastocyanin; Provisi 68.0 7.5 0.00019 17.3 3.7 35 1-36 1-35 (119) 93 pfam03180 Lipoprotein_9 NLPA l 66.2 8.1 0.00021 17.1 17.5 211 43-289 10-228 (236) 94 PRK12684 transcriptional regul 66.0 8.2 0.00021 17.1 16.4 207 27-285 90-299 (313) 95 PRK12682 transcriptional regul 65.6 8.4 0.00021 17.1 14.9 209 27-287 90-301 (309) 96 PRK10802 peptidoglycan-associa 65.2 7.4 0.00019 17.4 3.2 23 1-24 3-25 (173) 97 pfam06097 DUF945 Bacterial pro 65.1 5.5 0.00014 18.1 2.5 53 1-68 1-56 (460) 98 PRK02515 psbU photosystem II c 64.2 7.9 0.0002 17.2 3.2 25 1-25 1-25 (144) 99 COG1638 DctP TRAP-type C4-dica 63.9 9 0.00023 16.9 6.9 83 1-91 1-91 (332) 100 PRK12473 hypothetical protein; 63.5 9.2 0.00024 16.8 4.8 85 2-110 1-92 (193) 101 TIGR01047 nspC carboxynorsperm 62.6 9.1 0.00023 16.8 3.3 32 35-69 223-255 (403) 102 PRK10866 outer membrane protei 62.4 9.7 0.00025 16.7 3.4 23 1-24 1-23 (243) 103 PRK10355 xylF D-xylose transpo 62.1 9.8 0.00025 16.7 9.2 71 24-98 20-96 (330) 104 PRK02463 OxaA-like protein pre 61.6 2 5.2E-05 20.7 -0.2 25 270-294 244-268 (307) 105 TIGR03659 IsdE heme ABC transp 60.6 9.1 0.00023 16.9 3.0 24 2-26 1-24 (289) 106 pfam12262 Lipase_bact_N Bacter 59.0 8.6 0.00022 17.0 2.6 24 1-26 1-24 (269) 107 PRK10449 heat-inducible protei 58.8 5.2 0.00013 18.3 1.5 21 1-25 1-21 (140) 108 smart00062 PBPb Bacterial peri 58.3 11 0.00029 16.3 15.3 178 44-275 23-201 (219) 109 PRK10936 periplasmic sensory p 57.8 12 0.0003 16.2 6.9 47 44-91 57-109 (340) 110 PRK11242 DNA-binding transcrip 57.5 12 0.0003 16.2 18.2 198 27-278 88-289 (292) 111 PRK12683 transcriptional regul 56.6 12 0.00031 16.1 14.9 207 28-285 91-300 (308) 112 PRK10341 DNA-binding transcrip 55.7 13 0.00032 16.0 17.2 199 29-277 96-295 (312) 113 TIGR01729 taurine_ABC_bnd taur 55.3 10 0.00026 16.6 2.5 196 49-288 19-221 (304) 114 PRK09534 btuF corrinoid ABC tr 54.6 13 0.00034 15.9 3.2 33 248-280 276-308 (364) 115 pfam09358 UBA_e1_C Ubiquitin-a 54.1 13 0.00034 15.9 4.0 39 31-69 28-67 (124) 116 TIGR02896 spore_III_AF stage I 54.0 13 0.00034 15.9 2.9 21 1-21 29-49 (113) 117 PRK09915 putative outer membra 52.9 11 0.00029 16.3 2.4 24 1-26 6-29 (488) 118 COG2358 Imp TRAP-type uncharac 52.1 15 0.00037 15.7 18.4 248 12-280 7-278 (321) 119 TIGR02747 TraV type IV conjuga 51.8 10 0.00026 16.5 2.1 22 1-23 1-22 (174) 120 PRK12679 cbl transcriptional r 51.6 15 0.00038 15.6 17.1 204 26-281 89-299 (316) 121 PRK09791 putative DNA-binding 51.0 15 0.00038 15.6 18.4 203 27-278 92-295 (302) 122 PRK09408 ompX outer membrane p 50.8 15 0.00039 15.5 2.9 52 1-66 1-53 (171) 123 PRK11627 hypothetical protein; 50.6 9 0.00023 16.9 1.6 26 1-34 2-27 (192) 124 TIGR03502 lipase_Pla1_cef extr 50.2 14 0.00035 15.8 2.5 22 1-24 1-22 (792) 125 PRK10270 hypothetical protein; 49.9 16 0.0004 15.5 2.8 21 1-21 1-21 (340) 126 PRK11548 hypothetical protein; 47.7 16 0.00042 15.4 2.5 23 1-24 1-23 (113) 127 COG4851 CamS Protein involved 47.2 15 0.00038 15.6 2.3 23 266-290 281-303 (382) 128 PRK10477 outer membrane lipopr 46.4 16 0.0004 15.5 2.3 66 1-68 1-68 (177) 129 COG3338 Cah Carbonic anhydrase 46.2 11 0.00028 16.4 1.5 50 1-58 1-50 (250) 130 cd00134 PBPb Bacterial peripla 45.2 19 0.00047 15.0 15.8 187 45-285 23-214 (218) 131 PRK09578 periplasmic multidrug 44.8 19 0.00048 15.0 3.1 23 4-26 6-28 (385) 132 COG4213 XylF ABC-type xylose t 44.6 19 0.00049 15.0 5.8 74 24-99 20-97 (341) 133 CHL00180 rbcR LysR transcripti 43.3 20 0.00051 14.9 15.8 206 28-278 93-301 (307) 134 TIGR03525 GldK gliding motilit 42.5 16 0.00042 15.3 1.9 26 1-33 1-26 (449) 135 pfam05481 Myco_19_kDa Mycobact 41.6 7.7 0.0002 17.3 0.1 24 1-24 1-25 (160) 136 PRK09967 putative outer membra 41.1 22 0.00055 14.7 3.0 22 1-24 1-22 (160) 137 pfam08139 LPAM_1 Prokaryotic m 41.0 15 0.00039 15.6 1.5 20 1-24 1-20 (26) 138 TIGR00439 ftsX putative protei 39.0 23 0.0006 14.5 3.3 61 27-92 67-138 (314) 139 pfam02402 Lysis_col Lysis prot 38.9 16 0.0004 15.5 1.3 20 1-22 1-20 (46) 140 PRK12680 transcriptional regul 38.1 24 0.00062 14.4 7.3 200 26-277 89-292 (327) 141 pfam06646 Mycoplasma_p37 High 36.3 26 0.00066 14.2 4.7 45 44-92 54-108 (383) 142 PRK11138 outer membrane protei 36.1 26 0.00066 14.2 3.1 23 1-24 1-23 (394) 143 TIGR01533 lipo_e_P4 5'-nucleot 35.7 26 0.00067 14.2 2.2 33 1-33 1-36 (295) 144 TIGR01326 OAH_OAS_sulfhy O-ace 35.3 27 0.00068 14.1 3.2 72 34-110 100-183 (434) 145 pfam06291 Lambda_Bor Bor prote 34.9 20 0.0005 14.9 1.3 20 1-24 1-20 (97) 146 cd06353 PBP1_BmpA_Med_like Per 34.7 27 0.0007 14.1 6.1 51 51-102 23-74 (258) 147 COG1512 Beta-propeller domains 33.9 28 0.00072 14.0 6.2 29 42-70 46-74 (271) 148 PRK13684 Ycf48-like protein; P 33.5 29 0.00073 13.9 3.3 25 1-25 1-25 (333) 149 PRK13883 conjugal transfer pro 33.5 24 0.00061 14.4 1.6 18 1-22 1-18 (156) 150 PRK09465 tolC outer membrane c 33.5 27 0.0007 14.1 1.9 13 1-13 1-13 (446) 151 PRK12696 flgH flagellar basal 33.0 17 0.00044 15.2 0.8 19 1-23 2-20 (238) 152 COG5294 Uncharacterized protei 32.9 29 0.00075 13.9 2.5 14 1-14 1-14 (113) 153 PRK10760 murein hydrolase B; P 32.3 30 0.00077 13.8 4.1 15 47-61 58-72 (357) 154 COG4594 FecB ABC-type Fe3+-cit 32.2 30 0.00077 13.8 4.0 27 1-28 3-29 (310) 155 COG2913 OlmA Outer membrane li 32.1 30 0.00077 13.8 2.1 24 1-24 3-26 (147) 156 pfam09508 Lact_bio_phlase Lact 32.0 28 0.00072 14.0 1.7 28 35-64 227-254 (716) 157 PRK01622 OxaA-like protein pre 32.0 31 0.00078 13.8 3.3 24 1-24 2-26 (266) 158 pfam09581 Spore_III_AF Stage I 31.2 31 0.0008 13.7 2.6 22 1-22 19-40 (185) 159 PRK12700 flgH flagellar basal 31.1 32 0.0008 13.7 2.7 22 1-23 3-24 (230) 160 pfam05079 DUF680 Protein of un 30.7 24 0.0006 14.4 1.1 20 1-20 1-20 (73) 161 PRK11189 lipoprotein NlpI; Pro 30.7 32 0.00082 13.7 3.1 24 1-25 1-24 (297) 162 COG4880 Secreted protein conta 30.3 33 0.00083 13.6 3.0 27 1-27 1-28 (603) 163 PRK12681 cysB transcriptional 30.2 33 0.00083 13.6 5.3 198 27-276 90-291 (324) 164 PRK05362 phosphopentomutase; P 30.2 33 0.00083 13.6 2.1 46 45-90 110-156 (393) 165 PRK11443 hypothetical protein; 29.9 31 0.00079 13.7 1.6 29 1-34 1-29 (120) 166 COG3317 NlpB Uncharacterized l 29.8 33 0.00085 13.6 2.7 25 1-26 1-25 (342) 167 PRK10363 cpxP periplasmic repr 29.7 33 0.00085 13.6 3.5 22 1-22 1-22 (166) 168 PRK12751 cpxP periplasmic stre 29.6 34 0.00086 13.5 3.5 22 1-22 1-22 (162) 169 PRK00059 prsA peptidylprolyl i 29.3 34 0.00087 13.5 6.0 24 1-24 1-24 (336) 170 COG4238 Murein lipoprotein [Ce 29.1 30 0.00076 13.9 1.4 24 1-24 1-24 (78) 171 PRK09973 putative outer membra 29.0 29 0.00073 13.9 1.3 24 1-25 1-24 (85) 172 COG2980 RlpB Rare lipoprotein 28.6 35 0.00089 13.4 5.4 58 1-64 1-64 (178) 173 pfam11105 CCAP Arthropod cardi 28.2 35 0.0009 13.4 3.0 72 1-78 1-74 (133) 174 COG4314 NosL Predicted lipopro 28.1 36 0.00091 13.4 2.7 22 1-25 1-22 (176) 175 PRK12701 flgH flagellar basal 27.8 36 0.00092 13.4 2.5 21 1-23 1-21 (230) 176 PRK11546 zraP zinc resistance 27.7 36 0.00093 13.3 2.9 61 1-63 1-65 (139) 177 pfam04076 BOF Bacterial OB fol 27.2 37 0.00094 13.3 4.5 20 1-20 1-20 (126) 178 PRK09906 DNA-binding transcrip 27.1 37 0.00095 13.3 18.5 197 28-277 88-288 (296) 179 COG5633 Predicted periplasmic 27.0 37 0.00095 13.3 1.8 21 1-25 1-21 (123) 180 PRK00031 lolA outer-membrane l 26.8 38 0.00096 13.3 2.5 13 1-13 1-13 (201) 181 PRK10837 putative DNA-binding 26.1 39 0.00099 13.2 16.9 202 27-278 86-288 (291) 182 cd06354 PBP1_BmpA_PnrA_like Pe 26.1 39 0.00099 13.2 6.3 50 51-101 24-73 (265) 183 TIGR03524 GldJ gliding motilit 26.0 39 0.00099 13.2 2.9 22 1-22 3-24 (559) 184 PRK11087 hypothetical protein; 25.7 40 0.001 13.1 3.5 36 1-36 1-38 (238) 185 PRK13697 cytochrome c6; Provis 25.6 40 0.001 13.1 2.2 16 1-16 1-16 (111) 186 PRK10954 periplasmic protein d 25.3 40 0.001 13.1 2.5 16 1-16 1-16 (207) 187 pfam11777 DUF3316 Protein of u 25.2 40 0.001 13.1 3.2 33 1-34 1-33 (114) 188 PRK10907 intramembrane serine 25.0 41 0.001 13.1 1.8 10 55-64 20-29 (276) 189 PRK11615 hypothetical protein; 24.3 42 0.0011 13.0 2.6 22 1-22 1-22 (185) 190 PRK11679 lipoprotein; Provisio 24.0 43 0.0011 12.9 2.4 22 1-22 6-27 (345) 191 PRK11013 DNA-binding transcrip 23.9 43 0.0011 12.9 15.5 205 28-286 92-299 (309) 192 PRK06707 amidase; Provisional 23.6 32 0.00083 13.6 0.8 53 1-63 1-59 (536) 193 PRK10455 periplasmic protein; 23.4 44 0.0011 12.9 2.9 19 1-19 1-19 (161) 194 COG3417 FlgN Collagen-binding 23.1 44 0.0011 12.8 3.0 22 1-23 1-22 (200) 195 PRK03598 hypothetical protein; 23.1 44 0.0011 12.8 2.8 21 1-21 1-21 (331) 196 COG4704 Uncharacterized protei 22.8 45 0.0011 12.8 3.0 35 1-35 1-35 (151) 197 PRK13528 outer membrane recept 22.3 46 0.0012 12.7 2.9 24 1-24 3-26 (727) 198 PRK13733 conjugal transfer pro 21.9 9 0.00023 16.9 -2.3 24 1-26 1-24 (171) 199 TIGR00658 orni_carb_tr ornithi 21.8 47 0.0012 12.7 8.1 34 31-65 191-224 (341) 200 TIGR01347 sucB 2-oxoglutarate 21.8 47 0.0012 12.7 1.7 26 252-277 303-332 (435) 201 pfam04507 DUF576 Protein of un 21.6 43 0.0011 12.9 1.1 24 1-24 1-26 (257) 202 pfam06551 DUF1120 Protein of u 21.4 48 0.0012 12.6 3.6 31 1-34 1-32 (145) 203 COG3470 Tpd Uncharacterized pr 21.0 49 0.0013 12.6 3.1 25 1-26 1-25 (179) 204 COG5298 Uncharacterized protei 20.9 49 0.0013 12.6 4.7 56 1-56 1-58 (530) 205 TIGR02346 chap_CCT_theta T-com 20.9 49 0.0013 12.6 3.9 57 41-102 272-328 (554) 206 PRK13484 putative iron-regulat 20.9 49 0.0013 12.6 2.9 17 47-63 64-80 (682) 207 pfam06649 DUF1161 Protein of u 20.4 50 0.0013 12.5 2.2 16 1-16 1-16 (75) 208 PRK08605 D-lactate dehydrogena 20.4 50 0.0013 12.5 4.8 41 31-77 3-43 (332) 209 PRK09934 putative fimbrial pro 20.4 50 0.0013 12.5 3.3 13 1-13 1-13 (171) 210 PRK09810 entericidin A; Provis 20.0 51 0.0013 12.5 1.9 19 1-22 2-20 (41) No 1 >PRK11205 tbpA thiamine transporter substrate binding subunit; Provisional Probab=100.00 E-value=0 Score=385.93 Aligned_cols=318 Identities=59% Similarity=1.091 Sum_probs=293.9 Q ss_pred HHHCCCCCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHHCCEEEEEECCCHHHHHHHHHHHHCCCCCEEEECCCHHHHH Q ss_conf 86146742348859999788778512233889999998848879999728868999788753126641011013337899 Q gi|254780561|r 19 SYCTLDGLPAKPVLTVYTYNSFVADEGAGPKIKQAFERKCNCELKLIGLSDGVALLNKLRMEGENSAADIVLGFDNNLID 98 (338) Q Consensus 19 ~~~~~~~~~~~~~l~v~~~~~~~~~~~~~~~i~~~Fek~~gikV~~~~~~~~~~~~~~~~a~~~~~~~Dv~~~~~~~~~~ 98 (338) ..|+.+ +.++++|||||+.||..++...+.|+++|||+|||||+++..++++++++|+++++.++++||+++.+...+. T Consensus 9 ~~~a~~-~~~~~~l~vyt~dsf~~d~g~~~~l~~~Fek~tGikV~~v~~~dg~~ll~Rl~~Eg~np~ADVv~gvD~~~l~ 87 (326) T PRK11205 9 LLCAAP-AFAKPVLTVYTYDSFAADWGPGPAVKKAFEAECGCELKFVALEDGVSLLNRLRLEGKNSKADVVLGLDNNLLA 87 (326) T ss_pred HHHCCC-CCCCCEEEEEECCCCCCCCCCCHHHHHHHHHHHCCEEEEEECCCHHHHHHHHHHCCCCCCCCEEEECCHHHHH T ss_conf 986362-0029879999665444466864799999989879689999679679999999973899976589981767899 Q ss_pred HHHHHHCCCCCCCCHHHCCCCCCCCCCCCCCEEEEEEHHHHCCCCCCCCCCCHHHHCCCCCCCEEEEEECCCCCCHHHHH Q ss_conf 98875212613677000210001243211532431000110011124587645652120006828985047653036789 Q gi|254780561|r 99 LARKTGLFAKSNIDASQLKLPIKWDDDIFVPYDYGYLAFIYDKRQITQPPKNFDELINSTKPWKIIYQDPRTSTLGLGLL 178 (338) Q Consensus 99 ~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~p~s~~~l~~~~~~~~~~~~~~~~~~~~~~~~ 178 (338) .+.+.+++.|+.+.......+.......+.++..++.+++||++++..+|+||.||++++|+|++++.++..+..+..++ T Consensus 88 ~A~~~gl~~p~~s~~~~~~ip~~~~d~~~vp~d~g~~a~vYn~~~~~~~P~S~~DL~~p~~kgki~i~dP~tSs~g~a~l 167 (326) T PRK11205 88 AAKKTGLFAPSGVDTSAVTVPGGWNDDTFVPYDYGYFAFVYDKEKLKNPPKSLKELVESDQNWKVIYQDPRTSTPGLGLL 167 (326) T ss_pred HHHHCCCCCCCCCCHHHCCCCCCCCCCCEECCCCCEEEEEEEHHHCCCCCCCHHHHHCCHHCCCEEECCCCCCCCHHHHH T ss_conf 99986996525686110248600269852113476589998422237887698987273303886633888785279999 Q ss_pred HHHHHHCCCCHHHHHHHHHHCCCEEECCCHHHHHHHHCCCCCEEEEECCHHHHHHHHHCCCCEEEEECCCCEEEEEEEEE Q ss_conf 99987177313789877762884683084789888622586569995044999998507997378831698088886424 Q gi|254780561|r 179 LWIQKIYGDNSAQVWKKIATKTATVTKGWTESYGFFLKGESDFVLSYSTSPGFYLLNYGQDDYVAALFSEGHYLQIEVAA 258 (338) Q Consensus 179 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~eG~~~~~~~~~ 258 (338) .++...++++..+++.+++.|..++.+++.+....+..|+++++++|..+..++...++........+++|++..+++++ T Consensus 168 ~~~i~~~Ge~~~e~W~~L~~N~~~~~~gws~a~~~~~~Ge~~ivlsy~~spa~~~~~~~~~~~~~~~~~eG~~~~iegag 247 (326) T PRK11205 168 LWMKKVYGDDAPQAWQKLAKKTVTVTKGWSEAYGLFLKGEADLVLSYTTSPAYHIIAEKKDNYAAANFSEGHYLQVEVAA 247 (326) T ss_pred HHHHHHHCCCHHHHHHHHHHCCCCCCCCCHHHHHHHHCCCCCEEEEEECCHHHHHHHCCCCCCCEEECCCCCEEEEEEEE T ss_conf 99999716328999999986384447883388887644767579985147787764136665422445887277766789 Q ss_pred EECCCCCHHHHHHHHHHHCCHHHHHHHHHCCCEEECCCCCCCCHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHC Q ss_conf 3378999899999999963999999988504505055765882134122322234789899999999999999998741 Q gi|254780561|r 259 QLVRSKQPQLAQEFMQFMISPSFQRILPTTNWMYPVVDISMPDVYQAIRIPQKSLRFNAEETTRYRNQWISSWQNAVSR 337 (338) Q Consensus 259 i~k~a~n~~~A~~Fi~flls~e~Q~~~a~~~~~~P~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~W~~~v~~ 337 (338) |+|+++|+++|++|||||+|+|+|+.+++.+++|||++..+|+.+.....|.+...++++.++++|++|+++|+++|+| T Consensus 248 I~k~akn~e~A~kFidfllS~eaQ~~~~~~n~~yPV~~~~lP~~~~~~~~p~~~~~~~~~~i~~~~~~w~~~W~~~v~r 326 (326) T PRK11205 248 RTAASKQPELAQKFLQFMVSPAFQNAIPTGNWMYPVAKVTLPAGFETLTKPATTLEFTPEEVAANRKAWISEWQRAVSR 326 (326) T ss_pred EECCCCCHHHHHHHHHHHCCHHHHHHHHHHCEEEECCCCCCCHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCC T ss_conf 8379999899999999975899999864405445248887976477457986454579999999999999999998619 No 2 >COG4143 TbpA ABC-type thiamine transport system, periplasmic component [Coenzyme metabolism] Probab=100.00 E-value=0 Score=348.73 Aligned_cols=333 Identities=49% Similarity=0.922 Sum_probs=301.1 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHHCCEEEEEECCCHHHHHHHHHHH Q ss_conf 90279999999999999986146742348859999788778512233889999998848879999728868999788753 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLDGLPAKPVLTVYTYNSFVADEGAGPKIKQAFERKCNCELKLIGLSDGVALLNKLRME 80 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~v~~~~~~~~~~~~~~~i~~~Fek~~gikV~~~~~~~~~~~~~~~~a~ 80 (338) |+|+.+.+. ++++ ++++|.+ +.++.++|+||+++|+..++...+++.++||++|+|+|+++..+++.++++|++.+ T Consensus 1 m~~~l~~~~--~~al-l~~~~~~-a~~~t~~LtVytydSF~~ewg~Gp~vk~~FE~~~~~~v~fV~~~d~v~llnRl~le 76 (336) T COG4143 1 MRRLLRALI--GLAL-LVSAALG-AQAATPTLTVYTYDSFASEWGPGPKVKKAFEAEYGCKVNFVALGDGVELLNRLILE 76 (336) T ss_pred CHHHHHHHH--HHHH-HHHHHHH-HHHCCCEEEEEEEEEEECCCCCCHHHHHHHHHHHCCEEEEEECCCHHHHHHHHHHC T ss_conf 902499999--9999-9998657-76348538999976342256885789999999859528999707589999999970 Q ss_pred HCCCCCEEEECCCHHHHHHHHHHHCCCCCCCCHHHCCCCCCCCCCCC-CCEEEEEEHHHHCCCCCCCCCCCHHHHCCCCC Q ss_conf 12664101101333789998875212613677000210001243211-53243100011001112458764565212000 Q gi|254780561|r 81 GENSAADIVLGFDNNLIDLARKTGLFAKSNIDASQLKLPIKWDDDIF-VPYDYGYLAFIYDKRQITQPPKNFDELINSTK 159 (338) Q Consensus 81 ~~~~~~Dv~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~y~~~~~~~~p~s~~~l~~~~~ 159 (338) +.++.+|++++.+...+..+.+.+++.++..+..+.+....+..... +|++++++.++||++++..+|+|.+||.+++| T Consensus 77 g~~~~ADvvvGldn~~l~~A~~~~lf~~~~~d~~~~~vp~~~~~d~f~~P~DyGy~a~vYd~~~~~~ppksL~dL~~~~y 156 (336) T COG4143 77 GKNPKADVVVGLDNNLLARARETGLFAPYGVDASDVPVPEGWKIDTFALPYDYGYFAFVYDKTKLKNPPKSLKDLVEPEY 156 (336) T ss_pred CCCCCCCEEEECCHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEECHHHHCCCCCCHHHHCCCCC T ss_conf 78988858993472778888855876667898557888765356751012454128999765775287626898638512 Q ss_pred CCEEEEEECCCCCCHHHHHHHHHHHC-CCCHHHHHHHHHHCCCEEECCCHHHHHHHHCCCCCEEEEECCHHHHHHHHHCC Q ss_conf 68289850476530367899998717-73137898777628846830847898886225865699950449999985079 Q gi|254780561|r 160 PWKIIYQDPRTSTLGLGLLLWIQKIY-GDNSAQVWKKIATKTATVTKGWTESYGFFLKGESDFVLSYSTSPGFYLLNYGQ 238 (338) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~ 238 (338) ++++.+++|..|..+..++.+++..+ ++...+++.+++++..++.+|+++.+..+..||.++.++|.++..++...++. T Consensus 157 ~~~ii~qdPrtSspGl~fLlW~~svyg~d~~~~~W~kL~~~~v~v~~gWSeaY~aFt~GEap~VLSYtTspay~~~~~~~ 236 (336) T COG4143 157 AGKIIYQDPRTSSPGLGFLLWTISVYGEDGAPEAWAKLADNGVTVTKGWSEAYGAFTKGEAPLVLSYTTSPAYHVYPEKK 236 (336) T ss_pred CCCEEECCCCCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHCCEEECCCHHHHHHHHHCCCCCEEEEECCCCCHHCCCCCC T ss_conf 69578069887884078899998763623189999999858868658679889887478655799860682210253200 Q ss_pred CCEEEEECCCCEEEEEEEEEEECCCCCHHHHHHHHHHHCCHHHHHHHHHCCCEEECCC-CCCCCHHHHCCCCCCCCCCCH Q ss_conf 9737883169808888642433789998999999999639999999885045050557-658821341223222347898 Q gi|254780561|r 239 DDYVAALFSEGHYLQIEVAAQLVRSKQPQLAQEFMQFMISPSFQRILPTTNWMYPVVD-ISMPDVYQAIRIPQKSLRFNA 317 (338) Q Consensus 239 ~~~~~~~~~eG~~~~~~~~~i~k~a~n~~~A~~Fi~flls~e~Q~~~a~~~~~~P~~~-~~~p~~~~~~~~~~~~~~~~~ 317 (338) .+.......||.+..+++.++.|+++|+++|.+|++||+|+|+|..++++++++|+++ ..+|+.++....|.+...+++ T Consensus 237 ~~~~a~~f~eG~ylqiEgaa~~k~~knp~LA~~F~~FmlS~e~Q~~ip~~nwm~Pa~~~~~Lp~~f~~~~~p~~~~~~~~ 316 (336) T COG4143 237 DRYAAALFPEGHYLQVEGAAVLKGAKNPELADKFLQFMLSPEFQDAIPTTNWMYPAVKNVPLPAVFDALVKPAKTVSLSP 316 (336) T ss_pred CCCCHHCCCCCCEEEEEEEEECCCCCCHHHHHHHHHHHHCHHHHHHCCCCCEEEECCCCCCCCHHHHHHCCCCCCCCCCH T ss_conf 22100117888536776653112799879999999997098778407845433234789868777885178987656899 Q ss_pred HHHHHHHHHHHHHHHHHHHC Q ss_conf 99999999999999998741 Q gi|254780561|r 318 EETTRYRNQWISSWQNAVSR 337 (338) Q Consensus 318 ~~~~~~~~~~~~~W~~~v~~ 337 (338) +.+++|+.+|+++|.++|+| T Consensus 317 ~~v~~~~~~Wi~~W~~~v~~ 336 (336) T COG4143 317 KEVAANRKAWISEWQRAVSR 336 (336) T ss_pred HHHHHHHHHHHHHHHHHHHC T ss_conf 99999999999999998529 No 3 >TIGR03227 PhnS 2-aminoethylphosphonate ABC transporter, periplasmic 2-aminoethylphosphonate binding protein. This ABC transporter periplasmic substrate binding protein component is found in a region of the salmonella typhimurium LT2 genome responsible for the catabolism of 2-aminoethylphosphonate via the phnWX pathway (GenProp0238). The protein contains a match to pfam01547 for the "Bacterial extracellular solute-binding protein" domain. Probab=100.00 E-value=0 Score=339.93 Aligned_cols=318 Identities=16% Similarity=0.241 Sum_probs=261.9 Q ss_pred HHHHHHHCCCCCCCCCEEEEEECCCCC-CCHHHHHHHHHHHHHHHCCEEEEEECCCHHHHHHHHHHHHCCCCCEEEECCC Q ss_conf 999986146742348859999788778-5122338899999988488799997288689997887531266410110133 Q gi|254780561|r 15 TGVISYCTLDGLPAKPVLTVYTYNSFV-ADEGAGPKIKQAFERKCNCELKLIGLSDGVALLNKLRMEGENSAADIVLGFD 93 (338) Q Consensus 15 ~~~~~~~~~~~~~~~~~l~v~~~~~~~-~~~~~~~~i~~~Fek~~gikV~~~~~~~~~~~~~~~~a~~~~~~~Dv~~~~~ 93 (338) ++++.+ ...++.++.+|+||+.++.. .+..+.+.++++|||+|||||+++..+ ++++++|+++++.++++||+++.+ T Consensus 22 ~~~~~~-~~~~a~a~~~~~vy~~~~~~~~~~~~~~~~~~aFek~tGikV~~v~~~-sge~laRl~aEk~nPqaDVv~~~d 99 (367) T TIGR03227 22 AALLQG-AAAPAQAAAVVLVYSADGLEDGDNSLYQDQFDAFEKAEGIKVNIVEAG-GGEVVERAAKEKGNPKADVIVTAP 99 (367) T ss_pred HHHHHH-CCCCCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHCCEEEEEECC-HHHHHHHHHHCCCCCCCEEEEECC T ss_conf 999864-235554586599996466434443468887688888669838999897-799999999522799720999785 Q ss_pred HHHHHHHHHHHCCCCCCCCHHH-CCCCCCCCCCCCCCEEEEEEHHHHCCCCCCCCCCCHHHHCCCCCCCEEEEEECCCCC Q ss_conf 3789998875212613677000-210001243211532431000110011124587645652120006828985047653 Q gi|254780561|r 94 NNLIDLARKTGLFAKSNIDASQ-LKLPIKWDDDIFVPYDYGYLAFIYDKRQITQPPKNFDELINSTKPWKIIYQDPRTST 172 (338) Q Consensus 94 ~~~~~~~~~~~l~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~p~s~~~l~~~~~~~~~~~~~~~~~~ 172 (338) ..+.++...+|+.++.+.... ......+...++.++..+..+++||++.+..+|+||.||++|+|+|++.+.++..++ T Consensus 100 -~~~~~A~~~GLl~py~~~~~~~ip~~~k~~dg~~~~~~~~~~~~~yN~~~~k~~PkSW~DL~dP~ykgkI~~~dP~tS~ 178 (367) T TIGR03227 100 -PFIQQAAAEGLLANFNSDAAKAIPAIAKAADGLWAPFVKNYFSFAINPKLLKSAPASFADLLDADFKGKLAYSNPAQAA 178 (367) T ss_pred -HHHHHHHHCCCCCCCCCCCHHHCCHHHHCCCCCEEEEECCEEEEEEEHHHHCCCCCCHHHHHCCCCCCEEEECCCCCCH T ss_conf -7999999788956278966455777663779838988555799999857705799889987285337828952877661 Q ss_pred CHHHHHHHHHHHCC--CCHHHHHHHHHHCCCEEECCCHHHHHHHHCCCCCEEEEECCHHHHHHHHHCCCCEEEEECC--- Q ss_conf 03678999987177--3137898777628846830847898886225865699950449999985079973788316--- Q gi|254780561|r 173 LGLGLLLWIQKIYG--DNSAQVWKKIATKTATVTKGWTESYGFFLKGESDFVLSYSTSPGFYLLNYGQDDYVAALFS--- 247 (338) Q Consensus 173 ~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--- 247 (338) .+...+.++....+ +.+++++.++..|+..+..+......++.+|++.++.+.... .......++.++.+++|. T Consensus 179 ~g~~~l~~~~~~~G~eD~a~~~~~~L~~N~~~~~~s~~~~~~~~~~GE~~va~g~~~~-~~~~~~~~g~~v~~v~P~~~~ 257 (367) T TIGR03227 179 DGMAVIILAFALFGSEDAAFAYLAKLEANNKFHSAGTGKLNALLNKGEIAVANGDLQM-DLADAEHGGLNIKIFFPAADA 257 (367) T ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCEECCCCHHHHHHHHCCCEEEEEEECCH-HHHHHHHCCCCCEEEEECCCC T ss_conf 7999999999970985388999998874486724781799999857867899520204-367776279970289546777 Q ss_pred ---CCEEEEEEEEEEECCCCCHHHHHHHHHHHCCHHHHHHHHHCCCEEECCCCCC-CCHH----HHCCCCCCCCCCCHHH Q ss_conf ---9808888642433789998999999999639999999885045050557658-8213----4122322234789899 Q gi|254780561|r 248 ---EGHYLQIEVAAQLVRSKQPQLAQEFMQFMISPSFQRILPTTNWMYPVVDISM-PDVY----QAIRIPQKSLRFNAEE 319 (338) Q Consensus 248 ---eG~~~~~~~~~i~k~a~n~~~A~~Fi~flls~e~Q~~~a~~~~~~P~~~~~~-p~~~----~~~~~~~~~~~~~~~~ 319 (338) +|++..++.++|+|++||+++|++|||||+|+|+|+.+++.++.+|++..+. |+.. .......++..+||+. T Consensus 258 g~~~g~~~i~~~i~ivKga~n~e~AkkfidflLS~e~Q~~~a~~~~~~Pv~~~v~l~~~~~~~~~~~~~~~kl~~~Dw~~ 337 (367) T TIGR03227 258 GEPPSAFAIPYAIGLVKGAPNQDAGKKLIDFLLSADAQAKVPDLSFGIPGRSDVPLSDEHGEAAKAAIAGVKLIPPDWDA 337 (367) T ss_pred CCCCCEEEEEEHHHHHCCCCCHHHHHHHHHHHCCHHHHHHHHHHCCCCCCCCCCCCCHHHHHHCCCCCCCCEEECCCHHH T ss_conf 88873487522034644999989999999997299999999986016767788769835554221332335111589999 Q ss_pred HHHHHHHHHHHHHHHHH Q ss_conf 99999999999999874 Q gi|254780561|r 320 TTRYRNQWISSWQNAVS 336 (338) Q Consensus 320 ~~~~~~~~~~~W~~~v~ 336 (338) ++++|++|++||++.|- T Consensus 338 i~~~r~~wi~rW~~eV~ 354 (367) T TIGR03227 338 VAAKKPADIERWHKEVI 354 (367) T ss_pred HHHHHHHHHHHHHHHHC T ss_conf 99878999999999857 No 4 >TIGR03261 phnS2 putative 2-aminoethylphosphonate ABC transport system, 1-aminoethylphosphonate-binding protein component. This ABC transporter extracellular solute-binding protein is found in a number of genomes in operon-like contexts strongly suggesting a substrate specificity for 2-aminoethylphosphonate (2-AEP). The characterized PhnSTUV system is absent in the genomes in which this system is found. These genomes encode systems for the catabolism of 2-AEP, making the need for a 2-AEP-specific transporter likely. Probab=100.00 E-value=0 Score=336.47 Aligned_cols=320 Identities=18% Similarity=0.206 Sum_probs=261.3 Q ss_pred HHHHHHHHHHHHHHHCCCCCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHH-CCEEEEEECCCHHHHHHHHHHHHCCCC Q ss_conf 9999999999998614674234885999978877851223388999999884-887999972886899978875312664 Q gi|254780561|r 7 IVVGIMMITGVISYCTLDGLPAKPVLTVYTYNSFVADEGAGPKIKQAFERKC-NCELKLIGLSDGVALLNKLRMEGENSA 85 (338) Q Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~~~~l~v~~~~~~~~~~~~~~~i~~~Fek~~-gikV~~~~~~~~~~~~~~~~a~~~~~~ 85 (338) +++++++++.++++| ++.++++|+|||. .++...++++++|||+| ||||+++. ++++++++|+++++.+++ T Consensus 2 ~~~~~~~~~~~~~a~---aa~~~~~l~VYss----~~~~~~~~i~~~Fek~~tgIkV~~v~-~~tgel~~rl~aEk~~p~ 73 (334) T TIGR03261 2 VVAGFIMTSLFFSAC---NSKANTELTVYTA----IEDELIAKYKDAFEKVNPDIKINWVR-DSTGIITAKLLAEKNNPQ 73 (334) T ss_pred HHHHHHHHHHHHHHH---HHCCCCEEEEEEC----CCHHHHHHHHHHHHHHCCCCEEEEEE-CCHHHHHHHHHHCCCCCC T ss_conf 889999999999831---0302995999969----98889999999999777994899998-972999999996457998 Q ss_pred CEEEECCCHHHHHHHHHHHCCCCCCCCHHHC-CCCCCCCC--CCCCCEEEEEEHHHHCCCCCCC----CCCCHHHHCCCC Q ss_conf 1011013337899988752126136770002-10001243--2115324310001100111245----876456521200 Q gi|254780561|r 86 ADIVLGFDNNLIDLARKTGLFAKSNIDASQL-KLPIKWDD--DIFVPYDYGYLAFIYDKRQITQ----PPKNFDELINST 158 (338) Q Consensus 86 ~Dv~~~~~~~~~~~~~~~~l~~~~~~~~~~~-~~~~~~~~--~~~~~~~~~~~~~~y~~~~~~~----~p~s~~~l~~~~ 158 (338) +||+++.+...+..+...+++.++.+..... .+.+.+.. .++.++.....++.||++.... .|+||.||++|+ T Consensus 74 aDV~~g~~~~~~~~~~~~gll~~y~~~~~~~l~~~~~d~~~~~~~~g~~~~~~~i~yN~~~~~~~~~p~P~sw~DL~dp~ 153 (334) T TIGR03261 74 ADVVWGLAASSLALLDKEGMLKPYKPKGLDALNPKFRDAKNPPHWVGMDAWMAAICFNTVEAKKKGLPKPTSWEDLTKPE 153 (334) T ss_pred CEEEEECCHHHHHHHHHCCCCCCCCCCCHHHCCHHHCCCCCCCEEEEEEEEEEEEEEEHHHCCCCCCCCCCCHHHHCCHH T ss_conf 65998456578999886043157775305447886529788851688752467899744440425899999999970947 Q ss_pred CCCEEEEEECCCCCCHHHHHHHHHHH-CCCCHHHHHHHHHHCCCEEECCCHHHHHHHHCCCCCEEEEECCHHHHHHHHHC Q ss_conf 06828985047653036789999871-77313789877762884683084789888622586569995044999998507 Q gi|254780561|r 159 KPWKIIYQDPRTSTLGLGLLLWIQKI-YGDNSAQVWKKIATKTATVTKGWTESYGFFLKGESDFVLSYSTSPGFYLLNYG 237 (338) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~ 237 (338) |+|+++++++..++.+.....++... +.+.+++++.++..|+..+..+.......+.+|++.+++++.+.. ....+. T Consensus 154 ykg~i~~~~p~~sg~~~~~l~~~~~~~G~d~g~~~l~~l~~N~~~~~~~~~~~~~~v~~Ge~~ig~~~~~~~--~~~~~~ 231 (334) T TIGR03261 154 YKGHIVMPNPASSGTGFLDVSAWLQMMGEDKGWDYMDKLHKNIAVYTHSGSKPCKLAGMGEFPIGISMAYRA--LKEKKK 231 (334) T ss_pred HCCEEECCCCCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHCCCEEEEEEECCHH--HHHHHC T ss_conf 489587348886338999999999973528899999999863241677763888998659857999735457--887643 Q ss_pred CCCEEEEECCCCEEEEEEEEEEECCCCCHHHHHHHHHHHCCHHHHHHHHHCCCEEECCCC-CCCCHHHHCCCCCCCCCCC Q ss_conf 997378831698088886424337899989999999996399999998850450505576-5882134122322234789 Q gi|254780561|r 238 QDDYVAALFSEGHYLQIEVAAQLVRSKQPQLAQEFMQFMISPSFQRILPTTNWMYPVVDI-SMPDVYQAIRIPQKSLRFN 316 (338) Q Consensus 238 ~~~~~~~~~~eG~~~~~~~~~i~k~a~n~~~A~~Fi~flls~e~Q~~~a~~~~~~P~~~~-~~p~~~~~~~~~~~~~~~~ 316 (338) +.++.+++|+||+...+++++|+|++||+++|++|||||+|+|+|+.+++. +.+|+++. ..|+.........+++.+| T Consensus 232 g~pv~~v~P~eG~~~~~~~~~Ivk~a~n~~~Ak~FidfllS~e~Q~~~a~~-~~~p~~~~v~~p~~~~p~~~~~~l~~~D 310 (334) T TIGR03261 232 GAPIDVVFPKEGLGWDIEATAIIKGSKNNDAAKKLVDWSISDEAMELYAKN-YAVVATPGVAKPDAGFPKNVEDLLIKND 310 (334) T ss_pred CCCEEEEECCCCCEEEEEEEEEECCCCCHHHHHHHHHHHCCHHHHHHHHHC-CEECCCCCCCCCCCCCCCCHHHHCCCCC T ss_conf 897338945898478773588887998989999999997699999999964-9034689887876679842354171779 Q ss_pred HHHHHHHHHHHHHHHHHHHHC Q ss_conf 899999999999999998741 Q gi|254780561|r 317 AEETTRYRNQWISSWQNAVSR 337 (338) Q Consensus 317 ~~~~~~~~~~~~~~W~~~v~~ 337 (338) ++.+++||++|+++|++.+.. T Consensus 311 ~~~~a~nr~~~l~~W~~~~g~ 331 (334) T TIGR03261 311 FVWAAANRDKILEEWSKRYGA 331 (334) T ss_pred HHHHHHHHHHHHHHHHHHHCC T ss_conf 999987599999999998453 No 5 >PRK09501 potD spermidine/putrescine ABC transporter periplasmic substrate-binding protein; Reviewed Probab=100.00 E-value=0 Score=296.78 Aligned_cols=284 Identities=15% Similarity=0.218 Sum_probs=220.4 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHHCCEEEEEECCCHHHHHHHHHHH Q ss_conf 90279999999999999986146742348859999788778512233889999998848879999728868999788753 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLDGLPAKPVLTVYTYNSFVADEGAGPKIKQAFERKCNCELKLIGLSDGVALLNKLRME 80 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~v~~~~~~~~~~~~~~~i~~~Fek~~gikV~~~~~~~~~~~~~~~~a~ 80 (338) |||+.+.++++++++..++++ +++++++|+||+|+.|.. ++++++||++|||+|++..++|++++++|++++ T Consensus 1 ~~~~~~~ll~~~a~~l~~~~~---~a~~~~~L~i~~W~~Yi~-----~~~i~~Fe~etGikV~~~~~~s~ee~~akL~a~ 72 (348) T PRK09501 1 MKKWSRHLLAAGALALGMSAA---HADDNNTLYFYNWTEYVP-----PGLLEQFTKETGIKVIYSTYESNETMYAKLKTY 72 (348) T ss_pred CHHHHHHHHHHHHHHHHCCHH---HHCCCCEEEEECCCCCCC-----HHHHHHHHHHHCCEEEEEECCCHHHHHHHHHCC T ss_conf 936899999999999735033---543389799987100189-----789999999989889999549999999999717 Q ss_pred HCCCCCEEEECCCHHHHHHHHHHHCCCCCCCCHHHC---------CCCCCCCCCCCCCEEEEEEHHHHCCCCCC-CCCCC Q ss_conf 126641011013337899988752126136770002---------10001243211532431000110011124-58764 Q gi|254780561|r 81 GENSAADIVLGFDNNLIDLARKTGLFAKSNIDASQL---------KLPIKWDDDIFVPYDYGYLAFIYDKRQIT-QPPKN 150 (338) Q Consensus 81 ~~~~~~Dv~~~~~~~~~~~~~~~~l~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~y~~~~~~-~~p~s 150 (338) + +.++|+++. +...+..+...+++++.+....+. ......+..|.+|+.+++.++.||++++. .+++| T Consensus 73 ~-~~~yDvv~p-s~~~v~~l~~~gll~~LD~s~ipN~~nl~p~~~~~~~dp~~~y~vPy~wGt~Gi~yn~d~v~~~~~~S 150 (348) T PRK09501 73 K-DGAYDLVVP-STYYVDKMRKEGMIQKIDKSKLTNFSNLDPDMLNKPFDPNNDYSIPYIWGATAIGVNGDAVDPKSVTS 150 (348) T ss_pred C-CCCCEEEEE-CHHHHHHHHHCCCCCCCCHHHCCCHHHCCHHHHCCCCCCCCCEEEEEEEECCEEEEEECCCCCCCCCC T ss_conf 9-998289997-96899999977984547842284812269767447748998489889975115998501147888887 Q ss_pred HHHHCCCCCCCEEEEEECCCCCCHHHHHHHHHHHCC------CCHHHHHHHHHHCCCEEECCCHHHHHHHHCCCCCEEEE Q ss_conf 565212000682898504765303678999987177------31378987776288468308478988862258656999 Q gi|254780561|r 151 FDELINSTKPWKIIYQDPRTSTLGLGLLLWIQKIYG------DNSAQVWKKIATKTATVTKGWTESYGFFLKGESDFVLS 224 (338) Q Consensus 151 ~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~ 224 (338) |.+||+++++|++.+.+.........+...+...+. ....+.+.+++++...+.++ .....+.+|++.++.+ T Consensus 151 W~dL~dp~~kGki~~~d~~~~~~~~al~~lG~~~n~~d~~~l~~a~~~L~~~~~~i~~~~sd--~~~~~l~~Gev~ia~~ 228 (348) T PRK09501 151 WADLWKPEYKGSLLLTDDAREVFQMALRKLGYSGNTTDPKEIEAAYNELKKLMPNVAAFNSD--NPANPYMEGEVNLGMI 228 (348) T ss_pred HHHHCCHHHCCEEEECCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCEEEEECC--CHHHHHHCCCCEEEEE T ss_conf 89854966479079716678999999998279988899999999999999851105898177--0546987388329987 Q ss_pred ECCHHHHHHHHHCCCCEEEEECCCCEEEEEEEEEEECCCCCHHHHHHHHHHHCCHHHHHHHHHCCCEEECCCCC Q ss_conf 50449999985079973788316980888864243378999899999999963999999988504505055765 Q gi|254780561|r 225 YSTSPGFYLLNYGQDDYVAALFSEGHYLQIEVAAQLVRSKQPQLAQEFMQFMISPSFQRILPTTNWMYPVVDIS 298 (338) Q Consensus 225 ~~~~~~~~~~~~~~~~~~~~~~~eG~~~~~~~~~i~k~a~n~~~A~~Fi~flls~e~Q~~~a~~~~~~P~~~~~ 298 (338) |.++. ....+.+.++.+++|+||+..|.++++|+|+++|+++|++|||||++||+++.+++..++.+.+..+ T Consensus 229 wsgda--~~~~~~~~~i~~v~PkEG~~~w~D~~~Ipk~A~n~e~A~~FInf~l~Pevaa~~a~~~~y~t~n~~A 300 (348) T PRK09501 229 WNGSA--FVARQAGTPIDVVWPKEGGIFWMDSLAIPANAKNKEGALKLINFLLRPDVAKQVAETIGYPTPNLAA 300 (348) T ss_pred ECHHH--HHHHHCCCCCCEEECCCCCEEEEEEEEEECCCCCHHHHHHHHHHHHCHHHHHHHHHHHCCCCHHHHH T ss_conf 40589--9999708997189757886699899889899989899999999861979999999974889830888 No 6 >TIGR01254 sfuA ABC transporter periplasmic binding protein, thiB subfamily; InterPro: IPR005948 Bacterial high affinity transport systems are involved in active transport of solutes across the cytoplasmic membrane. The protein components of these traffic systems include one or two transmembrane protein components, one or two membrane-associated ATP-binding proteins and a high affinity periplasmic solute-binding protein. In Gram-positive bacteria, which are surrounded by a single membrane and therefore have no periplasmic region, the equivalent proteins are bound to the membrane via an N-terminal lipid anchor. These homologue proteins do not play an integral role in the transport process per se, but probably serve as receptors to trigger or initiate translocation of the solute through the membrane by binding to external sites of the integral membrane proteins of the efflux system. In addition at least some solute-binding proteins function in the initiation of sensory transduction pathways. Thiamine pyrophosphate 1 is a required cofactor synthesized de novo in Salmonella typhimurium. The primary role for TPP is in central metabolism as an electron carrier and nucleophile for such enzymes as pyruvate dehydrogenase (1.2.4.1 from EC), acetolactate synthase (4.1.3.18 from EC), and alpha-ketoglutarate dehydrogenase (1.2.4.2 from EC). Despite its importance in cellular physiology, neither the de novo biosynthetic pathway nor the salvage systems for thiamine are fully understood in any organism. The thiamine ABC transporter, periplasmic binding protein in bacteria is believed to be involved in the specific translocation of thiamine and its phosphoesters across the inner membrane. The protein belongs to the larger ABC transport system which consists of at least three components: the inner membrane permease; thiamine binding protein and an ATP-binding subunit. It has been experimentally demonstrated that mutants in the various steps in the de novo synthesis of thiamine and its biologically active form, namely thiamine pyrophosphate can be exogenously supplemented with thiamine, thiamine monophosphate (TMP) or thiamine pyrophosphate TPP. ; GO: 0005215 transporter activity, 0006810 transport, 0030288 outer membrane-bounded periplasmic space. Probab=100.00 E-value=0 Score=301.60 Aligned_cols=304 Identities=49% Similarity=0.906 Sum_probs=287.5 Q ss_pred CCCEEEEEECCCCCCCHHHHHHHHHHHHHHH-CCEEEEEECCCHHHHHHHHHHHHCCCCCEEEECCCHHHHHHHHHHHCC Q ss_conf 4885999978877851223388999999884-887999972886899978875312664101101333789998875212 Q gi|254780561|r 28 AKPVLTVYTYNSFVADEGAGPKIKQAFERKC-NCELKLIGLSDGVALLNKLRMEGENSAADIVLGFDNNLIDLARKTGLF 106 (338) Q Consensus 28 ~~~~l~v~~~~~~~~~~~~~~~i~~~Fek~~-gikV~~~~~~~~~~~~~~~~a~~~~~~~Dv~~~~~~~~~~~~~~~~l~ 106 (338) ++++||||+++|+..+++..+.+.++||+.+ +|+|+++..+++.++++|++.++.++.+||+++.|+..+..+.+.+|+ T Consensus 1 A~~~l~v~tydSF~~~wG~~p~~e~~f~kA~~~C~~~fv~~~d~~~lLNrL~L~g~~~~ADvVlGLDN~ll~~A~~~Gll 80 (324) T TIGR01254 1 AQTTLTVYTYDSFAADWGLGPKVEKAFEKADAQCKVKFVALGDAGELLNRLRLEGKNPKADVVLGLDNNLLEKAEKTGLL 80 (324) T ss_pred CCCEEEEEEEECCCCCCCCCHHHHHHHHCCCCCCCEEEEECCCCEEEEEEEEECCCCCCCCEEECCCHHHHHHHHHCCCC T ss_conf 98558898863214787620478876300588862788843884332111231369884316765236789999871742 Q ss_pred CCCC-CCHHHCCCCCCCCCC-CCCCEEEEEEHHHHCCCCCCCC----CCCHHHHCCCCCCCEEEEEECCCCCCHHHHHHH Q ss_conf 6136-770002100012432-1153243100011001112458----764565212000682898504765303678999 Q gi|254780561|r 107 AKSN-IDASQLKLPIKWDDD-IFVPYDYGYLAFIYDKRQITQP----PKNFDELINSTKPWKIIYQDPRTSTLGLGLLLW 180 (338) Q Consensus 107 ~~~~-~~~~~~~~~~~~~~~-~~~~~~~~~~~~~y~~~~~~~~----p~s~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~ 180 (338) .+.. +...+...+..+.+. ...+++++++.+.|++.++..+ |+|.+||..+++..++++.+|.+|+.+.+++++ T Consensus 81 a~~~sv~~~~v~~p~~w~~~Gtf~p~DYGY~A~vYDk~~l~~~~L~LP~SL~eL~~~~~~~~viy~dPrTSsPG~GlLlw 160 (324) T TIGR01254 81 APAQSVDLAKVAVPTGWDDAGTFLPVDYGYVAFVYDKKKLQNPKLALPKSLKELVEPEYDLLVIYQDPRTSSPGLGLLLW 160 (324) T ss_pred CCCCCCCCCCCCCCCCCCCCCCEEEECCCEEEEEECHHHHCCCCCCCCCCHHHHCCHHHCCEEEECCCCCCCHHHHHHHH T ss_conf 33578753324788865767746000012134243158754888888867366347215263465388867256899998 Q ss_pred HHHHC-CCCHHHHHHHHHHCCCEEECCCHHHHH---HHHCCCCCEEEEECCHHHHHHHHHC--------CCCEEEEECCC Q ss_conf 98717-731378987776288468308478988---8622586569995044999998507--------99737883169 Q gi|254780561|r 181 IQKIY-GDNSAQVWKKIATKTATVTKGWTESYG---FFLKGESDFVLSYSTSPGFYLLNYG--------QDDYVAALFSE 248 (338) Q Consensus 181 ~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~g~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~e 248 (338) +...+ .+..+.+++.+..+.+++.+|+++.+. .+..|+.++.++|.++..++.+.+. ..+.......| T Consensus 161 ~~~~Yg~d~a~~aW~~l~~~tv~V~kGWSeAY~~DF~f~gG~~dLVLSY~tSPay~~~~~~nydP~K~~~~~~A~~~l~e 240 (324) T TIGR01254 161 MKSVYGEDKAPQAWKQLRKKTVTVTKGWSEAYYKDFAFLGGEYDLVLSYATSPAYHVLFEKNYDPAKLAKAPYANLNLSE 240 (324) T ss_pred HHHCCCHHHHHHHHHHHHHCCCEEEECCHHHHHHHHHHCCCCCCEEEEHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCC T ss_conf 76237800379999999837947863332444455876179876453002337999998706882113776421101699 Q ss_pred CEEEEEEEEEEECCCCCHHHHHHHHHHHCCHHHHHHHHHCCCEEECCCCCCCCHHHHCCCCCCC-CCCCHHHHHHHHHHH Q ss_conf 8088886424337899989999999996399999998850450505576588213412232223-478989999999999 Q gi|254780561|r 249 GHYLQIEVAAQLVRSKQPQLAQEFMQFMISPSFQRILPTTNWMYPVVDISMPDVYQAIRIPQKS-LRFNAEETTRYRNQW 327 (338) Q Consensus 249 G~~~~~~~~~i~k~a~n~~~A~~Fi~flls~e~Q~~~a~~~~~~P~~~~~~p~~~~~~~~~~~~-~~~~~~~~~~~~~~~ 327 (338) |++..+++.+.+|++|||++|.+||+||+||++|+.|.+.+++|||++...++.+..+..+.+. ..+....+++++..| T Consensus 241 G~~~QvE~aA~~k~~KqP~LA~kf~~Fl~S~~~Q~~iPT~~w~YPv~~~t~~~~~~~l~~~~~~~~~~~~~~v~~~~~~w 320 (324) T TIGR01254 241 GHYLQVEVAARLKGAKQPELADKFVQFLLSPEVQKAIPTGNWMYPVVKGTLLPGFDKLAQQPTTDTAPAPAEVTANRKRW 320 (324) T ss_pred CEEEEEEEEEEECCCCCHHHHHHHHHHHCCHHHHHHCCCCCEEECEECCCCCCCHHHHCCCCCCCCCCCHHHHHHHHHHH T ss_conf 70686543311247998678999999830768775235454540201156562135536886567561157887665432 Q ss_pred HHHH Q ss_conf 9999 Q gi|254780561|r 328 ISSW 331 (338) Q Consensus 328 ~~~W 331 (338) |++| T Consensus 321 i~~W 324 (324) T TIGR01254 321 ISEW 324 (324) T ss_pred HCCC T ss_conf 0149 No 7 >TIGR01276 thiB thiamin/thiamin pyrophosphate ABC transporter, thiamin/thiamin pyrophospate-binding protein; InterPro: IPR005967 Bacterial high affinity transport systems are involved in active transport of solutes across the cytoplasmic membrane. The protein components of these traffic systems include one or two transmembrane protein components, one or two membrane-associated ATP-binding proteins and a high affinity periplasmic solute-binding protein. In Gram-positive bacteria, which are surrounded by a single membrane and therefore have no periplasmic region, the equivalent proteins are bound to the membrane via an N-terminal lipid anchor. These homologue proteins do not play an integral role in the transport process per se, but probably serve as receptors to trigger or initiate translocation of the solute through the membrane by binding to external sites of the integral membrane proteins of the efflux system. In addition at least some solute-binding proteins function in the initiation of sensory transduction pathways. Thiamine pyrophosphate (TPP) is a required cofactor synthesized de novo in Salmonella typhimurium . The primary role for TPP is in central metabolism as an electron carrier and nucleophile for such enzymes as pyruvate dehydrogenase (1.2.4.1 from EC), acetolactate synthase (4.1.3.18 from EC), and alpha-ketoglutarate dehydrogenase (1.2.4.2 from EC). Despite its importance in cellular physiology, neither the de novo biosynthetic pathway nor the salvage systems for thiamine are fully understood in any organism. The thiamine ABC transporter, periplasmic binding protein in proteobacteria is believed to be involved in the specific translocation of thiamine and its phosphoesters across the inner membrane The protein belongs to the larger ABC transport system which consists of at least three components: the inner membrane permease; thiamine binding protein and an ATP-binding subunit. It has been experimentally demonstrated that mutants in the various steps in the de novo synthesis of thiamine and its biologically active form, namely thiamine pyrophosphate can be exogenously supplemented with thiamine, thiamine monophosphate or thiamine pyrophosphate.; GO: 0005524 ATP binding, 0042626 ATPase activity coupled to transmembrane movement of substances, 0006810 transport, 0030288 outer membrane-bounded periplasmic space. Probab=100.00 E-value=0 Score=300.85 Aligned_cols=312 Identities=52% Similarity=0.954 Sum_probs=302.7 Q ss_pred CCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHH-CCEEEEEECCCHHHHHHHHHHHHCCCCCEEEECCCHHHHHHHHHHH Q ss_conf 234885999978877851223388999999884-8879999728868999788753126641011013337899988752 Q gi|254780561|r 26 LPAKPVLTVYTYNSFVADEGAGPKIKQAFERKC-NCELKLIGLSDGVALLNKLRMEGENSAADIVLGFDNNLIDLARKTG 104 (338) Q Consensus 26 ~~~~~~l~v~~~~~~~~~~~~~~~i~~~Fek~~-gikV~~~~~~~~~~~~~~~~a~~~~~~~Dv~~~~~~~~~~~~~~~~ 104 (338) ..+++.|+||+++|+..+|+..+.+.++|||++ .|.++++...+...+++|++.||.+..+||+++.|+.+++.+...| T Consensus 12 ~~A~~~~~VY~YDSF~~~WG~GP~VK~~FEkA~~~C~~~l~~lE~~~~LLNR~RlEGK~~KAD~VLGLDN~lL~~A~KTG 91 (326) T TIGR01276 12 VFAKQVLTVYSYDSFAADWGAGPKVKKDFEKADPNCELKLVALEDVVVLLNRLRLEGKKSKADVVLGLDNFLLDAAEKTG 91 (326) T ss_pred HHCCCEEEEEECCCCCCCCCCCCCCHHHHHCCCCCCCEEEEECCCCEEEEEEEEECCCCCCCCEEECCHHHHHHHHHHCC T ss_conf 65054016850224332357758421144057876413656413330322122303666543367633056787664166 Q ss_pred CCCCCCCCHHHCCCCCCCCCCCCCCEEEEEEHHHHCCCCCCCCCCCHHHHCCCCCCCEEEEEECCCCCCHHHHHHHHHHH Q ss_conf 12613677000210001243211532431000110011124587645652120006828985047653036789999871 Q gi|254780561|r 105 LFAKSNIDASQLKLPIKWDDDIFVPYDYGYLAFIYDKRQITQPPKNFDELINSTKPWKIIYQDPRTSTLGLGLLLWIQKI 184 (338) Q Consensus 105 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~p~s~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 184 (338) ++.+........+.+..|.+...+|++++++.++|++.|...||+|..+|-+..-.-++.+.||..|..+.+++.+|... T Consensus 92 ~FA~~~V~~~~~~~P~~W~N~TF~PFDyG~yAFVYDK~KL~NPP~SLKELVEs~Q~~~VIYQDPRTS~~G~GLL~WM~~V 171 (326) T TIGR01276 92 LFAKSNVDLDALDLPTKWANDTFLPFDYGYYAFVYDKTKLKNPPKSLKELVESDQDLRVIYQDPRTSSVGLGLLLWMNKV 171 (326) T ss_pred CCCCCCCCCEECCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCCHHHHHHHCCCCCCEEEEECCCCCCCCCHHHHHHHHC T ss_conf 43556555000048877467621564457535787210146886445765203577268975687686443476655440 Q ss_pred C-CCCHHHHHHHHHHCCCEEECCCHHHHHHHHCCCCCEEEEECCHHHHHHHHHCCCCEEEEECCCCEEEEEEEEEEECCC Q ss_conf 7-731378987776288468308478988862258656999504499999850799737883169808888642433789 Q gi|254780561|r 185 Y-GDNSAQVWKKIATKTATVTKGWTESYGFFLKGESDFVLSYSTSPGFYLLNYGQDDYVAALFSEGHYLQIEVAAQLVRS 263 (338) Q Consensus 185 ~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~eG~~~~~~~~~i~k~a 263 (338) + +++....++++.++.+++.+|+++.+-.+..||.|..++|.++..++.+.+..++.-.....||.+..++..+-++++ T Consensus 172 YP~D~~~~AW~~L~K~TVTV~KGWS~~YG~FLKGE~DLVLSY~TSP~YH~L~EKKDNYAA~~F~EGH~~QVE~AAR~A~~ 251 (326) T TIGR01276 172 YPADDIQSAWKKLDKKTVTVTKGWSDAYGLFLKGEADLVLSYSTSPLYHILEEKKDNYAAADFAEGHILQVELAARVAAS 251 (326) T ss_pred CCCCCHHHHHHHHCCCEEEEECCCHHHCCCCCCCCCCEEEEECCCCHHHHHHHCCCCHHHHCCCCCCHHHHHHHHHHHCC T ss_conf 78554478999744643664068255504000243102420000521321210243112302146623556667775258 Q ss_pred CCHHHHHHHHHHHCCHHHHHHHHHCCCEEECCCCCCCCHHHHCCCCCCC-CCCCHHHHHHHHHHHHHHHHHHHHC Q ss_conf 9989999999996399999998850450505576588213412232223-4789899999999999999998741 Q gi|254780561|r 264 KQPQLAQEFMQFMISPSFQRILPTTNWMYPVVDISMPDVYQAIRIPQKS-LRFNAEETTRYRNQWISSWQNAVSR 337 (338) Q Consensus 264 ~n~~~A~~Fi~flls~e~Q~~~a~~~~~~P~~~~~~p~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~W~~~v~~ 337 (338) ++|++|.+||.||.||-.|..|.+.|+++||.+..+|..+..+..+.+. ..+.+...++.|..||..|+++|+| T Consensus 252 ~~P~~A~~Fl~Fl~SP~~Q~~I~T~N~M~PV~~~~~~~~F~~L~~~~~Tq~~~~P~~~~~Q~~~WIS~WQ~~~~~ 326 (326) T TIGR01276 252 KNPELADDFLAFLVSPAAQKAIVTANIMLPVINVELEAGFDKLKKQAKTQLEITPAVVAEQLKAWISEWQRALSK 326 (326) T ss_pred CCCHHHHHHHHHHCCHHHHHCCCCCCEECCEEEEEECCCHHHHCCCCCCCEEECHHHHHHHHHHHHHHHHHHHCC T ss_conf 880458999997318356301332321203353211501354166522210205167888887567777775149 No 8 >PRK10682 putrescine transporter subunit: periplasmic-binding component of ABC superfamily; Provisional Probab=100.00 E-value=1.1e-43 Score=273.04 Aligned_cols=288 Identities=12% Similarity=0.175 Sum_probs=210.9 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHHCCEEEEEECCCHHHHHHHHHHH Q ss_conf 90279999999999999986146742348859999788778512233889999998848879999728868999788753 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLDGLPAKPVLTVYTYNSFVADEGAGPKIKQAFERKCNCELKLIGLSDGVALLNKLRME 80 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~v~~~~~~~~~~~~~~~i~~~Fek~~gikV~~~~~~~~~~~~~~~~a~ 80 (338) |..+-+-.++.++++++++.+++..++++++|+||+|++|.. +.++++|||+|||+|++..++|.+++++|++++ T Consensus 1 ~~~~~~~~~~~l~~~~~~~~~~~~~aa~~~~Lnv~nW~~Yi~-----p~~i~~FeketGikV~~~~~~snee~~akl~ag 75 (370) T PRK10682 1 MTALNKKWLSGLVAGALMAVSVGTLAAEQKTLHIYNWSDYIA-----PDTVANFEKETGIKVVYDVFDSNEVLEGKLMAG 75 (370) T ss_pred CCHHHHHHHHHHHHHHHHHHHCCHHHHCCCEEEEECCCCCCC-----HHHHHHHHHHHCCEEEEEECCCHHHHHHHHHCC T ss_conf 931778899999999999985205664279799981676579-----779999999979889999549989999999669 Q ss_pred HCCCCCEEEECCCHHHHHHHHHHHCCCCCCCCHHH----CC-------CCCCCCCCCCCCEEEEEEHHHHCCCCCCC--- Q ss_conf 12664101101333789998875212613677000----21-------00012432115324310001100111245--- Q gi|254780561|r 81 GENSAADIVLGFDNNLIDLARKTGLFAKSNIDASQ----LK-------LPIKWDDDIFVPYDYGYLAFIYDKRQITQ--- 146 (338) Q Consensus 81 ~~~~~~Dv~~~~~~~~~~~~~~~~l~~~~~~~~~~----~~-------~~~~~~~~~~~~~~~~~~~~~y~~~~~~~--- 146 (338) + ..+||++. +...+......+++++.+....+ .. .....+..|.+|+.+++.++.||++++.. T Consensus 76 ~--~gyDvv~p-s~~~v~~~~~~gll~pLD~sklpN~~ni~p~~~~~~~~~dpg~~YsVPy~wGt~GI~Yn~d~v~~~~g 152 (370) T PRK10682 76 S--TGFDLVVP-SASFLERQLTAGVFQPLDKSKLPNWKNLDPELLKLVAKHDPDNKYAMPYMWATTGIGYNVDKVKAVLG 152 (370) T ss_pred C--CCCCEEEE-CHHHHHHHHHCCCCCCCCHHHCCCHHHCCHHHHHHHHCCCCCCEEEEEEEECCEEEEEECCCCCCCCC T ss_conf 9--99988997-77999999976984647700086744429998841003699975998888422599997654422346 Q ss_pred ---CCCCHHHHCCCCCCCE-----EEEEECCCCCCHHHHHHHHHHHCCCC-------HHHHHHHHHHCCCEEECCCHHHH Q ss_conf ---8764565212000682-----89850476530367899998717731-------37898777628846830847898 Q gi|254780561|r 147 ---PPKNFDELINSTKPWK-----IIYQDPRTSTLGLGLLLWIQKIYGDN-------SAQVWKKIATKTATVTKGWTESY 211 (338) Q Consensus 147 ---~p~s~~~l~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~ 211 (338) +++||++||+|++.+| +.+.+......+..+...+...+..+ ..+.+.+++.++..+.+ .+.. T Consensus 153 ~~~p~~sW~~l~~P~~~~k~k~~~v~~ld~~~~~~~~aL~~lG~~~n~~~~~d~~~aa~~~L~~~kp~v~~~~s--~~~~ 230 (370) T PRK10682 153 EDAPVDSWDLVLKPENLEKLKSCGVSFLDAPEEIFATVLNYLGKDPNSTKADDYTGPATDLLLKLRPNIRYFHS--SQYI 230 (370) T ss_pred CCCCCCHHHHHHCHHHHHHCCCCCEEEECCHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHCHHCCEECC--CHHH T ss_conf 78884629986492756530358558976888999999997699989999999999999999971222441037--2889 Q ss_pred HHHHCCCCCEEEEECCHHHHHH----HHHCCCCEEEEECCCCEEEEEEEEEEECCCCCHHHHHHHHHHHCCHHHHHHHHH Q ss_conf 8862258656999504499999----850799737883169808888642433789998999999999639999999885 Q gi|254780561|r 212 GFFLKGESDFVLSYSTSPGFYL----LNYGQDDYVAALFSEGHYLQIEVAAQLVRSKQPQLAQEFMQFMISPSFQRILPT 287 (338) Q Consensus 212 ~~~~~g~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~eG~~~~~~~~~i~k~a~n~~~A~~Fi~flls~e~Q~~~a~ 287 (338) ..+.+|++.++.+|.++..... ..+.+.++.+++|+||+..|.++++|+|+|+|+++|++|||||++||.++.++. T Consensus 231 ~~l~~Gei~~a~~wsGd~~~~~~~~~~~~~~~~i~yv~PkEG~~~w~D~~~Ipk~A~n~~~A~~FInf~l~Pe~aa~~~~ 310 (370) T PRK10682 231 NDLANGDICVAIGWAGDVWQASNRAKEAKNGVNVSFSIPKEGAMAFFDVFAMPADAKNKDEAYQFLNYLLRPDVIAHISD 310 (370) T ss_pred HHHHCCCEEEEEEECHHHHHHHHHHHHCCCCCEEEEEECCCCCEEEEEEEEEECCCCCHHHHHHHHHHHCCHHHHHHHHH T ss_conf 99854867999974089999999998635787069981588745888788886899996999999998509799999998 Q ss_pred CCCEEECCCCC Q ss_conf 04505055765 Q gi|254780561|r 288 TNWMYPVVDIS 298 (338) Q Consensus 288 ~~~~~P~~~~~ 298 (338) ..++.+.++.+ T Consensus 311 ~~~y~~~n~~A 321 (370) T PRK10682 311 HVFYANANKAA 321 (370) T ss_pred HHCCCCHHHHH T ss_conf 64788753898 No 9 >COG0687 PotD Spermidine/putrescine-binding periplasmic protein [Amino acid transport and metabolism] Probab=100.00 E-value=1.6e-41 Score=260.36 Aligned_cols=288 Identities=17% Similarity=0.244 Sum_probs=212.4 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCC--CCCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHHCCEEEEEECCCHHHHHHHHH Q ss_conf 90279999999999999986146--7423488599997887785122338899999988488799997288689997887 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTL--DGLPAKPVLTVYTYNSFVADEGAGPKIKQAFERKCNCELKLIGLSDGVALLNKLR 78 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~--~~~~~~~~l~v~~~~~~~~~~~~~~~i~~~Fek~~gikV~~~~~~~~~~~~~~~~ 78 (338) ||++.+.. .++++++++++|+. .+.+++.+|+|++|++|.. +.++++|||+|||+|++..+++.+++.++++ T Consensus 2 ~k~~~~~~-~~~~~~~~~a~~~~~~~a~a~~~~l~v~~w~gy~~-----~~~~~~F~k~tGi~V~~~~~~sne~~~ak~~ 75 (363) T COG0687 2 MKKLSRRS-FLALAALALAALAGAAAAAAEGKTLNVYNWGGYID-----PALLKPFEKETGIKVVYDTFDSNEEMLAKLK 75 (363) T ss_pred CCHHHHHH-HHHHHHHHHHHHHCHHHHHHCCCEEEEECCCCCCC-----HHHHHHHHHHHCCEEEEECCCCHHHHHHHHH T ss_conf 41466667-89999999998600045662578599964787557-----7889989999899799953898699999998 Q ss_pred HHHCCCCCEEEECCCHHHHHHHHHHHCCCCCCCCHHH---------CCCC-CCCCCCCCCCEEEEEEHHHHCCCCCC--C Q ss_conf 5312664101101333789998875212613677000---------2100-01243211532431000110011124--5 Q gi|254780561|r 79 MEGENSAADIVLGFDNNLIDLARKTGLFAKSNIDASQ---------LKLP-IKWDDDIFVPYDYGYLAFIYDKRQIT--Q 146 (338) Q Consensus 79 a~~~~~~~Dv~~~~~~~~~~~~~~~~l~~~~~~~~~~---------~~~~-~~~~~~~~~~~~~~~~~~~y~~~~~~--~ 146 (338) +++.+ +|++.. +...+.+....+++++.+....+ .... ...+..|.+|+.+++.++.||+++++ . T Consensus 76 ~g~~~--~Dvv~~-s~~~~~~~~~~gll~~lD~skl~n~~~l~~~~~~~~~~d~g~~y~vPy~~g~t~i~Yn~~~~~~~~ 152 (363) T COG0687 76 AGGGG--YDVVVP-SGYFVARLIKEGLLQPLDKSKLPNWKNLDPEFLNPPKFDPGNKYSVPYFWGTTGIAYNTDKVKDAP 152 (363) T ss_pred HCCCC--CCEEEC-CHHHHHHHHHCCCCCCCCHHHCCCHHHCCHHHHCCCCCCCCCEECEEEEEEEEEEEEECCCCCCCC T ss_conf 08999--887971-888999998667567589666648211698783577678888603167886469999800058999 Q ss_pred CCCCHHHHCCCC----CCCE-EEEEECCCCCCHHHHHHHHHHHCCC------CHHHHHHHHHHCCCEEECCCHHHHHHHH Q ss_conf 876456521200----0682-8985047653036789999871773------1378987776288468308478988862 Q gi|254780561|r 147 PPKNFDELINST----KPWK-IIYQDPRTSTLGLGLLLWIQKIYGD------NSAQVWKKIATKTATVTKGWTESYGFFL 215 (338) Q Consensus 147 ~p~s~~~l~~~~----~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 215 (338) +|+||++||+++ ++++ +.+.+.........+...+...... ...+.+.+++. ...++.+..+..+.+. T Consensus 153 ~~~sW~~l~d~~~~~~~k~~~~~~~d~~~~~~~~al~~lg~~~n~~~~~~~~~a~~~L~~~kp-~~~~~~~~~~~~~~l~ 231 (363) T COG0687 153 PPTSWADLFDPEKFPGLKGKGVALLDDDREVFGAALALLGKSPNTTDPEDLKKAFDLLDKLKP-VNVYWFDGSQYVQLLA 231 (363) T ss_pred CCCCHHHHHCCHHHHHHCCCCCCCCCCCHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHCC-CEEEEECCHHHHHHHH T ss_conf 998899983842002232575301366057899999971899788998899999999997278-5089963627888986 Q ss_pred CCCCCEEEEECCHHHHHHHHHCCCCEEEEECCCCEEEEEEEEEEECCCCCHHHHHHHHHHHCCHHHHHHHHHCCCEEECC Q ss_conf 25865699950449999985079973788316980888864243378999899999999963999999988504505055 Q gi|254780561|r 216 KGESDFVLSYSTSPGFYLLNYGQDDYVAALFSEGHYLQIEVAAQLVRSKQPQLAQEFMQFMISPSFQRILPTTNWMYPVV 295 (338) Q Consensus 216 ~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~eG~~~~~~~~~i~k~a~n~~~A~~Fi~flls~e~Q~~~a~~~~~~P~~ 295 (338) +|+++++.+|.++.......+++.++.++.|+||++.|.++++|+|+++|+++|++|||||++||+|+.+++..++.|.+ T Consensus 232 ~Gev~~a~~w~g~~~~~~~~~~~~~i~~~~p~eG~~~w~D~~~ipk~a~n~~~A~~fInf~~~pe~~a~~~~~~~y~~~n 311 (363) T COG0687 232 NGEVVLAMGWSGDAAAAKAAKNGAPIEFVIPKEGSILWFDNLAIPKGAKNVDAAYKFINFLLDPEVAAKLAEFVGYAPPN 311 (363) T ss_pred CCCEEEEEEECHHHHHHHHCCCCCEEEEECCCCCCEEEEEEEEEECCCCCHHHHHHHHHHHCCHHHHHHHHHHHCCCCCC T ss_conf 69978999777789998625589805773478873478788998799999899999999851989999998732678888 Q ss_pred CCC Q ss_conf 765 Q gi|254780561|r 296 DIS 298 (338) Q Consensus 296 ~~~ 298 (338) +.+ T Consensus 312 ~~a 314 (363) T COG0687 312 KAA 314 (363) T ss_pred HHH T ss_conf 777 No 10 >COG1840 AfuA ABC-type Fe3+ transport system, periplasmic component [Inorganic ion transport and metabolism] Probab=100.00 E-value=3.4e-33 Score=211.78 Aligned_cols=285 Identities=20% Similarity=0.205 Sum_probs=205.8 Q ss_pred HHHHHHHHHCCEEEEEECCCHHHHHHHHHHHHCC-CCCEEEECCCHHHHHHHHHHHCCCCCCCC-HHHCCCCCCCCCCC- Q ss_conf 9999998848879999728868999788753126-64101101333789998875212613677-00021000124321- Q gi|254780561|r 50 IKQAFERKCNCELKLIGLSDGVALLNKLRMEGEN-SAADIVLGFDNNLIDLARKTGLFAKSNID-ASQLKLPIKWDDDI- 126 (338) Q Consensus 50 i~~~Fek~~gikV~~~~~~~~~~~~~~~~a~~~~-~~~Dv~~~~~~~~~~~~~~~~l~~~~~~~-~~~~~~~~~~~~~~- 126 (338) ++++|++++||+|+++..+++ +.+.++.++..+ ..+||+++.+...+..+.+.+++.++.+. .........+.... T Consensus 1 ~~~~f~~~~gikv~~~~~~s~-~~~~~~~~e~~~~~~aDv~~~~~~~~~~~l~~~gll~~y~~~~~~~i~~~~~~~~~~~ 79 (299) T COG1840 1 LIKAFEKKPGIKVEFVDAGSG-ELLARIIAEKGNPQAADVLWGGAPVSLDQLKEAGLLQPYKSPELDEIPAWAPDPDGAY 79 (299) T ss_pred CHHHHHHCCCCEEEEEECCCH-HHHHHHHHHCCCCCCCCEEEECCHHHHHHHHHCCCCCCCCCCCHHHHHHCCCCCCCCC T ss_conf 955673377966898748848-9999999722489988189606816999998579722269834656533134644466 Q ss_pred ---CCCEEEEEEHHHHCCCCCCCCCCCHHHHCCCCCCCEEEEEECCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHCCCEE Q ss_conf ---15324310001100111245876456521200068289850476530367899998717731378987776288468 Q gi|254780561|r 127 ---FVPYDYGYLAFIYDKRQITQPPKNFDELINSTKPWKIIYQDPRTSTLGLGLLLWIQKIYGDNSAQVWKKIATKTATV 203 (338) Q Consensus 127 ---~~~~~~~~~~~~y~~~~~~~~p~s~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 203 (338) +.......++..++++...+.|++|.||.+|+|+|++.+.++..+.............+...+++++..++.|...+ T Consensus 80 ~~~~~~~~~~~~n~~~~~~~~~~~P~~w~DL~~p~~kg~i~~~~p~~s~~~~~~~~~~~~~G~~~~~~~l~~l~~n~~~~ 159 (299) T COG1840 80 GGGYVGPLVFAVNTDYNKDLLKPVPKSWADLAKPEYKGKVQMADPTSSGTAYAALLALQAYGEEKGWAYLKGLAANLATY 159 (299) T ss_pred CCCEEEEEEEEEEEECCCCCCCCCCCCHHHHHCHHHCCCEEEECCCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCC T ss_conf 56447789999961103445689986799870988669846547885589999999998648799999999999851334 Q ss_pred E-CCCHHHHHHHHCCCCCEEEEECCHHHHHHHHHCCCCEEEEECCCCEEEEEEEEEEECCCCCHHHHHHHHHHHCCHHHH Q ss_conf 3-084789888622586569995044999998507997378831698088886424337899989999999996399999 Q gi|254780561|r 204 T-KGWTESYGFFLKGESDFVLSYSTSPGFYLLNYGQDDYVAALFSEGHYLQIEVAAQLVRSKQPQLAQEFMQFMISPSFQ 282 (338) Q Consensus 204 ~-~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~eG~~~~~~~~~i~k~a~n~~~A~~Fi~flls~e~Q 282 (338) . .+..........|+.++++++..+..+......+.++.+++|.+|+.+.+.+++|+|+++|+++|++||+|++|+|+| T Consensus 160 ~~~~~~~~~~~va~Ge~~vg~~~~~~~~~~~~~~~~~~v~iv~P~~G~~v~~~~vaiik~a~~~e~Ak~fid~llS~egQ 239 (299) T COG1840 160 TGGSSSVVAKVVAGGEAAVGLGNLYYGAYAKDKAKGAPVEVVYPEEGTGVNPSGVALLKKAKNPEAAKLFIDFLLSKEGQ 239 (299) T ss_pred CCCCCHHHHHHHHCCCCCEEEEECCHHHHHHHHCCCCCEEEEECCCCCEEEEEEEEEECCCCCHHHHHHHHHHHCCHHHH T ss_conf 57882889999747986478872107999975146985699948988689754355434899989999999998089889 Q ss_pred HHHHHCCC--EEECCCCCCCCHHHHCCCCCCCCCCCHHHHHH----HHHHHHHHHHHHHHC Q ss_conf 99885045--05055765882134122322234789899999----999999999998741 Q gi|254780561|r 283 RILPTTNW--MYPVVDISMPDVYQAIRIPQKSLRFNAEETTR----YRNQWISSWQNAVSR 337 (338) Q Consensus 283 ~~~a~~~~--~~P~~~~~~p~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~W~~~v~~ 337 (338) +.++..+. .+|++.... ..+....+.....++++.+.. .+.+++++|.+.+.. T Consensus 240 ~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 298 (299) T COG1840 240 EILAKKGYSPEIPVNPKAE--DSPAALDFADLIALDFETLGKLLEAEREKFAEKWVKEVGA 298 (299) T ss_pred HHHHHCCCCCCCCCCCCCC--CCCCCCCCHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 9998417576678888766--6742015022024584430035788889999999998644 No 11 >PRK11622 putative ABC transporter solute-binding protein; Provisional Probab=99.97 E-value=2.4e-27 Score=177.61 Aligned_cols=299 Identities=18% Similarity=0.203 Sum_probs=211.7 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCC--------CCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHHCCEEEEEECCCHHH Q ss_conf 902799999999999999861467--------423488599997887785122338899999988488799997288689 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLD--------GLPAKPVLTVYTYNSFVADEGAGPKIKQAFERKCNCELKLIGLSDGVA 72 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~l~v~~~~~~~~~~~~~~~i~~~Fek~~gikV~~~~~~~~~~ 72 (338) |+|...+++++++++.+.++|+.. ..+..+|++.|.||+-..--..++.+.++.+++|||+++.+..++..+ T Consensus 1 ~~r~~~~~~~~l~~~~~~~~~~~~~~~W~~i~~~A~GqtV~f~aWGGs~~iN~yi~w~~~~l~~~y~ItL~~Vk~~Dt~~ 80 (401) T PRK11622 1 MMRHLLALLGLLALLSLAAAASDAANTWQQILEEAKGQTVYFNAWGGSPAINRYLDWVAKELKERYGITLKHVKLADIAE 80 (401) T ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHCCCEEEEEEECCCHHHHHHHHHHHHHHHHHHCCEEEEECCCCHHH T ss_conf 95889999999998750145764448999999971698899997679577759999999999998596689952545899 Q ss_pred HHHHHHHHHC-----CCCCEEEECCCHHHHHHHHHHHCCC-CCC---CCHHHCC-------CCCCCCCCCCCCEEEEEEH Q ss_conf 9978875312-----6641011013337899988752126-136---7700021-------0001243211532431000 Q gi|254780561|r 73 LLNKLRMEGE-----NSAADIVLGFDNNLIDLARKTGLFA-KSN---IDASQLK-------LPIKWDDDIFVPYDYGYLA 136 (338) Q Consensus 73 ~~~~~~a~~~-----~~~~Dv~~~~~~~~~~~~~~~~l~~-~~~---~~~~~~~-------~~~~~~~~~~~~~~~~~~~ 136 (338) .++|+.+|+. ++.+|++|.....+... .+.+|+. +.. +.+...+ +.......+-.|+.....+ T Consensus 81 av~rvlaEk~AG~~~~GsvDliWINGENF~~m-K~~~LL~gp~~~~LPN~~~vd~~~~v~~DFg~PtdG~EaPWg~aQ~v 159 (401) T PRK11622 81 AVNRLLAEKQAGRDTGGSVDLVWINGENFRTL-KEAGLLYGPFAETLPNWRYVDTQLPVREDFGVPTEGLEAPWGVAQLV 159 (401) T ss_pred HHHHHHHHHHHCCCCCCCEEEEEECCHHHHHH-HHCCCCCCCHHHHCCHHHCCCCCCCCCCCCCCCCCCCCCCCCCCEEE T ss_conf 99999999860778898445899886128988-86783138766659404205887553555787366551676553189 Q ss_pred HHHCCCCCCCCCCCHHHHCC--CCCCCEEEEEECCCCCCHHHHHHHHHHHCCC------------------CHHHHHHHH Q ss_conf 11001112458764565212--0006828985047653036789999871773------------------137898777 Q gi|254780561|r 137 FIYDKRQITQPPKNFDELIN--STKPWKIIYQDPRTSTLGLGLLLWIQKIYGD------------------NSAQVWKKI 196 (338) Q Consensus 137 ~~y~~~~~~~~p~s~~~l~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------------------~~~~~~~~~ 196 (338) +.|+..++..+|.|..+|.+ .+.+||+.++.+............+....++ ..|+++.++ T Consensus 160 f~ydsa~v~~pP~s~~eLl~wak~nPGrfTYP~PPdF~GstFlkq~L~~l~~~~~~l~~p~~~~~~~~~~~plW~yLd~l 239 (401) T PRK11622 160 FIYDSARVPQPPQSPAELLEWAKANPGRFTYPRPPDFTGTAFLKQLLIELTGDPAALKQPVDDATFARVTAPLWDYLDEL 239 (401) T ss_pred EEECHHHCCCCCCCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHCCCHHHHCCCCCHHHHHHHHHHHHHHHHHH T ss_conf 99876655999989999999999789971489999875379999999997398567518678899998738999999985 Q ss_pred H----HCCCEEECCCHHHHHHHHCCCCCEEEEECCHHHHHHHHHC--CCCEEEEECCCCEEEEEEEEEEECCCCCHHHHH Q ss_conf 6----2884683084789888622586569995044999998507--997378831698088886424337899989999 Q gi|254780561|r 197 A----TKTATVTKGWTESYGFFLKGESDFVLSYSTSPGFYLLNYG--QDDYVAALFSEGHYLQIEVAAQLVRSKQPQLAQ 270 (338) Q Consensus 197 ~----~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~eG~~~~~~~~~i~k~a~n~~~A~ 270 (338) . +.+..|..+..+..+++.+||+++.+++...........+ ....+...+.+|++..+..++|+.||+|++.|+ T Consensus 240 ~P~LWr~G~~yP~~~~~l~~L~adGEi~~~~sfnp~~a~~~I~~G~lP~t~r~~~~~~GtigN~hflAIP~NA~~kagA~ 319 (401) T PRK11622 240 HPYLWRKGKTFPASPAELDQLLADGELDLAFTFNPNEAQAAIANGELPASTRSFAFSDGTIGNTHFVAIPFNANAKAGAK 319 (401) T ss_pred CHHHHHCCCCCCCCHHHHHHHHHCCCEEEEEECCHHHHHHHHHCCCCCCCEEEEECCCCCCCCCCCEEEECCCCCHHHHH T ss_conf 77888440126899999999984798778840387778889866999974026742698216765077207999877899 Q ss_pred HHHHHHCCHHHHHHHHHC-C-CEEECCC-CCCC Q ss_conf 999996399999998850-4-5050557-6588 Q gi|254780561|r 271 EFMQFMISPSFQRILPTT-N-WMYPVVD-ISMP 300 (338) Q Consensus 271 ~Fi~flls~e~Q~~~a~~-~-~~~P~~~-~~~p 300 (338) .++|||+|||+|..-++- . +..||.. ..++ T Consensus 320 VvaNfLLSPEAQ~~K~~p~~WGd~tVLd~~kL~ 352 (401) T PRK11622 320 VVANFLLSPEAQLRKADPAVWGDPSVLDPQKLP 352 (401) T ss_pred HHHHHHCCHHHHHHHCCCCCCCCCCCCCHHHCC T ss_conf 999996399999975386424885411645389 No 12 >PRK09474 malE maltose ABC transporter periplasmic protein; Reviewed Probab=99.96 E-value=3.9e-26 Score=170.63 Aligned_cols=280 Identities=15% Similarity=0.163 Sum_probs=182.2 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHHCCEEEEEECCCHHHHHHHHHHH Q ss_conf 90279999999999999986146742348859999788778512233889999998848879999728868999788753 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLDGLPAKPVLTVYTYNSFVADEGAGPKIKQAFERKCNCELKLIGLSDGVALLNKLRME 80 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~v~~~~~~~~~~~~~~~i~~~Fek~~gikV~~~~~~~~~~~~~~~~a~ 80 (338) |||.+++++..+++++++++|+ ++++++.+||||..+. .....+++++++|||+|||+|++... +++..|+... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~ltvW~~~~--~~~~~l~~~~~~Fe~~~gi~V~v~~~---~~~~~kl~~a 74 (394) T PRK09474 1 IKKGLRILALSALTTLMLSASA-LAKIEEGKLVIWINGD--KGYNGLAEVGKKFEKDTGIKVTVEHP---DKLEEKFPQV 74 (394) T ss_pred CCHHHHHHHHHHHHHHHHHCCC-CCCCCCCEEEEEECCC--HHHHHHHHHHHHHHHCCCCEEEEECC---HHHHHHHHHH T ss_conf 9415889999999999973643-2456678699995797--05699999999974540977999710---2689999974 Q ss_pred H-CCCCCEEEECCCHHHHHHHHHHHCCCCCCCCHHH--------CCCCCCCCCCCCCCEEEEEEHHHHCCCCCCCCCCCH Q ss_conf 1-2664101101333789998875212613677000--------210001243211532431000110011124587645 Q gi|254780561|r 81 G-ENSAADIVLGFDNNLIDLARKTGLFAKSNIDASQ--------LKLPIKWDDDIFVPYDYGYLAFIYDKRQITQPPKNF 151 (338) Q Consensus 81 ~-~~~~~Dv~~~~~~~~~~~~~~~~l~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~p~s~ 151 (338) . .+..+||++ .+.+.+......+++.+....... .......+..|++|+...+..+.||++.++.+|+|| T Consensus 75 ~~~G~~PDv~~-~~~~~~~~~a~~G~l~pl~~~~~~~~~~~~~~~~a~tydGk~YgvP~~~~t~~L~YNKdl~~~pP~Tw 153 (394) T PRK09474 75 AATGDGPDIIF-WAHDRFGGYAQSGLLAEVTPSKAFQDKLVPFTWDAVRYNGKLIGYPIAVEALSLIYNKDLVPNPPKTW 153 (394) T ss_pred HCCCCCCCEEE-ECCHHHHHHHHCCCCCCCCCCHHHHHHCCHHHHHCEEECCEEEEECCCCCCEEEEEEHHHCCCCCCCH T ss_conf 20799995899-88476999997898663885644453046334524688988888630122306887400036899999 Q ss_pred HHHCCCC----CCCEEEEEECCCCCCHHHHHHHHHH--------------------HCCCCHHHHHHHHHHCCCEEE-CC Q ss_conf 6521200----0682898504765303678999987--------------------177313789877762884683-08 Q gi|254780561|r 152 DELINST----KPWKIIYQDPRTSTLGLGLLLWIQK--------------------IYGDNSAQVWKKIATKTATVT-KG 206 (338) Q Consensus 152 ~~l~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~--------------------~~~~~~~~~~~~~~~~~~~~~-~~ 206 (338) +||.... ..|+..+.......+..+.+..... ....++.+++.++.+....-. .+ T Consensus 154 dEl~~~ak~l~~~g~~~~~~~~~~~~~~~~~~~~~Gg~~f~~~~gg~d~~~~~~ns~~~v~al~~~~~l~~~g~~~~~~~ 233 (394) T PRK09474 154 EEIPALDKELKKKGKSAIMWNLQEPYFTWPLIAADGGYAFKKENGGYDVKDVGVNNAGAKAGLQFLVDLIKNKHMNADTD 233 (394) T ss_pred HHHHHHHHHHHHCCCCEEEECCCCHHHHHHHHHHCCCEEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCC T ss_conf 99999999987538640452266418778999807884434378875554354489999999999999997599898888 Q ss_pred CHHHHHHHHCCCCCEEEEECCHHHHHHHHHCCCCEEEEECC--CC----EEEEEEEEEEECCCCCHHHHHHHHH-HHCCH Q ss_conf 47898886225865699950449999985079973788316--98----0888864243378999899999999-96399 Q gi|254780561|r 207 WTESYGFFLKGESDFVLSYSTSPGFYLLNYGQDDYVAALFS--EG----HYLQIEVAAQLVRSKQPQLAQEFMQ-FMISP 279 (338) Q Consensus 207 ~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--eG----~~~~~~~~~i~k~a~n~~~A~~Fi~-flls~ 279 (338) .......+.+|.+.|.+.+.+. .....+.+.++.+...+ +| ++....+++|+++++|+++|++||+ |++|+ T Consensus 234 ~~~~~~~F~~Gk~am~i~GpW~--~~~~~~~~~~~gva~lP~~~g~~~~~~~G~~~~~v~~~s~nk~~A~~Fi~~~l~s~ 311 (394) T PRK09474 234 YSVAEAAFNKGETAMTINGPWA--WSNIDKSGVNYGVTVLPTFNGQPSKPFVGVLSAGINAASPNKELAKEFLENYLLTD 311 (394) T ss_pred CCHHHHHHHCCCEEEEEECCHH--HHHHHHCCCCCEEEECCCCCCCCCCCEEEEEEEEEECCCCCHHHHHHHHHHHHCCH T ss_conf 2167777755863499844154--54587649874088588889998875255568999899999999999999996399 Q ss_pred HHHHHHHHCC Q ss_conf 9999988504 Q gi|254780561|r 280 SFQRILPTTN 289 (338) Q Consensus 280 e~Q~~~a~~~ 289 (338) |.|+.+.... T Consensus 312 e~~~~~~~~~ 321 (394) T PRK09474 312 EGLKAVNKDK 321 (394) T ss_pred HHHHHHHHCC T ss_conf 9999986238 No 13 >COG2182 MalE Maltose-binding periplasmic proteins/domains [Carbohydrate transport and metabolism] Probab=99.88 E-value=5.9e-21 Score=140.38 Aligned_cols=292 Identities=18% Similarity=0.199 Sum_probs=183.6 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCC--------CCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHHCCEEEEEECCCHHH Q ss_conf 9027999999999999998614674--------23488599997887785122338899999988488799997288689 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLDG--------LPAKPVLTVYTYNSFVADEGAGPKIKQAFERKCNCELKLIGLSDGVA 72 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~l~v~~~~~~~~~~~~~~~i~~~Fek~~gikV~~~~~~~~~~ 72 (338) ||+++.+.++.+++++.+..|+.++ ..++++||||..+ ......+..+.++||+++||+|+++..+.. + T Consensus 4 ~~~~~~~~~~av~~l~~l~a~~~~~~~~~~~~~~~~~~~ltvW~~~--~~~~~~i~~~~~kfek~~gi~V~i~~~~~~-d 80 (420) T COG2182 4 LKKVLILAVVAVLALSALAACTSSSSTTKSLSSTIAEKKLTVWVDG--EKEVDGIKEAAAKFEKETGIKVKIVEEDYD-D 80 (420) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCEEEEEECC--CHHHHHHHHHHHHHHHHHCCEEEEEECCCC-C T ss_conf 7999999999999999887506777777665577678748997578--306799999999999987975899946750-4 Q ss_pred HHHHHHH-HHCCCCCEEEECCCHHHHHHHHHHHCCCCCCCCHHHC--------CCCCCCCCCCCCCEEEEEEHHHHCCCC Q ss_conf 9978875-3126641011013337899988752126136770002--------100012432115324310001100111 Q gi|254780561|r 73 LLNKLRM-EGENSAADIVLGFDNNLIDLARKTGLFAKSNIDASQL--------KLPIKWDDDIFVPYDYGYLAFIYDKRQ 143 (338) Q Consensus 73 ~~~~~~a-~~~~~~~Dv~~~~~~~~~~~~~~~~l~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~y~~~~ 143 (338) ...++.. ...+..+|++. ...+++..+...+++.+........ ......+..|++|....+..+.||++. T Consensus 81 ~~~~~~~~~paGqgpDi~~-~~hD~iG~~a~~G~l~~i~~~~~~~~~~~~~~~~a~~y~GkiYGlP~~~Et~~L~YNKdl 159 (420) T COG2182 81 LLEKFLLAAPAGQGPDIFV-WAHDRIGGLAEAGLLLEIDPYATKYTDFYPAALVAATYKGKLYGLPQAVETLALYYNKDL 159 (420) T ss_pred HHHHHHCCCCCCCCCCEEE-ECCHHHHHHHHCCCCCCCCCCHHHHHCCCHHHHHHHHCCCEEEECCHHHHHHHHHEECCC T ss_conf 8888601377788897798-561367799867810116763444404446666653238878725566645653010001 Q ss_pred CCCCCCCHHHHCCCCCCCE------E--EEEECCCCCCHHHHHHHH-------------------HHHCCCCHHHHHHHH Q ss_conf 2458764565212000682------8--985047653036789999-------------------871773137898777 Q gi|254780561|r 144 ITQPPKNFDELINSTKPWK------I--IYQDPRTSTLGLGLLLWI-------------------QKIYGDNSAQVWKKI 196 (338) Q Consensus 144 ~~~~p~s~~~l~~~~~~~~------~--~~~~~~~~~~~~~~~~~~-------------------~~~~~~~~~~~~~~~ 196 (338) ++.+|+||+||.....+.+ . .+.+....++...+.... ...+...+..++... T Consensus 160 v~~~P~T~~el~~~ak~~~~~~~g~~~~~~~~~~~~Y~~y~~~~~~GgyvF~~~~g~~~~~d~gl~n~G~i~g~~~~~~~ 239 (420) T COG2182 160 VPEPPKTWEELEALAKEAKDKGWGKSHGFLADLTNFYYSYGLIAGYGGYVFGKENGKYDPKDIGLANAGAIEGANFLKSW 239 (420) T ss_pred CCCCCCCHHHHHHHHHHHHHCCCCCEEEECCCHHHHHHHHHHHCCCCCEEECCCCCCCCHHHCCCCCCCCHHHHHHHHHH T ss_conf 47884259999999998642457723332012242576400111478578357777634443167663506789999999 Q ss_pred HHCCCEEECCC-HHHHHHHHCCCCCEEEEECCHHHHHHHHHCCCCEEEEE-CCC--C----EEEEEEEEEEECCCCCHHH Q ss_conf 62884683084-78988862258656999504499999850799737883-169--8----0888864243378999899 Q gi|254780561|r 197 ATKTATVTKGW-TESYGFFLKGESDFVLSYSTSPGFYLLNYGQDDYVAAL-FSE--G----HYLQIEVAAQLVRSKQPQL 268 (338) Q Consensus 197 ~~~~~~~~~~~-~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~e--G----~~~~~~~~~i~k~a~n~~~ 268 (338) ......-.... .-...++..|+..+.+.+.|+..... +.+.++.+.. |.- | .+..+-+++|...++|+++ T Consensus 240 ~~~g~~~~~~~~~~~~slF~~G~aa~~i~GPW~~~~~~--~~g~n~GvaplP~~~~g~~~~pf~Gv~~~~Vsa~skn~~~ 317 (420) T COG2182 240 YKKGLIPEDVAGDFAQSLFTEGKAAAIINGPWSISAYK--DAGINYGVAPLPTLNNGKKPKPFSGVKGYVVSAASKNKEA 317 (420) T ss_pred HHCCCCCCCCCCHHHHHHHHCCCCEEEECCCCCCCHHH--HCCCCCCEEECCCCCCCCCCCCCCCCEEEEECCCCCCHHH T ss_conf 97278883434489999986588407864787550235--3176534541687788888676655248896488976899 Q ss_pred HHHHHHHHCCHHHHHHHHHCCCEEECCCCC Q ss_conf 999999963999999988504505055765 Q gi|254780561|r 269 AQEFMQFMISPSFQRILPTTNWMYPVVDIS 298 (338) Q Consensus 269 A~~Fi~flls~e~Q~~~a~~~~~~P~~~~~ 298 (338) |.+|++|++.++.|..+.+..+..|++..+ T Consensus 318 A~~f~~~~t~~~~~~~~~~~~~~ipa~~~~ 347 (420) T COG2182 318 AAKFVKYFTNPKNQKLLYDEAGEIPANKAA 347 (420) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCHHHHH T ss_conf 999999866489999888755874446775 No 14 >PRK10752 sulfate transporter subunit; Provisional Probab=99.88 E-value=3.6e-20 Score=135.85 Aligned_cols=272 Identities=15% Similarity=0.168 Sum_probs=183.8 Q ss_pred CCCCCCEEEEEECCCCCCCHHHHHHHHHHHHH----HHCCEEEEEECCCHHHHHHHHHHHHCCCCCEEEECCCHHHHHHH Q ss_conf 42348859999788778512233889999998----84887999972886899978875312664101101333789998 Q gi|254780561|r 25 GLPAKPVLTVYTYNSFVADEGAGPKIKQAFER----KCNCELKLIGLSDGVALLNKLRMEGENSAADIVLGFDNNLIDLA 100 (338) Q Consensus 25 ~~~~~~~l~v~~~~~~~~~~~~~~~i~~~Fek----~~gikV~~~~~~~~~~~~~~~~a~~~~~~~Dv~~~~~~~~~~~~ 100 (338) +.+++.+|+..+ |....+.++++.+.|.+ +||.+|++...-+++..+++....+ .++||+.......+..+ T Consensus 17 ~~~~~~~llnvS---Yd~tRely~~~n~~F~~~wk~~tG~~V~i~qShGGSg~QARaVi~G--l~ADVVtlal~~Did~l 91 (329) T PRK10752 17 VMAKDIQLLNVS---YDPTRELYEQYNKAFSAHWKQQTGDNVVIRQSHGGSGKQATSVING--IEADVVTLALAYDVDAI 91 (329) T ss_pred HHHCCEEEEEEC---CHHHHHHHHHHHHHHHHHHHHHHCCCEEEEECCCCCHHHHHHHHCC--CCCCEEEECCHHHHHHH T ss_conf 540552699614---0758999999989999999883299038985578762789998478--75677875142136899 Q ss_pred HHHHCCCCCCCCHHHCCCCCCCCCCCCCCEEEEEEHHHHCCCCCCCCCCCHHHHCCCCCCCEEEEEECCCCCCHHHHHHH Q ss_conf 87521261367700021000124321153243100011001112458764565212000682898504765303678999 Q gi|254780561|r 101 RKTGLFAKSNIDASQLKLPIKWDDDIFVPYDYGYLAFIYDKRQITQPPKNFDELINSTKPWKIIYQDPRTSTLGLGLLLW 180 (338) Q Consensus 101 ~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~p~s~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~ 180 (338) ...+++.+..-...+.+ ..|+. .++++. -++..+...++|+||..+. -++.+++|..|+...+-.++ T Consensus 92 ~~~GlI~~dW~~~~P~n---------s~pyt-Stivfl-VRkGNPK~IkdW~DL~k~g--V~VItPNPKTSG~aRwn~LA 158 (329) T PRK10752 92 AERGRIDKNWIKRLPDN---------SAPYT-STIVFL-VRKGNPKQIHDWNDLIKPG--VSVITPNPKSSGGARWNYLA 158 (329) T ss_pred HHCCCCCCCHHHHCCCC---------CCCEE-EEEEEE-ECCCCCCCCCCHHHHCCCC--CEEECCCCCCCCHHHHHHHH T ss_conf 86799991456548899---------96522-379999-7189956687746746489--78976899887317799999 Q ss_pred HH----HH-CCC--CHHHHHHHHHHCCCEEECCCHHHHHHHH-CCCCCEEEEECCHHHHHHHHHCCCCEEEEECCCCEEE Q ss_conf 98----71-773--1378987776288468308478988862-2586569995044999998507997378831698088 Q gi|254780561|r 181 IQ----KI-YGD--NSAQVWKKIATKTATVTKGWTESYGFFL-KGESDFVLSYSTSPGFYLLNYGQDDYVAALFSEGHYL 252 (338) Q Consensus 181 ~~----~~-~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~eG~~~ 252 (338) .. .. +++ .+.+++.++..|+..+.++.......+. +|..|+.++|.+.........+...+.+++|..-..+ T Consensus 159 awg~a~~~~gg~e~~a~~fv~~l~~NV~vld~gaR~AT~tF~~rg~GDVLlt~ENEa~l~~~~~g~~~~eiV~P~~sila 238 (329) T PRK10752 159 AWGYALHHNNNDQAKAQDFVKALYKNVEVLDSGARGSTNTFVERGIGDVLIAWENEALLAANELGKDKFEIVTPSESILA 238 (329) T ss_pred HHHHHHHHCCCCHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHCCCCCEEEEHHHHHHHHHHHCCCCCCEEECCCCCEEC T ss_conf 99999972499999999999999836852589734677899864867557634379999998608998628789963551 Q ss_pred EEEEEEEECCCC---CHHHHHHHHHHHCCHHHHHHHHHCCCEEECCCCCCCCHHHHCCCCCCCCCCC Q ss_conf 886424337899---9899999999963999999988504505055765882134122322234789 Q gi|254780561|r 253 QIEVAAQLVRSK---QPQLAQEFMQFMISPSFQRILPTTNWMYPVVDISMPDVYQAIRIPQKSLRFN 316 (338) Q Consensus 253 ~~~~~~i~k~a~---n~~~A~~Fi~flls~e~Q~~~a~~~~~~P~~~~~~p~~~~~~~~~~~~~~~~ 316 (338) -+-...+-++.+ ..+.|..|++||.|||+|+++|+. +++|+++..+.+....++ +.+++.++ T Consensus 239 e~pVAvVd~~vdk~GTr~~A~ayl~fLys~eaQ~i~Ak~-~yRP~~~~v~~~~~~~FP-~i~l~ti~ 303 (329) T PRK10752 239 EPTVSVVDKVVEKKGTREVAEAYLKYLYSPEGQEIAAKN-YYRPRDAEVAKKYENAFP-KLKLFTID 303 (329) T ss_pred CCCEEEEECCCCCCCCHHHHHHHHHHHCCHHHHHHHHHH-CCCCCCHHHHHHHHCCCC-CCCEEEEE T ss_conf 597599834456668899999999986597999999980-889899789887654399-85147633 No 15 >COG0725 ModA ABC-type molybdate transport system, periplasmic component [Inorganic ion transport and metabolism] Probab=99.87 E-value=1.8e-19 Score=131.78 Aligned_cols=255 Identities=18% Similarity=0.252 Sum_probs=174.0 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHHCCEEEEEECCCHHHHHHHHHHH Q ss_conf 90279999999999999986146742348859999788778512233889999998848879999728868999788753 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLDGLPAKPVLTVYTYNSFVADEGAGPKIKQAFERKCNCELKLIGLSDGVALLNKLRME 80 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~v~~~~~~~~~~~~~~~i~~~Fek~~gikV~~~~~~~~~~~~~~~~a~ 80 (338) ||+...+...+++++.+++|.+.+.+....+++|+..+|. ...++++.+.||++||++|++. +++++++..|+.. T Consensus 1 ~~~~~~i~~~l~~~~~~~~~~~~~~~~~~~~i~VfAAaSL---~~~l~~i~~~F~~~~~~~V~~~-f~gS~~l~~qIe~- 75 (258) T COG0725 1 GKKMKKILALLLLVLLALGCAAGSAAQEAATITVFAAASL---TDALEEIAKQFEKETGVKVEVE-FGGSGALARQIEQ- 75 (258) T ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEHHH---HHHHHHHHHHHHHHHCCEEEEE-ECCHHHHHHHHHC- T ss_conf 9426889999999999998615654456740999981566---8999999999999879879999-6138999999975- Q ss_pred HCCCCCEEEECCCHHHHHHHHHHHCCCCCCCCHHHCCCCCCCCCCCCCCEEEEEEHHHHCCCCCCCCCCCHHHHCCCCCC Q ss_conf 12664101101333789998875212613677000210001243211532431000110011124587645652120006 Q gi|254780561|r 81 GENSAADIVLGFDNNLIDLARKTGLFAKSNIDASQLKLPIKWDDDIFVPYDYGYLAFIYDKRQITQPPKNFDELINSTKP 160 (338) Q Consensus 81 ~~~~~~Dv~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~p~s~~~l~~~~~~ 160 (338) +.+.|+++..+...+......++..+.. ...+.....++++.+. ......+|.++.+. .. T Consensus 76 --Ga~~D~fiSa~~~~~~~l~~~g~~~~~~----------------~~~fa~n~lvl~~~~~-~~~~~~~~~~l~~~-~~ 135 (258) T COG0725 76 --GAPADLFISADDAYMDKLEDKGLIYADS----------------RIVFAGNRLVLAVPKG-SKKKIESLEDLLER-PD 135 (258) T ss_pred --CCCCCEEEECCHHHHHHHHHCCCCCCCC----------------EEEEECCEEEEEEECC-CCCCCCHHHHHHCC-CC T ss_conf --9986879988888899998668856671----------------5774257489998688-74565129887308-78 Q ss_pred CEEEEEECCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHCCCEEECCCHHHHHHHHCCCCCEEEEECCHHHHHHHHHCCCC Q ss_conf 82898504765303678999987177313789877762884683084789888622586569995044999998507997 Q gi|254780561|r 161 WKIIYQDPRTSTLGLGLLLWIQKIYGDNSAQVWKKIATKTATVTKGWTESYGFFLKGESDFVLSYSTSPGFYLLNYGQDD 240 (338) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~ 240 (338) .++++.+|...-.+......+...+......- ......+..+....+..|+.+.++.|.++... ..... T Consensus 136 ~~lai~~p~~~P~G~ya~~~l~~~g~~~~~~~-------k~v~~~~v~~~l~~V~~G~ad~g~vy~sd~~~----~~~~~ 204 (258) T COG0725 136 VRLAIGDPKTVPAGKYAKEALELLGLWYTLKD-------KLVLATNVRQALAYVETGEADAGFVYVSDALL----SKKVK 204 (258) T ss_pred CEEEECCCCCCCCHHHHHHHHHHHCHHHHCCC-------CEEECCCHHHHHHHHHCCCCCEEEEEEEHHHH----CCCCE T ss_conf 57996488769714999999997051453367-------54653657989999863787779999763340----67760 Q ss_pred EEEEECCCCEEEEEEEEEEECCCCCHHHHHHHHHHHCCHHHHHHHHHCCCE Q ss_conf 378831698088886424337899989999999996399999998850450 Q gi|254780561|r 241 YVAALFSEGHYLQIEVAAQLVRSKQPQLAQEFMQFMISPSFQRILPTTNWM 291 (338) Q Consensus 241 ~~~~~~~eG~~~~~~~~~i~k~a~n~~~A~~Fi~flls~e~Q~~~a~~~~~ 291 (338) ....+|.+......+.++|+++++|+++|++|++|+.|+++|+.+.+.++. T Consensus 205 ~~~~~~~~~~~Pi~y~iav~~~~~~~~~A~~f~~fl~s~~a~~il~~~Gf~ 255 (258) T COG0725 205 IVGVFPEDLHSPIVYPIAVLKNAKNPELAKEFVDFLLSPEAQEILEKYGFE 255 (258) T ss_pred EEEECCCCCCCCEEEEEEEECCCCCHHHHHHHHHHHHCHHHHHHHHHCCCC T ss_conf 899735666897167899973889859999999997187779999984998 No 16 >PRK10974 glycerol-3-phosphate transporter periplasmic binding protein; Provisional Probab=99.87 E-value=4.8e-20 Score=135.07 Aligned_cols=291 Identities=12% Similarity=0.087 Sum_probs=160.1 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHH-CCEEEEEECCCHHHHHHHHHH Q ss_conf 9027999999999999998614674234885999978877851223388999999884-887999972886899978875 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLDGLPAKPVLTVYTYNSFVADEGAGPKIKQAFERKC-NCELKLIGLSDGVALLNKLRM 79 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~v~~~~~~~~~~~~~~~i~~~Fek~~-gikV~~~~~~~~~~~~~~~~a 79 (338) ||.+..+++++++.++ ..+++.+..+++.|+..++. ....++.|+++|+++| +|+|+.+..++.++.++++.+ T Consensus 3 ~~~~~~~~~~~~~~~~-----~~~~a~a~~eI~fWh~~~g~-~~~~l~~lv~eFn~~~p~i~V~~~~~g~y~~~l~k~~a 76 (439) T PRK10974 3 MKSLRSTALGLALGLA-----LSGNAQAVTEIPFWHSMEGE-LGKEVDSLAQRFNASQPDYKIVPVYKGNYEQNLAAGIA 76 (439) T ss_pred HHHHHHHHHHHHHHHH-----HCCCCCCCEEEEEECCCCCC-HHHHHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHHH T ss_conf 5789999999999998-----44556797699993589981-57999999999998785918999865899999999999 Q ss_pred H-HCCCCCEEEECCCHHHHHHHHHHHCCCCCCCC------------HHHC----CCCCCCCCCCCCCEEEEEEHHHHCCC Q ss_conf 3-12664101101333789998875212613677------------0002----10001243211532431000110011 Q gi|254780561|r 80 E-GENSAADIVLGFDNNLIDLARKTGLFAKSNID------------ASQL----KLPIKWDDDIFVPYDYGYLAFIYDKR 142 (338) Q Consensus 80 ~-~~~~~~Dv~~~~~~~~~~~~~~~~l~~~~~~~------------~~~~----~~~~~~~~~~~~~~~~~~~~~~y~~~ 142 (338) . ..+..+||+........ .....+.+.|.... +.+. ......+..|++|+...+.++.||++ T Consensus 77 a~~ag~~Pdv~~~~~~~~~-~~~~~~~l~pl~~~~~~~~~~~d~~~~~p~~~~~~~~~~~Gk~yglP~~~s~~vlyYNkd 155 (439) T PRK10974 77 AFRSGNAPAILQVYEVGTA-TMMASKAIKPVYDVFKDAGIPFDESQFVPTVAGYYSDAKTGHLLSQPFNSSTPVLYYNKD 155 (439) T ss_pred HHHCCCCCEEEEECCCCHH-HHHHCCCCCCCHHHHHHCCCCCCHHHHHHHHHHHHCCCCCCEEEEEEECCCCEEEEEEHH T ss_conf 8726999829998743099-999769977717989725878785653278887612577986999830377659998589 Q ss_pred CCC-------CCCCCHHHHCCCCCCCE-----EEEEECCCCCCH-HHHHH----HHHHH-CC------------C---CH Q ss_conf 124-------58764565212000682-----898504765303-67899----99871-77------------3---13 Q gi|254780561|r 143 QIT-------QPPKNFDELINSTKPWK-----IIYQDPRTSTLG-LGLLL----WIQKI-YG------------D---NS 189 (338) Q Consensus 143 ~~~-------~~p~s~~~l~~~~~~~~-----~~~~~~~~~~~~-~~~~~----~~~~~-~~------------~---~~ 189 (338) .+. .+|+||++|.+...+-+ ..+......... ..+.. ..... ++ . +. T Consensus 156 lf~kAGlDp~~pP~TWdEl~~~a~klk~~g~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~g~d~~~~~~~fnsp~~v~a 235 (439) T PRK10974 156 AFKKAGLDPEQPPKTWQDLAADAAKLKAAGMKCGYASGWQGWIQLENFSAWHGLPFASKNNGFDGTDAVLEFNKPEQVKH 235 (439) T ss_pred HHHHCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCEEHHHHHHHCCHHHHHCCCCCCCCCCEEEECCHHHHHH T ss_conf 99976999889999999999999999855887640056653001378998657255306887357664487289999999 Q ss_pred HHHHHHHHHCCC-EEECCCHHHHHHHHCCCCCEEEEECCHHHHHHHHHCCCCEEEEECC-----CC----EEEEEEEEEE Q ss_conf 789877762884-6830847898886225865699950449999985079973788316-----98----0888864243 Q gi|254780561|r 190 AQVWKKIATKTA-TVTKGWTESYGFFLKGESDFVLSYSTSPGFYLLNYGQDDYVAALFS-----EG----HYLQIEVAAQ 259 (338) Q Consensus 190 ~~~~~~~~~~~~-~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----eG----~~~~~~~~~i 259 (338) .+++.++.+... .......+....+..|.+.|.+...+..... ......++.+...+ .| ++....++.| T Consensus 236 l~~l~~l~~~g~~~~~~~~~~~~~~F~~G~~am~~~~~~~~~~~-~~~~~~~~gva~~P~~~~~~~~~~~~~~gG~~l~v 314 (439) T PRK10974 236 IALLEEMNKKGDFSYVGRKDESTEKFYNGDCAITTASSGSLANI-RKYAKFNYGVGMMPYDADAKGAPQNAIIGGASLWV 314 (439) T ss_pred HHHHHHHHHCCCCCCCCCCCHHHHHHHCCCEEEEEECHHHHHHH-HHHCCCCCEEEECCCCCCCCCCCCCCCCCCCEEEE T ss_conf 99999988627636667763465666368747997161766668-76178760478678787767888655347713898 Q ss_pred ECC--CCCHHHHHHHHHHHCCHHHHHHHHHCCCEEECCCCCC Q ss_conf 378--9998999999999639999999885045050557658 Q gi|254780561|r 260 LVR--SKQPQLAQEFMQFMISPSFQRILPTTNWMYPVVDISM 299 (338) Q Consensus 260 ~k~--a~n~~~A~~Fi~flls~e~Q~~~a~~~~~~P~~~~~~ 299 (338) +++ ..++++|++||+||+|||.|..+++.+++.|+++.+. T Consensus 315 ~~~~~~~~~~aA~~Fikflts~e~~~~~~~~tG~lP~~~~a~ 356 (439) T PRK10974 315 MQGKDKETYKGVAKFLDFLAKPENAAEWHQKTGYLPITTAAY 356 (439) T ss_pred ECCCCHHHHHHHHHHHHHHCCHHHHHHHHHHCCCCCCCHHHH T ss_conf 668998999999999998459899999998539877756664 No 17 >PRK10677 modA molybdate transporter periplasmic protein; Provisional Probab=99.86 E-value=5.4e-19 Score=128.96 Aligned_cols=248 Identities=13% Similarity=0.176 Sum_probs=161.5 Q ss_pred CHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHHCCEEEEEECCCHHHHHHHHHHHH Q ss_conf 02799999999999999861467423488599997887785122338899999988488799997288689997887531 Q gi|254780561|r 2 KKFARIVVGIMMITGVISYCTLDGLPAKPVLTVYTYNSFVADEGAGPKIKQAFERKCNCELKLIGLSDGVALLNKLRMEG 81 (338) Q Consensus 2 Kk~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~v~~~~~~~~~~~~~~~i~~~Fek~~gikV~~~~~~~~~~~~~~~~a~~ 81 (338) |++.++.+++++.. ..++++.+++.+|+|+...+. ...+++|.+.||++||++|++ .+++++.+..+++ T Consensus 3 ~~~~~~~~~~~~~~----~~~~~~~a~~~~l~V~aAAsL---~~~~~ei~~~Fek~~g~~v~~-~fgsSg~L~~QI~--- 71 (257) T PRK10677 3 RKWLNLFAGAALSF----AVAGNALADEGKITVFAAASL---TNAMQDIATQYKKEKGVDVVS-SFASSSTLARQIE--- 71 (257) T ss_pred HHHHHHHHHHHHHH----CCCCCCCCCCCEEEEEEECCC---HHHHHHHHHHHHHHHCCEEEE-EECCHHHHHHHHH--- T ss_conf 28999999999987----235533256885999992576---899999999998841984999-9664899999998--- Q ss_pred CCCCCEEEECCCHHHHHHHHHHHCCCCCCCCHHHCCCCCCCCCCCCCCEEEEEEHHHHCCCCC-----CCCCCCHHHHCC Q ss_conf 266410110133378999887521261367700021000124321153243100011001112-----458764565212 Q gi|254780561|r 82 ENSAADIVLGFDNNLIDLARKTGLFAKSNIDASQLKLPIKWDDDIFVPYDYGYLAFIYDKRQI-----TQPPKNFDELIN 156 (338) Q Consensus 82 ~~~~~Dv~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~-----~~~p~s~~~l~~ 156 (338) .+.++|+++..+...+..+.+.+++...... .+.....+++..+... .....+|..+. T Consensus 72 ~GAp~DvF~sAd~~~~~~l~~~~~i~~~~~~----------------~~a~n~lvli~pk~~~~~~~~i~~~~d~~~ll- 134 (257) T PRK10677 72 AGAPADLFISADQKWMDYAVDKKAIDTATRQ----------------TLLGNSLVVVAPKASEQKDFTIDKKTDWTSLL- 134 (257) T ss_pred CCCCCCEEEECCHHHHHHHHHCCCCCCCCEE----------------EEECCEEEEEEECCCCCCCCCCCCCCCHHHHC- T ss_conf 2899658997875778999867785677645----------------62147499998468777864434332377742- Q ss_pred CCCCCEEEEEECCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHCCCEEECCCHHHHHHHHCCCCCEEEEECCHHHHHHHHH Q ss_conf 00068289850476530367899998717731378987776288468308478988862258656999504499999850 Q gi|254780561|r 157 STKPWKIIYQDPRTSTLGLGLLLWIQKIYGDNSAQVWKKIATKTATVTKGWTESYGFFLKGESDFVLSYSTSPGFYLLNY 236 (338) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~ 236 (338) ..+|+++.+|...-++......+...+.. ..+..+ ..+..+..+....+.+|+++.++.|.++.. .. T Consensus 135 --~~~riaia~P~~aP~G~ya~~~L~~~gl~------~~l~~k-lv~~~nV~~~l~~v~~G~adaG~Vy~Sda~----~~ 201 (257) T PRK10677 135 --NGGRLAVGDPDHVPAGIYAKEALQKLGAW------DTLSPK-LAPAEDVRGALALVERNEAPLGIVYGSDAV----AS 201 (257) T ss_pred --CCCCEEEECCCCCCCHHHHHHHHHHCCCH------HHHCCC-EEECCCHHHHHHHHHCCCCCEEEEECCHHH----CC T ss_conf --58808985888886579999999976766------550654-132576999999998299878999700022----26 Q ss_pred CCCCEEEEECCCCEEEEEEEEEEECCCCCHHHHHHHHHHHCCHHHHHHHHHCCCE Q ss_conf 7997378831698088886424337899989999999996399999998850450 Q gi|254780561|r 237 GQDDYVAALFSEGHYLQIEVAAQLVRSKQPQLAQEFMQFMISPSFQRILPTTNWM 291 (338) Q Consensus 237 ~~~~~~~~~~~eG~~~~~~~~~i~k~a~n~~~A~~Fi~flls~e~Q~~~a~~~~~ 291 (338) ........+|++-.....+.++++++++|+ .|++|++||+|||+|+.+.+.+|. T Consensus 202 ~~~~~~~~~p~~~~~pi~y~~ai~~~~~~~-~a~~F~~fl~S~~a~~Il~~~GFt 255 (257) T PRK10677 202 KGVKVVATFPEDSHKKVEYPMAIVEGHNNA-TVSAFYDYLKGPQAAEIFKRYGFT 255 (257) T ss_pred CCCEEEEECCHHHCCCCEEEEEEECCCCCH-HHHHHHHHHCCHHHHHHHHHHCCC T ss_conf 886499987745389614558997599999-999999997799999999982983 No 18 >PRK10852 thiosulfate transporter subunit; Provisional Probab=99.86 E-value=2.9e-19 Score=130.58 Aligned_cols=292 Identities=14% Similarity=0.163 Sum_probs=186.8 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEECCCCCCCHHHHHHHHHHHH----HHH-CCEEEEEECCCHHHHHH Q ss_conf 9027999999999999998614674234885999978877851223388999999----884-88799997288689997 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLDGLPAKPVLTVYTYNSFVADEGAGPKIKQAFE----RKC-NCELKLIGLSDGVALLN 75 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~v~~~~~~~~~~~~~~~i~~~Fe----k~~-gikV~~~~~~~~~~~~~ 75 (338) |++....-..+.++.+++..+ +++..+|.-.+ |....+.++++.+.|. +++ |-+|++...-+++..++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~g----~a~~~~LlnVS---Yd~tRely~~~n~~F~~~Wk~~~~g~~V~i~qShGGSg~QA 73 (338) T PRK10852 1 MAVNLLKKNSLALVASLLLAG----HAQATELLNSS---YDVSRELFAALNPPFEQQWAKDNGGDKLTIKQSHAGSSKQA 73 (338) T ss_pred CCHHHHHHHHHHHHHHHHHCC----CCCCCEEEECC---CHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHH T ss_conf 905678662189999999737----87772688513---05479999999899999999756896189985478760578 Q ss_pred HHHHHHCCCCCEEEECCCHHHHHHHHHHH-CCCCCCCCHHHCCCCCCCCCCCCCCEEEEEEHHHHCCCCCCCCCCCHHHH Q ss_conf 88753126641011013337899988752-12613677000210001243211532431000110011124587645652 Q gi|254780561|r 76 KLRMEGENSAADIVLGFDNNLIDLARKTG-LFAKSNIDASQLKLPIKWDDDIFVPYDYGYLAFIYDKRQITQPPKNFDEL 154 (338) Q Consensus 76 ~~~a~~~~~~~Dv~~~~~~~~~~~~~~~~-l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~p~s~~~l 154 (338) +.... +.++||+.......+..+.+.+ |+.+..-...+.+ ..|+. .++++.. ++..+...++|+|| T Consensus 74 RaVi~--GL~ADVVtlal~~Did~l~~~g~LI~~dW~~~lP~n---------s~pyt-StivFlV-RkGNPK~IkdW~DL 140 (338) T PRK10852 74 LAILQ--GLKADVVTYNQVTDVQILHDKGKLIPADWQSRLPNN---------SSPFY-STMGFLV-RKGNPKNIHDWNDL 140 (338) T ss_pred HHHHC--CCCCCEEEEECCCCHHHHHHCCCCCCCCHHHHCCCC---------CCCEE-EEEEEEE-ECCCCCCCCCHHHH T ss_conf 99847--875446875254324889862786780546538999---------96446-6799998-07995668775774 Q ss_pred CCCCCCCEEEEEECCCCCCHHHHHHHHH----HH-CCC--CHHHHHHHHHHCCCEEECCCHHHHHHH-HCCCCCEEEEEC Q ss_conf 1200068289850476530367899998----71-773--137898777628846830847898886-225865699950 Q gi|254780561|r 155 INSTKPWKIIYQDPRTSTLGLGLLLWIQ----KI-YGD--NSAQVWKKIATKTATVTKGWTESYGFF-LKGESDFVLSYS 226 (338) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~-~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~g~~~~~~~~~ 226 (338) ..+. -++.+++|..|+...+-.++.. .. +++ ...++..++..|+..+.++.......+ .+|..|+.++|. T Consensus 141 ~k~g--V~VItPNPKTSGgaRwnyLAAwg~a~~~~gg~e~~a~~fv~~l~~NVpVld~gaR~AT~tF~~rg~GDVLl~~E 218 (338) T PRK10852 141 VRSD--VKLIFPNPKTSGNARYTYLAAWGAADKADGGDKAKTEQFMTQFLKNVEVFDTGGRGATTTFAERGLGDVLISFE 218 (338) T ss_pred CCCC--CEEECCCCCCCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHCCCCCEEEEHH T ss_conf 6489--78975799888407699999999999864999899999999998368515897167889998738675675216 Q ss_pred CHHHHHHHHHCCCCEEEEECCCCEEEEEEEEEEECCCC---CHHHHHHHHHHHCCHHHHHHHHHCCCEEECCCCCCCCHH Q ss_conf 44999998507997378831698088886424337899---989999999996399999998850450505576588213 Q gi|254780561|r 227 TSPGFYLLNYGQDDYVAALFSEGHYLQIEVAAQLVRSK---QPQLAQEFMQFMISPSFQRILPTTNWMYPVVDISMPDVY 303 (338) Q Consensus 227 ~~~~~~~~~~~~~~~~~~~~~eG~~~~~~~~~i~k~a~---n~~~A~~Fi~flls~e~Q~~~a~~~~~~P~~~~~~p~~~ 303 (338) +.........+...+.+++|..-..+-+-...+-|+.. .++.|..|++||.|||+|+++|+. +++|+++..+.+.. T Consensus 219 NEa~l~~~~~g~~~~eiV~Ps~sIlae~pVAvVDk~vdk~GTr~vA~AyL~yLyS~eaQ~i~Ak~-~yRP~~~~v~~~~~ 297 (338) T PRK10852 219 SEVNNIRKQYEAQGFEVVIPKTNILAEFPVAWVDKNVQANGTEKAAKAYLNWLYSPQAQTIITDY-YYRVNNPEVMDKLK 297 (338) T ss_pred HHHHHHHHHHCCCCCEEECCCCCEEECCCEEEEEECCCCCCCHHHHHHHHHHHCCHHHHHHHHHH-CCCCCCHHHHHHHH T ss_conf 78999999608997149889975660697489860123458899999999986598999999980-89989978987764 Q ss_pred HHCCCCCCCCCCC Q ss_conf 4122322234789 Q gi|254780561|r 304 QAIRIPQKSLRFN 316 (338) Q Consensus 304 ~~~~~~~~~~~~~ 316 (338) ..++ +.+++.++ T Consensus 298 ~~FP-~~~lfti~ 309 (338) T PRK10852 298 DKFP-QTELFRVE 309 (338) T ss_pred CCCC-CCCEEEEE T ss_conf 5499-85147653 No 19 >COG4134 ABC-type uncharacterized transport system, periplasmic component [General function prediction only] Probab=99.86 E-value=5e-20 Score=134.98 Aligned_cols=327 Identities=15% Similarity=0.196 Sum_probs=201.8 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCC----CCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHHCCEEEEEECCCHHHHHHH Q ss_conf 902799999999999999861467----4234885999978877851223388999999884887999972886899978 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLD----GLPAKPVLTVYTYNSFVADEGAGPKIKQAFERKCNCELKLIGLSDGVALLNK 76 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~----~~~~~~~l~v~~~~~~~~~~~~~~~i~~~Fek~~gikV~~~~~~~~~~~~~~ 76 (338) |||....++++++++++.+-.+.- +.+...++..-+||+-..--...+.....-|.+|||+|.++... +.+...+ T Consensus 1 M~~~~~~L~~~~la~~lat~a~~w~e~~akA~~q~vyfnawgg~~~~ns~~~W~~~~~~~kYgi~v~~~~l~-~~~a~~~ 79 (384) T COG4134 1 MKRHLGTLAALLLAFSLATHASDWDEVKAKAKGQTVYFNAWGGSTAINSYADWVSDEMKAKYGIKVAIVRLD-AADAVKR 79 (384) T ss_pred CCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHCCCCEEEEEECCCCCHHCCCCCCCCHHHCCCCEEEEEECCC-HHHHHHH T ss_conf 962378999999974566523678999986357736998505631000026532202320025178751241-8888888 Q ss_pred HHH---HHCCC--CCEEEECCCHHHHHHHHHHHCCCCCC----CCHHH------CCCCCCCCCCCCCCEEEEEEHHHHCC Q ss_conf 875---31266--41011013337899988752126136----77000------21000124321153243100011001 Q gi|254780561|r 77 LRM---EGENS--AADIVLGFDNNLIDLARKTGLFAKSN----IDASQ------LKLPIKWDDDIFVPYDYGYLAFIYDK 141 (338) Q Consensus 77 ~~a---~~~~~--~~Dv~~~~~~~~~~~~~~~~l~~~~~----~~~~~------~~~~~~~~~~~~~~~~~~~~~~~y~~ 141 (338) +.+ .+.+. ..|+++.....+-. +...+++.... +.+.. ......+...+..++.....++.|++ T Consensus 80 ~~a~~AAG~~e~g~VDl~~~~g~n~p~-lk~~~~Ly~~~~~~LPnwr~~d~ql~~tdf~~pt~Gl~~~~~~~q~~f~yd~ 158 (384) T COG4134 80 LIAELAAGRDEGGSVDLLWINGENFPT-LKEAGLLYTPWAEDLPNWRYVDHQLPRTDFSVPTEGLESPWGGNQLVFIYDR 158 (384) T ss_pred HHHHHHCCCCCCCCEEEEEECCCCCHH-HHHCCCCCCCHHHHCCCCHHHCHHCCCCCCCCCCCCCCCCCCCCCCEEEECC T ss_conf 999986425568726899835754133-4316620251232186601203003533257887762144027752388723 Q ss_pred CCCCCCCCCHHHHCC--CCCCCEEEEEECCCCCCHHHHHHHHHH-------------------HCCCCHHHHHHHHHHCC Q ss_conf 112458764565212--000682898504765303678999987-------------------17731378987776288 Q gi|254780561|r 142 RQITQPPKNFDELIN--STKPWKIIYQDPRTSTLGLGLLLWIQK-------------------IYGDNSAQVWKKIATKT 200 (338) Q Consensus 142 ~~~~~~p~s~~~l~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~-------------------~~~~~~~~~~~~~~~~~ 200 (338) ++....|.|.++|.. .+.++|+.++.|..++.+..+...... ......|+++..+.... T Consensus 159 ~k~~~~P~s~eeLL~wA~aNP~rf~Y~~P~~gg~G~~Fl~~~~i~Lt~d~~~l~~avDpat~~k~~~p~WqYl~~Lh~~i 238 (384) T COG4134 159 DKVPQVPQSFEELLVWAKANPKRFGYPRPPNGGTGTAFLKQLLIMLTGDAAALQQAVDPATFAKVWAPLWQYLDELHPYI 238 (384) T ss_pred CCCCCCCCCHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHCCCHHHHCCCCCHHHHHHHCCHHHHHHHHHHHHH T ss_conf 44788773999999999859675356899988860769998899854855765078880777762108999999861556 Q ss_pred CEEECCCHHHHHHHHCCCCCEEEEECCHHHHHHHHHC--CCCEEEEECCCCEEEEEEEEEEECCCCCHHHHHHHHHHHCC Q ss_conf 4683084789888622586569995044999998507--99737883169808888642433789998999999999639 Q gi|254780561|r 201 ATVTKGWTESYGFFLKGESDFVLSYSTSPGFYLLNYG--QDDYVAALFSEGHYLQIEVAAQLVRSKQPQLAQEFMQFMIS 278 (338) Q Consensus 201 ~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~eG~~~~~~~~~i~k~a~n~~~A~~Fi~flls 278 (338) -...++..++.+.+..|+..+.+.|.. .....+.-+ .....+..++.|.+.....++|++|++|+..|..|+|||+| T Consensus 239 ~r~~kg~Adml~lL~dG~L~l~~t~~~-~~~s~~~tG~lp~s~~~~~~~~G~vgn~~f~aIPaNa~~~A~alvl~n~lls 317 (384) T COG4134 239 WRTGKGPADMLQLLNDGTLYLTLTFPD-HASSAIATGDLPASARSFALEKGMVGNGHFMAIPANANAKAAALVLANFLLS 317 (384) T ss_pred HHCCCCHHHHHHHHCCCCEEEEECCHH-HHHCCHHCCCCCHHHHHHHHCCCCCCCCCEEEECCCCCCCHHHHHHHHHHCC T ss_conf 013898789998741784788604606-6403022167866766421016653675169822799970699999988639 Q ss_pred HHHHH-HHHHCCCEEE--CCCCCCCCHHHHC---CCC--CCCC--CCCHHHHHHHHHHHHHH Q ss_conf 99999-9885045050--5576588213412---232--2234--78989999999999999 Q gi|254780561|r 279 PSFQR-ILPTTNWMYP--VVDISMPDVYQAI---RIP--QKSL--RFNAEETTRYRNQWISS 330 (338) Q Consensus 279 ~e~Q~-~~a~~~~~~P--~~~~~~p~~~~~~---~~~--~~~~--~~~~~~~~~~~~~~~~~ 330 (338) ||.|. .+-...+.-| ..+..+|+..+.. ..| ..+. .+.+....+...+|.++ T Consensus 318 ~EaQlrk~d~~v~~~P~~l~pq~lpda~qe~~~~~~p~~~~P~l~~php~w~~a~~~ew~~~ 379 (384) T COG4134 318 PEAQLRKLDPAVWGDPSVLDPQLLPDAQQEFLQSRMPQDLPPVLAEPHPAWMMALEQEWARR 379 (384) T ss_pred HHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHH T ss_conf 88887651520148964258001746675543204754568501478606899999999987 No 20 >PRK04168 hypothetical protein; Provisional Probab=99.85 E-value=1.5e-18 Score=126.32 Aligned_cols=263 Identities=17% Similarity=0.221 Sum_probs=154.8 Q ss_pred HHHHHHHHHHHHHHHCCC-CCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHH-CCEEEEEECCCHHHHHHHHHHHHCCC Q ss_conf 999999999999861467-4234885999978877851223388999999884-88799997288689997887531266 Q gi|254780561|r 7 IVVGIMMITGVISYCTLD-GLPAKPVLTVYTYNSFVADEGAGPKIKQAFERKC-NCELKLIGLSDGVALLNKLRMEGENS 84 (338) Q Consensus 7 ~~~~~~~~~~~~~~~~~~-~~~~~~~l~v~~~~~~~~~~~~~~~i~~~Fek~~-gikV~~~~~~~~~~~~~~~~a~~~~~ 84 (338) ++..++++..++++|.+. .+..+.+|+|+..+|.. ..++++.++||++| |++|+....|| ..+..++ .+ .+. T Consensus 9 ~~~~~~~~~~~~~g~~~~~~~~~~~~l~Vf~AGSL~---~pf~ei~~~Fe~~~p~v~v~~e~~GS-~~~ar~I-te-~g~ 82 (336) T PRK04168 9 VIVILLLLALVFLGCVNTSEAEPPGKLKIFHAGSLS---VPFEEYEKEFEAYYPNVDVQREAGGS-VACVRKI-TE-LGK 82 (336) T ss_pred HHHHHHHHHHHHCCCCCCCCCCCCEEEEEEECCCCH---HHHHHHHHHHHHHCCCCEEEEEECCH-HHHHHHH-HH-CCC T ss_conf 379999999997055577777886159999755258---99999999999878996599984738-9999999-95-799 Q ss_pred CCEEEECCCHHHHHHHHHHHCCCCCCCCHHHCCCCCCCCCCCCCCEEEEEEHHHHCCC-CCCC--CCCCHHHHCCCCCCC Q ss_conf 4101101333789998875212613677000210001243211532431000110011-1245--876456521200068 Q gi|254780561|r 85 AADIVLGFDNNLIDLARKTGLFAKSNIDASQLKLPIKWDDDIFVPYDYGYLAFIYDKR-QITQ--PPKNFDELINSTKPW 161 (338) Q Consensus 85 ~~Dv~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~-~~~~--~p~s~~~l~~~~~~~ 161 (338) .+||+...+...+..+...+. ......+.....+++|+.+ ++.+ ....|.+.+.. ..- T Consensus 83 ~aDVfaSAD~~~i~~ll~p~~------------------adw~v~FA~N~lViayt~~Sk~a~ei~~~nWyeiL~~-~~V 143 (336) T PRK04168 83 PADILASADYTLIPKMMIPEY------------------ADWYVMFATNEIVLAYTDKSKYADEINSDNWYEILQR-PGV 143 (336) T ss_pred CCCEEEECCHHHHHHHHCCCC------------------CCCEEEECCCEEEEEECCCCCCCCCCCCCCHHHHHCC-CCC T ss_conf 988899898578788746443------------------3410121155389997688821100573318898577-985 Q ss_pred EEEEEECCCCCCHHHHHHHHH---HHCCCCH-HHHHHHH-------------------HHCCCEEECCCHHHHHHHHCCC Q ss_conf 289850476530367899998---7177313-7898777-------------------6288468308478988862258 Q gi|254780561|r 162 KIIYQDPRTSTLGLGLLLWIQ---KIYGDNS-AQVWKKI-------------------ATKTATVTKGWTESYGFFLKGE 218 (338) Q Consensus 162 ~~~~~~~~~~~~~~~~~~~~~---~~~~~~~-~~~~~~~-------------------~~~~~~~~~~~~~~~~~~~~g~ 218 (338) +++..+|...-.+.-..+.+. ..+...+ .+.+... ......+.+...+....+..|+ T Consensus 144 k~g~sdP~~DP~Gyrt~~~~~LAe~~Y~~pgl~~~l~~~~~n~~~~~~~i~~p~~~~~n~~k~~~rp~e~~Ll~~lesG~ 223 (336) T PRK04168 144 KWGFSDPNDDPCGYRSLMVLQLAELYYNDPTIYDKLVLKNTNFYVNGTYIRSPKEILVNRPKIFVRPKEVELLGLLETGA 223 (336) T ss_pred EEEECCCCCCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHCCCCCCCCCCCCCHHHHCCCCCCCEECCCHHHHHHHHHCCC T ss_conf 79837998898608999999999986068417888864035533454223560431145543110552677777875588 Q ss_pred CCEEEEECCHHHHHHHHHC----------------CCCEEEEECCC-CEE---EEEEEEEEECCCCCHHHHHHHHHHHCC Q ss_conf 6569995044999998507----------------99737883169-808---888642433789998999999999639 Q gi|254780561|r 219 SDFVLSYSTSPGFYLLNYG----------------QDDYVAALFSE-GHY---LQIEVAAQLVRSKQPQLAQEFMQFMIS 278 (338) Q Consensus 219 ~~~~~~~~~~~~~~~~~~~----------------~~~~~~~~~~e-G~~---~~~~~~~i~k~a~n~~~A~~Fi~flls 278 (338) .|.++.|.+....+.+..- -..+.+..... .+. ...++++|+++++|+++|.+|++||+| T Consensus 224 ~Dy~f~Y~S~A~q~~L~yv~LP~eINL~~~~~ad~Y~~vsv~v~~~~~t~~g~PIvY~iTV~~~A~n~e~A~~FV~flls 303 (336) T PRK04168 224 MDYLFIYKSVAVQHNLKYIELPDEINLGDYSYADFYKKVSVTLTGKKPTITAKPIVYGITVPKNAPNREAAIEFLKYLLD 303 (336) T ss_pred CCEEEEEEEHHHHCCCCCCCCCHHHCCCCCCCCCCCCEEEEEECCCCCCEECCCEEEEEECCCCCCCHHHHHHHHHHHHC T ss_conf 64698762448676997003833314678110020105899971787512147668987646899998999999999848 Q ss_pred HHHHHHHHHCCCEEECC Q ss_conf 99999988504505055 Q gi|254780561|r 279 PSFQRILPTTNWMYPVV 295 (338) Q Consensus 279 ~e~Q~~~a~~~~~~P~~ 295 (338) +++|+.+.+.+. -|.. T Consensus 304 ~~Gq~il~~~G~-~p~~ 319 (336) T PRK04168 304 EGGPKIFENNGQ-PPIV 319 (336) T ss_pred CHHHHHHHHCCC-CCCC T ss_conf 025899997599-9874 No 21 >COG1613 Sbp ABC-type sulfate transport system, periplasmic component [Inorganic ion transport and metabolism] Probab=99.84 E-value=2.1e-18 Score=125.52 Aligned_cols=281 Identities=17% Similarity=0.197 Sum_probs=180.4 Q ss_pred HHHHHHCCC---CCCCCCEEEEEECCCCCCCHHHHHHHHHHHH----HHHCCEEEEEECCCHHHHHHHHHHHHCCCCCEE Q ss_conf 999861467---4234885999978877851223388999999----884887999972886899978875312664101 Q gi|254780561|r 16 GVISYCTLD---GLPAKPVLTVYTYNSFVADEGAGPKIKQAFE----RKCNCELKLIGLSDGVALLNKLRMEGENSAADI 88 (338) Q Consensus 16 ~~~~~~~~~---~~~~~~~l~v~~~~~~~~~~~~~~~i~~~Fe----k~~gikV~~~~~~~~~~~~~~~~a~~~~~~~Dv 88 (338) .++++|+.. +.+.+.+|+..++ ....+.++++.++|. +++|-.|.+....+++.-+++... .+.++|| T Consensus 21 ~~~~~~~~~~~~a~~~~~tLlnVSY---d~tre~y~~~n~aF~~~WkaetG~~Vti~qShGGSgkQaRsVi--DGl~ADV 95 (348) T COG1613 21 GALGACSDVVRAAPAKDTTLLNVSY---DPTRELYKEYNPAFAKYWKAETGETVTIQQSHGGSGKQARSVI--DGLEADV 95 (348) T ss_pred HHHCCCCCCCCCCCCCCCEEEEEEC---CCCHHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHHHC--CCCCCCE T ss_conf 5412566565554556725888521---5658889998899999988731983599965787620357660--6753324 Q ss_pred EECCCHHHHHHHHHHH-CCCCCCCCHHHCCCCCCCCCCCCCCEEEEEEHHHHCCCCCCCCCCCHHHHCCCCCCCEEEEEE Q ss_conf 1013337899988752-126136770002100012432115324310001100111245876456521200068289850 Q gi|254780561|r 89 VLGFDNNLIDLARKTG-LFAKSNIDASQLKLPIKWDDDIFVPYDYGYLAFIYDKRQITQPPKNFDELINSTKPWKIIYQD 167 (338) Q Consensus 89 ~~~~~~~~~~~~~~~~-l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~p~s~~~l~~~~~~~~~~~~~ 167 (338) +-..-...+....+.+ ++++....-.+.+ ..||. .++++. -+...+...++|.||..+.- .+.+++ T Consensus 96 Vtlal~~Did~i~~~g~li~~dW~~rlp~n---------s~Pyt-StivFl-VRkGNPK~I~DW~DL~k~gV--~VItpN 162 (348) T COG1613 96 VTLALAYDIDAIAKAGGLIDKDWQKRLPNN---------SAPYT-STIVFL-VRKGNPKQIRDWDDLVKPGV--QVITPN 162 (348) T ss_pred EEEEHHHHHHHHHHHCCCCCCCHHHHCCCC---------CCCCC-CEEEEE-EECCCCCCCCCHHHHCCCCC--EEECCC T ss_conf 664113118999972687894657648699---------87733-069999-93699655675577465886--797689 Q ss_pred CCCCCCHHHHHHHHH----HH-CCC--CHHHHHHHHHHCCCEEECCCHHHHHHHHC-CCCCEEEEECCHHHHHHHHHCCC Q ss_conf 476530367899998----71-773--13789877762884683084789888622-58656999504499999850799 Q gi|254780561|r 168 PRTSTLGLGLLLWIQ----KI-YGD--NSAQVWKKIATKTATVTKGWTESYGFFLK-GESDFVLSYSTSPGFYLLNYGQD 239 (338) Q Consensus 168 ~~~~~~~~~~~~~~~----~~-~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-g~~~~~~~~~~~~~~~~~~~~~~ 239 (338) |..|+...+-+++.. .. +++ .+.++..++..|+....++.......+.+ |..++.++|.+.........++. T Consensus 163 PKTSGgARWN~Laawa~a~~~~~gdeaka~~fV~~L~~nvpvld~gaRgAT~tF~qrgiGDVLi~wENEA~la~~e~g~~ 242 (348) T COG1613 163 PKTSGGARWNYLAAWAYALKTNGGDEAKAKDFVGKLYKNVPVLDTGARGATTTFVQRGIGDVLIAWENEALLALNELGGD 242 (348) T ss_pred CCCCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHCCCCCEEEEECHHHHHHHHHHCCC T ss_conf 98876522788999899871479988999999999984683245885212578886386647887241888879974688 Q ss_pred CEEEEECCCCEEEEEEEEEEECCCC---CHHHHHHHHHHHCCHHHHHHHHHCCCEEECCCCCCCCHHHHCCCCCCCCCCC Q ss_conf 7378831698088886424337899---9899999999963999999988504505055765882134122322234789 Q gi|254780561|r 240 DYVAALFSEGHYLQIEVAAQLVRSK---QPQLAQEFMQFMISPSFQRILPTTNWMYPVVDISMPDVYQAIRIPQKSLRFN 316 (338) Q Consensus 240 ~~~~~~~~eG~~~~~~~~~i~k~a~---n~~~A~~Fi~flls~e~Q~~~a~~~~~~P~~~~~~p~~~~~~~~~~~~~~~~ 316 (338) ++.+++|+--..+.+-..+|-++.. +.+.|..|++||.|||+|.+.|+. +++|+++.++......++ ..+++.++ T Consensus 243 ~feiV~Ps~si~aEpPVAVVd~~vdkkgtr~vAeAyl~yLys~~gQ~i~Ak~-~yRP~~p~v~a~~~~~FP-~v~l~ti~ 320 (348) T COG1613 243 KFEIVTPSVSILAEPPVAVVDKNVDKKGTRKVAEAYLKYLYSPEGQEIAAKH-YYRPRDPEVAAKFADQFP-KLKLFTID 320 (348) T ss_pred CCCEECCCEEEEECCCEEEEECCCCCCCCHHHHHHHHHHHCCHHHHHHHHHH-CCCCCCHHHHHHHHHHCC-CCEEEEEH T ss_conf 7668768602660697478751123335689999999875583888999982-778898788877786489-86056531 No 22 >COG1653 UgpB ABC-type sugar transport system, periplasmic component [Carbohydrate transport and metabolism] Probab=99.82 E-value=8.1e-18 Score=122.10 Aligned_cols=297 Identities=19% Similarity=0.215 Sum_probs=160.3 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCCCCC-CEEEEEECCCCCCCHHHHHHHHHHHHHHH-CCEEEEEECCCHHHHHHHHH Q ss_conf 90279999999999999986146742348-85999978877851223388999999884-88799997288689997887 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLDGLPAK-PVLTVYTYNSFVADEGAGPKIKQAFERKC-NCELKLIGLSDGVALLNKLR 78 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~~~~~~-~~l~v~~~~~~~~~~~~~~~i~~~Fek~~-gikV~~~~~~~~~~~~~~~~ 78 (338) |++..++.+.++++++++++|+...+.++ .+++++++..........+.++++||++| ||+|+++..+.+ +...++. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~i~~f~~~~p~ikV~~~~~~~~-~~~~~l~ 79 (433) T COG1653 1 LKKISKLLLLLLAAALGLAACGSAAASAAGVTITFWHSWTGGEEADALEELIKEFEKENPGIKVKVVNVPGD-DYLQKLL 79 (433) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHCCCEEEEEEECCHH-HHHHHHH T ss_conf 921799999999999999999743024557439999833787315899999999997779847999856758-8999999 Q ss_pred HHH-CCCCCEEEECCCHHHHHHHHHHHC-CCCCCCCH----H-------HCCCCCCCCCCCCCCEEEEEEHHHHCCCCCC Q ss_conf 531-266410110133378999887521-26136770----0-------0210001243211532431000110011124 Q gi|254780561|r 79 MEG-ENSAADIVLGFDNNLIDLARKTGL-FAKSNIDA----S-------QLKLPIKWDDDIFVPYDYGYLAFIYDKRQIT 145 (338) Q Consensus 79 a~~-~~~~~Dv~~~~~~~~~~~~~~~~l-~~~~~~~~----~-------~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~ 145 (338) +.- .+...|++.... .........+. +.+..... . .......++..|++|.......+.||++.+. T Consensus 80 ~~~~ag~~pDv~~~~~-~~~~~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~dG~~y~~P~~~~~~~~~ynkd~~~ 158 (433) T COG1653 80 TALASGDAPDVVQLDP-EWLAAFAKAGGLLEDLTDLINPKLDLSDFPPALLDTTTYDGKLYGVPFNSSTPALFYNKDLFK 158 (433) T ss_pred HHHHCCCCCEEEEECC-HHHHHHHHHCCCCCCCCCCCCCCHHHHHCCHHHHHHEEECCEEEEEEEECCCCEEEEEHHHHH T ss_conf 9985799981999773-768999972676566641001230244304776742468986899984324756998789999 Q ss_pred ----CC-CCCHHHHCCCCC----CC--EEEEEECCC---CCCHHHHHHHHHH-------HCC------C----CHHHHHH Q ss_conf ----58-764565212000----68--289850476---5303678999987-------177------3----1378987 Q gi|254780561|r 146 ----QP-PKNFDELINSTK----PW--KIIYQDPRT---STLGLGLLLWIQK-------IYG------D----NSAQVWK 194 (338) Q Consensus 146 ----~~-p~s~~~l~~~~~----~~--~~~~~~~~~---~~~~~~~~~~~~~-------~~~------~----~~~~~~~ 194 (338) .. |++|+++..... .+ ...+..... ......+...... ... . ++.++.. T Consensus 159 ~aG~~~pp~tw~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 238 (433) T COG1653 159 KAGLKPPPKTWDELLAAAKKLKEKGGDVYGIAGGGGWQGEDLFFTFALFLGGAGGGFLDKDGGEAFLNDPEAVEALEFLK 238 (433) T ss_pred HCCCCCCCCCHHHHHHHHHHHHHCCCCCCCEECCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHHHHH T ss_conf 76999999998999999999986047866400255454323299999998505875225777610479989999999999 Q ss_pred HHHHCCC-E---EECCCHHHHH-HHHCCCCCEEEEECCHHHHHHHHHC-CCCEEEEECCC----C---EEEEEEEEEEEC Q ss_conf 7762884-6---8308478988-8622586569995044999998507-99737883169----8---088886424337 Q gi|254780561|r 195 KIATKTA-T---VTKGWTESYG-FFLKGESDFVLSYSTSPGFYLLNYG-QDDYVAALFSE----G---HYLQIEVAAQLV 261 (338) Q Consensus 195 ~~~~~~~-~---~~~~~~~~~~-~~~~g~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~e----G---~~~~~~~~~i~k 261 (338) ++..... . ...+..+... .+..|.+.+...+.+.......... ..++.+...+- + ......++++++ T Consensus 239 ~l~~~~~~~~~~~~~~~~~~~~~~f~~G~~a~~~~g~~~~~~~~~~~~~~~~~~~~~~P~~~~~~~~~~~~~~~~~~i~~ 318 (433) T COG1653 239 DLYKKGLLPKGASGYGWDDAGALAFGSGKVAMTIDGTWAIGYFKKAAGPKFDIGVAPLPAGPGGGGAAGGVGGGGLGVSK 318 (433) T ss_pred HHHHCCCCCCCCCCCCHHHHHHHHHHCCCEEEEEECHHHHHHHHHCCCCCCCEEEEECCCCCCCCCCCCCCCCEEEEEEC T ss_conf 99862666767665357677554200364689970557766665134545664688268788878752211553799974 Q ss_pred CCCC-HHHHHHHHHHHCCHHHHHH-HHHCCCEEECCCCCC Q ss_conf 8999-8999999999639999999-885045050557658 Q gi|254780561|r 262 RSKQ-PQLAQEFMQFMISPSFQRI-LPTTNWMYPVVDISM 299 (338) Q Consensus 262 ~a~n-~~~A~~Fi~flls~e~Q~~-~a~~~~~~P~~~~~~ 299 (338) ++++ +++|++||+|+.|||.|.. +....+..|+++... T Consensus 319 ~~~~~~~aA~~fi~~l~s~e~q~~~~~~~~g~~P~~~~~~ 358 (433) T COG1653 319 KSKKHKEAAWKFLEFLTSPEAQAELLAKVTGGLPVRKSAY 358 (433) T ss_pred CCCCCHHHHHHHHHHHCCCHHHHHHHHHHCCCCCCCHHHH T ss_conf 8986699999999997690878888774148787625543 No 23 >pfam01547 SBP_bac_1 Bacterial extracellular solute-binding protein. This family also includes the bacterial extracellular solute-binding protein family POTD/POTF. Probab=99.71 E-value=3.4e-16 Score=112.66 Aligned_cols=209 Identities=16% Similarity=0.146 Sum_probs=114.8 Q ss_pred CCHHHHHHHHHHHHHHH-CCEEEEEECCCHHHHHHHHHH--HHCCCCCEEEECCCHHHHHHHHHHHCCCCCCCCHHHCCC Q ss_conf 51223388999999884-887999972886899978875--312664101101333789998875212613677000210 Q gi|254780561|r 42 ADEGAGPKIKQAFERKC-NCELKLIGLSDGVALLNKLRM--EGENSAADIVLGFDNNLIDLARKTGLFAKSNIDASQLKL 118 (338) Q Consensus 42 ~~~~~~~~i~~~Fek~~-gikV~~~~~~~~~~~~~~~~a--~~~~~~~Dv~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~ 118 (338) .+...+++++++|||+| ||||++...++ ++...++.+ ..++.++||+.. +...+....+.+++.+......... T Consensus 5 ~~~~~~~~li~~Fe~~~p~IkV~~~~~~~-~~~~~kl~~~~~ag~~~~Dv~~~-~~~~~~~~~~~g~l~~l~~~~~~~~- 81 (260) T pfam01547 5 TEGAALQALVAEFEKEHPGIKVEVESVGS-GSLAQKLTTAIAAGDGPADVFAS-ADDWLAKLAKAGLLLPLDDYVKNYL- 81 (260) T ss_pred CHHHHHHHHHHHHHHHCCCCEEEEEECCC-HHHHHHHHHHHHCCCCCCEEEEE-CCHHHHHHHHCCCEEECCHHHCCCC- T ss_conf 68999999999999888490899997882-79999999999769998669997-8365899987697450742210234- Q ss_pred CCCCCCCCCCCEEEEEEHHHHCCCCCCCCCCCHHHHCCCCCC------CEEEEEE-CCCCCCHHHHHHHHHHHC------ Q ss_conf 001243211532431000110011124587645652120006------8289850-476530367899998717------ Q gi|254780561|r 119 PIKWDDDIFVPYDYGYLAFIYDKRQITQPPKNFDELINSTKP------WKIIYQD-PRTSTLGLGLLLWIQKIY------ 185 (338) Q Consensus 119 ~~~~~~~~~~~~~~~~~~~~y~~~~~~~~p~s~~~l~~~~~~------~~~~~~~-~~~~~~~~~~~~~~~~~~------ 185 (338) .+|+++.+.... ....... .................. T Consensus 82 ------------------------------~~w~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 131 (260) T pfam01547 82 ------------------------------VTWDELLEAAKKLKEKGKTPIGGAYGGDASGTLGYFTLALLASLGGPLFD 131 (260) T ss_pred ------------------------------CCHHHHHHHHHHHHHCCCCCEEEEEECCCCCCHHHHHHHHHHHCCCCCCC T ss_conf ------------------------------77899999999977447875146630356662269999999852875434 Q ss_pred -------------------CCCHHHHHHHHHHCCCEEECCCHHHHHHHHCCCCCEEEEECCHHHHHHHHH---------- Q ss_conf -------------------731378987776288468308478988862258656999504499999850---------- Q gi|254780561|r 186 -------------------GDNSAQVWKKIATKTATVTKGWTESYGFFLKGESDFVLSYSTSPGFYLLNY---------- 236 (338) Q Consensus 186 -------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~---------- 236 (338) ....................+..+....+.+|+..+.+.+.+......... T Consensus 132 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 211 (260) T pfam01547 132 KDGGALDNPEAVTAITYVEDLYAIVLLLKKLKNPGVAGADGREALALFEQGKAAMAIVGGGAASANKPGLKVAFAAPAPD 211 (260) T ss_pred CCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHCCCEEEEEECHHHHHHHHHHCCCHHHHHCCC T ss_conf 43344466223326899999999998750357877666787999999866963699835899999874022001110244 Q ss_pred --CCCCEEEEECCCCEEEEEEEEEEECCCCCHHHHHHHHHHHCCHHHHH Q ss_conf --79973788316980888864243378999899999999963999999 Q gi|254780561|r 237 --GQDDYVAALFSEGHYLQIEVAAQLVRSKQPQLAQEFMQFMISPSFQR 283 (338) Q Consensus 237 --~~~~~~~~~~~eG~~~~~~~~~i~k~a~n~~~A~~Fi~flls~e~Q~ 283 (338) ....+....+.+|+.....+++|+|+++|+++|++||+||+|||+|+ T Consensus 212 ~~~~~~~~~~~~~~~~~~~~~~~~I~~~skn~eaA~~Fi~fl~s~e~Qa 260 (260) T pfam01547 212 PKGDIGYAPLPAGKATFVGGYGLAIPKGSKNKEAAKKFLDFLLSPEAQA 260 (260) T ss_pred CCCCCCEECCCCCCCCEECCEEEEEECCCCCHHHHHHHHHHHCCHHHCC T ss_conf 5777533038999865055479999899999999999999977997569 No 24 >TIGR01256 modA molybdate ABC transporter, periplasmic molybdate-binding protein; InterPro: IPR005950 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). This entry describes the molybdate ABC transporter periplasmic binding protein in bacteria and archae. Several of the periplasmic receptors constitute a diverse class of binding proteins that differ widely in size, sequence and ligand specificity. It has been shown experimentally by radioactive labeling that ModA represents a hydrophylioc periplasmic-binding protein in Gram-negative organisms and its counterpart in Gram-positive organisms is a lipoprotein. The other components of the system include ModB, an integral membrane protein and ModC, the ATP-binding subunit. Almost all of them display a common beta/alpha folding motif and have similar tertiary structures consisting of two globular domains.; GO: 0015412 molybdate transmembrane-transporting ATPase activity, 0015689 molybdate ion transport, 0030288 outer membrane-bounded periplasmic space. Probab=99.58 E-value=1.2e-13 Score=97.68 Aligned_cols=211 Identities=16% Similarity=0.157 Sum_probs=147.1 Q ss_pred HHHHHHHHHHHHHHC-CEEEEEECCCHHHHHHHHHHHHCCCCCEEEECCCHHHHHHHHHHHCCCCCCCCHHHCCCCCCCC Q ss_conf 233889999998848-8799997288689997887531266410110133378999887521261367700021000124 Q gi|254780561|r 45 GAGPKIKQAFERKCN-CELKLIGLSDGVALLNKLRMEGENSAADIVLGFDNNLIDLARKTGLFAKSNIDASQLKLPIKWD 123 (338) Q Consensus 45 ~~~~~i~~~Fek~~g-ikV~~~~~~~~~~~~~~~~a~~~~~~~Dv~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~ 123 (338) ..+++|.+.|+++|| .+|.+ ++++++.+..|+. .+.++|+++..+...+..+.+.++..+..... T Consensus 6 ~Al~~i~~~F~k~~g~~~Vv~-sFgsSg~L~~QI~---~GAP~D~F~SA~~~~~~~L~~~g~~v~~~~~~---------- 71 (225) T TIGR01256 6 DALKEIAKQFEKRTGKNKVVF-SFGSSGTLYTQIE---NGAPADVFISADEKRPKKLVDKGLVVKGSEFV---------- 71 (225) T ss_pred HHHHHHHHHHHCCCCCCEEEE-EECCCHHHHHHHH---CCCCEEEEECCCCCCCHHHHHCCCEECCCCEE---------- T ss_conf 899999988620089956999-8768178999997---17978876614634540466378730254458---------- Q ss_pred CCCCCCEEEEEEHHHHCCCCCCCCCCCHHHHCCCCCC-----CEEEEEECCCCCCHHHHHHHHHHHCCCCHHHHHHHHHH Q ss_conf 3211532431000110011124587645652120006-----82898504765303678999987177313789877762 Q gi|254780561|r 124 DDIFVPYDYGYLAFIYDKRQITQPPKNFDELINSTKP-----WKIIYQDPRTSTLGLGLLLWIQKIYGDNSAQVWKKIAT 198 (338) Q Consensus 124 ~~~~~~~~~~~~~~~y~~~~~~~~p~s~~~l~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 198 (338) +..+ -+..-.......-.+.+.+..+.+. +++++.+|..--++....-.+...+-. ..+.. T Consensus 72 ------~~~n--~LVL~~~~~~~~~~~~~~l~~p~~~~~l~~~~vaigdP~~~P~G~~A~E~L~~lgl~------~~l~~ 137 (225) T TIGR01256 72 ------YAGN--KLVLISPKNSRVVDSLDILKKPGWVIVLKDKTVAIGDPKHAPYGAAAKEVLQKLGLW------EKLKK 137 (225) T ss_pred ------EEEE--EEEEEECCCCCCCCCHHHHCCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCC------HHHHC T ss_conf ------9862--189985355420130221138974899863702346824443579999999707976------25644 Q ss_pred CCCEEEC-CCHHHHHHHHCCCCCEEEEECCHHHHHHHHHC-CCCEEEEECCCCEEEEEEEEEEECCCCCHHHHHHHHHHH Q ss_conf 8846830-84789888622586569995044999998507-997378831698088886424337899989999999996 Q gi|254780561|r 199 KTATVTK-GWTESYGFFLKGESDFVLSYSTSPGFYLLNYG-QDDYVAALFSEGHYLQIEVAAQLVRSKQPQLAQEFMQFM 276 (338) Q Consensus 199 ~~~~~~~-~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~eG~~~~~~~~~i~k~a~n~~~A~~Fi~fl 276 (338) +.+. .. +..+..+.+.+|+.++++++.++.. ... .....+++|++=.-..-+.++|+++++|..+|+.|++|| T Consensus 138 kLV~-g~P~vrQA~~~V~~~nA~~GiVa~Sdv~----~~~K~~~~v~~~p~~ly~Pi~yp~~i~~~g~n~~~a~~f~dyL 212 (225) T TIGR01256 138 KLVY-GEPDVRQALQFVETGNAPIGIVALSDVI----PSGKKVGSVAVIPEDLYKPIKYPAVIVKGGKNNAAAKAFIDYL 212 (225) T ss_pred CEEE-CCCCHHHHHHHHHHCCCCEEEEEHHEEE----ECCCCCCEEEEECHHHCCCEEEEEEEEECCCCHHHHHHHHHHH T ss_conf 8652-4888999988877258866687311022----0003662799963312796021178871687608999999870 Q ss_pred CCHHHHHHHHHC Q ss_conf 399999998850 Q gi|254780561|r 277 ISPSFQRILPTT 288 (338) Q Consensus 277 ls~e~Q~~~a~~ 288 (338) -|||++++|-++ T Consensus 213 ~sp~A~~il~~Y 224 (225) T TIGR01256 213 KSPEAKEILRKY 224 (225) T ss_pred CCHHHHHHHHCC T ss_conf 796476786305 No 25 >COG4588 AcfC Accessory colonization factor AcfC, contains ABC-type periplasmic domain [General function prediction only] Probab=99.49 E-value=4.6e-11 Score=82.74 Aligned_cols=245 Identities=15% Similarity=0.170 Sum_probs=144.5 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHHCCEEEEEECCCHHHHHHHHHHH Q ss_conf 90279999999999999986146742348859999788778512233889999998848879999728868999788753 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLDGLPAKPVLTVYTYNSFVADEGAGPKIKQAFERKCNCELKLIGLSDGVALLNKLRME 80 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~v~~~~~~~~~~~~~~~i~~~Fek~~gikV~~~~~~~~~~~~~~~~a~ 80 (338) ||+...+++.+++... +.....+++|..++ +...+..+.+.||++||++|++. +|.......+. T Consensus 1 Mk~~~~i~~~~~La~s---------~~~~adinlYGpGG---PhtaL~~vA~~~~ektg~kVnvt-~GPq~tW~~kA--- 64 (252) T COG4588 1 MKKAVLILLIFLLAFS---------SAANADINLYGPGG---PHTALKDVAKKYEEKTGIKVNVT-AGPQATWNEKA--- 64 (252) T ss_pred CCHHHHHHHHHHHHHH---------HHHCCEEEEECCCC---CCHHHHHHHHHHHHHHCEEEEEE-CCCCCHHHHHH--- T ss_conf 9336889999999724---------64045178766999---86789999999878748079994-18861044564--- Q ss_pred HCCCCCEEEECCCHHHHHHHHHHHCCCCCCCCHHHCCCCCCCCCCCCCCEEEEEEHHHHCCCCCCCCCCCHHHHCCCCCC Q ss_conf 12664101101333789998875212613677000210001243211532431000110011124587645652120006 Q gi|254780561|r 81 GENSAADIVLGFDNNLIDLARKTGLFAKSNIDASQLKLPIKWDDDIFVPYDYGYLAFIYDKRQITQPPKNFDELINSTKP 160 (338) Q Consensus 81 ~~~~~~Dv~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~p~s~~~l~~~~~~ 160 (338) ...+|++++.+...+....... ...+...+.. |..... .+..-+...+...+..+||+.+.- T Consensus 65 --kknADilfgaseqsalaia~~~-----~~~fs~~~i~---------ply~R~-aiIlvkkgNPknIk~~eDll~~gi- 126 (252) T COG4588 65 --KKNADILFGASEQSALAIAEDH-----KDSFSEKNIQ---------PLYLRP-AIILVKKGNPKNIKGFEDLLKPGI- 126 (252) T ss_pred --HCCCCEEECCCHHHHHHHHHHC-----CCCCCCCCCC---------EEEEEC-EEEEECCCCCCCCCCHHHHHCCCC- T ss_conf --1367456416488899999856-----3335520253---------135402-179962799544456888705786- Q ss_pred CEEEEEECCC----CCCHHHHHHHHHHHCCCCHHHHHHHHHHCCCEEECCCHHHHHHHHC-CCCCEEEEECCHHHHHHHH Q ss_conf 8289850476----5303678999987177313789877762884683084789888622-5865699950449999985 Q gi|254780561|r 161 WKIIYQDPRT----STLGLGLLLWIQKIYGDNSAQVWKKIATKTATVTKGWTESYGFFLK-GESDFVLSYSTSPGFYLLN 235 (338) Q Consensus 161 ~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-g~~~~~~~~~~~~~~~~~~ 235 (338) ++...+..+ ++.+.+.-.+ +-....+.+..++.|...|.+++......+.+ -.++..+.|.... T Consensus 127 -~ivV~dGaG~sntsgtgvwED~a----gr~~~ie~v~afR~NI~~fapnSgaArkaf~~~~~aDawItW~dWa------ 195 (252) T COG4588 127 -GIVVNDGAGVSNTSGTGVWEDIA----GRKGNIETVAAFRKNIVAFAPNSGAARKAFENQPDADAWITWADWA------ 195 (252) T ss_pred -EEEEECCCCCCCCCCCEEHHHHH----CCCCCHHHHHHHHHCEEEECCCCCHHHHHHHCCCCCCEEEEECCHH------ T ss_conf -29983797644777722048654----1011488999988523897468704899985499985588701323------ Q ss_pred HCCCCEEEEEC-CCCEEEEEEEEEEECCCCCHHHHHHHHHHHCCHHHHHHHHHCCCE Q ss_conf 07997378831-698088886424337899989999999996399999998850450 Q gi|254780561|r 236 YGQDDYVAALF-SEGHYLQIEVAAQLVRSKQPQLAQEFMQFMISPSFQRILPTTNWM 291 (338) Q Consensus 236 ~~~~~~~~~~~-~eG~~~~~~~~~i~k~a~n~~~A~~Fi~flls~e~Q~~~a~~~~~ 291 (338) ...+.+.-.++ ..-..++-+.-++++..+|+ .|+.|++||.|.|+|+.+-.++|. T Consensus 196 ~snpdig~~v~~~~d~vIyRd~nv~~~~~a~~-ea~~F~dyl~S~EAq~ifkkygW~ 251 (252) T COG4588 196 KSNPDIGDAVEIEKDYVIYRDFNVALAKDANK-EARDFADYLQSDEAQKIFKKYGWT 251 (252) T ss_pred HHCCCHHCEEECCCCEEEEEECCEEECCCCCH-HHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 30985010045166617754020343689997-899999997217789999860767 No 26 >COG4150 CysP ABC-type sulfate transport system, periplasmic component [Inorganic ion transport and metabolism] Probab=99.41 E-value=5.9e-12 Score=87.91 Aligned_cols=266 Identities=16% Similarity=0.193 Sum_probs=147.4 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEECCCCCCCHHHHHHHHHHHHH----HH-CCE--EEEEECCCHHHH Q ss_conf 90279999999999999986146742348859999788778512233889999998----84-887--999972886899 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLDGLPAKPVLTVYTYNSFVADEGAGPKIKQAFER----KC-NCE--LKLIGLSDGVAL 73 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~v~~~~~~~~~~~~~~~i~~~Fek----~~-gik--V~~~~~~~~~~~ 73 (338) |||++. ..+..++++++.....++ ..+|.-. ||....+.+..+-++|++ ++ |-+ |+....|++.+. T Consensus 6 mkk~~~---~~~~~~~L~a~~~qa~as-~~~lLNs---SYDv~RElf~a~N~~F~~~w~~~~pG~~lti~QSHAGsSkQA 78 (341) T COG4150 6 MKKNLL---IGATILALFAALLQAGAS-ATELLNS---SYDVSRELFAALNPAFEAQWAKDNPGDKLTIKQSHAGSSKQA 78 (341) T ss_pred HHHHHH---HHHHHHHHHHHHHHCCCC-HHHHHHH---HHHHHHHHHHHHCHHHHHHHHHHCCCCEEEEECCCCCCHHHH T ss_conf 889887---669999999998761467-0667630---567899999983989998888608996468723456655889 Q ss_pred HHHHHHHHCCCCCEEEECCCHHHHHHHHHHH-CCCCCCCCHHHCCCCCCCCCCCCCCEEEEEEHHHHCCCCCCCCCCCHH Q ss_conf 9788753126641011013337899988752-126136770002100012432115324310001100111245876456 Q gi|254780561|r 74 LNKLRMEGENSAADIVLGFDNNLIDLARKTG-LFAKSNIDASQLKLPIKWDDDIFVPYDYGYLAFIYDKRQITQPPKNFD 152 (338) Q Consensus 74 ~~~~~a~~~~~~~Dv~~~~~~~~~~~~~~~~-l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~p~s~~ 152 (338) +...+ +..+||+.......+..+...+ |+...+-...+++.. | +++..-..-+...+...++|+ T Consensus 79 --raIlq--GL~ADVVTfNQVtDVqiLhdkg~lipAdWq~~lPnnsS---------P--fYS~~aFLVR~GNPK~IkdW~ 143 (341) T COG4150 79 --RAILQ--GLKADVVTFNQVTDVQILHDKGKLIPADWQSRLPNNSS---------P--FYSTMAFLVRKGNPKNIKDWD 143 (341) T ss_pred --HHHHH--HCCCCEEEEECCEEEEEEHHCCCCCCHHHHHHCCCCCC---------C--CCCCHHHHHHCCCCCCCCCHH T ss_conf --99985--42456677402102432110487046245640998889---------6--432327775368966676577 Q ss_pred HHCCCCCCCEEEEEECCCCCCHHHHHHHH-----HHHCCC--CHHHHHHHHHHCCCEEECCCHH-HHHHHHCCCCCEEEE Q ss_conf 52120006828985047653036789999-----871773--1378987776288468308478-988862258656999 Q gi|254780561|r 153 ELINSTKPWKIIYQDPRTSTLGLGLLLWI-----QKIYGD--NSAQVWKKIATKTATVTKGWTE-SYGFFLKGESDFVLS 224 (338) Q Consensus 153 ~l~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~--~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~g~~~~~~~ 224 (338) ||.-.. -++.+++|..|+....-.++. ...+++ ...+++.++..|+..|-.++.. ......+|-.++.+. T Consensus 144 DLvRdD--Vk~iFPNPKTSGNaRYTYLAA~~~a~~af~gD~~k~~ef~~k~~~Nv~VFDTGGRgATTtFveRglGDVLIt 221 (341) T COG4150 144 DLVRDD--VKLIFPNPKTSGNARYTYLAAWGAADEAFGGDKAKTEEFMTKFLKNVEVFDTGGRGATTTFVERGLGDVLIT 221 (341) T ss_pred HHHHHC--CEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCEECCCCCCCCCHHHHHCCCCEEEE T ss_conf 775503--447668998887600158898788876526858899999999961883421688642000455256647999 Q ss_pred ECCHHHHHHHHHCCCCEEEEECCCCEEEEEEEEEE---ECCCCCHHHHHHHHHHHCCHHHHHHHHHCCC Q ss_conf 50449999985079973788316980888864243---3789998999999999639999999885045 Q gi|254780561|r 225 YSTSPGFYLLNYGQDDYVAALFSEGHYLQIEVAAQ---LVRSKQPQLAQEFMQFMISPSFQRILPTTNW 290 (338) Q Consensus 225 ~~~~~~~~~~~~~~~~~~~~~~~eG~~~~~~~~~i---~k~a~n~~~A~~Fi~flls~e~Q~~~a~~~~ 290 (338) .......-+...+.....++.|+-.-++-.-...+ +...-..++|+.+++||.||++|.++++.++ T Consensus 222 FEsE~~~irkqyg~d~~evVvP~~siLAEFPVa~Vdkvv~k~Gt~~~AkaYl~~LYsp~~Q~i~a~~~~ 290 (341) T COG4150 222 FESEVNNIRKQYGADKFEVVVPKTSILAEFPVAWVDKVVDKNGTEKAAKAYLNYLYSPQAQTIIAEFYY 290 (341) T ss_pred EEHHHCCHHHHHCCCCCEEECCCHHHHHHCCHHHHHHHHHHCCCHHHHHHHHHHHCCCHHHHHHHHHCC T ss_conf 406530278873866523755653355416526775565302629999999988609388799998353 No 27 >TIGR00971 3a0106s03 sulfate ABC transporter, sulfate-binding protein; InterPro: IPR005669 Thiosulphate-binding proteins (gene cysP) specifically binds thiosulphate and are involved in the transport systems for this nutrients , . There are two conserved regions in the protein, one located in the N-terminal region and the other in the central part of these proteins. The second pattern includes two adjacent amino acids (Ser-Gly) that, in sbp, are known to be essential for sulphate binding . ; GO: 0015419 sulfate transmembrane-transporting ATPase activity, 0008272 sulfate transport, 0030288 outer membrane-bounded periplasmic space. Probab=99.39 E-value=6.5e-12 Score=87.68 Aligned_cols=249 Identities=14% Similarity=0.166 Sum_probs=162.2 Q ss_pred CCCCCHHHHHHHHHHH----HHHHC-CEEEEEE--CCCHHHHHHHHHHHHCCCCCEEEECCCHHHHHHHHHHHC-CCCCC Q ss_conf 7785122338899999----98848-8799997--288689997887531266410110133378999887521-26136 Q gi|254780561|r 39 SFVADEGAGPKIKQAF----ERKCN-CELKLIG--LSDGVALLNKLRMEGENSAADIVLGFDNNLIDLARKTGL-FAKSN 110 (338) Q Consensus 39 ~~~~~~~~~~~i~~~F----ek~~g-ikV~~~~--~~~~~~~~~~~~a~~~~~~~Dv~~~~~~~~~~~~~~~~l-~~~~~ 110 (338) ||....+++..+-++| .+++| =+|.+.. .+++-+.++.+ .+..+||+.......+..+.+.+- ++... T Consensus 20 SYD~~RELy~~~N~~F~~~W~~~~GGD~~~i~QSH~gs~KQA~~v~----~G~~ADvVT~~~~~DV~~~~~~G~lI~~~W 95 (320) T TIGR00971 20 SYDVTRELYEQYNKAFEAHWKKDTGGDKVVIKQSHGGSGKQALSVL----NGLEADVVTLALAYDVQILAERGRLIDKDW 95 (320) T ss_pred CCCHHHHHHHHHCHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHCCCCCCHHH T ss_conf 3331069998718157875666448954789742688606889987----001255778888876899984387365468 Q ss_pred CCHHHCCCCCCCCCCCCCCEEEEEEHHHHCCCCCCCCCCCHHHHCCCCCCCEEEEEECCCCCCHHHHHHH-H----HHHC Q ss_conf 7700021000124321153243100011001112458764565212000682898504765303678999-9----8717 Q gi|254780561|r 111 IDASQLKLPIKWDDDIFVPYDYGYLAFIYDKRQITQPPKNFDELINSTKPWKIIYQDPRTSTLGLGLLLW-I----QKIY 185 (338) Q Consensus 111 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~p~s~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~-~----~~~~ 185 (338) ...++.+-.+ ...++++..++ ..+...++|.||..+. -++.+++|..|+...+-.++ . ...+ T Consensus 96 ~~rLP~NS~P----------y~ST~~FLVRK-GNPK~I~DWnDL~~~~--v~~I~PNPK~SG~AR~~YLaAWG~A~~~~~ 162 (320) T TIGR00971 96 IKRLPDNSAP----------YTSTIVFLVRK-GNPKQIHDWNDLIKPG--VSVITPNPKSSGGARYNYLAAWGYALHADN 162 (320) T ss_pred HHHCCCCCCC----------CCEEEEEEEEC-CCCCCCCCHHHHHHCC--CEEEECCCCCCCCHHHHHHHHHHHHHHCCC T ss_conf 8628787886----------20133556754-7989646513463358--758607889864157899998766751378 Q ss_pred CC--CHHHHHHHHHHCCCEEECCCHH-HHHHHHCCCCCEEEEECCHHHHHHHHHCCCCEEEEECCCC-----EEEEEEEE Q ss_conf 73--1378987776288468308478-9888622586569995044999998507997378831698-----08888642 Q gi|254780561|r 186 GD--NSAQVWKKIATKTATVTKGWTE-SYGFFLKGESDFVLSYSTSPGFYLLNYGQDDYVAALFSEG-----HYLQIEVA 257 (338) Q Consensus 186 ~~--~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~eG-----~~~~~~~~ 257 (338) ++ ...++...+.+|.-.+-+|... .....++|-.|+.++|..........-+.....++.|.+- ++.+++-. T Consensus 163 ~D~aK~~~F~~~l~KNvEVlD~G~RGaT~TF~ERG~GDVLI~~E~E~~~~~~~~~~~~FE~V~P~~~ILAE~~V~~VDK~ 242 (320) T TIGR00971 163 GDKAKAEQFVKALLKNVEVLDSGARGATNTFVERGLGDVLIAFENEALLIRKELEKDKFEIVIPSESILAEPTVSVVDKV 242 (320) T ss_pred CCHHHHHHHHHHHHCCEEEEECCCCCCCCCHHHCCCCHHHHHHHHHHHHHHHHHCCCCCEEECCCCCCCCCCCEEEEECC T ss_conf 76245899999985010332068886521011106330013457889999996334872686477432135650357433 Q ss_pred EEECCCCCHHHHHHHHHHHCCHHHHHHHHHCCCEEECCCCCCCCHHHHCC Q ss_conf 43378999899999999963999999988504505055765882134122 Q gi|254780561|r 258 AQLVRSKQPQLAQEFMQFMISPSFQRILPTTNWMYPVVDISMPDVYQAIR 307 (338) Q Consensus 258 ~i~k~a~n~~~A~~Fi~flls~e~Q~~~a~~~~~~P~~~~~~p~~~~~~~ 307 (338) +--+++ ++.|+++|+||.|||+|.+++.. |++|-++.++......++ T Consensus 243 V~~~~T--~~~A~AYL~yLYSP~~Q~I~~~~-yYR~~~~~v~~K~~~~FP 289 (320) T TIGR00971 243 VEKKGT--KKVAEAYLKYLYSPEGQEIIAKN-YYRVRDAEVAKKYEDKFP 289 (320) T ss_pred CCCCCH--HHHHHHHHHHHCCCCHHHHHHHC-CCCCCCHHHHHHHHCCCC T ss_conf 032345--89999999740494135676542-678888889867751688 No 28 >COG2998 TupB ABC-type tungstate transport system, permease component [Coenzyme metabolism] Probab=99.33 E-value=1.8e-10 Score=79.25 Aligned_cols=229 Identities=14% Similarity=0.128 Sum_probs=120.5 Q ss_pred CCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHHCCEEEEEECCCHHHHHHHHHHHHCCCCCEEEECCCHHHHHHHHHHHC Q ss_conf 23488599997887785122338899999988488799997288689997887531266410110133378999887521 Q gi|254780561|r 26 LPAKPVLTVYTYNSFVADEGAGPKIKQAFERKCNCELKLIGLSDGVALLNKLRMEGENSAADIVLGFDNNLIDLARKTGL 105 (338) Q Consensus 26 ~~~~~~l~v~~~~~~~~~~~~~~~i~~~Fek~~gikV~~~~~~~~~~~~~~~~a~~~~~~~Dv~~~~~~~~~~~~~~~~l 105 (338) ++...+|.+.|..| .-+...++.+++.||+.+|++|+++..|.+..+ ..+.++..||++.+++..-......+. T Consensus 26 sa~~~~LrmATTTS-t~dtGLLd~l~p~fE~~~g~~v~~vAvGTG~AL-----kmge~gdvDvv~vHapk~E~~fv~~G~ 99 (280) T COG2998 26 SADAAELRMATTTS-TEDTGLLDVLLPKFEKYTGYDVQVVAVGTGKAL-----KMGERGDVDVVIVHAPKAEKEFVKDGF 99 (280) T ss_pred CCCCCEEEEEEECC-CCCCCCHHHHHHHHHHCCCCEEEEEEECCHHHH-----HHHCCCCCCEEEEECCHHHHHHHHCCC T ss_conf 75420378854302-455631888777664336965899981235877-----511268867899837077899987278 Q ss_pred CCCCCCCHHHCCCCCCCCCCCCCCEEEEEEHHH---HCCCCCCCCCCCHHHHCCCCCCCEEEEEECCCCCCHHHHHHHHH Q ss_conf 261367700021000124321153243100011---00111245876456521200068289850476530367899998 Q gi|254780561|r 106 FAKSNIDASQLKLPIKWDDDIFVPYDYGYLAFI---YDKRQITQPPKNFDELINSTKPWKIIYQDPRTSTLGLGLLLWIQ 182 (338) Q Consensus 106 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---y~~~~~~~~p~s~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 182 (338) .....+ +++..+.++ -++.++....+--+.|..-...+...+.....|+........+. T Consensus 100 gv~r~~------------------vmYNdFiiVgp~~dpA~~k~~kn~~e~fe~Ia~~ka~FvSRGD~SGT~~~E~~lWk 161 (280) T COG2998 100 GVDRRP------------------VMYNDFIIVGPADDPAGIKDAKNGKEAFEKIAEEKAKFVSRGDNSGTDSKELSLWK 161 (280) T ss_pred CCCCCC------------------EEEEEEEEECCCCCCHHCCCCHHHHHHHHHHHHCCCEEEECCCCCCCCHHHHHHHH T ss_conf 866761------------------01311799877666111122000899999999728905742787784277888998 Q ss_pred HHCCCCHHHHHHHHHHCCCEEECCCHHHHHHHHCCCCCEEEEECCHHHHHHHHHCCCCEEEEECCCCEEEEEEEEEEE-- Q ss_conf 717731378987776288468308478988862258656999504499999850799737883169808888642433-- Q gi|254780561|r 183 KIYGDNSAQVWKKIATKTATVTKGWTESYGFFLKGESDFVLSYSTSPGFYLLNYGQDDYVAALFSEGHYLQIEVAAQL-- 260 (338) Q Consensus 183 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~eG~~~~~~~~~i~-- 260 (338) ..+.+.... ..-.....+..+...+-. ..-...+.- ...+ ..-...+.+.++...+..+..++++.++ T Consensus 162 ~~g~~p~~~------~wY~s~G~GMg~tL~~A~-Ek~aytLtD--rgTy-laYr~r~~L~iv~~gd~~L~N~Ysvi~vNP 231 (280) T COG2998 162 VTGIEPTVK------GWYISAGQGMGDTLNMAN-EKKAYTLTD--RGTY-LAYRNRPTLVIVLEGDPSLFNPYSVIAVNP 231 (280) T ss_pred HCCCCCCCC------CCEEECCCCHHHHHHHHH-HHCEEEEEC--CCEE-EEECCCCCEEEEECCCCCCCCCEEEEEECH T ss_conf 708788988------603652744577877765-512168613--6305-676276652899617811167125898761 Q ss_pred CCCC--CHHHHHHHHHHHCCHHHHHHHHHC Q ss_conf 7899--989999999996399999998850 Q gi|254780561|r 261 VRSK--QPQLAQEFMQFMISPSFQRILPTT 288 (338) Q Consensus 261 k~a~--n~~~A~~Fi~flls~e~Q~~~a~~ 288 (338) +..| |-.+|.+||+|++|+++|..|++. T Consensus 232 ~r~~~vny~~A~kfi~w~~s~~gq~~Ia~f 261 (280) T COG2998 232 KRVKGVNYTAATKFIEWLMSEKGQNLIAEF 261 (280) T ss_pred HCCCCCCCHHHHHHHHHHHHHHHHHHHHHC T ss_conf 007885711899999999647778888650 No 29 >PRK03537 hypothetical protein; Provisional Probab=98.88 E-value=5.4e-08 Score=64.81 Aligned_cols=144 Identities=16% Similarity=0.156 Sum_probs=86.1 Q ss_pred CCCCCCCHHH-HCCCCCCCEEEEEECCCCCCHHHHHHHH---HHHCCCCHHHHHHHHH-----HCCCEEECCCHHHHHHH Q ss_conf 2458764565-2120006828985047653036789999---8717731378987776-----28846830847898886 Q gi|254780561|r 144 ITQPPKNFDE-LINSTKPWKIIYQDPRTSTLGLGLLLWI---QKIYGDNSAQVWKKIA-----TKTATVTKGWTESYGFF 214 (338) Q Consensus 144 ~~~~p~s~~~-l~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~ 214 (338) ......+|.| |.++.. |+++..+...-.+......+ .......+.....+.. .+......+.......+ T Consensus 32 ~~~~~~~~ld~L~~p~~--ri~~stP~~dP~G~Ya~~~l~~a~~~~~g~~~~l~~ka~~lvgg~~s~~~p~gr~~~~~~~ 109 (188) T PRK03537 32 VMLTSENLLDLLLDPDI--RLGTSTPGADPSGDYAWQLFDRAEALKAGAGEALRTKALQLVGGPNSAPIPAGRNAYEWLI 109 (188) T ss_pred CCCCCCCHHHHHCCCCE--EEEECCCCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHH T ss_conf 87674468888439974--9982699989724899999998875277617899998877417765666887520323331 Q ss_pred HCCCCCEEEEECCHHHHHHHHHCCCCEEEE-ECCCCEEEEEEEEEEECCCCCHHHHHHHHHHHCCHHHHHHHHHCCCEEE Q ss_conf 225865699950449999985079973788-3169808888642433789998999999999639999999885045050 Q gi|254780561|r 215 LKGESDFVLSYSTSPGFYLLNYGQDDYVAA-LFSEGHYLQIEVAAQLVRSKQPQLAQEFMQFMISPSFQRILPTTNWMYP 293 (338) Q Consensus 215 ~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~eG~~~~~~~~~i~k~a~n~~~A~~Fi~flls~e~Q~~~a~~~~~~P 293 (338) .+|+.|++++|.++... ..+....+.++ .|.+-.+...+++++.++++ +.|+.|++|++|+|+|+++++.++.-| T Consensus 110 ~~geaDa~ivY~TdA~~--a~~~~~~l~~v~iP~~~~~~~~Y~iav~~~a~--~~A~~f~~flls~~gQ~iL~~~GF~pp 185 (188) T PRK03537 110 ENKQADIFIGYCTSAAL--ALKEVPDLQIVDLPEPLAVGANYGLAVLKDAS--EAARPLAEFILSPKGQAILAQYGFSPP 185 (188) T ss_pred CCCCCCEEEEEECCHHH--HHHCCCCCEEEECCCCCCCCCCEEEEECCCCC--HHHHHHHHHHCCHHHHHHHHHCCCCCC T ss_conf 05974579998514787--85158985499588000666323578736880--999999999739999999998089999 No 30 >PRK09861 cytoplasmic membrane lipoprotein-28; Provisional Probab=98.82 E-value=7.4e-06 Score=52.35 Aligned_cols=253 Identities=12% Similarity=0.111 Sum_probs=133.4 Q ss_pred CCH---HHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHHCCEEEEEECCCHHHHHHHH Q ss_conf 902---79999999999999986146742348859999788778512233889999998848879999728868999788 Q gi|254780561|r 1 MKK---FARIVVGIMMITGVISYCTLDGLPAKPVLTVYTYNSFVADEGAGPKIKQAFERKCNCELKLIGLSDGVALLNKL 77 (338) Q Consensus 1 MKk---~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~v~~~~~~~~~~~~~~~i~~~Fek~~gikV~~~~~~~~~~~~~~~ 77 (338) ||+ +++..+++++++.++++|+.+ .+++++++|-+.. .+...+++.+.+.-.++.|++|+++.+.+...--..+ T Consensus 1 ~~~~~~~l~~~a~ll~~~~~l~~C~~~-~~~~k~ikVG~~~--~p~~~i~e~~~~~~~ek~G~~leiv~FsDy~~PN~AL 77 (272) T PRK09861 1 MKLTTHHLRAGAALLLAGILLAGCDQS-SSDAKHIKVGVIN--GAEQDVAEVAKKVAKEKYGLDVELVGFSGSLLPNDAT 77 (272) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHCCCC-CCCCCEEEEEECC--CCHHHHHHHHHHHHHHHCCCEEEEEEECCCCCHHHHH T ss_conf 951099999999999999998745897-6789738999678--9869999999988876179768999946862262897 Q ss_pred HHHHCCCCCEEEECCCHHHHHHHHHHHCCCCCCCCHHHCCCCCCCCCCCCCCEEEEEEHHHHCCCCCCCCCCCHHHHCCC Q ss_conf 75312664101101333789998875212613677000210001243211532431000110011124587645652120 Q gi|254780561|r 78 RMEGENSAADIVLGFDNNLIDLARKTGLFAKSNIDASQLKLPIKWDDDIFVPYDYGYLAFIYDKRQITQPPKNFDELINS 157 (338) Q Consensus 78 ~a~~~~~~~Dv~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~p~s~~~l~~~ 157 (338) + ++..|.-+.-...++.......-.. ....+..+..| .+ .+...-+|.+++ T Consensus 78 -~---~G~iDaN~fQH~pyL~~~n~~~g~~-----------L~~v~~~~~~P-----~g------lYS~K~ksl~dl--- 128 (272) T PRK09861 78 -N---HGELDANVFQHRPFLEQDNQAHGYK-----------LVAVGNTFVFP-----MA------GYSKKIKTVAQI--- 128 (272) T ss_pred -H---CCCCCCHHHHHHHHHHHHHHHCCCC-----------EEEEEEEEEEE-----EE------ECCCCCCCHHHC--- T ss_conf -6---7983602455799999999986995-----------79973167830-----14------034465987584--- Q ss_pred CCCCEEEEEECCCCCCHHHHHHHHH---HHCCCC---HHHHHHHHHHCCCEEECCC-HHHHHHHHCCCCCEEEEECCHHH Q ss_conf 0068289850476530367899998---717731---3789877762884683084-78988862258656999504499 Q gi|254780561|r 158 TKPWKIIYQDPRTSTLGLGLLLWIQ---KIYGDN---SAQVWKKIATKTATVTKGW-TESYGFFLKGESDFVLSYSTSPG 230 (338) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~---~~~~~~~~~~~~~~~~~~~-~~~~~~~~~g~~~~~~~~~~~~~ 230 (338) ....++++++...+.......+.-. ...... .............++..-. .+..+.+...+++.++.+.+. T Consensus 129 p~Ga~IaIPnDpsN~~RAL~lL~~aGLI~Lk~~~~~~~t~~DI~~NPk~lk~~e~~aaql~rsl~Dp~vD~avin~n~-- 206 (272) T PRK09861 129 KEGATVAIPNDPTNLGRALLLLQKEKLITLKEGKGLLPTALDITDNPRHLQIMELEGAQLPRVLDDPKVDVAIISTTY-- 206 (272) T ss_pred CCCCEEEECCCHHHHHHHHHHHHHCCCEEECCCCCCCCCHHHHHCCCCCCEEEECCHHHHHHHHCCCCCCEEEECHHH-- T ss_conf 799989804781269999999998897897799997777445760887767998267774675057761689976057-- Q ss_pred HHHHHHCCCCE-EEEECCCCEEEEEEEEEEECCCCCHHHHHHHHHHHCCHHHHHHHHHCC Q ss_conf 99985079973-788316980888864243378999899999999963999999988504 Q gi|254780561|r 231 FYLLNYGQDDY-VAALFSEGHYLQIEVAAQLVRSKQPQLAQEFMQFMISPSFQRILPTTN 289 (338) Q Consensus 231 ~~~~~~~~~~~-~~~~~~eG~~~~~~~~~i~k~a~n~~~A~~Fi~flls~e~Q~~~a~~~ 289 (338) ....+..+. ...+.++..-.....+++-++.++.+..++|++-+.|+|+++.+-+.+ T Consensus 207 --a~~agl~p~~dal~~E~~~~py~niiavr~~~~d~~~ik~lv~a~~S~evk~~i~~~y 264 (272) T PRK09861 207 --IQQTGLSPVHDSVFIEDKNSPYVNILVAREDNKNAENVKEFLQSYQSPEVAKAAETIF 264 (272) T ss_pred --HHHCCCCCCCCCEEECCCCCCEEEEEEECCCCCCCHHHHHHHHHHCCHHHHHHHHHHC T ss_conf --9887779440020366899983799998276769989999999977999999999877 No 31 >COG3221 PhnD ABC-type phosphate/phosphonate transport system, periplasmic component [Inorganic ion transport and metabolism] Probab=98.77 E-value=6e-06 Score=52.91 Aligned_cols=257 Identities=15% Similarity=0.099 Sum_probs=137.3 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCCCCC-----CEEEEEECCCC--CCCHHHHHHHHHHHHHHHCCEEEEEECCCHHHH Q ss_conf 90279999999999999986146742348-----85999978877--851223388999999884887999972886899 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLDGLPAK-----PVLTVYTYNSF--VADEGAGPKIKQAFERKCNCELKLIGLSDGVAL 73 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~l~v~~~~~~--~~~~~~~~~i~~~Fek~~gikV~~~~~~~~~~~ 73 (338) |+|...++.+++.+.++++||+.+....+ ++|++---.+- .......+++.+-++++.|++|+.....+...+ T Consensus 1 m~~~~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~~l~~gi~p~e~~~~~~~~~~pl~~~L~~~lG~~V~~~~a~dy~~v 80 (299) T COG3221 1 MKKKKLLLASAAAVLALAGCCGSSESDAPAAEDPKELRVGIVPTENPTNLIPAWAPLADYLEKELGIPVEFFVATDYAAV 80 (299) T ss_pred CCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCEEEEECCCCCHHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHH T ss_conf 94278899999999999863257745432226875359997688886878876566998988986995435725768999 Q ss_pred HHHHHHHHCCCCCEEEECCCHHHHHHHHHHHCCCCCCCCHHHCCCCCCCCCCCCCCEEEEEEHHHHCCCCCCCCCCCHHH Q ss_conf 97887531266410110133378999887521261367700021000124321153243100011001112458764565 Q gi|254780561|r 74 LNKLRMEGENSAADIVLGFDNNLIDLARKTGLFAKSNIDASQLKLPIKWDDDIFVPYDYGYLAFIYDKRQITQPPKNFDE 153 (338) Q Consensus 74 ~~~~~a~~~~~~~Dv~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~p~s~~~ 153 (338) +..+. ++..|+.+......+..........+.... ...++. + .++..++.+++ .+.++..+ T Consensus 81 ieal~----~g~~D~A~~~~~a~~~a~~~~~~~e~~~~~------~~~dg~----~--~Y~S~~i~~~d---s~i~sl~d 141 (299) T COG3221 81 IEALR----AGQVDIAWLGPSAYVEAVDRALAGEPLAQT------VQKDGS----P--GYYSVIIVRAD---SPIKSLED 141 (299) T ss_pred HHHHH----CCCEEEEECCCHHHHHHHHHCCCCCCEEEE------ECCCCC----C--CEEEEEEEECC---CCCCHHHH T ss_conf 99985----896107741857789986412454201332------125898----6--51789999279---97433788 Q ss_pred HCCCCCCCEEEEEECCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHCCCEEECCCHHHHHHHHCCCCCEEEEECCHHHHHH Q ss_conf 21200068289850476530367899998717731378987776288468308478988862258656999504499999 Q gi|254780561|r 154 LINSTKPWKIIYQDPRTSTLGLGLLLWIQKIYGDNSAQVWKKIATKTATVTKGWTESYGFFLKGESDFVLSYSTSPGFYL 233 (338) Q Consensus 154 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~ 233 (338) +. ..++++.++............+...+..+. ........+.++.......+.+|+++++..+...+.... T Consensus 142 lk----gk~~af~d~~StSG~l~P~~~L~~~g~~d~-----~~~f~~v~~~G~H~~a~~aV~nG~vDva~~~~~~~~~~~ 212 (299) T COG3221 142 LK----GKRFAFGDPDSTSGYLFPLYYLAKEGGIDP-----DKFFGEVIFSGGHDAAVLAVANGQVDVAAVNSSARGLLK 212 (299) T ss_pred HC----CCEEECCCCCCCCHHHHHHHHHHHHCCCCH-----HHHHCEEECCCHHHHHHHHHHCCCCEEEECCHHHHHHHH T ss_conf 26----996763699854124768999998438984-----654226503570799999997488308852288887776 Q ss_pred HHHCC---CCEEEEECCCCEEEEEEEEEEECCCCCHHHHHHHHHHHCCH---HHHHHHHHC Q ss_conf 85079---97378831698088886424337899989999999996399---999998850 Q gi|254780561|r 234 LNYGQ---DDYVAALFSEGHYLQIEVAAQLVRSKQPQLAQEFMQFMISP---SFQRILPTT 288 (338) Q Consensus 234 ~~~~~---~~~~~~~~~eG~~~~~~~~~i~k~a~n~~~A~~Fi~flls~---e~Q~~~a~~ 288 (338) ..... ..+.++...+ ..+..-.++.+.=+++.-.++.++++.= |.++.+..- T Consensus 213 ~~~~~~~~~~l~vi~~S~---~iP~~pi~vr~~L~~~~k~kl~~af~~l~~~~~~~~l~~~ 270 (299) T COG3221 213 KAAPEGVAEKLRVIWKSP---LIPNDPIAVRSDLPADLKEKLRDAFLDLAKTEDKKILLDL 270 (299) T ss_pred HHCCCCCHHHCEEEEECC---CCCCCCEEEECCCCHHHHHHHHHHHHHCCCCCCCHHHHHH T ss_conf 404334533426999558---9899978883779999999999999843743113588975 No 32 >PRK11063 metQ DL-methionine transporter substrate-binding subunit; Provisional Probab=98.76 E-value=1.2e-05 Score=51.09 Aligned_cols=248 Identities=13% Similarity=0.043 Sum_probs=122.8 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHHCCEEEEEECCCHHHHHHHHHHHHCCCC Q ss_conf 99999999999998614674234885999978877851223388999999884887999972886899978875312664 Q gi|254780561|r 6 RIVVGIMMITGVISYCTLDGLPAKPVLTVYTYNSFVADEGAGPKIKQAFERKCNCELKLIGLSDGVALLNKLRMEGENSA 85 (338) Q Consensus 6 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~v~~~~~~~~~~~~~~~i~~~Fek~~gikV~~~~~~~~~~~~~~~~a~~~~~~ 85 (338) .+++++++++.++++|++. .++..+++|-... .+..++++...+..+++.|++|+++.+.+.-+--..+. ++. T Consensus 8 ~~~~~~l~~~l~l~gCg~~-~~~~~~ikVG~~~--gp~~ei~e~~~~~~~ek~G~~veiv~FsDy~~pN~AL~----~G~ 80 (271) T PRK11063 8 FAAVGALIGSLALVGCGQD-EKDPNHIKVGVIV--GAEQQVAEVAQKVAKEKYGLDVELVTFNDYVLPNEALS----KGD 80 (271) T ss_pred HHHHHHHHHHHHHCCCCCC-CCCCCEEEEEECC--CCHHHHHHHHHHHHHHHCCCEEEEEEECCCHHHHHHHH----CCC T ss_conf 9999999999975005898-5679818998468--98799999988888860797689999368321558986----798 Q ss_pred CEEEECCCHHHHHHHHHHHCCCCCCCCHHHCCCCCCCCCCCCCCEEEEEEHHHHCCCCCCCCCCCHHHHCCCCCCCEEEE Q ss_conf 10110133378999887521261367700021000124321153243100011001112458764565212000682898 Q gi|254780561|r 86 ADIVLGFDNNLIDLARKTGLFAKSNIDASQLKLPIKWDDDIFVPYDYGYLAFIYDKRQITQPPKNFDELINSTKPWKIIY 165 (338) Q Consensus 86 ~Dv~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~p~s~~~l~~~~~~~~~~~ 165 (338) .|.-+.-...++.......-.. ...-+..+..|. ..+...-+|.+++ ....++++ T Consensus 81 iDaN~fQH~~yL~~~nk~~g~~-----------L~~v~~~~~~P~-----------glYS~K~ksl~el---p~Ga~IaI 135 (271) T PRK11063 81 IDANAFQHKPYLDQQIKDRGYK-----------LVAVGNTFVYPI-----------AGYSKKIKSLDEL---QDGSQVAV 135 (271) T ss_pred CCHHHHHCHHHHHHHHHHCCCC-----------EEEEEEEEEEEE-----------EEEECCCCCHHHC---CCCCEEEE T ss_conf 4423455799999999977995-----------799866778415-----------8642365886475---69999980 Q ss_pred EECCCCCCHHHHHHHH---HHHCCCC---HHHHHHHHHHCCCEEECC-CHHHHHHHHCCCCCEEEEECCHHHHHHHHHCC Q ss_conf 5047653036789999---8717731---378987776288468308-47898886225865699950449999985079 Q gi|254780561|r 166 QDPRTSTLGLGLLLWI---QKIYGDN---SAQVWKKIATKTATVTKG-WTESYGFFLKGESDFVLSYSTSPGFYLLNYGQ 238 (338) Q Consensus 166 ~~~~~~~~~~~~~~~~---~~~~~~~---~~~~~~~~~~~~~~~~~~-~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~ 238 (338) ++...+.......+.- +...... .............++..- ..+....+...+++.++...+. ....+- T Consensus 136 PND~sN~~RAL~lL~~aGLIkLk~~~~~~~T~~DI~~Npk~l~~~e~daaql~rsl~D~dv~~aviN~~~----a~~agl 211 (271) T PRK11063 136 PNDPTNLGRSLLLLQKVGLIKLKDGVGLLPTVLDIVENPKNLKIVELEAPQLPRSLDDAQIALAVINTTY----ASQIGL 211 (271) T ss_pred CCCCCHHHHHHHHHHHCCCEEECCCCCCCCCHHHHHCCCCCCEEEECCHHHHHHHCCCCCEEEEEECHHH----HHHCCC T ss_conf 4885609999999998898897799998789668854887748999037764431357651089814778----977798 Q ss_pred CCE-EEEECCCCEEEEEEEEEEECCCCCHHHHHHHHHHHCCHHHHHHHHHCC Q ss_conf 973-788316980888864243378999899999999963999999988504 Q gi|254780561|r 239 DDY-VAALFSEGHYLQIEVAAQLVRSKQPQLAQEFMQFMISPSFQRILPTTN 289 (338) Q Consensus 239 ~~~-~~~~~~eG~~~~~~~~~i~k~a~n~~~A~~Fi~flls~e~Q~~~a~~~ 289 (338) .+. ...+.++..-.....+++-++.++.+..+++++-+.|+|+++.+.+.+ T Consensus 212 ~p~kdal~~E~~~~py~Nvivvr~~~kd~~~ik~lv~~~~S~evk~~I~~~~ 263 (271) T PRK11063 212 TPAKDGIFVEDKDSPYVNLIVAREDNKDAENVKKFVQAYQSDEVYEAANKVF 263 (271) T ss_pred CCCCCCEEECCCCCCEEEEEEECCCCCCCHHHHHHHHHHCCHHHHHHHHHHC T ss_conf 9342504777899983899998277779989999999977999999999877 No 33 >COG0226 PstS ABC-type phosphate transport system, periplasmic component [Inorganic ion transport and metabolism] Probab=98.48 E-value=1.1e-05 Score=51.44 Aligned_cols=239 Identities=12% Similarity=0.039 Sum_probs=107.9 Q ss_pred CEEEEEECCCCCCCHHHHHHHHHHHHHHH-CCEEEEEECCCHHHHHHHHHHHHCCCCCEEEECCCHHH---HHHHHHHHC Q ss_conf 85999978877851223388999999884-88799997288689997887531266410110133378---999887521 Q gi|254780561|r 30 PVLTVYTYNSFVADEGAGPKIKQAFERKC-NCELKLIGLSDGVALLNKLRMEGENSAADIVLGFDNNL---IDLARKTGL 105 (338) Q Consensus 30 ~~l~v~~~~~~~~~~~~~~~i~~~Fek~~-gikV~~~~~~~~~~~~~~~~a~~~~~~~Dv~~~~~~~~---~~~~~~~~l 105 (338) .+|++ -+| +......+.+.++|.+.| +++|++...|++... .+...++ .|+--...+.. ....... T Consensus 43 ~~l~~--~GS-ttv~p~~~~~a~~~~~~~~~~~v~~q~~GSg~G~-~~~~~~~----vDig~ss~p~~~~e~~~~~~~-- 112 (318) T COG0226 43 GSLTI--AGS-TTVAPLMEAWAEEFQKAYPNVKVNVQAGGSGAGI-KQALAGG----VDIGASSRPLKDSEKEADAKA-- 112 (318) T ss_pred CEEEE--ECC-CCHHHHHHHHHHHHHHHCCCEEEEEECCCCHHHH-HHHHCCC----CCCCCCCCCCCCCHHHHHHHC-- T ss_conf 35998--365-7389999999999998789947876368661889-9875267----703135766663267998740-- Q ss_pred CCCCCCCHHHCCCCCCCCCCCCCCEEEEEEHHHHCCCC--------------CCCCCCCHHHHCCCCCCCEEE-EEECCC Q ss_conf 26136770002100012432115324310001100111--------------245876456521200068289-850476 Q gi|254780561|r 106 FAKSNIDASQLKLPIKWDDDIFVPYDYGYLAFIYDKRQ--------------ITQPPKNFDELINSTKPWKII-YQDPRT 170 (338) Q Consensus 106 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~--------------~~~~p~s~~~l~~~~~~~~~~-~~~~~~ 170 (338) .......+.......++||.+. +....+.|.|..-.....++. +..... T Consensus 113 ----------------~~~l~~~pva~d~i~vi~N~~~~~~~~Lt~~~l~~If~G~it~W~d~~~~~~n~~I~vv~R~~~ 176 (318) T COG0226 113 ----------------GKKLVEFPVAIDAIVVIVNLPNPDVLNLTKDQLADIFSGKITNWNDVGGQLLNPGITVVGRSDG 176 (318) T ss_pred ----------------CCCCEEEEEEEEEEEEEECCCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCCCCCCCEEEECCCC T ss_conf ----------------7986697788777999975898541347999999997088875556686558998568847877 Q ss_pred CCCHHHHHHHHHHHCCCCHHHHHHHHHHCCCEEECCCHHHHHHHHCCCCCEEEEECCHHHHH--H----HHHCCCCEEEE Q ss_conf 53036789999871773137898777628846830847898886225865699950449999--9----85079973788 Q gi|254780561|r 171 STLGLGLLLWIQKIYGDNSAQVWKKIATKTATVTKGWTESYGFFLKGESDFVLSYSTSPGFY--L----LNYGQDDYVAA 244 (338) Q Consensus 171 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~--~----~~~~~~~~~~~ 244 (338) ++....+...+.. ....................+.........+-...++..-....... . ..+.+.....+ T Consensus 177 SGT~~~F~~~l~~--~~~~~~~g~~~~~~~~~~~~~n~~v~~~v~~tp~aIGYv~~~y~~~n~~~~~~~~~~~~~~~~~v 254 (318) T COG0226 177 SGTTETFTEYLLG--WKKKVGAGKTVDWPAAVGGKGNEGVVAAVSQTPGAIGYVGYAYAVQNNLKLAALANKAGGVGPTV 254 (318) T ss_pred CCHHHHHHHHHHH--HHHCCCCCCCCCCCCEECCCCCHHHHHHHHCCCCCEEEEEEHHHHHCCCHHHHHHHCCCCCCCCC T ss_conf 7769999999876--53203456654456400368857889888539984599986796524305666420157766665 Q ss_pred ECCC-----CEE--EEEEEEEEECCCC---CHHHHHHHHHHHCCHHHHHHHHHCCCEEECCCC Q ss_conf 3169-----808--8886424337899---989999999996399999998850450505576 Q gi|254780561|r 245 LFSE-----GHY--LQIEVAAQLVRSK---QPQLAQEFMQFMISPSFQRILPTTNWMYPVVDI 297 (338) Q Consensus 245 ~~~e-----G~~--~~~~~~~i~k~a~---n~~~A~~Fi~flls~e~Q~~~a~~~~~~P~~~~ 297 (338) .|.+ +.+ ++-..+.+++..+ ...+.++|++|++++..|. +++..++.|..+. T Consensus 255 ~p~~~~i~~~~y~Pi~~~~~~~v~~~~~~~~~~~~~~Fi~~~~~~~~~~-~~~~~Gyvplp~~ 316 (318) T COG0226 255 EPTAENIANGKYSPIVRPLFIYVNKNPKDTKAPAVKAFLDWALSPGGQQ-IVEEVGYVPLPDA 316 (318) T ss_pred CCHHHHHHCCCCCCCEEEEEEEEECCCCCCCCHHHHHHHHHHHCCCCCE-EHHHCCEECCCCC T ss_conf 6457775337735512578999827766665889999999996787600-1520881618724 No 34 >TIGR01098 3A0109s03R phosphonate ABC transporter, periplasmic phosphonate-binding protein; InterPro: IPR005770 Bacterial binding protein-dependent transport systems , are multicomponent systems typically composed of a periplasmic substrate-binding protein, one or two reciprocally homologous integral inner-membrane proteins and one or two peripheral membrane ATP-binding proteins that couple energy to the active transport system. The integral inner-membrane proteins translocate the substrate across the membrane. It has been shown , that most of these proteins contain a conserved region located about 80 to 100 residues from their C-terminal extremity. This region seems to be located in a cytoplasmic loop between two transmembrane domains. Apart from the conserved region, the sequence of these proteins is quite divergent, and they have a variable number of transmembrane helices. This is a family of periplasmic proteins which are part of the transport system for alkylphosphonate uptake. ; GO: 0015604 phosphonate transmembrane transporter activity, 0015716 phosphonate transport, 0030288 outer membrane-bounded periplasmic space. Probab=98.46 E-value=1.2e-05 Score=51.11 Aligned_cols=257 Identities=13% Similarity=0.020 Sum_probs=136.5 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCC-------C--CC---CC---------CCEEEEEECCCCCCC--HHHHHHHHHHHHHH Q ss_conf 90279999999999999986146-------7--42---34---------885999978877851--22338899999988 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTL-------D--GL---PA---------KPVLTVYTYNSFVAD--EGAGPKIKQAFERK 57 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~-------~--~~---~~---------~~~l~v~~~~~~~~~--~~~~~~i~~~Fek~ 57 (338) |||++.+++++++++++.+|+++ . ++ .. .++|++---.+.+.. ....+++.+..||+ T Consensus 1 mkr~~~~~~~~~~~~~~~~~~~S~~~~~~~~~~~~~~~~~~~~~~~~~~p~~l~~g~~P~e~~~~~~~~~~pL~~~Lek~ 80 (299) T TIGR01098 1 MKRLLALLAALLVASLAAACGSSEIDNRDCAAKAADAADADADVEKSETPKELNFGILPGENASNLARRWEPLKDYLEKK 80 (299) T ss_pred CHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHCCCCHHHCCCCCCEEEEEEECCCCHHHHHHHHHHHHHHHHHH T ss_conf 94788999999999999875420014543565510010156212103677726899844887557777668999988875 Q ss_pred HCCEEEEEECCCHHHHHHHHHHHHCCCCCEEEECCCHHHHHHHHHHHCCCCCCCCHHHCCCCCCCCCCCCCCEEEEEEHH Q ss_conf 48879999728868999788753126641011013337899988752126136770002100012432115324310001 Q gi|254780561|r 58 CNCELKLIGLSDGVALLNKLRMEGENSAADIVLGFDNNLIDLARKTGLFAKSNIDASQLKLPIKWDDDIFVPYDYGYLAF 137 (338) Q Consensus 58 ~gikV~~~~~~~~~~~~~~~~a~~~~~~~Dv~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 137 (338) .|++|++....+...+ +++-+ ....|+.+......+......+.-.-... .......-+-..+...+ T Consensus 81 Lg~kV~~~~~~dY~a~---IEamr-~g~~D~a~~g~~sY~~A~~~~~~~~~a~~---------~~~~~~~~g~~~Y~S~~ 147 (299) T TIGR01098 81 LGIKVELFVATDYSAV---IEAMR-FGRVDIAWFGPSSYVLAHRRANAEAFALT---------AIRYVSTDGSPGYYSVI 147 (299) T ss_pred CCCCEEEEEECCCCCE---EEEEC-CCCEEEEEECCCHHHEEEECCCCCHHHHH---------HHHHCCCCCCCCEEEEE T ss_conf 5886799861683103---34232-68078983478512322001387012134---------43311688898348999 Q ss_pred HHCCCCCCCCCCCHHHHCCCCCCCEEEEEECCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHCCCEEECCCHHHHHHHHCC Q ss_conf 10011124587645652120006828985047653036789999871773137898777628846830847898886225 Q gi|254780561|r 138 IYDKRQITQPPKNFDELINSTKPWKIIYQDPRTSTLGLGLLLWIQKIYGDNSAQVWKKIATKTATVTKGWTESYGFFLKG 217 (338) Q Consensus 138 ~y~~~~~~~~p~s~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 217 (338) +.+. .-+.+++.||+......++++.++..++....-...+.......... .......-..+.++-......+.+| T Consensus 148 iv~~---dS~i~~~~DlL~~LKGK~~af~Dp~STSG~l~P~~~l~~~~~~~~~~-~d~~~F~~~~~~G~Hd~~~~~V~nG 223 (299) T TIGR01098 148 IVKA---DSPIKSLKDLLKKLKGKTFAFGDPASTSGYLVPRYQLLKEGGLDADK-DDDKFFSEVVFSGSHDASILAVANG 223 (299) T ss_pred EECC---CCCCCCHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHCCCCCC-CCCCCCCEEEEECCHHHHHHHHHHC T ss_conf 9838---99963389999750797598506887512477889987650789888-5345233268843158999998808 Q ss_pred CCCEEEEECCHHHHHHHHHC---------------CCCEEEEECCCCEEEEE-EEEEEECCCCCH--HHHHHHHHHHCC Q ss_conf 86569995044999998507---------------99737883169808888-642433789998--999999999639 Q gi|254780561|r 218 ESDFVLSYSTSPGFYLLNYG---------------QDDYVAALFSEGHYLQI-EVAAQLVRSKQP--QLAQEFMQFMIS 278 (338) Q Consensus 218 ~~~~~~~~~~~~~~~~~~~~---------------~~~~~~~~~~eG~~~~~-~~~~i~k~a~n~--~~A~~Fi~flls 278 (338) .+|++..+............ ...++++.- +...| +.+++-++=+ + +.-.+..+++++ T Consensus 224 ~vDaA~~~~~~~~~~~~~~~~~~~~~~~~~g~~D~~~~~~vI~~---S~~IPndp~~~R~~L~-~yP~~k~ki~~~F~~ 298 (299) T TIGR01098 224 KVDAATNNSSAIERLKKRGPSDKFKKKVAKGATDLMKKVRVIWK---SPLIPNDPIAVRSDLP-PYPELKEKIRDAFLD 298 (299) T ss_pred CCCEEEECHHHHHHHHHCCCCCCCCEEECCCCCCCHHCEEEEEE---CCCCCCCCEEEECCCC-CCHHHHHHHHHHHHC T ss_conf 85457653044665653032356515520477671370178752---3787787546525789-667799999999851 No 35 >TIGR03431 PhnD phosphonate ABC transporter, periplasmic phosphonate binding protein. Note that this model does not identify all phnD-subfamily genes with evident phosphonate context, but all sequences above the trusted context may be inferred to bind phosphonate compounds even in the absence of such context. Furthermore, there is ample evidence to suggest that many other members of the TIGR01098 subfamily have a different primary function. Probab=98.46 E-value=9e-05 Score=46.03 Aligned_cols=250 Identities=14% Similarity=0.063 Sum_probs=121.9 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEECCCCCCC--HHHHHHHHHHHHHHHCCEEEEEECCCHHHHHHHHH Q ss_conf 9027999999999999998614674234885999978877851--22338899999988488799997288689997887 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLDGLPAKPVLTVYTYNSFVAD--EGAGPKIKQAFERKCNCELKLIGLSDGVALLNKLR 78 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~v~~~~~~~~~--~~~~~~i~~~Fek~~gikV~~~~~~~~~~~~~~~~ 78 (338) |+|. ++++++++++++++ ...+.....+|++..--..+.+ ...++++.+-.+++.|++|++....+-.++...+. T Consensus 1 m~~r--~l~~~~~~~~~~~~-~a~a~~~p~~l~~gv~P~~~~~~~~~~~~Pl~~yL~~~lg~~Ve~~~~~~y~~~~eal~ 77 (288) T TIGR03431 1 MLRR--LILSLVAAFMLISS-NAQAEDWPKELNFGIIPTENASDLKQRWEPLADYLSKKLGVKVKLFFATDYAGVIEGMR 77 (288) T ss_pred CCHH--HHHHHHHHHHHHCC-HHHHCCCCCEEEEEEECCCCHHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHHHHHH T ss_conf 9088--99999999999743-22430488368999806999999999999999999998789789996899999999998 Q ss_pred HHHCCCCCEEEECCCHHHHHHHHHHHCCCCCCCCHHHCCCCCCCCCCCCCCEEEEEEHHHHCCCCCCCCCCCHHHHCCCC Q ss_conf 53126641011013337899988752126136770002100012432115324310001100111245876456521200 Q gi|254780561|r 79 MEGENSAADIVLGFDNNLIDLARKTGLFAKSNIDASQLKLPIKWDDDIFVPYDYGYLAFIYDKRQITQPPKNFDELINST 158 (338) Q Consensus 79 a~~~~~~~Dv~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~p~s~~~l~~~~ 158 (338) +..+|+.+.............+. .+.-.... .. +.+. +...++.+++ .+.+++.||. T Consensus 78 ----~g~~Dia~~~p~~yv~a~~~~~~-~~la~~~~------~~----g~~~--y~s~iivr~D---s~i~sl~DLk--- 134 (288) T TIGR03431 78 ----FGKVDIAWYGPSSYAEAYQKANA-EAFAIEVN------AD----GSTG--YYSVLIVKKD---SPIKSLEDLK--- 134 (288) T ss_pred ----CCCCEEEEECCHHHHHHHHCCCC-EEEEEEEC------CC----CCCC--EEEEEEEECC---CCCCCHHHHC--- T ss_conf ----59830999884788998652597-48899851------58----8864--5799999899---9888778957--- Q ss_pred CCCEEEEEECCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHCCCEEECCCHHHHHHHHCCCCCEEEEECCHHHHHHHHHCC Q ss_conf 06828985047653036789999871773137898777628846830847898886225865699950449999985079 Q gi|254780561|r 159 KPWKIIYQDPRTSTLGLGLLLWIQKIYGDNSAQVWKKIATKTATVTKGWTESYGFFLKGESDFVLSYSTSPGFYLLNYGQ 238 (338) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~ 238 (338) ..++++.++...+........+....+....+.+.. ..+.++.......+..|+++.+........ ....++. T Consensus 135 -Gk~iaf~~~~StSG~l~P~~~L~~~~g~~~~~~f~~-----v~~~gshd~~~~aV~~G~~Dag~~~~~~~~-~~~~~~~ 207 (288) T TIGR03431 135 -GKTFGFVDPNSTSGFLVPSYYLFKKNGIKPKEYFKK-----VTFSGSHEAAILAVANGTVDAATTNDENLD-RMIRKGQ 207 (288) T ss_pred -CCEEEECCCCCCHHHHHHHHHHHHHCCCCHHHHHHH-----EEECCCHHHHHHHHHCCCCCEEEECHHHHH-HHHHCCC T ss_conf -987660287411113999999998659972545531-----331578789999998599329983388899-9997479 Q ss_pred C----CEEEEECCCCEEEEEEEEEEECCCCCHHHHHHHHHHHCC----HHHHHHHH Q ss_conf 9----737883169808888642433789998999999999639----99999988 Q gi|254780561|r 239 D----DYVAALFSEGHYLQIEVAAQLVRSKQPQLAQEFMQFMIS----PSFQRILP 286 (338) Q Consensus 239 ~----~~~~~~~~eG~~~~~~~~~i~k~a~n~~~A~~Fi~flls----~e~Q~~~a 286 (338) + .+.++...+. .-...+++.++- .++.-.++-+.|++ +++++.+- T Consensus 208 ~~~~~~lrvi~~S~~--iP~~~~~v~~~l-~~~~~~~i~~all~l~~~d~~~~~~l 260 (288) T TIGR03431 208 PDAMEDLRIIWKSPL--IPNGPIVYRKDL-PADLKAKIRKAFLNYHKTDKACFEKI 260 (288) T ss_pred CCHHHCEEEEEECCC--CCCCCEEEECCC-CHHHHHHHHHHHHHCCCCCHHHHHHH T ss_conf 630313699986788--887718996999-99999999999980786787689987 No 36 >PRK09495 glnH glutamine ABC transporter periplasmic protein; Reviewed Probab=98.18 E-value=0.00041 Score=42.17 Aligned_cols=206 Identities=17% Similarity=0.096 Sum_probs=105.7 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEECCCCCC--------CHHHHHHHHHHHHHHHCCEEEEEECCCHHH Q ss_conf 902799999999999999861467423488599997887785--------122338899999988488799997288689 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLDGLPAKPVLTVYTYNSFVA--------DEGAGPKIKQAFERKCNCELKLIGLSDGVA 72 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~v~~~~~~~~--------~~~~~~~i~~~Fek~~gikV~~~~~~~~~~ 72 (338) |||++++++++++++.+++++ +++++|+|.+-..|.+ ..++.-.+.+++.++-|++++++.. +-.. T Consensus 1 mk~~~~~~~~~~~~~~~~~~~-----a~~~~l~Vg~~~~~pPf~f~~~g~~~GfdvDl~~~ia~~lg~~~e~~~~-~~~~ 74 (247) T PRK09495 1 MKSVLKVSLAALTLAFAVSSH-----AADKELVVATDTAFVPFEFKQGDKYVGFDVDLWAAIAKELKLDYTLKPM-DFSG 74 (247) T ss_pred CHHHHHHHHHHHHHHHHHHHH-----HCCCEEEEEECCCCCCEEECCCCEEEEEHHHHHHHHHHHHCCCEEEEEC-CHHH T ss_conf 977999999999999987665-----1499499998998798057679927882999999999996994699977-8899 Q ss_pred HHHHHHHHHCCCCCEEEECCCHHHHHHHHHHHCCCCCCCCHHHCCCCCCCCCCCCCCEEEEEEHHHHCCCCCCCCCCCHH Q ss_conf 99788753126641011013337899988752126136770002100012432115324310001100111245876456 Q gi|254780561|r 73 LLNKLRMEGENSAADIVLGFDNNLIDLARKTGLFAKSNIDASQLKLPIKWDDDIFVPYDYGYLAFIYDKRQITQPPKNFD 152 (338) Q Consensus 73 ~~~~~~a~~~~~~~Dv~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~p~s~~ 152 (338) ++..++. ..+|+++..-.....+...-.. ..||......+...++ ....++++ T Consensus 75 ~i~~l~~----g~~D~~~~~~~~T~eR~~~~~F---------------------S~py~~~~~~i~v~~~--~~~i~~~~ 127 (247) T PRK09495 75 IIPALQT----KNIDLALAGITITDERKKAIDF---------------------SDGYYKSGLLVMVKAN--NNDIKSVK 127 (247) T ss_pred HHHHHHC----CCCCEEECCCCCCHHHHHHCCC---------------------CCCCEEEEEEEEEECC--CCCCCCHH T ss_conf 9999976----9956364660147899843334---------------------7770772469999899--97779824 Q ss_pred HHCCCCCCCE-EEEEECCCCCCHHHHHHHHHHHCCCCHHHHHHH-HHHCCCEEECCCHHHHHHHHCCCCCEEEEECCHHH Q ss_conf 5212000682-898504765303678999987177313789877-76288468308478988862258656999504499 Q gi|254780561|r 153 ELINSTKPWK-IIYQDPRTSTLGLGLLLWIQKIYGDNSAQVWKK-IATKTATVTKGWTESYGFFLKGESDFVLSYSTSPG 230 (338) Q Consensus 153 ~l~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~ 230 (338) ||. |+ ++.... +... ++... ..........+..+....+..|+++..+.-..... T Consensus 128 dL~-----Gk~v~v~~G--s~~~----------------~~~~~~~~~~~i~~~~~~~~~~~aL~~GrvDa~v~d~~~~~ 184 (247) T PRK09495 128 DLD-----GKVVAVKSG--TGSV----------------DYAKANIKTKDLRQFPNIDNAYLELGTGRADAVLHDTPNIL 184 (247) T ss_pred HCC-----CCEEEEECC--CHHH----------------HHHHHHCCCCCEEECCCHHHHHHHHHCCCEEEEEECHHHHH T ss_conf 548-----988998469--5899----------------99996388986696598899999876587349995689999 Q ss_pred HHHHHHCCCCEEEEECCCCEEEEEEEEEEECCCC Q ss_conf 9998507997378831698088886424337899 Q gi|254780561|r 231 FYLLNYGQDDYVAALFSEGHYLQIEVAAQLVRSK 264 (338) Q Consensus 231 ~~~~~~~~~~~~~~~~~eG~~~~~~~~~i~k~a~ 264 (338) +.....+...+..+.+ .......+++++|+++ T Consensus 185 ~~~~~~~~~~~~~~~~--~~~~~~~~ia~~k~~~ 216 (247) T PRK09495 185 YFIKTAGNGQFKAVGD--SLEAQQYGIAFPKGSD 216 (247) T ss_pred HHHHHCCCCCEEEECC--CCCCCEEEEEEECCHH T ss_conf 9998489997899467--7776459999928829 No 37 >PRK11260 cystine transporter subunit; Provisional Probab=98.12 E-value=0.00055 Score=41.47 Aligned_cols=222 Identities=14% Similarity=0.095 Sum_probs=111.3 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCCCCC---------CEEEEEECCCCCC---------CHHHHHHHHHHHHHHHCCEE Q ss_conf 90279999999999999986146742348---------8599997887785---------12233889999998848879 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLDGLPAK---------PVLTVYTYNSFVA---------DEGAGPKIKQAFERKCNCEL 62 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~l~v~~~~~~~~---------~~~~~~~i~~~Fek~~gikV 62 (338) |||+.+.++..+++++++++|+.++++++ ++|+|-+...+.+ ..++.-++.+++.++-|+++ T Consensus 1 ~~~~~~~~~~~~l~~~~~~~~~~~~~a~~~~L~~I~~~G~L~Vg~~~~~pPf~~~d~~G~~~GfdvDl~~~ia~~lg~~~ 80 (264) T PRK11260 1 LAHLGRQALMGVMAVALVAGMSVKSFADEGLLNKVKERGTLLVGLEGTYPPFSFQGEDGKLTGFEVEFAEALAKHLGVKA 80 (264) T ss_pred CHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHCCEEEEEECCCCCCCEEECCCCCEEEEEHHHHHHHHHHHCCCE T ss_conf 90467999999999999998357654342439999869989999778988936899999988871999999999849956 Q ss_pred EEEECCCHHHHHHHHHHHHCCCCCEEEECCCHHHHHHHHHHHCCCCCCCCHHHCCCCCCCCCCCCCCEEEEEEHHHHCCC Q ss_conf 99972886899978875312664101101333789998875212613677000210001243211532431000110011 Q gi|254780561|r 63 KLIGLSDGVALLNKLRMEGENSAADIVLGFDNNLIDLARKTGLFAKSNIDASQLKLPIKWDDDIFVPYDYGYLAFIYDKR 142 (338) Q Consensus 63 ~~~~~~~~~~~~~~~~a~~~~~~~Dv~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~ 142 (338) +++.. +-..++..+.. ..+|+++..-.....+... -.+..||......+...+. T Consensus 81 e~~~~-~~~~~i~~L~~----g~~Di~~~~~t~T~eR~~~---------------------~~Fs~PY~~~~~~~~v~~~ 134 (264) T PRK11260 81 SLKPT-KWDGMLASLDS----KRIDVVINQVTISDERKKK---------------------YDFSTPYTVSGIQALVKKG 134 (264) T ss_pred EEEEC-CHHHHHHHHHC----CCCCEEECCCCCCHHHHHH---------------------CCCCCCEEEEEEEEEEECC T ss_conf 99978-88999999866----9946672575358888851---------------------7767743774459999878 Q ss_pred CCCCCCCCHHHHCCCCCCCEEEEEECCCCCCHHHHHHHHHHHCCCCHHHHHHH-HHHCCCEEECCCHHHHHHHHCCCCCE Q ss_conf 12458764565212000682898504765303678999987177313789877-76288468308478988862258656 Q gi|254780561|r 143 QITQPPKNFDELINSTKPWKIIYQDPRTSTLGLGLLLWIQKIYGDNSAQVWKK-IATKTATVTKGWTESYGFFLKGESDF 221 (338) Q Consensus 143 ~~~~~p~s~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~g~~~~ 221 (338) .....+++.||.. .+++.... +... +++.+ ..........+..+....+..|+++. T Consensus 135 -~~~~i~~~~dL~G----k~V~v~~g--s~~~----------------~~l~~~~~~~~i~~~~~~~~~~~~l~~G~vDa 191 (264) T PRK11260 135 -NEGTIKTAADLKG----KKVGVGLG--TNYE----------------EWLRQNVQGVDVRTYDDDPTKYQDLRVGRIDA 191 (264) T ss_pred -CCCCCCCHHHHCC----CEEEEECC--CHHH----------------HHHHHHCCCCCEEEECCHHHHHHHHHCCCCCE T ss_conf -9766689789668----71688569--7899----------------99986088981686366189999987598679 Q ss_pred EEEECCHHHHHHHHHCCCCEEEEECCCCEEEEEEEEEEECCCCCHHHHHHHHHHH Q ss_conf 9995044999998507997378831698088886424337899989999999996 Q gi|254780561|r 222 VLSYSTSPGFYLLNYGQDDYVAALFSEGHYLQIEVAAQLVRSKQPQLAQEFMQFM 276 (338) Q Consensus 222 ~~~~~~~~~~~~~~~~~~~~~~~~~~eG~~~~~~~~~i~k~a~n~~~A~~Fi~fl 276 (338) .+.-.. ................ .+.......++++.|+ ++++...+=++| T Consensus 192 ~i~d~~-~~~~~~~~~~~~~~~~--~~~~~~~~~~iav~k~--~~~L~~~in~~L 241 (264) T PRK11260 192 ILVDRL-AALDLVKKTNDTLAVT--GEAFSRQESGVALRKG--NPDLLKAVNNAI 241 (264) T ss_pred EEECHH-HHHHHHHHCCCCEEEC--CCCCCCCEEEEEEECC--CHHHHHHHHHHH T ss_conf 994599-9999998689978980--6876665089999899--999999999999 No 38 >PRK10797 glutamate and aspartate transporter subunit; Provisional Probab=97.96 E-value=0.0011 Score=39.68 Aligned_cols=245 Identities=14% Similarity=0.096 Sum_probs=119.2 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCCCC--------CCEEEEEECCCCCC-------------CHHHHHHHHHHHHHHH- Q ss_conf 9027999999999999998614674234--------88599997887785-------------1223388999999884- Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLDGLPA--------KPVLTVYTYNSFVA-------------DEGAGPKIKQAFERKC- 58 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~l~v~~~~~~~~-------------~~~~~~~i~~~Fek~~- 58 (338) |||+..+++++++++++++||+..++++ .++|+|-+-.++.+ +-.....|.++..++- T Consensus 3 ~kk~~~~~l~~~l~~~~~~~~~~~~a~~~tL~~Ik~~G~l~vG~~~~~pPf~~~d~~g~~vGfdvDla~~ia~~l~~~l~ 82 (302) T PRK10797 3 LRKLATALLALGLSAGLAQAEDAAPAAGSTLDKIAKNGVIVVGHRESSVPFSYYDNQQKVVGYSQDYSNAIVEAVKKKLN 82 (302) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHCCEEEEEECCCCCCCEEECCCCCCCCCHHHHHHHHHHHHHHHHC T ss_conf 58999999999999999745677753227199998699899998999899716899998515417999999999987744 Q ss_pred --CCEEEEEECCCHHHHHHHHHHHHCCCCCEEEECCCHHHHHHHHHHHCCCCCCCCHHHCCCCCCCCCCCCCCEEEEEEH Q ss_conf --887999972886899978875312664101101333789998875212613677000210001243211532431000 Q gi|254780561|r 59 --NCELKLIGLSDGVALLNKLRMEGENSAADIVLGFDNNLIDLARKTGLFAKSNIDASQLKLPIKWDDDIFVPYDYGYLA 136 (338) Q Consensus 59 --gikV~~~~~~~~~~~~~~~~a~~~~~~~Dv~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 136 (338) |++|+++... .+..+.-++ +..+|+++.......++...- .+..||...... T Consensus 83 ~~~~~v~~v~~~-~~~rip~L~----~gk~Di~~~~~t~T~eR~k~v---------------------dFS~pY~~~~~~ 136 (302) T PRK10797 83 KPDLQVKLIPIT-SQNRIPLLQ----NGTFDFECGSTTNNLERQKQA---------------------AFSDTIFVVGTR 136 (302) T ss_pred CCCCEEEEEECC-HHHHHHHHH----CCCCCEEEECCCCCHHHHCCE---------------------EECCCCEECCEE T ss_conf 677349999757-888779997----898088973575697774124---------------------214662771468 Q ss_pred HHHCCCCCCCCCCCHHHHCCCCCCCEEEEEECCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHCCCEEECCCHHHHHHHHC Q ss_conf 11001112458764565212000682898504765303678999987177313789877762884683084789888622 Q gi|254780561|r 137 FIYDKRQITQPPKNFDELINSTKPWKIIYQDPRTSTLGLGLLLWIQKIYGDNSAQVWKKIATKTATVTKGWTESYGFFLK 216 (338) Q Consensus 137 ~~y~~~~~~~~p~s~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 216 (338) +...++ ...++++||. |+......+.... ......... ...........+..+....+.+ T Consensus 137 llv~k~---s~Iks~~DL~-----GK~V~V~~GTt~e--~~~~~~~~~----------~~~~~~iv~~~~~~~a~~~L~~ 196 (302) T PRK10797 137 LLTKKG---GDIKDFADLK-----GKAVVVTSGTTSE--VLLNKLNEE----------QKMNMRIISAKDHGDSFRTLES 196 (302) T ss_pred EEEECC---CCCCCHHHHC-----CCEEEEECCCHHH--HHHHHHHHH----------HCCCCEEEECCCHHHHHHHHHC T ss_conf 999778---8666756608-----9979996787399--999986675----------0677237850888999999985 Q ss_pred CCCCEEEEECCHHHHHHHH-HCCCCEEEEECCCCEEEEEEEEEEECCCCCHHHH---HHHHHHHC-CHHHHHHHHHCCCE Q ss_conf 5865699950449999985-0799737883169808888642433789998999---99999963-99999998850450 Q gi|254780561|r 217 GESDFVLSYSTSPGFYLLN-YGQDDYVAALFSEGHYLQIEVAAQLVRSKQPQLA---QEFMQFMI-SPSFQRILPTTNWM 291 (338) Q Consensus 217 g~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~eG~~~~~~~~~i~k~a~n~~~A---~~Fi~fll-s~e~Q~~~a~~~~~ 291 (338) |.++..+.-..-....... .......++ .+.....+.++++.|+. +++- -+.|.-|. +-+.++ |.+..+. T Consensus 197 GrvDA~~~D~~~l~~~~~~~~~~~~~~iv--~~~~~~e~ygia~rKgd--~~L~~~Vn~aL~~l~~dGt~~~-I~~KWF~ 271 (302) T PRK10797 197 GRAVAFMMDDALLAGERAKAKKPDNWEIV--GKPQSQEAYGCMLRKDD--PQFKKLMDDTIAQAQTSGEAEK-WFDKWFK 271 (302) T ss_pred CCCCEEEECHHHHHHHHHHCCCCCCEEEE--CCCCCCCCEEEEEECCC--HHHHHHHHHHHHHHHHCCCHHH-HHHHHCC T ss_conf 99509981229888999863699754993--78677760799996899--9999999999999997782999-9998759 Q ss_pred EECCC Q ss_conf 50557 Q gi|254780561|r 292 YPVVD 296 (338) Q Consensus 292 ~P~~~ 296 (338) -|..+ T Consensus 272 ~p~P~ 276 (302) T PRK10797 272 NPIPP 276 (302) T ss_pred CCCCC T ss_conf 99999 No 39 >PRK11553 alkanesulfonate transporter substrate-binding subunit; Provisional Probab=97.89 E-value=0.0015 Score=38.95 Aligned_cols=229 Identities=12% Similarity=0.058 Sum_probs=110.2 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHH-CCEEEEEECCCHHHHHHHHHH Q ss_conf 9027999999999999998614674234885999978877851223388999999884-887999972886899978875 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLDGLPAKPVLTVYTYNSFVADEGAGPKIKQAFERKC-NCELKLIGLSDGVALLNKLRM 79 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~v~~~~~~~~~~~~~~~i~~~Fek~~-gikV~~~~~~~~~~~~~~~~a 79 (338) |||++++.++.++++++++ +.++++..+|+|-... +.... .+..-..-|||++ |+||++..+.++..++.-+.+ T Consensus 1 ~~~~l~~~la~~~~~~~~~---~~~~~~~~~lrIgyq~-~~~~~-~~~~~~~~~ek~~~~~kV~w~~F~~G~~~~eAl~a 75 (314) T PRK11553 1 MRNIIKLALAGLLSVSTLA---VAAESSPEALRIGYQK-GSIGL-VLAKSHQLLEKRFPQTKISWVEFPAGPQMLEALNV 75 (314) T ss_pred CHHHHHHHHHHHHHHHHHH---HHHHCCCCEEEEEECC-CCHHH-HHHHHCCHHHHHCCCCCEEEEECCCCHHHHHHHHC T ss_conf 9779999999999998754---4433489759999617-74158-99985286766178983589978974899999976 Q ss_pred HHCCCCCEEEECCCHHHHHHHHHHHC-CCCCCCCHHHCCCCCCCCCCCCCCEEEEEEHHHHCCCCCCCCCCCHHHHCCCC Q ss_conf 31266410110133378999887521-26136770002100012432115324310001100111245876456521200 Q gi|254780561|r 80 EGENSAADIVLGFDNNLIDLARKTGL-FAKSNIDASQLKLPIKWDDDIFVPYDYGYLAFIYDKRQITQPPKNFDELINST 158 (338) Q Consensus 80 ~~~~~~~Dv~~~~~~~~~~~~~~~~l-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~p~s~~~l~~~~ 158 (338) ...|+-...+...+.... .+. +.... ..........+... ...+.+|..|| T Consensus 76 ----G~iD~~~~g~~p~i~a~a-~G~~~~~va----------------~~~~~~~~~~i~V~---~~S~I~s~aDL---- 127 (314) T PRK11553 76 ----GSIDLGSTGDIPPIFAQA-AGADLVYVG----------------VEPPKPKAEVILVA---ENSPIKTVADL---- 127 (314) T ss_pred ----CCCCEEEECCHHHHHHHH-CCCCEEEEE----------------EECCCCCCEEEEEE---CCCCCCCHHHH---- T ss_conf ----997545115858999986-699869999----------------86378864189984---89877888893---- Q ss_pred CCCE-EEEEECCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHCCCEEECCCHHHHHHHHCCCCCEEEEECCHHHHHHHHHC Q ss_conf 0682-898504765303678999987177313789877762884683084789888622586569995044999998507 Q gi|254780561|r 159 KPWK-IIYQDPRTSTLGLGLLLWIQKIYGDNSAQVWKKIATKTATVTKGWTESYGFFLKGESDFVLSYSTSPGFYLLNYG 237 (338) Q Consensus 159 ~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~ 237 (338) +|| +++. . .+.....+..++ ...+.+..+ +..+.-...+....+.+|.+|....|...... ....+ T Consensus 128 -kGKkVa~~-~-Gs~~~~~l~~aL-~~aGL~~~D--------V~~v~l~p~d~~aAl~~G~VDA~~~w~P~~~~-a~~~~ 194 (314) T PRK11553 128 -KGHKVAFQ-K-GSSSHNLLLRAL-RQAGLKFTD--------IQPTYLTPADARAAFQQGNVDAWAIWDPYYSA-ALLQG 194 (314) T ss_pred -CCCEEEEE-C-CCHHHHHHHHHH-HHCCCCHHH--------EEEEECCCHHHHHHHHCCCCCEEEECCHHHHH-HHHCC T ss_conf -89989741-7-973799999999-986998899--------18984593889999966997889975678999-98628 Q ss_pred CCCEEEEECCCCEEEEEEEEEEEC---CCCCHHHHHHHHHHHCC Q ss_conf 997378831698088886424337---89998999999999639 Q gi|254780561|r 238 QDDYVAALFSEGHYLQIEVAAQLV---RSKQPQLAQEFMQFMIS 278 (338) Q Consensus 238 ~~~~~~~~~~eG~~~~~~~~~i~k---~a~n~~~A~~Fi~flls 278 (338) + .++..-.+|..... .+.+.. -.+||+.-.+|++.+.. T Consensus 195 g--aRvl~dg~~~~~~~-~~~~~~~~fa~~~p~~v~~~l~~l~~ 235 (314) T PRK11553 195 G--VRVLKDGTDLNQTG-SFYLAARPYAEKNGAFIQQVLATLSE 235 (314) T ss_pred C--CEEEECCCCCCCCC-CEEEECHHHHHHCHHHHHHHHHHHHH T ss_conf 9--58985376667776-16886189888799999999999999 No 40 >COG1464 NlpA ABC-type metal ion transport system, periplasmic component/surface antigen [Inorganic ion transport and metabolism] Probab=97.80 E-value=0.0021 Score=38.04 Aligned_cols=251 Identities=14% Similarity=0.087 Sum_probs=120.1 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHHCCEEEEEECCCHHHHHHHHHHH Q ss_conf 90279999999999999986146742348859999788778512233889999998848879999728868999788753 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLDGLPAKPVLTVYTYNSFVADEGAGPKIKQAFERKCNCELKLIGLSDGVALLNKLRME 80 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~v~~~~~~~~~~~~~~~i~~~Fek~~gikV~~~~~~~~~~~~~~~~a~ 80 (338) ||++.++.+++++++ ++++|+...+.+..+++|.+.. .+...+++.+++.=.++.|++++++.+.+...- +...+ T Consensus 1 m~~~~~~~~~~~~~l-~~~~~~~~~~~~~~~I~vg~~~--~p~a~ile~~~k~~~~k~Gi~l~i~~FtDY~~P-N~AL~- 75 (268) T COG1464 1 MKKLKKLALALVALL-ALAACGAAAAKATKTIKVGATP--GPHAEILEVVVKPALKKKGLDLKIVEFTDYVQP-NEALA- 75 (268) T ss_pred CCHHHHHHHHHHHHH-HHHHHHHHCCCCCCCEEEEECC--CCHHHHHHHHHHHHHHHCCCEEEEEEECCCCCH-HHHHH- T ss_conf 943889999999999-9975223113457717996368--965999999987788855965999981487665-57876- Q ss_pred HCCCCCEEEECCCHHHHHHHHHHHCCCCCCCCHHHCCCCCCCCCCCCCCEEEEEEHHHHCCCCCCCCCCCHHHHCCCCCC Q ss_conf 12664101101333789998875212613677000210001243211532431000110011124587645652120006 Q gi|254780561|r 81 GENSAADIVLGFDNNLIDLARKTGLFAKSNIDASQLKLPIKWDDDIFVPYDYGYLAFIYDKRQITQPPKNFDELINSTKP 160 (338) Q Consensus 81 ~~~~~~Dv~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~p~s~~~l~~~~~~ 160 (338) +...|.-..-...++.......-.. ....+..+..|.. .+...-++..+| ... T Consensus 76 --~gdiDaN~FQH~pyL~~~~k~~~~~-----------Lv~vg~~~i~Pmg-----------~YSkk~ksl~el---~~G 128 (268) T COG1464 76 --DGDIDANAFQHKPYLDQFNKEHGGK-----------LVAVGNTHIEPMG-----------LYSKKYKSLAEL---KDG 128 (268) T ss_pred --CCCCCCHHHHCHHHHHHHHHHCCCC-----------EEEEEEEEECCCE-----------ECCHHCCCHHHC---CCC T ss_conf --6884600121468899999973997-----------8987557761425-----------012200868467---999 Q ss_pred CEEEEEECCCCCCHHHHHHH---HHHHCC---CCHHHHHHHHHHCCCEEEC-CCHHHHHHHHCCCCCEEEEECCHHHHHH Q ss_conf 82898504765303678999---987177---3137898777628846830-8478988862258656999504499999 Q gi|254780561|r 161 WKIIYQDPRTSTLGLGLLLW---IQKIYG---DNSAQVWKKIATKTATVTK-GWTESYGFFLKGESDFVLSYSTSPGFYL 233 (338) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~---~~~~~~---~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~g~~~~~~~~~~~~~~~~ 233 (338) ..+++++...+......... ++.... ...+...........++.. ...+....+. +++..+...++ + T Consensus 129 atIaiPNDpsN~gRAL~lL~~aGLIkLk~~~~~~aT~~DI~eNPK~lki~EldAaqlpRaLd--dvD~AvIN~ny----A 202 (268) T COG1464 129 ATIAIPNDPTNEGRALLLLQKAGLIKLKDGVNLLATPKDITENPKNLKIKELEAAQLPRALD--DVDAAVINTNY----A 202 (268) T ss_pred CEEECCCCCCCHHHHHHHHHHCCCEEECCCCCCCCCHHHHHHCCCCCEEEECCHHHCCCCCC--CCCEEEECCHH----H T ss_conf 98988789873567999999879679768876558887886194457568705675653203--56779874307----9 Q ss_pred HHHCCCCEEEEECCCC--EEEEEEEEEEECCCCCHHHHHHHHHHHCCHHHHHHHHHCC Q ss_conf 8507997378831698--0888864243378999899999999963999999988504 Q gi|254780561|r 234 LNYGQDDYVAALFSEG--HYLQIEVAAQLVRSKQPQLAQEFMQFMISPSFQRILPTTN 289 (338) Q Consensus 234 ~~~~~~~~~~~~~~eG--~~~~~~~~~i~k~a~n~~~A~~Fi~flls~e~Q~~~a~~~ 289 (338) ...+-.+-+-.+..|+ .-.+...+++-++.++.+.-+++++-+.|+|+++.+-+.+ T Consensus 203 ~~AgL~p~kdai~~e~~~~spY~Niivvr~~d~d~~~ik~lv~a~qs~evk~~i~k~y 260 (268) T COG1464 203 LQAGLNPKKDALFEEDKDSSPYVNIIVVREEDKDDPAVKKLVEAYQSDEVKAFIEKKY 260 (268) T ss_pred HHCCCCCCCCCEECCCCCCCCCEEEEEECCCCCCCHHHHHHHHHHCCHHHHHHHHHHH T ss_conf 7759991204214146667862479997155668889999999974999999999972 No 41 >PRK10859 putative transglycosylase; Provisional Probab=97.66 E-value=0.0035 Score=36.78 Aligned_cols=213 Identities=15% Similarity=0.115 Sum_probs=107.5 Q ss_pred CCCEEEEEECCC---C----CCCHHHHHHHHHHHHHHHCCEEEEEECCCHHHHHHHHHHHHCCCCCEEEECCCHHHHHHH Q ss_conf 488599997887---7----851223388999999884887999972886899978875312664101101333789998 Q gi|254780561|r 28 AKPVLTVYTYNS---F----VADEGAGPKIKQAFERKCNCELKLIGLSDGVALLNKLRMEGENSAADIVLGFDNNLIDLA 100 (338) Q Consensus 28 ~~~~l~v~~~~~---~----~~~~~~~~~i~~~Fek~~gikV~~~~~~~~~~~~~~~~a~~~~~~~Dv~~~~~~~~~~~~ 100 (338) +.++|+|.|-.+ | ..+.++.=++++.|.+.-|++++++...+.++++..+..+ ..|+....-...-. T Consensus 41 ~rG~LrV~T~~spttYy~~~~~~~GFEYELak~FA~~LGV~L~i~~~~~~~el~~~L~~G----~~DiaAAgLt~t~~-- 114 (507) T PRK10859 41 ARGELRVSTIHSPLTYSTINGKPFGLDYELAKQFADYLGVKLKVTVRQNISQLFDDLDNG----NADLLAAGLVYNSE-- 114 (507) T ss_pred HCCEEEEEEECCCCEEEECCCCCCCHHHHHHHHHHHHCCCEEEEEECCCHHHHHHHHHCC----CCCEEECCCCCCHH-- T ss_conf 789689999758965994199647799999999999809917999749999999998679----98889557877853-- Q ss_pred HHHHCCCCCCCCHHHCCCCCCCCCCCCCCEEEEEEHHHHCCCCCCCCCCCHHHHCCCCCCCEEEEEECCCCCCHHHHHHH Q ss_conf 87521261367700021000124321153243100011001112458764565212000682898504765303678999 Q gi|254780561|r 101 RKTGLFAKSNIDASQLKLPIKWDDDIFVPYDYGYLAFIYDKRQITQPPKNFDELINSTKPWKIIYQDPRTSTLGLGLLLW 180 (338) Q Consensus 101 ~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~p~s~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~ 180 (338) ....+. .+.++....-.++|++ ....|++..||. |+......+ +.+...+ .. T Consensus 115 -R~~~~~------------------~~p~Y~~v~q~LVyr~--g~~rP~~l~DL~-----G~~I~V~~g-Ssy~e~L-~~ 166 (507) T PRK10859 115 -RVKNYQ------------------PGPTYYSVSQQLVYRV--GQYRPRTLGDLT-----GEQLTVAPG-HVVVNDL-QT 166 (507) T ss_pred -HHHCCC------------------CCCCCCCEEEEEEEEC--CCCCCCCHHHCC-----CCEEEECCC-CHHHHHH-HH T ss_conf -674043------------------5686503003999958--998988977837-----986998179-7599999-99 Q ss_pred HHHH-CCCCHHHHHHHHHHCCCEEECCCHHHHHHHHCCCCCEEEEECCHHHHHHHHHCCCCEEEEE--CCCCEEEEEEEE Q ss_conf 9871-7731378987776288468308478988862258656999504499999850799737883--169808888642 Q gi|254780561|r 181 IQKI-YGDNSAQVWKKIATKTATVTKGWTESYGFFLKGESDFVLSYSTSPGFYLLNYGQDDYVAAL--FSEGHYLQIEVA 257 (338) Q Consensus 181 ~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~eG~~~~~~~~ 257 (338) +... .+.-.+. ..-..+..+....+.+|+++..++-+.... ..+.-.+++.+.+ ..+..+ .| T Consensus 167 Lk~~~~P~L~w~---------~~~~~~teeLL~~V~~g~IdyTVaDS~~~~--l~qr~~P~L~vaf~l~~~~~i----aW 231 (507) T PRK10859 167 LKETKFPELSWK---------VDDKKGSAELLEQVIEGKLDYTIADSVAIS--LFQRVHPELAVALDVTDEQPV----TW 231 (507) T ss_pred HHHCCCCCEEEE---------ECCCCCHHHHHHHHHCCCCCEEEECCHHHH--HHHHHCCCHHCCCCCCCCCCE----EE T ss_conf 885348971599---------558999999999997598776885548899--877648440122456887513----78 Q ss_pred EEECCCCCH--HHHHHHHHHHCCHHHHHHHHHCC Q ss_conf 433789998--99999999963999999988504 Q gi|254780561|r 258 AQLVRSKQP--QLAQEFMQFMISPSFQRILPTTN 289 (338) Q Consensus 258 ~i~k~a~n~--~~A~~Fi~flls~e~Q~~~a~~~ 289 (338) ++++++++. .++..|++-+.....-..+-++. T Consensus 232 a~~k~~d~sL~~al~~ff~~~~~~g~l~~L~erY 265 (507) T PRK10859 232 FSPLDDDNTLSAALLDFFNEMNEDGTLARLEEKY 265 (507) T ss_pred EECCCCCHHHHHHHHHHHHHHHCCCHHHHHHHHH T ss_conf 8458997899999999999850484899999998 No 42 >PRK11480 tauA taurine transporter substrate binding subunit; Provisional Probab=97.58 E-value=0.0046 Score=36.08 Aligned_cols=213 Identities=13% Similarity=0.047 Sum_probs=104.3 Q ss_pred HHHCCCCCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHHCCEEEEEECCCHHHHHHHHHHHHCCCCCEEEECCCHHHHH Q ss_conf 86146742348859999788778512233889999998848879999728868999788753126641011013337899 Q gi|254780561|r 19 SYCTLDGLPAKPVLTVYTYNSFVADEGAGPKIKQAFERKCNCELKLIGLSDGVALLNKLRMEGENSAADIVLGFDNNLID 98 (338) Q Consensus 19 ~~~~~~~~~~~~~l~v~~~~~~~~~~~~~~~i~~~Fek~~gikV~~~~~~~~~~~~~~~~a~~~~~~~Dv~~~~~~~~~~ 98 (338) +.|....+++..+|+|--..+ ..+. ..-.--..|||+.|++|++..+.++..++.-+.++ ..|+-...+..... T Consensus 12 ~~~~~~~~a~~~~v~igYq~~-~~p~-~~aka~g~~ek~~G~kV~W~~F~sG~~~~eAlasG----~vDig~~G~~P~~~ 85 (320) T PRK11480 12 ALAFIAFQAQAVNVTVAYQTS-AEPA-KVAQADNTFAKESGATVDWRKFDSGASIVRALASG----DVQIGNLGSSPLAV 85 (320) T ss_pred HHHHHHHHCCCCEEEEEEECC-CCCH-HHHHHCCCHHHHCCCCEEEEECCCHHHHHHHHHCC----CCCEECCCCCHHHH T ss_conf 986314215764599998069-8628-99986384567519812799779759999999669----97564577847999 Q ss_pred HHHHHHC-CCCCCCCHHHCCCCCCCCCCCCCCEEEEEEHHHHCCCCCCCCCCCHHHHCCCCCCCEEEEEECCCCCCHHHH Q ss_conf 9887521-261367700021000124321153243100011001112458764565212000682898504765303678 Q gi|254780561|r 99 LARKTGL-FAKSNIDASQLKLPIKWDDDIFVPYDYGYLAFIYDKRQITQPPKNFDELINSTKPWKIIYQDPRTSTLGLGL 177 (338) Q Consensus 99 ~~~~~~l-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~p~s~~~l~~~~~~~~~~~~~~~~~~~~~~~ 177 (338) ... .++ +.-... ...+ .....+..+ ...++..| .+||..-...+ +.....+ T Consensus 86 a~a-~g~~ik~v~i-----------~~~~-----~~~ealvv~-----~~I~s~~D-----LkGKkVav~~g-S~~hy~l 137 (320) T PRK11480 86 AAS-QQVPIEVFLL-----------ASKL-----GNSEALVVK-----KTISKPED-----LIGKRIAVPFI-STTHYSL 137 (320) T ss_pred HHH-CCCCEEEEEE-----------ECCC-----CCCEEEEEC-----CCCCCHHH-----CCCCEEECCCC-CCHHHHH T ss_conf 986-7998599998-----------5478-----974389956-----89999688-----49998960788-7318999 Q ss_pred HHHHHHHCCCCHHHHHHHHHHCCCEEECCCHHHHHHHHCCCCCEEEEECCHHHHHHHHHCCCCEEEEECCCC--EEEEEE Q ss_conf 999987177313789877762884683084789888622586569995044999998507997378831698--088886 Q gi|254780561|r 178 LLWIQKIYGDNSAQVWKKIATKTATVTKGWTESYGFFLKGESDFVLSYSTSPGFYLLNYGQDDYVAALFSEG--HYLQIE 255 (338) Q Consensus 178 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~eG--~~~~~~ 255 (338) ..++...+.. ..+ +..+.-...+....+.+|.+|....|.. ....+.+.+ ++......-+ .....+ T Consensus 138 l~aL~~~Gl~-~~d--------V~iv~m~p~d~~aA~~~G~vDa~~~W~P--~l~~l~~~G-~vL~~s~~~~~~G~pt~d 205 (320) T PRK11480 138 LAALKHWGIK-PGQ--------VEIVNLQPPAIIAAWQRGDIDGAYVWAP--AVNALEKDG-KVLTDSEQVGQWGAPTLD 205 (320) T ss_pred HHHHHHCCCC-HHH--------EEEEECCCHHHHHHHHCCCCCEEEECCH--HHHHHHHCC-CEEEECHHHHHCCCCCCC T ss_conf 9999985999-899--------0788449288999997699787962677--999998439-788641664204897455 Q ss_pred EEEEECC--CCCHHHHHHHHHHHC Q ss_conf 4243378--999899999999963 Q gi|254780561|r 256 VAAQLVR--SKQPQLAQEFMQFMI 277 (338) Q Consensus 256 ~~~i~k~--a~n~~~A~~Fi~fll 277 (338) .+++.+. .+|||.-.+|++-+. T Consensus 206 ~~vv~~~Fa~e~Pe~V~~flkv~~ 229 (320) T PRK11480 206 VWVVRKDFAEKHPEVVKAFAKSAI 229 (320) T ss_pred EEEECHHHHHHCHHHHHHHHHHHH T ss_conf 688639999889899999999999 No 43 >COG1732 OpuBC Periplasmic glycine betaine/choline-binding (lipo)protein of an ABC-type transport system (osmoprotectant binding protein) [Cell envelope biogenesis, outer membrane] Probab=97.20 E-value=0.0067 Score=35.12 Aligned_cols=248 Identities=14% Similarity=0.069 Sum_probs=109.0 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCC---CCCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHHCCEEEEEE-CCCHHHHHHH Q ss_conf 90279999999999999986146---7423488599997887785122338899999988488799997-2886899978 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTL---DGLPAKPVLTVYTYNSFVADEGAGPKIKQAFERKCNCELKLIG-LSDGVALLNK 76 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~---~~~~~~~~l~v~~~~~~~~~~~~~~~i~~~Fek~~gikV~~~~-~~~~~~~~~~ 76 (338) ||++..+...++..+.++++|+. +++.++.+++|-+=. .+|..++-.|+...=+++|++|+... .|++.-...- T Consensus 1 ~~~~~~~~~~~l~~~~~l~~~s~~~~~~~~~~~~I~VgsK~--~tE~~IL~~m~~~lle~~~~kv~~~~~lG~t~v~~~A 78 (300) T COG1732 1 SKRRMLLIKLLLGSLLLLAACSLPGLGSASAAKTIVVGSKI--FTEQYILGNILKQLLEKNGIKVEDKTGLGGTAVVRNA 78 (300) T ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCEEEECCC--CCHHHHHHHHHHHHHHHCCCCEEECCCCCCHHHHHHH T ss_conf 90899999999999999999603456642357887983477--8388999999999998659914541688852999999 Q ss_pred HHHHHCCCCCEEEECCCHHHHHHHHHHHCCCCCCCCHHHCCCCCCCCCCC----CCC-EEEEEEHHHHCCCCCC-CCCCC Q ss_conf 87531266410110133378999887521261367700021000124321----153-2431000110011124-58764 Q gi|254780561|r 77 LRMEGENSAADIVLGFDNNLIDLARKTGLFAKSNIDASQLKLPIKWDDDI----FVP-YDYGYLAFIYDKRQIT-QPPKN 150 (338) Q Consensus 77 ~~a~~~~~~~Dv~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~----~~~-~~~~~~~~~y~~~~~~-~~p~s 150 (338) +. ++.+|+..-.....+.......-....++...-..........+ ..| ....++.+...++.-. ...+| T Consensus 79 l~----~G~IDiYpEYTGt~~~~~lk~~~~~~~dp~~~y~~vK~~~~k~~~l~wl~p~~~nNTyA~av~~~~A~~~~i~T 154 (300) T COG1732 79 LK----SGDIDIYPEYTGTALFSFLKKDPPASKDPKKVYETVKKLDAKQFKLVWLKPAGFNNTYALAVRKDVAEKYNLET 154 (300) T ss_pred HH----CCCCCEEEEECCHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHCCCEEEECCCCCCCCEEEEECHHHHHHHCCCC T ss_conf 97----59877685432314566345676454798999999999888607978965468776058996198898819816 Q ss_pred HHHHCCCCCCCEEEEEECCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHCC-CEEECCCHHHHHHHHCCCCCEEEEECCHH Q ss_conf 56521200068289850476530367899998717731378987776288-46830847898886225865699950449 Q gi|254780561|r 151 FDELINSTKPWKIIYQDPRTSTLGLGLLLWIQKIYGDNSAQVWKKIATKT-ATVTKGWTESYGFFLKGESDFVLSYSTSP 229 (338) Q Consensus 151 ~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~g~~~~~~~~~~~~ 229 (338) ..||..-...-+.++..-- .....+..+......+...- ...... .-.++++.+|.++++.+|+++. T Consensus 155 iSDLak~~~~l~~g~~~eF-----------~~R~DG~~~l~k~Yg~~~~~~~~~m~~-gl~y~Al~~g~~d~~~~YsTDg 222 (300) T COG1732 155 ISDLAKHSNQLKLGADSEF-----------AERADGLPALQKAYGFDFKPDLRTMDG-GLTYQALKNGTVDAADAYSTDG 222 (300) T ss_pred HHHHHHHHHHCEECCCHHH-----------HCCCCCCHHHHHHHCCCCCCCCEECCC-HHHHHHHHCCCCCEEEECCCCC T ss_conf 9999986653462278423-----------136512289999848766887443381-5899998749977676314552 Q ss_pred HHHHHHHCCCCEEEEECCCCEEEEEEEEEEECC---CCCHHHHHH Q ss_conf 999985079973788316980888864243378---999899999 Q gi|254780561|r 230 GFYLLNYGQDDYVAALFSEGHYLQIEVAAQLVR---SKQPQLAQE 271 (338) Q Consensus 230 ~~~~~~~~~~~~~~~~~~eG~~~~~~~~~i~k~---a~n~~~A~~ 271 (338) ... ...+.+.--..+.+..-++.-|++. .+||+++.. T Consensus 223 ~I~-----~~~L~VLkDDK~~fP~Y~~apvvre~vlk~~Pel~~~ 262 (300) T COG1732 223 RIA-----AYGLKVLKDDKGFFPPYQAAPVVREEVLKKHPELKTI 262 (300) T ss_pred CCC-----CCCCEEEECCCCCCCCCCCCCEECHHHHHHCHHHHHH T ss_conf 212-----2685797068767998765640218777679889999 No 44 >pfam02030 Lipoprotein_8 Hypothetical lipoprotein (MG045 family). This family includes hypothetical lipoproteins, the amino terminal part of this protein is related to pfam01547, a family of solute binding proteins. This suggests this family also has a solute binding function. Probab=97.09 E-value=0.017 Score=32.70 Aligned_cols=256 Identities=16% Similarity=0.207 Sum_probs=139.6 Q ss_pred CCHHHHH---HHHHHHHHHHHHHHCCCCCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHHCCEEEEEECCCHHHHHHHH Q ss_conf 9027999---9999999999986146742348859999788778512233889999998848879999728868999788 Q gi|254780561|r 1 MKKFARI---VVGIMMITGVISYCTLDGLPAKPVLTVYTYNSFVADEGAGPKIKQAFERKCNCELKLIGLSDGVALLNKL 77 (338) Q Consensus 1 MKk~~~~---~~~~~~~~~~~~~~~~~~~~~~~~l~v~~~~~~~~~~~~~~~i~~~Fek~~gikV~~~~~~~~~~~~~~~ 77 (338) |||-++. ++++.+. .++++|+ ..++++..+.+|..+ -+.+.-++.++ ..+..+.+.+.+..- T Consensus 1 mk~~~k~~f~~a~~t~~-~iltaCs------~~~fv~aNfesYmsp-----llle~~~~~~~--lsF~ty~tNE~L~~~- 65 (493) T pfam02030 1 MKKQLKYLFIFAGITFS-PILTACS------SSKFVVANFESYMSP-----LLLERAKRKRP--LTFLTYPNNEKLING- 65 (493) T ss_pred CCHHHHHHHHHHHCCHH-HHHHHCC------CCCEEEECHHHHCCH-----HHHHHHHHHCC--CCEEECCCHHHHHHH- T ss_conf 91145767764420132-6765426------784798411433387-----88999987258--742303756899864- Q ss_pred HHHHCCCCCEEEECCCHHHHHHHHHHHCCCCCCCCHHHCC---------------------------CCCCCC-----CC Q ss_conf 7531266410110133378999887521261367700021---------------------------000124-----32 Q gi|254780561|r 78 RMEGENSAADIVLGFDNNLIDLARKTGLFAKSNIDASQLK---------------------------LPIKWD-----DD 125 (338) Q Consensus 78 ~a~~~~~~~Dv~~~~~~~~~~~~~~~~l~~~~~~~~~~~~---------------------------~~~~~~-----~~ 125 (338) -.+-.+||.+. +...+.++...+++.+.+...-++. ....+. -. T Consensus 66 ---Fk~ntYdVava-SaY~VakLa~~g~L~~IdWsKFNLk~~n~~~~~i~n~~DA~~LFT~~v~aIs~~ysd~K~~~LL~ 141 (493) T pfam02030 66 ---FANNTYDVAVA-SAYAVSELAKNGLLKPIDWAKFNLKKENNQSITVNNIEDAKKLFTKQIWAISNAYKDGKNDELLE 141 (493) T ss_pred ---HCCCCEEEEHH-HHHHHHHHHHCCCCCCCCHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCHHHHH T ss_conf ---14462344203-58999999865971214587616544567752123546788862188999887651588324777 Q ss_pred CCCCEEEEEEHHHHCCCCCCCCC---CCHHHHCCC-------CCCCEEEEEECCCCCCHHHHHHHHHHHC------C--- Q ss_conf 11532431000110011124587---645652120-------0068289850476530367899998717------7--- Q gi|254780561|r 126 IFVPYDYGYLAFIYDKRQITQPP---KNFDELINS-------TKPWKIIYQDPRTSTLGLGLLLWIQKIY------G--- 186 (338) Q Consensus 126 ~~~~~~~~~~~~~y~~~~~~~~p---~s~~~l~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~--- 186 (338) |.+||..--..++|+-++...-+ -+|.+.... -...|++..+..+..+...-..-..... . T Consensus 142 y~VPYFlQdlvFaYrgkki~~L~~~dvsWsdi~k~l~~~~nrfN~~kl~~idDaRtiysia~li~~e~~N~ti~vNpk~~ 221 (493) T pfam02030 142 WMVPYFLQDLVFVYRGEKIPELEKKDVYWSDVIKAIVRHKDRFNKNRLIAIDDARTIFSLANIVQLENKNNIIDVNPKEL 221 (493) T ss_pred HHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHCCCCCCCCEEEEECCHHHHHHHHHHHHHCCCCCEEEECCCCC T ss_conf 64478887678875146675655565269999999961835566541578512277875898886315573377654332 Q ss_pred --------CCHHHHHHHHHHC-C--CEEECCCHHHHHHHHCCCCCEEEEECCHHHHHHHHH------------CCCCEEE Q ss_conf --------3137898777628-8--468308478988862258656999504499999850------------7997378 Q gi|254780561|r 187 --------DNSAQVWKKIATK-T--ATVTKGWTESYGFFLKGESDFVLSYSTSPGFYLLNY------------GQDDYVA 243 (338) Q Consensus 187 --------~~~~~~~~~~~~~-~--~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~------------~~~~~~~ 243 (338) .+..... .++.+ . .-+-+........+..|...-+++|.++..+.++.. .+....+ T Consensus 222 ~~n~f~~~Yes~~~i-~~~~Nn~~~mf~NsDSNivLNelA~~~~~g~I~YNGD~~fAA~GGd~~~e~~~n~~p~~~nfHI 300 (493) T pfam02030 222 KTNYFLNVYESFSYL-GLKLNNLSNMFVNSDSNIVINELAMGRRQGGIVYNGDAVYAALGGDLRDEADENMKPTGDNFHI 300 (493) T ss_pred EECCCCCCHHHHHHH-HHHCCCHHHHHCCCCHHHHHHHHHCCCCCCCEEECCCEEEHHCCCCCCCCCCCCCCCCCCCEEE T ss_conf 211245600212046-5520526666305851788888751355662686441001014875545556567877774137 Q ss_pred EECCCCEEEEEEEEEEECC-CCCHHHHHHHHHHHC Q ss_conf 8316980888864243378-999899999999963 Q gi|254780561|r 244 ALFSEGHYLQIEVAAQLVR-SKQPQLAQEFMQFMI 277 (338) Q Consensus 244 ~~~~eG~~~~~~~~~i~k~-a~n~~~A~~Fi~fll 277 (338) +.|+. ++...++++|.|. ..|.++|++.|.=+. T Consensus 301 Vqpk~-t~~~LD~~ViNk~~~~~~q~Ay~~I~kla 334 (493) T pfam02030 301 VQPKD-SPVALDFLIINSQQKQFEQAAHEYINELA 334 (493) T ss_pred ECCCC-CCEEEEEEEECCCHHHHHHHHHHHHHHHH T ss_conf 44789-75354677751003555889999999997 No 45 >PRK10918 phosphate transporter subunit; Provisional Probab=96.90 E-value=0.026 Score=31.72 Aligned_cols=73 Identities=14% Similarity=0.141 Sum_probs=40.6 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHHCCEEEEEECCCHHHHHHHHHHH Q ss_conf 90279999999999999986146742348859999788778512233889999998848879999728868999788753 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLDGLPAKPVLTVYTYNSFVADEGAGPKIKQAFERKCNCELKLIGLSDGVALLNKLRME 80 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~v~~~~~~~~~~~~~~~i~~~Fek~~gikV~~~~~~~~~~~~~~~~a~ 80 (338) ||-...+..++++++..+++.+. .+..+|+ ..++ +....+++.-..+|.+++|++|+|...||+.- +.+..++ T Consensus 1 ~~~~~~~~~~~~~~~ls~~a~~a---~aa~~l~--GaGa-Tf~~p~~~~w~~~~~~~~g~~v~Y~~~GSG~G-~~~f~~g 73 (346) T PRK10918 1 MKVMRTTVATVVAATLSMSAFSA---FAAASLT--GAGA-TFPAPVYAKWADTYQKETGNKVNYQGIGSSGG-VKQIIAN 73 (346) T ss_pred CCHHHHHHHHHHHHHHHHHHHHH---HHHCCCC--CCCC-CHHHHHHHHHHHHHHHHCCCEEEECCCCCHHH-HHHHHCC T ss_conf 90148899999999998877678---7616612--4143-34999999999999984598689826687899-9999738 No 46 >COG3181 Uncharacterized protein conserved in bacteria [Function unknown] Probab=96.50 E-value=0.051 Score=29.98 Aligned_cols=236 Identities=12% Similarity=0.035 Sum_probs=101.0 Q ss_pred CCEEEEEEC-CCCCCCHHHHHHHHHHHHHHHCCE--EEEEECCCHHHHHHHHHHHHCCCCCEEEECCCHHHHHHHHHHHC Q ss_conf 885999978-877851223388999999884887--99997288689997887531266410110133378999887521 Q gi|254780561|r 29 KPVLTVYTY-NSFVADEGAGPKIKQAFERKCNCE--LKLIGLSDGVALLNKLRMEGENSAADIVLGFDNNLIDLARKTGL 105 (338) Q Consensus 29 ~~~l~v~~~-~~~~~~~~~~~~i~~~Fek~~gik--V~~~~~~~~~~~~~~~~a~~~~~~~Dv~~~~~~~~~~~~~~~~l 105 (338) +..+|+..+ .-+..-+...+.+.+.-+|+.|-. |+.+..+++......+.....+.+.-+... ....+......++ T Consensus 27 ~~~it~Ivp~~~GGg~D~~aR~~~~~l~k~lg~~v~V~N~pGagG~ia~~~va~a~pG~t~~l~~~-~~~~~~~~~~~~~ 105 (319) T COG3181 27 ERPITIIVPAAAGGGTDQTARALAESLSKELGQPVVVDNKPGAGGAIAAGAVAKAAPGYTILLIAG-STPALLLPILGGL 105 (319) T ss_pred CCCEEEEEECCCCCHHHHHHHHHHHHHHHHHCCCEEEEECCCCCCHHHHHHHHHCCCCCCEEEEEC-CCCCCCCHHHCCC T ss_conf 787589996589875889999999999998089779995588731688999984589984488734-7643221544268 Q ss_pred CCCCCCCHHHCCCCCCCCCCCCCCEEEEEEHHHHCCCCCCCCCCCHHHHCCCCCC--CEEEEEECCCCCCHHHHHHHHHH Q ss_conf 2613677000210001243211532431000110011124587645652120006--82898504765303678999987 Q gi|254780561|r 106 FAKSNIDASQLKLPIKWDDDIFVPYDYGYLAFIYDKRQITQPPKNFDELINSTKP--WKIIYQDPRTSTLGLGLLLWIQK 183 (338) Q Consensus 106 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~p~s~~~l~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~ 183 (338) ...... +.. ........+..+..+ -..+.+|+.||.+.... +.+.....+...........+.. T Consensus 106 ~~~~~~---D~~--------pva~v~~~p~~l~v~---~~s~~~t~~dlv~~~k~~p~~v~~~~~g~Gs~dhl~~~~~~k 171 (319) T COG3181 106 PYYKLK---DFT--------PVASLVSDPGVLVVR---ADSPYKTLKDLVAYAKADPGSVIGGGSGLGSADHLAGALFAK 171 (319) T ss_pred CCCCHH---HCE--------EHHHEECCCCEEEEE---CCCCCCCHHHHHHHHHHCCCEEEECCCCCCCHHHHHHHHHHH T ss_conf 888544---230--------033111366159981---788736699999999859980895478889588999999999 Q ss_pred HCCCCHHHHHHHHHHCCCEEECCCHHHHHHHHCCCCCEEEEECCHHHHHHHHHCCCCEEEEEC-------------CC-C Q ss_conf 177313789877762884683084789888622586569995044999998507997378831-------------69-8 Q gi|254780561|r 184 IYGDNSAQVWKKIATKTATVTKGWTESYGFFLKGESDFVLSYSTSPGFYLLNYGQDDYVAALF-------------SE-G 249 (338) Q Consensus 184 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------------~e-G 249 (338) ..+. +....-+++..+....+..|.+++.....++.. ...+.+...+-.++- .| | T Consensus 172 ~~Gi----------~~~~Vpy~g~gea~taLlgg~v~a~~~~~se~~-~~vksG~lr~Lav~s~eRl~~~pdvPT~~E~G 240 (319) T COG3181 172 AAGI----------KITYVPYKGGGEALTALLGGHVDAGSTNLSELL-SQVKSGTLRLLAVFSEERLPGLPDVPTLKEQG 240 (319) T ss_pred HHCC----------CEEEEEECCCCHHHHHHHCCCEEEEECCHHHHH-HHHCCCCEEEEEEECHHHCCCCCCCCCHHHCC T ss_conf 7389----------625774168648789986086654436712435-56305746787753431067799999867638 Q ss_pred ---EEEEEEEEEEECCCCCHHH---HHHHHHHHCCHHHHHHHHHCCC Q ss_conf ---0888864243378999899---9999999639999999885045 Q gi|254780561|r 250 ---HYLQIEVAAQLVRSKQPQL---AQEFMQFMISPSFQRILPTTNW 290 (338) Q Consensus 250 ---~~~~~~~~~i~k~a~n~~~---A~~Fi~flls~e~Q~~~a~~~~ 290 (338) .+....++..+++.+...- ...|=+-+-|||.|+.+.+..+ T Consensus 241 ~~~~~~~wrgvfap~g~~~e~~~~~~~a~kk~l~s~e~~~~~~~~~~ 287 (319) T COG3181 241 YDVVMSIWRGVFAPAGTPDEIIAKLSAALKKALASPEWQKRLKELGL 287 (319) T ss_pred CCEEEEEEEEEEECCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHCCC T ss_conf 75466443589807999999999999999998639889999985588 No 47 >COG2113 ProX ABC-type proline/glycine betaine transport systems, periplasmic components [Amino acid transport and metabolism] Probab=96.23 E-value=0.072 Score=29.09 Aligned_cols=99 Identities=13% Similarity=0.117 Sum_probs=46.6 Q ss_pred CCHHHHHHHHHHHHHH-HHHHHCCCCCCCCCEEEEE--ECCCCCCCHHHHHHHH-HHHHHHHCCEEEEEECCCHHHHHHH Q ss_conf 9027999999999999-9986146742348859999--7887785122338899-9999884887999972886899978 Q gi|254780561|r 1 MKKFARIVVGIMMITG-VISYCTLDGLPAKPVLTVY--TYNSFVADEGAGPKIK-QAFERKCNCELKLIGLSDGVALLNK 76 (338) Q Consensus 1 MKk~~~~~~~~~~~~~-~~~~~~~~~~~~~~~l~v~--~~~~~~~~~~~~~~i~-~~Fek~~gikV~~~~~~~~~~~~~~ 76 (338) |++..+++.+++++++ .+++|+........++++. +|.+.. ..-.+. ..+ |++|++|+++.. +..-+.+- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~a~~a~~~~~~~tV~~a~~~W~s~~----~~t~v~~~iL-k~~Gy~v~~~~~-~~~~~~~s 74 (302) T COG2113 1 MKKLLALLAAVAAAALLAAAAAAAAAAEAGKTVRIADVGWTSGT----ATTNVAKKIL-KGLGYTVELVTL-DTAVMYQS 74 (302) T ss_pred CCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCEEEEEECCCCHHH----HHHHHHHHHH-HHCCCCCEEEEC-CHHHHHHH T ss_conf 92389999999999998622001113455882588436961789----9999999999-967996246655-59999999 Q ss_pred HHHHHCCCCCEEEECCCHHHHHHHHHHHC Q ss_conf 87531266410110133378999887521 Q gi|254780561|r 77 LRMEGENSAADIVLGFDNNLIDLARKTGL 105 (338) Q Consensus 77 ~~a~~~~~~~Dv~~~~~~~~~~~~~~~~l 105 (338) +..+..+.....+.......+......+. T Consensus 75 la~gd~D~~~~~W~p~~~~~~~~~~~~~~ 103 (302) T COG2113 75 LAKGDLDVFPEAWLPTTPDDYKKAVKDKK 103 (302) T ss_pred HHCCCCCCCCCEECCCCHHHHHHHHCCCC T ss_conf 97379753201026887588998742685 No 48 >PRK09701 D-allose transporter subunit; Provisional Probab=95.96 E-value=0.068 Score=29.25 Aligned_cols=61 Identities=10% Similarity=0.092 Sum_probs=33.8 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEECCCCCCCHHHHHHHHHHH---HHHHCCEEEEEECCC Q ss_conf 902799999999999999861467423488599997887785122338899999---988488799997288 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLDGLPAKPVLTVYTYNSFVADEGAGPKIKQAF---ERKCNCELKLIGLSD 69 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~v~~~~~~~~~~~~~~~i~~~F---ek~~gikV~~~~~~~ 69 (338) |||++++++++++++++.+. +.++.++-|..-. ....++..+.+.+ -++.|++|++....+ T Consensus 1 m~k~~~~~~~~~~~~~~~~~-----a~a~~~iavi~k~---~~npf~~~~~~Ga~~aAk~~G~~v~v~~~~~ 64 (311) T PRK09701 1 MNKYLKYFSGTLVGLMLSTS-----AFAAAEYAVVLKT---LSNPFWVDMKKGIEDEAKTLGVSVDIFASPS 64 (311) T ss_pred CCHHHHHHHHHHHHHHHHHH-----HHHCCCEEEEECC---CCCHHHHHHHHHHHHHHHHHCCEEEEEECCC T ss_conf 93699999999999998887-----7436849999688---8998999999999999998699799992798 No 49 >pfam04069 OpuAC Substrate binding domain of ABC-type glycine betaine transport system. Part of a high affinity multicomponent binding-protein-dependent transport system involved in bacterial osmoregulation. This domain is often fused to the permease component of the transporter complex. Family members are often integral membrane proteins or predicted to be attached to the membrane by a lipid anchor. Glycine betaine is involved in protection from high osmolarity environments for example in Bacillus subtilis. The family member OpuBC is closely related, and involved in choline transport. Choline is necessary for the biosynthesis of glycine betaine. L-carnitine is important for osmoregulation in Listeria monocytogenes. Family also contains proteins binding l-proline (ProX), histidine (HisX) and taurine (TauA). Probab=95.55 E-value=0.14 Score=27.35 Aligned_cols=226 Identities=12% Similarity=0.085 Sum_probs=99.2 Q ss_pred CEEEEEECCCCCCCHHHHHHHHHHHHHHHCCEEEEEECCCHHHHHHHHHHHHCCCCCEEEE-CCCHHHHHHHHHHHCCCC Q ss_conf 8599997887785122338899999988488799997288689997887531266410110-133378999887521261 Q gi|254780561|r 30 PVLTVYTYNSFVADEGAGPKIKQAFERKCNCELKLIGLSDGVALLNKLRMEGENSAADIVL-GFDNNLIDLARKTGLFAK 108 (338) Q Consensus 30 ~~l~v~~~~~~~~~~~~~~~i~~~Fek~~gikV~~~~~~~~~~~~~~~~a~~~~~~~Dv~~-~~~~~~~~~~~~~~l~~~ 108 (338) ++++|-+- .+ .+..+...|.+..-++.|++|+.+..+++..+...+.. +..|++. ...+........ .+... T Consensus 1 ktI~ig~~-~w-~e~~i~a~i~~~~Le~~G~~V~~~~~~~~~~~~~al~~----G~iDi~~~~w~~~~~~~~~~-~~~~~ 73 (256) T pfam04069 1 KTIVIGSK-NW-TEQEILANIAAQLLEALGYVVELVGLGSTAVLFAALAS----GDIDLYPEEWTGTTYEAYKK-AVEEK 73 (256) T ss_pred CEEEEECC-CC-CHHHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHC----CCCCEEEHHCCCCHHHHHHH-HHHCC T ss_conf 92999268-86-28999999999999976981698547870899999976----99758633306625799998-75146 Q ss_pred CCCCHHHCCCCCCCCCCCCCCEEEEEEHHHHCCCCCC-CCCCCHHHHCCCCC-----CCEEEEEECCCCCCHHHHHHHHH Q ss_conf 3677000210001243211532431000110011124-58764565212000-----68289850476530367899998 Q gi|254780561|r 109 SNIDASQLKLPIKWDDDIFVPYDYGYLAFIYDKRQIT-QPPKNFDELINSTK-----PWKIIYQDPRTSTLGLGLLLWIQ 182 (338) Q Consensus 109 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~-~~p~s~~~l~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~ 182 (338) ... ... +........++++.+.... ...+|+.||.++.- .+++.-..++.... ....... T Consensus 74 ~~~--------~~~----~~~~~~~~~g~~Vp~~~a~~~~i~si~dL~~~~~~~~~~~~~~~~~~~G~~~~--~~~~~~~ 139 (256) T pfam04069 74 LGL--------LVL----GPLGAGNTYGLAVPKYVAEKPGIKSISDLAKPADDELGFKGEFIGRPDGWGCT--RSTEGLL 139 (256) T ss_pred CCE--------EEC----CCCCCCCEEEEEECHHHHHHCCCCCHHHHHCCCCCCCCCCCCEECCCCCCCCC--HHHHHHH T ss_conf 873--------861----35777875899985888835699879997274210248886553278885400--6789999 Q ss_pred HHCCCCHHHHHHHHHHCCCEEECCCHHHHHHHHCCCCCEEEEECCHHHHHHHHHCCCCEEEEECCCCEEEEEEEEEEE-- Q ss_conf 717731378987776288468308478988862258656999504499999850799737883169808888642433-- Q gi|254780561|r 183 KIYGDNSAQVWKKIATKTATVTKGWTESYGFFLKGESDFVLSYSTSPGFYLLNYGQDDYVAALFSEGHYLQIEVAAQL-- 260 (338) Q Consensus 183 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~eG~~~~~~~~~i~-- 260 (338) ..++.+..+.. ......-.........+|+..++..|..++..... ++.+.--+++.+.-.+.+..+ T Consensus 140 ~~yGl~~~~~~------~~s~~~~~a~~~~A~~~g~~~v~~~w~p~~~~~~~-----dl~~L~Dp~~~~~~~~~v~~v~~ 208 (256) T pfam04069 140 KAYGLDKYELV------EGSEAAMDALLYAAIKRGEPDVVYAWTPDWMIKKY-----DLVVLEDPKGLFPPAYNVVPVVR 208 (256) T ss_pred HHCCCCCEEEC------CCCHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHC-----CEEECCCCCCCCCCCCEEEEEEC T ss_conf 86397640123------58555459999999984998899963761655427-----96993688645797655875210 Q ss_pred CC--CCCHHHHHHHHHHHCCHHHHHHHHH Q ss_conf 78--9998999999999639999999885 Q gi|254780561|r 261 VR--SKQPQLAQEFMQFMISPSFQRILPT 287 (338) Q Consensus 261 k~--a~n~~~A~~Fi~flls~e~Q~~~a~ 287 (338) ++ .+||+++..|=++-++.+.+..+.. T Consensus 209 ~~~~~~~P~~~~~l~~~~~t~~~~~~l~~ 237 (256) T pfam04069 209 KGFAEKHPEVAAFLNKLSLDTEDLNELNA 237 (256) T ss_pred HHHHHHCHHHHHHHHHCCCCHHHHHHHHH T ss_conf 46777798999999867899999999999 No 50 >pfam03401 Bug Tripartite tricarboxylate transporter family receptor. These probable extra-cytoplasmic solute receptors are strongly overrepresented in several beta-proteobacteria. This family, formerly known as Bug - Bordetella uptake gene (bug) product - is a family of bacterial tripartite tricarboxylate receptors of the extracytoplasmic solute binding receptor-dependent transporter group of families, distinct from the ABC and TRAP-T families. The TctABC system has been characterized in S. typhimurium, and TctC is the extracytoplasmic tricarboxylate-binding receptor which binds the transporters TctA and TctB, two integral membrane proteins. Complete three-component systems are found only in bacteria. Probab=95.28 E-value=0.18 Score=26.81 Aligned_cols=245 Identities=12% Similarity=0.078 Sum_probs=119.2 Q ss_pred HHHHHHHHHHHCCEEEEEECC-CHHHHHHHHHHHHCCCCCEEEECCCHHHHHHHHHHHCCCCCCCCHHHCCCCCCCCCCC Q ss_conf 889999998848879999728-8689997887531266410110133378999887521261367700021000124321 Q gi|254780561|r 48 PKIKQAFERKCNCELKLIGLS-DGVALLNKLRMEGENSAADIVLGFDNNLIDLARKTGLFAKSNIDASQLKLPIKWDDDI 126 (338) Q Consensus 48 ~~i~~~Fek~~gikV~~~~~~-~~~~~~~~~~a~~~~~~~Dv~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~ 126 (338) +.+.+.++|+.|..|.++... .++..-....+......+-+.+.............++ ++++ ... -.+ T Consensus 3 R~~a~~l~~~lG~~vvV~N~pGagG~ig~~~v~~a~~DGyTl~~~~~~~~~~~~~~~~~--~~~~-~~d--------f~p 71 (274) T pfam03401 3 RALAQGMSKQLGQPVIVENKPGAGGIIGADAVAKAAPDGYTILLGSTGLAVAPHLYPNL--PYDP-IKD--------FTP 71 (274) T ss_pred HHHHHHHHHHHCCCEEEEECCCCCHHHHHHHHHHCCCCCCEEEEECCHHHHHHHHHCCC--CCCC-HHH--------HHH T ss_conf 89999999986998799966983069999999808999988999467898768884347--8870-887--------675 Q ss_pred CCCEEEEEEHHHHCCCCCCCCCCCHHHHCCCC--CCCEEEEEECCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHCCCEEE Q ss_conf 15324310001100111245876456521200--0682898504765303678999987177313789877762884683 Q gi|254780561|r 127 FVPYDYGYLAFIYDKRQITQPPKNFDELINST--KPWKIIYQDPRTSTLGLGLLLWIQKIYGDNSAQVWKKIATKTATVT 204 (338) Q Consensus 127 ~~~~~~~~~~~~y~~~~~~~~p~s~~~l~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 204 (338) ..........+..+.+ .+-+|++||.+.. .++++.+...+..+........+....+.+ .....+ T Consensus 72 ia~~~~~~~~l~v~~d---sp~~t~~dli~~ak~~Pg~~~~g~~G~gs~~hl~~~~l~~~~G~~----------~~~Vpy 138 (274) T pfam03401 72 VSLLATSPMVLVVPAD---SPFKTLQELVAYAKANPGKLTFASAGIGTSNHLAGELLASKAGVQ----------LSHVPY 138 (274) T ss_pred HHHHHCCCEEEEECCC---CCCCCHHHHHHHHHHCCCCEEEECCCCCCCCHHHHHHHHHHCCCC----------EEEECC T ss_conf 8865016358897799---975689999999984899658732788863189999999970996----------488346 Q ss_pred CCCHHHHHHHHCCCCCEEEEECCHHHHHHHHHCCCCEEEEECC-------------C-C----EEEEEEEEEEECCCCCH Q ss_conf 0847898886225865699950449999985079973788316-------------9-8----08888642433789998 Q gi|254780561|r 205 KGWTESYGFFLKGESDFVLSYSTSPGFYLLNYGQDDYVAALFS-------------E-G----HYLQIEVAAQLVRSKQP 266 (338) Q Consensus 205 ~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------------e-G----~~~~~~~~~i~k~a~n~ 266 (338) ++..+....+..|++++.+...+.. ......+...+-.+.-. | | ......++..+++.|.. T Consensus 139 ~G~~~~~~allgG~vd~~~~~~~~~-~~~v~~G~lr~Lav~~~~R~~~~PdVPT~~E~G~~~~~~~~w~g~~ap~gtP~~ 217 (274) T pfam03401 139 KGSSPALQDLLGGRVDMMIDSLTST-APYIKAGKLRALAVTSPKRSPQLPDVPTVAELGLKGFEAGVWFGLVAPKGTPPA 217 (274) T ss_pred CCCHHHHHHHHCCEEEEEEECCHHH-HHHHCCCCCEEEEEECCCCCCCCCCCCCHHHHCCCCCEEEEEEEEEECCCCCHH T ss_conf 7832668998678445998351320-445417982599997575675448999768848787147888887634988999 Q ss_pred ---HHHHHHHHHHCCHHHHHHHHHCCCEEECCCCCCCCHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHC Q ss_conf ---99999999963999999988504505055765882134122322234789899999999999999998741 Q gi|254780561|r 267 ---QLAQEFMQFMISPSFQRILPTTNWMYPVVDISMPDVYQAIRIPQKSLRFNAEETTRYRNQWISSWQNAVSR 337 (338) Q Consensus 267 ---~~A~~Fi~flls~e~Q~~~a~~~~~~P~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~W~~~v~~ 337 (338) .+...|-+-+-+||.|+.+.+.++ . +...+.++..+...+=.++|.+.+.+ T Consensus 218 ~~~~l~~a~~~~~~dpe~~~~~~~~g~-~-------------------~~~~~~~e~~~~~~~e~~~~~~l~k~ 271 (274) T pfam03401 218 VVEKLNDAIKKALKDPEVVERLANLGM-E-------------------PVPGTPQQLAQFIQAEVKRWGPLIQE 271 (274) T ss_pred HHHHHHHHHHHHHCCHHHHHHHHHCCC-C-------------------CCCCCHHHHHHHHHHHHHHHHHHHHH T ss_conf 999999999999749999999997789-5-------------------78899999999999999999999998 No 51 >pfam09084 NMT1 NMT1/THI5 like. This family contains the NMT1 and THI5 proteins. These proteins are proposed to be required for the biosynthesis of the pyrimidine moiety of thiamine. They are regulated by thiamine. Probab=94.56 E-value=0.28 Score=25.65 Aligned_cols=189 Identities=16% Similarity=0.102 Sum_probs=89.1 Q ss_pred HHHHHHHHHHCCEEEEEECCCHHHHHHHHHHHHCCCCCEEEECCCHHHHHHHHHHHC-CCCCCCCHHHCCCCCCCCCCCC Q ss_conf 899999988488799997288689997887531266410110133378999887521-2613677000210001243211 Q gi|254780561|r 49 KIKQAFERKCNCELKLIGLSDGVALLNKLRMEGENSAADIVLGFDNNLIDLARKTGL-FAKSNIDASQLKLPIKWDDDIF 127 (338) Q Consensus 49 ~i~~~Fek~~gikV~~~~~~~~~~~~~~~~a~~~~~~~Dv~~~~~~~~~~~~~~~~l-~~~~~~~~~~~~~~~~~~~~~~ 127 (338) ...+.|=+++|++|+++.+.++.+.+..+.+ ...|+.+......+. ....+. +.... ...... T Consensus 10 A~~~G~f~~~GL~Ve~~~~~~~~~~~~al~s----G~~D~a~~~~~~~~~-~~~~g~~~~~i~--------~~~~~~--- 73 (216) T pfam09084 10 AQEKGYFKEEGLDVEIVEPADPSDAVQLVAA----GKADFGVSYQPSLLL-ARAKGLPVVSVA--------ALIQHP--- 73 (216) T ss_pred HHHCCCHHHCCCCEEEEECCCCHHHHHHHHC----CCCCEEECCCHHHHH-HHHCCCEEEEEE--------ECCCCC--- T ss_conf 9987946884980899966882789999976----986778537199999-997898499999--------643589--- Q ss_pred CCEEEEEEHHHHCCCCCCCCCCCHHHHCCCCCCCEEEEEECCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHCCCEEECCC Q ss_conf 53243100011001112458764565212000682898504765303678999987177313789877762884683084 Q gi|254780561|r 128 VPYDYGYLAFIYDKRQITQPPKNFDELINSTKPWKIIYQDPRTSTLGLGLLLWIQKIYGDNSAQVWKKIATKTATVTKGW 207 (338) Q Consensus 128 ~~~~~~~~~~~y~~~~~~~~p~s~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 207 (338) ...+...+ ....+++.|| +||......... .. .....+....+.+.. ....+.-+. T Consensus 74 ------~~~~~v~~---~~~i~s~~DL-----kGk~i~~~~~~~-~~-~~l~~~L~~~Gl~~~--------dv~~v~~~~ 129 (216) T pfam09084 74 ------PNGLISLK---DSGIKSPKDL-----KGKRIGYSGSPF-EE-ALLKALLAKAGGDPS--------DVTLVNVGG 129 (216) T ss_pred ------CEEEEEEC---CCCCCCHHHH-----CCCEEEEECCCH-HH-HHHHHHHHHCCCCHH--------HEEEEECCH T ss_conf ------72899978---8898997895-----898899637972-89-999999998599989--------908996784 Q ss_pred HHHHHHHHCCCCCEEEEECCHHHHHHHHHCCCCEEEEECCC-CEEEEEEEEEEECC---CCCHHHHHHHHHHHC Q ss_conf 78988862258656999504499999850799737883169-80888864243378---999899999999963 Q gi|254780561|r 208 TESYGFFLKGESDFVLSYSTSPGFYLLNYGQDDYVAALFSE-GHYLQIEVAAQLVR---SKQPQLAQEFMQFMI 277 (338) Q Consensus 208 ~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e-G~~~~~~~~~i~k~---a~n~~~A~~Fi~fll 277 (338) .+....+.+|+++....+............+.......+.+ |....+....+.+. .+||+.+.+|++=+. T Consensus 130 ~~~~~al~~G~vDa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~f~~a~~ 203 (216) T pfam09084 130 TSLSPALLTGKVDAAIGGYYNWEGVELELEGVDLNIFALADYGVPDYYSLVLITNDATLKKNPELVKAFLRATK 203 (216) T ss_pred HHHHHHHHCCCCCEEEECCCCHHHHHHHHCCCCEEEEEHHHCCCCCCCCEEEEECHHHHHHCHHHHHHHHHHHH T ss_conf 88888985699558996246757999985688658970654388876622999989999889899999999999 No 52 >PRK11119 proX glycine betaine transporter periplasmic subunit; Provisional Probab=93.31 E-value=0.49 Score=24.24 Aligned_cols=76 Identities=13% Similarity=0.068 Sum_probs=36.7 Q ss_pred HHHHHHHCCCCCEEEEECCHHHHHHHHHCCCCEEEEE-CC---------------CCE----EEEEEEEEEECC--CCCH Q ss_conf 8988862258656999504499999850799737883-16---------------980----888864243378--9998 Q gi|254780561|r 209 ESYGFFLKGESDFVLSYSTSPGFYLLNYGQDDYVAAL-FS---------------EGH----YLQIEVAAQLVR--SKQP 266 (338) Q Consensus 209 ~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~---------------eG~----~~~~~~~~i~k~--a~n~ 266 (338) +......+++--+..+|.-.+......-+ .++.+.- |. ++. ....-..+..|+ .+|| T Consensus 194 ~~~~a~~~~epiv~~~W~PHW~~~~~~pg-~Dv~~L~~P~~~~~~~~~~~~~~~~dp~~~g~~~~~i~~v~~k~f~e~~P 272 (332) T PRK11119 194 DTIARYKEGKPVLYYTWTPYWVSDVLKPG-KDVVWLQVPFSALPGDQKNADTKLPNGKNYGFPVSTMHIVANKKFAEKNP 272 (332) T ss_pred HHHHHHHCCCCEEEEEECCCHHHHHCCCC-CCCEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEECHHHHHCC T ss_conf 99999987999899996771134202565-55135126555577554564333577410378610277521003556394 Q ss_pred HHHHHHHHHHCCHHHHHHH Q ss_conf 9999999996399999998 Q gi|254780561|r 267 QLAQEFMQFMISPSFQRIL 285 (338) Q Consensus 267 ~~A~~Fi~flls~e~Q~~~ 285 (338) ++++-|=+|-++.+-+..+ T Consensus 273 ~a~~~l~n~~~t~~d~~~~ 291 (332) T PRK11119 273 AAAKLFEIMKLPLADINAQ 291 (332) T ss_pred HHHHHHHHCCCCHHHHHHH T ss_conf 8999999679999999999 No 53 >COG0715 TauA ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components [Inorganic ion transport and metabolism] Probab=93.30 E-value=0.49 Score=24.22 Aligned_cols=188 Identities=14% Similarity=0.091 Sum_probs=84.9 Q ss_pred HHHHHHHHHHHCC-EEEEEECCCHHHHHHHHHHHHCCCCCEEEECCCHHHHH-HHHHHHCCCCCCCCHHHCCCCCCCCCC Q ss_conf 8899999988488-79999728868999788753126641011013337899-988752126136770002100012432 Q gi|254780561|r 48 PKIKQAFERKCNC-ELKLIGLSDGVALLNKLRMEGENSAADIVLGFDNNLID-LARKTGLFAKSNIDASQLKLPIKWDDD 125 (338) Q Consensus 48 ~~i~~~Fek~~gi-kV~~~~~~~~~~~~~~~~a~~~~~~~Dv~~~~~~~~~~-~~~~~~l~~~~~~~~~~~~~~~~~~~~ 125 (338) -...+.|=+++|+ +|+++...+.......+.++. .|+.....+.... .....+. +.. T Consensus 48 va~~kG~f~~~Gl~~V~~~~~~~~~~~~~~l~~G~----~D~a~~~~~~~~~~~~~~~g~--pv~--------------- 106 (335) T COG0715 48 VAKEKGFFKKEGLDDVELVEFTGGAPVLEALAAGA----LDFAVYYTGDTPLIAAGAAGA--PVK--------------- 106 (335) T ss_pred HHHHCCHHHHCCCCEEEEEECCCCHHHHHHHHCCC----CCEEEECCCCHHHHHHHCCCC--CEE--------------- T ss_conf 99978917774997589970798457999997699----888742378559999635699--979--------------- Q ss_pred CCCCEEEE---EEHHHHCCCCCCCCCCCHHHHCCCCCCCEEEEEECCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHCCCE Q ss_conf 11532431---000110011124587645652120006828985047653036789999871773137898777628846 Q gi|254780561|r 126 IFVPYDYG---YLAFIYDKRQITQPPKNFDELINSTKPWKIIYQDPRTSTLGLGLLLWIQKIYGDNSAQVWKKIATKTAT 202 (338) Q Consensus 126 ~~~~~~~~---~~~~~y~~~~~~~~p~s~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 202 (338) .+..... ...+.. ......++++|| ...+++++.+.. . ....+.......+.+..+.. . T Consensus 107 -~va~~~~~~~~~~i~~---~~~~~i~~~adl----kGk~vg~~~~~~-~-~~~~l~~~L~~~Gl~~~dv~--------~ 168 (335) T COG0715 107 -VVAALVQNGNGIALLV---LKDSGIKSVADL----KGKKVGVPFGGS-T-SDFLLRYALAKAGLDPDDVE--------L 168 (335) T ss_pred -EEEEEEECCCCEEEEE---CCCCCCCCHHHC----CCCEEEEECCCC-C-HHHHHHHHHHHCCCCHHHCE--------E T ss_conf -9998743797329997---388776771333----899899818997-0-79999999998699989946--------8 Q ss_pred EECCCHHHHHHHHCCCCCEEEEECCHHHHHHHHHCCCCEEEEECCCCEEE-EEEEEEEECC---CCCHHHHHHHHHHHC Q ss_conf 83084789888622586569995044999998507997378831698088-8864243378---999899999999963 Q gi|254780561|r 203 VTKGWTESYGFFLKGESDFVLSYSTSPGFYLLNYGQDDYVAALFSEGHYL-QIEVAAQLVR---SKQPQLAQEFMQFMI 277 (338) Q Consensus 203 ~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~eG~~~-~~~~~~i~k~---a~n~~~A~~Fi~fll 277 (338) ..-...+....+..|.++....|.. ........+ ...++....+... .+....+... .+||+.+.+|++-+. T Consensus 169 v~~~~~~~~~al~~g~vda~~~~ep--~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~v~a~~ 244 (335) T COG0715 169 VNLPPADAVAALAAGQVDAFVVWEP--WNAAAEGEG-GGRVLLDGADLWGNHPELVLVVRKEFIEANPEAVKAFLKALA 244 (335) T ss_pred EECCHHHHHHHHHCCCCCEEEECCH--HHHHHHHCC-CCEEEEECCCCCCCCCEEEEEECHHHHHHCHHHHHHHHHHHH T ss_conf 7369299999997598688996667--989973157-857986156667667426999558877669999999999999 No 54 >PRK06760 hypothetical protein; Provisional Probab=92.06 E-value=0.19 Score=26.66 Aligned_cols=86 Identities=23% Similarity=0.290 Sum_probs=53.3 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCC--CCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHH----CCEEEEEECCCHHHHH Q ss_conf 902799999999999999861467--4234885999978877851223388999999884----8879999728868999 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLD--GLPAKPVLTVYTYNSFVADEGAGPKIKQAFERKC----NCELKLIGLSDGVALL 74 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~--~~~~~~~l~v~~~~~~~~~~~~~~~i~~~Fek~~----gikV~~~~~~~~~~~~ 74 (338) |||.+.+.++.+++++++++|+.. ...+.+.|.+ . ++.....|+..+.+.+ -.+|+.-+.+.. T Consensus 1 MKK~l~i~i~~iis~~~lsaCS~~~~~~PANGvlii-G------dE~~~~~I~d~yK~~tke~~~y~vK~g~~~~k---- 69 (223) T PRK06760 1 MKKTLTIFMLTILLLISFSACSKKENSFPANGVLII-G------DENNISPIINRYQEITKENEVFSVKKGEVGNG---- 69 (223) T ss_pred CCEEEEEHHHHHHHHHHHHHCCCCCCCCCCCEEEEE-E------CHHHHHHHHHHHHHHHHHHHEEEEEEEEECCE---- T ss_conf 950676499999999984003367878987437999-6------51677889998877643432046675137882---- Q ss_pred HHHHHHHCCCCCEEEECCCHHHHHHHHHHHCCCCCC Q ss_conf 788753126641011013337899988752126136 Q gi|254780561|r 75 NKLRMEGENSAADIVLGFDNNLIDLARKTGLFAKSN 110 (338) Q Consensus 75 ~~~~a~~~~~~~Dv~~~~~~~~~~~~~~~~l~~~~~ 110 (338) -++..+......+...+++...+ T Consensus 70 -------------kVLim~~sTAe~mvKk~llRkrD 92 (223) T PRK06760 70 -------------QVLILNESTAQALIKEKVFRKRD 92 (223) T ss_pred -------------EEEEEEHHHHHHHHHHCCEEECC T ss_conf -------------79998618999997612023045 No 55 >TIGR03414 ABC_choline_bnd choline ABC transporter, periplasmic binding protein. Partial phylogenetic profiling (PMID:16930487) vs. the genome property of glycine betaine biosynthesis from choline consistently reveals a member of this ABC transporter periplasmic binding protein as the best match, save for the betaine biosynthesis enzymes themselves. Genomes often carry several paralogs, one encoded together with the permease and ATP-binding components and another encoded next to a choline-sulfatase gene, suggesting that different members of this protein family interact with shared components and give some flexibility in substrate. Of two members from Sinorhizobium meliloti 1021, one designated ChoX has been shown experimentally to bind choline (though not various related compounds such as betaine) and to be required for about 60 % of choline uptake. Members of this protein have an invariant Cys residue near the N-terminus and likely are lipoproteins. Probab=92.00 E-value=0.74 Score=23.20 Aligned_cols=221 Identities=9% Similarity=-0.015 Sum_probs=89.6 Q ss_pred CCCEEEE--EECCCCCCCHHHHHHHHHHHHHHHCCEEEEEECCCHHHHHHHHHHHHCCCCCEEEECCCHHHHHHHHHHHC Q ss_conf 4885999--97887785122338899999988488799997288689997887531266410110133378999887521 Q gi|254780561|r 28 AKPVLTV--YTYNSFVADEGAGPKIKQAFERKCNCELKLIGLSDGVALLNKLRMEGENSAADIVLGFDNNLIDLARKTGL 105 (338) Q Consensus 28 ~~~~l~v--~~~~~~~~~~~~~~~i~~~Fek~~gikV~~~~~~~~~~~~~~~~a~~~~~~~Dv~~~~~~~~~~~~~~~~l 105 (338) +.+++++ .+|.+.. ....|.+.-=++.|++|+++.. +...+..-+..+..+...++.+........+....+- T Consensus 6 ~~~~V~i~~~~W~s~~----~~t~v~~~iLE~~GY~Ve~~~~-~~~~~~~~la~GdiDv~~e~W~p~~~~~~~~~~~~g~ 80 (290) T TIGR03414 6 SCKTVRFADVGWTDIT----ATTALASVLLEGLGYQPKVTLL-SVPVTYAGLKDGDLDVFLGNWMPAMEPDIKPYLESGS 80 (290) T ss_pred CCCCEEEEECCCCHHH----HHHHHHHHHHHHCCCCEEEEEC-CHHHHHHHHHCCCCEEEEECCCCCCHHHHHHHHHCCC T ss_conf 0996699606864899----9999999999974996489967-7499999997699707871114785778998863696 Q ss_pred CCCCCCCHHHCCCCCCCCCCCCCCEEEEEEHHHHCCCCCCCCCCCHHHHCCC--CCCCEEEEEECCCCCCHHHHHHHHH- Q ss_conf 2613677000210001243211532431000110011124587645652120--0068289850476530367899998- Q gi|254780561|r 106 FAKSNIDASQLKLPIKWDDDIFVPYDYGYLAFIYDKRQITQPPKNFDELINS--TKPWKIIYQDPRTSTLGLGLLLWIQ- 182 (338) Q Consensus 106 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~p~s~~~l~~~--~~~~~~~~~~~~~~~~~~~~~~~~~- 182 (338) +....... ... ..+..+-.|-.+ ...+||.||... .++|++....++..... ....... T Consensus 81 v~~lg~~~--------~~a------~~g~~VP~Yv~d---~~l~si~DL~~~~~~f~g~i~gi~~G~~~~~-~~~~~i~~ 142 (290) T TIGR03414 81 VEVLGPNL--------EGA------KYTLAVPTYVAD---AGLKSFADIAKFKDKLDGKIYGIEPGNDGNR-LIQKMIDK 142 (290) T ss_pred EEECCCCC--------CCC------EEEEEECHHHHH---CCCCCHHHHHHCHHHCCCCEECCCCCCCHHH-HHHHHHHH T ss_conf 89835666--------786------378985414776---5999999998586752997613679984778-99999876 Q ss_pred HHCCCCHHHHHHHHHHCCCEEE-CCCH----HHHHHHHCCCCCEEEEECCHHHHHHHHHCCCCEEEEECCCCEEE---EE Q ss_conf 7177313789877762884683-0847----89888622586569995044999998507997378831698088---88 Q gi|254780561|r 183 KIYGDNSAQVWKKIATKTATVT-KGWT----ESYGFFLKGESDFVLSYSTSPGFYLLNYGQDDYVAALFSEGHYL---QI 254 (338) Q Consensus 183 ~~~~~~~~~~~~~~~~~~~~~~-~~~~----~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~eG~~~---~~ 254 (338) ..++..+. .+. ++.. +......+++..+...|.-.+... ..++.+.-.+++.+. .. T Consensus 143 ~~ygL~~~-----------~l~~~S~aam~a~l~~A~~~~epiv~~~W~Phw~~~-----~~dl~~L~~~~~~~g~~~~~ 206 (290) T TIGR03414 143 NAFGLGGF-----------KLVESSEAGMLAQVARAVKRKEWVVFLGWEPHPMNT-----NFKMTYLTGGDDYFGPNYGG 206 (290) T ss_pred HCCCCCCC-----------EECCCCHHHHHHHHHHHHHCCCCEEEEECCCCHHHH-----HCCEEEECCCCCCCCCCCCH T ss_conf 40798870-----------213688899999999999869998998237754555-----17706705874457899862 Q ss_pred EEEE--EECC--CCCHHHHHHHHHHHCCHHHHHHHHH Q ss_conf 6424--3378--9998999999999639999999885 Q gi|254780561|r 255 EVAA--QLVR--SKQPQLAQEFMQFMISPSFQRILPT 287 (338) Q Consensus 255 ~~~~--i~k~--a~n~~~A~~Fi~flls~e~Q~~~a~ 287 (338) ..+. +.++ .+||+++.-|=+|-++.+.|..+.. T Consensus 207 ~~v~~~~~~~f~~~~P~~~~~l~~~~~~~~~~~~~~~ 243 (290) T TIGR03414 207 ATVYTNTRKGYAAECPNVGKLLTNLTFTLDMENQIMG 243 (290) T ss_pred HHEEECCCHHHHHHCHHHHHHHHCCCCCHHHHHHHHH T ss_conf 2066414705878793899999615899899999999 No 56 >PRK12450 foldase protein PrsA; Reviewed Probab=92.00 E-value=0.73 Score=23.23 Aligned_cols=15 Identities=20% Similarity=0.366 Sum_probs=10.2 Q ss_pred HHHHHHHHHHCCEEE Q ss_conf 899999988488799 Q gi|254780561|r 49 KIKQAFERKCNCELK 63 (338) Q Consensus 49 ~i~~~Fek~~gikV~ 63 (338) .+-+.|+++||-+|. T Consensus 63 i~~KvLe~kYg~kVs 77 (309) T PRK12450 63 VISRVFETQYANKVS 77 (309) T ss_pred HHHHHHHHHHCCCCC T ss_conf 999999998535178 No 57 >PRK01326 prsA foldase protein PrsA; Reviewed Probab=91.27 E-value=0.49 Score=24.26 Aligned_cols=63 Identities=19% Similarity=0.201 Sum_probs=31.1 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCCCC------CCEEEE---EECC--CCCCCHHHHH-HHHHHHHHHHCCEEE Q ss_conf 9027999999999999998614674234------885999---9788--7785122338-899999988488799 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLDGLPA------KPVLTV---YTYN--SFVADEGAGP-KIKQAFERKCNCELK 63 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~~~~~------~~~l~v---~~~~--~~~~~~~~~~-~i~~~Fek~~gikV~ 63 (338) |||.+.+.++.++.+.++++|+.+..+. .+++|. |.-- .-.....+.+ -+.+-|+++||-+|. T Consensus 1 MKKKl~~~~~~~~svl~LaaCs~~~~~~~vat~kgg~IT~~e~Y~~~K~~~~g~q~l~~mi~~kvLe~~Yg~kVs 75 (310) T PRK01326 1 MKKKLIAGAVTLLSVATLAACSKTNDNTKVISMKGDTITVSDFYNEVKNNEVAQQAMLNLVISRVFEKQYGDKVS 75 (310) T ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEECCCCEEHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCCCCC T ss_conf 908899999999999999871699988068982499342999999984664289999999999999998626178 No 58 >PRK10653 D-ribose transporter subunit RbsB; Provisional Probab=90.65 E-value=1 Score=22.38 Aligned_cols=62 Identities=16% Similarity=0.200 Sum_probs=34.7 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEECCCCCCCHHHHHHHHHHHHH---HHCCEEEEEECCCH Q ss_conf 90279999999999999986146742348859999788778512233889999998---84887999972886 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLDGLPAKPVLTVYTYNSFVADEGAGPKIKQAFER---KCNCELKLIGLSDG 70 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~v~~~~~~~~~~~~~~~i~~~Fek---~~gikV~~~~~~~~ 70 (338) |||+..+++++++++ |....+.+++|+-|...+ ....++..+++..|+ ++|+++.+...... T Consensus 3 ~~k~~~~~~~~~~~~-----~~~a~a~a~~TIgvivp~---i~npff~~v~~gie~~a~~~Gy~l~v~~s~~d 67 (295) T PRK10653 3 MKKLATLVSAVALSA-----TVSANAMAKDTIALVVST---LNNPFFVSLKDGAQKEADKLGYNLVVLDSQNN 67 (295) T ss_pred HHHHHHHHHHHHHHH-----HCCCHHHCCCEEEEEECC---CCCHHHHHHHHHHHHHHHHCCCEEEEEECCCC T ss_conf 079999999998765-----212177529979999489---87979999999999999976998999839999 No 59 >PRK03002 prsA peptidylprolyl isomerase; Reviewed Probab=90.12 E-value=1 Score=22.46 Aligned_cols=24 Identities=13% Similarity=0.236 Sum_probs=17.3 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 902799999999999999861467 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLD 24 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~ 24 (338) |||-+.+++..++++.++++|+.. T Consensus 1 mkkK~i~~~~~~~svl~LaaC~~~ 24 (285) T PRK03002 1 MRGKHIFIITALISILMLSACGQK 24 (285) T ss_pred CCHHHHHHHHHHHHHHHHHHHCCC T ss_conf 960189999999999999984579 No 60 >PRK10598 hypothetical protein; Provisional Probab=89.79 E-value=0.54 Score=24.01 Aligned_cols=47 Identities=23% Similarity=0.540 Sum_probs=29.3 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEECCCCCCCHHHHHHHHHH--HHHHHCCE Q ss_conf 90279999999999999986146742348859999788778512233889999--99884887 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLDGLPAKPVLTVYTYNSFVADEGAGPKIKQA--FERKCNCE 61 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~v~~~~~~~~~~~~~~~i~~~--Fek~~gik 61 (338) |||+. ++++++++++++||.. |+=|+ .++.++.+.+.+. ||++.||. T Consensus 1 mkk~~--l~~~l~l~~llsGC~~--------l~qYs----ISEqEin~YL~~~~~~~k~iGi~ 49 (186) T PRK10598 1 MKKFL--LAAALLVSGLLVGCNQ--------LTQYT----ISEQEINQYLAKHNNFSKDIGLP 49 (186) T ss_pred CCHHH--HHHHHHHHHHHHCCCC--------CCCEE----ECHHHHHHHHHHHCCHHHHCCCC T ss_conf 91269--9999999999714353--------36440----07999999998743378771888 No 61 >COG1744 Med Uncharacterized ABC-type transport system, periplasmic component/surface lipoprotein [General function prediction only] Probab=89.25 E-value=1.3 Score=21.71 Aligned_cols=100 Identities=13% Similarity=0.041 Sum_probs=46.4 Q ss_pred CCHH-HHHHHHHHHHHHHHHHHCCC---CCCCCCEEEE-EECCCCCCCH----HHHHHHHHHHHHHHCCEEEE-----EE Q ss_conf 9027-99999999999999861467---4234885999-9788778512----23388999999884887999-----97 Q gi|254780561|r 1 MKKF-ARIVVGIMMITGVISYCTLD---GLPAKPVLTV-YTYNSFVADE----GAGPKIKQAFERKCNCELKL-----IG 66 (338) Q Consensus 1 MKk~-~~~~~~~~~~~~~~~~~~~~---~~~~~~~l~v-~~~~~~~~~~----~~~~~i~~~Fek~~gikV~~-----~~ 66 (338) ||+. ......+++++.++.+|+.+ +.+....+.+ ....+...+. ..... .+.++++.|++++. .. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~D~s~n~~~~~g-~~~~~~~~g~~~~~~~~~~~~ 79 (345) T COG1744 1 MKFRRILGLKLALGAALLLAACGASLAGAAAAGKKKKVAVIDVGGIDDKSFNQSAYEG-LLKAKKELGLKVETYYWEYVQ 79 (345) T ss_pred CCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCEEEEEEECCCCCCCCCHHHHHHH-HHHHHHHCCCCEEEEEEECCC T ss_conf 9278899999999999999984465544434565248999951887775710889999-999988609844432220368 Q ss_pred CCCHHHHHHHHHHHHCCCCCEEEECCCHHHHHHHHH Q ss_conf 288689997887531266410110133378999887 Q gi|254780561|r 67 LSDGVALLNKLRMEGENSAADIVLGFDNNLIDLARK 102 (338) Q Consensus 67 ~~~~~~~~~~~~a~~~~~~~Dv~~~~~~~~~~~~~~ 102 (338) ..+..++...++.- .....|++++....+...... T Consensus 80 ~~~~~~~~~~~~~~-a~~g~~lI~~~gf~~~d~~~~ 114 (345) T COG1744 80 SDSEADYERALRAL-AEDGYDLIFGTGFAFSDALEK 114 (345) T ss_pred CCCHHHHHHHHHHH-HHCCCCEEEECCHHHHHHHHH T ss_conf 87467899999999-866997899846205569999 No 62 >PRK11917 bifunctional adhesin/ABC transporter aspartate/glutamate-binding protein; Reviewed Probab=89.10 E-value=1.4 Score=21.65 Aligned_cols=215 Identities=13% Similarity=0.087 Sum_probs=97.2 Q ss_pred HHHHHHHHHHHHHHHCCCCCC-------CCCEEEEEECCCCCC----C------HHHHHHHHHHHHHHH---CCEEEEEE Q ss_conf 999999999999861467423-------488599997887785----1------223388999999884---88799997 Q gi|254780561|r 7 IVVGIMMITGVISYCTLDGLP-------AKPVLTVYTYNSFVA----D------EGAGPKIKQAFERKC---NCELKLIG 66 (338) Q Consensus 7 ~~~~~~~~~~~~~~~~~~~~~-------~~~~l~v~~~~~~~~----~------~~~~~~i~~~Fek~~---gikV~~~~ 66 (338) +.++++++++|++.|...+++ +.++|+|-+...+.+ + .++.-.+.++.-++- +++|+++. T Consensus 8 ~~~~~~~l~~~~~~~~~~a~~~~l~~I~~~G~L~vg~~~~~pPf~~~d~~~g~~~GfDvDla~~iAk~llg~~~~v~~v~ 87 (259) T PRK11917 8 LKLAVFALGACVAFSNANAAEGKLESIKSKGQLIVGVKNDVPHYALLDQATGEIKGFEIDVAKLLAKSILGDDKKIKLVA 87 (259) T ss_pred HHHHHHHHHHHHHCCCCHHHHCHHHHHHHCCEEEEEECCCCCCCCEEECCCCCEEEEEHHHHHHHHHHHCCCCCEEEEEE T ss_conf 99999999999863452111441999985998999987897886205679996788879999999999659985699997 Q ss_pred CCCHHHHHHHHHHHHCCCCCEEEECCCHHHHHHHHHHHCCCCCCCCHHHCCCCCCCCCCCCCCEEEEEEHHHHCCCCCCC Q ss_conf 28868999788753126641011013337899988752126136770002100012432115324310001100111245 Q gi|254780561|r 67 LSDGVALLNKLRMEGENSAADIVLGFDNNLIDLARKTGLFAKSNIDASQLKLPIKWDDDIFVPYDYGYLAFIYDKRQITQ 146 (338) Q Consensus 67 ~~~~~~~~~~~~a~~~~~~~Dv~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~ 146 (338) ... ...+..+. +..+|+++..-.....+ ...-....||......+...++ . T Consensus 88 ~~~-~~~~p~L~----~gkvD~i~~~~tiT~eR---------------------~k~vdFS~PY~~~~~~ilv~k~---~ 138 (259) T PRK11917 88 VNA-KTRGPLLD----NGSVDAVIATFTITPER---------------------KRIYNFSEPYYQDAIGLLVLKE---K 138 (259) T ss_pred ECH-HHHHHHHH----CCCEEEEEECCCCCCCH---------------------HCCEEECCCEEEEEEEEEEECC---C T ss_conf 067-87768987----79855998664767654---------------------3057622476996559999899---9 Q ss_pred CCCCHHHHCCCCCCCEEEEEECCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHC-CCEEECCCHHHHHHHHCCCCCEEEEE Q ss_conf 87645652120006828985047653036789999871773137898777628-84683084789888622586569995 Q gi|254780561|r 147 PPKNFDELINSTKPWKIIYQDPRTSTLGLGLLLWIQKIYGDNSAQVWKKIATK-TATVTKGWTESYGFFLKGESDFVLSY 225 (338) Q Consensus 147 ~p~s~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~g~~~~~~~~ 225 (338) ..+++.||... +++.... +.......... ...... ......+..+....+..|.++....- T Consensus 139 ~i~~~~DL~gk----~Vgv~~g--st~~~~~~~~~------------~~~~~~~~~~~~~~~~~~~~~l~~grvDa~~~d 200 (259) T PRK11917 139 NYKSLADMKGA----NIGVAQA--ATTKKAIGEAA------------KKIGIDVKFSEFPDYPSIKAALDAKRVDAFSVD 200 (259) T ss_pred CCCCHHHHCCC----CEEEEEC--CCHHHHHHHHH------------HHCCCCEEEEECCCHHHHHHHHHCCCEEEEECC T ss_conf 99998995898----1799927--84699999988------------860885169964999999999984980299826 Q ss_pred CCHHHHHHHHHCCCCEEEEECCCCEEEEEEEEEEECCCCCHHHHHHHHHHH Q ss_conf 044999998507997378831698088886424337899989999999996 Q gi|254780561|r 226 STSPGFYLLNYGQDDYVAALFSEGHYLQIEVAAQLVRSKQPQLAQEFMQFM 276 (338) Q Consensus 226 ~~~~~~~~~~~~~~~~~~~~~~eG~~~~~~~~~i~k~a~n~~~A~~Fi~fl 276 (338) .. ......+.... +.+ +.-.....++++.|+. ++....+=+++ T Consensus 201 ~~--~~~~~~~~~~~---~~~-~~~~~~~~giavrKgd--~~l~~~vn~~i 243 (259) T PRK11917 201 KS--ILLGYVDDKSE---ILP-DSFEPQSYGIVTKKDD--PAFAKYVDDFV 243 (259) T ss_pred HH--HHHHHHHCCCC---CCC-CCCCCCEEEEEEECCC--HHHHHHHHHHH T ss_conf 99--99999861980---168-8887525899994899--89999999999 No 63 >PRK09859 multidrug efflux system protein MdtE; Provisional Probab=87.21 E-value=0.65 Score=23.53 Aligned_cols=26 Identities=8% Similarity=0.376 Sum_probs=21.6 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 90279999999999999986146742 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLDGL 26 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~~~ 26 (338) |||..+++++++++++++++|+.... T Consensus 1 m~~k~~~li~ll~~~~lL~gC~~~~~ 26 (385) T PRK09859 1 MNRRRKLLIPLLFCGAMLTACDDKSA 26 (385) T ss_pred CCCHHHHHHHHHHHHHHHHCCCCCCC T ss_conf 98206789999999999953799976 No 64 >PRK09558 ushA bifunctional UDP-sugar hydrolase/5'-nucleotidase periplasmic precursor; Reviewed Probab=87.21 E-value=1.2 Score=21.88 Aligned_cols=37 Identities=14% Similarity=0.113 Sum_probs=23.3 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCCC-CC--CEEEEEEC Q ss_conf 902799999999999999861467423-48--85999978 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLDGLP-AK--PVLTVYTY 37 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~~~~-~~--~~l~v~~~ 37 (338) |+|+++.++++++++++..|.+...++ ++ -+|+|+.- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ltilht 40 (551) T PRK09558 1 MMKLLKGLVALALLAALALCGSTACAYEGDKTYKITVLHT 40 (551) T ss_pred CHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCEEEEEEEE T ss_conf 9258999999999999874067664446898579999984 No 65 >PRK04405 prsA peptidylprolyl isomerase; Provisional Probab=86.44 E-value=2 Score=20.69 Aligned_cols=22 Identities=23% Similarity=0.258 Sum_probs=10.1 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCC Q ss_conf 90279999999999999986146 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTL 23 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~ 23 (338) |||++..++++++ +.++++|+. T Consensus 5 MKK~~l~~~~~~~-~l~LaaCss 26 (298) T PRK04405 5 MKKWALAAASAGL-LLSLAGCSS 26 (298) T ss_pred HHHHHHHHHHHHH-HHHHHHCCC T ss_conf 8999999999999-999987179 No 66 >TIGR03427 ABC_peri_uca ABC transporter periplasmic binding protein, urea carboxylase region. Members of this family are ABC transporter periplasmic binding proteins associated with the urea carboxylase/allophanate hydrolase pathway, an alternative to urease for urea degradation. The protein is restricted to bacteria with the pathway, with its gene close to the urea carboxylase and allophanate hydrolase genes. The substrate for this transporter therefore is likely to be urea or a compound from which urea is easily derived. Probab=84.97 E-value=2.4 Score=20.27 Aligned_cols=184 Identities=11% Similarity=0.095 Sum_probs=86.1 Q ss_pred HHHHHHHHHHHCCEEEEEECCCHHHHHHHHHHHHCCCCCEEEECCCHHHHHHHHHHHCCCCCCCCHHHCCCCCCCCCCCC Q ss_conf 88999999884887999972886899978875312664101101333789998875212613677000210001243211 Q gi|254780561|r 48 PKIKQAFERKCNCELKLIGLSDGVALLNKLRMEGENSAADIVLGFDNNLIDLARKTGLFAKSNIDASQLKLPIKWDDDIF 127 (338) Q Consensus 48 ~~i~~~Fek~~gikV~~~~~~~~~~~~~~~~a~~~~~~~Dv~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~ 127 (338) +.+++++-++|||+|+++.+.+..+.+..+.++ +.|-+...+.+.+......+.-... ... T Consensus 23 ~GifkKw~~k~Gi~Vel~~f~dy~~si~a~~AG----~~Dg~~~Tn~Dal~i~aa~Gvd~~~---------------vlv 83 (328) T TIGR03427 23 QGIVDKWADKYGITIEVVQINDYVESINQYTAG----KFDGCTMTNMDALTIPAAGGVDTTA---------------LIV 83 (328) T ss_pred CCCHHHHHHHCCCEEEEEECCCHHHHHHHHHCC----CCCEEEEECCHHHHHHHCCCCCEEE---------------EEE T ss_conf 492655588759737999828848889999758----9774776261566513117997289---------------998 Q ss_pred CCEEEEEEHHHHCCCCCCCCCCCHHHHCCCCCCCEEEEEECCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHCCCEEECCC Q ss_conf 53243100011001112458764565212000682898504765303678999987177313789877762884683084 Q gi|254780561|r 128 VPYDYGYLAFIYDKRQITQPPKNFDELINSTKPWKIIYQDPRTSTLGLGLLLWIQKIYGDNSAQVWKKIATKTATVTKGW 207 (338) Q Consensus 128 ~~~~~~~~~~~y~~~~~~~~p~s~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 207 (338) ..+..+..++.. ...+|..|| +||.+-.... +.....+..++...+ ... ..+..+.-+. T Consensus 84 ~D~SnG~D~Iva------k~~~slaDL-----KGKkVav~~~-svshyLL~rALe~aG-Ls~--------~DV~vVn~~~ 142 (328) T TIGR03427 84 GDFSNGNDGIVL------KGGKSLADL-----KGQKVNLVEL-SVSHYLLARALESVG-LSE--------KDVKVVNTSD 142 (328) T ss_pred ECCCCCCCEEEE------CCCCCHHHH-----CCCEEECCCC-CCHHHHHHHHHHHCC-CCH--------HHEEEEECCC T ss_conf 525788747997------899887884-----8988832478-707999999999869-988--------9907884795 Q ss_pred HHHHHHHHCCCCCEEEEECCHHHHHHHHHCCCCEEEE-ECCCCEEEEEEEEEEECC--CCCHHHHHHHHH Q ss_conf 7898886225865699950449999985079973788-316980888864243378--999899999999 Q gi|254780561|r 208 TESYGFFLKGESDFVLSYSTSPGFYLLNYGQDDYVAA-LFSEGHYLQIEVAAQLVR--SKQPQLAQEFMQ 274 (338) Q Consensus 208 ~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~eG~~~~~~~~~i~k~--a~n~~~A~~Fi~ 274 (338) .+....+..|.++..+.|.-.... .....+...-+. --..|- ..+.+++... ..||+.+++++. T Consensus 143 ~d~~aAf~sg~VdA~vTWeP~ls~-i~~~~ga~~iF~Ss~iPGe--I~D~lvV~~~~L~~nP~~~kAlv~ 209 (328) T TIGR03427 143 ADIVAAFITKDVTAVVTWNPQLSE-IKAQPGANEVFDSSQIPGE--ILDLMVVNTQTLKANPNLGKALTG 209 (328) T ss_pred HHHHHHHHCCCCCEEEEECHHHHH-HHHCCCCCEEEECCCCCCC--EEEEEEECHHHHHHCHHHHHHHHH T ss_conf 469999747998789997856899-9728998264425778884--037999858988879999999999 No 67 >pfam03466 LysR_substrate LysR substrate binding domain. The structure of this domain is known and is similar to the periplasmic binding proteins. Probab=84.92 E-value=2.4 Score=20.25 Aligned_cols=200 Identities=12% Similarity=0.089 Sum_probs=101.8 Q ss_pred CCCCEEEEEECCCCCCCHHHHHHHHHHHHHHH-CCEEEEEECCCHHHHHHHHHHHHCCCCCEEEECCCHHHHHHHHHHHC Q ss_conf 34885999978877851223388999999884-88799997288689997887531266410110133378999887521 Q gi|254780561|r 27 PAKPVLTVYTYNSFVADEGAGPKIKQAFERKC-NCELKLIGLSDGVALLNKLRMEGENSAADIVLGFDNNLIDLARKTGL 105 (338) Q Consensus 27 ~~~~~l~v~~~~~~~~~~~~~~~i~~~Fek~~-gikV~~~~~~~~~~~~~~~~a~~~~~~~Dv~~~~~~~~~~~~~~~~l 105 (338) ...++|+|.+..+. ....++.++..|.++| +|+|++... +..++...+. +...|+.+...... T Consensus 3 ~~~G~i~I~~~~~~--~~~~lp~~l~~f~~~~P~i~i~i~~~-~~~~i~~~l~----~g~~Di~i~~~~~~--------- 66 (209) T pfam03466 3 GPRGRLRIGAPPTF--AAYLLPPLLARFRERYPDVELELREG-DSEELLDLLA----EGELDLAIRRGPPD--------- 66 (209) T ss_pred CCCEEEEEEEEHHH--HHHHHHHHHHHHHHHCCCCEEEEEEC-CCHHHHHHHH----CCCCEEEEECCCCC--------- T ss_conf 98789999938899--99999999999999888928999989-8488999998----69814887317899--------- Q ss_pred CCCCCCCHHHCCCCCCCCCCCCCCEEEEEEHHHHCCC--CCCCCCCCHHHHCCCCCCCEEEEEECCCCCCHHHHHHHHHH Q ss_conf 2613677000210001243211532431000110011--12458764565212000682898504765303678999987 Q gi|254780561|r 106 FAKSNIDASQLKLPIKWDDDIFVPYDYGYLAFIYDKR--QITQPPKNFDELINSTKPWKIIYQDPRTSTLGLGLLLWIQK 183 (338) Q Consensus 106 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~--~~~~~p~s~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 183 (338) . ..............++..++ .....+.++++|.+-.+ ........ ........... T Consensus 67 ----~------------~~~~~~~l~~~~~~~~~s~~~~l~~~~~i~~~~l~~~~~----i~~~~~~~-~~~~~~~~~~~ 125 (209) T pfam03466 67 ----D------------PGLEARPLFEEPLVLVAPPDHPLAAGEPVSLEDLADEPL----ILLEPGSG-LRDLVDRALER 125 (209) T ss_pred ----C------------CCEEEEEEEEEEEEEEECCCCHHHCCCCCCHHHHCCCCE----EEECCCCC-CHHHHHHHHHH T ss_conf ----9------------980899976241365533670554289754355146657----98569987-13799877876 Q ss_pred HCCCCHHHHHHHHHHCCCEEECCCHHHHHHHHCCCCCEEEEECCHHHHHHHHHCCCCEEEEECCCCEEEEEEEEEEECCC Q ss_conf 17731378987776288468308478988862258656999504499999850799737883169808888642433789 Q gi|254780561|r 184 IYGDNSAQVWKKIATKTATVTKGWTESYGFFLKGESDFVLSYSTSPGFYLLNYGQDDYVAALFSEGHYLQIEVAAQLVRS 263 (338) Q Consensus 184 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~eG~~~~~~~~~i~k~a 263 (338) . ....+......+..........|.. +++.......... ....+..+...+-.+....++.-.++. T Consensus 126 ~----------~~~~~~~~~~~~~~~~~~lv~~g~G---i~~lp~~~~~~~~-~~~~L~~i~~~~~~~~~~~~li~~~~~ 191 (209) T pfam03466 126 A----------GLRPRVALEVNSLEALLAAVAAGLG---IALLPRSAVAREL-ADGRLVVLPLPDPPLPRPIYLVYRKGR 191 (209) T ss_pred C----------CCCCCEEEEECCHHHHHHHHHHCCE---EEECCHHHHHHHH-HCCCEEEEECCCCCCCCEEEEEEECCC T ss_conf 4----------8763024674787999999994999---9991399999887-589989995888887447999998949 Q ss_pred CCHHHHHHHHHHHC Q ss_conf 99899999999963 Q gi|254780561|r 264 KQPQLAQEFMQFMI 277 (338) Q Consensus 264 ~n~~~A~~Fi~fll 277 (338) ....+++.|++|+. T Consensus 192 ~~s~~~~~f~~~l~ 205 (209) T pfam03466 192 RLSPAVRAFIDFLR 205 (209) T ss_pred CCCHHHHHHHHHHH T ss_conf 78999999999999 No 68 >COG4259 Uncharacterized protein conserved in bacteria [Function unknown] Probab=84.24 E-value=2.5 Score=20.08 Aligned_cols=35 Identities=11% Similarity=0.415 Sum_probs=21.1 Q ss_pred CCHHHHHH-HHHHHHHHHHHHHCCCCCCCCCEEEEEECCCCC Q ss_conf 90279999-999999999986146742348859999788778 Q gi|254780561|r 1 MKKFARIV-VGIMMITGVISYCTLDGLPAKPVLTVYTYNSFV 41 (338) Q Consensus 1 MKk~~~~~-~~~~~~~~~~~~~~~~~~~~~~~l~v~~~~~~~ 41 (338) |++...+. +++++..+++++|+++ .-.+|.|++|. T Consensus 1 M~~~ktlsr~al~~av~~LagC~~g------pKslY~w~gYq 36 (121) T COG4259 1 MSKLKTLSRLALLLAVAALAGCGGG------PKSLYQWEGYQ 36 (121) T ss_pred CCCHHHHHHHHHHHHHHHHHHCCCC------CCCCCCCCCCC T ss_conf 9514889999999999999880579------86410137863 No 69 >PRK02998 prsA peptidylprolyl isomerase; Reviewed Probab=83.70 E-value=2.7 Score=19.94 Aligned_cols=23 Identities=26% Similarity=0.223 Sum_probs=13.6 Q ss_pred CCHHHHHHHHHHHH-HHHHHHHCC Q ss_conf 90279999999999-999986146 Q gi|254780561|r 1 MKKFARIVVGIMMI-TGVISYCTL 23 (338) Q Consensus 1 MKk~~~~~~~~~~~-~~~~~~~~~ 23 (338) |||...++++++++ +..+++|++ T Consensus 1 Mkkkk~~~~~~~~~~~l~LaaC~s 24 (283) T PRK02998 1 MKKKKLFIGTIISCVVLALSACGS 24 (283) T ss_pred CCHHHHHHHHHHHHHHHHHHHCCC T ss_conf 935899999999999999986689 No 70 >PRK10833 putative assembly protein; Provisional Probab=82.86 E-value=2.9 Score=19.74 Aligned_cols=49 Identities=12% Similarity=0.201 Sum_probs=28.2 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHHCCEEEE Q ss_conf 9027999999999999998614674234885999978877851223388999999884887999 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLDGLPAKPVLTVYTYNSFVADEGAGPKIKQAFERKCNCELKL 64 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~v~~~~~~~~~~~~~~~i~~~Fek~~gikV~~ 64 (338) |||++.++++++++..+.... |++.- + +..+.+.|.+..+++||-++.+ T Consensus 1 MKrll~~l~illvvvv~gl~a----------Lv~lv----d-PNd~k~~I~~qV~~~TGr~L~i 49 (617) T PRK10833 1 MRRFLTTLMILLVVLVAGLSA----------LVLLV----N-PNDFRAYMVKQVEARSGYQLQL 49 (617) T ss_pred CCHHHHHHHHHHHHHHHHHHH----------HHEEE----C-HHHHHHHHHHHHHHHHCCEEEE T ss_conf 941899999999999999998----------81504----8-8997999999999977987998 No 71 >PRK06481 fumarate reductase flavoprotein subunit; Validated Probab=82.46 E-value=1.7 Score=21.14 Aligned_cols=25 Identities=12% Similarity=0.226 Sum_probs=20.8 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 9027999999999999998614674 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLDG 25 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~~ 25 (338) |||++..+..++++..++++|+... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~c~~~~ 25 (506) T PRK06481 1 KKKIWTTLGMLLLLALILTGCGSNT 25 (506) T ss_pred CCHHHHHHHHHHHHHHHHHHCCCCC T ss_conf 9218999999999999986507999 No 72 >pfam06316 Ail_Lom Enterobacterial Ail/Lom protein. This family consists of several bacterial and phage Ail/Lom-like proteins. The Yersinia enterocolitica Ail protein is a known virulence factor. Proteins in this family are predicted to consist of eight transmembrane beta-sheets and four cell surface-exposed loops. It is thought that Ail directly promotes invasion and loop 2 contains an active site, perhaps a receptor-binding domain. The phage protein Lom is expressed during lysogeny, and encode host-cell envelope proteins. Lom is found in the bacterial outer membrane, and is homologous to virulence proteins of two other enterobacterial genera. It has been suggested that lysogeny may generally have a role in bacterial survival in animal hosts, and perhaps in pathogenesis. Probab=81.78 E-value=1.3 Score=21.72 Aligned_cols=54 Identities=11% Similarity=0.135 Sum_probs=27.7 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCCC--CCCEEEEEECCCCC-CCHHHHHHHHHHHHHHHCCEEEEEE Q ss_conf 902799999999999999861467423--48859999788778-5122338899999988488799997 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLDGLP--AKPVLTVYTYNSFV-ADEGAGPKIKQAFERKCNCELKLIG 66 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~~~~--~~~~l~v~~~~~~~-~~~~~~~~i~~~Fek~~gikV~~~~ 66 (338) |||+.. .+++++.++++|+.++.+ .+.|+.+ +|. ..- + ...|.+-.|+.++|.. T Consensus 1 m~~~~~---~~l~~~~~l~~~~~~a~A~~~~sTvS~----GYaQa~~---~--~~g~~~l~G~NlKYRY 57 (199) T pfam06316 1 MRKLCA---AILSAAICLAAAGTPASAAEHQSTLSA----GYLHAHT---D--APGSDDLNGINVKYRY 57 (199) T ss_pred CCHHHH---HHHHHHHHHHHHCCCCHHHCCCCEEEE----CEEEEEC---C--CCCCCCCCCEEEEEEE T ss_conf 901778---999999999972773112147866862----2100314---7--5667788853888664 No 73 >pfam07273 DUF1439 Protein of unknown function (DUF1439). This family consists of several hypothetical bacterial proteins of around 190 residues in length. Several members of this family are annotated as being putative lipoproteins and are often known as YceB. The function of this family is unknown. Probab=81.02 E-value=2.6 Score=20.04 Aligned_cols=45 Identities=22% Similarity=0.491 Sum_probs=26.3 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEECCCCCCCHHHHHHHHHH--HHHHHCC Q ss_conf 90279999999999999986146742348859999788778512233889999--9988488 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLDGLPAKPVLTVYTYNSFVADEGAGPKIKQA--FERKCNC 60 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~v~~~~~~~~~~~~~~~i~~~--Fek~~gi 60 (338) ||++..+ +++++.++++|.. |+.|+ .++.++.+.+.+. |+++.|| T Consensus 1 Mk~~~~~---~~~l~~~LsgC~~--------ls~ys----ISE~Ein~yL~k~~~~~k~~gi 47 (177) T pfam07273 1 MKKLLLL---ILLLVLLLSGCAS--------LSQYS----ISEAEINQYLQKEVHLEKKVGI 47 (177) T ss_pred CCHHHHH---HHHHHHHHHCCCC--------CCCEE----ECHHHHHHHHHHHCCHHHHHCC T ss_conf 9148999---9999999715575--------66252----6699999999874577786088 No 74 >pfam05170 AsmA AsmA family. The AsmA gene, whose product is involved in the assembly of outer membrane proteins in Escherichia coli. AsmA mutations were isolated as extragenic suppressors of an OmpF assembly mutant. AsmA may have a role in LPS biogenesis. Probab=80.23 E-value=3.6 Score=19.18 Aligned_cols=49 Identities=18% Similarity=0.312 Sum_probs=30.7 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHHCCEEEE Q ss_conf 9027999999999999998614674234885999978877851223388999999884887999 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLDGLPAKPVLTVYTYNSFVADEGAGPKIKQAFERKCNCELKL 64 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~v~~~~~~~~~~~~~~~i~~~Fek~~gikV~~ 64 (338) |||++++++++++++.++... -+.++.| ..+.+.|.+.-+++||-+|.+ T Consensus 1 Mk~~lk~l~~i~~~lv~~~~~---------l~~~~d~------n~~k~~i~~~v~~~TGr~l~i 49 (537) T pfam05170 1 MKKALKILLIILIVLLLLIIA---------LIALFDP------NYFKPTIQQKVSAASGRPLQI 49 (537) T ss_pred CHHHHHHHHHHHHHHHHHHHH---------HEEEECH------HHHHHHHHHHHHHHHCCEEEE T ss_conf 945899999999999999998---------6075288------997999999999988985999 No 75 >PRK03095 prsA peptidylprolyl isomerase; Reviewed Probab=79.81 E-value=3.7 Score=19.10 Aligned_cols=22 Identities=23% Similarity=0.176 Sum_probs=12.5 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 902799999999999999861467 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLD 24 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~ 24 (338) |||++..++ +++..++++|+.+ T Consensus 1 MKK~~la~~--~~svl~LaaC~~~ 22 (287) T PRK03095 1 MKKAMLALA--ATSVIALSACGTS 22 (287) T ss_pred CCHHHHHHH--HHHHHHHHHCCCC T ss_conf 907899999--9999999851689 No 76 >COG4521 TauA ABC-type taurine transport system, periplasmic component [Inorganic ion transport and metabolism] Probab=79.50 E-value=3.4 Score=19.32 Aligned_cols=185 Identities=13% Similarity=0.100 Sum_probs=83.2 Q ss_pred HHHHHHHCCEEEEEECCCHHHHHHHHHHHHCCCCCEEEECCCHHHHHHHHHHHCCCCCCCCHHHCCCCCCCCCCCCCCEE Q ss_conf 99998848879999728868999788753126641011013337899988752126136770002100012432115324 Q gi|254780561|r 52 QAFERKCNCELKLIGLSDGVALLNKLRMEGENSAADIVLGFDNNLIDLARKTGLFAKSNIDASQLKLPIKWDDDIFVPYD 131 (338) Q Consensus 52 ~~Fek~~gikV~~~~~~~~~~~~~~~~a~~~~~~~Dv~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~ 131 (338) ..|+|+.|.++.+..+.|+.++...+..+ +.++-. ++.++-......+..+.......... T Consensus 49 g~~aK~~gatiDWRkFdSG~~vv~AlASG--dvqiG~-iGSsplaaAaSr~vpie~f~~~~~ig---------------- 109 (334) T COG4521 49 GAFAKESGATIDWRKFDSGASIVRALASG--DVQIGN-IGSSPLAAAASRQVPIEVFLLASQIG---------------- 109 (334) T ss_pred CHHHHHCCCCCCHHHCCCHHHHHHHHHCC--CCCCCC-CCCCHHHHHHHCCCCEEEEEHHHHCC---------------- T ss_conf 35677607854511047536899998628--861046-57856667763588658853012128---------------- Q ss_pred EEEEHHHHCCCCCCCCCCCHHHHCCCCCCCEEEEEECCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHCCCEEECCCHHHH Q ss_conf 31000110011124587645652120006828985047653036789999871773137898777628846830847898 Q gi|254780561|r 132 YGYLAFIYDKRQITQPPKNFDELINSTKPWKIIYQDPRTSTLGLGLLLWIQKIYGDNSAQVWKKIATKTATVTKGWTESY 211 (338) Q Consensus 132 ~~~~~~~y~~~~~~~~p~s~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 211 (338) .+-.+..++- .....-.||. ..++..+-.. .....++.++..-+.+.+. +....-...+.. T Consensus 110 -~sEALVvr~g---sgI~kpeDL~----GK~iavPFvS--TtHysLLaaLkhw~idp~~---------V~IlNl~Pp~Ia 170 (334) T COG4521 110 -NSEALVVRKG---SGIEKPEDLI----GKRIAVPFVS--TTHYSLLAALKHWGIDPGQ---------VEILNLQPPAIA 170 (334) T ss_pred -CCCEEEEECC---CCCCCHHHHC----CCEECCCEEE--HHHHHHHHHHHHCCCCCCC---------EEEECCCCHHHH T ss_conf -6310355237---7869857864----6721042430--0089999999871878552---------468615978999 Q ss_pred HHHHCCCCCEEEEECCHHHHHHHHHCCCCEEEEECCCC--EEEEEEEEEEECC--CCCHHHHHHHHHHHC Q ss_conf 88622586569995044999998507997378831698--0888864243378--999899999999963 Q gi|254780561|r 212 GFFLKGESDFVLSYSTSPGFYLLNYGQDDYVAALFSEG--HYLQIEVAAQLVR--SKQPQLAQEFMQFMI 277 (338) Q Consensus 212 ~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~eG--~~~~~~~~~i~k~--a~n~~~A~~Fi~fll 277 (338) ..+.+|++|....|. .....+.+.+..+.-. -.-| .....+.+++-|+ -+|||.-..|.+-.+ T Consensus 171 AAwqRGDIDgAyVW~--PAl~el~ksGkVltDs-~qvgqwgaPTfdvwVvrkdfAekhPe~v~aFakv~~ 237 (334) T COG4521 171 AAWQRGDIDGAYVWA--PALSELKKSGKVLTDS-EQVGQWGAPTFDVWVVRKDFAEKHPEVVAAFAKVAL 237 (334) T ss_pred HHHHCCCCCCEEECC--HHHHHHHHCCCEECCH-HHHHCCCCCCEEEEEEEHHHHHHCHHHHHHHHHHHH T ss_conf 998707877225446--7688886258575268-886214798513688631766769699999999888 No 77 >pfam10566 Glyco_hydro_97 Glycoside hydrolase 97. This is the 97th family of glycosidases, in this case bacterial. The central part of the GH97 family protein sequences represents a typical and complete (beta/alpha)8-barrel or catalytic TIM-barrel type domain. The N- and C-terminal parts of the sequences, mainly consisting of beta-strands, most probably form two additional non-catalytic domains with as yet unknown functions. The non-catalytic domains of glycosidases from the alpha-galactosidase and alpha-glucosidase superfamilies are also predominantly composed of beta-strands, and at least some of these domains are involved in oligomerisation and carbohydrate binding. In all known glycosidases with the (beta-alpha)8-barrel fold, the amino acid residues at the active site are located on the C-termini of the beta-strands. Probab=77.75 E-value=3.1 Score=19.57 Aligned_cols=23 Identities=26% Similarity=0.524 Sum_probs=17.4 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCC Q ss_conf 90279999999999999986146 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTL 23 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~ 23 (338) |||++.++++++++.+++++|+. T Consensus 1 mkk~~~~~~~~~~~~~~~~~~~~ 23 (643) T pfam10566 1 MKKLTIILLAFLLLIGNLSLAAR 23 (643) T ss_pred CCCHHHHHHHHHHHHHHHHHHHC T ss_conf 95025789999999976567541 No 78 >PRK06934 flavodoxin; Provisional Probab=76.10 E-value=4.8 Score=18.45 Aligned_cols=35 Identities=20% Similarity=0.273 Sum_probs=16.5 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCC-CCCCCEEEEEEC Q ss_conf 9027999999999999998614674-234885999978 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLDG-LPAKPVLTVYTY 37 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~~-~~~~~~l~v~~~ 37 (338) |+|.+.+++++++ ..++.|+.+. ..+++.|.+|=. T Consensus 9 ~~~~~~~~~~~~~--~~~~~~a~~~~q~~~k~LI~Yfs 44 (221) T PRK06934 9 LTKHLILFLAVAV--SSLSLIAEAADQNARRVLIVYFS 44 (221) T ss_pred HHHHHHHHHHHHH--HHHHHHHHCCCCCCCEEEEEEEC T ss_conf 4788999999999--98888750377677557999964 No 79 >pfam08085 Entericidin Entericidin EcnA/B family. This family consists of the entericidin antidote/toxin peptides. The entericidin locus is activated in stationary phase under high osmolarity conditions by rho-S and simultaneously repressed by the osmoregulatory EnvZ/OmpR signal transduction pathway. The entericidin locus encodes tandem paralogous genes (ecnAB) and directs the synthesis of two small cell-envelope lipoproteins which can maintain plasmids in bacterial population by means of post-segregational killing. Probab=75.64 E-value=2.5 Score=20.12 Aligned_cols=21 Identities=24% Similarity=0.641 Sum_probs=16.6 Q ss_pred CCHHHHHHHHHHHHHHHHHHH Q ss_conf 902799999999999999861 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYC 21 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~ 21 (338) |||.+.+++.+++++..+++| T Consensus 1 Mk~~~~~~~~l~~~~~~lagC 21 (42) T pfam08085 1 MKKLIALLLALLLLALVLAGC 21 (42) T ss_pred CHHHHHHHHHHHHHHHHHHCC T ss_conf 923699999999999998612 No 80 >COG2984 ABC-type uncharacterized transport system, periplasmic component [General function prediction only] Probab=75.25 E-value=5.1 Score=18.32 Aligned_cols=76 Identities=7% Similarity=0.112 Sum_probs=36.2 Q ss_pred CCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHHCC---EEEEEECCCHHHHHHHHHHHHCCCCCEEEECCCHHHHHHHH Q ss_conf 423488599997887785122338899999988488---79999728868999788753126641011013337899988 Q gi|254780561|r 25 GLPAKPVLTVYTYNSFVADEGAGPKIKQAFERKCNC---ELKLIGLSDGVALLNKLRMEGENSAADIVLGFDNNLIDLAR 101 (338) Q Consensus 25 ~~~~~~~l~v~~~~~~~~~~~~~~~i~~~Fek~~gi---kV~~~~~~~~~~~~~~~~a~~~~~~~Dv~~~~~~~~~~~~~ 101 (338) ..+.+.++-|...-+...-+...+.+.++++++ |+ ++.+....+......++...-.....|+++........... T Consensus 26 ~~~~~~~VaI~~~veHpaLd~~~~G~~~aLk~~-G~~n~~i~~~na~~~~~~a~~iarql~~~~~dviv~i~tp~Aq~~~ 104 (322) T COG2984 26 AAADQITVAITQFVEHPALDAAREGVKEALKDA-GYKNVKIDYQNAQGDLGTAAQIARQLVGDKPDVIVAIATPAAQALV 104 (322) T ss_pred CCCCCEEEEEEEEECCHHHHHHHHHHHHHHHHC-CCCCEEEEEECCCCCHHHHHHHHHHHHCCCCCEEEECCCHHHHHHH T ss_conf 444433688998506655999999789999864-8667289962477885789999999614799679961778999999 No 81 >PRK02944 OxaA-like protein precursor; Validated Probab=75.07 E-value=3.7 Score=19.15 Aligned_cols=25 Identities=16% Similarity=0.319 Sum_probs=20.7 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 9027999999999999998614674 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLDG 25 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~~ 25 (338) |||...+++.+++++.+++||+... T Consensus 1 mkk~~~~~~~~~~~~~~lsgC~~~~ 25 (255) T PRK02944 1 MKKKLGLLAMVVALMAILAGCSEVN 25 (255) T ss_pred CCHHHHHHHHHHHHHHHHHCCCCCC T ss_conf 9289999999999999996246889 No 82 >PRK10510 putative outer membrane lipoprotein; Provisional Probab=73.67 E-value=3.7 Score=19.16 Aligned_cols=29 Identities=24% Similarity=0.285 Sum_probs=21.3 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 90279999999999999986146742348 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLDGLPAK 29 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~~~~~~ 29 (338) |||...++++++..+.++++|.......+ T Consensus 1 m~~r~~lia~~~a~~l~lsgC~t~p~tg~ 29 (219) T PRK10510 1 MKKRVYLIAAIVSGALAVSGCTTNPYTGE 29 (219) T ss_pred CCHHHHHHHHHHHHHHHHHCCCCCCCCCC T ss_conf 94268999999999987413567898766 No 83 >pfam03202 Lipoprotein_10 Putative mycoplasma lipoprotein, C-terminal region. Probab=73.62 E-value=2.5 Score=20.15 Aligned_cols=28 Identities=11% Similarity=0.235 Sum_probs=24.2 Q ss_pred EEEEEEEECCCCCHHHHHHHHHHHCCHH Q ss_conf 8864243378999899999999963999 Q gi|254780561|r 253 QIEVAAQLVRSKQPQLAQEFMQFMISPS 280 (338) Q Consensus 253 ~~~~~~i~k~a~n~~~A~~Fi~flls~e 280 (338) .|+-|+|..+.+.-+++++|++|+++.+ T Consensus 5 GPsLIGIhaNeked~aTk~FV~W~l~~~ 32 (134) T pfam03202 5 GPNLIGIHANEKENAETQKFVNWFLNTE 32 (134) T ss_pred CCCEEEEECCCCCCHHHHHHHHHHHCCC T ss_conf 8745553057655689999999996374 No 84 >TIGR02036 dsdC D-serine deaminase transcriptional activator; InterPro: IPR011781 This family, part of the LysR family of transcriptional regulators, activates transcription of the gene for D-serine deaminase, dsdA. Trusted members of this family so far are found adjacent to dsdA and only in Gammaproteobacteria, including Escherichia coli, Vibrio cholerae, and Colwellia psychrerythraea.. Probab=73.06 E-value=1.8 Score=20.98 Aligned_cols=192 Identities=16% Similarity=0.128 Sum_probs=93.7 Q ss_pred CCEEEEEECCCCCCCHHHHHHHHHHHHHHHC-CEEEEEECCCHHHHHHHHHHHHCCCCCEEEECCCHHHHHHHHHHHCCC Q ss_conf 8859999788778512233889999998848-879999728868999788753126641011013337899988752126 Q gi|254780561|r 29 KPVLTVYTYNSFVADEGAGPKIKQAFERKCN-CELKLIGLSDGVALLNKLRMEGENSAADIVLGFDNNLIDLARKTGLFA 107 (338) Q Consensus 29 ~~~l~v~~~~~~~~~~~~~~~i~~~Fek~~g-ikV~~~~~~~~~~~~~~~~a~~~~~~~Dv~~~~~~~~~~~~~~~~l~~ 107 (338) .++||||+--|+. +.++-+-+.+|+++|+ |.+++.+ |+ +-++ -.+.++||.+..|+..-.++.=..++. T Consensus 95 SG~LT~YSRPSfA--QCWLVPri~~F~~~YPsIsL~~LT-GN--eNin-----fqg~GIDvAIYFDd~~p~kL~c~~im~ 164 (302) T TIGR02036 95 SGELTVYSRPSFA--QCWLVPRIADFKKRYPSISLKVLT-GN--ENIN-----FQGAGIDVAIYFDDKQPAKLSCKEIMD 164 (302) T ss_pred CCCEEECCCCCHH--HHHHHHHHHHHHHCCCCEEEEECC-CC--CCEE-----CCCCCCEEEEEECCCCCCCCCHHHHCC T ss_conf 2102002255333--443323232120038712211001-53--5320-----135750378870788874212012124 Q ss_pred CCCCCHHHCCCCCCCCCCCCCCEEEEEEHHHHCCCCCCCCCCCHHHHCCCCCCCEEEEEECCCCCCHHHHHHHHHHHCCC Q ss_conf 13677000210001243211532431000110011124587645652120006828985047653036789999871773 Q gi|254780561|r 108 KSNIDASQLKLPIKWDDDIFVPYDYGYLAFIYDKRQITQPPKNFDELINSTKPWKIIYQDPRTSTLGLGLLLWIQKIYGD 187 (338) Q Consensus 108 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~p~s~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 187 (338) . +.-..-.|+|..+..+.....+.....++.--.+=+.+ T Consensus 165 E-----------------------------------------~I~PVCsPEYAq~~~L~~~~~NL~~cTLLHDnQAW~yd 203 (302) T TIGR02036 165 E-----------------------------------------TIVPVCSPEYAQKYALTSKVDNLSQCTLLHDNQAWSYD 203 (302) T ss_pred C-----------------------------------------CCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC T ss_conf 3-----------------------------------------31343481355555365445676521110001203788 Q ss_pred CHHHHHHHHHHC----------CCEEECCCHHHHHHHHCCCCCEEEEECCHHHHHHHHHCCCCEEEEECCCCEEE--EEE Q ss_conf 137898777628----------84683084789888622586569995044999998507997378831698088--886 Q gi|254780561|r 188 NSAQVWKKIATK----------TATVTKGWTESYGFFLKGESDFVLSYSTSPGFYLLNYGQDDYVAALFSEGHYL--QIE 255 (338) Q Consensus 188 ~~~~~~~~~~~~----------~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~eG~~~--~~~ 255 (338) .++|.+..++.+ ...|-.+.-.+..+...-.++|+= .+....++..|- +-..+ .+-.+. .-+ T Consensus 204 S~tDEW~~WAn~~~~n~L~~ss~IgFDRSDLAVIAA~NnaGvAMGR---~slv~~rl~sGE--L~~Pf-~d~~v~C~QrY 277 (302) T TIGR02036 204 SDTDEWKSWANAYAVNNLESSSSIGFDRSDLAVIAAMNNAGVAMGR---KSLVKKRLASGE--LVKPF-IDKKVLCKQRY 277 (302) T ss_pred CCCCHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCHHCCH---HHHHHHHHCCCC--CCCCC-CCCEEEECCCE T ss_conf 8854789888632132113222367654479999998504411023---789886521675--32677-88655625630 Q ss_pred EEEEECCCCCHHHHHHHHHHHCC Q ss_conf 42433789998999999999639 Q gi|254780561|r 256 VAAQLVRSKQPQLAQEFMQFMIS 278 (338) Q Consensus 256 ~~~i~k~a~n~~~A~~Fi~flls 278 (338) +++-.-|.+|++- +.||+||-+ T Consensus 278 YvaTLPn~~n~Ki-elFI~WL~~ 299 (302) T TIGR02036 278 YVATLPNKKNTKI-ELFIEWLKE 299 (302) T ss_pred EEECCCCCCCCHH-HHHHHHHHH T ss_conf 6740888877057-888799996 No 85 >COG3019 Predicted metal-binding protein [General function prediction only] Probab=72.65 E-value=5.9 Score=17.94 Aligned_cols=66 Identities=18% Similarity=0.113 Sum_probs=33.2 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHHCCEEEEEECCCHHHHHH Q ss_conf 902799999999999999861467423488599997887785122338899999988488799997288689997 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLDGLPAKPVLTVYTYNSFVADEGAGPKIKQAFERKCNCELKLIGLSDGVALLN 75 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~v~~~~~~~~~~~~~~~i~~~Fek~~gikV~~~~~~~~~~~~~ 75 (338) |||+..+.+.+++. .....++.+...++++|- ++.=.=-..-++..| .+|++|+.+...+-..+.. T Consensus 1 Mrr~~~l~~l~a~~----l~~~~~~~a~~~~~~vyk----sPnCGCC~~w~~~mk-~~Gf~Vk~~~~~d~~alK~ 66 (149) T COG3019 1 MRRRAFLRSLAALG----LGSTGPAQAQATEMVVYK----SPNCGCCDEWAQHMK-ANGFEVKVVETDDFLALKR 66 (149) T ss_pred CCHHHHHHHHHHHH----HHCCCCHHCCEEEEEEEE----CCCCCCHHHHHHHHH-HCCCEEEEEECCCHHHHHH T ss_conf 91268999999998----531440010103578873----889861799999998-6795789852572899988 No 86 >PRK11143 glpQ glycerophosphodiester phosphodiesterase; Provisional Probab=72.15 E-value=6.1 Score=17.87 Aligned_cols=56 Identities=13% Similarity=0.077 Sum_probs=34.4 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEECC-CCCCCHHHHHHHHHHHHH Q ss_conf 90279999999999999986146742348859999788-778512233889999998 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLDGLPAKPVLTVYTYN-SFVADEGAGPKIKQAFER 56 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~v~~~~-~~~~~~~~~~~i~~~Fek 56 (338) ||-..+-+..++++.+++++.+..++++.+.|+|.+-| ++..+++-+..+..+++. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~plVIAHRGasg~~PEnTl~Af~~Ai~~ 57 (359) T PRK11143 1 MKMTLKNLSLALLLASLVAGSSAMAADAAEKIVIAHRGASGYLPEHTLPAKAMAYAQ 57 (359) T ss_pred CCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCEEEECCCCCCCCCCCHHHHHHHHHHC T ss_conf 950288999999999998535644446899869968986788865789999999984 No 87 >pfam05590 DUF769 Xylella fastidiosa protein of unknown function (DUF769). This family consists of several uncharacterized hypothetical proteins of unknown function from Xylella fastidiosa, the organism that causes Pierce's disease in plants. Probab=71.20 E-value=4.1 Score=18.85 Aligned_cols=26 Identities=12% Similarity=0.250 Sum_probs=20.7 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 90279999999999999986146742 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLDGL 26 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~~~ 26 (338) |+...++..+++.++.++++|+++.- T Consensus 1 Msl~prL~~SllaAslLLagCSsgPp 26 (279) T pfam05590 1 MSLHPRLSCSLLAASLLLAGCSSGPP 26 (279) T ss_pred CCCCCCHHHHHHHHHHHHHCCCCCCC T ss_conf 98666401699999999723788998 No 88 >TIGR03042 PS_II_psbQ_bact photosystem II protein PsbQ. This protein through the member sll1638 from Synechocystis sp. PCC 6803, was shown to be part of the cyanobacteria photosystem II. It is homologous to (but quite diverged from) the chloroplast PsbQ protein, called oxygen-evolving enhancer protein 3 (OEE3). We designate this cyanobacteria protein PsbQ by homology. Probab=69.94 E-value=6.8 Score=17.58 Aligned_cols=49 Identities=12% Similarity=0.274 Sum_probs=22.4 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHH Q ss_conf 9027999999999999998614674234885999978877851223388999999884 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLDGLPAKPVLTVYTYNSFVADEGAGPKIKQAFERKC 58 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~v~~~~~~~~~~~~~~~i~~~Fek~~ 58 (338) +|++..++++++++. +.+|++++++...+ |+ ..+-..++..++.|++.. T Consensus 1 lRsi~s~~L~~~~~f--LvsC~gp~a~~p~t---ys----p~~l~~Iq~~~~~i~~~r 49 (142) T TIGR03042 1 LRSLASLLLVLLLTF--LVSCSGPAAAVPPT---YS----PAQLAQIQRQAEGIEAAK 49 (142) T ss_pred CHHHHHHHHHHHHHH--HHHCCCCCCCCCCC---CC----HHHHHHHHHHHHHHHHHH T ss_conf 976999999999999--98838998889998---69----999999999998899999 No 89 >pfam07901 DUF1672 Protein of unknown function (DUF1672). This family is composed of hypothetical bacterial proteins of unknown function. Probab=69.86 E-value=6.8 Score=17.57 Aligned_cols=21 Identities=29% Similarity=0.420 Sum_probs=13.3 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 902799999999999999861467 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLD 24 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~ 24 (338) |||++ .++++++.++++|+.- T Consensus 1 MkK~i---~~il~~~llL~GCs~m 21 (304) T pfam07901 1 MRKLI---KLILIATLLLSGCSTT 21 (304) T ss_pred CCHHH---HHHHHHHHHHCCCCCC T ss_conf 90189---9999999997133445 No 90 >COG5567 Predicted small periplasmic lipoprotein [Cell motility and secretion] Probab=69.57 E-value=4.4 Score=18.68 Aligned_cols=22 Identities=18% Similarity=0.403 Sum_probs=16.6 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCC Q ss_conf 90279999999999999986146 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTL 23 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~ 23 (338) ||+.++.++++++++ .+++|+. T Consensus 1 mk~~~~s~~ala~l~-sLA~CG~ 22 (58) T COG5567 1 MKNVFKSLLALATLF-SLAGCGL 22 (58) T ss_pred CHHHHHHHHHHHHHH-HHHHCCC T ss_conf 904899999999999-9985266 No 91 >COG5510 Predicted small secreted protein [Function unknown] Probab=69.01 E-value=4.8 Score=18.48 Aligned_cols=22 Identities=18% Similarity=0.469 Sum_probs=16.7 Q ss_pred CCHHHHHHHHHHHHHHHHHHHC Q ss_conf 9027999999999999998614 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCT 22 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~ 22 (338) |||++.+.+.+++++.++++|- T Consensus 2 mk~t~l~i~~vll~s~llaaCN 23 (44) T COG5510 2 MKKTILLIALVLLASTLLAACN 23 (44) T ss_pred CHHHHHHHHHHHHHHHHHHHHH T ss_conf 0489999999999999998866 No 92 >PRK02710 plastocyanin; Provisional Probab=68.04 E-value=7.5 Score=17.34 Aligned_cols=35 Identities=14% Similarity=0.037 Sum_probs=16.5 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEE Q ss_conf 902799999999999999861467423488599997 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLDGLPAKPVLTVYT 36 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~v~~ 36 (338) |+|..+.+++++++..+..+++.+++.+ .+..|.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~a~A-~~~~v~m 35 (119) T PRK02710 1 MAKRLRSIAAALVAVVSSFGLGVSSASA-ATVEVKM 35 (119) T ss_pred CHHHHHHHHHHHHHHHHHHHHCCCCCCC-CEEEEEE T ss_conf 9068899999999999999812570202-5289997 No 93 >pfam03180 Lipoprotein_9 NLPA lipoprotein. This family of bacterial lipoproteins contains several antigenic members, that may be involved in bacterial virulence. Their precise function is unknown. However they are probably distantly related to pfam00497 which are solute binding proteins. Probab=66.25 E-value=8.1 Score=17.13 Aligned_cols=211 Identities=13% Similarity=0.058 Sum_probs=99.6 Q ss_pred CHHHHHHHHHHHHHHHCCEEEEEECCCHHHHHHHHHHHHCCCCCEEEECCCHHHHHHHHHHHCCCCCCCCHHHCCCCCCC Q ss_conf 12233889999998848879999728868999788753126641011013337899988752126136770002100012 Q gi|254780561|r 43 DEGAGPKIKQAFERKCNCELKLIGLSDGVALLNKLRMEGENSAADIVLGFDNNLIDLARKTGLFAKSNIDASQLKLPIKW 122 (338) Q Consensus 43 ~~~~~~~i~~~Fek~~gikV~~~~~~~~~~~~~~~~a~~~~~~~Dv~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~ 122 (338) ...+++.+.+ .-++.||+|+++.+.+.... ++..+ ++..|.-+.-...++.......-.. .... T Consensus 10 ~~~i~~~v~~-~~~~~Gi~veiv~F~Dy~~p-N~AL~---~GeIDaN~fQH~~yl~~~n~~~g~~-----------L~~v 73 (236) T pfam03180 10 HAEVLEVAKP-LAKKKGLDLEIVEFTDYVQP-NTALA---DGEIDANAFQHLPYLDQFNKEGGLD-----------LVAV 73 (236) T ss_pred HHHHHHHHHH-HHHHCCCEEEEEEECCCCCH-HHHHH---CCCCCHHHHCCHHHHHHHHHHCCCC-----------EEEE T ss_conf 6999999999-99964987999981686455-07977---8995524443899999999977996-----------7996 Q ss_pred CCCCCCCEEEEEEHHHHCCCCCCCCCCCHHHHCCCCCCCEEEEEECCCCCCHHHHHHHHH---HHCCCC---HHHHHHHH Q ss_conf 432115324310001100111245876456521200068289850476530367899998---717731---37898777 Q gi|254780561|r 123 DDDIFVPYDYGYLAFIYDKRQITQPPKNFDELINSTKPWKIIYQDPRTSTLGLGLLLWIQ---KIYGDN---SAQVWKKI 196 (338) Q Consensus 123 ~~~~~~~~~~~~~~~~y~~~~~~~~p~s~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~---~~~~~~~~ 196 (338) +..+..|... +...-+|.+++ ....++++++...+......++.-. ...... ........ T Consensus 74 ~~~~~~p~gl-----------YS~k~ksl~~l---p~Ga~IaIpnD~sN~~RAL~lL~~aGLI~Lk~~~~~~~t~~DI~~ 139 (236) T pfam03180 74 GNTHVEPIGL-----------YSKKYKSLSEL---PDGATIAVPNDPSNEGRALLLLEKAGLIKLKDGAGLLATVKDITE 139 (236) T ss_pred EEEEECCEEE-----------EECCCCCHHHC---CCCCEEEECCCCCHHHHHHHHHHHCCCEEECCCCCCCCCHHHHHH T ss_conf 3046723795-----------43376887585---799989815884429999999998898897689998789888972 Q ss_pred HHCCCEEECCC-HHHHHHHHCCCCCEEEEECCHHHHHHHHHCCCCEEEEE-CCCCEEEEEEEEEEECCCCCHHHHHHHHH Q ss_conf 62884683084-78988862258656999504499999850799737883-16980888864243378999899999999 Q gi|254780561|r 197 ATKTATVTKGW-TESYGFFLKGESDFVLSYSTSPGFYLLNYGQDDYVAAL-FSEGHYLQIEVAAQLVRSKQPQLAQEFMQ 274 (338) Q Consensus 197 ~~~~~~~~~~~-~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~eG~~~~~~~~~i~k~a~n~~~A~~Fi~ 274 (338) .....++..-. .+....+ .+++.++.+.+. ....+..+..... .++-.-.....+++-++-++.+..+++++ T Consensus 140 Npk~l~~~ev~a~ql~~~l--~dvD~avin~n~----a~~agl~p~~~~l~~e~~~~~y~n~ivvr~~~~d~~~ik~l~e 213 (236) T pfam03180 140 NPKNLKIKELEAAQLPRAL--DDVDAAVINTNY----ALQAGLDPKKDALFEEDKDSPYVNIIVVREDDKDDPAVKKLVK 213 (236) T ss_pred CCCCCEEEEECHHHHHHHC--CCCCEEEECHHH----HHHCCCCHHHHHHHCCCCCCCEEEEEEEECCCCCCHHHHHHHH T ss_conf 8677669991677755421--665789986567----9887949212133147889983799998177769999999999 Q ss_pred HHCCHHHHHHHHHCC Q ss_conf 963999999988504 Q gi|254780561|r 275 FMISPSFQRILPTTN 289 (338) Q Consensus 275 flls~e~Q~~~a~~~ 289 (338) -+.|+|+++.+.+.+ T Consensus 214 ~~~s~~vk~~i~~~y 228 (236) T pfam03180 214 AYQSEEVKAFIEKKY 228 (236) T ss_pred HHCCHHHHHHHHHHC T ss_conf 987999999999886 No 94 >PRK12684 transcriptional regulator CysB-like protein; Reviewed Probab=66.01 E-value=8.2 Score=17.10 Aligned_cols=207 Identities=12% Similarity=0.066 Sum_probs=97.7 Q ss_pred CCCCEEEEEECCCCCCCHHHHHHHHHHHHHHH-CCEEEEEECCCHHHHHHHHHHHHCCCCCEEEECCCHHHHHHHHHHHC Q ss_conf 34885999978877851223388999999884-88799997288689997887531266410110133378999887521 Q gi|254780561|r 27 PAKPVLTVYTYNSFVADEGAGPKIKQAFERKC-NCELKLIGLSDGVALLNKLRMEGENSAADIVLGFDNNLIDLARKTGL 105 (338) Q Consensus 27 ~~~~~l~v~~~~~~~~~~~~~~~i~~~Fek~~-gikV~~~~~~~~~~~~~~~~a~~~~~~~Dv~~~~~~~~~~~~~~~~l 105 (338) ...++|+|.+..+. -...++.++..|.++| +|+|++.. +++.++...+.. ...|+.+..... T Consensus 90 ~~~G~l~ig~~~s~--~~~~lp~~l~~f~~~~P~v~l~l~~-~~~~~~~~~l~~----g~~Dl~i~~~~~---------- 152 (313) T PRK12684 90 QDQGNLTIATTHTQ--ARYALPAAIAEFKKRYPKVRLSILQ-GSPTQIAEMVIH----DQADLAIATEAI---------- 152 (313) T ss_pred CCCCEEECCHHHHH--HHHCCCHHHHHHHHHCCCEEEEEEE-CCHHHHHHHHHC----CCCCEEECCCCC---------- T ss_conf 77751411024555--7515949999998448980899971-788999999977----996655043577---------- Q ss_pred CCCCCCCHHHCCCCCCCCCCCCCCEEEEEEHHHHCC--CCCCCCCCCHHHHCCCCCCCEEEEEECCCCCCHHHHHHHHHH Q ss_conf 261367700021000124321153243100011001--112458764565212000682898504765303678999987 Q gi|254780561|r 106 FAKSNIDASQLKLPIKWDDDIFVPYDYGYLAFIYDK--RQITQPPKNFDELINSTKPWKIIYQDPRTSTLGLGLLLWIQK 183 (338) Q Consensus 106 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~--~~~~~~p~s~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 183 (338) . ... .. ...+........+... ......+-++.||.+.. ..+.... +............ T Consensus 153 ~--~~~---~l---------~~~~l~~~~~~~v~~~~hpla~~~~v~l~dL~~~p----~I~~~~~-~~~r~~i~~~~~~ 213 (313) T PRK12684 153 A--DYK---EL---------VSLPCYQWNHAVVVPPDHPLLERKPLTLEDLAQYP----LITYDDA-FAGRSKINKAFAL 213 (313) T ss_pred C--CCC---CE---------EEEEEEECCEEEEECCCCCCCCCCCCCHHHHCCCC----EEECCCC-CCHHHHHHHHHHH T ss_conf 7--778---73---------89980322168995389854469999999985999----8962899-8589999999997 Q ss_pred HCCCCHHHHHHHHHHCCCEEECCCHHHHHHHHCCCCCEEEEECCHHHHHHHHHCCCCEEEEECCCCEEEEEEEEEEECCC Q ss_conf 17731378987776288468308478988862258656999504499999850799737883169808888642433789 Q gi|254780561|r 184 IYGDNSAQVWKKIATKTATVTKGWTESYGFFLKGESDFVLSYSTSPGFYLLNYGQDDYVAALFSEGHYLQIEVAAQLVRS 263 (338) Q Consensus 184 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~eG~~~~~~~~~i~k~a 263 (338) . ....+...-..+......++..|-. ++....... ..+....+..+...+-.....-+++..|+. T Consensus 214 ~----------g~~p~~~~e~~~~~~i~~~V~~GlG---i~ilP~~a~--~~~~~~~L~~lp~~~~~~~~~~~v~~~k~~ 278 (313) T PRK12684 214 R----------GLKPDIVLEAIDADVIKTYVELGLG---VGIVADMAF--DPERDRNLRAIPAGHLFGSNVTRVALKQGA 278 (313) T ss_pred C----------CCCCCEEEEECCHHHHHHHHHHCCC---HHHHHHHHH--CHHHCCCEEEEECCCCCCCEEEEEEEECCC T ss_conf 7----------9987479999979999999995985---725599984--676389879997846776605999997988 Q ss_pred CCHHHHHHHHHHHCCHHHHHHH Q ss_conf 9989999999996399999998 Q gi|254780561|r 264 KQPQLAQEFMQFMISPSFQRIL 285 (338) Q Consensus 264 ~n~~~A~~Fi~flls~e~Q~~~ 285 (338) .=+..|+.||+|+- +.-++.+ T Consensus 279 ~ls~~~r~FIe~l~-e~l~r~~ 299 (313) T PRK12684 279 YLRGYVYTFIELFS-PTLNRKL 299 (313) T ss_pred CCCHHHHHHHHHHH-HHCCHHH T ss_conf 66899999999987-4659899 No 95 >PRK12682 transcriptional regulator CysB-like protein; Reviewed Probab=65.64 E-value=8.4 Score=17.06 Aligned_cols=209 Identities=13% Similarity=0.070 Sum_probs=98.2 Q ss_pred CCCCEEEEEECCCCCCCHHHHHHHHHHHHHHH-CCEEEEEECCCHHHHHHHHHHHHCCCCCEEEECCCHHHHHHHHHHHC Q ss_conf 34885999978877851223388999999884-88799997288689997887531266410110133378999887521 Q gi|254780561|r 27 PAKPVLTVYTYNSFVADEGAGPKIKQAFERKC-NCELKLIGLSDGVALLNKLRMEGENSAADIVLGFDNNLIDLARKTGL 105 (338) Q Consensus 27 ~~~~~l~v~~~~~~~~~~~~~~~i~~~Fek~~-gikV~~~~~~~~~~~~~~~~a~~~~~~~Dv~~~~~~~~~~~~~~~~l 105 (338) ...++|+|.+..+ .....++.++.+|.++| +|++++... ++.++...+. +...|+.+.... + T Consensus 90 ~~~G~lrIg~~~~--~~~~~lp~~l~~f~~~~P~v~l~l~~~-~~~~~~~~l~----~g~~Dl~i~~~~----------~ 152 (309) T PRK12682 90 QDSGTLTIATTHT--QARYVLPRVVAKFRKRYPKVNLSLHQG-SPDEIAQMVI----SGEADIGIATES----------L 152 (309) T ss_pred CCCCCCCCCCCHH--HHHHCCCHHHHHHHHHCCCCEEEEEEC-CHHHHHHHHH----CCCCCHHHHHCC----------C T ss_conf 6777412124313--332038625399998789837999837-8299999997----699764565077----------7 Q ss_pred CCCCCCCHHHCCCCCCCCCCCCCCEEEEEEHHHHCCC--CCCCCCCCHHHHCCCCCCCEEEEEECCCCCCHHHHHHHHHH Q ss_conf 2613677000210001243211532431000110011--12458764565212000682898504765303678999987 Q gi|254780561|r 106 FAKSNIDASQLKLPIKWDDDIFVPYDYGYLAFIYDKR--QITQPPKNFDELINSTKPWKIIYQDPRTSTLGLGLLLWIQK 183 (338) Q Consensus 106 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~--~~~~~p~s~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 183 (338) .. . + .....+.......++..++ .....+-++.||.+.. ..+.... +............ T Consensus 153 ~~--~----~--------~l~~~~l~~~~~~~v~~~~h~la~~~~i~~~dL~~~p----~I~~~~~-~~~r~~~~~~~~~ 213 (309) T PRK12682 153 DD--D----P--------DLVTLPCYDWTHAVIVPSDHPLAQKERITLEDLAEYP----LITYHPG-FTGRSAIDKAFAA 213 (309) T ss_pred CC--C----C--------CEEEEEEECCCEEEEECCCCHHHCCCCCCHHHHCCCC----EEEECCC-CCHHHHHHHHHHH T ss_conf 78--9----8--------8089993315236761699721149999999982899----7972799-8679999999997 Q ss_pred HCCCCHHHHHHHHHHCCCEEECCCHHHHHHHHCCCCCEEEEECCHHHHHHHHHCCCCEEEEECCCCEEEEEEEEEEECCC Q ss_conf 17731378987776288468308478988862258656999504499999850799737883169808888642433789 Q gi|254780561|r 184 IYGDNSAQVWKKIATKTATVTKGWTESYGFFLKGESDFVLSYSTSPGFYLLNYGQDDYVAALFSEGHYLQIEVAAQLVRS 263 (338) Q Consensus 184 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~eG~~~~~~~~~i~k~a 263 (338) . ....+......+......+...|.. ++.......... ....+..+...+......-+++..|+. T Consensus 214 ~----------g~~p~~~~~~~~~~~~~~~v~~G~G---i~~lp~~~~~~~--~~~~Lv~v~~~~~~~~~~~~l~~rk~~ 278 (309) T PRK12682 214 A----------GLQPDIVLEAIDSDVIKTYVRLGLG---VGIIAEMAYRPD--RDGDLVALPAGHLFGPNTAWVAVKRGA 278 (309) T ss_pred C----------CCCCCEEEEECCHHHHHHHHHHCCE---EEHHHHHHHHHH--HCCCEEEEECCCCCCCCEEEEEEECCC T ss_conf 7----------9996269999989999999995994---242289983376--479989997988777617999996966 Q ss_pred CCHHHHHHHHHHHCCHHHHHHHHH Q ss_conf 998999999999639999999885 Q gi|254780561|r 264 KQPQLAQEFMQFMISPSFQRILPT 287 (338) Q Consensus 264 ~n~~~A~~Fi~flls~e~Q~~~a~ 287 (338) .-+..++.||+|+. +...+.+.+ T Consensus 279 ~l~~~~~~FId~l~-e~l~~~~~~ 301 (309) T PRK12682 279 YLRNYVYDFIELLA-PHLSRELIE 301 (309) T ss_pred CCCHHHHHHHHHHH-HHCCHHHHH T ss_conf 11899999999999-872989999 No 96 >PRK10802 peptidoglycan-associated outer membrane lipoprotein; Provisional Probab=65.17 E-value=7.4 Score=17.36 Aligned_cols=23 Identities=13% Similarity=0.306 Sum_probs=13.4 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 902799999999999999861467 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLD 24 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~ 24 (338) |+|+++.++.++.++ ++++|++. T Consensus 3 ~nk~~K~l~ia~~~l-~LaACSS~ 25 (173) T PRK10802 3 LNKVLKGLMIALPVM-AIAACSSN 25 (173) T ss_pred HHHHHHHHHHHHHHH-HHHHCCCC T ss_conf 078999999999999-99855799 No 97 >pfam06097 DUF945 Bacterial protein of unknown function (DUF945). This family consists of several hypothetical bacterial proteins of unknown function. Probab=65.06 E-value=5.5 Score=18.14 Aligned_cols=53 Identities=13% Similarity=0.206 Sum_probs=28.3 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHH---HCCEEEEEECC Q ss_conf 902799999999999999861467423488599997887785122338899999988---48879999728 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLDGLPAKPVLTVYTYNSFVADEGAGPKIKQAFERK---CNCELKLIGLS 68 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~v~~~~~~~~~~~~~~~i~~~Fek~---~gikV~~~~~~ 68 (338) ||| . ++++++.+++++.+|+ +|..+.--+..++..++...++ .+++|+++.+. T Consensus 1 MkK-~-va~gviv~l~a~~~g~-------------~wytG~~~E~~~~~~v~~lN~~l~~~~~~~~~~~y~ 56 (460) T pfam06097 1 MKK-S-VAIGVIVALGAVWTGG-------------PWYTGKQAEAEYQDAVAQLNAELKNPGLKLENVSYD 56 (460) T ss_pred CCE-E-EEEHHHHHHHHHHHHH-------------HHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEECC T ss_conf 961-6-6209899999998534-------------657678999999999999988742585799999846 No 98 >PRK02515 psbU photosystem II complex extrinsic protein precursor U; Provisional Probab=64.17 E-value=7.9 Score=17.21 Aligned_cols=25 Identities=12% Similarity=0.060 Sum_probs=20.1 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 9027999999999999998614674 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLDG 25 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~~ 25 (338) |||+.+.+..++++++++.+|.+.. T Consensus 1 mkrl~~~L~~ll~v~~L~lg~~g~~ 25 (144) T PRK02515 1 MKRLLSWLVRLLVVLSLLLGLLGWL 25 (144) T ss_pred CCHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 9179999999999999999984466 No 99 >COG1638 DctP TRAP-type C4-dicarboxylate transport system, periplasmic component [Carbohydrate transport and metabolism] Probab=63.91 E-value=9 Score=16.87 Aligned_cols=83 Identities=17% Similarity=0.252 Sum_probs=36.0 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCCCCC--CEEEEEECC-CCCCCHHHHHHHHHHHHHHHC--CEEEEEE---CCCHHH Q ss_conf 90279999999999999986146742348--859999788-778512233889999998848--8799997---288689 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLDGLPAK--PVLTVYTYN-SFVADEGAGPKIKQAFERKCN--CELKLIG---LSDGVA 72 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~~~~~~--~~l~v~~~~-~~~~~~~~~~~i~~~Fek~~g--ikV~~~~---~~~~~~ 72 (338) |++...+++++++++++++ ++.++. .++...... ...+.......+.+.-++++| ++|++.. .|+..+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~----~~~a~~~~~~l~~~~~~~~~~p~~~~~~~fa~~v~ekt~G~l~i~vfP~~qLG~~~~ 76 (332) T COG1638 1 MMKLLALLLALLLAAAALA----SAAAAAGALVLRFSHVTPEGHPKGKAAKKFAELVEEKTGGRLKIEVFPNSQLGGEAE 76 (332) T ss_pred CCCHHHHHHHHHHHHHHHH----HHHHHHCCEEEEEECCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCHHH T ss_conf 9513789999999999977----778740154887523579988078999999999998719938999766756687699 Q ss_pred HHHHHHHHHCCCCCEEEEC Q ss_conf 9978875312664101101 Q gi|254780561|r 73 LLNKLRMEGENSAADIVLG 91 (338) Q Consensus 73 ~~~~~~a~~~~~~~Dv~~~ 91 (338) ++.+++.+ ..|.... T Consensus 77 ~ie~l~~G----~id~~~~ 91 (332) T COG1638 77 MIEQLRSG----TLDIGVV 91 (332) T ss_pred HHHHHHCC----CCEEEEC T ss_conf 99999659----7307850 No 100 >PRK12473 hypothetical protein; Provisional Probab=63.45 E-value=9.2 Score=16.82 Aligned_cols=85 Identities=24% Similarity=0.301 Sum_probs=49.4 Q ss_pred CHHHHHHHHHHHHHHHHHHHCCCC---CCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHH----CCEEEEEECCCHHHHH Q ss_conf 027999999999999998614674---234885999978877851223388999999884----8879999728868999 Q gi|254780561|r 2 KKFARIVVGIMMITGVISYCTLDG---LPAKPVLTVYTYNSFVADEGAGPKIKQAFERKC----NCELKLIGLSDGVALL 74 (338) Q Consensus 2 Kk~~~~~~~~~~~~~~~~~~~~~~---~~~~~~l~v~~~~~~~~~~~~~~~i~~~Fek~~----gikV~~~~~~~~~~~~ 74 (338) ||...+.+++++.++++++|+.-. ..+.+ +.++. ++...++++..+.+.+ =.+|+.-+.+... T Consensus 1 kK~~~l~l~a~is~~~ls~CSl~g~~~epANG-vli~G------dE~~~~~Iid~yK~~~ke~e~f~vK~g~~~~~k--- 70 (193) T PRK12473 1 KKLVGLGLVAAISFGALSGCSLLGMIAEKANG-FVLYG------DEEQVQQIMDKYKDEVKSKDFYKMKMGTLEGKK--- 70 (193) T ss_pred CCEEEEHHHHHHHHHHHCCCHHCCCCCCCCCE-EEEEE------CHHHHHHHHHHHHHHHHHHHEEEEEEEEECCCE--- T ss_conf 93675329999999985000104887789877-99996------568889999987776533002678986677817--- Q ss_pred HHHHHHHCCCCCEEEECCCHHHHHHHHHHHCCCCCC Q ss_conf 788753126641011013337899988752126136 Q gi|254780561|r 75 NKLRMEGENSAADIVLGFDNNLIDLARKTGLFAKSN 110 (338) Q Consensus 75 ~~~~a~~~~~~~Dv~~~~~~~~~~~~~~~~l~~~~~ 110 (338) ++..+......+...+++...+ T Consensus 71 --------------VLim~ksTAe~miK~~l~Rk~D 92 (193) T PRK12473 71 --------------VLVMDKTTAEKMVKKELLKKVD 92 (193) T ss_pred --------------EEEEECHHHHHHHHHHHHEECC T ss_conf --------------9998318999998722411303 No 101 >TIGR01047 nspC carboxynorspermidine decarboxylase; InterPro: IPR005730 Carboxynorspermidine synthase, mediates the nicotinamide-nucleotide-linked reduction of the Schiff base H2N(CH2)3N = CHCH2CH(NH2)COOH. This is formed from L-aspartic beta-semialdehyde (ASA) and 1,3-diaminopropane (DAP) and is reduced to carboxynorspermidine [H2N(CH2)3NH(CH2)2CH(NH2)COOH], an intermediate in the novel pathway for norspermidine biosynthesis shown in Vibrio alginolyticus.; GO: 0016830 carbon-carbon lyase activity, 0045312 nor-spermidine biosynthetic process. Probab=62.58 E-value=9.1 Score=16.84 Aligned_cols=32 Identities=9% Similarity=0.185 Sum_probs=16.4 Q ss_pred EECCCCCCCHHHHHHHHHHHHHHHC-CEEEEEECCC Q ss_conf 9788778512233889999998848-8799997288 Q gi|254780561|r 35 YTYNSFVADEGAGPKIKQAFERKCN-CELKLIGLSD 69 (338) Q Consensus 35 ~~~~~~~~~~~~~~~i~~~Fek~~g-ikV~~~~~~~ 69 (338) .||.+|+.+ .+=.++|+|.++|+ |+|- .+.|+ T Consensus 223 ~T~~GYDv~--~LIa~~K~F~erh~~v~vi-LEPGe 255 (403) T TIGR01047 223 ITKKGYDVE--KLIAVIKAFKERHGNVQVI-LEPGE 255 (403) T ss_pred ECCCCCCHH--HHHHHHHHHHHCCCCEEEE-EECCC T ss_conf 457888889--9999986665117853899-71582 No 102 >PRK10866 outer membrane protein assembly complex subunit YfiO; Provisional Probab=62.38 E-value=9.7 Score=16.70 Aligned_cols=23 Identities=13% Similarity=0.304 Sum_probs=13.3 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 902799999999999999861467 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLD 24 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~ 24 (338) |+|+..+++.+++ +.++++|++. T Consensus 1 m~~~k~l~~~~~l-~l~l~gCs~~ 23 (243) T PRK10866 1 MTRMKYLVAAATL-SLFLAGCSGS 23 (243) T ss_pred CCHHHHHHHHHHH-HHHHHHCCCC T ss_conf 9458999999999-9999876899 No 103 >PRK10355 xylF D-xylose transporter subunit XylF; Provisional Probab=62.07 E-value=9.8 Score=16.67 Aligned_cols=71 Identities=6% Similarity=0.132 Sum_probs=30.9 Q ss_pred CCCCCCCEEEEEECCCCCCCHHHHHHHHHHHHH---HHCCEEEEEECCCH-HHHHHHHHHHHCCCCCEEE--ECCCHHHH Q ss_conf 742348859999788778512233889999998---84887999972886-8999788753126641011--01333789 Q gi|254780561|r 24 DGLPAKPVLTVYTYNSFVADEGAGPKIKQAFER---KCNCELKLIGLSDG-VALLNKLRMEGENSAADIV--LGFDNNLI 97 (338) Q Consensus 24 ~~~~~~~~l~v~~~~~~~~~~~~~~~i~~~Fek---~~gikV~~~~~~~~-~~~~~~~~a~~~~~~~Dv~--~~~~~~~~ 97 (338) ++.+++.++=+. ..+. ....+..+.+.|++ +.|++|.+....+. ....+++.. -....+|.+ ...+...+ T Consensus 20 ~a~a~~~~IGv~-~~~~--~~~~~~~~~~~~~~~Ake~G~~v~v~dA~~D~~~Qi~qIe~-~I~qgvdaIiv~p~D~~al 95 (330) T PRK10355 20 AAHAKEVKIGMA-IDDL--RLERWQKDRDIFVKKAESLGAKVFVQSANGNEETQMSQIEN-MINRGVDVLVIIPYNGQVL 95 (330) T ss_pred HHHHCCCEEEEE-ECCC--CCHHHHHHHHHHHHHHHHCCCEEEEECCCCCHHHHHHHHHH-HHHCCCCEEEEECCCHHHH T ss_conf 112138679999-0688--88689999999999999759989997189998999999999-9985999999969987888 Q ss_pred H Q ss_conf 9 Q gi|254780561|r 98 D 98 (338) Q Consensus 98 ~ 98 (338) . T Consensus 96 ~ 96 (330) T PRK10355 96 S 96 (330) T ss_pred H T ss_conf 9 No 104 >PRK02463 OxaA-like protein precursor; Provisional Probab=61.62 E-value=2 Score=20.65 Aligned_cols=25 Identities=12% Similarity=0.279 Sum_probs=17.8 Q ss_pred HHHHHHHCCHHHHHHHHHCCCEEEC Q ss_conf 9999996399999998850450505 Q gi|254780561|r 270 QEFMQFMISPSFQRILPTTNWMYPV 294 (338) Q Consensus 270 ~~Fi~flls~e~Q~~~a~~~~~~P~ 294 (338) .-++||.+.|...+.+++...-.|. T Consensus 244 qliin~~~~p~~~~~i~~e~~~~p~ 268 (307) T PRK02463 244 QLITTYILKPRLRKQIKEEFAKNPP 268 (307) T ss_pred HHHHHCCCCHHHHHHHHHHHHHCCC T ss_conf 9999622676799999999872899 No 105 >TIGR03659 IsdE heme ABC transporter, heme-binding protein isdE. This family of ABC substrate-binding proteins is observed primarily in close proximity with proteins localized to the cell wall and bearing the NEAT (NEAr Transporter, pfam05031) heme-binding domain. IsdE has been shown to bind heme and is involved in the process of scavenging heme for the purpose of obtaining iron. Probab=60.63 E-value=9.1 Score=16.85 Aligned_cols=24 Identities=17% Similarity=0.376 Sum_probs=15.2 Q ss_pred CHHHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 0279999999999999986146742 Q gi|254780561|r 2 KKFARIVVGIMMITGVISYCTLDGL 26 (338) Q Consensus 2 Kk~~~~~~~~~~~~~~~~~~~~~~~ 26 (338) ||++.++++++++++ +++|+.+.. T Consensus 1 Kk~~~~~~~~~~~~~-l~~C~~~~~ 24 (289) T TIGR03659 1 KKILSLILLALLSLG-LTGCSSSKA 24 (289) T ss_pred CCHHHHHHHHHHHHH-HHHCCCCCC T ss_conf 912999999999999-843289987 No 106 >pfam12262 Lipase_bact_N Bacterial virulence factor lipase N-terminal. This domain family is found in bacteria, and is typically between 258 and 271 amino acids in length. There are two conserved sequence motifs: DGT and DGWST. This family is the N-terminal region of bacterial virulence factor lipase. The N-terminal region contains a potential signalling sequence. Probab=58.98 E-value=8.6 Score=17.01 Aligned_cols=24 Identities=21% Similarity=0.198 Sum_probs=12.8 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 90279999999999999986146742 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLDGL 26 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~~~ 26 (338) |||. ++.++++.+..+++|+.... T Consensus 1 MKk~--~l~~~i~SAL~LaGCg~~s~ 24 (269) T pfam12262 1 MKKK--FLALLLASALLLAGCGDDTE 24 (269) T ss_pred CCHH--HHHHHHHHHHHHCCCCCCCC T ss_conf 9247--99999999864111579965 No 107 >PRK10449 heat-inducible protein; Provisional Probab=58.82 E-value=5.2 Score=18.28 Aligned_cols=21 Identities=29% Similarity=0.547 Sum_probs=11.3 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 9027999999999999998614674 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLDG 25 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~~ 25 (338) |||++.+++ +..++++|+++. T Consensus 1 MKk~l~l~~----~~llLagC~s~~ 21 (140) T PRK10449 1 MKKVVALVA----LSLLMAGCVSSG 21 (140) T ss_pred CCHHHHHHH----HHHHHHHCCCCC T ss_conf 903499999----999997523899 No 108 >smart00062 PBPb Bacterial periplasmic substrate-binding proteins. bacterial proteins, eukaryotic ones are in PBPe Probab=58.26 E-value=11 Score=16.27 Aligned_cols=178 Identities=15% Similarity=0.115 Sum_probs=88.8 Q ss_pred HHHHHHHHHHHHHHHCCEEEEEECCCHHHHHHHHHHHHCCCCCEEEECCCHHHHHHHHHHHCCCCCCCCHHHCCCCCCCC Q ss_conf 22338899999988488799997288689997887531266410110133378999887521261367700021000124 Q gi|254780561|r 44 EGAGPKIKQAFERKCNCELKLIGLSDGVALLNKLRMEGENSAADIVLGFDNNLIDLARKTGLFAKSNIDASQLKLPIKWD 123 (338) Q Consensus 44 ~~~~~~i~~~Fek~~gikV~~~~~~~~~~~~~~~~a~~~~~~~Dv~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~ 123 (338) .++.-+|++++.++.|+++++... +....+..++ ...+|++............ .+ T Consensus 23 ~G~~~Dil~~~~~~lg~~i~~~~~-~~~r~~~~l~----~g~~D~~~~~~~~~~~r~~---~~----------------- 77 (219) T smart00062 23 TGFDVDLAKAIAKELGLKVEFVEV-SFDNLLTALK----SGKIDVVAAGMTITPERAK---QV----------------- 77 (219) T ss_pred EEEHHHHHHHHHHHHCCCEEEEEC-CHHHHHHHHH----CCCCCEEEEECCCCHHHHH---CC----------------- T ss_conf 882999999999997996799984-7899999997----5897705311357867750---02----------------- Q ss_pred CCCCCCEEEEEEHHHHCCCCCCCCCCCHHHHCCCCCCCEEEEEECCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHC-CCE Q ss_conf 3211532431000110011124587645652120006828985047653036789999871773137898777628-846 Q gi|254780561|r 124 DDIFVPYDYGYLAFIYDKRQITQPPKNFDELINSTKPWKIIYQDPRTSTLGLGLLLWIQKIYGDNSAQVWKKIATK-TAT 202 (338) Q Consensus 124 ~~~~~~~~~~~~~~~y~~~~~~~~p~s~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~ 202 (338) .+..|+......+.++++ ..+.+..+|.. .++++... +... +.+.....+ ... T Consensus 78 -~~s~p~~~~~~~~~~~~~---~~~~~~~dL~g----~~i~~~~g--~~~~----------------~~l~~~~~~~~~~ 131 (219) T smart00062 78 -DFSDPYYKSGQVILVRKD---SPIKSLEDLKG----KKVAVVAG--TTGE----------------ELLKKLYPEAKIV 131 (219) T ss_pred -CCCCCEECCCEEEEEECC---CCCCCHHHHCC----CEEEEECC--CHHH----------------HHHHHHCCCCEEE T ss_conf -224550024447999658---99799799679----77999579--8799----------------9999728785289 Q ss_pred EECCCHHHHHHHHCCCCCEEEEECCHHHHHHHHHCCCCEEEEECCCCEEEEEEEEEEECCCCCHHHHHHHHHH Q ss_conf 8308478988862258656999504499999850799737883169808888642433789998999999999 Q gi|254780561|r 203 VTKGWTESYGFFLKGESDFVLSYSTSPGFYLLNYGQDDYVAALFSEGHYLQIEVAAQLVRSKQPQLAQEFMQF 275 (338) Q Consensus 203 ~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~eG~~~~~~~~~i~k~a~n~~~A~~Fi~f 275 (338) ...+..+...++..|++++.++............+.....++... .....+.++++.++ ++++...|=+. T Consensus 132 ~~~~~~~~~~~l~~g~vD~~i~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~--~~~l~~~~n~~ 201 (219) T smart00062 132 SYDSQAEALAALKAGRADAAVADAPALAALVKQHGLPELKIVGDP-LDTPEGYAFAVRKG--DPELLDKINKA 201 (219) T ss_pred EECCHHHHHHHHHCCCEEEEEECHHHHHHHHHHCCCCCEEEECCC-CCCCCEEEEEEECC--CHHHHHHHHHH T ss_conf 859999999999839879999779999999984899983783467-77764199999599--99999999999 No 109 >PRK10936 periplasmic sensory protein associated with the TorRS two-component regulatory system; Provisional Probab=57.84 E-value=12 Score=16.23 Aligned_cols=47 Identities=13% Similarity=0.115 Sum_probs=22.9 Q ss_pred HHHHHHHHHHHH---HHHCCEEEEEECCCH---HHHHHHHHHHHCCCCCEEEEC Q ss_conf 223388999999---884887999972886---899978875312664101101 Q gi|254780561|r 44 EGAGPKIKQAFE---RKCNCELKLIGLSDG---VALLNKLRMEGENSAADIVLG 91 (338) Q Consensus 44 ~~~~~~i~~~Fe---k~~gikV~~~~~~~~---~~~~~~~~a~~~~~~~Dv~~~ 91 (338) ..++..+-...+ |+.||+|++...++. ...+.+++. -....+|-++. T Consensus 57 d~yW~~v~~G~~~~Ak~lGv~l~v~~a~~~~d~~~Q~~qie~-~i~~~vDAIil 109 (340) T PRK10936 57 DSYWLSVNYGLVQEAKRLGVDLKVLEAGGYYNLATQQQQLEQ-CVAWGADAILL 109 (340) T ss_pred CHHHHHHHHHHHHHHHHHCCEEEEECCCCCCCHHHHHHHHHH-HHHCCCCEEEE T ss_conf 869999999999999980986999858998899999999999-99759999998 No 110 >PRK11242 DNA-binding transcriptional regulator CynR; Provisional Probab=57.54 E-value=12 Score=16.20 Aligned_cols=198 Identities=12% Similarity=0.127 Sum_probs=100.2 Q ss_pred CCCCEEEEEECCCCCCCHHHHHHHHHHHHHHH-CCEEEEEECCCHHHHHHHHHHHHCCCCCEEEECCCHHHHHHHHHHHC Q ss_conf 34885999978877851223388999999884-88799997288689997887531266410110133378999887521 Q gi|254780561|r 27 PAKPVLTVYTYNSFVADEGAGPKIKQAFERKC-NCELKLIGLSDGVALLNKLRMEGENSAADIVLGFDNNLIDLARKTGL 105 (338) Q Consensus 27 ~~~~~l~v~~~~~~~~~~~~~~~i~~~Fek~~-gikV~~~~~~~~~~~~~~~~a~~~~~~~Dv~~~~~~~~~~~~~~~~l 105 (338) ...++|+|.+..++ -...+++++..|.++| +|+|++... +..++...+. +...|+.+...+. T Consensus 88 ~~~G~l~ig~~~~~--~~~~l~~~l~~f~~~~P~i~i~l~~~-~~~~~~~~l~----~~~~D~~i~~~~~---------- 150 (292) T PRK11242 88 LSRGSLRLAMTPTF--TAYLIGPLIDAFHARYPGITLTIREM-PQERIEALLA----DDELDVGIAFAPV---------- 150 (292) T ss_pred CCCCEEEEECHHHH--HHHHHHHHHHHHHHHCCCCCEEEEEC-CHHHHHHHHH----CCCCCEEEEECCC---------- T ss_conf 77765863024577--88873899888886488973489976-8899999986----7985579983688---------- Q ss_pred CCCCCCCHHHCCCCCCCCCCCCCCEEEEEEHHHHCCCC---CCCCCCCHHHHCCCCCCCEEEEEECCCCCCHHHHHHHHH Q ss_conf 26136770002100012432115324310001100111---245876456521200068289850476530367899998 Q gi|254780561|r 106 FAKSNIDASQLKLPIKWDDDIFVPYDYGYLAFIYDKRQ---ITQPPKNFDELINSTKPWKIIYQDPRTSTLGLGLLLWIQ 182 (338) Q Consensus 106 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~---~~~~p~s~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 182 (338) ..+. ....+........+..++. ....+.++.+|.+.. ..+..... ........... T Consensus 151 ---~~~~------------l~~~~l~~~~~~~v~~~~h~la~~~~~is~~~L~~~~----~i~~~~~~-~~~~~~~~~~~ 210 (292) T PRK11242 151 ---HSPE------------IEAQPLFTETLALVVGRTHPLAARRKPLTLDELADEP----LVLLSAEF-ATREQIDRYFR 210 (292) T ss_pred ---CCCC------------CEEEEECCCEEEEEEECCCHHHHCCCCCCHHHHCCCC----EEEECCCC-CHHHHHHHHHH T ss_conf ---8877------------2565420450799990898042279999999981899----89967998-57999999999 Q ss_pred HHCCCCHHHHHHHHHHCCCEEECCCHHHHHHHHCCCCCEEEEECCHHHHHHHHHCCCCEEEEECCCCEEEEEEEEEEECC Q ss_conf 71773137898777628846830847898886225865699950449999985079973788316980888864243378 Q gi|254780561|r 183 KIYGDNSAQVWKKIATKTATVTKGWTESYGFFLKGESDFVLSYSTSPGFYLLNYGQDDYVAALFSEGHYLQIEVAAQLVR 262 (338) Q Consensus 183 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~eG~~~~~~~~~i~k~ 262 (338) .. ....+...-.++......++..|.. +++ ...... ...+++..+...+..+...-+++-.|+ T Consensus 211 ~~----------~i~~~~~~~~~~~~~~~~~V~~g~G-i~i--lP~~~~----~~~~~l~~i~l~~~~~~r~i~lv~~k~ 273 (292) T PRK11242 211 RH----------GITPRVAIEANSISAVLEIVRRGRL-ATL--LPAAIA----REHDDLCAVALDPPLPQRTAALLRRKG 273 (292) T ss_pred HC----------CCCCCEEEEECCHHHHHHHHHHCCE-EEE--EHHHHH----HHCCCEEEEECCCCCCEEEEEEEEECC T ss_conf 76----------9998536753889999999997996-997--028986----008998999898998703999999893 Q ss_pred CCCHHHHHHHHHHHCC Q ss_conf 9998999999999639 Q gi|254780561|r 263 SKQPQLAQEFMQFMIS 278 (338) Q Consensus 263 a~n~~~A~~Fi~flls 278 (338) .....+++.|++|+.. T Consensus 274 ~~~s~~~~~Fid~l~e 289 (292) T PRK11242 274 AYRSAAARAFIELALE 289 (292) T ss_pred CCCCHHHHHHHHHHHH T ss_conf 9889999999999986 No 111 >PRK12683 transcriptional regulator CysB-like protein; Reviewed Probab=56.57 E-value=12 Score=16.10 Aligned_cols=207 Identities=14% Similarity=-0.005 Sum_probs=95.7 Q ss_pred CCCEEEEEECCCCCCCHHHHHHHHHHHHHHH-CCEEEEEECCCHHHHHHHHHHHHCCCCCEEEECCCHHHHHHHHHHHCC Q ss_conf 4885999978877851223388999999884-887999972886899978875312664101101333789998875212 Q gi|254780561|r 28 AKPVLTVYTYNSFVADEGAGPKIKQAFERKC-NCELKLIGLSDGVALLNKLRMEGENSAADIVLGFDNNLIDLARKTGLF 106 (338) Q Consensus 28 ~~~~l~v~~~~~~~~~~~~~~~i~~~Fek~~-gikV~~~~~~~~~~~~~~~~a~~~~~~~Dv~~~~~~~~~~~~~~~~l~ 106 (338) ..++|.|.+..+. -...++.++..|.++| +|+|++... ++.++...+.. ...|+.+....... ..++ T Consensus 91 ~~G~l~Ig~~~~~--~~~~lp~~l~~f~~~~P~v~i~l~~~-~~~~~~~~L~~----g~~Dl~i~~~~~~~----~~~l- 158 (308) T PRK12683 91 DSGRLTVATTHTQ--ARYALPKVVRQFTEVFPKVHLALRQG-SPQEIAEMLLN----GEADIGIATEALDR----EPDL- 158 (308) T ss_pred CCCEEEECCCCHH--HHHCCCHHHHHHHHHCCCCEEEEEEC-CHHHHHHHHHC----CCCCEEECCCCCCC----CCCC- T ss_conf 7762420013201--12016846689998789966999607-86999999977----99778806667788----8771- Q ss_pred CCCCCCHHHCCCCCCCCCCCCCCEEEEEEHHHHCC--CCCCCCCCCHHHHCCCCCCCEEEEEECCCCCCHHHHHHHHHHH Q ss_conf 61367700021000124321153243100011001--1124587645652120006828985047653036789999871 Q gi|254780561|r 107 AKSNIDASQLKLPIKWDDDIFVPYDYGYLAFIYDK--RQITQPPKNFDELINSTKPWKIIYQDPRTSTLGLGLLLWIQKI 184 (338) Q Consensus 107 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~--~~~~~~p~s~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 184 (338) . ..+.......++..+ ......+-++.++.+.. ..+...+ +......-...... T Consensus 159 ----------~---------~~~l~~~~~~lv~~~~hpla~~~~v~l~dl~~~p----~I~~~~~-~~~r~~i~~~~~~~ 214 (308) T PRK12683 159 ----------V---------SFPYYSWHHVVVVPKGHPLTGRENLTLEALAEYP----IITYDQG-FTGRSHIDQAFAEA 214 (308) T ss_pred ----------E---------EEEECCCCEEEEECCCCCCCCCCCCCHHHHCCCC----EEECCCC-CCHHHHHHHHHHHC T ss_conf ----------6---------8761245158984699953369999999985999----8961899-82999999999977 Q ss_pred CCCCHHHHHHHHHHCCCEEECCCHHHHHHHHCCCCCEEEEECCHHHHHHHHHCCCCEEEEECCCCEEEEEEEEEEECCCC Q ss_conf 77313789877762884683084789888622586569995044999998507997378831698088886424337899 Q gi|254780561|r 185 YGDNSAQVWKKIATKTATVTKGWTESYGFFLKGESDFVLSYSTSPGFYLLNYGQDDYVAALFSEGHYLQIEVAAQLVRSK 264 (338) Q Consensus 185 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~eG~~~~~~~~~i~k~a~ 264 (338) + +..+...-..+......++..|-. ++...........+ ..+..+...+-.....-+++..|+.. T Consensus 215 g----------~~p~~~~e~~~~~~i~~~V~~g~G---i~ilp~~~~~~~~~--~~lv~l~~~~~~~~~~~~i~~rk~~~ 279 (308) T PRK12683 215 G----------AVPDIVLTALDADVIKTYVELGMG---VGIVAAMAYDPQRD--TGLVALDTQHLFEANTTRVALRRGAY 279 (308) T ss_pred C----------CCCCEEEEECCHHHHHHHHHHCCC---HHHHHHHHHHHHHC--CCEEEEECCCCCCCCEEEEEECCCCC T ss_conf 9----------998579999989999999997984---70789999878860--99799989788766179999909685 Q ss_pred CHHHHHHHHHHHCCHHHHHHH Q ss_conf 989999999996399999998 Q gi|254780561|r 265 QPQLAQEFMQFMISPSFQRIL 285 (338) Q Consensus 265 n~~~A~~Fi~flls~e~Q~~~ 285 (338) -+.+|++||+++...=.++.+ T Consensus 280 Ls~~a~~Fie~l~~~l~~~~~ 300 (308) T PRK12683 280 LRGYAYRFIEMFAPHLSEAEI 300 (308) T ss_pred CCHHHHHHHHHHHHHHCHHHH T ss_conf 799999999998886298999 No 112 >PRK10341 DNA-binding transcriptional activator TdcA; Provisional Probab=55.68 E-value=13 Score=16.02 Aligned_cols=199 Identities=11% Similarity=0.089 Sum_probs=91.6 Q ss_pred CCEEEEEECCCCCCCHHHHHHHHHHHHHHH-CCEEEEEECCCHHHHHHHHHHHHCCCCCEEEECCCHHHHHHHHHHHCCC Q ss_conf 885999978877851223388999999884-8879999728868999788753126641011013337899988752126 Q gi|254780561|r 29 KPVLTVYTYNSFVADEGAGPKIKQAFERKC-NCELKLIGLSDGVALLNKLRMEGENSAADIVLGFDNNLIDLARKTGLFA 107 (338) Q Consensus 29 ~~~l~v~~~~~~~~~~~~~~~i~~~Fek~~-gikV~~~~~~~~~~~~~~~~a~~~~~~~Dv~~~~~~~~~~~~~~~~l~~ 107 (338) ...+.+.... ......++.++..|.++| +|+|++... ...++...+.. ...|+.++...+. T Consensus 96 ~~~l~ig~~~--~~~~~~lp~~l~~f~~~~P~v~i~l~~~-~~~~~~~~l~~----g~~D~~i~~~~~~----------- 157 (312) T PRK10341 96 VVDVSFGFPS--LIGFTFMSDMINKFKEVFPKAQVSMYEA-QLSSFLPAIRD----GRLDFAIGTLSNE----------- 157 (312) T ss_pred CEEEECCCCH--HHHHHHHHHHHHHHHHHCCCCEEEEEEC-CHHHHHHHHHC----CCCEEEEEECCCC----------- T ss_conf 3257435406--8899999999999999888977999989-99999999966----9861999743687----------- Q ss_pred CCCCCHHHCCCCCCCCCCCCCCEEEEEEHHHHCCCCCCCCCCCHHHHCCCCCCCEEEEEECCCCCCHHHHHHHHHHHCCC Q ss_conf 13677000210001243211532431000110011124587645652120006828985047653036789999871773 Q gi|254780561|r 108 KSNIDASQLKLPIKWDDDIFVPYDYGYLAFIYDKRQITQPPKNFDELINSTKPWKIIYQDPRTSTLGLGLLLWIQKIYGD 187 (338) Q Consensus 108 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~p~s~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 187 (338) .... ... ..+.......+...+......+.++.+|.+..+ .+.... .............. T Consensus 158 -~~~~--~~~---------~~~l~~~~~~lv~~~~~p~~~~~~l~~L~~~~~----il~~~~-~~~~~~~~~~~~~~--- 217 (312) T PRK10341 158 -MKLQ--DLH---------VEPLFESEFVLVASKSRTCTGTTTLESLKNEQW----VLPQTN-MGYYSELLTTLQRN--- 217 (312) T ss_pred -CCCC--CEE---------EEEEECCCEEEEECCCCCCCCCCCHHHHCCCCE----EEECCC-CCHHHHHHHHHHHC--- T ss_conf -7778--749---------999512428999767542348979899807986----860699-85899999999976--- Q ss_pred CHHHHHHHHHHCCCEEECCCHHHHHHHHCCCCCEEEEECCHHHHHHHHHCCCCEEEEECCCCEEEEEEEEEEECCCCCHH Q ss_conf 13789877762884683084789888622586569995044999998507997378831698088886424337899989 Q gi|254780561|r 188 NSAQVWKKIATKTATVTKGWTESYGFFLKGESDFVLSYSTSPGFYLLNYGQDDYVAALFSEGHYLQIEVAAQLVRSKQPQ 267 (338) Q Consensus 188 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~eG~~~~~~~~~i~k~a~n~~ 267 (338) ....+.....++.......+..|.. +.+. ........ ....+..+..++..+....+++-.|+..-.. T Consensus 218 -------g~~~~~~~~~~s~~~i~~lv~~g~g-i~~l--P~~~~~~~--~~~~L~~lpl~~~~~~~~~~lv~~k~~~ls~ 285 (312) T PRK10341 218 -------GISIENIVKTDSVVTIYNLVLNADF-LTVI--PCDMTSPF--GSNQFITIPIKETLPVARYAAVWSKNYRIKK 285 (312) T ss_pred -------CCCCCCEEEECHHHHHHHHHHHCCE-EEEE--HHHHHHHH--CCCCEEEEECCCCCCEEEEEEEEECCCCCCH T ss_conf -------9998816998909999999997995-8972--69999663--1898999989998851189999989398899 Q ss_pred HHHHHHHHHC Q ss_conf 9999999963 Q gi|254780561|r 268 LAQEFMQFMI 277 (338) Q Consensus 268 ~A~~Fi~fll 277 (338) +|..||+++- T Consensus 286 a~~~fie~~k 295 (312) T PRK10341 286 AASVLVELAK 295 (312) T ss_pred HHHHHHHHHH T ss_conf 9999999999 No 113 >TIGR01729 taurine_ABC_bnd taurine ABC transporter, periplasmic binding protein; InterPro: IPR010068 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). This entry identifies a cluster of ABC transporter periplasmic substrate binding proteins, apparently specific for taurine. Transport systems for taurine (NH2-CH2-CH2-SO3H), sulphonates, and sulphate esters import sulphur when sulphate levels are low. The most closely related proteins outside this family are putative aliphatic sulphonate binding proteins.. Probab=55.32 E-value=10 Score=16.56 Aligned_cols=196 Identities=13% Similarity=0.148 Sum_probs=98.2 Q ss_pred HHHHHHHHHHCCEEEEEECCCHHHHHHHHHHHHCCCCCEEEE-CCCHHHHHHHHHHHCCCCCCCCHHHCCCCCCCCCCCC Q ss_conf 899999988488799997288689997887531266410110-1333789998875212613677000210001243211 Q gi|254780561|r 49 KIKQAFERKCNCELKLIGLSDGVALLNKLRMEGENSAADIVL-GFDNNLIDLARKTGLFAKSNIDASQLKLPIKWDDDIF 127 (338) Q Consensus 49 ~i~~~Fek~~gikV~~~~~~~~~~~~~~~~a~~~~~~~Dv~~-~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~ 127 (338) .-+++|+|++|.+|.+..++|+.++..-+..+ ..++-. +.++-......+-.+....-.. T Consensus 19 grfda~aKe~gaTiDWRkFdSGadi~~AlASG----~V~IG~~GSsplaaAASr~vpie~f~~~~--------------- 79 (304) T TIGR01729 19 GRFDAYAKEAGATIDWRKFDSGADIVAALASG----DVDIGVVGSSPLAAAASREVPIEVFLVVD--------------- 79 (304) T ss_pred CCHHHHHHCCCCEEEEEEECCHHHHHHHHHCC----CCCCCCCCCCHHHHHHHCCCCEEEEEEEC--------------- T ss_conf 62224543048802001104416555565338----63322224707899872688300201001--------------- Q ss_pred CCEEEEEEHHHHCCCCCCCCCCCHHHHCCCCCCCEEEEEECCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHCCCEEECCC Q ss_conf 53243100011001112458764565212000682898504765303678999987177313789877762884683084 Q gi|254780561|r 128 VPYDYGYLAFIYDKRQITQPPKNFDELINSTKPWKIIYQDPRTSTLGLGLLLWIQKIYGDNSAQVWKKIATKTATVTKGW 207 (338) Q Consensus 128 ~~~~~~~~~~~y~~~~~~~~p~s~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 207 (338) .+.--...+........-+.|- ...++..+-.. +....++.++.+...-+.-+ +..+.=.. T Consensus 80 --------~iG~SEALVar~g~GIeKpeDL-~GK~vaVPFvS--TtHysLLaaLk~vw~~dp~~--------V~IlNl~P 140 (304) T TIGR01729 80 --------KIGKSEALVAREGAGIEKPEDL-KGKKVAVPFVS--TTHYSLLAALKHVWKVDPKE--------VEILNLKP 140 (304) T ss_pred --------CCCCCCEEEEECCCCCCCHHHH-CCCEEECCCCC--HHHHHHHHHHHHHHCCCCCE--------EEEEECCC T ss_conf --------0287200245436788766671-78845027511--34589999888762678760--------57861796 Q ss_pred HHHHHHHHCCCCCEEEEECCHHHHHHHHHCCCCEEEEECCC--C--EEEEEEEEEEECC--CCCHHHHHHHHHHHCCHHH Q ss_conf 78988862258656999504499999850799737883169--8--0888864243378--9998999999999639999 Q gi|254780561|r 208 TESYGFFLKGESDFVLSYSTSPGFYLLNYGQDDYVAALFSE--G--HYLQIEVAAQLVR--SKQPQLAQEFMQFMISPSF 281 (338) Q Consensus 208 ~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e--G--~~~~~~~~~i~k~--a~n~~~A~~Fi~flls~e~ 281 (338) .+....+.+|++|....| ......+.+.+.-+. -.| | .....|.+++-|+ -+||+.-+.|.+=+... - T Consensus 141 P~I~AAwqRGDIDaAYVW--~PAl~~l~k~GKV~~---DSeqvgawgaPTfD~wvvrkdfAeknPe~v~aF~Kv~~dA-y 214 (304) T TIGR01729 141 PEIVAAWQRGDIDAAYVW--DPALSELKKSGKVVS---DSEQVGAWGAPTFDAWVVRKDFAEKNPEVVKAFVKVLADA-Y 214 (304) T ss_pred HHHHHHHHCCCCCCCCCC--HHHHHHHHHCCCEEE---CHHHHHHCCCCCCCEEEEECHHHHHCHHHHHHHHHHHHHH-H T ss_conf 579987543882621037--456887753596685---2578741368731024652145430707899999889985-0 Q ss_pred HHHHHHC Q ss_conf 9998850 Q gi|254780561|r 282 QRILPTT 288 (338) Q Consensus 282 Q~~~a~~ 288 (338) |..+|.. T Consensus 215 a~Y~Anp 221 (304) T TIGR01729 215 ADYLANP 221 (304) T ss_pred HHHCCCC T ss_conf 1002388 No 114 >PRK09534 btuF corrinoid ABC transporter substrate-binding protein; Reviewed Probab=54.62 E-value=13 Score=15.91 Aligned_cols=33 Identities=6% Similarity=-0.029 Sum_probs=17.3 Q ss_pred CCEEEEEEEEEEECCCCCHHHHHHHHHHHCCHH Q ss_conf 980888864243378999899999999963999 Q gi|254780561|r 248 EGHYLQIEVAAQLVRSKQPQLAQEFMQFMISPS 280 (338) Q Consensus 248 eG~~~~~~~~~i~k~a~n~~~A~~Fi~flls~e 280 (338) +|-+..++.-.+...+|.--.+.+=|.-.+-|| T Consensus 276 n~rv~~vd~d~i~RpGPRiv~glE~lA~~iHp~ 308 (364) T PRK09534 276 TGNVVTVNTNHINQPAPRIVEPMARMANAFHNT 308 (364) T ss_pred CCCEEEECCCCCCCCCCHHHHHHHHHHHHHCCC T ss_conf 483899676445799871999999999863823 No 115 >pfam09358 UBA_e1_C Ubiquitin-activating enzyme e1 C-terminal domain. This presumed domain found at the C-terminus of Ubiquitin-activating enzyme e1 proteins is functionally uncharacterized. Probab=54.05 E-value=13 Score=15.86 Aligned_cols=39 Identities=10% Similarity=0.152 Sum_probs=27.4 Q ss_pred EEEEEECCCCCCC-HHHHHHHHHHHHHHHCCEEEEEECCC Q ss_conf 5999978877851-22338899999988488799997288 Q gi|254780561|r 31 VLTVYTYNSFVAD-EGAGPKIKQAFERKCNCELKLIGLSD 69 (338) Q Consensus 31 ~l~v~~~~~~~~~-~~~~~~i~~~Fek~~gikV~~~~~~~ 69 (338) .+....|+.+..+ ..-++.+++.|+++||++|..+..|. T Consensus 28 ~~~~T~WDr~~v~~d~TL~~li~~f~~~~~lev~mis~g~ 67 (124) T pfam09358 28 DKEFTIWDRWEVNGDLTLQELIDYFEEKYGLEVTMLSQGV 67 (124) T ss_pred CEEEEEEEEEEECCCCCHHHHHHHHHHHHCCEEEEEEECC T ss_conf 7677457789987995099999999998196699997289 No 116 >TIGR02896 spore_III_AF stage III sporulation protein AF; InterPro: IPR014245 This family represents the stage III sporulation protein AF (SpoIIIAF) of the bacterial endospore formation program, which exists in some but not all members of the Firmicutes (formerly called low-GC Gram-positives). The C-terminal region of these proteins is poorly conserved. . Probab=54.03 E-value=13 Score=15.89 Aligned_cols=21 Identities=29% Similarity=0.681 Sum_probs=18.8 Q ss_pred CCHHHHHHHHHHHHHHHHHHH Q ss_conf 902799999999999999861 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYC 21 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~ 21 (338) |||+.++++|+++++..+... T Consensus 29 lkKYvk~v~GL~L~~viL~Pi 49 (113) T TIGR02896 29 LKKYVKFVVGLILIVVILNPI 49 (113) T ss_pred CCHHHHHHHHHHHHHHHHHHH T ss_conf 517999999999999999899 No 117 >PRK09915 putative outer membrane efflux protein MdtP; Provisional Probab=52.93 E-value=11 Score=16.26 Aligned_cols=24 Identities=25% Similarity=0.412 Sum_probs=13.3 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 90279999999999999986146742 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLDGL 26 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~~~ 26 (338) |||+ ++.++++++.++++|+.... T Consensus 6 ~~r~--~~~~~l~~~~lLagC~~~p~ 29 (488) T PRK09915 6 LSRL--LLCSILGSTTLISGCALVRK 29 (488) T ss_pred HHHH--HHHHHHHHHHHHCCCCCCCC T ss_conf 7699--99999999999752668999 No 118 >COG2358 Imp TRAP-type uncharacterized transport system, periplasmic component [General function prediction only] Probab=52.06 E-value=15 Score=15.67 Aligned_cols=248 Identities=13% Similarity=0.019 Sum_probs=104.4 Q ss_pred HHHHHHHHHHCCC-CCCCCCEEEEEECCCCCCCHHHHHHHHHHHHH-HHCCEEEEEECCCHHHHHHHHHHHHCCCCCEEE Q ss_conf 9999999861467-42348859999788778512233889999998-848879999728868999788753126641011 Q gi|254780561|r 12 MMITGVISYCTLD-GLPAKPVLTVYTYNSFVADEGAGPKIKQAFER-KCNCELKLIGLSDGVALLNKLRMEGENSAADIV 89 (338) Q Consensus 12 ~~~~~~~~~~~~~-~~~~~~~l~v~~~~~~~~~~~~~~~i~~~Fek-~~gikV~~~~~~~~~~~~~~~~a~~~~~~~Dv~ 89 (338) +.++++..+|.+. ..++...+++.|.+....=..+...|.+.-.+ ..||+++.+..+++-+.++.+.. ...|.- T Consensus 7 ~~la~~~~~~~~~~~~~~~~~itigTG~~~G~YY~ig~~ia~~~~~~~~~i~~~v~~tggSv~Nl~~i~~----Ge~d~a 82 (321) T COG2358 7 AALAAAGAGSVSTGPAAEPKFITIGTGSTGGVYYPIGGGLAQLLNKDEKGIECSVVPTGGSVENLKLLAS----GEADLA 82 (321) T ss_pred HHHHHHHHCCCCCCCCCCCEEEEEEECCCCCEEEEHHHHHHHHHHCCCCCEEEEEEECCCHHHHHHHHHC----CCCCHH T ss_conf 8989987344565666676379996069986223037899999851477818999606542999986753----762535 Q ss_pred ECCCHHHHHHHHHHHCCCCCCCCHHHCCCCCCCCCCCCCCEEEEEEHHHHCCCCCCCCCCCHHHHCCCCCCCEEEEEECC Q ss_conf 01333789998875212613677000210001243211532431000110011124587645652120006828985047 Q gi|254780561|r 90 LGFDNNLIDLARKTGLFAKSNIDASQLKLPIKWDDDIFVPYDYGYLAFIYDKRQITQPPKNFDELINSTKPWKIIYQDPR 169 (338) Q Consensus 90 ~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~p~s~~~l~~~~~~~~~~~~~~~ 169 (338) +..+........-.+.+.-.... .+...... .......++.+.+ ...++|.||. ||.....+. T Consensus 83 lvq~d~a~~ay~G~g~f~~~~~~-~~lr~v~~--------lype~~~vv~r~d---~~Ikti~DL~-----GKrV~iG~~ 145 (321) T COG2358 83 LVQSDVAYEAYNGTGSFEGKGKD-ENLRAVAA--------LYPEPFHVVTRKD---AGIKTIADLK-----GKRVAIGPP 145 (321) T ss_pred HHHHHHHHHHHHCCCCCCCCCCC-CCHHHHEE--------CCCCEEEEEEECC---CCCCEEHHCC-----CCEEEECCC T ss_conf 66678999987185311456655-14332213--------0542089999668---8965601038-----988862699 Q ss_pred CCCCHHHHHHHHHHHCCCCHHHHH-H---------HHHH--CCCEE-ECCCHHHHHHHHCCCCCEEEEECCHHHHHHHHH Q ss_conf 653036789999871773137898-7---------7762--88468-308478988862258656999504499999850 Q gi|254780561|r 170 TSTLGLGLLLWIQKIYGDNSAQVW-K---------KIAT--KTATV-TKGWTESYGFFLKGESDFVLSYSTSPGFYLLNY 236 (338) Q Consensus 170 ~~~~~~~~~~~~~~~~~~~~~~~~-~---------~~~~--~~~~~-~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~ 236 (338) .|+........+...+........ . .+.. -+..+ ..+..........-..++.+.-..+.....+.+ T Consensus 146 gSgt~~~a~~il~a~Gi~~~~~~~~~~~~~a~~~~~l~~g~iDA~~~~~G~p~~ai~el~~~~~i~lv~i~~e~i~kl~~ 225 (321) T COG2358 146 GSGTEATARQILEALGITYDDYELDLGLGDAESADALKNGTIDAAFYVAGVPNPAISELATTCDIVLVPISGEEIDKLEE 225 (321) T ss_pred CCCCHHHHHHHHHHCCCCCCCHHHHHHCCCHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHCCEEEEECCHHHHHHHHH T ss_conf 97408899999997399976245666337306578763796328998158888319999861775899378799999974 Q ss_pred CCCCE-EEEECCC------C--EEEEEEEEEEECCCCCHHHHHHHHHHHCCHH Q ss_conf 79973-7883169------8--0888864243378999899999999963999 Q gi|254780561|r 237 GQDDY-VAALFSE------G--HYLQIEVAAQLVRSKQPQLAQEFMQFMISPS 280 (338) Q Consensus 237 ~~~~~-~~~~~~e------G--~~~~~~~~~i~k~a~n~~~A~~Fi~flls~e 280 (338) ..+-. ....|.. + ..+.+..+-+.+..=+.|.+++++.-+..-. T Consensus 226 ~~~~y~~~~IpagtY~g~~~~i~tv~v~a~lvt~~~v~ed~vY~~tKa~fen~ 278 (321) T COG2358 226 KYPYYAKATIPAGTYPGIDEDIPTVAVAAILVTSDDVSEDLVYKLTKALFENL 278 (321) T ss_pred CCCCCEEEECCCCCCCCCCCCCCEEEEEEEEEECCCCCHHHHHHHHHHHHHHH T ss_conf 24664135414545789888764157878998338998899999999999736 No 119 >TIGR02747 TraV type IV conjugative transfer system protein TraV; InterPro: IPR014118 This entry includes TraV, which is a component of conjugative type IV secretion system. TraV is an outer membrane lipoprotein that is believed to interact with the secretin TraK , , . The alignment contains three conserved cysteines in the N-terminal half.. Probab=51.77 E-value=10 Score=16.52 Aligned_cols=22 Identities=9% Similarity=0.274 Sum_probs=10.0 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCC Q ss_conf 90279999999999999986146 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTL 23 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~ 23 (338) |+|+.. ++.+.+++.+|+||+. T Consensus 1 Mmr~~~-L~~~~~~~f~LtGCsa 22 (174) T TIGR02747 1 MMRLKV-LLLLACVAFLLTGCSA 22 (174) T ss_pred CCHHHH-HHHHHHHHHHHHHHHH T ss_conf 906789-9999999872002577 No 120 >PRK12679 cbl transcriptional regulator Cbl; Reviewed Probab=51.63 E-value=15 Score=15.63 Aligned_cols=204 Identities=12% Similarity=0.042 Sum_probs=101.4 Q ss_pred CCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHH-CCEEEEEECCCHHHHHHHHHHHHCCCCCEEEECCCHHHHHHHHHHH Q ss_conf 234885999978877851223388999999884-8879999728868999788753126641011013337899988752 Q gi|254780561|r 26 LPAKPVLTVYTYNSFVADEGAGPKIKQAFERKC-NCELKLIGLSDGVALLNKLRMEGENSAADIVLGFDNNLIDLARKTG 104 (338) Q Consensus 26 ~~~~~~l~v~~~~~~~~~~~~~~~i~~~Fek~~-gikV~~~~~~~~~~~~~~~~a~~~~~~~Dv~~~~~~~~~~~~~~~~ 104 (338) ....++|+|.+..+ .-...++.++..|.++| +|++++.. +++.++...+.. ...|+.+..... T Consensus 89 ~~~~G~l~Ig~~~t--~a~~~LP~~l~~F~~~~P~v~l~l~~-~~~~~i~~~l~~----g~~Dl~i~~~~~--------- 152 (316) T PRK12679 89 NDTSGVLTIATTHT--QARYSLPEVIKAFRELFPEVRLELIQ-GTPQEIATLLQN----GEADIGIASERL--------- 152 (316) T ss_pred CCCCCEEEEEEEEC--CHHCCCCHHHHHHHHHCCCCEEEEEE-CCHHHHHHHHHC----CCCCEEECCCCC--------- T ss_conf 88776499998503--20103819999999868995389984-787999999987----998852224677--------- Q ss_pred CCCCCCCCHHHCCCCCCCCCCCCCCEEEEEEHHHHCCC--CCCCCCCCHHHHCCCCCCCEEEEEECCCCCCHHHHHHHHH Q ss_conf 12613677000210001243211532431000110011--1245876456521200068289850476530367899998 Q gi|254780561|r 105 LFAKSNIDASQLKLPIKWDDDIFVPYDYGYLAFIYDKR--QITQPPKNFDELINSTKPWKIIYQDPRTSTLGLGLLLWIQ 182 (338) Q Consensus 105 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~--~~~~~p~s~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 182 (338) .. .+ .....|.......++..++ .....+-++.+|.+.. +.+...+ +......-.+.. T Consensus 153 -~~--~p------------~l~~~p~~~~~~~lvvp~~HpLa~~~~vsl~~L~~~p----lI~~~~g-~~~R~~id~~f~ 212 (316) T PRK12679 153 -SN--DP------------QLVAFPWFRWHHSLLVPLDHPLTQITPLTLESIAKWP----LITYRQG-ITGRSRIDDAFA 212 (316) T ss_pred -CC--CC------------CCEEEEEEECCEEEEECCCCCCCCCCCCCHHHHCCCC----EEEECCC-CHHHHHHHHHHH T ss_conf -89--97------------6079980553413560189951248998999987999----8950899-849999999999 Q ss_pred HHCCCCHHHHHHHHHHCCCEEECCCHHHHHHHHCCCCCEEEEECCHHHHHHHHHCCCCEEEEECCCCEE-EEEEEEEEEC Q ss_conf 717731378987776288468308478988862258656999504499999850799737883169808-8886424337 Q gi|254780561|r 183 KIYGDNSAQVWKKIATKTATVTKGWTESYGFFLKGESDFVLSYSTSPGFYLLNYGQDDYVAALFSEGHY-LQIEVAAQLV 261 (338) Q Consensus 183 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~eG~~-~~~~~~~i~k 261 (338) ..+ ...+...-..+......++..|-. +++ -.... .......--+..+.++.+ ...-++++.| T Consensus 213 ~~G----------~~p~i~~e~~~~~~i~~~V~~GlG-vai--lp~~a---~~~~~~~~l~~~~~~~~~~~~~~~~~~~r 276 (316) T PRK12679 213 RKG----------LLADIVLSAQDSDVIKTYVALGLG-IGL--VAEQS---SGEQEEKNLIRLDTRHLFDANTVWLGLKR 276 (316) T ss_pred HCC----------CCCCEEEEECCHHHHHHHHHHCCC-EEE--CHHHH---CCCCCCCCEEEECCCCCCCCCEEEEEEEC T ss_conf 779----------999779999989999999998990-997--01445---07546898899326346766538999967 Q ss_pred CCCCHHHHHHHHHHH---CCHHH Q ss_conf 899989999999996---39999 Q gi|254780561|r 262 RSKQPQLAQEFMQFM---ISPSF 281 (338) Q Consensus 262 ~a~n~~~A~~Fi~fl---ls~e~ 281 (338) +.--...+++||+++ ++.|. T Consensus 277 ~~~l~~~~~~Fi~~~~~~l~~~~ 299 (316) T PRK12679 277 GQLQRNYVWRFLELCNAGLSVED 299 (316) T ss_pred CCCCHHHHHHHHHHHHCCCCHHH T ss_conf 87556999999997520289999 No 121 >PRK09791 putative DNA-binding transcriptional regulator; Provisional Probab=51.00 E-value=15 Score=15.57 Aligned_cols=203 Identities=11% Similarity=0.080 Sum_probs=96.1 Q ss_pred CCCCEEEEEECCCCCCCHHHHHHHHHHHHHHH-CCEEEEEECCCHHHHHHHHHHHHCCCCCEEEECCCHHHHHHHHHHHC Q ss_conf 34885999978877851223388999999884-88799997288689997887531266410110133378999887521 Q gi|254780561|r 27 PAKPVLTVYTYNSFVADEGAGPKIKQAFERKC-NCELKLIGLSDGVALLNKLRMEGENSAADIVLGFDNNLIDLARKTGL 105 (338) Q Consensus 27 ~~~~~l~v~~~~~~~~~~~~~~~i~~~Fek~~-gikV~~~~~~~~~~~~~~~~a~~~~~~~Dv~~~~~~~~~~~~~~~~l 105 (338) ...++++|....+ .....++.++..|.++| +|+|++.. ++..++..++.. ...|+.+..... T Consensus 92 ~~~G~l~Ig~~~~--~~~~~lp~~l~~f~~~~P~v~i~i~~-~~~~~~~~~l~~----g~~D~~i~~~~~---------- 154 (302) T PRK09791 92 QLAGQINIGMGAS--ISRSLMPAVISRFHQQHPQVKVRIME-GQLVSMINELRQ----GELDFTINTYYQ---------- 154 (302) T ss_pred CCCCEEEEEECHH--HHHHHHHHHHHHHHHHCCCCEEEEEE-CCHHHHHHHHHC----CCCCEEEECCCC---------- T ss_conf 7663399986268--89871199999999988797799998-999999999987----997789965678---------- Q ss_pred CCCCCCCHHHCCCCCCCCCCCCCCEEEEEEHHHHCCCCCCCCCCCHHHHCCCCCCCEEEEEECCCCCCHHHHHHHHHHHC Q ss_conf 26136770002100012432115324310001100111245876456521200068289850476530367899998717 Q gi|254780561|r 106 FAKSNIDASQLKLPIKWDDDIFVPYDYGYLAFIYDKRQITQPPKNFDELINSTKPWKIIYQDPRTSTLGLGLLLWIQKIY 185 (338) Q Consensus 106 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~p~s~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 185 (338) .+.+. .. ...+........+..++.......++.++.+.. ..+..+...... ......... T Consensus 155 -~~~~~---~l---------~~~~l~~~~~~~v~~~~hpl~~~~~l~~l~~~~----~~~~~~~~~~~~-~~~~~~~~~- 215 (302) T PRK09791 155 -GPYDH---EF---------TFEKLLEKQFAIFCRPGHPAIGARSIKQLLDYS----WTMPTPHGSYYK-QLSELLDDQ- 215 (302) T ss_pred -CCCCC---CE---------EEEEEEEEEEEEEECCCCCCCCCCCHHHHHCCC----EEEECCCCCHHH-HHHHHHHHC- T ss_conf -87887---64---------799977401599983898435798999984799----387579997999-999999967- Q ss_pred CCCHHHHHHHHHHCCCEEECCCHHHHHHHHCCCCCEEEEECCHHHHHHHHHCCCCEEEEECCCCEEEEEEEEEEECCCCC Q ss_conf 73137898777628846830847898886225865699950449999985079973788316980888864243378999 Q gi|254780561|r 186 GDNSAQVWKKIATKTATVTKGWTESYGFFLKGESDFVLSYSTSPGFYLLNYGQDDYVAALFSEGHYLQIEVAAQLVRSKQ 265 (338) Q Consensus 186 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~eG~~~~~~~~~i~k~a~n 265 (338) ....+......+..........|.. +++. ......... ....+..+...+......-+++-.++... T Consensus 216 ---------g~~p~~~~~~~~~~~~~~lv~~g~g-v~il--p~~~~~~~~-~~~~lv~lpl~~~~~~~~~~lv~r~~~~~ 282 (302) T PRK09791 216 ---------AQTPQVGVVCETFSACISLVAKSDF-LSIL--PEEMGCDPL-HGQGLVMLPVSEILPKAAYYLIQRRDSRQ 282 (302) T ss_pred ---------CCCCCEEEEECCHHHHHHHHHHCCE-EEEE--CHHHHHHHH-HCCCEEEEECCCCCCCEEEEEEEECCCCC T ss_conf ---------9998569999929999999997896-7963--199998787-48999999799988600899999890988 Q ss_pred HHHHHHHHHHHCC Q ss_conf 8999999999639 Q gi|254780561|r 266 PQLAQEFMQFMIS 278 (338) Q Consensus 266 ~~~A~~Fi~flls 278 (338) ..++..||+++-. T Consensus 283 spa~~~fi~~lr~ 295 (302) T PRK09791 283 TPLTASLITQFRR 295 (302) T ss_pred CHHHHHHHHHHHH T ss_conf 9999999999999 No 122 >PRK09408 ompX outer membrane protein X; Provisional Probab=50.77 E-value=15 Score=15.55 Aligned_cols=52 Identities=21% Similarity=0.096 Sum_probs=25.7 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEECCCCCCCHHHHHHHHHHHHH-HHCCEEEEEE Q ss_conf 90279999999999999986146742348859999788778512233889999998-8488799997 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLDGLPAKPVLTVYTYNSFVADEGAGPKIKQAFER-KCNCELKLIG 66 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~v~~~~~~~~~~~~~~~i~~~Fek-~~gikV~~~~ 66 (338) |||++.+.+.++ .++..+..+.+++.|+.+-=..+ =.+.|.. -.|+.+++.. T Consensus 1 mkk~~~~s~~~~----~~~~~a~~a~A~~sTvS~GYAQs----------~~~~~~n~l~G~NlKYrY 53 (171) T PRK09408 1 MKKIACLSALAA----VLAFTAGTAVAATSTVTGGYAQS----------DAQGVANKMGGFNLKYRY 53 (171) T ss_pred CCHHHHHHHHHH----HHHHHHHHHHCCCCEEEEEEEEE----------CCCCCCCCCCCEEEEEEE T ss_conf 922478999999----98856666642576698788642----------001346786744888766 No 123 >PRK11627 hypothetical protein; Provisional Probab=50.56 E-value=9 Score=16.87 Aligned_cols=26 Identities=19% Similarity=0.241 Sum_probs=14.0 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEE Q ss_conf 9027999999999999998614674234885999 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLDGLPAKPVLTV 34 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~v 34 (338) |||++.. ++++.+++||+.+ .++|++ T Consensus 2 mkk~~~~----l~a~~lL~gCas~----p~~l~l 27 (192) T PRK11627 2 LKKILFP----LVALFMLAGCATP----PTTLEV 27 (192) T ss_pred HHHHHHH----HHHHHHHHHHCCC----CCEEEE T ss_conf 2879999----9999999860699----763786 No 124 >TIGR03502 lipase_Pla1_cef extracellular lipase, Pla-1/cef family. Members of this protein family are bacterial lipoproteins largely from the Gammaproteobacteria. Characterized members are expressed in extracellularly and have esterase activity. Members include the lipase Pla-1 from Aeromonas hydrophila (AF092033) and CHO cell elongation factor (cef) from Vibrio hollisae Probab=50.21 E-value=14 Score=15.81 Aligned_cols=22 Identities=18% Similarity=0.131 Sum_probs=10.6 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 902799999999999999861467 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLD 24 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~ 24 (338) |||.+ +..+++.+..+++|+++ T Consensus 1 MKk~~--l~~~iasal~LagCGg~ 22 (792) T TIGR03502 1 MKKLL--LSLAIASALGLAGCGDN 22 (792) T ss_pred CCHHH--HHHHHHHHHHHHCCCCC T ss_conf 93378--99999998750045788 No 125 >PRK10270 hypothetical protein; Provisional Probab=49.94 E-value=16 Score=15.47 Aligned_cols=21 Identities=24% Similarity=0.428 Sum_probs=13.9 Q ss_pred CCHHHHHHHHHHHHHHHHHHH Q ss_conf 902799999999999999861 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYC 21 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~ 21 (338) |||++.+++.+++++++.++. T Consensus 1 Mkk~~~~~~~l~~~l~i~~g~ 21 (340) T PRK10270 1 MKKVLLIILLLLVVLGIAAGV 21 (340) T ss_pred CCHHHHHHHHHHHHHHHHHHH T ss_conf 902899999999999999999 No 126 >PRK11548 hypothetical protein; Provisional Probab=47.67 E-value=16 Score=15.36 Aligned_cols=23 Identities=9% Similarity=0.192 Sum_probs=14.3 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 902799999999999999861467 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLD 24 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~ 24 (338) || +..+.++.++++.++++|+.. T Consensus 1 m~-~k~l~~a~ll~~lllsgCS~~ 23 (113) T PRK11548 1 MR-CKTLTAAAAVLLMLTAGCSTL 23 (113) T ss_pred CC-HHHHHHHHHHHHHHHCCCCCC T ss_conf 92-578899999999998011578 No 127 >COG4851 CamS Protein involved in sex pheromone biosynthesis [General function prediction only] Probab=47.15 E-value=15 Score=15.59 Aligned_cols=23 Identities=17% Similarity=0.322 Sum_probs=13.6 Q ss_pred HHHHHHHHHHHCCHHHHHHHHHCCC Q ss_conf 8999999999639999999885045 Q gi|254780561|r 266 PQLAQEFMQFMISPSFQRILPTTNW 290 (338) Q Consensus 266 ~~~A~~Fi~flls~e~Q~~~a~~~~ 290 (338) ++.---|++| ..+.|..+...++ T Consensus 281 ~ndndnF~rF--k~~iq~yf~nytg 303 (382) T COG4851 281 RNDNDNFLRF--KEEIQGYFPNYTG 303 (382) T ss_pred CCCHHHHHHH--HHHHHHHHHCCCE T ss_conf 6632789999--9999875102010 No 128 >PRK10477 outer membrane lipoprotein Blc; Provisional Probab=46.39 E-value=16 Score=15.49 Aligned_cols=66 Identities=12% Similarity=0.155 Sum_probs=26.6 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCCCCC-CEEEEEECCCCCCCHHHHHHHHHHHHHHH-CCEEEEEECC Q ss_conf 90279999999999999986146742348-85999978877851223388999999884-8879999728 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLDGLPAK-PVLTVYTYNSFVADEGAGPKIKQAFERKC-NCELKLIGLS 68 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~~~~~~-~~l~v~~~~~~~~~~~~~~~i~~~Fek~~-gikV~~~~~~ 68 (338) || ++.++++++++.+|++|.+ +...+. +++.=+.-.-|-..|..+..+-..||+.. +++.+|..-. T Consensus 1 ~~-~~~~~~~~~~~~~~~~c~~-~~~p~~~~~V~~~Dl~RYmG~WYEIAr~P~~fe~g~~~~ta~Y~l~~ 68 (177) T PRK10477 1 MR-LLPVVAAVTAAFLVVACSS-PTPPKGVTVVNNFDAKRYLGTWYEIARFDHRFERGLEKVTATYSLRD 68 (177) T ss_pred CC-HHHHHHHHHHHHHHHHCCC-CCCCCCCCCCCCCCHHHCCEEEEEEEECCCHHHCCCCCEEEEEEECC T ss_conf 94-0789999999999976469-99999884167347446488028997048668878713113899879 No 129 >COG3338 Cah Carbonic anhydrase [Inorganic ion transport and metabolism] Probab=46.19 E-value=11 Score=16.36 Aligned_cols=50 Identities=14% Similarity=0.108 Sum_probs=23.1 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHH Q ss_conf 9027999999999999998614674234885999978877851223388999999884 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLDGLPAKPVLTVYTYNSFVADEGAGPKIKQAFERKC 58 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~v~~~~~~~~~~~~~~~i~~~Fek~~ 58 (338) |||..-+ ++++.+++.||..+.+++ +-|.+.+-..+ +.+.++.++|+.-. T Consensus 1 Mkk~~l~---~~~L~~~~~~~~ts~a~~----~~WgY~g~~GP-EhWg~L~~df~~C~ 50 (250) T COG3338 1 MKKRFLI---VLALGASLIGAETSPAAE----VHWGYTGKEGP-EHWGELSPDFEACA 50 (250) T ss_pred CCEEEEH---HHHHHHHHHHHHCCCCCC----CCCCCCCCCCC-HHHHHCCCHHHHHH T ss_conf 9423207---898766454200264301----01465688580-56654084245665 No 130 >cd00134 PBPb Bacterial periplasmic transport systems use membrane-bound complexes and substrate-bound, membrane-associated, periplasmic binding proteins (PBPs) to transport a wide variety of substrates, such as, amino acids, peptides, sugars, vitamins and inorganic ions. PBPs have two cell-membrane translocation functions: bind substrate, and interact with the membrane bound complex. A diverse group of periplasmic transport receptors for lysine/arginine/ornithine (LAO), glutamine, histidine, sulfate, phosphate, molybdate, and methanol are included in the PBPb CD. Probab=45.24 E-value=19 Score=15.04 Aligned_cols=187 Identities=17% Similarity=0.137 Sum_probs=93.8 Q ss_pred HHHHHHHHHHHHHHCCEEEEEECCCHHHHHHHHHHHHCCCCCEEEECCCHHHHHHHHHHHCCCCCCCCHHHCCCCCCCCC Q ss_conf 23388999999884887999972886899978875312664101101333789998875212613677000210001243 Q gi|254780561|r 45 GAGPKIKQAFERKCNCELKLIGLSDGVALLNKLRMEGENSAADIVLGFDNNLIDLARKTGLFAKSNIDASQLKLPIKWDD 124 (338) Q Consensus 45 ~~~~~i~~~Fek~~gikV~~~~~~~~~~~~~~~~a~~~~~~~Dv~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~ 124 (338) ++.-++++++.++.|++++++... ....+..+. .+.+|++.........+. .. + T Consensus 23 G~~~dl~~~i~~~lg~~~~~~~~~-~~~~~~~l~----~g~~D~~~~~~~~t~~R~--~~-~------------------ 76 (218) T cd00134 23 GFDVDLAKAIAKELGVKVKFVEVD-WDGLITALK----SGKVDLIAAGMTITPERA--KQ-V------------------ 76 (218) T ss_pred EEHHHHHHHHHHHHCCCEEEEECC-HHHHHHHHH----CCCCCEEEEEECCCHHHH--CC-C------------------ T ss_conf 839999999999969966999758-899999985----799776888502585661--41-4------------------ Q ss_pred CCCCCEEEEEEHHHHCCCCCCCCCCCHHHHCCCCCCCEEEEEECCCCCCHHHHHHHHHHHCCCCHHHHHHHHH-HCCCEE Q ss_conf 2115324310001100111245876456521200068289850476530367899998717731378987776-288468 Q gi|254780561|r 125 DIFVPYDYGYLAFIYDKRQITQPPKNFDELINSTKPWKIIYQDPRTSTLGLGLLLWIQKIYGDNSAQVWKKIA-TKTATV 203 (338) Q Consensus 125 ~~~~~~~~~~~~~~y~~~~~~~~p~s~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~ 203 (338) .+..|+......+.++++ ....+..+|.. .+++.. ..+... +.+.... ...... T Consensus 77 ~fs~p~~~~~~~~~~~~~---~~~~~~~dl~g----~~i~v~--~g~~~~----------------~~l~~~~~~~~~~~ 131 (218) T cd00134 77 DFSDPYYKSGQVILVKKG---SPIKSVKDLKG----KKVAVQ--KGSTAE----------------KYLKKALPEAKVVS 131 (218) T ss_pred CCCCCEECCCEEEEEECC---CCCCCHHHHCC----CEEEEE--CCCHHH----------------HHHHHHHCCCEEEE T ss_conf 557644415707999868---99999899779----626795--898899----------------99997315754999 Q ss_pred ECCCHHHHHHHHCCCCCEEEEECCHHHHHHHHHCCCCEEEEECCCCEEEEEEEEEEECCCCCHHHHHHHHHHHC----CH Q ss_conf 30847898886225865699950449999985079973788316980888864243378999899999999963----99 Q gi|254780561|r 204 TKGWTESYGFFLKGESDFVLSYSTSPGFYLLNYGQDDYVAALFSEGHYLQIEVAAQLVRSKQPQLAQEFMQFMI----SP 279 (338) Q Consensus 204 ~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~eG~~~~~~~~~i~k~a~n~~~A~~Fi~fll----s~ 279 (338) ..+..+...++..|+++..+....... ............+.+.. ...+..+++..+..++++..+|=+.|. +- T Consensus 132 ~~~~~~~~~~l~~G~vD~~v~~~~~~~-~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~l~~~~n~~l~~l~~~G 208 (218) T cd00134 132 YDDNAEALAALENGRADAVIVDEIALA-ALLKKHPPELKIVGPSI--DLEPLGFGVAVGKDNKELLDAVNKALKELRADG 208 (218) T ss_pred ECCHHHHHHHHHCCCCEEEEECHHHHH-HHHHHCCCCCEEECCCC--CCCCEEEEEEECCCCHHHHHHHHHHHHHHHHCH T ss_conf 699999999998599469997699999-99986799819963566--767348999974999999999999999998682 Q ss_pred HHHHHH Q ss_conf 999998 Q gi|254780561|r 280 SFQRIL 285 (338) Q Consensus 280 e~Q~~~ 285 (338) +.++++ T Consensus 209 ~~~~I~ 214 (218) T cd00134 209 ELKKIS 214 (218) T ss_pred HHHHHH T ss_conf 999998 No 131 >PRK09578 periplasmic multidrug efflux lipoprotein precursor; Reviewed Probab=44.84 E-value=19 Score=15.00 Aligned_cols=23 Identities=9% Similarity=0.157 Sum_probs=14.1 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 79999999999999986146742 Q gi|254780561|r 4 FARIVVGIMMITGVISYCTLDGL 26 (338) Q Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~ 26 (338) ..++.+.+++++.++++|+.... T Consensus 6 r~~~~l~~l~~~~~L~gC~~~~~ 28 (385) T PRK09578 6 RRRLALAALVAAFALAGCGKGDS 28 (385) T ss_pred HHHHHHHHHHHHHHHHCCCCCCC T ss_conf 79999999999999816799987 No 132 >COG4213 XylF ABC-type xylose transport system, periplasmic component [Carbohydrate transport and metabolism] Probab=44.61 E-value=19 Score=14.98 Aligned_cols=74 Identities=5% Similarity=0.047 Sum_probs=32.1 Q ss_pred CCCCCCCEEEEEECCCCCCCH--HHHHHHHHHHHHHHCCEEEEEECCCH-HHHHHHHHHHHCC-CCCEEEECCCHHHHHH Q ss_conf 742348859999788778512--23388999999884887999972886-8999788753126-6410110133378999 Q gi|254780561|r 24 DGLPAKPVLTVYTYNSFVADE--GAGPKIKQAFERKCNCELKLIGLSDG-VALLNKLRMEGEN-SAADIVLGFDNNLIDL 99 (338) Q Consensus 24 ~~~~~~~~l~v~~~~~~~~~~--~~~~~i~~~Fek~~gikV~~~~~~~~-~~~~~~~~a~~~~-~~~Dv~~~~~~~~~~~ 99 (338) .+..++..+- .+|.....++ .-...+.+++|+- |.||.+....+. +..++++...... ...-|+...+...+.. T Consensus 20 aa~~~d~~IG-is~~d~~~eRW~~D~~~~~~~~e~~-g~k~~~q~A~~~~~~Q~~qien~i~qg~~vlvi~a~d~~~l~~ 97 (341) T COG4213 20 AAAAKDGVIG-ISMPDLRSERWIKDRDAFVKKAEAL-GAKVDVQSADGDEEKQLAQIENMINQGVKVLVIGAIDGGVLSN 97 (341) T ss_pred HHHCCCCEEE-EECCCHHHHHHHHHHHHHHHHHHHC-CCHHHHHHHCCCHHHHHHHHHHHHHCCCCEEEEEECCCHHHHH T ss_conf 2312487599-9768735766653268899999862-4323034322571679999999873599789998135325899 No 133 >CHL00180 rbcR LysR transcriptional regulator; Provisional Probab=43.28 E-value=20 Score=14.86 Aligned_cols=206 Identities=12% Similarity=0.023 Sum_probs=98.3 Q ss_pred CCCEEEEEECCCCCCCHHHHHHHHHHHHHHH-CCEEEEEECCCHHHHHHHHHHHHCCCCCEEEECCCHHHHHHHHHHHCC Q ss_conf 4885999978877851223388999999884-887999972886899978875312664101101333789998875212 Q gi|254780561|r 28 AKPVLTVYTYNSFVADEGAGPKIKQAFERKC-NCELKLIGLSDGVALLNKLRMEGENSAADIVLGFDNNLIDLARKTGLF 106 (338) Q Consensus 28 ~~~~l~v~~~~~~~~~~~~~~~i~~~Fek~~-gikV~~~~~~~~~~~~~~~~a~~~~~~~Dv~~~~~~~~~~~~~~~~l~ 106 (338) ..++|+|.+..+ .....++.++..|.++| +++|++.. +++.++...+.. ...|+.+..... T Consensus 93 ~~g~lrI~~~~s--~~~~~lp~~l~~f~~~~P~v~i~l~~-~~~~~~~~~l~~----~~~D~~i~~~~~----------- 154 (307) T CHL00180 93 QGGTLIIGASQT--TGTYLMPRLIGLFRQKYPQIAVQLQV-HSTRRIAWSVAN----GQIDLAIIGGEV----------- 154 (307) T ss_pred CCCCCCCCCCCH--HHHHHHHHHHHHHHHHCCCCCEEEEE-CCHHHHHHHHHC----CCEEEEEECCCC----------- T ss_conf 258601010406--66664358899999988899727897-799999999987----980099975778----------- Q ss_pred CCCCCCHHHCCCCCCCCCCCCCCEEEEEEHHHHCC--CCCCCCCCCHHHHCCCCCCCEEEEEECCCCCCHHHHHHHHHHH Q ss_conf 61367700021000124321153243100011001--1124587645652120006828985047653036789999871 Q gi|254780561|r 107 AKSNIDASQLKLPIKWDDDIFVPYDYGYLAFIYDK--RQITQPPKNFDELINSTKPWKIIYQDPRTSTLGLGLLLWIQKI 184 (338) Q Consensus 107 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~--~~~~~~p~s~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 184 (338) .+.... ... ..++.......+..+ ......+-++.||.+.. ...... .+......-...... T Consensus 155 ~~~~~~--~l~---------~~~l~~~~~~lv~~~~hpla~~~~v~~~dL~~~~----~i~~~~-~~~~~~~~~~~~~~~ 218 (307) T CHL00180 155 PTELKD--NLQ---------VTPYAEDELALILPKSHPFAKLKKIQKEDLYRLR----FIALDS-QSTIRKVIDNILIQN 218 (307) T ss_pred CCCCCC--CEE---------EEEEECCCEEEEECCCCHHHCCCCCCHHHHHCCC----EEEECC-CCCHHHHHHHHHHHC T ss_conf 866678--549---------9994324279997389802239999999981798----487179-996899999999976 Q ss_pred CCCCHHHHHHHHHHCCCEEECCCHHHHHHHHCCCCCEEEEECCHHHHHHHHHCCCCEEEEECCCCEEEEEEEEEEECCCC Q ss_conf 77313789877762884683084789888622586569995044999998507997378831698088886424337899 Q gi|254780561|r 185 YGDNSAQVWKKIATKTATVTKGWTESYGFFLKGESDFVLSYSTSPGFYLLNYGQDDYVAALFSEGHYLQIEVAAQLVRSK 264 (338) Q Consensus 185 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~eG~~~~~~~~~i~k~a~ 264 (338) +... . .-+.....++......++..|.. +++.......... ....+..+..++......-+++..++.. T Consensus 219 ~~~~-~------~~~~~~~~~~~~~l~~~v~~g~G---ia~lP~~~v~~~~-~~g~l~~~~~~~~~~~r~i~lv~~~~r~ 287 (307) T CHL00180 219 GIDS-S------RFKIEMELNSIEAIKNAVQSGLG---AAFVSVSAIEKEL-ELGTLHWAKIENITIKRTLSIITNPNRY 287 (307) T ss_pred CCCC-C------CCEEEEEECCHHHHHHHHHHCCE---EEECCHHHHHHHH-HCCCEEEEECCCCCCEEEEEEEEECCCC T ss_conf 9984-5------54279997809999999993994---9982599999898-7798899978999850499999979597 Q ss_pred CHHHHHHHHHHHCC Q ss_conf 98999999999639 Q gi|254780561|r 265 QPQLAQEFMQFMIS 278 (338) Q Consensus 265 n~~~A~~Fi~flls 278 (338) -..+++.|++|+++ T Consensus 288 ~s~a~~~F~~~l~~ 301 (307) T CHL00180 288 RSKAAETFSKEILT 301 (307) T ss_pred CCHHHHHHHHHHHH T ss_conf 79999999999998 No 134 >TIGR03525 GldK gliding motility-associated lipoprotein GldK. Members of this protein family are exclusive to the Bacteroidetes phylum (previously Cytophaga-Flavobacteria-Bacteroides). GldK is a lipoprotein linked to a type of rapid surface gliding motility found in certain Bacteroidetes, such as Flavobacterium johnsoniae. Knockouts of GldK abolish the gliding phenotype. GldK is homologous to GldJ. There is a GldK homolog in Cytophaga hutchinsonii and several other species that has a different, shorter architecture and is represented by a separate model. Gliding motility appears closely linked to chitin utilization in the model species Flavobacterium johnsoniae. Bacteroidetes with members of this protein family appear to have all of the genes associated with gliding motility. Probab=42.48 E-value=16 Score=15.35 Aligned_cols=26 Identities=23% Similarity=0.245 Sum_probs=12.4 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEE Q ss_conf 902799999999999999861467423488599 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLDGLPAKPVLT 33 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~ 33 (338) |||++.+++. ++++..|+.+ ..++|+ T Consensus 1 Mkk~~~~~~~----~~~~~SCg~~---~~GELv 26 (449) T TIGR03525 1 MKKYLVFAAL----VVLVYSCGSG---DKGELV 26 (449) T ss_pred CCHHHHHHHH----HHHHHHCCCC---CCCEEE T ss_conf 9124899999----9987210589---971387 No 135 >pfam05481 Myco_19_kDa Mycobacterium 19 kDa lipoprotein antigen. Most of the antigens of Mycobacterium leprae and M. tuberculosis that have been identified are members of stress protein families, which are highly conserved throughout many diverse species. Of the M. leprae and M. tuberculosis antigens identified by monoclonal antibodies, all except the 18-kDa M. leprae antigen and the 19-kDa M. tuberculosis antigen are strongly cross-reactive between these two species and are coded within very similar genes. Probab=41.60 E-value=7.7 Score=17.27 Aligned_cols=24 Identities=21% Similarity=0.389 Sum_probs=15.8 Q ss_pred CCHH-HHHHHHHHHHHHHHHHHCCC Q ss_conf 9027-99999999999999861467 Q gi|254780561|r 1 MKKF-ARIVVGIMMITGVISYCTLD 24 (338) Q Consensus 1 MKk~-~~~~~~~~~~~~~~~~~~~~ 24 (338) |||- +..+.+++++++.+++|+++ T Consensus 1 m~r~~~vav~G~AilaaglsGCSs~ 25 (160) T pfam05481 1 MRRKLLAAVAGVTILAAGASGCSSG 25 (160) T ss_pred CCCEEEEEEHHHHHHHHHHCCCCCC T ss_conf 9620477651478887775025589 No 136 >PRK09967 putative outer membrane lipoprotein; Provisional Probab=41.14 E-value=22 Score=14.66 Aligned_cols=22 Identities=18% Similarity=0.387 Sum_probs=12.1 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 902799999999999999861467 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLD 24 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~ 24 (338) |+|. +++.+++++.++++|... T Consensus 1 m~k~--~~~~~l~~~l~lsgC~s~ 22 (160) T PRK09967 1 MIKH--LVAPLIFTSLILTGCQSP 22 (160) T ss_pred CHHH--HHHHHHHHHHHHHHCCCC T ss_conf 9078--999999999999845899 No 137 >pfam08139 LPAM_1 Prokaryotic membrane lipoprotein lipid attachment site. In prokaryotes, membrane lipoproteins are synthesized with a precursor signal peptide, which is cleaved by a specific lipoprotein signal peptidase (signal peptidase II). The peptidase recognizes a conserved sequence and cuts upstream of a cysteine residue to which a glyceride-fatty acid lipid is attached. Probab=41.04 E-value=15 Score=15.57 Aligned_cols=20 Identities=20% Similarity=0.575 Sum_probs=10.2 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 902799999999999999861467 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLD 24 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~ 24 (338) |||+..+++.++. +++|+.. T Consensus 1 Mkk~~~l~~~~l~----LagCas~ 20 (26) T pfam08139 1 MKKLLLLLLALLL----LAGCASX 20 (26) T ss_pred CHHHHHHHHHHHH----HHCCCCC T ss_conf 9669999999999----8233200 No 138 >TIGR00439 ftsX putative protein insertion permease FtsX; InterPro: IPR004513 FtsX is an integral membrane protein encoded in the same operon as signal recognition particle docking protein FtsY and FtsE. FtsE is a hydrophilic nucleotide-binding protein that associates with the inner membrane by means of association with FtsX; FtsE mutants are viable only in high salt, supporting possible roles in cell division, as previously indicated, or in transport.; GO: 0007049 cell cycle, 0016021 integral to membrane. Probab=39.00 E-value=23 Score=14.46 Aligned_cols=61 Identities=18% Similarity=0.265 Sum_probs=39.4 Q ss_pred CCCCEEEEEECCCCCCCHHHHHHHHHHHHHHHCCE-EEEEECCCHHHHHHHHHH-H---------HCCCCCEEEECC Q ss_conf 34885999978877851223388999999884887-999972886899978875-3---------126641011013 Q gi|254780561|r 27 PAKPVLTVYTYNSFVADEGAGPKIKQAFERKCNCE-LKLIGLSDGVALLNKLRM-E---------GENSAADIVLGF 92 (338) Q Consensus 27 ~~~~~l~v~~~~~~~~~~~~~~~i~~~Fek~~gik-V~~~~~~~~~~~~~~~~a-~---------~~~~~~Dv~~~~ 92 (338) -.++.+|||--. +.+++..+.+.+..+++-|++ |+|+.- ++-++.++. + ..||-+||++.. T Consensus 67 ypspqiTvYl~k--~l~~~~a~~Vv~~l~~~kgve~~~y~sr---ed~L~Ef~sWsgfg~~l~mLd~NPLPaV~iV~ 138 (314) T TIGR00439 67 YPSPQITVYLEK--ALAEEDADTVVSKLRRDKGVEKINYISR---EDALAEFKSWSGFGELLEMLDDNPLPAVVIVT 138 (314) T ss_pred CCCCCHHHHHHH--HHCHHHHHHHHHHHHHHCCCCHHHCCCH---HHHHHHHHCCCCCCHHHHHHCCCCCCCEEEEE T ss_conf 788403578775--0134468999999987406202101135---66778863346430056764178797368870 No 139 >pfam02402 Lysis_col Lysis protein. These small bacterial proteins are required for colicin release and partial cell lysis. This family contains lysis proteins for several different forms of colicin. Bacillus subtilis lytA has been included in this family, the similarity is not highly significant, however it is also a short protein, that is involved in secretion of other proteins (Bateman A pers. obs.). Probab=38.89 E-value=16 Score=15.47 Aligned_cols=20 Identities=40% Similarity=0.745 Sum_probs=11.2 Q ss_pred CCHHHHHHHHHHHHHHHHHHHC Q ss_conf 9027999999999999998614 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCT 22 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~ 22 (338) |||+.. ..+++++.++++|- T Consensus 1 Mkki~~--~~i~~~~~~L~aCQ 20 (46) T pfam02402 1 MKKILF--IGILLLTVLLSACQ 20 (46) T ss_pred CCEEEE--HHHHHHHHHHHHHH T ss_conf 924201--39999999999855 No 140 >PRK12680 transcriptional regulator CysB-like protein; Reviewed Probab=38.14 E-value=24 Score=14.38 Aligned_cols=200 Identities=15% Similarity=0.050 Sum_probs=96.0 Q ss_pred CCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHH-CCEEEEEECCCHHHHHHHHHHHHCCCCCEEEECCCHHHHHHHHHHH Q ss_conf 234885999978877851223388999999884-8879999728868999788753126641011013337899988752 Q gi|254780561|r 26 LPAKPVLTVYTYNSFVADEGAGPKIKQAFERKC-NCELKLIGLSDGVALLNKLRMEGENSAADIVLGFDNNLIDLARKTG 104 (338) Q Consensus 26 ~~~~~~l~v~~~~~~~~~~~~~~~i~~~Fek~~-gikV~~~~~~~~~~~~~~~~a~~~~~~~Dv~~~~~~~~~~~~~~~~ 104 (338) ....++|+|.+..+ .-...++.+++.|.++| +|++++.. +++.++...+.. ...|+.+....... T Consensus 89 ~~~~G~L~Ig~~~t--~a~~~LP~vi~~f~~~yP~V~l~l~~-~~~~~l~e~l~~----g~~Diai~~~~~~~------- 154 (327) T PRK12680 89 RESQGQLTLTTTHT--QARFVLPPAVAQIKQAYPQVSVHLQQ-AAESAALDLLGQ----GDADIAVVSTAGGE------- 154 (327) T ss_pred CCCCEEEEEEEECH--HHHHHHHHHHHHHHHHCCCCEEEEEE-CCCHHHHHHHHC----CCCCEEEECCCCCC------- T ss_conf 88733799998422--77775689999999868996799998-982889999978----99409994477779------- Q ss_pred CCCCCCCCHHHCCCCCCCCCCCCCCEEEEEEHHHHCCCC---CCCCCCCHHHHCCCCCCCEEEEEECCCCCCHHHHHHHH Q ss_conf 126136770002100012432115324310001100111---24587645652120006828985047653036789999 Q gi|254780561|r 105 LFAKSNIDASQLKLPIKWDDDIFVPYDYGYLAFIYDKRQ---ITQPPKNFDELINSTKPWKIIYQDPRTSTLGLGLLLWI 181 (338) Q Consensus 105 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~---~~~~p~s~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~ 181 (338) +.. ....+...+........+- ....+.++.+|.+.. +.+...+.. .....-.+. T Consensus 155 ---~~~--------------~~~~pl~~~~~~vvvP~~HpL~~~~~~i~L~~La~~p----lI~~~~g~~-~r~~id~af 212 (327) T PRK12680 155 ---PSA--------------GIAVPLYRWRRLVVVPRGHALDTPRTAPDMAALAEHP----LISYDSSTR-PGSSLQRAF 212 (327) T ss_pred ---CCC--------------CEEEEEEECCEEEEEECCCCCCCCCCCCCHHHHHCCC----EEECCCCCC-HHHHHHHHH T ss_conf ---976--------------5476752235799997898100589999999980898----896389986-799999999 Q ss_pred HHHCCCCHHHHHHHHHHCCCEEECCCHHHHHHHHCCCCCEEEEECCHHHHHHHHHCCCCEEEEECCCCEEEEEEEEEEEC Q ss_conf 87177313789877762884683084789888622586569995044999998507997378831698088886424337 Q gi|254780561|r 182 QKIYGDNSAQVWKKIATKTATVTKGWTESYGFFLKGESDFVLSYSTSPGFYLLNYGQDDYVAALFSEGHYLQIEVAAQLV 261 (338) Q Consensus 182 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~eG~~~~~~~~~i~k 261 (338) ... .+..+...-..+......++..|-. +++ --...+ ......+.......-.....-.+++-+ T Consensus 213 ~~~----------Gl~P~iv~ea~~~d~Ik~~V~~GlG-V~I--vp~mA~---~~~d~~l~~~~~~~l~~~~~~~~~~~~ 276 (327) T PRK12680 213 AQV----------GLEPSIALTALDADLIKTYVRAGLG-VGL--VAEMAV---NAFDEDLRAWPAPAPIAECIAWAVLPR 276 (327) T ss_pred HHC----------CCCCCEEEEECCHHHHHHHHHHCCE-EEE--CHHHHH---CCCCCCEEEEECCCCCCCCEEEEEEEC T ss_conf 977----------9976479999959999999994986-188--164762---876898389406457775605999978 Q ss_pred CCCCHHHHHHHHHHHC Q ss_conf 8999899999999963 Q gi|254780561|r 262 RSKQPQLAQEFMQFMI 277 (338) Q Consensus 262 ~a~n~~~A~~Fi~fll 277 (338) |+-=..-++.||+-+. T Consensus 277 ~~~lr~~~~~fi~~~~ 292 (327) T PRK12680 277 DRVLRDYALDLVHVLA 292 (327) T ss_pred CCHHHHHHHHHHHHHC T ss_conf 8468999999999866 No 141 >pfam06646 Mycoplasma_p37 High affinity transport system protein p37. This family consists of several high affinity transport system protein p37 sequences which are specific to Mycoplasma species. The p37 gene is part of an operon encoding two additional proteins which are highly similar to components of the periplasmic binding-protein-dependent transport systems of Gram-negative bacteria.It has been suggested that p37 is part of a homologous, high-affinity transport system in M. hyorhinis, a Gram-positive bacterium. Probab=36.28 E-value=26 Score=14.20 Aligned_cols=45 Identities=9% Similarity=0.141 Sum_probs=26.9 Q ss_pred HHHHHHHHHHHHHHH----------CCEEEEEECCCHHHHHHHHHHHHCCCCCEEEECC Q ss_conf 223388999999884----------8879999728868999788753126641011013 Q gi|254780561|r 44 EGAGPKIKQAFERKC----------NCELKLIGLSDGVALLNKLRMEGENSAADIVLGF 92 (338) Q Consensus 44 ~~~~~~i~~~Fek~~----------gikV~~~~~~~~~~~~~~~~a~~~~~~~Dv~~~~ 92 (338) ...++.+.++|-+-. .|++++...+++...+.+++.. ..|+.+.. T Consensus 54 ~~Fl~~~~~~fnkLkn~~~~tk~~~dV~f~i~~~dD~~~~~~~l~~n----~~Dfai~n 108 (383) T pfam06646 54 NKFLKNFSNEFSKLKKANDKTKNLDDVTITVNIIDDSFTKIDNIQTG----KADFAFVN 108 (383) T ss_pred HHHHHHHHHHHHHHHCCCHHHCCCCCCEEEEEEECCCHHHHHHHHCC----CCCEEEEC T ss_conf 99999999999886517920146766548999824851798886468----87768853 No 142 >PRK11138 outer membrane protein assembly complex subunit YfgL; Provisional Probab=36.08 E-value=26 Score=14.19 Aligned_cols=23 Identities=17% Similarity=0.336 Sum_probs=13.4 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 902799999999999999861467 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLD 24 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~ 24 (338) |+.. ++++.++++++++++|+.. T Consensus 1 ~~~~-~~~~~~l~~~~ll~gCs~~ 23 (394) T PRK11138 1 MQLR-KLLLPGLLSVALLSGCSLF 23 (394) T ss_pred CCHH-HHHHHHHHHHHHHHHCCCC T ss_conf 9516-7799999999998652378 No 143 >TIGR01533 lipo_e_P4 5'-nucleotidase, lipoprotein e(P4) family; InterPro: IPR006423 This group of sequences represents a set of bacterial lipoproteins belonging to a larger acid phosphatase family, which in turn belongs to the haloacid dehalogenase (HAD) superfamily of aspartate-dependent hydrolases. Members are found on the outer membrane of Gram-negative bacteria and the cytoplasmic membrane of Gram-positive bacteria. Most members have classic lipoprotein signal sequences. A critical role of this 5'-nucleotidase in Haemophilus influenzae is the degradation of external riboside in order to allow transport into the cell. An earlier suggested role in hemin transport is no longer current. This enzyme may also have other physiologically significant roles.. Probab=35.74 E-value=26 Score=14.15 Aligned_cols=33 Identities=24% Similarity=0.263 Sum_probs=16.4 Q ss_pred CCHHH-HHHHHHHHHHHH--HHHHCCCCCCCCCEEE Q ss_conf 90279-999999999999--9861467423488599 Q gi|254780561|r 1 MKKFA-RIVVGIMMITGV--ISYCTLDGLPAKPVLT 33 (338) Q Consensus 1 MKk~~-~~~~~~~~~~~~--~~~~~~~~~~~~~~l~ 33 (338) ||+.. ++++.+++.+++ +.+|+........++. T Consensus 1 M~~~~Lk~~~~a~L~lsVl~~~~C~~~q~~~~~~~~ 36 (295) T TIGR01533 1 MKKKLLKIVLIASLSLSVLLLKGCSSAQKKKEATLK 36 (295) T ss_pred CCCHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHH T ss_conf 960369999999999999985157663401355420 No 144 >TIGR01326 OAH_OAS_sulfhy O-acetylhomoserine aminocarboxypropyltransferase/cysteine synthase; InterPro: IPR006235 This family of sequences is a distinct clade of the Cys/Met metabolism pyridoxal phosphate-dependent enzyme superfamily. Members include examples of OAH/OAS sulfhydrylase, an enzyme with activity both as O-acetylhomoserine sulfhydrylase (OAH, 2.5.1.49 from EC) and O-acetylserine sulfhydrylase (OAS, 2.5.1.47 from EC). An alternate name for OAH sulfhydrylase is homocysteine synthase.; GO: 0016765 transferase activity transferring alkyl or aryl (other than methyl) groups, 0006520 amino acid metabolic process. Probab=35.29 E-value=27 Score=14.11 Aligned_cols=72 Identities=14% Similarity=0.026 Sum_probs=39.9 Q ss_pred EEECCCCCCCHHHHHHHHHHHHHHHCCEEEEEECCC-HHHHHHHHHHHH---------CCCCCEEEECCCHHHHHHHH-- Q ss_conf 997887785122338899999988488799997288-689997887531---------26641011013337899988-- Q gi|254780561|r 34 VYTYNSFVADEGAGPKIKQAFERKCNCELKLIGLSD-GVALLNKLRMEG---------ENSAADIVLGFDNNLIDLAR-- 101 (338) Q Consensus 34 v~~~~~~~~~~~~~~~i~~~Fek~~gikV~~~~~~~-~~~~~~~~~a~~---------~~~~~Dv~~~~~~~~~~~~~-- 101 (338) |+|..-|..-++.+..-+|.| ||+|+++..++ .++ +.++..+. +||..||.-......+.... T Consensus 100 VsS~~LYGGTynLF~~Tlkrl----GIevrFvd~dd~pe~-~~k~id~nTKAvf~EtIgNP~~~v~Die~~a~~Ah~~Ph 174 (434) T TIGR01326 100 VSSSYLYGGTYNLFKHTLKRL----GIEVRFVDGDDDPEE-LEKAIDENTKAVFAETIGNPALNVPDIEAVAEVAHAHPH 174 (434) T ss_pred EECCCCCCHHHHHHHHHHHHC----CEEEEEECCCCCHHH-HHHHCCCCCEEEEEECCCCCCCCCCCHHHHHHHHHHCCC T ss_conf 980644422578999955544----814887278888789-997606675189840123877676785899999986789 Q ss_pred HHHCCCCCC Q ss_conf 752126136 Q gi|254780561|r 102 KTGLFAKSN 110 (338) Q Consensus 102 ~~~l~~~~~ 110 (338) ..-|+.... T Consensus 175 gvPliVDNT 183 (434) T TIGR01326 175 GVPLIVDNT 183 (434) T ss_pred CCEEEEECC T ss_conf 834887478 No 145 >pfam06291 Lambda_Bor Bor protein. This family consists of several Bacteriophage lambda Bor and Escherichia coli Iss proteins. Expression of bor significantly increases the survival of the Escherichia coli host cell in animal serum. This property is a well known bacterial virulence determinant indeed, bor and its adjacent sequences are highly homologous to the iss serum resistance locus of the plasmid ColV2-K94, which confers virulence in animals. It has been suggested that lysogeny may generally have a role in bacterial survival in animal hosts, and perhaps in pathogenesis. Probab=34.89 E-value=20 Score=14.90 Aligned_cols=20 Identities=25% Similarity=0.338 Sum_probs=10.6 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 902799999999999999861467 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLD 24 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~ 24 (338) |||+...+ +++.++++|+.. T Consensus 1 MKk~~~~~----~~allLtgCa~Q 20 (97) T pfam06291 1 MKKMLFAA----ALALLITGCAQQ 20 (97) T ss_pred CHHHHHHH----HHHHHHCCCCCE T ss_conf 92259999----999997213304 No 146 >cd06353 PBP1_BmpA_Med_like Periplasmic binding domain of the basic membrane lipoprotein Med in Bacillus and its close homologs from other bacteria and Archaea. Periplasmic binding domain of the basic membrane lipoprotein Med in Bacillus and its close homologs from other bacteria and Archaea. Med, a cell-surface localized protein, which regulates the competence transcription factor gene comK in Bacillus subtilis, lacks the DNA binding domain when compared with structures of transcription regulators from the LacI family. Nevertheless, Med has significant overall sequence homology to various periplasmic substrate-binding proteins. Moreover, the structure of Med shows a striking similarity to PnrA, a periplasmic nucleoside binding protein of an ATP-binding cassette transport system. Members of this group contain the type I periplasmic sugar-binding protein-like fold. Probab=34.71 E-value=27 Score=14.05 Aligned_cols=51 Identities=22% Similarity=0.285 Sum_probs=30.3 Q ss_pred HHHHHHHHCCEEEEEECC-CHHHHHHHHHHHHCCCCCEEEECCCHHHHHHHHH Q ss_conf 999998848879999728-8689997887531266410110133378999887 Q gi|254780561|r 51 KQAFERKCNCELKLIGLS-DGVALLNKLRMEGENSAADIVLGFDNNLIDLARK 102 (338) Q Consensus 51 ~~~Fek~~gikV~~~~~~-~~~~~~~~~~a~~~~~~~Dv~~~~~~~~~~~~~~ 102 (338) +++.+++.|+++.++... +.++....++. -....+|+++.....+.....+ T Consensus 23 ~~~~~~~~g~~~~~~e~~~~~~~~~~~l~~-~~~~g~dlIi~~g~~~~~~~~~ 74 (258) T cd06353 23 RKAAEKALGVEVTYVENVPEGADAERVLRE-LAAQGYDLIFGTSFGFMDAALK 74 (258) T ss_pred HHHHHHHHCCEEEEEECCCCHHHHHHHHHH-HHHCCCCEEEEECHHHHHHHHH T ss_conf 999999859869999779987899999999-9975998999944576699999 No 147 >COG1512 Beta-propeller domains of methanol dehydrogenase type [General function prediction only] Probab=33.94 E-value=28 Score=13.98 Aligned_cols=29 Identities=17% Similarity=0.310 Sum_probs=20.5 Q ss_pred CCHHHHHHHHHHHHHHHCCEEEEEECCCH Q ss_conf 51223388999999884887999972886 Q gi|254780561|r 42 ADEGAGPKIKQAFERKCNCELKLIGLSDG 70 (338) Q Consensus 42 ~~~~~~~~i~~~Fek~~gikV~~~~~~~~ 70 (338) .+...+..-++++|++++-+|-++...+. T Consensus 46 ~e~~~Leq~l~~L~~kt~~QiaVv~vpSt 74 (271) T COG1512 46 AERGALEQQLADLEQKTGAQIAVVTVPST 74 (271) T ss_pred HHHHHHHHHHHHHHHCCCCEEEEEEECCC T ss_conf 57778999999997355861899994589 No 148 >PRK13684 Ycf48-like protein; Provisional Probab=33.55 E-value=29 Score=13.94 Aligned_cols=25 Identities=8% Similarity=0.178 Sum_probs=13.7 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 9027999999999999998614674 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLDG 25 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~~ 25 (338) ||++.+.+..++++++++..|+++. T Consensus 1 m~~~~~~~~~l~l~~~~~~~~~~c~ 25 (333) T PRK13684 1 MKRLLKSLLNLLLLLALGLVLSGCS 25 (333) T ss_pred CHHHHHHHHHHHHHHHHHHHHCCCC T ss_conf 9268888999999999998632235 No 149 >PRK13883 conjugal transfer protein TrbH; Provisional Probab=33.47 E-value=24 Score=14.40 Aligned_cols=18 Identities=17% Similarity=0.329 Sum_probs=9.3 Q ss_pred CCHHHHHHHHHHHHHHHHHHHC Q ss_conf 9027999999999999998614 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCT 22 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~ 22 (338) |||++.+++. +.++++|+ T Consensus 1 MrK~~~~all----~laL~gCA 18 (156) T PRK13883 1 MRKILSLALL----ALALGGCA 18 (156) T ss_pred CHHHHHHHHH----HHHHHCCC T ss_conf 9268999999----99970331 No 150 >PRK09465 tolC outer membrane channel protein; Reviewed Probab=33.47 E-value=27 Score=14.06 Aligned_cols=13 Identities=31% Similarity=0.746 Sum_probs=8.5 Q ss_pred CCHHHHHHHHHHH Q ss_conf 9027999999999 Q gi|254780561|r 1 MKKFARIVVGIMM 13 (338) Q Consensus 1 MKk~~~~~~~~~~ 13 (338) |||++.+++++++ T Consensus 1 mkki~~~l~~l~l 13 (446) T PRK09465 1 MKKLLPLLIGLSL 13 (446) T ss_pred CCHHHHHHHHHHH T ss_conf 9429999999999 No 151 >PRK12696 flgH flagellar basal body L-ring protein; Reviewed Probab=32.96 E-value=17 Score=15.21 Aligned_cols=19 Identities=16% Similarity=0.265 Sum_probs=9.0 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCC Q ss_conf 90279999999999999986146 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTL 23 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~ 23 (338) |||++ ++++++.++++|+. T Consensus 2 m~~~l----~~~~~~~~L~GC~~ 20 (238) T PRK12696 2 IRKLL----AASCAVLLLSGCNA 20 (238) T ss_pred HHHHH----HHHHHHHHHHCCCC T ss_conf 58999----99999999516568 No 152 >COG5294 Uncharacterized protein conserved in bacteria [Function unknown] Probab=32.92 E-value=29 Score=13.88 Aligned_cols=14 Identities=29% Similarity=0.510 Sum_probs=9.9 Q ss_pred CCHHHHHHHHHHHH Q ss_conf 90279999999999 Q gi|254780561|r 1 MKKFARIVVGIMMI 14 (338) Q Consensus 1 MKk~~~~~~~~~~~ 14 (338) |||++..+++++++ T Consensus 1 MKkil~~ilall~~ 14 (113) T COG5294 1 MKKILIGILALLLI 14 (113) T ss_pred CCCHHHHHHHHHHH T ss_conf 90439999999999 No 153 >PRK10760 murein hydrolase B; Provisional Probab=32.25 E-value=30 Score=13.82 Aligned_cols=15 Identities=7% Similarity=0.122 Sum_probs=9.3 Q ss_pred HHHHHHHHHHHHCCE Q ss_conf 388999999884887 Q gi|254780561|r 47 GPKIKQAFERKCNCE 61 (338) Q Consensus 47 ~~~i~~~Fek~~gik 61 (338) .+.+++..-++||+. T Consensus 58 ~~~Fi~~mv~~~gf~ 72 (357) T PRK10760 58 AQQFIDKMVNKHGFD 72 (357) T ss_pred HHHHHHHHHHHCCCC T ss_conf 999999999961989 No 154 >COG4594 FecB ABC-type Fe3+-citrate transport system, periplasmic component [Inorganic ion transport and metabolism] Probab=32.23 E-value=30 Score=13.81 Aligned_cols=27 Identities=19% Similarity=0.401 Sum_probs=14.7 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCCCC Q ss_conf 9027999999999999998614674234 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLDGLPA 28 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~~~~~ 28 (338) |+++ .+.+.+++++.++++|+.+...+ T Consensus 3 ~~~~-~~i~~lll~lllva~C~~s~~~~ 29 (310) T COG4594 3 MKKT-AIILTLLLLLLLVAACSSSDNNQ 29 (310) T ss_pred CHHH-HHHHHHHHHHHHHHHHCCCCCCC T ss_conf 5035-99999999999997714767655 No 155 >COG2913 OlmA Outer membrane lipoprotein OmlA (small protein A) [Cell envelope biogenesis, outer membrane] Probab=32.11 E-value=30 Score=13.80 Aligned_cols=24 Identities=13% Similarity=0.217 Sum_probs=16.0 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 902799999999999999861467 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLD 24 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~ 24 (338) |++...+...++++++++++|+.. T Consensus 3 ~~~~~~~~~~a~l~~~als~Cst~ 26 (147) T COG2913 3 LMATAILAIAALLGAAALSGCSTL 26 (147) T ss_pred HHHHHHHHHHHHHHHHHHCCCCCC T ss_conf 788889999999999983148754 No 156 >pfam09508 Lact_bio_phlase Lacto-N-biose phosphorylase. The gene which codes for this protein in gut-bacteria is located in a novel putative operon for galactose metabolism. The protein appears to be a carbohydrate-processing phosphorolytic enzyme (EC:2.4.1.211), unlike either glycoside hydrolases or glycoside lyase. Intestinal colonisation by bifidobacteria is important for human health, especially in pediatrics, because colonisation seems to prevent infection by some pathogenic bacteria that cause diarrhoea or other illnesses. The operon seems to be involved in intestinal colonisation by bifidobacteria mediated by metabolism of mucin sugars. In addition, it may also resolve the question of the nature of the bifidus factor in human milk as the lacto-N-biose structure found in milk oligosaccharides. Probab=32.05 E-value=28 Score=13.99 Aligned_cols=28 Identities=18% Similarity=0.374 Sum_probs=21.4 Q ss_pred EECCCCCCCHHHHHHHHHHHHHHHCCEEEE Q ss_conf 978877851223388999999884887999 Q gi|254780561|r 35 YTYNSFVADEGAGPKIKQAFERKCNCELKL 64 (338) Q Consensus 35 ~~~~~~~~~~~~~~~i~~~Fek~~gikV~~ 64 (338) .-|.+|.. ..-+.++++|||++|.+.+- T Consensus 227 vDWfGY~a--SVSp~aLe~FEke~GY~lrp 254 (716) T pfam09508 227 VDWFGYSA--SVSPYALEQFEKEYGYKLRP 254 (716) T ss_pred EECCCCCC--CCCHHHHHHHHHHHCCCCCH T ss_conf 33456563--23899999999984988887 No 157 >PRK01622 OxaA-like protein precursor; Validated Probab=31.96 E-value=31 Score=13.79 Aligned_cols=24 Identities=17% Similarity=0.362 Sum_probs=11.4 Q ss_pred CCHHHH-HHHHHHHHHHHHHHHCCC Q ss_conf 902799-999999999999861467 Q gi|254780561|r 1 MKKFAR-IVVGIMMITGVISYCTLD 24 (338) Q Consensus 1 MKk~~~-~~~~~~~~~~~~~~~~~~ 24 (338) ||+... ++..+++++.+++||+.. T Consensus 2 ~k~~~~~l~~~~~~~~~~lsgc~~~ 26 (266) T PRK01622 2 LKSYRAVLISLSLLLVFVLSGCSNA 26 (266) T ss_pred CHHHHHHHHHHHHHHHHHHHCCCCC T ss_conf 5388999999999999999546899 No 158 >pfam09581 Spore_III_AF Stage III sporulation protein AF (Spore_III_AF). This family represents the stage III sporulation protein AF (Spore_III_AF) of the bacterial endospore formation program, which exists in some but not all members of the Firmicutes (formerly called low-GC Gram-positives). The C-terminal region of these proteins is poorly conserved. Probab=31.15 E-value=31 Score=13.71 Aligned_cols=22 Identities=27% Similarity=0.659 Sum_probs=18.7 Q ss_pred CCHHHHHHHHHHHHHHHHHHHC Q ss_conf 9027999999999999998614 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCT 22 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~ 22 (338) |||+.+++++++++++++.... T Consensus 19 ~~ky~~~~~gl~l~~~~l~pi~ 40 (185) T pfam09581 19 MKKYVKLVLGLLLIVIILNPVL 40 (185) T ss_pred HHHHHHHHHHHHHHHHHHHHHH T ss_conf 6899999999999999999999 No 159 >PRK12700 flgH flagellar basal body L-ring protein; Reviewed Probab=31.13 E-value=32 Score=13.70 Aligned_cols=22 Identities=18% Similarity=0.391 Sum_probs=11.7 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCC Q ss_conf 90279999999999999986146 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTL 23 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~ 23 (338) ||...++.++ ++++++++||+. T Consensus 3 ~~t~~Rl~~~-~~~~~ll~GCa~ 24 (230) T PRK12700 3 LKTVLRLPVC-AALLALAAGCAM 24 (230) T ss_pred HHHHHHHHHH-HHHHHHHHCCCC T ss_conf 5889999999-999999744468 No 160 >pfam05079 DUF680 Protein of unknown function (DUF680). This family contains several uncharacterized proteins which seem to be found exclusively in Rhizobium loti. Probab=30.70 E-value=24 Score=14.44 Aligned_cols=20 Identities=30% Similarity=0.311 Sum_probs=12.5 Q ss_pred CCHHHHHHHHHHHHHHHHHH Q ss_conf 90279999999999999986 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISY 20 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~ 20 (338) |||+...+++++.+++...+ T Consensus 1 M~KiaLtaAAiL~asgsAFA 20 (73) T pfam05079 1 MKKIALTAAAILAASGSAFA 20 (73) T ss_pred CCHHHHHHHHHHHHCCCCCC T ss_conf 92248999999997065221 No 161 >PRK11189 lipoprotein NlpI; Provisional Probab=30.69 E-value=32 Score=13.66 Aligned_cols=24 Identities=21% Similarity=0.265 Sum_probs=14.6 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 9027999999999999998614674 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLDG 25 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~~ 25 (338) |+++++..+.+. .+.+++||+++. T Consensus 1 m~~~~r~~~~~~-~~l~LsGC~s~~ 24 (297) T PRK11189 1 MKPFLRWCFVAT-AALLLAGCASSN 24 (297) T ss_pred CCHHHHHHHHHH-HHHHHHHHCCCC T ss_conf 951899999999-999998640588 No 162 >COG4880 Secreted protein containing C-terminal beta-propeller domain distantly related to WD-40 repeats [General function prediction only] Probab=30.30 E-value=33 Score=13.62 Aligned_cols=27 Identities=15% Similarity=0.132 Sum_probs=18.2 Q ss_pred CC-HHHHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 90-2799999999999999861467423 Q gi|254780561|r 1 MK-KFARIVVGIMMITGVISYCTLDGLP 27 (338) Q Consensus 1 MK-k~~~~~~~~~~~~~~~~~~~~~~~~ 27 (338) || |...++++++++.+++..|+....+ T Consensus 1 Mk~klvai~l~~~~~i~lf~~~~~~p~q 28 (603) T COG4880 1 MKKKLVAIALAFTALILLFWISSQAPPQ 28 (603) T ss_pred CCCHHHHHHHHHHHHHHHHEEECCCCCC T ss_conf 9603799999999999872774468876 No 163 >PRK12681 cysB transcriptional regulator CysB; Reviewed Probab=30.24 E-value=33 Score=13.61 Aligned_cols=198 Identities=12% Similarity=0.059 Sum_probs=95.2 Q ss_pred CCCCEEEEEECCCCCCCHHHHHHHHHHHHHHH-CCEEEEEECCCHHHHHHHHHHHHCCCCCEEEECCCHHHHHHHHHHHC Q ss_conf 34885999978877851223388999999884-88799997288689997887531266410110133378999887521 Q gi|254780561|r 27 PAKPVLTVYTYNSFVADEGAGPKIKQAFERKC-NCELKLIGLSDGVALLNKLRMEGENSAADIVLGFDNNLIDLARKTGL 105 (338) Q Consensus 27 ~~~~~l~v~~~~~~~~~~~~~~~i~~~Fek~~-gikV~~~~~~~~~~~~~~~~a~~~~~~~Dv~~~~~~~~~~~~~~~~l 105 (338) ...++|+|.+..+ .-...++.++..|.++| +|++++.. ++..++...+. +...|+.+..... .+ T Consensus 90 ~~~G~L~Ia~~~t--~a~~~LP~~i~~F~~~~P~v~l~l~~-~~~~~i~~~l~----~g~~DlaI~~e~~--------~~ 154 (324) T PRK12681 90 PDKGSLYIATTHT--QARYALPPVIKGFIKRYPRVSLHMHQ-GSPTQIAEAAA----KGEADFAIATEAL--------HL 154 (324) T ss_pred CCCCEEEEEECHH--HHHHHHHHHHHHHHHHCCCCCEEEEE-CCHHHHHHHHH----CCCCCCCCCCCCC--------CC T ss_conf 8763499985358--88877259999999878998446531-78799999998----7998723124666--------66 Q ss_pred CCCCCCCHHHCCCCCCCCCCCCCCEEEEEEHHHHCCC--CCCCCCCCHHHHCCCCCCCEEEEEECCCCCCHHHHHHHHHH Q ss_conf 2613677000210001243211532431000110011--12458764565212000682898504765303678999987 Q gi|254780561|r 106 FAKSNIDASQLKLPIKWDDDIFVPYDYGYLAFIYDKR--QITQPPKNFDELINSTKPWKIIYQDPRTSTLGLGLLLWIQK 183 (338) Q Consensus 106 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~--~~~~~p~s~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 183 (338) .......|...+....+..++ .-...+-++.+|.+.. ..+...+.. .....-.+... T Consensus 155 ----------------~~dl~~~P~~~~~~~vvvp~~HPLa~~~~itl~dL~~~p----lI~~~~g~~-~R~~id~~f~~ 213 (324) T PRK12681 155 ----------------YDDLIMLPCYHWNRSVVVPPDHPLAKKKKLTIEELAQYP----LVTYVFGFT-GRSELDTAFNR 213 (324) T ss_pred ----------------CCCCEEEEEEECCCCCCCCCCCCCCCCCCCCHHHHCCCC----EEEECCCCC-HHHHHHHHHHH T ss_conf ----------------887468640003222457999965579999999986998----596149982-99999999998 Q ss_pred HCCCCHHHHHHHHHHCCCEEECCCHHHHHHHHCCCCCEEEEECCHHHHHHHHHCCCCEEEEECCCCE-EEEEEEEEEECC Q ss_conf 1773137898777628846830847898886225865699950449999985079973788316980-888864243378 Q gi|254780561|r 184 IYGDNSAQVWKKIATKTATVTKGWTESYGFFLKGESDFVLSYSTSPGFYLLNYGQDDYVAALFSEGH-YLQIEVAAQLVR 262 (338) Q Consensus 184 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~eG~-~~~~~~~~i~k~ 262 (338) . .+..+...-..+......++..|-. +++ -....+. .....++ ...+.+.- ...+-.+++-+| T Consensus 214 ~----------g~~P~iv~e~~~~~~I~~~V~~GlG-V~i--lp~mA~~--~~~d~~l-~~~~~~~~f~~~~t~i~~r~g 277 (324) T PRK12681 214 A----------GLTPRIVFTATDADVIKTYVRLGLG-VGV--IASMAVD--PVADPDL-VAIDASHLFAHSTTKIGFRRG 277 (324) T ss_pred C----------CCCCEEEEEECCHHHHHHHHHHCCE-EEE--CHHHHCC--CCCCCCE-EEEECCCCCCCCEEEEEEECC T ss_conf 8----------9976079998989999999995991-397--0355519--6557881-784365567776479999799 Q ss_pred CCCHHHHHHHHHHH Q ss_conf 99989999999996 Q gi|254780561|r 263 SKQPQLAQEFMQFM 276 (338) Q Consensus 263 a~n~~~A~~Fi~fl 276 (338) .-=...++.||+-+ T Consensus 278 ~~l~~~~~~Fi~~~ 291 (324) T PRK12681 278 TFLRSYMYDFIERF 291 (324) T ss_pred CCHHHHHHHHHHHH T ss_conf 71579999999986 No 164 >PRK05362 phosphopentomutase; Provisional Probab=30.18 E-value=33 Score=13.61 Aligned_cols=46 Identities=15% Similarity=0.226 Sum_probs=27.7 Q ss_pred HHHHHHHHHHHHHHCCE-EEEEECCCHHHHHHHHHHHHCCCCCEEEE Q ss_conf 23388999999884887-99997288689997887531266410110 Q gi|254780561|r 45 GAGPKIKQAFERKCNCE-LKLIGLSDGVALLNKLRMEGENSAADIVL 90 (338) Q Consensus 45 ~~~~~i~~~Fek~~gik-V~~~~~~~~~~~~~~~~a~~~~~~~Dv~~ 90 (338) .+-++++++|++++|.+ |---...|+.+++..+-.+-......++. T Consensus 110 gFP~eli~~~~~~~g~~gvlgN~~aSGt~ii~~lG~eh~~tg~pIvY 156 (393) T PRK05362 110 GFPQELIDEIEERAGRKGILGNKHASGTEIIDELGEEHMKTGKPIVY 156 (393) T ss_pred CCHHHHHHHHHHHHCCCCEEECCCCCCHHHHHHHHHHHHHCCCCEEE T ss_conf 36099999999985898178246575049999998987620897698 No 165 >PRK11443 hypothetical protein; Provisional Probab=29.90 E-value=31 Score=13.75 Aligned_cols=29 Identities=34% Similarity=0.552 Sum_probs=14.4 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEE Q ss_conf 9027999999999999998614674234885999 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLDGLPAKPVLTV 34 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~v 34 (338) ||++... +++.++++|..+...-.++.+. T Consensus 1 Mk~~~~~-----~~~lll~GCq~~p~t~ap~~~~ 29 (120) T PRK11443 1 MKKFIAP-----LLALLVSGCQIDPYTHAPTLTS 29 (120) T ss_pred CHHHHHH-----HHHHHHHCCCCCCCCCCCCCCC T ss_conf 9158999-----9999996667896246865568 No 166 >COG3317 NlpB Uncharacterized lipoprotein [Cell envelope biogenesis, outer membrane] Probab=29.80 E-value=33 Score=13.57 Aligned_cols=25 Identities=28% Similarity=0.555 Sum_probs=15.6 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 90279999999999999986146742 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLDGL 26 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~~~ 26 (338) ||....+++++++ .+++++|+.... T Consensus 1 ~k~~~~lv~~al~-v~~LaaCSs~~~ 25 (342) T COG3317 1 MKSSAKLVLGALL-VLLLAACSSDSE 25 (342) T ss_pred CCHHHHHHHHHHH-HHHHHHCCCCCC T ss_conf 9117889999999-998741468850 No 167 >PRK10363 cpxP periplasmic repressor CpxP; Reviewed Probab=29.65 E-value=33 Score=13.55 Aligned_cols=22 Identities=14% Similarity=0.193 Sum_probs=15.9 Q ss_pred CCHHHHHHHHHHHHHHHHHHHC Q ss_conf 9027999999999999998614 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCT 22 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~ 22 (338) |||+..++++..++++.++..+ T Consensus 1 MRklTA~vlAStLalgs~s~aa 22 (166) T PRK10363 1 MRIVTAAVMASTLAVSSLSHAA 22 (166) T ss_pred CHHHHHHHHHHHHHHHHHHHHH T ss_conf 9069999999999998999862 No 168 >PRK12751 cpxP periplasmic stress adaptor protein CpxP; Reviewed Probab=29.56 E-value=34 Score=13.54 Aligned_cols=22 Identities=23% Similarity=0.416 Sum_probs=15.3 Q ss_pred CCHHHHHHHHHHHHHHHHHHHC Q ss_conf 9027999999999999998614 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCT 22 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~ 22 (338) |+|+..++++..++++.++..+ T Consensus 1 MrK~tal~~AS~LaLGs~~a~A 22 (162) T PRK12751 1 MRKVTTLVMASMFVLGSSAAFA 22 (162) T ss_pred CHHHHHHHHHHHHHHHHHHHHH T ss_conf 9178999999999999999987 No 169 >PRK00059 prsA peptidylprolyl isomerase; Provisional Probab=29.29 E-value=34 Score=13.52 Aligned_cols=24 Identities=21% Similarity=0.234 Sum_probs=15.9 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 902799999999999999861467 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLD 24 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~ 24 (338) ||+..++++++++++.++++++++ T Consensus 1 ~~~~~ki~~~~ii~~~~~~~vg~~ 24 (336) T PRK00059 1 MKSAKKIVASLLVGVFIFSAVGCN 24 (336) T ss_pred CCCHHHHHHHHHHHHHHHHHHCCC T ss_conf 950799999999999999983025 No 170 >COG4238 Murein lipoprotein [Cell envelope biogenesis, outer membrane] Probab=29.10 E-value=30 Score=13.86 Aligned_cols=24 Identities=13% Similarity=0.232 Sum_probs=14.3 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 902799999999999999861467 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLD 24 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~ 24 (338) |++...++.+..+.+.+++||+.. T Consensus 1 m~~~~m~l~Avvlg~lllAGc~s~ 24 (78) T COG4238 1 MKTTKMTLGAVVLGSLLLAGCSSN 24 (78) T ss_pred CCCEEHHHHHHHHHHHHHHHCCCH T ss_conf 974201278999877999701108 No 171 >PRK09973 putative outer membrane lipoprotein; Provisional Probab=28.98 E-value=29 Score=13.94 Aligned_cols=24 Identities=25% Similarity=0.375 Sum_probs=14.3 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 9027999999999999998614674 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLDG 25 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~~ 25 (338) ||+.+. +-++.+++.++++|+..+ T Consensus 1 mrtk~v-lGaviLaS~LLaGCsn~~ 24 (85) T PRK09973 1 MKTIFT-VGAVVLATCLLSGCVNEQ 24 (85) T ss_pred CCCEEH-HHHHHHHHHHHHCCCCHH T ss_conf 960211-069999999998155558 No 172 >COG2980 RlpB Rare lipoprotein B [Cell envelope biogenesis, outer membrane] Probab=28.61 E-value=35 Score=13.45 Aligned_cols=58 Identities=9% Similarity=0.155 Sum_probs=27.7 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCCCC------CCEEEEEECCCCCCCHHHHHHHHHHHHHHHCCEEEE Q ss_conf 9027999999999999998614674234------885999978877851223388999999884887999 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLDGLPA------KPVLTVYTYNSFVADEGAGPKIKQAFERKCNCELKL 64 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~~~~~------~~~l~v~~~~~~~~~~~~~~~i~~~Fek~~gikV~~ 64 (338) |++ +..+ .+++++.++++|+..-... =.++.+.+.+.|. .....+-... ..+||+|.- T Consensus 1 M~~-L~~~-lL~~a~~~L~aCGFhLRg~~~ip~~l~~l~l~s~d~y~---~l~r~vrr~L-~~n~v~vv~ 64 (178) T COG2980 1 MRK-LKTL-LLLAAVLLLAACGFHLRGTTQIPPELKTLSLESSDPYG---PLTRAVRRQL-RLNGVDVVD 64 (178) T ss_pred CCC-HHHH-HHHHHHHHHHHCCEEECCCCCCCCCHHHEEEECCCCCC---HHHHHHHHHH-HHCCCEEEE T ss_conf 900-5899-99999999733453431788899411331554368987---6899999999-984964531 No 173 >pfam11105 CCAP Arthropod cardioacceleratory peptide 2a. CCAP exerts a reversible and dose-dependant cardio-stimulatory effect on the semi-isolated heart of experimental beetles. CCAP also increases free hemolymph sugar concentration in young larvae and adults of the meal-worm beetle. Probab=28.23 E-value=35 Score=13.41 Aligned_cols=72 Identities=13% Similarity=0.163 Sum_probs=33.1 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHHCCEEEEEECCC--HHHHHHHHH Q ss_conf 902799999999999999861467423488599997887785122338899999988488799997288--689997887 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLDGLPAKPVLTVYTYNSFVADEGAGPKIKQAFERKCNCELKLIGLSD--GVALLNKLR 78 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~v~~~~~~~~~~~~~~~i~~~Fek~~gikV~~~~~~~--~~~~~~~~~ 78 (338) |+....++++++.++.|+-||..+......+.. +...... .....++..+| ||+--+-.++.+ ...++.+.. T Consensus 1 m~~s~~ilL~l~~~l~c~~c~~a~~eprn~~~~--~~e~~~~-~~~KrPFCNAF---TGCGrKRss~ps~pP~s~~kr~~ 74 (133) T pfam11105 1 MRTSMRILLLLVALLACLDCGVADREPRNYKQY--NMEPHKL-SPEKRPFCNAF---TGCGRKRSSYPSYPPASMFKRHD 74 (133) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCC--CCCCCCC-CCCCCCCCHHH---CCCCCCCCCCCCCCCHHHHHHHH T ss_conf 903999999999999999974342152013423--4543225-75457530320---46565655689998188887654 No 174 >COG4314 NosL Predicted lipoprotein involved in nitrous oxide reduction [Energy production and conversion] Probab=28.05 E-value=36 Score=13.39 Aligned_cols=22 Identities=23% Similarity=0.549 Sum_probs=14.8 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 9027999999999999998614674 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLDG 25 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~~ 25 (338) ||| ++++++++++++++|.... T Consensus 1 mkr---~Lla~la~~~llAgC~~~e 22 (176) T COG4314 1 MKR---TLLAILAVTALLAGCRQAE 22 (176) T ss_pred CCH---HHHHHHHHHHHHHHCCHHH T ss_conf 940---3799999999987534311 No 175 >PRK12701 flgH flagellar basal body L-ring protein; Reviewed Probab=27.79 E-value=36 Score=13.36 Aligned_cols=21 Identities=14% Similarity=0.213 Sum_probs=11.5 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCC Q ss_conf 90279999999999999986146 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTL 23 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~ 23 (338) |+|.... ..+++++++++|+. T Consensus 1 m~r~~~~--~~~la~~lL~GC~~ 21 (230) T PRK12701 1 MNRLNIA--VSCLATALLFGCEA 21 (230) T ss_pred CCHHHHH--HHHHHHHHHHCCCC T ss_conf 9258999--99999999755668 No 176 >PRK11546 zraP zinc resistance protein; Provisional Probab=27.66 E-value=36 Score=13.35 Aligned_cols=61 Identities=13% Similarity=0.225 Sum_probs=27.6 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEE--CCCCCC-CHHHHHHHHHHHHHHH-CCEEE Q ss_conf 902799999999999999861467423488599997--887785-1223388999999884-88799 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLDGLPAKPVLTVYT--YNSFVA-DEGAGPKIKQAFERKC-NCELK 63 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~v~~--~~~~~~-~~~~~~~i~~~Fek~~-gikV~ 63 (338) |||-.++.+.++.++.++.+.+ ++-+... --.|. ....+. +....+.|.++|..++ .++-+ T Consensus 1 mk~n~~~~~~l~~~~~~~~~~~-~a~a~~~-~~~~~~~~~~LT~EQQa~~qkL~~eF~~eTa~LRqq 65 (139) T PRK11546 1 MKRNTKIALVLMALSALAMGSG-SAFAHNG-HGMWQQGAAPLTTEQQAAAQKIHNDFYAQTSALRQQ 65 (139) T ss_pred CCCCCCCCHHHHHHHHHHHHCC-HHHCCCC-CCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9865420599999999998035-2312578-777788889889999999999999999999999999 No 177 >pfam04076 BOF Bacterial OB fold (BOF) protein. Proteins in this family form an OB-fold. Analysis of the predicted binding site of BOF family proteins implies that they lack nucleic acid-binding properties. They contain an predicted N-terminal signal peptide which indicates that they localize in the periplasm where they may function to bind proteins, small molecules, or other typical OB-fold ligands. As hypothesized for the distantly related OB-fold containing bacterial enterotoxins, the loss of nucleotide-binding function and the rapid evolution of the BOF ligand-binding site may be associated with the presence of BOF proteins in mobile genetic elements and their potential role in bacterial pathogenicity. Probab=27.22 E-value=37 Score=13.30 Aligned_cols=20 Identities=25% Similarity=0.308 Sum_probs=11.2 Q ss_pred CCHHHHHHHHHHHHHHHHHH Q ss_conf 90279999999999999986 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISY 20 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~ 20 (338) |||.+..++++++++..++. T Consensus 1 MKK~~i~~~~~l~s~~~~Aa 20 (126) T pfam04076 1 MKKLAIALASALASTSALAA 20 (126) T ss_pred CCHHHHHHHHHHHHCHHHHH T ss_conf 95689999999984678797 No 178 >PRK09906 DNA-binding transcriptional regulator HcaR; Provisional Probab=27.13 E-value=37 Score=13.29 Aligned_cols=197 Identities=13% Similarity=0.029 Sum_probs=89.0 Q ss_pred CCCEEEEEECCCCCCCHHHHHHHHHHHHHHH-CCEEEEEECCCHHHHHHHHHHHHCCCCCEEEECCCHHHHHHHHHHHCC Q ss_conf 4885999978877851223388999999884-887999972886899978875312664101101333789998875212 Q gi|254780561|r 28 AKPVLTVYTYNSFVADEGAGPKIKQAFERKC-NCELKLIGLSDGVALLNKLRMEGENSAADIVLGFDNNLIDLARKTGLF 106 (338) Q Consensus 28 ~~~~l~v~~~~~~~~~~~~~~~i~~~Fek~~-gikV~~~~~~~~~~~~~~~~a~~~~~~~Dv~~~~~~~~~~~~~~~~l~ 106 (338) ....|+|....+ .....++.++..|.++| +|+|++.... +.+....+.. ...|+.+...+... . T Consensus 88 ~~~~l~ig~~~~--~~~~~lp~~l~~f~~~~P~v~v~l~~~~-~~~~~~~l~~----g~~D~~i~~~~~~~-----~--- 152 (296) T PRK09906 88 EDRQLTIGFVPS--AEVNLLPKVLPMFRLRHPDTLIELVSLI-NTQQEEKLRR----GELDVGFMRHPVYS-----D--- 152 (296) T ss_pred CCCEEEEEECHH--HHHHHHHHHHHHHHHHCCCCCEEEEECC-CHHHHHHHHC----CCCCEEEEECCCCC-----C--- T ss_conf 771388321225--6999999999998740898634899758-0899999986----99767987368899-----8--- Q ss_pred CCCCCCHHHCCCCCCCCCCCCCCEEEEEEHHHHCC--CCCCCCCCCHHHHCCCCCCCEEEEEECCCCCCHHHHHHHHHHH Q ss_conf 61367700021000124321153243100011001--1124587645652120006828985047653036789999871 Q gi|254780561|r 107 AKSNIDASQLKLPIKWDDDIFVPYDYGYLAFIYDK--RQITQPPKNFDELINSTKPWKIIYQDPRTSTLGLGLLLWIQKI 184 (338) Q Consensus 107 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~--~~~~~~p~s~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 184 (338) .. ...+......+..... ........++.+|.+.. ..+..+..+............. T Consensus 153 --------~l---------~~~~l~~~~~~~~~~~~hpla~~~~i~~~dl~~~~----~i~~~~~~~~~~~~~~~~~~~~ 211 (296) T PRK09906 153 --------EI---------DYLELLDEPLVVVLPVDHPLAHEKEITAAQLDGVN----FISPDPAQSGSLAPIIKAWFAQ 211 (296) T ss_pred --------CE---------EEEEEECCEEEEEEECCCCCCCCCCCCHHHHCCCC----EEEECCCCCCCHHHHHHHHHHH T ss_conf --------81---------89985046079998289810059998999975998----7963665476099999999997 Q ss_pred CCCCHHHHHHHHHHCCCEEECCCHHHHHHHHCCCCCEEEEECCHHHHHHHHHCCCCEEEEECCCCEEEE-EEEEEEECCC Q ss_conf 773137898777628846830847898886225865699950449999985079973788316980888-8642433789 Q gi|254780561|r 185 YGDNSAQVWKKIATKTATVTKGWTESYGFFLKGESDFVLSYSTSPGFYLLNYGQDDYVAALFSEGHYLQ-IEVAAQLVRS 263 (338) Q Consensus 185 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~eG~~~~-~~~~~i~k~a 263 (338) . ....+......+......+...|.. ++....... . .....+. ..|-++.... .-+++-.++. T Consensus 212 ~---------g~~p~~~~~~~~~~~~~~~v~~G~G---i~~lP~~~~-~--~~~~~l~-~~pl~~~~~~~~~~l~~r~~~ 275 (296) T PRK09906 212 H---------NSQPNIVQVATNILVTMNLVGMGLG---CTIIPGYMN-N--FNTGQVV-FRPLAGNVPSIALLMAWKKGE 275 (296) T ss_pred C---------CCCCCEEEEECCHHHHHHHHHHCCE---EEECHHHHH-H--HCCCCEE-EEECCCCCCEEEEEEEECCCC T ss_conf 7---------9997469998989999999994989---998458773-1--0689999-998899987028999987999 Q ss_pred CCHHHHHHHHHHHC Q ss_conf 99899999999963 Q gi|254780561|r 264 KQPQLAQEFMQFMI 277 (338) Q Consensus 264 ~n~~~A~~Fi~fll 277 (338) . ..+++.||+|+- T Consensus 276 ~-spa~~~Fi~~l~ 288 (296) T PRK09906 276 M-KPALRDFIAIVQ 288 (296) T ss_pred C-CHHHHHHHHHHH T ss_conf 9-999999999999 No 179 >COG5633 Predicted periplasmic lipoprotein [General function prediction only] Probab=27.02 E-value=37 Score=13.28 Aligned_cols=21 Identities=14% Similarity=0.365 Sum_probs=10.8 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 9027999999999999998614674 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLDG 25 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~~ 25 (338) |||...+.+++ .++.||++.. T Consensus 1 Mrk~~~~~l~~----~lLvGCsS~~ 21 (123) T COG5633 1 MRKLCLLSLAL----LLLVGCSSHQ 21 (123) T ss_pred CCEEHHHHHHH----HHHHCCCCCC T ss_conf 93003899999----9942047887 No 180 >PRK00031 lolA outer-membrane lipoprotein carrier protein; Reviewed Probab=26.84 E-value=38 Score=13.26 Aligned_cols=13 Identities=23% Similarity=0.094 Sum_probs=7.4 Q ss_pred CCHHHHHHHHHHH Q ss_conf 9027999999999 Q gi|254780561|r 1 MKKFARIVVGIMM 13 (338) Q Consensus 1 MKk~~~~~~~~~~ 13 (338) |||++.+++.+++ T Consensus 1 mk~~~~~~~~~~~ 13 (201) T PRK00031 1 MKKLLIAALLAAA 13 (201) T ss_pred CCHHHHHHHHHHH T ss_conf 9159999999999 No 181 >PRK10837 putative DNA-binding transcriptional regulator; Provisional Probab=26.10 E-value=39 Score=13.18 Aligned_cols=202 Identities=9% Similarity=0.004 Sum_probs=93.0 Q ss_pred CCCCEEEEEECCCCCCCHHHHHHHHHHHHHHH-CCEEEEEECCCHHHHHHHHHHHHCCCCCEEEECCCHHHHHHHHHHHC Q ss_conf 34885999978877851223388999999884-88799997288689997887531266410110133378999887521 Q gi|254780561|r 27 PAKPVLTVYTYNSFVADEGAGPKIKQAFERKC-NCELKLIGLSDGVALLNKLRMEGENSAADIVLGFDNNLIDLARKTGL 105 (338) Q Consensus 27 ~~~~~l~v~~~~~~~~~~~~~~~i~~~Fek~~-gikV~~~~~~~~~~~~~~~~a~~~~~~~Dv~~~~~~~~~~~~~~~~l 105 (338) ...++|.|.+..+. -...++.++..|.++| +|+|++. .+++.+++..+.. ...|+.+...+. T Consensus 86 ~~~g~l~i~~~~~~--~~~~l~~~l~~f~~~~P~v~l~l~-~~~~~~~~~~l~~----~~~D~~i~~~~~---------- 148 (291) T PRK10837 86 EDNGAIRIYASSTI--GNYILPAMIARYRHDYPQLPLELS-VGNSQDVIQAVLD----FRVDIGLIEGPC---------- 148 (291) T ss_pred CCCCEEEEEEEHHH--HHHHHHHHHHHHHHHCCCCEEEEE-ECCHHHHHHHHHH----CCCCEEEECCCC---------- T ss_conf 78862999961799--999999999999964899359999-8888999999980----884166741788---------- Q ss_pred CCCCCCCHHHCCCCCCCCCCCCCCEEEEEEHHHHCCCCCCCCCCCHHHHCCCCCCCEEEEEECCCCCCHHHHHHHHHHHC Q ss_conf 26136770002100012432115324310001100111245876456521200068289850476530367899998717 Q gi|254780561|r 106 FAKSNIDASQLKLPIKWDDDIFVPYDYGYLAFIYDKRQITQPPKNFDELINSTKPWKIIYQDPRTSTLGLGLLLWIQKIY 185 (338) Q Consensus 106 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~p~s~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 185 (338) ..+.. ........ ..... ...+.. ....|.++.+|.+..+ .+.... +............. T Consensus 149 ---~~~~l---~~~~l~~~----~~~~v---~~p~~~-l~~~~i~~~~l~~~~~----i~~~~~-~~~r~~~~~~~~~~- 208 (291) T PRK10837 149 ---HSTEI---ISEPWLED----ELVVF---AAPDSP-LARGPVTLEQLAAAPW----ILRERG-SGTREIVDYLLLSH- 208 (291) T ss_pred ---CCCCE---EEEEEECC----CEEEE---ECCCCH-HHCCCCCHHHHHCCCE----EEECCC-CCHHHHHHHHHHHC- T ss_conf ---99871---89994414----18999---878866-5379999899807987----994499-86899999999854- Q ss_pred CCCHHHHHHHHHHCCCEEECCCHHHHHHHHCCCCCEEEEECCHHHHHHHHHCCCCEEEEECCCCEEEEEEEEEEECCCCC Q ss_conf 73137898777628846830847898886225865699950449999985079973788316980888864243378999 Q gi|254780561|r 186 GDNSAQVWKKIATKTATVTKGWTESYGFFLKGESDFVLSYSTSPGFYLLNYGQDDYVAALFSEGHYLQIEVAAQLVRSKQ 265 (338) Q Consensus 186 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~eG~~~~~~~~~i~k~a~n 265 (338) ....+.....++..........|.. ++...........+. ..+..+.++.......-+++-.|+..- T Consensus 209 ---------~~~~~~~~~~~~~~~i~~~v~~G~G---ia~lp~~~v~~~l~~-g~L~~l~~~~~~~~~~~~lv~~k~~~l 275 (291) T PRK10837 209 ---------LPRFEMAMELGNSEAIKHAVRHGLG---ISCLSRRVIADQLQA-GTLVEVAVPLPRLMRTLWRIHHRQKHL 275 (291) T ss_pred ---------CCCCCEEEECCCHHHHHHHHHHCCC---EEECCHHHHHHHHHC-CCEEEEECCCCCCCCEEEEEEECCCCC T ss_conf ---------8865301312979999999995991---897249999999878-997997078888612799999897865 Q ss_pred HHHHHHHHHHHCC Q ss_conf 8999999999639 Q gi|254780561|r 266 PQLAQEFMQFMIS 278 (338) Q Consensus 266 ~~~A~~Fi~flls 278 (338) +.+++.||+|+-. T Consensus 276 s~~~~~Fidfl~~ 288 (291) T PRK10837 276 SNALQRFLSYCQP 288 (291) T ss_pred CHHHHHHHHHHHH T ss_conf 9999999999862 No 182 >cd06354 PBP1_BmpA_PnrA_like Periplasmic binding domain of basic membrane lipoprotein, PnrA, in Treponema pallidum and its homologs from other bacteria and Archaea. Periplasmic binding domain of basic membrane lipoprotein, PnrA, in Treponema pallidum and its homologs from other bacteria and Archaea. The PnrA lipoprotein, also known as Tp0319 or TmpC, represents a novel family of bacterial purine nucleoside receptor encoded within an ATP-binding cassette (ABC) transport system (pnrABCDE). It shows a striking structural similarity to another basic membrane lipoprotein Med which regulates the competence transcription factor gene, comK, in Bacillus subtilis. The members of PnrA-like subgroup are likely to have similar nucleoside-binding functions and a similar type I periplasmic sugar-binding protein-like fold. Probab=26.08 E-value=39 Score=13.18 Aligned_cols=50 Identities=14% Similarity=0.182 Sum_probs=33.0 Q ss_pred HHHHHHHHCCEEEEEECCCHHHHHHHHHHHHCCCCCEEEECCCHHHHHHHH Q ss_conf 999998848879999728868999788753126641011013337899988 Q gi|254780561|r 51 KQAFERKCNCELKLIGLSDGVALLNKLRMEGENSAADIVLGFDNNLIDLAR 101 (338) Q Consensus 51 ~~~Fek~~gikV~~~~~~~~~~~~~~~~a~~~~~~~Dv~~~~~~~~~~~~~ 101 (338) ++++++++|++++++...+..+....++.-. ...+|+++........... T Consensus 24 ~~~~~~~~g~~~~~~e~~~~~d~~~~~~~~~-~~g~~lIi~~g~~~~~~~~ 73 (265) T cd06354 24 LERAAKELGIEYKYVESKSDADYEPNLEQLA-DAGYDLIVGVGFLLADALK 73 (265) T ss_pred HHHHHHHHCCEEEEEECCCHHHHHHHHHHHH-HCCCCEEEECCHHHHHHHH T ss_conf 9999998699389993799899999999999-8799999991756789999 No 183 >TIGR03524 GldJ gliding motility-associated lipoprotein GldJ. Members of this protein family are exclusive to the Bacteroidetes phylum (previously Cytophaga-Flavobacteria-Bacteroides). GldJ is a lipoprotein linked to a type of rapid surface gliding motility found in certain Bacteroidetes, such as Flavobacterium johnsoniae. Knockouts of GldJ abolish the gliding phenotype. GldJ is homologous to GldK. There is a GldJ homolog in Cytophaga hutchinsonii and several other species that has a different, shorter architecture and is represented by a separate model. Gliding motility appears closely linked to chitin utilization in the model species Flavobacterium johnsoniae. Bacteroidetes with members of this protein family appear to have all of the genes associated with gliding motility. Probab=25.97 E-value=39 Score=13.16 Aligned_cols=22 Identities=18% Similarity=0.144 Sum_probs=12.7 Q ss_pred CCHHHHHHHHHHHHHHHHHHHC Q ss_conf 9027999999999999998614 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCT 22 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~ 22 (338) ||.++++++.+++++++.+|.. T Consensus 3 ~~~~~~~~~~~~~~~~~~sC~~ 24 (559) T TIGR03524 3 MKNVFKITFIVFVSLLFVSCKK 24 (559) T ss_pred HHHHHHHHHHHHHHHHHHHCCC T ss_conf 6788999999999999764258 No 184 >PRK11087 hypothetical protein; Provisional Probab=25.67 E-value=40 Score=13.13 Aligned_cols=36 Identities=17% Similarity=0.244 Sum_probs=14.9 Q ss_pred CC-HHHHHHHHHHHHHHHHHHHCC-CCCCCCCEEEEEE Q ss_conf 90-279999999999999986146-7423488599997 Q gi|254780561|r 1 MK-KFARIVVGIMMITGVISYCTL-DGLPAKPVLTVYT 36 (338) Q Consensus 1 MK-k~~~~~~~~~~~~~~~~~~~~-~~~~~~~~l~v~~ 36 (338) || |++.+.+.+++..++++.|.. ......++++|.. T Consensus 1 ~~~~~~~~~~~~~l~~~~~~~~~~a~~~p~~p~ItVtG 38 (238) T PRK11087 1 MKLKVLALAALLGLSASLLAMAAQAAELPDFPHIVTSG 38 (238) T ss_pred CCCHHHHHHHHHHHHHHHHCCCHHHCCCCCCCEEEEEE T ss_conf 93049999999998776632434434799998799998 No 185 >PRK13697 cytochrome c6; Provisional Probab=25.58 E-value=40 Score=13.12 Aligned_cols=16 Identities=25% Similarity=0.463 Sum_probs=9.7 Q ss_pred CCHHHHHHHHHHHHHH Q ss_conf 9027999999999999 Q gi|254780561|r 1 MKKFARIVVGIMMITG 16 (338) Q Consensus 1 MKk~~~~~~~~~~~~~ 16 (338) |||++.+++..++++. T Consensus 1 Mk~l~~~~l~~~~~~~ 16 (111) T PRK13697 1 MKKILKLVLLTLLLLT 16 (111) T ss_pred CHHHHHHHHHHHHHHH T ss_conf 9789999999999999 No 186 >PRK10954 periplasmic protein disulfide isomerase I; Provisional Probab=25.27 E-value=40 Score=13.09 Aligned_cols=16 Identities=25% Similarity=0.374 Sum_probs=10.0 Q ss_pred CCHHHHHHHHHHHHHH Q ss_conf 9027999999999999 Q gi|254780561|r 1 MKKFARIVVGIMMITG 16 (338) Q Consensus 1 MKk~~~~~~~~~~~~~ 16 (338) |||++.+++++++..+ T Consensus 1 mkk~~~~~~~llls~~ 16 (207) T PRK10954 1 MKKIWLALAGMVLAFS 16 (207) T ss_pred CCHHHHHHHHHHHHHH T ss_conf 9159999999999977 No 187 >pfam11777 DUF3316 Protein of unknown function (DUF3316). This family of bacterial proteins has no known function. Several members are, however, annotated as being putative acyl-CoA synthetase, but this could not be confirmed. Probab=25.19 E-value=40 Score=13.08 Aligned_cols=33 Identities=12% Similarity=0.157 Sum_probs=15.3 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEE Q ss_conf 9027999999999999998614674234885999 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLDGLPAKPVLTV 34 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~v 34 (338) |||++.+++.+++.+.+.+ +......+..++.. T Consensus 1 mKk~~ll~~~ll~s~~AfA-~~~~~~~~~~~~~~ 33 (114) T pfam11777 1 MKKLILLALLLLLSATAFA-GNYVSTTQTDSLQT 33 (114) T ss_pred CHHHHHHHHHHHHHHHHHH-CCCCCCCCCEEEEE T ss_conf 9108999999999888875-55536556258971 No 188 >PRK10907 intramembrane serine protease GlpG; Provisional Probab=24.97 E-value=41 Score=13.05 Aligned_cols=10 Identities=10% Similarity=-0.007 Sum_probs=5.3 Q ss_pred HHHHCCEEEE Q ss_conf 9884887999 Q gi|254780561|r 55 ERKCNCELKL 64 (338) Q Consensus 55 ek~~gikV~~ 64 (338) =+..||..++ T Consensus 20 l~~~gi~~~v 29 (276) T PRK10907 20 MATQGVILTI 29 (276) T ss_pred HHHCCCCEEE T ss_conf 9867990798 No 189 >PRK11615 hypothetical protein; Provisional Probab=24.33 E-value=42 Score=12.98 Aligned_cols=22 Identities=14% Similarity=0.253 Sum_probs=14.1 Q ss_pred CCHHHHHHHHHHHHHHHHHHHC Q ss_conf 9027999999999999998614 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCT 22 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~ 22 (338) ||++.+.+...++++++.+|-. T Consensus 1 m~~l~Ky~giglLv~gLAACD~ 22 (185) T PRK11615 1 MRNLVKYVGIGLLVMGLAACDN 22 (185) T ss_pred CCHHHHHHHHHHHHHHHHHHCC T ss_conf 9107888869999988876238 No 190 >PRK11679 lipoprotein; Provisional Probab=23.97 E-value=43 Score=12.94 Aligned_cols=22 Identities=14% Similarity=0.160 Sum_probs=12.9 Q ss_pred CCHHHHHHHHHHHHHHHHHHHC Q ss_conf 9027999999999999998614 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCT 22 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~ 22 (338) ||-.+..++++++++.+.+|++ T Consensus 6 ~k~~~~~v~~~~lv~~L~ACss 27 (345) T PRK11679 6 QKSRLAKVAGVSLVLLLAACSS 27 (345) T ss_pred HHHHHHHHHHHHHHHHHHHCCC T ss_conf 1203777888999999986079 No 191 >PRK11013 DNA-binding transcriptional regulator LysR; Provisional Probab=23.94 E-value=43 Score=12.94 Aligned_cols=205 Identities=9% Similarity=0.001 Sum_probs=93.8 Q ss_pred CCCEEEEEECCCCCCCHHHHHHHHHHHHHHH-CCEEEEEECCCHHHHHHHHHHHHCCCCCEEEECCCHHHHHHHHHHHCC Q ss_conf 4885999978877851223388999999884-887999972886899978875312664101101333789998875212 Q gi|254780561|r 28 AKPVLTVYTYNSFVADEGAGPKIKQAFERKC-NCELKLIGLSDGVALLNKLRMEGENSAADIVLGFDNNLIDLARKTGLF 106 (338) Q Consensus 28 ~~~~l~v~~~~~~~~~~~~~~~i~~~Fek~~-gikV~~~~~~~~~~~~~~~~a~~~~~~~Dv~~~~~~~~~~~~~~~~l~ 106 (338) ..++|.|.+..+. -...++.++..|.++| +|+|++... ++..+...+.. ...|+.+...... T Consensus 92 ~~G~lri~~~~~~--~~~~lp~~l~~f~~~~P~v~v~i~~~-~~~~~~~~l~~----~~~Dl~l~~~~~~---------- 154 (309) T PRK11013 92 RQGQLSIACLPVF--SQSLLPGLCQPFLARYPDVSLNIVPQ-ESPLLEEWLSA----QRHDLGLTETLHT---------- 154 (309) T ss_pred CCCCCCCCCCHHH--HHHHCCHHHHHHHHHCCCCEEEEEEC-CHHHHHHHHHC----CCCCEEEEECCCC---------- T ss_conf 3676565300678--98866523156784389807999989-85999999975----9977899607889---------- Q ss_pred CCCCCCHHHCCCCCCCCCCCCCCEEEEE-EHHH-HCCCCCCCCCCCHHHHCCCCCCCEEEEEECCCCCCHHHHHHHHHHH Q ss_conf 6136770002100012432115324310-0011-0011124587645652120006828985047653036789999871 Q gi|254780561|r 107 AKSNIDASQLKLPIKWDDDIFVPYDYGY-LAFI-YDKRQITQPPKNFDELINSTKPWKIIYQDPRTSTLGLGLLLWIQKI 184 (338) Q Consensus 107 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~-y~~~~~~~~p~s~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 184 (338) +... . ..+..... ..+. -+.......+-++.||.+..+ ..... .+......-...... T Consensus 155 -~~~~---~-----------~~~l~~~~~~~v~~~~hpla~~~~i~~~dL~~~~~----i~~~~-~~~~r~~~~~~~~~~ 214 (309) T PRK11013 155 -PAGT---E-----------RTELLTLDEVCVLPDGHPLAAKKVLTPDDFQGENF----ISLSR-TDSYRQLLDQLFAEH 214 (309) T ss_pred -CCCC---E-----------EEEEEEEEEEEECCCCCCCCCCCCCCHHHHCCCCE----EECCC-CCCHHHHHHHHHHHC T ss_conf -9875---7-----------88864201467547898000499999999738986----96389-994999999999976 Q ss_pred CCCCHHHHHHHHHHCCCEEECCCHHHHHHHHCCCCCEEEEECCHHHHHHHHHCCCCEEEEECCCCEEEEEEEEEEECCCC Q ss_conf 77313789877762884683084789888622586569995044999998507997378831698088886424337899 Q gi|254780561|r 185 YGDNSAQVWKKIATKTATVTKGWTESYGFFLKGESDFVLSYSTSPGFYLLNYGQDDYVAALFSEGHYLQIEVAAQLVRSK 264 (338) Q Consensus 185 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~eG~~~~~~~~~i~k~a~ 264 (338) + ...+...-..+......++..|.. +++ ......... ....+ .+.|-+......-+++.+++.+ T Consensus 215 g----------~~~~~~~e~~~~~~~~~lV~~G~G-vai--l~~~~~~~~--~~~g~-~~~pl~~~~~~~~~lv~~~~r~ 278 (309) T PRK11013 215 G----------VKRRMVVETHSAASVCAMVRAGLG-VSI--VNPLTALDY--AGSGL-VVRPFSISVPFTVSLIRPLHRP 278 (309) T ss_pred C----------CCCCEEEEECCHHHHHHHHHHCCE-EEE--ECHHHHHHH--HHCCE-EEEECCCCCCEEEEEEEECCCC T ss_conf 9----------997769998969999999997996-999--469999866--21990-9998888996189999989997 Q ss_pred CHHHHHHHHHHHCCHHHHHHHH Q ss_conf 9899999999963999999988 Q gi|254780561|r 265 QPQLAQEFMQFMISPSFQRILP 286 (338) Q Consensus 265 n~~~A~~Fi~flls~e~Q~~~a 286 (338) ...++..|++|+- .++++..+ T Consensus 279 ~s~~~~aF~~~Lr-~~a~al~~ 299 (309) T PRK11013 279 SSALVDAFSEHLQ-AQAKALVT 299 (309) T ss_pred CCHHHHHHHHHHH-HHHHHHHH T ss_conf 7999999999999-99999999 No 192 >PRK06707 amidase; Provisional Probab=23.57 E-value=32 Score=13.63 Aligned_cols=53 Identities=17% Similarity=0.270 Sum_probs=28.5 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCC-----CCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHH-CCEEE Q ss_conf 902799999999999999861467-----4234885999978877851223388999999884-88799 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLD-----GLPAKPVLTVYTYNSFVADEGAGPKIKQAFERKC-NCELK 63 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~l~v~~~~~~~~~~~~~~~i~~~Fek~~-gikV~ 63 (338) |||+.+....++.++.+++|.+.- -..++.+-.+|. .+.+++.|.++- ++.++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------~~~~~~~~~~~~~~~~~~ 59 (536) T PRK06707 1 MKKWVKVTLSIAGGIVLLACAGGYYVYKNYFPKEPERIVYD----------KERVLQPIHNQLKGINIE 59 (536) T ss_pred CCCCEEEHHHHEEHHEEEECCCEEEEEECCCCCCCCCCCCC----------HHHHHHHHHHHHHHCCHH T ss_conf 95340011105023215624613898533577775302026----------888874699888504788 No 193 >PRK10455 periplasmic protein; Reviewed Probab=23.40 E-value=44 Score=12.87 Aligned_cols=19 Identities=21% Similarity=0.424 Sum_probs=14.0 Q ss_pred CCHHHHHHHHHHHHHHHHH Q ss_conf 9027999999999999998 Q gi|254780561|r 1 MKKFARIVVGIMMITGVIS 19 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~ 19 (338) |||+..++++..+++++++ T Consensus 1 Mrkltal~~AS~l~lg~~~ 19 (161) T PRK10455 1 MRKLTALFVASTLALGAAN 19 (161) T ss_pred CHHHHHHHHHHHHHHHHHH T ss_conf 9179999999999983668 No 194 >COG3417 FlgN Collagen-binding surface adhesin SpaP (antigen I/II family) [General function prediction only] Probab=23.09 E-value=44 Score=12.84 Aligned_cols=22 Identities=23% Similarity=0.401 Sum_probs=12.3 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCC Q ss_conf 90279999999999999986146 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTL 23 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~ 23 (338) |+|..+ .+.+++++.++++|+. T Consensus 1 mtk~k~-~~~il~~al~l~GCs~ 22 (200) T COG3417 1 MTKMKI-YASILLLALFLSGCSS 22 (200) T ss_pred CCHHHH-HHHHHHHHHHHHHCCC T ss_conf 925799-9999999999841136 No 195 >PRK03598 hypothetical protein; Provisional Probab=23.07 E-value=44 Score=12.84 Aligned_cols=21 Identities=24% Similarity=0.150 Sum_probs=14.0 Q ss_pred CCHHHHHHHHHHHHHHHHHHH Q ss_conf 902799999999999999861 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYC 21 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~ 21 (338) |||.+.+.++++++.++..+. T Consensus 1 mkk~~~~~~~~~~~~~~~~~~ 21 (331) T PRK03598 1 MKKPVVIGLAVVALVAVGAGG 21 (331) T ss_pred CCCHHHHHHHHHHHHHHHHHH T ss_conf 983389999999999999997 No 196 >COG4704 Uncharacterized protein conserved in bacteria [Function unknown] Probab=22.77 E-value=45 Score=12.80 Aligned_cols=35 Identities=17% Similarity=0.043 Sum_probs=14.7 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEE Q ss_conf 90279999999999999986146742348859999 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLDGLPAKPVLTVY 35 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~v~ 35 (338) |.++....+.++..++.+.......+...+.|||. T Consensus 1 m~~~~~~~l~Ll~aa~sL~~~~aaaaeatgkLTvt 35 (151) T COG4704 1 MLNISRRRLFLLAAALSLVSLKAAAAEATGKLTVT 35 (151) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEE T ss_conf 95077999999999999874887777534856999 No 197 >PRK13528 outer membrane receptor FepA; Provisional Probab=22.28 E-value=46 Score=12.74 Aligned_cols=24 Identities=13% Similarity=0.126 Sum_probs=12.2 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 902799999999999999861467 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLD 24 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~ 24 (338) |++...+.++.+++++|++..... T Consensus 3 ~~~~~~~~~~~~~~~~l~~~~~a~ 26 (727) T PRK13528 3 MRANKILWLLTVVLAGLNSQLSAA 26 (727) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 304669999999999875055565 No 198 >PRK13733 conjugal transfer protein TraV; Provisional Probab=21.88 E-value=9 Score=16.89 Aligned_cols=24 Identities=25% Similarity=0.242 Sum_probs=12.7 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 90279999999999999986146742 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLDGL 26 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~~~ 26 (338) |||+. ++..++.+.+++||++-.+ T Consensus 1 mk~~~--~l~~l~~~llLtGCAG~nS 24 (171) T PRK13733 1 MKKIS--LLIPLLGTLLLSGCAGTNS 24 (171) T ss_pred CCEEE--EEHHHCCEEEEECCCCCCC T ss_conf 95167--5112026002602457775 No 199 >TIGR00658 orni_carb_tr ornithine carbamoyltransferase; InterPro: IPR002292 The ornithine carbamoyltransferases are a family of enzymes that catalyse the production of citrulline from carbamoyl-phosphate and ornithine . This reaction is part of the arginine biosynthetic pathway, and in some organisms the reaction is also part of the arginine deaminase pathway . In most prokaryotes, and some lower eukaryotes, the enzyme is found in the cytoplasm, but in higher eukaryotes, such as mammals, the enzyme is found in the mitochondrial matrix and functions as part of the urea cycle. In most organisms analysed to date, the enzyme has been shown to consist of a trimer of identical or nonidentical subunits . Extensive similarity has been found within the ornithine carbamoyltransferase family, some of the conserved areas being important in catalysis . The C-terminal region of these enzymes shows a degree of similarity to the aspartate carbamoyltransferases, which are also known to bind carbamoyl- phosphate . This region contains a highly conserved Cys residue (in a His-Cys-Lys-Pro motif) implicated in ornithine binding . ; GO: 0004585 ornithine carbamoyltransferase activity, 0006520 amino acid metabolic process, 0009348 ornithine carbamoyltransferase complex. Probab=21.83 E-value=47 Score=12.69 Aligned_cols=34 Identities=12% Similarity=0.102 Sum_probs=17.3 Q ss_pred EEEEEECCCCCCCHHHHHHHHHHHHHHHCCEEEEE Q ss_conf 59999788778512233889999998848879999 Q gi|254780561|r 31 VLTVYTYNSFVADEGAGPKIKQAFERKCNCELKLI 65 (338) Q Consensus 31 ~l~v~~~~~~~~~~~~~~~i~~~Fek~~gikV~~~ 65 (338) .+.|.|..+|.++..+.+ -+++|-+++|=+|+++ T Consensus 191 ~~~v~tP~gyeP~~~~v~-~A~~~a~e~Gg~~~lt 224 (341) T TIGR00658 191 DVVVATPEGYEPDAEIVK-KAKKIAKENGGSVELT 224 (341) T ss_pred EEEEECCCCCCCCHHHHH-HHHHHHHHCCCEEEEE T ss_conf 478878888887878999-9999997179859996 No 200 >TIGR01347 sucB 2-oxoglutarate dehydrogenase, E2 component, dihydrolipoamide succinyltransferase; InterPro: IPR006255 These sequences describe the TCA cycle 2-oxoglutarate system E2 component, dihydrolipoamide succinyltransferase. It is closely related to the pyruvate dehydrogenase E2 component, dihydrolipoamide acetyltransferase. Members include the mitochondrial and Gram-negative bacterial forms. Mycobacterial candidates are highly derived, differ in having and extra copy of the lipoyl-binding domain at the N-terminus and are generally excluded from this group of sequences as are all examples of dihydrolipoamide acetyltransferase. ; GO: 0004149 dihydrolipoyllysine-residue succinyltransferase activity, 0006099 tricarboxylic acid cycle, 0045252 oxoglutarate dehydrogenase complex. Probab=21.79 E-value=47 Score=12.68 Aligned_cols=26 Identities=4% Similarity=-0.180 Sum_probs=9.8 Q ss_pred EEEEEEEEE--CCCCCHHHH--HHHHHHHC Q ss_conf 888642433--789998999--99999963 Q gi|254780561|r 252 LQIEVAAQL--VRSKQPQLA--QEFMQFMI 277 (338) Q Consensus 252 ~~~~~~~i~--k~a~n~~~A--~~Fi~fll 277 (338) ..+.|+++| .|+.....| .+=|.-|- T Consensus 303 sTd~GLVvPVvRnad~l~~AdIE~~I~~L~ 332 (435) T TIGR01347 303 STDRGLVVPVVRNADALSFADIEKEIKELG 332 (435) T ss_pred ECCCCCEEEEEECCCCCCHHHHHHHHHHHH T ss_conf 759983342272566578677779989888 No 201 >pfam04507 DUF576 Protein of unknown function, DUF576. This family contains several uncharacterized staphylococcal proteins. Probab=21.65 E-value=43 Score=12.93 Aligned_cols=24 Identities=21% Similarity=0.343 Sum_probs=14.0 Q ss_pred CCHHH--HHHHHHHHHHHHHHHHCCC Q ss_conf 90279--9999999999999861467 Q gi|254780561|r 1 MKKFA--RIVVGIMMITGVISYCTLD 24 (338) Q Consensus 1 MKk~~--~~~~~~~~~~~~~~~~~~~ 24 (338) |+... .+-+++++++..+++|+.. T Consensus 1 M~~~kk~~L~is~liliifI~Gcg~~ 26 (257) T pfam04507 1 MKYSKKLALYISVLILIIFISGCGFM 26 (257) T ss_pred CCCHHEEHHHHHHHHHHHHHEECCCC T ss_conf 95001003678899786520002235 No 202 >pfam06551 DUF1120 Protein of unknown function (DUF1120). This family consists of several hypothetical bacterial proteins of unknown function. Probab=21.37 E-value=48 Score=12.63 Aligned_cols=31 Identities=26% Similarity=0.278 Sum_probs=13.7 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCCC-CCCEEEE Q ss_conf 902799999999999999861467423-4885999 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLDGLP-AKPVLTV 34 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~~~~-~~~~l~v 34 (338) |||.+ +..+++++++.+|+...++ ...+|+| T Consensus 1 mkK~l---~~~~i~a~l~~~~ssa~aa~~ta~LkV 32 (145) T pfam06551 1 MKKNL---AATAIAAGLAVLATSAQAADSTAVLKV 32 (145) T ss_pred CCHHH---HHHHHHHHHHHHHCCHHHCCCCCEEEE T ss_conf 92378---999999999986031422577446999 No 203 >COG3470 Tpd Uncharacterized protein probably involved in high-affinity Fe2+ transport [Inorganic ion transport and metabolism] Probab=20.96 E-value=49 Score=12.58 Aligned_cols=25 Identities=16% Similarity=0.210 Sum_probs=13.8 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 90279999999999999986146742 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLDGL 26 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~~~ 26 (338) ||+. +++++.+++++++++|+..++ T Consensus 1 M~~~-k~l~~~~~~a~v~s~~a~~~a 25 (179) T COG3470 1 MKMK-KLLLSAAILASVFSAPAEGAA 25 (179) T ss_pred CCHH-HHHHHHHHHHHHHHHHHCCCC T ss_conf 9168-999999999999741000023 No 204 >COG5298 Uncharacterized protein conserved in bacteria [Function unknown] Probab=20.90 E-value=49 Score=12.57 Aligned_cols=56 Identities=13% Similarity=0.017 Sum_probs=31.8 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEECCCC--CCCHHHHHHHHHHHHH Q ss_conf 9027999999999999998614674234885999978877--8512233889999998 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCTLDGLPAKPVLTVYTYNSF--VADEGAGPKIKQAFER 56 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~v~~~~~~--~~~~~~~~~i~~~Fek 56 (338) |++.+.+.+.++|++..++..+..+.++.+++++|..-.. +.+..++..++--|.. T Consensus 1 ~~~~l~i~~llfca~~s~s~~~qa~~qD~gv~~~y~~ld~~~~~q~~Mld~Llghfns 58 (530) T COG5298 1 MRKKLIISILLFCAAFSVSTHPQAAKQDSGVVVFYDSLDKGTDNQGNMLDSLLGHFNS 58 (530) T ss_pred CCHHHHHHHHHHHHHEECCCCCCCCCCCCCEEEEEEECCCCCCCHHHHHHHHHHHHHC T ss_conf 9244778999988733500477776667877999982358875057799999987534 No 205 >TIGR02346 chap_CCT_theta T-complex protein 1, theta subunit; InterPro: IPR012721 Members of this eukaryotic family are part of the group II chaperonin complex called CCT (chaperonin containing TCP-1) or TRiC. The archaeal equivalent group II chaperonin is often called the thermosome. Both are somewhat related to the group I chaperonin of bacterial, GroEL/GroES. This family consists exclusively of the CCT theta chain (part of a paralogous family) from animals, plants, fungi, and other eukaryotes.; GO: 0005524 ATP binding, 0051082 unfolded protein binding, 0006457 protein folding. Probab=20.88 E-value=49 Score=12.57 Aligned_cols=57 Identities=9% Similarity=-0.076 Sum_probs=33.5 Q ss_pred CCCHHHHHHHHHHHHHHHCCEEEEEECCCHHHHHHHHHHHHCCCCCEEEECCCHHHHHHHHH Q ss_conf 85122338899999988488799997288689997887531266410110133378999887 Q gi|254780561|r 41 VADEGAGPKIKQAFERKCNCELKLIGLSDGVALLNKLRMEGENSAADIVLGFDNNLIDLARK 102 (338) Q Consensus 41 ~~~~~~~~~i~~~Fek~~gikV~~~~~~~~~~~~~~~~a~~~~~~~Dv~~~~~~~~~~~~~~ 102 (338) -.|+...+.++|+| ++.||+|- ++.|+-+++ +++-.++.+ .-++=..+-..++++.. T Consensus 272 KgEE~~~e~~iK~i-ad~Gv~VI-v~gG~v~d~-AlHy~~~Y~--im~~k~~SKFELrRlCk 328 (554) T TIGR02346 272 KGEENQIEALIKAI-ADSGVKVI-VTGGSVGDM-ALHYVEKYN--IMVLKIPSKFELRRLCK 328 (554) T ss_pred CCHHHHHHHHHHHH-HHCCCEEE-EECCCHHHH-HHHHHHHCC--EEEEEECCCHHHHHHHH T ss_conf 02689999987899-86898599-946848899-998781558--26999778265889998 No 206 >PRK13484 putative iron-regulated outer membrane virulence protein; Provisional Probab=20.85 E-value=49 Score=12.57 Aligned_cols=17 Identities=6% Similarity=0.016 Sum_probs=9.2 Q ss_pred HHHHHHHHHHHHCCEEE Q ss_conf 38899999988488799 Q gi|254780561|r 47 GPKIKQAFERKCNCELK 63 (338) Q Consensus 47 ~~~i~~~Fek~~gikV~ 63 (338) ...|.+.++...||.|. T Consensus 64 ~~~l~d~L~~~pGv~v~ 80 (682) T PRK13484 64 VSDLVDAVKDVEGISIT 80 (682) T ss_pred CCCHHHHHHHCCCEEEE T ss_conf 99899998418998993 No 207 >pfam06649 DUF1161 Protein of unknown function (DUF1161). This family consists of several short, hypothetical bacterial proteins of unknown function. Probab=20.45 E-value=50 Score=12.52 Aligned_cols=16 Identities=19% Similarity=0.328 Sum_probs=7.8 Q ss_pred CCHHHHHHHHHHHHHH Q ss_conf 9027999999999999 Q gi|254780561|r 1 MKKFARIVVGIMMITG 16 (338) Q Consensus 1 MKk~~~~~~~~~~~~~ 16 (338) |||+......+++++. T Consensus 1 Mkk~~l~~~l~lla~~ 16 (75) T pfam06649 1 MKRLLLAVALLLLAAS 16 (75) T ss_pred CCHHHHHHHHHHHHHH T ss_conf 9246999999997134 No 208 >PRK08605 D-lactate dehydrogenase; Validated Probab=20.42 E-value=50 Score=12.51 Aligned_cols=41 Identities=10% Similarity=0.104 Sum_probs=28.0 Q ss_pred EEEEEECCCCCCCHHHHHHHHHHHHHHHCCEEEEEECCCHHHHHHHH Q ss_conf 59999788778512233889999998848879999728868999788 Q gi|254780561|r 31 VLTVYTYNSFVADEGAGPKIKQAFERKCNCELKLIGLSDGVALLNKL 77 (338) Q Consensus 31 ~l~v~~~~~~~~~~~~~~~i~~~Fek~~gikV~~~~~~~~~~~~~~~ 77 (338) ++.+|+-. ....+++++|-+++|++|++......++.+.++ T Consensus 3 Ki~~~~~~------~~e~~~~~~~~~~~~~ev~~~~~~~~ee~i~~~ 43 (332) T PRK08605 3 KIKLMSVR------DEDAPYIKAWAEKHHVEVDLTKEALTDDNVEEV 43 (332) T ss_pred EEEEEECC------HHHHHHHHHHHHHCCCEEEEECCCCCHHHHHHH T ss_conf 69998276------656999999888659079996699998999985 No 209 >PRK09934 putative fimbrial protein; Provisional Probab=20.40 E-value=50 Score=12.51 Aligned_cols=13 Identities=8% Similarity=0.171 Sum_probs=7.1 Q ss_pred CCHHHHHHHHHHH Q ss_conf 9027999999999 Q gi|254780561|r 1 MKKFARIVVGIMM 13 (338) Q Consensus 1 MKk~~~~~~~~~~ 13 (338) |||++..+++.++ T Consensus 1 mkk~~l~~~~~l~ 13 (171) T PRK09934 1 MRRVFIAIFCGLL 13 (171) T ss_pred CHHHHHHHHHHHH T ss_conf 9268999999999 No 210 >PRK09810 entericidin A; Provisional Probab=20.04 E-value=51 Score=12.46 Aligned_cols=19 Identities=21% Similarity=0.644 Sum_probs=10.9 Q ss_pred CCHHHHHHHHHHHHHHHHHHHC Q ss_conf 9027999999999999998614 Q gi|254780561|r 1 MKKFARIVVGIMMITGVISYCT 22 (338) Q Consensus 1 MKk~~~~~~~~~~~~~~~~~~~ 22 (338) |||+ ++.+++...++++|- T Consensus 2 mkrl---i~lil~~~~ll~gcn 20 (41) T PRK09810 2 MKRL---IVLVLLASTLLTGCN 20 (41) T ss_pred HHHH---HHHHHHHHHHHHCCC T ss_conf 3899---999999999982562 Done!