Query gi|254780342|ref|YP_003064755.1| substrate-binding region of ABC-type glycine betaine transport system [Candidatus Liberibacter asiaticus str. psy62] Match_columns 309 No_of_seqs 121 out of 1546 Neff 8.5 Searched_HMMs 39220 Date Sun May 29 15:45:09 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780342.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 TIGR03414 ABC_choline_bnd chol 100.0 0 0 467.7 27.5 288 19-307 1-290 (290) 2 PRK11119 proX glycine betaine 100.0 0 0 396.4 24.0 288 2-291 4-327 (332) 3 COG2113 ProX ABC-type proline/ 100.0 0 0 394.7 25.2 275 20-296 25-302 (302) 4 pfam04069 OpuAC Substrate bind 100.0 0 0 365.9 22.5 250 26-279 1-256 (256) 5 COG1732 OpuBC Periplasmic glyc 100.0 2.8E-45 0 314.3 23.5 248 22-280 29-300 (300) 6 PRK11553 alkanesulfonate trans 99.7 2.1E-15 5.2E-20 120.1 17.5 256 1-278 1-267 (314) 7 PRK11480 tauA taurine transpor 99.5 8.9E-13 2.3E-17 102.9 16.2 245 7-275 7-266 (320) 8 COG4521 TauA ABC-type taurine 99.2 1.7E-10 4.4E-15 88.0 10.3 272 1-292 4-296 (334) 9 COG0715 TauA ABC-type nitrate/ 99.1 2.9E-08 7.4E-13 73.6 17.3 207 24-248 29-243 (335) 10 pfam09084 NMT1 NMT1/THI5 like. 99.0 5.1E-08 1.3E-12 72.0 14.0 179 46-245 13-198 (216) 11 TIGR03431 PhnD phosphonate ABC 98.9 1.9E-07 4.7E-12 68.3 16.5 220 1-247 2-244 (288) 12 TIGR03427 ABC_peri_uca ABC tra 98.8 4.6E-07 1.2E-11 65.8 15.3 201 25-247 1-209 (328) 13 TIGR01729 taurine_ABC_bnd taur 98.8 2E-08 5.1E-13 74.6 7.0 221 49-291 24-269 (304) 14 TIGR01098 3A0109s03R phosphona 98.7 4.1E-07 1E-11 66.1 12.5 229 1-248 1-295 (299) 15 PRK09861 cytoplasmic membrane 98.7 2E-06 5E-11 61.6 14.0 225 6-258 12-259 (272) 16 PRK11063 metQ DL-methionine tr 98.6 2.5E-06 6.3E-11 61.0 14.0 231 1-259 5-259 (271) 17 PRK10677 modA molybdate transp 98.4 0.00011 2.8E-09 50.2 16.4 230 1-247 1-239 (257) 18 PRK10797 glutamate and asparta 98.3 7.8E-05 2E-09 51.3 14.9 214 1-247 3-251 (302) 19 COG3221 PhnD ABC-type phosphat 98.3 8.9E-05 2.3E-09 50.9 14.5 249 1-266 2-275 (299) 20 PRK09495 glnH glutamine ABC tr 98.3 1.9E-05 4.8E-10 55.2 10.8 173 1-202 1-183 (247) 21 COG2358 Imp TRAP-type uncharac 98.3 5.4E-06 1.4E-10 58.8 7.9 202 1-215 1-216 (321) 22 COG1464 NlpA ABC-type metal io 98.2 1.8E-05 4.6E-10 55.4 10.2 228 1-262 1-260 (268) 23 TIGR01728 SsuA_fam ABC transpo 98.2 4.4E-05 1.1E-09 52.9 12.0 234 28-280 1-257 (311) 24 TIGR03227 PhnS 2-aminoethylpho 98.2 0.00054 1.4E-08 45.8 17.2 284 2-298 13-360 (367) 25 COG0725 ModA ABC-type molybdat 98.1 0.00064 1.6E-08 45.3 16.5 186 22-219 26-214 (258) 26 COG4143 TbpA ABC-type thiamine 98.1 0.00012 3.2E-09 49.9 12.8 240 1-257 1-282 (336) 27 PRK09501 potD spermidine/putre 98.1 0.00017 4.3E-09 49.1 13.3 256 1-278 1-305 (348) 28 COG1638 DctP TRAP-type C4-dica 98.0 0.0019 4.7E-08 42.3 16.0 277 1-307 1-330 (332) 29 PRK11917 bifunctional adhesin/ 97.8 0.00083 2.1E-08 44.6 11.7 191 25-248 37-242 (259) 30 TIGR03261 phnS2 putative 2-ami 97.7 0.0044 1.1E-07 39.8 15.8 237 4-259 2-280 (334) 31 COG2998 TupB ABC-type tungstat 97.6 0.0036 9.1E-08 40.4 13.1 232 18-267 22-266 (280) 32 PRK11260 cystine transporter s 97.6 0.0016 4E-08 42.7 11.3 211 1-247 5-239 (264) 33 PRK10682 putrescine transporte 97.6 0.0052 1.3E-07 39.4 13.9 76 2-81 6-84 (370) 34 TIGR01096 3A0103s03R lysine-ar 97.0 0.011 2.8E-07 37.3 9.8 261 2-277 1-333 (333) 35 PRK12679 cbl transcriptional r 96.8 0.034 8.7E-07 34.1 10.9 210 22-260 89-305 (316) 36 COG0687 PotD Spermidine/putres 96.8 0.051 1.3E-06 32.9 12.9 174 22-206 28-249 (363) 37 PRK11205 tbpA thiamine transpo 96.4 0.084 2.2E-06 31.5 12.7 172 22-204 15-220 (326) 38 pfam03180 Lipoprotein_9 NLPA l 96.4 0.023 5.9E-07 35.2 7.7 202 28-260 1-225 (236) 39 PRK12681 cysB transcriptional 96.3 0.1 2.6E-06 30.9 10.7 212 22-259 89-304 (324) 40 cd00134 PBPb Bacterial peripla 96.1 0.1 2.6E-06 31.0 10.0 173 39-247 23-199 (218) 41 PRK12680 transcriptional regul 96.1 0.12 3.1E-06 30.5 12.8 209 22-259 89-304 (327) 42 COG4134 ABC-type uncharacteriz 95.9 0.16 4E-06 29.8 9.9 246 24-277 32-349 (384) 43 TIGR02424 TF_pcaQ pca operon t 95.5 0.058 1.5E-06 32.5 6.6 188 21-234 88-281 (300) 44 pfam03466 LysR_substrate LysR 95.5 0.22 5.7E-06 28.7 12.4 196 23-248 3-204 (209) 45 COG1910 Periplasmic molybdate- 95.4 0.12 3E-06 30.5 7.9 175 53-249 16-198 (223) 46 PRK09959 hybrid sensory histid 95.4 0.24 6E-06 28.6 10.1 55 25-79 55-121 (1197) 47 PRK09791 putative DNA-binding 94.9 0.32 8.2E-06 27.7 12.4 198 23-248 92-293 (302) 48 PRK04168 hypothetical protein; 94.9 0.33 8.4E-06 27.6 13.8 201 2-217 6-247 (336) 49 TIGR01256 modA molybdate ABC t 94.6 0.39 1E-05 27.2 9.8 200 39-248 6-224 (225) 50 smart00062 PBPb Bacterial peri 94.5 0.4 1E-05 27.1 9.9 170 39-247 24-200 (219) 51 PRK11242 DNA-binding transcrip 94.4 0.42 1.1E-05 26.9 11.5 193 24-246 89-285 (292) 52 PRK12683 transcriptional regul 94.4 0.44 1.1E-05 26.8 9.2 207 23-258 90-303 (308) 53 PRK09986 DNA-binding transcrip 94.2 0.46 1.2E-05 26.7 12.1 197 22-249 77-277 (278) 54 CHL00180 rbcR LysR transcripti 93.8 0.56 1.4E-05 26.1 11.1 201 23-248 92-299 (307) 55 PRK12684 transcriptional regul 93.6 0.61 1.6E-05 25.9 10.1 207 22-257 89-302 (313) 56 PRK10341 DNA-binding transcrip 93.2 0.69 1.8E-05 25.6 12.1 198 23-248 94-294 (312) 57 COG2984 ABC-type uncharacteriz 92.6 0.84 2.1E-05 25.0 12.0 94 1-94 2-107 (322) 58 PRK09508 leuO leucine transcri 92.4 0.89 2.3E-05 24.9 8.2 61 23-83 109-169 (314) 59 PRK12682 transcriptional regul 92.0 1 2.5E-05 24.5 9.2 204 23-255 90-300 (309) 60 PRK01066 porphobilinogen deami 91.7 0.95 2.4E-05 24.7 6.8 63 23-85 16-99 (234) 61 PRK11622 putative ABC transpor 90.9 1.3 3.3E-05 23.8 17.2 250 1-257 2-333 (401) 62 pfam00497 SBP_bac_3 Bacterial 90.3 1.5 3.7E-05 23.4 10.2 175 40-247 24-203 (224) 63 PRK10837 putative DNA-binding 90.0 1.6 4E-05 23.3 10.5 195 23-247 86-285 (291) 64 PRK13731 conjugal transfer sur 89.8 0.68 1.7E-05 25.6 4.7 60 1-61 3-87 (243) 65 pfam01379 Porphobil_deam Porph 89.1 1.8 4.6E-05 22.8 6.7 58 27-84 1-79 (213) 66 PRK12450 foldase protein PrsA; 89.0 1.4 3.7E-05 23.5 5.9 62 1-62 4-81 (309) 67 PRK09906 DNA-binding transcrip 88.9 1.8 4.7E-05 22.8 6.6 194 24-248 88-287 (296) 68 pfam00496 SBP_bac_5 Bacterial 87.9 2.1 5.5E-05 22.4 8.7 25 57-81 148-173 (361) 69 TIGR00363 TIGR00363 lipoprotei 87.6 0.19 4.9E-06 29.2 0.7 219 23-266 20-259 (264) 70 PRK09755 putative binding prot 86.9 2.4 6.2E-05 22.0 7.5 28 23-50 30-69 (535) 71 PRK11151 DNA-binding transcrip 86.8 1.7 4.3E-05 23.0 5.1 195 24-248 89-289 (305) 72 COG3181 Uncharacterized protei 86.1 2.7 6.8E-05 21.7 18.8 188 1-198 1-203 (319) 73 PRK09474 malE maltose ABC tran 86.0 2.7 6.9E-05 21.7 12.7 79 1-80 1-85 (394) 74 PRK03095 prsA peptidylprolyl i 85.6 2.8 7.2E-05 21.6 5.8 56 1-58 1-69 (287) 75 PRK00072 hemC porphobilinogen 85.1 3 7.6E-05 21.4 7.1 58 26-83 3-81 (300) 76 PRK10355 xylF D-xylose transpo 84.8 3.1 7.9E-05 21.3 7.2 77 1-82 3-88 (330) 77 PRK03601 transcriptional regul 84.7 3.1 7.9E-05 21.3 6.5 65 24-88 87-151 (275) 78 PRK01742 tolB translocation pr 84.4 0.42 1.1E-05 27.0 1.1 23 185-207 198-220 (430) 79 PRK11482 putative DNA-binding 84.0 3.3 8.5E-05 21.1 7.2 64 20-85 60-123 (266) 80 pfam03480 SBP_bac_7 Bacterial 83.2 3.6 9.2E-05 20.9 18.6 221 38-289 10-283 (285) 81 pfam06543 Lac_bphage_repr Lact 82.8 0.34 8.7E-06 27.5 0.1 30 275-308 11-40 (49) 82 PRK09701 D-allose transporter 81.9 4 0.0001 20.6 6.6 96 1-98 1-107 (311) 83 PRK10653 D-ribose transporter 81.0 4.3 0.00011 20.4 7.3 79 1-81 3-88 (295) 84 pfam03401 Bug Tripartite trica 80.7 4.4 0.00011 20.3 11.1 233 42-289 2-270 (274) 85 PRK04405 prsA peptidylprolyl i 80.6 3.8 9.7E-05 20.7 4.8 60 1-60 5-78 (298) 86 cd00494 HMBS Hydroxymethylbila 80.5 4.5 0.00011 20.3 6.7 58 28-85 1-79 (292) 87 PRK10859 putative transglycosy 79.4 4.9 0.00012 20.0 13.2 158 25-204 42-209 (507) 88 PRK09408 ompX outer membrane p 79.2 1.9 5E-05 22.6 2.9 52 1-61 1-53 (171) 89 PRK11013 DNA-binding transcrip 78.8 5.1 0.00013 19.9 10.8 205 24-262 92-301 (309) 90 PRK01326 prsA foldase protein 78.8 5.1 0.00013 19.9 6.2 26 36-61 52-78 (310) 91 PRK02889 tolB translocation pr 77.5 2 5.2E-05 22.5 2.6 23 185-207 190-212 (430) 92 PRK05137 tolB translocation pr 77.5 0.99 2.5E-05 24.5 1.0 22 186-207 200-221 (437) 93 TIGR03525 GldK gliding motilit 76.6 3.7 9.5E-05 20.8 3.8 25 26-50 67-91 (449) 94 PRK04792 tolB translocation pr 75.7 5.3 0.00013 19.8 4.3 23 185-207 214-236 (450) 95 PRK11074 putative DNA-binding 74.5 6.6 0.00017 19.2 7.2 58 24-81 90-147 (300) 96 COG1613 Sbp ABC-type sulfate t 74.3 6.7 0.00017 19.1 14.2 248 27-281 39-312 (348) 97 PRK11716 DNA-binding transcrip 74.0 6.8 0.00017 19.1 7.7 196 23-248 64-266 (269) 98 PRK03629 tolB translocation pr 73.9 3.8 9.6E-05 20.8 3.2 53 1-53 1-62 (430) 99 PRK11143 glpQ glycerophosphodi 73.7 5.9 0.00015 19.5 4.2 46 4-49 8-57 (359) 100 TIGR03339 phn_lysR aminoethylp 73.7 6.9 0.00018 19.0 13.4 191 23-246 84-277 (279) 101 COG0181 HemC Porphobilinogen d 72.9 7.2 0.00018 18.9 7.2 57 26-82 2-79 (307) 102 pfam10566 Glyco_hydro_97 Glyco 72.6 3.7 9.3E-05 20.9 2.9 18 1-18 1-18 (643) 103 PRK00178 tolB translocation pr 72.1 5.6 0.00014 19.6 3.8 22 186-207 195-216 (433) 104 COG0018 ArgS Arginyl-tRNA synt 71.4 7.8 0.0002 18.7 4.4 57 21-81 125-183 (577) 105 TIGR00456 argS arginyl-tRNA sy 70.1 8 0.0002 18.6 4.2 48 20-71 125-174 (600) 106 PRK10752 sulfate transporter s 69.5 8.6 0.00022 18.4 11.5 237 37-281 28-295 (329) 107 cd00671 ArgRS_core This is the 68.8 8.9 0.00023 18.3 4.5 51 20-74 7-59 (267) 108 KOG4164 consensus 68.3 2.1 5.4E-05 22.4 0.9 80 202-284 318-415 (497) 109 PRK10877 thiol:disulfide inter 67.7 5.3 0.00014 19.8 2.9 61 1-82 1-62 (232) 110 COG2182 MalE Maltose-binding p 65.8 10 0.00026 18.0 8.3 57 25-82 40-101 (420) 111 PRK06934 flavodoxin; Provision 65.5 10 0.00026 17.9 5.6 35 5-39 15-49 (221) 112 PRK09934 putative fimbrial pro 65.4 7.1 0.00018 19.0 3.1 29 1-29 1-30 (171) 113 PRK10216 DNA-binding transcrip 65.1 10 0.00027 17.9 8.1 58 24-82 64-121 (288) 114 PRK10918 phosphate transporter 64.9 11 0.00027 17.8 7.0 77 3-82 5-81 (346) 115 TIGR00212 hemC porphobilinogen 64.0 11 0.00028 17.7 4.6 54 28-82 1-78 (312) 116 pfam06316 Ail_Lom Enterobacter 63.9 8 0.0002 18.6 3.2 54 1-61 1-57 (199) 117 TIGR02995 ectoine_ehuB ectoine 62.8 9.2 0.00023 18.2 3.3 146 25-195 33-191 (285) 118 COG3019 Predicted metal-bindin 61.8 12 0.00031 17.5 4.6 57 1-63 1-59 (149) 119 COG4588 AcfC Accessory coloniz 61.2 12 0.00031 17.4 12.5 201 28-249 23-237 (252) 120 pfam00750 tRNA-synt_1d tRNA sy 61.2 12 0.00031 17.4 4.6 48 20-71 26-75 (345) 121 TIGR03042 PS_II_psbQ_bact phot 59.9 11 0.00028 17.8 3.3 46 1-50 1-46 (142) 122 PRK03757 hypothetical protein; 59.4 7.7 0.0002 18.8 2.4 22 1-22 1-22 (191) 123 PRK11528 hypothetical protein; 59.1 4.2 0.00011 20.4 1.0 38 1-38 1-41 (254) 124 pfam06572 DUF1131 Protein of u 58.3 5.5 0.00014 19.7 1.5 78 1-78 1-95 (192) 125 TIGR02478 6PF1K_euk 6-phosphof 57.3 7.6 0.00019 18.8 2.1 47 26-72 243-298 (777) 126 TIGR01328 met_gam_lyase methio 56.2 6.9 0.00018 19.1 1.7 191 101-302 38-300 (392) 127 PRK09765 2-O-a-mannosyl-D-glyc 56.2 9.7 0.00025 18.1 2.5 39 23-61 160-201 (638) 128 PRK10959 outer membrane protei 55.6 9.9 0.00025 18.0 2.5 19 1-19 1-19 (212) 129 TIGR01323 nitrile_alph nitrile 55.5 8 0.0002 18.6 2.0 21 215-235 157-177 (186) 130 PRK10808 outer membrane protei 54.8 16 0.0004 16.7 3.4 31 1-31 1-31 (347) 131 PRK08475 F0F1 ATP synthase sub 54.7 13 0.00034 17.2 3.0 24 1-24 1-24 (170) 132 COG0834 HisJ ABC-type amino ac 54.5 16 0.0004 16.7 10.4 136 44-204 65-202 (275) 133 PRK00031 lolA outer-membrane l 54.2 9.9 0.00025 18.0 2.3 19 1-19 1-19 (201) 134 PRK10598 hypothetical protein; 53.5 16 0.00042 16.6 4.2 40 1-48 1-40 (186) 135 PRK13697 cytochrome c6; Provis 53.2 4.7 0.00012 20.1 0.5 43 1-43 1-43 (111) 136 PRK13680 hypothetical protein; 52.9 17 0.00043 16.5 4.9 42 11-52 16-58 (121) 137 PRK01611 argS arginyl-tRNA syn 51.9 17 0.00044 16.4 4.2 49 21-73 126-176 (570) 138 pfam06510 DUF1102 Protein of u 51.0 14 0.00036 17.0 2.6 22 1-22 1-22 (183) 139 PRK09733 putative fimbrial pro 51.0 18 0.00045 16.4 3.1 30 1-31 2-31 (181) 140 pfam07901 DUF1672 Protein of u 50.9 13 0.00033 17.2 2.5 82 1-82 1-110 (304) 141 pfam11777 DUF3316 Protein of u 50.8 18 0.00046 16.3 4.2 32 1-32 1-35 (114) 142 PRK02710 plastocyanin; Provisi 50.7 18 0.00046 16.3 3.4 30 3-32 6-35 (119) 143 PRK10159 outer membrane phosph 49.7 18 0.00046 16.3 3.0 31 1-31 1-35 (351) 144 PRK10718 hypothetical protein; 49.7 6.2 0.00016 19.3 0.7 54 25-78 36-95 (191) 145 pfam11348 DUF3150 Protein of u 49.5 19 0.00048 16.2 3.2 13 111-123 73-85 (257) 146 pfam02379 consensus 49.5 9.5 0.00024 18.1 1.6 32 30-61 3-37 (103) 147 TIGR00829 FRU PTS system, Fru 49.3 8.1 0.00021 18.6 1.2 45 30-74 1-48 (85) 148 TIGR00787 dctP TRAP transporte 48.9 19 0.00049 16.1 8.8 193 36-247 9-235 (267) 149 PRK10427 putative fructose-lik 48.7 9.4 0.00024 18.2 1.4 34 29-62 5-41 (114) 150 PRK10002 outer membrane protei 48.5 20 0.0005 16.1 3.1 32 1-32 2-37 (362) 151 PRK09967 putative outer membra 47.7 20 0.00052 16.0 3.2 22 1-22 1-22 (160) 152 PRK10852 thiosulfate transport 47.3 20 0.00052 16.0 12.8 163 38-203 33-222 (338) 153 pfam09403 FadA Adhesion protei 47.1 12 0.00031 17.5 1.8 19 1-19 1-19 (126) 154 TIGR03502 lipase_Pla1_cef extr 46.9 19 0.00048 16.2 2.7 21 1-21 1-21 (792) 155 PRK12451 arginyl-tRNA syntheta 46.8 21 0.00053 15.9 4.4 47 21-71 121-169 (562) 156 pfam12262 Lipase_bact_N Bacter 46.5 18 0.00046 16.3 2.6 22 1-22 1-22 (269) 157 PRK09810 entericidin A; Provis 46.4 9.7 0.00025 18.1 1.2 29 1-29 2-30 (41) 158 PRK10954 periplasmic protein d 46.4 19 0.00047 16.3 2.7 30 1-30 1-30 (207) 159 PRK11404 putative PTS transpor 46.4 11 0.00029 17.6 1.6 32 30-61 7-41 (482) 160 PRK10449 heat-inducible protei 46.1 13 0.00033 17.3 1.8 20 1-21 1-20 (140) 161 cd05569 PTS_IIB_fructose PTS_I 46.1 12 0.0003 17.5 1.6 32 30-61 2-36 (96) 162 PRK04043 tolB translocation pr 45.8 22 0.00055 15.8 4.5 52 1-52 1-57 (419) 163 PRK10780 periplasmic chaperone 44.9 18 0.00045 16.4 2.4 29 1-35 1-29 (161) 164 PRK03002 prsA peptidylprolyl i 44.7 22 0.00057 15.7 5.7 13 206-218 216-228 (285) 165 pfam07273 DUF1439 Protein of u 43.7 23 0.00059 15.6 3.6 45 1-54 1-46 (177) 166 PRK09465 tolC outer membrane c 43.4 17 0.00044 16.5 2.1 21 1-21 1-21 (446) 167 PRK11233 nitrogen assimilation 42.4 24 0.00062 15.5 13.6 193 23-248 88-287 (305) 168 pfam07882 Fucose_iso_N2 L-fuco 42.3 24 0.00062 15.5 3.6 15 48-62 22-36 (181) 169 PRK02399 hypothetical protein; 42.2 24 0.00062 15.5 8.9 59 26-87 186-247 (407) 170 PRK13883 conjugal transfer pro 41.3 25 0.00064 15.4 4.5 47 1-50 1-47 (156) 171 PRK00040 rpsP 30S ribosomal pr 41.0 17 0.00044 16.5 1.8 27 277-308 48-74 (75) 172 PRK05697 flagellar basal body- 40.0 22 0.00056 15.8 2.2 25 1-25 1-25 (135) 173 TIGR02414 pepN_proteo aminopep 39.7 27 0.00068 15.2 3.6 71 237-309 720-800 (898) 174 pfam04076 BOF Bacterial OB fol 39.0 18 0.00046 16.3 1.7 56 1-59 1-62 (126) 175 COG1653 UgpB ABC-type sugar tr 38.9 27 0.0007 15.1 7.4 57 26-82 31-94 (433) 176 pfam03032 Brevenin Brevenin/es 38.6 20 0.00051 16.0 1.8 20 1-20 3-22 (46) 177 PRK09915 putative outer membra 38.4 28 0.00071 15.1 2.6 54 1-54 6-82 (488) 178 PRK13861 type IV secretion sys 38.2 22 0.00056 15.8 2.0 24 52-76 50-73 (293) 179 cd07373 2A5CPDO_A The alpha su 38.1 28 0.00072 15.1 2.7 22 183-204 158-181 (271) 180 PRK09818 putative kinase inhib 37.7 29 0.00073 15.0 2.6 74 1-85 1-74 (183) 181 KOG0404 consensus 37.6 29 0.00073 15.0 4.5 18 183-200 199-216 (322) 182 pfam09070 PFU PFU (PLAA family 37.4 29 0.00074 15.0 3.0 18 264-281 75-92 (115) 183 pfam11940 DUF3458 Domain of un 37.2 29 0.00074 15.0 4.8 71 238-308 231-308 (408) 184 pfam05170 AsmA AsmA family. Th 37.1 29 0.00075 15.0 4.0 50 1-60 1-50 (537) 185 PRK11062 nhaR transcriptional 37.0 29 0.00075 15.0 6.4 197 23-247 90-288 (296) 186 COG1445 FrwB Phosphotransferas 36.8 24 0.00062 15.5 2.0 32 30-61 5-39 (122) 187 cd00532 MGS-like MGS-like doma 36.6 30 0.00076 14.9 4.7 41 45-86 34-79 (112) 188 COG5510 Predicted small secret 36.5 24 0.00062 15.5 2.0 31 1-31 2-35 (44) 189 PRK11627 hypothetical protein; 36.4 30 0.00076 14.9 3.5 28 1-32 2-29 (192) 190 PRK10064 colicin I receptor; P 36.2 29 0.00075 15.0 2.4 77 6-82 11-105 (663) 191 pfam07437 YfaZ YfaZ precursor. 35.5 26 0.00066 15.3 2.0 19 1-19 1-19 (180) 192 COG2143 Thioredoxin-related pr 35.4 27 0.0007 15.1 2.1 22 1-22 1-22 (182) 193 TIGR01283 nifE nitrogenase MoF 35.3 31 0.0008 14.8 2.9 48 171-220 238-289 (470) 194 TIGR00389 glyS_dimeric glycyl- 35.1 31 0.0008 14.8 2.4 37 229-267 400-436 (606) 195 TIGR02052 MerP mercuric transp 34.9 32 0.00081 14.7 3.5 33 1-36 1-33 (92) 196 PRK10290 superoxide dismutase; 34.7 32 0.00081 14.7 2.4 21 1-21 1-21 (173) 197 pfam08085 Entericidin Enterici 34.5 30 0.00076 14.9 2.2 15 1-15 1-15 (42) 198 PHA01735 hypothetical protein 34.5 32 0.00082 14.7 3.2 26 256-281 19-46 (76) 199 PRK04922 tolB translocation pr 34.4 32 0.00082 14.7 3.2 22 186-207 199-220 (439) 200 pfam07424 TrbM TrbM. This fami 34.1 29 0.00075 15.0 2.1 18 1-18 1-18 (189) 201 PRK10712 fructose-specific PTS 33.8 33 0.00084 14.6 2.5 33 29-61 106-141 (563) 202 PRK12698 flgH flagellar basal 33.2 27 0.00068 15.2 1.8 16 1-16 1-16 (224) 203 TIGR02953 penta_MxKDx pentapep 33.1 30 0.00076 14.9 2.0 22 2-23 1-25 (89) 204 COG3470 Tpd Uncharacterized pr 33.1 32 0.00082 14.7 2.2 66 1-82 3-71 (179) 205 COG0228 RpsP Ribosomal protein 33.1 27 0.00068 15.2 1.8 16 265-280 62-77 (87) 206 pfam06792 UPF0261 Uncharacteri 32.7 34 0.00088 14.5 8.8 60 25-87 184-247 (403) 207 PRK09529 bifunctional acetyl-C 32.6 34 0.00088 14.5 3.3 71 36-107 278-358 (713) 208 PRK10554 outer membrane porin 32.4 35 0.00089 14.5 3.0 31 1-31 1-35 (373) 209 CHL00005 rps16 ribosomal prote 31.6 26 0.00065 15.4 1.5 24 280-308 49-72 (82) 210 PRK09558 ushA bifunctional UDP 31.5 36 0.00092 14.4 3.7 34 1-34 1-41 (551) 211 pfam03888 MucB_RseB MucB/RseB 30.7 37 0.00095 14.3 2.3 48 1-61 1-48 (314) 212 PRK11372 lysozyme inhibitor; P 30.6 29 0.00074 15.0 1.6 18 1-18 3-20 (109) 213 pfam01527 Transposase_8 Transp 30.3 38 0.00096 14.3 4.5 42 249-291 6-47 (75) 214 PRK10747 putative protoheme IX 28.8 40 0.001 14.1 4.1 36 1-36 1-36 (398) 215 PRK10833 putative assembly pro 28.7 40 0.001 14.1 4.7 50 1-60 1-50 (617) 216 PRK11548 hypothetical protein; 28.6 40 0.001 14.1 3.5 39 1-39 3-41 (113) 217 COG5567 Predicted small peripl 28.5 40 0.001 14.1 2.5 20 1-20 1-20 (58) 218 pfam11853 DUF3373 Protein of u 28.4 40 0.001 14.1 2.4 20 1-20 1-20 (485) 219 COG4851 CamS Protein involved 28.4 41 0.001 14.0 2.2 20 1-20 1-20 (382) 220 pfam08557 Lipid_DES Sphingolip 28.4 32 0.00082 14.7 1.5 25 222-246 10-34 (39) 221 TIGR00838 argH argininosuccina 27.0 22 0.00057 15.7 0.5 47 229-275 315-361 (469) 222 COG1840 AfuA ABC-type Fe3+ tra 26.4 44 0.0011 13.8 9.5 124 125-256 104-240 (299) 223 pfam08984 DUF1858 Domain of un 26.3 44 0.0011 13.8 3.2 31 233-263 9-42 (59) 224 PRK13616 lipoprotein LpqB; Pro 25.9 45 0.0011 13.8 2.7 32 2-33 6-37 (590) 225 cd00569 HTH_Hin_like Helix-tur 25.9 45 0.0011 13.8 4.1 27 249-276 5-31 (42) 226 pfam04959 ARS2 Arsenite-resist 25.8 45 0.0011 13.8 3.2 27 265-291 54-82 (211) 227 pfam08139 LPAM_1 Prokaryotic m 25.2 46 0.0012 13.7 2.1 13 1-13 1-13 (26) 228 TIGR02791 VirB5 P-type DNA tra 25.2 46 0.0012 13.7 2.7 28 239-266 174-203 (233) 229 cd05805 MPG1_transferase GTP-m 25.2 46 0.0012 13.7 15.1 58 24-82 33-91 (441) 230 PRK13916 plasmid segregation p 25.1 46 0.0012 13.7 3.8 35 235-269 14-51 (97) 231 COG5455 Predicted integral mem 25.1 46 0.0012 13.7 2.2 20 1-20 2-21 (129) 232 PRK11443 hypothetical protein; 25.0 45 0.0012 13.7 1.8 16 1-16 1-16 (120) 233 TIGR02294 nickel_nikA nickel A 24.5 47 0.0012 13.6 5.4 25 57-81 200-225 (513) 234 pfam03562 MltA MltA specific i 24.5 48 0.0012 13.6 2.1 18 272-289 87-104 (113) 235 PRK07369 dihydroorotase; Provi 23.8 46 0.0012 13.7 1.6 36 250-286 329-366 (419) 236 cd05803 PGM_like4 This PGM-lik 23.7 49 0.0013 13.5 12.7 54 24-78 36-90 (445) 237 PRK06760 hypothetical protein; 23.6 49 0.0013 13.5 5.3 57 1-62 1-66 (223) 238 KOG2892 consensus 23.6 49 0.0013 13.5 6.1 60 23-82 3-83 (320) 239 COG3417 FlgN Collagen-binding 23.5 50 0.0013 13.5 2.4 23 1-23 3-25 (200) 240 pfam07219 HemY_N HemY protein 23.2 50 0.0013 13.4 3.0 36 1-36 1-37 (134) 241 pfam09920 DUF2150 Uncharacteri 23.0 27 0.00068 15.2 0.3 37 255-294 141-177 (190) 242 PRK10894 hypothetical protein; 22.7 52 0.0013 13.4 3.0 34 1-34 7-42 (184) 243 TIGR03061 pip_yhgE_Nterm YhgE/ 21.9 53 0.0014 13.3 6.8 56 24-82 40-103 (164) 244 COG3583 Uncharacterized protei 21.6 54 0.0014 13.2 3.3 55 5-60 17-72 (309) 245 pfam09862 DUF2089 Protein of u 21.6 54 0.0014 13.2 2.3 33 44-77 69-104 (113) 246 TIGR03519 Bac_Flav_fam_1 Bacte 21.2 55 0.0014 13.2 2.0 17 1-17 1-17 (292) 247 pfam06291 Lambda_Bor Bor prote 21.2 55 0.0014 13.2 1.8 12 1-12 1-12 (97) 248 cd02017 TPP_E1_EcPDC_like Thia 21.2 55 0.0014 13.2 4.7 98 178-281 199-304 (386) 249 PRK11186 carboxy-terminal prot 20.7 57 0.0014 13.1 4.0 19 41-60 300-318 (673) 250 COG3218 ABC-type uncharacteriz 20.5 22 0.00057 15.8 -0.6 32 5-36 13-44 (205) 251 pfam03207 OspD Borrelia outer 20.4 57 0.0015 13.1 2.5 22 1-22 1-22 (264) No 1 >TIGR03414 ABC_choline_bnd choline ABC transporter, periplasmic binding protein. Partial phylogenetic profiling (PMID:16930487) vs. the genome property of glycine betaine biosynthesis from choline consistently reveals a member of this ABC transporter periplasmic binding protein as the best match, save for the betaine biosynthesis enzymes themselves. Genomes often carry several paralogs, one encoded together with the permease and ATP-binding components and another encoded next to a choline-sulfatase gene, suggesting that different members of this protein family interact with shared components and give some flexibility in substrate. Of two members from Sinorhizobium meliloti 1021, one designated ChoX has been shown experimentally to bind choline (though not various related compounds such as betaine) and to be required for about 60 % of choline uptake. Members of this protein have an invariant Cys residue near the N-terminus and likely are lipoproteins. Probab=100.00 E-value=0 Score=467.75 Aligned_cols=288 Identities=47% Similarity=0.889 Sum_probs=276.3 Q ss_pred CCCCCCCCCEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHCCCCEEEEEEECCCCHHHHHHHHHCC Q ss_conf 35834588379942588699999999999996106984899856405788885149813999840464135666655304 Q gi|254780342|r 19 ARDADSCTPVRFADTGWTDIAATTAMTSVILEEILGYKTNIKLLAVPVTFRSLKNKGIDIFMGYWYPSLEKFIAPYLEEG 98 (309) Q Consensus 19 ~~~~~~~~~I~ig~~~wte~~v~a~i~~~iLE~~lGy~Ve~~~~~~~~~~~al~~G~iD~~~e~w~p~~~~~~~~~~~~~ 98 (309) ++.|.+|+||+||..+|+++.++++|+++|||+ +||+|+++.++++++|++|++||||+++|.|.|+++..+.++.+++ T Consensus 1 aa~~a~~~~V~i~~~~W~s~~~~t~v~~~iLE~-~GY~Ve~~~~~~~~~~~~la~GdiDv~~e~W~p~~~~~~~~~~~~g 79 (290) T TIGR03414 1 AAEPASCKTVRFADVGWTDITATTALASVLLEG-LGYQPKVTLLSVPVTYAGLKDGDLDVFLGNWMPAMEPDIKPYLESG 79 (290) T ss_pred CCCHHCCCCEEEEECCCCHHHHHHHHHHHHHHH-CCCCEEEEECCHHHHHHHHHCCCCEEEEECCCCCCHHHHHHHHHCC T ss_conf 977100996699606864899999999999997-4996489967749999999769970787111478577899886369 Q ss_pred CEEEECCCCCCCEEEEEEEHHHHHCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCCCCCCCEEEECCCCCC Q ss_conf 70230135668535678634464123455356555566544201357854320055587664035452210010366520 Q gi|254780342|r 99 SIKLVAENLQGAKYMLAVNDVGFALGIKSYQDIAKYKKELGAKIYGIEPGNEGNQRILDMINNNKFSLKGFRLIEASELA 178 (309) Q Consensus 99 ~~~~~~~~~~~~~~~~~v~~~~~~~~i~~~~dl~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~s~~~ 178 (309) .++.++.++.+.+.+|+||.|+++.+|++|+||.++.+.|+++++||++++++.....+.++.+.|++.+++++.+|+++ T Consensus 80 ~v~~lg~~~~~a~~g~~VP~Yv~d~~l~si~DL~~~~~~f~g~i~gi~~G~~~~~~~~~~i~~~~ygL~~~~l~~~S~aa 159 (290) T TIGR03414 80 SVEVLGPNLEGAKYTLAVPTYVADAGLKSFADIAKFKDKLDGKIYGIEPGNDGNRLIQKMIDKNAFGLGGFKLVESSEAG 159 (290) T ss_pred CEEECCCCCCCCEEEEEECHHHHHCCCCCHHHHHHCHHHCCCCEECCCCCCCHHHHHHHHHHHHCCCCCCCEECCCCHHH T ss_conf 68983566678637898541477659999999985867529976136799847789999987640798870213688899 Q ss_pred HHHHHHHHHHCCCCEEEEEECCCHHHHCCCCEEECCCCCCCC--CCHHHHHHHHCHHHHHHCHHHHHHHHHCCCCHHHHH Q ss_conf 189999999659936999412422320158514007865467--786583544277778748899999874089989999 Q gi|254780342|r 179 SFSQIRRDQRNNIPAVFLSWEPHPINSDLNIHYLPGGEEISG--FGEASVYTVVRSDYLDKCPNISRLLKNIKFSVALEN 256 (309) Q Consensus 179 ~~a~l~~A~~~~~~~v~~~wsp~~~~~~~dl~~Ledp~~~~~--~p~~~~~~vv~~~~~~~~P~~~~~L~~~~ls~~~~~ 256 (309) |++++.+|+++++|+++|+|+|||++.+||+++|+||++.++ ++...+++++|++|++++|++++||++|+||+++++ T Consensus 160 m~a~l~~A~~~~epiv~~~W~Phw~~~~~dl~~L~~~~~~~g~~~~~~~v~~~~~~~f~~~~P~~~~~l~~~~~~~~~~~ 239 (290) T TIGR03414 160 MLAQVARAVKRKEWVVFLGWEPHPMNTNFKMTYLTGGDDYFGPNYGGATVYTNTRKGYAAECPNVGKLLTNLTFTLDMEN 239 (290) T ss_pred HHHHHHHHHHCCCCEEEEECCCCHHHHHCCEEEECCCCCCCCCCCCHHHEEECCCHHHHHHCHHHHHHHHCCCCCHHHHH T ss_conf 99999999986999899823775455517706705874457899862206641470587879389999961589989999 Q ss_pred HHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHH Q ss_conf 999999845999899999999985999998754754567713799999874 Q gi|254780342|r 257 EMMKLILNNKQDRQFVGRTMLRTHPDLLKNWLIGVTTFDGQDPSRQLERFM 307 (309) Q Consensus 257 ~l~~~v~~~g~~~~~vA~~wL~~n~~~~~~Wl~~~~~~~~~~~~~~~~~~~ 307 (309) +|++.++.+|++++++|++||++|+++|++||+|+||.||+|++++||.+| T Consensus 240 ~~~~~i~~~g~~~~~aA~~Wi~~n~d~~~~Wl~g~~~~~~~~~~~~~~~~~ 290 (290) T TIGR03414 240 QIMGAILNDGKDPEAAARQWLKANPEVLDPWLAGVTTVDGKDGLAAVKAAL 290 (290) T ss_pred HHHHHHHHCCCCHHHHHHHHHHHCHHHHHHHHCCCCCCCCCCHHHHHHHCC T ss_conf 999999975999999999999979799999967860678872799998419 No 2 >PRK11119 proX glycine betaine transporter periplasmic subunit; Provisional Probab=100.00 E-value=0 Score=396.42 Aligned_cols=288 Identities=16% Similarity=0.205 Sum_probs=252.0 Q ss_pred HHHHHHHHHHHHHHHHH--CCCCCCCCCEEEECCCCCHHHHHHHHHHHHHHHCCCCEEE-EEECCHHHHHHHHHCCCCEE Q ss_conf 57999999999998630--3583458837994258869999999999999610698489-98564057888851498139 Q gi|254780342|r 2 YKILAVCLFLTTFSISY--ARDADSCTPVRFADTGWTDIAATTAMTSVILEEILGYKTN-IKLLAVPVTFRSLKNKGIDI 78 (309) Q Consensus 2 kk~~~~~~~~~~~~~~~--~~~~~~~~~I~ig~~~wte~~v~a~i~~~iLE~~lGy~Ve-~~~~~~~~~~~al~~G~iD~ 78 (309) +|+++++.+++.++.+. +..|++++||+|+..+|++..++++|++++||+ +||+|+ ++..+.+++|++|++||||+ T Consensus 4 ~~~l~~~a~~~~~~~~a~a~~~pg~g~tV~~a~~~W~~~~a~t~v~~~~Le~-lGY~Ve~~~~~~~~~~~~ala~GdiD~ 82 (332) T PRK11119 4 HKVLLATALATLISTSAFAADLPGKGITVQPAQSTIAEETFQTLLVSRALEK-LGYDVNKPKEVDYNVFYTSIASGDATF 82 (332) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEEECCCHHHHHHHHHHHHHHHH-CCCCCCCCEECCHHHHHHHHHCCCCEE T ss_conf 8889999999985227652558999977876016840899999999999998-699854201444688999985588448 Q ss_pred EEEEECCCCHHHHHHHHHCCCEEEECCCCCCCEEEEEEEHHHHHC-CCCCHHHHHH--HHHHH------HHCCCCCCCCC Q ss_conf 998404641356666553047023013566853567863446412-3455356555--56654------42013578543 Q gi|254780342|r 79 FMGYWYPSLEKFIAPYLEEGSIKLVAENLQGAKYMLAVNDVGFAL-GIKSYQDIAK--YKKEL------GAKIYGIEPGN 149 (309) Q Consensus 79 ~~e~w~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~-~i~~~~dl~~--~~~~~------~~~~~g~~~~~ 149 (309) +++.|+|.++..+.+...++.++..+.++.+...+|+|+++.++. +|++|++|++ +++.| +++++||+|+| T Consensus 83 ~~~~WlP~~~~~~~~~~~~~~~~~~g~~~~ga~~G~~V~~~~~d~~~I~si~dLkdp~~a~~Fd~~~~GKg~i~g~~pGw 162 (332) T PRK11119 83 TAVNWFPLHDDQYEAAGGDKKFYREGVIVGGAAQGYLIDKKTADKYNITNIAQLKDPKIAKLFDTNGDGKADLTGCNPGW 162 (332) T ss_pred EECCCCCCCHHHHHHHCCCCEEEECCCCCCCCCEEEEEECCCCCCCCCCCHHHHHCHHHHHHCCCCCCCCEEEEECCCCC T ss_conf 65561765179999732776476424335665136895023321148578998628678986076778850288148872 Q ss_pred CCCHHHHHHHHHCCCCCCCEEEECCCCCCHHHHHHHHHHCCCCEEEEEECCCHHHHCC----CCEEECCCCCC------- Q ss_conf 2005558766403545221001036652018999999965993699941242232015----85140078654------- Q gi|254780342|r 150 EGNQRILDMINNNKFSLKGFRLIEASELASFSQIRRDQRNNIPAVFLSWEPHPINSDL----NIHYLPGGEEI------- 218 (309) Q Consensus 150 ~~~~~~~~~~~~~~l~~~~~~~~~~s~~~~~a~l~~A~~~~~~~v~~~wsp~~~~~~~----dl~~Ledp~~~------- 218 (309) +++......++.|+|.. .+++.++|+++|++++.+|+++++|++||+|+|||++.+| |+++|++|+.. T Consensus 163 g~~~~~~~~i~~ygL~~-~~~~~~gs~~am~A~~~~a~~~~epiv~~~W~PHW~~~~~~pg~Dv~~L~~P~~~~~~~~~~ 241 (332) T PRK11119 163 GCEAVINHQLKAYGLTD-TVTHNQGNYAALMADTIARYKEGKPVLYYTWTPYWVSDVLKPGKDVVWLQVPFSALPGDQKN 241 (332) T ss_pred CHHHHHHHHHHHCCCCC-EEEECCCCHHHHHHHHHHHHHCCCCEEEEEECCCHHHHHCCCCCCCEEECCCCCCCCCCCCC T ss_conf 06689999998679965-05463564899999999999879998999967711342025655513512655557755456 Q ss_pred ----------CCCCHHHHHHHHCHHHHHHCHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHH---HHHHHHHHHHHHH Q ss_conf ----------677865835442777787488999998740899899999999998459998999---9999998599999 Q gi|254780342|r 219 ----------SGFGEASVYTVVRSDYLDKCPNISRLLKNIKFSVALENEMMKLILNNKQDRQFV---GRTMLRTHPDLLK 285 (309) Q Consensus 219 ----------~~~p~~~~~~vv~~~~~~~~P~~~~~L~~~~ls~~~~~~l~~~v~~~g~~~~~v---A~~wL~~n~~~~~ 285 (309) +++++.++++++|++|++++|++++||++|+||+++++++++.|...+++++++ |++||++|+++|+ T Consensus 242 ~~~~~~dp~~~g~~~~~i~~v~~k~f~e~~P~a~~~l~n~~~t~~d~~~~~~~i~~~~~~~~d~~~~A~~WI~~N~d~v~ 321 (332) T PRK11119 242 ADTKLPNGKNYGFPVSTMHIVANKKFAEKNPAAAKLFEIMKLPLADINAQNARMHDGENSEADIQRHVDGWIKAHQAQFD 321 (332) T ss_pred CCCCCCCCCCCCCCCCEEEEEEECHHHHHCCHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCHHHHH T ss_conf 43335774103786102775210035563948999999679999999999999981699846699999999997989999 Q ss_pred HHHCCC Q ss_conf 875475 Q gi|254780342|r 286 NWLIGV 291 (309) Q Consensus 286 ~Wl~~~ 291 (309) +||+.+ T Consensus 322 ~WL~~A 327 (332) T PRK11119 322 GWVKEA 327 (332) T ss_pred HHHHHH T ss_conf 999999 No 3 >COG2113 ProX ABC-type proline/glycine betaine transport systems, periplasmic components [Amino acid transport and metabolism] Probab=100.00 E-value=0 Score=394.71 Aligned_cols=275 Identities=34% Similarity=0.573 Sum_probs=261.2 Q ss_pred CCCCCCCCEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHCCCCEEEEEEECCCCHHHHHHHHHCCC Q ss_conf 58345883799425886999999999999961069848998564057888851498139998404641356666553047 Q gi|254780342|r 20 RDADSCTPVRFADTGWTDIAATTAMTSVILEEILGYKTNIKLLAVPVTFRSLKNKGIDIFMGYWYPSLEKFIAPYLEEGS 99 (309) Q Consensus 20 ~~~~~~~~I~ig~~~wte~~v~a~i~~~iLE~~lGy~Ve~~~~~~~~~~~al~~G~iD~~~e~w~p~~~~~~~~~~~~~~ 99 (309) +.+.+|++|+|+..+|++..++++|.+.+|| .+||+|+++++++.+.|++|++|++|+.+++|.|.++..+++..+++. T Consensus 25 ~~~~~~~tV~~a~~~W~s~~~~t~v~~~iLk-~~Gy~v~~~~~~~~~~~~sla~gd~D~~~~~W~p~~~~~~~~~~~~~~ 103 (302) T COG2113 25 AAAEAGKTVRIADVGWTSGTATTNVAKKILK-GLGYTVELVTLDTAVMYQSLAKGDLDVFPEAWLPTTPDDYKKAVKDKK 103 (302) T ss_pred CCCCCCCEEEEEECCCCHHHHHHHHHHHHHH-HCCCCCEEEECCHHHHHHHHHCCCCCCCCCEECCCCHHHHHHHHCCCC T ss_conf 1345588258843696178999999999999-679962466555999999997379753201026887588998742685 Q ss_pred EEEECCCCCCCEEEEEEEHHHHHCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCCCCCCCEEEECCCCCCH Q ss_conf 02301356685356786344641234553565555665442013578543200555876640354522100103665201 Q gi|254780342|r 100 IKLVAENLQGAKYMLAVNDVGFALGIKSYQDIAKYKKELGAKIYGIEPGNEGNQRILDMINNNKFSLKGFRLIEASELAS 179 (309) Q Consensus 100 ~~~~~~~~~~~~~~~~v~~~~~~~~i~~~~dl~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~s~~~~ 179 (309) ++..+.+..|..++|+||+|+++.++++++|+.+..+.+.++++||+|+|+|+....++++.|++++..++++.+++++| T Consensus 104 v~~~~~~~~Ga~~g~~vp~y~~d~~iksi~D~~~~~d~~~g~i~g~~pG~g~~~~~~~~~k~y~~~~~~~~lv~~s~~am 183 (302) T COG2113 104 VELGGTNLEGAKQGWAVPKYVADAGIKSIADLAKFKDKFGGKIYGIEPGWGCMRVIEDAIKAYGLGLKGFELVESSEAAM 183 (302) T ss_pred EEECCCCCCCCEEEEEECEEHHHCCCCCHHHHHHCCCCCCCCEECCCCCCCHHHHHHHHHHHCCCCCCCEEEECCCHHHH T ss_conf 44135466772478982400342355656569866687778578158997354899999864365536448844747889 Q ss_pred HHHHHHHHHCCCCEEEEEECCCHHHHCCCCEEECCC---CCCCCCCHHHHHHHHCHHHHHHCHHHHHHHHHCCCCHHHHH Q ss_conf 899999996599369994124223201585140078---65467786583544277778748899999874089989999 Q gi|254780342|r 180 FSQIRRDQRNNIPAVFLSWEPHPINSDLNIHYLPGG---EEISGFGEASVYTVVRSDYLDKCPNISRLLKNIKFSVALEN 256 (309) Q Consensus 180 ~a~l~~A~~~~~~~v~~~wsp~~~~~~~dl~~Ledp---~~~~~~p~~~~~~vv~~~~~~~~P~~~~~L~~~~ls~~~~~ 256 (309) ++++.++.++++|++||+|+|||++.+|++++|+|| ++... ...++.+++|++|.+++|.++++|++++|+.++.+ T Consensus 184 l~~~~~a~~~~~piv~~~WsPh~m~~k~~l~~L~~~~~~~~~~~-g~~~v~t~~~kg~~~~~p~v~~~l~~~~~~~~~~n 262 (302) T COG2113 184 LAAAIRAAKRKEPIVFYGWSPHWMFGKYDLKYLEDPGPPKGANG-GWETVHTVVRKGFAEEAPNVAKLLANFSFPLAEEN 262 (302) T ss_pred HHHHHHHHHCCCCEEEEEECCHHHHHCCCEEEECCCCCCCCCCC-CCEEEEHHHHHHHHHHCCHHHHHHHHCCCCHHHHH T ss_conf 99999997458987999827626652365466047886567788-61046315624666759069999997678888999 Q ss_pred HHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 9999998459998999999999859999987547545677 Q gi|254780342|r 257 EMMKLILNNKQDRQFVGRTMLRTHPDLLKNWLIGVTTFDG 296 (309) Q Consensus 257 ~l~~~v~~~g~~~~~vA~~wL~~n~~~~~~Wl~~~~~~~~ 296 (309) +++..++.++.+|+++|++||++|+++|+.|++++.+.|+ T Consensus 263 ~~~~~~~~~~~~~e~aA~~wl~~~~~~~~~Wv~~~~~~~~ 302 (302) T COG2113 263 ALMAAMDDGGATPEEAAKQWLKANPEIWDGWVSGVAAADG 302 (302) T ss_pred HHHHHHHHCCCCHHHHHHHHHHHCHHHHHHHHCHHHHCCC T ss_conf 9989998389988999999999787988755263543149 No 4 >pfam04069 OpuAC Substrate binding domain of ABC-type glycine betaine transport system. Part of a high affinity multicomponent binding-protein-dependent transport system involved in bacterial osmoregulation. This domain is often fused to the permease component of the transporter complex. Family members are often integral membrane proteins or predicted to be attached to the membrane by a lipid anchor. Glycine betaine is involved in protection from high osmolarity environments for example in Bacillus subtilis. The family member OpuBC is closely related, and involved in choline transport. Choline is necessary for the biosynthesis of glycine betaine. L-carnitine is important for osmoregulation in Listeria monocytogenes. Family also contains proteins binding l-proline (ProX), histidine (HisX) and taurine (TauA). Probab=100.00 E-value=0 Score=365.89 Aligned_cols=250 Identities=21% Similarity=0.313 Sum_probs=230.0 Q ss_pred CCEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEEECC-HHHHHHHHHCCCCEEEEEEECCCCHHHHHHHHHCC-CEEEE Q ss_conf 83799425886999999999999961069848998564-05788885149813999840464135666655304-70230 Q gi|254780342|r 26 TPVRFADTGWTDIAATTAMTSVILEEILGYKTNIKLLA-VPVTFRSLKNKGIDIFMGYWYPSLEKFIAPYLEEG-SIKLV 103 (309) Q Consensus 26 ~~I~ig~~~wte~~v~a~i~~~iLE~~lGy~Ve~~~~~-~~~~~~al~~G~iD~~~e~w~p~~~~~~~~~~~~~-~~~~~ 103 (309) +||+||+++|+|+.++++|++++||+ +||+|+++..+ +.++|++|++|+||+++|.|.|+.+..+.+.+.++ .+..+ T Consensus 1 ktI~ig~~~w~e~~i~a~i~~~~Le~-~G~~V~~~~~~~~~~~~~al~~G~iDi~~~~w~~~~~~~~~~~~~~~~~~~~~ 79 (256) T pfam04069 1 KTIVIGSKNWTEQEILANIAAQLLEA-LGYVVELVGLGSTAVLFAALASGDIDLYPEEWTGTTYEAYKKAVEEKLGLLVL 79 (256) T ss_pred CEEEEECCCCCHHHHHHHHHHHHHHH-CCCCEEEEECCCCHHHHHHHHCCCCCEEEHHCCCCHHHHHHHHHHCCCCEEEC T ss_conf 92999268862899999999999997-69816985478708999999769975863330662579999875146873861 Q ss_pred CCCCCCCEEEEEEEHHHHH-CCCCCHHHHHHHHH---HHHHCCCCCCCCCCCCHHHHHHHHHCCCCCCCEEEECCCCCCH Q ss_conf 1356685356786344641-23455356555566---5442013578543200555876640354522100103665201 Q gi|254780342|r 104 AENLQGAKYMLAVNDVGFA-LGIKSYQDIAKYKK---ELGAKIYGIEPGNEGNQRILDMINNNKFSLKGFRLIEASELAS 179 (309) Q Consensus 104 ~~~~~~~~~~~~v~~~~~~-~~i~~~~dl~~~~~---~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~s~~~~ 179 (309) +.++.++..+|+||+|+++ ++|+||+||+++.. .+++++++|+++|+++......++.||| .++++..+|+++| T Consensus 80 ~~~~~~~~~g~~Vp~~~a~~~~i~si~dL~~~~~~~~~~~~~~~~~~~G~~~~~~~~~~~~~yGl--~~~~~~~~s~~~~ 157 (256) T pfam04069 80 GPLGAGNTYGLAVPKYVAEKPGIKSISDLAKPADDELGFKGEFIGRPDGWGCTRSTEGLLKAYGL--DKYELVEGSEAAM 157 (256) T ss_pred CCCCCCCEEEEEECHHHHHHCCCCCHHHHHCCCCCCCCCCCCEECCCCCCCCCHHHHHHHHHCCC--CCEEECCCCHHHH T ss_conf 35777875899985888835699879997274210248886553278885400678999986397--6401235855545 Q ss_pred HHHHHHHHHCCCCEEEEEECCCHHHHCCCCEEECCCCCCCCCCHHHHHHHHCHHHHHHCHHHHHHHHHCCCCHHHHHHHH Q ss_conf 89999999659936999412422320158514007865467786583544277778748899999874089989999999 Q gi|254780342|r 180 FSQIRRDQRNNIPAVFLSWEPHPINSDLNIHYLPGGEEISGFGEASVYTVVRSDYLDKCPNISRLLKNIKFSVALENEMM 259 (309) Q Consensus 180 ~a~l~~A~~~~~~~v~~~wsp~~~~~~~dl~~Ledp~~~~~~p~~~~~~vv~~~~~~~~P~~~~~L~~~~ls~~~~~~l~ 259 (309) ++++++|+++|+|+++++|+|||++.+|++++|+||+++++ |.+++.+++|++|+++||+++++|++++||+++|++|| T Consensus 158 ~a~~~~A~~~g~~~v~~~w~p~~~~~~~dl~~L~Dp~~~~~-~~~~v~~v~~~~~~~~~P~~~~~l~~~~~t~~~~~~l~ 236 (256) T pfam04069 158 DALLYAAIKRGEPDVVYAWTPDWMIKKYDLVVLEDPKGLFP-PAYNVVPVVRKGFAEKHPEVAAFLNKLSLDTEDLNELN 236 (256) T ss_pred HHHHHHHHHCCCCEEEEECCCCHHHHHCCEEECCCCCCCCC-CCCEEEEEECHHHHHHCHHHHHHHHHCCCCHHHHHHHH T ss_conf 99999999849988999637616554279699368864579-76558752104677779899999986789999999999 Q ss_pred HHHHHCCCCHHHHHHHHHHH Q ss_conf 99984599989999999998 Q gi|254780342|r 260 KLILNNKQDRQFVGRTMLRT 279 (309) Q Consensus 260 ~~v~~~g~~~~~vA~~wL~~ 279 (309) ++|+++|++|++||++||++ T Consensus 237 ~~v~~~g~~~~~vA~~wl~~ 256 (256) T pfam04069 237 AQVDVEGKDPEEVAKDWLKE 256 (256) T ss_pred HHHHHCCCCHHHHHHHHHHC T ss_conf 99997699999999999859 No 5 >COG1732 OpuBC Periplasmic glycine betaine/choline-binding (lipo)protein of an ABC-type transport system (osmoprotectant binding protein) [Cell envelope biogenesis, outer membrane] Probab=100.00 E-value=2.8e-45 Score=314.33 Aligned_cols=248 Identities=17% Similarity=0.164 Sum_probs=211.7 Q ss_pred CCCCCCEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEEE--CCHHHHHHHHHCCCCEEEEEEECCCCHHHHHH------ Q ss_conf 3458837994258869999999999999610698489985--64057888851498139998404641356666------ Q gi|254780342|r 22 ADSCTPVRFADTGWTDIAATTAMTSVILEEILGYKTNIKL--LAVPVTFRSLKNKGIDIFMGYWYPSLEKFIAP------ 93 (309) Q Consensus 22 ~~~~~~I~ig~~~wte~~v~a~i~~~iLE~~lGy~Ve~~~--~~~~~~~~al~~G~iD~~~e~w~p~~~~~~~~------ 93 (309) ..++++|+||++++||+.|++||++++||++ |++|+.+. |++.+++.||.+|+||+|+||+++......++ T Consensus 29 ~~~~~~I~VgsK~~tE~~IL~~m~~~lle~~-~~kv~~~~~lG~t~v~~~Al~~G~IDiYpEYTGt~~~~~lk~~~~~~~ 107 (300) T COG1732 29 ASAAKTIVVGSKIFTEQYILGNILKQLLEKN-GIKVEDKTGLGGTAVVRNALKSGDIDIYPEYTGTALFSFLKKDPPASK 107 (300) T ss_pred CCCCCCEEEECCCCCHHHHHHHHHHHHHHHC-CCCEEECCCCCCHHHHHHHHHCCCCCEEEEECCHHHHHHCCCCCCCCC T ss_conf 2357887983477838899999999999865-991454168885299999997598776854323145663456764547 Q ss_pred ------------HHHCCCEEEECCCCCCCEEEEEEEHHHHH-CCCCCHHHHHHHHHHHHHCCCCCCCCCC-CCHHHHHHH Q ss_conf ------------55304702301356685356786344641-2345535655556654420135785432-005558766 Q gi|254780342|r 94 ------------YLEEGSIKLVAENLQGAKYMLAVNDVGFA-LGIKSYQDIAKYKKELGAKIYGIEPGNE-GNQRILDMI 159 (309) Q Consensus 94 ------------~~~~~~~~~~~~~~~~~~~~~~v~~~~~~-~~i~~~~dl~~~~~~~~~~~~g~~~~~~-~~~~~~~~~ 159 (309) ..+...++++.+...++.+.++|++..++ .+|.+++||.++.+.+ ..|.+.+|- +.++.+... T Consensus 108 dp~~~y~~vK~~~~k~~~l~wl~p~~~nNTyA~av~~~~A~~~~i~TiSDLak~~~~l---~~g~~~eF~~R~DG~~~l~ 184 (300) T COG1732 108 DPKKVYETVKKLDAKQFKLVWLKPAGFNNTYALAVRKDVAEKYNLETISDLAKHSNQL---KLGADSEFAERADGLPALQ 184 (300) T ss_pred CHHHHHHHHHHHHHHCCCEEEECCCCCCCCEEEEECHHHHHHHCCCCHHHHHHHHHHC---EECCCHHHHCCCCCCHHHH T ss_conf 9899999999988860797896546877605899619889881981699999866534---6227842313651228999 Q ss_pred HHCCCCCCCEEEECCCCCCHHHHHHHHHHCCCCEEEEEECCCHHHHCCCCEEECCCCCCCCCCHHHHHHHHCHHHHHHCH Q ss_conf 40354522100103665201899999996599369994124223201585140078654677865835442777787488 Q gi|254780342|r 160 NNNKFSLKGFRLIEASELASFSQIRRDQRNNIPAVFLSWEPHPINSDLNIHYLPGGEEISGFGEASVYTVVRSDYLDKCP 239 (309) Q Consensus 160 ~~~~l~~~~~~~~~~s~~~~~a~l~~A~~~~~~~v~~~wsp~~~~~~~dl~~Ledp~~~~~~p~~~~~~vv~~~~~~~~P 239 (309) +.|++.+..- +..++.+ ..|+|+++|+.+++.+||||+.+..|+|++|+||+++| |+|++.|++|++++++|| T Consensus 185 k~Yg~~~~~~-~~~m~~g----l~y~Al~~g~~d~~~~YsTDg~I~~~~L~VLkDDK~~f--P~Y~~apvvre~vlk~~P 257 (300) T COG1732 185 KAYGFDFKPD-LRTMDGG----LTYQALKNGTVDAADAYSTDGRIAAYGLKVLKDDKGFF--PPYQAAPVVREEVLKKHP 257 (300) T ss_pred HHHCCCCCCC-CEECCCH----HHHHHHHCCCCCEEEECCCCCCCCCCCCEEEECCCCCC--CCCCCCCEECHHHHHHCH T ss_conf 9848766887-4433815----89999874997767631455221226857970687679--987656402187776798 Q ss_pred HHHHHHHHC--CCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHH Q ss_conf 999998740--89989999999999845999899999999985 Q gi|254780342|r 240 NISRLLKNI--KFSVALENEMMKLILNNKQDRQFVGRTMLRTH 280 (309) Q Consensus 240 ~~~~~L~~~--~ls~~~~~~l~~~v~~~g~~~~~vA~~wL~~n 280 (309) ++.++|+++ +|++++|++||++|+++|++|+.||++||++| T Consensus 258 el~~~l~~l~~kid~~tMq~LNy~V~v~g~~~~~VA~dfL~~~ 300 (300) T COG1732 258 ELKTILNKLSGKIDTETMQALNYRVDVEGKDPQKVAADFLKKK 300 (300) T ss_pred HHHHHHHHHHCCCCHHHHHHHHHHHHCCCCCHHHHHHHHHHCC T ss_conf 8999998875458999999987776506999899999998509 No 6 >PRK11553 alkanesulfonate transporter substrate-binding subunit; Provisional Probab=99.71 E-value=2.1e-15 Score=120.07 Aligned_cols=256 Identities=14% Similarity=0.146 Sum_probs=172.2 Q ss_pred CHHHHHHHHHHH-HHHHHHCCCCCCCCCEEEECCCCCHHHHHHHHHHHHHHHCC-CCEEEEEEC-CHHHHHHHHHCCCCE Q ss_conf 957999999999-99863035834588379942588699999999999996106-984899856-405788885149813 Q gi|254780342|r 1 MYKILAVCLFLT-TFSISYARDADSCTPVRFADTGWTDIAATTAMTSVILEEIL-GYKTNIKLL-AVPVTFRSLKNKGID 77 (309) Q Consensus 1 mkk~~~~~~~~~-~~~~~~~~~~~~~~~I~ig~~~wte~~v~a~i~~~iLE~~l-Gy~Ve~~~~-~~~~~~~al~~G~iD 77 (309) |||++.+.+..+ +++++.++.....++||||+....-..++ ..-..++|+.+ |++|+.+.. +.+.+.+||.+|+|| T Consensus 1 ~~~~l~~~la~~~~~~~~~~~~~~~~~~lrIgyq~~~~~~~~-~~~~~~~ek~~~~~kV~w~~F~~G~~~~eAl~aG~iD 79 (314) T PRK11553 1 MRNIIKLALAGLLSVSTLAVAAESSPEALRIGYQKGSIGLVL-AKSHQLLEKRFPQTKISWVEFPAGPQMLEALNVGSID 79 (314) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCHHHHH-HHHCCHHHHHCCCCCEEEEECCCCHHHHHHHHCCCCC T ss_conf 977999999999999875444334897599996177415899-9852867661789835899789748999999769975 Q ss_pred EEEEEECCCCHHHHHHHHHCCCEEEECC-CCCCCEEEEEEEHHHHHCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCHHHH Q ss_conf 9998404641356666553047023013-566853567863446412345535655556654420135785432005558 Q gi|254780342|r 78 IFMGYWYPSLEKFIAPYLEEGSIKLVAE-NLQGAKYMLAVNDVGFALGIKSYQDIAKYKKELGAKIYGIEPGNEGNQRIL 156 (309) Q Consensus 78 ~~~e~w~p~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~v~~~~~~~~i~~~~dl~~~~~~~~~~~~g~~~~~~~~~~~~ 156 (309) +..-.-.|.. +.. .....+..++. ...+....+.|++. .+|++++|| +++.++...+...+.... T Consensus 80 ~~~~g~~p~i---~a~-a~G~~~~~va~~~~~~~~~~i~V~~~---S~I~s~aDL-------kGKkVa~~~Gs~~~~~l~ 145 (314) T PRK11553 80 LGSTGDIPPI---FAQ-AAGADLVYVGVEPPKPKAEVILVAEN---SPIKTVADL-------KGHKVAFQKGSSSHNLLL 145 (314) T ss_pred EEEECCHHHH---HHH-HCCCCEEEEEEECCCCCCEEEEEECC---CCCCCHHHH-------CCCEEEEECCCHHHHHHH T ss_conf 4511585899---998-66998699998637886418998489---877888893-------899897417973799999 Q ss_pred HHHHHCCCCCCCEEEECCCCCCHHHHHHHHHHCCCCEEEEEECCCHHHH--CCCCEEECCCCCCCCCCHHHHHHHHCHHH Q ss_conf 7664035452210010366520189999999659936999412422320--15851400786546778658354427777 Q gi|254780342|r 157 DMINNNKFSLKGFRLIEASELASFSQIRRDQRNNIPAVFLSWEPHPINS--DLNIHYLPGGEEISGFGEASVYTVVRSDY 234 (309) Q Consensus 157 ~~~~~~~l~~~~~~~~~~s~~~~~a~l~~A~~~~~~~v~~~wsp~~~~~--~~dl~~Ledp~~~~~~p~~~~~~vv~~~~ 234 (309) ..++..||.....+++....+ +...|+++|+++.+..|.|..... +.+.++|.|..+.. ....+.+++.+| T Consensus 146 ~aL~~aGL~~~DV~~v~l~p~----d~~aAl~~G~VDA~~~w~P~~~~a~~~~gaRvl~dg~~~~---~~~~~~~~~~~f 218 (314) T PRK11553 146 RALRQAGLKFTDIQPTYLTPA----DARAAFQQGNVDAWAIWDPYYSAALLQGGVRVLKDGTDLN---QTGSFYLAARPY 218 (314) T ss_pred HHHHHCCCCHHHEEEEECCCH----HHHHHHHCCCCCEEEECCHHHHHHHHCCCCEEEECCCCCC---CCCCEEEECHHH T ss_conf 999986998899189845938----8999996699788997567899998628958985376667---776168861898 Q ss_pred HHHCHHHH-HHHHHC----CCCHHHHHHHHHHHHHCCCCHHHHHHHHHH Q ss_conf 87488999-998740----899899999999998459998999999999 Q gi|254780342|r 235 LDKCPNIS-RLLKNI----KFSVALENEMMKLILNNKQDRQFVGRTMLR 278 (309) Q Consensus 235 ~~~~P~~~-~~L~~~----~ls~~~~~~l~~~v~~~g~~~~~vA~~wL~ 278 (309) .++||++. +|++.+ .|..+.-.+....+..+-.-|.++++.++. T Consensus 219 a~~~p~~v~~~l~~l~~a~~w~~~~p~eaa~i~A~~~gl~~~v~~~~l~ 267 (314) T PRK11553 219 AEKNGAFIQQVLATLSEADALTRSQRAQSIALLAKTMGLPAPVIASYLD 267 (314) T ss_pred HHHCHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHCCCHHHHHHHHH T ss_conf 8879999999999999999999979999999999987929999999985 No 7 >PRK11480 tauA taurine transporter substrate binding subunit; Provisional Probab=99.54 E-value=8.9e-13 Score=102.91 Aligned_cols=245 Identities=14% Similarity=0.069 Sum_probs=157.8 Q ss_pred HHHHHHHHHHHHCCCCCCCCCEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEEECCH-HHHHHHHHCCCCEEEEEEECC Q ss_conf 9999999986303583458837994258869999999999999610698489985640-578888514981399984046 Q gi|254780342|r 7 VCLFLTTFSISYARDADSCTPVRFADTGWTDIAATTAMTSVILEEILGYKTNIKLLAV-PVTFRSLKNKGIDIFMGYWYP 85 (309) Q Consensus 7 ~~~~~~~~~~~~~~~~~~~~~I~ig~~~wte~~v~a~i~~~iLE~~lGy~Ve~~~~~~-~~~~~al~~G~iD~~~e~w~p 85 (309) ..++++++.++++ .+.++||||+..-.+....+. ..-.+|+.+||+|+.+..++ ..+.+||.+|+||+..-.-.| T Consensus 7 ~~~~~~~~~~~~~---a~~~~v~igYq~~~~p~~~ak-a~g~~ek~~G~kV~W~~F~sG~~~~eAlasG~vDig~~G~~P 82 (320) T PRK11480 7 NTLLAALAFIAFQ---AQAVNVTVAYQTSAEPAKVAQ-ADNTFAKESGATVDWRKFDSGASIVRALASGDVQIGNLGSSP 82 (320) T ss_pred HHHHHHHHHHHHH---CCCCEEEEEEECCCCCHHHHH-HCCCHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEECCCCCH T ss_conf 9999998631421---576459999806986289998-638456751981279977975999999966997564577847 Q ss_pred CCHHHHHHHHHCCCEEEECCCCCCCEEEEEEEHHHHHCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCCCC Q ss_conf 41356666553047023013566853567863446412345535655556654420135785432005558766403545 Q gi|254780342|r 86 SLEKFIAPYLEEGSIKLVAENLQGAKYMLAVNDVGFALGIKSYQDIAKYKKELGAKIYGIEPGNEGNQRILDMINNNKFS 165 (309) Q Consensus 86 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~i~~~~dl~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~ 165 (309) .....-.. .+-.++.+...+ +....++|+. +|.+++|| +++.++.+.+...+......++..|+. T Consensus 83 ~~~a~a~g--~~ik~v~i~~~~-~~~ealvv~~-----~I~s~~DL-------kGKkVav~~gS~~hy~ll~aL~~~Gl~ 147 (320) T PRK11480 83 LAVAASQQ--VPIEVFLLASKL-GNSEALVVKK-----TISKPEDL-------IGKRIAVPFISTTHYSLLAALKHWGIK 147 (320) T ss_pred HHHHHHCC--CCEEEEEEECCC-CCCEEEEECC-----CCCCHHHC-------CCCEEECCCCCCHHHHHHHHHHHCCCC T ss_conf 99998679--985999985478-9743899568-----99996884-------999896078873189999999985999 Q ss_pred CCCEEEECCCCCCHHHHHHHHHHCCCCEEEEEECCCHHHHCCCCEEECCCCCC--CCCCHHHHHHHHCHHHHHHCHHHHH Q ss_conf 22100103665201899999996599369994124223201585140078654--6778658354427777874889999 Q gi|254780342|r 166 LKGFRLIEASELASFSQIRRDQRNNIPAVFLSWEPHPINSDLNIHYLPGGEEI--SGFGEASVYTVVRSDYLDKCPNISR 243 (309) Q Consensus 166 ~~~~~~~~~s~~~~~a~l~~A~~~~~~~v~~~wsp~~~~~~~dl~~Ledp~~~--~~~p~~~~~~vv~~~~~~~~P~~~~ 243 (309) ....+++....+ +...|+.+|+++..+.|.|.-.-.+.+.++|-|.... .+.|..+ ..++|++|.++||++.+ T Consensus 148 ~~dV~iv~m~p~----d~~aA~~~G~vDa~~~W~P~l~~l~~~G~vL~~s~~~~~~G~pt~d-~~vv~~~Fa~e~Pe~V~ 222 (320) T PRK11480 148 PGQVEIVNLQPP----AIIAAWQRGDIDGAYVWAPAVNALEKDGKVLTDSEQVGQWGAPTLD-VWVVRKDFAEKHPEVVK 222 (320) T ss_pred HHHEEEEECCCH----HHHHHHHCCCCCEEEECCHHHHHHHHCCCEEEECHHHHHCCCCCCC-EEEECHHHHHHCHHHHH T ss_conf 899078844928----8999997699787962677999998439788641664204897455-68863999988989999 Q ss_pred HHHHC--------CCCHHHH---HHHHHHH-HHCCCCHHHHHHH Q ss_conf 98740--------8998999---9999999-8459998999999 Q gi|254780342|r 244 LLKNI--------KFSVALE---NEMMKLI-LNNKQDRQFVGRT 275 (309) Q Consensus 244 ~L~~~--------~ls~~~~---~~l~~~v-~~~g~~~~~vA~~ 275 (309) -+-+. .=..+.+ .++...+ ..-|++++++... T Consensus 223 ~flkv~~~A~~~~~~npd~~~~~~~~~~~iAk~~G~~~~~~~~~ 266 (320) T PRK11480 223 AFAKSAIDAQQPYIANPDAWLKQPENISKLARLSGVPEGDVPGL 266 (320) T ss_pred HHHHHHHHHHHHHHHCHHHHHCCCHHHHHHHHHHCCCHHHHHHH T ss_conf 99999999999999596744048488999999959898899999 No 8 >COG4521 TauA ABC-type taurine transport system, periplasmic component [Inorganic ion transport and metabolism] Probab=99.21 E-value=1.7e-10 Score=88.00 Aligned_cols=272 Identities=14% Similarity=0.107 Sum_probs=174.8 Q ss_pred CHHHHHHHHHHHHHHHHHCCCCCCCCCEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEEECC-HHHHHHHHHCCCCEEE Q ss_conf 957999999999998630358345883799425886999999999999961069848998564-0578888514981399 Q gi|254780342|r 1 MYKILAVCLFLTTFSISYARDADSCTPVRFADTGWTDIAATTAMTSVILEEILGYKTNIKLLA-VPVTFRSLKNKGIDIF 79 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~~~~~~~~~I~ig~~~wte~~v~a~i~~~iLE~~lGy~Ve~~~~~-~~~~~~al~~G~iD~~ 79 (309) |||+..+..++..+.+...+...+..+|++|+..--|....+.--.. .++..|++++....+ ...+..||++|++++- T Consensus 4 ~~r~~~l~~~~aa~a~~l~~~~Aqa~~vtVgYQt~~eP~kvaqADg~-~aK~~gatiDWRkFdSG~~vv~AlASGdvqiG 82 (334) T COG4521 4 LRRISSLSSLLAALAFVLLAFAAQAVDVTVGYQTSAEPAKVAQADGA-FAKESGATIDWRKFDSGASIVRALASGDVQIG 82 (334) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCEEEEEEEECCCCCCCCCCCCH-HHHHCCCCCCHHHCCCHHHHHHHHHCCCCCCC T ss_conf 77788888899999988767664223168865403575202134635-67760785451104753689999862886104 Q ss_pred EEEECCCCHHHHHHHHHCCCEE-EECCCCCCCEEEEEEEHHHHHCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHH Q ss_conf 9840464135666655304702-301356685356786344641234553565555665442013578543200555876 Q gi|254780342|r 80 MGYWYPSLEKFIAPYLEEGSIK-LVAENLQGAKYMLAVNDVGFALGIKSYQDIAKYKKELGAKIYGIEPGNEGNQRILDM 158 (309) Q Consensus 80 ~e~w~p~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~v~~~~~~~~i~~~~dl~~~~~~~~~~~~g~~~~~~~~~~~~~~ 158 (309) .-.-.|...+-.++. .++ .+-....|....+++++.. +|..-+||. ++.++.+--...+...... T Consensus 83 ~iGSsplaaAaSr~v----pie~f~~~~~ig~sEALVvr~gs---gI~kpeDL~-------GK~iavPFvSTtHysLLaa 148 (334) T COG4521 83 NIGSSPLAAAASRQV----PIEVFLLASQIGNSEALVVRKGS---GIEKPEDLI-------GKRIAVPFVSTTHYSLLAA 148 (334) T ss_pred CCCCCHHHHHHHCCC----CEEEEEHHHHCCCCCEEEEECCC---CCCCHHHHC-------CCEECCCEEEHHHHHHHHH T ss_conf 657856667763588----65885301212863103552377---869857864-------6721042430008999999 Q ss_pred HHHCCCCCCCEEEECCCCCCHHHHHHHHHHCCCCEEEEEECCCHHHHCCCCEEECCCCCC--CCCCHHHHHHHHCHHHHH Q ss_conf 640354522100103665201899999996599369994124223201585140078654--677865835442777787 Q gi|254780342|r 159 INNNKFSLKGFRLIEASELASFSQIRRDQRNNIPAVFLSWEPHPINSDLNIHYLPGGEEI--SGFGEASVYTVVRSDYLD 236 (309) Q Consensus 159 ~~~~~l~~~~~~~~~~s~~~~~a~l~~A~~~~~~~v~~~wsp~~~~~~~dl~~Ledp~~~--~~~p~~~~~~vv~~~~~~ 236 (309) ++..+.+....++..... .++.+|.+||..+-.|+|.|----.+-..++|-|.... .+-|..+ ..++|++|++ T Consensus 149 Lkhw~idp~~V~IlNl~P----p~IaAAwqRGDIDgAyVW~PAl~el~ksGkVltDs~qvgqwgaPTfd-vwVvrkdfAe 223 (334) T COG4521 149 LKHWGIDPGQVEILNLQP----PAIAAAWQRGDIDGAYVWAPALSELKKSGKVLTDSEQVGQWGAPTFD-VWVVRKDFAE 223 (334) T ss_pred HHHCCCCCCCEEEECCCC----HHHHHHHHCCCCCCEEECCHHHHHHHHCCCEECCHHHHHCCCCCCEE-EEEEEHHHHH T ss_conf 987187855246861597----89999987078772254467688886258575268886214798513-6886317667 Q ss_pred HCHHHHHHHHHCCCCH-------H-----HHHHHHHHHHHCCCCHHHHHH-----HHHHHHHHHHHHHHCCCC Q ss_conf 4889999987408998-------9-----999999999845999899999-----999985999998754754 Q gi|254780342|r 237 KCPNISRLLKNIKFSV-------A-----LENEMMKLILNNKQDRQFVGR-----TMLRTHPDLLKNWLIGVT 292 (309) Q Consensus 237 ~~P~~~~~L~~~~ls~-------~-----~~~~l~~~v~~~g~~~~~vA~-----~wL~~n~~~~~~Wl~~~~ 292 (309) +||++..-+-++.++. . +..+........|.++++|-. .|+......-+.||-|-+ T Consensus 224 khPe~v~aFakv~~da~a~Y~anp~~W~~~s~~v~k~arL~G~~~e~vp~llkg~tfpt~~eQ~sd~~Lggg~ 296 (334) T COG4521 224 KHPEVVAAFAKVALDAQADYLANPDVWLADSEQVSKLARLSGVPEEDVPELLKGNTFPTPQEQTSDKELGGGV 296 (334) T ss_pred HCHHHHHHHHHHHHHHHHHHHCCHHHHCCCCHHHHHHHHHCCCCHHHHHHHHCCCCCCCHHHHHHHHHHCCHH T ss_conf 6969999999988876677742902320675544333541589845658986067779878863476634618 No 9 >COG0715 TauA ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components [Inorganic ion transport and metabolism] Probab=99.09 E-value=2.9e-08 Score=73.55 Aligned_cols=207 Identities=17% Similarity=0.208 Sum_probs=137.8 Q ss_pred CCCCEEEECCCCCHHH-HHHHHHHHHHHHCCCC-EEEEEEC-CHHHHHHHHHCCCCEEEEEEECCCCHHHHHHHHHCCCE Q ss_conf 5883799425886999-9999999999610698-4899856-40578888514981399984046413566665530470 Q gi|254780342|r 24 SCTPVRFADTGWTDIA-ATTAMTSVILEEILGY-KTNIKLL-AVPVTFRSLKNKGIDIFMGYWYPSLEKFIAPYLEEGSI 100 (309) Q Consensus 24 ~~~~I~ig~~~wte~~-v~a~i~~~iLE~~lGy-~Ve~~~~-~~~~~~~al~~G~iD~~~e~w~p~~~~~~~~~~~~~~~ 100 (309) ..-+|+++...++... ..-...+-.+++. |+ +|+++.. +.....++|.+|.+|+..-+. +......... ....+ T Consensus 29 ~~~~~~~~~~~~~~~~~~~va~~kG~f~~~-Gl~~V~~~~~~~~~~~~~~l~~G~~D~a~~~~-~~~~~~~~~~-~g~pv 105 (335) T COG0715 29 EKVTILLGWLPNPDHAPLYVAKEKGFFKKE-GLDDVELVEFTGGAPVLEALAAGALDFAVYYT-GDTPLIAAGA-AGAPV 105 (335) T ss_pred CCEEEEEEEECCHHHHHHHHHHHCCHHHHC-CCCEEEEEECCCCHHHHHHHHCCCCCEEEECC-CCHHHHHHHC-CCCCE T ss_conf 706999743337028999999978917774-99758997079845799999769988874237-8559999635-69997 Q ss_pred EEECCCCCC-CEEEEEEEHHHHHCCCCCHHHHHHHHHHHHHCCCCCCCCCC-CCHHHHHHHHHCCCCCCCEEEECCCCCC Q ss_conf 230135668-53567863446412345535655556654420135785432-0055587664035452210010366520 Q gi|254780342|r 101 KLVAENLQG-AKYMLAVNDVGFALGIKSYQDIAKYKKELGAKIYGIEPGNE-GNQRILDMINNNKFSLKGFRLIEASELA 178 (309) Q Consensus 101 ~~~~~~~~~-~~~~~~v~~~~~~~~i~~~~dl~~~~~~~~~~~~g~~~~~~-~~~~~~~~~~~~~l~~~~~~~~~~s~~~ 178 (309) ..++....+ ....+.+.+. ..+++++|| +++.+|...+.. ........+...||+....+++..... T Consensus 106 ~~va~~~~~~~~~~i~~~~~---~~i~~~adl-------kGk~vg~~~~~~~~~~~l~~~L~~~Gl~~~dv~~v~~~~~- 174 (335) T COG0715 106 KVVAALVQNGNGIALLVLKD---SGIKSVADL-------KGKKVGVPFGGSTSDFLLRYALAKAGLDPDDVELVNLPPA- 174 (335) T ss_pred EEEEEEEECCCCEEEEECCC---CCCCCHHHC-------CCCEEEEECCCCCHHHHHHHHHHHCCCCHHHCEEEECCHH- T ss_conf 99998743797329997388---776771333-------8998998189970799999999986999899468736929- Q ss_pred HHHHHHHHHHCCCCEEEEEECCCHHHHCC--CCEEECCCCCCCCCCHHHHHHHHCHHHHHHCHHHH-HHHHHC Q ss_conf 18999999965993699941242232015--85140078654677865835442777787488999-998740 Q gi|254780342|r 179 SFSQIRRDQRNNIPAVFLSWEPHPINSDL--NIHYLPGGEEISGFGEASVYTVVRSDYLDKCPNIS-RLLKNI 248 (309) Q Consensus 179 ~~a~l~~A~~~~~~~v~~~wsp~~~~~~~--dl~~Ledp~~~~~~p~~~~~~vv~~~~~~~~P~~~-~~L~~~ 248 (309) +..+|+.+|+++.+..|.|....... ..+++-+..+... ..+....++++++.++||++. +|++.+ T Consensus 175 ---~~~~al~~g~vda~~~~ep~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~p~~~~~~v~a~ 243 (335) T COG0715 175 ---DAVAALAAGQVDAFVVWEPWNAAAEGEGGGRVLLDGADLWG-NHPELVLVVRKEFIEANPEAVKAFLKAL 243 (335) T ss_pred ---HHHHHHHCCCCCEEEECCHHHHHHHHCCCCEEEEECCCCCC-CCCEEEEEECHHHHHHCHHHHHHHHHHH T ss_conf ---99999975986889966679899731578579861566676-6742699955887766999999999999 No 10 >pfam09084 NMT1 NMT1/THI5 like. This family contains the NMT1 and THI5 proteins. These proteins are proposed to be required for the biosynthesis of the pyrimidine moiety of thiamine. They are regulated by thiamine. Probab=98.96 E-value=5.1e-08 Score=71.97 Aligned_cols=179 Identities=11% Similarity=0.066 Sum_probs=113.8 Q ss_pred HHHHHHCCCCEEEEEEC-CHHHHHHHHHCCCCEEEEEEECCCCHHHHHHHHHCCCEEEECCCCCCCEEEEEEEHHHHHCC Q ss_conf 99996106984899856-40578888514981399984046413566665530470230135668535678634464123 Q gi|254780342|r 46 SVILEEILGYKTNIKLL-AVPVTFRSLKNKGIDIFMGYWYPSLEKFIAPYLEEGSIKLVAENLQGAKYMLAVNDVGFALG 124 (309) Q Consensus 46 ~~iLE~~lGy~Ve~~~~-~~~~~~~al~~G~iD~~~e~w~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~ 124 (309) +-..++. |.+||++.. +.....++|.+|++|+..-...+ ..........++.++.........+.++. ..+ T Consensus 13 ~G~f~~~-GL~Ve~~~~~~~~~~~~al~sG~~D~a~~~~~~----~~~~~~~g~~~~~i~~~~~~~~~~~~v~~---~~~ 84 (216) T pfam09084 13 KGYFKEE-GLDVEIVEPADPSDAVQLVAAGKADFGVSYQPS----LLLARAKGLPVVSVAALIQHPPNGLISLK---DSG 84 (216) T ss_pred CCCHHHC-CCCEEEEECCCCHHHHHHHHCCCCCEEECCCHH----HHHHHHCCCEEEEEEECCCCCCEEEEEEC---CCC T ss_conf 7946884-980899966882789999976986778537199----99999789849999964358972899978---889 Q ss_pred CCCHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCCCCCCCEEEECCCCCCHHHHHHHHHHCCCCEEEEEECCCHHH Q ss_conf 45535655556654420135785432005558766403545221001036652018999999965993699941242232 Q gi|254780342|r 125 IKSYQDIAKYKKELGAKIYGIEPGNEGNQRILDMINNNKFSLKGFRLIEASELASFSQIRRDQRNNIPAVFLSWEPHPIN 204 (309) Q Consensus 125 i~~~~dl~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~s~~~~~a~l~~A~~~~~~~v~~~wsp~~~~ 204 (309) +++++|| +++.++..++..........++.+++.....+++..+.. ....|+++|+++.+..|.|+|.. T Consensus 85 i~s~~DL-------kGk~i~~~~~~~~~~~l~~~L~~~Gl~~~dv~~v~~~~~----~~~~al~~G~vDa~~~~~~~~~~ 153 (216) T pfam09084 85 IKSPKDL-------KGKRIGYSGSPFEEALLKALLAKAGGDPSDVTLVNVGGT----SLSPALLTGKVDAAIGGYYNWEG 153 (216) T ss_pred CCCHHHH-------CCCEEEEECCCHHHHHHHHHHHHCCCCHHHEEEEECCHH----HHHHHHHCCCCCEEEECCCCHHH T ss_conf 8997895-------898899637972899999999985999899089967848----88889856995589962467579 Q ss_pred HC-----CCCEEE-CCCCCCCCCCHHHHHHHHCHHHHHHCHHHHHHH Q ss_conf 01-----585140-078654677865835442777787488999998 Q gi|254780342|r 205 SD-----LNIHYL-PGGEEISGFGEASVYTVVRSDYLDKCPNISRLL 245 (309) Q Consensus 205 ~~-----~dl~~L-edp~~~~~~p~~~~~~vv~~~~~~~~P~~~~~L 245 (309) -. .....+ ....+.+.++. ...++|++|+++||++.+-+ T Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~p~~~~~f 198 (216) T pfam09084 154 VELELEGVDLNIFALADYGVPDYYS--LVLITNDATLKKNPELVKAF 198 (216) T ss_pred HHHHHCCCCEEEEEHHHCCCCCCCC--EEEEECHHHHHHCHHHHHHH T ss_conf 9998568865897065438887662--29999899998898999999 No 11 >TIGR03431 PhnD phosphonate ABC transporter, periplasmic phosphonate binding protein. Note that this model does not identify all phnD-subfamily genes with evident phosphonate context, but all sequences above the trusted context may be inferred to bind phosphonate compounds even in the absence of such context. Furthermore, there is ample evidence to suggest that many other members of the TIGR01098 subfamily have a different primary function. Probab=98.94 E-value=1.9e-07 Score=68.31 Aligned_cols=220 Identities=14% Similarity=0.122 Sum_probs=120.6 Q ss_pred CHHHHHHHHHHHHHHHHHCCCCCCCCCEEEECCCCCHHHHHHHHHH---HHHHHCCCCEEEEEEC-CHHHHHHHHHCCCC Q ss_conf 9579999999999986303583458837994258869999999999---9996106984899856-40578888514981 Q gi|254780342|r 1 MYKILAVCLFLTTFSISYARDADSCTPVRFADTGWTDIAATTAMTS---VILEEILGYKTNIKLL-AVPVTFRSLKNKGI 76 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~~~~~~~~~I~ig~~~wte~~v~a~i~~---~iLE~~lGy~Ve~~~~-~~~~~~~al~~G~i 76 (309) |||+++++++++++.++.+.+.+..+|+++|..+.-+...+-+-+. ..|++.+|++|++... +...+.++|++|++ T Consensus 2 ~~r~l~~~~~~~~~~~~~a~a~~~p~~l~~gv~P~~~~~~~~~~~~Pl~~yL~~~lg~~Ve~~~~~~y~~~~eal~~g~~ 81 (288) T TIGR03431 2 LRRLILSLVAAFMLISSNAQAEDWPKELNFGIIPTENASDLKQRWEPLADYLSKKLGVKVKLFFATDYAGVIEGMRFGKV 81 (288) T ss_pred CHHHHHHHHHHHHHHCCHHHHCCCCCEEEEEEECCCCHHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHHHHHHCCCC T ss_conf 08899999999999743224304883689998069999999999999999999987897899968999999999985983 Q ss_pred EEEEEEECCCCHHHHHHHHHCCCEEEEC----CCC-CCCEEEEEEEHHHHHCCCCCHHHHHHHHHHHHHCCCC-CCCCCC Q ss_conf 3999840464135666655304702301----356-6853567863446412345535655556654420135-785432 Q gi|254780342|r 77 DIFMGYWYPSLEKFIAPYLEEGSIKLVA----ENL-QGAKYMLAVNDVGFALGIKSYQDIAKYKKELGAKIYG-IEPGNE 150 (309) Q Consensus 77 D~~~e~w~p~~~~~~~~~~~~~~~~~~~----~~~-~~~~~~~~v~~~~~~~~i~~~~dl~~~~~~~~~~~~g-~~~~~~ 150 (309) |+..-. |..--... .+.....+. ..+ ...+..+.+++ +.+|.+++||+ ++.++ .++... T Consensus 82 Dia~~~--p~~yv~a~---~~~~~~~la~~~~~~g~~~y~s~iivr~---Ds~i~sl~DLk-------Gk~iaf~~~~St 146 (288) T TIGR03431 82 DIAWYG--PSSYAEAY---QKANAEAFAIEVNADGSTGYYSVLIVKK---DSPIKSLEDLK-------GKTFGFVDPNST 146 (288) T ss_pred EEEEEC--CHHHHHHH---HCCCCEEEEEEECCCCCCCEEEEEEEEC---CCCCCCHHHHC-------CCEEEECCCCCC T ss_conf 099988--47889986---5259748899851588864579999989---99888778957-------987660287411 Q ss_pred CCHHHHH-HH-HHCCCCCCC-E-EEE-CCCCCCHHHHHHHHHHCCCCEEEEEECCCHHHH--C------CCCEEECCCCC Q ss_conf 0055587-66-403545221-0-010-366520189999999659936999412422320--1------58514007865 Q gi|254780342|r 151 GNQRILD-MI-NNNKFSLKG-F-RLI-EASELASFSQIRRDQRNNIPAVFLSWEPHPINS--D------LNIHYLPGGEE 217 (309) Q Consensus 151 ~~~~~~~-~~-~~~~l~~~~-~-~~~-~~s~~~~~a~l~~A~~~~~~~v~~~wsp~~~~~--~------~dl~~Ledp~~ 217 (309) .....+. .+ +..++.... + ++. .++ ......++.+|+.++.......+... + -++++|--... T Consensus 147 SG~l~P~~~L~~~~g~~~~~~f~~v~~~gs----hd~~~~aV~~G~~Dag~~~~~~~~~~~~~~~~~~~~~lrvi~~S~~ 222 (288) T TIGR03431 147 SGFLVPSYYLFKKNGIKPKEYFKKVTFSGS----HEAAILAVANGTVDAATTNDENLDRMIRKGQPDAMEDLRIIWKSPL 222 (288) T ss_pred HHHHHHHHHHHHHCCCCHHHHHHHEEECCC----HHHHHHHHHCCCCCEEEECHHHHHHHHHCCCCCHHHCEEEEEECCC T ss_conf 113999999998659972545531331578----7899999985993299833888999997479630313699986788 Q ss_pred CCCCCHHHHHHHHCHHHHHHCHHHHHHHHH Q ss_conf 467786583544277778748899999874 Q gi|254780342|r 218 ISGFGEASVYTVVRSDYLDKCPNISRLLKN 247 (309) Q Consensus 218 ~~~~p~~~~~~vv~~~~~~~~P~~~~~L~~ 247 (309) .|.. ..++|+++ .|+..+-+++ T Consensus 223 ---iP~~--~~~v~~~l---~~~~~~~i~~ 244 (288) T TIGR03431 223 ---IPNG--PIVYRKDL---PADLKAKIRK 244 (288) T ss_pred ---CCCC--CEEEECCC---CHHHHHHHHH T ss_conf ---8877--18996999---9999999999 No 12 >TIGR03427 ABC_peri_uca ABC transporter periplasmic binding protein, urea carboxylase region. Members of this family are ABC transporter periplasmic binding proteins associated with the urea carboxylase/allophanate hydrolase pathway, an alternative to urease for urea degradation. The protein is restricted to bacteria with the pathway, with its gene close to the urea carboxylase and allophanate hydrolase genes. The substrate for this transporter therefore is likely to be urea or a compound from which urea is easily derived. Probab=98.83 E-value=4.6e-07 Score=65.75 Aligned_cols=201 Identities=12% Similarity=0.095 Sum_probs=136.3 Q ss_pred CCCEEEECCCCCHHHHHHHH-----HHHHHHHCCCCEEEEEEC-CHHHHHHHHHCCCCEEEEEEECCCCHHHHHHHHHCC Q ss_conf 88379942588699999999-----999996106984899856-405788885149813999840464135666655304 Q gi|254780342|r 25 CTPVRFADTGWTDIAATTAM-----TSVILEEILGYKTNIKLL-AVPVTFRSLKNKGIDIFMGYWYPSLEKFIAPYLEEG 98 (309) Q Consensus 25 ~~~I~ig~~~wte~~v~a~i-----~~~iLE~~lGy~Ve~~~~-~~~~~~~al~~G~iD~~~e~w~p~~~~~~~~~~~~~ 98 (309) +++++||.-.|++-.....- .+.-.++ .|++|+++.. +.....+++.+|++|...-.+.+....... ...+. T Consensus 1 k~~~kig~S~w~Gw~Pw~~A~e~GifkKw~~k-~Gi~Vel~~f~dy~~si~a~~AG~~Dg~~~Tn~Dal~i~aa-~Gvd~ 78 (328) T TIGR03427 1 KDKFKVCWSIYAGWMPWGYAAQQGIVDKWADK-YGITIEVVQINDYVESINQYTAGKFDGCTMTNMDALTIPAA-GGVDT 78 (328) T ss_pred CCCEEEEEECCCCHHHHHHHHHCCCHHHHHHH-CCCEEEEEECCCHHHHHHHHHCCCCCEEEEECCHHHHHHHC-CCCCE T ss_conf 99339997268538889999974926555887-59737999828848889999758977477626156651311-79972 Q ss_pred CEEEECCCCCCCEEEEEEEHHHHHCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCCCCCCCEEEECCCCCC Q ss_conf 70230135668535678634464123455356555566544201357854320055587664035452210010366520 Q gi|254780342|r 99 SIKLVAENLQGAKYMLAVNDVGFALGIKSYQDIAKYKKELGAKIYGIEPGNEGNQRILDMINNNKFSLKGFRLIEASELA 178 (309) Q Consensus 99 ~~~~~~~~~~~~~~~~~v~~~~~~~~i~~~~dl~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~s~~~ 178 (309) ..+..+-...| ..++.++ ++++++|| +++.++.+.+...+....+.++..|+.....+++..+.+ T Consensus 79 ~~vlv~D~SnG-~D~Ivak------~~~slaDL-------KGKkVav~~~svshyLL~rALe~aGLs~~DV~vVn~~~~- 143 (328) T TIGR03427 79 TALIVGDFSNG-NDGIVLK------GGKSLADL-------KGQKVNLVELSVSHYLLARALESVGLSEKDVKVVNTSDA- 143 (328) T ss_pred EEEEEECCCCC-CCEEEEC------CCCCHHHH-------CCCEEECCCCCCHHHHHHHHHHHCCCCHHHEEEEECCCH- T ss_conf 89998525788-7479978------99887884-------898883247870799999999986998899078847954- Q ss_pred HHHHHHHHHHCCCCEEEEEECCCHH-HH-CCCCEEECCCCCCCCCCHHHHHHHHCHHHHHHCHHHHHHHHH Q ss_conf 1899999996599369994124223-20-158514007865467786583544277778748899999874 Q gi|254780342|r 179 SFSQIRRDQRNNIPAVFLSWEPHPI-NS-DLNIHYLPGGEEISGFGEASVYTVVRSDYLDKCPNISRLLKN 247 (309) Q Consensus 179 ~~a~l~~A~~~~~~~v~~~wsp~~~-~~-~~dl~~Ledp~~~~~~p~~~~~~vv~~~~~~~~P~~~~~L~~ 247 (309) ++.+|+.+|+++.+..|.|... +. .-+-+.|=|..+.-+ --.+ ..++|.+++++||+..+.|-. T Consensus 144 ---d~~aAf~sg~VdA~vTWeP~ls~i~~~~ga~~iF~Ss~iPG-eI~D-~lvV~~~~L~~nP~~~kAlv~ 209 (328) T TIGR03427 144 ---DIVAAFITKDVTAVVTWNPQLSEIKAQPGANEVFDSSQIPG-EILD-LMVVNTQTLKANPNLGKALTG 209 (328) T ss_pred ---HHHHHHHCCCCCEEEEECHHHHHHHHCCCCCEEEECCCCCC-CEEE-EEEECHHHHHHCHHHHHHHHH T ss_conf ---69999747998789997856899972899826442577888-4037-999858988879999999999 No 13 >TIGR01729 taurine_ABC_bnd taurine ABC transporter, periplasmic binding protein; InterPro: IPR010068 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). This entry identifies a cluster of ABC transporter periplasmic substrate binding proteins, apparently specific for taurine. Transport systems for taurine (NH2-CH2-CH2-SO3H), sulphonates, and sulphate esters import sulphur when sulphate levels are low. The most closely related proteins outside this family are putative aliphatic sulphonate binding proteins.. Probab=98.78 E-value=2e-08 Score=74.58 Aligned_cols=221 Identities=15% Similarity=0.159 Sum_probs=159.8 Q ss_pred HHHCCCCEEEEEECCH-HHHHHHHHCCCCEEEEEEECCCCHHHHHHHHHCCCEEEE-CCCCCCCEEEEEEEHHHHHCCCC Q ss_conf 9610698489985640-578888514981399984046413566665530470230-13566853567863446412345 Q gi|254780342|r 49 LEEILGYKTNIKLLAV-PVTFRSLKNKGIDIFMGYWYPSLEKFIAPYLEEGSIKLV-AENLQGAKYMLAVNDVGFALGIK 126 (309) Q Consensus 49 LE~~lGy~Ve~~~~~~-~~~~~al~~G~iD~~~e~w~p~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~v~~~~~~~~i~ 126 (309) .++..||+++-...++ ..+..||++|++||-.=.-.|...+-.+.. .++.. =....|....+++++.. +|. T Consensus 24 ~aKe~gaTiDWRkFdSGadi~~AlASG~V~IG~~GSsplaaAASr~v----pie~f~~~~~iG~SEALVar~g~---GIe 96 (304) T TIGR01729 24 YAKEAGATIDWRKFDSGADIVAALASGDVDIGVVGSSPLAAAASREV----PIEVFLVVDKIGKSEALVAREGA---GIE 96 (304) T ss_pred HHHCCCCEEEEEEECCHHHHHHHHHCCCCCCCCCCCCHHHHHHHCCC----CEEEEEEECCCCCCCEEEEECCC---CCC T ss_conf 54304880200110441655556533863322224707899872688----30020100102872002454367---887 Q ss_pred CHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHH-CCCCCCCEEEECCCCCCHHHHHHHHHHCCCCEEEEEECCCHHHH Q ss_conf 53565555665442013578543200555876640-35452210010366520189999999659936999412422320 Q gi|254780342|r 127 SYQDIAKYKKELGAKIYGIEPGNEGNQRILDMINN-NKFSLKGFRLIEASELASFSQIRRDQRNNIPAVFLSWEPHPINS 205 (309) Q Consensus 127 ~~~dl~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~-~~l~~~~~~~~~~s~~~~~a~l~~A~~~~~~~v~~~wsp~~~~~ 205 (309) ..+||. |+.++.|--...+..+.-.++. -+.+....+++.+.++ ++.+|.+||.++..|+|.|=---- T Consensus 97 KpeDL~-------GK~vaVPFvSTtHysLLaaLk~vw~~dp~~V~IlNl~PP----~I~AAwqRGDIDaAYVW~PAl~~l 165 (304) T TIGR01729 97 KPEDLK-------GKKVAVPFVSTTHYSLLAALKHVWKVDPKEVEILNLKPP----EIVAAWQRGDIDAAYVWDPALSEL 165 (304) T ss_pred CHHHHC-------CCEEECCCCCHHHHHHHHHHHHHHCCCCCEEEEEECCCH----HHHHHHHCCCCCCCCCCHHHHHHH T ss_conf 666717-------884502751134589999888762678760578617965----799875438826210374568877 Q ss_pred CCCCEEECCCCCC--CCCCHHHHHHHHCHHHHHHCHHHHHHHHHC---------------CCCHHHHHHHHHHHHHCCCC Q ss_conf 1585140078654--677865835442777787488999998740---------------89989999999999845999 Q gi|254780342|r 206 DLNIHYLPGGEEI--SGFGEASVYTVVRSDYLDKCPNISRLLKNI---------------KFSVALENEMMKLILNNKQD 268 (309) Q Consensus 206 ~~dl~~Ledp~~~--~~~p~~~~~~vv~~~~~~~~P~~~~~L~~~---------------~ls~~~~~~l~~~v~~~g~~ 268 (309) +-+.++|-|.+.. .+-|.-+ ..|+|++|+++||++.+-+-|. +-+.++++.|...+ |.+ T Consensus 166 ~k~GKV~~DSeqvgawgaPTfD-~wvvrkdfAeknPe~v~aF~Kv~~dAya~Y~Anpd~W~~~spq~~k~akL~---G~~ 241 (304) T TIGR01729 166 KKSGKVVSDSEQVGAWGAPTFD-AWVVRKDFAEKNPEVVKAFVKVLADAYADYLANPDAWKADSPQVSKLAKLI---GAD 241 (304) T ss_pred HHCCCEEECHHHHHHCCCCCCC-EEEEECHHHHHCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHCCEEEEE---CCC T ss_conf 5359668525787413687310-246521454307078999998899850100238875667862112224440---688 Q ss_pred HHHHHH-----HHHHHHHHHHHHHHCCC Q ss_conf 899999-----99998599999875475 Q gi|254780342|r 269 RQFVGR-----TMLRTHPDLLKNWLIGV 291 (309) Q Consensus 269 ~~~vA~-----~wL~~n~~~~~~Wl~~~ 291 (309) .++|.. -|+......-++||=|= T Consensus 242 ~e~vp~llkg~~fpT~~eQ~sdk~lGGg 269 (304) T TIGR01729 242 AEDVPELLKGLSFPTADEQVSDKLLGGG 269 (304) T ss_pred CCCHHHHHHCCCCCCHHHHHHHHHCCCC T ss_conf 2313777622346785452002321673 No 14 >TIGR01098 3A0109s03R phosphonate ABC transporter, periplasmic phosphonate-binding protein; InterPro: IPR005770 Bacterial binding protein-dependent transport systems , are multicomponent systems typically composed of a periplasmic substrate-binding protein, one or two reciprocally homologous integral inner-membrane proteins and one or two peripheral membrane ATP-binding proteins that couple energy to the active transport system. The integral inner-membrane proteins translocate the substrate across the membrane. It has been shown , that most of these proteins contain a conserved region located about 80 to 100 residues from their C-terminal extremity. This region seems to be located in a cytoplasmic loop between two transmembrane domains. Apart from the conserved region, the sequence of these proteins is quite divergent, and they have a variable number of transmembrane helices. This is a family of periplasmic proteins which are part of the transport system for alkylphosphonate uptake. ; GO: 0015604 phosphonate transmembrane transporter activity, 0015716 phosphonate transport, 0030288 outer membrane-bounded periplasmic space. Probab=98.74 E-value=4.1e-07 Score=66.10 Aligned_cols=229 Identities=15% Similarity=0.121 Sum_probs=126.7 Q ss_pred CHHHHHHHHHHHHHHHHHC----------CC--C----CC---------CCCEEEECCCC---CHHHHHHHHHHHHHHHC Q ss_conf 9579999999999986303----------58--3----45---------88379942588---69999999999999610 Q gi|254780342|r 1 MYKILAVCLFLTTFSISYA----------RD--A----DS---------CTPVRFADTGW---TDIAATTAMTSVILEEI 52 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~----------~~--~----~~---------~~~I~ig~~~w---te~~v~a~i~~~iLE~~ 52 (309) |||+++++++++..+++.+ +. + .. -+++++|..+- ++..-...-++..||++ T Consensus 1 mkr~~~~~~~~~~~~~~~~~~~S~~~~~~~~~~~~~~~~~~~~~~~~~~p~~l~~g~~P~e~~~~~~~~~~pL~~~Lek~ 80 (299) T TIGR01098 1 MKRLLALLAALLVASLAAACGSSEIDNRDCAAKAADAADADADVEKSETPKELNFGILPGENASNLARRWEPLKDYLEKK 80 (299) T ss_pred CHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHCCCCHHHCCCCCCEEEEEEECCCCHHHHHHHHHHHHHHHHHH T ss_conf 94788999999999999875420014543565510010156212103677726899844887557777668999988875 Q ss_pred CCCEEEEEEC-CHHHHHHHHHCCCCEEEEEEECCCCHHHHHHHHHCCCEEEEC----CCCC---CCEEEEEEEHHHHHCC Q ss_conf 6984899856-405788885149813999840464135666655304702301----3566---8535678634464123 Q gi|254780342|r 53 LGYKTNIKLL-AVPVTFRSLKNKGIDIFMGYWYPSLEKFIAPYLEEGSIKLVA----ENLQ---GAKYMLAVNDVGFALG 124 (309) Q Consensus 53 lGy~Ve~~~~-~~~~~~~al~~G~iD~~~e~w~p~~~~~~~~~~~~~~~~~~~----~~~~---~~~~~~~v~~~~~~~~ 124 (309) ||++|++... +...+.+||+.|.||+.. ++...-.......+....... .... +.+.-+.+ -++.+ T Consensus 81 Lg~kV~~~~~~dY~a~IEamr~g~~D~a~---~g~~sY~~A~~~~~~~~~a~~~~~~~~~~g~~~Y~S~~iv---~~dS~ 154 (299) T TIGR01098 81 LGIKVELFVATDYSAVIEAMRFGRVDIAW---FGPSSYVLAHRRANAEAFALTAIRYVSTDGSPGYYSVIIV---KADSP 154 (299) T ss_pred CCCCEEEEEECCCCCEEEEECCCCEEEEE---ECCCHHHEEEECCCCCHHHHHHHHHCCCCCCCCEEEEEEE---CCCCC T ss_conf 58867998616831033423268078983---4785123220013870121344331168889834899998---38999 Q ss_pred CCCHHHHHHHHHHHHHCCCC-CCCCCCCCHHHHH--HHHHCCCCCC------CEEEE-CCCCCCHHHHHHHHHHCCCCEE Q ss_conf 45535655556654420135-7854320055587--6640354522------10010-3665201899999996599369 Q gi|254780342|r 125 IKSYQDIAKYKKELGAKIYG-IEPGNEGNQRILD--MINNNKFSLK------GFRLI-EASELASFSQIRRDQRNNIPAV 194 (309) Q Consensus 125 i~~~~dl~~~~~~~~~~~~g-~~~~~~~~~~~~~--~~~~~~l~~~------~~~~~-~~s~~~~~a~l~~A~~~~~~~v 194 (309) |++++||.++. +++.++ .++-.......+. ....+++..+ |.+++ .|+ .....-++.||+.++ T Consensus 155 i~~~~DlL~~L---KGK~~af~Dp~STSG~l~P~~~l~~~~~~~~~~~d~~~F~~~~~~G~----Hd~~~~~V~nG~vDa 227 (299) T TIGR01098 155 IKSLKDLLKKL---KGKTFAFGDPASTSGYLVPRYQLLKEGGLDADKDDDKFFSEVVFSGS----HDASILAVANGKVDA 227 (299) T ss_pred CCCHHHHHHHC---CCCEEEECCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCEEEEECC----HHHHHHHHHHCCCCE T ss_conf 63389999750---79759850688751247788998765078988853452332688431----589999988088545 Q ss_pred EEEECCCHHHHCC--------------------CCEEECCCCCCCCCCHHHHHHHHCHHHHHHCHHHHHHHHHC Q ss_conf 9941242232015--------------------85140078654677865835442777787488999998740 Q gi|254780342|r 195 FLSWEPHPINSDL--------------------NIHYLPGGEEISGFGEASVYTVVRSDYLDKCPNISRLLKNI 248 (309) Q Consensus 195 ~~~wsp~~~~~~~--------------------dl~~Ledp~~~~~~p~~~~~~vv~~~~~~~~P~~~~~L~~~ 248 (309) ...|+..+-..+- +++++--.. ..|.+. .++|+++.+ +|+..+-|.++ T Consensus 228 A~~~~~~~~~~~~~~~~~~~~~~~~~g~~D~~~~~~vI~~S~---~IPndp--~~~R~~L~~-yP~~k~ki~~~ 295 (299) T TIGR01098 228 ATNNSSAIERLKKRGPSDKFKKKVAKGATDLMKKVRVIWKSP---LIPNDP--IAVRSDLPP-YPELKEKIRDA 295 (299) T ss_pred EEECHHHHHHHHHCCCCCCCCEEECCCCCCCHHCEEEEEECC---CCCCCC--EEEECCCCC-CHHHHHHHHHH T ss_conf 765304466565303235651552047767137017875237---877875--465257896-67799999999 No 15 >PRK09861 cytoplasmic membrane lipoprotein-28; Provisional Probab=98.66 E-value=2e-06 Score=61.62 Aligned_cols=225 Identities=12% Similarity=0.083 Sum_probs=122.8 Q ss_pred HHHHHHHHHHHHHCCCCCCCCCEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEEEC-CHHHHHHHHHCCCCEEEEEEEC Q ss_conf 999999999863035834588379942588699999999999996106984899856-4057888851498139998404 Q gi|254780342|r 6 AVCLFLTTFSISYARDADSCTPVRFADTGWTDIAATTAMTSVILEEILGYKTNIKLL-AVPVTFRSLKNKGIDIFMGYWY 84 (309) Q Consensus 6 ~~~~~~~~~~~~~~~~~~~~~~I~ig~~~wte~~v~a~i~~~iLE~~lGy~Ve~~~~-~~~~~~~al~~G~iD~~~e~w~ 84 (309) +++++++++..+|+...++.++|+||...-+.+.+.-.+...+.|+ .||+++++.. +...--.||.+|+||.-.-- T Consensus 12 a~ll~~~~~l~~C~~~~~~~k~ikVG~~~~p~~~i~e~~~~~~~ek-~G~~leiv~FsDy~~PN~AL~~G~iDaN~fQ-- 88 (272) T PRK09861 12 AALLLAGILLAGCDQSSSDAKHIKVGVINGAEQDVAEVAKKVAKEK-YGLDVELVGFSGSLLPNDATNHGELDANVFQ-- 88 (272) T ss_pred HHHHHHHHHHHHCCCCCCCCCEEEEEECCCCHHHHHHHHHHHHHHH-CCCEEEEEEECCCCCHHHHHHCCCCCCHHHH-- T ss_conf 9999999998745897678973899967898699999999888761-7976899994686226289767983602455-- Q ss_pred CCCHHHHHHHHHCCC--EEEECCCCCCCEEEEEEEHHHHHCCCCCHHHHHHHH---------HHHHHC-------CCCCC Q ss_conf 641356666553047--023013566853567863446412345535655556---------654420-------13578 Q gi|254780342|r 85 PSLEKFIAPYLEEGS--IKLVAENLQGAKYMLAVNDVGFALGIKSYQDIAKYK---------KELGAK-------IYGIE 146 (309) Q Consensus 85 p~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~v~~~~~~~~i~~~~dl~~~~---------~~~~~~-------~~g~~ 146 (309) +..++..+.++.+ ++.++..+.. ..+++. ..++++++|++-+ ...+.. .+-.. T Consensus 89 --H~pyL~~~n~~~g~~L~~v~~~~~~-P~glYS------~K~ksl~dlp~Ga~IaIPnDpsN~~RAL~lL~~aGLI~Lk 159 (272) T PRK09861 89 --HRPFLEQDNQAHGYKLVAVGNTFVF-PMAGYS------KKIKTVAQIKEGATVAIPNDPTNLGRALLLLQKEKLITLK 159 (272) T ss_pred --HHHHHHHHHHHCCCCEEEEEEEEEE-EEEECC------CCCCCHHHCCCCCEEEECCCHHHHHHHHHHHHHCCCEEEC T ss_conf --7999999999869957997316783-014034------4659875847999898047812699999999988978977 Q ss_pred CCCCCCHHHHHHHHHCCCCCCCEEEECCCCCCHHHHHHHHHHCCCCEEEEEECCCHHHHCC----CCEEECCCCCCCCCC Q ss_conf 5432005558766403545221001036652018999999965993699941242232015----851400786546778 Q gi|254780342|r 147 PGNEGNQRILDMINNNKFSLKGFRLIEASELASFSQIRRDQRNNIPAVFLSWEPHPINSDL----NIHYLPGGEEISGFG 222 (309) Q Consensus 147 ~~~~~~~~~~~~~~~~~l~~~~~~~~~~s~~~~~a~l~~A~~~~~~~v~~~wsp~~~~~~~----dl~~Ledp~~~~~~p 222 (309) ++.+......+ +..+...+. +++.. -+++.+++++.++++..+......-..+ |--++|+.. . T Consensus 160 ~~~~~~~t~~D-I~~NPk~lk---~~e~~----aaql~rsl~Dp~vD~avin~n~a~~agl~p~~dal~~E~~~-----~ 226 (272) T PRK09861 160 EGKGLLPTALD-ITDNPRHLQ---IMELE----GAQLPRVLDDPKVDVAIISTTYIQQTGLSPVHDSVFIEDKN-----S 226 (272) T ss_pred CCCCCCCCHHH-HHCCCCCCE---EEECC----HHHHHHHHCCCCCCEEEECHHHHHHCCCCCCCCCEEECCCC-----C T ss_conf 99997777445-760887767---99826----77746750577616899760579887779440020366899-----9 Q ss_pred HHHHHHHHCHHHHHHCHHHHHHHHHCCCCHHHHHHH Q ss_conf 658354427777874889999987408998999999 Q gi|254780342|r 223 EASVYTVVRSDYLDKCPNISRLLKNIKFSVALENEM 258 (309) Q Consensus 223 ~~~~~~vv~~~~~~~~P~~~~~L~~~~ls~~~~~~l 258 (309) +|-...++|++= ++.|.+.++.+-++ .+++.+. T Consensus 227 py~niiavr~~~-~d~~~ik~lv~a~~--S~evk~~ 259 (272) T PRK09861 227 PYVNILVAREDN-KNAENVKEFLQSYQ--SPEVAKA 259 (272) T ss_pred CEEEEEEECCCC-CCCHHHHHHHHHHC--CHHHHHH T ss_conf 837999982767-69989999999977--9999999 No 16 >PRK11063 metQ DL-methionine transporter substrate-binding subunit; Provisional Probab=98.64 E-value=2.5e-06 Score=61.00 Aligned_cols=231 Identities=12% Similarity=0.101 Sum_probs=120.8 Q ss_pred CHHHHHHHHHHH-HHHHHHCCCCCCCCCEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEEEC-CHHHHHHHHHCCCCEE Q ss_conf 957999999999-99863035834588379942588699999999999996106984899856-4057888851498139 Q gi|254780342|r 1 MYKILAVCLFLT-TFSISYARDADSCTPVRFADTGWTDIAATTAMTSVILEEILGYKTNIKLL-AVPVTFRSLKNKGIDI 78 (309) Q Consensus 1 mkk~~~~~~~~~-~~~~~~~~~~~~~~~I~ig~~~wte~~v~a~i~~~iLE~~lGy~Ve~~~~-~~~~~~~al~~G~iD~ 78 (309) ||+++++.++++ +..++|++...+.++|+||...-+...+.-.+.....|+. ||+++++.. +...--.||..|+||+ T Consensus 5 ~k~~~~~~~l~~~l~l~gCg~~~~~~~~ikVG~~~gp~~ei~e~~~~~~~ek~-G~~veiv~FsDy~~pN~AL~~G~iDa 83 (271) T PRK11063 5 FKTFAAVGALIGSLALVGCGQDEKDPNHIKVGVIVGAEQQVAEVAQKVAKEKY-GLDVELVTFNDYVLPNEALSKGDIDA 83 (271) T ss_pred HHHHHHHHHHHHHHHHCCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHHHHHC-CCEEEEEEECCCHHHHHHHHCCCCCH T ss_conf 99999999999999750058985679818998468987999999888888607-97689999368321558986798442 Q ss_pred EEEEECCCCHHHHHHHHHCC--CEEEECCCCCCCEEEEEEEHHHHHCCCCCHHHHHHHH---------HHHHH------- Q ss_conf 99840464135666655304--7023013566853567863446412345535655556---------65442------- Q gi|254780342|r 79 FMGYWYPSLEKFIAPYLEEG--SIKLVAENLQGAKYMLAVNDVGFALGIKSYQDIAKYK---------KELGA------- 140 (309) Q Consensus 79 ~~e~w~p~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~v~~~~~~~~i~~~~dl~~~~---------~~~~~------- 140 (309) -.-- +..++..+.++. .++.++..+.. ..+++. ..++++++|++-+ ...+. T Consensus 84 N~fQ----H~~yL~~~nk~~g~~L~~v~~~~~~-P~glYS------~K~ksl~elp~Ga~IaIPND~sN~~RAL~lL~~a 152 (271) T PRK11063 84 NAFQ----HKPYLDQQIKDRGYKLVAVGNTFVY-PIAGYS------KKIKSLDELQDGSQVAVPNDPTNLGRSLLLLQKV 152 (271) T ss_pred HHHH----CHHHHHHHHHHCCCCEEEEEEEEEE-EEEEEE------CCCCCHHHCCCCCEEEECCCCCHHHHHHHHHHHC T ss_conf 3455----7999999999779957998667784-158642------3658864756999998048856099999999988 Q ss_pred CCCCCCCCCCCCHHHHHHHHHCCCCCCCEEEECCCCCCHHHHHHHHHHCCCCEEEEEECCCHHHH----CCCCEEECCCC Q ss_conf 01357854320055587664035452210010366520189999999659936999412422320----15851400786 Q gi|254780342|r 141 KIYGIEPGNEGNQRILDMINNNKFSLKGFRLIEASELASFSQIRRDQRNNIPAVFLSWEPHPINS----DLNIHYLPGGE 216 (309) Q Consensus 141 ~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~s~~~~~a~l~~A~~~~~~~v~~~wsp~~~~~----~~dl~~Ledp~ 216 (309) .++-..++.+......+.. .+...+. +++.. -+++.+++.+..+.+...-.....-. .-|--++|++. T Consensus 153 GLIkLk~~~~~~~T~~DI~-~Npk~l~---~~e~d----aaql~rsl~D~dv~~aviN~~~a~~agl~p~kdal~~E~~~ 224 (271) T PRK11063 153 GLIKLKDGVGLLPTVLDIV-ENPKNLK---IVELE----APQLPRSLDDAQIALAVINTTYASQIGLTPAKDGIFVEDKD 224 (271) T ss_pred CCEEECCCCCCCCCHHHHH-CCCCCCE---EEECC----HHHHHHHCCCCCEEEEEECHHHHHHCCCCCCCCCEEECCCC T ss_conf 9889779999878966885-4887748---99903----77644313576510898147789777989342504777899 Q ss_pred CCCCCCHHHHHHHHCHHHHHHCHHHHHHHHHCCCCHHHHHHHH Q ss_conf 5467786583544277778748899999874089989999999 Q gi|254780342|r 217 EISGFGEASVYTVVRSDYLDKCPNISRLLKNIKFSVALENEMM 259 (309) Q Consensus 217 ~~~~~p~~~~~~vv~~~~~~~~P~~~~~L~~~~ls~~~~~~l~ 259 (309) .+|-...++|++ -++.|.+.++.+-++ .+++.+.. T Consensus 225 -----~py~Nvivvr~~-~kd~~~ik~lv~~~~--S~evk~~I 259 (271) T PRK11063 225 -----SPYVNLIVARED-NKDAENVKKFVQAYQ--SDEVYEAA 259 (271) T ss_pred -----CCEEEEEEECCC-CCCCHHHHHHHHHHC--CHHHHHHH T ss_conf -----983899998277-779989999999977--99999999 No 17 >PRK10677 modA molybdate transporter periplasmic protein; Provisional Probab=98.35 E-value=0.00011 Score=50.24 Aligned_cols=230 Identities=8% Similarity=-0.000 Sum_probs=129.0 Q ss_pred CHHHHH-HHHHHHHHHHHHCCCCCCCCCEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHCC-CCEE Q ss_conf 957999-99999999863035834588379942588699999999999996106984899856405788885149-8139 Q gi|254780342|r 1 MYKILA-VCLFLTTFSISYARDADSCTPVRFADTGWTDIAATTAMTSVILEEILGYKTNIKLLAVPVTFRSLKNK-GIDI 78 (309) Q Consensus 1 mkk~~~-~~~~~~~~~~~~~~~~~~~~~I~ig~~~wte~~v~a~i~~~iLE~~lGy~Ve~~~~~~~~~~~al~~G-~iD~ 78 (309) |-|.+. +.+..++.+..+++..++.++|+|.... +-..++.+|.++.=+ +.|.+|++..+++..+++.+..| ..|+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~a~~~~l~V~aAA-sL~~~~~ei~~~Fek-~~g~~v~~~fgsSg~L~~QI~~GAp~Dv 78 (257) T PRK10677 1 MARKWLNLFAGAALSFAVAGNALADEGKITVFAAA-SLTNAMQDIATQYKK-EKGVDVVSSFASSSTLARQIEAGAPADL 78 (257) T ss_pred CCHHHHHHHHHHHHHHCCCCCCCCCCCEEEEEEEC-CCHHHHHHHHHHHHH-HHCCEEEEEECCHHHHHHHHHCCCCCCE T ss_conf 91289999999999872355332568859999925-768999999999988-4198499996648999999982899658 Q ss_pred EEEEECCCCHHHHHHHHHCCCEEEEC-CCCCCCEEEEEEEHHHHHC--CCCCHHHHHHHHHHHHHCCCCCCCC-CCCCHH Q ss_conf 99840464135666655304702301-3566853567863446412--3455356555566544201357854-320055 Q gi|254780342|r 79 FMGYWYPSLEKFIAPYLEEGSIKLVA-ENLQGAKYMLAVNDVGFAL--GIKSYQDIAKYKKELGAKIYGIEPG-NEGNQR 154 (309) Q Consensus 79 ~~e~w~p~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~v~~~~~~~--~i~~~~dl~~~~~~~~~~~~g~~~~-~~~~~~ 154 (309) +...-... ......++.+.... ..+..++.-+++|+..... .+....++.+.... .++.-++|. ..+..- T Consensus 79 F~sAd~~~----~~~l~~~~~i~~~~~~~~a~n~lvli~pk~~~~~~~~i~~~~d~~~ll~~--~riaia~P~~aP~G~y 152 (257) T PRK10677 79 FISADQKW----MDYAVDKKAIDTATRQTLLGNSLVVVAPKASEQKDFTIDKKTDWTSLLNG--GRLAVGDPDHVPAGIY 152 (257) T ss_pred EEECCHHH----HHHHHHCCCCCCCCEEEEECCEEEEEEECCCCCCCCCCCCCCCHHHHCCC--CCEEEECCCCCCCHHH T ss_conf 99787577----89998677856776456214749999846877786443433237774258--8089858888865799 Q ss_pred HHHHHHHCCCCCCC-EEEECCCCCCHHHHHHHHHHCCCCEEEEEECCCHHHHCCCCEEE-CCCCCCCCCCH-HHHHHHHC Q ss_conf 58766403545221-00103665201899999996599369994124223201585140-07865467786-58354427 Q gi|254780342|r 155 ILDMINNNKFSLKG-FRLIEASELASFSQIRRDQRNNIPAVFLSWEPHPINSDLNIHYL-PGGEEISGFGE-ASVYTVVR 231 (309) Q Consensus 155 ~~~~~~~~~l~~~~-~~~~~~s~~~~~a~l~~A~~~~~~~v~~~wsp~~~~~~~dl~~L-edp~~~~~~p~-~~~~~vv~ 231 (309) ..+.++..++-..- -+++.+ .-..++.+.+.+|+.++..+|.+|..-.. ++..+ .-|.+.+. |- |.+. +++ T Consensus 153 a~~~L~~~gl~~~l~~klv~~---~nV~~~l~~v~~G~adaG~Vy~Sda~~~~-~~~~~~~~p~~~~~-pi~y~~a-i~~ 226 (257) T PRK10677 153 AKEALQKLGAWDTLSPKLAPA---EDVRGALALVERNEAPLGIVYGSDAVASK-GVKVVATFPEDSHK-KVEYPMA-IVE 226 (257) T ss_pred HHHHHHHCCCHHHHCCCEEEC---CCHHHHHHHHHCCCCCEEEEECCHHHCCC-CCEEEEECCHHHCC-CCEEEEE-EEC T ss_conf 999999767665506541325---76999999998299878999700022268-86499987745389-6145589-975 Q ss_pred HHHHHHCHHHHHHHHH Q ss_conf 7778748899999874 Q gi|254780342|r 232 SDYLDKCPNISRLLKN 247 (309) Q Consensus 232 ~~~~~~~P~~~~~L~~ 247 (309) ..++|.+.+|++= T Consensus 227 ---~~~~~~a~~F~~f 239 (257) T PRK10677 227 ---GHNNATVSAFYDY 239 (257) T ss_pred ---CCCCHHHHHHHHH T ss_conf ---9999999999999 No 18 >PRK10797 glutamate and aspartate transporter subunit; Provisional Probab=98.32 E-value=7.8e-05 Score=51.26 Aligned_cols=214 Identities=12% Similarity=0.110 Sum_probs=113.7 Q ss_pred CHHHHHHHHHHHHHHHHHCCC-C--C---------CCCCEEEEC----CCCC---------HH-HHHHHHHHHHHHHCC- Q ss_conf 957999999999998630358-3--4---------588379942----5886---------99-999999999996106- Q gi|254780342|r 1 MYKILAVCLFLTTFSISYARD-A--D---------SCTPVRFAD----TGWT---------DI-AATTAMTSVILEEIL- 53 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~~~-~--~---------~~~~I~ig~----~~wt---------e~-~v~a~i~~~iLE~~l- 53 (309) |||+..+++.++++++..++. + . +.++|+||. ++|. +- .=+++.++..|...+ T Consensus 3 ~kk~~~~~l~~~l~~~~~~~~~~~~a~~~tL~~Ik~~G~l~vG~~~~~pPf~~~d~~g~~vGfdvDla~~ia~~l~~~l~ 82 (302) T PRK10797 3 LRKLATALLALGLSAGLAQAEDAAPAAGSTLDKIAKNGVIVVGHRESSVPFSYYDNQQKVVGYSQDYSNAIVEAVKKKLN 82 (302) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHCCEEEEEECCCCCCCEEECCCCCCCCCHHHHHHHHHHHHHHHHC T ss_conf 58999999999999999745677753227199998699899998999899716899998515417999999999987744 Q ss_pred --CCEEEEEECCHHHHHHHHHCCCCEEEEEEECCCCHHHHHHHHHCCCEEEECCCCCCCEEEEEEEHHHHHCCCCCHHHH Q ss_conf --984899856405788885149813999840464135666655304702301356685356786344641234553565 Q gi|254780342|r 54 --GYKTNIKLLAVPVTFRSLKNKGIDIFMGYWYPSLEKFIAPYLEEGSIKLVAENLQGAKYMLAVNDVGFALGIKSYQDI 131 (309) Q Consensus 54 --Gy~Ve~~~~~~~~~~~al~~G~iD~~~e~w~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~i~~~~dl 131 (309) |.+|+.++.+....+.+|.+|.+|+.+..- +...+ +...+..+..|.-....+.|++. ..|+++.|| T Consensus 83 ~~~~~v~~v~~~~~~rip~L~~gk~Di~~~~~-t~T~e-------R~k~vdFS~pY~~~~~~llv~k~---s~Iks~~DL 151 (302) T PRK10797 83 KPDLQVKLIPITSQNRIPLLQNGTFDFECGST-TNNLE-------RQKQAAFSDTIFVVGTRLLTKKG---GDIKDFADL 151 (302) T ss_pred CCCCEEEEEECCHHHHHHHHHCCCCCEEEECC-CCCHH-------HHCCEEECCCCEECCEEEEEECC---CCCCCHHHH T ss_conf 67734999975788877999789808897357-56977-------74124214662771468999778---866675660 Q ss_pred HHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCCCCCCCEEEECCCCCCHHHHHHHHHHCCCCEEEEEECCC--HH--HHCC Q ss_conf 5556654420135785432005558766403545221001036652018999999965993699941242--23--2015 Q gi|254780342|r 132 AKYKKELGAKIYGIEPGNEGNQRILDMINNNKFSLKGFRLIEASELASFSQIRRDQRNNIPAVFLSWEPH--PI--NSDL 207 (309) Q Consensus 132 ~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~s~~~~~a~l~~A~~~~~~~v~~~wsp~--~~--~~~~ 207 (309) +++.+|...+................+. +++.... ..+...++++|+.+.+..=.+- +. ..+. T Consensus 152 -------~GK~V~V~~GTt~e~~~~~~~~~~~~~~---~iv~~~~---~~~a~~~L~~GrvDA~~~D~~~l~~~~~~~~~ 218 (302) T PRK10797 152 -------KGKAVVVTSGTTSEVLLNKLNEEQKMNM---RIISAKD---HGDSFRTLESGRAVAFMMDDALLAGERAKAKK 218 (302) T ss_pred -------CCCEEEEECCCHHHHHHHHHHHHHCCCC---EEEECCC---HHHHHHHHHCCCCCEEEECHHHHHHHHHHCCC T ss_conf -------8997999678739999998667506772---3785088---89999999859950998122988899986369 Q ss_pred --CCEEECCCCCCCCCCHHHHHHHHCHHHHHHCHHHHHHHHH Q ss_conf --8514007865467786583544277778748899999874 Q gi|254780342|r 208 --NIHYLPGGEEISGFGEASVYTVVRSDYLDKCPNISRLLKN 247 (309) Q Consensus 208 --dl~~Ledp~~~~~~p~~~~~~vv~~~~~~~~P~~~~~L~~ 247 (309) .+.++.+|. ...+ ....+| +..|+..+.+++ T Consensus 219 ~~~~~iv~~~~---~~e~--ygia~r----Kgd~~L~~~Vn~ 251 (302) T PRK10797 219 PDNWEIVGKPQ---SQEA--YGCMLR----KDDPQFKKLMDD 251 (302) T ss_pred CCCEEEECCCC---CCCC--EEEEEE----CCCHHHHHHHHH T ss_conf 97549937867---7760--799996----899999999999 No 19 >COG3221 PhnD ABC-type phosphate/phosphonate transport system, periplasmic component [Inorganic ion transport and metabolism] Probab=98.28 E-value=8.9e-05 Score=50.86 Aligned_cols=249 Identities=14% Similarity=0.014 Sum_probs=117.4 Q ss_pred CHHHHHHHHHHHHHHH-HHCCCCCCC-------CCEEEECCCC---CHHHHHHHHHHHHHHHCCCCEEEEEE-CCHHHHH Q ss_conf 9579999999999986-303583458-------8379942588---69999999999999610698489985-6405788 Q gi|254780342|r 1 MYKILAVCLFLTTFSI-SYARDADSC-------TPVRFADTGW---TDIAATTAMTSVILEEILGYKTNIKL-LAVPVTF 68 (309) Q Consensus 1 mkk~~~~~~~~~~~~~-~~~~~~~~~-------~~I~ig~~~w---te~~v~a~i~~~iLE~~lGy~Ve~~~-~~~~~~~ 68 (309) |||.+..++.++.+++ .+++...+. +++++|...- +...-...=+...||+.+|.+|+... .+...+. T Consensus 2 ~~~~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~~l~~gi~p~e~~~~~~~~~~pl~~~L~~~lG~~V~~~~a~dy~~vi 81 (299) T COG3221 2 KKKKLLLASAAAVLALAGCCGSSESDAPAAEDPKELRVGIVPTENPTNLIPAWAPLADYLEKELGIPVEFFVATDYAAVI 81 (299) T ss_pred CHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCEEEEECCCCCHHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHH T ss_conf 42788999999999998632577454322268753599976888868788765669989889869954357257689999 Q ss_pred HHHHCCCCEEEEEEECCCCHHHHHHHHHCCCEEEEC-C-CCCCCEEEEEEEHHHHHCCCCCHHHHHHHHHHHHHCCCCCC Q ss_conf 885149813999840464135666655304702301-3-56685356786344641234553565555665442013578 Q gi|254780342|r 69 RSLKNKGIDIFMGYWYPSLEKFIAPYLEEGSIKLVA-E-NLQGAKYMLAVNDVGFALGIKSYQDIAKYKKELGAKIYGIE 146 (309) Q Consensus 69 ~al~~G~iD~~~e~w~p~~~~~~~~~~~~~~~~~~~-~-~~~~~~~~~~v~~~~~~~~i~~~~dl~~~~~~~~~~~~g~~ 146 (309) +||..|++|+..-.+.+ +............+. . ...+...++-+---.++..|+++.|++...=. .+.+ T Consensus 82 eal~~g~~D~A~~~~~a----~~~a~~~~~~~e~~~~~~~~dg~~~Y~S~~i~~~ds~i~sl~dlkgk~~a-----f~d~ 152 (299) T COG3221 82 EALRAGQVDIAWLGPSA----YVEAVDRALAGEPLAQTVQKDGSPGYYSVIIVRADSPIKSLEDLKGKRFA-----FGDP 152 (299) T ss_pred HHHHCCCEEEEECCCHH----HHHHHHHCCCCCCEEEEECCCCCCCEEEEEEEECCCCCCHHHHHCCCEEE-----CCCC T ss_conf 99858961077418577----89986412454201332125898651789999279974337882699676-----3699 Q ss_pred CCCCCCHHHHHHHHHCC-CC-CCCEEEECCCCCCHHHHHHHHHHCCCCEEEEEECCCHHHHCC--------CCEEECCCC Q ss_conf 54320055587664035-45-221001036652018999999965993699941242232015--------851400786 Q gi|254780342|r 147 PGNEGNQRILDMINNNK-FS-LKGFRLIEASELASFSQIRRDQRNNIPAVFLSWEPHPINSDL--------NIHYLPGGE 216 (309) Q Consensus 147 ~~~~~~~~~~~~~~~~~-l~-~~~~~~~~~s~~~~~a~l~~A~~~~~~~v~~~wsp~~~~~~~--------dl~~Ledp~ 216 (309) .-..+...-...+...+ .+ ..++..+.-+++ ......++.+|+.++...++.-+....- ++++|--.. T Consensus 153 ~StSG~l~P~~~L~~~g~~d~~~~f~~v~~~G~--H~~a~~aV~nG~vDva~~~~~~~~~~~~~~~~~~~~~l~vi~~S~ 230 (299) T COG3221 153 DSTSGYLFPLYYLAKEGGIDPDKFFGEVIFSGG--HDAAVLAVANGQVDVAAVNSSARGLLKKAAPEGVAEKLRVIWKSP 230 (299) T ss_pred CCCCHHHHHHHHHHHHCCCCHHHHHCEEECCCH--HHHHHHHHHCCCCEEEECCHHHHHHHHHHCCCCCHHHCEEEEECC T ss_conf 854124768999998438984654226503570--799999997488308852288887776404334533426999558 Q ss_pred CCCCCCHHHHHHHHCHHHHHHC-HHHHHHHHHCCCCHHHHHHHHHHHHHCC Q ss_conf 5467786583544277778748-8999998740899899999999998459 Q gi|254780342|r 217 EISGFGEASVYTVVRSDYLDKC-PNISRLLKNIKFSVALENEMMKLILNNK 266 (309) Q Consensus 217 ~~~~~p~~~~~~vv~~~~~~~~-P~~~~~L~~~~ls~~~~~~l~~~v~~~g 266 (309) - .| +...++|+++..+. -++.++|-++.- ++++..+....-..+ T Consensus 231 ~---iP--~~pi~vr~~L~~~~k~kl~~af~~l~~-~~~~~~l~~~~~~~~ 275 (299) T COG3221 231 L---IP--NDPIAVRSDLPADLKEKLRDAFLDLAK-TEDKKILLDLYGLGG 275 (299) T ss_pred C---CC--CCCEEEECCCCHHHHHHHHHHHHHCCC-CCCCHHHHHHCCCCC T ss_conf 9---89--997888377999999999999984374-311358897505676 No 20 >PRK09495 glnH glutamine ABC transporter periplasmic protein; Reviewed Probab=98.27 E-value=1.9e-05 Score=55.25 Aligned_cols=173 Identities=15% Similarity=0.159 Sum_probs=99.8 Q ss_pred CHHHHHHHHHHHHHHHHHCCCCCCCCCEEEECC-CCC-----HH----HHHHHHHHHHHHHCCCCEEEEEECCHHHHHHH Q ss_conf 957999999999998630358345883799425-886-----99----99999999999610698489985640578888 Q gi|254780342|r 1 MYKILAVCLFLTTFSISYARDADSCTPVRFADT-GWT-----DI----AATTAMTSVILEEILGYKTNIKLLAVPVTFRS 70 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~~~~~~~~~I~ig~~-~wt-----e~----~v~a~i~~~iLE~~lGy~Ve~~~~~~~~~~~a 70 (309) |||++.++++++++++++++.++ .++|+||.. +|+ +. -+--.+.+.+-+ +||++++++..+-..++.+ T Consensus 1 mk~~~~~~~~~~~~~~~~~~~a~-~~~l~Vg~~~~~pPf~f~~~g~~~GfdvDl~~~ia~-~lg~~~e~~~~~~~~~i~~ 78 (247) T PRK09495 1 MKSVLKVSLAALTLAFAVSSHAA-DKELVVATDTAFVPFEFKQGDKYVGFDVDLWAAIAK-ELKLDYTLKPMDFSGIIPA 78 (247) T ss_pred CHHHHHHHHHHHHHHHHHHHHHC-CCEEEEEECCCCCCEEECCCCEEEEEHHHHHHHHHH-HHCCCEEEEECCHHHHHHH T ss_conf 97799999999999998766514-994999989987980576799278829999999999-9699469997788999999 Q ss_pred HHCCCCEEEEEEECCCCHHHHHHHHHCCCEEEECCCCCCCEEEEEEEHHHHHCCCCCHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 51498139998404641356666553047023013566853567863446412345535655556654420135785432 Q gi|254780342|r 71 LKNKGIDIFMGYWYPSLEKFIAPYLEEGSIKLVAENLQGAKYMLAVNDVGFALGIKSYQDIAKYKKELGAKIYGIEPGNE 150 (309) Q Consensus 71 l~~G~iD~~~e~w~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~i~~~~dl~~~~~~~~~~~~g~~~~~~ 150 (309) |.+|++|+..... +.+.+.- ........|......+.++.. ...+.+++||. ++.+|...+.. T Consensus 79 l~~g~~D~~~~~~-~~T~eR~-------~~~~FS~py~~~~~~i~v~~~--~~~i~~~~dL~-------Gk~v~v~~Gs~ 141 (247) T PRK09495 79 LQTKNIDLALAGI-TITDERK-------KAIDFSDGYYKSGLLVMVKAN--NNDIKSVKDLD-------GKVVAVKSGTG 141 (247) T ss_pred HHCCCCCEEECCC-CCCHHHH-------HHCCCCCCCEEEEEEEEEECC--CCCCCCHHHCC-------CCEEEEECCCH T ss_conf 9769956364660-1478998-------433347770772469999899--97779824548-------98899846958 Q ss_pred CCHHHHHHHHHCCCCCCCEEEECCCCCCHHHHHHHHHHCCCCEEEEEECCCH Q ss_conf 0055587664035452210010366520189999999659936999412422 Q gi|254780342|r 151 GNQRILDMINNNKFSLKGFRLIEASELASFSQIRRDQRNNIPAVFLSWEPHP 202 (309) Q Consensus 151 ~~~~~~~~~~~~~l~~~~~~~~~~s~~~~~a~l~~A~~~~~~~v~~~wsp~~ 202 (309) ..+ .++.. +........+. ..+...++.+|+.+++..=.+.. T Consensus 142 ~~~----~~~~~-~~~~~i~~~~~-----~~~~~~aL~~GrvDa~v~d~~~~ 183 (247) T PRK09495 142 SVD----YAKAN-IKTKDLRQFPN-----IDNAYLELGTGRADAVLHDTPNI 183 (247) T ss_pred HHH----HHHHH-CCCCCEEECCC-----HHHHHHHHHCCCEEEEEECHHHH T ss_conf 999----99963-88986696598-----89999987658734999568999 No 21 >COG2358 Imp TRAP-type uncharacterized transport system, periplasmic component [General function prediction only] Probab=98.26 E-value=5.4e-06 Score=58.80 Aligned_cols=202 Identities=11% Similarity=0.066 Sum_probs=121.5 Q ss_pred CHHHHHHHHHHHHHHHHHCCCCCCCCCEEEECCCCCHH-HHHHHHHHHHHHH-CCCCEEEEEE-CCHHHHHHHHHCCCCE Q ss_conf 95799999999999863035834588379942588699-9999999999961-0698489985-6405788885149813 Q gi|254780342|r 1 MYKILAVCLFLTTFSISYARDADSCTPVRFADTGWTDI-AATTAMTSVILEE-ILGYKTNIKL-LAVPVTFRSLKNKGID 77 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~~~~~~~~~I~ig~~~wte~-~v~a~i~~~iLE~-~lGy~Ve~~~-~~~~~~~~al~~G~iD 77 (309) |+.+.++.++++++.+.+...+.....|+||+-+-++. +.++.=+++++++ ..|++++.++ +++-.....+.+|++| T Consensus 1 ~~~l~l~~la~~~~~~~~~~~~~~~~~itigTG~~~G~YY~ig~~ia~~~~~~~~~i~~~v~~tggSv~Nl~~i~~Ge~d 80 (321) T COG2358 1 MKNLALAALAAAGAGSVSTGPAAEPKFITIGTGSTGGVYYPIGGGLAQLLNKDEKGIECSVVPTGGSVENLKLLASGEAD 80 (321) T ss_pred CCEEEHHHHHHHHHCCCCCCCCCCCEEEEEEECCCCCEEEEHHHHHHHHHHCCCCCEEEEEEECCCHHHHHHHHHCCCCC T ss_conf 95021689899873445656666763799960699862230378999998514778189996065429999867537625 Q ss_pred EEEEEECCCCHHH-----HHHHHHCCCEEEECCCCCCCEEEEEEEHHHHHCCCCCHHHHHHHHHHHHHCCCCCCCCCCCC Q ss_conf 9998404641356-----66655304702301356685356786344641234553565555665442013578543200 Q gi|254780342|r 78 IFMGYWYPSLEKF-----IAPYLEEGSIKLVAENLQGAKYMLAVNDVGFALGIKSYQDIAKYKKELGAKIYGIEPGNEGN 152 (309) Q Consensus 78 ~~~e~w~p~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~i~~~~dl~~~~~~~~~~~~g~~~~~~~~ 152 (309) +..-.-......+ +........+..+...+.+... +++++ +.+|++|+||. ..|+.-.+++++.+ T Consensus 81 ~alvq~d~a~~ay~G~g~f~~~~~~~~lr~v~~lype~~~-vv~r~---d~~Ikti~DL~------GKrV~iG~~gSgt~ 150 (321) T COG2358 81 LALVQSDVAYEAYNGTGSFEGKGKDENLRAVAALYPEPFH-VVTRK---DAGIKTIADLK------GKRVAIGPPGSGTE 150 (321) T ss_pred HHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHEECCCCEEE-EEEEC---CCCCCEEHHCC------CCEEEECCCCCCCH T ss_conf 3566678999987185311456655143322130542089-99966---88965601038------98886269997408 Q ss_pred HHHHHHHHHCCCCCCCEEEECCCCCCHHHHHHHHHHCCCCEEEEEECCCHH------HHCCCCEEECCC Q ss_conf 555876640354522100103665201899999996599369994124223------201585140078 Q gi|254780342|r 153 QRILDMINNNKFSLKGFRLIEASELASFSQIRRDQRNNIPAVFLSWEPHPI------NSDLNIHYLPGG 215 (309) Q Consensus 153 ~~~~~~~~~~~l~~~~~~~~~~s~~~~~a~l~~A~~~~~~~v~~~wsp~~~------~~~~dl~~Ledp 215 (309) ...+..++.+|+....+++...-.. ++...++++|+.+.++.-.+++. ....++++++-+ T Consensus 151 ~~a~~il~a~Gi~~~~~~~~~~~~~---a~~~~~l~~g~iDA~~~~~G~p~~ai~el~~~~~i~lv~i~ 216 (321) T COG2358 151 ATARQILEALGITYDDYELDLGLGD---AESADALKNGTIDAAFYVAGVPNPAISELATTCDIVLVPIS 216 (321) T ss_pred HHHHHHHHHCCCCCCCHHHHHHCCC---HHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHCCEEEEECC T ss_conf 8999999973999762456663373---06578763796328998158888319999861775899378 No 22 >COG1464 NlpA ABC-type metal ion transport system, periplasmic component/surface antigen [Inorganic ion transport and metabolism] Probab=98.23 E-value=1.8e-05 Score=55.39 Aligned_cols=228 Identities=18% Similarity=0.140 Sum_probs=114.1 Q ss_pred CHHHHHHHHHHHHHHHHHCCCCCC---CCCEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEEEC-CHHHHHHHHHCCCC Q ss_conf 957999999999998630358345---88379942588699999999999996106984899856-40578888514981 Q gi|254780342|r 1 MYKILAVCLFLTTFSISYARDADS---CTPVRFADTGWTDIAATTAMTSVILEEILGYKTNIKLL-AVPVTFRSLKNKGI 76 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~~~~~~---~~~I~ig~~~wte~~v~a~i~~~iLE~~lGy~Ve~~~~-~~~~~~~al~~G~i 76 (309) |+++...+++++++++..+|.+.. .++|+||...-+.+.+.-.+.+..+++. |++++++.. +...--.||..||| T Consensus 1 m~~~~~~~~~~~~~l~~~~~~~~~~~~~~~I~vg~~~~p~a~ile~~~k~~~~k~-Gi~l~i~~FtDY~~PN~AL~~gdi 79 (268) T COG1464 1 MKKLKKLALALVALLALAACGAAAAKATKTIKVGATPGPHAEILEVVVKPALKKK-GLDLKIVEFTDYVQPNEALADGDI 79 (268) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEECCCCHHHHHHHHHHHHHHHC-CCEEEEEEECCCCCHHHHHHCCCC T ss_conf 9438899999999999975223113457717996368965999999987788855-965999981487665578766884 Q ss_pred EEEEEEECCCCHHHHHHHHHC--CCEEEECCCCCCCEEEEEEEHHHHHCCCCCHHHHHHHHHHHHHCCCCCC--CCCCCC Q ss_conf 399984046413566665530--4702301356685356786344641234553565555665442013578--543200 Q gi|254780342|r 77 DIFMGYWYPSLEKFIAPYLEE--GSIKLVAENLQGAKYMLAVNDVGFALGIKSYQDIAKYKKELGAKIYGIE--PGNEGN 152 (309) Q Consensus 77 D~~~e~w~p~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~v~~~~~~~~i~~~~dl~~~~~~~~~~~~g~~--~~~~~~ 152 (309) |.-.=-..| ++..+..+ +.++.++..+..-. +++ ...++++++|++- ..++++ +....| T Consensus 80 DaN~FQH~p----yL~~~~k~~~~~Lv~vg~~~i~Pm-g~Y------Skk~ksl~el~~G------atIaiPNDpsN~gR 142 (268) T COG1464 80 DANAFQHKP----YLDQFNKEHGGKLVAVGNTHIEPM-GLY------SKKYKSLAELKDG------ATIAIPNDPTNEGR 142 (268) T ss_pred CCHHHHCHH----HHHHHHHHCCCCEEEEEEEEECCC-EEC------CHHCCCHHHCCCC------CEEECCCCCCCHHH T ss_conf 600121468----899999973997898755776142-501------2200868467999------98988789873567 Q ss_pred H-------------------HHHHHHHHCCCCCCCEEEECCCCCCHHHHHHHHHHCCCCEEEEEECCCHHH-----HCCC Q ss_conf 5-------------------558766403545221001036652018999999965993699941242232-----0158 Q gi|254780342|r 153 Q-------------------RILDMINNNKFSLKGFRLIEASELASFSQIRRDQRNNIPAVFLSWEPHPIN-----SDLN 208 (309) Q Consensus 153 ~-------------------~~~~~~~~~~l~~~~~~~~~~s~~~~~a~l~~A~~~~~~~v~~~wsp~~~~-----~~~d 208 (309) . ....-+...--.+.+.++.. +++-+++.+...-++ -+++.. .+-| T Consensus 143 AL~lL~~aGLIkLk~~~~~~aT~~DI~eNPK~lki~EldA-------aqlpRaLddvD~AvI---N~nyA~~AgL~p~kd 212 (268) T COG1464 143 ALLLLQKAGLIKLKDGVNLLATPKDITENPKNLKIKELEA-------AQLPRALDDVDAAVI---NTNYALQAGLNPKKD 212 (268) T ss_pred HHHHHHHCCCEEECCCCCCCCCHHHHHHCCCCCEEEECCH-------HHCCCCCCCCCEEEE---CCHHHHHCCCCCCCC T ss_conf 9999998796797688765588878861944575687056-------756532035677987---430797759991204 Q ss_pred CEEECCCCCCCCCCHHHHHHHHCHHHHHHCHHHHHHHHHCCCCHHHHHHHHHHH Q ss_conf 514007865467786583544277778748899999874089989999999999 Q gi|254780342|r 209 IHYLPGGEEISGFGEASVYTVVRSDYLDKCPNISRLLKNIKFSVALENEMMKLI 262 (309) Q Consensus 209 l~~Ledp~~~~~~p~~~~~~vv~~~~~~~~P~~~~~L~~~~ls~~~~~~l~~~v 262 (309) .-++|+. +. .||--..++|+ --++.|.+.++++.++ |.+.-....... T Consensus 213 ai~~e~~-~~---spY~Niivvr~-~d~d~~~ik~lv~a~q-s~evk~~i~k~y 260 (268) T COG1464 213 ALFEEDK-DS---SPYVNIIVVRE-EDKDDPAVKKLVEAYQ-SDEVKAFIEKKY 260 (268) T ss_pred CEECCCC-CC---CCCEEEEEECC-CCCCCHHHHHHHHHHC-CHHHHHHHHHHH T ss_conf 2141466-67---86247999715-5668889999999974-999999999972 No 23 >TIGR01728 SsuA_fam ABC transporter, substrate-binding protein, aliphatic sulfonates family; InterPro: IPR010067 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). Members of this family are substrate-binding periplasmic proteins of ABC transporters. This subfamily includes SsuA, a member of a transporter operon needed to obtain sulphur from aliphatic sulphonates. Related proteins include taurine (NH2-CH2-CH2-S03H) binding proteins, the probable sulphate ester binding protein AtsR, and the probable aromatic sulphonate binding protein AsfC. All these families make sulphur available when Cys and sulphate levels are low.; GO: 0006790 sulfur metabolic process, 0006810 transport, 0016020 membrane. Probab=98.22 E-value=4.4e-05 Score=52.85 Aligned_cols=234 Identities=20% Similarity=0.197 Sum_probs=149.6 Q ss_pred EEEECCC-CCHHHHHHHHHHHHHHHCC---CCEEEEEEC-CHHHHHHHHHCCCCEEEEEEECCCCHHHHHHHHHC-CCEE Q ss_conf 7994258-8699999999999996106---984899856-40578888514981399984046413566665530-4702 Q gi|254780342|r 28 VRFADTG-WTDIAATTAMTSVILEEIL---GYKTNIKLL-AVPVTFRSLKNKGIDIFMGYWYPSLEKFIAPYLEE-GSIK 101 (309) Q Consensus 28 I~ig~~~-wte~~v~a~i~~~iLE~~l---Gy~Ve~~~~-~~~~~~~al~~G~iD~~~e~w~p~~~~~~~~~~~~-~~~~ 101 (309) |+||+.. |......-..-+.++|+.+ |.+|+.+.. +.+...++|..|.||+---.-.|...... .. ..+. T Consensus 1 ~~~gy~~~~~~~~~~~~~~~g~~~~~~~~~g~~~~~~~f~~g~~~~~~l~~g~~d~g~~g~~p~~~~~~----~g~~~~~ 76 (311) T TIGR01728 1 VRIGYQPNLGHSPLLLAKEKGLLEKEGGKEGTKVEWVEFPGGPPELEALGAGSLDFGYIGPGPALNAYA----AGNADIK 76 (311) T ss_pred CEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCHHHHHHHCCCCCCCCCCCCCHHEEECC----CCCCCEE T ss_conf 921133773128999986400134403656612478862675477775205650014336731201000----3676467 Q ss_pred EECCCCCC-CEEEEEEEHHHHHCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCC------CCCCCEEEECC Q ss_conf 30135668-535678634464123455356555566544201357854320055587664035------45221001036 Q gi|254780342|r 102 LVAENLQG-AKYMLAVNDVGFALGIKSYQDIAKYKKELGAKIYGIEPGNEGNQRILDMINNNK------FSLKGFRLIEA 174 (309) Q Consensus 102 ~~~~~~~~-~~~~~~v~~~~~~~~i~~~~dl~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~------l~~~~~~~~~~ 174 (309) .++....+ ......+.. ...++++.|| +++.++.+.+..........+...+ ++.....+... T Consensus 77 ~~~~~~~~~~~~~~~~~~---~~~i~~~~dl-------kGk~~a~~~g~~~~~~~~~~l~~~ggl~~~~~~~~d~~~~~~ 146 (311) T TIGR01728 77 AVGLSSDGPSATALVVGK---GSGIRSVKDL-------KGKRIAVPKGGSGHDLLLRALLKAGGLEDNLLSGDDVDILEL 146 (311) T ss_pred EEEEECCCCCCEEEEECC---CCCCCHHHHC-------CCCEEEEECCCCHHHHHHHHHHHHCCCCCCCCCCCCEEEEEC T ss_conf 998525776403566336---6773014334-------774566306661578999999973472000368763155402 Q ss_pred CCCCHHHHHHHHHHCCCCEEEEEECCCHHHHCCC--CEEECCCCCCCCCCH--HHHHHHHCHHHHHHCHHHHHH-HHHC- Q ss_conf 6520189999999659936999412422320158--514007865467786--583544277778748899999-8740- Q gi|254780342|r 175 SELASFSQIRRDQRNNIPAVFLSWEPHPINSDLN--IHYLPGGEEISGFGE--ASVYTVVRSDYLDKCPNISRL-LKNI- 248 (309) Q Consensus 175 s~~~~~a~l~~A~~~~~~~v~~~wsp~~~~~~~d--l~~Ledp~~~~~~p~--~~~~~vv~~~~~~~~P~~~~~-L~~~- 248 (309) ... ....|+.+|+++.+-.|.|.+.....+ -+++-+....+. +. .....+++++|.+.||+...- ++.+ T Consensus 147 ~~~----~~~~a~~~g~~da~~~~~P~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~ 221 (311) T TIGR01728 147 GPS----DARAALAAGQVDAWGIWEPWLSALVEEGGARVLLNGEGIGL-PEWLVPGFLVVREEFAEEHPEQVERLLKVLV 221 (311) T ss_pred CCH----HHHHHHHCCCCCEEECCCCCHHHHHHHCCCEEEEECCCCCC-CCCCCCEEEEECHHHHHHCHHHHHHHHHHHH T ss_conf 604----56776540462212116871445432226247861456576-5310120034206787515488999999999 Q ss_pred ---CCCHHHHHHHHHHHHHCCC-CHHHHHHHHHHHH Q ss_conf ---8998999999999984599-9899999999985 Q gi|254780342|r 249 ---KFSVALENEMMKLILNNKQ-DRQFVGRTMLRTH 280 (309) Q Consensus 249 ---~ls~~~~~~l~~~v~~~g~-~~~~vA~~wL~~n 280 (309) .+-.+.-.+....+..+-. -+....+.++.+. T Consensus 222 ~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~ 257 (311) T TIGR01728 222 KATKWAEENPDESAEILAKELGDLPDEVLPEEVLNG 257 (311) T ss_pred HHHHHHHHCHHHHHHHHHHHHHCCCHHHHHHHHCCC T ss_conf 999988618657899999985102523456654055 No 24 >TIGR03227 PhnS 2-aminoethylphosphonate ABC transporter, periplasmic 2-aminoethylphosphonate binding protein. This ABC transporter periplasmic substrate binding protein component is found in a region of the salmonella typhimurium LT2 genome responsible for the catabolism of 2-aminoethylphosphonate via the phnWX pathway (GenProp0238). The protein contains a match to pfam01547 for the "Bacterial extracellular solute-binding protein" domain. Probab=98.21 E-value=0.00054 Score=45.78 Aligned_cols=284 Identities=14% Similarity=0.074 Sum_probs=124.6 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCCCCCEEE-ECCCCC-HHHHHHHHHHHHHHHCCCCEEEEEECCHHHHHHHHH----CCC Q ss_conf 57999999999998630358345883799-425886-999999999999961069848998564057888851----498 Q gi|254780342|r 2 YKILAVCLFLTTFSISYARDADSCTPVRF-ADTGWT-DIAATTAMTSVILEEILGYKTNIKLLAVPVTFRSLK----NKG 75 (309) Q Consensus 2 kk~~~~~~~~~~~~~~~~~~~~~~~~I~i-g~~~wt-e~~v~a~i~~~iLE~~lGy~Ve~~~~~~~~~~~al~----~G~ 75 (309) ||+ +++..+.++..+.+..+....+|++ .+-++. .+....+-+--..|+..|++|+++.+++..+..-|. +-+ T Consensus 13 ~~~-~~~~~~~~~~~~~~~~a~a~~~~~vy~~~~~~~~~~~~~~~~~~aFek~tGikV~~v~~~sge~laRl~aEk~nPq 91 (367) T TIGR03227 13 RRF-ALAAGAAALLQGAAAPAQAAAVVLVYSADGLEDGDNSLYQDQFDAFEKAEGIKVNIVEAGGGEVVERAAKEKGNPK 91 (367) T ss_pred HHH-HHHHHHHHHHHHCCCCCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHCCEEEEEECCHHHHHHHHHHCCCCCC T ss_conf 999-9999999998642355545865999964664344434688876888886698389998977999999995227997 Q ss_pred CEEEEEEECCCCHHHHHHHHHCCCEEEE------------------CCCCCCCEEEEEEEHHHHHCCCCCHHHHHHHHHH Q ss_conf 1399984046413566665530470230------------------1356685356786344641234553565555665 Q gi|254780342|r 76 IDIFMGYWYPSLEKFIAPYLEEGSIKLV------------------AENLQGAKYMLAVNDVGFALGIKSYQDIAKYKKE 137 (309) Q Consensus 76 iD~~~e~w~p~~~~~~~~~~~~~~~~~~------------------~~~~~~~~~~~~v~~~~~~~~i~~~~dl~~~~~~ 137 (309) +|+..-. +. .......++.+... ...+.....++.+.......--++|.||.++ . T Consensus 92 aDVv~~~--d~---~~~~A~~~GLl~py~~~~~~~ip~~~k~~dg~~~~~~~~~~~~~yN~~~~k~~PkSW~DL~dP--~ 164 (367) T TIGR03227 92 ADVIVTA--PP---FIQQAAAEGLLANFNSDAAKAIPAIAKAADGLWAPFVKNYFSFAINPKLLKSAPASFADLLDA--D 164 (367) T ss_pred CEEEEEC--CH---HHHHHHHCCCCCCCCCCCHHHCCHHHHCCCCCEEEEECCEEEEEEEHHHHCCCCCCHHHHHCC--C T ss_conf 2099978--57---999999788956278966455777663779838988555799999857705799889987285--3 Q ss_pred HHHCCCCCCCCCCCCH--HHHHHHHHCCCCCCCE--------E--EECCCCCCHHHHHHHHHHCCCCEEEEEECCCHHH- Q ss_conf 4420135785432005--5587664035452210--------0--1036652018999999965993699941242232- Q gi|254780342|r 138 LGAKIYGIEPGNEGNQ--RILDMINNNKFSLKGF--------R--LIEASELASFSQIRRDQRNNIPAVFLSWEPHPIN- 204 (309) Q Consensus 138 ~~~~~~g~~~~~~~~~--~~~~~~~~~~l~~~~~--------~--~~~~s~~~~~a~l~~A~~~~~~~v~~~wsp~~~~- 204 (309) ++++++..+|...... ...-....+|-....+ + ....+ ...+...+.+||..+...+...... T Consensus 165 ykgkI~~~dP~tS~~g~~~l~~~~~~~G~eD~a~~~~~~L~~N~~~~~~s----~~~~~~~~~~GE~~va~g~~~~~~~~ 240 (367) T TIGR03227 165 FKGKLAYSNPAQAADGMAVIILAFALFGSEDAAFAYLAKLEANNKFHSAG----TGKLNALLNKGEIAVANGDLQMDLAD 240 (367) T ss_pred CCCEEEECCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCEECCC----CHHHHHHHHCCCEEEEEEECCHHHHH T ss_conf 37828952877661799999999997098538899999887448672478----17999998578678995202043677 Q ss_pred H---CCCCEEECCCCCCCCCCHHHHHHHHCHHHH--HHCHHHHHHHHHCCCCHHHHHHHHHHH-----HHCCCCHHHHH- Q ss_conf 0---158514007865467786583544277778--748899999874089989999999999-----84599989999- Q gi|254780342|r 205 S---DLNIHYLPGGEEISGFGEASVYTVVRSDYL--DKCPNISRLLKNIKFSVALENEMMKLI-----LNNKQDRQFVG- 273 (309) Q Consensus 205 ~---~~dl~~Ledp~~~~~~p~~~~~~vv~~~~~--~~~P~~~~~L~~~~ls~~~~~~l~~~v-----~~~g~~~~~vA- 273 (309) . .+++.+. -|....+.++..+.....-... .+||+.++-|=++-+|.+-+..+.... ..+-.-+.+.+ T Consensus 241 ~~~~g~~v~~v-~P~~~~g~~~g~~~i~~~i~ivKga~n~e~AkkfidflLS~e~Q~~~a~~~~~~Pv~~~v~l~~~~~~ 319 (367) T TIGR03227 241 AEHGGLNIKIF-FPAADAGEPPSAFAIPYAIGLVKGAPNQDAGKKLIDFLLSADAQAKVPDLSFGIPGRSDVPLSDEHGE 319 (367) T ss_pred HHHCCCCCEEE-EECCCCCCCCCEEEEEEHHHHHCCCCCHHHHHHHHHHHCCHHHHHHHHHHCCCCCCCCCCCCCHHHHH T ss_conf 76279970289-54677788873487522034644999989999999997299999999986016767788769835554 Q ss_pred ---------H------HHHHHHHHHH-HHHHCCCCCCCCHH Q ss_conf ---------9------9999859999-98754754567713 Q gi|254780342|r 274 ---------R------TMLRTHPDLL-KNWLIGVTTFDGQD 298 (309) Q Consensus 274 ---------~------~wL~~n~~~~-~~Wl~~~~~~~~~~ 298 (309) + +++..|++.| ++|-.-|.-.-|++ T Consensus 320 ~~~~~~~~~kl~~~Dw~~i~~~r~~wi~rW~~eV~~~~~~~ 360 (367) T TIGR03227 320 AAKAAIAGVKLIPPDWDAVAAKKPADIERWHKEVIGDSGKQ 360 (367) T ss_pred HCCCCCCCCEEECCCHHHHHHHHHHHHHHHHHHHCCCCCCE T ss_conf 22133233511158999999878999999999857666864 No 25 >COG0725 ModA ABC-type molybdate transport system, periplasmic component [Inorganic ion transport and metabolism] Probab=98.14 E-value=0.00064 Score=45.30 Aligned_cols=186 Identities=12% Similarity=0.036 Sum_probs=116.0 Q ss_pred CCCCCCEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHCC-CCEEEEEEECCCCHHHHHHHHHCCCE Q ss_conf 34588379942588699999999999996106984899856405788885149-81399984046413566665530470 Q gi|254780342|r 22 ADSCTPVRFADTGWTDIAATTAMTSVILEEILGYKTNIKLLAVPVTFRSLKNK-GIDIFMGYWYPSLEKFIAPYLEEGSI 100 (309) Q Consensus 22 ~~~~~~I~ig~~~wte~~v~a~i~~~iLE~~lGy~Ve~~~~~~~~~~~al~~G-~iD~~~e~w~p~~~~~~~~~~~~~~~ 100 (309) ..++.+|+|.... +=..++.++.++.-++. |.+|+...+++..+.+-+..| ..|++...-.. ...+....+.+ T Consensus 26 ~~~~~~i~VfAAa-SL~~~l~~i~~~F~~~~-~~~V~~~f~gS~~l~~qIe~Ga~~D~fiSa~~~----~~~~l~~~g~~ 99 (258) T COG0725 26 AQEAATITVFAAA-SLTDALEEIAKQFEKET-GVKVEVEFGGSGALARQIEQGAPADLFISADDA----YMDKLEDKGLI 99 (258) T ss_pred CCCCCEEEEEEEH-HHHHHHHHHHHHHHHHH-CCEEEEEECCHHHHHHHHHCCCCCCEEEECCHH----HHHHHHHCCCC T ss_conf 4567409999815-66899999999999987-987999961389999999759986879988888----89999866885 Q ss_pred EE-ECCCCCCCEEEEEEEHHHHHCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCCCCCCCE-EEECCCCCC Q ss_conf 23-0135668535678634464123455356555566544201357854320055587664035452210-010366520 Q gi|254780342|r 101 KL-VAENLQGAKYMLAVNDVGFALGIKSYQDIAKYKKELGAKIYGIEPGNEGNQRILDMINNNKFSLKGF-RLIEASELA 178 (309) Q Consensus 101 ~~-~~~~~~~~~~~~~v~~~~~~~~i~~~~dl~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~-~~~~~s~~~ 178 (309) .. -...+..++-.+++++...... .+|.++.+.. ...--+|.+.--.+.......++.-++-.... .++.. . T Consensus 100 ~~~~~~~fa~n~lvl~~~~~~~~~~-~~~~~l~~~~--~~~lai~~p~~~P~G~ya~~~l~~~g~~~~~~~k~v~~---~ 173 (258) T COG0725 100 YADSRIVFAGNRLVLAVPKGSKKKI-ESLEDLLERP--DVRLAIGDPKTVPAGKYAKEALELLGLWYTLKDKLVLA---T 173 (258) T ss_pred CCCCEEEEECCEEEEEEECCCCCCC-CHHHHHHCCC--CCEEEECCCCCCCCHHHHHHHHHHHCHHHHCCCCEEEC---C T ss_conf 6671577425748999868874565-1298873087--85799648876971499999999705145336754653---6 Q ss_pred HHHHHHHHHHCCCCEEEEEECCCHHHHCCCCEEECCCCCCC Q ss_conf 18999999965993699941242232015851400786546 Q gi|254780342|r 179 SFSQIRRDQRNNIPAVFLSWEPHPINSDLNIHYLPGGEEIS 219 (309) Q Consensus 179 ~~a~l~~A~~~~~~~v~~~wsp~~~~~~~dl~~Ledp~~~~ 219 (309) ...++..-+++|+.+..++|.+++........+..-|...+ T Consensus 174 ~v~~~l~~V~~G~ad~g~vy~sd~~~~~~~~~~~~~~~~~~ 214 (258) T COG0725 174 NVRQALAYVETGEADAGFVYVSDALLSKKVKIVGVFPEDLH 214 (258) T ss_pred CHHHHHHHHHCCCCCEEEEEEEHHHHCCCCEEEEECCCCCC T ss_conf 57989999863787779999763340677608997356668 No 26 >COG4143 TbpA ABC-type thiamine transport system, periplasmic component [Coenzyme metabolism] Probab=98.14 E-value=0.00012 Score=49.92 Aligned_cols=240 Identities=13% Similarity=0.077 Sum_probs=120.1 Q ss_pred CHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEC-CCCCHHHHHHHHHHHHHHHCCCCEEEEEE-CCHHHHHHHH-HCC--- Q ss_conf 95799999999999863035834588379942-58869999999999999610698489985-6405788885-149--- Q gi|254780342|r 1 MYKILAVCLFLTTFSISYARDADSCTPVRFAD-TGWTDIAATTAMTSVILEEILGYKTNIKL-LAVPVTFRSL-KNK--- 74 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~~~~~~~~~I~ig~-~~wte~~v~a~i~~~iLE~~lGy~Ve~~~-~~~~~~~~al-~~G--- 74 (309) |+|.+.++++++++.+++++...+..++++.. -.|.+.--.+.-++.-.|...+++|+.+. ++...++.-+ .+| T Consensus 1 m~~~l~~~~~~all~~~~~~a~~~t~~LtVytydSF~~ewg~Gp~vk~~FE~~~~~~v~fV~~~d~v~llnRl~leg~~~ 80 (336) T COG4143 1 MRRLLRALIGLALLVSAALGAQAATPTLTVYTYDSFASEWGPGPKVKKAFEAEYGCKVNFVALGDGVELLNRLILEGKNP 80 (336) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHCCCEEEEEEEEEEECCCCCCHHHHHHHHHHHCCEEEEEECCCHHHHHHHHHHCCCCC T ss_conf 90249999999999998657763485389999763422568857899999998595289997075899999999707898 Q ss_pred CCEEEEEEECCCCHHHHHHHHHCCCEE-----------------EEC-CCCCCCEEEEEEEHHHHHCCCCCHHHHHHHHH Q ss_conf 813999840464135666655304702-----------------301-35668535678634464123455356555566 Q gi|254780342|r 75 GIDIFMGYWYPSLEKFIAPYLEEGSIK-----------------LVA-ENLQGAKYMLAVNDVGFALGIKSYQDIAKYKK 136 (309) Q Consensus 75 ~iD~~~e~w~p~~~~~~~~~~~~~~~~-----------------~~~-~~~~~~~~~~~v~~~~~~~~i~~~~dl~~~~~ 136 (309) .-|++.... + ..+.+....+.+. ..- ..++..+..+.+.......--+|+++|.+... T Consensus 81 ~ADvvvGld--n--~~l~~A~~~~lf~~~~~d~~~~~vp~~~~~d~f~~P~DyGy~a~vYd~~~~~~ppksL~dL~~~~y 156 (336) T COG4143 81 KADVVVGLD--N--NLLARARETGLFAPYGVDASDVPVPEGWKIDTFALPYDYGYFAFVYDKTKLKNPPKSLKDLVEPEY 156 (336) T ss_pred CCCEEEECC--H--HHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEECHHHHCCCCCCHHHHCCCCC T ss_conf 885899347--2--778888855876667898557888765356751012454128999765775287626898638512 Q ss_pred HHHHCCCCCCCCCCCCHH--HHHHHHHC----------CCCCCCEEEECCCCCCHHHHHHHHHHCCCCEEEEEECCCHHH Q ss_conf 544201357854320055--58766403----------545221001036652018999999965993699941242232 Q gi|254780342|r 137 ELGAKIYGIEPGNEGNQR--ILDMINNN----------KFSLKGFRLIEASELASFSQIRRDQRNNIPAVFLSWEPHPIN 204 (309) Q Consensus 137 ~~~~~~~g~~~~~~~~~~--~~~~~~~~----------~l~~~~~~~~~~s~~~~~a~l~~A~~~~~~~v~~~wsp~~~~ 204 (309) . ++++--+|....... ..=....| .|......+.. -|.+.|.++.+||..+..-|+||... T Consensus 157 ~--~~ii~qdPrtSspGl~fLlW~~svyg~d~~~~~W~kL~~~~v~v~~-----gWSeaY~aFt~GEap~VLSYtTspay 229 (336) T COG4143 157 A--GKIIYQDPRTSSPGLGFLLWTISVYGEDGAPEAWAKLADNGVTVTK-----GWSEAYGAFTKGEAPLVLSYTTSPAY 229 (336) T ss_pred C--CCEEECCCCCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHCCEEECC-----CHHHHHHHHHCCCCCEEEEECCCCCH T ss_conf 6--9578069887884078899998763623189999999858868658-----67988988747865579986068221 Q ss_pred HCCCCE-----EECCCCCCCCCCHHHHHHHHCHHHH-HHCHHHHHHHHHCCCCHHHHHH Q ss_conf 015851-----4007865467786583544277778-7488999998740899899999 Q gi|254780342|r 205 SDLNIH-----YLPGGEEISGFGEASVYTVVRSDYL-DKCPNISRLLKNIKFSVALENE 257 (309) Q Consensus 205 ~~~dl~-----~Ledp~~~~~~p~~~~~~vv~~~~~-~~~P~~~~~L~~~~ls~~~~~~ 257 (309) ..+.=. -...+++.+.+.+. +. .++ .++|++++-+=+|-||.+-+.. T Consensus 230 ~~~~~~~~~~~a~~f~eG~ylqiEg----aa--~~k~~knp~LA~~F~~FmlS~e~Q~~ 282 (336) T COG4143 230 HVYPEKKDRYAAALFPEGHYLQVEG----AA--VLKGAKNPELADKFLQFMLSPEFQDA 282 (336) T ss_pred HCCCCCCCCCCHHCCCCCCEEEEEE----EE--ECCCCCCHHHHHHHHHHHHCHHHHHH T ss_conf 0253200221001178885367766----53--11279987999999999709877840 No 27 >PRK09501 potD spermidine/putrescine ABC transporter periplasmic substrate-binding protein; Reviewed Probab=98.13 E-value=0.00017 Score=49.09 Aligned_cols=256 Identities=11% Similarity=0.078 Sum_probs=120.9 Q ss_pred CHHHHHHHHHHHHHHHHHCCCC-CCCCCEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEEECCHH-HHHHHHHC---CC Q ss_conf 9579999999999986303583-4588379942588699999999999996106984899856405-78888514---98 Q gi|254780342|r 1 MYKILAVCLFLTTFSISYARDA-DSCTPVRFADTGWTDIAATTAMTSVILEEILGYKTNIKLLAVP-VTFRSLKN---KG 75 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~~~~-~~~~~I~ig~~~wte~~v~a~i~~~iLE~~lGy~Ve~~~~~~~-~~~~al~~---G~ 75 (309) |||+...++.++.++++.++.. ..+++++| -+| +.++--.++++- |+++|++|++...+++ .++.-|+. |. T Consensus 1 ~~~~~~~ll~~~a~~l~~~~~~a~~~~~L~i--~~W-~~Yi~~~~i~~F-e~etGikV~~~~~~s~ee~~akL~a~~~~~ 76 (348) T PRK09501 1 MKKWSRHLLAAGALALGMSAAHADDNNTLYF--YNW-TEYVPPGLLEQF-TKETGIKVIYSTYESNETMYAKLKTYKDGA 76 (348) T ss_pred CHHHHHHHHHHHHHHHHCCHHHHCCCCEEEE--ECC-CCCCCHHHHHHH-HHHHCCEEEEEECCCHHHHHHHHHCCCCCC T ss_conf 9368999999999997350335433897999--871-001897899999-999898899995499999999997179998 Q ss_pred CEEEEEEE-------------------CCCCHHHHHHHHHCCCE---EEECCCCCCCEEEEEEEHHHHH-CCCCCHHHHH Q ss_conf 13999840-------------------46413566665530470---2301356685356786344641-2345535655 Q gi|254780342|r 76 IDIFMGYW-------------------YPSLEKFIAPYLEEGSI---KLVAENLQGAKYMLAVNDVGFA-LGIKSYQDIA 132 (309) Q Consensus 76 iD~~~e~w-------------------~p~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~v~~~~~~-~~i~~~~dl~ 132 (309) .|+..-.- .|+.... .+...+... ...+..+.-...++.+....-. ..+.+|.+|- T Consensus 77 yDvv~ps~~~v~~l~~~gll~~LD~s~ipN~~nl-~p~~~~~~~dp~~~y~vPy~wGt~Gi~yn~d~v~~~~~~SW~dL~ 155 (348) T PRK09501 77 YDLVVPSTYYVDKMRKEGMIQKIDKSKLTNFSNL-DPDMLNKPFDPNNDYSIPYIWGATAIGVNGDAVDPKSVTSWADLW 155 (348) T ss_pred CEEEEECHHHHHHHHHCCCCCCCCHHHCCCHHHC-CHHHHCCCCCCCCCEEEEEEEECCEEEEEECCCCCCCCCCHHHHC T ss_conf 2899979689999997798454784228481226-976744774899848988997511599850114788888789854 Q ss_pred HHHHHHHHCCCCCC----------------CCCCCCHHHHHHHHH-CCCCCCCEEEECCCCCCHHHHHHHHHHCCCCEEE Q ss_conf 55665442013578----------------543200555876640-3545221001036652018999999965993699 Q gi|254780342|r 133 KYKKELGAKIYGIE----------------PGNEGNQRILDMINN-NKFSLKGFRLIEASELASFSQIRRDQRNNIPAVF 195 (309) Q Consensus 133 ~~~~~~~~~~~g~~----------------~~~~~~~~~~~~~~~-~~l~~~~~~~~~~s~~~~~a~l~~A~~~~~~~v~ 195 (309) +.+ +++++.-.+ +............+. ..+......+ .++ .....+.+|+..+. T Consensus 156 dp~--~kGki~~~d~~~~~~~~al~~lG~~~n~~d~~~l~~a~~~L~~~~~~i~~~--~sd-----~~~~~l~~Gev~ia 226 (348) T PRK09501 156 KPE--YKGSLLLTDDAREVFQMALRKLGYSGNTTDPKEIEAAYNELKKLMPNVAAF--NSD-----NPANPYMEGEVNLG 226 (348) T ss_pred CHH--HCCEEEECCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCEEEE--ECC-----CHHHHHHCCCCEEE T ss_conf 966--479079716678999999998279988899999999999999851105898--177-----05469873883299 Q ss_pred EEECCCHHHHCCCCEEEC--CCC-CCCCCCHHHHHHHHCHHHHHHCHH-HHHHHHHCCCCHHHHHHHHHHHHHCCCCHHH Q ss_conf 941242232015851400--786-546778658354427777874889-9999874089989999999999845999899 Q gi|254780342|r 196 LSWEPHPINSDLNIHYLP--GGE-EISGFGEASVYTVVRSDYLDKCPN-ISRLLKNIKFSVALENEMMKLILNNKQDRQF 271 (309) Q Consensus 196 ~~wsp~~~~~~~dl~~Le--dp~-~~~~~p~~~~~~vv~~~~~~~~P~-~~~~L~~~~ls~~~~~~l~~~v~~~g~~~~~ 271 (309) .+|+.+....+-.-..++ -|+ +...+ .+...+.+ -..||+ +++|++- -++.+...++...+. --+|.. T Consensus 227 ~~wsgda~~~~~~~~~i~~v~PkEG~~~w--~D~~~Ipk---~A~n~e~A~~FInf-~l~Pevaa~~a~~~~--y~t~n~ 298 (348) T PRK09501 227 MIWNGSAFVARQAGTPIDVVWPKEGGIFW--MDSLAIPA---NAKNKEGALKLINF-LLRPDVAKQVAETIG--YPTPNL 298 (348) T ss_pred EEECHHHHHHHHCCCCCCEEECCCCCEEE--EEEEEEEC---CCCCHHHHHHHHHH-HHCHHHHHHHHHHHC--CCCHHH T ss_conf 87405899999708997189757886699--89988989---99898999999998-619799999999748--898308 Q ss_pred HHHHHHH Q ss_conf 9999999 Q gi|254780342|r 272 VGRTMLR 278 (309) Q Consensus 272 vA~~wL~ 278 (309) .|++.|. T Consensus 299 ~A~~~l~ 305 (348) T PRK09501 299 AARKLLS 305 (348) T ss_pred HHHHHCC T ss_conf 8886189 No 28 >COG1638 DctP TRAP-type C4-dicarboxylate transport system, periplasmic component [Carbohydrate transport and metabolism] Probab=97.96 E-value=0.0019 Score=42.28 Aligned_cols=277 Identities=15% Similarity=0.129 Sum_probs=136.3 Q ss_pred CHHHHHHHHHHHHHHHHHCCCCCCC--CCEEEECCC--CCHHHHHHHHHHHHHHHCCCCEEEEEE------CCHHHHHHH Q ss_conf 9579999999999986303583458--837994258--869999999999999610698489985------640578888 Q gi|254780342|r 1 MYKILAVCLFLTTFSISYARDADSC--TPVRFADTG--WTDIAATTAMTSVILEEILGYKTNIKL------LAVPVTFRS 70 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~~~~~~~--~~I~ig~~~--wte~~v~a~i~~~iLE~~lGy~Ve~~~------~~~~~~~~a 70 (309) ||++...++.++++.+++++..... -+++.++.. .+-........++.+++++|.++++.. |+.....++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~a~~~~~~l~~~~~~~~~~p~~~~~~~fa~~v~ekt~G~l~i~vfP~~qLG~~~~~ie~ 80 (332) T COG1638 1 MMKLLALLLALLLAAAALASAAAAAGALVLRFSHVTPEGHPKGKAAKKFAELVEEKTGGRLKIEVFPNSQLGGEAEMIEQ 80 (332) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHCCEEEEEECCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCHHHHHHH T ss_conf 95137899999999999777787401548875235799880789999999999987199389997667566876999999 Q ss_pred HHCCCCEEEEEEE------CCC------------CH-----------HHHHHHHHCCCEEEECCCCCCCEEEEEEEHHHH Q ss_conf 5149813999840------464------------13-----------566665530470230135668535678634464 Q gi|254780342|r 71 LKNKGIDIFMGYW------YPS------------LE-----------KFIAPYLEEGSIKLVAENLQGAKYMLAVNDVGF 121 (309) Q Consensus 71 l~~G~iD~~~e~w------~p~------------~~-----------~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~ 121 (309) |+.|.||+....- .|. .+ ..+.+..++.++..++....|.+..... T Consensus 81 l~~G~id~~~~s~~~l~~~~P~~~v~~lPflf~d~~~~~~~~~~~~g~~l~~~~e~~g~~~l~~~~~G~R~~t~~----- 155 (332) T COG1638 81 LRSGTLDIGVVSLGFLAGLVPEFGVFDLPFLFRDEEHARRVLDSEFGEELLKSLEAKGLKGLAFWENGFRQFTSN----- 155 (332) T ss_pred HHCCCCEEEECCCHHHCCCCCCCEEECCCEEECCHHHHHHHHHHHHHHHHHHHHHHCCCEEEEEECCCCEEEEEC----- T ss_conf 965973078506523211387501224875518999999997138899999999976957999862752233346----- Q ss_pred HCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCCCCCCCEEEECCCCCCHHHHHHHHHHCCCCEEEEEECCC Q ss_conf 12345535655556654420135785432005558766403545221001036652018999999965993699941242 Q gi|254780342|r 122 ALGIKSYQDIAKYKKELGAKIYGIEPGNEGNQRILDMINNNKFSLKGFRLIEASELASFSQIRRDQRNNIPAVFLSWEPH 201 (309) Q Consensus 122 ~~~i~~~~dl~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~s~~~~~a~l~~A~~~~~~~v~~~wsp~ 201 (309) ..-|.+.+||+ +..+=+ ..+.......+..|.. .++.+ ++++|.|++.|-++-...=.++ T Consensus 156 k~PI~~peDlk-------GlkiRv----~~s~~~~~~~~a~GA~-----P~pm~----f~Evy~aLqtGvVDGqEnp~~~ 215 (332) T COG1638 156 KRPIKTPEDLK-------GLKIRV----PQSPLLLAMFKALGAN-----PTPMP----FAEVYTALQTGVVDGQENPLSN 215 (332) T ss_pred CCCCCCHHHHC-------CCEEEC----CCCHHHHHHHHHCCCC-----CCCCC----HHHHHHHHHCCCCCCCCCCHHH T ss_conf 78778968967-------984635----8988999999986998-----97788----8999999974970044176766 Q ss_pred HHHHC-CCC-EEECCCCCCCCCCHHHHHHHHCHHHHHHC-HHHHHHHHHC---------CCCHHHHHHHHHHHHHCCCCH Q ss_conf 23201-585-14007865467786583544277778748-8999998740---------899899999999998459998 Q gi|254780342|r 202 PINSD-LNI-HYLPGGEEISGFGEASVYTVVRSDYLDKC-PNISRLLKNI---------KFSVALENEMMKLILNNKQDR 269 (309) Q Consensus 202 ~~~~~-~dl-~~Ledp~~~~~~p~~~~~~vv~~~~~~~~-P~~~~~L~~~---------~ls~~~~~~l~~~v~~~g~~~ 269 (309) ..-.+ |.. +++-.+.+. +.++ ..++++.+-+.- |+..++|.+. ++..+...++...+...|... T Consensus 216 i~~~k~~EVqky~t~tnH~--~~~~--~~~~s~~~w~~L~~e~q~il~~aa~e~~~~~~~~~~~~e~~~~e~lk~~Gv~v 291 (332) T COG1638 216 IYSAKLYEVQKYLTLTNHI--YLPL--AVLVSKAFWDSLPEEDQTILLEAAKEAAEEQRKLVEELEDELLEKLKEAGVEV 291 (332) T ss_pred HHCCCHHHHHHHHHHCCCC--CCCE--EEEECHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEE T ss_conf 6104265675665215665--5542--68973898830999999999999999999999999998999999999779989 Q ss_pred HH--HHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHH Q ss_conf 99--999999985999998754754567713799999874 Q gi|254780342|r 270 QF--VGRTMLRTHPDLLKNWLIGVTTFDGQDPSRQLERFM 307 (309) Q Consensus 270 ~~--vA~~wL~~n~~~~~~Wl~~~~~~~~~~~~~~~~~~~ 307 (309) -+ ..++|-+.-...++.|.+-.- .-|++..++++... T Consensus 292 ~~~~~~~~~~~~~~~~~~~~~~~~~-~~g~~l~e~~~~~~ 330 (332) T COG1638 292 VEPDAAEAFREAAKPVYDEFAKKAG-EIGAELIEAIEAAR 330 (332) T ss_pred ECCCCHHHHHHHHHHHHHHHHHHCC-HHHHHHHHHHHHHH T ss_conf 7477369999999899999986441-55799999999873 No 29 >PRK11917 bifunctional adhesin/ABC transporter aspartate/glutamate-binding protein; Reviewed Probab=97.78 E-value=0.00083 Score=44.57 Aligned_cols=191 Identities=13% Similarity=0.084 Sum_probs=101.6 Q ss_pred CCCEEEEC----CCCCH-----HHHH---HHHHHHHHHHCC--CCEEEEEECCHHHHHHHHHCCCCEEEEEEECCCCHHH Q ss_conf 88379942----58869-----9999---999999996106--9848998564057888851498139998404641356 Q gi|254780342|r 25 CTPVRFAD----TGWTD-----IAAT---TAMTSVILEEIL--GYKTNIKLLAVPVTFRSLKNKGIDIFMGYWYPSLEKF 90 (309) Q Consensus 25 ~~~I~ig~----~~wte-----~~v~---a~i~~~iLE~~l--Gy~Ve~~~~~~~~~~~al~~G~iD~~~e~w~p~~~~~ 90 (309) .+++++|. ++|.- +.+. -.|...+-++-+ +.+++.++.+.....++|.+|.+|+.......+. T Consensus 37 ~G~L~vg~~~~~pPf~~~d~~~g~~~GfDvDla~~iAk~llg~~~~v~~v~~~~~~~~p~L~~gkvD~i~~~~tiT~--- 113 (259) T PRK11917 37 KGQLIVGVKNDVPHYALLDQATGEIKGFEIDVAKLLAKSILGDDKKIKLVAVNAKTRGPLLDNGSVDAVIATFTITP--- 113 (259) T ss_pred CCEEEEEECCCCCCCCEEECCCCCEEEEEHHHHHHHHHHHCCCCCEEEEEEECHHHHHHHHHCCCEEEEEECCCCCC--- T ss_conf 99899998789788620567999678887999999999965998569999706787768987798559986647676--- Q ss_pred HHHHHHCCCEEEECCCCCCCEEEEEEEHHHHHCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCCCCCCCEE Q ss_conf 66655304702301356685356786344641234553565555665442013578543200555876640354522100 Q gi|254780342|r 91 IAPYLEEGSIKLVAENLQGAKYMLAVNDVGFALGIKSYQDIAKYKKELGAKIYGIEPGNEGNQRILDMINNNKFSLKGFR 170 (309) Q Consensus 91 ~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~i~~~~dl~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~ 170 (309) ++...+.....|......+.+++. .++++++||. ++.+|...+..............+......+ T Consensus 114 -----eR~k~vdFS~PY~~~~~~ilv~k~---~~i~~~~DL~-------gk~Vgv~~gst~~~~~~~~~~~~~~~~~~~~ 178 (259) T PRK11917 114 -----ERKRIYNFSEPYYQDAIGLLVLKE---KNYKSLADMK-------GANIGVAQAATTKKAIGEAAKKIGIDVKFSE 178 (259) T ss_pred -----CHHCCEEECCCEEEEEEEEEEECC---CCCCCHHHHC-------CCCEEEEECCCHHHHHHHHHHHCCCCEEEEE T ss_conf -----543057622476996559999899---9999989958-------9817999278469999998886088516996 Q ss_pred EECCCCCCHHHHHHHHHHCCCCEEEEEECCCHH-HHCCCCEEECCCCCCCCCCHHHHHHHHCHHHHHHCHHHHHHHHHC Q ss_conf 103665201899999996599369994124223-201585140078654677865835442777787488999998740 Q gi|254780342|r 171 LIEASELASFSQIRRDQRNNIPAVFLSWEPHPI-NSDLNIHYLPGGEEISGFGEASVYTVVRSDYLDKCPNISRLLKNI 248 (309) Q Consensus 171 ~~~~s~~~~~a~l~~A~~~~~~~v~~~wsp~~~-~~~~dl~~Ledp~~~~~~p~~~~~~vv~~~~~~~~P~~~~~L~~~ 248 (309) + +. ..++..++++|+.+.+..=.+--. +..-..+.+.++ + .| ......+|+ .+|+..+.+++. T Consensus 179 ~-~~-----~~~~~~~l~~grvDa~~~d~~~~~~~~~~~~~~~~~~---~-~~-~~~giavrK----gd~~l~~~vn~~ 242 (259) T PRK11917 179 F-PD-----YPSIKAALDAKRVDAFSVDKSILLGYVDDKSEILPDS---F-EP-QSYGIVTKK----DDPAFAKYVDDF 242 (259) T ss_pred C-CC-----HHHHHHHHHCCCEEEEECCHHHHHHHHHCCCCCCCCC---C-CC-CEEEEEEEC----CCHHHHHHHHHH T ss_conf 4-99-----9999999984980299826999999986198016888---8-75-258999948----998999999999 No 30 >TIGR03261 phnS2 putative 2-aminoethylphosphonate ABC transport system, 1-aminoethylphosphonate-binding protein component. This ABC transporter extracellular solute-binding protein is found in a number of genomes in operon-like contexts strongly suggesting a substrate specificity for 2-aminoethylphosphonate (2-AEP). The characterized PhnSTUV system is absent in the genomes in which this system is found. These genomes encode systems for the catabolism of 2-AEP, making the need for a 2-AEP-specific transporter likely. Probab=97.74 E-value=0.0044 Score=39.84 Aligned_cols=237 Identities=14% Similarity=0.101 Sum_probs=108.2 Q ss_pred HHHHHHHHHHHHHHHCCCCCCCCCEEEECCCCCHHHHHHHHHHHHHHHC-CCCEEEEEECCHHHHHHHH----HCCCCEE Q ss_conf 9999999999986303583458837994258869999999999999610-6984899856405788885----1498139 Q gi|254780342|r 4 ILAVCLFLTTFSISYARDADSCTPVRFADTGWTDIAATTAMTSVILEEI-LGYKTNIKLLAVPVTFRSL----KNKGIDI 78 (309) Q Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~~I~ig~~~wte~~v~a~i~~~iLE~~-lGy~Ve~~~~~~~~~~~al----~~G~iD~ 78 (309) +++.++++++++++++ +.+.++|+|=+. ..+. ....|+.. .|+. .|++|++..+++..+..-| .+...|+ T Consensus 2 ~~~~~~~~~~~~~a~a--a~~~~~l~VYss-~~~~-~~~~i~~~-Fek~~tgIkV~~v~~~tgel~~rl~aEk~~p~aDV 76 (334) T TIGR03261 2 VVAGFIMTSLFFSACN--SKANTELTVYTA-IEDE-LIAKYKDA-FEKVNPDIKINWVRDSTGIITAKLLAEKNNPQADV 76 (334) T ss_pred HHHHHHHHHHHHHHHH--HCCCCEEEEEEC-CCHH-HHHHHHHH-HHHHCCCCEEEEEECCHHHHHHHHHHCCCCCCCEE T ss_conf 8899999999998310--302995999969-9888-99999999-99777994899998972999999996457998659 Q ss_pred EEEEECCCCHHHHHHHHHCCCE--------EEECCCC---CC---------CEEEEEEEHHH-HHCCC---CCHHHHHHH Q ss_conf 9984046413566665530470--------2301356---68---------53567863446-41234---553565555 Q gi|254780342|r 79 FMGYWYPSLEKFIAPYLEEGSI--------KLVAENL---QG---------AKYMLAVNDVG-FALGI---KSYQDIAKY 134 (309) Q Consensus 79 ~~e~w~p~~~~~~~~~~~~~~~--------~~~~~~~---~~---------~~~~~~v~~~~-~~~~i---~~~~dl~~~ 134 (309) ....-.+.... ...++.+ ..+...+ .+ ...++++.... ...++ ++|.||.++ T Consensus 77 ~~g~~~~~~~~----~~~~gll~~y~~~~~~~l~~~~~d~~~~~~~~g~~~~~~~i~yN~~~~~~~~~p~P~sw~DL~dp 152 (334) T TIGR03261 77 VWGLAASSLAL----LDKEGMLKPYKPKGLDALNPKFRDAKNPPHWVGMDAWMAAICFNTVEAKKKGLPKPTSWEDLTKP 152 (334) T ss_pred EEECCHHHHHH----HHHCCCCCCCCCCCHHHCCHHHCCCCCCCEEEEEEEEEEEEEEEHHHCCCCCCCCCCCHHHHCCH T ss_conf 98456578999----88604315777530544788652978885168875246789974444042589999999997094 Q ss_pred HHHHHHCCCCCCCCCCCCHH--HHHHHHHCCC--------CC-CCEEEECCCCCCHHHHHHHHHHCCCCEEEEEECCCHH Q ss_conf 66544201357854320055--5876640354--------52-2100103665201899999996599369994124223 Q gi|254780342|r 135 KKELGAKIYGIEPGNEGNQR--ILDMINNNKF--------SL-KGFRLIEASELASFSQIRRDQRNNIPAVFLSWEPHPI 203 (309) Q Consensus 135 ~~~~~~~~~g~~~~~~~~~~--~~~~~~~~~l--------~~-~~~~~~~~s~~~~~a~l~~A~~~~~~~v~~~wsp~~~ 203 (309) . +++++....|....... ....+..+|- .+ ........+. ......+.+|+..+...|..+.. T Consensus 153 ~--ykg~i~~~~p~~sg~~~~~l~~~~~~~G~d~g~~~l~~l~~N~~~~~~~~----~~~~~~v~~Ge~~ig~~~~~~~~ 226 (334) T TIGR03261 153 E--YKGHIVMPNPASSGTGFLDVSAWLQMMGEDKGWDYMDKLHKNIAVYTHSG----SKPCKLAGMGEFPIGISMAYRAL 226 (334) T ss_pred H--HCCEEECCCCCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHCCCCCCC----CHHHHHHHCCCEEEEEEECCHHH T ss_conf 7--48958734888633899999999997352889999999986324167776----38889986598579997354578 Q ss_pred HHCCCCEEE--CCCCCCCCCCHHHHHHHHCHHHHHHCHHHHHHHHHCCCCHHHHHHHH Q ss_conf 201585140--07865467786583544277778748899999874089989999999 Q gi|254780342|r 204 NSDLNIHYL--PGGEEISGFGEASVYTVVRSDYLDKCPNISRLLKNIKFSVALENEMM 259 (309) Q Consensus 204 ~~~~dl~~L--edp~~~~~~p~~~~~~vv~~~~~~~~P~~~~~L~~~~ls~~~~~~l~ 259 (309) ..+-+-.-+ -.|++-..+....+. ++. -.+||++++.|=.+-+|.+.+..+. T Consensus 227 ~~~~~g~pv~~v~P~eG~~~~~~~~~-Ivk---~a~n~~~Ak~FidfllS~e~Q~~~a 280 (334) T TIGR03261 227 KEKKKGAPIDVVFPKEGLGWDIEATA-IIK---GSKNNDAAKKLVDWSISDEAMELYA 280 (334) T ss_pred HHHHCCCCEEEEECCCCCEEEEEEEE-EEC---CCCCHHHHHHHHHHHCCHHHHHHHH T ss_conf 87643897338945898478773588-887---9989899999999976999999999 No 31 >COG2998 TupB ABC-type tungstate transport system, permease component [Coenzyme metabolism] Probab=97.64 E-value=0.0036 Score=40.43 Aligned_cols=232 Identities=12% Similarity=0.079 Sum_probs=116.8 Q ss_pred HCCCCCCCCCEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHCCCCEEEEEEECCCCHHHHHHHHHC Q ss_conf 03583458837994258869999999999999610698489985640578888514981399984046413566665530 Q gi|254780342|r 18 YARDADSCTPVRFADTGWTDIAATTAMTSVILEEILGYKTNIKLLAVPVTFRSLKNKGIDIFMGYWYPSLEKFIAPYLEE 97 (309) Q Consensus 18 ~~~~~~~~~~I~ig~~~wte~~v~a~i~~~iLE~~lGy~Ve~~~~~~~~~~~al~~G~iD~~~e~w~p~~~~~~~~~~~~ 97 (309) .++++++...++++....++..-+=.++.-..|+..||+|+++..+|......=+.||.|+..-- .|..+..| +++ T Consensus 22 ~~a~sa~~~~LrmATTTSt~dtGLLd~l~p~fE~~~g~~v~~vAvGTG~ALkmge~gdvDvv~vH-apk~E~~f---v~~ 97 (280) T COG2998 22 YSASSADAAELRMATTTSTEDTGLLDVLLPKFEKYTGYDVQVVAVGTGKALKMGERGDVDVVIVH-APKAEKEF---VKD 97 (280) T ss_pred HHCCCCCCCEEEEEEECCCCCCCCHHHHHHHHHHCCCCEEEEEEECCHHHHHHHCCCCCCEEEEE-CCHHHHHH---HHC T ss_conf 86057542037885430245563188877766433696589998123587751126886789983-70778999---872 Q ss_pred CCEEEECCCCCCCEEEEEEEHHHHH-CCCCC---HHHHHHHHHHHHHCCCC--CCCCCCCCHHHHHHHHHCCCCCCCEEE Q ss_conf 4702301356685356786344641-23455---35655556654420135--785432005558766403545221001 Q gi|254780342|r 98 GSIKLVAENLQGAKYMLAVNDVGFA-LGIKS---YQDIAKYKKELGAKIYG--IEPGNEGNQRILDMINNNKFSLKGFRL 171 (309) Q Consensus 98 ~~~~~~~~~~~~~~~~~~v~~~~~~-~~i~~---~~dl~~~~~~~~~~~~g--~~~~~~~~~~~~~~~~~~~l~~~~~~~ 171 (309) |--+.-. .-.++.+.+..+..+ .++++ ..+-.+....-+..++. ..+|-...+. ..-+..++...+..- T Consensus 98 G~gv~r~---~vmYNdFiiVgp~~dpA~~k~~kn~~e~fe~Ia~~ka~FvSRGD~SGT~~~E~--~lWk~~g~~p~~~~w 172 (280) T COG2998 98 GFGVDRR---PVMYNDFIIVGPADDPAGIKDAKNGKEAFEKIAEEKAKFVSRGDNSGTDSKEL--SLWKVTGIEPTVKGW 172 (280) T ss_pred CCCCCCC---CEEEEEEEEECCCCCCHHCCCCHHHHHHHHHHHHCCCEEEECCCCCCCCHHHH--HHHHHCCCCCCCCCC T ss_conf 7886676---10131179987766611112200089999999972890574278778427788--899870878898860 Q ss_pred ECCCCCCHHHHHHHHHHCCCCEEEEEECCCHHHHCCC----CE-EECCCCCCCCCCHHHHHHHHCHH-HHH-HCHHHHHH Q ss_conf 0366520189999999659936999412422320158----51-40078654677865835442777-787-48899999 Q gi|254780342|r 172 IEASELASFSQIRRDQRNNIPAVFLSWEPHPINSDLN----IH-YLPGGEEISGFGEASVYTVVRSD-YLD-KCPNISRL 244 (309) Q Consensus 172 ~~~s~~~~~a~l~~A~~~~~~~v~~~wsp~~~~~~~d----l~-~Ledp~~~~~~p~~~~~~vv~~~-~~~-~~P~~~~~ 244 (309) ..+++.+|...+.=|-+.+- |.-|-.+.+.+|. |+ +|+.++..+ .+|.+. ++++. ..- .+.++..| T Consensus 173 Y~s~G~GMg~tL~~A~Ek~a----ytLtDrgTylaYr~r~~L~iv~~gd~~L~--N~Ysvi-~vNP~r~~~vny~~A~kf 245 (280) T COG2998 173 YISAGQGMGDTLNMANEKKA----YTLTDRGTYLAYRNRPTLVIVLEGDPSLF--NPYSVI-AVNPKRVKGVNYTAATKF 245 (280) T ss_pred EEECCCCHHHHHHHHHHHCE----EEEECCCEEEEECCCCCEEEEECCCCCCC--CCEEEE-EECHHCCCCCCCHHHHHH T ss_conf 36527445778777655121----68613630567627665289961781116--712589-876100788571189999 Q ss_pred HHHCCCCHHHHHHHHHHHHHCCC Q ss_conf 87408998999999999984599 Q gi|254780342|r 245 LKNIKFSVALENEMMKLILNNKQ 267 (309) Q Consensus 245 L~~~~ls~~~~~~l~~~v~~~g~ 267 (309) ++-+ ..+.-.++.+..-++|+ T Consensus 246 i~w~--~s~~gq~~Ia~fkvng~ 266 (280) T COG2998 246 IEWL--MSEKGQNLIAEFKVNGQ 266 (280) T ss_pred HHHH--HHHHHHHHHHHCEECCE T ss_conf 9999--64777888865056673 No 32 >PRK11260 cystine transporter subunit; Provisional Probab=97.63 E-value=0.0016 Score=42.74 Aligned_cols=211 Identities=13% Similarity=0.060 Sum_probs=103.0 Q ss_pred CHHHHHHHHHHHHHHHHHC-CCCC--------CCCCEEEECC-CCC-HH------H---HHHHHHHHHHHHCCCCEEEEE Q ss_conf 9579999999999986303-5834--------5883799425-886-99------9---999999999961069848998 Q gi|254780342|r 1 MYKILAVCLFLTTFSISYA-RDAD--------SCTPVRFADT-GWT-DI------A---ATTAMTSVILEEILGYKTNIK 60 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~-~~~~--------~~~~I~ig~~-~wt-e~------~---v~a~i~~~iLE~~lGy~Ve~~ 60 (309) +||+++.++.+++++.+.+ +.++ +.++|+||.. +|+ -+ . +--.+.+.+-+ .||++++++ T Consensus 5 ~~~~~~~~l~~~~~~~~~~~~~a~~~~L~~I~~~G~L~Vg~~~~~pPf~~~d~~G~~~GfdvDl~~~ia~-~lg~~~e~~ 83 (264) T PRK11260 5 GRQALMGVMAVALVAGMSVKSFADEGLLNKVKERGTLLVGLEGTYPPFSFQGEDGKLTGFEVEFAEALAK-HLGVKASLK 83 (264) T ss_pred HHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHCCEEEEEECCCCCCCEEECCCCCEEEEEHHHHHHHHH-HHCCCEEEE T ss_conf 7999999999999998357654342439999869989999778988936899999988871999999999-849956999 Q ss_pred ECCHHHHHHHHHCCCCEEEEEEECCCCHHHHHHHHHCCCEEEECCCCCCCEEEEEEEHHHHHCCCCCHHHHHHHHHHHHH Q ss_conf 56405788885149813999840464135666655304702301356685356786344641234553565555665442 Q gi|254780342|r 61 LLAVPVTFRSLKNKGIDIFMGYWYPSLEKFIAPYLEEGSIKLVAENLQGAKYMLAVNDVGFALGIKSYQDIAKYKKELGA 140 (309) Q Consensus 61 ~~~~~~~~~al~~G~iD~~~e~w~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~i~~~~dl~~~~~~~~~ 140 (309) +.+-..++.+|.+|++|+....- +.+.+... .......|......+.+++.. ...+.+++||. + T Consensus 84 ~~~~~~~i~~L~~g~~Di~~~~~-t~T~eR~~-------~~~Fs~PY~~~~~~~~v~~~~-~~~i~~~~dL~-------G 147 (264) T PRK11260 84 PTKWDGMLASLDSKRIDVVINQV-TISDERKK-------KYDFSTPYTVSGIQALVKKGN-EGTIKTAADLK-------G 147 (264) T ss_pred ECCHHHHHHHHHCCCCCEEECCC-CCCHHHHH-------HCCCCCCEEEEEEEEEEECCC-CCCCCCHHHHC-------C T ss_conf 78889999998669946672575-35888885-------177677437744599998789-76668978966-------8 Q ss_pred CCCCCCCCCCCCHHHHHHHHHCCCCCCCEEEECCCCCCHHHHHHHHHHCCCCEEEEEECCCH--HHHCCC-C-EEECCCC Q ss_conf 01357854320055587664035452210010366520189999999659936999412422--320158-5-1400786 Q gi|254780342|r 141 KIYGIEPGNEGNQRILDMINNNKFSLKGFRLIEASELASFSQIRRDQRNNIPAVFLSWEPHP--INSDLN-I-HYLPGGE 216 (309) Q Consensus 141 ~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~s~~~~~a~l~~A~~~~~~~v~~~wsp~~--~~~~~d-l-~~Ledp~ 216 (309) +.+|...+.... +.++...-+.. ....... .+...++++|+.+++..=.+-. ...++. . +.+..| T Consensus 148 k~V~v~~gs~~~----~~l~~~~~~~~-i~~~~~~-----~~~~~~l~~G~vDa~i~d~~~~~~~~~~~~~~~~~~~~~- 216 (264) T PRK11260 148 KKVGVGLGTNYE----EWLRQNVQGVD-VRTYDDD-----PTKYQDLRVGRIDAILVDRLAALDLVKKTNDTLAVTGEA- 216 (264) T ss_pred CEEEEECCCHHH----HHHHHHCCCCC-EEEECCH-----HHHHHHHHCCCCCEEEECHHHHHHHHHHCCCCEEECCCC- T ss_conf 716885697899----99986088981-6863661-----899999875986799945999999998689978980687- Q ss_pred CCCCCCHHHHHHHHCHHHHHHCHHHHHHHHH Q ss_conf 5467786583544277778748899999874 Q gi|254780342|r 217 EISGFGEASVYTVVRSDYLDKCPNISRLLKN 247 (309) Q Consensus 217 ~~~~~p~~~~~~vv~~~~~~~~P~~~~~L~~ 247 (309) + ........+++ .+|+..+.+++ T Consensus 217 --~--~~~~~~iav~k----~~~~L~~~in~ 239 (264) T PRK11260 217 --F--SRQESGVALRK----GNPDLLKAVNN 239 (264) T ss_pred --C--CCCEEEEEEEC----CCHHHHHHHHH T ss_conf --6--66508999989----99999999999 No 33 >PRK10682 putrescine transporter subunit: periplasmic-binding component of ABC superfamily; Provisional Probab=97.63 E-value=0.0052 Score=39.37 Aligned_cols=76 Identities=16% Similarity=0.147 Sum_probs=54.6 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCCCCCEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEEECC-HHHHHHHHHCCC--CEE Q ss_conf 57999999999998630358345883799425886999999999999961069848998564-057888851498--139 Q gi|254780342|r 2 YKILAVCLFLTTFSISYARDADSCTPVRFADTGWTDIAATTAMTSVILEEILGYKTNIKLLA-VPVTFRSLKNKG--IDI 78 (309) Q Consensus 2 kk~~~~~~~~~~~~~~~~~~~~~~~~I~ig~~~wte~~v~a~i~~~iLE~~lGy~Ve~~~~~-~~~~~~al~~G~--iD~ 78 (309) ||++.-++++++++.+.++.+++.++++| -+|.+ ++--.++.. .|+++|++|++...+ ...++.-|.+|. .|+ T Consensus 6 ~~~~~~l~~~~~~~~~~~~~aa~~~~Lnv--~nW~~-Yi~p~~i~~-FeketGikV~~~~~~snee~~akl~ag~~gyDv 81 (370) T PRK10682 6 KKWLSGLVAGALMAVSVGTLAAEQKTLHI--YNWSD-YIAPDTVAN-FEKETGIKVVYDVFDSNEVLEGKLMAGSTGFDL 81 (370) T ss_pred HHHHHHHHHHHHHHHHCCHHHHCCCEEEE--ECCCC-CCCHHHHHH-HHHHHCCEEEEEECCCHHHHHHHHHCCCCCCCE T ss_conf 88999999999999852056642797999--81676-579779999-999979889999549989999999669999988 Q ss_pred EEE Q ss_conf 998 Q gi|254780342|r 79 FMG 81 (309) Q Consensus 79 ~~e 81 (309) ..- T Consensus 82 v~p 84 (370) T PRK10682 82 VVP 84 (370) T ss_pred EEE T ss_conf 997 No 34 >TIGR01096 3A0103s03R lysine-arginine-ornithine-binding periplasmic protein; InterPro: IPR005768 Bacterial high affinity transport systems are involved in active transport of solutes across the cytoplasmic membrane. The protein components of these traffic systems include one or two transmembrane protein components, one or two membrane-associated ATP-binding proteins (ABC transporters; see IPR003439 from INTERPRO) and a high affinity periplasmic solute-binding protein. The later are thought to bind the substrate in the vicinity of the inner membrane, and to transfer it to a complex of inner membrane proteins for concentration into the cytoplasm. In Gram-positive bacteria which are surrounded by a single membrane and have therefore no periplasmic region the equivalent proteins are bound to the membrane via an N-terminal lipid anchor. These homolog proteins do not play an integral role in the transport process per se, but probably serve as receptors to trigger or initiate translocation of the solute through the membrane by binding to external sites of the integral membrane proteins of the efflux system. In addition at least some solute-binding proteins function in the initiation of sensory transduction pathways. On the basis of sequence similarities, the vast majority of these solute-binding proteins can be grouped into eight families of clusters, which generally correlate with the nature of the solute bound. Family 3 groups together specific amino acids and opine-binding periplasmic proteins and a periplasmic homolog with catalytic activity. ; GO: 0005215 transporter activity, 0006810 transport, 0030288 outer membrane-bounded periplasmic space. Probab=97.02 E-value=0.011 Score=37.26 Aligned_cols=261 Identities=16% Similarity=0.114 Sum_probs=145.9 Q ss_pred HHHHHHHHHHHHHHHHHCC--CCCCCCCEEEEC----CCCCHHHHH-----------HHHHHHHHHHCCC---CEEEEEE Q ss_conf 5799999999999863035--834588379942----588699999-----------9999999961069---8489985 Q gi|254780342|r 2 YKILAVCLFLTTFSISYAR--DADSCTPVRFAD----TGWTDIAAT-----------TAMTSVILEEILG---YKTNIKL 61 (309) Q Consensus 2 kk~~~~~~~~~~~~~~~~~--~~~~~~~I~ig~----~~wte~~v~-----------a~i~~~iLE~~lG---y~Ve~~~ 61 (309) |+++.+.|+++..+++.++ ..-..+.|+||+ ++|...-.- .+|.++|-.+. | .+++.++ T Consensus 1 k~~l~~~L~ag~~~~~~a~~~a~~~~~~v~~gt~~~y~PF~~~~a~G~~~GFDvDl~~alc~~~~gD~-GWdh~~c~~~~ 79 (333) T TIGR01096 1 KLVLLAALVAGGDASSAATAAAAKEKGSVRIGTETAYPPFEFKDADGKLVGFDVDLANALCKEMKGDP-GWDHAKCKFVE 79 (333) T ss_pred CHHHHHHHHHHHHHHHHCCHHHHHHCCCEEEEECCCCCCCCCCCCCCCEEEHHHHHHHHHHHHHCCCC-CCCCCEEEEEC T ss_conf 85899999999999862156888528807998657879865518895456326789999999845899-96431488842 Q ss_pred CCHHHHHHHHHCCCCEEEEE-EECCCCHHHHHHHHHCCCEEEECCCCCCCEEEEEEEHHHHHC---------CCCCHHH- Q ss_conf 64057888851498139998-404641356666553047023013566853567863446412---------3455356- Q gi|254780342|r 62 LAVPVTFRSLKNKGIDIFMG-YWYPSLEKFIAPYLEEGSIKLVAENLQGAKYMLAVNDVGFAL---------GIKSYQD- 130 (309) Q Consensus 62 ~~~~~~~~al~~G~iD~~~e-~w~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~---------~i~~~~d- 130 (309) ..-..++++|..|+||+.+. .| +-+.+.-+ .+.-+..|......++|.+..... .+.++++ T Consensus 80 ~~~~~liPsL~~k~~Dai~an~~-~It~~R~~-------~i~F~~~yy~~~~~f~v~~~~~~~~~~~~~E~~~Ld~~A~~ 151 (333) T TIGR01096 80 QAFDGLIPSLKAKKVDAIMANAM-SITDERQK-------QIDFSDPYYATGAGFVVKKDSDLAKTLEDGENYSLDNQADA 151 (333) T ss_pred CCHHHHHHHHHCCCEEEEEECCC-CCCHHHHH-------HHCCCCCCCCCCCEEEEECCCCCCCCHHHHCCCCCCCCCHH T ss_conf 75665202545380689972665-54665663-------21144300004430222248875431453303672000379 Q ss_pred ----HHHHHHHHH-HCCCCCCCCCCCCHHHHHHHHHCCCCCCCEEEECCCCCCHHHHHHHHHHCCCCEEEEEECCC--HH Q ss_conf ----555566544-20135785432005558766403545221001036652018999999965993699941242--23 Q gi|254780342|r 131 ----IAKYKKELG-AKIYGIEPGNEGNQRILDMINNNKFSLKGFRLIEASELASFSQIRRDQRNNIPAVFLSWEPH--PI 203 (309) Q Consensus 131 ----l~~~~~~~~-~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~s~~~~~a~l~~A~~~~~~~v~~~wsp~--~~ 203 (309) +.+...-++ ++.+|...|.....-..+..+...++..-.+.. -..+++..+..|..++.+.=++- .. T Consensus 152 ~K~~~d~~~~~L~nG~~~gV~~GTT~~~~~~~~~~~~Nl~v~~~~Y~------~~~~~~~dL~~GR~Da~~~D~~~l~~~ 225 (333) T TIGR01096 152 AKKEIDDVTKFLKNGKKVGVQSGTTHEQYLKDEFKPKNLGVDIVEYD------SYDNANMDLKAGRIDAVLADASVLAEG 225 (333) T ss_pred HHHHHHHHHHHHHCCCEEEECCCCCHHHHHHHHCCCCCCEEEEEEEC------CHHHHHHHHHCCCCEEEECCHHHHHHH T ss_conf 89989989998527548986268848999986244337426666408------867999997579621774365888888 Q ss_pred HHCC----CCEEECC--------CCCCCCCC---------------HHHHHHHHCH--H-HHHHCHHHHHHHHHC-C-CC Q ss_conf 2015----8514007--------86546778---------------6583544277--7-787488999998740-8-99 Q gi|254780342|r 204 NSDL----NIHYLPG--------GEEISGFG---------------EASVYTVVRS--D-YLDKCPNISRLLKNI-K-FS 251 (309) Q Consensus 204 ~~~~----dl~~Led--------p~~~~~~p---------------~~~~~~vv~~--~-~~~~~P~~~~~L~~~-~-ls 251 (309) +.+. |++++.+ |-..||-. ...+-+.+|+ . ....+|++..+|-.= - +. T Consensus 226 ~~~~~~gkd~~~~g~~~~~~PLGP~~~FG~g~W~DiV~W~~YAL~~A~E~Gig~rk~de~~~s~Np~i~r~lg~~~Dt~G 305 (333) T TIGR01096 226 FLKPPNGKDLKFVGPKVTDEPLGPVKYFGDGYWFDIVRWSAYALVAAEEVGIGLRKVDEVKASTNPEIKRLLGSEGDTIG 305 (333) T ss_pred HHCCCCCCCEEECCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHHHCCCCCCHHHHHCCCCCHHH T ss_conf 73789888637728322688989855578883001214466777634432125345588851888302322045565121 Q ss_pred HHHHHHHHHHHH--HCCCCHHHHHHHHH Q ss_conf 899999999998--45999899999999 Q gi|254780342|r 252 VALENEMMKLIL--NNKQDRQFVGRTML 277 (309) Q Consensus 252 ~~~~~~l~~~v~--~~g~~~~~vA~~wL 277 (309) ++--..+|.++. .+-.+.+.+.++|+ T Consensus 306 ~eL~~~~n~Al~~~~adGtY~~i~~kwf 333 (333) T TIGR01096 306 KELKAKFNKALKAIRADGTYQKISKKWF 333 (333) T ss_pred HHHHHHHHHHHHHHHHCCCHHHHHHHCC T ss_conf 8999999999999986587468886519 No 35 >PRK12679 cbl transcriptional regulator Cbl; Reviewed Probab=96.82 E-value=0.034 Score=34.07 Aligned_cols=210 Identities=14% Similarity=0.038 Sum_probs=128.6 Q ss_pred CCCCCCEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHCCCCEEEEEEECCCCHHHHHHHHHCCCEE Q ss_conf 34588379942588699999999999996106984899856405788885149813999840464135666655304702 Q gi|254780342|r 22 ADSCTPVRFADTGWTDIAATTAMTSVILEEILGYKTNIKLLAVPVTFRSLKNKGIDIFMGYWYPSLEKFIAPYLEEGSIK 101 (309) Q Consensus 22 ~~~~~~I~ig~~~wte~~v~a~i~~~iLE~~lGy~Ve~~~~~~~~~~~al~~G~iD~~~e~w~p~~~~~~~~~~~~~~~~ 101 (309) .....+++||.......+.+..+++...+..=+.++++..+++..+.+.|.+|++|+..-.-.... +..+. T Consensus 89 ~~~~G~l~Ig~~~t~a~~~LP~~l~~F~~~~P~v~l~l~~~~~~~i~~~l~~g~~Dl~i~~~~~~~---------~p~l~ 159 (316) T PRK12679 89 NDTSGVLTIATTHTQARYSLPEVIKAFRELFPEVRLELIQGTPQEIATLLQNGEADIGIASERLSN---------DPQLV 159 (316) T ss_pred CCCCCEEEEEEEECCHHCCCCHHHHHHHHHCCCCEEEEEECCHHHHHHHHHCCCCCEEECCCCCCC---------CCCCE T ss_conf 887764999985032010381999999986899538998478799999998799885222467789---------97607 Q ss_pred EECCCCCCCEEEEEEEHHHHH--CCCCCHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCCCCCCCEEEECCCCCCH Q ss_conf 301356685356786344641--234553565555665442013578543200555876640354522100103665201 Q gi|254780342|r 102 LVAENLQGAKYMLAVNDVGFA--LGIKSYQDIAKYKKELGAKIYGIEPGNEGNQRILDMINNNKFSLKGFRLIEASELAS 179 (309) Q Consensus 102 ~~~~~~~~~~~~~~v~~~~~~--~~i~~~~dl~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~s~~~~ 179 (309) .. +.+ .....+++|..-+- ..--++++|.+ ..++..+++++.+..........|+... ..+...+ T Consensus 160 ~~-p~~-~~~~~lvvp~~HpLa~~~~vsl~~L~~------~plI~~~~g~~~R~~id~~f~~~G~~p~-i~~e~~~---- 226 (316) T PRK12679 160 AF-PWF-RWHHSLLVPLDHPLTQITPLTLESIAK------WPLITYRQGITGRSRIDDAFARKGLLAD-IVLSAQD---- 226 (316) T ss_pred EE-EEE-ECCEEEEECCCCCCCCCCCCCHHHHCC------CCEEEECCCCHHHHHHHHHHHHCCCCCC-EEEEECC---- T ss_conf 99-805-534135601899512489989999879------9989508998499999999997799997-7999998---- Q ss_pred HHHHHHHHHCCCCEEEEEECCCHHH---HCCCCEEECCCCCCCCCCHHHHHHHHCHHHHHHCHHHHHHHHHC--CCCHHH Q ss_conf 8999999965993699941242232---01585140078654677865835442777787488999998740--899899 Q gi|254780342|r 180 FSQIRRDQRNNIPAVFLSWEPHPIN---SDLNIHYLPGGEEISGFGEASVYTVVRSDYLDKCPNISRLLKNI--KFSVAL 254 (309) Q Consensus 180 ~a~l~~A~~~~~~~v~~~wsp~~~~---~~~dl~~Ledp~~~~~~p~~~~~~vv~~~~~~~~P~~~~~L~~~--~ls~~~ 254 (309) ...+.+.++.|-.+-+ -|.... ...+++.++-+ +. +++.....+.+++-. -.|.+++|++-+ .||.+. T Consensus 227 ~~~i~~~V~~GlGvai---lp~~a~~~~~~~~l~~~~~~-~~--~~~~~~~~~~~r~~~-l~~~~~~Fi~~~~~~l~~~~ 299 (316) T PRK12679 227 SDVIKTYVALGLGIGL---VAEQSSGEQEEKNLIRLDTR-HL--FDANTVWLGLKRGQL-QRNYVWRFLELCNAGLSVED 299 (316) T ss_pred HHHHHHHHHHCCCEEE---CHHHHCCCCCCCCEEEECCC-CC--CCCCEEEEEEECCCC-CHHHHHHHHHHHHCCCCHHH T ss_conf 9999999998990997---01445075468988993263-46--766538999967875-56999999997520289999 Q ss_pred HHHHHH Q ss_conf 999999 Q gi|254780342|r 255 ENEMMK 260 (309) Q Consensus 255 ~~~l~~ 260 (309) +..... T Consensus 300 ~~~~~~ 305 (316) T PRK12679 300 IKRQVM 305 (316) T ss_pred HHHHHH T ss_conf 999987 No 36 >COG0687 PotD Spermidine/putrescine-binding periplasmic protein [Amino acid transport and metabolism] Probab=96.76 E-value=0.051 Score=32.91 Aligned_cols=174 Identities=14% Similarity=0.088 Sum_probs=80.7 Q ss_pred CCCCCCEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEEECCH-HHHHHHHHCCC--CEEEEEEE--------------- Q ss_conf 3458837994258869999999999999610698489985640-57888851498--13999840--------------- Q gi|254780342|r 22 ADSCTPVRFADTGWTDIAATTAMTSVILEEILGYKTNIKLLAV-PVTFRSLKNKG--IDIFMGYW--------------- 83 (309) Q Consensus 22 ~~~~~~I~ig~~~wte~~v~a~i~~~iLE~~lGy~Ve~~~~~~-~~~~~al~~G~--iD~~~e~w--------------- 83 (309) ..+.++++| .+|.+..--+ ++.. .|+++|++|.....++ ...+..+..|. .|+..-.. T Consensus 28 ~a~~~~l~v--~~w~gy~~~~-~~~~-F~k~tGi~V~~~~~~sne~~~ak~~~g~~~~Dvv~~s~~~~~~~~~~gll~~l 103 (363) T COG0687 28 AAEGKTLNV--YNWGGYIDPA-LLKP-FEKETGIKVVYDTFDSNEEMLAKLKAGGGGYDVVVPSGYFVARLIKEGLLQPL 103 (363) T ss_pred HHCCCEEEE--ECCCCCCCHH-HHHH-HHHHHCCEEEEECCCCHHHHHHHHHHCCCCCCEEECCHHHHHHHHHCCCCCCC T ss_conf 625785999--6478755778-8998-99998997999538986999999980899988797188899999866756758 Q ss_pred ----CCCCHHHHHHHHHCCCEEEEC----CCCCCCEEEEEEEHHHHH--CCCCCHHHHHHHHHHH--HHCCC-------- Q ss_conf ----464135666655304702301----356685356786344641--2345535655556654--42013-------- Q gi|254780342|r 84 ----YPSLEKFIAPYLEEGSIKLVA----ENLQGAKYMLAVNDVGFA--LGIKSYQDIAKYKKEL--GAKIY-------- 143 (309) Q Consensus 84 ----~p~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~v~~~~~~--~~i~~~~dl~~~~~~~--~~~~~-------- 143 (309) .|+........... ....-+ ..+.-...++++...... +...+|+++-+..... +++.+ T Consensus 104 D~skl~n~~~l~~~~~~~-~~~d~g~~y~vPy~~g~t~i~Yn~~~~~~~~~~~sW~~l~d~~~~~~~k~~~~~~~d~~~~ 182 (363) T COG0687 104 DKSKLPNWKNLDPEFLNP-PKFDPGNKYSVPYFWGTTGIAYNTDKVKDAPPPTSWADLFDPEKFPGLKGKGVALLDDDRE 182 (363) T ss_pred CHHHCCCHHHCCHHHHCC-CCCCCCCEECEEEEEEEEEEEEECCCCCCCCCCCCHHHHHCCHHHHHHCCCCCCCCCCCHH T ss_conf 966664821169878357-7678888603167886469999800058999998899983842002232575301366057 Q ss_pred ---------CCCCCCCCCHHHHHHHHHCCCCCCC-EEEECCCCCCHHHHHHHHHHCCCCEEEEEECCCHHHHC Q ss_conf ---------5785432005558766403545221-00103665201899999996599369994124223201 Q gi|254780342|r 144 ---------GIEPGNEGNQRILDMINNNKFSLKG-FRLIEASELASFSQIRRDQRNNIPAVFLSWEPHPINSD 206 (309) Q Consensus 144 ---------g~~~~~~~~~~~~~~~~~~~l~~~~-~~~~~~s~~~~~a~l~~A~~~~~~~v~~~wsp~~~~~~ 206 (309) |..+-.......+...+. -..+.. ......+ .+..+.+.+|+..+..+|+.+....+ T Consensus 183 ~~~~al~~lg~~~n~~~~~~~~~a~~~-L~~~kp~~~~~~~~-----~~~~~~l~~Gev~~a~~w~g~~~~~~ 249 (363) T COG0687 183 VFGAALALLGKSPNTTDPEDLKKAFDL-LDKLKPVNVYWFDG-----SQYVQLLANGEVVLAMGWSGDAAAAK 249 (363) T ss_pred HHHHHHHHCCCCCCCCCHHHHHHHHHH-HHHHCCCEEEEECC-----HHHHHHHHCCCEEEEEEECHHHHHHH T ss_conf 899999971899788998899999999-99727850899636-----27888986699789997777899986 No 37 >PRK11205 tbpA thiamine transporter substrate binding subunit; Provisional Probab=96.42 E-value=0.084 Score=31.50 Aligned_cols=172 Identities=12% Similarity=0.064 Sum_probs=79.3 Q ss_pred CCCCCCEEEECCC-CCHHHHHHHHHHHHHHHCCCCEEEEEEC-CHHHHHHHHH-CC---CCEEEEEEECCCCHHHHHHHH Q ss_conf 3458837994258-8699999999999996106984899856-4057888851-49---813999840464135666655 Q gi|254780342|r 22 ADSCTPVRFADTG-WTDIAATTAMTSVILEEILGYKTNIKLL-AVPVTFRSLK-NK---GIDIFMGYWYPSLEKFIAPYL 95 (309) Q Consensus 22 ~~~~~~I~ig~~~-wte~~v~a~i~~~iLE~~lGy~Ve~~~~-~~~~~~~al~-~G---~iD~~~e~w~p~~~~~~~~~~ 95 (309) +.+..+++|-+.+ |.+..-.+..+....|+.+|++|+++.. +...+..-|. .| ..|+....--. .+.... T Consensus 15 ~~~~~~l~vyt~dsf~~d~g~~~~l~~~Fek~tGikV~~v~~~dg~~ll~Rl~~Eg~np~ADVv~gvD~~----~l~~A~ 90 (326) T PRK11205 15 AFAKPVLTVYTYDSFAADWGPGPAVKKAFEAECGCELKFVALEDGVSLLNRLRLEGKNSKADVVLGLDNN----LLAAAK 90 (326) T ss_pred CCCCCEEEEEECCCCCCCCCCCHHHHHHHHHHHCCEEEEEECCCHHHHHHHHHHCCCCCCCCEEEECCHH----HHHHHH T ss_conf 0029879999665444466864799999989879689999679679999999973899976589981767----899999 Q ss_pred HCCCEEEE---------C--------CCCCCCEEEEEEEHHHHHCCCCCHHHHHHHHHHHHHCCCCCCCCCCCC--HHHH Q ss_conf 30470230---------1--------356685356786344641234553565555665442013578543200--5558 Q gi|254780342|r 96 EEGSIKLV---------A--------ENLQGAKYMLAVNDVGFALGIKSYQDIAKYKKELGAKIYGIEPGNEGN--QRIL 156 (309) Q Consensus 96 ~~~~~~~~---------~--------~~~~~~~~~~~v~~~~~~~~i~~~~dl~~~~~~~~~~~~g~~~~~~~~--~~~~ 156 (309) ..+.+.-. . ..+......+.+.+.......+++.||.+.+ +++++.-.+|..... .... T Consensus 91 ~~gl~~p~~s~~~~~~ip~~~~d~~~vp~d~g~~a~vYn~~~~~~~P~S~~DL~~p~--~kgki~i~dP~tSs~g~a~l~ 168 (326) T PRK11205 91 KTGLFAPSGVDTSAVTVPGGWNDDTFVPYDYGYFAFVYDKEKLKNPPKSLKELVESD--QNWKVIYQDPRTSTPGLGLLL 168 (326) T ss_pred HCCCCCCCCCCHHHCCCCCCCCCCCEECCCCCEEEEEEEHHHCCCCCCCHHHHHCCH--HCCCEEECCCCCCCCHHHHHH T ss_conf 869965256861102486002698521134765899984222378876989872733--038866338887852799999 Q ss_pred HHHHHCC---------CCCCCEEEECCCCCCHHHHHHHHHHCCCCEEEEEECCCHHH Q ss_conf 7664035---------45221001036652018999999965993699941242232 Q gi|254780342|r 157 DMINNNK---------FSLKGFRLIEASELASFSQIRRDQRNNIPAVFLSWEPHPIN 204 (309) Q Consensus 157 ~~~~~~~---------l~~~~~~~~~~s~~~~~a~l~~A~~~~~~~v~~~wsp~~~~ 204 (309) ..+..+| |......+..+ +.+.+.++.+|+..++..|+++... T Consensus 169 ~~i~~~Ge~~~e~W~~L~~N~~~~~~g-----ws~a~~~~~~Ge~~ivlsy~~spa~ 220 (326) T PRK11205 169 WMKKVYGDDAPQAWQKLAKKTVTVTKG-----WSEAYGLFLKGEADLVLSYTTSPAY 220 (326) T ss_pred HHHHHHCCCHHHHHHHHHHCCCCCCCC-----CHHHHHHHHCCCCCEEEEEECCHHH T ss_conf 999971632899999998638444788-----3388887644767579985147787 No 38 >pfam03180 Lipoprotein_9 NLPA lipoprotein. This family of bacterial lipoproteins contains several antigenic members, that may be involved in bacterial virulence. Their precise function is unknown. However they are probably distantly related to pfam00497 which are solute binding proteins. Probab=96.39 E-value=0.023 Score=35.17 Aligned_cols=202 Identities=15% Similarity=0.133 Sum_probs=101.5 Q ss_pred EEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEEEC-CHHHHHHHHHCCCCEEEEEEECCCCHHHHHHHHHCC--CEEEEC Q ss_conf 79942588699999999999996106984899856-405788885149813999840464135666655304--702301 Q gi|254780342|r 28 VRFADTGWTDIAATTAMTSVILEEILGYKTNIKLL-AVPVTFRSLKNKGIDIFMGYWYPSLEKFIAPYLEEG--SIKLVA 104 (309) Q Consensus 28 I~ig~~~wte~~v~a~i~~~iLE~~lGy~Ve~~~~-~~~~~~~al~~G~iD~~~e~w~p~~~~~~~~~~~~~--~~~~~~ 104 (309) ||||...-+-+.+.-. ++..+++. ||+||++.. +...--.||.+|+||.-.-- +..++..+.++. .++.++ T Consensus 1 lkvG~~~~p~~~i~~~-v~~~~~~~-Gi~veiv~F~Dy~~pN~AL~~GeIDaN~fQ----H~~yl~~~n~~~g~~L~~v~ 74 (236) T pfam03180 1 LKVGATPGPHAEVLEV-AKPLAKKK-GLDLEIVEFTDYVQPNTALADGEIDANAFQ----HLPYLDQFNKEGGLDLVAVG 74 (236) T ss_pred CEEEECCCCHHHHHHH-HHHHHHHC-CCEEEEEEECCCCCHHHHHHCCCCCHHHHC----CHHHHHHHHHHCCCCEEEEE T ss_conf 9896438976999999-99999964-987999981686455079778995524443----89999999997799679963 Q ss_pred CCCCCCEEEEEEEHHHHHCCCCCHHHHHHHHH---------HHHH-------CCCCCCCCCCCCHHHHHHHHHCCCCCCC Q ss_conf 35668535678634464123455356555566---------5442-------0135785432005558766403545221 Q gi|254780342|r 105 ENLQGAKYMLAVNDVGFALGIKSYQDIAKYKK---------ELGA-------KIYGIEPGNEGNQRILDMINNNKFSLKG 168 (309) Q Consensus 105 ~~~~~~~~~~~v~~~~~~~~i~~~~dl~~~~~---------~~~~-------~~~g~~~~~~~~~~~~~~~~~~~l~~~~ 168 (309) ..+..- .+++. ..++++++|++-+. ..+. .++-..++.+......+... ....+. T Consensus 75 ~~~~~p-~glYS------~k~ksl~~lp~Ga~IaIpnD~sN~~RAL~lL~~aGLI~Lk~~~~~~~t~~DI~~-Npk~l~- 145 (236) T pfam03180 75 NTHVEP-IGLYS------KKYKSLSELPDGATIAVPNDPSNEGRALLLLEKAGLIKLKDGAGLLATVKDITE-NPKNLK- 145 (236) T ss_pred EEEECC-EEEEE------CCCCCHHHCCCCCEEEECCCCCHHHHHHHHHHHCCCEEECCCCCCCCCHHHHHH-CCCCCE- T ss_conf 046723-79543------376887585799989815884429999999998898897689998789888972-867766- Q ss_pred EEEECCCCCCHHHHHHHHHHCCCCEEEEEECCCHHHHCCC----CEEECCCCCCCCCCHHHHHHHHCHHHHHHCHHHHHH Q ss_conf 0010366520189999999659936999412422320158----514007865467786583544277778748899999 Q gi|254780342|r 169 FRLIEASELASFSQIRRDQRNNIPAVFLSWEPHPINSDLN----IHYLPGGEEISGFGEASVYTVVRSDYLDKCPNISRL 244 (309) Q Consensus 169 ~~~~~~s~~~~~a~l~~A~~~~~~~v~~~wsp~~~~~~~d----l~~Ledp~~~~~~p~~~~~~vv~~~~~~~~P~~~~~ 244 (309) +++.. .+++.+++.+ .++.........-+..+ .-.+|+. + .+|....++|++= ++.|.+.++ T Consensus 146 --~~ev~----a~ql~~~l~d--vD~avin~n~a~~agl~p~~~~l~~e~~-~----~~y~n~ivvr~~~-~d~~~ik~l 211 (236) T pfam03180 146 --IKELE----AAQLPRALDD--VDAAVINTNYALQAGLDPKKDALFEEDK-D----SPYVNIIVVREDD-KDDPAVKKL 211 (236) T ss_pred --EEEEC----HHHHHHHCCC--CCEEEECHHHHHHCCCCHHHHHHHCCCC-C----CCEEEEEEEECCC-CCCHHHHHH T ss_conf --99916----7775542166--5789986567988794921213314788-9----9837999981777-699999999 Q ss_pred HHHCCCCHHHHHHHHH Q ss_conf 8740899899999999 Q gi|254780342|r 245 LKNIKFSVALENEMMK 260 (309) Q Consensus 245 L~~~~ls~~~~~~l~~ 260 (309) .+-++ .+.+.+... T Consensus 212 ~e~~~--s~~vk~~i~ 225 (236) T pfam03180 212 VKAYQ--SEEVKAFIE 225 (236) T ss_pred HHHHC--CHHHHHHHH T ss_conf 99987--999999999 No 39 >PRK12681 cysB transcriptional regulator CysB; Reviewed Probab=96.27 E-value=0.1 Score=30.95 Aligned_cols=212 Identities=7% Similarity=-0.004 Sum_probs=127.6 Q ss_pred CCCCCCEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHCCCCEEEEEEECCCCHHHHHHHHHCCCEE Q ss_conf 34588379942588699999999999996106984899856405788885149813999840464135666655304702 Q gi|254780342|r 22 ADSCTPVRFADTGWTDIAATTAMTSVILEEILGYKTNIKLLAVPVTFRSLKNKGIDIFMGYWYPSLEKFIAPYLEEGSIK 101 (309) Q Consensus 22 ~~~~~~I~ig~~~wte~~v~a~i~~~iLE~~lGy~Ve~~~~~~~~~~~al~~G~iD~~~e~w~p~~~~~~~~~~~~~~~~ 101 (309) .....+|+||.......+++..+++...++.=+.++++..+++..+.+.+.+|++|+..-.-.+. .+ + .+. T Consensus 89 ~~~~G~L~Ia~~~t~a~~~LP~~i~~F~~~~P~v~l~l~~~~~~~i~~~l~~g~~DlaI~~e~~~---~~-----~-dl~ 159 (324) T PRK12681 89 WPDKGSLYIATTHTQARYALPPVIKGFIKRYPRVSLHMHQGSPTQIAEAAAKGEADFAIATEALH---LY-----D-DLI 159 (324) T ss_pred CCCCCEEEEEECHHHHHHHHHHHHHHHHHHCCCCCEEEEECCHHHHHHHHHCCCCCCCCCCCCCC---CC-----C-CCE T ss_conf 88763499985358888772599999998789984465317879999999879987231246666---68-----8-746 Q ss_pred EECCCCCCCEEEEEEEHHHHHC--CCCCHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCCCCCCCEEEECCCCCCH Q ss_conf 3013566853567863446412--34553565555665442013578543200555876640354522100103665201 Q gi|254780342|r 102 LVAENLQGAKYMLAVNDVGFAL--GIKSYQDIAKYKKELGAKIYGIEPGNEGNQRILDMINNNKFSLKGFRLIEASELAS 179 (309) Q Consensus 102 ~~~~~~~~~~~~~~v~~~~~~~--~i~~~~dl~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~s~~~~ 179 (309) .-+.+... ..+++|..-+-. .--++++|.++ .++.-+.++..+....+.....|+... ..+...+ T Consensus 160 -~~P~~~~~-~~vvvp~~HPLa~~~~itl~dL~~~------plI~~~~g~~~R~~id~~f~~~g~~P~-iv~e~~~---- 226 (324) T PRK12681 160 -MLPCYHWN-RSVVVPPDHPLAKKKKLTIEELAQY------PLVTYVFGFTGRSELDTAFNRAGLTPR-IVFTATD---- 226 (324) T ss_pred -EEEEEECC-CCCCCCCCCCCCCCCCCCHHHHCCC------CEEEECCCCCHHHHHHHHHHHCCCCCE-EEEEECC---- T ss_conf -86400032-2245799996557999999998699------859614998299999999998899760-7999898---- Q ss_pred HHHHHHHHHCCCCEEEEEECCCHHHHCCCCEEECCCCCCCCCCHHHHHHHHCHHHHHHCHHHHHHHHHC--CCCHHHHHH Q ss_conf 899999996599369994124223201585140078654677865835442777787488999998740--899899999 Q gi|254780342|r 180 FSQIRRDQRNNIPAVFLSWEPHPINSDLNIHYLPGGEEISGFGEASVYTVVRSDYLDKCPNISRLLKNI--KFSVALENE 257 (309) Q Consensus 180 ~a~l~~A~~~~~~~v~~~wsp~~~~~~~dl~~Ledp~~~~~~p~~~~~~vv~~~~~~~~P~~~~~L~~~--~ls~~~~~~ 257 (309) ...+.+.++.|-.+-+.....---...-+|+.++-+ +.| ++.....+.|++.-= .+-+++|++-| .+|-+.+.+ T Consensus 227 ~~~I~~~V~~GlGV~ilp~mA~~~~~d~~l~~~~~~-~~f--~~~~t~i~~r~g~~l-~~~~~~Fi~~~~p~l~~~~~~~ 302 (324) T PRK12681 227 ADVIKTYVRLGLGVGVIASMAVDPVADPDLVAIDAS-HLF--AHSTTKIGFRRGTFL-RSYMYDFIERFAPHLTRDVVEK 302 (324) T ss_pred HHHHHHHHHHCCEEEECHHHHCCCCCCCCEEEEECC-CCC--CCCEEEEEEECCCCH-HHHHHHHHHHHHCCCCHHHHHH T ss_conf 999999999599139703555196557881784365-567--776479999799715-7999999998612048999999 Q ss_pred HH Q ss_conf 99 Q gi|254780342|r 258 MM 259 (309) Q Consensus 258 l~ 259 (309) -. T Consensus 303 ~~ 304 (324) T PRK12681 303 AV 304 (324) T ss_pred HH T ss_conf 98 No 40 >cd00134 PBPb Bacterial periplasmic transport systems use membrane-bound complexes and substrate-bound, membrane-associated, periplasmic binding proteins (PBPs) to transport a wide variety of substrates, such as, amino acids, peptides, sugars, vitamins and inorganic ions. PBPs have two cell-membrane translocation functions: bind substrate, and interact with the membrane bound complex. A diverse group of periplasmic transport receptors for lysine/arginine/ornithine (LAO), glutamine, histidine, sulfate, phosphate, molybdate, and methanol are included in the PBPb CD. Probab=96.15 E-value=0.1 Score=30.95 Aligned_cols=173 Identities=10% Similarity=0.067 Sum_probs=90.8 Q ss_pred HHHHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHCCCCEEEEEEECCCCHHHHHHHHHCCCEEEECCCCCCCEEEEEEEH Q ss_conf 99999999999610698489985640578888514981399984046413566665530470230135668535678634 Q gi|254780342|r 39 AATTAMTSVILEEILGYKTNIKLLAVPVTFRSLKNKGIDIFMGYWYPSLEKFIAPYLEEGSIKLVAENLQGAKYMLAVND 118 (309) Q Consensus 39 ~v~a~i~~~iLE~~lGy~Ve~~~~~~~~~~~al~~G~iD~~~e~w~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~ 118 (309) -+...++..+.+ .+|++++++..+-..++.+|.+|++|+.+..-..+ ..... .......+......+++++ T Consensus 23 G~~~dl~~~i~~-~lg~~~~~~~~~~~~~~~~l~~g~~D~~~~~~~~t-~~R~~-------~~~fs~p~~~~~~~~~~~~ 93 (218) T cd00134 23 GFDVDLAKAIAK-ELGVKVKFVEVDWDGLITALKSGKVDLIAAGMTIT-PERAK-------QVDFSDPYYKSGQVILVKK 93 (218) T ss_pred EEHHHHHHHHHH-HHCCCEEEEECCHHHHHHHHHCCCCCEEEEEECCC-HHHHC-------CCCCCCCEECCCEEEEEEC T ss_conf 839999999999-96996699975889999998579977688850258-56614-------1455764441570799986 Q ss_pred HHHHCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCCCCCCCEEEECCCCCCHHHHHHHHHHCCCCEEEEEE Q ss_conf 46412345535655556654420135785432005558766403545221001036652018999999965993699941 Q gi|254780342|r 119 VGFALGIKSYQDIAKYKKELGAKIYGIEPGNEGNQRILDMINNNKFSLKGFRLIEASELASFSQIRRDQRNNIPAVFLSW 198 (309) Q Consensus 119 ~~~~~~i~~~~dl~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~s~~~~~a~l~~A~~~~~~~v~~~w 198 (309) . ..+.++++|. ++.+|...+..... .+.... . ..+++.-+ -..+..+++.+|+.+++..+ T Consensus 94 ~---~~~~~~~dl~-------g~~i~v~~g~~~~~----~l~~~~-~--~~~~~~~~---~~~~~~~~l~~G~vD~~v~~ 153 (218) T cd00134 94 G---SPIKSVKDLK-------GKKVAVQKGSTAEK----YLKKAL-P--EAKVVSYD---DNAEALAALENGRADAVIVD 153 (218) T ss_pred C---CCCCCHHHHC-------CCEEEEECCCHHHH----HHHHHH-C--CCEEEEEC---CHHHHHHHHHCCCCEEEEEC T ss_conf 8---9999989977-------96267958988999----999731-5--75499969---99999999985994699976 Q ss_pred CCCHH--HHCC--CCEEECCCCCCCCCCHHHHHHHHCHHHHHHCHHHHHHHHH Q ss_conf 24223--2015--8514007865467786583544277778748899999874 Q gi|254780342|r 199 EPHPI--NSDL--NIHYLPGGEEISGFGEASVYTVVRSDYLDKCPNISRLLKN 247 (309) Q Consensus 199 sp~~~--~~~~--dl~~Ledp~~~~~~p~~~~~~vv~~~~~~~~P~~~~~L~~ 247 (309) .+... ..++ ++..+..+... .+..+...++ +.+|++.+.|++ T Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~----~~~~~l~~~~n~ 199 (218) T cd00134 154 EIALAALLKKHPPELKIVGPSIDL---EPLGFGVAVG----KDNKELLDAVNK 199 (218) T ss_pred HHHHHHHHHHCCCCCEEECCCCCC---CCEEEEEEEC----CCCHHHHHHHHH T ss_conf 999999998679981996356676---7348999974----999999999999 No 41 >PRK12680 transcriptional regulator CysB-like protein; Reviewed Probab=96.13 E-value=0.12 Score=30.51 Aligned_cols=209 Identities=9% Similarity=0.027 Sum_probs=129.1 Q ss_pred CCCCCCEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHCCCCEEEEEEECCCCHHHHHHHHHCCCEE Q ss_conf 34588379942588699999999999996106984899856405788885149813999840464135666655304702 Q gi|254780342|r 22 ADSCTPVRFADTGWTDIAATTAMTSVILEEILGYKTNIKLLAVPVTFRSLKNKGIDIFMGYWYPSLEKFIAPYLEEGSIK 101 (309) Q Consensus 22 ~~~~~~I~ig~~~wte~~v~a~i~~~iLE~~lGy~Ve~~~~~~~~~~~al~~G~iD~~~e~w~p~~~~~~~~~~~~~~~~ 101 (309) .....+++||...-...+++..+++...++.=+.++.+..+++..+.+.+.+|++|+..-....... .. .. T Consensus 89 ~~~~G~L~Ig~~~t~a~~~LP~vi~~f~~~yP~V~l~l~~~~~~~l~e~l~~g~~Diai~~~~~~~~--------~~-~~ 159 (327) T PRK12680 89 RESQGQLTLTTTHTQARFVLPPAVAQIKQAYPQVSVHLQQAAESAALDLLGQGDADIAVVSTAGGEP--------SA-GI 159 (327) T ss_pred CCCCEEEEEEEECHHHHHHHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHCCCCCEEEECCCCCCC--------CC-CE T ss_conf 8873379999842277775689999999868996799998982889999978994099944777799--------76-54 Q ss_pred EECCCCCCCEEEEEEEHHHH---HCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCCCCCCCEEEECCCCCC Q ss_conf 30135668535678634464---123455356555566544201357854320055587664035452210010366520 Q gi|254780342|r 102 LVAENLQGAKYMLAVNDVGF---ALGIKSYQDIAKYKKELGAKIYGIEPGNEGNQRILDMINNNKFSLKGFRLIEASELA 178 (309) Q Consensus 102 ~~~~~~~~~~~~~~v~~~~~---~~~i~~~~dl~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~s~~~ 178 (309) . -+.+.. ...+++|..-+ .....++++|.+ ..++.-+.+...+.......+..|+... ..+...+ T Consensus 160 ~-~pl~~~-~~~vvvP~~HpL~~~~~~i~L~~La~------~plI~~~~g~~~r~~id~af~~~Gl~P~-iv~ea~~--- 227 (327) T PRK12680 160 A-VPLYRW-RRLVVVPRGHALDTPRTAPDMAALAE------HPLISYDSSTRPGSSLQRAFAQVGLEPS-IALTALD--- 227 (327) T ss_pred E-EEEEEC-CEEEEEECCCCCCCCCCCCCHHHHHC------CCEEECCCCCCHHHHHHHHHHHCCCCCC-EEEEECC--- T ss_conf 7-675223-57999978981005899999999808------9889638998679999999997799764-7999995--- Q ss_pred HHHHHHHHHHCCCCEEE-EEECCCHHHHCCCCEEECCCCCCCCCCHHHHHHHHCH-HHHHHCHHHHHHHHHC--CCCHHH Q ss_conf 18999999965993699-9412422320158514007865467786583544277-7787488999998740--899899 Q gi|254780342|r 179 SFSQIRRDQRNNIPAVF-LSWEPHPINSDLNIHYLPGGEEISGFGEASVYTVVRS-DYLDKCPNISRLLKNI--KFSVAL 254 (309) Q Consensus 179 ~~a~l~~A~~~~~~~v~-~~wsp~~~~~~~dl~~Ledp~~~~~~p~~~~~~vv~~-~~~~~~P~~~~~L~~~--~ls~~~ 254 (309) ...+.+.++.|-.+-+ .-+.-|.. .-+++.++-+ +. |++.......|+ .|+..+ +|+|++-| .|+-+. T Consensus 228 -~d~Ik~~V~~GlGV~Ivp~mA~~~~--d~~l~~~~~~-~l--~~~~~~~~~~~~~~~lr~~--~~~fi~~~~p~l~~~~ 299 (327) T PRK12680 228 -ADLIKTYVRAGLGVGLVAEMAVNAF--DEDLRAWPAP-AP--IAECIAWAVLPRDRVLRDY--ALDLVHVLAPQIDKRD 299 (327) T ss_pred -HHHHHHHHHHCCEEEECHHHHHCCC--CCCEEEEECC-CC--CCCCEEEEEEECCCHHHHH--HHHHHHHHCCCCCHHH T ss_conf -9999999994986188164762876--8983894064-57--7756059999788468999--9999998668889999 Q ss_pred HHHHH Q ss_conf 99999 Q gi|254780342|r 255 ENEMM 259 (309) Q Consensus 255 ~~~l~ 259 (309) +.+.. T Consensus 300 ~~~~~ 304 (327) T PRK12680 300 LRRVL 304 (327) T ss_pred HHHHH T ss_conf 99982 No 42 >COG4134 ABC-type uncharacterized transport system, periplasmic component [General function prediction only] Probab=95.88 E-value=0.16 Score=29.76 Aligned_cols=246 Identities=12% Similarity=0.085 Sum_probs=114.8 Q ss_pred CCCCEEEECCCCCHHHHHHHHH---HHHHHHCCCCEEEEEECC---HH-HHHHHHHC-----CCCEEEEEEE--CCCCH- Q ss_conf 5883799425886999999999---999961069848998564---05-78888514-----9813999840--46413- Q gi|254780342|r 24 SCTPVRFADTGWTDIAATTAMT---SVILEEILGYKTNIKLLA---VP-VTFRSLKN-----KGIDIFMGYW--YPSLE- 88 (309) Q Consensus 24 ~~~~I~ig~~~wte~~v~a~i~---~~iLE~~lGy~Ve~~~~~---~~-~~~~al~~-----G~iD~~~e~w--~p~~~- 88 (309) .+++|- -+-|-++..++.-. +.++|.++|.+|.++.++ .. .+...++. |.+|++..+- +|... T Consensus 32 ~~q~vy--fnawgg~~~~ns~~~W~~~~~~~kYgi~v~~~~l~~~~a~~~~~a~~AAG~~e~g~VDl~~~~g~n~p~lk~ 109 (384) T COG4134 32 KGQTVY--FNAWGGSTAINSYADWVSDEMKAKYGIKVAIVRLDAADAVKRLIAELAAGRDEGGSVDLLWINGENFPTLKE 109 (384) T ss_pred CCCCEE--EEEECCCCCHHCCCCCCCCHHHCCCCEEEEEECCCHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCCHHHHH T ss_conf 577369--985056310000265322023200251787512418888888999986425568726899835754133431 Q ss_pred --HHH----------HH---HHHC----CCEEEECCCCCCCEEEEEEEHHHHHCCCCCHHHHHHHHHHHHHCCCCC-CCC Q ss_conf --566----------66---5530----470230135668535678634464123455356555566544201357-854 Q gi|254780342|r 89 --KFI----------AP---YLEE----GSIKLVAENLQGAKYMLAVNDVGFALGIKSYQDIAKYKKELGAKIYGI-EPG 148 (309) Q Consensus 89 --~~~----------~~---~~~~----~~~~~~~~~~~~~~~~~~v~~~~~~~~i~~~~dl~~~~~~~~~~~~g~-~~~ 148 (309) -++ .. .+.+ .....+.....+++..+.........--.+.++|...++...+|+.-. ++. T Consensus 110 ~~~Ly~~~~~~LPnwr~~d~ql~~tdf~~pt~Gl~~~~~~~q~~f~yd~~k~~~~P~s~eeLL~wA~aNP~rf~Y~~P~~ 189 (384) T COG4134 110 AGLLYTPWAEDLPNWRYVDHQLPRTDFSVPTEGLESPWGGNQLVFIYDRDKVPQVPQSFEELLVWAKANPKRFGYPRPPN 189 (384) T ss_pred CCCCCCCHHHHCCCCHHHCHHCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCCCCCCCHHHHHHHHHHCCCCCCCCCCCC T ss_conf 66202512321866012030035332578877621440277523887234478877399999999985967535689998 Q ss_pred -CCCCHHHHHHHHH--------------------------CCCCCCC-EEEECCCCCCHHHHHHHHHHCCCCEEEEEECC Q ss_conf -3200555876640--------------------------3545221-00103665201899999996599369994124 Q gi|254780342|r 149 -NEGNQRILDMINN--------------------------NKFSLKG-FRLIEASELASFSQIRRDQRNNIPAVFLSWEP 200 (309) Q Consensus 149 -~~~~~~~~~~~~~--------------------------~~l~~~~-~~~~~~s~~~~~a~l~~A~~~~~~~v~~~wsp 200 (309) +.++......+-. |.-.+.. ......+.+.|+ +-+.+|+-.+...| | T Consensus 190 gg~G~~Fl~~~~i~Lt~d~~~l~~avDpat~~k~~~p~WqYl~~Lh~~i~r~~kg~Adml----~lL~dG~L~l~~t~-~ 264 (384) T COG4134 190 GGTGTAFLKQLLIMLTGDAAALQQAVDPATFAKVWAPLWQYLDELHPYIWRTGKGPADML----QLLNDGTLYLTLTF-P 264 (384) T ss_pred CCCCHHHHHHHHHHHCCCHHHHCCCCCHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHH----HHHCCCCEEEEECC-H T ss_conf 886076999889985485576507888077776210899999986155601389878999----87417847886046-0 Q ss_pred CHHHHCCCCEEECCCCCCCCCC---HHHHHHHHCHHHHHHCHHHHHHHHHCCCCHHHH-HHHHHHHHHCC-----CCHHH Q ss_conf 2232015851400786546778---658354427777874889999987408998999-99999998459-----99899 Q gi|254780342|r 201 HPINSDLNIHYLPGGEEISGFG---EASVYTVVRSDYLDKCPNISRLLKNIKFSVALE-NEMMKLILNNK-----QDRQF 271 (309) Q Consensus 201 ~~~~~~~dl~~Ledp~~~~~~p---~~~~~~vv~~~~~~~~P~~~~~L~~~~ls~~~~-~~l~~~v~~~g-----~~~~~ 271 (309) ++.++......|.+.-..+.+. ..+.+.++-. .-..++++.-+|-+|-||.|.+ ..++-.|-.+- +.-.+ T Consensus 265 ~~~~s~~~tG~lp~s~~~~~~~~G~vgn~~f~aIP-aNa~~~A~alvl~n~lls~EaQlrk~d~~v~~~P~~l~pq~lpd 343 (384) T COG4134 265 DHASSAIATGDLPASARSFALEKGMVGNGHFMAIP-ANANAKAAALVLANFLLSPEAQLRKLDPAVWGDPSVLDPQLLPD 343 (384) T ss_pred HHHHCCHHCCCCCHHHHHHHHCCCCCCCCCEEEEC-CCCCCCHHHHHHHHHHCCHHHHHHHHCCCCCCCCCCCCCCCCHH T ss_conf 66403022167866766421016653675169822-79997069999998863988887651520148964258001746 Q ss_pred HHHHHH Q ss_conf 999999 Q gi|254780342|r 272 VGRTML 277 (309) Q Consensus 272 vA~~wL 277 (309) +.++++ T Consensus 344 a~qe~~ 349 (384) T COG4134 344 AQQEFL 349 (384) T ss_pred HHHHHH T ss_conf 675543 No 43 >TIGR02424 TF_pcaQ pca operon transcription factor PcaQ; InterPro: IPR012787 Members of this family are LysR-family transcription factors associated with operons for catabolism of protocatechuate . Members occur only in proteobacteria.; GO: 0003677 DNA binding, 0016563 transcription activator activity, 0019619 protocatechuate catabolic process, 0045941 positive regulation of transcription. Probab=95.52 E-value=0.058 Score=32.54 Aligned_cols=188 Identities=13% Similarity=0.025 Sum_probs=109.2 Q ss_pred CCCCCCCEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHCCCCEEEEEEECCCC----HHHHHHHHH Q ss_conf 8345883799425886999999999999961069848998564057888851498139998404641----356666553 Q gi|254780342|r 21 DADSCTPVRFADTGWTDIAATTAMTSVILEEILGYKTNIKLLAVPVTFRSLKNKGIDIFMGYWYPSL----EKFIAPYLE 96 (309) Q Consensus 21 ~~~~~~~I~ig~~~wte~~v~a~i~~~iLE~~lGy~Ve~~~~~~~~~~~al~~G~iD~~~e~w~p~~----~~~~~~~~~ 96 (309) ...++.+++||.-..-+...+-+++++++++..+..|.+..|.+..+.+.|+.|+||+.+-= +|.- .-.|.. T Consensus 88 g~~~~~~~~~GaLPTVaA~L~P~~~~~f~~~~p~~~~~i~tG~n~~Lld~LR~G~LDlVvGR-L~ape~MqGl~Fe~--- 163 (300) T TIGR02424 88 GEGEGPTVRVGALPTVAARLMPEVVKRFLAEAPRLRLRIVTGENAYLLDQLRVGELDLVVGR-LGAPETMQGLSFEH--- 163 (300) T ss_pred HHCCCCEEEECCCHHHHHHHHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHCCCCCEEEEEC-CCCCHHHCCCCCCC--- T ss_conf 21489648972646689999999999999616895589987785787987228980277606-88832115864251--- Q ss_pred CCCEEEECCCCCCCEEEEEEEHHHHHCC--CCCHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCCCCCCCEEEECC Q ss_conf 0470230135668535678634464123--45535655556654420135785432005558766403545221001036 Q gi|254780342|r 97 EGSIKLVAENLQGAKYMLAVNDVGFALG--IKSYQDIAKYKKELGAKIYGIEPGNEGNQRILDMINNNKFSLKGFRLIEA 174 (309) Q Consensus 97 ~~~~~~~~~~~~~~~~~~~v~~~~~~~~--i~~~~dl~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~ 174 (309) .|.+ ...++|...=+-.. .-+.+.|.+|-=-+ -+.+...+....+.+-+.|+...-..+... T Consensus 164 ---------LY~E-~v~~VVragHPL~~~p~~~~~~L~~yPvl~------P~~g~~IR~~~erll~a~G~~~l~~r~Etv 227 (300) T TIGR02424 164 ---------LYNE-RVVFVVRAGHPLLAAPSLSVASLADYPVLL------PPEGSIIRPLVERLLIACGIPALPQRIETV 227 (300) T ss_pred ---------CCCC-CEEEEECCCCCCCCCCCCCHHHHHCCCCCC------CCCHHHHHHHHHHHHHHCCCCCCCCHHHHH T ss_conf ---------4477-337887085785578877778873388104------862226799999999874765566013443 Q ss_pred CCCCHHHHHHHHHHCCCCEEEEEECCCHHHHCCCCEEECCCCCCCCCCHHHHHHHHCHHH Q ss_conf 652018999999965993699941242232015851400786546778658354427777 Q gi|254780342|r 175 SELASFSQIRRDQRNNIPAVFLSWEPHPINSDLNIHYLPGGEEISGFGEASVYTVVRSDY 234 (309) Q Consensus 175 s~~~~~a~l~~A~~~~~~~v~~~wsp~~~~~~~dl~~Ledp~~~~~~p~~~~~~vv~~~~ 234 (309) |.+ ++.-|- ..++-++++.-.-=-.-+...-|+-|+-+-+. +...+-...|.+. T Consensus 228 S~~--fgR~y~-~~sdA~WiiS~gvv~~dl~~G~L~~Lp~~~~~---t~GPVGL~~R~d~ 281 (300) T TIGR02424 228 SDS--FGRRYV-RESDAVWIISEGVVAKDLADGTLVELPVDTRE---TAGPVGLCTRPDE 281 (300) T ss_pred HHH--HHHHHH-HHCCCEEECCHHHHHHHHCCCCEEECCCCCCC---CCCCCCCCCCCCC T ss_conf 157--887776-45472577266789887528927764657577---8877021548887 No 44 >pfam03466 LysR_substrate LysR substrate binding domain. The structure of this domain is known and is similar to the periplasmic binding proteins. Probab=95.45 E-value=0.22 Score=28.74 Aligned_cols=196 Identities=12% Similarity=0.057 Sum_probs=110.6 Q ss_pred CCCCCEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHCCCCEEEEEEECCCCHHHHHHHHHCCCEEE Q ss_conf 45883799425886999999999999961069848998564057888851498139998404641356666553047023 Q gi|254780342|r 23 DSCTPVRFADTGWTDIAATTAMTSVILEEILGYKTNIKLLAVPVTFRSLKNKGIDIFMGYWYPSLEKFIAPYLEEGSIKL 102 (309) Q Consensus 23 ~~~~~I~ig~~~wte~~v~a~i~~~iLE~~lGy~Ve~~~~~~~~~~~al~~G~iD~~~e~w~p~~~~~~~~~~~~~~~~~ 102 (309) ...++|+||..+......+..++....++.=+.++++..+++..+.+.|.+|++|+..-+-.+.. ..+.. T Consensus 3 ~~~G~i~I~~~~~~~~~~lp~~l~~f~~~~P~i~i~i~~~~~~~i~~~l~~g~~Di~i~~~~~~~----------~~~~~ 72 (209) T pfam03466 3 GPRGRLRIGAPPTFAAYLLPPLLARFRERYPDVELELREGDSEELLDLLAEGELDLAIRRGPPDD----------PGLEA 72 (209) T ss_pred CCCEEEEEEEEHHHHHHHHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHCCCCEEEEECCCCCC----------CCEEE T ss_conf 98789999938899999999999999998889289999898488999998698148873178999----------98089 Q ss_pred ECCCCCCCEEEEEEEHHH--HHCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCCCCCCCEEEECCCCCCHH Q ss_conf 013566853567863446--412345535655556654420135785432005558766403545221001036652018 Q gi|254780342|r 103 VAENLQGAKYMLAVNDVG--FALGIKSYQDIAKYKKELGAKIYGIEPGNEGNQRILDMINNNKFSLKGFRLIEASELASF 180 (309) Q Consensus 103 ~~~~~~~~~~~~~v~~~~--~~~~i~~~~dl~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~s~~~~~ 180 (309) . ..+. ....+++++.- +.....++++|.++ ..+........+..........++.... .+...+ . T Consensus 73 ~-~l~~-~~~~~~~s~~~~l~~~~~i~~~~l~~~------~~i~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~----~ 139 (209) T pfam03466 73 R-PLFE-EPLVLVAPPDHPLAAGEPVSLEDLADE------PLILLEPGSGLRDLVDRALERAGLRPRV-ALEVNS----L 139 (209) T ss_pred E-EEEE-EEEEEEECCCCHHHCCCCCCHHHHCCC------CEEEECCCCCCHHHHHHHHHHCCCCCCE-EEEECC----H T ss_conf 9-9762-413655336705542897543551466------5798569987137998778764876302-467478----7 Q ss_pred HHHHHHHHCCCCEEEEEECCCHHHH----CCCCEEECCCCCCCCCCHHHHHHHHCHHHHHHCHHHHHHHHHC Q ss_conf 9999999659936999412422320----1585140078654677865835442777787488999998740 Q gi|254780342|r 181 SQIRRDQRNNIPAVFLSWEPHPINS----DLNIHYLPGGEEISGFGEASVYTVVRSDYLDKCPNISRLLKNI 248 (309) Q Consensus 181 a~l~~A~~~~~~~v~~~wsp~~~~~----~~dl~~Ledp~~~~~~p~~~~~~vv~~~~~~~~P~~~~~L~~~ 248 (309) ..+...+++|..+. +-|+.... ...++.+..+... ++..+..+.++ -....|.+..|++-+ T Consensus 140 ~~~~~lv~~g~Gi~---~lp~~~~~~~~~~~~L~~i~~~~~~---~~~~~~li~~~-~~~~s~~~~~f~~~l 204 (209) T pfam03466 140 EALLAAVAAGLGIA---LLPRSAVARELADGRLVVLPLPDPP---LPRPIYLVYRK-GRRLSPAVRAFIDFL 204 (209) T ss_pred HHHHHHHHHCCEEE---ECCHHHHHHHHHCCCEEEEECCCCC---CCCEEEEEEEC-CCCCCHHHHHHHHHH T ss_conf 99999999499999---9139999988758998999588888---74479999989-497899999999999 No 45 >COG1910 Periplasmic molybdate-binding protein/domain [Inorganic ion transport and metabolism] Probab=95.42 E-value=0.12 Score=30.52 Aligned_cols=175 Identities=14% Similarity=0.164 Sum_probs=109.3 Q ss_pred CCCEEEEEECCHHHHHHHHHCCCCEEEEEEECCCC-----HHHHHHHHHCCCEEEECCCCCCCEEEEEEEHHHHHCCCCC Q ss_conf 69848998564057888851498139998404641-----3566665530470230135668535678634464123455 Q gi|254780342|r 53 LGYKTNIKLLAVPVTFRSLKNKGIDIFMGYWYPSL-----EKFIAPYLEEGSIKLVAENLQGAKYMLAVNDVGFALGIKS 127 (309) Q Consensus 53 lGy~Ve~~~~~~~~~~~al~~G~iD~~~e~w~p~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~i~~ 127 (309) -++..+.+..++..=+.+++.|+.|+..-.-.+.. ....++...+. ...+ .+.--.+|++|++.. -+++.+ T Consensus 16 ~~~~~~~~~vGS~gG~~~i~~G~adiag~hlld~~~g~ynip~v~~~~~~~-~~LI--~g~~Re~Gl~v~~gn-pk~i~~ 91 (223) T COG1910 16 AGETAKTVYVGSMGGLLAIKRGEADIAGIHLLDNESGTYNIPYVKKYLPGE-AVLI--RGYLREQGLMVQKGN-PKNISS 91 (223) T ss_pred CCCCCCEEEECCCCCHHHHHHCCCCEEEEEECCCCCCCEEHHHHHHCCCCC-EEEE--EEEEEEEEEEEECCC-CCCCCC T ss_conf 523450688604455476662545144421325778744230042307884-6998--536651036886489-776624 Q ss_pred HHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCCCC---CCCEEEECCCCCCHHHHHHHHHHCCCCEEEEEECCCHHH Q ss_conf 35655556654420135785432005558766403545---221001036652018999999965993699941242232 Q gi|254780342|r 128 YQDIAKYKKELGAKIYGIEPGNEGNQRILDMINNNKFS---LKGFRLIEASELASFSQIRRDQRNNIPAVFLSWEPHPIN 204 (309) Q Consensus 128 ~~dl~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~---~~~~~~~~~s~~~~~a~l~~A~~~~~~~v~~~wsp~~~~ 204 (309) |+||.+. +-+++.-+.+.+.+....+.+..++.. ..+|... ...-..+.+|+..|+.++...-.+- - T Consensus 92 ~edl~~~----d~~fVNR~rGSGTR~LlD~~L~~~~~~~~~I~GY~~e----~~th~avA~aVa~G~AD~GvGlr~~--A 161 (223) T COG1910 92 LEDLLRK----DLRFVNRNRGSGTRILLDELLGELNILPDSIKGYSDE----ATTHDAVASAVASGRADAGVGLRHA--A 161 (223) T ss_pred HHHHHHC----CCEEEECCCCCCHHHHHHHHHHHCCCCCHHCCCCCCC----CCCCHHHHHHHHCCCCCCCCCHHHH--H T ss_conf 9988526----7578715899658999999998737670105784323----2120899999975898725017999--9 Q ss_pred HCCCCEEECCCCCCCCCCHHHHHHHHCHHHHHHCHHHHHHHHHCC Q ss_conf 015851400786546778658354427777874889999987408 Q gi|254780342|r 205 SDLNIHYLPGGEEISGFGEASVYTVVRSDYLDKCPNISRLLKNIK 249 (309) Q Consensus 205 ~~~dl~~Ledp~~~~~~p~~~~~~vv~~~~~~~~P~~~~~L~~~~ 249 (309) .+|+|.+.+-.+.. | ..++|+++.+ .|.+.+|++.++ T Consensus 162 ~~~gL~Fipl~~E~-----Y--D~virke~~~-~~~vr~fi~~L~ 198 (223) T COG1910 162 EKYGLDFIPLGDEE-----Y--DFVIRKERLD-KPVVRAFIKALK 198 (223) T ss_pred HHCCCCEEECCCCE-----E--EEEEEHHHCC-CHHHHHHHHHHC T ss_conf 98498517745625-----7--8998456705-888999999861 No 46 >PRK09959 hybrid sensory histidine kinase in two-component regulatory system with EvgA; Provisional Probab=95.39 E-value=0.24 Score=28.60 Aligned_cols=55 Identities=16% Similarity=0.136 Sum_probs=40.1 Q ss_pred CCCEEEEC--CCC-----C----HHHHHHHHHHHHHHHCCCCEEEEEE-CCHHHHHHHHHCCCCEEE Q ss_conf 88379942--588-----6----9999999999999610698489985-640578888514981399 Q gi|254780342|r 25 CTPVRFAD--TGW-----T----DIAATTAMTSVILEEILGYKTNIKL-LAVPVTFRSLKNKGIDIF 79 (309) Q Consensus 25 ~~~I~ig~--~~w-----t----e~~v~a~i~~~iLE~~lGy~Ve~~~-~~~~~~~~al~~G~iD~~ 79 (309) .+++++|. .++ | ...-++.=|..+|.+.+|.++++.. .+...+++||++|+||+. T Consensus 55 ~~~l~vgv~~pd~~Pf~~~~~~~~yeGI~ADYl~li~~~L~~~i~v~~y~~r~~al~AL~~GeIDl~ 121 (1197) T PRK09959 55 KKNLVIAVHKSQTATLLHTDSQQRVRGINADYLNLLKRALNIKLTLREYADHQKAMDALEEGEVDIV 121 (1197) T ss_pred CCEEEEEEECCCCCCEEEECCCCCCCCCCHHHHHHHHHHHCCCEEEEECCCHHHHHHHHHCCCCCEE T ss_conf 8858997106889996772799864684799999999986898799956999999999975997563 No 47 >PRK09791 putative DNA-binding transcriptional regulator; Provisional Probab=94.93 E-value=0.32 Score=27.73 Aligned_cols=198 Identities=13% Similarity=0.158 Sum_probs=108.3 Q ss_pred CCCCCEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHCCCCEEEEEEECCCCHHHHHHHHHCCCEEE Q ss_conf 45883799425886999999999999961069848998564057888851498139998404641356666553047023 Q gi|254780342|r 23 DSCTPVRFADTGWTDIAATTAMTSVILEEILGYKTNIKLLAVPVTFRSLKNKGIDIFMGYWYPSLEKFIAPYLEEGSIKL 102 (309) Q Consensus 23 ~~~~~I~ig~~~wte~~v~a~i~~~iLE~~lGy~Ve~~~~~~~~~~~al~~G~iD~~~e~w~p~~~~~~~~~~~~~~~~~ 102 (309) ....+|+||.........+..++....++.=+.++++..+++..+.+.|.+|++|+......+.... ..+.. T Consensus 92 ~~~G~l~Ig~~~~~~~~~lp~~l~~f~~~~P~v~i~i~~~~~~~~~~~l~~g~~D~~i~~~~~~~~~--------~~l~~ 163 (302) T PRK09791 92 QLAGQINIGMGASISRSLMPAVISRFHQQHPQVKVRIMEGQLVSMINELRQGELDFTINTYYQGPYD--------HEFTF 163 (302) T ss_pred CCCCEEEEEECHHHHHHHHHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHCCCCCEEEECCCCCCCC--------CCEEE T ss_conf 7663399986268898711999999999887977999989999999999879977899656788788--------76479 Q ss_pred ECCCCCCCEEEEEEEHHHHHCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCCCCCCCEEEECCCCCCHHHH Q ss_conf 01356685356786344641234553565555665442013578543200555876640354522100103665201899 Q gi|254780342|r 103 VAENLQGAKYMLAVNDVGFALGIKSYQDIAKYKKELGAKIYGIEPGNEGNQRILDMINNNKFSLKGFRLIEASELASFSQ 182 (309) Q Consensus 103 ~~~~~~~~~~~~~v~~~~~~~~i~~~~dl~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~s~~~~~a~ 182 (309) .........++++..-+..+..++.++.++ ..+...+.........+.....+.... ..+...+ +.. T Consensus 164 --~~l~~~~~~~v~~~~hpl~~~~~l~~l~~~------~~~~~~~~~~~~~~~~~~~~~~g~~p~-~~~~~~~----~~~ 230 (302) T PRK09791 164 --EKLLEKQFAIFCRPGHPAIGARSIKQLLDY------SWTMPTPHGSYYKQLSELLDDQAQTPQ-VGVVCET----FSA 230 (302) T ss_pred --EEEEEEEEEEEECCCCCCCCCCCHHHHHCC------CEEEECCCCCHHHHHHHHHHHCCCCCC-EEEEECC----HHH T ss_conf --997740159998389843579899998479------938757999799999999996799985-6999992----999 Q ss_pred HHHHHHCCCCEEEEEECCCHH----HHCCCCEEECCCCCCCCCCHHHHHHHHCHHHHHHCHHHHHHHHHC Q ss_conf 999996599369994124223----201585140078654677865835442777787488999998740 Q gi|254780342|r 183 IRRDQRNNIPAVFLSWEPHPI----NSDLNIHYLPGGEEISGFGEASVYTVVRSDYLDKCPNISRLLKNI 248 (309) Q Consensus 183 l~~A~~~~~~~v~~~wsp~~~----~~~~dl~~Ledp~~~~~~p~~~~~~vv~~~~~~~~P~~~~~L~~~ 248 (309) +...+.+|..+.+ -|... ....+++.++.+... |...+..+.+++- ...|.+..|++-+ T Consensus 231 ~~~lv~~g~gv~i---lp~~~~~~~~~~~~lv~lpl~~~~---~~~~~~lv~r~~~-~~spa~~~fi~~l 293 (302) T PRK09791 231 CISLVAKSDFLSI---LPEEMGCDPLHGQGLVMLPVSEIL---PKAAYYLIQRRDS-RQTPLTASLITQF 293 (302) T ss_pred HHHHHHHCCEEEE---ECHHHHHHHHHCCCEEEEECCCCC---CCEEEEEEEECCC-CCCHHHHHHHHHH T ss_conf 9999997896796---319999878748999999799988---6008999998909-8899999999999 No 48 >PRK04168 hypothetical protein; Provisional Probab=94.87 E-value=0.33 Score=27.63 Aligned_cols=201 Identities=11% Similarity=0.102 Sum_probs=98.1 Q ss_pred HHHHHHHHHHHHHHHHHCC---CCCCCCCEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHC-C-CC Q ss_conf 5799999999999863035---83458837994258869999999999999610698489985640578888514-9-81 Q gi|254780342|r 2 YKILAVCLFLTTFSISYAR---DADSCTPVRFADTGWTDIAATTAMTSVILEEILGYKTNIKLLAVPVTFRSLKN-K-GI 76 (309) Q Consensus 2 kk~~~~~~~~~~~~~~~~~---~~~~~~~I~ig~~~wte~~v~a~i~~~iLE~~lGy~Ve~~~~~~~~~~~al~~-G-~i 76 (309) +++++++++++.+++++|. .+....+++|-....-.. ...+|-++--++.-|.+|....+++..+-+-+.. | .. T Consensus 6 ~~~~~~~~~~~~~~~~g~~~~~~~~~~~~l~Vf~AGSL~~-pf~ei~~~Fe~~~p~v~v~~e~~GS~~~ar~Ite~g~~a 84 (336) T PRK04168 6 RKIVIVILLLLALVFLGCVNTSEAEPPGKLKIFHAGSLSV-PFEEYEKEFEAYYPNVDVQREAGGSVACVRKITELGKPA 84 (336) T ss_pred EEHHHHHHHHHHHHHCCCCCCCCCCCCEEEEEEECCCCHH-HHHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHCCCCC T ss_conf 2203799999999970555777778861599997552589-999999999987899659998473899999999579998 Q ss_pred EEEEEEECCCCHHHHHHHHHCCCEEEECCCCCCCEEEEEEEHHH---HHCCCCCHHHHHHHHHHHHHCCCCCCCCC-CCC Q ss_conf 39998404641356666553047023013566853567863446---41234553565555665442013578543-200 Q gi|254780342|r 77 DIFMGYWYPSLEKFIAPYLEEGSIKLVAENLQGAKYMLAVNDVG---FALGIKSYQDIAKYKKELGAKIYGIEPGN-EGN 152 (309) Q Consensus 77 D~~~e~w~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~---~~~~i~~~~dl~~~~~~~~~~~~g~~~~~-~~~ 152 (309) |++.++-.........+...+ -...+..+.-.++++... .+.+-.+|-++....+ .++--++|.. .|. T Consensus 85 DVfaSAD~~~i~~ll~p~~ad-----w~v~FA~N~lViayt~~Sk~a~ei~~~nWyeiL~~~~---Vk~g~sdP~~DP~G 156 (336) T PRK04168 85 DILASADYTLIPKMMIPEYAD-----WYVMFATNEIVLAYTDKSKYADEINSDNWYEILQRPG---VKWGFSDPNDDPCG 156 (336) T ss_pred CEEEECCHHHHHHHHCCCCCC-----CEEEECCCEEEEEECCCCCCCCCCCCCCHHHHHCCCC---CEEEECCCCCCCCH T ss_conf 889989857878874644334-----1012115538999768882110057331889857798---57983799889860 Q ss_pred H--HHHHHHHHCCCCCCC------------------------------EEEECCCCCCHHHHHHHHHHCCCCEEEEEECC Q ss_conf 5--558766403545221------------------------------00103665201899999996599369994124 Q gi|254780342|r 153 Q--RILDMINNNKFSLKG------------------------------FRLIEASELASFSQIRRDQRNNIPAVFLSWEP 200 (309) Q Consensus 153 ~--~~~~~~~~~~l~~~~------------------------------~~~~~~s~~~~~a~l~~A~~~~~~~v~~~wsp 200 (309) . ..--++....|+..+ ..++...+ .++...++.|+.+.++.|.. T Consensus 157 yrt~~~~~LAe~~Y~~pgl~~~l~~~~~n~~~~~~~i~~p~~~~~n~~k~~~rp~e----~~Ll~~lesG~~Dy~f~Y~S 232 (336) T PRK04168 157 YRSLMVLQLAELYYNDPTIYDKLVLKNTNFYVNGTYIRSPKEILVNRPKIFVRPKE----VELLGLLETGAMDYLFIYKS 232 (336) T ss_pred HHHHHHHHHHHHHCCCCCHHHHHHHHCCCCCCCCCCCCCHHHHCCCCCCCEECCCH----HHHHHHHHCCCCCEEEEEEE T ss_conf 89999999999860684178888640355334542235604311455431105526----77777875588646987624 Q ss_pred CHHHHCCCCEEECCCCC Q ss_conf 22320158514007865 Q gi|254780342|r 201 HPINSDLNIHYLPGGEE 217 (309) Q Consensus 201 ~~~~~~~dl~~Ledp~~ 217 (309) ...-+ +|.+++.|+. T Consensus 233 ~A~q~--~L~yv~LP~e 247 (336) T PRK04168 233 VAVQH--NLKYIELPDE 247 (336) T ss_pred HHHHC--CCCCCCCCHH T ss_conf 48676--9970038333 No 49 >TIGR01256 modA molybdate ABC transporter, periplasmic molybdate-binding protein; InterPro: IPR005950 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). This entry describes the molybdate ABC transporter periplasmic binding protein in bacteria and archae. Several of the periplasmic receptors constitute a diverse class of binding proteins that differ widely in size, sequence and ligand specificity. It has been shown experimentally by radioactive labeling that ModA represents a hydrophylioc periplasmic-binding protein in Gram-negative organisms and its counterpart in Gram-positive organisms is a lipoprotein. The other components of the system include ModB, an integral membrane protein and ModC, the ATP-binding subunit. Almost all of them display a common beta/alpha folding motif and have similar tertiary structures consisting of two globular domains.; GO: 0015412 molybdate transmembrane-transporting ATPase activity, 0015689 molybdate ion transport, 0030288 outer membrane-bounded periplasmic space. Probab=94.57 E-value=0.39 Score=27.17 Aligned_cols=200 Identities=13% Similarity=0.053 Sum_probs=125.7 Q ss_pred HHHHHHHHHHHHHCCC-CEEEEEECCHHHHHHHHHCC-CCEEEEEEECCCCHHHHHHHHHCCCEEEECC-CCCCCEEEEE Q ss_conf 9999999999961069-84899856405788885149-8139998404641356666553047023013-5668535678 Q gi|254780342|r 39 AATTAMTSVILEEILG-YKTNIKLLAVPVTFRSLKNK-GIDIFMGYWYPSLEKFIAPYLEEGSIKLVAE-NLQGAKYMLA 115 (309) Q Consensus 39 ~v~a~i~~~iLE~~lG-y~Ve~~~~~~~~~~~al~~G-~iD~~~e~w~p~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~ 115 (309) .++.+|.++-=+ +.| -+|-+..+++..++.-+.+| -.|++.++-. ..+++..+++.++.... .+.++..-++ T Consensus 6 ~Al~~i~~~F~k-~~g~~~Vv~sFgsSg~L~~QI~~GAP~D~F~SA~~----~~~~~L~~~g~~v~~~~~~~~~n~LVL~ 80 (225) T TIGR01256 6 DALKEIAKQFEK-RTGKNKVVFSFGSSGTLYTQIENGAPADVFISADE----KRPKKLVDKGLVVKGSEFVYAGNKLVLI 80 (225) T ss_pred HHHHHHHHHHHC-CCCCCEEEEEECCCHHHHHHHHCCCCEEEEECCCC----CCCHHHHHCCCEECCCCEEEEEEEEEEE T ss_conf 899999988620-08995699987681789999971797887661463----4540466378730254458986218998 Q ss_pred EEHHHHHCCCCCHHHHHHHH--H--HHHHCCCCCCCCCCCCHHHHHHHHHCCCCCCC-EEEECCCCCCHHHHHHHHHHCC Q ss_conf 63446412345535655556--6--54420135785432005558766403545221-0010366520189999999659 Q gi|254780342|r 116 VNDVGFALGIKSYQDIAKYK--K--ELGAKIYGIEPGNEGNQRILDMINNNKFSLKG-FRLIEASELASFSQIRRDQRNN 190 (309) Q Consensus 116 v~~~~~~~~i~~~~dl~~~~--~--~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~-~~~~~~s~~~~~a~l~~A~~~~ 190 (309) .+.... ..++.+.|.+.. . .++.--+|-|........-.+.+...++-... .+++.+.+ -..++.+=+++| T Consensus 81 ~~~~~~--~~~~~~~l~~p~~~~~l~~~~vaigdP~~~P~G~~A~E~L~~lgl~~~l~~kLV~g~P--~vrQA~~~V~~~ 156 (225) T TIGR01256 81 SPKNSR--VVDSLDILKKPGWVIVLKDKTVAIGDPKHAPYGAAAKEVLQKLGLWEKLKKKLVYGEP--DVRQALQFVETG 156 (225) T ss_pred ECCCCC--CCCCHHHHCCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCCHHHHCCEEECCC--CHHHHHHHHHHC T ss_conf 535542--0130221138974899863702346824443579999999707976256448652488--899998887725 Q ss_pred CCEEEEEECCCHHHHC-CCCEEECCCCCCCCCCHHHHHHHHCHH--------HHH--HCHHHHHHHHHC Q ss_conf 9369994124223201-585140078654677865835442777--------787--488999998740 Q gi|254780342|r 191 IPAVFLSWEPHPINSD-LNIHYLPGGEEISGFGEASVYTVVRSD--------YLD--KCPNISRLLKNI 248 (309) Q Consensus 191 ~~~v~~~wsp~~~~~~-~dl~~Ledp~~~~~~p~~~~~~vv~~~--------~~~--~~P~~~~~L~~~ 248 (309) +-.+..++.++-.-.. -+..++--|.+.+- |-.+...++..+ |.+ +.|++.++|+++ T Consensus 157 nA~~GiVa~Sdv~~~~K~~~~v~~~p~~ly~-Pi~yp~~i~~~g~n~~~a~~f~dyL~sp~A~~il~~Y 224 (225) T TIGR01256 157 NAPIGIVALSDVIPSGKKVGSVAVIPEDLYK-PIKYPAVIVKGGKNNAAAKAFIDYLKSPEAKEILRKY 224 (225) T ss_pred CCCEEEEEHHEEEECCCCCCEEEEECHHHCC-CEEEEEEEEECCCCHHHHHHHHHHHCCHHHHHHHHCC T ss_conf 8866687311022000366279996331279-6021178871687608999999870796476786305 No 50 >smart00062 PBPb Bacterial periplasmic substrate-binding proteins. bacterial proteins, eukaryotic ones are in PBPe Probab=94.52 E-value=0.4 Score=27.09 Aligned_cols=170 Identities=14% Similarity=0.168 Sum_probs=90.6 Q ss_pred HHHHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHCCCCEEEEEE--ECCCCHHHHHHHHHCCCEEEECCCCCCCEEEEEE Q ss_conf 99999999999610698489985640578888514981399984--0464135666655304702301356685356786 Q gi|254780342|r 39 AATTAMTSVILEEILGYKTNIKLLAVPVTFRSLKNKGIDIFMGY--WYPSLEKFIAPYLEEGSIKLVAENLQGAKYMLAV 116 (309) Q Consensus 39 ~v~a~i~~~iLE~~lGy~Ve~~~~~~~~~~~al~~G~iD~~~e~--w~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v 116 (309) -+...++.++.+ .+|++++++..+..-++.++.+|++|+.+.. |.+.....+ .. ...+......+.+ T Consensus 24 G~~~Dil~~~~~-~lg~~i~~~~~~~~r~~~~l~~g~~D~~~~~~~~~~~r~~~~---------~~-s~p~~~~~~~~~~ 92 (219) T smart00062 24 GFDVDLAKAIAK-ELGLKVEFVEVSFDNLLTALKSGKIDVVAAGMTITPERAKQV---------DF-SDPYYKSGQVILV 92 (219) T ss_pred EEHHHHHHHHHH-HHCCCEEEEECCHHHHHHHHHCCCCCEEEEECCCCHHHHHCC---------CC-CCCEECCCEEEEE T ss_conf 829999999999-979967999847899999997589770531135786775002---------22-4550024447999 Q ss_pred EHHHHHCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCCCCCCCEEEECCCCCCHHHHHHHHHHCCCCEEEE Q ss_conf 34464123455356555566544201357854320055587664035452210010366520189999999659936999 Q gi|254780342|r 117 NDVGFALGIKSYQDIAKYKKELGAKIYGIEPGNEGNQRILDMINNNKFSLKGFRLIEASELASFSQIRRDQRNNIPAVFL 196 (309) Q Consensus 117 ~~~~~~~~i~~~~dl~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~s~~~~~a~l~~A~~~~~~~v~~ 196 (309) ++ ...+.++++|. +..+|...++... ..+.....+..... .. + ..++..++.+|+.+++. T Consensus 93 ~~---~~~~~~~~dL~-------g~~i~~~~g~~~~----~~l~~~~~~~~~~~-~~-~----~~~~~~~l~~g~vD~~i 152 (219) T smart00062 93 RK---DSPIKSLEDLK-------GKKVAVVAGTTGE----ELLKKLYPEAKIVS-YD-S----QAEALAALKAGRADAAV 152 (219) T ss_pred EC---CCCCCCHHHHC-------CCEEEEECCCHHH----HHHHHHCCCCEEEE-EC-C----HHHHHHHHHCCCEEEEE T ss_conf 65---89979979967-------9779995798799----99997287852898-59-9----99999999839879999 Q ss_pred EECCC--HHHHCCCC---EEECCCCCCCCCCHHHHHHHHCHHHHHHCHHHHHHHHH Q ss_conf 41242--23201585---14007865467786583544277778748899999874 Q gi|254780342|r 197 SWEPH--PINSDLNI---HYLPGGEEISGFGEASVYTVVRSDYLDKCPNISRLLKN 247 (309) Q Consensus 197 ~wsp~--~~~~~~dl---~~Ledp~~~~~~p~~~~~~vv~~~~~~~~P~~~~~L~~ 247 (309) .+.+. +.+.+... .....+.. .+.....++++ .+|+..+.|++ T Consensus 153 ~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~----~~~~l~~~~n~ 200 (219) T smart00062 153 ADAPALAALVKQHGLPELKIVGDPLD----TPEGYAFAVRK----GDPELLDKINK 200 (219) T ss_pred ECHHHHHHHHHHCCCCCEEEECCCCC----CCCEEEEEEEC----CCHHHHHHHHH T ss_conf 77999999998489998378346777----76419999959----99999999999 No 51 >PRK11242 DNA-binding transcriptional regulator CynR; Provisional Probab=94.41 E-value=0.42 Score=26.93 Aligned_cols=193 Identities=11% Similarity=0.069 Sum_probs=108.5 Q ss_pred CCCCEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHCCCCEEEEEEECCCCHHHHHHHHHCCCEEEE Q ss_conf 58837994258869999999999999610698489985640578888514981399984046413566665530470230 Q gi|254780342|r 24 SCTPVRFADTGWTDIAATTAMTSVILEEILGYKTNIKLLAVPVTFRSLKNKGIDIFMGYWYPSLEKFIAPYLEEGSIKLV 103 (309) Q Consensus 24 ~~~~I~ig~~~wte~~v~a~i~~~iLE~~lGy~Ve~~~~~~~~~~~al~~G~iD~~~e~w~p~~~~~~~~~~~~~~~~~~ 103 (309) ..++|+||....-....+..++....++.=+.++++..+.+..+...|.+|++|+..-.-.+.... +. T Consensus 89 ~~G~l~ig~~~~~~~~~l~~~l~~f~~~~P~i~i~l~~~~~~~~~~~l~~~~~D~~i~~~~~~~~~----------l~-- 156 (292) T PRK11242 89 SRGSLRLAMTPTFTAYLIGPLIDAFHARYPGITLTIREMPQERIEALLADDELDVGIAFAPVHSPE----------IE-- 156 (292) T ss_pred CCCEEEEECHHHHHHHHHHHHHHHHHHHCCCCCEEEEECCHHHHHHHHHCCCCCEEEEECCCCCCC----------CE-- T ss_conf 776586302457788873899888886488973489976889999998679855799836888877----------25-- Q ss_pred CCCCCCCEEEEEEEHHHHH---CCCCCHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCCCCCCCEEEECCCCCCHH Q ss_conf 1356685356786344641---2345535655556654420135785432005558766403545221001036652018 Q gi|254780342|r 104 AENLQGAKYMLAVNDVGFA---LGIKSYQDIAKYKKELGAKIYGIEPGNEGNQRILDMINNNKFSLKGFRLIEASELASF 180 (309) Q Consensus 104 ~~~~~~~~~~~~v~~~~~~---~~i~~~~dl~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~s~~~~~ 180 (309) ..........+++++.-+. ....++.+|.+. ..+...++...+..........++... ..+...+ + T Consensus 157 ~~~l~~~~~~~v~~~~h~la~~~~~is~~~L~~~------~~i~~~~~~~~~~~~~~~~~~~~i~~~-~~~~~~~----~ 225 (292) T PRK11242 157 AQPLFTETLALVVGRTHPLAARRKPLTLDELADE------PLVLLSAEFATREQIDRYFRRHGITPR-VAIEANS----I 225 (292) T ss_pred EEEECCCEEEEEEECCCHHHHCCCCCCHHHHCCC------CEEEECCCCCHHHHHHHHHHHCCCCCC-EEEEECC----H T ss_conf 6542045079999089804227999999998189------989967998579999999997699985-3675388----9 Q ss_pred HHHHHHHHCCCCEEEEEECCCHHHHCC-CCEEECCCCCCCCCCHHHHHHHHCHHHHHHCHHHHHHHH Q ss_conf 999999965993699941242232015-851400786546778658354427777874889999987 Q gi|254780342|r 181 SQIRRDQRNNIPAVFLSWEPHPINSDL-NIHYLPGGEEISGFGEASVYTVVRSDYLDKCPNISRLLK 246 (309) Q Consensus 181 a~l~~A~~~~~~~v~~~wsp~~~~~~~-dl~~Ledp~~~~~~p~~~~~~vv~~~~~~~~P~~~~~L~ 246 (309) ..+...++.|-.+.+ -|++....+ +++.+.-+... |.-.+..+.+++ ....|.+..|++ T Consensus 226 ~~~~~~V~~g~Gi~i---lP~~~~~~~~~l~~i~l~~~~---~~r~i~lv~~k~-~~~s~~~~~Fid 285 (292) T PRK11242 226 SAVLEIVRRGRLATL---LPAAIAREHDDLCAVALDPPL---PQRTAALLRRKG-AYRSAAARAFIE 285 (292) T ss_pred HHHHHHHHHCCEEEE---EHHHHHHHCCCEEEEECCCCC---CEEEEEEEEECC-CCCCHHHHHHHH T ss_conf 999999997996997---028986008998999898998---703999999893-988999999999 No 52 >PRK12683 transcriptional regulator CysB-like protein; Reviewed Probab=94.35 E-value=0.44 Score=26.85 Aligned_cols=207 Identities=11% Similarity=-0.042 Sum_probs=120.0 Q ss_pred CCCCCEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHCCCCEEEEEEECCCCHHHHHHHHHCCCEEE Q ss_conf 45883799425886999999999999961069848998564057888851498139998404641356666553047023 Q gi|254780342|r 23 DSCTPVRFADTGWTDIAATTAMTSVILEEILGYKTNIKLLAVPVTFRSLKNKGIDIFMGYWYPSLEKFIAPYLEEGSIKL 102 (309) Q Consensus 23 ~~~~~I~ig~~~wte~~v~a~i~~~iLE~~lGy~Ve~~~~~~~~~~~al~~G~iD~~~e~w~p~~~~~~~~~~~~~~~~~ 102 (309) ....+|+||.......+.+..++....++.=+.++++..+++..+.+.|.+|++|+..-.-...... .+.. T Consensus 90 ~~~G~l~Ig~~~~~~~~~lp~~l~~f~~~~P~v~i~l~~~~~~~~~~~L~~g~~Dl~i~~~~~~~~~---------~l~~ 160 (308) T PRK12683 90 RDSGRLTVATTHTQARYALPKVVRQFTEVFPKVHLALRQGSPQEIAEMLLNGEADIGIATEALDREP---------DLVS 160 (308) T ss_pred CCCCEEEECCCCHHHHHCCCHHHHHHHHHCCCCEEEEEECCHHHHHHHHHCCCCCEEECCCCCCCCC---------CCEE T ss_conf 7776242001320112016846689998789966999607869999999779977880666778887---------7168 Q ss_pred ECCCCCCCEEEEEEEHHHHHC--CCCCHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCCCCCCCEEEECCCCCCHH Q ss_conf 013566853567863446412--345535655556654420135785432005558766403545221001036652018 Q gi|254780342|r 103 VAENLQGAKYMLAVNDVGFAL--GIKSYQDIAKYKKELGAKIYGIEPGNEGNQRILDMINNNKFSLKGFRLIEASELASF 180 (309) Q Consensus 103 ~~~~~~~~~~~~~v~~~~~~~--~i~~~~dl~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~s~~~~~ 180 (309) . ........++++..-+.. .-.++++|.+ ..++..++++..+..........++... ..+...+ . T Consensus 161 ~--~l~~~~~~lv~~~~hpla~~~~v~l~dl~~------~p~I~~~~~~~~r~~i~~~~~~~g~~p~-~~~e~~~----~ 227 (308) T PRK12683 161 F--PYYSWHHVVVVPKGHPLTGRENLTLEALAE------YPIITYDQGFTGRSHIDQAFAEAGAVPD-IVLTALD----A 227 (308) T ss_pred E--EECCCCEEEEECCCCCCCCCCCCCHHHHCC------CCEEECCCCCCHHHHHHHHHHHCCCCCC-EEEEECC----H T ss_conf 7--612451589846999533699999999859------9989618998299999999997799985-7999998----9 Q ss_pred HHHHHHHHCCCCEEEEEECCCHHHHC---CCCEEECCCCCCCCCCHHHHHHHHCHHHHHHCHHHHHHHHHC--CCCHHHH Q ss_conf 99999996599369994124223201---585140078654677865835442777787488999998740--8998999 Q gi|254780342|r 181 SQIRRDQRNNIPAVFLSWEPHPINSD---LNIHYLPGGEEISGFGEASVYTVVRSDYLDKCPNISRLLKNI--KFSVALE 255 (309) Q Consensus 181 a~l~~A~~~~~~~v~~~wsp~~~~~~---~dl~~Ledp~~~~~~p~~~~~~vv~~~~~~~~P~~~~~L~~~--~ls~~~~ 255 (309) ..+.+.+..|-.+.+ -|...+.. -+++.+.-+ +.+ ++..+..+.|++ ..-.|.+.+|++-+ +++.+++ T Consensus 228 ~~i~~~V~~g~Gi~i---lp~~~~~~~~~~~lv~l~~~-~~~--~~~~~~i~~rk~-~~Ls~~a~~Fie~l~~~l~~~~~ 300 (308) T PRK12683 228 DVIKTYVELGMGVGI---VAAMAYDPQRDTGLVALDTQ-HLF--EANTTRVALRRG-AYLRGYAYRFIEMFAPHLSEAEI 300 (308) T ss_pred HHHHHHHHHCCCHHH---HHHHHHHHHHCCCEEEEECC-CCC--CCCEEEEEECCC-CCCCHHHHHHHHHHHHHHCHHHH T ss_conf 999999997984707---89999878860997999897-887--661799999096-85799999999998886298999 Q ss_pred HHH Q ss_conf 999 Q gi|254780342|r 256 NEM 258 (309) Q Consensus 256 ~~l 258 (309) ..- T Consensus 301 ~~~ 303 (308) T PRK12683 301 AGA 303 (308) T ss_pred HHH T ss_conf 998 No 53 >PRK09986 DNA-binding transcriptional activator XapR; Provisional Probab=94.24 E-value=0.46 Score=26.69 Aligned_cols=197 Identities=10% Similarity=0.081 Sum_probs=107.5 Q ss_pred CCCCCCEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHCCCCEEEEEEECCCCHHHHHHHHHCCCEE Q ss_conf 34588379942588699999999999996106984899856405788885149813999840464135666655304702 Q gi|254780342|r 22 ADSCTPVRFADTGWTDIAATTAMTSVILEEILGYKTNIKLLAVPVTFRSLKNKGIDIFMGYWYPSLEKFIAPYLEEGSIK 101 (309) Q Consensus 22 ~~~~~~I~ig~~~wte~~v~a~i~~~iLE~~lGy~Ve~~~~~~~~~~~al~~G~iD~~~e~w~p~~~~~~~~~~~~~~~~ 101 (309) .....+|+||...-.....+..+++...++.=+.++++..+.+....+.|.+|++|+..-.- +... . . .+. T Consensus 77 ~~~~g~l~Ig~~~~~~~~~l~~~l~~f~~~~P~i~v~i~~~~~~~~~~~l~~~~~D~a~~~~-~~~~---~----~-~~~ 147 (278) T PRK09986 77 RGEAGRIELGVVGTALWGRMRPAMRHFLKENPNVEVLFREKSPSMQMALLERRELDAGIWRM-ALEP---N----P-GFT 147 (278) T ss_pred CCCCCCEEEEECHHHHHHHHHHHHHHHHHHCCCEEEEEEECCHHHHHHHHHCCCCCEEEECC-CCCC---C----C-CCE T ss_conf 57777336874216888764289999998789839999978869999999779975899637-7778---8----7-742 Q ss_pred EECCCCCCCEEEEEEEHHHH--HCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCCCCCCCEEEECCCCCCH Q ss_conf 30135668535678634464--1234553565555665442013578543200555876640354522100103665201 Q gi|254780342|r 102 LVAENLQGAKYMLAVNDVGF--ALGIKSYQDIAKYKKELGAKIYGIEPGNEGNQRILDMINNNKFSLKGFRLIEASELAS 179 (309) Q Consensus 102 ~~~~~~~~~~~~~~v~~~~~--~~~i~~~~dl~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~s~~~~ 179 (309) .. ........+++++.-+ ....-++++|.. ..++..++...............+..... ....+ - T Consensus 148 ~~--~l~~~~~~~~~~~~hpla~~~~i~~~~l~~------~~~i~~~~~~~~~~~~~~~~~~~g~~~~~--~~~~~---~ 214 (278) T PRK09986 148 SL--RLHESAFAVAVPEEHLLASKSSVPLKALRN------EYFVTLPSVHSDWGFLQRVCQQAGFSPQI--IREVN---E 214 (278) T ss_pred EE--EEEECCCEEEECCCCCCCCCCCCCHHHHCC------CCEEEECCCCCHHHHHHHHHHHCCCCCCE--EEEEC---C T ss_conf 68--877416379965987333699999999859------99798679986699999999977998517--99999---8 Q ss_pred HHHHHHHHHCCCCEEEEEECCCHHHHC--CCCEEECCCCCCCCCCHHHHHHHHCHHHHHHCHHHHHHHHHCC Q ss_conf 899999996599369994124223201--5851400786546778658354427777874889999987408 Q gi|254780342|r 180 FSQIRRDQRNNIPAVFLSWEPHPINSD--LNIHYLPGGEEISGFGEASVYTVVRSDYLDKCPNISRLLKNIK 249 (309) Q Consensus 180 ~a~l~~A~~~~~~~v~~~wsp~~~~~~--~dl~~Ledp~~~~~~p~~~~~~vv~~~~~~~~P~~~~~L~~~~ 249 (309) ...+...++.|-.+.+ -|...... -+++++.-... ++.++..+-|++ ..+|.+.+|+..++ T Consensus 215 ~~~~~~~V~~G~Giai---lp~~~~~~~~~~v~~~Pl~~~----~~~~~~lv~~~~--~~~p~~~~fi~~l~ 277 (278) T PRK09986 215 PQTVLAMVSMGIGITL---VADSYAQMPWPGVVFRPLKER----IPADLYIVYHQQ--QATPALEKLLAALT 277 (278) T ss_pred HHHHHHHHHHCCEEEE---EHHHHHCCCCCCEEEEECCCC----CCEEEEEEECCC--CCCHHHHHHHHHHC T ss_conf 9999999997998998---417774237899999988999----803899998499--99999999999865 No 54 >CHL00180 rbcR LysR transcriptional regulator; Provisional Probab=93.78 E-value=0.56 Score=26.13 Aligned_cols=201 Identities=11% Similarity=0.094 Sum_probs=115.5 Q ss_pred CCCCCEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHCCCCEEEEE-EECCCCHHHHHHHHHCCCEE Q ss_conf 45883799425886999999999999961069848998564057888851498139998-40464135666655304702 Q gi|254780342|r 23 DSCTPVRFADTGWTDIAATTAMTSVILEEILGYKTNIKLLAVPVTFRSLKNKGIDIFMG-YWYPSLEKFIAPYLEEGSIK 101 (309) Q Consensus 23 ~~~~~I~ig~~~wte~~v~a~i~~~iLE~~lGy~Ve~~~~~~~~~~~al~~G~iD~~~e-~w~p~~~~~~~~~~~~~~~~ 101 (309) ....+|+||........++..++....++.=+.++++..+++..+.+.|.+|++|+..- ...|... ...+. T Consensus 92 ~~~g~lrI~~~~s~~~~~lp~~l~~f~~~~P~v~i~l~~~~~~~~~~~l~~~~~D~~i~~~~~~~~~--------~~~l~ 163 (307) T CHL00180 92 LQGGTLIIGASQTTGTYLMPRLIGLFRQKYPQIAVQLQVHSTRRIAWSVANGQIDLAIIGGEVPTEL--------KDNLQ 163 (307) T ss_pred CCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCEEEEECCHHHHHHHHHCCCEEEEEECCCCCCCC--------CCCEE T ss_conf 1258601010406666643588999999888997278977999999999879800999757788666--------78549 Q ss_pred EECCCCCCCEEEEEEEHHHH--HCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCCCCCCCEEEECCCCCCH Q ss_conf 30135668535678634464--1234553565555665442013578543200555876640354522100103665201 Q gi|254780342|r 102 LVAENLQGAKYMLAVNDVGF--ALGIKSYQDIAKYKKELGAKIYGIEPGNEGNQRILDMINNNKFSLKGFRLIEASELAS 179 (309) Q Consensus 102 ~~~~~~~~~~~~~~v~~~~~--~~~i~~~~dl~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~s~~~~ 179 (309) . ..+......++++..-+ .....+++||.+ ...++.+.+...+......+...++....+... .+... T Consensus 164 -~-~~l~~~~~~lv~~~~hpla~~~~v~~~dL~~------~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~ 233 (307) T CHL00180 164 -V-TPYAEDELALILPKSHPFAKLKKIQKEDLYR------LRFIALDSQSTIRKVIDNILIQNGIDSSRFKIE--MELNS 233 (307) T ss_pred -E-EEEECCCEEEEECCCCHHHCCCCCCHHHHHC------CCEEEECCCCCHHHHHHHHHHHCCCCCCCCEEE--EEECC T ss_conf -9-9943242799973898022399999999817------984871799968999999999769984554279--99780 Q ss_pred HHHHHHHHHCCCCEEEEEECCCHHHHC----CCCEEECCCCCCCCCCHHHHHHHHCHHHHHHCHHHHHHHHHC Q ss_conf 899999996599369994124223201----585140078654677865835442777787488999998740 Q gi|254780342|r 180 FSQIRRDQRNNIPAVFLSWEPHPINSD----LNIHYLPGGEEISGFGEASVYTVVRSDYLDKCPNISRLLKNI 248 (309) Q Consensus 180 ~a~l~~A~~~~~~~v~~~wsp~~~~~~----~dl~~Ledp~~~~~~p~~~~~~vv~~~~~~~~P~~~~~L~~~ 248 (309) ...+..++..|-.+.+ -|...+.. ..++.++-+... ....+..+.+++ ....|.+..|++.+ T Consensus 234 ~~~l~~~v~~g~Gia~---lP~~~v~~~~~~g~l~~~~~~~~~---~~r~i~lv~~~~-r~~s~a~~~F~~~l 299 (307) T CHL00180 234 IEAIKNAVQSGLGAAF---VSVSAIEKELELGTLHWAKIENIT---IKRTLSIITNPN-RYRSKAAETFSKEI 299 (307) T ss_pred HHHHHHHHHHCCEEEE---CCHHHHHHHHHCCCEEEEECCCCC---CEEEEEEEEECC-CCCCHHHHHHHHHH T ss_conf 9999999993994998---259999989877988999789998---504999999795-97799999999999 No 55 >PRK12684 transcriptional regulator CysB-like protein; Reviewed Probab=93.58 E-value=0.61 Score=25.91 Aligned_cols=207 Identities=11% Similarity=0.019 Sum_probs=118.8 Q ss_pred CCCCCCEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHCCCCEEEEEEECCCCHHHHHHHHHCCCEE Q ss_conf 34588379942588699999999999996106984899856405788885149813999840464135666655304702 Q gi|254780342|r 22 ADSCTPVRFADTGWTDIAATTAMTSVILEEILGYKTNIKLLAVPVTFRSLKNKGIDIFMGYWYPSLEKFIAPYLEEGSIK 101 (309) Q Consensus 22 ~~~~~~I~ig~~~wte~~v~a~i~~~iLE~~lGy~Ve~~~~~~~~~~~al~~G~iD~~~e~w~p~~~~~~~~~~~~~~~~ 101 (309) .....+++||.......+.+..++....++.=+.++++..+++..+.+.+.+|++|+..-.-.+.. ...+. T Consensus 89 ~~~~G~l~ig~~~s~~~~~lp~~l~~f~~~~P~v~l~l~~~~~~~~~~~l~~g~~Dl~i~~~~~~~---------~~~l~ 159 (313) T PRK12684 89 AQDQGNLTIATTHTQARYALPAAIAEFKKRYPKVRLSILQGSPTQIAEMVIHDQADLAIATEAIAD---------YKELV 159 (313) T ss_pred CCCCCEEECCHHHHHHHHCCCHHHHHHHHHCCCEEEEEEECCHHHHHHHHHCCCCCEEECCCCCCC---------CCCEE T ss_conf 677751411024555751594999999844898089997178899999997799665504357777---------78738 Q ss_pred EECCCCCCCEEEEEEEHHHH--HCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCCCCCCCEEEECCCCCCH Q ss_conf 30135668535678634464--1234553565555665442013578543200555876640354522100103665201 Q gi|254780342|r 102 LVAENLQGAKYMLAVNDVGF--ALGIKSYQDIAKYKKELGAKIYGIEPGNEGNQRILDMINNNKFSLKGFRLIEASELAS 179 (309) Q Consensus 102 ~~~~~~~~~~~~~~v~~~~~--~~~i~~~~dl~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~s~~~~ 179 (309) .. +.+. ....++++..-+ ...--++++|.+ ...+..+++...+..........|+... ..+...+ T Consensus 160 ~~-~l~~-~~~~~v~~~~hpla~~~~v~l~dL~~------~p~I~~~~~~~~r~~i~~~~~~~g~~p~-~~~e~~~---- 226 (313) T PRK12684 160 SL-PCYQ-WNHAVVVPPDHPLLERKPLTLEDLAQ------YPLITYDDAFAGRSKINKAFALRGLKPD-IVLEAID---- 226 (313) T ss_pred EE-EEEE-CCEEEEECCCCCCCCCCCCCHHHHCC------CCEEECCCCCCHHHHHHHHHHHCCCCCC-EEEEECC---- T ss_conf 99-8032-21689953898544699999999859------9989628998589999999997799874-7999997---- Q ss_pred HHHHHHHHHCCCCEEEEEECCCHHHH---CCCCEEECCCCCCCCCCHHHHHHHHCHHHHHHCHHHHHHHHHC--CCCHHH Q ss_conf 89999999659936999412422320---1585140078654677865835442777787488999998740--899899 Q gi|254780342|r 180 FSQIRRDQRNNIPAVFLSWEPHPINS---DLNIHYLPGGEEISGFGEASVYTVVRSDYLDKCPNISRLLKNI--KFSVAL 254 (309) Q Consensus 180 ~a~l~~A~~~~~~~v~~~wsp~~~~~---~~dl~~Ledp~~~~~~p~~~~~~vv~~~~~~~~P~~~~~L~~~--~ls~~~ 254 (309) ...+.+.+..|-.+-+ -|...+. ..+++.++... . .++..+..+.+++ .--.|.+..|++-+ +|+-+. T Consensus 227 ~~~i~~~V~~GlGi~i---lP~~a~~~~~~~~L~~lp~~~-~--~~~~~~~v~~~k~-~~ls~~~r~FIe~l~e~l~r~~ 299 (313) T PRK12684 227 ADVIKTYVELGLGVGI---VADMAFDPERDRNLRAIPAGH-L--FGSNVTRVALKQG-AYLRGYVYTFIELFSPTLNRKL 299 (313) T ss_pred HHHHHHHHHHCCCHHH---HHHHHHCHHHCCCEEEEECCC-C--CCCEEEEEEEECC-CCCCHHHHHHHHHHHHHCCHHH T ss_conf 9999999995985725---599984676389879997846-7--7660599999798-8668999999999874659899 Q ss_pred HHH Q ss_conf 999 Q gi|254780342|r 255 ENE 257 (309) Q Consensus 255 ~~~ 257 (309) +.+ T Consensus 300 ~~~ 302 (313) T PRK12684 300 VEQ 302 (313) T ss_pred HHH T ss_conf 999 No 56 >PRK10341 DNA-binding transcriptional activator TdcA; Provisional Probab=93.24 E-value=0.69 Score=25.56 Aligned_cols=198 Identities=11% Similarity=0.098 Sum_probs=103.1 Q ss_pred CCCCCEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHCCCCEEEEEEECCCCHHHHHHHHHCCCEEE Q ss_conf 45883799425886999999999999961069848998564057888851498139998404641356666553047023 Q gi|254780342|r 23 DSCTPVRFADTGWTDIAATTAMTSVILEEILGYKTNIKLLAVPVTFRSLKNKGIDIFMGYWYPSLEKFIAPYLEEGSIKL 102 (309) Q Consensus 23 ~~~~~I~ig~~~wte~~v~a~i~~~iLE~~lGy~Ve~~~~~~~~~~~al~~G~iD~~~e~w~p~~~~~~~~~~~~~~~~~ 102 (309) .....|+||........++..++....+..=+.++++..+.+..+.+.+.+|++|+....-..... .. .+.. T Consensus 94 ~~~~~l~ig~~~~~~~~~lp~~l~~f~~~~P~v~i~l~~~~~~~~~~~l~~g~~D~~i~~~~~~~~-------~~-~~~~ 165 (312) T PRK10341 94 EAVVDVSFGFPSLIGFTFMSDMINKFKEVFPKAQVSMYEAQLSSFLPAIRDGRLDFAIGTLSNEMK-------LQ-DLHV 165 (312) T ss_pred CCCEEEECCCCHHHHHHHHHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHCCCCEEEEEECCCCCC-------CC-CEEE T ss_conf 873257435406889999999999999988897799998999999999966986199974368777-------78-7499 Q ss_pred ECCCCCCCEEEEEEEHHHHHCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCCCCCCCEEEECCCCCCHHHH Q ss_conf 01356685356786344641234553565555665442013578543200555876640354522100103665201899 Q gi|254780342|r 103 VAENLQGAKYMLAVNDVGFALGIKSYQDIAKYKKELGAKIYGIEPGNEGNQRILDMINNNKFSLKGFRLIEASELASFSQ 182 (309) Q Consensus 103 ~~~~~~~~~~~~~v~~~~~~~~i~~~~dl~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~s~~~~~a~ 182 (309) ...+ .....+++++.-+...-.++++|.+. ..+....+.+.+......+...++.... .+...+ +.. T Consensus 166 -~~l~-~~~~~lv~~~~~p~~~~~~l~~L~~~------~~il~~~~~~~~~~~~~~~~~~g~~~~~-~~~~~s----~~~ 232 (312) T PRK10341 166 -EPLF-ESEFVLVASKSRTCTGTTTLESLKNE------QWVLPQTNMGYYSELLTTLQRNGISIEN-IVKTDS----VVT 232 (312) T ss_pred -EEEE-CCCEEEEECCCCCCCCCCCHHHHCCC------CEEEECCCCCHHHHHHHHHHHCCCCCCC-EEEECH----HHH T ss_conf -9951-24289997675423489798998079------8686069985899999999976999881-699890----999 Q ss_pred HHHHHHCCCCEEEEEECCCHHH---HCCCCEEECCCCCCCCCCHHHHHHHHCHHHHHHCHHHHHHHHHC Q ss_conf 9999965993699941242232---01585140078654677865835442777787488999998740 Q gi|254780342|r 183 IRRDQRNNIPAVFLSWEPHPIN---SDLNIHYLPGGEEISGFGEASVYTVVRSDYLDKCPNISRLLKNI 248 (309) Q Consensus 183 l~~A~~~~~~~v~~~wsp~~~~---~~~dl~~Ledp~~~~~~p~~~~~~vv~~~~~~~~P~~~~~L~~~ 248 (309) +...+.+|-.+.+. |.... ...+++.+.-+... |...+..+.+++ ..-.|.+..|++-+ T Consensus 233 i~~lv~~g~gi~~l---P~~~~~~~~~~~L~~lpl~~~~---~~~~~~lv~~k~-~~ls~a~~~fie~~ 294 (312) T PRK10341 233 IYNLVLNADFLTVI---PCDMTSPFGSNQFITIPIKETL---PVARYAAVWSKN-YRIKKAASVLVELA 294 (312) T ss_pred HHHHHHHCCEEEEE---HHHHHHHHCCCCEEEEECCCCC---CEEEEEEEEECC-CCCCHHHHHHHHHH T ss_conf 99999979958972---6999966318989999899988---511899999893-98899999999999 No 57 >COG2984 ABC-type uncharacterized transport system, periplasmic component [General function prediction only] Probab=92.63 E-value=0.84 Score=25.01 Aligned_cols=94 Identities=18% Similarity=0.283 Sum_probs=58.6 Q ss_pred CHHHHHHHHHHHHHHHHHCCCC--CCCCCEEEECCCCCHHHHHHHHHHHHHHH--CCCC-EEEEEE----CCHH---HHH Q ss_conf 9579999999999986303583--45883799425886999999999999961--0698-489985----6405---788 Q gi|254780342|r 1 MYKILAVCLFLTTFSISYARDA--DSCTPVRFADTGWTDIAATTAMTSVILEE--ILGY-KTNIKL----LAVP---VTF 68 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~~~~--~~~~~I~ig~~~wte~~v~a~i~~~iLE~--~lGy-~Ve~~~----~~~~---~~~ 68 (309) |+|.+...+++.++++++.... ..+++++||...+-|+-.+-+...-+.|. ..|| ++++.. ++.. .+. T Consensus 2 ~~~~~~~~~~~~~L~~~~l~~~~~~~~~~~~VaI~~~veHpaLd~~~~G~~~aLk~~G~~n~~i~~~na~~~~~~a~~ia 81 (322) T COG2984 2 MLRSLSFYLFFAALALAFLSSAGSAAADQITVAITQFVEHPALDAAREGVKEALKDAGYKNVKIDYQNAQGDLGTAAQIA 81 (322) T ss_pred CHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEEEECCHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCHHHHHHHH T ss_conf 26888999999999999997412144443368899850665599999978999986486672899624778857899999 Q ss_pred HHHHCCCCEEEEEEECCCCHHHHHHH Q ss_conf 88514981399984046413566665 Q gi|254780342|r 69 RSLKNKGIDIFMGYWYPSLEKFIAPY 94 (309) Q Consensus 69 ~al~~G~iD~~~e~w~p~~~~~~~~~ 94 (309) .-+.....|+..-.-.|......... T Consensus 82 rql~~~~~dviv~i~tp~Aq~~~s~~ 107 (322) T COG2984 82 RQLVGDKPDVIVAIATPAAQALVSAT 107 (322) T ss_pred HHHHCCCCCEEEECCCHHHHHHHHHC T ss_conf 99614799679961778999999846 No 58 >PRK09508 leuO leucine transcriptional activator; Reviewed Probab=92.44 E-value=0.89 Score=24.85 Aligned_cols=61 Identities=10% Similarity=-0.042 Sum_probs=52.2 Q ss_pred CCCCCEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHCCCCEEEEEEE Q ss_conf 4588379942588699999999999996106984899856405788885149813999840 Q gi|254780342|r 23 DSCTPVRFADTGWTDIAATTAMTSVILEEILGYKTNIKLLAVPVTFRSLKNKGIDIFMGYW 83 (309) Q Consensus 23 ~~~~~I~ig~~~wte~~v~a~i~~~iLE~~lGy~Ve~~~~~~~~~~~al~~G~iD~~~e~w 83 (309) ....+++|+..+..+..++..+++.+-+..=+..+++....+..+.+.|.+|++|+...+. T Consensus 109 ~~~~~~~i~~~~~~~~~llp~ll~~~~~~~P~v~l~~~~~~~~~~~~~L~~g~iDl~I~~~ 169 (314) T PRK09508 109 SSERVFNLCICSPLDNRLASQIYNRIEQIAPNIHLVFKSSLNQNIEHQLRYQETEFVISYE 169 (314) T ss_pred CCCCEEEEEECCHHHHHHHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHCCCCCEEEECC T ss_conf 0473699996307899999999999998689977999857706499999769952999678 No 59 >PRK12682 transcriptional regulator CysB-like protein; Reviewed Probab=92.01 E-value=1 Score=24.52 Aligned_cols=204 Identities=11% Similarity=0.054 Sum_probs=114.1 Q ss_pred CCCCCEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHCCCCEEEEEEECCCCHHHHHHHHHCCCEEE Q ss_conf 45883799425886999999999999961069848998564057888851498139998404641356666553047023 Q gi|254780342|r 23 DSCTPVRFADTGWTDIAATTAMTSVILEEILGYKTNIKLLAVPVTFRSLKNKGIDIFMGYWYPSLEKFIAPYLEEGSIKL 102 (309) Q Consensus 23 ~~~~~I~ig~~~wte~~v~a~i~~~iLE~~lGy~Ve~~~~~~~~~~~al~~G~iD~~~e~w~p~~~~~~~~~~~~~~~~~ 102 (309) ....+|+||.........+..++....++.=+.++++..+.+..+.+.+..|++|+..-.- +. .....+.. T Consensus 90 ~~~G~lrIg~~~~~~~~~lp~~l~~f~~~~P~v~l~l~~~~~~~~~~~l~~g~~Dl~i~~~-~~--------~~~~~l~~ 160 (309) T PRK12682 90 QDSGTLTIATTHTQARYVLPRVVAKFRKRYPKVNLSLHQGSPDEIAQMVISGEADIGIATE-SL--------DDDPDLVT 160 (309) T ss_pred CCCCCCCCCCCHHHHHHCCCHHHHHHHHHCCCCEEEEEECCHHHHHHHHHCCCCCHHHHHC-CC--------CCCCCEEE T ss_conf 6777412124313332038625399998789837999837829999999769976456507-77--------78988089 Q ss_pred ECCCCCCCEEEEEEEHHHH--HCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCCCCCCCEEEECCCCCCHH Q ss_conf 0135668535678634464--12345535655556654420135785432005558766403545221001036652018 Q gi|254780342|r 103 VAENLQGAKYMLAVNDVGF--ALGIKSYQDIAKYKKELGAKIYGIEPGNEGNQRILDMINNNKFSLKGFRLIEASELASF 180 (309) Q Consensus 103 ~~~~~~~~~~~~~v~~~~~--~~~i~~~~dl~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~s~~~~~ 180 (309) . +.+. ....+++++.-+ .....+++||.+. ..+..+++...+..........++... ..+...+ . T Consensus 161 ~-~l~~-~~~~~v~~~~h~la~~~~i~~~dL~~~------p~I~~~~~~~~r~~~~~~~~~~g~~p~-~~~~~~~----~ 227 (309) T PRK12682 161 L-PCYD-WTHAVIVPSDHPLAQKERITLEDLAEY------PLITYHPGFTGRSAIDKAFAAAGLQPD-IVLEAID----S 227 (309) T ss_pred E-EEEC-CCEEEEECCCCHHHCCCCCCHHHHCCC------CEEEECCCCCHHHHHHHHHHHCCCCCC-EEEEECC----H T ss_conf 9-9331-523676169972114999999998289------979727998679999999997799962-6999998----9 Q ss_pred HHHHHHHHCCCCEEEEEECCCHHHH---CCCCEEECCCCCCCCCCHHHHHHHHCHHHHHHCHHHHHHHHHC--CCCHHHH Q ss_conf 9999999659936999412422320---1585140078654677865835442777787488999998740--8998999 Q gi|254780342|r 181 SQIRRDQRNNIPAVFLSWEPHPINS---DLNIHYLPGGEEISGFGEASVYTVVRSDYLDKCPNISRLLKNI--KFSVALE 255 (309) Q Consensus 181 a~l~~A~~~~~~~v~~~wsp~~~~~---~~dl~~Ledp~~~~~~p~~~~~~vv~~~~~~~~P~~~~~L~~~--~ls~~~~ 255 (309) ..+.+.+..|-.+.+ -|..... ..+++.+..+. .. ++..+..+.+++. ...|.+..|++=+ +|+-+.. T Consensus 228 ~~~~~~v~~G~Gi~~---lp~~~~~~~~~~~Lv~v~~~~-~~--~~~~~~l~~rk~~-~l~~~~~~FId~l~e~l~~~~~ 300 (309) T PRK12682 228 DVIKTYVRLGLGVGI---IAEMAYRPDRDGDLVALPAGH-LF--GPNTAWVAVKRGA-YLRNYVYDFIELLAPHLSRELI 300 (309) T ss_pred HHHHHHHHHCCEEEH---HHHHHHHHHHCCCEEEEECCC-CC--CCCEEEEEEECCC-CCCHHHHHHHHHHHHHCCHHHH T ss_conf 999999995994242---289983376479989997988-77--7617999996966-1189999999999987298999 No 60 >PRK01066 porphobilinogen deaminase; Provisional Probab=91.66 E-value=0.95 Score=24.66 Aligned_cols=63 Identities=11% Similarity=-0.084 Sum_probs=48.5 Q ss_pred CCCCCEEEECCCCCHHHHHHHHHHHHHHHCC-CCEEEEEE----CCH---H-------------HHHHHHHCCCCEEEEE Q ss_conf 4588379942588699999999999996106-98489985----640---5-------------7888851498139998 Q gi|254780342|r 23 DSCTPVRFADTGWTDIAATTAMTSVILEEIL-GYKTNIKL----LAV---P-------------VTFRSLKNKGIDIFMG 81 (309) Q Consensus 23 ~~~~~I~ig~~~wte~~v~a~i~~~iLE~~l-Gy~Ve~~~----~~~---~-------------~~~~al~~G~iD~~~e 81 (309) .++++|+||+-+..=..+.+++++..|++.. ++++|++. |+. . .+-++|.+|+||+.+. T Consensus 16 ~~kk~IrIGTR~S~LAl~Qa~~V~~~L~~~~p~~~~ei~~i~T~GD~~~~~~L~~iggkG~Ftkele~aLl~g~iDiAVH 95 (234) T PRK01066 16 LGKRPLRIASRKSPLAKAQVHECLRLLRSFFPKLWFQISTTTTQGDLDQKTPLHLVENTGFFTDDVDFLVLSGKCDLAIH 95 (234) T ss_pred HCCCCEEEEECCCHHHHHHHHHHHHHHHHHCCCCEEEEEEEECCCCCCCCCCHHHCCCCCEEHHHHHHHHHCCCCCEEEE T ss_conf 06884699958879999999999999997689973999999633731457768774897454899999997698778987 Q ss_pred EECC Q ss_conf 4046 Q gi|254780342|r 82 YWYP 85 (309) Q Consensus 82 ~w~p 85 (309) .+-. T Consensus 96 SlKD 99 (234) T PRK01066 96 SAKD 99 (234) T ss_pred ECCC T ss_conf 1578 No 61 >PRK11622 putative ABC transporter solute-binding protein; Provisional Probab=90.90 E-value=1.3 Score=23.79 Aligned_cols=250 Identities=13% Similarity=0.144 Sum_probs=109.5 Q ss_pred CHHHHHHHHHHHHHHHHHCCCC-----------CCCCCEEEECCCCCH-----HHHHHHHHHHHHHHCCCCEEEEEEC-C Q ss_conf 9579999999999986303583-----------458837994258869-----9999999999996106984899856-4 Q gi|254780342|r 1 MYKILAVCLFLTTFSISYARDA-----------DSCTPVRFADTGWTD-----IAATTAMTSVILEEILGYKTNIKLL-A 63 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~~~~-----------~~~~~I~ig~~~wte-----~~v~a~i~~~iLE~~lGy~Ve~~~~-~ 63 (309) ||+++.++.+++++.++.++.. +.+.||..= -|-+ .++ ..+..++-| ++|.+++.+.. + T Consensus 2 ~r~~~~~~~~l~~~~~~~~~~~~~~~W~~i~~~A~GqtV~f~--aWGGs~~iN~yi-~w~~~~l~~-~y~ItL~~Vk~~D 77 (401) T PRK11622 2 MRHLLALLGLLALLSLAAAASDAANTWQQILEEAKGQTVYFN--AWGGSPAINRYL-DWVAKELKE-RYGITLKHVKLAD 77 (401) T ss_pred HHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHCCCEEEEE--EECCCHHHHHHH-HHHHHHHHH-HHCCEEEEECCCC T ss_conf 588999999999875014576444899999997169889999--767957775999-999999999-8596689952545 Q ss_pred HHHHHHHH---------HCCCCEEEEEE-------------ECCCCHHHH-HHHHHCC-C-EEEECCCCCC-------CE Q ss_conf 05788885---------14981399984-------------046413566-6655304-7-0230135668-------53 Q gi|254780342|r 64 VPVTFRSL---------KNKGIDIFMGY-------------WYPSLEKFI-APYLEEG-S-IKLVAENLQG-------AK 111 (309) Q Consensus 64 ~~~~~~al---------~~G~iD~~~e~-------------w~p~~~~~~-~~~~~~~-~-~~~~~~~~~~-------~~ 111 (309) +...-.-| ..|.||+..-- ++|-...+- .+++... . ..+.+....| .+ T Consensus 78 t~~av~rvlaEk~AG~~~~GsvDliWINGENF~~mK~~~LL~gp~~~~LPN~~~vd~~~~v~~DFg~PtdG~EaPWg~aQ 157 (401) T PRK11622 78 IAEAVNRLLAEKQAGRDTGGSVDLVWINGENFRTLKEAGLLYGPFAETLPNWRYVDTQLPVREDFGVPTEGLEAPWGVAQ 157 (401) T ss_pred HHHHHHHHHHHHHHCCCCCCCEEEEEECCHHHHHHHHCCCCCCCHHHHCCHHHCCCCCCCCCCCCCCCCCCCCCCCCCCE T ss_conf 89999999999860778898445899886128988867831387666594042058875535557873665516765531 Q ss_pred EEEEEEHHHHHCCCCCHHHHHHHHHHHHHCCCC-CCCCCCCCHHHHHHHHHCCCCCCC-EEE------------------ Q ss_conf 567863446412345535655556654420135-785432005558766403545221-001------------------ Q gi|254780342|r 112 YMLAVNDVGFALGIKSYQDIAKYKKELGAKIYG-IEPGNEGNQRILDMINNNKFSLKG-FRL------------------ 171 (309) Q Consensus 112 ~~~~v~~~~~~~~i~~~~dl~~~~~~~~~~~~g-~~~~~~~~~~~~~~~~~~~l~~~~-~~~------------------ 171 (309) ..+.+....-..--.|.+.|......-.+|+.- .+|.+.++...++.+-...-+... ... T Consensus 158 ~vf~ydsa~v~~pP~s~~eLl~wak~nPGrfTYP~PPdF~GstFlkq~L~~l~~~~~~l~~p~~~~~~~~~~~plW~yLd 237 (401) T PRK11622 158 LVFIYDSARVPQPPQSPAELLEWAKANPGRFTYPRPPDFTGTAFLKQLLIELTGDPAALKQPVDDATFARVTAPLWDYLD 237 (401) T ss_pred EEEEECHHHCCCCCCCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHCCCHHHHCCCCCHHHHHHHHHHHHHHHH T ss_conf 89998766559999899999999997899714899998753799999999973985675186788999987389999999 Q ss_pred -------EC-CCCCCHHHHHHHHHHCCCCEEEEEECCCHHHHCCCCEEECCCCCCCCCCHHHHHHHHCHHHH-----HHC Q ss_conf -------03-66520189999999659936999412422320158514007865467786583544277778-----748 Q gi|254780342|r 172 -------IE-ASELASFSQIRRDQRNNIPAVFLSWEPHPINSDLNIHYLPGGEEISGFGEASVYTVVRSDYL-----DKC 238 (309) Q Consensus 172 -------~~-~s~~~~~a~l~~A~~~~~~~v~~~wsp~~~~~~~dl~~Ledp~~~~~~p~~~~~~vv~~~~~-----~~~ 238 (309) .. ...+.-.+++.+-+.+||.++...|.|.....+-.=-.|.+.-..|.+... .+.+.-|. ..+ T Consensus 238 ~l~P~LWr~G~~yP~~~~~l~~L~adGEi~~~~sfnp~~a~~~I~~G~lP~t~r~~~~~~G---tigN~hflAIP~NA~~ 314 (401) T PRK11622 238 ELHPYLWRKGKTFPASPAELDQLLADGELDLAFTFNPNEAQAAIANGELPASTRSFAFSDG---TIGNTHFVAIPFNANA 314 (401) T ss_pred HHCHHHHHCCCCCCCCHHHHHHHHHCCCEEEEEECCHHHHHHHHHCCCCCCCEEEEECCCC---CCCCCCCEEEECCCCC T ss_conf 8577888440126899999999984798778840387778889866999974026742698---2167650772079998 Q ss_pred HHHHHHHHHCCCCHHHHHH Q ss_conf 8999998740899899999 Q gi|254780342|r 239 PNISRLLKNIKFSVALENE 257 (309) Q Consensus 239 P~~~~~L~~~~ls~~~~~~ 257 (309) |+.+.++-+|=||.+.+.. T Consensus 315 kagA~VvaNfLLSPEAQ~~ 333 (401) T PRK11622 315 KAGAKVVANFLLSPEAQLR 333 (401) T ss_pred HHHHHHHHHHHCCHHHHHH T ss_conf 7789999999639999997 No 62 >pfam00497 SBP_bac_3 Bacterial extracellular solute-binding proteins, family 3. Probab=90.27 E-value=1.5 Score=23.43 Aligned_cols=175 Identities=15% Similarity=0.156 Sum_probs=87.1 Q ss_pred HHHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHCCCCEEEEEEECCCCHHHHHHHHHCCCEEEECCCCCCCEEEEEEEHH Q ss_conf 99999999996106984899856405788885149813999840464135666655304702301356685356786344 Q gi|254780342|r 40 ATTAMTSVILEEILGYKTNIKLLAVPVTFRSLKNKGIDIFMGYWYPSLEKFIAPYLEEGSIKLVAENLQGAKYMLAVNDV 119 (309) Q Consensus 40 v~a~i~~~iLE~~lGy~Ve~~~~~~~~~~~al~~G~iD~~~e~w~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~ 119 (309) +..+++..+.+. +|++++++...-.-+..++.+|++|+.+..+..+-+ . +..+.... .+......++++.. T Consensus 24 ~~~dil~~i~~~-lg~~~~~~~~p~~~~~~~l~~g~~D~~~~~~~~t~~-R------~~~~~~s~-p~~~~~~~~~~~~~ 94 (224) T pfam00497 24 FDVDLAKAIAKR-LGVKVEFVPVSWDGLIPALQSGKIDVIIAGMTITPE-R------KKKVDFSD-PYYYSGQVLVVRKD 94 (224) T ss_pred EHHHHHHHHHHH-HCCCEEEEECCHHHHHHHHHCCCCCEEECCCCCCHH-H------HHHHCCCC-CEEECCEEEEEECC T ss_conf 299999999999-799679998588999999976998766136778855-5------10212575-34404717999989 Q ss_pred HH--HCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCCCCCCCEEEEC-CCCCCHHHHHHHHHHCCCCEEEE Q ss_conf 64--1234553565555665442013578543200555876640354522100103-66520189999999659936999 Q gi|254780342|r 120 GF--ALGIKSYQDIAKYKKELGAKIYGIEPGNEGNQRILDMINNNKFSLKGFRLIE-ASELASFSQIRRDQRNNIPAVFL 196 (309) Q Consensus 120 ~~--~~~i~~~~dl~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~~~~-~s~~~~~a~l~~A~~~~~~~v~~ 196 (309) .. ...+.++.||. ++.+|...+.... ..+... .....++.. .+. .+...++.+|+.+++. T Consensus 95 ~~~~~~~~~~~~dL~-------~~~I~v~~g~~~~----~~l~~~--~~~~~~i~~~~~~----~~~l~~L~~gr~D~~i 157 (224) T pfam00497 95 SPPKIKSIKDLADLK-------GKKVGVQKGTTQE----DLLKEL--APKGAEIVLYDDQ----AEALQALAAGRVDAVV 157 (224) T ss_pred CCCCCCCCCCHHHHC-------CCEEEEECCCHHH----HHHHHH--CCCCCEEEEECCH----HHHHHHHHCCCEEEEE T ss_conf 876545658989977-------9779996897799----999860--6244516995427----8888898749768999 Q ss_pred EECCCH--HHHCCCCEEECCCCCCCCCCHHHHHHHHCHHHHHHCHHHHHHHHH Q ss_conf 412422--320158514007865467786583544277778748899999874 Q gi|254780342|r 197 SWEPHP--INSDLNIHYLPGGEEISGFGEASVYTVVRSDYLDKCPNISRLLKN 247 (309) Q Consensus 197 ~wsp~~--~~~~~dl~~Ledp~~~~~~p~~~~~~vv~~~~~~~~P~~~~~L~~ 247 (309) .-.... .+..++...+.-+... .+..+..++++ .+|++.+.+++ T Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~k----~~~~l~~~~n~ 203 (224) T pfam00497 158 ADSPVAAYLIKKNPGLNLVVGEPL---SGEPYGIAVRK----GDPELLAAVNK 203 (224) T ss_pred ECHHHHHHHHHHCCCCEEEECCCC---CCCEEEEEEEC----CCHHHHHHHHH T ss_conf 579999999986899729943677---76149999979----99999999999 No 63 >PRK10837 putative DNA-binding transcriptional regulator; Provisional Probab=89.95 E-value=1.6 Score=23.27 Aligned_cols=195 Identities=10% Similarity=0.018 Sum_probs=101.9 Q ss_pred CCCCCEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHCCCCEEEEEEECCCCHHHH-HHHHHCCCEE Q ss_conf 458837994258869999999999999610698489985640578888514981399984046413566-6655304702 Q gi|254780342|r 23 DSCTPVRFADTGWTDIAATTAMTSVILEEILGYKTNIKLLAVPVTFRSLKNKGIDIFMGYWYPSLEKFI-APYLEEGSIK 101 (309) Q Consensus 23 ~~~~~I~ig~~~wte~~v~a~i~~~iLE~~lGy~Ve~~~~~~~~~~~al~~G~iD~~~e~w~p~~~~~~-~~~~~~~~~~ 101 (309) +...+++||...--...++..++....++.=+.++++..+++..+.+.+.+|++|+..-.. |.....+ ... T Consensus 86 ~~~g~l~i~~~~~~~~~~l~~~l~~f~~~~P~v~l~l~~~~~~~~~~~l~~~~~D~~i~~~-~~~~~~l~~~~------- 157 (291) T PRK10837 86 EDNGAIRIYASSTIGNYILPAMIARYRHDYPQLPLELSVGNSQDVIQAVLDFRVDIGLIEG-PCHSTEIISEP------- 157 (291) T ss_pred CCCCEEEEEEEHHHHHHHHHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHCCCCEEEECC-CCCCCCEEEEE------- T ss_conf 7886299996179999999999999996489935999988889999999808841667417-88998718999------- Q ss_pred EECCCCCCCEEEEEEEHHHHHCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCCCCCCCEEEECCCCCCHHH Q ss_conf 30135668535678634464123455356555566544201357854320055587664035452210010366520189 Q gi|254780342|r 102 LVAENLQGAKYMLAVNDVGFALGIKSYQDIAKYKKELGAKIYGIEPGNEGNQRILDMINNNKFSLKGFRLIEASELASFS 181 (309) Q Consensus 102 ~~~~~~~~~~~~~~v~~~~~~~~i~~~~dl~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~s~~~~~a 181 (309) ........++.|.+.....-.++++|.+. ..+..+.+.+.+..........+.... .....++ .. T Consensus 158 ----l~~~~~~~v~~p~~~l~~~~i~~~~l~~~------~~i~~~~~~~~r~~~~~~~~~~~~~~~-~~~~~~~----~~ 222 (291) T PRK10837 158 ----WLEDELVVFAAPDSPLARGPVTLEQLAAA------PWILRERGSGTREIVDYLLLSHLPRFE-MAMELGN----SE 222 (291) T ss_pred ----EECCCEEEEECCCCHHHCCCCCHHHHHCC------CEEEECCCCCHHHHHHHHHHHCCCCCC-EEEECCC----HH T ss_conf ----44141899987886653799998998079------879944998689999999985488653-0131297----99 Q ss_pred HHHHHHHCCCCEEEEEECCCHHH----HCCCCEEECCCCCCCCCCHHHHHHHHCHHHHHHCHHHHHHHHH Q ss_conf 99999965993699941242232----0158514007865467786583544277778748899999874 Q gi|254780342|r 182 QIRRDQRNNIPAVFLSWEPHPIN----SDLNIHYLPGGEEISGFGEASVYTVVRSDYLDKCPNISRLLKN 247 (309) Q Consensus 182 ~l~~A~~~~~~~v~~~wsp~~~~----~~~dl~~Ledp~~~~~~p~~~~~~vv~~~~~~~~P~~~~~L~~ 247 (309) .+..++..|-.+.+ -|...+ ....++.+..|... ....+..+.++ =..-.|.+..|++= T Consensus 223 ~i~~~v~~G~Gia~---lp~~~v~~~l~~g~L~~l~~~~~~---~~~~~~lv~~k-~~~ls~~~~~Fidf 285 (291) T PRK10837 223 AIKHAVRHGLGISC---LSRRVIADQLQAGTLVEVAVPLPR---LMRTLWRIHHR-QKHLSNALQRFLSY 285 (291) T ss_pred HHHHHHHHCCCEEE---CCHHHHHHHHHCCCEEEEECCCCC---CCCEEEEEEEC-CCCCCHHHHHHHHH T ss_conf 99999995991897---249999999878997997078888---61279999989-78659999999999 No 64 >PRK13731 conjugal transfer surface exclusion protein TraT; Provisional Probab=89.82 E-value=0.68 Score=25.61 Aligned_cols=60 Identities=17% Similarity=0.127 Sum_probs=36.6 Q ss_pred CHHHHHHHHHHHHHHHHHCCC-----------------------CCCCCCEEEECCCCCHHHH--HHHHHHHHHHHCCCC Q ss_conf 957999999999998630358-----------------------3458837994258869999--999999999610698 Q gi|254780342|r 1 MYKILAVCLFLTTFSISYARD-----------------------ADSCTPVRFADTGWTDIAA--TTAMTSVILEEILGY 55 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~~~-----------------------~~~~~~I~ig~~~wte~~v--~a~i~~~iLE~~lGy 55 (309) |||++++.++.+.+++++|+. |.+.++|=+-.+|-++..+ +..-+...|+.+ || T Consensus 3 ~kk~~~~~~~~~~l~lsGC~a~~tai~krnL~VqTkMS~TIfLdP~s~ktVyv~vrNTS~~~~~~l~~~i~~~L~ak-GY 81 (243) T PRK13731 3 TKKLMMVALVSSTLALSGCGAMSTAIKKRNLEVKTQMSETIWLEPASERTVFLQIKNTSDKDMSGLQGKIADAVKAK-GY 81 (243) T ss_pred HHHHHHHHHHHHHHHHCCHHHHHHHHHHCCCCEECEECCCEEECCCCCCEEEEEEECCCCCCHHHHHHHHHHHHHHC-CE T ss_conf 37757999999999971568767787735752222000438866854557999985178772456789999999858-90 Q ss_pred EEEEEE Q ss_conf 489985 Q gi|254780342|r 56 KTNIKL 61 (309) Q Consensus 56 ~Ve~~~ 61 (309) .|.-.| T Consensus 82 ~iv~~P 87 (243) T PRK13731 82 QVVTSP 87 (243) T ss_pred EEECCH T ss_conf 986786 No 65 >pfam01379 Porphobil_deam Porphobilinogen deaminase, dipyromethane cofactor binding domain. Probab=89.05 E-value=1.8 Score=22.84 Aligned_cols=58 Identities=16% Similarity=0.077 Sum_probs=46.3 Q ss_pred CEEEECCCCCHHHHHHHHHHHHHHHCC-CCEEEEEEC----CH----------------HHHHHHHHCCCCEEEEEEEC Q ss_conf 379942588699999999999996106-984899856----40----------------57888851498139998404 Q gi|254780342|r 27 PVRFADTGWTDIAATTAMTSVILEEIL-GYKTNIKLL----AV----------------PVTFRSLKNKGIDIFMGYWY 84 (309) Q Consensus 27 ~I~ig~~~wte~~v~a~i~~~iLE~~l-Gy~Ve~~~~----~~----------------~~~~~al~~G~iD~~~e~w~ 84 (309) ||+||+-+..-..+.+++++..|++.. |+++|++.. +. ..+-++|.+|+||+.+..+- T Consensus 1 tIrIgtR~S~LAl~Qa~~v~~~L~~~~p~~~~ei~~i~T~GD~~~~~~l~~~ggkG~Ftkele~aLl~~~iDiAVHS~K 79 (213) T pfam01379 1 TIRIGTRKSKLALIQTNHVIEKLKELCPDIEVEIVIIKTTGDKILDKPLAKIGGKGLFTKELEEALLNGEIDLAVHSLK 79 (213) T ss_pred CEEEEECCCHHHHHHHHHHHHHHHHHCCCCCEEEEEEECCCCCCCCCCCCCCCCCEEEHHHHHHHHHCCCCCEEEEECC T ss_conf 9699958999999999999999998689986899999787844366653116984142699999997098787876046 No 66 >PRK12450 foldase protein PrsA; Reviewed Probab=88.97 E-value=1.4 Score=23.47 Aligned_cols=62 Identities=18% Similarity=0.181 Sum_probs=32.5 Q ss_pred CHHHHHHHHHHHHHH-HHHCCCCCC--------CCCEEEEC-----CC-CCHHHHH-HHHHHHHHHHCCCCEEEEEEC Q ss_conf 957999999999998-630358345--------88379942-----58-8699999-999999996106984899856 Q gi|254780342|r 1 MYKILAVCLFLTTFS-ISYARDADS--------CTPVRFAD-----TG-WTDIAAT-TAMTSVILEEILGYKTNIKLL 62 (309) Q Consensus 1 mkk~~~~~~~~~~~~-~~~~~~~~~--------~~~I~ig~-----~~-wte~~v~-a~i~~~iLE~~lGy~Ve~~~~ 62 (309) |||+++.+++++.++ +++|++.++ +++|+... ++ -.++.++ .-|+..+||+.+|-+|.-... T Consensus 4 mKK~~~~~~~~~svl~LaAC~s~~~~~~v~t~kgg~IT~~efy~~~K~t~~gq~~L~~mi~~KvLe~kYg~kVsdkeV 81 (309) T PRK12450 4 MNKLITGVVTLATVVTLSACQSSHNNTKLVSMKGDTITVSDFYNETKNTELAQKAMLSLVISRVFETQYANKVSDKEV 81 (309) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCCCCCHHHH T ss_conf 999999999999999998605899983689835992409999999853415899999999999999985351788999 No 67 >PRK09906 DNA-binding transcriptional regulator HcaR; Provisional Probab=88.94 E-value=1.8 Score=22.79 Aligned_cols=194 Identities=6% Similarity=-0.001 Sum_probs=105.4 Q ss_pred CCCCEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHCCCCEEEEEEECCCCHHHHHHHHHCCCEEEE Q ss_conf 58837994258869999999999999610698489985640578888514981399984046413566665530470230 Q gi|254780342|r 24 SCTPVRFADTGWTDIAATTAMTSVILEEILGYKTNIKLLAVPVTFRSLKNKGIDIFMGYWYPSLEKFIAPYLEEGSIKLV 103 (309) Q Consensus 24 ~~~~I~ig~~~wte~~v~a~i~~~iLE~~lGy~Ve~~~~~~~~~~~al~~G~iD~~~e~w~p~~~~~~~~~~~~~~~~~~ 103 (309) ....++||...-....++..+++...+..=+.++++..+++..+...|.+|++|+..-. .|.... .+... T Consensus 88 ~~~~l~ig~~~~~~~~~lp~~l~~f~~~~P~v~v~l~~~~~~~~~~~l~~g~~D~~i~~-~~~~~~---------~l~~~ 157 (296) T PRK09906 88 EDRQLTIGFVPSAEVNLLPKVLPMFRLRHPDTLIELVSLINTQQEEKLRRGELDVGFMR-HPVYSD---------EIDYL 157 (296) T ss_pred CCCEEEEEECHHHHHHHHHHHHHHHHHHCCCCCEEEEECCCHHHHHHHHCCCCCEEEEE-CCCCCC---------CEEEE T ss_conf 77138832122569999999999987408986348997580899999986997679873-688998---------81899 Q ss_pred CCCCCCCEEEEEEEHHHHHC--CCCCHHHHHHHHHHHHHCCCCCCCC--CCCCHHHHHHHHHCCCCCCCEEEECCCCCCH Q ss_conf 13566853567863446412--3455356555566544201357854--3200555876640354522100103665201 Q gi|254780342|r 104 AENLQGAKYMLAVNDVGFAL--GIKSYQDIAKYKKELGAKIYGIEPG--NEGNQRILDMINNNKFSLKGFRLIEASELAS 179 (309) Q Consensus 104 ~~~~~~~~~~~~v~~~~~~~--~i~~~~dl~~~~~~~~~~~~g~~~~--~~~~~~~~~~~~~~~l~~~~~~~~~~s~~~~ 179 (309) ........++++..-+.. .-.++++|... ..+..++. ...+..........+..... .....+ T Consensus 158 --~l~~~~~~~~~~~~hpla~~~~i~~~dl~~~------~~i~~~~~~~~~~~~~~~~~~~~~g~~p~~-~~~~~~---- 224 (296) T PRK09906 158 --ELLDEPLVVVLPVDHPLAHEKEITAAQLDGV------NFISPDPAQSGSLAPIIKAWFAQHNSQPNI-VQVATN---- 224 (296) T ss_pred --EEECCEEEEEEECCCCCCCCCCCCHHHHCCC------CEEEECCCCCCCHHHHHHHHHHHCCCCCCE-EEEECC---- T ss_conf --8504607999828981005999899997599------879636654760999999999977999746-999898---- Q ss_pred HHHHHHHHHCCCCEEEEEECCCHHH--HCCCCEEECCCCCCCCCCHHHHHHHHCHHHHHHCHHHHHHHHHC Q ss_conf 8999999965993699941242232--01585140078654677865835442777787488999998740 Q gi|254780342|r 180 FSQIRRDQRNNIPAVFLSWEPHPIN--SDLNIHYLPGGEEISGFGEASVYTVVRSDYLDKCPNISRLLKNI 248 (309) Q Consensus 180 ~a~l~~A~~~~~~~v~~~wsp~~~~--~~~dl~~Ledp~~~~~~p~~~~~~vv~~~~~~~~P~~~~~L~~~ 248 (309) +..+...+..|-.+.+ -|.+.. ...+++.++-+... |...+..+.|++- ..|.+..|++-+ T Consensus 225 ~~~~~~~v~~G~Gi~~---lP~~~~~~~~~~l~~~pl~~~~---~~~~~~l~~r~~~--~spa~~~Fi~~l 287 (296) T PRK09906 225 ILVTMNLVGMGLGCTI---IPGYMNNFNTGQVVFRPLAGNV---PSIALLMAWKKGE--MKPALRDFIAIV 287 (296) T ss_pred HHHHHHHHHHCCEEEE---CHHHHHHHCCCCEEEEECCCCC---CEEEEEEEECCCC--CCHHHHHHHHHH T ss_conf 9999999994989998---4587731068999999889998---7028999987999--999999999999 No 68 >pfam00496 SBP_bac_5 Bacterial extracellular solute-binding proteins, family 5 Middle. The borders of this family are based on the PDBSum definitions of the domain edges for the periplasmic oligopeptide-binding protein from Salmonella typhimurium. Probab=87.88 E-value=2.1 Score=22.35 Aligned_cols=25 Identities=12% Similarity=0.298 Sum_probs=12.3 Q ss_pred EEEEEC-CHHHHHHHHHCCCCEEEEE Q ss_conf 899856-4057888851498139998 Q gi|254780342|r 57 TNIKLL-AVPVTFRSLKNKGIDIFMG 81 (309) Q Consensus 57 Ve~~~~-~~~~~~~al~~G~iD~~~e 81 (309) |++... +......++.+|+||+... T Consensus 148 I~~~~~~~~~~~~~~l~~G~iD~~~~ 173 (361) T pfam00496 148 ITFRVIPDDAARLAALQSGEVDDVAG 173 (361) T ss_pred EEEEECCCCHHHHHHHHCCCEEEECC T ss_conf 99998688389999987698008846 No 69 >TIGR00363 TIGR00363 lipoprotein, YaeC family; InterPro: IPR004478 This family of putative lipoproteins contains a consensus site for lipoprotein signal sequence cleavage. Escherichia coli contains several lipoproteins in addition to the major outer membrane lipoprotein. Members of this family are often localized in the cytoplasmic membrane and are attached to it by a lipid anchor.. Probab=87.61 E-value=0.19 Score=29.18 Aligned_cols=219 Identities=11% Similarity=0.108 Sum_probs=124.7 Q ss_pred CCCCCEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEEECCHH-HHHHHHHCCCCEEEEEEECCCCHHHHHHHHHCCCEE Q ss_conf 4588379942588699999999999996106984899856405-788885149813999840464135666655304702 Q gi|254780342|r 23 DSCTPVRFADTGWTDIAATTAMTSVILEEILGYKTNIKLLAVP-VTFRSLKNKGIDIFMGYWYPSLEKFIAPYLEEGSIK 101 (309) Q Consensus 23 ~~~~~I~ig~~~wte~~v~a~i~~~iLE~~lGy~Ve~~~~~~~-~~~~al~~G~iD~~~e~w~p~~~~~~~~~~~~~~~~ 101 (309) .....|++|...-.|+.+ ++|.+.+-++++|.+|+++..+.. .--+|+..|++|.-.=--.|..+....... ...++ T Consensus 20 ~~~~~ikvGv~~G~e~~v-ae~a~kvak~kyGldv~lv~f~~~~lPn~a~~~G~~dan~~qh~Pyl~~~~~~~~-~~~lv 97 (264) T TIGR00363 20 AAPKKIKVGVISGAEQQV-AEVAKKVAKEKYGLDVELVEFNDYVLPNEAVSKGDLDANAFQHKPYLDQDAKAKN-GYKLV 97 (264) T ss_pred HHHHHEEEEEEECCCHHH-HHHHHHHHHHHCCCEEEEEECCCCCCCHHHHCCCCCCHHHHHCCCCHHHHHHHHH-CCCEE T ss_conf 100010477861770348-9999987433216316886114520551231036510244422752125665410-24168 Q ss_pred EECCCCCCCEEEEEEEHHHHHCCCCCHHHHHHHHH---------HHHHCC-------CCCCCCCCCCHHHHHHHHHCCCC Q ss_conf 30135668535678634464123455356555566---------544201-------35785432005558766403545 Q gi|254780342|r 102 LVAENLQGAKYMLAVNDVGFALGIKSYQDIAKYKK---------ELGAKI-------YGIEPGNEGNQRILDMINNNKFS 165 (309) Q Consensus 102 ~~~~~~~~~~~~~~v~~~~~~~~i~~~~dl~~~~~---------~~~~~~-------~g~~~~~~~~~~~~~~~~~~~l~ 165 (309) .+ ...+++|-......|+++..+.+-.. .++..+ +-...+.+.-....+..+. T Consensus 98 ~v-------GntfvyP~aGyskkik~~~~~~~G~~v~~PndPtn~Gr~l~ll~k~~li~lk~~~~ll~t~ld~~~n---- 166 (264) T TIGR00363 98 AV-------GNTFVYPLAGYSKKIKSVDELQDGAKVAVPNDPTNLGRALLLLQKEGLIKLKDGKGLLPTVLDIVEN---- 166 (264) T ss_pred EE-------ECEEEECCCHHHHHHHHHHHHHCCCEEEECCCCCCCCCEEEEEECCCCEEEECCCCHHHHHHHHHCC---- T ss_conf 87-------0502311210346664288651587797358863003011122100001210113035666655318---- Q ss_pred CCCEEEECCCCCCHHHHHHHHHHCCCCEEEEEECCC----HHHHCCCCEEECCCCCCCCCCHHHHHHHHCHHHHHHCHHH Q ss_conf 221001036652018999999965993699941242----2320158514007865467786583544277778748899 Q gi|254780342|r 166 LKGFRLIEASELASFSQIRRDQRNNIPAVFLSWEPH----PINSDLNIHYLPGGEEISGFGEASVYTVVRSDYLDKCPNI 241 (309) Q Consensus 166 ~~~~~~~~~s~~~~~a~l~~A~~~~~~~v~~~wsp~----~~~~~~dl~~Ledp~~~~~~p~~~~~~vv~~~~~~~~P~~ 241 (309) .....+.+... +.+.+++.+.+..+..+-+|. ++....|-+++||-. .|| +..++...=-++.-.+ T Consensus 167 P~~l~i~e~~~----~~~~r~ldd~~~~~a~~~~ty~~q~Gl~~~~d~~f~edk~-----sPy-vn~~v~r~dnk~~~~v 236 (264) T TIGR00363 167 PKKLKIVELEA----AQLPRALDDAKVDLAVINTTYASQVGLSPAKDGVFVEDKE-----SPY-VNIIVAREDNKDAENV 236 (264) T ss_pred CCCEEEEEECC----HHHHHHCCCCCEEEEEEHHHHHHHHCCCCCCCEEEEEECC-----CCE-EEEEEEECCCCCHHHH T ss_conf 54116886130----1212220543012332001234440665021605886078-----875-8999851444006899 Q ss_pred HHHHHHCCCCHHHHHHHHHHHHHCC Q ss_conf 9998740899899999999998459 Q gi|254780342|r 242 SRLLKNIKFSVALENEMMKLILNNK 266 (309) Q Consensus 242 ~~~L~~~~ls~~~~~~l~~~v~~~g 266 (309) .+|++.+ .++++.+-...+..+| T Consensus 237 ~~f~~~y--q~~ev~~~a~~~f~~G 259 (264) T TIGR00363 237 KEFVQSY--QSEEVYKAAEKVFNGG 259 (264) T ss_pred HHHHHHC--CCHHHHHHHHHHHHCC T ss_conf 9999840--6577999999886077 No 70 >PRK09755 putative binding protein; Provisional Probab=86.89 E-value=2.4 Score=21.99 Aligned_cols=28 Identities=7% Similarity=0.000 Sum_probs=19.8 Q ss_pred CCCCCEEEEC------------CCCCHHHHHHHHHHHHHH Q ss_conf 4588379942------------588699999999999996 Q gi|254780342|r 23 DSCTPVRFAD------------TGWTDIAATTAMTSVILE 50 (309) Q Consensus 23 ~~~~~I~ig~------------~~wte~~v~a~i~~~iLE 50 (309) ..+++++++. .+.++..|+.+|+--++. T Consensus 30 ~~~~~l~~~~~~dp~tLDP~~~~d~~s~~v~~~iye~Lv~ 69 (535) T PRK09755 30 APQQVFRYNNHSDPGTLDPQKVEENTAAQIVLDLFEGLVW 69 (535) T ss_pred CCCCEEEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHHEE T ss_conf 7777899937999897171405773699999998646401 No 71 >PRK11151 DNA-binding transcriptional regulator OxyR; Provisional Probab=86.79 E-value=1.7 Score=23.02 Aligned_cols=195 Identities=13% Similarity=0.054 Sum_probs=103.0 Q ss_pred CCCCEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHCCCCEEEEEEECCCCHHHHHHHHHCCCEEEE Q ss_conf 58837994258869999999999999610698489985640578888514981399984046413566665530470230 Q gi|254780342|r 24 SCTPVRFADTGWTDIAATTAMTSVILEEILGYKTNIKLLAVPVTFRSLKNKGIDIFMGYWYPSLEKFIAPYLEEGSIKLV 103 (309) Q Consensus 24 ~~~~I~ig~~~wte~~v~a~i~~~iLE~~lGy~Ve~~~~~~~~~~~al~~G~iD~~~e~w~p~~~~~~~~~~~~~~~~~~ 103 (309) ...+|+||.......+++..++....++.=+.++++..+++..+.+.|.+|++|+..-...+..... ... T Consensus 89 ~~G~lrig~~~~~~~~llp~~l~~f~~~~P~v~i~l~~~~~~~l~~~l~~g~~D~~i~~~~~~~~~~----------~~~ 158 (305) T PRK11151 89 MSGPLHIGLIPTVGPYLLPHIIPMLHQTFPKLEMYLHEAQTHQLLAQLDSGKLDCAILALVKESEAF----------IEV 158 (305) T ss_pred CCCEEEEEEHHHHHHHCCCHHHHHHHHCCCCCEEEEEECCCHHHHHHHHCCCCCEEEEECCCCCCCC----------EEE T ss_conf 7641431102223332382779998623898379999887577787865788537998458899872----------687 Q ss_pred CCCCCCCEEEEEEEHHHHHC--CCCCHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCCCCCCCEEEECCCCCCHHH Q ss_conf 13566853567863446412--3455356555566544201357854320055587664035452210010366520189 Q gi|254780342|r 104 AENLQGAKYMLAVNDVGFAL--GIKSYQDIAKYKKELGAKIYGIEPGNEGNQRILDMINNNKFSLKGFRLIEASELASFS 181 (309) Q Consensus 104 ~~~~~~~~~~~~v~~~~~~~--~i~~~~dl~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~s~~~~~a 181 (309) ........++++..-+-. .--++++|.+ ...+..+++...+..........+.... ..+...+ +. T Consensus 159 --~l~~e~l~lv~~~~hpla~~~~i~l~~L~~------~~~i~~~~~~~~r~~~~~~~~~~g~~~~-~~~~~~~----~~ 225 (305) T PRK11151 159 --PLFDEPMLLAVYEDHPWANRDRVPMSDLAG------EKLLMLEDGHCLRDQAMGFCFEAGADED-THFRATS----LE 225 (305) T ss_pred --EECCCCEEEEEECCCCCCCCCCCCHHHHCC------CCEEEECCCCCHHHHHHHHHHHCCCCCC-EEEEEEC----HH T ss_conf --620462799995898433699989999769------9889968998599999999997799987-3899944----79 Q ss_pred HHHHHHHCCCCEEEEEECCCHHHH----CCCCEEECCCCCCCCCCHHHHHHHHCHHHHHHCHHHHHHHHHC Q ss_conf 999999659936999412422320----1585140078654677865835442777787488999998740 Q gi|254780342|r 182 QIRRDQRNNIPAVFLSWEPHPINS----DLNIHYLPGGEEISGFGEASVYTVVRSDYLDKCPNISRLLKNI 248 (309) Q Consensus 182 ~l~~A~~~~~~~v~~~wsp~~~~~----~~dl~~Ledp~~~~~~p~~~~~~vv~~~~~~~~P~~~~~L~~~ 248 (309) .+.+.+..|-.+-+ -|..... .-+++.+.-.... |.-.+..+.|++- --.|.+..|++-+ T Consensus 226 ~i~~lV~~G~Gv~i---lp~~~v~~~~~~~~l~~~pl~~~~---~~r~i~lv~r~~~-~ls~~~~~~~e~i 289 (305) T PRK11151 226 TLRNMVAAGSGITL---LPALAVPNERKRDGVCYLPCIKPE---PRRTIGLVYRPGS-PLRSRYEQLAEAI 289 (305) T ss_pred HHHHHHHHCCCEEE---CCHHHHHHHCCCCCEEEEECCCCC---CEEEEEEEEECCC-CCCHHHHHHHHHH T ss_conf 99999992996898---359998622126988999888999---7369999998909-8789999999999 No 72 >COG3181 Uncharacterized protein conserved in bacteria [Function unknown] Probab=86.13 E-value=2.7 Score=21.73 Aligned_cols=188 Identities=13% Similarity=0.048 Sum_probs=86.9 Q ss_pred CHHHHHHHHHHHHHHHH--HCCCCCCCCCEEEE--CCCCCHHHHHHHHHHHHHHHCCCCEEEEE--EC-CHHHHHHHHHC Q ss_conf 95799999999999863--03583458837994--25886999999999999961069848998--56-40578888514 Q gi|254780342|r 1 MYKILAVCLFLTTFSIS--YARDADSCTPVRFA--DTGWTDIAATTAMTSVILEEILGYKTNIK--LL-AVPVTFRSLKN 73 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~--~~~~~~~~~~I~ig--~~~wte~~v~a~i~~~iLE~~lGy~Ve~~--~~-~~~~~~~al~~ 73 (309) |++++..+.+.+.++++ .++..--.+||+|- ...=-+--..+-.+++.|++++|-++.+. +| +..+.+..+.. T Consensus 1 ~~~~~~~~~~~a~~~~~~~~~a~~~P~~~it~Ivp~~~GGg~D~~aR~~~~~l~k~lg~~v~V~N~pGagG~ia~~~va~ 80 (319) T COG3181 1 MKALLVPLALAATGLLGASAAAQAYPERPITIIVPAAAGGGTDQTARALAESLSKELGQPVVVDNKPGAGGAIAAGAVAK 80 (319) T ss_pred CCHHHHHHHHHHHHHHCCCCCCCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHHHHHCCCEEEEECCCCCCHHHHHHHHH T ss_conf 91378889999988613332245588787589996589875889999999999998089779995588731688999984 Q ss_pred ---CCCEEEEEEECCCCH-HHHHHHHHCCCEEEECC--CCCCCEEEEEEEHHHHHCCCCCHHHHHHHHHHHHHCCCCCCC Q ss_conf ---981399984046413-56666553047023013--566853567863446412345535655556654420135785 Q gi|254780342|r 74 ---KGIDIFMGYWYPSLE-KFIAPYLEEGSIKLVAE--NLQGAKYMLAVNDVGFALGIKSYQDIAKYKKELGAKIYGIEP 147 (309) Q Consensus 74 ---G~iD~~~e~w~p~~~-~~~~~~~~~~~~~~~~~--~~~~~~~~~~v~~~~~~~~i~~~~dl~~~~~~~~~~~~g~~~ 147 (309) |.=....-...+... ..+.+ .....+.+..+ ........++|+... ..+++.+|.++.+.-.+.+++... T Consensus 81 a~pG~t~~l~~~~~~~~~~~~~~~-~~~~~~~D~~pva~v~~~p~~l~v~~~s---~~~t~~dlv~~~k~~p~~v~~~~~ 156 (319) T COG3181 81 AAPGYTILLIAGSTPALLLPILGG-LPYYKLKDFTPVASLVSDPGVLVVRADS---PYKTLKDLVAYAKADPGSVIGGGS 156 (319) T ss_pred CCCCCCEEEEECCCCCCCCHHHCC-CCCCCHHHCEEHHHEECCCCEEEEECCC---CCCCHHHHHHHHHHCCCEEEECCC T ss_conf 589984488734764322154426-8888544230033111366159981788---736699999999859980895478 Q ss_pred CCCCCHHHH--HHHHHCCCCCCCEEEECCCCCCHHHHHHHHHHCCCCEEEEEE Q ss_conf 432005558--766403545221001036652018999999965993699941 Q gi|254780342|r 148 GNEGNQRIL--DMINNNKFSLKGFRLIEASELASFSQIRRDQRNNIPAVFLSW 198 (309) Q Consensus 148 ~~~~~~~~~--~~~~~~~l~~~~~~~~~~s~~~~~a~l~~A~~~~~~~v~~~w 198 (309) +.+..+... ...+..+..+. +....++..++ .++-.|++++...- T Consensus 157 g~Gs~dhl~~~~~~k~~Gi~~~-~Vpy~g~gea~-----taLlgg~v~a~~~~ 203 (319) T COG3181 157 GLGSADHLAGALFAKAAGIKIT-YVPYKGGGEAL-----TALLGGHVDAGSTN 203 (319) T ss_pred CCCCHHHHHHHHHHHHHCCCEE-EEEECCCCHHH-----HHHHCCCEEEEECC T ss_conf 8895889999999997389625-77416864878-----99860866544367 No 73 >PRK09474 malE maltose ABC transporter periplasmic protein; Reviewed Probab=86.04 E-value=2.7 Score=21.70 Aligned_cols=79 Identities=11% Similarity=-0.052 Sum_probs=45.0 Q ss_pred CHHHHHHHHH---HHHHHHHHCCCCCCCCCEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEEECCH--HHHHHHHHCCC Q ss_conf 9579999999---999986303583458837994258869999999999999610698489985640--57888851498 Q gi|254780342|r 1 MYKILAVCLF---LTTFSISYARDADSCTPVRFADTGWTDIAATTAMTSVILEEILGYKTNIKLLAV--PVTFRSLKNKG 75 (309) Q Consensus 1 mkk~~~~~~~---~~~~~~~~~~~~~~~~~I~ig~~~wte~~v~a~i~~~iLE~~lGy~Ve~~~~~~--~~~~~al~~G~ 75 (309) |||.+..|.+ +.++++++++.+.+..+++|=...=.+...+.++.+.-=| +.|++|+++..+. .-+..++.+|. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ltvW~~~~~~~~~l~~~~~~Fe~-~~gi~V~v~~~~~~~~kl~~a~~~G~ 79 (394) T PRK09474 1 IKKGLRILALSALTTLMLSASALAKIEEGKLVIWINGDKGYNGLAEVGKKFEK-DTGIKVTVEHPDKLEEKFPQVAATGD 79 (394) T ss_pred CCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCHHHHHHHHHHHHHHH-CCCCEEEEECCHHHHHHHHHHHCCCC T ss_conf 94158899999999999736432456678699995797056999999999745-40977999710268999997420799 Q ss_pred -CEEEE Q ss_conf -13999 Q gi|254780342|r 76 -IDIFM 80 (309) Q Consensus 76 -iD~~~ 80 (309) -|++. T Consensus 80 ~PDv~~ 85 (394) T PRK09474 80 GPDIIF 85 (394) T ss_pred CCCEEE T ss_conf 995899 No 74 >PRK03095 prsA peptidylprolyl isomerase; Reviewed Probab=85.56 E-value=2.8 Score=21.57 Aligned_cols=56 Identities=18% Similarity=0.104 Sum_probs=28.4 Q ss_pred CHHHHHHHHHHHHHHHHHCCCCCCCCCEEEECCCCC------------HHHHHHHH-HHHHHHHCCCCEEE Q ss_conf 957999999999998630358345883799425886------------99999999-99999610698489 Q gi|254780342|r 1 MYKILAVCLFLTTFSISYARDADSCTPVRFADTGWT------------DIAATTAM-TSVILEEILGYKTN 58 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~~~~~~~~~I~ig~~~wt------------e~~v~a~i-~~~iLE~~lGy~Ve 58 (309) |||.++++.+++.+++++|+..++..-++.-.-+-| ++.++.+| ...+|++. |+|. T Consensus 1 MKK~~la~~~~svl~LaaC~~~~~~~vat~kgg~IT~~e~y~~~k~~~g~~~l~~mi~~kvl~~~--ykVs 69 (287) T PRK03095 1 MKKAMLALAATSVIALSACGTSSSDKIVTSKAGDITKDEFYEQMKTQAGKQVLNNMVMEKVLIKN--YKVE 69 (287) T ss_pred CCHHHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--CCCC T ss_conf 90789999999999998516899875898469827299999999988899999999999999996--4878 No 75 >PRK00072 hemC porphobilinogen deaminase; Reviewed Probab=85.09 E-value=3 Score=21.41 Aligned_cols=58 Identities=14% Similarity=0.017 Sum_probs=46.1 Q ss_pred CCEEEECCCCCHHHHHHHHHHHHHHHCC-CCEEEEEE----CCH----------------HHHHHHHHCCCCEEEEEEE Q ss_conf 8379942588699999999999996106-98489985----640----------------5788885149813999840 Q gi|254780342|r 26 TPVRFADTGWTDIAATTAMTSVILEEIL-GYKTNIKL----LAV----------------PVTFRSLKNKGIDIFMGYW 83 (309) Q Consensus 26 ~~I~ig~~~wte~~v~a~i~~~iLE~~l-Gy~Ve~~~----~~~----------------~~~~~al~~G~iD~~~e~w 83 (309) ++|+||+-...=..+.++++...|++.. ++++|++. |+. .-+-++|.+|+||+.+..+ T Consensus 3 ~~irIgtR~S~LAl~Qa~~V~~~L~~~~p~~~~ei~~i~T~GD~~~~~pL~~iGgkG~Ftkele~aLl~g~iDiAVHSl 81 (300) T PRK00072 3 RKLRIGTRGSKLALWQAEWVKDRLKAAHPGLEVELVPIKTKGDRILDVPLAKIGGKGLFVKELEEALLDGEIDIAVHSL 81 (300) T ss_pred CCEEEEECCCHHHHHHHHHHHHHHHHHCCCCEEEEEEEECCCCCCCCCCHHHCCCCEEEHHHHHHHHHHCCCCEEEEEC T ss_conf 7669997898999999999999999758997189999964674456875788289503589999999828755255413 No 76 >PRK10355 xylF D-xylose transporter subunit XylF; Provisional Probab=84.82 E-value=3.1 Score=21.33 Aligned_cols=77 Identities=14% Similarity=0.088 Sum_probs=37.1 Q ss_pred CHHHHHHHHHHHHHHHHHCCCCCCCCCEEEE--CCCCCHHHHH--HHHHHHHHHHCCCCEEEEEEC--CHHH---HHHHH Q ss_conf 9579999999999986303583458837994--2588699999--999999996106984899856--4057---88885 Q gi|254780342|r 1 MYKILAVCLFLTTFSISYARDADSCTPVRFA--DTGWTDIAAT--TAMTSVILEEILGYKTNIKLL--AVPV---TFRSL 71 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~~~~~~~~~I~ig--~~~wte~~v~--a~i~~~iLE~~lGy~Ve~~~~--~~~~---~~~al 71 (309) |||++++++++++++++.+ .++.++|| .++.+..+.. .+-+..-.+ .+|++|.+... +... ..+.+ T Consensus 3 ~k~~~~~~~~~~~~~~~~a----~a~~~~IGv~~~~~~~~~~~~~~~~~~~~Ak-e~G~~v~v~dA~~D~~~Qi~qIe~~ 77 (330) T PRK10355 3 IKNILLTLCAALLLTSVAA----HAKEVKIGMAIDDLRLERWQKDRDIFVKKAE-SLGAKVFVQSANGNEETQMSQIENM 77 (330) T ss_pred HHHHHHHHHHHHHHHCCHH----HHCCCEEEEEECCCCCHHHHHHHHHHHHHHH-HCCCEEEEECCCCCHHHHHHHHHHH T ss_conf 6899999999999844611----2138679999068888689999999999999-7599899971899989999999999 Q ss_pred HCCCCEEEEEE Q ss_conf 14981399984 Q gi|254780342|r 72 KNKGIDIFMGY 82 (309) Q Consensus 72 ~~G~iD~~~e~ 82 (309) -+-.+|...-. T Consensus 78 I~qgvdaIiv~ 88 (330) T PRK10355 78 INRGVDVLVII 88 (330) T ss_pred HHCCCCEEEEE T ss_conf 98599999996 No 77 >PRK03601 transcriptional regulator HdfR; Provisional Probab=84.71 E-value=3.1 Score=21.30 Aligned_cols=65 Identities=5% Similarity=0.062 Sum_probs=51.0 Q ss_pred CCCCEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHCCCCEEEEEEECCCCH Q ss_conf 58837994258869999999999999610698489985640578888514981399984046413 Q gi|254780342|r 24 SCTPVRFADTGWTDIAATTAMTSVILEEILGYKTNIKLLAVPVTFRSLKNKGIDIFMGYWYPSLE 88 (309) Q Consensus 24 ~~~~I~ig~~~wte~~v~a~i~~~iLE~~lGy~Ve~~~~~~~~~~~al~~G~iD~~~e~w~p~~~ 88 (309) ....++||...-.....+..++..+.++.-+.+.++..+++..+...|.+|++|+..-.-.|... T Consensus 87 ~~~~l~ig~~~~~~~~~l~~~l~~~~~~~P~v~l~l~~~~~~~~~~~L~~~~~Dl~i~~~~p~~~ 151 (275) T PRK03601 87 RHNEFSIGASASLWECMLNQWLGRLYQNQEELQFEARIAQRQSLVKQLHERQLDLLITTEAPKMD 151 (275) T ss_pred CCCCEEEEEEHHHHHHHHHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHCCCCCEEEECCCCCCC T ss_conf 77609999775778999999999999978896799998898999999977993589977799888 No 78 >PRK01742 tolB translocation protein TolB; Provisional Probab=84.38 E-value=0.42 Score=27.00 Aligned_cols=23 Identities=17% Similarity=0.291 Sum_probs=18.3 Q ss_pred HHHHCCCCEEEEEECCCHHHHCC Q ss_conf 99965993699941242232015 Q gi|254780342|r 185 RDQRNNIPAVFLSWEPHPINSDL 207 (309) Q Consensus 185 ~A~~~~~~~v~~~wsp~~~~~~~ 207 (309) .-.+.+++++-+.|+||+.--.| T Consensus 198 ~l~~~~~~ilsP~wSPdg~~iaY 220 (430) T PRK01742 198 IVNRSAQPIMSPAWSPDGQRLAY 220 (430) T ss_pred ECCCCCCCEEEEEECCCCCEEEE T ss_conf 85357972572018589988999 No 79 >PRK11482 putative DNA-binding transcriptional regulator; Provisional Probab=84.04 E-value=3.3 Score=21.11 Aligned_cols=64 Identities=9% Similarity=0.024 Sum_probs=47.2 Q ss_pred CCCCCCCCEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHCCCCEEEEEEECC Q ss_conf 583458837994258869999999999999610698489985640578888514981399984046 Q gi|254780342|r 20 RDADSCTPVRFADTGWTDIAATTAMTSVILEEILGYKTNIKLLAVPVTFRSLKNKGIDIFMGYWYP 85 (309) Q Consensus 20 ~~~~~~~~I~ig~~~wte~~v~a~i~~~iLE~~lGy~Ve~~~~~~~~~~~al~~G~iD~~~e~w~p 85 (309) ...+...+++|+..++.+..++..++..+-+..- ++.+.........+.|.+|++|+....-.. T Consensus 60 ~~~~~~~~~~i~~~~~~~~~~lp~l~~~l~~~aP--~i~l~~~~~~~~~~~L~~~~~Dl~i~~~~~ 123 (266) T PRK11482 60 GSYDKQRTITIATTPSVGALVLPVIYQAIKPHYP--QLLLRNIPISDAENQLSQFQTDLIIDTHSC 123 (266) T ss_pred CCCCCCEEEEEEECHHHHHHHHHHHHHHHHHHCC--CEEEEECCCCCHHHHHHCCCCCEEEECCCC T ss_conf 8888652599997588999999999999997799--869995686578899856998889856888 No 80 >pfam03480 SBP_bac_7 Bacterial extracellular solute-binding protein, family 7. This family of proteins are involved in binding extracellular solutes for transport across the bacterial cytoplasmic membrane. This family includes a C4-dicarboxylate-binding protein and the sialic acid-binding protein SiaP. The structure of the SiaP receptor has revealed an overall topology similar to ATP binding cassette ESR (extracytoplasmic solute receptors) proteins. Upon binding of sialic acid, SiaP undergoes domain closure about a hinge region and kinking of an alpha-helix hinge component. Probab=83.20 E-value=3.6 Score=20.89 Aligned_cols=221 Identities=15% Similarity=0.124 Sum_probs=103.7 Q ss_pred HHHHHHHHHHHHHHCCCCEEEEEE------CCHHHHHHHHHCCCCEEEEEEE--CCCCHH-------------------- Q ss_conf 999999999999610698489985------6405788885149813999840--464135-------------------- Q gi|254780342|r 38 IAATTAMTSVILEEILGYKTNIKL------LAVPVTFRSLKNKGIDIFMGYW--YPSLEK-------------------- 89 (309) Q Consensus 38 ~~v~a~i~~~iLE~~lGy~Ve~~~------~~~~~~~~al~~G~iD~~~e~w--~p~~~~-------------------- 89 (309) .....+-.++.+|+..|-++++.. ++...+++++++|.+|+..-.- .+.... T Consensus 10 ~~~~~~~fa~~V~e~T~G~v~i~~~~~g~L~~~~e~~~~v~~G~~d~~~~~~~~~~~~~p~~~~~~lP~~~~~~~~~~~~ 89 (285) T pfam03480 10 KGKAAEKFAKLVEEKTGGRLKIEVYPNSQLGGDREVIEALKNGTVDIAAPSSGYFGGLPPEIGVFDLPFLFDDEQHLERV 89 (285) T ss_pred HHHHHHHHHHHHHHHCCCEEEEEEECCCCCCCHHHHHHHHHCCCCCEEEECCHHHCCCCHHHHHHHCCCCCCCHHHHHHH T ss_conf 89999999999998779908999955987588799999997799488976402221257123245275467989999999 Q ss_pred -------HHHHHHHCCCEEEECCCCCCCEEEEEEEHHHHHCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHC Q ss_conf -------6666553047023013566853567863446412345535655556654420135785432005558766403 Q gi|254780342|r 90 -------FIAPYLEEGSIKLVAENLQGAKYMLAVNDVGFALGIKSYQDIAKYKKELGAKIYGIEPGNEGNQRILDMINNN 162 (309) Q Consensus 90 -------~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~i~~~~dl~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~ 162 (309) .+.+...+.++..++....+....+. ....+.+.+||+ ++.+-.+ ........++.. T Consensus 90 ~~~~~~~~~~~~~~~~gv~~L~~~~~g~~~~~~-----~~~pi~~~~Dlk-------G~KiR~~----~~~~~~~~~~al 153 (285) T pfam03480 90 LDGEFGEALLKSLEAKGLKGLAFWNNGFRQFTN-----NKKPINSPEDLK-------GLKLRVP----PSPLLGEVFKAL 153 (285) T ss_pred HHCHHHHHHHHHHHHCCCEEEEEECCCCEEEEE-----CCCCCCCHHHHC-------CCEEEEC----CCHHHHHHHHHC T ss_conf 958478999999997696899862688457522-----788889979967-------9828835----867999999985 Q ss_pred CCCCCCEEEECCCCCCHHHHHHHHHHCCCCEEEEEECCCHHHHCCCC----EEECCCCCCCCCCHHHHHHHHCHHHHHHC Q ss_conf 54522100103665201899999996599369994124223201585----14007865467786583544277778748 Q gi|254780342|r 163 KFSLKGFRLIEASELASFSQIRRDQRNNIPAVFLSWEPHPINSDLNI----HYLPGGEEISGFGEASVYTVVRSDYLDKC 238 (309) Q Consensus 163 ~l~~~~~~~~~~s~~~~~a~l~~A~~~~~~~v~~~wsp~~~~~~~dl----~~Ledp~~~~~~p~~~~~~vv~~~~~~~~ 238 (309) |. ..+..+ +.+++.|+++|-++-.. +.+.+. ..+++ ++.-.+. +++.. ...++++++-+.- T Consensus 154 Ga-----~pv~~~----~~evy~aLq~GvvDg~~-~~~~~~-~~~~~~ev~ky~~~~~--~~~~~--~~~~~n~~~w~~L 218 (285) T pfam03480 154 GA-----NPTPMP----FGEVYTALQTGVVDGQE-NPLSNI-YSQKFYEVQKYLTETN--HGYLD--YLVVMNKDTWDSL 218 (285) T ss_pred CC-----CCCCCC----HHHHHHHHHHCCEEEEE-CCCHHH-HHHCHHHHCCEEEECC--CHHHH--HHHHHHHHHHHCC T ss_conf 99-----740157----58899999708462675-371356-5404121133672256--33567--8999249999649 Q ss_pred H-HHHHHHHHC---------CCCHHHHHHHHHHHHHCCCC----HHHHHHHHHHHHHHHHHHHHC Q ss_conf 8-999998740---------89989999999999845999----899999999985999998754 Q gi|254780342|r 239 P-NISRLLKNI---------KFSVALENEMMKLILNNKQD----RQFVGRTMLRTHPDLLKNWLI 289 (309) Q Consensus 239 P-~~~~~L~~~---------~ls~~~~~~l~~~v~~~g~~----~~~vA~~wL~~n~~~~~~Wl~ 289 (309) | +..++|.+. ....+...+........|.. .++.-.+|-+.-...|++|.+ T Consensus 219 ~~~~q~~i~~a~~~a~~~~~~~~~~~~~~~~~~~~~~g~~v~~~s~~e~~~~~~a~~pv~~~~~~ 283 (285) T pfam03480 219 PPDLQAILEEAAKEATELQRKLAEELLKEKELLKAAGKTEVIVLTPEDREAFREALKPVYKEFEK 283 (285) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999999999999999999998899999999789969999999999999996999999985 No 81 >pfam06543 Lac_bphage_repr Lactococcus bacteriophage repressor. This family represents the C-terminus of Lactococcus bacteriophage repressor proteins. Probab=82.78 E-value=0.34 Score=27.54 Aligned_cols=30 Identities=20% Similarity=0.440 Sum_probs=26.9 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHC Q ss_conf 9999859999987547545677137999998744 Q gi|254780342|r 275 TMLRTHPDLLKNWLIGVTTFDGQDPSRQLERFMN 308 (309) Q Consensus 275 ~wL~~n~~~~~~Wl~~~~~~~~~~~~~~~~~~~~ 308 (309) +.+..|+--|++|++ +||++-++-||.+|+ T Consensus 11 dlvDD~kvdWd~WVS----FdG~PlsDeVK~amk 40 (49) T pfam06543 11 DLVDDNKVDWDEWVS----FDGRPLSDEVKEAMK 40 (49) T ss_pred HHCCCCCCCHHHHEE----CCCCCCCHHHHHHHH T ss_conf 964676567477310----189337489999999 No 82 >PRK09701 D-allose transporter subunit; Provisional Probab=81.93 E-value=4 Score=20.59 Aligned_cols=96 Identities=15% Similarity=-0.044 Sum_probs=46.8 Q ss_pred CHHHHHHHHHHHHHHHHHCCCCCCCCCEEEECCCCCHHHHHH--HHHHHHHHHCCCCEEEEEEC----CHH---HHHHHH Q ss_conf 957999999999998630358345883799425886999999--99999996106984899856----405---788885 Q gi|254780342|r 1 MYKILAVCLFLTTFSISYARDADSCTPVRFADTGWTDIAATT--AMTSVILEEILGYKTNIKLL----AVP---VTFRSL 71 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~~~~~~~~~I~ig~~~wte~~v~a--~i~~~iLE~~lGy~Ve~~~~----~~~---~~~~al 71 (309) |||++.+++.+++.++.++ ++....+|-+-.++.+..+... +=.+..-+ .+|++|++... +.. .+.+.+ T Consensus 1 m~k~~~~~~~~~~~~~~~~-~a~a~~~iavi~k~~~npf~~~~~~Ga~~aAk-~~G~~v~v~~~~~~~D~~~Qi~~Ie~~ 78 (311) T PRK09701 1 MNKYLKYFSGTLVGLMLST-SAFAAAEYAVVLKTLSNPFWVDMKKGIEDEAK-TLGVSVDIFASPSEGDFQSQLQLFEDL 78 (311) T ss_pred CCHHHHHHHHHHHHHHHHH-HHHHCCCEEEEECCCCCHHHHHHHHHHHHHHH-HHCCEEEEEECCCCCCHHHHHHHHHHH T ss_conf 9369999999999999888-77436849999688899899999999999999-869979999279878999999999999 Q ss_pred HCCCCEEEEEE-ECC-CCHHHHHHHHHCC Q ss_conf 14981399984-046-4135666655304 Q gi|254780342|r 72 KNKGIDIFMGY-WYP-SLEKFIAPYLEEG 98 (309) Q Consensus 72 ~~G~iD~~~e~-w~p-~~~~~~~~~~~~~ 98 (309) ..-.+|...-. ..+ ......++....+ T Consensus 79 I~~gvdaIii~p~d~~a~~~~i~~A~~aG 107 (311) T PRK09701 79 SNKNYKGIAFAPLSSVNLVMPVARAWKKG 107 (311) T ss_pred HHCCCCEEEEECCCHHHHHHHHHHHHHCC T ss_conf 97599999991898778899999999779 No 83 >PRK10653 D-ribose transporter subunit RbsB; Provisional Probab=81.00 E-value=4.3 Score=20.38 Aligned_cols=79 Identities=11% Similarity=0.103 Sum_probs=45.3 Q ss_pred CHHHHHHHHHHHHHHHHHCCCCCCCCCEEEECCCCCHHHHHH--HHHHHHHHHCCCCEEEEEEC--CHH---HHHHHHHC Q ss_conf 957999999999998630358345883799425886999999--99999996106984899856--405---78888514 Q gi|254780342|r 1 MYKILAVCLFLTTFSISYARDADSCTPVRFADTGWTDIAATT--AMTSVILEEILGYKTNIKLL--AVP---VTFRSLKN 73 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~~~~~~~~~I~ig~~~wte~~v~a--~i~~~iLE~~lGy~Ve~~~~--~~~---~~~~al~~ 73 (309) |||+..+++.++++.+ .++.+...+||-+-.++.+...... .=+...+++ .||++-+... +.. ...+.+.. T Consensus 3 ~~k~~~~~~~~~~~~~-~~a~a~a~~TIgvivp~i~npff~~v~~gie~~a~~-~Gy~l~v~~s~~d~~~q~~~i~~li~ 80 (295) T PRK10653 3 MKKLATLVSAVALSAT-VSANAMAKDTIALVVSTLNNPFFVSLKDGAQKEADK-LGYNLVVLDSQNNPAKELANVQDLTV 80 (295) T ss_pred HHHHHHHHHHHHHHHH-CCCHHHCCCEEEEEECCCCCHHHHHHHHHHHHHHHH-CCCEEEEEECCCCHHHHHHHHHHHHH T ss_conf 0799999999987652-121775299799994898797999999999999997-69989998399999999999999997 Q ss_pred CCCEEEEE Q ss_conf 98139998 Q gi|254780342|r 74 KGIDIFMG 81 (309) Q Consensus 74 G~iD~~~e 81 (309) ..+|...- T Consensus 81 ~~vdgiii 88 (295) T PRK10653 81 RGTKILLI 88 (295) T ss_pred CCCCEEEE T ss_conf 69988998 No 84 >pfam03401 Bug Tripartite tricarboxylate transporter family receptor. These probable extra-cytoplasmic solute receptors are strongly overrepresented in several beta-proteobacteria. This family, formerly known as Bug - Bordetella uptake gene (bug) product - is a family of bacterial tripartite tricarboxylate receptors of the extracytoplasmic solute binding receptor-dependent transporter group of families, distinct from the ABC and TRAP-T families. The TctABC system has been characterized in S. typhimurium, and TctC is the extracytoplasmic tricarboxylate-binding receptor which binds the transporters TctA and TctB, two integral membrane proteins. Complete three-component systems are found only in bacteria. Probab=80.68 E-value=4.4 Score=20.31 Aligned_cols=233 Identities=12% Similarity=0.033 Sum_probs=109.8 Q ss_pred HHHHHHHHHHCCCCEEEE--EECC-HHHHHHHHHCCCCEEEEEEECCCCHHHHHHHHH-C------CCEEEECCCCCCCE Q ss_conf 999999996106984899--8564-057888851498139998404641356666553-0------47023013566853 Q gi|254780342|r 42 TAMTSVILEEILGYKTNI--KLLA-VPVTFRSLKNKGIDIFMGYWYPSLEKFIAPYLE-E------GSIKLVAENLQGAK 111 (309) Q Consensus 42 a~i~~~iLE~~lGy~Ve~--~~~~-~~~~~~al~~G~iD~~~e~w~p~~~~~~~~~~~-~------~~~~~~~~~~~~~~ 111 (309) |-+++..|++.+|-+|-+ ++|. ..+....+.+-.-|-|.=...+... ....... + ..+..++. ..... T Consensus 2 AR~~a~~l~~~lG~~vvV~N~pGagG~ig~~~v~~a~~DGyTl~~~~~~~-~~~~~~~~~~~~~~~~df~pia~-~~~~~ 79 (274) T pfam03401 2 ARALAQGMSKQLGQPVIVENKPGAGGIIGADAVAKAAPDGYTILLGSTGL-AVAPHLYPNLPYDPIKDFTPVSL-LATSP 79 (274) T ss_pred HHHHHHHHHHHHCCCEEEEECCCCCHHHHHHHHHHCCCCCCEEEEECCHH-HHHHHHHCCCCCCCHHHHHHHHH-HHCCC T ss_conf 78999999998699879996698306999999980899998899946789-87688843478870887675886-50163 Q ss_pred EEEEEEHHHHHCCCCCHHHHHHHHHHHHHCC-CCCCC-CCCCCHHHHHHHHHCCCCCCCEEEECCCCCCHHHHHHHHHHC Q ss_conf 5678634464123455356555566544201-35785-432005558766403545221001036652018999999965 Q gi|254780342|r 112 YMLAVNDVGFALGIKSYQDIAKYKKELGAKI-YGIEP-GNEGNQRILDMINNNKFSLKGFRLIEASELASFSQIRRDQRN 189 (309) Q Consensus 112 ~~~~v~~~~~~~~i~~~~dl~~~~~~~~~~~-~g~~~-~~~~~~~~~~~~~~~~l~~~~~~~~~~s~~~~~a~l~~A~~~ 189 (309) ..++|+. +..++++++|.+....-.+++ +|... +...+-......+ ..|... +.++-..+ ++...++-. T Consensus 80 ~~l~v~~---dsp~~t~~dli~~ak~~Pg~~~~g~~G~gs~~hl~~~~l~~--~~G~~~-~~Vpy~G~---~~~~~allg 150 (274) T pfam03401 80 MVLVVPA---DSPFKTLQELVAYAKANPGKLTFASAGIGTSNHLAGELLAS--KAGVQL-SHVPYKGS---SPALQDLLG 150 (274) T ss_pred EEEEECC---CCCCCCHHHHHHHHHHCCCCEEEECCCCCCCCHHHHHHHHH--HCCCCE-EEECCCCC---HHHHHHHHC T ss_conf 5889779---99756899999999848996587327888631899999999--709964-88346783---266899867 Q ss_pred CCCEEEEEECCCHH--HHCCCCEEEC--CCCCCC-----------CCCHHH----HHHHHCHHHHHHCHHHHHHHHHC-- Q ss_conf 99369994124223--2015851400--786546-----------778658----35442777787488999998740-- Q gi|254780342|r 190 NIPAVFLSWEPHPI--NSDLNIHYLP--GGEEIS-----------GFGEAS----VYTVVRSDYLDKCPNISRLLKNI-- 248 (309) Q Consensus 190 ~~~~v~~~wsp~~~--~~~~dl~~Le--dp~~~~-----------~~p~~~----~~~vv~~~~~~~~P~~~~~L~~~-- 248 (309) |++++.....+... +...+++.|- .++..- ++|... ...++.++.. +++.+.|++. T Consensus 151 G~vd~~~~~~~~~~~~v~~G~lr~Lav~~~~R~~~~PdVPT~~E~G~~~~~~~~w~g~~ap~gtP---~~~~~~l~~a~~ 227 (274) T pfam03401 151 GRVDMMIDSLTSTAPYIKAGKLRALAVTSPKRSPQLPDVPTVAELGLKGFEAGVWFGLVAPKGTP---PAVVEKLNDAIK 227 (274) T ss_pred CEEEEEEECCHHHHHHHCCCCCEEEEEECCCCCCCCCCCCCHHHHCCCCCEEEEEEEEEECCCCC---HHHHHHHHHHHH T ss_conf 84459983513204454179825999975756754489997688487871478888876349889---999999999999 Q ss_pred -CCCHHHHHHHHHHHHH--CCCCHHHHHHHHHHHHHHHHHHHHC Q ss_conf -8998999999999984--5999899999999985999998754 Q gi|254780342|r 249 -KFSVALENEMMKLILN--NKQDRQFVGRTMLRTHPDLLKNWLI 289 (309) Q Consensus 249 -~ls~~~~~~l~~~v~~--~g~~~~~vA~~wL~~n~~~~~~Wl~ 289 (309) .+..++..+....... .-+++++. ++|+++..+.|.+.++ T Consensus 228 ~~~~dpe~~~~~~~~g~~~~~~~~~e~-~~~~~~e~~~~~~l~k 270 (274) T pfam03401 228 KALKDPEVVERLANLGMEPVPGTPQQL-AQFIQAEVKRWGPLIQ 270 (274) T ss_pred HHHCCHHHHHHHHHCCCCCCCCCHHHH-HHHHHHHHHHHHHHHH T ss_conf 997499999999977895788999999-9999999999999999 No 85 >PRK04405 prsA peptidylprolyl isomerase; Provisional Probab=80.59 E-value=3.8 Score=20.75 Aligned_cols=60 Identities=12% Similarity=-0.012 Sum_probs=29.8 Q ss_pred CHHHHHHHHHHHHHHHHHCCCCCC-------CCCEEEEC-----CC-CCHHHHHHH-HHHHHHHHCCCCEEEEE Q ss_conf 957999999999998630358345-------88379942-----58-869999999-99999961069848998 Q gi|254780342|r 1 MYKILAVCLFLTTFSISYARDADS-------CTPVRFAD-----TG-WTDIAATTA-MTSVILEEILGYKTNIK 60 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~~~~~~-------~~~I~ig~-----~~-wte~~v~a~-i~~~iLE~~lGy~Ve~~ 60 (309) |||+++++++++++++..+|+.++ +++|+-.. ++ =.++.++.+ |+..+||+.+|-+|.-. T Consensus 5 MKK~~l~~~~~~~~l~LaaCss~~~~Vat~kg~~IT~~e~y~~~k~~~~g~~~l~~mi~~kvl~~~yg~kVsdk 78 (298) T PRK04405 5 MKKWALAAASAGLLLSLAGCSSNNKTVATYKGGKITESQYYKEMKQSSAGKQTLANMIIYRALEKQYGKKVSTK 78 (298) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCEEEEECCCCCCHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHCCCCCHH T ss_conf 89999999999999999871799976998639872599999998644236799999999999999874606799 No 86 >cd00494 HMBS Hydroxymethylbilane synthase (HMBS), also known as porphobilinogen deaminase (PBGD), is an intermediate enzyme in the biosynthetic pathway of tetrapyrrolic ring systems, such as heme, chlorophylls, and vitamin B12. HMBS catalyzes the conversion of porphobilinogen (PBG) into hydroxymethylbilane (HMB). HMBS consists of three domains, and is believed to bind substrate through a hinge-bending motion of domains I and II. HMBS is found in all organisms except viruses. Probab=80.46 E-value=4.5 Score=20.26 Aligned_cols=58 Identities=17% Similarity=0.068 Sum_probs=45.2 Q ss_pred EEEECCCCCHHHHHHHHHHHHHHHCC-CCEEEEEE----CCH----------------HHHHHHHHCCCCEEEEEEECC Q ss_conf 79942588699999999999996106-98489985----640----------------578888514981399984046 Q gi|254780342|r 28 VRFADTGWTDIAATTAMTSVILEEIL-GYKTNIKL----LAV----------------PVTFRSLKNKGIDIFMGYWYP 85 (309) Q Consensus 28 I~ig~~~wte~~v~a~i~~~iLE~~l-Gy~Ve~~~----~~~----------------~~~~~al~~G~iD~~~e~w~p 85 (309) |+||+-...=..+.+++++..|++.. ++++|++. |+. .-+-++|.+|+||+.+..+-. T Consensus 1 IrIgTR~S~LAl~Qa~~V~~~L~~~~p~~~~ei~~i~T~GD~~~~~~L~~~ggkG~Ftkele~aLl~g~iDiAVHSlKD 79 (292) T cd00494 1 IRIGTRKSKLALIQTNKVIEKLKELCPGIEVEIVIIKTTGDKILDKPLAKIGGKGLFTKELEEALLNGEIDLAVHSLKD 79 (292) T ss_pred CEEEECCCHHHHHHHHHHHHHHHHHCCCCCEEEEEEECCCCCCCCCCHHHCCCCCEEHHHHHHHHHCCCCCEEEEECCC T ss_conf 9899379899999999999999975899818999996657313676387749961447999999970987879860465 No 87 >PRK10859 putative transglycosylase; Provisional Probab=79.38 E-value=4.9 Score=20.04 Aligned_cols=158 Identities=9% Similarity=0.077 Sum_probs=83.5 Q ss_pred CCCEEEECCCCCHHHH---------HHHHHHHHHHHCCCCEEEEEEC-CHHHHHHHHHCCCCEEEEEEECCCCHHHHHHH Q ss_conf 8837994258869999---------9999999996106984899856-40578888514981399984046413566665 Q gi|254780342|r 25 CTPVRFADTGWTDIAA---------TTAMTSVILEEILGYKTNIKLL-AVPVTFRSLKNKGIDIFMGYWYPSLEKFIAPY 94 (309) Q Consensus 25 ~~~I~ig~~~wte~~v---------~a~i~~~iLE~~lGy~Ve~~~~-~~~~~~~al~~G~iD~~~e~w~p~~~~~~~~~ 94 (309) .+.+++++-+-+..+. --++.+. +.+.+|.+.++... +...+|.+|.+|+.|+..-. ++....... T Consensus 42 rG~LrV~T~~spttYy~~~~~~~GFEYELak~-FA~~LGV~L~i~~~~~~~el~~~L~~G~~DiaAAg-Lt~t~~R~~-- 117 (507) T PRK10859 42 RGELRVSTIHSPLTYSTINGKPFGLDYELAKQ-FADYLGVKLKVTVRQNISQLFDDLDNGNADLLAAG-LVYNSERVK-- 117 (507) T ss_pred CCEEEEEEECCCCEEEECCCCCCCHHHHHHHH-HHHHCCCEEEEEECCCHHHHHHHHHCCCCCEEECC-CCCCHHHHH-- T ss_conf 89689999758965994199647799999999-99980991799974999999999867998889557-877853674-- Q ss_pred HHCCCEEEECCCCCCCEEEEEEEHHHHHCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCCCCCCCEEEECC Q ss_conf 53047023013566853567863446412345535655556654420135785432005558766403545221001036 Q gi|254780342|r 95 LEEGSIKLVAENLQGAKYMLAVNDVGFALGIKSYQDIAKYKKELGAKIYGIEPGNEGNQRILDMINNNKFSLKGFRLIEA 174 (309) Q Consensus 95 ~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~i~~~~dl~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~ 174 (309) ....++.+.....-++..+... -..++.||. ++.+....+......... ++...+..-.+.. T Consensus 118 -----~~~~~p~Y~~v~q~LVyr~g~~--rP~~l~DL~-------G~~I~V~~gSsy~e~L~~-Lk~~~~P~L~w~~--- 179 (507) T PRK10859 118 -----NYQPGPTYYSVSQQLVYRVGQY--RPRTLGDLT-------GEQLTVAPGHVVVNDLQT-LKETKFPELSWKV--- 179 (507) T ss_pred -----CCCCCCCCCCEEEEEEEECCCC--CCCCHHHCC-------CCEEEECCCCHHHHHHHH-HHHCCCCCEEEEE--- T ss_conf -----0435686503003999958998--988977837-------986998179759999999-8853489715995--- Q ss_pred CCCCHHHHHHHHHHCCCCEEEEEECCCHHH Q ss_conf 652018999999965993699941242232 Q gi|254780342|r 175 SELASFSQIRRDQRNNIPAVFLSWEPHPIN 204 (309) Q Consensus 175 s~~~~~a~l~~A~~~~~~~v~~~wsp~~~~ 204 (309) ++..-..++-+.+.+|+.+...+=+....+ T Consensus 180 ~~~~~teeLL~~V~~g~IdyTVaDS~~~~l 209 (507) T PRK10859 180 DDKKGSAELLEQVIEGKLDYTIADSVAISL 209 (507) T ss_pred CCCCCHHHHHHHHHCCCCCEEEECCHHHHH T ss_conf 589999999999975987768855488998 No 88 >PRK09408 ompX outer membrane protein X; Provisional Probab=79.18 E-value=1.9 Score=22.63 Aligned_cols=52 Identities=10% Similarity=0.146 Sum_probs=32.2 Q ss_pred CHHHHHHHHHHHHHHHHHCCCCCCCCCEEEECCCCCHHHHHHHHHHHHHHH-CCCCEEEEEE Q ss_conf 957999999999998630358345883799425886999999999999961-0698489985 Q gi|254780342|r 1 MYKILAVCLFLTTFSISYARDADSCTPVRFADTGWTDIAATTAMTSVILEE-ILGYKTNIKL 61 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~~~~~~~~~I~ig~~~wte~~v~a~i~~~iLE~-~lGy~Ve~~~ 61 (309) |||++.+.++++.+.++..++.+...||.+|..-- .+-. ..+ .-|.++++.. T Consensus 1 mkk~~~~s~~~~~~~~~a~~a~A~~sTvS~GYAQs-------~~~~--~~n~l~G~NlKYrY 53 (171) T PRK09408 1 MKKIACLSALAAVLAFTAGTAVAATSTVTGGYAQS-------DAQG--VANKMGGFNLKYRY 53 (171) T ss_pred CCHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEEEE-------CCCC--CCCCCCCEEEEEEE T ss_conf 92247899999998856666642576698788642-------0013--46786744888766 No 89 >PRK11013 DNA-binding transcriptional regulator LysR; Provisional Probab=78.77 E-value=5.1 Score=19.92 Aligned_cols=205 Identities=9% Similarity=-0.023 Sum_probs=111.0 Q ss_pred CCCCEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHCCCCEEEEEEECCCCHHHHHHHHHCCCEEEE Q ss_conf 58837994258869999999999999610698489985640578888514981399984046413566665530470230 Q gi|254780342|r 24 SCTPVRFADTGWTDIAATTAMTSVILEEILGYKTNIKLLAVPVTFRSLKNKGIDIFMGYWYPSLEKFIAPYLEEGSIKLV 103 (309) Q Consensus 24 ~~~~I~ig~~~wte~~v~a~i~~~iLE~~lGy~Ve~~~~~~~~~~~al~~G~iD~~~e~w~p~~~~~~~~~~~~~~~~~~ 103 (309) ...+++||.......+.+..++....++.=+.++++..+++..+.+.+..|++|+..-..... ... .... T Consensus 92 ~~G~lri~~~~~~~~~~lp~~l~~f~~~~P~v~v~i~~~~~~~~~~~l~~~~~Dl~l~~~~~~-~~~---------~~~~ 161 (309) T PRK11013 92 RQGQLSIACLPVFSQSLLPGLCQPFLARYPDVSLNIVPQESPLLEEWLSAQRHDLGLTETLHT-PAG---------TERT 161 (309) T ss_pred CCCCCCCCCCHHHHHHHCCHHHHHHHHHCCCCEEEEEECCHHHHHHHHHCCCCCEEEEECCCC-CCC---------CEEE T ss_conf 367656530067898866523156784389807999989859999999759977899607889-987---------5788 Q ss_pred CCCCCCCEEEEEEEHHHH--HCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCCCCCCCEEEECCCCCCHHH Q ss_conf 135668535678634464--123455356555566544201357854320055587664035452210010366520189 Q gi|254780342|r 104 AENLQGAKYMLAVNDVGF--ALGIKSYQDIAKYKKELGAKIYGIEPGNEGNQRILDMINNNKFSLKGFRLIEASELASFS 181 (309) Q Consensus 104 ~~~~~~~~~~~~v~~~~~--~~~i~~~~dl~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~s~~~~~a 181 (309) .... ....++++..-+ ....-+++||.+. ..+....+...+..........+.... ..+...+ .. T Consensus 162 -~l~~-~~~~~v~~~~hpla~~~~i~~~dL~~~------~~i~~~~~~~~r~~~~~~~~~~g~~~~-~~~e~~~----~~ 228 (309) T PRK11013 162 -ELLT-LDEVCVLPDGHPLAAKKVLTPDDFQGE------NFISLSRTDSYRQLLDQLFAEHGVKRR-MVVETHS----AA 228 (309) T ss_pred -EEEE-EEEEEECCCCCCCCCCCCCCHHHHCCC------CEEECCCCCCHHHHHHHHHHHCCCCCC-EEEEECC----HH T ss_conf -8642-014675478980004999999997389------869638999499999999997699977-6999896----99 Q ss_pred HHHHHHHCCCCEEEEEECCCHHH---HCCCCEEECCCCCCCCCCHHHHHHHHCHHHHHHCHHHHHHHHHCCCCHHHHHHH Q ss_conf 99999965993699941242232---015851400786546778658354427777874889999987408998999999 Q gi|254780342|r 182 QIRRDQRNNIPAVFLSWEPHPIN---SDLNIHYLPGGEEISGFGEASVYTVVRSDYLDKCPNISRLLKNIKFSVALENEM 258 (309) Q Consensus 182 ~l~~A~~~~~~~v~~~wsp~~~~---~~~dl~~Ledp~~~~~~p~~~~~~vv~~~~~~~~P~~~~~L~~~~ls~~~~~~l 258 (309) .+.+.++.|-.+.+. |.... ..-+++++..+.. .+..+..+.++ -....|.+..|++-+ -++..+| T Consensus 229 ~~~~lV~~G~Gvail---~~~~~~~~~~~g~~~~pl~~~----~~~~~~lv~~~-~r~~s~~~~aF~~~L---r~~a~al 297 (309) T PRK11013 229 SVCAMVRAGLGVSIV---NPLTALDYAGSGLVVRPFSIS----VPFTVSLIRPL-HRPSSALVDAFSEHL---QAQAKAL 297 (309) T ss_pred HHHHHHHHCCEEEEE---CHHHHHHHHHCCEEEEECCCC----CCEEEEEEEEC-CCCCCHHHHHHHHHH---HHHHHHH T ss_conf 999999979969994---699998662199099988889----96189999989-997799999999999---9999999 Q ss_pred HHHH Q ss_conf 9999 Q gi|254780342|r 259 MKLI 262 (309) Q Consensus 259 ~~~v 262 (309) +.++ T Consensus 298 ~~~~ 301 (309) T PRK11013 298 VTRL 301 (309) T ss_pred HHHH T ss_conf 9989 No 90 >PRK01326 prsA foldase protein PrsA; Reviewed Probab=78.77 E-value=5.1 Score=19.92 Aligned_cols=26 Identities=23% Similarity=0.152 Sum_probs=17.2 Q ss_pred CHHHH-HHHHHHHHHHHCCCCEEEEEE Q ss_conf 69999-999999999610698489985 Q gi|254780342|r 36 TDIAA-TTAMTSVILEEILGYKTNIKL 61 (309) Q Consensus 36 te~~v-~a~i~~~iLE~~lGy~Ve~~~ 61 (309) .++.+ ..-|+..+||+.+|-+|.-.. T Consensus 52 ~g~q~l~~mi~~kvLe~~Yg~kVsdke 78 (310) T PRK01326 52 VAQQAMLNLVISRVFEKQYGDKVSDKE 78 (310) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCCHHH T ss_conf 289999999999999998626178999 No 91 >PRK02889 tolB translocation protein TolB; Provisional Probab=77.51 E-value=2 Score=22.52 Aligned_cols=23 Identities=17% Similarity=0.359 Sum_probs=17.4 Q ss_pred HHHHCCCCEEEEEECCCHHHHCC Q ss_conf 99965993699941242232015 Q gi|254780342|r 185 RDQRNNIPAVFLSWEPHPINSDL 207 (309) Q Consensus 185 ~A~~~~~~~v~~~wsp~~~~~~~ 207 (309) .-.+.+++++-+.|+||+.--.| T Consensus 190 ~lt~~~~~~lsP~wSPdg~~iaY 212 (430) T PRK02889 190 IALRSTEPIISPAWSPDGTKVAY 212 (430) T ss_pred EECCCCCEEEEEEECCCCCEEEE T ss_conf 92168964585108799998999 No 92 >PRK05137 tolB translocation protein TolB; Provisional Probab=77.48 E-value=0.99 Score=24.53 Aligned_cols=22 Identities=5% Similarity=-0.035 Sum_probs=16.4 Q ss_pred HHHCCCCEEEEEECCCHHHHCC Q ss_conf 9965993699941242232015 Q gi|254780342|r 186 DQRNNIPAVFLSWEPHPINSDL 207 (309) Q Consensus 186 A~~~~~~~v~~~wsp~~~~~~~ 207 (309) -...+++++.+.|+||+.--.| T Consensus 200 l~~~~~~ilsP~wSPdg~~iaY 221 (437) T PRK05137 200 LTDGSSLVLTPRFSPTRQEITY 221 (437) T ss_pred ECCCCCCCCCCCCCCCCCEEEE T ss_conf 3047875424338589987999 No 93 >TIGR03525 GldK gliding motility-associated lipoprotein GldK. Members of this protein family are exclusive to the Bacteroidetes phylum (previously Cytophaga-Flavobacteria-Bacteroides). GldK is a lipoprotein linked to a type of rapid surface gliding motility found in certain Bacteroidetes, such as Flavobacterium johnsoniae. Knockouts of GldK abolish the gliding phenotype. GldK is homologous to GldJ. There is a GldK homolog in Cytophaga hutchinsonii and several other species that has a different, shorter architecture and is represented by a separate model. Gliding motility appears closely linked to chitin utilization in the model species Flavobacterium johnsoniae. Bacteroidetes with members of this protein family appear to have all of the genes associated with gliding motility. Probab=76.58 E-value=3.7 Score=20.80 Aligned_cols=25 Identities=12% Similarity=0.034 Sum_probs=15.5 Q ss_pred CCEEEECCCCCHHHHHHHHHHHHHH Q ss_conf 8379942588699999999999996 Q gi|254780342|r 26 TPVRFADTGWTDIAATTAMTSVILE 50 (309) Q Consensus 26 ~~I~ig~~~wte~~v~a~i~~~iLE 50 (309) ++|+|-+.=-.+..|+++.++|-++ T Consensus 67 k~VtV~sFyMDETEVTN~EYRQFV~ 91 (449) T TIGR03525 67 KTVTVRSFYMDETEITNSEYRQFVE 91 (449) T ss_pred CEEEECCEECCCCCCCHHHHHHHHH T ss_conf 0588803761464111899999999 No 94 >PRK04792 tolB translocation protein TolB; Provisional Probab=75.69 E-value=5.3 Score=19.81 Aligned_cols=23 Identities=17% Similarity=0.276 Sum_probs=18.0 Q ss_pred HHHHCCCCEEEEEECCCHHHHCC Q ss_conf 99965993699941242232015 Q gi|254780342|r 185 RDQRNNIPAVFLSWEPHPINSDL 207 (309) Q Consensus 185 ~A~~~~~~~v~~~wsp~~~~~~~ 207 (309) .-...+++++-+.|+||+.--.| T Consensus 214 ~lt~~~~~~lsP~wSPdg~~iaY 236 (450) T PRK04792 214 LVLRSKQPLMSPAWSPDGKKLAY 236 (450) T ss_pred ECCCCCCEEEEEEECCCCCEEEE T ss_conf 90279972774338589998999 No 95 >PRK11074 putative DNA-binding transcriptional regulator; Provisional Probab=74.55 E-value=6.6 Score=19.18 Aligned_cols=58 Identities=12% Similarity=0.045 Sum_probs=48.0 Q ss_pred CCCCEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHCCCCEEEEE Q ss_conf 5883799425886999999999999961069848998564057888851498139998 Q gi|254780342|r 24 SCTPVRFADTGWTDIAATTAMTSVILEEILGYKTNIKLLAVPVTFRSLKNKGIDIFMG 81 (309) Q Consensus 24 ~~~~I~ig~~~wte~~v~a~i~~~iLE~~lGy~Ve~~~~~~~~~~~al~~G~iD~~~e 81 (309) ...+++|+.........+..++....+..=+.++++.....+.+.+.+.+|++|+..- T Consensus 90 ~~G~lri~~~~~~~~~~l~~~l~~f~~~~P~v~l~l~~~~~~~~~~~l~~g~~Dlai~ 147 (300) T PRK11074 90 WRGQLSIALDNIVRPDRTRQLIADFYRHFDDVELIIRQEVFNGVWDALADGRADIAIG 147 (300) T ss_pred CCCEEEEEECCHHHHHHHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHCCCCCEEEE T ss_conf 8425999865335789999999999998869579999766367999997699778994 No 96 >COG1613 Sbp ABC-type sulfate transport system, periplasmic component [Inorganic ion transport and metabolism] Probab=74.31 E-value=6.7 Score=19.14 Aligned_cols=248 Identities=10% Similarity=0.026 Sum_probs=112.9 Q ss_pred CEEEECCCCCHHHHHHHHHHHH--HHHCCCCEEEEEE--CCHHHHHHHHHCC-CCEEEEEEECCCCHHHHHH--HHHCCC Q ss_conf 3799425886999999999999--9610698489985--6405788885149-8139998404641356666--553047 Q gi|254780342|r 27 PVRFADTGWTDIAATTAMTSVI--LEEILGYKTNIKL--LAVPVTFRSLKNK-GIDIFMGYWYPSLEKFIAP--YLEEGS 99 (309) Q Consensus 27 ~I~ig~~~wte~~v~a~i~~~i--LE~~lGy~Ve~~~--~~~~~~~~al~~G-~iD~~~e~w~p~~~~~~~~--~~~~~~ 99 (309) ++...+-+=|...-...+-+.. -++..|-+|.++. |++.-+-.++..| +-|+..=.-.+..+.+... ...++. T Consensus 39 tLlnVSYd~tre~y~~~n~aF~~~WkaetG~~Vti~qShGGSgkQaRsViDGl~ADVVtlal~~Did~i~~~g~li~~dW 118 (348) T COG1613 39 TLLNVSYDPTRELYKEYNPAFAKYWKAETGETVTIQQSHGGSGKQARSVIDGLEADVVTLALAYDIDAIAKAGGLIDKDW 118 (348) T ss_pred EEEEEECCCCHHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHHHCCCCCCCEEEEEHHHHHHHHHHHCCCCCCCH T ss_conf 58885215658889998899999988731983599965787620357660675332466411311899997268789465 Q ss_pred EEEECCCC--CCCEEEEEEEHHHHHCCCCCHHHHHHHHHHH---HHCCCCCCCCCCCCHHHHHHHHHCCCCCC------- Q ss_conf 02301356--6853567863446412345535655556654---42013578543200555876640354522------- Q gi|254780342|r 100 IKLVAENL--QGAKYMLAVNDVGFALGIKSYQDIAKYKKEL---GAKIYGIEPGNEGNQRILDMINNNKFSLK------- 167 (309) Q Consensus 100 ~~~~~~~~--~~~~~~~~v~~~~~~~~i~~~~dl~~~~~~~---~~~~~g~~~~~~~~~~~~~~~~~~~l~~~------- 167 (309) ...+..+. ......+.|++- .-.+|++|.||.+..-.. ..+.. +..+|-.-..+...++.++-+.. T Consensus 119 ~~rlp~ns~PytStivFlVRkG-NPK~I~DW~DL~k~gV~VItpNPKTS-GgARWN~Laawa~a~~~~~gdeaka~~fV~ 196 (348) T COG1613 119 QKRLPNNSAPYTSTIVFLVRKG-NPKQIRDWDDLVKPGVQVITPNPKTS-GGARWNYLAAWAYALKTNGGDEAKAKDFVG 196 (348) T ss_pred HHHCCCCCCCCCCEEEEEEECC-CCCCCCCHHHHCCCCCEEECCCCCCC-CCHHHHHHHHHHHHHHCCCCCHHHHHHHHH T ss_conf 7648699877330699999369-96556755774658867976899887-652278899989987147998899999999 Q ss_pred -CEEEECCCCCCHHHHHHHHHHCCCCEEEEEECCCHHHHCCCC--E---EECCCCCCCCCCHHHHHHHHCHHHHHH-CHH Q ss_conf -100103665201899999996599369994124223201585--1---400786546778658354427777874-889 Q gi|254780342|r 168 -GFRLIEASELASFSQIRRDQRNNIPAVFLSWEPHPINSDLNI--H---YLPGGEEISGFGEASVYTVVRSDYLDK-CPN 240 (309) Q Consensus 168 -~~~~~~~s~~~~~a~l~~A~~~~~~~v~~~wsp~~~~~~~dl--~---~Ledp~~~~~~p~~~~~~vv~~~~~~~-~P~ 240 (309) -+.-++.-+.+.-.....=+++|..+|...|+-.......++ . .+--+-....-| ..-||.+...++ .-+ T Consensus 197 ~L~~nvpvld~gaRgAT~tF~qrgiGDVLi~wENEA~la~~e~g~~~feiV~Ps~si~aEp---PVAVVd~~vdkkgtr~ 273 (348) T COG1613 197 KLYKNVPVLDTGARGATTTFVQRGIGDVLIAWENEALLALNELGGDKFEIVTPSVSILAEP---PVAVVDKNVDKKGTRK 273 (348) T ss_pred HHHHCCCCCCCCCCHHHHHHHHCCCCCEEEEECHHHHHHHHHHCCCCCCEECCCEEEEECC---CEEEEECCCCCCCCHH T ss_conf 9984683245885212578886386647887241888879974688766876860266069---7478751123335689 Q ss_pred HHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHH Q ss_conf 99998740899899999999998459998999999999859 Q gi|254780342|r 241 ISRLLKNIKFSVALENEMMKLILNNKQDRQFVGRTMLRTHP 281 (309) Q Consensus 241 ~~~~L~~~~ls~~~~~~l~~~v~~~g~~~~~vA~~wL~~n~ 281 (309) +++.+-++-+|.+- +++.++--.--.+|+-+|+.-. .-| T Consensus 274 vAeAyl~yLys~~g-Q~i~Ak~~yRP~~p~v~a~~~~-~FP 312 (348) T COG1613 274 VAEAYLKYLYSPEG-QEIAAKHYYRPRDPEVAAKFAD-QFP 312 (348) T ss_pred HHHHHHHHHCCHHH-HHHHHHHCCCCCCHHHHHHHHH-HCC T ss_conf 99999987558388-8999982778898788877786-489 No 97 >PRK11716 DNA-binding transcriptional regulator IlvY; Provisional Probab=73.96 E-value=6.8 Score=19.09 Aligned_cols=196 Identities=9% Similarity=-0.013 Sum_probs=100.1 Q ss_pred CCCCCEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHCCCCEEEEEEECCCCHHHHHHHHHCCCEEE Q ss_conf 45883799425886999999999999961069848998564057888851498139998404641356666553047023 Q gi|254780342|r 23 DSCTPVRFADTGWTDIAATTAMTSVILEEILGYKTNIKLLAVPVTFRSLKNKGIDIFMGYWYPSLEKFIAPYLEEGSIKL 102 (309) Q Consensus 23 ~~~~~I~ig~~~wte~~v~a~i~~~iLE~~lGy~Ve~~~~~~~~~~~al~~G~iD~~~e~w~p~~~~~~~~~~~~~~~~~ 102 (309) ....+|+||.........+..+++..-+..=+.++++..+++..+.+.|.+|++|+..-. .|.... + .+.. T Consensus 64 ~~~G~lri~~~~t~~~~~lp~~l~~f~~~~P~v~~~l~~~~~~~~~~~l~~g~~D~ai~~-~p~~~~-------~-~l~~ 134 (269) T PRK11716 64 SLSGELSLFCSVTAAYSHLPPILDRFRAEHPLVEIKLTTGDAADAVEKVQSGEADLAIAA-KPETLP-------S-SLAF 134 (269) T ss_pred CCCCCEEEEECHHHHHHHCCHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEEEE-CCCCCC-------C-CEEE T ss_conf 876626998514887775467899999988894489842899999999986995589993-787899-------9-7699 Q ss_pred ECCCCCCCEEEEEEEHHHH---HCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCCCCCCCEEEECCCCCCH Q ss_conf 0135668535678634464---1234553565555665442013578543200555876640354522100103665201 Q gi|254780342|r 103 VAENLQGAKYMLAVNDVGF---ALGIKSYQDIAKYKKELGAKIYGIEPGNEGNQRILDMINNNKFSLKGFRLIEASELAS 179 (309) Q Consensus 103 ~~~~~~~~~~~~~v~~~~~---~~~i~~~~dl~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~s~~~~ 179 (309) . ........++.|.... .....+..|+.+. ..+..+.+ ..+..........++.... .....+ T Consensus 135 ~--~l~~~~l~~v~p~~~~~~~~~~~~~~~dl~~~------p~il~~~~-~~r~~~~~~~~~~~~~p~i-~~e~~~---- 200 (269) T PRK11716 135 S--PIDEIPLVLIAPALPCPVRQQVSQEKPDWSRI------PFILPEHG-PARRRIDLWFRRHKIKPNI-YAQVSG---- 200 (269) T ss_pred E--EEEEEEEEEEEECCCCHHHCCCCCCHHHHHCC------CEEECCCC-CHHHHHHHHHHHCCCCCCE-EEEECC---- T ss_conf 9--96564189999788415436356884468338------67815898-2799999999976999717-999897---- Q ss_pred HHHHHHHHHCCCCEEEEEECCCHHHHC----CCCEEECCCCCCCCCCHHHHHHHHCHHHHHHCHHHHHHHHHC Q ss_conf 899999996599369994124223201----585140078654677865835442777787488999998740 Q gi|254780342|r 180 FSQIRRDQRNNIPAVFLSWEPHPINSD----LNIHYLPGGEEISGFGEASVYTVVRSDYLDKCPNISRLLKNI 248 (309) Q Consensus 180 ~a~l~~A~~~~~~~v~~~wsp~~~~~~----~dl~~Ledp~~~~~~p~~~~~~vv~~~~~~~~P~~~~~L~~~ 248 (309) ...+.+.++.|-.+.+ -|...... ..++.|..+... ++..+..+.+++ ....|.+.+|++-+ T Consensus 201 ~~~i~~~V~~G~Giai---lP~~~~~~~~~~~~l~~l~~~~~~---~~~~i~l~~~~~-r~~sPa~~aF~~~l 266 (269) T PRK11716 201 HEAIVSMVALGCGVGI---LPEVVLDNSPVRDRVQRLEVPPEL---TPFELGVCCQKK-RLHEPLIDAFWQLL 266 (269) T ss_pred HHHHHHHHHHCCEEEE---HHHHHHHCCCCCCCEEEEECCCCC---CCEEEEEEEECC-CCCCHHHHHHHHHH T ss_conf 9999999997994172---059998327557988997489988---736999999799-86799999999999 No 98 >PRK03629 tolB translocation protein TolB; Provisional Probab=73.92 E-value=3.8 Score=20.76 Aligned_cols=53 Identities=13% Similarity=0.013 Sum_probs=30.0 Q ss_pred CHHHHHHHHHHHHHHHHHCCC-------CCCCCCEEEECCCC--CHHHHHHHHHHHHHHHCC Q ss_conf 957999999999998630358-------34588379942588--699999999999996106 Q gi|254780342|r 1 MYKILAVCLFLTTFSISYARD-------ADSCTPVRFADTGW--TDIAATTAMTSVILEEIL 53 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~~~-------~~~~~~I~ig~~~w--te~~v~a~i~~~iLE~~l 53 (309) |||++.+++++.++.++.+.+ .+...++.||..+| .+.....+-+.+++...| T Consensus 1 mk~~~~~~f~~lll~~~~a~A~L~I~It~G~~~~~PIAV~pF~~~g~~~~~~~i~~II~~DL 62 (430) T PRK03629 1 MKQALRVAFGFLILWASVLHAEVRIEITQGVDSARPIGVVPFKWAGPGAAPEDIGGIVAADL 62 (430) T ss_pred CCHHHHHHHHHHHHHHHHHCEEEEEEEECCCCCCCCEEEECCCCCCCCCCHHHHHHHHHHHH T ss_conf 90148889989998754422379999934667732569928644788864678999998768 No 99 >PRK11143 glpQ glycerophosphodiester phosphodiesterase; Provisional Probab=73.74 E-value=5.9 Score=19.49 Aligned_cols=46 Identities=15% Similarity=0.116 Sum_probs=31.9 Q ss_pred HHHHHHHHHHHHHHHCCCCCCCCCEEEECCC----CCHHHHHHHHHHHHH Q ss_conf 9999999999986303583458837994258----869999999999999 Q gi|254780342|r 4 ILAVCLFLTTFSISYARDADSCTPVRFADTG----WTDIAATTAMTSVIL 49 (309) Q Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~~I~ig~~~----wte~~v~a~i~~~iL 49 (309) +..++++++++..+.+..++..+|+.||+-+ ++|++..+...+.-+ T Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~plVIAHRGasg~~PEnTl~Af~~Ai~~ 57 (359) T PRK11143 8 LSLALLLASLVAGSSAMAADAAEKIVIAHRGASGYLPEHTLPAKAMAYAQ 57 (359) T ss_pred HHHHHHHHHHHHCCCCCCCCCCCCEEEECCCCCCCCCCCHHHHHHHHHHC T ss_conf 99999999998535644446899869968986788865789999999984 No 100 >TIGR03339 phn_lysR aminoethylphosphonate catabolism associated LysR family transcriptional regulator. This group of sequences represents a number of related clades with numerous examples of members adjacent to operons for the degradation of 2-aminoethylphosphonate (AEP) in Pseudomonas, Ralstonia, Bordetella and Burkholderia species. These are transcriptional regulators of the LysR family which contain a helix-turn-helix (HTH) domain (pfam00126) and a periplasmic substrate-binding protein-like domain (pfam03466). Probab=73.71 E-value=6.9 Score=19.05 Aligned_cols=191 Identities=12% Similarity=0.055 Sum_probs=104.7 Q ss_pred CCCCCEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHCCCCEEEEEEECCCCHHHHHHHHHCCCEEE Q ss_conf 45883799425886999999999999961069848998564057888851498139998404641356666553047023 Q gi|254780342|r 23 DSCTPVRFADTGWTDIAATTAMTSVILEEILGYKTNIKLLAVPVTFRSLKNKGIDIFMGYWYPSLEKFIAPYLEEGSIKL 102 (309) Q Consensus 23 ~~~~~I~ig~~~wte~~v~a~i~~~iLE~~lGy~Ve~~~~~~~~~~~al~~G~iD~~~e~w~p~~~~~~~~~~~~~~~~~ 102 (309) ....+++||. +....+..+++...++.-+.++++..+++..+.+.|.+|++|+..-.-.+... .+.. T Consensus 84 ~~~g~l~ig~---~~~~~l~~~l~~f~~~~P~v~i~i~~~~~~~~~~~L~~~~~Di~i~~~~~~~~----------~~~~ 150 (279) T TIGR03339 84 LREGSLRIAA---TAPYYVLDLVARFRQRYPGIEVSVRIGNSQEVLQALQSYRVDVAVSSEVVDDP----------RLDR 150 (279) T ss_pred CCCCEEEECC---CHHHHHHHHHHHHHHHCCCCCEEEEECCHHHHHHHHHCCCEEEEEECCCCCCC----------CEEE T ss_conf 8745168617---51778999999999858986057895580877754025965899972788999----------8399 Q ss_pred ECCCCCCCEEEEEEEHHHH--HCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCCCCCCCEEEECCCCCCHH Q ss_conf 0135668535678634464--12345535655556654420135785432005558766403545221001036652018 Q gi|254780342|r 103 VAENLQGAKYMLAVNDVGF--ALGIKSYQDIAKYKKELGAKIYGIEPGNEGNQRILDMINNNKFSLKGFRLIEASELASF 180 (309) Q Consensus 103 ~~~~~~~~~~~~~v~~~~~--~~~i~~~~dl~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~s~~~~~ 180 (309) .........++++..-+ ...-.++.++... ..+..+++...+..........++.... .+...+ . T Consensus 151 --~~l~~~~~~~~~~~~~~la~~~~i~~~~l~~~------~~i~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~----~ 217 (279) T TIGR03339 151 --VVLGNDPLVAVVHRQHPLAERESVTLEELAGQ------PLLMREPGSVTRQTTEEALAAAGVAPRP-ALEIGS----R 217 (279) T ss_pred --EEEECCCEEEEEECCCHHHCCCCCCHHHHCCC------CEEEECCCCCHHHHHHHHHHHCCCCCCE-EEEECC----H T ss_conf --99404448999918863323999999997699------9898289983999999999977998753-677896----9 Q ss_pred HHHHHHHHCCCCEEEEEECCCHHH-HCCCCEEECCCCCCCCCCHHHHHHHHCHHHHHHCHHHHHHHH Q ss_conf 999999965993699941242232-015851400786546778658354427777874889999987 Q gi|254780342|r 181 SQIRRDQRNNIPAVFLSWEPHPIN-SDLNIHYLPGGEEISGFGEASVYTVVRSDYLDKCPNISRLLK 246 (309) Q Consensus 181 a~l~~A~~~~~~~v~~~wsp~~~~-~~~dl~~Ledp~~~~~~p~~~~~~vv~~~~~~~~P~~~~~L~ 246 (309) ..+.+.+.+|..+.+ -|.... ...+++.++-... .|.-.+..+.+++- ...|.+..|++ T Consensus 218 ~~~~~~v~~G~Giai---lP~~~v~~~~~l~~~pl~~~---~~~~~~~l~~~~~r-~~sp~~~aFie 277 (279) T TIGR03339 218 EAIREAVLAGLGVSV---VSAAEVGRDPRLRVLPIVGA---EPTMDEYLYCLKER-RGARLIAAFLE 277 (279) T ss_pred HHHHHHHHHCCEEEE---EEHHHHHHCCCEEEEECCCC---CCCCEEEEEEECCC-CCCHHHHHHHH T ss_conf 999999997993897---14899610989899989888---98627999998998-68999999996 No 101 >COG0181 HemC Porphobilinogen deaminase [Coenzyme metabolism] Probab=72.88 E-value=7.2 Score=18.92 Aligned_cols=57 Identities=16% Similarity=0.069 Sum_probs=44.7 Q ss_pred CCEEEECCCCCHHHHHHHHHHHHHHHCCC-CEEEEEECCH--------------------HHHHHHHHCCCCEEEEEE Q ss_conf 83799425886999999999999961069-8489985640--------------------578888514981399984 Q gi|254780342|r 26 TPVRFADTGWTDIAATTAMTSVILEEILG-YKTNIKLLAV--------------------PVTFRSLKNKGIDIFMGY 82 (309) Q Consensus 26 ~~I~ig~~~wte~~v~a~i~~~iLE~~lG-y~Ve~~~~~~--------------------~~~~~al~~G~iD~~~e~ 82 (309) .+|+||+-...=..++++.+...|++..+ +++|++...| .-+-++|.+|++|+.+.. T Consensus 2 ~~irIgTR~S~LAl~Qa~~V~~~L~~~~p~~~~ei~~i~T~GD~i~d~pL~~iGgkGlFtkEle~all~g~~DiAVHS 79 (307) T COG0181 2 MKLRIGTRGSKLALAQANEVIERLKAAYPDLEVEIVTIKTKGDRILDKPLSKIGGKGLFTKELEQALLEGEIDIAVHS 79 (307) T ss_pred CCEEEECCCCHHHHHHHHHHHHHHHHHCCCCEEEEEEEECCCCHHCCCHHHHCCCCEEEHHHHHHHHHCCCCCEEEEE T ss_conf 735885178788999999999999986899538999985145142013488848818779999999976997789760 No 102 >pfam10566 Glyco_hydro_97 Glycoside hydrolase 97. This is the 97th family of glycosidases, in this case bacterial. The central part of the GH97 family protein sequences represents a typical and complete (beta/alpha)8-barrel or catalytic TIM-barrel type domain. The N- and C-terminal parts of the sequences, mainly consisting of beta-strands, most probably form two additional non-catalytic domains with as yet unknown functions. The non-catalytic domains of glycosidases from the alpha-galactosidase and alpha-glucosidase superfamilies are also predominantly composed of beta-strands, and at least some of these domains are involved in oligomerisation and carbohydrate binding. In all known glycosidases with the (beta-alpha)8-barrel fold, the amino acid residues at the active site are located on the C-termini of the beta-strands. Probab=72.59 E-value=3.7 Score=20.85 Aligned_cols=18 Identities=17% Similarity=0.102 Sum_probs=11.9 Q ss_pred CHHHHHHHHHHHHHHHHH Q ss_conf 957999999999998630 Q gi|254780342|r 1 MYKILAVCLFLTTFSISY 18 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~ 18 (309) |||+++.++++++++++. T Consensus 1 mkk~~~~~~~~~~~~~~~ 18 (643) T pfam10566 1 MKKLTIILLAFLLLIGNL 18 (643) T ss_pred CCCHHHHHHHHHHHHHHH T ss_conf 950257899999999765 No 103 >PRK00178 tolB translocation protein TolB; Provisional Probab=72.07 E-value=5.6 Score=19.62 Aligned_cols=22 Identities=14% Similarity=0.223 Sum_probs=16.5 Q ss_pred HHHCCCCEEEEEECCCHHHHCC Q ss_conf 9965993699941242232015 Q gi|254780342|r 186 DQRNNIPAVFLSWEPHPINSDL 207 (309) Q Consensus 186 A~~~~~~~v~~~wsp~~~~~~~ 207 (309) -...+++++-+.|+||+.--.| T Consensus 195 lt~~~~~~lsP~wSPdg~~iaY 216 (433) T PRK00178 195 LLQSREPILSPRWSPDGKRIAY 216 (433) T ss_pred CCCCCCCEECCEECCCCCEEEE T ss_conf 2368974670007589988999 No 104 >COG0018 ArgS Arginyl-tRNA synthetase [Translation, ribosomal structure and biogenesis] Probab=71.41 E-value=7.8 Score=18.71 Aligned_cols=57 Identities=18% Similarity=0.036 Sum_probs=40.5 Q ss_pred CCCCCCCEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEEE--CCHHHHHHHHHCCCCEEEEE Q ss_conf 83458837994258869999999999999610698489985--64057888851498139998 Q gi|254780342|r 21 DADSCTPVRFADTGWTDIAATTAMTSVILEEILGYKTNIKL--LAVPVTFRSLKNKGIDIFMG 81 (309) Q Consensus 21 ~~~~~~~I~ig~~~wte~~v~a~i~~~iLE~~lGy~Ve~~~--~~~~~~~~al~~G~iD~~~e 81 (309) +|+-.+|.+||+ .=+.|++..++.+|+. +||+|.... +|.+.++.-|..+=.-.+.| T Consensus 125 SaNptkplHiGH---lR~aiiGDsLaril~~-~Gy~V~r~~yvnD~G~Q~~~l~~~~~~~~~e 183 (577) T COG0018 125 SANPTGPLHIGH---LRNAIIGDSLARILEF-LGYDVTRENYVNDWGTQIGMLALSYEKRGRE 183 (577) T ss_pred CCCCCCCCCCCH---HHHHHHHHHHHHHHHH-CCCCEEEEEEECCHHHHHHHHHHHHHHHCCC T ss_conf 799999852332---3536889999999998-3997667866785799999999999995541 No 105 >TIGR00456 argS arginyl-tRNA synthetase; InterPro: IPR001278 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology . The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric . Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices , and are mostly dimeric or multimeric, containing at least three conserved regions , , . However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases; these synthetases are further divided into three subclasses, a, b and c, according to sequence homology. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases . Arginyl-tRNA synthetase (6.1.1.19 from EC) has been crystallized and preliminary X-ray crystallographic analysis of yeast arginyl-tRNA synthetase-yeast tRNAArg complexes is available . ; GO: 0000166 nucleotide binding, 0004814 arginine-tRNA ligase activity, 0005524 ATP binding, 0006412 translation, 0006420 arginyl-tRNA aminoacylation, 0005737 cytoplasm. Probab=70.12 E-value=8 Score=18.62 Aligned_cols=48 Identities=21% Similarity=0.163 Sum_probs=37.0 Q ss_pred CCCCCCCCEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEEE--CCHHHHHHHH Q ss_conf 583458837994258869999999999999610698489985--6405788885 Q gi|254780342|r 20 RDADSCTPVRFADTGWTDIAATTAMTSVILEEILGYKTNIKL--LAVPVTFRSL 71 (309) Q Consensus 20 ~~~~~~~~I~ig~~~wte~~v~a~i~~~iLE~~lGy~Ve~~~--~~~~~~~~al 71 (309) ++|+-.+|++||+ -=+.|++..++-+||- +||+|.+.. +|-+.++.-+ T Consensus 125 sSaNp~gplH~GH---lR~aiIGD~L~r~l~~-~G~~V~r~yyVND~G~Q~~~l 174 (600) T TIGR00456 125 SSANPAGPLHIGH---LRNAIIGDSLARILEF-LGYDVIREYYVNDWGRQIGLL 174 (600) T ss_pred ECCCCCCCCCHHH---HHHHHHHHHHHHHHHH-CCCCEEEEEEECCHHHHHHHH T ss_conf 6588897611000---0257778899999987-597079852005428999999 No 106 >PRK10752 sulfate transporter subunit; Provisional Probab=69.53 E-value=8.6 Score=18.44 Aligned_cols=237 Identities=13% Similarity=0.063 Sum_probs=99.7 Q ss_pred HHHHHHHHHHHHH-------HHCCCCEEEEEE--CCHHHHHHHHHCC-CCEEEEEEECCCCHHHHHH-HHHCCCEEEECC Q ss_conf 9999999999999-------610698489985--6405788885149-8139998404641356666-553047023013 Q gi|254780342|r 37 DIAATTAMTSVIL-------EEILGYKTNIKL--LAVPVTFRSLKNK-GIDIFMGYWYPSLEKFIAP-YLEEGSIKLVAE 105 (309) Q Consensus 37 e~~v~a~i~~~iL-------E~~lGy~Ve~~~--~~~~~~~~al~~G-~iD~~~e~w~p~~~~~~~~-~~~~~~~~~~~~ 105 (309) +.-++-++|+++. +++.|-+|++.. |++..+-.++..| +-|+..=.-.+..+.+..+ ....+....+.. T Consensus 28 SYd~tRely~~~n~~F~~~wk~~tG~~V~i~qShGGSg~QARaVi~Gl~ADVVtlal~~Did~l~~~GlI~~dW~~~~P~ 107 (329) T PRK10752 28 SYDPTRELYEQYNKAFSAHWKQQTGDNVVIRQSHGGSGKQATSVINGIEADVVTLALAYDVDAIAERGRIDKNWIKRLPD 107 (329) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHCCCEEEEECCCCCHHHHHHHHCCCCCCEEEECCHHHHHHHHHCCCCCCCHHHHCCC T ss_conf 40758999999989999999883299038985578762789998478756778751421368998679999145654889 Q ss_pred CC--CCCEEEEEEEHHHHHCCCCCHHHHHHHHHHH---HHCCCCCCCCCCCCHHHHHHHHHCC---------CCCCCEEE Q ss_conf 56--6853567863446412345535655556654---4201357854320055587664035---------45221001 Q gi|254780342|r 106 NL--QGAKYMLAVNDVGFALGIKSYQDIAKYKKEL---GAKIYGIEPGNEGNQRILDMINNNK---------FSLKGFRL 171 (309) Q Consensus 106 ~~--~~~~~~~~v~~~~~~~~i~~~~dl~~~~~~~---~~~~~g~~~~~~~~~~~~~~~~~~~---------l~~~~~~~ 171 (309) +. .....-+.|++- .-.+|++|.||.+..-.. ..+.. +...|..-..+...++..+ +..-+.++ T Consensus 108 ns~pytStivflVRkG-NPK~IkdW~DL~k~gV~VItPNPKTS-G~aRwn~LAawg~a~~~~gg~e~~a~~fv~~l~~NV 185 (329) T PRK10752 108 NSAPYTSTIVFLVRKG-NPKQIHDWNDLIKPGVSVITPNPKSS-GGARWNYLAAWGYALHHNNNDQAKAQDFVKALYKNV 185 (329) T ss_pred CCCCEEEEEEEEECCC-CCCCCCCHHHHCCCCCEEECCCCCCC-CHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCC T ss_conf 9965223799997189-95668774674648978976899887-317799999999999724999999999999998368 Q ss_pred ECCCCCCHHHHHHHHHHCCCCEEEEEECCCHHHH--CCC---CEEECCCCCCCCCCHHHHHHHHCHHHHHH-CHHHHHHH Q ss_conf 0366520189999999659936999412422320--158---51400786546778658354427777874-88999998 Q gi|254780342|r 172 IEASELASFSQIRRDQRNNIPAVFLSWEPHPINS--DLN---IHYLPGGEEISGFGEASVYTVVRSDYLDK-CPNISRLL 245 (309) Q Consensus 172 ~~~s~~~~~a~l~~A~~~~~~~v~~~wsp~~~~~--~~d---l~~Ledp~~~~~~p~~~~~~vv~~~~~~~-~P~~~~~L 245 (309) .....++ -.....=+++|..+|...|+-..... +++ +.++--+-....-|| ..+|.+-..++ ..++++.+ T Consensus 186 ~vld~ga-R~AT~tF~~rg~GDVLlt~ENEa~l~~~~~g~~~~eiV~P~~silae~p---VAvVd~~vdk~GTr~~A~ay 261 (329) T PRK10752 186 EVLDSGA-RGSTNTFVERGIGDVLIAWENEALLAANELGKDKFEIVTPSESILAEPT---VSVVDKVVEKKGTREVAEAY 261 (329) T ss_pred CCCCCCC-HHHHHHHHHCCCCCEEEEHHHHHHHHHHHCCCCCCEEECCCCCEECCCC---EEEEECCCCCCCCHHHHHHH T ss_conf 5258973-4677899864867557634379999998608998628789963551597---59983445666889999999 Q ss_pred HHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHH Q ss_conf 740899899999999998459998999999999859 Q gi|254780342|r 246 KNIKFSVALENEMMKLILNNKQDRQFVGRTMLRTHP 281 (309) Q Consensus 246 ~~~~ls~~~~~~l~~~v~~~g~~~~~vA~~wL~~n~ 281 (309) -++-+|.+-+ ++.++.-.--.+++ |++++-+.-| T Consensus 262 l~fLys~eaQ-~i~Ak~~yRP~~~~-v~~~~~~~FP 295 (329) T PRK10752 262 LKYLYSPEGQ-EIAAKNYYRPRDAE-VAKKYENAFP 295 (329) T ss_pred HHHHCCHHHH-HHHHHHCCCCCCHH-HHHHHHCCCC T ss_conf 9986597999-99998088989978-9887654399 No 107 >cd00671 ArgRS_core This is the catalytic core domain of Arginyl tRNA synthetase (ArgRS). This class I enzyme is a monomer, which aminoacylates the 2'-OH of the nucleotide at the 3' of the appropriate tRNA. The core domain is based on the Rossman fold and is responsible for the ATP-dependent formation of the enzyme bound aminoacyl-adenylate. There are at least three subgroups of ArgRS. One type contains both characteristic class I HIGH and KMSKS motifs, which are involved in ATP binding. The other subtype lacks the KMSKS motif; however, it has a lysine N-terminal to the HIGH motif, which serves as the functional counterpart to the second lysine of the KMSKS motif. A third group, which is found primarily in archaea and a few bacteria, lacks both the KMSKS motif and the HIGH loop lysine. Probab=68.76 E-value=8.9 Score=18.34 Aligned_cols=51 Identities=18% Similarity=0.085 Sum_probs=38.9 Q ss_pred CCCCCCCCEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEEE--CCHHHHHHHHHCC Q ss_conf 583458837994258869999999999999610698489985--6405788885149 Q gi|254780342|r 20 RDADSCTPVRFADTGWTDIAATTAMTSVILEEILGYKTNIKL--LAVPVTFRSLKNK 74 (309) Q Consensus 20 ~~~~~~~~I~ig~~~wte~~v~a~i~~~iLE~~lGy~Ve~~~--~~~~~~~~al~~G 74 (309) ++|+-.+|.+||. -=+.+++..++-+||. .||+|.... ++.+.++.-+..| T Consensus 7 ~S~Np~kplHvGH---lR~aiiGdslaril~~-~G~~V~r~~yvnD~G~Qi~~l~~~ 59 (267) T cd00671 7 VSANPTGPLHVGH---LRNAIIGDSLARILEF-AGYDVTREYYINDWGRQIGLLILS 59 (267) T ss_pred CCCCCCCCCCHHH---HHHHHHHHHHHHHHHH-CCCEEEEEEECCCCCHHHHHHHHH T ss_conf 5899999850310---3638999999999998-799588988507860999999999 No 108 >KOG4164 consensus Probab=68.35 E-value=2.1 Score=22.41 Aligned_cols=80 Identities=15% Similarity=0.089 Sum_probs=55.9 Q ss_pred HHHHCCCCEEECCCCCCCC------------------CCHHHHHHHHCHHHHHHCHHHHHHHHHCCCCHHHHHHHHHHHH Q ss_conf 2320158514007865467------------------7865835442777787488999998740899899999999998 Q gi|254780342|r 202 PINSDLNIHYLPGGEEISG------------------FGEASVYTVVRSDYLDKCPNISRLLKNIKFSVALENEMMKLIL 263 (309) Q Consensus 202 ~~~~~~dl~~Ledp~~~~~------------------~p~~~~~~vv~~~~~~~~P~~~~~L~~~~ls~~~~~~l~~~v~ 263 (309) +....|+--.|+||.--|+ .-+.++....++.|.+++|.+.=.|++++==-.+|++++ . T Consensus 318 ~d~~dydasllDdPq~~~gkhk~~l~F~symttvidyVrpsdlKkdmNe~FreKfP~IkLTLSKirSlKREMr~l~---~ 394 (497) T KOG4164 318 GDKCDYDASLLDDPQWPCGKHKRVLIFSSYMTTVIDYVRPSDLKKDMNETFREKFPHIKLTLSKIRSLKREMRELG---E 394 (497) T ss_pred CCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCEEEEEECHHHHHHHHHHHHHHHCCEEEEEHHHHHHHHHHHHHHH---H T ss_conf 4431358100258655444544236750003135778665788888768888748825776888888999999742---1 Q ss_pred HCCCCHHHHHHHHHHHHHHHH Q ss_conf 459998999999999859999 Q gi|254780342|r 264 NNKQDRQFVGRTMLRTHPDLL 284 (309) Q Consensus 264 ~~g~~~~~vA~~wL~~n~~~~ 284 (309) ..|-++..||..|+==-+=.. T Consensus 395 d~~id~~TVa~AyVYFEKliL 415 (497) T KOG4164 395 DCGIDVVTVAMAYVYFEKLIL 415 (497) T ss_pred CCCCCCEEEHHHHHHHHHHHH T ss_conf 048660110257999999998 No 109 >PRK10877 thiol:disulfide interchange protein DsbC; Provisional Probab=67.69 E-value=5.3 Score=19.78 Aligned_cols=61 Identities=21% Similarity=0.255 Sum_probs=35.1 Q ss_pred CHHHHHHHHHHHHHHHHHCCCCCCCCCEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEE-EECCHHHHHHHHHCCCCEEE Q ss_conf 95799999999999863035834588379942588699999999999996106984899-85640578888514981399 Q gi|254780342|r 1 MYKILAVCLFLTTFSISYARDADSCTPVRFADTGWTDIAATTAMTSVILEEILGYKTNI-KLLAVPVTFRSLKNKGIDIF 79 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~~~~~~~~~I~ig~~~wte~~v~a~i~~~iLE~~lGy~Ve~-~~~~~~~~~~al~~G~iD~~ 79 (309) |||.+.++.++++.+++++ .++ +. .+++-|++ +|.+|+- .+-.-..+|+-+..|+| +| T Consensus 1 mkk~l~~~~~l~~~~s~~a-~aD-------------ea-----~i~~~l~k-~g~~v~~V~~sPi~GlyeV~~~~~i-~Y 59 (232) T PRK10877 1 MKKGFMLFTLLAAAFSGFA-HAD-------------DA-----AIQQTLAK-LGIQSADIQPSPVAGMKTVLTESGV-LY 59 (232) T ss_pred CCHHHHHHHHHHHHHHHHH-HCC-------------HH-----HHHHHHHH-CCCCCCEECCCCCCCEEEEEECCEE-EE T ss_conf 9266999999999986252-437-------------99-----99999998-3998313556899982999989978-99 Q ss_pred EEE Q ss_conf 984 Q gi|254780342|r 80 MGY 82 (309) Q Consensus 80 ~e~ 82 (309) .+- T Consensus 60 ~s~ 62 (232) T PRK10877 60 ITD 62 (232) T ss_pred ECC T ss_conf 869 No 110 >COG2182 MalE Maltose-binding periplasmic proteins/domains [Carbohydrate transport and metabolism] Probab=65.76 E-value=10 Score=17.96 Aligned_cols=57 Identities=14% Similarity=0.040 Sum_probs=38.1 Q ss_pred CCCEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEEECCHH----HHHHHHHCCC-CEEEEEE Q ss_conf 88379942588699999999999996106984899856405----7888851498-1399984 Q gi|254780342|r 25 CTPVRFADTGWTDIAATTAMTSVILEEILGYKTNIKLLAVP----VTFRSLKNKG-IDIFMGY 82 (309) Q Consensus 25 ~~~I~ig~~~wte~~v~a~i~~~iLE~~lGy~Ve~~~~~~~----~~~~al~~G~-iD~~~e~ 82 (309) .++++|-..+-.+...+.++.+...++ +|++|+++..+-. ....+...|+ -|++... T Consensus 40 ~~~ltvW~~~~~~~~~i~~~~~kfek~-~gi~V~i~~~~~~d~~~~~~~~~paGqgpDi~~~~ 101 (420) T COG2182 40 EKKLTVWVDGEKEVDGIKEAAAKFEKE-TGIKVKIVEEDYDDLLEKFLLAAPAGQGPDIFVWA 101 (420) T ss_pred CCEEEEEECCCHHHHHHHHHHHHHHHH-HCCEEEEEECCCCCHHHHHHCCCCCCCCCCEEEEC T ss_conf 874899757830679999999999998-79758999467504888860137778889779856 No 111 >PRK06934 flavodoxin; Provisional Probab=65.50 E-value=10 Score=17.93 Aligned_cols=35 Identities=9% Similarity=0.054 Sum_probs=21.6 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCCEEEECCCCCHHH Q ss_conf 99999999998630358345883799425886999 Q gi|254780342|r 5 LAVCLFLTTFSISYARDADSCTPVRFADTGWTDIA 39 (309) Q Consensus 5 ~~~~~~~~~~~~~~~~~~~~~~~I~ig~~~wte~~ 39 (309) +++++++++++...++.+...+.|-|++..++|.. T Consensus 15 ~~~~~~~~~~~~~a~~~~q~~~k~LI~Yfs~~~~~ 49 (221) T PRK06934 15 LFLAVAVSSLSLIAEAADQNARRVLIVYFSQPEDV 49 (221) T ss_pred HHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCCC T ss_conf 99999999888875037767755799996475555 No 112 >PRK09934 putative fimbrial protein; Provisional Probab=65.40 E-value=7.1 Score=18.96 Aligned_cols=29 Identities=10% Similarity=0.001 Sum_probs=17.2 Q ss_pred CHHHHHHHHHHHHHHHHH-CCCCCCCCCEE Q ss_conf 957999999999998630-35834588379 Q gi|254780342|r 1 MYKILAVCLFLTTFSISY-ARDADSCTPVR 29 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~-~~~~~~~~~I~ 29 (309) |||++++++++++++... ++.+....+|+ T Consensus 1 mkk~~l~~~~~l~~~~~~~aa~~~g~g~i~ 30 (171) T PRK09934 1 MRRVFIAIFCGLLWSPLSQAASPLGEINIE 30 (171) T ss_pred CHHHHHHHHHHHHHHHHHHHCCCCCEEEEE T ss_conf 926899999999986344211358805899 No 113 >PRK10216 DNA-binding transcriptional regulator YidZ; Provisional Probab=65.09 E-value=10 Score=17.88 Aligned_cols=58 Identities=9% Similarity=0.035 Sum_probs=46.7 Q ss_pred CCCCEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHCCCCEEEEEE Q ss_conf 58837994258869999999999999610698489985640578888514981399984 Q gi|254780342|r 24 SCTPVRFADTGWTDIAATTAMTSVILEEILGYKTNIKLLAVPVTFRSLKNKGIDIFMGY 82 (309) Q Consensus 24 ~~~~I~ig~~~wte~~v~a~i~~~iLE~~lGy~Ve~~~~~~~~~~~al~~G~iD~~~e~ 82 (309) +..+.+|+..+..+..++..+++.+-++.=+..+++...+.. ..+.|.+|++|+.... T Consensus 64 ~~~~F~ia~~d~~~~~llp~l~~~i~~~AP~v~l~~~~~~~~-~~~~L~~g~iDl~i~~ 121 (288) T PRK10216 64 RGLKFELAAESPLMMIMLNALSQQIYQRYPQATIKLRNWDYD-SLDAITRGEVDIGFTG 121 (288) T ss_pred CCCEEEEEECCHHHHHHHHHHHHHHHHHCCCCEEEEECCCCC-HHHHHHCCCCCEEEEC T ss_conf 570799996367899999999999997699977996248935-5999975997689835 No 114 >PRK10918 phosphate transporter subunit; Provisional Probab=64.85 E-value=11 Score=17.85 Aligned_cols=77 Identities=10% Similarity=0.035 Sum_probs=44.1 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCCCCEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHCCCCEEEEEE Q ss_conf 79999999999986303583458837994258869999999999999610698489985640578888514981399984 Q gi|254780342|r 3 KILAVCLFLTTFSISYARDADSCTPVRFADTGWTDIAATTAMTSVILEEILGYKTNIKLLAVPVTFRSLKNKGIDIFMGY 82 (309) Q Consensus 3 k~~~~~~~~~~~~~~~~~~~~~~~~I~ig~~~wte~~v~a~i~~~iLE~~lGy~Ve~~~~~~~~~~~al~~G~iD~~~e~ 82 (309) |.+.+.++++.++++..+. ....+|+=+...|....... +..-..+..|.+|++...++..=...+.+|.+||-..= T Consensus 5 ~~~~~~~~~~~ls~~a~~a-~aa~~l~GaGaTf~~p~~~~--w~~~~~~~~g~~v~Y~~~GSG~G~~~f~~g~vdFa~SD 81 (346) T PRK10918 5 RTTVATVVAATLSMSAFSA-FAAASLTGAGATFPAPVYAK--WADTYQKETGNKVNYQGIGSSGGVKQIIANTVDFGASD 81 (346) T ss_pred HHHHHHHHHHHHHHHHHHH-HHHCCCCCCCCCHHHHHHHH--HHHHHHHHCCCEEEECCCCCHHHHHHHHCCCCCEEECC T ss_conf 8899999999998877678-76166124143349999999--99999984598689826687899999973898566458 No 115 >TIGR00212 hemC porphobilinogen deaminase; InterPro: IPR000860 Tetrapyrroles are large macrocyclic compounds derived from a common biosynthetic pathway . The end-product, uroporphyrinogen III, is used to synthesise a number of important molecules, including vitamin B12, haem, sirohaem, chlorophyll, coenzyme F430 and phytochromobilin . The first stage in tetrapyrrole synthesis is the synthesis of 5-aminoaevulinic acid ALA via two possible routes: (1) condensation of succinyl CoA and glycine (C4 pathway) using ALA synthase (2.3.1.37 from EC), or (2) decarboxylation of glutamate (C5 pathway) via three different enzymes, glutamyl-tRNA synthetase (6.1.1.17 from EC) to charge a tRNA with glutamate, glutamyl-tRNA reductase (1.2.1.70 from EC) to reduce glutamyl-tRNA to glutamate-1-semialdehyde (GSA), and GSA aminotransferase (5.4.3.8 from EC) to catalyse a transamination reaction to produce ALA. The second stage is to convert ALA to uroporphyrinogen III, the first macrocyclic tetrapyrrolic structure in the pathway. This is achieved by the action of three enzymes in one common pathway: porphobilinogen (PBG) synthase (or ALA dehydratase, 4.2.1.24 from EC) to condense two ALA molecules to generate porphobilinogen; hydroxymethylbilane synthase (or PBG deaminase, 2.5.1.61 from EC) to polymerise four PBG molecules into preuroporphyrinogen (tetrapyrrole structure); and uroporphyrinogen III synthase (4.2.1.75 from EC) to link two pyrrole units together (rings A and D) to yield uroporphyrinogen III. Uroporphyrinogen III is the first branch point of the pathway. To synthesise cobalamin (vitamin B12), sirohaem, and coenzyme F430, uroporphyrinogen III needs to be converted into precorrin-2 by the action of uroporphyrinogen III methyltransferase (2.1.1.107 from EC). To synthesise haem and chlorophyll, uroporphyrinogen III needs to be decarboxylated into coproporphyrinogen III by the action of uroporphyrinogen III decarboxylase (4.1.1.37 from EC) . This entry represents hydroxymethylbilane synthase (or porphobilinogen deaminase, 2.5.1.61 from EC), which functions during the second stage of tetrapyrrole biosynthesis. This enzyme catalyses the polymerisation of four PBG molecules into the tetrapyrrole structure, preuroporphyrinogen, with the concomitant release of four molecules of ammonia. This enzyme uses a unique dipyrro-methane cofactor made from two molecules of PBG, which is covalently attached to a cysteine side chain. The tetrapyrrole product is synthesized in an ordered, sequential fashion, by initial attachment of the first pyrrole unit (ring A) to the cofactor, followed by subsequent additions of the remaining pyrrole units (rings B, C, D) to the growing pyrrole chain . The link between the pyrrole ring and the cofactor is broken once all the pyrroles have been added. This enzyme is folded into three distinct domains that enclose a single, large active site that makes use of an aspartic acid as its one essential catalytic residue, acting as a general acid/base during catalysis , . A deficiency of hydroxymethylbilane synthase is implicated in the neuropathic disease, Acute Intermittent Porphyria (AIP) . ; GO: 0004418 hydroxymethylbilane synthase activity, 0033014 tetrapyrrole biosynthetic process. Probab=64.00 E-value=11 Score=17.75 Aligned_cols=54 Identities=19% Similarity=0.138 Sum_probs=41.8 Q ss_pred EEEECCCCCHHHHHHHHHHHHHHHCCC---CEEEEEE----CC---HHH-HH-------------HHHHCCCCEEEEEE Q ss_conf 799425886999999999999961069---8489985----64---057-88-------------88514981399984 Q gi|254780342|r 28 VRFADTGWTDIAATTAMTSVILEEILG---YKTNIKL----LA---VPV-TF-------------RSLKNKGIDIFMGY 82 (309) Q Consensus 28 I~ig~~~wte~~v~a~i~~~iLE~~lG---y~Ve~~~----~~---~~~-~~-------------~al~~G~iD~~~e~ 82 (309) |+||+-+..=..++|+.++..|++. + |++|++. || ... ++ +||-+|+||+.+.. T Consensus 1 l~IGTRgS~LAl~Qa~~V~~~l~~~-~~p~~~~ei~~~kT~GD~~~~~~~L~~~GgKGlFtKEle~aLL~geiD~AVHS 78 (312) T TIGR00212 1 LRIGTRGSKLALAQANLVKEQLKEV-YPPELDTEIVIIKTTGDKIQDKPLLSDIGGKGLFTKELEQALLDGEIDLAVHS 78 (312) T ss_pred CEEECCCCHHHHHHHHHHHHHHHHH-CCCCCCEEEEEEEECCCCCCCCCCEEECCCHHHHHHHHHHHHHCCCCCEEEEC T ss_conf 9784048689999999999999973-69740217999980486111610002137503668899999625984488843 No 116 >pfam06316 Ail_Lom Enterobacterial Ail/Lom protein. This family consists of several bacterial and phage Ail/Lom-like proteins. The Yersinia enterocolitica Ail protein is a known virulence factor. Proteins in this family are predicted to consist of eight transmembrane beta-sheets and four cell surface-exposed loops. It is thought that Ail directly promotes invasion and loop 2 contains an active site, perhaps a receptor-binding domain. The phage protein Lom is expressed during lysogeny, and encode host-cell envelope proteins. Lom is found in the bacterial outer membrane, and is homologous to virulence proteins of two other enterobacterial genera. It has been suggested that lysogeny may generally have a role in bacterial survival in animal hosts, and perhaps in pathogenesis. Probab=63.92 E-value=8 Score=18.63 Aligned_cols=54 Identities=11% Similarity=0.062 Sum_probs=30.9 Q ss_pred CHHHHHHHHHHHHHHHHHCC---CCCCCCCEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEEE Q ss_conf 95799999999999863035---83458837994258869999999999999610698489985 Q gi|254780342|r 1 MYKILAVCLFLTTFSISYAR---DADSCTPVRFADTGWTDIAATTAMTSVILEEILGYKTNIKL 61 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~~---~~~~~~~I~ig~~~wte~~v~a~i~~~iLE~~lGy~Ve~~~ 61 (309) |||+.+.+++++.+.+..+. +.+...||.+|.. .+++=.+-.++--|.++++.. T Consensus 1 m~~~~~~~l~~~~~l~~~~~~a~A~~~~sTvS~GYa-------Qa~~~~~g~~~l~G~NlKYRY 57 (199) T pfam06316 1 MRKLCAAILSAAICLAAAGTPASAAEHQSTLSAGYL-------HAHTDAPGSDDLNGINVKYRY 57 (199) T ss_pred CCHHHHHHHHHHHHHHHHCCCCHHHCCCCEEEECEE-------EEECCCCCCCCCCCEEEEEEE T ss_conf 901778999999999972773112147866862210-------031475667788853888664 No 117 >TIGR02995 ectoine_ehuB ectoine/hydroxyectoine ABC transporter solute-binding protein; InterPro: IPR014337 Members of this entry are the extracellular solute-binding proteins of ABC transporters and they closely resemble amino acid transporters. The member from Sinorhizobium meliloti (Rhizobium meliloti) is involved in ectoine uptake, both for osmoprotection and for catabolism. All other members of the seed alignment are found associated with ectoine catabolic genes.. Probab=62.85 E-value=9.2 Score=18.24 Aligned_cols=146 Identities=17% Similarity=0.145 Sum_probs=81.2 Q ss_pred CCCEEEECCCC----------CHHHHHHHHHHHHHHHCCCCE-EEEEECCHHHHHHHHHCCCCEEEEEE--ECCCCHHHH Q ss_conf 88379942588----------699999999999996106984-89985640578888514981399984--046413566 Q gi|254780342|r 25 CTPVRFADTGW----------TDIAATTAMTSVILEEILGYK-TNIKLLAVPVTFRSLKNKGIDIFMGY--WYPSLEKFI 91 (309) Q Consensus 25 ~~~I~ig~~~w----------te~~v~a~i~~~iLE~~lGy~-Ve~~~~~~~~~~~al~~G~iD~~~e~--w~p~~~~~~ 91 (309) ..-||||..|= +..-+.-.|...||+ .||.+ |.-+.-.-+.+..+|+.|+.|+..-. -.|..= T Consensus 33 QG~~Ria~ANE~Pf~y~g~DGkv~G~~PdvA~aifK-~lGi~dVn~~~te~G~lIPGL~AgRfDaiaAg~~i~PeRC--- 108 (285) T TIGR02995 33 QGFVRIAIANEVPFAYVGADGKVAGAAPDVARAIFK-RLGIKDVNASVTEYGALIPGLKAGRFDAIAAGLFIKPERC--- 108 (285) T ss_pred CCEEEEEECCCCCCCCCCCCCCCCCCCHHHHHHHHH-HCCCEEEEEEECCCCCCCCCCCCCHHHHHHHCCCCCHHHH--- T ss_conf 530477400678640026885313685578999998-5481100000356403145410102568861157885666--- Q ss_pred HHHHHCCCEEEECCCCCCCEEEEEEEHHHHHCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCCCCCCCEEE Q ss_conf 66553047023013566853567863446412345535655556654420135785432005558766403545221001 Q gi|254780342|r 92 APYLEEGSIKLVAENLQGAKYMLAVNDVGFALGIKSYQDIAKYKKELGAKIYGIEPGNEGNQRILDMINNNKFSLKGFRL 171 (309) Q Consensus 92 ~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~i~~~~dl~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~~ 171 (309) ..+....+++. ...++.|++-.+. .+.|..|+.+..+. .+++..|... .+..+.-|-..+..-+ T Consensus 109 ------kqVaft~P~~~-~~eallvkKGNPk-~L~sY~DiAknP~~----K~a~~~G~~e----~~~ar~~GVk~eqi~~ 172 (285) T TIGR02995 109 ------KQVAFTEPILS-LAEALLVKKGNPK-ELKSYKDIAKNPEV----KVAVVAGAVE----VKYAREAGVKEEQIVV 172 (285) T ss_pred ------HHHHHCCCCCC-CCCCCCCCCCCCC-CCCCCCCHHCCCCC----EEEECCCCHH----HHHHHHCCCCCCEEEE T ss_conf ------46440688644-7552112587888-87752020068231----2761247237----8999865898660788 Q ss_pred ECCCCCCHHHHHHHHHHCCCCEEE Q ss_conf 036652018999999965993699 Q gi|254780342|r 172 IEASELASFSQIRRDQRNNIPAVF 195 (309) Q Consensus 172 ~~~s~~~~~a~l~~A~~~~~~~v~ 195 (309) ++....++ ++++.+..++. T Consensus 173 vPd~~~~l-----~~vq~~RaDay 191 (285) T TIGR02995 173 VPDTASAL-----KAVQDKRADAY 191 (285) T ss_pred ECCCHHHH-----HHHHHHCCCEE T ss_conf 24871678-----88774034220 No 118 >COG3019 Predicted metal-binding protein [General function prediction only] Probab=61.79 E-value=12 Score=17.49 Aligned_cols=57 Identities=9% Similarity=0.047 Sum_probs=29.7 Q ss_pred CHHHHHH-HHHHHHHHHHHCCCCCCCCCEEE-ECCCCCHHHHHHHHHHHHHHHCCCCEEEEEECC Q ss_conf 9579999-99999998630358345883799-425886999999999999961069848998564 Q gi|254780342|r 1 MYKILAV-CLFLTTFSISYARDADSCTPVRF-ADTGWTDIAATTAMTSVILEEILGYKTNIKLLA 63 (309) Q Consensus 1 mkk~~~~-~~~~~~~~~~~~~~~~~~~~I~i-g~~~wte~~v~a~i~~~iLE~~lGy~Ve~~~~~ 63 (309) |||...+ .++++.+.+..++.+. ...+.+ =++| ---+..+.+.||.+ |++|+.+.-+ T Consensus 1 Mrr~~~l~~l~a~~l~~~~~~~a~-~~~~~vyksPn----CGCC~~w~~~mk~~-Gf~Vk~~~~~ 59 (149) T COG3019 1 MRRRAFLRSLAALGLGSTGPAQAQ-ATEMVVYKSPN----CGCCDEWAQHMKAN-GFEVKVVETD 59 (149) T ss_pred CCHHHHHHHHHHHHHHCCCCHHCC-EEEEEEEECCC----CCCHHHHHHHHHHC-CCEEEEEECC T ss_conf 912689999999985314400101-03578873889----86179999999867-9578985257 No 119 >COG4588 AcfC Accessory colonization factor AcfC, contains ABC-type periplasmic domain [General function prediction only] Probab=61.18 E-value=12 Score=17.42 Aligned_cols=201 Identities=16% Similarity=0.142 Sum_probs=93.7 Q ss_pred EEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHCCCCEEEEEEECCCCHHHHHHHHHCCCEEEECCCC Q ss_conf 79942588699999999999996106984899856405788885149813999840464135666655304702301356 Q gi|254780342|r 28 VRFADTGWTDIAATTAMTSVILEEILGYKTNIKLLAVPVTFRSLKNKGIDIFMGYWYPSLEKFIAPYLEEGSIKLVAENL 107 (309) Q Consensus 28 I~ig~~~wte~~v~a~i~~~iLE~~lGy~Ve~~~~~~~~~~~al~~G~iD~~~e~w~p~~~~~~~~~~~~~~~~~~~~~~ 107 (309) |++=.++=+. .++-.+.. ..|+++|.+|+++.|... .|.--+.++-|+..........++......+-....+.+. T Consensus 23 inlYGpGGPh-taL~~vA~-~~~ektg~kVnvt~GPq~-tW~~kAkknADilfgaseqsalaia~~~~~~fs~~~i~pl- 98 (252) T COG4588 23 INLYGPGGPH-TALKDVAK-KYEEKTGIKVNVTAGPQA-TWNEKAKKNADILFGASEQSALAIAEDHKDSFSEKNIQPL- 98 (252) T ss_pred EEEECCCCCC-HHHHHHHH-HHHHHHCEEEEEECCCCC-HHHHHHHCCCCEEECCCHHHHHHHHHHCCCCCCCCCCCEE- T ss_conf 7876699986-78999999-987874807999418861-0445641367456416488899999856333552025313- Q ss_pred CCCEEEEEEEHHHHHCCCCCHHHHHHHHHHH---HHCCCCCCCCCCCCHHHHHHH------HH-CCCCCCCEEEECCCCC Q ss_conf 6853567863446412345535655556654---420135785432005558766------40-3545221001036652 Q gi|254780342|r 108 QGAKYMLAVNDVGFALGIKSYQDIAKYKKEL---GAKIYGIEPGNEGNQRILDMI------NN-NKFSLKGFRLIEASEL 177 (309) Q Consensus 108 ~~~~~~~~v~~~~~~~~i~~~~dl~~~~~~~---~~~~~g~~~~~~~~~~~~~~~------~~-~~l~~~~~~~~~~s~~ 177 (309) ..+......+...-.+|+.++||.+..-.. ++..+.--+ +...++++. +. -.+.....-+.+.|.+ T Consensus 99 -y~R~aiIlvkkgNPknIk~~eDll~~gi~ivV~dGaG~snts---gtgvwED~agr~~~ie~v~afR~NI~~fapnSga 174 (252) T COG4588 99 -YLRPAIILVKKGNPKNIKGFEDLLKPGIGIVVNDGAGVSNTS---GTGVWEDIAGRKGNIETVAAFRKNIVAFAPNSGA 174 (252) T ss_pred -EEECEEEEECCCCCCCCCCHHHHHCCCCEEEEECCCCCCCCC---CCEEHHHHHCCCCCHHHHHHHHHCEEEECCCCCH T ss_conf -540217996279954445688870578629983797644777---7220486541011488999988523897468704 Q ss_pred CHHHHHHHHHHC-CCCEEEEEECCCHHHHCCCC---EEECCCCCCCCCCHHHHHHHHCHHHHHHCHHHHHHHHHCC Q ss_conf 018999999965-99369994124223201585---1400786546778658354427777874889999987408 Q gi|254780342|r 178 ASFSQIRRDQRN-NIPAVFLSWEPHPINSDLNI---HYLPGGEEISGFGEASVYTVVRSDYLDKCPNISRLLKNIK 249 (309) Q Consensus 178 ~~~a~l~~A~~~-~~~~v~~~wsp~~~~~~~dl---~~Ledp~~~~~~p~~~~~~vv~~~~~~~~P~~~~~L~~~~ 249 (309) + ..|+++ -..++.-.| +||..+--|. +.++.+ .-.|-..++ +.+++ .++++.+|+.=++ T Consensus 175 A-----rkaf~~~~~aDawItW-~dWa~snpdig~~v~~~~d--~vIyRd~nv--~~~~~---a~~ea~~F~dyl~ 237 (252) T COG4588 175 A-----RKAFENQPDADAWITW-ADWAKSNPDIGDAVEIEKD--YVIYRDFNV--ALAKD---ANKEARDFADYLQ 237 (252) T ss_pred H-----HHHHHCCCCCCEEEEE-CCHHHHCCCHHCEEECCCC--EEEEEECCE--EECCC---CCHHHHHHHHHHH T ss_conf 8-----9998549998558870-1323309850100451666--177540203--43689---9978999999972 No 120 >pfam00750 tRNA-synt_1d tRNA synthetases class I (R). Other tRNA synthetase sub-families are too dissimilar to be included. This family includes only arginyl tRNA synthetase. Probab=61.16 E-value=12 Score=17.42 Aligned_cols=48 Identities=19% Similarity=0.192 Sum_probs=35.4 Q ss_pred CCCCCCCCEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEEE--CCHHHHHHHH Q ss_conf 583458837994258869999999999999610698489985--6405788885 Q gi|254780342|r 20 RDADSCTPVRFADTGWTDIAATTAMTSVILEEILGYKTNIKL--LAVPVTFRSL 71 (309) Q Consensus 20 ~~~~~~~~I~ig~~~wte~~v~a~i~~~iLE~~lGy~Ve~~~--~~~~~~~~al 71 (309) ++|+-.+|.+||+ -=+.+++..++-+||. .||+|.... ++.+.++.-+ T Consensus 26 ~S~Np~kplHvGH---lR~aiiGdslarll~~-~G~~V~r~~yvnD~G~Qi~~l 75 (345) T pfam00750 26 SSPNPAKPIHVGH---LRSTIIGDALSRLLEF-LGYDVVRENYVGDWGTQFGML 75 (345) T ss_pred CCCCCCCCCCHHH---HHHHHHHHHHHHHHHH-CCCEEEEEEEECCHHHHHHHH T ss_conf 3899998850410---1527999999999998-799598998768648999999 No 121 >TIGR03042 PS_II_psbQ_bact photosystem II protein PsbQ. This protein through the member sll1638 from Synechocystis sp. PCC 6803, was shown to be part of the cyanobacteria photosystem II. It is homologous to (but quite diverged from) the chloroplast PsbQ protein, called oxygen-evolving enhancer protein 3 (OEE3). We designate this cyanobacteria protein PsbQ by homology. Probab=59.93 E-value=11 Score=17.75 Aligned_cols=46 Identities=11% Similarity=0.020 Sum_probs=26.8 Q ss_pred CHHHHHHHHHHHHHHHHHCCCCCCCCCEEEECCCCCHHHHHHHHHHHHHH Q ss_conf 95799999999999863035834588379942588699999999999996 Q gi|254780342|r 1 MYKILAVCLFLTTFSISYARDADSCTPVRFADTGWTDIAATTAMTSVILE 50 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~~~~~~~~~I~ig~~~wte~~v~a~i~~~iLE 50 (309) +|+++.+|++++++++..|++|....|-+ --+++...-+.+..-++ T Consensus 1 lRsi~s~~L~~~~~fLvsC~gp~a~~p~t----ysp~~l~~Iq~~~~~i~ 46 (142) T TIGR03042 1 LRSLASLLLVLLLTFLVSCSGPAAAVPPT----YSPAQLAQIQRQAEGIE 46 (142) T ss_pred CHHHHHHHHHHHHHHHHHCCCCCCCCCCC----CCHHHHHHHHHHHHHHH T ss_conf 97699999999999998838998889998----69999999999998899 No 122 >PRK03757 hypothetical protein; Provisional Probab=59.36 E-value=7.7 Score=18.75 Aligned_cols=22 Identities=18% Similarity=-0.091 Sum_probs=14.8 Q ss_pred CHHHHHHHHHHHHHHHHHCCCC Q ss_conf 9579999999999986303583 Q gi|254780342|r 1 MYKILAVCLFLTTFSISYARDA 22 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~~~~ 22 (309) |||++++++++++++.+.++.+ T Consensus 1 mkk~~l~~~l~a~~~~~~~A~a 22 (191) T PRK03757 1 MKKTLLGLTLGSLLFSAGSAVA 22 (191) T ss_pred CCHHHHHHHHHHHHHHCCCCCC T ss_conf 9043999999999863040457 No 123 >PRK11528 hypothetical protein; Provisional Probab=59.14 E-value=4.2 Score=20.44 Aligned_cols=38 Identities=21% Similarity=0.293 Sum_probs=26.6 Q ss_pred CHHHHHHHHHHHHHHHHHCCCCCCC---CCEEEECCCCCHH Q ss_conf 9579999999999986303583458---8379942588699 Q gi|254780342|r 1 MYKILAVCLFLTTFSISYARDADSC---TPVRFADTGWTDI 38 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~~~~~~~---~~I~ig~~~wte~ 38 (309) |||.++++.+++.+.+...+.+.-. ..|.+...+|+.+ T Consensus 1 m~K~ll~ls~~~~~~~~~~a~ae~~~gfa~is~nylDWs~~ 41 (254) T PRK11528 1 MRKSLLALSLLATLALAAPAQAEYKGGFANISLNYLDWTSR 41 (254) T ss_pred CCHHHHHHHHHHHHCCCCCCCHHHHCCEEEEECCEEECCCC T ss_conf 94124568999873156642045525531001132322688 No 124 >pfam06572 DUF1131 Protein of unknown function (DUF1131). This family consists of several hypothetical bacterial proteins of unknown function. Probab=58.27 E-value=5.5 Score=19.71 Aligned_cols=78 Identities=14% Similarity=0.187 Sum_probs=40.6 Q ss_pred CHHHHHHHHHHHHHHHHHCCCCC-----------CCCCEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEEEC------C Q ss_conf 95799999999999863035834-----------588379942588699999999999996106984899856------4 Q gi|254780342|r 1 MYKILAVCLFLTTFSISYARDAD-----------SCTPVRFADTGWTDIAATTAMTSVILEEILGYKTNIKLL------A 63 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~~~~~-----------~~~~I~ig~~~wte~~v~a~i~~~iLE~~lGy~Ve~~~~------~ 63 (309) ||.+...++.+.++..++++.++ .+.++.|-...--+-...+-|-.+.|++.|+-+-++..| . T Consensus 1 ~~~~r~~l~~~~llL~GCst~s~fsWSslsP~nWFgssl~Vs~~GVG~Ita~Tpm~e~aI~~~L~g~Y~lRsGM~t~nG~ 80 (192) T pfam06572 1 MKSLRLFLLALPLLLTGCSTLSNFSWSSLSPWNWFGSSLEVTEQGVGGITAGTPLNEGAIEDALDGDYRLRSGMKTSNGQ 80 (192) T ss_pred CCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCCCCCCCCCCHHHHHHHHCCCCEECCCCCCCCCE T ss_conf 94301368776688606556688542226872203784288326535212578368899998758974663655205981 Q ss_pred HHHHHHHHHCCCCEE Q ss_conf 057888851498139 Q gi|254780342|r 64 VPVTFRSLKNKGIDI 78 (309) Q Consensus 64 ~~~~~~al~~G~iD~ 78 (309) .-.+|+||+..++-+ T Consensus 81 iv~~fqAmk~d~V~l 95 (192) T pfam06572 81 IVRYFQAMKGDQVKL 95 (192) T ss_pred EEEEEEEECCCCEEE T ss_conf 999987606881799 No 125 >TIGR02478 6PF1K_euk 6-phosphofructokinase; InterPro: IPR009161 Members of this family are eukaryotic (with one exception) ATP-dependent 6-phosphofructokinases (2.7.1.11 from EC) in which two tandem copies of the phosphofructokinase are found . Members are found, often including several isozymes, in animals and fungi and in the bacterium Propionibacterium acnes KPA171202 (a human skin commensal). Phosphofructokinase (PFK) catalyses the phosphorylation of fructose-6-phosphate to fructose-1,6-biphosphate, which then enters the Embden-Meyerhof pathway. PFK is a key regulatory enzyme in glycolysis. For additional information please see , , .; GO: 0003872 6-phosphofructokinase activity, 0006096 glycolysis, 0005737 cytoplasm. Probab=57.32 E-value=7.6 Score=18.79 Aligned_cols=47 Identities=17% Similarity=0.177 Sum_probs=37.3 Q ss_pred CCEEEE---CCCCCHHHHHHHHHHHHHHHCCCCEEEEEE------CCHHHHHHHHH Q ss_conf 837994---258869999999999999610698489985------64057888851 Q gi|254780342|r 26 TPVRFA---DTGWTDIAATTAMTSVILEEILGYKTNIKL------LAVPVTFRSLK 72 (309) Q Consensus 26 ~~I~ig---~~~wte~~v~a~i~~~iLE~~lGy~Ve~~~------~~~~~~~~al~ 72 (309) ++|+|- ..|-..+-|+++.+|++|+++|+++++++. |+.+.+|.-+. T Consensus 243 ~~iViVAEGA~D~~~~pItse~VK~vl~~~L~lDtRiT~LGHVQRGG~PsA~DR~L 298 (777) T TIGR02478 243 KNIVIVAEGAIDRELNPITSEDVKDVLVERLGLDTRITVLGHVQRGGAPSAYDRIL 298 (777) T ss_pred CEEEEEEECEECCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHHHH T ss_conf 40799842105467779688989888644018402476605224688722899999 No 126 >TIGR01328 met_gam_lyase methionine gamma-lyase; InterPro: IPR006237 This family of sequences is a methionine gamma-lyase subset of a family of PLP-dependent trans-sulphuration enzymes. The member from the parasite Trichomonas vaginalis is described as catalyzing alpha gamma- and alpha-beta eliminations and gamma-replacement reactions on methionine, cysteine, and some derivatives. Likewise, the enzyme from Pseudomonas degrades cysteine as well as methionine.. Probab=56.18 E-value=6.9 Score=19.06 Aligned_cols=191 Identities=13% Similarity=0.209 Sum_probs=99.5 Q ss_pred EEECCCCCCCEEEEEEEHHHHHCCCC----CHHHHHHHHHHH-------------------HHCCCCCCCCCCCCHHH-H Q ss_conf 23013566853567863446412345----535655556654-------------------42013578543200555-8 Q gi|254780342|r 101 KLVAENLQGAKYMLAVNDVGFALGIK----SYQDIAKYKKEL-------------------GAKIYGIEPGNEGNQRI-L 156 (309) Q Consensus 101 ~~~~~~~~~~~~~~~v~~~~~~~~i~----~~~dl~~~~~~~-------------------~~~~~g~~~~~~~~~~~-~ 156 (309) +.-|.-++|.+.|+.+.+-. .|.+. .++.|.+-++.. .-.++..+.-.+|+-.. . T Consensus 38 eqGGnrFAG~e~Gy~YsRlg-NPTv~~LE~riA~LEg~EAavAt~SGMgAi~A~l~~~lkAgDh~i~d~~lYGCTfall~ 116 (392) T TIGR01328 38 EQGGNRFAGEEEGYVYSRLG-NPTVDLLERRIAELEGAEAAVATASGMGAIAATLLSLLKAGDHVISDETLYGCTFALLE 116 (392) T ss_pred HCCCCCCCCCCCCEEEECCC-CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCHHHHHH T ss_conf 40555366743662685267-71377888898754204677887512899999999998528757743874201489987 Q ss_pred HHHHHCCCCCCCEEEECCCCCCHHHHHHHHHHCCCCEEEEEECC-CHH-------------HHCCCCEEECCCCCCCCCC Q ss_conf 76640354522100103665201899999996599369994124-223-------------2015851400786546778 Q gi|254780342|r 157 DMINNNKFSLKGFRLIEASELASFSQIRRDQRNNIPAVFLSWEP-HPI-------------NSDLNIHYLPGGEEISGFG 222 (309) Q Consensus 157 ~~~~~~~l~~~~~~~~~~s~~~~~a~l~~A~~~~~~~v~~~wsp-~~~-------------~~~~dl~~Ledp~~~~~~p 222 (309) ..+..|+.... ++... ...++.++++.+.- ++|.=|| ++. ..+-+++++-| +.|.-| T Consensus 117 ~~ltkfgi~v~---f~~~~---~pee~~a~i~dnT~-ivYfETPaNPtm~liD~e~v~r~A~~~gG~kVivD--NTfatP 187 (392) T TIGR01328 117 DALTKFGIDVK---FVDMA---SPEEVKAAIKDNTK-IVYFETPANPTMKLIDLELVVRVAHEKGGVKVIVD--NTFATP 187 (392) T ss_pred HHHHHHCCEEE---EECCC---CHHHHHHHCCCCCE-EEEEECCCCCCHHHHHHHHHHHHHHHCCCEEEEEE--CCCCCH T ss_conf 32677375279---83178---77999984489977-99986599875257759999999975298289985--662770 Q ss_pred HHH------HHHHHCH--HHHHHCHHHHH--------HHHHCC------------------CCHHHHHHHHHHHHHCCCC Q ss_conf 658------3544277--77874889999--------987408------------------9989999999999845999 Q gi|254780342|r 223 EAS------VYTVVRS--DYLDKCPNISR--------LLKNIK------------------FSVALENEMMKLILNNKQD 268 (309) Q Consensus 223 ~~~------~~~vv~~--~~~~~~P~~~~--------~L~~~~------------------ls~~~~~~l~~~v~~~g~~ 268 (309) .=. +-.||++ ++..-|-++.. .|++++ |=..=+.=|+-+|+.+-++ T Consensus 188 ~lt~P~alG~D~VVHSATKYigGHgDvVaG~~~G~~e~~q~iR~~g~KD~tG~ViSPfdAwL~lRGLkTL~~RM~rhsen 267 (392) T TIGR01328 188 YLTRPVALGVDVVVHSATKYIGGHGDVVAGVLIGDAELLQRIRLVGVKDMTGAVISPFDAWLVLRGLKTLAVRMKRHSEN 267 (392) T ss_pred HHCCHHHCCCCEEEECCCCCCCCCHHHHHHHHCCCHHHHHHHCCCCCCCCCCCEECHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 01380361877689424121376122112211077566744011110005665005578999974366777888773053 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHH Q ss_conf 8999999999859999987547545677137999 Q gi|254780342|r 269 RQFVGRTMLRTHPDLLKNWLIGVTTFDGQDPSRQ 302 (309) Q Consensus 269 ~~~vA~~wL~~n~~~~~~Wl~~~~~~~~~~~~~~ 302 (309) ++.|| +||++||..=.=.==|-.-+.++|-+++ T Consensus 268 A~kvA-e~Lk~HPaVe~V~YPG~~~hp~hdiaak 300 (392) T TIGR01328 268 AEKVA-EYLKEHPAVEKVYYPGLEEHPDHDIAAK 300 (392) T ss_pred HHHHH-HHHHCCCCCCEEECCCCCCCCCCHHHHH T ss_conf 89999-9852388602123688778887408999 No 127 >PRK09765 2-O-a-mannosyl-D-glycerate specific PTS transporten components IIABC; Provisional Probab=56.16 E-value=9.7 Score=18.08 Aligned_cols=39 Identities=26% Similarity=0.169 Sum_probs=31.9 Q ss_pred CCCCCEEEECCCCCHHHHHHHHHHHHHHH---CCCCEEEEEE Q ss_conf 45883799425886999999999999961---0698489985 Q gi|254780342|r 23 DSCTPVRFADTGWTDIAATTAMTSVILEE---ILGYKTNIKL 61 (309) Q Consensus 23 ~~~~~I~ig~~~wte~~v~a~i~~~iLE~---~lGy~Ve~~~ 61 (309) .+.++..+|...-+.+.+.++|.++-||+ ++||++++-. T Consensus 160 ~~~~~~IvavTaCPtGIAHTYMAaeaL~~aak~~g~~ikVET 201 (638) T PRK09765 160 FSNAPTIVCVTACPAGIAHTYMAAEYLEKAGRKLGVNVYVEK 201 (638) T ss_pred CCCCCEEEEEECCCCHHHHHHHHHHHHHHHHHHCCCEEEEEE T ss_conf 567862677403773167899999999999998398589974 No 128 >PRK10959 outer membrane protein W; Provisional Probab=55.62 E-value=9.9 Score=18.04 Aligned_cols=19 Identities=26% Similarity=0.233 Sum_probs=13.7 Q ss_pred CHHHHHHHHHHHHHHHHHC Q ss_conf 9579999999999986303 Q gi|254780342|r 1 MYKILAVCLFLTTFSISYA 19 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~ 19 (309) |||+++++++++.++++.+ T Consensus 1 Mkk~~~a~~~~~a~~s~~A 19 (212) T PRK10959 1 MKKLTVAALAVATLLSGSA 19 (212) T ss_pred CCHHHHHHHHHHHHHHHHH T ss_conf 9378999999998751677 No 129 >TIGR01323 nitrile_alph nitrile hydratase, alpha subunit; InterPro: IPR004232 Nitrile hydratases (4.2.1.84 from EC) are bacterial enzymes that catalyse the hydration of nitrile compounds to the corresponding amides. They are used as biocatalysts in acrylamide production, one of the few commercial scale bioprocesses, as well as in environmental remediation for the removal of nitriles from waste streams. Nitrile hydratases are composed of two subunits, alpha and beta, and are normally active as a tetramer, alpha(2)beta(2). Nitrile hydratases contain either a non-haem iron or a non-corrinoid cobalt centre, both types sharing a highly conserved peptide sequence in the alpha subunit (CXLCSC) that provides all the residues involved in coordinating the metal ion. Each type of nitrile hydratase specifically incorporated its metal with the help of activator proteins encoded by flanking regions of the nitrile hydratase genes that are necessary for metal insertion. The Fe-containing enzyme is photo-regulated: in the dark the enzyme is inactivated due to the association of nitric oxide (NO) to the iron, while in the light the enzyme is active by photo-dissociation of NO. The NO is held in place by a claw setting formed through specific oxygen atoms in two modified cysteines and a serine residue in the active site , . The cobalt-containing enzyme is unaffected by NO, but was shown to undergo a similar effect with carbon monoxide , . Fe- and cobalt-containing enzymes also display different inhibition patterns with nitrophenols. This entry represents the alpha subunit, which is a duplication of two structural repeats, each consisting of 4 layers, alpha/beta/beta/alpha.; GO: 0018822 nitrile hydratase activity, 0046914 transition metal ion binding, 0006807 nitrogen compound metabolic process. Probab=55.54 E-value=8 Score=18.64 Aligned_cols=21 Identities=24% Similarity=0.320 Sum_probs=11.8 Q ss_pred CCCCCCCCHHHHHHHHCHHHH Q ss_conf 865467786583544277778 Q gi|254780342|r 215 GEEISGFGEASVYTVVRSDYL 235 (309) Q Consensus 215 p~~~~~~p~~~~~~vv~~~~~ 235 (309) |.+.-.+.+.++..+|++|.+ T Consensus 157 PaGte~~sEEqL~~LVtrDsl 177 (186) T TIGR01323 157 PAGTENFSEEQLAKLVTRDSL 177 (186) T ss_pred CCCCCCCCHHHHHHHHCCCCE T ss_conf 898888787888644100100 No 130 >PRK10808 outer membrane protein A; Reviewed Probab=54.85 E-value=16 Score=16.74 Aligned_cols=31 Identities=13% Similarity=-0.136 Sum_probs=20.2 Q ss_pred CHHHHHHHHHHHHHHHHHCCCCCCCCCEEEE Q ss_conf 9579999999999986303583458837994 Q gi|254780342|r 1 MYKILAVCLFLTTFSISYARDADSCTPVRFA 31 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~~~~~~~~~I~ig 31 (309) |||++++++++++.+.+.+.++.+..+-=+| T Consensus 1 MKKt~ial~va~a~~a~~A~Aa~~dn~wY~G 31 (347) T PRK10808 1 MKKTAIAIAVALAGFATVAQAAPKDNTWYTG 31 (347) T ss_pred CCHHHHHHHHHHHHHHHHHHCCCCCCCEEEE T ss_conf 9467999999999777643120002887896 No 131 >PRK08475 F0F1 ATP synthase subunit B; Validated Probab=54.74 E-value=13 Score=17.21 Aligned_cols=24 Identities=17% Similarity=0.332 Sum_probs=16.8 Q ss_pred CHHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 957999999999998630358345 Q gi|254780342|r 1 MYKILAVCLFLTTFSISYARDADS 24 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~~~~~~ 24 (309) |+|++.+++++.+++++..+++.+ T Consensus 1 m~~~~~~~~~~~~~~~a~~~~~~~ 24 (170) T PRK08475 1 MMKLFFLILLLPLYAFGSDIGSGE 24 (170) T ss_pred CHHHHHHHHHHHHHHHHCCCCCCC T ss_conf 908999999999999863578886 No 132 >COG0834 HisJ ABC-type amino acid transport/signal transduction systems, periplasmic component/domain [Amino acid transport and metabolism / Signal transduction mechanisms] Probab=54.52 E-value=16 Score=16.71 Aligned_cols=136 Identities=15% Similarity=0.118 Sum_probs=72.7 Q ss_pred HHHHHHHHCCCCE--EEEEECCHHHHHHHHHCCCCEEEEEEECCCCHHHHHHHHHCCCEEEECCCCCCCEEEEEEEHHHH Q ss_conf 9999996106984--89985640578888514981399984046413566665530470230135668535678634464 Q gi|254780342|r 44 MTSVILEEILGYK--TNIKLLAVPVTFRSLKNKGIDIFMGYWYPSLEKFIAPYLEEGSIKLVAENLQGAKYMLAVNDVGF 121 (309) Q Consensus 44 i~~~iLE~~lGy~--Ve~~~~~~~~~~~al~~G~iD~~~e~w~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~ 121 (309) +...+-+ .++.. ++.+...-...+.++.+|.+|+.+.....+. +... .+.. ...+......+.+..... T Consensus 65 l~~~ia~-~l~~~~~~~~~~~~~~~~~~~l~~g~~D~~~~~~~~t~-~r~~------~~~f-s~py~~~~~~~~~~~~~~ 135 (275) T COG0834 65 LAKAIAK-RLGGDKKVEFVPVAWDGLIPALKAGKVDIIIAGMTITP-ERKK------KVDF-SDPYYYSGQVLLVKKDSD 135 (275) T ss_pred HHHHHHH-HHCCCCCEEEEECCHHHHHHHHHCCCCCEEEECCCCCH-HHHH------CCCC-CCCEEECCEEEEEECCCC T ss_conf 9999999-84677424885068877638987799888985673687-7752------0056-777365465899976885 Q ss_pred HCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCCCCCCCEEEECCCCCCHHHHHHHHHHCCCCEEEEEECCC Q ss_conf 12345535655556654420135785432005558766403545221001036652018999999965993699941242 Q gi|254780342|r 122 ALGIKSYQDIAKYKKELGAKIYGIEPGNEGNQRILDMINNNKFSLKGFRLIEASELASFSQIRRDQRNNIPAVFLSWEPH 201 (309) Q Consensus 122 ~~~i~~~~dl~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~s~~~~~a~l~~A~~~~~~~v~~~wsp~ 201 (309) . .+++++||.. +.+|...+.. ................... ... .+...++++|+.+++....+- T Consensus 136 ~-~~~~~~DL~g-------k~v~v~~gt~--~~~~~~~~~~~~~~~~~~~-~~~-----~~~~~al~~Gr~Da~~~d~~~ 199 (275) T COG0834 136 I-GIKSLEDLKG-------KKVGVQLGTT--DEAEEKAKKPGPNAKIVAY-DSN-----AEALLALKNGRADAVVSDSAV 199 (275) T ss_pred C-CCCCHHHHCC-------CEEEEECCCH--HHHHHHHHHCCCCCEEEEC-CCH-----HHHHHHHHCCCCCEEEECHHH T ss_conf 4-5589889589-------7699966845--8888776404677449981-888-----999999984998899706888 Q ss_pred HHH Q ss_conf 232 Q gi|254780342|r 202 PIN 204 (309) Q Consensus 202 ~~~ 204 (309) ... T Consensus 200 ~~~ 202 (275) T COG0834 200 LAG 202 (275) T ss_pred HHH T ss_conf 766 No 133 >PRK00031 lolA outer-membrane lipoprotein carrier protein; Reviewed Probab=54.24 E-value=9.9 Score=18.02 Aligned_cols=19 Identities=32% Similarity=0.368 Sum_probs=14.2 Q ss_pred CHHHHHHHHHHHHHHHHHC Q ss_conf 9579999999999986303 Q gi|254780342|r 1 MYKILAVCLFLTTFSISYA 19 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~ 19 (309) |||++.++++++.++++.+ T Consensus 1 mk~~~~~~~~~~~l~~~~~ 19 (201) T PRK00031 1 MKKLLIAALLAAALVASSA 19 (201) T ss_pred CCHHHHHHHHHHHHHHHHH T ss_conf 9159999999999877664 No 134 >PRK10598 hypothetical protein; Provisional Probab=53.50 E-value=16 Score=16.60 Aligned_cols=40 Identities=10% Similarity=0.011 Sum_probs=24.5 Q ss_pred CHHHHHHHHHHHHHHHHHCCCCCCCCCEEEECCCCCHHHHHHHHHHHH Q ss_conf 957999999999998630358345883799425886999999999999 Q gi|254780342|r 1 MYKILAVCLFLTTFSISYARDADSCTPVRFADTGWTDIAATTAMTSVI 48 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~~~~~~~~~I~ig~~~wte~~v~a~i~~~i 48 (309) |||++.+++++....+++|+.-.+ -+-||+.+..++-+.+ T Consensus 1 mkk~~l~~~l~l~~llsGC~~l~q--------YsISEqEin~YL~~~~ 40 (186) T PRK10598 1 MKKFLLAAALLVSGLLVGCNQLTQ--------YTISEQEINQYLAKHN 40 (186) T ss_pred CCHHHHHHHHHHHHHHHCCCCCCC--------EEECHHHHHHHHHHHC T ss_conf 912699999999999714353364--------4007999999998743 No 135 >PRK13697 cytochrome c6; Provisional Probab=53.20 E-value=4.7 Score=20.15 Aligned_cols=43 Identities=14% Similarity=0.037 Sum_probs=21.8 Q ss_pred CHHHHHHHHHHHHHHHHHCCCCCCCCCEEEECCCCCHHHHHHH Q ss_conf 9579999999999986303583458837994258869999999 Q gi|254780342|r 1 MYKILAVCLFLTTFSISYARDADSCTPVRFADTGWTDIAATTA 43 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~~~~~~~~~I~ig~~~wte~~v~a~ 43 (309) |||++..++++.++++....+++.+..+.-|..-|...=+.++ T Consensus 1 Mk~l~~~~l~~~~~~~~~~~~pA~Aad~~~G~~iF~~nCA~CH 43 (111) T PRK13697 1 MKKILKLVLLTLLLLTFAFTSPAFAADAANGEQIFSANCAGCH 43 (111) T ss_pred CHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCHHHC T ss_conf 9789999999999999980355444167531799985376514 No 136 >PRK13680 hypothetical protein; Provisional Probab=52.91 E-value=17 Score=16.54 Aligned_cols=42 Identities=14% Similarity=0.261 Sum_probs=29.9 Q ss_pred HHHHHHHHCCCCCCCCCEEEEC-CCCCHHHHHHHHHHHHHHHC Q ss_conf 9999863035834588379942-58869999999999999610 Q gi|254780342|r 11 LTTFSISYARDADSCTPVRFAD-TGWTDIAATTAMTSVILEEI 52 (309) Q Consensus 11 ~~~~~~~~~~~~~~~~~I~ig~-~~wte~~v~a~i~~~iLE~~ 52 (309) ++++++.+++.+.+.+.+++.+ .+-+.+.|.+.|=...|..+ T Consensus 16 ll~~s~~~~a~~ee~r~akf~~C~gL~~~qIAaqVKrDflQNR 58 (121) T PRK13680 16 LLACSSVFSANAEESKSVKFPKCEGLDAAGIAASVKRDYQQNR 58 (121) T ss_pred HHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCC T ss_conf 9998877766505765334454579898999999999998711 No 137 >PRK01611 argS arginyl-tRNA synthetase; Reviewed Probab=51.87 E-value=17 Score=16.44 Aligned_cols=49 Identities=18% Similarity=0.185 Sum_probs=33.6 Q ss_pred CCCCCCCEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEEE--CCHHHHHHHHHC Q ss_conf 83458837994258869999999999999610698489985--640578888514 Q gi|254780342|r 21 DADSCTPVRFADTGWTDIAATTAMTSVILEEILGYKTNIKL--LAVPVTFRSLKN 73 (309) Q Consensus 21 ~~~~~~~I~ig~~~wte~~v~a~i~~~iLE~~lGy~Ve~~~--~~~~~~~~al~~ 73 (309) +|+-.+|.+||+ .=+.+++..++.+||. .||+|.... +|.+.++.-+.. T Consensus 126 S~Np~kplHvGH---~R~aiiGdslaril~~-~G~~V~r~~yvnD~G~Qi~~l~~ 176 (570) T PRK01611 126 SANPTGPLHVGH---LRSAVIGDALARILEF-AGYDVTREYYVNDAGTQIGMLAA 176 (570) T ss_pred CCCCCCCCCHHH---HHHHHHHHHHHHHHHH-CCCCEEEEEEECCCCHHHHHHHH T ss_conf 899999862335---8878999999999998-39907999997874279999999 No 138 >pfam06510 DUF1102 Protein of unknown function (DUF1102). This family consists of several hypothetical archaeal proteins of unknown function. Probab=51.04 E-value=14 Score=17.05 Aligned_cols=22 Identities=23% Similarity=0.427 Sum_probs=15.3 Q ss_pred CHHHHHHHHHHHHHHHHHCCCC Q ss_conf 9579999999999986303583 Q gi|254780342|r 1 MYKILAVCLFLTTFSISYARDA 22 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~~~~ 22 (309) |||++..+++++.+++..++.+ T Consensus 1 Mkkl~gi~~ll~~l~L~vGa~a 22 (183) T pfam06510 1 MKKLIGILLLLAGLGLVVGAGA 22 (183) T ss_pred CHHHHHHHHHHHHHHHHCCCCC T ss_conf 9148999999999995400565 No 139 >PRK09733 putative fimbrial protein; Provisional Probab=50.95 E-value=18 Score=16.41 Aligned_cols=30 Identities=20% Similarity=0.110 Sum_probs=17.9 Q ss_pred CHHHHHHHHHHHHHHHHHCCCCCCCCCEEEE Q ss_conf 9579999999999986303583458837994 Q gi|254780342|r 1 MYKILAVCLFLTTFSISYARDADSCTPVRFA 31 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~~~~~~~~~I~ig 31 (309) |||.++++++++.++.+..+ +....+|++- T Consensus 2 mKk~ll~~~~~~~~s~~~~A-~~~~G~V~F~ 31 (181) T PRK09733 2 FKKSLLVAGVAMILSNNVFA-DEGHGIVKFK 31 (181) T ss_pred HHHHHHHHHHHHHHHHHHHH-HCCCCEEEEE T ss_conf 06889999999998666576-4488689999 No 140 >pfam07901 DUF1672 Protein of unknown function (DUF1672). This family is composed of hypothetical bacterial proteins of unknown function. Probab=50.94 E-value=13 Score=17.25 Aligned_cols=82 Identities=10% Similarity=0.102 Sum_probs=42.6 Q ss_pred CHHHHHHHHHHHHHHHHHCCCCCC------CCCEEEECCCCCHH-------HH-----------HHHHHHHHHHHCCCCE Q ss_conf 957999999999998630358345------88379942588699-------99-----------9999999996106984 Q gi|254780342|r 1 MYKILAVCLFLTTFSISYARDADS------CTPVRFADTGWTDI-------AA-----------TTAMTSVILEEILGYK 56 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~~~~~~------~~~I~ig~~~wte~-------~v-----------~a~i~~~iLE~~lGy~ 56 (309) |||++..+++++++..+|+..... +.|-.+...+.++| .= +.....+.+.++++.+ T Consensus 1 MkK~i~~il~~~llL~GCs~mn~~~~~e~ks~~e~~~vqdYvGQGfs~~~~~~~~e~akkhr~ei~k~~~qyfkd~y~~~ 80 (304) T pfam07901 1 MRKLIKLILIATLLLSGCSTTNNESNKETKSVPEEMEASKYVGQGFQPPAEKDAIEFAKKHKDKIAKRGEQFFMDNFGLK 80 (304) T ss_pred CCHHHHHHHHHHHHHCCCCCCCCCCCHHHCCCCCCCCHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCE T ss_conf 90189999999999713344555420010367751138660677668677304899999979999999999998751730 Q ss_pred EEEEE--CCHHHH--HHHHHCCCCEEEEEE Q ss_conf 89985--640578--888514981399984 Q gi|254780342|r 57 TNIKL--LAVPVT--FRSLKNKGIDIFMGY 82 (309) Q Consensus 57 Ve~~~--~~~~~~--~~al~~G~iD~~~e~ 82 (309) |+... +++..+ |---..-+|-+..+. T Consensus 81 VKv~NVVga~d~a~V~Vec~~~~i~F~tSi 110 (304) T pfam07901 81 VKATNVVGSGDGVEVYVHCDDHDIVFNASI 110 (304) T ss_pred EEEEEEECCCCCEEEEEECCCCCCEEEEEE T ss_conf 589778617981899998258883355423 No 141 >pfam11777 DUF3316 Protein of unknown function (DUF3316). This family of bacterial proteins has no known function. Several members are, however, annotated as being putative acyl-CoA synthetase, but this could not be confirmed. Probab=50.79 E-value=18 Score=16.33 Aligned_cols=32 Identities=16% Similarity=0.239 Sum_probs=20.5 Q ss_pred CHHHHHHHHHHHHHHHHHCCCC---CCCCCEEEEC Q ss_conf 9579999999999986303583---4588379942 Q gi|254780342|r 1 MYKILAVCLFLTTFSISYARDA---DSCTPVRFAD 32 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~~~~---~~~~~I~ig~ 32 (309) |||++++++++++.+.++++.- .+.+++..+. T Consensus 1 mKk~~ll~~~ll~s~~AfA~~~~~~~~~~~~~~~~ 35 (114) T pfam11777 1 MKKLILLALLLLLSATAFAGNYVSTTQTDSLQTDV 35 (114) T ss_pred CHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEECC T ss_conf 91089999999998888755553655625897155 No 142 >PRK02710 plastocyanin; Provisional Probab=50.68 E-value=18 Score=16.32 Aligned_cols=30 Identities=17% Similarity=0.159 Sum_probs=18.6 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCCCCEEEEC Q ss_conf 799999999999863035834588379942 Q gi|254780342|r 3 KILAVCLFLTTFSISYARDADSCTPVRFAD 32 (309) Q Consensus 3 k~~~~~~~~~~~~~~~~~~~~~~~~I~ig~ 32 (309) |.++++++++.++++.+..+..++++.+-. T Consensus 6 ~~~~~~~~~~~~~~~~~~~~a~A~~~~v~m 35 (119) T PRK02710 6 RSIAAALVAVVSSFGLGVSSASAATVEVKM 35 (119) T ss_pred HHHHHHHHHHHHHHHHCCCCCCCCEEEEEE T ss_conf 999999999999998125702025289997 No 143 >PRK10159 outer membrane phosphoporin protein E; Provisional Probab=49.73 E-value=18 Score=16.34 Aligned_cols=31 Identities=10% Similarity=0.038 Sum_probs=19.7 Q ss_pred CHHHHHHHHHHHHHHHHHCCCC----CCCCCEEEE Q ss_conf 9579999999999986303583----458837994 Q gi|254780342|r 1 MYKILAVCLFLTTFSISYARDA----DSCTPVRFA 31 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~~~~----~~~~~I~ig 31 (309) |||.+++++++++++++.+.++ .++.++.|+ T Consensus 1 MKk~~lA~ai~a~l~s~~a~AaeVYn~dg~~ldig 35 (351) T PRK10159 1 MKKSTLALVVMGIVASASVQAAEIYNKDGNKLDVY 35 (351) T ss_pred CCHHHHHHHHHHHHHCCCCCEEEEEECCCCEEEEE T ss_conf 94738999999998726531799997899889783 No 144 >PRK10718 hypothetical protein; Provisional Probab=49.70 E-value=6.2 Score=19.34 Aligned_cols=54 Identities=15% Similarity=0.271 Sum_probs=31.5 Q ss_pred CCCEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEEEC------CHHHHHHHHHCCCCEE Q ss_conf 88379942588699999999999996106984899856------4057888851498139 Q gi|254780342|r 25 CTPVRFADTGWTDIAATTAMTSVILEEILGYKTNIKLL------AVPVTFRSLKNKGIDI 78 (309) Q Consensus 25 ~~~I~ig~~~wte~~v~a~i~~~iLE~~lGy~Ve~~~~------~~~~~~~al~~G~iD~ 78 (309) +.+++|-...--+-...+-|-.+.|++.|+-+-++..| ..-.+|+||+..++-+ T Consensus 36 gs~l~vs~~GVG~Ita~Tpm~e~aI~~aL~g~Y~lRsGM~t~~G~iv~~fqA~k~d~V~l 95 (191) T PRK10718 36 GSSTKVSEQGVGEITASTPLQEQAIADALDGDYRLRSGMKTANGNVVRFFEAMKGDQVAM 95 (191) T ss_pred CCCEEEECCCCCCCCCCCCCCHHHHHHHHCCCCEECCCCEECCCEEEEEEEECCCCEEEE T ss_conf 883388426534313588367899998748973663655214984999885417881899 No 145 >pfam11348 DUF3150 Protein of unknown function (DUF3150). This bacterial family of proteins with unknown function appears to be restricted to Proteobacteria. Probab=49.55 E-value=19 Score=16.20 Aligned_cols=13 Identities=15% Similarity=-0.289 Sum_probs=8.1 Q ss_pred EEEEEEEHHHHHC Q ss_conf 3567863446412 Q gi|254780342|r 111 KYMLAVNDVGFAL 123 (309) Q Consensus 111 ~~~~~v~~~~~~~ 123 (309) ..||+||....+. T Consensus 73 lgG~aVP~~~~~~ 85 (257) T pfam11348 73 LGGYAVPEDKADE 85 (257) T ss_pred CCCEECCHHHHHH T ss_conf 3734456888999 No 146 >pfam02379 consensus Probab=49.47 E-value=9.5 Score=18.14 Aligned_cols=32 Identities=28% Similarity=0.298 Sum_probs=25.4 Q ss_pred EECCCCCHHHHHHHHHHHHHHH---CCCCEEEEEE Q ss_conf 9425886999999999999961---0698489985 Q gi|254780342|r 30 FADTGWTDIAATTAMTSVILEE---ILGYKTNIKL 61 (309) Q Consensus 30 ig~~~wte~~v~a~i~~~iLE~---~lGy~Ve~~~ 61 (309) ++...-+.+.+.++|.++.||+ .+||++++-. T Consensus 3 vaVTACptGIAHTymAAeaLe~aa~~~G~~ikVEt 37 (103) T pfam02379 3 VAVTACPTGIAHTYMAAEALEKAAKKLGWEVKVET 37 (103) T ss_pred EEEECCCCCHHHHHHHHHHHHHHHHHCCCEEEEEE T ss_conf 99955986087999999999999998799599995 No 147 >TIGR00829 FRU PTS system, Fru family, IIB component; InterPro: IPR003353 The bacterial phosphoenolpyruvate: sugar phosphotransferase system (PTS) is a multi-protein system involved in the regulation of a variety of metabolic and transcriptional processes. The PTS catalyzes the phosphorylation of incoming sugar substrates concomitant with their translocation across the cell membrane. The general mechanism of the PTS is the following: a phosphoryl group from phosphoenolpyruvate (PEP) is transferred to enzyme-I (EI) of PTS which in turn transfers it to a phosphoryl carrier protein (HPr). Phospho-HPr then transfers the phosphoryl group to a sugar-specific permease which consists of at least three structurally distinct domains (IIA, IIB, and IIC) which can either be fused together in a single polypeptide chain or exist as two or three interactive chains, formerly called enzymes II (EII) and III (EIII). IIB (2.7.1.69 from EC) is is phosphorylated by phospho-IIA, before the phosphoryl group is transferred to the sugar substrate.; GO: 0005351 sugar:hydrogen ion symporter activity, 0009401 phosphoenolpyruvate-dependent sugar phosphotransferase system, 0016020 membrane. Probab=49.26 E-value=8.1 Score=18.59 Aligned_cols=45 Identities=20% Similarity=0.146 Sum_probs=28.7 Q ss_pred EECCCCCHHHHHHHHHHHHHHH---CCCCEEEEEECCHHHHHHHHHCC Q ss_conf 9425886999999999999961---06984899856405788885149 Q gi|254780342|r 30 FADTGWTDIAATTAMTSVILEE---ILGYKTNIKLLAVPVTFRSLKNK 74 (309) Q Consensus 30 ig~~~wte~~v~a~i~~~iLE~---~lGy~Ve~~~~~~~~~~~al~~G 74 (309) ||...=|.+.+++.|.++-||+ +.||+|++-..++-..-.+|..- T Consensus 1 vAvTACPTGiAHTFMAaEale~~Ak~~g~~vkVETQG~~G~~N~LT~~ 48 (85) T TIGR00829 1 VAVTACPTGIAHTFMAAEALEKAAKKRGWEVKVETQGSVGAQNALTAE 48 (85) T ss_pred CEECCCCCCCCHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCHH T ss_conf 924348983003577789999999846997888852751876563867 No 148 >TIGR00787 dctP TRAP transporter solute receptor, DctP family; InterPro: IPR004682 TRAP-T family proteins generally consist of three components, and these systems have so far been found in Gram-negative bacteria, Gram-positive bacteria and archaea. Only one member of the family has been both sequenced and functionally characterised. This system is the DctPQM system of Rhodobacter capsulatus. DctP is a periplasmic dicarboxylate (malate, fumarate, succinate) binding receptor that is biochemically well-characterised.; GO: 0006810 transport, 0030288 outer membrane-bounded periplasmic space. Probab=48.87 E-value=19 Score=16.14 Aligned_cols=193 Identities=12% Similarity=0.048 Sum_probs=90.4 Q ss_pred CHHHHHHHHHHHHHHHCCCCEEEEEE------C--CHHHHHHHHHCCCCEEEEEEECCCCHHH-HHHH------------ Q ss_conf 69999999999999610698489985------6--4057888851498139998404641356-6665------------ Q gi|254780342|r 36 TDIAATTAMTSVILEEILGYKTNIKL------L--AVPVTFRSLKNKGIDIFMGYWYPSLEKF-IAPY------------ 94 (309) Q Consensus 36 te~~v~a~i~~~iLE~~lGy~Ve~~~------~--~~~~~~~al~~G~iD~~~e~w~p~~~~~-~~~~------------ 94 (309) +.....++.....+++.+.-++++.. + +-....++|+.|.||+..... +....+ .... T Consensus 9 ~p~~~AA~~Fa~lv~e~t~G~~~i~~~P~s~LgPa~D~~~~~~l~~G~~d~~~~s~-~~f~~la~p~~~v~dlPflF~d~ 87 (267) T TIGR00787 9 TPKHKAAEKFAKLVEEKTKGEIKISVFPSSQLGPASDREMLEALRGGALDMTAPSS-SKFTPLASPELAVLDLPFLFKDT 87 (267) T ss_pred CCHHHHHHHHHHHHHHHHCCEEEEEECCCCCCCCCCHHHHHHHHHCCCEEEEECCC-CCCCCCCCCHHHHHCCCHHHHCH T ss_conf 71658999999999986588578997576347887769999984089548972576-75423330068771676243148 Q ss_pred --HHCCCEE-EECCCC---CCCE-EEEEEEHHHHHCCCCCHHHHHHHH---HHHHHCCCCCCCCCCCCHHHHHHHHHCCC Q ss_conf --5304702-301356---6853-567863446412345535655556---65442013578543200555876640354 Q gi|254780342|r 95 --LEEGSIK-LVAENL---QGAK-YMLAVNDVGFALGIKSYQDIAKYK---KELGAKIYGIEPGNEGNQRILDMINNNKF 164 (309) Q Consensus 95 --~~~~~~~-~~~~~~---~~~~-~~~~v~~~~~~~~i~~~~dl~~~~---~~~~~~~~g~~~~~~~~~~~~~~~~~~~l 164 (309) +.+...- ..|... .... .|+....+ .+.|.+.++.-+++. ++++|.. -+-..........+.-|- T Consensus 88 ~~~~Kvl~G~~vG~~L~~~~e~~~LG~~~L~~-w~nG~R~~t~~~~p~~~p~D~kGLK----iRi~~s~~~~~~~~~lGA 162 (267) T TIGR00787 88 NHVHKVLDGSEVGKELLKSLEKKNLGLKGLAY-WDNGFRQFTSSKKPVTKPEDLKGLK----IRIPASPVLEAQFKALGA 162 (267) T ss_pred HHHHHHHCCCHHHHHHHHHHHHHHCCCCCHHH-HHCCHHHHHCCCCCCCCCHHHCCCC----CCCCCCHHHHHHHHHCCC T ss_conf 99998736658899999988876324000033-2331454221434124603325874----025986899999997089 Q ss_pred CCCCEEEECCCCCCHHHHHHHHHHCCCCEEEEEECCCHHHH-CCCC-EEECCCCCCCCCCHHHHHHHHCH-HHHHHCHHH Q ss_conf 52210010366520189999999659936999412422320-1585-14007865467786583544277-778748899 Q gi|254780342|r 165 SLKGFRLIEASELASFSQIRRDQRNNIPAVFLSWEPHPINS-DLNI-HYLPGGEEISGFGEASVYTVVRS-DYLDKCPNI 241 (309) Q Consensus 165 ~~~~~~~~~~s~~~~~a~l~~A~~~~~~~v~~~wsp~~~~~-~~dl-~~Ledp~~~~~~p~~~~~~vv~~-~~~~~~P~~ 241 (309) . +. .-.++++|.|+++|-++--..=.+.+.-. -|++ ++|--..| +|-++. + ++++ -+..--|++ T Consensus 163 ~-----P~----~~~F~EvY~ALq~gvVDgqENP~s~~~s~KlYEVqKyL~~tnH--~y~g~~-v-~~n~~~w~~L~~dl 229 (267) T TIGR00787 163 N-----PE----PMAFSEVYTALQTGVVDGQENPLSNIYSSKLYEVQKYLSLTNH--GYLGYL-V-VVNKAFWKSLPPDL 229 (267) T ss_pred C-----CC----CCCHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHH--HHCCCC-E-EEEHHHHHCCCHHH T ss_conf 9-----84----1265899998754810100467277621021236434533103--032620-2-75036743187889 Q ss_pred HHHHHH Q ss_conf 999874 Q gi|254780342|r 242 SRLLKN 247 (309) Q Consensus 242 ~~~L~~ 247 (309) .+.++. T Consensus 230 ~~~~~~ 235 (267) T TIGR00787 230 QAVVKE 235 (267) T ss_pred HHHHHH T ss_conf 999999 No 149 >PRK10427 putative fructose-like phosphotransferase EIIB subunit 3; Provisional Probab=48.68 E-value=9.4 Score=18.19 Aligned_cols=34 Identities=24% Similarity=0.133 Sum_probs=27.1 Q ss_pred EEECCCCCHHHHHHHHHHHHHHH---CCCCEEEEEEC Q ss_conf 99425886999999999999961---06984899856 Q gi|254780342|r 29 RFADTGWTDIAATTAMTSVILEE---ILGYKTNIKLL 62 (309) Q Consensus 29 ~ig~~~wte~~v~a~i~~~iLE~---~lGy~Ve~~~~ 62 (309) -+|...-+.+.+.++|.++.||+ .+||++++-.. T Consensus 5 IVaVTACptGIAHTyMAAeaLe~aak~~G~~ikVETQ 41 (114) T PRK10427 5 LVAVTACVSGVAHTYMAAERLEKLCQLEKWGVKIETQ 41 (114) T ss_pred EEEEECCCCHHHHHHHHHHHHHHHHHHCCCEEEEECC T ss_conf 8999538753989999999999999986996899748 No 150 >PRK10002 outer membrane protein F; Provisional Probab=48.46 E-value=20 Score=16.09 Aligned_cols=32 Identities=13% Similarity=0.097 Sum_probs=18.9 Q ss_pred CHHHHHHHHHHHHHHHHHCCCC----CCCCCEEEEC Q ss_conf 9579999999999986303583----4588379942 Q gi|254780342|r 1 MYKILAVCLFLTTFSISYARDA----DSCTPVRFAD 32 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~~~~----~~~~~I~ig~ 32 (309) |||.+++++++++++.+.+.++ .++.++.|+. T Consensus 2 MKK~~LA~ai~all~ag~a~AAeVYn~DGn~Ldl~G 37 (362) T PRK10002 2 MKRNILAVIVPALLVAGTANAAEIYNKDGNKVDLYG 37 (362) T ss_pred CHHHHHHHHHHHHHHHCCCCEEEEEECCCCEEEECC T ss_conf 308799999999986064417899858998898738 No 151 >PRK09967 putative outer membrane lipoprotein; Provisional Probab=47.66 E-value=20 Score=16.02 Aligned_cols=22 Identities=23% Similarity=0.304 Sum_probs=12.7 Q ss_pred CHHHHHHHHHHHHHHHHHCCCC Q ss_conf 9579999999999986303583 Q gi|254780342|r 1 MYKILAVCLFLTTFSISYARDA 22 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~~~~ 22 (309) |+|.+++.++++.+++++|.++ T Consensus 1 m~k~~~~~~l~~~l~lsgC~s~ 22 (160) T PRK09967 1 MIKHLVAPLIFTSLILTGCQSP 22 (160) T ss_pred CHHHHHHHHHHHHHHHHHCCCC T ss_conf 9078999999999999845899 No 152 >PRK10852 thiosulfate transporter subunit; Provisional Probab=47.34 E-value=20 Score=15.98 Aligned_cols=163 Identities=9% Similarity=-0.026 Sum_probs=73.0 Q ss_pred HHHHHHHHHHHHH--------HCCCCEEEEE--ECCHHHHHHHHHCC-CCEEEEEEECCCCHHHHHH--HHHCCCEEEEC Q ss_conf 9999999999996--------1069848998--56405788885149-8139998404641356666--55304702301 Q gi|254780342|r 38 IAATTAMTSVILE--------EILGYKTNIK--LLAVPVTFRSLKNK-GIDIFMGYWYPSLEKFIAP--YLEEGSIKLVA 104 (309) Q Consensus 38 ~~v~a~i~~~iLE--------~~lGy~Ve~~--~~~~~~~~~al~~G-~iD~~~e~w~p~~~~~~~~--~~~~~~~~~~~ 104 (309) .-++-++|.++.. +..|-+|++. .|++.-+-.++..| +-|+..=.-.+..+.+..+ ....+....+. T Consensus 33 Yd~tRely~~~n~~F~~~Wk~~~~g~~V~i~qShGGSg~QARaVi~GL~ADVVtlal~~Did~l~~~g~LI~~dW~~~lP 112 (338) T PRK10852 33 YDVSRELFAALNPPFEQQWAKDNGGDKLTIKQSHAGSSKQALAILQGLKADVVTYNQVTDVQILHDKGKLIPADWQSRLP 112 (338) T ss_pred CHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHHCCCCCCEEEEECCCCHHHHHHCCCCCCCCHHHHCC T ss_conf 05479999999899999999756896189985478760578998478754468752543248898627867805465389 Q ss_pred CCCC--CCEEEEEEEHHHHHCCCCCHHHHHHHHHHH---HHCCCCCCCCCCCCHHHHHHHHHCCCC---------CCCEE Q ss_conf 3566--853567863446412345535655556654---420135785432005558766403545---------22100 Q gi|254780342|r 105 ENLQ--GAKYMLAVNDVGFALGIKSYQDIAKYKKEL---GAKIYGIEPGNEGNQRILDMINNNKFS---------LKGFR 170 (309) Q Consensus 105 ~~~~--~~~~~~~v~~~~~~~~i~~~~dl~~~~~~~---~~~~~g~~~~~~~~~~~~~~~~~~~l~---------~~~~~ 170 (309) .+.. .....+.|++- .-.+|++|.||.+..-.. ..+.. +..+|..-..+....+.++-+ .-+.+ T Consensus 113 ~ns~pytStivFlVRkG-NPK~IkdW~DL~k~gV~VItPNPKTS-GgaRwnyLAAwg~a~~~~gg~e~~a~~fv~~l~~N 190 (338) T PRK10852 113 NNSSPFYSTMGFLVRKG-NPKNIHDWNDLVRSDVKLIFPNPKTS-GNARYTYLAAWGAADKADGGDKAKTEQFMTQFLKN 190 (338) T ss_pred CCCCCEEEEEEEEEECC-CCCCCCCHHHHCCCCCEEECCCCCCC-CCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHC T ss_conf 99964466799998079-95668775774648978975799888-40769999999999986499989999999999836 Q ss_pred EECCCCCCHHHHHHHHHHCCCCEEEEEECCCHH Q ss_conf 103665201899999996599369994124223 Q gi|254780342|r 171 LIEASELASFSQIRRDQRNNIPAVFLSWEPHPI 203 (309) Q Consensus 171 ~~~~s~~~~~a~l~~A~~~~~~~v~~~wsp~~~ 203 (309) + +.-+.+--.....=+++|..+|...|+-... T Consensus 191 V-pVld~gaR~AT~tF~~rg~GDVLl~~ENEa~ 222 (338) T PRK10852 191 V-EVFDTGGRGATTTFAERGLGDVLISFESEVN 222 (338) T ss_pred C-CCCCCCCHHHHHHHHHCCCCCEEEEHHHHHH T ss_conf 8-5158971678899987386756752167899 No 153 >pfam09403 FadA Adhesion protein FadA. FadA (Fusobacterium adhesin A) is an adhesin which forms two alpha helices. Probab=47.06 E-value=12 Score=17.48 Aligned_cols=19 Identities=26% Similarity=0.207 Sum_probs=13.3 Q ss_pred CHHHHHHHHHHHHHHHHHC Q ss_conf 9579999999999986303 Q gi|254780342|r 1 MYKILAVCLFLTTFSISYA 19 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~ 19 (309) |||+++++++++..++.++ T Consensus 1 MkK~ll~~~llls~~s~aa 19 (126) T pfam09403 1 MKKILLCSMLILSSLSYAA 19 (126) T ss_pred CHHHHHHHHHHHHHHHHHH T ss_conf 9169999999999999874 No 154 >TIGR03502 lipase_Pla1_cef extracellular lipase, Pla-1/cef family. Members of this protein family are bacterial lipoproteins largely from the Gammaproteobacteria. Characterized members are expressed in extracellularly and have esterase activity. Members include the lipase Pla-1 from Aeromonas hydrophila (AF092033) and CHO cell elongation factor (cef) from Vibrio hollisae Probab=46.88 E-value=19 Score=16.23 Aligned_cols=21 Identities=19% Similarity=0.251 Sum_probs=16.2 Q ss_pred CHHHHHHHHHHHHHHHHHCCC Q ss_conf 957999999999998630358 Q gi|254780342|r 1 MYKILAVCLFLTTFSISYARD 21 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~~~ 21 (309) |||+++.+++++.+.+.+|+. T Consensus 1 MKk~~l~~~iasal~LagCGg 21 (792) T TIGR03502 1 MKKLLLSLAIASALGLAGCGD 21 (792) T ss_pred CCHHHHHHHHHHHHHHHCCCC T ss_conf 933789999999875004578 No 155 >PRK12451 arginyl-tRNA synthetase; Reviewed Probab=46.80 E-value=21 Score=15.93 Aligned_cols=47 Identities=15% Similarity=0.093 Sum_probs=32.3 Q ss_pred CCCCCCCEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEEE--CCHHHHHHHH Q ss_conf 83458837994258869999999999999610698489985--6405788885 Q gi|254780342|r 21 DADSCTPVRFADTGWTDIAATTAMTSVILEEILGYKTNIKL--LAVPVTFRSL 71 (309) Q Consensus 21 ~~~~~~~I~ig~~~wte~~v~a~i~~~iLE~~lGy~Ve~~~--~~~~~~~~al 71 (309) +|+-.+|.+||+ .=+.+++..++-+|+. .||+|.... +|.+.++.-+ T Consensus 121 SaNPtgpLHVGH---~R~aiiGdsLariL~~-~G~~V~reyYinD~G~Qi~~l 169 (562) T PRK12451 121 SPNIAKPFSMGH---LRSTMIGNSLKHIAEK-CGYEVVGINYIGDWGTQFGKL 169 (562) T ss_pred CCCCCCCCCHHH---HHHHHHHHHHHHHHHH-CCCEEEEEEEECCCCHHHHHH T ss_conf 899998862422---4627999999999998-698599999747730899999 No 156 >pfam12262 Lipase_bact_N Bacterial virulence factor lipase N-terminal. This domain family is found in bacteria, and is typically between 258 and 271 amino acids in length. There are two conserved sequence motifs: DGT and DGWST. This family is the N-terminal region of bacterial virulence factor lipase. The N-terminal region contains a potential signalling sequence. Probab=46.53 E-value=18 Score=16.31 Aligned_cols=22 Identities=18% Similarity=0.203 Sum_probs=16.0 Q ss_pred CHHHHHHHHHHHHHHHHHCCCC Q ss_conf 9579999999999986303583 Q gi|254780342|r 1 MYKILAVCLFLTTFSISYARDA 22 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~~~~ 22 (309) |||+++.++++..+.+.+|+.. T Consensus 1 MKk~~l~~~i~SAL~LaGCg~~ 22 (269) T pfam12262 1 MKKKFLALLLASALLLAGCGDD 22 (269) T ss_pred CCHHHHHHHHHHHHHHCCCCCC T ss_conf 9247999999998641115799 No 157 >PRK09810 entericidin A; Provisional Probab=46.44 E-value=9.7 Score=18.08 Aligned_cols=29 Identities=10% Similarity=0.224 Sum_probs=18.6 Q ss_pred CHHHHHHHHHHHHHHHHHCCCCCCCCCEE Q ss_conf 95799999999999863035834588379 Q gi|254780342|r 1 MYKILAVCLFLTTFSISYARDADSCTPVR 29 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~~~~~~~~~I~ 29 (309) |||++.+.++.+.+..++...-+.++.|+ T Consensus 2 mkrli~lil~~~~ll~gcntargfgedik 30 (41) T PRK09810 2 MKRLIVLVLLASTLLTGCNTARGFGEDIK 30 (41) T ss_pred HHHHHHHHHHHHHHHHCCCCCCCCHHHHH T ss_conf 38999999999999825621221217789 No 158 >PRK10954 periplasmic protein disulfide isomerase I; Provisional Probab=46.44 E-value=19 Score=16.26 Aligned_cols=30 Identities=13% Similarity=-0.122 Sum_probs=17.2 Q ss_pred CHHHHHHHHHHHHHHHHHCCCCCCCCCEEE Q ss_conf 957999999999998630358345883799 Q gi|254780342|r 1 MYKILAVCLFLTTFSISYARDADSCTPVRF 30 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~~~~~~~~~I~i 30 (309) |||++++++++++.+++.++.-.+++.-++ T Consensus 1 mkk~~~~~~~llls~~a~Aa~~~eG~~Y~~ 30 (207) T PRK10954 1 MKKIWLALAGMVLAFSASAAQFTDGKQYTT 30 (207) T ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCCCCEE T ss_conf 915999999999997722503548997558 No 159 >PRK11404 putative PTS transporter components IIBC; Provisional Probab=46.39 E-value=11 Score=17.64 Aligned_cols=32 Identities=28% Similarity=0.204 Sum_probs=26.5 Q ss_pred EECCCCCHHHHHHHHHHHHHHH---CCCCEEEEEE Q ss_conf 9425886999999999999961---0698489985 Q gi|254780342|r 30 FADTGWTDIAATTAMTSVILEE---ILGYKTNIKL 61 (309) Q Consensus 30 ig~~~wte~~v~a~i~~~iLE~---~lGy~Ve~~~ 61 (309) +|...-+.+.++++|.++-||+ ++||++++-. T Consensus 7 vAVTaCPTGIAHTYMAAeaL~~aak~~G~~ikVET 41 (482) T PRK11404 7 VAITNCPAGIAHTYMVAEALEQKARSLGHTIKVET 41 (482) T ss_pred EEEECCCCHHHHHHHHHHHHHHHHHHCCCEEEEEC T ss_conf 99813862698999999999999998699689963 No 160 >PRK10449 heat-inducible protein; Provisional Probab=46.14 E-value=13 Score=17.28 Aligned_cols=20 Identities=15% Similarity=0.202 Sum_probs=10.6 Q ss_pred CHHHHHHHHHHHHHHHHHCCC Q ss_conf 957999999999998630358 Q gi|254780342|r 1 MYKILAVCLFLTTFSISYARD 21 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~~~ 21 (309) |||++.++++.. ++.+++++ T Consensus 1 MKk~l~l~~~~l-lLagC~s~ 20 (140) T PRK10449 1 MKKVVALVALSL-LMAGCVSS 20 (140) T ss_pred CCHHHHHHHHHH-HHHHCCCC T ss_conf 903499999999-99752389 No 161 >cd05569 PTS_IIB_fructose PTS_IIB_fructose: subunit IIB of enzyme II (EII) of the fructose-specific phosphoenolpyruvate:carbohydrate phosphotransferase system (PTS). In this system, EII (also referred to as FruAB) is a fructose-specific permease made up of two proteins (FruA and FruB) each containing 3 domains. The FruA protein contains two tandem nonidentical IIB domains and a C-terminal IIC transmembrane domain. Both IIB domains of FruA are included in this alignment. The FruB protein (also referred to as diphosphoryl transfer protein) contains a IIA domain, a domain of unknown function, and an Hpr-like domain called FPr (fructose-inducible HPr). This familiy also includes the IIB domains of several fructose-like PTS permeases including the Frv permease encoded by the frvABXR operon, the Frw permease encoded by the frwACBD operon, the Frx permease encoded by the hrsA gene, and the Fry permease encoded by the fryABC (ypdDGH) operon. FruAB takes up exogenous fructose, releasing the 1-p Probab=46.07 E-value=12 Score=17.50 Aligned_cols=32 Identities=28% Similarity=0.292 Sum_probs=24.7 Q ss_pred EECCCCCHHHHHHHHHHHHHHH---CCCCEEEEEE Q ss_conf 9425886999999999999961---0698489985 Q gi|254780342|r 30 FADTGWTDIAATTAMTSVILEE---ILGYKTNIKL 61 (309) Q Consensus 30 ig~~~wte~~v~a~i~~~iLE~---~lGy~Ve~~~ 61 (309) ++...-+.+.+.++|.++.||+ .+||++.+-. T Consensus 2 vaVTaCptGiAHTymAAeaL~~aA~~~G~~ikVEt 36 (96) T cd05569 2 VAVTACPTGIAHTYMAAEALEKAAKKLGWEIKVET 36 (96) T ss_pred EEEECCCCHHHHHHHHHHHHHHHHHHCCCEEEEEE T ss_conf 89921886098999999999999998799599994 No 162 >PRK04043 tolB translocation protein TolB; Provisional Probab=45.82 E-value=22 Score=15.83 Aligned_cols=52 Identities=15% Similarity=0.010 Sum_probs=24.8 Q ss_pred CHHHHHHHHHHHHHHHHHCC-----CCCCCCCEEEECCCCCHHHHHHHHHHHHHHHC Q ss_conf 95799999999999863035-----83458837994258869999999999999610 Q gi|254780342|r 1 MYKILAVCLFLTTFSISYAR-----DADSCTPVRFADTGWTDIAATTAMTSVILEEI 52 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~~-----~~~~~~~I~ig~~~wte~~v~a~i~~~iLE~~ 52 (309) |||+++.+++++.++..-+. .....-+|-|....|.+..-+.+=+.+++... T Consensus 1 MKki~l~ll~~~~~~aae~~IeIt~~~~~~~~IAVvp~~~~~~~~l~~~i~~II~~D 57 (419) T PRK04043 1 MKKLVLFLLVSLGLFAVDATLDIVKKGQKLPKIVVEDSSDSNDANLKLKFFKILVND 57 (419) T ss_pred CHHHHHHHHHHHHHHHCCEEEEEEECCCCCCCEEEECCCCCCCCCHHHHHHHHHHHH T ss_conf 955778899997774504799999478787658996168677641368999999875 No 163 >PRK10780 periplasmic chaperone; Provisional Probab=44.94 E-value=18 Score=16.40 Aligned_cols=29 Identities=21% Similarity=0.064 Sum_probs=17.1 Q ss_pred CHHHHHHHHHHHHHHHHHCCCCCCCCCEEEECCCC Q ss_conf 95799999999999863035834588379942588 Q gi|254780342|r 1 MYKILAVCLFLTTFSISYARDADSCTPVRFADTGW 35 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~~~~~~~~~I~ig~~~w 35 (309) |||++.++++.+.++.+ +.+. -+||..|. T Consensus 1 mKk~l~~~~l~l~l~~~--a~Aa----~KIgvVd~ 29 (161) T PRK10780 1 MKKWLLAAGLGLALATS--AQAA----DKIAIVNM 29 (161) T ss_pred CHHHHHHHHHHHHHHHH--HHHH----HCEEEEEH T ss_conf 95899999999985888--8775----35455329 No 164 >PRK03002 prsA peptidylprolyl isomerase; Reviewed Probab=44.68 E-value=22 Score=15.72 Aligned_cols=13 Identities=15% Similarity=0.279 Sum_probs=7.8 Q ss_pred CCCCEEECCCCCC Q ss_conf 1585140078654 Q gi|254780342|r 206 DLNIHYLPGGEEI 218 (309) Q Consensus 206 ~~dl~~Ledp~~~ 218 (309) .|.+..|.|.++. T Consensus 216 GyhIIk~~dk~~~ 228 (285) T PRK03002 216 GYHIIKLTDKKDL 228 (285) T ss_pred EEEEEEEEECCCC T ss_conf 1699998416889 No 165 >pfam07273 DUF1439 Protein of unknown function (DUF1439). This family consists of several hypothetical bacterial proteins of around 190 residues in length. Several members of this family are annotated as being putative lipoproteins and are often known as YceB. The function of this family is unknown. Probab=43.74 E-value=23 Score=15.63 Aligned_cols=45 Identities=20% Similarity=0.332 Sum_probs=25.2 Q ss_pred CHHHHHHHHHHHHHHHHHCCCCCCCCCEEEECCCCCHHHHHHHHHHHH-HHHCCC Q ss_conf 957999999999998630358345883799425886999999999999-961069 Q gi|254780342|r 1 MYKILAVCLFLTTFSISYARDADSCTPVRFADTGWTDIAATTAMTSVI-LEEILG 54 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~~~~~~~~~I~ig~~~wte~~v~a~i~~~i-LE~~lG 54 (309) ||+++.++++++++ +++|.. +..-+.||+.+..++-+.+ +|++.| T Consensus 1 Mk~~~~~~~~l~~~-LsgC~~--------ls~ysISE~Ein~yL~k~~~~~k~~g 46 (177) T pfam07273 1 MKKLLLLILLLVLL-LSGCAS--------LSQYSISEAEINQYLQKEVHLEKKVG 46 (177) T ss_pred CCHHHHHHHHHHHH-HHCCCC--------CCCEEECHHHHHHHHHHHCCHHHHHC T ss_conf 91489999999999-715575--------66252669999999987457778608 No 166 >PRK09465 tolC outer membrane channel protein; Reviewed Probab=43.40 E-value=17 Score=16.47 Aligned_cols=21 Identities=29% Similarity=0.273 Sum_probs=13.3 Q ss_pred CHHHHHHHHHHHHHHHHHCCC Q ss_conf 957999999999998630358 Q gi|254780342|r 1 MYKILAVCLFLTTFSISYARD 21 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~~~ 21 (309) ||||+.+++.+++.+++.++. T Consensus 1 mkki~~~l~~l~l~~~~~~a~ 21 (446) T PRK09465 1 MKKLLPLLIGLSLAGFSSASQ 21 (446) T ss_pred CCHHHHHHHHHHHHHCCHHCH T ss_conf 942999999999986131121 No 167 >PRK11233 nitrogen assimilation transcriptional regulator; Provisional Probab=42.40 E-value=24 Score=15.50 Aligned_cols=193 Identities=10% Similarity=-0.003 Sum_probs=95.9 Q ss_pred CCCCCEEEECCC-CCHHHHHHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHCCCCEEEEEEECCCCHHHHHHHHHCCCEE Q ss_conf 458837994258-8699999999999996106984899856405788885149813999840464135666655304702 Q gi|254780342|r 23 DSCTPVRFADTG-WTDIAATTAMTSVILEEILGYKTNIKLLAVPVTFRSLKNKGIDIFMGYWYPSLEKFIAPYLEEGSIK 101 (309) Q Consensus 23 ~~~~~I~ig~~~-wte~~v~a~i~~~iLE~~lGy~Ve~~~~~~~~~~~al~~G~iD~~~e~w~p~~~~~~~~~~~~~~~~ 101 (309) ....+|+||... .....+.-.+++.+.+..=+.++++..+++..+.+.+.+|++|+..-+-.+.... +. T Consensus 88 ~~~G~l~ig~~p~~~~~~l~~p~l~~f~~~~P~v~v~l~~~~~~~l~~~l~~g~~Dlai~~~~~~~~~----------l~ 157 (305) T PRK11233 88 ALSGQVSIGLAPGTAASSITMPLLQAVRAEFPEIVIYLHENSGAVLNEKLINGQLDMAVIYEHSPVAG----------VS 157 (305) T ss_pred CCCCEEEEEEECCHHHHHHHHHHHHHHHHHCCCEEEEEECCCHHHHHHHHHCCCEEEEEEECCCCCCC----------CE T ss_conf 87404999996215566536999999999789429997238879999998769847999827989888----------28 Q ss_pred EECCCCCCCEEEEEEEHHHHHCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCCCCCCCEEEECCCCCCHHH Q ss_conf 30135668535678634464123455356555566544201357854320055587664035452210010366520189 Q gi|254780342|r 102 LVAENLQGAKYMLAVNDVGFALGIKSYQDIAKYKKELGAKIYGIEPGNEGNQRILDMINNNKFSLKGFRLIEASELASFS 181 (309) Q Consensus 102 ~~~~~~~~~~~~~~v~~~~~~~~i~~~~dl~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~s~~~~~a 181 (309) .. ........++.+...+ ..-.++.++.+. ..+...++...+..........++.... .....+ .. T Consensus 158 ~~--~l~~~~l~lv~~~~~p-~~~i~l~~l~~~------~~i~~~~~~~~r~~~~~~~~~~~~~~~~-~~e~~~----~~ 223 (305) T PRK11233 158 SQ--ALLKEDLFLVGTQDCP-GQSVDLNAIAQM------NLFLPRDYSAVRLRVDEAFSLRRLTAKV-IGEIES----IA 223 (305) T ss_pred EE--EEEECCEEEEECCCCC-CCCCCHHHHCCC------CEEECCCCCHHHHHHHHHHHHCCCCCCE-EEEECC----HH T ss_conf 99--9985358999847788-787899998089------9898799984999999999977999847-999996----99 Q ss_pred HHHHHHHCCCCEEEEEECCCHHH----HCCCCEEE--CCCCCCCCCCHHHHHHHHCHHHHHHCHHHHHHHHHC Q ss_conf 99999965993699941242232----01585140--078654677865835442777787488999998740 Q gi|254780342|r 182 QIRRDQRNNIPAVFLSWEPHPIN----SDLNIHYL--PGGEEISGFGEASVYTVVRSDYLDKCPNISRLLKNI 248 (309) Q Consensus 182 ~l~~A~~~~~~~v~~~wsp~~~~----~~~dl~~L--edp~~~~~~p~~~~~~vv~~~~~~~~P~~~~~L~~~ 248 (309) .+.+++..|-.+.+ -|.... ...++..+ .+|. +.-.+..+.+++ ..-.|.+..|.+-+ T Consensus 224 ~i~~~V~~G~Giai---lP~~~~~~~~~~~~~~~~~~~~p~-----~~~~~~l~~~~~-r~lspaa~af~d~L 287 (305) T PRK11233 224 TLTAAIASGMGVTV---LPESAARSLCGAVNGWMSRITTPS-----MSLSLSLNLSAR-ANLSPQAQAVKEIL 287 (305) T ss_pred HHHHHHHHCCEEEE---EHHHHHHHHHHCCCEEEEECCCCC-----CEEEEEEEEECC-CCCCHHHHHHHHHH T ss_conf 99999997997898---719999987534976999259998-----665999999890-98899999999999 No 168 >pfam07882 Fucose_iso_N2 L-fucose isomerase, second N-terminal domain. The members of this family are similar to L-fucose isomerase expressed by E. coli (EC:5.3.1.3). This enzyme corresponds to glucose-6-phosphate isomerase in glycolysis, and converts an aldo-hexose to a ketose to prepare it for aldol cleavage. The enzyme is a hexamer, with each subunit being wedge-shaped and composed of three domains. Both domains 1 and 2 contain central parallel beta- sheets with surrounding alpha helices. The active centre is shared between pairs of subunits related along the molecular three-fold axis, with domains 2 and 3 from one subunit providing most of the substrate-contacting residues. Probab=42.27 E-value=24 Score=15.48 Aligned_cols=15 Identities=13% Similarity=0.381 Sum_probs=8.6 Q ss_pred HHHHCCCCEEEEEEC Q ss_conf 996106984899856 Q gi|254780342|r 48 ILEEILGYKTNIKLL 62 (309) Q Consensus 48 iLE~~lGy~Ve~~~~ 62 (309) ..++.+|..||.+.. T Consensus 22 f~q~ylGmrvE~VDm 36 (181) T pfam07882 22 FFQDYLGMRVEYVDM 36 (181) T ss_pred HHHHHHCCCCCEECH T ss_conf 999995992001229 No 169 >PRK02399 hypothetical protein; Provisional Probab=42.17 E-value=24 Score=15.47 Aligned_cols=59 Identities=14% Similarity=0.029 Sum_probs=37.5 Q ss_pred CCEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEE--ECCHHHHHHHH-HCCCCEEEEEEECCCC Q ss_conf 83799425886999999999999961069848998--56405788885-1498139998404641 Q gi|254780342|r 26 TPVRFADTGWTDIAATTAMTSVILEEILGYKTNIK--LLAVPVTFRSL-KNKGIDIFMGYWYPSL 87 (309) Q Consensus 26 ~~I~ig~~~wte~~v~a~i~~~iLE~~lGy~Ve~~--~~~~~~~~~al-~~G~iD~~~e~w~p~~ 87 (309) ..|-+-+..-|...+ .-.+..||++ ||+|-.- .|...-.++.| ++|.+|...+.+.+.. T Consensus 186 p~I~~TmfGvTt~~v--~~~~~~Le~~-GyEv~VFHAtG~GG~aME~Li~~G~~~~VlDlTttE~ 247 (407) T PRK02399 186 PLIGLTMFGVTTPCV--QAARSELEAR-GYEVLVFHATGTGGRAMEKLADSGLIAGVLDLTTTEV 247 (407) T ss_pred CEEEEECCCCHHHHH--HHHHHHHHHC-CCEEEEECCCCCCHHHHHHHHHCCCEEEEEECCHHHH T ss_conf 668986058749999--9999999867-9708998368970789999997798059996644888 No 170 >PRK13883 conjugal transfer protein TrbH; Provisional Probab=41.27 E-value=25 Score=15.38 Aligned_cols=47 Identities=19% Similarity=0.174 Sum_probs=26.4 Q ss_pred CHHHHHHHHHHHHHHHHHCCCCCCCCCEEEECCCCCHHHHHHHHHHHHHH Q ss_conf 95799999999999863035834588379942588699999999999996 Q gi|254780342|r 1 MYKILAVCLFLTTFSISYARDADSCTPVRFADTGWTDIAATTAMTSVILE 50 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~~~~~~~~~I~ig~~~wte~~v~a~i~~~iLE 50 (309) |||++.+++++..++. |+++..-+.=+. .... ..+++....++|+.+ T Consensus 1 MrK~~~~all~laL~g-CAtt~~YGNFv~-~~~~-~~q~lA~Dav~QL~~ 47 (156) T PRK13883 1 MRKILSLALLALALGG-CATTSQYGNFVQ-ASAA-DQQKLATDAVQQLAA 47 (156) T ss_pred CHHHHHHHHHHHHHHC-CCCCCCCCCCCC-CCHH-HHHHHHHHHHHHHHH T ss_conf 9268999999999703-313578877358-7403-548889999999998 No 171 >PRK00040 rpsP 30S ribosomal protein S16; Reviewed Probab=40.97 E-value=17 Score=16.49 Aligned_cols=27 Identities=19% Similarity=0.553 Sum_probs=20.4 Q ss_pred HHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHC Q ss_conf 99859999987547545677137999998744 Q gi|254780342|r 277 LRTHPDLLKNWLIGVTTFDGQDPSRQLERFMN 308 (309) Q Consensus 277 L~~n~~~~~~Wl~~~~~~~~~~~~~~~~~~~~ 308 (309) ++=|.+.+..||+ .|+++++.|++.++ T Consensus 48 ~~l~~eri~~Wl~-----~GAqpS~~V~~Llk 74 (75) T PRK00040 48 VKLDEERVLYWLG-----QGAQPTDTVARLLK 74 (75) T ss_pred EEECHHHHHHHHH-----CCCCCCHHHHHHHC T ss_conf 9984999999998-----78986899999863 No 172 >PRK05697 flagellar basal body-associated protein FliL-like protein; Validated Probab=40.02 E-value=22 Score=15.81 Aligned_cols=25 Identities=32% Similarity=0.384 Sum_probs=15.6 Q ss_pred CHHHHHHHHHHHHHHHHHCCCCCCC Q ss_conf 9579999999999986303583458 Q gi|254780342|r 1 MYKILAVCLFLTTFSISYARDADSC 25 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~~~~~~~ 25 (309) ||+++..+++++.++++..+.+.+. T Consensus 1 ~~~~~~~~l~l~~~~~~~~~~ae~~ 25 (135) T PRK05697 1 MKRLLAALLLLLSLLFSVAAAAEEE 25 (135) T ss_pred CHHHHHHHHHHHHHHCCCHHHHCCC T ss_conf 9259999999999744725443234 No 173 >TIGR02414 pepN_proteo aminopeptidase N; InterPro: IPR012779 Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site . The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases . Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. The M1 family of zinc metallopeptidases contains a number of distinct, well-separated clades of proteins with aminopeptidase activity. Several are designated aminopeptidase N, 3.4.11.2 from EC, after the Escherichia coli enzyme, suggesting a similar activity profile (see P04825 from SWISSPROT for a description of catalytic activity). This family of zinc metallopeptidases belong to MEROPS peptidase family M1 (aminopeptidase N, clan MA); the majority are identified as alanyl aminopeptidases (proteobacteria) that are closely related to Escherichia coli PepN and presumed to have a similar (not identical) function. Nearly all are found in proteobacteria, but members are found also in cyanobacteria, plants, and apicomplexan parasites , . This family differs greatly in sequence from the family of aminopeptidases typified by Streptomyces lividans PepN (IPR012778 from INTERPRO) and from the membrane bound aminopeptidase N family in animals.; GO: 0004179 membrane alanyl aminopeptidase activity, 0008270 zinc ion binding. Probab=39.69 E-value=27 Score=15.23 Aligned_cols=71 Identities=13% Similarity=0.202 Sum_probs=51.8 Q ss_pred HCHHHHHHHH-HCC-C--CHHHHHHHHHHHHHCCCCHHH--H----HHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHH Q ss_conf 4889999987-408-9--989999999999845999899--9----9999998599999875475456771379999987 Q gi|254780342|r 237 KCPNISRLLK-NIK-F--SVALENEMMKLILNNKQDRQF--V----GRTMLRTHPDLLKNWLIGVTTFDGQDPSRQLERF 306 (309) Q Consensus 237 ~~P~~~~~L~-~~~-l--s~~~~~~l~~~v~~~g~~~~~--v----A~~wL~~n~~~~~~Wl~~~~~~~~~~~~~~~~~~ 306 (309) +.|++.+... ++. = =|+.+++|...+..++...+. + .+.| ++.|=++|+|.+=..|+.. |.=+.||+. T Consensus 720 ~~~~~~~~~~~q~~~A~NMTD~~~ALs~l~~~~~~~~~~~~AL~~Fy~~W-~~dpLV~DkWF~~qAt~~~-d~Le~V~~L 797 (898) T TIGR02414 720 DDAEIRNLALEQFKSADNMTDRLAALSALVNAESDEREKAEALAAFYQKW-KDDPLVMDKWFALQATSPR-DTLERVKAL 797 (898) T ss_pred CCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHH-CCCCHHEEHHEEEEECCCC-HHHHHHHHH T ss_conf 98889999999742688856899999998615887679999999999973-4484110000111314872-489999999 Q ss_pred HCC Q ss_conf 449 Q gi|254780342|r 307 MNN 309 (309) Q Consensus 307 ~~~ 309 (309) |+. T Consensus 798 ~~H 800 (898) T TIGR02414 798 LKH 800 (898) T ss_pred HCC T ss_conf 707 No 174 >pfam04076 BOF Bacterial OB fold (BOF) protein. Proteins in this family form an OB-fold. Analysis of the predicted binding site of BOF family proteins implies that they lack nucleic acid-binding properties. They contain an predicted N-terminal signal peptide which indicates that they localize in the periplasm where they may function to bind proteins, small molecules, or other typical OB-fold ligands. As hypothesized for the distantly related OB-fold containing bacterial enterotoxins, the loss of nucleotide-binding function and the rapid evolution of the BOF ligand-binding site may be associated with the presence of BOF proteins in mobile genetic elements and their potential role in bacterial pathogenicity. Probab=39.00 E-value=18 Score=16.32 Aligned_cols=56 Identities=11% Similarity=-0.030 Sum_probs=26.0 Q ss_pred CHHHHHHHHHHHHHHHHHCC-CCC-----CCCCEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEE Q ss_conf 95799999999999863035-834-----588379942588699999999999996106984899 Q gi|254780342|r 1 MYKILAVCLFLTTFSISYAR-DAD-----SCTPVRFADTGWTDIAATTAMTSVILEEILGYKTNI 59 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~~-~~~-----~~~~I~ig~~~wte~~v~a~i~~~iLE~~lGy~Ve~ 59 (309) |||++++++++++++.+.++ ..+ ...+..- .|++......-++++++..=+-.|.+ T Consensus 1 MKK~~i~~~~~l~s~~~~Aa~~gGf~~p~~~~~~qg---Gf~Gp~~~~~TV~~a~~~~Dd~~V~L 62 (126) T pfam04076 1 MKKLAIALASALASTSALAADQGGFSGPPPPHAQQG---GFQGPAPTKTTVKQAKSAADDAWVSL 62 (126) T ss_pred CCHHHHHHHHHHHHCHHHHHHCCCCCCCCCCCCCCC---CCCCCCCCEEHHHHHHHCCCCCCEEE T ss_conf 956899999999846787976389889998744367---73487454883998865879993999 No 175 >COG1653 UgpB ABC-type sugar transport system, periplasmic component [Carbohydrate transport and metabolism] Probab=38.88 E-value=27 Score=15.15 Aligned_cols=57 Identities=12% Similarity=0.045 Sum_probs=36.4 Q ss_pred CCEEEECCCCCH--HHHHHHHHHHHHHHCCCCEEEEEECCH----HHHHHHHHCCCC-EEEEEE Q ss_conf 837994258869--999999999999610698489985640----578888514981-399984 Q gi|254780342|r 26 TPVRFADTGWTD--IAATTAMTSVILEEILGYKTNIKLLAV----PVTFRSLKNKGI-DIFMGY 82 (309) Q Consensus 26 ~~I~ig~~~wte--~~v~a~i~~~iLE~~lGy~Ve~~~~~~----~~~~~al~~G~i-D~~~e~ 82 (309) .+|++-.....+ ....-.++...-++.-|++|+....+. ..+..++.+|+. |++.-. T Consensus 31 ~~i~~~~~~~~~~~~~~~~~~i~~f~~~~p~ikV~~~~~~~~~~~~~l~~~~~ag~~pDv~~~~ 94 (433) T COG1653 31 VTITFWHSWTGGEEADALEELIKEFEKENPGIKVKVVNVPGDDYLQKLLTALASGDAPDVVQLD 94 (433) T ss_pred CEEEEEEECCCCCHHHHHHHHHHHHHHHCCCEEEEEEECCHHHHHHHHHHHHHCCCCCEEEEEC T ss_conf 4399998337873158999999999977798479998567588999999998579998199977 No 176 >pfam03032 Brevenin Brevenin/esculentin/gaegurin/rugosin family. This family contains a number of defence peptides secreted from the skin of amphibians, including the opiate-like dermorphins and deltorphins, and the antimicrobial dermoseptins and temporins. Probab=38.58 E-value=20 Score=16.03 Aligned_cols=20 Identities=35% Similarity=0.459 Sum_probs=17.7 Q ss_pred CHHHHHHHHHHHHHHHHHCC Q ss_conf 95799999999999863035 Q gi|254780342|r 1 MYKILAVCLFLTTFSISYAR 20 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~~ 20 (309) |||.++++++++++++|.|. T Consensus 3 lKKSlll~fFLG~islSlCe 22 (46) T pfam03032 3 LKKSLLLVLFLGLVSLSLCE 22 (46) T ss_pred CCHHHHHHHHHHHHHHHHHH T ss_conf 10538999998651476524 No 177 >PRK09915 putative outer membrane efflux protein MdtP; Provisional Probab=38.40 E-value=28 Score=15.10 Aligned_cols=54 Identities=17% Similarity=0.239 Sum_probs=28.4 Q ss_pred CHHHHHHHHHHHHHHHHHCCC-CCC--------------CCCEEEECCCCCHH--------HHHHHHHHHHHHHCCC Q ss_conf 957999999999998630358-345--------------88379942588699--------9999999999961069 Q gi|254780342|r 1 MYKILAVCLFLTTFSISYARD-ADS--------------CTPVRFADTGWTDI--------AATTAMTSVILEEILG 54 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~~~-~~~--------------~~~I~ig~~~wte~--------~v~a~i~~~iLE~~lG 54 (309) |||++++.++++.+++++|+. +.. .........+|+.. -.+..++.+.|+++.. T Consensus 6 ~~r~~~~~~l~~~~lLagC~~~p~~~~p~~~~~p~~~~~~~~~~~~~~~~~~~~WW~~F~Dp~L~~LI~~AL~~N~d 82 (488) T PRK09915 6 LSRLLLCSILGSTTLISGCALVRKDSAPHQQLKPEQIKLADDIHLASSGWPQAQWWKQLNDPQLDALIQRTLSGSHT 82 (488) T ss_pred HHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHCCCHHHHHHHHHHHHCCHH T ss_conf 76999999999999975266899999999888964334466656677788726699867998799999999965986 No 178 >PRK13861 type IV secretion system protein VirB9; Provisional Probab=38.23 E-value=22 Score=15.80 Aligned_cols=24 Identities=8% Similarity=0.020 Sum_probs=12.2 Q ss_pred CCCCEEEEEECCHHHHHHHHHCCCC Q ss_conf 0698489985640578888514981 Q gi|254780342|r 52 ILGYKTNIKLLAVPVTFRSLKNKGI 76 (309) Q Consensus 52 ~lGy~Ve~~~~~~~~~~~al~~G~i 76 (309) ..||-+.++.++.+.+ ..+..|+- T Consensus 50 ~~G~~T~I~F~~dE~I-~~v~~Gd~ 73 (293) T PRK13861 50 AVGATLVVTFGANETV-TAVAVSNS 73 (293) T ss_pred ECCEEEEEEECCCCEE-EEECCCCC T ss_conf 8746999998899879-98615882 No 179 >cd07373 2A5CPDO_A The alpha subunit of the Class III extradiol dioxygenase, 2-amino-5-chlorophenol 1,6-dioxygenase, which catalyzes the oxidization and subsequent ring-opening of 2-amino-5-chlorophenol. 2-amino-5-chlorophenol 1,6-dioxygenase (2A5CPDO) catalyzes the oxidization and subsequent ring-opening of 2-amino-5-chlorophenol, which is an intermediate during p-chloronitrobenzene degradation. This enzyme is a member of the class III extradiol dioxygenase family, a group of enzymes which use a non-heme Fe(II) to cleave aromatic rings between a hydroxylated carbon and an adjacent non-hydroxylated carbon. The active enzyme is probably a heterotetramer, composed of two alpha and two beta subunits. The alpha and beta subunits share significant sequence similarity and may have evolved by gene duplication. This model describes the alpha subunit, which does not contain a potential metal binding site and may not possess catalytic activity. Probab=38.06 E-value=28 Score=15.07 Aligned_cols=22 Identities=9% Similarity=-0.207 Sum_probs=8.7 Q ss_pred HHHHHHC--CCCEEEEEECCCHHH Q ss_conf 9999965--993699941242232 Q gi|254780342|r 183 IRRDQRN--NIPAVFLSWEPHPIN 204 (309) Q Consensus 183 l~~A~~~--~~~~v~~~wsp~~~~ 204 (309) +.+|+++ +++.++-.=+=+.++ T Consensus 158 ~~~Ai~~s~krvvllASgsLSHr~ 181 (271) T cd07373 158 AADAAKDQNKRVAVVGVGGLSGSL 181 (271) T ss_pred HHHHHHHCCCCEEEEEECCHHHHH T ss_conf 999999829978999835324431 No 180 >PRK09818 putative kinase inhibitor; Provisional Probab=37.72 E-value=29 Score=15.03 Aligned_cols=74 Identities=8% Similarity=0.031 Sum_probs=38.5 Q ss_pred CHHHHHHHHHHHHHHHHHCCCCCCCCCEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHCCCCEEEE Q ss_conf 95799999999999863035834588379942588699999999999996106984899856405788885149813999 Q gi|254780342|r 1 MYKILAVCLFLTTFSISYARDADSCTPVRFADTGWTDIAATTAMTSVILEEILGYKTNIKLLAVPVTFRSLKNKGIDIFM 80 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~~~~~~~~~I~ig~~~wte~~v~a~i~~~iLE~~lGy~Ve~~~~~~~~~~~al~~G~iD~~~ 80 (309) |||+++.++++.+.++. +..+.++-+.+|.++..+..-+.. ..+|++ =...+-...|..+=.|---+.. T Consensus 1 ~~~~~~~~~~~~~~~~a------~a~~f~ltS~~f~dG~~ip~~~~~---~~~gc~--G~niSP~L~W~~~P~gTkSfAl 69 (183) T PRK09818 1 MKKLIVSSVLAFITFSA------QAAAFQVTSNEIKTGEQLTTSHVF---SGFGCE--GGNTSPSLTWSGAPEGTKSFAV 69 (183) T ss_pred CHHHHHHHHHHHHHCCC------CCCCEEEECCCCCCCCCCCHHHHC---CCCCCC--CCCCCCCEEECCCCCCCEEEEE T ss_conf 91389999999974435------568648878775794898876614---688888--8862877254489998768898 Q ss_pred EEECC Q ss_conf 84046 Q gi|254780342|r 81 GYWYP 85 (309) Q Consensus 81 e~w~p 85 (309) -...| T Consensus 70 ~~~DP 74 (183) T PRK09818 70 TVYDP 74 (183) T ss_pred EEECC T ss_conf 77679 No 181 >KOG0404 consensus Probab=37.59 E-value=29 Score=15.02 Aligned_cols=18 Identities=17% Similarity=0.134 Sum_probs=10.8 Q ss_pred HHHHHHCCCCEEEEEECC Q ss_conf 999996599369994124 Q gi|254780342|r 183 IRRDQRNNIPAVFLSWEP 200 (309) Q Consensus 183 l~~A~~~~~~~v~~~wsp 200 (309) .++|.++-...+.|.+.. T Consensus 199 q~ra~~npnI~v~~nt~~ 216 (322) T KOG0404 199 QQRAEKNPNIEVLYNTVA 216 (322) T ss_pred HHHHHCCCCEEEEECHHH T ss_conf 998753997699931122 No 182 >pfam09070 PFU PFU (PLAA family ubiquitin binding). This domain is found N terminal to pfam08324 and binds to ubiquitin. Probab=37.42 E-value=29 Score=15.00 Aligned_cols=18 Identities=6% Similarity=0.004 Sum_probs=12.5 Q ss_pred HCCCCHHHHHHHHHHHHH Q ss_conf 459998999999999859 Q gi|254780342|r 264 NNKQDRQFVGRTMLRTHP 281 (309) Q Consensus 264 ~~g~~~~~vA~~wL~~n~ 281 (309) +.+.+|.++|..||..|. T Consensus 75 N~~dnp~~~A~~F~~~~~ 92 (115) T pfam09070 75 NLGDNPYEAAEKFILRNE 92 (115) T ss_pred CCCCCHHHHHHHHHHHCC T ss_conf 089998999999999829 No 183 >pfam11940 DUF3458 Domain of unknown function (DUF3458). This presumed domain is functionally uncharacterized. This domain is found in bacteria, archaea and eukaryotes. This domain is typically between 402 to 419 amino acids in length. This domain is found associated with pfam01433. This domain has a conserved FSAPV sequence motif. Probab=37.16 E-value=29 Score=14.97 Aligned_cols=71 Identities=11% Similarity=0.186 Sum_probs=49.5 Q ss_pred CHHHHHHHH-HC---CCCHHHHHHHHHHHHHCCCCHHHHHHHHH---HHHHHHHHHHHCCCCCCCCHHHHHHHHHHHC Q ss_conf 889999987-40---89989999999999845999899999999---9859999987547545677137999998744 Q gi|254780342|r 238 CPNISRLLK-NI---KFSVALENEMMKLILNNKQDRQFVGRTML---RTHPDLLKNWLIGVTTFDGQDPSRQLERFMN 308 (309) Q Consensus 238 ~P~~~~~L~-~~---~ls~~~~~~l~~~v~~~g~~~~~vA~~wL---~~n~~~~~~Wl~~~~~~~~~~~~~~~~~~~~ 308 (309) .|++.++.. ++ .==|+.+.+|...+..+....+++-.+|- ++++-.+++|.+-..+....+..+.|++-++ T Consensus 231 ~~~~~~la~~q~~~A~NMTD~laAL~~l~~~~~~~r~~~L~~Fy~~w~~d~LV~dkWF~~qA~~~~~~~l~~V~~L~~ 308 (408) T pfam11940 231 DPEALELALEQFEAADNMTDRLAALSALVNSDSPEREEALADFYERWKDDPLVMDKWFALQASSPRPDALERVKALMQ 308 (408) T ss_pred CHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCCHHHHHHHHHC T ss_conf 878999999998637987599999999984898048999999999876693999999999843889988999999965 No 184 >pfam05170 AsmA AsmA family. The AsmA gene, whose product is involved in the assembly of outer membrane proteins in Escherichia coli. AsmA mutations were isolated as extragenic suppressors of an OmpF assembly mutant. AsmA may have a role in LPS biogenesis. Probab=37.09 E-value=29 Score=14.97 Aligned_cols=50 Identities=12% Similarity=0.027 Sum_probs=30.1 Q ss_pred CHHHHHHHHHHHHHHHHHCCCCCCCCCEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEE Q ss_conf 957999999999998630358345883799425886999999999999961069848998 Q gi|254780342|r 1 MYKILAVCLFLTTFSISYARDADSCTPVRFADTGWTDIAATTAMTSVILEEILGYKTNIK 60 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~~~~~~~~~I~ig~~~wte~~v~a~i~~~iLE~~lGy~Ve~~ 60 (309) |||++..++.++.+..... .+...-|.-..+...|..++=+ .+|+++.+. T Consensus 1 Mk~~lk~l~~i~~~lv~~~---------~~l~~~~d~n~~k~~i~~~v~~-~TGr~l~i~ 50 (537) T pfam05170 1 MKKALKILLIILIVLLLLI---------IALIALFDPNYFKPTIQQKVSA-ASGRPLQID 50 (537) T ss_pred CHHHHHHHHHHHHHHHHHH---------HHHEEEECHHHHHHHHHHHHHH-HHCCEEEEC T ss_conf 9458999999999999999---------9860752889979999999999-889859994 No 185 >PRK11062 nhaR transcriptional activator NhaR; Provisional Probab=37.04 E-value=29 Score=14.96 Aligned_cols=197 Identities=11% Similarity=0.042 Sum_probs=99.8 Q ss_pred CCCCCEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHCCCCEEEEEEECCCCHHHHHHHHHCCCEEE Q ss_conf 45883799425886999999999999961069848998564057888851498139998404641356666553047023 Q gi|254780342|r 23 DSCTPVRFADTGWTDIAATTAMTSVILEEILGYKTNIKLLAVPVTFRSLKNKGIDIFMGYWYPSLEKFIAPYLEEGSIKL 102 (309) Q Consensus 23 ~~~~~I~ig~~~wte~~v~a~i~~~iLE~~lGy~Ve~~~~~~~~~~~al~~G~iD~~~e~w~p~~~~~~~~~~~~~~~~~ 102 (309) ....+++||..+-....+...+....+...=.....+..+.+..+.+.|..+++|+.... .|...... ..... T Consensus 90 ~~~~~lrvGi~~tl~k~~~~~lL~~~~~~~~~v~l~~~eg~~~~LL~~L~~~~LDlvLsd-~p~~~~~~------~~~~s 162 (296) T PRK11062 90 ESNLLFDVGVADALSKRLVSRVLLAAVPEDESIHLRCFESTHEMLLEQLSQHKLDMILSD-CPVDSTQQ------EGLFS 162 (296) T ss_pred CCCCEEEEEEHHHHHHHHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHHCCCCCEEEEC-CCCCCCCC------CCEEE T ss_conf 777059998524646989999999997228977999934987999999983898757606-88875568------86389 Q ss_pred ECCCCCCCEEEEEEEHHHHHCCCCCHHHHHHHHHHHHHC-CCCCCCCCCCCHHHHHHHHHCCCCCCCEEEECCCCCCHHH Q ss_conf 013566853567863446412345535655556654420-1357854320055587664035452210010366520189 Q gi|254780342|r 103 VAENLQGAKYMLAVNDVGFALGIKSYQDIAKYKKELGAK-IYGIEPGNEGNQRILDMINNNKFSLKGFRLIEASELASFS 181 (309) Q Consensus 103 ~~~~~~~~~~~~~v~~~~~~~~i~~~~dl~~~~~~~~~~-~~g~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~s~~~~~a 181 (309) . .........++.++.. .. ++. ....+. .+-.......+.......+..++... ..-+-++.+|+. T Consensus 163 ~-~L~~~~v~~~~~~~l~-~~------~fP---~~L~~~plllp~~~S~lR~~ld~wf~~~gI~P~--Iv~E~dD~allk 229 (296) T PRK11062 163 K-KLGECGVSFFCTNPLP-EK------PFP---ACLEERRLLIPGRRTMLGRKLLNWFNSQGLNVE--ILGEFDDAALMK 229 (296) T ss_pred E-ECCCCCEEEEECCCCC-CC------CCH---HHHCCCCCCCCCCCCHHHHHHHHHHHHCCCCCE--EEEEECCHHHHH T ss_conf 8-7466846898457444-58------795---775579840279987589999999996899844--999859899999 Q ss_pred HHHHHHHCCCCE-EEEEECCCHHHHCCCCEEECCCCCCCCCCHHHHHHHHCHHHHHHCHHHHHHHHH Q ss_conf 999999659936-999412422320158514007865467786583544277778748899999874 Q gi|254780342|r 182 QIRRDQRNNIPA-VFLSWEPHPINSDLNIHYLPGGEEISGFGEASVYTVVRSDYLDKCPNISRLLKN 247 (309) Q Consensus 182 ~l~~A~~~~~~~-v~~~wsp~~~~~~~dl~~Ledp~~~~~~p~~~~~~vv~~~~~~~~P~~~~~L~~ 247 (309) .+ .+.|-.+ +.+...-+-....|.++.+..-.+.. -.++. +...-.-.||.+.++++. T Consensus 230 ~~---a~~G~Gva~~P~~v~~d~~~~~~l~~~g~~~~v~--e~fyA---It~~R~~~hP~v~~i~~~ 288 (296) T PRK11062 230 AF---GAYHDAIFVAPSIYAQDIYADHSVVEIGRVENVK--EEYHV---IFAERMIQHPAVQRICNT 288 (296) T ss_pred HH---HHHCCCEEECCHHHHHHHHHCCCCEEECCCCCCE--EEEEE---EECCCCCCCHHHHHHHHH T ss_conf 99---9819988987899999998579918944578842--67999---850446898699999999 No 186 >COG1445 FrwB Phosphotransferase system fructose-specific component IIB [Carbohydrate transport and metabolism] Probab=36.79 E-value=24 Score=15.48 Aligned_cols=32 Identities=28% Similarity=0.238 Sum_probs=26.2 Q ss_pred EECCCCCHHHHHHHHHHHHHHH---CCCCEEEEEE Q ss_conf 9425886999999999999961---0698489985 Q gi|254780342|r 30 FADTGWTDIAATTAMTSVILEE---ILGYKTNIKL 61 (309) Q Consensus 30 ig~~~wte~~v~a~i~~~iLE~---~lGy~Ve~~~ 61 (309) ++...-+.+.+.++|.++.||+ .+|+++++-. T Consensus 5 VAVTACPtGIAHTyMAAeaLe~~A~~~g~~IKVET 39 (122) T COG1445 5 VAVTACPTGIAHTYMAAEALEKAAKKLGVEIKVET 39 (122) T ss_pred EEEECCCCHHHHHHHHHHHHHHHHHHCCCEEEEEC T ss_conf 99945872287899999999999998498699970 No 187 >cd00532 MGS-like MGS-like domain. This domain composes the whole protein of methylglyoxal synthetase, which catalyzes the enolization of dihydroxyacetone phosphate (DHAP) to produce methylglyoxal. The family also includes the C-terminal domain in carbamoyl phosphate synthetase (CPS) where it catalyzes the last phosphorylation of a coaboxyphosphate intermediate to form the product carbamoyl phosphate and may also play a regulatory role. This family also includes inosine monophosphate cyclohydrolase. The known structures in this family show a common phosphate binding site. Probab=36.64 E-value=30 Score=14.92 Aligned_cols=41 Identities=17% Similarity=0.220 Sum_probs=28.1 Q ss_pred HHHHHHHCCCCEEEEEE----CCHHHHHHHHHC-CCCEEEEEEECCC Q ss_conf 99999610698489985----640578888514-9813999840464 Q gi|254780342|r 45 TSVILEEILGYKTNIKL----LAVPVTFRSLKN-KGIDIFMGYWYPS 86 (309) Q Consensus 45 ~~~iLE~~lGy~Ve~~~----~~~~~~~~al~~-G~iD~~~e~w~p~ 86 (309) .+..|+++ |++|+.+. ++.+.+...+.+ |+||+....+.+. T Consensus 34 Ta~~L~~~-Gi~~~~V~Ki~e~g~p~v~d~i~~~g~I~lVINt~~~~ 79 (112) T cd00532 34 TSRVLADA-GIPVRAVSKRHEDGEPTVDAAIAEKGKFDVVINLRDPR 79 (112) T ss_pred HHHHHHHC-CCEEEEEEECCCCCCCCHHHHHHCCCCEEEEEECCCCC T ss_conf 99999987-98348985048789988999986689679999888888 No 188 >COG5510 Predicted small secreted protein [Function unknown] Probab=36.46 E-value=24 Score=15.49 Aligned_cols=31 Identities=16% Similarity=0.222 Sum_probs=14.5 Q ss_pred CHH---HHHHHHHHHHHHHHHCCCCCCCCCEEEE Q ss_conf 957---9999999999986303583458837994 Q gi|254780342|r 1 MYK---ILAVCLFLTTFSISYARDADSCTPVRFA 31 (309) Q Consensus 1 mkk---~~~~~~~~~~~~~~~~~~~~~~~~I~ig 31 (309) ||| +++++++.+++..++-...+.++.|.-+ T Consensus 2 mk~t~l~i~~vll~s~llaaCNT~rG~G~DIq~~ 35 (44) T COG5510 2 MKKTILLIALVLLASTLLAACNTMRGAGKDIQSG 35 (44) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHH T ss_conf 0489999999999999998866300223648888 No 189 >PRK11627 hypothetical protein; Provisional Probab=36.44 E-value=30 Score=14.90 Aligned_cols=28 Identities=18% Similarity=0.126 Sum_probs=13.7 Q ss_pred CHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEC Q ss_conf 95799999999999863035834588379942 Q gi|254780342|r 1 MYKILAVCLFLTTFSISYARDADSCTPVRFAD 32 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~~~~~~~~~I~ig~ 32 (309) |||++.. +++++++.+|++.|. +|.+.- T Consensus 2 mkk~~~~-l~a~~lL~gCas~p~---~l~l~P 29 (192) T PRK11627 2 LKKILFP-LVALFMLAGCATPPT---TLEVSP 29 (192) T ss_pred HHHHHHH-HHHHHHHHHHCCCCC---EEEECC T ss_conf 2879999-999999986069976---378678 No 190 >PRK10064 colicin I receptor; Provisional Probab=36.19 E-value=29 Score=14.97 Aligned_cols=77 Identities=14% Similarity=0.089 Sum_probs=33.7 Q ss_pred HHHHHHHHHHHHHCCCCCCCCCEEEECCCCCHH--------------HHHH---HHHHHHHHHCCCCEEEEEEC-CHHHH Q ss_conf 999999999863035834588379942588699--------------9999---99999996106984899856-40578 Q gi|254780342|r 6 AVCLFLTTFSISYARDADSCTPVRFADTGWTDI--------------AATT---AMTSVILEEILGYKTNIKLL-AVPVT 67 (309) Q Consensus 6 ~~~~~~~~~~~~~~~~~~~~~~I~ig~~~wte~--------------~v~a---~i~~~iLE~~lGy~Ve~~~~-~~~~~ 67 (309) ++|++++.|+....+..+..++|++-....... .+.. .=+.++|+..-|..+.-..+ ...+. T Consensus 11 ~~~~~~~~~~~~~~a~~~~~e~vVVTatr~~~~~~~tp~sVtVIt~e~i~~~~~~~l~d~L~~~PGV~~~~~g~~~~~i~ 90 (663) T PRK10064 11 GLCLSAISCAWPVAAVDDDGETMVVTASSVEQNLKDAPASISVITQEDLQRKPVQNLKDVLKEVPGVQLTNEGDNRKGVS 90 (663) T ss_pred HHHHHHHHHCCCHHHCCCCCCEEEEECCCCCCCHHHCCCCEEEECHHHHHHHCCCCHHHHHHHCCCEEEECCCCCCCCEE T ss_conf 99999998636401212699839992358847878889757998999997608988999985289989957888766479 Q ss_pred HHHHHCCCCEEEEEE Q ss_conf 888514981399984 Q gi|254780342|r 68 FRSLKNKGIDIFMGY 82 (309) Q Consensus 68 ~~al~~G~iD~~~e~ 82 (309) ..++....+-++.+. T Consensus 91 iRG~~~~~~~vliDG 105 (663) T PRK10064 91 IRGLDSSYTLILVDG 105 (663) T ss_pred EECCCCCCCEEEECC T ss_conf 928788762899999 No 191 >pfam07437 YfaZ YfaZ precursor. This family contains the precursor of the bacterial protein YfaZ (approximately 180 residues long). Many members of this family are hypothetical proteins. Probab=35.48 E-value=26 Score=15.33 Aligned_cols=19 Identities=16% Similarity=0.030 Sum_probs=14.5 Q ss_pred CHHHHHHHHHHHHHHHHHC Q ss_conf 9579999999999986303 Q gi|254780342|r 1 MYKILAVCLFLTTFSISYA 19 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~ 19 (309) |||++++|++.+++.++.+ T Consensus 1 mkk~~~~~a~~l~~~s~~A 19 (180) T pfam07437 1 MAKFRVASVALLLLVALSV 19 (180) T ss_pred CCHHHHHHHHHHHHHHHHC T ss_conf 9035899999998763210 No 192 >COG2143 Thioredoxin-related protein [Posttranslational modification, protein turnover, chaperones] Probab=35.36 E-value=27 Score=15.15 Aligned_cols=22 Identities=14% Similarity=0.428 Sum_probs=14.7 Q ss_pred CHHHHHHHHHHHHHHHHHCCCC Q ss_conf 9579999999999986303583 Q gi|254780342|r 1 MYKILAVCLFLTTFSISYARDA 22 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~~~~ 22 (309) |+|++..++++.+++++.++.+ T Consensus 1 ~mRvl~i~Lliis~fl~a~~s~ 22 (182) T COG2143 1 VMRVLLIVLLIISLFLSACKSN 22 (182) T ss_pred CCCHHHHHHHHHHHHHHHHHCC T ss_conf 9001999999999999997178 No 193 >TIGR01283 nifE nitrogenase MoFe cofactor biosynthesis protein NifE; InterPro: IPR005973 The enzyme responsible for nitrogen fixation, the nitrogenase, shows a high degree of conservation of structure, function, and amino acid sequence across wide phylogenetic ranges. All known Mo-nitrogenases consist of two components, component I (also called dinitrogenase, or Fe-Mo protein), an alpha2beta2 tetramer encoded by the nifD and nifK genes, and component II (dinitrogenase reductase, or Fe protein) a homodimer encoded by the nifH gene. Two operons, nifDK and nifEN, encode a tetrameric (alpha2beta2 and N2E2) enzymatic complex. Nitrogenase contains two unusual rare metal clusters; one of them is the iron molybdenum cofactor (FeMo-co), which is considered to be the site of dinitrogen reduction and whose biosynthesis requires the products of nifNE and of some other nif genes. It has been proposed that NifNE might serve as a scaffold upon which FeMo-co is built and then inserted into component I. ; GO: 0005515 protein binding, 0006461 protein complex assembly, 0009399 nitrogen fixation. Probab=35.26 E-value=31 Score=14.78 Aligned_cols=48 Identities=13% Similarity=0.240 Sum_probs=23.4 Q ss_pred EECCCCCCHHHHHHHHHHCCCCEE----EEEECCCHHHHCCCCEEECCCCCCCC Q ss_conf 103665201899999996599369----99412422320158514007865467 Q gi|254780342|r 171 LIEASELASFSQIRRDQRNNIPAV----FLSWEPHPINSDLNIHYLPGGEEISG 220 (309) Q Consensus 171 ~~~~s~~~~~a~l~~A~~~~~~~v----~~~wsp~~~~~~~dl~~Ledp~~~~~ 220 (309) +-..++.+-..++..|=+..--+| ...+-+-+|-.+|+.-+++. .+|| T Consensus 238 ~~~~tGD~r~~ev~~aHrA~lN~v~CSks~~nlA~~m~~~YGIPyf~~--SFyG 289 (470) T TIGR01283 238 LATITGDSRYAEVQTAHRAKLNMVQCSKSMINLARKMEEKYGIPYFEV--SFYG 289 (470) T ss_pred EEECCCCCCHHHHHCCCCCCEEEEECCHHHHHHHHHHHHHCCCCEEEE--CCCH T ss_conf 997378887899840411554335752367889999996468975872--1410 No 194 >TIGR00389 glyS_dimeric glycyl-tRNA synthetase; InterPro: IPR002315 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology . The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric . Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices , and are mostly dimeric or multimeric, containing at least three conserved regions , , . However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases; these synthetases are further divided into three subclasses, a, b and c, according to sequence homology. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases . In eubacteria, glycyl-tRNA synthetase (6.1.1.14 from EC) is an alpha2/beta2 tetramer composed of 2 different subunits , , . In some eubacteria, in archaea and eukaryota, glycyl-tRNA synthetase is an alpha2 dimer, this family. It belongs to class IIc and is one of the most complex synthetases. What is most interesting is the lack of similarity between the two types: divergence at the sequence level is so great that it is impossible to infer descent from common genes. The alpha (see IPR002310 from INTERPRO) and beta subunits (see IPR002311 from INTERPRO) also lack significant sequence similarity. However, they are translated from a single mRNA , and a single chain glycyl-tRNA synthetase from Chlamydia trachomatis has been found to have significant similarity with both domains, suggesting divergence from a single polypeptide chain . The sequence and crystal structure of the homodimeric glycyl-tRNA synthetase from Thermus thermophilus, shows that each monomer consists of an active site strongly resembling that of the aspartyl and seryl enzymes, a C-terminal anticodon recognition domain of 100 residues and a third domain unusually inserted between motifs 1 and 2 almost certainly interacting with the acceptor arm of tRNA(Gly). The C-terminal domain has a novel five-stranded parallel-antiparallel beta-sheet structure with three surrounding helices. The active site residues most probably responsible for substrate recognition, in particular in the Gly binding pocket, can be identified by inference from aspartyl-tRNA synthetase due to the conserved nature of the class II active site , .; GO: 0000166 nucleotide binding, 0004820 glycine-tRNA ligase activity, 0005524 ATP binding, 0006412 translation, 0006426 glycyl-tRNA aminoacylation, 0005737 cytoplasm. Probab=35.14 E-value=31 Score=14.77 Aligned_cols=37 Identities=16% Similarity=0.156 Sum_probs=24.2 Q ss_pred HHCHHHHHHCHHHHHHHHHCCCCHHHHHHHHHHHHHCCC Q ss_conf 427777874889999987408998999999999984599 Q gi|254780342|r 229 VVRSDYLDKCPNISRLLKNIKFSVALENEMMKLILNNKQ 267 (309) Q Consensus 229 vv~~~~~~~~P~~~~~L~~~~ls~~~~~~l~~~v~~~g~ 267 (309) +.=+.|.++.+.+...|++ +|.+++.+....++.+|+ T Consensus 400 ~~G~~FrkdAk~ies~L~~--lse~dl~~~~~~l~~~G~ 436 (606) T TIGR00389 400 KLGPKFRKDAKKIESALEN--LSEDDLEEVEKELDEEGK 436 (606) T ss_pred CCCCHHHCHHHHHHHHHHH--CCHHHHHHHHHHHHCCCE T ss_conf 4684320117899999973--287478999998634986 No 195 >TIGR02052 MerP mercuric transport protein periplasmic component; InterPro: IPR011795 This entry represents the periplasmic mercury (II) binding protein of the bacterial mercury detoxification system which passes mercuric ion to the MerT transporter for subsequent reduction to Hg(0) by the mercuric reductase MerA. MerP contains a distinctive GMTCXXC motif associated with metal binding . MerP is related to a larger family of metal binding proteins.; GO: 0015097 mercury ion transmembrane transporter activity, 0045340 mercury ion binding, 0015694 mercury ion transport, 0046689 response to mercury ion, 0042597 periplasmic space. Probab=34.93 E-value=32 Score=14.75 Aligned_cols=33 Identities=21% Similarity=0.199 Sum_probs=20.4 Q ss_pred CHHHHHHHHHHHHHHHHHCCCCCCCCCEEEECCCCC Q ss_conf 957999999999998630358345883799425886 Q gi|254780342|r 1 MYKILAVCLFLTTFSISYARDADSCTPVRFADTGWT 36 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~~~~~~~~~I~ig~~~wt 36 (309) ||||..++.++++++...-+.+. ||++..+.-+ T Consensus 1 Mkkla~llaL~~l~s~pawAat~---TVTL~vPgMt 33 (92) T TIGR02052 1 MKKLAALLALAALVSLPAWAATQ---TVTLEVPGMT 33 (92) T ss_pred CHHHHHHHHHHHHHHHHHHHCCC---CEEEECCCCC T ss_conf 94789999999998511322276---3353068886 No 196 >PRK10290 superoxide dismutase; Provisional Probab=34.74 E-value=32 Score=14.73 Aligned_cols=21 Identities=19% Similarity=0.085 Sum_probs=15.3 Q ss_pred CHHHHHHHHHHHHHHHHHCCC Q ss_conf 957999999999998630358 Q gi|254780342|r 1 MYKILAVCLFLTTFSISYARD 21 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~~~ 21 (309) |||+++++++++++.+..++. T Consensus 1 mk~~~~~~~~l~~~~~~~a~~ 21 (173) T PRK10290 1 MKRFSLAILALVVCTGAQAAS 21 (173) T ss_pred CCHHHHHHHHHHHHHHHHHCC T ss_conf 910799999999976665144 No 197 >pfam08085 Entericidin Entericidin EcnA/B family. This family consists of the entericidin antidote/toxin peptides. The entericidin locus is activated in stationary phase under high osmolarity conditions by rho-S and simultaneously repressed by the osmoregulatory EnvZ/OmpR signal transduction pathway. The entericidin locus encodes tandem paralogous genes (ecnAB) and directs the synthesis of two small cell-envelope lipoproteins which can maintain plasmids in bacterial population by means of post-segregational killing. Probab=34.48 E-value=30 Score=14.91 Aligned_cols=15 Identities=33% Similarity=0.443 Sum_probs=9.0 Q ss_pred CHHHHHHHHHHHHHH Q ss_conf 957999999999998 Q gi|254780342|r 1 MYKILAVCLFLTTFS 15 (309) Q Consensus 1 mkk~~~~~~~~~~~~ 15 (309) |||++++++.+.+++ T Consensus 1 Mk~~~~~~~~l~~~~ 15 (42) T pfam08085 1 MKKLIALLLALLLLA 15 (42) T ss_pred CHHHHHHHHHHHHHH T ss_conf 923699999999999 No 198 >PHA01735 hypothetical protein Probab=34.45 E-value=32 Score=14.70 Aligned_cols=26 Identities=15% Similarity=0.211 Sum_probs=16.0 Q ss_pred HHHHHHHHHCCCCHHH--HHHHHHHHHH Q ss_conf 9999999845999899--9999999859 Q gi|254780342|r 256 NEMMKLILNNKQDRQF--VGRTMLRTHP 281 (309) Q Consensus 256 ~~l~~~v~~~g~~~~~--vA~~wL~~n~ 281 (309) .++..+|.-..-++.| ||++||+.|. T Consensus 19 ~ell~rIksGeAttaDL~aA~~~LKdN~ 46 (76) T PHA01735 19 NELLSRIKSGEATTADLRAACDWLKSND 46 (76) T ss_pred HHHHHHHCCCCCCHHHHHHHHHHHHHCC T ss_conf 9999986078887999999999999879 No 199 >PRK04922 tolB translocation protein TolB; Provisional Probab=34.43 E-value=32 Score=14.70 Aligned_cols=22 Identities=18% Similarity=0.280 Sum_probs=17.3 Q ss_pred HHHCCCCEEEEEECCCHHHHCC Q ss_conf 9965993699941242232015 Q gi|254780342|r 186 DQRNNIPAVFLSWEPHPINSDL 207 (309) Q Consensus 186 A~~~~~~~v~~~wsp~~~~~~~ 207 (309) -.+.+++++-+.|+||+.--.| T Consensus 199 lt~~~~~ilsP~wSPdg~~iaY 220 (439) T PRK04922 199 IVRSAEPLLSPNWSPDGKKLAY 220 (439) T ss_pred ECCCCCCEEEEEECCCCCEEEE T ss_conf 2268982684247689988999 No 200 >pfam07424 TrbM TrbM. This family contains the bacterial protein TrbM (approximately 180 residues long). In Comamonas testosteroni T-2, TrbM is derived from the IncP1beta plasmid pTSA, which encodes the widespread genes for p-toluenesulfonate (TSA) degradation. Probab=34.07 E-value=29 Score=14.97 Aligned_cols=18 Identities=28% Similarity=0.056 Sum_probs=10.4 Q ss_pred CHHHHHHHHHHHHHHHHH Q ss_conf 957999999999998630 Q gi|254780342|r 1 MYKILAVCLFLTTFSISY 18 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~ 18 (309) |||++++++++..++.+. T Consensus 1 MKK~ll~~al~~~~~~s~ 18 (189) T pfam07424 1 MKKILLAVALAFGMFGSA 18 (189) T ss_pred CCHHHHHHHHHHHHHHCC T ss_conf 920179999999997147 No 201 >PRK10712 fructose-specific PTS system IIBC component; Provisional Probab=33.83 E-value=33 Score=14.63 Aligned_cols=33 Identities=21% Similarity=0.142 Sum_probs=27.6 Q ss_pred EEECCCCCHHHHHHHHHHHHHHH---CCCCEEEEEE Q ss_conf 99425886999999999999961---0698489985 Q gi|254780342|r 29 RFADTGWTDIAATTAMTSVILEE---ILGYKTNIKL 61 (309) Q Consensus 29 ~ig~~~wte~~v~a~i~~~iLE~---~lGy~Ve~~~ 61 (309) -++...-+.+.++++|.++-||+ ++||++++-. T Consensus 106 IVAVTACPTGIAHTYMAAEaL~~aak~~Gv~ikVET 141 (563) T PRK10712 106 VVAVTACPTGVAHTFMAAEAIETEAKKRGWWVKVET 141 (563) T ss_pred EEEEECCCCHHHHHHHHHHHHHHHHHHCCCEEEEEC T ss_conf 999815862598999999999999998699699963 No 202 >PRK12698 flgH flagellar basal body L-ring protein; Reviewed Probab=33.22 E-value=27 Score=15.21 Aligned_cols=16 Identities=19% Similarity=0.171 Sum_probs=10.6 Q ss_pred CHHHHHHHHHHHHHHH Q ss_conf 9579999999999986 Q gi|254780342|r 1 MYKILAVCLFLTTFSI 16 (309) Q Consensus 1 mkk~~~~~~~~~~~~~ 16 (309) |||++++++++++..| T Consensus 1 M~r~~l~~~~l~L~gC 16 (224) T PRK12698 1 MARYILLALALLLAGC 16 (224) T ss_pred CHHHHHHHHHHHHHCC T ss_conf 9779999999998365 No 203 >TIGR02953 penta_MxKDx pentapeptide MXKDX repeat protein; InterPro: IPR014299 Members of this protein family are small bacterial proteins, each with an N-terminal signal sequence followed by up to 11 imperfect repeats of a pentapeptide. The pentapeptide repeat is usually of the form Met-Xaa-Lys-Asp-Xaa.. Probab=33.11 E-value=30 Score=14.93 Aligned_cols=22 Identities=18% Similarity=0.042 Sum_probs=13.6 Q ss_pred HHHHHHHHHHHH---HHHHHCCCCC Q ss_conf 579999999999---9863035834 Q gi|254780342|r 2 YKILAVCLFLTT---FSISYARDAD 23 (309) Q Consensus 2 kk~~~~~~~~~~---~~~~~~~~~~ 23 (309) +||++++++++| +++++++.+. T Consensus 1 ~~i~~~~~~a~f~~~l~~ap~A~Aq 25 (89) T TIGR02953 1 KRIVLAISAAAFSAGLSLAPAAFAQ 25 (89) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 8799999999999998851055421 No 204 >COG3470 Tpd Uncharacterized protein probably involved in high-affinity Fe2+ transport [Inorganic ion transport and metabolism] Probab=33.10 E-value=32 Score=14.70 Aligned_cols=66 Identities=17% Similarity=0.092 Sum_probs=32.7 Q ss_pred CHHHHHHHHHHHHHHHHHCCCCCCCCCEEEECCCCCHHHHHHHHHHHHHHHCCCCEE---EEEECCHHHHHHHHHCCCCE Q ss_conf 957999999999998630358345883799425886999999999999961069848---99856405788885149813 Q gi|254780342|r 1 MYKILAVCLFLTTFSISYARDADSCTPVRFADTGWTDIAATTAMTSVILEEILGYKT---NIKLLAVPVTFRSLKNKGID 77 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~~~~~~~~~I~ig~~~wte~~v~a~i~~~iLE~~lGy~V---e~~~~~~~~~~~al~~G~iD 77 (309) |+|+++.+++++.+++.++..+...+.+-||...- =. |+++ -+++.+-.+.=..+..-+-| T Consensus 3 ~~k~l~~~~~~a~v~s~~a~~~a~~~E~Pigep~e---------------vg-~meiaaVYlQpveMeP~G~~~~a~~aD 66 (179) T COG3470 3 MKKLLLSAAILASVFSAPAEGAAAFKEVPIGEPIE---------------VG-GMEIAAVYLQPVEMEPRGMQLPAAEAD 66 (179) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCC---------------CC-CEEEEEEEEEEEECCCCCCCCCHHHCC T ss_conf 68999999999999741000023446346899732---------------18-748999996203034465798702136 Q ss_pred EEEEE Q ss_conf 99984 Q gi|254780342|r 78 IFMGY 82 (309) Q Consensus 78 ~~~e~ 82 (309) ++.|. T Consensus 67 iHlEA 71 (179) T COG3470 67 IHLEA 71 (179) T ss_pred CEEEE T ss_conf 33533 No 205 >COG0228 RpsP Ribosomal protein S16 [Translation, ribosomal structure and biogenesis] Probab=33.09 E-value=27 Score=15.23 Aligned_cols=16 Identities=13% Similarity=0.108 Sum_probs=7.5 Q ss_pred CCCCHHHHHHHHHHHH Q ss_conf 5999899999999985 Q gi|254780342|r 265 NKQDRQFVGRTMLRTH 280 (309) Q Consensus 265 ~g~~~~~vA~~wL~~n 280 (309) +|-.|.|-++..|++. T Consensus 62 ~GAqpSdtV~~ll~~~ 77 (87) T COG0228 62 QGAQPSDTVRRLLKKA 77 (87) T ss_pred CCCCCCHHHHHHHHHH T ss_conf 5997448999999985 No 206 >pfam06792 UPF0261 Uncharacterized protein family (UPF0261). Probab=32.66 E-value=34 Score=14.51 Aligned_cols=60 Identities=15% Similarity=0.101 Sum_probs=38.8 Q ss_pred CCC-EEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEE--ECCHHHHHHHH-HCCCCEEEEEEECCCC Q ss_conf 883-799425886999999999999961069848998--56405788885-1498139998404641 Q gi|254780342|r 25 CTP-VRFADTGWTDIAATTAMTSVILEEILGYKTNIK--LLAVPVTFRSL-KNKGIDIFMGYWYPSL 87 (309) Q Consensus 25 ~~~-I~ig~~~wte~~v~a~i~~~iLE~~lGy~Ve~~--~~~~~~~~~al-~~G~iD~~~e~w~p~~ 87 (309) .+| |-|-+..-|...+ ...+..||+. ||+|-.- .|...-.++.| .+|.+|...+.+.+.. T Consensus 184 ~kp~I~~TmfGvTt~~v--~~~~~~Le~~-GyEv~VFHAtG~GG~amE~Li~~G~~~~VlD~TttEl 247 (403) T pfam06792 184 EKPLIGITMFGVTTPCV--QAAREALEAR-GYEVLVFHATGTGGRAMESLIREGAFAGVLDLTTTEV 247 (403) T ss_pred CCCEEEEECCCCHHHHH--HHHHHHHHHC-CCEEEEECCCCCCHHHHHHHHHCCCEEEEEECCHHHH T ss_conf 88769997168749999--9999999866-9708998368970789999997798059996644888 No 207 >PRK09529 bifunctional acetyl-CoA decarbonylase/synthase complex subunit alpha/beta; Reviewed Probab=32.61 E-value=34 Score=14.51 Aligned_cols=71 Identities=17% Similarity=0.295 Sum_probs=47.4 Q ss_pred CHHHHHHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHCCC----CEEEEEEECCCCHHHH------HHHHHCCCEEEECC Q ss_conf 6999999999999961069848998564057888851498----1399984046413566------66553047023013 Q gi|254780342|r 36 TDIAATTAMTSVILEEILGYKTNIKLLAVPVTFRSLKNKG----IDIFMGYWYPSLEKFI------APYLEEGSIKLVAE 105 (309) Q Consensus 36 te~~v~a~i~~~iLE~~lGy~Ve~~~~~~~~~~~al~~G~----iD~~~e~w~p~~~~~~------~~~~~~~~~~~~~~ 105 (309) -++.-.-.|++.-||-+ |.+|+++..+-++.|...-.|+ -|+|+|.=+|...... ....+++++...|+ T Consensus 278 v~~~~~d~iv~~alE~R-giKi~vt~i~iPv~~gpafEGErIRk~dm~vE~gG~k~~~fELV~~~~~~eieDgki~viGP 356 (713) T PRK09529 278 LASPDYDKIVSKALEVR-GIKVTVTEIPIPVAFGPAFEGERVRKEDMYVEFGGGKTPAFELVRMAEMDEIEDGKVEVIGP 356 (713) T ss_pred CCCCCHHHHHHHHHHHC-CCEEEEEECCCCCCCCCCCCCCEECCCCEEEEECCCCCCCEEEEEECCCCCCCCCEEEEECC T ss_conf 14798899999888863-84689996688630265557634404774899389886517999963666454673799789 Q ss_pred CC Q ss_conf 56 Q gi|254780342|r 106 NL 107 (309) Q Consensus 106 ~~ 107 (309) .. T Consensus 357 di 358 (713) T PRK09529 357 DI 358 (713) T ss_pred CH T ss_conf 80 No 208 >PRK10554 outer membrane porin protein C; Provisional Probab=32.38 E-value=35 Score=14.48 Aligned_cols=31 Identities=13% Similarity=0.013 Sum_probs=18.8 Q ss_pred CHHHHHHHHHHHHHHHHHCCCC----CCCCCEEEE Q ss_conf 9579999999999986303583----458837994 Q gi|254780342|r 1 MYKILAVCLFLTTFSISYARDA----DSCTPVRFA 31 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~~~~----~~~~~I~ig 31 (309) |||.+++++++++++.+.+.++ .++.++.|+ T Consensus 1 MKk~~LA~~i~all~ag~a~AAeVYn~Dgn~Ldl~ 35 (373) T PRK10554 1 MKVKVLSLLVPALLVAGAANAAEIYNKDGNKLDLY 35 (373) T ss_pred CCHHHHHHHHHHHHHHCCCCEEEEEECCCCEEEEE T ss_conf 95408999999998607531799996799889763 No 209 >CHL00005 rps16 ribosomal protein S16 Probab=31.64 E-value=26 Score=15.35 Aligned_cols=24 Identities=13% Similarity=0.293 Sum_probs=14.7 Q ss_pred HHHHHHHHHCCCCCCCCHHHHHHHHHHHC Q ss_conf 59999987547545677137999998744 Q gi|254780342|r 280 HPDLLKNWLIGVTTFDGQDPSRQLERFMN 308 (309) Q Consensus 280 n~~~~~~Wl~~~~~~~~~~~~~~~~~~~~ 308 (309) |.+.+..||+ .|+|+.+.|++.++ T Consensus 49 ~~eri~~Wl~-----~GAqpT~tV~~Llk 72 (82) T CHL00005 49 NVPAILYFLE-----KGAQPTGTVYDILK 72 (82) T ss_pred CHHHHHHHHH-----CCCCCCHHHHHHHH T ss_conf 8999999999-----76988899999999 No 210 >PRK09558 ushA bifunctional UDP-sugar hydrolase/5'-nucleotidase periplasmic precursor; Reviewed Probab=31.49 E-value=36 Score=14.39 Aligned_cols=34 Identities=15% Similarity=0.090 Sum_probs=15.8 Q ss_pred CHHHHHHHHHHHHHHHHHC-----CCCCCCCC--EEEECCC Q ss_conf 9579999999999986303-----58345883--7994258 Q gi|254780342|r 1 MYKILAVCLFLTTFSISYA-----RDADSCTP--VRFADTG 34 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~-----~~~~~~~~--I~ig~~~ 34 (309) |||++..++.++++.+..+ +.+...+| |+|-+.| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ltilhtn 41 (551) T PRK09558 1 MMKLLKGLVALALLAALALCGSTACAYEGDKTYKITVLHTN 41 (551) T ss_pred CHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCEEEEEEEEC T ss_conf 92589999999999998740676644468985799999845 No 211 >pfam03888 MucB_RseB MucB/RseB family. Members of this family are regulators of the anti-sigma E protein RseD. Probab=30.66 E-value=37 Score=14.30 Aligned_cols=48 Identities=21% Similarity=0.245 Sum_probs=29.0 Q ss_pred CHHHHHHHHHHHHHHHHHCCCCCCCCCEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEEE Q ss_conf 9579999999999986303583458837994258869999999999999610698489985 Q gi|254780342|r 1 MYKILAVCLFLTTFSISYARDADSCTPVRFADTGWTDIAATTAMTSVILEEILGYKTNIKL 61 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~~~~~~~~~I~ig~~~wte~~v~a~i~~~iLE~~lGy~Ve~~~ 61 (309) |||+..++++++.++.+..+++.. +-...+..|....= .+.|+=..+. T Consensus 1 ~~~~~~~~~~~~~l~~~~~a~a~~-----------sa~~~L~~M~~A~~--~lnY~G~fvy 48 (314) T pfam03888 1 MKKLKSLTLLLGSLLASPTAQAED-----------SAQALLNRMNEASQ--QLSYEGSFVY 48 (314) T ss_pred CCHHHHHHHHHHHHHCCCCCCCCC-----------CHHHHHHHHHHHHH--HCCEEEEEEE T ss_conf 925887999999873573334667-----------89999999999887--5774799999 No 212 >PRK11372 lysozyme inhibitor; Provisional Probab=30.62 E-value=29 Score=15.00 Aligned_cols=18 Identities=33% Similarity=0.418 Sum_probs=13.9 Q ss_pred CHHHHHHHHHHHHHHHHH Q ss_conf 957999999999998630 Q gi|254780342|r 1 MYKILAVCLFLTTFSISY 18 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~ 18 (309) |||++++++.+.+..|+. T Consensus 3 MKkll~~~~~llLsGCs~ 20 (109) T PRK11372 3 MKKLLIICLPVLLTGCSA 20 (109) T ss_pred HHHHHHHHHHHHHHCCCC T ss_conf 488999999999704543 No 213 >pfam01527 Transposase_8 Transposase. Transposase proteins are necessary for efficient DNA transposition. This family consists of various E. coli insertion elements and other bacterial transposases some of which are members of the IS3 family. Probab=30.30 E-value=38 Score=14.26 Aligned_cols=42 Identities=17% Similarity=0.233 Sum_probs=35.8 Q ss_pred CCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 8998999999999984599989999999998599999875475 Q gi|254780342|r 249 KFSVALENEMMKLILNNKQDRQFVGRTMLRTHPDLLKNWLIGV 291 (309) Q Consensus 249 ~ls~~~~~~l~~~v~~~g~~~~~vA~~wL~~n~~~~~~Wl~~~ 291 (309) ++|.+.-.++...+...|.+..+||+++ -=|+.++-.|++-. T Consensus 6 rys~efK~~~V~~~~~~g~sv~~var~~-gi~~~~l~~W~k~~ 47 (75) T pfam01527 6 RYSEEFKARAVKESLEPGASVSELAREH-GVSPATLYKWRKKY 47 (75) T ss_pred CCCHHHHHHHHHHHHHCCCCHHHHHHHH-CCCHHHHHHHHHHH T ss_conf 8999999999999980998499999998-95999999999998 No 214 >PRK10747 putative protoheme IX biogenesis protein; Provisional Probab=28.78 E-value=40 Score=14.09 Aligned_cols=36 Identities=17% Similarity=0.102 Sum_probs=20.1 Q ss_pred CHHHHHHHHHHHHHHHHHCCCCCCCCCEEEECCCCC Q ss_conf 957999999999998630358345883799425886 Q gi|254780342|r 1 MYKILAVCLFLTTFSISYARDADSCTPVRFADTGWT 36 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~~~~~~~~~I~ig~~~wt 36 (309) |+|+++..++++...+.+.-..++.+=|-|...+|+ T Consensus 1 M~r~l~~~~~l~~g~~~g~~~~~~~GYVlI~~~~~~ 36 (398) T PRK10747 1 MLKVLLLFVLLIAGIVVGPMIAGHQGYVLIQTDNYN 36 (398) T ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECCEE T ss_conf 906999999999999999998489986999889986 No 215 >PRK10833 putative assembly protein; Provisional Probab=28.73 E-value=40 Score=14.09 Aligned_cols=50 Identities=12% Similarity=0.094 Sum_probs=28.6 Q ss_pred CHHHHHHHHHHHHHHHHHCCCCCCCCCEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEE Q ss_conf 957999999999998630358345883799425886999999999999961069848998 Q gi|254780342|r 1 MYKILAVCLFLTTFSISYARDADSCTPVRFADTGWTDIAATTAMTSVILEEILGYKTNIK 60 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~~~~~~~~~I~ig~~~wte~~v~a~i~~~iLE~~lGy~Ve~~ 60 (309) |||++..+.+++.+.+..... +. .-+.-.....+|..++=+ ++||++.+. T Consensus 1 MKrll~~l~illvvvv~gl~a------Lv---~lvdPNd~k~~I~~qV~~-~TGr~L~i~ 50 (617) T PRK10833 1 MRRFLTTLMILLVVLVAGLSA------LV---LLVNPNDFRAYMVKQVEA-RSGYQLQLD 50 (617) T ss_pred CCHHHHHHHHHHHHHHHHHHH------HH---EEECHHHHHHHHHHHHHH-HHCCEEEEC T ss_conf 941899999999999999998------81---504889979999999999-779879987 No 216 >PRK11548 hypothetical protein; Provisional Probab=28.61 E-value=40 Score=14.07 Aligned_cols=39 Identities=5% Similarity=-0.026 Sum_probs=15.4 Q ss_pred CHHHHHHHHHHHHHHHHHCCCCCCCCCEEEECCCCCHHH Q ss_conf 957999999999998630358345883799425886999 Q gi|254780342|r 1 MYKILAVCLFLTTFSISYARDADSCTPVRFADTGWTDIA 39 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~~~~~~~~~I~ig~~~wte~~ 39 (309) ||++.++.++++++.++|+....--=.+.|-.-|.-++. T Consensus 3 ~k~l~~a~ll~~lllsgCS~~~~~vYk~dI~QGN~i~~~ 41 (113) T PRK11548 3 CKTLTAAAAVLLMLTAGCSTLERVVYRPDINQGNYLTPN 41 (113) T ss_pred HHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCHH T ss_conf 578899999999998011578775657655756766999 No 217 >COG5567 Predicted small periplasmic lipoprotein [Cell motility and secretion] Probab=28.51 E-value=40 Score=14.06 Aligned_cols=20 Identities=20% Similarity=0.092 Sum_probs=13.3 Q ss_pred CHHHHHHHHHHHHHHHHHCC Q ss_conf 95799999999999863035 Q gi|254780342|r 1 MYKILAVCLFLTTFSISYAR 20 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~~ 20 (309) ||+.+...+++.++++..+| T Consensus 1 mk~~~~s~~ala~l~sLA~C 20 (58) T COG5567 1 MKNVFKSLLALATLFSLAGC 20 (58) T ss_pred CHHHHHHHHHHHHHHHHHHC T ss_conf 90489999999999999852 No 218 >pfam11853 DUF3373 Protein of unknown function (DUF3373). This family of proteins are functionally uncharacterized. This protein is found in bacteria. Proteins in this family are typically between 472 to 574 amino acids in length. Probab=28.42 E-value=40 Score=14.05 Aligned_cols=20 Identities=20% Similarity=0.384 Sum_probs=13.3 Q ss_pred CHHHHHHHHHHHHHHHHHCC Q ss_conf 95799999999999863035 Q gi|254780342|r 1 MYKILAVCLFLTTFSISYAR 20 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~~ 20 (309) |||++.+++++++++++..+ T Consensus 1 Mkk~~~l~~~aall~~s~~~ 20 (485) T pfam11853 1 MKKLVSLSLVAALLALSLPA 20 (485) T ss_pred CCHHHHHHHHHHHHHHCCCH T ss_conf 91147899999999750410 No 219 >COG4851 CamS Protein involved in sex pheromone biosynthesis [General function prediction only] Probab=28.37 E-value=41 Score=14.05 Aligned_cols=20 Identities=20% Similarity=0.140 Sum_probs=12.7 Q ss_pred CHHHHHHHHHHHHHHHHHCC Q ss_conf 95799999999999863035 Q gi|254780342|r 1 MYKILAVCLFLTTFSISYAR 20 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~~ 20 (309) |||.++.+++++.+++++|. T Consensus 1 Mkktl~i~~ta~vliLs~C~ 20 (382) T COG4851 1 MKKTLGIAATASVLILSGCF 20 (382) T ss_pred CCHHHHHHHHHHHHHHHHCC T ss_conf 94015589999999986116 No 220 >pfam08557 Lipid_DES Sphingolipid Delta4-desaturase (DES). Sphingolipids are important membrane signalling molecules involved in many different cellular functions in eukaryotes. Sphingolipid delta 4-desaturase catalyses the formation of (E)-sphing-4-enine. Some proteins in this family have bifunctional delta 4-desaturase/C-4-hydroxylase activity. Delta 4-desaturated sphingolipids may play a role in early signalling required for entry into meiotic and spermatid differentiation pathways during Drosophila spermatogenesis. This small domain associates with FA_desaturase pfam00487 and appears to be specific to sphingolipid delta 4-desaturase. Probab=28.35 E-value=32 Score=14.69 Aligned_cols=25 Identities=24% Similarity=0.331 Sum_probs=19.2 Q ss_pred CHHHHHHHHCHHHHHHCHHHHHHHH Q ss_conf 8658354427777874889999987 Q gi|254780342|r 222 GEASVYTVVRSDYLDKCPNISRLLK 246 (309) Q Consensus 222 p~~~~~~vv~~~~~~~~P~~~~~L~ 246 (309) ...+.+..-|+..+++||++.+++- T Consensus 10 ~t~ePH~~RRk~IL~kyPeIkkL~G 34 (39) T pfam08557 10 YTEEPHASRRKEILKKYPEIKKLMG 34 (39) T ss_pred CCCCCHHHHHHHHHHHCHHHHHHHC T ss_conf 7987349999999997978898708 No 221 >TIGR00838 argH argininosuccinate lyase; InterPro: IPR009049 This entry represents argininosuccinate lyase, but may include examples of avian delta crystallins, in which argininosuccinate lyase activity may or may not be present and the biological role is to provide the optically clear cellular protein of the eye lens. ; GO: 0004056 argininosuccinate lyase activity, 0042450 arginine biosynthetic process via ornithine. Probab=27.02 E-value=22 Score=15.73 Aligned_cols=47 Identities=15% Similarity=0.190 Sum_probs=41.8 Q ss_pred HHCHHHHHHCHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHH Q ss_conf 42777787488999998740899899999999998459998999999 Q gi|254780342|r 229 VVRSDYLDKCPNISRLLKNIKFSVALENEMMKLILNNKQDRQFVGRT 275 (309) Q Consensus 229 vv~~~~~~~~P~~~~~L~~~~ls~~~~~~l~~~v~~~g~~~~~vA~~ 275 (309) ..|+|+.++.|.+++.|+.+..+-+-|+.|...+.++..-.++.|.+ T Consensus 315 aYNrDlQEdkE~LFD~v~t~~~~~~~~~~~l~~i~vn~er~~e~A~~ 361 (469) T TIGR00838 315 AYNRDLQEDKEPLFDAVKTVELSLEVMTGMLDTIKVNKERMEEAASA 361 (469) T ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHC T ss_conf 23510013783678899999999999999983081587899998644 No 222 >COG1840 AfuA ABC-type Fe3+ transport system, periplasmic component [Inorganic ion transport and metabolism] Probab=26.43 E-value=44 Score=13.83 Aligned_cols=124 Identities=19% Similarity=0.089 Sum_probs=56.6 Q ss_pred CCCHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHH--HCCC-----CCCCE-EEECCCCCCHHHHHHHHHHCCCCEEEE Q ss_conf 455356555566544201357854320055587664--0354-----52210-010366520189999999659936999 Q gi|254780342|r 125 IKSYQDIAKYKKELGAKIYGIEPGNEGNQRILDMIN--NNKF-----SLKGF-RLIEASELASFSQIRRDQRNNIPAVFL 196 (309) Q Consensus 125 i~~~~dl~~~~~~~~~~~~g~~~~~~~~~~~~~~~~--~~~l-----~~~~~-~~~~~s~~~~~a~l~~A~~~~~~~v~~ 196 (309) -++|.||.++.-.. ++....|...... ...... .+|- .+..+ .........-...+.+++.+|+..+++ T Consensus 104 P~~w~DL~~p~~kg--~i~~~~p~~s~~~-~~~~~~~~~~G~~~~~~~l~~l~~n~~~~~~~~~~~~~~~va~Ge~~vg~ 180 (299) T COG1840 104 PKSWADLAKPEYKG--KVQMADPTSSGTA-YAALLALQAYGEEKGWAYLKGLAANLATYTGGSSSVVAKVVAGGEAAVGL 180 (299) T ss_pred CCCHHHHHCHHHCC--CEEEECCCCCHHH-HHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHCCCCCEEE T ss_conf 86799870988669--8465478855899-99999998648799999999999851334578828899997479864788 Q ss_pred EECCCHHHHCC--CC--EEECCCCC-CCCCCHHHHHHHHCHHHHHHCHHHHHHHHHCCCCHHHHH Q ss_conf 41242232015--85--14007865-467786583544277778748899999874089989999 Q gi|254780342|r 197 SWEPHPINSDL--NI--HYLPGGEE-ISGFGEASVYTVVRSDYLDKCPNISRLLKNIKFSVALEN 256 (309) Q Consensus 197 ~wsp~~~~~~~--dl--~~Ledp~~-~~~~p~~~~~~vv~~~~~~~~P~~~~~L~~~~ls~~~~~ 256 (309) .|.=.+..... +- +-+--|++ ... -...+..+.. ..||+.++.+=.+-+|.+-+. T Consensus 181 ~~~~~~~~~~~~~~~~~v~iv~P~~G~~v-~~~~vaiik~----a~~~e~Ak~fid~llS~egQ~ 240 (299) T COG1840 181 GNLYYGAYAKDKAKGAPVEVVYPEEGTGV-NPSGVALLKK----AKNPEAAKLFIDFLLSKEGQE 240 (299) T ss_pred EECCHHHHHHHHCCCCCEEEEECCCCCEE-EEEEEEEECC----CCCHHHHHHHHHHHCCHHHHH T ss_conf 72107999975146985699948988689-7543554348----999899999999980898899 No 223 >pfam08984 DUF1858 Domain of unknown function (DUF1858). This domain has no known function. It is found in various hypothetical bacterial proteins. Probab=26.35 E-value=44 Score=13.82 Aligned_cols=31 Identities=10% Similarity=0.269 Sum_probs=24.0 Q ss_pred HHHHHCHHHHHHHHHC---CCCHHHHHHHHHHHH Q ss_conf 7787488999998740---899899999999998 Q gi|254780342|r 233 DYLDKCPNISRLLKNI---KFSVALENEMMKLIL 263 (309) Q Consensus 233 ~~~~~~P~~~~~L~~~---~ls~~~~~~l~~~v~ 263 (309) ++.++||++.++|-.+ .|.+..|.+-..++- T Consensus 9 ~l~~~yPe~~~il~~~GF~~l~nP~m~~t~gr~~ 42 (59) T pfam08984 9 DLVERYPETIEILVELGFKPLANPAMRNTVGRVA 42 (59) T ss_pred HHHHHCHHHHHHHHHCCCHHCCCHHHHHHHHHHC T ss_conf 9998797899999986951106969998764455 No 224 >PRK13616 lipoprotein LpqB; Provisional Probab=25.85 E-value=45 Score=13.76 Aligned_cols=32 Identities=22% Similarity=0.202 Sum_probs=22.2 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCCCCCEEEECC Q ss_conf 57999999999998630358345883799425 Q gi|254780342|r 2 YKILAVCLFLTTFSISYARDADSCTPVRFADT 33 (309) Q Consensus 2 kk~~~~~~~~~~~~~~~~~~~~~~~~I~ig~~ 33 (309) +|.++++++.+.+..+|++.|.++.+-.|+.. T Consensus 6 ~~~~~~~~~~~~llaGCaslP~ss~pq~v~~~ 37 (590) T PRK13616 6 LKALAALLAVALLVAGCASLPSSSAPQAIGTV 37 (590) T ss_pred HHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCC T ss_conf 99999999999986120368999997233665 No 225 >cd00569 HTH_Hin_like Helix-turn-helix domain of Hin and related proteins, a family of DNA-binding domains unique to bacteria and represented by the Hin protein of Salmonella. The basic HTH domain is a simple fold comprised of three core helices that form a right-handed helical bundle. The principal DNA-protein interface is formed by the third helix, the recognition helix, inserting itself into the major groove of the DNA. A diverse array of HTH domains participate in a variety of functions that depend on their DNA-binding properties. HTH_Hin represents one of the simplest versions of the HTH domains; the characterization of homologous relationships between various sequence-diverse HTH domain families remains difficult. The Hin recombinase induces the site-specific inversion of a chromosomal DNA segment containing a promoter, which controls the alternate expression of two genes by reversibly switching orientation. The Hin recombinase consists of a single polypeptide chain containing a D Probab=25.85 E-value=45 Score=13.76 Aligned_cols=27 Identities=15% Similarity=0.207 Sum_probs=21.3 Q ss_pred CCCHHHHHHHHHHHHHCCCCHHHHHHHH Q ss_conf 8998999999999984599989999999 Q gi|254780342|r 249 KFSVALENEMMKLILNNKQDRQFVGRTM 276 (309) Q Consensus 249 ~ls~~~~~~l~~~v~~~g~~~~~vA~~w 276 (309) +||.+++.++-..++ +|.+..++|+.| T Consensus 5 ~lt~~q~~~ar~l~~-~G~~~~~iA~~~ 31 (42) T cd00569 5 KLTPEQIEEARRLLA-AGESVAEIARRL 31 (42) T ss_pred CCCHHHHHHHHHHHH-CCCCHHHHHHHH T ss_conf 699999999999999-789899999997 No 226 >pfam04959 ARS2 Arsenite-resistance protein 2. Arsenite is a carcinogenic compound which can act as a co-mutagen by inhibiting DNA repair. Arsenite-resistance protein 2 is thought to play a role in arsenite resistance. Probab=25.84 E-value=45 Score=13.76 Aligned_cols=27 Identities=15% Similarity=0.364 Sum_probs=12.9 Q ss_pred CCCCHHHHHHHHHHHHHHHHHH--HHCCC Q ss_conf 5999899999999985999998--75475 Q gi|254780342|r 265 NKQDRQFVGRTMLRTHPDLLKN--WLIGV 291 (309) Q Consensus 265 ~g~~~~~vA~~wL~~n~~~~~~--Wl~~~ 291 (309) ..+|+++.+++||++|-...++ ||-.. T Consensus 54 G~Kd~E~EVEkFi~aNTqEl~kdKWLCPL 82 (211) T pfam04959 54 GRKDPEQEVEKFVQANTQELAKDKWLCPL 82 (211) T ss_pred CCCCHHHHHHHHHHHHHHHHCCCCEECCC T ss_conf 58888899999999857764122452467 No 227 >pfam08139 LPAM_1 Prokaryotic membrane lipoprotein lipid attachment site. In prokaryotes, membrane lipoproteins are synthesized with a precursor signal peptide, which is cleaved by a specific lipoprotein signal peptidase (signal peptidase II). The peptidase recognizes a conserved sequence and cuts upstream of a cysteine residue to which a glyceride-fatty acid lipid is attached. Probab=25.25 E-value=46 Score=13.69 Aligned_cols=13 Identities=38% Similarity=0.370 Sum_probs=9.8 Q ss_pred CHHHHHHHHHHHH Q ss_conf 9579999999999 Q gi|254780342|r 1 MYKILAVCLFLTT 13 (309) Q Consensus 1 mkk~~~~~~~~~~ 13 (309) |||++.+++++.. T Consensus 1 Mkk~~~l~~~~l~ 13 (26) T pfam08139 1 MKKLLLLLLALLL 13 (26) T ss_pred CHHHHHHHHHHHH T ss_conf 9669999999999 No 228 >TIGR02791 VirB5 P-type DNA transfer protein VirB5; InterPro: IPR014158 This entry contains VirB5, a protein that is involved in the type IV DNA secretion systems typified by the Agrobacterium Ti plasmid vir system where it interacts with several other proteins essential for proper pilus formation . VirB5 is homologous to the IncN (N-type) conjugation system protein TraC as well as the P-type protein TrbJ and the F-type protein TraE .. Probab=25.24 E-value=46 Score=13.69 Aligned_cols=28 Identities=18% Similarity=0.133 Sum_probs=16.5 Q ss_pred HHHHHHHHHC--CCCHHHHHHHHHHHHHCC Q ss_conf 8999998740--899899999999998459 Q gi|254780342|r 239 PNISRLLKNI--KFSVALENEMMKLILNNK 266 (309) Q Consensus 239 P~~~~~L~~~--~ls~~~~~~l~~~v~~~g 266 (309) -+|-.+++++ .=+..++.+|-.++.+|. T Consensus 174 ~~I~~L~~~iN~~~DpK~iAdLQ~RiqvE~ 203 (233) T TIGR02791 174 SNIEQLRQKINEAGDPKAIADLQARIQVEQ 203 (233) T ss_pred HHHHHHHHHHCCCCCHHHHHHHHHHHHHHH T ss_conf 999999987248998578999988899999 No 229 >cd05805 MPG1_transferase GTP-mannose-1-phosphate guanyltransferase (MPG1 transferase), also known as GDP-mannose pyrophosphorylase, is a bifunctional enzyme with both phosphomannose isomerase (PMI) activity and GDP-mannose phosphorylase (GMP) activity. The protein contains an N-terminal NTP transferase domain, an L-beta-H domain, and a C-terminal PGM-like domain that belongs to the alpha-D-phosphohexomutase superfamily. This subfamily is limited to bacteria and archaea. The alpha-D-phosphohexomutases include several related enzymes that catalyze a reversible intramolecular phosphoryl transfer on their sugar substrates. Members of this group appear to lack conserved residues necessary for metal binding and catalytic activity. Other members of this superfamily include the phosphoglucomutases (PGM1 and PGM2), phosphoglucosamine mutase (PNGM), phosphoacetylglucosamine mutase (PAGM), the bacterial phosphomannomutase ManB, the bacterial phosphoglucosamine mutase GlmM, and the bifunctional Probab=25.17 E-value=46 Score=13.68 Aligned_cols=58 Identities=12% Similarity=-0.006 Sum_probs=39.1 Q ss_pred CCCCEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEE-EECCHHHHHHHHHCCCCEEEEEE Q ss_conf 588379942588699999999999996106984899-85640578888514981399984 Q gi|254780342|r 24 SCTPVRFADTGWTDIAATTAMTSVILEEILGYKTNI-KLLAVPVTFRSLKNKGIDIFMGY 82 (309) Q Consensus 24 ~~~~I~ig~~~wte~~v~a~i~~~iLE~~lGy~Ve~-~~~~~~~~~~al~~G~iD~~~e~ 82 (309) .+.+|.||.-.=.++..+++.+...|-.. |++|-. -...|+.++-++.....|.-.-. T Consensus 33 ~~~~V~VG~D~R~ss~~l~~a~~~gl~~~-G~~V~~~g~~pTP~~~~a~~~~~~~~GI~I 91 (441) T cd05805 33 PGSTVTVSRDASRASRMLKRALISGLLST-GVNVRDLGALPLPVARYAIRFLGASGGIHV 91 (441) T ss_pred CCCEEEEEECCCCCHHHHHHHHHHHHHHC-CCEEEEECCCCHHHHHHHHHHCCCCEEEEE T ss_conf 99989999689855899999999999986-994999078771899999985599838998 No 230 >PRK13916 plasmid segregation protein ParR; Provisional Probab=25.11 E-value=46 Score=13.67 Aligned_cols=35 Identities=14% Similarity=0.293 Sum_probs=23.0 Q ss_pred HHHCHHHHHHHHHC---CCCHHHHHHHHHHHHHCCCCH Q ss_conf 87488999998740---899899999999998459998 Q gi|254780342|r 235 LDKCPNISRLLKNI---KFSVALENEMMKLILNNKQDR 269 (309) Q Consensus 235 ~~~~P~~~~~L~~~---~ls~~~~~~l~~~v~~~g~~~ 269 (309) .++||++++||.++ +.++-.-..|...|+.-|.++ T Consensus 14 r~~~~~iF~FL~~~P~GT~~~~iR~~L~rYI~~~G~~~ 51 (97) T PRK13916 14 KEDYPQIFDFLENVPRGTKTAHIREALRRYIEEIGENP 51 (97) T ss_pred HCCCHHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCCC T ss_conf 10138999999977788741799999999999557899 No 231 >COG5455 Predicted integral membrane protein [Function unknown] Probab=25.08 E-value=46 Score=13.67 Aligned_cols=20 Identities=15% Similarity=0.276 Sum_probs=12.1 Q ss_pred CHHHHHHHHHHHHHHHHHCC Q ss_conf 95799999999999863035 Q gi|254780342|r 1 MYKILAVCLFLTTFSISYAR 20 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~~ 20 (309) |||+++++++.++.....+. T Consensus 2 mkrlv~~lla~s~~a~p~a~ 21 (129) T COG5455 2 MKRLVSALLAVSSVAAPMAL 21 (129) T ss_pred CHHHHHHHHHHHHHHCCHHH T ss_conf 14389999999985051110 No 232 >PRK11443 hypothetical protein; Provisional Probab=25.04 E-value=45 Score=13.74 Aligned_cols=16 Identities=31% Similarity=0.193 Sum_probs=11.2 Q ss_pred CHHHHHHHHHHHHHHH Q ss_conf 9579999999999986 Q gi|254780342|r 1 MYKILAVCLFLTTFSI 16 (309) Q Consensus 1 mkk~~~~~~~~~~~~~ 16 (309) ||+++++++++++..| T Consensus 1 Mk~~~~~~~~lll~GC 16 (120) T PRK11443 1 MKKFIAPLLALLVSGC 16 (120) T ss_pred CHHHHHHHHHHHHHCC T ss_conf 9158999999999666 No 233 >TIGR02294 nickel_nikA nickel ABC transporter, periplasmic nickel-binding protein; InterPro: IPR011980 Members of this family are periplasmic nickel-binding proteins of nickel ABC transporters. Nickel is bound specifically, albeit weakly, through water molecules positioned in the binding site. The amino acids whose side chains line the binding site include Tyr-44, Met-49, Trp-122, Arg-159, Trp-420, and Tyr-424 (numbering based on the precursor sequence of Escherichia coli NikA) with the Arg contributing a hydrogen bond indirectly through a water molecule .. Probab=24.55 E-value=47 Score=13.60 Aligned_cols=25 Identities=12% Similarity=0.389 Sum_probs=16.0 Q ss_pred EEEEEC-CHHHHHHHHHCCCCEEEEE Q ss_conf 899856-4057888851498139998 Q gi|254780342|r 57 TNIKLL-AVPVTFRSLKNKGIDIFMG 81 (309) Q Consensus 57 Ve~~~~-~~~~~~~al~~G~iD~~~e 81 (309) |..+-. +.+...=|+++||||+.-. T Consensus 200 V~vKVIPD~~TR~LAfe~Ge~dLi~G 225 (513) T TIGR02294 200 VTVKVIPDAETRALAFESGEVDLIFG 225 (513) T ss_pred EEEEEECCHHHHHHHHHCCCCCEEEC T ss_conf 37887367336888763186102323 No 234 >pfam03562 MltA MltA specific insert domain. This beta barrel domain is found inserted in the MltA a murein degrading transglycosylase enzyme. This domain may be involved in peptidoglycan binding. Probab=24.45 E-value=48 Score=13.59 Aligned_cols=18 Identities=33% Similarity=0.451 Sum_probs=12.3 Q ss_pred HHHHHHHHHHHHHHHHHC Q ss_conf 999999985999998754 Q gi|254780342|r 272 VGRTMLRTHPDLLKNWLI 289 (309) Q Consensus 272 vA~~wL~~n~~~~~~Wl~ 289 (309) .-++||++||+..++-|. T Consensus 87 ~Ik~wl~~nP~~~~~~l~ 104 (113) T pfam03562 87 AIRAWLRANPDRVDELLE 104 (113) T ss_pred HHHHHHHHCHHHHHHHHH T ss_conf 999999989899999997 No 235 >PRK07369 dihydroorotase; Provisional Probab=23.83 E-value=46 Score=13.69 Aligned_cols=36 Identities=14% Similarity=0.121 Sum_probs=23.4 Q ss_pred CCHHHHHHHHH--HHHHCCCCHHHHHHHHHHHHHHHHHH Q ss_conf 99899999999--99845999899999999985999998 Q gi|254780342|r 250 FSVALENEMMK--LILNNKQDRQFVGRTMLRTHPDLLKN 286 (309) Q Consensus 250 ls~~~~~~l~~--~v~~~g~~~~~vA~~wL~~n~~~~~~ 286 (309) -..+++-.+.+ .|...+.+..+++ +.+..||.+.=. T Consensus 329 ~Gve~~lpll~~~~V~~g~lsl~~~v-~~~s~nPAki~g 366 (419) T PRK07369 329 IGLELALPLLWQNLVETGELSALQLW-QALSTNPARCLG 366 (419) T ss_pred CCHHHHHHHHHHHHHHCCCCCHHHHH-HHHHHHHHHHHC T ss_conf 43887999999999874998999999-997778999839 No 236 >cd05803 PGM_like4 This PGM-like (phosphoglucomutase-like) domain is located C-terminal to a mannose-1-phosphate guanyltransferase domain in a protein of unknown function that is found in both prokaryotes and eukaryotes. This domain belongs to the alpha-D-phosphohexomutase superfamily which includes several related enzymes that catalyze a reversible intramolecular phosphoryl transfer on their sugar substrates. Members of this superfamily include the phosphoglucomutases (PGM1 and PGM2), phosphoglucosamine mutase (PNGM), phosphoacetylglucosamine mutase (PAGM), the bacterial phosphomannomutase ManB, the bacterial phosphoglucosamine mutase GlmM, and the bifunctional phosphomannomutase/phosphoglucomutase (PMM/PGM). Each of these enzymes has four domains with a centrally located active site formed by four loops, one from each domain. All four domains are included in this alignment model. Probab=23.67 E-value=49 Score=13.50 Aligned_cols=54 Identities=7% Similarity=-0.031 Sum_probs=39.0 Q ss_pred CCCCEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEE-EECCHHHHHHHHHCCCCEE Q ss_conf 588379942588699999999999996106984899-8564057888851498139 Q gi|254780342|r 24 SCTPVRFADTGWTDIAATTAMTSVILEEILGYKTNI-KLLAVPVTFRSLKNKGIDI 78 (309) Q Consensus 24 ~~~~I~ig~~~wte~~v~a~i~~~iLE~~lGy~Ve~-~~~~~~~~~~al~~G~iD~ 78 (309) ..++|.||.-.=.++...++.+...|... |++|-. -...|+.+.-+++.-..+. T Consensus 36 ~~~~VvVg~D~R~~s~~~~~~~a~gl~s~-G~~V~~~g~~pTP~~~~~v~~~~~~~ 90 (445) T cd05803 36 KGGKIVVGRDGRPSGPMLEKIVIGALLAC-GCDVIDLGIAPTPTVQVLVRQSQASG 90 (445) T ss_pred CCCEEEEEECCCCCHHHHHHHHHHHHHHC-CCEEEECCCCCHHHHHHHHHHCCCCE T ss_conf 99849999699866899999999999977-98499889887389999998559985 No 237 >PRK06760 hypothetical protein; Provisional Probab=23.64 E-value=49 Score=13.49 Aligned_cols=57 Identities=14% Similarity=0.068 Sum_probs=30.8 Q ss_pred CHHHHHHHHH--HHHHHHHHCC-----CCCCCCCEEEECCCCCHHH--HHHHHHHHHHHHCCCCEEEEEEC Q ss_conf 9579999999--9999863035-----8345883799425886999--99999999996106984899856 Q gi|254780342|r 1 MYKILAVCLF--LTTFSISYAR-----DADSCTPVRFADTGWTDIA--ATTAMTSVILEEILGYKTNIKLL 62 (309) Q Consensus 1 mkk~~~~~~~--~~~~~~~~~~-----~~~~~~~I~ig~~~wte~~--v~a~i~~~iLE~~lGy~Ve~~~~ 62 (309) |||++..+++ +++.++++|+ .|+.+- +-|| +|+. -+.+-|+...+++.-|+|+.... T Consensus 1 MKK~l~i~i~~iis~~~lsaCS~~~~~~PANGv-liiG----dE~~~~~I~d~yK~~tke~~~y~vK~g~~ 66 (223) T PRK06760 1 MKKTLTIFMLTILLLISFSACSKKENSFPANGV-LIIG----DENNISPIINRYQEITKENEVFSVKKGEV 66 (223) T ss_pred CCEEEEEHHHHHHHHHHHHHCCCCCCCCCCCEE-EEEE----CHHHHHHHHHHHHHHHHHHHEEEEEEEEE T ss_conf 950676499999999984003367878987437-9996----51677889998877643432046675137 No 238 >KOG2892 consensus Probab=23.60 E-value=49 Score=13.49 Aligned_cols=60 Identities=13% Similarity=0.052 Sum_probs=42.7 Q ss_pred CCCCCEEEECCCCCHHHHHHHHHHHHHHHCC-CCEEEEEECC---HHHHH-----------------HHHHCCCCEEEEE Q ss_conf 4588379942588699999999999996106-9848998564---05788-----------------8851498139998 Q gi|254780342|r 23 DSCTPVRFADTGWTDIAATTAMTSVILEEIL-GYKTNIKLLA---VPVTF-----------------RSLKNKGIDIFMG 81 (309) Q Consensus 23 ~~~~~I~ig~~~wte~~v~a~i~~~iLE~~l-Gy~Ve~~~~~---~~~~~-----------------~al~~G~iD~~~e 81 (309) .+..+|+||+-...=..|.++.+..-||+.+ +++-++-..+ ..+++ .+|.+|++|+.+. T Consensus 3 ~~~~~irIGtRKSkLAvIQs~~v~~~Lek~YP~l~f~I~t~~T~GDkIl~k~L~~ig~KsLfTkELE~aL~~~~~divVH 82 (320) T KOG2892 3 PRTAVIRIGTRKSKLAVIQSYHVREKLEKKYPELAFEIITMSTTGDKILSKPLAKIGGKSLFTKELEDALINGHVDIVVH 82 (320) T ss_pred CCCEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCEEEEEEECCCCHHHHHCHHHHHCCCCHHHHHHHHHHHCCCCCEEEE T ss_conf 86548971476661012307999999986598863579983143367761447662654100788999985398528987 Q ss_pred E Q ss_conf 4 Q gi|254780342|r 82 Y 82 (309) Q Consensus 82 ~ 82 (309) . T Consensus 83 S 83 (320) T KOG2892 83 S 83 (320) T ss_pred E T ss_conf 5 No 239 >COG3417 FlgN Collagen-binding surface adhesin SpaP (antigen I/II family) [General function prediction only] Probab=23.49 E-value=50 Score=13.47 Aligned_cols=23 Identities=26% Similarity=0.317 Sum_probs=12.2 Q ss_pred CHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 95799999999999863035834 Q gi|254780342|r 1 MYKILAVCLFLTTFSISYARDAD 23 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~~~~~ 23 (309) |+|+++++++++++..+|++.++ T Consensus 3 k~k~~~~il~~al~l~GCs~~~a 25 (200) T COG3417 3 KMKIYASILLLALFLSGCSSEPA 25 (200) T ss_pred HHHHHHHHHHHHHHHHHCCCCCC T ss_conf 57999999999999841136778 No 240 >pfam07219 HemY_N HemY protein N-terminus. This family represents the N-terminus (approximately 150 residues) of bacterial HemY porphyrin biosynthesis proteins. This is a membrane protein involved in a late step of protoheme IX synthesis. Probab=23.17 E-value=50 Score=13.43 Aligned_cols=36 Identities=17% Similarity=0.174 Sum_probs=18.3 Q ss_pred CHHHHHHHHHHHHHHH-HHCCCCCCCCCEEEECCCCC Q ss_conf 9579999999999986-30358345883799425886 Q gi|254780342|r 1 MYKILAVCLFLTTFSI-SYARDADSCTPVRFADTGWT 36 (309) Q Consensus 1 mkk~~~~~~~~~~~~~-~~~~~~~~~~~I~ig~~~wt 36 (309) |+|+++.++++..+.. .+.-..++...|.|...+|+ T Consensus 1 M~R~l~~~~~~l~~~~~~~~~l~~~pGyV~I~~~~~~ 37 (134) T pfam07219 1 MLRVLLFLFLVLLLGLFGGAWLAGQPGYVVISYGGYR 37 (134) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCEE T ss_conf 9789999999999999999999369995999998998 No 241 >pfam09920 DUF2150 Uncharacterized protein conserved in archaea (DUF2150). This domain, found in various hypothetical archaeal proteins, has no known function. Probab=23.05 E-value=27 Score=15.24 Aligned_cols=37 Identities=11% Similarity=0.210 Sum_probs=25.9 Q ss_pred HHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 9999999984599989999999998599999875475456 Q gi|254780342|r 255 ENEMMKLILNNKQDRQFVGRTMLRTHPDLLKNWLIGVTTF 294 (309) Q Consensus 255 ~~~l~~~v~~~g~~~~~vA~~wL~~n~~~~~~Wl~~~~~~ 294 (309) ..++=++|-....-|..++.+ .++.+|++||.|++|- T Consensus 141 ~aq~Ga~Vi~G~~l~~~~v~d---~~~glv~EWldGiDsi 177 (190) T pfam09920 141 VAQIGARVIDGESLPEKYIED---LEYGLVAEWLDGIDSI 177 (190) T ss_pred HHHHCHHHCCCCCCCHHHCCC---CCCCHHHHHHHHHHHH T ss_conf 998272350788687766055---6740899999889999 No 242 >PRK10894 hypothetical protein; Provisional Probab=22.66 E-value=52 Score=13.37 Aligned_cols=34 Identities=21% Similarity=0.302 Sum_probs=17.4 Q ss_pred CHHH-HHHHHHHHHHHH-HHCCCCCCCCCEEEECCC Q ss_conf 9579-999999999986-303583458837994258 Q gi|254780342|r 1 MYKI-LAVCLFLTTFSI-SYARDADSCTPVRFADTG 34 (309) Q Consensus 1 mkk~-~~~~~~~~~~~~-~~~~~~~~~~~I~ig~~~ 34 (309) |+++ ++++.+++++++ ++|-.++...||.|-.-. T Consensus 7 ~~~~~l~l~~~ll~~S~~a~Al~sD~~qPI~I~AD~ 42 (184) T PRK10894 7 KLSLNLLLASSLLAASIPAFAVTGDTEQPIHIDSDQ 42 (184) T ss_pred CCHHHHHHHHHHHHHCCHHHHCCCCCCCCEEEEECE T ss_conf 420669999999871612431553446998999452 No 243 >TIGR03061 pip_yhgE_Nterm YhgE/Pip N-terminal domain. This family contains the N-terminal domain of a family of multiple membrane-spanning proteins of Gram-positive bacteria. One member was shown to be a host protein essential for phage infection, so many members of this family are called "phage infection protein". A separate model, TIGR03062, represents the conserved C-terminal domain. The domains are separated by regions highly variable in both length and sequence, often containing extended heptad repeats as described in model TIGR03057. Probab=21.88 E-value=53 Score=13.27 Aligned_cols=56 Identities=5% Similarity=-0.022 Sum_probs=38.0 Q ss_pred CCCCEEEECCCCCHH--------HHHHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHCCCCEEEEEE Q ss_conf 588379942588699--------99999999999610698489985640578888514981399984 Q gi|254780342|r 24 SCTPVRFADTGWTDI--------AATTAMTSVILEEILGYKTNIKLLAVPVTFRSLKNKGIDIFMGY 82 (309) Q Consensus 24 ~~~~I~ig~~~wte~--------~v~a~i~~~iLE~~lGy~Ve~~~~~~~~~~~al~~G~iD~~~e~ 82 (309) .-+.+.||..|=..+ .+-..|+..+.++. .++.+ ..+.....++|.+|+.++.... T Consensus 40 ~~~~lpvAVVN~D~g~~~~g~~~~~G~~lv~~L~~~~-~~~w~--~vs~~~A~~gl~~G~yy~~i~I 103 (164) T TIGR03061 40 NLDNLPVAVVNEDKGATYDGKTLNAGDDLVKELKKND-DLDWH--FVSAKEAEKGLADGKYYMVITI 103 (164) T ss_pred CCCCCEEEEEECCCCCCCCCCCCCHHHHHHHHHHCCC-CCEEE--ECCHHHHHHHHHCCCEEEEEEE T ss_conf 6578718999778786508815228999999873388-86179--8489999988864956999997 No 244 >COG3583 Uncharacterized protein conserved in bacteria [Function unknown] Probab=21.59 E-value=54 Score=13.23 Aligned_cols=55 Identities=22% Similarity=0.189 Sum_probs=32.0 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCCEEEECCCC-CHHHHHHHHHHHHHHHCCCCEEEEE Q ss_conf 9999999999863035834588379942588-6999999999999961069848998 Q gi|254780342|r 5 LAVCLFLTTFSISYARDADSCTPVRFADTGW-TDIAATTAMTSVILEEILGYKTNIK 60 (309) Q Consensus 5 ~~~~~~~~~~~~~~~~~~~~~~~I~ig~~~w-te~~v~a~i~~~iLE~~lGy~Ve~~ 60 (309) +++|++++.++++++.+...+++|++-...= -...-++.-+..+|++. |.+|.-. T Consensus 17 l~~g~~l~v~~~a~~~~~~tk~~vtv~v~Ge~~~v~T~a~tV~d~l~e~-G~kv~d~ 72 (309) T COG3583 17 LAAGLLLVVLLIAGGVAEATKKTVTVTVDGEKMLVKTHASTVIDLLEEA-GIKVGDQ 72 (309) T ss_pred HHHHHHHHHHHHHHHHHHHHHCEEEEEECCEEEEEEHHHHHHHHHHHHC-CCCCCCC T ss_conf 8888999999987678875020389998785723201044688999860-9663431 No 245 >pfam09862 DUF2089 Protein of unknown function(DUF2089). This domain, found in various hypothetical prokaryotic proteins, has no known function. Probab=21.56 E-value=54 Score=13.23 Aligned_cols=33 Identities=18% Similarity=0.258 Sum_probs=13.9 Q ss_pred HHHHHHHHCCCCEEEEEE---CCHHHHHHHHHCCCCE Q ss_conf 999999610698489985---6405788885149813 Q gi|254780342|r 44 MTSVILEEILGYKTNIKL---LAVPVTFRSLKNKGID 77 (309) Q Consensus 44 i~~~iLE~~lGy~Ve~~~---~~~~~~~~al~~G~iD 77 (309) ....+|+ +|||+++... .+...+...|.+|+|. T Consensus 69 rLd~ii~-~lGy~~~~~~~~~~~~~eIL~~Le~GEIs 104 (113) T pfam09862 69 RLDEIIA-ALGYEEEPEPEKRVDKKEILDKLEKGEIS 104 (113) T ss_pred HHHHHHH-HCCCCCCCCCCCCCCHHHHHHHHHCCCCC T ss_conf 9999999-80898777777762299999999849979 No 246 >TIGR03519 Bac_Flav_fam_1 Bacteroidetes-specific putative membrane protein. This model describes a protein family unique to, and greatly expanded in, the Bacteriodetes. Species in this lineage include several, such as Cytophaga hutchinsonii and Flavobacterium johnsoniae, that exhibit a poorly understood rapid gliding phenotype. Several members of this protein family are found in operons with other genes whose loss leads to a loss a this motility. Probab=21.23 E-value=55 Score=13.18 Aligned_cols=17 Identities=35% Similarity=0.524 Sum_probs=12.0 Q ss_pred CHHHHHHHHHHHHHHHH Q ss_conf 95799999999999863 Q gi|254780342|r 1 MYKILAVCLFLTTFSIS 17 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~ 17 (309) |||+++++++++++.+. T Consensus 1 Mkki~l~~~~~~~~~~~ 17 (292) T TIGR03519 1 MKKILLLLLLLLLLTVQ 17 (292) T ss_pred CCCHHHHHHHHHHHHHH T ss_conf 96057889999999877 No 247 >pfam06291 Lambda_Bor Bor protein. This family consists of several Bacteriophage lambda Bor and Escherichia coli Iss proteins. Expression of bor significantly increases the survival of the Escherichia coli host cell in animal serum. This property is a well known bacterial virulence determinant indeed, bor and its adjacent sequences are highly homologous to the iss serum resistance locus of the plasmid ColV2-K94, which confers virulence in animals. It has been suggested that lysogeny may generally have a role in bacterial survival in animal hosts, and perhaps in pathogenesis. Probab=21.22 E-value=55 Score=13.18 Aligned_cols=12 Identities=25% Similarity=0.238 Sum_probs=8.3 Q ss_pred CHHHHHHHHHHH Q ss_conf 957999999999 Q gi|254780342|r 1 MYKILAVCLFLT 12 (309) Q Consensus 1 mkk~~~~~~~~~ 12 (309) |||++++.++++ T Consensus 1 MKk~~~~~~~al 12 (97) T pfam06291 1 MKKMLFAAALAL 12 (97) T ss_pred CHHHHHHHHHHH T ss_conf 922599999999 No 248 >cd02017 TPP_E1_EcPDC_like Thiamine pyrophosphate (TPP) family, E1 of E. coli PDC-like subfamily, TPP-binding module; composed of proteins similar to the E1 component of the Escherichia coli pyruvate dehydrogenase multienzyme complex (PDC). PDC catalyzes the oxidative decarboxylation of pyruvate and the subsequent acetylation of coenzyme A to acetyl-CoA. The E1 component of PDC catalyzes the first step of the multistep process, using TPP and a divalent cation as cofactors. E. coli PDC is a homodimeric enzyme. Probab=21.18 E-value=55 Score=13.18 Aligned_cols=98 Identities=14% Similarity=0.061 Sum_probs=58.4 Q ss_pred CHHHHHHHHHHCCCCEEEE-EECCCHH--HHCCCCEEECCCCCCCCCCHHHHH-----HHHCHHHHHHCHHHHHHHHHCC Q ss_conf 0189999999659936999-4124223--201585140078654677865835-----4427777874889999987408 Q gi|254780342|r 178 ASFSQIRRDQRNNIPAVFL-SWEPHPI--NSDLNIHYLPGGEEISGFGEASVY-----TVVRSDYLDKCPNISRLLKNIK 249 (309) Q Consensus 178 ~~~a~l~~A~~~~~~~v~~-~wsp~~~--~~~~dl~~Ledp~~~~~~p~~~~~-----~vv~~~~~~~~P~~~~~L~~~~ 249 (309) ..+.++.+.++.--|.|+- .|...|- |.+-.-..|..--+.+....|+.. -..|+.|..++|++.++++. T Consensus 199 kiiqele~~F~~aGW~VIkv~wG~~wd~lf~~d~~g~L~~~m~e~~dg~yQ~~~~~~Ga~~Re~ffg~~pel~~lv~~-- 276 (386) T cd02017 199 KIIQELEGIFRGAGWNVIKVIWGSKWDELLAKDGGGALRQRMEETVDGDYQTLKAKDGAYVREHFFGKYPELKALVTD-- 276 (386) T ss_pred CHHHHHHHHHHHCCCEEEEECCCHHHHHHHCCCCCCHHHHHHHHCCCHHHHHHHHCCHHHHHHHHCCCCHHHHHHHHC-- T ss_conf 258999999987798089853221499997678754699998866788889887247599999980898889999850-- Q ss_pred CCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHH Q ss_conf 99899999999998459998999999999859 Q gi|254780342|r 250 FSVALENEMMKLILNNKQDRQFVGRTMLRTHP 281 (309) Q Consensus 250 ls~~~~~~l~~~v~~~g~~~~~vA~~wL~~n~ 281 (309) +|.+++.+|+. .|-|+..+...|-+.-. T Consensus 277 lsD~dl~~L~r----GGHD~~kl~aAy~~A~~ 304 (386) T cd02017 277 LSDEDLWALNR----GGHDPRKVYAAYKKAVE 304 (386) T ss_pred CCHHHHHHHHC----CCCCHHHHHHHHHHHHH T ss_conf 78878998762----89998999999999984 No 249 >PRK11186 carboxy-terminal protease; Provisional Probab=20.65 E-value=57 Score=13.11 Aligned_cols=19 Identities=16% Similarity=0.240 Sum_probs=8.6 Q ss_pred HHHHHHHHHHHCCCCEEEEE Q ss_conf 99999999961069848998 Q gi|254780342|r 41 TTAMTSVILEEILGYKTNIK 60 (309) Q Consensus 41 ~a~i~~~iLE~~lGy~Ve~~ 60 (309) +..+++.|=-.+ |-+|+++ T Consensus 300 lddvV~lIRG~k-GT~V~L~ 318 (673) T PRK11186 300 LDDVVALIKGPK-GSKVRLE 318 (673) T ss_pred HHHHHHHHCCCC-CCEEEEE T ss_conf 999999853899-8879999 No 250 >COG3218 ABC-type uncharacterized transport system, auxiliary component [General function prediction only] Probab=20.47 E-value=22 Score=15.75 Aligned_cols=32 Identities=16% Similarity=0.042 Sum_probs=12.6 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCCEEEECCCCC Q ss_conf 99999999998630358345883799425886 Q gi|254780342|r 5 LAVCLFLTTFSISYARDADSCTPVRFADTGWT 36 (309) Q Consensus 5 ~~~~~~~~~~~~~~~~~~~~~~~I~ig~~~wt 36 (309) +++.+.+++++++|...+..-+++.+...+-. T Consensus 13 l~~~la~~a~L~gC~~~~~p~~~~~l~~~~~~ 44 (205) T COG3218 13 LAAALALAATLAGCGPEAAPNDTYSLSVHPPV 44 (205) T ss_pred HHHHHHHHHHHHCCCCCCCCCCEEECCCCCCC T ss_conf 99999999997234777897744641578866 No 251 >pfam03207 OspD Borrelia outer surface protein D (OspD). Probab=20.42 E-value=57 Score=13.07 Aligned_cols=22 Identities=18% Similarity=0.235 Sum_probs=15.9 Q ss_pred CHHHHHHHHHHHHHHHHHCCCC Q ss_conf 9579999999999986303583 Q gi|254780342|r 1 MYKILAVCLFLTTFSISYARDA 22 (309) Q Consensus 1 mkk~~~~~~~~~~~~~~~~~~~ 22 (309) |||++..+++..|+.++.+|.- T Consensus 1 mkklikilllslflllsisc~h 22 (264) T pfam03207 1 MKKLIKILLLSLFLLLSISCSH 22 (264) T ss_pred CHHHHHHHHHHHHHHHHHHHCC T ss_conf 9159999999999998832225 Done!