Query gi|254780598|ref|YP_003065011.1| outer membrane protein [Candidatus Liberibacter asiaticus str. psy62] Match_columns 205 No_of_seqs 139 out of 814 Neff 7.4 Searched_HMMs 39220 Date Sun May 29 23:11:10 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780598.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 COG3637 Opacity protein and re 100.0 3.2E-33 8E-38 188.5 18.8 178 1-205 1-199 (199) 2 PRK10808 outer membrane protei 100.0 6.1E-32 1.6E-36 181.9 17.9 173 1-205 1-195 (347) 3 pfam01389 OmpA_membrane OmpA-l 100.0 2.2E-28 5.5E-33 163.6 16.8 148 54-205 5-174 (175) 4 PRK09408 ompX outer membrane p 99.9 2.6E-20 6.6E-25 122.1 16.3 169 1-205 1-171 (171) 5 pfam06316 Ail_Lom Enterobacter 99.8 1.2E-17 3.2E-22 108.4 16.1 170 1-205 1-199 (199) 6 PRK10959 outer membrane protei 99.6 5.7E-14 1.4E-18 89.6 17.5 182 1-205 1-212 (212) 7 pfam02530 Porin_2 Porin subfam 99.5 9.9E-13 2.5E-17 83.2 14.7 78 114-192 245-357 (378) 8 pfam07437 YfaZ YfaZ precursor. 99.4 1.4E-11 3.7E-16 77.2 14.6 175 1-205 1-180 (180) 9 pfam02462 Opacity Opacity fami 99.3 1.2E-11 3.1E-16 77.6 10.3 115 83-205 2-126 (126) 10 pfam05736 OprF OprF membrane d 99.2 2.6E-10 6.5E-15 70.8 12.7 174 1-205 1-180 (184) 11 pfam03922 OmpW OmpW family. Th 99.1 1E-08 2.6E-13 62.6 15.6 130 72-205 32-189 (189) 12 COG3047 OmpW Outer membrane pr 99.0 1.8E-08 4.5E-13 61.3 14.2 130 72-205 54-213 (213) 13 PRK10716 long-chain fatty acid 98.4 7.1E-05 1.8E-09 42.8 15.6 79 126-205 297-393 (437) 14 pfam03349 Toluene_X Outer memb 98.3 0.00014 3.7E-09 41.2 15.8 85 120-205 261-366 (415) 15 COG2067 FadL Long-chain fatty 98.2 2.6E-05 6.5E-10 45.1 10.8 88 115-205 285-391 (440) 16 PRK10716 long-chain fatty acid 98.0 0.00026 6.6E-09 39.9 12.9 66 118-184 153-245 (437) 17 pfam03797 Autotransporter Auto 97.9 0.0011 2.8E-08 36.7 13.8 130 54-184 85-258 (265) 18 pfam03349 Toluene_X Outer memb 97.8 0.0016 4.1E-08 35.8 14.9 85 118-205 314-415 (415) 19 pfam10082 DUF2320 Uncharacteri 97.6 0.0026 6.7E-08 34.7 17.5 48 157-205 329-380 (380) 20 pfam06629 MipA MltA-interactin 97.6 0.0028 7.1E-08 34.6 17.4 117 78-205 87-226 (226) 21 pfam00267 Porin_1 Gram-negativ 97.6 0.0029 7.3E-08 34.5 15.4 43 162-205 286-334 (334) 22 COG2067 FadL Long-chain fatty 97.5 0.0037 9.5E-08 34.0 12.1 84 118-205 339-440 (440) 23 cd00342 gram_neg_porins Porins 97.4 0.0052 1.3E-07 33.2 11.8 24 163-187 275-298 (329) 24 COG3203 OmpC Outer membrane pr 97.3 0.0068 1.7E-07 32.6 11.7 49 156-205 298-354 (354) 25 pfam11854 DUF3374 Protein of u 97.3 0.0081 2.1E-07 32.2 13.3 42 162-205 221-283 (675) 26 cd00342 gram_neg_porins Porins 97.0 0.015 3.8E-07 30.9 17.5 141 52-205 139-283 (329) 27 TIGR03509 OMP_MtrB_PioB decahe 97.0 0.017 4.2E-07 30.6 13.8 67 119-185 486-559 (649) 28 COG3203 OmpC Outer membrane pr 96.9 0.019 4.7E-07 30.4 13.7 28 1-30 1-28 (354) 29 pfam05420 BCSC_C Cellulose syn 96.8 0.024 6.2E-07 29.8 17.3 50 153-205 287-336 (339) 30 pfam03797 Autotransporter Auto 96.7 0.029 7.3E-07 29.4 17.1 111 73-185 70-203 (265) 31 pfam05736 OprF OprF membrane d 96.6 0.004 1E-07 33.8 4.8 86 74-168 89-179 (184) 32 PRK10554 outer membrane porin 96.6 0.034 8.6E-07 29.0 11.9 43 162-205 324-373 (373) 33 TIGR03016 pepcterm_hypo_1 unch 96.6 0.035 9E-07 28.9 14.8 80 120-205 348-431 (431) 34 pfam10082 DUF2320 Uncharacteri 96.5 0.036 9.2E-07 28.9 16.6 45 160-205 295-343 (380) 35 pfam11854 DUF3374 Protein of u 96.5 0.038 9.6E-07 28.8 11.1 23 1-25 1-23 (675) 36 PRK09980 ompL outer membrane p 96.1 0.064 1.6E-06 27.6 13.3 77 124-205 152-230 (230) 37 PRK09752 adhesin; Provisional 96.1 0.066 1.7E-06 27.5 13.0 26 158-184 1205-1230(1250) 38 PRK11528 hypothetical protein; 96.0 0.071 1.8E-06 27.4 14.3 45 159-205 208-254 (254) 39 pfam11383 DUF3187 Protein of u 96.0 0.041 1E-06 28.6 7.3 119 79-205 117-243 (273) 40 pfam00267 Porin_1 Gram-negativ 96.0 0.076 2E-06 27.2 14.8 80 120-205 210-295 (334) 41 PRK09980 ompL outer membrane p 96.0 0.013 3.3E-07 31.2 4.7 63 120-187 106-171 (230) 42 PRK10993 outer membrane protea 95.7 0.1 2.6E-06 26.5 10.0 49 156-205 240-311 (311) 43 PRK13483 enterobactin receptor 95.4 0.13 3.4E-06 26.0 14.8 40 164-205 616-663 (663) 44 pfam09381 Porin_OmpG Outer mem 95.3 0.14 3.6E-06 25.8 8.5 81 120-205 193-280 (280) 45 COG3713 OmpV Outer membrane pr 95.2 0.15 3.8E-06 25.7 15.7 80 125-205 156-258 (258) 46 pfam10626 TraO Conjugative tra 95.2 0.15 3.8E-06 25.7 11.4 174 1-205 1-191 (193) 47 pfam01389 OmpA_membrane OmpA-l 94.8 0.2 5.2E-06 25.1 13.0 89 89-180 5-95 (175) 48 COG5338 Uncharacterized protei 94.7 0.019 4.9E-07 30.3 2.1 62 115-177 262-323 (468) 49 PRK12580 outer membrane protea 94.4 0.25 6.3E-06 24.6 9.7 47 158-205 242-312 (312) 50 pfam06178 KdgM Oligogalacturon 94.4 0.26 6.5E-06 24.5 15.6 51 154-205 167-218 (218) 51 COG4625 Uncharacterized protei 94.0 0.31 7.8E-06 24.1 11.2 46 157-205 532-577 (577) 52 PRK10455 periplasmic protein; 93.9 0.055 1.4E-06 28.0 3.1 29 1-29 1-29 (161) 53 pfam06727 DUF1207 Protein of u 93.7 0.36 9.1E-06 23.8 7.7 84 120-205 217-337 (337) 54 PRK11447 cellulose synthase su 93.6 0.36 9.2E-06 23.8 15.4 49 153-204 1089-1137(1158) 55 pfam01278 Omptin Omptin family 93.5 0.38 9.7E-06 23.6 9.3 49 156-205 223-295 (295) 56 pfam06551 DUF1120 Protein of u 93.3 0.055 1.4E-06 28.0 2.2 34 1-34 1-34 (145) 57 pfam04338 DUF481 Protein of un 93.1 0.44 1.1E-05 23.3 16.1 47 158-205 160-210 (210) 58 PRK10159 outer membrane phosph 93.0 0.46 1.2E-05 23.2 12.3 43 162-205 305-351 (351) 59 pfam11059 DUF2860 Protein of u 92.6 0.53 1.3E-05 22.9 13.6 74 131-204 218-297 (297) 60 PRK10002 outer membrane protei 92.1 0.62 1.6E-05 22.6 12.3 43 162-205 316-362 (362) 61 smart00869 Autotransporter Aut 92.0 0.63 1.6E-05 22.5 16.3 24 160-184 231-254 (261) 62 pfam05275 CopB Copper resistan 91.8 0.67 1.7E-05 22.4 9.0 42 163-205 159-210 (210) 63 PRK03757 hypothetical protein; 91.8 0.11 2.9E-06 26.3 2.3 24 1-25 1-24 (191) 64 pfam01617 Surface_Ag_2 Surface 91.3 0.77 2E-05 22.1 15.0 174 2-186 4-234 (279) 65 pfam03502 Channel_Tsx Nucleosi 90.9 0.84 2.1E-05 21.9 13.5 49 155-205 205-259 (259) 66 pfam02264 LamB LamB porin. Mal 90.2 0.97 2.5E-05 21.6 15.1 110 90-203 273-395 (399) 67 PRK09934 putative fimbrial pro 90.2 0.22 5.5E-06 24.9 2.4 23 1-23 1-23 (171) 68 TIGR01414 autotrans_barl outer 89.7 1.1 2.8E-05 21.3 11.2 134 69-205 294-454 (454) 69 PRK10177 hypothetical protein; 89.3 1.2 3E-05 21.1 10.9 49 153-205 246-294 (452) 70 PRK12696 flgH flagellar basal 88.7 0.79 2E-05 22.0 4.4 19 1-20 2-20 (238) 71 PRK13513 putative outer membra 88.1 1.4 3.6E-05 20.7 15.6 42 162-205 611-660 (660) 72 COG5571 Autotransporter protei 88.0 0.22 5.7E-06 24.8 1.2 133 68-205 81-239 (239) 73 PRK13861 type IV secretion sys 87.7 0.34 8.6E-06 23.9 2.0 28 1-31 2-29 (293) 74 pfam06082 DUF940 Bacterial put 86.6 1.1 2.8E-05 21.3 4.1 42 164-205 168-214 (661) 75 pfam11924 DUF3442 Protein of u 86.5 1.8 4.5E-05 20.2 14.2 49 153-205 201-249 (280) 76 TIGR03014 EpsL exopolysacchari 85.3 2.1 5.2E-05 19.9 15.7 43 162-205 335-381 (381) 77 PRK09782 bacteriophage N4 rece 85.0 0.29 7.3E-06 24.3 0.5 67 130-205 910-987 (987) 78 pfam07424 TrbM TrbM. This fami 83.8 0.68 1.7E-05 22.3 1.9 25 1-26 1-25 (189) 79 pfam12094 DUF3570 Protein of u 83.1 2.6 6.6E-05 19.4 12.3 76 125-202 210-302 (407) 80 PRK13697 cytochrome c6; Provis 83.1 0.8 2E-05 22.0 2.1 27 1-27 1-29 (111) 81 pfam05079 DUF680 Protein of un 83.1 0.73 1.9E-05 22.2 1.9 24 1-27 1-24 (73) 82 pfam07996 T4SS Type IV secreti 82.9 0.86 2.2E-05 21.8 2.2 26 1-27 1-26 (217) 83 PRK09701 D-allose transporter 82.2 0.96 2.4E-05 21.6 2.2 27 1-27 1-27 (311) 84 PRK00031 lolA outer-membrane l 81.9 0.79 2E-05 22.0 1.7 25 1-25 1-25 (201) 85 pfam11231 DUF3034 Protein of u 81.7 2.9 7.5E-05 19.1 6.6 57 127-184 148-204 (257) 86 PRK12751 cpxP periplasmic stre 81.4 1.7 4.3E-05 20.3 3.2 26 1-27 1-26 (162) 87 PRK10363 cpxP periplasmic repr 81.3 1.5 3.9E-05 20.5 3.0 24 1-24 1-24 (166) 88 pfam09411 PagL Lipid A 3-O-dea 81.1 3.1 7.9E-05 19.0 15.4 71 132-205 64-139 (139) 89 PRK09733 putative fimbrial pro 80.1 0.95 2.4E-05 21.6 1.6 23 1-26 2-24 (181) 90 PRK12698 flgH flagellar basal 80.0 3.4 8.7E-05 18.8 4.4 13 1-13 1-13 (224) 91 pfam04234 CopC Copper resistan 79.7 2.4 6E-05 19.6 3.5 35 1-35 1-36 (120) 92 PRK13893 conjugal transfer pro 79.2 1.1 2.7E-05 21.3 1.6 26 1-26 1-27 (193) 93 pfam04076 BOF Bacterial OB fol 79.1 1.6 4E-05 20.5 2.4 21 1-25 1-21 (126) 94 PRK03577 acid shock protein pr 79.0 1.5 3.7E-05 20.7 2.2 33 1-35 1-33 (102) 95 PRK12700 flgH flagellar basal 78.5 3.8 9.7E-05 18.5 4.4 20 1-20 3-23 (230) 96 COG3137 Putative salt-induced 77.3 4.1 0.00011 18.3 4.9 112 53-181 50-161 (262) 97 PRK09495 glnH glutamine ABC tr 76.2 1.9 4.9E-05 20.0 2.2 25 1-26 1-25 (247) 98 COG4571 OmpT Outer membrane pr 75.9 4.5 0.00012 18.1 14.6 47 158-205 245-314 (314) 99 PRK03641 hypothetical protein; 73.9 1.9 4.8E-05 20.1 1.7 27 1-30 1-27 (220) 100 PRK12701 flgH flagellar basal 73.9 5.2 0.00013 17.8 4.6 25 1-25 1-25 (230) 101 pfam04575 DUF560 Protein of un 73.3 5.4 0.00014 17.7 11.4 113 71-190 92-239 (301) 102 PRK12450 foldase protein PrsA; 73.1 2.3 5.7E-05 19.7 1.9 21 1-21 4-26 (309) 103 PRK02889 tolB translocation pr 72.6 2.2 5.6E-05 19.7 1.7 24 1-24 4-27 (430) 104 PRK11260 cystine transporter s 72.4 3.1 7.8E-05 19.0 2.5 26 1-26 5-30 (264) 105 PRK03095 prsA peptidylprolyl i 72.4 3 7.6E-05 19.0 2.4 25 1-25 1-25 (287) 106 PRK10954 periplasmic protein d 72.3 4.6 0.00012 18.1 3.3 21 1-25 1-21 (207) 107 TIGR02953 penta_MxKDx pentapep 72.2 2.7 6.9E-05 19.3 2.1 24 2-25 1-27 (89) 108 PRK10259 hypothetical protein; 70.7 3.3 8.5E-05 18.8 2.4 31 1-32 1-31 (86) 109 PRK10942 serine endoprotease; 70.1 4.3 0.00011 18.2 2.8 19 1-19 1-19 (474) 110 pfam10566 Glyco_hydro_97 Glyco 69.5 3.2 8E-05 18.9 2.0 32 1-32 1-32 (643) 111 pfam07396 Porin_O_P Phosphate- 69.4 6.6 0.00017 17.3 14.2 44 161-205 313-361 (361) 112 PRK10780 periplasmic chaperone 68.6 2.6 6.7E-05 19.3 1.5 22 1-25 1-22 (161) 113 cd07306 Porin3_VDAC Voltage-de 67.8 7.2 0.00018 17.1 13.8 16 170-185 190-205 (276) 114 PRK11627 hypothetical protein; 67.6 4.9 0.00012 17.9 2.7 16 1-17 2-17 (192) 115 pfam11336 DUF3138 Protein of u 67.5 3.3 8.5E-05 18.8 1.8 24 162-186 463-486 (514) 116 pfam10836 DUF2574 Protein of u 66.1 3.3 8.3E-05 18.8 1.5 26 1-27 1-26 (93) 117 PRK10355 xylF D-xylose transpo 66.0 5.1 0.00013 17.9 2.5 27 1-27 1-27 (330) 118 pfam07383 DUF1496 Protein of u 66.0 5.1 0.00013 17.9 2.5 31 1-31 1-31 (88) 119 PRK10672 rare lipoprotein A; P 65.9 7.9 0.0002 16.9 4.0 20 1-20 1-20 (369) 120 pfam03249 TSA Type specific an 65.7 7.8 0.0002 16.9 3.4 75 129-205 443-524 (524) 121 TIGR03525 GldK gliding motilit 65.3 6 0.00015 17.5 2.8 46 1-49 1-46 (449) 122 PRK12788 flgH flagellar basal 64.8 8.3 0.00021 16.8 4.1 18 1-18 1-18 (231) 123 pfam12099 DUF3575 Protein of u 64.3 8.5 0.00022 16.7 10.0 31 158-188 117-154 (186) 124 pfam11777 DUF3316 Protein of u 64.2 3.6 9.3E-05 18.6 1.4 21 1-25 1-21 (114) 125 COG4991 Uncharacterized protei 64.1 7.4 0.00019 17.0 3.0 24 1-24 10-33 (155) 126 PRK10877 thiol:disulfide inter 63.6 3.2 8.1E-05 18.9 1.1 25 1-27 1-25 (232) 127 COG3700 AphA Acid phosphatase 63.4 8.1 0.00021 16.8 3.1 27 1-27 1-27 (237) 128 pfam06291 Lambda_Bor Bor prote 63.3 4.3 0.00011 18.3 1.7 23 1-25 1-23 (97) 129 PRK10653 D-ribose transporter 63.2 5.1 0.00013 17.8 2.1 26 1-26 3-28 (295) 130 pfam07901 DUF1672 Protein of u 63.1 4.8 0.00012 18.0 1.9 22 1-22 1-22 (304) 131 pfam05538 Campylo_MOMP Campylo 62.8 9.1 0.00023 16.6 5.4 14 192-205 424-437 (437) 132 COG2829 PldA Outer membrane ph 61.9 9.5 0.00024 16.5 8.0 76 114-205 236-313 (317) 133 pfam12276 DUF3617 Protein of u 61.8 7.2 0.00018 17.1 2.6 17 1-17 1-17 (149) 134 PRK12407 flgH flagellar basal 61.6 9.6 0.00024 16.4 3.7 16 2-18 1-16 (220) 135 PRK09465 tolC outer membrane c 61.3 5.2 0.00013 17.8 1.8 29 1-30 1-29 (446) 136 PRK13484 putative iron-regulat 60.9 9.9 0.00025 16.4 15.3 85 120-205 585-682 (682) 137 PRK06763 F0F1 ATP synthase sub 60.5 1.8 4.7E-05 20.1 -0.6 18 1-18 3-20 (213) 138 PRK09501 potD spermidine/putre 60.2 7.2 0.00018 17.1 2.4 25 1-25 1-25 (348) 139 COG3678 CpxP P pilus assembly/ 60.0 6.4 0.00016 17.4 2.1 31 1-31 2-32 (160) 140 PRK10564 maltose regulon perip 60.0 8.5 0.00022 16.7 2.7 30 1-30 1-30 (303) 141 PRK12395 maltoporin; Provision 59.9 5.8 0.00015 17.5 1.9 25 1-25 1-25 (419) 142 PRK10044 ferrichrome outer mem 59.6 10 0.00027 16.2 15.0 25 161-186 313-337 (730) 143 cd01345 OM_channels Porin supe 58.9 11 0.00027 16.2 4.2 22 162-184 191-212 (253) 144 PRK10802 peptidoglycan-associa 58.8 11 0.00028 16.2 3.3 17 1-19 7-23 (173) 145 pfam06649 DUF1161 Protein of u 58.8 4.9 0.00013 17.9 1.3 26 1-30 1-26 (75) 146 pfam12262 Lipase_bact_N Bacter 57.6 6.9 0.00018 17.2 1.9 16 1-16 1-16 (269) 147 TIGR03502 lipase_Pla1_cef extr 56.0 7.8 0.0002 16.9 2.0 18 1-18 1-19 (792) 148 PRK09915 putative outer membra 53.2 14 0.00035 15.7 3.7 21 1-21 6-26 (488) 149 PRK10449 heat-inducible protei 52.5 10 0.00026 16.3 2.1 17 1-18 1-17 (140) 150 PRK11459 multidrug resistance 52.3 14 0.00036 15.6 3.8 28 1-28 1-28 (478) 151 COG4314 NosL Predicted lipopro 51.9 14 0.00036 15.6 2.7 23 1-23 1-23 (176) 152 pfam12071 DUF3551 Protein of u 51.2 9.9 0.00025 16.4 1.9 11 1-11 1-11 (82) 153 pfam11853 DUF3373 Protein of u 51.0 15 0.00038 15.5 9.8 54 131-184 332-392 (485) 154 pfam03895 YadA YadA-like C-ter 50.9 15 0.00038 15.5 6.1 38 162-205 41-78 (78) 155 PRK10049 pgaA outer membrane p 50.8 15 0.00038 15.5 6.4 48 157-205 769-818 (818) 156 PRK11553 alkanesulfonate trans 50.6 14 0.00035 15.7 2.5 25 1-25 1-25 (314) 157 PRK01326 prsA foldase protein 49.4 13 0.00034 15.7 2.3 18 1-18 1-21 (310) 158 PRK10894 hypothetical protein; 49.1 11 0.00029 16.0 1.9 26 1-26 7-32 (184) 159 PRK04405 prsA peptidylprolyl i 48.4 12 0.0003 16.0 1.9 15 1-15 5-19 (298) 160 COG5455 Predicted integral mem 48.3 14 0.00037 15.5 2.3 13 1-13 2-14 (129) 161 PRK10290 superoxide dismutase; 48.1 11 0.00029 16.0 1.8 22 1-26 1-22 (173) 162 PRK01904 hypothetical protein; 47.9 11 0.00027 16.2 1.6 27 1-29 1-27 (217) 163 COG3767 Uncharacterized low-co 47.9 12 0.0003 16.0 1.8 24 1-24 1-25 (95) 164 COG4851 CamS Protein involved 47.4 14 0.00035 15.6 2.1 25 1-26 1-26 (382) 165 TIGR01672 AphA HAD superfamily 47.1 17 0.00044 15.1 3.2 26 1-26 1-27 (248) 166 PRK00249 flgH flagellar basal 47.0 17 0.00044 15.1 4.6 25 1-25 1-25 (231) 167 PRK13680 hypothetical protein; 46.9 14 0.00034 15.7 2.0 25 1-25 5-29 (121) 168 COG2854 Ttg2D ABC-type transpo 46.3 12 0.00031 15.9 1.7 24 1-24 3-26 (202) 169 pfam04357 DUF490 Family of unk 46.2 18 0.00045 15.1 3.8 30 166-205 318-347 (349) 170 PRK11917 bifunctional adhesin/ 45.3 17 0.00042 15.2 2.3 18 1-18 3-20 (259) 171 COG3026 RseB Negative regulato 44.4 18 0.00047 15.0 2.4 27 1-27 1-27 (320) 172 PRK11119 proX glycine betaine 43.6 18 0.00046 15.0 2.3 25 2-29 4-28 (332) 173 pfam06518 DUF1104 Protein of u 43.6 19 0.00047 15.0 2.3 23 1-26 1-23 (142) 174 pfam09403 FadA Adhesion protei 43.1 17 0.00044 15.1 2.1 10 1-10 1-10 (126) 175 CHL00132 psaF photosystem I su 42.9 20 0.00051 14.8 2.6 31 1-31 1-31 (185) 176 PRK09967 putative outer membra 42.4 21 0.00052 14.7 2.6 20 1-20 1-21 (160) 177 TIGR02791 VirB5 P-type DNA tra 41.7 21 0.00054 14.7 2.5 16 1-16 1-16 (233) 178 pfam07585 DUF1551 Protein of u 41.4 21 0.00054 14.7 11.2 50 155-205 258-338 (338) 179 PRK04517 hypothetical protein; 41.4 16 0.00041 15.3 1.7 25 1-27 1-26 (216) 180 TIGR03431 PhnD phosphonate ABC 41.4 21 0.00054 14.7 2.5 25 1-26 1-25 (288) 181 PRK11397 dacD D-alanyl-D-alani 41.2 22 0.00055 14.6 3.0 27 1-27 4-30 (390) 182 PRK09838 periplasmic copper-bi 41.0 22 0.00055 14.6 3.2 26 1-27 1-26 (114) 183 PHA00025 major coat protein 40.8 19 0.00048 14.9 2.0 27 1-27 1-27 (76) 184 pfam05643 DUF799 Putative bact 40.7 20 0.00051 14.8 2.1 16 1-17 1-16 (215) 185 TIGR03227 PhnS 2-aminoethylpho 40.6 19 0.00048 15.0 1.9 27 1-27 12-38 (367) 186 pfam08139 LPAM_1 Prokaryotic m 40.2 20 0.00051 14.8 2.0 13 1-13 1-13 (26) 187 COG4771 FepA Outer membrane re 39.2 23 0.00059 14.5 7.6 44 160-205 639-699 (699) 188 TIGR03519 Bac_Flav_fam_1 Bacte 38.6 24 0.0006 14.4 16.8 11 1-11 1-11 (292) 189 PRK13731 conjugal transfer sur 38.5 21 0.00055 14.6 2.0 37 1-37 3-44 (243) 190 PRK10797 glutamate and asparta 38.3 19 0.00049 14.9 1.7 16 1-16 3-18 (302) 191 pfam05150 Legionella_OMP Legio 38.2 24 0.00061 14.4 11.9 41 2-48 5-45 (297) 192 PRK10154 hypothetical protein; 37.1 23 0.00059 14.5 2.0 27 1-27 1-27 (134) 193 PRK09818 putative kinase inhib 37.0 20 0.0005 14.8 1.6 27 1-31 1-27 (183) 194 PRK10316 hypothetical protein; 36.6 20 0.00051 14.8 1.6 29 1-31 1-29 (209) 195 PRK11443 hypothetical protein; 36.5 19 0.0005 14.9 1.5 13 1-13 1-13 (120) 196 COG4549 Uncharacterized protei 36.4 24 0.00062 14.4 2.0 23 1-23 1-23 (178) 197 PRK08482 F0F1 ATP synthase sub 35.7 16 0.00042 15.2 1.0 20 1-26 1-20 (105) 198 pfam02402 Lysis_col Lysis prot 35.6 17 0.00043 15.2 1.1 22 1-22 1-22 (46) 199 PRK09360 lamB maltoporin; Prov 34.9 27 0.0007 14.1 14.4 24 1-26 5-28 (417) 200 pfam11106 YjbE Exopolysacchari 34.4 28 0.00071 14.1 2.4 25 1-27 1-25 (80) 201 COG2825 HlpA Outer membrane pr 33.8 29 0.00073 14.0 2.2 25 1-25 1-25 (170) 202 PRK10957 iron-enterobactin tra 33.5 29 0.00074 14.0 2.8 32 2-33 5-36 (319) 203 pfam00879 Defensin_propep Defe 32.4 30 0.00077 13.9 2.9 29 1-33 1-29 (52) 204 PRK10173 glucose-1-phosphatase 32.2 30 0.00078 13.9 2.3 25 1-26 1-25 (413) 205 TIGR01653 lactococcin_972 bact 32.2 31 0.00078 13.9 2.0 25 1-25 1-28 (149) 206 COG4531 ZnuA ABC-type Zn2+ tra 32.2 31 0.00078 13.9 2.0 31 1-31 4-34 (318) 207 PRK10598 hypothetical protein; 31.8 31 0.00079 13.8 2.2 15 1-15 1-15 (186) 208 pfam10614 Tafi-CsgF Curli prod 31.3 32 0.00081 13.8 2.6 30 1-32 1-30 (142) 209 pfam01308 Chlam_OMP Chlamydia 31.1 32 0.00081 13.8 2.5 23 1-24 1-23 (389) 210 TIGR01098 3A0109s03R phosphona 30.1 33 0.00085 13.7 2.9 18 1-18 1-18 (299) 211 PRK04792 tolB translocation pr 30.1 33 0.00085 13.7 2.2 28 1-31 2-29 (450) 212 PRK10884 putative signal trans 29.4 29 0.00074 14.0 1.4 28 1-29 1-28 (206) 213 TIGR01779 TonB-B12 TonB-depend 29.0 34 0.00087 13.6 1.7 61 119-181 238-303 (644) 214 TIGR00031 UDP-GALP_mutase UDP- 28.7 14 0.00035 15.7 -0.3 18 172-189 291-308 (390) 215 COG4588 AcfC Accessory coloniz 28.3 36 0.00092 13.5 2.0 24 1-27 1-24 (252) 216 PRK11372 lysozyme inhibitor; P 28.2 29 0.00074 14.0 1.3 14 1-14 3-16 (109) 217 COG4150 CysP ABC-type sulfate 27.4 37 0.00095 13.4 1.9 24 1-24 6-29 (341) 218 COG5633 Predicted periplasmic 26.7 38 0.00097 13.4 1.6 25 1-27 1-25 (123) 219 pfam07273 DUF1439 Protein of u 26.4 39 0.001 13.3 2.1 18 1-18 1-18 (177) 220 COG5510 Predicted small secret 26.2 39 0.001 13.3 2.3 20 1-20 2-24 (44) 221 PRK10918 phosphate transporter 26.2 40 0.001 13.3 2.2 28 1-28 4-31 (346) 222 COG3133 SlyB Outer membrane li 26.1 38 0.00098 13.3 1.6 18 1-18 1-19 (154) 223 pfam03482 SIC sic protein. Ser 25.8 40 0.001 13.2 1.7 22 2-23 11-32 (306) 224 PRK03629 tolB translocation pr 25.5 41 0.001 13.2 2.1 22 1-24 1-22 (430) 225 PRK13211 N-acetylglucosamine-b 25.3 41 0.001 13.2 2.1 30 1-30 1-30 (475) 226 cd07303 Porin3 Eukaryotic pori 25.2 41 0.0011 13.2 9.3 18 162-180 213-230 (274) 227 PRK09558 ushA bifunctional UDP 25.0 42 0.0011 13.2 3.0 20 1-20 1-20 (551) 228 PRK10523 lipoprotein involved 24.9 42 0.0011 13.1 3.0 18 1-18 2-19 (234) 229 cd01346 Maltoporin-like The Ma 24.4 43 0.0011 13.1 14.7 113 85-203 260-388 (392) 230 TIGR01165 cbiN cobalt transpor 24.1 43 0.0011 13.1 2.1 16 1-16 3-21 (94) 231 PRK00059 prsA peptidylprolyl i 23.4 45 0.0011 13.0 2.1 21 1-21 4-24 (336) 232 PRK09810 entericidin A; Provis 23.4 45 0.0011 13.0 2.1 21 1-21 2-22 (41) 233 pfam04202 Mfp-3 Foot protein 3 23.4 45 0.0011 13.0 2.4 18 1-18 1-18 (71) 234 COG4143 TbpA ABC-type thiamine 23.2 45 0.0012 13.0 1.9 26 1-26 1-26 (336) 235 PRK03002 prsA peptidylprolyl i 22.4 47 0.0012 12.9 2.2 17 1-17 1-17 (285) 236 COG3489 Predicted periplasmic 21.9 46 0.0012 13.0 1.3 21 1-21 4-24 (359) 237 COG3248 Tsx Nucleoside-binding 21.7 49 0.0012 12.8 12.8 32 1-35 5-36 (284) 238 COG5514 Uncharacterized conser 21.6 49 0.0012 12.8 1.4 22 1-26 1-22 (203) 239 PRK10270 hypothetical protein; 21.6 49 0.0013 12.8 1.9 16 1-16 1-16 (340) 240 TIGR02967 guan_deamin guanine 21.6 24 0.00062 14.4 -0.2 14 121-134 179-193 (426) 241 COG3227 LasB Zinc metalloprote 21.4 50 0.0013 12.8 2.0 28 1-28 1-29 (507) 242 TIGR01655 yxeA_fam conserved h 20.9 51 0.0013 12.7 2.1 25 1-25 1-25 (121) 243 PRK12750 cpxP periplasmic repr 20.9 51 0.0013 12.7 2.0 22 2-24 5-26 (170) 244 pfam01103 Bac_surface_Ag Surfa 20.9 51 0.0013 12.7 11.6 70 133-205 245-317 (317) 245 PRK11289 ampC beta-lactamase; 20.8 51 0.0013 12.7 1.9 27 1-27 4-31 (387) 246 TIGR02448 TIGR02448 conserverd 20.7 48 0.0012 12.9 1.2 24 1-26 1-25 (107) 247 COG3652 Predicted outer membra 20.7 51 0.0013 12.7 2.2 26 1-26 1-26 (170) 248 pfam08194 DIM DIM protein. Dro 20.7 51 0.0013 12.7 2.3 18 1-18 1-19 (37) 249 PRK09474 malE maltose ABC tran 20.6 52 0.0013 12.7 2.3 16 1-16 1-16 (394) 250 PRK09840 catecholate sideropho 20.4 52 0.0013 12.7 11.5 16 171-186 623-638 (760) No 1 >COG3637 Opacity protein and related surface antigens [Cell envelope biogenesis, outer membrane] Probab=100.00 E-value=3.2e-33 Score=188.50 Aligned_cols=178 Identities=26% Similarity=0.367 Sum_probs=140.2 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC-CCCCCEEE Q ss_conf 94437899999999984100002466543566666555634554444441154210001223333446677-77760268 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAADPVRRAHHGGRGVVPTIATNRYVPIRHDFNGPYAGLSALYNGSFGEE-AHHNAGGS 79 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad~~~~~~~~~~~~~~~~~~~~~~~~~~~w~G~y~g~~~g~~~~~~~~-~~~~~~~G 79 (205) ||++++++++.+ ...++|.|||. ...|.+.|.+++.++.+...+. ...++..| T Consensus 1 mk~~l~~a~~~~---~~~~~a~aad~-----------------------~~~w~~~~~~g~a~~~~~~~~~~~~~~~~~G 54 (199) T COG3637 1 MKKLLAAAALAA---LLLSAAAAADA-----------------------AAGWYGTYSGGYAGSDWSGSDSSPSIGFGGG 54 (199) T ss_pred CHHHHHHHHHHH---HHHHHHHHHHH-----------------------HCCCCEEEEEEEEECCCCCCCCCCCCCCCCC T ss_conf 906999999999---87514444444-----------------------2387068863488426567764435665422 Q ss_pred EE-EEEEEECCCEEEEEEEEEEECCCCCC----CCCCCC--CCEEEEEEEEEEEEECCCEEEEEEEEEEEEEEEEC---C Q ss_conf 99-85464048379984510132155566----662122--32148999897543127677898854477644431---3 Q gi|254780598|r 80 IF-AGYNVEDSCIMYGVEGDVRYTVPVLA----DNIHSL--HGIGGSLRIRGGYEVSDSLLLYATVGPDVAQKYET---G 149 (205) Q Consensus 80 ~~-~GYn~q~~~~V~G~e~d~~~~~~~~~----~~~~~~--~~~~~~~~~r~Gy~~~~~~l~Y~~~G~a~~~~~~~---~ 149 (205) ++ .|||+|.++.|+|+|+++++...... ...... ..|.++++++++|.++++++||+.+|++..+++.+ . T Consensus 55 ~~~~g~n~~~~~~v~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~a~~~~~~~~~~~~Y~~aGvg~~~~k~~~~~~ 134 (199) T COG3637 55 LKLAGYNLKYRYSVLGVEGSFTYTGQSGRYSGGSGKNQVDNKAWYGSLRAGPDYRINDRFSPYGGAGVAYGKVKTSTDEV 134 (199) T ss_pred EEEEEEEEECCCEEEEEEEEEEEECCCCCCCCCCCCCCCCCCEEEEEEEEECCEECCCCEEEEEECCEEEEEEEEECCCC T ss_conf 16676873358467888988761024665447666444442057899998655105864444040105768998712566 Q ss_pred CCC-CCCCCCEEEEEEEEEEEECCCCCEEEEEEEEEEECCC---------CCCCCEEEEEEEEEEC Q ss_conf 455-5677520479973466100488679999999975056---------5566226999998539 Q gi|254780598|r 150 KAG-EITPIAIGGTAGVGVEVGGLSESLVARLEYRASKYSK---------VEGFYNTISLGVGMKF 205 (205) Q Consensus 150 ~~~-~~~~~~~g~~~G~Gvey~~it~~~~~r~EY~y~df~~---------~~~~~~~~~~G~~ykF 205 (205) ... ++++++.+|++|+|+|| .+||||++++||||++|++ .+.+.|+|++||+||| T Consensus 135 ~~~~~~s~~~~g~~~gAGv~~-~~tdnv~vd~~Y~y~~~g~~~~~~~~~~~~~~~~~v~vGv~YkF 199 (199) T COG3637 135 GGSADESKTKTGYAYGAGVQY-NPTDNVAIDLGYEYSDFGKKDGVDGGYSGDVKTHTVKVGVGYKF 199 (199) T ss_pred CCCCCCCCCEEEEEEEEEEEE-CCCCCEEEEEEEEEEECCCCCCCCCCCCCCCCCCEEEEEEEEEC T ss_conf 741167655788678410798-15598799889998815886456754136401340899605659 No 2 >PRK10808 outer membrane protein A; Reviewed Probab=100.00 E-value=6.1e-32 Score=181.87 Aligned_cols=173 Identities=23% Similarity=0.322 Sum_probs=126.2 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC--------CCCCC Q ss_conf 9443789999999998410000246654356666655563455444444115421000122333344--------66777 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAADPVRRAHHGGRGVVPTIATNRYVPIRHDFNGPYAGLSALYNGS--------FGEEA 72 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad~~~~~~~~~~~~~~~~~~~~~~~~~~~w~G~y~g~~~g~~~~--------~~~~~ 72 (205) |||++++++++.++++ +.|+||.. ... +|+|+.+|++.. ..+++ T Consensus 1 MKKt~ial~va~a~~a--~~A~Aa~~---------------dn~-----------wY~G~k~G~s~~~d~~~~~~~~~~d 52 (347) T PRK10808 1 MKKTAIAIAVALAGFA--TVAQAAPK---------------DNT-----------WYTGAKLGWSQYHDTGFINNNGPTH 52 (347) T ss_pred CCHHHHHHHHHHHHHH--HHHHCCCC---------------CCC-----------EEEEEEEEEEEECCCCCCCCCCCCC T ss_conf 9467999999999777--64312000---------------288-----------7896421100121331246677765 Q ss_pred CCCCEEEEEEEEEEECCCEEEEEEEEEEECCCCC---CCCCCCCCCEEEEEEEEEEEEECCCEEEEEEEEEEEEEEEECC Q ss_conf 7760268998546404837998451013215556---6662122321489998975431276778988544776444313 Q gi|254780598|r 73 HHNAGGSIFAGYNVEDSCIMYGVEGDVRYTVPVL---ADNIHSLHGIGGSLRIRGGYEVSDSLLLYATVGPDVAQKYETG 149 (205) Q Consensus 73 ~~~~~~G~~~GYn~q~~~~V~G~e~d~~~~~~~~---~~~~~~~~~~~~~~~~r~Gy~~~~~~l~Y~~~G~a~~~~~~~~ 149 (205) ++.+++|+|+|||++. .+++|..|++.+... .........+..+|.+|+.|+++|++.+|+++|.++++.+... T Consensus 53 ~~~~g~G~f~GYqfn~---~~a~E~GYd~lG~~~~~g~~~~~~~~~~g~~La~k~~ypl~d~l~ly~k~G~~y~~~d~~~ 129 (347) T PRK10808 53 KNQLGAGAFGGYQVNP---YLGFEMGYDWLGRMPYKGDVSNGAFKAQGVQLTAKLGYPITDDLDVYTRLGGMVWRADTKS 129 (347) T ss_pred CCCCEEEEEEEEEECC---EEEEEEEECCCCCCCCCCCCCCCCCEEEEEEEEEEEECCCCCCCCEEEEEEEEEEEECCCC T ss_conf 5661245675477254---4889870000285432454447640146788888881045677232578534899712344 Q ss_pred CC-----CCCCCCCEEEEEEEEEEEECCCCCEEEEEEEEEEE-CCCC-----CCCCEEEEEEEEEEC Q ss_conf 45-----55677520479973466100488679999999975-0565-----566226999998539 Q gi|254780598|r 150 KA-----GEITPIAIGGTAGVGVEVGGLSESLVARLEYRASK-YSKV-----EGFYNTISLGVGMKF 205 (205) Q Consensus 150 ~~-----~~~~~~~~g~~~G~Gvey~~it~~~~~r~EY~y~d-f~~~-----~~~~~~~~~G~~ykF 205 (205) .. ...+++...+++|+|+|| +||++|.+|+||||.+ .++. +.+.|.+++|+.||| T Consensus 130 ~~~~~~~~~~~d~g~s~~~g~GveY-~lt~~~~~R~eYq~~~~igd~~~~g~~~d~~~~slGlsY~F 195 (347) T PRK10808 130 NVTGGARGKDHDTGVSPVFAGGVEY-AITPEWATRLEYQWTNNIGDANTIGTRPDNGLLSVGVSYRF 195 (347) T ss_pred CCCCCCCCCCCCCCCCCEEEEEEEE-EECCCCEEEEEEEEEEECCCCCCCCCCCCCCEEEEEEEECC T ss_conf 5566543235656736206720358-80637268888999810575555574677756999999637 No 3 >pfam01389 OmpA_membrane OmpA-like transmembrane domain. The structure of OmpA transmembrane domain shows that it consists of an eight stranded beta barrel. This family includes some other distantly related outer membrane proteins with low scores. Probab=99.96 E-value=2.2e-28 Score=163.63 Aligned_cols=148 Identities=20% Similarity=0.280 Sum_probs=112.4 Q ss_pred CCCCCCCCCCCCCC--------CCCCCCCCCEEEEEEEEEEECCCEEEEEEEEEEECCCCCCC---CCCCCCCEEEEEEE Q ss_conf 21000122333344--------66777776026899854640483799845101321555666---62122321489998 Q gi|254780598|r 54 NGPYAGLSALYNGS--------FGEEAHHNAGGSIFAGYNVEDSCIMYGVEGDVRYTVPVLAD---NIHSLHGIGGSLRI 122 (205) Q Consensus 54 ~G~y~g~~~g~~~~--------~~~~~~~~~~~G~~~GYn~q~~~~V~G~e~d~~~~~~~~~~---~~~~~~~~~~~~~~ 122 (205) ..+|+|+++|++.- ..+++++..++|+|+||||+. .+++|+.+.+.+..... ..........+|.+ T Consensus 5 n~~Y~G~~~G~s~~~~~~~~~~~~~~d~~~~~~glf~GYqfn~---~~a~E~gY~~lG~~~~~~~~~~~~~~~~~~~l~~ 81 (175) T pfam01389 5 NTWYTGAKAGWASFHDTGLIEGNGPTHRNSLTAGAFGGYQINN---YLAAELGYDYLGRMPYKGKGKNNAHKAQGVTLSL 81 (175) T ss_pred CCEEEEEEEEEEEECCCCCCCCCCCCCCCCCEEEEEEEEECCC---EEEEEEEEEECCCCCCCCCCCCCCEEEEEEEEEE T ss_conf 9578966402568957634457876567871789998631031---1226874243165433553434540468999999 Q ss_pred EEEEEECCCEEEEEEEEEEEEEEEECCC-----CCCCCCCCEEEEEEEEEEEECCCCCEEEEEEEEEEE-CCC-----CC Q ss_conf 9754312767789885447764443134-----555677520479973466100488679999999975-056-----55 Q gi|254780598|r 123 RGGYEVSDSLLLYATVGPDVAQKYETGK-----AGEITPIAIGGTAGVGVEVGGLSESLVARLEYRASK-YSK-----VE 191 (205) Q Consensus 123 r~Gy~~~~~~l~Y~~~G~a~~~~~~~~~-----~~~~~~~~~g~~~G~Gvey~~it~~~~~r~EY~y~d-f~~-----~~ 191 (205) |+.||++|++.+|+|+|+++++.+.+.. ....+++.+++++|+|+|| +|||||++|+||||++ .++ .+ T Consensus 82 k~~~pl~d~~~ly~K~G~a~~~~d~~~~~~~~~~~~~~d~~~~~~~G~G~eY-~it~~~~~r~eYq~~~~~g~~~~~~~~ 160 (175) T pfam01389 82 KLSYPLTDDLDVYGKVGGALVRADYKFYEDANGKTGNHDTGVSPLFALGVEY-AVTPELAVRLEYQYLNNIGDLHKQGKR 160 (175) T ss_pred EEEEEECCCEEEEEEEEEEEEECCCCEECCCCCCCCCCCCCCCEEEEEEEEE-EECCCCEEEEEEEEEHHCCCCCCCCCC T ss_conf 9888614856799971068996133210256664223567705388888899-807582788889992225853356766 Q ss_pred CCCEEEEEEEEEEC Q ss_conf 66226999998539 Q gi|254780598|r 192 GFYNTISLGVGMKF 205 (205) Q Consensus 192 ~~~~~~~~G~~ykF 205 (205) .++|.+++||.||| T Consensus 161 ~D~~~~slGlsYrF 174 (175) T pfam01389 161 PDNGSASLGISYRF 174 (175) T ss_pred CCCCEEEEEEEEEC T ss_conf 77777999999777 No 4 >PRK09408 ompX outer membrane protein X; Provisional Probab=99.86 E-value=2.6e-20 Score=122.14 Aligned_cols=169 Identities=20% Similarity=0.222 Sum_probs=121.4 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEE Q ss_conf 94437899999999984100002466543566666555634554444441154210001223333446677777602689 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAADPVRRAHHGGRGVVPTIATNRYVPIRHDFNGPYAGLSALYNGSFGEEAHHNAGGSI 80 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad~~~~~~~~~~~~~~~~~~~~~~~~~~~w~G~y~g~~~g~~~~~~~~~~~~~~~G~ 80 (205) |||++.+.++++++...+.+|+|++.+ -+. ||+.++.....+ ..-|+ T Consensus 1 mkk~~~~s~~~~~~~~~a~~a~A~~sT-------------------------vS~-------GYAQs~~~~~~n-~l~G~ 47 (171) T PRK09408 1 MKKIACLSALAAVLAFTAGTAVAATST-------------------------VTG-------GYAQSDAQGVAN-KMGGF 47 (171) T ss_pred CCHHHHHHHHHHHHHHHHHHHHCCCCE-------------------------EEE-------EEEEECCCCCCC-CCCCE T ss_conf 922478999999988566666425766-------------------------987-------886420013467-86744 Q ss_pred EEEEEEECCCEEEEEEEEEEECCCCCCCC-CCCCCCEEEEEEEEEEEEECCCEEEEEEEEEEEEEEEECCC-CCCCCCCC Q ss_conf 98546404837998451013215556666-21223214899989754312767789885447764443134-55567752 Q gi|254780598|r 81 FAGYNVEDSCIMYGVEGDVRYTVPVLADN-IHSLHGIGGSLRIRGGYEVSDSLLLYATVGPDVAQKYETGK-AGEITPIA 158 (205) Q Consensus 81 ~~GYn~q~~~~V~G~e~d~~~~~~~~~~~-~~~~~~~~~~~~~r~Gy~~~~~~l~Y~~~G~a~~~~~~~~~-~~~~~~~~ 158 (205) -+-|++++++--+|+-+-+.|...+...+ .........+|.+-+.|.+||.+.+|+.+|.+..+++.... ..+.+.++ T Consensus 48 NlKYrYE~dn~~~Gvi~Sftyt~~d~~~~~~~~~~~~YySl~aGPsyR~NdyvS~Ygl~G~a~~k~~~~~~~~~~~s~s~ 127 (171) T PRK09408 48 NLKYRYEEDNSPLGVIGSFTYTEKSRTADSGDYNKNQYYGITAGPAYRINDWASIYGVVGVGYGKFQTTEYPTYKHDTSD 127 (171) T ss_pred EEEEEEEECCCCEEEEEEEEEECCCCCCCCCCCCEEEEEEEEECCEEEECCEEEEEEECCEEEEEEEECCCCCCCCCCCC T ss_conf 88876673698677899999981665667753120469989534427862437522550133689973246764456110 Q ss_pred EEEEEEEEEEEECCCCCEEEEEEEEEEECCCCCCCCEEEEEEEEEEC Q ss_conf 04799734661004886799999999750565566226999998539 Q gi|254780598|r 159 IGGTAGVGVEVGGLSESLVARLEYRASKYSKVEGFYNTISLGVGMKF 205 (205) Q Consensus 159 ~g~~~G~Gvey~~it~~~~~r~EY~y~df~~~~~~~~~~~~G~~ykF 205 (205) ++..+|+|++++ ..+||++.+.|+++++.+++ .++|.+|++||| T Consensus 128 t~fayGAGvQ~N-P~~nvaid~gYE~S~~~~~~--~~~f~vGvGYRF 171 (171) T PRK09408 128 YGFSYGAGLQFN-PMENVALDFSYEQSRIRSVD--VGTWIAGVGYRF 171 (171) T ss_pred CCEEEEEEEEEC-CCCCEEEEEEEEECCCCCEE--ECCEEEEEEECC T ss_conf 214666547980-67787999876111458727--685998114139 No 5 >pfam06316 Ail_Lom Enterobacterial Ail/Lom protein. This family consists of several bacterial and phage Ail/Lom-like proteins. The Yersinia enterocolitica Ail protein is a known virulence factor. Proteins in this family are predicted to consist of eight transmembrane beta-sheets and four cell surface-exposed loops. It is thought that Ail directly promotes invasion and loop 2 contains an active site, perhaps a receptor-binding domain. The phage protein Lom is expressed during lysogeny, and encode host-cell envelope proteins. Lom is found in the bacterial outer membrane, and is homologous to virulence proteins of two other enterobacterial genera. It has been suggested that lysogeny may generally have a role in bacterial survival in animal hosts, and perhaps in pathogenesis. Probab=99.79 E-value=1.2e-17 Score=108.36 Aligned_cols=170 Identities=20% Similarity=0.188 Sum_probs=112.7 Q ss_pred CHHHHHHHHHHHHHHHHCCC-CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEE Q ss_conf 94437899999999984100-00246654356666655563455444444115421000122333344667777760268 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCS-AQAADPVRRAHHGGRGVVPTIATNRYVPIRHDFNGPYAGLSALYNGSFGEEAHHNAGGS 79 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~-A~Aad~~~~~~~~~~~~~~~~~~~~~~~~~~~w~G~y~g~~~g~~~~~~~~~~~~~~~G 79 (205) |||++..+..+++.|++++. |.|+. ....-+..|+..|..... ..-.-| T Consensus 1 m~~~~~~~l~~~~~l~~~~~~a~A~~-----------------------~~sTvS~GYaQa~~~~~g-------~~~l~G 50 (199) T pfam06316 1 MRKLCAAILSAAICLAAAGTPASAAE-----------------------HQSTLSAGYLHAHTDAPG-------SDDLNG 50 (199) T ss_pred CCHHHHHHHHHHHHHHHHCCCCHHHC-----------------------CCCEEEECEEEEECCCCC-------CCCCCC T ss_conf 90177899999999997277311214-----------------------786686221003147566-------778885 Q ss_pred EEEEEEEECCCEEEEEEEEEEECCCCCCCC----------CCCCCCEEEEEEEEEEEEECCCEEEEEEEEEEEEEEEECC Q ss_conf 998546404837998451013215556666----------2122321489998975431276778988544776444313 Q gi|254780598|r 80 IFAGYNVEDSCIMYGVEGDVRYTVPVLADN----------IHSLHGIGGSLRIRGGYEVSDSLLLYATVGPDVAQKYETG 149 (205) Q Consensus 80 ~~~GYn~q~~~~V~G~e~d~~~~~~~~~~~----------~~~~~~~~~~~~~r~Gy~~~~~~l~Y~~~G~a~~~~~~~~ 149 (205) +.+-|++++++- +|+-+-+.|...+.+.. ...+.. ..+|.+-+.|.+||.+.+|+.+|++..+++... T Consensus 51 ~NlKYRYE~d~~-lGvIgSfTy~~~~~~~~~~~~~~~~~~~~~~~~-YySlmaGPsyR~Nd~vS~Ygl~G~a~~k~~~~~ 128 (199) T pfam06316 51 INVKYRYEFTDT-LGLITSFSYANAEDEQKTHYNDTRWHEDSVRNR-WFSVMAGPSVRVNEWFSAYALAGVAYARVSSFA 128 (199) T ss_pred EEEEEEEECCCC-EEEEEEEEEECCCCCCCEECCCCCCCCCCCCCE-EEEEEECCCEEECCEEEEEEECCEEEEEEEECC T ss_conf 388866441798-078999999615653120205762025642203-899973462685110120121231367764024 Q ss_pred ------------------CCCCCCCCCEEEEEEEEEEEECCCCCEEEEEEEEEEECCCCCCCCEEEEEEEEEEC Q ss_conf ------------------45556775204799734661004886799999999750565566226999998539 Q gi|254780598|r 150 ------------------KAGEITPIAIGGTAGVGVEVGGLSESLVARLEYRASKYSKVEGFYNTISLGVGMKF 205 (205) Q Consensus 150 ------------------~~~~~~~~~~g~~~G~Gvey~~it~~~~~r~EY~y~df~~~~~~~~~~~~G~~ykF 205 (205) ...+.+.+.++..+|+|++++ ..+|+++.+.|+++..++.+ .++|.+||+||| T Consensus 129 ~d~~~~~~~~~~~~~~~~~s~~~s~s~tsfAyGAGvQfN-P~~nvaiD~gYEgS~~~d~~--~~gf~vGVGYRF 199 (199) T pfam06316 129 GDYFRLTDDEGKKHDHLTGSDDARRSHTALAWGAGVQFN-PTENIAVDLAYEASGRGDWR--TDAFIVGTGYKF 199 (199) T ss_pred CCEEEECCCCCCCCCCCCCCCCCCCCCCCEEEEEEEEEC-CCCCEEEEEEEEECCCCCCE--ECCEEEEEEECC T ss_conf 650331255565455544444454443322442027971-57667999877630348705--175899514409 No 6 >PRK10959 outer membrane protein W; Provisional Probab=99.64 E-value=5.7e-14 Score=89.55 Aligned_cols=182 Identities=15% Similarity=0.059 Sum_probs=103.7 Q ss_pred CHHHHHHHHHHHHHHHHCCCCC-CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEE Q ss_conf 9443789999999998410000-246654356666655563455444444115421000122333344667777760268 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQ-AADPVRRAHHGGRGVVPTIATNRYVPIRHDFNGPYAGLSALYNGSFGEEAHHNAGGS 79 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~-Aad~~~~~~~~~~~~~~~~~~~~~~~~~~~w~G~y~g~~~g~~~~~~~~~~~~~~~G 79 (205) |||+.++++++++++++...|+ |.|...+.-.....+. .. ++.-.+.. ..-..++....+ T Consensus 1 Mkk~~~a~~~~~a~~s~~A~A~~aGd~~vr~g~~~V~P~--~~-----------s~~~~~~~------~~~~v~~~~~~~ 61 (212) T PRK10959 1 MKKLTVAALAVATLLSGSAFAHEAGDFIVRAGAATVRPN--EG-----------SDGVLGSL------GEFSVDNNTQLG 61 (212) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCC--CC-----------CCCCCCCC------CCEEECCCCEEE T ss_conf 937899999999875167788327998999668998347--88-----------88622566------512206885363 Q ss_pred EEEEEEEECCCEEEEEEEEEEEC---CCCCCCCCCCCCCEEEEEEEEEEEEE---CCCEEEEEEEEEEEEEEEECC---C Q ss_conf 99854640483799845101321---55566662122321489998975431---276778988544776444313---4 Q gi|254780598|r 80 IFAGYNVEDSCIMYGVEGDVRYT---VPVLADNIHSLHGIGGSLRIRGGYEV---SDSLLLYATVGPDVAQKYETG---K 150 (205) Q Consensus 80 ~~~GYn~q~~~~V~G~e~d~~~~---~~~~~~~~~~~~~~~~~~~~r~Gy~~---~~~~l~Y~~~G~a~~~~~~~~---~ 150 (205) +-..|-+..+ +++|.-.... +.........+.-+...-..-+.|-+ +.++-||+-+|+.++.+-.+. . T Consensus 62 lt~~Yf~tdn---i~vEll~~tpf~h~v~~~g~g~vg~~~~lPPtl~~qYhF~~~~~~~rPYVGaGvnYt~F~d~~~~~~ 138 (212) T PRK10959 62 LTFGYMATDN---IGVELLAATPFRHKVGLGGTGDIATVKHLPPTLMAQYYFGDAQSKLRPYVGAGVNYTTFFDNEFNDT 138 (212) T ss_pred EEEEEEEECC---EEEEEEEECCCCCEECCCCCCCEEEEEECCCEEEEEECCCCCCCCEEEEEEEEEEEEEEECCCCCCC T ss_conf 6899997188---3599999648700781467331587996498999997348999845135761067999971324565 Q ss_pred C------CCCCCCCEEEEEEEEEEEECCCCCEEEEEEEEEEECCC--------------CCCCCEEEEEEEEEEC Q ss_conf 5------55677520479973466100488679999999975056--------------5566226999998539 Q gi|254780598|r 151 A------GEITPIAIGGTAGVGVEVGGLSESLVARLEYRASKYSK--------------VEGFYNTISLGVGMKF 205 (205) Q Consensus 151 ~------~~~~~~~~g~~~G~Gvey~~it~~~~~r~EY~y~df~~--------------~~~~~~~~~~G~~ykF 205 (205) . .-+-+..+|+.+-+|+|| +|++||+++++.+|++... ++.|--.+.+|++||| T Consensus 139 ~~~~~~~~l~l~~S~G~a~q~G~dy-~l~~~W~~n~dv~y~~i~T~a~~~~~~~~~~~~v~idP~v~~vgvgYrF 212 (212) T PRK10959 139 GKDAGLSDLSLKDSWGAAGQVGLDY-LINRDWLVNASVWYMDIDTDVKFKLGGAQQKISTRLDPWVFMFSAGYRF 212 (212) T ss_pred CCCCCCCCEEECCCEEEEEEEEEEE-EECCCEEEEEEEEEEECCCEEEEECCCCEEEEEEEECCEEEEEEEEEEC T ss_conf 5567745106627460799987899-9379839999999996234599963884066047708789997759989 No 7 >pfam02530 Porin_2 Porin subfamily. This family consists of porins from the alpha subdivision of Proteobacteria the members of this family are related to pfam00267. The porins form large aqueous channels in the cell membrane allowing the selective entry of hydrophilic compounds this so called 'molecular sieve' is found in the cell walls of gram negative bacteria. Probab=99.52 E-value=9.9e-13 Score=83.16 Aligned_cols=78 Identities=19% Similarity=0.140 Sum_probs=60.5 Q ss_pred CCEEEEEEEEEEEEECCCEEEEEEEEEEEEEEEECC-----------------------------------CCCCCCCCC Q ss_conf 321489998975431276778988544776444313-----------------------------------455567752 Q gi|254780598|r 114 HGIGGSLRIRGGYEVSDSLLLYATVGPDVAQKYETG-----------------------------------KAGEITPIA 158 (205) Q Consensus 114 ~~~~~~~~~r~Gy~~~~~~l~Y~~~G~a~~~~~~~~-----------------------------------~~~~~~~~~ 158 (205) ..|.+.+|+|+.+.++|++.++..++|+....+... ......+.. T Consensus 245 ~~~~~Av~~~~~~~~g~g~s~~l~g~Y~~ga~~Y~~~~~~~~~~~~Wav~~~~~~~~t~k~~~~~~~~~~~~~~~~~~d~ 324 (378) T pfam02530 245 DKEEGAVRARLSVKAGDGDTLNIAGGYASGANRYTGDSEYHNWGGNWAVWAGYTYKATDKTAITPGAQWGGAGNVCNPDF 324 (378) T ss_pred CCCCEEEEEEEEEEECCCCEEEEEEEEECCCCCCCCCCCCCCCCCCEEEEEEEEECCCCEEEEEECEEEECCCCCCCCCC T ss_conf 54250448999998679845899999815874223554443467750377656862333089820337502454347775 Q ss_pred EEEEEEEEEEEECCCCCEEEEEEEEEEECCCCCC Q ss_conf 0479973466100488679999999975056556 Q gi|254780598|r 159 IGGTAGVGVEVGGLSESLVARLEYRASKYSKVEG 192 (205) Q Consensus 159 ~g~~~G~Gvey~~it~~~~~r~EY~y~df~~~~~ 192 (205) ..|.+|+.+.| .+.+|+.+++|++|+++++... T Consensus 325 ~~~~vg~nv~y-tpV~nl~~~~Ev~Yt~~d~k~~ 357 (378) T pfam02530 325 DGWAVGANVDY-TIVDNLTFTAEVRYTDLGQKYF 357 (378) T ss_pred CEEEEEEEEEE-EECCCEEEEEEEEEEECCCCCC T ss_conf 42688887887-8809968978999974687445 No 8 >pfam07437 YfaZ YfaZ precursor. This family contains the precursor of the bacterial protein YfaZ (approximately 180 residues long). Many members of this family are hypothetical proteins. Probab=99.42 E-value=1.4e-11 Score=77.21 Aligned_cols=175 Identities=15% Similarity=0.074 Sum_probs=108.6 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEE Q ss_conf 94437899999999984100002466543566666555634554444441154210001223333446677777602689 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAADPVRRAHHGGRGVVPTIATNRYVPIRHDFNGPYAGLSALYNGSFGEEAHHNAGGSI 80 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad~~~~~~~~~~~~~~~~~~~~~~~~~~~w~G~y~g~~~g~~~~~~~~~~~~~~~G~ 80 (205) |||++++.+++.++++ ++|+|-++..+.-...... ..+-...-+ |......|...+ +++-.+++ T Consensus 1 mkk~~~~~a~~l~~~s--~~A~A~s~~~~~g~~~t~~--------~~~~g~~~~----gl~~~g~w~~sd--~dg~~~~~ 64 (180) T pfam07437 1 MAKFRVASVALLLLVA--LSVNASSFNLGLGNDYTNT--------GLGLGTNTD----GLAVSGNWIYSD--DDGDVAGG 64 (180) T ss_pred CCHHHHHHHHHHHHHH--HHCEEEEEECCCCCCEEEE--------EECCCCCCC----CEEEECCEEEEC--CCCCEECC T ss_conf 9035899999998763--2105788422466413777--------430378887----505421265787--99967324 Q ss_pred EEEEEEECCCEEEEEEEEEEECCCCCCCCCCCCCCEEEEEEEEEEEEECCCEEEEEEEEEEEEEEEECCCCCCCCCCCEE Q ss_conf 98546404837998451013215556666212232148999897543127677898854477644431345556775204 Q gi|254780598|r 81 FAGYNVEDSCIMYGVEGDVRYTVPVLADNIHSLHGIGGSLRIRGGYEVSDSLLLYATVGPDVAQKYETGKAGEITPIAIG 160 (205) Q Consensus 81 ~~GYn~q~~~~V~G~e~d~~~~~~~~~~~~~~~~~~~~~~~~r~Gy~~~~~~l~Y~~~G~a~~~~~~~~~~~~~~~~~~g 160 (205) -.+++++.+...+.+-+-..+.+.+... +.+...+.+|+.+++.+++.+|+.+=|+=... .+..+.- T Consensus 65 gl~~~~~~Gp~~~~vGgK~~y~~~~~g~-----~G~Ava~GG~~~~~~~~~~sl~Ge~YyaP~~l--------ssGvd~y 131 (180) T pfam07437 65 GLGMNLPAGPHHATVGAKGSYLWPDEGP-----NGSAVAVGGGLALAIGPSFSLQGEAYYAPSVL--------SSGVDSY 131 (180) T ss_pred EEEEECCCCCEEEEEEEEEEEECCCCCC-----CCEEEEECCEEECCCCCCEEEEEEEEECCHHH--------CCCCHHH T ss_conf 0467405678389820369982766688-----63189862568624588578988789880673--------2562033 Q ss_pred EEEEEEEEEECCCCCEEEEEEEEEEECCCCCCC-----CEEEEEEEEEEC Q ss_conf 799734661004886799999999750565566-----226999998539 Q gi|254780598|r 161 GTAGVGVEVGGLSESLVARLEYRASKYSKVEGF-----YNTISLGVGMKF 205 (205) Q Consensus 161 ~~~G~Gvey~~it~~~~~r~EY~y~df~~~~~~-----~~~~~~G~~ykF 205 (205) .-+.+|++| .+.+++++.+.|||+++...|.. .+-+.||++.+| T Consensus 132 ~ea~~gv~~-~v~~pl~v~aGYRyi~~~~~dg~~d~~ladG~yvG~~~sF 180 (180) T pfam07437 132 YEANSGVRY-NINRPMALYVGYRYINFEGKDGARDNTFADGAYIGGKFRF 180 (180) T ss_pred EEECCCEEE-EECCCCEEEEEEEEEEEECCCCCCCCEECCCCEEEEEECC T ss_conf 373461699-9737728997579998663679877524237468337509 No 9 >pfam02462 Opacity Opacity family porin protein. Pathogenic Neisseria spp. possess a repertoire of phase-variable Opacity proteins that mediate various pathogen--host cell interactions. These proteins are integral membrane proteins related to other porins. Probab=99.34 E-value=1.2e-11 Score=77.58 Aligned_cols=115 Identities=22% Similarity=0.186 Sum_probs=81.0 Q ss_pred EEEEECCCEEEEEEEEEEECCCCCCCCCCCCCCEEEEEEEEEEE--EECCCEEEEEEEEEEEEEEEECC-----CCCCCC Q ss_conf 54640483799845101321555666621223214899989754--31276778988544776444313-----455567 Q gi|254780598|r 83 GYNVEDSCIMYGVEGDVRYTVPVLADNIHSLHGIGGSLRIRGGY--EVSDSLLLYATVGPDVAQKYETG-----KAGEIT 155 (205) Q Consensus 83 GYn~q~~~~V~G~e~d~~~~~~~~~~~~~~~~~~~~~~~~r~Gy--~~~~~~l~Y~~~G~a~~~~~~~~-----~~~~~~ 155 (205) ||+|- +|=+ -+||.....-.++....+ . .++.+-+=| ..++++-||+-++++..+.+.+. ...+.+ T Consensus 2 GYdfg--~~R~--A~DY~~y~k~~~~~~~~~-~--~SlglsaiYDFdt~s~~kPYvGarv~~~~~~~~~~~~~~~~~s~S 74 (126) T pfam02462 2 GYRFG--NLRF--AVDYTRYGKVKANSTDFK-G--YSLGASVIYDFDTQSPVKPYVGARVSTNGFKVTASAELGGSDSFS 74 (126) T ss_pred CCCCC--CEEE--EECHHHHCCCCCCCCCEE-E--EEEEEEEEEECCCCCCCCEEEEEEEEECCCCCCCCCCCCCCCCCC T ss_conf 61358--7999--820123152137875357-6--763678999715899521426756761222103555358755650 Q ss_pred CCCEEEEEEEEEEEECCCCCEEEEEEEEEEECCCC---CCCCEEEEEEEEEEC Q ss_conf 75204799734661004886799999999750565---566226999998539 Q gi|254780598|r 156 PIAIGGTAGVGVEVGGLSESLVARLEYRASKYSKV---EGFYNTISLGVGMKF 205 (205) Q Consensus 156 ~~~~g~~~G~Gvey~~it~~~~~r~EY~y~df~~~---~~~~~~~~~G~~ykF 205 (205) +++.||-+.+||.| .|||||++.++|||..+++. +.+.|..++||-||| T Consensus 75 ~~~~g~gv~AGVgy-ditpnltLD~GYRY~~~G~le~tr~~theas~GvRY~F 126 (126) T pfam02462 75 DTRIGLGVLAGVSY-KLTPNVDLDTGYRYNRLGKVTVTNVRSGEASAGVRYKF 126 (126) T ss_pred CCEEEEEEEEEEEE-ECCCCEEEECCEEEEECCCCCCCEECCCCEEEEEEEEC T ss_conf 33599988743688-32787377125288004641343020121368866639 No 10 >pfam05736 OprF OprF membrane domain. This domain represents the presumed membrane spanning region of the OprF proteins. This region is involved in channel formation and is thought to form an 8-stranded beta-barrel. Probab=99.25 E-value=2.6e-10 Score=70.78 Aligned_cols=174 Identities=16% Similarity=0.118 Sum_probs=101.1 Q ss_pred CH-HHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEE Q ss_conf 94-43789999999998410000246654356666655563455444444115421000122333344667777760268 Q gi|254780598|r 1 MQ-KLFLAVGVSSLALASFCSAQAADPVRRAHHGGRGVVPTIATNRYVPIRHDFNGPYAGLSALYNGSFGEEAHHNAGGS 79 (205) Q Consensus 1 Mk-k~~la~av~~lal~~~~~A~Aad~~~~~~~~~~~~~~~~~~~~~~~~~~~w~G~y~g~~~g~~~~~~~~~~~~~~~G 79 (205) || |-.|-+++.+|+.+.+-.|.|...-..+ ..+ +.+.|. .....+..++..+| T Consensus 1 mklkntlg~aig~lvaa~s~~alAqgqgave----------------~e~---f~g~~~-------fDs~r~~~~g~~~G 54 (184) T pfam05736 1 MKLKNTLGFAIGSLVAATSFGALAQGQGAVE----------------IEA---FYKKQY-------NDSVRNMKNGDLPG 54 (184) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHCCCCCEE----------------EEE---CCCEEE-------CCCCCCCCCCCCCE T ss_conf 9532015578888888730167765467479----------------842---266475-------26313532489640 Q ss_pred EEEEEEEECCCEEEEEEEEEEECCCCCCCCCCCCCCE-EEEEEEEEEEEE---CCCEEEEEEEEEEEEEEEECCCCCCCC Q ss_conf 9985464048379984510132155566662122321-489998975431---276778988544776444313455567 Q gi|254780598|r 80 IFAGYNVEDSCIMYGVEGDVRYTVPVLADNIHSLHGI-GGSLRIRGGYEV---SDSLLLYATVGPDVAQKYETGKAGEIT 155 (205) Q Consensus 80 ~~~GYn~q~~~~V~G~e~d~~~~~~~~~~~~~~~~~~-~~~~~~r~Gy~~---~~~~l~Y~~~G~a~~~~~~~~~~~~~~ 155 (205) +-+||++..+ +.+|+-++..+...++......+. ....+...-|.| .+.+-||+.+|.+..+.+.......-. T Consensus 55 ~~iGY~lTdd---vsl~l~y~~~~~~r~~~~~g~~~i~g~~~rLDalYHF~~pg~~lrPYvs~G~gh~s~~~~~~g~~Gr 131 (184) T pfam05736 55 GSIGYFLTDD---VELNLSYDETHDTRSTDGTGNQKVGGDLTSLDAQYHFGTPGDGLRPYVSAGFAHQSITNVADGSQGR 131 (184) T ss_pred EEEEEECCCC---EEEEEEECCCCCHHCCCCCCCCCCCCCEEEEEEEEECCCCCCCCCEEEEECCCCCEEECCCCCCCCC T ss_conf 3652571788---3899887342100013576661035634667779973898766337277314652000225687765 Q ss_pred CCCEEEEEEEEEEEECCCCCEEEEEEEEE-EECCCCCCCCEEEEEEEEEEC Q ss_conf 75204799734661004886799999999-750565566226999998539 Q gi|254780598|r 156 PIAIGGTAGVGVEVGGLSESLVARLEYRA-SKYSKVEGFYNTISLGVGMKF 205 (205) Q Consensus 156 ~~~~g~~~G~Gvey~~it~~~~~r~EY~y-~df~~~~~~~~~~~~G~~ykF 205 (205) +..+=..+|+|+.| .|||||.+|+|-|- .++++-+.+ =..++|++|-| T Consensus 132 d~sT~~N~G~G~Ky-~itdn~~aRa~vra~y~lD~~~~d-~~a~vGl~~~F 180 (184) T pfam05736 132 DQSTFANIGAGAKY-YFTENFYARAGVEAQYGLDNGDWE-WAALVGLGVNF 180 (184) T ss_pred CCCEEEEECCCEEE-EECCCEEEEECHHHHCCCCCCCCC-EEEEEEEEEEC T ss_conf 64416763452689-960544776132531047788612-00455249960 No 11 >pfam03922 OmpW OmpW family. This family includes outer membrane protein W (OmpW) proteins from a variety of bacterial species. This protein may form the receptor for S4 colicins in E. coli. Probab=99.11 E-value=1e-08 Score=62.58 Aligned_cols=130 Identities=14% Similarity=0.088 Sum_probs=85.0 Q ss_pred CCCCCEEEEEEEEEEECCCEEEEEEEEEEEC---CCCCC---CCCCCCCCEEEEEEEEEEEEE---CCCEEEEEEEEEEE Q ss_conf 7776026899854640483799845101321---55566---662122321489998975431---27677898854477 Q gi|254780598|r 72 AHHNAGGSIFAGYNVEDSCIMYGVEGDVRYT---VPVLA---DNIHSLHGIGGSLRIRGGYEV---SDSLLLYATVGPDV 142 (205) Q Consensus 72 ~~~~~~~G~~~GYn~q~~~~V~G~e~d~~~~---~~~~~---~~~~~~~~~~~~~~~r~Gy~~---~~~~l~Y~~~G~a~ 142 (205) .++....++-..|-+..+ +++|+-.... +.... .....+.-+..-...-+.|.+ +.++.||+-+|+.+ T Consensus 32 v~~~~~~~l~~~Yf~tdn---ia~eli~g~p~~h~i~~~g~~~~~~lG~v~~lPPtlt~qYhf~~~~~~~rPYVGaGvnY 108 (189) T pfam03922 32 VGNDTQPGLTFTYFFTDN---VGVELLAATPFSHNIGGSGPGSNGLVGKTKHLPPTLTAQYHFGDASSGFRPYVGAGVNY 108 (189) T ss_pred ECCCCEEEEEEEEEEECC---EEEEEEEECCCEEEEECCCCCCCCCEEEEEECCCEEEEEEEECCCCCCEEEEEEEEEEE T ss_conf 768855857899997188---56999996697579834566678605879964989999993489999650567601689 Q ss_pred EEEEECCCC-----CCCCCCCEEEEEEEEEEEECCCCCEEEEEEEEEEECCC--------------CCCCCEEEEEEEEE Q ss_conf 644431345-----55677520479973466100488679999999975056--------------55662269999985 Q gi|254780598|r 143 AQKYETGKA-----GEITPIAIGGTAGVGVEVGGLSESLVARLEYRASKYSK--------------VEGFYNTISLGVGM 203 (205) Q Consensus 143 ~~~~~~~~~-----~~~~~~~~g~~~G~Gvey~~it~~~~~r~EY~y~df~~--------------~~~~~~~~~~G~~y 203 (205) +.+-.+... .-+-+..+|+.+-+|++| +|+++|+++++.+|+++.. ++.|--.+.+|++| T Consensus 109 t~f~~~~~~~~~~~~~~~d~s~G~a~q~G~dy-~l~~~w~~n~dv~~~~i~t~at~~~g~~~~~~~v~lDP~v~~~gvgy 187 (189) T pfam03922 109 TTFFDEKLGSTGATDLKLKDSWGAAGQAGVDY-MVNDDWLVNMDVWYMFIDTTASYKLGGAKLKTKVKLDPWVFSIGLGY 187 (189) T ss_pred EEEECCCCCCCCCCEEEECCCEEEEEEEEEEE-EECCCEEEEEEEEEEECCCEEEEECCCCEEEEEEEECCEEEEEEEEE T ss_conf 99983445888854689706540899998899-96898399999999962456999728850785799588899987698 Q ss_pred EC Q ss_conf 39 Q gi|254780598|r 204 KF 205 (205) Q Consensus 204 kF 205 (205) || T Consensus 188 rF 189 (189) T pfam03922 188 RF 189 (189) T ss_pred EC T ss_conf 79 No 12 >COG3047 OmpW Outer membrane protein W [Cell envelope biogenesis, outer membrane] Probab=99.04 E-value=1.8e-08 Score=61.32 Aligned_cols=130 Identities=14% Similarity=0.104 Sum_probs=83.0 Q ss_pred CCCCCEEEEEEEEEEECCCEEEEEEEEEEEC--CCCCCC----CCCCCCCEEEEEEEEEEEE---ECCCEEEEEEEEEEE Q ss_conf 7776026899854640483799845101321--555666----6212232148999897543---127677898854477 Q gi|254780598|r 72 AHHNAGGSIFAGYNVEDSCIMYGVEGDVRYT--VPVLAD----NIHSLHGIGGSLRIRGGYE---VSDSLLLYATVGPDV 142 (205) Q Consensus 72 ~~~~~~~G~~~GYn~q~~~~V~G~e~d~~~~--~~~~~~----~~~~~~~~~~~~~~r~Gy~---~~~~~l~Y~~~G~a~ 142 (205) ..++...++-.-|-+..+ +++|.-.... ...+.. ...+..-|..-...-+.|- -...+-||+-+|+.+ T Consensus 54 v~~~~~pgL~~TY~ftdN---i~vEli~~t~~~h~~g~~~~~~~g~i~~~~~lPpTlt~Qyhf~~~~s~frPYvGaG~nY 130 (213) T COG3047 54 VSNNTQPGLTFTYFFTDN---IGVELIAATPFAHKIGVGGLGGLGKIGKTKHLPPTLTAQYHFGDASSKFRPYVGAGLNY 130 (213) T ss_pred ECCCCCCCEEEEEEEECC---EEEEEEEECCCCEEECCCCCCCCCCCCCEEEECCEEEEEEECCCCCCCCCCCEEECCCE T ss_conf 736764433689997377---24677861340203055676555311417880868899973168655544214503334 Q ss_pred EEEEECCCC-----CCCCCCCEEEEEEEEEEEECCCCCEEEEEEEEEEECCC----------------CCCCCEEEEEEE Q ss_conf 644431345-----55677520479973466100488679999999975056----------------556622699999 Q gi|254780598|r 143 AQKYETGKA-----GEITPIAIGGTAGVGVEVGGLSESLVARLEYRASKYSK----------------VEGFYNTISLGV 201 (205) Q Consensus 143 ~~~~~~~~~-----~~~~~~~~g~~~G~Gvey~~it~~~~~r~EY~y~df~~----------------~~~~~~~~~~G~ 201 (205) +.+..+... .-.-+..+|...-+|++| +++++|.++..-+|.++.+ ++.|-=-+.+|+ T Consensus 131 t~f~~e~~~~~~~~~lkl~ns~G~a~q~G~Dy-~l~~~w~ln~dV~~~~i~t~a~~~~~~g~~~~~~kvkLdPwv~~~Gi 209 (213) T COG3047 131 TTFFDEKFGSTGLSDLKLKDSWGAAGQAGVDY-MLNDDWLLNMDVKYMFIDTTAKYKAGLGGAKLKSKVKLDPWVFMIGI 209 (213) T ss_pred EEEEEECCCCCCCCEEEECCCCCEEEEEEEEE-EECCCEEEEEEEEEEECCCCEEECCCCCCCEEECCEECCCEEEEEEE T ss_conf 89985014777754045135546116854558-90456188778899961554034246677505410341764999532 Q ss_pred EEEC Q ss_conf 8539 Q gi|254780598|r 202 GMKF 205 (205) Q Consensus 202 ~ykF 205 (205) +||| T Consensus 210 gyRF 213 (213) T COG3047 210 GYRF 213 (213) T ss_pred EECC T ss_conf 6619 No 13 >PRK10716 long-chain fatty acid outer membrane transporter; Provisional Probab=98.36 E-value=7.1e-05 Score=42.79 Aligned_cols=79 Identities=9% Similarity=-0.007 Sum_probs=50.2 Q ss_pred EEECCCEEEEEEEEEEEEEEE----ECCC-----CCCCCCCCEEEEEEEEEEEECCCCCEEEEEEEEEEECC------C- Q ss_conf 431276778988544776444----3134-----55567752047997346610048867999999997505------6- Q gi|254780598|r 126 YEVSDSLLLYATVGPDVAQKY----ETGK-----AGEITPIAIGGTAGVGVEVGGLSESLVARLEYRASKYS------K- 189 (205) Q Consensus 126 y~~~~~~l~Y~~~G~a~~~~~----~~~~-----~~~~~~~~~g~~~G~Gvey~~it~~~~~r~EY~y~df~------~- 189 (205) +.++|++.+-+.+-+..+..- .... ...+.+++..|.+.+|+|| +++++|.+|+.|-|-+=. + T Consensus 297 ~~~~~~~~l~~d~~wt~WS~~~~l~~~~~~g~~~~~~~~nw~Dtw~~~~G~~Y-~~~~~l~LRaG~aYD~sPv~~~~~~~ 375 (437) T PRK10716 297 NRVAPQWAIHYSLAYTSWSQFQELKATSTNGDTLFQKHEGFKDAYRIALGTTY-YYDDNWTFRTGIAFDDSPVPAQNRSI 375 (437) T ss_pred EECCCCEEEEEEEEEEEECCCCCEEEECCCCCCCCCCCCCCCCCEEEEEEEEE-ECCCCCEEEEEEEECCCCCCCCCCCC T ss_conf 96385569999999965146552334437885132554577410079978999-87998189887787468888886562 Q ss_pred --CCCCCEEEEEEEEEEC Q ss_conf --5566226999998539 Q gi|254780598|r 190 --VEGFYNTISLGVGMKF 205 (205) Q Consensus 190 --~~~~~~~~~~G~~ykF 205 (205) =|.+.+-+++|++||+ T Consensus 376 ~iPd~dR~~~S~G~~y~~ 393 (437) T PRK10716 376 SIPDQDRFWLSAGTTYAF 393 (437) T ss_pred CCCCCCCEEEEEEEEEEC T ss_conf 076888859981268854 No 14 >pfam03349 Toluene_X Outer membrane protein transport protein (OMPP1/FadL/TodX). This family includes TodX from Pseudomonas putida F1 and TbuX from Ralstonia pickettii PKO1. These are membrane proteins of uncertain function that are involved in toluene catabolism. Related proteins involved in the degradation of similar aromatic hydrocarbons are also in this family, such as CymD. This family also includes FadL involved in translocation of long-chain fatty acids across the outer membrane. It is also a receptor for the bacteriophage T2. Probab=98.28 E-value=0.00014 Score=41.22 Aligned_cols=85 Identities=14% Similarity=0.098 Sum_probs=54.8 Q ss_pred EEEEEEEEECCCEEEEEEEEEEEEEE-EE---C--------CCCCCCCCCCEEEEEEEEEEEECCCCCEEEEEEEEEEEC Q ss_conf 99897543127677898854477644-43---1--------345556775204799734661004886799999999750 Q gi|254780598|r 120 LRIRGGYEVSDSLLLYATVGPDVAQK-YE---T--------GKAGEITPIAIGGTAGVGVEVGGLSESLVARLEYRASKY 187 (205) Q Consensus 120 ~~~r~Gy~~~~~~l~Y~~~G~a~~~~-~~---~--------~~~~~~~~~~~g~~~G~Gvey~~it~~~~~r~EY~y~df 187 (205) +..-+.|.++|++.+-+-.-+..+.. +. . .....+..++..|.+.+|+|| .++|+|.+|+.|.|.+= T Consensus 261 ~~~g~~~~~~~~~~l~~d~~~~~WS~~~~l~~~~~~~~~~~~~~~~~~~w~d~~~~~~G~eY-~~~~~l~LRaG~~yd~s 339 (415) T pfam03349 261 LELGISHQFNDQWAVHASVKRTFWSDFDKLIFVLFVFAKGGTDLSLPQNYRDTTTYALGTDY-QFNDRWTLRAGYAYDQS 339 (415) T ss_pred EEEEEEEECCCCEEEEEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEEEEEE-ECCCCEEEEEEEEECCC T ss_conf 99999998489889999999987300001022344556776453467684150089878999-85998699974765467 Q ss_pred C--C-------CCCCCEEEEEEEEEEC Q ss_conf 5--6-------5566226999998539 Q gi|254780598|r 188 S--K-------VEGFYNTISLGVGMKF 205 (205) Q Consensus 188 ~--~-------~~~~~~~~~~G~~ykF 205 (205) . + -+.+.+.+++|++|++ T Consensus 340 pv~~~~~~~~~Pd~dr~~~s~G~~y~~ 366 (415) T pfam03349 340 ATPAESRSARIPDTDRTWLSLGATYAF 366 (415) T ss_pred CCCCCCCCEECCCCCCEEEEEEEEEEC T ss_conf 878776360045778669994058985 No 15 >COG2067 FadL Long-chain fatty acid transport protein [Lipid metabolism] Probab=98.23 E-value=2.6e-05 Score=45.08 Aligned_cols=88 Identities=14% Similarity=0.001 Sum_probs=59.5 Q ss_pred CEEEEEEEEEEEEECCCEEEEEEEEEEEEE----EEECCC-----CCCCCC-CCEEEEEEEEEEEECCCCCEEEEEEEEE Q ss_conf 214899989754312767789885447764----443134-----555677-5204799734661004886799999999 Q gi|254780598|r 115 GIGGSLRIRGGYEVSDSLLLYATVGPDVAQ----KYETGK-----AGEITP-IAIGGTAGVGVEVGGLSESLVARLEYRA 184 (205) Q Consensus 115 ~~~~~~~~r~Gy~~~~~~l~Y~~~G~a~~~----~~~~~~-----~~~~~~-~~~g~~~G~Gvey~~it~~~~~r~EY~y 184 (205) ....++.. ...+.|++++++.+=+..+. ++.... .....+ .+.+|++.+|++| .++|.|++|+.+.| T Consensus 285 P~~~el~~--~~~~~d~w~~~~s~~wT~WS~f~~l~~~~~~~~~~~~~~~~~yrD~wt~a~G~~Y-~~nd~~tlragiay 361 (440) T COG2067 285 PASAELSG--QHKVADQWAIHGSVKWTDWSSFDKLDFVFTFGKTLFAKTEDGYRDTWTVALGTTY-KFNDQWTLRAGIAY 361 (440) T ss_pred CCEEEEEE--EECCCCCEEEEEEEEEEECCCCCEEEEEECCCCCCCCCCCCCCCCCCEEEEECEE-ECCCCCEEEEEEEE T ss_conf 85799722--6403887079988998651677448889768875224653341043179400106-73866337500334 Q ss_pred EEC----C--CC---CCCCEEEEEEEEEEC Q ss_conf 750----5--65---566226999998539 Q gi|254780598|r 185 SKY----S--KV---EGFYNTISLGVGMKF 205 (205) Q Consensus 185 ~df----~--~~---~~~~~~~~~G~~ykF 205 (205) .+= . +. +.|...+++|+.|+| T Consensus 362 D~s~s~~~~~~~~iPd~Dr~~~s~G~~Y~~ 391 (440) T COG2067 362 DQSPSPAQNRSISIPDTDRWWLSLGTTYKF 391 (440) T ss_pred CCCCCCCCCCCCCCCCCCCEEEECCCEEEC T ss_conf 578885445543247877579966647725 No 16 >PRK10716 long-chain fatty acid outer membrane transporter; Provisional Probab=98.04 E-value=0.00026 Score=39.93 Aligned_cols=66 Identities=17% Similarity=0.143 Sum_probs=31.7 Q ss_pred EEEEEEEEEEECCCEEEEEEEEEEEEEEEECCCC---------------------------CCCCCCCEEEEEEEEEEEE Q ss_conf 8999897543127677898854477644431345---------------------------5567752047997346610 Q gi|254780598|r 118 GSLRIRGGYEVSDSLLLYATVGPDVAQKYETGKA---------------------------GEITPIAIGGTAGVGVEVG 170 (205) Q Consensus 118 ~~~~~r~Gy~~~~~~l~Y~~~G~a~~~~~~~~~~---------------------------~~~~~~~~g~~~G~Gvey~ 170 (205) ..+.+-+.|.++|++.+=+-.-+.+.+.+.+... ...+...+|+-+-+|+-| T Consensus 153 i~inPsvAykvnd~lSvG~G~~~~y~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~Gd~~g~G~nlG~~~- 231 (437) T PRK10716 153 MNLNLSGAYRLNNAWSFGLGFDAVYARAKIERFAGDLGQLVAGQGQALAATAGGIPSNTKIAHLNGNQWGFGWNAGILY- 231 (437) T ss_pred EEEEEEEEEECCCCEEEEEEEEEEEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEECCCCEEEEEEEEEEE- T ss_conf 9995677786078789987069999888844213442321234556654445677876530366145357889999999- Q ss_pred CCCCCEEEEEEEEE Q ss_conf 04886799999999 Q gi|254780598|r 171 GLSESLVARLEYRA 184 (205) Q Consensus 171 ~it~~~~~r~EY~y 184 (205) .++|++.+.+-||- T Consensus 232 ~~~~~~~~GlsYRS 245 (437) T PRK10716 232 ELDKNNRYALTYRS 245 (437) T ss_pred EECCCCEEEEEEEC T ss_conf 81789789999852 No 17 >pfam03797 Autotransporter Autotransporter beta-domain. Secretion of protein products occurs by a number of different pathways in bacteria. One of these pathways known as the type V pathway was first described for the IgA1 protease. The protein component that mediates secretion through the outer membrane is contained within the secreted protein itself, hence the proteins secreted in this way are called autotransporters. This family corresponds to the presumed integral membrane beta-barrel domain that transports the protein. This domain is found at the C terminus of the proteins it occurs in. The N terminus contains the variable passenger domain that is translocated across the membrane. Once the passenger domain is exported it is cleaved auto-catalytically in some proteins, in others a different protease is used and in some cases no cleavage occurs. Probab=97.86 E-value=0.0011 Score=36.68 Aligned_cols=130 Identities=23% Similarity=0.243 Sum_probs=73.9 Q ss_pred CCCCCCCCCCCCCCCCC--------------CCCCCCEEEEEEEEEEEC-CCEEEEEEEEEEECCCCCCCC--------- Q ss_conf 21000122333344667--------------777760268998546404-837998451013215556666--------- Q gi|254780598|r 54 NGPYAGLSALYNGSFGE--------------EAHHNAGGSIFAGYNVED-SCIMYGVEGDVRYTVPVLADN--------- 109 (205) Q Consensus 54 ~G~y~g~~~g~~~~~~~--------------~~~~~~~~G~~~GYn~q~-~~~V~G~e~d~~~~~~~~~~~--------- 109 (205) .++|+-..++++....+ .....+...+.+||+|.. +++.+-..+.+.|.....+.- T Consensus 85 ~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~~~~gy~~~~~~~~~i~P~~~l~y~~~~~~~f~E~g~~~~~ 164 (265) T pfam03797 85 GGLYLDGVLAYGRFDNDVKRLGTSTGTAKGDYDSHGLGASLEAGYRFKLSGNLTLTPFAGLQYVYLRQDGFTESGARSFA 164 (265) T ss_pred CCCEEEEEEEEEEEECEEEEECCCCCCCEEEEEEEEEEEEEEEEECCCCCCCEEEEEEEEEEEEEEECCCEEEECCCCCC T ss_conf 98369767888997150487306787302228024999999987623326975996999999999876876670998435 Q ss_pred CCC----CCCEEEEEEEEEEEEECC-CEEEEEEEEEEEEEEEECC---------------CCCCCCCCCEEEEEEEEEEE Q ss_conf 212----232148999897543127-6778988544776444313---------------45556775204799734661 Q gi|254780598|r 110 IHS----LHGIGGSLRIRGGYEVSD-SLLLYATVGPDVAQKYETG---------------KAGEITPIAIGGTAGVGVEV 169 (205) Q Consensus 110 ~~~----~~~~~~~~~~r~Gy~~~~-~~l~Y~~~G~a~~~~~~~~---------------~~~~~~~~~~g~~~G~Gvey 169 (205) ... .......+..|+.+.+.+ ++-+|..+++.+....... .....+..+....+++|+++ T Consensus 165 l~~~~~~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~~ 244 (265) T pfam03797 165 LSVDSQSLTNLSGPLGLRLEYRFSLRSLKPYATLGWAYEFDDYRQVPTNGATLLASGGSWVTSGTPLARNAAELGAGVEL 244 (265) T ss_pred EEEECCCCCEEEEEEEEEEEEEECCCEEEEEEEEEEEEECCCCCCCEEEEEEEECCCCCEEECCCCCCCEEEEEEEEEEE T ss_conf 17844666589999999999985287364899999999977887550399997268984299675666549999988999 Q ss_pred ECCCCCEEEEEEEEE Q ss_conf 004886799999999 Q gi|254780598|r 170 GGLSESLVARLEYRA 184 (205) Q Consensus 170 ~~it~~~~~r~EY~y 184 (205) .+++++++.++|+. T Consensus 245 -~~~~~~~l~~~~~~ 258 (265) T pfam03797 245 -ALGKNLSLFLNYSG 258 (265) T ss_pred -EECCCEEEEEEEEE T ss_conf -98998699999970 No 18 >pfam03349 Toluene_X Outer membrane protein transport protein (OMPP1/FadL/TodX). This family includes TodX from Pseudomonas putida F1 and TbuX from Ralstonia pickettii PKO1. These are membrane proteins of uncertain function that are involved in toluene catabolism. Related proteins involved in the degradation of similar aromatic hydrocarbons are also in this family, such as CymD. This family also includes FadL involved in translocation of long-chain fatty acids across the outer membrane. It is also a receptor for the bacteriophage T2. Probab=97.76 E-value=0.0016 Score=35.84 Aligned_cols=85 Identities=14% Similarity=0.152 Sum_probs=46.4 Q ss_pred EEEEEEEEEEECCCEEEEEEEEEEEEEEEECCCCCC-CCCCCEEEEEEEEEEEECCCCCEEEEEEEEEEECCCCC----- Q ss_conf 899989754312767789885447764443134555-67752047997346610048867999999997505655----- Q gi|254780598|r 118 GSLRIRGGYEVSDSLLLYATVGPDVAQKYETGKAGE-ITPIAIGGTAGVGVEVGGLSESLVARLEYRASKYSKVE----- 191 (205) Q Consensus 118 ~~~~~r~Gy~~~~~~l~Y~~~G~a~~~~~~~~~~~~-~~~~~~g~~~G~Gvey~~it~~~~~r~EY~y~df~~~~----- 191 (205) +.++.-+.|.++|++ -.++|+++-+.-....... .-....-+.+.+|+.| .+++++++.+-|.|..+.+.+ T Consensus 314 ~~~~~G~eY~~~~~l--~LRaG~~yd~spv~~~~~~~~~Pd~dr~~~s~G~~y-~~~~~~~~d~ay~y~~~~~~~~~~~~ 390 (415) T pfam03349 314 TTYALGTDYQFNDRW--TLRAGYAYDQSATPAESRSARIPDTDRTWLSLGATY-AFTKDLTIDAGFSYLFGKKVSIKENN 390 (415) T ss_pred EEEEEEEEEECCCCE--EEEEEEEECCCCCCCCCCCEECCCCCCEEEEEEEEE-ECCCCCEEEEEEEEEECCCCCEEECC T ss_conf 089878999859986--999747654678787763600457786699940589-85999589999999960655310016 Q ss_pred -----------CCCEEEEEEEEEEC Q ss_conf -----------66226999998539 Q gi|254780598|r 192 -----------GFYNTISLGVGMKF 205 (205) Q Consensus 192 -----------~~~~~~~~G~~ykF 205 (205) .+.|-+.++++||| T Consensus 391 ~~~~~~~g~~~~~~~~~~~~~~y~F 415 (415) T pfam03349 391 VTNGTLPGTSSGSAHLFALSYNYRF 415 (415) T ss_pred CCCCEEEEEEEEEEEEEEEEEEEEC T ss_conf 7786078999748999999887679 No 19 >pfam10082 DUF2320 Uncharacterized protein conserved in bacteria (DUF2320). This domain, found in various hypothetical bacterial proteins, has no known function. Probab=97.63 E-value=0.0026 Score=34.75 Aligned_cols=48 Identities=17% Similarity=0.063 Sum_probs=36.2 Q ss_pred CCEEEEEEEEEEEECCCCCEEEEEEEEEEECCC----CCCCCEEEEEEEEEEC Q ss_conf 520479973466100488679999999975056----5566226999998539 Q gi|254780598|r 157 IAIGGTAGVGVEVGGLSESLVARLEYRASKYSK----VEGFYNTISLGVGMKF 205 (205) Q Consensus 157 ~~~g~~~G~Gvey~~it~~~~~r~EY~y~df~~----~~~~~~~~~~G~~ykF 205 (205) ....+.+++|+.| .++++|.+.+.|+|.+... .+-+.|.|.|||..|+ T Consensus 329 ~D~~~~~~~~l~Y-~~~r~l~l~~~y~~~~~~S~~~~~~y~~n~v~l~l~~~~ 380 (380) T pfam10082 329 KDETYSAEAGLTY-KLNRWLDLTAGVRYEDLDSNIPGRSYDANVVGLGLTLQR 380 (380) T ss_pred CCCEEEEEEEEEE-EEECCEEEEEEEEEEEEECCCCCCCEEEEEEEEEEEEEC T ss_conf 0133999999999-841779999999999703788878625059999999979 No 20 >pfam06629 MipA MltA-interacting protein MipA. This family consists of several bacterial MltA-interacting protein (MipA) like sequences. As well as interacting with the membrane-bound lytic transglycosylase MltA, MipA is known to bind to PBP1B, a bifunctional murein transglycosylase/transpeptidase. MipA is considered to be a structural protein mediating the assembly of MltA to PBP1B into a complex. Probab=97.61 E-value=0.0028 Score=34.60 Aligned_cols=117 Identities=15% Similarity=0.145 Sum_probs=70.4 Q ss_pred EEEEEEEEEECCCEEEEEEEEEEECCCCCCCCCCCCCCEEEEEEEEEEEEECCCEEEEEEEEEEEEEEEECCCC------ Q ss_conf 68998546404837998451013215556666212232148999897543127677898854477644431345------ Q gi|254780598|r 78 GSIFAGYNVEDSCIMYGVEGDVRYTVPVLADNIHSLHGIGGSLRIRGGYEVSDSLLLYATVGPDVAQKYETGKA------ 151 (205) Q Consensus 78 ~G~~~GYn~q~~~~V~G~e~d~~~~~~~~~~~~~~~~~~~~~~~~r~Gy~~~~~~l~Y~~~G~a~~~~~~~~~~------ 151 (205) .|+.. +++.+.+.+.++...+-.+ ....+...++...-+++++++.+-..+++.+.+.+..... T Consensus 87 ~G~~~--~~~~~~~~~~~~~~~dv~~--------~~~G~~~~~~~~~~~~~~~~~~l~~~~~~~~~s~~y~~yyyGV~~~ 156 (226) T pfam06629 87 GGLKA--AYDTGWGTVRAELEQDISN--------GSRGFVADIGLAATYDLGDRLSLSPSLGLSYASRKYNDYYFGVSAA 156 (226) T ss_pred EEEEE--EEECCCEEEEEEEEEECCC--------CCCCEEEEEEEEEEEECCCCEEEEEEEEEEECCHHHHHHHHCCCHH T ss_conf 17899--9994888999999993246--------7388999999999996399779999789998677675655377978 Q ss_pred -------CC-CCCCC-EEEEEEEEEEEECCCCCEEEEEEEEEEECCCC--------CCCCEEEEEEEEEEC Q ss_conf -------55-67752-04799734661004886799999999750565--------566226999998539 Q gi|254780598|r 152 -------GE-ITPIA-IGGTAGVGVEVGGLSESLVARLEYRASKYSKV--------EGFYNTISLGVGMKF 205 (205) Q Consensus 152 -------~~-~~~~~-~g~~~G~Gvey~~it~~~~~r~EY~y~df~~~--------~~~~~~~~~G~~ykF 205 (205) .. ..+.. ..+.+++-+.| .|++||.+.+.-+|+.+.+. +-..-++.+|+.|+| T Consensus 157 ea~~s~~~~Y~a~~g~~~~~~~~~~~y-~l~~~w~~~~~~~~~~L~~~~~dSPiv~~~~~~~~~~g~~Y~F 226 (226) T pfam06629 157 ESARSGLAQYNAGAGLWSAGLGLSATY-MLTDHWSALGSASYTRLSGDAADSPIVERRGQPSAGLGVAYRF 226 (226) T ss_pred HHHHCCCCCCCCCCCEEEEEEEEEEEE-EECCCEEEEEEEEEEECCCCHHCCCCEEECCCEEEEEEEEEEC T ss_conf 961237864057877078999999999-9669999999999998488211499588678789999999989 No 21 >pfam00267 Porin_1 Gram-negative porin. Probab=97.60 E-value=0.0029 Score=34.55 Aligned_cols=43 Identities=16% Similarity=0.342 Sum_probs=35.5 Q ss_pred EEEEEEEEECCCCCEEEEEEEEEEECCCCCC------CCEEEEEEEEEEC Q ss_conf 9973466100488679999999975056556------6226999998539 Q gi|254780598|r 162 TAGVGVEVGGLSESLVARLEYRASKYSKVEG------FYNTISLGVGMKF 205 (205) Q Consensus 162 ~~G~Gvey~~it~~~~~r~EY~y~df~~~~~------~~~~~~~G~~ykF 205 (205) .+-+|+|| .++++..+-+||++....+.+. ..|.++|||-|+| T Consensus 286 q~~lg~dY-~fSKrt~~Y~~Y~~~~~d~~~~~~~~~~~dd~~~vGlRY~F 334 (334) T pfam00267 286 YVDVGATY-YFNKNMSAYVDYKINLLDKGDDFAAGINTDDAVAVGLVYQF 334 (334) T ss_pred EEEEEEEE-EECCCCEEEEEEEEECCCCCCCCCCCCCCCCEEEEEEEEEC T ss_conf 99999899-96588049999987625666544467776888999808819 No 22 >COG2067 FadL Long-chain fatty acid transport protein [Lipid metabolism] Probab=97.53 E-value=0.0037 Score=33.97 Aligned_cols=84 Identities=18% Similarity=0.170 Sum_probs=43.5 Q ss_pred EEEEEEEEEEECCCEEEEEEEEEEEEEEEECCCC--CCCCCCCEEEEEEEEEEEECCCCCEEEEEEEEEEECCCC----- Q ss_conf 8999897543127677898854477644431345--556775204799734661004886799999999750565----- Q gi|254780598|r 118 GSLRIRGGYEVSDSLLLYATVGPDVAQKYETGKA--GEITPIAIGGTAGVGVEVGGLSESLVARLEYRASKYSKV----- 190 (205) Q Consensus 118 ~~~~~r~Gy~~~~~~l~Y~~~G~a~~~~~~~~~~--~~~~~~~~g~~~G~Gvey~~it~~~~~r~EY~y~df~~~----- 190 (205) +++..-.+|.++|++ -.++|+++-+.-..... ... .....|.+++|+-| .+|+++.+++-|-|....+. T Consensus 339 wt~a~G~~Y~~nd~~--tlragiayD~s~s~~~~~~~~i-Pd~Dr~~~s~G~~Y-~~t~n~~vd~g~ay~~~~~~s~~~~ 414 (440) T COG2067 339 WTVALGTTYKFNDQW--TLRAGIAYDQSPSPAQNRSISI-PDTDRWWLSLGTTY-KFTKNLEVDAGYAYLKGKSVSINEV 414 (440) T ss_pred CEEEEECEEECCCCC--EEEEEEEECCCCCCCCCCCCCC-CCCCCEEEECCCEE-ECCCCEEEEEEEEEEEECCCCCCCC T ss_conf 179400106738663--3750033457888544554324-78775799666477-2599838998889998414421024 Q ss_pred -----------CCCCEEEEEEEEEEC Q ss_conf -----------566226999998539 Q gi|254780598|r 191 -----------EGFYNTISLGVGMKF 205 (205) Q Consensus 191 -----------~~~~~~~~~G~~ykF 205 (205) +...+-+.+.++||| T Consensus 415 ~~~g~~~~~~~~~~a~~~~ls~~ykF 440 (440) T COG2067 415 RGAGVTLSGFSKGSAHLFGLSLNYKF 440 (440) T ss_pred CCCCCCCCCCCCEEEEEEEEEEEEEC T ss_conf 56773002311117999998877619 No 23 >cd00342 gram_neg_porins Porins form aqueous channels for the diffusion of small hydrophillic molecules across the outer membrane. Individual 16-strand anti-parallel beta-barrels form a central pore, and trimerizes thru mainly hydrophobic interactions at the interface. Trimers are stabilized by hytrophillic clamping of Loop L2. Loop 3 bends into the pore, creating an elliptical constriction of about 7 x 11A, large enough to allow passage of a glucose molecule without steric hindrance. Removal of the C-terminal residue (usuallly F) destabilizes the trimer and removal of the 16th beta-sheet abolishes trimerization. Unlike typical membrane proteins, porins lack long hydrophobic stretches. Short turns are found at the smooth, periplasmic end, longer irregular loops are found at the rough, extracellular end. C-terminal residue forms salt bridge with N-terminus. Probab=97.42 E-value=0.0052 Score=33.21 Aligned_cols=24 Identities=25% Similarity=0.282 Sum_probs=10.8 Q ss_pred EEEEEEEECCCCCEEEEEEEEEEEC Q ss_conf 9734661004886799999999750 Q gi|254780598|r 163 AGVGVEVGGLSESLVARLEYRASKY 187 (205) Q Consensus 163 ~G~Gvey~~it~~~~~r~EY~y~df 187 (205) +.+|++| .|+++..+-+||.|.+- T Consensus 275 ~~lG~~Y-~lSKrT~lY~~~~~~~n 298 (329) T cd00342 275 VALGADY-ALSKRTDLYAEYGYQKN 298 (329) T ss_pred EEEEEEE-EECCCHHHHHHHHHEEC T ss_conf 9999999-71554898665054110 No 24 >COG3203 OmpC Outer membrane protein (porin) [Cell envelope biogenesis, outer membrane] Probab=97.33 E-value=0.0068 Score=32.62 Aligned_cols=49 Identities=14% Similarity=0.211 Sum_probs=39.5 Q ss_pred CCCEEEEEEEEEEEECCCCCEEEEEEEEEEECCCCC--------CCCEEEEEEEEEEC Q ss_conf 752047997346610048867999999997505655--------66226999998539 Q gi|254780598|r 156 PIAIGGTAGVGVEVGGLSESLVARLEYRASKYSKVE--------GFYNTISLGVGMKF 205 (205) Q Consensus 156 ~~~~g~~~G~Gvey~~it~~~~~r~EY~y~df~~~~--------~~~~~~~~G~~ykF 205 (205) +......+-+|++| .+++|+++.+||.|.+..+.+ ...+.+.|||-|+| T Consensus 298 ~~~~~~~~~~ga~Y-~fsKrt~~Yv~y~~~~~~~~~~~~~~~~~~~~~~v~VGlr~~F 354 (354) T COG3203 298 DKDLVKYVDVGATY-YFSKNTSTYVDYGINLLDSDDNATVPGGGNTDNGVAVGLRYQF 354 (354) T ss_pred CCCCCEEEEEEEEE-EEECCEEEEEEEEEEECCCCCCCCCCCCCCCCCEEEEEEEEEC T ss_conf 88760599950179-9804724899988874156675304566654313899879869 No 25 >pfam11854 DUF3374 Protein of unknown function (DUF3374). This family of proteins are functionally uncharacterized. This protein is found in bacteria. Proteins in this family are typically between 665 to 712 amino acids in length. Probab=97.26 E-value=0.0081 Score=32.23 Aligned_cols=42 Identities=21% Similarity=0.312 Sum_probs=29.9 Q ss_pred EEEEEEEEECCCCCEEEEEEEEEEECCCC---------------------CCCCEEEEEEEEEEC Q ss_conf 99734661004886799999999750565---------------------566226999998539 Q gi|254780598|r 162 TAGVGVEVGGLSESLVARLEYRASKYSKV---------------------EGFYNTISLGVGMKF 205 (205) Q Consensus 162 ~~G~Gvey~~it~~~~~r~EY~y~df~~~---------------------~~~~~~~~~G~~ykF 205 (205) ++.+|++| ..++|.+.+.|.-+.|.+. |.+.|++++--.|++ T Consensus 221 ~l~agv~~--~Gd~W~~~l~Y~gS~f~Nd~~~L~~~~~~~~~~~~~a~~PDN~ahqlslsG~y~l 283 (675) T pfam11854 221 TLKAGVSL--SGDNWLADLSYNGSFFRNDISALSWDNAFNPYGGQLALAPDNQAHQLSLSGQYNL 283 (675) T ss_pred EEECCEEE--ECCCEEEEEEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCHHHEEEEEEEEEC T ss_conf 47204089--4565078667745676557430103777655445305599863436888778703 No 26 >cd00342 gram_neg_porins Porins form aqueous channels for the diffusion of small hydrophillic molecules across the outer membrane. Individual 16-strand anti-parallel beta-barrels form a central pore, and trimerizes thru mainly hydrophobic interactions at the interface. Trimers are stabilized by hytrophillic clamping of Loop L2. Loop 3 bends into the pore, creating an elliptical constriction of about 7 x 11A, large enough to allow passage of a glucose molecule without steric hindrance. Removal of the C-terminal residue (usuallly F) destabilizes the trimer and removal of the 16th beta-sheet abolishes trimerization. Unlike typical membrane proteins, porins lack long hydrophobic stretches. Short turns are found at the smooth, periplasmic end, longer irregular loops are found at the rough, extracellular end. C-terminal residue forms salt bridge with N-terminus. Probab=97.02 E-value=0.015 Score=30.90 Aligned_cols=141 Identities=16% Similarity=0.132 Sum_probs=69.3 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEEEEECCCEEEEEEEEEEECCCCC-CCCCCCCCCEEEEEEEEEEEEECC Q ss_conf 5421000122333344667777760268998546404837998451013215556-666212232148999897543127 Q gi|254780598|r 52 DFNGPYAGLSALYNGSFGEEAHHNAGGSIFAGYNVEDSCIMYGVEGDVRYTVPVL-ADNIHSLHGIGGSLRIRGGYEVSD 130 (205) Q Consensus 52 ~w~G~y~g~~~g~~~~~~~~~~~~~~~G~~~GYn~q~~~~V~G~e~d~~~~~~~~-~~~~~~~~~~~~~~~~r~Gy~~~~ 130 (205) .|.|+.++.+.+....... ...+...++-+-|... .+-+++--+-....... ......... -..++-+.|.+++ T Consensus 139 ~~~Gf~~~a~y~~g~~~~~-~~~~~~~~~~~~Y~~g--~~~~~~~y~~~~~~~~~~~~~~~~~~~--~~~~~ga~Y~~g~ 213 (329) T cd00342 139 FFGGLTFGAMYAFGNQAGS-TSNGRGYGAGLSYENG--PLSLGAAYDQQRNGGGAAGGAAGATSQ--RAYGAGASYDFGG 213 (329) T ss_pred CCCCEEEEEEEECCCCCCC-CCCCCEEEEEEEEECC--CEEEEEEEEEEECCCCCCCCCCCCCCE--EEEEEEEEEEECC T ss_conf 6577899999953667787-7778579999999529--999999999850676444566776514--8999999999689 Q ss_pred CEEEEEEEEEEEEEEEECCCCCCCCCCCEEEEEEEEEEEECCCCCEEEEEEEEEEECCCC---CCCCEEEEEEEEEEC Q ss_conf 677898854477644431345556775204799734661004886799999999750565---566226999998539 Q gi|254780598|r 131 SLLLYATVGPDVAQKYETGKAGEITPIAIGGTAGVGVEVGGLSESLVARLEYRASKYSKV---EGFYNTISLGVGMKF 205 (205) Q Consensus 131 ~~l~Y~~~G~a~~~~~~~~~~~~~~~~~~g~~~G~Gvey~~it~~~~~r~EY~y~df~~~---~~~~~~~~~G~~ykF 205 (205) + +..+.|... +.+......+.....| -+|+.| .+++.+.+++.|.+.+-.+. +.+.+.+.+|+.|+| T Consensus 214 -~--~l~~~Y~~~--~~~~~~~~~~~~~~~~--~~ga~y-~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~lG~~Y~l 283 (329) T cd00342 214 -L--KLGAGYTNT--RNDNGGGGGSAKFNGY--ELGATY-QLTPALRLGAAYYYTKDRNDGGGDGKANQVALGADYAL 283 (329) T ss_pred -E--EEEEEEEEE--ECCCCCCCCCEEEEEE--EEEEEE-ECCCCEEEEEEEEEEECCCCCCCCCCEEEEEEEEEEEE T ss_conf -9--999999998--8357777875026899--999999-80787389999999944567688897189999999971 No 27 >TIGR03509 OMP_MtrB_PioB decaheme-associated outer membrane protein, MtrB/PioB family. Members of this protein family are integral proteins of the bacterial outer membrane, associated with multiheme c-type cytochromes involved in electron transfer. The MtrB protein of Shewanella oneidensis MR-1 (SO1776) has been shown to form a complex with 1:1:1 stochiometry with the small, periplasmic decaheme cytochrome MtrA and large, surface-exposed decaheme cytochrome MtrC. Probab=96.96 E-value=0.017 Score=30.63 Aligned_cols=67 Identities=22% Similarity=0.237 Sum_probs=36.8 Q ss_pred EEEEEEEEEECCCEEEEEEEEEEEEEEEECCCCCC-CCC-----CCEEEEEEEEEEEECC-CCCEEEEEEEEEE Q ss_conf 99989754312767789885447764443134555-677-----5204799734661004-8867999999997 Q gi|254780598|r 119 SLRIRGGYEVSDSLLLYATVGPDVAQKYETGKAGE-ITP-----IAIGGTAGVGVEVGGL-SESLVARLEYRAS 185 (205) Q Consensus 119 ~~~~r~Gy~~~~~~l~Y~~~G~a~~~~~~~~~~~~-~~~-----~~~g~~~G~Gvey~~i-t~~~~~r~EY~y~ 185 (205) ....-+.|.++|.+.+|+-.+.-+.+.+-.+.... .++ ..---++|+|++|..| .++|++..+|.|+ T Consensus 486 ~ydl~~sy~~~~~ls~~Af~~~q~~ds~q~gs~~~s~~~W~~~~~D~~~~iG~G~~y~~l~d~kL~Lg~dysys 559 (649) T TIGR03509 486 GYDLDASYTFTDGLSLHAFYNQQWIDSNQAGSATYSTPDWTSDTTDSVTTIGAGLTYEGLLDLKLSLGGDYSYS 559 (649) T ss_pred EEEEEEEEEECCCEEEEEEEEEEEEHHHCCCCCCCCCCCCCCCCCCEEEEEEEEEEEECCCCCCEEEECCEEEE T ss_conf 16555889967981899988888753123566676754433544301468975147723666664874215650 No 28 >COG3203 OmpC Outer membrane protein (porin) [Cell envelope biogenesis, outer membrane] Probab=96.91 E-value=0.019 Score=30.39 Aligned_cols=28 Identities=29% Similarity=0.362 Sum_probs=16.7 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCCCCCC Q ss_conf 944378999999999841000024665435 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAADPVRRA 30 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad~~~~~ 30 (205) |||.++++++++ ++++.++.|+.+-... T Consensus 1 ~~~~~~a~~~~a--~~aA~a~saatlYg~~ 28 (354) T COG3203 1 MKRKILALAVPA--LLAAGAANAAELYGKD 28 (354) T ss_pred CCHHHHHHHHHH--HHHHHHHHEEEEECCC T ss_conf 905689999987--6765322058995314 No 29 >pfam05420 BCSC_C Cellulose synthase operon protein C C-terminus (BCSC_C). This family contains the C-terminal regions of several bacterial cellulose synthase operon C (BCSC) proteins. BCSC is involved in cellulose synthesis although the exact function of this protein is unknown. Probab=96.77 E-value=0.024 Score=29.79 Aligned_cols=50 Identities=14% Similarity=0.034 Sum_probs=38.0 Q ss_pred CCCCCCEEEEEEEEEEEECCCCCEEEEEEEEEEECCCCCCCCEEEEEEEEEEC Q ss_conf 56775204799734661004886799999999750565566226999998539 Q gi|254780598|r 153 EITPIAIGGTAGVGVEVGGLSESLVARLEYRASKYSKVEGFYNTISLGVGMKF 205 (205) Q Consensus 153 ~~~~~~~g~~~G~Gvey~~it~~~~~r~EY~y~df~~~~~~~~~~~~G~~ykF 205 (205) ..+.+..++++.+.+|| +|+++|.+.++-.+..+++=. .+...+=+.|-| T Consensus 287 g~S~~G~~y~l~a~~Ey-rls~~~~lG~~~~~~~a~dY~--p~~~~lYlRY~f 336 (339) T pfam05420 287 GDSGSGLGYSLRAAVEY-RLSPRWFLGGGLDIQRAGDYA--PSSALLYLRYAF 336 (339) T ss_pred CCCCCCEEEEEEEEEEE-EECCCEEEEEEEECCCCCCCC--CCCEEEEEEEEC T ss_conf 88888555788899999-946988998778002157888--340899999960 No 30 >pfam03797 Autotransporter Autotransporter beta-domain. Secretion of protein products occurs by a number of different pathways in bacteria. One of these pathways known as the type V pathway was first described for the IgA1 protease. The protein component that mediates secretion through the outer membrane is contained within the secreted protein itself, hence the proteins secreted in this way are called autotransporters. This family corresponds to the presumed integral membrane beta-barrel domain that transports the protein. This domain is found at the C terminus of the proteins it occurs in. The N terminus contains the variable passenger domain that is translocated across the membrane. Once the passenger domain is exported it is cleaved auto-catalytically in some proteins, in others a different protease is used and in some cases no cleavage occurs. Probab=96.68 E-value=0.029 Score=29.42 Aligned_cols=111 Identities=14% Similarity=0.152 Sum_probs=52.5 Q ss_pred CCCCEEEEEEEEEEECCCEEEEEEEEEEECCCCCCC-------CCCCCCCEEEEEEEEEEEEEC--CCEE--EEEEEEEE Q ss_conf 776026899854640483799845101321555666-------621223214899989754312--7677--89885447 Q gi|254780598|r 73 HHNAGGSIFAGYNVEDSCIMYGVEGDVRYTVPVLAD-------NIHSLHGIGGSLRIRGGYEVS--DSLL--LYATVGPD 141 (205) Q Consensus 73 ~~~~~~G~~~GYn~q~~~~V~G~e~d~~~~~~~~~~-------~~~~~~~~~~~~~~r~Gy~~~--~~~l--~Y~~~G~a 141 (205) .+....++++.|+.+. +|.+-..+.+.+.+.+... ....-..+..++.+++||.+. +++. ||+.+-+. T Consensus 70 ~~~~~~g~y~~~~~~~-~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~~~~gy~~~~~~~~~i~P~~~l~y~ 148 (265) T pfam03797 70 SDSYGAGLYAQWNFDG-GLYLDGVLAYGRFDNDVKRLGTSTGTAKGDYDSHGLGASLEAGYRFKLSGNLTLTPFAGLQYV 148 (265) T ss_pred EEEEEEEEEEEEECCC-CCEEEEEEEEEEEECEEEEECCCCCCCEEEEEEEEEEEEEEEEECCCCCCCEEEEEEEEEEEE T ss_conf 6707999999998489-836976788899715048730678730222802499999998762332697599699999999 Q ss_pred EEEEEE---CC------CCCCCCCCCEEEEEEEEEEEECCCC---CEEEEEEEEEE Q ss_conf 764443---13------4555677520479973466100488---67999999997 Q gi|254780598|r 142 VAQKYE---TG------KAGEITPIAIGGTAGVGVEVGGLSE---SLVARLEYRAS 185 (205) Q Consensus 142 ~~~~~~---~~------~~~~~~~~~~g~~~G~Gvey~~it~---~~~~r~EY~y~ 185 (205) +.+.+. ++ .....+....-..+|+-+++ .+.+ ++.+++.+.|. T Consensus 149 ~~~~~~f~E~g~~~~~l~~~~~~~~~~~~~~G~~~~~-~~~~~~~~~~~~~~~~~~ 203 (265) T pfam03797 149 YLRQDGFTESGARSFALSVDSQSLTNLSGPLGLRLEY-RFSLRSLKPYATLGWAYE 203 (265) T ss_pred EEECCCEEEECCCCCCEEEECCCCCEEEEEEEEEEEE-EECCCEEEEEEEEEEEEE T ss_conf 9876876670998435178446665899999999999-852873648999999999 No 31 >pfam05736 OprF OprF membrane domain. This domain represents the presumed membrane spanning region of the OprF proteins. This region is involved in channel formation and is thought to form an 8-stranded beta-barrel. Probab=96.59 E-value=0.004 Score=33.81 Aligned_cols=86 Identities=13% Similarity=0.052 Sum_probs=42.6 Q ss_pred CCCEEEEEEEEEE-ECC----CEEEEEEEEEEECCCCCCCCCCCCCCEEEEEEEEEEEEECCCEEEEEEEEEEEEEEEEC Q ss_conf 7602689985464-048----37998451013215556666212232148999897543127677898854477644431 Q gi|254780598|r 74 HNAGGSIFAGYNV-EDS----CIMYGVEGDVRYTVPVLADNIHSLHGIGGSLRIRGGYEVSDSLLLYATVGPDVAQKYET 148 (205) Q Consensus 74 ~~~~~G~~~GYn~-q~~----~~V~G~e~d~~~~~~~~~~~~~~~~~~~~~~~~r~Gy~~~~~~l~Y~~~G~a~~~~~~~ 148 (205) ++....|-+=|+| +.+ ..|-+.-+..+..+.... .....+..+..+.+-+.|-++|+ +|+++++.... T Consensus 89 ~g~~~rLDalYHF~~pg~~lrPYvs~G~gh~s~~~~~~g-~~Grd~sT~~N~G~G~Ky~itdn--~~aRa~vra~y---- 161 (184) T pfam05736 89 GGDLTSLDAQYHFGTPGDGLRPYVSAGFAHQSITNVADG-SQGRDQSTFANIGAGAKYYFTEN--FYARAGVEAQY---- 161 (184) T ss_pred CCCEEEEEEEEECCCCCCCCCEEEEECCCCCEEECCCCC-CCCCCCCEEEEECCCEEEEECCC--EEEEECHHHHC---- T ss_conf 563466777997389876633727731465200022568-77656441676345268996054--47761325310---- Q ss_pred CCCCCCCCCCEEEEEEEEEE Q ss_conf 34555677520479973466 Q gi|254780598|r 149 GKAGEITPIAIGGTAGVGVE 168 (205) Q Consensus 149 ~~~~~~~~~~~g~~~G~Gve 168 (205) . -+.+++.+.+++|+++- T Consensus 162 -~-lD~~~~d~~a~vGl~~~ 179 (184) T pfam05736 162 -G-LDNGDWEWAALVGLGVN 179 (184) T ss_pred -C-CCCCCCCEEEEEEEEEE T ss_conf -4-77886120045524996 No 32 >PRK10554 outer membrane porin protein C; Provisional Probab=96.58 E-value=0.034 Score=29.05 Aligned_cols=43 Identities=14% Similarity=0.326 Sum_probs=33.6 Q ss_pred EEEEEEEEECCCCCEEEEEEEEEEECCCCC-------CCCEEEEEEEEEEC Q ss_conf 997346610048867999999997505655-------66226999998539 Q gi|254780598|r 162 TAGVGVEVGGLSESLVARLEYRASKYSKVE-------GFYNTISLGVGMKF 205 (205) Q Consensus 162 ~~G~Gvey~~it~~~~~r~EY~y~df~~~~-------~~~~~~~~G~~ykF 205 (205) .+-+|.+| .+++|+.+-+||+.--+.+.+ ..-|.+.||+.|+| T Consensus 324 ~~~iga~Y-~FnkN~~~y~~Yk~NllD~~~~~~~~~~~~DD~~alGlrY~F 373 (373) T PRK10554 324 YVDVGATY-YFNKNMSTYVDYKINLLDDNDFTRDAGINTDDIVALGLVYQF 373 (373) T ss_pred EEEEEEEE-EECCCEEEEEEEEEECCCCCCCCCCCCCCCCCEEEEEEEEEC T ss_conf 38976799-977872168888751367653100036776776999628819 No 33 >TIGR03016 pepcterm_hypo_1 uncharacterized protein, PEP-CTERM system associated. Members of this protein family are found predominantly in exopolysaccharide biosynthesis operons marked by the presence of the EpsH-family putative exosortase and presence in the genome of the PEP-CTERM protein sorting signal. Members of this family may be distantly related to the EpsL family modeled in TIGR03014. Probab=96.55 E-value=0.035 Score=28.94 Aligned_cols=80 Identities=16% Similarity=0.111 Sum_probs=53.1 Q ss_pred EEEEEEEEECCCEEEEEEEEEEEEEEEECCCCCCCCCCCEEEEEEEEEEEECCCCCEEEEEEEEEEECCC----CCCCCE Q ss_conf 9989754312767789885447764443134555677520479973466100488679999999975056----556622 Q gi|254780598|r 120 LRIRGGYEVSDSLLLYATVGPDVAQKYETGKAGEITPIAIGGTAGVGVEVGGLSESLVARLEYRASKYSK----VEGFYN 195 (205) Q Consensus 120 ~~~r~Gy~~~~~~l~Y~~~G~a~~~~~~~~~~~~~~~~~~g~~~G~Gvey~~it~~~~~r~EY~y~df~~----~~~~~~ 195 (205) +.++..+.+.+++.+-+.+++...+.+ ... +....-+.+.+|+.| .|++++.+.+-|+|.+... .+-+.| T Consensus 348 ~~~~~~~~L~~rl~~~~~~~~~~~~~~--~~~---~~~d~~~~~~~gl~y-~l~~~~~~~l~Yry~~rdS~~~~~dy~~N 421 (431) T TIGR03016 348 GSVLLTHRLSPRLTANLGLTYERNEGS--GTN---DQETDRTSVSAGLSY-QLGENTTGSLTYRYQERDSNFEATDYTEN 421 (431) T ss_pred EEEEEEEEECCCCEEEEEEEEEEEECC--CCC---CCCEEEEEEEEEEEE-EECCCEEEEEEEEEEECCCCCCCCCHHHH T ss_conf 038999996513121489999972437--777---650469999998999-96386599999999970478766664551 Q ss_pred EEEEEEEEEC Q ss_conf 6999998539 Q gi|254780598|r 196 TISLGVGMKF 205 (205) Q Consensus 196 ~~~~G~~ykF 205 (205) .|.+.+.++| T Consensus 422 ~v~~~l~~~F 431 (431) T TIGR03016 422 RLMAEVTMTF 431 (431) T ss_pred EEEEEEEEEC T ss_conf 2157787409 No 34 >pfam10082 DUF2320 Uncharacterized protein conserved in bacteria (DUF2320). This domain, found in various hypothetical bacterial proteins, has no known function. Probab=96.54 E-value=0.036 Score=28.91 Aligned_cols=45 Identities=20% Similarity=0.248 Sum_probs=24.9 Q ss_pred EEEEEEEEEEECCCCCEEEEEEEEE--EECCCC--CCCCEEEEEEEEEEC Q ss_conf 4799734661004886799999999--750565--566226999998539 Q gi|254780598|r 160 GGTAGVGVEVGGLSESLVARLEYRA--SKYSKV--EGFYNTISLGVGMKF 205 (205) Q Consensus 160 g~~~G~Gvey~~it~~~~~r~EY~y--~df~~~--~~~~~~~~~G~~ykF 205 (205) ...++++++. .++++|++.+.+.| .+|... +-+...+.+|+.|+| T Consensus 295 ~~~~~~~~~h-~~~~~l~~~~~~~y~~~~y~g~~r~D~~~~~~~~l~Y~~ 343 (380) T pfam10082 295 ETTYSLGVDH-ELRRNLTLSLRLSYGQDDYQGSGRKDETYSAEAGLTYKL 343 (380) T ss_pred EEEEEEEEEE-EEECCEEEEEEEEEEEEECCCCCCCCCEEEEEEEEEEEE T ss_conf 9999999999-730018999999999847468872013399999999984 No 35 >pfam11854 DUF3374 Protein of unknown function (DUF3374). This family of proteins are functionally uncharacterized. This protein is found in bacteria. Proteins in this family are typically between 665 to 712 amino acids in length. Probab=96.51 E-value=0.038 Score=28.81 Aligned_cols=23 Identities=39% Similarity=0.352 Sum_probs=11.1 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 9443789999999998410000246 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAAD 25 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad 25 (205) ||..+-.+++++++++ ++|+|+| T Consensus 1 mkf~l~lialAll~~~--~~a~A~~ 23 (675) T pfam11854 1 MKFKLNLIALALLAAA--FSANAAD 23 (675) T ss_pred CCCCHHHHHHHHHHHC--CHHHHCC T ss_conf 9732229999998624--3333156 No 36 >PRK09980 ompL outer membrane porin L; Provisional Probab=96.12 E-value=0.064 Score=27.61 Aligned_cols=77 Identities=12% Similarity=0.048 Sum_probs=46.3 Q ss_pred EEEEECCCEEEEEEEEEEEEEEEECCCCCCCCCCCEEEEEEEEEEEECCCCCEEEEEEEEEEECC--CCCCCCEEEEEEE Q ss_conf 75431276778988544776444313455567752047997346610048867999999997505--6556622699999 Q gi|254780598|r 124 GGYEVSDSLLLYATVGPDVAQKYETGKAGEITPIAIGGTAGVGVEVGGLSESLVARLEYRASKYS--KVEGFYNTISLGV 201 (205) Q Consensus 124 ~Gy~~~~~~l~Y~~~G~a~~~~~~~~~~~~~~~~~~g~~~G~Gvey~~it~~~~~r~EY~y~df~--~~~~~~~~~~~G~ 201 (205) .+|-+++.+.. .-- ..+.+.. ......+..+..|-.-+-+.| .++++|+=-.|+.|.|-. +.+-....+|||| T Consensus 152 ~~y~~~~~~~~-~~~-~tY~~~v--nd~~~~NgKk~~~E~n~~~~Y-k~~~~w~PY~E~~~ld~~~~~~~r~q~r~RVGv 226 (230) T PRK09980 152 WNFIITDKFSY-TFE-PHYFYNV--NDFNSSNGTKHHWEITNTFRY-RINEHWLPYFELRWLDRNVGPYHREQNQIRIGA 226 (230) T ss_pred EEEEECCCEEE-EEE-EEEEEEC--CCCEECCCCCHHHHEEEEEEE-ECCCCCCCEEEEEECCCCCCCCCCCEEEEEEEE T ss_conf 77897574379-973-0999850--770732786404431026888-428962431898742235676777135799988 Q ss_pred EEEC Q ss_conf 8539 Q gi|254780598|r 202 GMKF 205 (205) Q Consensus 202 ~ykF 205 (205) .|-| T Consensus 227 ~y~F 230 (230) T PRK09980 227 KYFF 230 (230) T ss_pred EEEC T ss_conf 8629 No 37 >PRK09752 adhesin; Provisional Probab=96.09 E-value=0.066 Score=27.54 Aligned_cols=26 Identities=23% Similarity=0.169 Sum_probs=12.1 Q ss_pred CEEEEEEEEEEEECCCCCEEEEEEEEE Q ss_conf 204799734661004886799999999 Q gi|254780598|r 158 AIGGTAGVGVEVGGLSESLVARLEYRA 184 (205) Q Consensus 158 ~~g~~~G~Gvey~~it~~~~~r~EY~y 184 (205) +.-.-+-+|||= .|+++|+++..-.+ T Consensus 1205 rNiGELKvGVEG-QINk~lnvWGNVaq 1230 (1250) T PRK09752 1205 KQRGEIKVGVTG-NISQRVSLRGSVAW 1230 (1250) T ss_pred CCCCEEECCEEE-EECCCEEEEEEEEE T ss_conf 130103611699-97674678987668 No 38 >PRK11528 hypothetical protein; Provisional Probab=96.03 E-value=0.071 Score=27.39 Aligned_cols=45 Identities=16% Similarity=0.180 Sum_probs=29.9 Q ss_pred EEEEEEEEEEEECCCCCEEEEEEEEEEE--CCCCCCCCEEEEEEEEEEC Q ss_conf 0479973466100488679999999975--0565566226999998539 Q gi|254780598|r 159 IGGTAGVGVEVGGLSESLVARLEYRASK--YSKVEGFYNTISLGVGMKF 205 (205) Q Consensus 159 ~g~~~G~Gvey~~it~~~~~r~EY~y~d--f~~~~~~~~~~~~G~~ykF 205 (205) .|..-++-+-. -.+++|+..+.|||.| +++... .|.+-.-+.|.| T Consensus 208 ~G~NGa~alwW-~~t~~~ttgiqYRYAdNKLG~~~~-~D~iiYs~kYnF 254 (254) T PRK11528 208 DGINGAVALWW-NATKHLTTGIQYRYADNKLGEDFL-QDAIIYSIKYNF 254 (254) T ss_pred CCCCCEEEEEE-EECCCEEEEEEEEEHHCCCCCCCC-CCEEEEEEEECC T ss_conf 76235389988-405760042699800012573223-753899996249 No 39 >pfam11383 DUF3187 Protein of unknown function (DUF3187). This family of proteins with unknown function appear to be restricted to Proteobacteria. Probab=95.99 E-value=0.041 Score=28.61 Aligned_cols=119 Identities=14% Similarity=0.117 Sum_probs=66.2 Q ss_pred EEEEEEEEEC-C--CEEEEEEEEEEECCCCCCCCCCCCCCEEEEEEEEEEEEECCCEEEEEEEEEEEEEEEECCCCCCCC Q ss_conf 8998546404-8--379984510132155566662122321489998975431276778988544776444313455567 Q gi|254780598|r 79 SIFAGYNVED-S--CIMYGVEGDVRYTVPVLADNIHSLHGIGGSLRIRGGYEVSDSLLLYATVGPDVAQKYETGKAGEIT 155 (205) Q Consensus 79 G~~~GYn~q~-~--~~V~G~e~d~~~~~~~~~~~~~~~~~~~~~~~~r~Gy~~~~~~l~Y~~~G~a~~~~~~~~~~~~~~ 155 (205) .++..|++-. . ..=+|+-+-++. .. .+........-.+..--+|.- ++--+|.++|++..+.+ ....... T Consensus 117 t~Y~qYql~~~~~hglS~GgSLyyN~--v~--~g~F~~~~FEQ~lQlNysy~~-~~h~~~s~lG~t~r~~~--~~~~~l~ 189 (273) T pfam11383 117 TLYLQYQLFQNEHHGLSIGGSLYYNN--VS--SGPFKGESFEQGLQLNYSYQK-DRHHFYSTLGVVFRNSD--EALNDLP 189 (273) T ss_pred HHHHHEEEECCCCCCEEEEEEEEECC--CC--CCCCCCCCHHHHHEEEEEEEC-CCCEEEEEEEEEEECCC--CCCCCCC T ss_conf 87641243048875278779999546--46--776366652232113005543-77528889889983587--4236655 Q ss_pred CCCEEEEEEEEEEEECCCCCEEEEEEEEEEECC-C----CCCCCEEEEEEEEEEC Q ss_conf 752047997346610048867999999997505-6----5566226999998539 Q gi|254780598|r 156 PIAIGGTAGVGVEVGGLSESLVARLEYRASKYS-K----VEGFYNTISLGVGMKF 205 (205) Q Consensus 156 ~~~~g~~~G~Gvey~~it~~~~~r~EY~y~df~-~----~~~~~~~~~~G~~ykF 205 (205) -....|+.++|.|| ++++|-.+-+||.+.+=. + .+-..|.+.+|-.|++ T Consensus 190 ~k~~t~s~~~gY~Y-~l~~~H~ll~ey~~yqG~~~~~~~fse~SnE~~LGYRY~~ 243 (273) T pfam11383 190 YKKNTLSWAIGYEY-RLTERHHLLIEYHWYQGAVEGDTDFSEASNEAVLGYRYYM 243 (273) T ss_pred CCCCEEEEEEEEEE-EECCCHHHHHHHHHHCCCCCCCCHHCCCCCEEEEEEEEEE T ss_conf 34415777721267-5237614654334333642686100112413578665773 No 40 >pfam00267 Porin_1 Gram-negative porin. Probab=95.97 E-value=0.076 Score=27.22 Aligned_cols=80 Identities=16% Similarity=0.245 Sum_probs=47.0 Q ss_pred EEEEEEEEECCCEEEEEEEEEEEEEEEECCCCCCCCCCCEEEEEEEEEEEECCCCCEEEEEEEEEEECCCC------CCC Q ss_conf 99897543127677898854477644431345556775204799734661004886799999999750565------566 Q gi|254780598|r 120 LRIRGGYEVSDSLLLYATVGPDVAQKYETGKAGEITPIAIGGTAGVGVEVGGLSESLVARLEYRASKYSKV------EGF 193 (205) Q Consensus 120 ~~~r~Gy~~~~~~l~Y~~~G~a~~~~~~~~~~~~~~~~~~g~~~G~Gvey~~it~~~~~r~EY~y~df~~~------~~~ 193 (205) .++-.+|.+++ + |.-+.|+..+............. .-.+-+.++| .+++.+..++.|.|..-.++ ... T Consensus 210 ~~~ga~Y~~~~-l--~la~~Y~~~~~~~~~~~~~~~~~--~~~~e~~~~Y-~~~~g~~~~~~Y~~~~~~~~~~~~~~~~~ 283 (334) T pfam00267 210 WRLGGKYDANN-V--YLAVAYAQTRNATPYGDVGFANK--NQNFEVAAQY-QFDFGLRPRVSYAQSKGKDLNGAYGDNDL 283 (334) T ss_pred EEEEEEEEECC-E--EEEEEEEEEEECCCCCCCCCCCC--CEEEEEEEEE-EECCCCCEEEEEEEEECCCCCCCCCCCCE T ss_conf 99999998054-8--99999865420101376433444--0469999999-74246316999998742223565577645 Q ss_pred CEEEEEEEEEEC Q ss_conf 226999998539 Q gi|254780598|r 194 YNTISLGVGMKF 205 (205) Q Consensus 194 ~~~~~~G~~ykF 205 (205) .|.+.+|+.|+| T Consensus 284 ~~q~~lg~dY~f 295 (334) T pfam00267 284 VKYVDVGATYYF 295 (334) T ss_pred EEEEEEEEEEEE T ss_conf 899999989996 No 41 >PRK09980 ompL outer membrane porin L; Provisional Probab=95.96 E-value=0.013 Score=31.16 Aligned_cols=63 Identities=10% Similarity=-0.006 Sum_probs=33.0 Q ss_pred EEEEEEEEECCCEEEEEEEEEEEEEEE---ECCCCCCCCCCCEEEEEEEEEEEECCCCCEEEEEEEEEEEC Q ss_conf 998975431276778988544776444---31345556775204799734661004886799999999750 Q gi|254780598|r 120 LRIRGGYEVSDSLLLYATVGPDVAQKY---ETGKAGEITPIAIGGTAGVGVEVGGLSESLVARLEYRASKY 187 (205) Q Consensus 120 ~~~r~Gy~~~~~~l~Y~~~G~a~~~~~---~~~~~~~~~~~~~g~~~G~Gvey~~it~~~~~r~EY~y~df 187 (205) --+|++|.+.|.+-+-++-=|-+.+.+ .++.....+..+. ..=+-| .+++.|....|+.|.+- T Consensus 106 PYLkl~Y~fd~~~~~~~RYRYd~~~yds~dlnG~~d~~d~hr~----d~Y~~y-~~~~~~~~~~~~tY~~~ 171 (230) T PRK09980 106 VYLDVNYKFTPWFNLTVRNRYNHNNYSSTDLNGELDNNDSYEI----GNYWNF-IITDKFSYTFEPHYFYN 171 (230) T ss_pred CCEEEEEEECCCCEEEEEEEECCCCCCCCCCCCCCCCCCEEEE----EEEEEE-EECCCEEEEEEEEEEEE T ss_conf 4177777665871588998852455674345776677750773----005778-97574379973099985 No 42 >PRK10993 outer membrane protease; Reviewed Probab=95.66 E-value=0.1 Score=26.55 Aligned_cols=49 Identities=16% Similarity=0.075 Sum_probs=34.4 Q ss_pred CCCEEEEEEEEEEEECCCCCEEEEEEEEEEECCCC-----------------------CCCCEEEEEEEEEEC Q ss_conf 75204799734661004886799999999750565-----------------------566226999998539 Q gi|254780598|r 156 PIAIGGTAGVGVEVGGLSESLVARLEYRASKYSKV-----------------------EGFYNTISLGVGMKF 205 (205) Q Consensus 156 ~~~~g~~~G~Gvey~~it~~~~~r~EY~y~df~~~-----------------------~~~~~~~~~G~~ykF 205 (205) +..-=+.+.+-+-| .+|||..+.+|..|..+..- +...-++++|+-||| T Consensus 240 k~s~y~s~~~~agY-~vtp~~kl~~e~~~~ky~~~kg~T~i~d~~tG~~~~~~d~AGi~n~~~tvs~GLkY~F 311 (311) T PRK10993 240 KNSPYYSASINAGY-YVTPNAKLFVEGAYNKYFEGKGDTQIYDHATGESSYGKDAAGIENQNFTVTAGLKYRF 311 (311) T ss_pred CCCCEEEEEEEEEE-EECCCEEEEEEEEEEEEECCCCCCEEEECCCCCEEECCCCCCCCCCCEEEEEEEEEEC T ss_conf 67642899981004-6279815999999999851568747874478716614764220332159998777869 No 43 >PRK13483 enterobactin receptor protein; Provisional Probab=95.37 E-value=0.13 Score=25.98 Aligned_cols=40 Identities=28% Similarity=0.352 Sum_probs=17.3 Q ss_pred EEEEEEECCCCCEEEEE--------EEEEEECCCCCCCCEEEEEEEEEEC Q ss_conf 73466100488679999--------9999750565566226999998539 Q gi|254780598|r 164 GVGVEVGGLSESLVARL--------EYRASKYSKVEGFYNTISLGVGMKF 205 (205) Q Consensus 164 G~Gvey~~it~~~~~r~--------EY~y~df~~~~~~~~~~~~G~~ykF 205 (205) -+.+.| .++++|.+.+ +|+|..++-..- -=++.+++.|+| T Consensus 616 dl~~~y-~~~~~~~l~~~v~NLfDk~Y~~~~~gy~~~-GR~~~~~~~y~F 663 (663) T PRK13483 616 DLGANY-QLTDSLKFSAGIYNLFDKEINYEEYGYVED-GRRYWLGMTYSF 663 (663) T ss_pred EEEEEE-EECCCEEEEEEEECCCCCCCCCCCCCCCCC-CEEEEEEEEEEC T ss_conf 979999-958987999999936699305334777889-909999999989 No 44 >pfam09381 Porin_OmpG Outer membrane protein G (OmpG). Porins are channel proteins in the outer membrane of gram negative bacteria which mediate the uptake of molecules required for growth and survival. Escherichia coli OmpG forms a 14 stranded beta-barrel and in contrast to most porins, appears to function as a monomer. The central pore of OmpG is wider than other E. coli porins and it is speculated that it may form a non-specific channel for the transport of larger oligosaccharides. Probab=95.29 E-value=0.14 Score=25.83 Aligned_cols=81 Identities=16% Similarity=0.162 Sum_probs=45.1 Q ss_pred EEEEEEEEECCC-EEEEEEEEEEEEEE---EECCCCCCCCCCCEEEEEEEEEEEECCCCCEEEEEEEEEEECCCCC---C Q ss_conf 998975431276-77898854477644---4313455567752047997346610048867999999997505655---6 Q gi|254780598|r 120 LRIRGGYEVSDS-LLLYATVGPDVAQK---YETGKAGEITPIAIGGTAGVGVEVGGLSESLVARLEYRASKYSKVE---G 192 (205) Q Consensus 120 ~~~r~Gy~~~~~-~l~Y~~~G~a~~~~---~~~~~~~~~~~~~~g~~~G~Gvey~~it~~~~~r~EY~y~df~~~~---~ 192 (205) +|+-+-..+++. +-||.+.|+-.+.- ..+..-...+.++ +|+=++| .+.+.+++-+||-|---...+ . T Consensus 193 iR~YlP~~~g~tTiTPY~R~gld~wsn~~w~~~~~r~~~~~~R----lGl~~~~-~~~~glsmtLeYAyE~q~hd~~~~d 267 (280) T pfam09381 193 IRAYLPLTLGNHSVTPYTRIGLDRWSNWDWQDDIEREGHDFNR----VGLFYGY-DFQNGLSVSLEYAFEWQDHDEGDSD 267 (280) T ss_pred EECEEEEEECCCCCCCCEEEEHHHHCCCCCCCCHHHCCCCCCC----EEEEEEE-CCCCCCEEEEEEEEEECCCCCCCCC T ss_conf 0000345336730044214303232144223667660884000----1367853-1688737765233220204568744 Q ss_pred CCEEEEEEEEEEC Q ss_conf 6226999998539 Q gi|254780598|r 193 FYNTISLGVGMKF 205 (205) Q Consensus 193 ~~~~~~~G~~ykF 205 (205) ++|-.-||++|-| T Consensus 268 kFHyaGvG~~YsF 280 (280) T pfam09381 268 KFHYAGVGVNYSF 280 (280) T ss_pred CEEECCCCEEECC T ss_conf 2033030035319 No 45 >COG3713 OmpV Outer membrane protein V [Cell envelope biogenesis, outer membrane] Probab=95.22 E-value=0.15 Score=25.73 Aligned_cols=80 Identities=14% Similarity=-0.024 Sum_probs=51.8 Q ss_pred EEEECCCEEEEEEEEEEEEEEEECCCCC-------------CCC--CCCEEEEEEEEEEEECCCCCEEEEEEEEEEECCC Q ss_conf 5431276778988544776444313455-------------567--7520479973466100488679999999975056 Q gi|254780598|r 125 GYEVSDSLLLYATVGPDVAQKYETGKAG-------------EIT--PIAIGGTAGVGVEVGGLSESLVARLEYRASKYSK 189 (205) Q Consensus 125 Gy~~~~~~l~Y~~~G~a~~~~~~~~~~~-------------~~~--~~~~g~~~G~Gvey~~it~~~~~r~EY~y~df~~ 189 (205) -+...+++.+=.-.|+.|.+.+...... ..+ .-....-+++=++| .||+||++.+--+|+.+.. T Consensus 156 ~~~~~~~~~ispg~g~s~~s~~Y~~yYfGVs~~esa~Sgl~~Y~p~~g~~s~~l~ls~~y-~~t~~w~~~~~~~~~RL~g 234 (258) T COG3713 156 PRMLIGRLSLSPGPGVSWASSKYNDYYFGVSAAESARSGLASYNPGGGAKSPGLGLSATY-MITDHWSVGGTAGYSRLSG 234 (258) T ss_pred EEECCCCEEECCCCCEEEECCCCCCEEECCCHHHHHHCCCCCCCCCCCCCCCCCEEEEEE-EECCCEEEEEEEEEEECCC T ss_conf 983059789854773277122334202052889973047774079988434672379999-7068779998887131033 Q ss_pred C--------CCCCEEEEEEEEEEC Q ss_conf 5--------566226999998539 Q gi|254780598|r 190 V--------EGFYNTISLGVGMKF 205 (205) Q Consensus 190 ~--------~~~~~~~~~G~~ykF 205 (205) . +...-.+.+||.||| T Consensus 235 daadSPiV~~~~q~~~~~~v~YrF 258 (258) T COG3713 235 DAADSPIVDRRNQFSVGTGVTYRF 258 (258) T ss_pred CCCCCCCEECCCCEEEEEEEEEEC T ss_conf 101497466156358998899859 No 46 >pfam10626 TraO Conjugative transposon protein TraO. This is a family of conjugative transposon proteins. Probab=95.22 E-value=0.15 Score=25.73 Aligned_cols=174 Identities=16% Similarity=0.111 Sum_probs=75.5 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEE Q ss_conf 94437899999999984100002466543566666555634554444441154210001223333446677777602689 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAADPVRRAHHGGRGVVPTIATNRYVPIRHDFNGPYAGLSALYNGSFGEEAHHNAGGSI 80 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad~~~~~~~~~~~~~~~~~~~~~~~~~~~w~G~y~g~~~g~~~~~~~~~~~~~~~G~ 80 (205) |||.+..+.+..++ ..+..|+|.-..+--.-... ... --+|++. ..+..+.+...|+ T Consensus 1 MKk~~~i~~~~~l~-l~~~~a~AQR~LPkqkGie~-----------~~g--~~~g~~~---------~~~~~~~~y~~g~ 57 (193) T pfam10626 1 MKRKLFIILLALLA-LFAGQAHAQRCLPGMKGIQL-----------TAG--MVDGFHS---------FKKRNDFGYHFGL 57 (193) T ss_pred CCEEEEEHHHHHHH-HHCCHHHHHHHCCCCCEEEE-----------EEE--EECCCEE---------CCCCCCCCEEEEE T ss_conf 96188608899999-86498888764777711799-----------853--2146110---------4678877479999 Q ss_pred -EEEEEEECCCEEEEEEEEEEECCCCCCCCCCCCCCEEEEEEEEEEEEE------CCCEEEEE----EEEEEEEEEEECC Q ss_conf -985464048379984510132155566662122321489998975431------27677898----8544776444313 Q gi|254780598|r 81 -FAGYNVEDSCIMYGVEGDVRYTVPVLADNIHSLHGIGGSLRIRGGYEV------SDSLLLYA----TVGPDVAQKYETG 149 (205) Q Consensus 81 -~~GYn~q~~~~V~G~e~d~~~~~~~~~~~~~~~~~~~~~~~~r~Gy~~------~~~~l~Y~----~~G~a~~~~~~~~ 149 (205) ..-|.-.-+.||+|+|-.-.+..-+. ..+.... ..+..||-+ +.++.+|+ .+||-..+....- T Consensus 58 als~Y~K~~n~wv~g~EY~~k~~~Yr~---~~IP~~q---ft~egGY~~~~LSD~~Knvfl~~G~SaL~GYEtvN~g~kl 131 (193) T pfam10626 58 AVSTYTKNGNKWVFGGEYLQKNYPYRD---TRIPVSQ---FTGEGGYYYNFLSDAGKNFFLSLGLSALAGYETVNWGEKL 131 (193) T ss_pred EEEEECCCCCEEEEEEEHHHHHCCCCC---CCCCHHH---HEEECCEEEEEEECCCCEEEEEECHHHEEEEEEECCCCCC T ss_conf 987631777589987102013289653---3650899---0540627999972598179997050116303675378730 Q ss_pred -CCCCCCCCCEEEEEE----EEEEEECCCCCEEEEEEEEEEE-CCCCCCCCEEEEEEEEEEC Q ss_conf -455567752047997----3466100488679999999975-0565566226999998539 Q gi|254780598|r 150 -KAGEITPIAIGGTAG----VGVEVGGLSESLVARLEYRASK-YSKVEGFYNTISLGVGMKF 205 (205) Q Consensus 150 -~~~~~~~~~~g~~~G----~Gvey~~it~~~~~r~EY~y~d-f~~~~~~~~~~~~G~~ykF 205 (205) ..+..=..+....+| +-+|. .+||++..-+.-|-.- |+..-..+| ..+|++.|| T Consensus 132 L~DGatL~~~d~FIYGga~~l~~E~-yLtDrivll~~~r~R~L~Gsd~~~Fh-~~~g~Glkf 191 (193) T pfam10626 132 LPDGATLTDGDSFIYGGAVTLELET-YLTDRVVLLLNARERLLFGSDSGKFH-TQFGLGIKI 191 (193) T ss_pred CCCCCEEECCCCEEECCCCEEEEEE-EEEEEEEEEEECCEEEEECCCCCCEE-EEECCEEEE T ss_conf 6686662058845887730699999-88514999982306899625547586-600021799 No 47 >pfam01389 OmpA_membrane OmpA-like transmembrane domain. The structure of OmpA transmembrane domain shows that it consists of an eight stranded beta barrel. This family includes some other distantly related outer membrane proteins with low scores. Probab=94.78 E-value=0.2 Score=25.06 Aligned_cols=89 Identities=17% Similarity=0.089 Sum_probs=38.5 Q ss_pred CCEEEEEEEEEEECCCCC-CCCCCCCCCEEEEEEEEEEEEECCCEEEEEEEEEEEE-EEEECCCCCCCCCCCEEEEEEEE Q ss_conf 837998451013215556-6662122321489998975431276778988544776-44431345556775204799734 Q gi|254780598|r 89 SCIMYGVEGDVRYTVPVL-ADNIHSLHGIGGSLRIRGGYEVSDSLLLYATVGPDVA-QKYETGKAGEITPIAIGGTAGVG 166 (205) Q Consensus 89 ~~~V~G~e~d~~~~~~~~-~~~~~~~~~~~~~~~~r~Gy~~~~~~l~Y~~~G~a~~-~~~~~~~~~~~~~~~~g~~~G~G 166 (205) +.|.+|..+-........ .......++.......-+||.++|.+-+- +||... +.+.+......+....++++.+= T Consensus 5 n~~Y~G~~~G~s~~~~~~~~~~~~~~d~~~~~~glf~GYqfn~~~a~E--~gY~~lG~~~~~~~~~~~~~~~~~~~l~~k 82 (175) T pfam01389 5 NTWYTGAKAGWASFHDTGLIEGNGPTHRNSLTAGAFGGYQINNYLAAE--LGYDYLGRMPYKGKGKNNAHKAQGVTLSLK 82 (175) T ss_pred CCEEEEEEEEEEEECCCCCCCCCCCCCCCCCEEEEEEEEECCCEEEEE--EEEEECCCCCCCCCCCCCCEEEEEEEEEEE T ss_conf 957896640256895763445787656787178999863103112268--742431654335534345404689999999 Q ss_pred EEEECCCCCEEEEE Q ss_conf 66100488679999 Q gi|254780598|r 167 VEVGGLSESLVARL 180 (205) Q Consensus 167 vey~~it~~~~~r~ 180 (205) .-| .|+++|.+-+ T Consensus 83 ~~~-pl~d~~~ly~ 95 (175) T pfam01389 83 LSY-PLTDDLDVYG 95 (175) T ss_pred EEE-EECCCEEEEE T ss_conf 888-6148567999 No 48 >COG5338 Uncharacterized protein conserved in bacteria [Function unknown] Probab=94.69 E-value=0.019 Score=30.29 Aligned_cols=62 Identities=27% Similarity=0.189 Sum_probs=37.8 Q ss_pred CEEEEEEEEEEEEECCCEEEEEEEEEEEEEEEECCCCCCCCCCCEEEEEEEEEEEECCCCCEE Q ss_conf 214899989754312767789885447764443134555677520479973466100488679 Q gi|254780598|r 115 GIGGSLRIRGGYEVSDSLLLYATVGPDVAQKYETGKAGEITPIAIGGTAGVGVEVGGLSESLV 177 (205) Q Consensus 115 ~~~~~~~~r~Gy~~~~~~l~Y~~~G~a~~~~~~~~~~~~~~~~~~g~~~G~Gvey~~it~~~~ 177 (205) .-..++++|.||.+.|.+-+|..+-......+...+......+..+|.+++|+|+ -+.+++. T Consensus 262 ~n~~s~~~R~gYelSPal~pf~e~~vgk~~YD~ridt~G~eRSs~~Y~~r~G~~~-D~gEk~~ 323 (468) T COG5338 262 RNLASLTGRIGYELSPALIPFAEAEVGKRDYDTRIDTQGYERSSDGYGARVGAEF-DLGEKLR 323 (468) T ss_pred CCCCEEEEECCEEECCCCCEEEEEECCCCHHHHHCCCCCCCCCCCCCCCCCCEEE-CHHHHHC T ss_conf 4400023211123365422026762373102110244342234442000132466-0004216 No 49 >PRK12580 outer membrane protease; Reviewed Probab=94.43 E-value=0.25 Score=24.60 Aligned_cols=47 Identities=15% Similarity=0.125 Sum_probs=32.5 Q ss_pred CEEEEEEEEEEEECCCCCEEEEEEEEEEECCCCC------------------------CCCEEEEEEEEEEC Q ss_conf 2047997346610048867999999997505655------------------------66226999998539 Q gi|254780598|r 158 AIGGTAGVGVEVGGLSESLVARLEYRASKYSKVE------------------------GFYNTISLGVGMKF 205 (205) Q Consensus 158 ~~g~~~G~Gvey~~it~~~~~r~EY~y~df~~~~------------------------~~~~~~~~G~~ykF 205 (205) .-=+.+.+-+-| .+||+..+.+|..|..+.+.+ ...-++++|+-||| T Consensus 242 s~yys~~~~agY-~vtp~~klf~e~~~~ky~~~kgdT~i~d~~tG~s~~~~~d~AGi~n~n~tvs~GLkY~F 312 (312) T PRK12580 242 SRYYGTVINAGY-YVTPNAKVFAEFTYSKYDEGKGGTQTIDKNSGDSVSIGGDAAGISNKNYTVTAGLQYRF 312 (312) T ss_pred CCEEEEEEECCE-EECCCEEEEEEEEEEEEECCCCCCEEEECCCCCEEEECCCCCCCCCCCEEEEEEEEEEC T ss_conf 641899984025-76798569999999877325686288866888365405774000243249999878869 No 50 >pfam06178 KdgM Oligogalacturonate-specific porin protein (KdgM). This family consists of several bacterial proteins which are homologous to the oligogalacturonate-specific porin protein KdgM from Erwinia chrysanthemi. The phytopathogenic Gram-negative bacteria Erwinia chrysanthemi secretes pectinases, which are able to degrade the pectic polymers of plant cell walls, and uses the degradation products as a carbon source for growth. KdgM is a major outer membrane protein, whose synthesis is strongly induced in the presence of pectic derivatives. KdgM behaves like a voltage-dependent porin that is slightly selective for anions and that exhibits fast block in the presence of trigalacturonate. In contrast to most porins, KdgM seems to be monomeric. Probab=94.36 E-value=0.26 Score=24.52 Aligned_cols=51 Identities=10% Similarity=0.062 Sum_probs=36.6 Q ss_pred CCCCCEEEEEEEEEEEECCCCCEEEEEEEEEEECCC-CCCCCEEEEEEEEEEC Q ss_conf 677520479973466100488679999999975056-5566226999998539 Q gi|254780598|r 154 ITPIAIGGTAGVGVEVGGLSESLVARLEYRASKYSK-VEGFYNTISLGVGMKF 205 (205) Q Consensus 154 ~~~~~~g~~~G~Gvey~~it~~~~~r~EY~y~df~~-~~~~~~~~~~G~~ykF 205 (205) .+.....|..-.-++| .+.++|.--+|+.+.+-.+ .|.....+||||.|-| T Consensus 167 ~d~k~~~~e~n~~~~Y-k~~~~W~PyvEvg~v~~~~~~d~rq~RyRVGv~Y~F 218 (218) T pfam06178 167 SDGKKTNWEYNVKLAY-KIDKHWTPYVEVGNVDVNNNTDERQTRYRVGVQYSF 218 (218) T ss_pred CCCCCCEEEEEEEEEE-ECCCCCCEEEEEEECCCCCCCCCCEEEEEEEEEEEC T ss_conf 4798851688899999-828982304999842567888662767998889769 No 51 >COG4625 Uncharacterized protein with a C-terminal OMP (outer membrane protein) domain [Function unknown] Probab=94.01 E-value=0.31 Score=24.12 Aligned_cols=46 Identities=13% Similarity=0.190 Sum_probs=34.4 Q ss_pred CCEEEEEEEEEEEECCCCCEEEEEEEEEEECCCCCCCCEEEEEEEEEEC Q ss_conf 5204799734661004886799999999750565566226999998539 Q gi|254780598|r 157 IAIGGTAGVGVEVGGLSESLVARLEYRASKYSKVEGFYNTISLGVGMKF 205 (205) Q Consensus 157 ~~~g~~~G~Gvey~~it~~~~~r~EY~y~df~~~~~~~~~~~~G~~ykF 205 (205) .+....+++|+.+ .+++|.++.+-|.+ .|++-.. -|.+.+++.+|| T Consensus 532 srdaa~i~~G~~l-~~s~n~sl~~~y~~-~fg~r~~-~~s~~~~l~~~F 577 (577) T COG4625 532 SRDAALLGLGAVL-PVSPNAGLWARYEG-QFGDRKV-DDSGGLRLQVRF 577 (577) T ss_pred CCCCEEEECCEEE-EECCCEEEEEEEEE-EECCCEE-CCCCCEEEEECC T ss_conf 6642477545168-61563268887767-9636245-145433457439 No 52 >PRK10455 periplasmic protein; Reviewed Probab=93.94 E-value=0.055 Score=27.97 Aligned_cols=29 Identities=38% Similarity=0.314 Sum_probs=25.4 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCCCCC Q ss_conf 94437899999999984100002466543 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAADPVRR 29 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad~~~~ 29 (205) |||+...+.+|.|+|++++.|+|||.... T Consensus 1 Mrkltal~~AS~l~lg~~~~A~aA~~~~~ 29 (161) T PRK10455 1 MRKLTALFVASTLALGAANLAHAADTTTA 29 (161) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHCCCCC T ss_conf 91799999999999836688887236899 No 53 >pfam06727 DUF1207 Protein of unknown function (DUF1207). This family consists of a number of hypothetical bacterial proteins of around 410 residues in length which seem to be specific to Chlamydia species. The function of this family is unknown. Probab=93.67 E-value=0.36 Score=23.79 Aligned_cols=84 Identities=19% Similarity=0.246 Sum_probs=56.2 Q ss_pred EEEEEEEEECCCEEEEEEEEEEEEEEE-------------ECCC-----------------CCCCCCCCEEEEEEEEEEE Q ss_conf 998975431276778988544776444-------------3134-----------------5556775204799734661 Q gi|254780598|r 120 LRIRGGYEVSDSLLLYATVGPDVAQKY-------------ETGK-----------------AGEITPIAIGGTAGVGVEV 169 (205) Q Consensus 120 ~~~r~Gy~~~~~~l~Y~~~G~a~~~~~-------------~~~~-----------------~~~~~~~~~g~~~G~Gvey 169 (205) +-.++.|.+ +..-+|+-+|+-..+.. ...+ ...+.++....++.+|+|. T Consensus 217 ~Dl~lS~e~-p~iRvYGG~G~i~~rd~s~~~p~~~e~GaElRsp~gl~~l~~rpv~A~d~k~~eq~~~~~d~s~~~G~E~ 295 (337) T pfam06727 217 VDLRLSYEL-PQIRVYGGGGGIFSRDPSPIKPWYIEYGAELRSPWGLDNLHARPIFAMDFKNWEQQKWNIDQSYRAGMEW 295 (337) T ss_pred EEEEEEEEC-CCEEEEECCEEEEECCCCCCCCEEEECCCEEECCCCCCCCCCEEEEEEEEEEHHHHCCCCCCEEEEEEEE T ss_conf 017997664-7579981401689548777887057617011076667863430689997523454145867248986676 Q ss_pred ECCC---CCEEEEEEEEEEECCCC----CCCCEEEEEEEEEEC Q ss_conf 0048---86799999999750565----566226999998539 Q gi|254780598|r 170 GGLS---ESLVARLEYRASKYSKV----EGFYNTISLGVGMKF 205 (205) Q Consensus 170 ~~it---~~~~~r~EY~y~df~~~----~~~~~~~~~G~~ykF 205 (205) ..+. .+|.+-+|| |.-|+.. +-..+-..+|+.|.| T Consensus 296 ~~~q~~gRk~~~llEY-~~G~Sp~GQF~re~~~Y~Gigl~Ygf 337 (337) T pfam06727 296 AKFQAFGRKLRLLLEY-HDGFSPTGQFFREKTDYLGFGLHYGF 337 (337) T ss_pred EHEECCCCEEEEEEEE-CCCCCCCCCEEEEECCEEEEEEEECC T ss_conf 2100437136776620-04999876057631327885358159 No 54 >PRK11447 cellulose synthase subunit BcsC; Provisional Probab=93.64 E-value=0.36 Score=23.75 Aligned_cols=49 Identities=12% Similarity=0.003 Sum_probs=33.4 Q ss_pred CCCCCCEEEEEEEEEEEECCCCCEEEEEEEEEEECCCCCCCCEEEEEEEEEE Q ss_conf 5677520479973466100488679999999975056556622699999853 Q gi|254780598|r 153 EITPIAIGGTAGVGVEVGGLSESLVARLEYRASKYSKVEGFYNTISLGVGMK 204 (205) Q Consensus 153 ~~~~~~~g~~~G~Gvey~~it~~~~~r~EY~y~df~~~~~~~~~~~~G~~yk 204 (205) ..+.+.+||++.+.+|| .|+++|++.+.-.+-.+++=. .+..-+=+-|- T Consensus 1089 g~s~~G~~y~l~a~~ey-r~~~~~~~G~~~~~~~s~dY~--p~~~~lY~RY~ 1137 (1158) T PRK11447 1089 GSSSQGFGYTARALIER-RVTSNWFVGTAVDIQQAKDYA--PSHALLYVRYS 1137 (1158) T ss_pred CCCCCCCCEEEEEEEEE-EECCCEEEECEEECCCCCCCC--CCCEEEEEEEE T ss_conf 88888632367788999-715867863105015335778--35146789896 No 55 >pfam01278 Omptin Omptin family. The omptin family is a family of serine proteases. Probab=93.53 E-value=0.38 Score=23.65 Aligned_cols=49 Identities=14% Similarity=0.086 Sum_probs=33.5 Q ss_pred CCCEEEEEEEEEEEECCCCCEEEEEEEEEEECCCCC------------------------CCCEEEEEEEEEEC Q ss_conf 752047997346610048867999999997505655------------------------66226999998539 Q gi|254780598|r 156 PIAIGGTAGVGVEVGGLSESLVARLEYRASKYSKVE------------------------GFYNTISLGVGMKF 205 (205) Q Consensus 156 ~~~~g~~~G~Gvey~~it~~~~~r~EY~y~df~~~~------------------------~~~~~~~~G~~ykF 205 (205) +..-=+.+.+-+-| .+||+..+.+|..|..+.+.+ -..-++++|+-||| T Consensus 223 kns~yys~~~~agY-~vtp~~klf~e~~~~ky~~~kg~T~i~d~~tG~~~~~~~d~AGi~n~~~tvs~GLkY~F 295 (295) T pfam01278 223 ENQNYYSVAVNAGY-YVTPNAKVFVEGSYSKVTNKKGDTSLYDIASGNTSGGEGDAAGIENYNFTVTAGLKYRF 295 (295) T ss_pred CCCCEEEEEEECCE-EECCCEEEEEEEEEEEEECCCCCCEEEECCCCCEEECCCCCCCCCCCCEEEEEEEEEEC T ss_conf 67641899984125-76798569999999877325687478873688656326775331322148998778869 No 56 >pfam06551 DUF1120 Protein of unknown function (DUF1120). This family consists of several hypothetical bacterial proteins of unknown function. Probab=93.25 E-value=0.055 Score=27.98 Aligned_cols=34 Identities=29% Similarity=0.207 Sum_probs=23.7 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCC Q ss_conf 9443789999999998410000246654356666 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAADPVRRAHHGG 34 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad~~~~~~~~~ 34 (205) |||++++.++.+-++.+.++|+||+........+ T Consensus 1 mkK~l~~~~i~a~l~~~~ssa~aa~~ta~LkV~G 34 (145) T pfam06551 1 MKKNLAATAIAAGLAVLATSAQAADSTAVLKVTG 34 (145) T ss_pred CCHHHHHHHHHHHHHHHHCCHHHCCCCCEEEEEE T ss_conf 9237899999999998603142257744699999 No 57 >pfam04338 DUF481 Protein of unknown function, DUF481. This family includes several proteins of uncharacterized function. Probab=93.14 E-value=0.44 Score=23.31 Aligned_cols=47 Identities=21% Similarity=0.328 Sum_probs=28.5 Q ss_pred CEEEEEEEEEEEECCCCCEEEEEEEEEE--ECCCC--CCCCEEEEEEEEEEC Q ss_conf 2047997346610048867999999997--50565--566226999998539 Q gi|254780598|r 158 AIGGTAGVGVEVGGLSESLVARLEYRAS--KYSKV--EGFYNTISLGVGMKF 205 (205) Q Consensus 158 ~~g~~~G~Gvey~~it~~~~~r~EY~y~--df~~~--~~~~~~~~~G~~ykF 205 (205) ..-+..-.|+++ .|+++|++++.|.+. .-... +-.-.+++++++|+| T Consensus 160 d~~~~~~~~l~~-~l~~~lsl~~~~~~~yds~P~~~~~~~D~~~~~~l~y~f 210 (210) T pfam04338 160 NTRLNSETGLTV-KLTDNLSLKLSYEVDYNSDPPAGKKKTDTTTSVSLGYSF 210 (210) T ss_pred CEEEEEEEEEEE-EEECEEEEEEEEEEEECCCCCCCCCCCCEEEEEEEEEEC T ss_conf 679999999999-830107999999999937799998874539999999989 No 58 >PRK10159 outer membrane phosphoporin protein E; Provisional Probab=93.04 E-value=0.46 Score=23.24 Aligned_cols=43 Identities=14% Similarity=0.319 Sum_probs=34.2 Q ss_pred EEEEEEEEECCCCCEEEEEEEEEEECCCCC----CCCEEEEEEEEEEC Q ss_conf 997346610048867999999997505655----66226999998539 Q gi|254780598|r 162 TAGVGVEVGGLSESLVARLEYRASKYSKVE----GFYNTISLGVGMKF 205 (205) Q Consensus 162 ~~G~Gvey~~it~~~~~r~EY~y~df~~~~----~~~~~~~~G~~ykF 205 (205) .+-+|++| .+++|+.+-+||+...+.+.+ ..-|.+.+|+.|.| T Consensus 305 ~~~~ga~Y-~Fnkn~~~y~~Yk~n~ld~~~~~~~~~dd~~~~GlrY~F 351 (351) T PRK10159 305 YIDVGATY-YFNKNMSAFVDYKINQLDSDNKLNINNDDIVAVGMTYQF 351 (351) T ss_pred EEEEEEEE-EECCCCEEEEEEEECCCCCCCCCCCCCCCEEEEEEEEEC T ss_conf 69986799-957981589988601557755568876788999618859 No 59 >pfam11059 DUF2860 Protein of unknown function (DUF2860). This bacterial family of proteins has no known function. Probab=92.61 E-value=0.53 Score=22.91 Aligned_cols=74 Identities=14% Similarity=-0.062 Sum_probs=44.4 Q ss_pred CEEEEEEEEEEEEEEEECCCCCCCCCCCEEEEEEEEEEEECC--CCCEEEEEEEEEEECC-CC---CCCCEEEEEEEEEE Q ss_conf 677898854477644431345556775204799734661004--8867999999997505-65---56622699999853 Q gi|254780598|r 131 SLLLYATVGPDVAQKYETGKAGEITPIAIGGTAGVGVEVGGL--SESLVARLEYRASKYS-KV---EGFYNTISLGVGMK 204 (205) Q Consensus 131 ~~l~Y~~~G~a~~~~~~~~~~~~~~~~~~g~~~G~Gvey~~i--t~~~~~r~EY~y~df~-~~---~~~~~~~~~G~~yk 204 (205) +..+-.+++++..+.+...+..........|.+-+..||..+ -++|++-+-+-|..-. ++ |.+..-+++||+|| T Consensus 218 ~h~~~lt~~y~~~~Yd~~NPIF~ktr~d~~~~~f~~Y~y~~~fg~~~~s~~~~agy~~~dSnI~FYD~~~~~~svGi~Y~ 297 (297) T pfam11059 218 RHSLVLTLSYGKREYDAANPIFNKTREDNEYGAFLAYEYANLFGWKNWSFNALAGYSTTDSNIDFYDESEYLVSVGLNYK 297 (297) T ss_pred CCEEEEEECCCCCCCCCCCCCCCCCCCCCCEEEEEEEEECCCCCCCCEEEEEEEEECCCCCCCCEECCCCEEEEEECCCC T ss_conf 61599987132223333487524212377433677465406588755269998621235678626646434999612339 No 60 >PRK10002 outer membrane protein F; Provisional Probab=92.09 E-value=0.62 Score=22.56 Aligned_cols=43 Identities=16% Similarity=0.322 Sum_probs=33.6 Q ss_pred EEEEEEEEECCCCCEEEEEEEEEEECCCCC----CCCEEEEEEEEEEC Q ss_conf 997346610048867999999997505655----66226999998539 Q gi|254780598|r 162 TAGVGVEVGGLSESLVARLEYRASKYSKVE----GFYNTISLGVGMKF 205 (205) Q Consensus 162 ~~G~Gvey~~it~~~~~r~EY~y~df~~~~----~~~~~~~~G~~ykF 205 (205) .+-+|++| .+++|+..-+||+.-.+.+.+ ..-|.+.+|+-|.| T Consensus 316 ~~~lga~Y-~FnkN~~ty~~Yk~NllD~~~~~~~~~DD~~avGlrY~F 362 (362) T PRK10002 316 YFEVGATY-YFNKNMSTYVDYIINQIDSDNKLGVGSDDTVAVGIVYQF 362 (362) T ss_pred EEEEEEEE-EECCCEEEEEEEEHHCCCCCCCCCCCCCCEEEEEEEEEC T ss_conf 59987599-955870268766000567655568877787999718829 No 61 >smart00869 Autotransporter Autotransporter beta-domain. Secretion of protein products occurs by a number of different pathways in bacteria. One of these pathways known as the type IV pathway was first described for the IgA1 protease. The protein component that mediates secretion through the outer membrane is contained within the secreted protein itself, hence the proteins secreted in this way are called autotransporters. This family corresponds to the presumed integral membrane beta-barrel domain that transports the protein. This domain is found at the C-terminus of the proteins it occurs in. The N-terminus contains the variable passenger domain that is translocated across the membrane. Once the passenger domain is exported it is cleaved auto-catalytically in some proteins, in others a different peptidase is used and in some cases no cleavage occurs. Probab=92.03 E-value=0.63 Score=22.52 Aligned_cols=24 Identities=29% Similarity=0.524 Sum_probs=9.9 Q ss_pred EEEEEEEEEEECCCCCEEEEEEEEE Q ss_conf 4799734661004886799999999 Q gi|254780598|r 160 GGTAGVGVEVGGLSESLVARLEYRA 184 (205) Q Consensus 160 g~~~G~Gvey~~it~~~~~r~EY~y 184 (205) ...+++|+++ .+.+.+.+.++|.+ T Consensus 231 ~~~~~~g~~~-~~~~~~~~~~~~~~ 254 (261) T smart00869 231 AAELGLGLSA-KLSNGLSLSLNYDG 254 (261) T ss_pred EEEEEEEEEE-EECCCEEEEEEEEE T ss_conf 8999999999-98998899999999 No 62 >pfam05275 CopB Copper resistance protein B precursor (CopB). This family consists of several bacterial copper resistance proteins. Copper is essential and serves as cofactor for more than 30 enzymes yet a surplus of copper is toxic and leads to radical formation and oxidation of biomolecules. Therefore, copper homeostasis is a key requisite for every organism. CopB serves to extrude copper when it approaches toxic levels. Probab=91.80 E-value=0.67 Score=22.38 Aligned_cols=42 Identities=14% Similarity=0.211 Sum_probs=20.1 Q ss_pred EEEEEEEECCCCCEEEEEEEEEE-ECCCC---------CCCCEEEEEEEEEEC Q ss_conf 97346610048867999999997-50565---------566226999998539 Q gi|254780598|r 163 AGVGVEVGGLSESLVARLEYRAS-KYSKV---------EGFYNTISLGVGMKF 205 (205) Q Consensus 163 ~G~Gvey~~it~~~~~r~EY~y~-df~~~---------~~~~~~~~~G~~ykF 205 (205) +|+=+-| -|+..+.--+.+.|. .||+. +.+...+.+|+...| T Consensus 159 ~GlRlRY-Ei~R~fAPYvGv~~~~~~G~tad~~r~~g~~~~~~~~v~GvR~wF 210 (210) T pfam05275 159 LGLRLRY-EISREFAPYVGVEWERKFGDTADLARAAGEDTDETRFVAGVRFWF 210 (210) T ss_pred EEEEEEE-EEEECCCCEEEEEEEECCCCHHHHHHHCCCCCCCEEEEEEEEEEC T ss_conf 6568888-984043454879998603767699997499976508999999869 No 63 >PRK03757 hypothetical protein; Provisional Probab=91.80 E-value=0.11 Score=26.33 Aligned_cols=24 Identities=42% Similarity=0.513 Sum_probs=17.4 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 9443789999999998410000246 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAAD 25 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad 25 (205) |||++|+++++++++++ ++|+|++ T Consensus 1 mkk~~l~~~l~a~~~~~-~~A~a~~ 24 (191) T PRK03757 1 MKKTLLGLTLGSLLFSA-GSAVAAD 24 (191) T ss_pred CCHHHHHHHHHHHHHHC-CCCCCCC T ss_conf 90439999999998630-4045752 No 64 >pfam01617 Surface_Ag_2 Surface antigen. This family includes a number of bacterial surface antigens expressed on the surface of pathogens. Probab=91.27 E-value=0.77 Score=22.07 Aligned_cols=174 Identities=13% Similarity=0.023 Sum_probs=88.2 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC-----------CCC---- Q ss_conf 44378999999999841000024665435666665556345544444411542100012233-----------334---- Q gi|254780598|r 2 QKLFLAVGVSSLALASFCSAQAADPVRRAHHGGRGVVPTIATNRYVPIRHDFNGPYAGLSAL-----------YNG---- 66 (205) Q Consensus 2 kk~~la~av~~lal~~~~~A~Aad~~~~~~~~~~~~~~~~~~~~~~~~~~~w~G~y~g~~~g-----------~~~---- 66 (205) ||+++..++..+++.....+++........ ......-...+-|+...++.|.+.-.-. ... T Consensus 4 kk~fi~~~l~~l~~~lp~~sfs~~~~~~~~----~~~~fYis~qYkPs~~~f~~FsikE~~~~t~~~~~l~~d~~~~~~~ 79 (279) T pfam01617 4 KKFFIGAALATLLSFLPGSSFSDPVGPISD----GEGSFYIGLKYNPSISHFGTFSAKETGGTTKGVFGLKKDWDGVVIS 79 (279) T ss_pred CEEEHHHHHHHHHHHCCCHHHCCCCCCCCC----CCCEEEEEEEECCCCCCCCCEEEEECCCCCEEEEEECCCCCCCCCC T ss_conf 034299999999987763454063456767----7730999898876766656178973588740048722456543002 Q ss_pred -------CC---CCC-CCCC-CEEEEEEEEEEECCCEEEEEEEEEEECCCCCCCCCCCCC-------------------- Q ss_conf -------46---677-7776-026899854640483799845101321555666621223-------------------- Q gi|254780598|r 67 -------SF---GEE-AHHN-AGGSIFAGYNVEDSCIMYGVEGDVRYTVPVLADNIHSLH-------------------- 114 (205) Q Consensus 67 -------~~---~~~-~~~~-~~~G~~~GYn~q~~~~V~G~e~d~~~~~~~~~~~~~~~~-------------------- 114 (205) .. ... .++. .+.+.-+||-.. +.=+=+|+-+...+.+...+....+ T Consensus 80 ~~~~~~~~~~~y~~~f~~n~~~gfsGaiGYs~~--~~RiE~E~sye~Fdvk~~~~~~~~da~~y~al~r~~~~~~~~~~y 157 (279) T pfam01617 80 KHANFDFSVPGYSFKYENNLFTGFGGAIGYSMD--GPRVELEVSYERFDVKNQGNNYKNDAHKYAALTRDAAIAAADSKY 157 (279) T ss_pred CCCCCCCCCCCCCCCCCCCCCCEEEEEEEECCC--CEEEEEEEEEEEECCCCCCCCCCCCCEEEEEEECCCCCCCCCCCE T ss_conf 445677777899855056884206899968149--988999987658335899876146615899986002666667746 Q ss_pred -------CEEEEEEEEEEEEEC---CCEEEEEEEEEEEEEEEECCCCCCCCCCCEEEEEEEEEEEECCCCCEEEEEEEEE Q ss_conf -------214899989754312---7677898854477644431345556775204799734661004886799999999 Q gi|254780598|r 115 -------GIGGSLRIRGGYEVS---DSLLLYATVGPDVAQKYETGKAGEITPIAIGGTAGVGVEVGGLSESLVARLEYRA 184 (205) Q Consensus 115 -------~~~~~~~~r~Gy~~~---~~~l~Y~~~G~a~~~~~~~~~~~~~~~~~~g~~~G~Gvey~~it~~~~~r~EY~y 184 (205) -...++..-+=|++. -.+.||+-+|...--++. ......+..|..=+|+.| .|++++++-++-.| T Consensus 158 ~v~kN~gis~~S~m~N~CYD~~~~~~~i~PY~C~GiG~d~i~~----~~~~~~K~aYq~KlGvsY-~ls~~i~lF~~gyY 232 (279) T pfam01617 158 VVLKNDGLTDISLMLNACYDIAHEGIPVSPYVCAGVGADLISV----FGAINPKFSYQGKAGLSY-PITPEISLFAGGHY 232 (279) T ss_pred EEEEECCEEEEEEEEEEEEECCCCCCCCCCEEEEECCHHHHHH----CCCCCEEEEEEEEEEEEE-ECCCCEEEEEEEEE T ss_conf 9999677886657999999622388620447877437746323----267533589999523788-73888599998899 Q ss_pred EE Q ss_conf 75 Q gi|254780598|r 185 SK 186 (205) Q Consensus 185 ~d 186 (205) +. T Consensus 233 Hk 234 (279) T pfam01617 233 HK 234 (279) T ss_pred EE T ss_conf 87 No 65 >pfam03502 Channel_Tsx Nucleoside-specific channel-forming protein, Tsx. Probab=90.92 E-value=0.84 Score=21.89 Aligned_cols=49 Identities=18% Similarity=0.250 Sum_probs=35.0 Q ss_pred CCCCEEEEEEEEEEEECCCCCEEEEEEEEEEECC------CCCCCCEEEEEEEEEEC Q ss_conf 7752047997346610048867999999997505------65566226999998539 Q gi|254780598|r 155 TPIAIGGTAGVGVEVGGLSESLVARLEYRASKYS------KVEGFYNTISLGVGMKF 205 (205) Q Consensus 155 ~~~~~g~~~G~Gvey~~it~~~~~r~EY~y~df~------~~~~~~~~~~~G~~ykF 205 (205) ..++.|...-.++-+ - +|+|++...|||.+=. ..+..-....+-|+||| T Consensus 205 ~~~s~g~~~~~~l~w-~-~~~~~~g~~~ry~~n~~~~~d~g~~~~g~~~y~~v~Y~F 259 (259) T pfam03502 205 NRTSNGGNGALGLYW-H-TDRWAVGYGYRYYDNKGQWKDLGQDSTGDGHYYSVGYKF 259 (259) T ss_pred CCCCCCCCEEEEEEE-C-CCCEEEEEEEEEEECCCCCCCCCCCCCCCEEEEEEEECC T ss_conf 877665210278997-4-775898888787633564335775543414999998529 No 66 >pfam02264 LamB LamB porin. Maltoporin (LamB protein) forms a trimeric structure which facilitates the diffusion of maltodextrins across the outer membrane of Gram-negative bacteria. The membrane channel is formed by an antiparallel beta-barrel. Probab=90.24 E-value=0.97 Score=21.56 Aligned_cols=110 Identities=11% Similarity=-0.032 Sum_probs=62.4 Q ss_pred CEEEEEEEEEEECCCCCCCCCCCCCCEEEEEEEEEEEEECCCEEEEEEEEEEEEEEEECCCCCCCC----CCCEEEEEEE Q ss_conf 379984510132155566662122321489998975431276778988544776444313455567----7520479973 Q gi|254780598|r 90 CIMYGVEGDVRYTVPVLADNIHSLHGIGGSLRIRGGYEVSDSLLLYATVGPDVAQKYETGKAGEIT----PIAIGGTAGV 165 (205) Q Consensus 90 ~~V~G~e~d~~~~~~~~~~~~~~~~~~~~~~~~r~Gy~~~~~~l~Y~~~G~a~~~~~~~~~~~~~~----~~~~g~~~G~ 165 (205) +|.++-.+-+................| .++.+|+-|.+++.+-+-..+||...+.+........+ +.-..+++-+ T Consensus 273 ~~~~~~~~~yq~~~d~~~~~~~~~~~~-~s~~vRP~y~~~~~~~~~~E~Gy~~~~~~~~~~~~~~~~~~~K~TlApt~s~ 351 (399) T pfam02264 273 NWEVAHALVYQRGNDIYSTADKGSYEW-YTVVVRPMYKWNQIMSTAFEGGYFNVKNKSTGGNDREDNGQYKLTLAQTWKV 351 (399) T ss_pred CEEEEEEEEEEECCCCCCCCCCCCEEE-EEEEEEEEEEECCCEEEEEEEEEEEEECCCCCCCCCCCCCCEEEEEEEECCC T ss_conf 753605355674234367778887479-9998677379657258999988988723667886435478368999640312 Q ss_pred EEEEECCCCCEEEEEEEEEEECCC---------CCCCCEEEEEEEEE Q ss_conf 466100488679999999975056---------55662269999985 Q gi|254780598|r 166 GVEVGGLSESLVARLEYRASKYSK---------VEGFYNTISLGVGM 203 (205) Q Consensus 166 Gvey~~it~~~~~r~EY~y~df~~---------~~~~~~~~~~G~~y 203 (205) |-++ . .+=.+|+=..|.+..+ .+.+.+++.+|+-. T Consensus 352 g~~f-~--sRPelR~yaTy~~w~~~~~~~~~~~~~~~~~~~~fGvQ~ 395 (399) T pfam02264 352 GSSF-W--ARPEIRFYATYLKWDEKEGKGNTFDSGNDDDEWTFGAQM 395 (399) T ss_pred CCCC-C--CCCEEEEEEEEEECCCHHHCCCCCCCCCCCCCEEEEEEE T ss_conf 7987-7--895799999997135012033665778877878999999 No 67 >PRK09934 putative fimbrial protein; Provisional Probab=90.17 E-value=0.22 Score=24.91 Aligned_cols=23 Identities=22% Similarity=0.205 Sum_probs=15.4 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 94437899999999984100002 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQA 23 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~A 23 (205) |||++|++.++++..+.+.+|.+ T Consensus 1 mkk~~l~~~~~l~~~~~~~aa~~ 23 (171) T PRK09934 1 MRRVFIAIFCGLLWSPLSQAASP 23 (171) T ss_pred CHHHHHHHHHHHHHHHHHHHCCC T ss_conf 92689999999998634421135 No 68 >TIGR01414 autotrans_barl outer membrane autotransporter barrel domain; InterPro: IPR006315 The secretion of protein products occurs by a number of different pathways in bacteria and several secretion mechanisms have been described for Gram-negative bacteria , an increasing number employ a highly efficient but simple mechanism first described for the immunoglobulin A1 (IgA1) proteases , . The autotransporter secretion pathway is a distinct secretion mechanism, in which the protein moiety mediating export through the outer membrane is contained within the precursor of the secreted protein itself. Autotransporters have been implicated as important or putative virulence factors such as mediating adhesion to host cells or by mediating actin-promoted bacterial mobility . The key feature of an autotransporter is that it contains all the information for secretion in the precursor of the secreted protein itself . Autotransporters comprise three functional domains: 1) an N-terminal targeting domain (amino-terminal leader sequence) that functions as a signal peptide to mediate targeting to and translocation across the inner membrane 2) a C-terminal translocation domain (carboxy-terminal) that forms a beta-barrel pore to allow the secretion of 3) the passenger domain, the secreted mature protein . This entry shows the C-terminal autotransporter domain, it is about 400 amino acids in length and includes the aromatic amino acid-rich OMP signal, typically ending with a Phe or Trp residue, at the extreme C-terminus.; GO: 0019867 outer membrane. Probab=89.68 E-value=1.1 Score=21.31 Aligned_cols=134 Identities=21% Similarity=0.168 Sum_probs=86.6 Q ss_pred CCCCCCCCEEEEEEEEEEECCCEEEEEEEEEEECCCCCCC-----CC----CC--CCCEEEEEEEEEE---EEE----CC Q ss_conf 6777776026899854640483799845101321555666-----62----12--2321489998975---431----27 Q gi|254780598|r 69 GEEAHHNAGGSIFAGYNVEDSCIMYGVEGDVRYTVPVLAD-----NI----HS--LHGIGGSLRIRGG---YEV----SD 130 (205) Q Consensus 69 ~~~~~~~~~~G~~~GYn~q~~~~V~G~e~d~~~~~~~~~~-----~~----~~--~~~~~~~~~~r~G---y~~----~~ 130 (205) ......++...+.+||++... +.+-....+.+....... .. .. ...+...+..+++ +.+ .. T Consensus 294 ~~~~~~g~~~~~~~g~~~~~~-~~~~P~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~ 372 (454) T TIGR01414 294 GKYNSNGLGASLEAGYRYNLG-WYLEPQAQLSYLGVSGDDYKESNGTRVLGGGDSGDSLQGRLGLRLGYKASQFDLGTGR 372 (454) T ss_pred CCCCCCCEEEEEECCCEEEEE-EEEECCEEEEEEEECCCCEECCCCEEEEECCCCCCCEEEECCEEECCCHHHHHCCCCC T ss_conf 444333114543114011004-6750003688874056420104653676056653200000010000100000003673 Q ss_pred CEEEEEEEEEEEEEE--EEC------CCCCCCCCCCEEEEEEEEEEEECCC-CCEEEEEEEEEEECCCCCCCCEEEEEEE Q ss_conf 677898854477644--431------3455567752047997346610048-8679999999975056556622699999 Q gi|254780598|r 131 SLLLYATVGPDVAQK--YET------GKAGEITPIAIGGTAGVGVEVGGLS-ESLVARLEYRASKYSKVEGFYNTISLGV 201 (205) Q Consensus 131 ~~l~Y~~~G~a~~~~--~~~------~~~~~~~~~~~g~~~G~Gvey~~it-~~~~~r~EY~y~df~~~~~~~~~~~~G~ 201 (205) .+-+|....+..... +.. ............+.+++|++. .++ +++.+.+++.+....+... ..+..+|+ T Consensus 373 ~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~-~~~~~~~~~~~~~~~~~g~~~~~-~~~~~~g~ 450 (454) T TIGR01414 373 ALKPYLKANWLHEFGGLGGNGVLVNGGVTLRNDGSGTRGELGLGVNA-SLKSPNLSLYLDVDYQLGGKGET-PWSGNLGV 450 (454) T ss_pred EEEEEEEEEEEEECCCCCCCEEEECCCCCCCCCCCCCEEEEECCEEE-EECCCCEEEEEEEECCCCCCCCC-CEEEEEEE T ss_conf 22146777653101477753157516520024444422443124024-43364215765310013665333-20355535 Q ss_pred EEEC Q ss_conf 8539 Q gi|254780598|r 202 GMKF 205 (205) Q Consensus 202 ~ykF 205 (205) .|+| T Consensus 451 ~~~~ 454 (454) T TIGR01414 451 RYSF 454 (454) T ss_pred EEEC T ss_conf 7739 No 69 >PRK10177 hypothetical protein; Provisional Probab=89.25 E-value=1.2 Score=21.14 Aligned_cols=49 Identities=10% Similarity=0.059 Sum_probs=33.7 Q ss_pred CCCCCCEEEEEEEEEEEECCCCCEEEEEEEEEEECCCCCCCCEEEEEEEEEEC Q ss_conf 56775204799734661004886799999999750565566226999998539 Q gi|254780598|r 153 EITPIAIGGTAGVGVEVGGLSESLVARLEYRASKYSKVEGFYNTISLGVGMKF 205 (205) Q Consensus 153 ~~~~~~~g~~~G~Gvey~~it~~~~~r~EY~y~df~~~~~~~~~~~~G~~ykF 205 (205) +++.-+.=..+-+|+.|- --|=+++.+|++.-+ -....+.+.+.++||| T Consensus 246 ~~~~q~nP~A~T~GlnYT-PvPLvT~~a~~k~G~---~g~~dt~~~l~lnYr~ 294 (452) T PRK10177 246 SGTGYHNPVALSLGLNYT-PVPLVTVTAQHKQGE---SGVSQNNLGLNLNYRF 294 (452) T ss_pred CCCCCCCCHHEEECCCCC-CCCCEEEECCEECCC---CCCCCCEEEEEEEEEC T ss_conf 998766964178114540-521289610002167---8998755757998745 No 70 >PRK12696 flgH flagellar basal body L-ring protein; Reviewed Probab=88.70 E-value=0.79 Score=22.01 Aligned_cols=19 Identities=26% Similarity=0.382 Sum_probs=10.0 Q ss_pred CHHHHHHHHHHHHHHHHCCC Q ss_conf 94437899999999984100 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCS 20 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~ 20 (205) |||++++ .++++.|+++++ T Consensus 2 m~~~l~~-~~~~~~L~GC~~ 20 (238) T PRK12696 2 IRKLLAA-SCAVLLLSGCNA 20 (238) T ss_pred HHHHHHH-HHHHHHHHCCCC T ss_conf 5899999-999999516568 No 71 >PRK13513 putative outer membrane receptor; Provisional Probab=88.14 E-value=1.4 Score=20.73 Aligned_cols=42 Identities=17% Similarity=0.092 Sum_probs=23.9 Q ss_pred EEEEEEEEECCCCCEEEEEE--------EEEEECCCCCCCCEEEEEEEEEEC Q ss_conf 99734661004886799999--------999750565566226999998539 Q gi|254780598|r 162 TAGVGVEVGGLSESLVARLE--------YRASKYSKVEGFYNTISLGVGMKF 205 (205) Q Consensus 162 ~~G~Gvey~~it~~~~~r~E--------Y~y~df~~~~~~~~~~~~G~~ykF 205 (205) ++-+.+.| .++++|.+.+. |++...+... .--++.+|+.|+| T Consensus 611 ~~dl~~~y-~~~~~~~l~~~v~NLfDk~Y~~~~~~~~~-~GR~~~~~~~y~F 660 (660) T PRK13513 611 FVDLGANY-QLTKELRLMGGVYNLLDKRVDIDVNDKVL-DGRRYMVGASYDF 660 (660) T ss_pred EEEEEEEE-EECCCEEEEEEEEECCCCCCCCCCCCCCC-CCCEEEEEEEEEC T ss_conf 99989999-97999899999990559941102488758-9909999999989 No 72 >COG5571 Autotransporter protein or domain, integral membrane beta-barrel involved in protein secretion [Cell motility and secretion] Probab=88.01 E-value=0.22 Score=24.85 Aligned_cols=133 Identities=19% Similarity=0.129 Sum_probs=72.0 Q ss_pred CCCCCCCCCEEEEEEEEEEECCCEEEEEEEEEEECCCCCCCC-----CC-----CCCCE---EEEEEEEEEEEECCCEEE Q ss_conf 667777760268998546404837998451013215556666-----21-----22321---489998975431276778 Q gi|254780598|r 68 FGEEAHHNAGGSIFAGYNVEDSCIMYGVEGDVRYTVPVLADN-----IH-----SLHGI---GGSLRIRGGYEVSDSLLL 134 (205) Q Consensus 68 ~~~~~~~~~~~G~~~GYn~q~~~~V~G~e~d~~~~~~~~~~~-----~~-----~~~~~---~~~~~~r~Gy~~~~~~l~ 134 (205) .++.....++.++.+||+|-++.++.|.-+++.|...+.+.. .. ....| ..++.-|+-+..++ +-| T Consensus 81 ~get~~~~~t~~L~~Gyef~~~~l~tGpvA~L~~~~t~v~~y~E~~~ssTal~f~dq~~~S~~sslGwrlr~~~~~-~~P 159 (239) T COG5571 81 LGETNAESWTARLSAGYEFPFGELRTGPVAGLHYDKTTVSGYNEQNASSTALGFTDQLWHSLVSSLGWRLRLRLAD-LQP 159 (239) T ss_pred CCCCCHHHHHHHCCCCCCCCCCCEEEECCHHCCCCCCEECCCCCCCCCCCCCCCCCCCHHHHHHCCCEEEECCCCC-CCC T ss_conf 3788723122210133554322123201000000443001212246641101566511566530354677322244-440 Q ss_pred EEEEEEEEEEEEE-------------CCCCCCCCCCCEEEEEEEEEEEECCCCCEEEEEEEEEEECCCCCCCCEEEEEEE Q ss_conf 9885447764443-------------134555677520479973466100488679999999975056556622699999 Q gi|254780598|r 135 YATVGPDVAQKYE-------------TGKAGEITPIAIGGTAGVGVEVGGLSESLVARLEYRASKYSKVEGFYNTISLGV 201 (205) Q Consensus 135 Y~~~G~a~~~~~~-------------~~~~~~~~~~~~g~~~G~Gvey~~it~~~~~r~EY~y~df~~~~~~~~~~~~G~ 201 (205) |+..=|-.---+. ..-..+.+++-. -+=+|+++ .+++++..-++---.+-. .+...-..++|| T Consensus 160 yaqi~y~~ef~~d~~~aqagL~~m~~~~~~~~qd~nw~--~i~~Ga~~-~l~~~i~~~a~~~q~~~~-t~~n~~~ytLgv 235 (239) T COG5571 160 YAQIRYVREFGDDIWKAQAGLSRMTATNVLPSQDKNWV--DITAGADF-LLNSSIGAFAAASQRENS-TENNQYLYTLGV 235 (239) T ss_pred EEEEEHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCHH--HEEEEEEE-CCCCHHHHHHHHHHHHCC-CCCCCCEEEEEE T ss_conf 14565787650123656410246600243566676641--30441012-234106899876444305-666631057753 Q ss_pred EEEC Q ss_conf 8539 Q gi|254780598|r 202 GMKF 205 (205) Q Consensus 202 ~ykF 205 (205) +-|| T Consensus 236 s~~F 239 (239) T COG5571 236 SARF 239 (239) T ss_pred ECCC T ss_conf 0239 No 73 >PRK13861 type IV secretion system protein VirB9; Provisional Probab=87.73 E-value=0.34 Score=23.92 Aligned_cols=28 Identities=29% Similarity=0.345 Sum_probs=17.3 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCCCCCCC Q ss_conf 9443789999999998410000246654356 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAADPVRRAH 31 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad~~~~~~ 31 (205) |||+++++ +++|+++++|+|++.+...+ T Consensus 2 mk~l~~~~---~~~l~~~~~a~A~~~P~~s~ 29 (293) T PRK13861 2 IKKLFLTL---ACLLFAAIGALAEDTPAAGK 29 (293) T ss_pred CHHHHHHH---HHHHHHHHHHHHHHCCCCCC T ss_conf 08999999---99998612667875698888 No 74 >pfam06082 DUF940 Bacterial putative lipoprotein (DUF940). This family consists of hypothetical bacterial proteins several of which are described as putative lipoproteins. Probab=86.57 E-value=1.1 Score=21.26 Aligned_cols=42 Identities=24% Similarity=0.194 Sum_probs=23.8 Q ss_pred EEEEEEECCCCCEEEEEEEEEEECCCC----CCC-CEEEEEEEEEEC Q ss_conf 734661004886799999999750565----566-226999998539 Q gi|254780598|r 164 GVGVEVGGLSESLVARLEYRASKYSKV----EGF-YNTISLGVGMKF 205 (205) Q Consensus 164 G~Gvey~~it~~~~~r~EY~y~df~~~----~~~-~~~~~~G~~ykF 205 (205) =+|+||....++|.+++||.=.++... ..+ .+.+.+|+.||+ T Consensus 168 FgGiey~~p~~~l~lk~Eyd~~~y~~e~~~~~~~~~SpfNfGl~Y~l 214 (661) T pfam06082 168 FGGVEYQTPWEPLRLKAEYDGNDYSAEFAGRGLEQDSPFNFGAVYRL 214 (661) T ss_pred CCCEEEECCCCCEEEEEEECCCCCCCCCCCCCCCCCCCCEEEEEEEC T ss_conf 34159856888808999976765444445677476886148899982 No 75 >pfam11924 DUF3442 Protein of unknown function (DUF3442). This family of proteins are functionally uncharacterized. This protein is found in bacteria. Proteins in this family are typically between 334 to 948 amino acids in length. Probab=86.45 E-value=1.8 Score=20.20 Aligned_cols=49 Identities=14% Similarity=0.101 Sum_probs=33.7 Q ss_pred CCCCCCEEEEEEEEEEEECCCCCEEEEEEEEEEECCCCCCCCEEEEEEEEEEC Q ss_conf 56775204799734661004886799999999750565566226999998539 Q gi|254780598|r 153 EITPIAIGGTAGVGVEVGGLSESLVARLEYRASKYSKVEGFYNTISLGVGMKF 205 (205) Q Consensus 153 ~~~~~~~g~~~G~Gvey~~it~~~~~r~EY~y~df~~~~~~~~~~~~G~~ykF 205 (205) ..+.......+-+|++| -..|.+++.++|+.. +...+...+.++++|+| T Consensus 201 ~~~~q~~p~a~t~gl~y-tP~pllt~~~~~~~~---~~~~~~t~~~l~~ny~l 249 (280) T pfam11924 201 SGNLQKDPHAVTLGLNY-TPVPLLTVGADYRDG---KGGQDDTFVGLGLNYRL 249 (280) T ss_pred CCCCCCCCCEEEEEEEE-CCCCCEEEEEEEEEC---CCCCCCCEEEEEEEECC T ss_conf 66555686117998884-036877998736806---89986668999999778 No 76 >TIGR03014 EpsL exopolysaccharide biosynthesis operon protein EpsL. The epsL gene is described as a component of the methanolan exopolysaccharide biosynthesis operon in Methylobacillus sp strain 12S, although no other information regarding its possible function is suggested. Homologs of this gene are found in several other exopolysaccharide operons in a small number of species. These operons contain a subset of the methanolan operon genes by homology and synteny, including the epsH gene which is proposed to act as an "exosortase" directing proteins with a C-terminal tag (PEP-CTERM) to the exopolysaccharide layer. Each of the genomes in which these genes and epsL are found also encode genes with these C-terminal tags. Probab=85.28 E-value=2.1 Score=19.88 Aligned_cols=43 Identities=9% Similarity=0.177 Sum_probs=20.4 Q ss_pred EEEEEEEEECCCCCEEEEEEEEEEECC----CCCCCCEEEEEEEEEEC Q ss_conf 997346610048867999999997505----65566226999998539 Q gi|254780598|r 162 TAGVGVEVGGLSESLVARLEYRASKYS----KVEGFYNTISLGVGMKF 205 (205) Q Consensus 162 ~~G~Gvey~~it~~~~~r~EY~y~df~----~~~~~~~~~~~G~~ykF 205 (205) .+++++.| .+...+.+.+.|+|.+-. ..+-+.|.+.|.+-+.| T Consensus 335 ~~~l~~~Y-~p~R~l~ls~~~~~~~RdSN~~~~~Yd~n~~~ls~~~tf 381 (381) T TIGR03014 335 SGSLSLDW-SPVRAVRISAAFQREKRDSNSDGFDFDSNSVSVSATLTF 381 (381) T ss_pred EEEEEEEE-EEEEEEEEEEEEEEEECCCCCCCCCCCCCEEEEEEEEEC T ss_conf 87789998-320238887579985245765557512323899999989 No 77 >PRK09782 bacteriophage N4 receptor, outer membrane subunit; Provisional Probab=85.05 E-value=0.29 Score=24.27 Aligned_cols=67 Identities=21% Similarity=0.257 Sum_probs=42.4 Q ss_pred CCEEEEEEEEEEEEEEEECCCCCCCCCCCEEEEEEEEEE---------EECCCCCEEEEEEEEEEECC--CCCCCCEEEE Q ss_conf 767789885447764443134555677520479973466---------10048867999999997505--6556622699 Q gi|254780598|r 130 DSLLLYATVGPDVAQKYETGKAGEITPIAIGGTAGVGVE---------VGGLSESLVARLEYRASKYS--KVEGFYNTIS 198 (205) Q Consensus 130 ~~~l~Y~~~G~a~~~~~~~~~~~~~~~~~~g~~~G~Gve---------y~~it~~~~~r~EY~y~df~--~~~~~~~~~~ 198 (205) ..++||+.+- .+.-..+.+...-..|+|+- |++...+++||+||+|+--+ +.-...|.+- T Consensus 910 QtL~PYa~vq---------~n~wr~d~~~g~~~aGVGvrW~~W~~e~rYNA~~~~~~V~~EyQ~tl~~~~~~~~~~n~~~ 980 (987) T PRK09782 910 QTLEPYAHVQ---------DNGYRDKGTQGAQLGGVGVRWNIWTGETHYNAWPHKVSVGVEYQHTFKAINQRNGERNNAF 980 (987) T ss_pred CEECCEEEEC---------CCCCCCCCCCCCCCCCCEEEEEEECCCCCCCCCCCCCEECHHHHHHHHHHHHHCCCCCCCE T ss_conf 4532314440---------6651001224654432215776503786445677540330538988998875225545642 Q ss_pred EEEEEEC Q ss_conf 9998539 Q gi|254780598|r 199 LGVGMKF 205 (205) Q Consensus 199 ~G~~ykF 205 (205) +|++++| T Consensus 981 ~~~~~~~ 987 (987) T PRK09782 981 LTIGVHW 987 (987) T ss_pred EEEECCC T ss_conf 6741469 No 78 >pfam07424 TrbM TrbM. This family contains the bacterial protein TrbM (approximately 180 residues long). In Comamonas testosteroni T-2, TrbM is derived from the IncP1beta plasmid pTSA, which encodes the widespread genes for p-toluenesulfonate (TSA) degradation. Probab=83.76 E-value=0.68 Score=22.34 Aligned_cols=25 Identities=40% Similarity=0.402 Sum_probs=16.2 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCC Q ss_conf 94437899999999984100002466 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAADP 26 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad~ 26 (205) |||++++++++..++++. +|++.|+ T Consensus 1 MKK~ll~~al~~~~~~s~-~A~~~d~ 25 (189) T pfam07424 1 MKKILLAVALAFGMFGSA-SAQKLDV 25 (189) T ss_pred CCHHHHHHHHHHHHHHCC-CCCHHHH T ss_conf 920179999999997147-6420034 No 79 >pfam12094 DUF3570 Protein of unknown function (DUF3570). This family of proteins is functionally uncharacterized. This protein is found in bacteria. Proteins in this family are typically between 396 to 444 amino acids in length. Probab=83.15 E-value=2.6 Score=19.38 Aligned_cols=76 Identities=9% Similarity=0.011 Sum_probs=41.1 Q ss_pred EEEECCCEEEEEEEEEEEEEEEECCC-----------------CCCCCCCCEEEEEEEEEEEECCCCCEEEEEEEEEEEC Q ss_conf 54312767789885447764443134-----------------5556775204799734661004886799999999750 Q gi|254780598|r 125 GYEVSDSLLLYATVGPDVAQKYETGK-----------------AGEITPIAIGGTAGVGVEVGGLSESLVARLEYRASKY 187 (205) Q Consensus 125 Gy~~~~~~l~Y~~~G~a~~~~~~~~~-----------------~~~~~~~~~g~~~G~Gvey~~it~~~~~r~EY~y~df 187 (205) .=.++.++++=...++...+--.+.+ .+.-.+++.-..+++...| +|.++..+|++|||.. T Consensus 210 SQVl~k~~~~~~n~~~~~~~GyLsdPYk~v~~~D~~~~~~~~~~E~rPd~R~k~~~~~r~~~-~l~~~~~l~~~YRyy~- 287 (407) T pfam12094 210 TQVLTKNTLVQLNLEYSSSSGYLSDPYKVVSVVDSNGPAQLYLYENRPDTRTKQAVYLRAKY-YLPDRAALRFSYRYYS- 287 (407) T ss_pred EEEECCCEEEEEEEEEEECCEECCCCCEEEEEECCCCCCCCCCHHCCCCHHHHHHHHHHEEE-ECCCCCEEEEEEEEEE- T ss_conf 89955151999988788702363798179997347764122212119862332010102202-0587725678778884- Q ss_pred CCCCCCCEEEEEEEE Q ss_conf 565566226999998 Q gi|254780598|r 188 SKVEGFYNTISLGVG 202 (205) Q Consensus 188 ~~~~~~~~~~~~G~~ 202 (205) .+=.++.||+-+... T Consensus 288 DdWgi~sHT~~~~~~ 302 (407) T pfam12094 288 DDWGITSHTLEAEYR 302 (407) T ss_pred CCCCCCEEEEEEEEE T ss_conf 677751068616999 No 80 >PRK13697 cytochrome c6; Provisional Probab=83.09 E-value=0.8 Score=21.99 Aligned_cols=27 Identities=33% Similarity=0.289 Sum_probs=19.1 Q ss_pred CHHHHHHHHHHHHHHH--HCCCCCCCCCC Q ss_conf 9443789999999998--41000024665 Q gi|254780598|r 1 MQKLFLAVGVSSLALA--SFCSAQAADPV 27 (205) Q Consensus 1 Mkk~~la~av~~lal~--~~~~A~Aad~~ 27 (205) |||++..+.++.+.+. ..++|+|+|+. T Consensus 1 Mk~l~~~~l~~~~~~~~~~~~pA~Aad~~ 29 (111) T PRK13697 1 MKKILKLVLLTLLLLTFAFTSPAFAADAA 29 (111) T ss_pred CHHHHHHHHHHHHHHHHHCCCHHHHHHHH T ss_conf 97899999999999999803554441675 No 81 >pfam05079 DUF680 Protein of unknown function (DUF680). This family contains several uncharacterized proteins which seem to be found exclusively in Rhizobium loti. Probab=83.09 E-value=0.73 Score=22.18 Aligned_cols=24 Identities=33% Similarity=0.268 Sum_probs=17.2 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 944378999999999841000024665 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAADPV 27 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad~~ 27 (205) |||+.|+++. .|.+.++|||.... T Consensus 1 M~KiaLtaAA---iL~asgsAFAGSD~ 24 (73) T pfam05079 1 MKKIALTAAA---ILAASGSAFAGSDN 24 (73) T ss_pred CCHHHHHHHH---HHHHCCCCCCCCCC T ss_conf 9224899999---99970652213565 No 82 >pfam07996 T4SS Type IV secretion system proteins. Members of this family are components of the type IV secretion system. They mediate intracellular transfer of macromolecules via a mechanism ancestrally related to that of bacterial conjugation machineries. Probab=82.95 E-value=0.86 Score=21.81 Aligned_cols=26 Identities=31% Similarity=0.351 Sum_probs=18.3 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 944378999999999841000024665 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAADPV 27 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad~~ 27 (205) |||.++++++ +++++.+++|+|+..+ T Consensus 1 Mkk~~~~~~~-~~~l~~~~~a~a~GIP 26 (217) T pfam07996 1 MKKKLIILAL-ALALLAASPAAAAGIP 26 (217) T ss_pred CCHHHHHHHH-HHHHHCCCHHHCCCCC T ss_conf 9337999999-9998477777608998 No 83 >PRK09701 D-allose transporter subunit; Provisional Probab=82.21 E-value=0.96 Score=21.58 Aligned_cols=27 Identities=26% Similarity=0.096 Sum_probs=21.9 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 944378999999999841000024665 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAADPV 27 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad~~ 27 (205) |||++..++.++++++.+++|.|+.-. T Consensus 1 m~k~~~~~~~~~~~~~~~~~a~a~~~i 27 (311) T PRK09701 1 MNKYLKYFSGTLVGLMLSTSAFAAAEY 27 (311) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHCCCE T ss_conf 936999999999999988877436849 No 84 >PRK00031 lolA outer-membrane lipoprotein carrier protein; Reviewed Probab=81.88 E-value=0.79 Score=22.02 Aligned_cols=25 Identities=36% Similarity=0.367 Sum_probs=16.5 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 9443789999999998410000246 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAAD 25 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad 25 (205) |||+++++.+.++++++.+.|.+.+ T Consensus 1 mk~~~~~~~~~~~l~~~~~~A~~~~ 25 (201) T PRK00031 1 MKKLLIAALLAAALVASSAFADAAS 25 (201) T ss_pred CCHHHHHHHHHHHHHHHHHCCCHHH T ss_conf 9159999999999877664502899 No 85 >pfam11231 DUF3034 Protein of unknown function (DUF3034). This family of proteins with unknown function appears to be restricted to Proteobacteria. Probab=81.73 E-value=2.9 Score=19.08 Aligned_cols=57 Identities=23% Similarity=0.140 Sum_probs=30.7 Q ss_pred EECCCEEEEEEEEEEEEEEEECCCCCCCCCCCEEEEEEEEEEEECCCCCEEEEEEEEE Q ss_conf 3127677898854477644431345556775204799734661004886799999999 Q gi|254780598|r 127 EVSDSLLLYATVGPDVAQKYETGKAGEITPIAIGGTAGVGVEVGGLSESLVARLEYRA 184 (205) Q Consensus 127 ~~~~~~l~Y~~~G~a~~~~~~~~~~~~~~~~~~g~~~G~Gvey~~it~~~~~r~EY~y 184 (205) .++.++|+=+++=+..++...=-.++.+.....-+..-+-+-+ +++++|.+.+|||. T Consensus 148 ~~g~nll~N~TlR~TkANQ~GLLGFGg~~~d~y~~~~E~S~a~-ll~~~~avG~EYR~ 204 (257) T pfam11231 148 LFGRNLLVNGTLRATKANQLGLLGFGGDRNDSYELQPEGSVAV-LLSRNLAVGVEYRQ 204 (257) T ss_pred CCCCEEEEEEEEEEECCCCCCCCCCCCCCCCCEEEEEEEEEEE-EECCCEEEEEEEEC T ss_conf 2560179987886502431340112788898606867148999-83586488777202 No 86 >PRK12751 cpxP periplasmic stress adaptor protein CpxP; Reviewed Probab=81.36 E-value=1.7 Score=20.32 Aligned_cols=26 Identities=38% Similarity=0.289 Sum_probs=20.6 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 944378999999999841000024665 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAADPV 27 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad~~ 27 (205) |+|+...+.+|.|+|+++ +|+|++.. T Consensus 1 MrK~tal~~AS~LaLGs~-~a~Aa~~~ 26 (162) T PRK12751 1 MRKVTTLVMASMFVLGSS-AAFAADNT 26 (162) T ss_pred CHHHHHHHHHHHHHHHHH-HHHHCCCC T ss_conf 917899999999999999-99874378 No 87 >PRK10363 cpxP periplasmic repressor CpxP; Reviewed Probab=81.33 E-value=1.5 Score=20.53 Aligned_cols=24 Identities=33% Similarity=0.333 Sum_probs=19.5 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCC Q ss_conf 944378999999999841000024 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAA 24 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aa 24 (205) |||+...+.+|.|++++.+.|.++ T Consensus 1 MRklTA~vlAStLalgs~s~aa~~ 24 (166) T PRK10363 1 MRIVTAAVMASTLAVSSLSHAAEV 24 (166) T ss_pred CHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 906999999999999899986245 No 88 >pfam09411 PagL Lipid A 3-O-deacylase (PagL). PagL is an outer membrane protein with lipid A 3-O-deacylase activity. It forms an 8 stranded beta barrel structure. Probab=81.13 E-value=3.1 Score=18.97 Aligned_cols=71 Identities=20% Similarity=0.236 Sum_probs=38.4 Q ss_pred EEEEEEEEEEEEEEEECCCCCCCCCCCEEEEEEEEEEEECCCCCEEEEEEEEEEECCCC-----CCCCEEEEEEEEEEC Q ss_conf 77898854477644431345556775204799734661004886799999999750565-----566226999998539 Q gi|254780598|r 132 LLLYATVGPDVAQKYETGKAGEITPIAIGGTAGVGVEVGGLSESLVARLEYRASKYSKV-----EGFYNTISLGVGMKF 205 (205) Q Consensus 132 ~l~Y~~~G~a~~~~~~~~~~~~~~~~~~g~~~G~Gvey~~it~~~~~r~EY~y~df~~~-----~~~~~~~~~G~~ykF 205 (205) .-+|..+|....=.+.+.....+=.+..-+...+|+-| .+.+++ +++|||..+++. .-.+|++.+-++|+| T Consensus 64 ~~~f~E~GiG~~~~s~~~~~~~~lg~~~~F~~~~g~g~-~~~~~~--~~~~~~~H~SNa~i~~~N~G~~~~~l~~~~~F 139 (139) T pfam09411 64 FKPFVEAGIGVALLSRTRDEDKDLGTRFQFRDQLGAGY-QFGGRP--RVGYRYQHYSNAGIKDPNPGLNSVGLRLGYSF 139 (139) T ss_pred CCEEEEEEEEEEEECCCCCCCCCCCCEEEEECCEEEEE-EECCCC--EEEEEEEEECCCCCCCCCCCCCEEEEEEEEEC T ss_conf 47389987654886152138986666087200188899-976886--58999999517887999889015879998439 No 89 >PRK09733 putative fimbrial protein; Provisional Probab=80.07 E-value=0.95 Score=21.60 Aligned_cols=23 Identities=22% Similarity=0.260 Sum_probs=15.4 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCC Q ss_conf 94437899999999984100002466 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAADP 26 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad~ 26 (205) |||.+++++++++ ++++++|++- T Consensus 2 mKk~ll~~~~~~~---~s~~~~A~~~ 24 (181) T PRK09733 2 FKKSLLVAGVAMI---LSNNVFADEG 24 (181) T ss_pred HHHHHHHHHHHHH---HHHHHHHHCC T ss_conf 0688999999999---8666576448 No 90 >PRK12698 flgH flagellar basal body L-ring protein; Reviewed Probab=79.99 E-value=3.4 Score=18.76 Aligned_cols=13 Identities=23% Similarity=0.161 Sum_probs=9.2 Q ss_pred CHHHHHHHHHHHH Q ss_conf 9443789999999 Q gi|254780598|r 1 MQKLFLAVGVSSL 13 (205) Q Consensus 1 Mkk~~la~av~~l 13 (205) |||+++.+++..| T Consensus 1 M~r~~l~~~~l~L 13 (224) T PRK12698 1 MARYILLALALLL 13 (224) T ss_pred CHHHHHHHHHHHH T ss_conf 9779999999998 No 91 >pfam04234 CopC Copper resistance protein CopC. CopC is a bacterial blue copper protein that binds 1 atom of copper per protein molecule. Along with CopA, CopC mediates copper resistance by sequestration of copper in the periplasm. Probab=79.69 E-value=2.4 Score=19.56 Aligned_cols=35 Identities=31% Similarity=0.348 Sum_probs=24.8 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCC-CCCCCCCCCCCC Q ss_conf 94437899999999984100002-466543566666 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQA-ADPVRRAHHGGR 35 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~A-ad~~~~~~~~~~ 35 (205) ||++.+.+.+.++++..+.+|+| +.+....|.... T Consensus 1 M~~~~~~~~~~~~~~~~~~~A~AH~~l~~s~P~~ga 36 (120) T pfam04234 1 MRRLLLAAALLALLLLLAGAASAHAKLVSSTPAAGA 36 (120) T ss_pred CHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCC T ss_conf 977999999999998740321315442432799949 No 92 >PRK13893 conjugal transfer protein TrbM; Provisional Probab=79.22 E-value=1.1 Score=21.35 Aligned_cols=26 Identities=35% Similarity=0.379 Sum_probs=18.2 Q ss_pred CHHHHHHHHHHHHHHH-HCCCCCCCCC Q ss_conf 9443789999999998-4100002466 Q gi|254780598|r 1 MQKLFLAVGVSSLALA-SFCSAQAADP 26 (205) Q Consensus 1 Mkk~~la~av~~lal~-~~~~A~Aad~ 26 (205) |||.++.+++.++++. +++++.|.|+ T Consensus 1 m~kk~l~~a~~~~a~~~~~g~~~a~d~ 27 (193) T PRK13893 1 MKKKLLALAAAVAALGSAAGTASAQDV 27 (193) T ss_pred CCCHHHHHHHHHHHHCCCCCCCCHHHH T ss_conf 961488999999997047564414355 No 93 >pfam04076 BOF Bacterial OB fold (BOF) protein. Proteins in this family form an OB-fold. Analysis of the predicted binding site of BOF family proteins implies that they lack nucleic acid-binding properties. They contain an predicted N-terminal signal peptide which indicates that they localize in the periplasm where they may function to bind proteins, small molecules, or other typical OB-fold ligands. As hypothesized for the distantly related OB-fold containing bacterial enterotoxins, the loss of nucleotide-binding function and the rapid evolution of the BOF ligand-binding site may be associated with the presence of BOF proteins in mobile genetic elements and their potential role in bacterial pathogenicity. Probab=79.07 E-value=1.6 Score=20.50 Aligned_cols=21 Identities=52% Similarity=0.521 Sum_probs=12.5 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 9443789999999998410000246 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAAD 25 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad 25 (205) |||+++++++++ ++ .+++|++ T Consensus 1 MKK~~i~~~~~l---~s-~~~~Aa~ 21 (126) T pfam04076 1 MKKLAIALASAL---AS-TSALAAD 21 (126) T ss_pred CCHHHHHHHHHH---HH-CHHHHHH T ss_conf 956899999999---84-6787976 No 94 >PRK03577 acid shock protein precursor; Provisional Probab=78.97 E-value=1.5 Score=20.65 Aligned_cols=33 Identities=24% Similarity=0.202 Sum_probs=22.0 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCC Q ss_conf 94437899999999984100002466543566666 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAADPVRRAHHGGR 35 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad~~~~~~~~~~ 35 (205) |||+|..+-. ..++.++.++|++.....+..+. T Consensus 1 MkKvLalvva--A~mGLSSvafAaeT~~~a~~a~~ 33 (102) T PRK03577 1 MKKVLALVVA--AAMGLSSAAFAAETATTAAPAAT 33 (102) T ss_pred CCHHHHHHHH--HHHCHHHHHHHHCCCCCCCCCCC T ss_conf 9058999999--98241277787514667887764 No 95 >PRK12700 flgH flagellar basal body L-ring protein; Reviewed Probab=78.47 E-value=3.8 Score=18.50 Aligned_cols=20 Identities=25% Similarity=0.388 Sum_probs=7.5 Q ss_pred CHHHHHHHHHHHH-HHHHCCC Q ss_conf 9443789999999-9984100 Q gi|254780598|r 1 MQKLFLAVGVSSL-ALASFCS 20 (205) Q Consensus 1 Mkk~~la~av~~l-al~~~~~ 20 (205) ||+++....+.++ ++.+.|+ T Consensus 3 ~~t~~Rl~~~~~~~~ll~GCa 23 (230) T PRK12700 3 LKTVLRLPVCAALLALAAGCA 23 (230) T ss_pred HHHHHHHHHHHHHHHHHHCCC T ss_conf 588999999999999974446 No 96 >COG3137 Putative salt-induced outer membrane protein [Cell envelope biogenesis, outer membrane] Probab=77.32 E-value=4.1 Score=18.32 Aligned_cols=112 Identities=17% Similarity=0.130 Sum_probs=47.8 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEEEEECCCEEEEEEEEEEECCCCCCCCCCCCCCEEEEEEEEEEEEECCCE Q ss_conf 42100012233334466777776026899854640483799845101321555666621223214899989754312767 Q gi|254780598|r 53 FNGPYAGLSALYNGSFGEEAHHNAGGSIFAGYNVEDSCIMYGVEGDVRYTVPVLADNIHSLHGIGGSLRIRGGYEVSDSL 132 (205) Q Consensus 53 w~G~y~g~~~g~~~~~~~~~~~~~~~G~~~GYn~q~~~~V~G~e~d~~~~~~~~~~~~~~~~~~~~~~~~r~Gy~~~~~~ 132 (205) |.| .+..||....++........-+-.=|+-- ..--++.++++... ..+...++..+ .+..|--|.+.|+. T Consensus 50 w~g---sve~Gy~a~sGNT~sesLnarlk~~y~~g-~~~a~~lw~n~sn~---s~n~e~s~~ry--~~~~~sry~l~~~~ 120 (262) T COG3137 50 WEG---SVEAGYLAQSGNTTSESLNARLKMTYYGG-QITAYSLWGNASNT---SSNDERSSERY--AQGGRSRYNLTDYD 120 (262) T ss_pred CCC---CCCEEEEECCCCCCCHHCCCCCEEEEECC-CCCHHHHHHHHHCC---CCCCHHHHHHH--HHCCEEEECCCCCE T ss_conf 666---52300251368853210022100155157-64108876544126---65431357887--53021222356622 Q ss_pred EEEEEEEEEEEEEEECCCCCCCCCCCEEEEEEEEEEEECCCCCEEEEEE Q ss_conf 7898854477644431345556775204799734661004886799999 Q gi|254780598|r 133 LLYATVGPDVAQKYETGKAGEITPIAIGGTAGVGVEVGGLSESLVARLE 181 (205) Q Consensus 133 l~Y~~~G~a~~~~~~~~~~~~~~~~~~g~~~G~Gvey~~it~~~~~r~E 181 (205) ++|+..=+- .-+.. . -+.+.-.+.|.|-++ +-+|.+++|.| T Consensus 121 y~~Gqf~~l--tdRy~--~---Y~~~~tlt~GyG~qf-~ng~~~sl~~e 161 (262) T COG3137 121 YLFGQFSWL--TDRYN--G---YDRRDTLTAGYGRQF-LNGPVHSLRFE 161 (262) T ss_pred EEEEEEECH--HHHHC--C---CCCCCEEECCCCEEE-ECCCCEEEEEE T ss_conf 676322210--12321--3---111121203443243-04872478984 No 97 >PRK09495 glnH glutamine ABC transporter periplasmic protein; Reviewed Probab=76.18 E-value=1.9 Score=20.04 Aligned_cols=25 Identities=32% Similarity=0.305 Sum_probs=16.7 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCC Q ss_conf 94437899999999984100002466 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAADP 26 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad~ 26 (205) |||++..+.+ ++++..+++|+|++- T Consensus 1 mk~~~~~~~~-~~~~~~~~~~~a~~~ 25 (247) T PRK09495 1 MKSVLKVSLA-ALTLAFAVSSHAADK 25 (247) T ss_pred CHHHHHHHHH-HHHHHHHHHHHHCCC T ss_conf 9779999999-999999876651499 No 98 >COG4571 OmpT Outer membrane protease [Cell envelope biogenesis, outer membrane] Probab=75.94 E-value=4.5 Score=18.11 Aligned_cols=47 Identities=15% Similarity=0.133 Sum_probs=32.2 Q ss_pred CEEEEEEEEEEEECCCCCEEEEEEEEEEECCCC-----------------------CCCCEEEEEEEEEEC Q ss_conf 204799734661004886799999999750565-----------------------566226999998539 Q gi|254780598|r 158 AIGGTAGVGVEVGGLSESLVARLEYRASKYSKV-----------------------EGFYNTISLGVGMKF 205 (205) Q Consensus 158 ~~g~~~G~Gvey~~it~~~~~r~EY~y~df~~~-----------------------~~~~~~~~~G~~ykF 205 (205) ..=+.+++-+-| .+|+|-.+-+|--|.+...- +....++++|+-||| T Consensus 245 s~yy~~~i~~GY-y~t~nak~fve~ay~k~~~~kG~tqt~d~~g~t~~~~~d~aGi~~~~~tvt~Glky~F 314 (314) T COG4571 245 SRYYSVVINAGY-YVTENAKVFVEGAYNKVTEGKGGTQTYDKAGDTSDYGGDAAGIENYNYTVTAGLKYRF 314 (314) T ss_pred CCEEEEEEEEEE-EECCCCEEEEEEEEEEEECCCCCCEEEECCCCCCCCCCCCCCCCCCCEEEEEEEEECC T ss_conf 834899873147-9867834999877777750568725675168636667753333333148998777619 No 99 >PRK03641 hypothetical protein; Provisional Probab=73.94 E-value=1.9 Score=20.09 Aligned_cols=27 Identities=22% Similarity=0.071 Sum_probs=19.0 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCCCCCC Q ss_conf 944378999999999841000024665435 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAADPVRRA 30 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad~~~~~ 30 (205) |||.+++ ++|++|.+.+++|+.+..+. T Consensus 1 m~~~~i~---~llal~~S~~~~A~tL~lp~ 27 (220) T PRK03641 1 MKTGIVT---TLLALCLPVSVFATTLRLSP 27 (220) T ss_pred CCHHHHH---HHHHHHHHHHHHEEEEECCC T ss_conf 9116999---99999862364504664798 No 100 >PRK12701 flgH flagellar basal body L-ring protein; Reviewed Probab=73.93 E-value=5.2 Score=17.83 Aligned_cols=25 Identities=32% Similarity=0.319 Sum_probs=16.7 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 9443789999999998410000246 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAAD 25 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad 25 (205) |+++.+++.+.+++|.+.|++.... T Consensus 1 m~r~~~~~~~la~~lL~GC~~~~~~ 25 (230) T PRK12701 1 MNRLNIAVSCLATALLFGCEALHPP 25 (230) T ss_pred CCHHHHHHHHHHHHHHHCCCCCCCC T ss_conf 9258999999999997556688999 No 101 >pfam04575 DUF560 Protein of unknown function (DUF560). Family of hypothetical bacterial proteins. Probab=73.27 E-value=5.4 Score=17.74 Aligned_cols=113 Identities=14% Similarity=0.042 Sum_probs=56.6 Q ss_pred CCCCCCEEEEEEEEEEECCC--EEEEEEEEEEECCCCCCC-------------------------------CCCCCCCEE Q ss_conf 77776026899854640483--799845101321555666-------------------------------621223214 Q gi|254780598|r 71 EAHHNAGGSIFAGYNVEDSC--IMYGVEGDVRYTVPVLAD-------------------------------NIHSLHGIG 117 (205) Q Consensus 71 ~~~~~~~~G~~~GYn~q~~~--~V~G~e~d~~~~~~~~~~-------------------------------~~~~~~~~~ 117 (205) .+.+.....+.+||.++..+ +-++...+..+.+.+.-. .....+... T Consensus 92 ~~yd~~~~r~~~G~~y~~~~~~~~~~P~~e~~~~g~~~y~~~~G~~~~~~~~l~~~~~~~~~~~~~~~~y~~~~~~dg~~ 171 (301) T pfam04575 92 KKYNELSVRASLGYGYKNAKQDVSVLPFVEKRWYGGDRYSWANGVRLEGSYWLSPKWQLSTALEYKQERYFERKHLDGNI 171 (301) T ss_pred CCCCHHHHCEEECEEECCCCCEEEEEEEEEEEECCCCHHHHCCCEEEEEEEECCCCCEEEEHHHHHHHHCCCCCCCCCCC T ss_conf 54341221036240040677589997869999838970321456789999987947197321345665426010657621 Q ss_pred EEEEEEEEEEECCCEEEEEEEEEEEEEEEECCCCCCCCCCCEEEEEEEEEEEECCCCCEEEEEE--EEEEECCCC Q ss_conf 8999897543127677898854477644431345556775204799734661004886799999--999750565 Q gi|254780598|r 118 GSLRIRGGYEVSDSLLLYATVGPDVAQKYETGKAGEITPIAIGGTAGVGVEVGGLSESLVARLE--YRASKYSKV 190 (205) Q Consensus 118 ~~~~~r~Gy~~~~~~l~Y~~~G~a~~~~~~~~~~~~~~~~~~g~~~G~Gvey~~it~~~~~r~E--Y~y~df~~~ 190 (205) ..+.+-+-|..+++..+|+-+.+...+.+. ...+....|..+|.+-|. ...+++++. |+..+|.+. T Consensus 172 ~~~~~tl~~~~~~~~~~~~g~~~~~~~~~~----~~~sy~~~g~rlG~~~~~---~~gls~~l~~s~~~r~y~~~ 239 (301) T pfam04575 172 HFVSLTLIYLPNPSQYWFGGLDFYREDTRD----KADSYDRKGVRLGWGQEW---GGGLSTRLSASYAKRQYKDK 239 (301) T ss_pred EEEEEEEEEEECCCCEEEEEEEEEEECCCC----CHHHHHHCEEEEEEEEEE---CCCEEEEEEEEEEEEECCCC T ss_conf 689999999938983999989998702456----101122123899999980---79888999999888504776 No 102 >PRK12450 foldase protein PrsA; Reviewed Probab=73.07 E-value=2.3 Score=19.67 Aligned_cols=21 Identities=29% Similarity=0.347 Sum_probs=14.6 Q ss_pred CHHHHHHHH--HHHHHHHHCCCC Q ss_conf 944378999--999999841000 Q gi|254780598|r 1 MQKLFLAVG--VSSLALASFCSA 21 (205) Q Consensus 1 Mkk~~la~a--v~~lal~~~~~A 21 (205) |||+++++. +++++|+++++. T Consensus 4 mKK~~~~~~~~~svl~LaAC~s~ 26 (309) T PRK12450 4 MNKLITGVVTLATVVTLSACQSS 26 (309) T ss_pred HHHHHHHHHHHHHHHHHHHHCCC T ss_conf 99999999999999999860589 No 103 >PRK02889 tolB translocation protein TolB; Provisional Probab=72.59 E-value=2.2 Score=19.74 Aligned_cols=24 Identities=38% Similarity=0.216 Sum_probs=20.7 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCC Q ss_conf 944378999999999841000024 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAA 24 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aa 24 (205) |+|+.+.+.+++++++++++|+|. T Consensus 4 ~~~l~~~~~~~~~l~~~~~~A~A~ 27 (430) T PRK02889 4 MTKLGLRALVASCLIAAGGAAHAQ 27 (430) T ss_pred HHHHHHHHHHHHHHHHCCCCCEEE T ss_conf 998999999999998621020278 No 104 >PRK11260 cystine transporter subunit; Provisional Probab=72.40 E-value=3.1 Score=19.00 Aligned_cols=26 Identities=12% Similarity=0.063 Sum_probs=13.4 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCC Q ss_conf 94437899999999984100002466 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAADP 26 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad~ 26 (205) +||++++++..+++.+...++.|++. T Consensus 5 ~~~~~~~~l~~~~~~~~~~~~~a~~~ 30 (264) T PRK11260 5 GRQALMGVMAVALVAGMSVKSFADEG 30 (264) T ss_pred HHHHHHHHHHHHHHHHHCCCCCCCCC T ss_conf 79999999999999983576543424 No 105 >PRK03095 prsA peptidylprolyl isomerase; Reviewed Probab=72.39 E-value=3 Score=19.04 Aligned_cols=25 Identities=32% Similarity=0.534 Sum_probs=14.8 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 9443789999999998410000246 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAAD 25 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad 25 (205) |||.++++++.+++..++|+....+ T Consensus 1 MKK~~la~~~~svl~LaaC~~~~~~ 25 (287) T PRK03095 1 MKKAMLALAATSVIALSACGTSSSD 25 (287) T ss_pred CCHHHHHHHHHHHHHHHHCCCCCCC T ss_conf 9078999999999999851689987 No 106 >PRK10954 periplasmic protein disulfide isomerase I; Provisional Probab=72.29 E-value=4.6 Score=18.08 Aligned_cols=21 Identities=48% Similarity=0.517 Sum_probs=12.6 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 9443789999999998410000246 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAAD 25 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad 25 (205) |||++++++... ++. +|+|++ T Consensus 1 mkk~~~~~~~ll--ls~--~a~Aa~ 21 (207) T PRK10954 1 MKKIWLALAGMV--LAF--SASAAQ 21 (207) T ss_pred CCHHHHHHHHHH--HHH--HHHHHH T ss_conf 915999999999--997--722503 No 107 >TIGR02953 penta_MxKDx pentapeptide MXKDX repeat protein; InterPro: IPR014299 Members of this protein family are small bacterial proteins, each with an N-terminal signal sequence followed by up to 11 imperfect repeats of a pentapeptide. The pentapeptide repeat is usually of the form Met-Xaa-Lys-Asp-Xaa.. Probab=72.18 E-value=2.7 Score=19.27 Aligned_cols=24 Identities=25% Similarity=0.379 Sum_probs=15.7 Q ss_pred HHHHHHHHHHHH--HHHHCCCCCCC-C Q ss_conf 443789999999--99841000024-6 Q gi|254780598|r 2 QKLFLAVGVSSL--ALASFCSAQAA-D 25 (205) Q Consensus 2 kk~~la~av~~l--al~~~~~A~Aa-d 25 (205) |+|++++.+++| .|+.+..|+|. | T Consensus 1 ~~i~~~~~~a~f~~~l~~ap~A~AqnD 27 (89) T TIGR02953 1 KRIVLAISAAAFSAGLSLAPAAFAQND 27 (89) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 879999999999999885105542157 No 108 >PRK10259 hypothetical protein; Provisional Probab=70.74 E-value=3.3 Score=18.79 Aligned_cols=31 Identities=29% Similarity=0.399 Sum_probs=21.1 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCC Q ss_conf 94437899999999984100002466543566 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAADPVRRAHH 32 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad~~~~~~~ 32 (205) ||.+-..+++.+|...++ .++||+++...+. T Consensus 1 MK~ik~~~aa~aLs~~SF-gsfAA~~v~~~qa 31 (86) T PRK10259 1 MKTINTVVAAMALSTLSF-GVFAAEPVTASQA 31 (86) T ss_pred CCHHHHHHHHHHHHHHHH-HHHHCCCCCCCCC T ss_conf 914899999999998226-7654141463221 No 109 >PRK10942 serine endoprotease; Provisional Probab=70.12 E-value=4.3 Score=18.23 Aligned_cols=19 Identities=32% Similarity=0.298 Sum_probs=13.7 Q ss_pred CHHHHHHHHHHHHHHHHCC Q ss_conf 9443789999999998410 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFC 19 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~ 19 (205) |||..|++++..|+++.+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~ 19 (474) T PRK10942 1 MKKTTLALSALALSLGLAL 19 (474) T ss_pred CCCHHHHHHHHHHHHHHHC T ss_conf 9721457999999987640 No 110 >pfam10566 Glyco_hydro_97 Glycoside hydrolase 97. This is the 97th family of glycosidases, in this case bacterial. The central part of the GH97 family protein sequences represents a typical and complete (beta/alpha)8-barrel or catalytic TIM-barrel type domain. The N- and C-terminal parts of the sequences, mainly consisting of beta-strands, most probably form two additional non-catalytic domains with as yet unknown functions. The non-catalytic domains of glycosidases from the alpha-galactosidase and alpha-glucosidase superfamilies are also predominantly composed of beta-strands, and at least some of these domains are involved in oligomerisation and carbohydrate binding. In all known glycosidases with the (beta-alpha)8-barrel fold, the amino acid residues at the active site are located on the C-termini of the beta-strands. Probab=69.51 E-value=3.2 Score=18.92 Aligned_cols=32 Identities=16% Similarity=0.106 Sum_probs=19.4 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCC Q ss_conf 94437899999999984100002466543566 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAADPVRRAHH 32 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad~~~~~~~ 32 (205) |||+++.+.+..++++..+.+.+-......|. T Consensus 1 mkk~~~~~~~~~~~~~~~~~~~~k~~~v~SPD 32 (643) T pfam10566 1 MKKLTIILLAFLLLIGNLSLAARKNYQLTSPN 32 (643) T ss_pred CCCHHHHHHHHHHHHHHHHHHHCCEEEEECCC T ss_conf 95025789999999976567541623899989 No 111 >pfam07396 Porin_O_P Phosphate-selective porin O and P. This family represents a conserved region approximately 400 residues long within the bacterial phosphate-selective porins O and P. These are anion-specific porins, the binding site of which has a higher affinity for phosphate than chloride ions. Porin O has a higher affinity for polyphosphates, while porin P has a higher affinity for orthophosphate. In P. aeruginosa, porin O was found to be expressed only under phosphate-starvation conditions during the stationary growth phase. Probab=69.44 E-value=6.6 Score=17.27 Aligned_cols=44 Identities=14% Similarity=0.205 Sum_probs=32.7 Q ss_pred EEEEEEEEEECCCCCEEEEEEEEEEECCCC-----CCCCEEEEEEEEEEC Q ss_conf 799734661004886799999999750565-----566226999998539 Q gi|254780598|r 161 GTAGVGVEVGGLSESLVARLEYRASKYSKV-----EGFYNTISLGVGMKF 205 (205) Q Consensus 161 ~~~G~Gvey~~it~~~~~r~EY~y~df~~~-----~~~~~~~~~G~~ykF 205 (205) =.+.+|+.| -+++|+-+.++|.|+++.+. +.+.+.+.+.+-|.| T Consensus 313 ~~~t~GlNw-Y~n~n~r~~~nY~~~~~~~~~~~~~~~~~~~~~~R~Q~~F 361 (361) T pfam07396 313 DSWTLGVNW-YVKGNLKFMLNYIKVDSDRRTPATNDDTPDAVTMRLQYDF 361 (361) T ss_pred EEEEEEEEE-EECCCEEEEEEEEEEECCCCCCCCCCCCCCEEEEEEEEEC T ss_conf 999999999-9878899999999856466665678888676999998729 No 112 >PRK10780 periplasmic chaperone; Provisional Probab=68.65 E-value=2.6 Score=19.32 Aligned_cols=22 Identities=59% Similarity=0.667 Sum_probs=13.7 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 9443789999999998410000246 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAAD 25 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad 25 (205) |||+++++++ .|.++.+|.|+. T Consensus 1 mKk~l~~~~l---~l~l~~~a~Aa~ 22 (161) T PRK10780 1 MKKWLLAAGL---GLALATSAQAAD 22 (161) T ss_pred CHHHHHHHHH---HHHHHHHHHHHH T ss_conf 9589999999---998588887753 No 113 >cd07306 Porin3_VDAC Voltage-dependent anion channel of the outer mitochondrial membrane. The voltage-dependent anion channel (VDAC) regulates the flux of mostly anionic metabolites through the outer mitochondrial membrane, which is highly permeable to small molecules. VDAC is the most abundant protein in the outer membrane, and membrane potentials can toggle VDAC between open or high-conducting and closed or low-conducting forms. VDAC binds to and is regulated in part by hexokinase, an interaction that renders mitochondria less susceptible to pro-apoptotic signals, most likely by intefering with VDAC's capability to respond to Bcl-2 family proteins. While VDAC appears to play a key role in mitochondrially induced cell death, a proposed involvement in forming the mitochondrial permeability transition pore, which is characteristic for damaged mitochondria and apoptosis, has been challenged by more recent studies. Probab=67.82 E-value=7.2 Score=17.09 Aligned_cols=16 Identities=13% Similarity=-0.056 Sum_probs=7.7 Q ss_pred ECCCCCEEEEEEEEEE Q ss_conf 0048867999999997 Q gi|254780598|r 170 GGLSESLVARLEYRAS 185 (205) Q Consensus 170 ~~it~~~~~r~EY~y~ 185 (205) +.++++|.+.+|-.|. T Consensus 190 ~~v~~~~~va~~~~~~ 205 (276) T cd07306 190 HKVSPRLAVGAKVTWY 205 (276) T ss_pred EECCCCEEEEEEEEEE T ss_conf 9938764899999995 No 114 >PRK11627 hypothetical protein; Provisional Probab=67.62 E-value=4.9 Score=17.95 Aligned_cols=16 Identities=19% Similarity=0.401 Sum_probs=9.3 Q ss_pred CHHHHHHHHHHHHHHHH Q ss_conf 94437899999999984 Q gi|254780598|r 1 MQKLFLAVGVSSLALAS 17 (205) Q Consensus 1 Mkk~~la~av~~lal~~ 17 (205) |||+++.++. +++|++ T Consensus 2 mkk~~~~l~a-~~lL~g 17 (192) T PRK11627 2 LKKILFPLVA-LFMLAG 17 (192) T ss_pred HHHHHHHHHH-HHHHHH T ss_conf 2879999999-999986 No 115 >pfam11336 DUF3138 Protein of unknown function (DUF3138). This family of proteins with unknown function appear to be restricted to Proteobacteria. Probab=67.51 E-value=3.3 Score=18.80 Aligned_cols=24 Identities=13% Similarity=0.207 Sum_probs=17.0 Q ss_pred EEEEEEEEECCCCCEEEEEEEEEEE Q ss_conf 9973466100488679999999975 Q gi|254780598|r 162 TAGVGVEVGGLSESLVARLEYRASK 186 (205) Q Consensus 162 ~~G~Gvey~~it~~~~~r~EY~y~d 186 (205) .+-+++-| ..|+.+++|+||||.. T Consensus 463 a~~~dLlf-ypTqq~TvK~EYRhDw 486 (514) T pfam11336 463 AITADLLF-YPTQQITVKVEYRHDW 486 (514) T ss_pred HHHHHHHH-CCCCCEEEEEEEEECC T ss_conf 42200310-4453158999985133 No 116 >pfam10836 DUF2574 Protein of unknown function (DUF2574). This family of proteins appears to be restricted to Enterobacteriaceae. Members of the family are annotated as yehE however currently no function is known. Probab=66.12 E-value=3.3 Score=18.84 Aligned_cols=26 Identities=31% Similarity=0.381 Sum_probs=20.7 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 944378999999999841000024665 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAADPV 27 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad~~ 27 (205) |||-++. ..-.|+.+++.+++|.|.. T Consensus 1 Mkk~ll~-GiI~laYGls~P~faSdtA 26 (93) T pfam10836 1 MKKYLLM-GIIVLAYGLSSPVFASDTA 26 (93) T ss_pred CCCHHHH-HHHHHHHHCCCCCCCCCCE T ss_conf 9842435-4688865315642014616 No 117 >PRK10355 xylF D-xylose transporter subunit XylF; Provisional Probab=66.01 E-value=5.1 Score=17.86 Aligned_cols=27 Identities=15% Similarity=0.111 Sum_probs=16.5 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 944378999999999841000024665 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAADPV 27 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad~~ 27 (205) ||..-++++++++.++++++|.|.+.. T Consensus 1 mk~k~~~~~~~~~~~~~~~~a~a~~~~ 27 (330) T PRK10355 1 MKIKNILLTLCAALLLTSVAAHAKEVK 27 (330) T ss_pred CCHHHHHHHHHHHHHHHCCHHHHCCCE T ss_conf 966899999999999844611213867 No 118 >pfam07383 DUF1496 Protein of unknown function (DUF1496). This family consists of several bacterial proteins of around 90 residues in length. Members of this family seem to be found exclusively in the Orders Vibrionales and Enterobacteriales. The function of this family is unknown. Probab=66.00 E-value=5.1 Score=17.86 Aligned_cols=31 Identities=23% Similarity=0.336 Sum_probs=19.8 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCCCCCCC Q ss_conf 9443789999999998410000246654356 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAADPVRRAH 31 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad~~~~~~ 31 (205) ||++++....+.+++.+.+...++|.....| T Consensus 1 Mk~~~~~~~~~~~~~~~la~~~~~dv~~~~~ 31 (88) T pfam07383 1 MKRLLILCFAALLSLVALANSVAADVIVTPP 31 (88) T ss_pred CHHHHHHHHHHHHHHHHHCCCCCCCEECCCC T ss_conf 9218999999999999861657886661699 No 119 >PRK10672 rare lipoprotein A; Provisional Probab=65.93 E-value=7.9 Score=16.88 Aligned_cols=20 Identities=30% Similarity=0.413 Sum_probs=16.5 Q ss_pred CHHHHHHHHHHHHHHHHCCC Q ss_conf 94437899999999984100 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCS 20 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~ 20 (205) |||-+|.+.+++++|+++.. T Consensus 1 mr~~wl~~~~~~~ll~~c~~ 20 (369) T PRK10672 1 MRKQWLGICIAAGLLAACTS 20 (369) T ss_pred CCHHHHHHHHHHHHHHHCCC T ss_conf 93335899999999865258 No 120 >pfam03249 TSA Type specific antigen. There are several antigenic variants in Rickettsia tsutsugamushi, and a type-specific antigen (TSA) of 56-kilodaltons located on the rickettsial surface is responsible for the variation. TSA proteins are probably integral membrane proteins. Probab=65.67 E-value=7.8 Score=16.89 Aligned_cols=75 Identities=27% Similarity=0.406 Sum_probs=26.8 Q ss_pred CCCEEEEEEEE--EEEEEEEECCCCCCCCCCCEEEEEEEEEEEECCCCCEEEEEEEEE-EECCCCCCCC--EE--EEEEE Q ss_conf 27677898854--477644431345556775204799734661004886799999999-7505655662--26--99999 Q gi|254780598|r 129 SDSLLLYATVG--PDVAQKYETGKAGEITPIAIGGTAGVGVEVGGLSESLVARLEYRA-SKYSKVEGFY--NT--ISLGV 201 (205) Q Consensus 129 ~~~~l~Y~~~G--~a~~~~~~~~~~~~~~~~~~g~~~G~Gvey~~it~~~~~r~EY~y-~df~~~~~~~--~~--~~~G~ 201 (205) +++|.+|+-.| +|++.-+.+...-. -.+..-.+--+||-.+ ..+.+-+.+|=-| +.|++++-++ |. .++|| T Consensus 443 teSfSiYaG~gaGlAyT~gkID~kdiK-~htGmVaSgalgvain-aaegvyvdieggymhsfskieekysvnalmas~gv 520 (524) T pfam03249 443 TESFSIYAGVGAGLAYTSGKIDGKDIK-AHTGMVASGALGVAIN-AAEGVYVDIEGGYMHSFSKIEEKYSVNALMASVGV 520 (524) T ss_pred CCEEEEEECCCCCCEEECCCCCCCCCC-CCCCEEEECCHHEEEE-CCCEEEEEECCCEEEEHHHHHHHHCHHHHHHHHCC T ss_conf 131567622464212321554674334-5443253042013662-23506996205423302445454044478774040 Q ss_pred EEEC Q ss_conf 8539 Q gi|254780598|r 202 GMKF 205 (205) Q Consensus 202 ~ykF 205 (205) .|.| T Consensus 521 rynf 524 (524) T pfam03249 521 RYNF 524 (524) T ss_pred CCCC T ss_conf 2469 No 121 >TIGR03525 GldK gliding motility-associated lipoprotein GldK. Members of this protein family are exclusive to the Bacteroidetes phylum (previously Cytophaga-Flavobacteria-Bacteroides). GldK is a lipoprotein linked to a type of rapid surface gliding motility found in certain Bacteroidetes, such as Flavobacterium johnsoniae. Knockouts of GldK abolish the gliding phenotype. GldK is homologous to GldJ. There is a GldK homolog in Cytophaga hutchinsonii and several other species that has a different, shorter architecture and is represented by a separate model. Gliding motility appears closely linked to chitin utilization in the model species Flavobacterium johnsoniae. Bacteroidetes with members of this protein family appear to have all of the genes associated with gliding motility. Probab=65.34 E-value=6 Score=17.48 Aligned_cols=46 Identities=15% Similarity=0.073 Sum_probs=23.5 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 9443789999999998410000246654356666655563455444444 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAADPVRRAHHGGRGVVPTIATNRYVPI 49 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad~~~~~~~~~~~~~~~~~~~~~~~~ 49 (205) |||+++.+.+.++++++ +...-..++..... +...+.+...+.+|. T Consensus 1 Mkk~~~~~~~~~~~~SC-g~~~~GELvGv~~~--~w~~~~PyGMV~IPg 46 (449) T TIGR03525 1 MKKYLVFAALVVLVYSC-GSGDKGELVGVEGK--KWYPEKPYGMVLVPG 46 (449) T ss_pred CCHHHHHHHHHHHHHHC-CCCCCCEEEEECCC--CCCCCCCCEEEEECC T ss_conf 91248999999987210-58997138731487--557899983589668 No 122 >PRK12788 flgH flagellar basal body L-ring protein; Reviewed Probab=64.76 E-value=8.3 Score=16.76 Aligned_cols=18 Identities=22% Similarity=0.143 Sum_probs=11.5 Q ss_pred CHHHHHHHHHHHHHHHHC Q ss_conf 944378999999999841 Q gi|254780598|r 1 MQKLFLAVGVSSLALASF 18 (205) Q Consensus 1 Mkk~~la~av~~lal~~~ 18 (205) ||.+++.+++.+|+-|+. T Consensus 1 mr~l~~~~a~l~LsGCa~ 18 (231) T PRK12788 1 MRLLVAILACLALAGCAN 18 (231) T ss_pred CCHHHHHHHHHHHHHHCC T ss_conf 908999999999764038 No 123 >pfam12099 DUF3575 Protein of unknown function (DUF3575). This family of proteins are functionally uncharacterized. This family is only found in bacteria. Proteins in this family are typically between 187 to 236 amino acids in length. Probab=64.34 E-value=8.5 Score=16.72 Aligned_cols=31 Identities=26% Similarity=0.315 Sum_probs=17.1 Q ss_pred CEEEEEEEEEEEE---CCCCCEEEEEE----EEEEECC Q ss_conf 2047997346610---04886799999----9997505 Q gi|254780598|r 158 AIGGTAGVGVEVG---GLSESLVARLE----YRASKYS 188 (205) Q Consensus 158 ~~g~~~G~Gvey~---~it~~~~~r~E----Y~y~df~ 188 (205) --|+.+|+|+.|. .|+++|.+-++ |.++++. T Consensus 117 yqG~~~g~GisyGY~~~Ls~~wnlE~~iG~Gy~~~~Yd 154 (186) T pfam12099 117 YQGWLYGGGISYGYQWPLSRRWNLEFSIGLGYLHLEYD 154 (186) T ss_pred CCCEEEEEEEEEEEEEEECCCEEEEEEEEEEEEEEEEE T ss_conf 03156836799999879679478999999658986870 No 124 >pfam11777 DUF3316 Protein of unknown function (DUF3316). This family of bacterial proteins has no known function. Several members are, however, annotated as being putative acyl-CoA synthetase, but this could not be confirmed. Probab=64.17 E-value=3.6 Score=18.61 Aligned_cols=21 Identities=38% Similarity=0.401 Sum_probs=13.0 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 9443789999999998410000246 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAAD 25 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad 25 (205) |||+++++.+ +++ +.+|+|.. T Consensus 1 mKk~~ll~~~--ll~--s~~AfA~~ 21 (114) T pfam11777 1 MKKLILLALL--LLL--SATAFAGN 21 (114) T ss_pred CHHHHHHHHH--HHH--HHHHHHCC T ss_conf 9108999999--999--88887555 No 125 >COG4991 Uncharacterized protein with a bacterial SH3 domain homologue [Function unknown] Probab=64.05 E-value=7.4 Score=17.01 Aligned_cols=24 Identities=29% Similarity=0.182 Sum_probs=15.9 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCC Q ss_conf 944378999999999841000024 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAA 24 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aa 24 (205) |+|.++.+.+..+++.+.+.|+|+ T Consensus 10 ~~~~~~~A~a~~~~l~~~~~a~aa 33 (155) T COG4991 10 SMKTLMRASAFGLALLMPAAAYAA 33 (155) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 677899999999998747888887 No 126 >PRK10877 thiol:disulfide interchange protein DsbC; Provisional Probab=63.61 E-value=3.2 Score=18.91 Aligned_cols=25 Identities=28% Similarity=0.260 Sum_probs=16.3 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 944378999999999841000024665 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAADPV 27 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad~~ 27 (205) |||.++++.+.++++ ++.|+|.+.. T Consensus 1 mkk~l~~~~~l~~~~--s~~a~aDea~ 25 (232) T PRK10877 1 MKKGFMLFTLLAAAF--SGFAHADDAA 25 (232) T ss_pred CCHHHHHHHHHHHHH--HHHHHCCHHH T ss_conf 926699999999998--6252437999 No 127 >COG3700 AphA Acid phosphatase (class B) [General function prediction only] Probab=63.42 E-value=8.1 Score=16.82 Aligned_cols=27 Identities=26% Similarity=0.208 Sum_probs=20.5 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 944378999999999841000024665 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAADPV 27 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad~~ 27 (205) |||+..++.+..|..+.+.+|.|-... T Consensus 1 Mkki~~als~v~ll~~~a~pa~A~g~s 27 (237) T COG3700 1 MKKITQALSAVCLLFALASPAVALGSS 27 (237) T ss_pred CHHHHHHHHHHHHHHHHCCHHHHCCCC T ss_conf 914889999999999862616640799 No 128 >pfam06291 Lambda_Bor Bor protein. This family consists of several Bacteriophage lambda Bor and Escherichia coli Iss proteins. Expression of bor significantly increases the survival of the Escherichia coli host cell in animal serum. This property is a well known bacterial virulence determinant indeed, bor and its adjacent sequences are highly homologous to the iss serum resistance locus of the plasmid ColV2-K94, which confers virulence in animals. It has been suggested that lysogeny may generally have a role in bacterial survival in animal hosts, and perhaps in pathogenesis. Probab=63.34 E-value=4.3 Score=18.25 Aligned_cols=23 Identities=26% Similarity=0.605 Sum_probs=13.6 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 9443789999999998410000246 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAAD 25 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad 25 (205) |||++++++++ ++|++ ++.+... T Consensus 1 MKk~~~~~~~a-llLtg-Ca~QT~~ 23 (97) T pfam06291 1 MKKMLFAAALA-LLITG-CAQQTFT 23 (97) T ss_pred CHHHHHHHHHH-HHHCC-CCCEEEE T ss_conf 92259999999-99721-3304999 No 129 >PRK10653 D-ribose transporter subunit RbsB; Provisional Probab=63.18 E-value=5.1 Score=17.84 Aligned_cols=26 Identities=31% Similarity=0.290 Sum_probs=20.0 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCC Q ss_conf 94437899999999984100002466 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAADP 26 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad~ 26 (205) |||+..++....++.+..++|.|.+. T Consensus 3 ~~k~~~~~~~~~~~~~~~a~a~a~~T 28 (295) T PRK10653 3 MKKLATLVSAVALSATVSANAMAKDT 28 (295) T ss_pred HHHHHHHHHHHHHHHHCCCHHHCCCE T ss_conf 07999999999876521217752997 No 130 >pfam07901 DUF1672 Protein of unknown function (DUF1672). This family is composed of hypothetical bacterial proteins of unknown function. Probab=63.10 E-value=4.8 Score=17.97 Aligned_cols=22 Identities=23% Similarity=0.388 Sum_probs=18.5 Q ss_pred CHHHHHHHHHHHHHHHHCCCCC Q ss_conf 9443789999999998410000 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQ 22 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~ 22 (205) |||++..+.+++|+|++++.-. T Consensus 1 MkK~i~~il~~~llL~GCs~mn 22 (304) T pfam07901 1 MRKLIKLILIATLLLSGCSTTN 22 (304) T ss_pred CCHHHHHHHHHHHHHCCCCCCC T ss_conf 9018999999999971334455 No 131 >pfam05538 Campylo_MOMP Campylobacter major outer membrane protein. This family consists of Campylobacter major outer membrane proteins. The major outer membrane protein (MOMP), a putative porin and a multifunction surface protein of Campylobacter jejuni, may play an important role in the adaptation of the organism to various host environments. Probab=62.79 E-value=9.1 Score=16.56 Aligned_cols=14 Identities=21% Similarity=0.197 Sum_probs=10.8 Q ss_pred CCCEEEEEEEEEEC Q ss_conf 66226999998539 Q gi|254780598|r 192 GFYNTISLGVGMKF 205 (205) Q Consensus 192 ~~~~~~~~G~~ykF 205 (205) .+.|++|+-+-|+| T Consensus 424 ~~~d~~R~eAkYsF 437 (437) T pfam05538 424 ADHDAVRLQALYKF 437 (437) T ss_pred CCCCEEEEEEEECC T ss_conf 45432899998439 No 132 >COG2829 PldA Outer membrane phospholipase A [Cell envelope biogenesis, outer membrane] Probab=61.88 E-value=9.5 Score=16.47 Aligned_cols=76 Identities=16% Similarity=0.273 Sum_probs=43.5 Q ss_pred CCEEEEEEEEEEEEECCCEEEEEEEEEEEEEEEECCCCCCCCCCCEEEEEEEEEEEECCCCCEEEEEEEEEEECCC--CC Q ss_conf 3214899989754312767789885447764443134555677520479973466100488679999999975056--55 Q gi|254780598|r 114 HGIGGSLRIRGGYEVSDSLLLYATVGPDVAQKYETGKAGEITPIAIGGTAGVGVEVGGLSESLVARLEYRASKYSK--VE 191 (205) Q Consensus 114 ~~~~~~~~~r~Gy~~~~~~l~Y~~~G~a~~~~~~~~~~~~~~~~~~g~~~G~Gvey~~it~~~~~r~EY~y~df~~--~~ 191 (205) .++++-...|++|..+|....+- |+ +.+. ++.-| .+-+|.-| .|++++-.-+.| +.-+|. +| T Consensus 236 ~~YMGy~dvkl~Y~~~d~~f~l~--~~---~yn~--------~t~~G-a~e~~ysY-pI~~~v~~Y~Q~-fnGYGEsLiD 299 (317) T COG2829 236 TKYMGYGDVKLGYRLGDHQFSLS--GY---RYNP--------NTGYG-AIELGYSY-PITKHVRLYTQV-FNGYGESLID 299 (317) T ss_pred HHHCCCCCEEEEEECCCCEEEEE--EE---EECC--------CCCCC-EEEEEECC-CCCCCEEEEEEE-ECCCCCHHHH T ss_conf 87526651363364165057765--35---7146--------78875-06875012-135860899998-5267720432 Q ss_pred CCCEEEEEEEEEEC Q ss_conf 66226999998539 Q gi|254780598|r 192 GFYNTISLGVGMKF 205 (205) Q Consensus 192 ~~~~~~~~G~~ykF 205 (205) -+.+.-|||+|.++ T Consensus 300 Yn~~~~RiGvGi~l 313 (317) T COG2829 300 YNFNQNRIGVGIML 313 (317) T ss_pred HHHCCCCEEEEEEE T ss_conf 12102423577982 No 133 >pfam12276 DUF3617 Protein of unknown function (DUF3617). This family of proteins is found in bacteria. Proteins in this family are typically between 155 and 179 amino acids in length. There is a single completely conserved residue C that may be functionally important. Probab=61.80 E-value=7.2 Score=17.09 Aligned_cols=17 Identities=29% Similarity=0.340 Sum_probs=10.6 Q ss_pred CHHHHHHHHHHHHHHHH Q ss_conf 94437899999999984 Q gi|254780598|r 1 MQKLFLAVGVSSLALAS 17 (205) Q Consensus 1 Mkk~~la~av~~lal~~ 17 (205) |||.++.+++.++++.. T Consensus 1 M~~~~~~~~~~~~~~~~ 17 (149) T pfam12276 1 MKRLLLLALLLLLLLLA 17 (149) T ss_pred CCHHHHHHHHHHHHHHH T ss_conf 95057899999999862 No 134 >PRK12407 flgH flagellar basal body L-ring protein; Reviewed Probab=61.60 E-value=9.6 Score=16.44 Aligned_cols=16 Identities=19% Similarity=0.403 Sum_probs=7.5 Q ss_pred HHHHHHHHHHHHHHHHC Q ss_conf 44378999999999841 Q gi|254780598|r 2 QKLFLAVGVSSLALASF 18 (205) Q Consensus 2 kk~~la~av~~lal~~~ 18 (205) ||+++.+++ +++|+++ T Consensus 1 kr~l~l~~~-~l~LsGC 16 (220) T PRK12407 1 KRFLILTPM-VLALCGC 16 (220) T ss_pred CCCHHHHHH-HHHHHCC T ss_conf 915799999-9997365 No 135 >PRK09465 tolC outer membrane channel protein; Reviewed Probab=61.29 E-value=5.2 Score=17.82 Aligned_cols=29 Identities=21% Similarity=0.033 Sum_probs=17.1 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCCCCCC Q ss_conf 944378999999999841000024665435 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAADPVRRA 30 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad~~~~~ 30 (205) |||++..+.. .++.+.+++|+|.++.... T Consensus 1 mkki~~~l~~-l~l~~~~~~a~a~tL~eal 29 (446) T PRK09465 1 MKKLLPLLIG-LSLAGFSSASQAEDLLQVY 29 (446) T ss_pred CCHHHHHHHH-HHHHHCCHHCHHCCHHHHH T ss_conf 9429999999-9998613112122099999 No 136 >PRK13484 putative iron-regulated outer membrane virulence protein; Provisional Probab=60.91 E-value=9.9 Score=16.37 Aligned_cols=85 Identities=8% Similarity=0.167 Sum_probs=34.1 Q ss_pred EEEEEEEEECCCEEEEEEEEEEEEEEEEC-CCCCCCCCCCEEEE-EEEEEEEECCCCCEEEEEEEE------EEECCCCC Q ss_conf 99897543127677898854477644431-34555677520479-973466100488679999999------97505655 Q gi|254780598|r 120 LRIRGGYEVSDSLLLYATVGPDVAQKYET-GKAGEITPIAIGGT-AGVGVEVGGLSESLVARLEYR------ASKYSKVE 191 (205) Q Consensus 120 ~~~r~Gy~~~~~~l~Y~~~G~a~~~~~~~-~~~~~~~~~~~g~~-~G~Gvey~~it~~~~~r~EY~------y~df~~~~ 191 (205) +.+++.|.+.+.+.+++...+.-.+.... .+.........+|+ +-+.+.| .++++|.+++.-+ |.+..... T Consensus 585 ~~~~~~~~~~~~~~~~~~~~y~g~~~~~~~~~~~~~~~~~~~y~~~dl~~~Y-~~~~~~~l~~~V~NLfDk~y~~~~~~~ 663 (682) T PRK13484 585 ANAKLEWDYTQDITFYSSLNYTGKQIWAAQRNGAKVPRVRNGFTSMDIGLNY-QILPDTLINFAVLNVTDRKSEDIDTID 663 (682) T ss_pred EEEEEEEEECCCEEEEEEEEEEEEEEEECCCCCCCCCEECCCEEEEEEEEEE-ECCCCCEEEEEEECCCCCCEEEEECCC T ss_conf 8999999975772698889998667770347776564202777999879999-879990999999977488611440457 Q ss_pred -----CCCEEEEEEEEEEC Q ss_conf -----66226999998539 Q gi|254780598|r 192 -----GFYNTISLGVGMKF 205 (205) Q Consensus 192 -----~~~~~~~~G~~ykF 205 (205) ..--++.++|.|+| T Consensus 664 ~~~~~~~gR~~~~~v~y~F 682 (682) T PRK13484 664 GNWQVDEGRRYWANVRVSF 682 (682) T ss_pred CCCCCCCCCEEEEEEEEEC T ss_conf 7472389969999999989 No 137 >PRK06763 F0F1 ATP synthase subunit alpha; Validated Probab=60.51 E-value=1.8 Score=20.14 Aligned_cols=18 Identities=33% Similarity=0.377 Sum_probs=13.6 Q ss_pred CHHHHHHHHHHHHHHHHC Q ss_conf 944378999999999841 Q gi|254780598|r 1 MQKLFLAVGVSSLALASF 18 (205) Q Consensus 1 Mkk~~la~av~~lal~~~ 18 (205) |||++||-|+..++|++. T Consensus 3 MKKvILAgALGiAALaGT 20 (213) T PRK06763 3 MKKVILAGALGIAALSGT 20 (213) T ss_pred CCEEEEECHHHHHHHHCC T ss_conf 513888130333545348 No 138 >PRK09501 potD spermidine/putrescine ABC transporter periplasmic substrate-binding protein; Reviewed Probab=60.19 E-value=7.2 Score=17.08 Aligned_cols=25 Identities=20% Similarity=0.139 Sum_probs=14.7 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 9443789999999998410000246 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAAD 25 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad 25 (205) |||+.+.+..+.++..+.+++.|++ T Consensus 1 ~~~~~~~ll~~~a~~l~~~~~~a~~ 25 (348) T PRK09501 1 MKKWSRHLLAAGALALGMSAAHADD 25 (348) T ss_pred CHHHHHHHHHHHHHHHHCCHHHHCC T ss_conf 9368999999999997350335433 No 139 >COG3678 CpxP P pilus assembly/Cpx signaling pathway, periplasmic inhibitor/zinc-resistance associated protein [Intracellular trafficking and secretion / Cell motility and secretio / Signal transduction mechanisms / Inorganic ion transport and metabolism] Probab=59.96 E-value=6.4 Score=17.36 Aligned_cols=31 Identities=35% Similarity=0.389 Sum_probs=21.2 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCCCCCCC Q ss_conf 9443789999999998410000246654356 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAADPVRRAH 31 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad~~~~~~ 31 (205) ||+...++..++++++++..+++|......+ T Consensus 2 mr~~~~~v~a~al~~~a~~la~~Aa~~~~~~ 32 (160) T COG3678 2 MRKVTAAVGAAALALTASALAHAAATAAAPP 32 (160) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC T ss_conf 3368999999999865789998887435899 No 140 >PRK10564 maltose regulon periplasmic protein; Provisional Probab=59.96 E-value=8.5 Score=16.71 Aligned_cols=30 Identities=30% Similarity=0.193 Sum_probs=19.7 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCCCCCC Q ss_conf 944378999999999841000024665435 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAADPVRRA 30 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad~~~~~ 30 (205) |||.++++.+++.++...-.+--||+...+ T Consensus 1 m~k~l~~~~l~~~l~~~~~~~~~a~~~~~~ 30 (303) T PRK10564 1 MKKSLIALCLSAGLLASVPGISLADVNYVP 30 (303) T ss_pred CCHHHHHHHHHHHHHHCCCCHHHCCCCCCC T ss_conf 940099999998886058620111663246 No 141 >PRK12395 maltoporin; Provisional Probab=59.90 E-value=5.8 Score=17.55 Aligned_cols=25 Identities=20% Similarity=0.078 Sum_probs=18.2 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 9443789999999998410000246 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAAD 25 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad 25 (205) ||..++.+++++++.+.+.+|+|+. T Consensus 1 m~~~~~~~~~a~~~~~~~~~~~a~~ 25 (419) T PRK12395 1 MKTSLRTLSVALAAALVSPSVLAIE 25 (419) T ss_pred CCCCHHHHHHHHHHHHHHHHCCHHH T ss_conf 9612357899999998642113211 No 142 >PRK10044 ferrichrome outer membrane transporter; Provisional Probab=59.56 E-value=10 Score=16.25 Aligned_cols=25 Identities=12% Similarity=0.298 Sum_probs=17.1 Q ss_pred EEEEEEEEEECCCCCEEEEEEEEEEE Q ss_conf 79973466100488679999999975 Q gi|254780598|r 161 GTAGVGVEVGGLSESLVARLEYRASK 186 (205) Q Consensus 161 ~~~G~Gvey~~it~~~~~r~EY~y~d 186 (205) ..++.-+|. .+++.|+++..++|.+ T Consensus 313 ~~~~~~~~~-~~~d~~t~~~~~~y~~ 337 (730) T PRK10044 313 KMVGYSFDH-EFNDTFTVRQNLRFAE 337 (730) T ss_pred EEEEEEEEE-EECCCEEEEEEEEEEE T ss_conf 999999999-7079879999999865 No 143 >cd01345 OM_channels Porin superfamily. These outer membrane channels share a beta-barrel structure that differ in strand and shear number. Classical (gram-negative ) porins are non-specific channels for small hydrophillic molecules and form 16 beta-stranded barrels (16,20), which associate as trimers. Maltoporin-like channels have specificities for various sugars and form 18 beta-stranded barrels (18,22), which associate as trimers. Ligand-gated protein channels cooperate with a TonB associated inner membrane complex to actively transport ligands via the proton motive force and they form monomeric, (22,24) barrels. The 150-200 N-terminal residues form a plug that blocks the channel from the periplasmic end. Probab=58.86 E-value=11 Score=16.18 Aligned_cols=22 Identities=9% Similarity=0.193 Sum_probs=18.4 Q ss_pred EEEEEEEEECCCCCEEEEEEEEE Q ss_conf 99734661004886799999999 Q gi|254780598|r 162 TAGVGVEVGGLSESLVARLEYRA 184 (205) Q Consensus 162 ~~G~Gvey~~it~~~~~r~EY~y 184 (205) .+++|+.| .+++++++++.+.- T Consensus 191 ~~~i~~~Y-~~nkn~~l~~Gv~N 212 (253) T cd01345 191 IVGLSATW-DVTKNVSLTGGYDN 212 (253) T ss_pred EEEECCEE-EECCCEEEEECCCC T ss_conf 88403158-86255588402020 No 144 >PRK10802 peptidoglycan-associated outer membrane lipoprotein; Provisional Probab=58.85 E-value=11 Score=16.18 Aligned_cols=17 Identities=29% Similarity=0.557 Sum_probs=6.4 Q ss_pred CHHHHHHHHHHHHHHHHCC Q ss_conf 9443789999999998410 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFC 19 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~ 19 (205) ||.+++++++ ++|++++ T Consensus 7 ~K~l~ia~~~--l~LaACS 23 (173) T PRK10802 7 LKGLMIALPV--MAIAACS 23 (173) T ss_pred HHHHHHHHHH--HHHHHCC T ss_conf 9999999999--9998557 No 145 >pfam06649 DUF1161 Protein of unknown function (DUF1161). This family consists of several short, hypothetical bacterial proteins of unknown function. Probab=58.77 E-value=4.9 Score=17.92 Aligned_cols=26 Identities=42% Similarity=0.424 Sum_probs=15.4 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCCCCCC Q ss_conf 944378999999999841000024665435 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAADPVRRA 30 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad~~~~~ 30 (205) |||.++++++..++ .++.||.-..+. T Consensus 1 Mkk~~l~~~l~lla----~~alAA~~sCE~ 26 (75) T pfam06649 1 MKRLLLAVALLLLA----ASALAAPKSCEE 26 (75) T ss_pred CCHHHHHHHHHHHH----HHHHHCCCCHHH T ss_conf 92469999999971----345315588899 No 146 >pfam12262 Lipase_bact_N Bacterial virulence factor lipase N-terminal. This domain family is found in bacteria, and is typically between 258 and 271 amino acids in length. There are two conserved sequence motifs: DGT and DGWST. This family is the N-terminal region of bacterial virulence factor lipase. The N-terminal region contains a potential signalling sequence. Probab=57.56 E-value=6.9 Score=17.17 Aligned_cols=16 Identities=44% Similarity=0.484 Sum_probs=12.0 Q ss_pred CHHHHHHHHHHHHHHH Q ss_conf 9443789999999998 Q gi|254780598|r 1 MQKLFLAVGVSSLALA 16 (205) Q Consensus 1 Mkk~~la~av~~lal~ 16 (205) |||+++.+++++.+.. T Consensus 1 MKk~~l~~~i~SAL~L 16 (269) T pfam12262 1 MKKKFLALLLASALLL 16 (269) T ss_pred CCHHHHHHHHHHHHHH T ss_conf 9247999999998641 No 147 >TIGR03502 lipase_Pla1_cef extracellular lipase, Pla-1/cef family. Members of this protein family are bacterial lipoproteins largely from the Gammaproteobacteria. Characterized members are expressed in extracellularly and have esterase activity. Members include the lipase Pla-1 from Aeromonas hydrophila (AF092033) and CHO cell elongation factor (cef) from Vibrio hollisae Probab=55.95 E-value=7.8 Score=16.89 Aligned_cols=18 Identities=39% Similarity=0.600 Sum_probs=13.5 Q ss_pred CHHHHHHHHHHHHH-HHHC Q ss_conf 94437899999999-9841 Q gi|254780598|r 1 MQKLFLAVGVSSLA-LASF 18 (205) Q Consensus 1 Mkk~~la~av~~la-l~~~ 18 (205) |||+++.+++++.+ |+++ T Consensus 1 MKk~~l~~~iasal~LagC 19 (792) T TIGR03502 1 MKKLLLSLAIASALGLAGC 19 (792) T ss_pred CCHHHHHHHHHHHHHHHCC T ss_conf 9337899999998750045 No 148 >PRK09915 putative outer membrane efflux protein MdtP; Provisional Probab=53.24 E-value=14 Score=15.67 Aligned_cols=21 Identities=29% Similarity=0.414 Sum_probs=13.2 Q ss_pred CHHHHHHHHHHHHHHHHCCCC Q ss_conf 944378999999999841000 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSA 21 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A 21 (205) ||+++++..+++.++.+.|+. T Consensus 6 ~~r~~~~~~l~~~~lLagC~~ 26 (488) T PRK09915 6 LSRLLLCSILGSTTLISGCAL 26 (488) T ss_pred HHHHHHHHHHHHHHHHCCCCC T ss_conf 769999999999999752668 No 149 >PRK10449 heat-inducible protein; Provisional Probab=52.51 E-value=10 Score=16.29 Aligned_cols=17 Identities=29% Similarity=0.352 Sum_probs=9.9 Q ss_pred CHHHHHHHHHHHHHHHHC Q ss_conf 944378999999999841 Q gi|254780598|r 1 MQKLFLAVGVSSLALASF 18 (205) Q Consensus 1 Mkk~~la~av~~lal~~~ 18 (205) |||++..++++. ++++. T Consensus 1 MKk~l~l~~~~l-lLagC 17 (140) T PRK10449 1 MKKVVALVALSL-LMAGC 17 (140) T ss_pred CCHHHHHHHHHH-HHHHC T ss_conf 903499999999-99752 No 150 >PRK11459 multidrug resistance outer membrane protein MdtQ; Provisional Probab=52.28 E-value=14 Score=15.59 Aligned_cols=28 Identities=18% Similarity=0.062 Sum_probs=15.1 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCCCC Q ss_conf 9443789999999998410000246654 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAADPVR 28 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad~~~ 28 (205) |||..+--.+++|+++..-+++|..+.. T Consensus 1 ~~~~~f~~~~a~l~~~llLaGCa~~~~~ 28 (478) T PRK11459 1 MNRDSFYAATASLPLFILLAGCAPMHET 28 (478) T ss_pred CCCCCHHHHHHHHHHHHHHHCCCCCCCC T ss_conf 9974058889999999998378899999 No 151 >COG4314 NosL Predicted lipoprotein involved in nitrous oxide reduction [Energy production and conversion] Probab=51.95 E-value=14 Score=15.59 Aligned_cols=23 Identities=26% Similarity=0.296 Sum_probs=16.2 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 94437899999999984100002 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQA 23 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~A 23 (205) ||+.++++....+++++...+.- T Consensus 1 mkr~Lla~la~~~llAgC~~~ed 23 (176) T COG4314 1 MKRTLLAILAVTALLAGCRQAED 23 (176) T ss_pred CCHHHHHHHHHHHHHHHCCHHHC T ss_conf 94037999999999875343113 No 152 >pfam12071 DUF3551 Protein of unknown function (DUF3551). This family of proteins is functionally uncharacterized. This protein is found in bacteria. Proteins in this family are typically between 79 to 104 amino acids in length. This protein has a single completely conserved residue C that may be functionally important. Probab=51.15 E-value=9.9 Score=16.38 Aligned_cols=11 Identities=36% Similarity=0.329 Sum_probs=8.2 Q ss_pred CHHHHHHHHHH Q ss_conf 94437899999 Q gi|254780598|r 1 MQKLFLAVGVS 11 (205) Q Consensus 1 Mkk~~la~av~ 11 (205) ||+++++..+. T Consensus 1 MR~~~~a~~a~ 11 (82) T pfam12071 1 MRGLLLAALAL 11 (82) T ss_pred CHHHHHHHHHH T ss_conf 94799999999 No 153 >pfam11853 DUF3373 Protein of unknown function (DUF3373). This family of proteins are functionally uncharacterized. This protein is found in bacteria. Proteins in this family are typically between 472 to 574 amino acids in length. Probab=51.02 E-value=15 Score=15.48 Aligned_cols=54 Identities=19% Similarity=0.091 Sum_probs=28.0 Q ss_pred CEEEEEEEEEEEEEEEECCC-------CCCCCCCCEEEEEEEEEEEECCCCCEEEEEEEEE Q ss_conf 67789885447764443134-------5556775204799734661004886799999999 Q gi|254780598|r 131 SLLLYATVGPDVAQKYETGK-------AGEITPIAIGGTAGVGVEVGGLSESLVARLEYRA 184 (205) Q Consensus 131 ~~l~Y~~~G~a~~~~~~~~~-------~~~~~~~~~g~~~G~Gvey~~it~~~~~r~EY~y 184 (205) ++-.++..|+..++-..... .-...++++||.+=+|++|..+++.--+++||.| T Consensus 332 d~~~F~S~a~S~t~Pn~~~~~~~~~~~~l~~~~sktG~sv~vG~~~p~~~~~~K~G~EYN~ 392 (485) T pfam11853 332 NLNAFVSFGWSKTHPNGNVSANAGFQGMLGSTDSKTGYSVYVGARYDLLPSGTKFGLEYNH 392 (485) T ss_pred CEEEEEEEEEEECCCCCCCCCHHHHHHCCCCCCCCCEEEEEEEEECCCCCCCCEEEEEEEC T ss_conf 6448998888850589875501567752289989860599996414677878657789615 No 154 >pfam03895 YadA YadA-like C-terminal region. This region represents the C-terminal 120 amino acids of a family of surface-exposed bacterial proteins. YadA, an adhesin from Yersinia, was the first member of this family to be characterized. UspA2 from Moraxella was second. The Eib immunoglobulin-binding proteins from E. coli were third, followed by the DsrA proteins of Haemophilus ducreyi and others. These proteins are homologous at their C-terminal and have predicted signal sequences, but they diverge elsewhere. The C-terminal 9 amino acids, consisting of alternating hydrophobic amino acids ending in F or W, comprise a targeting motif for the outer membrane of the Gram negative cell envelope. This region is important for oligomerisation. Probab=50.95 E-value=15 Score=15.47 Aligned_cols=38 Identities=21% Similarity=0.223 Sum_probs=22.2 Q ss_pred EEEEEEEEECCCCCEEEEEEEEEEECCCCCCCCEEEEEEEEEEC Q ss_conf 99734661004886799999999750565566226999998539 Q gi|254780598|r 162 TAGVGVEVGGLSESLVARLEYRASKYSKVEGFYNTISLGVGMKF 205 (205) Q Consensus 162 ~~G~Gvey~~it~~~~~r~EY~y~df~~~~~~~~~~~~G~~ykF 205 (205) .+.+|+.+ .+++||.+|+.--+.. . ..-.+.+|++|.| T Consensus 41 A~AvG~s~-~~n~~~~~~~~~s~~s----~-~~~~~gaGv~~~w 78 (78) T pfam03895 41 AVAVGASY-RSNGNVVVKLGVSTSS----G-GDVGGGAGVGYQW 78 (78) T ss_pred EEEEEEEE-EECCCEEEEEEEEECC----C-CCEEEEEEEEEEC T ss_conf 89999889-8779899999898718----9-9879997764589 No 155 >PRK10049 pgaA outer membrane protein PgaA; Provisional Probab=50.75 E-value=15 Score=15.46 Aligned_cols=48 Identities=17% Similarity=0.220 Sum_probs=22.3 Q ss_pred CCEEEEEEEEEEEECCCCCEEEEEEEEE--EECCCCCCCCEEEEEEEEEEC Q ss_conf 5204799734661004886799999999--750565566226999998539 Q gi|254780598|r 157 IAIGGTAGVGVEVGGLSESLVARLEYRA--SKYSKVEGFYNTISLGVGMKF 205 (205) Q Consensus 157 ~~~g~~~G~Gvey~~it~~~~~r~EY~y--~df~~~~~~~~~~~~G~~ykF 205 (205) +...|.++-|=+. .+++.|.+..+-.+ .-+.-..-..+.+.+++++|| T Consensus 769 ~~~~w~l~Y~h~w-~~~~~~~~~ygv~~~k~~YDG~rE~~~~~~~~l~~rF 818 (818) T PRK10049 769 TGVVTTLGYGQRV-SWNDVIDTGATLRWDKRPYDGDREHNLYLSFDLNYRF 818 (818) T ss_pred CCCCCEEECEEEE-EECCEEEEEEEEEECCCCCCCCCCCCCEEEEEEEECC T ss_conf 7865202011489-9767144577887436888998664747999988439 No 156 >PRK11553 alkanesulfonate transporter substrate-binding subunit; Provisional Probab=50.65 E-value=14 Score=15.65 Aligned_cols=25 Identities=12% Similarity=0.215 Sum_probs=16.9 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 9443789999999998410000246 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAAD 25 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad 25 (205) ||+++.+...++++++....+.++. T Consensus 1 ~~~~l~~~la~~~~~~~~~~~~~~~ 25 (314) T PRK11553 1 MRNIIKLALAGLLSVSTLAVAAESS 25 (314) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 9779999999999998754443348 No 157 >PRK01326 prsA foldase protein PrsA; Reviewed Probab=49.39 E-value=13 Score=15.69 Aligned_cols=18 Identities=39% Similarity=0.409 Sum_probs=10.6 Q ss_pred CHHHHHHHHH---HHHHHHHC Q ss_conf 9443789999---99999841 Q gi|254780598|r 1 MQKLFLAVGV---SSLALASF 18 (205) Q Consensus 1 Mkk~~la~av---~~lal~~~ 18 (205) |||.+++.++ ++++|+++ T Consensus 1 MKKKl~~~~~~~~svl~LaaC 21 (310) T PRK01326 1 MKKKLIAGAVTLLSVATLAAC 21 (310) T ss_pred CCHHHHHHHHHHHHHHHHHHH T ss_conf 908899999999999999871 No 158 >PRK10894 hypothetical protein; Provisional Probab=49.13 E-value=11 Score=16.04 Aligned_cols=26 Identities=27% Similarity=0.097 Sum_probs=14.1 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCC Q ss_conf 94437899999999984100002466 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAADP 26 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad~ 26 (205) |+++.|.++.++|+++.++-|.-+|- T Consensus 7 ~~~~~l~l~~~ll~~S~~a~Al~sD~ 32 (184) T PRK10894 7 KLSLNLLLASSLLAASIPAFAVTGDT 32 (184) T ss_pred CCHHHHHHHHHHHHHCCHHHHCCCCC T ss_conf 42066999999987161243155344 No 159 >PRK04405 prsA peptidylprolyl isomerase; Provisional Probab=48.42 E-value=12 Score=15.98 Aligned_cols=15 Identities=40% Similarity=0.452 Sum_probs=10.2 Q ss_pred CHHHHHHHHHHHHHH Q ss_conf 944378999999999 Q gi|254780598|r 1 MQKLFLAVGVSSLAL 15 (205) Q Consensus 1 Mkk~~la~av~~lal 15 (205) |||++++++.+++++ T Consensus 5 MKK~~l~~~~~~~~l 19 (298) T PRK04405 5 MKKWALAAASAGLLL 19 (298) T ss_pred HHHHHHHHHHHHHHH T ss_conf 899999999999999 No 160 >COG5455 Predicted integral membrane protein [Function unknown] Probab=48.33 E-value=14 Score=15.54 Aligned_cols=13 Identities=31% Similarity=0.245 Sum_probs=7.7 Q ss_pred CHHHHHHHHHHHH Q ss_conf 9443789999999 Q gi|254780598|r 1 MQKLFLAVGVSSL 13 (205) Q Consensus 1 Mkk~~la~av~~l 13 (205) |||+++++..++. T Consensus 2 mkrlv~~lla~s~ 14 (129) T COG5455 2 MKRLVSALLAVSS 14 (129) T ss_pred CHHHHHHHHHHHH T ss_conf 1438999999998 No 161 >PRK10290 superoxide dismutase; Provisional Probab=48.12 E-value=11 Score=16.05 Aligned_cols=22 Identities=36% Similarity=0.442 Sum_probs=12.0 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCC Q ss_conf 94437899999999984100002466 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAADP 26 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad~ 26 (205) |||+.++++. |+++ +.++|+.. T Consensus 1 mk~~~~~~~~--l~~~--~~~~a~~~ 22 (173) T PRK10290 1 MKRFSLAILA--LVVC--TGAQAASE 22 (173) T ss_pred CCHHHHHHHH--HHHH--HHHHHCCC T ss_conf 9107999999--9997--66651446 No 162 >PRK01904 hypothetical protein; Provisional Probab=47.92 E-value=11 Score=16.23 Aligned_cols=27 Identities=15% Similarity=-0.005 Sum_probs=17.4 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCCCCC Q ss_conf 94437899999999984100002466543 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAADPVRR 29 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad~~~~ 29 (205) ||....+++ +++|+.+++++|+.+..+ T Consensus 1 MK~~~~~~a--l~~l~~S~~a~A~tL~lp 27 (217) T PRK01904 1 MKLRAAALA--VATLLTSTASFAGMVTTS 27 (217) T ss_pred CCCCHHHHH--HHHHHHHHHHHHHHCCCC T ss_conf 970079999--999986089887514078 No 163 >COG3767 Uncharacterized low-complexity protein [Function unknown] Probab=47.87 E-value=12 Score=16.02 Aligned_cols=24 Identities=38% Similarity=0.310 Sum_probs=14.0 Q ss_pred CHHHHHH-HHHHHHHHHHCCCCCCC Q ss_conf 9443789-99999999841000024 Q gi|254780598|r 1 MQKLFLA-VGVSSLALASFCSAQAA 24 (205) Q Consensus 1 Mkk~~la-~av~~lal~~~~~A~Aa 24 (205) |||+.++ ..+.++.|+.+..++|. T Consensus 1 mkk~~~~~alagA~~l~~a~~~~a~ 25 (95) T COG3767 1 MKKLATAAALAGALLLGAAAVAQAE 25 (95) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9037889999878999988888987 No 164 >COG4851 CamS Protein involved in sex pheromone biosynthesis [General function prediction only] Probab=47.43 E-value=14 Score=15.65 Aligned_cols=25 Identities=24% Similarity=0.191 Sum_probs=13.2 Q ss_pred CHHHHHH-HHHHHHHHHHCCCCCCCCC Q ss_conf 9443789-9999999984100002466 Q gi|254780598|r 1 MQKLFLA-VGVSSLALASFCSAQAADP 26 (205) Q Consensus 1 Mkk~~la-~av~~lal~~~~~A~Aad~ 26 (205) |||.+.. +.+++|.|+++. -+-.|. T Consensus 1 Mkktl~i~~ta~vliLs~C~-~~~dd~ 26 (382) T COG4851 1 MKKTLGIAATASVLILSGCF-PFVDDT 26 (382) T ss_pred CCHHHHHHHHHHHHHHHHCC-CCCCCC T ss_conf 94015589999999986116-766873 No 165 >TIGR01672 AphA HAD superfamily (subfamily IIIB) phosphatase, TIGR01672; InterPro: IPR010025 This family of proteins is a member of the IIIB subfamily (IPR001001 from INTERPRO) of the haloacid dehalogenase (HAD) superfamily of hydrolases. All characterised members of subfamily III and most characterised members of the HAD superfamily are phosphatases. HAD superfamily phosphatases contain active site residues in several conserved catalytic motifs , all of which are found conserved in this family. AphA is a periplasmic acid phosphatase of Escherichia coli belonging to class B bacterial phosphatases , which is part of the DDDD superfamily of phosphohydrolases. The crystal structure of AphA has been determined at 2.2A and its resolution extended to 1.7A. Despite the lack of sequence homology, the AphA structure reveals a haloacid dehalogenase-like fold. This finding suggests that this fold could be conserved among members of the DDDD superfamily of phosphohydrolases. The active enzyme is a homotetramer built by using an extended N-terminal arm intertwining the four monomers. The active site of the native enzyme hosts a magnesium ion, which can be replaced by other metal ions. The structure explains the non-specific behaviour of AphA towards substrates, while a structure-based alignment with other phosphatases provides clues about the catalytic mechanism . ; GO: 0003993 acid phosphatase activity, 0030288 outer membrane-bounded periplasmic space. Probab=47.06 E-value=17 Score=15.14 Aligned_cols=26 Identities=31% Similarity=0.208 Sum_probs=18.9 Q ss_pred CHHHHHH-HHHHHHHHHHCCCCCCCCC Q ss_conf 9443789-9999999984100002466 Q gi|254780598|r 1 MQKLFLA-VGVSSLALASFCSAQAADP 26 (205) Q Consensus 1 Mkk~~la-~av~~lal~~~~~A~Aad~ 26 (205) |||++-+ +.+.+|+.+.+.+|.|... T Consensus 1 Mkk~~~~ils~v~ll~a~a~pa~A~g~ 27 (248) T TIGR01672 1 MKKVTKAILSVVALLTALAVPAVALGK 27 (248) T ss_pred CHHHHHHHHHHHHHHHHHHCCEEEECC T ss_conf 915788999999999997423122058 No 166 >PRK00249 flgH flagellar basal body L-ring protein; Reviewed Probab=47.03 E-value=17 Score=15.14 Aligned_cols=25 Identities=24% Similarity=0.376 Sum_probs=11.3 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 9443789999999998410000246 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAAD 25 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad 25 (205) |+++++.+.+.++++.+.|+....+ T Consensus 1 ~~~~~~~~~~~~~~~LsgCa~~~~~ 25 (231) T PRK00249 1 MAMMLIALALLLLLLLSGCASIPPK 25 (231) T ss_pred CCCHHHHHHHHHHHHHHHCCCCCCC T ss_conf 9338999999999998753489998 No 167 >PRK13680 hypothetical protein; Provisional Probab=46.89 E-value=14 Score=15.68 Aligned_cols=25 Identities=32% Similarity=0.178 Sum_probs=17.3 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 9443789999999998410000246 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAAD 25 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad 25 (205) |||.-+++.+.+++++...++++-+ T Consensus 5 mkk~G~~l~~lll~~s~~~~a~~ee 29 (121) T PRK13680 5 MKKRGALLVLLLLACSSVFSANAEE 29 (121) T ss_pred HHHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 7665299999999988777665057 No 168 >COG2854 Ttg2D ABC-type transport system involved in resistance to organic solvents, auxiliary component [Secondary metabolites biosynthesis, transport, and catabolism] Probab=46.26 E-value=12 Score=15.88 Aligned_cols=24 Identities=29% Similarity=0.297 Sum_probs=15.3 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCC Q ss_conf 944378999999999841000024 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAA 24 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aa 24 (205) |||+++..++.+.+++++.++.++ T Consensus 3 m~k~l~~~~ll~~a~a~~~~~~~~ 26 (202) T COG2854 3 MKKSLTILALLVIAFASSLAAAAP 26 (202) T ss_pred HHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 899999999999999988876375 No 169 >pfam04357 DUF490 Family of unknown function (DUF490). Probab=46.19 E-value=18 Score=15.07 Aligned_cols=30 Identities=17% Similarity=0.180 Sum_probs=14.3 Q ss_pred EEEEECCCCCEEEEEEEEEEECCCCCCCCEEEEEEEEEEC Q ss_conf 4661004886799999999750565566226999998539 Q gi|254780598|r 166 GVEVGGLSESLVARLEYRASKYSKVEGFYNTISLGVGMKF 205 (205) Q Consensus 166 Gvey~~it~~~~~r~EY~y~df~~~~~~~~~~~~G~~ykF 205 (205) .+|| .++++|++++++... ..-.+++-||| T Consensus 318 ~i~y-~l~~~~~l~~~~~~~---------~~~~~~l~y~~ 347 (349) T pfam04357 318 TLEY-RLTPNLSLEASSDSG---------GESGVDLFYKF 347 (349) T ss_pred EEEE-EECCCEEEEEEECCC---------CCCEEEEEEEE T ss_conf 9999-976998999999679---------86389899997 No 170 >PRK11917 bifunctional adhesin/ABC transporter aspartate/glutamate-binding protein; Reviewed Probab=45.27 E-value=17 Score=15.21 Aligned_cols=18 Identities=22% Similarity=0.266 Sum_probs=11.6 Q ss_pred CHHHHHHHHHHHHHHHHC Q ss_conf 944378999999999841 Q gi|254780598|r 1 MQKLFLAVGVSSLALASF 18 (205) Q Consensus 1 Mkk~~la~av~~lal~~~ 18 (205) +||.++++++.++++|.. T Consensus 3 ~~~~l~~~~~~~l~~~~~ 20 (259) T PRK11917 3 FRKSLLKLAVFALGACVA 20 (259) T ss_pred HHHHHHHHHHHHHHHHHH T ss_conf 279999999999999986 No 171 >COG3026 RseB Negative regulator of sigma E activity [Signal transduction mechanisms] Probab=44.36 E-value=18 Score=14.99 Aligned_cols=27 Identities=19% Similarity=0.175 Sum_probs=20.4 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 944378999999999841000024665 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAADPV 27 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad~~ 27 (205) ||++..++.+....++++..|.|.++. T Consensus 1 mk~l~~s~~ll~~sl~~s~~a~ae~~s 27 (320) T COG3026 1 MKQLWFSLLLLLGSLLLSAAASAESAS 27 (320) T ss_pred CCHHHHHHHHHHHHHHHHHHHHCCCCC T ss_conf 923789999999998610443046865 No 172 >PRK11119 proX glycine betaine transporter periplasmic subunit; Provisional Probab=43.60 E-value=18 Score=15.02 Aligned_cols=25 Identities=36% Similarity=0.279 Sum_probs=14.3 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCCCCCCC Q ss_conf 4437899999999984100002466543 Q gi|254780598|r 2 QKLFLAVGVSSLALASFCSAQAADPVRR 29 (205) Q Consensus 2 kk~~la~av~~lal~~~~~A~Aad~~~~ 29 (205) ||++++.++ +++..++|+|+|++.+ T Consensus 4 ~~~l~~~a~---~~~~~~~a~a~~~pg~ 28 (332) T PRK11119 4 HKVLLATAL---ATLISTSAFAADLPGK 28 (332) T ss_pred HHHHHHHHH---HHHHHHHHHHCCCCCC T ss_conf 888999999---9985227652558999 No 173 >pfam06518 DUF1104 Protein of unknown function (DUF1104). This family consists of several hypothetical proteins of unknown function which appear to be found exclusively in Helicobacter pylori. Probab=43.59 E-value=19 Score=14.96 Aligned_cols=23 Identities=30% Similarity=0.223 Sum_probs=16.3 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCC Q ss_conf 94437899999999984100002466 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAADP 26 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad~ 26 (205) |||++..+.+++|+++. ..|+|- T Consensus 1 MKk~~~~~l~~aL~~s~---l~AaDF 23 (142) T pfam06518 1 MKKALSLCLVGALLFSF---LNAADF 23 (142) T ss_pred CHHHHHHHHHHHHHHHH---HCHHHH T ss_conf 92689999999999998---552642 No 174 >pfam09403 FadA Adhesion protein FadA. FadA (Fusobacterium adhesin A) is an adhesin which forms two alpha helices. Probab=43.11 E-value=17 Score=15.14 Aligned_cols=10 Identities=30% Similarity=0.521 Sum_probs=7.6 Q ss_pred CHHHHHHHHH Q ss_conf 9443789999 Q gi|254780598|r 1 MQKLFLAVGV 10 (205) Q Consensus 1 Mkk~~la~av 10 (205) |||+++...+ T Consensus 1 MkK~ll~~~l 10 (126) T pfam09403 1 MKKILLCSML 10 (126) T ss_pred CHHHHHHHHH T ss_conf 9169999999 No 175 >CHL00132 psaF photosystem I subunit III; Validated Probab=42.89 E-value=20 Score=14.79 Aligned_cols=31 Identities=23% Similarity=0.113 Sum_probs=20.4 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCCCCCCC Q ss_conf 9443789999999998410000246654356 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAADPVRRAH 31 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad~~~~~~ 31 (205) ||+++..+.+..+.++.+-....||...-.| T Consensus 1 Mrr~~a~~l~~~l~~~~ap~~a~AdvagLtP 31 (185) T CHL00132 1 MKKFNLIFLLAALLLTLNPNIALADVAGLTP 31 (185) T ss_pred CHHHHHHHHHHHHHHHCCCCHHHCCCCCCCC T ss_conf 9069999999999993686221101138860 No 176 >PRK09967 putative outer membrane lipoprotein; Provisional Probab=42.40 E-value=21 Score=14.74 Aligned_cols=20 Identities=30% Similarity=0.270 Sum_probs=11.2 Q ss_pred CHHHHH-HHHHHHHHHHHCCC Q ss_conf 944378-99999999984100 Q gi|254780598|r 1 MQKLFL-AVGVSSLALASFCS 20 (205) Q Consensus 1 Mkk~~l-a~av~~lal~~~~~ 20 (205) |+|.++ .+.+++|+|+++++ T Consensus 1 m~k~~~~~~l~~~l~lsgC~s 21 (160) T PRK09967 1 MIKHLVAPLIFTSLILTGCQS 21 (160) T ss_pred CHHHHHHHHHHHHHHHHHCCC T ss_conf 907899999999999984589 No 177 >TIGR02791 VirB5 P-type DNA transfer protein VirB5; InterPro: IPR014158 This entry contains VirB5, a protein that is involved in the type IV DNA secretion systems typified by the Agrobacterium Ti plasmid vir system where it interacts with several other proteins essential for proper pilus formation . VirB5 is homologous to the IncN (N-type) conjugation system protein TraC as well as the P-type protein TrbJ and the F-type protein TraE .. Probab=41.65 E-value=21 Score=14.68 Aligned_cols=16 Identities=31% Similarity=0.432 Sum_probs=11.8 Q ss_pred CHHHHHHHHHHHHHHH Q ss_conf 9443789999999998 Q gi|254780598|r 1 MQKLFLAVGVSSLALA 16 (205) Q Consensus 1 Mkk~~la~av~~lal~ 16 (205) |||++++.++|+|.|. T Consensus 1 mKK~~~~~a~salml~ 16 (233) T TIGR02791 1 MKKHAVSCALSALMLV 16 (233) T ss_pred CHHHHHHHHHHHHHHH T ss_conf 9256799999999999 No 178 >pfam07585 DUF1551 Protein of unknown function (DUF1551). A family of proteins identified in Rhodopirellula baltica. Probab=41.42 E-value=21 Score=14.66 Aligned_cols=50 Identities=16% Similarity=0.170 Sum_probs=35.0 Q ss_pred CCCCEEEE--EEEEEEEECCCCCEEEEEEEEEEECCCC-----------------------------CCCCEEEEEEEEE Q ss_conf 77520479--9734661004886799999999750565-----------------------------5662269999985 Q gi|254780598|r 155 TPIAIGGT--AGVGVEVGGLSESLVARLEYRASKYSKV-----------------------------EGFYNTISLGVGM 203 (205) Q Consensus 155 ~~~~~g~~--~G~Gvey~~it~~~~~r~EY~y~df~~~-----------------------------~~~~~~~~~G~~y 203 (205) +++..+++ +++-+.| .+|++|++++.|+..-+++| +.-.|=+.+|+-| T Consensus 258 ~~~~fa~v~e~~~~~~y-~~t~~~~l~~GY~~l~~s~V~ra~dqid~~~~~~~~~~~~~p~~~~~~s~~~~~Gl~~G~e~ 336 (338) T pfam07585 258 ARDRFAFVPELGLNLGY-QLTPNLSLTVGYSFLYLSDVALAGDQIDRNVNTANLTGAVRPAFVDSNSSLLAHGLTFGGEY 336 (338) T ss_pred ECCEEEEEEECCEEEEE-EECCCEEEEEEEEEEEECCCCCCHHCCCCCCCCCCCCCCCCCCEEECCCCEEEEEEEEEEEE T ss_conf 11308998424079999-96487899986789998143652120155668888877667623434687599715876898 Q ss_pred EC Q ss_conf 39 Q gi|254780598|r 204 KF 205 (205) Q Consensus 204 kF 205 (205) +| T Consensus 337 ~f 338 (338) T pfam07585 337 SF 338 (338) T ss_pred EC T ss_conf 39 No 179 >PRK04517 hypothetical protein; Provisional Probab=41.40 E-value=16 Score=15.28 Aligned_cols=25 Identities=32% Similarity=0.065 Sum_probs=15.9 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCC-CCC Q ss_conf 944378999999999841000024-665 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAA-DPV 27 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aa-d~~ 27 (205) ||.+...+ ++++|+.+++|+|+ .+. T Consensus 1 MK~~~~~~--~ll~l~~s~sa~A~vtL~ 26 (216) T PRK04517 1 MKLIKPLT--CLLALAMSGSAFADVTVE 26 (216) T ss_pred CCCHHHHH--HHHHHHHHHHHHHHCEEE T ss_conf 96027999--999999713553320783 No 180 >TIGR03431 PhnD phosphonate ABC transporter, periplasmic phosphonate binding protein. Note that this model does not identify all phnD-subfamily genes with evident phosphonate context, but all sequences above the trusted context may be inferred to bind phosphonate compounds even in the absence of such context. Furthermore, there is ample evidence to suggest that many other members of the TIGR01098 subfamily have a different primary function. Probab=41.39 E-value=21 Score=14.66 Aligned_cols=25 Identities=28% Similarity=0.227 Sum_probs=14.1 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCC Q ss_conf 94437899999999984100002466 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAADP 26 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad~ 26 (205) |+|-++.+.+++++++ +++|.|.+. T Consensus 1 m~~r~l~~~~~~~~~~-~~~a~a~~~ 25 (288) T TIGR03431 1 MLRRLILSLVAAFMLI-SSNAQAEDW 25 (288) T ss_pred CCHHHHHHHHHHHHHH-CCHHHHCCC T ss_conf 9088999999999997-432243048 No 181 >PRK11397 dacD D-alanyl-D-alanine carboxypeptidase; Provisional Probab=41.16 E-value=22 Score=14.64 Aligned_cols=27 Identities=19% Similarity=0.199 Sum_probs=10.8 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 944378999999999841000024665 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAADPV 27 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad~~ 27 (205) |||++.++++.++.+..++.+++.-.. T Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 30 (390) T PRK11397 4 KRRLIIAASLFVFNLSSAFAAENIPFS 30 (390) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCCC T ss_conf 789999999999999998740457789 No 182 >PRK09838 periplasmic copper-binding protein; Provisional Probab=40.98 E-value=22 Score=14.62 Aligned_cols=26 Identities=27% Similarity=0.307 Sum_probs=20.2 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 944378999999999841000024665 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAADPV 27 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad~~ 27 (205) ||+++-++..+++.+.++ .++|.|.- T Consensus 1 m~~~~~~~~~~~~s~~~~-~~~a~~h~ 26 (114) T PRK09838 1 MKKALKVAMFSLFSVIGF-NAQANEHH 26 (114) T ss_pred CCHHHHHHHHHHHHHHHH-HHHCCCCC T ss_conf 932789999999998732-32025653 No 183 >PHA00025 major coat protein Probab=40.84 E-value=19 Score=14.92 Aligned_cols=27 Identities=22% Similarity=0.224 Sum_probs=20.6 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 944378999999999841000024665 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAADPV 27 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad~~ 27 (205) |||.....++.+.++.....++|||+. T Consensus 1 mkks~v~ka~~a~~l~vsa~sFAAe~~ 27 (76) T PHA00025 1 MKKSKVKKASGAVTLLVSAGSFAAEPN 27 (76) T ss_pred CCHHHHHHHHHHHHHHCCCCCCCCCCC T ss_conf 926699999765553001210003776 No 184 >pfam05643 DUF799 Putative bacterial lipoprotein (DUF799). This family consists of several bacterial proteins of unknown function. Some of the family members are described as putative lipoproteins. Probab=40.72 E-value=20 Score=14.81 Aligned_cols=16 Identities=31% Similarity=0.472 Sum_probs=11.0 Q ss_pred CHHHHHHHHHHHHHHHH Q ss_conf 94437899999999984 Q gi|254780598|r 1 MQKLFLAVGVSSLALAS 17 (205) Q Consensus 1 Mkk~~la~av~~lal~~ 17 (205) |||+++.+++.. .|++ T Consensus 1 mK~lil~l~~~l-~L~g 16 (215) T pfam05643 1 MKPLILGLAAVL-ALSA 16 (215) T ss_pred CHHHHHHHHHHH-HHHH T ss_conf 931499999999-9862 No 185 >TIGR03227 PhnS 2-aminoethylphosphonate ABC transporter, periplasmic 2-aminoethylphosphonate binding protein. This ABC transporter periplasmic substrate binding protein component is found in a region of the salmonella typhimurium LT2 genome responsible for the catabolism of 2-aminoethylphosphonate via the phnWX pathway (GenProp0238). The protein contains a match to pfam01547 for the "Bacterial extracellular solute-binding protein" domain. Probab=40.56 E-value=19 Score=14.96 Aligned_cols=27 Identities=33% Similarity=0.429 Sum_probs=21.2 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 944378999999999841000024665 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAADPV 27 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad~~ 27 (205) |||+.++++.++|+.+...+|+|++.. T Consensus 12 ~~~~~~~~~~~~~~~~~~~~a~a~~~~ 38 (367) T TIGR03227 12 LRRFALAAGAAALLQGAAAPAQAAAVV 38 (367) T ss_pred HHHHHHHHHHHHHHHHCCCCCCCCCEE T ss_conf 999999999999986423555458659 No 186 >pfam08139 LPAM_1 Prokaryotic membrane lipoprotein lipid attachment site. In prokaryotes, membrane lipoproteins are synthesized with a precursor signal peptide, which is cleaved by a specific lipoprotein signal peptidase (signal peptidase II). The peptidase recognizes a conserved sequence and cuts upstream of a cysteine residue to which a glyceride-fatty acid lipid is attached. Probab=40.21 E-value=20 Score=14.80 Aligned_cols=13 Identities=38% Similarity=0.388 Sum_probs=9.5 Q ss_pred CHHHHHHHHHHHH Q ss_conf 9443789999999 Q gi|254780598|r 1 MQKLFLAVGVSSL 13 (205) Q Consensus 1 Mkk~~la~av~~l 13 (205) |||+++...+..+ T Consensus 1 Mkk~~~l~~~~l~ 13 (26) T pfam08139 1 MKKLLLLLLALLL 13 (26) T ss_pred CHHHHHHHHHHHH T ss_conf 9669999999999 No 187 >COG4771 FepA Outer membrane receptor for ferrienterochelin and colicins [Inorganic ion transport and metabolism] Probab=39.15 E-value=23 Score=14.47 Aligned_cols=44 Identities=9% Similarity=0.079 Sum_probs=27.8 Q ss_pred EEE-EEEEEEEECCCCCEEEEEEEEEE---EC-------------CCCCCCCEEEEEEEEEEC Q ss_conf 479-97346610048867999999997---50-------------565566226999998539 Q gi|254780598|r 160 GGT-AGVGVEVGGLSESLVARLEYRAS---KY-------------SKVEGFYNTISLGVGMKF 205 (205) Q Consensus 160 g~~-~G~Gvey~~it~~~~~r~EY~y~---df-------------~~~~~~~~~~~~G~~ykF 205 (205) .|+ +++|+-| .+|+||++++.-.-. ++ +.+. +--++.++|.|+| T Consensus 639 ~Yti~~~ga~y-~~tkn~sL~agV~NLfDK~l~~~~~~a~~~g~~~y~e-pGRry~~s~~~~F 699 (699) T COG4771 639 AYTIVDLGATY-DVTKNVSLNAGVYNLFDKRLSRDGNYAFGAGAYTYNE-PGRRYYMSLNYSF 699 (699) T ss_pred CEEEEECCEEE-EECCCEEEEHHHHHHCCHHHCCCCCCCCCCCCCCCCC-CCCEEEEEEECCC T ss_conf 64898424146-6323426601012204422123554333445432236-7725899853049 No 188 >TIGR03519 Bac_Flav_fam_1 Bacteroidetes-specific putative membrane protein. This model describes a protein family unique to, and greatly expanded in, the Bacteriodetes. Species in this lineage include several, such as Cytophaga hutchinsonii and Flavobacterium johnsoniae, that exhibit a poorly understood rapid gliding phenotype. Several members of this protein family are found in operons with other genes whose loss leads to a loss a this motility. Probab=38.62 E-value=24 Score=14.42 Aligned_cols=11 Identities=27% Similarity=0.416 Sum_probs=8.4 Q ss_pred CHHHHHHHHHH Q ss_conf 94437899999 Q gi|254780598|r 1 MQKLFLAVGVS 11 (205) Q Consensus 1 Mkk~~la~av~ 11 (205) |||+++.+.+. T Consensus 1 Mkki~l~~~~~ 11 (292) T TIGR03519 1 MKKILLLLLLL 11 (292) T ss_pred CCCHHHHHHHH T ss_conf 96057889999 No 189 >PRK13731 conjugal transfer surface exclusion protein TraT; Provisional Probab=38.48 E-value=21 Score=14.65 Aligned_cols=37 Identities=24% Similarity=0.185 Sum_probs=22.5 Q ss_pred CHHHHHHHHHHH-HHHHHCCCCCCC----CCCCCCCCCCCCC Q ss_conf 944378999999-999841000024----6654356666655 Q gi|254780598|r 1 MQKLFLAVGVSS-LALASFCSAQAA----DPVRRAHHGGRGV 37 (205) Q Consensus 1 Mkk~~la~av~~-lal~~~~~A~Aa----d~~~~~~~~~~~~ 37 (205) |||+++...+++ |+|+++++.+.+ |+........... T Consensus 3 ~kk~~~~~~~~~~l~lsGC~a~~tai~krnL~VqTkMS~TIf 44 (243) T PRK13731 3 TKKLMMVALVSSTLALSGCGAMSTAIKKRNLEVKTQMSETIW 44 (243) T ss_pred HHHHHHHHHHHHHHHHCCHHHHHHHHHHCCCCEECEECCCEE T ss_conf 377579999999999715687677877357522220004388 No 190 >PRK10797 glutamate and aspartate transporter subunit; Provisional Probab=38.28 E-value=19 Score=14.91 Aligned_cols=16 Identities=25% Similarity=0.221 Sum_probs=8.3 Q ss_pred CHHHHHHHHHHHHHHH Q ss_conf 9443789999999998 Q gi|254780598|r 1 MQKLFLAVGVSSLALA 16 (205) Q Consensus 1 Mkk~~la~av~~lal~ 16 (205) |||+++.+...++.++ T Consensus 3 ~kk~~~~~l~~~l~~~ 18 (302) T PRK10797 3 LRKLATALLALGLSAG 18 (302) T ss_pred HHHHHHHHHHHHHHHH T ss_conf 5899999999999999 No 191 >pfam05150 Legionella_OMP Legionella pneumophila major outer membrane protein precursor. This family consists of major outer membrane protein precursors from Legionella pneumophila. Probab=38.24 E-value=24 Score=14.39 Aligned_cols=41 Identities=24% Similarity=0.109 Sum_probs=18.1 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 44378999999999841000024665435666665556345544444 Q gi|254780598|r 2 QKLFLAVGVSSLALASFCSAQAADPVRRAHHGGRGVVPTIATNRYVP 48 (205) Q Consensus 2 kk~~la~av~~lal~~~~~A~Aad~~~~~~~~~~~~~~~~~~~~~~~ 48 (205) ||..++ +|+|++ +.++|.++-+ +-.+.-...|.....+..+ T Consensus 5 KKtt~A----vl~lgs-s~~~AGtMGP-VcTPGnVtVPCe~~~wd~g 45 (297) T pfam05150 5 KKTAAA----VFALGS-SALFAGTMGP-VCTPGNVTVPCERTAWDIG 45 (297) T ss_pred HHHHHH----HHHHHC-CCEECCCCCC-CCCCCCEECCCCCCCEECC T ss_conf 777999----987621-3300244677-4578855047876520213 No 192 >PRK10154 hypothetical protein; Provisional Probab=37.11 E-value=23 Score=14.48 Aligned_cols=27 Identities=30% Similarity=0.452 Sum_probs=19.2 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 944378999999999841000024665 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAADPV 27 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad~~ 27 (205) ||..+...+...+.+..++.|+|.|-. T Consensus 1 mks~~t~sa~l~l~~~~~~~a~anD~~ 27 (134) T PRK10154 1 MKSVFTISASLAISLMLCCTAQANDHK 27 (134) T ss_pred CCCHHHHHHHHHHHHHHHHHHHCCCCE T ss_conf 941323489999999876544227746 No 193 >PRK09818 putative kinase inhibitor; Provisional Probab=36.96 E-value=20 Score=14.84 Aligned_cols=27 Identities=30% Similarity=0.306 Sum_probs=16.3 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCCCCCCC Q ss_conf 9443789999999998410000246654356 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAADPVRRAH 31 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad~~~~~~ 31 (205) |||+++.+.++.+.+ +|+|+......+ T Consensus 1 ~~~~~~~~~~~~~~~----~a~a~~f~ltS~ 27 (183) T PRK09818 1 MKKLIVSSVLAFITF----SAQAAAFQVTSN 27 (183) T ss_pred CHHHHHHHHHHHHHC----CCCCCCEEEECC T ss_conf 913899999999744----355686488787 No 194 >PRK10316 hypothetical protein; Provisional Probab=36.62 E-value=20 Score=14.80 Aligned_cols=29 Identities=24% Similarity=0.304 Sum_probs=18.9 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCCCCCCC Q ss_conf 9443789999999998410000246654356 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAADPVRRAH 31 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad~~~~~~ 31 (205) |||++++..+++... .+++.|++....++ T Consensus 1 ~~r~l~a~~l~~~l~--~~~~~aa~~a~~~~ 29 (209) T PRK10316 1 MKRLIMATMVTAILA--SSTVWAADNAAAQQ 29 (209) T ss_pred CHHHHHHHHHHHHHH--CCHHHHHHHHCCCC T ss_conf 926999999999982--35145544311430 No 195 >PRK11443 hypothetical protein; Provisional Probab=36.46 E-value=19 Score=14.86 Aligned_cols=13 Identities=15% Similarity=0.161 Sum_probs=9.1 Q ss_pred CHHHHHHHHHHHH Q ss_conf 9443789999999 Q gi|254780598|r 1 MQKLFLAVGVSSL 13 (205) Q Consensus 1 Mkk~~la~av~~l 13 (205) |||+.+++.+..| T Consensus 1 Mk~~~~~~~~lll 13 (120) T PRK11443 1 MKKFIAPLLALLV 13 (120) T ss_pred CHHHHHHHHHHHH T ss_conf 9158999999999 No 196 >COG4549 Uncharacterized protein conserved in bacteria [Function unknown] Probab=36.40 E-value=24 Score=14.36 Aligned_cols=23 Identities=26% Similarity=0.127 Sum_probs=14.3 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 94437899999999984100002 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQA 23 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~A 23 (205) |||+.+.+.+..++..++..|.| T Consensus 1 mk~i~~~l~~~~~~~~~~~~AsA 23 (178) T COG4549 1 MKKIKLSLCALFALSLSTFTASA 23 (178) T ss_pred CCCHHHHHHHHHHHHHCCCCCEE T ss_conf 94036889999998752332217 No 197 >PRK08482 F0F1 ATP synthase subunit C; Validated Probab=35.70 E-value=16 Score=15.25 Aligned_cols=20 Identities=50% Similarity=0.411 Sum_probs=12.6 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCC Q ss_conf 94437899999999984100002466 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAADP 26 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad~ 26 (205) |||+++.. ++.++.|+|+|- T Consensus 1 mkk~~~l~------~a~a~~Afa~e~ 20 (105) T PRK08482 1 MKKLLLLF------LALAGVAFAADG 20 (105) T ss_pred CHHHHHHH------HHHHHHHHHHCC T ss_conf 91599999------999999876047 No 198 >pfam02402 Lysis_col Lysis protein. These small bacterial proteins are required for colicin release and partial cell lysis. This family contains lysis proteins for several different forms of colicin. Bacillus subtilis lytA has been included in this family, the similarity is not highly significant, however it is also a short protein, that is involved in secretion of other proteins (Bateman A pers. obs.). Probab=35.61 E-value=17 Score=15.16 Aligned_cols=22 Identities=27% Similarity=0.321 Sum_probs=10.6 Q ss_pred CHHHHHHHHHHHHHHHHCCCCC Q ss_conf 9443789999999998410000 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQ 22 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~ 22 (205) |||++....+....+..++.|+ T Consensus 1 Mkki~~~~i~~~~~~L~aCQaN 22 (46) T pfam02402 1 MKKILFIGILLLTVLLSACQAN 22 (46) T ss_pred CCEEEEHHHHHHHHHHHHHHHH T ss_conf 9242013999999999985550 No 199 >PRK09360 lamB maltoporin; Provisional Probab=34.89 E-value=27 Score=14.10 Aligned_cols=24 Identities=38% Similarity=0.246 Sum_probs=13.7 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCC Q ss_conf 94437899999999984100002466 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAADP 26 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad~ 26 (205) |||+.|+++|++.+ ++.+|+|.|- T Consensus 5 ~~~~~~a~~~~~~~--~~~~a~a~eF 28 (417) T PRK09360 5 LRKLPLAVAVAAGL--LSTQAMAVDF 28 (417) T ss_pred HHHHHHHHHHHHHH--CCCCEEEEEE T ss_conf 32578999999764--0465148985 No 200 >pfam11106 YjbE Exopolysaccharide production protein YjbE. YjbE is part of a four gene operon which is involved in exopolysaccharide production. The expression of YjbE is higher than the rest of the operon yjbEFGH. It appears to be restricted to Enterobacteriaceae. Probab=34.38 E-value=28 Score=14.05 Aligned_cols=25 Identities=32% Similarity=0.260 Sum_probs=13.9 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 944378999999999841000024665 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAADPV 27 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad~~ 27 (205) |||++..+-. ++..++.+++|+++. T Consensus 1 mkkil~gifa--i~al~a~s~~aapv~ 25 (80) T pfam11106 1 MKKILSGIFA--IAALAATSAQAAPVQ 25 (80) T ss_pred CHHHHHHHHH--HHHHHHHHCCCCCEE T ss_conf 9037788899--999776102336612 No 201 >COG2825 HlpA Outer membrane protein [Cell envelope biogenesis, outer membrane] Probab=33.83 E-value=29 Score=14.00 Aligned_cols=25 Identities=36% Similarity=0.246 Sum_probs=12.8 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 9443789999999998410000246 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAAD 25 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad 25 (205) ||+.++...+..+++.+++++.+++ T Consensus 1 Mk~~~~~~~l~l~la~~~s~~~~~~ 25 (170) T COG2825 1 MKKRLLAALLGLALATSASAAQAAP 25 (170) T ss_pred CCHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 9638999999999998877775275 No 202 >PRK10957 iron-enterobactin transporter periplasmic binding protein; Provisional Probab=33.45 E-value=29 Score=13.97 Aligned_cols=32 Identities=31% Similarity=0.224 Sum_probs=22.1 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCC Q ss_conf 44378999999999841000024665435666 Q gi|254780598|r 2 QKLFLAVGVSSLALASFCSAQAADPVRRAHHG 33 (205) Q Consensus 2 kk~~la~av~~lal~~~~~A~Aad~~~~~~~~ 33 (205) +|..|++.+.+++++++++|++++-+...... T Consensus 5 ~~~~~~~~~~~~~~~g~~~~~~~~~prtV~h~ 36 (319) T PRK10957 5 PLYRLALLLLGLLLSGIAAAQAADWPRTITDS 36 (319) T ss_pred HHHHHHHHHHHHHHCCCCCCCCCCCCEEEECC T ss_conf 78999999999997478755458988789719 No 203 >pfam00879 Defensin_propep Defensin propeptide. Probab=32.41 E-value=30 Score=13.88 Aligned_cols=29 Identities=38% Similarity=0.358 Sum_probs=16.7 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCC Q ss_conf 944378999999999841000024665435666 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAADPVRRAHHG 33 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad~~~~~~~~ 33 (205) ||++.|..++..+++ .|+|-.+..+.... T Consensus 1 MrTl~lLaA~lL~al----qaQAeplq~~~de~ 29 (52) T pfam00879 1 MRTLALLAALLLLAL----QAQAEPLQERADEA 29 (52) T ss_pred CCHHHHHHHHHHHHH----HHHCCCCCCCCCCC T ss_conf 913999999999999----98605642332325 No 204 >PRK10173 glucose-1-phosphatase/inositol phosphatase; Provisional Probab=32.24 E-value=30 Score=13.86 Aligned_cols=25 Identities=36% Similarity=0.353 Sum_probs=15.3 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCC Q ss_conf 94437899999999984100002466 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAADP 26 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad~ 26 (205) |||.+++++++...+. +.+++|... T Consensus 1 ~~~~~~~~~~~~~~~~-~~~~~a~~~ 25 (413) T PRK10173 1 MKKSLLAAAVAGAVLL-SSAAQAQTV 25 (413) T ss_pred CCHHHHHHHHHHHHHH-HHHHHCCCC T ss_conf 9245999999999998-877740589 No 205 >TIGR01653 lactococcin_972 bacteriocin, lactococcin 972 family; InterPro: IPR006540 These sequences represent bacteriocins related to lactococcin 972 . Members tend to be found in association with a seven transmembrane putative immunity protein. . Probab=32.23 E-value=31 Score=13.86 Aligned_cols=25 Identities=32% Similarity=0.190 Sum_probs=15.6 Q ss_pred CHHHH---HHHHHHHHHHHHCCCCCCCC Q ss_conf 94437---89999999998410000246 Q gi|254780598|r 1 MQKLF---LAVGVSSLALASFCSAQAAD 25 (205) Q Consensus 1 Mkk~~---la~av~~lal~~~~~A~Aad 25 (205) |||+. +++.++.+.|+....+.++- T Consensus 1 MkkK~ki~v~~~~~~~iL~~t~gg~~~q 28 (149) T TIGR01653 1 MKKKVKIGVVSLVSLTILATTLGGLAAQ 28 (149) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHEEH T ss_conf 9504788999999999875446553001 No 206 >COG4531 ZnuA ABC-type Zn2+ transport system, periplasmic component/surface adhesin [Inorganic ion transport and metabolism] Probab=32.22 E-value=31 Score=13.86 Aligned_cols=31 Identities=29% Similarity=0.210 Sum_probs=13.9 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCCCCCCC Q ss_conf 9443789999999998410000246654356 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAADPVRRAH 31 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad~~~~~~ 31 (205) ||++++..+.++++++++.++-+++++.+.. T Consensus 4 ~~~~~l~~A~~A~~~~~~~a~A~~~Vv~SIK 34 (318) T COG4531 4 HKKTLLLSALFALLLGSAPAAAAAAVVTSIK 34 (318) T ss_pred HHHHHHHHHHHHHHHCCCHHCCCCCCEEEEC T ss_conf 5779999999999844530004632146504 No 207 >PRK10598 hypothetical protein; Provisional Probab=31.81 E-value=31 Score=13.82 Aligned_cols=15 Identities=33% Similarity=0.470 Sum_probs=9.4 Q ss_pred CHHHHHHHHHHHHHH Q ss_conf 944378999999999 Q gi|254780598|r 1 MQKLFLAVGVSSLAL 15 (205) Q Consensus 1 Mkk~~la~av~~lal 15 (205) |||+++++++....+ T Consensus 1 mkk~~l~~~l~l~~l 15 (186) T PRK10598 1 MKKFLLAAALLVSGL 15 (186) T ss_pred CCHHHHHHHHHHHHH T ss_conf 912699999999999 No 208 >pfam10614 Tafi-CsgF Curli production assembly/transport component CsgF. Fimbriae are cell-surface protein polymers, of eg. E coli and Salmonella spp, that mediate interactions important for host and environmental persistence, development of biofilms, motility, colonisation and invasion of cells, and conjugation. Four general assembly pathways for different fimbriae have been proposed, one of which is extracellular nucleation-precipitation (ENP), that differs from the others in that fibre-growth occurs extracellularly. Thin aggregative fimbriae (Tafi) are the only fimbriae dependent on the ENP pathway. Tafi were first identified in Salmonella spp and the controlling operon termed agf; however subsequent isolation of the homologous operon in E coli led to its being called csg. Tafi are known as curli because, in the absence of extracellular polysaccharides, their morphology appears curled; however, when expressed with such polysaccharides their morphology appears as a tangled amorphous m Probab=31.26 E-value=32 Score=13.77 Aligned_cols=30 Identities=27% Similarity=0.105 Sum_probs=19.5 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCC Q ss_conf 94437899999999984100002466543566 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAADPVRRAHH 32 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad~~~~~~~ 32 (205) ||++.+.+.+ +.+..+..|+|.+++..+.- T Consensus 1 mk~~~~~~~l--~~l~~s~~a~A~elvy~f~N 30 (142) T pfam10614 1 MKYIGVWAIL--LFLAASPTAWAGDLVYQPVN 30 (142) T ss_pred CCCHHHHHHH--HHHHHCCHHHHHHEEECCCC T ss_conf 9142899999--99997043015133466459 No 209 >pfam01308 Chlam_OMP Chlamydia major outer membrane protein. The major outer membrane protein of Chlamydia contains four symmetrically spaced variable domains (VDs I to IV). This protein is believed to be an integral part to the pathogenesis, possibly adhesion. Along with the lipopolysaccharide, the major out membrane protein (MOMP) makes up the surface of the elementary body cell. The MOMP is the protein used to determine the different serotypes. Probab=31.14 E-value=32 Score=13.76 Aligned_cols=23 Identities=35% Similarity=0.262 Sum_probs=15.6 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCC Q ss_conf 944378999999999841000024 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAA 24 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aa 24 (205) |||++-.+.+.+. ++++.+-+|- T Consensus 1 mkkllks~l~~aa-~~s~~SLqAl 23 (389) T pfam01308 1 MKKLLKSALLFAA-LGSASSLQAL 23 (389) T ss_pred CHHHHHHHHHHHH-HCCCCCHHCC T ss_conf 9378899999998-6464410003 No 210 >TIGR01098 3A0109s03R phosphonate ABC transporter, periplasmic phosphonate-binding protein; InterPro: IPR005770 Bacterial binding protein-dependent transport systems , are multicomponent systems typically composed of a periplasmic substrate-binding protein, one or two reciprocally homologous integral inner-membrane proteins and one or two peripheral membrane ATP-binding proteins that couple energy to the active transport system. The integral inner-membrane proteins translocate the substrate across the membrane. It has been shown , that most of these proteins contain a conserved region located about 80 to 100 residues from their C-terminal extremity. This region seems to be located in a cytoplasmic loop between two transmembrane domains. Apart from the conserved region, the sequence of these proteins is quite divergent, and they have a variable number of transmembrane helices. This is a family of periplasmic proteins which are part of the transport system for alkylphosphonate uptake. ; GO: 0015604 phosphonate transmembrane transporter activity, 0015716 phosphonate transport, 0030288 outer membrane-bounded periplasmic space. Probab=30.12 E-value=33 Score=13.66 Aligned_cols=18 Identities=17% Similarity=0.187 Sum_probs=12.5 Q ss_pred CHHHHHHHHHHHHHHHHC Q ss_conf 944378999999999841 Q gi|254780598|r 1 MQKLFLAVGVSSLALASF 18 (205) Q Consensus 1 Mkk~~la~av~~lal~~~ 18 (205) |||+++.+++.++++.+. T Consensus 1 mkr~~~~~~~~~~~~~~~ 18 (299) T TIGR01098 1 MKRLLALLAALLVASLAA 18 (299) T ss_pred CHHHHHHHHHHHHHHHHH T ss_conf 947889999999999998 No 211 >PRK04792 tolB translocation protein TolB; Provisional Probab=30.08 E-value=33 Score=13.66 Aligned_cols=28 Identities=25% Similarity=0.140 Sum_probs=14.8 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCCCCCCC Q ss_conf 9443789999999998410000246654356 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAADPVRRAH 31 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad~~~~~~ 31 (205) |||+++++.+..+.+ +..|+|. +..+-. T Consensus 2 ~k~~~~~l~lll~~~--s~~A~A~-L~IeIt 29 (450) T PRK04792 2 LKRLILGLFVLLLSF--SQVANAA-LELVIT 29 (450) T ss_pred CHHHHHHHHHHHHHH--HHHHEEE-EEEEEE T ss_conf 389999999999986--5752024-999995 No 212 >PRK10884 putative signal transduction protein; Provisional Probab=29.36 E-value=29 Score=13.98 Aligned_cols=28 Identities=32% Similarity=0.367 Sum_probs=18.8 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCCCCC Q ss_conf 94437899999999984100002466543 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAADPVRR 29 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad~~~~ 29 (205) |+|+-+. .++.+.++.+..++|....+. T Consensus 1 m~~l~~~-~~~~l~~~~~~~~~aeeTrYV 28 (206) T PRK10884 1 MPKLRLI-GLTLLALSATAVSHAEETRYV 28 (206) T ss_pred CCCCCHH-HHHHHHHHHHHHHHHCCCEEE T ss_conf 9740139-999999864577752141577 No 213 >TIGR01779 TonB-B12 TonB-dependent vitamin B12 receptor; InterPro: IPR010101 The outer membrane is an essential component of Gram-negative bacteria, providing them with increased resistance to antibiotics, digestive enzymes, detergents and immune surveillance . The outer membrane is permeable to small hydrophilic molecules because of the presence of aqueous diffusion channels (e.g. porins). Small molecules present at high concentration diffuse down their concentration gradients into the periplasmic space. Porins are inadequate for the efficient acquisition of iron siderophores, cobalamins (Cbl), and other molecules that are present at very low concentrations and that are too bulky to pass through the lumen of the porin. Another class of outer membrane proteins binds these substrates with high specificity and carries out active transport across the outer membrane. This active transport process requires an energy source and a second protein, TonB. Outer membrane active-transport proteins interact with the transperiplasmic protein TonB through a conserved sequence, the "Ton-box". Interaction with TonB couples the energy of the proton motive force of the inner membrane to drive an outer membrane transport cycle . To date, crystal structures of four TonB-dependent transporters have been solved. Three of these are iron-siderophore transporters: ferrichrome transporter FhuA , ; ferric enterobactin transporter FepA ; and ferric dicitrate transporter FecA . The fourth structure is the cobalamin transporter BtuB . All of these structures are composed of two domains, a conserved N-terminal globular domain (hatch) and a 22-stranded beta-barrel (barrel). The hatch domain resides within the barrel and occludes the large pore of the large barrel domain. The hatch domains are composed of a central core of four beta-strands connected by loops. The conserved Ton-box is located near the periplasmic opening of the barrel and precedes the conserved hatch core. The barrels have large extracellular loops and short periplasmic turns connecting the beta-strands. TonB-dependent transporters bind their cognate substrates using residues from hatch loops, from the interior surfaces of beta-strands in the barrel wall, and from extracellular loops of the barrel . This entry represents the TonB-dependent outer membrane receptor found in gamma-proteobacteria responsible for translocating the cobalt-containing vitamin B12 (cobalamin). ; GO: 0004872 receptor activity, 0015889 cobalamin transport, 0016020 membrane. Probab=29.00 E-value=34 Score=13.61 Aligned_cols=61 Identities=18% Similarity=0.090 Sum_probs=29.5 Q ss_pred EEEEEEEEEECCCEEEEEEEEEEEEEEEECC--CCCCC---CCCCEEEEEEEEEEEECCCCCEEEEEE Q ss_conf 9998975431276778988544776444313--45556---775204799734661004886799999 Q gi|254780598|r 119 SLRIRGGYEVSDSLLLYATVGPDVAQKYETG--KAGEI---TPIAIGGTAGVGVEVGGLSESLVARLE 181 (205) Q Consensus 119 ~~~~r~Gy~~~~~~l~Y~~~G~a~~~~~~~~--~~~~~---~~~~~g~~~G~Gvey~~it~~~~~r~E 181 (205) ++.++.+|.|++++.+++++=-+-.+.+++. ..+.. ...+.--.+=+|..| .+|.|..++. T Consensus 238 ~~~~~~~h~F~~~w~~f~~~~~y~~~~~YD~y~sfG~~~~~~~~~~~q~~~~G~~~--~~e~~~~~L~ 303 (644) T TIGR01779 238 SLLLAGGHRFDERWSLFVRVLRYENRNDYDAYSSFGSRDYDEAEKGEQSLVGGRAR--YSERDASKLT 303 (644) T ss_pred CCCCCEEEEECCCCCEEEEEEEECCCCCCCCEECCCCCCHHHHHCCCCEEEEEEEE--CCCCEEEEEE T ss_conf 42310357876754202356420462101541248985511220355002443222--1764023246 No 214 >TIGR00031 UDP-GALP_mutase UDP-galactopyranose mutase; InterPro: IPR004379 UDP-galactopyranose mutase (5.4.99.9 from EC) is involved in the conversion of UDP-GALP into UDP-GALF through a 2-keto intermediate, and contains FAD as a cofactor. The gene is known as glf, ceoA, and rfbD. It is known experimentally in Escherichia coli, Mycobacterium tuberculosis, and Klebsiella pneumoniae.; GO: 0008767 UDP-galactopyranose mutase activity, 0009103 lipopolysaccharide biosynthetic process. Probab=28.71 E-value=14 Score=15.65 Aligned_cols=18 Identities=17% Similarity=0.248 Sum_probs=8.1 Q ss_pred CCCCEEEEEEEEEEECCC Q ss_conf 488679999999975056 Q gi|254780598|r 172 LSESLVARLEYRASKYSK 189 (205) Q Consensus 172 it~~~~~r~EY~y~df~~ 189 (205) ....++=-+||.|-+-.+ T Consensus 291 ~~~~~TR~~EyK~L~~~~ 308 (390) T TIGR00031 291 LDVDITRIVEYKHLTPTG 308 (390) T ss_pred CCCCCCEEEEEEEECCCC T ss_conf 769743256544326677 No 215 >COG4588 AcfC Accessory colonization factor AcfC, contains ABC-type periplasmic domain [General function prediction only] Probab=28.27 E-value=36 Score=13.49 Aligned_cols=24 Identities=38% Similarity=0.347 Sum_probs=14.8 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 944378999999999841000024665 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAADPV 27 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad~~ 27 (205) |||..+.++.++|++++ +++ ||.. T Consensus 1 Mk~~~~i~~~~~La~s~--~~~-adin 24 (252) T COG4588 1 MKKAVLILLIFLLAFSS--AAN-ADIN 24 (252) T ss_pred CCHHHHHHHHHHHHHHH--HHC-CEEE T ss_conf 93368899999997246--404-5178 No 216 >PRK11372 lysozyme inhibitor; Provisional Probab=28.15 E-value=29 Score=13.97 Aligned_cols=14 Identities=29% Similarity=0.294 Sum_probs=9.3 Q ss_pred CHHHHHHHHHHHHH Q ss_conf 94437899999999 Q gi|254780598|r 1 MQKLFLAVGVSSLA 14 (205) Q Consensus 1 Mkk~~la~av~~la 14 (205) |||++.+.++..|+ T Consensus 3 MKkll~~~~~llLs 16 (109) T PRK11372 3 MKKLLIICLPVLLT 16 (109) T ss_pred HHHHHHHHHHHHHH T ss_conf 48899999999970 No 217 >COG4150 CysP ABC-type sulfate transport system, periplasmic component [Inorganic ion transport and metabolism] Probab=27.37 E-value=37 Score=13.40 Aligned_cols=24 Identities=25% Similarity=0.185 Sum_probs=15.8 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCC Q ss_conf 944378999999999841000024 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAA 24 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aa 24 (205) |||.++..++.+.++++...|.|. T Consensus 6 mkk~~~~~~~~~~L~a~~~qa~as 29 (341) T COG4150 6 MKKNLLIGATILALFAALLQAGAS 29 (341) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCC T ss_conf 889887669999999998761467 No 218 >COG5633 Predicted periplasmic lipoprotein [General function prediction only] Probab=26.72 E-value=38 Score=13.38 Aligned_cols=25 Identities=32% Similarity=0.325 Sum_probs=13.3 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 944378999999999841000024665 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAADPV 27 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad~~ 27 (205) |||.+++ -+++++|.+. .++-+-++ T Consensus 1 Mrk~~~~-~l~~~lLvGC-sS~~~i~~ 25 (123) T COG5633 1 MRKLCLL-SLALLLLVGC-SSHQEILV 25 (123) T ss_pred CCEEHHH-HHHHHHHHCC-CCCCCCCC T ss_conf 9300389-9999994204-78877522 No 219 >pfam07273 DUF1439 Protein of unknown function (DUF1439). This family consists of several hypothetical bacterial proteins of around 190 residues in length. Several members of this family are annotated as being putative lipoproteins and are often known as YceB. The function of this family is unknown. Probab=26.39 E-value=39 Score=13.30 Aligned_cols=18 Identities=33% Similarity=0.421 Sum_probs=11.2 Q ss_pred CHHHHHHHHHHHHHHHHC Q ss_conf 944378999999999841 Q gi|254780598|r 1 MQKLFLAVGVSSLALASF 18 (205) Q Consensus 1 Mkk~~la~av~~lal~~~ 18 (205) ||++++++.++++.++++ T Consensus 1 Mk~~~~~~~~l~~~LsgC 18 (177) T pfam07273 1 MKKLLLLILLLVLLLSGC 18 (177) T ss_pred CCHHHHHHHHHHHHHHCC T ss_conf 914899999999997155 No 220 >COG5510 Predicted small secreted protein [Function unknown] Probab=26.25 E-value=39 Score=13.29 Aligned_cols=20 Identities=30% Similarity=0.240 Sum_probs=9.2 Q ss_pred CHHHH---HHHHHHHHHHHHCCC Q ss_conf 94437---899999999984100 Q gi|254780598|r 1 MQKLF---LAVGVSSLALASFCS 20 (205) Q Consensus 1 Mkk~~---la~av~~lal~~~~~ 20 (205) |||.+ .++.+++..+++... T Consensus 2 mk~t~l~i~~vll~s~llaaCNT 24 (44) T COG5510 2 MKKTILLIALVLLASTLLAACNT 24 (44) T ss_pred CHHHHHHHHHHHHHHHHHHHHHH T ss_conf 04899999999999999988663 No 221 >PRK10918 phosphate transporter subunit; Provisional Probab=26.16 E-value=40 Score=13.28 Aligned_cols=28 Identities=21% Similarity=0.296 Sum_probs=19.7 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCCCC Q ss_conf 9443789999999998410000246654 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAADPVR 28 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad~~~ 28 (205) ||.+..++++++|.+++.+++.|+++.. T Consensus 4 ~~~~~~~~~~~~ls~~a~~a~aa~~l~G 31 (346) T PRK10918 4 MRTTVATVVAATLSMSAFSAFAAASLTG 31 (346) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCCC T ss_conf 4889999999999887767876166124 No 222 >COG3133 SlyB Outer membrane lipoprotein [Cell envelope biogenesis, outer membrane] Probab=26.08 E-value=38 Score=13.35 Aligned_cols=18 Identities=39% Similarity=0.429 Sum_probs=12.1 Q ss_pred CHHHHHHHH-HHHHHHHHC Q ss_conf 944378999-999999841 Q gi|254780598|r 1 MQKLFLAVG-VSSLALASF 18 (205) Q Consensus 1 Mkk~~la~a-v~~lal~~~ 18 (205) |||..++++ +.+|.|... T Consensus 1 M~k~~~avA~~~al~L~GC 19 (154) T COG3133 1 MIKTALAVAALMALSLTGC 19 (154) T ss_pred CCHHHHHHHHHHHHHCCCC T ss_conf 9425789999988740011 No 223 >pfam03482 SIC sic protein. Serotype M1 group A Streptococcus strains cause epidemic waves of human infections. This family includes the sic protein an extracellular protein (streptococcal inhibitor of complement) that inhibits human complement. Probab=25.84 E-value=40 Score=13.25 Aligned_cols=22 Identities=18% Similarity=0.153 Sum_probs=13.6 Q ss_pred HHHHHHHHHHHHHHHHCCCCCC Q ss_conf 4437899999999984100002 Q gi|254780598|r 2 QKLFLAVGVSSLALASFCSAQA 23 (205) Q Consensus 2 kk~~la~av~~lal~~~~~A~A 23 (205) |.+|++.+|++++|++..+.-| T Consensus 11 ktLLFTSLvAVALLGATQPvSA 32 (306) T pfam03482 11 KTLLFTSLVAVALLGATQPVSA 32 (306) T ss_pred HHHHHHHHHHHHHHCCCCCCCH T ss_conf 2567899999998447776524 No 224 >PRK03629 tolB translocation protein TolB; Provisional Probab=25.52 E-value=41 Score=13.22 Aligned_cols=22 Identities=18% Similarity=0.013 Sum_probs=14.5 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCC Q ss_conf 944378999999999841000024 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAA 24 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aa 24 (205) |||+++.+-...+.++.. |+|. T Consensus 1 mk~~~~~~f~~lll~~~~--a~A~ 22 (430) T PRK03629 1 MKQALRVAFGFLILWASV--LHAE 22 (430) T ss_pred CCHHHHHHHHHHHHHHHH--HCEE T ss_conf 901488899899987544--2237 No 225 >PRK13211 N-acetylglucosamine-binding protein A; Reviewed Probab=25.34 E-value=41 Score=13.20 Aligned_cols=30 Identities=23% Similarity=0.111 Sum_probs=17.3 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCCCCCC Q ss_conf 944378999999999841000024665435 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAADPVRRA 30 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad~~~~~ 30 (205) ||+.+..++++++++..++.|+|-.-+..+ T Consensus 1 ~~~~~~~~~~a~a~~l~s~~A~AHGyv~~P 30 (475) T PRK13211 1 MKKKLLKTLLAAALLLVSGGALAHGYVSSP 30 (475) T ss_pred CCCCHHHHHHHHHHHHCCCCCEECCCCCCC T ss_conf 974135799999998615340314054797 No 226 >cd07303 Porin3 Eukaryotic porin family that forms channels in the mitochondrial outer membrane. The porin family 3 contains two sub-families that play vital roles in the mitochondrial outer membrane, a translocase for unfolded pre-proteins (Tom40) and the voltage-dependent anion channel (VDAC) that regulates the flux of mostly anionic metabolites through the outer mitochondrial membrane. Probab=25.24 E-value=41 Score=13.19 Aligned_cols=18 Identities=6% Similarity=0.082 Sum_probs=8.0 Q ss_pred EEEEEEEEECCCCCEEEEE Q ss_conf 9973466100488679999 Q gi|254780598|r 162 TAGVGVEVGGLSESLVARL 180 (205) Q Consensus 162 ~~G~Gvey~~it~~~~~r~ 180 (205) .+++|..| ++.+.-++|+ T Consensus 213 ~~~vg~ky-~lD~~~~vka 230 (274) T cd07303 213 RFGIAAKY-QVDPDACFSA 230 (274) T ss_pred EEEEEEEE-ECCCCCEEEE T ss_conf 69989889-6199977999 No 227 >PRK09558 ushA bifunctional UDP-sugar hydrolase/5'-nucleotidase periplasmic precursor; Reviewed Probab=24.97 E-value=42 Score=13.16 Aligned_cols=20 Identities=25% Similarity=0.114 Sum_probs=12.5 Q ss_pred CHHHHHHHHHHHHHHHHCCC Q ss_conf 94437899999999984100 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCS 20 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~ 20 (205) |+|++..+++.++.++.... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~ 20 (551) T PRK09558 1 MMKLLKGLVALALLAALALC 20 (551) T ss_pred CHHHHHHHHHHHHHHHHHHC T ss_conf 92589999999999998740 No 228 >PRK10523 lipoprotein involved with copper homeostasis and adhesion; Provisional Probab=24.85 E-value=42 Score=13.15 Aligned_cols=18 Identities=22% Similarity=0.143 Sum_probs=10.7 Q ss_pred CHHHHHHHHHHHHHHHHC Q ss_conf 944378999999999841 Q gi|254780598|r 1 MQKLFLAVGVSSLALASF 18 (205) Q Consensus 1 Mkk~~la~av~~lal~~~ 18 (205) |||+++.+++++-+++.. T Consensus 2 mkk~~~~~~~a~g~~~l~ 19 (234) T PRK10523 2 MKKAILTALAAVGLFALM 19 (234) T ss_pred HHHHHHHHHHHHHHHHHH T ss_conf 058999999999999876 No 229 >cd01346 Maltoporin-like The Maltoporin-like channels (LamB porin) form a trimeric structure which facilitate the diffusion of maltodextrins and other sugars across the outer membrane of Gram-negative bacteria. The membrane channel is formed by an 18-strand antiparallel beta-barrel (18,22). Loop 3 folds into the core to constrict pore size. Long irregular loops are found on the extracelllular side, while short turns are in the periplasm.Tightly-bound water molecules are found in the eyelet of the passage, and only substrates that can displace and replace the broken hydrogen bonds are likely to enter the pore. In the MPR structure, loops 4,6, and 9 have the greatest mobility and are highly variable; these are postulated to attract maltodextrins. Probab=24.44 E-value=43 Score=13.10 Aligned_cols=113 Identities=16% Similarity=0.106 Sum_probs=63.4 Q ss_pred EEECC-CEEEEEEEEEEECCCCCCCCCCCCCCEEEEEEEEEEEEECCCEEEEEEEEEEEEEEEECCCC---CCCCCCCEE Q ss_conf 64048-37998451013215556666212232148999897543127677898854477644431345---556775204 Q gi|254780598|r 85 NVEDS-CIMYGVEGDVRYTVPVLADNIHSLHGIGGSLRIRGGYEVSDSLLLYATVGPDVAQKYETGKA---GEITPIAIG 160 (205) Q Consensus 85 n~q~~-~~V~G~e~d~~~~~~~~~~~~~~~~~~~~~~~~r~Gy~~~~~~l~Y~~~G~a~~~~~~~~~~---~~~~~~~~g 160 (205) .++.. +|-+..++-+......... .....| .++.+|+-|.+++.+-+-..+||...+.+..... .+-.+.-.. T Consensus 260 ~~~~~~~~~~~~~~~yq~~~~~~~~--~~~~~~-~s~~vRP~Y~~~~~~~~~~E~Gy~~~~~~~~~~~~~~~~~~K~TlA 336 (392) T cd01346 260 EWQLGDRFGGGVALVYQRGNDPYGY--GDDYQW-ASVGVRPAYKWSDNFKTAFEVGYDTVKADDGGGVTDDGSLYKLTVA 336 (392) T ss_pred EEECCCCEEEEEEEEEEECCCCCCC--CCCEEE-EEEEEEEEEEECCCEEEEEEEEEEEEECCCCCCCCCCCCEEEEEEE T ss_conf 8824797625043656843577778--897599-9999886079868368999978999934888875567865799997 Q ss_pred EEEEEEEEEECCCCCEEEEEEEEEEECCCC------------CCCCEEEEEEEEE Q ss_conf 799734661004886799999999750565------------5662269999985 Q gi|254780598|r 161 GTAGVGVEVGGLSESLVARLEYRASKYSKV------------EGFYNTISLGVGM 203 (205) Q Consensus 161 ~~~G~Gvey~~it~~~~~r~EY~y~df~~~------------~~~~~~~~~G~~y 203 (205) .++-+|-++ . .+=.+|+=+.|.+..+. +.+.+.+.+||-. T Consensus 337 ~t~s~g~~f-~--~RPelR~faTy~~w~~~~~~~~~~~~~~~~~~~~~~~fGvQ~ 388 (392) T cd01346 337 PTFSAGTDF-W--SRPELRFYATYSNWNDAALRAFTAFGSAFGNSKDGWNFGVQV 388 (392) T ss_pred EECCCCCCC-C--CCCEEEEEEEEEECCCCCCCCCCCCCCCCCCCCCCEEEEEEE T ss_conf 704108897-7--896899999998143010122444454456888868999999 No 230 >TIGR01165 cbiN cobalt transport protein; InterPro: IPR003705 The cobalt transport protein CbiN is part of the active cobalt transport system involved in uptake of cobalt in to the cell involved with cobalamin biosynthesis (vitamin B12). It has been suggested that CbiN may function as the periplasmic binding protein component of the active cobalt transport system .; GO: 0015087 cobalt ion transmembrane transporter activity, 0006824 cobalt ion transport, 0009236 cobalamin biosynthetic process, 0016020 membrane. Probab=24.14 E-value=43 Score=13.07 Aligned_cols=16 Identities=31% Similarity=0.468 Sum_probs=9.7 Q ss_pred CHHHH---HHHHHHHHHHH Q ss_conf 94437---89999999998 Q gi|254780598|r 1 MQKLF---LAVGVSSLALA 16 (205) Q Consensus 1 Mkk~~---la~av~~lal~ 16 (205) |||++ |.+.|++|++. T Consensus 3 ~k~~ina~LL~~va~L~~l 21 (94) T TIGR01165 3 MKKTINALLLAAVAALVVL 21 (94) T ss_pred CCCHHHHHHHHHHHHHHHH T ss_conf 5308999999999999999 No 231 >PRK00059 prsA peptidylprolyl isomerase; Provisional Probab=23.44 E-value=45 Score=13.00 Aligned_cols=21 Identities=10% Similarity=0.165 Sum_probs=16.8 Q ss_pred CHHHHHHHHHHHHHHHHCCCC Q ss_conf 944378999999999841000 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSA 21 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A 21 (205) |||+++.+.+.++++++++.- T Consensus 4 ~~ki~~~~ii~~~~~~~vg~~ 24 (336) T PRK00059 4 AKKIVASLLVGVFIFSAVGCN 24 (336) T ss_pred HHHHHHHHHHHHHHHHHHCCC T ss_conf 799999999999999983025 No 232 >PRK09810 entericidin A; Provisional Probab=23.42 E-value=45 Score=13.00 Aligned_cols=21 Identities=29% Similarity=0.351 Sum_probs=13.6 Q ss_pred CHHHHHHHHHHHHHHHHCCCC Q ss_conf 944378999999999841000 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSA 21 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A 21 (205) ||+++..+......+.....| T Consensus 2 mkrli~lil~~~~ll~gcnta 22 (41) T PRK09810 2 MKRLIVLVLLASTLLTGCNTA 22 (41) T ss_pred HHHHHHHHHHHHHHHHCCCCC T ss_conf 389999999999998256212 No 233 >pfam04202 Mfp-3 Foot protein 3. Mytilus foot protein-3 (Mfp-3) is a highly polymorphic protein family located in the byssal adhesive plaques of blue mussels. Probab=23.38 E-value=45 Score=12.99 Aligned_cols=18 Identities=28% Similarity=0.340 Sum_probs=14.8 Q ss_pred CHHHHHHHHHHHHHHHHC Q ss_conf 944378999999999841 Q gi|254780598|r 1 MQKLFLAVGVSSLALASF 18 (205) Q Consensus 1 Mkk~~la~av~~lal~~~ 18 (205) |+++.++++|+.+++.++ T Consensus 1 Mnn~Sv~VLvaLVLIGsF 18 (71) T pfam04202 1 MNNSSVSVLVALVLIGSF 18 (71) T ss_pred CCCCHHHHHHHHHHHHHE T ss_conf 974005899999985120 No 234 >COG4143 TbpA ABC-type thiamine transport system, periplasmic component [Coenzyme metabolism] Probab=23.19 E-value=45 Score=12.97 Aligned_cols=26 Identities=35% Similarity=0.368 Sum_probs=16.7 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCC Q ss_conf 94437899999999984100002466 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAADP 26 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad~ 26 (205) |++++.++..+++++++++.++++.+ T Consensus 1 m~~~l~~~~~~all~~~~~~a~~~t~ 26 (336) T COG4143 1 MRRLLRALIGLALLVSAALGAQAATP 26 (336) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 90249999999999998657763485 No 235 >PRK03002 prsA peptidylprolyl isomerase; Reviewed Probab=22.44 E-value=47 Score=12.89 Aligned_cols=17 Identities=6% Similarity=0.094 Sum_probs=9.1 Q ss_pred CHHHHHHHHHHHHHHHH Q ss_conf 94437899999999984 Q gi|254780598|r 1 MQKLFLAVGVSSLALAS 17 (205) Q Consensus 1 Mkk~~la~av~~lal~~ 17 (205) |||.++.++++++++.+ T Consensus 1 mkkK~i~~~~~~~svl~ 17 (285) T PRK03002 1 MRGKHIFIITALISILM 17 (285) T ss_pred CCHHHHHHHHHHHHHHH T ss_conf 96018999999999999 No 236 >COG3489 Predicted periplasmic lipoprotein [General function prediction only] Probab=21.85 E-value=46 Score=12.96 Aligned_cols=21 Identities=33% Similarity=0.354 Sum_probs=12.7 Q ss_pred CHHHHHHHHHHHHHHHHCCCC Q ss_conf 944378999999999841000 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSA 21 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A 21 (205) |||++..++++.++++...++ T Consensus 4 ~~~l~~vLav~Ll~~~p~a~a 24 (359) T COG3489 4 MRKLVTVLAVALLAACPPAPA 24 (359) T ss_pred HHHHHHHHHHHHHHCCCCCCC T ss_conf 678889999999832898743 No 237 >COG3248 Tsx Nucleoside-binding outer membrane protein [Cell envelope biogenesis, outer membrane] Probab=21.66 E-value=49 Score=12.80 Aligned_cols=32 Identities=28% Similarity=0.173 Sum_probs=18.9 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCC Q ss_conf 94437899999999984100002466543566666 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAADPVRRAHHGGR 35 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad~~~~~~~~~~ 35 (205) |||.+||+.+ +++..+++.|.|.....|.... T Consensus 5 m~ktlLA~gl---laA~sa~~~~~d~~~~~~~~~~ 36 (284) T COG3248 5 MKKTLLAAGL---LAALSASFTANDAENDKPQYLS 36 (284) T ss_pred HHHHHHHHHH---HHHCCCCEECCCCCCCCCCCCC T ss_conf 8899999889---8733663350201258866443 No 238 >COG5514 Uncharacterized conserved protein [Function unknown] Probab=21.64 E-value=49 Score=12.81 Aligned_cols=22 Identities=23% Similarity=0.333 Sum_probs=13.6 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCC Q ss_conf 94437899999999984100002466 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAADP 26 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad~ 26 (205) |||++.+ ..+++..++++|.|. T Consensus 1 mkkivaa----~v~~s~~~~~hA~~d 22 (203) T COG5514 1 MKKIVAA----QVIFSEPSPNHAPDD 22 (203) T ss_pred CHHHHHH----HHHHCCCCCCCCCCC T ss_conf 9047665----777617774448876 No 239 >PRK10270 hypothetical protein; Provisional Probab=21.61 E-value=49 Score=12.80 Aligned_cols=16 Identities=25% Similarity=0.391 Sum_probs=11.3 Q ss_pred CHHHHHHHHHHHHHHH Q ss_conf 9443789999999998 Q gi|254780598|r 1 MQKLFLAVGVSSLALA 16 (205) Q Consensus 1 Mkk~~la~av~~lal~ 16 (205) |||+++.+.+..++++ T Consensus 1 Mkk~~~~~~~l~~~l~ 16 (340) T PRK10270 1 MKKVLLIILLLLVVLG 16 (340) T ss_pred CCHHHHHHHHHHHHHH T ss_conf 9028999999999999 No 240 >TIGR02967 guan_deamin guanine deaminase; InterPro: IPR014311 This entry describes guanine deaminase, which hydrolyzes guanine to xanthine and ammonia. Xanthine can then be converted to urate by xanthine dehydrogenase. In some bacteria, the guanine deaminase gene is found near the xdhABC genes for xanthine dehydrogenase. Non-homologous forms of guanine deaminase also exist, as well as more distantly related homologs which are not included in this entry. . Probab=21.59 E-value=24 Score=14.38 Aligned_cols=14 Identities=21% Similarity=0.107 Sum_probs=7.4 Q ss_pred EE-EEEEEECCCEEE Q ss_conf 98-975431276778 Q gi|254780598|r 121 RI-RGGYEVSDSLLL 134 (205) Q Consensus 121 ~~-r~Gy~~~~~~l~ 134 (205) |+ |+.|+++|||-| T Consensus 179 kGnR~~YavTPRFAP 193 (426) T TIGR02967 179 KGNRLLYAVTPRFAP 193 (426) T ss_pred CCCCCCCEECCCCCC T ss_conf 056367335466776 No 241 >COG3227 LasB Zinc metalloprotease (elastase) [Amino acid transport and metabolism] Probab=21.41 E-value=50 Score=12.77 Aligned_cols=28 Identities=36% Similarity=0.290 Sum_probs=15.5 Q ss_pred CHHHHHH-HHHHHHHHHHCCCCCCCCCCC Q ss_conf 9443789-999999998410000246654 Q gi|254780598|r 1 MQKLFLA-VGVSSLALASFCSAQAADPVR 28 (205) Q Consensus 1 Mkk~~la-~av~~lal~~~~~A~Aad~~~ 28 (205) |||++.. .+++...+..+.+..|+++.. T Consensus 1 mKKls~~l~~l~~~~~~~~~~~~aa~~~t 29 (507) T COG3227 1 MKKLSSVLTGLFLASAGSFPSVAAADLET 29 (507) T ss_pred CCHHHHHHHHHHHHHHCCCCCHHHCCCCC T ss_conf 94178999999998740342323315232 No 242 >TIGR01655 yxeA_fam conserved hypothetical protein; InterPro: IPR006542 These are a family of small (about 115 amino acids) uncharacterised proteins with N-terminal signal sequences, found exclusively in Gram-positive organisms. Most genomes that have any members of this family have at least two members.. Probab=20.93 E-value=51 Score=12.72 Aligned_cols=25 Identities=8% Similarity=-0.048 Sum_probs=19.1 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 9443789999999998410000246 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAAD 25 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad 25 (205) |||.++.+++.++++..++.-+-.. T Consensus 1 MKk~~~~Lla~i~~~~i~~~~~~~~ 25 (121) T TIGR01655 1 MKKVLAGLLALIVVIIIVGLLFFHP 25 (121) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 9355688999999999988766403 No 243 >PRK12750 cpxP periplasmic repressor CpxP; Reviewed Probab=20.88 E-value=51 Score=12.71 Aligned_cols=22 Identities=45% Similarity=0.409 Sum_probs=12.0 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCC Q ss_conf 44378999999999841000024 Q gi|254780598|r 2 QKLFLAVGVSSLALASFCSAQAA 24 (205) Q Consensus 2 kk~~la~av~~lal~~~~~A~Aa 24 (205) ||++|++.+..++|++ .+|+|. T Consensus 5 kk~~laav~lpl~lg~-~sa~a~ 26 (170) T PRK12750 5 KKLVLAAVVLPLTLGT-ASAFAF 26 (170) T ss_pred HHHHHHHHHHHHHHHH-HHHHCC T ss_conf 9999999999998421-777506 No 244 >pfam01103 Bac_surface_Ag Surface antigen. This entry includes the following surface antigens; D15 antigen from H.influenzae, OMA87 from P.multocida, OMP85 from N.meningitidis and N.gonorrhoeae. The family also includes a number of eukaryotic proteins that are members of the UPF0140 family. There also appears to be a relationship to pfam03865 (personal obs: C Yeats). In eukaryotes, it appears that these proteins are not surface antigens; S. cerevisiae YNL026W (SAM50) is an essential component of the Sorting and Assembly Machinery (SAM) of the mitochondrial outer membrane. The protein was localized to the mitochondria. Probab=20.85 E-value=51 Score=12.71 Aligned_cols=70 Identities=21% Similarity=0.274 Sum_probs=40.5 Q ss_pred EEEEEEEEEEEEEEECCC-CCCCCCCCEEEEEEEEEEEECCCCCEEEEEEEEEEECCCCCCCC--EEEEEEEEEEC Q ss_conf 789885447764443134-55567752047997346610048867999999997505655662--26999998539 Q gi|254780598|r 133 LLYATVGPDVAQKYETGK-AGEITPIAIGGTAGVGVEVGGLSESLVARLEYRASKYSKVEGFY--NTISLGVGMKF 205 (205) Q Consensus 133 l~Y~~~G~a~~~~~~~~~-~~~~~~~~~g~~~G~Gvey~~it~~~~~r~EY~y~df~~~~~~~--~~~~~G~~ykF 205 (205) .++.-+|-.+........ .........-+.+|+|+.+ .+|=--+|++|-+ -+.+.+.+. ..+.++++..| T Consensus 245 ~~F~D~G~~~~~~~~~~~~~~~~~~~~~~~s~G~Glr~--~tP~gpi~ld~a~-pl~~~~~d~~~~~f~F~iG~~F 317 (317) T pfam01103 245 ALFVDAGNVWNSGGDTPGGSDGVRSKNIRASVGVGLRW--DSPFGPLRFDYAI-PLKKPDGDRTEFRFYFGIGTEF 317 (317) T ss_pred EEEEEEEEECCCCCCCCCCCCCCCCCCEEEEEEEEEEE--ECCEEEEEEEEEE-CCCCCCCCCCCEEEEEECCCCC T ss_conf 99998017504567777544435678777999999999--7781879999998-7667778872208999840569 No 245 >PRK11289 ampC beta-lactamase; Provisional Probab=20.81 E-value=51 Score=12.71 Aligned_cols=27 Identities=30% Similarity=0.171 Sum_probs=14.0 Q ss_pred CHHHHHH-HHHHHHHHHHCCCCCCCCCC Q ss_conf 9443789-99999999841000024665 Q gi|254780598|r 1 MQKLFLA-VGVSSLALASFCSAQAADPV 27 (205) Q Consensus 1 Mkk~~la-~av~~lal~~~~~A~Aad~~ 27 (205) |||.+.. +.++...+++...|+||++. T Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~~a~~~ 31 (387) T PRK11289 4 MMKSLLAALLLTASISAFAAPAAAATSQ 31 (387) T ss_pred HHHHHHHHHHHHHHHHHCCCCCCCCCHH T ss_conf 9999999999999998616765569979 No 246 >TIGR02448 TIGR02448 conserverd hypothetical protein; InterPro: IPR012661 This family consists of small hypothetical proteins, about 100 amino acids in length. The family includes five members (three in tandem) in Pseudomonas aeruginosa PAO1, and also in Pseudomonas putida (strain KT2440), four in Pseudomonas syringae DC3000, and single members in several other Proteobacteria. The function is unknown.. Probab=20.72 E-value=48 Score=12.86 Aligned_cols=24 Identities=33% Similarity=0.263 Sum_probs=11.9 Q ss_pred CH-HHHHHHHHHHHHHHHCCCCCCCCC Q ss_conf 94-437899999999984100002466 Q gi|254780598|r 1 MQ-KLFLAVGVSSLALASFCSAQAADP 26 (205) Q Consensus 1 Mk-k~~la~av~~lal~~~~~A~Aad~ 26 (205) || +..|..++..|.|++. |.+... T Consensus 1 MR~r~~Ll~~~lLl~~A~~--a~a~~~ 25 (107) T TIGR02448 1 MRLRKLLLAAALLLSLAAG--AAAASF 25 (107) T ss_pred CCHHHHHHHHHHHHHCCHH--HHHHHH T ss_conf 9424789999998741033--777886 No 247 >COG3652 Predicted outer membrane protein [Function unknown] Probab=20.68 E-value=51 Score=12.69 Aligned_cols=26 Identities=31% Similarity=0.308 Sum_probs=18.4 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCC Q ss_conf 94437899999999984100002466 Q gi|254780598|r 1 MQKLFLAVGVSSLALASFCSAQAADP 26 (205) Q Consensus 1 Mkk~~la~av~~lal~~~~~A~Aad~ 26 (205) |-..++...++++++.+...|+++|+ T Consensus 1 mi~~~m~~~aal~llla~p~A~aad~ 26 (170) T COG3652 1 MIRHFMKAAAALALLLALPFAQAADK 26 (170) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 90367777999999997365551489 No 248 >pfam08194 DIM DIM protein. Drosophila immune-induced molecules (DIMs) are short proteins induced during the immune response of Drosophila. This family includes DIMs 1 to 4 that have masses below 5 kDa. Probab=20.68 E-value=51 Score=12.69 Aligned_cols=18 Identities=44% Similarity=0.344 Sum_probs=11.5 Q ss_pred CHHHHHHHHHHHHH-HHHC Q ss_conf 94437899999999-9841 Q gi|254780598|r 1 MQKLFLAVGVSSLA-LASF 18 (205) Q Consensus 1 Mkk~~la~av~~la-l~~~ 18 (205) ||-+.++.++..|+ ++.+ T Consensus 1 MK~lsl~~~l~LLa~La~a 19 (37) T pfam08194 1 MKFLSLAFALGLLATLAFA 19 (37) T ss_pred CHHHHHHHHHHHHHHHHHH T ss_conf 9089999999999999972 No 249 >PRK09474 malE maltose ABC transporter periplasmic protein; Reviewed Probab=20.55 E-value=52 Score=12.68 Aligned_cols=16 Identities=19% Similarity=0.237 Sum_probs=9.5 Q ss_pred CHHHHHHHHHHHHHHH Q ss_conf 9443789999999998 Q gi|254780598|r 1 MQKLFLAVGVSSLALA 16 (205) Q Consensus 1 Mkk~~la~av~~lal~ 16 (205) |||.+-.++++.++.. T Consensus 1 ~~~~~~~~~~~~~~~~ 16 (394) T PRK09474 1 IKKGLRILALSALTTL 16 (394) T ss_pred CCHHHHHHHHHHHHHH T ss_conf 9415889999999999 No 250 >PRK09840 catecholate siderophore receptor Fiu; Provisional Probab=20.44 E-value=52 Score=12.66 Aligned_cols=16 Identities=13% Similarity=0.202 Sum_probs=7.8 Q ss_pred CCCCCEEEEEEEEEEE Q ss_conf 0488679999999975 Q gi|254780598|r 171 GLSESLVARLEYRASK 186 (205) Q Consensus 171 ~it~~~~~r~EY~y~d 186 (205) .++++|.+.+-|.|.| T Consensus 623 ~~~~~~~l~~~yty~d 638 (760) T PRK09840 623 NITPAWQVIAGYTQQK 638 (760) T ss_pred EECCCEEEEEEEEEEE T ss_conf 0069879999999997 Done!