Query gi|254780374|ref|YP_003064787.1| flagellar basal body L-ring protein [Candidatus Liberibacter asiaticus str. psy62] Match_columns 238 No_of_seqs 135 out of 642 Neff 6.7 Searched_HMMs 39220 Date Sun May 29 16:03:17 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780374.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 PRK12701 flgH flagellar basal 100.0 0 0 465.5 22.7 227 1-238 1-230 (230) 2 PRK00249 flgH flagellar basal 100.0 0 0 464.0 23.7 227 4-238 5-231 (231) 3 PRK12696 flgH flagellar basal 100.0 0 0 464.5 23.2 225 1-238 1-238 (238) 4 PRK12788 flgH flagellar basal 100.0 0 0 462.8 23.3 228 6-238 4-231 (231) 5 PRK12698 flgH flagellar basal 100.0 0 0 456.2 22.0 221 1-238 1-224 (224) 6 PRK12407 flgH flagellar basal 100.0 0 0 451.2 21.6 218 2-238 1-220 (220) 7 PRK12700 flgH flagellar basal 100.0 0 0 449.9 22.5 182 56-238 46-230 (230) 8 PRK12697 flgH flagellar basal 100.0 0 0 447.0 22.3 211 9-238 15-227 (227) 9 PRK12699 flgH flagellar basal 100.0 0 0 442.1 22.6 216 7-238 22-246 (246) 10 COG2063 FlgH Flagellar basal b 100.0 0 0 415.8 20.9 225 6-235 4-230 (230) 11 pfam02107 FlgH Flagellar L-rin 100.0 0 0 407.4 19.8 178 55-237 3-181 (181) 12 PRK11627 hypothetical protein; 94.9 0.024 6.1E-07 34.7 2.9 25 1-25 1-25 (192) 13 PRK09837 copper/silver efflux 91.5 0.29 7.5E-06 27.8 4.1 19 1-19 1-19 (460) 14 PRK00022 lolB outer membrane l 91.0 0.22 5.5E-06 28.6 3.0 23 1-23 1-23 (203) 15 TIGR02747 TraV type IV conjuga 90.4 0.16 4E-06 29.5 1.9 20 1-20 1-22 (174) 16 PRK09915 putative outer membra 90.2 0.4 1E-05 26.9 3.8 21 1-21 6-27 (488) 17 TIGR02722 lp_ uncharacterized 88.7 1 2.5E-05 24.5 4.9 86 3-91 4-97 (215) 18 pfam06291 Lambda_Bor Bor prote 88.5 0.43 1.1E-05 26.8 2.9 25 1-26 1-25 (97) 19 PRK10449 heat-inducible protei 88.2 0.76 1.9E-05 25.2 4.0 20 1-21 1-20 (140) 20 COG3056 Uncharacterized lipopr 88.0 0.51 1.3E-05 26.3 3.0 23 146-168 132-154 (204) 21 PRK09967 putative outer membra 87.8 0.37 9.5E-06 27.1 2.2 21 1-21 1-22 (160) 22 pfam05643 DUF799 Putative bact 87.1 0.62 1.6E-05 25.8 3.0 23 1-24 1-23 (215) 23 pfam07901 DUF1672 Protein of u 86.8 0.69 1.8E-05 25.5 3.1 21 1-21 1-21 (304) 24 PRK10760 murein hydrolase B; P 86.8 1.4 3.7E-05 23.5 4.8 23 1-24 1-23 (357) 25 pfam06474 MLTD_N MLTD_N. 86.2 1.2 3.1E-05 23.9 4.1 17 7-23 5-21 (93) 26 PRK10866 outer membrane protei 85.7 0.69 1.7E-05 25.5 2.7 22 1-22 1-24 (243) 27 PRK13883 conjugal transfer pro 85.3 0.58 1.5E-05 25.9 2.1 22 1-23 1-22 (156) 28 PRK09810 entericidin A; Provis 84.8 0.57 1.5E-05 26.0 1.9 21 1-21 2-22 (41) 29 COG3417 FlgN Collagen-binding 84.6 1.3 3.2E-05 23.8 3.6 20 2-21 4-23 (200) 30 PRK13616 lipoprotein LpqB; Pro 84.2 1.1 2.9E-05 24.1 3.3 22 3-24 7-28 (590) 31 TIGR03502 lipase_Pla1_cef extr 83.6 1.7 4.3E-05 23.0 3.9 20 1-20 1-21 (792) 32 PRK10598 hypothetical protein; 81.4 1.1 2.7E-05 24.3 2.2 20 1-20 1-21 (186) 33 COG4314 NosL Predicted lipopro 81.1 1.4 3.5E-05 23.6 2.7 23 1-23 1-23 (176) 34 COG4851 CamS Protein involved 81.1 1.2 3.1E-05 23.9 2.4 24 1-24 1-25 (382) 35 PRK00276 infA translation init 81.0 3.7 9.4E-05 20.9 4.9 57 153-214 8-70 (72) 36 COG4764 Uncharacterized protei 80.8 1 2.6E-05 24.4 1.9 24 1-26 5-28 (197) 37 COG3065 Slp Starvation-inducib 80.6 3 7.6E-05 21.4 4.3 58 3-87 8-66 (191) 38 PRK11138 outer membrane protei 80.5 3.4 8.7E-05 21.1 4.5 19 2-20 4-22 (394) 39 PRK11459 multidrug resistance 80.3 2.5 6.5E-05 21.9 3.8 20 1-20 1-24 (478) 40 pfam08139 LPAM_1 Prokaryotic m 80.2 1.1 2.8E-05 24.1 2.0 21 1-22 1-21 (26) 41 pfam07273 DUF1439 Protein of u 79.6 1.3 3.4E-05 23.6 2.2 20 1-20 1-20 (177) 42 pfam06572 DUF1131 Protein of u 79.4 1 2.6E-05 24.4 1.6 20 1-20 1-20 (192) 43 PRK09859 multidrug efflux syst 78.8 1.8 4.5E-05 22.9 2.6 20 1-20 1-23 (385) 44 PRK11067 outer membrane protei 78.8 1.1 2.8E-05 24.2 1.5 32 61-95 64-95 (801) 45 PRK10672 rare lipoprotein A; P 78.4 4.8 0.00012 20.2 4.7 25 151-175 161-185 (369) 46 PRK09578 periplasmic multidrug 78.0 2 5.2E-05 22.5 2.7 20 2-21 7-26 (385) 47 TIGR02887 spore_ger_x_C germin 77.6 1.4 3.5E-05 23.6 1.8 21 1-21 1-21 (400) 48 TIGR00752 slp outer membrane l 76.5 5.4 0.00014 19.8 4.6 62 4-91 4-66 (190) 49 COG5633 Predicted periplasmic 75.4 1.7 4.4E-05 23.0 1.8 22 1-23 1-22 (123) 50 PRK11189 lipoprotein NlpI; Pro 74.4 3.1 7.9E-05 21.3 2.9 20 1-20 1-22 (297) 51 COG5510 Predicted small secret 73.6 2.4 6.1E-05 22.1 2.1 20 1-20 2-24 (44) 52 PRK10718 hypothetical protein; 73.3 1.3 3.2E-05 23.8 0.7 21 1-21 1-21 (191) 53 COG3009 Uncharacterized protei 73.2 3.8 9.8E-05 20.8 3.1 29 152-180 115-145 (190) 54 CHL00010 infA translation init 73.1 5.7 0.00015 19.7 4.0 59 153-216 8-72 (78) 55 TIGR03300 assembly_YfgL outer 73.1 3.6 9.1E-05 20.9 2.9 16 5-20 3-18 (377) 56 COG5645 Predicted periplasmic 72.6 1.7 4.4E-05 23.0 1.2 26 1-27 1-26 (80) 57 PRK02944 OxaA-like protein pre 72.2 4.1 0.0001 20.6 3.1 21 1-21 1-24 (255) 58 PRK10781 rcsF outer membrane l 72.1 6.6 0.00017 19.3 4.1 19 1-21 1-19 (133) 59 PRK11443 hypothetical protein; 71.5 3.2 8.2E-05 21.2 2.4 18 1-20 1-18 (120) 60 PRK13733 conjugal transfer pro 71.3 0.42 1.1E-05 26.8 -2.1 31 173-203 127-157 (171) 61 cd04451 S1_IF1 S1_IF1: Transla 71.2 8 0.0002 18.8 4.7 39 153-196 2-40 (64) 62 COG5461 Type IV pili component 71.0 5.4 0.00014 19.8 3.5 20 71-90 74-93 (224) 63 PRK10783 mltD membrane-bound l 68.6 9.1 0.00023 18.4 4.5 17 8-24 6-22 (449) 64 PRK11548 hypothetical protein; 68.2 3.3 8.5E-05 21.1 1.9 21 1-21 3-23 (113) 65 pfam12262 Lipase_bact_N Bacter 68.0 7.4 0.00019 18.9 3.6 21 1-21 1-22 (269) 66 pfam11153 DUF2931 Protein of u 67.8 3.6 9.1E-05 21.0 2.0 19 152-170 129-147 (202) 67 PRK13731 conjugal transfer sur 67.6 3 7.6E-05 21.5 1.5 21 1-21 3-24 (243) 68 pfam08085 Entericidin Enterici 67.3 3.9 9.9E-05 20.7 2.1 20 1-20 1-23 (42) 69 PRK11251 DNA-binding transcrip 66.1 5 0.00013 20.0 2.5 23 1-23 1-23 (109) 70 PRK12450 foldase protein PrsA; 65.5 3.9 1E-04 20.7 1.8 21 1-21 4-26 (309) 71 TIGR03352 VI_chp_3 type VI sec 64.9 3.3 8.3E-05 21.2 1.3 15 7-21 5-19 (146) 72 TIGR02522 pilus_cpaD pilus (Ca 64.6 7.5 0.00019 18.9 3.1 79 6-90 2-85 (211) 73 TIGR03302 OM_YfiO outer membra 64.0 5.5 0.00014 19.8 2.4 19 4-22 3-21 (235) 74 PRK11023 hypothetical protein; 63.7 4.7 0.00012 20.2 2.0 21 1-21 1-22 (191) 75 PRK12442 translation initiatio 63.5 11 0.00029 17.8 4.7 63 152-219 7-75 (87) 76 PRK06481 fumarate reductase fl 63.4 11 0.00029 17.8 5.1 41 149-190 203-243 (506) 77 PRK10722 hypothetical protein; 63.2 5.9 0.00015 19.6 2.4 20 4-23 17-36 (248) 78 PRK06033 hypothetical protein; 61.9 12 0.00031 17.6 5.2 32 158-191 50-81 (87) 79 TIGR02269 TIGR02269 Myxococcus 61.7 5.7 0.00015 19.7 2.1 19 6-24 2-20 (228) 80 TIGR02062 RNase_B exoribonucle 60.5 4.6 0.00012 20.3 1.4 25 73-97 635-659 (664) 81 TIGR03525 GldK gliding motilit 60.1 6.3 0.00016 19.4 2.1 20 1-21 1-20 (449) 82 PRK03095 prsA peptidylprolyl i 60.0 7.4 0.00019 19.0 2.4 20 1-20 1-21 (287) 83 TIGR02544 III_secr_YscJ type I 60.0 5.8 0.00015 19.6 1.9 18 148-165 125-142 (203) 84 COG3017 LolB Outer membrane li 59.8 7.1 0.00018 19.1 2.3 21 1-21 4-25 (206) 85 TIGR03042 PS_II_psbQ_bact phot 59.7 9.2 0.00023 18.4 2.9 22 1-22 1-23 (142) 86 PRK01326 prsA foldase protein 58.8 8.2 0.00021 18.7 2.5 20 1-20 1-23 (310) 87 pfam07269 consensus 58.8 6.2 0.00016 19.4 1.9 19 1-22 1-19 (55) 88 PRK10802 peptidoglycan-associa 58.4 7.7 0.0002 18.9 2.3 14 10-23 14-27 (173) 89 PRK02463 OxaA-like protein pre 58.2 10 0.00026 18.1 2.9 14 8-21 14-27 (307) 90 PRK10533 putative lipoprotein; 57.9 7.4 0.00019 19.0 2.1 21 1-21 1-22 (171) 91 pfam05590 DUF769 Xylella fasti 57.9 12 0.00032 17.5 3.3 18 3-20 6-23 (279) 92 pfam09533 DUF2380 Predicted li 56.9 11 0.00028 17.9 2.8 23 1-23 1-24 (220) 93 pfam06604 consensus 56.3 15 0.00039 17.0 4.8 17 4-20 7-23 (181) 94 PRK10523 lipoprotein involved 56.1 11 0.00029 17.8 2.8 24 1-24 2-27 (234) 95 TIGR02238 recomb_DMC1 meiotic 56.1 1.9 4.9E-05 22.7 -1.1 31 203-235 199-229 (314) 96 pfam06788 UPF0257 Uncharacteri 55.5 7.9 0.0002 18.8 1.9 21 1-22 1-21 (236) 97 COG3126 Uncharacterized protei 54.8 16 0.00041 16.8 8.9 83 1-105 1-84 (158) 98 PRK11372 lysozyme inhibitor; P 54.8 8.9 0.00023 18.5 2.1 18 1-20 3-20 (109) 99 PRK10510 putative outer membra 54.0 9.3 0.00024 18.3 2.1 20 1-20 1-23 (219) 100 PRK10397 hypothetical protein; 53.7 8.9 0.00023 18.5 2.0 18 1-20 1-18 (137) 101 PRK11671 mltC murein transglyc 53.1 11 0.00028 17.9 2.4 25 1-25 1-25 (360) 102 PRK13835 conjugal transfer pro 52.9 15 0.00038 17.1 3.0 21 1-22 1-21 (144) 103 COG3521 Predicted component of 52.8 17 0.00044 16.6 3.7 22 1-22 1-24 (159) 104 pfam10671 TcpQ Toxin co-regula 52.6 4.5 0.00011 20.4 0.3 22 1-22 1-22 (178) 105 COG4380 Uncharacterized protei 52.4 15 0.00039 17.0 3.0 22 3-24 2-23 (216) 106 TIGR02521 type_IV_pilW type IV 51.5 11 0.00029 17.8 2.2 15 8-22 2-16 (247) 107 PRK02939 hypothetical protein; 51.3 14 0.00036 17.2 2.7 22 1-23 1-22 (236) 108 COG3218 ABC-type uncharacteriz 51.3 17 0.00044 16.7 3.1 22 197-218 157-178 (205) 109 pfam02402 Lysis_col Lysis prot 51.0 8.8 0.00023 18.5 1.6 20 1-20 1-21 (46) 110 COG4607 CeuA ABC-type enteroch 50.6 13 0.00033 17.4 2.4 22 1-22 2-23 (320) 111 PRK04405 prsA peptidylprolyl i 50.4 11 0.00027 17.9 1.9 21 1-21 5-27 (298) 112 PRK13613 lipoprotein LpqB; Pro 50.3 14 0.00035 17.2 2.5 20 3-22 2-21 (603) 113 PRK11162 mltA murein transglyc 50.2 19 0.00048 16.4 3.9 30 154-183 164-193 (362) 114 COG4785 NlpI Lipoprotein NlpI, 49.9 16 0.00042 16.8 2.8 26 1-26 1-27 (297) 115 TIGR02898 spore_YhcN_YlaJ spor 49.7 9.8 0.00025 18.2 1.6 18 5-22 2-21 (185) 116 PRK10838 spr putative outer me 48.6 19 0.00048 16.4 3.0 18 6-23 14-31 (188) 117 COG3168 PilP Tfp pilus assembl 48.5 20 0.00051 16.2 4.6 39 137-181 97-135 (170) 118 PRK03002 prsA peptidylprolyl i 48.4 16 0.0004 16.9 2.5 20 1-20 1-23 (285) 119 PRK13861 type IV secretion sys 48.2 17 0.00043 16.7 2.7 18 1-19 1-18 (293) 120 PRK13614 lipoprotein LpqB; Pro 48.0 21 0.00052 16.2 4.1 15 7-21 13-27 (573) 121 TIGR03431 PhnD phosphonate ABC 47.4 15 0.00038 17.0 2.3 24 1-24 1-24 (288) 122 pfam10447 EXOSC1 Exosome compo 46.8 21 0.00054 16.1 5.1 42 153-199 7-49 (72) 123 pfam06316 Ail_Lom Enterobacter 46.8 17 0.00043 16.7 2.5 22 1-22 1-22 (199) 124 PRK11260 cystine transporter s 46.5 15 0.00039 17.0 2.3 20 1-20 4-23 (264) 125 COG3015 CutF Uncharacterized l 45.9 14 0.00036 17.2 2.0 23 1-23 2-24 (178) 126 COG0361 InfA Translation initi 45.7 22 0.00057 15.9 4.4 59 151-214 6-70 (75) 127 COG3470 Tpd Uncharacterized pr 45.5 21 0.00053 16.1 2.8 21 1-21 3-23 (179) 128 COG4669 EscJ Type III secretor 45.4 14 0.00035 17.2 1.9 17 148-164 119-135 (246) 129 pfam11839 DUF3359 Protein of u 44.6 16 0.0004 16.9 2.0 23 1-23 1-25 (97) 130 pfam01298 Lipoprotein_5 Transf 44.6 23 0.00059 15.8 5.9 10 11-20 13-22 (589) 131 TIGR03516 ppisom_GldI peptidyl 44.2 16 0.0004 16.9 2.0 19 75-93 83-101 (177) 132 PRK05698 fliN flagellar motor 43.6 24 0.00061 15.7 5.5 35 158-194 115-149 (155) 133 cd05695 S1_Rrp5_repeat_hs3 S1_ 43.6 24 0.00061 15.7 6.7 49 154-215 4-57 (66) 134 pfam12046 DUF3529 Protein of u 43.5 13 0.00033 17.4 1.5 28 144-173 130-157 (173) 135 COG3133 SlyB Outer membrane li 43.1 15 0.00038 17.0 1.7 21 1-21 1-22 (154) 136 COG2980 RlpB Rare lipoprotein 43.1 22 0.00057 15.9 2.7 28 1-28 1-29 (178) 137 TIGR01845 outer_NodT efflux tr 42.8 19 0.00049 16.4 2.3 11 10-20 2-12 (477) 138 TIGR02567 YscW type III secret 42.7 7.1 0.00018 19.1 0.1 13 9-21 2-14 (131) 139 PRK10796 LPS-assembly lipoprot 42.3 22 0.00057 15.9 2.5 25 1-25 1-26 (196) 140 COG4594 FecB ABC-type Fe3+-cit 41.8 21 0.00053 16.1 2.3 18 4-21 8-25 (310) 141 COG4238 Murein lipoprotein [Ce 40.6 19 0.00047 16.4 1.9 18 4-21 7-24 (78) 142 pfam11873 DUF3393 Domain of un 40.4 22 0.00055 16.0 2.2 13 10-22 8-20 (200) 143 COG2834 LolA Outer membrane li 38.9 19 0.00049 16.3 1.8 21 2-22 6-26 (211) 144 PRK11530 hypothetical protein; 38.8 17 0.00043 16.7 1.5 18 4-21 3-20 (176) 145 PRK10175 hypothetical protein; 38.2 16 0.00041 16.8 1.3 20 7-26 5-24 (75) 146 TIGR01781 Trep_dent_lipo Trepo 38.0 24 0.00061 15.8 2.1 21 3-24 7-27 (463) 147 TIGR01433 CyoA ubiquinol oxida 37.7 14 0.00036 17.2 0.9 13 7-19 2-14 (228) 148 PRK08983 fliN flagellar motor 37.5 30 0.00076 15.2 5.6 35 158-194 86-120 (126) 149 TIGR01004 PulS_OutS lipoprotei 37.2 30 0.00077 15.1 2.7 15 6-20 12-26 (136) 150 PRK10957 iron-enterobactin tra 37.0 24 0.00062 15.7 2.0 19 3-21 6-24 (319) 151 COG2143 Thioredoxin-related pr 36.7 29 0.00075 15.2 2.5 24 1-24 1-25 (182) 152 PRK01622 OxaA-like protein pre 36.5 31 0.00079 15.0 2.9 13 8-20 13-25 (266) 153 TIGR03659 IsdE heme ABC transp 36.4 28 0.00071 15.3 2.3 17 7-23 8-24 (289) 154 pfam10566 Glyco_hydro_97 Glyco 36.2 26 0.00066 15.5 2.1 16 1-16 1-16 (643) 155 PRK07963 fliN flagellar motor 36.1 31 0.0008 15.0 5.6 35 158-194 96-130 (137) 156 PRK11281 potassium efflux prot 35.9 18 0.00047 16.5 1.3 13 75-87 935-947 (1107) 157 PRK10929 hypothetical protein; 35.9 18 0.00047 16.5 1.3 12 75-86 933-944 (1109) 158 COG5567 Predicted small peripl 35.8 32 0.00081 15.0 4.5 20 1-20 1-22 (58) 159 PRK13684 Ycf48-like protein; P 35.7 32 0.00081 15.0 2.9 16 5-20 11-26 (333) 160 smart00108 B_lectin Bulb-type 34.8 25 0.00063 15.7 1.8 18 156-173 87-104 (114) 161 TIGR03511 GldH_lipo gliding mo 34.8 16 0.00041 16.8 0.9 19 1-19 3-21 (156) 162 PRK11616 hypothetical protein; 34.8 31 0.00078 15.1 2.3 16 13-28 15-30 (109) 163 PRK10936 periplasmic sensory p 34.7 33 0.00084 14.9 3.7 24 1-24 1-24 (340) 164 pfam09158 MotCF Bacteriophage 34.5 33 0.00085 14.8 2.9 20 156-175 43-62 (103) 165 COG1021 EntE Peptide arylation 33.7 24 0.00061 15.8 1.6 33 153-185 416-448 (542) 166 COG2935 Putative arginyl-tRNA: 32.8 27 0.0007 15.4 1.8 17 153-169 162-178 (253) 167 COG4259 Uncharacterized protei 32.7 36 0.00091 14.7 2.6 20 1-20 1-24 (121) 168 COG1729 Uncharacterized protei 32.3 36 0.00092 14.6 2.5 23 1-23 1-23 (262) 169 PRK02889 tolB translocation pr 32.1 27 0.00069 15.4 1.6 20 1-20 4-23 (430) 170 COG4143 TbpA ABC-type thiamine 31.2 38 0.00096 14.5 2.3 19 1-19 1-19 (336) 171 TIGR01067 rplN_bact ribosomal 30.8 37 0.00094 14.6 2.1 34 74-107 29-62 (128) 172 PRK09973 putative outer membra 30.7 27 0.00069 15.4 1.5 20 1-20 1-22 (85) 173 PRK10871 nlpD lipoprotein NlpD 30.7 37 0.00095 14.5 2.2 48 148-202 285-336 (374) 174 cd00028 B_lectin Bulb-type man 30.7 35 0.00088 14.7 2.0 17 156-172 88-104 (116) 175 pfam09676 TraV Type IV conjuga 30.3 19 0.00048 16.4 0.6 20 161-180 113-132 (134) 176 PRK04792 tolB translocation pr 30.1 38 0.00097 14.5 2.1 19 1-19 1-20 (450) 177 TIGR01645 half-pint poly-U bin 30.0 18 0.00046 16.5 0.4 34 165-198 667-707 (791) 178 COG3317 NlpB Uncharacterized l 29.8 40 0.001 14.4 6.8 25 70-94 85-109 (342) 179 cd05734 Ig7_DSCAM Seventh immu 29.6 34 0.00086 14.8 1.8 14 159-172 38-51 (79) 180 COG4939 Major membrane immunog 29.4 38 0.00096 14.5 2.0 21 1-21 1-22 (147) 181 PRK10477 outer membrane lipopr 29.1 41 0.001 14.3 3.2 17 5-21 6-22 (177) 182 PRK10959 outer membrane protei 29.0 41 0.001 14.3 2.1 18 1-18 1-18 (212) 183 PRK09495 glnH glutamine ABC tr 28.9 40 0.001 14.3 2.1 19 1-19 1-19 (247) 184 pfam01203 GSPII_N Bacterial ty 28.7 26 0.00067 15.5 1.1 17 52-68 36-52 (251) 185 pfam03866 HAP Hydrophobic abun 28.3 42 0.0011 14.2 2.2 19 1-20 1-19 (167) 186 PHA00407 phage lambda Rz1-like 28.1 42 0.0011 14.2 2.7 15 8-22 39-53 (84) 187 TIGR02134 transald_staph trans 27.6 21 0.00053 16.1 0.4 25 61-86 173-198 (237) 188 pfam07119 DUF1375 Protein of u 26.8 28 0.00072 15.3 1.0 19 10-28 3-21 (70) 189 PRK02998 prsA peptidylprolyl i 26.7 45 0.0011 14.0 2.5 20 1-20 1-24 (283) 190 KOG1923 consensus 26.6 45 0.0011 14.0 4.8 33 10-42 264-296 (830) 191 COG3107 LppC Putative lipoprot 26.3 46 0.0012 14.0 2.1 15 8-22 17-31 (604) 192 PRK10554 outer membrane porin 26.1 46 0.0012 14.0 2.0 20 1-20 1-20 (373) 193 PRK03757 hypothetical protein; 26.0 46 0.0012 13.9 2.2 15 1-15 1-15 (191) 194 pfam01315 Ald_Xan_dh_C Aldehyd 26.0 46 0.0012 13.9 3.0 31 68-98 69-99 (111) 195 PRK08190 bifunctional enoyl-Co 25.8 47 0.0012 13.9 2.5 20 153-172 103-122 (465) 196 pfam07241 consensus 25.7 47 0.0012 13.9 3.6 30 162-191 53-82 (87) 197 TIGR00008 infA translation ini 25.7 47 0.0012 13.9 4.0 39 153-196 6-44 (69) 198 PRK09752 adhesin; Provisional 25.5 39 0.00099 14.4 1.5 22 59-80 957-980 (1250) 199 PRK11063 metQ DL-methionine tr 25.5 47 0.0012 13.9 2.2 15 8-22 13-27 (271) 200 TIGR03044 PS_II_psb27 photosys 24.9 40 0.001 14.3 1.5 16 5-20 12-27 (135) 201 pfam06924 DUF1281 Protein of u 24.9 42 0.0011 14.2 1.6 15 10-24 32-46 (134) 202 PRK11669 pbpG D-alanyl-D-alani 24.9 49 0.0012 13.8 3.4 19 3-21 2-20 (308) 203 PRK10525 cytochrome o ubiquino 24.8 43 0.0011 14.1 1.6 13 7-19 14-26 (305) 204 pfam11353 DUF3153 Protein of u 24.4 27 0.00069 15.4 0.5 19 7-25 3-21 (210) 205 PRK10002 outer membrane protei 24.2 50 0.0013 13.7 2.0 19 1-19 2-20 (362) 206 PRK06687 chlorohydrolase; Vali 24.1 37 0.00094 14.5 1.2 15 77-91 205-219 (422) 207 COG1462 CsgG Uncharacterized p 23.5 52 0.0013 13.6 4.7 28 136-163 139-166 (252) 208 COG4880 Secreted protein conta 23.3 52 0.0013 13.6 2.5 49 158-208 182-250 (603) 209 PRK10461 thiamine biosynthesis 23.3 52 0.0013 13.6 2.6 17 5-21 7-23 (351) 210 PRK10759 hypothetical protein; 23.1 40 0.001 14.3 1.2 13 7-19 4-16 (106) 211 pfam06085 Rz1 Lipoprotein Rz1 22.9 53 0.0014 13.6 3.4 21 1-21 1-23 (59) 212 KOG1193 consensus 22.8 53 0.0014 13.6 1.8 29 139-167 302-331 (511) 213 pfam09619 YscW Type III secret 22.4 28 0.00073 15.3 0.3 15 9-23 2-16 (124) 214 TIGR02917 PEP_TPR_lipo putativ 22.3 54 0.0014 13.5 2.3 18 4-21 7-26 (924) 215 pfam10368 YkyA Putative cell-w 21.9 37 0.00094 14.6 0.8 11 10-20 3-13 (205) 216 PRK11479 hypothetical protein; 21.7 56 0.0014 13.4 4.6 20 5-24 7-26 (273) 217 pfam05968 Bacillus_PapR Bacill 21.6 56 0.0014 13.4 2.1 19 1-19 1-19 (48) 218 PRK11679 lipoprotein; Provisio 21.4 57 0.0015 13.4 4.4 21 70-90 90-110 (345) 219 PRK11186 carboxy-terminal prot 21.3 57 0.0015 13.4 2.2 32 130-165 237-268 (673) 220 PHA00144 major head protein 21.1 57 0.0015 13.3 4.3 105 68-177 329-436 (438) 221 TIGR01935 NOT-MenG RraA family 21.0 58 0.0015 13.3 2.0 31 69-99 21-53 (155) 222 PRK06760 hypothetical protein; 20.9 58 0.0015 13.3 1.7 38 69-108 56-93 (223) 223 PRK13697 cytochrome c6; Provis 20.9 58 0.0015 13.3 2.3 19 1-19 1-19 (111) 224 pfam03304 Mlp Mlp lipoprotein 20.9 58 0.0015 13.3 1.7 19 3-21 2-21 (150) 225 pfam10023 DUF2265 Predicted am 20.5 43 0.0011 14.1 0.9 14 7-20 2-15 (338) 226 COG4775 Outer membrane protein 20.4 57 0.0015 13.3 1.5 28 69-96 69-96 (766) 227 TIGR02236 recomb_radA DNA repa 20.0 17 0.00043 16.7 -1.2 22 204-225 223-244 (333) No 1 >PRK12701 flgH flagellar basal body L-ring protein; Reviewed Probab=100.00 E-value=0 Score=465.48 Aligned_cols=227 Identities=28% Similarity=0.394 Sum_probs=195.7 Q ss_pred CCHHHHHHHHHH-HHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC-CCCCCCCCCC Q ss_conf 911799999999-99854336720025687536677776644445644555444444455761215775-5211232236 Q gi|254780374|r 1 MYRYSFIVLCYS-SLLFGCHSSISEITGIPQMSPMGSSLDENNRMPFLGIDFKNSDSTKKSYSLWRDSH-AALFKDSRAL 78 (238) Q Consensus 1 M~r~~l~~l~~~-~lL~GCas~~~~~~~~p~~tp~~~~~~~~~~~p~~~~~~~~~~~~~~~gSL~~~~~-~~lf~D~RA~ 78 (238) |+|+.+++.|++ ++|+||++..++ .+.+.|.|+|++++.+.+..+.+.+||||+.++ .+||+|+||+ T Consensus 1 m~r~~~~~~~la~~lL~GC~~~~~~-----------~p~~~p~~aP~~p~~p~p~~~~~~~GSl~~~~~~~~lF~D~RA~ 69 (230) T PRK12701 1 MNRLNIAVSCLATALLFGCEALHPP-----------APGDNPDYAPTYPVTPDPKELRKVSGAIYSSETALPLFETPRAR 69 (230) T ss_pred CCHHHHHHHHHHHHHHHCCCCCCCC-----------CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC T ss_conf 9258999999999997556688999-----------99999886776888889765778997530378876442555336 Q ss_pred CCCCEEEEEEEECCHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHEECEEEEEEEEEEEE Q ss_conf 76876999997241010000122234565445554333455445566655465555543211000000014899999999 Q gi|254780374|r 79 NVGDILTVDIRIDDQAVFDNQTGRSRNNSLHRKLSGGFSLFGQQTPQMNGNLNYDGGGASSGKGSISRAEKLNLLIAAIV 158 (238) Q Consensus 79 ~VGDIiTV~i~E~~~a~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~G~~~~~~~l~~~isa~V 158 (238) +|||||||+|.|+++|++++++..+|+++.+...+.+++.....+......++.+++.+|+|+|+++|+++|+++|+|+| T Consensus 70 ~VGDilTV~i~E~t~Asks~~t~~~k~~s~~~~~~~~~g~~~~~~~~~~~~~~~~s~~~f~G~G~~~qs~~l~gtIta~V 149 (230) T PRK12701 70 HPGDILTVFLIEKTDAQKNATTTQRKNDTTKITNKLFLGRPISLGSGYSMDFDLDNQRQFNGEGRSIQNNKLAGSISVTV 149 (230) T ss_pred CCCCEEEEEEEECCHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEEEEEE T ss_conf 78987999999733433311323254565445653224654324566654446666543556430102214789999999 Q ss_pred EEECCCCCEEEEEEEEEEEECCEEEEEEEEEECHHHCCCCCEEEHHEEEEEEEEEECEEEECCCCCCCHHHHHHHH-HCC Q ss_conf 9980797289978877984188799999979847767899738242055439998020331233486668999998-558 Q gi|254780374|r 159 TAILENGNLIISGSQEVRVNDEIRSLNVTGIVRPQDVDAHNSVSYDKIAEARISYGGKGRTTELLRPPIGHQLIEN-LSP 237 (238) Q Consensus 159 ~~VlpNGnL~I~G~k~i~vn~e~~~i~isGiVRp~DI~~~NtV~S~~iAda~I~y~g~G~~~d~~~~gwl~r~~~~-i~P 237 (238) ++|||||||+|+|+|||+||+|.|+|+|+|||||+||++||||.|+|||||||+|.|+|.++|+|+||||+|||.. +|| T Consensus 150 ~~VLpNGNLvI~GeK~v~vN~e~e~i~lsGiVRP~DI~~~NtV~S~~IAdArI~Y~G~G~l~d~~~~GWltr~f~s~~~P 229 (230) T PRK12701 150 AKVLANGNMVVQGEKWVRINQGNEFVRLSGIVRPQDIKADNTITSDRIANARISYGGTGQINNTNAQGWLSRILWGPLFP 229 (230) T ss_pred EEECCCCCEEEEEEEEEEECCCEEEEEEEEEECHHHCCCCCEECCCEEEEEEEEECCEECCCCCCCCCHHHHHHCCCCCC T ss_conf 99866997999998999987988999999998878879998697102243799982202205342559899975435588 Q ss_pred C Q ss_conf 9 Q gi|254780374|r 238 L 238 (238) Q Consensus 238 l 238 (238) - T Consensus 230 ~ 230 (230) T PRK12701 230 T 230 (230) T ss_pred C T ss_conf 9 No 2 >PRK00249 flgH flagellar basal body L-ring protein; Reviewed Probab=100.00 E-value=0 Score=464.04 Aligned_cols=227 Identities=37% Similarity=0.546 Sum_probs=196.6 Q ss_pred HHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCE Q ss_conf 79999999999854336720025687536677776644445644555444444455761215775521123223676876 Q gi|254780374|r 4 YSFIVLCYSSLLFGCHSSISEITGIPQMSPMGSSLDENNRMPFLGIDFKNSDSTKKSYSLWRDSHAALFKDSRALNVGDI 83 (238) Q Consensus 4 ~~l~~l~~~~lL~GCas~~~~~~~~p~~tp~~~~~~~~~~~p~~~~~~~~~~~~~~~gSL~~~~~~~lf~D~RA~~VGDI 83 (238) ++.++++++++|+||++..++ ...|.++|+.++.+.+.+.+. ++..+..++||||+.++++||+|+||++|||| T Consensus 5 ~~~~~~~~~~~LsgCa~~~~~-~~~p~~~p~~~~~~~p~~~~~-----~~~Gsiy~~~sl~~~~~~~Lf~D~RA~~VGDi 78 (231) T PRK00249 5 LIALALLLLLLLSGCASIPPK-PLIPALAPPTSPTPLPGYAPV-----EPNGSIFQAGSLWNYGYRPLFKDRRARNVGDI 78 (231) T ss_pred HHHHHHHHHHHHHHCCCCCCC-CCCCCCCCCCCCCCCCCCCCC-----CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCE T ss_conf 999999999998753489998-888776898888788876777-----89887457645546555635123443789996 Q ss_pred EEEEEEECCHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHEECEEEEEEEEEEEEEEECC Q ss_conf 99999724101000012223456544555433345544556665546555554321100000001489999999999807 Q gi|254780374|r 84 LTVDIRIDDQAVFDNQTGRSRNNSLHRKLSGGFSLFGQQTPQMNGNLNYDGGGASSGKGSISRAEKLNLLIAAIVTAILE 163 (238) Q Consensus 84 iTV~i~E~~~a~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~G~~~~~~~l~~~isa~V~~Vlp 163 (238) |||+|+|+++|+++++++.+|+++.+.+++.+++..... .....++.+++++|+|+|+++|+++|+++|+|+|++||| T Consensus 79 iTV~i~E~~~As~~~~t~~~r~~~~~~~~~~~~g~~~~~--~~~~~~~~~~~~~f~G~G~~~~s~~ltgtIta~V~~Vlp 156 (231) T PRK00249 79 LTVVLQENTSASKSNSTNRSRDGSESFGLPSLFGYLQGG--DANASLETSSDNSFKGKGSANRSNTLSGTITVTVTQVLP 156 (231) T ss_pred EEEEEEECCCHHHCCCCCCCCCCCCCCCCCCCCCCCCCC--CCCCCCCCCCCCCCCCCCCEEEEEEEEEEEEEEEEEECC T ss_conf 999999704011035643144565454774325665666--665443345553135654111001589999999999878 Q ss_pred CCCEEEEEEEEEEEECCEEEEEEEEEECHHHCCCCCEEEHHEEEEEEEEEECEEEECCCCCCCHHHHHHHHHCCC Q ss_conf 972899788779841887999999798477678997382420554399980203312334866689999985589 Q gi|254780374|r 164 NGNLIISGSQEVRVNDEIRSLNVTGIVRPQDVDAHNSVSYDKIAEARISYGGKGRTTELLRPPIGHQLIENLSPL 238 (238) Q Consensus 164 NGnL~I~G~k~i~vn~e~~~i~isGiVRp~DI~~~NtV~S~~iAda~I~y~g~G~~~d~~~~gwl~r~~~~i~Pl 238 (238) ||||+|+|+|+|+||+|.|+|+|+|||||+||+++|+|.|+|||||||+|.|+|.++|+|+||||+|||+++||. T Consensus 157 NGnL~I~G~K~v~vN~e~e~i~~sGvVRP~DI~~~NtV~S~~IAdArI~Y~G~G~l~d~~~~gWl~r~~~~i~PF 231 (231) T PRK00249 157 NGNLVIRGEKEVRVNQGDEFLRVSGVVRPRDISGDNTISSTRIADARISYGGRGAISDAQQPGWGQRFFDSLSPF 231 (231) T ss_pred CCCEEEEEEEEEEECCCEEEEEEEEEECHHHCCCCCEECCCEEEEEEEEECCCCCCCCCCCCCHHHHHHHHCCCC T ss_conf 996899999999989988999999999978889997596104244799980201315233679999998861899 No 3 >PRK12696 flgH flagellar basal body L-ring protein; Reviewed Probab=100.00 E-value=0 Score=464.55 Aligned_cols=225 Identities=32% Similarity=0.440 Sum_probs=192.5 Q ss_pred CCHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC-CCCCCCCCCCC Q ss_conf 91179999999999854336720025687536677776644445644555444444455761215775-52112322367 Q gi|254780374|r 1 MYRYSFIVLCYSSLLFGCHSSISEITGIPQMSPMGSSLDENNRMPFLGIDFKNSDSTKKSYSLWRDSH-AALFKDSRALN 79 (238) Q Consensus 1 M~r~~l~~l~~~~lL~GCas~~~~~~~~p~~tp~~~~~~~~~~~p~~~~~~~~~~~~~~~gSL~~~~~-~~lf~D~RA~~ 79 (238) |+|.+++.+|++++|+||++..+.....|.+.| +++++.|+....++||||++++ .+||+|+||++ T Consensus 1 mm~~~l~~~~~~~~L~GC~~~~~~~~~~~~~~p-------------p~~~~~p~~~~~~~GSl~~~~~~~~LF~D~RA~~ 67 (238) T PRK12696 1 MIRKLLAASCAVLLLSGCNAARQQPSPLPPVTP-------------PQAYVEPEDAAANPGSLFSDAEADLLFSDSRARR 67 (238) T ss_pred CHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCC-------------CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC T ss_conf 958999999999995165688888888899889-------------9776676667899986455898775535553356 Q ss_pred CCCEEEEEEEECCHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCC------------CCCCCCCCCCCCCCCCCCHHEECE Q ss_conf 68769999972410100001222345654455543334554455------------666554655555432110000000 Q gi|254780374|r 80 VGDILTVDIRIDDQAVFDNQTGRSRNNSLHRKLSGGFSLFGQQT------------PQMNGNLNYDGGGASSGKGSISRA 147 (238) Q Consensus 80 VGDIiTV~i~E~~~a~~~~~~~~~r~~~~~~~~~~~~~~~~~~~------------~~~~~~~~~~~~~~~~g~G~~~~~ 147 (238) |||||||+|.|+++|+++++++.+|+++.+.+++++||...... ....+.++.+++++|+|+|+++|+ T Consensus 68 VGDilTV~i~E~t~A~~sa~t~~~r~ss~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~f~G~G~~~~s 147 (238) T PRK12696 68 VGDIVLVKIVENAKAKNKADTTADKDSTNEYGVSAFFGQSSASINPMNPVGAFGGKVGTNPLLQTSSKSKLSGTGETKRE 147 (238) T ss_pred CCCEEEEEEEEECHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEC T ss_conf 89869999998145432036441445565547641136543234655444445666565422234554434565301001 Q ss_pred EEEEEEEEEEEEEECCCCCEEEEEEEEEEEECCEEEEEEEEEECHHHCCCCCEEEHHEEEEEEEEEECEEEECCCCCCCH Q ss_conf 14899999999998079728997887798418879999997984776789973824205543999802033123348666 Q gi|254780374|r 148 EKLNLLIAAIVTAILENGNLIISGSQEVRVNDEIRSLNVTGIVRPQDVDAHNSVSYDKIAEARISYGGKGRTTELLRPPI 227 (238) Q Consensus 148 ~~l~~~isa~V~~VlpNGnL~I~G~k~i~vn~e~~~i~isGiVRp~DI~~~NtV~S~~iAda~I~y~g~G~~~d~~~~gw 227 (238) ++|+++|+|+|++|||||||+|+|+|+|++|+|.|+|+|+|||||+||+++|+|.|+|||||||+|.|+|.++|+|+||| T Consensus 148 ~~lt~tIaa~Vv~VLpNGnL~I~G~k~v~vn~~~~~i~~sGiVrp~DI~~~NtV~S~~iAdarI~y~G~G~l~d~q~~gW 227 (238) T PRK12696 148 NTVTATIAARVLRVMPGGLLQVEGARETRVNDETQYIVVSGLVRSRDVASDNSVMSTQMADARIAYYGKGVLADKQRPGW 227 (238) T ss_pred CEEEEEEEEEEEEECCCCCEEEEEEEEEEECCCEEEEEEEEEECHHHCCCCCEECCCEEEEEEEEECCCEEECCCCCCCH T ss_conf 35899999999998779959999989999889889999999988899799986961032547999812012364436699 Q ss_pred HHHHHHHHCCC Q ss_conf 89999985589 Q gi|254780374|r 228 GHQLIENLSPL 238 (238) Q Consensus 228 l~r~~~~i~Pl 238 (238) |+|||+++||. T Consensus 228 l~r~~~~v~PF 238 (238) T PRK12696 228 FTRLMDNLWPF 238 (238) T ss_pred HHHHHHHCCCC T ss_conf 99997620899 No 4 >PRK12788 flgH flagellar basal body L-ring protein; Reviewed Probab=100.00 E-value=0 Score=462.81 Aligned_cols=228 Identities=38% Similarity=0.624 Sum_probs=196.4 Q ss_pred HHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEE Q ss_conf 99999999985433672002568753667777664444564455544444445576121577552112322367687699 Q gi|254780374|r 6 FIVLCYSSLLFGCHSSISEITGIPQMSPMGSSLDENNRMPFLGIDFKNSDSTKKSYSLWRDSHAALFKDSRALNVGDILT 85 (238) Q Consensus 6 l~~l~~~~lL~GCas~~~~~~~~p~~tp~~~~~~~~~~~p~~~~~~~~~~~~~~~gSL~~~~~~~lf~D~RA~~VGDIiT 85 (238) +++++++++|+||+++++++++.|.++|+.++.... .+| .|..+........+|||+++ .+||+|+||++|||||| T Consensus 4 l~~~~a~l~LsGCa~~~~~~~~~P~~sPv~~~~~~~-~~~--~P~~p~~~~~~~~~Sl~~~~-~~LF~D~RA~~VGDIlT 79 (231) T PRK12788 4 LVAILACLALAGCANNLADALQAPRMSPIGQGLRAD-ARA--IPVEPRMRTPVSRRSLWDER-TDLYRDPRASRTGDLLT 79 (231) T ss_pred HHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCC-CCC--CCCCCCCCCCCCCCCCHHCC-CCCCCCCCCCCCCCEEE T ss_conf 999999999764038743367899878888876666-777--78887777777778411145-53336664233798499 Q ss_pred EEEEECCHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHEECEEEEEEEEEEEEEEECCCC Q ss_conf 99972410100001222345654455543334554455666554655555432110000000148999999999980797 Q gi|254780374|r 86 VDIRIDDQAVFDNQTGRSRNNSLHRKLSGGFSLFGQQTPQMNGNLNYDGGGASSGKGSISRAEKLNLLIAAIVTAILENG 165 (238) Q Consensus 86 V~i~E~~~a~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~G~~~~~~~l~~~isa~V~~VlpNG 165 (238) |+|.|+++|+++++++++|+++.+.+++.+++.... ........+.+++.+|+|+|+++|+++|+++|+|+|++||||| T Consensus 80 V~i~e~~~As~~~~t~~sr~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~G~G~~~rs~~lt~tIaa~V~~VLPNG 158 (231) T PRK12788 80 VTISMNDKATLDNKTDRSRDSQTKFGLDYLAEFLPW-SDKGKLTANVSSQTSTKGTGKIDRTEEVKFSVAAIVVDVLPNG 158 (231) T ss_pred EEEEECCHHHHHCCCCCCCCCCCCCCCCCCCCCCCC-CCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEEEEEEECCCC T ss_conf 999872432220355414345635475101013665-6666533345566434564211002437899999999987899 Q ss_pred CEEEEEEEEEEEECCEEEEEEEEEECHHHCCCCCEEEHHEEEEEEEEEECEEEECCCCCCCHHHHHHHHHCCC Q ss_conf 2899788779841887999999798477678997382420554399980203312334866689999985589 Q gi|254780374|r 166 NLIISGSQEVRVNDEIRSLNVTGIVRPQDVDAHNSVSYDKIAEARISYGGKGRTTELLRPPIGHQLIENLSPL 238 (238) Q Consensus 166 nL~I~G~k~i~vn~e~~~i~isGiVRp~DI~~~NtV~S~~iAda~I~y~g~G~~~d~~~~gwl~r~~~~i~Pl 238 (238) ||+|+|+|+|++|+|.|+|+|+|||||+||+++|+|.|+|||||||+|.|+|.++|+|+||||+|||++|||. T Consensus 159 NLvI~G~k~v~vN~e~e~i~vsGiVRP~DI~~~Ntv~s~~iAdarI~Y~G~G~l~d~q~pgW~~r~~~~i~PF 231 (231) T PRK12788 159 NLLISGSQEVRVNYEMRVLNVGGIVRPLDITRNNTIAYEKVAEARISYGGRGRLMEVQQPAWGHQVYDQVVPF 231 (231) T ss_pred CEEEEEEEEEEECCCEEEEEEEEEECHHHCCCCCEECCCEEEEEEEEECCEEEECCCCCCCHHHHHHHHCCCC T ss_conf 5899999999988986999999998879979997498212256899980203236222659999998760899 No 5 >PRK12698 flgH flagellar basal body L-ring protein; Reviewed Probab=100.00 E-value=0 Score=456.18 Aligned_cols=221 Identities=26% Similarity=0.364 Sum_probs=184.9 Q ss_pred CCHHHHHHHHHHHHHHHHCCCCCC-CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC-CCCCCCCCCC Q ss_conf 911799999999998543367200-25687536677776644445644555444444455761215775-5211232236 Q gi|254780374|r 1 MYRYSFIVLCYSSLLFGCHSSISE-ITGIPQMSPMGSSLDENNRMPFLGIDFKNSDSTKKSYSLWRDSH-AALFKDSRAL 78 (238) Q Consensus 1 M~r~~l~~l~~~~lL~GCas~~~~-~~~~p~~tp~~~~~~~~~~~p~~~~~~~~~~~~~~~gSL~~~~~-~~lf~D~RA~ 78 (238) |+|+++++++ ++|+||++..++ ....|. +.|+.+ ..++.+....||||++++ .+||+|+||+ T Consensus 1 M~r~~l~~~~--l~L~gCa~~~~~p~~~~p~------------~aP~~p--~~~~~~~~~~GSi~~~~~~~~lf~D~RA~ 64 (224) T PRK12698 1 MARYILLALA--LLLAGCSSTPKKPIADDPY------------YAPVYP--EAPPTKIAATGSIFQDSQANSLYSDIKAH 64 (224) T ss_pred CHHHHHHHHH--HHHHCCCCCCCCCCCCCCC------------CCCCCC--CCCCCCCCCCCCCCCCCCCCCCCCCCCCC T ss_conf 9779999999--9983656889888888988------------678679--98876669997402588777542352104 Q ss_pred CCCCEEEEEEEECCHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHEECEEEEEEEEEEEE Q ss_conf 76876999997241010000122234565445554333455445566655465555543211000000014899999999 Q gi|254780374|r 79 NVGDILTVDIRIDDQAVFDNQTGRSRNNSLHRKLSGGFSLFGQQTPQMNGNLNYDGGGASSGKGSISRAEKLNLLIAAIV 158 (238) Q Consensus 79 ~VGDIiTV~i~E~~~a~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~G~~~~~~~l~~~isa~V 158 (238) +|||||||+|+|+++|+++++++++|+++.+++.+.+++...... .....++.+++++|+|+|+++|+++|+++|+|+| T Consensus 65 ~VGDIvTV~i~E~~~A~~s~~t~~~r~~~~~~~~~~~~g~~~~~~-~~~~~~~~~~~~~f~G~G~~~~s~~ltgtIta~V 143 (224) T PRK12698 65 KVGDIITVVLKESTQAKKSANNEIKKGSDLSLDPIYAGGSNVSIG-GVPLDLGYKDSMNTKRESDADQSNSLSGSISANV 143 (224) T ss_pred CCCCEEEEEEEECCHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCC-CCCCCCCCCCCCCCCCCCCEECCCEEEEEEEEEE T ss_conf 599879999987043321146400344553436652366432357-7654344456653456330101024899999999 Q ss_pred EEECCCCCEEEEEEEEEEEECCEEEEEEEEEECHHHCCCCCEEEHHEEEEEEEEEECEEEECCCCCCCHHHHHHH-HHCC Q ss_conf 998079728997887798418879999997984776789973824205543999802033123348666899999-8558 Q gi|254780374|r 159 TAILENGNLIISGSQEVRVNDEIRSLNVTGIVRPQDVDAHNSVSYDKIAEARISYGGKGRTTELLRPPIGHQLIE-NLSP 237 (238) Q Consensus 159 ~~VlpNGnL~I~G~k~i~vn~e~~~i~isGiVRp~DI~~~NtV~S~~iAda~I~y~g~G~~~d~~~~gwl~r~~~-~i~P 237 (238) +||||||||+|+|+|+|+||+|.|+|+|+|||||+||++||+|.|+|||||||+|.|+|.++|+|+||||+|||+ .+|| T Consensus 144 ~~VlpNGNL~I~G~k~i~vN~~~e~i~~sGvVRP~DI~~~NtV~S~~IAdArI~Y~G~G~l~d~q~~GWl~r~f~~~~wP 223 (224) T PRK12698 144 MQVLPNGNLVIRGEKWISINNGDEFIRLTGIVRSQDITPDNTIDSTRVANARIQYSGTGTFADAQKVGWLSQFFMSDWWP 223 (224) T ss_pred EEECCCCCEEEEEEEEEEECCCEEEEEEEEEECHHHCCCCCEECCCEEEEEEEEECCEEEECCCCCCCHHHHHHCCCCCC T ss_conf 99866980999998999988987999999999878889998697002243799983224126332448889976305699 Q ss_pred C Q ss_conf 9 Q gi|254780374|r 238 L 238 (238) Q Consensus 238 l 238 (238) . T Consensus 224 F 224 (224) T PRK12698 224 F 224 (224) T ss_pred C T ss_conf 9 No 6 >PRK12407 flgH flagellar basal body L-ring protein; Reviewed Probab=100.00 E-value=0 Score=451.19 Aligned_cols=218 Identities=26% Similarity=0.352 Sum_probs=180.4 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC-CCCCCCCCCCCCC Q ss_conf 117999999999985433672002568753667777664444564455544444445576121577-5521123223676 Q gi|254780374|r 2 YRYSFIVLCYSSLLFGCHSSISEITGIPQMSPMGSSLDENNRMPFLGIDFKNSDSTKKSYSLWRDS-HAALFKDSRALNV 80 (238) Q Consensus 2 ~r~~l~~l~~~~lL~GCas~~~~~~~~p~~tp~~~~~~~~~~~p~~~~~~~~~~~~~~~gSL~~~~-~~~lf~D~RA~~V 80 (238) +|+ +++.+++++|+||++....... +.+.+.|+. +..++.....+||||+.+ +.+||+|+||++| T Consensus 1 kr~-l~l~~~~l~LsGC~~~~~~~~~-----------~~~~~app~--~~~~p~~~~~~GSi~~~~~~~~lf~D~RA~~V 66 (220) T PRK12407 1 KRF-LILTPMVLALCGCESPALLVQK-----------DDAEFAPPA--NLIQPATVTEGGGLFQPAYSWSLLQDRRAYRI 66 (220) T ss_pred CCC-HHHHHHHHHHHCCCCCCCCCCC-----------CCCCCCCCC--CCCCCCCCCCCCCEEECCCCCCCCCCCCCCCC T ss_conf 915-7999999997365888877888-----------987878977--76778888999730007987755365544568 Q ss_pred CCEEEEEEEECCHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHEECEEEEEEEEEEEEEE Q ss_conf 87699999724101000012223456544555433345544556665546555554321100000001489999999999 Q gi|254780374|r 81 GDILTVDIRIDDQAVFDNQTGRSRNNSLHRKLSGGFSLFGQQTPQMNGNLNYDGGGASSGKGSISRAEKLNLLIAAIVTA 160 (238) Q Consensus 81 GDIiTV~i~E~~~a~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~G~~~~~~~l~~~isa~V~~ 160 (238) ||||||+|.|+++|+++++++.+|+++.+.+++.+++.... . ...+.+++++|+|+|+++|+++|+++|+|+|++ T Consensus 67 GDIlTV~i~E~t~As~s~~t~~~r~~~~~~~~~~~~g~~~~---~--~~~~~~~~~~f~G~G~~~qs~~ltg~itvtV~~ 141 (220) T PRK12407 67 GDILTVILDESTQSSKQAKTNFGKKNDMSLGVPEVLGKKLN---K--FGGSISGKRDFDGSATSAQQNMLRGSITVAVHQ 141 (220) T ss_pred CCEEEEEEEECCHHHHHCCCCCCCCCCCCCCCCCHHCCCCC---C--CCCCCCCCCCCCCCEEEEEEEEEEEEEEEEEEE T ss_conf 98699999871443320254424456645466202044443---3--555444564346730220132588899999999 Q ss_pred ECCCCCEEEEEEEEEEEECCEEEEEEEEEECHHHCCCCCEEEHHEEEEEEEEEECEEEECCCCCCCHHHHHHHH-HCCC Q ss_conf 80797289978877984188799999979847767899738242055439998020331233486668999998-5589 Q gi|254780374|r 161 ILENGNLIISGSQEVRVNDEIRSLNVTGIVRPQDVDAHNSVSYDKIAEARISYGGKGRTTELLRPPIGHQLIEN-LSPL 238 (238) Q Consensus 161 VlpNGnL~I~G~k~i~vn~e~~~i~isGiVRp~DI~~~NtV~S~~iAda~I~y~g~G~~~d~~~~gwl~r~~~~-i~Pl 238 (238) |||||||+|+|+|+|++|+|.|+|+|+|||||+||++||+|.|+|||||||+|.|+|.++|+|+||||+|||+. +||| T Consensus 142 VlpNGNLvI~G~K~v~vN~e~e~irvsGiVRP~DI~~dNtV~S~~iAdArI~Y~G~G~l~d~~~~GWlsrff~~~~~p~ 220 (220) T PRK12407 142 VLPNGVLVIRGEKWLTLNQGDEYMRVTGLVRADDVARDNSVSSQRIANARISYAGRGALSDANSAGWLTRFFNHPLFPI 220 (220) T ss_pred ECCCCCEEEEEEEEEEECCCEEEEEEEEEECHHHCCCCCEECCCEEEEEEEEEEEEEEECCCCCCCHHHHHHCCCCCCC T ss_conf 8789968999979999899879999999988788799985972132448999820151363425388899757675789 No 7 >PRK12700 flgH flagellar basal body L-ring protein; Reviewed Probab=100.00 E-value=0 Score=449.86 Aligned_cols=182 Identities=27% Similarity=0.408 Sum_probs=165.1 Q ss_pred CCCCCCCCCCCC---CCCCCCCCCCCCCCCEEEEEEEECCHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 445576121577---55211232236768769999972410100001222345654455543334554455666554655 Q gi|254780374|r 56 STKKSYSLWRDS---HAALFKDSRALNVGDILTVDIRIDDQAVFDNQTGRSRNNSLHRKLSGGFSLFGQQTPQMNGNLNY 132 (238) Q Consensus 56 ~~~~~gSL~~~~---~~~lf~D~RA~~VGDIiTV~i~E~~~a~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 132 (238) ....+||||+.+ +.+||+|+||++|||||||+|.|+++|+++++++.+|+++.+.+++..++...... .....++. T Consensus 46 ~~~~~Gsiyq~~~~g~~~Lf~D~RA~~VGDilTV~i~E~~~Ask~~~t~~~r~~s~~~~~~~~~~~~~~~~-~~~~~~~~ 124 (230) T PRK12700 46 SARPNGSIYQPSAYGNYPLFEDRRPRNVGDIVTIVLEEKTNAAKGVATNTSRDGSATLGVAAAPRFMDGVI-NDKLDTDI 124 (230) T ss_pred CCCCCCCCCCCCCCCCCHHHCCCCCCCCCCEEEEEEEECCHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCC-CCCCCCCC T ss_conf 78999643357666754000666447899869999987344322045543545663436555665456655-66556565 Q ss_pred CCCCCCCCCHHEECEEEEEEEEEEEEEEECCCCCEEEEEEEEEEEECCEEEEEEEEEECHHHCCCCCEEEHHEEEEEEEE Q ss_conf 55543211000000014899999999998079728997887798418879999997984776789973824205543999 Q gi|254780374|r 133 DGGGASSGKGSISRAEKLNLLIAAIVTAILENGNLIISGSQEVRVNDEIRSLNVTGIVRPQDVDAHNSVSYDKIAEARIS 212 (238) Q Consensus 133 ~~~~~~~g~G~~~~~~~l~~~isa~V~~VlpNGnL~I~G~k~i~vn~e~~~i~isGiVRp~DI~~~NtV~S~~iAda~I~ 212 (238) +++++|+|+|+++|+++|+++|+|+|+||||||||+|+|||||++|+|.|+|+|+|||||+||+++|+|.|+|||||||+ T Consensus 125 ~~~~~f~G~G~~~~sn~ltgtItvtV~~VlpNGNL~I~GeK~v~vN~~~e~Ir~sGiVRP~DI~~~NtV~S~~IAdArI~ 204 (230) T PRK12700 125 SGGNTANGTGKSSANNTFTGTITTTVIGVLPNGNLQIAGEKQIAINRGSEYVRFSGVVDPRSITGSNTVSSTRVADARIE 204 (230) T ss_pred CCCCCCCCCCCEECCCEEEEEEEEEEEEECCCCCEEEEEEEEEEECCCEEEEEEEEEECHHHCCCCCEECCCEEEEEEEE T ss_conf 55543356543001135899999999998669958999989999889889999999998788899986970033548999 Q ss_pred EECEEEECCCCCCCHHHHHHHHHCCC Q ss_conf 80203312334866689999985589 Q gi|254780374|r 213 YGGKGRTTELLRPPIGHQLIENLSPL 238 (238) Q Consensus 213 y~g~G~~~d~~~~gwl~r~~~~i~Pl 238 (238) |.|+|.++|+|+||||+|||++|||. T Consensus 205 Y~G~G~l~d~q~~GWl~rff~~v~PF 230 (230) T PRK12700 205 YRSKGVMDEVQTMGWLQRFFLIASPF 230 (230) T ss_pred ECCEECCCCCCCCCHHHHHHHHHCCC T ss_conf 84301215211549999998861899 No 8 >PRK12697 flgH flagellar basal body L-ring protein; Reviewed Probab=100.00 E-value=0 Score=446.96 Aligned_cols=211 Identities=27% Similarity=0.354 Sum_probs=176.6 Q ss_pred HHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC--CCCCCCCCCCCCCCCEEEE Q ss_conf 99999985433672002568753667777664444564455544444445576121577--5521123223676876999 Q gi|254780374|r 9 LCYSSLLFGCHSSISEITGIPQMSPMGSSLDENNRMPFLGIDFKNSDSTKKSYSLWRDS--HAALFKDSRALNVGDILTV 86 (238) Q Consensus 9 l~~~~lL~GCas~~~~~~~~p~~tp~~~~~~~~~~~p~~~~~~~~~~~~~~~gSL~~~~--~~~lf~D~RA~~VGDIiTV 86 (238) .+++++|+||++...+.. .++ |.+.++ +++...+.+||||+++ +++||+|+||++||||||| T Consensus 15 al~~l~LsGCa~~~~~p~-----------~~~----p~~~~~-~~~~~~~~~GSi~~~~~~~~~LF~D~RA~~VGDIiTV 78 (227) T PRK12697 15 ALAALALAGCAQIPREPI-----------TQQ----PMTAQP-PMPPSMQAPGSIYNPGYAGRPLFEDQRPRNVGDILTI 78 (227) T ss_pred HHHHHHHHCCCCCCCCCC-----------CCC----CCCCCC-CCCCCCCCCCCCCCCCCCCCCHHCCCCCCCCCCEEEE T ss_conf 999999726567899998-----------889----878889-9987779997500587666600146654789996999 Q ss_pred EEEECCHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHEECEEEEEEEEEEEEEEECCCCC Q ss_conf 99724101000012223456544555433345544556665546555554321100000001489999999999807972 Q gi|254780374|r 87 DIRIDDQAVFDNQTGRSRNNSLHRKLSGGFSLFGQQTPQMNGNLNYDGGGASSGKGSISRAEKLNLLIAAIVTAILENGN 166 (238) Q Consensus 87 ~i~E~~~a~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~G~~~~~~~l~~~isa~V~~VlpNGn 166 (238) +|.|+++|+++++++.+|+++.+.+++....+.+. .....++.+++++|+|+|+++|+++|+++|+|+|+||||||| T Consensus 79 ~i~E~~~Asks~~t~~~r~~~~~~~~~~~~~l~~~---~~~~~~~~~~s~~f~G~G~~~~sn~ltgtItatV~~VLPNGN 155 (227) T PRK12697 79 VIAENINATKSSGANTNRQGNTSFDVPTAGFLGGL---FNKANLSATGANKFAATGGASAANTFNGTITVTVTNVLPNGN 155 (227) T ss_pred EEEEECCHHHHCCCCCCCCCCCCCCCCCCCCCCCC---CCCCCCCCCCCCCCCCCCCCCCEEEEEEEEEEEEEEECCCCC T ss_conf 99981311120356535567744367654333665---665543555564245643200111588999999999877993 Q ss_pred EEEEEEEEEEEECCEEEEEEEEEECHHHCCCCCEEEHHEEEEEEEEEECEEEECCCCCCCHHHHHHHHHCCC Q ss_conf 899788779841887999999798477678997382420554399980203312334866689999985589 Q gi|254780374|r 167 LIISGSQEVRVNDEIRSLNVTGIVRPQDVDAHNSVSYDKIAEARISYGGKGRTTELLRPPIGHQLIENLSPL 238 (238) Q Consensus 167 L~I~G~k~i~vn~e~~~i~isGiVRp~DI~~~NtV~S~~iAda~I~y~g~G~~~d~~~~gwl~r~~~~i~Pl 238 (238) |+|+|+|+|++|+|.|+|+|+|||||+||++||||.|+|||||||+|.|+|.++|+|+||||+|||++|||. T Consensus 156 LvI~GeK~v~vN~e~e~IrlsGvVRP~DI~~dNtV~S~~IAdArI~Y~G~G~l~d~q~~GWl~r~f~~i~PF 227 (227) T PRK12697 156 LVVSGEKQMLINQGNEFVRFSGVVNPNTISGQNSVYSTQVADARIEYSAKGYINEAETMGWLQRFFLNIAPW 227 (227) T ss_pred EEEEEEEEEEECCCEEEEEEEEEECHHHCCCCCEECCCEEEEEEEEECCCEEECCCCCCCHHHHHHHHHCCC T ss_conf 999999999988987999999898778779998596005354799980202135222559999998864889 No 9 >PRK12699 flgH flagellar basal body L-ring protein; Reviewed Probab=100.00 E-value=0 Score=442.08 Aligned_cols=216 Identities=23% Similarity=0.341 Sum_probs=180.8 Q ss_pred HHHHHHHHHHHHCCCCCC-CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC--------CCCCCCCCCC Q ss_conf 999999998543367200-2568753667777664444564455544444445576121577--------5521123223 Q gi|254780374|r 7 IVLCYSSLLFGCHSSISE-ITGIPQMSPMGSSLDENNRMPFLGIDFKNSDSTKKSYSLWRDS--------HAALFKDSRA 77 (238) Q Consensus 7 ~~l~~~~lL~GCas~~~~-~~~~p~~tp~~~~~~~~~~~p~~~~~~~~~~~~~~~gSL~~~~--------~~~lf~D~RA 77 (238) +++++.++|+||+...++ ....|. ..+ +|.++...+.+..||+|+.. +++||+|+|| T Consensus 22 ~~~~~~~~l~gC~~~~~~p~~~~p~------------~a~--p~~~~~~~p~~~~Gsiy~~~~~~~~~g~~~~LF~D~RA 87 (246) T PRK12699 22 VLIVMLALVGGCSLPTPAPKVQQPM------------SAR--PPDLPLIPPSQGNGGIYQAAFDYQGRGRYHPLFEDRRA 87 (246) T ss_pred HHHHHHHHHCCCCCCCCCCCCCCCC------------CCC--CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC T ss_conf 9999999860446899987678875------------468--99755568888888602255444466654567576755 Q ss_pred CCCCCEEEEEEEECCHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHEECEEEEEEEEEEE Q ss_conf 67687699999724101000012223456544555433345544556665546555554321100000001489999999 Q gi|254780374|r 78 LNVGDILTVDIRIDDQAVFDNQTGRSRNNSLHRKLSGGFSLFGQQTPQMNGNLNYDGGGASSGKGSISRAEKLNLLIAAI 157 (238) Q Consensus 78 ~~VGDIiTV~i~E~~~a~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~G~~~~~~~l~~~isa~ 157 (238) ++|||||||+|.|+++|+++++++.+|+++.+.+++.+++....... ....++.+++++|+|+|+++|+++|+++|+|+ T Consensus 88 ~~VGDIlTV~I~EstsAsk~a~t~~sr~~s~~~~~~~~~G~~~~~~~-~~~~~~~~ss~~f~G~G~~~~sn~ltgtItat 166 (246) T PRK12699 88 RQIGDTIIVLLNEKTNASKKSGSNAQRNGDISLSVPSFLGLPFKSIF-QDLDLEASSSNKFDGKGESSSNNNFTGTITVT 166 (246) T ss_pred CCCCCEEEEEEEECCHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCC-CCCCCCCCCCCCCCCCCCCCCCCEEEEEEEEE T ss_conf 54898599999973543221465536466635467421477643456-67675654454334642000103589999999 Q ss_pred EEEECCCCCEEEEEEEEEEEECCEEEEEEEEEECHHHCCCCCEEEHHEEEEEEEEEECEEEECCCCCCCHHHHHHHHHCC Q ss_conf 99980797289978877984188799999979847767899738242055439998020331233486668999998558 Q gi|254780374|r 158 VTAILENGNLIISGSQEVRVNDEIRSLNVTGIVRPQDVDAHNSVSYDKIAEARISYGGKGRTTELLRPPIGHQLIENLSP 237 (238) Q Consensus 158 V~~VlpNGnL~I~G~k~i~vn~e~~~i~isGiVRp~DI~~~NtV~S~~iAda~I~y~g~G~~~d~~~~gwl~r~~~~i~P 237 (238) |+||||||||+|+|+|||+||+|+|+|+|+|||||+|| .||+|.|+|||||||+|.|+|.++|+|+||||+|||++||| T Consensus 167 Vv~VLpNGNLvI~GeK~v~vN~e~e~IrlsGiVRP~DI-~dNtV~St~IAdArI~Y~G~G~i~d~q~~GWl~R~f~~i~P 245 (246) T PRK12699 167 VVDILPNGNLLVSGEKQIGINQGHEFIRLSGVINPINV-INNTVSSIQVADARIEYRGNGYLDEAQTMGWLSRFFLSISP 245 (246) T ss_pred EEEECCCCCEEEEEEEEEEECCCEEEEEEEEEECHHHC-CCCEECHHEEEEEEEEECCCEEECCCCCCCHHHHHHHHHCC T ss_conf 99987899889998799998898799999999783436-78708300125489998231112522255999999876189 Q ss_pred C Q ss_conf 9 Q gi|254780374|r 238 L 238 (238) Q Consensus 238 l 238 (238) . T Consensus 246 F 246 (246) T PRK12699 246 F 246 (246) T ss_pred C T ss_conf 9 No 10 >COG2063 FlgH Flagellar basal body L-ring protein [Cell motility and secretion] Probab=100.00 E-value=0 Score=415.84 Aligned_cols=225 Identities=35% Similarity=0.505 Sum_probs=189.1 Q ss_pred HHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEE Q ss_conf 99999999985433672002568753667777664444564455544444445576121577552112322367687699 Q gi|254780374|r 6 FIVLCYSSLLFGCHSSISEITGIPQMSPMGSSLDENNRMPFLGIDFKNSDSTKKSYSLWRDSHAALFKDSRALNVGDILT 85 (238) Q Consensus 6 l~~l~~~~lL~GCas~~~~~~~~p~~tp~~~~~~~~~~~p~~~~~~~~~~~~~~~gSL~~~~~~~lf~D~RA~~VGDIiT 85 (238) ++++++..+|++|++ .+...|++.|+.++...... |.+.+.+..+......+|||..++++||+|+||++|||||| T Consensus 4 ~~~~~~l~~la~c~~---e~~~~p~~~p~~~~~~~a~~-~~~~~~p~~p~~~~~~~sl~~~~~~~Lf~D~RA~~vGDilT 79 (230) T COG2063 4 LYALLALGLLAGCGT---ECAKVPPLSPIVPGATVAQP-PQPGPAPIQPGSIFQGYSLWNYGYQPLFEDRRASNVGDILT 79 (230) T ss_pred HHHHHHHHHHHHHHH---HHCCCCCCCCCCCCCCCCCC-CCCCCCCCCCCCCCCCCCCCCCCCCHHHCCCCCCCCCCEEE T ss_conf 379999999987326---53168875546788542437-76555677887634543556733220122543256898799 Q ss_pred EEEEECCHHHHCCCCCCCCCCCCCCCCCCCCCCCC--CCCCCCCCCCCCCCCCCCCCCHHEECEEEEEEEEEEEEEEECC Q ss_conf 99972410100001222345654455543334554--4556665546555554321100000001489999999999807 Q gi|254780374|r 86 VDIRIDDQAVFDNQTGRSRNNSLHRKLSGGFSLFG--QQTPQMNGNLNYDGGGASSGKGSISRAEKLNLLIAAIVTAILE 163 (238) Q Consensus 86 V~i~E~~~a~~~~~~~~~r~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~g~G~~~~~~~l~~~isa~V~~Vlp 163 (238) |+|.|+++|++++++..+|+++....+....+... ........ +...++++|+|+|+..|+|+|+++|||+|++||| T Consensus 80 Vvi~E~~~As~~~~t~~~r~~~~~~~~~~~~g~~~~~~~~~d~~~-~~~~s~n~~~G~g~~~~s~~l~~tIta~V~~VLp 158 (230) T COG2063 80 IVIQENTSASKSSSTNRSRKNSTGFGLTAVGGVLLGGNNGGDDSS-YETSSNNDFKGKGGAKRSESLTGTITATVVQVLP 158 (230) T ss_pred EEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHH-HHCCCCCCCCCCCCCCCCCEEEEEEEEEEEEECC T ss_conf 999854520200232101236788672333120347556787024-2225886656775533234279999999999868 Q ss_pred CCCEEEEEEEEEEEECCEEEEEEEEEECHHHCCCCCEEEHHEEEEEEEEEECEEEECCCCCCCHHHHHHHHH Q ss_conf 972899788779841887999999798477678997382420554399980203312334866689999985 Q gi|254780374|r 164 NGNLIISGSQEVRVNDEIRSLNVTGIVRPQDVDAHNSVSYDKIAEARISYGGKGRTTELLRPPIGHQLIENL 235 (238) Q Consensus 164 NGnL~I~G~k~i~vn~e~~~i~isGiVRp~DI~~~NtV~S~~iAda~I~y~g~G~~~d~~~~gwl~r~~~~i 235 (238) ||||+|+|+|||+||+|.|+|+|+|||||+||+.+|||.|+|||||||+|.|+|.++|.|+||||+|||+.+ T Consensus 159 NGNL~I~G~Kev~vN~~~e~i~vsGvVRP~DI~~~NtI~S~~IAdARI~Y~G~G~lse~q~~gwlq~~~~~l 230 (230) T COG2063 159 NGNLVIEGEKEVRVNGEKEIIRVSGVVRPDDISGDNTISSTRIADARIEYGGRGYLSEAQQPGWLQRFFLAL 230 (230) T ss_pred CCCEEEEEEEEEEECCCEEEEEEEEEECHHCCCCCCEECCHHEEEEEEEECCCEEECCCCCCCHHHHHHHHC T ss_conf 997899977999977844899994268601057777143101132589865725663466865899998529 No 11 >pfam02107 FlgH Flagellar L-ring protein. Probab=100.00 E-value=0 Score=407.39 Aligned_cols=178 Identities=35% Similarity=0.537 Sum_probs=166.8 Q ss_pred CCCCCCCCCCCCCC-CCCCCCCCCCCCCCEEEEEEEECCHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 44455761215775-52112322367687699999724101000012223456544555433345544556665546555 Q gi|254780374|r 55 DSTKKSYSLWRDSH-AALFKDSRALNVGDILTVDIRIDDQAVFDNQTGRSRNNSLHRKLSGGFSLFGQQTPQMNGNLNYD 133 (238) Q Consensus 55 ~~~~~~gSL~~~~~-~~lf~D~RA~~VGDIiTV~i~E~~~a~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 133 (238) ++.+.+||||+++. .+||+|+||++|||||||+|.|+++|+++++++.+|+++.+..++.+++... ....++.+ T Consensus 3 ~~~~~~GSl~~~~~~~~lf~D~rA~~VGDivTV~i~E~~~A~~s~~t~~~r~~~~~~~~~~~~g~~~-----~~~~~~~~ 77 (181) T pfam02107 3 PPAPANGSLYQAGSARPLFEDRRARRVGDILTVVLSENTSASKSASTSTSRSSSISLGIPTLFGLGG-----LSLSLGAS 77 (181) T ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEECCHHHHCCCCCCCCCCCCCCCCCCCCCCCC-----CCCCCCCC T ss_conf 8988997532589866542455324589869999998044321145430334443315643257777-----66321346 Q ss_pred CCCCCCCCHHEECEEEEEEEEEEEEEEECCCCCEEEEEEEEEEEECCEEEEEEEEEECHHHCCCCCEEEHHEEEEEEEEE Q ss_conf 55432110000000148999999999980797289978877984188799999979847767899738242055439998 Q gi|254780374|r 134 GGGASSGKGSISRAEKLNLLIAAIVTAILENGNLIISGSQEVRVNDEIRSLNVTGIVRPQDVDAHNSVSYDKIAEARISY 213 (238) Q Consensus 134 ~~~~~~g~G~~~~~~~l~~~isa~V~~VlpNGnL~I~G~k~i~vn~e~~~i~isGiVRp~DI~~~NtV~S~~iAda~I~y 213 (238) ++++|+|+|+++|+++|+++|+|+|++|||||||+|+|+|+|+||+|.|+|+|+|||||+||+++|+|.|++||||||+| T Consensus 78 ~~~~~~G~G~~~~~~~lt~~ita~V~~VlpNGnLvI~G~K~i~vn~e~e~i~lsGiVRp~DI~~~NtV~S~~IAdA~I~y 157 (181) T pfam02107 78 SSSSFKGSGSASQSNTLTGTITVTVTDVLPNGNLVIRGEKEVRVNQGDEYIRLSGIVRPEDISPDNTVSSTRIADARIEY 157 (181) T ss_pred CCCCCCCCCEEEEEEEEEEEEEEEEEEECCCCCEEEEEEEEEEECCCEEEEEEEEEECHHHCCCCCEECCCEEEEEEEEE T ss_conf 55434564305333069999999999986899899999999998998899999999998997999759600335589998 Q ss_pred ECEEEECCCCCCCHHHHHHHHHCC Q ss_conf 020331233486668999998558 Q gi|254780374|r 214 GGKGRTTELLRPPIGHQLIENLSP 237 (238) Q Consensus 214 ~g~G~~~d~~~~gwl~r~~~~i~P 237 (238) .|+|.++|+|+||||+|||+++|| T Consensus 158 ~G~G~l~d~~~~gwl~r~~~~i~P 181 (181) T pfam02107 158 GGRGAIADAQQPGWLQRFFDSVWP 181 (181) T ss_pred ECEEEECCCCCCCHHHHHHHHHCC T ss_conf 010523644367999999886683 No 12 >PRK11627 hypothetical protein; Provisional Probab=94.86 E-value=0.024 Score=34.70 Aligned_cols=25 Identities=20% Similarity=0.355 Sum_probs=18.3 Q ss_pred CCHHHHHHHHHHHHHHHHCCCCCCC Q ss_conf 9117999999999985433672002 Q gi|254780374|r 1 MYRYSFIVLCYSSLLFGCHSSISEI 25 (238) Q Consensus 1 M~r~~l~~l~~~~lL~GCas~~~~~ 25 (238) |+|.+++.++..++|+||+++...+ T Consensus 1 ~mkk~~~~l~a~~lL~gCas~p~~l 25 (192) T PRK11627 1 MLKKILFPLVALFMLAGCATPPTTL 25 (192) T ss_pred CHHHHHHHHHHHHHHHHHCCCCCEE T ss_conf 9287999999999998606997637 No 13 >PRK09837 copper/silver efflux system outer membrane protein CusC; Provisional Probab=91.49 E-value=0.29 Score=27.81 Aligned_cols=19 Identities=26% Similarity=0.597 Sum_probs=17.5 Q ss_pred CCHHHHHHHHHHHHHHHHC Q ss_conf 9117999999999985433 Q gi|254780374|r 1 MYRYSFIVLCYSSLLFGCH 19 (238) Q Consensus 1 M~r~~l~~l~~~~lL~GCa 19 (238) |++..++.++++++|+||+ T Consensus 1 ~~~~rll~l~~~l~L~gC~ 19 (460) T PRK09837 1 MSPCKLLPFCVALALTGCS 19 (460) T ss_pred CCHHHHHHHHHHHHHCCCC T ss_conf 9757899999999982477 No 14 >PRK00022 lolB outer membrane lipoprotein LolB; Provisional Probab=90.98 E-value=0.22 Score=28.64 Aligned_cols=23 Identities=30% Similarity=0.397 Sum_probs=16.8 Q ss_pred CCHHHHHHHHHHHHHHHHCCCCC Q ss_conf 91179999999999854336720 Q gi|254780374|r 1 MYRYSFIVLCYSSLLFGCHSSIS 23 (238) Q Consensus 1 M~r~~l~~l~~~~lL~GCas~~~ 23 (238) |.|+..++++++++|+||++... T Consensus 1 ~~r~~~~l~~~~llLsgCat~p~ 23 (203) T PRK00022 1 MFRLLRLLPLAALLLAGCASLPP 23 (203) T ss_pred CHHHHHHHHHHHHHHHHCCCCCC T ss_conf 90638999999999866148999 No 15 >TIGR02747 TraV type IV conjugative transfer system protein TraV; InterPro: IPR014118 This entry includes TraV, which is a component of conjugative type IV secretion system. TraV is an outer membrane lipoprotein that is believed to interact with the secretin TraK , , . The alignment contains three conserved cysteines in the N-terminal half.. Probab=90.43 E-value=0.16 Score=29.51 Aligned_cols=20 Identities=30% Similarity=0.544 Sum_probs=15.2 Q ss_pred CCHHHHHHHHHHHH--HHHHCC Q ss_conf 91179999999999--854336 Q gi|254780374|r 1 MYRYSFIVLCYSSL--LFGCHS 20 (238) Q Consensus 1 M~r~~l~~l~~~~l--L~GCas 20 (238) |||+.+++++++++ |+||++ T Consensus 1 Mmr~~~L~~~~~~~f~LtGCsa 22 (174) T TIGR02747 1 MMRLKVLLLLACVAFLLTGCSA 22 (174) T ss_pred CCHHHHHHHHHHHHHHHHHHHH T ss_conf 9067899999999872002577 No 16 >PRK09915 putative outer membrane efflux protein MdtP; Provisional Probab=90.19 E-value=0.4 Score=26.93 Aligned_cols=21 Identities=24% Similarity=0.360 Sum_probs=15.2 Q ss_pred CCHHHHHHHHHH-HHHHHHCCC Q ss_conf 911799999999-998543367 Q gi|254780374|r 1 MYRYSFIVLCYS-SLLFGCHSS 21 (238) Q Consensus 1 M~r~~l~~l~~~-~lL~GCas~ 21 (238) |+|++++.++.+ ++|+||+.. T Consensus 6 ~~r~~~~~~l~~~~lLagC~~~ 27 (488) T PRK09915 6 LSRLLLCSILGSTTLISGCALV 27 (488) T ss_pred HHHHHHHHHHHHHHHHCCCCCC T ss_conf 7699999999999997526689 No 17 >TIGR02722 lp_ uncharacterized lipoprotein; InterPro: IPR014094 Members of this protein family are restricted to the Proteobacteria, and all are predicted lipoproteins. In genomes that contain the thiK gene for the salvage enzyme thiamin kinase, the member of this family is encoded nearby.. Probab=88.73 E-value=1 Score=24.46 Aligned_cols=86 Identities=17% Similarity=0.186 Sum_probs=43.6 Q ss_pred HHHHHHHHHHHHHHHHCCCC-CCCCCCCC--CCCCCC--CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 17999999999985433672-00256875--366777--76644445644555444444455761215775521123223 Q gi|254780374|r 3 RYSFIVLCYSSLLFGCHSSI-SEITGIPQ--MSPMGS--SLDENNRMPFLGIDFKNSDSTKKSYSLWRDSHAALFKDSRA 77 (238) Q Consensus 3 r~~l~~l~~~~lL~GCas~~-~~~~~~p~--~tp~~~--~~~~~~~~p~~~~~~~~~~~~~~~gSL~~~~~~~lf~D~RA 77 (238) ++++++++++++|+||.... ......+. ..|+.+ .......++....++.||....- =|+.-..|.|+.-++ T Consensus 4 ~~l~~~~~~Al~L~GC~~~~~~y~~~~~~~~~~~~~~tl~~~~~d~~~~a~~~~~pPk~~~~---dwsm~~~P~v~~~~~ 80 (215) T TIGR02722 4 KILIFVALLALLLSGCVSQRVAYTDATDPQPVEPVTPTLGLSSTDLEPIANKMIQPPKVRKI---DWSMLSSPAVAQITP 80 (215) T ss_pred HHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCEEEECCCCCCCHHHHHHHCCCCCHHHH---HHHHHCCCCCCCCCC T ss_conf 87899999999985468872665688756677760577668886723347651388741256---466526884022367 Q ss_pred C-C--CCCEEEEEEEEC Q ss_conf 6-7--687699999724 Q gi|254780374|r 78 L-N--VGDILTVDIRID 91 (238) Q Consensus 78 ~-~--VGDIiTV~i~E~ 91 (238) . . -+-||-|----| T Consensus 81 ~~gv~~~~il~vd~i~N 97 (215) T TIGR02722 81 GDGVETRPILLVDSIKN 97 (215) T ss_pred CCCCCCCCEEEEEEEEC T ss_conf 78721267789851216 No 18 >pfam06291 Lambda_Bor Bor protein. This family consists of several Bacteriophage lambda Bor and Escherichia coli Iss proteins. Expression of bor significantly increases the survival of the Escherichia coli host cell in animal serum. This property is a well known bacterial virulence determinant indeed, bor and its adjacent sequences are highly homologous to the iss serum resistance locus of the plasmid ColV2-K94, which confers virulence in animals. It has been suggested that lysogeny may generally have a role in bacterial survival in animal hosts, and perhaps in pathogenesis. Probab=88.53 E-value=0.43 Score=26.79 Aligned_cols=25 Identities=20% Similarity=0.368 Sum_probs=17.7 Q ss_pred CCHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 91179999999999854336720025 Q gi|254780374|r 1 MYRYSFIVLCYSSLLFGCHSSISEIT 26 (238) Q Consensus 1 M~r~~l~~l~~~~lL~GCas~~~~~~ 26 (238) |+|+.+... ++++|+||+++.-.+. T Consensus 1 MKk~~~~~~-~allLtgCa~QT~~~~ 25 (97) T pfam06291 1 MKKMLFAAA-LALLITGCAQQTFTVG 25 (97) T ss_pred CHHHHHHHH-HHHHHCCCCCEEEEEC T ss_conf 922599999-9999721330499935 No 19 >PRK10449 heat-inducible protein; Provisional Probab=88.17 E-value=0.76 Score=25.19 Aligned_cols=20 Identities=35% Similarity=0.539 Sum_probs=14.6 Q ss_pred CCHHHHHHHHHHHHHHHHCCC Q ss_conf 911799999999998543367 Q gi|254780374|r 1 MYRYSFIVLCYSSLLFGCHSS 21 (238) Q Consensus 1 M~r~~l~~l~~~~lL~GCas~ 21 (238) |+|+ +.++++.++|+||++. T Consensus 1 MKk~-l~l~~~~llLagC~s~ 20 (140) T PRK10449 1 MKKV-VALVALSLLMAGCVSS 20 (140) T ss_pred CCHH-HHHHHHHHHHHHCCCC T ss_conf 9034-9999999999752389 No 20 >COG3056 Uncharacterized lipoprotein [Cell envelope biogenesis, outer membrane] Probab=87.96 E-value=0.51 Score=26.30 Aligned_cols=23 Identities=17% Similarity=0.315 Sum_probs=16.3 Q ss_pred CEEEEEEEEEEEEEEECCCCCEE Q ss_conf 00148999999999980797289 Q gi|254780374|r 146 RAEKLNLLIAAIVTAILENGNLI 168 (238) Q Consensus 146 ~~~~l~~~isa~V~~VlpNGnL~ 168 (238) -...+.+.+-++|+..-|||.|+ T Consensus 132 lryni~tkv~i~I~A~~~~Gk~~ 154 (204) T COG3056 132 LRYNIDTKVDITITATAANGKMV 154 (204) T ss_pred EEECCCCEEEEEEEEECCCCCEE T ss_conf 23302545889999973787155 No 21 >PRK09967 putative outer membrane lipoprotein; Provisional Probab=87.77 E-value=0.37 Score=27.15 Aligned_cols=21 Identities=29% Similarity=0.474 Sum_probs=14.0 Q ss_pred CCHHHHH-HHHHHHHHHHHCCC Q ss_conf 9117999-99999998543367 Q gi|254780374|r 1 MYRYSFI-VLCYSSLLFGCHSS 21 (238) Q Consensus 1 M~r~~l~-~l~~~~lL~GCas~ 21 (238) |+|.++. +++++++|+||++. T Consensus 1 m~k~~~~~~l~~~l~lsgC~s~ 22 (160) T PRK09967 1 MIKHLVAPLIFTSLILTGCQSP 22 (160) T ss_pred CHHHHHHHHHHHHHHHHHCCCC T ss_conf 9078999999999999845899 No 22 >pfam05643 DUF799 Putative bacterial lipoprotein (DUF799). This family consists of several bacterial proteins of unknown function. Some of the family members are described as putative lipoproteins. Probab=87.10 E-value=0.62 Score=25.77 Aligned_cols=23 Identities=17% Similarity=0.122 Sum_probs=15.0 Q ss_pred CCHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 911799999999998543367200 Q gi|254780374|r 1 MYRYSFIVLCYSSLLFGCHSSISE 24 (238) Q Consensus 1 M~r~~l~~l~~~~lL~GCas~~~~ 24 (238) ||++++ .+..+++|+||+++..+ T Consensus 1 mK~lil-~l~~~l~L~gCa~~~~~ 23 (215) T pfam05643 1 MKPLIL-GLAAVLALSACQVQKAP 23 (215) T ss_pred CHHHHH-HHHHHHHHHHCCCCCCC T ss_conf 931499-99999998623489998 No 23 >pfam07901 DUF1672 Protein of unknown function (DUF1672). This family is composed of hypothetical bacterial proteins of unknown function. Probab=86.80 E-value=0.69 Score=25.46 Aligned_cols=21 Identities=29% Similarity=0.512 Sum_probs=18.5 Q ss_pred CCHHHHHHHHHHHHHHHHCCC Q ss_conf 911799999999998543367 Q gi|254780374|r 1 MYRYSFIVLCYSSLLFGCHSS 21 (238) Q Consensus 1 M~r~~l~~l~~~~lL~GCas~ 21 (238) |++++..++++.+||+||++- T Consensus 1 MkK~i~~il~~~llL~GCs~m 21 (304) T pfam07901 1 MRKLIKLILIATLLLSGCSTT 21 (304) T ss_pred CCHHHHHHHHHHHHHCCCCCC T ss_conf 901899999999997133445 No 24 >PRK10760 murein hydrolase B; Provisional Probab=86.80 E-value=1.4 Score=23.46 Aligned_cols=23 Identities=30% Similarity=0.332 Sum_probs=15.3 Q ss_pred CCHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 911799999999998543367200 Q gi|254780374|r 1 MYRYSFIVLCYSSLLFGCHSSISE 24 (238) Q Consensus 1 M~r~~l~~l~~~~lL~GCas~~~~ 24 (238) |+|++-+ +.++++|+||++...+ T Consensus 1 ~r~~~~~-~~~~~~l~~css~p~~ 23 (357) T PRK10760 1 KRRYVTL-LPLFVLLAACSSKPKP 23 (357) T ss_pred CCCHHHH-HHHHHHHHHHCCCCCC T ss_conf 9404868-9999999985189999 No 25 >pfam06474 MLTD_N MLTD_N. Probab=86.22 E-value=1.2 Score=23.92 Aligned_cols=17 Identities=41% Similarity=0.671 Sum_probs=10.7 Q ss_pred HHHHHHHHHHHHCCCCC Q ss_conf 99999999854336720 Q gi|254780374|r 7 IVLCYSSLLFGCHSSIS 23 (238) Q Consensus 7 ~~l~~~~lL~GCas~~~ 23 (238) +++++++||+||.+... T Consensus 5 ail~~alLLaGCQst~~ 21 (93) T pfam06474 5 AVLAAALLLVGCQSTQA 21 (93) T ss_pred HHHHHHHHHHHHCCCCC T ss_conf 99999999998137999 No 26 >PRK10866 outer membrane protein assembly complex subunit YfiO; Provisional Probab=85.72 E-value=0.69 Score=25.48 Aligned_cols=22 Identities=36% Similarity=0.496 Sum_probs=14.3 Q ss_pred CCH--HHHHHHHHHHHHHHHCCCC Q ss_conf 911--7999999999985433672 Q gi|254780374|r 1 MYR--YSFIVLCYSSLLFGCHSSI 22 (238) Q Consensus 1 M~r--~~l~~l~~~~lL~GCas~~ 22 (238) |+| +++++++++++|+||++.. T Consensus 1 m~~~k~l~~~~~l~l~l~gCs~~~ 24 (243) T PRK10866 1 MTRMKYLVAAATLSLFLAGCSGSK 24 (243) T ss_pred CCHHHHHHHHHHHHHHHHHCCCCC T ss_conf 945899999999999998768998 No 27 >PRK13883 conjugal transfer protein TrbH; Provisional Probab=85.27 E-value=0.58 Score=25.95 Aligned_cols=22 Identities=23% Similarity=0.259 Sum_probs=14.6 Q ss_pred CCHHHHHHHHHHHHHHHHCCCCC Q ss_conf 91179999999999854336720 Q gi|254780374|r 1 MYRYSFIVLCYSSLLFGCHSSIS 23 (238) Q Consensus 1 M~r~~l~~l~~~~lL~GCas~~~ 23 (238) |+|++++.+++ ++|+||++..+ T Consensus 1 MrK~~~~all~-laL~gCAtt~~ 22 (156) T PRK13883 1 MRKILSLALLA-LALGGCATTSQ 22 (156) T ss_pred CHHHHHHHHHH-HHHHCCCCCCC T ss_conf 92689999999-99703313578 No 28 >PRK09810 entericidin A; Provisional Probab=84.81 E-value=0.57 Score=25.99 Aligned_cols=21 Identities=43% Similarity=0.684 Sum_probs=17.1 Q ss_pred CCHHHHHHHHHHHHHHHHCCC Q ss_conf 911799999999998543367 Q gi|254780374|r 1 MYRYSFIVLCYSSLLFGCHSS 21 (238) Q Consensus 1 M~r~~l~~l~~~~lL~GCas~ 21 (238) |+|++.++++...+|+||.+. T Consensus 2 mkrli~lil~~~~ll~gcnta 22 (41) T PRK09810 2 MKRLIVLVLLASTLLTGCNTA 22 (41) T ss_pred HHHHHHHHHHHHHHHHCCCCC T ss_conf 389999999999998256212 No 29 >COG3417 FlgN Collagen-binding surface adhesin SpaP (antigen I/II family) [General function prediction only] Probab=84.56 E-value=1.3 Score=23.82 Aligned_cols=20 Identities=25% Similarity=0.399 Sum_probs=13.3 Q ss_pred CHHHHHHHHHHHHHHHHCCC Q ss_conf 11799999999998543367 Q gi|254780374|r 2 YRYSFIVLCYSSLLFGCHSS 21 (238) Q Consensus 2 ~r~~l~~l~~~~lL~GCas~ 21 (238) +|++..+++++++|+||++. T Consensus 4 ~k~~~~il~~al~l~GCs~~ 23 (200) T COG3417 4 MKIYASILLLALFLSGCSSE 23 (200) T ss_pred HHHHHHHHHHHHHHHHCCCC T ss_conf 79999999999998411367 No 30 >PRK13616 lipoprotein LpqB; Provisional Probab=84.23 E-value=1.1 Score=24.07 Aligned_cols=22 Identities=27% Similarity=0.307 Sum_probs=14.0 Q ss_pred HHHHHHHHHHHHHHHHCCCCCC Q ss_conf 1799999999998543367200 Q gi|254780374|r 3 RYSFIVLCYSSLLFGCHSSISE 24 (238) Q Consensus 3 r~~l~~l~~~~lL~GCas~~~~ 24 (238) |.+.++++++++|+||++-... T Consensus 7 ~~~~~~~~~~~llaGCaslP~s 28 (590) T PRK13616 7 KALAALLAVALLVAGCASLPSS 28 (590) T ss_pred HHHHHHHHHHHHHHHCCCCCCC T ss_conf 9999999999986120368999 No 31 >TIGR03502 lipase_Pla1_cef extracellular lipase, Pla-1/cef family. Members of this protein family are bacterial lipoproteins largely from the Gammaproteobacteria. Characterized members are expressed in extracellularly and have esterase activity. Members include the lipase Pla-1 from Aeromonas hydrophila (AF092033) and CHO cell elongation factor (cef) from Vibrio hollisae Probab=83.65 E-value=1.7 Score=23.05 Aligned_cols=20 Identities=25% Similarity=0.250 Sum_probs=13.7 Q ss_pred CCHHHHHHHHHH-HHHHHHCC Q ss_conf 911799999999-99854336 Q gi|254780374|r 1 MYRYSFIVLCYS-SLLFGCHS 20 (238) Q Consensus 1 M~r~~l~~l~~~-~lL~GCas 20 (238) |||+++.+.++. +.|+||.. T Consensus 1 MKk~~l~~~iasal~LagCGg 21 (792) T TIGR03502 1 MKKLLLSLAIASALGLAGCGD 21 (792) T ss_pred CCHHHHHHHHHHHHHHHCCCC T ss_conf 933789999999875004578 No 32 >PRK10598 hypothetical protein; Provisional Probab=81.37 E-value=1.1 Score=24.30 Aligned_cols=20 Identities=35% Similarity=0.680 Sum_probs=15.0 Q ss_pred CCHHHH-HHHHHHHHHHHHCC Q ss_conf 911799-99999999854336 Q gi|254780374|r 1 MYRYSF-IVLCYSSLLFGCHS 20 (238) Q Consensus 1 M~r~~l-~~l~~~~lL~GCas 20 (238) |+|+.+ .+++++.+|+||++ T Consensus 1 mkk~~l~~~l~l~~llsGC~~ 21 (186) T PRK10598 1 MKKFLLAAALLVSGLLVGCNQ 21 (186) T ss_pred CCHHHHHHHHHHHHHHHCCCC T ss_conf 912699999999999714353 No 33 >COG4314 NosL Predicted lipoprotein involved in nitrous oxide reduction [Energy production and conversion] Probab=81.10 E-value=1.4 Score=23.60 Aligned_cols=23 Identities=30% Similarity=0.532 Sum_probs=18.8 Q ss_pred CCHHHHHHHHHHHHHHHHCCCCC Q ss_conf 91179999999999854336720 Q gi|254780374|r 1 MYRYSFIVLCYSSLLFGCHSSIS 23 (238) Q Consensus 1 M~r~~l~~l~~~~lL~GCas~~~ 23 (238) |+|.++..+++.++|+||....+ T Consensus 1 mkr~Lla~la~~~llAgC~~~ed 23 (176) T COG4314 1 MKRTLLAILAVTALLAGCRQAED 23 (176) T ss_pred CCHHHHHHHHHHHHHHHCCHHHC T ss_conf 94037999999999875343113 No 34 >COG4851 CamS Protein involved in sex pheromone biosynthesis [General function prediction only] Probab=81.08 E-value=1.2 Score=23.88 Aligned_cols=24 Identities=17% Similarity=0.358 Sum_probs=18.4 Q ss_pred CCH-HHHHHHHHHHHHHHHCCCCCC Q ss_conf 911-799999999998543367200 Q gi|254780374|r 1 MYR-YSFIVLCYSSLLFGCHSSISE 24 (238) Q Consensus 1 M~r-~~l~~l~~~~lL~GCas~~~~ 24 (238) |+| +.+.+++++++|+||+..+.+ T Consensus 1 Mkktl~i~~ta~vliLs~C~~~~dd 25 (382) T COG4851 1 MKKTLGIAATASVLILSGCFPFVDD 25 (382) T ss_pred CCHHHHHHHHHHHHHHHHCCCCCCC T ss_conf 9401558999999998611676687 No 35 >PRK00276 infA translation initiation factor IF-1; Validated Probab=81.02 E-value=3.7 Score=20.86 Aligned_cols=57 Identities=18% Similarity=0.238 Sum_probs=37.1 Q ss_pred EEEEEEEEECCCCCEEEEEEEEEEEECCEEEEEEEEEECHHHCC--CCC----EEEHHEEEEEEEEEE Q ss_conf 99999999807972899788779841887999999798477678--997----382420554399980 Q gi|254780374|r 153 LIAAIVTAILENGNLIISGSQEVRVNDEIRSLNVTGIVRPQDVD--AHN----SVSYDKIAEARISYG 214 (238) Q Consensus 153 ~isa~V~~VlpNGnL~I~G~k~i~vn~e~~~i~isGiVRp~DI~--~~N----tV~S~~iAda~I~y~ 214 (238) .+..+|+|+|||++..|+=+ |+..-.-.+||-+|-.-|. ++. .++-..+.-.||.|. T Consensus 8 ~~~G~V~e~lpn~~F~V~Le-----ng~~v~a~~sGKmR~~~Iril~GD~V~vE~spYDltkGRIv~R 70 (72) T PRK00276 8 EMEGTVLETLPNAMFRVELE-----NGHEVLAHISGKMRKNYIRILPGDKVTVELSPYDLTKGRITYR 70 (72) T ss_pred EEEEEEEEECCCCEEEEEEC-----CCCEEEEEECHHEEEEEEEECCCCEEEEEECCCCCCCEEEEEE T ss_conf 99999999859988999978-----9999999974131101699758998999988667995789997 No 36 >COG4764 Uncharacterized protein conserved in bacteria [Function unknown] Probab=80.83 E-value=1 Score=24.45 Aligned_cols=24 Identities=33% Similarity=0.560 Sum_probs=16.5 Q ss_pred CCHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 91179999999999854336720025 Q gi|254780374|r 1 MYRYSFIVLCYSSLLFGCHSSISEIT 26 (238) Q Consensus 1 M~r~~l~~l~~~~lL~GCas~~~~~~ 26 (238) |+|++|.++ .+.|+||++...++. T Consensus 5 ~~r~~~~v~--lL~LagCaTaP~~~~ 28 (197) T COG4764 5 MMRLVFAVV--LLALAGCATAPSQVN 28 (197) T ss_pred HHHHHHHHH--HHHHHHCCCCCCCCC T ss_conf 698999999--999703036876765 No 37 >COG3065 Slp Starvation-inducible outer membrane lipoprotein [Cell envelope biogenesis, outer membrane] Probab=80.59 E-value=3 Score=21.44 Aligned_cols=58 Identities=24% Similarity=0.299 Sum_probs=31.5 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC-CCCCCCCCCCCCCCC Q ss_conf 1799999999998543367200256875366777766444456445554444444557612157-755211232236768 Q gi|254780374|r 3 RYSFIVLCYSSLLFGCHSSISEITGIPQMSPMGSSLDENNRMPFLGIDFKNSDSTKKSYSLWRD-SHAALFKDSRALNVG 81 (238) Q Consensus 3 r~~l~~l~~~~lL~GCas~~~~~~~~p~~tp~~~~~~~~~~~p~~~~~~~~~~~~~~~gSL~~~-~~~~lf~D~RA~~VG 81 (238) ++-.++..++++|+||.+-.+.+...- |. ..-|+... ....+|--.-||.=| T Consensus 8 ~~~~l~~~laflLsgC~tiPk~l~g~~---------------~~------------s~~s~~~~~~~~~~~~gq~aR~GG 60 (191) T COG3065 8 KKGALIGTLAFLLSGCVTIPKALKGES---------------PT------------SQQSLVRVMSQPQLYVGQQARFGG 60 (191) T ss_pred HHHHHHHHHHHHHHHCCCCCHHHCCCC---------------CC------------CHHHEEEECCCCCCCCCCEEEECC T ss_conf 799999999999763035783126899---------------74------------222011002577652044235176 Q ss_pred CEEEEE Q ss_conf 769999 Q gi|254780374|r 82 DILTVD 87 (238) Q Consensus 82 DIiTV~ 87 (238) -||.|. T Consensus 61 kVvnv~ 66 (191) T COG3065 61 KVVNVI 66 (191) T ss_pred EEEEEE T ss_conf 899876 No 38 >PRK11138 outer membrane protein assembly complex subunit YfgL; Provisional Probab=80.49 E-value=3.4 Score=21.10 Aligned_cols=19 Identities=26% Similarity=0.293 Sum_probs=13.5 Q ss_pred CHHHHHHHHHHHHHHHHCC Q ss_conf 1179999999999854336 Q gi|254780374|r 2 YRYSFIVLCYSSLLFGCHS 20 (238) Q Consensus 2 ~r~~l~~l~~~~lL~GCas 20 (238) +|+++..++.+.+|+||+. T Consensus 4 ~~~~~~~l~~~~ll~gCs~ 22 (394) T PRK11138 4 RKLLLPGLLSVALLSGCSL 22 (394) T ss_pred HHHHHHHHHHHHHHHHCCC T ss_conf 6779999999999865237 No 39 >PRK11459 multidrug resistance outer membrane protein MdtQ; Provisional Probab=80.31 E-value=2.5 Score=21.91 Aligned_cols=20 Identities=45% Similarity=0.534 Sum_probs=13.5 Q ss_pred CCHHHHH----HHHHHHHHHHHCC Q ss_conf 9117999----9999999854336 Q gi|254780374|r 1 MYRYSFI----VLCYSSLLFGCHS 20 (238) Q Consensus 1 M~r~~l~----~l~~~~lL~GCas 20 (238) |+|..|. .+.++++|+||+. T Consensus 1 ~~~~~f~~~~a~l~~~llLaGCa~ 24 (478) T PRK11459 1 MNRDSFYAATASLPLFILLAGCAP 24 (478) T ss_pred CCCCCHHHHHHHHHHHHHHHCCCC T ss_conf 997405888999999999837889 No 40 >pfam08139 LPAM_1 Prokaryotic membrane lipoprotein lipid attachment site. In prokaryotes, membrane lipoproteins are synthesized with a precursor signal peptide, which is cleaved by a specific lipoprotein signal peptidase (signal peptidase II). The peptidase recognizes a conserved sequence and cuts upstream of a cysteine residue to which a glyceride-fatty acid lipid is attached. Probab=80.16 E-value=1.1 Score=24.15 Aligned_cols=21 Identities=33% Similarity=0.587 Sum_probs=14.0 Q ss_pred CCHHHHHHHHHHHHHHHHCCCC Q ss_conf 9117999999999985433672 Q gi|254780374|r 1 MYRYSFIVLCYSSLLFGCHSSI 22 (238) Q Consensus 1 M~r~~l~~l~~~~lL~GCas~~ 22 (238) |+|++++.+++ ++|+||++.. T Consensus 1 Mkk~~~l~~~~-l~LagCas~~ 21 (26) T pfam08139 1 MKKLLLLLLAL-LLLAGCASXX 21 (26) T ss_pred CHHHHHHHHHH-HHHHCCCCCC T ss_conf 96699999999-9982332001 No 41 >pfam07273 DUF1439 Protein of unknown function (DUF1439). This family consists of several hypothetical bacterial proteins of around 190 residues in length. Several members of this family are annotated as being putative lipoproteins and are often known as YceB. The function of this family is unknown. Probab=79.59 E-value=1.3 Score=23.64 Aligned_cols=20 Identities=35% Similarity=0.580 Sum_probs=15.6 Q ss_pred CCHHHHHHHHHHHHHHHHCC Q ss_conf 91179999999999854336 Q gi|254780374|r 1 MYRYSFIVLCYSSLLFGCHS 20 (238) Q Consensus 1 M~r~~l~~l~~~~lL~GCas 20 (238) |+++.+++++++++|+||++ T Consensus 1 Mk~~~~~~~~l~~~LsgC~~ 20 (177) T pfam07273 1 MKKLLLLILLLVLLLSGCAS 20 (177) T ss_pred CCHHHHHHHHHHHHHHCCCC T ss_conf 91489999999999715575 No 42 >pfam06572 DUF1131 Protein of unknown function (DUF1131). This family consists of several hypothetical bacterial proteins of unknown function. Probab=79.44 E-value=1 Score=24.39 Aligned_cols=20 Identities=30% Similarity=0.577 Sum_probs=16.8 Q ss_pred CCHHHHHHHHHHHHHHHHCC Q ss_conf 91179999999999854336 Q gi|254780374|r 1 MYRYSFIVLCYSSLLFGCHS 20 (238) Q Consensus 1 M~r~~l~~l~~~~lL~GCas 20 (238) |+.+.++++.+.++|+||++ T Consensus 1 ~~~~r~~l~~~~llL~GCst 20 (192) T pfam06572 1 MKSLRLFLLALPLLLTGCST 20 (192) T ss_pred CCCHHHHHHHHHHHHHCCCC T ss_conf 94301368776688606556 No 43 >PRK09859 multidrug efflux system protein MdtE; Provisional Probab=78.82 E-value=1.8 Score=22.88 Aligned_cols=20 Identities=25% Similarity=0.423 Sum_probs=11.4 Q ss_pred CCH--H-HHHHHHHHHHHHHHCC Q ss_conf 911--7-9999999999854336 Q gi|254780374|r 1 MYR--Y-SFIVLCYSSLLFGCHS 20 (238) Q Consensus 1 M~r--~-~l~~l~~~~lL~GCas 20 (238) |+| + ++.+++++++|+||.. T Consensus 1 m~~k~~~li~ll~~~~lL~gC~~ 23 (385) T PRK09859 1 MNRRRKLLIPLLFCGAMLTACDD 23 (385) T ss_pred CCCHHHHHHHHHHHHHHHHCCCC T ss_conf 98206789999999999953799 No 44 >PRK11067 outer membrane protein assembly factor YaeT; Provisional Probab=78.75 E-value=1.1 Score=24.20 Aligned_cols=32 Identities=28% Similarity=0.434 Sum_probs=25.7 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCEEEEEEEECCHHH Q ss_conf 61215775521123223676876999997241010 Q gi|254780374|r 61 YSLWRDSHAALFKDSRALNVGDILTVDIRIDDQAV 95 (238) Q Consensus 61 gSL~~~~~~~lf~D~RA~~VGDIiTV~i~E~~~a~ 95 (238) -+||. ..+|.|-+...-|++|+|.|.|..... T Consensus 64 k~L~~---tg~F~dV~v~~~g~~L~~~V~E~P~I~ 95 (801) T PRK11067 64 RALFA---TGNFEDVRVLRDGDTLLVQVKERPTIA 95 (801) T ss_pred HHHHH---CCCCCEEEEEEECCEEEEEEEECCEEE T ss_conf 99985---899524899971999999999577389 No 45 >PRK10672 rare lipoprotein A; Provisional Probab=78.39 E-value=4.8 Score=20.17 Aligned_cols=25 Identities=16% Similarity=0.145 Sum_probs=17.4 Q ss_pred EEEEEEEEEEECCCCCEEEEEEEEE Q ss_conf 9999999999807972899788779 Q gi|254780374|r 151 NLLIAAIVTAILENGNLIISGSQEV 175 (238) Q Consensus 151 ~~~isa~V~~VlpNGnL~I~G~k~i 175 (238) ++.+-+.++.|-|+|+|.=-|..-. T Consensus 161 ~a~VrVe~I~vapdg~lsGpGt~gt 185 (369) T PRK10672 161 NTKVRIDPIIVAQDGSLSGPGMAGT 185 (369) T ss_pred CCEEEEEEEEECCCCCCCCCCCCCC T ss_conf 7505787776357655356655453 No 46 >PRK09578 periplasmic multidrug efflux lipoprotein precursor; Reviewed Probab=78.00 E-value=2 Score=22.51 Aligned_cols=20 Identities=25% Similarity=0.386 Sum_probs=15.0 Q ss_pred CHHHHHHHHHHHHHHHHCCC Q ss_conf 11799999999998543367 Q gi|254780374|r 2 YRYSFIVLCYSSLLFGCHSS 21 (238) Q Consensus 2 ~r~~l~~l~~~~lL~GCas~ 21 (238) +|+.+++++++++|+||... T Consensus 7 ~~~~l~~l~~~~~L~gC~~~ 26 (385) T PRK09578 7 RRLALAALVAAFALAGCGKG 26 (385) T ss_pred HHHHHHHHHHHHHHHCCCCC T ss_conf 99999999999998167999 No 47 >TIGR02887 spore_ger_x_C germination protein, Ger(x)C family; InterPro: IPR008844 The GerAA, -AB, and -AC proteins of the Bacillus subtilis spore are required for the germination response to L-alanine as the sole germinant. Members of GerAC family are thought to be located in the inner spore membrane. Although the function of this family is unclear, they are likely to encode the components of the germination apparatus that respond directly to this germinant, mediating the spore's response .; GO: 0009847 spore germination, 0016020 membrane. Probab=77.65 E-value=1.4 Score=23.55 Aligned_cols=21 Identities=19% Similarity=0.249 Sum_probs=15.9 Q ss_pred CCHHHHHHHHHHHHHHHHCCC Q ss_conf 911799999999998543367 Q gi|254780374|r 1 MYRYSFIVLCYSSLLFGCHSS 21 (238) Q Consensus 1 M~r~~l~~l~~~~lL~GCas~ 21 (238) +++.++++++++++|+||-.+ T Consensus 1 ~~~~~~~~~~~~~lL~GCwd~ 21 (400) T TIGR02887 1 KKLKILLLILALLLLTGCWDK 21 (400) T ss_pred CHHHHHHHHHHHHHHCCCCCC T ss_conf 946678999999983266562 No 48 >TIGR00752 slp outer membrane lipoprotein, Slp family; InterPro: IPR004658 Slp superfamily members are present in the Gram-negative gamma proteobacteria Escherichia coli (which also contains a close paralog), Haemophilus influenzae and Pasteurella multocida and Vibrio cholerae. The known members of the family to date share a motif LX[GA]C near the N-terminus, which is compatible with the possibility that the protein is modified into a lipoprotein with Cys as the new N-terminus. Slp from E. coli is known to be a lipoprotein of the outer membrane and to be expressed in response to carbon starvation.; GO: 0019867 outer membrane. Probab=76.51 E-value=5.4 Score=19.84 Aligned_cols=62 Identities=24% Similarity=0.314 Sum_probs=37.2 Q ss_pred HHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC-CCCCCCCCCCCCCCCCC Q ss_conf 79999999999854336720025687536677776644445644555444444455761215-77552112322367687 Q gi|254780374|r 4 YSFIVLCYSSLLFGCHSSISEITGIPQMSPMGSSLDENNRMPFLGIDFKNSDSTKKSYSLWR-DSHAALFKDSRALNVGD 82 (238) Q Consensus 4 ~~l~~l~~~~lL~GCas~~~~~~~~p~~tp~~~~~~~~~~~p~~~~~~~~~~~~~~~gSL~~-~~~~~lf~D~RA~~VGD 82 (238) .-++.+.++++|+||.+..+.+.+..+ |... .|+=+ .....||--+.||.=|- T Consensus 4 ~g~l~~~l~f~L~gC~~~P~~ikg~~q--------------P~~~------------~s~v~v~~qp~lyvg~~aRfGGK 57 (190) T TIGR00752 4 KGLLITALAFLLTGCIAVPKAIKGERQ--------------PFSQ------------KSYVEVSSQPLLYVGQEARFGGK 57 (190) T ss_pred HHHHHHHHHHHHHHHCCCCCCCCCCCC--------------CCCC------------HHHHEECCCCCEEECCEEEECCE T ss_conf 145688899887531467423455679--------------8620------------02321107997231105753775 Q ss_pred EEEEEEEEC Q ss_conf 699999724 Q gi|254780374|r 83 ILTVDIRID 91 (238) Q Consensus 83 IiTV~i~E~ 91 (238) ||-|....+ T Consensus 58 vvnv~~~~~ 66 (190) T TIGR00752 58 VVNVTNLKN 66 (190) T ss_pred EEEEEECCC T ss_conf 999876488 No 49 >COG5633 Predicted periplasmic lipoprotein [General function prediction only] Probab=75.44 E-value=1.7 Score=22.97 Aligned_cols=22 Identities=27% Similarity=0.386 Sum_probs=15.0 Q ss_pred CCHHHHHHHHHHHHHHHHCCCCC Q ss_conf 91179999999999854336720 Q gi|254780374|r 1 MYRYSFIVLCYSSLLFGCHSSIS 23 (238) Q Consensus 1 M~r~~l~~l~~~~lL~GCas~~~ 23 (238) |+++.+ +++..++|.||++..+ T Consensus 1 Mrk~~~-~~l~~~lLvGCsS~~~ 22 (123) T COG5633 1 MRKLCL-LSLALLLLVGCSSHQE 22 (123) T ss_pred CCEEHH-HHHHHHHHHCCCCCCC T ss_conf 930038-9999999420478877 No 50 >PRK11189 lipoprotein NlpI; Provisional Probab=74.40 E-value=3.1 Score=21.34 Aligned_cols=20 Identities=30% Similarity=0.473 Sum_probs=11.9 Q ss_pred CCH-HHHHHHHH-HHHHHHHCC Q ss_conf 911-79999999-999854336 Q gi|254780374|r 1 MYR-YSFIVLCY-SSLLFGCHS 20 (238) Q Consensus 1 M~r-~~l~~l~~-~~lL~GCas 20 (238) |++ +.++.+++ +++|+||++ T Consensus 1 m~~~~r~~~~~~~~l~LsGC~s 22 (297) T PRK11189 1 MKPFLRWCFVATAALLLAGCAS 22 (297) T ss_pred CCHHHHHHHHHHHHHHHHHHCC T ss_conf 9518999999999999986405 No 51 >COG5510 Predicted small secreted protein [Function unknown] Probab=73.62 E-value=2.4 Score=22.06 Aligned_cols=20 Identities=35% Similarity=0.524 Sum_probs=12.7 Q ss_pred CCH---HHHHHHHHHHHHHHHCC Q ss_conf 911---79999999999854336 Q gi|254780374|r 1 MYR---YSFIVLCYSSLLFGCHS 20 (238) Q Consensus 1 M~r---~~l~~l~~~~lL~GCas 20 (238) |+| +++++++.+.+|+||.+ T Consensus 2 mk~t~l~i~~vll~s~llaaCNT 24 (44) T COG5510 2 MKKTILLIALVLLASTLLAACNT 24 (44) T ss_pred CHHHHHHHHHHHHHHHHHHHHHH T ss_conf 04899999999999999988663 No 52 >PRK10718 hypothetical protein; Provisional Probab=73.31 E-value=1.3 Score=23.80 Aligned_cols=21 Identities=29% Similarity=0.556 Sum_probs=17.1 Q ss_pred CCHHHHHHHHHHHHHHHHCCC Q ss_conf 911799999999998543367 Q gi|254780374|r 1 MYRYSFIVLCYSSLLFGCHSS 21 (238) Q Consensus 1 M~r~~l~~l~~~~lL~GCas~ 21 (238) |+.+.++++...++|+||++- T Consensus 1 ~~~lr~~l~~~~llLsGCst~ 21 (191) T PRK10718 1 MKSLRLLLLALPLLLTGCSTM 21 (191) T ss_pred CCCCCCHHHHHHHHHHCCCCC T ss_conf 985310231224764065666 No 53 >COG3009 Uncharacterized protein conserved in bacteria [Function unknown] Probab=73.20 E-value=3.8 Score=20.76 Aligned_cols=29 Identities=17% Similarity=0.155 Sum_probs=18.9 Q ss_pred EEEEEEEEE--ECCCCCEEEEEEEEEEEECC Q ss_conf 999999999--80797289978877984188 Q gi|254780374|r 152 LLIAAIVTA--ILENGNLIISGSQEVRVNDE 180 (238) Q Consensus 152 ~~isa~V~~--VlpNGnL~I~G~k~i~vn~e 180 (238) -++.++|.+ =-|.|+-+|+|+=.+.-.++ T Consensus 115 ~~l~vtv~rF~gry~g~a~i~g~W~L~~~~G 145 (190) T COG3009 115 YKLNVTVQRFDGRYTGKAVISGEWRLLHDKG 145 (190) T ss_pred EEEEEEEEEECCCCCCCEEEEEEEEEECCCC T ss_conf 7999999621573468548999999745666 No 54 >CHL00010 infA translation initiation factor 1 Probab=73.12 E-value=5.7 Score=19.66 Aligned_cols=59 Identities=20% Similarity=0.338 Sum_probs=39.3 Q ss_pred EEEEEEEEECCCCCEEEEEEEEEEEECCEEEEEEEEEECHHHCC--CCC----EEEHHEEEEEEEEEECE Q ss_conf 99999999807972899788779841887999999798477678--997----38242055439998020 Q gi|254780374|r 153 LIAAIVTAILENGNLIISGSQEVRVNDEIRSLNVTGIVRPQDVD--AHN----SVSYDKIAEARISYGGK 216 (238) Q Consensus 153 ~isa~V~~VlpNGnL~I~G~k~i~vn~e~~~i~isGiVRp~DI~--~~N----tV~S~~iAda~I~y~g~ 216 (238) .+..+|+|+|||+.+.|+=+ |+..-.-.+||-+|-.-|. ++- .++-..+--.||.|.-+ T Consensus 8 e~eG~V~e~Lpn~~F~V~Le-----ng~~Vla~~sGKmR~~~Iril~GD~V~VE~SpYDltkGRI~~R~k 72 (78) T CHL00010 8 IMEGLVTESLPNGMFRVRLD-----NGDLVLGYISGKIRRNSIRILPGDRVKVELSPYDLTKGRIIYRLR 72 (78) T ss_pred EEEEEEEEECCCCEEEEEEC-----CCCEEEEEECCHHEECEEEECCCCEEEEEECCCCCCCEEEEEEEC T ss_conf 99899999869988999978-----999999999805200318974899899998756788578999904 No 55 >TIGR03300 assembly_YfgL outer membrane assembly lipoprotein YfgL. Members of this protein family are YfgL, a lipoprotein component of a complex that acts protein insertion into the bacterial outer membrane. Other members of this complex are NlpB, YfiO, and YaeT. This protein contains multiple copies of a repeat that, in other contexts, are associated with binding of the coenzyme PQQ. Probab=73.11 E-value=3.6 Score=20.95 Aligned_cols=16 Identities=31% Similarity=0.565 Sum_probs=10.7 Q ss_pred HHHHHHHHHHHHHHCC Q ss_conf 9999999999854336 Q gi|254780374|r 5 SFIVLCYSSLLFGCHS 20 (238) Q Consensus 5 ~l~~l~~~~lL~GCas 20 (238) ++++++++++|+||+. T Consensus 3 ~~~~~~~~~~l~gCs~ 18 (377) T TIGR03300 3 LALVIALAALLSGCSW 18 (377) T ss_pred CHHHHHHHHHHHHCCC T ss_conf 3899999999855657 No 56 >COG5645 Predicted periplasmic lipoprotein [General function prediction only] Probab=72.65 E-value=1.7 Score=22.95 Aligned_cols=26 Identities=35% Similarity=0.338 Sum_probs=16.3 Q ss_pred CCHHHHHHHHHHHHHHHHCCCCCCCCC Q ss_conf 911799999999998543367200256 Q gi|254780374|r 1 MYRYSFIVLCYSSLLFGCHSSISEITG 27 (238) Q Consensus 1 M~r~~l~~l~~~~lL~GCas~~~~~~~ 27 (238) |++ +++.+.+.++|+||++-+....+ T Consensus 1 mr~-i~l~l~v~lllSGC~SV~s~t~~ 26 (80) T COG5645 1 MRN-ILLSLMVLLLLSGCGSVISLTIP 26 (80) T ss_pred CCE-EHHHHHHHHHHCCCCEEEEECCC T ss_conf 915-32789999983764103774268 No 57 >PRK02944 OxaA-like protein precursor; Validated Probab=72.18 E-value=4.1 Score=20.58 Aligned_cols=21 Identities=19% Similarity=0.330 Sum_probs=12.1 Q ss_pred CCHHHHHHH---HHHHHHHHHCCC Q ss_conf 911799999---999998543367 Q gi|254780374|r 1 MYRYSFIVL---CYSSLLFGCHSS 21 (238) Q Consensus 1 M~r~~l~~l---~~~~lL~GCas~ 21 (238) |+|...++. .++++|+||+.. T Consensus 1 mkk~~~~~~~~~~~~~~lsgC~~~ 24 (255) T PRK02944 1 MKKKLGLLAMVVALMAILAGCSEV 24 (255) T ss_pred CCHHHHHHHHHHHHHHHHHCCCCC T ss_conf 928999999999999999624688 No 58 >PRK10781 rcsF outer membrane lipoprotein; Reviewed Probab=72.08 E-value=6.6 Score=19.26 Aligned_cols=19 Identities=26% Similarity=0.457 Sum_probs=11.2 Q ss_pred CCHHHHHHHHHHHHHHHHCCC Q ss_conf 911799999999998543367 Q gi|254780374|r 1 MYRYSFIVLCYSSLLFGCHSS 21 (238) Q Consensus 1 M~r~~l~~l~~~~lL~GCas~ 21 (238) |+.+.+.++ +++|+||+.. T Consensus 1 MR~lpi~ll--al~ltGCs~l 19 (133) T PRK10781 1 MRALPICLL--ALMLTGCSML 19 (133) T ss_pred CCHHHHHHH--HHHHHCCCCC T ss_conf 920899999--9998431344 No 59 >PRK11443 hypothetical protein; Provisional Probab=71.52 E-value=3.2 Score=21.23 Aligned_cols=18 Identities=28% Similarity=0.519 Sum_probs=12.4 Q ss_pred CCHHHHHHHHHHHHHHHHCC Q ss_conf 91179999999999854336 Q gi|254780374|r 1 MYRYSFIVLCYSSLLFGCHS 20 (238) Q Consensus 1 M~r~~l~~l~~~~lL~GCas 20 (238) |+++++.+++ +||+||.. T Consensus 1 Mk~~~~~~~~--lll~GCq~ 18 (120) T PRK11443 1 MKKFIAPLLA--LLVSGCQI 18 (120) T ss_pred CHHHHHHHHH--HHHHCCCC T ss_conf 9158999999--99966678 No 60 >PRK13733 conjugal transfer protein TraV; Provisional Probab=71.29 E-value=0.42 Score=26.80 Aligned_cols=31 Identities=19% Similarity=0.310 Sum_probs=21.7 Q ss_pred EEEEEECCEEEEEEEEEECHHHCCCCCEEEH Q ss_conf 7798418879999997984776789973824 Q gi|254780374|r 173 QEVRVNDEIRSLNVTGIVRPQDVDAHNSVSY 203 (238) Q Consensus 173 k~i~vn~e~~~i~isGiVRp~DI~~~NtV~S 203 (238) +-++.-+....|=|.=+|..+|.=.+-++-+ T Consensus 127 ~P~R~~~~~~~~WIAP~vD~dd~~Hqp~~V~ 157 (171) T PRK13733 127 RPLRTGEQTAALWIAPYIDNQDVYHQPSSVF 157 (171) T ss_pred CCCCCHHHEEEEEEEEEECCCCCCCCCCEEE T ss_conf 8742430003467740373788603675699 No 61 >cd04451 S1_IF1 S1_IF1: Translation Initiation Factor IF1, S1-like RNA-binding domain. IF1 contains an S1-like RNA-binding domain, which is found in a wide variety of RNA-associated proteins. Translation initiation includes a number of interrelated steps preceding the formation of the first peptide bond. In Escherichia coli, the initiation mechanism requires, in addition to mRNA, fMet-tRNA, and ribosomal subunits, the presence of three additional proteins (initiation factors IF1, IF2, and IF3) and at least one GTP molecule. The three initiation factors influence both the kinetics and the stability of ternary complex formation. IF1 is the smallest of the three factors. IF1 enhances the rate of 70S ribosome subunit association and dissociation and the interaction of 30S ribosomal subunit with IF2 and IF3. It stimulates 30S complex formation. In addition, by binding to the A-site of the 30S ribosomal subunit, IF1 may contribute to the fidelity of the selection of the initiation site of th Probab=71.24 E-value=8 Score=18.75 Aligned_cols=39 Identities=18% Similarity=0.206 Sum_probs=27.3 Q ss_pred EEEEEEEEECCCCCEEEEEEEEEEEECCEEEEEEEEEECHHHCC Q ss_conf 99999999807972899788779841887999999798477678 Q gi|254780374|r 153 LIAAIVTAILENGNLIISGSQEVRVNDEIRSLNVTGIVRPQDVD 196 (238) Q Consensus 153 ~isa~V~~VlpNGnL~I~G~k~i~vn~e~~~i~isGiVRp~DI~ 196 (238) .+...|+|+||||++.|+=+ |+..-.-.+||-+|-.-|. T Consensus 2 e~~G~V~e~lpn~~F~V~Le-----ng~~v~a~~sGKmr~~~Ir 40 (64) T cd04451 2 EMEGVVTEALPNAMFRVELE-----NGHEVLAHISGKMRMNYIR 40 (64) T ss_pred EEEEEEEEECCCCEEEEEEC-----CCCEEEEEECCHHEEEEEE T ss_conf 78899999879988999978-----9989999972330412698 No 62 >COG5461 Type IV pili component [Cell motility and secretion] Probab=70.99 E-value=5.4 Score=19.82 Aligned_cols=20 Identities=25% Similarity=0.240 Sum_probs=14.2 Q ss_pred CCCCCCCCCCCCEEEEEEEE Q ss_conf 11232236768769999972 Q gi|254780374|r 71 LFKDSRALNVGDILTVDIRI 90 (238) Q Consensus 71 lf~D~RA~~VGDIiTV~i~E 90 (238) -|.|+-...--|+|+|.+-- T Consensus 74 gf~~~y~s~~a~~l~i~ip~ 93 (224) T COG5461 74 GFLDRYSSASADALHIQIPS 93 (224) T ss_pred HHHHHHHHCCCCCEEEECCC T ss_conf 99987643257716998158 No 63 >PRK10783 mltD membrane-bound lytic murein transglycosylase D; Provisional Probab=68.62 E-value=9.1 Score=18.40 Aligned_cols=17 Identities=47% Similarity=0.743 Sum_probs=10.1 Q ss_pred HHHHHHHHHHHCCCCCC Q ss_conf 99999998543367200 Q gi|254780374|r 8 VLCYSSLLFGCHSSISE 24 (238) Q Consensus 8 ~l~~~~lL~GCas~~~~ 24 (238) +|+.+++|+||.+...+ T Consensus 6 ~l~~~~ll~gcq~~~~~ 22 (449) T PRK10783 6 ILLASVLLVGCQSSKND 22 (449) T ss_pred HHHHHHHHHHCCCCCCC T ss_conf 99999999725578888 No 64 >PRK11548 hypothetical protein; Provisional Probab=68.18 E-value=3.3 Score=21.14 Aligned_cols=21 Identities=14% Similarity=0.134 Sum_probs=14.4 Q ss_pred CCHHHHHHHHHHHHHHHHCCC Q ss_conf 911799999999998543367 Q gi|254780374|r 1 MYRYSFIVLCYSSLLFGCHSS 21 (238) Q Consensus 1 M~r~~l~~l~~~~lL~GCas~ 21 (238) |+.+....+++.++|+||+.- T Consensus 3 ~k~l~~a~ll~~lllsgCS~~ 23 (113) T PRK11548 3 CKTLTAAAAVLLMLTAGCSTL 23 (113) T ss_pred HHHHHHHHHHHHHHHCCCCCC T ss_conf 578899999999998011578 No 65 >pfam12262 Lipase_bact_N Bacterial virulence factor lipase N-terminal. This domain family is found in bacteria, and is typically between 258 and 271 amino acids in length. There are two conserved sequence motifs: DGT and DGWST. This family is the N-terminal region of bacterial virulence factor lipase. The N-terminal region contains a potential signalling sequence. Probab=67.98 E-value=7.4 Score=18.95 Aligned_cols=21 Identities=33% Similarity=0.426 Sum_probs=14.3 Q ss_pred CCHHHHHHHHHH-HHHHHHCCC Q ss_conf 911799999999-998543367 Q gi|254780374|r 1 MYRYSFIVLCYS-SLLFGCHSS 21 (238) Q Consensus 1 M~r~~l~~l~~~-~lL~GCas~ 21 (238) |||+++.++++. ++|+||..- T Consensus 1 MKk~~l~~~i~SAL~LaGCg~~ 22 (269) T pfam12262 1 MKKKFLALLLASALLLAGCGDD 22 (269) T ss_pred CCHHHHHHHHHHHHHHCCCCCC T ss_conf 9247999999998641115799 No 66 >pfam11153 DUF2931 Protein of unknown function (DUF2931). Some members in this family of proteins are annotated as lipoproteins however this cannot be confirmed. Currently, there is no known function. Probab=67.79 E-value=3.6 Score=20.96 Aligned_cols=19 Identities=16% Similarity=0.277 Sum_probs=13.9 Q ss_pred EEEEEEEEEECCCCCEEEE Q ss_conf 9999999998079728997 Q gi|254780374|r 152 LLIAAIVTAILENGNLIIS 170 (238) Q Consensus 152 ~~isa~V~~VlpNGnL~I~ 170 (238) ..-.-.|....|+|+.+|- T Consensus 129 ~~r~~~~~GLaPgG~v~vw 147 (202) T pfam11153 129 CYRDNIVLGLAPGGKVVVW 147 (202) T ss_pred EEEEEEEEEECCCCEEEEE T ss_conf 6354499997899889999 No 67 >PRK13731 conjugal transfer surface exclusion protein TraT; Provisional Probab=67.61 E-value=3 Score=21.47 Aligned_cols=21 Identities=24% Similarity=0.351 Sum_probs=13.8 Q ss_pred CCHHHHHHHHHH-HHHHHHCCC Q ss_conf 911799999999-998543367 Q gi|254780374|r 1 MYRYSFIVLCYS-SLLFGCHSS 21 (238) Q Consensus 1 M~r~~l~~l~~~-~lL~GCas~ 21 (238) |+++.+..++.+ ++|+||++. T Consensus 3 ~kk~~~~~~~~~~l~lsGC~a~ 24 (243) T PRK13731 3 TKKLMMVALVSSTLALSGCGAM 24 (243) T ss_pred HHHHHHHHHHHHHHHHCCHHHH T ss_conf 3775799999999997156876 No 68 >pfam08085 Entericidin Entericidin EcnA/B family. This family consists of the entericidin antidote/toxin peptides. The entericidin locus is activated in stationary phase under high osmolarity conditions by rho-S and simultaneously repressed by the osmoregulatory EnvZ/OmpR signal transduction pathway. The entericidin locus encodes tandem paralogous genes (ecnAB) and directs the synthesis of two small cell-envelope lipoproteins which can maintain plasmids in bacterial population by means of post-segregational killing. Probab=67.31 E-value=3.9 Score=20.74 Aligned_cols=20 Identities=25% Similarity=0.501 Sum_probs=11.5 Q ss_pred CCHHHHHHHHHH---HHHHHHCC Q ss_conf 911799999999---99854336 Q gi|254780374|r 1 MYRYSFIVLCYS---SLLFGCHS 20 (238) Q Consensus 1 M~r~~l~~l~~~---~lL~GCas 20 (238) |+|++.++++++ +.|+||.| T Consensus 1 Mk~~~~~~~~l~~~~~~lagCNT 23 (42) T pfam08085 1 MKKLIALLLALLLLALVLAGCNT 23 (42) T ss_pred CHHHHHHHHHHHHHHHHHHCCHH T ss_conf 92369999999999999861222 No 69 >PRK11251 DNA-binding transcriptional activator OsmE; Provisional Probab=66.14 E-value=5 Score=20.03 Aligned_cols=23 Identities=13% Similarity=0.098 Sum_probs=16.9 Q ss_pred CCHHHHHHHHHHHHHHHHCCCCC Q ss_conf 91179999999999854336720 Q gi|254780374|r 1 MYRYSFIVLCYSSLLFGCHSSIS 23 (238) Q Consensus 1 M~r~~l~~l~~~~lL~GCas~~~ 23 (238) |+..+|...+.+.+|+||+++.+ T Consensus 1 ~~~~~l~aaa~~~~LaGCt~~d~ 23 (109) T PRK11251 1 NMAGILSAAAVLTMLAGCTAYDR 23 (109) T ss_pred CCHHHHHHHHHHHHHHHCCCCCC T ss_conf 90258899999999971447767 No 70 >PRK12450 foldase protein PrsA; Reviewed Probab=65.47 E-value=3.9 Score=20.71 Aligned_cols=21 Identities=29% Similarity=0.262 Sum_probs=13.4 Q ss_pred CCHHHHHHH--HHHHHHHHHCCC Q ss_conf 911799999--999998543367 Q gi|254780374|r 1 MYRYSFIVL--CYSSLLFGCHSS 21 (238) Q Consensus 1 M~r~~l~~l--~~~~lL~GCas~ 21 (238) |+++++.++ +.+++|+||.+. T Consensus 4 mKK~~~~~~~~~svl~LaAC~s~ 26 (309) T PRK12450 4 MNKLITGVVTLATVVTLSACQSS 26 (309) T ss_pred HHHHHHHHHHHHHHHHHHHHCCC T ss_conf 99999999999999999860589 No 71 >TIGR03352 VI_chp_3 type VI secretion lipoprotein, VC_A0113 family. Work by Mougous, et al. (2006), describes IAHP-related loci as a type VI secretion system (PMID:16763151). This protein family is associated with type VI secretion loci, although not treated explicitly by Mougous, et al. Probab=64.88 E-value=3.3 Score=21.20 Aligned_cols=15 Identities=40% Similarity=0.713 Sum_probs=9.6 Q ss_pred HHHHHHHHHHHHCCC Q ss_conf 999999998543367 Q gi|254780374|r 7 IVLCYSSLLFGCHSS 21 (238) Q Consensus 7 ~~l~~~~lL~GCas~ 21 (238) ++++++++|+||++. T Consensus 5 ~~l~~~lllagC~s~ 19 (146) T TIGR03352 5 VLLAACLLLAGCSSA 19 (146) T ss_pred HHHHHHHHHHHCCCC T ss_conf 589999998532599 No 72 >TIGR02522 pilus_cpaD pilus (Caulobacter type) biogenesis lipoprotein CpaD; InterPro: IPR013361 Proteins in this entry consist of a pilus biogenesis protein, CpaD, from Caulobacter, and homologues in other bacteria, including three in the root nodule bacterium Bradyrhizobium japonicum. The molecular function of the homologues is not known.. Probab=64.62 E-value=7.5 Score=18.92 Aligned_cols=79 Identities=23% Similarity=0.214 Sum_probs=34.7 Q ss_pred HHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC---CCCCCCCCCCCCCCCC--CCCCCCCCC Q ss_conf 99999999985433672002568753667777664444564455544444---4455761215775521--123223676 Q gi|254780374|r 6 FIVLCYSSLLFGCHSSISEITGIPQMSPMGSSLDENNRMPFLGIDFKNSD---STKKSYSLWRDSHAAL--FKDSRALNV 80 (238) Q Consensus 6 l~~l~~~~lL~GCas~~~~~~~~p~~tp~~~~~~~~~~~p~~~~~~~~~~---~~~~~gSL~~~~~~~l--f~D~RA~~V 80 (238) .+++.+.++|+||++..+.. .+-..+.+++...|.......... ..+..+-|-.+....| |-+.--+.- T Consensus 2 ~~~~a~~~~LagC~~~~P~~------~~~~~p~d~~~~~PI~v~~~~~~~~l~v~~~~~Gl~a~~~~~l~~~l~~~~~~~ 75 (211) T TIGR02522 2 VLLLALTALLAGCASTDPEI------DVGALPDDYRQRHPIAVQEESKSIDLLVAPGDRGLSASQQDQLLGFLKDWSRAS 75 (211) T ss_pred HHHHHHHHHHHHCCCCCCCC------CCCCCCCCCCCCCCCEECCCCEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCC T ss_conf 78988999975305888755------534323453458873551177013455448887679888999999999986305 Q ss_pred CCEEEEEEEE Q ss_conf 8769999972 Q gi|254780374|r 81 GDILTVDIRI 90 (238) Q Consensus 81 GDIiTV~i~E 90 (238) -+.|+|.+.. T Consensus 76 ~~~l~~~~p~ 85 (211) T TIGR02522 76 AQTLVVVIPS 85 (211) T ss_pred CCEEEEECCC T ss_conf 8537897588 No 73 >TIGR03302 OM_YfiO outer membrane assembly lipoprotein YfiO. Members of this protein family include YfiO, a near-essential protein of the outer membrane, part of a complex involved in protein insertion into the bacterial outer membrane. Many proteins in this family are annotated as ComL, based on the involvement of this protein in natural transformation with exogenous DNA in Neisseria gonorrhoeae. This protein family shows sequence similarity to, but is distinct from, the tol-pal system protein YbgF (TIGR02795). Probab=63.99 E-value=5.5 Score=19.76 Aligned_cols=19 Identities=32% Similarity=0.485 Sum_probs=11.7 Q ss_pred HHHHHHHHHHHHHHHCCCC Q ss_conf 7999999999985433672 Q gi|254780374|r 4 YSFIVLCYSSLLFGCHSSI 22 (238) Q Consensus 4 ~~l~~l~~~~lL~GCas~~ 22 (238) .++++++++++|+||++.. T Consensus 3 ~~~~~~~~~~~l~sCs~~~ 21 (235) T TIGR03302 3 LLILLLALLLLLAGCSSKK 21 (235) T ss_pred HHHHHHHHHHHHHHCCCCC T ss_conf 9999999999998738997 No 74 >PRK11023 hypothetical protein; Provisional Probab=63.65 E-value=4.7 Score=20.18 Aligned_cols=21 Identities=38% Similarity=0.475 Sum_probs=12.2 Q ss_pred CCHHHHH-HHHHHHHHHHHCCC Q ss_conf 9117999-99999998543367 Q gi|254780374|r 1 MYRYSFI-VLCYSSLLFGCHSS 21 (238) Q Consensus 1 M~r~~l~-~l~~~~lL~GCas~ 21 (238) |+.+..+ +++++++|+||++. T Consensus 1 mk~~~~l~~l~~~~~L~GC~~~ 22 (191) T PRK11023 1 MKALSPLAVLISALLLQGCVAA 22 (191) T ss_pred CCHHHHHHHHHHHHHHCCCHHH T ss_conf 9448899999999997268233 No 75 >PRK12442 translation initiation factor IF-1; Reviewed Probab=63.49 E-value=11 Score=17.78 Aligned_cols=63 Identities=17% Similarity=0.272 Sum_probs=43.4 Q ss_pred EEEEEEEEEECCCCCEEEEEEEEEEEECCEEEEEEEEEECHHHCC--CCC----EEEHHEEEEEEEEEECEEEE Q ss_conf 999999999807972899788779841887999999798477678--997----38242055439998020331 Q gi|254780374|r 152 LLIAAIVTAILENGNLIISGSQEVRVNDEIRSLNVTGIVRPQDVD--AHN----SVSYDKIAEARISYGGKGRT 219 (238) Q Consensus 152 ~~isa~V~~VlpNGnL~I~G~k~i~vn~e~~~i~isGiVRp~DI~--~~N----tV~S~~iAda~I~y~g~G~~ 219 (238) -.+.-+|+|+|||+.+.|+=+ |+..-.-.+||-+|-.-|. ++. .++-..+--.||.|.-+..- T Consensus 7 ie~eG~V~e~lpn~~FrV~Le-----Ng~~Vla~~sGKmR~~~IrIl~GD~V~VElSpYDltkGRIvyR~k~~~ 75 (87) T PRK12442 7 IELDGIVDEVLPDSRYRVTLE-----NGVEVGAYASGRMRKHRIRILAGDRVTLELSPYDLTKGRINFRHKDER 75 (87) T ss_pred EEEEEEEEEECCCCEEEEEEC-----CCCEEEEEEECCEEEEEEEECCCCEEEEEECCCCCCCEEEEEEECCCC T ss_conf 998789999879987999977-----998999998550133027776899899998877788748998847888 No 76 >PRK06481 fumarate reductase flavoprotein subunit; Validated Probab=63.42 E-value=11 Score=17.77 Aligned_cols=41 Identities=24% Similarity=0.344 Sum_probs=20.2 Q ss_pred EEEEEEEEEEEEECCCCCEEEEEEEEEEEECCEEEEEEEEEE Q ss_conf 489999999999807972899788779841887999999798 Q gi|254780374|r 149 KLNLLIAAIVTAILENGNLIISGSQEVRVNDEIRSLNVTGIV 190 (238) Q Consensus 149 ~l~~~isa~V~~VlpNGnL~I~G~k~i~vn~e~~~i~isGiV 190 (238) .++....+.|+++... +=.|.|-+--.-+++.+.+.-.+|| T Consensus 203 gv~i~~~t~~~~L~~~-dG~V~GV~~~~~~g~~~~i~akaVI 243 (506) T PRK06481 203 KIPLFVNADVTKITEK-DGKVTGVKVKINGKETKTISSKAVV 243 (506) T ss_pred CCEEEECCEEEEEEEC-CCCEEEEEEEECCCEEEEEECCEEE T ss_conf 9879987783232305-8937899999689849999735699 No 77 >PRK10722 hypothetical protein; Provisional Probab=63.18 E-value=5.9 Score=19.57 Aligned_cols=20 Identities=20% Similarity=0.178 Sum_probs=12.8 Q ss_pred HHHHHHHHHHHHHHHCCCCC Q ss_conf 79999999999854336720 Q gi|254780374|r 4 YSFIVLCYSSLLFGCHSSIS 23 (238) Q Consensus 4 ~~l~~l~~~~lL~GCas~~~ 23 (238) ..+++.+.+++|+||+.... T Consensus 17 ~~~~~~~~~l~L~gC~~~~~ 36 (248) T PRK10722 17 RLWIAGLPCLALAGCVQNAN 36 (248) T ss_pred HHHHHHHHHHHHHHCCCCCC T ss_conf 99997149998863047888 No 78 >PRK06033 hypothetical protein; Validated Probab=61.93 E-value=12 Score=17.60 Aligned_cols=32 Identities=28% Similarity=0.336 Sum_probs=28.8 Q ss_pred EEEECCCCCEEEEEEEEEEEECCEEEEEEEEEEC Q ss_conf 9998079728997887798418879999997984 Q gi|254780374|r 158 VTAILENGNLIISGSQEVRVNDEIRSLNVTGIVR 191 (238) Q Consensus 158 V~~VlpNGnL~I~G~k~i~vn~e~~~i~isGiVR 191 (238) =++++-||.|+=+|| +.+.++..-++++-||. T Consensus 50 pvdi~vN~~liArGE--VvV~~d~~gVriTeIv~ 81 (87) T PRK06033 50 EVKILANNHPIASGV--VLVDRNRIAVEVKRMLP 81 (87) T ss_pred CEEEEECCEEEEEEE--EEEECCEEEEEEEEEEC T ss_conf 489999999998885--99989888999999304 No 79 >TIGR02269 TIGR02269 Myxococcus xanthus paralogous lipoprotein family TIGR02269; InterPro: IPR011755 This family consists of at least 9 paralogs in Myxococcus xanthus, a member of the Deltaproteobacteria. One appears truncated toward the N-terminus; the others are predicted lipoproteins. The function is unknown.. Probab=61.65 E-value=5.7 Score=19.67 Aligned_cols=19 Identities=32% Similarity=0.667 Sum_probs=12.9 Q ss_pred HHHHHHHHHHHHHCCCCCC Q ss_conf 9999999998543367200 Q gi|254780374|r 6 FIVLCYSSLLFGCHSSISE 24 (238) Q Consensus 6 l~~l~~~~lL~GCas~~~~ 24 (238) +++.++++||.||++..+. T Consensus 2 l~~~lla~ll~GCa~~~~~ 20 (228) T TIGR02269 2 LLLLLLALLLVGCASTAPK 20 (228) T ss_pred HHHHHHHHHHHHHCCCCCC T ss_conf 5799889998752057877 No 80 >TIGR02062 RNase_B exoribonuclease II; InterPro: IPR011804 This family consists of exoribonuclease II (RNase II), the product of the rnb gene, as found in a number of gamma proteobacteria. In Escherichia coli, it is one of eight different exoribonucleases. It is involved in mRNA degradation and tRNA precursor end processing.; GO: 0003723 RNA binding, 0008859 exoribonuclease II activity, 0006401 RNA catabolic process. Probab=60.46 E-value=4.6 Score=20.25 Aligned_cols=25 Identities=16% Similarity=0.252 Sum_probs=18.8 Q ss_pred CCCCCCCCCCEEEEEEEECCHHHHC Q ss_conf 2322367687699999724101000 Q gi|254780374|r 73 KDSRALNVGDILTVDIRIDDQAVFD 97 (238) Q Consensus 73 ~D~RA~~VGDIiTV~i~E~~~a~~~ 97 (238) .--+-++|||+|+|+|.|=-...+| T Consensus 635 kge~~YKi~D~~~v~LtEv~~~tRS 659 (664) T TIGR02062 635 KGEVVYKIGDVIEVVLTEVREETRS 659 (664) T ss_pred EEEEEEEEEEEEEEEEEEEECCCCC T ss_conf 1078986610699998454013450 No 81 >TIGR03525 GldK gliding motility-associated lipoprotein GldK. Members of this protein family are exclusive to the Bacteroidetes phylum (previously Cytophaga-Flavobacteria-Bacteroides). GldK is a lipoprotein linked to a type of rapid surface gliding motility found in certain Bacteroidetes, such as Flavobacterium johnsoniae. Knockouts of GldK abolish the gliding phenotype. GldK is homologous to GldJ. There is a GldK homolog in Cytophaga hutchinsonii and several other species that has a different, shorter architecture and is represented by a separate model. Gliding motility appears closely linked to chitin utilization in the model species Flavobacterium johnsoniae. Bacteroidetes with members of this protein family appear to have all of the genes associated with gliding motility. Probab=60.12 E-value=6.3 Score=19.40 Aligned_cols=20 Identities=25% Similarity=0.517 Sum_probs=13.8 Q ss_pred CCHHHHHHHHHHHHHHHHCCC Q ss_conf 911799999999998543367 Q gi|254780374|r 1 MYRYSFIVLCYSSLLFGCHSS 21 (238) Q Consensus 1 M~r~~l~~l~~~~lL~GCas~ 21 (238) |+|++++++++ +++.+|.+. T Consensus 1 Mkk~~~~~~~~-~~~~SCg~~ 20 (449) T TIGR03525 1 MKKYLVFAALV-VLVYSCGSG 20 (449) T ss_pred CCHHHHHHHHH-HHHHHCCCC T ss_conf 91248999999-987210589 No 82 >PRK03095 prsA peptidylprolyl isomerase; Reviewed Probab=60.04 E-value=7.4 Score=18.95 Aligned_cols=20 Identities=20% Similarity=0.238 Sum_probs=13.6 Q ss_pred CCHHHHHHHHHH-HHHHHHCC Q ss_conf 911799999999-99854336 Q gi|254780374|r 1 MYRYSFIVLCYS-SLLFGCHS 20 (238) Q Consensus 1 M~r~~l~~l~~~-~lL~GCas 20 (238) |++.++.+.+++ +.|+||++ T Consensus 1 MKK~~la~~~~svl~LaaC~~ 21 (287) T PRK03095 1 MKKAMLALAATSVIALSACGT 21 (287) T ss_pred CCHHHHHHHHHHHHHHHHCCC T ss_conf 907899999999999985168 No 83 >TIGR02544 III_secr_YscJ type III secretion apparatus lipoprotein, YscJ/HrcJ family; InterPro: IPR003282 Secretion of virulence factors in Gram-negative bacteria involves transportation of the protein across two membranes to reach the cell exterior. There have been four secretion systems described in animal enteropathogens such as Salmonella and Yersinia, with further sequence similarities in plant pathogens like Ralstonia and Erwinia. The type III secretion system is of great interest as it is used to transport virulence factors from the pathogen directly into the host cell and is only triggered when the bacterium comes into close contact with the host. The protein subunits of the system are very similar to those of bacterial flagellar biosynthesis . However, while the latter forms a ring structure to allow secretion of flagellin and is an integral part of the flagellum itself, type III subunits in the outer membrane translocate secreted proteins through a channel-like structure. One of the outer membrane protein subunit families, termed "K" here for nomenclature purposes, aids in the structural assembly of the invasion complex . It is also described as a lipoprotein. Members of this family include the Salmonella PrgK and SsaJ genes, MxiJ from Shigella, YscJ from Yersinia, and from the plant enteropathogens NolT (Rhizobium) and HrcJ Erwinia). The flagellar M-ring protein FliF also shares a low level of similarity, presumably due to evolution of the type III secretion system from the flagellar biosynthetic pathway.; GO: 0009306 protein secretion. Probab=60.03 E-value=5.8 Score=19.63 Aligned_cols=18 Identities=22% Similarity=0.377 Sum_probs=14.8 Q ss_pred EEEEEEEEEEEEEECCCC Q ss_conf 148999999999980797 Q gi|254780374|r 148 EKLNLLIAAIVTAILENG 165 (238) Q Consensus 148 ~~l~~~isa~V~~VlpNG 165 (238) ..+.|-|+|+|.=|||.- T Consensus 125 s~IdGV~~ArVhVvlP~~ 142 (203) T TIGR02544 125 SQIDGVISARVHVVLPEN 142 (203) T ss_pred HHCCCEEEEEEEEEECCC T ss_conf 017926866799972167 No 84 >COG3017 LolB Outer membrane lipoprotein involved in outer membrane biogenesis [Cell envelope biogenesis, outer membrane] Probab=59.79 E-value=7.1 Score=19.09 Aligned_cols=21 Identities=29% Similarity=0.374 Sum_probs=12.1 Q ss_pred CCHHH-HHHHHHHHHHHHHCCC Q ss_conf 91179-9999999998543367 Q gi|254780374|r 1 MYRYS-FIVLCYSSLLFGCHSS 21 (238) Q Consensus 1 M~r~~-l~~l~~~~lL~GCas~ 21 (238) |++.. +++.+++++|+||+.. T Consensus 4 ~~~~~~~l~~~As~LL~aC~~~ 25 (206) T COG3017 4 MKRLLFLLLALASLLLTACTLT 25 (206) T ss_pred HHHHHHHHHHHHHHHHHHCCCC T ss_conf 7778789999999999756566 No 85 >TIGR03042 PS_II_psbQ_bact photosystem II protein PsbQ. This protein through the member sll1638 from Synechocystis sp. PCC 6803, was shown to be part of the cyanobacteria photosystem II. It is homologous to (but quite diverged from) the chloroplast PsbQ protein, called oxygen-evolving enhancer protein 3 (OEE3). We designate this cyanobacteria protein PsbQ by homology. Probab=59.69 E-value=9.2 Score=18.36 Aligned_cols=22 Identities=14% Similarity=0.378 Sum_probs=12.1 Q ss_pred CCHHHHHHHHHH-HHHHHHCCCC Q ss_conf 911799999999-9985433672 Q gi|254780374|r 1 MYRYSFIVLCYS-SLLFGCHSSI 22 (238) Q Consensus 1 M~r~~l~~l~~~-~lL~GCas~~ 22 (238) ++++.-++++++ .+|.+|++.. T Consensus 1 lRsi~s~~L~~~~~fLvsC~gp~ 23 (142) T TIGR03042 1 LRSLASLLLVLLLTFLVSCSGPA 23 (142) T ss_pred CHHHHHHHHHHHHHHHHHCCCCC T ss_conf 97699999999999998838998 No 86 >PRK01326 prsA foldase protein PrsA; Reviewed Probab=58.84 E-value=8.2 Score=18.67 Aligned_cols=20 Identities=15% Similarity=0.092 Sum_probs=12.7 Q ss_pred CCHHHHHHH---HHHHHHHHHCC Q ss_conf 911799999---99999854336 Q gi|254780374|r 1 MYRYSFIVL---CYSSLLFGCHS 20 (238) Q Consensus 1 M~r~~l~~l---~~~~lL~GCas 20 (238) |++.+++.. +.+++|+||++ T Consensus 1 MKKKl~~~~~~~~svl~LaaCs~ 23 (310) T PRK01326 1 MKKKLIAGAVTLLSVATLAACSK 23 (310) T ss_pred CCHHHHHHHHHHHHHHHHHHHCC T ss_conf 90889999999999999987169 No 87 >pfam07269 consensus Probab=58.77 E-value=6.2 Score=19.44 Aligned_cols=19 Identities=21% Similarity=0.424 Sum_probs=11.0 Q ss_pred CCHHHHHHHHHHHHHHHHCCCC Q ss_conf 9117999999999985433672 Q gi|254780374|r 1 MYRYSFIVLCYSSLLFGCHSSI 22 (238) Q Consensus 1 M~r~~l~~l~~~~lL~GCas~~ 22 (238) |++.++++++ .|+||.+.- T Consensus 1 MKy~ll~l~l---~LAgC~T~d 19 (55) T pfam07269 1 MKYCLLLLIL---ALAGCQTND 19 (55) T ss_pred CHHHHHHHHH---HHHHCCCCC T ss_conf 9014999999---987511467 No 88 >PRK10802 peptidoglycan-associated outer membrane lipoprotein; Provisional Probab=58.40 E-value=7.7 Score=18.85 Aligned_cols=14 Identities=14% Similarity=0.145 Sum_probs=9.2 Q ss_pred HHHHHHHHHCCCCC Q ss_conf 99999854336720 Q gi|254780374|r 10 CYSSLLFGCHSSIS 23 (238) Q Consensus 10 ~~~~lL~GCas~~~ 23 (238) +.+++|+||++... T Consensus 14 ~~~l~LaACSS~~~ 27 (173) T PRK10802 14 LPVMAIAACSSNKN 27 (173) T ss_pred HHHHHHHHCCCCCC T ss_conf 99999985579998 No 89 >PRK02463 OxaA-like protein precursor; Provisional Probab=58.23 E-value=10 Score=18.11 Aligned_cols=14 Identities=29% Similarity=0.396 Sum_probs=8.9 Q ss_pred HHHHHHHHHHHCCC Q ss_conf 99999998543367 Q gi|254780374|r 8 VLCYSSLLFGCHSS 21 (238) Q Consensus 8 ~l~~~~lL~GCas~ 21 (238) ++.++++|+||.+. T Consensus 14 ~~~~~l~LsgC~~~ 27 (307) T PRK02463 14 ALSMLLTLTGCVGR 27 (307) T ss_pred HHHHHHHHHCCCCC T ss_conf 99999999634689 No 90 >PRK10533 putative lipoprotein; Provisional Probab=57.94 E-value=7.4 Score=18.98 Aligned_cols=21 Identities=24% Similarity=0.355 Sum_probs=14.6 Q ss_pred CCHHHH-HHHHHHHHHHHHCCC Q ss_conf 911799-999999998543367 Q gi|254780374|r 1 MYRYSF-IVLCYSSLLFGCHSS 21 (238) Q Consensus 1 M~r~~l-~~l~~~~lL~GCas~ 21 (238) |++..| +++-++++|++|++. T Consensus 1 Mr~~~~~ll~PlallLSACttV 22 (171) T PRK10533 1 MRYSKLALLLPCALLLSACTTV 22 (171) T ss_pred CCCCHHHHHHHHHHHHHHHCCC T ss_conf 9862038999999999872247 No 91 >pfam05590 DUF769 Xylella fastidiosa protein of unknown function (DUF769). This family consists of several uncharacterized hypothetical proteins of unknown function from Xylella fastidiosa, the organism that causes Pierce's disease in plants. Probab=57.87 E-value=12 Score=17.54 Aligned_cols=18 Identities=50% Similarity=0.635 Sum_probs=12.1 Q ss_pred HHHHHHHHHHHHHHHHCC Q ss_conf 179999999999854336 Q gi|254780374|r 3 RYSFIVLCYSSLLFGCHS 20 (238) Q Consensus 3 r~~l~~l~~~~lL~GCas 20 (238) ++...++++.+||+||++ T Consensus 6 rL~~SllaAslLLagCSs 23 (279) T pfam05590 6 RLSCSLLAASLLLAGCSS 23 (279) T ss_pred CHHHHHHHHHHHHHCCCC T ss_conf 401699999999723788 No 92 >pfam09533 DUF2380 Predicted lipoprotein of unknown function (DUF2380). This family consists of at least 9 paralogs in Myxococcus xanthus, a member of the Deltaproteobacteria. One appears truncated toward the N-terminus; the others are predicted lipoproteins. The function is unknown. Probab=56.89 E-value=11 Score=17.90 Aligned_cols=23 Identities=30% Similarity=0.505 Sum_probs=14.6 Q ss_pred CCH-HHHHHHHHHHHHHHHCCCCC Q ss_conf 911-79999999999854336720 Q gi|254780374|r 1 MYR-YSFIVLCYSSLLFGCHSSIS 23 (238) Q Consensus 1 M~r-~~l~~l~~~~lL~GCas~~~ 23 (238) |++ +.+.++++++++.||++..+ T Consensus 1 ~r~~l~~~l~vLal~~~GCasaa~ 24 (220) T pfam09533 1 MRRALVLWLLVLALLWVGCASAAP 24 (220) T ss_pred CHHHHHHHHHHHHHHHHHHCCCCC T ss_conf 906899999999999845015789 No 93 >pfam06604 consensus Probab=56.34 E-value=15 Score=17.00 Aligned_cols=17 Identities=24% Similarity=0.231 Sum_probs=12.8 Q ss_pred HHHHHHHHHHHHHHHCC Q ss_conf 79999999999854336 Q gi|254780374|r 4 YSFIVLCYSSLLFGCHS 20 (238) Q Consensus 4 ~~l~~l~~~~lL~GCas 20 (238) .+|+.+.++++|+||.+ T Consensus 7 ~~~~~~~~v~~laGCq~ 23 (181) T pfam06604 7 GLVAGLAAVLVLAGCQR 23 (181) T ss_pred HHHHHHHHHHHHHCCCC T ss_conf 78999999999734453 No 94 >PRK10523 lipoprotein involved with copper homeostasis and adhesion; Provisional Probab=56.09 E-value=11 Score=17.83 Aligned_cols=24 Identities=21% Similarity=0.362 Sum_probs=15.2 Q ss_pred CCHHHHHHHHHHH--HHHHHCCCCCC Q ss_conf 9117999999999--98543367200 Q gi|254780374|r 1 MYRYSFIVLCYSS--LLFGCHSSISE 24 (238) Q Consensus 1 M~r~~l~~l~~~~--lL~GCas~~~~ 24 (238) |+|+++.+++++. .|.||..+.+. T Consensus 2 mkk~~~~~~~a~g~~~l~GC~~r~~~ 27 (234) T PRK10523 2 MKKAILTALAAVGLFALMGCNNRAEV 27 (234) T ss_pred HHHHHHHHHHHHHHHHHHCCCCCCCC T ss_conf 05899999999999987634665434 No 95 >TIGR02238 recomb_DMC1 meiotic recombinase Dmc1; InterPro: IPR011940 This entry describes DMC1, a subfamily of a larger family of DNA repair and recombination proteins. It is eukaryotic only and most closely related to eukaryotic RAD51. It also resembles archaeal RadA (IPR011938 from INTERPRO) and RadB (IPR011939 from INTERPRO) and bacterial RecA. It has been characterised for human as a recombinase active only in meiosis .; GO: 0003677 DNA binding, 0005524 ATP binding, 0008094 DNA-dependent ATPase activity, 0007131 meiotic recombination, 0005634 nucleus. Probab=56.08 E-value=1.9 Score=22.66 Aligned_cols=31 Identities=23% Similarity=0.367 Sum_probs=22.9 Q ss_pred HHEEEEEEEEEECEEEECCCCCCCHHHHHHHHH Q ss_conf 420554399980203312334866689999985 Q gi|254780374|r 203 YDKIAEARISYGGKGRTTELLRPPIGHQLIENL 235 (238) Q Consensus 203 S~~iAda~I~y~g~G~~~d~~~~gwl~r~~~~i 235 (238) -+=||..|+-|.|||.|++-|+. |.+.|..| T Consensus 199 DSI~alFRVDfsGRGELseRQQK--L~~~l~~L 229 (314) T TIGR02238 199 DSIMALFRVDFSGRGELSERQQK--LAQMLSRL 229 (314) T ss_pred HHHHHHHCCCCCCCCCCHHHHHH--HHHHHHHH T ss_conf 20255413121257640278899--99999998 No 96 >pfam06788 UPF0257 Uncharacterized protein family (UPF0257). Probab=55.48 E-value=7.9 Score=18.79 Aligned_cols=21 Identities=24% Similarity=0.567 Sum_probs=13.8 Q ss_pred CCHHHHHHHHHHHHHHHHCCCC Q ss_conf 9117999999999985433672 Q gi|254780374|r 1 MYRYSFIVLCYSSLLFGCHSSI 22 (238) Q Consensus 1 M~r~~l~~l~~~~lL~GCas~~ 22 (238) |+|..++.++ +++|+||.-.. T Consensus 1 ~k~~~~~~~l-a~~LsgCD~~~ 21 (236) T pfam06788 1 VKKYLLVMLL-ALVLTGCDNPD 21 (236) T ss_pred CCCHHHHHHH-HHHEECCCCCC T ss_conf 9401368878-77212557888 No 97 >COG3126 Uncharacterized protein conserved in bacteria [Function unknown] Probab=54.84 E-value=16 Score=16.85 Aligned_cols=83 Identities=23% Similarity=0.202 Sum_probs=43.5 Q ss_pred CCH-HHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 911-7999999999985433672002568753667777664444564455544444445576121577552112322367 Q gi|254780374|r 1 MYR-YSFIVLCYSSLLFGCHSSISEITGIPQMSPMGSSLDENNRMPFLGIDFKNSDSTKKSYSLWRDSHAALFKDSRALN 79 (238) Q Consensus 1 M~r-~~l~~l~~~~lL~GCas~~~~~~~~p~~tp~~~~~~~~~~~p~~~~~~~~~~~~~~~gSL~~~~~~~lf~D~RA~~ 79 (238) |+. .+.+.++++++|++|+....++ ++|. . ...+......... -.+.-||..+-|.- T Consensus 1 Mk~~~~~~g~~~~~~La~c~~~~~~~-----~~p~--~--------~~~~~~~~~a~~s-------v~G~V~yReriALP 58 (158) T COG3126 1 MKLVHIPIGLLAAILLAACASKSADI-----PAPA--S--------LTATQQSAAAQKS-------VSGTVLYRERIALP 58 (158) T ss_pred CCCEEHHHHHHHHHHHHHCCCCCCCC-----CCCC--E--------EECCCCCCCCCCC-------CCCCEEEEEEECCC T ss_conf 97032146789999987531245678-----8863--0--------3124544243322-------12416998875179 Q ss_pred CCCEEEEEEEECCHHHHCCCCCCCCC Q ss_conf 68769999972410100001222345 Q gi|254780374|r 80 VGDILTVDIRIDDQAVFDNQTGRSRN 105 (238) Q Consensus 80 VGDIiTV~i~E~~~a~~~~~~~~~r~ 105 (238) .|-.|||.+..-+-|-.-+..-.+++ T Consensus 59 p~AvltV~L~DvSlADaPsrvla~~t 84 (158) T COG3126 59 PGAVLTVTLSDVSLADAPSRVLAEQT 84 (158) T ss_pred CCCEEEEEEEECCCCCCHHHHHHHHE T ss_conf 99789999975121247367665432 No 98 >PRK11372 lysozyme inhibitor; Provisional Probab=54.84 E-value=8.9 Score=18.46 Aligned_cols=18 Identities=33% Similarity=0.912 Sum_probs=11.1 Q ss_pred CCHHHHHHHHHHHHHHHHCC Q ss_conf 91179999999999854336 Q gi|254780374|r 1 MYRYSFIVLCYSSLLFGCHS 20 (238) Q Consensus 1 M~r~~l~~l~~~~lL~GCas 20 (238) |+++ ++.++.++|+||+- T Consensus 3 MKkl--l~~~~~llLsGCs~ 20 (109) T PRK11372 3 MKKL--LIICLPVLLTGCSA 20 (109) T ss_pred HHHH--HHHHHHHHHHCCCC T ss_conf 4889--99999999704543 No 99 >PRK10510 putative outer membrane lipoprotein; Provisional Probab=54.02 E-value=9.3 Score=18.34 Aligned_cols=20 Identities=15% Similarity=0.464 Sum_probs=11.7 Q ss_pred CCHHHHHHHHH---HHHHHHHCC Q ss_conf 91179999999---999854336 Q gi|254780374|r 1 MYRYSFIVLCY---SSLLFGCHS 20 (238) Q Consensus 1 M~r~~l~~l~~---~~lL~GCas 20 (238) |+|.++++.++ +++|+||++ T Consensus 1 m~~r~~lia~~~a~~l~lsgC~t 23 (219) T PRK10510 1 MKKRVYLIAAIVSGALAVSGCTT 23 (219) T ss_pred CCHHHHHHHHHHHHHHHHHCCCC T ss_conf 94268999999999987413567 No 100 >PRK10397 hypothetical protein; Provisional Probab=53.72 E-value=8.9 Score=18.47 Aligned_cols=18 Identities=28% Similarity=0.431 Sum_probs=12.3 Q ss_pred CCHHHHHHHHHHHHHHHHCC Q ss_conf 91179999999999854336 Q gi|254780374|r 1 MYRYSFIVLCYSSLLFGCHS 20 (238) Q Consensus 1 M~r~~l~~l~~~~lL~GCas 20 (238) |+++++...++ .|+||+. T Consensus 1 mkk~~~~g~~l--~LsGCa~ 18 (137) T PRK10397 1 MKKLAIAGALM--LLAGCAE 18 (137) T ss_pred CCHHHHHHHHH--HHHCCCC T ss_conf 91246778999--9735324 No 101 >PRK11671 mltC murein transglycosylase C; Provisional Probab=53.08 E-value=11 Score=17.87 Aligned_cols=25 Identities=24% Similarity=0.300 Sum_probs=14.5 Q ss_pred CCHHHHHHHHHHHHHHHHCCCCCCC Q ss_conf 9117999999999985433672002 Q gi|254780374|r 1 MYRYSFIVLCYSSLLFGCHSSISEI 25 (238) Q Consensus 1 M~r~~l~~l~~~~lL~GCas~~~~~ 25 (238) |++.++++++++.||.+|++..... T Consensus 1 ~~~k~~~~~~~~~~l~~c~~~~~~~ 25 (360) T PRK11671 1 MMKKYLALALIAPLLISCSSSKKGD 25 (360) T ss_pred CCHHHHHHHHHHHHHHHHCCCCCCC T ss_conf 9146889999999998710468887 No 102 >PRK13835 conjugal transfer protein TrbH; Provisional Probab=52.87 E-value=15 Score=17.07 Aligned_cols=21 Identities=29% Similarity=0.499 Sum_probs=13.3 Q ss_pred CCHHHHHHHHHHHHHHHHCCCC Q ss_conf 9117999999999985433672 Q gi|254780374|r 1 MYRYSFIVLCYSSLLFGCHSSI 22 (238) Q Consensus 1 M~r~~l~~l~~~~lL~GCas~~ 22 (238) |+|+ +..+++.++|+||.+.. T Consensus 1 mr~l-l~~~i~a~~lsgCqta~ 21 (144) T PRK13835 1 MRRL-LAACILALLLSGCQTAT 21 (144) T ss_pred CHHH-HHHHHHHHHHHHCCCCC T ss_conf 9559-99999999864041368 No 103 >COG3521 Predicted component of the type VI protein secretion system [Intracellular trafficking, secretion, and vesicular transport] Probab=52.77 E-value=17 Score=16.64 Aligned_cols=22 Identities=32% Similarity=0.375 Sum_probs=11.9 Q ss_pred CCH--HHHHHHHHHHHHHHHCCCC Q ss_conf 911--7999999999985433672 Q gi|254780374|r 1 MYR--YSFIVLCYSSLLFGCHSSI 22 (238) Q Consensus 1 M~r--~~l~~l~~~~lL~GCas~~ 22 (238) |.+ ..+..+++.++++||++.. T Consensus 1 M~~~~~a~~~l~al~~~sgCsss~ 24 (159) T COG3521 1 MNSSRKAVLALFALLVLSGCSSSK 24 (159) T ss_pred CCHHHHHHHHHHHHHHHHHHCCCC T ss_conf 942577899999999852412588 No 104 >pfam10671 TcpQ Toxin co-regulated pilus biosynthesis protein Q. The toxin-coregulated pilus (TCP) of Vibrio cholerae and the soluble TcpF protein that is secreted via the TCP biogenesis apparatus are essential for intestinal colonisation in the disease of cholera. TcpQ is part of an outer membrane complex of the TCP biogenesis apparatus, comprised of TcpC and TcpQ, and the TcpQ is required for proper localisation of TcpC to the outer membrane. Probab=52.59 E-value=4.5 Score=20.36 Aligned_cols=22 Identities=32% Similarity=0.483 Sum_probs=17.2 Q ss_pred CCHHHHHHHHHHHHHHHHCCCC Q ss_conf 9117999999999985433672 Q gi|254780374|r 1 MYRYSFIVLCYSSLLFGCHSSI 22 (238) Q Consensus 1 M~r~~l~~l~~~~lL~GCas~~ 22 (238) |+|-.++..|++++++||++-. T Consensus 1 mkknyii~~~iaii~sgcssie 22 (178) T pfam10671 1 MKKNYIIGACIAIILSGCSSIE 22 (178) T ss_pred CCCCEEEHHHHHHHHCCCCCHH T ss_conf 9752431545999965763023 No 105 >COG4380 Uncharacterized protein conserved in bacteria [Function unknown] Probab=52.41 E-value=15 Score=16.97 Aligned_cols=22 Identities=14% Similarity=0.125 Sum_probs=15.7 Q ss_pred HHHHHHHHHHHHHHHHCCCCCC Q ss_conf 1799999999998543367200 Q gi|254780374|r 3 RYSFIVLCYSSLLFGCHSSISE 24 (238) Q Consensus 3 r~~l~~l~~~~lL~GCas~~~~ 24 (238) +.+++.+..+++|+||+.+..+ T Consensus 2 ~~l~~~~~~vl~ls~c~~~~a~ 23 (216) T COG4380 2 KPLILGLAAVLALSACQVQKAP 23 (216) T ss_pred HHHHHHHHHHHHHHHCCCCCCC T ss_conf 0689999999998653577676 No 106 >TIGR02521 type_IV_pilW type IV pilus biogenesis/stability protein PilW; InterPro: IPR013360 Proteins in this entry are designated PilF and PilW . This outer membrane protein is required both for pilus stability and for pilus functions such as adherence to human cells. Members of this family contain copies of the TPR (tetratricopeptide repeat) domain.. Probab=51.47 E-value=11 Score=17.78 Aligned_cols=15 Identities=27% Similarity=0.379 Sum_probs=11.2 Q ss_pred HHHHHHHHHHHCCCC Q ss_conf 999999985433672 Q gi|254780374|r 8 VLCYSSLLFGCHSSI 22 (238) Q Consensus 8 ~l~~~~lL~GCas~~ 22 (238) .++++++|+||++.. T Consensus 2 ~l~~~~~l~GC~~~~ 16 (247) T TIGR02521 2 ALVLLLALTGCVTTP 16 (247) T ss_pred HHHHHHHHHCCCCCC T ss_conf 887846785157887 No 107 >PRK02939 hypothetical protein; Reviewed Probab=51.33 E-value=14 Score=17.21 Aligned_cols=22 Identities=27% Similarity=0.317 Sum_probs=13.5 Q ss_pred CCHHHHHHHHHHHHHHHHCCCCC Q ss_conf 91179999999999854336720 Q gi|254780374|r 1 MYRYSFIVLCYSSLLFGCHSSIS 23 (238) Q Consensus 1 M~r~~l~~l~~~~lL~GCas~~~ 23 (238) |++..++.+ ++++|+||--... T Consensus 1 ~k~~~~~~~-l~~~L~gCD~~~a 22 (236) T PRK02939 1 MKYKLLPCL-LAILLTGCDRTEV 22 (236) T ss_pred CCCHHHHHH-HHHHHHCCCCCCC T ss_conf 962347899-9998762688887 No 108 >COG3218 ABC-type uncharacterized transport system, auxiliary component [General function prediction only] Probab=51.28 E-value=17 Score=16.65 Aligned_cols=22 Identities=23% Similarity=0.369 Sum_probs=16.0 Q ss_pred CCCEEEHHEEEEEEEEEECEEE Q ss_conf 9973824205543999802033 Q gi|254780374|r 197 AHNSVSYDKIAEARISYGGKGR 218 (238) Q Consensus 197 ~~NtV~S~~iAda~I~y~g~G~ 218 (238) .|++|-++|+-++.+-..|+|. T Consensus 157 ~n~~v~A~r~F~a~~pv~g~~~ 178 (205) T COG3218 157 RNGTVRASRVFRASQPVDGKGN 178 (205) T ss_pred CCCCEEEEEEEEEEECCCCCCC T ss_conf 8886888777898601468983 No 109 >pfam02402 Lysis_col Lysis protein. These small bacterial proteins are required for colicin release and partial cell lysis. This family contains lysis proteins for several different forms of colicin. Bacillus subtilis lytA has been included in this family, the similarity is not highly significant, however it is also a short protein, that is involved in secretion of other proteins (Bateman A pers. obs.). Probab=51.02 E-value=8.8 Score=18.48 Aligned_cols=20 Identities=30% Similarity=0.519 Sum_probs=12.7 Q ss_pred CCHHHHHHHH-HHHHHHHHCC Q ss_conf 9117999999-9999854336 Q gi|254780374|r 1 MYRYSFIVLC-YSSLLFGCHS 20 (238) Q Consensus 1 M~r~~l~~l~-~~~lL~GCas 20 (238) |+|++++.++ +.++|+||-. T Consensus 1 Mkki~~~~i~~~~~~L~aCQa 21 (46) T pfam02402 1 MKKILFIGILLLTVLLSACQA 21 (46) T ss_pred CCEEEEHHHHHHHHHHHHHHH T ss_conf 924201399999999998555 No 110 >COG4607 CeuA ABC-type enterochelin transport system, periplasmic component [Inorganic ion transport and metabolism] Probab=50.61 E-value=13 Score=17.42 Aligned_cols=22 Identities=27% Similarity=0.609 Sum_probs=18.0 Q ss_pred CCHHHHHHHHHHHHHHHHCCCC Q ss_conf 9117999999999985433672 Q gi|254780374|r 1 MYRYSFIVLCYSSLLFGCHSSI 22 (238) Q Consensus 1 M~r~~l~~l~~~~lL~GCas~~ 22 (238) |.++.++..+++++|.+|.+.. T Consensus 2 ~~~~~~~a~~~~lll~ac~~~~ 23 (320) T COG4607 2 MLRLLYLAACVALLLTACSPNS 23 (320) T ss_pred CCHHHHHHHHHHHHHHHHCCCC T ss_conf 4121699999999998726786 No 111 >PRK04405 prsA peptidylprolyl isomerase; Provisional Probab=50.35 E-value=11 Score=17.94 Aligned_cols=21 Identities=24% Similarity=0.430 Sum_probs=12.6 Q ss_pred CCHHHHHHHHHH--HHHHHHCCC Q ss_conf 911799999999--998543367 Q gi|254780374|r 1 MYRYSFIVLCYS--SLLFGCHSS 21 (238) Q Consensus 1 M~r~~l~~l~~~--~lL~GCas~ 21 (238) |+++++.+.+++ +.|+||++. T Consensus 5 MKK~~l~~~~~~~~l~LaaCss~ 27 (298) T PRK04405 5 MKKWALAAASAGLLLSLAGCSSN 27 (298) T ss_pred HHHHHHHHHHHHHHHHHHHCCCC T ss_conf 89999999999999999871799 No 112 >PRK13613 lipoprotein LpqB; Provisional Probab=50.26 E-value=14 Score=17.24 Aligned_cols=20 Identities=35% Similarity=0.418 Sum_probs=13.4 Q ss_pred HHHHHHHHHHHHHHHHCCCC Q ss_conf 17999999999985433672 Q gi|254780374|r 3 RYSFIVLCYSSLLFGCHSSI 22 (238) Q Consensus 3 r~~l~~l~~~~lL~GCas~~ 22 (238) |..-++-|.+++|+||++-. T Consensus 2 ~~~~~~~~~~vllaGCasiP 21 (603) T PRK13613 2 RAAAYTGCGAVLLAGCASMP 21 (603) T ss_pred CHHHHHHHHHHEEECCCCCC T ss_conf 06665452440210113699 No 113 >PRK11162 mltA murein transglycosylase A; Provisional Probab=50.22 E-value=19 Score=16.38 Aligned_cols=30 Identities=20% Similarity=0.132 Sum_probs=17.6 Q ss_pred EEEEEEEECCCCCEEEEEEEEEEEECCEEE Q ss_conf 999999980797289978877984188799 Q gi|254780374|r 154 IAAIVTAILENGNLIISGSQEVRVNDEIRS 183 (238) Q Consensus 154 isa~V~~VlpNGnL~I~G~k~i~vn~e~~~ 183 (238) .=|-+-+-..+=-|.|+|+=.|++-++.+. T Consensus 164 ELaW~~dpvD~FfLqIQGSGrv~l~DG~~l 193 (362) T PRK11162 164 ELAYSNSLIDNFIMEVQGSGYVDFGDGDPL 193 (362) T ss_pred EEEEECCHHHHHHEEECCCEEEECCCCCEE T ss_conf 699848867645365417479984899889 No 114 >COG4785 NlpI Lipoprotein NlpI, contains TPR repeats [General function prediction only] Probab=49.91 E-value=16 Score=16.79 Aligned_cols=26 Identities=19% Similarity=0.273 Sum_probs=17.3 Q ss_pred CCHHHHHHHHHHHH-HHHHCCCCCCCC Q ss_conf 91179999999999-854336720025 Q gi|254780374|r 1 MYRYSFIVLCYSSL-LFGCHSSISEIT 26 (238) Q Consensus 1 M~r~~l~~l~~~~l-L~GCas~~~~~~ 26 (238) |+-..++..|+.++ |+||+++....- T Consensus 1 mk~f~~~~~~~~~~~LagC~~~~~~~~ 27 (297) T COG4785 1 MKPFLRWCFVATALTLAGCSNQNETFW 27 (297) T ss_pred CCCHHHHHHHHHHHHHHHHHCCCCCEE T ss_conf 960477899999998775304667410 No 115 >TIGR02898 spore_YhcN_YlaJ sporulation lipoprotein, YhcN/YlaJ family; InterPro: IPR014247 This entry contains YhcN and YlaJ, which are predicted lipoproteins that have been detected as spore proteins but not vegetative proteins in Bacillus subtilis. Both appear to be expressed under control of the RNA polymerase sigma-G factor. The YlaJ-like members of this family have a low-complexity, strongly acidic, 40-residue C-terminal domain. . Probab=49.66 E-value=9.8 Score=18.20 Aligned_cols=18 Identities=39% Similarity=0.630 Sum_probs=11.4 Q ss_pred HHHHHHHHHHH--HHHCCCC Q ss_conf 99999999998--5433672 Q gi|254780374|r 5 SFIVLCYSSLL--FGCHSSI 22 (238) Q Consensus 5 ~l~~l~~~~lL--~GCas~~ 22 (238) ++++|+++++| +||+-.. T Consensus 2 l~~~ll~~~~l~~tgC~~~~ 21 (185) T TIGR02898 2 LFIILLLLLLLLATGCTNAQ 21 (185) T ss_pred HHHHHHHHHHHHHHCCCCCC T ss_conf 68899999999873134100 No 116 >PRK10838 spr putative outer membrane lipoprotein; Provisional Probab=48.61 E-value=19 Score=16.43 Aligned_cols=18 Identities=22% Similarity=0.364 Sum_probs=11.7 Q ss_pred HHHHHHHHHHHHHCCCCC Q ss_conf 999999999854336720 Q gi|254780374|r 6 FIVLCYSSLLFGCHSSIS 23 (238) Q Consensus 6 l~~l~~~~lL~GCas~~~ 23 (238) +..++++++|++|++... T Consensus 14 ~~~~~~~~~l~ac~~~~~ 31 (188) T PRK10838 14 IPAIAVAVLLSACSSNNT 31 (188) T ss_pred HHHHHHHHHHHHHCCCCC T ss_conf 179999998886338985 No 117 >COG3168 PilP Tfp pilus assembly protein PilP [Cell motility and secretion / Intracellular trafficking and secretion] Probab=48.48 E-value=20 Score=16.21 Aligned_cols=39 Identities=18% Similarity=0.124 Sum_probs=19.8 Q ss_pred CCCCCHHEECEEEEEEEEEEEEEEECCCCCEEEEEEEEEEEECCE Q ss_conf 321100000001489999999999807972899788779841887 Q gi|254780374|r 137 ASSGKGSISRAEKLNLLIAAIVTAILENGNLIISGSQEVRVNDEI 181 (238) Q Consensus 137 ~~~g~G~~~~~~~l~~~isa~V~~VlpNGnL~I~G~k~i~vn~e~ 181 (238) .|+=.|..++...+.+-|.+ |.|...|.--+.+..|.+. T Consensus 97 ~~rlvGtm~~g~~~~A~i~~------~~~v~~V~vG~YlGqN~Gr 135 (170) T COG3168 97 TFRLVGTLKSGQGVSALIEA------PGGVYRVRVGQYLGQNYGR 135 (170) T ss_pred HEEEEEEECCCCCEEEEEEC------CCCEEEEEECCEEECCCCE T ss_conf 60367785278836899973------8926998501075024766 No 118 >PRK03002 prsA peptidylprolyl isomerase; Reviewed Probab=48.41 E-value=16 Score=16.93 Aligned_cols=20 Identities=25% Similarity=0.272 Sum_probs=10.6 Q ss_pred CCHHHH-H--HHHHHHHHHHHCC Q ss_conf 911799-9--9999999854336 Q gi|254780374|r 1 MYRYSF-I--VLCYSSLLFGCHS 20 (238) Q Consensus 1 M~r~~l-~--~l~~~~lL~GCas 20 (238) |++..+ + .++.+++|+||+. T Consensus 1 mkkK~i~~~~~~~svl~LaaC~~ 23 (285) T PRK03002 1 MRGKHIFIITALISILMLSACGQ 23 (285) T ss_pred CCHHHHHHHHHHHHHHHHHHHCC T ss_conf 96018999999999999998457 No 119 >PRK13861 type IV secretion system protein VirB9; Provisional Probab=48.23 E-value=17 Score=16.69 Aligned_cols=18 Identities=17% Similarity=0.176 Sum_probs=10.6 Q ss_pred CCHHHHHHHHHHHHHHHHC Q ss_conf 9117999999999985433 Q gi|254780374|r 1 MYRYSFIVLCYSSLLFGCH 19 (238) Q Consensus 1 M~r~~l~~l~~~~lL~GCa 19 (238) |||.+++++++++ +..|. T Consensus 1 mmk~l~~~~~~~l-~~~~~ 18 (293) T PRK13861 1 MIKKLFLTLACLL-FAAIG 18 (293) T ss_pred CCHHHHHHHHHHH-HHHHH T ss_conf 9089999999999-86126 No 120 >PRK13614 lipoprotein LpqB; Provisional Probab=47.96 E-value=21 Score=16.16 Aligned_cols=15 Identities=13% Similarity=0.308 Sum_probs=9.5 Q ss_pred HHHHHHHHHHHHCCC Q ss_conf 999999998543367 Q gi|254780374|r 7 IVLCYSSLLFGCHSS 21 (238) Q Consensus 7 ~~l~~~~lL~GCas~ 21 (238) ++++++++|+||++- T Consensus 13 ll~~~~v~LagCasi 27 (573) T PRK13614 13 LLVLLGVTLSACAQI 27 (573) T ss_pred HHHHHHHHHHHHCCC T ss_conf 999999886540469 No 121 >TIGR03431 PhnD phosphonate ABC transporter, periplasmic phosphonate binding protein. Note that this model does not identify all phnD-subfamily genes with evident phosphonate context, but all sequences above the trusted context may be inferred to bind phosphonate compounds even in the absence of such context. Furthermore, there is ample evidence to suggest that many other members of the TIGR01098 subfamily have a different primary function. Probab=47.37 E-value=15 Score=17.03 Aligned_cols=24 Identities=17% Similarity=0.292 Sum_probs=16.1 Q ss_pred CCHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 911799999999998543367200 Q gi|254780374|r 1 MYRYSFIVLCYSSLLFGCHSSISE 24 (238) Q Consensus 1 M~r~~l~~l~~~~lL~GCas~~~~ 24 (238) |+|..+++++++++|..|.+.-++ T Consensus 1 m~~r~l~~~~~~~~~~~~~a~a~~ 24 (288) T TIGR03431 1 MLRRLILSLVAAFMLISSNAQAED 24 (288) T ss_pred CCHHHHHHHHHHHHHHCCHHHHCC T ss_conf 908899999999999743224304 No 122 >pfam10447 EXOSC1 Exosome component EXOSC1/CSL4. This family of proteins are components of the exosome 3'-5' exoribonuclease complex. The exosome mediates degradation of unstable mRNAs that contain AU-rich elements (AREs) within their 3' untranslated regions. Probab=46.84 E-value=21 Score=16.06 Aligned_cols=42 Identities=29% Similarity=0.321 Sum_probs=23.4 Q ss_pred EEEEEEEEECCCC-CEEEEEEEEEEEECCEEEEEEEEEECHHHCCCCC Q ss_conf 9999999980797-2899788779841887999999798477678997 Q gi|254780374|r 153 LIAAIVTAILENG-NLIISGSQEVRVNDEIRSLNVTGIVRPQDVDAHN 199 (238) Q Consensus 153 ~isa~V~~VlpNG-nL~I~G~k~i~vn~e~~~i~isGiVRp~DI~~~N 199 (238) .+.|+|+.+.+.= +..| +.+++..-.=.+.|+||-+||.+.. T Consensus 7 iV~arVtrv~~~~a~~~I-----l~v~~~~l~~~f~G~IR~~DVRate 49 (72) T pfam10447 7 IVLARVTRVTPRQAYCEI-----LAVGGTALKETFKGIIRKQDVRATE 49 (72) T ss_pred EEEEEEEEECHHEEEEEE-----EEECCEECCCCEEEEEEHHHCCCCC T ss_conf 999999997501579999-----9999998788555898947666564 No 123 >pfam06316 Ail_Lom Enterobacterial Ail/Lom protein. This family consists of several bacterial and phage Ail/Lom-like proteins. The Yersinia enterocolitica Ail protein is a known virulence factor. Proteins in this family are predicted to consist of eight transmembrane beta-sheets and four cell surface-exposed loops. It is thought that Ail directly promotes invasion and loop 2 contains an active site, perhaps a receptor-binding domain. The phage protein Lom is expressed during lysogeny, and encode host-cell envelope proteins. Lom is found in the bacterial outer membrane, and is homologous to virulence proteins of two other enterobacterial genera. It has been suggested that lysogeny may generally have a role in bacterial survival in animal hosts, and perhaps in pathogenesis. Probab=46.75 E-value=17 Score=16.68 Aligned_cols=22 Identities=14% Similarity=0.114 Sum_probs=17.9 Q ss_pred CCHHHHHHHHHHHHHHHHCCCC Q ss_conf 9117999999999985433672 Q gi|254780374|r 1 MYRYSFIVLCYSSLLFGCHSSI 22 (238) Q Consensus 1 M~r~~l~~l~~~~lL~GCas~~ 22 (238) |+|+...+|+++++|++|+... T Consensus 1 m~~~~~~~l~~~~~l~~~~~~a 22 (199) T pfam06316 1 MRKLCAAILSAAICLAAAGTPA 22 (199) T ss_pred CCHHHHHHHHHHHHHHHHCCCC T ss_conf 9017789999999999727731 No 124 >PRK11260 cystine transporter subunit; Provisional Probab=46.52 E-value=15 Score=16.96 Aligned_cols=20 Identities=15% Similarity=0.183 Sum_probs=15.4 Q ss_pred CCHHHHHHHHHHHHHHHHCC Q ss_conf 91179999999999854336 Q gi|254780374|r 1 MYRYSFIVLCYSSLLFGCHS 20 (238) Q Consensus 1 M~r~~l~~l~~~~lL~GCas 20 (238) |.|.+|+.++.+.+++||++ T Consensus 4 ~~~~~~~~~l~~~~~~~~~~ 23 (264) T PRK11260 4 LGRQALMGVMAVALVAGMSV 23 (264) T ss_pred HHHHHHHHHHHHHHHHHHCC T ss_conf 67999999999999998357 No 125 >COG3015 CutF Uncharacterized lipoprotein NlpE involved in copper resistance [Cell envelope biogenesis, outer membrane / Inorganic ion transport and metabolism] Probab=45.93 E-value=14 Score=17.17 Aligned_cols=23 Identities=13% Similarity=0.313 Sum_probs=16.9 Q ss_pred CCHHHHHHHHHHHHHHHHCCCCC Q ss_conf 91179999999999854336720 Q gi|254780374|r 1 MYRYSFIVLCYSSLLFGCHSSIS 23 (238) Q Consensus 1 M~r~~l~~l~~~~lL~GCas~~~ 23 (238) |+|.++.++++.+.|.||..+.+ T Consensus 2 vkk~i~tl~a~~l~l~gcnnr~~ 24 (178) T COG3015 2 VKKAIVTLMAVILTLMGCNNRAE 24 (178) T ss_pred HHHHHHHHHHHHHHHHCCCCHHH T ss_conf 07899999999999845776332 No 126 >COG0361 InfA Translation initiation factor 1 (IF-1) [Translation, ribosomal structure and biogenesis] Probab=45.75 E-value=22 Score=15.95 Aligned_cols=59 Identities=22% Similarity=0.313 Sum_probs=35.7 Q ss_pred EEEEEEEEEEECCCCCEEEEEEEEEEEECCEEEEEEEEEECH--HHCCCCCEE----EHHEEEEEEEEEE Q ss_conf 999999999980797289978877984188799999979847--767899738----2420554399980 Q gi|254780374|r 151 NLLIAAIVTAILENGNLIISGSQEVRVNDEIRSLNVTGIVRP--QDVDAHNSV----SYDKIAEARISYG 214 (238) Q Consensus 151 ~~~isa~V~~VlpNGnL~I~G~k~i~vn~e~~~i~isGiVRp--~DI~~~NtV----~S~~iAda~I~y~ 214 (238) .+.+-++|+++|||+.+.|+= -|+....-+|+|-.|- -=|.++-.| .-.+.--++|.|. T Consensus 6 ~~e~~g~V~e~L~~~~f~v~~-----eng~~~~ahI~GKmr~~~i~I~~GD~V~Ve~~~~d~~kg~I~~R 70 (75) T COG0361 6 EIEMEGTVIEMLPNGRFRVEL-----ENGHERLAHISGKMRKNRIRILPGDVVLVELSPYDLTKGRIVYR 70 (75) T ss_pred CCEEEEEEEEECCCCEEEEEE-----CCCCEEEEECCCCCCEEEEEECCCCEEEEEECCCCCCCCCEEEE T ss_conf 428999999973897899994-----69969998825874400488579999999745665333558998 No 127 >COG3470 Tpd Uncharacterized protein probably involved in high-affinity Fe2+ transport [Inorganic ion transport and metabolism] Probab=45.53 E-value=21 Score=16.14 Aligned_cols=21 Identities=10% Similarity=0.039 Sum_probs=13.9 Q ss_pred CCHHHHHHHHHHHHHHHHCCC Q ss_conf 911799999999998543367 Q gi|254780374|r 1 MYRYSFIVLCYSSLLFGCHSS 21 (238) Q Consensus 1 M~r~~l~~l~~~~lL~GCas~ 21 (238) |+|+++...+++++++.|+.. T Consensus 3 ~~k~l~~~~~~a~v~s~~a~~ 23 (179) T COG3470 3 MKKLLLSAAILASVFSAPAEG 23 (179) T ss_pred HHHHHHHHHHHHHHHHHHHCC T ss_conf 689999999999997410000 No 128 >COG4669 EscJ Type III secretory pathway, lipoprotein EscJ [Intracellular trafficking and secretion] Probab=45.40 E-value=14 Score=17.24 Aligned_cols=17 Identities=29% Similarity=0.311 Sum_probs=14.7 Q ss_pred EEEEEEEEEEEEEECCC Q ss_conf 14899999999998079 Q gi|254780374|r 148 EKLNLLIAAIVTAILEN 164 (238) Q Consensus 148 ~~l~~~isa~V~~VlpN 164 (238) ..+.|-+.|+|.=+||+ T Consensus 119 s~mDGVi~ArV~I~lp~ 135 (246) T COG4669 119 SKMDGVISARVHISLPE 135 (246) T ss_pred HHCCCEEEEEEEEECCC T ss_conf 75676588889997577 No 129 >pfam11839 DUF3359 Protein of unknown function (DUF3359). This family of proteins are functionally uncharacterized. This protein is found in bacteria. Proteins in this family are about 80 amino acids in length. Probab=44.59 E-value=16 Score=16.93 Aligned_cols=23 Identities=30% Similarity=0.468 Sum_probs=14.1 Q ss_pred CCHHH-HHHHHHHHHH-HHHCCCCC Q ss_conf 91179-9999999998-54336720 Q gi|254780374|r 1 MYRYS-FIVLCYSSLL-FGCHSSIS 23 (238) Q Consensus 1 M~r~~-l~~l~~~~lL-~GCas~~~ 23 (238) |++++ |..+.+..+| .||++..+ T Consensus 1 mkkllk~s~i~laallaggcaslse 25 (97) T pfam11839 1 MKKLLKFSAIALAALLAGGCASLSE 25 (97) T ss_pred CCHHHHHHHHHHHHHHHCCCHHHHH T ss_conf 9348999999999999743123218 No 130 >pfam01298 Lipoprotein_5 Transferrin binding protein-like solute binding protein. This family of proteins are distantly related to other families of solute binding proteins. Probab=44.57 E-value=23 Score=15.84 Aligned_cols=10 Identities=30% Similarity=0.557 Sum_probs=7.0 Q ss_pred HHHHHHHHCC Q ss_conf 9999854336 Q gi|254780374|r 11 YSSLLFGCHS 20 (238) Q Consensus 11 ~~~lL~GCas 20 (238) +++||++|.. T Consensus 13 ~~~LLSAC~G 22 (589) T pfam01298 13 AAGLLSACSG 22 (589) T ss_pred HHHHHHHHCC T ss_conf 9999997346 No 131 >TIGR03516 ppisom_GldI peptidyl-prolyl isomerase, gliding motility-associated. Members of this protein family are exclusive to the Bacteroidetes phylum (previously Cytophaga-Flavobacteria-Bacteroides). GldI is a FKBP-type peptidyl-prolyl cis-trans isomerase (pfam00254) linked to a type of rapid surface gliding motility found in certain Bacteroidetes, such as Flavobacterium johnsoniae and Cytophaga hutchinsonii. Knockout of this gene abolishes the gliding phenotype. Gliding motility appears closely linked to chitin utilization in the model species Flavobacterium johnsoniae. This family is only found in Bacteroidetes containing the suite of genes proposed to confer the gliding motility phenotype. Probab=44.22 E-value=16 Score=16.88 Aligned_cols=19 Identities=21% Similarity=0.378 Sum_probs=13.5 Q ss_pred CCCCCCCCEEEEEEEECCH Q ss_conf 2236768769999972410 Q gi|254780374|r 75 SRALNVGDILTVDIRIDDQ 93 (238) Q Consensus 75 ~RA~~VGDIiTV~i~E~~~ 93 (238) -..-..||.|||.-.-..- T Consensus 83 ~~~P~~gd~V~~~Y~~~~l 101 (177) T TIGR03516 83 GTTPEFGDLVTFEYDIRAL 101 (177) T ss_pred CCCCCCCCEEEEEEEEEEC T ss_conf 8899999989999999976 No 132 >PRK05698 fliN flagellar motor switch protein; Validated Probab=43.59 E-value=24 Score=15.74 Aligned_cols=35 Identities=31% Similarity=0.515 Sum_probs=29.4 Q ss_pred EEEECCCCCEEEEEEEEEEEECCEEEEEEEEEECHHH Q ss_conf 9998079728997887798418879999997984776 Q gi|254780374|r 158 VTAILENGNLIISGSQEVRVNDEIRSLNVTGIVRPQD 194 (238) Q Consensus 158 V~~VlpNGnL~I~G~k~i~vn~e~~~i~isGiVRp~D 194 (238) =++|+=||.|+=+|+ +.+-++..=|||+-||.|.+ T Consensus 115 PvdI~VNg~liA~GE--VVVv~d~fGVRIteIisp~e 149 (155) T PRK05698 115 PLDVLVNGTLIAHGE--VVVVNEKFGIRLTDVISPSE 149 (155) T ss_pred CEEEEECCEEEEEEE--EEEECCEEEEEEEEECCHHH T ss_conf 548999999999886--99999977899998459899 No 133 >cd05695 S1_Rrp5_repeat_hs3 S1_Rrp5_repeat_hs3: Rrp5 is a trans-acting factor important for biogenesis of both the 40S and 60S eukaryotic ribosomal subunits. Rrp5 has two distinct regions, an N-terminal region containing tandemly repeated S1 RNA-binding domains (12 S1 repeats in Saccharomyces cerevisiae Rrp5 and 14 S1 repeats in Homo sapiens Rrp5) and a C-terminal region containing tetratricopeptide repeat (TPR) motifs thought to be involved in protein-protein interactions. Mutational studies have shown that each region represents a specific functional domain. Deletions within the S1-containing region inhibit pre-rRNA processing at either site A3 or A2, whereas deletions within the TPR region confer an inability to support cleavage of A0-A2. This CD includes H. sapiens S1 repeat 3 (hs3). Rrp5 is found in eukaryotes but not in prokaryotes or archaea. Probab=43.56 E-value=24 Score=15.74 Aligned_cols=49 Identities=31% Similarity=0.374 Sum_probs=35.4 Q ss_pred EEEEEEEECCCCCEEEEEEEEEEEECCEEEE-EEEEEECHHHCCCCCE----EEHHEEEEEEEEEEC Q ss_conf 9999999807972899788779841887999-9997984776789973----824205543999802 Q gi|254780374|r 154 IAAIVTAILENGNLIISGSQEVRVNDEIRSL-NVTGIVRPQDVDAHNS----VSYDKIAEARISYGG 215 (238) Q Consensus 154 isa~V~~VlpNGnL~I~G~k~i~vn~e~~~i-~isGiVRp~DI~~~Nt----V~S~~iAda~I~y~g 215 (238) +-++|.+|+|||-.+ +++ .++|.|.+-+.++.-. =.-.+-..|||-|.- T Consensus 4 V~a~V~kV~~~Gl~~-------------~fl~~F~G~Vd~~HL~~~~~~~~~Y~~~~~v~ArIL~Vd 57 (66) T cd05695 4 VNARVKKVLSNGLIL-------------DFLSSFTGTVDFLHLDPEKSSKSTYKEGQKVRARILYVD 57 (66) T ss_pred EEEEEEEECCCCEEE-------------EEECCEEEEEEHHHCCCCCCCCCCCCCCCEEEEEEEEEC T ss_conf 999999981794899-------------965128999887676877687000467988999999998 No 134 >pfam12046 DUF3529 Protein of unknown function (DUF3529). This family of proteins is functionally uncharacterized. This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 176 to 190 amino acids in length. Probab=43.49 E-value=13 Score=17.39 Aligned_cols=28 Identities=18% Similarity=0.240 Sum_probs=11.7 Q ss_pred EECEEEEEEEEEEEEEEECCCCCEEEEEEE Q ss_conf 000014899999999998079728997887 Q gi|254780374|r 144 ISRAEKLNLLIAAIVTAILENGNLIISGSQ 173 (238) Q Consensus 144 ~~~~~~l~~~isa~V~~VlpNGnL~I~G~k 173 (238) +.|.|++...+-. .+--+--.+.|+|+| T Consensus 130 A~R~E~v~lkl~~--~~~~~~s~l~v~ahR 157 (173) T pfam12046 130 ASRPEQVELKLVE--ADDDGGSDLRVRAHR 157 (173) T ss_pred CCCCCEEEEEEEE--CCCCCCCEEEEEECH T ss_conf 7875018999984--689987369999639 No 135 >COG3133 SlyB Outer membrane lipoprotein [Cell envelope biogenesis, outer membrane] Probab=43.07 E-value=15 Score=17.04 Aligned_cols=21 Identities=24% Similarity=0.398 Sum_probs=14.7 Q ss_pred CCHHHHHHH-HHHHHHHHHCCC Q ss_conf 911799999-999998543367 Q gi|254780374|r 1 MYRYSFIVL-CYSSLLFGCHSS 21 (238) Q Consensus 1 M~r~~l~~l-~~~~lL~GCas~ 21 (238) |+|..+++- +.++.|.||++. T Consensus 1 M~k~~~avA~~~al~L~GCan~ 22 (154) T COG3133 1 MIKTALAVAALMALSLTGCANN 22 (154) T ss_pred CCHHHHHHHHHHHHHCCCCCCC T ss_conf 9425789999988740011058 No 136 >COG2980 RlpB Rare lipoprotein B [Cell envelope biogenesis, outer membrane] Probab=43.06 E-value=22 Score=15.92 Aligned_cols=28 Identities=25% Similarity=0.322 Sum_probs=16.8 Q ss_pred CCHHHHHHH-HHHHHHHHHCCCCCCCCCC Q ss_conf 911799999-9999985433672002568 Q gi|254780374|r 1 MYRYSFIVL-CYSSLLFGCHSSISEITGI 28 (238) Q Consensus 1 M~r~~l~~l-~~~~lL~GCas~~~~~~~~ 28 (238) |+++..+++ +++++|+||.=++...... T Consensus 1 M~~L~~~lL~~a~~~L~aCGFhLRg~~~i 29 (178) T COG2980 1 MRKLKTLLLLAAVLLLAACGFHLRGTTQI 29 (178) T ss_pred CCCHHHHHHHHHHHHHHHCCEEECCCCCC T ss_conf 90058999999999973345343178889 No 137 >TIGR01845 outer_NodT efflux transporter, outer membrane factor (OMF) lipoprotein, NodT family; InterPro: IPR010131 Members of group are outer membrane lipoproteins from the NodT family of the RND (Resistance-Nodulation-cell Division) type efflux systems. These proteins work with an inner membrane ABC transporter ATPase and an adapter called a membrane fusion protein. Most members of this family are likely to export primarily small molecules rather than proteins, but are related to the type I protein secretion outer membrane proteins TolC and PrtF.; GO: 0005215 transporter activity, 0008289 lipid binding, 0006810 transport, 0016020 membrane. Probab=42.81 E-value=19 Score=16.37 Aligned_cols=11 Identities=27% Similarity=0.398 Sum_probs=8.3 Q ss_pred HHHHHHHHHCC Q ss_conf 99999854336 Q gi|254780374|r 10 CYSSLLFGCHS 20 (238) Q Consensus 10 ~~~~lL~GCas 20 (238) +++++|+||+. T Consensus 2 ~~a~~LsGCa~ 12 (477) T TIGR01845 2 VAALALSGCAV 12 (477) T ss_pred HHHHHHHHHCC T ss_conf 78899985206 No 138 >TIGR02567 YscW type III secretion system chaperone YscW; InterPro: IPR013400 This entry is encoded within type III secretion operons. The protein has been characterised as a chaperone for the outer membrane pore component YscC (IPR003522 from INTERPRO). YscW is a lipoprotein which is itself localized to the outer membrane and, it is believed, facilitates the oligomerisation and localization of YscC.. Probab=42.68 E-value=7.1 Score=19.07 Aligned_cols=13 Identities=54% Similarity=0.641 Sum_probs=9.6 Q ss_pred HHHHHHHHHHCCC Q ss_conf 9999998543367 Q gi|254780374|r 9 LCYSSLLFGCHSS 21 (238) Q Consensus 9 l~~~~lL~GCas~ 21 (238) +++++||+||++. T Consensus 2 l~~~lLL~GC~~s 14 (131) T TIGR02567 2 LILLLLLTGCVSS 14 (131) T ss_pred HHHHHHHHCCCCC T ss_conf 3541002100012 No 139 >PRK10796 LPS-assembly lipoprotein RlpB; Provisional Probab=42.32 E-value=22 Score=15.94 Aligned_cols=25 Identities=16% Similarity=0.267 Sum_probs=14.4 Q ss_pred CCHHHHHHHHHH-HHHHHHCCCCCCC Q ss_conf 911799999999-9985433672002 Q gi|254780374|r 1 MYRYSFIVLCYS-SLLFGCHSSISEI 25 (238) Q Consensus 1 M~r~~l~~l~~~-~lL~GCas~~~~~ 25 (238) |+.+..+++.++ ++++||.=++... T Consensus 1 ~r~l~~l~l~lavll~agCGFhLRg~ 26 (196) T PRK10796 1 MRYLATLLLSLAVLVTAGCGWHLRGT 26 (196) T ss_pred CHHHHHHHHHHHHHHHCCCCCEECCC T ss_conf 95699999999999974767157689 No 140 >COG4594 FecB ABC-type Fe3+-citrate transport system, periplasmic component [Inorganic ion transport and metabolism] Probab=41.78 E-value=21 Score=16.12 Aligned_cols=18 Identities=28% Similarity=0.347 Sum_probs=10.7 Q ss_pred HHHHHHHHHHHHHHHCCC Q ss_conf 799999999998543367 Q gi|254780374|r 4 YSFIVLCYSSLLFGCHSS 21 (238) Q Consensus 4 ~~l~~l~~~~lL~GCas~ 21 (238) +++.+++++++.+||++. T Consensus 8 ~i~~lll~lllva~C~~s 25 (310) T COG4594 8 IILTLLLLLLLVAACSSS 25 (310) T ss_pred HHHHHHHHHHHHHHHCCC T ss_conf 999999999999771476 No 141 >COG4238 Murein lipoprotein [Cell envelope biogenesis, outer membrane] Probab=40.59 E-value=19 Score=16.45 Aligned_cols=18 Identities=39% Similarity=0.307 Sum_probs=11.0 Q ss_pred HHHHHHHHHHHHHHHCCC Q ss_conf 799999999998543367 Q gi|254780374|r 4 YSFIVLCYSSLLFGCHSS 21 (238) Q Consensus 4 ~~l~~l~~~~lL~GCas~ 21 (238) .++..++..++|+||++. T Consensus 7 ~l~Avvlg~lllAGc~s~ 24 (78) T COG4238 7 TLGAVVLGSLLLAGCSSN 24 (78) T ss_pred HHHHHHHHHHHHHHCCCH T ss_conf 278999877999701108 No 142 >pfam11873 DUF3393 Domain of unknown function (DUF3393). This domain is functionally uncharacterized. This domain is found in bacteria. This presumed domain is typically between 188 to 206 amino acids in length. This domain is found associated with pfam01464. Probab=40.42 E-value=22 Score=16.02 Aligned_cols=13 Identities=38% Similarity=0.516 Sum_probs=8.0 Q ss_pred HHHHHHHHHCCCC Q ss_conf 9999985433672 Q gi|254780374|r 10 CYSSLLFGCHSSI 22 (238) Q Consensus 10 ~~~~lL~GCas~~ 22 (238) ++++||+||+... T Consensus 8 ~~~llL~~Cs~~~ 20 (200) T pfam11873 8 LILLLLSGCSSQF 20 (200) T ss_pred HHHHHHHHCCCCC T ss_conf 9999984157765 No 143 >COG2834 LolA Outer membrane lipoprotein-sorting protein [Cell envelope biogenesis, outer membrane] Probab=38.87 E-value=19 Score=16.34 Aligned_cols=21 Identities=10% Similarity=0.124 Sum_probs=13.5 Q ss_pred CHHHHHHHHHHHHHHHHCCCC Q ss_conf 117999999999985433672 Q gi|254780374|r 2 YRYSFIVLCYSSLLFGCHSSI 22 (238) Q Consensus 2 ~r~~l~~l~~~~lL~GCas~~ 22 (238) +++++++++++++++||+... T Consensus 6 ~~~~~~~~l~~~f~~~~~~~~ 26 (211) T COG2834 6 MKLLLALALLLLFLSACAQAG 26 (211) T ss_pred HHHHHHHHHHHHHHHHHHHHH T ss_conf 999999999999988887745 No 144 >PRK11530 hypothetical protein; Provisional Probab=38.82 E-value=17 Score=16.71 Aligned_cols=18 Identities=17% Similarity=0.242 Sum_probs=11.7 Q ss_pred HHHHHHHHHHHHHHHCCC Q ss_conf 799999999998543367 Q gi|254780374|r 4 YSFIVLCYSSLLFGCHSS 21 (238) Q Consensus 4 ~~l~~l~~~~lL~GCas~ 21 (238) +.++++...++|+||+.+ T Consensus 3 ~~~~ll~sll~LsgCa~q 20 (176) T PRK11530 3 LRLLLLGSLLSLSACAQQ 20 (176) T ss_pred EEHHHHHHHHHHHHCCCC T ss_conf 003488899998642475 No 145 >PRK10175 hypothetical protein; Provisional Probab=38.22 E-value=16 Score=16.81 Aligned_cols=20 Identities=40% Similarity=0.522 Sum_probs=12.6 Q ss_pred HHHHHHHHHHHHCCCCCCCC Q ss_conf 99999999854336720025 Q gi|254780374|r 7 IVLCYSSLLFGCHSSISEIT 26 (238) Q Consensus 7 ~~l~~~~lL~GCas~~~~~~ 26 (238) ++.+.+++|+||.+-+.+.. T Consensus 5 ~~~~~~lllsGC~SIms~t~ 24 (75) T PRK10175 5 VVSIMVTLLSGCGSIISRTI 24 (75) T ss_pred HHHHHHHHHCCCHHHHCCCC T ss_conf 88889999816424430557 No 146 >TIGR01781 Trep_dent_lipo Treponema denticola clustered lipoprotein; InterPro: IPR010103 This entry represents a family of six predicted lipoproteins from a region of about 20 tandemly arranged genes in the Treponema denticola genome. Two other neighbouring genes share the lipoprotein signal peptide region but do not show more extensive homology. The function of this locus is unknown.. Probab=38.02 E-value=24 Score=15.75 Aligned_cols=21 Identities=33% Similarity=0.812 Sum_probs=12.7 Q ss_pred HHHHHHHHHHHHHHHHCCCCCC Q ss_conf 1799999999998543367200 Q gi|254780374|r 3 RYSFIVLCYSSLLFGCHSSISE 24 (238) Q Consensus 3 r~~l~~l~~~~lL~GCas~~~~ 24 (238) |++|+++ +|+||..|...... T Consensus 7 KlIFIL~-LAvLLFSC~ke~ke 27 (463) T TIGR01781 7 KLIFILM-LAVLLFSCKKEVKE 27 (463) T ss_pred HHHHHHH-HHHHHHCCCCCCCC T ss_conf 6888999-99986047621133 No 147 >TIGR01433 CyoA ubiquinol oxidase, subunit II; InterPro: IPR006333 Members of this domain catalyze the oxidation of ubiquinol with the concomitant reduction of molecular oxygen to water. This acts as the terminal electron acceptor in the respiratory chain. Subunit II is responsible for binding and oxidation of the ubiquinone substrate. This sequence is closely related to QoxA, which oxidizes quinol in Gram-positive bacteria but which is in complex with subunits which utilise cytochrome a in the reduction of molecular oxygen. Slightly more distantly related is subunit II of cytochrome c oxidase which uses cytochrome c as the oxidant . ; GO: 0016682 oxidoreductase activity acting on diphenols and related substances as donors oxygen as acceptor, 0006118 electron transport, 0016020 membrane. Probab=37.66 E-value=14 Score=17.22 Aligned_cols=13 Identities=38% Similarity=0.608 Sum_probs=8.4 Q ss_pred HHHHHHHHHHHHC Q ss_conf 9999999985433 Q gi|254780374|r 7 IVLCYSSLLFGCH 19 (238) Q Consensus 7 ~~l~~~~lL~GCa 19 (238) .++++.+||+||. T Consensus 2 ~~~~~~~lLsGCN 14 (228) T TIGR01433 2 SLILASLLLSGCN 14 (228) T ss_pred HHHHHHHHHHCCC T ss_conf 5778999861256 No 148 >PRK08983 fliN flagellar motor switch protein; Validated Probab=37.53 E-value=30 Score=15.15 Aligned_cols=35 Identities=23% Similarity=0.419 Sum_probs=28.8 Q ss_pred EEEECCCCCEEEEEEEEEEEECCEEEEEEEEEECHHH Q ss_conf 9998079728997887798418879999997984776 Q gi|254780374|r 158 VTAILENGNLIISGSQEVRVNDEIRSLNVTGIVRPQD 194 (238) Q Consensus 158 V~~VlpNGnL~I~G~k~i~vn~e~~~i~isGiVRp~D 194 (238) =++++-||.++=+|+ +.+-++..=+||+-||.|++ T Consensus 86 pvdI~VNg~liA~GE--vVvv~d~fGVRIteIvs~~e 120 (126) T PRK08983 86 PLDVMVNGTLIAHGE--VVVVNDKFGIRLTDVISQTE 120 (126) T ss_pred CEEEEECCEEEEEEE--EEEECCEEEEEEEEECCHHH T ss_conf 589999999999988--99999988999998459899 No 149 >TIGR01004 PulS_OutS lipoprotein, PulS/OutS family; InterPro: IPR005699 This family comprises lipoproteins from four gamma proteobacterial species: PulS protein of Klebsiella pneumoniae (P20440 from SWISSPROT), the OutS protein of Erwinia chrysanthemi (Q01567 from SWISSPROT) and Pectobacterium chrysanthemi, and the functionally uncharacterised E. coli protein EtpO. PulS and OutS have been shown to interact with and facilitate insertion of secretins into the outer membrane, suggesting a chaperone-like, or piloting function for members of this family.; GO: 0008565 protein transporter activity, 0006886 intracellular protein transport. Probab=37.24 E-value=30 Score=15.12 Aligned_cols=15 Identities=33% Similarity=0.607 Sum_probs=9.7 Q ss_pred HHHHHHHHHHHHHCC Q ss_conf 999999999854336 Q gi|254780374|r 6 FIVLCYSSLLFGCHS 20 (238) Q Consensus 6 l~~l~~~~lL~GCas 20 (238) ++.+++++.|+||.+ T Consensus 12 ~~~~~~~v~lsgCqt 26 (136) T TIGR01004 12 VLSLLVVVLLSGCQT 26 (136) T ss_pred HHHHHHHHHHHHCCC T ss_conf 999999998640478 No 150 >PRK10957 iron-enterobactin transporter periplasmic binding protein; Provisional Probab=36.97 E-value=24 Score=15.72 Aligned_cols=19 Identities=26% Similarity=0.375 Sum_probs=13.5 Q ss_pred HHHHHHHHHHHHHHHHCCC Q ss_conf 1799999999998543367 Q gi|254780374|r 3 RYSFIVLCYSSLLFGCHSS 21 (238) Q Consensus 3 r~~l~~l~~~~lL~GCas~ 21 (238) ++++++.+++++++||++. T Consensus 6 ~~~~~~~~~~~~~~g~~~~ 24 (319) T PRK10957 6 LYRLALLLLGLLLSGIAAA 24 (319) T ss_pred HHHHHHHHHHHHHCCCCCC T ss_conf 8999999999997478755 No 151 >COG2143 Thioredoxin-related protein [Posttranslational modification, protein turnover, chaperones] Probab=36.73 E-value=29 Score=15.17 Aligned_cols=24 Identities=25% Similarity=0.412 Sum_probs=14.8 Q ss_pred CCHHHH-HHHHHHHHHHHHCCCCCC Q ss_conf 911799-999999998543367200 Q gi|254780374|r 1 MYRYSF-IVLCYSSLLFGCHSSISE 24 (238) Q Consensus 1 M~r~~l-~~l~~~~lL~GCas~~~~ 24 (238) |+|.++ .++++.+++.+|++..+. T Consensus 1 ~mRvl~i~Lliis~fl~a~~s~~e~ 25 (182) T COG2143 1 VMRVLLIVLLIISLFLSACKSNNEK 25 (182) T ss_pred CCCHHHHHHHHHHHHHHHHHCCCHH T ss_conf 9001999999999999997178044 No 152 >PRK01622 OxaA-like protein precursor; Validated Probab=36.51 E-value=31 Score=15.05 Aligned_cols=13 Identities=31% Similarity=0.501 Sum_probs=8.2 Q ss_pred HHHHHHHHHHHCC Q ss_conf 9999999854336 Q gi|254780374|r 8 VLCYSSLLFGCHS 20 (238) Q Consensus 8 ~l~~~~lL~GCas 20 (238) ++.++++|+||+. T Consensus 13 ~~~~~~~lsgc~~ 25 (266) T PRK01622 13 SLLLVFVLSGCSN 25 (266) T ss_pred HHHHHHHHHCCCC T ss_conf 9999999954689 No 153 >TIGR03659 IsdE heme ABC transporter, heme-binding protein isdE. This family of ABC substrate-binding proteins is observed primarily in close proximity with proteins localized to the cell wall and bearing the NEAT (NEAr Transporter, pfam05031) heme-binding domain. IsdE has been shown to bind heme and is involved in the process of scavenging heme for the purpose of obtaining iron. Probab=36.36 E-value=28 Score=15.34 Aligned_cols=17 Identities=35% Similarity=0.407 Sum_probs=10.2 Q ss_pred HHHHHHHHHHHHCCCCC Q ss_conf 99999999854336720 Q gi|254780374|r 7 IVLCYSSLLFGCHSSIS 23 (238) Q Consensus 7 ~~l~~~~lL~GCas~~~ 23 (238) +++++++.|+||++... T Consensus 8 ~~~~~~~~l~~C~~~~~ 24 (289) T TIGR03659 8 LLALLSLGLTGCSSSKA 24 (289) T ss_pred HHHHHHHHHHHCCCCCC T ss_conf 99999999843289987 No 154 >pfam10566 Glyco_hydro_97 Glycoside hydrolase 97. This is the 97th family of glycosidases, in this case bacterial. The central part of the GH97 family protein sequences represents a typical and complete (beta/alpha)8-barrel or catalytic TIM-barrel type domain. The N- and C-terminal parts of the sequences, mainly consisting of beta-strands, most probably form two additional non-catalytic domains with as yet unknown functions. The non-catalytic domains of glycosidases from the alpha-galactosidase and alpha-glucosidase superfamilies are also predominantly composed of beta-strands, and at least some of these domains are involved in oligomerisation and carbohydrate binding. In all known glycosidases with the (beta-alpha)8-barrel fold, the amino acid residues at the active site are located on the C-termini of the beta-strands. Probab=36.19 E-value=26 Score=15.54 Aligned_cols=16 Identities=25% Similarity=0.422 Sum_probs=11.5 Q ss_pred CCHHHHHHHHHHHHHH Q ss_conf 9117999999999985 Q gi|254780374|r 1 MYRYSFIVLCYSSLLF 16 (238) Q Consensus 1 M~r~~l~~l~~~~lL~ 16 (238) |+|+++++++++++++ T Consensus 1 mkk~~~~~~~~~~~~~ 16 (643) T pfam10566 1 MKKLTIILLAFLLLIG 16 (643) T ss_pred CCCHHHHHHHHHHHHH T ss_conf 9502578999999997 No 155 >PRK07963 fliN flagellar motor switch protein FliN; Validated Probab=36.09 E-value=31 Score=15.01 Aligned_cols=35 Identities=34% Similarity=0.494 Sum_probs=29.3 Q ss_pred EEEECCCCCEEEEEEEEEEEECCEEEEEEEEEECHHH Q ss_conf 9998079728997887798418879999997984776 Q gi|254780374|r 158 VTAILENGNLIISGSQEVRVNDEIRSLNVTGIVRPQD 194 (238) Q Consensus 158 V~~VlpNGnL~I~G~k~i~vn~e~~~i~isGiVRp~D 194 (238) =++++-||.|+=+|+ +.+-++..=+||+-||.|++ T Consensus 96 pvdI~VNg~liA~GE--VVvv~dk~GVRIteIisp~e 130 (137) T PRK07963 96 PLDILINGYLIAQGE--VVVVADKYGVRITDIITPSE 130 (137) T ss_pred CEEEEECCEEEEEEE--EEEECCEEEEEEEEECCHHH T ss_conf 569999999999987--99999878899998559899 No 156 >PRK11281 potassium efflux protein KefA; Provisional Probab=35.94 E-value=18 Score=16.48 Aligned_cols=13 Identities=31% Similarity=0.733 Sum_probs=9.1 Q ss_pred CCCCCCCCEEEEE Q ss_conf 2236768769999 Q gi|254780374|r 75 SRALNVGDILTVD 87 (238) Q Consensus 75 ~RA~~VGDIiTV~ 87 (238) -|..||||+|||- T Consensus 935 ErPiRiGD~vTi~ 947 (1107) T PRK11281 935 ERPVRIGDTVTIG 947 (1107) T ss_pred CCCCCCCCEEEEC T ss_conf 1671247668987 No 157 >PRK10929 hypothetical protein; Provisional Probab=35.90 E-value=18 Score=16.47 Aligned_cols=12 Identities=25% Similarity=0.764 Sum_probs=9.2 Q ss_pred CCCCCCCCEEEE Q ss_conf 223676876999 Q gi|254780374|r 75 SRALNVGDILTV 86 (238) Q Consensus 75 ~RA~~VGDIiTV 86 (238) -|..||||+||| T Consensus 933 ErPiRiGD~VTi 944 (1109) T PRK10929 933 EKPIRIGDTVTI 944 (1109) T ss_pred CCCCCCCCEEEE T ss_conf 167124766898 No 158 >COG5567 Predicted small periplasmic lipoprotein [Cell motility and secretion] Probab=35.79 E-value=32 Score=14.98 Aligned_cols=20 Identities=25% Similarity=0.207 Sum_probs=11.8 Q ss_pred CCHHHHHHHHHH--HHHHHHCC Q ss_conf 911799999999--99854336 Q gi|254780374|r 1 MYRYSFIVLCYS--SLLFGCHS 20 (238) Q Consensus 1 M~r~~l~~l~~~--~lL~GCas 20 (238) |++.+...++++ +.|+||.- T Consensus 1 mk~~~~s~~ala~l~sLA~CG~ 22 (58) T COG5567 1 MKNVFKSLLALATLFSLAGCGL 22 (58) T ss_pred CHHHHHHHHHHHHHHHHHHCCC T ss_conf 9048999999999999985266 No 159 >PRK13684 Ycf48-like protein; Provisional Probab=35.69 E-value=32 Score=14.97 Aligned_cols=16 Identities=25% Similarity=0.619 Sum_probs=10.7 Q ss_pred HHHHHHHHHHHHHHCC Q ss_conf 9999999999854336 Q gi|254780374|r 5 SFIVLCYSSLLFGCHS 20 (238) Q Consensus 5 ~l~~l~~~~lL~GCas 20 (238) ++++++++++|+||.+ T Consensus 11 l~l~~~~~~~~~~c~~ 26 (333) T PRK13684 11 LLLLLALGLVLSGCST 26 (333) T ss_pred HHHHHHHHHHHCCCCC T ss_conf 9999999986322356 No 160 >smart00108 B_lectin Bulb-type mannose-specific lectin. Probab=34.83 E-value=25 Score=15.67 Aligned_cols=18 Identities=39% Similarity=0.661 Sum_probs=13.6 Q ss_pred EEEEEECCCCCEEEEEEE Q ss_conf 999998079728997887 Q gi|254780374|r 156 AIVTAILENGNLIISGSQ 173 (238) Q Consensus 156 a~V~~VlpNGnL~I~G~k 173 (238) ..+..++++|||||.+.. T Consensus 87 ~~~~~L~d~GNlVl~~~~ 104 (114) T smart00108 87 NYVLVLLDDGNLVIYDSD 104 (114) T ss_pred CEEEEEECCCCEEEECCC T ss_conf 779999479779998799 No 161 >TIGR03511 GldH_lipo gliding motility-associated lipoprotein GldH. Members of this protein family are predicted lipoproteins, exclusive to the Bacteroidetes phylum (previously Cytophaga-Flavobacteria-Bacteroides). Members include GldH, a protein linked to a type of rapid surface gliding motility found in certain Bacteroidetes, such as Flavobacterium johnsoniae and Cytophaga hutchinsonii. Gliding motility appears closely linked to chitin utilization in the model species Flavobacterium johnsoniae. Not all Bacteroidetes with members of this protein family may have gliding motility. Probab=34.83 E-value=16 Score=16.83 Aligned_cols=19 Identities=21% Similarity=0.382 Sum_probs=13.0 Q ss_pred CCHHHHHHHHHHHHHHHHC Q ss_conf 9117999999999985433 Q gi|254780374|r 1 MYRYSFIVLCYSSLLFGCH 19 (238) Q Consensus 1 M~r~~l~~l~~~~lL~GCa 19 (238) |+|-.++.++++++|.+|. T Consensus 3 ~~~~~i~~~l~~~ll~SC~ 21 (156) T TIGR03511 3 LVRNSISFFLGACVLVSCT 21 (156) T ss_pred HHHHHHHHHHHHHHEEEEC T ss_conf 6854499999999707638 No 162 >PRK11616 hypothetical protein; Provisional Probab=34.82 E-value=31 Score=15.07 Aligned_cols=16 Identities=44% Similarity=0.627 Sum_probs=11.4 Q ss_pred HHHHHHCCCCCCCCCC Q ss_conf 9985433672002568 Q gi|254780374|r 13 SLLFGCHSSISEITGI 28 (238) Q Consensus 13 ~lL~GCas~~~~~~~~ 28 (238) ++++||++-+.+.++. T Consensus 15 l~~sGCSSvMtHtG~~ 30 (109) T PRK11616 15 LLLSGCSSVMSHTGGK 30 (109) T ss_pred HHHCCCHHHHCCCCCC T ss_conf 9875751230114798 No 163 >PRK10936 periplasmic sensory protein associated with the TorRS two-component regulatory system; Provisional Probab=34.73 E-value=33 Score=14.87 Aligned_cols=24 Identities=21% Similarity=0.065 Sum_probs=16.0 Q ss_pred CCHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 911799999999998543367200 Q gi|254780374|r 1 MYRYSFIVLCYSSLLFGCHSSISE 24 (238) Q Consensus 1 M~r~~l~~l~~~~lL~GCas~~~~ 24 (238) |+|+++.++++.+++.+|++.... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~ 24 (340) T PRK10936 1 MMRILLFLLLSLFLLSAFAADNLL 24 (340) T ss_pred CHHHHHHHHHHHHHHHHHCCCCCC T ss_conf 932799999999999870467434 No 164 >pfam09158 MotCF Bacteriophage T4 MotA, C-terminal. Members of this family adopt a compact alpha/beta structure comprising three alpha-helices and six beta-strands in the order: alpha1-beta1-beta2-beta3-beta4-alpha2-beta5-beta6-alpha3. The beta-strands form a single anti-parallel beta-sheet and the three alpha-helices pack side-by-side onto one surface of the beta-sheet. In this architecture, the domain's hydrophobic core is at the sheet-helix interface, and the second surface of the beta-sheet is completely exposed. The domain is a DNA-binding motif, with a consensus sequence containing nine base pairs (5'-TTTGCTTTA-3'), that appears to bind to various mot boxes, allowing access to the minor groove towards the 5'-end of this sequence and the major groove towards the 3'-end. Probab=34.50 E-value=33 Score=14.85 Aligned_cols=20 Identities=30% Similarity=0.267 Sum_probs=16.9 Q ss_pred EEEEEECCCCCEEEEEEEEE Q ss_conf 99999807972899788779 Q gi|254780374|r 156 AIVTAILENGNLIISGSQEV 175 (238) Q Consensus 156 a~V~~VlpNGnL~I~G~k~i 175 (238) -+=.||+-||+|.|.|.|.. T Consensus 43 iR~fEi~n~G~~RIfgYkm~ 62 (103) T pfam09158 43 IRNFEINNNGNMRIFGYKMM 62 (103) T ss_pred EEEEEEECCCCEEEEEEECC T ss_conf 26899815881899875223 No 165 >COG1021 EntE Peptide arylation enzymes [Secondary metabolites biosynthesis, transport, and catabolism] Probab=33.70 E-value=24 Score=15.76 Aligned_cols=33 Identities=12% Similarity=0.169 Sum_probs=27.5 Q ss_pred EEEEEEEEECCCCCEEEEEEEEEEEECCEEEEE Q ss_conf 999999998079728997887798418879999 Q gi|254780374|r 153 LIAAIVTAILENGNLIISGSQEVRVNDEIRSLN 185 (238) Q Consensus 153 ~isa~V~~VlpNGnL~I~G~k~i~vn~e~~~i~ 185 (238) -.|--++...+.|+|+|+|+-+=.||.+-|.|- T Consensus 416 YrsGD~V~~~~dGyl~V~GR~KDQINRgGEKIA 448 (542) T COG1021 416 YRSGDLVRRDPDGYLVVEGRVKDQINRGGEKIA 448 (542) T ss_pred EECCCEEEECCCCEEEEEEEEHHHHCCCCCHHH T ss_conf 353754576688618997310565413521134 No 166 >COG2935 Putative arginyl-tRNA:protein arginylyltransferase [Posttranslational modification, protein turnover, chaperones] Probab=32.76 E-value=27 Score=15.38 Aligned_cols=17 Identities=35% Similarity=0.511 Sum_probs=13.6 Q ss_pred EEEEEEEEECCCCCEEE Q ss_conf 99999999807972899 Q gi|254780374|r 153 LIAAIVTAILENGNLII 169 (238) Q Consensus 153 ~isa~V~~VlpNGnL~I 169 (238) -|++.|+||||+|-=-| T Consensus 162 LvAVavtDvL~dGlSsV 178 (253) T COG2935 162 LVAVAVTDVLPDGLSSV 178 (253) T ss_pred EEEEEEEECCCCCCEEE T ss_conf 79998621166763048 No 167 >COG4259 Uncharacterized protein conserved in bacteria [Function unknown] Probab=32.69 E-value=36 Score=14.66 Aligned_cols=20 Identities=25% Similarity=0.381 Sum_probs=11.2 Q ss_pred CCHHHHHH----HHHHHHHHHHCC Q ss_conf 91179999----999999854336 Q gi|254780374|r 1 MYRYSFIV----LCYSSLLFGCHS 20 (238) Q Consensus 1 M~r~~l~~----l~~~~lL~GCas 20 (238) |+++..+. ++.++.|+||++ T Consensus 1 M~~~ktlsr~al~~av~~LagC~~ 24 (121) T COG4259 1 MSKLKTLSRLALLLAVAALAGCGG 24 (121) T ss_pred CCCHHHHHHHHHHHHHHHHHHCCC T ss_conf 951488999999999999988057 No 168 >COG1729 Uncharacterized protein conserved in bacteria [Function unknown] Probab=32.27 E-value=36 Score=14.62 Aligned_cols=23 Identities=35% Similarity=0.225 Sum_probs=17.9 Q ss_pred CCHHHHHHHHHHHHHHHHCCCCC Q ss_conf 91179999999999854336720 Q gi|254780374|r 1 MYRYSFIVLCYSSLLFGCHSSIS 23 (238) Q Consensus 1 M~r~~l~~l~~~~lL~GCas~~~ 23 (238) |++.+...++..+++.||+.... T Consensus 1 m~~~~~~~~~~l~~~~~~~~~~~ 23 (262) T COG1729 1 MKSNLRLVLLGLSLLVGLAAPAS 23 (262) T ss_pred CCHHHHHHHHHHHHHHHHCCCCC T ss_conf 90269999999999876225334 No 169 >PRK02889 tolB translocation protein TolB; Provisional Probab=32.05 E-value=27 Score=15.43 Aligned_cols=20 Identities=20% Similarity=0.202 Sum_probs=12.5 Q ss_pred CCHHHHHHHHHHHHHHHHCC Q ss_conf 91179999999999854336 Q gi|254780374|r 1 MYRYSFIVLCYSSLLFGCHS 20 (238) Q Consensus 1 M~r~~l~~l~~~~lL~GCas 20 (238) |+|+.+.++++++|+..|.. T Consensus 4 ~~~l~~~~~~~~~l~~~~~~ 23 (430) T PRK02889 4 MTKLGLRALVASCLIAAGGA 23 (430) T ss_pred HHHHHHHHHHHHHHHHCCCC T ss_conf 99899999999999862102 No 170 >COG4143 TbpA ABC-type thiamine transport system, periplasmic component [Coenzyme metabolism] Probab=31.24 E-value=38 Score=14.51 Aligned_cols=19 Identities=16% Similarity=0.086 Sum_probs=13.4 Q ss_pred CCHHHHHHHHHHHHHHHHC Q ss_conf 9117999999999985433 Q gi|254780374|r 1 MYRYSFIVLCYSSLLFGCH 19 (238) Q Consensus 1 M~r~~l~~l~~~~lL~GCa 19 (238) |+|+...++++++++++|. T Consensus 1 m~~~l~~~~~~all~~~~~ 19 (336) T COG4143 1 MRRLLRALIGLALLVSAAL 19 (336) T ss_pred CHHHHHHHHHHHHHHHHHH T ss_conf 9024999999999999865 No 171 >TIGR01067 rplN_bact ribosomal protein L14; InterPro: IPR005745 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites , . About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many of ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome , . Ribosomal protein L14 is one of the proteins from the large ribosomal subunit. In bacteria, L14 is known to bind directly to the 23S rRNA. It belongs to a family of ribosomal proteins which have been grouped on the basis of sequence similarities . L14 is a protein of 119 to 137 amino-acid residues. This family distinguishes bacterial and most organellar examples of ribosomal protein L14 from all archaeal and eukaryotic forms. ; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0015934 large ribosomal subunit. Probab=30.82 E-value=37 Score=14.57 Aligned_cols=34 Identities=15% Similarity=0.144 Sum_probs=22.2 Q ss_pred CCCCCCCCCEEEEEEEECCHHHHCCCCCCCCCCC Q ss_conf 3223676876999997241010000122234565 Q gi|254780374|r 74 DSRALNVGDILTVDIRIDDQAVFDNQTGRSRNNS 107 (238) Q Consensus 74 D~RA~~VGDIiTV~i~E~~~a~~~~~~~~~r~~~ 107 (238) .+|--.|||+|+|.|.+-.-.........=|.++ T Consensus 29 ~~~~A~vGD~Iv~~vK~a~P~~~~~~~~~vKKGd 62 (128) T TIGR01067 29 RRRYAKVGDVIVVVVKDAIPNGKGKYKLKVKKGD 62 (128) T ss_pred CCCEECCCCEEEEEEEEEECCCCCCEEECCCCCC T ss_conf 8220212989999876630598763010231133 No 172 >PRK09973 putative outer membrane lipoprotein; Provisional Probab=30.70 E-value=27 Score=15.39 Aligned_cols=20 Identities=25% Similarity=0.438 Sum_probs=11.0 Q ss_pred CCH-HHHHHH-HHHHHHHHHCC Q ss_conf 911-799999-99999854336 Q gi|254780374|r 1 MYR-YSFIVL-CYSSLLFGCHS 20 (238) Q Consensus 1 M~r-~~l~~l-~~~~lL~GCas 20 (238) |+. +++-.+ +...+|+||++ T Consensus 1 mrtk~vlGaviLaS~LLaGCsn 22 (85) T PRK09973 1 MKTIFTVGAVVLATCLLSGCVN 22 (85) T ss_pred CCCEEHHHHHHHHHHHHHCCCC T ss_conf 9602110699999999981555 No 173 >PRK10871 nlpD lipoprotein NlpD; Provisional Probab=30.70 E-value=37 Score=14.53 Aligned_cols=48 Identities=19% Similarity=0.099 Sum_probs=23.6 Q ss_pred EEEEEEEEEEEEEEC----CCCCEEEEEEEEEEEECCEEEEEEEEEECHHHCCCCCEEE Q ss_conf 148999999999980----7972899788779841887999999798477678997382 Q gi|254780374|r 148 EKLNLLIAAIVTAIL----ENGNLIISGSQEVRVNDEIRSLNVTGIVRPQDVDAHNSVS 202 (238) Q Consensus 148 ~~l~~~isa~V~~Vl----pNGnL~I~G~k~i~vn~e~~~i~isGiVRp~DI~~~NtV~ 202 (238) .-+.+.-.-+|+-.- --|||+| |.++..++..-+--..-.+..+-.|. T Consensus 285 ~pV~Aaa~G~VvyaG~~l~gyGnlii-------IkH~~~~lsaYahn~~~lVkeg~~V~ 336 (374) T PRK10871 285 QAIIATADGRVVYAGNALRGYGNLII-------IKHNDDYLSAYAHNDTMLVREQQEVK 336 (374) T ss_pred CEEEECCCEEEEEECCCCCCCCEEEE-------EECCCCCEEEEECCCCCCCCCCCEEC T ss_conf 81785378389992478787762899-------98499867861676626678889988 No 174 >cd00028 B_lectin Bulb-type mannose-specific lectin. The domain contains a three-fold internal repeat (beta-prism architecture). The consensus sequence motif QXDXNXVXY is involved in alpha-D-mannose recognition. Lectins are carbohydrate-binding proteins which specifically recognize diverse carbohydrates and mediate a wide variety of biological processes, such as cell-cell and host-pathogen interactions, serum glycoprotein turnover, and innate immune responses. Probab=30.66 E-value=35 Score=14.74 Aligned_cols=17 Identities=35% Similarity=0.659 Sum_probs=13.4 Q ss_pred EEEEEECCCCCEEEEEE Q ss_conf 99999807972899788 Q gi|254780374|r 156 AIVTAILENGNLIISGS 172 (238) Q Consensus 156 a~V~~VlpNGnL~I~G~ 172 (238) +.+...+.+|||||.+. T Consensus 88 ~~~a~L~d~GNLVl~~~ 104 (116) T cd00028 88 NYVLVLLDDGNLVLYDS 104 (116) T ss_pred CEEEEEECCCCEEEECC T ss_conf 75999938988999869 No 175 >pfam09676 TraV Type IV conjugative transfer system lipoprotein (TraV). This entry includes TraV, which is a component of conjugative type IV secretion system. TraV is an outer membrane lipoprotein that is believed to interact with the secretin TraK. The alignment contains three conserved cysteines in the N-terminal half. Probab=30.34 E-value=19 Score=16.42 Aligned_cols=20 Identities=20% Similarity=0.134 Sum_probs=12.8 Q ss_pred ECCCCCEEEEEEEEEEEECC Q ss_conf 80797289978877984188 Q gi|254780374|r 161 ILENGNLIISGSQEVRVNDE 180 (238) Q Consensus 161 VlpNGnL~I~G~k~i~vn~e 180 (238) +..+|+|.-.+.=.+.|... T Consensus 113 ~d~~g~~~~~~~vy~~v~~~ 132 (134) T pfam09676 113 IDADGDLHDPRYVYFVVDEG 132 (134) T ss_pred ECCCCCEECCCEEEEEECCC T ss_conf 87999886233699996478 No 176 >PRK04792 tolB translocation protein TolB; Provisional Probab=30.09 E-value=38 Score=14.48 Aligned_cols=19 Identities=5% Similarity=-0.076 Sum_probs=10.4 Q ss_pred CCHHH-HHHHHHHHHHHHHC Q ss_conf 91179-99999999985433 Q gi|254780374|r 1 MYRYS-FIVLCYSSLLFGCH 19 (238) Q Consensus 1 M~r~~-l~~l~~~~lL~GCa 19 (238) |+|.+ +.++++++++++|| T Consensus 1 M~k~~~~~l~lll~~~s~~A 20 (450) T PRK04792 1 MLKRLILGLFVLLLSFSQVA 20 (450) T ss_pred CCHHHHHHHHHHHHHHHHHH T ss_conf 93899999999999865752 No 177 >TIGR01645 half-pint poly-U binding splicing factor, half-pint family; InterPro: IPR006532 The proteins contain three RNA recognition motifs and have been characterised as poly-pyrimidine tract binding proteins associated with RNA splicing factors , , . In the case of PUF60, in complex with p54, and in the presence of U2AF, it facilitates association of U2 snRNP with pre-mRNA . . Probab=30.01 E-value=18 Score=16.53 Aligned_cols=34 Identities=24% Similarity=0.495 Sum_probs=29.5 Q ss_pred CCEEEEEEE-------EEEEECCEEEEEEEEEECHHHCCCC Q ss_conf 728997887-------7984188799999979847767899 Q gi|254780374|r 165 GNLIISGSQ-------EVRVNDEIRSLNVTGIVRPQDVDAH 198 (238) Q Consensus 165 GnL~I~G~k-------~i~vn~e~~~i~isGiVRp~DI~~~ 198 (238) =||-|+|.- ++.-.++...|.|+=.|.|+||+-+ T Consensus 667 E~m~I~G~sARhlvM~kLmR~~~S~VivLRNMV~P~DiDe~ 707 (791) T TIGR01645 667 EGMSIKGNSARHLVMQKLMRKNESRVIVLRNMVDPKDIDED 707 (791) T ss_pred CCCEECCCHHHHHHHHHHCCCCCCEEEEEECCCCCCCCCCC T ss_conf 17422336489999887346467707898347888433210 No 178 >COG3317 NlpB Uncharacterized lipoprotein [Cell envelope biogenesis, outer membrane] Probab=29.77 E-value=40 Score=14.35 Aligned_cols=25 Identities=32% Similarity=0.248 Sum_probs=17.9 Q ss_pred CCCCCCCCCCCCCEEEEEEEECCHH Q ss_conf 2112322367687699999724101 Q gi|254780374|r 70 ALFKDSRALNVGDILTVDIRIDDQA 94 (238) Q Consensus 70 ~lf~D~RA~~VGDIiTV~i~E~~~a 94 (238) .+..+.|+-+=||--.+++.-...+ T Consensus 85 ~~Isg~r~erdGd~~~lv~~~~~~~ 109 (342) T COG3317 85 ALISGSRAERDGDTRWLVVENQPAA 109 (342) T ss_pred HHCCCCHHEECCCEEEEEEECCCHH T ss_conf 2246522111587369998179678 No 179 >cd05734 Ig7_DSCAM Seventh immunoglobulin (Ig)-like domain of Down Syndrome Cell Adhesion molecule (DSCAM). Ig7_DSCAM: the seventh immunoglobulin (Ig)-like domain of Down Syndrome Cell Adhesion molecule (DSCAM). DSCAM is a cell adhesion molecule expressed largely in the developing nervous system. The gene encoding DSCAM is located at human chromosome 21q22, the locus associated with the mental retardation phenotype of Down Syndrome. DSCAM is predicted to be the largest member of the IG superfamily. It has been demonstrated that DSCAM can mediate cation-independent homophilic intercellular adhesion. Probab=29.57 E-value=34 Score=14.82 Aligned_cols=14 Identities=36% Similarity=0.619 Sum_probs=8.3 Q ss_pred EEECCCCCEEEEEE Q ss_conf 99807972899788 Q gi|254780374|r 159 TAILENGNLIISGS 172 (238) Q Consensus 159 ~~VlpNGnL~I~G~ 172 (238) ++++|||.|+|+.- T Consensus 38 ~~v~~nGsL~I~~v 51 (79) T cd05734 38 IQLLSNGSLLIKHV 51 (79) T ss_pred EEECCCCCEEEEEC T ss_conf 89836986999435 No 180 >COG4939 Major membrane immunogen, membrane-anchored lipoprotein [Function unknown] Probab=29.39 E-value=38 Score=14.49 Aligned_cols=21 Identities=29% Similarity=0.262 Sum_probs=13.6 Q ss_pred CCH-HHHHHHHHHHHHHHHCCC Q ss_conf 911-799999999998543367 Q gi|254780374|r 1 MYR-YSFIVLCYSSLLFGCHSS 21 (238) Q Consensus 1 M~r-~~l~~l~~~~lL~GCas~ 21 (238) |.+ .+...+++.+||.||... T Consensus 1 mvk~g~~~~~~~~~LL~aCg~s 22 (147) T COG4939 1 MVKYGLVGMIVALSLLTACGKS 22 (147) T ss_pred CCEEHHHHHHHHHHHHHHHCCC T ss_conf 9130365999999999870666 No 181 >PRK10477 outer membrane lipoprotein Blc; Provisional Probab=29.08 E-value=41 Score=14.28 Aligned_cols=17 Identities=18% Similarity=0.317 Sum_probs=11.0 Q ss_pred HHHHHHHHHHHHHHCCC Q ss_conf 99999999998543367 Q gi|254780374|r 5 SFIVLCYSSLLFGCHSS 21 (238) Q Consensus 5 ~l~~l~~~~lL~GCas~ 21 (238) +++.+.++++|.||++. T Consensus 6 ~~~~~~~~~~~~~c~~~ 22 (177) T PRK10477 6 VVAAVTAAFLVVACSSP 22 (177) T ss_pred HHHHHHHHHHHHHCCCC T ss_conf 99999999999764699 No 182 >PRK10959 outer membrane protein W; Provisional Probab=28.95 E-value=41 Score=14.27 Aligned_cols=18 Identities=28% Similarity=0.440 Sum_probs=13.0 Q ss_pred CCHHHHHHHHHHHHHHHH Q ss_conf 911799999999998543 Q gi|254780374|r 1 MYRYSFIVLCYSSLLFGC 18 (238) Q Consensus 1 M~r~~l~~l~~~~lL~GC 18 (238) |||+.+.+++++.++++- T Consensus 1 Mkk~~~a~~~~~a~~s~~ 18 (212) T PRK10959 1 MKKLTVAALAVATLLSGS 18 (212) T ss_pred CCHHHHHHHHHHHHHHHH T ss_conf 937899999999875167 No 183 >PRK09495 glnH glutamine ABC transporter periplasmic protein; Reviewed Probab=28.90 E-value=40 Score=14.31 Aligned_cols=19 Identities=21% Similarity=0.167 Sum_probs=12.3 Q ss_pred CCHHHHHHHHHHHHHHHHC Q ss_conf 9117999999999985433 Q gi|254780374|r 1 MYRYSFIVLCYSSLLFGCH 19 (238) Q Consensus 1 M~r~~l~~l~~~~lL~GCa 19 (238) |++++.+++++++++.+|+ T Consensus 1 mk~~~~~~~~~~~~~~~~~ 19 (247) T PRK09495 1 MKSVLKVSLAALTLAFAVS 19 (247) T ss_pred CHHHHHHHHHHHHHHHHHH T ss_conf 9779999999999999876 No 184 >pfam01203 GSPII_N Bacterial type II secretion system protein N. Probab=28.74 E-value=26 Score=15.50 Aligned_cols=17 Identities=18% Similarity=0.288 Sum_probs=11.0 Q ss_pred CCCCCCCCCCCCCCCCC Q ss_conf 44444455761215775 Q gi|254780374|r 52 KNSDSTKKSYSLWRDSH 68 (238) Q Consensus 52 ~~~~~~~~~gSL~~~~~ 68 (238) +.-......|++|+.+- T Consensus 36 ~~v~l~gvsGTlW~G~a 52 (251) T pfam01203 36 AGVSLSGVSGTLWQGRA 52 (251) T ss_pred CCEEEECCCCCEEECEE T ss_conf 85389454556571347 No 185 >pfam03866 HAP Hydrophobic abundant protein (HAP). Expression of HAP is thought to be developmentally regulated and possibly involved in spherule cell wall formation. Probab=28.30 E-value=42 Score=14.19 Aligned_cols=19 Identities=37% Similarity=0.838 Sum_probs=13.9 Q ss_pred CCHHHHHHHHHHHHHHHHCC Q ss_conf 91179999999999854336 Q gi|254780374|r 1 MYRYSFIVLCYSSLLFGCHS 20 (238) Q Consensus 1 M~r~~l~~l~~~~lL~GCas 20 (238) ||||+|+.+|+.... +-++ T Consensus 1 mmkyifialc~faiv-alas 19 (167) T pfam03866 1 MMKYIFIALCFFAIV-ALAS 19 (167) T ss_pred CCHHHHHHHHHHHHH-HHHH T ss_conf 913589999999999-9962 No 186 >PHA00407 phage lambda Rz1-like protein Probab=28.13 E-value=42 Score=14.18 Aligned_cols=15 Identities=27% Similarity=0.465 Sum_probs=9.1 Q ss_pred HHHHHHHHHHHCCCC Q ss_conf 999999985433672 Q gi|254780374|r 8 VLCYSSLLFGCHSSI 22 (238) Q Consensus 8 ~l~~~~lL~GCas~~ 22 (238) ++.+++.++||++.. T Consensus 39 llicv~tISGCaSes 53 (84) T PHA00407 39 LLICVATISGCASES 53 (84) T ss_pred HHHHHHHHHHHHHCC T ss_conf 999999976665025 No 187 >TIGR02134 transald_staph transaldolase; InterPro: IPR011861 This small family of proteins belong to the transaldolases. Coxiella and Staphylococcus lack members of the known transaldolase families and appear to require a transaldolase activity for completion of the pentose phosphate pathway.. Probab=27.58 E-value=21 Score=16.12 Aligned_cols=25 Identities=40% Similarity=0.579 Sum_probs=20.9 Q ss_pred CCCCCCCCCCCCCCCCCCCCC-CEEEE Q ss_conf 612157755211232236768-76999 Q gi|254780374|r 61 YSLWRDSHAALFKDSRALNVG-DILTV 86 (238) Q Consensus 61 gSL~~~~~~~lf~D~RA~~VG-DIiTV 86 (238) -=||. +.|-||.=..|-++| ||||+ T Consensus 173 eLLWA-SpRElfNiiQAd~iG~dIITc 198 (237) T TIGR02134 173 ELLWA-SPRELFNIIQADRIGVDIITC 198 (237) T ss_pred HHHCC-CCHHHHHHHHHHHCCCEEEEC T ss_conf 00024-511456677473428406762 No 188 >pfam07119 DUF1375 Protein of unknown function (DUF1375). This family consists of several hypothetical, putative lipoproteins of around 80 residues in length. Members of this family seem to be specific to the Class Gammaproteobacteria. The function of this family is unknown. Probab=26.78 E-value=28 Score=15.28 Aligned_cols=19 Identities=26% Similarity=0.366 Sum_probs=12.3 Q ss_pred HHHHHHHHHCCCCCCCCCC Q ss_conf 9999985433672002568 Q gi|254780374|r 10 CYSSLLFGCHSSISEITGI 28 (238) Q Consensus 10 ~~~~lL~GCas~~~~~~~~ 28 (238) ++.++|+||++-+....+. T Consensus 3 ~~~~~lsGC~si~t~~~~~ 21 (70) T pfam07119 3 ALLLLLSGCGTIVTLTGGD 21 (70) T ss_pred HHHHHHCCCHHHEECCCCC T ss_conf 7998873451012415788 No 189 >PRK02998 prsA peptidylprolyl isomerase; Reviewed Probab=26.74 E-value=45 Score=14.02 Aligned_cols=20 Identities=20% Similarity=0.150 Sum_probs=10.1 Q ss_pred CCHHH--HHHHHH--HHHHHHHCC Q ss_conf 91179--999999--999854336 Q gi|254780374|r 1 MYRYS--FIVLCY--SSLLFGCHS 20 (238) Q Consensus 1 M~r~~--l~~l~~--~~lL~GCas 20 (238) |++.- +.++++ ++.|+||++ T Consensus 1 Mkkkk~~~~~~~~~~~l~LaaC~s 24 (283) T PRK02998 1 MKKKKLFIGTIISCVVLALSACGS 24 (283) T ss_pred CCHHHHHHHHHHHHHHHHHHHCCC T ss_conf 935899999999999999986689 No 190 >KOG1923 consensus Probab=26.64 E-value=45 Score=14.01 Aligned_cols=33 Identities=21% Similarity=0.354 Sum_probs=18.4 Q ss_pred HHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 999998543367200256875366777766444 Q gi|254780374|r 10 CYSSLLFGCHSSISEITGIPQMSPMGSSLDENN 42 (238) Q Consensus 10 ~~~~lL~GCas~~~~~~~~p~~tp~~~~~~~~~ 42 (238) ...+.+++|.++.......|...|.+...+... T Consensus 264 a~gs~asA~~s~~~S~s~ppppap~p~~~~~~a 296 (830) T KOG1923 264 ALGSLASACDSQPGSGSGPPPPAPLPHTAQSDA 296 (830) T ss_pred HHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCC T ss_conf 666665403567888999999999987655667 No 191 >COG3107 LppC Putative lipoprotein [General function prediction only] Probab=26.33 E-value=46 Score=13.97 Aligned_cols=15 Identities=20% Similarity=0.476 Sum_probs=6.6 Q ss_pred HHHHHHHHHHHCCCC Q ss_conf 999999985433672 Q gi|254780374|r 8 VLCYSSLLFGCHSSI 22 (238) Q Consensus 8 ~l~~~~lL~GCas~~ 22 (238) .++++++|+||++.. T Consensus 17 ~illa~vla~C~t~~ 31 (604) T COG3107 17 PILLALVLAGCSTFL 31 (604) T ss_pred HHHHHHHHHHHCCCC T ss_conf 999999997604678 No 192 >PRK10554 outer membrane porin protein C; Provisional Probab=26.12 E-value=46 Score=13.96 Aligned_cols=20 Identities=20% Similarity=0.164 Sum_probs=15.7 Q ss_pred CCHHHHHHHHHHHHHHHHCC Q ss_conf 91179999999999854336 Q gi|254780374|r 1 MYRYSFIVLCYSSLLFGCHS 20 (238) Q Consensus 1 M~r~~l~~l~~~~lL~GCas 20 (238) |||.+|.+++.+++++|.+. T Consensus 1 MKk~~LA~~i~all~ag~a~ 20 (373) T PRK10554 1 MKVKVLSLLVPALLVAGAAN 20 (373) T ss_pred CCHHHHHHHHHHHHHHCCCC T ss_conf 95408999999998607531 No 193 >PRK03757 hypothetical protein; Provisional Probab=25.99 E-value=46 Score=13.93 Aligned_cols=15 Identities=20% Similarity=-0.009 Sum_probs=9.4 Q ss_pred CCHHHHHHHHHHHHH Q ss_conf 911799999999998 Q gi|254780374|r 1 MYRYSFIVLCYSSLL 15 (238) Q Consensus 1 M~r~~l~~l~~~~lL 15 (238) |+|+++.++++++++ T Consensus 1 mkk~~l~~~l~a~~~ 15 (191) T PRK03757 1 MKKTLLGLTLGSLLF 15 (191) T ss_pred CCHHHHHHHHHHHHH T ss_conf 904399999999986 No 194 >pfam01315 Ald_Xan_dh_C Aldehyde oxidase and xanthine dehydrogenase, a/b hammerhead domain. Probab=25.98 E-value=46 Score=13.93 Aligned_cols=31 Identities=23% Similarity=0.302 Sum_probs=25.8 Q ss_pred CCCCCCCCCCCCCCCEEEEEEEECCHHHHCC Q ss_conf 5521123223676876999997241010000 Q gi|254780374|r 68 HAALFKDSRALNVGDILTVDIRIDDQAVFDN 98 (238) Q Consensus 68 ~~~lf~D~RA~~VGDIiTV~i~E~~~a~~~~ 98 (238) ..++|.+.|-+.+|+.|-++|.|+..+...+ T Consensus 69 ~~p~la~~~V~y~GqpvA~VvAet~~~A~~A 99 (111) T pfam01315 69 PDALLADDKVRFVGQPIAAVVADDEETARDA 99 (111) T ss_pred CCCCCCCCEEEECCCEEEEEEECCHHHHHHH T ss_conf 7330568989888988999999999999999 No 195 >PRK08190 bifunctional enoyl-CoA hydratase/phosphate acetyltransferase; Validated Probab=25.76 E-value=47 Score=13.91 Aligned_cols=20 Identities=5% Similarity=0.122 Sum_probs=9.1 Q ss_pred EEEEEEEEECCCCCEEEEEE Q ss_conf 99999999807972899788 Q gi|254780374|r 153 LIAAIVTAILENGNLIISGS 172 (238) Q Consensus 153 ~isa~V~~VlpNGnL~I~G~ 172 (238) +++++|+|..|.-+.++--- T Consensus 103 tatVtV~ek~~~k~~V~L~~ 122 (465) T PRK08190 103 TVTVTVREKRPEKRIVLLDC 122 (465) T ss_pred EEEEEEEEECCCCCEEEEEE T ss_conf 99999999717889899997 No 196 >pfam07241 consensus Probab=25.74 E-value=47 Score=13.91 Aligned_cols=30 Identities=23% Similarity=0.271 Sum_probs=25.6 Q ss_pred CCCCCEEEEEEEEEEEECCEEEEEEEEEEC Q ss_conf 079728997887798418879999997984 Q gi|254780374|r 162 LENGNLIISGSQEVRVNDEIRSLNVTGIVR 191 (238) Q Consensus 162 lpNGnL~I~G~k~i~vn~e~~~i~isGiVR 191 (238) +.+|.+.|+|+.-+--.-..+.|.++|.|+ T Consensus 53 ~~~g~i~I~G~~L~I~~i~~eei~I~G~I~ 82 (87) T pfam07241 53 LKVGQIIIKGENFVIKFILPEEILLEGKIK 82 (87) T ss_pred CCCEEEEEECCCEEEEEECCCEEEEEEEEE T ss_conf 685799999534999996362899999998 No 197 >TIGR00008 infA translation initiation factor IF-1; InterPro: IPR004368 This family consists of translation initiation factor IF-1 as found in bacteria and chloroplasts. This protein, about 70 residues in length, consists largely of an S1 RNA binding domain (IPR003029 from INTERPRO).; GO: 0003743 translation initiation factor activity, 0006413 translational initiation. Probab=25.73 E-value=47 Score=13.90 Aligned_cols=39 Identities=15% Similarity=0.165 Sum_probs=29.7 Q ss_pred EEEEEEEEECCCCCEEEEEEEEEEEECCEEEEEEEEEECHHHCC Q ss_conf 99999999807972899788779841887999999798477678 Q gi|254780374|r 153 LIAAIVTAILENGNLIISGSQEVRVNDEIRSLNVTGIVRPQDVD 196 (238) Q Consensus 153 ~isa~V~~VlpNGnL~I~G~k~i~vn~e~~~i~isGiVRp~DI~ 196 (238) .+.-.|+|.|||+.+.|+=+ |+..=.-.+||-+|-..|. T Consensus 6 ~~~G~v~e~l~n~~f~V~Le-----N~~~v~a~iSGk~r~~~ir 44 (69) T TIGR00008 6 EVEGKVVESLPNAMFRVELE-----NGHEVLAHISGKIRKNYIR 44 (69) T ss_pred EECCEEEECCCCCEEEEEEC-----CCCEEEEEECCEEECCEEE T ss_conf 74238985436982699947-----9978989971034135175 No 198 >PRK09752 adhesin; Provisional Probab=25.54 E-value=39 Score=14.43 Aligned_cols=22 Identities=14% Similarity=-0.002 Sum_probs=10.4 Q ss_pred CCCCCCCCC--CCCCCCCCCCCCC Q ss_conf 576121577--5521123223676 Q gi|254780374|r 59 KSYSLWRDS--HAALFKDSRALNV 80 (238) Q Consensus 59 ~~gSL~~~~--~~~lf~D~RA~~V 80 (238) -.||-+..- ...||.-++.-|- T Consensus 957 EaGSYlANLaAANtLFvtrLhDRl 980 (1250) T PRK09752 957 KVGGYLNNLRAANQAFVMERRDHA 980 (1250) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHC T ss_conf 111157799999987665116424 No 199 >PRK11063 metQ DL-methionine transporter substrate-binding subunit; Provisional Probab=25.51 E-value=47 Score=13.88 Aligned_cols=15 Identities=33% Similarity=0.372 Sum_probs=9.2 Q ss_pred HHHHHHHHHHHCCCC Q ss_conf 999999985433672 Q gi|254780374|r 8 VLCYSSLLFGCHSSI 22 (238) Q Consensus 8 ~l~~~~lL~GCas~~ 22 (238) ++++++.|+||.+.. T Consensus 13 ~l~~~l~l~gCg~~~ 27 (271) T PRK11063 13 ALIGSLALVGCGQDE 27 (271) T ss_pred HHHHHHHHCCCCCCC T ss_conf 999999750058985 No 200 >TIGR03044 PS_II_psb27 photosystem II protein Psb27. Members of this family are the Psb27 protein of the cyanobacterial photosynthetic supracomplex, photosystem II. Although most protein components of both cyanobacterial and chloroplast versions of photosystem II are closely related and described together by single HMM families, this family is strictly bacterial. Some uncharacterized proteins with highly divergent sequences, from Arabidopsis, score between trusted and noise cutoffs for this model but are not at this time assigned as functionally equivalent photosystem II proteins. Probab=24.94 E-value=40 Score=14.31 Aligned_cols=16 Identities=31% Similarity=0.414 Sum_probs=10.7 Q ss_pred HHHHHHHHHHHHHHCC Q ss_conf 9999999999854336 Q gi|254780374|r 5 SFIVLCYSSLLFGCHS 20 (238) Q Consensus 5 ~l~~l~~~~lL~GCas 20 (238) +-+.|+++++|.||+. T Consensus 12 lal~L~~~l~l~~cs~ 27 (135) T TIGR03044 12 LALVLGLCLLLTACSG 27 (135) T ss_pred HHHHHHHHHHHHCCCC T ss_conf 9999999999734567 No 201 >pfam06924 DUF1281 Protein of unknown function (DUF1281). This family consists of several hypothetical enterobacterial proteins of around 170 residues in length. Members of this family are found in Escherichia coli, Salmonella typhimurium and Shigella species. The function of this family is unknown. Probab=24.91 E-value=42 Score=14.21 Aligned_cols=15 Identities=20% Similarity=0.432 Sum_probs=10.4 Q ss_pred HHHHHHHHHCCCCCC Q ss_conf 999998543367200 Q gi|254780374|r 10 CYSSLLFGCHSSISE 24 (238) Q Consensus 10 ~~~~lL~GCas~~~~ 24 (238) .+-+||+|||.-+++ T Consensus 32 si~LFLAGCAGiLkP 46 (134) T pfam06924 32 SIKLFLAGCAGILKP 46 (134) T ss_pred HHHHHHHHHCCCCCC T ss_conf 999999701111566 No 202 >PRK11669 pbpG D-alanyl-D-alanine endopeptidase; Provisional Probab=24.86 E-value=49 Score=13.80 Aligned_cols=19 Identities=21% Similarity=0.231 Sum_probs=10.7 Q ss_pred HHHHHHHHHHHHHHHHCCC Q ss_conf 1799999999998543367 Q gi|254780374|r 3 RYSFIVLCYSSLLFGCHSS 21 (238) Q Consensus 3 r~~l~~l~~~~lL~GCas~ 21 (238) |+.+.++++.++++||... T Consensus 2 k~~~~~~~~~~~~~~~~~~ 20 (308) T PRK11669 2 KIRVSLLSLLLLLAGVPVA 20 (308) T ss_pred CHHHHHHHHHHHHHHHCCC T ss_conf 3269999999999740036 No 203 >PRK10525 cytochrome o ubiquinol oxidase subunit II; Provisional Probab=24.85 E-value=43 Score=14.14 Aligned_cols=13 Identities=31% Similarity=0.603 Sum_probs=8.5 Q ss_pred HHHHHHHHHHHHC Q ss_conf 9999999985433 Q gi|254780374|r 7 IVLCYSSLLFGCH 19 (238) Q Consensus 7 ~~l~~~~lL~GCa 19 (238) .++..+++|+||. T Consensus 14 ~l~~~~~lLsGC~ 26 (305) T PRK10525 14 SLFAGTVLLSGCN 26 (305) T ss_pred HHHHHHHHHCCCC T ss_conf 9999999866787 No 204 >pfam11353 DUF3153 Protein of unknown function (DUF3153). This family of proteins with unknown function appear to be restricted to Cyanobacteria. Some members are annotated as membrane proteins however this cannot be confirmed. Probab=24.38 E-value=27 Score=15.41 Aligned_cols=19 Identities=26% Similarity=0.508 Sum_probs=12.1 Q ss_pred HHHHHHHHHHHHCCCCCCC Q ss_conf 9999999985433672002 Q gi|254780374|r 7 IVLCYSSLLFGCHSSISEI 25 (238) Q Consensus 7 ~~l~~~~lL~GCas~~~~~ 25 (238) +++.++++|+||......+ T Consensus 3 vlllltlLLSGCV~~~~~i 21 (210) T pfam11353 3 VLLLLTLLLSGCVRIDADL 21 (210) T ss_pred HHHHHHHHHCCEEEEECEE T ss_conf 2999999847818984303 No 205 >PRK10002 outer membrane protein F; Provisional Probab=24.20 E-value=50 Score=13.72 Aligned_cols=19 Identities=26% Similarity=0.279 Sum_probs=9.3 Q ss_pred CCHHHHHHHHHHHHHHHHC Q ss_conf 9117999999999985433 Q gi|254780374|r 1 MYRYSFIVLCYSSLLFGCH 19 (238) Q Consensus 1 M~r~~l~~l~~~~lL~GCa 19 (238) |+|.+|.+++.+++++|.+ T Consensus 2 MKK~~LA~ai~all~ag~a 20 (362) T PRK10002 2 MKRNILAVIVPALLVAGTA 20 (362) T ss_pred CHHHHHHHHHHHHHHHCCC T ss_conf 3087999999999860644 No 206 >PRK06687 chlorohydrolase; Validated Probab=24.13 E-value=37 Score=14.55 Aligned_cols=15 Identities=7% Similarity=0.208 Sum_probs=6.5 Q ss_pred CCCCCCEEEEEEEEC Q ss_conf 367687699999724 Q gi|254780374|r 77 ALNVGDILTVDIRID 91 (238) Q Consensus 77 A~~VGDIiTV~i~E~ 91 (238) |++-|=.+.+-+.|. T Consensus 205 a~~~~~~~~~H~~e~ 219 (422) T PRK06687 205 AVENQTMVHIHLSET 219 (422) T ss_pred HHHCCCEEEEEECCC T ss_conf 997198499996452 No 207 >COG1462 CsgG Uncharacterized protein involved in formation of curli polymers [Cell envelope biogenesis, outer membrane] Probab=23.50 E-value=52 Score=13.64 Aligned_cols=28 Identities=11% Similarity=0.118 Sum_probs=15.0 Q ss_pred CCCCCCHHEECEEEEEEEEEEEEEEECC Q ss_conf 4321100000001489999999999807 Q gi|254780374|r 136 GASSGKGSISRAEKLNLLIAAIVTAILE 163 (238) Q Consensus 136 ~~~~g~G~~~~~~~l~~~isa~V~~Vlp 163 (238) ..+-|-+.-.....-...|..+||+|-- T Consensus 139 ~~l~G~~~r~~~q~a~iavdLRvVnv~T 166 (252) T COG1462 139 AQLFGIGARGKKQIAYIAVDLRVVNVST 166 (252) T ss_pred CCCEEEECCCCCCEEEEEEEEEEEECCC T ss_conf 5401021143010112677888886144 No 208 >COG4880 Secreted protein containing C-terminal beta-propeller domain distantly related to WD-40 repeats [General function prediction only] Probab=23.32 E-value=52 Score=13.62 Aligned_cols=49 Identities=29% Similarity=0.318 Sum_probs=27.7 Q ss_pred EEEECCCCCEEEEEEEE------EEEECCEEEEEE--------------EEEECHHHCCCCCEEEHHEEEE Q ss_conf 99980797289978877------984188799999--------------9798477678997382420554 Q gi|254780374|r 158 VTAILENGNLIISGSQE------VRVNDEIRSLNV--------------TGIVRPQDVDAHNSVSYDKIAE 208 (238) Q Consensus 158 V~~VlpNGnL~I~G~k~------i~vn~e~~~i~i--------------sGiVRp~DI~~~NtV~S~~iAd 208 (238) |-.-|-||.|.|-.+|. +.+|+- +|-- +=+|---|+..+|.=.|-.++- T Consensus 182 vdaRlynG~lYiv~rk~~~~cPiv~iN~~--ei~~~~~y~p~~pm~~dttyii~~vd~ksg~vens~~vvG 250 (603) T COG4880 182 VDARLYNGELYIVARKSSIDCPIVVINGR--EINEAPLYSPDFPMDPDTTYIIAGVDLKSGNVENSAYVVG 250 (603) T ss_pred EEEEEECCEEEEEECCCCCCCCEEEECCE--EECCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCEEEEEC T ss_conf 43034388899998036899988998776--8235532266677788714999987526786021589945 No 209 >PRK10461 thiamine biosynthesis lipoprotein ApbE; Provisional Probab=23.30 E-value=52 Score=13.62 Aligned_cols=17 Identities=29% Similarity=0.591 Sum_probs=11.0 Q ss_pred HHHHHHHHHHHHHHCCC Q ss_conf 99999999998543367 Q gi|254780374|r 5 SFIVLCYSSLLFGCHSS 21 (238) Q Consensus 5 ~l~~l~~~~lL~GCas~ 21 (238) .+.+|.++++|+||... T Consensus 7 ~~~ll~~~~~l~gC~~~ 23 (351) T PRK10461 7 RVALLAAALLLVGCDQA 23 (351) T ss_pred HHHHHHHHHHHHHHCCC T ss_conf 99999999999872488 No 210 >PRK10759 hypothetical protein; Provisional Probab=23.11 E-value=40 Score=14.35 Aligned_cols=13 Identities=31% Similarity=0.603 Sum_probs=8.2 Q ss_pred HHHHHHHHHHHHC Q ss_conf 9999999985433 Q gi|254780374|r 7 IVLCYSSLLFGCH 19 (238) Q Consensus 7 ~~l~~~~lL~GCa 19 (238) ++++++++|+||+ T Consensus 4 ~~~~~~l~~sgc~ 16 (106) T PRK10759 4 LIVPLLLLLSGCS 16 (106) T ss_pred HHHHHHHHHCCCH T ss_conf 9999999984423 No 211 >pfam06085 Rz1 Lipoprotein Rz1 precursor. This family consists of several bacteria and phage lipoprotein Rz1 precursors. Rz1 is a proline-rich lipoprotein from bacteriophage lambda which is known to have fusogenic properties. Rz1-induced liposome fusion is thought to be mediated primarily by the generation of local perturbation in the bilayer lipid membrane and to a lesser extent by electrostatic forces. Probab=22.87 E-value=53 Score=13.56 Aligned_cols=21 Identities=14% Similarity=0.278 Sum_probs=11.2 Q ss_pred CCHHHHHHH--HHHHHHHHHCCC Q ss_conf 911799999--999998543367 Q gi|254780374|r 1 MYRYSFIVL--CYSSLLFGCHSS 21 (238) Q Consensus 1 M~r~~l~~l--~~~~lL~GCas~ 21 (238) |+++.+.+. .+.+.++||.+. T Consensus 1 Mr~lkm~l~~~m~~L~vsaC~S~ 23 (59) T pfam06085 1 MRKLKMKLCVMMLPLVVSACSSK 23 (59) T ss_pred CCCCHHHHHHHHHHHHHHHHCCC T ss_conf 95202569999989999982699 No 212 >KOG1193 consensus Probab=22.78 E-value=53 Score=13.55 Aligned_cols=29 Identities=31% Similarity=0.251 Sum_probs=22.7 Q ss_pred CCCHHEECEEEEEEE-EEEEEEEECCCCCE Q ss_conf 110000000148999-99999998079728 Q gi|254780374|r 139 SGKGSISRAEKLNLL-IAAIVTAILENGNL 167 (238) Q Consensus 139 ~g~G~~~~~~~l~~~-isa~V~~VlpNGnL 167 (238) -|-|+.-|.-.+.+. |||.|+++||++-= T Consensus 302 ~gyGSfHqqY~lDgkLIAVgViDILP~cvS 331 (511) T KOG1193 302 CGYGSFHQQYRLDGKLIAVGVIDILPKCVS 331 (511) T ss_pred CCCCCHHHEEEECCEEEEEEEEECCCCCCC T ss_conf 665556650234786899987322777531 No 213 >pfam09619 YscW Type III secretion system lipoprotein chaperone (YscW). This entry is encoded within type III secretion operons. The protein has been characterized as a chaperone for the outer membrane pore component YscC. YscW is a lipoprotein which is itself localized to the outer membrane and, it is believed, facilitates the oligomerisation and localisation of YscC. Probab=22.39 E-value=28 Score=15.27 Aligned_cols=15 Identities=33% Similarity=0.392 Sum_probs=10.1 Q ss_pred HHHHHHHHHHCCCCC Q ss_conf 999999854336720 Q gi|254780374|r 9 LCYSSLLFGCHSSIS 23 (238) Q Consensus 9 l~~~~lL~GCas~~~ 23 (238) +.++++|+||.+... T Consensus 2 l~l~~lL~GCVt~p~ 16 (124) T pfam09619 2 LPLLLLLTGCVTTPM 16 (124) T ss_pred HHHHHHHHCCCCCCC T ss_conf 203324606556777 No 214 >TIGR02917 PEP_TPR_lipo putative PEP-CTERM system TPR-repeat lipoprotein; InterPro: IPR014266 This protein family occurs in strictly within a subset of Gram-negative bacterial species with the proposed PEP-CTERM/exosortase system , analogous to the LPXTG/sortase system common in Gram-positive bacteria. The proteins in this entry occur in a species if, and only if, a transmembrane histidine kinase (IPR014265 from INTERPRO) and a DNA-binding response regulator (IPR014264 from INTERPRO) also occur. The present of tetratricopeptide repeats (TPR) suggests they may be involved in protein-protein interaction, possibly for the regulation of PEP-CTERM protein expression, since many PEP-CTERM proteins in these genomes are preceded by a proposed DNA binding site for the response regulator.. Probab=22.30 E-value=54 Score=13.49 Aligned_cols=18 Identities=28% Similarity=0.305 Sum_probs=9.7 Q ss_pred HHHHHHHHHHHHHHH--CCC Q ss_conf 799999999998543--367 Q gi|254780374|r 4 YSFIVLCYSSLLFGC--HSS 21 (238) Q Consensus 4 ~~l~~l~~~~lL~GC--as~ 21 (238) ++.+++...++|+|| +.. T Consensus 7 l~~~~~~a~~lLsaCsGg~~ 26 (924) T TIGR02917 7 LLTILLAAILLLSACSGGDQ 26 (924) T ss_pred HHHHHHHHHHHHHHCCCCCC T ss_conf 99999999999863057888 No 215 >pfam10368 YkyA Putative cell-wall binding lipoprotein. YkyA is a family of proteins containing a lipoprotein signal and a hydrolase domain. It is similar to cell wall binding proteins and might also be recognisable by a host immune defence system. It is thus likely to belong to pathways important for pathogenicity. Probab=21.86 E-value=37 Score=14.56 Aligned_cols=11 Identities=36% Similarity=0.540 Sum_probs=7.4 Q ss_pred HHHHHHHHHCC Q ss_conf 99999854336 Q gi|254780374|r 10 CYSSLLFGCHS 20 (238) Q Consensus 10 ~~~~lL~GCas 20 (238) +.++||+||.+ T Consensus 3 ~s~~lLaGC~~ 13 (205) T pfam10368 3 LSILLLTGCTT 13 (205) T ss_pred HHHHHHHHHCC T ss_conf 89999986269 No 216 >PRK11479 hypothetical protein; Provisional Probab=21.74 E-value=56 Score=13.42 Aligned_cols=20 Identities=30% Similarity=0.433 Sum_probs=12.6 Q ss_pred HHHHHHHHHHHHHHCCCCCC Q ss_conf 99999999998543367200 Q gi|254780374|r 5 SFIVLCYSSLLFGCHSSISE 24 (238) Q Consensus 5 ~l~~l~~~~lL~GCas~~~~ 24 (238) .++++...++|+||+.-+.. T Consensus 7 ~~~~~~~~lll~~c~~dv~~ 26 (273) T PRK11479 7 CRLLLPWFLLLSACTVDISQ 26 (273) T ss_pred HHHHHHHHHHHHHHHHCCCC T ss_conf 99999999998775412466 No 217 >pfam05968 Bacillus_PapR Bacillus PapR protein. This family consists of the Bacillus species specific PapR protein. The papR gene belongs to the PlcR regulon and is located 70 bp downstream from plcR. It encodes a 48-amino-acid peptide. Disruption of the papR gene abolishes expression of the PlcR regulon, resulting in a large decrease in haemolysis and virulence in insect larvae. A processed form of PapR activates the PlcR regulon by allowing PlcR to bind to its DNA target. This activating mechanism is strain specific. Probab=21.59 E-value=56 Score=13.40 Aligned_cols=19 Identities=21% Similarity=0.240 Sum_probs=16.4 Q ss_pred CCHHHHHHHHHHHHHHHHC Q ss_conf 9117999999999985433 Q gi|254780374|r 1 MYRYSFIVLCYSSLLFGCH 19 (238) Q Consensus 1 M~r~~l~~l~~~~lL~GCa 19 (238) |+|+++..++...+++|-+ T Consensus 1 MKKlLigslltlAM~wGis 19 (48) T pfam05968 1 MKKLLIGSLLTLAMLWGIS 19 (48) T ss_pred CCHHHHHHHHHHHHHHCCC T ss_conf 9047886899999995332 No 218 >PRK11679 lipoprotein; Provisional Probab=21.35 E-value=57 Score=13.37 Aligned_cols=21 Identities=29% Similarity=0.276 Sum_probs=14.2 Q ss_pred CCCCCCCCCCCCCEEEEEEEE Q ss_conf 211232236768769999972 Q gi|254780374|r 70 ALFKDSRALNVGDILTVDIRI 90 (238) Q Consensus 70 ~lf~D~RA~~VGDIiTV~i~E 90 (238) +|-..-|.-+=||..++.+.- T Consensus 90 ~l~~gsr~e~~g~~~~~lv~~ 110 (345) T PRK11679 90 ALISGARTQFNGDTATLLLEN 110 (345) T ss_pred CCCCCCEEEECCCEEEEEEEC T ss_conf 547883687379717999837 No 219 >PRK11186 carboxy-terminal protease; Provisional Probab=21.33 E-value=57 Score=13.37 Aligned_cols=32 Identities=13% Similarity=0.325 Sum_probs=13.8 Q ss_pred CCCCCCCCCCCCHHEECEEEEEEEEEEEEEEECCCC Q ss_conf 655555432110000000148999999999980797 Q gi|254780374|r 130 LNYDGGGASSGKGSISRAEKLNLLIAAIVTAILENG 165 (238) Q Consensus 130 ~~~~~~~~~~g~G~~~~~~~l~~~isa~V~~VlpNG 165 (238) ++.+-+.+|.|-|..-+.+. +. ++|+++.|+| T Consensus 237 F~~~msgs~eGIGa~l~~~d--~~--~~Iv~~i~Gg 268 (673) T PRK11186 237 FNTEMNLSLEGIGAVLQMDD--DY--TVIKSLVAGG 268 (673) T ss_pred HHHHCCCEEEEEEEEEEEEC--CE--EEEEEECCCC T ss_conf 98741672766889999989--98--9999706899 No 220 >PHA00144 major head protein Probab=21.15 E-value=57 Score=13.34 Aligned_cols=105 Identities=14% Similarity=0.105 Sum_probs=54.0 Q ss_pred CCCCCCCCCCCCCCCEEEEEEEECCHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCC--CCCC-CCCCCCCCCCCCCHHE Q ss_conf 552112322367687699999724101000012223456544555433345544556--6655-4655555432110000 Q gi|254780374|r 68 HAALFKDSRALNVGDILTVDIRIDDQAVFDNQTGRSRNNSLHRKLSGGFSLFGQQTP--QMNG-NLNYDGGGASSGKGSI 144 (238) Q Consensus 68 ~~~lf~D~RA~~VGDIiTV~i~E~~~a~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~--~~~~-~~~~~~~~~~~g~G~~ 144 (238) +.+.|+.--|+--||.+-++-.--.++...+ ....|...+..-+-. +...+- .+.. +.+.+.+.+..|.=++ T Consensus 329 S~S~FanavAFv~~~~~~~v~qviv~p~i~~-lk~g~~~~~~a~ir~----p~~k~~~v~W~ve~~s~Gt~is~dgvl~i 403 (438) T PHA00144 329 SASRFANAVAFVTGDNTIVVTQVIVSPEIAS-LKQGKSQGFTAYIRA----PDNKGHAVVWYVEGGSTGTSISNDGVLTI 403 (438) T ss_pred CCCCCCCCEEEECCCCCCCEEEEEECCCEEE-EECCCCCCCEEEEEC----CCCCCCEEEEEEECCCCCCEECCCCEEEE T ss_conf 0355655036504787631003885443121-001676760379877----77766347899607865512226735985 Q ss_pred ECEEEEEEEEEEEEEEECCCCCEEEEEEEEEEE Q ss_conf 000148999999999980797289978877984 Q gi|254780374|r 145 SRAEKLNLLIAAIVTAILENGNLIISGSQEVRV 177 (238) Q Consensus 145 ~~~~~l~~~isa~V~~VlpNGnL~I~G~k~i~v 177 (238) .-++--++++-|+|.---+|||--|.||-.|.| T Consensus 404 ~~nE~~~~~v~v~Vd~~~~~~nK~v~gEa~i~v 436 (438) T PHA00144 404 GANEPNQLTVKVTVDIGTADGNKPVVGEAVISV 436 (438) T ss_pred CCCCCCEEEEEEEEECCCCCCCCCCCCEEEEEE T ss_conf 389987147999997158889865443368873 No 221 >TIGR01935 NOT-MenG RraA family; InterPro: IPR010203 This entry includes a number of closely related sequences bacteria and plants. The Escherichia coli member of this family has been characterised as a regulator of RNase E (see IPR004659 from INTERPRO), and its crystal structure has been analysed . E. coli RraA acts as a trans-acting modulator of RNA turnover, binding essential endonuclease RNase E and inhibiting RNA processing . RNase E forms the core of a large RNA-catalysis machine termed the degradosomes. RraA (and RraB) causes remodelling of degradosome composition, which is associated with alterations in RNA decay and global transcript abundance and as such is a bacterial mechanism for the regulation of RNA cleavage .; GO: 0008428 ribonuclease inhibitor activity, 0051252 regulation of RNA metabolic process. Probab=20.99 E-value=58 Score=13.32 Aligned_cols=31 Identities=29% Similarity=0.360 Sum_probs=23.4 Q ss_pred CCCCCC--CCCCCCCCEEEEEEEECCHHHHCCC Q ss_conf 521123--2236768769999972410100001 Q gi|254780374|r 69 AALFKD--SRALNVGDILTVDIRIDDQAVFDNQ 99 (238) Q Consensus 69 ~~lf~D--~RA~~VGDIiTV~i~E~~~a~~~~~ 99 (238) .++|++ .|+..-|-|+||.+-|+.+--|+-= T Consensus 21 ~P~Fr~FGgr~~F~G~ivTvkcFEDNslvr~~L 53 (155) T TIGR01935 21 EPVFRNFGGRAAFAGPIVTVKCFEDNSLVREVL 53 (155) T ss_pred CHHHHHCCCCCCCCCCCEEEEEEECCHHHHHHH T ss_conf 234321144232478403899970473688998 No 222 >PRK06760 hypothetical protein; Provisional Probab=20.95 E-value=58 Score=13.32 Aligned_cols=38 Identities=13% Similarity=-0.055 Sum_probs=18.0 Q ss_pred CCCCCCCCCCCCCCEEEEEEEECCHHHHCCCCCCCCCCCC Q ss_conf 5211232236768769999972410100001222345654 Q gi|254780374|r 69 AALFKDSRALNVGDILTVDIRIDDQAVFDNQTGRSRNNSL 108 (238) Q Consensus 69 ~~lf~D~RA~~VGDIiTV~i~E~~~a~~~~~~~~~r~~~~ 108 (238) ..+|.-..+.-=|- .|+|-..+.|..--....-|.... T Consensus 56 ~~~y~vK~g~~~~k--kVLim~~sTAe~mvKk~llRkrDn 93 (223) T PRK06760 56 NEVFSVKKGEVGNG--QVLILNESTAQALIKEKVFRKRDN 93 (223) T ss_pred HHEEEEEEEEECCE--EEEEEEHHHHHHHHHHCCEEECCC T ss_conf 32046675137882--799986189999976120230458 No 223 >PRK13697 cytochrome c6; Provisional Probab=20.94 E-value=58 Score=13.32 Aligned_cols=19 Identities=26% Similarity=0.044 Sum_probs=13.0 Q ss_pred CCHHHHHHHHHHHHHHHHC Q ss_conf 9117999999999985433 Q gi|254780374|r 1 MYRYSFIVLCYSSLLFGCH 19 (238) Q Consensus 1 M~r~~l~~l~~~~lL~GCa 19 (238) |+|++.+++++.+++..+- T Consensus 1 Mk~l~~~~l~~~~~~~~~~ 19 (111) T PRK13697 1 MKKILKLVLLTLLLLTFAF 19 (111) T ss_pred CHHHHHHHHHHHHHHHHHC T ss_conf 9789999999999999980 No 224 >pfam03304 Mlp Mlp lipoprotein family. The Mlp (for Multicopy Lipoprotein) family of lipoproteins is found in Borrelia species. This family were previously known as 2.9 lipoprotein genes. These surface expressed genes may represent new candidate vaccinogens for Lyme disease. Members of this family generally are downstream of four ORFs called A,B,C and D that are involved in hemolytic activity. Probab=20.87 E-value=58 Score=13.31 Aligned_cols=19 Identities=32% Similarity=0.672 Sum_probs=10.6 Q ss_pred HHHHHHHHH-HHHHHHHCCC Q ss_conf 179999999-9998543367 Q gi|254780374|r 3 RYSFIVLCY-SSLLFGCHSS 21 (238) Q Consensus 3 r~~l~~l~~-~~lL~GCas~ 21 (238) +++=+++|+ .++|.||.+. T Consensus 2 KIINilfclfll~Ln~Cnsn 21 (150) T pfam03304 2 KIINILFCLFLLMLNGCNSN 21 (150) T ss_pred CEEHHHHHHHHHHHCCCCCC T ss_conf 43149999999998265678 No 225 >pfam10023 DUF2265 Predicted aminopeptidase (DUF2265). Members of this family of bacterial proteins comprise various hypothetical proteins and putative aminopeptidases. Their exact function, has not, as yet, been defined. Probab=20.48 E-value=43 Score=14.13 Aligned_cols=14 Identities=36% Similarity=0.823 Sum_probs=9.8 Q ss_pred HHHHHHHHHHHHCC Q ss_conf 99999999854336 Q gi|254780374|r 7 IVLCYSSLLFGCHS 20 (238) Q Consensus 7 ~~l~~~~lL~GCas 20 (238) ++++++++|+||++ T Consensus 2 ~ll~~~l~l~gC~~ 15 (338) T pfam10023 2 LLLALALLLAGCQT 15 (338) T ss_pred HHHHHHHHHCCCCC T ss_conf 39999999868863 No 226 >COG4775 Outer membrane protein/protective antigen OMA87 [Cell envelope biogenesis, outer membrane] Probab=20.35 E-value=57 Score=13.35 Aligned_cols=28 Identities=21% Similarity=0.260 Sum_probs=23.8 Q ss_pred CCCCCCCCCCCCCCEEEEEEEECCHHHH Q ss_conf 5211232236768769999972410100 Q gi|254780374|r 69 AALFKDSRALNVGDILTVDIRIDDQAVF 96 (238) Q Consensus 69 ~~lf~D~RA~~VGDIiTV~i~E~~~a~~ 96 (238) ..+|+|-++.+.|++|.|.|.|....+. T Consensus 69 tg~F~~v~~~~~~~~Lvv~V~e~p~I~~ 96 (766) T COG4775 69 TGYFEDVRVEREGGTLVVTVKERPVINS 96 (766) T ss_pred CCCCCCEEEEECCCEEEEEEECCCCEEE T ss_conf 6897613788709879999962772878 No 227 >TIGR02236 recomb_radA DNA repair and recombination protein RadA; InterPro: IPR011938 This family consists exclusively of archaeal RadA protein, a homolog of bacterial RecA, eukaryotic RAD51 (IPR011941 from INTERPRO), and archaeal RadB (IPR011939 from INTERPRO). This protein is involved in DNA repair and recombination. The member from Pyrococcus horikoshii contains an intein .; GO: 0003684 damaged DNA binding, 0005524 ATP binding, 0008094 DNA-dependent ATPase activity, 0006281 DNA repair, 0006310 DNA recombination. Probab=20.02 E-value=17 Score=16.70 Aligned_cols=22 Identities=23% Similarity=0.332 Sum_probs=14.0 Q ss_pred HEEEEEEEEEECEEEECCCCCC Q ss_conf 2055439998020331233486 Q gi|254780374|r 204 DKIAEARISYGGKGRTTELLRP 225 (238) Q Consensus 204 ~~iAda~I~y~g~G~~~d~~~~ 225 (238) +-+|-.|=||.|||.|++-||. T Consensus 223 SLtsHFRaEY~GRg~LA~RQQK 244 (333) T TIGR02236 223 SLTSHFRAEYVGRGNLAERQQK 244 (333) T ss_pred CCCCCCCCCCCCHHHHHHHHHH T ss_conf 2100223676450247889869 Done!