Query gi|254780803|ref|YP_003065216.1| hypothetical protein CLIBASIA_03470 [Candidatus Liberibacter asiaticus str. psy62] Match_columns 165 No_of_seqs 110 out of 354 Neff 8.0 Searched_HMMs 39220 Date Sun May 29 18:03:04 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780803.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 COG5468 Predicted secreted (pe 99.9 7.4E-24 1.9E-28 165.4 18.6 160 4-163 2-171 (172) 2 pfam09927 DUF2159 Predicted se 99.9 3.7E-21 9.5E-26 148.9 15.7 118 41-158 1-120 (120) 3 PRK10796 LPS-assembly lipoprot 98.8 2.1E-06 5.4E-11 58.7 16.7 153 5-162 2-164 (196) 4 COG2980 RlpB Rare lipoprotein 98.7 1E-06 2.6E-11 60.7 14.1 150 7-160 4-160 (178) 5 pfam04390 RplB Rare lipoprotei 98.1 0.0004 1E-08 44.9 15.2 124 36-159 11-141 (145) 6 PRK11627 hypothetical protein; 94.9 0.3 7.7E-06 27.2 12.1 115 6-126 2-152 (192) 7 pfam05643 DUF799 Putative bact 94.1 0.45 1.1E-05 26.2 11.3 33 96-128 127-159 (215) 8 PRK10802 peptidoglycan-associa 93.2 0.075 1.9E-06 31.0 2.8 28 1-28 1-28 (173) 9 PRK12700 flgH flagellar basal 92.1 0.14 3.6E-06 29.3 3.0 25 1-25 1-25 (230) 10 PRK13731 conjugal transfer sur 90.6 0.67 1.7E-05 25.2 5.2 78 1-79 1-99 (243) 11 PRK10866 outer membrane protei 89.7 0.24 6.2E-06 27.8 2.3 22 3-24 1-22 (243) 12 PRK00178 tolB translocation pr 85.6 2.7 7E-05 21.4 10.0 134 7-158 4-155 (433) 13 PRK01622 OxaA-like protein pre 84.7 1.1 2.9E-05 23.8 3.4 27 1-27 2-29 (266) 14 TIGR03302 OM_YfiO outer membra 82.7 0.79 2E-05 24.7 1.9 20 6-25 1-20 (235) 15 COG5567 Predicted small peripl 81.9 1.1 2.8E-05 23.8 2.4 29 3-31 1-30 (58) 16 TIGR02887 spore_ger_x_C germin 79.8 1.1 2.9E-05 23.7 1.9 17 6-22 2-18 (400) 17 PRK01742 tolB translocation pr 79.5 4.7 0.00012 20.0 9.6 140 1-158 1-161 (430) 18 PRK02889 tolB translocation pr 79.4 4.8 0.00012 19.9 9.8 45 98-158 112-156 (430) 19 PRK10525 cytochrome o ubiquino 78.4 1.9 4.7E-05 22.4 2.6 31 1-32 4-34 (305) 20 TIGR03525 GldK gliding motilit 76.0 1.4 3.7E-05 23.1 1.5 19 6-24 1-19 (449) 21 PRK11372 lysozyme inhibitor; P 75.8 2.1 5.3E-05 22.1 2.3 22 6-27 2-24 (109) 22 PRK02463 OxaA-like protein pre 74.1 3.2 8.1E-05 21.0 2.9 25 3-27 5-29 (307) 23 TIGR03516 ppisom_GldI peptidyl 73.4 4.9 0.00013 19.9 3.7 31 97-128 90-120 (177) 24 PRK02998 prsA peptidylprolyl i 71.6 4.1 0.00011 20.3 3.0 23 1-23 1-23 (283) 25 PRK13684 Ycf48-like protein; P 71.2 6.9 0.00018 18.9 4.0 47 1-47 3-56 (333) 26 PRK00249 flgH flagellar basal 71.1 2.5 6.4E-05 21.6 1.8 21 5-25 2-22 (231) 27 PRK12696 flgH flagellar basal 70.7 3.1 8E-05 21.1 2.2 20 5-24 1-20 (238) 28 PRK11023 hypothetical protein; 68.5 3.8 9.6E-05 20.6 2.2 19 5-23 2-20 (191) 29 TIGR03524 GldJ gliding motilit 68.3 5.2 0.00013 19.7 2.9 25 1-25 1-25 (559) 30 PRK12788 flgH flagellar basal 68.0 2.7 6.8E-05 21.5 1.4 18 7-24 1-18 (231) 31 TIGR03659 IsdE heme ABC transp 67.3 3.9 9.9E-05 20.5 2.1 22 4-25 1-22 (289) 32 COG3017 LolB Outer membrane li 66.9 5.4 0.00014 19.6 2.8 23 1-23 1-23 (206) 33 PRK10533 putative lipoprotein; 66.7 4.5 0.00011 20.1 2.3 25 8-32 5-30 (171) 34 pfam01514 YscJ_FliF Secretory 66.5 6.8 0.00017 19.0 3.2 26 7-32 9-35 (206) 35 PRK04405 prsA peptidylprolyl i 66.4 5.5 0.00014 19.6 2.7 22 3-24 5-26 (298) 36 PRK12450 foldase protein PrsA; 65.9 5.6 0.00014 19.5 2.7 22 3-24 4-25 (309) 37 PRK09837 copper/silver efflux 64.5 4.9 0.00013 19.8 2.2 21 11-31 7-27 (460) 38 PRK03002 prsA peptidylprolyl i 64.2 5.6 0.00014 19.5 2.4 23 1-24 1-23 (285) 39 PRK02944 OxaA-like protein pre 63.4 7.2 0.00018 18.9 2.8 22 3-24 2-23 (255) 40 PRK12698 flgH flagellar basal 62.6 4.6 0.00012 20.0 1.7 28 95-122 133-161 (224) 41 PRK09578 periplasmic multidrug 62.2 7.8 0.0002 18.6 2.8 20 4-23 5-24 (385) 42 TIGR03511 GldH_lipo gliding mo 62.1 12 0.0003 17.5 4.9 99 5-118 2-109 (156) 43 PRK10175 hypothetical protein; 61.9 2.9 7.5E-05 21.2 0.6 18 7-24 1-18 (75) 44 PRK03629 tolB translocation pr 61.0 12 0.00032 17.4 9.8 136 3-158 1-157 (430) 45 pfam03304 Mlp Mlp lipoprotein 60.8 4 0.0001 20.4 1.2 19 5-24 2-20 (150) 46 PRK09859 multidrug efflux syst 59.1 8.6 0.00022 18.4 2.6 25 1-26 1-25 (385) 47 PRK13616 lipoprotein LpqB; Pro 59.0 9 0.00023 18.2 2.7 29 1-29 1-29 (590) 48 TIGR02747 TraV type IV conjuga 58.8 6.3 0.00016 19.2 1.9 21 7-27 4-26 (174) 49 PRK11459 multidrug resistance 58.3 7.8 0.0002 18.6 2.3 30 1-30 1-30 (478) 50 PRK09915 putative outer membra 58.2 9.1 0.00023 18.2 2.6 31 1-31 1-33 (488) 51 pfam04507 DUF576 Protein of un 57.4 5.3 0.00014 19.6 1.3 21 5-25 6-26 (257) 52 PRK04792 tolB translocation pr 56.2 15 0.00038 16.9 8.9 144 12-158 9-177 (450) 53 TIGR01074 rep ATP-dependent DN 55.9 15 0.00039 16.9 3.5 75 17-111 39-114 (677) 54 COG3056 Uncharacterized lipopr 55.1 16 0.0004 16.8 11.7 127 4-133 13-175 (204) 55 TIGR03352 VI_chp_3 type VI sec 55.0 3.7 9.5E-05 20.6 0.2 21 8-28 2-22 (146) 56 PRK04043 tolB translocation pr 54.9 16 0.0004 16.8 11.7 46 98-158 102-147 (419) 57 TIGR01433 CyoA ubiquinol oxida 54.1 7 0.00018 18.9 1.5 22 11-32 2-23 (228) 58 COG4594 FecB ABC-type Fe3+-cit 53.8 11 0.00028 17.7 2.5 25 1-25 1-25 (310) 59 pfam07901 DUF1672 Protein of u 53.2 8.7 0.00022 18.3 1.8 20 8-27 4-23 (304) 60 PRK11443 hypothetical protein; 52.9 8 0.0002 18.5 1.6 21 7-27 1-21 (120) 61 PRK01326 prsA foldase protein 52.8 13 0.00032 17.4 2.6 21 5-25 4-24 (310) 62 COG4669 EscJ Type III secretor 52.2 12 0.0003 17.5 2.4 26 4-30 2-27 (246) 63 KOG0604 consensus 51.6 9.2 0.00024 18.2 1.8 16 21-36 257-272 (400) 64 COG3317 NlpB Uncharacterized l 51.4 12 0.0003 17.6 2.3 25 4-28 2-26 (342) 65 TIGR02898 spore_YhcN_YlaJ spor 51.4 10 0.00026 17.9 2.0 17 10-26 3-21 (185) 66 PRK11548 hypothetical protein; 51.3 11 0.00029 17.6 2.2 16 14-29 12-29 (113) 67 PRK10477 outer membrane lipopr 51.2 11 0.00028 17.7 2.1 21 4-24 1-21 (177) 68 COG5645 Predicted periplasmic 51.0 5.6 0.00014 19.5 0.6 17 8-24 3-19 (80) 69 COG4939 Major membrane immunog 51.0 9.5 0.00024 18.1 1.7 18 9-26 6-23 (147) 70 pfam03886 DUF330 Protein of un 50.8 18 0.00046 16.4 11.7 92 49-153 59-159 (160) 71 PRK10838 spr putative outer me 50.5 13 0.00033 17.3 2.4 27 3-29 7-33 (188) 72 COG3417 FlgN Collagen-binding 50.2 13 0.00032 17.3 2.3 20 5-24 3-22 (200) 73 COG4808 Uncharacterized protei 50.2 15 0.00038 16.9 2.7 22 3-24 4-25 (152) 74 COG3065 Slp Starvation-inducib 50.1 15 0.00038 16.9 2.7 25 3-27 3-28 (191) 75 PRK11063 metQ DL-methionine tr 49.7 17 0.00043 16.6 2.8 13 12-24 13-25 (271) 76 PRK12699 flgH flagellar basal 49.4 11 0.00027 17.8 1.8 21 4-25 16-36 (246) 77 pfam08139 LPAM_1 Prokaryotic m 48.8 12 0.00031 17.5 2.0 19 7-25 2-20 (26) 78 PRK13883 conjugal transfer pro 48.2 12 0.00029 17.6 1.8 25 6-30 1-25 (156) 79 pfam06474 MLTD_N MLTD_N. 47.6 8.9 0.00023 18.3 1.2 19 8-26 2-20 (93) 80 TIGR02722 lp_ uncharacterized 47.4 18 0.00045 16.4 2.7 30 4-33 1-31 (215) 81 PRK10449 heat-inducible protei 47.4 11 0.00027 17.8 1.5 21 7-27 2-22 (140) 82 PRK10797 glutamate and asparta 46.9 21 0.00054 16.0 3.0 25 1-25 1-25 (302) 83 COG4764 Uncharacterized protei 43.6 15 0.00039 16.8 1.9 21 7-27 5-25 (197) 84 TIGR00752 slp outer membrane l 42.8 12 0.0003 17.5 1.2 23 6-28 2-24 (190) 85 PRK10760 murein hydrolase B; P 42.1 13 0.00034 17.2 1.4 17 117-134 326-342 (357) 86 PRK06760 hypothetical protein; 42.0 7.1 0.00018 18.9 -0.0 23 3-25 1-24 (223) 87 PRK12407 flgH flagellar basal 41.8 12 0.0003 17.5 1.1 16 8-23 2-17 (220) 88 pfam08085 Entericidin Enterici 41.5 22 0.00056 15.9 2.4 19 4-22 2-21 (42) 89 PRK11162 mltA murein transglyc 41.5 18 0.00046 16.4 2.0 22 6-27 2-23 (362) 90 PRK00022 lolB outer membrane l 41.3 17 0.00042 16.6 1.7 16 12-27 8-23 (203) 91 TIGR01742 SA_tandem_lipo Staph 40.9 14 0.00036 17.1 1.3 24 1-24 1-26 (267) 92 pfam06085 Rz1 Lipoprotein Rz1 40.7 15 0.00038 16.9 1.5 24 5-28 3-26 (59) 93 TIGR02643 T_phosphoryl thymidi 40.7 15 0.00039 16.8 1.5 22 13-34 95-117 (440) 94 TIGR02544 III_secr_YscJ type I 40.6 24 0.00061 15.7 2.5 24 8-31 3-27 (203) 95 pfam02352 Decorin_bind Decorin 40.0 23 0.00059 15.7 2.3 22 1-25 1-23 (166) 96 pfam06604 consensus 39.9 24 0.00062 15.6 2.4 22 1-23 1-22 (181) 97 COG3168 PilP Tfp pilus assembl 38.3 13 0.00034 17.2 0.9 18 8-25 2-19 (170) 98 PRK03095 prsA peptidylprolyl i 37.7 25 0.00065 15.5 2.2 12 13-24 10-21 (287) 99 pfam06291 Lambda_Bor Bor prote 37.0 20 0.00052 16.1 1.7 23 7-29 2-24 (97) 100 pfam11153 DUF2931 Protein of u 36.6 17 0.00043 16.6 1.2 16 8-23 2-17 (202) 101 pfam11254 DUF3053 Protein of u 35.7 26 0.00067 15.4 2.1 20 5-24 2-21 (229) 102 COG2834 LolA Outer membrane li 33.8 30 0.00077 15.0 2.1 26 1-26 1-26 (211) 103 PRK11653 hypothetical protein; 33.3 27 0.00069 15.3 1.8 23 5-27 20-42 (223) 104 pfam07854 DUF1646 Protein of u 32.9 31 0.0008 14.9 2.1 24 4-28 226-249 (347) 105 PRK10270 hypothetical protein; 32.4 36 0.00091 14.6 3.0 20 3-22 1-20 (340) 106 pfam12262 Lipase_bact_N Bacter 32.2 31 0.00079 15.0 2.0 20 4-24 2-21 (269) 107 PRK11138 outer membrane protei 32.1 34 0.00086 14.8 2.1 22 1-24 1-22 (394) 108 TIGR01781 Trep_dent_lipo Trepo 31.9 35 0.0009 14.6 2.2 22 4-25 3-24 (463) 109 pfam11873 DUF3393 Domain of un 31.9 18 0.00045 16.5 0.6 19 8-26 2-20 (200) 110 PRK06193 hypothetical protein; 31.0 36 0.00091 14.6 2.1 21 5-25 2-22 (211) 111 PRK11671 mltC murein transglyc 31.0 29 0.00073 15.2 1.6 17 6-22 2-18 (360) 112 PRK09967 putative outer membra 31.0 31 0.0008 14.9 1.8 10 15-24 12-21 (160) 113 TIGR01329 cysta_beta_ly_E cyst 30.7 24 0.00062 15.6 1.2 17 15-31 274-290 (415) 114 PRK10510 putative outer membra 30.6 38 0.00097 14.4 2.5 24 4-27 3-26 (219) 115 COG5510 Predicted small secret 29.2 40 0.001 14.3 2.6 23 1-23 1-23 (44) 116 PRK10215 hypothetical protein; 29.0 32 0.00082 14.9 1.6 24 4-27 6-30 (219) 117 PRK11189 lipoprotein NlpI; Pro 28.7 40 0.001 14.3 2.1 20 4-23 2-21 (297) 118 TIGR03529 GldK_short gliding m 28.2 18 0.00046 16.4 0.2 21 5-25 1-21 (344) 119 TIGR02917 PEP_TPR_lipo putativ 27.8 43 0.0011 14.1 2.4 17 6-22 4-21 (924) 120 pfam01298 Lipoprotein_5 Transf 27.7 30 0.00076 15.1 1.2 21 1-23 1-21 (589) 121 COG1704 LemA Uncharacterized c 27.0 37 0.00095 14.5 1.6 26 4-29 2-27 (185) 122 PRK10781 rcsF outer membrane l 27.0 32 0.00082 14.9 1.3 16 8-23 2-17 (133) 123 pfam12273 RCR Chitin synthesis 26.8 43 0.0011 14.1 1.9 22 10-31 9-38 (124) 124 TIGR03300 assembly_YfgL outer 26.3 31 0.00079 15.0 1.1 17 8-24 2-18 (377) 125 KOG2791 consensus 26.2 38 0.00097 14.4 1.6 24 1-29 2-25 (455) 126 PRK09861 cytoplasmic membrane 25.9 46 0.0012 13.9 2.9 19 6-24 8-26 (272) 127 COG3218 ABC-type uncharacteriz 25.6 47 0.0012 13.9 12.6 143 4-160 8-199 (205) 128 COG0386 BtuE Glutathione perox 25.5 17 0.00043 16.6 -0.3 12 19-30 34-45 (162) 129 COG3957 Phosphoketolase [Carbo 24.7 46 0.0012 13.9 1.8 16 16-31 232-247 (793) 130 pfam09716 ETRAMP Malarial earl 24.3 47 0.0012 13.9 1.7 25 5-31 2-26 (84) 131 KOG4544 consensus 24.1 24 0.0006 15.7 0.2 25 4-28 72-96 (144) 132 TIGR01734 D-ala-DACP-lig D-ala 24.0 50 0.0013 13.7 2.2 16 12-27 66-81 (513) 133 pfam11810 DUF3332 Domain of un 23.7 51 0.0013 13.7 2.1 19 4-22 2-20 (175) 134 COG5633 Predicted periplasmic 23.7 32 0.00082 14.9 0.8 16 12-27 7-22 (123) 135 PRK11669 pbpG D-alanyl-D-alani 22.9 53 0.0013 13.6 2.0 19 10-28 5-23 (308) 136 PRK10461 thiamine biosynthesis 22.9 53 0.0013 13.5 4.4 18 9-26 7-24 (351) 137 TIGR02522 pilus_cpaD pilus (Ca 22.7 50 0.0013 13.7 1.6 21 10-30 2-23 (211) 138 PRK10759 hypothetical protein; 22.7 50 0.0013 13.7 1.6 14 10-23 3-16 (106) 139 TIGR02521 type_IV_pilW type IV 22.5 40 0.001 14.3 1.1 23 12-34 2-25 (247) 140 PRK11397 dacD D-alanyl-D-alani 22.4 54 0.0014 13.5 2.5 25 1-25 1-26 (390) 141 PRK05137 tolB translocation pr 22.4 54 0.0014 13.5 10.0 45 98-158 117-161 (437) 142 TIGR03042 PS_II_psbQ_bact phot 22.3 54 0.0014 13.5 2.3 23 3-27 1-23 (142) 143 pfam03783 CsgG Curli productio 21.9 55 0.0014 13.4 8.5 84 71-154 89-199 (207) 144 pfam02402 Lysis_col Lysis prot 21.8 30 0.00075 15.1 0.3 16 12-27 9-24 (46) 145 pfam05715 zf-piccolo Piccolo Z 21.7 27 0.00068 15.4 0.1 10 19-28 33-42 (61) 146 COG4826 Serine protease inhibi 21.5 56 0.0014 13.4 2.5 56 9-64 3-58 (410) 147 pfam08627 CRT-like CRT-like. T 20.7 52 0.0013 13.6 1.4 11 19-29 93-103 (130) 148 PRK04168 hypothetical protein; 20.6 58 0.0015 13.3 1.6 21 3-23 4-24 (336) 149 PRK09558 ushA bifunctional UDP 20.6 59 0.0015 13.3 2.5 25 3-27 1-25 (551) 150 pfam04549 CD47 CD47 transmembr 20.5 59 0.0015 13.2 1.7 20 9-30 106-125 (156) 151 COG2859 Uncharacterized protei 20.4 45 0.0011 14.0 1.0 18 8-25 8-25 (237) No 1 >COG5468 Predicted secreted (periplasmic) protein [Function unknown] Probab=99.93 E-value=7.4e-24 Score=165.44 Aligned_cols=160 Identities=21% Similarity=0.352 Sum_probs=135.4 Q ss_pred HHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCEEEEEEEEECCCCHHHHHHHHHHHHHCCCCC--C-CCCEEEEEEECC Q ss_conf 7889999999999998368514334157767202232578407774144899999997314777--7-671689997100 Q gi|254780803|r 4 KRILRILVVFNFFLLINSCSVYPFYYLKNQDRTEYIHPIKVLVVSKNNKNEIYRSINFLTSTVR--T-KNLYQLEVNIDS 80 (165) Q Consensus 4 ~ri~k~l~~~~~~~~ls~CGf~PlY~~~~~~~~~~l~~I~i~~~~~r~~~~i~n~L~~~~~~~~--~-~~~y~l~i~~~~ 80 (165) +|.+....+.+-+.++++|.+.|+|+....+....+..|+|.+..+|.||.+||+|.+.++... + .+.|.|.+.+++ T Consensus 2 ~r~l~~va~a~S~~~l~~C~v~PLYsg~~a~~~~~LAsI~i~~a~~r~gqevRnqLifl~s~gaGePA~p~y~Lsl~vs~ 81 (172) T COG5468 2 KRTLARVALAFSLATLTACQVRPLYSGERAGTESGLASIEISPAGGRGGQEVRNQLIFLFSGGAGEPANPQYYLSLSVSR 81 (172) T ss_pred HHHHHHHHHHHHHHHHHCCEEEECCCCCCCCHHCCEEEEEEECCCCHHHHHHHHHHHHHHCCCCCCCCCCCEEEEEEEEE T ss_conf 01899999999999863114554135787530102026787426765889999999999616899988821689987431 Q ss_pred CEEECC--C-----CCCCCCEEEEEEEEEEEEEECCCCCEEEEEEEEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 000000--2-----467650489999999999980589587742579998606776157799999999999999999999 Q gi|254780803|r 81 FTDHAI--S-----NAFFKNIGRITLKAKYYFKEISGKNILYENNTDVTSLFDFSDQQFSQLRSHKSSEEKAIQELSENI 153 (165) Q Consensus 81 ~~~~~i--~-----~~~~~~~~~~~~~~~y~L~~~~~~~~i~~~~~~~~~~Y~~~~~~f~~~~~e~~a~erl~~~lae~I 153 (165) ....+. + .++.++..++.|+++|.|.|.++|+++..|..+..++||...|+|+.+++++|+|+|+++++||.| T Consensus 82 ~a~~~~~~nv~~~~d~d~p~ag~v~mt~~y~Ltdak~G~~v~~Grr~vtas~D~p~QefA~irA~rDaenRAArElAe~i 161 (172) T COG5468 82 FARGVRLVNVGNDTDRDRPSAGRVVMTADYVLTDAKDGAEVHKGRRSVTASFDRPRQEFAKIRAQRDAENRAARELAEII 161 (172) T ss_pred EEEEEEEEECCCCCCCCCCCCCEEEEEEEEEEEECCCCCEEEECCEEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 03457887405655578987757999865898754788776524315788734751478888887606778999999999 Q ss_pred HHHHHHHHHC Q ss_conf 9999999740 Q gi|254780803|r 154 YIDIISFIRT 163 (165) Q Consensus 154 ~~~i~~~l~~ 163 (165) ..+|+.||-. T Consensus 162 ~ldlA~~L~r 171 (172) T COG5468 162 QLDLAAWLGR 171 (172) T ss_pred HHHHHHHHHC T ss_conf 8899999635 No 2 >pfam09927 DUF2159 Predicted secreted (periplasmic) protein (DUF2159). This domain, found in various hypothetical prokaryotic proteins, has no known function. Probab=99.88 E-value=3.7e-21 Score=148.92 Aligned_cols=118 Identities=14% Similarity=0.215 Sum_probs=108.6 Q ss_pred EEEEECCCCHHHHHHHHHHHHHCCCC-CCCCCEEEEEEECCCEEEC-CCCCCCCCEEEEEEEEEEEEEECCCCCEEEEEE Q ss_conf 57840777414489999999731477-7767168999710000000-024676504899999999999805895877425 Q gi|254780803|r 41 PIKVLVVSKNNKNEIYRSINFLTSTV-RTKNLYQLEVNIDSFTDHA-ISNAFFKNIGRITLKAKYYFKEISGKNILYENN 118 (165) Q Consensus 41 ~I~i~~~~~r~~~~i~n~L~~~~~~~-~~~~~y~l~i~~~~~~~~~-i~~~~~~~~~~~~~~~~y~L~~~~~~~~i~~~~ 118 (165) .|+|.+|++|.||.++|+|.+++|.+ +..|.|.|.+.++++...+ +..++.++++++.++++|+|++..+|++|.+|+ T Consensus 1 ~I~v~~~~~R~g~~lr~~L~~rLg~g~~~~p~Y~L~v~~~~~~~~~~i~~~~~~tr~~~~~~a~y~L~d~~tg~~v~sG~ 80 (120) T pfam09927 1 AIEVAPIDGRQGQLLRNALEDRLGGGRPAAPRYRLSVRLDTSRESLGIRPDNDTTRYNLTGTATYTLVDLATGKVVASGT 80 (120) T ss_pred CEEECCCCCCCHHHHHHHHHHHHCCCCCCCCCEEEEEEEEEEEEEEEEECCCCEEEEEEEEEEEEEEEECCCCCEEEEEE T ss_conf 96880899934599999999984689999972899999988886434704895079999999999999889997999507 Q ss_pred EEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 7999860677615779999999999999999999999999 Q gi|254780803|r 119 TDVTSLFDFSDQQFSQLRSHKSSEEKAIQELSENIYIDII 158 (165) Q Consensus 119 ~~~~~~Y~~~~~~f~~~~~e~~a~erl~~~lae~I~~~i~ 158 (165) ++.+++|+..+++|++.++|+||++|++..|||+|..+|+ T Consensus 81 ~~~~tsy~~~~s~~A~~~A~~DA~~Rla~~lAd~I~~rLa 120 (120) T pfam09927 81 VTSFTSYDATGSTYATLAAERDAEERLARELADQIVTRLA 120 (120) T ss_pred EEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 9999867476878999999999999999999999998629 No 3 >PRK10796 LPS-assembly lipoprotein RlpB; Provisional Probab=98.76 E-value=2.1e-06 Score=58.74 Aligned_cols=153 Identities=10% Similarity=0.039 Sum_probs=93.5 Q ss_pred HHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCEEEEEEEEECC--CCHHHHHHHHHHHHHC----CCCCC-CCCEEEEEE Q ss_conf 8899999999999983685143341577672022325784077--7414489999999731----47777-671689997 Q gi|254780803|r 5 RILRILVVFNFFLLINSCSVYPFYYLKNQDRTEYIHPIKVLVV--SKNNKNEIYRSINFLT----STVRT-KNLYQLEVN 77 (165) Q Consensus 5 ri~k~l~~~~~~~~ls~CGf~PlY~~~~~~~~~~l~~I~i~~~--~~r~~~~i~n~L~~~~----~~~~~-~~~y~l~i~ 77 (165) |.++.+++...+++++||||+ +-+ +..+...+..+.+... .+...+.++.+|.-.. ..+.. ++.-.|.+. T Consensus 2 r~l~~l~l~lavll~agCGFh-LRg--~~~lP~~l~tL~l~S~Dpys~Ltr~vr~qLr~~gV~lv~~~~~~~~vp~L~i~ 78 (196) T PRK10796 2 RYLATLLLSLAVLVTAGCGWH-LRG--TTQVPSEMKTMILDSGDPNGPLSRAVRNQLRLNGVELLDKDTTRKDVPSLRLG 78 (196) T ss_pred HHHHHHHHHHHHHHHCCCCCE-ECC--CCCCCCCCCEEEEEECCCCCHHHHHHHHHHHHCCCEEECCCCCCCCCCEEEEC T ss_conf 569999999999997476715-768--99998644458998048786799999999987795896378752458569971 Q ss_pred ECC-CEEE-CCCCCCCCCEEEEEEEEEEEEEECCCCCEEEEEEEEEEEEECCCCCH-HHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 100-0000-00246765048999999999998058958774257999860677615-77999999999999999999999 Q gi|254780803|r 78 IDS-FTDH-AISNAFFKNIGRITLKAKYYFKEISGKNILYENNTDVTSLFDFSDQQ-FSQLRSHKSSEEKAIQELSENIY 154 (165) Q Consensus 78 ~~~-~~~~-~i~~~~~~~~~~~~~~~~y~L~~~~~~~~i~~~~~~~~~~Y~~~~~~-f~~~~~e~~a~erl~~~lae~I~ 154 (165) ... .... .+..+|.+-.|.+..+|+|++.. - +..++.-+++..-+|-...+. -+...-++-..+-+=+.+|.+|. T Consensus 79 ~e~~~~rtlSl~~~Gr~AEYeL~~~V~~~v~~-P-~~~~~p~~~~v~R~y~dnp~~aLAKs~E~elL~~EMR~~aA~Qii 156 (196) T PRK10796 79 GVSISQDTASVFRNGQTAEYQMVMTVNAQVLI-P-GHDIYPISVKVFRSFFDNPQMALAKDNEQDMIVQEMYDQAAEQLI 156 (196) T ss_pred CCCCCCEEEEECCCCHHHHHEEEEEEEEEEEE-C-CCCCEEEEEEEEEECCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 56555316988157465201788899999995-7-986056688999823479778762699999999999999999999 Q ss_pred HHHHHHHH Q ss_conf 99999974 Q gi|254780803|r 155 IDIISFIR 162 (165) Q Consensus 155 ~~i~~~l~ 162 (165) .||+...+ T Consensus 157 RrL~~v~~ 164 (196) T PRK10796 157 RKLLSVRA 164 (196) T ss_pred HHHHHHHH T ss_conf 99866408 No 4 >COG2980 RlpB Rare lipoprotein B [Cell envelope biogenesis, outer membrane] Probab=98.72 E-value=1e-06 Score=60.70 Aligned_cols=150 Identities=12% Similarity=0.098 Sum_probs=90.7 Q ss_pred HHHHHHHHHHHHHHCCCCEEEEECCCCCCCEEEE--EEEEECCCCHHHHHHHHHHHHHCCCC--CCCCCEEEEEEE-CCC Q ss_conf 9999999999998368514334157767202232--57840777414489999999731477--776716899971-000 Q gi|254780803|r 7 LRILVVFNFFLLINSCSVYPFYYLKNQDRTEYIH--PIKVLVVSKNNKNEIYRSINFLTSTV--RTKNLYQLEVNI-DSF 81 (165) Q Consensus 7 ~k~l~~~~~~~~ls~CGf~PlY~~~~~~~~~~l~--~I~i~~~~~r~~~~i~n~L~~~~~~~--~~~~~y~l~i~~-~~~ 81 (165) ++.++++....+|+||||+ +-+ +.+....+. .++...+-+.....+++.|..-...- ..+..-.|.+.. +.. T Consensus 4 L~~~lL~~a~~~L~aCGFh-LRg--~~~ip~~l~~l~l~s~d~y~~l~r~vrr~L~~n~v~vv~~~~~~p~Lri~se~~~ 80 (178) T COG2980 4 LKTLLLLAAVLLLAACGFH-LRG--TTQIPPELKTLSLESSDPYGPLTRAVRRQLRLNGVDVVDTAKDVPVLRILSESTS 80 (178) T ss_pred HHHHHHHHHHHHHHHCCEE-ECC--CCCCCCCHHHEEEECCCCCCHHHHHHHHHHHHCCCEEEECCCCCCEEEECCEECC T ss_conf 5899999999997334534-317--8889941133155436898768999999999849645314778607886022215 Q ss_pred -EEECCCCCCCCCEEEEEEEEEEEEEECCCCCEEEEEEEEEEEEECCCCCHHHHHHHHHHH-HHHHHHHHHHHHHHHHHH Q ss_conf -000002467650489999999999980589587742579998606776157799999999-999999999999999999 Q gi|254780803|r 82 -TDHAISNAFFKNIGRITLKAKYYFKEISGKNILYENNTDVTSLFDFSDQQFSQLRSHKSS-EEKAIQELSENIYIDIIS 159 (165) Q Consensus 82 -~~~~i~~~~~~~~~~~~~~~~y~L~~~~~~~~i~~~~~~~~~~Y~~~~~~f~~~~~e~~a-~erl~~~lae~I~~~i~~ 159 (165) ....+...|.+-.|.++++++|++.+- ++.+++.-+++++-+|--+....--..+|+.+ ..-+=+.+|++|..+|.+ T Consensus 81 ~~t~Si~~~gr~aEy~L~~~v~a~v~~~-~~~v~~p~~v~V~R~~~dn~~~aLaK~~E~e~l~~EMR~daa~QLvrrl~s 159 (178) T COG2980 81 QRTASIFRQGRAAEYQLTLTVEAQVLDP-NDDVLYPISVKVFRSFFDNPNMALAKDEEREMLWNEMRRDAAEQLVRRLAS 159 (178) T ss_pred CCEEEEEHHHHHHHHHEEEEEEEEEECC-CCCCCCCEEEEEEEEECCCHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 5327863214666561246889999818-987568647899885105877775233789999999999999999999987 Q ss_pred H Q ss_conf 9 Q gi|254780803|r 160 F 160 (165) Q Consensus 160 ~ 160 (165) - T Consensus 160 l 160 (178) T COG2980 160 L 160 (178) T ss_pred H T ss_conf 1 No 5 >pfam04390 RplB Rare lipoprotein B family. The Escherichia coli family member has been named Rare lipoprotein B (RplB). Thioglyceride and N-fatty acyl residues may be attached to the N-terminal cysteine, which is conserved in this family. RplB is speculated to be involved in cell duplication. Probab=98.13 E-value=0.0004 Score=44.86 Aligned_cols=124 Identities=6% Similarity=0.020 Sum_probs=80.9 Q ss_pred CEEEEEEEEE--CCCCHHHHHHHHHHHHHCC--CCCCCCCEEEEEEECCCEEE--CCCCCCCCCEEEEEEEEEEEEEECC Q ss_conf 0223257840--7774144899999997314--77776716899971000000--0024676504899999999999805 Q gi|254780803|r 36 TEYIHPIKVL--VVSKNNKNEIYRSINFLTS--TVRTKNLYQLEVNIDSFTDH--AISNAFFKNIGRITLKAKYYFKEIS 109 (165) Q Consensus 36 ~~~l~~I~i~--~~~~r~~~~i~n~L~~~~~--~~~~~~~y~l~i~~~~~~~~--~i~~~~~~~~~~~~~~~~y~L~~~~ 109 (165) ...+..+.+. ++.......+++.|....- ...++..+.|.+........ .++..|.+..|.+.++++|++.+.. T Consensus 11 ~~~~~~l~l~s~~~~~~l~~~l~~~L~~~gv~v~~~~~a~~~L~i~~~~~~rr~lSv~~~grv~EYeL~~~v~~~v~~~~ 90 (145) T pfam04390 11 PLALKTLYLSSSDPYSELTRELRRQLRLSGVKVVDDAAADPVLRLLSESRSRRTLSLFSQGRVAEYELTYTVNFRVRDPN 90 (145) T ss_pred CCCCCEEEEEECCCCCHHHHHHHHHHHHCCCEECCCCCCCEEEEECCCCCEEEEEEECCCCEEEEEEEEEEEEEEEECCC T ss_conf 98867799993699987999999999887988826877856999824611079998747972102899999999998699 Q ss_pred CCCEEEEEEEEEEEEECCCCCHH-HHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 89587742579998606776157-799999999999999999999999999 Q gi|254780803|r 110 GKNILYENNTDVTSLFDFSDQQF-SQLRSHKSSEEKAIQELSENIYIDIIS 159 (165) Q Consensus 110 ~~~~i~~~~~~~~~~Y~~~~~~f-~~~~~e~~a~erl~~~lae~I~~~i~~ 159 (165) +...+-..++...-+|...++.. +...-|+-..+-+-+.+|.+|..||.. T Consensus 91 g~~l~~~~~i~~~R~~~~d~~~~Lak~~Ee~~L~~eM~~d~a~qIlrRL~~ 141 (145) T pfam04390 91 GELLIPPTTVEVRRDYLDDPNNALAKSQEEELLRREMRRDAAQQIIRRLAA 141 (145) T ss_pred CCCCCCCEEEEEEEEEECCHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 987138768999998412956762319999999999999999999999874 No 6 >PRK11627 hypothetical protein; Provisional Probab=94.87 E-value=0.3 Score=27.24 Aligned_cols=115 Identities=10% Similarity=0.089 Sum_probs=53.2 Q ss_pred HHHHHHHHHHHHHHHCCCCEE-------EEEC--CCCCCCEEEEEEEEEC----------------------CCCHHHHH Q ss_conf 899999999999983685143-------3415--7767202232578407----------------------77414489 Q gi|254780803|r 6 ILRILVVFNFFLLINSCSVYP-------FYYL--KNQDRTEYIHPIKVLV----------------------VSKNNKNE 54 (165) Q Consensus 6 i~k~l~~~~~~~~ls~CGf~P-------lY~~--~~~~~~~~l~~I~i~~----------------------~~~r~~~~ 54 (165) ++|+++.+..+++|+||.=+| -+.. .++.... ..+.+.. +....... T Consensus 2 mkk~~~~l~a~~lL~gCas~p~~l~l~P~~~~~~~~~~~~~--~~vsv~s~D~R~~~~va~~~~~g~~~~L~~s~~l~~~ 79 (192) T PRK11627 2 LKKILFPLVALFMLAGCATPPTTLEVSPKITLPQQDPSLMG--VTVSINGADQRQDQALAKVTRDNQLVTLTPSRDLRFL 79 (192) T ss_pred HHHHHHHHHHHHHHHHHCCCCCEEEECCCCCCCCCCCCCCC--EEEEEEEEECCCCCEEEEECCCCCEEEECCCCCHHHH T ss_conf 28799999999999860699763786788766434554577--0599987753665326798069946770478688999 Q ss_pred HHHHHHHHCCCC----CCCCCEEEEEEECCCEEECCCCCCCCCEEEEEEEEEEEEEEC-CCCCEEEEEEEEEEEEEC Q ss_conf 999999731477----776716899971000000002467650489999999999980-589587742579998606 Q gi|254780803|r 55 IYRSINFLTSTV----RTKNLYQLEVNIDSFTDHAISNAFFKNIGRITLKAKYYFKEI-SGKNILYENNTDVTSLFD 126 (165) Q Consensus 55 i~n~L~~~~~~~----~~~~~y~l~i~~~~~~~~~i~~~~~~~~~~~~~~~~y~L~~~-~~~~~i~~~~~~~~~~Y~ 126 (165) +.+.|...+... ++.....+.+.+.+-. .+.+...-+|++...+...+.-. .+|.. +.+.+++..++. T Consensus 80 lq~~l~~ql~sqG~~i~~~a~~~l~v~i~~l~---a~V~q~~~~y~i~t~v~i~I~v~~~ng~k-~tK~y~~~~~~e 152 (192) T PRK11627 80 LQEVLEKQMTARGYMIGPNGAVNLQIIVNQLY---ADVSQGNVRYNIATKADISIIATAQNGNK-MTKNYRASYNVE 152 (192) T ss_pred HHHHHHHHHHHCCEEECCCCCCEEEEEHHHHC---CCCCCCEEEEEECCEEEEEEEEECCCCCE-EEEEEECCEECC T ss_conf 99999999985881715887625999968931---60255504777524799999998378977-778872322115 No 7 >pfam05643 DUF799 Putative bacterial lipoprotein (DUF799). This family consists of several bacterial proteins of unknown function. Some of the family members are described as putative lipoproteins. Probab=94.12 E-value=0.45 Score=26.20 Aligned_cols=33 Identities=15% Similarity=0.100 Sum_probs=24.9 Q ss_pred EEEEEEEEEEEECCCCCEEEEEEEEEEEEECCC Q ss_conf 999999999998058958774257999860677 Q gi|254780803|r 96 RITLKAKYYFKEISGKNILYENNTDVTSLFDFS 128 (165) Q Consensus 96 ~~~~~~~y~L~~~~~~~~i~~~~~~~~~~Y~~~ 128 (165) ...++++++|+|..+|+++++|+.+....-+++ T Consensus 127 ~t~V~~~a~LvdlrTG~~LW~g~a~~~~~~~~~ 159 (215) T pfam05643 127 VTTVSAKARLVDSRNGKELWSGSASIREGSNNS 159 (215) T ss_pred EEEEEEEEEEEECCCCCEEEEEEEEECCCCCCC T ss_conf 379999999984567867653103400168888 No 8 >PRK10802 peptidoglycan-associated outer membrane lipoprotein; Provisional Probab=93.24 E-value=0.075 Score=30.96 Aligned_cols=28 Identities=21% Similarity=0.275 Sum_probs=24.4 Q ss_pred CCHHHHHHHHHHHHHHHHHHCCCCEEEE Q ss_conf 9157889999999999998368514334 Q gi|254780803|r 1 MLYKRILRILVVFNFFLLINSCSVYPFY 28 (165) Q Consensus 1 ml~~ri~k~l~~~~~~~~ls~CGf~PlY 28 (165) |.+.|++|.+++...+++|++|+-+|-- T Consensus 1 M~~nk~~K~l~ia~~~l~LaACSS~~~~ 28 (173) T PRK10802 1 MQLNKVLKGLMIALPVMAIAACSSNKNA 28 (173) T ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCCCC T ss_conf 9607899999999999999855799988 No 9 >PRK12700 flgH flagellar basal body L-ring protein; Reviewed Probab=92.07 E-value=0.14 Score=29.29 Aligned_cols=25 Identities=28% Similarity=0.571 Sum_probs=23.0 Q ss_pred CCHHHHHHHHHHHHHHHHHHCCCCE Q ss_conf 9157889999999999998368514 Q gi|254780803|r 1 MLYKRILRILVVFNFFLLINSCSVY 25 (165) Q Consensus 1 ml~~ri~k~l~~~~~~~~ls~CGf~ 25 (165) |+++-++|+.+++.++++|+||.-. T Consensus 1 ~~~~t~~Rl~~~~~~~~ll~GCa~~ 25 (230) T PRK12700 1 MMLKTVLRLPVCAALLALAAGCAMI 25 (230) T ss_pred CCHHHHHHHHHHHHHHHHHHCCCCC T ss_conf 9158899999999999997444689 No 10 >PRK13731 conjugal transfer surface exclusion protein TraT; Provisional Probab=90.63 E-value=0.67 Score=25.15 Aligned_cols=78 Identities=17% Similarity=0.211 Sum_probs=39.5 Q ss_pred CCHHHHHHHHHHHHHHHHHHCCCCEE-EEECCCCCCCEEE-EEEEEECCC------------C----HHHHHHHHHHHHH Q ss_conf 91578899999999999983685143-3415776720223-257840777------------4----1448999999973 Q gi|254780803|r 1 MLYKRILRILVVFNFFLLINSCSVYP-FYYLKNQDRTEYI-HPIKVLVVS------------K----NNKNEIYRSINFL 62 (165) Q Consensus 1 ml~~ri~k~l~~~~~~~~ls~CGf~P-lY~~~~~~~~~~l-~~I~i~~~~------------~----r~~~~i~n~L~~~ 62 (165) |.+|+++...++ +..++|+||+-.- .-+.++..+.-++ ..|-+++++ + .+...|.+.|... T Consensus 1 m~~kk~~~~~~~-~~~l~lsGC~a~~tai~krnL~VqTkMS~TIfLdP~s~ktVyv~vrNTS~~~~~~l~~~i~~~L~ak 79 (243) T PRK13731 1 MKTKKLMMVALV-SSTLALSGCGAMSTAIKKRNLEVKTQMSETIWLEPASERTVFLQIKNTSDKDMSGLQGKIADAVKAK 79 (243) T ss_pred CCHHHHHHHHHH-HHHHHHCCHHHHHHHHHHCCCCEECEECCCEEECCCCCCEEEEEEECCCCCCHHHHHHHHHHHHHHC T ss_conf 933775799999-9999971568767787735752222000438866854557999985178772456789999999858 Q ss_pred C---CCCCCCCCEEEEEEEC Q ss_conf 1---4777767168999710 Q gi|254780803|r 63 T---STVRTKNLYQLEVNID 79 (165) Q Consensus 63 ~---~~~~~~~~y~l~i~~~ 79 (165) - ..++.+..|.|..++- T Consensus 80 GY~iv~~P~~A~Y~LQaNVL 99 (243) T PRK13731 80 GYQVVTSPDKAYYWIQANVL 99 (243) T ss_pred CEEEECCHHHCEEEEEEEEH T ss_conf 90986786576077274240 No 11 >PRK10866 outer membrane protein assembly complex subunit YfiO; Provisional Probab=89.67 E-value=0.24 Score=27.83 Aligned_cols=22 Identities=18% Similarity=0.375 Sum_probs=18.4 Q ss_pred HHHHHHHHHHHHHHHHHHCCCC Q ss_conf 5788999999999999836851 Q gi|254780803|r 3 YKRILRILVVFNFFLLINSCSV 24 (165) Q Consensus 3 ~~ri~k~l~~~~~~~~ls~CGf 24 (165) ++|++++++++++.++|+||+= T Consensus 1 m~~~k~l~~~~~l~l~l~gCs~ 22 (243) T PRK10866 1 MTRMKYLVAAATLSLFLAGCSG 22 (243) T ss_pred CCHHHHHHHHHHHHHHHHHCCC T ss_conf 9458999999999999987689 No 12 >PRK00178 tolB translocation protein TolB; Provisional Probab=85.59 E-value=2.7 Score=21.40 Aligned_cols=134 Identities=7% Similarity=0.056 Sum_probs=66.4 Q ss_pred HHHHHHHHHHHHHHCCCCEEEEECCCCCCCEEEEEEEEECCC--------CHHHHHHHHHHHHHCCCCCC-CCCEEEEE- Q ss_conf 999999999999836851433415776720223257840777--------41448999999973147777-67168999- Q gi|254780803|r 7 LRILVVFNFFLLINSCSVYPFYYLKNQDRTEYIHPIKVLVVS--------KNNKNEIYRSINFLTSTVRT-KNLYQLEV- 76 (165) Q Consensus 7 ~k~l~~~~~~~~ls~CGf~PlY~~~~~~~~~~l~~I~i~~~~--------~r~~~~i~n~L~~~~~~~~~-~~~y~l~i- 76 (165) |+.++++.++++++++-+.+|+-+=..+.. .--.|.|.+.. ..+.+++++.|.. .|-... .+...+.. T Consensus 4 l~~~~~l~~~l~~~~~a~A~l~I~It~g~~-~~ipIAI~pF~~~~~~~~~~~i~~iI~~DL~~-sG~F~~l~~~~~~~~p 81 (433) T PRK00178 4 LKRIALLVLCLLAGGAAAAELNIEITSGSD-RAIPIAVVPFGWQGGSVLPEDMAQIIGNDLRN-SGYFEPIPRQNMISLP 81 (433) T ss_pred HHHHHHHHHHHHHHHHHCEEEEEEEECCCC-CEEEEEECCCCCCCCCCHHHHHHHHHHHHHHC-CCCCEECCHHHCCCCC T ss_conf 799999999998767622039999934777-60406985766668762248999999988862-8885524835524688 Q ss_pred ----EECCCEEECCCCC----CCCCEEEEEEEEEEEEEECCCCCEEEEEEEEEEEEECCCCCHHHHHHHHHHHHHHHHHH Q ss_conf ----7100000000246----76504899999999999805895877425799986067761577999999999999999 Q gi|254780803|r 77 ----NIDSFTDHAISNA----FFKNIGRITLKAKYYFKEISGKNILYENNTDVTSLFDFSDQQFSQLRSHKSSEEKAIQE 148 (165) Q Consensus 77 ----~~~~~~~~~i~~~----~~~~~~~~~~~~~y~L~~~~~~~~i~~~~~~~~~~Y~~~~~~f~~~~~e~~a~erl~~~ 148 (165) .+....=..+..+ |......-.+.++|+|.|...++.+..+.+.+. .+--.+++-. T Consensus 82 ~~~~~i~f~~W~~lgad~Lv~G~i~~~gg~~~v~~~L~DV~~~~~l~g~~~~~~----------------~~~~R~~aH~ 145 (433) T PRK00178 82 SQASEVIFRDWKALGAQYVMVGNIVPAGGRLQVQYALFNVATEQQVLTGSVSGS----------------TDQLRDMAHY 145 (433) T ss_pred CCCCCCCHHHHHHCCCCEEEEEEEEECCCEEEEEEEEEECCCCEEEECCEEEEC----------------HHHHHHHHHH T ss_conf 854575756877509878999879945997999999997145207603269957----------------8998999999 Q ss_pred HHHHHHHHHH Q ss_conf 9999999999 Q gi|254780803|r 149 LSENIYIDII 158 (165) Q Consensus 149 lae~I~~~i~ 158 (165) +|+.|...|. T Consensus 146 iaD~Iy~~lT 155 (433) T PRK00178 146 IADQSFEKLT 155 (433) T ss_pred HHHHHHHHHH T ss_conf 9999999830 No 13 >PRK01622 OxaA-like protein precursor; Validated Probab=84.66 E-value=1.1 Score=23.78 Aligned_cols=27 Identities=19% Similarity=0.369 Sum_probs=20.8 Q ss_pred CCHHHHHHHHHHHHHHHHHHCCCCE-EE Q ss_conf 9157889999999999998368514-33 Q gi|254780803|r 1 MLYKRILRILVVFNFFLLINSCSVY-PF 27 (165) Q Consensus 1 ml~~ri~k~l~~~~~~~~ls~CGf~-Pl 27 (165) |+.+|.+-+...+...++|+||+.. |. T Consensus 2 ~k~~~~~l~~~~~~~~~~lsgc~~~~~~ 29 (266) T PRK01622 2 LKSYRAVLISLSLLLVFVLSGCSNAAPI 29 (266) T ss_pred CHHHHHHHHHHHHHHHHHHHCCCCCCCC T ss_conf 5388999999999999999546899998 No 14 >TIGR03302 OM_YfiO outer membrane assembly lipoprotein YfiO. Members of this protein family include YfiO, a near-essential protein of the outer membrane, part of a complex involved in protein insertion into the bacterial outer membrane. Many proteins in this family are annotated as ComL, based on the involvement of this protein in natural transformation with exogenous DNA in Neisseria gonorrhoeae. This protein family shows sequence similarity to, but is distinct from, the tol-pal system protein YbgF (TIGR02795). Probab=82.71 E-value=0.79 Score=24.71 Aligned_cols=20 Identities=25% Similarity=0.683 Sum_probs=15.9 Q ss_pred HHHHHHHHHHHHHHHCCCCE Q ss_conf 89999999999998368514 Q gi|254780803|r 6 ILRILVVFNFFLLINSCSVY 25 (165) Q Consensus 6 i~k~l~~~~~~~~ls~CGf~ 25 (165) +++++++++++++|+|||.. T Consensus 1 ~~~~~~~~~~~~~l~sCs~~ 20 (235) T TIGR03302 1 LLLLILLLALLLLLAGCSSK 20 (235) T ss_pred CHHHHHHHHHHHHHHHCCCC T ss_conf 97999999999999873899 No 15 >COG5567 Predicted small periplasmic lipoprotein [Cell motility and secretion] Probab=81.88 E-value=1.1 Score=23.83 Aligned_cols=29 Identities=17% Similarity=0.583 Sum_probs=21.3 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCE-EEEECC Q ss_conf 57889999999999998368514-334157 Q gi|254780803|r 3 YKRILRILVVFNFFLLINSCSVY-PFYYLK 31 (165) Q Consensus 3 ~~ri~k~l~~~~~~~~ls~CGf~-PlY~~~ 31 (165) +|+.++-+..+..++.|+|||-+ |+|-.- T Consensus 1 mk~~~~s~~ala~l~sLA~CG~KGPLy~Pp 30 (58) T COG5567 1 MKNVFKSLLALATLFSLAGCGLKGPLYFPP 30 (58) T ss_pred CHHHHHHHHHHHHHHHHHHCCCCCCCCCCH T ss_conf 904899999999999998526678762782 No 16 >TIGR02887 spore_ger_x_C germination protein, Ger(x)C family; InterPro: IPR008844 The GerAA, -AB, and -AC proteins of the Bacillus subtilis spore are required for the germination response to L-alanine as the sole germinant. Members of GerAC family are thought to be located in the inner spore membrane. Although the function of this family is unclear, they are likely to encode the components of the germination apparatus that respond directly to this germinant, mediating the spore's response .; GO: 0009847 spore germination, 0016020 membrane. Probab=79.82 E-value=1.1 Score=23.74 Aligned_cols=17 Identities=35% Similarity=0.727 Sum_probs=8.9 Q ss_pred HHHHHHHHHHHHHHHCC Q ss_conf 89999999999998368 Q gi|254780803|r 6 ILRILVVFNFFLLINSC 22 (165) Q Consensus 6 i~k~l~~~~~~~~ls~C 22 (165) +.+++++++.+++|+|| T Consensus 2 ~~~~~~~~~~~~lL~GC 18 (400) T TIGR02887 2 KLKILLLILALLLLTGC 18 (400) T ss_pred HHHHHHHHHHHHHHCCC T ss_conf 46678999999983266 No 17 >PRK01742 tolB translocation protein TolB; Provisional Probab=79.48 E-value=4.7 Score=19.96 Aligned_cols=140 Identities=9% Similarity=0.070 Sum_probs=60.2 Q ss_pred CCHHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCEEEEEEEEECC------CCHHHHHHHHHHHHHCCCCCC-CCCEE Q ss_conf 91578899999999999983685143341577672022325784077------741448999999973147777-67168 Q gi|254780803|r 1 MLYKRILRILVVFNFFLLINSCSVYPFYYLKNQDRTEYIHPIKVLVV------SKNNKNEIYRSINFLTSTVRT-KNLYQ 73 (165) Q Consensus 1 ml~~ri~k~l~~~~~~~~ls~CGf~PlY~~~~~~~~~~l~~I~i~~~------~~r~~~~i~n~L~~~~~~~~~-~~~y~ 73 (165) |++.|-+..+++++|.++++++...+|.-.=..+.. .--.|.|.+- ...+.++|++.|... |-... .+... T Consensus 1 m~m~~~~~~~~~~l~~~~~~~~a~a~L~I~It~G~~-~~~pIAi~pF~~~~~~~~~i~~vI~~DL~~S-G~F~~i~~~~~ 78 (430) T PRK01742 1 MKLLKRLVSVFAVLFAVISNAFAGDEVRIVIDEGVD-GARPIAVVPFKWNGSAPEDIAKIIAADLRNS-GKFNPIAVSQM 78 (430) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHCCCEEEEEECCCC-CCCCEEEECCCCCCCCHHHHHHHHHHHHHCC-CCCCCCCHHHC T ss_conf 916999999999999998778505755999945767-7404599253678976678999998777608-47262596674 Q ss_pred EEE-----EECCCEEECCCCC----CCCCEEEEEEEEEEEEEECCC-----CCEEEEEEEEEEEEECCCCCHHHHHHHHH Q ss_conf 999-----7100000000246----765048999999999998058-----95877425799986067761577999999 Q gi|254780803|r 74 LEV-----NIDSFTDHAISNA----FFKNIGRITLKAKYYFKEISG-----KNILYENNTDVTSLFDFSDQQFSQLRSHK 139 (165) Q Consensus 74 l~i-----~~~~~~~~~i~~~----~~~~~~~~~~~~~y~L~~~~~-----~~~i~~~~~~~~~~Y~~~~~~f~~~~~e~ 139 (165) +.. .+....=..+..+ |..+...=.+.++|+|.|.-. ++++..+. |... . T Consensus 79 ~~~p~~~~~v~~~~W~~lga~~LV~G~v~~~~~~~~v~f~L~Dv~~~~~~~g~~l~~~~------~~~~----------~ 142 (430) T PRK01742 79 PQRPTSASEVNPEAWSNLGIDAVVVGQVTPSGNGYSIAYQLVDTVGASGTAGTVLMQNS------YTVT----------N 142 (430) T ss_pred CCCCCCCCCCCHHHHHHCCCCEEEEEEEEECCCEEEEEEEEEEEECCCCCCCEEEEEEE------EECC----------H T ss_conf 45898503349778874399999999899759917999999995101246652886416------6158----------8 Q ss_pred HHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999999999999 Q gi|254780803|r 140 SSEEKAIQELSENIYIDII 158 (165) Q Consensus 140 ~a~erl~~~lae~I~~~i~ 158 (165) +-..+++-.+|+.|+.+|. T Consensus 143 ~~~R~~aH~iaD~Iye~lT 161 (430) T PRK01742 143 KWLRYGAHTVSDEVFEKLT 161 (430) T ss_pred HHHHHHHHHHHHHHHHHHC T ss_conf 9989888999999999862 No 18 >PRK02889 tolB translocation protein TolB; Provisional Probab=79.39 E-value=4.8 Score=19.94 Aligned_cols=45 Identities=11% Similarity=0.206 Sum_probs=31.2 Q ss_pred EEEEEEEEEECCCCCEEEEEEEEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999998058958774257999860677615779999999999999999999999999 Q gi|254780803|r 98 TLKAKYYFKEISGKNILYENNTDVTSLFDFSDQQFSQLRSHKSSEEKAIQELSENIYIDII 158 (165) Q Consensus 98 ~~~~~y~L~~~~~~~~i~~~~~~~~~~Y~~~~~~f~~~~~e~~a~erl~~~lae~I~~~i~ 158 (165) .+.++|+|.|...++.+....+... .+--.+++-.+|++|+.+|. T Consensus 112 ~~~v~~~L~Dv~~~~~l~g~~~~~~----------------~~~~R~~aH~iaD~Iy~~lT 156 (430) T PRK02889 112 QYEVRFRLYDTVKQQSLGGLSLTVT----------------EDQLRATAHKIADYIYEKLT 156 (430) T ss_pred CEEEEEEEEECCCCCEEEEEEEEEC----------------HHHHHHHHHHHHHHHHHHHH T ss_conf 4899999997466618743389958----------------89989999999999999861 No 19 >PRK10525 cytochrome o ubiquinol oxidase subunit II; Provisional Probab=78.43 E-value=1.9 Score=22.44 Aligned_cols=31 Identities=23% Similarity=0.330 Sum_probs=23.0 Q ss_pred CCHHHHHHHHHHHHHHHHHHCCCCEEEEECCC Q ss_conf 91578899999999999983685143341577 Q gi|254780803|r 1 MLYKRILRILVVFNFFLLINSCSVYPFYYLKN 32 (165) Q Consensus 1 ml~~ri~k~l~~~~~~~~ls~CGf~PlY~~~~ 32 (165) ++|.|.++.+.++...++|+||+.. +...++ T Consensus 4 ~~~~~~~~~l~l~~~~~lLsGC~~~-~LdPkG 34 (305) T PRK10525 4 RKYNKSLGWLSLFAGTVLLSGCNSA-LLDPKG 34 (305) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCCC-CCCCCC T ss_conf 6777789999999999986678763-289998 No 20 >TIGR03525 GldK gliding motility-associated lipoprotein GldK. Members of this protein family are exclusive to the Bacteroidetes phylum (previously Cytophaga-Flavobacteria-Bacteroides). GldK is a lipoprotein linked to a type of rapid surface gliding motility found in certain Bacteroidetes, such as Flavobacterium johnsoniae. Knockouts of GldK abolish the gliding phenotype. GldK is homologous to GldJ. There is a GldK homolog in Cytophaga hutchinsonii and several other species that has a different, shorter architecture and is represented by a separate model. Gliding motility appears closely linked to chitin utilization in the model species Flavobacterium johnsoniae. Bacteroidetes with members of this protein family appear to have all of the genes associated with gliding motility. Probab=75.97 E-value=1.4 Score=23.12 Aligned_cols=19 Identities=26% Similarity=0.527 Sum_probs=13.9 Q ss_pred HHHHHHHHHHHHHHHCCCC Q ss_conf 8999999999999836851 Q gi|254780803|r 6 ILRILVVFNFFLLINSCSV 24 (165) Q Consensus 6 i~k~l~~~~~~~~ls~CGf 24 (165) |+|+++++.+++++.+||= T Consensus 1 Mkk~~~~~~~~~~~~SCg~ 19 (449) T TIGR03525 1 MKKYLVFAALVVLVYSCGS 19 (449) T ss_pred CCHHHHHHHHHHHHHHCCC T ss_conf 9124899999998721058 No 21 >PRK11372 lysozyme inhibitor; Provisional Probab=75.78 E-value=2.1 Score=22.14 Aligned_cols=22 Identities=23% Similarity=0.555 Sum_probs=17.2 Q ss_pred HHHHHHHHHHHHHHHCCCC-EEE Q ss_conf 8999999999999836851-433 Q gi|254780803|r 6 ILRILVVFNFFLLINSCSV-YPF 27 (165) Q Consensus 6 i~k~l~~~~~~~~ls~CGf-~Pl 27 (165) .||.+++.+..++|+||++ +|+ T Consensus 2 ~MKkll~~~~~llLsGCs~~~~~ 24 (109) T PRK11372 2 SMKKLLIICLPVLLTGCSAYNQF 24 (109) T ss_pred CHHHHHHHHHHHHHHCCCCCCCC T ss_conf 54889999999997045432212 No 22 >PRK02463 OxaA-like protein precursor; Provisional Probab=74.13 E-value=3.2 Score=21.00 Aligned_cols=25 Identities=24% Similarity=0.304 Sum_probs=18.5 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCEEE Q ss_conf 5788999999999999836851433 Q gi|254780803|r 3 YKRILRILVVFNFFLLINSCSVYPF 27 (165) Q Consensus 3 ~~ri~k~l~~~~~~~~ls~CGf~Pl 27 (165) +||+.-..+++.++++|+||+-.|. T Consensus 5 ~k~~~~~~~~~~~~l~LsgC~~~~~ 29 (307) T PRK02463 5 LKRILFSGLALSMLLTLTGCVGRDK 29 (307) T ss_pred HHHHHHHHHHHHHHHHHHCCCCCCC T ss_conf 9999999999999999963468999 No 23 >TIGR03516 ppisom_GldI peptidyl-prolyl isomerase, gliding motility-associated. Members of this protein family are exclusive to the Bacteroidetes phylum (previously Cytophaga-Flavobacteria-Bacteroides). GldI is a FKBP-type peptidyl-prolyl cis-trans isomerase (pfam00254) linked to a type of rapid surface gliding motility found in certain Bacteroidetes, such as Flavobacterium johnsoniae and Cytophaga hutchinsonii. Knockout of this gene abolishes the gliding phenotype. Gliding motility appears closely linked to chitin utilization in the model species Flavobacterium johnsoniae. This family is only found in Bacteroidetes containing the suite of genes proposed to confer the gliding motility phenotype. Probab=73.42 E-value=4.9 Score=19.86 Aligned_cols=31 Identities=6% Similarity=0.242 Sum_probs=19.7 Q ss_pred EEEEEEEEEEECCCCCEEEEEEEEEEEEECCC Q ss_conf 99999999998058958774257999860677 Q gi|254780803|r 97 ITLKAKYYFKEISGKNILYENNTDVTSLFDFS 128 (165) Q Consensus 97 ~~~~~~y~L~~~~~~~~i~~~~~~~~~~Y~~~ 128 (165) =.++++|+..+.+ |+++++..-.+-.+|.+. T Consensus 90 d~V~~~Y~~~~ld-G~~~~ss~~~gp~~f~v~ 120 (177) T TIGR03516 90 DLVTFEYDIRALD-GDVIYSEEELGPQTYKVD 120 (177) T ss_pred CEEEEEEEEEECC-CCEEEEECCCCCEEEEEC T ss_conf 9899999999768-988874034686569855 No 24 >PRK02998 prsA peptidylprolyl isomerase; Reviewed Probab=71.55 E-value=4.1 Score=20.31 Aligned_cols=23 Identities=17% Similarity=0.348 Sum_probs=17.4 Q ss_pred CCHHHHHHHHHHHHHHHHHHCCC Q ss_conf 91578899999999999983685 Q gi|254780803|r 1 MLYKRILRILVVFNFFLLINSCS 23 (165) Q Consensus 1 ml~~ri~k~l~~~~~~~~ls~CG 23 (165) |.-++++-..++++..+.|++|| T Consensus 1 Mkkkk~~~~~~~~~~~l~LaaC~ 23 (283) T PRK02998 1 MKKKKLFIGTIISCVVLALSACG 23 (283) T ss_pred CCHHHHHHHHHHHHHHHHHHHCC T ss_conf 93589999999999999998668 No 25 >PRK13684 Ycf48-like protein; Provisional Probab=71.20 E-value=6.9 Score=18.94 Aligned_cols=47 Identities=19% Similarity=0.268 Sum_probs=31.5 Q ss_pred CCHHHHHHHHHHHHHHHHHHCCCC--EEEEECC-----CCCCCEEEEEEEEECC Q ss_conf 915788999999999999836851--4334157-----7672022325784077 Q gi|254780803|r 1 MLYKRILRILVVFNFFLLINSCSV--YPFYYLK-----NQDRTEYIHPIKVLVV 47 (165) Q Consensus 1 ml~~ri~k~l~~~~~~~~ls~CGf--~PlY~~~-----~~~~~~~l~~I~i~~~ 47 (165) ++++...++++++++.++|++|.- -|+-... .......+..|.+..+ T Consensus 3 ~~~~~~~~l~l~~~~~~~~~~c~~~~~p~~~~~pw~~i~l~t~~~~~dI~F~d~ 56 (333) T PRK13684 3 RLLKSLLNLLLLLALGLVLSGCSTTRVPMASSSPWQVIDLPTEANPLDIAFTDP 56 (333) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCEEEECCCCCCEEEEEEECC T ss_conf 688889999999999986322356677655789876770577782543798469 No 26 >PRK00249 flgH flagellar basal body L-ring protein; Reviewed Probab=71.09 E-value=2.5 Score=21.63 Aligned_cols=21 Identities=19% Similarity=0.528 Sum_probs=15.6 Q ss_pred HHHHHHHHHHHHHHHHCCCCE Q ss_conf 889999999999998368514 Q gi|254780803|r 5 RILRILVVFNFFLLINSCSVY 25 (165) Q Consensus 5 ri~k~l~~~~~~~~ls~CGf~ 25 (165) +++++.+++.++++|+||.-. T Consensus 2 ~~~~~~~~~~~~~~LsgCa~~ 22 (231) T PRK00249 2 AMMLIALALLLLLLLSGCASI 22 (231) T ss_pred CCHHHHHHHHHHHHHHHCCCC T ss_conf 338999999999998753489 No 27 >PRK12696 flgH flagellar basal body L-ring protein; Reviewed Probab=70.75 E-value=3.1 Score=21.06 Aligned_cols=20 Identities=25% Similarity=0.461 Sum_probs=15.2 Q ss_pred HHHHHHHHHHHHHHHHCCCC Q ss_conf 88999999999999836851 Q gi|254780803|r 5 RILRILVVFNFFLLINSCSV 24 (165) Q Consensus 5 ri~k~l~~~~~~~~ls~CGf 24 (165) |+.|++++++.+++|+||.- T Consensus 1 mm~~~l~~~~~~~~L~GC~~ 20 (238) T PRK12696 1 MIRKLLAASCAVLLLSGCNA 20 (238) T ss_pred CHHHHHHHHHHHHHHHCCCC T ss_conf 95899999999999516568 No 28 >PRK11023 hypothetical protein; Provisional Probab=68.55 E-value=3.8 Score=20.56 Aligned_cols=19 Identities=32% Similarity=0.476 Sum_probs=14.7 Q ss_pred HHHHHHHHHHHHHHHHCCC Q ss_conf 8899999999999983685 Q gi|254780803|r 5 RILRILVVFNFFLLINSCS 23 (165) Q Consensus 5 ri~k~l~~~~~~~~ls~CG 23 (165) |.++.+++++..++|+||- T Consensus 2 k~~~~l~~l~~~~~L~GC~ 20 (191) T PRK11023 2 KALSPLAVLISALLLQGCV 20 (191) T ss_pred CHHHHHHHHHHHHHHCCCH T ss_conf 4488999999999972682 No 29 >TIGR03524 GldJ gliding motility-associated lipoprotein GldJ. Members of this protein family are exclusive to the Bacteroidetes phylum (previously Cytophaga-Flavobacteria-Bacteroides). GldJ is a lipoprotein linked to a type of rapid surface gliding motility found in certain Bacteroidetes, such as Flavobacterium johnsoniae. Knockouts of GldJ abolish the gliding phenotype. GldJ is homologous to GldK. There is a GldJ homolog in Cytophaga hutchinsonii and several other species that has a different, shorter architecture and is represented by a separate model. Gliding motility appears closely linked to chitin utilization in the model species Flavobacterium johnsoniae. Bacteroidetes with members of this protein family appear to have all of the genes associated with gliding motility. Probab=68.34 E-value=5.2 Score=19.73 Aligned_cols=25 Identities=24% Similarity=0.357 Sum_probs=21.3 Q ss_pred CCHHHHHHHHHHHHHHHHHHCCCCE Q ss_conf 9157889999999999998368514 Q gi|254780803|r 1 MLYKRILRILVVFNFFLLINSCSVY 25 (165) Q Consensus 1 ml~~ri~k~l~~~~~~~~ls~CGf~ 25 (165) |-++-++|+++++++.++|.+|+=+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~sC~~~ 25 (559) T TIGR03524 1 VNMKNVFKITFIVFVSLLFVSCKKK 25 (559) T ss_pred CCHHHHHHHHHHHHHHHHHHHCCCC T ss_conf 9467889999999999997642588 No 30 >PRK12788 flgH flagellar basal body L-ring protein; Reviewed Probab=68.03 E-value=2.7 Score=21.48 Aligned_cols=18 Identities=28% Similarity=0.534 Sum_probs=14.9 Q ss_pred HHHHHHHHHHHHHHCCCC Q ss_conf 999999999999836851 Q gi|254780803|r 7 LRILVVFNFFLLINSCSV 24 (165) Q Consensus 7 ~k~l~~~~~~~~ls~CGf 24 (165) ||+++.+..+++|+||.- T Consensus 1 mr~l~~~~a~l~LsGCa~ 18 (231) T PRK12788 1 MRLLVAILACLALAGCAN 18 (231) T ss_pred CCHHHHHHHHHHHHHHCC T ss_conf 908999999999764038 No 31 >TIGR03659 IsdE heme ABC transporter, heme-binding protein isdE. This family of ABC substrate-binding proteins is observed primarily in close proximity with proteins localized to the cell wall and bearing the NEAT (NEAr Transporter, pfam05031) heme-binding domain. IsdE has been shown to bind heme and is involved in the process of scavenging heme for the purpose of obtaining iron. Probab=67.34 E-value=3.9 Score=20.47 Aligned_cols=22 Identities=27% Similarity=0.469 Sum_probs=17.2 Q ss_pred HHHHHHHHHHHHHHHHHCCCCE Q ss_conf 7889999999999998368514 Q gi|254780803|r 4 KRILRILVVFNFFLLINSCSVY 25 (165) Q Consensus 4 ~ri~k~l~~~~~~~~ls~CGf~ 25 (165) ||++..++++.+.++++|||=+ T Consensus 1 Kk~~~~~~~~~~~~~l~~C~~~ 22 (289) T TIGR03659 1 KKILSLILLALLSLGLTGCSSS 22 (289) T ss_pred CCHHHHHHHHHHHHHHHHCCCC T ss_conf 9129999999999998432899 No 32 >COG3017 LolB Outer membrane lipoprotein involved in outer membrane biogenesis [Cell envelope biogenesis, outer membrane] Probab=66.89 E-value=5.4 Score=19.61 Aligned_cols=23 Identities=26% Similarity=0.379 Sum_probs=18.8 Q ss_pred CCHHHHHHHHHHHHHHHHHHCCC Q ss_conf 91578899999999999983685 Q gi|254780803|r 1 MLYKRILRILVVFNFFLLINSCS 23 (165) Q Consensus 1 ml~~ri~k~l~~~~~~~~ls~CG 23 (165) |.+.|.++.++..+..++|+||. T Consensus 1 ~~~~~~~~~~l~~~As~LL~aC~ 23 (206) T COG3017 1 MPMMKRLLFLLLALASLLLTACT 23 (206) T ss_pred CCHHHHHHHHHHHHHHHHHHHCC T ss_conf 95277787899999999997565 No 33 >PRK10533 putative lipoprotein; Provisional Probab=66.71 E-value=4.5 Score=20.10 Aligned_cols=25 Identities=24% Similarity=0.552 Sum_probs=19.1 Q ss_pred HHHHHHHHHHHHHCCC-CEEEEECCC Q ss_conf 9999999999983685-143341577 Q gi|254780803|r 8 RILVVFNFFLLINSCS-VYPFYYLKN 32 (165) Q Consensus 8 k~l~~~~~~~~ls~CG-f~PlY~~~~ 32 (165) ++.+++..-++|++|+ .+|.|...+ T Consensus 5 ~~~ll~PlallLSACttV~Pa~kd~g 30 (171) T PRK10533 5 KLALLLPCALLLSACTTVTPAYKDNG 30 (171) T ss_pred HHHHHHHHHHHHHHHCCCCHHHHHCC T ss_conf 03899999999987224761143037 No 34 >pfam01514 YscJ_FliF Secretory protein of YscJ/FliF family. This family includes proteins that are related to the YscJ lipoprotein, and the amino terminus of FliF, the flageller M-ring protein. The members of the YscJ family are thought to be involved in secretion of several proteins. The FliF protein ring is thought to be part of the export apparatus for flageller proteins, based on the similarity to YscJ proteins. Probab=66.51 E-value=6.8 Score=19.00 Aligned_cols=26 Identities=19% Similarity=0.466 Sum_probs=18.5 Q ss_pred HHHHHHHHHHHHHHCC-CCEEEEECCC Q ss_conf 9999999999998368-5143341577 Q gi|254780803|r 7 LRILVVFNFFLLINSC-SVYPFYYLKN 32 (165) Q Consensus 7 ~k~l~~~~~~~~ls~C-Gf~PlY~~~~ 32 (165) +-++.++++++++.+| .|.|+|+.-+ T Consensus 9 ~~~i~~l~~l~~~~~~p~y~~Ly~~L~ 35 (206) T pfam01514 9 LAVVALLLLLLLLAGCPDYVTLYTGLS 35 (206) T ss_pred HHHHHHHHHHHHHHCCCCCEECCCCCC T ss_conf 999999999999847998561115999 No 35 >PRK04405 prsA peptidylprolyl isomerase; Provisional Probab=66.36 E-value=5.5 Score=19.57 Aligned_cols=22 Identities=18% Similarity=0.262 Sum_probs=13.1 Q ss_pred HHHHHHHHHHHHHHHHHHCCCC Q ss_conf 5788999999999999836851 Q gi|254780803|r 3 YKRILRILVVFNFFLLINSCSV 24 (165) Q Consensus 3 ~~ri~k~l~~~~~~~~ls~CGf 24 (165) +++++..+..++..+.|+|||= T Consensus 5 MKK~~l~~~~~~~~l~LaaCss 26 (298) T PRK04405 5 MKKWALAAASAGLLLSLAGCSS 26 (298) T ss_pred HHHHHHHHHHHHHHHHHHHCCC T ss_conf 8999999999999999987179 No 36 >PRK12450 foldase protein PrsA; Reviewed Probab=65.89 E-value=5.6 Score=19.49 Aligned_cols=22 Identities=9% Similarity=0.380 Sum_probs=13.9 Q ss_pred HHHHHHHHHHHHHHHHHHCCCC Q ss_conf 5788999999999999836851 Q gi|254780803|r 3 YKRILRILVVFNFFLLINSCSV 24 (165) Q Consensus 3 ~~ri~k~l~~~~~~~~ls~CGf 24 (165) +++++--+..++-.++|+|||= T Consensus 4 mKK~~~~~~~~~svl~LaAC~s 25 (309) T PRK12450 4 MNKLITGVVTLATVVTLSACQS 25 (309) T ss_pred HHHHHHHHHHHHHHHHHHHHCC T ss_conf 9999999999999999986058 No 37 >PRK09837 copper/silver efflux system outer membrane protein CusC; Provisional Probab=64.51 E-value=4.9 Score=19.84 Aligned_cols=21 Identities=29% Similarity=0.464 Sum_probs=14.6 Q ss_pred HHHHHHHHHHCCCCEEEEECC Q ss_conf 999999998368514334157 Q gi|254780803|r 11 VVFNFFLLINSCSVYPFYYLK 31 (165) Q Consensus 11 ~~~~~~~~ls~CGf~PlY~~~ 31 (165) +.++..++|+||...|-|... T Consensus 7 l~l~~~l~L~gC~~~P~y~~P 27 (460) T PRK09837 7 LPFCVALALTGCSLAPDYQRP 27 (460) T ss_pred HHHHHHHHHCCCCCCCCCCCC T ss_conf 999999998247789999996 No 38 >PRK03002 prsA peptidylprolyl isomerase; Reviewed Probab=64.21 E-value=5.6 Score=19.50 Aligned_cols=23 Identities=26% Similarity=0.508 Sum_probs=15.1 Q ss_pred CCHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 915788999999999999836851 Q gi|254780803|r 1 MLYKRILRILVVFNFFLLINSCSV 24 (165) Q Consensus 1 ml~~ri~k~l~~~~~~~~ls~CGf 24 (165) |..+.++-...+++. ++|+|||- T Consensus 1 mkkK~i~~~~~~~sv-l~LaaC~~ 23 (285) T PRK03002 1 MRGKHIFIITALISI-LMLSACGQ 23 (285) T ss_pred CCHHHHHHHHHHHHH-HHHHHHCC T ss_conf 960189999999999-99998457 No 39 >PRK02944 OxaA-like protein precursor; Validated Probab=63.36 E-value=7.2 Score=18.86 Aligned_cols=22 Identities=18% Similarity=0.400 Sum_probs=17.2 Q ss_pred HHHHHHHHHHHHHHHHHHCCCC Q ss_conf 5788999999999999836851 Q gi|254780803|r 3 YKRILRILVVFNFFLLINSCSV 24 (165) Q Consensus 3 ~~ri~k~l~~~~~~~~ls~CGf 24 (165) .||..-...++...++|+||+. T Consensus 2 kk~~~~~~~~~~~~~~lsgC~~ 23 (255) T PRK02944 2 KKKLGLLAMVVALMAILAGCSE 23 (255) T ss_pred CHHHHHHHHHHHHHHHHHCCCC T ss_conf 2899999999999999962468 No 40 >PRK12698 flgH flagellar basal body L-ring protein; Reviewed Probab=62.56 E-value=4.6 Score=20.04 Aligned_cols=28 Identities=4% Similarity=0.098 Sum_probs=15.0 Q ss_pred EEEEEEEEEEEEEC-CCCCEEEEEEEEEE Q ss_conf 89999999999980-58958774257999 Q gi|254780803|r 95 GRITLKAKYYFKEI-SGKNILYENNTDVT 122 (165) Q Consensus 95 ~~~~~~~~y~L~~~-~~~~~i~~~~~~~~ 122 (165) ..++.++..++++. .+|+.+.+|.-... T Consensus 133 ~~ltgtIta~V~~VlpNGNL~I~G~k~i~ 161 (224) T PRK12698 133 NSLSGSISANVMQVLPNGNLVIRGEKWIS 161 (224) T ss_pred CEEEEEEEEEEEEECCCCCEEEEEEEEEE T ss_conf 24899999999998669809999989999 No 41 >PRK09578 periplasmic multidrug efflux lipoprotein precursor; Reviewed Probab=62.22 E-value=7.8 Score=18.63 Aligned_cols=20 Identities=15% Similarity=0.248 Sum_probs=13.0 Q ss_pred HHHHHHHHHHHHHHHHHCCC Q ss_conf 78899999999999983685 Q gi|254780803|r 4 KRILRILVVFNFFLLINSCS 23 (165) Q Consensus 4 ~ri~k~l~~~~~~~~ls~CG 23 (165) ||..-.++++...++|+||| T Consensus 5 ~r~~~~l~~l~~~~~L~gC~ 24 (385) T PRK09578 5 RRRRLALAALVAAFALAGCG 24 (385) T ss_pred HHHHHHHHHHHHHHHHHCCC T ss_conf 87999999999999981679 No 42 >TIGR03511 GldH_lipo gliding motility-associated lipoprotein GldH. Members of this protein family are predicted lipoproteins, exclusive to the Bacteroidetes phylum (previously Cytophaga-Flavobacteria-Bacteroides). Members include GldH, a protein linked to a type of rapid surface gliding motility found in certain Bacteroidetes, such as Flavobacterium johnsoniae and Cytophaga hutchinsonii. Gliding motility appears closely linked to chitin utilization in the model species Flavobacterium johnsoniae. Not all Bacteroidetes with members of this protein family may have gliding motility. Probab=62.08 E-value=12 Score=17.52 Aligned_cols=99 Identities=11% Similarity=0.029 Sum_probs=44.4 Q ss_pred HHHHHHH-HHHHHHHHHCCCCE-EEEECCCCCCCEEEEEEEEECCCCHHHHHHHHHHHHHCCCCCCCCCEEEEEEECCCE Q ss_conf 8899999-99999998368514-334157767202232578407774144899999997314777767168999710000 Q gi|254780803|r 5 RILRILV-VFNFFLLINSCSVY-PFYYLKNQDRTEYIHPIKVLVVSKNNKNEIYRSINFLTSTVRTKNLYQLEVNIDSFT 82 (165) Q Consensus 5 ri~k~l~-~~~~~~~ls~CGf~-PlY~~~~~~~~~~l~~I~i~~~~~r~~~~i~n~L~~~~~~~~~~~~y~l~i~~~~~~ 82 (165) +++|..+ ++.+.+++.||+-. ++|-. ...+.. .|..-.+.+.+.+...+...+|.+.+.+..+. T Consensus 2 ~~~~~~i~~~l~~~ll~SC~~~~~vy~~----------y~~~~~----~gW~~~d~l~F~~~~~d~~~~y~l~l~lR~~~ 67 (156) T TIGR03511 2 SLVRNSISFFLGACVLVSCTENTDVYHS----------YQSTPH----GGWQKSDTLDFEIPITDYTASYRLFLLIRNDN 67 (156) T ss_pred CHHHHHHHHHHHHHHEEEECCCCCEEEE----------EEECCC----CCCCCCCCEEEEEECCCCCCCEEEEEEEECCC T ss_conf 0685449999999970763899633567----------679775----78888997899976177875089999997589 Q ss_pred ----EEC--C-CCCCCCCEEEEEEEEEEEEEECCCCCEEEEEE Q ss_conf ----000--0-24676504899999999999805895877425 Q gi|254780803|r 83 ----DHA--I-SNAFFKNIGRITLKAKYYFKEISGKNILYENN 118 (165) Q Consensus 83 ----~~~--i-~~~~~~~~~~~~~~~~y~L~~~~~~~~i~~~~ 118 (165) .++ + ............=|++|.|-| .+|+-+-+|. T Consensus 68 ~YpysNi~l~~~~~~~~~~~i~~DTle~~Lad-~~G~w~G~G~ 109 (156) T TIGR03511 68 RYPYRNLWALIHENLEDSTVIFTDTVDLTLAD-STGQWTGKGW 109 (156) T ss_pred CCCEEEEEEEEEEECCCCCEEEEEEEEEEEEC-CCCCEEECCC T ss_conf 75300079999997189977888616899788-9998874450 No 43 >PRK10175 hypothetical protein; Provisional Probab=61.89 E-value=2.9 Score=21.22 Aligned_cols=18 Identities=28% Similarity=0.624 Sum_probs=14.2 Q ss_pred HHHHHHHHHHHHHHCCCC Q ss_conf 999999999999836851 Q gi|254780803|r 7 LRILVVFNFFLLINSCSV 24 (165) Q Consensus 7 ~k~l~~~~~~~~ls~CGf 24 (165) ++++++.++.++|+|||= T Consensus 1 ~r~i~~~~~~lllsGC~S 18 (75) T PRK10175 1 MRLIVVSIMVTLLSGCGS 18 (75) T ss_pred CEEHHHHHHHHHHCCCHH T ss_conf 934488889999816424 No 44 >PRK03629 tolB translocation protein TolB; Provisional Probab=60.99 E-value=12 Score=17.40 Aligned_cols=136 Identities=13% Similarity=0.065 Sum_probs=60.6 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCEEEEEEEEECC--------CCHHHHHHHHHHHHHCCCCCCC-CCEE Q ss_conf 578899999999999983685143341577672022325784077--------7414489999999731477776-7168 Q gi|254780803|r 3 YKRILRILVVFNFFLLINSCSVYPFYYLKNQDRTEYIHPIKVLVV--------SKNNKNEIYRSINFLTSTVRTK-NLYQ 73 (165) Q Consensus 3 ~~ri~k~l~~~~~~~~ls~CGf~PlY~~~~~~~~~~l~~I~i~~~--------~~r~~~~i~n~L~~~~~~~~~~-~~y~ 73 (165) ||+++++++++.+|+ +++.+-.++-+-..+.. .--.|.|.+- ...+.++|++.|.+- |....- +... T Consensus 1 mk~~~~~~f~~lll~--~~~a~A~L~I~It~G~~-~~~PIAV~pF~~~g~~~~~~~i~~II~~DL~rS-G~F~~l~~~~~ 76 (430) T PRK03629 1 MKQALRVAFGFLILW--ASVLHAEVRIEITQGVD-SARPIGVVPFKWAGPGAAPEDIGGIVAADLRNS-GKFNPLDRARL 76 (430) T ss_pred CCHHHHHHHHHHHHH--HHHHCEEEEEEEECCCC-CCCCEEEECCCCCCCCCCHHHHHHHHHHHHHHC-CCCEECCHHHC T ss_conf 901488899899987--54422379999934667-732569928644788864678999998768757-78150396785 Q ss_pred EEE-----EECCCEEECCCCC----CCCCE-EEEEEEEEEEEEECCC--CCEEEEEEEEEEEEECCCCCHHHHHHHHHHH Q ss_conf 999-----7100000000246----76504-8999999999998058--9587742579998606776157799999999 Q gi|254780803|r 74 LEV-----NIDSFTDHAISNA----FFKNI-GRITLKAKYYFKEISG--KNILYENNTDVTSLFDFSDQQFSQLRSHKSS 141 (165) Q Consensus 74 l~i-----~~~~~~~~~i~~~----~~~~~-~~~~~~~~y~L~~~~~--~~~i~~~~~~~~~~Y~~~~~~f~~~~~e~~a 141 (165) +.. .+....=..+..+ |..+. ..=.+.++|+|.|... ++++..+.+.. .. +. T Consensus 77 ~~~p~~~~~v~~~dWr~lgad~LV~G~v~~~~~g~~~v~f~L~DV~~~~g~~l~~~~~~~------~~----------~~ 140 (430) T PRK03629 77 PQQPGTAQEVQPAAWSALGIDAVVVGQVTPNPDGSYNVAYQLVDTGGAPGTVLAQNSYKV------NK----------QW 140 (430) T ss_pred CCCCCCCCCCCHHHHHHCCCCEEEEEEEEECCCCEEEEEEEEEEEECCCCEEEEEEEEEC------CH----------HH T ss_conf 458984134585577760898899999997689619999999994166233864458845------87----------99 Q ss_pred HHHHHHHHHHHHHHHHH Q ss_conf 99999999999999999 Q gi|254780803|r 142 EEKAIQELSENIYIDII 158 (165) Q Consensus 142 ~erl~~~lae~I~~~i~ 158 (165) -.+++-.+|+.|+.+|. T Consensus 141 lR~~aH~isD~Iy~~lT 157 (430) T PRK03629 141 LRYAGHTASDEVFEKLT 157 (430) T ss_pred HHHHHHHHHHHHHHHHC T ss_conf 99999999999999851 No 45 >pfam03304 Mlp Mlp lipoprotein family. The Mlp (for Multicopy Lipoprotein) family of lipoproteins is found in Borrelia species. This family were previously known as 2.9 lipoprotein genes. These surface expressed genes may represent new candidate vaccinogens for Lyme disease. Members of this family generally are downstream of four ORFs called A,B,C and D that are involved in hemolytic activity. Probab=60.85 E-value=4 Score=20.37 Aligned_cols=19 Identities=37% Similarity=0.948 Sum_probs=12.5 Q ss_pred HHHHHHHHHHHHHHHHCCCC Q ss_conf 88999999999999836851 Q gi|254780803|r 5 RILRILVVFNFFLLINSCSV 24 (165) Q Consensus 5 ri~k~l~~~~~~~~ls~CGf 24 (165) ||+-|+|++ ||++|+||.- T Consensus 2 KIINilfcl-fll~Ln~Cns 20 (150) T pfam03304 2 KIINILFCL-FLLMLNGCNS 20 (150) T ss_pred CEEHHHHHH-HHHHHCCCCC T ss_conf 431499999-9999826567 No 46 >PRK09859 multidrug efflux system protein MdtE; Provisional Probab=59.06 E-value=8.6 Score=18.38 Aligned_cols=25 Identities=16% Similarity=0.211 Sum_probs=12.1 Q ss_pred CCHHHHHHHHHHHHHHHHHHCCCCEE Q ss_conf 91578899999999999983685143 Q gi|254780803|r 1 MLYKRILRILVVFNFFLLINSCSVYP 26 (165) Q Consensus 1 ml~~ri~k~l~~~~~~~~ls~CGf~P 26 (165) |--||.+- +.+++..++|+||+=+| T Consensus 1 m~~k~~~l-i~ll~~~~lL~gC~~~~ 25 (385) T PRK09859 1 MNRRRKLL-IPLLFCGAMLTACDDKS 25 (385) T ss_pred CCCHHHHH-HHHHHHHHHHHCCCCCC T ss_conf 98206789-99999999995379997 No 47 >PRK13616 lipoprotein LpqB; Provisional Probab=59.04 E-value=9 Score=18.25 Aligned_cols=29 Identities=21% Similarity=0.393 Sum_probs=24.5 Q ss_pred CCHHHHHHHHHHHHHHHHHHCCCCEEEEE Q ss_conf 91578899999999999983685143341 Q gi|254780803|r 1 MLYKRILRILVVFNFFLLINSCSVYPFYY 29 (165) Q Consensus 1 ml~~ri~k~l~~~~~~~~ls~CGf~PlY~ 29 (165) |++.|+.....+++..++++||+==|-=+ T Consensus 1 ~~~~~~~~~~~~~~~~~llaGCaslP~ss 29 (590) T PRK13616 1 KLMTRLKALAALLAVALLVAGCASLPSSS 29 (590) T ss_pred CHHHHHHHHHHHHHHHHHHHHCCCCCCCC T ss_conf 98899999999999999861203689999 No 48 >TIGR02747 TraV type IV conjugative transfer system protein TraV; InterPro: IPR014118 This entry includes TraV, which is a component of conjugative type IV secretion system. TraV is an outer membrane lipoprotein that is believed to interact with the secretin TraK , , . The alignment contains three conserved cysteines in the N-terminal half.. Probab=58.85 E-value=6.3 Score=19.19 Aligned_cols=21 Identities=24% Similarity=0.616 Sum_probs=12.1 Q ss_pred HHHHHHHHH-HHHHHCCC-CEEE Q ss_conf 999999999-99983685-1433 Q gi|254780803|r 7 LRILVVFNF-FLLINSCS-VYPF 27 (165) Q Consensus 7 ~k~l~~~~~-~~~ls~CG-f~Pl 27 (165) ++.|+++++ .|+|+||+ -.++ T Consensus 4 ~~~L~~~~~~~f~LtGCsa~~g~ 26 (174) T TIGR02747 4 LKVLLLLACVAFLLTGCSASLGL 26 (174) T ss_pred HHHHHHHHHHHHHHHHHHHHCCC T ss_conf 78999999998720025774565 No 49 >PRK11459 multidrug resistance outer membrane protein MdtQ; Provisional Probab=58.26 E-value=7.8 Score=18.62 Aligned_cols=30 Identities=13% Similarity=0.194 Sum_probs=24.1 Q ss_pred CCHHHHHHHHHHHHHHHHHHCCCCEEEEEC Q ss_conf 915788999999999999836851433415 Q gi|254780803|r 1 MLYKRILRILVVFNFFLLINSCSVYPFYYL 30 (165) Q Consensus 1 ml~~ri~k~l~~~~~~~~ls~CGf~PlY~~ 30 (165) |--+++......+.+.++|+||.--|-|.. T Consensus 1 ~~~~~f~~~~a~l~~~llLaGCa~~~~~~~ 30 (478) T PRK11459 1 MNRDSFYAATASLPLFILLAGCAPMHETRQ 30 (478) T ss_pred CCCCCHHHHHHHHHHHHHHHCCCCCCCCCC T ss_conf 997405888999999999837889999999 No 50 >PRK09915 putative outer membrane efflux protein MdtP; Provisional Probab=58.24 E-value=9.1 Score=18.23 Aligned_cols=31 Identities=19% Similarity=0.310 Sum_probs=20.7 Q ss_pred CCHHHHHHHHHHHHH--HHHHHCCCCEEEEECC Q ss_conf 915788999999999--9998368514334157 Q gi|254780803|r 1 MLYKRILRILVVFNF--FLLINSCSVYPFYYLK 31 (165) Q Consensus 1 ml~~ri~k~l~~~~~--~~~ls~CGf~PlY~~~ 31 (165) |.-+.|.+++++..+ .++|+||..-|-|... T Consensus 1 ~~~~~~~r~~~~~~l~~~~lLagC~~~p~~~~p 33 (488) T PRK09915 1 MINRQLSRLLLCSILGSTTLISGCALVRKDSAP 33 (488) T ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCC T ss_conf 911577699999999999997526689999999 No 51 >pfam04507 DUF576 Protein of unknown function, DUF576. This family contains several uncharacterized staphylococcal proteins. Probab=57.38 E-value=5.3 Score=19.63 Aligned_cols=21 Identities=14% Similarity=0.488 Sum_probs=11.8 Q ss_pred HHHHHHHHHHHHHHHHCCCCE Q ss_conf 889999999999998368514 Q gi|254780803|r 5 RILRILVVFNFFLLINSCSVY 25 (165) Q Consensus 5 ri~k~l~~~~~~~~ls~CGf~ 25 (165) |+...+.++.+.++++|||+. T Consensus 6 k~~L~is~liliifI~Gcg~~ 26 (257) T pfam04507 6 KLALYISVLILIIFISGCGFM 26 (257) T ss_pred EEHHHHHHHHHHHHHEECCCC T ss_conf 003678899786520002235 No 52 >PRK04792 tolB translocation protein TolB; Provisional Probab=56.19 E-value=15 Score=16.90 Aligned_cols=144 Identities=8% Similarity=0.095 Sum_probs=59.9 Q ss_pred HHHHHHHHHCCCCEEEEECCCCCCCEEEEEEEEECC--------CCHHHHHHHHHHHHHCCCCCC-CCCEEEEEE----- Q ss_conf 999999983685143341577672022325784077--------741448999999973147777-671689997----- Q gi|254780803|r 12 VFNFFLLINSCSVYPFYYLKNQDRTEYIHPIKVLVV--------SKNNKNEIYRSINFLTSTVRT-KNLYQLEVN----- 77 (165) Q Consensus 12 ~~~~~~~ls~CGf~PlY~~~~~~~~~~l~~I~i~~~--------~~r~~~~i~n~L~~~~~~~~~-~~~y~l~i~----- 77 (165) ++.+++++++|.+..|.-+=..+.. .--.|.|.+- ...+.++|++.|.+ .|-... .+...+..- T Consensus 9 l~lll~~~s~~A~A~L~IeIt~G~~-~~~PIAV~pF~~~g~~~~~~~i~~II~~DL~r-SG~F~~l~~~~~~~~P~~~~~ 86 (450) T PRK04792 9 LFVLLLSFSQVANAALELVITDGID-SARPIAIVPFKWEGEGKLPTDISAVIASDLQR-SGKFSPVPTSKMPQTPYSESE 86 (450) T ss_pred HHHHHHHHHHHHEEEEEEEEECCCC-CCCCEEEECCCCCCCCCHHHHHHHHHHHHHHH-CCCCEECCHHHCCCCCCCCCC T ss_conf 9999998657520249999954777-74254993754568743257999999988760-878552596674458885478 Q ss_pred ECCCEEECCCCC----CCCCE-EEEEEEEEEEEEECCCCCEEEEEEEEEEE---EECCCCCH--HHHH-HHHHHHHHHHH Q ss_conf 100000000246----76504-89999999999980589587742579998---60677615--7799-99999999999 Q gi|254780803|r 78 IDSFTDHAISNA----FFKNI-GRITLKAKYYFKEISGKNILYENNTDVTS---LFDFSDQQ--FSQL-RSHKSSEEKAI 146 (165) Q Consensus 78 ~~~~~~~~i~~~----~~~~~-~~~~~~~~y~L~~~~~~~~i~~~~~~~~~---~Y~~~~~~--f~~~-~~e~~a~erl~ 146 (165) +....=..+..+ |..+. ..=.+.|+|+|.|...++. ..|...+.+ .+-.+.++ +... .....--.+++ T Consensus 87 v~f~dWr~lgad~LV~G~v~~~~~g~~~V~f~L~DV~~g~~-~~g~~~~~~~~g~~v~~~~~~~~~~~~~~~~~~~R~~a 165 (450) T PRK04792 87 INFDAWTKMGVDALLTGSVTQNAEGKYVVNYQLVDVVRGQL-TKGQSKALSDDGQLVLSKDHVLFNKVATVSAPRMREYA 165 (450) T ss_pred CCHHHHHHCCCCEEEEEEEEECCCCEEEEEEEEEECCCCCC-CCCCCCCCCCCCCCCCCCCCEECCCEECCCHHHHHHHH T ss_conf 37789764389889999999889974999999998016510-45532122444321123210000202104778999999 Q ss_pred HHHHHHHHHHHH Q ss_conf 999999999999 Q gi|254780803|r 147 QELSENIYIDII 158 (165) Q Consensus 147 ~~lae~I~~~i~ 158 (165) -.+|+.|+.+|. T Consensus 166 H~isD~Iy~~lT 177 (450) T PRK04792 166 HRISDLVYEQLT 177 (450) T ss_pred HHHHHHHHHHHC T ss_conf 999999999970 No 53 >TIGR01074 rep ATP-dependent DNA helicase Rep; InterPro: IPR005752 RepA hexameric DNA helicase contain ATP-binding domains similar to those seen in monomeric helicases but which are arranged in a ring. There is compelling evidence to suggest that a single ssDNA molecule passes through the centre of the hexameric ring. Activity of the enzyme is based upon two separate but coupled activities, ssDNA translocation and duplex destabilisation, and is driven by energy derived from the continuous ATP-binding and hydrolysis events that take place in the active-site cleft. The resulting conformational changes that accompany these events underpin the coupling process and allow the helicase to translocate along the DNA, destabilizing the duplex and separating the two strands in an active process ; GO: 0004003 ATP-dependent DNA helicase activity, 0006268 DNA unwinding during replication, 0005737 cytoplasm. Probab=55.91 E-value=15 Score=16.87 Aligned_cols=75 Identities=13% Similarity=0.179 Sum_probs=47.2 Q ss_pred HHHHCCCCEEEEECCCCCCCEEEEEEEEECCCCHHHHHHHHHHHHHCCCCCCCCCEEEEEEECCCE-EECCCCCCCCCEE Q ss_conf 998368514334157767202232578407774144899999997314777767168999710000-0000246765048 Q gi|254780803|r 17 LLINSCSVYPFYYLKNQDRTEYIHPIKVLVVSKNNKNEIYRSINFLTSTVRTKNLYQLEVNIDSFT-DHAISNAFFKNIG 95 (165) Q Consensus 17 ~~ls~CGf~PlY~~~~~~~~~~l~~I~i~~~~~r~~~~i~n~L~~~~~~~~~~~~y~l~i~~~~~~-~~~i~~~~~~~~~ 95 (165) -|+.+|||+|=+ +.-|. -+|+...++++++-..++....++ |-+..-.+- =.++... .. T Consensus 39 yLi~~cgY~a~~----------IaAvT---FTNKAA~EMkERVA~~L~~~~~~G---L~isTFH~LGL~Ii~~E----~~ 98 (677) T TIGR01074 39 YLIQNCGYKAKN----------IAAVT---FTNKAAREMKERVAKTLGKGQAKG---LTISTFHTLGLKIIRRE----HN 98 (677) T ss_pred HHHHHCCCCCCC----------EEEEE---CCHHHHHHHHHHHHHHCCCCCCCC---CEEECCHHHHHHHHHHH----HH T ss_conf 987515878761----------68973---523777999999985226545585---44752057338999999----98 Q ss_pred EEEEEEEEEEEECCCC Q ss_conf 9999999999980589 Q gi|254780803|r 96 RITLKAKYYFKEISGK 111 (165) Q Consensus 96 ~~~~~~~y~L~~~~~~ 111 (165) .+.+..+|+|.|..|. T Consensus 99 ~lG~K~nFSlFD~~D~ 114 (677) T TIGR01074 99 ALGLKSNFSLFDETDQ 114 (677) T ss_pred HCCCCCCCCCCCHHHH T ss_conf 6488999642067889 No 54 >COG3056 Uncharacterized lipoprotein [Cell envelope biogenesis, outer membrane] Probab=55.08 E-value=16 Score=16.79 Aligned_cols=127 Identities=10% Similarity=0.123 Sum_probs=58.3 Q ss_pred HHHHHHHHHHHHHHHHHCCCCEEEEECC-------CCCCCEEEEEEEEECCC----------------------CHHHHH Q ss_conf 7889999999999998368514334157-------76720223257840777----------------------414489 Q gi|254780803|r 4 KRILRILVVFNFFLLINSCSVYPFYYLK-------NQDRTEYIHPIKVLVVS----------------------KNNKNE 54 (165) Q Consensus 4 ~ri~k~l~~~~~~~~ls~CGf~PlY~~~-------~~~~~~~l~~I~i~~~~----------------------~r~~~~ 54 (165) -+.+|+++++...++|+||.-.|---.. +++-.-.-..+.|...+ -...+. T Consensus 13 ~~t~k~L~~laa~~lLagC~a~~~tl~~tpp~~~~sqn~v~~~~~vsit~kD~R~~q~vA~v~~~~~lv~L~as~~vr~l 92 (204) T COG3056 13 NMTKKILFPLAAIFLLAGCAAPPTTLIVTPPAPLPSQNPVLMGVTVSITGKDQRSDQAVAKVTRDNQLVKLTASRDVRFL 92 (204) T ss_pred HHHHHHHHHHHHHHHHHHCCCCCCEEECCCCCCCCCCCCHHCCEEEEEECCCCHHHHHHHHHHCCCCEEEECCCCCHHHH T ss_conf 88999999999999997537997503136997664437233141899970550166788762047978983478139999 Q ss_pred HHHHHHHHCCCCC----CCCCEEEEEEECCCEEECCCCCCCCCEEEEEEEEEEEEEEC-CCCCEEEE--EEEEEEEEECC Q ss_conf 9999997314777----76716899971000000002467650489999999999980-58958774--25799986067 Q gi|254780803|r 55 IYRSINFLTSTVR----TKNLYQLEVNIDSFTDHAISNAFFKNIGRITLKAKYYFKEI-SGKNILYE--NNTDVTSLFDF 127 (165) Q Consensus 55 i~n~L~~~~~~~~----~~~~y~l~i~~~~~~~~~i~~~~~~~~~~~~~~~~y~L~~~-~~~~~i~~--~~~~~~~~Y~~ 127 (165) +.+-|...+++.+ ....-.+.+.+++-. .+..-..-||++...+..++... ..|+.+++ ++.+.+..++. T Consensus 93 lq~vl~~q~~srGfrig~n~~~n~~i~V~qly---a~V~qgnlryni~tkv~i~I~A~~~~Gk~~Kny~at~~~~GafsA 169 (204) T COG3056 93 LQEVLEKQMTSRGFRVGPNGEVNLQITVDQLY---ADVEQGNLRYNIDTKVDITITATAANGKMVKNYRATYNVEGAFSA 169 (204) T ss_pred HHHHHHHHHHHCCEEECCCCCCEEEEEHHHHH---HHHHHCCEEECCCCEEEEEEEEECCCCCEEEEEEEEEEEEEEEEC T ss_conf 99999998653436753689831899849966---355633123302545889999973787155324556666655405 Q ss_pred CCCHHH Q ss_conf 761577 Q gi|254780803|r 128 SDQQFS 133 (165) Q Consensus 128 ~~~~f~ 133 (165) +.++.+ T Consensus 170 sN~eI~ 175 (204) T COG3056 170 SNDEIA 175 (204) T ss_pred CCHHHH T ss_conf 886799 No 55 >TIGR03352 VI_chp_3 type VI secretion lipoprotein, VC_A0113 family. Work by Mougous, et al. (2006), describes IAHP-related loci as a type VI secretion system (PMID:16763151). This protein family is associated with type VI secretion loci, although not treated explicitly by Mougous, et al. Probab=55.04 E-value=3.7 Score=20.58 Aligned_cols=21 Identities=33% Similarity=0.483 Sum_probs=14.2 Q ss_pred HHHHHHHHHHHHHCCCCEEEE Q ss_conf 999999999998368514334 Q gi|254780803|r 8 RILVVFNFFLLINSCSVYPFY 28 (165) Q Consensus 8 k~l~~~~~~~~ls~CGf~PlY 28 (165) +..+++...++|+|||-.|-+ T Consensus 2 r~~~~l~~~lllagC~s~~~~ 22 (146) T TIGR03352 2 RFAVLLAACLLLAGCSSAPPP 22 (146) T ss_pred CEEHHHHHHHHHHHCCCCCCC T ss_conf 075589999998532599998 No 56 >PRK04043 tolB translocation protein TolB; Provisional Probab=54.94 E-value=16 Score=16.77 Aligned_cols=46 Identities=9% Similarity=0.067 Sum_probs=31.1 Q ss_pred EEEEEEEEEECCCCCEEEEEEEEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999998058958774257999860677615779999999999999999999999999 Q gi|254780803|r 98 TLKAKYYFKEISGKNILYENNTDVTSLFDFSDQQFSQLRSHKSSEEKAIQELSENIYIDII 158 (165) Q Consensus 98 ~~~~~y~L~~~~~~~~i~~~~~~~~~~Y~~~~~~f~~~~~e~~a~erl~~~lae~I~~~i~ 158 (165) .+.++|+|.|...++.+..+.+.... .+...+++-.+|++|..+|. T Consensus 102 ~~~v~~~L~Dv~~~~~l~~~~~~~~~---------------~~~~r~iAH~iaD~Iye~lt 147 (419) T PRK04043 102 KLSLRIKLYDANTGSLKFEKDYTINS---------------LDRYPFLAHKSAIDINNYLK 147 (419) T ss_pred EEEEEEEEEECCCCCEEEEEEEEECC---------------HHHHHHHHHHHHHHHHHHHC T ss_conf 69999999986477196540797065---------------45557789999999999855 No 57 >TIGR01433 CyoA ubiquinol oxidase, subunit II; InterPro: IPR006333 Members of this domain catalyze the oxidation of ubiquinol with the concomitant reduction of molecular oxygen to water. This acts as the terminal electron acceptor in the respiratory chain. Subunit II is responsible for binding and oxidation of the ubiquinone substrate. This sequence is closely related to QoxA, which oxidizes quinol in Gram-positive bacteria but which is in complex with subunits which utilise cytochrome a in the reduction of molecular oxygen. Slightly more distantly related is subunit II of cytochrome c oxidase which uses cytochrome c as the oxidant . ; GO: 0016682 oxidoreductase activity acting on diphenols and related substances as donors oxygen as acceptor, 0006118 electron transport, 0016020 membrane. Probab=54.14 E-value=7 Score=18.90 Aligned_cols=22 Identities=18% Similarity=0.224 Sum_probs=16.1 Q ss_pred HHHHHHHHHHCCCCEEEEECCC Q ss_conf 9999999983685143341577 Q gi|254780803|r 11 VVFNFFLLINSCSVYPFYYLKN 32 (165) Q Consensus 11 ~~~~~~~~ls~CGf~PlY~~~~ 32 (165) .++..++||+||.=+++-..++ T Consensus 2 ~~~~~~~lLsGCNn~~LL~PkG 23 (228) T TIGR01433 2 SLILASLLLSGCNNLVLLDPKG 23 (228) T ss_pred HHHHHHHHHHCCCCCCCCCCCC T ss_conf 5778999861256765538888 No 58 >COG4594 FecB ABC-type Fe3+-citrate transport system, periplasmic component [Inorganic ion transport and metabolism] Probab=53.82 E-value=11 Score=17.72 Aligned_cols=25 Identities=32% Similarity=0.526 Sum_probs=18.3 Q ss_pred CCHHHHHHHHHHHHHHHHHHCCCCE Q ss_conf 9157889999999999998368514 Q gi|254780803|r 1 MLYKRILRILVVFNFFLLINSCSVY 25 (165) Q Consensus 1 ml~~ri~k~l~~~~~~~~ls~CGf~ 25 (165) |-+++..-+.+++..+++.+|||-+ T Consensus 1 ~~~~~~~~i~~lll~lllva~C~~s 25 (310) T COG4594 1 MHMKKTAIILTLLLLLLLVAACSSS 25 (310) T ss_pred CCCHHHHHHHHHHHHHHHHHHHCCC T ss_conf 9750359999999999999771476 No 59 >pfam07901 DUF1672 Protein of unknown function (DUF1672). This family is composed of hypothetical bacterial proteins of unknown function. Probab=53.16 E-value=8.7 Score=18.33 Aligned_cols=20 Identities=25% Similarity=0.321 Sum_probs=13.9 Q ss_pred HHHHHHHHHHHHHCCCCEEE Q ss_conf 99999999999836851433 Q gi|254780803|r 8 RILVVFNFFLLINSCSVYPF 27 (165) Q Consensus 8 k~l~~~~~~~~ls~CGf~Pl 27 (165) +++.+++.+|+|+||..-+= T Consensus 4 ~i~~il~~~llL~GCs~mn~ 23 (304) T pfam07901 4 LIKLILIATLLLSGCSTTNN 23 (304) T ss_pred HHHHHHHHHHHHCCCCCCCC T ss_conf 89999999999713344555 No 60 >PRK11443 hypothetical protein; Provisional Probab=52.93 E-value=8 Score=18.55 Aligned_cols=21 Identities=19% Similarity=0.613 Sum_probs=15.5 Q ss_pred HHHHHHHHHHHHHHCCCCEEE Q ss_conf 999999999999836851433 Q gi|254780803|r 7 LRILVVFNFFLLINSCSVYPF 27 (165) Q Consensus 7 ~k~l~~~~~~~~ls~CGf~Pl 27 (165) ||.+.+..+.++|+||--.|- T Consensus 1 Mk~~~~~~~~lll~GCq~~p~ 21 (120) T PRK11443 1 MKKFIAPLLALLVSGCQIDPY 21 (120) T ss_pred CHHHHHHHHHHHHHCCCCCCC T ss_conf 915899999999966678962 No 61 >PRK01326 prsA foldase protein PrsA; Reviewed Probab=52.84 E-value=13 Score=17.37 Aligned_cols=21 Identities=14% Similarity=0.207 Sum_probs=11.3 Q ss_pred HHHHHHHHHHHHHHHHCCCCE Q ss_conf 889999999999998368514 Q gi|254780803|r 5 RILRILVVFNFFLLINSCSVY 25 (165) Q Consensus 5 ri~k~l~~~~~~~~ls~CGf~ 25 (165) +++-.++.+.-.++|+|||=. T Consensus 4 Kl~~~~~~~~svl~LaaCs~~ 24 (310) T PRK01326 4 KLIAGAVTLLSVATLAACSKT 24 (310) T ss_pred HHHHHHHHHHHHHHHHHHCCC T ss_conf 899999999999999871699 No 62 >COG4669 EscJ Type III secretory pathway, lipoprotein EscJ [Intracellular trafficking and secretion] Probab=52.24 E-value=12 Score=17.53 Aligned_cols=26 Identities=23% Similarity=0.494 Sum_probs=20.0 Q ss_pred HHHHHHHHHHHHHHHHHCCCCEEEEEC Q ss_conf 788999999999999836851433415 Q gi|254780803|r 4 KRILRILVVFNFFLLINSCSVYPFYYL 30 (165) Q Consensus 4 ~ri~k~l~~~~~~~~ls~CGf~PlY~~ 30 (165) +++.+.++++.+.++|+||- +-+|.. T Consensus 2 k~~~~~l~~l~l~l~L~gCk-~~Ly~g 27 (246) T COG4669 2 KSLRKYLFLLLLILLLTGCK-VDLYTG 27 (246) T ss_pred CHHHHHHHHHHHHHHHHCCH-HHHHCC T ss_conf 02899999999999983356-988707 No 63 >KOG0604 consensus Probab=51.61 E-value=9.2 Score=18.18 Aligned_cols=16 Identities=25% Similarity=0.445 Sum_probs=12.8 Q ss_pred CCCCEEEEECCCCCCC Q ss_conf 6851433415776720 Q gi|254780803|r 21 SCSVYPFYYLKNQDRT 36 (165) Q Consensus 21 ~CGf~PlY~~~~~~~~ 36 (165) =|||-|.|+..+...+ T Consensus 257 LCGyPPFYS~hg~ais 272 (400) T KOG0604 257 LCGYPPFYSNHGLAIS 272 (400) T ss_pred HCCCCCCCCCCCCCCC T ss_conf 5179850136786578 No 64 >COG3317 NlpB Uncharacterized lipoprotein [Cell envelope biogenesis, outer membrane] Probab=51.44 E-value=12 Score=17.58 Aligned_cols=25 Identities=24% Similarity=0.380 Sum_probs=18.3 Q ss_pred HHHHHHHHHHHHHHHHHCCCCEEEE Q ss_conf 7889999999999998368514334 Q gi|254780803|r 4 KRILRILVVFNFFLLINSCSVYPFY 28 (165) Q Consensus 4 ~ri~k~l~~~~~~~~ls~CGf~PlY 28 (165) +...+.++.+.+..+|+||+-.|=| T Consensus 2 k~~~~lv~~al~v~~LaaCSs~~~~ 26 (342) T COG3317 2 KSSAKLVLGALLVLLLAACSSDSEY 26 (342) T ss_pred CHHHHHHHHHHHHHHHHHCCCCCCC T ss_conf 1178899999999987414688500 No 65 >TIGR02898 spore_YhcN_YlaJ sporulation lipoprotein, YhcN/YlaJ family; InterPro: IPR014247 This entry contains YhcN and YlaJ, which are predicted lipoproteins that have been detected as spore proteins but not vegetative proteins in Bacillus subtilis. Both appear to be expressed under control of the RNA polymerase sigma-G factor. The YlaJ-like members of this family have a low-complexity, strongly acidic, 40-residue C-terminal domain. . Probab=51.38 E-value=10 Score=17.90 Aligned_cols=17 Identities=18% Similarity=0.614 Sum_probs=10.2 Q ss_pred HHHHHHHHHH--HCCCCEE Q ss_conf 9999999998--3685143 Q gi|254780803|r 10 LVVFNFFLLI--NSCSVYP 26 (165) Q Consensus 10 l~~~~~~~~l--s~CGf~P 26 (165) ++++.++|+| +||+..- T Consensus 3 ~~~ll~~~~l~~tgC~~~~ 21 (185) T TIGR02898 3 FIILLLLLLLLATGCTNAQ 21 (185) T ss_pred HHHHHHHHHHHHHCCCCCC T ss_conf 8899999999873134100 No 66 >PRK11548 hypothetical protein; Provisional Probab=51.34 E-value=11 Score=17.61 Aligned_cols=16 Identities=25% Similarity=0.563 Sum_probs=9.4 Q ss_pred HHHHHHHCCCCE-E-EEE Q ss_conf 999998368514-3-341 Q gi|254780803|r 14 NFFLLINSCSVY-P-FYY 29 (165) Q Consensus 14 ~~~~~ls~CGf~-P-lY~ 29 (165) .++++++||++- | +|. T Consensus 12 l~~lllsgCS~~~~~vYk 29 (113) T PRK11548 12 VLLMLTAGCSTLERVVYR 29 (113) T ss_pred HHHHHHCCCCCCCCCCCC T ss_conf 999998011578775657 No 67 >PRK10477 outer membrane lipoprotein Blc; Provisional Probab=51.19 E-value=11 Score=17.70 Aligned_cols=21 Identities=24% Similarity=0.493 Sum_probs=18.1 Q ss_pred HHHHHHHHHHHHHHHHHCCCC Q ss_conf 788999999999999836851 Q gi|254780803|r 4 KRILRILVVFNFFLLINSCSV 24 (165) Q Consensus 4 ~ri~k~l~~~~~~~~ls~CGf 24 (165) .|++.+++.+.+.++|.||+= T Consensus 1 ~~~~~~~~~~~~~~~~~~c~~ 21 (177) T PRK10477 1 MRLLPVVAAVTAAFLVVACSS 21 (177) T ss_pred CCHHHHHHHHHHHHHHHHCCC T ss_conf 940789999999999976469 No 68 >COG5645 Predicted periplasmic lipoprotein [General function prediction only] Probab=51.01 E-value=5.6 Score=19.52 Aligned_cols=17 Identities=29% Similarity=0.667 Sum_probs=11.2 Q ss_pred HHHHHHHHHHHHHCCCC Q ss_conf 99999999999836851 Q gi|254780803|r 8 RILVVFNFFLLINSCSV 24 (165) Q Consensus 8 k~l~~~~~~~~ls~CGf 24 (165) ++++.+...++++|||= T Consensus 3 ~i~l~l~v~lllSGC~S 19 (80) T COG5645 3 NILLSLMVLLLLSGCGS 19 (80) T ss_pred EEHHHHHHHHHHCCCCE T ss_conf 53278999998376410 No 69 >COG4939 Major membrane immunogen, membrane-anchored lipoprotein [Function unknown] Probab=50.96 E-value=9.5 Score=18.11 Aligned_cols=18 Identities=11% Similarity=0.285 Sum_probs=13.7 Q ss_pred HHHHHHHHHHHHCCCCEE Q ss_conf 999999999983685143 Q gi|254780803|r 9 ILVVFNFFLLINSCSVYP 26 (165) Q Consensus 9 ~l~~~~~~~~ls~CGf~P 26 (165) +.+.++.++||.+||++. T Consensus 6 ~~~~~~~~~LL~aCg~sd 23 (147) T COG4939 6 LVGMIVALSLLTACGKSD 23 (147) T ss_pred HHHHHHHHHHHHHHCCCC T ss_conf 659999999998706664 No 70 >pfam03886 DUF330 Protein of unknown function (DUF330). The proteins in this family are uncharacterized. The proteins are 170-190 amino residues in length. Probab=50.84 E-value=18 Score=16.37 Aligned_cols=92 Identities=14% Similarity=0.273 Sum_probs=52.4 Q ss_pred CHHHHHHHHHHHHHCCCC---------CCCCCEEEEEEECCCEEECCCCCCCCCEEEEEEEEEEEEEECCCCCEEEEEEE Q ss_conf 414489999999731477---------77671689997100000000246765048999999999998058958774257 Q gi|254780803|r 49 KNNKNEIYRSINFLTSTV---------RTKNLYQLEVNIDSFTDHAISNAFFKNIGRITLKAKYYFKEISGKNILYENNT 119 (165) Q Consensus 49 ~r~~~~i~n~L~~~~~~~---------~~~~~y~l~i~~~~~~~~~i~~~~~~~~~~~~~~~~y~L~~~~~~~~i~~~~~ 119 (165) ..+.+.+...|...++.. ...+.|.|.+.+..... ..+| +..+++.|+|.+..++.++....| T Consensus 59 ~~i~~~L~~~L~~~l~~~~V~~~~~~~~~~~~~~L~v~i~~F~~---~~~g-----~~~l~~~w~l~~~~~~~~~~~~~f 130 (160) T pfam03886 59 DQIRRALARDLSQRLPSARVAAPPWPLGGTPDYRLQVDVQRFEG---DPGG-----TAVLSAQWRLLDAGGQRVLASRSF 130 (160) T ss_pred HHHHHHHHHHHHHHCCCCEEEECCCCCCCCCCEEEEEEEEEEEE---ECCC-----EEEEEEEEEEECCCCCEEEEEEEE T ss_conf 99999999999965898629836888788866799999999974---0797-----499999999982899646777779 Q ss_pred EEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9998606776157799999999999999999999 Q gi|254780803|r 120 DVTSLFDFSDQQFSQLRSHKSSEEKAIQELSENI 153 (165) Q Consensus 120 ~~~~~Y~~~~~~f~~~~~e~~a~erl~~~lae~I 153 (165) .....-. .++.-+...+.. ++...+|++| T Consensus 131 ~~~~p~~-~~~~~~lvaAl~----~al~~la~~I 159 (160) T pfam03886 131 RISEPLA-GDGYAALVAALS----QALAQLADQI 159 (160) T ss_pred EEEEECC-CCCHHHHHHHHH----HHHHHHHHHH T ss_conf 9998668-899799999999----9999999975 No 71 >PRK10838 spr putative outer membrane lipoprotein; Provisional Probab=50.48 E-value=13 Score=17.27 Aligned_cols=27 Identities=22% Similarity=0.288 Sum_probs=21.9 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCEEEEE Q ss_conf 578899999999999983685143341 Q gi|254780803|r 3 YKRILRILVVFNFFLLINSCSVYPFYY 29 (165) Q Consensus 3 ~~ri~k~l~~~~~~~~ls~CGf~PlY~ 29 (165) ++.|++.+..+++..+|++|+=.|-.. T Consensus 7 ~r~~l~~~~~~~~~~~l~ac~~~~~~~ 33 (188) T PRK10838 7 LRYILRGIPAIAVAVLLSACSSNNTAK 33 (188) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCCC T ss_conf 999998717999999888633898555 No 72 >COG3417 FlgN Collagen-binding surface adhesin SpaP (antigen I/II family) [General function prediction only] Probab=50.18 E-value=13 Score=17.32 Aligned_cols=20 Identities=15% Similarity=0.374 Sum_probs=12.0 Q ss_pred HHHHHHHHHHHHHHHHCCCC Q ss_conf 88999999999999836851 Q gi|254780803|r 5 RILRILVVFNFFLLINSCSV 24 (165) Q Consensus 5 ri~k~l~~~~~~~~ls~CGf 24 (165) |+.++..++..-++++||.= T Consensus 3 k~k~~~~il~~al~l~GCs~ 22 (200) T COG3417 3 KMKIYASILLLALFLSGCSS 22 (200) T ss_pred HHHHHHHHHHHHHHHHHCCC T ss_conf 57999999999999841136 No 73 >COG4808 Uncharacterized protein conserved in bacteria [Function unknown] Probab=50.16 E-value=15 Score=16.88 Aligned_cols=22 Identities=9% Similarity=0.454 Sum_probs=14.7 Q ss_pred HHHHHHHHHHHHHHHHHHCCCC Q ss_conf 5788999999999999836851 Q gi|254780803|r 3 YKRILRILVVFNFFLLINSCSV 24 (165) Q Consensus 3 ~~ri~k~l~~~~~~~~ls~CGf 24 (165) +++...+++.++..+.|+|||= T Consensus 4 l~kl~~~~~alil~~sl~gCgd 25 (152) T COG4808 4 LNKLFSLVVALVLVFSLAGCGD 25 (152) T ss_pred HHHHHHHHHHHHHHHHHHHCCC T ss_conf 8878899999999887540486 No 74 >COG3065 Slp Starvation-inducible outer membrane lipoprotein [Cell envelope biogenesis, outer membrane] Probab=50.09 E-value=15 Score=16.88 Aligned_cols=25 Identities=12% Similarity=0.230 Sum_probs=14.4 Q ss_pred HHHHHHHHHHHHH-HHHHHCCCCEEE Q ss_conf 5788999999999-999836851433 Q gi|254780803|r 3 YKRILRILVVFNF-FLLINSCSVYPF 27 (165) Q Consensus 3 ~~ri~k~l~~~~~-~~~ls~CGf~Pl 27 (165) ..+.+++-.++.. -|+|+||---|= T Consensus 3 ~~m~~~~~~l~~~laflLsgC~tiPk 28 (191) T COG3065 3 SQMNMKKGALIGTLAFLLSGCVTIPK 28 (191) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCH T ss_conf 58764799999999999763035783 No 75 >PRK11063 metQ DL-methionine transporter substrate-binding subunit; Provisional Probab=49.71 E-value=17 Score=16.61 Aligned_cols=13 Identities=15% Similarity=0.143 Sum_probs=8.3 Q ss_pred HHHHHHHHHCCCC Q ss_conf 9999999836851 Q gi|254780803|r 12 VFNFFLLINSCSV 24 (165) Q Consensus 12 ~~~~~~~ls~CGf 24 (165) .+...++|+|||= T Consensus 13 ~l~~~l~l~gCg~ 25 (271) T PRK11063 13 ALIGSLALVGCGQ 25 (271) T ss_pred HHHHHHHHCCCCC T ss_conf 9999997500589 No 76 >PRK12699 flgH flagellar basal body L-ring protein; Reviewed Probab=49.35 E-value=11 Score=17.80 Aligned_cols=21 Identities=24% Similarity=0.540 Sum_probs=13.2 Q ss_pred HHHHHHHHHHHHHHHHHCCCCE Q ss_conf 7889999999999998368514 Q gi|254780803|r 4 KRILRILVVFNFFLLINSCSVY 25 (165) Q Consensus 4 ~ri~k~l~~~~~~~~ls~CGf~ 25 (165) .|.++.+++ +.+++|+||... T Consensus 16 ~~~~~~~~~-~~~~~l~gC~~~ 36 (246) T PRK12699 16 GRLLGPVLI-VMLALVGGCSLP 36 (246) T ss_pred CCHHHHHHH-HHHHHHCCCCCC T ss_conf 324888999-999986044689 No 77 >pfam08139 LPAM_1 Prokaryotic membrane lipoprotein lipid attachment site. In prokaryotes, membrane lipoproteins are synthesized with a precursor signal peptide, which is cleaved by a specific lipoprotein signal peptidase (signal peptidase II). The peptidase recognizes a conserved sequence and cuts upstream of a cysteine residue to which a glyceride-fatty acid lipid is attached. Probab=48.80 E-value=12 Score=17.48 Aligned_cols=19 Identities=21% Similarity=0.565 Sum_probs=12.4 Q ss_pred HHHHHHHHHHHHHHCCCCE Q ss_conf 9999999999998368514 Q gi|254780803|r 7 LRILVVFNFFLLINSCSVY 25 (165) Q Consensus 7 ~k~l~~~~~~~~ls~CGf~ 25 (165) .|+++++.+.++|+||.=. T Consensus 2 kk~~~l~~~~l~LagCas~ 20 (26) T pfam08139 2 KKLLLLLLALLLLAGCASX 20 (26) T ss_pred HHHHHHHHHHHHHHCCCCC T ss_conf 6699999999998233200 No 78 >PRK13883 conjugal transfer protein TrbH; Provisional Probab=48.22 E-value=12 Score=17.59 Aligned_cols=25 Identities=20% Similarity=0.293 Sum_probs=18.1 Q ss_pred HHHHHHHHHHHHHHHCCCCEEEEEC Q ss_conf 8999999999999836851433415 Q gi|254780803|r 6 ILRILVVFNFFLLINSCSVYPFYYL 30 (165) Q Consensus 6 i~k~l~~~~~~~~ls~CGf~PlY~~ 30 (165) |.|++.+..|.++|+||--+-.||+ T Consensus 1 MrK~~~~all~laL~gCAtt~~YGN 25 (156) T PRK13883 1 MRKILSLALLALALGGCATTSQYGN 25 (156) T ss_pred CHHHHHHHHHHHHHHCCCCCCCCCC T ss_conf 9268999999999703313578877 No 79 >pfam06474 MLTD_N MLTD_N. Probab=47.62 E-value=8.9 Score=18.27 Aligned_cols=19 Identities=26% Similarity=0.352 Sum_probs=12.4 Q ss_pred HHHHHHHHHHHHHCCCCEE Q ss_conf 9999999999983685143 Q gi|254780803|r 8 RILVVFNFFLLINSCSVYP 26 (165) Q Consensus 8 k~l~~~~~~~~ls~CGf~P 26 (165) |...++.+.+||+||--.| T Consensus 2 k~kail~~alLLaGCQst~ 20 (93) T pfam06474 2 RAKAVLAAALLLVGCQSTQ 20 (93) T ss_pred CHHHHHHHHHHHHHHCCCC T ss_conf 1899999999999813799 No 80 >TIGR02722 lp_ uncharacterized lipoprotein; InterPro: IPR014094 Members of this protein family are restricted to the Proteobacteria, and all are predicted lipoproteins. In genomes that contain the thiK gene for the salvage enzyme thiamin kinase, the member of this family is encoded nearby.. Probab=47.44 E-value=18 Score=16.44 Aligned_cols=30 Identities=20% Similarity=0.256 Sum_probs=20.0 Q ss_pred HHHHHHHHHHHHHHHHHCCCCE-EEEECCCC Q ss_conf 7889999999999998368514-33415776 Q gi|254780803|r 4 KRILRILVVFNFFLLINSCSVY-PFYYLKNQ 33 (165) Q Consensus 4 ~ri~k~l~~~~~~~~ls~CGf~-PlY~~~~~ 33 (165) ||.+-++++++.-++|+||.=+ |-|..... T Consensus 1 k~~~~l~~~~~~Al~L~GC~~~~~~y~~~~~ 31 (215) T TIGR02722 1 KKYKILIFVALLALLLSGCVSQRVAYTDATD 31 (215) T ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCCCCCCC T ss_conf 8378789999999998546887266568875 No 81 >PRK10449 heat-inducible protein; Provisional Probab=47.38 E-value=11 Score=17.83 Aligned_cols=21 Identities=14% Similarity=0.314 Sum_probs=15.8 Q ss_pred HHHHHHHHHHHHHHCCCCEEE Q ss_conf 999999999999836851433 Q gi|254780803|r 7 LRILVVFNFFLLINSCSVYPF 27 (165) Q Consensus 7 ~k~l~~~~~~~~ls~CGf~Pl 27 (165) +|++.++.+.++++||.-.|. T Consensus 2 Kk~l~l~~~~llLagC~s~~~ 22 (140) T PRK10449 2 KKVVALVALSLLMAGCVSSGK 22 (140) T ss_pred CHHHHHHHHHHHHHHCCCCCC T ss_conf 034999999999975238998 No 82 >PRK10797 glutamate and aspartate transporter subunit; Provisional Probab=46.87 E-value=21 Score=16.00 Aligned_cols=25 Identities=8% Similarity=0.121 Sum_probs=20.9 Q ss_pred CCHHHHHHHHHHHHHHHHHHCCCCE Q ss_conf 9157889999999999998368514 Q gi|254780803|r 1 MLYKRILRILVVFNFFLLINSCSVY 25 (165) Q Consensus 1 ml~~ri~k~l~~~~~~~~ls~CGf~ 25 (165) |.+|++...++.+.++..+.+|+-. T Consensus 1 m~~kk~~~~~l~~~l~~~~~~~~~~ 25 (302) T PRK10797 1 MQLRKLATALLALGLSAGLAQAEDA 25 (302) T ss_pred CCHHHHHHHHHHHHHHHHHHHCCCC T ss_conf 9458999999999999999745677 No 83 >COG4764 Uncharacterized protein conserved in bacteria [Function unknown] Probab=43.63 E-value=15 Score=16.83 Aligned_cols=21 Identities=24% Similarity=0.393 Sum_probs=14.2 Q ss_pred HHHHHHHHHHHHHHCCCCEEE Q ss_conf 999999999999836851433 Q gi|254780803|r 7 LRILVVFNFFLLINSCSVYPF 27 (165) Q Consensus 7 ~k~l~~~~~~~~ls~CGf~Pl 27 (165) +..+++...++.|+||+-.|- T Consensus 5 ~~r~~~~v~lL~LagCaTaP~ 25 (197) T COG4764 5 MMRLVFAVVLLALAGCATAPS 25 (197) T ss_pred HHHHHHHHHHHHHHHCCCCCC T ss_conf 698999999999703036876 No 84 >TIGR00752 slp outer membrane lipoprotein, Slp family; InterPro: IPR004658 Slp superfamily members are present in the Gram-negative gamma proteobacteria Escherichia coli (which also contains a close paralog), Haemophilus influenzae and Pasteurella multocida and Vibrio cholerae. The known members of the family to date share a motif LX[GA]C near the N-terminus, which is compatible with the possibility that the protein is modified into a lipoprotein with Cys as the new N-terminus. Slp from E. coli is known to be a lipoprotein of the outer membrane and to be expressed in response to carbon starvation.; GO: 0019867 outer membrane. Probab=42.81 E-value=12 Score=17.50 Aligned_cols=23 Identities=17% Similarity=0.375 Sum_probs=18.6 Q ss_pred HHHHHHHHHHHHHHHCCCCEEEE Q ss_conf 89999999999998368514334 Q gi|254780803|r 6 ILRILVVFNFFLLINSCSVYPFY 28 (165) Q Consensus 6 i~k~l~~~~~~~~ls~CGf~PlY 28 (165) +.|-+++.+|.|+|+||---|== T Consensus 2 ~~~g~l~~~l~f~L~gC~~~P~~ 24 (190) T TIGR00752 2 VKKGLLITALAFLLTGCIAVPKA 24 (190) T ss_pred CCHHHHHHHHHHHHHHHCCCCCC T ss_conf 20145688899887531467423 No 85 >PRK10760 murein hydrolase B; Provisional Probab=42.06 E-value=13 Score=17.19 Aligned_cols=17 Identities=6% Similarity=0.211 Sum_probs=6.9 Q ss_pred EEEEEEEEECCCCCHHHH Q ss_conf 257999860677615779 Q gi|254780803|r 117 NNTDVTSLFDFSDQQFSQ 134 (165) Q Consensus 117 ~~~~~~~~Y~~~~~~f~~ 134 (165) ..|.+..-||. +..|+. T Consensus 326 ~NF~vI~rYN~-S~~YAl 342 (357) T PRK10760 326 PNFYTITRYNH-STHYAM 342 (357) T ss_pred CCHHHHHHHCC-CHHHHH T ss_conf 67698877408-698999 No 86 >PRK06760 hypothetical protein; Provisional Probab=41.95 E-value=7.1 Score=18.87 Aligned_cols=23 Identities=22% Similarity=0.449 Sum_probs=14.0 Q ss_pred HHHHHHHHHHHHH-HHHHHCCCCE Q ss_conf 5788999999999-9998368514 Q gi|254780803|r 3 YKRILRILVVFNF-FLLINSCSVY 25 (165) Q Consensus 3 ~~ri~k~l~~~~~-~~~ls~CGf~ 25 (165) |+|.+.++++..+ +.+|+||+|- T Consensus 1 MKK~l~i~i~~iis~~~lsaCS~~ 24 (223) T PRK06760 1 MKKTLTIFMLTILLLISFSACSKK 24 (223) T ss_pred CCEEEEEHHHHHHHHHHHHHCCCC T ss_conf 950676499999999984003367 No 87 >PRK12407 flgH flagellar basal body L-ring protein; Reviewed Probab=41.80 E-value=12 Score=17.52 Aligned_cols=16 Identities=25% Similarity=0.607 Sum_probs=12.9 Q ss_pred HHHHHHHHHHHHHCCC Q ss_conf 9999999999983685 Q gi|254780803|r 8 RILVVFNFFLLINSCS 23 (165) Q Consensus 8 k~l~~~~~~~~ls~CG 23 (165) |++++..++++|+||. T Consensus 2 r~l~l~~~~l~LsGC~ 17 (220) T PRK12407 2 RFLILTPMVLALCGCE 17 (220) T ss_pred CCHHHHHHHHHHHCCC T ss_conf 1579999999973658 No 88 >pfam08085 Entericidin Entericidin EcnA/B family. This family consists of the entericidin antidote/toxin peptides. The entericidin locus is activated in stationary phase under high osmolarity conditions by rho-S and simultaneously repressed by the osmoregulatory EnvZ/OmpR signal transduction pathway. The entericidin locus encodes tandem paralogous genes (ecnAB) and directs the synthesis of two small cell-envelope lipoproteins which can maintain plasmids in bacterial population by means of post-segregational killing. Probab=41.50 E-value=22 Score=15.85 Aligned_cols=19 Identities=21% Similarity=0.648 Sum_probs=10.4 Q ss_pred HHHH-HHHHHHHHHHHHHCC Q ss_conf 7889-999999999998368 Q gi|254780803|r 4 KRIL-RILVVFNFFLLINSC 22 (165) Q Consensus 4 ~ri~-k~l~~~~~~~~ls~C 22 (165) ||.+ .++.+....+.|++| T Consensus 2 k~~~~~~~~l~~~~~~lagC 21 (42) T pfam08085 2 KKLIALLLALLLLALVLAGC 21 (42) T ss_pred HHHHHHHHHHHHHHHHHHCC T ss_conf 23699999999999998612 No 89 >PRK11162 mltA murein transglycosylase A; Provisional Probab=41.46 E-value=18 Score=16.41 Aligned_cols=22 Identities=23% Similarity=0.365 Sum_probs=17.0 Q ss_pred HHHHHHHHHHHHHHHCCCCEEE Q ss_conf 8999999999999836851433 Q gi|254780803|r 6 ILRILVVFNFFLLINSCSVYPF 27 (165) Q Consensus 6 i~k~l~~~~~~~~ls~CGf~Pl 27 (165) +.|-+++.+.+.+|.+|+=+|. T Consensus 2 ~~~~~~~~~~~~~l~~c~~~p~ 23 (362) T PRK11162 2 WVKYLLMGIVVALLAACSSKPT 23 (362) T ss_pred HHHHHHHHHHHHHHHHHCCCCC T ss_conf 5899999999999976047987 No 90 >PRK00022 lolB outer membrane lipoprotein LolB; Provisional Probab=41.25 E-value=17 Score=16.63 Aligned_cols=16 Identities=25% Similarity=0.349 Sum_probs=10.5 Q ss_pred HHHHHHHHHCCCCEEE Q ss_conf 9999999836851433 Q gi|254780803|r 12 VFNFFLLINSCSVYPF 27 (165) Q Consensus 12 ~~~~~~~ls~CGf~Pl 27 (165) ++...++|+||.=.|. T Consensus 8 l~~~~llLsgCat~p~ 23 (203) T PRK00022 8 LPLAALLLAGCASLPP 23 (203) T ss_pred HHHHHHHHHHCCCCCC T ss_conf 9999999866148999 No 91 >TIGR01742 SA_tandem_lipo Staphylococcus tandem lipoproteins; InterPro: IPR007595 This family contains several uncharacterised staphylococcal proteins.. Probab=40.90 E-value=14 Score=17.07 Aligned_cols=24 Identities=17% Similarity=0.326 Sum_probs=14.5 Q ss_pred CC-HHHHHHHHHHHHH-HHHHHCCCC Q ss_conf 91-5788999999999-999836851 Q gi|254780803|r 1 ML-YKRILRILVVFNF-FLLINSCSV 24 (165) Q Consensus 1 ml-~~ri~k~l~~~~~-~~~ls~CGf 24 (165) |. .+|+...+.++++ ..++.|||- T Consensus 1 Mky~~k~~ly~s~~il~I~~vvgC~~ 26 (267) T TIGR01742 1 MKYAKKLALYISVLILIISFVVGCGK 26 (267) T ss_pred CCHHHHHHHHHHHHHHHEEEEECCCC T ss_conf 91357999999998643000125875 No 92 >pfam06085 Rz1 Lipoprotein Rz1 precursor. This family consists of several bacteria and phage lipoprotein Rz1 precursors. Rz1 is a proline-rich lipoprotein from bacteriophage lambda which is known to have fusogenic properties. Rz1-induced liposome fusion is thought to be mediated primarily by the generation of local perturbation in the bilayer lipid membrane and to a lesser extent by electrostatic forces. Probab=40.74 E-value=15 Score=16.90 Aligned_cols=24 Identities=25% Similarity=0.401 Sum_probs=15.7 Q ss_pred HHHHHHHHHHHHHHHHCCCCEEEE Q ss_conf 889999999999998368514334 Q gi|254780803|r 5 RILRILVVFNFFLLINSCSVYPFY 28 (165) Q Consensus 5 ri~k~l~~~~~~~~ls~CGf~PlY 28 (165) +..-.++...+.+.++|||-+|=- T Consensus 3 ~lkm~l~~~m~~L~vsaC~S~p~v 26 (59) T pfam06085 3 KLKMKLCVMMLPLVVSACSSKPPV 26 (59) T ss_pred CCHHHHHHHHHHHHHHHHCCCCCC T ss_conf 202569999989999982699986 No 93 >TIGR02643 T_phosphoryl thymidine phosphorylase; InterPro: IPR013465 Thymidine phosphorylase (alternate name: pyrimidine phosphorylase) 2.4.2.4 from EC is involved, in E. coli and other Proteobacteria, in (deoxy)nucleotide degradation. It is often encoded in an operon together with a deoxyribose-phosphate aldolase, a phosphopentomutase and a purine nucleoside phosphorylase. In many other lineages, the corresponding enzyme is designated pyrimidine-nucleoside phosphorylase (2.4.2.2 from EC); the naming convention here follows standard literature practice.; GO: 0009032 thymidine phosphorylase activity. Probab=40.72 E-value=15 Score=16.82 Aligned_cols=22 Identities=9% Similarity=0.221 Sum_probs=17.1 Q ss_pred HHHHHHHHCCC-CEEEEECCCCC Q ss_conf 99999983685-14334157767 Q gi|254780803|r 13 FNFFLLINSCS-VYPFYYLKNQD 34 (165) Q Consensus 13 ~~~~~~ls~CG-f~PlY~~~~~~ 34 (165) +.+-.+.++|| |-||-|..+.+ T Consensus 95 LmLgPiVAACGgyVPMISGRGLG 117 (440) T TIGR02643 95 LMLGPIVAACGGYVPMISGRGLG 117 (440) T ss_pred HHHHHHHHHHCCCCCCCCCCCCC T ss_conf 88788999745852562578887 No 94 >TIGR02544 III_secr_YscJ type III secretion apparatus lipoprotein, YscJ/HrcJ family; InterPro: IPR003282 Secretion of virulence factors in Gram-negative bacteria involves transportation of the protein across two membranes to reach the cell exterior. There have been four secretion systems described in animal enteropathogens such as Salmonella and Yersinia, with further sequence similarities in plant pathogens like Ralstonia and Erwinia. The type III secretion system is of great interest as it is used to transport virulence factors from the pathogen directly into the host cell and is only triggered when the bacterium comes into close contact with the host. The protein subunits of the system are very similar to those of bacterial flagellar biosynthesis . However, while the latter forms a ring structure to allow secretion of flagellin and is an integral part of the flagellum itself, type III subunits in the outer membrane translocate secreted proteins through a channel-like structure. One of the outer membrane protein subunit families, termed "K" here for nomenclature purposes, aids in the structural assembly of the invasion complex . It is also described as a lipoprotein. Members of this family include the Salmonella PrgK and SsaJ genes, MxiJ from Shigella, YscJ from Yersinia, and from the plant enteropathogens NolT (Rhizobium) and HrcJ Erwinia). The flagellar M-ring protein FliF also shares a low level of similarity, presumably due to evolution of the type III secretion system from the flagellar biosynthetic pathway.; GO: 0009306 protein secretion. Probab=40.60 E-value=24 Score=15.66 Aligned_cols=24 Identities=29% Similarity=0.503 Sum_probs=18.6 Q ss_pred HHHHHHHHHHHHHCCCCEE-EEECC Q ss_conf 9999999999983685143-34157 Q gi|254780803|r 8 RILVVFNFFLLINSCSVYP-FYYLK 31 (165) Q Consensus 8 k~l~~~~~~~~ls~CGf~P-lY~~~ 31 (165) ..++++.++++|+||.=+- ||+.= T Consensus 3 ~~~~~~~l~llL~GC~~~~~Ly~gL 27 (203) T TIGR02544 3 RKLLLLLLLLLLTGCKVDLELYSGL 27 (203) T ss_pred HHHHHHHHHHHHHCCCCEEEEECCC T ss_conf 5889999999971787405650478 No 95 >pfam02352 Decorin_bind Decorin binding protein. This family consists of decorin binding proteins from Borrelia. The decorin binding protein of Borrelia burgdorferi the lyme disease spirochetes adheres to the proteoglycan decorin found on collagen fibres. Probab=40.03 E-value=23 Score=15.75 Aligned_cols=22 Identities=18% Similarity=0.588 Sum_probs=12.6 Q ss_pred CCHHHHHHHHHHHHHHH-HHHCCCCE Q ss_conf 91578899999999999-98368514 Q gi|254780803|r 1 MLYKRILRILVVFNFFL-LINSCSVY 25 (165) Q Consensus 1 ml~~ri~k~l~~~~~~~-~ls~CGf~ 25 (165) |.|-+ +++.+.+|+ +|.+||.. T Consensus 1 ~Ky~K---nllkltLlv~LLvaCgLt 23 (166) T pfam02352 1 IKYNK---NLLKLTLLVSLLVACGLT 23 (166) T ss_pred CCHHH---HHHHHHHHHHHHHHCCCC T ss_conf 92588---899999999999981574 No 96 >pfam06604 consensus Probab=39.91 E-value=24 Score=15.63 Aligned_cols=22 Identities=14% Similarity=0.436 Sum_probs=15.8 Q ss_pred CCHHHHHHHHHHHHHHHHHHCCC Q ss_conf 91578899999999999983685 Q gi|254780803|r 1 MLYKRILRILVVFNFFLLINSCS 23 (165) Q Consensus 1 ml~~ri~k~l~~~~~~~~ls~CG 23 (165) |-+.|+ +++.++.+.++|+||. T Consensus 1 M~~sk~-~~~~~~~~v~~laGCq 22 (181) T pfam06604 1 MNFSKA-GLVAGLAAVLVLAGCQ 22 (181) T ss_pred CCCCHH-HHHHHHHHHHHHHCCC T ss_conf 962067-7899999999973445 No 97 >COG3168 PilP Tfp pilus assembly protein PilP [Cell motility and secretion / Intracellular trafficking and secretion] Probab=38.34 E-value=13 Score=17.20 Aligned_cols=18 Identities=11% Similarity=0.410 Sum_probs=11.5 Q ss_pred HHHHHHHHHHHHHCCCCE Q ss_conf 999999999998368514 Q gi|254780803|r 8 RILVVFNFFLLINSCSVY 25 (165) Q Consensus 8 k~l~~~~~~~~ls~CGf~ 25 (165) |.+++++.++.++|||-. T Consensus 2 k~~~~l~slL~lvgC~~~ 19 (170) T COG3168 2 KLILLLISLLALVGCGAG 19 (170) T ss_pred CCHHHHHHHHHHHHCCCC T ss_conf 403688999999750468 No 98 >PRK03095 prsA peptidylprolyl isomerase; Reviewed Probab=37.69 E-value=25 Score=15.50 Aligned_cols=12 Identities=8% Similarity=0.384 Sum_probs=8.0 Q ss_pred HHHHHHHHCCCC Q ss_conf 999999836851 Q gi|254780803|r 13 FNFFLLINSCSV 24 (165) Q Consensus 13 ~~~~~~ls~CGf 24 (165) .+.+++|+|||- T Consensus 10 ~~svl~LaaC~~ 21 (287) T PRK03095 10 ATSVIALSACGT 21 (287) T ss_pred HHHHHHHHHCCC T ss_conf 999999985168 No 99 >pfam06291 Lambda_Bor Bor protein. This family consists of several Bacteriophage lambda Bor and Escherichia coli Iss proteins. Expression of bor significantly increases the survival of the Escherichia coli host cell in animal serum. This property is a well known bacterial virulence determinant indeed, bor and its adjacent sequences are highly homologous to the iss serum resistance locus of the plasmid ColV2-K94, which confers virulence in animals. It has been suggested that lysogeny may generally have a role in bacterial survival in animal hosts, and perhaps in pathogenesis. Probab=37.00 E-value=20 Score=16.08 Aligned_cols=23 Identities=26% Similarity=0.447 Sum_probs=14.4 Q ss_pred HHHHHHHHHHHHHHCCCCEEEEE Q ss_conf 99999999999983685143341 Q gi|254780803|r 7 LRILVVFNFFLLINSCSVYPFYY 29 (165) Q Consensus 7 ~k~l~~~~~~~~ls~CGf~PlY~ 29 (165) +|+++...+-++|+||.=+...- T Consensus 2 Kk~~~~~~~allLtgCa~QT~~~ 24 (97) T pfam06291 2 KKMLFAAALALLITGCAQQTFTV 24 (97) T ss_pred HHHHHHHHHHHHHCCCCCEEEEE T ss_conf 22599999999972133049993 No 100 >pfam11153 DUF2931 Protein of unknown function (DUF2931). Some members in this family of proteins are annotated as lipoproteins however this cannot be confirmed. Currently, there is no known function. Probab=36.61 E-value=17 Score=16.55 Aligned_cols=16 Identities=25% Similarity=0.752 Sum_probs=11.4 Q ss_pred HHHHHHHHHHHHHCCC Q ss_conf 9999999999983685 Q gi|254780803|r 8 RILVVFNFFLLINSCS 23 (165) Q Consensus 8 k~l~~~~~~~~ls~CG 23 (165) +.++++.++++++||. T Consensus 2 ~~i~~ll~~lll~aCs 17 (202) T pfam11153 2 KKILLLLLLLLLTACS 17 (202) T ss_pred HHHHHHHHHHHHHHHC T ss_conf 0148999999987602 No 101 >pfam11254 DUF3053 Protein of unknown function (DUF3053). Some members in this family of proteins are annotated as the membrane protein YiaF. No function is currently known. Probab=35.71 E-value=26 Score=15.39 Aligned_cols=20 Identities=20% Similarity=0.353 Sum_probs=13.1 Q ss_pred HHHHHHHHHHHHHHHHCCCC Q ss_conf 88999999999999836851 Q gi|254780803|r 5 RILRILVVFNFFLLINSCSV 24 (165) Q Consensus 5 ri~k~l~~~~~~~~ls~CGf 24 (165) |.+.-++.+...+.|+|||= T Consensus 2 R~~~p~~al~~v~~LagCgd 21 (229) T pfam11254 2 RWFAPLLALLVVLQLAGCGD 21 (229) T ss_pred CHHHHHHHHHHHHHHHHCCC T ss_conf 33888999999999850368 No 102 >COG2834 LolA Outer membrane lipoprotein-sorting protein [Cell envelope biogenesis, outer membrane] Probab=33.84 E-value=30 Score=15.02 Aligned_cols=26 Identities=15% Similarity=0.356 Sum_probs=18.0 Q ss_pred CCHHHHHHHHHHHHHHHHHHCCCCEE Q ss_conf 91578899999999999983685143 Q gi|254780803|r 1 MLYKRILRILVVFNFFLLINSCSVYP 26 (165) Q Consensus 1 ml~~ri~k~l~~~~~~~~ls~CGf~P 26 (165) |..+++..+++++...+++++|.... T Consensus 1 ~~~~~~~~~~~~~l~~~f~~~~~~~~ 26 (211) T COG2834 1 MMLMMMKLLLALALLLLFLSACAQAG 26 (211) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 92378999999999999988887745 No 103 >PRK11653 hypothetical protein; Provisional Probab=33.34 E-value=27 Score=15.34 Aligned_cols=23 Identities=4% Similarity=0.202 Sum_probs=16.6 Q ss_pred HHHHHHHHHHHHHHHHCCCCEEE Q ss_conf 88999999999999836851433 Q gi|254780803|r 5 RILRILVVFNFFLLINSCSVYPF 27 (165) Q Consensus 5 ri~k~l~~~~~~~~ls~CGf~Pl 27 (165) |+.-+.+.+...|+|+||+=++- T Consensus 20 ~l~P~alav~avF~LagCe~~~e 42 (223) T PRK11653 20 HLTPVALAVAAVFMLAGCEKSDE 42 (223) T ss_pred CCHHHHHHHHHHHHHHHCCCCCC T ss_conf 31278889889998650478866 No 104 >pfam07854 DUF1646 Protein of unknown function (DUF1646). Some of the members of this family are hypothetical bacterial and archaeal proteins, but others are annotated as being cation transporters expressed by the archaebacterium Methanosarcina mazei. Probab=32.87 E-value=31 Score=14.93 Aligned_cols=24 Identities=17% Similarity=0.296 Sum_probs=17.0 Q ss_pred HHHHHHHHHHHHHHHHHCCCCEEEE Q ss_conf 7889999999999998368514334 Q gi|254780803|r 4 KRILRILVVFNFFLLINSCSVYPFY 28 (165) Q Consensus 4 ~ri~k~l~~~~~~~~ls~CGf~PlY 28 (165) -|-.|+.+++..+-+| |-||+|+- T Consensus 226 iRa~KVy~Fi~AL~lL-G~gFkPli 249 (347) T pfam07854 226 IRAVKVFVFIFALELL-GEGFKPLI 249 (347) T ss_pred HHHHHHHHHHHHHHHH-HCCCHHHH T ss_conf 9899999999999998-25556679 No 105 >PRK10270 hypothetical protein; Provisional Probab=32.37 E-value=36 Score=14.59 Aligned_cols=20 Identities=25% Similarity=0.378 Sum_probs=13.1 Q ss_pred HHHHHHHHHHHHHHHHHHCC Q ss_conf 57889999999999998368 Q gi|254780803|r 3 YKRILRILVVFNFFLLINSC 22 (165) Q Consensus 3 ~~ri~k~l~~~~~~~~ls~C 22 (165) +||++.+++++.+++++.++ T Consensus 1 Mkk~~~~~~~l~~~l~i~~g 20 (340) T PRK10270 1 MKKVLLIILLLLVVLGIAAG 20 (340) T ss_pred CCHHHHHHHHHHHHHHHHHH T ss_conf 90289999999999999999 No 106 >pfam12262 Lipase_bact_N Bacterial virulence factor lipase N-terminal. This domain family is found in bacteria, and is typically between 258 and 271 amino acids in length. There are two conserved sequence motifs: DGT and DGWST. This family is the N-terminal region of bacterial virulence factor lipase. The N-terminal region contains a potential signalling sequence. Probab=32.19 E-value=31 Score=14.97 Aligned_cols=20 Identities=25% Similarity=0.419 Sum_probs=9.9 Q ss_pred HHHHHHHHHHHHHHHHHCCCC Q ss_conf 788999999999999836851 Q gi|254780803|r 4 KRILRILVVFNFFLLINSCSV 24 (165) Q Consensus 4 ~ri~k~l~~~~~~~~ls~CGf 24 (165) ||.++..+ ++.-++|+|||= T Consensus 2 Kk~~l~~~-i~SAL~LaGCg~ 21 (269) T pfam12262 2 KKKFLALL-LASALLLAGCGD 21 (269) T ss_pred CHHHHHHH-HHHHHHHCCCCC T ss_conf 24799999-999864111579 No 107 >PRK11138 outer membrane protein assembly complex subunit YfgL; Provisional Probab=32.11 E-value=34 Score=14.75 Aligned_cols=22 Identities=23% Similarity=0.475 Sum_probs=11.2 Q ss_pred CCHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 915788999999999999836851 Q gi|254780803|r 1 MLYKRILRILVVFNFFLLINSCSV 24 (165) Q Consensus 1 ml~~ri~k~l~~~~~~~~ls~CGf 24 (165) |-++|. ++..++.+.+|+||+. T Consensus 1 ~~~~~~--~~~~l~~~~ll~gCs~ 22 (394) T PRK11138 1 MQLRKL--LLPGLLSVALLSGCSL 22 (394) T ss_pred CCHHHH--HHHHHHHHHHHHHCCC T ss_conf 951677--9999999999865237 No 108 >TIGR01781 Trep_dent_lipo Treponema denticola clustered lipoprotein; InterPro: IPR010103 This entry represents a family of six predicted lipoproteins from a region of about 20 tandemly arranged genes in the Treponema denticola genome. Two other neighbouring genes share the lipoprotein signal peptide region but do not show more extensive homology. The function of this locus is unknown.. Probab=31.87 E-value=35 Score=14.63 Aligned_cols=22 Identities=23% Similarity=0.466 Sum_probs=16.4 Q ss_pred HHHHHHHHHHHHHHHHHCCCCE Q ss_conf 7889999999999998368514 Q gi|254780803|r 4 KRILRILVVFNFFLLINSCSVY 25 (165) Q Consensus 4 ~ri~k~l~~~~~~~~ls~CGf~ 25 (165) |--+|++|++.+=+||-||.=+ T Consensus 3 KdKlKlIFIL~LAvLLFSC~ke 24 (463) T TIGR01781 3 KDKLKLIFILMLAVLLFSCKKE 24 (463) T ss_pred CCHHHHHHHHHHHHHHHCCCCC T ss_conf 5404688899999986047621 No 109 >pfam11873 DUF3393 Domain of unknown function (DUF3393). This domain is functionally uncharacterized. This domain is found in bacteria. This presumed domain is typically between 188 to 206 amino acids in length. This domain is found associated with pfam01464. Probab=31.86 E-value=18 Score=16.48 Aligned_cols=19 Identities=26% Similarity=0.494 Sum_probs=11.7 Q ss_pred HHHHHHHHHHHHHCCCCEE Q ss_conf 9999999999983685143 Q gi|254780803|r 8 RILVVFNFFLLINSCSVYP 26 (165) Q Consensus 8 k~l~~~~~~~~ls~CGf~P 26 (165) |.+.++++.++|.||+=+| T Consensus 2 k~l~~~~~~llL~~Cs~~~ 20 (200) T pfam11873 2 KYLSLLLILLLLSGCSSQF 20 (200) T ss_pred CEEHHHHHHHHHHHCCCCC T ss_conf 3107999999984157765 No 110 >PRK06193 hypothetical protein; Provisional Probab=31.04 E-value=36 Score=14.60 Aligned_cols=21 Identities=19% Similarity=0.298 Sum_probs=17.0 Q ss_pred HHHHHHHHHHHHHHHHCCCCE Q ss_conf 889999999999998368514 Q gi|254780803|r 5 RILRILVVFNFFLLINSCSVY 25 (165) Q Consensus 5 ri~k~l~~~~~~~~ls~CGf~ 25 (165) |.+++++.+...++|++||-. T Consensus 2 ~~~~~~~~l~~~~~l~~~~~~ 22 (211) T PRK06193 2 RAAALLCALLVAALLAGCGRE 22 (211) T ss_pred CHHHHHHHHHHHHHHCCCCCC T ss_conf 179999999999994252200 No 111 >PRK11671 mltC murein transglycosylase C; Provisional Probab=30.99 E-value=29 Score=15.19 Aligned_cols=17 Identities=24% Similarity=0.336 Sum_probs=10.8 Q ss_pred HHHHHHHHHHHHHHHCC Q ss_conf 89999999999998368 Q gi|254780803|r 6 ILRILVVFNFFLLINSC 22 (165) Q Consensus 6 i~k~l~~~~~~~~ls~C 22 (165) +.|+++++.+.+||.+| T Consensus 2 ~~k~~~~~~~~~~l~~c 18 (360) T PRK11671 2 MKKYLALALIAPLLISC 18 (360) T ss_pred CHHHHHHHHHHHHHHHH T ss_conf 14688999999999871 No 112 >PRK09967 putative outer membrane lipoprotein; Provisional Probab=30.98 E-value=31 Score=14.94 Aligned_cols=10 Identities=20% Similarity=0.541 Sum_probs=6.9 Q ss_pred HHHHHHCCCC Q ss_conf 9999836851 Q gi|254780803|r 15 FFLLINSCSV 24 (165) Q Consensus 15 ~~~~ls~CGf 24 (165) ..++|+||.= T Consensus 12 ~~l~lsgC~s 21 (160) T PRK09967 12 TSLILTGCQS 21 (160) T ss_pred HHHHHHHCCC T ss_conf 9999984589 No 113 >TIGR01329 cysta_beta_ly_E cystathionine beta-lyase; InterPro: IPR006238 This group of sequences represent cystathionine beta-lyase (alternate name: beta-cystathionase), one of several pyridoxal-dependent enzymes of cysteine, methionine, and homocysteine metabolism. This enzyme is involved in the biosynthesis of Met from Cys.. Probab=30.68 E-value=24 Score=15.60 Aligned_cols=17 Identities=29% Similarity=0.321 Sum_probs=11.3 Q ss_pred HHHHHHCCCCEEEEECC Q ss_conf 99998368514334157 Q gi|254780803|r 15 FFLLINSCSVYPFYYLK 31 (165) Q Consensus 15 ~~~~ls~CGf~PlY~~~ 31 (165) ..+|=++|||+|-.+.+ T Consensus 274 A~fL~nsCGfkP~~~~v 290 (415) T TIGR01329 274 ALFLENSCGFKPTHSRV 290 (415) T ss_pred HHHHHHCCCCCCCCCCC T ss_conf 99863216888668831 No 114 >PRK10510 putative outer membrane lipoprotein; Provisional Probab=30.59 E-value=38 Score=14.41 Aligned_cols=24 Identities=25% Similarity=0.553 Sum_probs=17.5 Q ss_pred HHHHHHHHHHHHHHHHHCCCCEEE Q ss_conf 788999999999999836851433 Q gi|254780803|r 4 KRILRILVVFNFFLLINSCSVYPF 27 (165) Q Consensus 4 ~ri~k~l~~~~~~~~ls~CGf~Pl 27 (165) +|++.+..++...++|+||---|- T Consensus 3 ~r~~lia~~~a~~l~lsgC~t~p~ 26 (219) T PRK10510 3 KRVYLIAAIVSGALAVSGCTTNPY 26 (219) T ss_pred HHHHHHHHHHHHHHHHHCCCCCCC T ss_conf 268999999999987413567898 No 115 >COG5510 Predicted small secreted protein [Function unknown] Probab=29.16 E-value=40 Score=14.26 Aligned_cols=23 Identities=35% Similarity=0.499 Sum_probs=13.0 Q ss_pred CCHHHHHHHHHHHHHHHHHHCCC Q ss_conf 91578899999999999983685 Q gi|254780803|r 1 MLYKRILRILVVFNFFLLINSCS 23 (165) Q Consensus 1 ml~~ri~k~l~~~~~~~~ls~CG 23 (165) |..+-++.+.+++..-++|++|. T Consensus 1 mmk~t~l~i~~vll~s~llaaCN 23 (44) T COG5510 1 MMKKTILLIALVLLASTLLAACN 23 (44) T ss_pred CCHHHHHHHHHHHHHHHHHHHHH T ss_conf 90489999999999999998866 No 116 >PRK10215 hypothetical protein; Provisional Probab=29.04 E-value=32 Score=14.85 Aligned_cols=24 Identities=13% Similarity=0.292 Sum_probs=17.0 Q ss_pred HHHHHHHHHHHHHHHHHCC-CCEEE Q ss_conf 7889999999999998368-51433 Q gi|254780803|r 4 KRILRILVVFNFFLLINSC-SVYPF 27 (165) Q Consensus 4 ~ri~k~l~~~~~~~~ls~C-Gf~Pl 27 (165) +-+.|-+.++.+.++|+|| .|.-+ T Consensus 6 k~~~k~~~l~~~~f~LSgC~t~~L~ 30 (219) T PRK10215 6 KGFFKAAGLLPLAFFLSGCISFGLV 30 (219) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 3544452476689999877665542 No 117 >PRK11189 lipoprotein NlpI; Provisional Probab=28.75 E-value=40 Score=14.27 Aligned_cols=20 Identities=35% Similarity=0.489 Sum_probs=14.1 Q ss_pred HHHHHHHHHHHHHHHHHCCC Q ss_conf 78899999999999983685 Q gi|254780803|r 4 KRILRILVVFNFFLLINSCS 23 (165) Q Consensus 4 ~ri~k~l~~~~~~~~ls~CG 23 (165) ++.++..++..+.++|+||. T Consensus 2 ~~~~r~~~~~~~~l~LsGC~ 21 (297) T PRK11189 2 KPFLRWCFVATAALLLAGCA 21 (297) T ss_pred CHHHHHHHHHHHHHHHHHHC T ss_conf 51899999999999998640 No 118 >TIGR03529 GldK_short gliding motility-associated lipoprotein GldK. Members of this protein family are exclusive to the Bacteroidetes phylum (previously Cytophaga-Flavobacteria-Bacteroides). GldK is a lipoprotein linked to a type of rapid surface gliding motility found in certain Bacteroidetes, such as Flavobacterium johnsoniae. Knockouts of GldK abolish the gliding phenotype. GldK is homologous to GldJ. This model represents a GldK homolog in Cytophaga hutchinsonii and several other species that has a different, shorter architecture than that found in Flavobacterium johnsoniae and related species (represented by (TIGR03525). Gliding motility appears closely linked to chitin utilization in the model species Flavobacterium johnsoniae. Bacteroidetes with members of this protein family appear to have all of the genes associated with gliding motility. Probab=28.21 E-value=18 Score=16.39 Aligned_cols=21 Identities=24% Similarity=0.569 Sum_probs=15.0 Q ss_pred HHHHHHHHHHHHHHHHCCCCE Q ss_conf 889999999999998368514 Q gi|254780803|r 5 RILRILVVFNFFLLINSCSVY 25 (165) Q Consensus 5 ri~k~l~~~~~~~~ls~CGf~ 25 (165) |++.+.+.+.++++|++||+- T Consensus 1 ~~~~~~~~~~~~~~~~~c~~~ 21 (344) T TIGR03529 1 RLLSITLGLAVATLLPGCGLF 21 (344) T ss_pred CEEEEHHHHHHHHHHCCCCCC T ss_conf 903300467999762024667 No 119 >TIGR02917 PEP_TPR_lipo putative PEP-CTERM system TPR-repeat lipoprotein; InterPro: IPR014266 This protein family occurs in strictly within a subset of Gram-negative bacterial species with the proposed PEP-CTERM/exosortase system , analogous to the LPXTG/sortase system common in Gram-positive bacteria. The proteins in this entry occur in a species if, and only if, a transmembrane histidine kinase (IPR014265 from INTERPRO) and a DNA-binding response regulator (IPR014264 from INTERPRO) also occur. The present of tetratricopeptide repeats (TPR) suggests they may be involved in protein-protein interaction, possibly for the regulation of PEP-CTERM protein expression, since many PEP-CTERM proteins in these genomes are preceded by a proposed DNA binding site for the response regulator.. Probab=27.76 E-value=43 Score=14.11 Aligned_cols=17 Identities=29% Similarity=0.667 Sum_probs=8.9 Q ss_pred HHHHHHHHHH-HHHHHCC Q ss_conf 8999999999-9998368 Q gi|254780803|r 6 ILRILVVFNF-FLLINSC 22 (165) Q Consensus 6 i~k~l~~~~~-~~~ls~C 22 (165) +..++++.+. +++|+|| T Consensus 4 ~~~l~~~~~~a~~lLsaC 21 (924) T TIGR02917 4 ITALLTILLAAILLLSAC 21 (924) T ss_pred HHHHHHHHHHHHHHHHHC T ss_conf 999999999999998630 No 120 >pfam01298 Lipoprotein_5 Transferrin binding protein-like solute binding protein. This family of proteins are distantly related to other families of solute binding proteins. Probab=27.66 E-value=30 Score=15.05 Aligned_cols=21 Identities=29% Similarity=0.260 Sum_probs=10.7 Q ss_pred CCHHHHHHHHHHHHHHHHHHCCC Q ss_conf 91578899999999999983685 Q gi|254780803|r 1 MLYKRILRILVVFNFFLLINSCS 23 (165) Q Consensus 1 ml~~ri~k~l~~~~~~~~ls~CG 23 (165) |..+ ++-.++....+||++|+ T Consensus 1 M~~~--l~~~~lal~~~LLSAC~ 21 (589) T pfam01298 1 MNIP--LNKLALALAAGLLSACS 21 (589) T ss_pred CCCC--CCHHHHHHHHHHHHHHC T ss_conf 9656--54549999999999734 No 121 >COG1704 LemA Uncharacterized conserved protein [Function unknown] Probab=27.03 E-value=37 Score=14.47 Aligned_cols=26 Identities=27% Similarity=0.333 Sum_probs=18.3 Q ss_pred HHHHHHHHHHHHHHHHHCCCCEEEEE Q ss_conf 78899999999999983685143341 Q gi|254780803|r 4 KRILRILVVFNFFLLINSCSVYPFYY 29 (165) Q Consensus 4 ~ri~k~l~~~~~~~~ls~CGf~PlY~ 29 (165) ++++.++.++..+.++..|||.-+-. T Consensus 2 ~~~~i~l~vi~il~ll~~~~yN~lv~ 27 (185) T COG1704 2 RRFLIILAVIVILLLLAVGGYNGLVK 27 (185) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 23699999999999999996514999 No 122 >PRK10781 rcsF outer membrane lipoprotein; Reviewed Probab=26.98 E-value=32 Score=14.88 Aligned_cols=16 Identities=31% Similarity=0.611 Sum_probs=10.2 Q ss_pred HHHHHHHHHHHHHCCC Q ss_conf 9999999999983685 Q gi|254780803|r 8 RILVVFNFFLLINSCS 23 (165) Q Consensus 8 k~l~~~~~~~~ls~CG 23 (165) |.+.+-.+-|+|+||+ T Consensus 2 R~lpi~llal~ltGCs 17 (133) T PRK10781 2 RALPICLLALMLTGCS 17 (133) T ss_pred CHHHHHHHHHHHHCCC T ss_conf 2089999999984313 No 123 >pfam12273 RCR Chitin synthesis regulation, resistance to Congo red. RCR proteins are ER membrane proteins that regulate chitin deposition in fungal cell walls. Although chitin, a linear polymer of beta-1,4-linked N-acetylglucosamine, constitutes only 2% of the cell wall it plays a vital role in the overall protection of the cell wall against stress, noxious chemicals and osmotic pressure changes. Congo red is a cell wall-disrupting benzidine-type dye extensively used in many cell wall mutant studies that specifically targets chitin in yeast cells and inhibits growth. RCR proteins render the yeasts resistant to Congo red by diminishing the content of chitin in the cell wall. RCR proteins are probably regulating chitin synthase III interact directly with ubiquitin ligase Rsp5, and the VPEY motif is necessary for this, via interaction with the WW domains of Rsp5. Probab=26.80 E-value=43 Score=14.12 Aligned_cols=22 Identities=9% Similarity=0.386 Sum_probs=12.5 Q ss_pred HHHHHHHHHHHCC--------CCEEEEECC Q ss_conf 9999999998368--------514334157 Q gi|254780803|r 10 LVVFNFFLLINSC--------SVYPFYYLK 31 (165) Q Consensus 10 l~~~~~~~~ls~C--------Gf~PlY~~~ 31 (165) +++++++||+.+| |-+|||+.. T Consensus 9 i~~~~i~~f~~~~~n~RRrr~G~~Pi~GT~ 38 (124) T pfam12273 9 IIALLILFFLTARINRRRRRRGLQPIYGTA 38 (124) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCCCCCC T ss_conf 999999999999873999875888876776 No 124 >TIGR03300 assembly_YfgL outer membrane assembly lipoprotein YfgL. Members of this protein family are YfgL, a lipoprotein component of a complex that acts protein insertion into the bacterial outer membrane. Other members of this complex are NlpB, YfiO, and YaeT. This protein contains multiple copies of a repeat that, in other contexts, are associated with binding of the coenzyme PQQ. Probab=26.33 E-value=31 Score=14.98 Aligned_cols=17 Identities=29% Similarity=0.587 Sum_probs=13.0 Q ss_pred HHHHHHHHHHHHHCCCC Q ss_conf 99999999999836851 Q gi|254780803|r 8 RILVVFNFFLLINSCSV 24 (165) Q Consensus 8 k~l~~~~~~~~ls~CGf 24 (165) ++++++.+.++|+||++ T Consensus 2 ~~~~~~~~~~~l~gCs~ 18 (377) T TIGR03300 2 RLALVIALAALLSGCSW 18 (377) T ss_pred CCHHHHHHHHHHHHCCC T ss_conf 23899999999855657 No 125 >KOG2791 consensus Probab=26.22 E-value=38 Score=14.42 Aligned_cols=24 Identities=25% Similarity=0.452 Sum_probs=16.7 Q ss_pred CCHHHHHHHHHHHHHHHHHHCCCCEEEEE Q ss_conf 91578899999999999983685143341 Q gi|254780803|r 1 MLYKRILRILVVFNFFLLINSCSVYPFYY 29 (165) Q Consensus 1 ml~~ri~k~l~~~~~~~~ls~CGf~PlY~ 29 (165) |.|+|+.+++-.+ +++|+|--... T Consensus 2 mvYrRmhrlanaV-----iAcC~f~~~~v 25 (455) T KOG2791 2 MVYRRMHRLANAV-----IACCMFNQSHV 25 (455) T ss_pred CHHHHHHHHHHHH-----HHHHHHCEEEE T ss_conf 0369999999999-----99873120344 No 126 >PRK09861 cytoplasmic membrane lipoprotein-28; Provisional Probab=25.92 E-value=46 Score=13.90 Aligned_cols=19 Identities=11% Similarity=0.141 Sum_probs=10.9 Q ss_pred HHHHHHHHHHHHHHHCCCC Q ss_conf 8999999999999836851 Q gi|254780803|r 6 ILRILVVFNFFLLINSCSV 24 (165) Q Consensus 6 i~k~l~~~~~~~~ls~CGf 24 (165) +.....++...++|+|||= T Consensus 8 l~~~a~ll~~~~~l~~C~~ 26 (272) T PRK09861 8 LRAGAALLLAGILLAGCDQ 26 (272) T ss_pred HHHHHHHHHHHHHHHHCCC T ss_conf 9999999999999874589 No 127 >COG3218 ABC-type uncharacterized transport system, auxiliary component [General function prediction only] Probab=25.63 E-value=47 Score=13.87 Aligned_cols=143 Identities=10% Similarity=0.091 Sum_probs=72.7 Q ss_pred HHHHHHHHHHHHHHHHHCCCC--EEE--EECC-----CC-CCC----------------EEEEEEEEECCC--------- Q ss_conf 788999999999999836851--433--4157-----76-720----------------223257840777--------- Q gi|254780803|r 4 KRILRILVVFNFFLLINSCSV--YPF--YYLK-----NQ-DRT----------------EYIHPIKVLVVS--------- 48 (165) Q Consensus 4 ~ri~k~l~~~~~~~~ls~CGf--~Pl--Y~~~-----~~-~~~----------------~~l~~I~i~~~~--------- 48 (165) .|..+..+...+.++|+|||= +|. |... .. +-. ..-.+|-+.... T Consensus 8 ~~~~~l~~~la~~a~L~gC~~~~~p~~~~~l~~~~~~~~~~~~~~~~qllia~p~a~kal~s~rivV~~~~geiqy~~~a 87 (205) T COG3218 8 LRRLSLAAALALAATLAGCGPEAAPNDTYSLSVHPPVDSNGPKRKTWQLLIARPTALKALDSPRIVVRPSRGEIQYYAGA 87 (205) T ss_pred HHHHHHHHHHHHHHHHHCCCCCCCCCCEEECCCCCCCCCCCCCCCCEEEEECCCCCCCCCCCCCEEEECCCCHHHHHCCC T ss_conf 89999999999999972347778977446415788665777766541698537763220259866984286466785377 Q ss_pred ---CHHHHHHHHHHHHHCC---------CCCC--CCCEEEEEEECCCEEECCCCCCCCCEEEEEEEEEEEEEECCCCCEE Q ss_conf ---4144899999997314---------7777--6716899971000000002467650489999999999980589587 Q gi|254780803|r 49 ---KNNKNEIYRSINFLTS---------TVRT--KNLYQLEVNIDSFTDHAISNAFFKNIGRITLKAKYYFKEISGKNIL 114 (165) Q Consensus 49 ---~r~~~~i~n~L~~~~~---------~~~~--~~~y~l~i~~~~~~~~~i~~~~~~~~~~~~~~~~y~L~~~~~~~~i 114 (165) ++.-.+++..|...+. +.++ +-.|.|..++........ + ..-.-.+.+..+|.+..++.++ T Consensus 88 ~W~d~~~~l~q~~l~~aFe~sg~~~gl~~~g~gL~aDy~l~~dlr~FE~~y~--~---~~~~A~Iei~v~Ll~~~n~~v~ 162 (205) T COG3218 88 QWGDPLPNLVQDRLVQAFEDSGRFAGLGRPGSGLAADYQLILDLRAFEIQYV--T---GAPTAVIEISVRLLNDRNGTVR 162 (205) T ss_pred CCCCCHHHHHHHHHHHHHHHCCCEEECCCCCCCCEEEEEEEEHHHHHHHHCC--C---CCCEEEEEEEEEEECCCCCCEE T ss_conf 4368506899999999986238542025898650565654331434523225--8---8863899999887316888688 Q ss_pred EEEEEEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 7425799986067761577999999999999999999999999999 Q gi|254780803|r 115 YENNTDVTSLFDFSDQQFSQLRSHKSSEEKAIQELSENIYIDIISF 160 (165) Q Consensus 115 ~~~~~~~~~~Y~~~~~~f~~~~~e~~a~erl~~~lae~I~~~i~~~ 160 (165) .+..|+.+.--+..+.. +...|-..|- .++..+|..| T Consensus 163 A~r~F~a~~pv~g~~~~-a~v~A~dkA~--------~q~~~EIV~W 199 (205) T COG3218 163 ASRVFRASQPVDGKGNA-AVVKAFDKAF--------AQLTNEIVGW 199 (205) T ss_pred EEEEEEEEECCCCCCCH-HHHHHHHHHH--------HHHHHHHHHH T ss_conf 87778986014689837-8999999999--------9998887866 No 128 >COG0386 BtuE Glutathione peroxidase [Posttranslational modification, protein turnover, chaperones] Probab=25.54 E-value=17 Score=16.57 Aligned_cols=12 Identities=25% Similarity=0.445 Sum_probs=9.4 Q ss_pred HHCCCCEEEEEC Q ss_conf 836851433415 Q gi|254780803|r 19 INSCSVYPFYYL 30 (165) Q Consensus 19 ls~CGf~PlY~~ 30 (165) -|.|||+|=|.. T Consensus 34 ASkCGfTpQYeg 45 (162) T COG0386 34 ASKCGFTPQYEG 45 (162) T ss_pred CCCCCCCHHHHH T ss_conf 544688675899 No 129 >COG3957 Phosphoketolase [Carbohydrate transport and metabolism] Probab=24.70 E-value=46 Score=13.90 Aligned_cols=16 Identities=13% Similarity=0.121 Sum_probs=11.1 Q ss_pred HHHHHCCCCEEEEECC Q ss_conf 9998368514334157 Q gi|254780803|r 16 FLLINSCSVYPFYYLK 31 (165) Q Consensus 16 ~~~ls~CGf~PlY~~~ 31 (165) -.++.||||+|++-.+ T Consensus 232 ~~~f~G~Gy~p~~veg 247 (793) T COG3957 232 KALFEGYGYEPVFVEG 247 (793) T ss_pred HHHHHHCCCCEEEECC T ss_conf 9998517984058558 No 130 >pfam09716 ETRAMP Malarial early transcribed membrane protein (ETRAMP). These sequences represent a family of proteins from the malaria parasite Plasmodium falciparum, several of which have been shown to be expressed specifically in the ring stage as well as the rodent parasite Plasmodium yoelii. A homologue from Plasmodium chabaudi was localized to the parasitophorous vacuole membrane. Members have an initial hydrophobic, Phe/Tyr-rich, stretch long enough to span the membrane, a highly charged region rich in Lys, a second putative transmembrane region and a second highly charged, low complexity sequence region. Some members have up to 100 residues of additional C-terminal sequence. These genes have been shown to be found in the sub-telomeric regions of both Plasmodium falciparum and P. yoelii chromosomes. Probab=24.29 E-value=47 Score=13.88 Aligned_cols=25 Identities=28% Similarity=0.406 Sum_probs=13.4 Q ss_pred HHHHHHHHHHHHHHHHCCCCEEEEECC Q ss_conf 889999999999998368514334157 Q gi|254780803|r 5 RILRILVVFNFFLLINSCSVYPFYYLK 31 (165) Q Consensus 5 ri~k~l~~~~~~~~ls~CGf~PlY~~~ 31 (165) |+-|++.++.|++.+ |=|.|=+... T Consensus 2 KisKil~f~a~Llai--n~l~p~~~~~ 26 (84) T pfam09716 2 KVSKILYFFAFLLAI--NLLAPGLNNN 26 (84) T ss_pred CHHHHHHHHHHHHHH--HHCCCCCCCH T ss_conf 178999999999999--9703431111 No 131 >KOG4544 consensus Probab=24.06 E-value=24 Score=15.68 Aligned_cols=25 Identities=12% Similarity=-0.071 Sum_probs=18.8 Q ss_pred HHHHHHHHHHHHHHHHHCCCCEEEE Q ss_conf 7889999999999998368514334 Q gi|254780803|r 4 KRILRILVVFNFFLLINSCSVYPFY 28 (165) Q Consensus 4 ~ri~k~l~~~~~~~~ls~CGf~PlY 28 (165) .+..+...++.++.+...|||+-=+ T Consensus 72 ~~~Kr~~~liPIvPL~f~~gYqyd~ 96 (144) T KOG4544 72 FHHKRLLHLIPIVPLAFFIGYQYDF 96 (144) T ss_pred HHHHCCCHHEECHHHHHHHHHEEEC T ss_conf 8770400100011467632111111 No 132 >TIGR01734 D-ala-DACP-lig D-alanine-activating enzyme; InterPro: IPR010072 This entry represents the enzyme (also called D-alanine-D-alanyl carrier protein ligase) which activates D-alanine as an adenylate via the reaction D-ala + ATP to D-ala-AMP + PPi, and further catalyses the condensation of the amino acid adenylate with the D-alanyl carrier protein (D-ala-ACP). The D-alanine is then further transferred to teichoic acid in the biosynthesis of lipoteichoic acid (LTA) and wall teichoic acid (WTA) in Gram-positive bacteria, both polysacchatides .; GO: 0016208 AMP binding, 0047473 D-alanine-poly(phosphoribitol) ligase activity, 0019350 teichoic acid biosynthetic process. Probab=23.98 E-value=50 Score=13.68 Aligned_cols=16 Identities=13% Similarity=0.011 Sum_probs=11.9 Q ss_pred HHHHHHHHHCCCCEEE Q ss_conf 9999999836851433 Q gi|254780803|r 12 VFNFFLLINSCSVYPF 27 (165) Q Consensus 12 ~~~~~~~ls~CGf~Pl 27 (165) ..++.+..+||+|-|+ T Consensus 66 ~~flg~~KsGhaYiPv 81 (513) T TIGR01734 66 VAFLGSIKSGHAYIPV 81 (513) T ss_pred HHHHHHHHCCCCCCCC T ss_conf 9999975168964343 No 133 >pfam11810 DUF3332 Domain of unknown function (DUF3332). This family of proteins are functionally uncharacterized. This family is only found in bacteria. Probab=23.73 E-value=51 Score=13.65 Aligned_cols=19 Identities=21% Similarity=0.427 Sum_probs=15.1 Q ss_pred HHHHHHHHHHHHHHHHHCC Q ss_conf 7889999999999998368 Q gi|254780803|r 4 KRILRILVVFNFFLLINSC 22 (165) Q Consensus 4 ~ri~k~l~~~~~~~~ls~C 22 (165) +++++..++.+..+.++|| T Consensus 2 k~~~~v~a~~~~~~~~s~C 20 (175) T pfam11810 2 KKILAVVAALAGSTSLSGC 20 (175) T ss_pred CCHHHHHHHHHHHHHHHHH T ss_conf 1169999999999885220 No 134 >COG5633 Predicted periplasmic lipoprotein [General function prediction only] Probab=23.72 E-value=32 Score=14.87 Aligned_cols=16 Identities=25% Similarity=0.233 Sum_probs=9.9 Q ss_pred HHHHHHHHHCCCCEEE Q ss_conf 9999999836851433 Q gi|254780803|r 12 VFNFFLLINSCSVYPF 27 (165) Q Consensus 12 ~~~~~~~ls~CGf~Pl 27 (165) ++..+++|.||+=.|= T Consensus 7 ~~l~~~lLvGCsS~~~ 22 (123) T COG5633 7 LSLALLLLVGCSSHQE 22 (123) T ss_pred HHHHHHHHHCCCCCCC T ss_conf 9999999420478877 No 135 >PRK11669 pbpG D-alanyl-D-alanine endopeptidase; Provisional Probab=22.93 E-value=53 Score=13.55 Aligned_cols=19 Identities=21% Similarity=0.436 Sum_probs=13.4 Q ss_pred HHHHHHHHHHHCCCCEEEE Q ss_conf 9999999998368514334 Q gi|254780803|r 10 LVVFNFFLLINSCSVYPFY 28 (165) Q Consensus 10 l~~~~~~~~ls~CGf~PlY 28 (165) +.++++++++++|.+.|-- T Consensus 5 ~~~~~~~~~~~~~~~~~~~ 23 (308) T PRK11669 5 VSLLSLLLLLAGVPVAPQA 23 (308) T ss_pred HHHHHHHHHHHHHCCCCCC T ss_conf 9999999999740036554 No 136 >PRK10461 thiamine biosynthesis lipoprotein ApbE; Provisional Probab=22.87 E-value=53 Score=13.55 Aligned_cols=18 Identities=22% Similarity=0.432 Sum_probs=11.3 Q ss_pred HHHHHHHHHHHHCCCCEE Q ss_conf 999999999983685143 Q gi|254780803|r 9 ILVVFNFFLLINSCSVYP 26 (165) Q Consensus 9 ~l~~~~~~~~ls~CGf~P 26 (165) .+.+++++++|+||+=.| T Consensus 7 ~~~ll~~~~~l~gC~~~~ 24 (351) T PRK10461 7 RVALLAAALLLVGCDQAP 24 (351) T ss_pred HHHHHHHHHHHHHHCCCC T ss_conf 999999999998724888 No 137 >TIGR02522 pilus_cpaD pilus (Caulobacter type) biogenesis lipoprotein CpaD; InterPro: IPR013361 Proteins in this entry consist of a pilus biogenesis protein, CpaD, from Caulobacter, and homologues in other bacteria, including three in the root nodule bacterium Bradyrhizobium japonicum. The molecular function of the homologues is not known.. Probab=22.67 E-value=50 Score=13.70 Aligned_cols=21 Identities=10% Similarity=0.255 Sum_probs=14.8 Q ss_pred HHHHHHHHHHHCCCCEE-EEEC Q ss_conf 99999999983685143-3415 Q gi|254780803|r 10 LVVFNFFLLINSCSVYP-FYYL 30 (165) Q Consensus 10 l~~~~~~~~ls~CGf~P-lY~~ 30 (165) .++..+.++|.||+-++ --.. T Consensus 2 ~~~~a~~~~LagC~~~~P~~~~ 23 (211) T TIGR02522 2 VLLLALTALLAGCASTDPEIDV 23 (211) T ss_pred HHHHHHHHHHHHCCCCCCCCCC T ss_conf 7898899997530588875553 No 138 >PRK10759 hypothetical protein; Provisional Probab=22.66 E-value=50 Score=13.69 Aligned_cols=14 Identities=29% Similarity=0.778 Sum_probs=10.5 Q ss_pred HHHHHHHHHHHCCC Q ss_conf 99999999983685 Q gi|254780803|r 10 LVVFNFFLLINSCS 23 (165) Q Consensus 10 l~~~~~~~~ls~CG 23 (165) +++++.+++++||. T Consensus 3 ~~~~~~~l~~sgc~ 16 (106) T PRK10759 3 ALIVPLLLLLSGCS 16 (106) T ss_pred HHHHHHHHHHCCCH T ss_conf 39999999984423 No 139 >TIGR02521 type_IV_pilW type IV pilus biogenesis/stability protein PilW; InterPro: IPR013360 Proteins in this entry are designated PilF and PilW . This outer membrane protein is required both for pilus stability and for pilus functions such as adherence to human cells. Members of this family contain copies of the TPR (tetratricopeptide repeat) domain.. Probab=22.51 E-value=40 Score=14.25 Aligned_cols=23 Identities=13% Similarity=0.277 Sum_probs=15.7 Q ss_pred HHHHHHHHHCCCCE-EEEECCCCC Q ss_conf 99999998368514-334157767 Q gi|254780803|r 12 VFNFFLLINSCSVY-PFYYLKNQD 34 (165) Q Consensus 12 ~~~~~~~ls~CGf~-PlY~~~~~~ 34 (165) .++++++|+||+-+ |-+.+++.+ T Consensus 2 ~l~~~~~l~GC~~~~P~~q~~~~~ 25 (247) T TIGR02521 2 ALVLLLALTGCVTTPPTVQEQEVP 25 (247) T ss_pred HHHHHHHHHCCCCCCCCCCCCCCC T ss_conf 887846785157887897787888 No 140 >PRK11397 dacD D-alanyl-D-alanine carboxypeptidase; Provisional Probab=22.41 E-value=54 Score=13.49 Aligned_cols=25 Identities=32% Similarity=0.238 Sum_probs=9.8 Q ss_pred CCHHHHHHHHHHHHHHHH-HHCCCCE Q ss_conf 915788999999999999-8368514 Q gi|254780803|r 1 MLYKRILRILVVFNFFLL-INSCSVY 25 (165) Q Consensus 1 ml~~ri~k~l~~~~~~~~-ls~CGf~ 25 (165) |+++|.+-+.++++.+.+ .+.|++. T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~ 26 (390) T PRK11397 1 MLLKRRLIIAASLFVFNLSSAFAAEN 26 (390) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 94378999999999999999874045 No 141 >PRK05137 tolB translocation protein TolB; Provisional Probab=22.38 E-value=54 Score=13.49 Aligned_cols=45 Identities=13% Similarity=0.234 Sum_probs=30.6 Q ss_pred EEEEEEEEEECCCCCEEEEEEEEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999998058958774257999860677615779999999999999999999999999 Q gi|254780803|r 98 TLKAKYYFKEISGKNILYENNTDVTSLFDFSDQQFSQLRSHKSSEEKAIQELSENIYIDII 158 (165) Q Consensus 98 ~~~~~y~L~~~~~~~~i~~~~~~~~~~Y~~~~~~f~~~~~e~~a~erl~~~lae~I~~~i~ 158 (165) .+.++|+|.|...++.+....+.+. .+--.+++-.+|+.|+.+|. T Consensus 117 ~~~v~f~L~DV~~~~~l~g~~~~~~----------------~~~~R~~aH~iaD~Iye~LT 161 (437) T PRK05137 117 RLKVEFRLWDVFAGQQLLGQQFFTS----------------PENWRRAAHKIADAIYERLT 161 (437) T ss_pred EEEEEEEEEECCCHHEECCEEEEEC----------------HHHHHHHHHHHHHHHHHHHC T ss_conf 4999999997464010226089966----------------89999999999999999860 No 142 >TIGR03042 PS_II_psbQ_bact photosystem II protein PsbQ. This protein through the member sll1638 from Synechocystis sp. PCC 6803, was shown to be part of the cyanobacteria photosystem II. It is homologous to (but quite diverged from) the chloroplast PsbQ protein, called oxygen-evolving enhancer protein 3 (OEE3). We designate this cyanobacteria protein PsbQ by homology. Probab=22.30 E-value=54 Score=13.48 Aligned_cols=23 Identities=26% Similarity=0.501 Sum_probs=15.1 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCEEE Q ss_conf 5788999999999999836851433 Q gi|254780803|r 3 YKRILRILVVFNFFLLINSCSVYPF 27 (165) Q Consensus 3 ~~ri~k~l~~~~~~~~ls~CGf~Pl 27 (165) .+++..+++++...| |-+|| .|- T Consensus 1 lRsi~s~~L~~~~~f-LvsC~-gp~ 23 (142) T TIGR03042 1 LRSLASLLLVLLLTF-LVSCS-GPA 23 (142) T ss_pred CHHHHHHHHHHHHHH-HHHCC-CCC T ss_conf 976999999999999-98838-998 No 143 >pfam03783 CsgG Curli production assembly/transport component CsgG. CsgG is an outer membrane-located lipoprotein that is highly resistant to protease digestion. During curli assembly, an adhesive surface fibre, CsgG is required to maintain the stability of CsgA and CsgB. Probab=21.89 E-value=55 Score=13.43 Aligned_cols=84 Identities=13% Similarity=0.037 Sum_probs=46.8 Q ss_pred CEEEEEEECCCEEECCCC---------CCCCCEEEEEEEEEEEEEECCCCCEEEEEEEEEEE-EECCCCC---------- Q ss_conf 168999710000000024---------67650489999999999980589587742579998-6067761---------- Q gi|254780803|r 71 LYQLEVNIDSFTDHAISN---------AFFKNIGRITLKAKYYFKEISGKNILYENNTDVTS-LFDFSDQ---------- 130 (165) Q Consensus 71 ~y~l~i~~~~~~~~~i~~---------~~~~~~~~~~~~~~y~L~~~~~~~~i~~~~~~~~~-~Y~~~~~---------- 130 (165) .|.+.=.+.+.-.++.+. -+........++++.+++|..+++++.+-+.+... +++.... T Consensus 89 ~~~i~G~It~~~~n~~sgg~g~~~~Gig~~~~~~~~~V~vdLRlVdv~TgeVl~s~t~~k~i~s~~~~~~~f~~g~~~~~ 168 (207) T pfam03783 89 DYLIEGSITEFGRNVSTGGGGAQYFGILGSGKYQVAYVKVDLRAVDVRTGEVLASVTGEGEASSSNREVGLFVSGFGLLA 168 (207) T ss_pred CEEEEEEEEEECCCCCCCCCCCCCCCCCCCCEEEEEEEEEEEEEEECCCCEEEEEEEEEEEEEEEECCCCEEECCCCCEE T ss_conf 88999367775133223765654402235316999999999999972452799999988799997223224750354357 Q ss_pred -------HHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf -------577999999999999999999999 Q gi|254780803|r 131 -------QFSQLRSHKSSEEKAIQELSENIY 154 (165) Q Consensus 131 -------~f~~~~~e~~a~erl~~~lae~I~ 154 (165) .=....|-+.+.|+++..|..+.. T Consensus 169 ~~~g~~~nep~~~Avr~aIe~aV~~li~~g~ 199 (207) T pfam03783 169 GYDGTTNNKPLGLAIREAIEKAVEELITKGW 199 (207) T ss_pred EEECCCCCCHHHHHHHHHHHHHHHHHHHHHH T ss_conf 7404115876999999999999999986625 No 144 >pfam02402 Lysis_col Lysis protein. These small bacterial proteins are required for colicin release and partial cell lysis. This family contains lysis proteins for several different forms of colicin. Bacillus subtilis lytA has been included in this family, the similarity is not highly significant, however it is also a short protein, that is involved in secretion of other proteins (Bateman A pers. obs.). Probab=21.83 E-value=30 Score=15.09 Aligned_cols=16 Identities=13% Similarity=0.434 Sum_probs=9.6 Q ss_pred HHHHHHHHHCCCCEEE Q ss_conf 9999999836851433 Q gi|254780803|r 12 VFNFFLLINSCSVYPF 27 (165) Q Consensus 12 ~~~~~~~ls~CGf~Pl 27 (165) ++...++|++|.-.-+ T Consensus 9 i~~~~~~L~aCQaNyi 24 (46) T pfam02402 9 ILLLTVLLSACQANYI 24 (46) T ss_pred HHHHHHHHHHHHHHHE T ss_conf 9999999998555041 No 145 >pfam05715 zf-piccolo Piccolo Zn-finger. This (predicted) Zinc finger is found in the bassoon and piccolo proteins. There are eight conserved cysteines, suggesting that it coordinates two zinc ligands. Probab=21.75 E-value=27 Score=15.38 Aligned_cols=10 Identities=30% Similarity=0.421 Sum_probs=7.3 Q ss_pred HHCCCCEEEE Q ss_conf 8368514334 Q gi|254780803|r 19 INSCSVYPFY 28 (165) Q Consensus 19 ls~CGf~PlY 28 (165) -+-|||.|+= T Consensus 33 CnlCGFNP~P 42 (61) T pfam05715 33 CNLCGFNPNP 42 (61) T ss_pred HCCCCCCCCC T ss_conf 4155889980 No 146 >COG4826 Serine protease inhibitor [Posttranslational modification, protein turnover, chaperones] Probab=21.54 E-value=56 Score=13.38 Aligned_cols=56 Identities=18% Similarity=0.227 Sum_probs=34.2 Q ss_pred HHHHHHHHHHHHCCCCEEEEECCCCCCCEEEEEEEEECCCCHHHHHHHHHHHHHCC Q ss_conf 99999999998368514334157767202232578407774144899999997314 Q gi|254780803|r 9 ILVVFNFFLLINSCSVYPFYYLKNQDRTEYIHPIKVLVVSKNNKNEIYRSINFLTS 64 (165) Q Consensus 9 ~l~~~~~~~~ls~CGf~PlY~~~~~~~~~~l~~I~i~~~~~r~~~~i~n~L~~~~~ 64 (165) +++++.+-++..||==-.--..++.-.........|...++..++-+...|.++-+ T Consensus 3 ~~~i~~l~llc~gcie~~~v~~~~~in~dsv~dy~ia~ANNaF~FdlyS~~~~~~~ 58 (410) T COG4826 3 ILLIFLLALLCIGCIEDSAVNTKNTINTDSVEDYDIAAANNAFAFDLYSELAKQEG 58 (410) T ss_pred EEHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCHHHCCCCHHHHHHHHHHHHCC T ss_conf 71999999997441354441443343777655333222254034789999986526 No 147 >pfam08627 CRT-like CRT-like. This region is found in proteins related to Plasmodium falciparum chloroquine resistance transporter (CRT). Probab=20.74 E-value=52 Score=13.61 Aligned_cols=11 Identities=9% Similarity=0.353 Sum_probs=5.6 Q ss_pred HHCCCCEEEEE Q ss_conf 83685143341 Q gi|254780803|r 19 INSCSVYPFYY 29 (165) Q Consensus 19 ls~CGf~PlY~ 29 (165) |..-||-|+++ T Consensus 93 LtnygyvPIf~ 103 (130) T pfam08627 93 LTNYGYVPIFG 103 (130) T ss_pred HHHCCHHHHHH T ss_conf 88427368999 No 148 >PRK04168 hypothetical protein; Provisional Probab=20.63 E-value=58 Score=13.29 Aligned_cols=21 Identities=19% Similarity=0.417 Sum_probs=12.4 Q ss_pred HHHHHHHHHHHHHHHHHHCCC Q ss_conf 578899999999999983685 Q gi|254780803|r 3 YKRILRILVVFNFFLLINSCS 23 (165) Q Consensus 3 ~~ri~k~l~~~~~~~~ls~CG 23 (165) .++.+.+.+++.+.++++||. T Consensus 4 ~~~~~~~~~~~~~~~~~~g~~ 24 (336) T PRK04168 4 GRRKIVIVILLLLALVFLGCV 24 (336) T ss_pred CCEEHHHHHHHHHHHHHCCCC T ss_conf 532203799999999970555 No 149 >PRK09558 ushA bifunctional UDP-sugar hydrolase/5'-nucleotidase periplasmic precursor; Reviewed Probab=20.62 E-value=59 Score=13.26 Aligned_cols=25 Identities=16% Similarity=0.345 Sum_probs=15.3 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCEEE Q ss_conf 5788999999999999836851433 Q gi|254780803|r 3 YKRILRILVVFNFFLLINSCSVYPF 27 (165) Q Consensus 3 ~~ri~k~l~~~~~~~~ls~CGf~Pl 27 (165) |+|+++.++.+.+.+.++.|+-.+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~ 25 (551) T PRK09558 1 MMKLLKGLVALALLAALALCGSTAC 25 (551) T ss_pred CHHHHHHHHHHHHHHHHHHCCCCCC T ss_conf 9258999999999999874067664 No 150 >pfam04549 CD47 CD47 transmembrane region. This family represents the transmembrane region of CD47 leukocyte antigen. Probab=20.49 E-value=59 Score=13.25 Aligned_cols=20 Identities=35% Similarity=0.637 Sum_probs=11.9 Q ss_pred HHHHHHHHHHHHCCCCEEEEEC Q ss_conf 9999999999836851433415 Q gi|254780803|r 9 ILVVFNFFLLINSCSVYPFYYL 30 (165) Q Consensus 9 ~l~~~~~~~~ls~CGf~PlY~~ 30 (165) ++.+..+.+-+++| .|+|+. T Consensus 106 ilsvvgl~Ls~~~C--~~v~G~ 125 (156) T pfam04549 106 ILSVVGLALSLSGC--NPVYGY 125 (156) T ss_pred HHHHHHHHHHHHCC--CHHHHH T ss_conf 99999999999448--403279 No 151 >COG2859 Uncharacterized protein conserved in bacteria [Function unknown] Probab=20.41 E-value=45 Score=14.00 Aligned_cols=18 Identities=11% Similarity=0.093 Sum_probs=10.7 Q ss_pred HHHHHHHHHHHHHCCCCE Q ss_conf 999999999998368514 Q gi|254780803|r 8 RILVVFNFFLLINSCSVY 25 (165) Q Consensus 8 k~l~~~~~~~~ls~CGf~ 25 (165) ..+..+.+++.|++|||- T Consensus 8 ~~~~~~ll~l~l~a~g~i 25 (237) T COG2859 8 PALATGLLSLILSASGFI 25 (237) T ss_pred HHHHHHHHHHHHHHHHHH T ss_conf 789999999999988999 Done!