Query gi|254780589|ref|YP_003065002.1| hypothetical protein CLIBASIA_02380 [Candidatus Liberibacter asiaticus str. psy62] Match_columns 239 No_of_seqs 189 out of 410 Neff 4.2 Searched_HMMs 39220 Date Sun May 29 21:38:18 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780589.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 COG3034 Uncharacterized protei 100.0 0 0 509.7 15.9 235 1-237 1-236 (298) 2 pfam03734 YkuD L,D-transpeptid 99.6 6.4E-15 1.6E-19 112.4 8.9 103 52-168 2-115 (122) 3 PRK06132 hypothetical protein; 98.6 9.3E-07 2.4E-11 62.1 10.5 99 46-169 53-161 (365) 4 PRK10260 hypothetical protein; 98.6 4.9E-07 1.2E-11 63.8 8.4 101 54-169 101-227 (306) 5 PRK10190 hypothetical protein; 98.5 5.3E-07 1.3E-11 63.6 8.2 101 54-169 98-224 (310) 6 COG3786 Uncharacterized protei 98.4 2.1E-06 5.4E-11 59.9 9.2 105 65-169 58-209 (217) 7 COG1376 ErfK Uncharacterized p 97.8 0.00012 3.1E-09 49.0 7.4 108 56-169 95-222 (232) 8 PRK12472 hypothetical protein; 97.7 0.00037 9.4E-09 46.1 8.5 103 47-169 52-159 (512) 9 pfam10908 DUF2778 Protein of u 93.2 0.18 4.6E-06 29.5 4.8 81 92-181 19-118 (121) 10 TIGR03352 VI_chp_3 type VI sec 87.1 0.29 7.5E-06 28.2 1.4 53 4-57 2-54 (146) 11 PRK10594 hypothetical protein; 86.4 2.3 6E-05 22.6 5.7 120 52-186 375-577 (615) 12 PRK12696 flgH flagellar basal 85.2 0.79 2E-05 25.5 2.8 23 1-24 1-23 (238) 13 TIGR02522 pilus_cpaD pilus (Ca 83.1 2.7 7E-05 22.2 4.8 49 7-69 2-51 (211) 14 COG2989 Uncharacterized protei 82.7 2.3 5.9E-05 22.6 4.3 48 138-186 449-523 (561) 15 PRK09967 putative outer membra 82.6 2.2 5.6E-05 22.8 4.2 127 1-184 1-132 (160) 16 COG3521 Predicted component of 81.9 1.3 3.2E-05 24.3 2.7 99 1-118 1-112 (159) 17 PRK09578 periplasmic multidrug 80.7 4.1 0.0001 21.1 5.0 91 2-109 6-96 (385) 18 PRK12407 flgH flagellar basal 76.2 2 5.1E-05 23.1 2.3 20 4-23 1-20 (220) 19 PRK09859 multidrug efflux syst 75.7 5.1 0.00013 20.5 4.3 92 1-109 1-94 (385) 20 COG2134 Cdh CDP-diacylglycerol 74.8 7.1 0.00018 19.6 5.4 136 1-150 1-148 (252) 21 COG3065 Slp Starvation-inducib 74.5 4.6 0.00012 20.8 3.8 65 1-66 5-77 (191) 22 TIGR00752 slp outer membrane l 72.4 8.2 0.00021 19.3 4.7 63 5-67 4-74 (190) 23 pfam12262 Lipase_bact_N Bacter 72.3 3.7 9.4E-05 21.4 2.8 24 1-24 1-24 (269) 24 PRK11548 hypothetical protein; 69.6 9.4 0.00024 18.9 7.9 87 1-90 1-99 (113) 25 TIGR02747 TraV type IV conjuga 66.2 3.4 8.8E-05 21.6 1.6 22 1-22 1-23 (174) 26 COG5633 Predicted periplasmic 65.4 11 0.00029 18.4 5.4 80 5-90 4-83 (123) 27 pfam02402 Lysis_col Lysis prot 64.2 2.9 7.4E-05 22.0 1.0 22 1-22 1-22 (46) 28 PRK10760 murein hydrolase B; P 62.9 5.5 0.00014 20.3 2.2 22 3-24 1-22 (357) 29 pfam06474 MLTD_N MLTD_N. 61.9 8.4 0.00021 19.2 3.0 19 12-30 9-27 (93) 30 TIGR02544 III_secr_YscJ type I 60.3 10 0.00026 18.6 3.2 45 4-48 2-46 (203) 31 PRK10533 putative lipoprotein; 60.2 6.5 0.00017 19.9 2.2 23 1-23 1-23 (171) 32 PRK00022 lolB outer membrane l 55.6 17 0.00043 17.3 6.6 94 4-108 3-103 (203) 33 PRK10866 outer membrane protei 55.2 8.7 0.00022 19.1 2.1 22 2-23 3-24 (243) 34 TIGR02722 lp_ uncharacterized 54.5 12 0.00031 18.2 2.7 27 6-32 6-32 (215) 35 TIGR03502 lipase_Pla1_cef extr 54.2 12 0.00031 18.2 2.7 21 1-21 1-21 (792) 36 PRK11627 hypothetical protein; 54.0 9.3 0.00024 18.9 2.1 23 1-24 1-23 (192) 37 PRK10781 rcsF outer membrane l 54.0 9.8 0.00025 18.8 2.2 20 3-23 1-20 (133) 38 PRK06052 5-methyltetrahydropte 53.6 18 0.00047 17.1 5.0 35 93-131 103-138 (343) 39 PRK02944 OxaA-like protein pre 52.7 15 0.00038 17.6 3.0 22 1-22 1-24 (255) 40 PRK03095 prsA peptidylprolyl i 52.4 11 0.00029 18.4 2.4 23 1-23 1-23 (287) 41 PRK13835 conjugal transfer pro 52.4 13 0.00034 17.9 2.7 24 3-26 1-24 (144) 42 TIGR03302 OM_YfiO outer membra 49.9 20 0.00052 16.8 3.3 20 4-23 2-21 (235) 43 pfam07901 DUF1672 Protein of u 49.3 17 0.00042 17.4 2.8 21 1-22 1-21 (304) 44 PRK09810 entericidin A; Provis 47.3 15 0.00038 17.6 2.3 25 1-25 1-25 (41) 45 TIGR01433 CyoA ubiquinol oxida 47.1 16 0.00042 17.4 2.5 30 7-36 1-30 (228) 46 PRK09837 copper/silver efflux 47.0 22 0.00055 16.7 3.1 32 4-36 3-34 (460) 47 pfam10543 ORF6N ORF6N domain. 46.1 10 0.00026 18.7 1.3 52 150-202 8-63 (87) 48 PRK10523 lipoprotein involved 46.1 24 0.00061 16.4 6.3 73 1-82 2-77 (234) 49 COG4851 CamS Protein involved 45.6 15 0.00039 17.6 2.1 23 1-23 1-23 (382) 50 PRK04792 tolB translocation pr 43.8 23 0.00059 16.5 2.8 21 1-21 1-21 (450) 51 TIGR02607 antidote_HigA addict 43.2 18 0.00046 17.1 2.2 29 190-219 52-80 (81) 52 TIGR00668 apaH bis(5'-nucleosy 42.7 7.3 0.00019 19.6 0.1 33 71-110 214-246 (280) 53 PRK10449 heat-inducible protei 42.3 18 0.00045 17.2 2.0 35 137-174 77-112 (140) 54 KOG3935 consensus 41.8 10 0.00026 18.6 0.8 19 197-215 409-427 (446) 55 pfam08139 LPAM_1 Prokaryotic m 41.5 19 0.00049 17.0 2.1 20 4-23 2-21 (26) 56 PRK01326 prsA foldase protein 41.3 20 0.00052 16.8 2.2 23 1-23 1-25 (310) 57 PRK13731 conjugal transfer sur 41.1 29 0.00073 15.9 3.9 98 1-127 3-101 (243) 58 PRK00249 flgH flagellar basal 41.1 25 0.00065 16.2 2.7 23 1-23 1-23 (231) 59 COG2980 RlpB Rare lipoprotein 39.8 30 0.00077 15.8 5.9 79 1-79 1-100 (178) 60 PRK12698 flgH flagellar basal 39.4 24 0.0006 16.4 2.3 11 137-147 148-158 (224) 61 COG3056 Uncharacterized lipopr 38.9 22 0.00056 16.6 2.1 18 148-165 162-179 (204) 62 PRK11063 metQ DL-methionine tr 38.5 27 0.00068 16.1 2.4 24 1-24 3-28 (271) 63 COG4314 NosL Predicted lipopro 38.3 27 0.0007 16.0 2.5 21 3-23 2-22 (176) 64 COG5645 Predicted periplasmic 38.1 17 0.00044 17.2 1.5 28 3-30 1-28 (80) 65 TIGR02733 desat_CrtD C-3',4' d 38.0 18 0.00046 17.1 1.5 70 33-111 305-381 (499) 66 PRK11023 hypothetical protein; 37.7 25 0.00064 16.2 2.2 21 1-21 1-21 (191) 67 PRK11459 multidrug resistance 37.6 32 0.00083 15.6 3.0 22 1-22 1-25 (478) 68 COG3093 VapI Plasmid maintenan 37.5 26 0.00066 16.2 2.3 30 192-222 57-86 (104) 69 KOG3135 consensus 37.4 25 0.00063 16.3 2.2 54 100-157 107-173 (203) 70 KOG3413 consensus 37.4 30 0.00076 15.8 2.6 59 78-146 72-130 (156) 71 pfam05590 DUF769 Xylella fasti 36.7 32 0.00081 15.6 2.6 31 3-33 5-35 (279) 72 pfam08085 Entericidin Enterici 36.6 25 0.00064 16.2 2.1 23 1-23 1-25 (42) 73 COG3417 FlgN Collagen-binding 36.5 32 0.0008 15.6 2.6 19 3-21 4-22 (200) 74 PRK03002 prsA peptidylprolyl i 35.5 28 0.00072 15.9 2.2 22 1-22 1-24 (285) 75 pfam04288 MukE MukE-like famil 35.4 16 0.00041 17.4 1.0 104 98-213 57-201 (227) 76 pfam07269 consensus 35.0 29 0.00073 15.9 2.2 20 4-25 2-21 (55) 77 TIGR02887 spore_ger_x_C germin 34.6 28 0.00071 16.0 2.0 20 4-23 3-22 (400) 78 PRK05256 condesin subunit E; P 34.3 17 0.00045 17.2 1.0 57 156-213 142-208 (230) 79 COG3133 SlyB Outer membrane li 33.9 33 0.00084 15.5 2.3 26 1-26 1-26 (154) 80 COG4669 EscJ Type III secretor 33.7 38 0.00096 15.2 2.9 68 120-196 127-202 (246) 81 PRK11138 outer membrane protei 32.9 39 0.00099 15.1 2.7 23 1-23 1-24 (394) 82 TIGR02422 protocat_beta protoc 32.7 14 0.00035 17.9 0.2 76 28-108 40-127 (224) 83 PRK11397 dacD D-alanyl-D-alani 32.1 40 0.001 15.0 3.6 14 1-14 3-16 (390) 84 TIGR03431 PhnD phosphonate ABC 31.9 33 0.00084 15.5 2.1 22 1-22 1-22 (288) 85 COG5461 Type IV pili component 31.0 42 0.0011 14.9 4.1 45 7-65 11-55 (224) 86 PRK02998 prsA peptidylprolyl i 30.8 42 0.0011 14.9 3.2 26 1-26 1-29 (283) 87 COG2143 Thioredoxin-related pr 30.7 42 0.0011 14.9 4.3 51 1-59 1-51 (182) 88 PRK02201 putative inner membra 30.7 40 0.001 15.0 2.3 19 4-22 37-55 (392) 89 COG3095 MukE Uncharacterized p 30.6 22 0.00055 16.6 0.9 14 98-111 64-77 (238) 90 smart00095 TR_THY Transthyreti 30.5 42 0.0011 14.8 5.8 44 63-109 24-67 (121) 91 PRK10965 multicopper oxidase; 30.0 43 0.0011 14.8 4.3 19 1-19 1-19 (530) 92 PRK10796 LPS-assembly lipoprot 29.8 44 0.0011 14.8 7.5 78 1-79 1-103 (196) 93 PRK12701 flgH flagellar basal 29.5 44 0.0011 14.7 3.2 22 1-22 1-22 (230) 94 pfam07615 Ykof YKOF-related Fa 29.2 42 0.0011 14.9 2.2 60 153-225 9-71 (81) 95 PRK09915 putative outer membra 29.2 45 0.0011 14.7 2.9 23 1-23 6-28 (488) 96 PRK11189 lipoprotein NlpI; Pro 28.9 45 0.0012 14.7 3.5 40 1-40 1-44 (297) 97 PRK10783 mltD membrane-bound l 28.6 44 0.0011 14.8 2.2 22 1-25 1-22 (449) 98 pfam09476 Pilus_CpaD Pilus bio 28.5 46 0.0012 14.6 2.8 42 9-66 2-43 (200) 99 PRK04405 prsA peptidylprolyl i 28.5 36 0.00092 15.3 1.8 23 1-23 5-28 (298) 100 KOG0749 consensus 28.4 22 0.00056 16.6 0.7 64 15-82 16-86 (298) 101 PRK13859 type IV secretion sys 28.0 43 0.0011 14.8 2.1 23 4-28 2-24 (55) 102 TIGR03525 GldK gliding motilit 27.8 43 0.0011 14.8 2.0 19 4-22 2-20 (449) 103 COG5510 Predicted small secret 27.7 37 0.00094 15.2 1.7 23 1-23 1-26 (44) 104 pfam00820 Lipoprotein_1 Borrel 27.7 47 0.0012 14.6 4.9 58 4-82 2-60 (273) 105 TIGR03112 6_pyr_pter_rel 6-pyr 27.4 48 0.0012 14.5 4.2 61 143-211 22-89 (113) 106 PRK11251 DNA-binding transcrip 27.3 45 0.0011 14.7 2.1 33 11-48 10-42 (109) 107 TIGR01819 F420_cofD LPPG:Fo 2- 26.9 23 0.00059 16.5 0.6 50 85-147 4-55 (359) 108 pfam06604 consensus 26.8 49 0.0013 14.5 2.9 18 6-23 8-25 (181) 109 PRK13883 conjugal transfer pro 26.4 50 0.0013 14.4 3.1 31 104-136 112-145 (156) 110 PRK10672 rare lipoprotein A; P 26.4 50 0.0013 14.4 2.8 22 1-23 1-22 (369) 111 TIGR00784 citMHS citrate trans 26.0 16 0.00042 17.4 -0.3 16 118-133 268-283 (439) 112 PRK12788 flgH flagellar basal 25.9 51 0.0013 14.3 2.6 19 3-22 1-19 (231) 113 PRK13733 conjugal transfer pro 25.5 11 0.00027 18.6 -1.4 25 1-25 1-25 (171) 114 KOG4824 consensus 25.0 53 0.0014 14.3 2.1 35 1-35 6-40 (224) 115 PRK13616 lipoprotein LpqB; Pro 24.9 39 0.00099 15.1 1.4 20 2-21 5-24 (590) 116 TIGR03659 IsdE heme ABC transp 24.7 54 0.0014 14.2 2.8 19 6-24 6-24 (289) 117 KOG0884 consensus 24.3 42 0.0011 14.9 1.5 24 134-159 59-93 (161) 118 PRK10510 putative outer membra 24.3 55 0.0014 14.2 2.3 49 134-182 139-187 (219) 119 PRK13736 conjugal transfer pro 24.2 55 0.0014 14.2 2.2 23 150-172 128-150 (245) 120 TIGR00693 thiE thiamine-phosph 24.0 56 0.0014 14.1 4.9 59 152-224 2-71 (210) 121 TIGR00212 hemC porphobilinogen 23.7 38 0.00096 15.2 1.2 19 92-110 236-254 (312) 122 PRK02463 OxaA-like protein pre 23.2 58 0.0015 14.0 2.6 16 8-23 13-28 (307) 123 TIGR02824 quinone_pig3 putativ 22.7 21 0.00053 16.8 -0.3 74 99-180 112-196 (334) 124 TIGR01626 ytfJ_HI0045 conserve 22.6 59 0.0015 14.0 2.3 53 1-67 1-53 (184) 125 KOG2100 consensus 22.5 59 0.0015 14.0 3.7 37 140-186 685-722 (755) 126 KOG2292 consensus 22.3 23 0.00059 16.5 -0.2 51 86-137 624-687 (751) 127 COG3009 Uncharacterized protei 22.2 60 0.0015 13.9 2.0 15 8-22 5-19 (190) 128 pfam07172 GRP Glycine rich pro 22.0 35 0.00089 15.4 0.7 12 1-12 1-12 (91) 129 pfam02157 Man-6-P_recep Cation 21.6 38 0.00098 15.1 0.9 133 75-219 71-248 (278) 130 COG4939 Major membrane immunog 21.6 62 0.0016 13.8 1.9 23 1-23 1-23 (147) 131 pfam08345 YscJ_FliF_C Flagella 21.5 62 0.0016 13.8 3.2 34 153-186 127-162 (162) 132 PRK09250 fructose-bisphosphate 21.0 64 0.0016 13.8 3.2 54 124-185 138-201 (348) 133 PRK12389 glutamate-1-semialdeh 20.8 64 0.0016 13.7 3.0 12 150-161 276-287 (429) 134 COG2605 Predicted kinase relat 20.7 58 0.0015 14.0 1.6 35 154-188 262-303 (333) 135 PRK13908 putative recombinatio 20.5 45 0.0012 14.7 1.0 32 139-170 157-203 (204) 136 pfam05643 DUF799 Putative bact 20.3 66 0.0017 13.7 2.1 19 4-22 2-20 (215) 137 COG3317 NlpB Uncharacterized l 20.3 66 0.0017 13.7 2.5 21 2-22 3-23 (342) 138 PRK12450 foldase protein PrsA; 20.1 57 0.0015 14.0 1.5 12 11-22 15-26 (309) 139 TIGR03300 assembly_YfgL outer 20.1 67 0.0017 13.6 3.5 18 5-22 2-19 (377) No 1 >COG3034 Uncharacterized protein conserved in bacteria [Function unknown] Probab=100.00 E-value=0 Score=509.73 Aligned_cols=235 Identities=41% Similarity=0.682 Sum_probs=218.9 Q ss_pred CCCHHHHHHHHHHHHHCCCCCCC-CCCCCCCCCCCHHHHHHHHHCCCCCCCCEEEEEECCCCEEEEEEECCCCCEEEEEE Q ss_conf 94046899999999860466533-22234446755456777775187757847999983786999986078896689998 Q gi|254780589|r 1 MTNRYNILLFALFIFLNGCHHSR-SLIDKAEHPLSENLIISMQKKRTSPFHPTVIRIFKNENILEIWKRNVDAEYVLLKE 79 (239) Q Consensus 1 M~~R~~~Llfal~i~L~gc~~~~-~~~~~~~~pl~~~~~~~~~~~~~~~~~pi~Irv~K~e~~Lel~~~~~~g~~~l~kt 79 (239) |....+.++..++.+++.|.... ....+...+.+..+.+.++.+++..+++++|||||+|++|||| .+.++.|+++++ T Consensus 1 ~~~~~~~~~~~~~~~~~~c~~~gl~~~~~~~~~~s~~l~~k~~~k~~~~~~~~lv~i~K~e~~Lel~-~~~~~~yk~~~s 79 (298) T COG3034 1 MLFVLLKAALILAELLALCLSQGLLLVGKDKYFWSSELPVKNEAKGLYREKVVLVRIFKEERKLELY-EKDGNIYKLLRS 79 (298) T ss_pred CHHHHHHHHHHHHHHHHHHHHCCCHHCCCCCCCCHHHHHHHHHHHHCCCCCCEEEEEECCCCCEEHH-HCCCCHHHHHHC T ss_conf 9177899999999999999860211104664332088888654120245685699973256632021-147853123220 Q ss_pred EEECCCCCCCCCHHCCCCCCCCCEEEEEEECCCCCCCCCCEEEECCCCCHHHHHCCCCCCCEEECCCCCCCCCEEECHHH Q ss_conf 65213238988701017874750799984012587576521420799898899729998826871677866718706466 Q gi|254780589|r 80 YKICAWSGTFGPKIETGDEQAPEGFYYIGWNNLNPNSKYFLSINIGFPNEFDKAHNRTGADLMIHGECASAGCYAMNNKQ 159 (239) Q Consensus 80 Y~Ic~~sg~lGpK~~eGD~kTPEG~Y~i~~~~~np~S~y~ls~~i~YPN~~Dr~~grtG~~I~IHG~~~S~GC~amtd~~ 159 (239) |+||+|||++|||.+|||+|||||||.|+++++||||.||+|++|+|||++|+++||||++|||||.|+|.||++|||++ T Consensus 80 y~i~~~sg~lGpK~~EGD~ktPEGfY~l~~~~~~pds~Y~~s~~i~YPN~yD~a~grtG~~ImIHG~~~s~g~y~~~~a~ 159 (298) T COG3034 80 YKICLWSGKLGPKQREGDLKTPEGFYRLTRKQLNPDSYYYLSINIGYPNAYDKALGRTGGGIMIHGACLSDGCYSMTDAQ 159 (298) T ss_pred CCEEECCCCCCCCCCCCCCCCCCEEEEECCCCCCCCCHHHEEEEECCCCHHHHHCCCCCCCEEEECCCCCCCCCCHHHHH T ss_conf 43343148888622236778984478831123687551123675048986788628788825994265788855105666 Q ss_pred HHHHHHHHHHHHHHCCCCCEEEEEECCCCCHHHHHHHHCCCCHHHHHHHHHHHHHHHHHCCCCEEEEECCEEEECCCC Q ss_conf 999999999999750856111699555689889875425867378985356888999858973279727668944888 Q gi|254780589|r 160 MQEIYAIVRDSLRGNMQSHIQIQAFPFRMTSKNMQLYQNNPNYSFWNMLKLGHDYFEKNHQEPFIQIINKQYVFFKGQ 237 (239) Q Consensus 160 i~EIy~~v~~a~~~~gQ~~~~v~~~Pfrmt~~n~~~~~~~~~~~FW~~Lk~gyd~Fe~~~~~p~v~v~~~~Yvf~~~~ 237 (239) |+|||.++++||.+ ||..++|+++|||||.+||.+|+++++++||++||+|||+||.|+.||.|+||+|||||+.+. T Consensus 160 i~~i~~~~a~a~~~-~q~~~~~~~~~f~m~~~nm~~~~~~~~~~fw~~lk~gyd~fe~t~~~p~v~v~~~~yv~~~~~ 236 (298) T COG3034 160 IDEIYGCVAKALQG-GQESVQVVADPFRMTIENMVRHRLSPLFSFWKALKPGYDNFEVTFYPPTVSVCDGRYVFSFPT 236 (298) T ss_pred HHHHHHHHHHHCCC-CCEEEEEEECCCCCCHHHHHHHHHCHHHHHHHHHCCCCCCEEEEECCCCEEECCCEEEECCCC T ss_conf 88898899997136-764689874461366789998751526589998344563001340599646437678844544 No 2 >pfam03734 YkuD L,D-transpeptidase catalytic domain. This family of proteins are found in a range of bacteria. It has been shown that this domain can act as an L,D-transpeptidase that gives rise to an alternative pathway for peptidoglycan cross-linking. This gives bacteria resistance to beta-lactam antibiotics that inhibit PBPs which usually carry out the cross-linking reaction. The conserved region contains a conserved histidine and cysteine, with the cysteine thought to be an active site residue. Several members of this family contain peptidoglycan binding domains. The molecular structure of YkuD protein shows this domain has a novel tertiary fold consisting of a beta-sandwich with two mixed sheets, one containing five strands and the other, six strands. The two beta-sheets form a cradle capped by an alpha-helix. This family was formerly called the ErfK/YbiS/YcfS/YnhG family, but is now named after the first protein of known structure. Probab=99.59 E-value=6.4e-15 Score=112.42 Aligned_cols=103 Identities=25% Similarity=0.354 Sum_probs=87.5 Q ss_pred EEEEEECCCCEE-EEEEECCCCCEEEEEEEEECCCCCCCCCHHCCCCCCCCCEEEEEEECCCCCCCCCCEEEECCCCCHH Q ss_conf 799998378699-9986078896689998652132389887010178747507999840125875765214207998988 Q gi|254780589|r 52 TVIRIFKNENIL-EIWKRNVDAEYVLLKEYKICAWSGTFGPKIETGDEQAPEGFYYIGWNNLNPNSKYFLSINIGFPNEF 130 (239) Q Consensus 52 i~Irv~K~e~~L-el~~~~~~g~~~l~ktY~Ic~~sg~lGpK~~eGD~kTPEG~Y~i~~~~~np~S~y~ls~~i~YPN~~ 130 (239) ..|.|.++++.| .+|..+ +++++|+|+.| ++|.+||+|.|.|..+..+|.+..+.+....|++.. T Consensus 2 ~~I~Vd~~~~~l~~~~~~g-----~~v~~~~vs~G---------~~~~~TP~G~~~i~~k~~~~~~~~~~~~~~~~~~~~ 67 (122) T pfam03734 2 RVIVVDLSEQRLLLLYENG-----KLVLTYPVSVG---------RGDTPTPLGTFTITEKVENPTWAPGPGNGLGYVKFL 67 (122) T ss_pred CEEEEECCCCEEEEEEECC-----EEEEEEEEEEC---------CCCCCCCCCEEEEEEEEECCCCCCCCCCCCCCCCCC T ss_conf 2999989438999997599-----99999778858---------899988665699899875884578675554687655 Q ss_pred HHHCCCCCCCEEECCC----------CCCCCCEEECHHHHHHHHHHHH Q ss_conf 9972999882687167----------7866718706466999999999 Q gi|254780589|r 131 DKAHNRTGADLMIHGE----------CASAGCYAMNNKQMQEIYAIVR 168 (239) Q Consensus 131 Dr~~grtG~~I~IHG~----------~~S~GC~amtd~~i~EIy~~v~ 168 (239) .......+++|+|||+ +.|+|||+|.|+++++||..+. T Consensus 68 ~~~~~~~~~~i~iH~~~~~~~~~~g~~~ShGCIrl~~~d~~~l~~~v~ 115 (122) T pfam03734 68 DPWAFPNGGGIYIHGTGTPDLFSGGAPRSHGCIRLSNEDAKELYDWVL 115 (122) T ss_pred CEEEECCCCCEEEECCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHCCC T ss_conf 246872798438966888842247984677601579999999997099 No 3 >PRK06132 hypothetical protein; Provisional Probab=98.58 E-value=9.3e-07 Score=62.10 Aligned_cols=99 Identities=21% Similarity=0.374 Sum_probs=75.5 Q ss_pred CCCCCCEEEEEECCCCEEEEEEECCCCCEEEEEEEEECCCCCCCCCHHCCCCCCCCCEEEEEEECCCCCCCCCCEEEECC Q ss_conf 77578479999837869999860788966899986521323898870101787475079998401258757652142079 Q gi|254780589|r 46 TSPFHPTVIRIFKNENILEIWKRNVDAEYVLLKEYKICAWSGTFGPKIETGDEQAPEGFYYIGWNNLNPNSKYFLSINIG 125 (239) Q Consensus 46 ~~~~~pi~Irv~K~e~~Lel~~~~~~g~~~l~ktY~Ic~~sg~lGpK~~eGD~kTPEG~Y~i~~~~~np~S~y~ls~~i~ 125 (239) ..+..|+.|.|..+++.+.+|..+ .++.+-+|+. |.-| .-||.|.|.|..+..+-.|+.| T Consensus 53 ~~p~gp~~i~vsl~~Q~~~~y~~~-----~~i~~s~vst--G~~g-------~~TP~G~F~i~~K~~~h~S~~Y------ 112 (365) T PRK06132 53 RKPQGPLVIVVSLTEQRLYVYDNG-----ILIAVSTVST--GKRG-------HETPTGVFSILQKDKDHRSNIY------ 112 (365) T ss_pred CCCCCCEEEEEECCCEEEEEEECC-----EEEEEEEECC--CCCC-------CCCCCEEEEEEECCCCCCCCCC------ T ss_conf 789888899998320089999899-----8999988425--8888-------8788616788870241235665------ Q ss_pred CCCHHHHH-----CCCCCCCEEECCCC-----CCCCCEEECHHHHHHHHHHHHH Q ss_conf 98988997-----29998826871677-----8667187064669999999999 Q gi|254780589|r 126 FPNEFDKA-----HNRTGADLMIHGEC-----ASAGCYAMNNKQMQEIYAIVRD 169 (239) Q Consensus 126 YPN~~Dr~-----~grtG~~I~IHG~~-----~S~GC~amtd~~i~EIy~~v~~ 169 (239) |-+ .--|-++|.+||.. .|+|||.|-.+--+-+|.+.+- T Consensus 113 -----~~ApMP~~~rlt~~GialH~g~lpgypaSHGCirlP~~fA~~lf~~t~~ 161 (365) T PRK06132 113 -----SNAPMPYMQRLTWSGIALHAGNLPGYPASHGCVRLPMAFAKKLFGWTRM 161 (365) T ss_pred -----CCCCCCCEEEECCCEEEEECCCCCCCCCCCCCCCCCHHHHHHHHCCCCC T ss_conf -----7888864798516637971366799877676456978999998551458 No 4 >PRK10260 hypothetical protein; Provisional Probab=98.55 E-value=4.9e-07 Score=63.83 Aligned_cols=101 Identities=20% Similarity=0.277 Sum_probs=69.6 Q ss_pred EEEECCCCEEEEEEECCCCCEEEEEEEEECCCCCCCCCHHCCCCCCCCCE-EEEEEECCCCCCCCCC--EEEE------- Q ss_conf 99983786999986078896689998652132389887010178747507-9998401258757652--1420------- Q gi|254780589|r 54 IRIFKNENILEIWKRNVDAEYVLLKEYKICAWSGTFGPKIETGDEQAPEG-FYYIGWNNLNPNSKYF--LSIN------- 123 (239) Q Consensus 54 Irv~K~e~~Lel~~~~~~g~~~l~ktY~Ic~~sg~lGpK~~eGD~kTPEG-~Y~i~~~~~np~S~y~--ls~~------- 123 (239) |.|--.|.+|.-|-.+.+ .+-+|||.. |..| ..||.| ...|+.+..|| .|+ -|+. T Consensus 101 IVINlaEmRLYYfp~~~~----~V~t~PVGI--Gr~g-------~~TPlg~~t~I~~K~~~P--tW~pp~sir~e~~~~G 165 (306) T PRK10260 101 IVINSAEMRLYYYPKGTN----TVIVLPIGI--GQLG-------KDTPINWTTKVERKKAGP--TWTPTAKMHAEYRAAG 165 (306) T ss_pred EEEECHHHEEEEECCCCC----EEEEEEEEC--CCCC-------CCCCCCCEEEEEECCCCC--CCCCCHHHHHHHHHCC T ss_conf 698532424478438998----699982443--6688-------878876417999725799--8889878889999729 Q ss_pred C----CCC----CHHHHHCCCCCCCEEECCCC--------CCCCCEEECHHHHHHHHHHHHH Q ss_conf 7----998----98899729998826871677--------8667187064669999999999 Q gi|254780589|r 124 I----GFP----NEFDKAHNRTGADLMIHGEC--------ASAGCYAMNNKQMQEIYAIVRD 169 (239) Q Consensus 124 i----~YP----N~~Dr~~grtG~~I~IHG~~--------~S~GC~amtd~~i~EIy~~v~~ 169 (239) . -.| |+.-...=+.|....|||+. +|+|||.|.+++|+++|..|.- T Consensus 166 ~~LP~vvPpGPdNPLG~~Al~l~~~YlIHGTN~p~gIG~rvS~GCIRl~peDI~~Lf~~V~v 227 (306) T PRK10260 166 EPLPAVVPAGPDNPMGLYALYIGRLYAIHGTNANFGIGLRVSHGCVRLRNEDIKFLFEKVPV 227 (306) T ss_pred CCCCCCCCCCCCCCHHHHHHHHCCCEEEECCCCCCCCCCCCCCCEECCCHHHHHHHHHCCCC T ss_conf 98876689999896646777627840577789987513201687512375719999950999 No 5 >PRK10190 hypothetical protein; Provisional Probab=98.53 E-value=5.3e-07 Score=63.60 Aligned_cols=101 Identities=19% Similarity=0.284 Sum_probs=69.2 Q ss_pred EEEECCCCEEEEEEECCCCCEEEEEEEEECCCCCCCCCHHCCCCCCCCCE-EEEEEECCCCCCCCCC--EEE-------E Q ss_conf 99983786999986078896689998652132389887010178747507-9998401258757652--142-------0 Q gi|254780589|r 54 IRIFKNENILEIWKRNVDAEYVLLKEYKICAWSGTFGPKIETGDEQAPEG-FYYIGWNNLNPNSKYF--LSI-------N 123 (239) Q Consensus 54 Irv~K~e~~Lel~~~~~~g~~~l~ktY~Ic~~sg~lGpK~~eGD~kTPEG-~Y~i~~~~~np~S~y~--ls~-------~ 123 (239) |.|--.|.+|..|..+.+ .+.+|||.. |..| ..||+| +-.|..+.-|| .|+ -|+ + T Consensus 98 IVINlaEmRLYYfp~~~~----~V~t~PIGI--Gr~g-------~~TP~g~~t~i~~K~~nP--tW~Pp~sir~e~~~~G 162 (310) T PRK10190 98 IVVNVAEMRLYYYPPDSN----TVEVFPIGI--GQAG-------RETPRNWVTTVERKQEAP--TWTPTPNTRREYAKRG 162 (310) T ss_pred EEEEHHHHEEEEECCCCC----EEEEEEEEC--CCCC-------CCCCCCCEEEEEECCCCC--CCCCCHHHHHHHHHCC T ss_conf 487603502378559998----799971464--6588-------878887548998625699--8779777889999719 Q ss_pred CC----CC----CHHHHHCCCCCCCEEECCCC--------CCCCCEEECHHHHHHHHHHHHH Q ss_conf 79----98----98899729998826871677--------8667187064669999999999 Q gi|254780589|r 124 IG----FP----NEFDKAHNRTGADLMIHGEC--------ASAGCYAMNNKQMQEIYAIVRD 169 (239) Q Consensus 124 i~----YP----N~~Dr~~grtG~~I~IHG~~--------~S~GC~amtd~~i~EIy~~v~~ 169 (239) .. .| |+.-...=+.|....|||+. +|+|||.|.+++|+++|..|.- T Consensus 163 ~~LP~vvP~GPdNPLG~~Al~l~~~YlIHGTN~p~gIG~rvS~GCIRl~peDI~~Lf~~V~v 224 (310) T PRK10190 163 ESLPAFVPAGPDNPMGLYAIYIGRLYAIHGTNANFGIGLRVSQGCIRLRNDDIKYLFDNVPV 224 (310) T ss_pred CCCCCCCCCCCCCCHHHHHHHHCCCEEEECCCCCCCCCCHHCCCCCCCCHHHHHHHHHCCCC T ss_conf 98877779999895777888627860687889997502110687514487869999842999 No 6 >COG3786 Uncharacterized protein conserved in bacteria [Function unknown] Probab=98.44 E-value=2.1e-06 Score=59.90 Aligned_cols=105 Identities=17% Similarity=0.306 Sum_probs=71.3 Q ss_pred EEEECCCCCEEEEEEEEECCCCCCCCCHHCCCCCCCCCEEEEEEECC---CCCCC--------------------CCCEE Q ss_conf 98607889668999865213238988701017874750799984012---58757--------------------65214 Q gi|254780589|r 65 IWKRNVDAEYVLLKEYKICAWSGTFGPKIETGDEQAPEGFYYIGWNN---LNPNS--------------------KYFLS 121 (239) Q Consensus 65 l~~~~~~g~~~l~ktY~Ic~~sg~lGpK~~eGD~kTPEG~Y~i~~~~---~np~S--------------------~y~ls 121 (239) .+++++.+-+.+-.+++-..|+++.-++.+|||.+||+|.|.+.... -||.+ .|... T Consensus 58 ~~~k~~a~~~~~~~~~~~aiGk~G~s~~krEGd~aTP~gv~rll~~fg~~pnpgT~Lp~~~v~p~~~Wc~~~~dp~YNr~ 137 (217) T COG3786 58 PYQKTRAGWQPLKRVFTCAIGKAGISPAKREGDGATPLGVYRLLSGFGQAPNPGTKLPYTAVGPNDGWCDAPNDPNYNRP 137 (217) T ss_pred CCCCCCCCEEECCEEEEEEECCCCCCHHHCCCCCCCCCEEEEEEEEEEECCCCCCCCEEEECCCCCCCCCCCCCCCCCCC T ss_conf 43416532034124677665266657132058999961367774334504799764414742788603079998566887 Q ss_pred EECCCCCHHHHHC-------------------C-CCCCCEEECCCC----CCCCCEEECHHHHHHHHHHHHH Q ss_conf 2079989889972-------------------9-998826871677----8667187064669999999999 Q gi|254780589|r 122 INIGFPNEFDKAH-------------------N-RTGADLMIHGEC----ASAGCYAMNNKQMQEIYAIVRD 169 (239) Q Consensus 122 ~~i~YPN~~Dr~~-------------------g-rtG~~I~IHG~~----~S~GC~amtd~~i~EIy~~v~~ 169 (239) -.+.|+-..++.. - -.||.|+.|=.. -|.||+|+++..+.+|-...+. T Consensus 138 v~~pf~as~eem~r~d~~Yd~~~vm~~N~~~~~~G~GSAIFfHlarpgy~pT~GCVAi~~a~l~~iL~~~~~ 209 (217) T COG3786 138 VECPFSASHEEMLRIDSLYDYCVVMDWNIAPRKRGKGSAIFFHLARPGYEPTEGCVAISAATLVRILRWLRP 209 (217) T ss_pred CCCCCCCCHHHHCCCCCCEEEEEEEECCCCCCCCCCCCEEEEEECCCCCCCCCCEEEECHHHHHHHHHHCCC T ss_conf 547788467886147560237999741457777897216999964899999776576368879999986498 No 7 >COG1376 ErfK Uncharacterized protein conserved in bacteria [Function unknown] Probab=97.78 E-value=0.00012 Score=48.99 Aligned_cols=108 Identities=15% Similarity=0.170 Sum_probs=63.6 Q ss_pred EECCCCEEEEEEECCCCCEEEEEEEEECCCCCCCC---C----HHCCCCCCCCCEEEEEEECCC-CCCCCCCEEEECCCC Q ss_conf 98378699998607889668999865213238988---7----010178747507999840125-875765214207998 Q gi|254780589|r 56 IFKNENILEIWKRNVDAEYVLLKEYKICAWSGTFG---P----KIETGDEQAPEGFYYIGWNNL-NPNSKYFLSINIGFP 127 (239) Q Consensus 56 v~K~e~~Lel~~~~~~g~~~l~ktY~Ic~~sg~lG---p----K~~eGD~kTPEG~Y~i~~~~~-np~S~y~ls~~i~YP 127 (239) +++.+.++..+....+ ....|+|..+...+. . -.+++..-||.|.|.+..+.- ...+.. -+.-+.. T Consensus 95 ~~~~~~~~~~~~~~~~----~~~~~~v~~g~~~~~~~~~~~vs~~~~~p~~Tp~g~~~~~~~~~~~~~~~~--~~~~~p~ 168 (232) T COG1376 95 VVDTGLRLLYLVDDSG----TAQRYPVGIGKEGLDWPGTAKVSRGKEGPTWTPTGEFIVREKKGGFYMPNS--GVPPGPN 168 (232) T ss_pred ECCCCCEEEEEECCCC----CEEEEEEEECCCCCCCCCEEEEECCCCCCCCCCCCEEEEECCCCCCCCCCC--CCCCCCC T ss_conf 5167635899835888----169998875872320253378756764797278855785034534443266--6688898 Q ss_pred CHHH-HHCC---CCCCCEEECCCC--------CCCCCEEECHHHHHHHHHHHHH Q ss_conf 9889-9729---998826871677--------8667187064669999999999 Q gi|254780589|r 128 NEFD-KAHN---RTGADLMIHGEC--------ASAGCYAMNNKQMQEIYAIVRD 169 (239) Q Consensus 128 N~~D-r~~g---rtG~~I~IHG~~--------~S~GC~amtd~~i~EIy~~v~~ 169 (239) |.+- .-.. ..+....|||+. .|+|||.|+|+++.++|..|.. T Consensus 169 np~G~~y~~~~~~~~~~y~IHgt~~~~~iG~~~ShGCIRL~n~Da~~ly~~v~~ 222 (232) T COG1376 169 NPLGALYAVRSSPSDTGYGIHGTPEPASIGKAVSHGCIRLSNQDAKDLYNRVPV 222 (232) T ss_pred CCCCCEEEEEECCCCCEEEEECCCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCC T ss_conf 975520357404788507886887767778625887671774889999835899 No 8 >PRK12472 hypothetical protein; Provisional Probab=97.67 E-value=0.00037 Score=46.07 Aligned_cols=103 Identities=17% Similarity=0.225 Sum_probs=71.0 Q ss_pred CCCCCEEEEEECCCCEEEEEEECCCCCEEEEEEEEECCCCCCCCCHHCCCCCCCCCEEEEEEECCCCCCCCCCEEEECCC Q ss_conf 75784799998378699998607889668999865213238988701017874750799984012587576521420799 Q gi|254780589|r 47 SPFHPTVIRIFKNENILEIWKRNVDAEYVLLKEYKICAWSGTFGPKIETGDEQAPEGFYYIGWNNLNPNSKYFLSINIGF 126 (239) Q Consensus 47 ~~~~pi~Irv~K~e~~Lel~~~~~~g~~~l~ktY~Ic~~sg~lGpK~~eGD~kTPEG~Y~i~~~~~np~S~y~ls~~i~Y 126 (239) ..+.||++.|--++.++.+|..+ | ++..-||+. |..| +-||-|+|.|-.+...-.|.-|- +- T Consensus 52 ~~gePi~aiVSi~~Q~vt~YDa~--G---~~~~apVST--G~~G-------~~TP~GVFsiiqK~k~HrSniY~----~A 113 (512) T PRK12472 52 EAGEPIMAIVSIKSQQVTLYDAD--G---WILRAPVST--GTTG-------RETPAGVFAIVEKDKDHHSTMYD----DA 113 (512) T ss_pred CCCCCEEEEEEECCCEEEEECCC--C---EEEECCCCC--CCCC-------CCCCCEEEEEEECCCHHHHCCCC----CC T ss_conf 68996599998513268998168--7---289655347--8777-------87775345664215202203347----88 Q ss_pred CCHHHHHCCCCCCCEEECCCC-----CCCCCEEECHHHHHHHHHHHHH Q ss_conf 898899729998826871677-----8667187064669999999999 Q gi|254780589|r 127 PNEFDKAHNRTGADLMIHGEC-----ASAGCYAMNNKQMQEIYAIVRD 169 (239) Q Consensus 127 PN~~Dr~~grtG~~I~IHG~~-----~S~GC~amtd~~i~EIy~~v~~ 169 (239) |-+|- +--|=++|-+||.. .|+||+.|--.--+-+|.+.+- T Consensus 114 ~MP~M--QRiTWsGiALH~G~lPGYpASHGCvRmP~~FA~~lf~~T~~ 159 (512) T PRK12472 114 WMPNM--QRITWNGVALHGGPLPGYAASHGCVRMPYGFAEKLFDKTRI 159 (512) T ss_pred CCCCE--EEEEECCEEECCCCCCCCCCCCCCCCCCHHHHHHHHHHCCC T ss_conf 87500--01212122321677899867676242765899987400026 No 9 >pfam10908 DUF2778 Protein of unknown function (DUF2778). This is a bacterial family of uncharacterized proteins. Probab=93.23 E-value=0.18 Score=29.47 Aligned_cols=81 Identities=17% Similarity=0.204 Sum_probs=50.5 Q ss_pred HHCCCCCCCCCEEEEEEECCCCCCCCCCEEEECC-CCCHHH-----HHCCCCCCCEEECCCC-------------CCCCC Q ss_conf 0101787475079998401258757652142079-989889-----9729998826871677-------------86671 Q gi|254780589|r 92 KIETGDEQAPEGFYYIGWNNLNPNSKYFLSINIG-FPNEFD-----KAHNRTGADLMIHGEC-------------ASAGC 152 (239) Q Consensus 92 K~~eGD~kTPEG~Y~i~~~~~np~S~y~ls~~i~-YPN~~D-----r~~grtG~~I~IHG~~-------------~S~GC 152 (239) -.....+-||-|.|+|..| .|.+++.--+. -++..+ +..+..+...||||.- .|.|| T Consensus 19 ~~v~~~GpiPpg~Y~IvdR----~sgg~~~~~~~~~~d~~~wfalyr~dg~idd~t~i~gv~Rg~FRLHP~G~~G~S~GC 94 (121) T pfam10908 19 VAVRMVGPTPPGTYYIVDR----ESGGRLGRLRDLILDLGEWFALYRDDGKIDDETLFHGVERGAFRLHPVGPRGDSNGC 94 (121) T ss_pred CCCCCCCCCCCCEEEEEEC----CCCCCCHHHHHHHCCCCCCEEEECCCCCCCCCEEECCEECCCEEECCCCCCCCCCCC T ss_conf 0156778999950698876----777732245443035430024550578757513661310163675268888665877 Q ss_pred EEECHHHHHHHHHHHHHHHHHCCCCCEEE Q ss_conf 87064669999999999997508561116 Q gi|254780589|r 153 YAMNNKQMQEIYAIVRDSLRGNMQSHIQI 181 (239) Q Consensus 153 ~amtd~~i~EIy~~v~~a~~~~gQ~~~~v 181 (239) |.+.|.. =|..++.++.. ++.+.| T Consensus 95 IT~~~~~---~F~~~r~~l~~--~~~v~V 118 (121) T pfam10908 95 ITFKDYA---RFLRARKFLES--KRLVVV 118 (121) T ss_pred EEECCHH---HHHHHHHHCCC--CCCEEC T ss_conf 7766889---99999981566--653668 No 10 >TIGR03352 VI_chp_3 type VI secretion lipoprotein, VC_A0113 family. Work by Mougous, et al. (2006), describes IAHP-related loci as a type VI secretion system (PMID:16763151). This protein family is associated with type VI secretion loci, although not treated explicitly by Mougous, et al. Probab=87.05 E-value=0.29 Score=28.18 Aligned_cols=53 Identities=19% Similarity=0.394 Sum_probs=23.4 Q ss_pred HHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCEEEEEE Q ss_conf 468999999998604665332223444675545677777518775784799998 Q gi|254780589|r 4 RYNILLFALFIFLNGCHHSRSLIDKAEHPLSENLIISMQKKRTSPFHPTVIRIF 57 (239) Q Consensus 4 R~~~Llfal~i~L~gc~~~~~~~~~~~~pl~~~~~~~~~~~~~~~~~pi~Irv~ 57 (239) |+.++++ ++++|+||+++..........+.-..-..+-.......+|+.|||| T Consensus 2 r~~~~l~-~~lllagC~s~~~~~~~~~v~l~l~a~~~lNpd~~G~p~PvvVrvy 54 (146) T TIGR03352 2 RFAVLLA-ACLLLAGCSSAPPPKEPTYVTLTLTAAPDVNPDEDGRASPVVVRVY 54 (146) T ss_pred CEEHHHH-HHHHHHHCCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCEEEEEE T ss_conf 0755899-9999853259999888748999999568869798999357599999 No 11 >PRK10594 hypothetical protein; Provisional Probab=86.35 E-value=2.3 Score=22.61 Aligned_cols=120 Identities=16% Similarity=0.257 Sum_probs=66.1 Q ss_pred EEEEEECCCCEEEEEEECCCCCEEEEEEEEECCCCCCCCCHHCCCCCCCCCEEEEEEECCCCCCCCCCEEE--------- Q ss_conf 79999837869999860788966899986521323898870101787475079998401258757652142--------- Q gi|254780589|r 52 TVIRIFKNENILEIWKRNVDAEYVLLKEYKICAWSGTFGPKIETGDEQAPEGFYYIGWNNLNPNSKYFLSI--------- 122 (239) Q Consensus 52 i~Irv~K~e~~Lel~~~~~~g~~~l~ktY~Ic~~sg~lGpK~~eGD~kTPEG~Y~i~~~~~np~S~y~ls~--------- 122 (239) -.|.|-=-+-.|++|..+. ++-+-+|.- | | -|++||+=--.|+.--+||.=.-..|+ T Consensus 375 ~~IlVNIP~y~L~~~~~G~-----~v~~srVIV-----G-k---p~r~TPifss~I~~vV~NP~W~VP~SI~rkDiLPkl 440 (615) T PRK10594 375 TGIMVNIPAYSLVYYQNGN-----QVLDSRVIV-----G-R---PDRKTPMMSSALNNVVVNPPWNVPPTLARKDILPKV 440 (615) T ss_pred CEEEEECCCCEEEEEECCC-----EEEEEEEEE-----C-C---CCCCCHHHHCCCCEEEECCCCCCCHHHHHHHHHHHH T ss_conf 6589984640589898893-----899875685-----5-7---888882011210089978998897618888765775 Q ss_pred --------------ECCC-----------------CCHH----------HHHCCC------CCCCEEECCCCC------- Q ss_conf --------------0799-----------------8988----------997299------988268716778------- Q gi|254780589|r 123 --------------NIGF-----------------PNEF----------DKAHNR------TGADLMIHGECA------- 148 (239) Q Consensus 123 --------------~i~Y-----------------PN~~----------Dr~~gr------tG~~I~IHG~~~------- 148 (239) --++ |+.+ +-+.|+ ....|.+|.++. T Consensus 441 ~~dP~YL~~~~~~V~~g~~~~~~vdp~~IdW~~~~~~~fpYrlrQ~PG~~NALGrvKF~FPN~~sIYLHDTP~k~LF~r~ 520 (615) T PRK10594 441 RNDPGYLERHGYTVMRGWNSAEAIDPWQVDWSTITPSNLPFRFQQAPGARNSLGRYKFNMPSSDAIYLHDTPNHNLFQRD 520 (615) T ss_pred HHCHHHHHHCCEEEEECCCCCEECCCCCCCCCCCCCCCCCEEEEECCCCCCCCCCEEEECCCCCCEECCCCCCHHHHCCC T ss_conf 43868898799299866888735470324722257566865788699998867035884479986413689985774878 Q ss_pred ----CCCCEEECHHHHHHHHHH---------HHHHHHHCCCC-------CEEEEEECC Q ss_conf ----667187064669999999---------99999750856-------111699555 Q gi|254780589|r 149 ----SAGCYAMNNKQMQEIYAI---------VRDSLRGNMQS-------HIQIQAFPF 186 (239) Q Consensus 149 ----S~GC~amtd~~i~EIy~~---------v~~a~~~~gQ~-------~~~v~~~Pf 186 (239) |+|||.+.+..-.--|.+ +.++++. |+. .++||+.=+ T Consensus 521 ~RAfSsGCVRVe~p~eLA~~LL~~~gw~~~ri~~~l~~-g~t~~v~L~~~IPV~l~Y~ 577 (615) T PRK10594 521 IRALSSGCVRVNKASDLANMLLQDAGWNDARISDALKQ-GDTRYVNIRQRIPVNLYYL 577 (615) T ss_pred CCCCCCCCEECCCHHHHHHHHHHCCCCCHHHHHHHHHC-CCEEEEECCCCCCEEEEEE T ss_conf 64247770341899999999975359999999999856-9857976699788899998 No 12 >PRK12696 flgH flagellar basal body L-ring protein; Reviewed Probab=85.23 E-value=0.79 Score=25.53 Aligned_cols=23 Identities=22% Similarity=0.507 Sum_probs=14.6 Q ss_pred CCCHHHHHHHHHHHHHCCCCCCCC Q ss_conf 940468999999998604665332 Q gi|254780589|r 1 MTNRYNILLFALFIFLNGCHHSRS 24 (239) Q Consensus 1 M~~R~~~Llfal~i~L~gc~~~~~ 24 (239) |.+| ++++++++++|+||..... T Consensus 1 mm~~-~l~~~~~~~~L~GC~~~~~ 23 (238) T PRK12696 1 MIRK-LLAASCAVLLLSGCNAARQ 23 (238) T ss_pred CHHH-HHHHHHHHHHHHCCCCCCC T ss_conf 9589-9999999999516568888 No 13 >TIGR02522 pilus_cpaD pilus (Caulobacter type) biogenesis lipoprotein CpaD; InterPro: IPR013361 Proteins in this entry consist of a pilus biogenesis protein, CpaD, from Caulobacter, and homologues in other bacteria, including three in the root nodule bacterium Bradyrhizobium japonicum. The molecular function of the homologues is not known.. Probab=83.07 E-value=2.7 Score=22.18 Aligned_cols=49 Identities=18% Similarity=0.398 Sum_probs=29.8 Q ss_pred HHHHHHHHHHCCCCCCC-CCCCCCCCCCCHHHHHHHHHCCCCCCCCEEEEEECCCCEEEEEEEC Q ss_conf 99999999860466533-2223444675545677777518775784799998378699998607 Q gi|254780589|r 7 ILLFALFIFLNGCHHSR-SLIDKAEHPLSENLIISMQKKRTSPFHPTVIRIFKNENILEIWKRN 69 (239) Q Consensus 7 ~Llfal~i~L~gc~~~~-~~~~~~~~pl~~~~~~~~~~~~~~~~~pi~Irv~K~e~~Lel~~~~ 69 (239) +++.++++.|+||.... ... ..+...+.....||+|+ .++..++|.... T Consensus 2 ~~~~a~~~~LagC~~~~P~~~------------~~~~p~d~~~~~PI~v~--~~~~~~~l~v~~ 51 (211) T TIGR02522 2 VLLLALTALLAGCASTDPEID------------VGALPDDYRQRHPIAVQ--EESKSIDLLVAP 51 (211) T ss_pred HHHHHHHHHHHHCCCCCCCCC------------CCCCCCCCCCCCCCEEC--CCCEEEEEEECC T ss_conf 789889999753058887555------------34323453458873551--177013455448 No 14 >COG2989 Uncharacterized protein conserved in bacteria [Function unknown] Probab=82.71 E-value=2.3 Score=22.64 Aligned_cols=48 Identities=15% Similarity=0.339 Sum_probs=33.7 Q ss_pred CCCEEECCCCC-----------CCCCEEECHHHHHHHHHH---------HHHHHHHCCC-------CCEEEEEECC Q ss_conf 88268716778-----------667187064669999999---------9999975085-------6111699555 Q gi|254780589|r 138 GADLMIHGECA-----------SAGCYAMNNKQMQEIYAI---------VRDSLRGNMQ-------SHIQIQAFPF 186 (239) Q Consensus 138 G~~I~IHG~~~-----------S~GC~amtd~~i~EIy~~---------v~~a~~~~gQ-------~~~~v~~~Pf 186 (239) -..|..|-+++ |+||+.+-|..=---+.| +.+.|+. |+ ..|+||++=| T Consensus 449 ~~aIYmHDTP~kslF~r~mRalSsGCVRvq~~rdla~~lL~d~Gws~~~v~~~ik~-g~t~~i~v~~~vPVyl~Y~ 523 (561) T COG2989 449 SHAIYLHDTPSKSLFNRDMRALSSGCVRVQKPRDLANALLKDPGWSVDRVEETLKS-GKTTPIKVRQPVPVYLYYF 523 (561) T ss_pred CCCEEEECCCCHHHHHHHHHHHCCCCEEECCHHHHHHHHHHCCCCCHHHHHHHHCC-CCCEEEECCCCCCEEEEEE T ss_conf 76235206861555665777753585574488999999985579888998767416-8840200377786799999 No 15 >PRK09967 putative outer membrane lipoprotein; Provisional Probab=82.63 E-value=2.2 Score=22.78 Aligned_cols=127 Identities=16% Similarity=0.201 Sum_probs=62.7 Q ss_pred CCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCEEEEEECCCCEEEEEEECCCCCEEEEEEE Q ss_conf 94046899999999860466533222344467554567777751877578479999837869999860788966899986 Q gi|254780589|r 1 MTNRYNILLFALFIFLNGCHHSRSLIDKAEHPLSENLIISMQKKRTSPFHPTVIRIFKNENILEIWKRNVDAEYVLLKEY 80 (239) Q Consensus 1 M~~R~~~Llfal~i~L~gc~~~~~~~~~~~~pl~~~~~~~~~~~~~~~~~pi~Irv~K~e~~Lel~~~~~~g~~~l~ktY 80 (239) |.+++|..++.++++|+||+... .++.+..+..+..-+... .++- T Consensus 1 m~k~~~~~~l~~~l~lsgC~s~~-------~~~t~~~~~~~~~~~~~~---------------------t~~~------- 45 (160) T PRK09967 1 MIKHLVAPLIFTSLILTGCQSPQ-------GKFTPEQVAAMQSYGFTE---------------------SAGD------- 45 (160) T ss_pred CHHHHHHHHHHHHHHHHHCCCCC-------CCCCHHHHHHHHHCCCCC---------------------CCCC------- T ss_conf 90789999999999998458999-------998989988887437522---------------------3543------- Q ss_pred EECCCCCCCCCHHCCCCCCCCCEEEE-EEECCCCCCCCCCEEEECCCCCHHHHHCCCCCCCEEECCCCCCCCCEEECHHH Q ss_conf 52132389887010178747507999-84012587576521420799898899729998826871677866718706466 Q gi|254780589|r 81 KICAWSGTFGPKIETGDEQAPEGFYY-IGWNNLNPNSKYFLSINIGFPNEFDKAHNRTGADLMIHGECASAGCYAMNNKQ 159 (239) Q Consensus 81 ~Ic~~sg~lGpK~~eGD~kTPEG~Y~-i~~~~~np~S~y~ls~~i~YPN~~Dr~~grtG~~I~IHG~~~S~GC~amtd~~ 159 (239) |.-++ |+-+|+ .+...++|.+.--|. ......+..+...|-|.|-+-+.|--+- |.. T Consensus 46 ----~~~g~-----------~d~V~Fd~d~~~L~~~s~~~L~------~~A~~L~~~p~~~v~I~GHTD~~G~~~y-N~~ 103 (160) T PRK09967 46 ----WSLGL-----------SDAILFAKNDYKLLPESQQQIQ------TMAAKLASTGLTHARMDGHTDNYGEDSY-NEG 103 (160) T ss_pred ----CCCCC-----------CCCEEECCCCCCCCHHHHHHHH------HHHHHHHHCCCCEEEEEEECCCCCCHHH-HHH T ss_conf ----14489-----------9837926995011998999999------9999999789962999874799899899-999 Q ss_pred HHHHHH-HHHHHHHHCC-CCC--EEEEEE Q ss_conf 999999-9999997508-561--116995 Q gi|254780589|r 160 MQEIYA-IVRDSLRGNM-QSH--IQIQAF 184 (239) Q Consensus 160 i~EIy~-~v~~a~~~~g-Q~~--~~v~~~ 184 (239) ..|-=+ -|.++|..+| -+. +.+.-| T Consensus 104 LS~rRA~aV~~~l~~~g~i~~~ri~~~g~ 132 (160) T PRK09967 104 LSLKRANVVADAWAMGGQIPRSNLTTQGL 132 (160) T ss_pred HHHHHHHHHHHHHHHCCCCCCCEEEEEEC T ss_conf 99999999999998537854435899964 No 16 >COG3521 Predicted component of the type VI protein secretion system [Intracellular trafficking, secretion, and vesicular transport] Probab=81.87 E-value=1.3 Score=24.28 Aligned_cols=99 Identities=22% Similarity=0.381 Sum_probs=44.8 Q ss_pred CCC--HHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHHHCCC-----CCCCCEEEEEE--CCCCEEEEEEECCC Q ss_conf 940--4689999999986046653322234446755456777775187-----75784799998--37869999860788 Q gi|254780589|r 1 MTN--RYNILLFALFIFLNGCHHSRSLIDKAEHPLSENLIISMQKKRT-----SPFHPTVIRIF--KNENILEIWKRNVD 71 (239) Q Consensus 1 M~~--R~~~Llfal~i~L~gc~~~~~~~~~~~~pl~~~~~~~~~~~~~-----~~~~pi~Irv~--K~e~~Lel~~~~~~ 71 (239) |.. +.+.++|++++ ++||+++... .|...-.+...+..++ ...+|+.||+| |++... T Consensus 1 M~~~~~a~~~l~al~~-~sgCsss~~~-----~pp~~l~l~l~a~~~~Np~~~g~a~Pl~VrlyeLk~d~~F-------- 66 (159) T COG3521 1 MNSSRKAVLALFALLV-LSGCSSSKPL-----LPPSRLDLTLTAAPDLNPNANGEAAPLEVRLYELKDDSKF-------- 66 (159) T ss_pred CCHHHHHHHHHHHHHH-HHHHCCCCCC-----CCCCEEEEEEEECCCCCCCCCCCCCCEEEEEEEECCCCCC-------- T ss_conf 9425778999999998-5241258877-----8971888999706876989778836639999997686544-------- Q ss_pred CCEEEEEEEEE--CCCCCCCCCHHCCCCC--CCCCEEEEEEECCCCCCCCC Q ss_conf 96689998652--1323898870101787--47507999840125875765 Q gi|254780589|r 72 AEYVLLKEYKI--CAWSGTFGPKIETGDE--QAPEGFYYIGWNNLNPNSKY 118 (239) Q Consensus 72 g~~~l~ktY~I--c~~sg~lGpK~~eGD~--kTPEG~Y~i~~~~~np~S~y 118 (239) . -..|.- |...-.||.-...-|- -.|-.++.+++. ++.+.+| T Consensus 67 ---~-~adY~~l~~~~~~tLg~dlv~~~e~~L~Pge~k~l~~~-l~e~t~~ 112 (159) T COG3521 67 ---L-SADYDQLASDLKSTLGGDLVDVDEYILLPGETKTLSRT-LDEQTRA 112 (159) T ss_pred ---C-CCCHHHHHCCCCHHCCCCCCCCEEEEECCCCEEEECCC-CCCCCEE T ss_conf ---5-44436650562000052200113689759830770578-5635446 No 17 >PRK09578 periplasmic multidrug efflux lipoprotein precursor; Reviewed Probab=80.65 E-value=4.1 Score=21.12 Aligned_cols=91 Identities=14% Similarity=0.173 Sum_probs=41.1 Q ss_pred CCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCEEEEEECCCCEEEEEEECCCCCEEEEEEEE Q ss_conf 40468999999998604665332223444675545677777518775784799998378699998607889668999865 Q gi|254780589|r 2 TNRYNILLFALFIFLNGCHHSRSLIDKAEHPLSENLIISMQKKRTSPFHPTVIRIFKNENILEIWKRNVDAEYVLLKEYK 81 (239) Q Consensus 2 ~~R~~~Llfal~i~L~gc~~~~~~~~~~~~pl~~~~~~~~~~~~~~~~~pi~Irv~K~e~~Lel~~~~~~g~~~l~ktY~ 81 (239) ++|+.+++|+++++|+||+...... ..+.|.+ -.+...+...+...- -+.=.....+..+|--+- +|+ +..-. T Consensus 6 r~~~~l~~l~~~~~L~gC~~~~~~~-~~~~~~~-V~v~~v~~~~~~~~~-~~~G~v~a~~~v~l~~~v-~G~---v~~v~ 78 (385) T PRK09578 6 RRRLALAALVAAFALAGCGKGDSDA-AAEAPRE-ATVVTVRPTSVPMTV-ELPGRLDAYRQAEVRARV-AGI---VTART 78 (385) T ss_pred HHHHHHHHHHHHHHHHCCCCCCCCC-CCCCCCE-EEEEEEEEECCCEEE-EEEEEEEEEEEEEEEEEC-CEE---EEEEE T ss_conf 7999999999999981679998787-8999960-389998773322699-999999987999999664-469---99998 Q ss_pred ECCCCCCCCCHHCCCCCCCCCEEEEEEE Q ss_conf 2132389887010178747507999840 Q gi|254780589|r 82 ICAWSGTFGPKIETGDEQAPEGFYYIGW 109 (239) Q Consensus 82 Ic~~sg~lGpK~~eGD~kTPEG~Y~i~~ 109 (239) + ..|+..+.||. ...|+. T Consensus 79 v-----~~Gd~V~kGq~-----La~lD~ 96 (385) T PRK09578 79 Y-----EEGQEVKQGAV-----LFRIDP 96 (385) T ss_pred E-----CCCCEECCCCE-----EEEECC T ss_conf 0-----79998738988-----999853 No 18 >PRK12407 flgH flagellar basal body L-ring protein; Reviewed Probab=76.17 E-value=2 Score=23.06 Aligned_cols=20 Identities=20% Similarity=0.487 Sum_probs=13.7 Q ss_pred HHHHHHHHHHHHHCCCCCCC Q ss_conf 46899999999860466533 Q gi|254780589|r 4 RYNILLFALFIFLNGCHHSR 23 (239) Q Consensus 4 R~~~Llfal~i~L~gc~~~~ 23 (239) |.|+++.+++++|+||.... T Consensus 1 kr~l~l~~~~l~LsGC~~~~ 20 (220) T PRK12407 1 KRFLILTPMVLALCGCESPA 20 (220) T ss_pred CCCHHHHHHHHHHHCCCCCC T ss_conf 91579999999973658888 No 19 >PRK09859 multidrug efflux system protein MdtE; Provisional Probab=75.70 E-value=5.1 Score=20.51 Aligned_cols=92 Identities=16% Similarity=0.172 Sum_probs=39.1 Q ss_pred CCCHHHHHHH--HHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCEEEEEECCCCEEEEEEECCCCCEEEEE Q ss_conf 9404689999--99998604665332223444675545677777518775784799998378699998607889668999 Q gi|254780589|r 1 MTNRYNILLF--ALFIFLNGCHHSRSLIDKAEHPLSENLIISMQKKRTSPFHPTVIRIFKNENILEIWKRNVDAEYVLLK 78 (239) Q Consensus 1 M~~R~~~Llf--al~i~L~gc~~~~~~~~~~~~pl~~~~~~~~~~~~~~~~~pi~Irv~K~e~~Lel~~~~~~g~~~l~k 78 (239) |++|..+|+. .+.++|+||.......... +.|+-.+............ -+.=.....++-+|.-+- +|+ +. T Consensus 1 m~~k~~~li~ll~~~~lL~gC~~~~~~~~~~--~~p~V~v~~v~~~~~~~~~-~~~G~v~~~~~~~l~~~V-sG~---v~ 73 (385) T PRK09859 1 MNRRRKLLIPLLFCGAMLTACDDKSAENAAA--MTPEVGVVTLSPGSVNVLS-ELPGRTVPYEVAEIRPQV-GGI---II 73 (385) T ss_pred CCCHHHHHHHHHHHHHHHHCCCCCCCCCCCC--CCCCEEEEEEEEEEEEEEE-EEEEEEEEEEEEEEEEEC-CEE---EE T ss_conf 9820678999999999995379997687899--9981489998675101589-987999962899998145-629---99 Q ss_pred EEEECCCCCCCCCHHCCCCCCCCCEEEEEEE Q ss_conf 8652132389887010178747507999840 Q gi|254780589|r 79 EYKICAWSGTFGPKIETGDEQAPEGFYYIGW 109 (239) Q Consensus 79 tY~Ic~~sg~lGpK~~eGD~kTPEG~Y~i~~ 109 (239) .-++ ..|+..++||. ...|+. T Consensus 74 ~~~v-----~~Gd~VkkGq~-----La~LD~ 94 (385) T PRK09859 74 KRNF-----IEGDKVNQGDS-----LYQIDP 94 (385) T ss_pred EEEC-----CCCCEEECCCE-----EEEECC T ss_conf 9983-----89998848988-----998284 No 20 >COG2134 Cdh CDP-diacylglycerol pyrophosphatase [Lipid metabolism] Probab=74.82 E-value=7.1 Score=19.63 Aligned_cols=136 Identities=19% Similarity=0.211 Sum_probs=68.1 Q ss_pred CCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCC-CHHHHHHHHHCCCC---CCCCEEEEEECCCCEEEEEEECCC-CCEE Q ss_conf 940468999999998604665332223444675-54567777751877---578479999837869999860788-9668 Q gi|254780589|r 1 MTNRYNILLFALFIFLNGCHHSRSLIDKAEHPL-SENLIISMQKKRTS---PFHPTVIRIFKNENILEIWKRNVD-AEYV 75 (239) Q Consensus 1 M~~R~~~Llfal~i~L~gc~~~~~~~~~~~~pl-~~~~~~~~~~~~~~---~~~pi~Irv~K~e~~Lel~~~~~~-g~~~ 75 (239) |+++.++++.++++.+.+. .......+.+.|- --+++...-.-++. -.+|-. -.|+..---|+|...| -.|- T Consensus 1 mkka~~L~l~~i~I~~~ag-a~i~~~~~~~npdaLwKIV~e~C~p~~~~~~~paPCa--eV~~~AG~av~Kd~~gPlQyL 77 (252) T COG2134 1 MKKAGYLFLVGIVIAVLAG-ALIAYKLRPNNPDALWKIVHERCLPNYQQNQNPAPCA--EVKPQAGYAVLKDRNGPLQYL 77 (252) T ss_pred CCEEEHHHHHHHHHHHHCC-CCEEEECCCCCHHHHHHHHHHHCCCCHHCCCCCCCCE--EECCCCCEEEEECCCCCCEEE T ss_conf 9501014999999999626-5157545999848999999987076610159999863--645877648881368970157 Q ss_pred EEEEEEECCCCCCCCCHHCCCCCCCCCEEEE-------EEECCCCCCCCCCEEEECCCCCHHHHHCCCCCCCEEECCCCC Q ss_conf 9998652132389887010178747507999-------840125875765214207998988997299988268716778 Q gi|254780589|r 76 LLKEYKICAWSGTFGPKIETGDEQAPEGFYY-------IGWNNLNPNSKYFLSINIGFPNEFDKAHNRTGADLMIHGECA 148 (239) Q Consensus 76 l~ktY~Ic~~sg~lGpK~~eGD~kTPEG~Y~-------i~~~~~np~S~y~ls~~i~YPN~~Dr~~grtG~~I~IHG~~~ 148 (239) |+-+|+| +|--.|-..| --||-=||- ++.+.-||-...-.|+.||= ..|||-..+-||=+|. T Consensus 78 LmPt~rI---tGiE~P~L~e--~atpNyf~~AWqAR~fms~kyg~~ipd~dvsLaINs------k~gRtQdqlHIHISCi 146 (252) T COG2134 78 LMPTARI---TGIESPLLLE--PATPNYFYLAWQARDFMSKKYGNPIPDSDVSLAINS------KNGRTQDQLHIHISCI 146 (252) T ss_pred EEEEECC---CCCCCHHHCC--CCCCCHHHHHHHHHHHHHHHHCCCCCCCCEEEEECC------CCCCCCCCEEEEEEEC T ss_conf 7300114---6776734407--788648999998888999971899996012888647------6686543158999845 Q ss_pred CC Q ss_conf 66 Q gi|254780589|r 149 SA 150 (239) Q Consensus 149 S~ 150 (239) |- T Consensus 147 sp 148 (252) T COG2134 147 SP 148 (252) T ss_pred CH T ss_conf 90 No 21 >COG3065 Slp Starvation-inducible outer membrane lipoprotein [Cell envelope biogenesis, outer membrane] Probab=74.49 E-value=4.6 Score=20.82 Aligned_cols=65 Identities=22% Similarity=0.235 Sum_probs=30.1 Q ss_pred CCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCH-HHHHHHHHCCCCCCCCEE-------EEEECCCCEEEEE Q ss_conf 94046899999999860466533222344467554-567777751877578479-------9998378699998 Q gi|254780589|r 1 MTNRYNILLFALFIFLNGCHHSRSLIDKAEHPLSE-NLIISMQKKRTSPFHPTV-------IRIFKNENILEIW 66 (239) Q Consensus 1 M~~R~~~Llfal~i~L~gc~~~~~~~~~~~~pl~~-~~~~~~~~~~~~~~~pi~-------Irv~K~e~~Lel~ 66 (239) |+...-.|+.+++++|+||.+......+.. |.+. ++...+...+...++++. +..-|..-.||+- T Consensus 5 m~~~~~~l~~~laflLsgC~tiPk~l~g~~-~~s~~s~~~~~~~~~~~~gq~aR~GGkVvnv~n~~~rTrlEi~ 77 (191) T COG3065 5 MNMKKGALIGTLAFLLSGCVTIPKALKGES-PTSQQSLVRVMSQPQLYVGQQARFGGKVVNVINQKGRTRLEIA 77 (191) T ss_pred HHHHHHHHHHHHHHHHHHCCCCCHHHCCCC-CCCHHHEEEECCCCCCCCCCEEEECCEEEEEECCCCCEEEEEE T ss_conf 764799999999999763035783126899-7422201100257765204423517689987627895699999 No 22 >TIGR00752 slp outer membrane lipoprotein, Slp family; InterPro: IPR004658 Slp superfamily members are present in the Gram-negative gamma proteobacteria Escherichia coli (which also contains a close paralog), Haemophilus influenzae and Pasteurella multocida and Vibrio cholerae. The known members of the family to date share a motif LX[GA]C near the N-terminus, which is compatible with the possibility that the protein is modified into a lipoprotein with Cys as the new N-terminus. Slp from E. coli is known to be a lipoprotein of the outer membrane and to be expressed in response to carbon starvation.; GO: 0019867 outer membrane. Probab=72.38 E-value=8.2 Score=19.27 Aligned_cols=63 Identities=22% Similarity=0.310 Sum_probs=38.7 Q ss_pred HHHHHHHHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHH-HCCCCCCCCE-------EEEEECCCCEEEEEE Q ss_conf 68999999998604665332223444675545677777-5187757847-------999983786999986 Q gi|254780589|r 5 YNILLFALFIFLNGCHHSRSLIDKAEHPLSENLIISMQ-KKRTSPFHPT-------VIRIFKNENILEIWK 67 (239) Q Consensus 5 ~~~Llfal~i~L~gc~~~~~~~~~~~~pl~~~~~~~~~-~~~~~~~~pi-------~Irv~K~e~~Lel~~ 67 (239) .-+|+.++++.|+||.+......+.++|.+-+...... +..+..+..+ -+.+-|.+-+||+-. T Consensus 4 ~g~l~~~l~f~L~gC~~~P~~ikg~~qP~~~~s~v~v~~qp~lyvg~~aRfGGKvvnv~~~~~~T~lEi~~ 74 (190) T TIGR00752 4 KGLLITALAFLLTGCIAVPKAIKGERQPFSQKSYVEVSSQPLLYVGQEARFGGKVVNVTNLKNKTKLEILS 74 (190) T ss_pred HHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHEECCCCCEEECCEEEECCEEEEEEECCCCEEEEEEE T ss_conf 14568889988753146742345567986200232110799723110575377599987648870589987 No 23 >pfam12262 Lipase_bact_N Bacterial virulence factor lipase N-terminal. This domain family is found in bacteria, and is typically between 258 and 271 amino acids in length. There are two conserved sequence motifs: DGT and DGWST. This family is the N-terminal region of bacterial virulence factor lipase. The N-terminal region contains a potential signalling sequence. Probab=72.30 E-value=3.7 Score=21.41 Aligned_cols=24 Identities=29% Similarity=0.602 Sum_probs=20.1 Q ss_pred CCCHHHHHHHHHHHHHCCCCCCCC Q ss_conf 940468999999998604665332 Q gi|254780589|r 1 MTNRYNILLFALFIFLNGCHHSRS 24 (239) Q Consensus 1 M~~R~~~Llfal~i~L~gc~~~~~ 24 (239) |++.++.++++.++.|+||.+... T Consensus 1 MKk~~l~~~i~SAL~LaGCg~~s~ 24 (269) T pfam12262 1 MKKKFLALLLASALLLAGCGDDTE 24 (269) T ss_pred CCHHHHHHHHHHHHHHCCCCCCCC T ss_conf 924799999999864111579965 No 24 >PRK11548 hypothetical protein; Provisional Probab=69.58 E-value=9.4 Score=18.89 Aligned_cols=87 Identities=13% Similarity=0.204 Sum_probs=50.0 Q ss_pred CCCHHHHHHHH-HHHHHCCCCCCCCCCCCCC----CCCCHHHHHHHHHCCCC-------CCCCEEEEEECCCCEEEEEEE Q ss_conf 94046899999-9998604665332223444----67554567777751877-------578479999837869999860 Q gi|254780589|r 1 MTNRYNILLFA-LFIFLNGCHHSRSLIDKAE----HPLSENLIISMQKKRTS-------PFHPTVIRIFKNENILEIWKR 68 (239) Q Consensus 1 M~~R~~~Llfa-l~i~L~gc~~~~~~~~~~~----~pl~~~~~~~~~~~~~~-------~~~pi~Irv~K~e~~Lel~~~ 68 (239) |+.+.+...+. ++++|+||+......-+.. .-+..+.+.++ ..||+ .++|++.-.|..+|--.+|.. T Consensus 1 m~~k~l~~a~ll~~lllsgCS~~~~~vYk~dI~QGN~i~~~~v~~L-~~GMTk~QV~~iLGtP~l~d~F~~~rWdYvy~~ 79 (113) T PRK11548 1 MRCKTLTAAAAVLLMLTAGCSTLERVVYRPDINQGNYLTPNDVAKI-RVGMTQQQVAYALGTPMMSDPFGTNTWFYVFRQ 79 (113) T ss_pred CCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCHHHHHHH-CCCCCHHHHHHHCCCCCCCCCCCCCCEEEEEEE T ss_conf 9257889999999999801157877565765575676699999876-189999999987189632388788746999999 Q ss_pred CCCCCEEEEEEEEECCCCCCCC Q ss_conf 7889668999865213238988 Q gi|254780589|r 69 NVDAEYVLLKEYKICAWSGTFG 90 (239) Q Consensus 69 ~~~g~~~l~ktY~Ic~~sg~lG 90 (239) ..++. ......+..++-.-| T Consensus 80 ~~g~~--~~~~~~ltl~Fd~~~ 99 (113) T PRK11548 80 QPGHE--KVTQQTLTLTFNSSG 99 (113) T ss_pred ECCCC--CEEEEEEEEEECCCC T ss_conf 60898--358889999983999 No 25 >TIGR02747 TraV type IV conjugative transfer system protein TraV; InterPro: IPR014118 This entry includes TraV, which is a component of conjugative type IV secretion system. TraV is an outer membrane lipoprotein that is believed to interact with the secretin TraK , , . The alignment contains three conserved cysteines in the N-terminal half.. Probab=66.18 E-value=3.4 Score=21.57 Aligned_cols=22 Identities=36% Similarity=0.578 Sum_probs=14.3 Q ss_pred CCCHHHHHHHHHH-HHHCCCCCC Q ss_conf 9404689999999-986046653 Q gi|254780589|r 1 MTNRYNILLFALF-IFLNGCHHS 22 (239) Q Consensus 1 M~~R~~~Llfal~-i~L~gc~~~ 22 (239) |+++.++++++++ ++|.||++. T Consensus 1 Mmr~~~L~~~~~~~f~LtGCsa~ 23 (174) T TIGR02747 1 MMRLKVLLLLACVAFLLTGCSAS 23 (174) T ss_pred CCHHHHHHHHHHHHHHHHHHHHH T ss_conf 90678999999998720025774 No 26 >COG5633 Predicted periplasmic lipoprotein [General function prediction only] Probab=65.41 E-value=11 Score=18.37 Aligned_cols=80 Identities=15% Similarity=0.250 Sum_probs=42.5 Q ss_pred HHHHHHHHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCEEEEEECCCCEEEEEEECCCCCEEEEEEEEECC Q ss_conf 68999999998604665332223444675545677777518775784799998378699998607889668999865213 Q gi|254780589|r 5 YNILLFALFIFLNGCHHSRSLIDKAEHPLSENLIISMQKKRTSPFHPTVIRIFKNENILEIWKRNVDAEYVLLKEYKICA 84 (239) Q Consensus 5 ~~~Llfal~i~L~gc~~~~~~~~~~~~pl~~~~~~~~~~~~~~~~~pi~Irv~K~e~~Lel~~~~~~g~~~l~ktY~Ic~ 84 (239) +++ ++..+++|.||.+......-.++.+.-.. ..-.++++...|.++.+.-+.+...+-+++....-. -+|++ - T Consensus 4 ~~~-~~l~~~lLvGCsS~~~i~~~~~q~vvm~~--s~l~~~Isae~~~l~~sd~~~~~~s~l~N~~q~pv~--v~Yrf-Y 77 (123) T COG5633 4 LCL-LSLALLLLVGCSSHQEILVNDEQSVVMET--SVLIAGISAEKPVLSESDGQPSASSVLKNKRQEPVT--VHYRF-Y 77 (123) T ss_pred EHH-HHHHHHHHHCCCCCCCCCCCCCCEEEECC--CCEECCCCCCCCEEEEECCCCCEEEEEECCCCCCEE--EEEEE-E T ss_conf 038-99999994204788775225502034035--513414130587026631532115897515568638--99999-9 Q ss_pred CCCCCC Q ss_conf 238988 Q gi|254780589|r 85 WSGTFG 90 (239) Q Consensus 85 ~sg~lG 90 (239) |.-..| T Consensus 78 WYD~~G 83 (123) T COG5633 78 WYDAQG 83 (123) T ss_pred EECCCC T ss_conf 982888 No 27 >pfam02402 Lysis_col Lysis protein. These small bacterial proteins are required for colicin release and partial cell lysis. This family contains lysis proteins for several different forms of colicin. Bacillus subtilis lytA has been included in this family, the similarity is not highly significant, however it is also a short protein, that is involved in secretion of other proteins (Bateman A pers. obs.). Probab=64.21 E-value=2.9 Score=22.04 Aligned_cols=22 Identities=23% Similarity=0.425 Sum_probs=18.6 Q ss_pred CCCHHHHHHHHHHHHHCCCCCC Q ss_conf 9404689999999986046653 Q gi|254780589|r 1 MTNRYNILLFALFIFLNGCHHS 22 (239) Q Consensus 1 M~~R~~~Llfal~i~L~gc~~~ 22 (239) |+++.++.+|.+.++|+||+.. T Consensus 1 Mkki~~~~i~~~~~~L~aCQaN 22 (46) T pfam02402 1 MKKILFIGILLLTVLLSACQAN 22 (46) T ss_pred CCEEEEHHHHHHHHHHHHHHHH T ss_conf 9242013999999999985550 No 28 >PRK10760 murein hydrolase B; Provisional Probab=62.86 E-value=5.5 Score=20.32 Aligned_cols=22 Identities=27% Similarity=0.623 Sum_probs=17.3 Q ss_pred CHHHHHHHHHHHHHCCCCCCCC Q ss_conf 0468999999998604665332 Q gi|254780589|r 3 NRYNILLFALFIFLNGCHHSRS 24 (239) Q Consensus 3 ~R~~~Llfal~i~L~gc~~~~~ 24 (239) +|.|+.++.++++|++|++... T Consensus 1 ~r~~~~~~~~~~~l~~css~p~ 22 (357) T PRK10760 1 KRRYVTLLPLFVLLAACSSKPK 22 (357) T ss_pred CCCHHHHHHHHHHHHHHCCCCC T ss_conf 9404868999999998518999 No 29 >pfam06474 MLTD_N MLTD_N. Probab=61.90 E-value=8.4 Score=19.18 Aligned_cols=19 Identities=16% Similarity=0.487 Sum_probs=11.5 Q ss_pred HHHHHCCCCCCCCCCCCCC Q ss_conf 9998604665332223444 Q gi|254780589|r 12 LFIFLNGCHHSRSLIDKAE 30 (239) Q Consensus 12 l~i~L~gc~~~~~~~~~~~ 30 (239) ++++|+||+.+........ T Consensus 9 ~alLLaGCQst~~~~~~t~ 27 (93) T pfam06474 9 AALLLVGCQSTQAVSSTTS 27 (93) T ss_pred HHHHHHHHCCCCCCCCCCC T ss_conf 9999998137999999877 No 30 >TIGR02544 III_secr_YscJ type III secretion apparatus lipoprotein, YscJ/HrcJ family; InterPro: IPR003282 Secretion of virulence factors in Gram-negative bacteria involves transportation of the protein across two membranes to reach the cell exterior. There have been four secretion systems described in animal enteropathogens such as Salmonella and Yersinia, with further sequence similarities in plant pathogens like Ralstonia and Erwinia. The type III secretion system is of great interest as it is used to transport virulence factors from the pathogen directly into the host cell and is only triggered when the bacterium comes into close contact with the host. The protein subunits of the system are very similar to those of bacterial flagellar biosynthesis . However, while the latter forms a ring structure to allow secretion of flagellin and is an integral part of the flagellum itself, type III subunits in the outer membrane translocate secreted proteins through a channel-like structure. One of the outer membrane protein subunit families, termed "K" here for nomenclature purposes, aids in the structural assembly of the invasion complex . It is also described as a lipoprotein. Members of this family include the Salmonella PrgK and SsaJ genes, MxiJ from Shigella, YscJ from Yersinia, and from the plant enteropathogens NolT (Rhizobium) and HrcJ Erwinia). The flagellar M-ring protein FliF also shares a low level of similarity, presumably due to evolution of the type III secretion system from the flagellar biosynthetic pathway.; GO: 0009306 protein secretion. Probab=60.29 E-value=10 Score=18.65 Aligned_cols=45 Identities=18% Similarity=0.402 Sum_probs=31.4 Q ss_pred HHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCC Q ss_conf 468999999998604665332223444675545677777518775 Q gi|254780589|r 4 RYNILLFALFIFLNGCHHSRSLIDKAEHPLSENLIISMQKKRTSP 48 (239) Q Consensus 4 R~~~Llfal~i~L~gc~~~~~~~~~~~~pl~~~~~~~~~~~~~~~ 48 (239) |..++++.++++|+||....-..+.-..-=..+.++.+..+|+.. T Consensus 2 ~~~~~~~~l~llL~GC~~~~~Ly~gL~e~eANe~lA~L~~~gI~A 46 (203) T TIGR02544 2 RRKLLLLLLLLLLTGCKVDLELYSGLSEREANEMLALLMRHGIDA 46 (203) T ss_pred HHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHCCCCE T ss_conf 358899999999717874056504789878999999999669834 No 31 >PRK10533 putative lipoprotein; Provisional Probab=60.20 E-value=6.5 Score=19.88 Aligned_cols=23 Identities=22% Similarity=0.335 Sum_probs=20.4 Q ss_pred CCCHHHHHHHHHHHHHCCCCCCC Q ss_conf 94046899999999860466533 Q gi|254780589|r 1 MTNRYNILLFALFIFLNGCHHSR 23 (239) Q Consensus 1 M~~R~~~Llfal~i~L~gc~~~~ 23 (239) |++..+.|++.++++|++|.... T Consensus 1 Mr~~~~~ll~PlallLSACttV~ 23 (171) T PRK10533 1 MRYSKLALLLPCALLLSACTTVT 23 (171) T ss_pred CCCCHHHHHHHHHHHHHHHCCCC T ss_conf 98620389999999998722476 No 32 >PRK00022 lolB outer membrane lipoprotein LolB; Provisional Probab=55.56 E-value=17 Score=17.30 Aligned_cols=94 Identities=17% Similarity=0.168 Sum_probs=38.5 Q ss_pred HHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCH---HHHHHH-HHCCCCCCCCEEEEEECC-CCEEEEEEECCCCCEEEEE Q ss_conf 46899999999860466533222344467554---567777-751877578479999837-8699998607889668999 Q gi|254780589|r 4 RYNILLFALFIFLNGCHHSRSLIDKAEHPLSE---NLIISM-QKKRTSPFHPTVIRIFKN-ENILEIWKRNVDAEYVLLK 78 (239) Q Consensus 4 R~~~Llfal~i~L~gc~~~~~~~~~~~~pl~~---~~~~~~-~~~~~~~~~pi~Irv~K~-e~~Lel~~~~~~g~~~l~k 78 (239) |.+.+++..+++|+||............+... +-...+ ....-.....+.|+--++ .+-=--|.+. +. T Consensus 3 r~~~~l~~~~llLsgCat~p~~~~~~~~~~~~~w~~~q~~l~~l~~w~~~Gkiav~~~~~~~sa~f~W~q~-~~------ 75 (203) T PRK00022 3 RLLRLLPLAALLLAGCASLPPPKGPGKSPTSPQWQQHQQQLQQIQQYQARGRFAYISPQQRVSGRFDWQQQ-GD------ 75 (203) T ss_pred HHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCEEEEEEEEEECCCCEEEEEEEEEEC-CC------ T ss_conf 63899999999986614899988877775667499999999734234884689998788547899999975-98------ Q ss_pred EEEECCCCCCCCCHHC--CCCCCCCCEEEEEE Q ss_conf 8652132389887010--17874750799984 Q gi|254780589|r 79 EYKICAWSGTFGPKIE--TGDEQAPEGFYYIG 108 (239) Q Consensus 79 tY~Ic~~sg~lGpK~~--eGD~kTPEG~Y~i~ 108 (239) +|.|.+ +|++|--.. +|| |.|..-.+ T Consensus 76 ~~~i~L-~~pLG~~~~~i~~~---~~~~~L~~ 103 (203) T PRK00022 76 RYRLLL-TSPLGSTELELTGR---PGGATLED 103 (203) T ss_pred EEEEEE-EECCCCEEEEEEEC---CCEEEEEE T ss_conf 169999-80266549999987---99799997 No 33 >PRK10866 outer membrane protein assembly complex subunit YfiO; Provisional Probab=55.15 E-value=8.7 Score=19.09 Aligned_cols=22 Identities=32% Similarity=0.609 Sum_probs=16.2 Q ss_pred CCHHHHHHHHHHHHHCCCCCCC Q ss_conf 4046899999999860466533 Q gi|254780589|r 2 TNRYNILLFALFIFLNGCHHSR 23 (239) Q Consensus 2 ~~R~~~Llfal~i~L~gc~~~~ 23 (239) +++.++++.+++++|+||++.. T Consensus 3 ~~k~l~~~~~l~l~l~gCs~~~ 24 (243) T PRK10866 3 RMKYLVAAATLSLFLAGCSGSK 24 (243) T ss_pred HHHHHHHHHHHHHHHHHCCCCC T ss_conf 5899999999999998768998 No 34 >TIGR02722 lp_ uncharacterized lipoprotein; InterPro: IPR014094 Members of this protein family are restricted to the Proteobacteria, and all are predicted lipoproteins. In genomes that contain the thiK gene for the salvage enzyme thiamin kinase, the member of this family is encoded nearby.. Probab=54.48 E-value=12 Score=18.22 Aligned_cols=27 Identities=30% Similarity=0.512 Sum_probs=16.7 Q ss_pred HHHHHHHHHHHCCCCCCCCCCCCCCCC Q ss_conf 899999999860466533222344467 Q gi|254780589|r 6 NILLFALFIFLNGCHHSRSLIDKAEHP 32 (239) Q Consensus 6 ~~Llfal~i~L~gc~~~~~~~~~~~~p 32 (239) ++++++++++|+||......+..++.| T Consensus 6 l~~~~~~Al~L~GC~~~~~~y~~~~~~ 32 (215) T TIGR02722 6 LIFVALLALLLSGCVSQRVAYTDATDP 32 (215) T ss_pred HHHHHHHHHHHHCCCCCCCCCCCCCCC T ss_conf 899999999985468872665688756 No 35 >TIGR03502 lipase_Pla1_cef extracellular lipase, Pla-1/cef family. Members of this protein family are bacterial lipoproteins largely from the Gammaproteobacteria. Characterized members are expressed in extracellularly and have esterase activity. Members include the lipase Pla-1 from Aeromonas hydrophila (AF092033) and CHO cell elongation factor (cef) from Vibrio hollisae Probab=54.17 E-value=12 Score=18.20 Aligned_cols=21 Identities=29% Similarity=0.293 Sum_probs=18.6 Q ss_pred CCCHHHHHHHHHHHHHCCCCC Q ss_conf 940468999999998604665 Q gi|254780589|r 1 MTNRYNILLFALFIFLNGCHH 21 (239) Q Consensus 1 M~~R~~~Llfal~i~L~gc~~ 21 (239) |++.++.+.++.++.|+||.+ T Consensus 1 MKk~~l~~~iasal~LagCGg 21 (792) T TIGR03502 1 MKKLLLSLAIASALGLAGCGD 21 (792) T ss_pred CCHHHHHHHHHHHHHHHCCCC T ss_conf 933789999999875004578 No 36 >PRK11627 hypothetical protein; Provisional Probab=54.03 E-value=9.3 Score=18.93 Aligned_cols=23 Identities=26% Similarity=0.498 Sum_probs=14.7 Q ss_pred CCCHHHHHHHHHHHHHCCCCCCCC Q ss_conf 940468999999998604665332 Q gi|254780589|r 1 MTNRYNILLFALFIFLNGCHHSRS 24 (239) Q Consensus 1 M~~R~~~Llfal~i~L~gc~~~~~ 24 (239) |.++.++.+++ +++|+||+.+.. T Consensus 1 ~mkk~~~~l~a-~~lL~gCas~p~ 23 (192) T PRK11627 1 MLKKILFPLVA-LFMLAGCATPPT 23 (192) T ss_pred CHHHHHHHHHH-HHHHHHHCCCCC T ss_conf 92879999999-999986069976 No 37 >PRK10781 rcsF outer membrane lipoprotein; Reviewed Probab=54.00 E-value=9.8 Score=18.79 Aligned_cols=20 Identities=25% Similarity=0.383 Sum_probs=12.4 Q ss_pred CHHHHHHHHHHHHHCCCCCCC Q ss_conf 046899999999860466533 Q gi|254780589|r 3 NRYNILLFALFIFLNGCHHSR 23 (239) Q Consensus 3 ~R~~~Llfal~i~L~gc~~~~ 23 (239) +|++.+.| ++++|+||..-. T Consensus 1 MR~lpi~l-lal~ltGCs~l~ 20 (133) T PRK10781 1 MRALPICL-LALMLTGCSMLS 20 (133) T ss_pred CCHHHHHH-HHHHHHCCCCCC T ss_conf 92089999-999984313445 No 38 >PRK06052 5-methyltetrahydropteroyltriglutamate--homocysteine methyltransferase; Provisional Probab=53.62 E-value=18 Score=17.11 Aligned_cols=35 Identities=9% Similarity=0.125 Sum_probs=18.8 Q ss_pred HCCCCCCCCCEEEEEEECCCCCCCCCCEEEECC-CCCHHH Q ss_conf 101787475079998401258757652142079-989889 Q gi|254780589|r 93 IETGDEQAPEGFYYIGWNNLNPNSKYFLSINIG-FPNEFD 131 (239) Q Consensus 93 ~~eGD~kTPEG~Y~i~~~~~np~S~y~ls~~i~-YPN~~D 131 (239) .+|--++|++=.--|+ .|-.-|+..|+-. |-..++ T Consensus 103 yke~~g~tl~VRVCVt----GP~eLY~~~FG~t~y~Di~~ 138 (343) T PRK06052 103 YKEETGETLEVRVCVT----GPTELYLQEFGGTRYTDIYL 138 (343) T ss_pred HHHHCCCCCCEEEEEC----CCHHHHHHHHCCCHHHHHHH T ss_conf 8885088550799944----82799999848820899999 No 39 >PRK02944 OxaA-like protein precursor; Validated Probab=52.67 E-value=15 Score=17.64 Aligned_cols=22 Identities=32% Similarity=0.677 Sum_probs=14.9 Q ss_pred CCCHHHHHHH--HHHHHHCCCCCC Q ss_conf 9404689999--999986046653 Q gi|254780589|r 1 MTNRYNILLF--ALFIFLNGCHHS 22 (239) Q Consensus 1 M~~R~~~Llf--al~i~L~gc~~~ 22 (239) |++|+.+++. +++++|+||... T Consensus 1 mkk~~~~~~~~~~~~~~lsgC~~~ 24 (255) T PRK02944 1 MKKKLGLLAMVVALMAILAGCSEV 24 (255) T ss_pred CCHHHHHHHHHHHHHHHHHCCCCC T ss_conf 928999999999999999624688 No 40 >PRK03095 prsA peptidylprolyl isomerase; Reviewed Probab=52.37 E-value=11 Score=18.35 Aligned_cols=23 Identities=26% Similarity=0.271 Sum_probs=16.5 Q ss_pred CCCHHHHHHHHHHHHHCCCCCCC Q ss_conf 94046899999999860466533 Q gi|254780589|r 1 MTNRYNILLFALFIFLNGCHHSR 23 (239) Q Consensus 1 M~~R~~~Llfal~i~L~gc~~~~ 23 (239) |+...+.+..+.++.|+||+.+. T Consensus 1 MKK~~la~~~~svl~LaaC~~~~ 23 (287) T PRK03095 1 MKKAMLALAATSVIALSACGTSS 23 (287) T ss_pred CCHHHHHHHHHHHHHHHHCCCCC T ss_conf 90789999999999998516899 No 41 >PRK13835 conjugal transfer protein TrbH; Provisional Probab=52.37 E-value=13 Score=17.92 Aligned_cols=24 Identities=21% Similarity=0.372 Sum_probs=17.0 Q ss_pred CHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 046899999999860466533222 Q gi|254780589|r 3 NRYNILLFALFIFLNGCHHSRSLI 26 (239) Q Consensus 3 ~R~~~Llfal~i~L~gc~~~~~~~ 26 (239) +|.+++++.+.++|+||+.+.... T Consensus 1 mr~ll~~~i~a~~lsgCqta~dgl 24 (144) T PRK13835 1 MRRLLAACILALLLSGCQTATDGL 24 (144) T ss_pred CHHHHHHHHHHHHHHHCCCCCCCC T ss_conf 955999999999864041368864 No 42 >TIGR03302 OM_YfiO outer membrane assembly lipoprotein YfiO. Members of this protein family include YfiO, a near-essential protein of the outer membrane, part of a complex involved in protein insertion into the bacterial outer membrane. Many proteins in this family are annotated as ComL, based on the involvement of this protein in natural transformation with exogenous DNA in Neisseria gonorrhoeae. This protein family shows sequence similarity to, but is distinct from, the tol-pal system protein YbgF (TIGR02795). Probab=49.93 E-value=20 Score=16.84 Aligned_cols=20 Identities=40% Similarity=0.770 Sum_probs=13.8 Q ss_pred HHHHHHHHHHHHHCCCCCCC Q ss_conf 46899999999860466533 Q gi|254780589|r 4 RYNILLFALFIFLNGCHHSR 23 (239) Q Consensus 4 R~~~Llfal~i~L~gc~~~~ 23 (239) +.++++++++++|+||++.. T Consensus 2 ~~~~~~~~~~~~l~sCs~~~ 21 (235) T TIGR03302 2 LLLILLLALLLLLAGCSSKK 21 (235) T ss_pred HHHHHHHHHHHHHHHCCCCC T ss_conf 79999999999998738997 No 43 >pfam07901 DUF1672 Protein of unknown function (DUF1672). This family is composed of hypothetical bacterial proteins of unknown function. Probab=49.34 E-value=17 Score=17.36 Aligned_cols=21 Identities=24% Similarity=0.496 Sum_probs=14.0 Q ss_pred CCCHHHHHHHHHHHHHCCCCCC Q ss_conf 9404689999999986046653 Q gi|254780589|r 1 MTNRYNILLFALFIFLNGCHHS 22 (239) Q Consensus 1 M~~R~~~Llfal~i~L~gc~~~ 22 (239) |+.+ .+++++++++|.||... T Consensus 1 MkK~-i~~il~~~llL~GCs~m 21 (304) T pfam07901 1 MRKL-IKLILIATLLLSGCSTT 21 (304) T ss_pred CCHH-HHHHHHHHHHHCCCCCC T ss_conf 9018-99999999997133445 No 44 >PRK09810 entericidin A; Provisional Probab=47.31 E-value=15 Score=17.63 Aligned_cols=25 Identities=28% Similarity=0.603 Sum_probs=17.9 Q ss_pred CCCHHHHHHHHHHHHHCCCCCCCCC Q ss_conf 9404689999999986046653322 Q gi|254780589|r 1 MTNRYNILLFALFIFLNGCHHSRSL 25 (239) Q Consensus 1 M~~R~~~Llfal~i~L~gc~~~~~~ 25 (239) |.+|...|++-...+|.||+.+... T Consensus 1 mmkrli~lil~~~~ll~gcntargf 25 (41) T PRK09810 1 MMKRLIVLVLLASTLLTGCNTARGF 25 (41) T ss_pred CHHHHHHHHHHHHHHHHCCCCCCCC T ss_conf 9389999999999998256212212 No 45 >TIGR01433 CyoA ubiquinol oxidase, subunit II; InterPro: IPR006333 Members of this domain catalyze the oxidation of ubiquinol with the concomitant reduction of molecular oxygen to water. This acts as the terminal electron acceptor in the respiratory chain. Subunit II is responsible for binding and oxidation of the ubiquinone substrate. This sequence is closely related to QoxA, which oxidizes quinol in Gram-positive bacteria but which is in complex with subunits which utilise cytochrome a in the reduction of molecular oxygen. Slightly more distantly related is subunit II of cytochrome c oxidase which uses cytochrome c as the oxidant . ; GO: 0016682 oxidoreductase activity acting on diphenols and related substances as donors oxygen as acceptor, 0006118 electron transport, 0016020 membrane. Probab=47.13 E-value=16 Score=17.40 Aligned_cols=30 Identities=27% Similarity=0.376 Sum_probs=16.9 Q ss_pred HHHHHHHHHHCCCCCCCCCCCCCCCCCCHH Q ss_conf 999999998604665332223444675545 Q gi|254780589|r 7 ILLFALFIFLNGCHHSRSLIDKAEHPLSEN 36 (239) Q Consensus 7 ~Llfal~i~L~gc~~~~~~~~~~~~pl~~~ 36 (239) ++++++.++|+||+...-...+..--+..+ T Consensus 1 L~~~~~~~lLsGCNn~~LL~PkG~ig~eer 30 (228) T TIGR01433 1 LSLILASLLLSGCNNLVLLDPKGQIGLEER 30 (228) T ss_pred CHHHHHHHHHHCCCCCCCCCCCCCEEHHHH T ss_conf 857789998612567655388886530246 No 46 >PRK09837 copper/silver efflux system outer membrane protein CusC; Provisional Probab=47.00 E-value=22 Score=16.67 Aligned_cols=32 Identities=19% Similarity=0.454 Sum_probs=17.4 Q ss_pred HHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCHH Q ss_conf 468999999998604665332223444675545 Q gi|254780589|r 4 RYNILLFALFIFLNGCHHSRSLIDKAEHPLSEN 36 (239) Q Consensus 4 R~~~Llfal~i~L~gc~~~~~~~~~~~~pl~~~ 36 (239) ++-+|.++++++|+||... ....++..|+|.. T Consensus 3 ~~rll~l~~~l~L~gC~~~-P~y~~P~~~~p~~ 34 (460) T PRK09837 3 PCKLLPFCVALALTGCSLA-PDYQRPAMPVPQQ 34 (460) T ss_pred HHHHHHHHHHHHHCCCCCC-CCCCCCCCCCCCC T ss_conf 5789999999998247789-9999969998642 No 47 >pfam10543 ORF6N ORF6N domain. This domain was identified by Iyer and colleagues. Probab=46.15 E-value=10 Score=18.71 Aligned_cols=52 Identities=15% Similarity=0.382 Sum_probs=41.2 Q ss_pred CCCEEECHHHHHHHHHH----HHHHHHHCCCCCEEEEEECCCCCHHHHHHHHCCCCH Q ss_conf 67187064669999999----999997508561116995556898898754258673 Q gi|254780589|r 150 AGCYAMNNKQMQEIYAI----VRDSLRGNMQSHIQIQAFPFRMTSKNMQLYQNNPNY 202 (239) Q Consensus 150 ~GC~amtd~~i~EIy~~----v~~a~~~~gQ~~~~v~~~Pfrmt~~n~~~~~~~~~~ 202 (239) .|+=.|||.+++|+|.. +..+++.+ +..|+--.+=|..|.+.++..+..... T Consensus 8 rgqrV~t~~~LA~lyg~~t~~i~~~~~rN-~~RF~eg~~~F~L~~~E~~~l~~~~~~ 63 (87) T pfam10543 8 RGQRVMTDEDLAELYGVETGTINQAFKRN-RDRFPEGKFYFELTGEELKELRSQFVT 63 (87) T ss_pred CCEEEEEHHHHHHHHCCCHHHHHHHHHHH-HHHCCCCCEEEEECHHHHHHHHHHHCC T ss_conf 89689778999999596857999999985-764778747999638999999986130 No 48 >PRK10523 lipoprotein involved with copper homeostasis and adhesion; Provisional Probab=46.06 E-value=24 Score=16.38 Aligned_cols=73 Identities=19% Similarity=0.195 Sum_probs=33.8 Q ss_pred CCCHHHHHHHH-HHHHHCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCEEEEEECC--CCEEEEEEECCCCCEEEE Q ss_conf 94046899999-999860466533222344467554567777751877578479999837--869999860788966899 Q gi|254780589|r 1 MTNRYNILLFA-LFIFLNGCHHSRSLIDKAEHPLSENLIISMQKKRTSPFHPTVIRIFKN--ENILEIWKRNVDAEYVLL 77 (239) Q Consensus 1 M~~R~~~Llfa-l~i~L~gc~~~~~~~~~~~~pl~~~~~~~~~~~~~~~~~pi~Irv~K~--e~~Lel~~~~~~g~~~l~ 77 (239) |++..+.++++ -++.|.||+..... ...+|.....+..|.+ + ..-++=..+=+ +-.|.|| .||.|.+- T Consensus 2 mkk~~~~~~~a~g~~~l~GC~~r~~~--~~~~~~~~~~lkpm~q---s-f~GvLPCADCsGI~ttL~L~---~DGtfvl~ 72 (234) T PRK10523 2 MKKAILTALAAVGLFALMGCNNRAEV--DTLSPAQAAELKPMQQ---S-WRGVLPCADCEGIDTSLFLE---KDGTWVMN 72 (234) T ss_pred HHHHHHHHHHHHHHHHHHCCCCCCCC--CCCCCCHHHCCCCHHH---H-HCCEEECCCCCCCEEEEEEC---CCCCEEEE T ss_conf 05899999999999987634665434--6667532202670335---5-34602034667843589986---89878977 Q ss_pred EEEEE Q ss_conf 98652 Q gi|254780589|r 78 KEYKI 82 (239) Q Consensus 78 ktY~I 82 (239) .+|-= T Consensus 73 ~~YlG 77 (234) T PRK10523 73 ERYLG 77 (234) T ss_pred EEEEC T ss_conf 78704 No 49 >COG4851 CamS Protein involved in sex pheromone biosynthesis [General function prediction only] Probab=45.63 E-value=15 Score=17.56 Aligned_cols=23 Identities=26% Similarity=0.378 Sum_probs=18.9 Q ss_pred CCCHHHHHHHHHHHHHCCCCCCC Q ss_conf 94046899999999860466533 Q gi|254780589|r 1 MTNRYNILLFALFIFLNGCHHSR 23 (239) Q Consensus 1 M~~R~~~Llfal~i~L~gc~~~~ 23 (239) |++...+++.|+.++|+||.+-. T Consensus 1 Mkktl~i~~ta~vliLs~C~~~~ 23 (382) T COG4851 1 MKKTLGIAATASVLILSGCFPFV 23 (382) T ss_pred CCHHHHHHHHHHHHHHHHCCCCC T ss_conf 94015589999999986116766 No 50 >PRK04792 tolB translocation protein TolB; Provisional Probab=43.77 E-value=23 Score=16.48 Aligned_cols=21 Identities=24% Similarity=0.374 Sum_probs=15.0 Q ss_pred CCCHHHHHHHHHHHHHCCCCC Q ss_conf 940468999999998604665 Q gi|254780589|r 1 MTNRYNILLFALFIFLNGCHH 21 (239) Q Consensus 1 M~~R~~~Llfal~i~L~gc~~ 21 (239) |.+|+++.+|.++++++.+.. T Consensus 1 M~k~~~~~l~lll~~~s~~A~ 21 (450) T PRK04792 1 MLKRLILGLFVLLLSFSQVAN 21 (450) T ss_pred CCHHHHHHHHHHHHHHHHHHE T ss_conf 938999999999998657520 No 51 >TIGR02607 antidote_HigA addiction module antidote protein, HigA family; InterPro: IPR013430 Proteins in this entry form a distinct group of helix-turn-helix proteins, which are strictly bacterial and nearly always shorter than 110 amino acids. They include the characterised member HigA, without which the killer protein HigB cannot be cloned. The hig (host inhibition of growth) system is noted to be unusual in that killer protein is encoded by the upstream member of the gene pair.; GO: 0003677 DNA binding. Probab=43.17 E-value=18 Score=17.14 Aligned_cols=29 Identities=14% Similarity=0.216 Sum_probs=20.0 Q ss_pred HHHHHHHHCCCCHHHHHHHHHHHHHHHHHC Q ss_conf 889875425867378985356888999858 Q gi|254780589|r 190 SKNMQLYQNNPNYSFWNMLKLGHDYFEKNH 219 (239) Q Consensus 190 ~~n~~~~~~~~~~~FW~~Lk~gyd~Fe~~~ 219 (239) +-.+++.-. ..-+||-+|+..||.-+..+ T Consensus 52 ALRL~~~lG-~sp~~WLnlQ~~YDL~~a~~ 80 (81) T TIGR02607 52 ALRLAKALG-TSPEFWLNLQNAYDLWIAEQ 80 (81) T ss_pred HHHHHHHHC-CCHHHHHHHHHHHHHHHHHC T ss_conf 999999738-98768998774899999841 No 52 >TIGR00668 apaH bis(5'-nucleosyl)-tetraphosphatase (symmetrical); InterPro: IPR004617 Diadenosine 5',5"'-P1,P4-tetraphosphate (Ap4A) is a regulatory metabolite of stress conditions. It is hydrolyzed to two ADP by Bis(5'-nucleosyl)-tetraphosphatase (symmetrical) (3.6.1.41 from EC). Alternate names include diadenosine-tetraphosphatase and Ap4A hydrolase .; GO: 0008803 bis(5'-nucleosyl)-tetraphosphatase (symmetrical) activity. Probab=42.65 E-value=7.3 Score=19.57 Aligned_cols=33 Identities=39% Similarity=0.538 Sum_probs=24.9 Q ss_pred CCCEEEEEEEEECCCCCCCCCHHCCCCCCCCCEEEEEEEC Q ss_conf 8966899986521323898870101787475079998401 Q gi|254780589|r 71 DAEYVLLKEYKICAWSGTFGPKIETGDEQAPEGFYYIGWN 110 (239) Q Consensus 71 ~g~~~l~ktY~Ic~~sg~lGpK~~eGD~kTPEG~Y~i~~~ 110 (239) +++ +.++|+|.. ||=.-=-|.-||+|+|.++++ T Consensus 214 ~~P--~~~~~~I~F-----GHWASL~~~~TP~gIYaLDTG 246 (280) T TIGR00668 214 DNP--LYKEYSIVF-----GHWASLVDEGTPEGIYALDTG 246 (280) T ss_pred CCC--HHHCCCEEE-----EEEEECCCCCCCCCCCCCCCC T ss_conf 863--222455033-----022311788788743213354 No 53 >PRK10449 heat-inducible protein; Provisional Probab=42.26 E-value=18 Score=17.18 Aligned_cols=35 Identities=17% Similarity=0.428 Sum_probs=23.6 Q ss_pred CCCCEEECCCCCCC-CCEEECHHHHHHHHHHHHHHHHHC Q ss_conf 98826871677866-718706466999999999999750 Q gi|254780589|r 137 TGADLMIHGECASA-GCYAMNNKQMQEIYAIVRDSLRGN 174 (239) Q Consensus 137 tG~~I~IHG~~~S~-GC~amtd~~i~EIy~~v~~a~~~~ 174 (239) .++.|.|.+--+|. .| .|+++.++=..+-++|..+ T Consensus 77 ~~~~L~v~~lasT~m~C---~dp~l~~~E~~i~~~L~~~ 112 (140) T PRK10449 77 SDGELTVKGLAMTRMMC---ADPQLNELDNTISEMLKKG 112 (140) T ss_pred CCCEEEECHHHCCCCCC---CCHHHHHHHHHHHHHHHCC T ss_conf 59879983011005468---9868999999999996247 No 54 >KOG3935 consensus Probab=41.80 E-value=10 Score=18.62 Aligned_cols=19 Identities=42% Similarity=0.782 Sum_probs=16.1 Q ss_pred HCCCCHHHHHHHHHHHHHH Q ss_conf 2586737898535688899 Q gi|254780589|r 197 QNNPNYSFWNMLKLGHDYF 215 (239) Q Consensus 197 ~~~~~~~FW~~Lk~gyd~F 215 (239) +++.+|.||++||.|-.+. T Consensus 409 ~n~DSynF~~~l~gGahhi 427 (446) T KOG3935 409 QNSDSYNFWRQLKGGAHHI 427 (446) T ss_pred HCCCCHHHHHHHCCCCCEE T ss_conf 6375077999852665346 No 55 >pfam08139 LPAM_1 Prokaryotic membrane lipoprotein lipid attachment site. In prokaryotes, membrane lipoproteins are synthesized with a precursor signal peptide, which is cleaved by a specific lipoprotein signal peptidase (signal peptidase II). The peptidase recognizes a conserved sequence and cuts upstream of a cysteine residue to which a glyceride-fatty acid lipid is attached. Probab=41.46 E-value=19 Score=16.97 Aligned_cols=20 Identities=25% Similarity=0.593 Sum_probs=12.1 Q ss_pred HHHHHHHHHHHHHCCCCCCC Q ss_conf 46899999999860466533 Q gi|254780589|r 4 RYNILLFALFIFLNGCHHSR 23 (239) Q Consensus 4 R~~~Llfal~i~L~gc~~~~ 23 (239) |.+++++++++.|+||++.. T Consensus 2 kk~~~l~~~~l~LagCas~~ 21 (26) T pfam08139 2 KKLLLLLLALLLLAGCASXX 21 (26) T ss_pred HHHHHHHHHHHHHHCCCCCC T ss_conf 66999999999982332001 No 56 >PRK01326 prsA foldase protein PrsA; Reviewed Probab=41.26 E-value=20 Score=16.81 Aligned_cols=23 Identities=13% Similarity=0.180 Sum_probs=15.6 Q ss_pred CCCHHHHHHHH--HHHHHCCCCCCC Q ss_conf 94046899999--999860466533 Q gi|254780589|r 1 MTNRYNILLFA--LFIFLNGCHHSR 23 (239) Q Consensus 1 M~~R~~~Llfa--l~i~L~gc~~~~ 23 (239) |+.++++.+.+ .++.|+||+... T Consensus 1 MKKKl~~~~~~~~svl~LaaCs~~~ 25 (310) T PRK01326 1 MKKKLIAGAVTLLSVATLAACSKTN 25 (310) T ss_pred CCHHHHHHHHHHHHHHHHHHHCCCC T ss_conf 9088999999999999998716999 No 57 >PRK13731 conjugal transfer surface exclusion protein TraT; Provisional Probab=41.08 E-value=29 Score=15.90 Aligned_cols=98 Identities=17% Similarity=0.264 Sum_probs=49.7 Q ss_pred CCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCEEEEEECCCCEEEEEEECCCCCEEEEEEE Q ss_conf 94046899999999860466533222344467554567777751877578479999837869999860788966899986 Q gi|254780589|r 1 MTNRYNILLFALFIFLNGCHHSRSLIDKAEHPLSENLIISMQKKRTSPFHPTVIRIFKNENILEIWKRNVDAEYVLLKEY 80 (239) Q Consensus 1 M~~R~~~Llfal~i~L~gc~~~~~~~~~~~~pl~~~~~~~~~~~~~~~~~pi~Irv~K~e~~Lel~~~~~~g~~~l~ktY 80 (239) |++-++..+.+.+++|+||........+-+..+. ..|+ +-|++.= .++|+..|=.+|..++-. T Consensus 3 ~kk~~~~~~~~~~l~lsGC~a~~tai~krnL~Vq----TkMS-------~TIfLdP-~s~ktVyv~vrNTS~~~~----- 65 (243) T PRK13731 3 TKKLMMVALVSSTLALSGCGAMSTAIKKRNLEVK----TQMS-------ETIWLEP-ASERTVFLQIKNTSDKDM----- 65 (243) T ss_pred HHHHHHHHHHHHHHHHCCHHHHHHHHHHCCCCEE----CEEC-------CCEEECC-CCCCEEEEEEECCCCCCH----- T ss_conf 3775799999999997156876778773575222----2000-------4388668-545579999851787724----- Q ss_pred EECCCCCCCCCHHCCCCCCCCCEEEEEEECCCCCCCCCC-EEEECCCC Q ss_conf 521323898870101787475079998401258757652-14207998 Q gi|254780589|r 81 KICAWSGTFGPKIETGDEQAPEGFYYIGWNNLNPNSKYF-LSINIGFP 127 (239) Q Consensus 81 ~Ic~~sg~lGpK~~eGD~kTPEG~Y~i~~~~~np~S~y~-ls~~i~YP 127 (239) ..+.+|..+ +.++ .| |.|.. ||+..+| |.-|+=|- T Consensus 66 ------~~l~~~i~~-~L~a-kG-Y~iv~---~P~~A~Y~LQaNVL~~ 101 (243) T PRK13731 66 ------SGLQGKIAD-AVKA-KG-YQVVT---SPDKAYYWIQANVLKA 101 (243) T ss_pred ------HHHHHHHHH-HHHH-CC-EEEEC---CHHHCEEEEEEEEHHC T ss_conf ------567899999-9985-89-09867---8657607727424023 No 58 >PRK00249 flgH flagellar basal body L-ring protein; Reviewed Probab=41.08 E-value=25 Score=16.23 Aligned_cols=23 Identities=26% Similarity=0.452 Sum_probs=16.5 Q ss_pred CCCHHHHHHHHHHHHHCCCCCCC Q ss_conf 94046899999999860466533 Q gi|254780589|r 1 MTNRYNILLFALFIFLNGCHHSR 23 (239) Q Consensus 1 M~~R~~~Llfal~i~L~gc~~~~ 23 (239) |.+..+.++++++++|+||.... T Consensus 1 ~~~~~~~~~~~~~~~LsgCa~~~ 23 (231) T PRK00249 1 MAMMLIALALLLLLLLSGCASIP 23 (231) T ss_pred CCCHHHHHHHHHHHHHHHCCCCC T ss_conf 93389999999999987534899 No 59 >COG2980 RlpB Rare lipoprotein B [Cell envelope biogenesis, outer membrane] Probab=39.81 E-value=30 Score=15.78 Aligned_cols=79 Identities=18% Similarity=0.224 Sum_probs=40.5 Q ss_pred CCCHHHHHHHHHHHHHCCCCCCCCCC--------------CCCCCCCCHHHHHHHHHCCCC-----CCCCEEEE--EECC Q ss_conf 94046899999999860466533222--------------344467554567777751877-----57847999--9837 Q gi|254780589|r 1 MTNRYNILLFALFIFLNGCHHSRSLI--------------DKAEHPLSENLIISMQKKRTS-----PFHPTVIR--IFKN 59 (239) Q Consensus 1 M~~R~~~Llfal~i~L~gc~~~~~~~--------------~~~~~pl~~~~~~~~~~~~~~-----~~~pi~Ir--v~K~ 59 (239) |.+-..+++.+++.+|+||+.-.... .....|+.-.+...+...+.. .+.|++-. +--. T Consensus 1 M~~L~~~lL~~a~~~L~aCGFhLRg~~~ip~~l~~l~l~s~d~y~~l~r~vrr~L~~n~v~vv~~~~~~p~Lri~se~~~ 80 (178) T COG2980 1 MRKLKTLLLLAAVLLLAACGFHLRGTTQIPPELKTLSLESSDPYGPLTRAVRRQLRLNGVDVVDTAKDVPVLRILSESTS 80 (178) T ss_pred CCCHHHHHHHHHHHHHHHCCEEECCCCCCCCCHHHEEEECCCCCCHHHHHHHHHHHHCCCEEEECCCCCCEEEECCEECC T ss_conf 90058999999999973345343178889941133155436898768999999999849645314778607886022215 Q ss_pred CCEEEEEEECCCCCEEEEEE Q ss_conf 86999986078896689998 Q gi|254780589|r 60 ENILEIWKRNVDAEYVLLKE 79 (239) Q Consensus 60 e~~Lel~~~~~~g~~~l~kt 79 (239) +++..+++++...+|.+.-+ T Consensus 81 ~~t~Si~~~gr~aEy~L~~~ 100 (178) T COG2980 81 QRTASIFRQGRAAEYQLTLT 100 (178) T ss_pred CCEEEEEHHHHHHHHHEEEE T ss_conf 53278632146665612468 No 60 >PRK12698 flgH flagellar basal body L-ring protein; Reviewed Probab=39.35 E-value=24 Score=16.42 Aligned_cols=11 Identities=36% Similarity=0.631 Sum_probs=8.2 Q ss_pred CCCCEEECCCC Q ss_conf 98826871677 Q gi|254780589|r 137 TGADLMIHGEC 147 (239) Q Consensus 137 tG~~I~IHG~~ 147 (239) +.|.+.|+|.- T Consensus 148 pNGNL~I~G~k 158 (224) T PRK12698 148 PNGNLVIRGEK 158 (224) T ss_pred CCCCEEEEEEE T ss_conf 69809999989 No 61 >COG3056 Uncharacterized lipoprotein [Cell envelope biogenesis, outer membrane] Probab=38.92 E-value=22 Score=16.59 Aligned_cols=18 Identities=11% Similarity=0.482 Sum_probs=12.9 Q ss_pred CCCCCEEECHHHHHHHHH Q ss_conf 866718706466999999 Q gi|254780589|r 148 ASAGCYAMNNKQMQEIYA 165 (239) Q Consensus 148 ~S~GC~amtd~~i~EIy~ 165 (239) +-.|-+.-+|++|+..-. T Consensus 162 ~~~GafsAsN~eI~~vln 179 (204) T COG3056 162 NVEGAFSASNDEIADVLN 179 (204) T ss_pred EEEEEEECCCHHHHHHHH T ss_conf 666554058867999999 No 62 >PRK11063 metQ DL-methionine transporter substrate-binding subunit; Provisional Probab=38.48 E-value=27 Score=16.11 Aligned_cols=24 Identities=21% Similarity=0.220 Sum_probs=15.6 Q ss_pred CCCHHHHHHHHH--HHHHCCCCCCCC Q ss_conf 940468999999--998604665332 Q gi|254780589|r 1 MTNRYNILLFAL--FIFLNGCHHSRS 24 (239) Q Consensus 1 M~~R~~~Llfal--~i~L~gc~~~~~ 24 (239) |+++.|+++.++ ++.|+||+...+ T Consensus 3 ~~~k~~~~~~~l~~~l~l~gCg~~~~ 28 (271) T PRK11063 3 FKFKTFAAVGALIGSLALVGCGQDEK 28 (271) T ss_pred HHHHHHHHHHHHHHHHHHCCCCCCCC T ss_conf 02999999999999997500589856 No 63 >COG4314 NosL Predicted lipoprotein involved in nitrous oxide reduction [Energy production and conversion] Probab=38.28 E-value=27 Score=16.03 Aligned_cols=21 Identities=24% Similarity=0.561 Sum_probs=14.9 Q ss_pred CHHHHHHHHHHHHHCCCCCCC Q ss_conf 046899999999860466533 Q gi|254780589|r 3 NRYNILLFALFIFLNGCHHSR 23 (239) Q Consensus 3 ~R~~~Llfal~i~L~gc~~~~ 23 (239) +|.+++.++..++|+||..+. T Consensus 2 kr~Lla~la~~~llAgC~~~e 22 (176) T COG4314 2 KRTLLAILAVTALLAGCRQAE 22 (176) T ss_pred CHHHHHHHHHHHHHHHCCHHH T ss_conf 403799999999987534311 No 64 >COG5645 Predicted periplasmic lipoprotein [General function prediction only] Probab=38.12 E-value=17 Score=17.23 Aligned_cols=28 Identities=29% Similarity=0.421 Sum_probs=17.6 Q ss_pred CHHHHHHHHHHHHHCCCCCCCCCCCCCC Q ss_conf 0468999999998604665332223444 Q gi|254780589|r 3 NRYNILLFALFIFLNGCHHSRSLIDKAE 30 (239) Q Consensus 3 ~R~~~Llfal~i~L~gc~~~~~~~~~~~ 30 (239) +|.+++.++..+.|+||++..+.....+ T Consensus 1 mr~i~l~l~v~lllSGC~SV~s~t~~~e 28 (80) T COG5645 1 MRNILLSLMVLLLLSGCGSVISLTIPKE 28 (80) T ss_pred CCEEHHHHHHHHHHCCCCEEEEECCCCC T ss_conf 9153278999998376410377426888 No 65 >TIGR02733 desat_CrtD C-3',4' desaturase CrtD; InterPro: IPR014104 Members of this family are encoded by slr1293, a carotenoid biosynthesis protein which was shown to be the C-3',4' desaturase (CrtD) of myxoxanthophyll biosynthesis in Synechocystis sp. (strain PCC 6803), and close homologues (presumed to be functionally equivalent) from other cyanobacteria, where myxoxanthophyll biosynthesis is either known or expected. This enzyme can act on neurosporene and so presumably catalyses the first step that is committed to myxoxanthophyll.. Probab=38.03 E-value=18 Score=17.12 Aligned_cols=70 Identities=16% Similarity=0.139 Sum_probs=36.2 Q ss_pred CCHHHHHHHHHCCCCCCCCEEEEEECCCCEEEEEEECCCCCEEEEEEEEECCCCCCCCC-------HHCCCCCCCCCEEE Q ss_conf 55456777775187757847999983786999986078896689998652132389887-------01017874750799 Q gi|254780589|r 33 LSENLIISMQKKRTSPFHPTVIRIFKNENILEIWKRNVDAEYVLLKEYKICAWSGTFGP-------KIETGDEQAPEGFY 105 (239) Q Consensus 33 l~~~~~~~~~~~~~~~~~pi~Irv~K~e~~Lel~~~~~~g~~~l~ktY~Ic~~sg~lGp-------K~~eGD~kTPEG~Y 105 (239) ++..=..+++..-...++=|+=-..|++..=. +--++++.+-.+. ||+|. =-+|||+|+|.|.- T Consensus 305 ~~~~Yr~Rl~~LP~pSGA~V~Y~gvk~~alP~----~cp~HlQf~~D~~-----GP~gENNSLFVSiS~egDGRAP~G~~ 375 (499) T TIGR02733 305 LPPGYRKRLKKLPEPSGAFVLYLGVKRAALPV----DCPPHLQFLSDHQ-----GPIGENNSLFVSISQEGDGRAPAGEA 375 (499) T ss_pred CCHHHHHHHHHCCCCCCEEEEEECCCHHHCCC----CCCCCCEEECCCC-----CCCCCCCCEEEEECCCCCCCCCCCCE T ss_conf 66147999972778897288600013333777----7878604500788-----88200364027865786888775753 Q ss_pred EEEECC Q ss_conf 984012 Q gi|254780589|r 106 YIGWNN 111 (239) Q Consensus 106 ~i~~~~ 111 (239) +|-... T Consensus 376 TlIAS~ 381 (499) T TIGR02733 376 TLIASS 381 (499) T ss_pred EEEEEE T ss_conf 678742 No 66 >PRK11023 hypothetical protein; Provisional Probab=37.71 E-value=25 Score=16.25 Aligned_cols=21 Identities=19% Similarity=0.384 Sum_probs=15.5 Q ss_pred CCCHHHHHHHHHHHHHCCCCC Q ss_conf 940468999999998604665 Q gi|254780589|r 1 MTNRYNILLFALFIFLNGCHH 21 (239) Q Consensus 1 M~~R~~~Llfal~i~L~gc~~ 21 (239) |+.-..++++.++++|+||.+ T Consensus 1 mk~~~~l~~l~~~~~L~GC~~ 21 (191) T PRK11023 1 MKALSPLAVLISALLLQGCVA 21 (191) T ss_pred CCHHHHHHHHHHHHHHCCCHH T ss_conf 944889999999999726823 No 67 >PRK11459 multidrug resistance outer membrane protein MdtQ; Provisional Probab=37.65 E-value=32 Score=15.57 Aligned_cols=22 Identities=32% Similarity=0.523 Sum_probs=14.4 Q ss_pred CCCHHHHHH---HHHHHHHCCCCCC Q ss_conf 940468999---9999986046653 Q gi|254780589|r 1 MTNRYNILL---FALFIFLNGCHHS 22 (239) Q Consensus 1 M~~R~~~Ll---fal~i~L~gc~~~ 22 (239) |+++.|+-. ++++++|+||... T Consensus 1 ~~~~~f~~~~a~l~~~llLaGCa~~ 25 (478) T PRK11459 1 MNRDSFYAATASLPLFILLAGCAPM 25 (478) T ss_pred CCCCCHHHHHHHHHHHHHHHCCCCC T ss_conf 9974058889999999998378899 No 68 >COG3093 VapI Plasmid maintenance system antidote protein [General function prediction only] Probab=37.50 E-value=26 Score=16.17 Aligned_cols=30 Identities=17% Similarity=0.252 Sum_probs=21.4 Q ss_pred HHHHHHCCCCHHHHHHHHHHHHHHHHHCCCC Q ss_conf 9875425867378985356888999858973 Q gi|254780589|r 192 NMQLYQNNPNYSFWNMLKLGHDYFEKNHQEP 222 (239) Q Consensus 192 n~~~~~~~~~~~FW~~Lk~gyd~Fe~~~~~p 222 (239) .+++. ....-+||.+|+..||.-+....+- T Consensus 57 rL~k~-fGtspefWlNlQ~~ydL~~~~~~~~ 86 (104) T COG3093 57 RLAKV-FGTSPEFWLNLQNAYDLWEAENAVR 86 (104) T ss_pred HHHHH-HCCCHHHHHHHHHHHHHHHHHCCHH T ss_conf 99999-2998999999999988999971425 No 69 >KOG3135 consensus Probab=37.40 E-value=25 Score=16.28 Aligned_cols=54 Identities=19% Similarity=0.329 Sum_probs=31.7 Q ss_pred CCCEEEEEEECC--------CCCCCCC----CEEEECCCCCHHHHHCCCCCCCEEECCCC-CCCCCEEECH Q ss_conf 750799984012--------5875765----21420799898899729998826871677-8667187064 Q gi|254780589|r 100 APEGFYYIGWNN--------LNPNSKY----FLSINIGFPNEFDKAHNRTGADLMIHGEC-ASAGCYAMNN 157 (239) Q Consensus 100 TPEG~Y~i~~~~--------~np~S~y----~ls~~i~YPN~~Dr~~grtG~~I~IHG~~-~S~GC~amtd 157 (239) -|-|++.-+..+ ++--|.. -+...|||-|.+--.-.-.- |||.| |-+||||=.| T Consensus 107 K~AG~F~Stgs~gGgqE~talta~t~LvHHGmifVPlGYkn~~a~m~~me~----V~Ggsp~GAGt~Ag~D 173 (203) T KOG3135 107 KPAGIFVSTGSQGGGQETTALTAITQLVHHGMIFVPLGYKNFGAEMFEMEE----VHGGSPWGAGTFAGID 173 (203) T ss_pred CCEEEEEECCCCCCCHHHHHHHHHHHHHHCCEEEEECCCCHHHHHHHHHHC----CCCCCCCCCCEEECCC T ss_conf 730489852677885486899999999865528985141045554421321----3588987774034587 No 70 >KOG3413 consensus Probab=37.40 E-value=30 Score=15.79 Aligned_cols=59 Identities=22% Similarity=0.404 Sum_probs=39.0 Q ss_pred EEEEECCCCCCCCCHHCCCCCCCCCEEEEEEECCCCCCCCCCEEEECCCCCHHHHHCCCCCCCEEECCC Q ss_conf 986521323898870101787475079998401258757652142079989889972999882687167 Q gi|254780589|r 78 KEYKICAWSGTFGPKIETGDEQAPEGFYYIGWNNLNPNSKYFLSINIGFPNEFDKAHNRTGADLMIHGE 146 (239) Q Consensus 78 ktY~Ic~~sg~lGpK~~eGD~kTPEG~Y~i~~~~~np~S~y~ls~~i~YPN~~Dr~~grtG~~I~IHG~ 146 (239) ..|.|....|-|-=|. | --|.|-|+.. .||-+-.||=+++=|+.||-.+ .|..|-.|-. T Consensus 72 ~~~Dv~y~~GVLTl~l--g----~~GTYViNKQ--~PnkQIWlSSP~SGPkryD~~~--~~~Wiy~r~g 130 (156) T KOG3413 72 EGFDVDYADGVLTLKL--G----SVGTYVINKQ--PPNKQIWLSSPVSGPKRYDLCR--AGEWIYLRDG 130 (156) T ss_pred CCCCCCCCCCEEEEEE--C----CCEEEEECCC--CCCCEEEEECCCCCCCCCCCCC--CCCEEEECCC T ss_conf 6666320565389973--6----8435886378--9754046538888973222015--6745883387 No 71 >pfam05590 DUF769 Xylella fastidiosa protein of unknown function (DUF769). This family consists of several uncharacterized hypothetical proteins of unknown function from Xylella fastidiosa, the organism that causes Pierce's disease in plants. Probab=36.70 E-value=32 Score=15.62 Aligned_cols=31 Identities=23% Similarity=0.485 Sum_probs=20.6 Q ss_pred CHHHHHHHHHHHHHCCCCCCCCCCCCCCCCC Q ss_conf 0468999999998604665332223444675 Q gi|254780589|r 3 NRYNILLFALFIFLNGCHHSRSLIDKAEHPL 33 (239) Q Consensus 3 ~R~~~Llfal~i~L~gc~~~~~~~~~~~~pl 33 (239) -|+...++|..++|+||++..-+..+...|. T Consensus 5 prL~~SllaAslLLagCSsgPpIDsrTGKPM 35 (279) T pfam05590 5 PRLSCSLLAASLLLAGCSSGPPIDSRTGKPM 35 (279) T ss_pred CCHHHHHHHHHHHHHCCCCCCCCCCCCCCCC T ss_conf 6401699999999723788998667779843 No 72 >pfam08085 Entericidin Entericidin EcnA/B family. This family consists of the entericidin antidote/toxin peptides. The entericidin locus is activated in stationary phase under high osmolarity conditions by rho-S and simultaneously repressed by the osmoregulatory EnvZ/OmpR signal transduction pathway. The entericidin locus encodes tandem paralogous genes (ecnAB) and directs the synthesis of two small cell-envelope lipoproteins which can maintain plasmids in bacterial population by means of post-segregational killing. Probab=36.61 E-value=25 Score=16.24 Aligned_cols=23 Identities=30% Similarity=0.508 Sum_probs=12.2 Q ss_pred CCCHHHHHHH--HHHHHHCCCCCCC Q ss_conf 9404689999--9999860466533 Q gi|254780589|r 1 MTNRYNILLF--ALFIFLNGCHHSR 23 (239) Q Consensus 1 M~~R~~~Llf--al~i~L~gc~~~~ 23 (239) |+++..+++. .+++.|+||+... T Consensus 1 Mk~~~~~~~~l~~~~~~lagCNTv~ 25 (42) T pfam08085 1 MKKLIALLLALLLLALVLAGCNTVA 25 (42) T ss_pred CHHHHHHHHHHHHHHHHHHCCHHHH T ss_conf 9236999999999999986122211 No 73 >COG3417 FlgN Collagen-binding surface adhesin SpaP (antigen I/II family) [General function prediction only] Probab=36.53 E-value=32 Score=15.64 Aligned_cols=19 Identities=26% Similarity=0.583 Sum_probs=13.0 Q ss_pred CHHHHHHHHHHHHHCCCCC Q ss_conf 0468999999998604665 Q gi|254780589|r 3 NRYNILLFALFIFLNGCHH 21 (239) Q Consensus 3 ~R~~~Llfal~i~L~gc~~ 21 (239) ++.+..++.++++|+||.. T Consensus 4 ~k~~~~il~~al~l~GCs~ 22 (200) T COG3417 4 MKIYASILLLALFLSGCSS 22 (200) T ss_pred HHHHHHHHHHHHHHHHCCC T ss_conf 7999999999999841136 No 74 >PRK03002 prsA peptidylprolyl isomerase; Reviewed Probab=35.45 E-value=28 Score=15.94 Aligned_cols=22 Identities=23% Similarity=0.534 Sum_probs=15.4 Q ss_pred CCCHHHHHHHHHH--HHHCCCCCC Q ss_conf 9404689999999--986046653 Q gi|254780589|r 1 MTNRYNILLFALF--IFLNGCHHS 22 (239) Q Consensus 1 M~~R~~~Llfal~--i~L~gc~~~ 22 (239) |++++++++.+++ +.|+||+.. T Consensus 1 mkkK~i~~~~~~~svl~LaaC~~~ 24 (285) T PRK03002 1 MRGKHIFIITALISILMLSACGQK 24 (285) T ss_pred CCHHHHHHHHHHHHHHHHHHHCCC T ss_conf 960189999999999999984579 No 75 >pfam04288 MukE MukE-like family. Bacterial protein involved in chromosome partitioning, MukE Probab=35.40 E-value=16 Score=17.42 Aligned_cols=104 Identities=24% Similarity=0.377 Sum_probs=52.7 Q ss_pred CCCCCEEEEEEECCCCCCCCCCEEEECCCCCHHHHHCCC------------CCCCEEECCC------------------- Q ss_conf 747507999840125875765214207998988997299------------9882687167------------------- Q gi|254780589|r 98 EQAPEGFYYIGWNNLNPNSKYFLSINIGFPNEFDKAHNR------------TGADLMIHGE------------------- 146 (239) Q Consensus 98 ~kTPEG~Y~i~~~~~np~S~y~ls~~i~YPN~~Dr~~gr------------tG~~I~IHG~------------------- 146 (239) .++|||||++-++. . .-++.+-=|+.|-.-|+ .-.+||.+.. T Consensus 57 i~apEgffYLrPr~-----t--tlIprsvLSELdMlvGkvLc~LyLsperLa~~gift~qeL~deLl~l~dE~kllklv~ 129 (227) T pfam04288 57 IRAPEGFFYLRPRS-----T--TLINRSVLSELDMLVGKVLCYLYLSPERLAHEGIFTNQELYDELLTLADEKKLLKLVN 129 (227) T ss_pred EECCCCEEEECCCC-----C--CCCCHHHHHHHHHHHHHHHHHHCCCHHHHHCCCEEEHHHHHHHHHHHHCHHHHHHHHH T ss_conf 97687427865665-----5--7532766448888998899997069989605881218999999997415799999982 Q ss_pred CCCCCCEEECHHHHHHHHHHHHHHHHHCCCCC--EE-E--EEECCCCCHHHH-----HHHHCCCCHHHHHHHHHHHH Q ss_conf 78667187064669999999999997508561--11-6--995556898898-----75425867378985356888 Q gi|254780589|r 147 CASAGCYAMNNKQMQEIYAIVRDSLRGNMQSH--IQ-I--QAFPFRMTSKNM-----QLYQNNPNYSFWNMLKLGHD 213 (239) Q Consensus 147 ~~S~GC~amtd~~i~EIy~~v~~a~~~~gQ~~--~~-v--~~~Pfrmt~~n~-----~~~~~~~~~~FW~~Lk~gyd 213 (239) ..+.| ||-+.+-|..=|+.+|+.= ++- |+ + ..=-||.|++-+ .|.-++|.-+--+....|+- T Consensus 130 ~R~~g----sDlD~~Kl~ekvr~sLrrL-rRlgmI~~l~~d~~kF~iteAvfRF~advR~~~d~~eaq~RlIrdGea 201 (227) T pfam04288 130 NRSSG----SDLDRQKLFEKVRTSLRRL-RRLGMVITIGDNSGKFRITESVFRFGADVRLGDDPREAQLRLIRDGEA 201 (227) T ss_pred CCCCC----CHHHHHHHHHHHHHHHHHH-HHCCCEEEECCCCCEEEECHHHHHCCCCCCCCCCHHHHHHHHHHCCCE T ss_conf 66886----5230999999999999999-854533312366660774178886055234678867999998852764 No 76 >pfam07269 consensus Probab=34.98 E-value=29 Score=15.90 Aligned_cols=20 Identities=40% Similarity=0.811 Sum_probs=11.7 Q ss_pred HHHHHHHHHHHHHCCCCCCCCC Q ss_conf 4689999999986046653322 Q gi|254780589|r 4 RYNILLFALFIFLNGCHHSRSL 25 (239) Q Consensus 4 R~~~Llfal~i~L~gc~~~~~~ 25 (239) +.++|++.++ |+||+.++.. T Consensus 2 Ky~ll~l~l~--LAgC~T~d~l 21 (55) T pfam07269 2 KYCLLLLILA--LAGCQTNDVL 21 (55) T ss_pred HHHHHHHHHH--HHHCCCCCCC T ss_conf 0149999999--8751146711 No 77 >TIGR02887 spore_ger_x_C germination protein, Ger(x)C family; InterPro: IPR008844 The GerAA, -AB, and -AC proteins of the Bacillus subtilis spore are required for the germination response to L-alanine as the sole germinant. Members of GerAC family are thought to be located in the inner spore membrane. Although the function of this family is unclear, they are likely to encode the components of the germination apparatus that respond directly to this germinant, mediating the spore's response .; GO: 0009847 spore germination, 0016020 membrane. Probab=34.56 E-value=28 Score=15.98 Aligned_cols=20 Identities=30% Similarity=0.602 Sum_probs=12.7 Q ss_pred HHHHHHHHHHHHHCCCCCCC Q ss_conf 46899999999860466533 Q gi|254780589|r 4 RYNILLFALFIFLNGCHHSR 23 (239) Q Consensus 4 R~~~Llfal~i~L~gc~~~~ 23 (239) +.++++++++++|+||-... T Consensus 3 ~~~~~~~~~~~lL~GCwd~~ 22 (400) T TIGR02887 3 LKILLLILALLLLTGCWDKR 22 (400) T ss_pred HHHHHHHHHHHHHCCCCCCH T ss_conf 66789999999832665620 No 78 >PRK05256 condesin subunit E; Provisional Probab=34.32 E-value=17 Score=17.23 Aligned_cols=57 Identities=19% Similarity=0.279 Sum_probs=30.0 Q ss_pred CHHHHHHHHHHHHHHHHHCCCCC--EE-E--EEECCCCCHHHH-----HHHHCCCCHHHHHHHHHHHH Q ss_conf 64669999999999997508561--11-6--995556898898-----75425867378985356888 Q gi|254780589|r 156 NNKQMQEIYAIVRDSLRGNMQSH--IQ-I--QAFPFRMTSKNM-----QLYQNNPNYSFWNMLKLGHD 213 (239) Q Consensus 156 td~~i~EIy~~v~~a~~~~gQ~~--~~-v--~~~Pfrmt~~n~-----~~~~~~~~~~FW~~Lk~gyd 213 (239) ||-+.+-|..=|+.+|+.= ++- |+ + ..=-|+.|++-+ .|.-+++.-+--+..++|+- T Consensus 142 sDlD~~Kl~ekvr~sL~rL-~Rlg~I~~l~~d~~kf~it~AvfRF~a~vR~~dd~~Eaq~RLIrdGea 208 (230) T PRK05256 142 SDLDRQKLQEKVRTSLRRL-RRLGMVWFMGHDSSKFRITESVFRFGADVRSGDDPREAQLRLIRDGEA 208 (230) T ss_pred CHHHHHHHHHHHHHHHHHH-HHHHHHHHHCCCCCEEEECHHHHHHCCCCCCCCCHHHHHHHHHHCCCC T ss_conf 4442999999999999999-872179885066440553278886044123568866899999862661 No 79 >COG3133 SlyB Outer membrane lipoprotein [Cell envelope biogenesis, outer membrane] Probab=33.92 E-value=33 Score=15.53 Aligned_cols=26 Identities=19% Similarity=0.299 Sum_probs=20.3 Q ss_pred CCCHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 94046899999999860466533222 Q gi|254780589|r 1 MTNRYNILLFALFIFLNGCHHSRSLI 26 (239) Q Consensus 1 M~~R~~~Llfal~i~L~gc~~~~~~~ 26 (239) |++..+.+...+++.|.||......+ T Consensus 1 M~k~~~avA~~~al~L~GCan~d~~S 26 (154) T COG3133 1 MIKTALAVAALMALSLTGCANNDTLS 26 (154) T ss_pred CCHHHHHHHHHHHHHCCCCCCCCCCC T ss_conf 94257899999887400110587436 No 80 >COG4669 EscJ Type III secretory pathway, lipoprotein EscJ [Intracellular trafficking and secretion] Probab=33.73 E-value=38 Score=15.18 Aligned_cols=68 Identities=16% Similarity=0.182 Sum_probs=35.7 Q ss_pred EEEECCCCCHHHHHCCCCCCCEEECCCCCCCCCEEEC------HHHHHHHHHHHHHHHHHCCCCCEEEEEEC--CCCCHH Q ss_conf 1420799898899729998826871677866718706------46699999999999975085611169955--568988 Q gi|254780589|r 120 LSINIGFPNEFDKAHNRTGADLMIHGECASAGCYAMN------NKQMQEIYAIVRDSLRGNMQSHIQIQAFP--FRMTSK 191 (239) Q Consensus 120 ls~~i~YPN~~Dr~~grtG~~I~IHG~~~S~GC~amt------d~~i~EIy~~v~~a~~~~gQ~~~~v~~~P--frmt~~ 191 (239) -+.+|+||+..| .|++ ..++|+--+... +..+-.|=.+|.+++.|=--.+|-|-.-| +++-.. T Consensus 127 ArV~I~lp~~~~--~g~~-------~~P~saSVfIky~~~~nl~~~v~~IK~LV~nSv~gL~YenISVVl~~~~~~~~~~ 197 (246) T COG4669 127 ARVHISLPEDDD--EGKN-------ALPSSASVFIKYSPDVNLSIYVSQIKRLVANSVPGLQYENISVVLVPASDSDPAI 197 (246) T ss_pred EEEEEECCCCCC--CCCC-------CCCCEEEEEEEECCCCCHHHHHHHHHHHHHHCCCCCCHHCEEEEEEECCCCCCCC T ss_conf 889997577786--6788-------8874068999956889816768999999981567776000699982044344446 Q ss_pred HHHHH Q ss_conf 98754 Q gi|254780589|r 192 NMQLY 196 (239) Q Consensus 192 n~~~~ 196 (239) +-.+- T Consensus 198 ~~~~~ 202 (246) T COG4669 198 PGPRA 202 (246) T ss_pred CCCCC T ss_conf 67543 No 81 >PRK11138 outer membrane protein assembly complex subunit YfgL; Provisional Probab=32.94 E-value=39 Score=15.10 Aligned_cols=23 Identities=26% Similarity=0.233 Sum_probs=11.7 Q ss_pred CCCHHHHHHHH-HHHHHCCCCCCC Q ss_conf 94046899999-999860466533 Q gi|254780589|r 1 MTNRYNILLFA-LFIFLNGCHHSR 23 (239) Q Consensus 1 M~~R~~~Llfa-l~i~L~gc~~~~ 23 (239) |..|.+++... .+.+|+||.... T Consensus 1 ~~~~~~~~~~l~~~~ll~gCs~~~ 24 (394) T PRK11138 1 MQLRKLLLPGLLSVALLSGCSLFD 24 (394) T ss_pred CCHHHHHHHHHHHHHHHHHCCCCC T ss_conf 951677999999999986523788 No 82 >TIGR02422 protocat_beta protocatechuate 3,4-dioxygenase, beta subunit; InterPro: IPR012785 Protocatechuate (3,4-dihydroxybenzene, PCA) is an aromatic compound which is a key intermediate in the degradation of the plant biopolymer lignin and other aromatic compounds. The key step of PCA degradation is the ring-cleavage performed by dioxygenases adding both atoms from molecular oxygen to specific carbon atoms within the ring. This step can be performed by two distinct mechanisms; intradiol cleavage and extradiol cleavage. In intradiol cleavage the oxygen atoms are added to the carbons carrying the hydroxyl groups, producing two carboxylate groups. In extradiol cleavage the oxygens are added to one carbon carrying a hydroxyl group and another carrying a hydrogen, resulting in the formation of a carboxylate group and an aldehydic group. For further information see . PCA dioxygenases fall into the broader category of catechol dioxygenases. These are metalloenzymes which bind non-haeme iron. The extradiol dioxygenases use Fe(II) to activate oxygen for nucleophilic attack on the aromatic substrate, while the intradiol dioxygenases use Fe(III) to activate the aromatic substrate for an electrophilic attack by oxygen . This entry represents the beta subunit of protocatechuate 3,4-dioxygenase, an enzyme which cleaves the PCA ring by an intradiol mechanism. It is composed of two subunits, alpha (IPR012786 from INTERPRO) and beta which are highly similar in structure and are thought to share a common ancestor , , . The core of each subunit is two four-stranded beta-sheets that fold upon each other to form a beta sandwhich. The active site cavity contains the Fe(III)-binding site and is located between the two subunits. All Fe(III) ligands are contributed by the beta subunit.; GO: 0005506 iron ion binding, 0018578 protocatechuate 34-dioxygenase activity, 0019619 protocatechuate catabolic process. Probab=32.75 E-value=14 Score=17.91 Aligned_cols=76 Identities=16% Similarity=0.412 Sum_probs=51.4 Q ss_pred CCCCCCCHHHHHHHHHCC-CCCCCCEEE--EEE------CCCCEEEEEEECCCCCEEEEEEEEECCCCCCCCCHHCCCCC Q ss_conf 444675545677777518-775784799--998------37869999860788966899986521323898870101787 Q gi|254780589|r 28 KAEHPLSENLIISMQKKR-TSPFHPTVI--RIF------KNENILEIWKRNVDAEYVLLKEYKICAWSGTFGPKIETGDE 98 (239) Q Consensus 28 ~~~~pl~~~~~~~~~~~~-~~~~~pi~I--rv~------K~e~~Lel~~~~~~g~~~l~ktY~Ic~~sg~lGpK~~eGD~ 98 (239) ..-.|+..-+....+..| ...+..|.| ||. -....+|+|..|.+|||+..+. .|..+|-|- +-|=+ T Consensus 40 ~~~~~~DnDLt~N~~~~GE~pIGERI~V~GrVlD~~GrPVp~~LVE~WQANA~GRYrHk~D----~~~APLDPN-F~G~G 114 (224) T TIGR02422 40 EDLGPLDNDLTKNFAHGGEEPIGERIIVHGRVLDEDGRPVPNTLVEVWQANAAGRYRHKRD----QYLAPLDPN-FGGAG 114 (224) T ss_pred HHCCCCHHHHHHHHHCCCCCCCCCEEEEEEEEECCCCCCCCCCEEHHEECCCCCCCCCCCC----CCCCCCCCC-CCCCE T ss_conf 2246000157887640788877775789768865888858874100011035786246777----868888888-77512 Q ss_pred CC---CCEEEEEE Q ss_conf 47---50799984 Q gi|254780589|r 99 QA---PEGFYYIG 108 (239) Q Consensus 99 kT---PEG~Y~i~ 108 (239) || .+|.|.-. T Consensus 115 RtlTD~~G~Y~F~ 127 (224) T TIGR02422 115 RTLTDEDGYYRFR 127 (224) T ss_pred EEEECCCCCEEEE T ss_conf 2343799826888 No 83 >PRK11397 dacD D-alanyl-D-alanine carboxypeptidase; Provisional Probab=32.05 E-value=40 Score=15.01 Aligned_cols=14 Identities=14% Similarity=0.140 Sum_probs=6.2 Q ss_pred CCCHHHHHHHHHHH Q ss_conf 94046899999999 Q gi|254780589|r 1 MTNRYNILLFALFI 14 (239) Q Consensus 1 M~~R~~~Llfal~i 14 (239) |++|.++.++.+++ T Consensus 3 ~~~~~~~~~~~~~~ 16 (390) T PRK11397 3 LKRRLIIAASLFVF 16 (390) T ss_pred HHHHHHHHHHHHHH T ss_conf 37899999999999 No 84 >TIGR03431 PhnD phosphonate ABC transporter, periplasmic phosphonate binding protein. Note that this model does not identify all phnD-subfamily genes with evident phosphonate context, but all sequences above the trusted context may be inferred to bind phosphonate compounds even in the absence of such context. Furthermore, there is ample evidence to suggest that many other members of the TIGR01098 subfamily have a different primary function. Probab=31.91 E-value=33 Score=15.53 Aligned_cols=22 Identities=23% Similarity=0.403 Sum_probs=15.0 Q ss_pred CCCHHHHHHHHHHHHHCCCCCC Q ss_conf 9404689999999986046653 Q gi|254780589|r 1 MTNRYNILLFALFIFLNGCHHS 22 (239) Q Consensus 1 M~~R~~~Llfal~i~L~gc~~~ 22 (239) |.+|.++++++++++++++..+ T Consensus 1 m~~r~l~~~~~~~~~~~~~a~a 22 (288) T TIGR03431 1 MLRRLILSLVAAFMLISSNAQA 22 (288) T ss_pred CCHHHHHHHHHHHHHHCCHHHH T ss_conf 9088999999999997432243 No 85 >COG5461 Type IV pili component [Cell motility and secretion] Probab=31.04 E-value=42 Score=14.91 Aligned_cols=45 Identities=31% Similarity=0.464 Sum_probs=23.6 Q ss_pred HHHHHHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCEEEEEECCCCEEEE Q ss_conf 99999999860466533222344467554567777751877578479999837869999 Q gi|254780589|r 7 ILLFALFIFLNGCHHSRSLIDKAEHPLSENLIISMQKKRTSPFHPTVIRIFKNENILEI 65 (239) Q Consensus 7 ~Llfal~i~L~gc~~~~~~~~~~~~pl~~~~~~~~~~~~~~~~~pi~Irv~K~e~~Lel 65 (239) +|+.+++.+|+||.+.......+ +| .+....+||.|+ +.|.++.+ T Consensus 11 VL~~a~~allagCAs~d~~~~~a---ip---------ddyrt~hpI~i~--e~~~~~di 55 (224) T COG5461 11 VLLVAATALLAGCASRDPSTTGA---IP---------DDYRTRHPIVIR--EVEPEIDI 55 (224) T ss_pred HHHHHHHHHHHHCCCCCCCCCCC---CC---------CCCCCCCCEEEE--CCCCCCCE T ss_conf 99999999863213689655677---65---------323578876864--25777660 No 86 >PRK02998 prsA peptidylprolyl isomerase; Reviewed Probab=30.75 E-value=42 Score=14.88 Aligned_cols=26 Identities=15% Similarity=0.317 Sum_probs=16.3 Q ss_pred CCCHHHHHHHH---HHHHHCCCCCCCCCC Q ss_conf 94046899999---999860466533222 Q gi|254780589|r 1 MTNRYNILLFA---LFIFLNGCHHSRSLI 26 (239) Q Consensus 1 M~~R~~~Llfa---l~i~L~gc~~~~~~~ 26 (239) |+.+.+++.++ +++.|+||+++..+. T Consensus 1 Mkkkk~~~~~~~~~~~l~LaaC~s~~~va 29 (283) T PRK02998 1 MKKKKLFIGTIISCVVLALSACGSSDNVV 29 (283) T ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCCCEE T ss_conf 93589999999999999998668998179 No 87 >COG2143 Thioredoxin-related protein [Posttranslational modification, protein turnover, chaperones] Probab=30.69 E-value=42 Score=14.87 Aligned_cols=51 Identities=18% Similarity=0.364 Sum_probs=27.1 Q ss_pred CCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCEEEEEECC Q ss_conf 94046899999999860466533222344467554567777751877578479999837 Q gi|254780589|r 1 MTNRYNILLFALFIFLNGCHHSRSLIDKAEHPLSENLIISMQKKRTSPFHPTVIRIFKN 59 (239) Q Consensus 1 M~~R~~~Llfal~i~L~gc~~~~~~~~~~~~pl~~~~~~~~~~~~~~~~~pi~Irv~K~ 59 (239) |.+-..+++|++.++|.+|...... +.+-. ..-..+...+....++.+|-+ T Consensus 1 ~mRvl~i~Lliis~fl~a~~s~~e~--~s~~~------~~~d~ksi~~~~Kylllmfes 51 (182) T COG2143 1 VMRVLLIVLLIISLFLSACKSNNEK--RSNID------VFDDNKSISPNDKYLLLMFES 51 (182) T ss_pred CCCHHHHHHHHHHHHHHHHHCCCHH--HHHHH------HHHHHHHCCCCCCEEEEEECC T ss_conf 9001999999999999997178044--32255------677787459567589999758 No 88 >PRK02201 putative inner membrane protein translocase component YidC; Provisional Probab=30.69 E-value=40 Score=15.02 Aligned_cols=19 Identities=37% Similarity=0.440 Sum_probs=11.4 Q ss_pred HHHHHHHHHHHHHCCCCCC Q ss_conf 4689999999986046653 Q gi|254780589|r 4 RYNILLFALFIFLNGCHHS 22 (239) Q Consensus 4 R~~~Llfal~i~L~gc~~~ 22 (239) +.++-+|...+.|.||--+ T Consensus 37 Ki~~~~fl~~~~l~GC~Q~ 55 (392) T PRK02201 37 KIFFYLFLFGIGLWGCIQS 55 (392) T ss_pred HHHHHHHHHHHHHHHHHHH T ss_conf 9999999999999866775 No 89 >COG3095 MukE Uncharacterized protein involved in chromosome partitioning [Cell division and chromosome partitioning] Probab=30.55 E-value=22 Score=16.64 Aligned_cols=14 Identities=43% Similarity=0.878 Sum_probs=11.2 Q ss_pred CCCCCEEEEEEECC Q ss_conf 74750799984012 Q gi|254780589|r 98 EQAPEGFYYIGWNN 111 (239) Q Consensus 98 ~kTPEG~Y~i~~~~ 111 (239) -++|||||++.++. T Consensus 64 irapegffylrprs 77 (238) T COG3095 64 IRAPEGFFYLRPRS 77 (238) T ss_pred EECCCCEEEECCCC T ss_conf 22677526865663 No 90 >smart00095 TR_THY Transthyretin. Probab=30.47 E-value=42 Score=14.85 Aligned_cols=44 Identities=20% Similarity=0.388 Sum_probs=25.1 Q ss_pred EEEEEECCCCCEEEEEEEEECCCCCCCCCHHCCCCCCCCCEEEEEEE Q ss_conf 99986078896689998652132389887010178747507999840 Q gi|254780589|r 63 LEIWKRNVDAEYVLLKEYKICAWSGTFGPKIETGDEQAPEGFYYIGW 109 (239) Q Consensus 63 Lel~~~~~~g~~~l~ktY~Ic~~sg~lGpK~~eGD~kTPEG~Y~i~~ 109 (239) ++|+++..++.|+++.+=. ..-.|.... .-.+ .+-.+|.|.+.- T Consensus 24 V~L~~~~~~~~w~~ia~~~-Tn~DGR~~~-l~~~-~~~~~G~YrL~F 67 (121) T smart00095 24 VKVFKKTEEGTWEPFASGK-TNESGEIHE-LTTD-EKFVEGLYKVEF 67 (121) T ss_pred EEEEEECCCCCEEEEEEEE-ECCCCCCCC-CCCC-CCCCCEEEEEEE T ss_conf 9999987999779978876-389988567-6771-446765499999 No 91 >PRK10965 multicopper oxidase; Provisional Probab=30.01 E-value=43 Score=14.80 Aligned_cols=19 Identities=21% Similarity=0.051 Sum_probs=12.3 Q ss_pred CCCHHHHHHHHHHHHHCCC Q ss_conf 9404689999999986046 Q gi|254780589|r 1 MTNRYNILLFALFIFLNGC 19 (239) Q Consensus 1 M~~R~~~Llfal~i~L~gc 19 (239) |++|-|+.+.+.+..++++ T Consensus 1 l~RR~fl~~~~~~~~~~~l 19 (530) T PRK10965 1 MQRRDFLKLSAALGVASAL 19 (530) T ss_pred CCHHHHHHHHHHHHHHHHC T ss_conf 9789999999999999867 No 92 >PRK10796 LPS-assembly lipoprotein RlpB; Provisional Probab=29.79 E-value=44 Score=14.78 Aligned_cols=78 Identities=17% Similarity=0.284 Sum_probs=40.2 Q ss_pred CCCHHHHHHHHHHHHHCCCCCCCCCC--------------CCCCCCCCHHHHHHHHHCCCCCC---C-----CEEEEEE- Q ss_conf 94046899999999860466533222--------------34446755456777775187757---8-----4799998- Q gi|254780589|r 1 MTNRYNILLFALFIFLNGCHHSRSLI--------------DKAEHPLSENLIISMQKKRTSPF---H-----PTVIRIF- 57 (239) Q Consensus 1 M~~R~~~Llfal~i~L~gc~~~~~~~--------------~~~~~pl~~~~~~~~~~~~~~~~---~-----pi~Irv~- 57 (239) |+.-..+++..++++++||+.-.... .....|+...+..++...++..- + |+ +++. T Consensus 1 ~r~l~~l~l~lavll~agCGFhLRg~~~lP~~l~tL~l~S~Dpys~Ltr~vr~qLr~~gV~lv~~~~~~~~vp~-L~i~~ 79 (196) T PRK10796 1 MRYLATLLLSLAVLVTAGCGWHLRGTTQVPSEMKTMILDSGDPNGPLSRAVRNQLRLNGVELLDKDTTRKDVPS-LRLGG 79 (196) T ss_pred CHHHHHHHHHHHHHHHCCCCCEECCCCCCCCCCCEEEEEECCCCCHHHHHHHHHHHHCCCEEECCCCCCCCCCE-EEECC T ss_conf 95699999999999974767157689999864445899804878679999999998779589637875245856-99715 Q ss_pred --CCCCEEEEEEECCCCCEEEEEE Q ss_conf --3786999986078896689998 Q gi|254780589|r 58 --KNENILEIWKRNVDAEYVLLKE 79 (239) Q Consensus 58 --K~e~~Lel~~~~~~g~~~l~kt 79 (239) -+++.+.|+.++.--+|+|.-+ T Consensus 80 e~~~~rtlSl~~~Gr~AEYeL~~~ 103 (196) T PRK10796 80 VSISQDTASVFRNGQTAEYQMVMT 103 (196) T ss_pred CCCCCEEEEECCCCHHHHHEEEEE T ss_conf 655531698815746520178889 No 93 >PRK12701 flgH flagellar basal body L-ring protein; Reviewed Probab=29.45 E-value=44 Score=14.74 Aligned_cols=22 Identities=18% Similarity=0.152 Sum_probs=12.5 Q ss_pred CCCHHHHHHHHHHHHHCCCCCC Q ss_conf 9404689999999986046653 Q gi|254780589|r 1 MTNRYNILLFALFIFLNGCHHS 22 (239) Q Consensus 1 M~~R~~~Llfal~i~L~gc~~~ 22 (239) |++-.+.+...++++|+||... T Consensus 1 m~r~~~~~~~la~~lL~GC~~~ 22 (230) T PRK12701 1 MNRLNIAVSCLATALLFGCEAL 22 (230) T ss_pred CCHHHHHHHHHHHHHHHCCCCC T ss_conf 9258999999999997556688 No 94 >pfam07615 Ykof YKOF-related Family. Probab=29.23 E-value=42 Score=14.88 Aligned_cols=60 Identities=8% Similarity=0.132 Sum_probs=32.4 Q ss_pred EEE-CHHHHHHHHHHHHHHHHHCCCCCEEEEE--ECCCCCHHHHHHHHCCCCHHHHHHHHHHHHHHHHHCCCCEEE Q ss_conf 870-6466999999999999750856111699--555689889875425867378985356888999858973279 Q gi|254780589|r 153 YAM-NNKQMQEIYAIVRDSLRGNMQSHIQIQA--FPFRMTSKNMQLYQNNPNYSFWNMLKLGHDYFEKNHQEPFIQ 225 (239) Q Consensus 153 ~am-td~~i~EIy~~v~~a~~~~gQ~~~~v~~--~Pfrmt~~n~~~~~~~~~~~FW~~Lk~gyd~Fe~~~~~p~v~ 225 (239) |.| +|+.|++||.-++.+=..| +..+. |-=|+ +.+. -+-+.-|+..+.++++....-... T Consensus 9 YPMg~~d~m~~I~~ai~~~~~~g----v~~~s~h~sT~l--------~G~~-~~Vf~~l~~~f~~a~~~~~Hv~l~ 71 (81) T pfam07615 9 YPMGTSDFVDVILGALDALKTSG----VWDESDHYSTRL--------RGDV-GDVFDALEALFRNAAQSGGHVTLD 71 (81) T ss_pred EECCCCCHHHHHHHHHHHHHHCC----CEECCCEEEEEE--------ECCH-HHHHHHHHHHHHHHCCCCCEEEEE T ss_conf 72688557999999999875339----266246168898--------4788-999999999999855789759999 No 95 >PRK09915 putative outer membrane efflux protein MdtP; Provisional Probab=29.16 E-value=45 Score=14.71 Aligned_cols=23 Identities=13% Similarity=0.312 Sum_probs=14.3 Q ss_pred CCCHHHHHHHHHHHHHCCCCCCC Q ss_conf 94046899999999860466533 Q gi|254780589|r 1 MTNRYNILLFALFIFLNGCHHSR 23 (239) Q Consensus 1 M~~R~~~Llfal~i~L~gc~~~~ 23 (239) |++-.+..+++++++|+||.... T Consensus 6 ~~r~~~~~~l~~~~lLagC~~~p 28 (488) T PRK09915 6 LSRLLLCSILGSTTLISGCALVR 28 (488) T ss_pred HHHHHHHHHHHHHHHHCCCCCCC T ss_conf 76999999999999975266899 No 96 >PRK11189 lipoprotein NlpI; Provisional Probab=28.94 E-value=45 Score=14.69 Aligned_cols=40 Identities=20% Similarity=0.255 Sum_probs=18.7 Q ss_pred CCCHHHHHHHH-HHHHHCCCCCCCCCC---CCCCCCCCHHHHHH Q ss_conf 94046899999-999860466533222---34446755456777 Q gi|254780589|r 1 MTNRYNILLFA-LFIFLNGCHHSRSLI---DKAEHPLSENLIIS 40 (239) Q Consensus 1 M~~R~~~Llfa-l~i~L~gc~~~~~~~---~~~~~pl~~~~~~~ 40 (239) |++..-+++++ .+++|+||.++.+.. ...-.|+.|..... T Consensus 1 m~~~~r~~~~~~~~l~LsGC~s~~~~~~~~~~~a~plqp~~q~E 44 (297) T PRK11189 1 MKPFLRWCFVATAALLLAGCASSNTSWRKSEVLAVPLQPSYQQE 44 (297) T ss_pred CCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCHHHH T ss_conf 95189999999999999864058876444865144678779999 No 97 >PRK10783 mltD membrane-bound lytic murein transglycosylase D; Provisional Probab=28.63 E-value=44 Score=14.77 Aligned_cols=22 Identities=27% Similarity=0.492 Sum_probs=12.6 Q ss_pred CCCHHHHHHHHHHHHHCCCCCCCCC Q ss_conf 9404689999999986046653322 Q gi|254780589|r 1 MTNRYNILLFALFIFLNGCHHSRSL 25 (239) Q Consensus 1 M~~R~~~Llfal~i~L~gc~~~~~~ 25 (239) |+.|.+++ .+++|+||+.+... T Consensus 1 mk~~~~l~---~~~ll~gcq~~~~~ 22 (449) T PRK10783 1 MKAKAILL---ASVLLVGCQSSKND 22 (449) T ss_pred CCHHHHHH---HHHHHHHCCCCCCC T ss_conf 91899999---99999725578888 No 98 >pfam09476 Pilus_CpaD Pilus biogenesis CpaD protein (pilus_cpaD). Proteins in this entry consist of a pilus biogenesis protein, CpaD, from Caulobacter, and homologues in other bacteria, including three in the root nodule bacterium Bradyrhizobium japonicum. The molecular function of the homologues is not known. Probab=28.53 E-value=46 Score=14.64 Aligned_cols=42 Identities=26% Similarity=0.467 Sum_probs=22.7 Q ss_pred HHHHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCEEEEEECCCCEEEEE Q ss_conf 9999998604665332223444675545677777518775784799998378699998 Q gi|254780589|r 9 LFALFIFLNGCHHSRSLIDKAEHPLSENLIISMQKKRTSPFHPTVIRIFKNENILEIW 66 (239) Q Consensus 9 lfal~i~L~gc~~~~~~~~~~~~pl~~~~~~~~~~~~~~~~~pi~Irv~K~e~~Lel~ 66 (239) ++++++.|+||.... .... .| .+....+||.|. .++..+.|. T Consensus 2 ll~la~~L~aCa~~~-~~~~--~p-----------~dyr~~hpI~V~--~~~~~~di~ 43 (200) T pfam09476 2 LLALALALAGCASTA-PTAS--AP-----------DDYRTRHPIVVE--EADRTLDLP 43 (200) T ss_pred HHHHHHHHHCCCCCC-CCCC--CC-----------CCHHHCCCCEEE--ECCEEEEEE T ss_conf 678998873347876-4565--78-----------747552896687--435488873 No 99 >PRK04405 prsA peptidylprolyl isomerase; Provisional Probab=28.51 E-value=36 Score=15.28 Aligned_cols=23 Identities=22% Similarity=0.219 Sum_probs=12.6 Q ss_pred CCCHHHHHHHHHH-HHHCCCCCCC Q ss_conf 9404689999999-9860466533 Q gi|254780589|r 1 MTNRYNILLFALF-IFLNGCHHSR 23 (239) Q Consensus 1 M~~R~~~Llfal~-i~L~gc~~~~ 23 (239) |+...+.++.+++ +.|+||++.. T Consensus 5 MKK~~l~~~~~~~~l~LaaCss~~ 28 (298) T PRK04405 5 MKKWALAAASAGLLLSLAGCSSNN 28 (298) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCC T ss_conf 899999999999999998717999 No 100 >KOG0749 consensus Probab=28.42 E-value=22 Score=16.63 Aligned_cols=64 Identities=19% Similarity=0.401 Sum_probs=37.0 Q ss_pred HHCCCCCCCCCCCCCCCCCCHHHH-HHHHHCCCCCCC------CEEEEEECCCCEEEEEEECCCCCEEEEEEEEE Q ss_conf 860466533222344467554567-777751877578------47999983786999986078896689998652 Q gi|254780589|r 15 FLNGCHHSRSLIDKAEHPLSENLI-ISMQKKRTSPFH------PTVIRIFKNENILEIWKRNVDAEYVLLKEYKI 82 (239) Q Consensus 15 ~L~gc~~~~~~~~~~~~pl~~~~~-~~~~~~~~~~~~------pi~Irv~K~e~~Lel~~~~~~g~~~l~ktY~I 82 (239) +|+|. .+.+++..+..|+.-..+ .+.+...|.... ..++|+-|||..+.+|+.|.. -+++-||- T Consensus 16 fl~Gg-vaAavsKTavAPIERVKLlLQ~Q~~~~~~~~~YkGi~Dc~~r~~~eqG~~sfWRGN~a---nViRyfPt 86 (298) T KOG0749 16 FLAGG-VAAAVSKTAVAPIERVKLLLQVQDSEIAADKRYKGIVDCFVRIPKEQGFLSFWRGNLA---NVIRYFPT 86 (298) T ss_pred HHCCH-HHHHHHHHCCCCHHHHHHHHHHCCCHHHHCCCCCCHHHEEEECHHHHHHHHEECCCCC---CHHHCCCH T ss_conf 87120-7766554120527887888771311343226756622224624033112110226643---21210753 No 101 >PRK13859 type IV secretion system lipoprotein VirB7; Provisional Probab=27.98 E-value=43 Score=14.83 Aligned_cols=23 Identities=30% Similarity=0.608 Sum_probs=13.3 Q ss_pred HHHHHHHHHHHHHCCCCCCCCCCCC Q ss_conf 4689999999986046653322234 Q gi|254780589|r 4 RYNILLFALFIFLNGCHHSRSLIDK 28 (239) Q Consensus 4 R~~~Llfal~i~L~gc~~~~~~~~~ 28 (239) ++++|+ +++.|++|+.++....- T Consensus 2 KY~lL~--l~l~La~CqT~D~lAtc 24 (55) T PRK13859 2 KYCLLC--LALALAGCQTNDTLASC 24 (55) T ss_pred CHHHHH--HHHHHHHCCCCCCCCCC T ss_conf 013999--99999860046831124 No 102 >TIGR03525 GldK gliding motility-associated lipoprotein GldK. Members of this protein family are exclusive to the Bacteroidetes phylum (previously Cytophaga-Flavobacteria-Bacteroides). GldK is a lipoprotein linked to a type of rapid surface gliding motility found in certain Bacteroidetes, such as Flavobacterium johnsoniae. Knockouts of GldK abolish the gliding phenotype. GldK is homologous to GldJ. There is a GldK homolog in Cytophaga hutchinsonii and several other species that has a different, shorter architecture and is represented by a separate model. Gliding motility appears closely linked to chitin utilization in the model species Flavobacterium johnsoniae. Bacteroidetes with members of this protein family appear to have all of the genes associated with gliding motility. Probab=27.76 E-value=43 Score=14.84 Aligned_cols=19 Identities=16% Similarity=0.460 Sum_probs=11.4 Q ss_pred HHHHHHHHHHHHHCCCCCC Q ss_conf 4689999999986046653 Q gi|254780589|r 4 RYNILLFALFIFLNGCHHS 22 (239) Q Consensus 4 R~~~Llfal~i~L~gc~~~ 22 (239) |-++++++++++|++|+.. T Consensus 2 kk~~~~~~~~~~~~SCg~~ 20 (449) T TIGR03525 2 KKYLVFAALVVLVYSCGSG 20 (449) T ss_pred CHHHHHHHHHHHHHHCCCC T ss_conf 1248999999987210589 No 103 >COG5510 Predicted small secreted protein [Function unknown] Probab=27.74 E-value=37 Score=15.22 Aligned_cols=23 Identities=22% Similarity=0.400 Sum_probs=12.4 Q ss_pred CCCH---HHHHHHHHHHHHCCCCCCC Q ss_conf 9404---6899999999860466533 Q gi|254780589|r 1 MTNR---YNILLFALFIFLNGCHHSR 23 (239) Q Consensus 1 M~~R---~~~Llfal~i~L~gc~~~~ 23 (239) |.+| ..+++.+..+.|++|+... T Consensus 1 mmk~t~l~i~~vll~s~llaaCNT~r 26 (44) T COG5510 1 MMKKTILLIALVLLASTLLAACNTMR 26 (44) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 90489999999999999998866300 No 104 >pfam00820 Lipoprotein_1 Borrelia lipoprotein. This family of lipoproteins is found in Borrelia spirochetes. The function of these proteins is uncertain. Probab=27.72 E-value=47 Score=14.55 Aligned_cols=58 Identities=21% Similarity=0.234 Sum_probs=29.5 Q ss_pred HHHHHHHHHHHHHCCCCCCC-CCCCCCCCCCCHHHHHHHHHCCCCCCCCEEEEEECCCCEEEEEEECCCCCEEEEEEEEE Q ss_conf 46899999999860466533-22234446755456777775187757847999983786999986078896689998652 Q gi|254780589|r 4 RYNILLFALFIFLNGCHHSR-SLIDKAEHPLSENLIISMQKKRTSPFHPTVIRIFKNENILEIWKRNVDAEYVLLKEYKI 82 (239) Q Consensus 4 R~~~Llfal~i~L~gc~~~~-~~~~~~~~pl~~~~~~~~~~~~~~~~~pi~Irv~K~e~~Lel~~~~~~g~~~l~ktY~I 82 (239) +-++|.|++.+.|.+|.-.. +...+... ..+++-+-. |.|-||. +.+|+|.|+.+-.- T Consensus 2 kkyllgi~LilaliaCkQn~ss~dek~s~-----------svdLp~~~k--vlvSKEK--------nkdGKY~L~AtVDk 60 (273) T pfam00820 2 KKYLLGIGLILALIACKQNVSSLDEKNSV-----------SVDLPGEME--VLVSKEK--------NKDGKYSLRATVDT 60 (273) T ss_pred CEEEHHHHHHHHHHHHHCCCCCCCCCCCC-----------EECCCCCEE--EEEECCC--------CCCCCEEEEEEEEE T ss_conf 44311589999998760335554435661-----------022899708--9998124--------77875788888754 No 105 >TIGR03112 6_pyr_pter_rel 6-pyruvoyl tetrahydropterin synthase-related domain. Members of this family are small proteins, or small domains of larger proteins, that occur in certain Firmicutes in the same regions as members of families TIGR03110 and TIGR03111. Members of TIGR03110 resemble exosortase, a proposed protein sorting transpeptidase (see TIGR02602). TIGR03111 represents a small clade among the group 2 glycosyltransferases. Members of the current protein family resemble eukaryotic known and prokaryotic predicted 6-pyruvoyl tetrahydropterin synthases. Probab=27.45 E-value=48 Score=14.52 Aligned_cols=61 Identities=18% Similarity=0.235 Sum_probs=39.7 Q ss_pred ECCCCCCCCCEEECHH----HHHHHHHHHHHHHHHCCCCCEEEEEECCC---CCHHHHHHHHCCCCHHHHHHHHHH Q ss_conf 7167786671870646----69999999999997508561116995556---898898754258673789853568 Q gi|254780589|r 143 IHGECASAGCYAMNNK----QMQEIYAIVRDSLRGNMQSHIQIQAFPFR---MTSKNMQLYQNNPNYSFWNMLKLG 211 (239) Q Consensus 143 IHG~~~S~GC~amtd~----~i~EIy~~v~~a~~~~gQ~~~~v~~~Pfr---mt~~n~~~~~~~~~~~FW~~Lk~g 211 (239) +||-.|-+-|...+++ +..+|=..+++.+.. -+..+==.+-||. .|.+|+++| ||..|++- T Consensus 22 lHgHtw~v~v~v~~~~~~~idF~~lk~~v~~vl~~-ldh~~LNel~pF~~~NPT~ENia~y-------i~~~l~~~ 89 (113) T TIGR03112 22 KHPHTWEITIFVIKKEDKFILFNDVEKKVEKYLKP-YQNKYLNDLEPFDKINPTLENIGDY-------FFDEIKKL 89 (113) T ss_pred CCCCCEEEEEEEEECCCCEEEHHHHHHHHHHHHHH-HCCCHHCCCCCCCCCCCCHHHHHHH-------HHHHHHHH T ss_conf 76986799999980888687299999999999987-5751010487535689878999999-------99999987 No 106 >PRK11251 DNA-binding transcriptional activator OsmE; Provisional Probab=27.27 E-value=45 Score=14.70 Aligned_cols=33 Identities=21% Similarity=0.343 Sum_probs=19.1 Q ss_pred HHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCC Q ss_conf 99998604665332223444675545677777518775 Q gi|254780589|r 11 ALFIFLNGCHHSRSLIDKAEHPLSENLIISMQKKRTSP 48 (239) Q Consensus 11 al~i~L~gc~~~~~~~~~~~~pl~~~~~~~~~~~~~~~ 48 (239) +.+.+|+||..-.........||...+ .+||+. T Consensus 10 a~~~~LaGCt~~d~~~~~~~ePlVkdV-----~~GMT~ 42 (109) T PRK11251 10 AVLTMLAGCTAYDRNPVQYVEPVVKDV-----KKGMTR 42 (109) T ss_pred HHHHHHHHCCCCCCCHHHHCCHHHHHH-----HCCCCH T ss_conf 999999714477678134315069888-----747579 No 107 >TIGR01819 F420_cofD LPPG:Fo 2-phospho-L-lactate transferase; InterPro: IPR010115 This entry represents LPPG:Fo 2-phospho-L-lactate transferase (CofD), which catalyses the fourth step in the biosynthesis of coenzyme F420, a flavin derivative found in methanogens, Mycobacteria, and several other lineages. This enzyme is characterised so far in Methanococcus jannaschii but appears restricted to F420-containing species and is predicted to carry out the same function in these other species. . Probab=26.92 E-value=23 Score=16.47 Aligned_cols=50 Identities=26% Similarity=0.350 Sum_probs=33.6 Q ss_pred CCCCC-CCHHCCCCCCC-CCEEEEEEECCCCCCCCCCEEEECCCCCHHHHHCCCCCCCEEECCCC Q ss_conf 23898-87010178747-50799984012587576521420799898899729998826871677 Q gi|254780589|r 85 WSGTF-GPKIETGDEQA-PEGFYYIGWNNLNPNSKYFLSINIGFPNEFDKAHNRTGADLMIHGEC 147 (239) Q Consensus 85 ~sg~l-GpK~~eGD~kT-PEG~Y~i~~~~~np~S~y~ls~~i~YPN~~Dr~~grtG~~I~IHG~~ 147 (239) .||+. |||+-+|=+|. |+|-|..+..+. +..--++- -+-||.++||||.- T Consensus 4 LaGGtGtpkLL~Glk~v~~~Gqfaansahs--dad~el~V-----------vvNTgdD~w~~Gl~ 55 (359) T TIGR01819 4 LAGGTGTPKLLQGLKRVLPEGQFAANSAHS--DADAELTV-----------VVNTGDDVWIGGLK 55 (359) T ss_pred EECCCCCHHHHHHHHHHCCCCCEEHHCCCC--CCCCCEEE-----------EEECCCEEEECCCE T ss_conf 635867078999999874886311000025--53444488-----------98311304450752 No 108 >pfam06604 consensus Probab=26.79 E-value=49 Score=14.45 Aligned_cols=18 Identities=33% Similarity=0.578 Sum_probs=13.1 Q ss_pred HHHHHHHHHHHCCCCCCC Q ss_conf 899999999860466533 Q gi|254780589|r 6 NILLFALFIFLNGCHHSR 23 (239) Q Consensus 6 ~~Llfal~i~L~gc~~~~ 23 (239) |+.+.++.++|+||+... T Consensus 8 ~~~~~~~v~~laGCq~~~ 25 (181) T pfam06604 8 LVAGLAAVLVLAGCQRSR 25 (181) T ss_pred HHHHHHHHHHHHCCCCCC T ss_conf 899999999973445333 No 109 >PRK13883 conjugal transfer protein TrbH; Provisional Probab=26.39 E-value=50 Score=14.41 Aligned_cols=31 Identities=13% Similarity=0.239 Sum_probs=17.0 Q ss_pred EEEEEECCCCCCCCCCEEEECCCC---CHHHHHCCC Q ss_conf 999840125875765214207998---988997299 Q gi|254780589|r 104 FYYIGWNNLNPNSKYFLSINIGFP---NEFDKAHNR 136 (239) Q Consensus 104 ~Y~i~~~~~np~S~y~ls~~i~YP---N~~Dr~~gr 136 (239) +|-||.- ....-|.+++.||.- .+|...+|. T Consensus 112 ~YVlDq~--~~~nLYRvtl~VG~QslSRaY~~qng~ 145 (156) T PRK13883 112 RYVLDQA--GDSNLYRLTLMVGNQSITRAYLAQNGT 145 (156) T ss_pred EEEEECC--CCCCEEEEEEEECCEEECHHHHHHCCC T ss_conf 8997037--888648999998999830788863895 No 110 >PRK10672 rare lipoprotein A; Provisional Probab=26.36 E-value=50 Score=14.40 Aligned_cols=22 Identities=18% Similarity=0.285 Sum_probs=16.0 Q ss_pred CCCHHHHHHHHHHHHHCCCCCCC Q ss_conf 94046899999999860466533 Q gi|254780589|r 1 MTNRYNILLFALFIFLNGCHHSR 23 (239) Q Consensus 1 M~~R~~~Llfal~i~L~gc~~~~ 23 (239) |++-|+++.++.+ +|+||.... T Consensus 1 mr~~wl~~~~~~~-ll~~c~~~~ 22 (369) T PRK10672 1 MRKQWLGICIAAG-LLAACTSDD 22 (369) T ss_pred CCHHHHHHHHHHH-HHHHCCCCC T ss_conf 9333589999999-986525886 No 111 >TIGR00784 citMHS citrate transporter; InterPro: IPR014738 This entry consists of known and predicted citrate transporters. It includes two characterised citrate/proton symporters, CitM and CitH, from Bacillus subtilis . Both of these proteins couple the uptake of citrate in complex with a divalent cation to the uptake of protons, but differ in their metal ion specificities. CitM transports citrate in complex with Mg(2+), Ni(2+), Mn(2+), Co(2+), and Zn2(+), while CitH transports citrate in complex with Ca(2+), Ba(2+), and Sr(2+). A third paralog, YraO, exists in B. subtilis, but the function of this protein is not known. . Probab=25.99 E-value=16 Score=17.39 Aligned_cols=16 Identities=13% Similarity=0.557 Sum_probs=12.2 Q ss_pred CCEEEECCCCCHHHHH Q ss_conf 5214207998988997 Q gi|254780589|r 118 YFLSINIGFPNEFDKA 133 (239) Q Consensus 118 y~ls~~i~YPN~~Dr~ 133 (239) |-+|+-+||||...+. T Consensus 268 F~~AL~vNYPNvk~Q~ 283 (439) T TIGR00784 268 FVVALIVNYPNVKEQK 283 (439) T ss_pred HHHHHHHCCCCHHHHH T ss_conf 9999870587746789 No 112 >PRK12788 flgH flagellar basal body L-ring protein; Reviewed Probab=25.85 E-value=51 Score=14.34 Aligned_cols=19 Identities=26% Similarity=0.539 Sum_probs=12.3 Q ss_pred CHHHHHHHHHHHHHCCCCCC Q ss_conf 04689999999986046653 Q gi|254780589|r 3 NRYNILLFALFIFLNGCHHS 22 (239) Q Consensus 3 ~R~~~Llfal~i~L~gc~~~ 22 (239) +|++++++++ ++|+||... T Consensus 1 mr~l~~~~a~-l~LsGCa~~ 19 (231) T PRK12788 1 MRLLVAILAC-LALAGCANN 19 (231) T ss_pred CCHHHHHHHH-HHHHHHCCC T ss_conf 9089999999-997640387 No 113 >PRK13733 conjugal transfer protein TraV; Provisional Probab=25.53 E-value=11 Score=18.57 Aligned_cols=25 Identities=20% Similarity=0.297 Sum_probs=19.4 Q ss_pred CCCHHHHHHHHHHHHHCCCCCCCCC Q ss_conf 9404689999999986046653322 Q gi|254780589|r 1 MTNRYNILLFALFIFLNGCHHSRSL 25 (239) Q Consensus 1 M~~R~~~Llfal~i~L~gc~~~~~~ 25 (239) |+...|++.++..++|+||.+..+. T Consensus 1 mk~~~~l~~l~~~llLtGCAG~nSd 25 (171) T PRK13733 1 MKKISLLIPLLGTLLLSGCAGTNSE 25 (171) T ss_pred CCEEEEEHHHCCEEEEECCCCCCCC T ss_conf 9516751120260026024577754 No 114 >KOG4824 consensus Probab=24.98 E-value=53 Score=14.25 Aligned_cols=35 Identities=17% Similarity=0.284 Sum_probs=23.5 Q ss_pred CCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCH Q ss_conf 94046899999999860466533222344467554 Q gi|254780589|r 1 MTNRYNILLFALFIFLNGCHHSRSLIDKAEHPLSE 35 (239) Q Consensus 1 M~~R~~~Llfal~i~L~gc~~~~~~~~~~~~pl~~ 35 (239) |.+..|+|+|+..+++++...+.......+.|+.+ T Consensus 6 ~l~~~~~l~~~a~L~~~a~gqa~~lg~cpnypv~k 40 (224) T KOG4824 6 ALRKCFVLGFAAILLLSAAGQAIALGECPNYPVAK 40 (224) T ss_pred HHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCC T ss_conf 89999999889999864156467304689985346 No 115 >PRK13616 lipoprotein LpqB; Provisional Probab=24.94 E-value=39 Score=15.09 Aligned_cols=20 Identities=20% Similarity=0.394 Sum_probs=15.7 Q ss_pred CCHHHHHHHHHHHHHCCCCC Q ss_conf 40468999999998604665 Q gi|254780589|r 2 TNRYNILLFALFIFLNGCHH 21 (239) Q Consensus 2 ~~R~~~Llfal~i~L~gc~~ 21 (239) ++|.+.++.+++++|+||.+ T Consensus 5 ~~~~~~~~~~~~~llaGCas 24 (590) T PRK13616 5 RLKALAALLAVALLVAGCAS 24 (590) T ss_pred HHHHHHHHHHHHHHHHHCCC T ss_conf 99999999999998612036 No 116 >TIGR03659 IsdE heme ABC transporter, heme-binding protein isdE. This family of ABC substrate-binding proteins is observed primarily in close proximity with proteins localized to the cell wall and bearing the NEAT (NEAr Transporter, pfam05031) heme-binding domain. IsdE has been shown to bind heme and is involved in the process of scavenging heme for the purpose of obtaining iron. Probab=24.69 E-value=54 Score=14.21 Aligned_cols=19 Identities=42% Similarity=0.571 Sum_probs=11.4 Q ss_pred HHHHHHHHHHHCCCCCCCC Q ss_conf 8999999998604665332 Q gi|254780589|r 6 NILLFALFIFLNGCHHSRS 24 (239) Q Consensus 6 ~~Llfal~i~L~gc~~~~~ 24 (239) +++++.+++.|+||+.... T Consensus 6 ~~~~~~~~~~l~~C~~~~~ 24 (289) T TIGR03659 6 LILLALLSLGLTGCSSSKA 24 (289) T ss_pred HHHHHHHHHHHHHCCCCCC T ss_conf 9999999999843289987 No 117 >KOG0884 consensus Probab=24.27 E-value=42 Score=14.90 Aligned_cols=24 Identities=25% Similarity=0.412 Sum_probs=14.9 Q ss_pred CCCCCCCEEECCCC-----------CCCCCEEECHHH Q ss_conf 29998826871677-----------866718706466 Q gi|254780589|r 134 HNRTGADLMIHGEC-----------ASAGCYAMNNKQ 159 (239) Q Consensus 134 ~grtG~~I~IHG~~-----------~S~GC~amtd~~ 159 (239) -||.|..|| |.. .+.||++|.|.. T Consensus 59 tgrgg~siw--g~~fede~~~~lkh~~rg~vsmanng 93 (161) T KOG0884 59 TGRGGNSIW--GKKFEDEYSEYLKHNVRGVVSMANNG 93 (161) T ss_pred CCCCCCCCC--CCCCHHHHHHHHHCCCCEEEECCCCC T ss_conf 688875424--87432889998750631068713279 No 118 >PRK10510 putative outer membrane lipoprotein; Provisional Probab=24.25 E-value=55 Score=14.16 Aligned_cols=49 Identities=16% Similarity=0.119 Sum_probs=24.7 Q ss_pred CCCCCCCEEECCCCCCCCCEEECHHHHHHHHHHHHHHHHHCCCCCEEEE Q ss_conf 2999882687167786671870646699999999999975085611169 Q gi|254780589|r 134 HNRTGADLMIHGECASAGCYAMNNKQMQEIYAIVRDSLRGNMQSHIQIQ 182 (239) Q Consensus 134 ~grtG~~I~IHG~~~S~GC~amtd~~i~EIy~~v~~a~~~~gQ~~~~v~ 182 (239) +.++...|-|-|-.-+.|--+..-.--++==.-|.+-|...|-..-++. T Consensus 139 ~~yp~~~V~I~GHTD~~G~~~yN~~LS~~RA~aV~~yL~~~GV~~~Ri~ 187 (219) T PRK10510 139 KEYPKTAVNVVGYTDSTGSHDLNMRLSQQRADSVASALITQGVDASRIR 187 (219) T ss_pred HHCCCCEEEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHCCCCHHHEE T ss_conf 9889927999987689671999999999999999999998599956489 No 119 >PRK13736 conjugal transfer protein TraK; Provisional Probab=24.22 E-value=55 Score=14.16 Aligned_cols=23 Identities=9% Similarity=0.044 Sum_probs=11.9 Q ss_pred CCCEEECHHHHHHHHHHHHHHHH Q ss_conf 67187064669999999999997 Q gi|254780589|r 150 AGCYAMNNKQMQEIYAIVRDSLR 172 (239) Q Consensus 150 ~GC~amtd~~i~EIy~~v~~a~~ 172 (239) .--..-++.+..-|-.+.+.-.+ T Consensus 128 a~~~e~s~~Y~~~l~~l~~am~~ 150 (245) T PRK13736 128 AKAWETSQPYESLLVALSQAVRG 150 (245) T ss_pred CCCCCCCCCHHHHHHHHHHHHHC T ss_conf 44222138788999999999973 No 120 >TIGR00693 thiE thiamine-phosphate pyrophosphorylase; InterPro: IPR003733 Thiamine monophosphate synthase (TMP) (2.5.1.3 from EC) catalyzes the substitution of the pyrophosphate of 2-methyl-4-amino-5- hydroxymethylpyrimidine pyrophosphate by 4-methyl-5- (beta-hydroxyethyl)thiazole phosphate to yield thiamine phosphate in the thiamine biosynthesis pathway . TENI, a protein from Bacillus subtilis that regulates the production of several extracellular enzymes by reducing alkaline protease production belongs to this group .; GO: 0004789 thiamin-phosphate diphosphorylase activity, 0009228 thiamin biosynthetic process. Probab=23.99 E-value=56 Score=14.13 Aligned_cols=59 Identities=14% Similarity=0.194 Sum_probs=40.6 Q ss_pred CEEECHHHH----HHHHHHHHHHHHHCCCCCEEEEEECCCCCHHHHHHHHCCC-------CHHHHHHHHHHHHHHHHHCC Q ss_conf 187064669----9999999999975085611169955568988987542586-------73789853568889998589 Q gi|254780589|r 152 CYAMNNKQM----QEIYAIVRDSLRGNMQSHIQIQAFPFRMTSKNMQLYQNNP-------NYSFWNMLKLGHDYFEKNHQ 220 (239) Q Consensus 152 C~amtd~~i----~EIy~~v~~a~~~~gQ~~~~v~~~Pfrmt~~n~~~~~~~~-------~~~FW~~Lk~gyd~Fe~~~~ 220 (239) =|.+||+++ ++....++.||+| |-+.+|. |.++++ -..||+.-++-=+.=.++.- T Consensus 2 LYlit~~~~~~~~~~~~~~ve~Al~G-GV~~~Ql-------------R~K~~~~~~~yGE~~~~~~~A~~l~~lc~~y~~ 67 (210) T TIGR00693 2 LYLITDPQLFPDLKDLLARVEAALKG-GVTLVQL-------------RDKGSNALQQYGETRERLELAEKLRELCRKYGV 67 (210) T ss_pred CEEEECCCCCCCHHHHHHHHHHHHHC-CCEEEEE-------------EECCCCCCCCCCCHHHHHHHHHHHHHHHHHCCC T ss_conf 05630874677534489999999858-9629998-------------405875421258588999999999999987089 Q ss_pred CCEE Q ss_conf 7327 Q gi|254780589|r 221 EPFI 224 (239) Q Consensus 221 ~p~v 224 (239) |-.| T Consensus 68 ~f~v 71 (210) T TIGR00693 68 PFIV 71 (210) T ss_pred CEEE T ss_conf 7688 No 121 >TIGR00212 hemC porphobilinogen deaminase; InterPro: IPR000860 Tetrapyrroles are large macrocyclic compounds derived from a common biosynthetic pathway . The end-product, uroporphyrinogen III, is used to synthesise a number of important molecules, including vitamin B12, haem, sirohaem, chlorophyll, coenzyme F430 and phytochromobilin . The first stage in tetrapyrrole synthesis is the synthesis of 5-aminoaevulinic acid ALA via two possible routes: (1) condensation of succinyl CoA and glycine (C4 pathway) using ALA synthase (2.3.1.37 from EC), or (2) decarboxylation of glutamate (C5 pathway) via three different enzymes, glutamyl-tRNA synthetase (6.1.1.17 from EC) to charge a tRNA with glutamate, glutamyl-tRNA reductase (1.2.1.70 from EC) to reduce glutamyl-tRNA to glutamate-1-semialdehyde (GSA), and GSA aminotransferase (5.4.3.8 from EC) to catalyse a transamination reaction to produce ALA. The second stage is to convert ALA to uroporphyrinogen III, the first macrocyclic tetrapyrrolic structure in the pathway. This is achieved by the action of three enzymes in one common pathway: porphobilinogen (PBG) synthase (or ALA dehydratase, 4.2.1.24 from EC) to condense two ALA molecules to generate porphobilinogen; hydroxymethylbilane synthase (or PBG deaminase, 2.5.1.61 from EC) to polymerise four PBG molecules into preuroporphyrinogen (tetrapyrrole structure); and uroporphyrinogen III synthase (4.2.1.75 from EC) to link two pyrrole units together (rings A and D) to yield uroporphyrinogen III. Uroporphyrinogen III is the first branch point of the pathway. To synthesise cobalamin (vitamin B12), sirohaem, and coenzyme F430, uroporphyrinogen III needs to be converted into precorrin-2 by the action of uroporphyrinogen III methyltransferase (2.1.1.107 from EC). To synthesise haem and chlorophyll, uroporphyrinogen III needs to be decarboxylated into coproporphyrinogen III by the action of uroporphyrinogen III decarboxylase (4.1.1.37 from EC) . This entry represents hydroxymethylbilane synthase (or porphobilinogen deaminase, 2.5.1.61 from EC), which functions during the second stage of tetrapyrrole biosynthesis. This enzyme catalyses the polymerisation of four PBG molecules into the tetrapyrrole structure, preuroporphyrinogen, with the concomitant release of four molecules of ammonia. This enzyme uses a unique dipyrro-methane cofactor made from two molecules of PBG, which is covalently attached to a cysteine side chain. The tetrapyrrole product is synthesized in an ordered, sequential fashion, by initial attachment of the first pyrrole unit (ring A) to the cofactor, followed by subsequent additions of the remaining pyrrole units (rings B, C, D) to the growing pyrrole chain . The link between the pyrrole ring and the cofactor is broken once all the pyrroles have been added. This enzyme is folded into three distinct domains that enclose a single, large active site that makes use of an aspartic acid as its one essential catalytic residue, acting as a general acid/base during catalysis , . A deficiency of hydroxymethylbilane synthase is implicated in the neuropathic disease, Acute Intermittent Porphyria (AIP) . ; GO: 0004418 hydroxymethylbilane synthase activity, 0033014 tetrapyrrole biosynthetic process. Probab=23.72 E-value=38 Score=15.16 Aligned_cols=19 Identities=32% Similarity=0.158 Sum_probs=16.1 Q ss_pred HHCCCCCCCCCEEEEEEEC Q ss_conf 0101787475079998401 Q gi|254780589|r 92 KIETGDEQAPEGFYYIGWN 110 (239) Q Consensus 92 K~~eGD~kTPEG~Y~i~~~ 110 (239) +.-+|+++||.|.|..... T Consensus 236 ~~L~GgC~~Pig~~a~~~~ 254 (312) T TIGR00212 236 KELDGGCQTPIGAYAEYEG 254 (312) T ss_pred HHHCCCCCCCEEEEEEEEC T ss_conf 8728896433146889833 No 122 >PRK02463 OxaA-like protein precursor; Provisional Probab=23.22 E-value=58 Score=14.04 Aligned_cols=16 Identities=25% Similarity=0.625 Sum_probs=10.7 Q ss_pred HHHHHHHHHCCCCCCC Q ss_conf 9999999860466533 Q gi|254780589|r 8 LLFALFIFLNGCHHSR 23 (239) Q Consensus 8 Llfal~i~L~gc~~~~ 23 (239) ++.+++++|+||.... T Consensus 13 ~~~~~~l~LsgC~~~~ 28 (307) T PRK02463 13 LALSMLLTLTGCVGRD 28 (307) T ss_pred HHHHHHHHHHCCCCCC T ss_conf 9999999996346899 No 123 >TIGR02824 quinone_pig3 putative NAD(P)H quinone oxidoreductase, PIG3 family; InterPro: IPR014189 Members of this family are putative quinone oxidoreductases that belong to the broader superfamily of zinc-dependent alcohol (of medium chain length) dehydrogenases and quinone oxidoreductases. An alignment shows no motif of conserved Cys residues as are found in zinc-binding members of the superfamily, consequently members of this entry are likely to be quinone oxidoreductases. A member of this family in Homo sapiens (Human), PIG3, is induced by p53 but is otherwise uncharacterised.. Probab=22.73 E-value=21 Score=16.76 Aligned_cols=74 Identities=23% Similarity=0.406 Sum_probs=47.7 Q ss_pred CCCCEEEEEEECCCCCCCCCCEEEECCCCCHHHHHCCCCCCCEEECCCCC-----------CCCCEEECHHHHHHHHHHH Q ss_conf 47507999840125875765214207998988997299988268716778-----------6671870646699999999 Q gi|254780589|r 99 QAPEGFYYIGWNNLNPNSKYFLSINIGFPNEFDKAHNRTGADLMIHGECA-----------SAGCYAMNNKQMQEIYAIV 167 (239) Q Consensus 99 kTPEG~Y~i~~~~~np~S~y~ls~~i~YPN~~Dr~~grtG~~I~IHG~~~-----------S~GC~amtd~~i~EIy~~v 167 (239) .-|+|+=.+...-+ |- -|-+=|=|.|++++=+.|.-+-|||.-| ..||...|-..-+|=-..+ T Consensus 112 P~P~Gls~veAAAl-PE-----t~fTVW~NLF~~g~L~~GEtvLiHGGaSGIGttAIqLAKA~Ga~V~~TaGS~eK~~~a 185 (334) T TIGR02824 112 PVPEGLSLVEAAAL-PE-----TFFTVWSNLFQRGGLKAGETVLIHGGASGIGTTAIQLAKAFGARVFTTAGSDEKCATA 185 (334) T ss_pred ECCCCCCHHHHHCC-HH-----HHHHHHHHHHHHCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCEEEEEECCHHHHHHH T ss_conf 06898877976125-14-----7888888687221137897289971367367999999985697599982898999999 Q ss_pred HHHHHHCCCCCEE Q ss_conf 9999750856111 Q gi|254780589|r 168 RDSLRGNMQSHIQ 180 (239) Q Consensus 168 ~~a~~~~gQ~~~~ 180 (239) .++| | -+..|. T Consensus 186 ~~~L-G-Ad~aIN 196 (334) T TIGR02824 186 CEAL-G-ADIAIN 196 (334) T ss_pred HHHC-C-CEEEEE T ss_conf 9860-9-807860 No 124 >TIGR01626 ytfJ_HI0045 conserved hypothetical protein YtfJ-family, TIGR01626; InterPro: IPR006513 These are sequences from gamma proteobacteria that are related to the E. coli protein, YtfJ. . Probab=22.61 E-value=59 Score=13.96 Aligned_cols=53 Identities=11% Similarity=0.228 Sum_probs=31.7 Q ss_pred CCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCEEEEEECCCCEEEEEE Q ss_conf 9404689999999986046653322234446755456777775187757847999983786999986 Q gi|254780589|r 1 MTNRYNILLFALFIFLNGCHHSRSLIDKAEHPLSENLIISMQKKRTSPFHPTVIRIFKNENILEIWK 67 (239) Q Consensus 1 M~~R~~~Llfal~i~L~gc~~~~~~~~~~~~pl~~~~~~~~~~~~~~~~~pi~Irv~K~e~~Lel~~ 67 (239) |++|.++++.++++...+|.--. ..+.++|+--++.-.+ |.+-+.+-.-.-|. T Consensus 1 Mk~~~l~a~~~~~~p~~a~AHNL----~v~q~vP~Vgv~~~GE----------ivL~~~~~~yq~W~ 53 (184) T TIGR01626 1 MKKKKLLALLLVLLPILAFAHNL----EVEQSVPAVGVEERGE----------IVLSDDDISYQKWR 53 (184) T ss_pred CCHHHHHHHHHHHHHHHHHHHCC----CCCCCCCCCCCCCCEE----------EEECCCCCCCCCCC T ss_conf 91478999999998775654201----2488589711135401----------67527840015874 No 125 >KOG2100 consensus Probab=22.55 E-value=59 Score=13.95 Aligned_cols=37 Identities=22% Similarity=0.335 Sum_probs=20.3 Q ss_pred CEEECCCCCCCCCEEECHHHH-HHHHHHHHHHHHHCCCCCEEEEEECC Q ss_conf 268716778667187064669-99999999999750856111699555 Q gi|254780589|r 140 DLMIHGECASAGCYAMNNKQM-QEIYAIVRDSLRGNMQSHIQIQAFPF 186 (239) Q Consensus 140 ~I~IHG~~~S~GC~amtd~~i-~EIy~~v~~a~~~~gQ~~~~v~~~Pf 186 (239) .+-|||+- |+.+ -+=.+...+||+..|-. +++++||- T Consensus 685 ~LliHGt~---------DdnVh~q~s~~~~~aL~~~gv~-~~~~vypd 722 (755) T KOG2100 685 LLLIHGTE---------DDNVHFQQSAILIKALQNAGVP-FRLLVYPD 722 (755) T ss_pred EEEEECCC---------CCCCCHHHHHHHHHHHHHCCCC-EEEEECCC T ss_conf 79997466---------7665846699999999976994-48997389 No 126 >KOG2292 consensus Probab=22.27 E-value=23 Score=16.47 Aligned_cols=51 Identities=24% Similarity=0.468 Sum_probs=34.9 Q ss_pred CCCCCC-HHCCCCCCCCCEEEEEEECCCCCC--------CCCCEE--EECCC--CCHHHHHCCCC Q ss_conf 389887-010178747507999840125875--------765214--20799--89889972999 Q gi|254780589|r 86 SGTFGP-KIETGDEQAPEGFYYIGWNNLNPN--------SKYFLS--INIGF--PNEFDKAHNRT 137 (239) Q Consensus 86 sg~lGp-K~~eGD~kTPEG~Y~i~~~~~np~--------S~y~ls--~~i~Y--PN~~Dr~~grt 137 (239) +|+.-| -.+|-|.-||+|-|.++..- .|. -+||.+ +.+.| |.-|||.++.. T Consensus 624 ~~g~~p~~IkE~dy~t~~GEfRvD~~a-sptmlNcLmYKmsYyrfge~~~~~~~~~GyDR~R~~e 687 (751) T KOG2292 624 GGGEHPKHIKERDYFTPTGEFRVDAEA-SPTMLNCLMYKMSYYRFGELYTEFNGPPGYDRVRNAE 687 (751) T ss_pred CCCCCHHHCCCCCCCCCCCCEECCCCC-CHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHH T ss_conf 377683540534333788736437777-8789999999887643444101368998740365400 No 127 >COG3009 Uncharacterized protein conserved in bacteria [Function unknown] Probab=22.18 E-value=60 Score=13.91 Aligned_cols=15 Identities=27% Similarity=0.656 Sum_probs=10.2 Q ss_pred HHHHHHHHHCCCCCC Q ss_conf 999999986046653 Q gi|254780589|r 8 LLFALFIFLNGCHHS 22 (239) Q Consensus 8 Llfal~i~L~gc~~~ 22 (239) ++..+++.|+||.+. T Consensus 5 l~~~aal~L~~Cas~ 19 (190) T COG3009 5 LMIIAALLLAGCASG 19 (190) T ss_pred HHHHHHHHHHHCCCC T ss_conf 999999998753799 No 128 >pfam07172 GRP Glycine rich protein family. This family of proteins includes several glycine rich proteins as well as two nodulins 16 and 24. The family also contains proteins that are induced in response to various stresses. Probab=21.96 E-value=35 Score=15.38 Aligned_cols=12 Identities=33% Similarity=0.326 Sum_probs=7.6 Q ss_pred CCCHHHHHHHHH Q ss_conf 940468999999 Q gi|254780589|r 1 MTNRYNILLFAL 12 (239) Q Consensus 1 M~~R~~~Llfal 12 (239) |..|+|+||..+ T Consensus 1 MaSKa~llLgll 12 (91) T pfam07172 1 MASKALLLLGLL 12 (91) T ss_pred CCCHHHHHHHHH T ss_conf 950579999999 No 129 >pfam02157 Man-6-P_recep Cation-dependent mannose-6-phosphate receptor. Probab=21.64 E-value=38 Score=15.11 Aligned_cols=133 Identities=17% Similarity=0.237 Sum_probs=58.2 Q ss_pred EEEEEEEECCCCCC--CCC-----HHCCCCCCCCCEEEEEEECCCCCCCCCCEEEECCCCCHHHHHCCC--CCCCEEECC Q ss_conf 89998652132389--887-----010178747507999840125875765214207998988997299--988268716 Q gi|254780589|r 75 VLLKEYKICAWSGT--FGP-----KIETGDEQAPEGFYYIGWNNLNPNSKYFLSINIGFPNEFDKAHNR--TGADLMIHG 145 (239) Q Consensus 75 ~l~ktY~Ic~~sg~--lGp-----K~~eGD~kTPEG~Y~i~~~~~np~S~y~ls~~i~YPN~~Dr~~gr--tG~~I~IHG 145 (239) ..--+|.||..-|. .|- +..+++ -|-.|.|.-+.- ---|.|- -+--.+-+.||.--++ .-..|||-= T Consensus 71 ~Y~Y~F~VC~~ag~~~~gaGLVQ~~k~~~k-~tviGriN~T~i--~~GSdWI-mLiYk~GDkYdsHC~~E~RkA~IMIsC 146 (278) T pfam02157 71 MYSYIFRVCREAGNHSSGAGLVQINKSNDK-ETVVGRFNETHI--FNGSNWI-MLIYKGGDEYDNHCGKEQRRAVVMISC 146 (278) T ss_pred CEEEEEEEECCCCCCCCCCEEEEECCCCCC-CEEEEEECCEEE--ECCCCEE-EEEECCCCCCCCCCCHHHHHCEEEEEE T ss_conf 258999972013788888516886057886-269875012278--3587459-998437861134344122203589873 Q ss_pred CC---------------------------CCCCCEEECH-----HHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCHHHH Q ss_conf 77---------------------------8667187064-----669999999999997508561116995556898898 Q gi|254780589|r 146 EC---------------------------ASAGCYAMNN-----KQMQEIYAIVRDSLRGNMQSHIQIQAFPFRMTSKNM 193 (239) Q Consensus 146 ~~---------------------------~S~GC~amtd-----~~i~EIy~~v~~a~~~~gQ~~~~v~~~Pfrmt~~n~ 193 (239) .. +|+-|=|... .-+.-|+...-.++.-+|- -.+ -|-+-+..| T Consensus 147 n~~~~~~~f~vv~Eer~k~~~CfYLFEmdSsaaCp~~~s~LS~GSIllIif~~l~a~Ylv~G~---lyq--r~v~GAkG~ 221 (278) T pfam02157 147 NRHTLAANFNPVSEERGKVQDCFYLFEMDSSLACSPEISHLSVGSILLVIFASLVAVYIIGGF---LYQ--RLVVGAKGM 221 (278) T ss_pred CCCCCCCCCEEEEHHCCCCCCEEEEEEECCCCCCCCCCCCCCCCCEEHHHHHHHHHHHHHHHH---HHH--HHHHCCCCH T ss_conf 466556662232010165444179998436656786545677663301458999999999999---999--986425640 Q ss_pred HHHHCCCCHHHHHHHH----HHHHHHHHHC Q ss_conf 7542586737898535----6888999858 Q gi|254780589|r 194 QLYQNNPNYSFWNMLK----LGHDYFEKNH 219 (239) Q Consensus 194 ~~~~~~~~~~FW~~Lk----~gyd~Fe~~~ 219 (239) ..-||++||+++. .|-|+.-+++ T Consensus 222 ---Eq~Pn~~fW~~~gnL~ADGCdfvCR~~ 248 (278) T pfam02157 222 ---EQFPHLAFWQDLGNLVADGCDFVCRSK 248 (278) T ss_pred ---HCCCCHHHHHHCCCCCCCCCCEEEECC T ss_conf ---007656899872763346763477438 No 130 >COG4939 Major membrane immunogen, membrane-anchored lipoprotein [Function unknown] Probab=21.56 E-value=62 Score=13.83 Aligned_cols=23 Identities=17% Similarity=0.255 Sum_probs=17.8 Q ss_pred CCCHHHHHHHHHHHHHCCCCCCC Q ss_conf 94046899999999860466533 Q gi|254780589|r 1 MTNRYNILLFALFIFLNGCHHSR 23 (239) Q Consensus 1 M~~R~~~Llfal~i~L~gc~~~~ 23 (239) |..+-+..+++++++|.+|+.+. T Consensus 1 mvk~g~~~~~~~~~LL~aCg~sd 23 (147) T COG4939 1 MVKYGLVGMIVALSLLTACGKSD 23 (147) T ss_pred CCEEHHHHHHHHHHHHHHHCCCC T ss_conf 91303659999999998706664 No 131 >pfam08345 YscJ_FliF_C Flagellar M-ring protein C-terminal. This domain is found in bacterial flagellar M-ring (FliF) proteins together with the YscJ/FliF domain (pfam01514). Probab=21.49 E-value=62 Score=13.82 Aligned_cols=34 Identities=15% Similarity=0.375 Sum_probs=25.6 Q ss_pred EEECHHHHHHHHHHHHHHHHHCCC--CCEEEEEECC Q ss_conf 870646699999999999975085--6111699555 Q gi|254780589|r 153 YAMNNKQMQEIYAIVRDSLRGNMQ--SHIQIQAFPF 186 (239) Q Consensus 153 ~amtd~~i~EIy~~v~~a~~~~gQ--~~~~v~~~Pf 186 (239) -.+++++|++|-.||+.|+--..+ ..+.|..+|| T Consensus 127 ~~~s~ee~~~i~~lV~~AiG~d~~RGD~v~V~~~~F 162 (162) T pfam08345 127 VPRSPEELAQIEALVKSAVGFDAARGDSVTVVSMPF 162 (162) T ss_pred CCCCHHHHHHHHHHHHHHCCCCCCCCCEEEEEECCC T ss_conf 667999999999999996088877899899994689 No 132 >PRK09250 fructose-bisphosphate aldolase; Provisional Probab=21.00 E-value=64 Score=13.76 Aligned_cols=54 Identities=17% Similarity=0.315 Sum_probs=33.0 Q ss_pred CCCCCHHHHHCCCCCCCEEECCCCCCCCCEEEC----------HHHHHHHHHHHHHHHHHCCCCCEEEEEEC Q ss_conf 799898899729998826871677866718706----------46699999999999975085611169955 Q gi|254780589|r 124 IGFPNEFDKAHNRTGADLMIHGECASAGCYAMN----------NKQMQEIYAIVRDSLRGNMQSHIQIQAFP 185 (239) Q Consensus 124 i~YPN~~Dr~~grtG~~I~IHG~~~S~GC~amt----------d~~i~EIy~~v~~a~~~~gQ~~~~v~~~P 185 (239) +.|||.+|++.--+= ...+-.||.||. +++|+|+-.+.++|-.- |-. +=+-+|| T Consensus 138 L~~~~~~dQ~~~~sV------~dA~rLGa~AVG~TIY~GS~~~~~mi~E~~~i~~eAh~~-GL~-tVlW~Yp 201 (348) T PRK09250 138 LSYPNTYDQALTASV------EDALRLGAVAVGATIYFGSEESRRQIEEISEAFEEAHEL-GLA-TVLWSYL 201 (348) T ss_pred CCCCCCCCCCCCCCH------HHHHHCCCCEEEEEEECCCHHHHHHHHHHHHHHHHHHHC-CCE-EEEEECC T ss_conf 689998753400469------999854764654278469754799999999999999976-980-8999724 No 133 >PRK12389 glutamate-1-semialdehyde aminotransferase; Provisional Probab=20.80 E-value=64 Score=13.73 Aligned_cols=12 Identities=25% Similarity=0.465 Sum_probs=4.8 Q ss_pred CCCEEECHHHHH Q ss_conf 671870646699 Q gi|254780589|r 150 AGCYAMNNKQMQ 161 (239) Q Consensus 150 ~GC~amtd~~i~ 161 (239) +|++..+++-|+ T Consensus 276 i~av~g~~eim~ 287 (429) T PRK12389 276 IGAYGGRQDIME 287 (429) T ss_pred EEEEEEHHHHHH T ss_conf 488767799998 No 134 >COG2605 Predicted kinase related to galactokinase and mevalonate kinase [General function prediction only] Probab=20.68 E-value=58 Score=14.01 Aligned_cols=35 Identities=11% Similarity=0.189 Sum_probs=22.5 Q ss_pred EECHHHHHHHHHHHHHH-------HHHCCCCCEEEEEECCCC Q ss_conf 70646699999999999-------975085611169955568 Q gi|254780589|r 154 AMNNKQMQEIYAIVRDS-------LRGNMQSHIQIQAFPFRM 188 (239) Q Consensus 154 amtd~~i~EIy~~v~~a-------~~~~gQ~~~~v~~~Pfrm 188 (239) .+||+.||.||.++.+. +-+|||-..-|-+=|=+- T Consensus 262 ~ISN~~IDriy~~A~~~GA~~gKl~GaG~gGFllf~~~p~k~ 303 (333) T COG2605 262 RISNDAIDRIYELALKNGAYGGKLSGAGGGGFLLFFCDPSKR 303 (333) T ss_pred CCCCHHHHHHHHHHHHCCCHHCEEECCCCCCEEEEEECCCCH T ss_conf 767477899999998667322223026886279999685424 No 135 >PRK13908 putative recombination protein RecO; Provisional Probab=20.55 E-value=45 Score=14.67 Aligned_cols=32 Identities=16% Similarity=0.407 Sum_probs=23.5 Q ss_pred CCEEECCCCC---------------CCCCEEECHHHHHHHHHHHHHH Q ss_conf 8268716778---------------6671870646699999999999 Q gi|254780589|r 139 ADLMIHGECA---------------SAGCYAMNNKQMQEIYAIVRDS 170 (239) Q Consensus 139 ~~I~IHG~~~---------------S~GC~amtd~~i~EIy~~v~~a 170 (239) +.+--|..|. +-+|+-..|++|+++|.++.++ T Consensus 157 ~flpaH~~C~~~~~f~~~ki~~~f~~k~ti~l~D~evd~Lw~il~~G 203 (204) T PRK13908 157 GFLPAHPSCILGKSFDLKKIKAFFRTKSTIFLEDEEVEELWKILLEG 203 (204) T ss_pred HHHCCCHHHHCCCCCCHHHHHHHHHHCCCEECCHHHHHHHHHHHHCC T ss_conf 75123765527554589999999974784304878899999999753 No 136 >pfam05643 DUF799 Putative bacterial lipoprotein (DUF799). This family consists of several bacterial proteins of unknown function. Some of the family members are described as putative lipoproteins. Probab=20.33 E-value=66 Score=13.67 Aligned_cols=19 Identities=26% Similarity=0.357 Sum_probs=13.3 Q ss_pred HHHHHHHHHHHHHCCCCCC Q ss_conf 4689999999986046653 Q gi|254780589|r 4 RYNILLFALFIFLNGCHHS 22 (239) Q Consensus 4 R~~~Llfal~i~L~gc~~~ 22 (239) +.+++.++..++|+||... T Consensus 2 K~lil~l~~~l~L~gCa~~ 20 (215) T pfam05643 2 KPLILGLAAVLALSACQVQ 20 (215) T ss_pred HHHHHHHHHHHHHHHCCCC T ss_conf 3149999999998623489 No 137 >COG3317 NlpB Uncharacterized lipoprotein [Cell envelope biogenesis, outer membrane] Probab=20.28 E-value=66 Score=13.66 Aligned_cols=21 Identities=19% Similarity=0.328 Sum_probs=13.8 Q ss_pred CCHHHHHHHHHHHHHCCCCCC Q ss_conf 404689999999986046653 Q gi|254780589|r 2 TNRYNILLFALFIFLNGCHHS 22 (239) Q Consensus 2 ~~R~~~Llfal~i~L~gc~~~ 22 (239) ..+-.++.+++++.|+||+.. T Consensus 3 ~~~~lv~~al~v~~LaaCSs~ 23 (342) T COG3317 3 SSAKLVLGALLVLLLAACSSD 23 (342) T ss_pred HHHHHHHHHHHHHHHHHCCCC T ss_conf 178899999999987414688 No 138 >PRK12450 foldase protein PrsA; Reviewed Probab=20.11 E-value=57 Score=14.05 Aligned_cols=12 Identities=33% Similarity=0.617 Sum_probs=8.6 Q ss_pred HHHHHHCCCCCC Q ss_conf 999986046653 Q gi|254780589|r 11 ALFIFLNGCHHS 22 (239) Q Consensus 11 al~i~L~gc~~~ 22 (239) +.++.|+||+++ T Consensus 15 ~svl~LaAC~s~ 26 (309) T PRK12450 15 ATVVTLSACQSS 26 (309) T ss_pred HHHHHHHHHCCC T ss_conf 999999860589 No 139 >TIGR03300 assembly_YfgL outer membrane assembly lipoprotein YfgL. Members of this protein family are YfgL, a lipoprotein component of a complex that acts protein insertion into the bacterial outer membrane. Other members of this complex are NlpB, YfiO, and YaeT. This protein contains multiple copies of a repeat that, in other contexts, are associated with binding of the coenzyme PQQ. Probab=20.06 E-value=67 Score=13.64 Aligned_cols=18 Identities=33% Similarity=0.477 Sum_probs=12.4 Q ss_pred HHHHHHHHHHHHCCCCCC Q ss_conf 689999999986046653 Q gi|254780589|r 5 YNILLFALFIFLNGCHHS 22 (239) Q Consensus 5 ~~~Llfal~i~L~gc~~~ 22 (239) .++++++++++|+||+.. T Consensus 2 ~~~~~~~~~~~l~gCs~~ 19 (377) T TIGR03300 2 RLALVIALAALLSGCSWF 19 (377) T ss_pred CCHHHHHHHHHHHHCCCC T ss_conf 238999999998556576 Done!