Query gi|254780745|ref|YP_003065158.1| OstA family protein [Candidatus Liberibacter asiaticus str. psy62] Match_columns 181 No_of_seqs 112 out of 1078 Neff 8.1 Searched_HMMs 39220 Date Sun May 29 19:11:24 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780745.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 PRK10894 hypothetical protein; 100.0 1E-37 2.6E-42 222.9 19.6 162 1-179 1-168 (184) 2 TIGR03002 outer_YhbN cell enve 100.0 2.4E-34 6.2E-39 204.6 16.4 135 36-178 1-150 (150) 3 COG1934 Uncharacterized protei 100.0 3.7E-32 9.4E-37 192.8 17.4 140 31-179 19-164 (173) 4 pfam03968 OstA OstA-like prote 100.0 7.4E-30 1.9E-34 180.3 17.3 116 37-160 20-141 (141) 5 PRK03761 organic solvent toler 99.7 5.9E-15 1.5E-19 99.5 19.5 123 39-165 46-198 (782) 6 COG1452 Imp Organic solvent to 99.5 2.2E-12 5.5E-17 85.6 17.2 83 40-131 47-131 (784) 7 PRK04423 organic solvent toler 99.5 1.2E-12 3.2E-17 86.9 15.5 110 42-165 57-197 (798) 8 pfam06835 DUF1239 Protein of u 98.9 2.8E-07 7.1E-12 57.9 17.0 121 38-175 46-175 (176) 9 PRK03761 organic solvent toler 98.7 2.2E-07 5.6E-12 58.5 10.9 83 70-166 48-136 (782) 10 COG1452 Imp Organic solvent to 98.6 1.9E-07 5E-12 58.8 8.2 112 45-165 82-195 (784) 11 pfam06835 DUF1239 Protein of u 98.6 9.4E-07 2.4E-11 55.0 11.4 82 74-164 53-139 (176) 12 COG1934 Uncharacterized protei 98.4 1.1E-05 2.9E-10 49.2 13.1 87 44-139 61-157 (173) 13 PRK04423 organic solvent toler 98.4 2.1E-06 5.4E-11 53.1 8.5 81 71-166 57-137 (798) 14 PRK10893 hypothetical protein; 98.3 5.8E-05 1.5E-09 45.4 14.5 118 45-180 60-187 (190) 15 pfam03968 OstA OstA-like prote 98.3 1.7E-05 4.3E-10 48.3 10.9 86 72-166 26-118 (141) 16 TIGR03002 outer_YhbN cell enve 98.1 4.8E-05 1.2E-09 45.8 10.7 82 45-135 39-134 (150) 17 PRK10894 hypothetical protein; 97.8 8.8E-05 2.3E-09 44.4 7.3 86 44-138 64-160 (184) 18 COG3117 Uncharacterized protei 97.3 0.0034 8.7E-08 35.7 9.6 122 42-181 57-188 (188) 19 PRK10893 hypothetical protein; 96.8 0.017 4.3E-07 32.0 9.2 111 35-162 31-144 (190) 20 COG3117 Uncharacterized protei 94.4 0.081 2.1E-06 28.3 4.8 109 37-164 33-146 (188) 21 COG5375 Uncharacterized protei 93.4 0.46 1.2E-05 24.2 9.7 122 40-178 87-214 (216) 22 PRK11397 dacD D-alanyl-D-alani 92.7 0.33 8.3E-06 25.0 5.5 54 9-70 6-60 (390) 23 PRK09918 putative periplasmic 83.4 2.8 7.2E-05 19.9 5.2 19 8-26 3-21 (230) 24 PRK11385 hypothetical protein; 78.0 4.4 0.00011 18.9 5.2 22 1-22 10-31 (245) 25 PRK10064 colicin I receptor; P 78.0 1.7 4.2E-05 21.2 2.3 39 5-51 1-39 (663) 26 PRK11669 pbpG D-alanyl-D-alani 76.1 5 0.00013 18.6 4.3 29 42-70 33-62 (308) 27 PRK10001 D-alanyl-D-alanine ca 67.0 8.2 0.00021 17.4 3.9 54 5-68 5-59 (400) 28 TIGR00247 TIGR00247 conserved 61.1 10 0.00027 16.9 3.3 28 1-28 7-34 (373) 29 pfam07172 GRP Glycine rich pro 60.8 11 0.00028 16.8 3.8 10 37-46 32-41 (91) 30 PRK13483 enterobactin receptor 60.7 11 0.00028 16.8 3.5 44 5-51 2-45 (663) 31 PRK04517 hypothetical protein; 55.9 13 0.00034 16.3 5.5 42 48-89 50-93 (216) 32 PRK09926 putative chaperone pr 50.5 16 0.00042 15.8 4.8 19 11-29 6-24 (245) 33 PRK13524 outer membrane recept 48.8 17 0.00045 15.6 3.2 31 21-51 15-45 (742) 34 TIGR02856 spore_yqfC sporulati 48.5 18 0.00045 15.6 4.2 38 129-166 25-62 (87) 35 COG4856 Uncharacterized protei 46.5 19 0.00048 15.4 6.4 10 1-10 1-10 (403) 36 PRK02889 tolB translocation pr 46.2 17 0.00042 15.8 2.4 24 4-27 3-26 (430) 37 PRK04405 prsA peptidylprolyl i 43.2 21 0.00055 15.2 3.0 26 1-26 1-26 (298) 38 PRK13513 putative outer membra 42.4 22 0.00057 15.1 4.4 45 1-52 1-45 (660) 39 pfam07241 consensus 36.4 28 0.00071 14.6 3.9 41 45-85 25-69 (87) 40 PRK06518 hypothetical protein; 34.5 30 0.00076 14.4 5.5 48 2-76 5-53 (192) 41 PRK10626 hypothetical protein; 33.4 31 0.0008 14.3 3.8 43 38-86 25-68 (239) 42 TIGR01626 ytfJ_HI0045 conserve 31.5 26 0.00066 14.7 1.5 58 13-71 8-65 (184) 43 PRK10871 nlpD lipoprotein NlpD 30.6 35 0.00089 14.0 4.8 76 99-181 289-364 (374) 44 COG3054 Predicted transcriptio 30.0 36 0.00091 14.0 2.9 26 45-71 40-65 (184) 45 PRK10793 D-alanyl-D-alanine ca 28.9 37 0.00096 13.9 4.1 28 41-68 38-66 (403) 46 TIGR00383 corA magnesium and c 27.2 40 0.001 13.7 2.5 25 2-26 267-291 (339) 47 COG2968 Uncharacterized conser 24.6 45 0.0012 13.4 3.1 14 119-132 103-117 (243) 48 PRK13684 Ycf48-like protein; P 24.4 46 0.0012 13.4 13.9 65 1-70 1-70 (333) 49 pfam12046 DUF3529 Protein of u 24.1 46 0.0012 13.4 5.3 18 53-70 62-79 (173) 50 cd00982 gltB_C gltb_C. This do 24.1 46 0.0012 13.4 2.7 47 40-87 15-61 (251) 51 PRK01904 hypothetical protein; 23.6 47 0.0012 13.3 5.8 42 48-89 47-91 (217) 52 pfam07873 YabP YabP family. Th 23.4 48 0.0012 13.3 4.2 32 52-83 12-46 (66) 53 pfam10661 EssA Firmicute eSAT- 22.8 39 0.00099 13.8 1.1 14 41-54 32-45 (166) 54 pfam05984 Cytomega_UL20A Cytom 22.4 50 0.0013 13.2 3.4 17 61-77 44-60 (102) 55 PRK00378 nucleoid-associated p 22.1 51 0.0013 13.1 1.6 23 112-134 298-320 (334) 56 pfam04245 NA37 37-kD nucleoid- 21.4 53 0.0013 13.1 1.5 22 112-133 301-322 (334) 57 pfam10916 DUF2712 Protein of u 20.8 54 0.0014 13.0 2.9 27 1-27 1-27 (146) 58 pfam07271 Cytadhesin_P30 Cytad 20.8 54 0.0014 13.0 1.5 26 4-29 5-30 (279) No 1 >PRK10894 hypothetical protein; Provisional Probab=100.00 E-value=1e-37 Score=222.87 Aligned_cols=162 Identities=18% Similarity=0.174 Sum_probs=134.9 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCEEEEEEEEEEECCCCEEEEEEEEEEEECCEEEEEEE Q ss_conf 91346788999999999999987774332100012244577784799911699992388799980089997875999968 Q gi|254780745|r 1 MSAFLNSIHLYIHLLALVFVLLKADLSQAAASYMTRFKVLGNEKIHIKADMLEVKDAVQKAFFKGNVFMTQEDFSLQADK 80 (181) Q Consensus 1 ~~~~~~~~~~~~~~~~~l~~~~~~~~a~~~~~~~~~~~~d~~~Pi~I~AD~~~~d~~~~~~~~~GNV~i~qg~~~i~aD~ 80 (181) |+.+.+...+.+.++++|++++.+++| ++.|++|||+|+||++++|+++++++|+|||+++||+++|+||+ T Consensus 1 ~~~~~~~~~~~l~l~~~ll~~S~~a~A---------l~sD~~qPI~I~AD~~~~d~~~~~avytGNVvitQGsl~I~ADk 71 (184) T PRK10894 1 MKFKTNKLSLNLLLASSLLAASIPAFA---------VTGDTEQPIHIDSDQQSLDMQGNVVTFTGNVVVTQGTIKINADK 71 (184) T ss_pred CCCCCCCCHHHHHHHHHHHHHCCHHHH---------CCCCCCCCEEEEECEEEEECCCCEEEEEEEEEEEEEEEEEEEEE T ss_conf 953223420669999999871612431---------55344699899945289431599899992599996259999358 Q ss_pred EEEEECCCCCCCCCCEEEEEECCCCCC------CCCCCCEEEEEEEEECCCCEEEEEECCCCEEEEEECCCEEEEEEEEE Q ss_conf 999961766544420012220122202------45565332112799416658999947863499971797699229999 Q gi|254780745|r 81 MTIDYNNTNRDVSNKINRMDVERNIFI------QSGEINVIASNGYVDFQKRILVLNGDRADKVILKEKLNTFLGCKLIV 154 (181) Q Consensus 81 l~i~~~~~~~~~~~~i~~~~a~gnv~~------~~~~~~~~a~~~~y~~~~~~~~l~Gna~~~v~l~q~~~~i~gdki~y 154 (181) ++++++++.. ....+.|.|++.. ..+...++|++++|+..++.++|+|||. |+|+++.++|++|+| T Consensus 72 v~v~~~~~~~----~~~~i~a~G~pA~f~Q~~~~g~~i~a~A~~i~Y~~~~~~i~L~GnA~----L~Q~~~~v~G~~I~Y 143 (184) T PRK10894 72 VVVTRPGGEQ----GKEVIDGYGNPATFYQMQDNGKPVKGHASKMHYELAKDFVVLTGNAY----LEQLDSNIKGDKITY 143 (184) T ss_pred EEEEECCCCC----CEEEEEEECCCEEEEEECCCCCCEEEEEEEEEEECCCCEEEEECCEE----EEECCCEEEECEEEE T ss_conf 9999769975----37999996893799996568981376974899982699999976589----996899899378999 Q ss_pred EECCCEEEEEECCCCEEEEEEECCC Q ss_conf 9216589999289980999981187 Q gi|254780745|r 155 NIDTSFASLQGCESDQVQSIIRYDG 179 (181) Q Consensus 155 n~~tg~~~~~g~~~~rV~~i~~~~~ 179 (181) |++++++.+.|++++||++||+|.. T Consensus 144 dl~~~~~~a~s~~~~RV~~vi~P~~ 168 (184) T PRK10894 144 LVKEQKMQAFSDKGKRVTTVLVPSQ 168 (184) T ss_pred EECCCEEEEECCCCCEEEEEECCCC T ss_conf 9066799996799980799989401 No 2 >TIGR03002 outer_YhbN cell envelope biogenesis protein YhbN; InterPro: IPR014340 Members of this entry include YhbN, which is an essential protein in Escherichia coli. It is implicated in cell envelope integrity, and has numerous homologues in other Proteobacteria in a conserved location near lipopolysaccharide inner core biosynthesis genes. This entry is distantly related to the organic solvent tolerance protein (OstA).. Probab=100.00 E-value=2.4e-34 Score=204.60 Aligned_cols=135 Identities=24% Similarity=0.300 Sum_probs=121.6 Q ss_pred CCCCCCCCCEEEEEEEEEEECCCCEEEEEEEEEEEECCEEEEEEEEEEEECCCCCCCCCCEEEEEECCC-C-CCC----C Q ss_conf 244577784799911699992388799980089997875999968999961766544420012220122-2-024----5 Q gi|254780745|r 36 RFKVLGNEKIHIKADMLEVKDAVQKAFFKGNVFMTQEDFSLQADKMTIDYNNTNRDVSNKINRMDVERN-I-FIQ----S 109 (181) Q Consensus 36 ~~~~d~~~Pi~I~AD~~~~d~~~~~~~~~GNV~i~qg~~~i~aD~l~i~~~~~~~~~~~~i~~~~a~gn-v-~~~----~ 109 (181) ++++|+++||+|+||++++|+++++++|+|||+|+||++.|+||+|+++++.. .+.+..+.|+|+ + ++. . T Consensus 1 a~~~D~~qPi~I~Ad~~~~D~~~~~~~f~GnV~itQG~~~I~ADkv~v~~~~~----~~~~~~~~A~G~~PA~F~q~~d~ 76 (150) T TIGR03002 1 ALSSDREQPIEIEADSQTLDDKKNVVTFTGNVVITQGTLKIRADKVVVTRPND----GGGIEKATATGNKPATFRQKLDD 76 (150) T ss_pred CCCCCCCCCEEEEEEEEEEECCCCEEEEECCEEEEECCEEEECCEEEEEECCC----CCCEEEEEEECCCCEEEEEEECC T ss_conf 97787568689994006654028838886565878297598522889882688----78608999980781898899559 Q ss_pred C-C-CCEEEEEEEEECCCCEEEEEECCCCEEEEEECC-CEEEEEEEEEEECCCEEEEEECCC------CEEEEEEECC Q ss_conf 5-6-533211279941665899994786349997179-769922999992165899992899------8099998118 Q gi|254780745|r 110 G-E-INVIASNGYVDFQKRILVLNGDRADKVILKEKL-NTFLGCKLIVNIDTSFASLQGCES------DQVQSIIRYD 178 (181) Q Consensus 110 ~-~-~~~~a~~~~y~~~~~~~~l~Gna~~~v~l~q~~-~~i~gdki~yn~~tg~~~~~g~~~------~rV~~i~~~~ 178 (181) + . ..++|++++|++.++.++|+|||+ |+|++ |+++|++|+||+.++.+.+.|+++ +||++||+|. T Consensus 77 g~~~~~g~A~~i~Y~~~~~~~~Ltg~A~----v~Q~~k~~~~G~~I~Y~~~~q~~~a~~~~~~~~~~~~RV~~ii~P~ 150 (150) T TIGR03002 77 GKEYIEGQANRIEYDPAKDTVVLTGNAR----VKQGGKNEVSGDVITYDTKKQKVTAKGGGGSAKGASGRVTTIIQPK 150 (150) T ss_pred CCEEEEEECCEEEEECCCCEEEEECCEE----EEECCCCEEECCEEEEECEEEEEEEEECCCCCCCCCCCEEEEEECC T ss_conf 9827999711899932577999912349----9788977787668898440129989807888998697179998409 No 3 >COG1934 Uncharacterized protein conserved in bacteria [Function unknown] Probab=100.00 E-value=3.7e-32 Score=192.76 Aligned_cols=140 Identities=23% Similarity=0.261 Sum_probs=119.6 Q ss_pred HCCCCCCCCCCCCCEEEEEEEEEEECCCCEEEEEEEEEEEECCEEEEEEEEEEEECCCCCCCCCCEEEEEECCCC----C Q ss_conf 000122445777847999116999923887999800899978759999689999617665444200122201222----0 Q gi|254780745|r 31 ASYMTRFKVLGNEKIHIKADMLEVKDAVQKAFFKGNVFMTQEDFSLQADKMTIDYNNTNRDVSNKINRMDVERNI----F 106 (181) Q Consensus 31 ~~~~~~~~~d~~~Pi~I~AD~~~~d~~~~~~~~~GNV~i~qg~~~i~aD~l~i~~~~~~~~~~~~i~~~~a~gnv----~ 106 (181) .+.+.+++.|+++||+|+||++++|++++.++|+|||+++||+++|+||+++++++.+... ..+.+.|+. . T Consensus 19 ~~~a~al~~d~~qPi~i~ad~~el~~~~~~a~ftGNV~i~qG~~~l~Adkv~v~~~~~~~~-----~~i~a~gn~~~~~q 93 (173) T COG1934 19 SSPAFALKGDRDQPITIEADQQELDDKNGVATFTGNVVVTQGTITLRADKVTVTRDKEGGK-----EVIEANGNPATFSQ 93 (173) T ss_pred CCHHHHCCCCCCCCEEEECCCEEEECCCCEEEEECCEEEEECCCEEEEEEEEEECCCCCCC-----EEEEEECCEEEEEE T ss_conf 0102322356558779972522430668779997638998466179713699980578985-----48999546667887 Q ss_pred CC--CCCCCEEEEEEEEECCCCEEEEEECCCCEEEEEECCCEEEEEEEEEEECCCEEEEEECCCCEEEEEEECCC Q ss_conf 24--55653321127994166589999478634999717976992299999216589999289980999981187 Q gi|254780745|r 107 IQ--SGEINVIASNGYVDFQKRILVLNGDRADKVILKEKLNTFLGCKLIVNIDTSFASLQGCESDQVQSIIRYDG 179 (181) Q Consensus 107 ~~--~~~~~~~a~~~~y~~~~~~~~l~Gna~~~v~l~q~~~~i~gdki~yn~~tg~~~~~g~~~~rV~~i~~~~~ 179 (181) .. .....++|+++.|+..+++++|+||++ +.+.+|.++|++|+|++++|.+.++|+.++|||++|.|+. T Consensus 94 ~~~~~k~~~g~ad~~~Y~~~~~~~vLtGna~----v~q~gn~i~G~kIty~~~~~~~~a~~~~~~RV~~V~~p~~ 164 (173) T COG1934 94 KVDNGKTQKGTADQAIYDLKNDTIVLTGNAY----VLQLGNVIRGDKITYLTKTGRAVAESSKGGRVRTVLDPED 164 (173) T ss_pred EECCCCCEEEEEEEEEEEECCCEEEEECCEE----EEECCCEEEEEEEEEECCCCEEEECCCCCCCEEEEECCHH T ss_conf 6507850388851699982078999947799----9857988876399999434579962889984799988055 No 4 >pfam03968 OstA OstA-like protein. This family of proteins are mostly uncharacterized. However the family does include E. coli OstA that has been characterized as an organic solvent tolerance protein. Probab=99.97 E-value=7.4e-30 Score=180.28 Aligned_cols=116 Identities=29% Similarity=0.408 Sum_probs=102.4 Q ss_pred CCCCCCCCEEEEEEEEEEECCCCEEEEEEEEEEEECCEEEEEEEEEEEECCCCCCCCCCEEEEEECC------CCCCCCC Q ss_conf 4457778479991169999238879998008999787599996899996176654442001222012------2202455 Q gi|254780745|r 37 FKVLGNEKIHIKADMLEVKDAVQKAFFKGNVFMTQEDFSLQADKMTIDYNNTNRDVSNKINRMDVER------NIFIQSG 110 (181) Q Consensus 37 ~~~d~~~Pi~I~AD~~~~d~~~~~~~~~GNV~i~qg~~~i~aD~l~i~~~~~~~~~~~~i~~~~a~g------nv~~~~~ 110 (181) ++.|+++||+|+||++++|++++.++|+|||+++||+++|+||+++++++++.. .+..+.+.| ++....+ T Consensus 20 ~~~D~~~PI~I~AD~~~~d~~~~~~~~~GnV~i~QG~~~i~ad~~~i~~~~~~~----~~~~~~a~g~~~~~~~~~~~~~ 95 (141) T pfam03968 20 AQADRDQPIEIEADNLEADDKKGVATFTGNVVVTQGSITLRADKVVVTQDKKSN----GIERLEATGPATFRQKVDLVGR 95 (141) T ss_pred CCCCCCCCEEEEEEEEEEECCCCEEEEECCEEEEECCEEEEEEEEEEEECCCCC----CEEEEEEECCEEEEEEEECCCC T ss_conf 432679998999457999827999999874999808709996489999868889----6799999776049998614888 Q ss_pred CCCEEEEEEEEECCCCEEEEEECCCCEEEEEECCCEEEEEEEEEEECCCE Q ss_conf 65332112799416658999947863499971797699229999921658 Q gi|254780745|r 111 EINVIASNGYVDFQKRILVLNGDRADKVILKEKLNTFLGCKLIVNIDTSF 160 (181) Q Consensus 111 ~~~~~a~~~~y~~~~~~~~l~Gna~~~v~l~q~~~~i~gdki~yn~~tg~ 160 (181) ...++|++++|++.++.++|+||+ .++|++|.++|++|+||+++|+ T Consensus 96 ~~~a~a~~~~Y~~~~~~i~L~Gna----~l~q~~n~i~G~~I~ydl~~g~ 141 (141) T pfam03968 96 QIRGTADRAEYDSKKRIIVLTGNA----VLTQGGNSIRGDKITYDIKEGK 141 (141) T ss_pred CEEEEEEEEEEEECCCEEEEECCE----EEEECCCEEEECEEEEECCCCC T ss_conf 679997189999169999997339----9998999999689999976798 No 5 >PRK03761 organic solvent tolerance protein; Provisional Probab=99.71 E-value=5.9e-15 Score=99.52 Aligned_cols=123 Identities=19% Similarity=0.268 Sum_probs=95.3 Q ss_pred CCCCCCEEEEEEEEEEECCCCEEEEEEEEEEEECCEEEEEEEEEEEECCCCCCCCCCEEEEEECCCCCCCCCCCCEEEEE Q ss_conf 57778479991169999238879998008999787599996899996176654442001222012220245565332112 Q gi|254780745|r 39 VLGNEKIHIKADMLEVKDAVQKAFFKGNVFMTQEDFSLQADKMTIDYNNTNRDVSNKINRMDVERNIFIQSGEINVIASN 118 (181) Q Consensus 39 ~d~~~Pi~I~AD~~~~d~~~~~~~~~GNV~i~qg~~~i~aD~l~i~~~~~~~~~~~~i~~~~a~gnv~~~~~~~~~~a~~ 118 (181) ...++||+|+||++++++ .+.++|+|||.|.||+..|.||++++..+++.. .....+.|.|||.+.++...+.|++ T Consensus 46 ~~~~~pi~i~AD~~~~~~-~~~~~~~GnV~i~~g~~~l~Ad~v~~~q~~~~~---~~~~~~~a~Gnv~~~~~~~~~~g~~ 121 (782) T PRK03761 46 DPNDLPVTINADNAEANY-PDKAVYTGNVDIKQGNSRLQADEVQLHQQEAPG---EPVRTVDAEGNVHYDDNQIILKGPK 121 (782) T ss_pred CCCCCCEEEEECCEEECC-CCEEEEEEEEEEEECCEEEECCEEEEECCCCCC---CCCEEEEEECCEEEECCCEEEEECE T ss_conf 988884799933246258-977999933999979989995679995237866---6502899868989968961898146 Q ss_pred EEEECCCCEEEEEE------------CCCC-------EE-EEEEC----------CCEEEEEEEEEEECCCEEEEEE Q ss_conf 79941665899994------------7863-------49-99717----------9769922999992165899992 Q gi|254780745|r 119 GYVDFQKRILVLNG------------DRAD-------KV-ILKEK----------LNTFLGCKLIVNIDTSFASLQG 165 (181) Q Consensus 119 ~~y~~~~~~~~l~G------------na~~-------~v-~l~q~----------~~~i~gdki~yn~~tg~~~~~g 165 (181) +.++..++...+.. .|.. .. +|.++ .=.|++.+|++|.+++.+.+.. T Consensus 122 ~~~~~~~~~~~~~~~~y~~~~~~~~g~A~~~~~~~~~~~~~l~~~~yTtC~p~~~~W~i~A~~I~~D~~~~~~~~~n 198 (782) T PRK03761 122 AWSNLNTKDTNVWNGDYQMVGRQGRGKADLMKQRGENRYTILENGSFTSCLPGDNSWSVVGSEIIHDREEEVAEIWN 198 (782) T ss_pred EEEECCCCEEEEECCEEEEECCCCEEEEEEEEECCCCCEEEECCEEECCCCCCCCCEEEEEEEEEEECCCCEEEEEE T ss_conf 99953776459962489997378604700259917984899817188308998993399945899917778899980 No 6 >COG1452 Imp Organic solvent tolerance protein OstA [Cell envelope biogenesis, outer membrane] Probab=99.53 E-value=2.2e-12 Score=85.60 Aligned_cols=83 Identities=25% Similarity=0.247 Sum_probs=73.7 Q ss_pred CCCCC-EEEEEEEEEEECCCCEEEEEEEEEEEECCEEEEEEEEEEEECCCCCCCCCCEEEEEECCCCCCCC-CCCCEEEE Q ss_conf 77784-79991169999238879998008999787599996899996176654442001222012220245-56533211 Q gi|254780745|r 40 LGNEK-IHIKADMLEVKDAVQKAFFKGNVFMTQEDFSLQADKMTIDYNNTNRDVSNKINRMDVERNIFIQS-GEINVIAS 117 (181) Q Consensus 40 d~~~P-i~I~AD~~~~d~~~~~~~~~GNV~i~qg~~~i~aD~l~i~~~~~~~~~~~~i~~~~a~gnv~~~~-~~~~~~a~ 117 (181) ....| +.|+||.++++.+++.++|+|||+|.+|+..|.||++.+..+++ .+.|.|||.+.. +....+|+ T Consensus 47 ~~~~p~~~iea~~~~~~~~~~~ita~GnV~i~~~~~~l~Ad~v~y~q~t~---------~~~A~GnV~~~~~~g~~i~gd 117 (784) T COG1452 47 EQNLPKILIEADALEYNRKDQIITAEGNVEILYGDYRLLADKVDYDQKTD---------RVTATGNVEYIDGDGILIKGD 117 (784) T ss_pred CCCCCCEEEEHHHHCCCCCCCEEEEECEEEEEECCEEEEEEEEEEECCCC---------EEEEECCEEEECCCCCEEECC T ss_conf 45698458980020014778569996629999688899875899723478---------899977799982898099875 Q ss_pred EEEEECCCCEEEEE Q ss_conf 27994166589999 Q gi|254780745|r 118 NGYVDFQKRILVLN 131 (181) Q Consensus 118 ~~~y~~~~~~~~l~ 131 (181) ++.++.+.....+. T Consensus 118 ~a~~~ld~~~g~~~ 131 (784) T COG1452 118 KADVNLDTKKGFVS 131 (784) T ss_pred EEEEECCCCCCEEE T ss_conf 68996145764586 No 7 >PRK04423 organic solvent tolerance protein; Provisional Probab=99.52 E-value=1.2e-12 Score=86.91 Aligned_cols=110 Identities=19% Similarity=0.223 Sum_probs=85.2 Q ss_pred CCCEEEEEEEEEEECCCCEEEEEEEEEEEECCEEEEEEEEEEEECCCCCCCCCCEEEEEECCCCCCCCCCCCEEEEEEEE Q ss_conf 78479991169999238879998008999787599996899996176654442001222012220245565332112799 Q gi|254780745|r 42 NEKIHIKADMLEVKDAVQKAFFKGNVFMTQEDFSLQADKMTIDYNNTNRDVSNKINRMDVERNIFIQSGEINVIASNGYV 121 (181) Q Consensus 42 ~~Pi~I~AD~~~~d~~~~~~~~~GNV~i~qg~~~i~aD~l~i~~~~~~~~~~~~i~~~~a~gnv~~~~~~~~~~a~~~~y 121 (181) .+|+.|+||+++... .+++|+|||.+.||+..|.||++++...++ .+.|.|||.+.++.....|+++.+ T Consensus 57 ~~p~~v~aD~~~~~~--~~~~f~GnV~i~qg~~~l~AD~v~ydq~~~---------~v~A~GnV~~~~~~~~~~g~~~~~ 125 (798) T PRK04423 57 QLPTDIEGDQLSGTS--TTPQYQGNVALKRGDQFLGADNLRMDTETG---------NYIAEGNVRYQDTSIRMVADRAEG 125 (798) T ss_pred CCCEEEECCCCCCCC--CCCEEEEEEEEEECCEEEEEEEEEEECCCC---------EEEEECCEEEECCCCEEECCEEEE T ss_conf 698588764023557--751798559999799899911799986789---------899977789967985898142899 Q ss_pred ECCCCEEEEEECCCCEEEEEECCC-------------------------------EEEEEEEEEEECCCEEEEEE Q ss_conf 416658999947863499971797-------------------------------69922999992165899992 Q gi|254780745|r 122 DFQKRILVLNGDRADKVILKEKLN-------------------------------TFLGCKLIVNIDTSFASLQG 165 (181) Q Consensus 122 ~~~~~~~~l~Gna~~~v~l~q~~~-------------------------------~i~gdki~yn~~tg~~~~~g 165 (181) +..+....|.+-. ..+.+... .+++.+|.+|.+++.+.+.. T Consensus 126 ~l~~~~g~~~~~~---y~l~~~~~~G~A~~~~~~g~~~~l~~~~~ttC~~~~~~W~~~a~~i~~d~~~~~~~~~~ 197 (798) T PRK04423 126 NQDTDTHKITNIQ---YQLVERRGNGDAESVDLQGQVGQMHRSTYTTCDPSQPIWRLRAPEIDVDNDEGFGTARN 197 (798) T ss_pred ECCCCEEEEECCE---EEEEECCCCEEEEEEEECCCEEEEECCEECCCCCCCCCEEEEECEEEEECCCCEEEEEE T ss_conf 8267647996678---99994677236677674486479957198328988983599945799807788899994 No 8 >pfam06835 DUF1239 Protein of unknown function (DUF1239). This family consists of several hypothetical bacterial proteins of around 190 residues in length. The function of this family is unknown. Probab=98.93 E-value=2.8e-07 Score=57.91 Aligned_cols=121 Identities=19% Similarity=0.216 Sum_probs=95.2 Q ss_pred CCCCCCCEEEEEEEEEEECCCCEEEEEEEEEEEEC----CEEEEEEEEEEEECCCCCCCCCCEEEEEECCCCCCCCCC-- Q ss_conf 45777847999116999923887999800899978----759999689999617665444200122201222024556-- Q gi|254780745|r 38 KVLGNEKIHIKADMLEVKDAVQKAFFKGNVFMTQE----DFSLQADKMTIDYNNTNRDVSNKINRMDVERNIFIQSGE-- 111 (181) Q Consensus 38 ~~d~~~Pi~I~AD~~~~d~~~~~~~~~GNV~i~qg----~~~i~aD~l~i~~~~~~~~~~~~i~~~~a~gnv~~~~~~-- 111 (181) ..+.....+++|+++++...++.+.++.-+..... ...++|++..++-+.+ .+...|+|.+.+.. T Consensus 46 d~~G~l~~~l~a~~~~~~~~~~~~~l~~P~~~~~~~~~~~w~i~A~~g~~~~~~~---------~i~L~g~V~i~~~~~~ 116 (176) T pfam06835 46 DEQGKLQYRLTADKLEHYPDDDTTEFTNPVLIIYQDGKPPWTITADRAKLNKDGQ---------TLYLYGNVVIQSLEDK 116 (176) T ss_pred CCCCCEEEEEEECEEEEECCCCCEEEEEEEEEEECCCCCEEEEEECEEEEECCCC---------EEEEECCEEEEECCCC T ss_conf 9999899999904579965899689996399997799836999979499937996---------9999899999976899 Q ss_pred ---CCEEEEEEEEECCCCEEEEEECCCCEEEEEECCCEEEEEEEEEEECCCEEEEEECCCCEEEEEE Q ss_conf ---5332112799416658999947863499971797699229999921658999928998099998 Q gi|254780745|r 112 ---INVIASNGYVDFQKRILVLNGDRADKVILKEKLNTFLGCKLIVNIDTSFASLQGCESDQVQSII 175 (181) Q Consensus 112 ---~~~~a~~~~y~~~~~~~~l~Gna~~~v~l~q~~~~i~gdki~yn~~tg~~~~~g~~~~rV~~i~ 175 (181) .....+.+.|++.++.+.-.. +|.+.++.+.++|..+.+|++++++++.| +||+.+ T Consensus 117 ~~~~~l~T~~l~~~~~~~~~~td~----~V~i~~~~~~~~g~gm~~~l~~~~~~l~~----~Vr~~y 175 (176) T pfam06835 117 SRLTTITTENLTVNLDTQDASTDQ----PVTITGPNFVSTGTGMKANLKTGTAELLS----NVRGRY 175 (176) T ss_pred CCCEEEEEHEEEEECCCCEEEECC----CEEEEECCCEEEEEEEEEECCCCEEEEEC----CCEEEE T ss_conf 875699922099994777999487----49999099799988899997589999943----616995 No 9 >PRK03761 organic solvent tolerance protein; Provisional Probab=98.73 E-value=2.2e-07 Score=58.49 Aligned_cols=83 Identities=17% Similarity=0.197 Sum_probs=71.0 Q ss_pred EECCEEEEEEEEEEEECCCCCCCCCCEEEEEECCCCCCCCCCCCEEEEEEEEECCCC------EEEEEECCCCEEEEEEC Q ss_conf 978759999689999617665444200122201222024556533211279941665------89999478634999717 Q gi|254780745|r 70 TQEDFSLQADKMTIDYNNTNRDVSNKINRMDVERNIFIQSGEINVIASNGYVDFQKR------ILVLNGDRADKVILKEK 143 (181) Q Consensus 70 ~qg~~~i~aD~l~i~~~~~~~~~~~~i~~~~a~gnv~~~~~~~~~~a~~~~y~~~~~------~~~l~Gna~~~v~l~q~ 143 (181) .+-...|.||++++..+. ++.+.|||.+.+++....||++.|+..++ +++..|| |.+.++ T Consensus 48 ~~~pi~i~AD~~~~~~~~----------~~~~~GnV~i~~g~~~l~Ad~v~~~q~~~~~~~~~~~~a~Gn----v~~~~~ 113 (782) T PRK03761 48 NDLPVTINADNAEANYPD----------KAVYTGNVDIKQGNSRLQADEVQLHQQEAPGEPVRTVDAEGN----VHYDDN 113 (782) T ss_pred CCCCEEEEECCEEECCCC----------EEEEEEEEEEEECCEEEECCEEEEECCCCCCCCCEEEEEECC----EEEECC T ss_conf 888479993324625897----------799993399997998999567999523786665028998689----899689 Q ss_pred CCEEEEEEEEEEECCCEEEEEEC Q ss_conf 97699229999921658999928 Q gi|254780745|r 144 LNTFLGCKLIVNIDTSFASLQGC 166 (181) Q Consensus 144 ~~~i~gdki~yn~~tg~~~~~g~ 166 (181) ...++|+++.+|+++..+.+... T Consensus 114 ~~~~~g~~~~~~~~~~~~~~~~~ 136 (782) T PRK03761 114 QIILKGPKAWSNLNTKDTNVWNG 136 (782) T ss_pred CEEEEECEEEEECCCCEEEEECC T ss_conf 61898146999537764599624 No 10 >COG1452 Imp Organic solvent tolerance protein OstA [Cell envelope biogenesis, outer membrane] Probab=98.63 E-value=1.9e-07 Score=58.76 Aligned_cols=112 Identities=15% Similarity=0.185 Sum_probs=65.8 Q ss_pred EEEEEEEEEEECCCCEEEEEEEEEEEE-CCEEEEEEEEEEEECCCCCCCCCCEEEEEECCCCCCCCCCCCEEEEEEEEEC Q ss_conf 799911699992388799980089997-8759999689999617665444200122201222024556533211279941 Q gi|254780745|r 45 IHIKADMLEVKDAVQKAFFKGNVFMTQ-EDFSLQADKMTIDYNNTNRDVSNKINRMDVERNIFIQSGEINVIASNGYVDF 123 (181) Q Consensus 45 i~I~AD~~~~d~~~~~~~~~GNV~i~q-g~~~i~aD~l~i~~~~~~~~~~~~i~~~~a~gnv~~~~~~~~~~a~~~~y~~ 123 (181) ..|.||++.+|++++.++++|||.+.. +.+.|.+|++.+..+.+...... . .+.+........|+.++-.- T Consensus 82 ~~l~Ad~v~y~q~t~~~~A~GnV~~~~~~g~~i~gd~a~~~ld~~~g~~~n----~----~~~~~~~~~~~~A~~a~~~d 153 (784) T COG1452 82 YRLLADKVDYDQKTDRVTATGNVEYIDGDGILIKGDKADVNLDTKKGFVSN----F----RYQTVGRQGRFVAKSAERRD 153 (784) T ss_pred EEEEEEEEEEECCCCEEEEECCEEEECCCCCEEECCEEEEECCCCCCEEEE----E----EEEECCCEEEEEEEEEEECC T ss_conf 899875899723478899977799982898099875689961457645860----0----89852522579985522148 Q ss_pred CCCEEEEEECCCCEEEEEEC-CCEEEEEEEEEEECCCEEEEEE Q ss_conf 66589999478634999717-9769922999992165899992 Q gi|254780745|r 124 QKRILVLNGDRADKVILKEK-LNTFLGCKLIVNIDTSFASLQG 165 (181) Q Consensus 124 ~~~~~~l~Gna~~~v~l~q~-~~~i~gdki~yn~~tg~~~~~g 165 (181) .+..+|+.-..-.+..++. .=.+++.+|..|.+++.+.+.- T Consensus 154 -~~~~~l~~at~TtC~~~dnp~W~v~A~~i~~~~~~~~~~~~n 195 (784) T COG1452 154 -NDVTILKNATYTTCEPCDNPIWSVKASKIIQNREKQFAVFWN 195 (784) T ss_pred -CEEEEEEEEEEECCCCCCCCCEEEEEEEEEECCCCCEEEEEE T ss_conf -706999510882368889972599964899817785899995 No 11 >pfam06835 DUF1239 Protein of unknown function (DUF1239). This family consists of several hypothetical bacterial proteins of around 190 residues in length. The function of this family is unknown. Probab=98.62 E-value=9.4e-07 Score=55.04 Aligned_cols=82 Identities=22% Similarity=0.270 Sum_probs=45.5 Q ss_pred EEEEEEEEEEEECCCCCCCCCCEEEEEECCCCCCCCCCCCEEEEEEEEECCCCEEEEEECCCCEEEEEECC-----CEEE Q ss_conf 59999689999617665444200122201222024556533211279941665899994786349997179-----7699 Q gi|254780745|r 74 FSLQADKMTIDYNNTNRDVSNKINRMDVERNIFIQSGEINVIASNGYVDFQKRILVLNGDRADKVILKEKL-----NTFL 148 (181) Q Consensus 74 ~~i~aD~l~i~~~~~~~~~~~~i~~~~a~gnv~~~~~~~~~~a~~~~y~~~~~~~~l~Gna~~~v~l~q~~-----~~i~ 148 (181) .+|+|++++.|-+.+........-...-. ....-.++|+++.++.+.+.+.|.|| |++.+.. ..+. T Consensus 53 ~~l~a~~~~~~~~~~~~~l~~P~~~~~~~-----~~~~w~i~A~~g~~~~~~~~i~L~g~----V~i~~~~~~~~~~~l~ 123 (176) T pfam06835 53 YRLTADKLEHYPDDDTTEFTNPVLIIYQD-----GKPPWTITADRAKLNKDGQTLYLYGN----VVIQSLEDKSRLTTIT 123 (176) T ss_pred EEEEECEEEEECCCCCEEEEEEEEEEECC-----CCCEEEEEECEEEEECCCCEEEEECC----EEEEECCCCCCCEEEE T ss_conf 99990457996589968999639999779-----98369999794999379969999899----9999768998756999 Q ss_pred EEEEEEEECCCEEEEE Q ss_conf 2299999216589999 Q gi|254780745|r 149 GCKLIVNIDTSFASLQ 164 (181) Q Consensus 149 gdki~yn~~tg~~~~~ 164 (181) .+.+.|+.+++.+.-. T Consensus 124 T~~l~~~~~~~~~~td 139 (176) T pfam06835 124 TENLTVNLDTQDASTD 139 (176) T ss_pred EHEEEEECCCCEEEEC T ss_conf 2209999477799948 No 12 >COG1934 Uncharacterized protein conserved in bacteria [Function unknown] Probab=98.44 E-value=1.1e-05 Score=49.19 Aligned_cols=87 Identities=15% Similarity=0.080 Sum_probs=68.1 Q ss_pred CEEEEEEEEEEEC--CCC--EEEEEEE----EEEEEC--CEEEEEEEEEEEECCCCCCCCCCEEEEEECCCCCCCCCCCC Q ss_conf 4799911699992--388--7999800----899978--75999968999961766544420012220122202455653 Q gi|254780745|r 44 KIHIKADMLEVKD--AVQ--KAFFKGN----VFMTQE--DFSLQADKMTIDYNNTNRDVSNKINRMDVERNIFIQSGEIN 113 (181) Q Consensus 44 Pi~I~AD~~~~d~--~~~--~~~~~GN----V~i~qg--~~~i~aD~l~i~~~~~~~~~~~~i~~~~a~gnv~~~~~~~~ 113 (181) |+.|.||.+++.. +.+ .+++.|| +..... ...=+||+++|+..++ .+...||+.+.+.+.. T Consensus 61 ~~~l~Adkv~v~~~~~~~~~~i~a~gn~~~~~q~~~~~k~~~g~ad~~~Y~~~~~---------~~vLtGna~v~q~gn~ 131 (173) T COG1934 61 TITLRADKVTVTRDKEGGKEVIEANGNPATFSQKVDNGKTQKGTADQAIYDLKND---------TIVLTGNAYVLQLGNV 131 (173) T ss_pred CCEEEEEEEEEECCCCCCCEEEEEECCEEEEEEEECCCCCEEEEEEEEEEEECCC---------EEEEECCEEEEECCCE T ss_conf 6179713699980578985489995466678876507850388851699982078---------9999477999857988 Q ss_pred EEEEEEEEECCCCEEEEEECCCCEEE Q ss_conf 32112799416658999947863499 Q gi|254780745|r 114 VIASNGYVDFQKRILVLNGDRADKVI 139 (181) Q Consensus 114 ~~a~~~~y~~~~~~~~l~Gna~~~v~ 139 (181) ++|++++|+.+++.+...|.+..+|. T Consensus 132 i~G~kIty~~~~~~~~a~~~~~~RV~ 157 (173) T COG1934 132 IRGDKITYLTKTGRAVAESSKGGRVR 157 (173) T ss_pred EEEEEEEEECCCCEEEECCCCCCCEE T ss_conf 87639999943457996288998479 No 13 >PRK04423 organic solvent tolerance protein; Provisional Probab=98.40 E-value=2.1e-06 Score=53.12 Aligned_cols=81 Identities=15% Similarity=0.145 Sum_probs=69.6 Q ss_pred ECCEEEEEEEEEEEECCCCCCCCCCEEEEEECCCCCCCCCCCCEEEEEEEEECCCCEEEEEECCCCEEEEEECCCEEEEE Q ss_conf 78759999689999617665444200122201222024556533211279941665899994786349997179769922 Q gi|254780745|r 71 QEDFSLQADKMTIDYNNTNRDVSNKINRMDVERNIFIQSGEINVIASNGYVDFQKRILVLNGDRADKVILKEKLNTFLGC 150 (181) Q Consensus 71 qg~~~i~aD~l~i~~~~~~~~~~~~i~~~~a~gnv~~~~~~~~~~a~~~~y~~~~~~~~l~Gna~~~v~l~q~~~~i~gd 150 (181) |-...+.||+++..-+ .....|||.+.+++..+.||++.|+..++.++++|| |.+.+++..++|+ T Consensus 57 ~~p~~v~aD~~~~~~~-----------~~~f~GnV~i~qg~~~l~AD~v~ydq~~~~v~A~Gn----V~~~~~~~~~~g~ 121 (798) T PRK04423 57 QLPTDIEGDQLSGTST-----------TPQYQGNVALKRGDQFLGADNLRMDTETGNYIAEGN----VRYQDTSIRMVAD 121 (798) T ss_pred CCCEEEECCCCCCCCC-----------CCEEEEEEEEEECCEEEEEEEEEEECCCCEEEEECC----EEEECCCCEEECC T ss_conf 6985887640235577-----------517985599997998999117999867898999777----8996798589814 Q ss_pred EEEEEECCCEEEEEEC Q ss_conf 9999921658999928 Q gi|254780745|r 151 KLIVNIDTSFASLQGC 166 (181) Q Consensus 151 ki~yn~~tg~~~~~g~ 166 (181) ++.+|++++...+... T Consensus 122 ~~~~~l~~~~g~~~~~ 137 (798) T PRK04423 122 RAEGNQDTDTHKITNI 137 (798) T ss_pred EEEEECCCCEEEEECC T ss_conf 2899826764799667 No 14 >PRK10893 hypothetical protein; Provisional Probab=98.33 E-value=5.8e-05 Score=45.37 Aligned_cols=118 Identities=16% Similarity=0.211 Sum_probs=93.9 Q ss_pred EEEEEEEEEEECCCCEEEEEEEEEEEECC-----EEEEEEEEEEEECCCCCCCCCCEEEEEECCCCCCCC-----CCCCE Q ss_conf 79991169999238879998008999787-----599996899996176654442001222012220245-----56533 Q gi|254780745|r 45 IHIKADMLEVKDAVQKAFFKGNVFMTQED-----FSLQADKMTIDYNNTNRDVSNKINRMDVERNIFIQS-----GEINV 114 (181) Q Consensus 45 i~I~AD~~~~d~~~~~~~~~GNV~i~qg~-----~~i~aD~l~i~~~~~~~~~~~~i~~~~a~gnv~~~~-----~~~~~ 114 (181) .+++||+++.-...+...|+--|.....+ =.++|++.+++-+ ..+...|||.+.. .-+.. T Consensus 60 y~i~A~~~eHY~~~~~T~F~~Pvl~ly~~~~~~~W~VsA~~a~L~~d----------~~L~L~gnV~i~~L~~~s~iq~I 129 (190) T PRK10893 60 YRLIAQHVEYYSDQAVSWFTQPVLTTFDKDKVPTWSVKADKAKLTND----------RMLYLYGHVEVNALTPDSQLRRI 129 (190) T ss_pred EEEEECHHHHCCCCCCEEEECCEEEEECCCCCCCEEEEECEEEECCC----------CEEEEECCEEEEECCCCCCCCEE T ss_conf 89997142013137958981865999889995327999456798648----------88999777999853897545389 Q ss_pred EEEEEEEECCCCEEEEEECCCCEEEEEECCCEEEEEEEEEEECCCEEEEEECCCCEEEEEEECCCC Q ss_conf 211279941665899994786349997179769922999992165899992899809999811876 Q gi|254780745|r 115 IASNGYVDFQKRILVLNGDRADKVILKEKLNTFLGCKLIVNIDTSFASLQGCESDQVQSIIRYDGR 180 (181) Q Consensus 115 ~a~~~~y~~~~~~~~l~Gna~~~v~l~q~~~~i~gdki~yn~~tg~~~~~g~~~~rV~~i~~~~~r 180 (181) ..+.+..+..++.+.=. ..|.+...+-..+|.+|.-|++++++++. +.|++.+++..+ T Consensus 130 ~T~~l~idL~tqd~~SD----~~VtI~G~~f~s~G~gm~anL~~~~a~L~----~~Vkt~YEi~~~ 187 (190) T PRK10893 130 TTDNAQINLVTQDVTSD----DLVTLYGTTFNSSGLKMRGNLRSKNAELI----EKVRTSYEIQNK 187 (190) T ss_pred ECCCEEEECCCCEEECC----CEEEEECCCCEEEEEEEEEEHHHCEEEEH----HHCCEEEECCCC T ss_conf 82758980466468638----70999879839986108999365888904----506199963600 No 15 >pfam03968 OstA OstA-like protein. This family of proteins are mostly uncharacterized. However the family does include E. coli OstA that has been characterized as an organic solvent tolerance protein. Probab=98.28 E-value=1.7e-05 Score=48.28 Aligned_cols=86 Identities=17% Similarity=0.159 Sum_probs=57.4 Q ss_pred CCEEEEEEEEEEEECCCCCCCCCCEEEEEECCCCCCCCCCCCEEEEEEEEECCC-----CEEEEEECCCCE--EEEEECC Q ss_conf 875999968999961766544420012220122202455653321127994166-----589999478634--9997179 Q gi|254780745|r 72 EDFSLQADKMTIDYNNTNRDVSNKINRMDVERNIFIQSGEINVIASNGYVDFQK-----RILVLNGDRADK--VILKEKL 144 (181) Q Consensus 72 g~~~i~aD~l~i~~~~~~~~~~~~i~~~~a~gnv~~~~~~~~~~a~~~~y~~~~-----~~~~l~Gna~~~--v~l~q~~ 144 (181) ..+.|+||.+++..+.+ .....|||.+.+++..++||+++..... ..+...|++... +.+.++. T Consensus 26 ~PI~I~AD~~~~d~~~~---------~~~~~GnV~i~QG~~~i~ad~~~i~~~~~~~~~~~~~a~g~~~~~~~~~~~~~~ 96 (141) T pfam03968 26 QPIEIEADNLEADDKKG---------VATFTGNVVVTQGSITLRADKVVVTQDKKSNGIERLEATGPATFRQKVDLVGRQ 96 (141) T ss_pred CCEEEEEEEEEEECCCC---------EEEEECCEEEEECCEEEEEEEEEEEECCCCCCEEEEEEECCEEEEEEEECCCCC T ss_conf 99899945799982799---------999987499980870999648999986888967999997760499986148886 Q ss_pred CEEEEEEEEEEECCCEEEEEEC Q ss_conf 7699229999921658999928 Q gi|254780745|r 145 NTFLGCKLIVNIDTSFASLQGC 166 (181) Q Consensus 145 ~~i~gdki~yn~~tg~~~~~g~ 166 (181) -.-+|++++|+..++...++|+ T Consensus 97 ~~a~a~~~~Y~~~~~~i~L~Gn 118 (141) T pfam03968 97 IRGTADRAEYDSKKRIIVLTGN 118 (141) T ss_pred EEEEEEEEEEEECCCEEEEECC T ss_conf 7999718999916999999733 No 16 >TIGR03002 outer_YhbN cell envelope biogenesis protein YhbN; InterPro: IPR014340 Members of this entry include YhbN, which is an essential protein in Escherichia coli. It is implicated in cell envelope integrity, and has numerous homologues in other Proteobacteria in a conserved location near lipopolysaccharide inner core biosynthesis genes. This entry is distantly related to the organic solvent tolerance protein (OstA).. Probab=98.14 E-value=4.8e-05 Score=45.79 Aligned_cols=82 Identities=22% Similarity=0.228 Sum_probs=49.7 Q ss_pred EEEEEEEEEEEC--CC---CEEEEEEE--EEEEE----CC--EEEEEEEEEEEECCCCCCCCCCEEEEEECCCCCCCCCC Q ss_conf 799911699992--38---87999800--89997----87--59999689999617665444200122201222024556 Q gi|254780745|r 45 IHIKADMLEVKD--AV---QKAFFKGN--VFMTQ----ED--FSLQADKMTIDYNNTNRDVSNKINRMDVERNIFIQSGE 111 (181) Q Consensus 45 i~I~AD~~~~d~--~~---~~~~~~GN--V~i~q----g~--~~i~aD~l~i~~~~~~~~~~~~i~~~~a~gnv~~~~~~ 111 (181) +.|.||.+.+.+ .. ..++++|| |...| |+ +.-+|++++|+..++ .+...||..+.+++ T Consensus 39 ~~I~ADkv~v~~~~~~~~~~~~~A~G~~PA~F~q~~d~g~~~~~g~A~~i~Y~~~~~---------~~~Ltg~A~v~Q~~ 109 (150) T TIGR03002 39 LKIRADKVVVTRPNDGGGIEKATATGNKPATFRQKLDDGKEYIEGQANRIEYDPAKD---------TVVLTGNARVKQGG 109 (150) T ss_pred EEEECCEEEEEECCCCCCEEEEEEECCCCEEEEEEECCCCEEEEEECCEEEEECCCC---------EEEEECCEEEEECC T ss_conf 598522889882688786089999807818988995599827999711899932577---------99991234997889 Q ss_pred -CCEEEEEEEEECCCCEEEEEECCC Q ss_conf -533211279941665899994786 Q gi|254780745|r 112 -INVIASNGYVDFQKRILVLNGDRA 135 (181) Q Consensus 112 -~~~~a~~~~y~~~~~~~~l~Gna~ 135 (181) ...+|+.++||+.++.+.-.|... T Consensus 110 k~~~~G~~I~Y~~~~q~~~a~~~~~ 134 (150) T TIGR03002 110 KNEVSGDVITYDTKKQKVTAKGGGG 134 (150) T ss_pred CCEEECCEEEEECEEEEEEEEECCC T ss_conf 7778766889844012998980788 No 17 >PRK10894 hypothetical protein; Provisional Probab=97.83 E-value=8.8e-05 Score=44.35 Aligned_cols=86 Identities=17% Similarity=0.158 Sum_probs=46.4 Q ss_pred CEEEEEEEEEEECC---C--CEEEEEEE-EEEEE----C-CEEEEEEEEEEEECCCCCCCCCCEEEEEECCCCCCCCCCC Q ss_conf 47999116999923---8--87999800-89997----8-7599996899996176654442001222012220245565 Q gi|254780745|r 44 KIHIKADMLEVKDA---V--QKAFFKGN-VFMTQ----E-DFSLQADKMTIDYNNTNRDVSNKINRMDVERNIFIQSGEI 112 (181) Q Consensus 44 Pi~I~AD~~~~d~~---~--~~~~~~GN-V~i~q----g-~~~i~aD~l~i~~~~~~~~~~~~i~~~~a~gnv~~~~~~~ 112 (181) .+.|.||.+++... . ..++..|| +...| | .+.-+|++++|.... ..+...||..+.++.. T Consensus 64 sl~I~ADkv~v~~~~~~~~~~~i~a~G~pA~f~Q~~~~g~~i~a~A~~i~Y~~~~---------~~i~L~GnA~L~Q~~~ 134 (184) T PRK10894 64 TIKINADKVVVTRPGGEQGKEVIDGYGNPATFYQMQDNGKPVKGHASKMHYELAK---------DFVVLTGNAYLEQLDS 134 (184) T ss_pred EEEEEEEEEEEEECCCCCCEEEEEEECCCEEEEEECCCCCCEEEEEEEEEEECCC---------CEEEEECCEEEEECCC T ss_conf 5999935899997699753799999689379999656898137697489998269---------9999976589996899 Q ss_pred CEEEEEEEEECCCCEEEEEECCCCEE Q ss_conf 33211279941665899994786349 Q gi|254780745|r 113 NVIASNGYVDFQKRILVLNGDRADKV 138 (181) Q Consensus 113 ~~~a~~~~y~~~~~~~~l~Gna~~~v 138 (181) ..+|++++||..++.+...|+...+| T Consensus 135 ~v~G~~I~Ydl~~~~~~a~s~~~~RV 160 (184) T PRK10894 135 NIKGDKITYLVKEQKMQAFSDKGKRV 160 (184) T ss_pred EEEECEEEEEECCCEEEEECCCCCEE T ss_conf 89937899990667999967999807 No 18 >COG3117 Uncharacterized protein conserved in bacteria [Function unknown] Probab=97.31 E-value=0.0034 Score=35.74 Aligned_cols=122 Identities=14% Similarity=0.128 Sum_probs=75.0 Q ss_pred CCCEEEEEEEEEEECCCCEEEEEEEEEEEEC----CEEEEEEEEEEEECCCCCCCCCCEEEEEECCCCCCC-----C-CC Q ss_conf 7847999116999923887999800899978----759999689999617665444200122201222024-----5-56 Q gi|254780745|r 42 NEKIHIKADMLEVKDAVQKAFFKGNVFMTQE----DFSLQADKMTIDYNNTNRDVSNKINRMDVERNIFIQ-----S-GE 111 (181) Q Consensus 42 ~~Pi~I~AD~~~~d~~~~~~~~~GNV~i~qg----~~~i~aD~l~i~~~~~~~~~~~~i~~~~a~gnv~~~-----~-~~ 111 (181) ..-.+++|+.++....+..+.|+--+..... .-++.||+.++.-+.. +.-.++|... . .. T Consensus 57 ~~~y~l~a~~~eh~~~~~~t~ft~P~l~lf~e~~q~w~v~Ad~a~l~~d~~----------l~l~~~vvv~~~t~d~~q~ 126 (188) T COG3117 57 KLKYRLTAQHVEHYSDSSDTHFTAPVLTLFRENRQTWTVQADRAKLTKDGK----------LYLLGDVVVVALTPDSGQL 126 (188) T ss_pred CEEEEEEHHHHHHCCCCCCEEEECCEEEEECCCCCCEEEEHHHHEECCCCE----------EEEEECEEEEEECCCCCCC T ss_conf 056776323454154566557715359988388762799810202157855----------9987120677514776676 Q ss_pred CCEEEEEEEEECCCCEEEEEECCCCEEEEEECCCEEEEEEEEEEECCCEEEEEECCCCEEEEEEECCCCC Q ss_conf 5332112799416658999947863499971797699229999921658999928998099998118769 Q gi|254780745|r 112 INVIASNGYVDFQKRILVLNGDRADKVILKEKLNTFLGCKLIVNIDTSFASLQGCESDQVQSIIRYDGRP 181 (181) Q Consensus 112 ~~~~a~~~~y~~~~~~~~l~Gna~~~v~l~q~~~~i~gdki~yn~~tg~~~~~g~~~~rV~~i~~~~~r~ 181 (181) ...+.+++..+++++.+.-.- .|.+..+...-+|.++.+|+.++.+...+ .||+.+.+.+.| T Consensus 127 ~~l~T~~~~V~l~tqd~~te~----~V~l~~~~~~~tg~gm~~n~k~~~a~L~~----kV~t~yE~~~~~ 188 (188) T COG3117 127 QRLTTESAQVNLKTQDAQTED----PVTLYGGQFVSTANGMTGNLKTQQANLLS----KVRTRYEVLKAR 188 (188) T ss_pred CEEEEEEEEECCCCCCEECCC----CEEEECCCCCCCCCCEEEEHHHHHHHHHH----CCCCEEEECCCC T ss_conf 458710237613310000377----56674354402320147754655666421----245178724689 No 19 >PRK10893 hypothetical protein; Provisional Probab=96.80 E-value=0.017 Score=32.01 Aligned_cols=111 Identities=13% Similarity=0.068 Sum_probs=64.4 Q ss_pred CCCCCCCCCCEEEEEE--EEEEECCCCEEEEEEEEEEEECCEEEEEEEEEEEECCCCCCCCCCEEEEEECCCCCCCCCCC Q ss_conf 2244577784799911--69999238879998008999787599996899996176654442001222012220245565 Q gi|254780745|r 35 TRFKVLGNEKIHIKAD--MLEVKDAVQKAFFKGNVFMTQEDFSLQADKMTIDYNNTNRDVSNKINRMDVERNIFIQSGEI 112 (181) Q Consensus 35 ~~~~~d~~~Pi~I~AD--~~~~d~~~~~~~~~GNV~i~qg~~~i~aD~l~i~~~~~~~~~~~~i~~~~a~gnv~~~~~~~ 112 (181) .....|.++|--+..+ +..||. +|.. ..++.||+++.|-..+.-.....+-.+.-.+ ....= T Consensus 31 ~~~~~~~~~P~y~a~~l~t~~y~~-~G~l-----------sy~i~A~~~eHY~~~~~T~F~~Pvl~ly~~~----~~~~W 94 (190) T PRK10893 31 AQVVVNNNDPTYKSEHTDTLVYNP-EGAL-----------SYRLIAQHVEYYSDQAVSWFTQPVLTTFDKD----KVPTW 94 (190) T ss_pred CCCCCCCCCCCEEEECCEEEEECC-CCCE-----------EEEEEECHHHHCCCCCCEEEECCEEEEECCC----CCCCE T ss_conf 556578999966975336888999-9978-----------5899971420131379589818659998899----95327 Q ss_pred CEEEEEEEEECCCCEEEEEECCCCEEEEEE-CCCEEEEEEEEEEECCCEEE Q ss_conf 332112799416658999947863499971-79769922999992165899 Q gi|254780745|r 113 NVIASNGYVDFQKRILVLNGDRADKVILKE-KLNTFLGCKLIVNIDTSFAS 162 (181) Q Consensus 113 ~~~a~~~~y~~~~~~~~l~Gna~~~v~l~q-~~~~i~gdki~yn~~tg~~~ 162 (181) .++|+++..+. ++.+.|.||....-.... .-..|+.+.+.+|+.+..+. T Consensus 95 ~VsA~~a~L~~-d~~L~L~gnV~i~~L~~~s~iq~I~T~~l~idL~tqd~~ 144 (190) T PRK10893 95 SVKADKAKLTN-DRMLYLYGHVEVNALTPDSQLRRITTDNAQINLVTQDVT 144 (190) T ss_pred EEEECEEEECC-CCEEEEECCEEEEECCCCCCCCEEECCCEEEECCCCEEE T ss_conf 99945679864-888999777999853897545389827589804664686 No 20 >COG3117 Uncharacterized protein conserved in bacteria [Function unknown] Probab=94.42 E-value=0.081 Score=28.31 Aligned_cols=109 Identities=16% Similarity=0.134 Sum_probs=60.6 Q ss_pred CCCCCCCCEEEEEEEEE---EECCCCEEEEEEEEEEEECCEEEEEEEEEEEECCCCCCCCCCEEEEEECCCCCCCCCCCC Q ss_conf 44577784799911699---992388799980089997875999968999961766544420012220122202455653 Q gi|254780745|r 37 FKVLGNEKIHIKADMLE---VKDAVQKAFFKGNVFMTQEDFSLQADKMTIDYNNTNRDVSNKINRMDVERNIFIQSGEIN 113 (181) Q Consensus 37 ~~~d~~~Pi~I~AD~~~---~d~~~~~~~~~GNV~i~qg~~~i~aD~l~i~~~~~~~~~~~~i~~~~a~gnv~~~~~~~~ 113 (181) ....++.|. -+++.++ +|. .|+- +.++.|++++-|.+......+..+-.+... ....-+ T Consensus 33 v~~~~d~p~-Y~~e~~~~~~~de-~G~~-----------~y~l~a~~~eh~~~~~~t~ft~P~l~lf~e-----~~q~w~ 94 (188) T COG3117 33 VRPNPDEPA-YTMEGLDTTVYDE-QGKL-----------KYRLTAQHVEHYSDSSDTHFTAPVLTLFRE-----NRQTWT 94 (188) T ss_pred CCCCCCCCC-EEECCCCEEEECC-CCCE-----------EEEEEHHHHHHCCCCCCEEEECCEEEEECC-----CCCCEE T ss_conf 104999864-0205742367777-7605-----------677632345415456655771535998838-----876279 Q ss_pred EEEEEEEEECCCCEEEEEECCCCEEEEEE--CCCEEEEEEEEEEECCCEEEEE Q ss_conf 32112799416658999947863499971--7976992299999216589999 Q gi|254780745|r 114 VIASNGYVDFQKRILVLNGDRADKVILKE--KLNTFLGCKLIVNIDTSFASLQ 164 (181) Q Consensus 114 ~~a~~~~y~~~~~~~~l~Gna~~~v~l~q--~~~~i~gdki~yn~~tg~~~~~ 164 (181) ++|+++.+.. ++.+.+.|+........+ ....++.+++.+|++++.+..+ T Consensus 95 v~Ad~a~l~~-d~~l~l~~~vvv~~~t~d~~q~~~l~T~~~~V~l~tqd~~te 146 (188) T COG3117 95 VQADRAKLTK-DGKLYLLGDVVVVALTPDSGQLQRLTTESAQVNLKTQDAQTE 146 (188) T ss_pred EEHHHHEECC-CCEEEEEECEEEEEECCCCCCCCEEEEEEEEECCCCCCEECC T ss_conf 9810202157-855998712067751477667645871023761331000037 No 21 >COG5375 Uncharacterized protein conserved in bacteria [Function unknown] Probab=93.41 E-value=0.46 Score=24.20 Aligned_cols=122 Identities=15% Similarity=0.167 Sum_probs=74.6 Q ss_pred CCCCCEEEEEEEEEEECC-CCEEEEEE-EEEEEECC---EEEEEEEEEEEECCCCCCCCCCEEEEEECCCCCCCC-CCCC Q ss_conf 777847999116999923-88799980-08999787---599996899996176654442001222012220245-5653 Q gi|254780745|r 40 LGNEKIHIKADMLEVKDA-VQKAFFKG-NVFMTQED---FSLQADKMTIDYNNTNRDVSNKINRMDVERNIFIQS-GEIN 113 (181) Q Consensus 40 d~~~Pi~I~AD~~~~d~~-~~~~~~~G-NV~i~qg~---~~i~aD~l~i~~~~~~~~~~~~i~~~~a~gnv~~~~-~~~~ 113 (181) ....|..|+|+++.=|.+ .+.++++| +.++--|+ ..++|..-.+..+.+ .+.....|.+.. +... T Consensus 87 k~~~~ysitA~RAiqD~~n~~~i~Le~i~a~~pvg~~~~a~v~Ak~Gvyd~~~n---------tL~l~~dv~i~~~eGi~ 157 (216) T COG5375 87 KDGRPYSITAERAIQDVKNPNIITLEGILAILPVGEDNPAQVTAKSGVYDRDTN---------TLDLTDDVKITTDEGIR 157 (216) T ss_pred CCCCEEEEEHHHHHHCCCCCCEEEEEEEEEECCCCCCCCEEEEECCCEEECCCC---------EEECCCCEEEECCHHHH T ss_conf 789750330777765026887489960015524799973468712342666787---------13246876885354331 Q ss_pred EEEEEEEEECCCCEEEEEECCCCEEEEEECCCEEEEEEEEEEECCCEEEEEECCCCEEEEEEECC Q ss_conf 32112799416658999947863499971797699229999921658999928998099998118 Q gi|254780745|r 114 VIASNGYVDFQKRILVLNGDRADKVILKEKLNTFLGCKLIVNIDTSFASLQGCESDQVQSIIRYD 178 (181) Q Consensus 114 ~~a~~~~y~~~~~~~~l~Gna~~~v~l~q~~~~i~gdki~yn~~tg~~~~~g~~~~rV~~i~~~~ 178 (181) +..+.+..|..++.+ +|+. .+-+.-++..|+++.+.+--+.....++| +||+.|.+. T Consensus 158 a~~~sA~vDl~~G~~--sg~~--pv~i~~~~gsI~ads~~v~D~G~iivF~g----~Vr~~id~s 214 (216) T COG5375 158 ADFQSAFVDLRTGNM--SGDS--PVDISGKGGSITADSYSVTDKGKIIVFEG----NVRTRIDPS 214 (216) T ss_pred CCCCHHEEEECCCCC--CCCC--CEEECCCCCCEEEEEEEEECCCCEEEEEC----CEEEEECCC T ss_conf 356012466025610--2678--63330799848761367522884899956----568984477 No 22 >PRK11397 dacD D-alanyl-D-alanine carboxypeptidase; Provisional Probab=92.72 E-value=0.33 Score=25.01 Aligned_cols=54 Identities=19% Similarity=0.127 Sum_probs=35.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCEEEEEEE-EEEECCCCEEEEEEEEEEE Q ss_conf 9999999999999877743321000122445777847999116-9999238879998008999 Q gi|254780745|r 9 HLYIHLLALVFVLLKADLSQAAASYMTRFKVLGNEKIHIKADM-LEVKDAVQKAFFKGNVFMT 70 (181) Q Consensus 9 ~~~~~~~~~l~~~~~~~~a~~~~~~~~~~~~d~~~Pi~I~AD~-~~~d~~~~~~~~~GNV~i~ 70 (181) ++++.++++++++..++.+.+ .+...+|-.|.|.+ +-+|..+|.+.|..|..-. T Consensus 6 ~~~~~~~~~~~~~~~~~~~~~--------~~~~p~~P~i~A~sailiD~~TG~vLy~kN~d~~ 60 (390) T PRK11397 6 RLIIAASLFVFNLSSAFAAEN--------IPFSPQPPAIDAGSWVLMDYTTGQILTAGNEHQQ 60 (390) T ss_pred HHHHHHHHHHHHHHHHHHCCC--------CCCCCCCCCCCCEEEEEEECCCCCEEEEECCCCC T ss_conf 999999999999999874045--------7789999965533999999999969201292998 No 23 >PRK09918 putative periplasmic chaperone protein; Provisional Probab=83.42 E-value=2.8 Score=19.95 Aligned_cols=19 Identities=16% Similarity=0.462 Sum_probs=7.1 Q ss_pred HHHHHHHHHHHHHHHHHHH Q ss_conf 8999999999999987774 Q gi|254780745|r 8 IHLYIHLLALVFVLLKADL 26 (181) Q Consensus 8 ~~~~~~~~~~l~~~~~~~~ 26 (181) .++|+..+++++++++.+. T Consensus 3 ~~~~~~f~l~~~l~~~~~a 21 (230) T PRK09918 3 FNIFLLFTLLVLLLSSSAV 21 (230) T ss_pred HHHHHHHHHHHHHHHHHHH T ss_conf 2658999999999998877 No 24 >PRK11385 hypothetical protein; Provisional Probab=78.02 E-value=4.4 Score=18.90 Aligned_cols=22 Identities=23% Similarity=0.229 Sum_probs=14.9 Q ss_pred CCHHHHHHHHHHHHHHHHHHHH Q ss_conf 9134678899999999999998 Q gi|254780745|r 1 MSAFLNSIHLYIHLLALVFVLL 22 (181) Q Consensus 1 ~~~~~~~~~~~~~~~~~l~~~~ 22 (181) |++..+++.+++++++++++.. T Consensus 10 ~~~~~~r~~~~i~~~~~ll~s~ 31 (245) T PRK11385 10 MTNTWNRLALLIFAVLSLLVAG 31 (245) T ss_pred EEHHHHHHHHHHHHHHHHHHHH T ss_conf 7006899999999999999865 No 25 >PRK10064 colicin I receptor; Provisional Probab=77.97 E-value=1.7 Score=21.19 Aligned_cols=39 Identities=10% Similarity=-0.009 Sum_probs=21.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCEEEEEEE Q ss_conf 67889999999999999877743321000122445777847999116 Q gi|254780745|r 5 LNSIHLYIHLLALVFVLLKADLSQAAASYMTRFKVLGNEKIHIKADM 51 (181) Q Consensus 5 ~~~~~~~~~~~~~l~~~~~~~~a~~~~~~~~~~~~d~~~Pi~I~AD~ 51 (181) |.+|+.|..+.+|+.++.++.-+++ ..+..+||.++|.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~a--------~~~~~e~vVVTatr 39 (663) T PRK10064 1 MFRLNPFVRVGLCLSAISCAWPVAA--------VDDDGETMVVTASS 39 (663) T ss_pred CCCCCHHHHHHHHHHHHHHCCCHHH--------CCCCCCEEEEECCC T ss_conf 9414678999999999986364012--------12699839992358 No 26 >PRK11669 pbpG D-alanyl-D-alanine endopeptidase; Provisional Probab=76.14 E-value=5 Score=18.61 Aligned_cols=29 Identities=10% Similarity=0.157 Sum_probs=22.3 Q ss_pred CCCEEEEEEE-EEEECCCCEEEEEEEEEEE Q ss_conf 7847999116-9999238879998008999 Q gi|254780745|r 42 NEKIHIKADM-LEVKDAVQKAFFKGNVFMT 70 (181) Q Consensus 42 ~~Pi~I~AD~-~~~d~~~~~~~~~GNV~i~ 70 (181) -.|-+|.|.+ +-+|..+|.+.|..|..-. T Consensus 33 ~~~P~i~A~sail~D~~TG~VLy~kn~d~~ 62 (308) T PRK11669 33 ASQQELASGSAMVVDLNTNKVIYSSNPDLV 62 (308) T ss_pred CCCCCCCCEEEEEEECCCCCEEEEECCCCC T ss_conf 994964540899999999969001582998 No 27 >PRK10001 D-alanyl-D-alanine carboxypeptidase fraction C; Provisional Probab=67.02 E-value=8.2 Score=17.42 Aligned_cols=54 Identities=17% Similarity=0.130 Sum_probs=29.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCEEEEEEE-EEEECCCCEEEEEEEEE Q ss_conf 67889999999999999877743321000122445777847999116-99992388799980089 Q gi|254780745|r 5 LNSIHLYIHLLALVFVLLKADLSQAAASYMTRFKVLGNEKIHIKADM-LEVKDAVQKAFFKGNVF 68 (181) Q Consensus 5 ~~~~~~~~~~~~~l~~~~~~~~a~~~~~~~~~~~~d~~~Pi~I~AD~-~~~d~~~~~~~~~GNV~ 68 (181) -.+++....-.++++++.+.+.+.. ..-+|-.|.|.+ +-+|..+|.+.|..|.. T Consensus 5 ~~~~~~~~~~~a~~~~~~~~~~~~~----------~~~~~P~i~A~sail~D~~TG~VLy~Kn~d 59 (400) T PRK10001 5 SSSLRGLAAGSAFLFLFAPTAFAAE----------QTVEAPSVDARAWILMDYASGKVLAEGNAD 59 (400) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHCC----------CCCCCCCCCCEEEEEEECCCCCCHHHCCCC T ss_conf 6789888887899999877765140----------589879745318999999999594530939 No 28 >TIGR00247 TIGR00247 conserved hypothetical protein TIGR00247; InterPro: IPR003770 This family contains several aminodeoxychorismate lyase proteins. Aminodeoxychorismate lyase is a pyridoxal 5'-phosphate-dependent enzyme that converts 4-aminodeoxychorismate to pyruvate and p-aminobenzoate, a precursor of folic acid in bacteria . . Probab=61.10 E-value=10 Score=16.85 Aligned_cols=28 Identities=25% Similarity=0.369 Sum_probs=22.9 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9134678899999999999998777433 Q gi|254780745|r 1 MSAFLNSIHLYIHLLALVFVLLKADLSQ 28 (181) Q Consensus 1 ~~~~~~~~~~~~~~~~~l~~~~~~~~a~ 28 (181) ||.|+|+-++-+..+.+++++.++.+.. T Consensus 7 ~Kk~Lnkkri~i~~~~vll~~~gs~~~y 34 (373) T TIGR00247 7 MKKVLNKKRIIILLLLVLLLIIGSILLY 34 (373) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 4556656689999999999999889988 No 29 >pfam07172 GRP Glycine rich protein family. This family of proteins includes several glycine rich proteins as well as two nodulins 16 and 24. The family also contains proteins that are induced in response to various stresses. Probab=60.75 E-value=11 Score=16.76 Aligned_cols=10 Identities=10% Similarity=0.109 Sum_probs=3.8 Q ss_pred CCCCCCCCEE Q ss_conf 4457778479 Q gi|254780745|r 37 FKVLGNEKIH 46 (181) Q Consensus 37 ~~~d~~~Pi~ 46 (181) .+.+.+.+++ T Consensus 32 ~K~e~e~~v~ 41 (91) T pfam07172 32 EKSESENEVQ 41 (91) T ss_pred CCCCCCCCEE T ss_conf 4555424021 No 30 >PRK13483 enterobactin receptor protein; Provisional Probab=60.71 E-value=11 Score=16.76 Aligned_cols=44 Identities=18% Similarity=0.109 Sum_probs=23.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCEEEEEEE Q ss_conf 67889999999999999877743321000122445777847999116 Q gi|254780745|r 5 LNSIHLYIHLLALVFVLLKADLSQAAASYMTRFKVLGNEKIHIKADM 51 (181) Q Consensus 5 ~~~~~~~~~~~~~l~~~~~~~~a~~~~~~~~~~~~d~~~Pi~I~AD~ 51 (181) |++..+..++++.++++++.+.+++++... ....-++|.|+|.+ T Consensus 2 ~~~~~~~~~~~~~~~~~~~~a~a~~~a~~~---~~~~Le~vvVTatr 45 (663) T PRK13483 2 LSRKKYLAQVIAALCLLPAVPVANALADDA---PDKMMETIVVTASG 45 (663) T ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCCCCCC---CCCCCCCEEEECCC T ss_conf 641447999999999977765032334578---65678849994268 No 31 >PRK04517 hypothetical protein; Provisional Probab=55.89 E-value=13 Score=16.29 Aligned_cols=42 Identities=17% Similarity=0.373 Sum_probs=29.6 Q ss_pred EEEEEEEECCCCEEEEEEEEEEEECCE--EEEEEEEEEEECCCC Q ss_conf 911699992388799980089997875--999968999961766 Q gi|254780745|r 48 KADMLEVKDAVQKAFFKGNVFMTQEDF--SLQADKMTIDYNNTN 89 (181) Q Consensus 48 ~AD~~~~d~~~~~~~~~GNV~i~qg~~--~i~aD~l~i~~~~~~ 89 (181) .++++++++.++.++|.=+-.+.+|+. .+++|-+++.++..+ T Consensus 50 ~~~~l~L~~G~hQIv~Ry~~~~~~~~~~~~~~S~piI~~F~a~d 93 (216) T PRK04517 50 SEKTLELPDGENQIVFRYSPYFSQGNDRISVESDVIIATFDAAD 93 (216) T ss_pred CCCEEEECCCCEEEEEEEECEECCCCCEEEEECCCEEEEEECCC T ss_conf 78537857994389999833154899625885697999996566 No 32 >PRK09926 putative chaperone protein EcpD; Provisional Probab=50.52 E-value=16 Score=15.81 Aligned_cols=19 Identities=16% Similarity=0.013 Sum_probs=7.3 Q ss_pred HHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999987774332 Q gi|254780745|r 11 YIHLLALVFVLLKADLSQA 29 (181) Q Consensus 11 ~~~~~~~l~~~~~~~~a~~ 29 (181) |...+++++|++++..+++ T Consensus 6 ~~~~l~~~~~~~~s~~a~A 24 (245) T PRK09926 6 HTTALCFATCMAFSSSSIA 24 (245) T ss_pred HHHHHHHHHHHHHHHHCEE T ss_conf 7999999999998764037 No 33 >PRK13524 outer membrane receptor FepA; Provisional Probab=48.77 E-value=17 Score=15.65 Aligned_cols=31 Identities=19% Similarity=0.001 Sum_probs=15.6 Q ss_pred HHHHHHHHHHHCCCCCCCCCCCCCEEEEEEE Q ss_conf 9877743321000122445777847999116 Q gi|254780745|r 21 LLKADLSQAAASYMTRFKVLGNEKIHIKADM 51 (181) Q Consensus 21 ~~~~~~a~~~~~~~~~~~~d~~~Pi~I~AD~ 51 (181) +...+++.+++.+.........++|.++|.+ T Consensus 15 ~~~~~~~~~~~~~~~~~~~~~~~~ivvta~~ 45 (742) T PRK13524 15 LGIYGAAQAAEAEDQAAAPSHDDTIVVTAAE 45 (742) T ss_pred HHHCCCHHHCCCCCCCCCCCCCCEEEEEEEE T ss_conf 8754420103332235677766479995654 No 34 >TIGR02856 spore_yqfC sporulation protein YqfC; InterPro: IPR009877 This family consists of several hypothetical bacterial proteins of around 90 residues in length. The function of this family is unknown.. Probab=48.46 E-value=18 Score=15.62 Aligned_cols=38 Identities=16% Similarity=0.114 Sum_probs=22.6 Q ss_pred EEEECCCCEEEEEECCCEEEEEEEEEEECCCEEEEEEC Q ss_conf 99947863499971797699229999921658999928 Q gi|254780745|r 129 VLNGDRADKVILKEKLNTFLGCKLIVNIDTSFASLQGC 166 (181) Q Consensus 129 ~l~Gna~~~v~l~q~~~~i~gdki~yn~~tg~~~~~g~ 166 (181) ++.|+.+.-+.-.||--.++-+++.++...|...+.|. T Consensus 25 ~~~G~~~i~iENHrGl~~f~~~~vkl~~~~G~i~i~G~ 62 (87) T TIGR02856 25 TLIGKEHIYIENHRGLVVFSPEEVKLNSTNGKITIEGK 62 (87) T ss_pred EEECCEEEEEECCEEEEEECCCEEEEECCCCEEEEEEC T ss_conf 98888589985233168775865898337868999737 No 35 >COG4856 Uncharacterized protein conserved in bacteria [Function unknown] Probab=46.50 E-value=19 Score=15.45 Aligned_cols=10 Identities=60% Similarity=0.733 Sum_probs=7.2 Q ss_pred CCHHHHHHHH Q ss_conf 9134678899 Q gi|254780745|r 1 MSAFLNSIHL 10 (181) Q Consensus 1 ~~~~~~~~~~ 10 (181) |..|+|+--+ T Consensus 1 MdK~lns~W~ 10 (403) T COG4856 1 MDKFLNSPWL 10 (403) T ss_pred CCCHHCCCHH T ss_conf 9612037376 No 36 >PRK02889 tolB translocation protein TolB; Provisional Probab=46.17 E-value=17 Score=15.78 Aligned_cols=24 Identities=8% Similarity=0.058 Sum_probs=14.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 467889999999999999877743 Q gi|254780745|r 4 FLNSIHLYIHLLALVFVLLKADLS 27 (181) Q Consensus 4 ~~~~~~~~~~~~~~l~~~~~~~~a 27 (181) -|.+|++.+++++|||++...+-| T Consensus 3 ~~~~l~~~~~~~~~l~~~~~~A~A 26 (430) T PRK02889 3 LMTKLGLRALVASCLIAAGGAAHA 26 (430) T ss_pred HHHHHHHHHHHHHHHHHCCCCCEE T ss_conf 799899999999999862102027 No 37 >PRK04405 prsA peptidylprolyl isomerase; Provisional Probab=43.23 E-value=21 Score=15.16 Aligned_cols=26 Identities=15% Similarity=0.087 Sum_probs=19.8 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 91346788999999999999987774 Q gi|254780745|r 1 MSAFLNSIHLYIHLLALVFVLLKADL 26 (181) Q Consensus 1 ~~~~~~~~~~~~~~~~~l~~~~~~~~ 26 (181) |+..|+++.+-+..+++++.+..|+. T Consensus 1 m~~~MKK~~l~~~~~~~~l~LaaCss 26 (298) T PRK04405 1 MKKKMKKWALAAASAGLLLSLAGCSS 26 (298) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 90678999999999999999987179 No 38 >PRK13513 putative outer membrane receptor; Provisional Probab=42.38 E-value=22 Score=15.09 Aligned_cols=45 Identities=9% Similarity=0.063 Sum_probs=22.0 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCEEEEEEEE Q ss_conf 9134678899999999999998777433210001224457778479991169 Q gi|254780745|r 1 MSAFLNSIHLYIHLLALVFVLLKADLSQAAASYMTRFKVLGNEKIHIKADML 52 (181) Q Consensus 1 ~~~~~~~~~~~~~~~~~l~~~~~~~~a~~~~~~~~~~~~d~~~Pi~I~AD~~ 52 (181) |.-.-.++++.+..++++++++++...++. .....++|.++|.+. T Consensus 1 ~~~~~~~~~~~~~~a~~~~~~~~~~~~aa~-------~~~~le~ivVTAtr~ 45 (660) T PRK13513 1 MAKVNQRFRKTHSAALVIAAVISSQGYAAE-------KTTPTDTMVVTASGF 45 (660) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHCC-------CCCCCCEEEEEECCC T ss_conf 930555443799999999998756886376-------788987299970487 No 39 >pfam07241 consensus Probab=36.45 E-value=28 Score=14.56 Aligned_cols=41 Identities=10% Similarity=0.218 Sum_probs=29.4 Q ss_pred EEEEEE-EEEEECCCCEEEEEEEEEEEE---CCEEEEEEEEEEEE Q ss_conf 799911-699992388799980089997---87599996899996 Q gi|254780745|r 45 IHIKAD-MLEVKDAVQKAFFKGNVFMTQ---EDFSLQADKMTIDY 85 (181) Q Consensus 45 i~I~AD-~~~~d~~~~~~~~~GNV~i~q---g~~~i~aD~l~i~~ 85 (181) |++..+ ++.+..-.+...|+.+.+-.+ |...+.+..+.+.+ T Consensus 25 It~iG~~ei~IENhrGIl~y~~~~Iri~~~~g~i~I~G~~L~I~~ 69 (87) T pfam07241 25 ITVIGKIHIYIENHRGILEFSDNEIRLNLKVGQIIIKGENFVIKF 69 (87) T ss_pred EEEECCCEEEEECCCEEEEECCCEEEEECCCEEEEEECCCEEEEE T ss_conf 999878489998560499987889999868579999953499999 No 40 >PRK06518 hypothetical protein; Provisional Probab=34.53 E-value=30 Score=14.38 Aligned_cols=48 Identities=19% Similarity=0.180 Sum_probs=29.9 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCEEEEEEEEEEECCCCEEEEEEEEEEEECCE-EE Q ss_conf 1346788999999999999987774332100012244577784799911699992388799980089997875-99 Q gi|254780745|r 2 SAFLNSIHLYIHLLALVFVLLKADLSQAAASYMTRFKVLGNEKIHIKADMLEVKDAVQKAFFKGNVFMTQEDF-SL 76 (181) Q Consensus 2 ~~~~~~~~~~~~~~~~l~~~~~~~~a~~~~~~~~~~~~d~~~Pi~I~AD~~~~d~~~~~~~~~GNV~i~qg~~-~i 76 (181) +++.+++.+++.+.+.+.++..... +.++|+ .-+|.|-|.++.|+. +| T Consensus 5 ~~~~~~i~~~~~~~~~~~~~~~~~~-------------~~~~~~--------------~~i~~Gra~V~dG~T~~i 53 (192) T PRK06518 5 AAYSDKIALSDIVAAPLEVYKPKAS-------------VASQPL--------------VVIFHGRAQVTSGVTFKL 53 (192) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCCC-------------CCCCCC--------------CCEEEEEEEEEECCEEEE T ss_conf 8887431099999887998614532-------------125776--------------602551489963878998 No 41 >PRK10626 hypothetical protein; Provisional Probab=33.35 E-value=31 Score=14.28 Aligned_cols=43 Identities=19% Similarity=0.319 Sum_probs=32.2 Q ss_pred CCCCCCCEEEEEEEEEEECCCCEEEEEEEEEEE-ECCEEEEEEEEEEEEC Q ss_conf 457778479991169999238879998008999-7875999968999961 Q gi|254780745|r 38 KVLGNEKIHIKADMLEVKDAVQKAFFKGNVFMT-QEDFSLQADKMTIDYN 86 (181) Q Consensus 38 ~~d~~~Pi~I~AD~~~~d~~~~~~~~~GNV~i~-qg~~~i~aD~l~i~~~ 86 (181) .......|.|+++++++.. ..||..|. +|+..+.+..+.+.-. T Consensus 25 ~V~pqdDIiItpq~VqV~g------ang~l~I~pdG~l~~nGk~v~Ls~~ 68 (239) T PRK10626 25 SVTPRDDVIISPQTVQVKG------ENGNLVITPDGNVMRNGKQYTLNAA 68 (239) T ss_pred CCCCCCCEEECCCEEEEEC------CCCCEEECCCCCEEECCEEEECCHH T ss_conf 8887877667576379971------7786788789876887876301899 No 42 >TIGR01626 ytfJ_HI0045 conserved hypothetical protein YtfJ-family, TIGR01626; InterPro: IPR006513 These are sequences from gamma proteobacteria that are related to the E. coli protein, YtfJ. . Probab=31.53 E-value=26 Score=14.72 Aligned_cols=58 Identities=16% Similarity=0.148 Sum_probs=33.9 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCEEEEEEEEEEECCCCEEEEEEEEEEEE Q ss_conf 99999999987774332100012244577784799911699992388799980089997 Q gi|254780745|r 13 HLLALVFVLLKADLSQAAASYMTRFKVLGNEKIHIKADMLEVKDAVQKAFFKGNVFMTQ 71 (181) Q Consensus 13 ~~~~~l~~~~~~~~a~~~~~~~~~~~~d~~~Pi~I~AD~~~~d~~~~~~~~~GNV~i~q 71 (181) .++++++..+.++==....+..++..-...-+|.+.-+...| ++=+.|.+-|.|++.+ T Consensus 8 a~~~~~~p~~a~AHNL~v~q~vP~Vgv~~~GEivL~~~~~~y-q~W~SA~LaGKVRV~~ 65 (184) T TIGR01626 8 ALLLVLLPILAFAHNLEVEQSVPAVGVEERGEIVLSDDDISY-QKWRSAELAGKVRVVL 65 (184) T ss_pred HHHHHHHHHHHHHHCCCCCCCCCCCCCCCCEEEEECCCCCCC-CCCCCHHHHHHHHHHH T ss_conf 999999877565420124885897111354016752784001-5874155641577656 No 43 >PRK10871 nlpD lipoprotein NlpD; Provisional Probab=30.60 E-value=35 Score=14.02 Aligned_cols=76 Identities=13% Similarity=0.189 Sum_probs=35.5 Q ss_pred EEECCCCCCCCCCCCEEEEEEEEECCCCEEEEEECCCCEEEEEECCCEEEEEEEEEEECCCEEEEEECCCCEEEEEEECC Q ss_conf 22012220245565332112799416658999947863499971797699229999921658999928998099998118 Q gi|254780745|r 99 MDVERNIFIQSGEINVIASNGYVDFQKRILVLNGDRADKVILKEKLNTFLGCKLIVNIDTSFASLQGCESDQVQSIIRYD 178 (181) Q Consensus 99 ~~a~gnv~~~~~~~~~~a~~~~y~~~~~~~~l~Gna~~~v~l~q~~~~i~gdki~yn~~tg~~~~~g~~~~rV~~i~~~~ 178 (181) ..+.|.|........+.|.-+.....++.+...+... ...+..+...-.|++|-.-=.+|. ..-++..-|+.+ T Consensus 289 Aaa~G~VvyaG~~l~gyGnliiIkH~~~~lsaYahn~-~~lVkeg~~V~~Gq~Ia~~G~tg~------~~~~LHFEiR~~ 361 (374) T PRK10871 289 ATADGRVVYAGNALRGYGNLIIIKHNDDYLSAYAHND-TMLVREQQEVKAGQKIATMGSTGT------SSTRLHFEIRYK 361 (374) T ss_pred ECCCEEEEEECCCCCCCCEEEEEECCCCCEEEEECCC-CCCCCCCCEECCCCEEEEECCCCC------CCCEEEEEEEEC T ss_conf 5378389992478787762899984998678616766-266788899889998986437899------983089999779 Q ss_pred CCC Q ss_conf 769 Q gi|254780745|r 179 GRP 181 (181) Q Consensus 179 ~r~ 181 (181) ||| T Consensus 362 g~p 364 (374) T PRK10871 362 GKS 364 (374) T ss_pred CEE T ss_conf 986 No 44 >COG3054 Predicted transcriptional regulator [General function prediction only] Probab=29.98 E-value=36 Score=13.96 Aligned_cols=26 Identities=19% Similarity=0.239 Sum_probs=14.1 Q ss_pred EEEEEEEEEEECCCCEEEEEEEEEEEE Q ss_conf 799911699992388799980089997 Q gi|254780745|r 45 IHIKADMLEVKDAVQKAFFKGNVFMTQ 71 (181) Q Consensus 45 i~I~AD~~~~d~~~~~~~~~GNV~i~q 71 (181) +.+.-|...| +.=+.+.+-|.|++.| T Consensus 40 l~l~~~~~~y-~~W~SAqL~GKvRV~~ 65 (184) T COG3054 40 LVLDKDQFSY-KTWNSAQLVGKVRVLQ 65 (184) T ss_pred EEECCCCEEE-CCCCHHHCCCHHHHHH T ss_conf 8862762001-2452654044032233 No 45 >PRK10793 D-alanyl-D-alanine carboxypeptidase fraction A; Provisional Probab=28.93 E-value=37 Score=13.85 Aligned_cols=28 Identities=14% Similarity=0.003 Sum_probs=18.0 Q ss_pred CCCCEEEEEEE-EEEECCCCEEEEEEEEE Q ss_conf 77847999116-99992388799980089 Q gi|254780745|r 41 GNEKIHIKADM-LEVKDAVQKAFFKGNVF 68 (181) Q Consensus 41 ~~~Pi~I~AD~-~~~d~~~~~~~~~GNV~ 68 (181) -.+|=+|+|.+ +-+|..+|.+.|.-|.. T Consensus 38 ~p~~P~i~A~sailiD~~TG~VLy~kNad 66 (403) T PRK10793 38 IPGVPQIDAESYILIDYNSGKVLAEQNAD 66 (403) T ss_pred CCCCCCCCCEEEEEEECCCCCCHHHCCCC T ss_conf 79989755419999999999491310949 No 46 >TIGR00383 corA magnesium and cobalt transport protein CorA; InterPro: IPR004488 The CorA transport system is the primary Mg2+ influx system of Salmonella typhimurium and Escherichia coli. It has an unusual membrane topology, with a large, soluble, highly charged periplasmic N-terminal domain with three transmembrane segments in a shorter, hydrophobic C-terminal domain. It has been suggested that the CorA Mg2+ transport system forms the major Mg2+ uptake system in the bacteria and archaea but that some family members may have a function other than Mg2+ transport .; GO: 0015087 cobalt ion transmembrane transporter activity, 0015095 magnesium ion transmembrane transporter activity, 0006824 cobalt ion transport, 0015693 magnesium ion transport, 0016020 membrane. Probab=27.16 E-value=40 Score=13.68 Aligned_cols=25 Identities=20% Similarity=0.254 Sum_probs=12.8 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 1346788999999999999987774 Q gi|254780745|r 2 SAFLNSIHLYIHLLALVFVLLKADL 26 (181) Q Consensus 2 ~~~~~~~~~~~~~~~~l~~~~~~~~ 26 (181) +.+||++++++.++...|..+.+.. T Consensus 267 s~~mN~ImK~lTvvs~iFiPlTfIA 291 (339) T TIGR00383 267 SNKMNEIMKILTVVSTIFIPLTFIA 291 (339) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 7988689999999999986024520 No 47 >COG2968 Uncharacterized conserved protein [Function unknown] Probab=24.58 E-value=45 Score=13.41 Aligned_cols=14 Identities=21% Similarity=0.221 Sum_probs=6.8 Q ss_pred EEEECC-CCEEEEEE Q ss_conf 799416-65899994 Q gi|254780745|r 119 GYVDFQ-KRILVLNG 132 (181) Q Consensus 119 ~~y~~~-~~~~~l~G 132 (181) -.|+.. ++.-+|+| T Consensus 103 P~Y~y~~~~~~~ltG 117 (243) T COG2968 103 PQYDYPENGEPELTG 117 (243) T ss_pred CCEECCCCCCCEEEE T ss_conf 304147789834899 No 48 >PRK13684 Ycf48-like protein; Provisional Probab=24.42 E-value=46 Score=13.39 Aligned_cols=65 Identities=17% Similarity=0.144 Sum_probs=36.7 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCEEEEEEE--EE---EECCCCEEEEEEEEEEE Q ss_conf 913467889999999999999877743321000122445777847999116--99---99238879998008999 Q gi|254780745|r 1 MSAFLNSIHLYIHLLALVFVLLKADLSQAAASYMTRFKVLGNEKIHIKADM--LE---VKDAVQKAFFKGNVFMT 70 (181) Q Consensus 1 ~~~~~~~~~~~~~~~~~l~~~~~~~~a~~~~~~~~~~~~d~~~Pi~I~AD~--~~---~d~~~~~~~~~GNV~i~ 70 (181) |++++.++.-+++++++++++..|..... +.....+.+.|.+.-|. +. .|.+.+-++=+-.+++. T Consensus 1 m~~~~~~~~~l~l~~~~~~~~~~c~~~~~-----p~~~~~pw~~i~l~t~~~~~dI~F~d~~hGwlVGs~~tIle 70 (333) T PRK13684 1 MKRLLKSLLNLLLLLALGLVLSGCSTTRV-----PMASSSPWQVIDLPTEANPLDIAFTDPNHGWLVGSNRTLLE 70 (333) T ss_pred CHHHHHHHHHHHHHHHHHHHHCCCCCCCC-----CCCCCCCCEEEECCCCCCEEEEEEECCCCCEEECCCCEEEE T ss_conf 92688889999999999986322356677-----65578987677057778254379846997779736865999 No 49 >pfam12046 DUF3529 Protein of unknown function (DUF3529). This family of proteins is functionally uncharacterized. This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 176 to 190 amino acids in length. Probab=24.14 E-value=46 Score=13.36 Aligned_cols=18 Identities=17% Similarity=0.039 Sum_probs=12.3 Q ss_pred EEECCCCEEEEEEEEEEE Q ss_conf 999238879998008999 Q gi|254780745|r 53 EVKDAVQKAFFKGNVFMT 70 (181) Q Consensus 53 ~~d~~~~~~~~~GNV~i~ 70 (181) ..|-..++.+|.|+|.-. T Consensus 62 ~~d~~~~~itFeG~V~~S 79 (173) T pfam12046 62 GGDAEGEVITFEGIVAPS 79 (173) T ss_pred CCCCCCCEEEEEEEECCC T ss_conf 577556789998785475 No 50 >cd00982 gltB_C gltb_C. This domain is found at the C-terminus of the large subunit (gltB) of glutamate synthase (GltS). GltS encodes a complex iron-sulfur flavoprotein that catalyzes the synthesis of L-glutamate from L-glutamine and 2-oxoglutarate. It requires the transfer of ammonia and electrons among three distinct active centers that carry out L-Gln hydrolysis, conversion of 2-oxoglutarate into L-Glu, and electron uptake from a donor. These catalytic sites appear to occur in other domains within the protein, and not the domain in this CD. This particular domain has no known function, but it likely has a structural role as it interacts with the amidotransferase and FMN-binding domains of gltS. Probab=24.11 E-value=46 Score=13.35 Aligned_cols=47 Identities=15% Similarity=0.292 Sum_probs=30.3 Q ss_pred CCCCCEEEEEEEEEEECCCCEEEEEEEEEEEECCEEEEEEEEEEEECC Q ss_conf 777847999116999923887999800899978759999689999617 Q gi|254780745|r 40 LGNEKIHIKADMLEVKDAVQKAFFKGNVFMTQEDFSLQADKMTIDYNN 87 (181) Q Consensus 40 d~~~Pi~I~AD~~~~d~~~~~~~~~GNV~i~qg~~~i~aD~l~i~~~~ 87 (181) ++.+|++++..--..|+.-+ +.+.|-+.-..|+.-|.-|.+.+.+.- T Consensus 15 ~~~~~v~~~~~I~N~dRsvG-t~ls~~I~k~yG~~gLp~~ti~i~~~G 61 (251) T cd00982 15 ENGEPVTLEYPIRNTDRAVG-TMLSGEIAKRYGEEGLPEDTIKIKFEG 61 (251) T ss_pred HCCCEEEEEEEEEECCCCHH-HHHHHHHHHHHCCCCCCCCEEEEEEEC T ss_conf 55982999986680675675-889999999957348998759999876 No 51 >PRK01904 hypothetical protein; Provisional Probab=23.62 E-value=47 Score=13.30 Aligned_cols=42 Identities=14% Similarity=0.297 Sum_probs=26.6 Q ss_pred EEEEEEEECCC-CEEEEEEEEEEEECC--EEEEEEEEEEEECCCC Q ss_conf 91169999238-879998008999787--5999968999961766 Q gi|254780745|r 48 KADMLEVKDAV-QKAFFKGNVFMTQED--FSLQADKMTIDYNNTN 89 (181) Q Consensus 48 ~AD~~~~d~~~-~~~~~~GNV~i~qg~--~~i~aD~l~i~~~~~~ 89 (181) .++++++++.. +.++|.=+-.+.+|+ ....+|-+.+.++.+. T Consensus 47 ~~~sl~L~~G~~HQIvfRy~~~v~~~~~~~~~~S~p~IvtF~a~~ 91 (217) T PRK01904 47 KAKSFNVDDTQVHQVVVRVSEIVRSGSDRSLFESDPIIVTFQGTT 91 (217) T ss_pred CCCCEECCCCCCEEEEEEEEEEECCCCCEEEEECCCEEEEEECCC T ss_conf 776156379970489999802674799644895688999995588 No 52 >pfam07873 YabP YabP family. This family of proteins is involved in spore coat assembly during the process of sporulation. Probab=23.40 E-value=48 Score=13.28 Aligned_cols=32 Identities=13% Similarity=0.185 Sum_probs=11.0 Q ss_pred EEEECCCCEEEEEEEEEEE---ECCEEEEEEEEEE Q ss_conf 9999238879998008999---7875999968999 Q gi|254780745|r 52 LEVKDAVQKAFFKGNVFMT---QEDFSLQADKMTI 83 (181) Q Consensus 52 ~~~d~~~~~~~~~GNV~i~---qg~~~i~aD~l~i 83 (181) +.+..-.++..|.-+.+.. +|.+.|.+..+.+ T Consensus 12 l~Ien~~~I~~y~~~~I~l~t~~G~l~I~G~~L~I 46 (66) T pfam07873 12 LYIENVKGVISFDEEEIRLETKQGVLIIKGENLHI 46 (66) T ss_pred EEEECCCCEEEECCCEEEEEECCEEEEEECCCEEE T ss_conf 99988305899988899999285799999434798 No 53 >pfam10661 EssA Firmicute eSAT-6 protein secretion system EssA. The Wss (WXG100 protein secretion system) in S. aureus seems to be encoded by a locus of eight ORFs, called ess (eSAT-6 secretion system). This locus encodes, amongst several other proteins, EssA, a protein predicted to possess one transmembrane domain. Due to its predicted membrane location and its absolute requirement for WXG100 protein secretion, it has been speculated that EssA could form a secretion apparatus in conjunction with YukC and YukAB. Proteins homologous to EssA, YukC, EsaA and YukD were absent from mycobacteria. Probab=22.80 E-value=39 Score=13.77 Aligned_cols=14 Identities=29% Similarity=0.470 Sum_probs=6.4 Q ss_pred CCCCEEEEEEEEEE Q ss_conf 77847999116999 Q gi|254780745|r 41 GNEKIHIKADMLEV 54 (181) Q Consensus 41 ~~~Pi~I~AD~~~~ 54 (181) .+-.++|.-|+++- T Consensus 32 ~~Gkmd~KtDRlq~ 45 (166) T pfam10661 32 DDGKMQFKVDRLQK 45 (166) T ss_pred CCCCEEEEHHHHHH T ss_conf 78854332487776 No 54 >pfam05984 Cytomega_UL20A Cytomegalovirus UL20A protein. This family consists of several Cytomegalovirus UL20A proteins. UL20A is thought to be a glycoprotein. Probab=22.36 E-value=50 Score=13.16 Aligned_cols=17 Identities=12% Similarity=0.153 Sum_probs=12.4 Q ss_pred EEEEEEEEEEECCEEEE Q ss_conf 99980089997875999 Q gi|254780745|r 61 AFFKGNVFMTQEDFSLQ 77 (181) Q Consensus 61 ~~~~GNV~i~qg~~~i~ 77 (181) .+..|||...||.-.-. T Consensus 44 ~T~~~~~~~sq~G~t~D 60 (102) T pfam05984 44 ITLQGELKESQDGETPD 60 (102) T ss_pred CCCCCCEEECCCCCCCC T ss_conf 44066446647887678 No 55 >PRK00378 nucleoid-associated protein NdpA; Validated Probab=22.09 E-value=51 Score=13.13 Aligned_cols=23 Identities=13% Similarity=0.273 Sum_probs=14.2 Q ss_pred CCEEEEEEEEECCCCEEEEEECC Q ss_conf 53321127994166589999478 Q gi|254780745|r 112 INVIASNGYVDFQKRILVLNGDR 134 (181) Q Consensus 112 ~~~~a~~~~y~~~~~~~~l~Gna 134 (181) ..+-|+++.||+.+++++++|=+ T Consensus 298 ~~lLg~~I~yD~~~dtLtIk~~P 320 (334) T PRK00378 298 AHLLGERIFYDPATDTLTIKGTP 320 (334) T ss_pred HHHCCCCEEECCCCCEEEEECCC T ss_conf 57679717863889979980899 No 56 >pfam04245 NA37 37-kD nucleoid-associated bacterial protein. Probab=21.42 E-value=53 Score=13.05 Aligned_cols=22 Identities=14% Similarity=0.265 Sum_probs=13.0 Q ss_pred CCEEEEEEEEECCCCEEEEEEC Q ss_conf 5332112799416658999947 Q gi|254780745|r 112 INVIASNGYVDFQKRILVLNGD 133 (181) Q Consensus 112 ~~~~a~~~~y~~~~~~~~l~Gn 133 (181) ....|+++.||+.+.+++++|= T Consensus 301 ~~llge~I~yD~~~dtLtIk~~ 322 (334) T pfam04245 301 SELLGERVFYDPETDTLTIKGL 322 (334) T ss_pred HHHCCCCEEECCCCCEEEEECC T ss_conf 6777984587487897898089 No 57 >pfam10916 DUF2712 Protein of unknown function (DUF2712). This family of proteins with unknown function appear to be restricted to Bacillales. Probab=20.76 E-value=54 Score=12.97 Aligned_cols=27 Identities=15% Similarity=0.235 Sum_probs=19.7 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 913467889999999999999877743 Q gi|254780745|r 1 MSAFLNSIHLYIHLLALVFVLLKADLS 27 (181) Q Consensus 1 ~~~~~~~~~~~~~~~~~l~~~~~~~~a 27 (181) |+.|+.+--.|++.+++..+++.++.. T Consensus 1 ~~kf~kkn~r~~la~a~Gi~~~a~s~f 27 (146) T pfam10916 1 IKKFAKKNFRFILAAAIGILLFAPSFF 27 (146) T ss_pred CCHHHHHHHHHHHHHHHHHEEECHHHH T ss_conf 904777778999999754404331121 No 58 >pfam07271 Cytadhesin_P30 Cytadhesin P30/P32. This family consists of several Mycoplasma species specific Cytadhesin P32 and P30 proteins. P30 has been found to be membrane associated and localized on the tip organelle. It is thought that it is important in cytadherence and virulence. Probab=20.75 E-value=54 Score=13.00 Aligned_cols=26 Identities=23% Similarity=0.239 Sum_probs=16.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 46788999999999999987774332 Q gi|254780745|r 4 FLNSIHLYIHLLALVFVLLKADLSQA 29 (181) Q Consensus 4 ~~~~~~~~~~~~~~l~~~~~~~~a~~ 29 (181) |+...+||++++++...+++.++... T Consensus 5 ~~r~kKLfi~~~ll~~s~lfs~li~L 30 (279) T pfam07271 5 FFRLKKLFIKSKLLGVSFLFSGLIAL 30 (279) T ss_pred HHHHHHHHHHHHEEEHHHHHHHHHHH T ss_conf 13477765230000046888889997 Done!