Query gi|254780746|ref|YP_003065159.1| hypothetical protein CLIBASIA_03165 [Candidatus Liberibacter asiaticus str. psy62] Match_columns 215 No_of_seqs 158 out of 565 Neff 7.9 Searched_HMMs 39220 Date Sun May 29 19:05:18 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780746.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 COG5375 Uncharacterized protei 100.0 1.9E-41 0 253.2 12.5 195 13-210 21-216 (216) 2 pfam06835 DUF1239 Protein of u 99.9 2.5E-21 6.4E-26 139.4 24.1 168 35-206 4-176 (176) 3 PRK10893 hypothetical protein; 99.8 3.2E-18 8.1E-23 121.8 19.8 182 21-211 2-188 (190) 4 COG3117 Uncharacterized protei 99.6 1.2E-14 3.1E-19 101.5 14.1 176 26-209 6-186 (188) 5 COG1934 Uncharacterized protei 98.7 1.9E-06 4.8E-11 55.3 14.9 116 70-213 48-168 (173) 6 pfam03968 OstA OstA-like prote 98.7 7.9E-07 2E-11 57.4 12.9 44 120-164 97-140 (141) 7 pfam06835 DUF1239 Protein of u 98.7 3.2E-06 8.2E-11 53.9 15.3 106 66-176 69-175 (176) 8 PRK10894 hypothetical protein; 98.6 9.7E-06 2.5E-10 51.2 16.0 116 71-214 52-173 (184) 9 COG1452 Imp Organic solvent to 98.5 9.2E-06 2.3E-10 51.4 13.6 122 78-208 47-170 (784) 10 COG1452 Imp Organic solvent to 98.4 4E-06 1E-10 53.4 9.7 78 120-198 53-131 (784) 11 PRK03761 organic solvent toler 98.4 5.6E-05 1.4E-09 46.9 14.9 127 70-207 40-173 (782) 12 PRK03761 organic solvent toler 98.3 1.2E-05 3.1E-10 50.7 11.2 78 119-198 51-134 (782) 13 PRK04423 organic solvent toler 98.2 0.00024 6.1E-09 43.4 15.3 119 77-208 54-173 (798) 14 pfam03968 OstA OstA-like prote 98.2 0.00036 9.3E-09 42.3 16.1 88 119-207 27-125 (141) 15 PRK10894 hypothetical protein; 98.2 5.5E-05 1.4E-09 47.0 11.8 115 78-200 31-155 (184) 16 COG1934 Uncharacterized protei 98.1 0.00015 3.7E-09 44.6 12.5 81 120-200 62-151 (173) 17 TIGR03002 outer_YhbN cell enve 98.0 0.00029 7.4E-09 42.9 12.5 123 78-208 5-150 (150) 18 TIGR03002 outer_YhbN cell enve 97.9 0.0013 3.2E-08 39.3 14.4 94 117-211 7-114 (150) 19 PRK04423 organic solvent toler 97.8 0.00024 6.1E-09 43.4 9.7 79 118-199 58-136 (798) 20 COG3117 Uncharacterized protei 97.3 0.0085 2.2E-07 34.6 11.9 152 21-176 4-182 (188) 21 PRK10893 hypothetical protein; 97.3 0.0098 2.5E-07 34.2 11.6 105 67-177 77-183 (190) 22 COG5375 Uncharacterized protei 93.9 0.41 1E-05 25.1 8.9 105 66-178 109-213 (216) 23 COG3008 PqiB Paraquat-inducibl 87.7 1.2 3.1E-05 22.4 4.8 64 26-89 21-94 (553) 24 PRK10807 paraquat-inducible pr 87.2 1.3 3.2E-05 22.3 4.6 23 27-49 18-40 (546) 25 TIGR02099 TIGR02099 conserved 80.7 3.8 9.6E-05 19.6 4.8 100 79-184 1171-1281(1362) 26 COG4384 Mu-like prophage prote 62.9 10 0.00026 17.1 7.4 74 123-203 98-175 (203) 27 KOG0860 consensus 52.8 15 0.00039 16.2 3.5 24 2-25 60-84 (116) 28 PRK01904 hypothetical protein; 45.5 20 0.00052 15.5 6.0 96 51-152 25-126 (217) 29 pfam05449 DUF754 Protein of un 42.5 22 0.00057 15.2 2.6 31 11-41 15-45 (83) 30 PRK04517 hypothetical protein; 39.3 26 0.00065 14.9 5.8 49 27-75 1-50 (216) 31 pfam05137 PilN Fimbrial assemb 31.9 34 0.00086 14.2 3.8 20 70-89 109-128 (162) 32 pfam11696 DUF3292 Protein of u 30.0 37 0.00093 14.0 3.6 20 179-198 597-618 (641) 33 pfam03961 DUF342 Protein of un 27.0 41 0.0011 13.7 14.1 106 79-185 138-253 (450) 34 TIGR01000 bacteriocin_acc bact 26.5 42 0.0011 13.7 2.7 22 18-41 15-36 (476) 35 COG3088 CcmH Uncharacterized p 22.1 52 0.0013 13.2 2.1 25 25-49 105-129 (153) 36 TIGR01433 CyoA ubiquinol oxida 22.0 29 0.00075 14.6 0.3 51 19-69 72-124 (228) 37 pfam06890 Phage_Mu_Gp45 Bacter 22.0 52 0.0013 13.2 6.1 18 124-141 78-95 (162) 38 pfam05568 ASFV_J13L African sw 21.2 54 0.0014 13.1 1.6 14 59-72 61-74 (189) 39 pfam12166 DUF3595 Protein of u 20.4 56 0.0014 13.0 3.3 16 70-85 110-125 (416) No 1 >COG5375 Uncharacterized protein conserved in bacteria [Function unknown] Probab=100.00 E-value=1.9e-41 Score=253.17 Aligned_cols=195 Identities=22% Similarity=0.330 Sum_probs=176.0 Q ss_pred HHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCC-CCCCCCCCCCCCEEEEECEEEEEECCCCCEEEEEEEEEE Q ss_conf 3552010368899999999999999999876420034444-454431345543058711089988499888999960113 Q gi|254780746|r 13 QRSKYRQHVRFIQFLKFFFPFATVTIVGWFLLSSWIRIST-FSKSSIDLLDFEPMIMKKFILSDYSKDRVKYSLVAERAK 91 (215) Q Consensus 13 ~~~~~~r~Sr~V~~lKi~Lpl~al~i~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~m~~P~~~G~~~~g~~~~itA~~A~ 91 (215) ...+++||||+|+.||+.||++|.++++.+.+.+++..+. .....-...+..+++|.+|+|.|++.+|+||.++|++|+ T Consensus 21 ~y~~a~Rhs~rVr~Lki~lpl~a~~~a~~fi~~s~~rt~~~~~~~~~a~~~~gkivm~~P~f~G~nk~~~~ysitA~RAi 100 (216) T COG5375 21 AYARARRHSRRVRRLKIGLPLAAGGLALAFIAQSVWRTPALAVSLETAEIETGKIVLKKPRFNGVNKDGRPYSITAERAI 100 (216) T ss_pred HHHHHHHHCCCEEEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCEEECCCCCCCCCCCCCEEEEEHHHHH T ss_conf 89998642121022201578998888875344443113677630032412357479706613256278975033077776 Q ss_pred ECCCCCCCEEEEEEEEEEEEECCCCEEEEEEEEEEEEEECCCCEEEEECCEEEEECCCCEEEEEEEEEEEEEEEEEECCC Q ss_conf 35889874499995689998168845679999345899735646999777799951573999821279972038995385 Q gi|254780746|r 92 TSFNSGKGIIFLQDFELTVPTQRSEYGDMYLFAHSARFNLANHTLYISQPFKMKVKDNLRLDFETAVLDVKNITINSSDP 171 (215) Q Consensus 92 ~~~~~~~~~i~L~~v~a~i~~~~g~~~~i~i~A~~G~y~~~~~~l~L~g~V~i~~~~G~~l~t~~~~~d~~~~~i~s~~~ 171 (215) ||. .+|+++.|+.|.|.++..+.. ...++|++|+||+++++|.|..+|+|.+++|+.+..+++.+|+.+|.+.++.| T Consensus 101 qD~-~n~~~i~Le~i~a~~pvg~~~--~a~v~Ak~Gvyd~~~ntL~l~~dv~i~~~eGi~a~~~sA~vDl~~G~~sg~~p 177 (216) T COG5375 101 QDV-KNPNIITLEGILAILPVGEDN--PAQVTAKSGVYDRDTNTLDLTDDVKITTDEGIRADFQSAFVDLRTGNMSGDSP 177 (216) T ss_pred HCC-CCCCEEEEEEEEEECCCCCCC--CEEEEECCCEEECCCCEEECCCCEEEECCHHHHCCCCHHEEEECCCCCCCCCC T ss_conf 502-688748996001552479997--34687123426667871324687688535433135601246602561026786 Q ss_pred EEEEECCCEEEEEEEEEEECCCEEEEECCEEEEECHHHH Q ss_conf 899907718997079999278689998896999831132 Q gi|254780746|r 172 VIITHSDFVLSANFARIENSSRSAVFAGQVSVVVNPGVL 210 (215) Q Consensus 172 V~~~~~~g~i~A~~m~i~~~~~~i~F~g~V~l~i~p~~~ 210 (215) |.+.+++|.|+|++|.+.++|++|+|+|+||++|+|+.+ T Consensus 178 v~i~~~~gsI~ads~~v~D~G~iivF~g~Vr~~id~s~~ 216 (216) T COG5375 178 VDISGKGGSITADSYSVTDKGKIIVFEGNVRTRIDPSDI 216 (216) T ss_pred EEECCCCCCEEEEEEEEECCCCEEEEECCEEEEECCCCC T ss_conf 333079984876136752288489995656898447789 No 2 >pfam06835 DUF1239 Protein of unknown function (DUF1239). This family consists of several hypothetical bacterial proteins of around 190 residues in length. The function of this family is unknown. Probab=99.91 E-value=2.5e-21 Score=139.35 Aligned_cols=168 Identities=14% Similarity=0.160 Sum_probs=137.2 Q ss_pred HHHHHHHHHHHCCCCCCCCCCCCCCCCCCCEEEEECEEEEEECCCCC-EEEEEEEEEEECCCCCCCEEEEEEEEEEEEEC Q ss_conf 99999987642003444445443134554305871108998849988-89999601133588987449999568999816 Q gi|254780746|r 35 TVTIVGWFLLSSWIRISTFSKSSIDLLDFEPMIMKKFILSDYSKDRV-KYSLVAERAKTSFNSGKGIIFLQDFELTVPTQ 113 (215) Q Consensus 35 al~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~~P~~~G~~~~g~-~~~itA~~A~~~~~~~~~~i~L~~v~a~i~~~ 113 (215) +++++++++|+.......+..............|++..+..++.+|+ .|.++|+++.... +.+...|+.|...++.. T Consensus 4 ~~~l~~~~~~~~~~~~~~~~~~~~~~~~~pDy~~~~~~~~~~d~~G~l~~~l~a~~~~~~~--~~~~~~l~~P~~~~~~~ 81 (176) T pfam06835 4 ALLLAGWLLWLLSEESSQDEPQVKPDPSEPDYTAENLTTTQYDEQGKLQYRLTADKLEHYP--DDDTTEFTNPVLIIYQD 81 (176) T ss_pred HHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCEEEEEEEEEEECCCCCEEEEEEECEEEEEC--CCCCEEEEEEEEEEECC T ss_conf 9999999998663765333553568999998999922999999999899999904579965--89968999639999779 Q ss_pred CCCEEEEEEEEEEEEEECCCCEEEEECCEEEEECC----CCEEEEEEEEEEEEEEEEEECCCEEEEECCCEEEEEEEEEE Q ss_conf 88456799993458997356469997777999515----73999821279972038995385899907718997079999 Q gi|254780746|r 114 RSEYGDMYLFAHSARFNLANHTLYISQPFKMKVKD----NLRLDFETAVLDVKNITINSSDPVIITHSDFVLSANFARIE 189 (215) Q Consensus 114 ~g~~~~i~i~A~~G~y~~~~~~l~L~g~V~i~~~~----G~~l~t~~~~~d~~~~~i~s~~~V~~~~~~g~i~A~~m~i~ 189 (215) ++ ..+.++|++|.++.+.+.+.|.|+|++.... ..+|.|+.+.|+.+++.+.|+.||.+.+|++.++|.||.++ T Consensus 82 ~~--~~w~i~A~~g~~~~~~~~i~L~g~V~i~~~~~~~~~~~l~T~~l~~~~~~~~~~td~~V~i~~~~~~~~g~gm~~~ 159 (176) T pfam06835 82 GK--PPWTITADRAKLNKDGQTLYLYGNVVIQSLEDKSRLTTITTENLTVNLDTQDASTDQPVTITGPNFVSTGTGMKAN 159 (176) T ss_pred CC--CEEEEEECEEEEECCCCEEEEECCEEEEECCCCCCCEEEEEHEEEEECCCCEEEECCCEEEEECCCEEEEEEEEEE T ss_conf 98--3699997949993799699998999999768998756999220999947779994874999909979998889999 Q ss_pred ECCCEEEEECCEEEEEC Q ss_conf 27868999889699983 Q gi|254780746|r 190 NSSRSAVFAGQVSVVVN 206 (215) Q Consensus 190 ~~~~~i~F~g~V~l~i~ 206 (215) .+.+++.|.|+||..|+ T Consensus 160 l~~~~~~l~~~Vr~~ye 176 (176) T pfam06835 160 LKTGTAELLSNVRGRYE 176 (176) T ss_pred CCCCEEEEECCCEEEEC T ss_conf 75899999436169959 No 3 >PRK10893 hypothetical protein; Provisional Probab=99.83 E-value=3.2e-18 Score=121.82 Aligned_cols=182 Identities=14% Similarity=0.176 Sum_probs=142.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCEEEEECEEEEEECCCCC-EEEEEEEEEEECCCCCCC Q ss_conf 6889999999999999999987642003444445443134554305871108998849988-899996011335889874 Q gi|254780746|r 21 VRFIQFLKFFFPFATVTIVGWFLLSSWIRISTFSKSSIDLLDFEPMIMKKFILSDYSKDRV-KYSLVAERAKTSFNSGKG 99 (215) Q Consensus 21 Sr~V~~lKi~Lpl~al~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~~P~~~G~~~~g~-~~~itA~~A~~~~~~~~~ 99 (215) |+.-+|+-++|-++|+++++|.+. ...... ......+.......+-+=.-++.+|. .|.++|+++ .... ..+ T Consensus 2 ~~~~~w~~~~l~~~a~~l~~w~~~----~~~~~~-~~~~~~~~P~y~a~~l~t~~y~~~G~lsy~i~A~~~-eHY~-~~~ 74 (190) T PRK10893 2 SKTRRWVIIVLSLAALVLIGWNMA----DKDDTA-QVVVNNNDPTYKSEHTDTLVYNPEGALSYRLIAQHV-EYYS-DQA 74 (190) T ss_pred CHHHHHHHHHHHHHHHHHHEEEEC----CCCCCC-CCCCCCCCCCEEEECCEEEEECCCCCEEEEEEECHH-HHCC-CCC T ss_conf 525779999999999998505510----368765-565789999669753368889999978589997142-0131-379 Q ss_pred EEEEEEEEEEEEECCCCEEEEEEEEEEEEEECCCCEEEEECCEEEEEC--CC--CEEEEEEEEEEEEEEEEEECCCEEEE Q ss_conf 499995689998168845679999345899735646999777799951--57--39998212799720389953858999 Q gi|254780746|r 100 IIFLQDFELTVPTQRSEYGDMYLFAHSARFNLANHTLYISQPFKMKVK--DN--LRLDFETAVLDVKNITINSSDPVIIT 175 (215) Q Consensus 100 ~i~L~~v~a~i~~~~g~~~~i~i~A~~G~y~~~~~~l~L~g~V~i~~~--~G--~~l~t~~~~~d~~~~~i~s~~~V~~~ 175 (215) ...++.|...++..++ ...+.|+|++|.+..+ +.|.|.|+|.+..- ++ -++.|+.+.+|+.+..+.|+.||.+. T Consensus 75 ~T~F~~Pvl~ly~~~~-~~~W~VsA~~a~L~~d-~~L~L~gnV~i~~L~~~s~iq~I~T~~l~idL~tqd~~SD~~VtI~ 152 (190) T PRK10893 75 VSWFTQPVLTTFDKDK-VPTWSVKADKAKLTND-RMLYLYGHVEVNALTPDSQLRRITTDNAQINLVTQDVTSDDLVTLY 152 (190) T ss_pred CEEEECCEEEEECCCC-CCCEEEEECEEEECCC-CEEEEECCEEEEECCCCCCCCEEECCCEEEECCCCEEECCCEEEEE T ss_conf 5898186599988999-5327999456798648-8899977799985389754538982758980466468638709998 Q ss_pred ECCCEEEEEEEEEEECCCEEEEECCEEEEECHHHHC Q ss_conf 077189970799992786899988969998311320 Q gi|254780746|r 176 HSDFVLSANFARIENSSRSAVFAGQVSVVVNPGVLQ 211 (215) Q Consensus 176 ~~~g~i~A~~m~i~~~~~~i~F~g~V~l~i~p~~~~ 211 (215) |+++..+|.||..+.+.+.+.+..+||..|+++..+ T Consensus 153 G~~f~s~G~gm~anL~~~~a~L~~~Vkt~YEi~~~~ 188 (190) T PRK10893 153 GTTFNSSGLKMRGNLRSKNAELIEKVRTSYEIQNKQ 188 (190) T ss_pred CCCCEEEEEEEEEEHHHCEEEEHHHCCEEEECCCCC T ss_conf 798399861089993658889045061999636003 No 4 >COG3117 Uncharacterized protein conserved in bacteria [Function unknown] Probab=99.64 E-value=1.2e-14 Score=101.55 Aligned_cols=176 Identities=16% Similarity=0.196 Sum_probs=135.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCEEEEECEEEEEECCCCC-EEEEEEEEEEECCCCCCCEEEEE Q ss_conf 99999999999999987642003444445443134554305871108998849988-89999601133588987449999 Q gi|254780746|r 26 FLKFFFPFATVTIVGWFLLSSWIRISTFSKSSIDLLDFEPMIMKKFILSDYSKDRV-KYSLVAERAKTSFNSGKGIIFLQ 104 (215) Q Consensus 26 ~lKi~Lpl~al~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~~P~~~G~~~~g~-~~~itA~~A~~~~~~~~~~i~L~ 104 (215) |--++|-++|+.+++|+.+.... ........+.+. ..+|++..-.-++.+|. .|.++|... +... .++..+.+ T Consensus 6 w~~~ILll~a~~~~~w~~~~~~~---~~~~v~~~~d~p-~Y~~e~~~~~~~de~G~~~y~l~a~~~-eh~~-~~~~t~ft 79 (188) T COG3117 6 WVYLILLLAALALSGWLLGLEQD---EIEQVRPNPDEP-AYTMEGLDTTVYDEQGKLKYRLTAQHV-EHYS-DSSDTHFT 79 (188) T ss_pred HHHHHHHHHHHHHHHHHHHCCCC---CCCCCCCCCCCC-CEEECCCCEEEECCCCCEEEEEEHHHH-HHCC-CCCCEEEE T ss_conf 99999999999999987620135---524410499986-402057423677777605677632345-4154-56655771 Q ss_pred EEEEEEEECCCCEEEEEEEEEEEEEECCCCEEEEECCEEEEEC-CC---CEEEEEEEEEEEEEEEEEECCCEEEEECCCE Q ss_conf 5689998168845679999345899735646999777799951-57---3999821279972038995385899907718 Q gi|254780746|r 105 DFELTVPTQRSEYGDMYLFAHSARFNLANHTLYISQPFKMKVK-DN---LRLDFETAVLDVKNITINSSDPVIITHSDFV 180 (215) Q Consensus 105 ~v~a~i~~~~g~~~~i~i~A~~G~y~~~~~~l~L~g~V~i~~~-~G---~~l~t~~~~~d~~~~~i~s~~~V~~~~~~g~ 180 (215) .|...++-.+ ...++++|++|.++.+...+.+..+|+.... +. -.++|+++.+++++..+.++.||++.++.+. T Consensus 80 ~P~l~lf~e~--~q~w~v~Ad~a~l~~d~~l~l~~~vvv~~~t~d~~q~~~l~T~~~~V~l~tqd~~te~~V~l~~~~~~ 157 (188) T COG3117 80 APVLTLFREN--RQTWTVQADRAKLTKDGKLYLLGDVVVVALTPDSGQLQRLTTESAQVNLKTQDAQTEDPVTLYGGQFV 157 (188) T ss_pred CCEEEEECCC--CCCEEEEHHHHEECCCCEEEEEECEEEEEECCCCCCCCEEEEEEEEECCCCCCEECCCCEEEECCCCC T ss_conf 5359988388--76279981020215785599871206775147766764587102376133100003775667435440 Q ss_pred EEEEEEEEEECCCEEEEECCEEEEECHHH Q ss_conf 99707999927868999889699983113 Q gi|254780746|r 181 LSANFARIENSSRSAVFAGQVSVVVNPGV 209 (215) Q Consensus 181 i~A~~m~i~~~~~~i~F~g~V~l~i~p~~ 209 (215) -+|.+|..+.+.+.+.|.++||..|++.. T Consensus 158 ~tg~gm~~n~k~~~a~L~~kV~t~yE~~~ 186 (188) T COG3117 158 STANGMTGNLKTQQANLLSKVRTRYEVLK 186 (188) T ss_pred CCCCCEEEEHHHHHHHHHHCCCCEEEECC T ss_conf 23201477546556664212451787246 No 5 >COG1934 Uncharacterized protein conserved in bacteria [Function unknown] Probab=98.70 E-value=1.9e-06 Score=55.27 Aligned_cols=116 Identities=15% Similarity=0.090 Sum_probs=73.3 Q ss_pred CEEEEEECCCCC-EEEEEEEEEEECCCCCCC--EEEEEEEEEEEE--ECCCCEEEEEEEEEEEEEECCCCEEEEECCEEE Q ss_conf 108998849988-899996011335889874--499995689998--168845679999345899735646999777799 Q gi|254780746|r 70 KFILSDYSKDRV-KYSLVAERAKTSFNSGKG--IIFLQDFELTVP--TQRSEYGDMYLFAHSARFNLANHTLYISQPFKM 144 (215) Q Consensus 70 ~P~~~G~~~~g~-~~~itA~~A~~~~~~~~~--~i~L~~v~a~i~--~~~g~~~~i~i~A~~G~y~~~~~~l~L~g~V~i 144 (215) ..+|+|...=.+ +..+.|++..-..+.+.+ .+....=.+.+. ...|+ .+.-+|+++.|+..++++.|+|++.+ T Consensus 48 ~a~ftGNV~i~qG~~~l~Adkv~v~~~~~~~~~~i~a~gn~~~~~q~~~~~k--~~~g~ad~~~Y~~~~~~~vLtGna~v 125 (173) T COG1934 48 VATFTGNVVVTQGTITLRADKVTVTRDKEGGKEVIEANGNPATFSQKVDNGK--TQKGTADQAIYDLKNDTIVLTGNAYV 125 (173) T ss_pred EEEEECCEEEEECCCEEEEEEEEEECCCCCCCEEEEEECCEEEEEEEECCCC--CEEEEEEEEEEEECCCEEEEECCEEE T ss_conf 7999763899846617971369998057898548999546667887650785--03888516999820789999477999 Q ss_pred EECCCCEEEEEEEEEEEEEEEEEECCCEEEEECCCEEEEEEEEEEECCCEEEEECCEEEEECHHHHCCC Q ss_conf 951573999821279972038995385899907718997079999278689998896999831132012 Q gi|254780746|r 145 KVKDNLRLDFETAVLDVKNITINSSDPVIITHSDFVLSANFARIENSSRSAVFAGQVSVVVNPGVLQKK 213 (215) Q Consensus 145 ~~~~G~~l~t~~~~~d~~~~~i~s~~~V~~~~~~g~i~A~~m~i~~~~~~i~F~g~V~l~i~p~~~~~~ 213 (215) .... -.++.+.++|+.+++.+..++. =.++|+++|.|...+++ T Consensus 126 ~q~g-n~i~G~kIty~~~~~~~~a~~~-------------------------~~~RV~~V~~p~~~q~~ 168 (173) T COG1934 126 LQLG-NVIRGDKITYLTKTGRAVAESS-------------------------KGGRVRTVLDPEDSQQK 168 (173) T ss_pred EECC-CEEEEEEEEEECCCCEEEECCC-------------------------CCCCEEEEECCHHCCCC T ss_conf 8579-8887639999943457996288-------------------------99847999880552644 No 6 >pfam03968 OstA OstA-like protein. This family of proteins are mostly uncharacterized. However the family does include E. coli OstA that has been characterized as an organic solvent tolerance protein. Probab=98.70 E-value=7.9e-07 Score=57.38 Aligned_cols=44 Identities=9% Similarity=0.073 Sum_probs=27.6 Q ss_pred EEEEEEEEEEECCCCEEEEECCEEEEECCCCEEEEEEEEEEEEEE Q ss_conf 999934589973564699977779995157399982127997203 Q gi|254780746|r 120 MYLFAHSARFNLANHTLYISQPFKMKVKDNLRLDFETAVLDVKNI 164 (215) Q Consensus 120 i~i~A~~G~y~~~~~~l~L~g~V~i~~~~G~~l~t~~~~~d~~~~ 164 (215) +..+|+.+.|+..++.+.|.||+++... +-.+..+.+.||+++| T Consensus 97 ~~a~a~~~~Y~~~~~~i~L~Gna~l~q~-~n~i~G~~I~ydl~~g 140 (141) T pfam03968 97 IRGTADRAEYDSKKRIIVLTGNAVLTQG-GNSIRGDKITYDIKEG 140 (141) T ss_pred EEEEEEEEEEEECCCEEEEECCEEEEEC-CCEEEECEEEEECCCC T ss_conf 7999718999916999999733999989-9999968999997679 No 7 >pfam06835 DUF1239 Protein of unknown function (DUF1239). This family consists of several hypothetical bacterial proteins of around 190 residues in length. The function of this family is unknown. Probab=98.67 E-value=3.2e-06 Score=53.94 Aligned_cols=106 Identities=13% Similarity=0.086 Sum_probs=87.2 Q ss_pred EEEECEEEEEECCCCCEEEEEEEEEEECCCCCCCEEEEEE-EEEEEEECCCCEEEEEEEEEEEEEECCCCEEEEECCEEE Q ss_conf 5871108998849988899996011335889874499995-689998168845679999345899735646999777799 Q gi|254780746|r 66 MIMKKFILSDYSKDRVKYSLVAERAKTSFNSGKGIIFLQD-FELTVPTQRSEYGDMYLFAHSARFNLANHTLYISQPFKM 144 (215) Q Consensus 66 ~~m~~P~~~G~~~~g~~~~itA~~A~~~~~~~~~~i~L~~-v~a~i~~~~g~~~~i~i~A~~G~y~~~~~~l~L~g~V~i 144 (215) ..+++|++.-++.+|.+|.++|++|.-.. +.+.+.|.+ |.+.- ..+......+.++.-.|+..++.+.=..+|++ T Consensus 69 ~~l~~P~~~~~~~~~~~w~i~A~~g~~~~--~~~~i~L~g~V~i~~--~~~~~~~~~l~T~~l~~~~~~~~~~td~~V~i 144 (176) T pfam06835 69 TEFTNPVLIIYQDGKPPWTITADRAKLNK--DGQTLYLYGNVVIQS--LEDKSRLTTITTENLTVNLDTQDASTDQPVTI 144 (176) T ss_pred EEEEEEEEEEECCCCCEEEEEECEEEEEC--CCCEEEEECCEEEEE--CCCCCCCEEEEEHEEEEECCCCEEEECCCEEE T ss_conf 89996399997799836999979499937--996999989999997--68998756999220999947779994874999 Q ss_pred EECCCCEEEEEEEEEEEEEEEEEECCCEEEEE Q ss_conf 95157399982127997203899538589990 Q gi|254780746|r 145 KVKDNLRLDFETAVLDVKNITINSSDPVIITH 176 (215) Q Consensus 145 ~~~~G~~l~t~~~~~d~~~~~i~s~~~V~~~~ 176 (215) ...+ ..++...+.+|++++.+.-.+.|.+.. T Consensus 145 ~~~~-~~~~g~gm~~~l~~~~~~l~~~Vr~~y 175 (176) T pfam06835 145 TGPN-FVSTGTGMKANLKTGTAELLSNVRGRY 175 (176) T ss_pred EECC-CEEEEEEEEEECCCCEEEEECCCEEEE T ss_conf 9099-799988899997589999943616995 No 8 >PRK10894 hypothetical protein; Provisional Probab=98.59 E-value=9.7e-06 Score=51.23 Aligned_cols=116 Identities=14% Similarity=0.106 Sum_probs=70.2 Q ss_pred EEEEEECC-CCCEEEEEEEEEEECCCC-CCCE--EEEEEEEEEEE--ECCCCEEEEEEEEEEEEEECCCCEEEEECCEEE Q ss_conf 08998849-988899996011335889-8744--99995689998--168845679999345899735646999777799 Q gi|254780746|r 71 FILSDYSK-DRVKYSLVAERAKTSFNS-GKGI--IFLQDFELTVP--TQRSEYGDMYLFAHSARFNLANHTLYISQPFKM 144 (215) Q Consensus 71 P~~~G~~~-~g~~~~itA~~A~~~~~~-~~~~--i~L~~v~a~i~--~~~g~~~~i~i~A~~G~y~~~~~~l~L~g~V~i 144 (215) ..|+|.-. .-....|+|++..-..+. ..+. +...+=-|.+. ..+|++ +.-.|++..|+..++.+.|.||..+ T Consensus 52 avytGNVvitQGsl~I~ADkv~v~~~~~~~~~~~i~a~G~pA~f~Q~~~~g~~--i~a~A~~i~Y~~~~~~i~L~GnA~L 129 (184) T PRK10894 52 VTFTGNVVVTQGTIKINADKVVVTRPGGEQGKEVIDGYGNPATFYQMQDNGKP--VKGHASKMHYELAKDFVVLTGNAYL 129 (184) T ss_pred EEEEEEEEEEEEEEEEEEEEEEEEECCCCCCEEEEEEECCCEEEEEECCCCCC--EEEEEEEEEEECCCCEEEEECCEEE T ss_conf 99992599996259999358999976997537999996893799996568981--3769748999826999999765899 Q ss_pred EECCCCEEEEEEEEEEEEEEEEEECCCEEEEECCCEEEEEEEEEEECCCEEEEECCEEEEECHHHHCCCC Q ss_conf 9515739998212799720389953858999077189970799992786899988969998311320125 Q gi|254780746|r 145 KVKDNLRLDFETAVLDVKNITINSSDPVIITHSDFVLSANFARIENSSRSAVFAGQVSVVVNPGVLQKKE 214 (215) Q Consensus 145 ~~~~G~~l~t~~~~~d~~~~~i~s~~~V~~~~~~g~i~A~~m~i~~~~~~i~F~g~V~l~i~p~~~~~~~ 214 (215) .... -.+..+.+.||++++.+...+ . + .++|++++.|...+++. T Consensus 130 ~Q~~-~~v~G~~I~Ydl~~~~~~a~s------~-------------~------~~RV~~vi~P~~~~~k~ 173 (184) T PRK10894 130 EQLD-SNIKGDKITYLVKEQKMQAFS------D-------------K------GKRVTTVLVPSQLQDKN 173 (184) T ss_pred EECC-CEEEECEEEEEECCCEEEEEC------C-------------C------CCEEEEEECCCCCCCCC T ss_conf 9689-989937899990667999967------9-------------9------98079998940158655 No 9 >COG1452 Imp Organic solvent tolerance protein OstA [Cell envelope biogenesis, outer membrane] Probab=98.49 E-value=9.2e-06 Score=51.37 Aligned_cols=122 Identities=9% Similarity=0.084 Sum_probs=91.9 Q ss_pred CCCCE-EEEEEEEEEECCCCCCCEEEEEEEEEEEEECCCCEEEEEEEEEEEEEECCCCEEEEECCEEEEECCCCEEEEEE Q ss_conf 99888-99996011335889874499995689998168845679999345899735646999777799951573999821 Q gi|254780746|r 78 KDRVK-YSLVAERAKTSFNSGKGIIFLQDFELTVPTQRSEYGDMYLFAHSARFNLANHTLYISQPFKMKVKDNLRLDFET 156 (215) Q Consensus 78 ~~g~~-~~itA~~A~~~~~~~~~~i~L~~v~a~i~~~~g~~~~i~i~A~~G~y~~~~~~l~L~g~V~i~~~~G~~l~t~~ 156 (215) ..+.| +.+.|+.+.-+ .+.++|..++ .+.| ..|. .+|.|++-.||++++.+...|||.+...+|..+.++. T Consensus 47 ~~~~p~~~iea~~~~~~--~~~~~ita~G-nV~i--~~~~---~~l~Ad~v~y~q~t~~~~A~GnV~~~~~~g~~i~gd~ 118 (784) T COG1452 47 EQNLPKILIEADALEYN--RKDQIITAEG-NVEI--LYGD---YRLLADKVDYDQKTDRVTATGNVEYIDGDGILIKGDK 118 (784) T ss_pred CCCCCCEEEEHHHHCCC--CCCCEEEEEC-EEEE--EECC---EEEEEEEEEEECCCCEEEEECCEEEECCCCCEEECCE T ss_conf 45698458980020014--7785699966-2999--9688---8998758997234788999777999828980998756 Q ss_pred EEEEEEEEEEEECC-CEEEEECCCEEEEEEEEEEECCCEEEEECCEEEEECHH Q ss_conf 27997203899538-58999077189970799992786899988969998311 Q gi|254780746|r 157 AVLDVKNITINSSD-PVIITHSDFVLSANFARIENSSRSAVFAGQVSVVVNPG 208 (215) Q Consensus 157 ~~~d~~~~~i~s~~-~V~~~~~~g~i~A~~m~i~~~~~~i~F~g~V~l~i~p~ 208 (215) +.++++.+...-.. -++..+..+.+.|+.++..+ ++...++.++-+.-.|. T Consensus 119 a~~~ld~~~g~~~n~~~~~~~~~~~~~A~~a~~~d-~~~~~l~~at~TtC~~~ 170 (784) T COG1452 119 ADVNLDTKKGFVSNFRYQTVGRQGRFVAKSAERRD-NDVTILKNATYTTCEPC 170 (784) T ss_pred EEEECCCCCCEEEEEEEEECCCEEEEEEEEEEECC-CEEEEEEEEEEECCCCC T ss_conf 89961457645860089852522579985522148-70699951088236888 No 10 >COG1452 Imp Organic solvent tolerance protein OstA [Cell envelope biogenesis, outer membrane] Probab=98.39 E-value=4e-06 Score=53.41 Aligned_cols=78 Identities=12% Similarity=0.182 Sum_probs=58.6 Q ss_pred EEEEEEEEEEECCCCEEEEECCEEEEECCCCEEEEEEEEEEEEEEEEEECCCEEEEE-CCCEEEEEEEEEEECCCEEEEE Q ss_conf 999934589973564699977779995157399982127997203899538589990-7718997079999278689998 Q gi|254780746|r 120 MYLFAHSARFNLANHTLYISQPFKMKVKDNLRLDFETAVLDVKNITINSSDPVIITH-SDFVLSANFARIENSSRSAVFA 198 (215) Q Consensus 120 i~i~A~~G~y~~~~~~l~L~g~V~i~~~~G~~l~t~~~~~d~~~~~i~s~~~V~~~~-~~g~i~A~~m~i~~~~~~i~F~ 198 (215) +.|.|+.-.|+.+.+.+.+.|||.|.+++ +.+.++++.||.+++.+...|+|++.. .+..|.|+.+.++++.+.-.+. T Consensus 53 ~~iea~~~~~~~~~~~ita~GnV~i~~~~-~~l~Ad~v~y~q~t~~~~A~GnV~~~~~~g~~i~gd~a~~~ld~~~g~~~ 131 (784) T COG1452 53 ILIEADALEYNRKDQIITAEGNVEILYGD-YRLLADKVDYDQKTDRVTATGNVEYIDGDGILIKGDKADVNLDTKKGFVS 131 (784) T ss_pred EEEEHHHHCCCCCCCEEEEECEEEEEECC-EEEEEEEEEEECCCCEEEEECCEEEECCCCCEEECCEEEEECCCCCCEEE T ss_conf 58980020014778569996629999688-89987589972347889997779998289809987568996145764586 No 11 >PRK03761 organic solvent tolerance protein; Provisional Probab=98.36 E-value=5.6e-05 Score=46.93 Aligned_cols=127 Identities=9% Similarity=0.070 Sum_probs=91.5 Q ss_pred CEEEEEECCCCCEEEEEEEEEEECCCCCCCEEEEEEEEEEEEECCCCEEEEEEEEEEEEEECCCC------EEEEECCEE Q ss_conf 10899884998889999601133588987449999568999816884567999934589973564------699977779 Q gi|254780746|r 70 KFILSDYSKDRVKYSLVAERAKTSFNSGKGIIFLQDFELTVPTQRSEYGDMYLFAHSARFNLANH------TLYISQPFK 143 (215) Q Consensus 70 ~P~~~G~~~~g~~~~itA~~A~~~~~~~~~~i~L~~v~a~i~~~~g~~~~i~i~A~~G~y~~~~~------~l~L~g~V~ 143 (215) .|..+| +.+..|..|.|+++.-. .++.+.++ +.+.+..|. ..|.|++..|++.++ .+...|||. T Consensus 40 ~~~~~~-~~~~~pi~i~AD~~~~~---~~~~~~~~---GnV~i~~g~---~~l~Ad~v~~~q~~~~~~~~~~~~a~Gnv~ 109 (782) T PRK03761 40 RPLVTG-DPNDLPVTINADNAEAN---YPDKAVYT---GNVDIKQGN---SRLQADEVQLHQQEAPGEPVRTVDAEGNVH 109 (782) T ss_pred CCCCCC-CCCCCCEEEEECCEEEC---CCCEEEEE---EEEEEEECC---EEEECCEEEEECCCCCCCCCEEEEEECCEE T ss_conf 674269-98888479993324625---89779999---339999799---899956799952378666502899868989 Q ss_pred EEECCCCEEEEEEEEEEEEEEEEEECCC-EEEEECCCEEEEEEEEEEECCCEEEEECCEEEEECH Q ss_conf 9951573999821279972038995385-899907718997079999278689998896999831 Q gi|254780746|r 144 MKVKDNLRLDFETAVLDVKNITINSSDP-VIITHSDFVLSANFARIENSSRSAVFAGQVSVVVNP 207 (215) Q Consensus 144 i~~~~G~~l~t~~~~~d~~~~~i~s~~~-V~~~~~~g~i~A~~m~i~~~~~~i~F~g~V~l~i~p 207 (215) +. +.+..+..+.+.+++.......... ..+.+..+.-.|+.+....+++..+|++++-+.-.| T Consensus 110 ~~-~~~~~~~g~~~~~~~~~~~~~~~~~~y~~~~~~~~g~A~~~~~~~~~~~~~l~~~~yTtC~p 173 (782) T PRK03761 110 YD-DNQIILKGPKAWSNLNTKDTNVWNGDYQMVGRQGRGKADLMKQRGENRYTILENGSFTSCLP 173 (782) T ss_pred EE-CCCEEEEECEEEEECCCCEEEEECCEEEEECCCCEEEEEEEEECCCCCEEEECCEEECCCCC T ss_conf 96-89618981469995377645996248999737860470025991798489981718830899 No 12 >PRK03761 organic solvent tolerance protein; Provisional Probab=98.34 E-value=1.2e-05 Score=50.66 Aligned_cols=78 Identities=14% Similarity=0.141 Sum_probs=39.9 Q ss_pred EEEEEEEEEEEECCCCEEEEECCEEEEECCCCEEEEEEEEEEEEEE------EEEECCCEEEEECCCEEEEEEEEEEECC Q ss_conf 7999934589973564699977779995157399982127997203------8995385899907718997079999278 Q gi|254780746|r 119 DMYLFAHSARFNLANHTLYISQPFKMKVKDNLRLDFETAVLDVKNI------TINSSDPVIITHSDFVLSANFARIENSS 192 (215) Q Consensus 119 ~i~i~A~~G~y~~~~~~l~L~g~V~i~~~~G~~l~t~~~~~d~~~~------~i~s~~~V~~~~~~g~i~A~~m~i~~~~ 192 (215) +|+|.|++..++. .+.+.++|||.|...+ ..|.++++.++-+++ ++...|+|.+..|...+.|+++.++.+. T Consensus 51 pi~i~AD~~~~~~-~~~~~~~GnV~i~~g~-~~l~Ad~v~~~q~~~~~~~~~~~~a~Gnv~~~~~~~~~~g~~~~~~~~~ 128 (782) T PRK03761 51 PVTINADNAEANY-PDKAVYTGNVDIKQGN-SRLQADEVQLHQQEAPGEPVRTVDAEGNVHYDDNQIILKGPKAWSNLNT 128 (782) T ss_pred CEEEEECCEEECC-CCEEEEEEEEEEEECC-EEEECCEEEEECCCCCCCCCEEEEEECCEEEECCCEEEEECEEEEECCC T ss_conf 4799933246258-9779999339999799-8999567999523786665028998689899689618981469995377 Q ss_pred CEEEEE Q ss_conf 689998 Q gi|254780746|r 193 RSAVFA 198 (215) Q Consensus 193 ~~i~F~ 198 (215) +...+. T Consensus 129 ~~~~~~ 134 (782) T PRK03761 129 KDTNVW 134 (782) T ss_pred CEEEEE T ss_conf 645996 No 13 >PRK04423 organic solvent tolerance protein; Provisional Probab=98.20 E-value=0.00024 Score=43.38 Aligned_cols=119 Identities=8% Similarity=0.099 Sum_probs=84.5 Q ss_pred CCCCCEEEEEEEEEEECCCCCCCEEEEEEEEEEEEECCCCEEEEEEEEEEEEEECCCCEEEEECCEEEEECCCCEEEEEE Q ss_conf 49988899996011335889874499995689998168845679999345899735646999777799951573999821 Q gi|254780746|r 77 SKDRVKYSLVAERAKTSFNSGKGIIFLQDFELTVPTQRSEYGDMYLFAHSARFNLANHTLYISQPFKMKVKDNLRLDFET 156 (215) Q Consensus 77 ~~~g~~~~itA~~A~~~~~~~~~~i~L~~v~a~i~~~~g~~~~i~i~A~~G~y~~~~~~l~L~g~V~i~~~~G~~l~t~~ 156 (215) .....|..|.|++.. - .+++. ..++.+.+..|. -.|.|++..||+.++.+...|||.+. +.|..+..+. T Consensus 54 ~~~~~p~~v~aD~~~-~---~~~~~---~f~GnV~i~qg~---~~l~AD~v~ydq~~~~v~A~GnV~~~-~~~~~~~g~~ 122 (798) T PRK04423 54 MRQQLPTDIEGDQLS-G---TSTTP---QYQGNVALKRGD---QFLGADNLRMDTETGNYIAEGNVRYQ-DTSIRMVADR 122 (798) T ss_pred CCCCCCEEEECCCCC-C---CCCCC---EEEEEEEEEECC---EEEEEEEEEEECCCCEEEEECCEEEE-CCCCEEECCE T ss_conf 455698588764023-5---57751---798559999799---89991179998678989997778996-7985898142 Q ss_pred EEEEEEEEEEEECCCE-EEEECCCEEEEEEEEEEECCCEEEEECCEEEEECHH Q ss_conf 2799720389953858-999077189970799992786899988969998311 Q gi|254780746|r 157 AVLDVKNITINSSDPV-IITHSDFVLSANFARIENSSRSAVFAGQVSVVVNPG 208 (215) Q Consensus 157 ~~~d~~~~~i~s~~~V-~~~~~~g~i~A~~m~i~~~~~~i~F~g~V~l~i~p~ 208 (215) +.++++.+.....++- ......+.-.|..+++ .++...++.+.-+.-.|+ T Consensus 123 ~~~~l~~~~g~~~~~~y~l~~~~~~G~A~~~~~--~g~~~~l~~~~~ttC~~~ 173 (798) T PRK04423 123 AEGNQDTDTHKITNIQYQLVERRGNGDAESVDL--QGQVGQMHRSTYTTCDPS 173 (798) T ss_pred EEEECCCCEEEEECCEEEEEECCCCEEEEEEEE--CCCEEEEECCEECCCCCC T ss_conf 899826764799667899994677236677674--486479957198328988 No 14 >pfam03968 OstA OstA-like protein. This family of proteins are mostly uncharacterized. However the family does include E. coli OstA that has been characterized as an organic solvent tolerance protein. Probab=98.19 E-value=0.00036 Score=42.34 Aligned_cols=88 Identities=10% Similarity=0.122 Sum_probs=39.4 Q ss_pred EEEEEEEEEEEECCCCEEEEECCEEEEECCCCEEEEEEEEEEEEEE-----EEEECCCEEE------EECCCEEEEEEEE Q ss_conf 7999934589973564699977779995157399982127997203-----8995385899------9077189970799 Q gi|254780746|r 119 DMYLFAHSARFNLANHTLYISQPFKMKVKDNLRLDFETAVLDVKNI-----TINSSDPVII------THSDFVLSANFAR 187 (215) Q Consensus 119 ~i~i~A~~G~y~~~~~~l~L~g~V~i~~~~G~~l~t~~~~~d~~~~-----~i~s~~~V~~------~~~~g~i~A~~m~ 187 (215) +|.|+|+...++..++...++|||.+...+ +.++++.+.+..+++ .+...+|+.. .++...-+|+.+. T Consensus 27 PI~I~AD~~~~d~~~~~~~~~GnV~i~QG~-~~i~ad~~~i~~~~~~~~~~~~~a~g~~~~~~~~~~~~~~~~a~a~~~~ 105 (141) T pfam03968 27 PIEIEADNLEADDKKGVATFTGNVVVTQGS-ITLRADKVVVTQDKKSNGIERLEATGPATFRQKVDLVGRQIRGTADRAE 105 (141) T ss_pred CEEEEEEEEEEECCCCEEEEECCEEEEECC-EEEEEEEEEEEECCCCCCEEEEEEECCEEEEEEEECCCCCEEEEEEEEE T ss_conf 989994579998279999998749998087-0999648999986888967999997760499986148886799971899 Q ss_pred EEECCCEEEEECCEEEEECH Q ss_conf 99278689998896999831 Q gi|254780746|r 188 IENSSRSAVFAGQVSVVVNP 207 (215) Q Consensus 188 i~~~~~~i~F~g~V~l~i~p 207 (215) ++...+.+.|+|+|++.-.. T Consensus 106 Y~~~~~~i~L~Gna~l~q~~ 125 (141) T pfam03968 106 YDSKKRIIVLTGNAVLTQGG 125 (141) T ss_pred EEECCCEEEEECCEEEEECC T ss_conf 99169999997339999899 No 15 >PRK10894 hypothetical protein; Provisional Probab=98.18 E-value=5.5e-05 Score=46.98 Aligned_cols=115 Identities=3% Similarity=-0.028 Sum_probs=80.1 Q ss_pred CCCCEEEEEEEEEEECCCCCCCEEEEEEEEEEEEECCCCEEEEEEEEEEEEEECC-----CCEEEEECC-EEEEE--CCC Q ss_conf 9988899996011335889874499995689998168845679999345899735-----646999777-79995--157 Q gi|254780746|r 78 KDRVKYSLVAERAKTSFNSGKGIIFLQDFELTVPTQRSEYGDMYLFAHSARFNLA-----NHTLYISQP-FKMKV--KDN 149 (215) Q Consensus 78 ~~g~~~~itA~~A~~~~~~~~~~i~L~~v~a~i~~~~g~~~~i~i~A~~G~y~~~-----~~~l~L~g~-V~i~~--~~G 149 (215) +..+|+.|.|+++.-|... ++... .+.+.+.-|+ +.|.|++-+.... ...+.-.|+ +.... ++| T Consensus 31 D~~qPI~I~AD~~~~d~~~--~~avy---tGNVvitQGs---l~I~ADkv~v~~~~~~~~~~~i~a~G~pA~f~Q~~~~g 102 (184) T PRK10894 31 DTEQPIHIDSDQQSLDMQG--NVVTF---TGNVVVTQGT---IKINADKVVVTRPGGEQGKEVIDGYGNPATFYQMQDNG 102 (184) T ss_pred CCCCCEEEEECEEEEECCC--CEEEE---EEEEEEEEEE---EEEEEEEEEEEECCCCCCEEEEEEECCCEEEEEECCCC T ss_conf 4469989994528943159--98999---9259999625---99993589999769975379999968937999965689 Q ss_pred --CEEEEEEEEEEEEEEEEEECCCEEEEECCCEEEEEEEEEEECCCEEEEECC Q ss_conf --399982127997203899538589990771899707999927868999889 Q gi|254780746|r 150 --LRLDFETAVLDVKNITINSSDPVIITHSDFVLSANFARIENSSRSAVFAGQ 200 (215) Q Consensus 150 --~~l~t~~~~~d~~~~~i~s~~~V~~~~~~g~i~A~~m~i~~~~~~i~F~g~ 200 (215) +..+++.+.|+.+++.+...+...++.....++++.+.++.+.+++.-.|+ T Consensus 103 ~~i~a~A~~i~Y~~~~~~i~L~GnA~L~Q~~~~v~G~~I~Ydl~~~~~~a~s~ 155 (184) T PRK10894 103 KPVKGHASKMHYELAKDFVVLTGNAYLEQLDSNIKGDKITYLVKEQKMQAFSD 155 (184) T ss_pred CCEEEEEEEEEEECCCCEEEEECCEEEEECCCEEEECEEEEEECCCEEEEECC T ss_conf 81376974899982699999976589996899899378999906679999679 No 16 >COG1934 Uncharacterized protein conserved in bacteria [Function unknown] Probab=98.10 E-value=0.00015 Score=44.57 Aligned_cols=81 Identities=14% Similarity=0.055 Sum_probs=59.0 Q ss_pred EEEEEEEEEEEC--CCC--EEEEECC----EEEEEC-CCCEEEEEEEEEEEEEEEEEECCCEEEEECCCEEEEEEEEEEE Q ss_conf 999934589973--564--6999777----799951-5739998212799720389953858999077189970799992 Q gi|254780746|r 120 MYLFAHSARFNL--ANH--TLYISQP----FKMKVK-DNLRLDFETAVLDVKNITINSSDPVIITHSDFVLSANFARIEN 190 (215) Q Consensus 120 i~i~A~~G~y~~--~~~--~l~L~g~----V~i~~~-~G~~l~t~~~~~d~~~~~i~s~~~V~~~~~~g~i~A~~m~i~~ 190 (215) +.|.|++-.+.. ... .+...|+ +...+. .-+..+.+.+.|+++++++...+++.+...+..|+|+.|.+.. T Consensus 62 ~~l~Adkv~v~~~~~~~~~~i~a~gn~~~~~q~~~~~k~~~g~ad~~~Y~~~~~~~vLtGna~v~q~gn~i~G~kIty~~ 141 (173) T COG1934 62 ITLRADKVTVTRDKEGGKEVIEANGNPATFSQKVDNGKTQKGTADQAIYDLKNDTIVLTGNAYVLQLGNVIRGDKITYLT 141 (173) T ss_pred CEEEEEEEEEECCCCCCCEEEEEECCEEEEEEEECCCCCEEEEEEEEEEEECCCEEEEECCEEEEECCCEEEEEEEEEEC T ss_conf 17971369998057898548999546667887650785038885169998207899994779998579888763999994 Q ss_pred CCCEEEEECC Q ss_conf 7868999889 Q gi|254780746|r 191 SSRSAVFAGQ 200 (215) Q Consensus 191 ~~~~i~F~g~ 200 (215) +.+.+.-.+. T Consensus 142 ~~~~~~a~~~ 151 (173) T COG1934 142 KTGRAVAESS 151 (173) T ss_pred CCCEEEECCC T ss_conf 3457996288 No 17 >TIGR03002 outer_YhbN cell envelope biogenesis protein YhbN; InterPro: IPR014340 Members of this entry include YhbN, which is an essential protein in Escherichia coli. It is implicated in cell envelope integrity, and has numerous homologues in other Proteobacteria in a conserved location near lipopolysaccharide inner core biosynthesis genes. This entry is distantly related to the organic solvent tolerance protein (OstA).. Probab=98.00 E-value=0.00029 Score=42.90 Aligned_cols=123 Identities=7% Similarity=0.085 Sum_probs=81.6 Q ss_pred CCCCEEEEEEEEEEECCCCCCCEEEEEEEEEEEEECCCCEEEEEEEEEEEEEEC--CCC---EEEEECC--EEEEE--C- Q ss_conf 998889999601133588987449999568999816884567999934589973--564---6999777--79995--1- Q gi|254780746|r 78 KDRVKYSLVAERAKTSFNSGKGIIFLQDFELTVPTQRSEYGDMYLFAHSARFNL--ANH---TLYISQP--FKMKV--K- 147 (215) Q Consensus 78 ~~g~~~~itA~~A~~~~~~~~~~i~L~~v~a~i~~~~g~~~~i~i~A~~G~y~~--~~~---~l~L~g~--V~i~~--~- 147 (215) +..+|+.|.|+++.-|..+ +++.. .+.+.+.-|+ +.|+|++=+... ..+ .+.-+|+ +.... + T Consensus 5 D~~qPi~I~Ad~~~~D~~~--~~~~f---~GnV~itQG~---~~I~ADkv~v~~~~~~~~~~~~~A~G~~PA~F~q~~d~ 76 (150) T TIGR03002 5 DREQPIEIEADSQTLDDKK--NVVTF---TGNVVITQGT---LKIRADKVVVTRPNDGGGIEKATATGNKPATFRQKLDD 76 (150) T ss_pred CCCCCEEEEEEEEEEECCC--CEEEE---ECCEEEEECC---EEEECCEEEEEECCCCCCEEEEEEECCCCEEEEEEECC T ss_conf 7568689994006654028--83888---6565878297---59852288988268878608999980781898899559 Q ss_pred -CC-CEEEEEEEEEEEEEEEEEECCCEEEEECC-CEEEEEEEEEEECCCEEEE----E------CCEEEEECHH Q ss_conf -57-39998212799720389953858999077-1899707999927868999----8------8969998311 Q gi|254780746|r 148 -DN-LRLDFETAVLDVKNITINSSDPVIITHSD-FVLSANFARIENSSRSAVF----A------GQVSVVVNPG 208 (215) Q Consensus 148 -~G-~~l~t~~~~~d~~~~~i~s~~~V~~~~~~-g~i~A~~m~i~~~~~~i~F----~------g~V~l~i~p~ 208 (215) +. +.-.++.+.|+..+..+...+.-.+...+ ..++|+.+.++...+.+.- . ++|+++|.|+ T Consensus 77 g~~~~~g~A~~i~Y~~~~~~~~Ltg~A~v~Q~~k~~~~G~~I~Y~~~~q~~~a~~~~~~~~~~~~RV~~ii~P~ 150 (150) T TIGR03002 77 GKEYIEGQANRIEYDPAKDTVVLTGNARVKQGGKNEVSGDVITYDTKKQKVTAKGGGGSAKGASGRVTTIIQPK 150 (150) T ss_pred CCEEEEEECCEEEEECCCCEEEEECCEEEEECCCCEEECCEEEEECEEEEEEEEECCCCCCCCCCCEEEEEECC T ss_conf 98279997118999325779999123499788977787668898440129989807888998697179998409 No 18 >TIGR03002 outer_YhbN cell envelope biogenesis protein YhbN; InterPro: IPR014340 Members of this entry include YhbN, which is an essential protein in Escherichia coli. It is implicated in cell envelope integrity, and has numerous homologues in other Proteobacteria in a conserved location near lipopolysaccharide inner core biosynthesis genes. This entry is distantly related to the organic solvent tolerance protein (OstA).. Probab=97.91 E-value=0.0013 Score=39.29 Aligned_cols=94 Identities=10% Similarity=0.104 Sum_probs=62.5 Q ss_pred EEEEEEEEEEEEEECCCCEEEEECCEEEEECCCCEEEEEEEEEEE--EE---EEEEECCC--EEEEE----CC--CEEEE Q ss_conf 567999934589973564699977779995157399982127997--20---38995385--89990----77--18997 Q gi|254780746|r 117 YGDMYLFAHSARFNLANHTLYISQPFKMKVKDNLRLDFETAVLDV--KN---ITINSSDP--VIITH----SD--FVLSA 183 (215) Q Consensus 117 ~~~i~i~A~~G~y~~~~~~l~L~g~V~i~~~~G~~l~t~~~~~d~--~~---~~i~s~~~--V~~~~----~~--g~i~A 183 (215) ..+|+|.|+...+|..++...|+|||.|+-.+ +.+.++.+.+.- +. ..+.+.|. +.+.. .. ..=.| T Consensus 7 ~qPi~I~Ad~~~~D~~~~~~~f~GnV~itQG~-~~I~ADkv~v~~~~~~~~~~~~~A~G~~PA~F~q~~d~g~~~~~g~A 85 (150) T TIGR03002 7 EQPIEIEADSQTLDDKKNVVTFTGNVVITQGT-LKIRADKVVVTRPNDGGGIEKATATGNKPATFRQKLDDGKEYIEGQA 85 (150) T ss_pred CCCEEEEEEEEEEECCCCEEEEECCEEEEECC-EEEECCEEEEEECCCCCCEEEEEEECCCCEEEEEEECCCCEEEEEEC T ss_conf 68689994006654028838886565878297-59852288988268878608999980781898899559982799971 Q ss_pred EEEEEEECCCEEEEECCEEEEECH-HHHC Q ss_conf 079999278689998896999831-1320 Q gi|254780746|r 184 NFARIENSSRSAVFAGQVSVVVNP-GVLQ 211 (215) Q Consensus 184 ~~m~i~~~~~~i~F~g~V~l~i~p-~~~~ 211 (215) +.|+++...+.++++|+.++.=.. +.++ T Consensus 86 ~~i~Y~~~~~~~~Ltg~A~v~Q~~k~~~~ 114 (150) T TIGR03002 86 NRIEYDPAKDTVVLTGNARVKQGGKNEVS 114 (150) T ss_pred CEEEEECCCCEEEEECCEEEEECCCCEEE T ss_conf 18999325779999123499788977787 No 19 >PRK04423 organic solvent tolerance protein; Provisional Probab=97.84 E-value=0.00024 Score=43.36 Aligned_cols=79 Identities=10% Similarity=-0.027 Sum_probs=56.9 Q ss_pred EEEEEEEEEEEEECCCCEEEEECCEEEEECCCCEEEEEEEEEEEEEEEEEECCCEEEEECCCEEEEEEEEEEECCCEEEE Q ss_conf 67999934589973564699977779995157399982127997203899538589990771899707999927868999 Q gi|254780746|r 118 GDMYLFAHSARFNLANHTLYISQPFKMKVKDNLRLDFETAVLDVKNITINSSDPVIITHSDFVLSANFARIENSSRSAVF 197 (215) Q Consensus 118 ~~i~i~A~~G~y~~~~~~l~L~g~V~i~~~~G~~l~t~~~~~d~~~~~i~s~~~V~~~~~~g~i~A~~m~i~~~~~~i~F 197 (215) .++.|.|++-.-.. +...++|||.|...+ -.+.++.+.||-+++.+...|+|.+..++..+.|+.+++..+.+...+ T Consensus 58 ~p~~v~aD~~~~~~--~~~~f~GnV~i~qg~-~~l~AD~v~ydq~~~~v~A~GnV~~~~~~~~~~g~~~~~~l~~~~g~~ 134 (798) T PRK04423 58 LPTDIEGDQLSGTS--TTPQYQGNVALKRGD-QFLGADNLRMDTETGNYIAEGNVRYQDTSIRMVADRAEGNQDTDTHKI 134 (798) T ss_pred CCEEEECCCCCCCC--CCCEEEEEEEEEECC-EEEEEEEEEEECCCCEEEEECCEEEECCCCEEECCEEEEECCCCEEEE T ss_conf 98588764023557--751798559999799-899911799986789899977789967985898142899826764799 Q ss_pred EC Q ss_conf 88 Q gi|254780746|r 198 AG 199 (215) Q Consensus 198 ~g 199 (215) .+ T Consensus 135 ~~ 136 (798) T PRK04423 135 TN 136 (798) T ss_pred EC T ss_conf 66 No 20 >COG3117 Uncharacterized protein conserved in bacteria [Function unknown] Probab=97.34 E-value=0.0085 Score=34.60 Aligned_cols=152 Identities=14% Similarity=0.105 Sum_probs=106.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCC-----------CCCC----------------CCEEEEECEEE Q ss_conf 68899999999999999999876420034444454431-----------3455----------------43058711089 Q gi|254780746|r 21 VRFIQFLKFFFPFATVTIVGWFLLSSWIRISTFSKSSI-----------DLLD----------------FEPMIMKKFIL 73 (215) Q Consensus 21 Sr~V~~lKi~Lpl~al~i~~~~~~~~~~~~~~~~~~~~-----------~~~~----------------~~~~~m~~P~~ 73 (215) .|+|...-.++-++.++..+...-..........+.+. ++.- ....-.+.|.+ T Consensus 4 ~Rw~~~ILll~a~~~~~w~~~~~~~~~~~v~~~~d~p~Y~~e~~~~~~~de~G~~~y~l~a~~~eh~~~~~~t~ft~P~l 83 (188) T COG3117 4 RRWVYLILLLAALALSGWLLGLEQDEIEQVRPNPDEPAYTMEGLDTTVYDEQGKLKYRLTAQHVEHYSDSSDTHFTAPVL 83 (188) T ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCEEECCCCEEEECCCCCEEEEEEHHHHHHCCCCCCEEEECCEE T ss_conf 52999999999999999876201355244104999864020574236777776056776323454154566557715359 Q ss_pred EEECCCCCEEEEEEEEEEECCCCCCCEEEEEEEEEEEEECCCCEEEEEEEEEEEEEECCCCEEEEECCEEEEECCCCEEE Q ss_conf 98849988899996011335889874499995689998168845679999345899735646999777799951573999 Q gi|254780746|r 74 SDYSKDRVKYSLVAERAKTSFNSGKGIIFLQDFELTVPTQRSEYGDMYLFAHSARFNLANHTLYISQPFKMKVKDNLRLD 153 (215) Q Consensus 74 ~G~~~~g~~~~itA~~A~~~~~~~~~~i~L~~v~a~i~~~~g~~~~i~i~A~~G~y~~~~~~l~L~g~V~i~~~~G~~l~ 153 (215) .=++.+.++|.++|++|.-. ++..++.+.++...-...+.. ..-.++.++...+.+++.+.-...|+|+.. +...+ T Consensus 84 ~lf~e~~q~w~v~Ad~a~l~--~d~~l~l~~~vvv~~~t~d~~-q~~~l~T~~~~V~l~tqd~~te~~V~l~~~-~~~~t 159 (188) T COG3117 84 TLFRENRQTWTVQADRAKLT--KDGKLYLLGDVVVVALTPDSG-QLQRLTTESAQVNLKTQDAQTEDPVTLYGG-QFVST 159 (188) T ss_pred EEECCCCCCEEEEHHHHEEC--CCCEEEEEECEEEEEECCCCC-CCCEEEEEEEEECCCCCCEECCCCEEEECC-CCCCC T ss_conf 98838876279981020215--785599871206775147766-764587102376133100003775667435-44023 Q ss_pred EEEEEEEEEEEEEEECCCEEEEE Q ss_conf 82127997203899538589990 Q gi|254780746|r 154 FETAVLDVKNITINSSDPVIITH 176 (215) Q Consensus 154 t~~~~~d~~~~~i~s~~~V~~~~ 176 (215) ...+.++++.+.+...++|...- T Consensus 160 g~gm~~n~k~~~a~L~~kV~t~y 182 (188) T COG3117 160 ANGMTGNLKTQQANLLSKVRTRY 182 (188) T ss_pred CCCEEEEHHHHHHHHHHCCCCEE T ss_conf 20147754655666421245178 No 21 >PRK10893 hypothetical protein; Provisional Probab=97.27 E-value=0.0098 Score=34.24 Aligned_cols=105 Identities=15% Similarity=0.222 Sum_probs=82.9 Q ss_pred EEECEEEEEECCCCCE-EEEEEEEEEECCCCCCCEEEEE-EEEEEEEECCCCEEEEEEEEEEEEEECCCCEEEEECCEEE Q ss_conf 8711089988499888-9999601133588987449999-5689998168845679999345899735646999777799 Q gi|254780746|r 67 IMKKFILSDYSKDRVK-YSLVAERAKTSFNSGKGIIFLQ-DFELTVPTQRSEYGDMYLFAHSARFNLANHTLYISQPFKM 144 (215) Q Consensus 67 ~m~~P~~~G~~~~g~~-~~itA~~A~~~~~~~~~~i~L~-~v~a~i~~~~g~~~~i~i~A~~G~y~~~~~~l~L~g~V~i 144 (215) ..++|.|.=++.++.+ |.|+|++|.=. . ++.+.|. +|.++-...++.. -.|+.+....|..++.+.-...|.| T Consensus 77 ~F~~Pvl~ly~~~~~~~W~VsA~~a~L~--~-d~~L~L~gnV~i~~L~~~s~i--q~I~T~~l~idL~tqd~~SD~~VtI 151 (190) T PRK10893 77 WFTQPVLTTFDKDKVPTWSVKADKAKLT--N-DRMLYLYGHVEVNALTPDSQL--RRITTDNAQINLVTQDVTSDDLVTL 151 (190) T ss_pred EEECCEEEEECCCCCCCEEEEECEEEEC--C-CCEEEEECCEEEEECCCCCCC--CEEECCCEEEECCCCEEECCCEEEE T ss_conf 9818659998899953279994567986--4-888999777999853897545--3898275898046646863870999 Q ss_pred EECCCCEEEEEEEEEEEEEEEEEECCCEEEEEC Q ss_conf 951573999821279972038995385899907 Q gi|254780746|r 145 KVKDNLRLDFETAVLDVKNITINSSDPVIITHS 177 (215) Q Consensus 145 ~~~~G~~l~t~~~~~d~~~~~i~s~~~V~~~~~ 177 (215) . ..|+..+...+..|+++..+..-+.|...-. T Consensus 152 ~-G~~f~s~G~gm~anL~~~~a~L~~~Vkt~YE 183 (190) T PRK10893 152 Y-GTTFNSSGLKMRGNLRSKNAELIEKVRTSYE 183 (190) T ss_pred E-CCCCEEEEEEEEEEHHHCEEEEHHHCCEEEE T ss_conf 8-7983998610899936588890450619996 No 22 >COG5375 Uncharacterized protein conserved in bacteria [Function unknown] Probab=93.89 E-value=0.41 Score=25.07 Aligned_cols=105 Identities=11% Similarity=0.027 Sum_probs=79.9 Q ss_pred EEEECEEEEEECCCCCEEEEEEEEEEECCCCCCCEEEEEEEEEEEEECCCCEEEEEEEEEEEEEECCCCEEEEECCEEEE Q ss_conf 58711089988499888999960113358898744999956899981688456799993458997356469997777999 Q gi|254780746|r 66 MIMKKFILSDYSKDRVKYSLVAERAKTSFNSGKGIIFLQDFELTVPTQRSEYGDMYLFAHSARFNLANHTLYISQPFKMK 145 (215) Q Consensus 66 ~~m~~P~~~G~~~~g~~~~itA~~A~~~~~~~~~~i~L~~v~a~i~~~~g~~~~i~i~A~~G~y~~~~~~l~L~g~V~i~ 145 (215) ++++.|.-.=--.+-++-.++|.+.+-|- +.++..|+++- ++.+.+| +.-..+++..|..++.+.=...|.+. T Consensus 109 i~Le~i~a~~pvg~~~~a~v~Ak~Gvyd~--~~ntL~l~~dv-~i~~~eG----i~a~~~sA~vDl~~G~~sg~~pv~i~ 181 (216) T COG5375 109 ITLEGILAILPVGEDNPAQVTAKSGVYDR--DTNTLDLTDDV-KITTDEG----IRADFQSAFVDLRTGNMSGDSPVDIS 181 (216) T ss_pred EEEEEEEEECCCCCCCCEEEEECCCEEEC--CCCEEECCCCE-EEECCHH----HHCCCCHHEEEECCCCCCCCCCEEEC T ss_conf 89960015524799973468712342666--78713246876-8853543----31356012466025610267863330 Q ss_pred ECCCCEEEEEEEEEEEEEEEEEECCCEEEEECC Q ss_conf 515739998212799720389953858999077 Q gi|254780746|r 146 VKDNLRLDFETAVLDVKNITINSSDPVIITHSD 178 (215) Q Consensus 146 ~~~G~~l~t~~~~~d~~~~~i~s~~~V~~~~~~ 178 (215) +.+| .+++++..+--+..+++..+.|...-.. T Consensus 182 ~~~g-sI~ads~~v~D~G~iivF~g~Vr~~id~ 213 (216) T COG5375 182 GKGG-SITADSYSVTDKGKIIVFEGNVRTRIDP 213 (216) T ss_pred CCCC-CEEEEEEEEECCCCEEEEECCEEEEECC T ss_conf 7998-4876136752288489995656898447 No 23 >COG3008 PqiB Paraquat-inducible protein B [General function prediction only] Probab=87.66 E-value=1.2 Score=22.40 Aligned_cols=64 Identities=11% Similarity=0.060 Sum_probs=34.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCEEEEECEEEEEE----------CCCCCEEEEEEEE Q ss_conf 999999999999999876420034444454431345543058711089988----------4998889999601 Q gi|254780746|r 26 FLKFFFPFATVTIVGWFLLSSWIRISTFSKSSIDLLDFEPMIMKKFILSDY----------SKDRVKYSLVAER 89 (215) Q Consensus 26 ~lKi~Lpl~al~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~~P~~~G~----------~~~g~~~~itA~~ 89 (215) ..-|+||++|+++.+|+++..+...-..-...+...++-.-.=+.-+|.|+ ++|-+..+++|+- T Consensus 21 SpiWllPivAl~igawL~~~~~~~~G~~Itl~f~saeGIeaGKT~Iry~gvdVG~V~~v~LsdDl~~V~V~a~i 94 (553) T COG3008 21 SPIWLLPIVALLIGAWLLFQHVQDRGPEITLTFESAEGIEAGKTPIRYQGVDVGVVTDIKLSDDLKGVEVTAQI 94 (553) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCEEEECCEEEEEEEEEEECCCCCCEEEEEEE T ss_conf 80689999999999999999997049759999547666214860588513136678898853676616999996 No 24 >PRK10807 paraquat-inducible protein B; Provisional Probab=87.16 E-value=1.3 Score=22.31 Aligned_cols=23 Identities=26% Similarity=0.592 Sum_probs=19.1 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 99999999999999876420034 Q gi|254780746|r 27 LKFFFPFATVTIVGWFLLSSWIR 49 (215) Q Consensus 27 lKi~Lpl~al~i~~~~~~~~~~~ 49 (215) .=|++|++|+++.+|+++..+.. T Consensus 18 ~iWlvPivA~~ig~wl~~~~~~~ 40 (546) T PRK10807 18 PVWIFPIVTALIGAWILFYHYSH 40 (546) T ss_pred HHHHHHHHHHHHHHHHHHHHHHH T ss_conf 58999999999999999999982 No 25 >TIGR02099 TIGR02099 conserved hypothetical protein TIGR02099; InterPro: IPR011836 This entry describes a family of long proteins, over 1250 amino acids in length and present in the Proteobacteria. The degree of sequence similarity is low between sequences from different genera. Apparent membrane-spanning regions at the N-terminus and C-terminus suggest the protein is inserted into (or exported through) the membrane.. Probab=80.69 E-value=3.8 Score=19.62 Aligned_cols=100 Identities=12% Similarity=0.132 Sum_probs=59.3 Q ss_pred CCCEEEEEEEEEEECCCCCCCEEEEEEEEEEEEECCCCEEEEEEEEE--EEEE------ECCC--CEEEEE-CCEEEEEC Q ss_conf 98889999601133588987449999568999816884567999934--5899------7356--469997-77799951 Q gi|254780746|r 79 DRVKYSLVAERAKTSFNSGKGIIFLQDFELTVPTQRSEYGDMYLFAH--SARF------NLAN--HTLYIS-QPFKMKVK 147 (215) Q Consensus 79 ~g~~~~itA~~A~~~~~~~~~~i~L~~v~a~i~~~~g~~~~i~i~A~--~G~y------~~~~--~~l~L~-g~V~i~~~ 147 (215) .+.+|.|.++...+..+...++-+|.+ ++++.+.+|.- ..|... -|.. |.-- =.++|+ |||- + T Consensus 1171 ~~a~f~~~~~~~W~G~P~~~~l~~L~G-~l~~~L~kG~f--~~V~~Ga~AaRlL~LlSl~sL~Rkl~LDF~PGD~F---~ 1244 (1362) T TIGR02099 1171 RNASFELNADLSWRGSPWSFQLATLSG-KLKVDLGKGQF--LDVSGGAKAARLLGLLSLDSLLRKLRLDFSPGDVF---S 1244 (1362) T ss_pred CCCEEEEEEEEECCCCCCCCHHHHCCC-EEEEEECCCEE--EEECCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCC---C T ss_conf 441059998863599861111233352-69999878089--97668435778888851888887985248888856---6 Q ss_pred CCCEEEEEEEEEEEEEEEEEECCCEEEEECCCEEEEE Q ss_conf 5739998212799720389953858999077189970 Q gi|254780746|r 148 DNLRLDFETAVLDVKNITINSSDPVIITHSDFVLSAN 184 (215) Q Consensus 148 ~G~~l~t~~~~~d~~~~~i~s~~~V~~~~~~g~i~A~ 184 (215) +|+-|.+=.-...+++|.++|++|+++.+|-+.|.=. T Consensus 1245 ~G~aFD~i~G~~~~~~GV~~T~~P~~~~g~~a~i~~~ 1281 (1362) T TIGR02099 1245 KGFAFDSIRGTASLSDGVLRTDDPIKMDGPAADIAIR 1281 (1362) T ss_pred CCCCCCEEEEEEEEECCEEEECCCEEEEEEEECEEEE T ss_conf 7866521579899965689807867998776163873 No 26 >COG4384 Mu-like prophage protein gp45 [Function unknown] Probab=62.86 E-value=10 Score=17.15 Aligned_cols=74 Identities=11% Similarity=0.168 Sum_probs=44.3 Q ss_pred EEEEEEEECCCCEEEEECCEEEEECCCCEEEEEEEEEEEEEEEEEECCCEEEE----ECCCEEEEEEEEEEECCCEEEEE Q ss_conf 93458997356469997777999515739998212799720389953858999----07718997079999278689998 Q gi|254780746|r 123 FAHSARFNLANHTLYISQPFKMKVKDNLRLDFETAVLDVKNITINSSDPVIIT----HSDFVLSANFARIENSSRSAVFA 198 (215) Q Consensus 123 ~A~~G~y~~~~~~l~L~g~V~i~~~~G~~l~t~~~~~d~~~~~i~s~~~V~~~----~~~g~i~A~~m~i~~~~~~i~F~ 198 (215) ..+.++|+.....+.|..+..|.-+.. .+ .++- ...+..+-|+... .|.+.+.+++-.--..+....|+ T Consensus 98 aGeT~iY~~eG~~i~Lteg~~Ie~~ck-~~-----~v~a-~~~v~v~Ap~v~~t~~~t~~~~i~GN~~~~v~g~S~aTft 170 (203) T COG4384 98 AGETVIYNHEGAKIVLTEGGIIEADCK-TL-----TVNA-ATGVKVNAPLVEFTAVLTPQGQINGNGGMAVEGGSGATFT 170 (203) T ss_pred CCCEEEEECCCCEEEECCCCEEEEECC-EE-----EEEC-CCCEEECCCEEEEEEEECCCEEECCCCEEEECCCCCCCEE T ss_conf 774688822574899725968998561-78-----9745-8852663641588777555438816511562257753064 Q ss_pred CCEEE Q ss_conf 89699 Q gi|254780746|r 199 GQVSV 203 (215) Q Consensus 199 g~V~l 203 (215) +.+-+ T Consensus 171 ~~~~~ 175 (203) T COG4384 171 GSVNQ 175 (203) T ss_pred CCHHH T ss_conf 23232 No 27 >KOG0860 consensus Probab=52.81 E-value=15 Score=16.16 Aligned_cols=24 Identities=21% Similarity=0.411 Sum_probs=13.0 Q ss_pred CCHH-HHCCCCHHHHHCCCHHHHHH Q ss_conf 6314-54112023552010368899 Q gi|254780746|r 2 DHLD-RRNKILLQRSKYRQHVRFIQ 25 (215) Q Consensus 2 ~~~~-rr~~~~~~~~~~~r~Sr~V~ 25 (215) ++|+ |...+-...+.+++.++.++ T Consensus 60 ~~L~drad~L~~~as~F~~~A~klk 84 (116) T KOG0860 60 DELDDRADQLQAGASQFEKTAVKLK 84 (116) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 7899899999999999999999999 No 28 >PRK01904 hypothetical protein; Provisional Probab=45.51 E-value=20 Score=15.49 Aligned_cols=96 Identities=14% Similarity=0.049 Sum_probs=40.8 Q ss_pred CCCCCCCCCCCCCCEEEEECEEEEEEC----CCC--CEEEEEEEEEEECCCCCCCEEEEEEEEEEEEECCCCEEEEEEEE Q ss_conf 444544313455430587110899884----998--88999960113358898744999956899981688456799993 Q gi|254780746|r 51 STFSKSSIDLLDFEPMIMKKFILSDYS----KDR--VKYSLVAERAKTSFNSGKGIIFLQDFELTVPTQRSEYGDMYLFA 124 (215) Q Consensus 51 ~~~~~~~~~~~~~~~~~m~~P~~~G~~----~~g--~~~~itA~~A~~~~~~~~~~i~L~~v~a~i~~~~g~~~~i~i~A 124 (215) ..+..+.+-..+++++ ....|.+.+ .+| +.+.++=..-+++. .+...+.=.-+-+++.. ....+.+.. T Consensus 25 ~lp~~ielL~vdG~k~--~~slf~~~~sl~L~~G~~HQIvfRy~~~v~~~-~~~~~~~S~p~IvtF~a---~~~~l~l~~ 98 (217) T PRK01904 25 TTSSNVEFLAIDGQKA--SKSLLGKAKSFNVDDTQVHQVVVRVSEIVRSG-SDRSLFESDPIIVTFQG---TTEDIVISA 98 (217) T ss_pred CCCCCCEEEEECCEEC--CCCCCCCCCCEECCCCCCEEEEEEEEEEECCC-CCEEEEECCCEEEEEEC---CCEEEEEEC T ss_conf 0788713899768277--76545677615637997048999980267479-96448956889999955---881899967 Q ss_pred EEEEEECCCCEEEEECCEEEEECCCCEE Q ss_conf 4589973564699977779995157399 Q gi|254780746|r 125 HSARFNLANHTLYISQPFKMKVKDNLRL 152 (215) Q Consensus 125 ~~G~y~~~~~~l~L~g~V~i~~~~G~~l 152 (215) ....-..+.+...=+..+.+...+|-.+ T Consensus 99 P~~~~~~~ak~F~~~P~~~L~d~~g~~V 126 (217) T PRK01904 99 PKLTNERDVDKFKKSPNITVKTASGKEI 126 (217) T ss_pred CCCCCHHHHHHHHHCCCEEEEECCCCEE T ss_conf 9878889999875099289995899888 No 29 >pfam05449 DUF754 Protein of unknown function (DUF754). This domain appears to be found in a group of prophage proteins. Probab=42.53 E-value=22 Score=15.25 Aligned_cols=31 Identities=16% Similarity=0.105 Sum_probs=21.8 Q ss_pred CHHHHHCCCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 0235520103688999999999999999998 Q gi|254780746|r 11 LLQRSKYRQHVRFIQFLKFFFPFATVTIVGW 41 (215) Q Consensus 11 ~~~~~~~~r~Sr~V~~lKi~Lpl~al~i~~~ 41 (215) +.=|....||.+.++|+-++|.+.+...-.- T Consensus 15 ~~yrR~garhr~~~swlAyll~~~~~~~pi~ 45 (83) T pfam05449 15 LTYRRNGARHRPGVSWLAYLLIVAAGSQPIR 45 (83) T ss_pred HEEECCCCCCCCHHHHHHHHHHHHHHHHHHH T ss_conf 4045389654745789999999999888999 No 30 >PRK04517 hypothetical protein; Provisional Probab=39.28 E-value=26 Score=14.93 Aligned_cols=49 Identities=6% Similarity=0.002 Sum_probs=20.6 Q ss_pred HHHHHHHHHHHHHHH-HHHHCCCCCCCCCCCCCCCCCCCEEEEECEEEEE Q ss_conf 999999999999998-7642003444445443134554305871108998 Q gi|254780746|r 27 LKFFFPFATVTIVGW-FLLSSWIRISTFSKSSIDLLDFEPMIMKKFILSD 75 (215) Q Consensus 27 lKi~Lpl~al~i~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~m~~P~~~G 75 (215) ||++.++.+++.+++ ...+.-.....+....+-.+++++..+.+-.|++ T Consensus 1 MK~~~~~~~ll~l~~s~sa~A~vtL~lp~~ielLaVnG~k~~~sg~l~~~ 50 (216) T PRK04517 1 MKLIKPLTCLLALAMSGSAFADVTVEVPDDVDLLVANGEKPKLSGGFFAS 50 (216) T ss_pred CCCHHHHHHHHHHHHHHHHHHHCEEECCCCEEEEEECCEECCCCCCCCCC T ss_conf 96027999999999713553320783699777999889733456653357 No 31 >pfam05137 PilN Fimbrial assembly protein (PilN). Probab=31.95 E-value=34 Score=14.24 Aligned_cols=20 Identities=5% Similarity=0.041 Sum_probs=11.4 Q ss_pred CEEEEEECCCCCEEEEEEEE Q ss_conf 10899884998889999601 Q gi|254780746|r 70 KFILSDYSKDRVKYSLVAER 89 (215) Q Consensus 70 ~P~~~G~~~~g~~~~itA~~ 89 (215) +-.++..+.+|..+.|++-. T Consensus 109 gvwLt~l~~~~~~l~i~G~a 128 (162) T pfam05137 109 GVYLTKIKMEGDQLTISGFS 128 (162) T ss_pred CEEEEEEEECCCEEEEEEEE T ss_conf 75877888729889999888 No 32 >pfam11696 DUF3292 Protein of unknown function (DUF3292). This eukaryotic family of proteins has no known function. Probab=30.01 E-value=37 Score=14.05 Aligned_cols=20 Identities=10% Similarity=-0.099 Sum_probs=10.6 Q ss_pred CEEEEEEEEEE--ECCCEEEEE Q ss_conf 18997079999--278689998 Q gi|254780746|r 179 FVLSANFARIE--NSSRSAVFA 198 (215) Q Consensus 179 g~i~A~~m~i~--~~~~~i~F~ 198 (215) +.=.+||+.|. +.++.+.++ T Consensus 597 ~kEv~DGL~I~g~~~~q~y~lT 618 (641) T pfam11696 597 SKEVVDGLVISGDEPEQSYQLT 618 (641) T ss_pred CCCCCCCEEEEECCCCCEEEEE T ss_conf 7410256599505788679998 No 33 >pfam03961 DUF342 Protein of unknown function (DUF342). This family of bacterial proteins has no known function. The proteins are in the region of 500-600 amino acid residues in length. Probab=27.01 E-value=41 Score=13.74 Aligned_cols=106 Identities=8% Similarity=0.093 Sum_probs=60.7 Q ss_pred CCCEEEEEEEEEEECCCCCCCEEEEEEEEEEEEECCCCEEEEEEEEEEEEEECCCCEEEEECCEEEEE--CCCCEEEEE- Q ss_conf 98889999601133588987449999568999816884567999934589973564699977779995--157399982- Q gi|254780746|r 79 DRVKYSLVAERAKTSFNSGKGIIFLQDFELTVPTQRSEYGDMYLFAHSARFNLANHTLYISQPFKMKV--KDNLRLDFE- 155 (215) Q Consensus 79 ~g~~~~itA~~A~~~~~~~~~~i~L~~v~a~i~~~~g~~~~i~i~A~~G~y~~~~~~l~L~g~V~i~~--~~G~~l~t~- 155 (215) .|+.+.+.+.+-+.-.+.++. ...-.+.+.....++...-.-+---.|..|.++..+.|.|+|.|.- .+||...+. T Consensus 138 ~g~~~~~~~g~~~~~~~eDg~-~liA~~~G~~~~~~~~i~V~~vl~v~gdVd~~TGNI~F~G~V~I~G~V~~Gf~V~A~g 216 (450) T pfam03961 138 PGKDLPLKAGKGTELSEEDGN-KLIATIDGQPVVVKGGISVDDVLEIKGDVDLSTGNIDFKGSVIIRGDVEDGMKVKATG 216 (450) T ss_pred CCCCCCCCCCCCEEEECCCCC-EEEEECCCEEEEECCEEEEEEEEEECCCCCCCCCCEEEEEEEEEEEEECCCCEEEECC T ss_conf 987664307998398269998-9999549779998998999998997687787734467777799974728997899486 Q ss_pred EEEEE--EEEEEEEECCCEEEEEC-----CCEEEEEE Q ss_conf 12799--72038995385899907-----71899707 Q gi|254780746|r 156 TAVLD--VKNITINSSDPVIITHS-----DFVLSANF 185 (215) Q Consensus 156 ~~~~d--~~~~~i~s~~~V~~~~~-----~g~i~A~~ 185 (215) .+.+. .....+.+.+.+.+.+. .+.|.|++ T Consensus 217 dI~V~G~Ve~A~I~AgG~I~i~gGi~G~~~~~I~Agg 253 (450) T pfam03961 217 DITVGGFVESATIEAGGDIIVKGGIIGRGKGKIKAGG 253 (450) T ss_pred CEEEEEEEEEEEEEECCCEEEECCEECCCEEEEEECC T ss_conf 2899558978899979988995866618806999898 No 34 >TIGR01000 bacteriocin_acc bacteriocin secretion accessory protein; InterPro: IPR005696 Bacteriocins are antibacterial proteinaceous compounds produced by bacteria. In Gram-positive bacteria, they are divided into four classes. Within the class II, constituted by non-modified peptides produced mainly by lactic acid bacteria, bacteriocins of the subclass IIa, such as mesentericin Y105, are of particular interest. They are active against the foodborne pathogen Listeria monocytogenes and share a similar primary structure, with a conserved N-terminal motif (YGNGV). Subclass IIa bacteriocins induce membrane permeabilization of sensitive strains, but their target specificity and their molecular mode of action remain elusive. This family of proteins is involved in the secretion of such bacteriocins although the mechanism of this process is not well understood. ; GO: 0005215 transporter activity, 0006810 transport. Probab=26.47 E-value=42 Score=13.68 Aligned_cols=22 Identities=9% Similarity=0.078 Sum_probs=9.7 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 103688999999999999999998 Q gi|254780746|r 18 RQHVRFIQFLKFFFPFATVTIVGW 41 (215) Q Consensus 18 ~r~Sr~V~~lKi~Lpl~al~i~~~ 41 (215) +||..|= .-+++|++-|+++++ T Consensus 15 ~Ry~NFs--~l~I~P~~lLLv~~~ 36 (476) T TIGR01000 15 KRYHNFS--TLVIVPIFLLLVFLV 36 (476) T ss_pred CHHHCCH--HHHHHHHHHHHHHHH T ss_conf 0211012--689999999999999 No 35 >COG3088 CcmH Uncharacterized protein involved in biosynthesis of c-type cytochromes [Posttranslational modification, protein turnover, chaperones] Probab=22.11 E-value=52 Score=13.20 Aligned_cols=25 Identities=20% Similarity=0.280 Sum_probs=17.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 9999999999999999876420034 Q gi|254780746|r 25 QFLKFFFPFATVTIVGWFLLSSWIR 49 (215) Q Consensus 25 ~~lKi~Lpl~al~i~~~~~~~~~~~ 49 (215) -++-+++|+..+++.+++++..... T Consensus 105 T~lLW~~Pv~llllG~~~~~~~~rr 129 (153) T COG3088 105 TLLLWGLPVVLLLLGGVLLVRRARR 129 (153) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 8999995999999999999999765 No 36 >TIGR01433 CyoA ubiquinol oxidase, subunit II; InterPro: IPR006333 Members of this domain catalyze the oxidation of ubiquinol with the concomitant reduction of molecular oxygen to water. This acts as the terminal electron acceptor in the respiratory chain. Subunit II is responsible for binding and oxidation of the ubiquinone substrate. This sequence is closely related to QoxA, which oxidizes quinol in Gram-positive bacteria but which is in complex with subunits which utilise cytochrome a in the reduction of molecular oxygen. Slightly more distantly related is subunit II of cytochrome c oxidase which uses cytochrome c as the oxidant . ; GO: 0016682 oxidoreductase activity acting on diphenols and related substances as donors oxygen as acceptor, 0006118 electron transport, 0016020 membrane. Probab=22.03 E-value=29 Score=14.57 Aligned_cols=51 Identities=14% Similarity=0.203 Sum_probs=30.4 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHC--CCCCCCCCCCCCCCCCCCEEEEE Q ss_conf 0368899999999999999999876420--03444445443134554305871 Q gi|254780746|r 19 QHVRFIQFLKFFFPFATVTIVGWFLLSS--WIRISTFSKSSIDLLDFEPMIMK 69 (215) Q Consensus 19 r~Sr~V~~lKi~Lpl~al~i~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~m~ 69 (215) .||.+|=..=+..|.+.+++++++.|.+ ..+...|......+++.|=+.|. T Consensus 72 ~HS~kiE~VVW~iP~~Ii~fLg~ltW~~THkLDP~rPL~s~~kPI~iEVVAlD 124 (228) T TIGR01433 72 HHSTKIEIVVWTIPILIIVFLGVLTWISTHKLDPYRPLASDVKPITIEVVALD 124 (228) T ss_pred CCCCEEEEEEHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHCCCCEEEEEEEEC T ss_conf 66777786021234888878778531000145888764245895279998706 No 37 >pfam06890 Phage_Mu_Gp45 Bacteriophage Mu Gp45 protein. This family consists of Bacteriophage Mu Gp45 related proteins from both phages and bacteria. The function of this family is unknown although it has been suggested that family members may be involved in baseplate assembly. Probab=21.95 E-value=52 Score=13.18 Aligned_cols=18 Identities=11% Similarity=0.255 Sum_probs=11.0 Q ss_pred EEEEEEECCCCEEEEECC Q ss_conf 345899735646999777 Q gi|254780746|r 124 AHSARFNLANHTLYISQP 141 (215) Q Consensus 124 A~~G~y~~~~~~l~L~g~ 141 (215) .+-+.|+.+-+.+.|..+ T Consensus 78 GEvalY~~~G~~I~Lt~~ 95 (162) T pfam06890 78 GDVALYHDEGHRVRLTRD 95 (162) T ss_pred CCEEEEECCCCEEEEEEC T ss_conf 858998068999999018 No 38 >pfam05568 ASFV_J13L African swine fever virus J13L protein. This family consists of several African swine fever virus J13L proteins. Probab=21.16 E-value=54 Score=13.08 Aligned_cols=14 Identities=7% Similarity=-0.012 Sum_probs=7.6 Q ss_pred CCCCCCEEEEECEE Q ss_conf 34554305871108 Q gi|254780746|r 59 DLLDFEPMIMKKFI 72 (215) Q Consensus 59 ~~~~~~~~~m~~P~ 72 (215) ..++++.+...||. T Consensus 61 aaieeediqfinpy 74 (189) T pfam05568 61 AAIEEEDIQFINPY 74 (189) T ss_pred HHHHHHCCEEECCC T ss_conf 32004203020733 No 39 >pfam12166 DUF3595 Protein of unknown function (DUF3595). This family of proteins is functionally uncharacterized.This family of proteins is found in eukaryotes. Proteins in this family are typically between 578 and 2525 amino acids in length. Probab=20.39 E-value=56 Score=12.99 Aligned_cols=16 Identities=0% Similarity=-0.151 Sum_probs=8.8 Q ss_pred CEEEEEECCCCCEEEE Q ss_conf 1089988499888999 Q gi|254780746|r 70 KFILSDYSKDRVKYSL 85 (215) Q Consensus 70 ~P~~~G~~~~g~~~~i 85 (215) .|.|+....++.-..+ T Consensus 110 ~pif~~~a~~~~~~~~ 125 (416) T pfam12166 110 GPLYTMTAQSSEVTLI 125 (416) T ss_pred CCCEEEEECCCCEECC T ss_conf 6521476036720126 Done!