Query gi|254780395|ref|YP_003064808.1| organic solvent tolerance protein [Candidatus Liberibacter asiaticus str. psy62] Match_columns 762 No_of_seqs 218 out of 1458 Neff 8.8 Searched_HMMs 39220 Date Sun May 29 16:05:27 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780395.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 PRK04423 organic solvent toler 100.0 0 0 977.0 72.0 733 1-760 1-762 (798) 2 PRK03761 organic solvent toler 100.0 0 0 964.0 79.2 717 1-757 1-763 (782) 3 COG1452 Imp Organic solvent to 100.0 0 0 848.1 54.7 676 48-756 48-764 (784) 4 pfam04453 OstA_C Organic solve 100.0 0 0 394.0 43.8 371 301-697 1-382 (382) 5 COG1452 Imp Organic solvent to 99.3 1.2E-07 3.2E-12 68.0 40.9 179 52-246 82-270 (784) 6 PRK04423 organic solvent toler 99.3 2.1E-07 5.3E-12 66.5 55.5 196 51-264 86-289 (798) 7 pfam03968 OstA OstA-like prote 99.2 3.4E-09 8.6E-14 78.5 17.6 60 47-106 23-87 (141) 8 PRK10894 hypothetical protein; 99.2 1.3E-09 3.2E-14 81.4 15.3 83 46-128 30-122 (184) 9 COG1934 Uncharacterized protei 99.1 3.2E-08 8.3E-13 71.9 16.8 83 48-131 29-121 (173) 10 PRK03761 organic solvent toler 99.0 2E-06 5E-11 60.0 63.4 115 50-165 78-204 (782) 11 TIGR03002 outer_YhbN cell enve 99.0 1.2E-07 3.2E-12 68.0 16.2 102 46-149 4-119 (150) 12 COG1934 Uncharacterized protei 98.5 7.8E-06 2E-10 56.0 13.6 83 47-130 57-149 (173) 13 pfam06835 DUF1239 Protein of u 98.3 3E-05 7.6E-10 52.1 13.0 108 50-158 51-168 (176) 14 pfam03968 OstA OstA-like prote 98.3 2.5E-05 6.3E-10 52.7 12.3 76 49-125 54-140 (141) 15 PRK10894 hypothetical protein; 98.1 4.3E-05 1.1E-09 51.0 9.5 81 48-129 61-152 (184) 16 TIGR03002 outer_YhbN cell enve 98.0 3E-05 7.7E-10 52.1 7.5 109 79-198 8-131 (150) 17 pfam06835 DUF1239 Protein of u 97.6 0.0053 1.4E-07 37.1 19.6 80 49-129 83-167 (176) 18 COG5375 Uncharacterized protei 97.3 0.0053 1.4E-07 37.1 10.9 120 46-167 86-213 (216) 19 PRK10893 hypothetical protein; 96.8 0.037 9.5E-07 31.4 14.2 115 52-168 60-185 (190) 20 cd07306 Porin3_VDAC Voltage-de 96.7 0.041 1E-06 31.1 18.5 104 620-734 130-244 (276) 21 pfam01459 Porin_3 Eukaryotic p 96.6 0.051 1.3E-06 30.5 17.5 117 620-749 132-259 (272) 22 PRK10716 long-chain fatty acid 92.1 0.68 1.7E-05 23.0 18.3 51 681-731 381-433 (437) 23 PRK09980 ompL outer membrane p 90.9 0.89 2.3E-05 22.2 12.0 99 623-721 108-229 (230) 24 KOG3126 consensus 86.0 1.9 4.8E-05 20.0 13.5 47 621-667 136-182 (281) 25 TIGR03509 OMP_MtrB_PioB decahe 84.8 2.1 5.4E-05 19.7 18.6 26 681-706 581-606 (649) 26 TIGR03014 EpsL exopolysacchari 83.5 2.4 6.2E-05 19.3 15.8 147 565-731 214-379 (381) 27 pfam05096 Glu_cyclase_2 Glutam 81.3 2.9 7.4E-05 18.8 8.0 44 89-133 72-118 (264) 28 pfam06178 KdgM Oligogalacturon 81.1 3 7.5E-05 18.7 21.7 75 647-721 133-217 (218) 29 COG3117 Uncharacterized protei 75.8 4.2 0.00011 17.7 11.7 82 52-134 60-150 (188) 30 TIGR01778 TonB-copper TonB-dep 74.5 2.7 6.8E-05 19.0 2.6 52 141-196 34-92 (668) 31 pfam02462 Opacity Opacity fami 73.0 4.9 0.00013 17.2 9.2 38 684-721 81-125 (126) 32 pfam12600 DUF3769 Protein of u 70.6 5.5 0.00014 16.9 24.4 164 140-328 8-177 (403) 33 pfam00267 Porin_1 Gram-negativ 69.0 6 0.00015 16.7 11.0 81 650-731 247-332 (334) 34 PRK10554 outer membrane porin 67.4 6.4 0.00016 16.5 14.1 85 647-732 278-372 (373) 35 PRK10159 outer membrane phosph 66.1 6.8 0.00017 16.3 9.9 80 649-731 266-349 (351) 36 PRK09840 catecholate sideropho 63.4 7.6 0.00019 16.0 16.0 18 459-476 459-476 (760) 37 PRK10808 outer membrane protei 61.6 8.2 0.00021 15.7 15.9 17 311-327 153-169 (347) 38 pfam12094 DUF3570 Protein of u 59.7 8.8 0.00022 15.5 19.2 28 561-588 345-372 (407) 39 pfam04338 DUF481 Protein of un 55.1 10 0.00027 15.0 20.6 41 552-595 75-115 (210) 40 PRK10523 lipoprotein involved 54.7 11 0.00027 15.0 7.2 106 49-169 58-169 (234) 41 pfam05326 SVA Seminal vesicle 52.8 11 0.00028 14.9 2.3 27 158-186 80-106 (124) 42 pfam11751 DUF3308 Protein of u 49.9 13 0.00032 14.5 12.5 73 624-707 184-257 (274) 43 PRK04922 tolB translocation pr 48.2 11 0.00027 15.0 1.7 25 1-25 1-25 (439) 44 PRK11433 putative xanthine deh 45.8 15 0.00037 14.1 3.0 19 1-19 10-28 (217) 45 PRK10002 outer membrane protei 45.8 15 0.00037 14.1 14.6 82 647-731 275-360 (362) 46 COG4313 Protein involved in me 44.7 15 0.00039 14.0 9.2 65 685-749 219-302 (304) 47 cd07303 Porin3 Eukaryotic pori 44.3 15 0.00039 13.9 16.6 84 629-722 139-222 (274) 48 PRK13524 outer membrane recept 43.4 16 0.0004 13.8 16.7 15 460-474 446-460 (742) 49 pfam11854 DUF3374 Protein of u 43.2 16 0.00041 13.8 25.0 25 682-706 609-633 (675) 50 pfam11231 DUF3034 Protein of u 42.3 16 0.00042 13.7 8.3 24 683-706 182-205 (257) 51 PRK10049 pgaA outer membrane p 42.3 16 0.00042 13.7 12.3 16 564-579 654-669 (818) 52 PRK10695 hypothetical protein; 42.2 16 0.00042 13.7 5.8 41 53-93 56-97 (876) 53 pfam09694 Gcw_chp Bacterial pr 41.3 17 0.00043 13.6 12.1 52 682-733 137-196 (226) 54 pfam11924 DUF3442 Protein of u 38.8 19 0.00047 13.4 16.1 21 621-641 211-231 (280) 55 COG5338 Uncharacterized protei 38.2 19 0.00048 13.3 5.2 28 557-588 299-326 (468) 56 PRK11447 cellulose synthase su 36.9 20 0.00051 13.2 7.6 41 685-725 1099-1141(1158) 57 PRK09501 potD spermidine/putre 36.6 19 0.00048 13.3 1.4 19 1-19 1-19 (348) 58 pfam04245 NA37 37-kD nucleoid- 35.6 14 0.00036 14.2 0.7 10 266-275 103-112 (334) 59 COG3056 Uncharacterized lipopr 35.2 21 0.00054 13.0 2.9 27 6-32 12-39 (204) 60 PRK00378 nucleoid-associated p 34.0 16 0.0004 13.8 0.7 13 264-276 98-110 (334) 61 pfam11276 DUF3078 Protein of u 33.9 22 0.00056 12.9 5.5 28 679-706 58-85 (92) 62 PRK11667 hypothetical protein; 33.8 22 0.00056 12.9 2.0 23 1-23 1-23 (162) 63 smart00869 Autotransporter Aut 30.2 25 0.00065 12.5 11.1 84 442-528 117-201 (261) 64 TIGR03016 pepcterm_hypo_1 unch 29.8 26 0.00066 12.4 19.4 66 684-749 347-429 (431) 65 TIGR03519 Bac_Flav_fam_1 Bacte 29.4 26 0.00067 12.4 13.8 73 624-707 200-273 (292) 66 PRK13874 conjugal transfer pro 28.5 27 0.00069 12.3 2.3 28 12-39 8-35 (233) 67 COG3637 Opacity protein and re 25.9 30 0.00077 12.0 5.2 23 684-706 148-170 (199) 68 COG2067 FadL Long-chain fatty 24.4 32 0.00082 11.8 16.0 24 684-707 382-405 (440) 69 pfam01389 OmpA_membrane OmpA-l 21.9 36 0.00091 11.5 12.1 51 684-734 78-148 (175) 70 pfam04234 CopC Copper resistan 21.2 37 0.00094 11.4 5.7 35 6-42 2-36 (120) 71 PRK00022 lolB outer membrane l 21.0 37 0.00095 11.4 4.6 20 68-87 49-68 (203) 72 pfam04420 CHD5 CHD5-like prote 20.8 37 0.00093 11.4 0.6 21 200-220 114-134 (159) 73 PRK10301 hypothetical protein; 20.8 38 0.00096 11.3 6.5 39 1-42 1-39 (124) 74 COG3823 Glutamine cyclotransfe 20.7 38 0.00096 11.3 7.0 45 89-133 72-119 (262) No 1 >PRK04423 organic solvent tolerance protein; Provisional Probab=100.00 E-value=0 Score=977.02 Aligned_cols=733 Identities=17% Similarity=0.203 Sum_probs=590.3 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCC-CCCCC---------CCCCCCCCCCEEEEEEEEEEEECCCCEEEE Q ss_conf 95331024767899999999876324510025556-78654---------566677666607999207999766897999 Q gi|254780395|r 1 MRYNYRHLIGATTFFLAFNIITPQGTPPSNEQTTT-STPSK---------IKKNETNRHSELDISSDEIVLNSEGSTTTA 70 (762) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~---------~~~~~~~~~~~~~~~ad~~~~~~~~~~~~a 70 (762) ||+.-|.| +++|++|..+-+.++.+|..-..++ ..|++ ++..+.....|+.++||+|+...+ +-+. T Consensus 1 ~~~~~~~~--~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~v~aD~~~~~~~--~~~f 76 (798) T PRK04423 1 MRRALRLL--PLPLSIAICLPAMADEKPLNWGLCPATDPLPGFDGAPAADPKAAEMRQQLPTDIEGDQLSGTST--TPQY 76 (798) T ss_pred CCCHHHHH--HHHHHHHCCCCCHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCCCCC--CCEE T ss_conf 96202013--2354241004401210467756687677887656787655543444556985887640235577--5179 Q ss_pred EEEEEEEECCEEEEEEEEEEECCCCEEEEECCEEEECCCCCEEEEEEEEEECCCCEEEEEEEEEEEECCCEEEEEEEEEE Q ss_conf 98799998996999518999863787999734899838987899706999430255999847999706750210246872 Q gi|254780395|r 71 VGNVKIEYKGYHLSARDITFNHKNHRIIASGNIKLIEPDKRQIHAEYLDITDDFTNGIIKNLTIKIPADETYLTASSAQI 150 (762) Q Consensus 71 ~G~v~i~~~~~~l~Ad~i~y~~~~~~~~a~G~v~~~~~~g~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~ 150 (762) +|||+|.+++.+|+||+|+|||+++.|.|+|||++.++ |..+.|+++++++++++|.+.++++.+.+.+.++.|+.+++ T Consensus 77 ~GnV~i~qg~~~l~AD~v~ydq~~~~v~A~GnV~~~~~-~~~~~g~~~~~~l~~~~g~~~~~~y~l~~~~~~G~A~~~~~ 155 (798) T PRK04423 77 QGNVALKRGDQFLGADNLRMDTETGNYIAEGNVRYQDT-SIRMVADRAEGNQDTDTHKITNIQYQLVERRGNGDAESVDL 155 (798) T ss_pred EEEEEEEECCEEEEEEEEEEECCCCEEEEECCEEEECC-CCEEECCEEEEECCCCEEEEECCEEEEEECCCCEEEEEEEE T ss_conf 85599997998999117999867898999777899679-85898142899826764799667899994677236677674 Q ss_pred CCCEEEEEEEEEEECCCCCCCCCCCCEEEEEEEEEEEECCCCEEEEECCEEEECCEEEEECCCEECCCCCCCCCCCCCCE Q ss_conf 39759999879973137665778885089996699996777779994428998270686325400124555454541000 Q gi|254780395|r 151 IGQRTIFDKGTYTACSSCSKPNSRPPFWIVKSKRAILNRKTHTIRLEKPYLEIFGNSIFYFPLIEIPDETVVRKTGFLTP 230 (762) Q Consensus 151 ~~~~~~~~~~~~T~C~~~~~~~~~~P~W~i~a~~i~~d~~~~~~~~~~a~l~i~~vPv~y~Py~~~P~~~~~R~sGfL~P 230 (762) .++..++++|+||||+|++ |+|+|+|++|++|.++++++++||+|+|+||||||+|||.||+ +.+||||||+| T Consensus 156 ~g~~~~l~~~~~ttC~~~~------~~W~~~a~~i~~d~~~~~~~~~~a~~~~~~vPv~y~Py~~fp~-~~~R~sG~L~P 228 (798) T PRK04423 156 QGQVGQMHRSTYTTCDPSQ------PIWRLRAPEIDVDNDEGFGTARNAVLQIGKVPVLYFPWFKFPI-DDRRQTGLLFP 228 (798) T ss_pred CCCEEEEECCEECCCCCCC------CCEEEEECEEEEECCCCEEEEEEEEEEECCEEEEECCCEECCC-CCCCCCEEECC T ss_conf 4864799571983289889------8359994579980778889999459999989179347684526-99843243646 Q ss_pred EEECCCCCCCCEECEEEEECCCCCCEEEEEEECCCCEEEEEEEEEEEECCCCEEEEEEEEECCCCCCCCCCEEEEEEECC Q ss_conf 56413333430001034302465201455420243101100134553025607999987505556765432011232021 Q gi|254780395|r 231 LFSSGEKQRFGVGIPYYLVISDNSDATFTFSPHPKKGILGEMELRKYFHSGKHTLHAAYMYNNNVESGEERHQAMLASIA 310 (762) Q Consensus 231 ~~~~s~~~G~~~~~Pyy~~i~~~~D~Tltp~~~s~rG~~l~~eyRy~~~~g~~~~~~~~~~~d~~~~~~~~~r~~~~~~~ 310 (762) ++|+|+++|+++.+|||||||||+|+||+|++|++||+++++||||+.+.+.+++.++|+++|+... ++ ||.+..+| T Consensus 229 ~~~~s~~~G~~~~~PyY~niapn~D~T~tpr~ms~RG~~l~~EfRYL~~~~~G~~~~~yLp~D~~~~-~~--r~~~~~~~ 305 (798) T PRK04423 229 QLGLSGRNGFDYAQPIYLNLAPNYDATLLPRYMSKRGFMFDTEFRYLYDGGRGEVTGNFLPNDKLRD-KD--RGRVFFSG 305 (798) T ss_pred CCCCCCCCCEEEEEEEEEECCCCCCCEEEHHHHHHCCCCCCCEEEECCCCCCCEEEEEECCCCCCCC-CC--CCCEEEEE T ss_conf 6641489976999999885277644187011442126310325897267887418644447753454-44--54044522 Q ss_pred EECCEEEEEEEEEEEEEECCHHHHCCCCCCC---CCCEEEEEEEECCCCCCEEEEEEEEEEEEECCCCCCCCCCCEEEEC Q ss_conf 0000003566789999514101100123443---2110011222203355303577777776413566665544201212 Q gi|254780395|r 311 EFEINPIWNLGWHLKKQTSGQLSYNYYSDAL---SKRININQIYLTGTGEKNSFDMRALHYHIQEPLSKNEKKFPQANIY 387 (762) Q Consensus 311 ~~~l~~~~~~~~~~~~~SD~~y~~dy~~d~~---~~~~~~s~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l 387 (762) .+.++++|+..++++.|||++|++|+..+.. ....+...+.+.+.++.+.+.+++..||+++.... +.. .....+ T Consensus 306 ~~~~~~~w~~~~d~~~VSD~~Yf~Df~s~~~~~~s~~~L~q~~~~~~~~~nw~~~l~~q~yQ~l~~~l~-~~~-~PY~rl 383 (798) T PRK04423 306 YHNVNSHWQARASLAWVSDTRYVEDFTSRLVGMGSASSLQSTVGIYGTGETWTAGLMADRWQLTDYTLD-ERA-LPYNRQ 383 (798) T ss_pred EEECCCCCEEEEEEEECCCCCHHHHCCCCCCCCCCHHHHHHHHEEEECCCCEEEEEEEEEEEECCCCCC-CCC-CCHHHC T ss_conf 543189807998888704617565424565663547888633303222687246688887880367766-678-876658 Q ss_pred CCCCCCCCCCCCCCCCEEEEEEEEEEEEEECCCCCCC--C-----CCCCCCCCCCCCCEEEEEEEEEEEEEEECCCCEEE Q ss_conf 3311100146666651689756788777502454433--2-----23454445556521466777567764321671799 Q gi|254780395|r 388 PLIDYRYVDLQYAKSQQISITGNITAISRAKEKNTIN--Q-----MKHDSQPWIPNGINRRLSIEADWRKKIIGPLGILF 460 (762) Q Consensus 388 P~i~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~--~-----~~~~~~~~~~~~~~~R~~~~~~~~~~~~~~~G~~~ 460 (762) |++.+++.... ..+..+++..+++.+.+....-... . .............++|++++|++++|+..++| .+ T Consensus 384 PQl~~ny~~~~-~~~~d~~l~~e~t~F~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~R~~~~P~is~P~~~~~g-~l 461 (798) T PRK04423 384 PRLYFNWDKPV-FGILEAGVYAEAVRFTHDDSYLVQPPNDGNNDNYVRTGIRNQEYGGGSRLDVKPYVSMPLSGAAW-YV 461 (798) T ss_pred CEEEEEEECCC-CCCEEEEEEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEEEEEEEEECCCE-EE T ss_conf 65788612377-88501787888898660654434444455543222345443355641589987788773225742-68 Q ss_pred EEEEEEEEEEEEEECCCCCCC--CCCCCCCCEEEEEEEEEEEEEEECCEECCCCCEEEEEEEEEEEEECCCCCCCCCCCC Q ss_conf 988899877887512566666--654456520236898989999831021367632589877789994376666535655 Q gi|254780395|r 461 TPIANIRGDLHYLSFNRDLSS--DTISNNPNFVASKMLTAGLDIRYPIVAVTQKSRHILEGIAQVYAATDEKYIKTIPNE 538 (762) Q Consensus 461 tp~~~lr~~~Y~~~~~~~~~~--~~~~~~~~~~~r~~~~~~l~~~~p~~~~~~~~~h~LeP~~q~~~~~~~~~~~~iPn~ 538 (762) +|.++++.+.|.++....... ......++.++.+..++|+.|++++...+.+++||||||++ |.+.|+++|+.||+| T Consensus 462 tP~~~l~~t~Y~ld~~~~~~~~~~~~~s~sRtlP~fslDsGl~FER~~~l~g~~~~QTLEPRlf-YlYvPYrDQs~iP~F 540 (798) T PRK04423 462 TPTLAWRYTAYQLDSGLAETAPLTGNRSPSRSLPIASLDAGLYFDRETSLFGTNYLNTLEPRMY-YLYVPYRNQDDLPVF 540 (798) T ss_pred CHHHEEEEEEEECCCCCCCCCCCCCCCCCCEEEEEEEEECCEEEEEECCCCCCCCEEEEEEEEE-EEECCCCCCCCCCCC T ss_conf 7666014368870467555555466677434402689805269998533567753488701699-995573455668865 Q ss_pred CCCCCCCCCHHHCCCCCCCCCCEECCCEEEEEEEEEEEECC--CCCEEEEEEEEEEEECCCCCCCCCCCCCCCCCCCCCC Q ss_conf 44432247021002122357400046347887888888428--8606632300378841333334543336667788876 Q gi|254780395|r 539 DSHSLVLNSTSLFTQNRFSGFDRIEGGNRTNLGIRYIGSFN--NLFTINGVIGRSIHILGTNSFSIPDSIGIGQNSGLED 616 (762) Q Consensus 539 Ds~~~~fd~~nLF~~nrfsG~DRie~~~r~~~Gl~~~~~~~--~~~~~~~~iGQ~~~~~~~~~~~~~~~~~~~~~sgl~~ 616 (762) ||..++|++.+||+.|||+|.|||.|+||+++||+.+.... +.|.++++|||+|||.++++-..+. ....+. T Consensus 541 DT~~l~fsy~qLFrdnRFSG~DRI~DANQlTlGlTSR~~D~~tG~Er~r~SiGQi~YF~DrrV~L~~~------~~~~~~ 614 (798) T PRK04423 541 DTRPFTFSWGQLFRDSRYTGADRQNDANQLTLAVTSRWLDQDTGKEKLSLSAGQILYFSDSLVTINNS------EQTIEQ 614 (798) T ss_pred CCCCCCCCHHHHCCCCCCCCCCCCCCCCCCCEEEEEEEECCCCCCEEEEEEEEEEEEECCCCCCCCCC------CCCCCC T ss_conf 57766789899456884556444452114202247788717898078888635899943760015898------866657 Q ss_pred CCCCEEEEEEEECCCCEEEEEEEEECCCCCEEEEEEEEEEEECCCCE-EEEEEEEECCCCCCCCCCEEEEEEEEEEECCC Q ss_conf 64437999988436777999999861676606688999999647408-99999872267544666325899899997256 Q gi|254780395|r 617 KFSDYAAAVRLSLSPKITFSTQTLINPKDWSTRRTDTTIDYTMGSYE-ANISYTHIPAYPLYAYDARKTIQSRIKFKIND 695 (762) Q Consensus 617 ~~SD~v~~~~~~~~~~~~l~~~~~~d~~~~~~~r~~~~~~~~~~~~~-~~~~y~~~~~~~~~~~~~~~~~~~~~~~~l~~ 695 (762) ..||+++++.+++.+.|.+.+..+||++..++.+.++.+.|+++..+ ++++|+|..... ...+..+|+.+++.|||++ T Consensus 615 ~~S~~~~e~~~~l~~~w~l~~~~QYd~~~~~~~~~~~~l~Y~p~~~~~lnl~YRy~r~~~-~~~~~i~Q~d~s~~wPi~~ 693 (798) T PRK04423 615 GKSAWVADANYMINDRWTLGATYQWNPNSRREDLASLRARYLLNNDGIINLAYRYRRNLI-DNSDQLKQADFSFLYPINP 693 (798) T ss_pred CCCCEEEEEEEEECCCEEEEEEEEECCCCCCCCEEEEEEEEECCCCCEEEEEEEEEECCC-CCCCCEEEEEEEEEEEECC T ss_conf 877003788898558889974799899988100267899997699968988857860577-7888516877999999438 Q ss_pred CEEEEEEEEECCCCCCC-EEEEEEEEECCEEEEEEEEEEC---CCCCCCCEEEEEEEEECCCCCCCCCC Q ss_conf 56999999951654541-2256569964669999999971---47788735999999702445587667 Q gi|254780395|r 696 VLSADASLKWNMRAELP-SHSIGLAYQNDCATVKIIYKNE---SPNIHGYKIQARLSLRTIGDFNPIDI 760 (762) Q Consensus 696 ~w~~~~~~~ydl~~~~~-~~~~Gl~Y~~~Cw~~~l~y~r~---~~~~~~~~i~~~~~Lk~LG~~~~~~~ 760 (762) +|++.|+++||+++++. +..+||+|++|||+++++++|. .+...+++|+|+|+|||||++.+.+. T Consensus 694 ~W~~vGr~~Ydl~~~r~~E~l~GleY~sCCWa~R~v~~R~~~~~~~~~~~~i~~q~eL~GL~~~G~~~~ 762 (798) T PRK04423 694 RWSAVGRYYYSLLDKKPLEIIGGVQWDSCCLAVRGVVRRFVRNRDGEMDNSIQLEFVLKGLSSFGQDTD 762 (798) T ss_pred CEEEEEEEEECCCCCCCHHHCCCEEECCCCEEEEEEEEEEEECCCCCCCEEEEEEEEECCCCCCCCCHH T ss_conf 889999999718788112531070453887168979999981688762305999999736435699879 No 2 >PRK03761 organic solvent tolerance protein; Provisional Probab=100.00 E-value=0 Score=963.99 Aligned_cols=717 Identities=15% Similarity=0.182 Sum_probs=582.1 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHCCC-CCCCCCCCCCCCC--CCCCCCCCCCEEEEEEEEEEEECCCCEEEEEEEEEEE Q ss_conf 953310247678999999998763245-1002555678654--5666776666079992079997668979999879999 Q gi|254780395|r 1 MRYNYRHLIGATTFFLAFNIITPQGTP-PSNEQTTTSTPSK--IKKNETNRHSELDISSDEIVLNSEGSTTTAVGNVKIE 77 (762) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~--~~~~~~~~~~~~~~~ad~~~~~~~~~~~~a~G~v~i~ 77 (762) ||+..+-||..+++++.... .+.+ .+..|+...-|.- +.....|.+.||.|+||+++++. .+.++++|||+|. T Consensus 1 m~~~~~~~l~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pi~i~AD~~~~~~-~~~~~~~GnV~i~ 76 (782) T PRK03761 1 MKKRSPTLLATMIATALYSQ---QGLAADLASQCMLGVPSYDRPLVTGDPNDLPVTINADNAEANY-PDKAVYTGNVDIK 76 (782) T ss_pred CCCCHHHHHHHHHHHHHHCH---HHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCEEEEECCEEECC-CCEEEEEEEEEEE T ss_conf 98405789999999988470---7678988866640587667674269988884799933246258-9779999339999 Q ss_pred ECCEEEEEEEEEEECCCC------EEEEECCEEEECCCCCEEEEEEEEEECCCCEEEEEEEEEEEECCCEEEEEEEEEEC Q ss_conf 899699951899986378------79997348998389878997069994302559998479997067502102468723 Q gi|254780395|r 78 YKGYHLSARDITFNHKNH------RIIASGNIKLIEPDKRQIHAEYLDITDDFTNGIIKNLTIKIPADETYLTASSAQII 151 (762) Q Consensus 78 ~~~~~l~Ad~i~y~~~~~------~~~a~G~v~~~~~~g~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 151 (762) +++.+|.||+|+|||+++ +|.|+|||.+.++ +..+.|+++.++++.++|.+.++.+.+.+.+.++.|+.+... T Consensus 77 ~g~~~l~Ad~v~~~q~~~~~~~~~~~~a~Gnv~~~~~-~~~~~g~~~~~~~~~~~~~~~~~~y~~~~~~~~g~A~~~~~~ 155 (782) T PRK03761 77 QGNSRLQADEVQLHQQEAPGEPVRTVDAEGNVHYDDN-QIILKGPKAWSNLNTKDTNVWNGDYQMVGRQGRGKADLMKQR 155 (782) T ss_pred ECCEEEECCEEEEECCCCCCCCCEEEEEECCEEEECC-CEEEEECEEEEECCCCEEEEECCEEEEECCCCEEEEEEEEEC T ss_conf 7998999567999523786665028998689899689-618981469995377645996248999737860470025991 Q ss_pred -CCE-EEEEEEEEEECCCCCCCCCCCCEEEEEEEEEEEECCCCEEEEECCEEEECCEEEEECCCEECCCCCCCCCCCCCC Q ss_conf -975-999987997313766577888508999669999677777999442899827068632540012455545454100 Q gi|254780395|r 152 -GQR-TIFDKGTYTACSSCSKPNSRPPFWIVKSKRAILNRKTHTIRLEKPYLEIFGNSIFYFPLIEIPDETVVRKTGFLT 229 (762) Q Consensus 152 -~~~-~~~~~~~~T~C~~~~~~~~~~P~W~i~a~~i~~d~~~~~~~~~~a~l~i~~vPv~y~Py~~~P~~~~~R~sGfL~ 229 (762) ++. ++|++|+||+|+||++ .|+|+|++|+||+++++++++||+|+|+||||||+|||.||+++ +||||||+ T Consensus 156 ~~~~~~~l~~~~yTtC~p~~~------~W~i~A~~I~~D~~~~~~~~~nA~l~i~gvPVfY~Pyl~fP~~d-~RkSGfL~ 228 (782) T PRK03761 156 GENRYTILENGSFTSCLPGDN------SWSVVGSEIIHDREEEVAEIWNARFKVGDVPVFYSPYLQLPVGD-KRRSGFLI 228 (782) T ss_pred CCCCEEEECCEEECCCCCCCC------CEEEEEEEEEEECCCCEEEEEEEEEEECCCCEEECCCEECCCCC-CCCCCEEC T ss_conf 798489981718830899899------33999458999177788999801999968106956736066788-63011377 Q ss_pred EEEECCCCCCCCEECEEEEECCCCCCEEEEEEECCCCEEEEEEEEEEEECCCCEEEEEEEEECCCCCCC-----CCCEEE Q ss_conf 056413333430001034302465201455420243101100134553025607999987505556765-----432011 Q gi|254780395|r 230 PLFSSGEKQRFGVGIPYYLVISDNSDATFTFSPHPKKGILGEMELRKYFHSGKHTLHAAYMYNNNVESG-----EERHQA 304 (762) Q Consensus 230 P~~~~s~~~G~~~~~Pyy~~i~~~~D~Tltp~~~s~rG~~l~~eyRy~~~~g~~~~~~~~~~~d~~~~~-----~~~~r~ 304 (762) |++++|+++|+++++|||||||||+|+||+|++|++||+++++||||+.+.|.+++.++++++|+.... ..+.|| T Consensus 229 Ps~~~ss~~G~~~~~PYY~nIAPn~D~TltP~~~s~RG~~l~~EfRyl~~~g~g~i~~~~l~~D~~~~~~~~~~~~~~R~ 308 (782) T PRK03761 229 PNAKYSSKNGFEFELPYYWNIAPNYDATITPHYMSRRGWQWENEFRYLTQAGAGLMEGEYLPSDKVYDDEHPNDDNSSRW 308 (782) T ss_pred CCCEECCCCCEEEEEEEEEECCCCCCEEEEEEEECCCCCCEEEEEEECCCCCCCEEEEEECCCCCCCCCCCCCCCCCCEE T ss_conf 73204488975998889996478764199448986776230067998367877249999726874344445666778506 Q ss_pred EEEECCEECCEEEEEEEEEEEEEECCHHHHCCCCCCC--CCCEEEEEEEECCCCCCEEEEEEEEEEEEECCCCCCCCCCC Q ss_conf 2320210000003566789999514101100123443--21100112222033553035777777764135666655442 Q gi|254780395|r 305 MLASIAEFEINPIWNLGWHLKKQTSGQLSYNYYSDAL--SKRININQIYLTGTGEKNSFDMRALHYHIQEPLSKNEKKFP 382 (762) Q Consensus 305 ~~~~~~~~~l~~~~~~~~~~~~~SD~~y~~dy~~d~~--~~~~~~s~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 382 (762) .+.++|+..++++|++++++..|||++|++|+..... ....+.+.+.+.+....+.+.+++..||.++... .. T Consensus 309 ~~~~~h~g~~~~~w~~~~d~~~vSD~~Yf~D~~s~~~~~~~~~L~q~~~~~y~~~~~~~~~~~~~fQ~l~~~~-----~~ 383 (782) T PRK03761 309 LFYWNHSGVMDQVWRFNVDYTKVSDPSYFNDFDSKYGSSTDGYATQKFSVGYAQQNWDATLSTKQFQVFDEQN-----TS 383 (782) T ss_pred EEEEEEEEEECCCCEEEEEEEEECCHHHHHHCCCCCCCCCCCCEEEEEEEEEECCCCEEEEEEEEEEECCCCC-----CC T ss_conf 9998674120788179888888313667764445556534001778999988237826989988758617776-----78 Q ss_pred EEEECCCCCCCCCCCCCCCCCEEEEEEEEEEEEEECCCCCCCCCCCCCCCCCCCCCEEEEEEEEEEEEEEECCCCEEEEE Q ss_conf 01212331110014666665168975678877750245443322345444555652146677756776432167179998 Q gi|254780395|r 383 QANIYPLIDYRYVDLQYAKSQQISITGNITAISRAKEKNTINQMKHDSQPWIPNGINRRLSIEADWRKKIIGPLGILFTP 462 (762) Q Consensus 383 ~~~~lP~i~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~R~~~~~~~~~~~~~~~G~~~tp 462 (762) ...++|++++++..+. ..+..+++.++.+.+.+... .....+|++++|.|++|+..++|. ++| T Consensus 384 py~~lPql~~~~~~~~-~~~~~~~l~~~~~~F~~~~~---------------~~~~~~R~~~~p~l~~p~~~~~g~-l~~ 446 (782) T PRK03761 384 PYRAEPQLDFNYYQND-LGPFDFRLYSQAVRFVNDSD---------------NMPEATRVHLEPTLNLPLSNRWGS-LNT 446 (782) T ss_pred CHHCCCEEEEEEECCC-CCCEEEEEEEEEEEEECCCC---------------CCCCEEEEEECCEEEEEEECCCEE-EEE T ss_conf 7210617763457478-88633999977898533788---------------876448997522777774447527-989 Q ss_pred EEEEEEEEEEEECCCCCCCCCCCCCCCEEEEEEEEEEEEEEECCEE----CCCCCEEEEEEEEEEEEECCCCCCCCCCCC Q ss_conf 8899877887512566666654456520236898989999831021----367632589877789994376666535655 Q gi|254780395|r 463 IANIRGDLHYLSFNRDLSSDTISNNPNFVASKMLTAGLDIRYPIVA----VTQKSRHILEGIAQVYAATDEKYIKTIPNE 538 (762) Q Consensus 463 ~~~lr~~~Y~~~~~~~~~~~~~~~~~~~~~r~~~~~~l~~~~p~~~----~~~~~~h~LeP~~q~~~~~~~~~~~~iPn~ 538 (762) .+++++++|..+........ ....+..+.|.+|.++++..++|.+ .+.+++|||||++||.+.| +++|+.|||| T Consensus 447 ~~~l~~t~Y~~~~~~~~~~~-~~~~~~~~~R~~P~~~ld~~l~fer~~~~~~~~~tQtLEPr~qyly~P-~~dQs~IPnf 524 (782) T PRK03761 447 EAKLLATHYQQDNLDSYNAN-NEKLDESVNRVIPQFKVDGKMVLERDMDMLAPGYTQTLEPRVQYLYVP-YRDQSNIYNY 524 (782) T ss_pred EEEEEEEEEEECCCCCCCCC-CCCCCCCEEEECCEEEEECEEEEEEECCCCCCCCEEEEEEEEEEEECC-CCCCCCCCCC T ss_conf 86788899981167545556-556666501314348860216999842235786249995089999757-7667789765 Q ss_pred CCCCCCCCCHHHCCCCCCCCCCEECCCEEEEEEEEEEEECCC-CCEEEEEEEEEEEECCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 444322470210021223574000463478878888884288-6066323003788413333345433366677888766 Q gi|254780395|r 539 DSHSLVLNSTSLFTQNRFSGFDRIEGGNRTNLGIRYIGSFNN-LFTINGVIGRSIHILGTNSFSIPDSIGIGQNSGLEDK 617 (762) Q Consensus 539 Ds~~~~fd~~nLF~~nrfsG~DRie~~~r~~~Gl~~~~~~~~-~~~~~~~iGQ~~~~~~~~~~~~~~~~~~~~~sgl~~~ 617 (762) ||+.++||++|||+.|||+|+|||++++|+++||+++..... .+.+++++||+|++.++++.... .......... T Consensus 525 DS~~~~fd~~~LF~~nRFsG~DRI~d~nq~t~GltsR~~d~~g~E~~~~siGQi~yf~d~rv~~~~----~~~~~~~~~~ 600 (782) T PRK03761 525 DSTLLQSDYYGLFRDRTYSGLDRIASANQVTTGATTRFYDDAAVERFNISVGQIYYFTESRTGDDN----ITWENDDKTG 600 (782) T ss_pred CCCCCCCCHHHCCCCCCCCCCCCCCCCCEEEEEEEEEEECCCCCEEEEEEEEEEEEECCCCCCCCC----CCCCCCCCCC T ss_conf 575455576674565665675544787688887789998688824689998689992476444677----7766666566 Q ss_pred CCCEEEEEEEECCCCEEEEEEEEECCCCCEEEEEEEEEEEECCCCE-EEEEEEEECCCCC------------CCCCCEEE Q ss_conf 4437999988436777999999861676606688999999647408-9999987226754------------46663258 Q gi|254780395|r 618 FSDYAAAVRLSLSPKITFSTQTLINPKDWSTRRTDTTIDYTMGSYE-ANISYTHIPAYPL------------YAYDARKT 684 (762) Q Consensus 618 ~SD~v~~~~~~~~~~~~l~~~~~~d~~~~~~~r~~~~~~~~~~~~~-~~~~y~~~~~~~~------------~~~~~~~~ 684 (762) .|+++++..+++.+.|.+.+..+||++..++.+.+..+.|+++..+ ++++|+|...... ...+..+| T Consensus 601 ~~~~a~~~~~~~~~~~~~~~~~qyd~~~~~~~~~~~~l~y~~~~~~~~~l~Yry~~~~yi~~~l~~~~~~~~~~~~~i~q 680 (782) T PRK03761 601 SLVWAGETYWRISDRWGLRGGIQYDTRLDSVQLANSSLEYRRDEDRLIQLNYRYASPEYIQATLPKYYSTAEQYQQGISQ 680 (782) T ss_pred CCHHHHHEEEEECCCEEEEEEEEECCCCCCEEEEEEEEEEECCCCCEEEEEEEEECCHHHHCCCCCCCCCCCCCCCCCEE T ss_conf 20312310033057389864589877678403588899994489949974523651301002344334554334576327 Q ss_pred EEEEEEEECCCCEEEEEEEEECCCCCC-CEEEEEEEEECCEEEEEEEEEECCC--------CCCCCEEEEEEEEECCCC- Q ss_conf 998999972565699999995165454-1225656996466999999997147--------788735999999702445- Q gi|254780395|r 685 IQSRIKFKINDVLSADASLKWNMRAEL-PSHSIGLAYQNDCATVKIIYKNESP--------NIHGYKIQARLSLRTIGD- 754 (762) Q Consensus 685 ~~~~~~~~l~~~w~~~~~~~ydl~~~~-~~~~~Gl~Y~~~Cw~~~l~y~r~~~--------~~~~~~i~~~~~Lk~LG~- 754 (762) +.+++.|||+++|+++|.++||++.+. ++..+||+|++|||+++|+|+|... ...+++|+|+|+|||||+ T Consensus 681 ~~~s~~~pi~~~W~~~g~~~YDl~~~~~~e~~~GleY~~cCW~~rl~~~R~~~~~~~~~~~~~~d~~i~~q~~lkGLg~~ 760 (782) T PRK03761 681 VGAVASWPIADRWSIVGAYYYDTNANKPAEQLLGLQYNSCCWAIGVGYERKLTGWDNDKQHPVYDNNIGFNIELRGLSSN 760 (782) T ss_pred EEEEEEEEECCCEEEEEEEEEECCCCEEEEEEEEEEEECCCEEEEEEEEEEECCCCCCCCCCCCCCEEEEEEEEECCCCC T ss_conf 88999998038889999999847689282258533796786999999999972776776663004249999998423677 Q ss_pred CCC Q ss_conf 587 Q gi|254780395|r 755 FNP 757 (762) Q Consensus 755 ~~~ 757 (762) +.. T Consensus 761 ~g~ 763 (782) T PRK03761 761 YGL 763 (782) T ss_pred CCC T ss_conf 788 No 3 >COG1452 Imp Organic solvent tolerance protein OstA [Cell envelope biogenesis, outer membrane] Probab=100.00 E-value=0 Score=848.14 Aligned_cols=676 Identities=24% Similarity=0.354 Sum_probs=557.7 Q ss_pred CCCE-EEEEEEEEEEECCCCEEEEEEEEEEEECCEEEEEEEEEEECCCCEEEEECCEEEECCCCCEEEEEEEEEECCCCE Q ss_conf 6660-799920799976689799998799998996999518999863787999734899838987899706999430255 Q gi|254780395|r 48 RHSE-LDISSDEIVLNSEGSTTTAVGNVKIEYKGYHLSARDITFNHKNHRIIASGNIKLIEPDKRQIHAEYLDITDDFTN 126 (762) Q Consensus 48 ~~~~-~~~~ad~~~~~~~~~~~~a~G~v~i~~~~~~l~Ad~i~y~~~~~~~~a~G~v~~~~~~g~~~~~~~~~~~~~~~~ 126 (762) ...| +.++||.++++.+.++|+|+|||+|.+++++|.||+|+||+++++++|+|||++.+++|..+.|+++++++++++ T Consensus 48 ~~~p~~~iea~~~~~~~~~~~ita~GnV~i~~~~~~l~Ad~v~y~q~t~~~~A~GnV~~~~~~g~~i~gd~a~~~ld~~~ 127 (784) T COG1452 48 QNLPKILIEADALEYNRKDQIITAEGNVEILYGDYRLLADKVDYDQKTDRVTATGNVEYIDGDGILIKGDKADVNLDTKK 127 (784) T ss_pred CCCCCEEEEHHHHCCCCCCCEEEEECEEEEEECCEEEEEEEEEEECCCCEEEEECCEEEECCCCCEEECCEEEEECCCCC T ss_conf 56984589800200147785699966299996888998758997234788999777999828980998756899614576 Q ss_pred EEEEEEEEEEECCCEEEEEEEEEE-CCCEEEEEEEEEEECCCCCCCCCCCCEEEEEEEEEEEECCCCEEEEECCEEEECC Q ss_conf 999847999706750210246872-3975999987997313766577888508999669999677777999442899827 Q gi|254780395|r 127 GIIKNLTIKIPADETYLTASSAQI-IGQRTIFDKGTYTACSSCSKPNSRPPFWIVKSKRAILNRKTHTIRLEKPYLEIFG 205 (762) Q Consensus 127 g~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~T~C~~~~~~~~~~P~W~i~a~~i~~d~~~~~~~~~~a~l~i~~ 205 (762) |.+.++++.+.+..++++|+.+++ +++.+++++|+||+|+||++ |+|+|+|++|.+|.+++.+.++||+|+|+| T Consensus 128 g~~~n~~~~~~~~~~~~~A~~a~~~d~~~~~l~~at~TtC~~~dn-----p~W~v~A~~i~~~~~~~~~~~~na~f~v~g 202 (784) T COG1452 128 GFVSNFRYQTVGRQGRFVAKSAERRDNDVTILKNATYTTCEPCDN-----PIWSVKASKIIQNREKQFAVFWNARFEVGG 202 (784) T ss_pred CEEEEEEEEECCCEEEEEEEEEEECCCEEEEEEEEEEECCCCCCC-----CCEEEEEEEEEECCCCCEEEEEEEEEEECC T ss_conf 458600898525225799855221487069995108823688899-----725999648998177858999956999888 Q ss_pred EEEEECCCEECCCCCCCCCCCCCCEEEECCCCCCCCEECEEEEECCCCCCEEEEEEECCCCEEEEEEEEEEEECCCCEEE Q ss_conf 06863254001245554545410005641333343000103430246520145542024310110013455302560799 Q gi|254780395|r 206 NSIFYFPLIEIPDETVVRKTGFLTPLFSSGEKQRFGVGIPYYLVISDNSDATFTFSPHPKKGILGEMELRKYFHSGKHTL 285 (762) Q Consensus 206 vPv~y~Py~~~P~~~~~R~sGfL~P~~~~s~~~G~~~~~Pyy~~i~~~~D~Tltp~~~s~rG~~l~~eyRy~~~~g~~~~ 285 (762) |||||+|||.||+++ +||||||+|++++++++|+++.+|||||||||+|+||+|+||+|||.++++||||++..|.+.+ T Consensus 203 vPVfYlPyf~~P~~~-~RkSGFL~P~~~~ss~~Gf~~~~PyY~~laPn~DlTltp~y~skrG~~l~~e~Ryl~~~g~~~~ 281 (784) T COG1452 203 VPVFYLPYFFFPDDD-RRKSGFLIPSFGYSSRNGFSYGQPYYWNLAPNYDLTLTPQYMSKRGFGLEGEFRYLFSNGAGQL 281 (784) T ss_pred CCEEEEEEEECCCCC-CCCCCCCCCCCCCCCCCCCEEECCEEEECCCCCCCEECHHHHHHHCCCCCEEEEEECCCCCCEE T ss_conf 025950036140798-4422211354244588880783614784268988498667886608542236998614898427 Q ss_pred EEEEEECCCCCCCC-------CCEEEEEEECCEECCEEEEEEEEEEEEEECCHHHHCCCCCCCCCCEEEEEEEECCCCCC Q ss_conf 99875055567654-------32011232021000000356678999951410110012344321100112222033553 Q gi|254780395|r 286 HAAYMYNNNVESGE-------ERHQAMLASIAEFEINPIWNLGWHLKKQTSGQLSYNYYSDALSKRININQIYLTGTGEK 358 (762) Q Consensus 286 ~~~~~~~d~~~~~~-------~~~r~~~~~~~~~~l~~~~~~~~~~~~~SD~~y~~dy~~d~~~~~~~~s~~~l~~~~~~ 358 (762) .+.++++++..... .++|+++..+|.|.++ .|.+++++..+||..|.+||... ......++.+++..... T Consensus 282 ~~~~l~~~~~~~~~~~~~~~~~r~~~~~~~~~~f~~~-~w~~~v~~~~vSD~~y~~d~~~~--~~~~~~s~~y~t~~~~~ 358 (784) T COG1452 282 RGAGLPQDDSYYFGDDTRGNEVRWRYYWNHSGDFDQN-RWKFGVDVERVSDPNYFNDFDIR--YSSVSTSDGYLTQLGRR 358 (784) T ss_pred ECCCCCCCCHHHCCCCCCCCCCCCEEHHHCCCCHHHC-CEEECCCEEEECCHHHHHHHCCC--CCEEEEEEEEECCEEEE T ss_conf 5131356563231554334433210033305752003-33303542586373254331156--63589802023023015 Q ss_pred EEEEEEEEEEEEECC---CCCCCCCCCEEEECCCCCCCCCCCCCCCCCEEEEEEEEEEEEEECCCCCCCCCCCCCCCCCC Q ss_conf 035777777764135---66665544201212331110014666665168975678877750245443322345444555 Q gi|254780395|r 359 NSFDMRALHYHIQEP---LSKNEKKFPQANIYPLIDYRYVDLQYAKSQQISITGNITAISRAKEKNTINQMKHDSQPWIP 435 (762) Q Consensus 359 ~~~~~~~~~~~~~~~---~~~~~~~~~~~~~lP~i~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~ 435 (762) .||..+.+.+..+.. ...++.....+.++|+++++|.......+ .++++.+.+...+...... T Consensus 359 ~Y~~~~~~~~~l~~~~f~~~~~~~~~~~~~~~P~l~~~y~~~~~~~~-~L~~~~~s~~~~f~~~~~~------------- 424 (784) T COG1452 359 NYFALENLNFGLQIKYFQNLTNEKDNKPYNALPQLDYNYYLNDPVLG-ELNFTLDSQFVRFSNDDDY------------- 424 (784) T ss_pred EEEECCCCEEEEEEECCCCHHHHHCCCCCCCCCEEEECCCCCCCCCC-CEEEEEEEEEEEEECCCCC------------- T ss_conf 67604673599885100001233035764215426533546876554-2134300799998415666------------- Q ss_pred CCCEEEEEEEEEEEEEEECCCCEEEEEEEEEEEEEEEEECCCCCCCC------CCCCCCCEE-EEEEEEEEEEEEECCEE Q ss_conf 65214667775677643216717999888998778875125666666------544565202-36898989999831021 Q gi|254780395|r 436 NGINRRLSIEADWRKKIIGPLGILFTPIANIRGDLHYLSFNRDLSSD------TISNNPNFV-ASKMLTAGLDIRYPIVA 508 (762) Q Consensus 436 ~~~~~R~~~~~~~~~~~~~~~G~~~tp~~~lr~~~Y~~~~~~~~~~~------~~~~~~~~~-~r~~~~~~l~~~~p~~~ 508 (762) .....|++.+|.++.++..++| .+++..++..+.|..+.......+ ......+.. .+.+...++..+.|+.. T Consensus 425 ~~~~~r~~~~p~~~~~~~~~~~-~~~~~~gl~~t~y~~d~g~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~R~~~~~ 503 (784) T COG1452 425 MPERTRLHVEPSLRLSLSNSWG-SFGTESGLLATPYALDRGDAYDVDTNTADKASKFVNRSLPQRNMDSGGTLERDPLLF 503 (784) T ss_pred CCCEEECCCCCCCCCEEEECCE-EECCCCCEEECHHHHHCCCCCCCCCCCHHHHCCCCCCCCEEEEECCCCEEEEEEEEC T ss_conf 6430202356765513781021-334685306415565368554455443012213544320279832554088532413 Q ss_pred CCCCCEEEEEEEEEEEEECCCCCCCCCC--CCCCCCCCCCCHHHCCCCCCCCCCEECCCEEEEEEEEEEEECCCC-CEEE Q ss_conf 3676325898777899943766665356--554443224702100212235740004634788788888842886-0663 Q gi|254780395|r 509 VTQKSRHILEGIAQVYAATDEKYIKTIP--NEDSHSLVLNSTSLFTQNRFSGFDRIEGGNRTNLGIRYIGSFNNL-FTIN 585 (762) Q Consensus 509 ~~~~~~h~LeP~~q~~~~~~~~~~~~iP--n~Ds~~~~fd~~nLF~~nrfsG~DRie~~~r~~~Gl~~~~~~~~~-~~~~ 585 (762) .. +..|||||++|+.+. |+++|+.+| ++|++.++|+++|||+.|||+|+|||++++|+++|++.+..+..+ +.++ T Consensus 504 ~~-~~~~tlEP~~~y~y~-p~~~q~~~~~~~~Ds~~~~f~~~~LFr~~~ysG~Dri~~~n~ls~g~~~~~~d~~g~e~~~ 581 (784) T COG1452 504 GI-GYTQTLEPRAQYLYR-PYEDQSGIPLYNEDSQSLQFDYSNLFRDNRYSGYDRIEGANQLSLGVRSRFTDDAGFERLN 581 (784) T ss_pred CC-CCEEEECCCEEEEEC-CHHHCCCCCCCCCCCHHCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEEEEECCCCHHHHH T ss_conf 67-513641550047666-8788547765455502200254223012776762102778636565542344156242231 Q ss_pred EEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEECCCCEEEEEEEEECCCCCEEEEEEEEEEEECCCCE-E Q ss_conf 230037884133333454333666778887664437999988436777999999861676606688999999647408-9 Q gi|254780395|r 586 GVIGRSIHILGTNSFSIPDSIGIGQNSGLEDKFSDYAAAVRLSLSPKITFSTQTLINPKDWSTRRTDTTIDYTMGSYE-A 664 (762) Q Consensus 586 ~~iGQ~~~~~~~~~~~~~~~~~~~~~sgl~~~~SD~v~~~~~~~~~~~~l~~~~~~d~~~~~~~r~~~~~~~~~~~~~-~ 664 (762) +.+||++++...+.. .+.........|+++....+++...|.+.+..+||.+...+...+....|+++... + T Consensus 582 ~~~GQ~~y~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~y~~~~~~~~ 654 (784) T COG1452 582 ASAGQSYYFKGSRLL-------IPSDDKVPVGRSPWESAATYDINDGLNLSGSVQYDPSLNRRSLGSSGAEYRPEFKKII 654 (784) T ss_pred CCCCEEEEECCCCCC-------CCCCCCCCCCCCCCHHHCCCCHHHCEEEEEEEEECHHHCCCCCCCCHHEECCCCCCCE T ss_conf 014707774365355-------7765555434345555405164121666212650632254300210110045545440 Q ss_pred EEEEEEECCCCCCC--------CCCEEEEEEEEEEECCCCEEEEEEEEECCCCC-CCEEEEEEEEECCEEEEEEEEEECC Q ss_conf 99998722675446--------66325899899997256569999999516545-4122565699646699999999714 Q gi|254780395|r 665 NISYTHIPAYPLYA--------YDARKTIQSRIKFKINDVLSADASLKWNMRAE-LPSHSIGLAYQNDCATVKIIYKNES 735 (762) Q Consensus 665 ~~~y~~~~~~~~~~--------~~~~~~~~~~~~~~l~~~w~~~~~~~ydl~~~-~~~~~~Gl~Y~~~Cw~~~l~y~r~~ 735 (762) .++|+|........ .+..+|+.+++.|+|+++|+++|.+.||+..+ ..++.+|++|.+|||.+++.+++.. T Consensus 655 ~~~y~y~~~~~~~~~~~~~~~~~~~~~q~~~~~~~~i~~~ws~~g~~~yd~~~~~~~~~~lG~~Y~~~Cw~~~~~~~~~~ 734 (784) T COG1452 655 QLNYRYLSDEYIYISLQSSYGYKNKISQLGASAGYPLTKNWSVFGRIYYDLRADKPLERFLGVAYQSSCWAVRLGLSRYS 734 (784) T ss_pred EEEEEECCCCCCCCCCCCCCCCCCCEEEEEEEEEEEECCCEEEEEEEEECCCCCCHHHHHCCEEEECCCHHEEEEEEEEE T ss_conf 24455404311311345444667652698677788820537999778613135843433117277121101453346885 Q ss_pred C---------CCCCCEEEEEEEEECCCCCC Q ss_conf 7---------78873599999970244558 Q gi|254780395|r 736 P---------NIHGYKIQARLSLRTIGDFN 756 (762) Q Consensus 736 ~---------~~~~~~i~~~~~Lk~LG~~~ 756 (762) + ...+.++.|.++|++||++. T Consensus 735 ~~~~~~~~~~~~~~~~i~f~i~lr~lg~~~ 764 (784) T COG1452 735 NSRNNNDRLHLKFDSSIGFNISLRGLGDIG 764 (784) T ss_pred ECCCCCCCEEEEECCCCEEEEEEEECCCCC T ss_conf 023688504663126517999997448765 No 4 >pfam04453 OstA_C Organic solvent tolerance protein. Family involved in organic solvent tolerance in bacteria. The region contains several highly conserved, potentially catalytic, residues. Probab=100.00 E-value=0 Score=393.99 Aligned_cols=371 Identities=23% Similarity=0.304 Sum_probs=311.0 Q ss_pred CEEEEEEECCEECCEEEEEEEEEEEEEECCHHHHCCCCCC--CCCCEEEEEEEECCCCCCEEEEEEEEEEEEECCCCCCC Q ss_conf 2011232021000000356678999951410110012344--32110011222203355303577777776413566665 Q gi|254780395|r 301 RHQAMLASIAEFEINPIWNLGWHLKKQTSGQLSYNYYSDA--LSKRININQIYLTGTGEKNSFDMRALHYHIQEPLSKNE 378 (762) Q Consensus 301 ~~r~~~~~~~~~~l~~~~~~~~~~~~~SD~~y~~dy~~d~--~~~~~~~s~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~ 378 (762) +.||++.++|.+.|+++|+++++++.|||++|++||..+. .....+.+.+.+.+..+++++.+++.+||.+.....+ T Consensus 1 ~~Rg~~~~~~~~~l~~~w~~~~d~~~vSD~~Yl~d~~~~~~~~s~~~L~~~~~~~y~~~~~~~~~~~~~fq~L~~~~~~- 79 (382) T pfam04453 1 KNRGYVFAKHQFRINDNWSFGVDYTYVSDKNYFRDYSSNLGLRSQTYLLRQGRLSYRGENWDFSLRAQYYQTLRDSIIS- 79 (382) T ss_pred CCCEEEEEEEEEECCCCEEEEEEEEEECHHHHHHHCCCCCCCCCHHHEEEEEEEEEECCCCEEEEEEEEEEECCCCCCC- T ss_conf 9313498574376599979999999943688897618775542412421576788883772899999999933777666- Q ss_pred CCCCEEEECCCCCCCCCCCCCCC-CCEEEEEEEEEEEEEECCCCCCCCCCCCCCCCCCCCCEEEEEEEEEEEEEEECCCC Q ss_conf 54420121233111001466666-51689756788777502454433223454445556521466777567764321671 Q gi|254780395|r 379 KKFPQANIYPLIDYRYVDLQYAK-SQQISITGNITAISRAKEKNTINQMKHDSQPWIPNGINRRLSIEADWRKKIIGPLG 457 (762) Q Consensus 379 ~~~~~~~~lP~i~~~~~~~~~~~-~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~R~~~~~~~~~~~~~~~G 457 (762) ..+...++|.+++++....... +..++++++.+++.|..+.. ..+|++++|.|++|+..++| T Consensus 80 -~~~py~~lP~l~~~~~~~~~~~~~l~~~~~~~~~~f~~~~~~~----------------~~~R~~~~~~~~~~~~~~~g 142 (382) T pfam04453 80 -NDTPYQRLPQLDYNYYLPDLFLGGLDFSFDSQYRNFARDDDDQ----------------PGTRLHLEPELSLPFSTPWG 142 (382) T ss_pred -CCCCCEEEEEEEEEEECCCCCCCCEEEEEEEEEEEEECCCCCC----------------CEEEEEEEEEEEEEEECCCE T ss_conf -6777366259999988178776637999998899987058997----------------25899999999777876876 Q ss_pred EEEEEEEEEEEEEEEEECCCCCCCCCCCCCCCEEEEEEEEEEEEEEECCEEC---CCCCEEEEEEEEEEEEECCCCCCCC Q ss_conf 7999888998778875125666666544565202368989899998310213---6763258987778999437666653 Q gi|254780395|r 458 ILFTPIANIRGDLHYLSFNRDLSSDTISNNPNFVASKMLTAGLDIRYPIVAV---TQKSRHILEGIAQVYAATDEKYIKT 534 (762) Q Consensus 458 ~~~tp~~~lr~~~Y~~~~~~~~~~~~~~~~~~~~~r~~~~~~l~~~~p~~~~---~~~~~h~LeP~~q~~~~~~~~~~~~ 534 (762) + ++|.++++++.|.++..................|.+|.+++++++||.+. +.+++|+|||+++|.+.| +.+|+. T Consensus 143 ~-~~~~~~l~~~~Y~~~~~~~~~~~~~~~~~~~~~r~~p~~~l~~~~~f~R~~~~~~~~~~~LeP~~~y~~~p-~~~q~~ 220 (382) T pfam04453 143 F-LTPELSLRGTYYDLDVNYGPGNRITGGLDSSVSRVLPTFSLDSGLPFERDTELGGGYTQTLEPRAQYLYVP-YRDQGN 220 (382) T ss_pred E-EEEEEEEEEEEEEEECCCCCCCCCCCCCCCCCEEEEEEEEEEEEEEEEEEECCCCCCEEEEEEEEEEEEEC-CCCCCC T ss_conf 9-96999998779983157777655566667742066046888864289998436887349997899999805-766665 Q ss_pred CCCCCCCCCCCCCHHHCCCCCCCCCCEECCCEEEEEEEEEEEECCC-CCEEEEEEEEEEEECCCCCCCCCCCCCCCCCCC Q ss_conf 5655444322470210021223574000463478878888884288-606632300378841333334543336667788 Q gi|254780395|r 535 IPNEDSHSLVLNSTSLFTQNRFSGFDRIEGGNRTNLGIRYIGSFNN-LFTINGVIGRSIHILGTNSFSIPDSIGIGQNSG 613 (762) Q Consensus 535 iPn~Ds~~~~fd~~nLF~~nrfsG~DRie~~~r~~~Gl~~~~~~~~-~~~~~~~iGQ~~~~~~~~~~~~~~~~~~~~~sg 613 (762) ||||||..++||++|||+.|||+|.|||++++++++||+++....+ ++.+++.+||++++.+.+.+.. +..++ T Consensus 221 iP~~Ds~~~~f~~~nLF~~nrfsG~DRi~~~n~~t~Gl~~~~~~~~~~~~~~~~~GQ~~~~~~~~~~l~------~~~~~ 294 (382) T pfam04453 221 LPNFDSSSLDFDYTNLFRDNRFSGYDRIEDANRLTYGLTSRFFDDNGGERLRLSLGQSFYLSDSRVFLV------GDESG 294 (382) T ss_pred CCCCCCCCCCCCHHHHHCCCCCCCCCCCCCCCEEEEEEEEEEEECCCCEEEEEEEEEEEECCCCCCCCC------CCCCC T ss_conf 767654313569678632055777665588868999999999916996599999879999278754357------77777 Q ss_pred CCCCCCCEEEEEEEECCCCEEEEEEEEECCCCCEEEEEEEEEEEECCCC-EEEEEEEEECCCCCCCC---CCEEEEEEEE Q ss_conf 8766443799998843677799999986167660668899999964740-89999987226754466---6325899899 Q gi|254780395|r 614 LEDKFSDYAAAVRLSLSPKITFSTQTLINPKDWSTRRTDTTIDYTMGSY-EANISYTHIPAYPLYAY---DARKTIQSRI 689 (762) Q Consensus 614 l~~~~SD~v~~~~~~~~~~~~l~~~~~~d~~~~~~~r~~~~~~~~~~~~-~~~~~y~~~~~~~~~~~---~~~~~~~~~~ 689 (762) +++++||+++++.++|.++|++.++++||++..++.+.++++.|++++. .++++|+|.+..+.... +..+|+.+++ T Consensus 295 ~~~~~Sd~~~~~~~~~~~~~~l~~~~~~d~~~~~~~~~~~~~~y~~~~~~~~~l~Yry~~~~~~~~~~~~~~~~q~~~s~ 374 (382) T pfam04453 295 LTRRFSDLVAELSFNPNEGLSLSSSIQYDPNTNRFERAEVGLSYRPDRGNSINLGYRYRRDDPDYQYATNNGISQIGLSA 374 (382) T ss_pred CCCCCCCEEEEEEEEECCCEEEEEEEEECCCCCCEEEEEEEEEEECCCCCEEEEEEEEECCCCCCCCCCCCCCEEEEEEE T ss_conf 55663411467899866978999889988888940899999999879998999998885275432346778724699999 Q ss_pred EEECCCCE Q ss_conf 99725656 Q gi|254780395|r 690 KFKINDVL 697 (762) Q Consensus 690 ~~~l~~~w 697 (762) .|||+++| T Consensus 375 ~~pi~~~W 382 (382) T pfam04453 375 QYPLARNW 382 (382) T ss_pred EEECCCCC T ss_conf 96768899 No 5 >COG1452 Imp Organic solvent tolerance protein OstA [Cell envelope biogenesis, outer membrane] Probab=99.30 E-value=1.2e-07 Score=68.01 Aligned_cols=179 Identities=17% Similarity=0.110 Sum_probs=111.3 Q ss_pred EEEEEEEEEEECCCCEEEEEEEEEEEE-CCEEEEEEEEEEECCCCEEEEE-CCEEEECCCCCEEEEEEEEEECCCCEEEE Q ss_conf 799920799976689799998799998-9969995189998637879997-34899838987899706999430255999 Q gi|254780395|r 52 LDISSDEIVLNSEGSTTTAVGNVKIEY-KGYHLSARDITFNHKNHRIIAS-GNIKLIEPDKRQIHAEYLDITDDFTNGII 129 (762) Q Consensus 52 ~~~~ad~~~~~~~~~~~~a~G~v~i~~-~~~~l~Ad~i~y~~~~~~~~a~-G~v~~~~~~g~~~~~~~~~~~~~~~~g~~ 129 (762) ..|.||++.||+.++.++|+|||++.. ++..+.+|++.++.+++.-.+. +-+.+.+. +....|+.++... .....+ T Consensus 82 ~~l~Ad~v~y~q~t~~~~A~GnV~~~~~~g~~i~gd~a~~~ld~~~g~~~n~~~~~~~~-~~~~~A~~a~~~d-~~~~~l 159 (784) T COG1452 82 YRLLADKVDYDQKTDRVTATGNVEYIDGDGILIKGDKADVNLDTKKGFVSNFRYQTVGR-QGRFVAKSAERRD-NDVTIL 159 (784) T ss_pred EEEEEEEEEEECCCCEEEEECCEEEECCCCCEEECCEEEEECCCCCCEEEEEEEEECCC-EEEEEEEEEEECC-CEEEEE T ss_conf 89987589972347889997779998289809987568996145764586008985252-2579985522148-706999 Q ss_pred EEEEEEEEC---CC-EEEEEEEEEECC--CEEEEEEEEEEECCCCCCCCCCCCEEEEEEEEEEEECCCCEEEEECCEEEE Q ss_conf 847999706---75-021024687239--759999879973137665778885089996699996777779994428998 Q gi|254780395|r 130 KNLTIKIPA---DE-TYLTASSAQIIG--QRTIFDKGTYTACSSCSKPNSRPPFWIVKSKRAILNRKTHTIRLEKPYLEI 203 (762) Q Consensus 130 ~~~~~~~~~---~~-~~~~~~~~~~~~--~~~~~~~~~~T~C~~~~~~~~~~P~W~i~a~~i~~d~~~~~~~~~~a~l~i 203 (762) +++.+-.++ +. ..+.|+.+..+. +..++.+|.+--- ..|..++=.=.+=.+...+-+-. -+.+.. T Consensus 160 ~~at~TtC~~~dnp~W~v~A~~i~~~~~~~~~~~~na~f~v~--------gvPVfYlPyf~~P~~~~RkSGFL-~P~~~~ 230 (784) T COG1452 160 KNATYTTCEPCDNPIWSVKASKIIQNREKQFAVFWNARFEVG--------GVPVFYLPYFFFPDDDRRKSGFL-IPSFGY 230 (784) T ss_pred EEEEEECCCCCCCCCEEEEEEEEEECCCCCEEEEEEEEEEEC--------CCCEEEEEEEECCCCCCCCCCCC-CCCCCC T ss_conf 510882368889972599964899817785899995699988--------80259500361407984422211-354244 Q ss_pred CC-EE-EEECCCEECCCCCCCCCCCCCCEEEECCCCCCCCEECEE Q ss_conf 27-06-863254001245554545410005641333343000103 Q gi|254780395|r 204 FG-NS-IFYFPLIEIPDETVVRKTGFLTPLFSSGEKQRFGVGIPY 246 (762) Q Consensus 204 ~~-vP-v~y~Py~~~P~~~~~R~sGfL~P~~~~s~~~G~~~~~Py 246 (762) .. -+ .++.||+..+-|.-. ==|.| .+-+++|+.+..-| T Consensus 231 ss~~Gf~~~~PyY~~laPn~D---lTltp--~y~skrG~~l~~e~ 270 (784) T COG1452 231 SSRNGFSYGQPYYWNLAPNYD---LTLTP--QYMSKRGFGLEGEF 270 (784) T ss_pred CCCCCCEEECCEEEECCCCCC---CEECH--HHHHHHCCCCCEEE T ss_conf 588880783614784268988---49866--78866085422369 No 6 >PRK04423 organic solvent tolerance protein; Provisional Probab=99.26 E-value=2.1e-07 Score=66.52 Aligned_cols=196 Identities=16% Similarity=0.117 Sum_probs=101.4 Q ss_pred EEEEEEEEEEEECCCCEEEEEEEEEEEECCEEEEEEEEEEECCCCEEEEEC-CEEEECCCCCEEEEEEEEEECCCCEEEE Q ss_conf 079992079997668979999879999899699951899986378799973-4899838987899706999430255999 Q gi|254780395|r 51 ELDISSDEIVLNSEGSTTTAVGNVKIEYKGYHLSARDITFNHKNHRIIASG-NIKLIEPDKRQIHAEYLDITDDFTNGII 129 (762) Q Consensus 51 ~~~~~ad~~~~~~~~~~~~a~G~v~i~~~~~~l~Ad~i~y~~~~~~~~a~G-~v~~~~~~g~~~~~~~~~~~~~~~~g~~ 129 (762) ...|.||++.|+++++.+.|+|||.+..++.+|.||++.++..++...+.+ ...+.+.. ..-.|+.++++. +...+ T Consensus 86 ~~~l~AD~v~ydq~~~~v~A~GnV~~~~~~~~~~g~~~~~~l~~~~g~~~~~~y~l~~~~-~~G~A~~~~~~g--~~~~l 162 (798) T PRK04423 86 DQFLGADNLRMDTETGNYIAEGNVRYQDTSIRMVADRAEGNQDTDTHKITNIQYQLVERR-GNGDAESVDLQG--QVGQM 162 (798) T ss_pred CEEEEEEEEEEECCCCEEEEECCEEEECCCCEEECCEEEEECCCCEEEEECCEEEEEECC-CCEEEEEEEECC--CEEEE T ss_conf 989991179998678989997778996798589814289982676479966789999467-723667767448--64799 Q ss_pred EEEEEEEEC---CCEEEEEEEEEEC--CCEEEEEEEEEEECCCCCCCCCCCCEEEEEEEEEEEECCCCEEEEECCEEEEC Q ss_conf 847999706---7502102468723--97599998799731376657788850899966999967777799944289982 Q gi|254780395|r 130 KNLTIKIPA---DETYLTASSAQII--GQRTIFDKGTYTACSSCSKPNSRPPFWIVKSKRAILNRKTHTIRLEKPYLEIF 204 (762) Q Consensus 130 ~~~~~~~~~---~~~~~~~~~~~~~--~~~~~~~~~~~T~C~~~~~~~~~~P~W~i~a~~i~~d~~~~~~~~~~a~l~i~ 204 (762) +++.+-... ....+.|+.+..+ .+.-+..++++---. .|..++=-=+.-++.+.+-+..- ..+..- T Consensus 163 ~~~~~ttC~~~~~~W~~~a~~i~~d~~~~~~~~~~a~~~~~~--------vPv~y~Py~~fp~~~~R~sG~L~-P~~~~s 233 (798) T PRK04423 163 HRSTYTTCDPSQPIWRLRAPEIDVDNDEGFGTARNAVLQIGK--------VPVLYFPWFKFPIDDRRQTGLLF-PQLGLS 233 (798) T ss_pred ECCEECCCCCCCCCEEEEECEEEEECCCCEEEEEEEEEEECC--------EEEEECCCEECCCCCCCCCEEEC-CCCCCC T ss_conf 571983289889835999457998077888999945999998--------91793476845269984324364-666414 Q ss_pred C--EEEEECCCEECCCCCCCCCCCCCCEEEECCCCCCCCEECEEEEECCCCCCEEEEEEECC Q ss_conf 7--06863254001245554545410005641333343000103430246520145542024 Q gi|254780395|r 205 G--NSIFYFPLIEIPDETVVRKTGFLTPLFSSGEKQRFGVGIPYYLVISDNSDATFTFSPHP 264 (762) Q Consensus 205 ~--vPv~y~Py~~~P~~~~~R~sGfL~P~~~~s~~~G~~~~~Pyy~~i~~~~D~Tltp~~~s 264 (762) . =-.+-+||..---|. .---|.| .+-+++|+.+..-|-+ +.++..-++...|+. T Consensus 234 ~~~G~~~~~PyY~niapn---~D~T~tp--r~ms~RG~~l~~EfRY-L~~~~~G~~~~~yLp 289 (798) T PRK04423 234 GRNGFDYAQPIYLNLAPN---YDATLLP--RYMSKRGFMFDTEFRY-LYDGGRGEVTGNFLP 289 (798) T ss_pred CCCCEEEEEEEEEECCCC---CCCEEEH--HHHHHCCCCCCCEEEE-CCCCCCCEEEEEECC T ss_conf 899769999998852776---4418701--1442126310325897-267887418644447 No 7 >pfam03968 OstA OstA-like protein. This family of proteins are mostly uncharacterized. However the family does include E. coli OstA that has been characterized as an organic solvent tolerance protein. Probab=99.23 E-value=3.4e-09 Score=78.50 Aligned_cols=60 Identities=20% Similarity=0.363 Sum_probs=49.9 Q ss_pred CCCCEEEEEEEEEEEECCCCEEEEEEEEEEEECCEEEEEEEEEEECCC-----CEEEEECCEEEE Q ss_conf 666607999207999766897999987999989969995189998637-----879997348998 Q gi|254780395|r 47 NRHSELDISSDEIVLNSEGSTTTAVGNVKIEYKGYHLSARDITFNHKN-----HRIIASGNIKLI 106 (762) Q Consensus 47 ~~~~~~~~~ad~~~~~~~~~~~~a~G~v~i~~~~~~l~Ad~i~y~~~~-----~~~~a~G~v~~~ 106 (762) ....||.|+||+++++...+.++.+|||+|.|++.+|+||+++...+. ..+.|.|++.+. T Consensus 23 D~~~PI~I~AD~~~~d~~~~~~~~~GnV~i~QG~~~i~ad~~~i~~~~~~~~~~~~~a~g~~~~~ 87 (141) T pfam03968 23 DRDQPIEIEADNLEADDKKGVATFTGNVVVTQGSITLRADKVVVTQDKKSNGIERLEATGPATFR 87 (141) T ss_pred CCCCCEEEEEEEEEEECCCCEEEEECCEEEEECCEEEEEEEEEEEECCCCCCEEEEEEECCEEEE T ss_conf 67999899945799982799999987499980870999648999986888967999997760499 No 8 >PRK10894 hypothetical protein; Provisional Probab=99.22 E-value=1.3e-09 Score=81.35 Aligned_cols=83 Identities=17% Similarity=0.228 Sum_probs=58.5 Q ss_pred CCCCCEEEEEEEEEEEECCCCEEEEEEEEEEEECCEEEEEEEEEEECCC-----CEEEEECCEE-EEC--CCCCEEE--E Q ss_conf 7666607999207999766897999987999989969995189998637-----8799973489-983--8987899--7 Q gi|254780395|r 46 TNRHSELDISSDEIVLNSEGSTTTAVGNVKIEYKGYHLSARDITFNHKN-----HRIIASGNIK-LIE--PDKRQIH--A 115 (762) Q Consensus 46 ~~~~~~~~~~ad~~~~~~~~~~~~a~G~v~i~~~~~~l~Ad~i~y~~~~-----~~~~a~G~v~-~~~--~~g~~~~--~ 115 (762) .+.+.||.|+||++++|...++.+.+|||+|.||..+|.||+++..... ..+.|.|+.. +.+ .+|..+. | T Consensus 30 sD~~qPI~I~AD~~~~d~~~~~avytGNVvitQGsl~I~ADkv~v~~~~~~~~~~~i~a~G~pA~f~Q~~~~g~~i~a~A 109 (184) T PRK10894 30 GDTEQPIHIDSDQQSLDMQGNVVTFTGNVVVTQGTIKINADKVVVTRPGGEQGKEVIDGYGNPATFYQMQDNGKPVKGHA 109 (184) T ss_pred CCCCCCEEEEECEEEEECCCCEEEEEEEEEEEEEEEEEEEEEEEEEECCCCCCEEEEEEECCCEEEEEECCCCCCEEEEE T ss_conf 34469989994528943159989999259999625999935899997699753799999689379999656898137697 Q ss_pred EEEEEECCCCEEE Q ss_conf 0699943025599 Q gi|254780395|r 116 EYLDITDDFTNGI 128 (762) Q Consensus 116 ~~~~~~~~~~~g~ 128 (762) ++++|+.....-. T Consensus 110 ~~i~Y~~~~~~i~ 122 (184) T PRK10894 110 SKMHYELAKDFVV 122 (184) T ss_pred EEEEEECCCCEEE T ss_conf 4899982699999 No 9 >COG1934 Uncharacterized protein conserved in bacteria [Function unknown] Probab=99.07 E-value=3.2e-08 Score=71.92 Aligned_cols=83 Identities=19% Similarity=0.226 Sum_probs=61.2 Q ss_pred CCCEEEEEEEEEEEECCCCEEEEEEEEEEEECCEEEEEEEEEEEC--CCC--EEEEECC----EEEECCCCC--EEEEEE Q ss_conf 666079992079997668979999879999899699951899986--378--7999734----899838987--899706 Q gi|254780395|r 48 RHSELDISSDEIVLNSEGSTTTAVGNVKIEYKGYHLSARDITFNH--KNH--RIIASGN----IKLIEPDKR--QIHAEY 117 (762) Q Consensus 48 ~~~~~~~~ad~~~~~~~~~~~~a~G~v~i~~~~~~l~Ad~i~y~~--~~~--~~~a~G~----v~~~~~~g~--~~~~~~ 117 (762) .+.||.++||..+.+..+++++.+|||+|.++..+|+||++++.. +.+ .++|.|| +...++ |. .-+|++ T Consensus 29 ~~qPi~i~ad~~el~~~~~~a~ftGNV~i~qG~~~l~Adkv~v~~~~~~~~~~i~a~gn~~~~~q~~~~-~k~~~g~ad~ 107 (173) T COG1934 29 RDQPITIEADQQELDDKNGVATFTGNVVVTQGTITLRADKVTVTRDKEGGKEVIEANGNPATFSQKVDN-GKTQKGTADQ 107 (173) T ss_pred CCCCEEEECCCEEEECCCCEEEEECCEEEEECCCEEEEEEEEEECCCCCCCEEEEEECCEEEEEEEECC-CCCEEEEEEE T ss_conf 558779972522430668779997638998466179713699980578985489995466678876507-8503888516 Q ss_pred EEEECCCCEEEEEE Q ss_conf 99943025599984 Q gi|254780395|r 118 LDITDDFTNGIIKN 131 (762) Q Consensus 118 ~~~~~~~~~g~~~~ 131 (762) +.|+...+.-.+.+ T Consensus 108 ~~Y~~~~~~~vLtG 121 (173) T COG1934 108 AIYDLKNDTIVLTG 121 (173) T ss_pred EEEEECCCEEEEEC T ss_conf 99982078999947 No 10 >PRK03761 organic solvent tolerance protein; Provisional Probab=99.05 E-value=2e-06 Score=60.02 Aligned_cols=115 Identities=13% Similarity=0.110 Sum_probs=85.2 Q ss_pred CEEEEEEEEEEEECCCC------EEEEEEEEEEEECCEEEEEEEEEEECCCCEEEEE-CCEEEECCCCCEEEEEEEEEEC Q ss_conf 60799920799976689------7999987999989969995189998637879997-3489983898789970699943 Q gi|254780395|r 50 SELDISSDEIVLNSEGS------TTTAVGNVKIEYKGYHLSARDITFNHKNHRIIAS-GNIKLIEPDKRQIHAEYLDITD 122 (762) Q Consensus 50 ~~~~~~ad~~~~~~~~~------~~~a~G~v~i~~~~~~l~Ad~i~y~~~~~~~~a~-G~v~~~~~~g~~~~~~~~~~~~ 122 (762) +...|.||++.+++.++ +|+|+|||.+..++..+.+|++.+|..++...+. +..+|.... ..-.|++++... T Consensus 78 g~~~l~Ad~v~~~q~~~~~~~~~~~~a~Gnv~~~~~~~~~~g~~~~~~~~~~~~~~~~~~y~~~~~~-~~g~A~~~~~~~ 156 (782) T PRK03761 78 GNSRLQADEVQLHQQEAPGEPVRTVDAEGNVHYDDNQIILKGPKAWSNLNTKDTNVWNGDYQMVGRQ-GRGKADLMKQRG 156 (782) T ss_pred CCEEEECCEEEEECCCCCCCCCEEEEEECCEEEECCCEEEEECEEEEECCCCEEEEECCEEEEECCC-CEEEEEEEEECC T ss_conf 9989995679995237866650289986898996896189814699953776459962489997378-604700259917 Q ss_pred CCCEEEEEEEEEEEE---CCCEEEEEEEEEECC--CEEEEEEEEEEEC Q ss_conf 025599984799970---675021024687239--7599998799731 Q gi|254780395|r 123 DFTNGIIKNLTIKIP---ADETYLTASSAQIIG--QRTIFDKGTYTAC 165 (762) Q Consensus 123 ~~~~g~~~~~~~~~~---~~~~~~~~~~~~~~~--~~~~~~~~~~T~C 165 (762) +.+--+++++.+-.+ ++...+.|+.+..+. +...+++|.+--- T Consensus 157 ~~~~~~l~~~~yTtC~p~~~~W~i~A~~I~~D~~~~~~~~~nA~l~i~ 204 (782) T PRK03761 157 ENRYTILENGSFTSCLPGDNSWSVVGSEIIHDREEEVAEIWNARFKVG 204 (782) T ss_pred CCCEEEECCEEECCCCCCCCCEEEEEEEEEEECCCCEEEEEEEEEEEC T ss_conf 984899817188308998993399945899917778899980199996 No 11 >TIGR03002 outer_YhbN cell envelope biogenesis protein YhbN; InterPro: IPR014340 Members of this entry include YhbN, which is an essential protein in Escherichia coli. It is implicated in cell envelope integrity, and has numerous homologues in other Proteobacteria in a conserved location near lipopolysaccharide inner core biosynthesis genes. This entry is distantly related to the organic solvent tolerance protein (OstA).. Probab=98.96 E-value=1.2e-07 Score=68.03 Aligned_cols=102 Identities=17% Similarity=0.293 Sum_probs=71.8 Q ss_pred CCCCCEEEEEEEEEEEECCCCEEEEEEEEEEEECCEEEEEEEEEEEC--CCC---EEEEECC--EEEEC--CCCCE-EE- Q ss_conf 76666079992079997668979999879999899699951899986--378---7999734--89983--89878-99- Q gi|254780395|r 46 TNRHSELDISSDEIVLNSEGSTTTAVGNVKIEYKGYHLSARDITFNH--KNH---RIIASGN--IKLIE--PDKRQ-IH- 114 (762) Q Consensus 46 ~~~~~~~~~~ad~~~~~~~~~~~~a~G~v~i~~~~~~l~Ad~i~y~~--~~~---~~~a~G~--v~~~~--~~g~~-~~- 114 (762) .+.+.||.|+||+.++|..+++++.+|||+|.||...|+||+|+..+ ..+ .+.|.|+ +.|.+ .+|.. +. T Consensus 4 ~D~~qPi~I~Ad~~~~D~~~~~~~f~GnV~itQG~~~I~ADkv~v~~~~~~~~~~~~~A~G~~PA~F~q~~d~g~~~~~g 83 (150) T TIGR03002 4 SDREQPIEIEADSQTLDDKKNVVTFTGNVVITQGTLKIRADKVVVTRPNDGGGIEKATATGNKPATFRQKLDDGKEYIEG 83 (150) T ss_pred CCCCCCEEEEEEEEEEECCCCEEEEECCEEEEECCEEEECCEEEEEECCCCCCEEEEEEECCCCEEEEEEECCCCEEEEE T ss_conf 87568689994006654028838886565878297598522889882688786089999807818988995599827999 Q ss_pred -EEEEEEECCCCEEEEE-EEEEEEECCC-EEEEEEEEE Q ss_conf -7069994302559998-4799970675-021024687 Q gi|254780395|r 115 -AEYLDITDDFTNGIIK-NLTIKIPADE-TYLTASSAQ 149 (762) Q Consensus 115 -~~~~~~~~~~~~g~~~-~~~~~~~~~~-~~~~~~~~~ 149 (762) |++++|+.....-.|. ++++ .+.+ --+.|+.|. T Consensus 84 ~A~~i~Y~~~~~~~~Ltg~A~v--~Q~~k~~~~G~~I~ 119 (150) T TIGR03002 84 QANRIEYDPAKDTVVLTGNARV--KQGGKNEVSGDVIT 119 (150) T ss_pred ECCEEEEECCCCEEEEECCEEE--EECCCCEEECCEEE T ss_conf 7118999325779999123499--78897778766889 No 12 >COG1934 Uncharacterized protein conserved in bacteria [Function unknown] Probab=98.51 E-value=7.8e-06 Score=56.01 Aligned_cols=83 Identities=17% Similarity=0.287 Sum_probs=67.5 Q ss_pred CCCCEEEEEEEEEEEEC--CCC--EEEEEEE----EEEEECCE--EEEEEEEEEECCCCEEEEECCEEEECCCCCEEEEE Q ss_conf 66660799920799976--689--7999987----99998996--99951899986378799973489983898789970 Q gi|254780395|r 47 NRHSELDISSDEIVLNS--EGS--TTTAVGN----VKIEYKGY--HLSARDITFNHKNHRIIASGNIKLIEPDKRQIHAE 116 (762) Q Consensus 47 ~~~~~~~~~ad~~~~~~--~~~--~~~a~G~----v~i~~~~~--~l~Ad~i~y~~~~~~~~a~G~v~~~~~~g~~~~~~ 116 (762) ....++.|.||.++... +.+ .+.|.|| ++..+.+. .=+||+++||-+++++.+.||+.+.+ +|+.++|+ T Consensus 57 i~qG~~~l~Adkv~v~~~~~~~~~~i~a~gn~~~~~q~~~~~k~~~g~ad~~~Y~~~~~~~vLtGna~v~q-~gn~i~G~ 135 (173) T COG1934 57 VTQGTITLRADKVTVTRDKEGGKEVIEANGNPATFSQKVDNGKTQKGTADQAIYDLKNDTIVLTGNAYVLQ-LGNVIRGD 135 (173) T ss_pred EEECCCEEEEEEEEEECCCCCCCEEEEEECCEEEEEEEECCCCCEEEEEEEEEEEECCCEEEEECCEEEEE-CCCEEEEE T ss_conf 98466179713699980578985489995466678876507850388851699982078999947799985-79888763 Q ss_pred EEEEECCCCEEEEE Q ss_conf 69994302559998 Q gi|254780395|r 117 YLDITDDFTNGIIK 130 (762) Q Consensus 117 ~~~~~~~~~~g~~~ 130 (762) ++.++.+.+...+. T Consensus 136 kIty~~~~~~~~a~ 149 (173) T COG1934 136 KITYLTKTGRAVAE 149 (173) T ss_pred EEEEECCCCEEEEC T ss_conf 99999434579962 No 13 >pfam06835 DUF1239 Protein of unknown function (DUF1239). This family consists of several hypothetical bacterial proteins of around 190 residues in length. The function of this family is unknown. Probab=98.33 E-value=3e-05 Score=52.10 Aligned_cols=108 Identities=16% Similarity=0.214 Sum_probs=70.6 Q ss_pred CEEEEEEEEEEEECCCCEEEEEEEEEEEE-CC---EEEEEEEEEEECCCCEEEEECCEEEECCCC----CEEEEEEEEEE Q ss_conf 60799920799976689799998799998-99---699951899986378799973489983898----78997069994 Q gi|254780395|r 50 SELDISSDEIVLNSEGSTTTAVGNVKIEY-KG---YHLSARDITFNHKNHRIIASGNIKLIEPDK----RQIHAEYLDIT 121 (762) Q Consensus 50 ~~~~~~ad~~~~~~~~~~~~a~G~v~i~~-~~---~~l~Ad~i~y~~~~~~~~a~G~v~~~~~~g----~~~~~~~~~~~ 121 (762) -...+.|+++.+..+.+...++.=+...+ ++ .++.|++..++.+.+.+.+.|+|++....+ ..+..+.+.++ T Consensus 51 l~~~l~a~~~~~~~~~~~~~l~~P~~~~~~~~~~~w~i~A~~g~~~~~~~~i~L~g~V~i~~~~~~~~~~~l~T~~l~~~ 130 (176) T pfam06835 51 LQYRLTADKLEHYPDDDTTEFTNPVLIIYQDGKPPWTITADRAKLNKDGQTLYLYGNVVIQSLEDKSRLTTITTENLTVN 130 (176) T ss_pred EEEEEEECEEEEECCCCCEEEEEEEEEEECCCCCEEEEEECEEEEECCCCEEEEECCEEEEECCCCCCCEEEEEHEEEEE T ss_conf 99999904579965899689996399997799836999979499937996999989999997689987569992209999 Q ss_pred CCCCEEEEEEEEEEEECCCEEEEEEEEEE--CCCEEEEE Q ss_conf 30255999847999706750210246872--39759999 Q gi|254780395|r 122 DDFTNGIIKNLTIKIPADETYLTASSAQI--IGQRTIFD 158 (762) Q Consensus 122 ~~~~~g~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~ 158 (762) .+.+...-..+.. +..+.....|...+. ..+...|. T Consensus 131 ~~~~~~~td~~V~-i~~~~~~~~g~gm~~~l~~~~~~l~ 168 (176) T pfam06835 131 LDTQDASTDQPVT-ITGPNFVSTGTGMKANLKTGTAELL 168 (176) T ss_pred CCCCEEEECCCEE-EEECCCEEEEEEEEEECCCCEEEEE T ss_conf 4777999487499-9909979998889999758999994 No 14 >pfam03968 OstA OstA-like protein. This family of proteins are mostly uncharacterized. However the family does include E. coli OstA that has been characterized as an organic solvent tolerance protein. Probab=98.31 E-value=2.5e-05 Score=52.67 Aligned_cols=76 Identities=14% Similarity=0.246 Sum_probs=48.7 Q ss_pred CCEEEEEEEEEEEECCC-----CEEEEEEEE------EEEECCEEEEEEEEEEECCCCEEEEECCEEEECCCCCEEEEEE Q ss_conf 66079992079997668-----979999879------9998996999518999863787999734899838987899706 Q gi|254780395|r 49 HSELDISSDEIVLNSEG-----STTTAVGNV------KIEYKGYHLSARDITFNHKNHRIIASGNIKLIEPDKRQIHAEY 117 (762) Q Consensus 49 ~~~~~~~ad~~~~~~~~-----~~~~a~G~v------~i~~~~~~l~Ad~i~y~~~~~~~~a~G~v~~~~~~g~~~~~~~ 117 (762) .+.+.|.||+|+...+. ..+.|.|++ .+..+...-+||+++|+..++.+.+.||+++.++ ++++.|++ T Consensus 54 QG~~~i~ad~~~i~~~~~~~~~~~~~a~g~~~~~~~~~~~~~~~~a~a~~~~Y~~~~~~i~L~Gna~l~q~-~n~i~G~~ 132 (141) T pfam03968 54 QGSITLRADKVVVTQDKKSNGIERLEATGPATFRQKVDLVGRQIRGTADRAEYDSKKRIIVLTGNAVLTQG-GNSIRGDK 132 (141) T ss_pred ECCEEEEEEEEEEEECCCCCCEEEEEEECCEEEEEEEECCCCCEEEEEEEEEEEECCCEEEEECCEEEEEC-CCEEEECE T ss_conf 08709996489999868889679999977604999861488867999718999916999999733999989-99999689 Q ss_pred EEEECCCC Q ss_conf 99943025 Q gi|254780395|r 118 LDITDDFT 125 (762) Q Consensus 118 ~~~~~~~~ 125 (762) +.++++.+ T Consensus 133 I~ydl~~g 140 (141) T pfam03968 133 ITYDIKEG 140 (141) T ss_pred EEEECCCC T ss_conf 99997679 No 15 >PRK10894 hypothetical protein; Provisional Probab=98.08 E-value=4.3e-05 Score=51.05 Aligned_cols=81 Identities=15% Similarity=0.216 Sum_probs=60.0 Q ss_pred CCCEEEEEEEEEEEEC--C---CCEEEEEEEEEEEE----CCE--EEEEEEEEEECCCCEEEEECCEEEECCCCCEEEEE Q ss_conf 6660799920799976--6---89799998799998----996--99951899986378799973489983898789970 Q gi|254780395|r 48 RHSELDISSDEIVLNS--E---GSTTTAVGNVKIEY----KGY--HLSARDITFNHKNHRIIASGNIKLIEPDKRQIHAE 116 (762) Q Consensus 48 ~~~~~~~~ad~~~~~~--~---~~~~~a~G~v~i~~----~~~--~l~Ad~i~y~~~~~~~~a~G~v~~~~~~g~~~~~~ 116 (762) .++.+.|.||.|+.-. . ...+.|.|+...++ ++. .=+|++|.|+..++++.+.||+++.++ ++.+.|+ T Consensus 61 tQGsl~I~ADkv~v~~~~~~~~~~~i~a~G~pA~f~Q~~~~g~~i~a~A~~i~Y~~~~~~i~L~GnA~L~Q~-~~~v~G~ 139 (184) T PRK10894 61 TQGTIKINADKVVVTRPGGEQGKEVIDGYGNPATFYQMQDNGKPVKGHASKMHYELAKDFVVLTGNAYLEQL-DSNIKGD 139 (184) T ss_pred EEEEEEEEEEEEEEEECCCCCCEEEEEEECCCEEEEEECCCCCCEEEEEEEEEEECCCCEEEEECCEEEEEC-CCEEEEC T ss_conf 962599993589999769975379999968937999965689813769748999826999999765899968-9989937 Q ss_pred EEEEECCCCEEEE Q ss_conf 6999430255999 Q gi|254780395|r 117 YLDITDDFTNGII 129 (762) Q Consensus 117 ~~~~~~~~~~g~~ 129 (762) ++.|+.+.+.-.+ T Consensus 140 ~I~Ydl~~~~~~a 152 (184) T PRK10894 140 KITYLVKEQKMQA 152 (184) T ss_pred EEEEEECCCEEEE T ss_conf 8999906679999 No 16 >TIGR03002 outer_YhbN cell envelope biogenesis protein YhbN; InterPro: IPR014340 Members of this entry include YhbN, which is an essential protein in Escherichia coli. It is implicated in cell envelope integrity, and has numerous homologues in other Proteobacteria in a conserved location near lipopolysaccharide inner core biosynthesis genes. This entry is distantly related to the organic solvent tolerance protein (OstA).. Probab=98.00 E-value=3e-05 Score=52.06 Aligned_cols=109 Identities=12% Similarity=0.148 Sum_probs=55.3 Q ss_pred CCEEEEEEEEEEECCCCEEEEECCEEEECCCCCEEEEEEEEEEC--CCCE---EEE--E-EEEEE--EECCC--EEEEEE Q ss_conf 99699951899986378799973489983898789970699943--0255---999--8-47999--70675--021024 Q gi|254780395|r 79 KGYHLSARDITFNHKNHRIIASGNIKLIEPDKRQIHAEYLDITD--DFTN---GII--K-NLTIK--IPADE--TYLTAS 146 (762) Q Consensus 79 ~~~~l~Ad~i~y~~~~~~~~a~G~v~~~~~~g~~~~~~~~~~~~--~~~~---g~~--~-~~~~~--~~~~~--~~~~~~ 146 (762) ....|.||+..+|.+++.+..+|||.++++. ..|.||++.+.. +.+. ... . -+.|. +.++. ..+.|+ T Consensus 8 qPi~I~Ad~~~~D~~~~~~~f~GnV~itQG~-~~I~ADkv~v~~~~~~~~~~~~~A~G~~PA~F~q~~d~g~~~~~g~A~ 86 (150) T TIGR03002 8 QPIEIEADSQTLDDKKNVVTFTGNVVITQGT-LKIRADKVVVTRPNDGGGIEKATATGNKPATFRQKLDDGKEYIEGQAN 86 (150) T ss_pred CCEEEEEEEEEEECCCCEEEEECCEEEEECC-EEEECCEEEEEECCCCCCEEEEEEECCCCEEEEEEECCCCEEEEEECC T ss_conf 8689994006654028838886565878297-598522889882688786089999807818988995599827999711 Q ss_pred EEEE--CCCEEEE-EEEEEEECCCCCCCCCCCCEEEEEEEEEEEECCCCEEEEEC Q ss_conf 6872--3975999-98799731376657788850899966999967777799944 Q gi|254780395|r 147 SAQI--IGQRTIF-DKGTYTACSSCSKPNSRPPFWIVKSKRAILNRKTHTIRLEK 198 (762) Q Consensus 147 ~~~~--~~~~~~~-~~~~~T~C~~~~~~~~~~P~W~i~a~~i~~d~~~~~~~~~~ 198 (762) .++- ..+..+| .+|...-= . + =.|++++|++|..++.+.+.. T Consensus 87 ~i~Y~~~~~~~~Ltg~A~v~Q~--~-----k---~~~~G~~I~Y~~~~q~~~a~~ 131 (150) T TIGR03002 87 RIEYDPAKDTVVLTGNARVKQG--G-----K---NEVSGDVITYDTKKQKVTAKG 131 (150) T ss_pred EEEEECCCCEEEEECCEEEEEC--C-----C---CEEECCEEEEECEEEEEEEEE T ss_conf 8999325779999123499788--9-----7---778766889844012998980 No 17 >pfam06835 DUF1239 Protein of unknown function (DUF1239). This family consists of several hypothetical bacterial proteins of around 190 residues in length. The function of this family is unknown. Probab=97.59 E-value=0.0053 Score=37.05 Aligned_cols=80 Identities=16% Similarity=0.184 Sum_probs=53.0 Q ss_pred CCEEEEEEEEEEEECCCCEEEEEEEEEEEEC-----CEEEEEEEEEEECCCCEEEEECCEEEECCCCCEEEEEEEEEECC Q ss_conf 6607999207999766897999987999989-----96999518999863787999734899838987899706999430 Q gi|254780395|r 49 HSELDISSDEIVLNSEGSTTTAVGNVKIEYK-----GYHLSARDITFNHKNHRIIASGNIKLIEPDKRQIHAEYLDITDD 123 (762) Q Consensus 49 ~~~~~~~ad~~~~~~~~~~~~a~G~v~i~~~-----~~~l~Ad~i~y~~~~~~~~a~G~v~~~~~~g~~~~~~~~~~~~~ 123 (762) ..++.+.|+.-.+..+.+.+.+.|||+|.+. ..+|..+.+.|+.+++.+.-..+|++..+ +..+.|..+.++++ T Consensus 83 ~~~w~i~A~~g~~~~~~~~i~L~g~V~i~~~~~~~~~~~l~T~~l~~~~~~~~~~td~~V~i~~~-~~~~~g~gm~~~l~ 161 (176) T pfam06835 83 KPPWTITADRAKLNKDGQTLYLYGNVVIQSLEDKSRLTTITTENLTVNLDTQDASTDQPVTITGP-NFVSTGTGMKANLK 161 (176) T ss_pred CCEEEEEECEEEEECCCCEEEEECCEEEEECCCCCCCEEEEEHEEEEECCCCEEEECCCEEEEEC-CCEEEEEEEEEECC T ss_conf 83699997949993799699998999999768998756999220999947779994874999909-97999888999975 Q ss_pred CCEEEE Q ss_conf 255999 Q gi|254780395|r 124 FTNGII 129 (762) Q Consensus 124 ~~~g~~ 129 (762) .+.-.+ T Consensus 162 ~~~~~l 167 (176) T pfam06835 162 TGTAEL 167 (176) T ss_pred CCEEEE T ss_conf 899999 No 18 >COG5375 Uncharacterized protein conserved in bacteria [Function unknown] Probab=97.34 E-value=0.0053 Score=37.07 Aligned_cols=120 Identities=11% Similarity=0.156 Sum_probs=80.8 Q ss_pred CCCCCEEEEEEEEEEEECCC-CEEEEEE-EEEEEEC---CEEEEEEEEEEECCCCEEEEECCEEEECCCCCEEEEEEEEE Q ss_conf 76666079992079997668-9799998-7999989---96999518999863787999734899838987899706999 Q gi|254780395|r 46 TNRHSELDISSDEIVLNSEG-STTTAVG-NVKIEYK---GYHLSARDITFNHKNHRIIASGNIKLIEPDKRQIHAEYLDI 120 (762) Q Consensus 46 ~~~~~~~~~~ad~~~~~~~~-~~~~a~G-~v~i~~~---~~~l~Ad~i~y~~~~~~~~a~G~v~~~~~~g~~~~~~~~~~ 120 (762) ....-|..+.|++...|..+ +.++++| ++.+-.+ -.++.|.+=+||+.++.+.+.-.|.+...+|.....+++.+ T Consensus 86 nk~~~~ysitA~RAiqD~~n~~~i~Le~i~a~~pvg~~~~a~v~Ak~Gvyd~~~ntL~l~~dv~i~~~eGi~a~~~sA~v 165 (216) T COG5375 86 NKDGRPYSITAERAIQDVKNPNIITLEGILAILPVGEDNPAQVTAKSGVYDRDTNTLDLTDDVKITTDEGIRADFQSAFV 165 (216) T ss_pred CCCCCEEEEEHHHHHHCCCCCCEEEEEEEEEECCCCCCCCEEEEECCCEEECCCCEEECCCCEEEECCHHHHCCCCHHEE T ss_conf 27897503307777650268874899600155247999734687123426667871324687688535433135601246 Q ss_pred ECCCCEEEEEE-EEEEEECCCEEEEEEEEE--ECCCEEEEEEEEEEECCC Q ss_conf 43025599984-799970675021024687--239759999879973137 Q gi|254780395|r 121 TDDFTNGIIKN-LTIKIPADETYLTASSAQ--IIGQRTIFDKGTYTACSS 167 (762) Q Consensus 121 ~~~~~~g~~~~-~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~T~C~~ 167 (762) |+ ++|.+.. ..+........+.++..+ -+|+..+|+.++=|..+| T Consensus 166 Dl--~~G~~sg~~pv~i~~~~gsI~ads~~v~D~G~iivF~g~Vr~~id~ 213 (216) T COG5375 166 DL--RTGNMSGDSPVDISGKGGSITADSYSVTDKGKIIVFEGNVRTRIDP 213 (216) T ss_pred EE--CCCCCCCCCCEEECCCCCCEEEEEEEEECCCCEEEEECCEEEEECC T ss_conf 60--2561026786333079984876136752288489995656898447 No 19 >PRK10893 hypothetical protein; Provisional Probab=96.75 E-value=0.037 Score=31.42 Aligned_cols=115 Identities=10% Similarity=0.097 Sum_probs=72.7 Q ss_pred EEEEEEEEEEECCCCEEEEEEEEEEEEC-C----EEEEEEEEEEECCCCEEEEECCEEEEC--CC--CCEEEEEEEEEEC Q ss_conf 7999207999766897999987999989-9----699951899986378799973489983--89--8789970699943 Q gi|254780395|r 52 LDISSDEIVLNSEGSTTTAVGNVKIEYK-G----YHLSARDITFNHKNHRIIASGNIKLIE--PD--KRQIHAEYLDITD 122 (762) Q Consensus 52 ~~~~ad~~~~~~~~~~~~a~G~v~i~~~-~----~~l~Ad~i~y~~~~~~~~a~G~v~~~~--~~--g~~~~~~~~~~~~ 122 (762) -.+.||++++=.+.+....+-=|...|+ + ..++|+++..+.+ +.+.++|||++.. ++ -..+..+.+++++ T Consensus 60 y~i~A~~~eHY~~~~~T~F~~Pvl~ly~~~~~~~W~VsA~~a~L~~d-~~L~L~gnV~i~~L~~~s~iq~I~T~~l~idL 138 (190) T PRK10893 60 YRLIAQHVEYYSDQAVSWFTQPVLTTFDKDKVPTWSVKADKAKLTND-RMLYLYGHVEVNALTPDSQLRRITTDNAQINL 138 (190) T ss_pred EEEEECHHHHCCCCCCEEEECCEEEEECCCCCCCEEEEECEEEECCC-CEEEEECCEEEEECCCCCCCCEEECCCEEEEC T ss_conf 89997142013137958981865999889995327999456798648-88999777999853897545389827589804 Q ss_pred CCCEEEEEEEEEEEECCCEEEEEEEEEE--CCCEEEEEEEEEEECCCC Q ss_conf 0255999847999706750210246872--397599998799731376 Q gi|254780395|r 123 DFTNGIIKNLTIKIPADETYLTASSAQI--IGQRTIFDKGTYTACSSC 168 (762) Q Consensus 123 ~~~~g~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~T~C~~~ 168 (762) .+++..-+.... +...+....|...+. ......|.+-+-|.-++. T Consensus 139 ~tqd~~SD~~Vt-I~G~~f~s~G~gm~anL~~~~a~L~~~Vkt~YEi~ 185 (190) T PRK10893 139 VTQDVTSDDLVT-LYGTTFNSSGLKMRGNLRSKNAELIEKVRTSYEIQ 185 (190) T ss_pred CCCEEECCCEEE-EECCCCEEEEEEEEEEHHHCEEEEHHHCCEEEECC T ss_conf 664686387099-98798399861089993658889045061999636 No 20 >cd07306 Porin3_VDAC Voltage-dependent anion channel of the outer mitochondrial membrane. The voltage-dependent anion channel (VDAC) regulates the flux of mostly anionic metabolites through the outer mitochondrial membrane, which is highly permeable to small molecules. VDAC is the most abundant protein in the outer membrane, and membrane potentials can toggle VDAC between open or high-conducting and closed or low-conducting forms. VDAC binds to and is regulated in part by hexokinase, an interaction that renders mitochondria less susceptible to pro-apoptotic signals, most likely by intefering with VDAC's capability to respond to Bcl-2 family proteins. While VDAC appears to play a key role in mitochondrially induced cell death, a proposed involvement in forming the mitochondrial permeability transition pore, which is characteristic for damaged mitochondria and apoptosis, has been challenged by more recent studies. Probab=96.69 E-value=0.041 Score=31.15 Aligned_cols=104 Identities=14% Similarity=0.260 Sum_probs=79.8 Q ss_pred CEEEEEEEECCCCEEEEEEEEECCCCCEEEEEEEEEEEECCCCEEEEEEEEECCCCCCCCCCEEEEEEEEEEECCCCEEE Q ss_conf 37999988436777999999861676606688999999647408999998722675446663258998999972565699 Q gi|254780395|r 620 DYAAAVRLSLSPKITFSTQTLINPKDWSTRRTDTTIDYTMGSYEANISYTHIPAYPLYAYDARKTIQSRIKFKINDVLSA 699 (762) Q Consensus 620 D~v~~~~~~~~~~~~l~~~~~~d~~~~~~~r~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~l~~~w~~ 699 (762) +.+.-.-+-..++|.+.+++-||.....+.+..+.+.|+.+.+.+++.- . +-+++.++.-.+++++|.+ T Consensus 130 p~~~~s~v~g~~~~~~G~~~~yd~~~~~~~k~~~~~gy~~~d~~~~l~l---~--------n~~~~~~S~~~~v~~~~~v 198 (276) T cd07306 130 PLVGASAVLGYKGFLLGAEVVYDTAKSKFTKYNFALGYTNGDFELSLKL---N--------NGKTLRGSYFHKVSPRLAV 198 (276) T ss_pred CEEEEEEEECCCCEEEEEEEEEECCCCCEEEEEEEEEEECCCCEEEEEE---C--------CCCEEEEEEEEECCCCEEE T ss_conf 7657999964676798999998433483412789999845881899997---6--------9778999999993876489 Q ss_pred EEEEEECCCCCCCEEEEEEEEECCE-----------EEEEEEEEEC Q ss_conf 9999951654541225656996466-----------9999999971 Q gi|254780395|r 700 DASLKWNMRAELPSHSIGLAYQNDC-----------ATVKIIYKNE 734 (762) Q Consensus 700 ~~~~~ydl~~~~~~~~~Gl~Y~~~C-----------w~~~l~y~r~ 734 (762) .+...|..........+|-+|.-++ -.+++.|+.. T Consensus 199 a~~~~~~~~~~~~~~~vG~~y~ld~~t~vKaKvn~~g~~gl~yq~~ 244 (276) T cd07306 199 GAKVTWYSGTNETTFAVGGQYALDPDALVKAKVNNDGQLGLSYQHK 244 (276) T ss_pred EEEEEEECCCCCCEEEEEEEEECCCCCEEEEEECCCCEEEEEEEEE T ss_conf 9999995289971899999998079978999997897399999995 No 21 >pfam01459 Porin_3 Eukaryotic porin. Probab=96.55 E-value=0.051 Score=30.52 Aligned_cols=117 Identities=13% Similarity=0.297 Sum_probs=85.2 Q ss_pred CEEEEEEEECCCCEEEEEEEEECCCCCEEEEEEEEEEEECCCCEEEEEEEEECCCCCCCCCCEEEEEEEEEEECCCCEEE Q ss_conf 37999988436777999999861676606688999999647408999998722675446663258998999972565699 Q gi|254780395|r 620 DYAAAVRLSLSPKITFSTQTLINPKDWSTRRTDTTIDYTMGSYEANISYTHIPAYPLYAYDARKTIQSRIKFKINDVLSA 699 (762) Q Consensus 620 D~v~~~~~~~~~~~~l~~~~~~d~~~~~~~r~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~l~~~w~~ 699 (762) .+.+.+-..+. +|.+.+..-||.....+......++|..+.+.+.+... +...++.++.-.+++++|.+ T Consensus 132 ~~~~s~~~~~~-~~~lG~e~~y~~~~~~~~~~~~~~~y~~~~~~~~~~~~----------~~~~~l~~Sy~~kvs~~~~~ 200 (272) T pfam01459 132 IVVGSALQGVT-GLALGAEAGYDTASGKLTKYNAALGYAARDYIASLTLN----------NKGDTLTASYYHKVSDKLEV 200 (272) T ss_pred EEEEEEEEECC-CEEEEEEEEEECCCCCCCCEEEEEEEECCCEEEEEEEC----------CCCCEEEEEEEEECCCCEEE T ss_conf 89999999047-47889999995377862223699999769839999991----------78876999999991885799 Q ss_pred EEEEEECCCCCCCEEEEEEEEE-----------CCEEEEEEEEEECCCCCCCCEEEEEEEE Q ss_conf 9999951654541225656996-----------4669999999971477887359999997 Q gi|254780395|r 700 DASLKWNMRAELPSHSIGLAYQ-----------NDCATVKIIYKNESPNIHGYKIQARLSL 749 (762) Q Consensus 700 ~~~~~ydl~~~~~~~~~Gl~Y~-----------~~Cw~~~l~y~r~~~~~~~~~i~~~~~L 749 (762) .+...++.........+|.+|. +.+|.+++.|+..-. +.-.+.+..++ T Consensus 201 g~e~~~~~~~~~~~~~vG~qy~l~~~~~~kakids~g~v~~~~~~~l~--~~~~ltls~~~ 259 (272) T pfam01459 201 GAELTWNFSSNENTTTVGYKYDLDPSTTVKAKVDSNGKVGLLLEKRLR--PGVTLTLSAEV 259 (272) T ss_pred EEEEEEECCCCCEEEEEEEEEECCCCCEEEEEECCCCEEEEEEEEECC--CCCEEEEEEEE T ss_conf 999998027897189999999739998899998899999999999638--99399999998 No 22 >PRK10716 long-chain fatty acid outer membrane transporter; Provisional Probab=92.10 E-value=0.68 Score=22.98 Aligned_cols=51 Identities=8% Similarity=0.090 Sum_probs=28.7 Q ss_pred CEEEEEEEEEEECCCCEEEEEEEEECCCCC--CCEEEEEEEEECCEEEEEEEE Q ss_conf 325899899997256569999999516545--412256569964669999999 Q gi|254780395|r 681 ARKTIQSRIKFKINDVLSADASLKWNMRAE--LPSHSIGLAYQNDCATVKIIY 731 (762) Q Consensus 681 ~~~~~~~~~~~~l~~~w~~~~~~~ydl~~~--~~~~~~Gl~Y~~~Cw~~~l~y 731 (762) +|.-+.+++.|+++++|++...+.|=.-++ +.+..+..+|+...|.+++.+ T Consensus 381 dR~~~S~G~~y~~~~~~tvD~ay~yi~~~~~~i~~~~~~~~~~~~a~i~g~~~ 433 (437) T PRK10716 381 DRFWLSAGTTYAFNKDASVDVGVSYMHGQSVKINEGPYQFESEGKAWLFGTNF 433 (437) T ss_pred CCEEEEEEEEEECCCCCEEEEEEEEEECCCCCEECCCEEEEEEEEEEEEEEEE T ss_conf 88599812688548895799988999426641542403568723489999888 No 23 >PRK09980 ompL outer membrane porin L; Provisional Probab=90.92 E-value=0.89 Score=22.18 Aligned_cols=99 Identities=11% Similarity=0.215 Sum_probs=56.4 Q ss_pred EEEEEECCCCEEEEEEEEECCCC------------CEEEEEEEEEEEEC-CCCEEEEEEEEECCCCCC--C--CCCEEEE Q ss_conf 99988436777999999861676------------60668899999964-740899999872267544--6--6632589 Q gi|254780395|r 623 AAVRLSLSPKITFSTQTLINPKD------------WSTRRTDTTIDYTM-GSYEANISYTHIPAYPLY--A--YDARKTI 685 (762) Q Consensus 623 ~~~~~~~~~~~~l~~~~~~d~~~------------~~~~r~~~~~~~~~-~~~~~~~~y~~~~~~~~~--~--~~~~~~~ 685 (762) +.+.++..+.+.+..|.|||..+ .+..|.+.-+.+.. +.|.+...+.|....+.. . .+...|. T Consensus 108 Lkl~Y~fd~~~~~~~RYRYd~~~yds~dlnG~~d~~d~hr~d~Y~~y~~~~~~~~~~~~tY~~~vnd~~~~NgKk~~~E~ 187 (230) T PRK09980 108 LDVNYKFTPWFNLTVRNRYNHNNYSSTDLNGELDNNDSYEIGNYWNFIITDKFSYTFEPHYFYNVNDFNSSNGTKHHWEI 187 (230) T ss_pred EEEEEEECCCCEEEEEEEECCCCCCCCCCCCCCCCCCEEEEEEEEEEEECCCEEEEEEEEEEEECCCCEECCCCCHHHHE T ss_conf 77777665871588998852455674345776677750773005778975743799730999850770732786404431 Q ss_pred EEEEEEECCCCEEEEEEEEE-CCC-C--CCC--EEEEEEEEE Q ss_conf 98999972565699999995-165-4--541--225656996 Q gi|254780395|r 686 QSRIKFKINDVLSADASLKW-NMR-A--ELP--SHSIGLAYQ 721 (762) Q Consensus 686 ~~~~~~~l~~~w~~~~~~~y-dl~-~--~~~--~~~~Gl~Y~ 721 (762) ++.+.+++++.|.=...+.| |.. . ++. +-++|+.|. T Consensus 188 n~~~~Yk~~~~w~PY~E~~~ld~~~~~~~r~q~r~RVGv~y~ 229 (230) T PRK09980 188 TNTFRYRINEHWLPYFELRWLDRNVGPYHREQNQIRIGAKYF 229 (230) T ss_pred EEEEEEECCCCCCCEEEEEECCCCCCCCCCCEEEEEEEEEEE T ss_conf 026888428962431898742235676777135799988862 No 24 >KOG3126 consensus Probab=85.97 E-value=1.9 Score=20.03 Aligned_cols=47 Identities=6% Similarity=0.089 Sum_probs=37.3 Q ss_pred EEEEEEEECCCCEEEEEEEEECCCCCEEEEEEEEEEEECCCCEEEEE Q ss_conf 79999884367779999998616766066889999996474089999 Q gi|254780395|r 621 YAAAVRLSLSPKITFSTQTLINPKDWSTRRTDTTIDYTMGSYEANIS 667 (762) Q Consensus 621 ~v~~~~~~~~~~~~l~~~~~~d~~~~~~~r~~~~~~~~~~~~~~~~~ 667 (762) ++...-+-..++|-+.+++-||.....+.+..+.+.|+.+...+.+. T Consensus 136 ~i~~s~v~g~~g~l~G~~~~fDt~~~~~t~~n~~lgy~~~d~~l~~~ 182 (281) T KOG3126 136 LILGSLVLGHEGWLLGYETTFDTASGKLTKYNAALGYTTEDFTLHLN 182 (281) T ss_pred EEEEEEEECCCCEEEEEEEEEECCCCCEEEEEEEEEEECCCCEEEEE T ss_conf 38888994444347777788851688374267788752687389887 No 25 >TIGR03509 OMP_MtrB_PioB decaheme-associated outer membrane protein, MtrB/PioB family. Members of this protein family are integral proteins of the bacterial outer membrane, associated with multiheme c-type cytochromes involved in electron transfer. The MtrB protein of Shewanella oneidensis MR-1 (SO1776) has been shown to form a complex with 1:1:1 stochiometry with the small, periplasmic decaheme cytochrome MtrA and large, surface-exposed decaheme cytochrome MtrC. Probab=84.82 E-value=2.1 Score=19.67 Aligned_cols=26 Identities=8% Similarity=0.208 Sum_probs=20.7 Q ss_pred CEEEEEEEEEEECCCCEEEEEEEEEC Q ss_conf 32589989999725656999999951 Q gi|254780395|r 681 ARKTIQSRIKFKINDVLSADASLKWN 706 (762) Q Consensus 681 ~~~~~~~~~~~~l~~~w~~~~~~~yd 706 (762) ....+.+.+.++++++..+...+.|. T Consensus 581 ~~h~l~~~~~Y~ls~~~~lrl~y~yE 606 (649) T TIGR03509 581 NQHRLKLYGKYQLSKSSSLRLDYRYE 606 (649) T ss_pred EEEEEEEEEEEEECCCCEEEEEEEEE T ss_conf 57999888657626671788778899 No 26 >TIGR03014 EpsL exopolysaccharide biosynthesis operon protein EpsL. The epsL gene is described as a component of the methanolan exopolysaccharide biosynthesis operon in Methylobacillus sp strain 12S, although no other information regarding its possible function is suggested. Homologs of this gene are found in several other exopolysaccharide operons in a small number of species. These operons contain a subset of the methanolan operon genes by homology and synteny, including the epsH gene which is proposed to act as an "exosortase" directing proteins with a C-terminal tag (PEP-CTERM) to the exopolysaccharide layer. Each of the genomes in which these genes and epsL are found also encode genes with these C-terminal tags. Probab=83.48 E-value=2.4 Score=19.30 Aligned_cols=147 Identities=14% Similarity=0.123 Sum_probs=60.2 Q ss_pred CEEEEEEEEEEEECCCCCEEEEEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCEE--EEEEEECCCCEEEEEEEE-- Q ss_conf 3478878888884288606632300378841333334543336667788876644379--999884367779999998-- Q gi|254780395|r 565 GNRTNLGIRYIGSFNNLFTINGVIGRSIHILGTNSFSIPDSIGIGQNSGLEDKFSDYA--AAVRLSLSPKITFSTQTL-- 640 (762) Q Consensus 565 ~~r~~~Gl~~~~~~~~~~~~~~~iGQ~~~~~~~~~~~~~~~~~~~~~sgl~~~~SD~v--~~~~~~~~~~~~l~~~~~-- 640 (762) .+++.+++.|...-+ ..+.+.+|-..+-.+..+ .+++|-+. +.+.|+|.....|..... T Consensus 214 q~e~~l~~~W~~TgK--t~~~~klGy~~r~fd~~s---------------~rDfsG~~~~v~~~W~pT~kTtl~~~~sR~ 276 (381) T TIGR03014 214 QTEHELRFDWAVTGK--SKLQGSIGYVDREHDHLS---------------QRDFSGVIGRLNADWMVTGKTSLNAAISRE 276 (381) T ss_pred EEEEEEEEEEEECCE--EEEEEEEEEEEEECCCCC---------------CCCCCCCCEEEEEEEEECCCEEEEEEEEEE T ss_conf 326667538986150--799999847873245556---------------445667555778899732558999999764 Q ss_pred ECCCC---C-EEEEEEEEEEEEC---CCCEEEEEEEEECCC----CCC----CCCCEEEEEEEEEEECCCCEEEEEEEEE Q ss_conf 61676---6-0668899999964---740899999872267----544----6663258998999972565699999995 Q gi|254780395|r 641 INPKD---W-STRRTDTTIDYTM---GSYEANISYTHIPAY----PLY----AYDARKTIQSRIKFKINDVLSADASLKW 705 (762) Q Consensus 641 ~d~~~---~-~~~r~~~~~~~~~---~~~~~~~~y~~~~~~----~~~----~~~~~~~~~~~~~~~l~~~w~~~~~~~y 705 (762) ....+ . -..+..+.+.... ++..+.+...|...+ +.. -.|+...+.+++.|++...+.+.+.+.| T Consensus 277 ~~~~~~~~ssy~~~~~~~l~~~w~~T~kv~~~~~~~y~~rdY~G~~~~~~~~R~D~~~~~~l~~~Y~p~R~l~ls~~~~~ 356 (381) T TIGR03014 277 LANYQTVTSSYYRNRGTSIGPTWQATSKIAVRGRLDYEERDFEGDPLVGPPARSDRTRSGSLSLDWSPVRAVRISAAFQR 356 (381) T ss_pred ECCCCCCCCEEEEEEEEEEEEEEEEEEEEEEEEEEEEEECCCCCCCCCCCCCCCCCEEEEEEEEEEEEEEEEEEEEEEEE T ss_conf 03666777406995338842105211239998888998412358645676675432488778999832023888757998 Q ss_pred CCCCCCCEEEEEEEEECCEEEEEEEE Q ss_conf 16545412256569964669999999 Q gi|254780395|r 706 NMRAELPSHSIGLAYQNDCATVKIIY 731 (762) Q Consensus 706 dl~~~~~~~~~Gl~Y~~~Cw~~~l~y 731 (762) .-++. ..-++.|...-+.+++.+ T Consensus 357 ~~RdS---N~~~~~Yd~n~~~ls~~~ 379 (381) T TIGR03014 357 EKRDS---NSDGFDFDSNSVSVSATL 379 (381) T ss_pred EECCC---CCCCCCCCCCEEEEEEEE T ss_conf 52457---655575123238999999 No 27 >pfam05096 Glu_cyclase_2 Glutamine cyclotransferase. This family of enzymes EC:2.3.2.5 catalyse the cyclization of free L-glutamine and N-terminal glutaminyl residues in proteins to pyroglutamate (5-oxoproline) and pyroglutamyl residues respectively. This family includes plant and bacterial enzymes and seems unrelated to the mammalian enzymes. Probab=81.34 E-value=2.9 Score=18.78 Aligned_cols=44 Identities=5% Similarity=-0.043 Sum_probs=24.5 Q ss_pred EEECCCCEEEEECCE--EEECCCCCEEEEEEE-EEECCCCEEEEEEEE Q ss_conf 998637879997348--998389878997069-994302559998479 Q gi|254780395|r 89 TFNHKNHRIIASGNI--KLIEPDKRQIHAEYL-DITDDFTNGIIKNLT 133 (762) Q Consensus 89 ~y~~~~~~~~a~G~v--~~~~~~g~~~~~~~~-~~~~~~~~g~~~~~~ 133 (762) .+|.+++.+..+-++ .+ -++|.++.++++ .|+.+.+.|++.+.. T Consensus 72 ~~d~~tg~v~~~~~L~~~~-FgEGiT~~~d~i~qLTWk~~~~fvyd~~ 118 (264) T pfam05096 72 VYDLTQEIFSEKIAFPDTV-FGEGLTVVEDYVYLLTWKDGVAYKYDKA 118 (264) T ss_pred EEECCCCCEEEEEECCHHC-CEEEEEEECCEEEEEEECCCEEEEECCC T ss_conf 9978988699999678000-2000799899999999517827997275 No 28 >pfam06178 KdgM Oligogalacturonate-specific porin protein (KdgM). This family consists of several bacterial proteins which are homologous to the oligogalacturonate-specific porin protein KdgM from Erwinia chrysanthemi. The phytopathogenic Gram-negative bacteria Erwinia chrysanthemi secretes pectinases, which are able to degrade the pectic polymers of plant cell walls, and uses the degradation products as a carbon source for growth. KdgM is a major outer membrane protein, whose synthesis is strongly induced in the presence of pectic derivatives. KdgM behaves like a voltage-dependent porin that is slightly selective for anions and that exhibits fast block in the presence of trigalacturonate. In contrast to most porins, KdgM seems to be monomeric. Probab=81.05 E-value=3 Score=18.72 Aligned_cols=75 Identities=15% Similarity=0.190 Sum_probs=38.6 Q ss_pred EEEEEEEEEEEEC-CCCEEEEEEEEECC-CCCC---CCCCEEEEEEEEEEECCCCEEEEEEEEECC----CCC-CCEEEE Q ss_conf 0668899999964-74089999987226-7544---666325899899997256569999999516----545-412256 Q gi|254780395|r 647 STRRTDTTIDYTM-GSYEANISYTHIPA-YPLY---AYDARKTIQSRIKFKINDVLSADASLKWNM----RAE-LPSHSI 716 (762) Q Consensus 647 ~~~r~~~~~~~~~-~~~~~~~~y~~~~~-~~~~---~~~~~~~~~~~~~~~l~~~w~~~~~~~ydl----~~~-~~~~~~ 716 (762) ...|.+..+.|+. +.|.+.....|... +... +.+...+.++.+.+++.++|.=...+.+=- .++ ..+.++ T Consensus 133 ~~~r~d~~~~Y~~~d~~~~~ye~~y~~~~~~~~~~d~k~~~~e~n~~~~Yk~~~~W~PyvEvg~v~~~~~~d~rq~RyRV 212 (218) T pfam06178 133 DQYRIDGYIDYKINDKFSYTYEPTYYKKVNDFKSSDGKKTNWEYNVKLAYKIDKHWTPYVEVGNVDVNNNTDERQTRYRV 212 (218) T ss_pred CCEEEEEEEEEECCCCEEEEEEEEEEEECCCEECCCCCCCEEEEEEEEEEECCCCCCEEEEEEECCCCCCCCCCEEEEEE T ss_conf 60799899953216755899889999953835604798851688899999828982304999842567888662767998 Q ss_pred EEEEE Q ss_conf 56996 Q gi|254780395|r 717 GLAYQ 721 (762) Q Consensus 717 Gl~Y~ 721 (762) ||+|. T Consensus 213 Gv~Y~ 217 (218) T pfam06178 213 GVQYS 217 (218) T ss_pred EEEEE T ss_conf 88976 No 29 >COG3117 Uncharacterized protein conserved in bacteria [Function unknown] Probab=75.77 E-value=4.2 Score=17.69 Aligned_cols=82 Identities=11% Similarity=0.221 Sum_probs=41.8 Q ss_pred EEEEEEEEEEECCCCEEEEEEEE-EEEE---CCEEEEEEEEEEECCCCEEEEECCEEEEC--CC---CCEEEEEEEEEEC Q ss_conf 79992079997668979999879-9998---99699951899986378799973489983--89---8789970699943 Q gi|254780395|r 52 LDISSDEIVLNSEGSTTTAVGNV-KIEY---KGYHLSARDITFNHKNHRIIASGNIKLIE--PD---KRQIHAEYLDITD 122 (762) Q Consensus 52 ~~~~ad~~~~~~~~~~~~a~G~v-~i~~---~~~~l~Ad~i~y~~~~~~~~a~G~v~~~~--~~---g~~~~~~~~~~~~ 122 (762) ..+.|+++.+-.+.+....+-=+ .+.. ...++.||++++..+.. +...|||+-.. ++ -..+..+++.+++ T Consensus 60 y~l~a~~~eh~~~~~~t~ft~P~l~lf~e~~q~w~v~Ad~a~l~~d~~-l~l~~~vvv~~~t~d~~q~~~l~T~~~~V~l 138 (188) T COG3117 60 YRLTAQHVEHYSDSSDTHFTAPVLTLFRENRQTWTVQADRAKLTKDGK-LYLLGDVVVVALTPDSGQLQRLTTESAQVNL 138 (188) T ss_pred EEEEHHHHHHCCCCCCEEEECCEEEEECCCCCCEEEEHHHHEECCCCE-EEEEECEEEEEECCCCCCCCEEEEEEEEECC T ss_conf 776323454154566557715359988388762799810202157855-9987120677514776676458710237613 Q ss_pred CCCEEEEEEEEE Q ss_conf 025599984799 Q gi|254780395|r 123 DFTNGIIKNLTI 134 (762) Q Consensus 123 ~~~~g~~~~~~~ 134 (762) +.+...-+.+.- T Consensus 139 ~tqd~~te~~V~ 150 (188) T COG3117 139 KTQDAQTEDPVT 150 (188) T ss_pred CCCCEECCCCEE T ss_conf 310000377566 No 30 >TIGR01778 TonB-copper TonB-dependent copper receptor; InterPro: IPR010100 This entry represents a family of proteobacterial TonB-dependent outer membrane receptor/transporters which bind and translocate copper ions. Two characterised members of this family exist, outer membrane protein C (OprC) from Pseudomonas aeruginosa and NosA from Pseudomonas stutzeri, which is responsible for providing copper for the copper-containing N2O reductase .. Probab=74.45 E-value=2.7 Score=18.99 Aligned_cols=52 Identities=17% Similarity=0.280 Sum_probs=26.7 Q ss_pred EEEEEEEEEE-CCCEE-EEEEE--EEEECCCCCCCCCCCCEEEEEEE---EEEEECCCCEEEE Q ss_conf 0210246872-39759-99987--99731376657788850899966---9999677777999 Q gi|254780395|r 141 TYLTASSAQI-IGQRT-IFDKG--TYTACSSCSKPNSRPPFWIVKSK---RAILNRKTHTIRL 196 (762) Q Consensus 141 ~~~~~~~~~~-~~~~~-~~~~~--~~T~C~~~~~~~~~~P~W~i~a~---~i~~d~~~~~~~~ 196 (762) +...-=.+|- .+.+. |+.+| ++=+|+ .-- .+|+=+|..+ +|++.+.-+.|.+ T Consensus 34 G~ggDP~~RG~~gsRLNI~~DG~~v~GaCp-nRM---D~Pt~Yi~pEsYd~~TViKGpQtV~y 92 (668) T TIGR01778 34 GSGGDPVFRGQFGSRLNIVTDGGFVLGACP-NRM---DPPTSYISPESYDKVTVIKGPQTVKY 92 (668) T ss_pred CCCCCCEEECCCCCEEEEEECCCEEECCCC-CCC---CCCCCCCCCCCCCEEEEEECCCCEEE T ss_conf 878871065478634678657866887777-888---87645678751461169856720000 No 31 >pfam02462 Opacity Opacity family porin protein. Pathogenic Neisseria spp. possess a repertoire of phase-variable Opacity proteins that mediate various pathogen--host cell interactions. These proteins are integral membrane proteins related to other porins. Probab=72.99 E-value=4.9 Score=17.24 Aligned_cols=38 Identities=16% Similarity=0.369 Sum_probs=24.6 Q ss_pred EEEEEEEEECCCCEEEEEEEEECC----CCCC-C--EEEEEEEEE Q ss_conf 899899997256569999999516----5454-1--225656996 Q gi|254780395|r 684 TIQSRIKFKINDVLSADASLKWNM----RAEL-P--SHSIGLAYQ 721 (762) Q Consensus 684 ~~~~~~~~~l~~~w~~~~~~~ydl----~~~~-~--~~~~Gl~Y~ 721 (762) ++-++..+.|++||.+.+.++|+- ++.+ . +..+|+.|. T Consensus 81 gv~AGVgyditpnltLD~GYRY~~~G~le~tr~~theas~GvRY~ 125 (126) T pfam02462 81 GVLAGVSYKLTPNVDLDTGYRYNRLGKVTVTNVRSGEASAGVRYK 125 (126) T ss_pred EEEEEEEEECCCCEEEECCEEEEECCCCCCCEECCCCEEEEEEEE T ss_conf 887436883278737712528800464134302012136886663 No 32 >pfam12600 DUF3769 Protein of unknown function (DUF3769). This family of proteins is found in bacteria and eukaryotes. Proteins in this family are typically between 560 and 931 amino acids in length. Probab=70.58 E-value=5.5 Score=16.89 Aligned_cols=164 Identities=10% Similarity=0.049 Sum_probs=89.2 Q ss_pred CEEEEEEEEEECCCEEEEEEEEEEECCCCCCCCCCCCEEEEEEEEEEEECC-CCEEEEECCEEEECCEEEEECCC--EEC Q ss_conf 502102468723975999987997313766577888508999669999677-77799944289982706863254--001 Q gi|254780395|r 140 ETYLTASSAQIIGQRTIFDKGTYTACSSCSKPNSRPPFWIVKSKRAILNRK-THTIRLEKPYLEIFGNSIFYFPL--IEI 216 (762) Q Consensus 140 ~~~~~~~~~~~~~~~~~~~~~~~T~C~~~~~~~~~~P~W~i~a~~i~~d~~-~~~~~~~~a~l~i~~vPv~y~Py--~~~ 216 (762) ..+..++.+..+++...-+++.||-- |=.+|--.|+|..++.-+. .+.+..+..+|-+-.. +-+|. =.+ T Consensus 8 rwRF~a~~i~i~~~~W~a~~~~fTND------pf~ppq~~l~a~~~~~~~~~~~~i~s~~~~Li~d~~--~~iP~~~r~~ 79 (403) T pfam12600 8 RWRFQADRISFDGDGWKAERVRFTND------PFSPPELEIRAESVTAREISDGELKSRRNRLILDQR--LSIPLGRRRL 79 (403) T ss_pred EEEEECCEEEECCCCCEEEEEEECCC------CCCCHHEEEECCCEEEEECCCCCEEECCCEEEEECC--CCCCCCCCEE T ss_conf 89998038898077777715788089------999713699927269998799817962356998343--0135666405 Q ss_pred CCCCCCCCCCCCCEEEEC--CCCCCCCEECEE-EEECCCCCCEEEEEEECCCCEEEEEEEEEEEECCCCEEEEEEEEECC Q ss_conf 245554545410005641--333343000103-43024652014554202431011001345530256079999875055 Q gi|254780395|r 217 PDETVVRKTGFLTPLFSS--GEKQRFGVGIPY-YLVISDNSDATFTFSPHPKKGILGEMELRKYFHSGKHTLHAAYMYNN 293 (762) Q Consensus 217 P~~~~~R~sGfL~P~~~~--s~~~G~~~~~Py-y~~i~~~~D~Tltp~~~s~rG~~l~~eyRy~~~~g~~~~~~~~~~~d 293 (762) .. ..++...-. .+|+ -.+.|+.|.-.+ -..|.++.-+.++|+++-+|.+..++.= + ..++ T Consensus 80 ~~-~~~~~~~~w--~~G~D~~drdG~fi~R~~~~i~~~~~~~l~l~PQfliQRai~g~t~s--------~------~s~~ 142 (403) T pfam12600 80 VI-DREEVENRW--VIGYDNDDRDGLFVGRNLNPIDLTENTSLSLQPQFLIQRAINGETNS--------I------TSPK 142 (403) T ss_pred EC-CCCCCCCCE--EEEECCCCCCEEEEECCCCEEECCCCEEEEEEEHHHHHHHHCCCCCC--------C------CCCC T ss_conf 40-542346745--87422677760799627860730795479980389966765378776--------6------5765 Q ss_pred CCCCCCCCEEEEEEECCEECCEEEEEEEEEEEEEE Q ss_conf 56765432011232021000000356678999951 Q gi|254780395|r 294 NVESGEERHQAMLASIAEFEINPIWNLGWHLKKQT 328 (762) Q Consensus 294 ~~~~~~~~~r~~~~~~~~~~l~~~~~~~~~~~~~S 328 (762) ......-..-..+.++-+..++++|.+..+.++-| T Consensus 143 v~~~~~~~D~FgL~a~l~g~~~~~w~l~~~~~L~S 177 (403) T pfam12600 143 VDQDNNLPDLFGLKARLNGNLSPNTSLELDADLSS 177 (403) T ss_pred CCCCCCCCCCEEEEEEEEEECCCCCEEEEEEEECC T ss_conf 44467654426799998105089825999752335 No 33 >pfam00267 Porin_1 Gram-negative porin. Probab=69.02 E-value=6 Score=16.67 Aligned_cols=81 Identities=19% Similarity=0.284 Sum_probs=46.7 Q ss_pred EEEEEEEEECCC-CEEEEEEEEECCCCC----CCCCCEEEEEEEEEEECCCCEEEEEEEEECCCCCCCEEEEEEEEECCE Q ss_conf 889999996474-089999987226754----466632589989999725656999999951654541225656996466 Q gi|254780395|r 650 RTDTTIDYTMGS-YEANISYTHIPAYPL----YAYDARKTIQSRIKFKINDVLSADASLKWNMRAELPSHSIGLAYQNDC 724 (762) Q Consensus 650 r~~~~~~~~~~~-~~~~~~y~~~~~~~~----~~~~~~~~~~~~~~~~l~~~w~~~~~~~ydl~~~~~~~~~Gl~Y~~~C 724 (762) ..|+.+.|.... +...+.|.+...... ...+...++.+.+.|.|+++..+.+.+..++.++......++. +++- T Consensus 247 ~~e~~~~Y~~~~g~~~~~~Y~~~~~~~~~~~~~~~~~~~q~~lg~dY~fSKrt~~Y~~Y~~~~~d~~~~~~~~~~-~dd~ 325 (334) T pfam00267 247 NFEVAAQYQFDFGLRPRVSYAQSKGKDLNGAYGDNDLVKYVDVGATYYFNKNMSAYVDYKINLLDKGDDFAAGIN-TDDA 325 (334) T ss_pred EEEEEEEEEECCCCCEEEEEEEEECCCCCCCCCCCCEEEEEEEEEEEEECCCCEEEEEEEEECCCCCCCCCCCCC-CCCE T ss_conf 699999997424631699999874222356557764589999998999658804999998762566654446777-6888 Q ss_pred EEEEEEE Q ss_conf 9999999 Q gi|254780395|r 725 ATVKIIY 731 (762) Q Consensus 725 w~~~l~y 731 (762) |.++|.| T Consensus 326 ~~vGlRY 332 (334) T pfam00267 326 VAVGLVY 332 (334) T ss_pred EEEEEEE T ss_conf 9998088 No 34 >PRK10554 outer membrane porin protein C; Provisional Probab=67.37 E-value=6.4 Score=16.45 Aligned_cols=85 Identities=14% Similarity=0.257 Sum_probs=51.7 Q ss_pred EEEEEEEEEEEECCC-CEEEEEEEEECCCCCC--------CCCCEEEEEEEEEEECCCCEEEEEEEEECCCCCC-CEEEE Q ss_conf 066889999996474-0899999872267544--------6663258998999972565699999995165454-12256 Q gi|254780395|r 647 STRRTDTTIDYTMGS-YEANISYTHIPAYPLY--------AYDARKTIQSRIKFKINDVLSADASLKWNMRAEL-PSHSI 716 (762) Q Consensus 647 ~~~r~~~~~~~~~~~-~~~~~~y~~~~~~~~~--------~~~~~~~~~~~~~~~l~~~w~~~~~~~ydl~~~~-~~~~~ 716 (762) +..-.|+.+.|+++. ++.++.|.+-+..... ..+..+.+.+++.|.+++|.+....++.||-++- ..... T Consensus 278 k~~~~E~~a~Y~fd~glrp~~~Y~~~~~~d~~~~~~~~~~~~d~~~~~~iga~Y~FnkN~~~y~~Yk~NllD~~~~~~~~ 357 (373) T PRK10554 278 KAQNFEVVAQYQFDFGLRPSVAYLQSKGKDLGNIYGRNYGDQDILKYVDVGATYYFNKNMSTYVDYKINLLDDNDFTRDA 357 (373) T ss_pred CCCCEEEEEEEEECCCCCEEEEEEHHHCCCCCCCCCCCCCCCCHHHEEEEEEEEEECCCEEEEEEEEEECCCCCCCCCCC T ss_conf 23337899997623441101001010011123455566573000023897679997787216888875136765310003 Q ss_pred EEEEECCEEEEEEEEE Q ss_conf 5699646699999999 Q gi|254780395|r 717 GLAYQNDCATVKIIYK 732 (762) Q Consensus 717 Gl~Y~~~Cw~~~l~y~ 732 (762) |+. .++-+.++|.|+ T Consensus 358 ~~~-~DD~~alGlrY~ 372 (373) T PRK10554 358 GIN-TDDIVALGLVYQ 372 (373) T ss_pred CCC-CCCEEEEEEEEE T ss_conf 677-677699962881 No 35 >PRK10159 outer membrane phosphoporin protein E; Provisional Probab=66.06 E-value=6.8 Score=16.29 Aligned_cols=80 Identities=15% Similarity=0.198 Sum_probs=49.5 Q ss_pred EEEEEEEEEECCC-CEEEEEEEEECCCCCC---CCCCEEEEEEEEEEECCCCEEEEEEEEECCCCCCCEEEEEEEEECCE Q ss_conf 6889999996474-0899999872267544---66632589989999725656999999951654541225656996466 Q gi|254780395|r 649 RRTDTTIDYTMGS-YEANISYTHIPAYPLY---AYDARKTIQSRIKFKINDVLSADASLKWNMRAELPSHSIGLAYQNDC 724 (762) Q Consensus 649 ~r~~~~~~~~~~~-~~~~~~y~~~~~~~~~---~~~~~~~~~~~~~~~l~~~w~~~~~~~ydl~~~~~~~~~Gl~Y~~~C 724 (762) ...|+.+.|+.+. ++.+++|.+.+..... ..+..+.+.+.+.|.+++|......+..++-++-. .+++. .++- T Consensus 266 ~~~E~~a~Y~fd~gl~~~~~y~~~~~~d~~~~~~~d~~~~~~~ga~Y~Fnkn~~~y~~Yk~n~ld~~~--~~~~~-~dd~ 342 (351) T PRK10159 266 QNFEAVAQYQFDFGLRPSLGYVLSKGKDIEGIGDEDLVNYIDVGATYYFNKNMSAFVDYKINQLDSDN--KLNIN-NDDI 342 (351) T ss_pred CCEEEEEEEEECCCCEEEEEEEEEECCCCCCCCCCEEEEEEEEEEEEEECCCCEEEEEEEECCCCCCC--CCCCC-CCCE T ss_conf 33789898731145200322566512345577861322069986799957981589988601557755--56887-6788 Q ss_pred EEEEEEE Q ss_conf 9999999 Q gi|254780395|r 725 ATVKIIY 731 (762) Q Consensus 725 w~~~l~y 731 (762) |.+++.| T Consensus 343 ~~~GlrY 349 (351) T PRK10159 343 VAVGMTY 349 (351) T ss_pred EEEEEEE T ss_conf 9996188 No 36 >PRK09840 catecholate siderophore receptor Fiu; Provisional Probab=63.43 E-value=7.6 Score=15.96 Aligned_cols=18 Identities=22% Similarity=0.122 Sum_probs=12.1 Q ss_pred EEEEEEEEEEEEEEEECC Q ss_conf 999888998778875125 Q gi|254780395|r 459 LFTPIANIRGDLHYLSFN 476 (762) Q Consensus 459 ~~tp~~~lr~~~Y~~~~~ 476 (762) .++..+++|.+.|..+.+ T Consensus 459 ~l~l~~G~R~d~~~~~~~ 476 (760) T PRK09840 459 DFELNGGIRLDNYHTEYD 476 (760) T ss_pred CEEEEEEEEEEEEEECCC T ss_conf 579999888875430111 No 37 >PRK10808 outer membrane protein A; Reviewed Probab=61.57 E-value=8.2 Score=15.75 Aligned_cols=17 Identities=24% Similarity=0.538 Sum_probs=8.2 Q ss_pred EECCEEEEEEEEEEEEE Q ss_conf 00000035667899995 Q gi|254780395|r 311 EFEINPIWNLGWHLKKQ 327 (762) Q Consensus 311 ~~~l~~~~~~~~~~~~~ 327 (762) .+.|+++|.++.++++. T Consensus 153 eY~lt~~~~~R~eYq~~ 169 (347) T PRK10808 153 EYAITPEWATRLEYQWT 169 (347) T ss_pred EEEECCCCEEEEEEEEE T ss_conf 58806372688889998 No 38 >pfam12094 DUF3570 Protein of unknown function (DUF3570). This family of proteins is functionally uncharacterized. This protein is found in bacteria. Proteins in this family are typically between 396 to 444 amino acids in length. Probab=59.75 E-value=8.8 Score=15.54 Aligned_cols=28 Identities=11% Similarity=0.190 Sum_probs=20.1 Q ss_pred EECCCEEEEEEEEEEEECCCCCEEEEEE Q ss_conf 0046347887888888428860663230 Q gi|254780395|r 561 RIEGGNRTNLGIRYIGSFNNLFTINGVI 588 (762) Q Consensus 561 Rie~~~r~~~Gl~~~~~~~~~~~~~~~i 588 (762) |+..-.-.++||++.....++..+.+.. T Consensus 345 rLs~f~a~t~Gl~~~~~l~~~~~~~~~~ 372 (407) T pfam12094 345 RLSAFGAYTIGLKFGKKLSGGLEIDLRY 372 (407) T ss_pred CHHCCCCEEEEEEEEEEECCCCEEEEEE T ss_conf 4002766488367999937994898541 No 39 >pfam04338 DUF481 Protein of unknown function, DUF481. This family includes several proteins of uncharacterized function. Probab=55.13 E-value=10 Score=15.04 Aligned_cols=41 Identities=20% Similarity=0.230 Sum_probs=24.0 Q ss_pred CCCCCCCCCEECCCEEEEEEEEEEEECCCCCEEEEEEEEEEEEC Q ss_conf 21223574000463478878888884288606632300378841 Q gi|254780395|r 552 TQNRFSGFDRIEGGNRTNLGIRYIGSFNNLFTINGVIGRSIHIL 595 (762) Q Consensus 552 ~~nrfsG~DRie~~~r~~~Gl~~~~~~~~~~~~~~~iGQ~~~~~ 595 (762) ..++|.|+|.. .-+..|+.+.........+.+.+|-.|..+ T Consensus 75 e~D~f~~i~~r---~~~g~G~Gy~~~~~~~~~l~~~~G~~y~~e 115 (210) T pfam04338 75 ERDRFSGYDYR---ATLGAGLGYRFIDTDKTKLSVEAGPGYRYE 115 (210) T ss_pred EECCCCCEEEE---EEECCCEEEEEEECCCEEEEEEECCEEEEE T ss_conf 98422364589---997140149999689849999966639999 No 40 >PRK10523 lipoprotein involved with copper homeostasis and adhesion; Provisional Probab=54.74 E-value=11 Score=15.00 Aligned_cols=106 Identities=13% Similarity=0.254 Sum_probs=51.0 Q ss_pred CCEEEEEEEEEEEECCCCEEEEEEEE-EEEECCEEEEEEEEEEECCCCEEE---E-ECCEEEECCCCCEEEEEEEEEECC Q ss_conf 66079992079997668979999879-999899699951899986378799---9-734899838987899706999430 Q gi|254780395|r 49 HSELDISSDEIVLNSEGSTTTAVGNV-KIEYKGYHLSARDITFNHKNHRII---A-SGNIKLIEPDKRQIHAEYLDITDD 123 (762) Q Consensus 49 ~~~~~~~ad~~~~~~~~~~~~a~G~v-~i~~~~~~l~Ad~i~y~~~~~~~~---a-~G~v~~~~~~g~~~~~~~~~~~~~ 123 (762) +..+.+.+|....-++.-.=...|+- .-.+|.....||+++..-..++-. + .++++|.+.+|+.|.. . +|+. T Consensus 58 ~ttL~L~~DGtfvl~~~YlG~~~g~~~F~e~GrW~rtadklvLt~s~gek~yfr~~g~~L~mLD~~G~pI~S-~--lNY~ 134 (234) T PRK10523 58 DTSLFLEKDGTWVMNERYLGAREGPSSFASYGTWARTADKLVLTDSKGEKSYYRAKGDALEMLDREGNPIES-Q--LNYT 134 (234) T ss_pred EEEEEECCCCCEEEEEEEECCCCCCCCHHHCCEEEECCCEEEEECCCCCEEEEEECCCEEEEECCCCCCCCC-C--CCCE T ss_conf 358998689878977787045478863322460363288589747998740135628804671568980214-7--8862 Q ss_pred CCEEEEEEEEEEEECCCEEEEEEEEEECCCEEEE-EEEEEEECCCCC Q ss_conf 2559998479997067502102468723975999-987997313766 Q gi|254780395|r 124 FTNGIIKNLTIKIPADETYLTASSAQIIGQRTIF-DKGTYTACSSCS 169 (762) Q Consensus 124 ~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~T~C~~~~ 169 (762) +.. +. ..++ ..-....|....+ +-++||-|.-+. T Consensus 135 L~~--v~---~~lp-------~tp~~l~G~y~YmADAAtFtdCaTGk 169 (234) T PRK10523 135 LEP--VQ---QSLP-------KTPMPLRGMYFYMADAATFTDCATGK 169 (234) T ss_pred EEE--CC---CCCC-------CCCCCCCEEEEEECCCCEEEECCCCC T ss_conf 202--23---5678-------88741112577505772256747898 No 41 >pfam05326 SVA Seminal vesicle autoantigen (SVA). This family consists of seminal vesicle autoantigen and prolactin-inducible (PIP) proteins. Seminal vesicle autoantigen (SVA) is specifically present in the seminal plasma of mice. This 19-kDa secretory glycoprotein suppresses the motility of spermatozoa by interacting with phospholipid. PIP, has several known functions. In saliva, this protein plays a role in host defence by binding to microorganisms such as Streptococcus. PIP is an aspartyl proteinase and it acts as a factor capable of suppressing T-cell apoptosis through its interaction with CD4. Probab=52.80 E-value=11 Score=14.91 Aligned_cols=27 Identities=30% Similarity=0.627 Sum_probs=19.2 Q ss_pred EEEEEEECCCCCCCCCCCCEEEEEEEEEE Q ss_conf 98799731376657788850899966999 Q gi|254780395|r 158 DKGTYTACSSCSKPNSRPPFWIVKSKRAI 186 (762) Q Consensus 158 ~~~~~T~C~~~~~~~~~~P~W~i~a~~i~ 186 (762) -+..||+| .|+++|. .=.|.|.+++-. T Consensus 80 ~n~~yT~C-LC~~~~~-~FfWdi~v~~t~ 106 (124) T pfam05326 80 FNYKYTAC-LCNDYPN-TFFWDIQVNRTV 106 (124) T ss_pred CCEEEEEE-ECCCCCC-CEEEEEEECCEE T ss_conf 22266138-3279950-258999977479 No 42 >pfam11751 DUF3308 Protein of unknown function (DUF3308). Some members of this family of bacterial proteins are annotated as being one of the several TonB-dependent siderophore receptors, but this could not be confirmed. Probab=49.93 E-value=13 Score=14.50 Aligned_cols=73 Identities=16% Similarity=0.244 Sum_probs=31.9 Q ss_pred EEEEECCCCEEEEEEEEECCCCCEEEEEEEEEEEECC-CCEEEEEEEEECCCCCCCCCCEEEEEEEEEEECCCCEEEEEE Q ss_conf 9988436777999999861676606688999999647-408999998722675446663258998999972565699999 Q gi|254780395|r 624 AVRLSLSPKITFSTQTLINPKDWSTRRTDTTIDYTMG-SYEANISYTHIPAYPLYAYDARKTIQSRIKFKINDVLSADAS 702 (762) Q Consensus 624 ~~~~~~~~~~~l~~~~~~d~~~~~~~r~~~~~~~~~~-~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~l~~~w~~~~~ 702 (762) ...+.+.+.+.+.-.+.+-.........++++.+... ++.+.++||.. ..+.+.+.++++++|.+... T Consensus 184 gy~~~l~~~~~i~Ps~l~~~~~~~~~~~d~~~~~~~~~~~~~G~~YR~~-----------~ai~~~~G~~~~~~~~~gYs 252 (274) T pfam11751 184 GYVFDLNEELKLKPSVLYKYQEGAPLSLDLNANLLYNDKFWLGASYRNN-----------DAISALVGFNINDNLRIGYS 252 (274) T ss_pred EEEEECCCCEEEEEEEEEEEECCCCCEEEEEEEEEEECEEEEEEEECCC-----------CCEEEEEEEEECCCEEEEEE T ss_conf 8999868982899999999975997128778999990339999998489-----------95999996998896999999 Q ss_pred EEECC Q ss_conf 99516 Q gi|254780395|r 703 LKWNM 707 (762) Q Consensus 703 ~~ydl 707 (762) +.+.+ T Consensus 253 Yd~~~ 257 (274) T pfam11751 253 YDYTT 257 (274) T ss_pred EEEEC T ss_conf 97024 No 43 >PRK04922 tolB translocation protein TolB; Provisional Probab=48.17 E-value=11 Score=14.98 Aligned_cols=25 Identities=24% Similarity=0.385 Sum_probs=15.7 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHC Q ss_conf 9533102476789999999987632 Q gi|254780395|r 1 MRYNYRHLIGATTFFLAFNIITPQG 25 (762) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~ 25 (762) ||...|-|....+|++.+..++.+| T Consensus 1 ~~~~~~~~~~~~~~~~~~~a~~a~a 25 (439) T PRK04922 1 MKKPLRWLAALTALLLPLSALAQQQ 25 (439) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 9616899999999999877886168 No 44 >PRK11433 putative xanthine dehydrogenase iron-sulfur-binding subunit; Provisional Probab=45.85 E-value=15 Score=14.09 Aligned_cols=19 Identities=21% Similarity=0.182 Sum_probs=10.2 Q ss_pred CCCCCHHHHHHHHHHHHHH Q ss_conf 9533102476789999999 Q gi|254780395|r 1 MRYNYRHLIGATTFFLAFN 19 (762) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~ 19 (762) ++.++|.++..+...++.. T Consensus 10 ~~~srR~~~~~~~~~~a~~ 28 (217) T PRK11433 10 LKLTRRDLLIASAATAATA 28 (217) T ss_pred CCEEHHHHHHHHHHHHHHH T ss_conf 4502999999999999987 No 45 >PRK10002 outer membrane protein F; Provisional Probab=45.80 E-value=15 Score=14.08 Aligned_cols=82 Identities=18% Similarity=0.231 Sum_probs=53.8 Q ss_pred EEEEEEEEEEEECCC-CEEEEEEEEECCCCCCC---CCCEEEEEEEEEEECCCCEEEEEEEEECCCCCCCEEEEEEEEEC Q ss_conf 066889999996474-08999998722675446---66325899899997256569999999516545412256569964 Q gi|254780395|r 647 STRRTDTTIDYTMGS-YEANISYTHIPAYPLYA---YDARKTIQSRIKFKINDVLSADASLKWNMRAELPSHSIGLAYQN 722 (762) Q Consensus 647 ~~~r~~~~~~~~~~~-~~~~~~y~~~~~~~~~~---~~~~~~~~~~~~~~l~~~w~~~~~~~ydl~~~~~~~~~Gl~Y~~ 722 (762) +....|+.+.|+++. ++..+.|.+.+...... .+..+.+.+.+.|.+++|.+....+..++-++-. .+|+. .+ T Consensus 275 k~~~~El~a~Y~fd~gl~p~~~Y~~~~~~d~~~~~~~d~~~~~~lga~Y~FnkN~~ty~~Yk~NllD~~~--~~~~~-~D 351 (362) T PRK10002 275 KTQDVLLVAQYQFDFGLRPSIAYTKSKAKDVEGIGDVDLVNYFEVGATYYFNKNMSTYVDYIINQIDSDN--KLGVG-SD 351 (362) T ss_pred CCCCEEEEEEEEECCCCCEEEEEEEEECCCCCCCCCCEEEEEEEEEEEEEECCCEEEEEEEEHHCCCCCC--CCCCC-CC T ss_conf 1212679889961453112456776422334566765100059987599955870268766000567655--56887-77 Q ss_pred CEEEEEEEE Q ss_conf 669999999 Q gi|254780395|r 723 DCATVKIIY 731 (762) Q Consensus 723 ~Cw~~~l~y 731 (762) +-|.++|.| T Consensus 352 D~~avGlrY 360 (362) T PRK10002 352 DTVAVGIVY 360 (362) T ss_pred CEEEEEEEE T ss_conf 879997188 No 46 >COG4313 Protein involved in meta-pathway of phenol degradation [Energy production and conversion] Probab=44.74 E-value=15 Score=13.98 Aligned_cols=65 Identities=12% Similarity=0.160 Sum_probs=29.1 Q ss_pred EEEEEEEECCCCEEEEEEEEEC--CCCCCC---------EEE--EE--EEEECC-EEEEEEEEEECCC---CCCCCEEEE Q ss_conf 9989999725656999999951--654541---------225--65--699646-6999999997147---788735999 Q gi|254780395|r 685 IQSRIKFKINDVLSADASLKWN--MRAELP---------SHS--IG--LAYQND-CATVKIIYKNESP---NIHGYKIQA 745 (762) Q Consensus 685 ~~~~~~~~l~~~w~~~~~~~yd--l~~~~~---------~~~--~G--l~Y~~~-Cw~~~l~y~r~~~---~~~~~~i~~ 745 (762) .+.++++.|+++|.+.....|- +++++. .+. +| +-|..+ -|.+.+.|.++.. +..++.+++ T Consensus 219 ~d~s~s~~i~~~~~vGl~GY~lqQitdDr~~g~~~~g~R~rvfAiGPav~y~~~~~~~l~l~~y~Ef~arNR~eG~~~~l 298 (304) T COG4313 219 ADYSASYAITEHWRVGLGGYYLQQITDDRGNGATVPGFRERVFAIGPAVMYKFDFKGYLFLNWYKEFGARNRPEGDQLWL 298 (304) T ss_pred EEEHHHHCCCCCEEEEEEEEEEEECCCCCCCCCCCCCCHHEEEEECCEEEECCCCCCEEEEEHHHHHCCCCCCCCCEEEE T ss_conf 22212224478869841147888703666687425785010577566336614776469986223325345786654888 Q ss_pred EEEE Q ss_conf 9997 Q gi|254780395|r 746 RLSL 749 (762) Q Consensus 746 ~~~L 749 (762) ++.+ T Consensus 299 r~a~ 302 (304) T COG4313 299 RYAI 302 (304) T ss_pred EECC T ss_conf 7514 No 47 >cd07303 Porin3 Eukaryotic porin family that forms channels in the mitochondrial outer membrane. The porin family 3 contains two sub-families that play vital roles in the mitochondrial outer membrane, a translocase for unfolded pre-proteins (Tom40) and the voltage-dependent anion channel (VDAC) that regulates the flux of mostly anionic metabolites through the outer mitochondrial membrane. Probab=44.34 E-value=15 Score=13.94 Aligned_cols=84 Identities=15% Similarity=0.148 Sum_probs=47.6 Q ss_pred CCCCEEEEEEEEECCCCCEEEEEEEEEEEECCCCEEEEEEEEECCCCCCCCCCEEEEEEEEEEECCCCEEEEEEEEECCC Q ss_conf 36777999999861676606688999999647408999998722675446663258998999972565699999995165 Q gi|254780395|r 629 LSPKITFSTQTLINPKDWSTRRTDTTIDYTMGSYEANISYTHIPAYPLYAYDARKTIQSRIKFKINDVLSADASLKWNMR 708 (762) Q Consensus 629 ~~~~~~l~~~~~~d~~~~~~~r~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~l~~~w~~~~~~~ydl~ 708 (762) -.+.|.+.+..-||... ...+....+.|..+..-+.+. ..+ ++-+.+.++...+++++|.+.+...|+.. T Consensus 139 g~~g~l~G~~~~fDt~~-~~tk~n~algy~~~~~d~~~~-~~l--------nn~~~~~aSyyhkVn~~~~vga~~~~~~~ 208 (274) T cd07303 139 GYEGWLAGYQMVFETVS-RVTQSNFAVGYKTDYNEFQAH-TNV--------NDGTEFGGSIYHKVNDKLEVGVNLAATAG 208 (274) T ss_pred CCCCEEEEEEEEEECCC-CCEEEEEEEECCCCCCCEEEE-EEE--------CCCCEEEEEEEEECCCCEEEEEEEEEECC T ss_conf 21417876676321115-530588899705686417999-992--------68888889999993677679999998227 Q ss_pred CCCCEEEEEEEEEC Q ss_conf 45412256569964 Q gi|254780395|r 709 AELPSHSIGLAYQN 722 (762) Q Consensus 709 ~~~~~~~~Gl~Y~~ 722 (762) .....-.+|-+|.- T Consensus 209 ~~~~~~~vg~ky~l 222 (274) T cd07303 209 NSNTRFGIAAKYQV 222 (274) T ss_pred CCCCEEEEEEEEEC T ss_conf 89726998988961 No 48 >PRK13524 outer membrane receptor FepA; Provisional Probab=43.40 E-value=16 Score=13.85 Aligned_cols=15 Identities=20% Similarity=0.135 Sum_probs=9.4 Q ss_pred EEEEEEEEEEEEEEE Q ss_conf 998889987788751 Q gi|254780395|r 460 FTPIANIRGDLHYLS 474 (762) Q Consensus 460 ~tp~~~lr~~~Y~~~ 474 (762) ++..+++|.+.|... T Consensus 446 l~l~~G~Ryd~~~~~ 460 (742) T PRK13524 446 TMLTPGLRFDHHSIV 460 (742) T ss_pred EEEECCEEEEEEECC T ss_conf 265022588732012 No 49 >pfam11854 DUF3374 Protein of unknown function (DUF3374). This family of proteins are functionally uncharacterized. This protein is found in bacteria. Proteins in this family are typically between 665 to 712 amino acids in length. Probab=43.17 E-value=16 Score=13.82 Aligned_cols=25 Identities=4% Similarity=0.315 Sum_probs=17.3 Q ss_pred EEEEEEEEEEECCCCEEEEEEEEEC Q ss_conf 2589989999725656999999951 Q gi|254780395|r 682 RKTIQSRIKFKINDVLSADASLKWN 706 (762) Q Consensus 682 ~~~~~~~~~~~l~~~w~~~~~~~yd 706 (762) .+.+.+.+.|+++++..+...++|. T Consensus 609 ~h~l~~~~~Y~lse~~~lrl~y~yE 633 (675) T pfam11854 609 SHNLKLYADYQLSESMSLRLDYRYE 633 (675) T ss_pred EEEEEEEEEEEECCCCEEEEEEEEE T ss_conf 4688898645516661567888999 No 50 >pfam11231 DUF3034 Protein of unknown function (DUF3034). This family of proteins with unknown function appears to be restricted to Proteobacteria. Probab=42.29 E-value=16 Score=13.73 Aligned_cols=24 Identities=4% Similarity=0.079 Sum_probs=17.4 Q ss_pred EEEEEEEEEECCCCEEEEEEEEEC Q ss_conf 589989999725656999999951 Q gi|254780395|r 683 KTIQSRIKFKINDVLSADASLKWN 706 (762) Q Consensus 683 ~~~~~~~~~~l~~~w~~~~~~~yd 706 (762) -+...+..+-++++|.+.+.+|.- T Consensus 182 ~~~E~S~a~ll~~~~avG~EYR~K 205 (257) T pfam11231 182 LQPEGSVAVLLSRNLAVGVEYRQK 205 (257) T ss_pred EEEEEEEEEEECCCEEEEEEEECC T ss_conf 867148999835864887772028 No 51 >PRK10049 pgaA outer membrane protein PgaA; Provisional Probab=42.28 E-value=16 Score=13.73 Aligned_cols=16 Identities=13% Similarity=0.144 Sum_probs=8.0 Q ss_pred CCEEEEEEEEEEEECC Q ss_conf 6347887888888428 Q gi|254780395|r 564 GGNRTNLGIRYIGSFN 579 (762) Q Consensus 564 ~~~r~~~Gl~~~~~~~ 579 (762) .++.+..+++|+.... T Consensus 654 ~~~~~~~~v~yr~~er 669 (818) T PRK10049 654 TANSASAYVRWYQNER 669 (818) T ss_pred CCCCCCEEEEEEECCC T ss_conf 6311333789887463 No 52 >PRK10695 hypothetical protein; Provisional Probab=42.24 E-value=16 Score=13.73 Aligned_cols=41 Identities=12% Similarity=0.232 Sum_probs=30.4 Q ss_pred EEEEEEEEEECCCCEEEEEEEEEEEECCE-EEEEEEEEEECC Q ss_conf 99920799976689799998799998996-999518999863 Q gi|254780395|r 53 DISSDEIVLNSEGSTTTAVGNVKIEYKGY-HLSARDITFNHK 93 (762) Q Consensus 53 ~~~ad~~~~~~~~~~~~a~G~v~i~~~~~-~l~Ad~i~y~~~ 93 (762) -+.-+++.|-...-.+..-+++.+.+... +|.||.+..|.. T Consensus 56 ~l~lp~~~~~~~~C~la~~~~~~l~~~~~w~l~~~~l~lD~a 97 (876) T PRK10695 56 GLRLPDLRYLVGDCTLAHIDNLSLSWPSRWQLSIGTLTLDSA 97 (876) T ss_pred CEECCCEEEEECCCCEEECCCCEECCCCCEEEECCCCCCCHH T ss_conf 555587488818852588535188256300785030321589 No 53 >pfam09694 Gcw_chp Bacterial protein of unknown function (Gcw_chp). This entry represents a conserved hypothetical protein about 240 residues in length found so far in Proteobacteria including Shewanella oneidensis and Ralstonia solanacearum, usually as part of a paralogous family. The function is unknown. Probab=41.28 E-value=17 Score=13.63 Aligned_cols=52 Identities=13% Similarity=0.189 Sum_probs=37.2 Q ss_pred EEEEEEEEEEECCC-CEEEEEEEEEC-CCC-----CCCEEEEEEEEEC-CEEEEEEEEEE Q ss_conf 25899899997256-56999999951-654-----5412256569964-66999999997 Q gi|254780395|r 682 RKTIQSRIKFKIND-VLSADASLKWN-MRA-----ELPSHSIGLAYQN-DCATVKIIYKN 733 (762) Q Consensus 682 ~~~~~~~~~~~l~~-~w~~~~~~~yd-l~~-----~~~~~~~Gl~Y~~-~Cw~~~l~y~r 733 (762) ...+.+...++|.. .|.+.+...|. ... +...+.+|+.+.- .-|.++|.|.. T Consensus 137 s~Y~~~~~~~~l~~~~~tl~~h~G~q~~~~~~~~~~Y~Dw~vGvs~~~~gg~~~~laY~d 196 (226) T pfam09694 137 SGYLDLDAGVDLGLTGVTLNAHVGYQDVRNGFTAYSYTDWKIGLSKALDGGLTLSAAYVD 196 (226) T ss_pred CEEEEEEECEECCCCCEEEEEEEEEEECCCCCCCCCEEEEEEEEEEECCCCEEEEEEEEE T ss_conf 578861444032877669999856680137767787589999898984798899999990 No 54 >pfam11924 DUF3442 Protein of unknown function (DUF3442). This family of proteins are functionally uncharacterized. This protein is found in bacteria. Proteins in this family are typically between 334 to 948 amino acids in length. Probab=38.81 E-value=19 Score=13.38 Aligned_cols=21 Identities=10% Similarity=0.066 Sum_probs=9.8 Q ss_pred EEEEEEEECCCCEEEEEEEEE Q ss_conf 799998843677799999986 Q gi|254780395|r 621 YAAAVRLSLSPKITFSTQTLI 641 (762) Q Consensus 621 ~v~~~~~~~~~~~~l~~~~~~ 641 (762) +-..+.+.|.+.+++....+. T Consensus 211 ~t~gl~ytP~pllt~~~~~~~ 231 (280) T pfam11924 211 VTLGLNYTPVPLLTVGADYRD 231 (280) T ss_pred EEEEEEECCCCCEEEEEEEEE T ss_conf 799888403687799873680 No 55 >COG5338 Uncharacterized protein conserved in bacteria [Function unknown] Probab=38.21 E-value=19 Score=13.32 Aligned_cols=28 Identities=21% Similarity=0.475 Sum_probs=15.8 Q ss_pred CCCCEECCCEEEEEEEEEEEECCCCCEEEEEE Q ss_conf 57400046347887888888428860663230 Q gi|254780395|r 557 SGFDRIEGGNRTNLGIRYIGSFNNLFTINGVI 588 (762) Q Consensus 557 sG~DRie~~~r~~~Gl~~~~~~~~~~~~~~~i 588 (762) .|++|-.++--+.+|+. ++-++.++..+ T Consensus 299 ~G~eRSs~~Y~~r~G~~----~D~gEk~~GE~ 326 (468) T COG5338 299 QGYERSSDGYGARVGAE----FDLGEKLRGEV 326 (468) T ss_pred CCCCCCCCCCCCCCCEE----ECHHHHHCCCC T ss_conf 34223444200013246----60004216500 No 56 >PRK11447 cellulose synthase subunit BcsC; Provisional Probab=36.89 E-value=20 Score=13.19 Aligned_cols=41 Identities=5% Similarity=0.072 Sum_probs=25.4 Q ss_pred EEEEEEEECCCCEEEEEEEEECCCCCCCEEE--EEEEEECCEE Q ss_conf 9989999725656999999951654541225--6569964669 Q gi|254780395|r 685 IQSRIKFKINDVLSADASLKWNMRAELPSHS--IGLAYQNDCA 725 (762) Q Consensus 685 ~~~~~~~~l~~~w~~~~~~~ydl~~~~~~~~--~Gl~Y~~~Cw 725 (762) +...+.++++++|.+.+...+|-..+..... +-+.|.=+-| T Consensus 1099 l~a~~eyr~~~~~~~G~~~~~~~s~dY~p~~~~lY~RY~f~~~ 1141 (1158) T PRK11447 1099 ARALIERRVTSNWFVGTAVDIQQAKDYAPSHALLYVRYSAAGW 1141 (1158) T ss_pred EEEEEEEEECCCEEEECEEECCCCCCCCCCCEEEEEEEECCCC T ss_conf 7788999715867863105015335778351467898963776 No 57 >PRK09501 potD spermidine/putrescine ABC transporter periplasmic substrate-binding protein; Reviewed Probab=36.60 E-value=19 Score=13.33 Aligned_cols=19 Identities=32% Similarity=0.457 Sum_probs=13.4 Q ss_pred CCCCCHHHHHHHHHHHHHH Q ss_conf 9533102476789999999 Q gi|254780395|r 1 MRYNYRHLIGATTFFLAFN 19 (762) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~ 19 (762) ||+-.|+||+++.|.+... T Consensus 1 ~~~~~~~ll~~~a~~l~~~ 19 (348) T PRK09501 1 MKKWSRHLLAAGALALGMS 19 (348) T ss_pred CHHHHHHHHHHHHHHHHCC T ss_conf 9368999999999997350 No 58 >pfam04245 NA37 37-kD nucleoid-associated bacterial protein. Probab=35.64 E-value=14 Score=14.19 Aligned_cols=10 Identities=20% Similarity=0.009 Sum_probs=3.3 Q ss_pred CEEEEEEEEE Q ss_conf 1011001345 Q gi|254780395|r 266 KGILGEMELR 275 (762) Q Consensus 266 rG~~l~~eyR 275 (762) .|..+-.+|+ T Consensus 103 gG~llf~~Y~ 112 (334) T pfam04245 103 GGHLLFAHYQ 112 (334) T ss_pred CCEEEEEEEE T ss_conf 7289999996 No 59 >COG3056 Uncharacterized lipoprotein [Cell envelope biogenesis, outer membrane] Probab=35.17 E-value=21 Score=13.01 Aligned_cols=27 Identities=11% Similarity=0.177 Sum_probs=12.9 Q ss_pred HHHHHH-HHHHHHHHHHHHHCCCCCCCC Q ss_conf 024767-899999999876324510025 Q gi|254780395|r 6 RHLIGA-TTFFLAFNIITPQGTPPSNEQ 32 (762) Q Consensus 6 ~~~~~~-~~~~~~~~~~~~~~~~~~~~~ 32 (762) +|++.. .+++++.+++++-+.||..-. T Consensus 12 k~~t~k~L~~laa~~lLagC~a~~~tl~ 39 (204) T COG3056 12 KNMTKKILFPLAAIFLLAGCAAPPTTLI 39 (204) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCEEE T ss_conf 7889999999999999975379975031 No 60 >PRK00378 nucleoid-associated protein NdpA; Validated Probab=34.04 E-value=16 Score=13.85 Aligned_cols=13 Identities=15% Similarity=0.069 Sum_probs=4.8 Q ss_pred CCCEEEEEEEEEE Q ss_conf 4310110013455 Q gi|254780395|r 264 PKKGILGEMELRK 276 (762) Q Consensus 264 s~rG~~l~~eyRy 276 (762) ...|..+-.+|++ T Consensus 98 a~gG~llf~~Y~~ 110 (334) T PRK00378 98 AEGGFLLFCHYQY 110 (334) T ss_pred CCCCEEEEEEEEC T ss_conf 5561899999972 No 61 >pfam11276 DUF3078 Protein of unknown function (DUF3078). This bacterial family of proteins has no known function. Probab=33.89 E-value=22 Score=12.87 Aligned_cols=28 Identities=18% Similarity=0.158 Sum_probs=17.1 Q ss_pred CCCEEEEEEEEEEECCCCEEEEEEEEEC Q ss_conf 6632589989999725656999999951 Q gi|254780395|r 679 YDARKTIQSRIKFKINDVLSADASLKWN 706 (762) Q Consensus 679 ~~~~~~~~~~~~~~l~~~w~~~~~~~yd 706 (762) .+++=++.....+++.++|...+...+- T Consensus 58 tdD~i~l~S~~G~~~~~~W~yS~~~~f~ 85 (92) T pfam11276 58 TDDRLRLNSLLGYKASSNWYYSALGNFR 85 (92) T ss_pred EEEEEEEEEHHHHHHHCCEEEEEEEEEE T ss_conf 1119999505346410658899999999 No 62 >PRK11667 hypothetical protein; Provisional Probab=33.85 E-value=22 Score=12.87 Aligned_cols=23 Identities=17% Similarity=0.219 Sum_probs=17.2 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHH Q ss_conf 95331024767899999999876 Q gi|254780395|r 1 MRYNYRHLIGATTFFLAFNIITP 23 (762) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~ 23 (762) |+...+|++.|+.+.++++.... T Consensus 1 m~~~~~~~~~~~~l~~~ll~~~a 23 (162) T PRK11667 1 MMKTIKHLLCCAALAAALISTSA 23 (162) T ss_pred CCHHHHHHHHHHHHHHHHHHHHH T ss_conf 91489999999999999999999 No 63 >smart00869 Autotransporter Autotransporter beta-domain. Secretion of protein products occurs by a number of different pathways in bacteria. One of these pathways known as the type IV pathway was first described for the IgA1 protease. The protein component that mediates secretion through the outer membrane is contained within the secreted protein itself, hence the proteins secreted in this way are called autotransporters. This family corresponds to the presumed integral membrane beta-barrel domain that transports the protein. This domain is found at the C-terminus of the proteins it occurs in. The N-terminus contains the variable passenger domain that is translocated across the membrane. Once the passenger domain is exported it is cleaved auto-catalytically in some proteins, in others a different peptidase is used and in some cases no cleavage occurs. Probab=30.16 E-value=25 Score=12.47 Aligned_cols=84 Identities=13% Similarity=0.035 Sum_probs=35.5 Q ss_pred EEEEEEEEEEEECCCCEEEEEEEEEEEEEEEEECCCCCCCCC-CCCCCCEEEEEEEEEEEEEEECCEECCCCCEEEEEEE Q ss_conf 677756776432167179998889987788751256666665-4456520236898989999831021367632589877 Q gi|254780395|r 442 LSIEADWRKKIIGPLGILFTPIANIRGDLHYLSFNRDLSSDT-ISNNPNFVASKMLTAGLDIRYPIVAVTQKSRHILEGI 520 (762) Q Consensus 442 ~~~~~~~~~~~~~~~G~~~tp~~~lr~~~Y~~~~~~~~~~~~-~~~~~~~~~r~~~~~~l~~~~p~~~~~~~~~h~LeP~ 520 (762) +.+..++.+++.....+.++|.+++....-..+......... ..............+|+.+.+.+..... ..+.|. T Consensus 117 ~~~~~~~g~~~~~~~~~~i~P~~~l~~~~~~~~~~~E~~~~~~~~~~~~~~~~~~~~~G~~~~~~~~~~~~---~~~~~~ 193 (261) T smart00869 117 YGASLEAGYPFKLGGSLTLTPFAGLAYSYVRQDGFTESGGAFALSVDSQSLDSLSLPLGLRLGYAFALGDG---STLTPY 193 (261) T ss_pred EEEEEEECCCCCCCCCEEEEEEEEEEEEEEECCCEEECCCCHHCEECCCCCCEEEEEEEEEEEEEEECCCC---CEEEEE T ss_conf 99999733012234678999899999999975976771960211321566437999999999989985899---699999 Q ss_pred EEEEEECC Q ss_conf 78999437 Q gi|254780395|r 521 AQVYAATD 528 (762) Q Consensus 521 ~q~~~~~~ 528 (762) +.+.+... T Consensus 194 ~~~~~~~~ 201 (261) T smart00869 194 LSLAYQHD 201 (261) T ss_pred EEEEEEEH T ss_conf 99999943 No 64 >TIGR03016 pepcterm_hypo_1 uncharacterized protein, PEP-CTERM system associated. Members of this protein family are found predominantly in exopolysaccharide biosynthesis operons marked by the presence of the EpsH-family putative exosortase and presence in the genome of the PEP-CTERM protein sorting signal. Members of this family may be distantly related to the EpsL family modeled in TIGR03014. Probab=29.76 E-value=26 Score=12.43 Aligned_cols=66 Identities=20% Similarity=0.292 Sum_probs=37.4 Q ss_pred EEEEEEEEECCCCEEEEEEEEECC---CC--C----CCEEEEEEEEEC-CEEEEEEEEEE---CCCCC----CCCEEEEE Q ss_conf 899899997256569999999516---54--5----412256569964-66999999997---14778----87359999 Q gi|254780395|r 684 TIQSRIKFKINDVLSADASLKWNM---RA--E----LPSHSIGLAYQN-DCATVKIIYKN---ESPNI----HGYKIQAR 746 (762) Q Consensus 684 ~~~~~~~~~l~~~w~~~~~~~ydl---~~--~----~~~~~~Gl~Y~~-~Cw~~~l~y~r---~~~~~----~~~~i~~~ 746 (762) -+.+...+.|.+++.+.+...|.- .. + ...-.+|+.|.= .-..+++.|+. +.+.. .++.|++. T Consensus 347 ~~~~~~~~~L~~rl~~~~~~~~~~~~~~~~~~~~d~~~~~~~gl~y~l~~~~~~~l~Yry~~rdS~~~~~dy~~N~v~~~ 426 (431) T TIGR03016 347 GGSVLLTHRLSPRLTANLGLTYERNEGSGTNDQETDRTSVSAGLSYQLGENTTGSLTYRYQERDSNFEATDYTENRLMAE 426 (431) T ss_pred EEEEEEEEEECCCCEEEEEEEEEEEECCCCCCCCEEEEEEEEEEEEEECCCEEEEEEEEEEECCCCCCCCCHHHHEEEEE T ss_conf 10389999965131214899999724377776504699999989999638659999999997047876666455121577 Q ss_pred EEE Q ss_conf 997 Q gi|254780395|r 747 LSL 749 (762) Q Consensus 747 ~~L 749 (762) +++ T Consensus 427 l~~ 429 (431) T TIGR03016 427 VTM 429 (431) T ss_pred EEE T ss_conf 874 No 65 >TIGR03519 Bac_Flav_fam_1 Bacteroidetes-specific putative membrane protein. This model describes a protein family unique to, and greatly expanded in, the Bacteriodetes. Species in this lineage include several, such as Cytophaga hutchinsonii and Flavobacterium johnsoniae, that exhibit a poorly understood rapid gliding phenotype. Several members of this protein family are found in operons with other genes whose loss leads to a loss a this motility. Probab=29.35 E-value=26 Score=12.38 Aligned_cols=73 Identities=15% Similarity=0.189 Sum_probs=27.7 Q ss_pred EEEEECCCCEEEEEEEEECCCCCEEEEEEEEEEEECC-CCEEEEEEEEECCCCCCCCCCEEEEEEEEEEECCCCEEEEEE Q ss_conf 9988436777999999861676606688999999647-408999998722675446663258998999972565699999 Q gi|254780395|r 624 AVRLSLSPKITFSTQTLINPKDWSTRRTDTTIDYTMG-SYEANISYTHIPAYPLYAYDARKTIQSRIKFKINDVLSADAS 702 (762) Q Consensus 624 ~~~~~~~~~~~l~~~~~~d~~~~~~~r~~~~~~~~~~-~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~l~~~w~~~~~ 702 (762) ...+.+.+.+.+.-.+.+-.....-...++++..... .+.+.++||.. ..+.+.+.+++.+++.+... T Consensus 200 gy~~~l~~~~~i~Ps~l~~~~~~~~~~~d~~~~~~~~~~~~~G~~YR~~-----------~ai~~~~g~~~~~~~~~gys 268 (292) T TIGR03519 200 GYKFPLNEEIKITPSILYKYQKGAPFQLDLGANALYNDKLWAGAGYRGN-----------DAVIGLVGFNLNKRLSIGYS 268 (292) T ss_pred EEEEECCCCEEEEEEEEEEEECCCCEEEEEEEEEEECCEEEEEEEEECC-----------CCEEEEEEEEECCCEEEEEE T ss_conf 5799548984999999999973897079888999972269999999669-----------95999997997798999999 Q ss_pred EEECC Q ss_conf 99516 Q gi|254780395|r 703 LKWNM 707 (762) Q Consensus 703 ~~ydl 707 (762) +.+.+ T Consensus 269 Yd~~~ 273 (292) T TIGR03519 269 YDFST 273 (292) T ss_pred EEEEC T ss_conf 97023 No 66 >PRK13874 conjugal transfer protein TrbJ; Provisional Probab=28.49 E-value=27 Score=12.28 Aligned_cols=28 Identities=18% Similarity=0.236 Sum_probs=18.5 Q ss_pred HHHHHHHHHHHHHCCCCCCCCCCCCCCC Q ss_conf 8999999998763245100255567865 Q gi|254780395|r 12 TTFFLAFNIITPQGTPPSNEQTTTSTPS 39 (762) Q Consensus 12 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 39 (762) ...+++.++++|...+|++|+....+|. T Consensus 8 ~~~~~~~~~~~~~~~~pa~A~~~V~Dpt 35 (233) T PRK13874 8 AAALAAALALSPALATPAHAQWIVYDPT 35 (233) T ss_pred HHHHHHHHHHCCCCCCCCCCCCEEECCH T ss_conf 9999999874235888731450246701 No 67 >COG3637 Opacity protein and related surface antigens [Cell envelope biogenesis, outer membrane] Probab=25.90 E-value=30 Score=11.98 Aligned_cols=23 Identities=9% Similarity=0.322 Sum_probs=15.7 Q ss_pred EEEEEEEEECCCCEEEEEEEEEC Q ss_conf 89989999725656999999951 Q gi|254780395|r 684 TIQSRIKFKINDVLSADASLKWN 706 (762) Q Consensus 684 ~~~~~~~~~l~~~w~~~~~~~yd 706 (762) .+.++.++.+++||.+...++|. T Consensus 148 ~~gAGv~~~~tdnv~vd~~Y~y~ 170 (199) T COG3637 148 AYGAGVQYNPTDNVAIDLGYEYS 170 (199) T ss_pred EEEEEEEECCCCCEEEEEEEEEE T ss_conf 78410798155987998899988 No 68 >COG2067 FadL Long-chain fatty acid transport protein [Lipid metabolism] Probab=24.40 E-value=32 Score=11.80 Aligned_cols=24 Identities=17% Similarity=0.212 Sum_probs=11.1 Q ss_pred EEEEEEEEECCCCEEEEEEEEECC Q ss_conf 899899997256569999999516 Q gi|254780395|r 684 TIQSRIKFKINDVLSADASLKWNM 707 (762) Q Consensus 684 ~~~~~~~~~l~~~w~~~~~~~ydl 707 (762) -+.+++.|++++++.+....-|=+ T Consensus 382 ~~s~G~~Y~~t~n~~vd~g~ay~~ 405 (440) T COG2067 382 WLSLGTTYKFTKNLEVDAGYAYLK 405 (440) T ss_pred EEECCCEEECCCCEEEEEEEEEEE T ss_conf 996664772599838998889998 No 69 >pfam01389 OmpA_membrane OmpA-like transmembrane domain. The structure of OmpA transmembrane domain shows that it consists of an eight stranded beta barrel. This family includes some other distantly related outer membrane proteins with low scores. Probab=21.88 E-value=36 Score=11.48 Aligned_cols=51 Identities=16% Similarity=0.318 Sum_probs=30.4 Q ss_pred EEEEEEEEECCCCEEEEEEEEE-----CCCC------------CC--CEEEEEEEEE-CCEEEEEEEEEEC Q ss_conf 8998999972565699999995-----1654------------54--1225656996-4669999999971 Q gi|254780395|r 684 TIQSRIKFKINDVLSADASLKW-----NMRA------------EL--PSHSIGLAYQ-NDCATVKIIYKNE 734 (762) Q Consensus 684 ~~~~~~~~~l~~~w~~~~~~~y-----dl~~------------~~--~~~~~Gl~Y~-~~Cw~~~l~y~r~ 734 (762) ++...+.+||+++|.+.++.-. |.+. +- ....+|++|. .+-|.+++.|++. T Consensus 78 ~l~~k~~~pl~d~~~ly~K~G~a~~~~d~~~~~~~~~~~~~~d~~~~~~~G~G~eY~it~~~~~r~eYq~~ 148 (175) T pfam01389 78 TLSLKLSYPLTDDLDVYGKVGGALVRADYKFYEDANGKTGNHDTGVSPLFALGVEYAVTPELAVRLEYQYL 148 (175) T ss_pred EEEEEEEEEECCCEEEEEEEEEEEEECCCCEECCCCCCCCCCCCCCCEEEEEEEEEEECCCCEEEEEEEEE T ss_conf 99999888614856799971068996133210256664223567705388888899807582788889992 No 70 >pfam04234 CopC Copper resistance protein CopC. CopC is a bacterial blue copper protein that binds 1 atom of copper per protein molecule. Along with CopA, CopC mediates copper resistance by sequestration of copper in the periplasm. Probab=21.17 E-value=37 Score=11.38 Aligned_cols=35 Identities=17% Similarity=0.281 Sum_probs=14.7 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCC Q ss_conf 0247678999999998763245100255567865456 Q gi|254780395|r 6 RHLIGATTFFLAFNIITPQGTPPSNEQTTTSTPSKIK 42 (762) Q Consensus 6 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 42 (762) |.++....+++..+++++.| -+++.-..+.|+..+ T Consensus 2 ~~~~~~~~~~~~~~~~~~~A--~AH~~l~~s~P~~ga 36 (120) T pfam04234 2 RRLLLAAALLALLLLLAGAA--SAHAKLVSSTPAAGA 36 (120) T ss_pred HHHHHHHHHHHHHHHHHHHC--CCCCCCCCCCCCCCC T ss_conf 77999999999998740321--315442432799949 No 71 >PRK00022 lolB outer membrane lipoprotein LolB; Provisional Probab=20.97 E-value=37 Score=11.36 Aligned_cols=20 Identities=20% Similarity=0.032 Sum_probs=11.0 Q ss_pred EEEEEEEEEEECCEEEEEEE Q ss_conf 99998799998996999518 Q gi|254780395|r 68 TTAVGNVKIEYKGYHLSARD 87 (762) Q Consensus 68 ~~a~G~v~i~~~~~~l~Ad~ 87 (762) -.++|.+-|..++....|-- T Consensus 49 w~~~Gkiav~~~~~~~sa~f 68 (203) T PRK00022 49 YQARGRFAYISPQQRVSGRF 68 (203) T ss_pred EEEEEEEEEECCCCEEEEEE T ss_conf 48846899987885478999 No 72 >pfam04420 CHD5 CHD5-like protein. Members of this family are probably coiled-coil proteins that are similar to the CHD5 (Congenital heart disease 5) protein. In Saccharomyces cerevisiae this protein localizes to the ER and is thought to play a homeostatic role. Probab=20.82 E-value=37 Score=11.40 Aligned_cols=21 Identities=14% Similarity=0.079 Sum_probs=16.0 Q ss_pred EEEECCEEEEECCCEECCCCC Q ss_conf 899827068632540012455 Q gi|254780395|r 200 YLEIFGNSIFYFPLIEIPDET 220 (762) Q Consensus 200 ~l~i~~vPv~y~Py~~~P~~~ 220 (762) .++....|||++|...+|-|- T Consensus 114 ~~~y~k~pv~~lp~~~~p~p~ 134 (159) T pfam04420 114 RFWYRKTPVFVLPSGWFPYPV 134 (159) T ss_pred HHEECCCEEEECCCCHHHHHH T ss_conf 630068668980851005788 No 73 >PRK10301 hypothetical protein; Provisional Probab=20.76 E-value=38 Score=11.33 Aligned_cols=39 Identities=21% Similarity=0.112 Sum_probs=23.2 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCC Q ss_conf 953310247678999999998763245100255567865456 Q gi|254780395|r 1 MRYNYRHLIGATTFFLAFNIITPQGTPPSNEQTTTSTPSKIK 42 (762) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 42 (762) |--+.|+|..+.+++++.++ ++.| -++++-..+.|+... T Consensus 1 ~~~~~~~~r~~l~l~~~~l~-~~~A--~AHa~L~~S~Pa~ga 39 (124) T PRK10301 1 MASTARSLRYALAILTTSLV-TPAV--WAHAHLTHQYPAANA 39 (124) T ss_pred CCCHHHHHHHHHHHHHHHHH-CCCC--CCCCCCCCCCCCCCC T ss_conf 97317889999999999876-5013--005552544799979 No 74 >COG3823 Glutamine cyclotransferase [Posttranslational modification, protein turnover, chaperones] Probab=20.68 E-value=38 Score=11.32 Aligned_cols=45 Identities=9% Similarity=-0.006 Sum_probs=18.3 Q ss_pred EEECCCCEEEEECCEE--EECCCCCEEEEEEE-EEECCCCEEEEEEEE Q ss_conf 9986378799973489--98389878997069-994302559998479 Q gi|254780395|r 89 TFNHKNHRIIASGNIK--LIEPDKRQIHAEYL-DITDDFTNGIIKNLT 133 (762) Q Consensus 89 ~y~~~~~~~~a~G~v~--~~~~~g~~~~~~~~-~~~~~~~~g~~~~~~ 133 (762) .+|-.++.+.++-++- -.-++|.+..++.. .++.+.+.++..++. T Consensus 72 ~~~L~~gq~~~s~~l~~~~~FgEGit~~gd~~y~LTw~egvaf~~d~~ 119 (262) T COG3823 72 VSDLTTGQEIFSEKLAPDTVFGEGITKLGDYFYQLTWKEGVAFKYDAD 119 (262) T ss_pred EEECCCCEEEEEEECCCCCCCCCCEEECCCEEEEEEECCCEEEEECHH T ss_conf 884367508997406973200235003055589999425604898767 Done!