Query gi|254781078|ref|YP_003065491.1| hypothetical protein CLIBASIA_04900 [Candidatus Liberibacter asiaticus str. psy62] Match_columns 242 No_of_seqs 152 out of 223 Neff 5.0 Searched_HMMs 39220 Date Mon May 30 04:22:43 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254781078.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 COG3637 Opacity protein and re 100.0 4E-34 1E-38 205.4 18.7 191 7-242 2-199 (199) 2 pfam01389 OmpA_membrane OmpA-l 99.9 7.1E-23 1.8E-27 143.8 19.1 165 55-242 5-174 (175) 3 PRK10808 outer membrane protei 99.9 1.1E-21 2.9E-26 137.2 18.4 190 6-242 2-195 (347) 4 pfam02462 Opacity Opacity fami 99.4 8E-12 2E-16 83.3 11.0 121 94-242 2-126 (126) 5 pfam02530 Porin_2 Porin subfam 99.3 4.5E-11 1.2E-15 79.2 12.7 88 127-214 246-358 (378) 6 PRK09408 ompX outer membrane p 99.2 2E-09 5E-14 70.2 16.6 125 90-242 46-171 (171) 7 PRK10959 outer membrane protei 99.1 4.6E-08 1.2E-12 62.7 17.4 108 131-242 103-212 (212) 8 pfam06316 Ail_Lom Enterobacter 99.0 4.7E-08 1.2E-12 62.7 15.5 133 90-242 50-199 (199) 9 pfam03922 OmpW OmpW family. Th 98.9 2.5E-07 6.3E-12 58.7 16.2 104 131-242 84-189 (189) 10 pfam03349 Toluene_X Outer memb 98.8 2.2E-06 5.7E-11 53.5 19.3 61 182-242 355-415 (415) 11 COG2067 FadL Long-chain fatty 98.5 6.8E-06 1.7E-10 50.8 13.6 147 85-242 216-391 (440) 12 PRK10716 long-chain fatty acid 98.5 6.5E-05 1.7E-09 45.5 19.1 53 185-242 385-437 (437) 13 PRK10716 long-chain fatty acid 98.5 2.2E-05 5.5E-10 48.1 15.2 60 176-242 334-393 (437) 14 COG3047 OmpW Outer membrane pr 98.4 1.9E-06 4.9E-11 53.9 8.9 109 128-242 103-213 (213) 15 pfam07437 YfaZ YfaZ precursor. 98.3 8.4E-05 2.2E-09 44.9 15.5 102 85-211 58-160 (180) 16 pfam03797 Autotransporter Auto 98.3 0.00016 4.2E-09 43.3 15.5 151 54-205 84-258 (265) 17 pfam05736 OprF OprF membrane d 98.1 7.4E-05 1.9E-09 45.2 11.0 125 86-242 50-180 (184) 18 COG2067 FadL Long-chain fatty 97.9 0.00082 2.1E-08 39.5 13.9 104 128-242 335-440 (440) 19 pfam03349 Toluene_X Outer memb 97.8 0.00063 1.6E-08 40.1 11.9 146 86-242 195-366 (415) 20 smart00869 Autotransporter Aut 97.8 0.002 5E-08 37.4 14.3 153 55-207 79-256 (261) 21 pfam06629 MipA MltA-interactin 97.5 0.0056 1.4E-07 34.9 18.6 136 90-242 86-226 (226) 22 cd00342 gram_neg_porins Porins 97.2 0.013 3.3E-07 32.9 15.1 58 182-242 272-329 (329) 23 pfam12099 DUF3575 Protein of u 96.9 0.024 6E-07 31.5 11.8 38 8-45 7-44 (186) 24 COG3203 OmpC Outer membrane pr 96.7 0.033 8.4E-07 30.7 13.2 57 178-242 298-354 (354) 25 smart00869 Autotransporter Aut 96.5 0.047 1.2E-06 29.8 16.9 152 87-241 67-241 (261) 26 pfam10082 DUF2320 Uncharacteri 96.1 0.084 2.1E-06 28.5 19.1 50 181-242 331-380 (380) 27 pfam03797 Autotransporter Auto 95.8 0.11 2.9E-06 27.7 18.6 181 57-241 46-245 (265) 28 PRK09752 adhesin; Provisional 95.7 0.12 3E-06 27.7 14.4 145 55-206 1077-1231(1250) 29 pfam00267 Porin_1 Gram-negativ 95.7 0.13 3.3E-06 27.5 17.5 49 184-242 286-334 (334) 30 pfam11854 DUF3374 Protein of u 95.6 0.13 3.4E-06 27.3 13.3 18 10-27 5-23 (675) 31 pfam05736 OprF OprF membrane d 95.5 0.027 7E-07 31.1 4.9 99 8-113 9-122 (184) 32 COG5338 Uncharacterized protei 95.3 0.056 1.4E-06 29.4 5.9 78 118-206 253-331 (468) 33 TIGR01414 autotrans_barl outer 95.1 0.2 5E-06 26.4 12.3 167 55-242 265-454 (454) 34 pfam05420 BCSC_C Cellulose syn 94.9 0.23 6E-06 26.0 17.3 51 174-242 286-336 (339) 35 pfam11854 DUF3374 Protein of u 94.8 0.25 6.4E-06 25.9 15.6 58 185-242 612-675 (675) 36 PRK11447 cellulose synthase su 94.7 0.25 6.4E-06 25.9 15.1 32 175-206 1089-1120(1158) 37 pfam11383 DUF3187 Protein of u 94.4 0.12 2.9E-06 27.7 5.6 87 132-242 157-243 (273) 38 PRK12580 outer membrane protea 94.0 0.35 9E-06 25.0 7.4 61 182-242 244-312 (312) 39 COG3468 AidA Type V secretory 93.7 0.23 5.9E-06 26.1 6.0 163 38-206 380-570 (592) 40 pfam06082 DUF940 Bacterial put 93.4 0.29 7.4E-06 25.5 6.1 21 185-205 420-440 (661) 41 pfam01278 Omptin Omptin family 93.3 0.49 1.2E-05 24.3 7.5 60 181-242 226-295 (295) 42 pfam01617 Surface_Ag_2 Surface 92.7 0.6 1.5E-05 23.8 20.0 110 87-214 101-241 (279) 43 pfam07585 DUF1551 Protein of u 92.2 0.72 1.8E-05 23.4 11.9 122 121-242 189-338 (338) 44 COG4625 Uncharacterized protei 91.8 0.79 2E-05 23.1 11.0 46 179-242 532-577 (577) 45 PRK09980 ompL outer membrane p 91.2 0.93 2.4E-05 22.7 13.3 50 179-242 181-230 (230) 46 PRK10993 outer membrane protea 91.1 0.94 2.4E-05 22.7 13.1 61 182-242 244-311 (311) 47 PRK10159 outer membrane phosph 90.7 0.85 2.2E-05 23.0 5.8 47 184-242 305-351 (351) 48 TIGR03509 OMP_MtrB_PioB decahe 88.8 1.5 3.8E-05 21.6 17.0 59 184-242 584-649 (649) 49 TIGR03016 pepcterm_hypo_1 unch 88.4 1.6 4E-05 21.5 14.7 50 181-242 382-431 (431) 50 PRK10002 outer membrane protei 86.5 2 5E-05 21.0 5.3 46 185-242 317-362 (362) 51 COG3713 OmpV Outer membrane pr 86.2 2.1 5.4E-05 20.8 14.6 96 139-242 159-258 (258) 52 TIGR03014 EpsL exopolysacchari 85.9 2.2 5.6E-05 20.7 17.1 54 182-241 290-343 (381) 53 COG4571 OmpT Outer membrane pr 84.0 2.7 6.9E-05 20.2 7.3 150 90-242 136-314 (314) 54 PRK13484 putative iron-regulat 80.7 3.7 9.3E-05 19.5 6.3 53 190-242 540-593 (682) 55 pfam09381 Porin_OmpG Outer mem 80.5 3.3 8.3E-05 19.8 4.4 46 184-242 235-280 (280) 56 PRK13486 bifunctional enteroba 70.8 7 0.00018 18.0 5.9 43 194-242 538-580 (696) 57 PRK13483 enterobactin receptor 68.1 8 0.0002 17.6 6.1 19 190-208 533-551 (663) 58 COG5571 Autotransporter protei 65.7 2.1 5.4E-05 20.8 0.5 135 88-242 90-239 (239) 59 pfam04338 DUF481 Protein of un 65.4 9.1 0.00023 17.3 15.9 49 182-242 162-210 (210) 60 pfam06727 DUF1207 Protein of u 64.7 9.4 0.00024 17.3 5.2 99 129-242 214-337 (337) 61 pfam09411 PagL Lipid A 3-O-dea 64.1 9.6 0.00025 17.2 15.3 87 131-239 52-138 (139) 62 COG4771 FepA Outer membrane re 63.8 9.7 0.00025 17.2 4.0 59 182-242 639-699 (699) 63 PRK10177 hypothetical protein; 61.0 11 0.00028 16.9 6.7 25 181-205 252-276 (452) 64 pfam06178 KdgM Oligogalacturon 59.5 12 0.0003 16.7 17.3 51 177-242 168-218 (218) 65 PRK10554 outer membrane porin 55.2 14 0.00035 16.3 15.8 49 185-242 325-373 (373) 66 pfam11336 DUF3138 Protein of u 55.0 14 0.00036 16.3 8.6 41 177-217 456-496 (514) 67 PRK13513 putative outer membra 54.3 14 0.00037 16.2 6.2 18 191-208 532-549 (660) 68 pfam12094 DUF3570 Protein of u 47.0 19 0.00048 15.6 12.3 29 178-206 258-286 (407) 69 pfam09982 DUF2219 Uncharacteri 44.7 21 0.00053 15.4 13.7 53 93-152 127-180 (293) 70 pfam11924 DUF3442 Protein of u 43.9 21 0.00054 15.3 12.8 67 132-207 162-233 (280) 71 pfam10677 DUF2490 Protein of u 43.8 21 0.00055 15.3 6.3 31 174-208 147-177 (191) 72 pfam07172 GRP Glycine rich pro 43.1 5.9 0.00015 18.3 -0.3 43 5-53 3-47 (91) 73 COG2930 Uncharacterized conser 42.3 23 0.00058 15.2 3.0 56 52-109 62-118 (227) 74 TIGR01778 TonB-copper TonB-dep 40.4 24 0.00062 15.0 2.8 30 88-117 515-545 (668) 75 PRK11528 hypothetical protein; 39.1 25 0.00065 14.9 11.2 30 179-208 206-235 (254) 76 pfam04357 DUF490 Family of unk 30.9 35 0.0009 14.1 3.0 15 189-203 319-333 (349) 77 PRK03761 organic solvent toler 28.4 39 0.00099 13.9 5.6 21 185-205 681-701 (782) 78 TIGR00459 aspS_bact aspartyl-t 28.0 14 0.00037 16.2 -0.3 20 192-211 126-145 (653) 79 PRK03641 hypothetical protein; 26.3 43 0.0011 13.7 1.8 30 7-36 2-31 (220) 80 COG1956 GAF domain-containing 25.2 32 0.00081 14.3 1.0 11 51-61 50-60 (163) 81 PRK09840 catecholate sideropho 24.4 46 0.0012 13.4 9.3 20 190-209 621-640 (760) 82 PRK04517 hypothetical protein; 20.4 54 0.0014 13.1 1.4 28 9-36 5-33 (216) No 1 >COG3637 Opacity protein and related surface antigens [Cell envelope biogenesis, outer membrane] Probab=100.00 E-value=4e-34 Score=205.39 Aligned_cols=191 Identities=18% Similarity=0.244 Sum_probs=143.1 Q ss_pred CHHHHHHHHHHHCCCCEEEEEEECCCCCCCCCCCCCEEEECCCCCCEEEEEEEEEEEEEEEEEEECCCCCCCCCCCCCCC Q ss_conf 05678888987436520234332025434575321103303766530000574446441033222145556666676675 Q gi|254781078|r 7 GALFFVVFLSIVVPNHSLAVDLYLPRKIDLFNEADNNVEYQDDEYGIWSGNYVGLHISRLYETHPLADTINRKTYNSLLP 86 (242) Q Consensus 7 ~~~~~~~~~~~~~~~~~~a~d~~~~~~~~~~~~~~~~~~~~~~~~~~WtG~YiG~~~G~~~~~~~~~d~~~~~~~n~~~~ 86 (242) |-++.+++++++.-..+.|+|. ...|++.|.+++.+..|.+.+.. ...+ T Consensus 2 k~~l~~a~~~~~~~~~a~aad~----------------------~~~w~~~~~~g~a~~~~~~~~~~---------~~~~ 50 (199) T COG3637 2 KKLLAAAALAALLLSAAAAADA----------------------AAGWYGTYSGGYAGSDWSGSDSS---------PSIG 50 (199) T ss_pred HHHHHHHHHHHHHHHHHHHHHH----------------------HCCCCEEEEEEEEECCCCCCCCC---------CCCC T ss_conf 0699999999987514444444----------------------23870688634884265677644---------3566 Q ss_pred CEEEEE-EEEEEEECCEEEEEEEEEEEECCCCC---CCCCCCC--CEEEEEEEEEEEEEE-CCEEEEEEEEEEEEEEECC Q ss_conf 504799-88655445779999988674047776---4323101--003679899986331-4403444401145753011 Q gi|254781078|r 87 NGLGIE-LGHNIQLEDFVFGINCHTTAAKDDST---FYRLKEK--YFIYGDVVLKAGYSV-DSLLIYGMGGFGGAYVIDS 159 (242) Q Consensus 87 ~g~G~~-~GYN~Q~~~~V~GvE~d~~~~~~~~~---~~~~~~k--~~~~Gsvr~r~Gya~-~~~l~Y~~gG~A~~~~~~~ 159 (242) .+.|+| +|||+|+++.|+|+|+++++...+.. ......+ ..|.+++++|.+|++ ++++||+.+|++.++++.+ T Consensus 51 ~~~G~~~~g~n~~~~~~v~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~a~~~~~~~~~~~~Y~~aGvg~~~~k~~ 130 (199) T COG3637 51 FGGGLKLAGYNLKYRYSVLGVEGSFTYTGQSGRYSGGSGKNQVDNKAWYGSLRAGPDYRINDRFSPYGGAGVAYGKVKTS 130 (199) T ss_pred CCCCEEEEEEEEECCCEEEEEEEEEEEECCCCCCCCCCCCCCCCCCEEEEEEEEECCEECCCCEEEEEECCEEEEEEEEE T ss_conf 54221667687335846788898876102466544766644444205789999865510586444404010576899871 Q ss_pred CCCCCCCCCCCCCCCCCCCCCEEEEEEEEEEEECCCCEEEEEEEEEEECCCCEEECCCCCCCCCCCCCCCCEEEEEEEEE Q ss_conf 00023666655554345565303798731667327876888898887448730003666666442201234138999886 Q gi|254781078|r 160 SLEKVESDNSKNAKGRFDGHGSSVVLGIGLDYMVNYDISLSASYRYIPHHIHSVNNSNAKSDVERVDRKGNAHIASLGIN 239 (242) Q Consensus 160 ~~~~~~~~~~~~~~~~~d~t~~G~tvGaGVEYait~nwslraEY~Y~d~g~~~~~~~~~~~~~~~~~~~~~~h~v~~Gin 239 (242) ....... ...++++.||++|||+||++|+||++++||||.|+++++ .. +. .++.+.+.|++|+||+ T Consensus 131 ---~~~~~~~----~~~s~~~~g~~~gAGv~~~~tdnv~vd~~Y~y~~~g~~~--~~----~~-~~~~~~~~~~v~vGv~ 196 (199) T COG3637 131 ---TDEVGGS----ADESKTKTGYAYGAGVQYNPTDNVAIDLGYEYSDFGKKD--GV----DG-GYSGDVKTHTVKVGVG 196 (199) T ss_pred ---CCCCCCC----CCCCCCEEEEEEEEEEEECCCCCEEEEEEEEEEECCCCC--CC----CC-CCCCCCCCCEEEEEEE T ss_conf ---2566741----167655788678410798155987998899988158864--56----75-4136401340899605 Q ss_pred EEC Q ss_conf 209 Q gi|254781078|r 240 MHF 242 (242) Q Consensus 240 Y~F 242 (242) |+| T Consensus 197 YkF 199 (199) T COG3637 197 YKF 199 (199) T ss_pred EEC T ss_conf 659 No 2 >pfam01389 OmpA_membrane OmpA-like transmembrane domain. The structure of OmpA transmembrane domain shows that it consists of an eight stranded beta barrel. This family includes some other distantly related outer membrane proteins with low scores. Probab=99.92 E-value=7.1e-23 Score=143.81 Aligned_cols=165 Identities=18% Similarity=0.152 Sum_probs=111.3 Q ss_pred EEEEEEEEEEEEEEEEECCCC-CCCCCCCCCCCCEEEEEEEEEEEECCEEEEEEEEEEEECCCCCCC-CCCCCCEE-EEE Q ss_conf 005744464410332221455-566666766755047998865544577999998867404777643-23101003-679 Q gi|254781078|r 55 SGNYVGLHISRLYETHPLADT-INRKTYNSLLPNGLGIELGHNIQLEDFVFGINCHTTAAKDDSTFY-RLKEKYFI-YGD 131 (242) Q Consensus 55 tG~YiG~~~G~~~~~~~~~d~-~~~~~~n~~~~~g~G~~~GYN~Q~~~~V~GvE~d~~~~~~~~~~~-~~~~k~~~-~Gs 131 (242) ..||+|+++|.+.-...+... ...++.+ ..++|+++|| ||.++ |++|+.+......+... ....+.+. --+ T Consensus 5 n~~Y~G~~~G~s~~~~~~~~~~~~~~d~~---~~~~glf~GY--qfn~~-~a~E~gY~~lG~~~~~~~~~~~~~~~~~~~ 78 (175) T pfam01389 5 NTWYTGAKAGWASFHDTGLIEGNGPTHRN---SLTAGAFGGY--QINNY-LAAELGYDYLGRMPYKGKGKNNAHKAQGVT 78 (175) T ss_pred CCEEEEEEEEEEEECCCCCCCCCCCCCCC---CCEEEEEEEE--ECCCE-EEEEEEEEECCCCCCCCCCCCCCEEEEEEE T ss_conf 95789664025689576344578765678---7178999863--10311-226874243165433553434540468999 Q ss_pred EEEEEEEEEC-CEEEEEEEEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCEEEEEEEEEEEECCCCEEEEEEEEEEEC-C Q ss_conf 8999863314-4034444011457530110002366665555434556530379873166732787688889888744-8 Q gi|254781078|r 132 VVLKAGYSVD-SLLIYGMGGFGGAYVIDSSLEKVESDNSKNAKGRFDGHGSSVVLGIGLDYMVNYDISLSASYRYIPH-H 209 (242) Q Consensus 132 vr~r~Gya~~-~~l~Y~~gG~A~~~~~~~~~~~~~~~~~~~~~~~~d~t~~G~tvGaGVEYait~nwslraEY~Y~d~-g 209 (242) +.+|..|.+. ++-+|+|+|+++.+.+.+.... .......+.+.+++++|+|+||+|||||++|+||||++. + T Consensus 79 l~~k~~~pl~d~~~ly~K~G~a~~~~d~~~~~~------~~~~~~~~d~~~~~~~G~G~eY~it~~~~~r~eYq~~~~~g 152 (175) T pfam01389 79 LSLKLSYPLTDDLDVYGKVGGALVRADYKFYED------ANGKTGNHDTGVSPLFALGVEYAVTPELAVRLEYQYLNNIG 152 (175) T ss_pred EEEEEEEEECCCEEEEEEEEEEEEECCCCEECC------CCCCCCCCCCCCCEEEEEEEEEEECCCCEEEEEEEEEHHCC T ss_conf 999988861485679997106899613321025------66642235677053888888998075827888899922258 Q ss_pred CCEEECCCCCCCCCCCCCCCCEEEEEEEEEEEC Q ss_conf 730003666666442201234138999886209 Q gi|254781078|r 210 IHSVNNSNAKSDVERVDRKGNAHIASLGINMHF 242 (242) Q Consensus 210 ~~~~~~~~~~~~~~~~~~~~~~h~v~~GinY~F 242 (242) +. .....+.|.|.+.+||+||| T Consensus 153 ~~-----------~~~~~~~D~~~~slGlsYrF 174 (175) T pfam01389 153 DL-----------HKQGKRPDNGSASLGISYRF 174 (175) T ss_pred CC-----------CCCCCCCCCCEEEEEEEEEC T ss_conf 53-----------35676677777999999777 No 3 >PRK10808 outer membrane protein A; Reviewed Probab=99.90 E-value=1.1e-21 Score=137.21 Aligned_cols=190 Identities=14% Similarity=0.111 Sum_probs=119.5 Q ss_pred CCHHHHHHHHHHHCCCCEEEEEEECCCCCCCCCCCCCEEEECCCCCCEEEEEEEEEEEEEEEEEEECCCCCCCCCCCCCC Q ss_conf 20567888898743652023433202543457532110330376653000057444644103322214555666667667 Q gi|254781078|r 6 YGALFFVVFLSIVVPNHSLAVDLYLPRKIDLFNEADNNVEYQDDEYGIWSGNYVGLHISRLYETHPLADTINRKTYNSLL 85 (242) Q Consensus 6 ~~~~~~~~~~~~~~~~~~~a~d~~~~~~~~~~~~~~~~~~~~~~~~~~WtG~YiG~~~G~~~~~~~~~d~~~~~~~n~~~ 85 (242) +|-++.+.++.++..+.|+|+. +- ..||+|+.+|..+-. +.... +.....+-. T Consensus 2 KKt~ial~va~a~~a~~A~Aa~----~d---------------------n~wY~G~k~G~s~~~-d~~~~-~~~~~~d~~ 54 (347) T PRK10808 2 KKTAIAIAVALAGFATVAQAAP----KD---------------------NTWYTGAKLGWSQYH-DTGFI-NNNGPTHKN 54 (347) T ss_pred CHHHHHHHHHHHHHHHHHHCCC----CC---------------------CCEEEEEEEEEEEEC-CCCCC-CCCCCCCCC T ss_conf 4679999999997776431200----02---------------------887896421100121-33124-667776556 Q ss_pred CCEEEEEEEEEEEECCEEEEEEEEEEEECCCCCCCC-CCCCCE-EEEEEEEEEEEEEC-CEEEEEEEEEEEEEEECCCCC Q ss_conf 550479988655445779999988674047776432-310100-36798999863314-403444401145753011000 Q gi|254781078|r 86 PNGLGIELGHNIQLEDFVFGINCHTTAAKDDSTFYR-LKEKYF-IYGDVVLKAGYSVD-SLLIYGMGGFGGAYVIDSSLE 162 (242) Q Consensus 86 ~~g~G~~~GYN~Q~~~~V~GvE~d~~~~~~~~~~~~-~~~k~~-~~Gsvr~r~Gya~~-~~l~Y~~gG~A~~~~~~~~~~ 162 (242) ..++|+++|| ||.++ |++|..+.+.+.-..... ...+.+ .-.++-++..|.+. ++-+|++.|.++......... T Consensus 55 ~~g~G~f~GY--qfn~~-~a~E~GYd~lG~~~~~g~~~~~~~~~~g~~La~k~~ypl~d~l~ly~k~G~~y~~~d~~~~~ 131 (347) T PRK10808 55 QLGAGAFGGY--QVNPY-LGFEMGYDWLGRMPYKGDVSNGAFKAQGVQLTAKLGYPITDDLDVYTRLGGMVWRADTKSNV 131 (347) T ss_pred CCEEEEEEEE--EECCE-EEEEEEECCCCCCCCCCCCCCCCCEEEEEEEEEEEECCCCCCCCEEEEEEEEEEEECCCCCC T ss_conf 6124567547--72544-88987000028543245444764014678888888104567723257853489971234455 Q ss_pred CCCCCCCCCCCCCCCCCCEEEEEEEEEEEECCCCEEEEEEEEEEEC-CCCEEECCCCCCCCCCCCCCCCEEEEEEEEEEE Q ss_conf 2366665555434556530379873166732787688889888744-873000366666644220123413899988620 Q gi|254781078|r 163 KVESDNSKNAKGRFDGHGSSVVLGIGLDYMVNYDISLSASYRYIPH-HIHSVNNSNAKSDVERVDRKGNAHIASLGINMH 241 (242) Q Consensus 163 ~~~~~~~~~~~~~~d~t~~G~tvGaGVEYait~nwslraEY~Y~d~-g~~~~~~~~~~~~~~~~~~~~~~h~v~~GinY~ 241 (242) ........+.+...+++|+|+||+|++||++|+||||++- +. ...+..+.|.|.+.+||+|| T Consensus 132 ------~~~~~~~~~d~g~s~~~g~GveY~lt~~~~~R~eYq~~~~igd-----------~~~~g~~~d~~~~slGlsY~ 194 (347) T PRK10808 132 ------TGGARGKDHDTGVSPVFAGGVEYAITPEWATRLEYQWTNNIGD-----------ANTIGTRPDNGLLSVGVSYR 194 (347) T ss_pred ------CCCCCCCCCCCCCCCEEEEEEEEEECCCCEEEEEEEEEEECCC-----------CCCCCCCCCCCEEEEEEEEC T ss_conf ------6654323565673620672035880637268888999810575-----------55557467775699999963 Q ss_pred C Q ss_conf 9 Q gi|254781078|r 242 F 242 (242) Q Consensus 242 F 242 (242) | T Consensus 195 F 195 (347) T PRK10808 195 F 195 (347) T ss_pred C T ss_conf 7 No 4 >pfam02462 Opacity Opacity family porin protein. Pathogenic Neisseria spp. possess a repertoire of phase-variable Opacity proteins that mediate various pathogen--host cell interactions. These proteins are integral membrane proteins related to other porins. Probab=99.37 E-value=8e-12 Score=83.31 Aligned_cols=121 Identities=15% Similarity=0.168 Sum_probs=83.7 Q ss_pred EEEEEECCEEEEEEEEEEEE-CCCCCCCCCCCCCEEEEEEEEEEEEEE---CCEEEEEEEEEEEEEEECCCCCCCCCCCC Q ss_conf 86554457799999886740-477764323101003679899986331---44034444011457530110002366665 Q gi|254781078|r 94 GHNIQLEDFVFGINCHTTAA-KDDSTFYRLKEKYFIYGDVVLKAGYSV---DSLLIYGMGGFGGAYVIDSSLEKVESDNS 169 (242) Q Consensus 94 GYN~Q~~~~V~GvE~d~~~~-~~~~~~~~~~~k~~~~Gsvr~r~Gya~---~~~l~Y~~gG~A~~~~~~~~~~~~~~~~~ 169 (242) || .|++|.+ -+|++.- +.+.....+ |.. |+-.-+=|.| +++-||+.+++++.+++.+..... T Consensus 2 GY--dfg~~R~--A~DY~~y~k~~~~~~~~--~~~---SlglsaiYDFdt~s~~kPYvGarv~~~~~~~~~~~~~----- 67 (126) T pfam02462 2 GY--RFGNLRF--AVDYTRYGKVKANSTDF--KGY---SLGASVIYDFDTQSPVKPYVGARVSTNGFKVTASAEL----- 67 (126) T ss_pred CC--CCCCEEE--EECHHHHCCCCCCCCCE--EEE---EEEEEEEEECCCCCCCCEEEEEEEEECCCCCCCCCCC----- T ss_conf 61--3587999--82012315213787535--767---6367899971589952142675676122210355535----- Q ss_pred CCCCCCCCCCCEEEEEEEEEEEECCCCEEEEEEEEEEECCCCEEECCCCCCCCCCCCCCCCEEEEEEEEEEEC Q ss_conf 5554345565303798731667327876888898887448730003666666442201234138999886209 Q gi|254781078|r 170 KNAKGRFDGHGSSVVLGIGLDYMVNYDISLSASYRYIPHHIHSVNNSNAKSDVERVDRKGNAHIASLGINMHF 242 (242) Q Consensus 170 ~~~~~~~d~t~~G~tvGaGVEYait~nwslraEY~Y~d~g~~~~~~~~~~~~~~~~~~~~~~h~v~~GinY~F 242 (242) ....+.+.++.||-+.|||.|.|||||+|.++|||...|+- + +.+.+.|-+++||.|+| T Consensus 68 -~~~~s~S~~~~g~gv~AGVgyditpnltLD~GYRY~~~G~l-----------e--~tr~~theas~GvRY~F 126 (126) T pfam02462 68 -GGSDSFSDTRIGLGVLAGVSYKLTPNVDLDTGYRYNRLGKV-----------T--VTNVRSGEASAGVRYKF 126 (126) T ss_pred -CCCCCCCCCEEEEEEEEEEEEECCCCEEEECCEEEEECCCC-----------C--CCEECCCCEEEEEEEEC T ss_conf -87556503359998874368832787377125288004641-----------3--43020121368866639 No 5 >pfam02530 Porin_2 Porin subfamily. This family consists of porins from the alpha subdivision of Proteobacteria the members of this family are related to pfam00267. The porins form large aqueous channels in the cell membrane allowing the selective entry of hydrophilic compounds this so called 'molecular sieve' is found in the cell walls of gram negative bacteria. Probab=99.33 E-value=4.5e-11 Score=79.19 Aligned_cols=88 Identities=13% Similarity=0.110 Sum_probs=60.8 Q ss_pred EEEEEEEEEEEEEEC-CEEEEEEEEEEEEEEECCCCCC-----------------------CCCCC-CCCCCCCCCCCCE Q ss_conf 036798999863314-4034444011457530110002-----------------------36666-5555434556530 Q gi|254781078|r 127 FIYGDVVLKAGYSVD-SLLIYGMGGFGGAYVIDSSLEK-----------------------VESDN-SKNAKGRFDGHGS 181 (242) Q Consensus 127 ~~~Gsvr~r~Gya~~-~~l~Y~~gG~A~~~~~~~~~~~-----------------------~~~~~-~~~~~~~~d~t~~ 181 (242) .|.+.+|+|+...++ +..+++.|+++-+-.+...... ..... ............. T Consensus 246 ~~~~Av~~~~~~~~g~g~s~~l~g~Y~~ga~~Y~~~~~~~~~~~~Wav~~~~~~~~t~k~~~~~~~~~~~~~~~~~~d~~ 325 (378) T pfam02530 246 KEEGAVRARLSVKAGDGDTLNIAGGYASGANRYTGDSEYHNWGGNWAVWAGYTYKATDKTAITPGAQWGGAGNVCNPDFD 325 (378) T ss_pred CCCEEEEEEEEEEECCCCEEEEEEEEECCCCCCCCCCCCCCCCCCEEEEEEEEECCCCEEEEEECEEEECCCCCCCCCCC T ss_conf 42504489999986798458999998158742235544434677503776568623330898203375024543477754 Q ss_pred EEEEEEEEEEECCCCEEEEEEEEEEECCCCEEE Q ss_conf 379873166732787688889888744873000 Q gi|254781078|r 182 SVVLGIGLDYMVNYDISLSASYRYIPHHIHSVN 214 (242) Q Consensus 182 G~tvGaGVEYait~nwslraEY~Y~d~g~~~~~ 214 (242) .|.+|+.+.|.+.+|+.+++|++|.|++.+... T Consensus 326 ~~~vg~nv~ytpV~nl~~~~Ev~Yt~~d~k~~g 358 (378) T pfam02530 326 GWAVGANVDYTIVDNLTFTAEVRYTDLGQKYFG 358 (378) T ss_pred EEEEEEEEEEEECCCEEEEEEEEEEECCCCCCC T ss_conf 268888788788099689789999746874456 No 6 >PRK09408 ompX outer membrane protein X; Provisional Probab=99.23 E-value=2e-09 Score=70.22 Aligned_cols=125 Identities=17% Similarity=0.198 Sum_probs=91.6 Q ss_pred EEEEEEEEEECCEEEEEEEEEEEECCCCCCCCCCCCCEEEEEEEEEEEEEECC-EEEEEEEEEEEEEEECCCCCCCCCCC Q ss_conf 79988655445779999988674047776432310100367989998633144-03444401145753011000236666 Q gi|254781078|r 90 GIELGHNIQLEDFVFGINCHTTAAKDDSTFYRLKEKYFIYGDVVLKAGYSVDS-LLIYGMGGFGGAYVIDSSLEKVESDN 168 (242) Q Consensus 90 G~~~GYN~Q~~~~V~GvE~d~~~~~~~~~~~~~~~k~~~~Gsvr~r~Gya~~~-~l~Y~~gG~A~~~~~~~~~~~~~~~~ 168 (242) |+.+-|+|++++-.+|+-+.+++...|...........-+.++.+---|-+++ +=+|+..|++.++++...... T Consensus 46 G~NlKYrYE~dn~~~Gvi~Sftyt~~d~~~~~~~~~~~~YySl~aGPsyR~NdyvS~Ygl~G~a~~k~~~~~~~~----- 120 (171) T PRK09408 46 GFNLKYRYEEDNSPLGVIGSFTYTEKSRTADSGDYNKNQYYGITAGPAYRINDWASIYGVVGVGYGKFQTTEYPT----- 120 (171) T ss_pred CEEEEEEEEECCCCEEEEEEEEEECCCCCCCCCCCCEEEEEEEEECCEEEECCEEEEEEECCEEEEEEEECCCCC----- T ss_conf 448887667369867789999998166566775312046998953442786243752255013368997324676----- Q ss_pred CCCCCCCCCCCCEEEEEEEEEEEECCCCEEEEEEEEEEECCCCEEECCCCCCCCCCCCCCCCEEEEEEEEEEEC Q ss_conf 55554345565303798731667327876888898887448730003666666442201234138999886209 Q gi|254781078|r 169 SKNAKGRFDGHGSSVVLGIGLDYMVNYDISLSASYRYIPHHIHSVNNSNAKSDVERVDRKGNAHIASLGINMHF 242 (242) Q Consensus 169 ~~~~~~~~d~t~~G~tvGaGVEYait~nwslraEY~Y~d~g~~~~~~~~~~~~~~~~~~~~~~h~v~~GinY~F 242 (242) .+.+...++.+.||||++-..+||++.+.|.+-.+. +.+..+..+||.||| T Consensus 121 -----~~~s~s~t~fayGAGvQ~NP~~nvaid~gYE~S~~~------------------~~~~~~f~vGvGYRF 171 (171) T PRK09408 121 -----YKHDTSDYGFSYGAGLQFNPMENVALDFSYEQSRIR------------------SVDVGTWIAGVGYRF 171 (171) T ss_pred -----CCCCCCCCCEEEEEEEEECCCCCEEEEEEEEECCCC------------------CEEECCEEEEEEECC T ss_conf -----445611021466654798067787999876111458------------------727685998114139 No 7 >PRK10959 outer membrane protein W; Provisional Probab=99.07 E-value=4.6e-08 Score=62.73 Aligned_cols=108 Identities=13% Similarity=0.191 Sum_probs=71.5 Q ss_pred EEEEEEEE-EE-CCEEEEEEEEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCEEEEEEEEEEEECCCCEEEEEEEEEEEC Q ss_conf 98999863-31-44034444011457530110002366665555434556530379873166732787688889888744 Q gi|254781078|r 131 DVVLKAGY-SV-DSLLIYGMGGFGGAYVIDSSLEKVESDNSKNAKGRFDGHGSSVVLGIGLDYMVNYDISLSASYRYIPH 208 (242) Q Consensus 131 svr~r~Gy-a~-~~~l~Y~~gG~A~~~~~~~~~~~~~~~~~~~~~~~~d~t~~G~tvGaGVEYait~nwslraEY~Y~d~ 208 (242) ++.++-=+ .. .++-||+..|+.|...-+........ .......+. +..+|+.+=+|+||++++||++.+..+|++. T Consensus 103 tl~~qYhF~~~~~~~rPYVGaGvnYt~F~d~~~~~~~~-~~~~~~l~l-~~S~G~a~q~G~dy~l~~~W~~n~dv~y~~i 180 (212) T PRK10959 103 TLMAQYYFGDAQSKLRPYVGAGVNYTTFFDNEFNDTGK-DAGLSDLSL-KDSWGAAGQVGLDYLINRDWLVNASVWYMDI 180 (212) T ss_pred EEEEEECCCCCCCCEEEEEEEEEEEEEEECCCCCCCCC-CCCCCCEEE-CCCEEEEEEEEEEEEECCCEEEEEEEEEEEC T ss_conf 99999734899984513576106799997132456555-677451066-2746079998789993798399999999962 Q ss_pred CCCEEECCCCCCCCCCCCCCCCEEEEEEEEEEEC Q ss_conf 8730003666666442201234138999886209 Q gi|254781078|r 209 HIHSVNNSNAKSDVERVDRKGNAHIASLGINMHF 242 (242) Q Consensus 209 g~~~~~~~~~~~~~~~~~~~~~~h~v~~GinY~F 242 (242) .++..-..++ .....+.+.|.-++.+||.||| T Consensus 181 ~T~a~~~~~~--~~~~~~v~idP~v~~vgvgYrF 212 (212) T PRK10959 181 DTDVKFKLGG--AQQKISTRLDPWVFMFSAGYRF 212 (212) T ss_pred CCEEEEECCC--CEEEEEEEECCEEEEEEEEEEC T ss_conf 3459996388--4066047708789997759989 No 8 >pfam06316 Ail_Lom Enterobacterial Ail/Lom protein. This family consists of several bacterial and phage Ail/Lom-like proteins. The Yersinia enterocolitica Ail protein is a known virulence factor. Proteins in this family are predicted to consist of eight transmembrane beta-sheets and four cell surface-exposed loops. It is thought that Ail directly promotes invasion and loop 2 contains an active site, perhaps a receptor-binding domain. The phage protein Lom is expressed during lysogeny, and encode host-cell envelope proteins. Lom is found in the bacterial outer membrane, and is homologous to virulence proteins of two other enterobacterial genera. It has been suggested that lysogeny may generally have a role in bacterial survival in animal hosts, and perhaps in pathogenesis. Probab=99.01 E-value=4.7e-08 Score=62.68 Aligned_cols=133 Identities=13% Similarity=0.176 Sum_probs=88.2 Q ss_pred EEEEEEEEEECCEEEEEEEEEEEECCCCCCCCC---------CCCCEEEEEEEEEEEEEECC-EEEEEEEEEEEEEEECC Q ss_conf 799886554457799999886740477764323---------10100367989998633144-03444401145753011 Q gi|254781078|r 90 GIELGHNIQLEDFVFGINCHTTAAKDDSTFYRL---------KEKYFIYGDVVLKAGYSVDS-LLIYGMGGFGGAYVIDS 159 (242) Q Consensus 90 G~~~GYN~Q~~~~V~GvE~d~~~~~~~~~~~~~---------~~k~~~~Gsvr~r~Gya~~~-~l~Y~~gG~A~~~~~~~ 159 (242) |+.+-|+|++++- +|+-+-+++...+...... ..+.+ +.++.+=--|-++. +=+|+..|+|..+++.. T Consensus 50 G~NlKYRYE~d~~-lGvIgSfTy~~~~~~~~~~~~~~~~~~~~~~~~-YySlmaGPsyR~Nd~vS~Ygl~G~a~~k~~~~ 127 (199) T pfam06316 50 GINVKYRYEFTDT-LGLITSFSYANAEDEQKTHYNDTRWHEDSVRNR-WFSVMAGPSVRVNEWFSAYALAGVAYARVSSF 127 (199) T ss_pred CEEEEEEEECCCC-EEEEEEEEEECCCCCCCEECCCCCCCCCCCCCE-EEEEEECCCEEECCEEEEEEECCEEEEEEEEC T ss_conf 5388866441798-078999999615653120205762025642203-89997346268511012012123136776402 Q ss_pred CCCCCC--C-----CCCCCCCCCCCCCCEEEEEEEEEEEECCCCEEEEEEEEEEECCCCEEECCCCCCCCCCCCCCCCEE Q ss_conf 000236--6-----665555434556530379873166732787688889888744873000366666644220123413 Q gi|254781078|r 160 SLEKVE--S-----DNSKNAKGRFDGHGSSVVLGIGLDYMVNYDISLSASYRYIPHHIHSVNNSNAKSDVERVDRKGNAH 232 (242) Q Consensus 160 ~~~~~~--~-----~~~~~~~~~~d~t~~G~tvGaGVEYait~nwslraEY~Y~d~g~~~~~~~~~~~~~~~~~~~~~~h 232 (242) ...... . ........+.+...++.+.||||++-..+||++.+-|.+-.. .+.+.+ T Consensus 128 ~~d~~~~~~~~~~~~~~~~~s~~~s~s~tsfAyGAGvQfNP~~nvaiD~gYEgS~~------------------~d~~~~ 189 (199) T pfam06316 128 AGDYFRLTDDEGKKHDHLTGSDDARRSHTALAWGAGVQFNPTENIAVDLAYEASGR------------------GDWRTD 189 (199) T ss_pred CCCEEEECCCCCCCCCCCCCCCCCCCCCCCEEEEEEEEECCCCCEEEEEEEEECCC------------------CCCEEC T ss_conf 46503312555654555444444544433224420279715766799987763034------------------870517 Q ss_pred EEEEEEEEEC Q ss_conf 8999886209 Q gi|254781078|r 233 IASLGINMHF 242 (242) Q Consensus 233 ~v~~GinY~F 242 (242) ...+||.||| T Consensus 190 gf~vGVGYRF 199 (199) T pfam06316 190 AFIVGTGYKF 199 (199) T ss_pred CEEEEEEECC T ss_conf 5899514409 No 9 >pfam03922 OmpW OmpW family. This family includes outer membrane protein W (OmpW) proteins from a variety of bacterial species. This protein may form the receptor for S4 colicins in E. coli. Probab=98.91 E-value=2.5e-07 Score=58.73 Aligned_cols=104 Identities=15% Similarity=0.186 Sum_probs=71.9 Q ss_pred EEEEEEEE--EECCEEEEEEEEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCEEEEEEEEEEEECCCCEEEEEEEEEEEC Q ss_conf 98999863--3144034444011457530110002366665555434556530379873166732787688889888744 Q gi|254781078|r 131 DVVLKAGY--SVDSLLIYGMGGFGGAYVIDSSLEKVESDNSKNAKGRFDGHGSSVVLGIGLDYMVNYDISLSASYRYIPH 208 (242) Q Consensus 131 svr~r~Gy--a~~~~l~Y~~gG~A~~~~~~~~~~~~~~~~~~~~~~~~d~t~~G~tvGaGVEYait~nwslraEY~Y~d~ 208 (242) ++.++-=+ +-.++-||+..|+.|...-....... .....+. +..+|+++=+|++|++++||++.+..+|++. T Consensus 84 tlt~qYhf~~~~~~~rPYVGaGvnYt~f~~~~~~~~-----~~~~~~~-d~s~G~a~q~G~dy~l~~~w~~n~dv~~~~i 157 (189) T pfam03922 84 TLTAQYHFGDASSGFRPYVGAGVNYTTFFDEKLGST-----GATDLKL-KDSWGAAGQAGVDYMVNDDWLVNMDVWYMFI 157 (189) T ss_pred EEEEEEEECCCCCCEEEEEEEEEEEEEEECCCCCCC-----CCCEEEE-CCCEEEEEEEEEEEEECCCEEEEEEEEEEEC T ss_conf 999999348999965056760168999983445888-----8546897-0654089999889996898399999999962 Q ss_pred CCCEEECCCCCCCCCCCCCCCCEEEEEEEEEEEC Q ss_conf 8730003666666442201234138999886209 Q gi|254781078|r 209 HIHSVNNSNAKSDVERVDRKGNAHIASLGINMHF 242 (242) Q Consensus 209 g~~~~~~~~~~~~~~~~~~~~~~h~v~~GinY~F 242 (242) .++..-.... .....+.+.|.-++.+||.||| T Consensus 158 ~t~at~~~g~--~~~~~~v~lDP~v~~~gvgyrF 189 (189) T pfam03922 158 DTTASYKLGG--AKLKTKVKLDPWVFSIGLGYRF 189 (189) T ss_pred CCEEEEECCC--CEEEEEEEECCEEEEEEEEEEC T ss_conf 4569997288--5078579958889998769879 No 10 >pfam03349 Toluene_X Outer membrane protein transport protein (OMPP1/FadL/TodX). This family includes TodX from Pseudomonas putida F1 and TbuX from Ralstonia pickettii PKO1. These are membrane proteins of uncertain function that are involved in toluene catabolism. Related proteins involved in the degradation of similar aromatic hydrocarbons are also in this family, such as CymD. This family also includes FadL involved in translocation of long-chain fatty acids across the outer membrane. It is also a receptor for the bacteriophage T2. Probab=98.85 E-value=2.2e-06 Score=53.49 Aligned_cols=61 Identities=23% Similarity=0.414 Sum_probs=37.8 Q ss_pred EEEEEEEEEEECCCCEEEEEEEEEEECCCCEEECCCCCCCCCCCCCCCCEEEEEEEEEEEC Q ss_conf 3798731667327876888898887448730003666666442201234138999886209 Q gi|254781078|r 182 SVVLGIGLDYMVNYDISLSASYRYIPHHIHSVNNSNAKSDVERVDRKGNAHIASLGINMHF 242 (242) Q Consensus 182 G~tvGaGVEYait~nwslraEY~Y~d~g~~~~~~~~~~~~~~~~~~~~~~h~v~~GinY~F 242 (242) -+.+.+|+.|.++++|++.+-|.|+...+.+++..+.....-..+.+.+.|++.+++|||| T Consensus 355 r~~~s~G~~y~~~~~~~~d~ay~y~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~y~F 415 (415) T pfam03349 355 RTWLSLGATYAFTKDLTIDAGFSYLFGKKVSIKENNVTNGTLPGTSSGSAHLFALSYNYRF 415 (415) T ss_pred CEEEEEEEEEECCCCCEEEEEEEEEECCCCCEEECCCCCCEEEEEEEEEEEEEEEEEEEEC T ss_conf 6699940589859995899999999606553100167786078999748999999887679 No 11 >COG2067 FadL Long-chain fatty acid transport protein [Lipid metabolism] Probab=98.52 E-value=6.8e-06 Score=50.84 Aligned_cols=147 Identities=18% Similarity=0.143 Sum_probs=74.9 Q ss_pred CCCEEEEEEEEEEEEC-CEEEEE----EEEEEEECCCC------------------CCCC--CCCCCEEEEEEEEEEEEE Q ss_conf 7550479988655445-779999----98867404777------------------6432--310100367989998633 Q gi|254781078|r 85 LPNGLGIELGHNIQLE-DFVFGI----NCHTTAAKDDS------------------TFYR--LKEKYFIYGDVVLKAGYS 139 (242) Q Consensus 85 ~~~g~G~~~GYN~Q~~-~~V~Gv----E~d~~~~~~~~------------------~~~~--~~~k~~~~Gsvr~r~Gya 139 (242) ...++|-++|-.|+++ +.-+|+ |.+....+.-. .... .+.+.....++-..+-+. T Consensus 216 ~~~g~G~~lG~~ye~~d~~r~g~~YrS~V~~~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~l~lP~~~el~~~~~ 295 (440) T COG2067 216 EDWGFGWNLGVAYEINDNNRIGLSYRSKVKMDFKGGYESSLNLGMAGLPIGPLALGGVTPGSGKLTLSLPASAELSGQHK 295 (440) T ss_pred CCCCCEEEEEEEECCCCCEEEEEEEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCEECCCCEEEEECCCEEEEEEEEC T ss_conf 56422067888981686405778973055674257732102554444444322447430234347770585799722640 Q ss_pred E-CCEEEEEEEEEEEEEEECCCCCCCCCCC--CCCCCCCCCC-CCEEEEEEEEEEEECCCCEEEEEEEEEEECCCCEEEC Q ss_conf 1-4403444401145753011000236666--5555434556-5303798731667327876888898887448730003 Q gi|254781078|r 140 V-DSLLIYGMGGFGGAYVIDSSLEKVESDN--SKNAKGRFDG-HGSSVVLGIGLDYMVNYDISLSASYRYIPHHIHSVNN 215 (242) Q Consensus 140 ~-~~~l~Y~~gG~A~~~~~~~~~~~~~~~~--~~~~~~~~d~-t~~G~tvGaGVEYait~nwslraEY~Y~d~g~~~~~~ 215 (242) + +++++.... . ++..+.....+-+. .......... +|.+|++..|++|.+++.|++|+.+.|=.-..+..+. T Consensus 296 ~~d~w~~~~s~--~--wT~WS~f~~l~~~~~~~~~~~~~~~~~yrD~wt~a~G~~Y~~nd~~tlragiayD~s~s~~~~~ 371 (440) T COG2067 296 VADQWAIHGSV--K--WTDWSSFDKLDFVFTFGKTLFAKTEDGYRDTWTVALGTTYKFNDQWTLRAGIAYDQSPSPAQNR 371 (440) T ss_pred CCCCEEEEEEE--E--EEECCCCCEEEEEECCCCCCCCCCCCCCCCCCEEEEECEEECCCCCEEEEEEEECCCCCCCCCC T ss_conf 38870799889--9--8651677448889768875224653341043179400106738663375003345788854455 Q ss_pred CCCCCCCCCCCCCCCEEEEEEEEEEEC Q ss_conf 666666442201234138999886209 Q gi|254781078|r 216 SNAKSDVERVDRKGNAHIASLGINMHF 242 (242) Q Consensus 216 ~~~~~~~~~~~~~~~~h~v~~GinY~F 242 (242) .. .-.+.|.+.+.+|++|+| T Consensus 372 ~~-------~iPd~Dr~~~s~G~~Y~~ 391 (440) T COG2067 372 SI-------SIPDTDRWWLSLGTTYKF 391 (440) T ss_pred CC-------CCCCCCCEEEECCCEEEC T ss_conf 43-------247877579966647725 No 12 >PRK10716 long-chain fatty acid outer membrane transporter; Provisional Probab=98.48 E-value=6.5e-05 Score=45.49 Aligned_cols=53 Identities=23% Similarity=0.384 Sum_probs=26.3 Q ss_pred EEEEEEEECCCCEEEEEEEEEEECCCCEEECCCCCCCCCCCCCCCCEEEEEEEEEEEC Q ss_conf 8731667327876888898887448730003666666442201234138999886209 Q gi|254781078|r 185 LGIGLDYMVNYDISLSASYRYIPHHIHSVNNSNAKSDVERVDRKGNAHIASLGINMHF 242 (242) Q Consensus 185 vGaGVEYait~nwslraEY~Y~d~g~~~~~~~~~~~~~~~~~~~~~~h~v~~GinY~F 242 (242) +.+|+.|.++++|++.+-|.|++...++++..+- ..+.+.+.|++-+.+||+| T Consensus 385 ~S~G~~y~~~~~~tvD~ay~yi~~~~~~i~~~~~-----~~~~~~~a~i~g~~~~Y~F 437 (437) T PRK10716 385 LSAGTTYAFNKDASVDVGVSYMHGQSVKINEGPY-----QFESEGKAWLFGTNFNYAF 437 (437) T ss_pred EEEEEEEECCCCCEEEEEEEEEECCCCCEECCCE-----EEEEEEEEEEEEEEEEEEC T ss_conf 9812688548895799988999426641542403-----5687234899998888869 No 13 >PRK10716 long-chain fatty acid outer membrane transporter; Provisional Probab=98.47 E-value=2.2e-05 Score=48.10 Aligned_cols=60 Identities=13% Similarity=0.214 Sum_probs=45.0 Q ss_pred CCCCCEEEEEEEEEEEECCCCEEEEEEEEEEECCCCEEECCCCCCCCCCCCCCCCEEEEEEEEEEEC Q ss_conf 5565303798731667327876888898887448730003666666442201234138999886209 Q gi|254781078|r 176 FDGHGSSVVLGIGLDYMVNYDISLSASYRYIPHHIHSVNNSNAKSDVERVDRKGNAHIASLGINMHF 242 (242) Q Consensus 176 ~d~t~~G~tvGaGVEYait~nwslraEY~Y~d~g~~~~~~~~~~~~~~~~~~~~~~h~v~~GinY~F 242 (242) ...|+..|.+..|+||.++++|.+|+.|-| +...++..... ..=.+.|.+.+.+|.+|++ T Consensus 334 ~~nw~Dtw~~~~G~~Y~~~~~l~LRaG~aY---D~sPv~~~~~~----~~iPd~dR~~~S~G~~y~~ 393 (437) T PRK10716 334 HEGFKDAYRIALGTTYYYDDNWTFRTGIAF---DDSPVPAQNRS----ISIPDQDRFWLSAGTTYAF 393 (437) T ss_pred CCCCCCCEEEEEEEEEECCCCCEEEEEEEE---CCCCCCCCCCC----CCCCCCCCEEEEEEEEEEC T ss_conf 457741007997899987998189887787---46888888656----2076888859981268854 No 14 >COG3047 OmpW Outer membrane protein W [Cell envelope biogenesis, outer membrane] Probab=98.43 E-value=1.9e-06 Score=53.86 Aligned_cols=109 Identities=14% Similarity=0.117 Sum_probs=75.3 Q ss_pred EEEEEEEEEE--EEECCEEEEEEEEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCEEEEEEEEEEEECCCCEEEEEEEEE Q ss_conf 3679899986--33144034444011457530110002366665555434556530379873166732787688889888 Q gi|254781078|r 128 IYGDVVLKAG--YSVDSLLIYGMGGFGGAYVIDSSLEKVESDNSKNAKGRFDGHGSSVVLGIGLDYMVNYDISLSASYRY 205 (242) Q Consensus 128 ~~Gsvr~r~G--ya~~~~l~Y~~gG~A~~~~~~~~~~~~~~~~~~~~~~~~d~t~~G~tvGaGVEYait~nwslraEY~Y 205 (242) +.-+++++-= -+.+.+-||+..|+.|...-........ ...-.-+..+|..+=+|++|+++++|.++..-+| T Consensus 103 lPpTlt~Qyhf~~~~s~frPYvGaG~nYt~f~~e~~~~~~------~~~lkl~ns~G~a~q~G~Dy~l~~~w~ln~dV~~ 176 (213) T COG3047 103 LPPTLTAQYHFGDASSKFRPYVGAGLNYTTFFDEKFGSTG------LSDLKLKDSWGAAGQAGVDYMLNDDWLLNMDVKY 176 (213) T ss_pred ECCEEEEEEECCCCCCCCCCCEEECCCEEEEEEECCCCCC------CCEEEECCCCCEEEEEEEEEEECCCEEEEEEEEE T ss_conf 0868899973168655544214503334899850147777------5404513554611685455890456188778899 Q ss_pred EECCCCEEECCCCCCCCCCCCCCCCEEEEEEEEEEEC Q ss_conf 7448730003666666442201234138999886209 Q gi|254781078|r 206 IPHHIHSVNNSNAKSDVERVDRKGNAHIASLGINMHF 242 (242) Q Consensus 206 ~d~g~~~~~~~~~~~~~~~~~~~~~~h~v~~GinY~F 242 (242) +++.++.--..+........+.|.|.=++.+||.||| T Consensus 177 ~~i~t~a~~~~~~g~~~~~~kvkLdPwv~~~GigyRF 213 (213) T COG3047 177 MFIDTTAKYKAGLGGAKLKSKVKLDPWVFMIGIGYRF 213 (213) T ss_pred EECCCCEEECCCCCCCEEECCEECCCEEEEEEEEECC T ss_conf 9615540342466775054103417649995326619 No 15 >pfam07437 YfaZ YfaZ precursor. This family contains the precursor of the bacterial protein YfaZ (approximately 180 residues long). Many members of this family are hypothetical proteins. Probab=98.34 E-value=8.4e-05 Score=44.87 Aligned_cols=102 Identities=18% Similarity=0.095 Sum_probs=67.9 Q ss_pred CCCEEEEEEEEEEEECCEEEEEEEEEEEECCCCCCCCCCCCCEEEEEEEEEEEEEECC-EEEEEEEEEEEEEEECCCCCC Q ss_conf 7550479988655445779999988674047776432310100367989998633144-034444011457530110002 Q gi|254781078|r 85 LPNGLGIELGHNIQLEDFVFGINCHTTAAKDDSTFYRLKEKYFIYGDVVLKAGYSVDS-LLIYGMGGFGGAYVIDSSLEK 163 (242) Q Consensus 85 ~~~g~G~~~GYN~Q~~~~V~GvE~d~~~~~~~~~~~~~~~k~~~~Gsvr~r~Gya~~~-~l~Y~~gG~A~~~~~~~~~~~ 163 (242) ++...++-.+++.|++++.+.+-+-..+.+.+.... .+--.+-+|+.+...+ +=+|+.+=+|= +. T Consensus 58 dg~~~~~gl~~~~~~Gp~~~~vGgK~~y~~~~~g~~------G~Ava~GG~~~~~~~~~~sl~Ge~YyaP-----~~--- 123 (180) T pfam07437 58 DGDVAGGGLGMNLPAGPHHATVGAKGSYLWPDEGPN------GSAVAVGGGLALAIGPSFSLQGEAYYAP-----SV--- 123 (180) T ss_pred CCCEECCEEEEECCCCCEEEEEEEEEEEECCCCCCC------CEEEEECCEEECCCCCCEEEEEEEEECC-----HH--- T ss_conf 996732404674056783898203699827666886------3189862568624588578988789880-----67--- Q ss_pred CCCCCCCCCCCCCCCCCEEEEEEEEEEEECCCCEEEEEEEEEEECCCC Q ss_conf 366665555434556530379873166732787688889888744873 Q gi|254781078|r 164 VESDNSKNAKGRFDGHGSSVVLGIGLDYMVNYDISLSASYRYIPHHIH 211 (242) Q Consensus 164 ~~~~~~~~~~~~~d~t~~G~tvGaGVEYait~nwslraEY~Y~d~g~~ 211 (242) . .+ ....=+-+.+|++|.+.+++++.+.|||+++..+ T Consensus 124 -----l-----ss-Gvd~y~ea~~gv~~~v~~pl~v~aGYRyi~~~~~ 160 (180) T pfam07437 124 -----L-----SS-GVDSYYEANSGVRYNINRPMALYVGYRYINFEGK 160 (180) T ss_pred -----H-----CC-CCHHHEEECCCEEEEECCCCEEEEEEEEEEEECC T ss_conf -----3-----25-6203337346169997377289975799986636 No 16 >pfam03797 Autotransporter Autotransporter beta-domain. Secretion of protein products occurs by a number of different pathways in bacteria. One of these pathways known as the type V pathway was first described for the IgA1 protease. The protein component that mediates secretion through the outer membrane is contained within the secreted protein itself, hence the proteins secreted in this way are called autotransporters. This family corresponds to the presumed integral membrane beta-barrel domain that transports the protein. This domain is found at the C terminus of the proteins it occurs in. The N terminus contains the variable passenger domain that is translocated across the membrane. Once the passenger domain is exported it is cleaved auto-catalytically in some proteins, in others a different protease is used and in some cases no cleavage occurs. Probab=98.26 E-value=0.00016 Score=43.29 Aligned_cols=151 Identities=13% Similarity=0.068 Sum_probs=88.0 Q ss_pred EEEEEEEEEEEEEEEEEECCCCC--CCCCCCCC--CCCEEEEEEEEEEEE-CCEEEEEEEEEEEECCC--C------CCC Q ss_conf 00057444644103322214555--66666766--755047998865544-57799999886740477--7------643 Q gi|254781078|r 54 WSGNYVGLHISRLYETHPLADTI--NRKTYNSL--LPNGLGIELGHNIQL-EDFVFGINCHTTAAKDD--S------TFY 120 (242) Q Consensus 54 WtG~YiG~~~G~~~~~~~~~d~~--~~~~~n~~--~~~g~G~~~GYN~Q~-~~~V~GvE~d~~~~~~~--~------~~~ 120 (242) +.++|+-+.+++++...+..... ......+. ....+.+.+||+|++ ++|.+-..+.+.+.... . ... T Consensus 84 ~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~~~~gy~~~~~~~~~i~P~~~l~y~~~~~~~f~E~g~~~~ 163 (265) T pfam03797 84 DGGLYLDGVLAYGRFDNDVKRLGTSTGTAKGDYDSHGLGASLEAGYRFKLSGNLTLTPFAGLQYVYLRQDGFTESGARSF 163 (265) T ss_pred CCCCEEEEEEEEEEEECEEEEECCCCCCCEEEEEEEEEEEEEEEEECCCCCCCEEEEEEEEEEEEEEECCCEEEECCCCC T ss_conf 89836976788899715048730678730222802499999998762332697599699999999987687667099843 Q ss_pred CCCCCCEEEEEEEEEEEEEE----C--CEEEEEEEEEEEEEEECCCCCCCC-----CCCCCCCCCCCCCCCEEEEEEEEE Q ss_conf 23101003679899986331----4--403444401145753011000236-----666555543455653037987316 Q gi|254781078|r 121 RLKEKYFIYGDVVLKAGYSV----D--SLLIYGMGGFGGAYVIDSSLEKVE-----SDNSKNAKGRFDGHGSSVVLGIGL 189 (242) Q Consensus 121 ~~~~k~~~~Gsvr~r~Gya~----~--~~l~Y~~gG~A~~~~~~~~~~~~~-----~~~~~~~~~~~d~t~~G~tvGaGV 189 (242) .++.+..-+-+++.++|..+ . ++-||+..++++... ........ ...........+..+....+++|+ T Consensus 164 ~l~~~~~~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~ 242 (265) T pfam03797 164 ALSVDSQSLTNLSGPLGLRLEYRFSLRSLKPYATLGWAYEFD-DYRQVPTNGATLLASGGSWVTSGTPLARNAAELGAGV 242 (265) T ss_pred CEEEECCCCCEEEEEEEEEEEEEECCCEEEEEEEEEEEEECC-CCCCCEEEEEEEECCCCCEEECCCCCCCEEEEEEEEE T ss_conf 517844666589999999999985287364899999999977-8875503999972689842996756665499999889 Q ss_pred EEECCCCEEEEEEEEE Q ss_conf 6732787688889888 Q gi|254781078|r 190 DYMVNYDISLSASYRY 205 (242) Q Consensus 190 EYait~nwslraEY~Y 205 (242) ++.+++++++.++|+. T Consensus 243 ~~~~~~~~~l~~~~~~ 258 (265) T pfam03797 243 ELALGKNLSLFLNYSG 258 (265) T ss_pred EEEECCCEEEEEEEEE T ss_conf 9998998699999970 No 17 >pfam05736 OprF OprF membrane domain. This domain represents the presumed membrane spanning region of the OprF proteins. This region is involved in channel formation and is thought to form an 8-stranded beta-barrel. Probab=98.10 E-value=7.4e-05 Score=45.18 Aligned_cols=125 Identities=18% Similarity=0.195 Sum_probs=73.5 Q ss_pred CCEEEEEEEEEEEECCEEEEEEEEEEEECCCCCCCCCC-CCCEEEE-EEEEEEEEEE----CCEEEEEEEEEEEEEEECC Q ss_conf 55047998865544577999998867404777643231-0100367-9899986331----4403444401145753011 Q gi|254781078|r 86 PNGLGIELGHNIQLEDFVFGINCHTTAAKDDSTFYRLK-EKYFIYG-DVVLKAGYSV----DSLLIYGMGGFGGAYVIDS 159 (242) Q Consensus 86 ~~g~G~~~GYN~Q~~~~V~GvE~d~~~~~~~~~~~~~~-~k~~~~G-svr~r~Gya~----~~~l~Y~~gG~A~~~~~~~ 159 (242) +..+|+-+||++.-+ +|.+..+...+.....-. ...+.-| ..|.-.=|-| +.+-||+.+|++-+...+. T Consensus 50 g~~~G~~iGY~lTdd-----vsl~l~y~~~~~~r~~~~~g~~~i~g~~~rLDalYHF~~pg~~lrPYvs~G~gh~s~~~~ 124 (184) T pfam05736 50 GDLPGGSIGYFLTDD-----VELNLSYDETHDTRSTDGTGNQKVGGDLTSLDAQYHFGTPGDGLRPYVSAGFAHQSITNV 124 (184) T ss_pred CCCCEEEEEEECCCC-----EEEEEEECCCCCHHCCCCCCCCCCCCCEEEEEEEEECCCCCCCCCEEEEECCCCCEEECC T ss_conf 896403652571788-----389988734210001357666103563466777997389876633727731465200022 Q ss_pred CCCCCCCCCCCCCCCCCCCCCEEEEEEEEEEEECCCCEEEEEEEEEEECCCCEEECCCCCCCCCCCCCCCCEEEEEEEEE Q ss_conf 00023666655554345565303798731667327876888898887448730003666666442201234138999886 Q gi|254781078|r 160 SLEKVESDNSKNAKGRFDGHGSSVVLGIGLDYMVNYDISLSASYRYIPHHIHSVNNSNAKSDVERVDRKGNAHIASLGIN 239 (242) Q Consensus 160 ~~~~~~~~~~~~~~~~~d~t~~G~tvGaGVEYait~nwslraEY~Y~d~g~~~~~~~~~~~~~~~~~~~~~~h~v~~Gin 239 (242) . .....+..+| =..+|+|+.|-||+||.+|+|-|=+- .+|.....-.+++||+ T Consensus 125 ~---------~g~~Grd~sT--~~N~G~G~Ky~itdn~~aRa~vra~y----------------~lD~~~~d~~a~vGl~ 177 (184) T pfam05736 125 A---------DGSQGRDQST--FANIGAGAKYYFTENFYARAGVEAQY----------------GLDNGDWEWAALVGLG 177 (184) T ss_pred C---------CCCCCCCCCE--EEEECCCEEEEECCCEEEEECHHHHC----------------CCCCCCCCEEEEEEEE T ss_conf 5---------6877656441--67634526899605447761325310----------------4778861200455249 Q ss_pred EEC Q ss_conf 209 Q gi|254781078|r 240 MHF 242 (242) Q Consensus 240 Y~F 242 (242) |.| T Consensus 178 ~~F 180 (184) T pfam05736 178 VNF 180 (184) T ss_pred EEC T ss_conf 960 No 18 >COG2067 FadL Long-chain fatty acid transport protein [Lipid metabolism] Probab=97.94 E-value=0.00082 Score=39.45 Aligned_cols=104 Identities=18% Similarity=0.213 Sum_probs=65.7 Q ss_pred EEE--EEEEEEEEEECCEEEEEEEEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCEEEEEEEEEEEECCCCEEEEEEEEE Q ss_conf 367--989998633144034444011457530110002366665555434556530379873166732787688889888 Q gi|254781078|r 128 IYG--DVVLKAGYSVDSLLIYGMGGFGGAYVIDSSLEKVESDNSKNAKGRFDGHGSSVVLGIGLDYMVNYDISLSASYRY 205 (242) Q Consensus 128 ~~G--svr~r~Gya~~~~l~Y~~gG~A~~~~~~~~~~~~~~~~~~~~~~~~d~t~~G~tvGaGVEYait~nwslraEY~Y 205 (242) |.- +++.=+.|.++.-| =+.+|++|-+. .+..... ..+. --...|++++|+-|.+++|+++++-|-| T Consensus 335 yrD~wt~a~G~~Y~~nd~~-tlragiayD~s---~s~~~~~------~~~i-Pd~Dr~~~s~G~~Y~~t~n~~vd~g~ay 403 (440) T COG2067 335 YRDTWTVALGTTYKFNDQW-TLRAGIAYDQS---PSPAQNR------SISI-PDTDRWWLSLGTTYKFTKNLEVDAGYAY 403 (440) T ss_pred CCCCCEEEEECEEECCCCC-EEEEEEEECCC---CCCCCCC------CCCC-CCCCCEEEECCCEEECCCCEEEEEEEEE T ss_conf 1043179400106738663-37500334578---8854455------4324-7877579966647725998389988899 Q ss_pred EECCCCEEECCCCCCCCCCCCCCCCEEEEEEEEEEEC Q ss_conf 7448730003666666442201234138999886209 Q gi|254781078|r 206 IPHHIHSVNNSNAKSDVERVDRKGNAHIASLGINMHF 242 (242) Q Consensus 206 ~d~g~~~~~~~~~~~~~~~~~~~~~~h~v~~GinY~F 242 (242) +-..+.+++...........+.+...|.+.+.+||+| T Consensus 404 ~~~~~~s~~~~~~~g~~~~~~~~~~a~~~~ls~~ykF 440 (440) T COG2067 404 LKGKSVSINEVRGAGVTLSGFSKGSAHLFGLSLNYKF 440 (440) T ss_pred EEECCCCCCCCCCCCCCCCCCCCEEEEEEEEEEEEEC T ss_conf 9841442102456773002311117999998877619 No 19 >pfam03349 Toluene_X Outer membrane protein transport protein (OMPP1/FadL/TodX). This family includes TodX from Pseudomonas putida F1 and TbuX from Ralstonia pickettii PKO1. These are membrane proteins of uncertain function that are involved in toluene catabolism. Related proteins involved in the degradation of similar aromatic hydrocarbons are also in this family, such as CymD. This family also includes FadL involved in translocation of long-chain fatty acids across the outer membrane. It is also a receptor for the bacteriophage T2. Probab=97.84 E-value=0.00063 Score=40.08 Aligned_cols=146 Identities=14% Similarity=0.109 Sum_probs=79.4 Q ss_pred CCEEEEEEEEEEEEC-CEEEEEE----EEEEE------ECCCC---------CCCCCCCCCEEEEEEEEEEEEEEC-CEE Q ss_conf 550479988655445-7799999----88674------04777---------643231010036798999863314-403 Q gi|254781078|r 86 PNGLGIELGHNIQLE-DFVFGIN----CHTTA------AKDDS---------TFYRLKEKYFIYGDVVLKAGYSVD-SLL 144 (242) Q Consensus 86 ~~g~G~~~GYN~Q~~-~~V~GvE----~d~~~------~~~~~---------~~~~~~~k~~~~Gsvr~r~Gya~~-~~l 144 (242) ..++|.++|-.||.. ++.+|+- .+... ..... .....+.+.+.-..+..=+.|.+. ++. T Consensus 195 ~~g~G~~~G~~~~~~~~~~~G~sYrS~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~lP~~~~~g~~~~~~~~~~ 274 (415) T pfam03349 195 AWGVGWNLGVLYEPTDNTRIGASYRSKVKLDLKGKYTLSAVSGAAGNLAYGGSPLSGELRIQLPASLELGISHQFNDQWA 274 (415) T ss_pred EEEEEEEEEEEEECCCCCEEEEEEECCEEEEEECCEEEEECCCCCCCCCCCCCCCCCEEEECCCCEEEEEEEEECCCCEE T ss_conf 69840899999980899889999943127998521435505654443456776644258735891899999998489889 Q ss_pred EEEEEEEEEEEEECCCCCCC-----CCCCCCCCCCCCCCCCEEEEEEEEEEEECCCCEEEEEEEEEEECCCCEEECCCCC Q ss_conf 44440114575301100023-----6666555543455653037987316673278768888988874487300036666 Q gi|254781078|r 145 IYGMGGFGGAYVIDSSLEKV-----ESDNSKNAKGRFDGHGSSVVLGIGLDYMVNYDISLSASYRYIPHHIHSVNNSNAK 219 (242) Q Consensus 145 ~Y~~gG~A~~~~~~~~~~~~-----~~~~~~~~~~~~d~t~~G~tvGaGVEYait~nwslraEY~Y~d~g~~~~~~~~~~ 219 (242) +-+.. -| +..+..... ..............++.-|.+-.|+||.++++|.+|+.|.|-.- .+.... T Consensus 275 l~~d~--~~--~~WS~~~~l~~~~~~~~~~~~~~~~~~~w~d~~~~~~G~eY~~~~~l~LRaG~~yd~s---pv~~~~-- 345 (415) T pfam03349 275 VHASV--KR--TFWSDFDKLIFVLFVFAKGGTDLSLPQNYRDTTTYALGTDYQFNDRWTLRAGYAYDQS---ATPAES-- 345 (415) T ss_pred EEEEE--EE--EEECCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEEEEEEECCCCEEEEEEEEECCC---CCCCCC-- T ss_conf 99999--99--8730000102234455677645346768415008987899985998699974765467---878776-- Q ss_pred CCCCCCCCCCCEEEEEEEEEEEC Q ss_conf 66442201234138999886209 Q gi|254781078|r 220 SDVERVDRKGNAHIASLGINMHF 242 (242) Q Consensus 220 ~~~~~~~~~~~~h~v~~GinY~F 242 (242) ....-.+.+.|.+.+|..|++ T Consensus 346 --~~~~~Pd~dr~~~s~G~~y~~ 366 (415) T pfam03349 346 --RSARIPDTDRTWLSLGATYAF 366 (415) T ss_pred --CCEECCCCCCEEEEEEEEEEC T ss_conf --360045778669994058985 No 20 >smart00869 Autotransporter Autotransporter beta-domain. Secretion of protein products occurs by a number of different pathways in bacteria. One of these pathways known as the type IV pathway was first described for the IgA1 protease. The protein component that mediates secretion through the outer membrane is contained within the secreted protein itself, hence the proteins secreted in this way are called autotransporters. This family corresponds to the presumed integral membrane beta-barrel domain that transports the protein. This domain is found at the C-terminus of the proteins it occurs in. The N-terminus contains the variable passenger domain that is translocated across the membrane. Once the passenger domain is exported it is cleaved auto-catalytically in some proteins, in others a different peptidase is used and in some cases no cleavage occurs. Probab=97.79 E-value=0.002 Score=37.40 Aligned_cols=153 Identities=13% Similarity=0.089 Sum_probs=71.4 Q ss_pred EEEEEEEEEEEEEEEEECCCCCCCCC----CC--CCCCCEEEEEEEEEEEEC-CEEEEEEEEEEEECCCCCCCCCC---- Q ss_conf 00574446441033222145556666----67--667550479988655445-77999998867404777643231---- Q gi|254781078|r 55 SGNYVGLHISRLYETHPLADTINRKT----YN--SLLPNGLGIELGHNIQLE-DFVFGINCHTTAAKDDSTFYRLK---- 123 (242) Q Consensus 55 tG~YiG~~~G~~~~~~~~~d~~~~~~----~n--~~~~~g~G~~~GYN~Q~~-~~V~GvE~d~~~~~~~~~~~~~~---- 123 (242) .++|+.+.+++++.+.+......... .. +.....+.+.+||++.++ .|.+...+.+.+..........+ T Consensus 79 ~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~i~P~~~l~~~~~~~~~~~E~~~~~ 158 (261) T smart00869 79 NGLYLDAQLGYGRSDNDTKRTVTLGGAGTAKGSYDGTTYGASLEAGYPFKLGGSLTLTPFAGLAYSYVRQDGFTESGGAF 158 (261) T ss_pred CCEEEEEEEEEEEEEEEEEEEEECCCCCCEEEEEECEEEEEEEEECCCCCCCCCEEEEEEEEEEEEEEECCCEEECCCCH T ss_conf 98899999899998626899980488751167770408999997330122346789998999999999759767719602 Q ss_pred ---CCCEEEEEEEEEEEEEECC---------EEEEEEEEEEEEEEECCCC--CCCCCCCCCCCCCCCCCCCEEEEEEEEE Q ss_conf ---0100367989998633144---------0344440114575301100--0236666555543455653037987316 Q gi|254781078|r 124 ---EKYFIYGDVVLKAGYSVDS---------LLIYGMGGFGGAYVIDSSL--EKVESDNSKNAKGRFDGHGSSVVLGIGL 189 (242) Q Consensus 124 ---~k~~~~Gsvr~r~Gya~~~---------~l~Y~~gG~A~~~~~~~~~--~~~~~~~~~~~~~~~d~t~~G~tvGaGV 189 (242) .+..-.-+++.|+|..++. +.|++..++...-...... ...............+..+....+++|+ T Consensus 159 ~~~~~~~~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 238 (261) T smart00869 159 ALSVDSQSLDSLSLPLGLRLGYAFALGDGSTLTPYLSLAYQHDFYDKNPVVTASLLGAGGSFTTSGTDLSRNAAELGLGL 238 (261) T ss_pred HCEECCCCCCEEEEEEEEEEEEEEECCCCCEEEEEEEEEEEEHHCCCCHHEEEEEECCCCCEEEECCCCCCCEEEEEEEE T ss_conf 11321566437999999999989985899699999999999435399935699997589852993767787489999999 Q ss_pred EEECCCCEEEEEEEEEEE Q ss_conf 673278768888988874 Q gi|254781078|r 190 DYMVNYDISLSASYRYIP 207 (242) Q Consensus 190 EYait~nwslraEY~Y~d 207 (242) ++.+.+.+++.++|.+.- T Consensus 239 ~~~~~~~~~~~~~~~~~~ 256 (261) T smart00869 239 SAKLSNGLSLSLNYDGEF 256 (261) T ss_pred EEEECCCEEEEEEEEEEE T ss_conf 999899889999999998 No 21 >pfam06629 MipA MltA-interacting protein MipA. This family consists of several bacterial MltA-interacting protein (MipA) like sequences. As well as interacting with the membrane-bound lytic transglycosylase MltA, MipA is known to bind to PBP1B, a bifunctional murein transglycosylase/transpeptidase. MipA is considered to be a structural protein mediating the assembly of MltA to PBP1B into a complex. Probab=97.49 E-value=0.0056 Score=34.91 Aligned_cols=136 Identities=18% Similarity=0.102 Sum_probs=77.6 Q ss_pred EEEEEEEEEECCEEEEEEEEEEEECCCCCCCCCCCCCEEEEEEEEEEEEEEC-CEEEEEEEEEEEEEEECCCCCCCCC-- Q ss_conf 7998865544577999998867404777643231010036798999863314-4034444011457530110002366-- Q gi|254781078|r 90 GIELGHNIQLEDFVFGINCHTTAAKDDSTFYRLKEKYFIYGDVVLKAGYSVD-SLLIYGMGGFGGAYVIDSSLEKVES-- 166 (242) Q Consensus 90 G~~~GYN~Q~~~~V~GvE~d~~~~~~~~~~~~~~~k~~~~Gsvr~r~Gya~~-~~l~Y~~gG~A~~~~~~~~~~~~~~-- 166 (242) .+-+...|+.+.+.+.++......+.. ..+...++...-+.++ ++.+-...|+.|+.-+....-.--+ T Consensus 86 ~~G~~~~~~~~~~~~~~~~~~dv~~~~---------~G~~~~~~~~~~~~~~~~~~l~~~~~~~~~s~~y~~yyyGV~~~ 156 (226) T pfam06629 86 VGGLKAAYDTGWGTVRAELEQDISNGS---------RGFVADIGLAATYDLGDRLSLSPSLGLSYASRKYNDYYFGVSAA 156 (226) T ss_pred EEEEEEEEECCCEEEEEEEEEECCCCC---------CCEEEEEEEEEEEECCCCEEEEEEEEEEECCHHHHHHHHCCCHH T ss_conf 217899999488899999999324673---------88999999999996399779999789998677675655377978 Q ss_pred CCCCC--CCCCCCCCCEEEEEEEEEEEECCCCEEEEEEEEEEECCCCEEECCCCCCCCCCCCCCCCEEEEEEEEEEEC Q ss_conf 66555--54345565303798731667327876888898887448730003666666442201234138999886209 Q gi|254781078|r 167 DNSKN--AKGRFDGHGSSVVLGIGLDYMVNYDISLSASYRYIPHHIHSVNNSNAKSDVERVDRKGNAHIASLGINMHF 242 (242) Q Consensus 167 ~~~~~--~~~~~d~t~~G~tvGaGVEYait~nwslraEY~Y~d~g~~~~~~~~~~~~~~~~~~~~~~h~v~~GinY~F 242 (242) +.... +....+.....+.+++-+.|.+++||++.+.-+|..+.....|-+ + |+.+ ..-++.+||+|+| T Consensus 157 ea~~s~~~~Y~a~~g~~~~~~~~~~~y~l~~~w~~~~~~~~~~L~~~~~dSP-----i--v~~~-~~~~~~~g~~Y~F 226 (226) T pfam06629 157 ESARSGLAQYNAGAGLWSAGLGLSATYMLTDHWSALGSASYTRLSGDAADSP-----I--VERR-GQPSAGLGVAYRF 226 (226) T ss_pred HHHHCCCCCCCCCCCEEEEEEEEEEEEEECCCEEEEEEEEEEECCCCHHCCC-----C--EEEC-CCEEEEEEEEEEC T ss_conf 9612378640578770789999999999669999999999998488211499-----5--8867-8789999999989 No 22 >cd00342 gram_neg_porins Porins form aqueous channels for the diffusion of small hydrophillic molecules across the outer membrane. Individual 16-strand anti-parallel beta-barrels form a central pore, and trimerizes thru mainly hydrophobic interactions at the interface. Trimers are stabilized by hytrophillic clamping of Loop L2. Loop 3 bends into the pore, creating an elliptical constriction of about 7 x 11A, large enough to allow passage of a glucose molecule without steric hindrance. Removal of the C-terminal residue (usuallly F) destabilizes the trimer and removal of the 16th beta-sheet abolishes trimerization. Unlike typical membrane proteins, porins lack long hydrophobic stretches. Short turns are found at the smooth, periplasmic end, longer irregular loops are found at the rough, extracellular end. C-terminal residue forms salt bridge with N-terminus. Probab=97.18 E-value=0.013 Score=32.91 Aligned_cols=58 Identities=19% Similarity=0.248 Sum_probs=37.9 Q ss_pred EEEEEEEEEEECCCCEEEEEEEEEEECCCCEEECCCCCCCCCCCCCCCCEEEEEEEEEEEC Q ss_conf 3798731667327876888898887448730003666666442201234138999886209 Q gi|254781078|r 182 SVVLGIGLDYMVNYDISLSASYRYIPHHIHSVNNSNAKSDVERVDRKGNAHIASLGINMHF 242 (242) Q Consensus 182 G~tvGaGVEYait~nwslraEY~Y~d~g~~~~~~~~~~~~~~~~~~~~~~h~v~~GinY~F 242 (242) -..+..|+||.|+.+..+-+||.|++.......... .........+.+.+.+||.|+| T Consensus 272 ~~~~~lG~~Y~lSKrT~lY~~~~~~~n~~~~~~~~~---~~~~~~~~~~~~~~~~Girh~F 329 (329) T cd00342 272 ANQVALGADYALSKRTDLYAEYGYQKNSGGASTGLA---GGGGPSSGNNQDGVAVGIRHKF 329 (329) T ss_pred EEEEEEEEEEEECCCHHHHHHHHHEECCCCCCEECC---CCCCCCCCCCEEEEEEEEEEEC T ss_conf 189999999971554898665054110687504325---6777789986599999819989 No 23 >pfam12099 DUF3575 Protein of unknown function (DUF3575). This family of proteins are functionally uncharacterized. This family is only found in bacteria. Proteins in this family are typically between 187 to 236 amino acids in length. Probab=96.91 E-value=0.024 Score=31.48 Aligned_cols=38 Identities=18% Similarity=0.266 Sum_probs=20.1 Q ss_pred HHHHHHHHHHHCCCCEEEEEEECCCCCCCCCCCCCEEE Q ss_conf 56788889874365202343320254345753211033 Q gi|254781078|r 8 ALFFVVFLSIVVPNHSLAVDLYLPRKIDLFNEADNNVE 45 (242) Q Consensus 8 ~~~~~~~~~~~~~~~~~a~d~~~~~~~~~~~~~~~~~~ 45 (242) .+++++.+..++|.++|..-+-+--.-+.+-++...+| T Consensus 7 ~~~~~~~~~~~~~~~aQ~~alKTNll~~a~~tpNlg~E 44 (186) T pfam12099 7 LLLFLLLFCSLSPARAQVVAVKTNLLYDATTTPNLGVE 44 (186) T ss_pred EHHHHHHHHHCCCCEEEEEEEEECHHHHHHHCCCEEEE T ss_conf 07689998715532016898980455688629734899 No 24 >COG3203 OmpC Outer membrane protein (porin) [Cell envelope biogenesis, outer membrane] Probab=96.73 E-value=0.033 Score=30.67 Aligned_cols=57 Identities=11% Similarity=0.142 Sum_probs=41.7 Q ss_pred CCCEEEEEEEEEEEECCCCEEEEEEEEEEECCCCEEECCCCCCCCCCCCCCCCEEEEEEEEEEEC Q ss_conf 65303798731667327876888898887448730003666666442201234138999886209 Q gi|254781078|r 178 GHGSSVVLGIGLDYMVNYDISLSASYRYIPHHIHSVNNSNAKSDVERVDRKGNAHIASLGINMHF 242 (242) Q Consensus 178 ~t~~G~tvGaGVEYait~nwslraEY~Y~d~g~~~~~~~~~~~~~~~~~~~~~~h~v~~GinY~F 242 (242) ....-..+..|++|.|+.|.++-+||.|....+.... ..+ . .......+.|||.|+| T Consensus 298 ~~~~~~~~~~ga~Y~fsKrt~~Yv~y~~~~~~~~~~~--~~~----~--~~~~~~~v~VGlr~~F 354 (354) T COG3203 298 DKDLVKYVDVGATYYFSKNTSTYVDYGINLLDSDDNA--TVP----G--GGNTDNGVAVGLRYQF 354 (354) T ss_pred CCCCCEEEEEEEEEEEECCEEEEEEEEEEECCCCCCC--CCC----C--CCCCCCEEEEEEEEEC T ss_conf 8876059995017998047248999888741566753--045----6--6654313899879869 No 25 >smart00869 Autotransporter Autotransporter beta-domain. Secretion of protein products occurs by a number of different pathways in bacteria. One of these pathways known as the type IV pathway was first described for the IgA1 protease. The protein component that mediates secretion through the outer membrane is contained within the secreted protein itself, hence the proteins secreted in this way are called autotransporters. This family corresponds to the presumed integral membrane beta-barrel domain that transports the protein. This domain is found at the C-terminus of the proteins it occurs in. The N-terminus contains the variable passenger domain that is translocated across the membrane. Once the passenger domain is exported it is cleaved auto-catalytically in some proteins, in others a different peptidase is used and in some cases no cleavage occurs. Probab=96.51 E-value=0.047 Score=29.82 Aligned_cols=152 Identities=19% Similarity=0.173 Sum_probs=66.4 Q ss_pred CEEEEEEEEEEEEC-CEEEEEEEEEEEECCCCCC-------CCCCCC-CEEEEEEEEEEEEEE--C-CEEEEEEEEEEEE Q ss_conf 50479988655445-7799999886740477764-------323101-003679899986331--4-4034444011457 Q gi|254781078|r 87 NGLGIELGHNIQLE-DFVFGINCHTTAAKDDSTF-------YRLKEK-YFIYGDVVLKAGYSV--D-SLLIYGMGGFGGA 154 (242) Q Consensus 87 ~g~G~~~GYN~Q~~-~~V~GvE~d~~~~~~~~~~-------~~~~~k-~~~~Gsvr~r~Gya~--~-~~l~Y~~gG~A~~ 154 (242) ..+|+++.+ ++. +|-+-..+.+.+.+.+... ...+.+ ..+.-...+++||.+ + ++.+=-.+++.|. T Consensus 67 ~~~g~y~~~--~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~i~P~~~l~~~ 144 (261) T smart00869 67 YSLGLYAGY--SLGNGLYLDAQLGYGRSDNDTKRTVTLGGAGTAKGSYDGTTYGASLEAGYPFKLGGSLTLTPFAGLAYS 144 (261) T ss_pred EEEEEEEEE--ECCCCEEEEEEEEEEEEEEEEEEEEECCCCCCEEEEEECEEEEEEEEECCCCCCCCCEEEEEEEEEEEE T ss_conf 899999999--969988999998999986268999804887511677704089999973301223467899989999999 Q ss_pred EEECCCCCCCCCCCCCCCCCCCCCCCEEEEEEEEEEEECCC------CEEEEEEEEEEECCCCEE---EC--CCCCCCCC Q ss_conf 53011000236666555543455653037987316673278------768888988874487300---03--66666644 Q gi|254781078|r 155 YVIDSSLEKVESDNSKNAKGRFDGHGSSVVLGIGLDYMVNY------DISLSASYRYIPHHIHSV---NN--SNAKSDVE 223 (242) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~d~t~~G~tvGaGVEYait~------nwslraEY~Y~d~g~~~~---~~--~~~~~~~~ 223 (242) +++.+........... .....+.+..-..+|+-+++.+.. .+.+++.|.+--.++... .. ........ T Consensus 145 ~~~~~~~~E~~~~~~~-~~~~~~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 223 (261) T smart00869 145 YVRQDGFTESGGAFAL-SVDSQSLDSLSLPLGLRLGYAFALGDGSTLTPYLSLAYQHDFYDKNPVVTASLLGAGGSFTTS 223 (261) T ss_pred EEECCCEEECCCCHHC-EECCCCCCEEEEEEEEEEEEEEECCCCCEEEEEEEEEEEEHHCCCCHHEEEEEECCCCCEEEE T ss_conf 9975976771960211-321566437999999999989985899699999999999435399935699997589852993 Q ss_pred CCCCCCCEEEEEEEEEEE Q ss_conf 220123413899988620 Q gi|254781078|r 224 RVDRKGNAHIASLGINMH 241 (242) Q Consensus 224 ~~~~~~~~h~v~~GinY~ 241 (242) ..+.+.+...+.+|+.++ T Consensus 224 ~~~~~~~~~~~~~g~~~~ 241 (261) T smart00869 224 GTDLSRNAAELGLGLSAK 241 (261) T ss_pred CCCCCCCEEEEEEEEEEE T ss_conf 767787489999999999 No 26 >pfam10082 DUF2320 Uncharacterized protein conserved in bacteria (DUF2320). This domain, found in various hypothetical bacterial proteins, has no known function. Probab=96.08 E-value=0.084 Score=28.47 Aligned_cols=50 Identities=26% Similarity=0.393 Sum_probs=38.5 Q ss_pred EEEEEEEEEEEECCCCEEEEEEEEEEECCCCEEECCCCCCCCCCCCCCCCEEEEEEEEEEEC Q ss_conf 03798731667327876888898887448730003666666442201234138999886209 Q gi|254781078|r 181 SSVVLGIGLDYMVNYDISLSASYRYIPHHIHSVNNSNAKSDVERVDRKGNAHIASLGINMHF 242 (242) Q Consensus 181 ~G~tvGaGVEYait~nwslraEY~Y~d~g~~~~~~~~~~~~~~~~~~~~~~h~v~~GinY~F 242 (242) .-+.+++|+.|.++++|.+.+.|+|.+... + . -..+++.++|.+||..++ T Consensus 331 ~~~~~~~~l~Y~~~r~l~l~~~y~~~~~~S------~----~--~~~~y~~n~v~l~l~~~~ 380 (380) T pfam10082 331 ETYSAEAGLTYKLNRWLDLTAGVRYEDLDS------N----I--PGRSYDANVVGLGLTLQR 380 (380) T ss_pred CEEEEEEEEEEEEECCEEEEEEEEEEEEEC------C----C--CCCCEEEEEEEEEEEEEC T ss_conf 339999999998417799999999997037------8----8--878625059999999979 No 27 >pfam03797 Autotransporter Autotransporter beta-domain. Secretion of protein products occurs by a number of different pathways in bacteria. One of these pathways known as the type V pathway was first described for the IgA1 protease. The protein component that mediates secretion through the outer membrane is contained within the secreted protein itself, hence the proteins secreted in this way are called autotransporters. This family corresponds to the presumed integral membrane beta-barrel domain that transports the protein. This domain is found at the C terminus of the proteins it occurs in. The N terminus contains the variable passenger domain that is translocated across the membrane. Once the passenger domain is exported it is cleaved auto-catalytically in some proteins, in others a different protease is used and in some cases no cleavage occurs. Probab=95.78 E-value=0.11 Score=27.72 Aligned_cols=181 Identities=14% Similarity=0.072 Sum_probs=81.8 Q ss_pred EEEEEEEEEEEEEEECCCCCCCCCCCCCCCCEEEEEEEEEEEECCEEEEEEEEEEEECCCCC-----CCCCCCC-CEEEE Q ss_conf 57444644103322214555666667667550479988655445779999988674047776-----4323101-00367 Q gi|254781078|r 57 NYVGLHISRLYETHPLADTINRKTYNSLLPNGLGIELGHNIQLEDFVFGINCHTTAAKDDST-----FYRLKEK-YFIYG 130 (242) Q Consensus 57 ~YiG~~~G~~~~~~~~~d~~~~~~~n~~~~~g~G~~~GYN~Q~~~~V~GvE~d~~~~~~~~~-----~~~~~~k-~~~~G 130 (242) .-+|+.+++..++.+..+.. ...+.....+++++.|+.+ ++|-+-....+.+.+.+.. ....+.+ ..+.- T Consensus 46 ~~~G~~~~~~~~~~~~~~~~---~~~~~~~~~~g~y~~~~~~-~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~ 121 (265) T pfam03797 46 LILGLAFGYSFSKSKSDDGG---GKGKSDSYGAGLYAQWNFD-GGLYLDGVLAYGRFDNDVKRLGTSTGTAKGDYDSHGL 121 (265) T ss_pred EEEEEEEEEEEEECCCCCCC---CCEEEEEEEEEEEEEEECC-CCCEEEEEEEEEEEECEEEEECCCCCCCEEEEEEEEE T ss_conf 59999999985222117888---7677670799999999848-9836976788899715048730678730222802499 Q ss_pred EEEEEEEEEEC---CEEEEEEEEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCEEEEEEEEEEEECCC---CEEEEEEEE Q ss_conf 98999863314---403444401145753011000236666555543455653037987316673278---768888988 Q gi|254781078|r 131 DVVLKAGYSVD---SLLIYGMGGFGGAYVIDSSLEKVESDNSKNAKGRFDGHGSSVVLGIGLDYMVNY---DISLSASYR 204 (242) Q Consensus 131 svr~r~Gya~~---~~l~Y~~gG~A~~~~~~~~~~~~~~~~~~~~~~~~d~t~~G~tvGaGVEYait~---nwslraEY~ 204 (242) ++.+++||.+. ++.+--..++.|.+++.+...........-.....+....-..+|.-+|+.+.. ++.+++.+. T Consensus 122 ~~~~~~gy~~~~~~~~~i~P~~~l~y~~~~~~~f~E~g~~~~~l~~~~~~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~ 201 (265) T pfam03797 122 GASLEAGYRFKLSGNLTLTPFAGLQYVYLRQDGFTESGARSFALSVDSQSLTNLSGPLGLRLEYRFSLRSLKPYATLGWA 201 (265) T ss_pred EEEEEEEECCCCCCCEEEEEEEEEEEEEEECCCEEEECCCCCCEEEECCCCCEEEEEEEEEEEEEECCCEEEEEEEEEEE T ss_conf 99999876233269759969999999998768766709984351784466658999999999998528736489999999 Q ss_pred EE--ECCCCEEEC---CC--CCCCCCCCCCCCCEEEEEEEEEEE Q ss_conf 87--448730003---66--666644220123413899988620 Q gi|254781078|r 205 YI--PHHIHSVNN---SN--AKSDVERVDRKGNAHIASLGINMH 241 (242) Q Consensus 205 Y~--d~g~~~~~~---~~--~~~~~~~~~~~~~~h~v~~GinY~ 241 (242) |. +........ .. ....+...+.+.+.-.+.+|+++. T Consensus 202 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~~~ 245 (265) T pfam03797 202 YEFDDYRQVPTNGATLLASGGSWVTSGTPLARNAAELGAGVELA 245 (265) T ss_pred EECCCCCCCEEEEEEEECCCCCEEECCCCCCCEEEEEEEEEEEE T ss_conf 99778875503999972689842996756665499999889999 No 28 >PRK09752 adhesin; Provisional Probab=95.75 E-value=0.12 Score=27.65 Aligned_cols=145 Identities=8% Similarity=-0.056 Sum_probs=75.4 Q ss_pred EEEEEEEEEEEEEEEEECCCCCCCCCCCCCCCCEEEEEEEEEEEECC-EEEEEEEEEEEECCCCCC----CCCCCCCEEE Q ss_conf 00574446441033222145556666676675504799886554457-799999886740477764----3231010036 Q gi|254781078|r 55 SGNYVGLHISRLYETHPLADTINRKTYNSLLPNGLGIELGHNIQLED-FVFGINCHTTAAKDDSTF----YRLKEKYFIY 129 (242) Q Consensus 55 tG~YiG~~~G~~~~~~~~~d~~~~~~~n~~~~~g~G~~~GYN~Q~~~-~V~GvE~d~~~~~~~~~~----~~~~~k~~~~ 129 (242) +|+||-.-+=|+|-..++...-.....-+..+..+-+..||-|-.++ |.+=-.+.+.|...+... +.-+.+.+-- T Consensus 1077 ~GAYVDSWaQYSWFNNsV~GqdLAsEsYDSkG~TASLESGYtfklGe~~~IQPQAQVIWMGVKADsHTEaNGTrVsgeGd 1156 (1250) T PRK09752 1077 QGAWLDSWLQYAWFNNDVSEQDDGVDHYHSSGIIASLEAGYQWLPGRGVVIEPQAQVIYQGVQQDDFTAANHARVSQSQG 1156 (1250) T ss_pred CCEEEEEEEEEEEECCEECCCCCCHHEECCCCEEEEEECCCEEECCCCEEEEEEEEEEEEECCCCCCCCCCCCEEECCCC T ss_conf 85368554675556461026865200013511799985587796898389802159999731455331157878864689 Q ss_pred EEEEEEEEEEE-----CCEEEEEEEEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCEEEEEEEEEEEECCCCEEEEEEEE Q ss_conf 79899986331-----4403444401145753011000236666555543455653037987316673278768888988 Q gi|254781078|r 130 GDVVLKAGYSV-----DSLLIYGMGGFGGAYVIDSSLEKVESDNSKNAKGRFDGHGSSVVLGIGLDYMVNYDISLSASYR 204 (242) Q Consensus 130 Gsvr~r~Gya~-----~~~l~Y~~gG~A~~~~~~~~~~~~~~~~~~~~~~~~d~t~~G~tvGaGVEYait~nwslraEY~ 204 (242) ++++.|+|... ..+.||+-.- |-+.........+ ...-+.+..+.=--+-.|||=.++++|++..--. T Consensus 1157 dNIQTRLGvRse~kk~~~fqPFVEaN--WIHNTkdfgvsMD-----GvtVkQDGarNiGELKvGVEGQINk~lnvWGNVa 1229 (1250) T PRK09752 1157 DDIQTRLGLHSEWRTAVHVIPTLDLN--YYHDPHSTEIEED-----GSTISDDAVKQRGEIKVGVTGNISQRVSLRGSVA 1229 (1250) T ss_pred CCEEEEEEEEECCCCCCCCCCEEEEE--EEECCCCCEEEEC-----CEEEECCCCCCCCEEECCEEEEECCCEEEEEEEE T ss_conf 96676335886146677645016877--6653886237717-----9065013331301036116999767467898766 Q ss_pred EE Q ss_conf 87 Q gi|254781078|r 205 YI 206 (242) Q Consensus 205 Y~ 206 (242) +- T Consensus 1230 qQ 1231 (1250) T PRK09752 1230 WQ 1231 (1250) T ss_pred EE T ss_conf 88 No 29 >pfam00267 Porin_1 Gram-negative porin. Probab=95.66 E-value=0.13 Score=27.47 Aligned_cols=49 Identities=14% Similarity=0.226 Sum_probs=34.4 Q ss_pred EEEEEEEEECCCCEEEEEEEEEEECCCCEEECCCCCCCCCCCCCCCCEEEEEEEEEEEC Q ss_conf 98731667327876888898887448730003666666442201234138999886209 Q gi|254781078|r 184 VLGIGLDYMVNYDISLSASYRYIPHHIHSVNNSNAKSDVERVDRKGNAHIASLGINMHF 242 (242) Q Consensus 184 tvGaGVEYait~nwslraEY~Y~d~g~~~~~~~~~~~~~~~~~~~~~~h~v~~GinY~F 242 (242) -+-+|+||.|+.+.++-+||.+....+.... ..+. .+...+.|||.|+| T Consensus 286 q~~lg~dY~fSKrt~~Y~~Y~~~~~d~~~~~---------~~~~-~~dd~~~vGlRY~F 334 (334) T pfam00267 286 YVDVGATYYFNKNMSAYVDYKINLLDKGDDF---------AAGI-NTDDAVAVGLVYQF 334 (334) T ss_pred EEEEEEEEEECCCCEEEEEEEEECCCCCCCC---------CCCC-CCCCEEEEEEEEEC T ss_conf 9999989996588049999987625666544---------4677-76888999808819 No 30 >pfam11854 DUF3374 Protein of unknown function (DUF3374). This family of proteins are functionally uncharacterized. This protein is found in bacteria. Proteins in this family are typically between 665 to 712 amino acids in length. Probab=95.60 E-value=0.13 Score=27.34 Aligned_cols=18 Identities=17% Similarity=0.254 Sum_probs=8.0 Q ss_pred HHHHHHHHHCC-CCEEEEE Q ss_conf 78888987436-5202343 Q gi|254781078|r 10 FFVVFLSIVVP-NHSLAVD 27 (242) Q Consensus 10 ~~~~~~~~~~~-~~~~a~d 27 (242) |++++++|++- ..+.|+| T Consensus 5 l~lialAll~~~~~a~A~~ 23 (675) T pfam11854 5 LNLIALALLAAAFSANAAD 23 (675) T ss_pred HHHHHHHHHHHCCHHHHCC T ss_conf 2299999986243333156 No 31 >pfam05736 OprF OprF membrane domain. This domain represents the presumed membrane spanning region of the OprF proteins. This region is involved in channel formation and is thought to form an 8-stranded beta-barrel. Probab=95.53 E-value=0.027 Score=31.13 Aligned_cols=99 Identities=8% Similarity=-0.028 Sum_probs=47.9 Q ss_pred HHHHHHHHHHHCCCCEEEEEEECCCCCCCCCCCCCEEEECCCCCCEEEEEEEEEEEEEEEEE--------EECCCC--CC Q ss_conf 56788889874365202343320254345753211033037665300005744464410332--------221455--56 Q gi|254781078|r 8 ALFFVVFLSIVVPNHSLAVDLYLPRKIDLFNEADNNVEYQDDEYGIWSGNYVGLHISRLYET--------HPLADT--IN 77 (242) Q Consensus 8 ~~~~~~~~~~~~~~~~~a~d~~~~~~~~~~~~~~~~~~~~~~~~~~WtG~YiG~~~G~~~~~--------~~~~d~--~~ 77 (242) +..-.++++.+.|..||-.--+--+|--+++.-|-..+.- .|.=.|+.+||-+.. .+..+. .. T Consensus 9 ~aig~lvaa~s~~alAqgqgave~e~f~g~~~fDs~r~~~-------~g~~~G~~iGY~lTddvsl~l~y~~~~~~r~~~ 81 (184) T pfam05736 9 FAIGSLVAATSFGALAQGQGAVEIEAFYKKQYNDSVRNMK-------NGDLPGGSIGYFLTDDVELNLSYDETHDTRSTD 81 (184) T ss_pred HHHHHHHHHHHHHHHHCCCCCEEEEECCCEEECCCCCCCC-------CCCCCEEEEEEECCCCEEEEEEECCCCCHHCCC T ss_conf 7888888873016776546747984226647526313532-------489640365257178838998873421000135 Q ss_pred CCCCCCCCCCEEEEEEEEEE-EE----CCEEEEEEEEEEEE Q ss_conf 66667667550479988655-44----57799999886740 Q gi|254781078|r 78 RKTYNSLLPNGLGIELGHNI-QL----EDFVFGINCHTTAA 113 (242) Q Consensus 78 ~~~~n~~~~~g~G~~~GYN~-Q~----~~~V~GvE~d~~~~ 113 (242) .....++.+.-+.+-+=|++ +- -|+|-+.-++.+.. T Consensus 82 ~~g~~~i~g~~~rLDalYHF~~pg~~lrPYvs~G~gh~s~~ 122 (184) T pfam05736 82 GTGNQKVGGDLTSLDAQYHFGTPGDGLRPYVSAGFAHQSIT 122 (184) T ss_pred CCCCCCCCCCEEEEEEEEECCCCCCCCCEEEEECCCCCEEE T ss_conf 76661035634667779973898766337277314652000 No 32 >COG5338 Uncharacterized protein conserved in bacteria [Function unknown] Probab=95.32 E-value=0.056 Score=29.43 Aligned_cols=78 Identities=10% Similarity=0.207 Sum_probs=41.6 Q ss_pred CCCCCCCCCEEEEEEEEEEEEEECCEE-EEEEEEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCEEEEEEEEEEEECCCC Q ss_conf 643231010036798999863314403-4444011457530110002366665555434556530379873166732787 Q gi|254781078|r 118 TFYRLKEKYFIYGDVVLKAGYSVDSLL-IYGMGGFGGAYVIDSSLEKVESDNSKNAKGRFDGHGSSVVLGIGLDYMVNYD 196 (242) Q Consensus 118 ~~~~~~~k~~~~Gsvr~r~Gya~~~~l-~Y~~gG~A~~~~~~~~~~~~~~~~~~~~~~~~d~t~~G~tvGaGVEYait~n 196 (242) +....+.+-.-..++++|+||.+.|.| ||+..-+. +...-..++. ..+.....+|.+++|+|+.+.+. T Consensus 253 ttvsqkDRD~n~~s~~~R~gYelSPal~pf~e~~vg----k~~YD~ridt-------~G~eRSs~~Y~~r~G~~~D~gEk 321 (468) T COG5338 253 TTVSQKDRDRNLASLTGRIGYELSPALIPFAEAEVG----KRDYDTRIDT-------QGYERSSDGYGARVGAEFDLGEK 321 (468) T ss_pred CCCCHHCCCCCCCEEEEECCEEECCCCCEEEEEECC----CCHHHHHCCC-------CCCCCCCCCCCCCCCEEECHHHH T ss_conf 723320135440002321112336542202676237----3102110244-------34223444200013246600042 Q ss_pred EEEEEEEEEE Q ss_conf 6888898887 Q gi|254781078|r 197 ISLSASYRYI 206 (242) Q Consensus 197 wslraEY~Y~ 206 (242) +.--+.--|+ T Consensus 322 ~~GE~aiGYm 331 (468) T COG5338 322 LRGEVAIGYM 331 (468) T ss_pred HCCCCHHHHH T ss_conf 1650000011 No 33 >TIGR01414 autotrans_barl outer membrane autotransporter barrel domain; InterPro: IPR006315 The secretion of protein products occurs by a number of different pathways in bacteria and several secretion mechanisms have been described for Gram-negative bacteria , an increasing number employ a highly efficient but simple mechanism first described for the immunoglobulin A1 (IgA1) proteases , . The autotransporter secretion pathway is a distinct secretion mechanism, in which the protein moiety mediating export through the outer membrane is contained within the precursor of the secreted protein itself. Autotransporters have been implicated as important or putative virulence factors such as mediating adhesion to host cells or by mediating actin-promoted bacterial mobility . The key feature of an autotransporter is that it contains all the information for secretion in the precursor of the secreted protein itself . Autotransporters comprise three functional domains: 1) an N-terminal targeting domain (amino-terminal leader sequence) that functions as a signal peptide to mediate targeting to and translocation across the inner membrane 2) a C-terminal translocation domain (carboxy-terminal) that forms a beta-barrel pore to allow the secretion of 3) the passenger domain, the secreted mature protein . This entry shows the C-terminal autotransporter domain, it is about 400 amino acids in length and includes the aromatic amino acid-rich OMP signal, typically ending with a Phe or Trp residue, at the extreme C-terminus.; GO: 0019867 outer membrane. Probab=95.11 E-value=0.2 Score=26.43 Aligned_cols=167 Identities=16% Similarity=0.128 Sum_probs=101.6 Q ss_pred EEEEEEEEEEEEEEEEECCCCCCC----CCCCCCCCCEEEEEEEEEEEECCEEEEEEEEEEEECCCCCC--------CCC Q ss_conf 005744464410332221455566----66676675504799886554457799999886740477764--------323 Q gi|254781078|r 55 SGNYVGLHISRLYETHPLADTINR----KTYNSLLPNGLGIELGHNIQLEDFVFGINCHTTAAKDDSTF--------YRL 122 (242) Q Consensus 55 tG~YiG~~~G~~~~~~~~~d~~~~----~~~n~~~~~g~G~~~GYN~Q~~~~V~GvE~d~~~~~~~~~~--------~~~ 122 (242) +|+|+-..+-+.|....+...... .......+.+..+.+||.|.+. |.+--.+.+.+....... ... T Consensus 265 ~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~g~~~~~~-~~~~P~~~~~~~~~~~~~~~~~~g~~~~~ 343 (454) T TIGR01414 265 TGAYVDGWLKYSWFNNDLSSLGSGGKSVSGKYNSNGLGASLEAGYRYNLG-WYLEPQAQLSYLGVSGDDYKESNGTRVLG 343 (454) T ss_pred CCEEEEHEEEEEHHCCCCCCCCCCCCCCCCCCCCCCEEEEEECCCEEEEE-EEEECCEEEEEEEECCCCEECCCCEEEEE T ss_conf 31121000000000020100135542013444333114543114011004-67500036888740564201046536760 Q ss_pred CC--CCEEEEEEEEEEE---EEE-----CCEEEEEEEEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCEEEEEEEEEEEE Q ss_conf 10--1003679899986---331-----4403444401145753011000236666555543455653037987316673 Q gi|254781078|r 123 KE--KYFIYGDVVLKAG---YSV-----DSLLIYGMGGFGGAYVIDSSLEKVESDNSKNAKGRFDGHGSSVVLGIGLDYM 192 (242) Q Consensus 123 ~~--k~~~~Gsvr~r~G---ya~-----~~~l~Y~~gG~A~~~~~~~~~~~~~~~~~~~~~~~~d~t~~G~tvGaGVEYa 192 (242) .. ...+.+.+..+++ +.+ ..+-||....+....... ........ .......+..+..+.++.|++.. T Consensus 344 ~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~-~~~~~~~~--~~~~~~~~~~~~~~~~~~g~~~~ 420 (454) T TIGR01414 344 GGDSGDSLQGRLGLRLGYKASQFDLGTGRALKPYLKANWLHEFGGL-GGNGVLVN--GGVTLRNDGSGTRGELGLGVNAS 420 (454) T ss_pred CCCCCCCEEEECCEEECCCHHHHHCCCCCEEEEEEEEEEEEECCCC-CCCEEEEC--CCCCCCCCCCCCEEEEECCEEEE T ss_conf 5665320000001000010000000367322146777653101477-75315751--65200244444224431240244 Q ss_pred CC-CCEEEEEEEEEEECCCCEEECCCCCCCCCCCCCCCCEEEEEEEEEEEC Q ss_conf 27-876888898887448730003666666442201234138999886209 Q gi|254781078|r 193 VN-YDISLSASYRYIPHHIHSVNNSNAKSDVERVDRKGNAHIASLGINMHF 242 (242) Q Consensus 193 it-~nwslraEY~Y~d~g~~~~~~~~~~~~~~~~~~~~~~h~v~~GinY~F 242 (242) +. +++++..++.|...+ .........+|+.|.| T Consensus 421 ~~~~~~~~~~~~~~~~g~-----------------~~~~~~~~~~g~~~~~ 454 (454) T TIGR01414 421 LKSPNLSLYLDVDYQLGG-----------------KGETPWSGNLGVRYSF 454 (454) T ss_pred ECCCCEEEEEEEECCCCC-----------------CCCCCEEEEEEEEEEC T ss_conf 336421576531001366-----------------5333203555357739 No 34 >pfam05420 BCSC_C Cellulose synthase operon protein C C-terminus (BCSC_C). This family contains the C-terminal regions of several bacterial cellulose synthase operon C (BCSC) proteins. BCSC is involved in cellulose synthesis although the exact function of this protein is unknown. Probab=94.85 E-value=0.23 Score=26.02 Aligned_cols=51 Identities=16% Similarity=0.095 Sum_probs=37.9 Q ss_pred CCCCCCCEEEEEEEEEEEECCCCEEEEEEEEEEECCCCEEECCCCCCCCCCCCCCCCEEEEEEEEEEEC Q ss_conf 345565303798731667327876888898887448730003666666442201234138999886209 Q gi|254781078|r 174 GRFDGHGSSVVLGIGLDYMVNYDISLSASYRYIPHHIHSVNNSNAKSDVERVDRKGNAHIASLGINMHF 242 (242) Q Consensus 174 ~~~d~t~~G~tvGaGVEYait~nwslraEY~Y~d~g~~~~~~~~~~~~~~~~~~~~~~h~v~~GinY~F 242 (242) ...+++-.||++.|.+||.++++|.+.++-.+-.. .+++.....+-|.|-| T Consensus 286 ~g~S~~G~~y~l~a~~Eyrls~~~~lG~~~~~~~a------------------~dY~p~~~~lYlRY~f 336 (339) T pfam05420 286 PGDSGSGLGYSLRAAVEYRLSPRWFLGGGLDIQRA------------------GDYAPSSALLYLRYAF 336 (339) T ss_pred CCCCCCCEEEEEEEEEEEEECCCEEEEEEEECCCC------------------CCCCCCCEEEEEEEEC T ss_conf 68888855578889999994698899877800215------------------7888340899999960 No 35 >pfam11854 DUF3374 Protein of unknown function (DUF3374). This family of proteins are functionally uncharacterized. This protein is found in bacteria. Proteins in this family are typically between 665 to 712 amino acids in length. Probab=94.75 E-value=0.25 Score=25.88 Aligned_cols=58 Identities=21% Similarity=0.242 Sum_probs=26.2 Q ss_pred EEEEEEEECCCCEEEEEEEEEEECCCCEEE------CCCCCCCCCCCCCCCCEEEEEEEEEEEC Q ss_conf 873166732787688889888744873000------3666666442201234138999886209 Q gi|254781078|r 185 LGIGLDYMVNYDISLSASYRYIPHHIHSVN------NSNAKSDVERVDRKGNAHIASLGINMHF 242 (242) Q Consensus 185 vGaGVEYait~nwslraEY~Y~d~g~~~~~------~~~~~~~~~~~~~~~~~h~v~~GinY~F 242 (242) +=+=.+|+++++.++|+.|+|-.|.....- ..........+..+++.|.+++=++|+| T Consensus 612 l~~~~~Y~lse~~~lrl~y~yEry~d~D~an~~~~d~i~~~~t~G~~n~dYnAh~v~ls~sY~l 675 (675) T pfam11854 612 LKLYADYQLSESMSLRLDYRYERYSDTDYANDVTPDAIPNLLTLGDLNHNYNAHQVMLSFSYKL 675 (675) T ss_pred EEEEEEEEECCCCEEEEEEEEEEECCCCHHHCCCCCCCCCEEECCCCCCCCHHHEEEEEEEECC T ss_conf 8898645516661567888999740454332357777644244044557713411336788509 No 36 >PRK11447 cellulose synthase subunit BcsC; Provisional Probab=94.75 E-value=0.25 Score=25.87 Aligned_cols=32 Identities=6% Similarity=-0.044 Sum_probs=26.5 Q ss_pred CCCCCCEEEEEEEEEEEECCCCEEEEEEEEEE Q ss_conf 45565303798731667327876888898887 Q gi|254781078|r 175 RFDGHGSSVVLGIGLDYMVNYDISLSASYRYI 206 (242) Q Consensus 175 ~~d~t~~G~tvGaGVEYait~nwslraEY~Y~ 206 (242) .-+++-.||++.+.+||.|+++|.+++.-.|- T Consensus 1089 g~s~~G~~y~l~a~~eyr~~~~~~~G~~~~~~ 1120 (1158) T PRK11447 1089 GSSSQGFGYTARALIERRVTSNWFVGTAVDIQ 1120 (1158) T ss_pred CCCCCCCCEEEEEEEEEEECCCEEEECEEECC T ss_conf 88888632367788999715867863105015 No 37 >pfam11383 DUF3187 Protein of unknown function (DUF3187). This family of proteins with unknown function appear to be restricted to Proteobacteria. Probab=94.44 E-value=0.12 Score=27.71 Aligned_cols=87 Identities=17% Similarity=0.183 Sum_probs=54.2 Q ss_pred EEEEEEEEECCEEEEEEEEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCEEEEEEEEEEEECCCCEEEEEEEEEEECCCC Q ss_conf 89998633144034444011457530110002366665555434556530379873166732787688889888744873 Q gi|254781078|r 132 VVLKAGYSVDSLLIYGMGGFGGAYVIDSSLEKVESDNSKNAKGRFDGHGSSVVLGIGLDYMVNYDISLSASYRYIPHHIH 211 (242) Q Consensus 132 vr~r~Gya~~~~l~Y~~gG~A~~~~~~~~~~~~~~~~~~~~~~~~d~t~~G~tvGaGVEYait~nwslraEY~Y~d~g~~ 211 (242) +-..-+|..++--+|.+.|+.+-+-+ .+......-...|+.++|.||++++|-.+-+||.+..--.+ T Consensus 157 lQlNysy~~~~h~~~s~lG~t~r~~~-------------~~~~~l~~k~~t~s~~~gY~Y~l~~~H~ll~ey~~yqG~~~ 223 (273) T pfam11383 157 LQLNYSYQKDRHHFYSTLGVVFRNSD-------------EALNDLPYKKNTLSWAIGYEYRLTERHHLLIEYHWYQGAVE 223 (273) T ss_pred HEEEEEEECCCCEEEEEEEEEEECCC-------------CCCCCCCCCCCEEEEEEEEEEEECCCHHHHHHHHHHCCCCC T ss_conf 11300554377528889889983587-------------42366553441577772126752376146543343336426 Q ss_pred EEECCCCCCCCCCCCCCCCEEEEEEEEEEEC Q ss_conf 0003666666442201234138999886209 Q gi|254781078|r 212 SVNNSNAKSDVERVDRKGNAHIASLGINMHF 242 (242) Q Consensus 212 ~~~~~~~~~~~~~~~~~~~~h~v~~GinY~F 242 (242) ..-+.+...|-+.+|-.|++ T Consensus 224 -----------~~~~fse~SnE~~LGYRY~~ 243 (273) T pfam11383 224 -----------GDTDFSEASNEAVLGYRYYM 243 (273) T ss_pred -----------CCCHHCCCCCEEEEEEEEEE T ss_conf -----------86100112413578665773 No 38 >PRK12580 outer membrane protease; Reviewed Probab=94.04 E-value=0.35 Score=25.04 Aligned_cols=61 Identities=13% Similarity=0.178 Sum_probs=40.1 Q ss_pred EEEEEEEEEEECCCCEEEEEEEEEEECCCCE-----EECCCCCC---CCCCCCCCCCEEEEEEEEEEEC Q ss_conf 3798731667327876888898887448730-----00366666---6442201234138999886209 Q gi|254781078|r 182 SVVLGIGLDYMVNYDISLSASYRYIPHHIHS-----VNNSNAKS---DVERVDRKGNAHIASLGINMHF 242 (242) Q Consensus 182 G~tvGaGVEYait~nwslraEY~Y~d~g~~~-----~~~~~~~~---~~~~~~~~~~~h~v~~GinY~F 242 (242) =|-+-+-+-|.+|||+.+.+|..|-.+-... +|...+.. .-...-.+....++++||-|+| T Consensus 244 yys~~~~agY~vtp~~klf~e~~~~ky~~~kgdT~i~d~~tG~s~~~~~d~AGi~n~n~tvs~GLkY~F 312 (312) T PRK12580 244 YYGTVINAGYYVTPNAKVFAEFTYSKYDEGKGGTQTIDKNSGDSVSIGGDAAGISNKNYTVTAGLQYRF 312 (312) T ss_pred EEEEEEECCEEECCCEEEEEEEEEEEEECCCCCCEEEECCCCCEEEECCCCCCCCCCCEEEEEEEEEEC T ss_conf 189998402576798569999999877325686288866888365405774000243249999878869 No 39 >COG3468 AidA Type V secretory pathway, adhesin AidA [Cell envelope biogenesis, outer membrane / Intracellular trafficking and secretion] Probab=93.70 E-value=0.23 Score=26.05 Aligned_cols=163 Identities=14% Similarity=0.073 Sum_probs=97.9 Q ss_pred CCCCCEEEECCCCCCEE-------EEEEEEEEEEEEEEEEECCCCCC-CCCCCCCCCCEEEEEEEEEEEEC--CEEEEEE Q ss_conf 53211033037665300-------00574446441033222145556-66667667550479988655445--7799999 Q gi|254781078|r 38 NEADNNVEYQDDEYGIW-------SGNYVGLHISRLYETHPLADTIN-RKTYNSLLPNGLGIELGHNIQLE--DFVFGIN 107 (242) Q Consensus 38 ~~~~~~~~~~~~~~~~W-------tG~YiG~~~G~~~~~~~~~d~~~-~~~~n~~~~~g~G~~~GYN~Q~~--~~V~GvE 107 (242) ...+....|..-.|..| +|+|+.....+.|..+.+..... ..+.....+...-+..||+++++ .|++-.. T Consensus 380 ~~~~~~~~y~~g~~~tw~~~~~~~~g~y~d~~~q~~~f~n~~~~~~~~~~~~~~~~g~~~slE~g~~~~l~~~~~~~~pq 459 (592) T COG3468 380 LGADKTTGYSLGAYATWYASSAKATGAYLDGWAQYSWFDNGVNSLDNTGADGYLSSGYTASLEGGYRFLLGEASLTLQPQ 459 (592) T ss_pred CCCCCCCEECCCCEEEEEECCCCCCCEEEEEEHHHEECCCCCCCCCCCCCCCCCCCCEEEEEECCCEEECCCCCEEECCE T ss_conf 44453132435526897504775533366000122131465324455555434553223101035113413350886770 Q ss_pred EEEEEECCCCC----CCCCCCCCEEEEEEEEEEEEEECCE-------------EEEEEEEEEEEEEECCCCCCCCCCCCC Q ss_conf 88674047776----4323101003679899986331440-------------344440114575301100023666655 Q gi|254781078|r 108 CHTTAAKDDST----FYRLKEKYFIYGDVVLKAGYSVDSL-------------LIYGMGGFGGAYVIDSSLEKVESDNSK 170 (242) Q Consensus 108 ~d~~~~~~~~~----~~~~~~k~~~~Gsvr~r~Gya~~~~-------------l~Y~~gG~A~~~~~~~~~~~~~~~~~~ 170 (242) +++.+...... ....+...+-..++++|+|...+.. .||+..-.-. +.+.... + ... T Consensus 460 aq~~~~~v~~~~~~~~~~s~v~~~~~~~~~~rlg~~~~~~~~~~~~~~~~~~~~~~~~~n~~~-~~~~~~~--v---~~~ 533 (592) T COG3468 460 AQLTYQGVTVDDFTDSNGSEVSGDNGDSLQGRLGVALGYVTTVQGLDGLNAEEGPFVLQNWLH-EFGGDTE--V---NVD 533 (592) T ss_pred EEEEEEECCCCCCEECCCCEEECCCCCCHHHCCCCCCCCEEECCCCCCCCCCCCCEEEEEHHH-HCCCCCE--E---EEC T ss_conf 799997636542220268676157785321112100341011033333344445157530133-3267737--9---965 Q ss_pred CCCCCCCCCCEEEEEEEE-EEEECCCCEEEEEEEEEE Q ss_conf 554345565303798731-667327876888898887 Q gi|254781078|r 171 NAKGRFDGHGSSVVLGIG-LDYMVNYDISLSASYRYI 206 (242) Q Consensus 171 ~~~~~~d~t~~G~tvGaG-VEYait~nwslraEY~Y~ 206 (242) ......+..+.-+.++.| ++-.++.+.++.++-+|. T Consensus 534 ~~~~~~~~~~~~~e~~~g~~~a~~~~~~~~~g~~~~~ 570 (592) T COG3468 534 GVAEGNDGDRNTAEGGVGGVAAKLNKQTSLGGAAAYQ 570 (592) T ss_pred CCEEECCCCEEEEECCCCCEEEEECCCCCCHHHHHHH T ss_conf 7312214410244204554000112130402332233 No 40 >pfam06082 DUF940 Bacterial putative lipoprotein (DUF940). This family consists of hypothetical bacterial proteins several of which are described as putative lipoproteins. Probab=93.43 E-value=0.29 Score=25.51 Aligned_cols=21 Identities=29% Similarity=0.321 Sum_probs=9.7 Q ss_pred EEEEEEEECCCCEEEEEEEEE Q ss_conf 873166732787688889888 Q gi|254781078|r 185 LGIGLDYMVNYDISLSASYRY 205 (242) Q Consensus 185 vGaGVEYait~nwslraEY~Y 205 (242) +-+..||.+++||++...++| T Consensus 420 l~~~~ey~f~~nl~lsg~l~~ 440 (661) T pfam06082 420 LNANASYRLWPGLELSGSLRV 440 (661) T ss_pred EEEEEEEEECCCCEEEEEEEE T ss_conf 755589998699099999965 No 41 >pfam01278 Omptin Omptin family. The omptin family is a family of serine proteases. Probab=93.34 E-value=0.49 Score=24.28 Aligned_cols=60 Identities=10% Similarity=0.153 Sum_probs=40.8 Q ss_pred EEEEEEEEEEEECCCCEEEEEEEEEEECCCC-----EEECCCCC-----CCCCCCCCCCCEEEEEEEEEEEC Q ss_conf 0379873166732787688889888744873-----00036666-----66442201234138999886209 Q gi|254781078|r 181 SSVVLGIGLDYMVNYDISLSASYRYIPHHIH-----SVNNSNAK-----SDVERVDRKGNAHIASLGINMHF 242 (242) Q Consensus 181 ~G~tvGaGVEYait~nwslraEY~Y~d~g~~-----~~~~~~~~-----~~~~~~~~~~~~h~v~~GinY~F 242 (242) -=|-+-+-+-|.+|||+.+.+|..|-.+-.. -+|...+. .+.. -.+....++++||-|+| T Consensus 226 ~yys~~~~agY~vtp~~klf~e~~~~ky~~~kg~T~i~d~~tG~~~~~~~d~A--Gi~n~~~tvs~GLkY~F 295 (295) T pfam01278 226 NYYSVAVNAGYYVTPNAKVFVEGSYSKVTNKKGDTSLYDIASGNTSGGEGDAA--GIENYNFTVTAGLKYRF 295 (295) T ss_pred CEEEEEEECCEEECCCEEEEEEEEEEEEECCCCCCEEEECCCCCEEECCCCCC--CCCCCCEEEEEEEEEEC T ss_conf 41899984125767985699999998773256874788736886563267753--31322148998778869 No 42 >pfam01617 Surface_Ag_2 Surface antigen. This family includes a number of bacterial surface antigens expressed on the surface of pathogens. Probab=92.75 E-value=0.6 Score=23.78 Aligned_cols=110 Identities=18% Similarity=0.155 Sum_probs=69.2 Q ss_pred CEEEEEEEEEEEECCEEEEEEEEEEEECCCCCCCC--------------------------CCCCCEEEEEEEEEEEEEE Q ss_conf 50479988655445779999988674047776432--------------------------3101003679899986331 Q gi|254781078|r 87 NGLGIELGHNIQLEDFVFGINCHTTAAKDDSTFYR--------------------------LKEKYFIYGDVVLKAGYSV 140 (242) Q Consensus 87 ~g~G~~~GYN~Q~~~~V~GvE~d~~~~~~~~~~~~--------------------------~~~k~~~~Gsvr~r~Gya~ 140 (242) .++++-+||- +++..+=+|+-+.--+.+..... ++.+.=-..++-..+-|.+ T Consensus 101 ~gfsGaiGYs--~~~~RiE~E~sye~Fdvk~~~~~~~~da~~y~al~r~~~~~~~~~~y~v~kN~gis~~S~m~N~CYD~ 178 (279) T pfam01617 101 TGFGGAIGYS--MDGPRVELEVSYERFDVKNQGNNYKNDAHKYAALTRDAAIAAADSKYVVLKNDGLTDISLMLNACYDI 178 (279) T ss_pred CEEEEEEEEC--CCCEEEEEEEEEEEECCCCCCCCCCCCCEEEEEEECCCCCCCCCCCEEEEEECCEEEEEEEEEEEEEC T ss_conf 2068999681--49988999987658335899876146615899986002666667746999967788665799999962 Q ss_pred --C--CEEEEEEEEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCEEEEEEEEEEEECCCCEEEEEEEEEEE-CCCCEEE Q ss_conf --4--403444401145753011000236666555543455653037987316673278768888988874-4873000 Q gi|254781078|r 141 --D--SLLIYGMGGFGGAYVIDSSLEKVESDNSKNAKGRFDGHGSSVVLGIGLDYMVNYDISLSASYRYIP-HHIHSVN 214 (242) Q Consensus 141 --~--~~l~Y~~gG~A~~~~~~~~~~~~~~~~~~~~~~~~d~t~~G~tvGaGVEYait~nwslraEY~Y~d-~g~~~~~ 214 (242) . ++.||+-+|+..--++ ..+....+..|-.=+|+.|.++|++++-++-.|.. .|++ +. T Consensus 179 ~~~~~~i~PY~C~GiG~d~i~---------------~~~~~~~K~aYq~KlGvsY~ls~~i~lF~~gyYHkV~gn~-F~ 241 (279) T pfam01617 179 AHEGIPVSPYVCAGVGADLIS---------------VFGAINPKFSYQGKAGLSYPITPEISLFAGGHYHKVIGNE-FD 241 (279) T ss_pred CCCCCCCCCEEEEECCHHHHH---------------HCCCCCEEEEEEEEEEEEEECCCCEEEEEEEEEEEECCCC-CC T ss_conf 238862044787743774632---------------3267533589999523788738885999988998731775-57 No 43 >pfam07585 DUF1551 Protein of unknown function (DUF1551). A family of proteins identified in Rhodopirellula baltica. Probab=92.17 E-value=0.72 Score=23.36 Aligned_cols=122 Identities=16% Similarity=0.216 Sum_probs=64.1 Q ss_pred CCCCCCEEEEE-EEEEEEEEECCEEEEEEEEEEEEEEECCCCCCCC----C-CCC-------CCCCCCCCCCCEEEE--E Q ss_conf 23101003679-8999863314403444401145753011000236----6-665-------555434556530379--8 Q gi|254781078|r 121 RLKEKYFIYGD-VVLKAGYSVDSLLIYGMGGFGGAYVIDSSLEKVE----S-DNS-------KNAKGRFDGHGSSVV--L 185 (242) Q Consensus 121 ~~~~k~~~~Gs-vr~r~Gya~~~~l~Y~~gG~A~~~~~~~~~~~~~----~-~~~-------~~~~~~~d~t~~G~t--v 185 (242) ..+.+.+.+|- +-.|.-+-.+++-+-+.+-+|++.++.+...... . ... .......+..+.+++ + T Consensus 189 ~~~t~N~f~G~Q~G~~~~~~~~r~sl~~~~Kvalg~n~~~~~~~g~~~~~~~~~~~ggl~~~~tn~~~~~~~~fa~v~e~ 268 (338) T pfam07585 189 SFSTENRFYGAQLGLRLTFNEGRWSLDALTKVGVGGNRQTQTILGSTTLTAPDPTDGGLLAQSTNIGTAARDRFAFVPEL 268 (338) T ss_pred EEEEEEEEECEEEEEEEEEECCCEEEEEEEEEEEEEEEEEEEEEEEEEECCCCCCCCCEEEECCCCCEEECCEEEEEEEC T ss_conf 88764011150667899984386899888889984115889983025203666557725751256542611308998424 Q ss_pred EEEEEEECCCCEEEEEEEEEEECCCCEE-----ECC-C-CC------CCCCCCCCCCCEEEEEEEEEEEC Q ss_conf 7316673278768888988874487300-----036-6-66------66442201234138999886209 Q gi|254781078|r 186 GIGLDYMVNYDISLSASYRYIPHHIHSV-----NNS-N-AK------SDVERVDRKGNAHIASLGINMHF 242 (242) Q Consensus 186 GaGVEYait~nwslraEY~Y~d~g~~~~-----~~~-~-~~------~~~~~~~~~~~~h~v~~GinY~F 242 (242) +.-+-|.++++|++++.|+++..+.... |.. + +. +......-+.-.|=+.+|+.|+| T Consensus 269 ~~~~~y~~t~~~~l~~GY~~l~~s~V~ra~dqid~~~~~~~~~~~~~p~~~~~~s~~~~~Gl~~G~e~~f 338 (338) T pfam07585 269 GLNLGYQLTPNLSLTVGYSFLYLSDVALAGDQIDRNVNTANLTGAVRPAFVDSNSSLLAHGLTFGGEYSF 338 (338) T ss_pred CEEEEEEECCCEEEEEEEEEEEECCCCCCHHCCCCCCCCCCCCCCCCCCEEECCCCEEEEEEEEEEEEEC T ss_conf 0799999648789998678999814365212015566888887766762343468759971587689839 No 44 >COG4625 Uncharacterized protein with a C-terminal OMP (outer membrane protein) domain [Function unknown] Probab=91.81 E-value=0.79 Score=23.13 Aligned_cols=46 Identities=20% Similarity=0.199 Sum_probs=28.9 Q ss_pred CCEEEEEEEEEEEECCCCEEEEEEEEEEECCCCEEECCCCCCCCCCCCCCCCEEEEEEEEEEEC Q ss_conf 5303798731667327876888898887448730003666666442201234138999886209 Q gi|254781078|r 179 HGSSVVLGIGLDYMVNYDISLSASYRYIPHHIHSVNNSNAKSDVERVDRKGNAHIASLGINMHF 242 (242) Q Consensus 179 t~~G~tvGaGVEYait~nwslraEY~Y~d~g~~~~~~~~~~~~~~~~~~~~~~h~v~~GinY~F 242 (242) -|..-.|++|+.+.++||.++.+-|...- +.+ .-.+.+++++.++| T Consensus 532 srdaa~i~~G~~l~~s~n~sl~~~y~~~f-g~r-----------------~~~~s~~~~l~~~F 577 (577) T COG4625 532 SRDAALLGLGAVLPVSPNAGLWARYEGQF-GDR-----------------KVDDSGGLRLQVRF 577 (577) T ss_pred CCCCEEEECCEEEEECCCEEEEEEEEEEE-CCC-----------------EECCCCCEEEEECC T ss_conf 66424775451686156326888776796-362-----------------45145433457439 No 45 >PRK09980 ompL outer membrane porin L; Provisional Probab=91.16 E-value=0.93 Score=22.74 Aligned_cols=50 Identities=10% Similarity=0.137 Sum_probs=36.2 Q ss_pred CCEEEEEEEEEEEECCCCEEEEEEEEEEECCCCEEECCCCCCCCCCCCCCCCEEEEEEEEEEEC Q ss_conf 5303798731667327876888898887448730003666666442201234138999886209 Q gi|254781078|r 179 HGSSVVLGIGLDYMVNYDISLSASYRYIPHHIHSVNNSNAKSDVERVDRKGNAHIASLGINMHF 242 (242) Q Consensus 179 t~~G~tvGaGVEYait~nwslraEY~Y~d~g~~~~~~~~~~~~~~~~~~~~~~h~v~~GinY~F 242 (242) ..+-|-.-+-+-|.++++|+--+|++|+|-... +.+-+.-..||||.|.| T Consensus 181 Kk~~~E~n~~~~Yk~~~~w~PY~E~~~ld~~~~--------------~~~r~q~r~RVGv~y~F 230 (230) T PRK09980 181 TKHHWEITNTFRYRINEHWLPYFELRWLDRNVG--------------PYHREQNQIRIGAKYFF 230 (230) T ss_pred CCHHHHEEEEEEEECCCCCCCEEEEEECCCCCC--------------CCCCCEEEEEEEEEEEC T ss_conf 640443102688842896243189874223567--------------67771357999888629 No 46 >PRK10993 outer membrane protease; Reviewed Probab=91.14 E-value=0.94 Score=22.73 Aligned_cols=61 Identities=13% Similarity=0.149 Sum_probs=40.6 Q ss_pred EEEEEEEEEEECCCCEEEEEEEEEEECCCC-----EEECCCCCC--CCCCCCCCCCEEEEEEEEEEEC Q ss_conf 379873166732787688889888744873-----000366666--6442201234138999886209 Q gi|254781078|r 182 SVVLGIGLDYMVNYDISLSASYRYIPHHIH-----SVNNSNAKS--DVERVDRKGNAHIASLGINMHF 242 (242) Q Consensus 182 G~tvGaGVEYait~nwslraEY~Y~d~g~~-----~~~~~~~~~--~~~~~~~~~~~h~v~~GinY~F 242 (242) =|-+-+-+-|.+|||+++.+|..|..+-.. .+|...+.. .-...-.+.+..++++||-|+| T Consensus 244 y~s~~~~agY~vtp~~kl~~e~~~~ky~~~kg~T~i~d~~tG~~~~~~d~AGi~n~~~tvs~GLkY~F 311 (311) T PRK10993 244 YYSASINAGYYVTPNAKLFVEGAYNKYFEGKGDTQIYDHATGESSYGKDAAGIENQNFTVTAGLKYRF 311 (311) T ss_pred EEEEEEEEEEEECCCEEEEEEEEEEEEECCCCCCEEEECCCCCEEECCCCCCCCCCCEEEEEEEEEEC T ss_conf 28999810046279815999999999851568747874478716614764220332159998777869 No 47 >PRK10159 outer membrane phosphoporin protein E; Provisional Probab=90.71 E-value=0.85 Score=22.96 Aligned_cols=47 Identities=19% Similarity=0.413 Sum_probs=33.2 Q ss_pred EEEEEEEEECCCCEEEEEEEEEEECCCCEEECCCCCCCCCCCCCCCCEEEEEEEEEEEC Q ss_conf 98731667327876888898887448730003666666442201234138999886209 Q gi|254781078|r 184 VLGIGLDYMVNYDISLSASYRYIPHHIHSVNNSNAKSDVERVDRKGNAHIASLGINMHF 242 (242) Q Consensus 184 tvGaGVEYait~nwslraEY~Y~d~g~~~~~~~~~~~~~~~~~~~~~~h~v~~GinY~F 242 (242) .+-.|++|.|++|+++-+||+.-.+.... . ... ..-..+.+||+|.| T Consensus 305 ~~~~ga~Y~Fnkn~~~y~~Yk~n~ld~~~-----~------~~~-~~dd~~~~GlrY~F 351 (351) T PRK10159 305 YIDVGATYYFNKNMSAFVDYKINQLDSDN-----K------LNI-NNDDIVAVGMTYQF 351 (351) T ss_pred EEEEEEEEEECCCCEEEEEEEECCCCCCC-----C------CCC-CCCCEEEEEEEEEC T ss_conf 69986799957981589988601557755-----5------688-76788999618859 No 48 >TIGR03509 OMP_MtrB_PioB decaheme-associated outer membrane protein, MtrB/PioB family. Members of this protein family are integral proteins of the bacterial outer membrane, associated with multiheme c-type cytochromes involved in electron transfer. The MtrB protein of Shewanella oneidensis MR-1 (SO1776) has been shown to form a complex with 1:1:1 stochiometry with the small, periplasmic decaheme cytochrome MtrA and large, surface-exposed decaheme cytochrome MtrC. Probab=88.78 E-value=1.5 Score=21.64 Aligned_cols=59 Identities=19% Similarity=0.183 Sum_probs=27.2 Q ss_pred EEEEEEEEECCCCEEEEEEEEEEECCCCEEE-------CCCCCCCCCCCCCCCCEEEEEEEEEEEC Q ss_conf 9873166732787688889888744873000-------3666666442201234138999886209 Q gi|254781078|r 184 VLGIGLDYMVNYDISLSASYRYIPHHIHSVN-------NSNAKSDVERVDRKGNAHIASLGINMHF 242 (242) Q Consensus 184 tvGaGVEYait~nwslraEY~Y~d~g~~~~~-------~~~~~~~~~~~~~~~~~h~v~~GinY~F 242 (242) .|=+=.+|++++++++|+.|+|-.|-....- ...........+.+++.|.+.+=++|+| T Consensus 584 ~l~~~~~Y~ls~~~~lrl~y~yE~Y~d~D~~~~~~~~d~i~~~~t~G~~n~nYnAh~v~ls~sY~f 649 (649) T TIGR03509 584 RLKLYGKYQLSKSSSLRLDYRYERYSDNDYAYNNTAYDTIATVTTLGDQNPNYNAHYLGVSYSYLF 649 (649) T ss_pred EEEEEEEEEECCCCEEEEEEEEEEEECCCHHHCCCCCCCCCCCCCCCCCCCCCEEEEEEEEEEEEC T ss_conf 998886576266717887788999741642104767665434232454788864269999999629 No 49 >TIGR03016 pepcterm_hypo_1 uncharacterized protein, PEP-CTERM system associated. Members of this protein family are found predominantly in exopolysaccharide biosynthesis operons marked by the presence of the EpsH-family putative exosortase and presence in the genome of the PEP-CTERM protein sorting signal. Members of this family may be distantly related to the EpsL family modeled in TIGR03014. Probab=88.38 E-value=1.6 Score=21.49 Aligned_cols=50 Identities=22% Similarity=0.327 Sum_probs=38.0 Q ss_pred EEEEEEEEEEEECCCCEEEEEEEEEEECCCCEEECCCCCCCCCCCCCCCCEEEEEEEEEEEC Q ss_conf 03798731667327876888898887448730003666666442201234138999886209 Q gi|254781078|r 181 SSVVLGIGLDYMVNYDISLSASYRYIPHHIHSVNNSNAKSDVERVDRKGNAHIASLGINMHF 242 (242) Q Consensus 181 ~G~tvGaGVEYait~nwslraEY~Y~d~g~~~~~~~~~~~~~~~~~~~~~~h~v~~GinY~F 242 (242) .=|.+.+|+-|.|+++.++.+-|||.+... + ... .+++.+.|.++|++.| T Consensus 382 ~~~~~~~gl~y~l~~~~~~~l~Yry~~rdS------~----~~~--~dy~~N~v~~~l~~~F 431 (431) T TIGR03016 382 DRTSVSAGLSYQLGENTTGSLTYRYQERDS------N----FEA--TDYTENRLMAEVTMTF 431 (431) T ss_pred EEEEEEEEEEEEECCCEEEEEEEEEEECCC------C----CCC--CCHHHHEEEEEEEEEC T ss_conf 699999989999638659999999997047------8----766--6645512157787409 No 50 >PRK10002 outer membrane protein F; Provisional Probab=86.47 E-value=2 Score=20.95 Aligned_cols=46 Identities=20% Similarity=0.278 Sum_probs=31.9 Q ss_pred EEEEEEEECCCCEEEEEEEEEEECCCCEEECCCCCCCCCCCCCCCCEEEEEEEEEEEC Q ss_conf 8731667327876888898887448730003666666442201234138999886209 Q gi|254781078|r 185 LGIGLDYMVNYDISLSASYRYIPHHIHSVNNSNAKSDVERVDRKGNAHIASLGINMHF 242 (242) Q Consensus 185 vGaGVEYait~nwslraEY~Y~d~g~~~~~~~~~~~~~~~~~~~~~~h~v~~GinY~F 242 (242) +=.|++|.|++|++.-+||+.-.+... +. .. .. .-..+.+||.|.| T Consensus 317 ~~lga~Y~FnkN~~ty~~Yk~NllD~~-----~~----~~--~~-~DD~~avGlrY~F 362 (362) T PRK10002 317 FEVGATYYFNKNMSTYVDYIINQIDSD-----NK----LG--VG-SDDTVAVGIVYQF 362 (362) T ss_pred EEEEEEEEECCCEEEEEEEEHHCCCCC-----CC----CC--CC-CCCEEEEEEEEEC T ss_conf 998759995587026876600056765-----55----68--87-7787999718829 No 51 >COG3713 OmpV Outer membrane protein V [Cell envelope biogenesis, outer membrane] Probab=86.24 E-value=2.1 Score=20.79 Aligned_cols=96 Identities=15% Similarity=0.190 Sum_probs=55.8 Q ss_pred EECCEEEEEEEEEEEEEEECCCCCCCC--CCCCCCCCCCCC--CCCEEEEEEEEEEEECCCCEEEEEEEEEEECCCCEEE Q ss_conf 314403444401145753011000236--666555543455--6530379873166732787688889888744873000 Q gi|254781078|r 139 SVDSLLIYGMGGFGGAYVIDSSLEKVE--SDNSKNAKGRFD--GHGSSVVLGIGLDYMVNYDISLSASYRYIPHHIHSVN 214 (242) Q Consensus 139 a~~~~l~Y~~gG~A~~~~~~~~~~~~~--~~~~~~~~~~~d--~t~~G~tvGaGVEYait~nwslraEY~Y~d~g~~~~~ 214 (242) ..+++.+=...|+.|+.-+....-+-- .+...+.....+ .-.+-.-+++=++|+||+||++.+--.|..+....-+ T Consensus 159 ~~~~~~ispg~g~s~~s~~Y~~yYfGVs~~esa~Sgl~~Y~p~~g~~s~~l~ls~~y~~t~~w~~~~~~~~~RL~gdaad 238 (258) T COG3713 159 LIGRLSLSPGPGVSWASSKYNDYYFGVSAAESARSGLASYNPGGGAKSPGLGLSATYMITDHWSVGGTAGYSRLSGDAAD 238 (258) T ss_pred CCCCEEECCCCCEEEECCCCCCEEECCCHHHHHHCCCCCCCCCCCCCCCCCEEEEEEEECCCEEEEEEEEEEECCCCCCC T ss_conf 05978985477327712233420205288997304777407998843467237999970687799988871310331014 Q ss_pred CCCCCCCCCCCCCCCCEEEEEEEEEEEC Q ss_conf 3666666442201234138999886209 Q gi|254781078|r 215 NSNAKSDVERVDRKGNAHIASLGINMHF 242 (242) Q Consensus 215 ~~~~~~~~~~~~~~~~~h~v~~GinY~F 242 (242) -+ +-.+.+.-++.+|+.||| T Consensus 239 SP--------iV~~~~q~~~~~~v~YrF 258 (258) T COG3713 239 SP--------IVDRRNQFSVGTGVTYRF 258 (258) T ss_pred CC--------CEECCCCEEEEEEEEEEC T ss_conf 97--------466156358998899859 No 52 >TIGR03014 EpsL exopolysaccharide biosynthesis operon protein EpsL. The epsL gene is described as a component of the methanolan exopolysaccharide biosynthesis operon in Methylobacillus sp strain 12S, although no other information regarding its possible function is suggested. Homologs of this gene are found in several other exopolysaccharide operons in a small number of species. These operons contain a subset of the methanolan operon genes by homology and synteny, including the epsH gene which is proposed to act as an "exosortase" directing proteins with a C-terminal tag (PEP-CTERM) to the exopolysaccharide layer. Each of the genomes in which these genes and epsL are found also encode genes with these C-terminal tags. Probab=85.91 E-value=2.2 Score=20.69 Aligned_cols=54 Identities=15% Similarity=0.189 Sum_probs=23.0 Q ss_pred EEEEEEEEEEECCCCEEEEEEEEEEECCCCEEECCCCCCCCCCCCCCCCEEEEEEEEEEE Q ss_conf 379873166732787688889888744873000366666644220123413899988620 Q gi|254781078|r 182 SVVLGIGLDYMVNYDISLSASYRYIPHHIHSVNNSNAKSDVERVDRKGNAHIASLGINMH 241 (242) Q Consensus 182 G~tvGaGVEYait~nwslraEY~Y~d~g~~~~~~~~~~~~~~~~~~~~~~h~v~~GinY~ 241 (242) ...+-.+..+.+|++++++++..|-. ..+. +. +-+...+.+.+.+.+.+++.|. T Consensus 290 ~~~~~l~~~w~~T~kv~~~~~~~y~~---rdY~--G~-~~~~~~~R~D~~~~~~l~~~Y~ 343 (381) T TIGR03014 290 NRGTSIGPTWQATSKIAVRGRLDYEE---RDFE--GD-PLVGPPARSDRTRSGSLSLDWS 343 (381) T ss_pred EEEEEEEEEEEEEEEEEEEEEEEEEE---CCCC--CC-CCCCCCCCCCCEEEEEEEEEEE T ss_conf 53388421052112399988889984---1235--86-4567667543248877899983 No 53 >COG4571 OmpT Outer membrane protease [Cell envelope biogenesis, outer membrane] Probab=84.01 E-value=2.7 Score=20.19 Aligned_cols=150 Identities=11% Similarity=0.120 Sum_probs=71.5 Q ss_pred EEEEEEEEEEC-CEEEEEEE-----EEEEECCCCCC--CCCCCCCEEEEEEEEEEEEEECCEEEEEEEEEEEEEEEC--- Q ss_conf 79988655445-77999998-----86740477764--323101003679899986331440344440114575301--- Q gi|254781078|r 90 GIELGHNIQLE-DFVFGINC-----HTTAAKDDSTF--YRLKEKYFIYGDVVLKAGYSVDSLLIYGMGGFGGAYVID--- 158 (242) Q Consensus 90 G~~~GYN~Q~~-~~V~GvE~-----d~~~~~~~~~~--~~~~~k~~~~Gsvr~r~Gya~~~~l~Y~~gG~A~~~~~~--- 158 (242) -+.++-.+|.+ +..+|+.+ +++|.--.+.. .+...-.+..+.+|+ |||.--=-|||+ |+|..+..+ T Consensus 136 dlnlk~wlL~~~ny~~gv~~GyqeT~~swTA~GGsy~Y~~G~~ig~fP~g~r~-IgY~Qrf~m~yi--Glag~Yr~~~~e 212 (314) T COG4571 136 DLNLKGWLLQDPNYRLGVTAGYQETRVSWTAYGGSYIYSNGFYIGNFPNGVRA-IGYKQRFKMPYI--GLAGSYRYNDFE 212 (314) T ss_pred EEEECEEEECCCCEEECCCCCCEEEEEEEEECCCCEEECCCCCCCCCCCCCEE-EEECCCCCCEEE--ECCCEEEEEEEE T ss_conf 22204034038761212323503645679970661684077461448998256-653001565055--012317886544 Q ss_pred -------CCCCCCC---CCCCCCCC-CCCCCCCEEEEEEEEEEEECCCCEEEEEEEEEEECCCC-----EEECCC--CCC Q ss_conf -------1000236---66655554-34556530379873166732787688889888744873-----000366--666 Q gi|254781078|r 159 -------SSLEKVE---SDNSKNAK-GRFDGHGSSVVLGIGLDYMVNYDISLSASYRYIPHHIH-----SVNNSN--AKS 220 (242) Q Consensus 159 -------~~~~~~~---~~~~~~~~-~~~d~t~~G~tvGaGVEYait~nwslraEY~Y~d~g~~-----~~~~~~--~~~ 220 (242) +.+.... .+.-..-. .+..+..-=|.++.-+-|-+|+|-++-+|--|..+... .+|-.+ +.. T Consensus 213 ~~~~~Kys~wv~a~~~D~Hymr~ltfrd~~~~s~yy~~~i~~GYy~t~nak~fve~ay~k~~~~kG~tqt~d~~g~t~~~ 292 (314) T COG4571 213 FGALFKYSDWVNAHDNDEHYMRDLTFRDKTKNSRYYSVVINAGYYVTENAKVFVEGAYNKVTEGKGGTQTYDKAGDTSDY 292 (314) T ss_pred EEEEEEECCEEECCCCHHHHHHCCEEECCCCCCCEEEEEEEEEEEECCCCEEEEEEEEEEEECCCCCCEEEECCCCCCCC T ss_conf 30499505356647760345423323204578834899873147986783499987777775056872567516863666 Q ss_pred CCCCCCCCCCEEEEEEEEEEEC Q ss_conf 6442201234138999886209 Q gi|254781078|r 221 DVERVDRKGNAHIASLGINMHF 242 (242) Q Consensus 221 ~~~~~~~~~~~h~v~~GinY~F 242 (242) .-...-.+....+|++||-|+| T Consensus 293 ~~d~aGi~~~~~tvt~Glky~F 314 (314) T COG4571 293 GGDAAGIENYNYTVTAGLKYRF 314 (314) T ss_pred CCCCCCCCCCCEEEEEEEEECC T ss_conf 7753333333148998777619 No 54 >PRK13484 putative iron-regulated outer membrane virulence protein; Provisional Probab=80.73 E-value=3.7 Score=19.49 Aligned_cols=53 Identities=15% Similarity=0.161 Sum_probs=25.7 Q ss_pred EEECCCCEEEEEEEEEEECCCCEEECCCCCCCCCCCCC-CCCEEEEEEEEEEEC Q ss_conf 67327876888898887448730003666666442201-234138999886209 Q gi|254781078|r 190 DYMVNYDISLSASYRYIPHHIHSVNNSNAKSDVERVDR-KGNAHIASLGINMHF 242 (242) Q Consensus 190 EYait~nwslraEY~Y~d~g~~~~~~~~~~~~~~~~~~-~~~~h~v~~GinY~F 242 (242) .+.++++|++.+-|-|+|-..++-+.........+-.. ..-.|++.+.+.|.| T Consensus 540 ~~~~~~~~~l~~~yty~da~~~~~~~~~~g~~~~g~~~~~~P~~~~~~~~~~~~ 593 (682) T PRK13484 540 QIPVYDKWHVSANYTFTDSRRKSDDESLNGKSLKGEPLERTPRHAANAKLEWDY 593 (682) T ss_pred EEECCCCEEEEEEEEEEEEEECCCCCCCCCCCCCCCCCCCCCCEEEEEEEEEEE T ss_conf 997369889999999986574247643346545688335687607899999997 No 55 >pfam09381 Porin_OmpG Outer membrane protein G (OmpG). Porins are channel proteins in the outer membrane of gram negative bacteria which mediate the uptake of molecules required for growth and survival. Escherichia coli OmpG forms a 14 stranded beta-barrel and in contrast to most porins, appears to function as a monomer. The central pore of OmpG is wider than other E. coli porins and it is speculated that it may form a non-specific channel for the transport of larger oligosaccharides. Probab=80.54 E-value=3.3 Score=19.76 Aligned_cols=46 Identities=22% Similarity=0.351 Sum_probs=34.4 Q ss_pred EEEEEEEEECCCCEEEEEEEEEEECCCCEEECCCCCCCCCCCCCCCCEEEEEEEEEEEC Q ss_conf 98731667327876888898887448730003666666442201234138999886209 Q gi|254781078|r 184 VLGIGLDYMVNYDISLSASYRYIPHHIHSVNNSNAKSDVERVDRKGNAHIASLGINMHF 242 (242) Q Consensus 184 tvGaGVEYait~nwslraEY~Y~d~g~~~~~~~~~~~~~~~~~~~~~~h~v~~GinY~F 242 (242) .+|.=++|.+.+.+|+-.||-|-.- . -|.+ ....+|-.-+||||.| T Consensus 235 RlGl~~~~~~~~glsmtLeYAyE~q-~--hd~~----------~~dkFHyaGvG~~YsF 280 (280) T pfam09381 235 RVGLFYGYDFQNGLSVSLEYAFEWQ-D--HDEG----------DSDKFHYAGVGVNYSF 280 (280) T ss_pred CEEEEEEECCCCCCEEEEEEEEEEC-C--CCCC----------CCCCEEECCCCEEECC T ss_conf 0136785316887377652332202-0--4568----------7442033030035319 No 56 >PRK13486 bifunctional enterobactin receptor/adhesin protein; Provisional Probab=70.76 E-value=7 Score=17.96 Aligned_cols=43 Identities=19% Similarity=0.319 Sum_probs=22.3 Q ss_pred CCCEEEEEEEEEEECCCCEEECCCCCCCCCCCCCCCCEEEEEEEEEEEC Q ss_conf 7876888898887448730003666666442201234138999886209 Q gi|254781078|r 194 NYDISLSASYRYIPHHIHSVNNSNAKSDVERVDRKGNAHIASLGINMHF 242 (242) Q Consensus 194 t~nwslraEY~Y~d~g~~~~~~~~~~~~~~~~~~~~~~h~v~~GinY~F 242 (242) +++|++.+.|-|.+-..+.-+.... .++ ..-.|.++++++|+| T Consensus 538 ~~~~~l~~~yty~~~~~~d~~~~g~-----~l~-~~P~~~~~~~~~y~~ 580 (696) T PRK13486 538 SEDVTLSLNYTWTRSEQRDGDNKGA-----PLS-YTPEHMVNAKLNWQI 580 (696) T ss_pred CCCEEEEEEEEEEEEEECCCCCCCC-----CCC-CCCCCEEEEEEEEEE T ss_conf 1678999999998314226765686-----266-788103899988886 No 57 >PRK13483 enterobactin receptor protein; Provisional Probab=68.06 E-value=8 Score=17.63 Aligned_cols=19 Identities=26% Similarity=0.625 Sum_probs=11.2 Q ss_pred EEECCCCEEEEEEEEEEEC Q ss_conf 6732787688889888744 Q gi|254781078|r 190 DYMVNYDISLSASYRYIPH 208 (242) Q Consensus 190 EYait~nwslraEY~Y~d~ 208 (242) .+.++++|++.+-|-|+|- T Consensus 533 ~~~~~~~~~l~~~yty~d~ 551 (663) T PRK13483 533 SYPITSNLSLTGNYTYTDS 551 (663) T ss_pred EEEECCCEEEEEEEEEEEE T ss_conf 9982799899999999923 No 58 >COG5571 Autotransporter protein or domain, integral membrane beta-barrel involved in protein secretion [Cell motility and secretion] Probab=65.74 E-value=2.1 Score=20.79 Aligned_cols=135 Identities=15% Similarity=0.147 Sum_probs=71.2 Q ss_pred EEEEEEEEEEEECCEEEEEEEEEEEECCCCCCCCCCC--------CCEEEEEEEEEEEEEE----CCEEEEEEEEEEEEE Q ss_conf 0479988655445779999988674047776432310--------1003679899986331----440344440114575 Q gi|254781078|r 88 GLGIELGHNIQLEDFVFGINCHTTAAKDDSTFYRLKE--------KYFIYGDVVLKAGYSV----DSLLIYGMGGFGGAY 155 (242) Q Consensus 88 g~G~~~GYN~Q~~~~V~GvE~d~~~~~~~~~~~~~~~--------k~~~~Gsvr~r~Gya~----~~~l~Y~~gG~A~~~ 155 (242) +.++.+||+|-++.++.|.-++..+...+-..++... .-.-.-++++++|+-+ ..+-||+.--+-- + T Consensus 90 t~~L~~Gyef~~~~l~tGpvA~L~~~~t~v~~y~E~~~ssTal~f~dq~~~S~~sslGwrlr~~~~~~~Pyaqi~y~~-e 168 (239) T COG5571 90 TARLSAGYEFPFGELRTGPVAGLHYDKTTVSGYNEQNASSTALGFTDQLWHSLVSSLGWRLRLRLADLQPYAQIRYVR-E 168 (239) T ss_pred HHHCCCCCCCCCCCEEEECCHHCCCCCCEECCCCCCCCCCCCCCCCCCCHHHHHHCCCEEEECCCCCCCCEEEEEHHH-H T ss_conf 221013355432212320100000044300121224664110156651156653035467732224444014565787-6 Q ss_pred EECCCCCCCCCC---CCCCCCCCCCCCCEEEEEEEEEEEECCCCEEEEEEEEEEECCCCEEECCCCCCCCCCCCCCCCEE Q ss_conf 301100023666---65555434556530379873166732787688889888744873000366666644220123413 Q gi|254781078|r 156 VIDSSLEKVESD---NSKNAKGRFDGHGSSVVLGIGLDYMVNYDISLSASYRYIPHHIHSVNNSNAKSDVERVDRKGNAH 232 (242) Q Consensus 156 ~~~~~~~~~~~~---~~~~~~~~~d~t~~G~tvGaGVEYait~nwslraEY~Y~d~g~~~~~~~~~~~~~~~~~~~~~~h 232 (242) ...+........ .........|++-. -+=+|++..+++++..-++--=.+ ...+.+.. T Consensus 169 f~~d~~~aqagL~~m~~~~~~~~qd~nw~--~i~~Ga~~~l~~~i~~~a~~~q~~-----------------~~t~~n~~ 229 (239) T COG5571 169 FGDDIWKAQAGLSRMTATNVLPSQDKNWV--DITAGADFLLNSSIGAFAAASQRE-----------------NSTENNQY 229 (239) T ss_pred HHHHHHHHHCCCCCCCCCCCCCCCCCCHH--HEEEEEEECCCCHHHHHHHHHHHH-----------------CCCCCCCC T ss_conf 50123656410246600243566676641--304410122341068998764443-----------------05666631 Q ss_pred EEEEEEEEEC Q ss_conf 8999886209 Q gi|254781078|r 233 IASLGINMHF 242 (242) Q Consensus 233 ~v~~GinY~F 242 (242) .+++|+|-+| T Consensus 230 ~ytLgvs~~F 239 (239) T COG5571 230 LYTLGVSARF 239 (239) T ss_pred EEEEEEECCC T ss_conf 0577530239 No 59 >pfam04338 DUF481 Protein of unknown function, DUF481. This family includes several proteins of uncharacterized function. Probab=65.41 E-value=9.1 Score=17.34 Aligned_cols=49 Identities=18% Similarity=0.219 Sum_probs=31.3 Q ss_pred EEEEEEEEEEECCCCEEEEEEEEEEECCCCEEECCCCCCCCCCCCCCCCEEEEEEEEEEEC Q ss_conf 3798731667327876888898887448730003666666442201234138999886209 Q gi|254781078|r 182 SVVLGIGLDYMVNYDISLSASYRYIPHHIHSVNNSNAKSDVERVDRKGNAHIASLGINMHF 242 (242) Q Consensus 182 G~tvGaGVEYait~nwslraEY~Y~d~g~~~~~~~~~~~~~~~~~~~~~~h~v~~GinY~F 242 (242) =+..=.|++..|+++|++++.|.+--... .+ .+.+..-...+++|.|.| T Consensus 162 ~~~~~~~l~~~l~~~lsl~~~~~~~yds~-------P~-----~~~~~~D~~~~~~l~y~f 210 (210) T pfam04338 162 RLNSETGLTVKLTDNLSLKLSYEVDYNSD-------PP-----AGKKKTDTTTSVSLGYSF 210 (210) T ss_pred EEEEEEEEEEEEECEEEEEEEEEEEECCC-------CC-----CCCCCCCEEEEEEEEEEC T ss_conf 99999999998301079999999999377-------99-----998874539999999989 No 60 >pfam06727 DUF1207 Protein of unknown function (DUF1207). This family consists of a number of hypothetical bacterial proteins of around 410 residues in length which seem to be specific to Chlamydia species. The function of this family is unknown. Probab=64.65 E-value=9.4 Score=17.25 Aligned_cols=99 Identities=16% Similarity=0.144 Sum_probs=56.5 Q ss_pred EEEEEEEEEEEECCEEEEEEEEEEEEEEECCCCCCCC----C-C-----CC----CC-------CCCCCCCCCEEEEEEE Q ss_conf 6798999863314403444401145753011000236----6-6-----65----55-------5434556530379873 Q gi|254781078|r 129 YGDVVLKAGYSVDSLLIYGMGGFGGAYVIDSSLEKVE----S-D-----NS----KN-------AKGRFDGHGSSVVLGI 187 (242) Q Consensus 129 ~Gsvr~r~Gya~~~~l~Y~~gG~A~~~~~~~~~~~~~----~-~-----~~----~~-------~~~~~d~t~~G~tvGa 187 (242) +-.+-.|+.|.++.+-+|+.+|.-.-+- ......- . + .. .. -..+..++...++..+ T Consensus 214 ~E~~Dl~lS~e~p~iRvYGG~G~i~~rd--~s~~~p~~~e~GaElRsp~gl~~l~~rpv~A~d~k~~eq~~~~~d~s~~~ 291 (337) T pfam06727 214 YEGVDLRLSYELPQIRVYGGGGGIFSRD--PSPIKPWYIEYGAELRSPWGLDNLHARPIFAMDFKNWEQQKWNIDQSYRA 291 (337) T ss_pred CHHEEEEEEEECCCEEEEECCEEEEECC--CCCCCCEEEECCCEEECCCCCCCCCCEEEEEEEEEEHHHHCCCCCCEEEE T ss_conf 0010179976647579981401689548--77788705761701107666786343068999752345414586724898 Q ss_pred EEEEECC----CCEEEEEEEEEEECCCCEEECCCCCCCCCCCCCCCCEEEEEEEEEEEC Q ss_conf 1667327----876888898887448730003666666442201234138999886209 Q gi|254781078|r 188 GLDYMVN----YDISLSASYRYIPHHIHSVNNSNAKSDVERVDRKGNAHIASLGINMHF 242 (242) Q Consensus 188 GVEYait----~nwslraEY~Y~d~g~~~~~~~~~~~~~~~~~~~~~~h~v~~GinY~F 242 (242) |+|++-. .+|.+-+||-- +.+ .++.=.+...+-.-+|+.||| T Consensus 292 G~E~~~~q~~gRk~~~llEY~~------------G~S-p~GQF~re~~~Y~Gigl~Ygf 337 (337) T pfam06727 292 GMEWAKFQAFGRKLRLLLEYHD------------GFS-PTGQFFREKTDYLGFGLHYGF 337 (337) T ss_pred EEEEEHEECCCCEEEEEEEECC------------CCC-CCCCEEEEECCEEEEEEEECC T ss_conf 6676210043713677662004------------999-876057631327885358159 No 61 >pfam09411 PagL Lipid A 3-O-deacylase (PagL). PagL is an outer membrane protein with lipid A 3-O-deacylase activity. It forms an 8 stranded beta barrel structure. Probab=64.06 E-value=9.6 Score=17.19 Aligned_cols=87 Identities=20% Similarity=0.189 Sum_probs=44.2 Q ss_pred EEEEEEEEEECCEEEEEEEEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCEEEEEEEEEEEECCCCEEEEEEEEEEECCC Q ss_conf 98999863314403444401145753011000236666555543455653037987316673278768888988874487 Q gi|254781078|r 131 DVVLKAGYSVDSLLIYGMGGFGGAYVIDSSLEKVESDNSKNAKGRFDGHGSSVVLGIGLDYMVNYDISLSASYRYIPHHI 210 (242) Q Consensus 131 svr~r~Gya~~~~l~Y~~gG~A~~~~~~~~~~~~~~~~~~~~~~~~d~t~~G~tvGaGVEYait~nwslraEY~Y~d~g~ 210 (242) +...|....-.+.-+|+-+|++.......... + +.=.++.=+...+|+-|.+.+++ +++|||..+.+ T Consensus 52 ~p~~~~~~~~~~~~~f~E~GiG~~~~s~~~~~----~-------~~lg~~~~F~~~~g~g~~~~~~~--~~~~~~~H~SN 118 (139) T pfam09411 52 SPVVRLEFTGGSFKPFVEAGIGVALLSRTRDE----D-------KDLGTRFQFRDQLGAGYQFGGRP--RVGYRYQHYSN 118 (139) T ss_pred EEEEEEEECCCCCCEEEEEEEEEEEECCCCCC----C-------CCCCCEEEEECCEEEEEEECCCC--EEEEEEEEECC T ss_conf 79999973688747389987654886152138----9-------86666087200188899976886--58999999517 Q ss_pred CEEECCCCCCCCCCCCCCCCEEEEEEEEE Q ss_conf 30003666666442201234138999886 Q gi|254781078|r 211 HSVNNSNAKSDVERVDRKGNAHIASLGIN 239 (242) Q Consensus 211 ~~~~~~~~~~~~~~~~~~~~~h~v~~Gin 239 (242) ..+...| ...|.+.++++.+ T Consensus 119 a~i~~~N---------~G~~~~~l~~~~~ 138 (139) T pfam09411 119 AGIKDPN---------PGLNSVGLRLGYS 138 (139) T ss_pred CCCCCCC---------CCCCEEEEEEEEE T ss_conf 8879998---------8901587999843 No 62 >COG4771 FepA Outer membrane receptor for ferrienterochelin and colicins [Inorganic ion transport and metabolism] Probab=63.84 E-value=9.7 Score=17.17 Aligned_cols=59 Identities=15% Similarity=0.200 Sum_probs=33.6 Q ss_pred EE-EEEEEEEEECCCCEEEEEEEEEEECCCCEEE-CCCCCCCCCCCCCCCCEEEEEEEEEEEC Q ss_conf 37-9873166732787688889888744873000-3666666442201234138999886209 Q gi|254781078|r 182 SV-VLGIGLDYMVNYDISLSASYRYIPHHIHSVN-NSNAKSDVERVDRKGNAHIASLGINMHF 242 (242) Q Consensus 182 G~-tvGaGVEYait~nwslraEY~Y~d~g~~~~~-~~~~~~~~~~~~~~~~~h~v~~GinY~F 242 (242) .| +++.|+-|++|.||+|++--.= +..|.+. ..+-........+-.+--..-++|+|+| T Consensus 639 ~Yti~~~ga~y~~tkn~sL~agV~N--LfDK~l~~~~~~a~~~g~~~y~epGRry~~s~~~~F 699 (699) T COG4771 639 AYTIVDLGATYDVTKNVSLNAGVYN--LFDKRLSRDGNYAFGAGAYTYNEPGRRYYMSLNYSF 699 (699) T ss_pred CEEEEECCEEEEECCCEEEEHHHHH--HCCHHHCCCCCCCCCCCCCCCCCCCCEEEEEEECCC T ss_conf 6489842414663234266010122--044221235543334454322367725899853049 No 63 >PRK10177 hypothetical protein; Provisional Probab=60.96 E-value=11 Score=16.87 Aligned_cols=25 Identities=24% Similarity=0.404 Sum_probs=18.2 Q ss_pred EEEEEEEEEEEECCCCEEEEEEEEE Q ss_conf 0379873166732787688889888 Q gi|254781078|r 181 SSVVLGIGLDYMVNYDISLSASYRY 205 (242) Q Consensus 181 ~G~tvGaGVEYait~nwslraEY~Y 205 (242) .=..+-+||.|-=-|=+++++|++- T Consensus 252 nP~A~T~GlnYTPvPLvT~~a~~k~ 276 (452) T PRK10177 252 NPVALSLGLNYTPVPLVTVTAQHKQ 276 (452) T ss_pred CCHHEEECCCCCCCCCEEEECCEEC T ss_conf 9641781145405212896100021 No 64 >pfam06178 KdgM Oligogalacturonate-specific porin protein (KdgM). This family consists of several bacterial proteins which are homologous to the oligogalacturonate-specific porin protein KdgM from Erwinia chrysanthemi. The phytopathogenic Gram-negative bacteria Erwinia chrysanthemi secretes pectinases, which are able to degrade the pectic polymers of plant cell walls, and uses the degradation products as a carbon source for growth. KdgM is a major outer membrane protein, whose synthesis is strongly induced in the presence of pectic derivatives. KdgM behaves like a voltage-dependent porin that is slightly selective for anions and that exhibits fast block in the presence of trigalacturonate. In contrast to most porins, KdgM seems to be monomeric. Probab=59.55 E-value=12 Score=16.73 Aligned_cols=51 Identities=14% Similarity=0.237 Sum_probs=35.6 Q ss_pred CCCCEEEEEEEEEEEECCCCEEEEEEEEEEECCCCEEECCCCCCCCCCCCCCCCEEEEEEEEEEEC Q ss_conf 565303798731667327876888898887448730003666666442201234138999886209 Q gi|254781078|r 177 DGHGSSVVLGIGLDYMVNYDISLSASYRYIPHHIHSVNNSNAKSDVERVDRKGNAHIASLGINMHF 242 (242) Q Consensus 177 d~t~~G~tvGaGVEYait~nwslraEY~Y~d~g~~~~~~~~~~~~~~~~~~~~~~h~v~~GinY~F 242 (242) +.-..-|..-.-++|.+.++|+--+|+.+++-.+. .|. ..-..||||.|.| T Consensus 168 d~k~~~~e~n~~~~Yk~~~~W~PyvEvg~v~~~~~-------------~d~--rq~RyRVGv~Y~F 218 (218) T pfam06178 168 DGKKTNWEYNVKLAYKIDKHWTPYVEVGNVDVNNN-------------TDE--RQTRYRVGVQYSF 218 (218) T ss_pred CCCCCEEEEEEEEEEECCCCCCEEEEEEECCCCCC-------------CCC--CEEEEEEEEEEEC T ss_conf 79885168889999982898230499984256788-------------866--2767998889769 No 65 >PRK10554 outer membrane porin protein C; Provisional Probab=55.21 E-value=14 Score=16.32 Aligned_cols=49 Identities=18% Similarity=0.332 Sum_probs=33.3 Q ss_pred EEEEEEEECCCCEEEEEEEEEEECCCCEEECCCCCCCCCCCCCCCCEEEEEEEEEEEC Q ss_conf 8731667327876888898887448730003666666442201234138999886209 Q gi|254781078|r 185 LGIGLDYMVNYDISLSASYRYIPHHIHSVNNSNAKSDVERVDRKGNAHIASLGINMHF 242 (242) Q Consensus 185 vGaGVEYait~nwslraEY~Y~d~g~~~~~~~~~~~~~~~~~~~~~~h~v~~GinY~F 242 (242) +=.|++|.|++|+++-+||+.-......+.... . ...-..+.+||.|.| T Consensus 325 ~~iga~Y~FnkN~~~y~~Yk~NllD~~~~~~~~------~---~~~DD~~alGlrY~F 373 (373) T PRK10554 325 VDVGATYYFNKNMSTYVDYKINLLDDNDFTRDA------G---INTDDIVALGLVYQF 373 (373) T ss_pred EEEEEEEEECCCEEEEEEEEEECCCCCCCCCCC------C---CCCCCEEEEEEEEEC T ss_conf 897679997787216888875136765310003------6---776776999628819 No 66 >pfam11336 DUF3138 Protein of unknown function (DUF3138). This family of proteins with unknown function appear to be restricted to Proteobacteria. Probab=55.01 E-value=14 Score=16.30 Aligned_cols=41 Identities=10% Similarity=0.041 Sum_probs=31.6 Q ss_pred CCCCEEEEEEEEEEEECCCCEEEEEEEEEEECCCCEEECCC Q ss_conf 56530379873166732787688889888744873000366 Q gi|254781078|r 177 DGHGSSVVLGIGLDYMVNYDISLSASYRYIPHHIHSVNNSN 217 (242) Q Consensus 177 d~t~~G~tvGaGVEYait~nwslraEY~Y~d~g~~~~~~~~ 217 (242) .+-..-+.+-+++-|..|+++.+|+||||-+-..+.+-..+ T Consensus 456 cKGA~R~a~~~dLlfypTqq~TvK~EYRhDwA~~~VFl~~d 496 (514) T pfam11336 456 CKGANRQAITADLLFYPTQQITVKVEYRHDWANNKVFLRND 496 (514) T ss_pred CCCHHHHHHHHHHHHCCCCCEEEEEEEEECCCCCCEEECCC T ss_conf 55524554220031044531589999851333464577158 No 67 >PRK13513 putative outer membrane receptor; Provisional Probab=54.29 E-value=14 Score=16.23 Aligned_cols=18 Identities=22% Similarity=0.458 Sum_probs=9.0 Q ss_pred EECCCCEEEEEEEEEEEC Q ss_conf 732787688889888744 Q gi|254781078|r 191 YMVNYDISLSASYRYIPH 208 (242) Q Consensus 191 Yait~nwslraEY~Y~d~ 208 (242) +.++++|++.+-|-|+|- T Consensus 532 ~~~~~~~~l~~~yty~~~ 549 (660) T PRK13513 532 WDINQAWSLATNYTFTQS 549 (660) T ss_pred EEECCCEEEEEEEEEEEE T ss_conf 985599899999999942 No 68 >pfam12094 DUF3570 Protein of unknown function (DUF3570). This family of proteins is functionally uncharacterized. This protein is found in bacteria. Proteins in this family are typically between 396 to 444 amino acids in length. Probab=47.01 E-value=19 Score=15.57 Aligned_cols=29 Identities=21% Similarity=0.311 Sum_probs=13.4 Q ss_pred CCCEEEEEEEEEEEECCCCEEEEEEEEEE Q ss_conf 65303798731667327876888898887 Q gi|254781078|r 178 GHGSSVVLGIGLDYMVNYDISLSASYRYI 206 (242) Q Consensus 178 ~t~~G~tvGaGVEYait~nwslraEY~Y~ 206 (242) .+|.-..+++..-|.+.++.+++.+|||+ T Consensus 258 d~R~k~~~~~r~~~~l~~~~~l~~~YRyy 286 (407) T pfam12094 258 DTRTKQAVYLRAKYYLPDRAALRFSYRYY 286 (407) T ss_pred CHHHHHHHHHHEEEECCCCCEEEEEEEEE T ss_conf 62332010102202058772567877888 No 69 >pfam09982 DUF2219 Uncharacterized protein conserved in bacteria (DUF2219). This domain, found in various hypothetical bacterial proteins, has no known function. Probab=44.68 E-value=21 Score=15.37 Aligned_cols=53 Identities=17% Similarity=0.032 Sum_probs=26.2 Q ss_pred EEEEEEECC-EEEEEEEEEEEECCCCCCCCCCCCCEEEEEEEEEEEEEECCEEEEEEEEEE Q ss_conf 886554457-799999886740477764323101003679899986331440344440114 Q gi|254781078|r 93 LGHNIQLED-FVFGINCHTTAAKDDSTFYRLKEKYFIYGDVVLKAGYSVDSLLIYGMGGFG 152 (242) Q Consensus 93 ~GYN~Q~~~-~V~GvE~d~~~~~~~~~~~~~~~k~~~~Gsvr~r~Gya~~~~l~Y~~gG~A 152 (242) -|++.|+.| +++-+..+..+.-.-. . ..+.-+++.-++-.++++.-|+.+|+- T Consensus 127 ~GWd~Qi~ne~~~~l~~e~~~~~~~~---~----~~~~~dl~~~~~~~lGn~~t~~~~G~~ 180 (293) T pfam09982 127 QGWDNQIENEPGFNLSYERARRLFRA---T----DGGGFDLSPFAEAALGNLETYASAGAT 180 (293) T ss_pred CCHHHHCCCCEEEEEEEEEEEHHCCC---C----CCCCEEEEEEEEEEEEEEEEEEECCEE T ss_conf 75387567640899984134001112---6----898589988768996117887524569 No 70 >pfam11924 DUF3442 Protein of unknown function (DUF3442). This family of proteins are functionally uncharacterized. This protein is found in bacteria. Proteins in this family are typically between 334 to 948 amino acids in length. Probab=43.90 E-value=21 Score=15.30 Aligned_cols=67 Identities=16% Similarity=0.140 Sum_probs=38.5 Q ss_pred EEEEEEEEE--CCEEEEEE---EEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCEEEEEEEEEEEECCCCEEEEEEEEEE Q ss_conf 899986331--44034444---0114575301100023666655554345565303798731667327876888898887 Q gi|254781078|r 132 VVLKAGYSV--DSLLIYGM---GGFGGAYVIDSSLEKVESDNSKNAKGRFDGHGSSVVLGIGLDYMVNYDISLSASYRYI 206 (242) Q Consensus 132 vr~r~Gya~--~~~l~Y~~---gG~A~~~~~~~~~~~~~~~~~~~~~~~~d~t~~G~tvGaGVEYait~nwslraEY~Y~ 206 (242) -|..-||.+ .-.|||.- +-+.+.+...+..... ...+.-+..+.+-.||+|.-.|.+++.+||+.. T Consensus 162 Er~a~G~D~~~~~~lP~~p~lg~~l~yeqy~gd~v~lf---------~~~~~q~~p~a~t~gl~ytP~pllt~~~~~~~~ 232 (280) T pfam11924 162 ERPASGYDLRLGARLPAYPQLGGKLKYEQYYGDNVALF---------GSGNLQKDPHAVTLGLNYTPVPLLTVGADYRDG 232 (280) T ss_pred EEECCCEEEEEEEECCCCCCCCCEEEEEEEECCCCCCC---------CCCCCCCCCCEEEEEEEECCCCCEEEEEEEEEC T ss_conf 46359732674244686756674699999765753323---------666555686117998884036877998736806 Q ss_pred E Q ss_conf 4 Q gi|254781078|r 207 P 207 (242) Q Consensus 207 d 207 (242) . T Consensus 233 ~ 233 (280) T pfam11924 233 K 233 (280) T ss_pred C T ss_conf 8 No 71 >pfam10677 DUF2490 Protein of unknown function (DUF2490). This is a bacterial family of uncharacterized proteins. Probab=43.82 E-value=21 Score=15.29 Aligned_cols=31 Identities=29% Similarity=0.420 Sum_probs=24.2 Q ss_pred CCCCCCCEEEEEEEEEEEECCCCEEEEEEEEEEEC Q ss_conf 34556530379873166732787688889888744 Q gi|254781078|r 174 GRFDGHGSSVVLGIGLDYMVNYDISLSASYRYIPH 208 (242) Q Consensus 174 ~~~d~t~~G~tvGaGVEYait~nwslraEY~Y~d~ 208 (242) ..+|.. -+.+|+.|.+++++++.+-|+|.-. T Consensus 147 ~~fdqn----R~~~Gl~~~ls~~~~le~gY~~q~~ 177 (191) T pfam10677 147 SGFDQN----RLFAGLGYKLSKNLKLELGYLYQFI 177 (191) T ss_pred CCHHHH----HEEEEEEEEECCCCEEEEEEEEEEC T ss_conf 635250----1205611894899789999889983 No 72 >pfam07172 GRP Glycine rich protein family. This family of proteins includes several glycine rich proteins as well as two nodulins 16 and 24. The family also contains proteins that are induced in response to various stresses. Probab=43.05 E-value=5.9 Score=18.34 Aligned_cols=43 Identities=26% Similarity=0.308 Sum_probs=21.3 Q ss_pred CCCHHHHHHH--HHHHCCCCEEEEEEECCCCCCCCCCCCCEEEECCCCCCE Q ss_conf 2205678888--987436520234332025434575321103303766530 Q gi|254781078|r 5 GYGALFFVVF--LSIVVPNHSLAVDLYLPRKIDLFNEADNNVEYQDDEYGI 53 (242) Q Consensus 5 ~~~~~~~~~~--~~~~~~~~~~a~d~~~~~~~~~~~~~~~~~~~~~~~~~~ 53 (242) .++.||+-++ ..|++.+-+-|+++-.+++.+.. -+++++.+.= T Consensus 3 SKa~llLglllA~~lLIsSevaa~~ls~~~K~e~e------~~v~~~~~gg 47 (91) T pfam07172 3 SKALLLLGLLLAAVLLISSEVAAADLSNTEKSESE------NEVQDDKYGG 47 (91) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCC------CCEEECCCCC T ss_conf 05799999999999999999999997128455542------4021146457 No 73 >COG2930 Uncharacterized conserved protein [Function unknown] Probab=42.31 E-value=23 Score=15.16 Aligned_cols=56 Identities=20% Similarity=0.206 Sum_probs=37.0 Q ss_pred CEEEEEEEEEEEEEEEEEEECCCCCCC-CCCCCCCCCEEEEEEEEEEEECCEEEEEEEE Q ss_conf 300005744464410332221455566-6667667550479988655445779999988 Q gi|254781078|r 52 GIWSGNYVGLHISRLYETHPLADTINR-KTYNSLLPNGLGIELGHNIQLEDFVFGINCH 109 (242) Q Consensus 52 ~~WtG~YiG~~~G~~~~~~~~~d~~~~-~~~n~~~~~g~G~~~GYN~Q~~~~V~GvE~d 109 (242) .+=.||-+|++.|-+--..+..+.++. ..+-...+.++|.|+|. |...+|+=+..| T Consensus 62 vLkaGFvigGr~GqGvl~~r~~~nTWs~p~~v~~~g~siG~q~G~--qs~d~v~i~~~~ 118 (227) T COG2930 62 VLKAGFVIGGRYGQGVLVARLPDNTWSAPSFVKMAGASIGGQAGV--QSTDFVIILNTD 118 (227) T ss_pred HCEEEEEEECCCCCEEEEECCCCCCCCCCHHHHHCCCCCCCCCCC--EEEEEEEEECCH T ss_conf 103007885343534787327998744414441105553420242--022289998065 No 74 >TIGR01778 TonB-copper TonB-dependent copper receptor; InterPro: IPR010100 This entry represents a family of proteobacterial TonB-dependent outer membrane receptor/transporters which bind and translocate copper ions. Two characterised members of this family exist, outer membrane protein C (OprC) from Pseudomonas aeruginosa and NosA from Pseudomonas stutzeri, which is responsible for providing copper for the copper-containing N2O reductase .. Probab=40.36 E-value=24 Score=14.98 Aligned_cols=30 Identities=20% Similarity=0.215 Sum_probs=14.8 Q ss_pred EEEEEEEEEEEEC-CEEEEEEEEEEEECCCC Q ss_conf 0479988655445-77999998867404777 Q gi|254781078|r 88 GLGIELGHNIQLE-DFVFGINCHTTAAKDDS 117 (242) Q Consensus 88 g~G~~~GYN~Q~~-~~V~GvE~d~~~~~~~~ 117 (242) .+|+-+|..+||. +|+-++=.-+.|...+. T Consensus 515 l~GGEa~~~Y~l~~~W~~~~sLAY~~G~N~T 545 (668) T TIGR01778 515 LLGGEAGLSYQLDEQWKTEASLAYAWGENET 545 (668) T ss_pred EECCHHHEEHHCCCCCEEEEEEEEEECCCCC T ss_conf 3220210001006586120489997133235 No 75 >PRK11528 hypothetical protein; Provisional Probab=39.13 E-value=25 Score=14.88 Aligned_cols=30 Identities=13% Similarity=0.240 Sum_probs=24.6 Q ss_pred CCEEEEEEEEEEEECCCCEEEEEEEEEEEC Q ss_conf 530379873166732787688889888744 Q gi|254781078|r 179 HGSSVVLGIGLDYMVNYDISLSASYRYIPH 208 (242) Q Consensus 179 t~~G~tvGaGVEYait~nwslraEY~Y~d~ 208 (242) -+.|.-=++-+=+-.+++|+..+.|||.|. T Consensus 206 ~~~G~NGa~alwW~~t~~~ttgiqYRYAdN 235 (254) T PRK11528 206 GKDGINGAVALWWNATKHLTTGIQYRYADN 235 (254) T ss_pred CCCCCCCEEEEEEEECCCEEEEEEEEEHHC T ss_conf 977623538998840576004269980001 No 76 >pfam04357 DUF490 Family of unknown function (DUF490). Probab=30.86 E-value=35 Score=14.11 Aligned_cols=15 Identities=40% Similarity=0.580 Sum_probs=7.2 Q ss_pred EEEECCCCEEEEEEE Q ss_conf 667327876888898 Q gi|254781078|r 189 LDYMVNYDISLSASY 203 (242) Q Consensus 189 VEYait~nwslraEY 203 (242) +||.++++|++++++ T Consensus 319 i~y~l~~~~~l~~~~ 333 (349) T pfam04357 319 LEYRLTPNLSLEASS 333 (349) T ss_pred EEEEECCCEEEEEEE T ss_conf 999976998999999 No 77 >PRK03761 organic solvent tolerance protein; Provisional Probab=28.41 E-value=39 Score=13.87 Aligned_cols=21 Identities=19% Similarity=0.360 Sum_probs=10.1 Q ss_pred EEEEEEEECCCCEEEEEEEEE Q ss_conf 873166732787688889888 Q gi|254781078|r 185 LGIGLDYMVNYDISLSASYRY 205 (242) Q Consensus 185 vGaGVEYait~nwslraEY~Y 205 (242) ++.-..|.|+++|++-+.|+| T Consensus 681 ~~~s~~~pi~~~W~~~g~~~Y 701 (782) T PRK03761 681 VGAVASWPIADRWSIVGAYYY 701 (782) T ss_pred EEEEEEEEECCCEEEEEEEEE T ss_conf 889999980388899999998 No 78 >TIGR00459 aspS_bact aspartyl-tRNA synthetase; InterPro: IPR004524 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology . The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric . Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices , and are mostly dimeric or multimeric, containing at least three conserved regions , , . However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases; these synthetases are further divided into three subclasses, a, b and c, according to sequence homology. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases . Aspartyl tRNA synthetase 6.1.1.12 from EC is an alpha2 dimer that belongs to class IIb. Structural analysis combined with mutagenesis and enzymology data on the yeast enzyme point to a tRNA binding process that starts by a recognition event between the tRNA anticodon loop and the synthetase anticodon binding module . This family represents aspartyl-tRNA synthetases from the bacteria and from mitochondria. In some species, this enzyme aminoacylates tRNA for both Asp and Asn; Asp-tRNA(asn) is subsequently transamidated to Asn-tRNA(asn). ; GO: 0000166 nucleotide binding, 0004815 aspartate-tRNA ligase activity, 0005524 ATP binding, 0006412 translation, 0006422 aspartyl-tRNA aminoacylation, 0005737 cytoplasm. Probab=28.03 E-value=14 Score=16.24 Aligned_cols=20 Identities=15% Similarity=0.189 Sum_probs=13.3 Q ss_pred ECCCCEEEEEEEEEEECCCC Q ss_conf 32787688889888744873 Q gi|254781078|r 192 MVNYDISLSASYRYIPHHIH 211 (242) Q Consensus 192 ait~nwslraEY~Y~d~g~~ 211 (242) |.-+.=-+|.+|||+|+=.. T Consensus 126 a~e~~EDvRLkYRYLDLRrp 145 (653) T TIGR00459 126 AEEEREDVRLKYRYLDLRRP 145 (653) T ss_pred CCCCCCCCCCCCCCHHHCCH T ss_conf 00010166610022220487 No 79 >PRK03641 hypothetical protein; Provisional Probab=26.27 E-value=43 Score=13.65 Aligned_cols=30 Identities=23% Similarity=0.392 Sum_probs=23.2 Q ss_pred CHHHHHHHHHHHCCCCEEEEEEECCCCCCC Q ss_conf 056788889874365202343320254345 Q gi|254781078|r 7 GALFFVVFLSIVVPNHSLAVDLYLPRKIDL 36 (242) Q Consensus 7 ~~~~~~~~~~~~~~~~~~a~d~~~~~~~~~ 36 (242) |..+..|++++++...++|+.|-.|.-+++ T Consensus 2 ~~~~i~~llal~~S~~~~A~tL~lp~~iel 31 (220) T PRK03641 2 KTGIVTTLLALCLPVSVFATTLRLSPEVDL 31 (220) T ss_pred CHHHHHHHHHHHHHHHHHEEEEECCCCEEE T ss_conf 116999999998623645046647986469 No 80 >COG1956 GAF domain-containing protein [Signal transduction mechanisms] Probab=25.22 E-value=32 Score=14.34 Aligned_cols=11 Identities=27% Similarity=0.311 Sum_probs=9.0 Q ss_pred CCEEEEEEEEE Q ss_conf 53000057444 Q gi|254781078|r 51 YGIWSGNYVGL 61 (242) Q Consensus 51 ~~~WtG~YiG~ 61 (242) ..+|.|||+=- T Consensus 50 ~~nW~GFYl~~ 60 (163) T COG1956 50 DVNWVGFYLLE 60 (163) T ss_pred CCCEEEEEEEE T ss_conf 87369999940 No 81 >PRK09840 catecholate siderophore receptor Fiu; Provisional Probab=24.38 E-value=46 Score=13.45 Aligned_cols=20 Identities=10% Similarity=-0.044 Sum_probs=14.5 Q ss_pred EEECCCCEEEEEEEEEEECC Q ss_conf 67327876888898887448 Q gi|254781078|r 190 DYMVNYDISLSASYRYIPHH 209 (242) Q Consensus 190 EYait~nwslraEY~Y~d~g 209 (242) .+.++++|++.+-|-|+|.. T Consensus 621 ~~~~~~~~~l~~~yty~da~ 640 (760) T PRK09840 621 AGNITPAWQVIAGYTQQKAT 640 (760) T ss_pred EEEECCCEEEEEEEEEEEEE T ss_conf 87006987999999999779 No 82 >PRK04517 hypothetical protein; Provisional Probab=20.39 E-value=54 Score=13.08 Aligned_cols=28 Identities=18% Similarity=0.375 Sum_probs=22.4 Q ss_pred HHHHHHHHHHCCCCEEEE-EEECCCCCCC Q ss_conf 678888987436520234-3320254345 Q gi|254781078|r 9 LFFVVFLSIVVPNHSLAV-DLYLPRKIDL 36 (242) Q Consensus 9 ~~~~~~~~~~~~~~~~a~-d~~~~~~~~~ 36 (242) ...+|+++|+..+.++|+ .|..|+.+++ T Consensus 5 ~~~~~ll~l~~s~sa~A~vtL~lp~~iel 33 (216) T PRK04517 5 KPLTCLLALAMSGSAFADVTVEVPDDVDL 33 (216) T ss_pred HHHHHHHHHHHHHHHHHHCEEECCCCEEE T ss_conf 79999999997135533207836997779 Done!